; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007844 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007844
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionRNase H domain-containing protein
Genome locationChr10:15239241..15244132
RNA-Seq ExpressionHG10007844
SyntenyHG10007844
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_010675428.1 PREDICTED: uncharacterized protein LOC104891430 [Beta vulgaris subsp. vulgaris]1.5e-1039.5Show/hide
Query:  EMR-EAQLQEAMV-ERWMPPGGGDMKLNIDASCVSNVGVGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVR
        EMR EA L  A V E+W PPGG  +K N DA+  ++  +G G V+R+S  +V+ A  +   G  ++  AE  A    ++ A DMG  N+ +ESDS KLV 
Subjt:  EMR-EAQLQEAMV-ERWMPPGGGDMKLNIDASCVSNVGVGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVR

Query:  AIQNYSMQNSSIGGILEEI
         +Q   ++NS  GG++ +I
Subjt:  AIQNYSMQNSSIGGILEEI

XP_021840770.1 uncharacterized protein LOC110780758, partial [Spinacia oleracea]2.8e-0935.34Show/hide
Query:  GGKQVREGDEMREAQLQEAMVERWMPPGGGDMKLNIDASCVSNVGVGWGAVIRDSQRNVV-----KAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTN
        GG   R   E R  Q  +     W  PG G +K+N+DA C+  VG G GAV+RD+   V+     +   ++E  V     AE  A+LF ++++ +MG   
Subjt:  GGKQVREGDEMREAQLQEAMVERWMPPGGGDMKLNIDASCVSNVGVGWGAVIRDSQRNVV-----KAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTN

Query:  LWVESDSQKLVRAIQNYSMQNSSIGGILEEIRD
        + VE D   +V+A+Q     NSS+  +LE+IRD
Subjt:  LWVESDSQKLVRAIQNYSMQNSSIGGILEEIRD

XP_039832956.1 uncharacterized protein LOC120693663 isoform X1 [Panicum virgatum]6.2e-0939.81Show/hide
Query:  VERWMPPGGGDMKLNIDASCVSNVGV-GWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVRAIQNYSMQNSSI
        VE W  P    +K+N D S   N G  GWGAVIR+S   VVKA        ++    EV+A L  ++MA   G+TN+ +E+D+  L  A+ N S + SS+
Subjt:  VERWMPPGGGDMKLNIDASCVSNVGV-GWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVRAIQNYSMQNSSI

Query:  GGILEEIR
        GG++ EI+
Subjt:  GGILEEIR

XP_042950614.1 uncharacterized protein LOC122282698 [Carya illinoinensis]2.8e-0935.85Show/hide
Query:  RWMPPGGGDMKLNIDASCVSN-VGVGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVRAIQNYSMQNSSIGG
        +W PP  G +KLNID +  S+    G GAV+RD +  V+ AA+K E  + + +  E+LA+L  +Q+   +G+T+L VESDS   V+ ++      +  GG
Subjt:  RWMPPGGGDMKLNIDASCVSN-VGVGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVRAIQNYSMQNSSIGG

Query:  ILEEIR
        +++E++
Subjt:  ILEEIR

XP_042958214.1 uncharacterized protein LOC122293826 [Carya illinoinensis]1.1e-0835.04Show/hide
Query:  REAQLQEAMVERWMPPGGGDMKLNIDASCVSNVG-VGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVRAIQ
        +++++Q     RW  P    +KLN+D +  S +G +G GA++RD+   V+ AA+  E  V +    E+LALL  +Q    MG+  + VESD   ++ A+Q
Subjt:  REAQLQEAMVERWMPPGGGDMKLNIDASCVSNVG-VGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVRAIQ

Query:  NYSMQNSSIGGILEEIR
          SM NS+ G +  EI+
Subjt:  NYSMQNSSIGGILEEIR

TrEMBL top hitse value%identityAlignment
A0A2I4FLN6 uncharacterized protein LOC1090001954.3e-0836.79Show/hide
Query:  RWMPPGGGDMKLNIDASCVSNVG-VGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVRAIQNYSMQNSSIGG
        +W PP  G +KLN+D +  ++VG  G G V+RD    VV AA K E+ V      E+LA+L  +Q    +G+ NL VESD   +V+ +Q+     SS G 
Subjt:  RWMPPGGGDMKLNIDASCVSNVG-VGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVRAIQNYSMQNSSIGG

Query:  ILEEIR
        ++++++
Subjt:  ILEEIR

A0A7J6F1G1 Uncharacterized protein1.3e-0734.95Show/hide
Query:  WMPPGGGDMKLNIDASCV-SNVGVGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVRAIQNYSMQNSSIGGI
        W PP  G   +N DAS +    G G G +IRD    +V AA  +  G ++VL AE LA+   +++A    L N+++ SD+Q ++ A++  +  N+  G I
Subjt:  WMPPGGGDMKLNIDASCV-SNVGVGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVRAIQNYSMQNSSIGGI

Query:  LEE
        LE+
Subjt:  LEE

A0A7J6GEF1 RNase H domain-containing protein9.6e-0834.95Show/hide
Query:  WMPPGGGDMKLNIDASCV-SNVGVGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVRAIQNYSMQNSSIGGI
        W PP  G   +N DAS +    G G G +IRD    +V AA  +  G ++VL AE LA+   +++A    L N+++ SD+Q ++ A++  +  N+  G I
Subjt:  WMPPGGGDMKLNIDASCV-SNVGVGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVRAIQNYSMQNSSIGGI

Query:  LEE
        LE+
Subjt:  LEE

A0A7J6H3K7 Uncharacterized protein1.3e-0734.95Show/hide
Query:  WMPPGGGDMKLNIDASCV-SNVGVGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVRAIQNYSMQNSSIGGI
        W PP  G   +N DAS +    G G G +IRD    +V AA  +  G ++VL AE LA+   +++A    L N+++ SD+Q ++ A++  +  N+  G I
Subjt:  WMPPGGGDMKLNIDASCV-SNVGVGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVRAIQNYSMQNSSIGGI

Query:  LEE
        LE+
Subjt:  LEE

A0A803QGC3 Uncharacterized protein1.3e-0734.26Show/hide
Query:  VERWMPPGGGDMKLNIDASCV-SNVGVGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVRAIQNYSMQNSSI
        +  W PP  G   +N DAS +  + G G  AVIRDS+  +V AA  F  G ++VL A+   +L  I +A    ++N+ V SDSQ +++A+   +  ++  
Subjt:  VERWMPPGGGDMKLNIDASCV-SNVGVGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVRAIQNYSMQNSSI

Query:  GGILEEIR
        G ++ EI+
Subjt:  GGILEEIR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein8.9e-0631.48Show/hide
Query:  RWMPPGGGDMKLNIDASCVSN---VGVGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVRAIQNYSMQNSSI
        +W PP  G +K N D+           GW   IR+   ++V   N   +     L AE L  L  +Q+ +  GL  +W ESDS+ LV  I N    +S +
Subjt:  RWMPPGGGDMKLNIDASCVSN---VGVGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVRAIQNYSMQNSSI

Query:  GGILEEIR
        G ++ +IR
Subjt:  GGILEEIR

AT3G23320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.9e-0427.52Show/hide
Query:  REGDEMREAQLQEAMVERWMPPGGGDMKLNIDASCVS---NVGVGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDS
        +  D ++  Q + +  ++W  PG   +K N D S  +   + G+ W  +IR+SQ   +       +G   +  AE  AL++ IQ A+D+G   +  E D+
Subjt:  REGDEMREAQLQEAMVERWMPPGGGDMKLNIDASCVS---NVGVGWGAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDS

Query:  QKLVRAIQN
          + R I+N
Subjt:  QKLVRAIQN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCCGAGAGGTGGCTGAAATTTTAAGGACTATGGTTGGCAAAGTTGAAGAGATTGAAGGGGAGAGCAAAAGTAGATGGACGAGCCCATTCATGAGGATACGATTTCA
AATTGATGTCACTCAACCATTGCGTAGAGGATGCAAATTAAGAACTAAGGAAAGAAAAGAGGCGCAGCGACAGAATGGTGGTAAGCAAGTGCGTGAAGGGGATGAAATGC
GAGAGGCCCAGCTGCAGGAGGCAATGGTGGAGAGGTGGATGCCACCGGGGGGAGGGGATATGAAGTTGAATATCGATGCTTCATGTGTGTCTAATGTGGGAGTGGGGTGG
GGGGCTGTTATTCGTGACTCTCAAAGAAATGTGGTGAAAGCTGCGAATAAATTTGAGGAGGGAGTGGTGAATGTTTTAGCTGCTGAAGTGTTAGCGCTCTTGTTTGAAAT
TCAAATGGCCTTTGATATGGGTCTAACTAATTTGTGGGTAGAATCTGATTCTCAGAAATTAGTTAGAGCCATTCAAAATTATTCTATGCAAAATTCTTCAATTGGTGGAA
TTCTAGAAGAGATTAGAGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCCGAGAGGTGGCTGAAATTTTAAGGACTATGGTTGGCAAAGTTGAAGAGATTGAAGGGGAGAGCAAAAGTAGATGGACGAGCCCATTCATGAGGATACGATTTCA
AATTGATGTCACTCAACCATTGCGTAGAGGATGCAAATTAAGAACTAAGGAAAGAAAAGAGGCGCAGCGACAGAATGGTGGTAAGCAAGTGCGTGAAGGGGATGAAATGC
GAGAGGCCCAGCTGCAGGAGGCAATGGTGGAGAGGTGGATGCCACCGGGGGGAGGGGATATGAAGTTGAATATCGATGCTTCATGTGTGTCTAATGTGGGAGTGGGGTGG
GGGGCTGTTATTCGTGACTCTCAAAGAAATGTGGTGAAAGCTGCGAATAAATTTGAGGAGGGAGTGGTGAATGTTTTAGCTGCTGAAGTGTTAGCGCTCTTGTTTGAAAT
TCAAATGGCCTTTGATATGGGTCTAACTAATTTGTGGGTAGAATCTGATTCTCAGAAATTAGTTAGAGCCATTCAAAATTATTCTATGCAAAATTCTTCAATTGGTGGAA
TTCTAGAAGAGATTAGAGACTAG
Protein sequenceShow/hide protein sequence
MSREVAEILRTMVGKVEEIEGESKSRWTSPFMRIRFQIDVTQPLRRGCKLRTKERKEAQRQNGGKQVREGDEMREAQLQEAMVERWMPPGGGDMKLNIDASCVSNVGVGW
GAVIRDSQRNVVKAANKFEEGVVNVLAAEVLALLFEIQMAFDMGLTNLWVESDSQKLVRAIQNYSMQNSSIGGILEEIRD