; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014341 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014341
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionRNase H domain-containing protein
Genome locationChr02:9729418..9729807
RNA-Seq ExpressionHG10014341
SyntenyHG10014341
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VAH20608.1 unnamed protein product [Triticum turgidum subsp. durum]6.7e-0734.58Show/hide
Query:  AEILVVREGLRFASNFPNMNLMVESDSKQTIDLIHDDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAKNAKSCYLNVDWLVDYPTWIIY
        AE++ +R GL  A      +L++ESDS   +D+I+ +D        ++E  + L   F  V    C R  N VAD IAKNA S   ++ W    P +I  
Subjt:  AEILVVREGLRFASNFPNMNLMVESDSKQTIDLIHDDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAKNAKSCYLNVDWLVDYPTWIIY

Query:  QVQNDLS
        Q+ NDL+
Subjt:  QVQNDLS

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.8e-0736.28Show/hide
Query:  LAEILVVREGLRFASNFPNMNLMVESDSKQTIDLIHDDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAK-NAKSCYLNVDWLVDYPTWI
        LAEI  + EGL+FA+     +L VESDS   I LI ++         W+ +I+ L+  F  + F    R CN  A  +AK    S      WL ++PTW+
Subjt:  LAEILVVREGLRFASNFPNMNLMVESDSKQTIDLIHDDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAK-NAKSCYLNVDWLVDYPTWI

Query:  IYQVQNDLSSNLA
        +  VQ D  SN A
Subjt:  IYQVQNDLSSNLA

XP_022158489.1 uncharacterized protein LOC111024968 [Momordica charantia]3.2e-0932.17Show/hide
Query:  LAEILVVREGLRFASNFPNMNLMVESDSKQTIDLIHDDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAKNAKSCYLNVDWLVDYPTWII
        LA+IL +REGL  A+      ++VE+DS + ++LI D   W   A  W+EDI+  +  F  + F    R  N VA H+ +   S      W +D+P W+ 
Subjt:  LAEILVVREGLRFASNFPNMNLMVESDSKQTIDLIHDDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAKNAKSCYLNVDWLVDYPTWII

Query:  YQVQNDLSSNLALEA
           +    +++AL A
Subjt:  YQVQNDLSSNLALEA

XP_027101192.1 uncharacterized protein LOC113720678 [Coffea arabica]1.1e-0635.71Show/hide
Query:  LAEILVVREGLRFASNFPNMNLMVESDSKQTIDLIH--DDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAKNAKSCYLNVDWLVDYPTW
        + E L +R  +  A       +  ESD K  ID I+  +DD+  ++    L DIKRL LSF    F F  R  N V+ HIAK A +     +W  D+P W
Subjt:  LAEILVVREGLRFASNFPNMNLMVESDSKQTIDLIH--DDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAKNAKSCYLNVDWLVDYPTW

Query:  IIYQVQNDLSSN
        ++  VQ D  S+
Subjt:  IIYQVQNDLSSN

XP_027103277.1 uncharacterized protein LOC113724589 [Coffea arabica]6.1e-0833.63Show/hide
Query:  GIP-LAEILVVREGLRFASNFPNMNLMVESDSKQTIDLIHDDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAKNAKSCYLNVDWLVDYP
        G+P + E   +R+ L F  +   MN++++SD +  ID I +  +    A   L DIK LS  F S IF F  R  N VA H+AK A +    + W+  +P
Subjt:  GIP-LAEILVVREGLRFASNFPNMNLMVESDSKQTIDLIHDDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAKNAKSCYLNVDWLVDYP

Query:  TWIIYQVQNDLSS
         W+    +ND+ +
Subjt:  TWIIYQVQNDLSS

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248748.5e-0836.28Show/hide
Query:  LAEILVVREGLRFASNFPNMNLMVESDSKQTIDLIHDDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAK-NAKSCYLNVDWLVDYPTWI
        LAEI  + EGL+FA+     +L VESDS   I LI ++         W+ +I+ L+  F  + F    R CN  A  +AK    S      WL ++PTW+
Subjt:  LAEILVVREGLRFASNFPNMNLMVESDSKQTIDLIHDDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAK-NAKSCYLNVDWLVDYPTWI

Query:  IYQVQNDLSSNLA
        +  VQ D  SN A
Subjt:  IYQVQNDLSSNLA

A0A6J1DZK3 uncharacterized protein LOC1110249681.6e-0932.17Show/hide
Query:  LAEILVVREGLRFASNFPNMNLMVESDSKQTIDLIHDDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAKNAKSCYLNVDWLVDYPTWII
        LA+IL +REGL  A+      ++VE+DS + ++LI D   W   A  W+EDI+  +  F  + F    R  N VA H+ +   S      W +D+P W+ 
Subjt:  LAEILVVREGLRFASNFPNMNLMVESDSKQTIDLIHDDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAKNAKSCYLNVDWLVDYPTWII

Query:  YQVQNDLSSNLALEA
           +    +++AL A
Subjt:  YQVQNDLSSNLALEA

A0A6P6T067 uncharacterized protein LOC1136964901.2e-0628.21Show/hide
Query:  KRGGGIPLAEILVVREGLRFASNFPNMNLMVESDSKQTIDLIHDDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAKNAKSCYLNVDWLV
        ++ G     E L +R  L  A       + V+SD +  +  I+  ++        LEDI+ L  SF+S IF F  RS N  +  +A+ A      +DW  
Subjt:  KRGGGIPLAEILVVREGLRFASNFPNMNLMVESDSKQTIDLIHDDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAKNAKSCYLNVDWLV

Query:  DYPTWIIYQVQNDLSSN
         +PTW+    + D+  N
Subjt:  DYPTWIIYQVQNDLSSN

A0A6P6VGX4 uncharacterized protein LOC1137206785.5e-0735.71Show/hide
Query:  LAEILVVREGLRFASNFPNMNLMVESDSKQTIDLIH--DDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAKNAKSCYLNVDWLVDYPTW
        + E L +R  +  A       +  ESD K  ID I+  +DD+  ++    L DIKRL LSF    F F  R  N V+ HIAK A +     +W  D+P W
Subjt:  LAEILVVREGLRFASNFPNMNLMVESDSKQTIDLIH--DDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAKNAKSCYLNVDWLVDYPTW

Query:  IIYQVQNDLSSN
        ++  VQ D  S+
Subjt:  IIYQVQNDLSSN

A0A6P6VMX3 uncharacterized protein LOC1137245892.9e-0833.63Show/hide
Query:  GIP-LAEILVVREGLRFASNFPNMNLMVESDSKQTIDLIHDDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAKNAKSCYLNVDWLVDYP
        G+P + E   +R+ L F  +   MN++++SD +  ID I +  +    A   L DIK LS  F S IF F  R  N VA H+AK A +    + W+  +P
Subjt:  GIP-LAEILVVREGLRFASNFPNMNLMVESDSKQTIDLIHDDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAKNAKSCYLNVDWLVDYP

Query:  TWIIYQVQNDLSS
         W+    +ND+ +
Subjt:  TWIIYQVQNDLSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein2.7e-0632.58Show/hide
Query:  AEILVVREGLRFASNFPNMNLMVESDSKQTIDLIHDDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAKNAKSCYLNVD
        AE+  +R  +   S F    ++ ESDS+  I+++++D+IW S     ++D++RL   F  V F F  R  N +A+ +A+ + S +LN D
Subjt:  AEILVVREGLRFASNFPNMNLMVESDSKQTIDLIHDDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAKNAKSCYLNVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTCCTAAAGATCAAAAGGGGTGGGGGTATACCTTTGGCTGAAATCCTTGTGGTCCGTGAAGGTCTTCGTTTTGCTAGCAATTTTCCTAATATGAACCTTATGGT
TGAATCAGATAGTAAGCAAACGATCGATCTAATTCACGATGATGATATCTGGGCGAGTAGTGCTGATTGCTGGCTTGAAGATATAAAGAGGCTGTCATTGTCCTTTAATT
CGGTGATTTTTTGTTTTTGCCATAGAAGTTGTAATGTTGTAGCGGACCACATTGCCAAAAATGCAAAATCTTGTTATCTTAATGTTGATTGGCTGGTCGATTATCCCACG
TGGATAATTTATCAAGTCCAAAACGACTTGTCTTCAAATTTGGCCCTTGAGGCGCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTCCTAAAGATCAAAAGGGGTGGGGGTATACCTTTGGCTGAAATCCTTGTGGTCCGTGAAGGTCTTCGTTTTGCTAGCAATTTTCCTAATATGAACCTTATGGT
TGAATCAGATAGTAAGCAAACGATCGATCTAATTCACGATGATGATATCTGGGCGAGTAGTGCTGATTGCTGGCTTGAAGATATAAAGAGGCTGTCATTGTCCTTTAATT
CGGTGATTTTTTGTTTTTGCCATAGAAGTTGTAATGTTGTAGCGGACCACATTGCCAAAAATGCAAAATCTTGTTATCTTAATGTTGATTGGCTGGTCGATTATCCCACG
TGGATAATTTATCAAGTCCAAAACGACTTGTCTTCAAATTTGGCCCTTGAGGCGCTCTAA
Protein sequenceShow/hide protein sequence
MAFLKIKRGGGIPLAEILVVREGLRFASNFPNMNLMVESDSKQTIDLIHDDDIWASSADCWLEDIKRLSLSFNSVIFCFCHRSCNVVADHIAKNAKSCYLNVDWLVDYPT
WIIYQVQNDLSSNLALEAL