; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007681 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007681
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr10:9990794..9991474
RNA-Seq ExpressionHG10007681
SyntenyHG10007681
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN79190.1 hypothetical protein VITISV_000232 [Vitis vinifera]5.8e-3837.33Show/hide
Query:  DAASFTTSTITKGAYSLSILFTLADGFDLWLTGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELID
        D+    +  +  G++S+S+ F +      WL+ V GP++   R+ F  EL D++CL    W +GGDFN+IR   E    GR   +M   + FI   ELID
Subjt:  DAASFTTSTITKGAYSLSILFTLADGFDLWLTGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELID

Query:  FPISKGIFTWSDFRTPPTHSRIDRFLHT---ESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPG
         P+    FTWS+ +  P   R+DRFL++   E +  +S+   L R    TSDH+P++      + GP+PFRFENMWL+  SF     +WW      G  G
Subjt:  FPISKGIFTWSDFRTPPTHSRIDRFLHT---ESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPG

Query:  HGFINKLNGLKQAIKEWNITTFGNI
        H F+ KL  LK  +KEWN   FG++
Subjt:  HGFINKLNGLKQAIKEWNITTFGNI

RVW26402.1 hypothetical protein CK203_086098 [Vitis vinifera]1.3e-3739.29Show/hide
Query:  DAASFTTSTITKGAYSLSILFTLADGFDLWL-TGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELI
        D  S     + +G +S+S  F       +W+ TGV GP + + RE    E   +  L EE W +GGDFN I +  E S +GR   AM  F   I+   L+
Subjt:  DAASFTTSTITKGAYSLSILFTLADGFDLWL-TGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELI

Query:  DFPISKGIFTWSDFRTPPTHSRIDRFLHTESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPGHG
        DFP+  G FTWS+     +  R+DRFL T S +++      +RL +PTSDH+P+L   G  R GPSPF+FENMWL  + F   +  WW      G P + 
Subjt:  DFPISKGIFTWSDFRTPPTHSRIDRFLHTESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPGHG

Query:  FINKLNGLKQAIKEWNITTFGNIE
           KL GLKQ +K WN   FG +E
Subjt:  FINKLNGLKQAIKEWNITTFGNIE

RVX17959.1 Protein MICRORCHIDIA 4 [Vitis vinifera]2.6e-3838.84Show/hide
Query:  DAASFTTSTITKGAYSLSILFTLADGFDLWL-TGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELI
        D  S     + +G +S+S  F   D   +W+ TGV GP + ++RE    EL  +  L EE W +GGDFN+  +  + + +GR   AM  F   I+   L+
Subjt:  DAASFTTSTITKGAYSLSILFTLADGFDLWL-TGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELI

Query:  DFPISKGIFTWSDFRTPPTHSRIDRFLHTESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPGHG
        D P+  G FTWS      T +R+DRFL T   +++      +RL +PTSDHYP+L   G  R GPSPF+FENMWL  + F   +  WW      G P + 
Subjt:  DFPISKGIFTWSDFRTPPTHSRIDRFLHTESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPGHG

Query:  FINKLNGLKQAIKEWNITTFGNIE
           KL GLKQ +K WN   FG +E
Subjt:  FINKLNGLKQAIKEWNITTFGNIE

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]4.1e-4446.31Show/hide
Query:  KGAYSLSILFTLADGFDLWLTGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELIDFPISKGIFTWS
        +G +SL+I F L+DGF  W++G+ GPST +    F  EL DL  L E  W + GDFN+ RW +E SN     ++M  FNSFI    LID P++ G  TWS
Subjt:  KGAYSLSILFTLADGFDLWLTGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELIDFPISKGIFTWS

Query:  DFRTPPTHSRIDRFLHTESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPGHGFINKLNGLKQAI
              + S ID FL T   I+K      KR+ + TSDH+P+L   G    G +PFRFENMWL+ K+F PF+  WW N P  G PGHG + KL  LK AI
Subjt:  DFRTPPTHSRIDRFLHTESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPGHGFINKLNGLKQAI

Query:  KEW
        K W
Subjt:  KEW

XP_028075126.1 uncharacterized protein LOC114277426 [Camellia sinensis]1.7e-3740.38Show/hide
Query:  ITKGAYSLSILFTLADGFDLWLTGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELIDFPISKGIFT
        +  G +SLSI F   D  + WLTGV GP+   +R+ +  EL  L+ L    W +G DFN++R   E  N   TNR+M  F+SFI   ELID P+S   FT
Subjt:  ITKGAYSLSILFTLADGFDLWLTGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELIDFPISKGIFT

Query:  WSDFRTPPTHSRIDRFLHTESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPGHGFINKLNGLKQ
        WS+ +      R+DRFLHT       ++   +   +  SDH+P++      + GPSPFRFENMWL  +SF  F  +WW N    G  G  FI KL G+K 
Subjt:  WSDFRTPPTHSRIDRFLHTESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPGHGFINKLNGLKQ

Query:  AIKEWNITTFGNI
         +K W    FG++
Subjt:  AIKEWNITTFGNI

TrEMBL top hitse value%identityAlignment
A0A2N9ED35 Reverse transcriptase domain-containing protein4.8e-3842.18Show/hide
Query:  GAYSLSILF-TLADGFDLWLTGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELIDFPISKGIFTWS
        G YSLS  F ++ D F+   +GV GP++D  R     EL  L    +  W IGGDFN++R+P E S     + AM+ F+ FI    L+D P+  G+FTWS
Subjt:  GAYSLSILF-TLADGFDLWLTGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELIDFPISKGIFTWS

Query:  DFRTPPTHSRIDRFLHTESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPGHGFINKLNGLKQAI
        + R     SRIDRFL +    +     + +RL +  SDHYP+L   G    G SPFRFENMWL    F+  V  WW+++   G P H F +KL  LK  +
Subjt:  DFRTPPTHSRIDRFLHTESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPGHGFINKLNGLKQAI

Query:  KEWNITTFGNI
        K+WNI  FGNI
Subjt:  KEWNITTFGNI

A0A2N9GVD9 Reverse transcriptase domain-containing protein3.6e-3842.18Show/hide
Query:  GAYSLSILF-TLADGFDLWLTGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELIDFPISKGIFTWS
        G YSLS  F ++ D F+   TGV GP++D  R     EL  L    +  W IGGDFN++R+P E S     + AM+ F+ FI    L+D P+  G+FTWS
Subjt:  GAYSLSILF-TLADGFDLWLTGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELIDFPISKGIFTWS

Query:  DFRTPPTHSRIDRFLHTESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPGHGFINKLNGLKQAI
        + R     SRIDRFL +    +     + +RL +  SDHYP+L   G    G SPFRFENMWL    F+  V  WW+++   G P H F +KL  LK  +
Subjt:  DFRTPPTHSRIDRFLHTESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPGHGFINKLNGLKQAI

Query:  KEWNITTFGNI
        K+WN   FGNI
Subjt:  KEWNITTFGNI

A0A438K9R4 Protein MICRORCHIDIA 41.3e-3838.84Show/hide
Query:  DAASFTTSTITKGAYSLSILFTLADGFDLWL-TGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELI
        D  S     + +G +S+S  F   D   +W+ TGV GP + ++RE    EL  +  L EE W +GGDFN+  +  + + +GR   AM  F   I+   L+
Subjt:  DAASFTTSTITKGAYSLSILFTLADGFDLWL-TGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELI

Query:  DFPISKGIFTWSDFRTPPTHSRIDRFLHTESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPGHG
        D P+  G FTWS      T +R+DRFL T   +++      +RL +PTSDHYP+L   G  R GPSPF+FENMWL  + F   +  WW      G P + 
Subjt:  DFPISKGIFTWSDFRTPPTHSRIDRFLHTESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPGHG

Query:  FINKLNGLKQAIKEWNITTFGNIE
           KL GLKQ +K WN   FG +E
Subjt:  FINKLNGLKQAIKEWNITTFGNIE

A0A6J1E2G6 uncharacterized protein LOC1110254052.0e-4446.31Show/hide
Query:  KGAYSLSILFTLADGFDLWLTGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELIDFPISKGIFTWS
        +G +SL+I F L+DGF  W++G+ GPST +    F  EL DL  L E  W + GDFN+ RW +E SN     ++M  FNSFI    LID P++ G  TWS
Subjt:  KGAYSLSILFTLADGFDLWLTGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELIDFPISKGIFTWS

Query:  DFRTPPTHSRIDRFLHTESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPGHGFINKLNGLKQAI
              + S ID FL T   I+K      KR+ + TSDH+P+L   G    G +PFRFENMWL+ K+F PF+  WW N P  G PGHG + KL  LK AI
Subjt:  DFRTPPTHSRIDRFLHTESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPGHGFINKLNGLKQAI

Query:  KEW
        K W
Subjt:  KEW

A5BQD9 Reverse transcriptase domain-containing protein2.8e-3837.33Show/hide
Query:  DAASFTTSTITKGAYSLSILFTLADGFDLWLTGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELID
        D+    +  +  G++S+S+ F +      WL+ V GP++   R+ F  EL D++CL    W +GGDFN+IR   E    GR   +M   + FI   ELID
Subjt:  DAASFTTSTITKGAYSLSILFTLADGFDLWLTGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELID

Query:  FPISKGIFTWSDFRTPPTHSRIDRFLHT---ESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPG
         P+    FTWS+ +  P   R+DRFL++   E +  +S+   L R    TSDH+P++      + GP+PFRFENMWL+  SF     +WW      G  G
Subjt:  FPISKGIFTWSDFRTPPTHSRIDRFLHT---ESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPG

Query:  HGFINKLNGLKQAIKEWNITTFGNI
        H F+ KL  LK  +KEWN   FG++
Subjt:  HGFINKLNGLKQAIKEWNITTFGNI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATCAGATGCAGCTTCTTTCACAACATCCACCATCACTAAAGGTGCATATTCCTTAAGTATCCTTTTCACACTTGCTGACGGCTTTGATCTTTGGCTAACAGGGGT
GTGTGGACCATCCACTGATCAAAACAGAGAACAATTCATCCTTGAGCTTCATGATCTTTACTGTCTTGTGGAAGAAATCTGGACCATCGGGGGCGATTTCAATCTCATCC
GTTGGCCTTTCGAAAATTCCAACAGTGGTAGAACTAACAGAGCTATGTCAGCTTTTAATTCCTTTATCAATCATCGAGAACTCATTGATTTTCCCATTAGTAAAGGTATT
TTTACATGGTCTGATTTTCGTACGCCACCAACTCACTCAAGGATTGACCGTTTTCTTCACACAGAATCTATCATTAACAAATCCATTGATGCATCTTTGAAGAGATTAGA
CAAACCCACATCAGACCATTATCCCCTTCTCTCGACTCTTGGAAATCAAAGATCAGGTCCATCACCTTTCAGATTTGAAAACATGTGGCTCAATCGCAAAAGCTTCCTTC
CTTTTGTTACAAATTGGTGGAACAATCATCCATGTTTTGGGCATCCTGGGCATGGCTTTATCAACAAATTGAATGGGCTGAAACAGGCGATTAAAGAATGGAATATTACC
ACTTTTGGTAATATTGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTATCAGATGCAGCTTCTTTCACAACATCCACCATCACTAAAGGTGCATATTCCTTAAGTATCCTTTTCACACTTGCTGACGGCTTTGATCTTTGGCTAACAGGGGT
GTGTGGACCATCCACTGATCAAAACAGAGAACAATTCATCCTTGAGCTTCATGATCTTTACTGTCTTGTGGAAGAAATCTGGACCATCGGGGGCGATTTCAATCTCATCC
GTTGGCCTTTCGAAAATTCCAACAGTGGTAGAACTAACAGAGCTATGTCAGCTTTTAATTCCTTTATCAATCATCGAGAACTCATTGATTTTCCCATTAGTAAAGGTATT
TTTACATGGTCTGATTTTCGTACGCCACCAACTCACTCAAGGATTGACCGTTTTCTTCACACAGAATCTATCATTAACAAATCCATTGATGCATCTTTGAAGAGATTAGA
CAAACCCACATCAGACCATTATCCCCTTCTCTCGACTCTTGGAAATCAAAGATCAGGTCCATCACCTTTCAGATTTGAAAACATGTGGCTCAATCGCAAAAGCTTCCTTC
CTTTTGTTACAAATTGGTGGAACAATCATCCATGTTTTGGGCATCCTGGGCATGGCTTTATCAACAAATTGAATGGGCTGAAACAGGCGATTAAAGAATGGAATATTACC
ACTTTTGGTAATATTGAATGA
Protein sequenceShow/hide protein sequence
MVSDAASFTTSTITKGAYSLSILFTLADGFDLWLTGVCGPSTDQNREQFILELHDLYCLVEEIWTIGGDFNLIRWPFENSNSGRTNRAMSAFNSFINHRELIDFPISKGI
FTWSDFRTPPTHSRIDRFLHTESIINKSIDASLKRLDKPTSDHYPLLSTLGNQRSGPSPFRFENMWLNRKSFLPFVTNWWNNHPCFGHPGHGFINKLNGLKQAIKEWNIT
TFGNIE