; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022807 (gene) of Snake gourd v1 genome

Gene IDTan0022807
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG02:66950409..66953698
RNA-Seq ExpressionTan0022807
SyntenyTan0022807
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037515.1 uncharacterized protein E6C27_scaffold277G001260 [Cucumis melo var. makuwa]1.1e-7468.1Show/hide
Query:  FSGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPAD----AEVW---
        + GVMPPR+SR+ RQ++  TQDPTQGQ  RGSS  R      ++  A S++E+GRPE AGPSDPEK YGI+RL+KL ATVF GSTD AD    AE W   
Subjt:  FSGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPAD----AEVW---

Query:  ----RRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERKYTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWTDFSQLV
            R DARTL WQTFR IFE+KYYP+T  EAKRDEFLELKQGSLS+AEYERKYTE S+YA +I+ASESDRC RFERGLR EIRTPVT I KWT+FSQLV
Subjt:  ----RRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERKYTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWTDFSQLV

Query:  ETALRVEQSIAEEKSVVEPSRGALTTRSFRDP
        ETALRVEQSI EEKS +E SRG  TT   R P
Subjt:  ETALRVEQSIAEEKSVVEPSRGALTTRSFRDP

KAA0041108.1 reverse transcriptase [Cucumis melo var. makuwa]2.3e-7260.84Show/hide
Query:  CFPIFSGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVW---
        C  +  GVMPPRT R+ RQ+Q G Q PTQG     SS   V  G  N++ A +++E+GR + A PSDPEK YGI+RL+KLGATVF GSTDPADAE W   
Subjt:  CFPIFSGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVW---

Query:  -------------------------------------RRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERKYTEFSQYADVIVAS
                                             R DAR L WQTFR IFEDKYYPST  EAKRDEFL LKQGSLSVAEYERKYTE S+YADVI+AS
Subjt:  -------------------------------------RRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERKYTEFSQYADVIVAS

Query:  ESDRCRRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFR
        ESDRCRRFERGLR EIRTPVT I KWT+FSQLVETALRVEQSI EEKS VE SRG  T   FR
Subjt:  ESDRCRRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFR

KAA0051980.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.4e-7761.73Show/hide
Query:  LGMSISFIHMCHFILSTLSVLRGNNVV--VSVG--RLYSSFFQCF--PIF---------SGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVE
        L ++  F+H+  F   T  V+ G ++V  VSVG  R+      C+   +F          GVMPPRTSR+ RQ+Q   QDPTQGQ  RGSS  R      
Subjt:  LGMSISFIHMCHFILSTLSVLRGNNVV--VSVG--RLYSSFFQCF--PIF---------SGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVE

Query:  NKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVWRRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERK
        ++    S++E+GR E A PSDPEK YGI+RL++LGATVF GSTD ADAEVW  DARTL WQTFR IFE+KYYP+TC EAKRDEFLELKQGSLSVA+Y+RK
Subjt:  NKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVWRRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERK

Query:  YTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFR
        YTE S YA+VI+ASESDRCRRFERGL  EIRTPVT I KWTDFSQL+ETALRVEQSI EEKS +E SRG  TT   R
Subjt:  YTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFR

KAA0056353.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]8.9e-7270.83Show/hide
Query:  VMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVWRRDARTLYWQ
        VMPPRTS++ RQ+Q GTQDPTQGQ  RGSS  R      ++  + S++E+GRPE AGPSDPEK YGI+RL+KL ATVF GSTD ADAEVWR DARTL WQ
Subjt:  VMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVWRRDARTLYWQ

Query:  TFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERKYTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAEEK
        TFR IFE+KYYP+T  EAKRDEFLELKQ SLSV EYERK      YA++IVA ESDRC R ERGLR E RTPVT ITKW DFSQLVETALRVEQSI EEK
Subjt:  TFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERKYTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAEEK

Query:  SVVEPSRGALTTRSFR
        SV+E SRG  TT   R
Subjt:  SVVEPSRGALTTRSFR

KAA0056684.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]8.9e-7261.63Show/hide
Query:  SGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVW--------
        +GVMPPRT R+ RQ+Q G Q PTQG     SS   V  G  N++ A +++E+GR + A PSDPEK YGI+RL+KLGATVF GSTDPADAE W        
Subjt:  SGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVW--------

Query:  --------------------------------RRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERKYTEFSQYADVIVASESDRC
                                        R DAR L WQTFR IFEDKYYPST  EAKRDEFL LKQGSLSVAEYERKYTE S+YADVI+ASESDRC
Subjt:  --------------------------------RRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERKYTEFSQYADVIVASESDRC

Query:  RRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFR
        RRFERGLR EIRTPVT I KWT+FSQLVETALRVEQSI EEKS VE SRG  T   FR
Subjt:  RRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFR

TrEMBL top hitse value%identityAlignment
A0A5A7TDR2 Reverse transcriptase1.1e-7260.84Show/hide
Query:  CFPIFSGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVW---
        C  +  GVMPPRT R+ RQ+Q G Q PTQG     SS   V  G  N++ A +++E+GR + A PSDPEK YGI+RL+KLGATVF GSTDPADAE W   
Subjt:  CFPIFSGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVW---

Query:  -------------------------------------RRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERKYTEFSQYADVIVAS
                                             R DAR L WQTFR IFEDKYYPST  EAKRDEFL LKQGSLSVAEYERKYTE S+YADVI+AS
Subjt:  -------------------------------------RRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERKYTEFSQYADVIVAS

Query:  ESDRCRRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFR
        ESDRCRRFERGLR EIRTPVT I KWT+FSQLVETALRVEQSI EEKS VE SRG  T   FR
Subjt:  ESDRCRRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFR

A0A5A7U9X4 DNA/RNA polymerases superfamily protein6.9e-7861.73Show/hide
Query:  LGMSISFIHMCHFILSTLSVLRGNNVV--VSVG--RLYSSFFQCF--PIF---------SGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVE
        L ++  F+H+  F   T  V+ G ++V  VSVG  R+      C+   +F          GVMPPRTSR+ RQ+Q   QDPTQGQ  RGSS  R      
Subjt:  LGMSISFIHMCHFILSTLSVLRGNNVV--VSVG--RLYSSFFQCF--PIF---------SGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVE

Query:  NKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVWRRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERK
        ++    S++E+GR E A PSDPEK YGI+RL++LGATVF GSTD ADAEVW  DARTL WQTFR IFE+KYYP+TC EAKRDEFLELKQGSLSVA+Y+RK
Subjt:  NKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVWRRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERK

Query:  YTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFR
        YTE S YA+VI+ASESDRCRRFERGL  EIRTPVT I KWTDFSQL+ETALRVEQSI EEKS +E SRG  TT   R
Subjt:  YTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFR

A0A5A7UKD3 DNA/RNA polymerases superfamily protein4.3e-7270.83Show/hide
Query:  VMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVWRRDARTLYWQ
        VMPPRTS++ RQ+Q GTQDPTQGQ  RGSS  R      ++  + S++E+GRPE AGPSDPEK YGI+RL+KL ATVF GSTD ADAEVWR DARTL WQ
Subjt:  VMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVWRRDARTLYWQ

Query:  TFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERKYTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAEEK
        TFR IFE+KYYP+T  EAKRDEFLELKQ SLSV EYERK      YA++IVA ESDRC R ERGLR E RTPVT ITKW DFSQLVETALRVEQSI EEK
Subjt:  TFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERKYTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAEEK

Query:  SVVEPSRGALTTRSFR
        SV+E SRG  TT   R
Subjt:  SVVEPSRGALTTRSFR

A0A5A7UNA3 Reverse transcriptase4.3e-7261.63Show/hide
Query:  SGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVW--------
        +GVMPPRT R+ RQ+Q G Q PTQG     SS   V  G  N++ A +++E+GR + A PSDPEK YGI+RL+KLGATVF GSTDPADAE W        
Subjt:  SGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVW--------

Query:  --------------------------------RRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERKYTEFSQYADVIVASESDRC
                                        R DAR L WQTFR IFEDKYYPST  EAKRDEFL LKQGSLSVAEYERKYTE S+YADVI+ASESDRC
Subjt:  --------------------------------RRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERKYTEFSQYADVIVASESDRC

Query:  RRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFR
        RRFERGLR EIRTPVT I KWT+FSQLVETALRVEQSI EEKS VE SRG  T   FR
Subjt:  RRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFR

A0A5D3BSA6 Retrotrans_gag domain-containing protein5.4e-7568.1Show/hide
Query:  FSGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPAD----AEVW---
        + GVMPPR+SR+ RQ++  TQDPTQGQ  RGSS  R      ++  A S++E+GRPE AGPSDPEK YGI+RL+KL ATVF GSTD AD    AE W   
Subjt:  FSGVMPPRTSRQCRQDQGGTQDPTQGQYGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPAD----AEVW---

Query:  ----RRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERKYTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWTDFSQLV
            R DARTL WQTFR IFE+KYYP+T  EAKRDEFLELKQGSLS+AEYERKYTE S+YA +I+ASESDRC RFERGLR EIRTPVT I KWT+FSQLV
Subjt:  ----RRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVAEYERKYTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWTDFSQLV

Query:  ETALRVEQSIAEEKSVVEPSRGALTTRSFRDP
        ETALRVEQSI EEKS +E SRG  TT   R P
Subjt:  ETALRVEQSIAEEKSVVEPSRGALTTRSFRDP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGGAAGTTACTTTAAAGTTCAGGTCAAGTTCAGGATGCAGCAGGAAAACGAATATGGAAATAACATAAGCTTATTTTTAAGGAAGCACCTTTCGAAGTATTTACG
GACATATCCTTTGGGAATGTCAATTTCGTTTATTCATATGTGTCATTTTATATTGTCGACGTTGAGTGTACTCCGTGGCAACAATGTTGTCGTGAGTGTTGGGCGGCTTT
ATAGCTCATTCTTTCAGTGTTTTCCAATTTTTTCAGGAGTCATGCCACCACGTACTAGCAGACAATGCAGGCAGGATCAGGGCGGGACACAGGATCCTACCCAAGGCCAA
TATGGGAGGGGTTCTAGTGCCTCGAGAGTTCTGACTGGGGTCGAAAATAAAGAGCATGCTAGTTCCTCAGAGGAGGTAGGTAGGCCAGAGACAGCAGGGCCAAGTGATCC
AGAGAAAACATATGGAATAAAACGCCTGGAGAAATTAGGAGCCACAGTGTTTGGGGGTTCCACAGATCCAGCTGACGCCGAGGTTTGGCGCAGGGATGCACGTACTTTAT
ACTGGCAAACTTTCAGAAGCATATTCGAGGATAAGTATTACCCTAGCACGTGTCGCGAGGCAAAGAGGGATGAGTTTTTAGAGTTAAAGCAAGGGTCACTTTCAGTGGCT
GAGTACGAGAGGAAGTATACCGAGTTCTCGCAGTATGCTGATGTGATTGTGGCATCCGAGAGTGACAGATGTCGAAGGTTTGAAAGAGGATTACGTCCTGAGATACGTAC
CCCAGTCACAGTTATTACTAAGTGGACTGACTTTTCTCAGCTAGTAGAGACTGCTCTACGTGTTGAGCAGAGTATAGCAGAGGAGAAGTCAGTAGTGGAGCCTAGTCGTG
GGGCTTTGACAACAAGAAGTTTTCGAGATCCAGGTGCTACACATTCCTTTGTTTCTAGTATATTCCTAACCAAGCTGAATAGGAAGCTAAAGCCTTTACTGAGAGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTGGAAGTTACTTTAAAGTTCAGGTCAAGTTCAGGATGCAGCAGGAAAACGAATATGGAAATAACATAAGCTTATTTTTAAGGAAGCACCTTTCGAAGTATTTACG
GACATATCCTTTGGGAATGTCAATTTCGTTTATTCATATGTGTCATTTTATATTGTCGACGTTGAGTGTACTCCGTGGCAACAATGTTGTCGTGAGTGTTGGGCGGCTTT
ATAGCTCATTCTTTCAGTGTTTTCCAATTTTTTCAGGAGTCATGCCACCACGTACTAGCAGACAATGCAGGCAGGATCAGGGCGGGACACAGGATCCTACCCAAGGCCAA
TATGGGAGGGGTTCTAGTGCCTCGAGAGTTCTGACTGGGGTCGAAAATAAAGAGCATGCTAGTTCCTCAGAGGAGGTAGGTAGGCCAGAGACAGCAGGGCCAAGTGATCC
AGAGAAAACATATGGAATAAAACGCCTGGAGAAATTAGGAGCCACAGTGTTTGGGGGTTCCACAGATCCAGCTGACGCCGAGGTTTGGCGCAGGGATGCACGTACTTTAT
ACTGGCAAACTTTCAGAAGCATATTCGAGGATAAGTATTACCCTAGCACGTGTCGCGAGGCAAAGAGGGATGAGTTTTTAGAGTTAAAGCAAGGGTCACTTTCAGTGGCT
GAGTACGAGAGGAAGTATACCGAGTTCTCGCAGTATGCTGATGTGATTGTGGCATCCGAGAGTGACAGATGTCGAAGGTTTGAAAGAGGATTACGTCCTGAGATACGTAC
CCCAGTCACAGTTATTACTAAGTGGACTGACTTTTCTCAGCTAGTAGAGACTGCTCTACGTGTTGAGCAGAGTATAGCAGAGGAGAAGTCAGTAGTGGAGCCTAGTCGTG
GGGCTTTGACAACAAGAAGTTTTCGAGATCCAGGTGCTACACATTCCTTTGTTTCTAGTATATTCCTAACCAAGCTGAATAGGAAGCTAAAGCCTTTACTGAGAGGTTGA
Protein sequenceShow/hide protein sequence
MIGSYFKVQVKFRMQQENEYGNNISLFLRKHLSKYLRTYPLGMSISFIHMCHFILSTLSVLRGNNVVVSVGRLYSSFFQCFPIFSGVMPPRTSRQCRQDQGGTQDPTQGQ
YGRGSSASRVLTGVENKEHASSSEEVGRPETAGPSDPEKTYGIKRLEKLGATVFGGSTDPADAEVWRRDARTLYWQTFRSIFEDKYYPSTCREAKRDEFLELKQGSLSVA
EYERKYTEFSQYADVIVASESDRCRRFERGLRPEIRTPVTVITKWTDFSQLVETALRVEQSIAEEKSVVEPSRGALTTRSFRDPGATHSFVSSIFLTKLNRKLKPLLRG