; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007396 (gene) of Snake gourd v1 genome

Gene IDTan0007396
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG03:15950801..16017035
RNA-Seq ExpressionTan0007396
SyntenyTan0007396
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054351.1 reverse transcriptase [Cucumis melo var. makuwa]6.5e-0938.33Show/hide
Query:  TIVASEIDRCLIL-LSTLSVLRDNSAVIELPGSIT-----------RHNLLWKLLMFDMLYV------------------------VGVLRGNNVCWLHA
        ++   E+D+ +I  LSTLSVLRDN AV+E+  S+             ++  W  L F+ ++V                         G++RG+NVCWLHA
Subjt:  TIVASEIDRCLIL-LSTLSVLRDNSAVIELPGSIT-----------RHNLLWKLLMFDMLYV------------------------VGVLRGNNVCWLHA

Query:  VSRAKLACGSGGGVTTWYQS
        V RA+ A G GGGVTTWYQS
Subjt:  VSRAKLACGSGGGVTTWYQS

KAA0058280.1 uncharacterized protein E6C27_scaffold274G006090 [Cucumis melo var. makuwa]2.1e-0739.47Show/hide
Query:  EIDRCLIL-LSTLSVLRDNSAVIELP---------GSITRHNLLWKLLMFDMLYV------------------------VGVLRGNNVCWLHAVSRAKLA
        E+D+ +I  LSTLSVLRDN+ VIELP          S   ++  W  L F+ ++V                         G++RG++VCWLHAV RAK  
Subjt:  EIDRCLIL-LSTLSVLRDNSAVIELP---------GSITRHNLLWKLLMFDMLYV------------------------VGVLRGNNVCWLHAVSRAKLA

Query:  CGSGGGVTTWYQSS
         G GGGVTTW  S+
Subjt:  CGSGGGVTTWYQSS

KAA0066738.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.0e-0643.56Show/hide
Query:  TIVASEIDRCLIL-LSTLSVLRDNSAV-------------IELPGSITRHNLLWKLL-MFDMLYV-----VGVLRGNNVCWLHAVSRAKLACGSGGGVTT
        ++   E+D+ +I  LSTLS+LRDN  +             IELPG  T   L +K   +    YV      G++RG++VCWLHAV RAK A G GGGVTT
Subjt:  TIVASEIDRCLIL-LSTLSVLRDNSAV-------------IELPGSITRHNLLWKLL-MFDMLYV-----VGVLRGNNVCWLHAVSRAKLACGSGGGVTT

Query:  W
        W
Subjt:  W

TYK11835.1 uncharacterized protein E5676_scaffold152G00520 [Cucumis melo var. makuwa]5.5e-0838.66Show/hide
Query:  TIVASEIDRCLIL-LSTLSVLRDNSAVIELP---------GSITRHNLLWKLLMFDMLYV------------------------VGVLRGNNVCWLHAVS
        ++   E+D+ +I  LSTLSVLRDN+AVIELP          S   ++  W  L F+ ++V                         G++RG++VCWLHAV 
Subjt:  TIVASEIDRCLIL-LSTLSVLRDNSAVIELP---------GSITRHNLLWKLLMFDMLYV------------------------VGVLRGNNVCWLHAVS

Query:  RAKLACGSGGGVTTWYQSS
        RAK   G GGGVTTW  S+
Subjt:  RAKLACGSGGGVTTWYQSS

TYK14494.1 uncharacterized protein E5676_scaffold15G00050 [Cucumis melo var. makuwa]3.0e-0648.57Show/hide
Query:  STLSVLRDNSAVIELPGSITRHNLLWKLLMFDMLYVVGVLRGNNVCWLHAVSRAKLACGSGGGVTTWYQS
        STLSVL DN AV+E+   +                  G++R ++VCWLH+V RAK A G GGGVTTWYQS
Subjt:  STLSVLRDNSAVIELPGSITRHNLLWKLLMFDMLYVVGVLRGNNVCWLHAVSRAKLACGSGGGVTTWYQS

TrEMBL top hitse value%identityAlignment
A0A5A7UT17 CCHC-type domain-containing protein1.0e-0739.47Show/hide
Query:  EIDRCLIL-LSTLSVLRDNSAVIELP---------GSITRHNLLWKLLMFDMLYV------------------------VGVLRGNNVCWLHAVSRAKLA
        E+D+ +I  LSTLSVLRDN+ VIELP          S   ++  W  L F+ ++V                         G++RG++VCWLHAV RAK  
Subjt:  EIDRCLIL-LSTLSVLRDNSAVIELP---------GSITRHNLLWKLLMFDMLYV------------------------VGVLRGNNVCWLHAVSRAKLA

Query:  CGSGGGVTTWYQSS
         G GGGVTTW  S+
Subjt:  CGSGGGVTTWYQSS

A0A5D3CJX3 CCHC-type domain-containing protein2.6e-0838.66Show/hide
Query:  TIVASEIDRCLIL-LSTLSVLRDNSAVIELP---------GSITRHNLLWKLLMFDMLYV------------------------VGVLRGNNVCWLHAVS
        ++   E+D+ +I  LSTLSVLRDN+AVIELP          S   ++  W  L F+ ++V                         G++RG++VCWLHAV 
Subjt:  TIVASEIDRCLIL-LSTLSVLRDNSAVIELP---------GSITRHNLLWKLLMFDMLYV------------------------VGVLRGNNVCWLHAVS

Query:  RAKLACGSGGGVTTWYQSS
        RAK   G GGGVTTW  S+
Subjt:  RAKLACGSGGGVTTWYQSS

A0A5D3CMM1 Reverse transcriptase3.1e-0938.33Show/hide
Query:  TIVASEIDRCLIL-LSTLSVLRDNSAVIELPGSIT-----------RHNLLWKLLMFDMLYV------------------------VGVLRGNNVCWLHA
        ++   E+D+ +I  LSTLSVLRDN AV+E+  S+             ++  W  L F+ ++V                         G++RG+NVCWLHA
Subjt:  TIVASEIDRCLIL-LSTLSVLRDNSAVIELPGSIT-----------RHNLLWKLLMFDMLYV------------------------VGVLRGNNVCWLHA

Query:  VSRAKLACGSGGGVTTWYQS
        V RA+ A G GGGVTTWYQS
Subjt:  VSRAKLACGSGGGVTTWYQS

A0A5D3CU23 Retrotrans_gag domain-containing protein1.5e-0648.57Show/hide
Query:  STLSVLRDNSAVIELPGSITRHNLLWKLLMFDMLYVVGVLRGNNVCWLHAVSRAKLACGSGGGVTTWYQS
        STLSVL DN AV+E+   +                  G++R ++VCWLH+V RAK A G GGGVTTWYQS
Subjt:  STLSVLRDNSAVIELPGSITRHNLLWKLLMFDMLYVVGVLRGNNVCWLHAVSRAKLACGSGGGVTTWYQS

A0A5D3DVW9 DNA/RNA polymerases superfamily protein5.0e-0743.56Show/hide
Query:  TIVASEIDRCLIL-LSTLSVLRDNSAV-------------IELPGSITRHNLLWKLL-MFDMLYV-----VGVLRGNNVCWLHAVSRAKLACGSGGGVTT
        ++   E+D+ +I  LSTLS+LRDN  +             IELPG  T   L +K   +    YV      G++RG++VCWLHAV RAK A G GGGVTT
Subjt:  TIVASEIDRCLIL-LSTLSVLRDNSAV-------------IELPGSITRHNLLWKLL-MFDMLYV-----VGVLRGNNVCWLHAVSRAKLACGSGGGVTT

Query:  W
        W
Subjt:  W

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGATTACTTGAATTCCTTATTGCAAGAAGGGATTGTATCGTGGAATAATCATGGACACAGGATGCCCCCACTCGCATGTCAACCACATGAACGAGTCAGATCACC
TCGTTTGTATCTAATACAAATATGCGATGTAGGGTCGATGACTATAATAGAATATGAGAAGAAGTTCACAAAGTTGTCAAAGTATGCTAGCACTATTGTTGCAAGCGAGA
TAGATCGATGTCTTATTCTTTTGTCGACATTGAGTGTACTCCGTGACAACAGCGCTGTCATCGAGCTCCCTGGCTCGATAACTCGCCATAATCTGCTTTGGAAGCTTCTC
ATGTTTGATATGTTGTACGTGGTTGGTGTGTTAAGAGGTAATAATGTCTGTTGGCTTCATGCCGTCTCCCGTGCTAAGTTAGCATGTGGTTCGGGAGGGGGTGTGACAAC
TTGGTATCAGAGCAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGATTACTTGAATTCCTTATTGCAAGAAGGGATTGTATCGTGGAATAATCATGGACACAGGATGCCCCCACTCGCATGTCAACCACATGAACGAGTCAGATCACC
TCGTTTGTATCTAATACAAATATGCGATGTAGGGTCGATGACTATAATAGAATATGAGAAGAAGTTCACAAAGTTGTCAAAGTATGCTAGCACTATTGTTGCAAGCGAGA
TAGATCGATGTCTTATTCTTTTGTCGACATTGAGTGTACTCCGTGACAACAGCGCTGTCATCGAGCTCCCTGGCTCGATAACTCGCCATAATCTGCTTTGGAAGCTTCTC
ATGTTTGATATGTTGTACGTGGTTGGTGTGTTAAGAGGTAATAATGTCTGTTGGCTTCATGCCGTCTCCCGTGCTAAGTTAGCATGTGGTTCGGGAGGGGGTGTGACAAC
TTGGTATCAGAGCAGTTAG
Protein sequenceShow/hide protein sequence
MRDYLNSLLQEGIVSWNNHGHRMPPLACQPHERVRSPRLYLIQICDVGSMTIIEYEKKFTKLSKYASTIVASEIDRCLILLSTLSVLRDNSAVIELPGSITRHNLLWKLL
MFDMLYVVGVLRGNNVCWLHAVSRAKLACGSGGGVTTWYQSS