; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010489 (gene) of Snake gourd v1 genome

Gene IDTan0010489
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG11:33453196..33454154
RNA-Seq ExpressionTan0010489
SyntenyTan0010489
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037289.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]8.0e-2335.14Show/hide
Query:  EAKKTERFIMGLDENVRGFIQALAPRLRLRKITI---------------NNRSGNSGGEIGHSSKE----------------CGKSRLGKCMALVRACFK
        +A +TE+F+ GL  N++G ++AL P      + I                +R    GG      +E                CG+   G+C+A    CF+
Subjt:  EAKKTERFIMGLDENVRGFIQALAPRLRLRKITI---------------NNRSGNSGGEIGHSSKE----------------CGKSRLGKCMALVRACFK

Query:  CGKEGHRANISARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISRI-----------LCMDWLAQ
        C + GH A+   R KP    P   S       A QQGR +A+T ++   +  VVTGTLP+LGH AFVLFDS    SFIS +           L MDWL+ 
Subjt:  CGKEGHRANISARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISRI-----------LCMDWLAQ

Query:  NHACIDCFRKEVVFTPPNQPGY
        NHA IDCF KEV+F  P++P +
Subjt:  NHACIDCFRKEVVFTPPNQPGY

KAA0039148.1 pol protein [Cucumis melo var. makuwa]2.2e-2032.31Show/hide
Query:  EAKKTERFIMGLDENVRGFIQALAPR-----------LRLRKITINNRSGNSGGEIGHSSKECGKSRL----------GKCMALVRACFKCGKEGHRANI
        EA + E+F+ GL  +++G ++AL P            L L +   ++++   G  +G + K   +  +          G+C+A    CF+C +  H A+ 
Subjt:  EAKKTERFIMGLDENVRGFIQALAPR-----------LRLRKITINNRSGNSGGEIGHSSKECGKSRL----------GKCMALVRACFKCGKEGHRANI

Query:  SARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISRILC-----------------------MDWL
          + KP  + P   S       A QQGR +A+T ++   + ++VTGTLP+L H AFVLFDS    SFIS +                         MDWL
Subjt:  SARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISRILC-----------------------MDWL

Query:  AQNHACIDCFRKEVVFTPPNQPGYSSEAQ
        + NHACIDCF KEVVF PP+ P +    Q
Subjt:  AQNHACIDCFRKEVVFTPPNQPGYSSEAQ

KAA0054634.1 pol protein [Cucumis melo var. makuwa]1.3e-2032.81Show/hide
Query:  EAKKTERFIMGLDENVRGFIQALAP-------RLRL-----------------------RKI-----TINNRSGNSGGEIGHSSKE--------------
        EA +TE+F+ GL  +++G ++AL P       R+ L                       RK+      +  R+  SGG      +E              
Subjt:  EAKKTERFIMGLDENVRGFIQALAP-------RLRL-----------------------RKI-----TINNRSGNSGGEIGHSSKE--------------

Query:  --CGKSRLGKCMALVRACFKCGKEGHRANISARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISR
          CG+   G+C+A    CF+C + GH A++  R   +T+ PQ  +S        QQGR +A+T ++   S +VVTGTLP+LGH AFVLFDS    SFIS 
Subjt:  --CGKSRLGKCMALVRACFKCGKEGHRANISARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISR

Query:  -----------------------------ILCMDWLAQNHACIDCFRKEVVFTPPN
                                     IL MDWL+ NHA IDCF KEVVF PP+
Subjt:  -----------------------------ILCMDWLAQNHACIDCFRKEVVFTPPN

KAA0061201.1 pol protein [Cucumis melo var. makuwa]1.4e-2235.24Show/hide
Query:  EAKKTERFIMGLDENVRGFIQALAPR-----------LRLRKITINNRSGNSG-----GEIGHSSKE------CGKSRLGKCMALVRACFKCGKEGHRAN
        EA +TE+F+ GL  +++G ++AL P            L L +   ++++   G        G + +E      CG+   G+C+A    CF+C + GH A+
Subjt:  EAKKTERFIMGLDENVRGFIQALAPR-----------LRLRKITINNRSGNSG-----GEIGHSSKE------CGKSRLGKCMALVRACFKCGKEGHRAN

Query:  ISARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISR-----------------------------
        +  R   +T+ PQ  +S        QQGR YA+T ++   + +VVTGTLP+LGH AFVLFDS    SFIS                              
Subjt:  ISARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISR-----------------------------

Query:  ILCMDWLAQNHACIDCFRKEVVFTPPN
        IL MDWL+ NHA IDCF KEVVF PP+
Subjt:  ILCMDWLAQNHACIDCFRKEVVFTPPN

TYK26964.1 pol protein [Cucumis melo var. makuwa]2.2e-2032.31Show/hide
Query:  EAKKTERFIMGLDENVRGFIQALAPR-----------LRLRKITINNRSGNSGGEIGHSSKECGKSRL----------GKCMALVRACFKCGKEGHRANI
        EA + E+F+ GL  +++G ++AL P            L L +   ++++   G  +G + K   +  +          G+C+A    CF+C +  H A+ 
Subjt:  EAKKTERFIMGLDENVRGFIQALAPR-----------LRLRKITINNRSGNSGGEIGHSSKECGKSRL----------GKCMALVRACFKCGKEGHRANI

Query:  SARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISRILC-----------------------MDWL
          + KP  + P   S       A QQGR +A+T ++   + ++VTGTLP+L H AFVLFDS    SFIS +                         MDWL
Subjt:  SARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISRILC-----------------------MDWL

Query:  AQNHACIDCFRKEVVFTPPNQPGYSSEAQ
        + NHACIDCF KEVVF PP+ P +    Q
Subjt:  AQNHACIDCFRKEVVFTPPNQPGYSSEAQ

TrEMBL top hitse value%identityAlignment
A0A5A7T8E0 Pol protein1.1e-2032.31Show/hide
Query:  EAKKTERFIMGLDENVRGFIQALAPR-----------LRLRKITINNRSGNSGGEIGHSSKECGKSRL----------GKCMALVRACFKCGKEGHRANI
        EA + E+F+ GL  +++G ++AL P            L L +   ++++   G  +G + K   +  +          G+C+A    CF+C +  H A+ 
Subjt:  EAKKTERFIMGLDENVRGFIQALAPR-----------LRLRKITINNRSGNSGGEIGHSSKECGKSRL----------GKCMALVRACFKCGKEGHRANI

Query:  SARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISRILC-----------------------MDWL
          + KP  + P   S       A QQGR +A+T ++   + ++VTGTLP+L H AFVLFDS    SFIS +                         MDWL
Subjt:  SARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISRILC-----------------------MDWL

Query:  AQNHACIDCFRKEVVFTPPNQPGYSSEAQ
        + NHACIDCF KEVVF PP+ P +    Q
Subjt:  AQNHACIDCFRKEVVFTPPNQPGYSSEAQ

A0A5A7UHL7 Reverse transcriptase6.2e-2132.81Show/hide
Query:  EAKKTERFIMGLDENVRGFIQALAP-------RLRL-----------------------RKI-----TINNRSGNSGGEIGHSSKE--------------
        EA +TE+F+ GL  +++G ++AL P       R+ L                       RK+      +  R+  SGG      +E              
Subjt:  EAKKTERFIMGLDENVRGFIQALAP-------RLRL-----------------------RKI-----TINNRSGNSGGEIGHSSKE--------------

Query:  --CGKSRLGKCMALVRACFKCGKEGHRANISARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISR
          CG+   G+C+A    CF+C + GH A++  R   +T+ PQ  +S        QQGR +A+T ++   S +VVTGTLP+LGH AFVLFDS    SFIS 
Subjt:  --CGKSRLGKCMALVRACFKCGKEGHRANISARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISR

Query:  -----------------------------ILCMDWLAQNHACIDCFRKEVVFTPPN
                                     IL MDWL+ NHA IDCF KEVVF PP+
Subjt:  -----------------------------ILCMDWLAQNHACIDCFRKEVVFTPPN

A0A5A7V5X5 Pol protein6.6e-2335.24Show/hide
Query:  EAKKTERFIMGLDENVRGFIQALAPR-----------LRLRKITINNRSGNSG-----GEIGHSSKE------CGKSRLGKCMALVRACFKCGKEGHRAN
        EA +TE+F+ GL  +++G ++AL P            L L +   ++++   G        G + +E      CG+   G+C+A    CF+C + GH A+
Subjt:  EAKKTERFIMGLDENVRGFIQALAPR-----------LRLRKITINNRSGNSG-----GEIGHSSKE------CGKSRLGKCMALVRACFKCGKEGHRAN

Query:  ISARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISR-----------------------------
        +  R   +T+ PQ  +S        QQGR YA+T ++   + +VVTGTLP+LGH AFVLFDS    SFIS                              
Subjt:  ISARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISR-----------------------------

Query:  ILCMDWLAQNHACIDCFRKEVVFTPPN
        IL MDWL+ NHA IDCF KEVVF PP+
Subjt:  ILCMDWLAQNHACIDCFRKEVVFTPPN

A0A5D3C3E0 Ty3-gypsy retrotransposon protein3.9e-2335.14Show/hide
Query:  EAKKTERFIMGLDENVRGFIQALAPRLRLRKITI---------------NNRSGNSGGEIGHSSKE----------------CGKSRLGKCMALVRACFK
        +A +TE+F+ GL  N++G ++AL P      + I                +R    GG      +E                CG+   G+C+A    CF+
Subjt:  EAKKTERFIMGLDENVRGFIQALAPRLRLRKITI---------------NNRSGNSGGEIGHSSKE----------------CGKSRLGKCMALVRACFK

Query:  CGKEGHRANISARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISRI-----------LCMDWLAQ
        C + GH A+   R KP    P   S       A QQGR +A+T ++   +  VVTGTLP+LGH AFVLFDS    SFIS +           L MDWL+ 
Subjt:  CGKEGHRANISARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISRI-----------LCMDWLAQ

Query:  NHACIDCFRKEVVFTPPNQPGY
        NHA IDCF KEV+F  P++P +
Subjt:  NHACIDCFRKEVVFTPPNQPGY

A0A5D3DTY0 Pol protein1.1e-2032.31Show/hide
Query:  EAKKTERFIMGLDENVRGFIQALAPR-----------LRLRKITINNRSGNSGGEIGHSSKECGKSRL----------GKCMALVRACFKCGKEGHRANI
        EA + E+F+ GL  +++G ++AL P            L L +   ++++   G  +G + K   +  +          G+C+A    CF+C +  H A+ 
Subjt:  EAKKTERFIMGLDENVRGFIQALAPR-----------LRLRKITINNRSGNSGGEIGHSSKECGKSRL----------GKCMALVRACFKCGKEGHRANI

Query:  SARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISRILC-----------------------MDWL
          + KP  + P   S       A QQGR +A+T ++   + ++VTGTLP+L H AFVLFDS    SFIS +                         MDWL
Subjt:  SARIKPKTSHPQIMSSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISRILC-----------------------MDWL

Query:  AQNHACIDCFRKEVVFTPPNQPGYSSEAQ
        + NHACIDCF KEVVF PP+ P +    Q
Subjt:  AQNHACIDCFRKEVVFTPPNQPGYSSEAQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGGGAATTTACCCGTCTGTCCCGCTTTGCCCCGAGCCGGTGGATACGAAGCAAAGAAGACCGAGCGCTTCATCATGGGGCTAGATGAGAATGTTCGAGGCTTTAT
TCAAGCTTTAGCACCTCGATTACGCCTCCGCAAAATCACCATCAATAACAGATCGGGGAATAGCGGCGGAGAGATAGGCCACAGCAGCAAGGAATGTGGTAAATCGCGAC
TGGGTAAATGTATGGCCTTGGTCCGAGCGTGTTTCAAGTGTGGAAAGGAAGGGCACAGAGCTAATATCTCCGCTCGAATCAAGCCAAAGACTAGCCACCCACAGATTATG
AGTAGTGGGCCTGCTGTGAAGCCGGCCATACAACAGGGGAGAGCCTATGCGAGCACCAGTCGAGACATCTATGGCTCAGACTCAGTGGTTACAGGTACACTTCCATTACT
TGGACACCTCGCCTTTGTACTATTTGATTCTCGGTTCTACGACTCTTTTATCTCTCGCATCCTTTGCATGGACTGGTTAGCGCAGAACCATGCTTGTATTGATTGTTTCA
GGAAGGAGGTAGTGTTCACTCCTCCTAACCAACCTGGCTACAGTTCAGAGGCACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGGGAATTTACCCGTCTGTCCCGCTTTGCCCCGAGCCGGTGGATACGAAGCAAAGAAGACCGAGCGCTTCATCATGGGGCTAGATGAGAATGTTCGAGGCTTTAT
TCAAGCTTTAGCACCTCGATTACGCCTCCGCAAAATCACCATCAATAACAGATCGGGGAATAGCGGCGGAGAGATAGGCCACAGCAGCAAGGAATGTGGTAAATCGCGAC
TGGGTAAATGTATGGCCTTGGTCCGAGCGTGTTTCAAGTGTGGAAAGGAAGGGCACAGAGCTAATATCTCCGCTCGAATCAAGCCAAAGACTAGCCACCCACAGATTATG
AGTAGTGGGCCTGCTGTGAAGCCGGCCATACAACAGGGGAGAGCCTATGCGAGCACCAGTCGAGACATCTATGGCTCAGACTCAGTGGTTACAGGTACACTTCCATTACT
TGGACACCTCGCCTTTGTACTATTTGATTCTCGGTTCTACGACTCTTTTATCTCTCGCATCCTTTGCATGGACTGGTTAGCGCAGAACCATGCTTGTATTGATTGTTTCA
GGAAGGAGGTAGTGTTCACTCCTCCTAACCAACCTGGCTACAGTTCAGAGGCACAATGA
Protein sequenceShow/hide protein sequence
MRGNLPVCPALPRAGGYEAKKTERFIMGLDENVRGFIQALAPRLRLRKITINNRSGNSGGEIGHSSKECGKSRLGKCMALVRACFKCGKEGHRANISARIKPKTSHPQIM
SSGPAVKPAIQQGRAYASTSRDIYGSDSVVTGTLPLLGHLAFVLFDSRFYDSFISRILCMDWLAQNHACIDCFRKEVVFTPPNQPGYSSEAQ