; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020986 (gene) of Snake gourd v1 genome

Gene IDTan0020986
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG03:1166965..1169943
RNA-Seq ExpressionTan0020986
SyntenyTan0020986
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR041373 - Reverse transcriptase, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042295.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]8.6e-2559.29Show/hide
Query:  WLNLIEKCFRILPTCMIST-IKARKLLKKGCTTYLAYVVCALVSKLKPEDIPDGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIY
        W +  E+ F+ L   +++T I A  +  K C  Y       L   L    + DG VIAYASRQLK+HECNYPTHDLELAAVVLALKIWRHYLFG+KC I+
Subjt:  WLNLIEKCFRILPTCMIST-IKARKLLKKGCTTYLAYVVCALVSKLKPEDIPDGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIY

Query:  TDHKSLKYIFEQK
        TDHKSLKYIF+QK
Subjt:  TDHKSLKYIFEQK

KAA0042295.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]7.1e-1145.54Show/hide
Query:  RGGKRGKKVEAGTQEATGDRGREASEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQADPEKKYGIERLKALGATTFEGTTNPADAEIWLNLIEKCFRI
        R G+R ++ + G Q  T    +  S GESS          + F R TQ +  +  +  +DPEK YGIERLK LGAT FEG+T+PADAE WLN++EKCF +
Subjt:  RGGKRGKKVEAGTQEATGDRGREASEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQADPEKKYGIERLKALGATTFEGTTNPADAEIWLNLIEKCFRI

Query:  L
        +
Subjt:  L

KAA0042295.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]8.6e-2559.29Show/hide
Query:  WLNLIEKCFRILPTCMIST-IKARKLLKKGCTTYLAYVVCALVSKLKPEDIPDGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIY
        W +  E+CF+ L   +++T I A  +  K    Y       L   L    + DGKVIAYASR+LK+HECNYPTHDLELAAVVLALKIWRHYLFG+KC I+
Subjt:  WLNLIEKCFRILPTCMIST-IKARKLLKKGCTTYLAYVVCALVSKLKPEDIPDGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIY

Query:  TDHKSLKYIFEQK
        TDHKSLKYIF+QK
Subjt:  TDHKSLKYIFEQK

KAA0054447.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]4.3e-2443.09Show/hide
Query:  PADAEIWLNLIEKCFRILPTCMISTIKARKLLKKGCTTYLAYVV-----------------------------------------------CALVSKL--
        P  AE+   +++   +I+P  +IS +KA K  +KGCT +LAY+V                                                AL SK+  
Subjt:  PADAEIWLNLIEKCFRILPTCMISTIKARKLLKKGCTTYLAYVV-----------------------------------------------CALVSKL--

Query:  ---------KPEDIP-----DGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIYTDHKSLKYIFEQK
                 +  DIP       KVIAYAS QLKKHECNYPTHDLELAAVVLALKIWRHYLF +KC I+TDHKSLKYIF QK
Subjt:  ---------KPEDIP-----DGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIYTDHKSLKYIFEQK

KAA0060066.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]2.1e-1045.54Show/hide
Query:  RGGKRGKKVEAGTQEATGDRGREASEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQADPEKKYGIERLKALGATTFEGTTNPADAEIWLNLIEKCFRI
        R G+R ++ + G Q  T    +  S GESS          + FTR TQ +        +DPEK YGIERLK LGAT FEG+T+PAD E WLN++EKCF +
Subjt:  RGGKRGKKVEAGTQEATGDRGREASEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQADPEKKYGIERLKALGATTFEGTTNPADAEIWLNLIEKCFRI

Query:  L
        +
Subjt:  L

KAA0060066.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]7.8e-0258.33Show/hide
Query:  MISTIKARKLLKKGCTTYLAYVVCALVSKLKPEDIP
        +IS +KA KLL+KGCT +LA++V     KLKPED+P
Subjt:  MISTIKARKLLKKGCTTYLAYVVCALVSKLKPEDIP

KAA0060066.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]3.3e-2491.8Show/hide
Query:  DGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIYTDHKSLKYIFEQK
        DG VIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFG+KC I+TDHKSLKYIF QK
Subjt:  DGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIYTDHKSLKYIFEQK

KAA0067829.1 retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa]1.6e-0739.6Show/hide
Query:  RGGKRGKKVEAGTQEATGDRGREASEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQADPEKKYGIERLKALGATTFEGTTNPADAEIWLNLIEKCFRI
        R  +R ++ + G Q  T    +  S GESS  +       + F R  Q +        +DP++ YGIERLK LGAT FEG+ +PA+AE WLN++EKCF +
Subjt:  RGGKRGKKVEAGTQEATGDRGREASEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQADPEKKYGIERLKALGATTFEGTTNPADAEIWLNLIEKCFRI

Query:  L
        +
Subjt:  L

KAA0067829.1 retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa]1.9e-2491.8Show/hide
Query:  DGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIYTDHKSLKYIFEQK
        DG VIAYASRQLK+HECNYPTHDLELAAVVLALKIWRHYLFGKKC I+TDHKSLKYIF+QK
Subjt:  DGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIYTDHKSLKYIFEQK

TrEMBL top hitse value%identityAlignment
A0A5A7TLH7 Reverse transcriptase3.4e-1145.54Show/hide
Query:  RGGKRGKKVEAGTQEATGDRGREASEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQADPEKKYGIERLKALGATTFEGTTNPADAEIWLNLIEKCFRI
        R G+R ++ + G Q  T    +  S GESS          + F R TQ +  +  +  +DPEK YGIERLK LGAT FEG+T+PADAE WLN++EKCF +
Subjt:  RGGKRGKKVEAGTQEATGDRGREASEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQADPEKKYGIERLKALGATTFEGTTNPADAEIWLNLIEKCFRI

Query:  L
        +
Subjt:  L

A0A5A7TLH7 Reverse transcriptase9.3e-2591.8Show/hide
Query:  DGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIYTDHKSLKYIFEQK
        DG VIAYASRQLK+HECNYPTHDLELAAVVLALKIWRHYLFGKKC I+TDHKSLKYIF+QK
Subjt:  DGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIYTDHKSLKYIFEQK

A0A5A7UIP8 DNA/RNA polymerases superfamily protein2.1e-2443.09Show/hide
Query:  PADAEIWLNLIEKCFRILPTCMISTIKARKLLKKGCTTYLAYVV-----------------------------------------------CALVSKL--
        P  AE+   +++   +I+P  +IS +KA K  +KGCT +LAY+V                                                AL SK+  
Subjt:  PADAEIWLNLIEKCFRILPTCMISTIKARKLLKKGCTTYLAYVV-----------------------------------------------CALVSKL--

Query:  ---------KPEDIP-----DGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIYTDHKSLKYIFEQK
                 +  DIP       KVIAYAS QLKKHECNYPTHDLELAAVVLALKIWRHYLF +KC I+TDHKSLKYIF QK
Subjt:  ---------KPEDIP-----DGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIYTDHKSLKYIFEQK

A0A5A7V0R0 Reverse transcriptase9.9e-1145.54Show/hide
Query:  RGGKRGKKVEAGTQEATGDRGREASEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQADPEKKYGIERLKALGATTFEGTTNPADAEIWLNLIEKCFRI
        R G+R ++ + G Q  T    +  S GESS          + FTR TQ +        +DPEK YGIERLK LGAT FEG+T+PAD E WLN++EKCF +
Subjt:  RGGKRGKKVEAGTQEATGDRGREASEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQADPEKKYGIERLKALGATTFEGTTNPADAEIWLNLIEKCFRI

Query:  L
        +
Subjt:  L

A0A5A7V0R0 Reverse transcriptase3.8e-0258.33Show/hide
Query:  MISTIKARKLLKKGCTTYLAYVVCALVSKLKPEDIP
        +IS +KA KLL+KGCT +LA++V     KLKPED+P
Subjt:  MISTIKARKLLKKGCTTYLAYVVCALVSKLKPEDIP

A0A5A7V0R0 Reverse transcriptase1.6e-2491.8Show/hide
Query:  DGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIYTDHKSLKYIFEQK
        DG VIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFG+KC I+TDHKSLKYIF QK
Subjt:  DGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIYTDHKSLKYIFEQK

A0A5A7VRP7 Retrotransposon protein, putative, Ty3-gypsy subclass4.2e-2559.29Show/hide
Query:  WLNLIEKCFRILPTCMIST-IKARKLLKKGCTTYLAYVVCALVSKLKPEDIPDGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIY
        W +  E+CF+ L   +++T I A  +  K    Y       L   L    + DGKVIAYASR+LK+HECNYPTHDLELAAVVLALKIWRHYLFG+KC I+
Subjt:  WLNLIEKCFRILPTCMIST-IKARKLLKKGCTTYLAYVVCALVSKLKPEDIPDGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIY

Query:  TDHKSLKYIFEQK
        TDHKSLKYIF+QK
Subjt:  TDHKSLKYIFEQK

A0A5A7VRP7 Retrotransposon protein, putative, Ty3-gypsy subclass7.9e-0839.6Show/hide
Query:  RGGKRGKKVEAGTQEATGDRGREASEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQADPEKKYGIERLKALGATTFEGTTNPADAEIWLNLIEKCFRI
        R  +R ++ + G Q  T    +  S GESS  +       + F R  Q +        +DP++ YGIERLK LGAT FEG+ +PA+AE WLN++EKCF +
Subjt:  RGGKRGKKVEAGTQEATGDRGREASEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQADPEKKYGIERLKALGATTFEGTTNPADAEIWLNLIEKCFRI

Query:  L
        +
Subjt:  L

A0A5A7VRP7 Retrotransposon protein, putative, Ty3-gypsy subclass4.2e-2559.29Show/hide
Query:  WLNLIEKCFRILPTCMIST-IKARKLLKKGCTTYLAYVVCALVSKLKPEDIPDGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIY
        W +  E+ F+ L   +++T I A  +  K C  Y       L   L    + DG VIAYASRQLK+HECNYPTHDLELAAVVLALKIWRHYLFG+KC I+
Subjt:  WLNLIEKCFRILPTCMIST-IKARKLLKKGCTTYLAYVVCALVSKLKPEDIPDGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIY

Query:  TDHKSLKYIFEQK
        TDHKSLKYIF+QK
Subjt:  TDHKSLKYIFEQK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.64.4e-0844.26Show/hide
Query:  DGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIYTDHKSLKYIFEQK
        DG  ++Y SR L +HE NY T + EL A+V A K +RHYL G+   I +DH+ L +++  K
Subjt:  DGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIYTDHKSLKYIFEQK

P10394 Retrovirus-related Pol polyprotein from transposon 4124.6e-0544.44Show/hide
Query:  IAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIYTDHKSLKYIF
        +AYASR   K E N  T + ELAA+  A+  +R Y++GK   + TDH+ L Y+F
Subjt:  IAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIYTDHKSLKYIF

P20825 Retrovirus-related Pol polyprotein from transposon 2971.4e-0640.98Show/hide
Query:  DGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIYTDHKSLKYIFEQK
        +G  I++ SR L  HE NY   + EL A+V A K +RHYL G++  I +DH+ L+++   K
Subjt:  DGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIYTDHKSLKYIFEQK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCGAGGTGGCAAACGAGGTAAGAAAGTGGAGGCCGGAACTCAAGAAGCTACTGGTGATAGAGGAAGAGAGGCATCAGAGGGAGAGTCTAGTCATCCTCAGCAAGA
GGTGAACATGGAGGAACAGATCTTCACGAGGATAACTCAAAGATTAGCTGAAAGTGTTGGATCAGCACAAGCAGATCCAGAAAAAAAGTATGGCATTGAAAGACTGAAGG
CCTTAGGTGCAACAACATTTGAAGGCACGACAAATCCCGCTGATGCTGAGATTTGGTTAAATCTGATTGAGAAGTGTTTTAGGATATTGCCTACTTGTATGATATCAACT
ATTAAAGCTAGAAAATTATTGAAGAAAGGTTGTACAACTTATCTTGCATATGTAGTTTGTGCACTAGTGAGTAAATTAAAACCTGAAGACATTCCAGATGGGAAAGTGAT
CGCTTATGCATCAAGACAGTTGAAAAAGCATGAATGTAATTATCCTACTCATGATCTGGAGCTCGCAGCAGTTGTGTTAGCTTTAAAGATTTGGAGGCATTATCTGTTTG
GTAAGAAGTGTTGTATTTACACAGATCATAAAAGTTTAAAATATATCTTCGAACAAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTCGAGGTGGCAAACGAGGTAAGAAAGTGGAGGCCGGAACTCAAGAAGCTACTGGTGATAGAGGAAGAGAGGCATCAGAGGGAGAGTCTAGTCATCCTCAGCAAGA
GGTGAACATGGAGGAACAGATCTTCACGAGGATAACTCAAAGATTAGCTGAAAGTGTTGGATCAGCACAAGCAGATCCAGAAAAAAAGTATGGCATTGAAAGACTGAAGG
CCTTAGGTGCAACAACATTTGAAGGCACGACAAATCCCGCTGATGCTGAGATTTGGTTAAATCTGATTGAGAAGTGTTTTAGGATATTGCCTACTTGTATGATATCAACT
ATTAAAGCTAGAAAATTATTGAAGAAAGGTTGTACAACTTATCTTGCATATGTAGTTTGTGCACTAGTGAGTAAATTAAAACCTGAAGACATTCCAGATGGGAAAGTGAT
CGCTTATGCATCAAGACAGTTGAAAAAGCATGAATGTAATTATCCTACTCATGATCTGGAGCTCGCAGCAGTTGTGTTAGCTTTAAAGATTTGGAGGCATTATCTGTTTG
GTAAGAAGTGTTGTATTTACACAGATCATAAAAGTTTAAAATATATCTTCGAACAAAAATAG
Protein sequenceShow/hide protein sequence
MARGGKRGKKVEAGTQEATGDRGREASEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQADPEKKYGIERLKALGATTFEGTTNPADAEIWLNLIEKCFRILPTCMIST
IKARKLLKKGCTTYLAYVVCALVSKLKPEDIPDGKVIAYASRQLKKHECNYPTHDLELAAVVLALKIWRHYLFGKKCCIYTDHKSLKYIFEQK