; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021773 (gene) of Snake gourd v1 genome

Gene IDTan0021773
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG03:14758472..14759086
RNA-Seq ExpressionTan0021773
SyntenyTan0021773
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026280.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.8e-2758.88Show/hide
Query:  MEEQIFTRITQRLAESVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKGADDWWKITSSE-------
        +EEQ+  R+ QRL   +R  Q+D EKKYGIERLKALGATTF GTT+P DAE WL LIEKCF+V RC E++KV+L AF+LQ GA+DWW +  S        
Subjt:  MEEQIFTRITQRLAESVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKGADDWWKITSSE-------

Query:  -WVEFSK
         W EF K
Subjt:  -WVEFSK

KAA0035225.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.1e-2944.77Show/hide
Query:  MEEQIFTRITQRLAESVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKG----ADDWWKI-------
        MEEQ+  R+ QRL   +RS Q+DPEKKYGIERLKALGATTF GTT+PADAE WL LIEKCFRV RCPE++KV+L +F+LQ G     +DWW++       
Subjt:  MEEQIFTRITQRLAESVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKG----ADDWWKI-------

Query:  --------------------------------------------------TSSEWVEFSKLVEMALRVERSL
                                                          T ++W +FSKLVE ALRVE+SL
Subjt:  --------------------------------------------------TSSEWVEFSKLVEMALRVERSL

KAA0036813.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.1e-2743.64Show/hide
Query:  MEEQIFTRITQRLAESVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKGADDWWKITSS--------
        +EEQ+  R+ QRL   +RS Q+DPEKKYG ERLKALGATTF GTT+P D E WL LIEKCFRV R  E++KV+L AF+LQ  A+DWW++  S        
Subjt:  MEEQIFTRITQRLAESVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKGADDWWKITSS--------

Query:  ----------------------------------------------EWVEFSKLVEMALRVERSL
                                                      +W +FSKLVE+ALRVE+SL
Subjt:  ----------------------------------------------EWVEFSKLVEMALRVERSL

KAA0060484.1 Gag protease polyprotein-like protein [Cucumis melo var. makuwa]1.4e-2767.39Show/hide
Query:  SVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKGADDWWKITSS--------EWVEFSK
        S  STQ+DPEKKYGIERLKALGATTF GTT+PADAE WL LIEKCFRV RCPE++KV+L AF+LQ GA+DWW++  S         W EF K
Subjt:  SVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKGADDWWKITSS--------EWVEFSK

XP_038896416.1 uncharacterized protein LOC120084680 [Benincasa hispida]1.1e-3264.04Show/hide
Query:  MEEQIFTRITQRLAESVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKGADDWWKITSSEWVEFSKL
        ME+++F RITQRLA SV S Q DPEKK+GIERLKALGATTF+GTTDP DAE+WL LIEKCF+VMRCPE+ KV+L  F+LQKG + WW           KL
Subjt:  MEEQIFTRITQRLAESVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKGADDWWKITSSEWVEFSKL

Query:  VEMALRVERSLVDD
        VE A+RVE S+  +
Subjt:  VEMALRVERSLVDD

TrEMBL top hitse value%identityAlignment
A0A5A7SJ99 DNA/RNA polymerases superfamily protein8.9e-2858.88Show/hide
Query:  MEEQIFTRITQRLAESVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKGADDWWKITSSE-------
        +EEQ+  R+ QRL   +R  Q+D EKKYGIERLKALGATTF GTT+P DAE WL LIEKCF+V RC E++KV+L AF+LQ GA+DWW +  S        
Subjt:  MEEQIFTRITQRLAESVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKGADDWWKITSSE-------

Query:  -WVEFSK
         W EF K
Subjt:  -WVEFSK

A0A5A7T1M0 Reverse transcriptase5.2e-2843.64Show/hide
Query:  MEEQIFTRITQRLAESVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKGADDWWKITSS--------
        +EEQ+  R+ QRL   +RS Q+DPEKKYG ERLKALGATTF GTT+P D E WL LIEKCFRV R  E++KV+L AF+LQ  A+DWW++  S        
Subjt:  MEEQIFTRITQRLAESVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKGADDWWKITSS--------

Query:  ----------------------------------------------EWVEFSKLVEMALRVERSL
                                                      +W +FSKLVE+ALRVE+SL
Subjt:  ----------------------------------------------EWVEFSKLVEMALRVERSL

A0A5A7UZM6 Gag protease polyprotein-like protein6.8e-2867.39Show/hide
Query:  SVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKGADDWWKITSS--------EWVEFSK
        S  STQ+DPEKKYGIERLKALGATTF GTT+PADAE WL LIEKCFRV RCPE++KV+L AF+LQ GA+DWW++  S         W EF K
Subjt:  SVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKGADDWWKITSS--------EWVEFSK

A0A5D3BB91 Reverse transcriptase1.2e-2757.94Show/hide
Query:  MEEQIFTRITQRLAESVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKGADDWWKITSS--------
        +EEQ+  R+ QRL   +RS Q+DPEKKYG ERLKALGATTF GTT+P D E WL LIEKCFRV R  E++KV+L AF+LQ  A+DWW++  S        
Subjt:  MEEQIFTRITQRLAESVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKGADDWWKITSS--------

Query:  EWVEFSK
         W EF K
Subjt:  EWVEFSK

A0A5D3DES5 DNA/RNA polymerases superfamily protein5.6e-3044.77Show/hide
Query:  MEEQIFTRITQRLAESVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKG----ADDWWKI-------
        MEEQ+  R+ QRL   +RS Q+DPEKKYGIERLKALGATTF GTT+PADAE WL LIEKCFRV RCPE++KV+L +F+LQ G     +DWW++       
Subjt:  MEEQIFTRITQRLAESVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKG----ADDWWKI-------

Query:  --------------------------------------------------TSSEWVEFSKLVEMALRVERSL
                                                          T ++W +FSKLVE ALRVE+SL
Subjt:  --------------------------------------------------TSSEWVEFSKLVEMALRVERSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAACAGATCTTTACGAGGATAACTCAAAGATTAGCTGAAAGTGTTAGATCAACACAAGCAGATCCAGAAAAGAAGTATGGCATTGAAAGACTGAAAGCCTTAGG
TGCAACAACATTTGAAGGCACGACAGATCCCGCTGATGCTGAGGTTTGGTTAAATCTGATTGAGAAGTGCTTTAGGGTCATGCGATGCCCTGAAGAAAAGAAGGTCGATT
TAGTAGCATTCATACTTCAGAAAGGAGCAGATGATTGGTGGAAGATAACAAGTTCTGAGTGGGTTGAGTTCTCCAAGCTTGTGGAGATGGCATTACGAGTAGAGCGAAGC
CTAGTAGATGACATAATGGGAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAACAGATCTTTACGAGGATAACTCAAAGATTAGCTGAAAGTGTTAGATCAACACAAGCAGATCCAGAAAAGAAGTATGGCATTGAAAGACTGAAAGCCTTAGG
TGCAACAACATTTGAAGGCACGACAGATCCCGCTGATGCTGAGGTTTGGTTAAATCTGATTGAGAAGTGCTTTAGGGTCATGCGATGCCCTGAAGAAAAGAAGGTCGATT
TAGTAGCATTCATACTTCAGAAAGGAGCAGATGATTGGTGGAAGATAACAAGTTCTGAGTGGGTTGAGTTCTCCAAGCTTGTGGAGATGGCATTACGAGTAGAGCGAAGC
CTAGTAGATGACATAATGGGAAAATGA
Protein sequenceShow/hide protein sequence
MEEQIFTRITQRLAESVRSTQADPEKKYGIERLKALGATTFEGTTDPADAEVWLNLIEKCFRVMRCPEEKKVDLVAFILQKGADDWWKITSSEWVEFSKLVEMALRVERS
LVDDIMGK