; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010017 (gene) of Snake gourd v1 genome

Gene IDTan0010017
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG07:31693818..31694726
RNA-Seq ExpressionTan0010017
SyntenyTan0010017
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039476.1 uncharacterized protein E6C27_scaffold64G002900 [Cucumis melo var. makuwa]9.3e-4343.27Show/hide
Query:  SAQANPEKKYGTERLKALGATSFEATTDPADIE-------------------------------ADDWWKIIENIKEEAW--SWKDFRKAFEDKYYSSSC
        SAQ++P+KKYG ERLKALGAT+F  TT+P D+E                               A+DWW++ E+ +      SW +F+KAF DK+Y  S 
Subjt:  SAQANPEKKYGTERLKALGATSFEATTDPADIE-------------------------------ADDWWKIIENIKEEAW--SWKDFRKAFEDKYYSSSC

Query:  RDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKRFEDGLRTEIRTLVTTSSEWVEFSKLVETTLWVERSLVDDRMGKGAVGGGHTTYFV
        RDAKRNEFL L QGSM V EYEKK+TELS YA+ ++ +E++R KRFE+GLR EIRT VT  ++W +FSKLVE  L V +SL ++R  +        T+  
Subjt:  RDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKRFEDGLRTEIRTLVTTSSEWVEFSKLVETTLWVERSLVDDRMGKGAVGGGHTTYFV

Query:  DLPRDRFQRGDNRRFTPGVSGKGSFKPHRGRQTTPRTGTGGREER
         + R+R  +  + RF PGV  +G+FK          +G+GG  +R
Subjt:  DLPRDRFQRGDNRRFTPGVSGKGSFKPHRGRQTTPRTGTGGREER

KAA0060484.1 Gag protease polyprotein-like protein [Cucumis melo var. makuwa]2.2e-4444.08Show/hide
Query:  SAQANPEKKYGTERLKALGATSFEATTDPADIE-------------------------------ADDWWKIIENIKEEAW--SWKDFRKAFEDKYYSSSC
        S Q++PEKKYG ERLKALGAT+F  TT+PAD E                               A+DWW++ E+ +      SW +F+KAF DK+Y  S 
Subjt:  SAQANPEKKYGTERLKALGATSFEATTDPADIE-------------------------------ADDWWKIIENIKEEAW--SWKDFRKAFEDKYYSSSC

Query:  RDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKRFEDGLRTEIRTLVTTSSEWVEFSKLVETTLWVERSLVDDRMGKGAVGGGHTTYFV
        RDAKRNEFL L QGSM + EYEKK+TELS YA+ ++ +E++RCKRFE+GLR EIRT VT  ++W +FSKLVE  L VE+SL ++R  +        T+  
Subjt:  RDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKRFEDGLRTEIRTLVTTSSEWVEFSKLVETTLWVERSLVDDRMGKGAVGGGHTTYFV

Query:  DLPRDRFQRGDNRRFTPGVSGKGSFKPHRGRQTTPRTGTGGREER
         + R+R  +  + RF PGVS +G+FK      +  ++G+ G  +R
Subjt:  DLPRDRFQRGDNRRFTPGVSGKGSFKPHRGRQTTPRTGTGGREER

TYJ95881.1 retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa]3.9e-4942Show/hide
Query:  GKRGRQVEAETQEATGD--RGRKVSEGESSHPQQEVNMEEQIFTRITQRLAERVVSAQANPEKKYGTERLKALGATSFEATTDPADIE------------
        GK  +  +AE   A  +   G   S+ ESS P+ E N+EEQ+  R+ QRL   + SAQ++PEKKYG ERLKALGAT+F  TT+P D+E            
Subjt:  GKRGRQVEAETQEATGD--RGRKVSEGESSHPQQEVNMEEQIFTRITQRLAERVVSAQANPEKKYGTERLKALGATSFEATTDPADIE------------

Query:  -------------------ADDWWKIIENIKEEA--WSWKDFRKAFEDKYYSSSCRDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKR
                           A+DWW++ E+ +      SW +F+KAF DK+Y  S RDAK NEF+ L QG+M V EYEKK+TELS YA+ ++ +E +RCKR
Subjt:  -------------------ADDWWKIIENIKEEA--WSWKDFRKAFEDKYYSSSCRDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKR

Query:  FEDGLRTEIRTLVTTSSEWVEFSKLVETTLWVERSLVDDRMGKGAVGGGHTTYFVDLPRDRFQRGDNRRFTPGVSGKGSFKPHRGRQTTPRTGTGGREER
        FE+GLR EIRT VT  ++W +FSKLVE  L VE+SL ++R  +        T+   + R+R  +  + RF P VS +GSFK      +  ++ +GG  +R
Subjt:  FEDGLRTEIRTLVTTSSEWVEFSKLVETTLWVERSLVDDRMGKGAVGGGHTTYFVDLPRDRFQRGDNRRFTPGVSGKGSFKPHRGRQTTPRTGTGGREER

TYK15233.1 uncharacterized protein E5676_scaffold892G00030 [Cucumis melo var. makuwa]9.3e-4343.27Show/hide
Query:  SAQANPEKKYGTERLKALGATSFEATTDPADIE-------------------------------ADDWWKIIENIKEEAW--SWKDFRKAFEDKYYSSSC
        SAQ++P+KKYG ERLKALGAT+F  TT+P D+E                               A+DWW++ E+ +      SW +F+KAF DK+Y  S 
Subjt:  SAQANPEKKYGTERLKALGATSFEATTDPADIE-------------------------------ADDWWKIIENIKEEAW--SWKDFRKAFEDKYYSSSC

Query:  RDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKRFEDGLRTEIRTLVTTSSEWVEFSKLVETTLWVERSLVDDRMGKGAVGGGHTTYFV
        RDAKRNEFL L QGSM V EYEKK+TELS YA+ ++ +E++R KRFE+GLR EIRT VT  ++W +FSKLVE  L V +SL ++R  +        T+  
Subjt:  RDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKRFEDGLRTEIRTLVTTSSEWVEFSKLVETTLWVERSLVDDRMGKGAVGGGHTTYFV

Query:  DLPRDRFQRGDNRRFTPGVSGKGSFKPHRGRQTTPRTGTGGREER
         + R+R  +  + RF PGV  +G+FK          +G+GG  +R
Subjt:  DLPRDRFQRGDNRRFTPGVSGKGSFKPHRGRQTTPRTGTGGREER

XP_038890030.1 uncharacterized protein LOC120079741 [Benincasa hispida]2.3e-4147.71Show/hide
Query:  VSEGESSHPQQEV--NMEEQIFTRITQRLAERVVSAQANPEKKYGTERLKALGATSFEATTDPADI-------------------------------EAD
        +SEGESS PQ      +E+ +F RI QRLA  V   +AN EKKY  ER KALGA +FE TTDP +                                EA+
Subjt:  VSEGESSHPQQEV--NMEEQIFTRITQRLAERVVSAQANPEKKYGTERLKALGATSFEATTDPADI-------------------------------EAD

Query:  DWWKII--ENIKEEAWSWKDFRKAFEDKYYSSSCRDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKRFEDGLRTEIRTLVTTSSEWVE
         WWK+I       +A  W +F+KA +DKY  SS RDAKR+EFL L QG+M V EYE+KFTELS YA  I+A E DRC++FE GLR EI+T VT+++ W++
Subjt:  DWWKII--ENIKEEAWSWKDFRKAFEDKYYSSSCRDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKRFEDGLRTEIRTLVTTSSEWVE

Query:  FSKLVETTLWVERSLVDD
        F++LVE    VERSL DD
Subjt:  FSKLVETTLWVERSLVDD

TrEMBL top hitse value%identityAlignment
A0A5A7SX06 DNA/RNA polymerases superfamily protein2.7e-4040.92Show/hide
Query:  RGGKRGRQVEAETQEATGDRGRKVSEGESSHPQQEVNMEEQIFTRITQRLAERVVSAQANPEKKYGTERLKALGATSFEATTDPADI-------------
        R G+R RQ +   Q  T    +  S GESS          + FTR TQ +     +  +NP+K YG ERLK LGAT FE +TDP D              
Subjt:  RGGKRGRQVEAETQEATGDRGRKVSEGESSHPQQEVNMEEQIFTRITQRLAERVVSAQANPEKKYGTERLKALGATSFEATTDPADI-------------

Query:  ------------------EADDWWKIIENIKEE--AWSWKDFRKAFEDKYYSSSCRDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKR
                          EA+ WWK I   + +  A  W+ FR  FEDKYY S+  +AKR+EFLGL QGS+ V EYEKK+TELS YA  IVA+E DRC+R
Subjt:  ------------------EADDWWKIIENIKEE--AWSWKDFRKAFEDKYYSSSCRDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKR

Query:  FEDGLRTEIRTLVTTSSEWVEFSKLVETTLWVERSLVDDRMGKGAVGGGHTTYFVDLPRDRFQRGDNRRFTPG--VSGKGSFKPHRGRQTTPRTGTGGRE
        FE GLR EIRT VT  ++W  FS+LVET L VE+S+ +++       G  TT       + F+  + RRFTPG  +S +  FK   G Q +     G   
Subjt:  FEDGLRTEIRTLVTTSSEWVEFSKLVETTLWVERSLVDDRMGKGAVGGGHTTYFVDLPRDRFQRGDNRRFTPG--VSGKGSFKPHRGRQTTPRTGTGGRE

Query:  ERQ
        +RQ
Subjt:  ERQ

A0A5A7TBS0 CCHC-type domain-containing protein4.5e-4343.27Show/hide
Query:  SAQANPEKKYGTERLKALGATSFEATTDPADIE-------------------------------ADDWWKIIENIKEEAW--SWKDFRKAFEDKYYSSSC
        SAQ++P+KKYG ERLKALGAT+F  TT+P D+E                               A+DWW++ E+ +      SW +F+KAF DK+Y  S 
Subjt:  SAQANPEKKYGTERLKALGATSFEATTDPADIE-------------------------------ADDWWKIIENIKEEAW--SWKDFRKAFEDKYYSSSC

Query:  RDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKRFEDGLRTEIRTLVTTSSEWVEFSKLVETTLWVERSLVDDRMGKGAVGGGHTTYFV
        RDAKRNEFL L QGSM V EYEKK+TELS YA+ ++ +E++R KRFE+GLR EIRT VT  ++W +FSKLVE  L V +SL ++R  +        T+  
Subjt:  RDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKRFEDGLRTEIRTLVTTSSEWVEFSKLVETTLWVERSLVDDRMGKGAVGGGHTTYFV

Query:  DLPRDRFQRGDNRRFTPGVSGKGSFKPHRGRQTTPRTGTGGREER
         + R+R  +  + RF PGV  +G+FK          +G+GG  +R
Subjt:  DLPRDRFQRGDNRRFTPGVSGKGSFKPHRGRQTTPRTGTGGREER

A0A5A7UZM6 Gag protease polyprotein-like protein1.1e-4444.08Show/hide
Query:  SAQANPEKKYGTERLKALGATSFEATTDPADIE-------------------------------ADDWWKIIENIKEEAW--SWKDFRKAFEDKYYSSSC
        S Q++PEKKYG ERLKALGAT+F  TT+PAD E                               A+DWW++ E+ +      SW +F+KAF DK+Y  S 
Subjt:  SAQANPEKKYGTERLKALGATSFEATTDPADIE-------------------------------ADDWWKIIENIKEEAW--SWKDFRKAFEDKYYSSSC

Query:  RDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKRFEDGLRTEIRTLVTTSSEWVEFSKLVETTLWVERSLVDDRMGKGAVGGGHTTYFV
        RDAKRNEFL L QGSM + EYEKK+TELS YA+ ++ +E++RCKRFE+GLR EIRT VT  ++W +FSKLVE  L VE+SL ++R  +        T+  
Subjt:  RDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKRFEDGLRTEIRTLVTTSSEWVEFSKLVETTLWVERSLVDDRMGKGAVGGGHTTYFV

Query:  DLPRDRFQRGDNRRFTPGVSGKGSFKPHRGRQTTPRTGTGGREER
         + R+R  +  + RF PGVS +G+FK      +  ++G+ G  +R
Subjt:  DLPRDRFQRGDNRRFTPGVSGKGSFKPHRGRQTTPRTGTGGREER

A0A5D3BB91 Reverse transcriptase1.9e-4942Show/hide
Query:  GKRGRQVEAETQEATGD--RGRKVSEGESSHPQQEVNMEEQIFTRITQRLAERVVSAQANPEKKYGTERLKALGATSFEATTDPADIE------------
        GK  +  +AE   A  +   G   S+ ESS P+ E N+EEQ+  R+ QRL   + SAQ++PEKKYG ERLKALGAT+F  TT+P D+E            
Subjt:  GKRGRQVEAETQEATGD--RGRKVSEGESSHPQQEVNMEEQIFTRITQRLAERVVSAQANPEKKYGTERLKALGATSFEATTDPADIE------------

Query:  -------------------ADDWWKIIENIKEEA--WSWKDFRKAFEDKYYSSSCRDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKR
                           A+DWW++ E+ +      SW +F+KAF DK+Y  S RDAK NEF+ L QG+M V EYEKK+TELS YA+ ++ +E +RCKR
Subjt:  -------------------ADDWWKIIENIKEEA--WSWKDFRKAFEDKYYSSSCRDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKR

Query:  FEDGLRTEIRTLVTTSSEWVEFSKLVETTLWVERSLVDDRMGKGAVGGGHTTYFVDLPRDRFQRGDNRRFTPGVSGKGSFKPHRGRQTTPRTGTGGREER
        FE+GLR EIRT VT  ++W +FSKLVE  L VE+SL ++R  +        T+   + R+R  +  + RF P VS +GSFK      +  ++ +GG  +R
Subjt:  FEDGLRTEIRTLVTTSSEWVEFSKLVETTLWVERSLVDDRMGKGAVGGGHTTYFVDLPRDRFQRGDNRRFTPGVSGKGSFKPHRGRQTTPRTGTGGREER

A0A5D3CTK6 CCHC-type domain-containing protein4.5e-4343.27Show/hide
Query:  SAQANPEKKYGTERLKALGATSFEATTDPADIE-------------------------------ADDWWKIIENIKEEAW--SWKDFRKAFEDKYYSSSC
        SAQ++P+KKYG ERLKALGAT+F  TT+P D+E                               A+DWW++ E+ +      SW +F+KAF DK+Y  S 
Subjt:  SAQANPEKKYGTERLKALGATSFEATTDPADIE-------------------------------ADDWWKIIENIKEEAW--SWKDFRKAFEDKYYSSSC

Query:  RDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKRFEDGLRTEIRTLVTTSSEWVEFSKLVETTLWVERSLVDDRMGKGAVGGGHTTYFV
        RDAKRNEFL L QGSM V EYEKK+TELS YA+ ++ +E++R KRFE+GLR EIRT VT  ++W +FSKLVE  L V +SL ++R  +        T+  
Subjt:  RDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKRFEDGLRTEIRTLVTTSSEWVEFSKLVETTLWVERSLVDDRMGKGAVGGGHTTYFV

Query:  DLPRDRFQRGDNRRFTPGVSGKGSFKPHRGRQTTPRTGTGGREER
         + R+R  +  + RF PGV  +G+FK          +G+GG  +R
Subjt:  DLPRDRFQRGDNRRFTPGVSGKGSFKPHRGRQTTPRTGTGGREER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCGAGGTGGCAAACGAGGCAGGCAAGTGGAGGCCGAAACTCAAGAAGCTACTGGTGATAGAGGAAGAAAGGTATCAGAGGGAGAGTCTAGTCATCCTCAGCAAGA
GGTGAACATGGAGGAACAAATCTTTACGAGGATAACTCAAAGATTAGCTGAAAGGGTTGTATCAGCACAAGCAAATCCAGAAAAAAAGTATGGCACTGAAAGACTGAAGG
CCTTAGGTGCAACATCATTTGAAGCCACGACAGATCCCGCTGATATTGAGGCAGATGATTGGTGGAAGATAATAGAGAACATAAAAGAGGAAGCTTGGAGTTGGAAAGAT
TTTCGAAAGGCCTTTGAAGATAAGTACTATTCGAGCTCTTGTCGTGATGCAAAGAGGAACGAGTTTCTAGGACTTGTTCAGGGATCGATGAGAGTAACAGAATATGAGAA
GAAGTTCACAGAGTTATCAAATTATGCTAGCACTATTGTTGCAAACGAGATAGATCGATGCAAGAGGTTCGAGGATGGTTTGCGAACAGAGATCCGAACGCTTGTGACAA
CAAGTTCGGAGTGGGTTGAGTTCTCGAAGCTTGTGGAGACGACATTATGGGTAGAGCGAAGCCTAGTAGATGACAGAATGGGAAAAGGAGCTGTAGGTGGTGGTCATACC
ACTTATTTTGTTGATTTACCCCGAGACCGATTCCAACGAGGAGATAATAGGAGATTCACTCCAGGTGTCTCTGGAAAAGGAAGCTTTAAACCCCATCGTGGTAGGCAAAC
TACTCCAAGGACTGGTACAGGTGGACGTGAGGAAAGACAGAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTCGAGGTGGCAAACGAGGCAGGCAAGTGGAGGCCGAAACTCAAGAAGCTACTGGTGATAGAGGAAGAAAGGTATCAGAGGGAGAGTCTAGTCATCCTCAGCAAGA
GGTGAACATGGAGGAACAAATCTTTACGAGGATAACTCAAAGATTAGCTGAAAGGGTTGTATCAGCACAAGCAAATCCAGAAAAAAAGTATGGCACTGAAAGACTGAAGG
CCTTAGGTGCAACATCATTTGAAGCCACGACAGATCCCGCTGATATTGAGGCAGATGATTGGTGGAAGATAATAGAGAACATAAAAGAGGAAGCTTGGAGTTGGAAAGAT
TTTCGAAAGGCCTTTGAAGATAAGTACTATTCGAGCTCTTGTCGTGATGCAAAGAGGAACGAGTTTCTAGGACTTGTTCAGGGATCGATGAGAGTAACAGAATATGAGAA
GAAGTTCACAGAGTTATCAAATTATGCTAGCACTATTGTTGCAAACGAGATAGATCGATGCAAGAGGTTCGAGGATGGTTTGCGAACAGAGATCCGAACGCTTGTGACAA
CAAGTTCGGAGTGGGTTGAGTTCTCGAAGCTTGTGGAGACGACATTATGGGTAGAGCGAAGCCTAGTAGATGACAGAATGGGAAAAGGAGCTGTAGGTGGTGGTCATACC
ACTTATTTTGTTGATTTACCCCGAGACCGATTCCAACGAGGAGATAATAGGAGATTCACTCCAGGTGTCTCTGGAAAAGGAAGCTTTAAACCCCATCGTGGTAGGCAAAC
TACTCCAAGGACTGGTACAGGTGGACGTGAGGAAAGACAGAGGTAG
Protein sequenceShow/hide protein sequence
MARGGKRGRQVEAETQEATGDRGRKVSEGESSHPQQEVNMEEQIFTRITQRLAERVVSAQANPEKKYGTERLKALGATSFEATTDPADIEADDWWKIIENIKEEAWSWKD
FRKAFEDKYYSSSCRDAKRNEFLGLVQGSMRVTEYEKKFTELSNYASTIVANEIDRCKRFEDGLRTEIRTLVTTSSEWVEFSKLVETTLWVERSLVDDRMGKGAVGGGHT
TYFVDLPRDRFQRGDNRRFTPGVSGKGSFKPHRGRQTTPRTGTGGREERQR