; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008519 (gene) of Snake gourd v1 genome

Gene IDTan0008519
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG04:50966998..50967873
RNA-Seq ExpressionTan0008519
SyntenyTan0008519
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060484.1 Gag protease polyprotein-like protein [Cucumis melo var. makuwa]8.1e-4449.3Show/hide
Query:  SAQADPEKKSAIERLKALGATTFE--------------------------------AAFLLQKGADDWWKITESRKGEAW--SWKEFRKAFEDKYYPSSY
        S Q+DPEKK  IERLKALGATTF                                 AAFLLQ GA+DWW++ ESR+      SW EF+KAF DK+YP S+
Subjt:  SAQADPEKKSAIERLKALGATTFE--------------------------------AAFLLQKGADDWWKITESRKGEAW--SWKEFRKAFEDKYYPSSY

Query:  RDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDRCKRFEDGLRTEIRTPVTASSEWVEFSKLVETALRVEQSVVDDRMGKEAVGGGHTTYSV
        RDAKRNEFL L Q  M + EYEKK+T LS YA+ ++ +E++RCKRFE+GLR EIRTPVTA ++W +FSKLVE ALRVE+S +++R  +        T+S 
Subjt:  RDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDRCKRFEDGLRTEIRTPVTASSEWVEFSKLVETALRVEQSVVDDRMGKEAVGGGHTTYSV

Query:  GLSRDRFQQEIIG
         + R+R  +E  G
Subjt:  GLSRDRFQQEIIG

TYJ95881.1 retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa]5.8e-5044.48Show/hide
Query:  MARGGKRGR-QVEVGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLVENVGSAQADPEKKSAIERLKALGATTFE------------------
        M RG  R     E          G   S+ ESS P+ E N+EEQ+  R+ QRLV  + SAQ+DPEKK   ERLKALGATTF                   
Subjt:  MARGGKRGR-QVEVGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLVENVGSAQADPEKKSAIERLKALGATTFE------------------

Query:  --------------AAFLLQKGADDWWKITESRKGEA--WSWKEFRKAFEDKYYPSSYRDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDR
                      AAFLLQ  A+DWW++ ESR+      SW EF+KAF DK+YP S+RDAK NEF+ L Q  M V EYEKK+T LSKYA+ ++ +E +R
Subjt:  --------------AAFLLQKGADDWWKITESRKGEA--WSWKEFRKAFEDKYYPSSYRDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDR

Query:  CKRFEDGLRTEIRTPVTASSEWVEFSKLVETALRVEQSVVDDRMGKEAVGGGHTTYSVGLSRDRFQQEIIGDSLQVFLEKGALNPIVVGS
        CKRFE+GLR EIRTPVTA ++W +FSKLVE ALRVE+S+ + +  +EA      T+S  + R+R  +E  G  +     +G+      GS
Subjt:  CKRFEDGLRTEIRTPVTASSEWVEFSKLVETALRVEQSVVDDRMGKEAVGGGHTTYSVGLSRDRFQQEIIGDSLQVFLEKGALNPIVVGS

XP_038877272.1 uncharacterized protein LOC120069556 [Benincasa hispida]2.6e-4244.49Show/hide
Query:  RGRQVEVGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLVENVGSAQADPEKKSAIERLKALGATTFE-------------------------
        R R+  V T +AT  +  E  +GESSHPQ E   +EQ+  R  + L EN+G    DP+K  +IERLKALGA+TFE                         
Subjt:  RGRQVEVGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLVENVGSAQADPEKKSAIERLKALGATTFE-------------------------

Query:  -------AAFLLQKGADDWWKITESRKG--EAWSWKEFRKAFEDKYYPSSYRDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDRCKRFEDG
               A FLLQK A+DWW++ + R+   E  +W+EF KAF +K+YP ++RDAKRNEFL LVQ  + V EYE+K+T L KYAS I+ +E +R +RF++G
Subjt:  -------AAFLLQKGADDWWKITESRKG--EAWSWKEFRKAFEDKYYPSSYRDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDRCKRFEDG

Query:  LRTEIRTPVTASSEWVEFSKLVETALRVEQSVVDDRMGKEAVGGGHT-TYSVGL
        LR EIRT VTAS +  +F KLVE ALRVE S+    + + A   G +  YS+G+
Subjt:  LRTEIRTPVTASSEWVEFSKLVETALRVEQSVVDDRMGKEAVGGGHT-TYSVGL

XP_038877572.1 uncharacterized protein LOC120069826 [Benincasa hispida]7.3e-4549.29Show/hide
Query:  MEEQIFTRITQRLVENVGSAQADPEKKSAIERLKALGATTFE---------------AAFLLQKGADDWWKITESRKG--EAWSWKEFRKAFEDKYYPSS
        ME+++  RI QRLV  VGS Q DPEKK  +ERLKALGATTF+               A FLLQKG   WWK+  +R+   EA SW EFRK FEDKYYPS+
Subjt:  MEEQIFTRITQRLVENVGSAQADPEKKSAIERLKALGATTFE---------------AAFLLQKGADDWWKITESRKG--EAWSWKEFRKAFEDKYYPSS

Query:  YRDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDRCKRFEDGLRTEIRTPVTASSEWVEFSKLVETALRVEQSVVDDRMGKEAVGGGHTTYS
        +R+AKR+EFL L Q  M   +YE++F+ LS+YA  I+A E +RC+RF +GLR  I TP+T+++ WV F++LV TA++VE+S          V G HTT  
Subjt:  YRDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDRCKRFEDGLRTEIRTPVTASSEWVEFSKLVETALRVEQSVVDDRMGKEAVGGGHTTYS

Query:  VGLSRDRFQQE
         G ++D  + E
Subjt:  VGLSRDRFQQE

XP_038890030.1 uncharacterized protein LOC120079741 [Benincasa hispida]5.2e-4348.62Show/hide
Query:  VSEGESSHPQQEV--NMEEQIFTRITQRLVENVGSAQADPEKKSAIERLKALGATTFE--------------------------------AAFLLQKGAD
        +SEGESS PQ      +E+ +F RI QRL  +VG  +A+ EKK  IER KALGA TFE                                A FLLQK A+
Subjt:  VSEGESSHPQQEV--NMEEQIFTRITQRLVENVGSAQADPEKKSAIERLKALGATTFE--------------------------------AAFLLQKGAD

Query:  DWWKITESRKG--EAWSWKEFRKAFEDKYYPSSYRDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDRCKRFEDGLRTEIRTPVTASSEWVE
         WWK+   R+   +A  W EF+KA +DKY PSS+RDAKR+EFL L Q  M V EYE+KFT LS+YA  I+A E DRC++FE GLR EI+TPVT+++ W++
Subjt:  DWWKITESRKG--EAWSWKEFRKAFEDKYYPSSYRDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDRCKRFEDGLRTEIRTPVTASSEWVE

Query:  FSKLVETALRVEQSVVDD
        F++LVE A RVE+S+ DD
Subjt:  FSKLVETALRVEQSVVDD

TrEMBL top hitse value%identityAlignment
A0A5A7TBS0 CCHC-type domain-containing protein3.7e-4248.83Show/hide
Query:  SAQADPEKKSAIERLKALGATTFE--------------------------------AAFLLQKGADDWWKITESRKGEAW--SWKEFRKAFEDKYYPSSY
        SAQ+DP+KK  IERLKALGATTF                                 AAFLLQ GA+DWW++ ESR+      SW EF+KAF DK+YP S+
Subjt:  SAQADPEKKSAIERLKALGATTFE--------------------------------AAFLLQKGADDWWKITESRKGEAW--SWKEFRKAFEDKYYPSSY

Query:  RDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDRCKRFEDGLRTEIRTPVTASSEWVEFSKLVETALRVEQSVVDDRMGKEAVGGGHTTYSV
        RDAKRNEFL L Q  M V EYEKK+T LSKYA+ ++ +E++R KRFE+GLR EIRT VTA ++W +FSKLVE ALRV +S +++R  +        T+S 
Subjt:  RDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDRCKRFEDGLRTEIRTPVTASSEWVEFSKLVETALRVEQSVVDDRMGKEAVGGGHTTYSV

Query:  GLSRDRFQQEIIG
         + R+R  +E  G
Subjt:  GLSRDRFQQEIIG

A0A5A7UZM6 Gag protease polyprotein-like protein3.9e-4449.3Show/hide
Query:  SAQADPEKKSAIERLKALGATTFE--------------------------------AAFLLQKGADDWWKITESRKGEAW--SWKEFRKAFEDKYYPSSY
        S Q+DPEKK  IERLKALGATTF                                 AAFLLQ GA+DWW++ ESR+      SW EF+KAF DK+YP S+
Subjt:  SAQADPEKKSAIERLKALGATTFE--------------------------------AAFLLQKGADDWWKITESRKGEAW--SWKEFRKAFEDKYYPSSY

Query:  RDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDRCKRFEDGLRTEIRTPVTASSEWVEFSKLVETALRVEQSVVDDRMGKEAVGGGHTTYSV
        RDAKRNEFL L Q  M + EYEKK+T LS YA+ ++ +E++RCKRFE+GLR EIRTPVTA ++W +FSKLVE ALRVE+S +++R  +        T+S 
Subjt:  RDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDRCKRFEDGLRTEIRTPVTASSEWVEFSKLVETALRVEQSVVDDRMGKEAVGGGHTTYSV

Query:  GLSRDRFQQEIIG
         + R+R  +E  G
Subjt:  GLSRDRFQQEIIG

A0A5A7V0R0 Reverse transcriptase6.2e-4244.36Show/hide
Query:  RGGKRGRQVEVGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLVENVGSAQADPEKKSAIERLKALGATTFE---------------------
        R G+R RQ + G Q  T    +  S GESS          + FTR TQ +        +DPEK   IERLK LGAT FE                     
Subjt:  RGGKRGRQVEVGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLVENVGSAQADPEKKSAIERLKALGATTFE---------------------

Query:  -----------AAFLLQKGADDWWKITESRKGE--AWSWKEFRKAFEDKYYPSSYRDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDRCKR
                   A FLLQK A+ WWK   +R+ +  A  W+ FR  FEDKYYPS+Y +AKRNEFLGL Q  + V EYE+K+T LS+YA  IVA+E DRC+R
Subjt:  -----------AAFLLQKGADDWWKITESRKGE--AWSWKEFRKAFEDKYYPSSYRDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDRCKR

Query:  FEDGLRTEIRTPVTASSEWVEFSKLVETALRVEQSVVDDRMGKEAVGGGHTTYSV-GLSRDRFQQEIIGDSLQVF
        FE GLR EIRTPVTA ++W  FS+LVETALRVEQS+ +++   E   G        G  + RF  EI   S Q F
Subjt:  FEDGLRTEIRTPVTASSEWVEFSKLVETALRVEQSVVDDRMGKEAVGGGHTTYSV-GLSRDRFQQEIIGDSLQVF

A0A5D3BB91 Reverse transcriptase2.8e-5044.48Show/hide
Query:  MARGGKRGR-QVEVGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLVENVGSAQADPEKKSAIERLKALGATTFE------------------
        M RG  R     E          G   S+ ESS P+ E N+EEQ+  R+ QRLV  + SAQ+DPEKK   ERLKALGATTF                   
Subjt:  MARGGKRGR-QVEVGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLVENVGSAQADPEKKSAIERLKALGATTFE------------------

Query:  --------------AAFLLQKGADDWWKITESRKGEA--WSWKEFRKAFEDKYYPSSYRDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDR
                      AAFLLQ  A+DWW++ ESR+      SW EF+KAF DK+YP S+RDAK NEF+ L Q  M V EYEKK+T LSKYA+ ++ +E +R
Subjt:  --------------AAFLLQKGADDWWKITESRKGEA--WSWKEFRKAFEDKYYPSSYRDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDR

Query:  CKRFEDGLRTEIRTPVTASSEWVEFSKLVETALRVEQSVVDDRMGKEAVGGGHTTYSVGLSRDRFQQEIIGDSLQVFLEKGALNPIVVGS
        CKRFE+GLR EIRTPVTA ++W +FSKLVE ALRVE+S+ + +  +EA      T+S  + R+R  +E  G  +     +G+      GS
Subjt:  CKRFEDGLRTEIRTPVTASSEWVEFSKLVETALRVEQSVVDDRMGKEAVGGGHTTYSVGLSRDRFQQEIIGDSLQVFLEKGALNPIVVGS

A0A5D3CTK6 CCHC-type domain-containing protein3.7e-4248.83Show/hide
Query:  SAQADPEKKSAIERLKALGATTFE--------------------------------AAFLLQKGADDWWKITESRKGEAW--SWKEFRKAFEDKYYPSSY
        SAQ+DP+KK  IERLKALGATTF                                 AAFLLQ GA+DWW++ ESR+      SW EF+KAF DK+YP S+
Subjt:  SAQADPEKKSAIERLKALGATTFE--------------------------------AAFLLQKGADDWWKITESRKGEAW--SWKEFRKAFEDKYYPSSY

Query:  RDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDRCKRFEDGLRTEIRTPVTASSEWVEFSKLVETALRVEQSVVDDRMGKEAVGGGHTTYSV
        RDAKRNEFL L Q  M V EYEKK+T LSKYA+ ++ +E++R KRFE+GLR EIRT VTA ++W +FSKLVE ALRV +S +++R  +        T+S 
Subjt:  RDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDRCKRFEDGLRTEIRTPVTASSEWVEFSKLVETALRVEQSVVDDRMGKEAVGGGHTTYSV

Query:  GLSRDRFQQEIIG
         + R+R  +E  G
Subjt:  GLSRDRFQQEIIG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCGAGGTGGCAAACGAGGCAGGCAAGTAGAGGTTGGAACTCAAGAAGCTACTGGTGATAGAGGAAGAGAGGTATCAGAGGGAGAGTCTAGTCATCCTCAGCAAGA
GGTGAACATGGAGGAACAAATCTTTACGAGGATAACTCAAAGATTAGTTGAAAATGTTGGATCAGCACAAGCAGATCCAGAAAAGAAGTCTGCCATTGAAAGACTGAAGG
CCTTAGGTGCAACAACATTTGAAGCAGCATTCTTGCTCCAGAAAGGAGCAGATGATTGGTGGAAGATAACAGAGAGCAGAAAAGGGGAAGCTTGGAGCTGGAAAGAGTTT
CGAAAGGCCTTTGAAGATAAGTATTATCCGAGCTCTTATCGTGATGCAAAGAGGAACGAGTTTCTAGGACTTGTTCAGAGATTGATGATAGTAACAGAATATGAGAAGAA
GTTCACAGTGTTATCAAAGTATGCTAGCACTATTGTTGCAAACGAGATAGATCGATGCAAGAGGTTCGAGGATGGTCTACGAACAGAGATCCGAACGCCTGTGACAGCAA
GTTCGGAGTGGGTTGAGTTCTCCAAGCTTGTGGAGACGGCATTACGGGTAGAGCAAAGCGTAGTAGATGACAGAATGGGAAAAGAAGCTGTAGGTGGTGGTCATACCACT
TATTCTGTTGGTCTATCACGAGACCGATTCCAGCAGGAGATAATAGGAGATTCACTCCAGGTGTTTCTGGAAAAGGGAGCTTTAAACCCCATCGTGGTGGGCAGTCTACT
CCAAGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTCGAGGTGGCAAACGAGGCAGGCAAGTAGAGGTTGGAACTCAAGAAGCTACTGGTGATAGAGGAAGAGAGGTATCAGAGGGAGAGTCTAGTCATCCTCAGCAAGA
GGTGAACATGGAGGAACAAATCTTTACGAGGATAACTCAAAGATTAGTTGAAAATGTTGGATCAGCACAAGCAGATCCAGAAAAGAAGTCTGCCATTGAAAGACTGAAGG
CCTTAGGTGCAACAACATTTGAAGCAGCATTCTTGCTCCAGAAAGGAGCAGATGATTGGTGGAAGATAACAGAGAGCAGAAAAGGGGAAGCTTGGAGCTGGAAAGAGTTT
CGAAAGGCCTTTGAAGATAAGTATTATCCGAGCTCTTATCGTGATGCAAAGAGGAACGAGTTTCTAGGACTTGTTCAGAGATTGATGATAGTAACAGAATATGAGAAGAA
GTTCACAGTGTTATCAAAGTATGCTAGCACTATTGTTGCAAACGAGATAGATCGATGCAAGAGGTTCGAGGATGGTCTACGAACAGAGATCCGAACGCCTGTGACAGCAA
GTTCGGAGTGGGTTGAGTTCTCCAAGCTTGTGGAGACGGCATTACGGGTAGAGCAAAGCGTAGTAGATGACAGAATGGGAAAAGAAGCTGTAGGTGGTGGTCATACCACT
TATTCTGTTGGTCTATCACGAGACCGATTCCAGCAGGAGATAATAGGAGATTCACTCCAGGTGTTTCTGGAAAAGGGAGCTTTAAACCCCATCGTGGTGGGCAGTCTACT
CCAAGGATAG
Protein sequenceShow/hide protein sequence
MARGGKRGRQVEVGTQEATGDRGREVSEGESSHPQQEVNMEEQIFTRITQRLVENVGSAQADPEKKSAIERLKALGATTFEAAFLLQKGADDWWKITESRKGEAWSWKEF
RKAFEDKYYPSSYRDAKRNEFLGLVQRLMIVTEYEKKFTVLSKYASTIVANEIDRCKRFEDGLRTEIRTPVTASSEWVEFSKLVETALRVEQSVVDDRMGKEAVGGGHTT
YSVGLSRDRFQQEIIGDSLQVFLEKGALNPIVVGSLLQG