; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006496 (gene) of Snake gourd v1 genome

Gene IDTan0006496
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG05:66412239..66413165
RNA-Seq ExpressionTan0006496
SyntenyTan0006496
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039476.1 uncharacterized protein E6C27_scaffold64G002900 [Cucumis melo var. makuwa]4.4e-4849.14Show/hide
Query:  SAQTDPEKKYGIERLKALGATTFECTTDP---------------------------------KGADDWWKITESRK---GDAWSWKEFRKAFEDKYYPSS
        SAQ+DP+KKYGIERLKALGATTF  TT+P                                  GA+DWW++ ESR+   GD  SW EF+KAF DK+YP S
Subjt:  SAQTDPEKKYGIERLKALGATTFECTTDP---------------------------------KGADDWWKITESRK---GDAWSWKEFRKAFEDKYYPSS

Query:  YRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEIDRCKRFEDGLRAEIRTPVTTSSKWVAFSKLVETALRVERSLVDDRMRKGAVGGGHTTYS
        +RDAKRNEFL L QGSMT+ EYEKK+T+LSKYA+ ++  E++R KRFE+GLR EIRT VT  + W  FSKLVE ALRV +SL ++R R+        T+S
Subjt:  YRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEIDRCKRFEDGLRAEIRTPVTTSSKWVAFSKLVETALRVERSLVDDRMRKGAVGGGHTTYS

Query:  VGLPRDRFQRGDDRRFTPCVSGKGSFKPYRGG
          + R+R  +    RF P V  +G+FK    G
Subjt:  VGLPRDRFQRGDDRRFTPCVSGKGSFKPYRGG

KAA0060484.1 Gag protease polyprotein-like protein [Cucumis melo var. makuwa]7.2e-5148.02Show/hide
Query:  SAQTDPEKKYGIERLKALGATTFECTTDP---------------------------------KGADDWWKITESRK---GDAWSWKEFRKAFEDKYYPSS
        S Q+DPEKKYGIERLKALGATTF  TT+P                                  GA+DWW++ ESR+   GD  SW EF+KAF DK+YP S
Subjt:  SAQTDPEKKYGIERLKALGATTFECTTDP---------------------------------KGADDWWKITESRK---GDAWSWKEFRKAFEDKYYPSS

Query:  YRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEIDRCKRFEDGLRAEIRTPVTTSSKWVAFSKLVETALRVERSLVDDRMRKGAVGGGHTTYS
        +RDAKRNEFL L QGSMTI EYEKK+T+LS YA+ ++  E++RCKRFE+GLR EIRTPVT  + W  FSKLVE ALRVE+SL ++R ++        T+S
Subjt:  YRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEIDRCKRFEDGLRAEIRTPVTTSSKWVAFSKLVETALRVERSLVDDRMRKGAVGGGHTTYS

Query:  VGLPRDRFQRGDDRRFTPCVSGKGSFKPYRGGQTTPRTAQWAWGKTEAAGSS
          + R+R  +    RF P VS +G+FK    G +  ++   + G   ++GSS
Subjt:  VGLPRDRFQRGDDRRFTPCVSGKGSFKPYRGGQTTPRTAQWAWGKTEAAGSS

TYJ95881.1 retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa]2.8e-5546.23Show/hide
Query:  MARGGKRGR-QVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQTDPEKKYGIERLKALGATTFECTTDP-------------
        M RG  R     E       +  G   S+ ESS P+ E N+EEQ+  R+ QRL   + SAQ+DPEKKYG ERLKALGATTF  TT+P             
Subjt:  MARGGKRGR-QVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQTDPEKKYGIERLKALGATTFECTTDP-------------

Query:  --------------------KGADDWWKITESRK---GDAWSWKEFRKAFEDKYYPSSYRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEID
                              A+DWW++ ESR+   GD  SW EF+KAF DK+YP S+RDAK NEF+ L QG+MT+ EYEKK+T+LSKYA+ ++  E +
Subjt:  --------------------KGADDWWKITESRK---GDAWSWKEFRKAFEDKYYPSSYRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEID

Query:  RCKRFEDGLRAEIRTPVTTSSKWVAFSKLVETALRVERSLVDDRMRKGAVGGGHTTYSVGLPRDRFQRGDDRRFTPCVSGKGSFKPYRGGQT
        RCKRFE+GLR EIRTPVT  + W  FSKLVE ALRVE+SL ++R R+        T+S  + R+R  +    RF P VS +GSFK    G +
Subjt:  RCKRFEDGLRAEIRTPVTTSSKWVAFSKLVETALRVERSLVDDRMRKGAVGGGHTTYSVGLPRDRFQRGDDRRFTPCVSGKGSFKPYRGGQT

TYK15233.1 uncharacterized protein E5676_scaffold892G00030 [Cucumis melo var. makuwa]4.4e-4849.14Show/hide
Query:  SAQTDPEKKYGIERLKALGATTFECTTDP---------------------------------KGADDWWKITESRK---GDAWSWKEFRKAFEDKYYPSS
        SAQ+DP+KKYGIERLKALGATTF  TT+P                                  GA+DWW++ ESR+   GD  SW EF+KAF DK+YP S
Subjt:  SAQTDPEKKYGIERLKALGATTFECTTDP---------------------------------KGADDWWKITESRK---GDAWSWKEFRKAFEDKYYPSS

Query:  YRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEIDRCKRFEDGLRAEIRTPVTTSSKWVAFSKLVETALRVERSLVDDRMRKGAVGGGHTTYS
        +RDAKRNEFL L QGSMT+ EYEKK+T+LSKYA+ ++  E++R KRFE+GLR EIRT VT  + W  FSKLVE ALRV +SL ++R R+        T+S
Subjt:  YRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEIDRCKRFEDGLRAEIRTPVTTSSKWVAFSKLVETALRVERSLVDDRMRKGAVGGGHTTYS

Query:  VGLPRDRFQRGDDRRFTPCVSGKGSFKPYRGG
          + R+R  +    RF P V  +G+FK    G
Subjt:  VGLPRDRFQRGDDRRFTPCVSGKGSFKPYRGG

XP_038896366.1 uncharacterized protein LOC120084630 [Benincasa hispida]3.7e-4744.36Show/hide
Query:  SEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQTDPEKKYGIERLKALGATTFECTTDP---------------------------------KGADDWW
        SE  SSHP+++   EEQ+  + TQRL  S+ + Q D +KKYGIERLKALGAT FE T DP                                 K A+DWW
Subjt:  SEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQTDPEKKYGIERLKALGATTFECTTDP---------------------------------KGADDWW

Query:  KITESRKG--DAWSWKEFRKAFEDKYYPSSYRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEIDRCKRFEDGLRAEIRTPVTTSSKWVAFSK
        K+ ++R+G  +   W+EFRKAF +++YP S+RD K++EFL L+QG+MTI+EYE K+TKLSKYA  IV  E +RC+RFE GLR EIRTPVT +++W  FS+
Subjt:  KITESRKG--DAWSWKEFRKAFEDKYYPSSYRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEIDRCKRFEDGLRAEIRTPVTTSSKWVAFSK

Query:  LVETALRVERSLVDDRMRKGAVGGGHTTYSVGLPRDRFQRGDDRRFTPCVSGKGSFK
        L+E  +RVERSL+ + +R                  R++R   R FTP V  +  +K
Subjt:  LVETALRVERSLVDDRMRKGAVGGGHTTYSVGLPRDRFQRGDDRRFTPCVSGKGSFK

TrEMBL top hitse value%identityAlignment
A0A5A7TBS0 CCHC-type domain-containing protein2.1e-4849.14Show/hide
Query:  SAQTDPEKKYGIERLKALGATTFECTTDP---------------------------------KGADDWWKITESRK---GDAWSWKEFRKAFEDKYYPSS
        SAQ+DP+KKYGIERLKALGATTF  TT+P                                  GA+DWW++ ESR+   GD  SW EF+KAF DK+YP S
Subjt:  SAQTDPEKKYGIERLKALGATTFECTTDP---------------------------------KGADDWWKITESRK---GDAWSWKEFRKAFEDKYYPSS

Query:  YRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEIDRCKRFEDGLRAEIRTPVTTSSKWVAFSKLVETALRVERSLVDDRMRKGAVGGGHTTYS
        +RDAKRNEFL L QGSMT+ EYEKK+T+LSKYA+ ++  E++R KRFE+GLR EIRT VT  + W  FSKLVE ALRV +SL ++R R+        T+S
Subjt:  YRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEIDRCKRFEDGLRAEIRTPVTTSSKWVAFSKLVETALRVERSLVDDRMRKGAVGGGHTTYS

Query:  VGLPRDRFQRGDDRRFTPCVSGKGSFKPYRGG
          + R+R  +    RF P V  +G+FK    G
Subjt:  VGLPRDRFQRGDDRRFTPCVSGKGSFKPYRGG

A0A5A7UZM6 Gag protease polyprotein-like protein3.5e-5148.02Show/hide
Query:  SAQTDPEKKYGIERLKALGATTFECTTDP---------------------------------KGADDWWKITESRK---GDAWSWKEFRKAFEDKYYPSS
        S Q+DPEKKYGIERLKALGATTF  TT+P                                  GA+DWW++ ESR+   GD  SW EF+KAF DK+YP S
Subjt:  SAQTDPEKKYGIERLKALGATTFECTTDP---------------------------------KGADDWWKITESRK---GDAWSWKEFRKAFEDKYYPSS

Query:  YRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEIDRCKRFEDGLRAEIRTPVTTSSKWVAFSKLVETALRVERSLVDDRMRKGAVGGGHTTYS
        +RDAKRNEFL L QGSMTI EYEKK+T+LS YA+ ++  E++RCKRFE+GLR EIRTPVT  + W  FSKLVE ALRVE+SL ++R ++        T+S
Subjt:  YRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEIDRCKRFEDGLRAEIRTPVTTSSKWVAFSKLVETALRVERSLVDDRMRKGAVGGGHTTYS

Query:  VGLPRDRFQRGDDRRFTPCVSGKGSFKPYRGGQTTPRTAQWAWGKTEAAGSS
          + R+R  +    RF P VS +G+FK    G +  ++   + G   ++GSS
Subjt:  VGLPRDRFQRGDDRRFTPCVSGKGSFKPYRGGQTTPRTAQWAWGKTEAAGSS

A0A5A7V0R0 Reverse transcriptase1.2e-4644.33Show/hide
Query:  RGGKRGRQVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQTDPEKKYGIERLKALGATTFECTTDP----------------
        R G+R RQ + G Q  T    +  S GESS          + FTR TQ +        +DPEK YGIERLK LGAT FE +TDP                
Subjt:  RGGKRGRQVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQTDPEKKYGIERLKALGATTFECTTDP----------------

Query:  -----------------KGADDWWKITESRKGD--AWSWKEFRKAFEDKYYPSSYRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEIDRCKR
                         K A+ WWK   +R+ D  A  W+ FR  FEDKYYPS+Y +AKRNEFLGL QGS+++ EYE+K+T+LS+YA  IV SE DRC+R
Subjt:  -----------------KGADDWWKITESRKGD--AWSWKEFRKAFEDKYYPSSYRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEIDRCKR

Query:  FEDGLRAEIRTPVTTSSKWVAFSKLVETALRVERSLVDDRMRKGAVGGGHTTYSVGLPRDRFQRGDDRRFTP--CVSGKGSFKPYRGGQTT
        FE GLR EIRTPVT  +KW  FS+LVETALRVE+S+ ++   K AV     T +       F+  + RRFTP   +S +  FK   GGQ +
Subjt:  FEDGLRAEIRTPVTTSSKWVAFSKLVETALRVERSLVDDRMRKGAVGGGHTTYSVGLPRDRFQRGDDRRFTP--CVSGKGSFKPYRGGQTT

A0A5D3BB91 Reverse transcriptase1.4e-5546.23Show/hide
Query:  MARGGKRGR-QVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQTDPEKKYGIERLKALGATTFECTTDP-------------
        M RG  R     E       +  G   S+ ESS P+ E N+EEQ+  R+ QRL   + SAQ+DPEKKYG ERLKALGATTF  TT+P             
Subjt:  MARGGKRGR-QVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQTDPEKKYGIERLKALGATTFECTTDP-------------

Query:  --------------------KGADDWWKITESRK---GDAWSWKEFRKAFEDKYYPSSYRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEID
                              A+DWW++ ESR+   GD  SW EF+KAF DK+YP S+RDAK NEF+ L QG+MT+ EYEKK+T+LSKYA+ ++  E +
Subjt:  --------------------KGADDWWKITESRK---GDAWSWKEFRKAFEDKYYPSSYRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEID

Query:  RCKRFEDGLRAEIRTPVTTSSKWVAFSKLVETALRVERSLVDDRMRKGAVGGGHTTYSVGLPRDRFQRGDDRRFTPCVSGKGSFKPYRGGQT
        RCKRFE+GLR EIRTPVT  + W  FSKLVE ALRVE+SL ++R R+        T+S  + R+R  +    RF P VS +GSFK    G +
Subjt:  RCKRFEDGLRAEIRTPVTTSSKWVAFSKLVETALRVERSLVDDRMRKGAVGGGHTTYSVGLPRDRFQRGDDRRFTPCVSGKGSFKPYRGGQT

A0A5D3CTK6 CCHC-type domain-containing protein2.1e-4849.14Show/hide
Query:  SAQTDPEKKYGIERLKALGATTFECTTDP---------------------------------KGADDWWKITESRK---GDAWSWKEFRKAFEDKYYPSS
        SAQ+DP+KKYGIERLKALGATTF  TT+P                                  GA+DWW++ ESR+   GD  SW EF+KAF DK+YP S
Subjt:  SAQTDPEKKYGIERLKALGATTFECTTDP---------------------------------KGADDWWKITESRK---GDAWSWKEFRKAFEDKYYPSS

Query:  YRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEIDRCKRFEDGLRAEIRTPVTTSSKWVAFSKLVETALRVERSLVDDRMRKGAVGGGHTTYS
        +RDAKRNEFL L QGSMT+ EYEKK+T+LSKYA+ ++  E++R KRFE+GLR EIRT VT  + W  FSKLVE ALRV +SL ++R R+        T+S
Subjt:  YRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEIDRCKRFEDGLRAEIRTPVTTSSKWVAFSKLVETALRVERSLVDDRMRKGAVGGGHTTYS

Query:  VGLPRDRFQRGDDRRFTPCVSGKGSFKPYRGG
          + R+R  +    RF P V  +G+FK    G
Subjt:  VGLPRDRFQRGDDRRFTPCVSGKGSFKPYRGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCGAGGTGGAAAACGAGGCAGGCAAGTGGAGACTGGAACTCAAGAAGCTACTAGTGATAGAGGAAGAGAGGTATCAGAGGGAGAGTCTAGTCATCCTCAGCAAGA
GGTGAACATGGAGGAACAGATCTTTACGCGGATAACTCAAAGATTAGCTGAAAGTGTTGGATCAGCACAAACAGATCCAGAAAAGAAGTATGGCATTGAAAGATTGAAAG
CCTTAGGTGCAACAACATTTGAATGCACGACAGATCCCAAAGGAGCAGATGATTGGTGGAAGATAACAGAGAGCAGAAAAGGGGATGCTTGGAGTTGGAAAGAGTTTCGA
AAGGCCTTTGAAGATAAGTATTACCCGAGCTCATATCGTGATGCGAAGAGAAACGAGTTTCTAGGACTCGTTCAGGGGTCGATGACTATAATAGAATATGAGAAGAAGTT
CACAAAGTTATCAAAGTATGCTAGCACTATTGTTACAAGTGAGATAGATCGATGCAAGAGGTTCGAGGATGGTTTACGAGCAGAGATCCGAACACCTGTGACAACAAGTT
CTAAGTGGGTTGCGTTCTCTAAGCTTGTGGAGACGGCATTACGAGTAGAGCGAAGCCTAGTAGATGACAGAATGAGAAAAGGAGCTGTAGGTGGTGGTCATACCACTTAT
TCTGTTGGCTTACCTCGAGACCGATTCCAACGAGGAGATGATAGGAGATTCACTCCATGTGTCTCTGGAAAAGGAAGCTTTAAACCCTATCGTGGTGGGCAAACTACTCC
AAGGACTGCACAGTGGGCGTGGGGAAAGACAGAGGCGGCGGGATCAAGTTTTCGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTCGAGGTGGAAAACGAGGCAGGCAAGTGGAGACTGGAACTCAAGAAGCTACTAGTGATAGAGGAAGAGAGGTATCAGAGGGAGAGTCTAGTCATCCTCAGCAAGA
GGTGAACATGGAGGAACAGATCTTTACGCGGATAACTCAAAGATTAGCTGAAAGTGTTGGATCAGCACAAACAGATCCAGAAAAGAAGTATGGCATTGAAAGATTGAAAG
CCTTAGGTGCAACAACATTTGAATGCACGACAGATCCCAAAGGAGCAGATGATTGGTGGAAGATAACAGAGAGCAGAAAAGGGGATGCTTGGAGTTGGAAAGAGTTTCGA
AAGGCCTTTGAAGATAAGTATTACCCGAGCTCATATCGTGATGCGAAGAGAAACGAGTTTCTAGGACTCGTTCAGGGGTCGATGACTATAATAGAATATGAGAAGAAGTT
CACAAAGTTATCAAAGTATGCTAGCACTATTGTTACAAGTGAGATAGATCGATGCAAGAGGTTCGAGGATGGTTTACGAGCAGAGATCCGAACACCTGTGACAACAAGTT
CTAAGTGGGTTGCGTTCTCTAAGCTTGTGGAGACGGCATTACGAGTAGAGCGAAGCCTAGTAGATGACAGAATGAGAAAAGGAGCTGTAGGTGGTGGTCATACCACTTAT
TCTGTTGGCTTACCTCGAGACCGATTCCAACGAGGAGATGATAGGAGATTCACTCCATGTGTCTCTGGAAAAGGAAGCTTTAAACCCTATCGTGGTGGGCAAACTACTCC
AAGGACTGCACAGTGGGCGTGGGGAAAGACAGAGGCGGCGGGATCAAGTTTTCGATAG
Protein sequenceShow/hide protein sequence
MARGGKRGRQVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSAQTDPEKKYGIERLKALGATTFECTTDPKGADDWWKITESRKGDAWSWKEFR
KAFEDKYYPSSYRDAKRNEFLGLVQGSMTIIEYEKKFTKLSKYASTIVTSEIDRCKRFEDGLRAEIRTPVTTSSKWVAFSKLVETALRVERSLVDDRMRKGAVGGGHTTY
SVGLPRDRFQRGDDRRFTPCVSGKGSFKPYRGGQTTPRTAQWAWGKTEAAGSSFR