; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020535 (gene) of Snake gourd v1 genome

Gene IDTan0020535
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationLG05:71097160..71097801
RNA-Seq ExpressionTan0020535
SyntenyTan0020535
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]2.2e-4649.54Show/hide
Query:  MANASSEFFLSSIGAPHFSSPPLNQLLNQVTSIKLERGNFLLWKNLALSILRSYKLKCHLLGTKACPTMFLPQGSTEGITISERAS-----SSSSEPSTV
        MANA       S+ +  FS+PPLNQ+LNQ+ ++KL+R N+LLWK LAL IL+ YKL+ HL G   CP+ F+   S+   T++E  +     +SSS    +
Subjt:  MANASSEFFLSSIGAPHFSSPPLNQLLNQVTSIKLERGNFLLWKNLALSILRSYKLKCHLLGTKACPTMFLPQGSTEGITISERAS-----SSSSEPSTV

Query:  INPLYESWV----------------------MGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQAGSPVPTRSLI
        +N L+E WV                      MG    +DLWDA Q  FGVQSR EEDFLRQ+ Q TRKGN KM EYL +MK +V++LGQ GSPVP R+LI
Subjt:  INPLYESWV----------------------MGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQAGSPVPTRSLI

Query:  SQVLLGLDEEYNPIIVGI
        SQVLLGLDE YN +IV I
Subjt:  SQVLLGLDEEYNPIIVGI

KAA0067279.1 uncharacterized protein E6C27_scaffold418G001000 [Cucumis melo var. makuwa]3.6e-3345.95Show/hide
Query:  FSSPPLNQLLNQVTSIKLERGNFLLWKNLALSILRSYKLKCHLLGTKACPTMFLPQGSTEGITISERA------SSSSSEPSTVINPLYESWV-------
        F++P LNQ+LNQ+T+IKL+RGN+LLWK LAL IL+SYKL  HL G   C    +   +    +I E A      +SSSS     +NP YE W+       
Subjt:  FSSPPLNQLLNQVTSIKLERGNFLLWKNLALSILRSYKLKCHLLGTKACPTMFLPQGSTEGITISERA------SSSSSEPSTVINPLYESWV-------

Query:  ---------------MGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQAGSPVPTRSLIS
                       MG    KDLW+A Q LFG+QSR +EDFL Q FQ T+KGN+ M EYLR MK +V +LGQA S VP+ +++S
Subjt:  ---------------MGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQAGSPVPTRSLIS

TYJ96311.1 uncharacterized protein E5676_scaffold1970G00140 [Cucumis melo var. makuwa]3.1e-4047.52Show/hide
Query:  MANASSEFFLSSIGAPHFSSPPLNQLLNQVTSIKLERGNFLLWKNLALSILRSYKLKCHLLGTKACPTMFLPQGSTEGITISERAS-----SSSSEPSTV
        MANA       S+ +  FS+PPLNQ+LNQ+ ++KL+R N+LLWK LAL IL+ YKL+ HL G   CP+ F+   S+   T++E  +     +SSS    +
Subjt:  MANASSEFFLSSIGAPHFSSPPLNQLLNQVTSIKLERGNFLLWKNLALSILRSYKLKCHLLGTKACPTMFLPQGSTEGITISERAS-----SSSSEPSTV

Query:  INPLYESWV----------------------MGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQAGSPVPTRSLI
        +N L+E WV                      MG    +DLWDA Q  FGVQSR EEDFLRQ+ Q TRKGN KM EYL +MK +V++LGQ GSPVP R+LI
Subjt:  INPLYESWV----------------------MGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQAGSPVPTRSLI

Query:  SQ
        SQ
Subjt:  SQ

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]4.1e-4552.28Show/hide
Query:  FSSPPLNQLLNQVTSIKLERGNFLLWKNLALSILRSYKLKCHLLGTKAC-PTMFLPQGSTEGITISERASSSSSEPSTVINPLYESW-------------
        F+SPPLNQLLNQ+TSIK++RGNFLLW+NLAL ILRSYKL  +L G K C PT  +P  +   I       S+SS+ S  +NP YE+W             
Subjt:  FSSPPLNQLLNQVTSIKLERGNFLLWKNLALSILRSYKLKCHLLGTKAC-PTMFLPQGSTEGITISERASSSSSEPSTVINPLYESW-------------

Query:  ---------VMGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQAGSPVPTRSLISQVLLGLDEEYNPIIVGI
                 VMG + +++LW A+Q LFGVQSR E D+L+QVFQQT KG+++M EYL++MK H ++L  AGS V  R L+SQVL GLDEEYNPI+V +
Subjt:  ---------VMGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQAGSPVPTRSLISQVLLGLDEEYNPIIVGI

XP_038902487.1 uncharacterized protein LOC120089143 [Benincasa hispida]1.0e-3547.06Show/hide
Query:  TSIKLERGNFLLWKNLALSILRSYKLKCHLLGTKACPTMF----------LPQGSTEGI--------TISER---ASSSSSEPSTVINPLYES-------
        T+IKL++ N+LLW+NLAL ILRSY+L+ HL G   CP  F          +P G   G+        +++ +    ++S+S P   +NP YES       
Subjt:  TSIKLERGNFLLWKNLALSILRSYKLKCHLLGTKACPTMF----------LPQGSTEGI--------TISER---ASSSSSEPSTVINPLYES-------

Query:  ---W------------VMGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQAGSPVPTRSLISQVLLGLDEEYNPI
           W            VMG    K LW AIQ LFG+QSR  ED+LRQVFQQT KG MKM EYLR+MK H ++LG  GSPVPTR+L+SQVLLGLDEE+NP 
Subjt:  ---W------------VMGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQAGSPVPTRSLISQVLLGLDEEYNPI

Query:  IVGI
        +  I
Subjt:  IVGI

TrEMBL top hitse value%identityAlignment
A0A5A7SIT7 Uncharacterized protein1.1e-4649.54Show/hide
Query:  MANASSEFFLSSIGAPHFSSPPLNQLLNQVTSIKLERGNFLLWKNLALSILRSYKLKCHLLGTKACPTMFLPQGSTEGITISERAS-----SSSSEPSTV
        MANA       S+ +  FS+PPLNQ+LNQ+ ++KL+R N+LLWK LAL IL+ YKL+ HL G   CP+ F+   S+   T++E  +     +SSS    +
Subjt:  MANASSEFFLSSIGAPHFSSPPLNQLLNQVTSIKLERGNFLLWKNLALSILRSYKLKCHLLGTKACPTMFLPQGSTEGITISERAS-----SSSSEPSTV

Query:  INPLYESWV----------------------MGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQAGSPVPTRSLI
        +N L+E WV                      MG    +DLWDA Q  FGVQSR EEDFLRQ+ Q TRKGN KM EYL +MK +V++LGQ GSPVP R+LI
Subjt:  INPLYESWV----------------------MGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQAGSPVPTRSLI

Query:  SQVLLGLDEEYNPIIVGI
        SQVLLGLDE YN +IV I
Subjt:  SQVLLGLDEEYNPIIVGI

A0A5A7VPY0 Uncharacterized protein1.8e-3345.95Show/hide
Query:  FSSPPLNQLLNQVTSIKLERGNFLLWKNLALSILRSYKLKCHLLGTKACPTMFLPQGSTEGITISERA------SSSSSEPSTVINPLYESWV-------
        F++P LNQ+LNQ+T+IKL+RGN+LLWK LAL IL+SYKL  HL G   C    +   +    +I E A      +SSSS     +NP YE W+       
Subjt:  FSSPPLNQLLNQVTSIKLERGNFLLWKNLALSILRSYKLKCHLLGTKACPTMFLPQGSTEGITISERA------SSSSSEPSTVINPLYESWV-------

Query:  ---------------MGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQAGSPVPTRSLIS
                       MG    KDLW+A Q LFG+QSR +EDFL Q FQ T+KGN+ M EYLR MK +V +LGQA S VP+ +++S
Subjt:  ---------------MGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQAGSPVPTRSLIS

A0A5D3BCH9 Uncharacterized protein1.5e-4047.52Show/hide
Query:  MANASSEFFLSSIGAPHFSSPPLNQLLNQVTSIKLERGNFLLWKNLALSILRSYKLKCHLLGTKACPTMFLPQGSTEGITISERAS-----SSSSEPSTV
        MANA       S+ +  FS+PPLNQ+LNQ+ ++KL+R N+LLWK LAL IL+ YKL+ HL G   CP+ F+   S+   T++E  +     +SSS    +
Subjt:  MANASSEFFLSSIGAPHFSSPPLNQLLNQVTSIKLERGNFLLWKNLALSILRSYKLKCHLLGTKACPTMFLPQGSTEGITISERAS-----SSSSEPSTV

Query:  INPLYESWV----------------------MGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQAGSPVPTRSLI
        +N L+E WV                      MG    +DLWDA Q  FGVQSR EEDFLRQ+ Q TRKGN KM EYL +MK +V++LGQ GSPVP R+LI
Subjt:  INPLYESWV----------------------MGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQAGSPVPTRSLI

Query:  SQ
        SQ
Subjt:  SQ

A0A6J1D5J0 uncharacterized protein LOC1110175016.2e-3159.38Show/hide
Query:  SSSSSEPSTVINPLYESW----------------------VMGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQA
        SSSS      INPLYESW                      VMG     DLW AIQ LFGVQS+ EED+LRQVFQQTRKG++KM+++LR+MK H ++LGQA
Subjt:  SSSSSEPSTVINPLYESW----------------------VMGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQA

Query:  GSPVPTRSLISQVLLGLDEEYNPIIVGI
        GSPVPTRSLISQVLLGLDEEYNP++  I
Subjt:  GSPVPTRSLISQVLLGLDEEYNPIIVGI

A0A6J1DCW4 uncharacterized protein LOC1110195982.0e-4552.28Show/hide
Query:  FSSPPLNQLLNQVTSIKLERGNFLLWKNLALSILRSYKLKCHLLGTKAC-PTMFLPQGSTEGITISERASSSSSEPSTVINPLYESW-------------
        F+SPPLNQLLNQ+TSIK++RGNFLLW+NLAL ILRSYKL  +L G K C PT  +P  +   I       S+SS+ S  +NP YE+W             
Subjt:  FSSPPLNQLLNQVTSIKLERGNFLLWKNLALSILRSYKLKCHLLGTKAC-PTMFLPQGSTEGITISERASSSSSEPSTVINPLYESW-------------

Query:  ---------VMGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQAGSPVPTRSLISQVLLGLDEEYNPIIVGI
                 VMG + +++LW A+Q LFGVQSR E D+L+QVFQQT KG+++M EYL++MK H ++L  AGS V  R L+SQVL GLDEEYNPI+V +
Subjt:  ---------VMGCNITKDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQAGSPVPTRSLISQVLLGLDEEYNPIIVGI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAACGCCTCGTCTGAATTTTTCCTGTCTTCTATCGGAGCACCTCACTTCAGCAGTCCTCCGTTAAATCAACTACTTAACCAGGTAACTTCGATAAAATTAGAGAG
GGGAAACTTTCTACTGTGGAAGAATTTAGCTCTTTCCATCCTTCGGAGCTACAAACTCAAATGCCATCTCCTTGGGACCAAAGCTTGCCCAACCATGTTTCTACCTCAAG
GATCTACCGAAGGAATTACAATTTCTGAAAGAGCATCCTCCTCAAGCTCAGAGCCATCGACTGTGATCAATCCACTGTATGAGTCTTGGGTAATGGGGTGCAACATAACT
AAAGACCTTTGGGATGCCATTCAAACCTTGTTTGGGGTCCAGTCCAGAGTTGAAGAGGATTTTCTGCGTCAAGTCTTCCAACAGACACGCAAAGGTAATATGAAAATGTC
AGAATACTTACGAATCATGAAATGTCATGTTGAAAGTCTTGGTCAAGCAGGGAGTCCAGTGCCCACTAGGTCTCTAATTTCGCAGGTTCTACTTGGATTAGATGAAGAAT
ACAACCCTATTATTGTTGGAATATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAACGCCTCGTCTGAATTTTTCCTGTCTTCTATCGGAGCACCTCACTTCAGCAGTCCTCCGTTAAATCAACTACTTAACCAGGTAACTTCGATAAAATTAGAGAG
GGGAAACTTTCTACTGTGGAAGAATTTAGCTCTTTCCATCCTTCGGAGCTACAAACTCAAATGCCATCTCCTTGGGACCAAAGCTTGCCCAACCATGTTTCTACCTCAAG
GATCTACCGAAGGAATTACAATTTCTGAAAGAGCATCCTCCTCAAGCTCAGAGCCATCGACTGTGATCAATCCACTGTATGAGTCTTGGGTAATGGGGTGCAACATAACT
AAAGACCTTTGGGATGCCATTCAAACCTTGTTTGGGGTCCAGTCCAGAGTTGAAGAGGATTTTCTGCGTCAAGTCTTCCAACAGACACGCAAAGGTAATATGAAAATGTC
AGAATACTTACGAATCATGAAATGTCATGTTGAAAGTCTTGGTCAAGCAGGGAGTCCAGTGCCCACTAGGTCTCTAATTTCGCAGGTTCTACTTGGATTAGATGAAGAAT
ACAACCCTATTATTGTTGGAATATAA
Protein sequenceShow/hide protein sequence
MANASSEFFLSSIGAPHFSSPPLNQLLNQVTSIKLERGNFLLWKNLALSILRSYKLKCHLLGTKACPTMFLPQGSTEGITISERASSSSSEPSTVINPLYESWVMGCNIT
KDLWDAIQTLFGVQSRVEEDFLRQVFQQTRKGNMKMSEYLRIMKCHVESLGQAGSPVPTRSLISQVLLGLDEEYNPIIVGI