; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015049 (gene) of Snake gourd v1 genome

Gene IDTan0015049
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationLG03:73259451..73259840
RNA-Seq ExpressionTan0015049
SyntenyTan0015049
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142327.1 uncharacterized protein LOC111012468 [Momordica charantia]2.6e-2956.2Show/hide
Query:  LDGDSSTIQPTHPSVIELYENPYYLHPSDGTNLIHVSDLLTESNYASWSR-----------VGFIDGSLSIPSDALRNSWRICNSMVTAWILNATTKEIS
        +   S+T      S++E Y NPYYLH SD T+L+ VSDLL E+NY SWSR           +GF+DGS+S PS A  NSW+ICN++V AW+LN+ +KEIS
Subjt:  LDGDSSTIQPTHPSVIELYENPYYLHPSDGTNLIHVSDLLTESNYASWSR-----------VGFIDGSLSIPSDALRNSWRICNSMVTAWILNATTKEIS

Query:  ASLNFTDSARDIWLDLQQRYQ
        AS+ F+DSARDIWLDLQ+RYQ
Subjt:  ASLNFTDSARDIWLDLQQRYQ

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]2.2e-2551.79Show/hide
Query:  PTHPSVIELYENPYYLHPSDGTNLIHVSDLLTESNYASWSR-----------VGFIDGSLSIPSDALRNSWRICNSMVTAWILNATTKEISASLNFTDSA
        P +P V+E + NPY+LH SD T+L+ VSDLLT+ NY SWSR           +GF+DGS+S P+D   +SW ICN++V +WI N+ +K+ISAS+ F+DSA
Subjt:  PTHPSVIELYENPYYLHPSDGTNLIHVSDLLTESNYASWSR-----------VGFIDGSLSIPSDALRNSWRICNSMVTAWILNATTKEISASLNFTDSA

Query:  RDIWLDLQQRYQ
         +IWLDL++R+Q
Subjt:  RDIWLDLQQRYQ

XP_038874906.1 uncharacterized protein LOC120067409 [Benincasa hispida]1.3e-2550.75Show/hide
Query:  PNPPTVLDGDSS-------TIQPTHPSVIELYENPYYLHPSDGTNLIHVSDLLTESNYASWS-----------RVGFIDGSLSIPSDALRNSWRICNSMV
        P PP     +SS       T  P   S ++ Y   Y+LH SD T+L+ VSDLLT+SNY+SWS           ++GFIDG+L  P   L+NSW ICNS+V
Subjt:  PNPPTVLDGDSS-------TIQPTHPSVIELYENPYYLHPSDGTNLIHVSDLLTESNYASWS-----------RVGFIDGSLSIPSDALRNSWRICNSMV

Query:  TAWILNATTKEISASLNFTDSARDIWLDLQQRYQ
        T WI NA +K+I+AS+NF+DS R+IWLDLQQRYQ
Subjt:  TAWILNATTKEISASLNFTDSARDIWLDLQQRYQ

XP_038875043.1 uncharacterized protein LOC120067569 [Benincasa hispida]5.0e-2547.62Show/hide
Query:  PTVLDGDSS--TIQPTHPSVIELYENPYYLHPSDGTNLIHVSDLLTESNYASWS-----------RVGFIDGSLSIPSDALRNSWRICNSMVTAWILNAT
        PT  +  +S    + + PS++E Y+NPY+LHP D T+L+ +S+LLTESNY SWS           ++GFI+  +  PS  L +SW ICN +VTAWILN+ 
Subjt:  PTVLDGDSS--TIQPTHPSVIELYENPYYLHPSDGTNLIHVSDLLTESNYASWS-----------RVGFIDGSLSIPSDALRNSWRICNSMVTAWILNAT

Query:  TKEISASLNFTDSARDIWLDLQQRYQ
        +KEIS S+NF++S ++IW+D Q+RYQ
Subjt:  TKEISASLNFTDSARDIWLDLQQRYQ

XP_038902375.1 uncharacterized protein LOC120089012 [Benincasa hispida]1.3e-2550.38Show/hide
Query:  SPNPPTVLDGDSSTIQPT---HPSVIELYENPYYLHPSDGTNLIHVSDLLTESNYASWSRV-----------GFIDGSLSIPSDALRNSWRICNSMVTAW
        +PN PT       +  P+   +P+V+E Y N Y+LH SD TNL+ VS+LLTE+NY SWS+V           GF+DGS+   +  L +SW IC+ +VTAW
Subjt:  SPNPPTVLDGDSSTIQPT---HPSVIELYENPYYLHPSDGTNLIHVSDLLTESNYASWSRV-----------GFIDGSLSIPSDALRNSWRICNSMVTAW

Query:  ILNATTKEISASLNFTDSARDIWLDLQQRYQ
        ILN+  KEIS S+NF DSAR IW+DLQ+RYQ
Subjt:  ILNATTKEISASLNFTDSARDIWLDLQQRYQ

TrEMBL top hitse value%identityAlignment
A0A6J1CMF8 uncharacterized protein LOC1110124681.2e-2956.2Show/hide
Query:  LDGDSSTIQPTHPSVIELYENPYYLHPSDGTNLIHVSDLLTESNYASWSR-----------VGFIDGSLSIPSDALRNSWRICNSMVTAWILNATTKEIS
        +   S+T      S++E Y NPYYLH SD T+L+ VSDLL E+NY SWSR           +GF+DGS+S PS A  NSW+ICN++V AW+LN+ +KEIS
Subjt:  LDGDSSTIQPTHPSVIELYENPYYLHPSDGTNLIHVSDLLTESNYASWSR-----------VGFIDGSLSIPSDALRNSWRICNSMVTAWILNATTKEIS

Query:  ASLNFTDSARDIWLDLQQRYQ
        AS+ F+DSARDIWLDLQ+RYQ
Subjt:  ASLNFTDSARDIWLDLQQRYQ

A0A6J1DIP8 uncharacterized protein LOC1110203995.9e-2454.72Show/hide
Query:  IELYENPYYLHPSDGTNLIHVSDLLTESNYASWSR-----------VGFIDGSLSIPSDALRNSWRICNSMVTAWILNATTKEISASLNFTDSARDIWLD
        IE Y NPY+LH SD T+L+ VSD LT  NY SWSR           VGF+DGS+  P+  L +SW ICN++V +WILN+ +KEISAS+ F+DSAR+IWLD
Subjt:  IELYENPYYLHPSDGTNLIHVSDLLTESNYASWSR-----------VGFIDGSLSIPSDALRNSWRICNSMVTAWILNATTKEISASLNFTDSARDIWLD

Query:  LQQRYQ
        L++R++
Subjt:  LQQRYQ

A0A6J1DKR8 uncharacterized protein LOC1110218311.4e-2046.72Show/hide
Query:  VLDGDSSTIQPTHPSVIELYENPYYLHPSDGTNLIHVSDLLTESNYASWSR-----------VGFIDGSLSIPSDALRNSWRICNSMVTAWILNATTKEI
        VLD  SS    +  S ++   NPYYLH +D T L+ V+  LTE NY+SWSR           +GFIDGS+S P   L  +W   N +V AWILN+ +KEI
Subjt:  VLDGDSSTIQPTHPSVIELYENPYYLHPSDGTNLIHVSDLLTESNYASWSR-----------VGFIDGSLSIPSDALRNSWRICNSMVTAWILNATTKEI

Query:  SASLNFTDSARDIWLDLQQRYQ
        S+S+ F++SARDIW+DL++R++
Subjt:  SASLNFTDSARDIWLDLQQRYQ

A0A6J1DNP7 uncharacterized protein LOC1110220651.1e-2551.79Show/hide
Query:  PTHPSVIELYENPYYLHPSDGTNLIHVSDLLTESNYASWSR-----------VGFIDGSLSIPSDALRNSWRICNSMVTAWILNATTKEISASLNFTDSA
        P +P V+E + NPY+LH SD T+L+ VSDLLT+ NY SWSR           +GF+DGS+S P+D   +SW ICN++V +WI N+ +K+ISAS+ F+DSA
Subjt:  PTHPSVIELYENPYYLHPSDGTNLIHVSDLLTESNYASWSR-----------VGFIDGSLSIPSDALRNSWRICNSMVTAWILNATTKEISASLNFTDSA

Query:  RDIWLDLQQRYQ
         +IWLDL++R+Q
Subjt:  RDIWLDLQQRYQ

A0A6J1DW89 uncharacterized protein LOC1110237021.2e-1946.15Show/hide
Query:  PNPPTVLDGDSSTIQPTHPSVIE---LYENPYYLHPSDGTNLIHVSDLLTESNYASWSR-----------VGFIDGSLSIPSDALRNSWRICNSMVTAWI
        P PPT L  +SS+  P   S +       NPYYLH +D T L+ V+ LLTE NY SWSR           + FIDG +  PS  L  +W   N +V AWI
Subjt:  PNPPTVLDGDSSTIQPTHPSVIE---LYENPYYLHPSDGTNLIHVSDLLTESNYASWSR-----------VGFIDGSLSIPSDALRNSWRICNSMVTAWI

Query:  LNATTKEISASLNFTDSARDIWLDLQQRYQ
        LN+ +KEISAS+ F++SARDIW+DL +R++
Subjt:  LNATTKEISASLNFTDSARDIWLDLQQRYQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCCTAACCCTCCTACAGTCCTTGATGGCGATTCTTCTACAATTCAACCGACTCATCCCTCAGTTATTGAATTGTATGAAAATCCCTACTACCTCCATCCTTCTGA
CGGTACCAATTTGATTCATGTCTCCGACTTGTTGACAGAATCGAATTATGCTTCATGGAGTCGTGTTGGTTTCATCGATGGTTCTCTCTCGATTCCTTCCGATGCTTTAC
GCAATTCCTGGAGAATTTGCAATAGCATGGTTACGGCATGGATCTTAAACGCAACAACGAAGGAAATTTCAGCAAGTTTGAATTTCACGGACTCTGCTCGAGATATCTGG
CTTGATCTACAACAGCGGTATCAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCCCTAACCCTCCTACAGTCCTTGATGGCGATTCTTCTACAATTCAACCGACTCATCCCTCAGTTATTGAATTGTATGAAAATCCCTACTACCTCCATCCTTCTGA
CGGTACCAATTTGATTCATGTCTCCGACTTGTTGACAGAATCGAATTATGCTTCATGGAGTCGTGTTGGTTTCATCGATGGTTCTCTCTCGATTCCTTCCGATGCTTTAC
GCAATTCCTGGAGAATTTGCAATAGCATGGTTACGGCATGGATCTTAAACGCAACAACGAAGGAAATTTCAGCAAGTTTGAATTTCACGGACTCTGCTCGAGATATCTGG
CTTGATCTACAACAGCGGTATCAGTGA
Protein sequenceShow/hide protein sequence
MSPNPPTVLDGDSSTIQPTHPSVIELYENPYYLHPSDGTNLIHVSDLLTESNYASWSRVGFIDGSLSIPSDALRNSWRICNSMVTAWILNATTKEISASLNFTDSARDIW
LDLQQRYQ