; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G15699 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G15699
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionTransposon Ty3-G Gag-Pol polyprotein
Genome locationctg2009:6084293..6088594
RNA-Seq ExpressionCucsat.G15699
SyntenyCucsat.G15699
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0010158 - abaxial cell fate specification (biological process)
GO:0015074 - DNA integration (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR016197 - Chromo-like domain superfamily
IPR023780 - Chromo domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033666.1 Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa]1.96e-21279.44Show/hide
Query:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR
        A G+NVIMVVVDRL+KY+YFI+++HPFSAK VA  FID+IV +H  P SII+DRDKIFLSNFWKELFA+MGT+LKRS   HPQTDGQTERVN+CLETYLR
Subjt:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR

Query:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVGD
        CFCNEQP KWD+ IPWAELWYNTTFHASTK TPF+ VYGRPPPPLLSYG KKT NNEV+ +LKERDLA+NALKENL +AQNRMKK  D  RRELK KVG+
Subjt:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVGD

Query:  EVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQELG
        EVYLKL+ YRQ SLARKK EKLAP+YYGPYK+IEEIGAVAYRL LP EA+IHNVFHISQLK KLG QQ VQHQ PMLTE FELQLW E VLG+RWN+EL 
Subjt:  EVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQELG

Query:  GNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGKNV
         NEWLIKW+GLP+S+ATWESV+QMNQQFP FH+EDKVN+EPRGIVRPPI+HTYKRR + V
Subjt:  GNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGKNV

KAA0048169.1 disease resistance protein [Cucumis melo var. makuwa]2.13e-20777.78Show/hide
Query:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR
        A G+NVIMVVVDRL+KY+YFI+++HPFS K VA  FID+IV +H  P SII+D DKIFLSNFW+ELFA+MGT+LKRS   HPQTDGQTERVN+CLETYLR
Subjt:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR

Query:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVGD
        CFCNEQP KWD+ IPWAELWYNTTFHASTK TPF+ VYGR PPPLLSYG KKT NNEV+ +LKERD A+NALKENL +AQNRMKK AD  RRELK KVG+
Subjt:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVGD

Query:  EVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQELG
        EVYLKL+ YRQRSLARKK EKLAP+YYGPYK+IEEIGAVAY+L LP EA+IHNVFHISQLK KLG QQ VQHQ PMLTE FELQL  E VLG+RWN+ELG
Subjt:  EVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQELG

Query:  GNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGKNV
         NEWL+KW+GL +S+ATWESV+QMNQQFP FH+EDKVN+EPRGIVRPPI+HTYKRR + V
Subjt:  GNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGKNV

KAA0057186.1 Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa]1.68e-20778.77Show/hide
Query:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR
        A G+NVIMVVVDRLSKY+YF++M+HPFSAK VA  FID+IV +H IPKSII+DRDKIFLSNFWKELF  M TILKRS   HPQTDGQTERVN+CLETYLR
Subjt:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR

Query:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVGD
        CFCNEQP KW + IPWAELWYNTTFH+ST+ TPFQ VYGRPPPPL+SYG KKTPN+EV+T+LKERDLAI+ALKENL +AQNRMKK AD  RRELKFKVGD
Subjt:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVGD

Query:  EVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQELG
        EVYLKLR YRQRSLA+K+ EKLAPKYYGPY + E IG VAYRL  P EASIHNVFHISQLKLKLG Q  +Q QQP LT EFELQLW E VLG+RW+ ELG
Subjt:  EVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQELG

Query:  GNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGK
         NEWL+KWKGLP+SEATWESV+ MNQQFP FH+EDKV LEP+GIVRPPII+ YKR+GK
Subjt:  GNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGK

KAE8637598.1 hypothetical protein CSA_022681 [Cucumis sativus]1.15e-21681.72Show/hide
Query:  LAEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYL
        +A G NVIMVVVDRLSKYSYF+ ++HP++AK VASIF++++VSKH IPKSIITDRDKIFLSNFWKELF TMGTILKRS   HPQTDGQTERVNRCLETYL
Subjt:  LAEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYL

Query:  RCFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVG
        RCFCNEQPKKWD+LIPWAELWYNTTFHASTK TP+Q+V+GR PPPLLSYG+K++PNN+V+ +LKERDLA+NAL+ENLC+AQNRMKKMADRNRRELKFK+G
Subjt:  RCFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVG

Query:  DEVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQEL
        DEVYLKLR YRQRSLARKKCEKL+PK+YGPY++IEEIG VAYRL LP EA+IHNVFH+SQLKLKLG Q   Q QQP+LTE+FELQLW E VLG+RWN+EL
Subjt:  DEVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQEL

Query:  GGNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGKNV
        GGNEWLIKWK LPDSEATWESV+ +NQQFP FH+EDKVNLEPRGIVRPPIIHTY+RRG+ V
Subjt:  GGNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGKNV

TYK08087.1 disease resistance protein [Cucumis melo var. makuwa]1.11e-20978.61Show/hide
Query:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR
        A G+NVIMVVVDRL+KY+YFI+++HPFSAK VA  FID+IV +H  P SII+DRDKIFLSNFWKELFA+MGT+LKRS   HPQTDGQTERVN+CLETYLR
Subjt:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR

Query:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVGD
        CFCNEQP KWD+ IPWAELWYNTTFHASTK TPF+ VYGR PPPLLSYG KKT NNEV+ +LKERD A+NALKENL +AQNRMKK AD  RRELK KVG+
Subjt:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVGD

Query:  EVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQELG
        EVYLKL+ YRQRSLARKK EKLAP+YYGPYK+IEEIGAVAY+L LP EA+IHNVFHISQLK KLG QQ VQHQ PMLTE FELQL  E VLG+RWN+ELG
Subjt:  EVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQELG

Query:  GNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGKNV
         NEWL+KW+GL +S+ATWESV+QMNQQFP FH+EDKVN+EPRGIVRPPI+HTYKRR + V
Subjt:  GNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGKNV

TrEMBL top hitse value%identityAlignment
A0A5A7SUN8 Transposon Ty3-G Gag-Pol polyprotein9.48e-21379.44Show/hide
Query:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR
        A G+NVIMVVVDRL+KY+YFI+++HPFSAK VA  FID+IV +H  P SII+DRDKIFLSNFWKELFA+MGT+LKRS   HPQTDGQTERVN+CLETYLR
Subjt:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR

Query:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVGD
        CFCNEQP KWD+ IPWAELWYNTTFHASTK TPF+ VYGRPPPPLLSYG KKT NNEV+ +LKERDLA+NALKENL +AQNRMKK  D  RRELK KVG+
Subjt:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVGD

Query:  EVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQELG
        EVYLKL+ YRQ SLARKK EKLAP+YYGPYK+IEEIGAVAYRL LP EA+IHNVFHISQLK KLG QQ VQHQ PMLTE FELQLW E VLG+RWN+EL 
Subjt:  EVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQELG

Query:  GNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGKNV
         NEWLIKW+GLP+S+ATWESV+QMNQQFP FH+EDKVN+EPRGIVRPPI+HTYKRR + V
Subjt:  GNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGKNV

A0A5A7U3Z6 Disease resistance protein1.03e-20777.78Show/hide
Query:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR
        A G+NVIMVVVDRL+KY+YFI+++HPFS K VA  FID+IV +H  P SII+D DKIFLSNFW+ELFA+MGT+LKRS   HPQTDGQTERVN+CLETYLR
Subjt:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR

Query:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVGD
        CFCNEQP KWD+ IPWAELWYNTTFHASTK TPF+ VYGR PPPLLSYG KKT NNEV+ +LKERD A+NALKENL +AQNRMKK AD  RRELK KVG+
Subjt:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVGD

Query:  EVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQELG
        EVYLKL+ YRQRSLARKK EKLAP+YYGPYK+IEEIGAVAY+L LP EA+IHNVFHISQLK KLG QQ VQHQ PMLTE FELQL  E VLG+RWN+ELG
Subjt:  EVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQELG

Query:  GNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGKNV
         NEWL+KW+GL +S+ATWESV+QMNQQFP FH+EDKVN+EPRGIVRPPI+HTYKRR + V
Subjt:  GNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGKNV

A0A5A7USP1 Transposon Ty3-G Gag-Pol polyprotein8.11e-20878.77Show/hide
Query:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR
        A G+NVIMVVVDRLSKY+YF++M+HPFSAK VA  FID+IV +H IPKSII+DRDKIFLSNFWKELF  M TILKRS   HPQTDGQTERVN+CLETYLR
Subjt:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR

Query:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVGD
        CFCNEQP KW + IPWAELWYNTTFH+ST+ TPFQ VYGRPPPPL+SYG KKTPN+EV+T+LKERDLAI+ALKENL +AQNRMKK AD  RRELKFKVGD
Subjt:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVGD

Query:  EVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQELG
        EVYLKLR YRQRSLA+K+ EKLAPKYYGPY + E IG VAYRL  P EASIHNVFHISQLKLKLG Q  +Q QQP LT EFELQLW E VLG+RW+ ELG
Subjt:  EVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQELG

Query:  GNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGK
         NEWL+KWKGLP+SEATWESV+ MNQQFP FH+EDKV LEP+GIVRPPII+ YKR+GK
Subjt:  GNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGK

A0A5D3C7Z4 Disease resistance protein5.38e-21078.61Show/hide
Query:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR
        A G+NVIMVVVDRL+KY+YFI+++HPFSAK VA  FID+IV +H  P SII+DRDKIFLSNFWKELFA+MGT+LKRS   HPQTDGQTERVN+CLETYLR
Subjt:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR

Query:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVGD
        CFCNEQP KWD+ IPWAELWYNTTFHASTK TPF+ VYGR PPPLLSYG KKT NNEV+ +LKERD A+NALKENL +AQNRMKK AD  RRELK KVG+
Subjt:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVGD

Query:  EVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQELG
        EVYLKL+ YRQRSLARKK EKLAP+YYGPYK+IEEIGAVAY+L LP EA+IHNVFHISQLK KLG QQ VQHQ PMLTE FELQL  E VLG+RWN+ELG
Subjt:  EVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQELG

Query:  GNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGKNV
         NEWL+KW+GL +S+ATWESV+QMNQQFP FH+EDKVN+EPRGIVRPPI+HTYKRR + V
Subjt:  GNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGKNV

A0A5D3DI23 Transposon Ty3-G Gag-Pol polyprotein2.02e-20778.21Show/hide
Query:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR
        A  +NVIMVVVDRLSKY+YF++M+HPFSAK VA  FID+IV +H IPKSII+DRDKIFLSNFWKELF  M TILKRS   HPQTDGQTERVN+CLETYLR
Subjt:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR

Query:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVGD
        CFCNEQP KW + IPWAELWYNTTF++ST+ TPFQ VYGRPPPPL+SYG KKTPN+EV+T+LKERDLAI+ALKENL +AQNRMKK  D  RRELKFKVGD
Subjt:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVGD

Query:  EVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQELG
        EVYLKLR Y QRSLA+K+ EKLAPKYYGPY + E IG VAYRL LP EASIHNVFHISQLKLKLG Q  +Q QQP LT EFELQLW E VLG+RW+ ELG
Subjt:  EVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQELG

Query:  GNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGK
         NEWL+KWKGLPDSEATWESV+ MNQQFP FH+EDKV LEP+GIVRPPII+ YKR+GK
Subjt:  GNEWLIKWKGLPDSEATWESVFQMNQQFPDFHVEDKVNLEPRGIVRPPIIHTYKRRGK

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.1e-3233.33Show/hide
Query:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR
        + G N + VVVDR SK +  +      +A+  A +F  R+++    PK II D D IF S  WK+       ++K S+   PQTDGQTER N+ +E  LR
Subjt:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR

Query:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPP---PPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRREL-KF
        C C+  P  W   I   +  YN   H++T++TPF+ V+   P   P  L     KT  N  +T+          +KE+L     +MKK  D   +E+ +F
Subjt:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPP---PPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRREL-KF

Query:  KVGDEVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLP--LEASIHNVFHISQLK
        + GD V +K    R ++    K  KLAP + GP+ ++++ G   Y L LP  ++    + FH+S L+
Subjt:  KVGDEVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLP--LEASIHNVFHISQLK

P0CT35 Transposon Tf2-2 polyprotein2.1e-3233.33Show/hide
Query:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR
        + G N + VVVDR SK +  +      +A+  A +F  R+++    PK II D D IF S  WK+       ++K S+   PQTDGQTER N+ +E  LR
Subjt:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR

Query:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPP---PPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRREL-KF
        C C+  P  W   I   +  YN   H++T++TPF+ V+   P   P  L     KT  N  +T+          +KE+L     +MKK  D   +E+ +F
Subjt:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPP---PPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRREL-KF

Query:  KVGDEVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLP--LEASIHNVFHISQLK
        + GD V +K    R ++    K  KLAP + GP+ ++++ G   Y L LP  ++    + FH+S L+
Subjt:  KVGDEVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLP--LEASIHNVFHISQLK

P0CT36 Transposon Tf2-3 polyprotein2.1e-3233.33Show/hide
Query:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR
        + G N + VVVDR SK +  +      +A+  A +F  R+++    PK II D D IF S  WK+       ++K S+   PQTDGQTER N+ +E  LR
Subjt:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR

Query:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPP---PPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRREL-KF
        C C+  P  W   I   +  YN   H++T++TPF+ V+   P   P  L     KT  N  +T+          +KE+L     +MKK  D   +E+ +F
Subjt:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPP---PPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRREL-KF

Query:  KVGDEVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLP--LEASIHNVFHISQLK
        + GD V +K    R ++    K  KLAP + GP+ ++++ G   Y L LP  ++    + FH+S L+
Subjt:  KVGDEVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLP--LEASIHNVFHISQLK

P0CT41 Transposon Tf2-12 polyprotein2.1e-3233.33Show/hide
Query:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR
        + G N + VVVDR SK +  +      +A+  A +F  R+++    PK II D D IF S  WK+       ++K S+   PQTDGQTER N+ +E  LR
Subjt:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR

Query:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPP---PPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRREL-KF
        C C+  P  W   I   +  YN   H++T++TPF+ V+   P   P  L     KT  N  +T+          +KE+L     +MKK  D   +E+ +F
Subjt:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPP---PPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRREL-KF

Query:  KVGDEVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLP--LEASIHNVFHISQLK
        + GD V +K    R ++    K  KLAP + GP+ ++++ G   Y L LP  ++    + FH+S L+
Subjt:  KVGDEVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLP--LEASIHNVFHISQLK

Q9UR07 Transposon Tf2-11 polyprotein2.1e-3233.33Show/hide
Query:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR
        + G N + VVVDR SK +  +      +A+  A +F  R+++    PK II D D IF S  WK+       ++K S+   PQTDGQTER N+ +E  LR
Subjt:  AEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLR

Query:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPP---PPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRREL-KF
        C C+  P  W   I   +  YN   H++T++TPF+ V+   P   P  L     KT  N  +T+          +KE+L     +MKK  D   +E+ +F
Subjt:  CFCNEQPKKWDRLIPWAELWYNTTFHASTKVTPFQTVYGRPP---PPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRREL-KF

Query:  KVGDEVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLP--LEASIHNVFHISQLK
        + GD V +K    R ++    K  KLAP + GP+ ++++ G   Y L LP  ++    + FH+S L+
Subjt:  KVGDEVYLKLRLYRQRSLARKKCEKLAPKYYGPYKLIEEIGAVAYRLMLP--LEASIHNVFHISQLK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTAGCTGAAGGGGTTAATGTGATCATGGTGGTAGTTGACCGGCTGAGTAAGTATTCTTACTTTATATCAATGAGACATCCGTTTTCAGCTAAGCACGTTGCCTCCATATT
CATCGACAGAATAGTAAGCAAGCACGACATTCCTAAGTCGATTATCACAGACCGGGATAAAATCTTTCTTAGTAATTTTTGGAAAGAACTGTTTGCAACTATGGGGACTA
TTCTAAAGAGGAGTATGGTGCTTCACCCACAAACTGATGGGCAGACAGAAAGGGTAAATAGATGTCTTGAAACTTATCTGAGATGCTTCTGTAATGAACAACCAAAAAAA
TGGGATAGACTGATTCCATGGGCAGAATTGTGGTATAATACCACATTCCATGCATCCACCAAAGTGACCCCTTTTCAGACGGTATACGGTAGACCGCCTCCTCCCCTGTT
ATCCTATGGTTACAAGAAAACTCCTAATAATGAGGTTGACACACTGCTGAAAGAACGTGACTTAGCCATCAATGCCTTAAAGGAAAACCTGTGTGTAGCCCAGAACAGGA
TGAAGAAAATGGCTGATCGAAATAGAAGAGAGTTAAAATTTAAGGTAGGAGATGAAGTCTACTTGAAATTGAGACTTTACCGGCAACGCTCCTTAGCTCGGAAGAAGTGT
GAGAAGTTAGCACCTAAATATTACGGACCCTACAAACTTATTGAAGAAATAGGAGCAGTTGCGTACAGATTGATGTTACCCCTGGAAGCCAGTATCCACAATGTGTTCCA
TATTTCTCAGTTAAAACTAAAACTAGGTAATCAACAAGCTGTTCAACACCAGCAACCTATGCTAACGGAAGAGTTTGAGTTACAATTATGGCTAGAAATAGTGTTGGGAG
TCCGTTGGAATCAGGAATTGGGAGGAAATGAATGGTTAATCAAATGGAAAGGGCTGCCGGACAGTGAAGCAACTTGGGAATCTGTTTTCCAAATGAACCAACAATTCCCT
GACTTTCACGTTGAGGACAAGGTGAACCTGGAACCGAGGGGTATTGTAAGGCCCCCTATTATCCATACATACAAAAGGAGGGGTAAAAACGTAAAAGCTCACGCAGCATA
A
mRNA sequenceShow/hide mRNA sequence
CTAGCTGAAGGGGTTAATGTGATCATGGTGGTAGTTGACCGGCTGAGTAAGTATTCTTACTTTATATCAATGAGACATCCGTTTTCAGCTAAGCACGTTGCCTCCATATT
CATCGACAGAATAGTAAGCAAGCACGACATTCCTAAGTCGATTATCACAGACCGGGATAAAATCTTTCTTAGTAATTTTTGGAAAGAACTGTTTGCAACTATGGGGACTA
TTCTAAAGAGGAGTATGGTGCTTCACCCACAAACTGATGGGCAGACAGAAAGGGTAAATAGATGTCTTGAAACTTATCTGAGATGCTTCTGTAATGAACAACCAAAAAAA
TGGGATAGACTGATTCCATGGGCAGAATTGTGGTATAATACCACATTCCATGCATCCACCAAAGTGACCCCTTTTCAGACGGTATACGGTAGACCGCCTCCTCCCCTGTT
ATCCTATGGTTACAAGAAAACTCCTAATAATGAGGTTGACACACTGCTGAAAGAACGTGACTTAGCCATCAATGCCTTAAAGGAAAACCTGTGTGTAGCCCAGAACAGGA
TGAAGAAAATGGCTGATCGAAATAGAAGAGAGTTAAAATTTAAGGTAGGAGATGAAGTCTACTTGAAATTGAGACTTTACCGGCAACGCTCCTTAGCTCGGAAGAAGTGT
GAGAAGTTAGCACCTAAATATTACGGACCCTACAAACTTATTGAAGAAATAGGAGCAGTTGCGTACAGATTGATGTTACCCCTGGAAGCCAGTATCCACAATGTGTTCCA
TATTTCTCAGTTAAAACTAAAACTAGGTAATCAACAAGCTGTTCAACACCAGCAACCTATGCTAACGGAAGAGTTTGAGTTACAATTATGGCTAGAAATAGTGTTGGGAG
TCCGTTGGAATCAGGAATTGGGAGGAAATGAATGGTTAATCAAATGGAAAGGGCTGCCGGACAGTGAAGCAACTTGGGAATCTGTTTTCCAAATGAACCAACAATTCCCT
GACTTTCACGTTGAGGACAAGGTGAACCTGGAACCGAGGGGTATTGTAAGGCCCCCTATTATCCATACATACAAAAGGAGGGGTAAAAACGTAAAAGCTCACGCAGCATA
A
Protein sequenceShow/hide protein sequence
LAEGVNVIMVVVDRLSKYSYFISMRHPFSAKHVASIFIDRIVSKHDIPKSIITDRDKIFLSNFWKELFATMGTILKRSMVLHPQTDGQTERVNRCLETYLRCFCNEQPKK
WDRLIPWAELWYNTTFHASTKVTPFQTVYGRPPPPLLSYGYKKTPNNEVDTLLKERDLAINALKENLCVAQNRMKKMADRNRRELKFKVGDEVYLKLRLYRQRSLARKKC
EKLAPKYYGPYKLIEEIGAVAYRLMLPLEASIHNVFHISQLKLKLGNQQAVQHQQPMLTEEFELQLWLEIVLGVRWNQELGGNEWLIKWKGLPDSEATWESVFQMNQQFP
DFHVEDKVNLEPRGIVRPPIIHTYKRRGKNVKAHAA