; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPIUnG00290 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPIUnG00290
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationScaffold000101:118937..121530
RNA-Seq ExpressionCSPIUnG00290
SyntenyCSPIUnG00290
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR016197 - Chromo-like domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056980.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.4e-30066.96Show/hide
Query:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW
        M FGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYS+D  THLEHLT+VF+LLRQH LFAN +KC FAKDRIEYLGHWVS KGVEADQEKI+ M+EW
Subjt:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW

Query:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS
        PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLT+LTKKNNFRW+EE   AFE LK+AMVTLPVLALPDFQLPFE+ETDASG GLGAVL+QNK+PIAYFS
Subjt:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS

Query:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV
        QKLSETA+E+S+YERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHI EQREI+PG+QKW+MKLIGFDFEI YRAG +NKA DALSR+P EA+LN I V
Subjt:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV

Query:  PSLLDLETI---------------------------------------------------------------------------------GSPKDVQ--I
        PSLLD+  I                                                                                 G  KD++  +
Subjt:  PSLLDLETI---------------------------------------------------------------------------------GSPKDVQ--I

Query:  NSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRL
        +  H     + +A  P     PLPIPNRIW+DISMDF+EGLPRSKGFDTILVVVD LSKYAHF+TLGHPFSAK VA +FIKEVVRLHGYPRSIVSDRDR+
Subjt:  NSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRL

Query:  FLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPND
        FLSHFW+E+FRLQG QLKRSTAYHPQTDGQ+EVVNK +ELYL C C EK  +WS+K+AWAEYWYNT + + I++TPYA                G TPND
Subjt:  FLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPND

Query:  SVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFH
        SVE QLQ+RDE+L  LK HLQHAQ +M+KFA++H  DVVFDIGD +YLK+QPYRQQS+AKKRCEKLS +YFGPY VLGRVGEVAYLLDLP+TA+IHPVFH
Subjt:  SVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFH

Query:  VSQLKKA--------------------------------NDTERDWEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI
        VSQLKKA                                N+ ERDWEYLV+WKDQP+HEATWESYA+L +Q+P  HLEDKV+LLR GIARPPI
Subjt:  VSQLKKA--------------------------------NDTERDWEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI

KAA0058899.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.7e-30270.45Show/hide
Query:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW
        M FGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYS+D  THLEHLT+VF+LLRQH LFAN +KC FAKDRIEYLGHW+S KGVEADQEKI+ M+EW
Subjt:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW

Query:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS
        PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLT+LTKKNNFRW+EE   AFE LK+AMVTLPVLALPDFQLPFE+ETDASG GLGAVL+QNK+PIAYFS
Subjt:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS

Query:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV
        QKLSETA+E+S+YERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHI EQREI+PG+QKW+MKLIGFDFEI YRAG +NKA DALSR+P EA+LN I V
Subjt:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV

Query:  PSLLDLETI------------------------------------GSPKDVQ--INSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSK
        PSLLD+  I                                    G  KD++  ++  H     + +A  P      LPIPNRIW+DISMDF+EGLPRSK
Subjt:  PSLLDLETI------------------------------------GSPKDVQ--INSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSK

Query:  GFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRLFLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCF
        GFDTILVVVD LSKYAHF+TLGHPFSAK VA +FIKEVVRLHGYPRSIVSDRDR+FLSHFW+E+FRLQG QLKRSTAYHPQTDGQ+EVVNK +ELYL C 
Subjt:  GFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRLFLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCF

Query:  CHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPNDSVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDL
        C EK  +WS+K+AWAEYWYNT + + I++TPYA                G T NDSVE QLQ+RDE+L  LK HLQHAQ +MKKFA++H  DVVFDIGD 
Subjt:  CHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPNDSVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDL

Query:  LYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFHVSQLKKA--------------------------------NDTERD
        +YLK+QPYRQQS+AKKRCEKLS +YFGPY VLGRVGEVAYLLDLP+TA+IHPVFHVSQLKKA                                N+ ERD
Subjt:  LYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFHVSQLKKA--------------------------------NDTERD

Query:  WEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI
        WEYLV+WK+QP+HEATWESY++L +Q+P  HLEDKV+LLR GIARPPI
Subjt:  WEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI

TYK21209.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.1e-30066.96Show/hide
Query:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW
        M FGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYS+D  THLEHLT+VF+LLRQH LFAN +KC FAKDRIEYLGHWVS KGVEADQEKI+ M+EW
Subjt:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW

Query:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS
        PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLT+LTKKNNFRW+EE   AFE LK+AMVTLPVLALPDFQLPFE+ETDASG GLGAVL+QNK+PIAYFS
Subjt:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS

Query:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV
        QKLSETA+E+S+YERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHI EQREI+PG+QKW+MKLIGFDFEI YRAG +NKA DALSR+P EA+LN I V
Subjt:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV

Query:  PSLLDLETI---------------------------------------------------------------------------------GSPKDVQ--I
        PSLLD+  I                                                                                 G  KD++  +
Subjt:  PSLLDLETI---------------------------------------------------------------------------------GSPKDVQ--I

Query:  NSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRL
        +  H     + +A  P     PLPIPNRIW+DISMDF+EGLPRSKGFDTILVVVD LSKYAHF+TLGHPFSAK VA +FIKEVVRLHGYPRSIVSDRDR+
Subjt:  NSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRL

Query:  FLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPND
        FLSHFW+E+FRLQG QLKRSTAYHPQTDGQ+EVVNK +ELYL C C EK  +WS+K+AWAEYWYNT + + I++TPYA                G TPND
Subjt:  FLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPND

Query:  SVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFH
        SVE QLQ+RDE+L  LK HLQHAQ +M+KFA++H  DVVFDIGD +YLK+QPYRQQS+AKKRCEKLS +YFGPY VLGRVGEVAYLLDLP+TA+IHPVFH
Subjt:  SVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFH

Query:  VSQLKKA--------------------------------NDTERDWEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI
        VSQLKKA                                N+ ERDWEYLV+WKDQP+HEATWESYA+L +Q+P  HLEDKV+LLR GIARPPI
Subjt:  VSQLKKA--------------------------------NDTERDWEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI

TYK23724.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-29966.83Show/hide
Query:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW
        M FGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYS+D  THLEHLT+VF+LLRQH LFAN +KC FAKDRIEYLGHWVS KGVEADQEKI+ M+EW
Subjt:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW

Query:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS
        PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLT+LTKKNNFRW+EE   AFE LK+AMVTLPVLALPDFQLPFE+ETDASG GLGAVL+QNK+PIAYFS
Subjt:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS

Query:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV
        QKLSETA+E+S+YERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHI EQREI+PG+QKW+MKLIGFDFEI YRAG +NKA DALSR+P EA+LN I V
Subjt:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV

Query:  PSLLDLETI---------------------------------------------------------------------------------GSPKDVQ--I
        PSLLD+  I                                                                                 G  KD++  +
Subjt:  PSLLDLETI---------------------------------------------------------------------------------GSPKDVQ--I

Query:  NSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRL
        +  H     + +A  P     PLPIPNRIW+DISMDF+EGLPRSKGFDTILVVVD LSKYAHF+TLGHPFSAK VA +FIKEVVRLHGYPRSIVSDRDR+
Subjt:  NSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRL

Query:  FLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPND
        FLSHFW+E+FRLQG QLKRSTAYHPQTDGQ+EVVNK +ELYL C C EK  +WS+K+AWAEYWYNT + + I++TPYA                G TPND
Subjt:  FLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPND

Query:  SVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFH
        SVE QLQ+RDE+L  LK HLQHAQ +MKKFA++H  DVVFDIGD +YLK+QPYRQQS+AKKRCEKLS +YFGPY VLGRVGEVAYLLDLP+TA+IHPVFH
Subjt:  SVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFH

Query:  VSQLKKA--------------------------------NDTERDWEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI
        VSQLKKA                                N+ ERDWEYLV+WK+QP+HEATWESY++L +Q+P  HLEDKV+LLR GIARPPI
Subjt:  VSQLKKA--------------------------------NDTERDWEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI

TYK26407.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.1e-30066.96Show/hide
Query:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW
        M FGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYS+D  THLEHLT+VF+LLRQH LFAN +KC FAKDRIEYLGHWVS KGVEADQEKI+ M+EW
Subjt:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW

Query:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS
        PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLT+LTKKNNFRW+EE   AFE LK+AMVTLPVLALPDFQLPFE+ETDASG GLGAVL+QNK+PIAYFS
Subjt:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS

Query:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV
        QKLSETA+E+S+YERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHI EQREI+PG+QKW+MKLIGFDFEI YRAG +NKA DALSR+P EA+LN I V
Subjt:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV

Query:  PSLLDLETI---------------------------------------------------------------------------------GSPKDVQ--I
        PSLLD+  I                                                                                 G  KD++  +
Subjt:  PSLLDLETI---------------------------------------------------------------------------------GSPKDVQ--I

Query:  NSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRL
        +  H     + +A  P     PLPIPNRIW+DISMDF+EGLPRSKGFDTILVVVD LSKYAHF+TLGHPFSAK VA +FIKEVVRLHGYPRSIVSDRDR+
Subjt:  NSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRL

Query:  FLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPND
        FLSHFW+E+FRLQG QLKRSTAYHPQTDGQ+EVVNK +ELYL C C EK  +WS+K+AWAEYWYNT + + I++TPYA                G TPND
Subjt:  FLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPND

Query:  SVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFH
        SVE QLQ+RDE+L  LK HLQHAQ +M+KFA++H  DVVFDIGD +YLK+QPYRQQS+AKKRCEKLS +YFGPY VLGRVGEVAYLLDLP+TA+IHPVFH
Subjt:  SVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFH

Query:  VSQLKKA--------------------------------NDTERDWEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI
        VSQLKKA                                N+ ERDWEYLV+WKDQP+HEATWESYA+L +Q+P  HLEDKV+LLR GIARPPI
Subjt:  VSQLKKA--------------------------------NDTERDWEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI

TrEMBL top hitse value%identityAlignment
A0A5A7UPE1 Ty3/gypsy retrotransposon protein2.6e-30066.96Show/hide
Query:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW
        M FGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYS+D  THLEHLT+VF+LLRQH LFAN +KC FAKDRIEYLGHWVS KGVEADQEKI+ M+EW
Subjt:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW

Query:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS
        PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLT+LTKKNNFRW+EE   AFE LK+AMVTLPVLALPDFQLPFE+ETDASG GLGAVL+QNK+PIAYFS
Subjt:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS

Query:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV
        QKLSETA+E+S+YERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHI EQREI+PG+QKW+MKLIGFDFEI YRAG +NKA DALSR+P EA+LN I V
Subjt:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV

Query:  PSLLDLETI---------------------------------------------------------------------------------GSPKDVQ--I
        PSLLD+  I                                                                                 G  KD++  +
Subjt:  PSLLDLETI---------------------------------------------------------------------------------GSPKDVQ--I

Query:  NSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRL
        +  H     + +A  P     PLPIPNRIW+DISMDF+EGLPRSKGFDTILVVVD LSKYAHF+TLGHPFSAK VA +FIKEVVRLHGYPRSIVSDRDR+
Subjt:  NSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRL

Query:  FLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPND
        FLSHFW+E+FRLQG QLKRSTAYHPQTDGQ+EVVNK +ELYL C C EK  +WS+K+AWAEYWYNT + + I++TPYA                G TPND
Subjt:  FLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPND

Query:  SVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFH
        SVE QLQ+RDE+L  LK HLQHAQ +M+KFA++H  DVVFDIGD +YLK+QPYRQQS+AKKRCEKLS +YFGPY VLGRVGEVAYLLDLP+TA+IHPVFH
Subjt:  SVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFH

Query:  VSQLKKA--------------------------------NDTERDWEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI
        VSQLKKA                                N+ ERDWEYLV+WKDQP+HEATWESYA+L +Q+P  HLEDKV+LLR GIARPPI
Subjt:  VSQLKKA--------------------------------NDTERDWEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI

A0A5A7UZE8 Ty3/gypsy retrotransposon protein2.8e-30270.45Show/hide
Query:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW
        M FGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYS+D  THLEHLT+VF+LLRQH LFAN +KC FAKDRIEYLGHW+S KGVEADQEKI+ M+EW
Subjt:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW

Query:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS
        PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLT+LTKKNNFRW+EE   AFE LK+AMVTLPVLALPDFQLPFE+ETDASG GLGAVL+QNK+PIAYFS
Subjt:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS

Query:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV
        QKLSETA+E+S+YERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHI EQREI+PG+QKW+MKLIGFDFEI YRAG +NKA DALSR+P EA+LN I V
Subjt:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV

Query:  PSLLDLETI------------------------------------GSPKDVQ--INSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSK
        PSLLD+  I                                    G  KD++  ++  H     + +A  P      LPIPNRIW+DISMDF+EGLPRSK
Subjt:  PSLLDLETI------------------------------------GSPKDVQ--INSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSK

Query:  GFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRLFLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCF
        GFDTILVVVD LSKYAHF+TLGHPFSAK VA +FIKEVVRLHGYPRSIVSDRDR+FLSHFW+E+FRLQG QLKRSTAYHPQTDGQ+EVVNK +ELYL C 
Subjt:  GFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRLFLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCF

Query:  CHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPNDSVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDL
        C EK  +WS+K+AWAEYWYNT + + I++TPYA                G T NDSVE QLQ+RDE+L  LK HLQHAQ +MKKFA++H  DVVFDIGD 
Subjt:  CHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPNDSVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDL

Query:  LYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFHVSQLKKA--------------------------------NDTERD
        +YLK+QPYRQQS+AKKRCEKLS +YFGPY VLGRVGEVAYLLDLP+TA+IHPVFHVSQLKKA                                N+ ERD
Subjt:  LYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFHVSQLKKA--------------------------------NDTERD

Query:  WEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI
        WEYLV+WK+QP+HEATWESY++L +Q+P  HLEDKV+LLR GIARPPI
Subjt:  WEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI

A0A5D3DD68 Ty3/gypsy retrotransposon protein2.0e-30066.96Show/hide
Query:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW
        M FGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYS+D  THLEHLT+VF+LLRQH LFAN +KC FAKDRIEYLGHWVS KGVEADQEKI+ M+EW
Subjt:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW

Query:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS
        PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLT+LTKKNNFRW+EE   AFE LK+AMVTLPVLALPDFQLPFE+ETDASG GLGAVL+QNK+PIAYFS
Subjt:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS

Query:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV
        QKLSETA+E+S+YERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHI EQREI+PG+QKW+MKLIGFDFEI YRAG +NKA DALSR+P EA+LN I V
Subjt:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV

Query:  PSLLDLETI---------------------------------------------------------------------------------GSPKDVQ--I
        PSLLD+  I                                                                                 G  KD++  +
Subjt:  PSLLDLETI---------------------------------------------------------------------------------GSPKDVQ--I

Query:  NSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRL
        +  H     + +A  P     PLPIPNRIW+DISMDF+EGLPRSKGFDTILVVVD LSKYAHF+TLGHPFSAK VA +FIKEVVRLHGYPRSIVSDRDR+
Subjt:  NSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRL

Query:  FLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPND
        FLSHFW+E+FRLQG QLKRSTAYHPQTDGQ+EVVNK +ELYL C C EK  +WS+K+AWAEYWYNT + + I++TPYA                G TPND
Subjt:  FLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPND

Query:  SVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFH
        SVE QLQ+RDE+L  LK HLQHAQ +M+KFA++H  DVVFDIGD +YLK+QPYRQQS+AKKRCEKLS +YFGPY VLGRVGEVAYLLDLP+TA+IHPVFH
Subjt:  SVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFH

Query:  VSQLKKA--------------------------------NDTERDWEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI
        VSQLKKA                                N+ ERDWEYLV+WKDQP+HEATWESYA+L +Q+P  HLEDKV+LLR GIARPPI
Subjt:  VSQLKKA--------------------------------NDTERDWEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI

A0A5D3DJA9 Ty3/gypsy retrotransposon protein5.8e-30066.83Show/hide
Query:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW
        M FGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYS+D  THLEHLT+VF+LLRQH LFAN +KC FAKDRIEYLGHWVS KGVEADQEKI+ M+EW
Subjt:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW

Query:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS
        PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLT+LTKKNNFRW+EE   AFE LK+AMVTLPVLALPDFQLPFE+ETDASG GLGAVL+QNK+PIAYFS
Subjt:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS

Query:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV
        QKLSETA+E+S+YERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHI EQREI+PG+QKW+MKLIGFDFEI YRAG +NKA DALSR+P EA+LN I V
Subjt:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV

Query:  PSLLDLETI---------------------------------------------------------------------------------GSPKDVQ--I
        PSLLD+  I                                                                                 G  KD++  +
Subjt:  PSLLDLETI---------------------------------------------------------------------------------GSPKDVQ--I

Query:  NSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRL
        +  H     + +A  P     PLPIPNRIW+DISMDF+EGLPRSKGFDTILVVVD LSKYAHF+TLGHPFSAK VA +FIKEVVRLHGYPRSIVSDRDR+
Subjt:  NSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRL

Query:  FLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPND
        FLSHFW+E+FRLQG QLKRSTAYHPQTDGQ+EVVNK +ELYL C C EK  +WS+K+AWAEYWYNT + + I++TPYA                G TPND
Subjt:  FLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPND

Query:  SVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFH
        SVE QLQ+RDE+L  LK HLQHAQ +MKKFA++H  DVVFDIGD +YLK+QPYRQQS+AKKRCEKLS +YFGPY VLGRVGEVAYLLDLP+TA+IHPVFH
Subjt:  SVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFH

Query:  VSQLKKA--------------------------------NDTERDWEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI
        VSQLKKA                                N+ ERDWEYLV+WK+QP+HEATWESY++L +Q+P  HLEDKV+LLR GIARPPI
Subjt:  VSQLKKA--------------------------------NDTERDWEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI

A0A5D3DRT3 Ty3/gypsy retrotransposon protein2.0e-30066.96Show/hide
Query:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW
        M FGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYS+D  THLEHLT+VF+LLRQH LFAN +KC FAKDRIEYLGHWVS KGVEADQEKI+ M+EW
Subjt:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW

Query:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS
        PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLT+LTKKNNFRW+EE   AFE LK+AMVTLPVLALPDFQLPFE+ETDASG GLGAVL+QNK+PIAYFS
Subjt:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEE---AFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFS

Query:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV
        QKLSETA+E+S+YERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHI EQREI+PG+QKW+MKLIGFDFEI YRAG +NKA DALSR+P EA+LN I V
Subjt:  QKLSETAKERSIYERELMAIVLAVEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINV

Query:  PSLLDLETI---------------------------------------------------------------------------------GSPKDVQ--I
        PSLLD+  I                                                                                 G  KD++  +
Subjt:  PSLLDLETI---------------------------------------------------------------------------------GSPKDVQ--I

Query:  NSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRL
        +  H     + +A  P     PLPIPNRIW+DISMDF+EGLPRSKGFDTILVVVD LSKYAHF+TLGHPFSAK VA +FIKEVVRLHGYPRSIVSDRDR+
Subjt:  NSNHPPHFPRFRAWGP--FRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRL

Query:  FLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPND
        FLSHFW+E+FRLQG QLKRSTAYHPQTDGQ+EVVNK +ELYL C C EK  +WS+K+AWAEYWYNT + + I++TPYA                G TPND
Subjt:  FLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYA---------------PGLTPND

Query:  SVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFH
        SVE QLQ+RDE+L  LK HLQHAQ +M+KFA++H  DVVFDIGD +YLK+QPYRQQS+AKKRCEKLS +YFGPY VLGRVGEVAYLLDLP+TA+IHPVFH
Subjt:  SVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFH

Query:  VSQLKKA--------------------------------NDTERDWEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI
        VSQLKKA                                N+ ERDWEYLV+WKDQP+HEATWESYA+L +Q+P  HLEDKV+LLR GIARPPI
Subjt:  VSQLKKA--------------------------------NDTERDWEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPI

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.4e-8830.36Show/hide
Query:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW
        M +G++ APA FQ  +N +        ++ + DDIL++SK  S H++H+  V + L+  NL  N  KC+F + +++++G+ +SEKG    QE I  +L+W
Subjt:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW

Query:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKN-NFRWT---EEAFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKK-----
          PKN +ELR FLG   Y R+F+     +  PL  L KK+  ++WT    +A E +K+ +V+ PVL   DF     +ETDAS   +GAVLSQ        
Subjt:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKN-NFRWT---EEAFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKK-----

Query:  PIAYFSQKLSETAKERSIYERELMAIVLAVEKWRHYLLG--HRFVVYTDQKAL--RHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIP
        P+ Y+S K+S+     S+ ++E++AI+ +++ WRHYL      F + TD + L  R   E       + +W + L  F+FEI YR G  N   DALSRI 
Subjt:  PIAYFSQKLSETAKERSIYERELMAIVLAVEKWRHYLLG--HRFVVYTDQKAL--RHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIP

Query:  AEAQ----------LNVINVPSLLD-----------------------------------------LETIGSPKDVQINSN------------HPP----
         E +          +N +N  S+ D                                          + I  P D Q+               HP     
Subjt:  AEAQ----------LNVINVPSLLD-----------------------------------------LETIGSPKDVQINSN------------HPP----

Query:  ---HFPRF---------------------------RAWGPFRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATI
              RF                           + +GP + P+P   R W+ +SMDFI  LP S G++ + VVVD  SK A  +      +A+  A +
Subjt:  ---HFPRF---------------------------RAWGPFRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATI

Query:  FIKEVVRLHGYPRSIVSDRDRLFLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTP--
        F + V+   G P+ I++D D +F S  WK+        +K S  Y PQTDGQ+E  N++VE  L C C   P TW + ++  +  YN   H+  + TP  
Subjt:  FIKEVVRLHGYPRSIVSDRDRLFLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTP--

Query:  ----YAPGLTP------NDSVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDV-VFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRV
            Y+P L+P      +D  +   Q   ++  T+K HL     +MKK+ D+   ++  F  GDL+ +K    R ++    +  KL+  + GP+ VL + 
Subjt:  ----YAPGLTP------NDSVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDV-VFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRV

Query:  GEVAYLLDLPDTAR--IHPVFHVSQLKK
        G   Y LDLPD+ +      FHVS L+K
Subjt:  GEVAYLLDLPDTAR--IHPVFHVSQLKK

P0CT35 Transposon Tf2-2 polyprotein1.4e-8830.36Show/hide
Query:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW
        M +G++ APA FQ  +N +        ++ + DDIL++SK  S H++H+  V + L+  NL  N  KC+F + +++++G+ +SEKG    QE I  +L+W
Subjt:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW

Query:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKN-NFRWT---EEAFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKK-----
          PKN +ELR FLG   Y R+F+     +  PL  L KK+  ++WT    +A E +K+ +V+ PVL   DF     +ETDAS   +GAVLSQ        
Subjt:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKN-NFRWT---EEAFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKK-----

Query:  PIAYFSQKLSETAKERSIYERELMAIVLAVEKWRHYLLG--HRFVVYTDQKAL--RHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIP
        P+ Y+S K+S+     S+ ++E++AI+ +++ WRHYL      F + TD + L  R   E       + +W + L  F+FEI YR G  N   DALSRI 
Subjt:  PIAYFSQKLSETAKERSIYERELMAIVLAVEKWRHYLLG--HRFVVYTDQKAL--RHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIP

Query:  AEAQ----------LNVINVPSLLD-----------------------------------------LETIGSPKDVQINSN------------HPP----
         E +          +N +N  S+ D                                          + I  P D Q+               HP     
Subjt:  AEAQ----------LNVINVPSLLD-----------------------------------------LETIGSPKDVQINSN------------HPP----

Query:  ---HFPRF---------------------------RAWGPFRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATI
              RF                           + +GP + P+P   R W+ +SMDFI  LP S G++ + VVVD  SK A  +      +A+  A +
Subjt:  ---HFPRF---------------------------RAWGPFRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATI

Query:  FIKEVVRLHGYPRSIVSDRDRLFLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTP--
        F + V+   G P+ I++D D +F S  WK+        +K S  Y PQTDGQ+E  N++VE  L C C   P TW + ++  +  YN   H+  + TP  
Subjt:  FIKEVVRLHGYPRSIVSDRDRLFLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTP--

Query:  ----YAPGLTP------NDSVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDV-VFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRV
            Y+P L+P      +D  +   Q   ++  T+K HL     +MKK+ D+   ++  F  GDL+ +K    R ++    +  KL+  + GP+ VL + 
Subjt:  ----YAPGLTP------NDSVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDV-VFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRV

Query:  GEVAYLLDLPDTAR--IHPVFHVSQLKK
        G   Y LDLPD+ +      FHVS L+K
Subjt:  GEVAYLLDLPDTAR--IHPVFHVSQLKK

P0CT36 Transposon Tf2-3 polyprotein1.4e-8830.36Show/hide
Query:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW
        M +G++ APA FQ  +N +        ++ + DDIL++SK  S H++H+  V + L+  NL  N  KC+F + +++++G+ +SEKG    QE I  +L+W
Subjt:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW

Query:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKN-NFRWT---EEAFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKK-----
          PKN +ELR FLG   Y R+F+     +  PL  L KK+  ++WT    +A E +K+ +V+ PVL   DF     +ETDAS   +GAVLSQ        
Subjt:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKN-NFRWT---EEAFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKK-----

Query:  PIAYFSQKLSETAKERSIYERELMAIVLAVEKWRHYLLG--HRFVVYTDQKAL--RHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIP
        P+ Y+S K+S+     S+ ++E++AI+ +++ WRHYL      F + TD + L  R   E       + +W + L  F+FEI YR G  N   DALSRI 
Subjt:  PIAYFSQKLSETAKERSIYERELMAIVLAVEKWRHYLLG--HRFVVYTDQKAL--RHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIP

Query:  AEAQ----------LNVINVPSLLD-----------------------------------------LETIGSPKDVQINSN------------HPP----
         E +          +N +N  S+ D                                          + I  P D Q+               HP     
Subjt:  AEAQ----------LNVINVPSLLD-----------------------------------------LETIGSPKDVQINSN------------HPP----

Query:  ---HFPRF---------------------------RAWGPFRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATI
              RF                           + +GP + P+P   R W+ +SMDFI  LP S G++ + VVVD  SK A  +      +A+  A +
Subjt:  ---HFPRF---------------------------RAWGPFRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATI

Query:  FIKEVVRLHGYPRSIVSDRDRLFLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTP--
        F + V+   G P+ I++D D +F S  WK+        +K S  Y PQTDGQ+E  N++VE  L C C   P TW + ++  +  YN   H+  + TP  
Subjt:  FIKEVVRLHGYPRSIVSDRDRLFLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTP--

Query:  ----YAPGLTP------NDSVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDV-VFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRV
            Y+P L+P      +D  +   Q   ++  T+K HL     +MKK+ D+   ++  F  GDL+ +K    R ++    +  KL+  + GP+ VL + 
Subjt:  ----YAPGLTP------NDSVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDV-VFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRV

Query:  GEVAYLLDLPDTAR--IHPVFHVSQLKK
        G   Y LDLPD+ +      FHVS L+K
Subjt:  GEVAYLLDLPDTAR--IHPVFHVSQLKK

P0CT37 Transposon Tf2-4 polyprotein1.4e-8830.36Show/hide
Query:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW
        M +G++ APA FQ  +N +        ++ + DDIL++SK  S H++H+  V + L+  NL  N  KC+F + +++++G+ +SEKG    QE I  +L+W
Subjt:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW

Query:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKN-NFRWT---EEAFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKK-----
          PKN +ELR FLG   Y R+F+     +  PL  L KK+  ++WT    +A E +K+ +V+ PVL   DF     +ETDAS   +GAVLSQ        
Subjt:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKN-NFRWT---EEAFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKK-----

Query:  PIAYFSQKLSETAKERSIYERELMAIVLAVEKWRHYLLG--HRFVVYTDQKAL--RHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIP
        P+ Y+S K+S+     S+ ++E++AI+ +++ WRHYL      F + TD + L  R   E       + +W + L  F+FEI YR G  N   DALSRI 
Subjt:  PIAYFSQKLSETAKERSIYERELMAIVLAVEKWRHYLLG--HRFVVYTDQKAL--RHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIP

Query:  AEAQ----------LNVINVPSLLD-----------------------------------------LETIGSPKDVQINSN------------HPP----
         E +          +N +N  S+ D                                          + I  P D Q+               HP     
Subjt:  AEAQ----------LNVINVPSLLD-----------------------------------------LETIGSPKDVQINSN------------HPP----

Query:  ---HFPRF---------------------------RAWGPFRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATI
              RF                           + +GP + P+P   R W+ +SMDFI  LP S G++ + VVVD  SK A  +      +A+  A +
Subjt:  ---HFPRF---------------------------RAWGPFRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATI

Query:  FIKEVVRLHGYPRSIVSDRDRLFLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTP--
        F + V+   G P+ I++D D +F S  WK+        +K S  Y PQTDGQ+E  N++VE  L C C   P TW + ++  +  YN   H+  + TP  
Subjt:  FIKEVVRLHGYPRSIVSDRDRLFLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTP--

Query:  ----YAPGLTP------NDSVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDV-VFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRV
            Y+P L+P      +D  +   Q   ++  T+K HL     +MKK+ D+   ++  F  GDL+ +K    R ++    +  KL+  + GP+ VL + 
Subjt:  ----YAPGLTP------NDSVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDV-VFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRV

Query:  GEVAYLLDLPDTAR--IHPVFHVSQLKK
        G   Y LDLPD+ +      FHVS L+K
Subjt:  GEVAYLLDLPDTAR--IHPVFHVSQLKK

P0CT41 Transposon Tf2-12 polyprotein1.4e-8830.36Show/hide
Query:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW
        M +G++ APA FQ  +N +        ++ + DDIL++SK  S H++H+  V + L+  NL  N  KC+F + +++++G+ +SEKG    QE I  +L+W
Subjt:  MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEW

Query:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKN-NFRWT---EEAFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKK-----
          PKN +ELR FLG   Y R+F+     +  PL  L KK+  ++WT    +A E +K+ +V+ PVL   DF     +ETDAS   +GAVLSQ        
Subjt:  PIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKN-NFRWT---EEAFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKK-----

Query:  PIAYFSQKLSETAKERSIYERELMAIVLAVEKWRHYLLG--HRFVVYTDQKAL--RHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIP
        P+ Y+S K+S+     S+ ++E++AI+ +++ WRHYL      F + TD + L  R   E       + +W + L  F+FEI YR G  N   DALSRI 
Subjt:  PIAYFSQKLSETAKERSIYERELMAIVLAVEKWRHYLLG--HRFVVYTDQKAL--RHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIP

Query:  AEAQ----------LNVINVPSLLD-----------------------------------------LETIGSPKDVQINSN------------HPP----
         E +          +N +N  S+ D                                          + I  P D Q+               HP     
Subjt:  AEAQ----------LNVINVPSLLD-----------------------------------------LETIGSPKDVQINSN------------HPP----

Query:  ---HFPRF---------------------------RAWGPFRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATI
              RF                           + +GP + P+P   R W+ +SMDFI  LP S G++ + VVVD  SK A  +      +A+  A +
Subjt:  ---HFPRF---------------------------RAWGPFRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATI

Query:  FIKEVVRLHGYPRSIVSDRDRLFLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTP--
        F + V+   G P+ I++D D +F S  WK+        +K S  Y PQTDGQ+E  N++VE  L C C   P TW + ++  +  YN   H+  + TP  
Subjt:  FIKEVVRLHGYPRSIVSDRDRLFLSHFWKEMFRLQGIQLKRSTAYHPQTDGQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTP--

Query:  ----YAPGLTP------NDSVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDV-VFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRV
            Y+P L+P      +D  +   Q   ++  T+K HL     +MKK+ D+   ++  F  GDL+ +K    R ++    +  KL+  + GP+ VL + 
Subjt:  ----YAPGLTP------NDSVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDV-VFDIGDLLYLKVQPYRQQSLAKKRCEKLSSKYFGPYMVLGRV

Query:  GEVAYLLDLPDTAR--IHPVFHVSQLKK
        G   Y LDLPD+ +      FHVS L+K
Subjt:  GEVAYLLDLPDTAR--IHPVFHVSQLKK

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein4.7e-3657.25Show/hide
Query:  LEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLG--HWVSEKGVEADQEKIRNMLEWPIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFR
        + HL +V ++  QH  +AN KKC F + +I YLG  H +S +GV AD  K+  M+ WP PKN  ELRGFLGLTGYYRRFV NYG I  PLT L KKN+ +
Subjt:  LEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLG--HWVSEKGVEADQEKIRNMLEWPIPKNIRELRGFLGLTGYYRRFVANYGAIATPLTRLTKKNNFR

Query:  WTEE---AFELLKKAMVTLPVLALPDFQLPF
        WTE    AF+ LK A+ TLPVLALPD +LPF
Subjt:  WTEE---AFELLKKAMVTLPVLALPDFQLPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTTCGGTCTCACGAACGCTCCTGCGACGTTCCAAGCTCTAATGAATCAGGTATTTAGACCTTACCTGCGTAAATTCTTGCTGGTATTTTTCGATGACATTCTGGT
CTACAGTAAAGACGCTTCAACCCACTTAGAGCATCTCACCTTGGTGTTTCGATTATTAAGACAGCACAACCTATTCGCAAATAGCAAGAAGTGCCAGTTTGCCAAGGACC
GGATTGAGTACTTGGGCCATTGGGTGTCGGAAAAAGGGGTAGAAGCTGATCAAGAGAAGATTAGGAATATGCTGGAATGGCCAATTCCGAAAAACATAAGAGAATTGCGA
GGGTTCTTAGGATTAACGGGCTACTATAGACGTTTTGTGGCGAATTATGGTGCCATCGCCACCCCCCTAACTCGGTTGACAAAGAAAAACAACTTTCGATGGACAGAAGA
AGCATTTGAATTATTGAAGAAAGCAATGGTCACGTTACCAGTCCTAGCACTACCAGATTTTCAATTACCATTTGAAATAGAAACGGACGCTTCAGGGTTCGGTCTCGGGG
CGGTTTTATCACAAAACAAGAAACCTATTGCCTACTTTAGCCAGAAACTTTCCGAGACCGCCAAAGAGAGATCCATTTATGAGAGAGAATTGATGGCAATTGTGTTAGCA
GTAGAGAAATGGCGACACTACTTGTTGGGGCACCGCTTTGTGGTATACACTGACCAAAAAGCCTTGCGGCACATTTGGGAGCAAAGGGAGATTTTGCCGGGCATCCAAAA
ATGGGTAATGAAATTGATCGGCTTCGATTTCGAAATTTTTTATCGAGCAGGACAGAAGAACAAAGCAGTAGATGCACTTTCGCGAATTCCAGCCGAGGCACAGCTGAATG
TGATTAACGTACCCTCCTTGTTGGACCTCGAGACAATTGGTTCTCCCAAAGATGTCCAGATTAATTCCAACCATCCTCCACACTTTCCACGATTCCGTGCTTGGGGGCCA
TTCAGGACACCCCTCCCCATTCCCAACAGAATTTGGAAAGATATATCGATGGACTTTATAGAAGGGTTGCCGCGATCCAAGGGATTTGACACGATCCTTGTGGTGGTGGA
TCACCTTAGTAAGTACGCTCACTTCTTAACATTGGGACATCCATTTTCTGCCAAAACAGTAGCAACGATATTCATAAAGGAAGTGGTTCGTCTCCATGGGTACCCCCGCT
CCATTGTATCCGATAGGGACCGACTGTTCTTGAGTCACTTTTGGAAGGAGATGTTCCGTTTGCAAGGAATCCAATTGAAGAGAAGCACAGCCTACCACCCGCAAACGGAT
GGGCAGTCGGAAGTGGTGAACAAGAGTGTGGAGTTGTATTTGTGGTGTTTTTGTCACGAAAAACCAACGACATGGAGTGAAAAACTAGCTTGGGCAGAGTACTGGTACAA
CACCACCTTCCACGCGTTCATACGAAGCACCCCTTATGCACCAGGGCTTACTCCAAATGATTCAGTAGAATCGCAGTTGCAGGCCAGAGACGAGCTGCTGATTACTTTAA
AGAATCACTTACAACACGCCCAAGCACAAATGAAGAAATTTGCGGATGTACACCATTGGGACGTGGTGTTCGACATTGGCGATTTGCTCTATTTAAAAGTACAGCCGTAT
AGACAGCAGTCATTGGCTAAAAAACGATGCGAAAAGTTATCCTCAAAATATTTTGGACCCTACATGGTATTAGGAAGGGTAGGGGAAGTAGCCTATCTGCTGGATCTACC
AGACACTGCACGAATCCACCCTGTGTTCCACGTGTCACAACTGAAAAAGGCGAATGATACCGAACGTGATTGGGAGTATTTAGTTGAGTGGAAGGATCAACCGGCACACG
AAGCCACGTGGGAATCGTACGCGGTGCTTACAAACCAATATCCCGAATTCCACCTTGAGGACAAGGTGTCTCTTCTCCGAGAGGGTATTGCAAGGCCCCCAATCATCCAT
AGTAGTAGGATGGAGCTATATAGTCTGGTATTTTGTGCTAGGGTTGTTATTGAAAACATTAGGTATAATTTTGGGATGAACTTTGTTTCATTGGGAGAGACACAGCCCTC
TTATTTGGCTGGAATCTTATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTTCGGTCTCACGAACGCTCCTGCGACGTTCCAAGCTCTAATGAATCAGGTATTTAGACCTTACCTGCGTAAATTCTTGCTGGTATTTTTCGATGACATTCTGGT
CTACAGTAAAGACGCTTCAACCCACTTAGAGCATCTCACCTTGGTGTTTCGATTATTAAGACAGCACAACCTATTCGCAAATAGCAAGAAGTGCCAGTTTGCCAAGGACC
GGATTGAGTACTTGGGCCATTGGGTGTCGGAAAAAGGGGTAGAAGCTGATCAAGAGAAGATTAGGAATATGCTGGAATGGCCAATTCCGAAAAACATAAGAGAATTGCGA
GGGTTCTTAGGATTAACGGGCTACTATAGACGTTTTGTGGCGAATTATGGTGCCATCGCCACCCCCCTAACTCGGTTGACAAAGAAAAACAACTTTCGATGGACAGAAGA
AGCATTTGAATTATTGAAGAAAGCAATGGTCACGTTACCAGTCCTAGCACTACCAGATTTTCAATTACCATTTGAAATAGAAACGGACGCTTCAGGGTTCGGTCTCGGGG
CGGTTTTATCACAAAACAAGAAACCTATTGCCTACTTTAGCCAGAAACTTTCCGAGACCGCCAAAGAGAGATCCATTTATGAGAGAGAATTGATGGCAATTGTGTTAGCA
GTAGAGAAATGGCGACACTACTTGTTGGGGCACCGCTTTGTGGTATACACTGACCAAAAAGCCTTGCGGCACATTTGGGAGCAAAGGGAGATTTTGCCGGGCATCCAAAA
ATGGGTAATGAAATTGATCGGCTTCGATTTCGAAATTTTTTATCGAGCAGGACAGAAGAACAAAGCAGTAGATGCACTTTCGCGAATTCCAGCCGAGGCACAGCTGAATG
TGATTAACGTACCCTCCTTGTTGGACCTCGAGACAATTGGTTCTCCCAAAGATGTCCAGATTAATTCCAACCATCCTCCACACTTTCCACGATTCCGTGCTTGGGGGCCA
TTCAGGACACCCCTCCCCATTCCCAACAGAATTTGGAAAGATATATCGATGGACTTTATAGAAGGGTTGCCGCGATCCAAGGGATTTGACACGATCCTTGTGGTGGTGGA
TCACCTTAGTAAGTACGCTCACTTCTTAACATTGGGACATCCATTTTCTGCCAAAACAGTAGCAACGATATTCATAAAGGAAGTGGTTCGTCTCCATGGGTACCCCCGCT
CCATTGTATCCGATAGGGACCGACTGTTCTTGAGTCACTTTTGGAAGGAGATGTTCCGTTTGCAAGGAATCCAATTGAAGAGAAGCACAGCCTACCACCCGCAAACGGAT
GGGCAGTCGGAAGTGGTGAACAAGAGTGTGGAGTTGTATTTGTGGTGTTTTTGTCACGAAAAACCAACGACATGGAGTGAAAAACTAGCTTGGGCAGAGTACTGGTACAA
CACCACCTTCCACGCGTTCATACGAAGCACCCCTTATGCACCAGGGCTTACTCCAAATGATTCAGTAGAATCGCAGTTGCAGGCCAGAGACGAGCTGCTGATTACTTTAA
AGAATCACTTACAACACGCCCAAGCACAAATGAAGAAATTTGCGGATGTACACCATTGGGACGTGGTGTTCGACATTGGCGATTTGCTCTATTTAAAAGTACAGCCGTAT
AGACAGCAGTCATTGGCTAAAAAACGATGCGAAAAGTTATCCTCAAAATATTTTGGACCCTACATGGTATTAGGAAGGGTAGGGGAAGTAGCCTATCTGCTGGATCTACC
AGACACTGCACGAATCCACCCTGTGTTCCACGTGTCACAACTGAAAAAGGCGAATGATACCGAACGTGATTGGGAGTATTTAGTTGAGTGGAAGGATCAACCGGCACACG
AAGCCACGTGGGAATCGTACGCGGTGCTTACAAACCAATATCCCGAATTCCACCTTGAGGACAAGGTGTCTCTTCTCCGAGAGGGTATTGCAAGGCCCCCAATCATCCAT
AGTAGTAGGATGGAGCTATATAGTCTGGTATTTTGTGCTAGGGTTGTTATTGAAAACATTAGGTATAATTTTGGGATGAACTTTGTTTCATTGGGAGAGACACAGCCCTC
TTATTTGGCTGGAATCTTATCTTGA
Protein sequenceShow/hide protein sequence
MSFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSKDASTHLEHLTLVFRLLRQHNLFANSKKCQFAKDRIEYLGHWVSEKGVEADQEKIRNMLEWPIPKNIRELR
GFLGLTGYYRRFVANYGAIATPLTRLTKKNNFRWTEEAFELLKKAMVTLPVLALPDFQLPFEIETDASGFGLGAVLSQNKKPIAYFSQKLSETAKERSIYERELMAIVLA
VEKWRHYLLGHRFVVYTDQKALRHIWEQREILPGIQKWVMKLIGFDFEIFYRAGQKNKAVDALSRIPAEAQLNVINVPSLLDLETIGSPKDVQINSNHPPHFPRFRAWGP
FRTPLPIPNRIWKDISMDFIEGLPRSKGFDTILVVVDHLSKYAHFLTLGHPFSAKTVATIFIKEVVRLHGYPRSIVSDRDRLFLSHFWKEMFRLQGIQLKRSTAYHPQTD
GQSEVVNKSVELYLWCFCHEKPTTWSEKLAWAEYWYNTTFHAFIRSTPYAPGLTPNDSVESQLQARDELLITLKNHLQHAQAQMKKFADVHHWDVVFDIGDLLYLKVQPY
RQQSLAKKRCEKLSSKYFGPYMVLGRVGEVAYLLDLPDTARIHPVFHVSQLKKANDTERDWEYLVEWKDQPAHEATWESYAVLTNQYPEFHLEDKVSLLREGIARPPIIH
SSRMELYSLVFCARVVIENIRYNFGMNFVSLGETQPSYLAGILS