; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G20130 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G20130
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr3:15925829..15937128
RNA-Seq ExpressionCSPI03G20130
SyntenyCSPI03G20130
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0030154 - cell differentiation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR011989 - Armadillo-like helical
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8637598.1 hypothetical protein CSA_022681 [Cucumis sativus]6.8e-29665.47Show/hide
Query:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------
        ML  GVIRPS SPYS+PVLLVKKKDG WRFCVDYRKLNQVT +DKFPIPVIEELLDELHGAT FSKLDLKSGYHQIRM+EED E                
Subjt:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------

Query:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ
          G  +A  T +   N +                     DIDEH KHLGMVFA+LRD++LFANR KCVIAHS++QYLGH IS RGVEADE+KI+SM  W 
Subjt:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ

Query:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ
        +PKD+T LRGFLGLTGY RRFVK YGEIAAPL+KLLQKN+F W EEAT AF+ LKLAMTTLPVLALPDW+ PF +ETDASG+GLG VLS++GHP+AFFSQ
Subjt:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ

Query:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ
        KLS RAQ KSIYERELMAVVL+VQKWRHYLLGRKFTI+SDQKALKFLLEQREVQ QFQKWLT LLGYDFEILYQPG QNKVADALSRK+   ELN++TT 
Subjt:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ

Query:  GIVDMEI-----EKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ
        GIVD+EI     E D+ELQKII +L+   ++  KYQW  GRLLYKGR+VLP+ SSLIPSLLHTFHDSILGGHSG LRTYKRMSGEL W+GMK D+K+YV+
Subjt:  GIVDMEI-----EKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ

Query:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF
        +C+ CQRNKFEATKP GVLQP+PIP++ILEDWTMDFIEG                                  VA +F+++VVSKHGIPKS+I+ RDK+F
Subjt:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF

Query:  LSNFWKELFASMGTILKRSTTFHPQTDDRLK----------------------EFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV
        LSNFWKELF +MGTILKRST FHPQTD + +                      + IPWAELWYNTTFHASTK TP+Q+V+G  PPPLLSYG +++PNN+V
Subjt:  LSNFWKELFASMGTILKRSTTFHPQTDDRLK----------------------EFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV

Query:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGPRPGIFQL----YREELGLPQVLLCRVHFL
        E +LKERD+A++AL+ENLC+A NRMKKMAD  RRELKFK GDEV+LKLR Y QRSLA+++CEKL PK+YGP   I ++    YR +L  P+  +  V  +
Subjt:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGPRPGIFQL----YREELGLPQVLLCRVHFL

Query:  KEMLL
         ++ L
Subjt:  KEMLL

TYJ96663.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.7e-29467.7Show/hide
Query:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------
        ML  G+IRPSHSP+S+PVLLVKKKDG WRFCVDYRKLN++T +DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EED E                
Subjt:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------

Query:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ
          G  +A  T +   N +                    +DI EHEKHLGMVFA LRDNQL+ANRKKCV AHS+I YLGH ISK GVEAD++K+KSM +W 
Subjt:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ

Query:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ
        +PKDVT LRGFLGLTGY RRFVKGYGEIAAPL+KLLQKN+F+W E AT AFE+LK AM+T+PVLALPDW+LPFM+ETDASG GLG VLS+N HP+AFFSQ
Subjt:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ

Query:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ
        KLS+RAQ KSIYERELMAVVL+VQKWRHYLLGR+FTI+SDQKALKFLLEQREVQ QFQKWLT LLGYDFEILYQPG QNK ADALSR +   EL +L+T 
Subjt:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ

Query:  GIVDM-----EIEKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ
        GIVDM     E+EKDEELQ +I++L+ N     KY    G L+YKGRVVL K+SS+IPSLLHTFHDSILGGHSG LRTYKRMSGEL W+GMK D+KKYV+
Subjt:  GIVDM-----EIEKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ

Query:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF
        QCEICQRNK EATKP GVLQPLPIP+RILEDWTMDFIEG                                  VA+ FID++V +HGIPKS+IS RDK+F
Subjt:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF

Query:  LSNFWKELFASMGTILKRSTTFHPQTD----------------------DRLKEFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV
        +SNFWKELF +M TILKRST FHPQTD                      ++  +FIPWAELWYNTTFH+ST+ TPFQ VYG PPPPL+SYGD+KTPN+EV
Subjt:  LSNFWKELFASMGTILKRSTTFHPQTD----------------------DRLKEFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV

Query:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGP
        E +LKERD+AISALKENL +A NRMKK AD KRRELKFK GDEV+LKLR Y QRSLA++R EKL PKYYGP
Subjt:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGP

TYK21035.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.7e-29467.7Show/hide
Query:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------
        ML  G+IRPSHSP+S+PVLLVKKKDG WRFCVDYRKLN++T +DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EED E                
Subjt:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------

Query:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ
          G  +A  T +   N +                    +DI EHEKHLGMVFA LRDNQL+ANRKKCV AHS+I YLGH ISK GVEAD++K+KSM +W 
Subjt:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ

Query:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ
        +PKDVT LRGFLGLTGY RRFVKGYGEIAAPL+KLLQKN+F+W E AT AFE+LK AM+T+PVLALPDW+LPFM+ETDASG GLG VLS+N HP+AFFSQ
Subjt:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ

Query:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ
        KLS+RAQ KSIYERELMAVVL+VQKWRHYLLGR+FTI+SDQKALKFLLEQREVQ QFQKWLT LLGYDFEILYQPG QNK ADALSR +   EL +L+T 
Subjt:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ

Query:  GIVDM-----EIEKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ
        GIVDM     E+EKDEELQ +I++L+ N     KY    G L+YKGRVVL K+SS+IPSLLHTFHDSILGGHSG LRTYKRMSGEL W+GMK D+KKYV+
Subjt:  GIVDM-----EIEKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ

Query:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF
        QCEICQRNK EATKP GVLQPLPIP+RILEDWTMDFIEG                                  VA+ FID++V +HGIPKS+IS RDK+F
Subjt:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF

Query:  LSNFWKELFASMGTILKRSTTFHPQTD----------------------DRLKEFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV
        +SNFWKELF +M TILKRST FHPQTD                      ++  +FIPWAELWYNTTFH+ST+ TPFQ VYG PPPPL+SYGD+KTPN+EV
Subjt:  LSNFWKELFASMGTILKRSTTFHPQTD----------------------DRLKEFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV

Query:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGP
        E +LKERD+AISALKENL +A NRMKK AD KRRELKFK GDEV+LKLR Y QRSLA++R EKL PKYYGP
Subjt:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGP

TYK27058.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.7e-29467.7Show/hide
Query:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------
        ML  G+IRPSHSP+S+PVLLVKKKDG WRFCVDYRKLN++T +DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EED E                
Subjt:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------

Query:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ
          G  +A  T +   N +                    +DI EHEKHLGMVFA LRDNQL+ANRKKCV AHS+I YLGH ISK GVEAD++K+KSM +W 
Subjt:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ

Query:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ
        +PKDVT LRGFLGLTGY RRFVKGYGEIAAPL+KLLQKN+F+W E AT AFE+LK AM+T+PVLALPDW+LPFM+ETDASG GLG VLS+N HP+AFFSQ
Subjt:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ

Query:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ
        KLS+RAQ KSIYERELMAVVL+VQKWRHYLLGR+FTI+SDQKALKFLLEQREVQ QFQKWLT LLGYDFEILYQPG QNK ADALSR +   EL +L+T 
Subjt:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ

Query:  GIVDM-----EIEKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ
        GIVDM     E+EKDEELQ +I++L+ N     KY    G L+YKGRVVL K+SS+IPSLLHTFHDSILGGHSG LRTYKRMSGEL W+GMK D+KKYV+
Subjt:  GIVDM-----EIEKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ

Query:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF
        QCEICQRNK EATKP GVLQPLPIP+RILEDWTMDFIEG                                  VA+ FID++V +HGIPKS+IS RDK+F
Subjt:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF

Query:  LSNFWKELFASMGTILKRSTTFHPQTD----------------------DRLKEFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV
        +SNFWKELF +M TILKRST FHPQTD                      ++  +FIPWAELWYNTTFH+ST+ TPFQ VYG PPPPL+SYGD+KTPN+EV
Subjt:  LSNFWKELFASMGTILKRSTTFHPQTD----------------------DRLKEFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV

Query:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGP
        E +LKERD+AISALKENL +A NRMKK AD KRRELKFK GDEV+LKLR Y QRSLA++R EKL PKYYGP
Subjt:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGP

TYK28944.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.7e-29467.7Show/hide
Query:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------
        ML  G+IRPSHSP+S+PVLLVKKKDG WRFCVDYRKLN++T +DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EED E                
Subjt:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------

Query:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ
          G  +A  T +   N +                    +DI EHEKHLGMVFA LRDNQL+ANRKKCV AHS+I YLGH ISK GVEAD++K+KSM +W 
Subjt:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ

Query:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ
        +PKDVT LRGFLGLTGY RRFVKGYGEIAAPL+KLLQKN+F+W E AT AFE+LK AM+T+PVLALPDW+LPFM+ETDASG GLG VLS+N HP+AFFSQ
Subjt:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ

Query:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ
        KLS+RAQ KSIYERELMAVVL+VQKWRHYLLGR+FTI+SDQKALKFLLEQREVQ QFQKWLT LLGYDFEILYQPG QNK ADALSR +   EL +L+T 
Subjt:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ

Query:  GIVDM-----EIEKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ
        GIVDM     E+EKDEELQ +I++L+ N     KY    G L+YKGRVVL K+SS+IPSLLHTFHDSILGGHSG LRTYKRMSGEL W+GMK D+KKYV+
Subjt:  GIVDM-----EIEKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ

Query:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF
        QCEICQRNK EATKP GVLQPLPIP+RILEDWTMDFIEG                                  VA+ FID++V +HGIPKS+IS RDK+F
Subjt:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF

Query:  LSNFWKELFASMGTILKRSTTFHPQTD----------------------DRLKEFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV
        +SNFWKELF +M TILKRST FHPQTD                      ++  +FIPWAELWYNTTFH+ST+ TPFQ VYG PPPPL+SYGD+KTPN+EV
Subjt:  LSNFWKELFASMGTILKRSTTFHPQTD----------------------DRLKEFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV

Query:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGP
        E +LKERD+AISALKENL +A NRMKK AD KRRELKFK GDEV+LKLR Y QRSLA++R EKL PKYYGP
Subjt:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGP

TrEMBL top hitse value%identityAlignment
A0A5D3BBH7 Ty3/gypsy retrotransposon protein1.8e-29467.7Show/hide
Query:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------
        ML  G+IRPSHSP+S+PVLLVKKKDG WRFCVDYRKLN++T +DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EED E                
Subjt:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------

Query:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ
          G  +A  T +   N +                    +DI EHEKHLGMVFA LRDNQL+ANRKKCV AHS+I YLGH ISK GVEAD++K+KSM +W 
Subjt:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ

Query:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ
        +PKDVT LRGFLGLTGY RRFVKGYGEIAAPL+KLLQKN+F+W E AT AFE+LK AM+T+PVLALPDW+LPFM+ETDASG GLG VLS+N HP+AFFSQ
Subjt:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ

Query:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ
        KLS+RAQ KSIYERELMAVVL+VQKWRHYLLGR+FTI+SDQKALKFLLEQREVQ QFQKWLT LLGYDFEILYQPG QNK ADALSR +   EL +L+T 
Subjt:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ

Query:  GIVDM-----EIEKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ
        GIVDM     E+EKDEELQ +I++L+ N     KY    G L+YKGRVVL K+SS+IPSLLHTFHDSILGGHSG LRTYKRMSGEL W+GMK D+KKYV+
Subjt:  GIVDM-----EIEKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ

Query:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF
        QCEICQRNK EATKP GVLQPLPIP+RILEDWTMDFIEG                                  VA+ FID++V +HGIPKS+IS RDK+F
Subjt:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF

Query:  LSNFWKELFASMGTILKRSTTFHPQTD----------------------DRLKEFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV
        +SNFWKELF +M TILKRST FHPQTD                      ++  +FIPWAELWYNTTFH+ST+ TPFQ VYG PPPPL+SYGD+KTPN+EV
Subjt:  LSNFWKELFASMGTILKRSTTFHPQTD----------------------DRLKEFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV

Query:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGP
        E +LKERD+AISALKENL +A NRMKK AD KRRELKFK GDEV+LKLR Y QRSLA++R EKL PKYYGP
Subjt:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGP

A0A5D3DU86 Ty3/gypsy retrotransposon protein1.8e-29467.7Show/hide
Query:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------
        ML  G+IRPSHSP+S+PVLLVKKKDG WRFCVDYRKLN++T +DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EED E                
Subjt:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------

Query:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ
          G  +A  T +   N +                    +DI EHEKHLGMVFA LRDNQL+ANRKKCV AHS+I YLGH ISK GVEAD++K+KSM +W 
Subjt:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ

Query:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ
        +PKDVT LRGFLGLTGY RRFVKGYGEIAAPL+KLLQKN+F+W E AT AFE+LK AM+T+PVLALPDW+LPFM+ETDASG GLG VLS+N HP+AFFSQ
Subjt:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ

Query:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ
        KLS+RAQ KSIYERELMAVVL+VQKWRHYLLGR+FTI+SDQKALKFLLEQREVQ QFQKWLT LLGYDFEILYQPG QNK ADALSR +   EL +L+T 
Subjt:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ

Query:  GIVDM-----EIEKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ
        GIVDM     E+EKDEELQ +I++L+ N     KY    G L+YKGRVVL K+SS+IPSLLHTFHDSILGGHSG LRTYKRMSGEL W+GMK D+KKYV+
Subjt:  GIVDM-----EIEKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ

Query:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF
        QCEICQRNK EATKP GVLQPLPIP+RILEDWTMDFIEG                                  VA+ FID++V +HGIPKS+IS RDK+F
Subjt:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF

Query:  LSNFWKELFASMGTILKRSTTFHPQTD----------------------DRLKEFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV
        +SNFWKELF +M TILKRST FHPQTD                      ++  +FIPWAELWYNTTFH+ST+ TPFQ VYG PPPPL+SYGD+KTPN+EV
Subjt:  LSNFWKELFASMGTILKRSTTFHPQTD----------------------DRLKEFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV

Query:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGP
        E +LKERD+AISALKENL +A NRMKK AD KRRELKFK GDEV+LKLR Y QRSLA++R EKL PKYYGP
Subjt:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGP

A0A5D3DWA9 Ty3/gypsy retrotransposon protein1.8e-29467.7Show/hide
Query:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------
        ML  G+IRPSHSP+S+PVLLVKKKDG WRFCVDYRKLN++T +DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EED E                
Subjt:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------

Query:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ
          G  +A  T +   N +                    +DI EHEKHLGMVFA LRDNQL+ANRKKCV AHS+I YLGH ISK GVEAD++K+KSM +W 
Subjt:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ

Query:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ
        +PKDVT LRGFLGLTGY RRFVKGYGEIAAPL+KLLQKN+F+W E AT AFE+LK AM+T+PVLALPDW+LPFM+ETDASG GLG VLS+N HP+AFFSQ
Subjt:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ

Query:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ
        KLS+RAQ KSIYERELMAVVL+VQKWRHYLLGR+FTI+SDQKALKFLLEQREVQ QFQKWLT LLGYDFEILYQPG QNK ADALSR +   EL +L+T 
Subjt:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ

Query:  GIVDM-----EIEKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ
        GIVDM     E+EKDEELQ +I++L+ N     KY    G L+YKGRVVL K+SS+IPSLLHTFHDSILGGHSG LRTYKRMSGEL W+GMK D+KKYV+
Subjt:  GIVDM-----EIEKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ

Query:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF
        QCEICQRNK EATKP GVLQPLPIP+RILEDWTMDFIEG                                  VA+ FID++V +HGIPKS+IS RDK+F
Subjt:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF

Query:  LSNFWKELFASMGTILKRSTTFHPQTD----------------------DRLKEFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV
        +SNFWKELF +M TILKRST FHPQTD                      ++  +FIPWAELWYNTTFH+ST+ TPFQ VYG PPPPL+SYGD+KTPN+EV
Subjt:  LSNFWKELFASMGTILKRSTTFHPQTD----------------------DRLKEFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV

Query:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGP
        E +LKERD+AISALKENL +A NRMKK AD KRRELKFK GDEV+LKLR Y QRSLA++R EKL PKYYGP
Subjt:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGP

A0A5D3DZK6 Ty3/gypsy retrotransposon protein1.8e-29467.7Show/hide
Query:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------
        ML  G+IRPSHSP+S+PVLLVKKKDG WRFCVDYRKLN++T +DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EED E                
Subjt:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------

Query:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ
          G  +A  T +   N +                    +DI EHEKHLGMVFA LRDNQL+ANRKKCV AHS+I YLGH ISK GVEAD++K+KSM +W 
Subjt:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ

Query:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ
        +PKDVT LRGFLGLTGY RRFVKGYGEIAAPL+KLLQKN+F+W E AT AFE+LK AM+T+PVLALPDW+LPFM+ETDASG GLG VLS+N HP+AFFSQ
Subjt:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ

Query:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ
        KLS+RAQ KSIYERELMAVVL+VQKWRHYLLGR+FTI+SDQKALKFLLEQREVQ QFQKWLT LLGYDFEILYQPG QNK ADALSR +   EL +L+T 
Subjt:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ

Query:  GIVDM-----EIEKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ
        GIVDM     E+EKDEELQ +I++L+ N     KY    G L+YKGRVVL K+SS+IPSLLHTFHDSILGGHSG LRTYKRMSGEL W+GMK D+KKYV+
Subjt:  GIVDM-----EIEKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ

Query:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF
        QCEICQRNK EATKP GVLQPLPIP+RILEDWTMDFIEG                                  VA+ FID++V +HGIPKS+IS RDK+F
Subjt:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF

Query:  LSNFWKELFASMGTILKRSTTFHPQTD----------------------DRLKEFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV
        +SNFWKELF +M TILKRST FHPQTD                      ++  +FIPWAELWYNTTFH+ST+ TPFQ VYG PPPPL+SYGD+KTPN+EV
Subjt:  LSNFWKELFASMGTILKRSTTFHPQTD----------------------DRLKEFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV

Query:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGP
        E +LKERD+AISALKENL +A NRMKK AD KRRELKFK GDEV+LKLR Y QRSLA++R EKL PKYYGP
Subjt:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGP

A0A5D3E325 Ty3/gypsy retrotransposon protein1.8e-29467.7Show/hide
Query:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------
        ML  G+IRPSHSP+S+PVLLVKKKDG WRFCVDYRKLN++T +DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EED E                
Subjt:  MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED---------------

Query:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ
          G  +A  T +   N +                    +DI EHEKHLGMVFA LRDNQL+ANRKKCV AHS+I YLGH ISK GVEAD++K+KSM +W 
Subjt:  --GFQDAQGTLRIYGNAI-------------------CADIDEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQ

Query:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ
        +PKDVT LRGFLGLTGY RRFVKGYGEIAAPL+KLLQKN+F+W E AT AFE+LK AM+T+PVLALPDW+LPFM+ETDASG GLG VLS+N HP+AFFSQ
Subjt:  QPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQ

Query:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ
        KLS+RAQ KSIYERELMAVVL+VQKWRHYLLGR+FTI+SDQKALKFLLEQREVQ QFQKWLT LLGYDFEILYQPG QNK ADALSR +   EL +L+T 
Subjt:  KLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSRKEQLPELNSLTTQ

Query:  GIVDM-----EIEKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ
        GIVDM     E+EKDEELQ +I++L+ N     KY    G L+YKGRVVL K+SS+IPSLLHTFHDSILGGHSG LRTYKRMSGEL W+GMK D+KKYV+
Subjt:  GIVDM-----EIEKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKRMSGELHWQGMKTDVKKYVQ

Query:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF
        QCEICQRNK EATKP GVLQPLPIP+RILEDWTMDFIEG                                  VA+ FID++V +HGIPKS+IS RDK+F
Subjt:  QCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEG----------------------------------VAVLFIDRVVSKHGIPKSLISGRDKVF

Query:  LSNFWKELFASMGTILKRSTTFHPQTD----------------------DRLKEFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV
        +SNFWKELF +M TILKRST FHPQTD                      ++  +FIPWAELWYNTTFH+ST+ TPFQ VYG PPPPL+SYGD+KTPN+EV
Subjt:  LSNFWKELFASMGTILKRSTTFHPQTD----------------------DRLKEFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEV

Query:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGP
        E +LKERD+AISALKENL +A NRMKK AD KRRELKFK GDEV+LKLR Y QRSLA++R EKL PKYYGP
Subjt:  ETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLAKRRCEKLGPKYYGP

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein7.0e-7829.43Show/hide
Query:  LAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED----------------
        L  G+IR S +  + PV+ V KK+G  R  VDY+ LN+    + +P+P+IE+LL ++ G+T+F+KLDLKS YH IR+++ D                   
Subjt:  LAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED----------------

Query:  -GFQDAQGTLRIYGNAICADI-------------------DEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQQ
         G   A    + + N I  +                     EH KH+  V   L++  L  N+ KC    S+++++G+ IS++G    +E I  + +W+Q
Subjt:  -GFQDAQGTLRIYGNAICADI-------------------DEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQQ

Query:  PKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKN-SFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENG-----HPV
        PK+   LR FLG   Y R+F+    ++  PL+ LL+K+  ++W    TQA E +K  + + PVL   D++   ++ETDAS + +G VLS+       +PV
Subjt:  PKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKN-SFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENG-----HPV

Query:  AFFSQKLSSRAQTKSIYERELMAVVLAVQKWRHYLLG--RKFTIISDQKAL--KFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSR----
         ++S K+S      S+ ++E++A++ +++ WRHYL      F I++D + L  +   E      +  +W   L  ++FEI Y+PG  N +ADALSR    
Subjt:  AFFSQKLSSRAQTKSIYERELMAVVLAVQKWRHYLLG--RKFTIISDQKAL--KFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSR----

Query:  KEQLPE------LNSLTTQGIVD-------MEIEKDEELQKIIKKLEMNHEETSKYQWEKGRLL-YKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRT
         E +P+      +N +    I D        E   D +L  ++   +   EE    Q + G L+  K +++LP  + L  +++  +H+     H G+   
Subjt:  KEQLPE------LNSLTTQGIVD-------MEIEKDEELQKIIKKLEMNHEETSKYQWEKGRLL-YKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRT

Query:  YKRMSGELHWQGMKTDVKKYVQQCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFI----------------------------------EGVAVLF
           +     W+G++  +++YVQ C  CQ NK    KP G LQP+P  ER  E  +MDFI                                  E  A +F
Subjt:  YKRMSGELHWQGMKTDVKKYVQQCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFI----------------------------------EGVAVLF

Query:  IDRVVSKHGIPKSLISGRDKVFLSNFWKELFASMGTILKRSTTFHPQTD
          RV++  G PK +I+  D +F S  WK+       ++K S  + PQTD
Subjt:  IDRVVSKHGIPKSLISGRDKVFLSNFWKELFASMGTILKRSTTFHPQTD

P0CT41 Transposon Tf2-12 polyprotein7.0e-7829.43Show/hide
Query:  LAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED----------------
        L  G+IR S +  + PV+ V KK+G  R  VDY+ LN+    + +P+P+IE+LL ++ G+T+F+KLDLKS YH IR+++ D                   
Subjt:  LAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED----------------

Query:  -GFQDAQGTLRIYGNAICADI-------------------DEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQQ
         G   A    + + N I  +                     EH KH+  V   L++  L  N+ KC    S+++++G+ IS++G    +E I  + +W+Q
Subjt:  -GFQDAQGTLRIYGNAICADI-------------------DEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQQ

Query:  PKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKN-SFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENG-----HPV
        PK+   LR FLG   Y R+F+    ++  PL+ LL+K+  ++W    TQA E +K  + + PVL   D++   ++ETDAS + +G VLS+       +PV
Subjt:  PKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKN-SFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENG-----HPV

Query:  AFFSQKLSSRAQTKSIYERELMAVVLAVQKWRHYLLG--RKFTIISDQKAL--KFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSR----
         ++S K+S      S+ ++E++A++ +++ WRHYL      F I++D + L  +   E      +  +W   L  ++FEI Y+PG  N +ADALSR    
Subjt:  AFFSQKLSSRAQTKSIYERELMAVVLAVQKWRHYLLG--RKFTIISDQKAL--KFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSR----

Query:  KEQLPE------LNSLTTQGIVD-------MEIEKDEELQKIIKKLEMNHEETSKYQWEKGRLL-YKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRT
         E +P+      +N +    I D        E   D +L  ++   +   EE    Q + G L+  K +++LP  + L  +++  +H+     H G+   
Subjt:  KEQLPE------LNSLTTQGIVD-------MEIEKDEELQKIIKKLEMNHEETSKYQWEKGRLL-YKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRT

Query:  YKRMSGELHWQGMKTDVKKYVQQCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFI----------------------------------EGVAVLF
           +     W+G++  +++YVQ C  CQ NK    KP G LQP+P  ER  E  +MDFI                                  E  A +F
Subjt:  YKRMSGELHWQGMKTDVKKYVQQCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFI----------------------------------EGVAVLF

Query:  IDRVVSKHGIPKSLISGRDKVFLSNFWKELFASMGTILKRSTTFHPQTD
          RV++  G PK +I+  D +F S  WK+       ++K S  + PQTD
Subjt:  IDRVVSKHGIPKSLISGRDKVFLSNFWKELFASMGTILKRSTTFHPQTD

P0CT42 Transposon Tf2-7 polyprotein4.1e-7829.43Show/hide
Query:  LAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED----------------
        L  G+IR S +  + PV+ V KK+G  R  VDY+ LN+    + +P+P+IE+LL ++ G+T+F+KLDLKS YH IR+++ D                   
Subjt:  LAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED----------------

Query:  -GFQDAQGTLRIYGNAICADI-------------------DEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQQ
         G   A    + + N I  ++                    EH KH+  V   L++  L  N+ KC    S+++++G+ IS++G    +E I  + +W+Q
Subjt:  -GFQDAQGTLRIYGNAICADI-------------------DEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQQ

Query:  PKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKN-SFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENG-----HPV
        PK+   LR FLG   Y R+F+    ++  PL+ LL+K+  ++W    TQA E +K  + + PVL   D++   ++ETDAS + +G VLS+       +PV
Subjt:  PKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKN-SFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENG-----HPV

Query:  AFFSQKLSSRAQTKSIYERELMAVVLAVQKWRHYLLG--RKFTIISDQKAL--KFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSR----
         ++S K+S      S+ ++E++A++ +++ WRHYL      F I++D + L  +   E      +  +W   L  ++FEI Y+PG  N +ADALSR    
Subjt:  AFFSQKLSSRAQTKSIYERELMAVVLAVQKWRHYLLG--RKFTIISDQKAL--KFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSR----

Query:  KEQLPE------LNSLTTQGIVD-------MEIEKDEELQKIIKKLEMNHEETSKYQWEKGRLL-YKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRT
         E +P+      +N +    I D        E   D +L  ++   +   EE    Q + G L+  K +++LP  + L  +++  +H+     H G+   
Subjt:  KEQLPE------LNSLTTQGIVD-------MEIEKDEELQKIIKKLEMNHEETSKYQWEKGRLL-YKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRT

Query:  YKRMSGELHWQGMKTDVKKYVQQCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFI----------------------------------EGVAVLF
           +     W+G++  +++YVQ C  CQ NK    KP G LQP+P  ER  E  +MDFI                                  E  A +F
Subjt:  YKRMSGELHWQGMKTDVKKYVQQCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFI----------------------------------EGVAVLF

Query:  IDRVVSKHGIPKSLISGRDKVFLSNFWKELFASMGTILKRSTTFHPQTD
          RV++  G PK +I+  D +F S  WK+       ++K S  + PQTD
Subjt:  IDRVVSKHGIPKSLISGRDKVFLSNFWKELFASMGTILKRSTTFHPQTD

P0CT43 Transposon Tf2-8 polyprotein4.1e-7829.43Show/hide
Query:  LAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED----------------
        L  G+IR S +  + PV+ V KK+G  R  VDY+ LN+    + +P+P+IE+LL ++ G+T+F+KLDLKS YH IR+++ D                   
Subjt:  LAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED----------------

Query:  -GFQDAQGTLRIYGNAICADI-------------------DEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQQ
         G   A    + + N I  ++                    EH KH+  V   L++  L  N+ KC    S+++++G+ IS++G    +E I  + +W+Q
Subjt:  -GFQDAQGTLRIYGNAICADI-------------------DEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQQ

Query:  PKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKN-SFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENG-----HPV
        PK+   LR FLG   Y R+F+    ++  PL+ LL+K+  ++W    TQA E +K  + + PVL   D++   ++ETDAS + +G VLS+       +PV
Subjt:  PKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKN-SFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENG-----HPV

Query:  AFFSQKLSSRAQTKSIYERELMAVVLAVQKWRHYLLG--RKFTIISDQKAL--KFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSR----
         ++S K+S      S+ ++E++A++ +++ WRHYL      F I++D + L  +   E      +  +W   L  ++FEI Y+PG  N +ADALSR    
Subjt:  AFFSQKLSSRAQTKSIYERELMAVVLAVQKWRHYLLG--RKFTIISDQKAL--KFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSR----

Query:  KEQLPE------LNSLTTQGIVD-------MEIEKDEELQKIIKKLEMNHEETSKYQWEKGRLL-YKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRT
         E +P+      +N +    I D        E   D +L  ++   +   EE    Q + G L+  K +++LP  + L  +++  +H+     H G+   
Subjt:  KEQLPE------LNSLTTQGIVD-------MEIEKDEELQKIIKKLEMNHEETSKYQWEKGRLL-YKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRT

Query:  YKRMSGELHWQGMKTDVKKYVQQCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFI----------------------------------EGVAVLF
           +     W+G++  +++YVQ C  CQ NK    KP G LQP+P  ER  E  +MDFI                                  E  A +F
Subjt:  YKRMSGELHWQGMKTDVKKYVQQCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFI----------------------------------EGVAVLF

Query:  IDRVVSKHGIPKSLISGRDKVFLSNFWKELFASMGTILKRSTTFHPQTD
          RV++  G PK +I+  D +F S  WK+       ++K S  + PQTD
Subjt:  IDRVVSKHGIPKSLISGRDKVFLSNFWKELFASMGTILKRSTTFHPQTD

Q9UR07 Transposon Tf2-11 polyprotein4.1e-7829.43Show/hide
Query:  LAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED----------------
        L  G+IR S +  + PV+ V KK+G  R  VDY+ LN+    + +P+P+IE+LL ++ G+T+F+KLDLKS YH IR+++ D                   
Subjt:  LAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDRED----------------

Query:  -GFQDAQGTLRIYGNAICADI-------------------DEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQQ
         G   A    + + N I  ++                    EH KH+  V   L++  L  N+ KC    S+++++G+ IS++G    +E I  + +W+Q
Subjt:  -GFQDAQGTLRIYGNAICADI-------------------DEHEKHLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQQ

Query:  PKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKN-SFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENG-----HPV
        PK+   LR FLG   Y R+F+    ++  PL+ LL+K+  ++W    TQA E +K  + + PVL   D++   ++ETDAS + +G VLS+       +PV
Subjt:  PKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKN-SFQWIEEATQAFETLKLAMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENG-----HPV

Query:  AFFSQKLSSRAQTKSIYERELMAVVLAVQKWRHYLLG--RKFTIISDQKAL--KFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSR----
         ++S K+S      S+ ++E++A++ +++ WRHYL      F I++D + L  +   E      +  +W   L  ++FEI Y+PG  N +ADALSR    
Subjt:  AFFSQKLSSRAQTKSIYERELMAVVLAVQKWRHYLLG--RKFTIISDQKAL--KFLLEQREVQLQFQKWLTNLLGYDFEILYQPGPQNKVADALSR----

Query:  KEQLPE------LNSLTTQGIVD-------MEIEKDEELQKIIKKLEMNHEETSKYQWEKGRLL-YKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRT
         E +P+      +N +    I D        E   D +L  ++   +   EE    Q + G L+  K +++LP  + L  +++  +H+     H G+   
Subjt:  KEQLPE------LNSLTTQGIVD-------MEIEKDEELQKIIKKLEMNHEETSKYQWEKGRLL-YKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRT

Query:  YKRMSGELHWQGMKTDVKKYVQQCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFI----------------------------------EGVAVLF
           +     W+G++  +++YVQ C  CQ NK    KP G LQP+P  ER  E  +MDFI                                  E  A +F
Subjt:  YKRMSGELHWQGMKTDVKKYVQQCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFI----------------------------------EGVAVLF

Query:  IDRVVSKHGIPKSLISGRDKVFLSNFWKELFASMGTILKRSTTFHPQTD
          RV++  G PK +I+  D +F S  WK+       ++K S  + PQTD
Subjt:  IDRVVSKHGIPKSLISGRDKVFLSNFWKELFASMGTILKRSTTFHPQTD

Arabidopsis top hitse value%identityAlignment
AT1G12930.1 ARM repeat superfamily protein2.2e-2664.13Show/hide
Query:  IFQLYREELGLPQVLLCRVHFLKEMLLLPSLSTGDEKVIGGLACLFSEVGQAAPSLIVDASAEALALADALLSCVAFPSEDWEIADSTLQFW
        + +L      LPQVLL +V FL++ LL P+L   D K+I GLACL SE+GQAAP LIV+AS+EAL L DA+LSCV FPSEDWEIADST+QFW
Subjt:  IFQLYREELGLPQVLLCRVHFLKEMLLLPSLSTGDEKVIGGLACLFSEVGQAAPSLIVDASAEALALADALLSCVAFPSEDWEIADSTLQFW

ATMG00850.1 DNA/RNA polymerases superfamily protein5.3e-0467.86Show/hide
Query:  MLAVGVIRPSHSPYSNPVLLVKKKDGEW
        ML   +I+PS SPYS+PVLLV+KKDG W
Subjt:  MLAVGVIRPSHSPYSNPVLLVKKKDGEW

ATMG00860.1 DNA/RNA polymerases superfamily protein1.4e-3657.69Show/hide
Query:  HLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQ--ISKRGVEADEEKIKSMTKWQQPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWI
        HLGMV  +   +Q +ANRKKC     +I YLGH+  IS  GV AD  K+++M  W +PK+ T LRGFLGLTGY RRFVK YG+I  PL++LL+KNS +W 
Subjt:  HLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQ--ISKRGVEADEEKIKSMTKWQQPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWI

Query:  EEATQAFETLKLAMTTLPVLALPDWTLPFM
        E A  AF+ LK A+TTLPVLALPD  LPF+
Subjt:  EEATQAFETLKLAMTTLPVLALPDWTLPFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGCAGTCGGAGTTATTAGACCAAGTCATAGCCCTTACTCCAACCCAGTCCTATTAGTAAAGAAGAAAGATGGGGAATGGAGGTTTTGTGTAGATTATAGAAAACT
GAACCAAGTTACAACTTCTGATAAATTTCCCATTCCAGTCATAGAAGAATTACTAGATGAACTGCATGGAGCTACTGTTTTTTCAAAGCTTGATCTGAAATCAGGATACC
ACCAAATCAGAATGAAGGAGGAAGACAGAGAAGACGGCTTTCAGGACGCACAAGGGACATTACGAATTTATGGTAATGCCATTTGCGCTGATATCGATGAGCATGAGAAA
CACTTGGGGATGGTTTTTGCTGTTTTGAGGGATAATCAACTCTTTGCAAACAGAAAGAAGTGTGTAATAGCCCACTCCAAGATCCAGTACTTGGGACATCAAATCTCCAA
AAGAGGAGTAGAAGCGGATGAGGAAAAAATTAAAAGCATGACAAAGTGGCAGCAACCAAAAGATGTGACCAGTCTGAGAGGATTTTTGGGCCTAACAGGATACTGTCGAA
GATTTGTTAAAGGGTATGGAGAGATTGCGGCACCTTTGTCGAAATTGTTGCAAAAGAATTCCTTCCAATGGATTGAAGAAGCCACACAGGCTTTCGAAACACTGAAACTT
GCTATGACTACTCTTCCGGTGCTGGCTTTACCTGATTGGACACTCCCCTTTATGGTGGAAACGGATGCTTCAGGAATCGGGTTAGGGGTGGTGCTATCTGAAAACGGCCA
TCCTGTAGCATTCTTCAGCCAAAAGTTATCATCAAGGGCTCAAACCAAATCCATATATGAAAGAGAGCTGATGGCAGTAGTCCTTGCAGTACAAAAATGGAGACACTATC
TACTGGGGAGGAAATTCACCATTATTTCAGATCAGAAGGCGTTAAAATTCTTGCTGGAGCAGAGGGAAGTGCAGCTGCAATTCCAGAAATGGTTAACTAATCTCCTCGGG
TATGATTTTGAAATACTATACCAACCGGGTCCTCAAAATAAAGTGGCGGACGCTTTATCAAGAAAGGAACAACTACCAGAATTGAACTCTTTGACAACGCAAGGGATTGT
TGACATGGAGATAGAAAAGGATGAAGAACTTCAGAAAATTATTAAGAAGTTGGAAATGAATCATGAGGAAACCAGCAAATATCAGTGGGAAAAAGGAAGATTATTATACA
AAGGTAGGGTGGTGCTTCCTAAAACATCCTCATTGATACCCAGCCTTCTTCACACCTTCCATGATTCGATTCTAGGGGGTCATTCCGGAGTTTTAAGGACATATAAACGA
ATGAGTGGAGAATTGCATTGGCAAGGCATGAAGACTGATGTAAAGAAATATGTGCAGCAGTGTGAGATCTGTCAGAGGAACAAGTTTGAAGCCACCAAACCAGTGGGAGT
TTTGCAGCCTCTGCCCATTCCAGAAAGAATTTTAGAAGACTGGACCATGGACTTTATAGAGGGCGTGGCTGTTTTGTTTATAGATAGGGTCGTGAGTAAGCATGGAATTC
CAAAATCCCTAATATCAGGCCGAGACAAAGTGTTCCTTAGCAACTTCTGGAAGGAATTGTTCGCTTCTATGGGAACCATCTTGAAGAGAAGCACAACCTTTCACCCGCAA
ACAGATGACAGACTGAAAGAATTCATTCCATGGGCTGAGCTATGGTATAATACCACCTTCCATGCCTCTACTAAGATCACTCCCTTTCAAGCTGTATATGGCATACCACC
CCCACCCCTGTTATCATACGGAGATCAAAAGACCCCAAATAATGAAGTGGAAACAATATTGAAAGAAAGGGACATGGCCATTAGTGCCTTGAAAGAAAACCTTTGTCTAG
CACATAATCGGATGAAGAAGATGGCTGACCTCAAAAGAAGGGAGCTCAAATTCAAGGAAGGTGATGAAGTGTTCTTGAAACTAAGACTCTATTGGCAGAGATCATTAGCA
AAGAGGAGGTGTGAAAAACTTGGTCCCAAATACTACGGACCCCGACCAGGGATATTCCAGCTCTATAGAGAGGAGTTGGGGTTGCCTCAGGTCTTGCTGTGCAGAGTTCA
TTTTCTTAAGGAAATGCTTCTTTTGCCTTCTCTTAGTACTGGAGACGAGAAAGTAATCGGTGGTCTGGCATGCTTGTTCTCAGAAGTTGGGCAAGCAGCACCATCCTTAA
TTGTAGATGCCAGTGCTGAAGCCCTTGCTCTTGCTGATGCTCTCTTGAGTTGTGTGGCTTTTCCAAGTGAAGATTGGGAGATTGCTGACTCAACATTACAATTTTGGTAT
TGTCATCTTCTTGCTAAAATTCTGAATTTCTTT
mRNA sequenceShow/hide mRNA sequence
ATGCTTGCAGTCGGAGTTATTAGACCAAGTCATAGCCCTTACTCCAACCCAGTCCTATTAGTAAAGAAGAAAGATGGGGAATGGAGGTTTTGTGTAGATTATAGAAAACT
GAACCAAGTTACAACTTCTGATAAATTTCCCATTCCAGTCATAGAAGAATTACTAGATGAACTGCATGGAGCTACTGTTTTTTCAAAGCTTGATCTGAAATCAGGATACC
ACCAAATCAGAATGAAGGAGGAAGACAGAGAAGACGGCTTTCAGGACGCACAAGGGACATTACGAATTTATGGTAATGCCATTTGCGCTGATATCGATGAGCATGAGAAA
CACTTGGGGATGGTTTTTGCTGTTTTGAGGGATAATCAACTCTTTGCAAACAGAAAGAAGTGTGTAATAGCCCACTCCAAGATCCAGTACTTGGGACATCAAATCTCCAA
AAGAGGAGTAGAAGCGGATGAGGAAAAAATTAAAAGCATGACAAAGTGGCAGCAACCAAAAGATGTGACCAGTCTGAGAGGATTTTTGGGCCTAACAGGATACTGTCGAA
GATTTGTTAAAGGGTATGGAGAGATTGCGGCACCTTTGTCGAAATTGTTGCAAAAGAATTCCTTCCAATGGATTGAAGAAGCCACACAGGCTTTCGAAACACTGAAACTT
GCTATGACTACTCTTCCGGTGCTGGCTTTACCTGATTGGACACTCCCCTTTATGGTGGAAACGGATGCTTCAGGAATCGGGTTAGGGGTGGTGCTATCTGAAAACGGCCA
TCCTGTAGCATTCTTCAGCCAAAAGTTATCATCAAGGGCTCAAACCAAATCCATATATGAAAGAGAGCTGATGGCAGTAGTCCTTGCAGTACAAAAATGGAGACACTATC
TACTGGGGAGGAAATTCACCATTATTTCAGATCAGAAGGCGTTAAAATTCTTGCTGGAGCAGAGGGAAGTGCAGCTGCAATTCCAGAAATGGTTAACTAATCTCCTCGGG
TATGATTTTGAAATACTATACCAACCGGGTCCTCAAAATAAAGTGGCGGACGCTTTATCAAGAAAGGAACAACTACCAGAATTGAACTCTTTGACAACGCAAGGGATTGT
TGACATGGAGATAGAAAAGGATGAAGAACTTCAGAAAATTATTAAGAAGTTGGAAATGAATCATGAGGAAACCAGCAAATATCAGTGGGAAAAAGGAAGATTATTATACA
AAGGTAGGGTGGTGCTTCCTAAAACATCCTCATTGATACCCAGCCTTCTTCACACCTTCCATGATTCGATTCTAGGGGGTCATTCCGGAGTTTTAAGGACATATAAACGA
ATGAGTGGAGAATTGCATTGGCAAGGCATGAAGACTGATGTAAAGAAATATGTGCAGCAGTGTGAGATCTGTCAGAGGAACAAGTTTGAAGCCACCAAACCAGTGGGAGT
TTTGCAGCCTCTGCCCATTCCAGAAAGAATTTTAGAAGACTGGACCATGGACTTTATAGAGGGCGTGGCTGTTTTGTTTATAGATAGGGTCGTGAGTAAGCATGGAATTC
CAAAATCCCTAATATCAGGCCGAGACAAAGTGTTCCTTAGCAACTTCTGGAAGGAATTGTTCGCTTCTATGGGAACCATCTTGAAGAGAAGCACAACCTTTCACCCGCAA
ACAGATGACAGACTGAAAGAATTCATTCCATGGGCTGAGCTATGGTATAATACCACCTTCCATGCCTCTACTAAGATCACTCCCTTTCAAGCTGTATATGGCATACCACC
CCCACCCCTGTTATCATACGGAGATCAAAAGACCCCAAATAATGAAGTGGAAACAATATTGAAAGAAAGGGACATGGCCATTAGTGCCTTGAAAGAAAACCTTTGTCTAG
CACATAATCGGATGAAGAAGATGGCTGACCTCAAAAGAAGGGAGCTCAAATTCAAGGAAGGTGATGAAGTGTTCTTGAAACTAAGACTCTATTGGCAGAGATCATTAGCA
AAGAGGAGGTGTGAAAAACTTGGTCCCAAATACTACGGACCCCGACCAGGGATATTCCAGCTCTATAGAGAGGAGTTGGGGTTGCCTCAGGTCTTGCTGTGCAGAGTTCA
TTTTCTTAAGGAAATGCTTCTTTTGCCTTCTCTTAGTACTGGAGACGAGAAAGTAATCGGTGGTCTGGCATGCTTGTTCTCAGAAGTTGGGCAAGCAGCACCATCCTTAA
TTGTAGATGCCAGTGCTGAAGCCCTTGCTCTTGCTGATGCTCTCTTGAGTTGTGTGGCTTTTCCAAGTGAAGATTGGGAGATTGCTGACTCAACATTACAATTTTGGTAT
TGTCATCTTCTTGCTAAAATTCTGAATTTCTTT
Protein sequenceShow/hide protein sequence
MLAVGVIRPSHSPYSNPVLLVKKKDGEWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDREDGFQDAQGTLRIYGNAICADIDEHEK
HLGMVFAVLRDNQLFANRKKCVIAHSKIQYLGHQISKRGVEADEEKIKSMTKWQQPKDVTSLRGFLGLTGYCRRFVKGYGEIAAPLSKLLQKNSFQWIEEATQAFETLKL
AMTTLPVLALPDWTLPFMVETDASGIGLGVVLSENGHPVAFFSQKLSSRAQTKSIYERELMAVVLAVQKWRHYLLGRKFTIISDQKALKFLLEQREVQLQFQKWLTNLLG
YDFEILYQPGPQNKVADALSRKEQLPELNSLTTQGIVDMEIEKDEELQKIIKKLEMNHEETSKYQWEKGRLLYKGRVVLPKTSSLIPSLLHTFHDSILGGHSGVLRTYKR
MSGELHWQGMKTDVKKYVQQCEICQRNKFEATKPVGVLQPLPIPERILEDWTMDFIEGVAVLFIDRVVSKHGIPKSLISGRDKVFLSNFWKELFASMGTILKRSTTFHPQ
TDDRLKEFIPWAELWYNTTFHASTKITPFQAVYGIPPPPLLSYGDQKTPNNEVETILKERDMAISALKENLCLAHNRMKKMADLKRRELKFKEGDEVFLKLRLYWQRSLA
KRRCEKLGPKYYGPRPGIFQLYREELGLPQVLLCRVHFLKEMLLLPSLSTGDEKVIGGLACLFSEVGQAAPSLIVDASAEALALADALLSCVAFPSEDWEIADSTLQFWY
CHLLAKILNFF