; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G10920 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G10920
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr4:9233644..9235322
RNA-Seq ExpressionCSPI04G10920
SyntenyCSPI04G10920
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058816.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]8.9e-15253.53Show/hide
Query:  MLFILN-EEGGNEGENSKENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGKAA----------------LVEERHILKEEGTLFGVTIGN
        MLFILN EE   EGE S+    E VE+ +L   E+  IE R ITS ++KGTMKL G+VKGK                  LV+ER I     T FG+TIG+
Subjt:  MLFILN-EEGGNEGENSKENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGKAA----------------LVEERHILKEEGTLFGVTIGN

Query:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF
         T CKG+G+C +VE++L  L ++ D L V LG++DVVLGMQWLDTTGTMK+HWPSLTMVF     ++ LKG P LI+AEC LKTL+KTWE +DQGFL  +
Subjt:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF

Query:  QHYETKLE------SEYEEEQEG--------------------LPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS
        Q YE + E      + +  ++EG                    LPPKR+IDH+ILTLPG K +NV PYKYGH QKEEIEKLV EML TG+IRPSHSP+SS
Subjt:  QHYETKLE------SEYEEEQEG--------------------LPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS

Query:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------
         VL VKKKDGGWRFC                                   LDLKSGYHQIRM+EEDIEKTAFRT                          
Subjt:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------

Query:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG
                              +DI EHEKHLGMVFA LRDNQL+ANRKKCV  HS+I YLGH ISK  VE D++K+KSM+ W +PKDVTGLRGFLGL G
Subjt:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG

Query:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVKLMLQES
        YYRRFVKGYGEIA PLTKLLQKN FKWDE AT AFE LK AM+T+PVLALPDW+LPFM++    +S
Subjt:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVKLMLQES

TYK14439.1 uncharacterized protein E5676_scaffold186G00980 [Cucumis melo var. makuwa]9.8e-15956.07Show/hide
Query:  MLFILNEEGGNEGENSK-ENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGK----------------AALVEERHILKEEGTLFGVTIGN
        M FI+NEE   E  +SK E TE  +ELK L+LTE  +IEL T+TS +SKGTMKL G V+ K                 AL EE  +  E+ T FG TIGN
Subjt:  MLFILNEEGGNEGENSK-ENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGK----------------AALVEERHILKEEGTLFGVTIGN

Query:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF
         T+CKGKGVC+RVELKL  +TIIADFLAVELG+VD VLGMQWLDTTGTM++HWPSLTM+F    +QI LKG P+LIKAEC LKTL+KTW+DDDQGFL  +
Subjt:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF

Query:  QHYETKLESEYEEEQE--------------------------GLPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS
         + E   E +YE ++E                          GLPPKR IDH+ILT+P  + +NV PYKYGHVQKEEIE LV EML  G+IRPSHSPYSS
Subjt:  QHYETKLESEYEEEQE--------------------------GLPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS

Query:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------
         VL VK+KDGGWRFC                                   LDLKSGYHQIRMKEEDIEKTAFRT                          
Subjt:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------

Query:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG
                               DI+EHEKHLGMVFA+LRDNQL+AN+KKCV  HSKIQYLGH ISK+ VE DEEKIKSM++W +P DV+ LRGFLGL G
Subjt:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG

Query:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVK
        YYRRFVKGY +IATPLTKLLQKN FKW+EEA  AF KLK+AMTT+PVLALPDWTLPF ++
Subjt:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVK

TYK14624.1 uncharacterized protein E5676_scaffold1275G00160 [Cucumis melo var. makuwa]4.0e-15254.11Show/hide
Query:  MLFILNEEGGN-EGENSKENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGK----------------AALVEERHILKEEGTLFGVTIGN
        M FI+NEE  N EG++ +E TE IVELK L+LTE   IEL+T+T FSSKGTMKL G ++ K                 +L  +  +  E+ T FG TIGN
Subjt:  MLFILNEEGGN-EGENSKENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGK----------------AALVEERHILKEEGTLFGVTIGN

Query:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF
         T+CKGKG+C+RVE+KL  +TIIADFLAVELGSVD VLGMQWLDT GTMK+HWPSLTM F    +QI LKG P LIKAEC L+TL+KTW++DDQGFL  +
Subjt:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF

Query:  QHYETKLESEYEEEQ--------------------------EGLPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS
         + E ++E  Y+ ++                          +GLPPKR IDH+ILTLP  + +NV PYKYGHVQK EIE LV EML  G+IRPS SPYSS
Subjt:  QHYETKLESEYEEEQ--------------------------EGLPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS

Query:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------
         VL VKKKDGGWRFC                                   LDLKSGYHQIRMKEEDIEKTAFRT                          
Subjt:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------

Query:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG
                              +D+ EHEKHLGM+FAVLRDNQL+AN KKCV  HS+IQYLGHQISK  VE D++KI+SM+NW +P DVT LRGFLGL G
Subjt:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG

Query:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVK
        YYRRFVKGY  I TPLTKLLQKN FKW+EEA   F KLK+AMTT+PVLALPDW+LPF ++
Subjt:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVK

TYK28905.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.2e-15254.46Show/hide
Query:  MLFILNEEGGN-EGENSKENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGK----------------AALVEERHILKEEGTLFGVTIGN
        M FI+NEE  N EG N +E TEE VELK L+LTE   IEL+T+T  SSKGTMKL G ++ K                 +L  +  +  E+ T FG TIGN
Subjt:  MLFILNEEGGN-EGENSKENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGK----------------AALVEERHILKEEGTLFGVTIGN

Query:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF
         T+CKG+G+C+RVE+KL  +TIIADFLAVELGSVD VLGMQWLDTTGTMK+HWPSLTM F    KQI LKG P+LIKAEC L+TL+KTW++DDQGFL  +
Subjt:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF

Query:  QHYETKLESEYEEEQ--------------------------EGLPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS
         + E + E  Y+ ++                          +GLPPKR IDH+I+T+P  + +NV PYKYGHVQK EIEKLVTEML  G+IRPS SPYSS
Subjt:  QHYETKLESEYEEEQ--------------------------EGLPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS

Query:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------
         VL VKKKDGGW FC                                   LDLKSGYHQIRMKEEDIEKTAFRT                          
Subjt:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------

Query:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG
                              +DI EHEKHLGMVF  LRDNQL+AN KKCV  HSKIQYLGHQISK  VE DE+KI+SM+NW +P DVT LRGFLGL+G
Subjt:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG

Query:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVK
        YYRRFV+GY  IAT LTKLLQKN FKW+EEA  AF KLK+AMTT+PVLALPDW LPF ++
Subjt:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVK

TYK28944.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-15153.93Show/hide
Query:  MLFILN-EEGGNEGENSKENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGKAA----------------LVEERHILKEEGTLFGVTIGN
        MLFILN EE   EGE S+    E VE+ +L   E+  IE R ITS ++KGTMKL G+VKGK                  LV+ER I     T FG+TIG+
Subjt:  MLFILN-EEGGNEGENSKENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGKAA----------------LVEERHILKEEGTLFGVTIGN

Query:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF
         T CKG+G+C +VE++L  L ++ D L V LG++DVVLGMQWLDTTGTMK+HWPSLTMVF     ++ LKG P LI+AEC LKTL+KTWE +DQGFL  +
Subjt:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF

Query:  QHYETKLE------SEYEEEQEG--------------------LPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS
        Q YE + E      + +  ++EG                    LPPKR+IDH+ILTLPG K +NV PYKYGH QKEEIEKLV EML TG+IRPSHSP+SS
Subjt:  QHYETKLE------SEYEEEQEG--------------------LPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS

Query:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------
         VL VKKKDGGWRFC                                   LDLKSGYHQIRM+EEDIEKTAFRT                          
Subjt:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------

Query:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG
                              +DI EHEKHLGMVFA LRDNQL+ANRKKCV  HS+I YLGH ISK  VE D++K+KSM+ W +PKDVTGLRGFLGL G
Subjt:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG

Query:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVK
        YYRRFVKGYGEIA PLTKLLQKN FKWDE AT AFE LK AM+T+PVLALPDW+LPFM++
Subjt:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVK

TrEMBL top hitse value%identityAlignment
A0A5A7UXB4 Ty3/gypsy retrotransposon protein4.3e-15253.53Show/hide
Query:  MLFILN-EEGGNEGENSKENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGKAA----------------LVEERHILKEEGTLFGVTIGN
        MLFILN EE   EGE S+    E VE+ +L   E+  IE R ITS ++KGTMKL G+VKGK                  LV+ER I     T FG+TIG+
Subjt:  MLFILN-EEGGNEGENSKENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGKAA----------------LVEERHILKEEGTLFGVTIGN

Query:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF
         T CKG+G+C +VE++L  L ++ D L V LG++DVVLGMQWLDTTGTMK+HWPSLTMVF     ++ LKG P LI+AEC LKTL+KTWE +DQGFL  +
Subjt:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF

Query:  QHYETKLE------SEYEEEQEG--------------------LPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS
        Q YE + E      + +  ++EG                    LPPKR+IDH+ILTLPG K +NV PYKYGH QKEEIEKLV EML TG+IRPSHSP+SS
Subjt:  QHYETKLE------SEYEEEQEG--------------------LPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS

Query:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------
         VL VKKKDGGWRFC                                   LDLKSGYHQIRM+EEDIEKTAFRT                          
Subjt:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------

Query:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG
                              +DI EHEKHLGMVFA LRDNQL+ANRKKCV  HS+I YLGH ISK  VE D++K+KSM+ W +PKDVTGLRGFLGL G
Subjt:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG

Query:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVKLMLQES
        YYRRFVKGYGEIA PLTKLLQKN FKWDE AT AFE LK AM+T+PVLALPDW+LPFM++    +S
Subjt:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVKLMLQES

A0A5D3BBH7 Ty3/gypsy retrotransposon protein5.6e-15253.93Show/hide
Query:  MLFILN-EEGGNEGENSKENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGKAA----------------LVEERHILKEEGTLFGVTIGN
        MLFILN EE   EGE S+    E VE+ +L   E+  IE R ITS ++KGTMKL G+VKGK                  LV+ER I     T FG+TIG+
Subjt:  MLFILN-EEGGNEGENSKENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGKAA----------------LVEERHILKEEGTLFGVTIGN

Query:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF
         T CKG+G+C +VE++L  L ++ D L V LG++DVVLGMQWLDTTGTMK+HWPSLTMVF     ++ LKG P LI+AEC LKTL+KTWE +DQGFL  +
Subjt:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF

Query:  QHYETKLE------SEYEEEQEG--------------------LPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS
        Q YE + E      + +  ++EG                    LPPKR+IDH+ILTLPG K +NV PYKYGH QKEEIEKLV EML TG+IRPSHSP+SS
Subjt:  QHYETKLE------SEYEEEQEG--------------------LPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS

Query:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------
         VL VKKKDGGWRFC                                   LDLKSGYHQIRM+EEDIEKTAFRT                          
Subjt:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------

Query:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG
                              +DI EHEKHLGMVFA LRDNQL+ANRKKCV  HS+I YLGH ISK  VE D++K+KSM+ W +PKDVTGLRGFLGL G
Subjt:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG

Query:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVK
        YYRRFVKGYGEIA PLTKLLQKN FKWDE AT AFE LK AM+T+PVLALPDW+LPFM++
Subjt:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVK

A0A5D3CUL0 Reverse transcriptase domain-containing protein1.9e-15254.11Show/hide
Query:  MLFILNEEGGN-EGENSKENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGK----------------AALVEERHILKEEGTLFGVTIGN
        M FI+NEE  N EG++ +E TE IVELK L+LTE   IEL+T+T FSSKGTMKL G ++ K                 +L  +  +  E+ T FG TIGN
Subjt:  MLFILNEEGGN-EGENSKENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGK----------------AALVEERHILKEEGTLFGVTIGN

Query:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF
         T+CKGKG+C+RVE+KL  +TIIADFLAVELGSVD VLGMQWLDT GTMK+HWPSLTM F    +QI LKG P LIKAEC L+TL+KTW++DDQGFL  +
Subjt:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF

Query:  QHYETKLESEYEEEQ--------------------------EGLPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS
         + E ++E  Y+ ++                          +GLPPKR IDH+ILTLP  + +NV PYKYGHVQK EIE LV EML  G+IRPS SPYSS
Subjt:  QHYETKLESEYEEEQ--------------------------EGLPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS

Query:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------
         VL VKKKDGGWRFC                                   LDLKSGYHQIRMKEEDIEKTAFRT                          
Subjt:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------

Query:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG
                              +D+ EHEKHLGM+FAVLRDNQL+AN KKCV  HS+IQYLGHQISK  VE D++KI+SM+NW +P DVT LRGFLGL G
Subjt:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG

Query:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVK
        YYRRFVKGY  I TPLTKLLQKN FKW+EEA   F KLK+AMTT+PVLALPDW+LPF ++
Subjt:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVK

A0A5D3CW02 Uncharacterized protein4.8e-15956.07Show/hide
Query:  MLFILNEEGGNEGENSK-ENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGK----------------AALVEERHILKEEGTLFGVTIGN
        M FI+NEE   E  +SK E TE  +ELK L+LTE  +IEL T+TS +SKGTMKL G V+ K                 AL EE  +  E+ T FG TIGN
Subjt:  MLFILNEEGGNEGENSK-ENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGK----------------AALVEERHILKEEGTLFGVTIGN

Query:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF
         T+CKGKGVC+RVELKL  +TIIADFLAVELG+VD VLGMQWLDTTGTM++HWPSLTM+F    +QI LKG P+LIKAEC LKTL+KTW+DDDQGFL  +
Subjt:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF

Query:  QHYETKLESEYEEEQE--------------------------GLPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS
         + E   E +YE ++E                          GLPPKR IDH+ILT+P  + +NV PYKYGHVQKEEIE LV EML  G+IRPSHSPYSS
Subjt:  QHYETKLESEYEEEQE--------------------------GLPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS

Query:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------
         VL VK+KDGGWRFC                                   LDLKSGYHQIRMKEEDIEKTAFRT                          
Subjt:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------

Query:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG
                               DI+EHEKHLGMVFA+LRDNQL+AN+KKCV  HSKIQYLGH ISK+ VE DEEKIKSM++W +P DV+ LRGFLGL G
Subjt:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG

Query:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVK
        YYRRFVKGY +IATPLTKLLQKN FKW+EEA  AF KLK+AMTT+PVLALPDWTLPF ++
Subjt:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVK

A0A5D3DZZ7 Ty3/gypsy retrotransposon protein2.5e-15254.46Show/hide
Query:  MLFILNEEGGN-EGENSKENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGK----------------AALVEERHILKEEGTLFGVTIGN
        M FI+NEE  N EG N +E TEE VELK L+LTE   IEL+T+T  SSKGTMKL G ++ K                 +L  +  +  E+ T FG TIGN
Subjt:  MLFILNEEGGN-EGENSKENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGK----------------AALVEERHILKEEGTLFGVTIGN

Query:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF
         T+CKG+G+C+RVE+KL  +TIIADFLAVELGSVD VLGMQWLDTTGTMK+HWPSLTM F    KQI LKG P+LIKAEC L+TL+KTW++DDQGFL  +
Subjt:  DTKCKGKGVCKRVELKLNTLTIIADFLAVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGF

Query:  QHYETKLESEYEEEQ--------------------------EGLPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS
         + E + E  Y+ ++                          +GLPPKR IDH+I+T+P  + +NV PYKYGHVQK EIEKLVTEML  G+IRPS SPYSS
Subjt:  QHYETKLESEYEEEQ--------------------------EGLPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSS

Query:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------
         VL VKKKDGGW FC                                   LDLKSGYHQIRMKEEDIEKTAFRT                          
Subjt:  SVLFVKKKDGGWRFC-----------------------------------LDLKSGYHQIRMKEEDIEKTAFRT--------------------------

Query:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG
                              +DI EHEKHLGMVF  LRDNQL+AN KKCV  HSKIQYLGHQISK  VE DE+KI+SM+NW +P DVT LRGFLGL+G
Subjt:  ----------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQISKR-VETDEEKIKSMINWHQPKDVTGLRGFLGLMG

Query:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVK
        YYRRFV+GY  IAT LTKLLQKN FKW+EEA  AF KLK+AMTT+PVLALPDW LPF ++
Subjt:  YYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.0e-2126.85Show/hide
Query:  YKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSSSVLFVKKK---DGGWRF-------------------------------------CLDLKSGYHQIRM
        Y Y    ++E+E  + +ML+ G+IR S+SPY+S +  V KK    G  +F                                      +DL  G+HQI M
Subjt:  YKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSSSVLFVKKK---DGGWRF-------------------------------------CLDLKSGYHQIRM

Query:  KEEDIEKTAFRT------------------------------------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLG
          E + KTAF T                                                  ++EH + LG+VF  L    L     KC     +  +LG
Subjt:  KEEDIEKTAFRT------------------------------------------------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLG

Query:  HQIS-KRVETDEEKIKSMINWHQPKDVTGLRGFLGLMGYYRRFVKGYGEIATPLTKLLQKNYFKWD---EEATHAFEKLKLAMTTLPVLALPDWTLPF
        H ++   ++ + EKI+++  +  P     ++ FLGL GYYR+F+  + +IA P+TK L+KN  K D    E   AF+KLK  ++  P+L +PD+T  F
Subjt:  HQIS-KRVETDEEKIKSMINWHQPKDVTGLRGFLGLMGYYRRFVKGYGEIATPLTKLLQKNYFKWD---EEATHAFEKLKLAMTTLPVLALPDWTLPF

P20825 Retrovirus-related Pol polyprotein from transposon 2977.6e-2126.44Show/hide
Query:  QGFLHGFQHYETKLESEYEEEQEGLPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSSSVLFVKKK---DGGWRF--
        +G L+ F++ E K       E E L     I H +L    +  +    Y      + E+E  V EML+ G+IR S+SPY+S    V KK    G  ++  
Subjt:  QGFLHGFQHYETKLESEYEEEQEGLPPKRAIDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSSSVLFVKKK---DGGWRF--

Query:  -----------------------------------CLDLKSGYHQIRMKEEDIEKTAFRT----------------------------------------
                                            +DL  G+HQI M EE I KTAF T                                        
Subjt:  -----------------------------------CLDLKSGYHQIRMKEEDIEKTAFRT----------------------------------------

Query:  --------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQIS-KRVETDEEKIKSMINWHQPKDVTGLRGFLGLMGYYRRFVKGYGEIAT
                  + EH   + +VF  L D  L     KC     +  +LGH ++   ++ +  K+K+++++  P     +R FLGL GYYR+F+  Y +IA 
Subjt:  --------ADINEHEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQIS-KRVETDEEKIKSMINWHQPKDVTGLRGFLGLMGYYRRFVKGYGEIAT

Query:  PLTKLLQKNYFKWDE---EATHAFEKLKLAMTTLPVLALPDWTLPFMV
        P+T  L+K   K D    E   AFEKLK  +   P+L LPD+   F++
Subjt:  PLTKLLQKNYFKWDE---EATHAFEKLKLAMTTLPVLALPDWTLPFMV

P92523 Uncharacterized mitochondrial protein AtMg008604.6e-3454.14Show/hide
Query:  HLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQ---ISKRVETDEEKIKSMINWHQPKDVTGLRGFLGLMGYYRRFVKGYGEIATPLTKLLQKNYFKWD
        HLGMV  +   +Q +ANRKKC     +I YLGH+     + V  D  K+++M+ W +PK+ T LRGFLGL GYYRRFVK YG+I  PLT+LL+KN  KW 
Subjt:  HLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQ---ISKRVETDEEKIKSMINWHQPKDVTGLRGFLGLMGYYRRFVKGYGEIATPLTKLLQKNYFKWD

Query:  EEATHAFEKLKLAMTTLPVLALPDWTLPFMVKL
        E A  AF+ LK A+TTLPVLALPD  LPF+ ++
Subjt:  EEATHAFEKLKLAMTTLPVLALPDWTLPFMVKL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein5.6e-2428.66Show/hide
Query:  LESEYEE-EQEGLPPKRA------IDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSSSVLFVKKKDGGWRFC----------
        L+ +Y E  +  LPP+ A      + H I   PG +   + PY      ++EI K+V ++L    I PS SP SS V+ V KKDG +R C          
Subjt:  LESEYEE-EQEGLPPKRA------IDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSSSVLFVKKKDGGWRFC----------

Query:  -------------------------LDLKSGYHQIRMKEEDIEKTAFRTA---------------------------------------DI-------NE
                                 LDL SGYHQI M+ +D  KTAF T                                        DI        E
Subjt:  -------------------------LDLKSGYHQIRMKEEDIEKTAFRTA---------------------------------------DI-------NE

Query:  HEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQIS-KRVETDEEKIKSMINWHQPKDVTGLRGFLGLMGYYRRFVKGYGEIATPLTKLLQKNYFKW
        H KHL  V   L++  L   +KKC     + ++LG+ I  +++   + K  ++ ++  PK V   + FLG++ YYRRF+    +IA P+ +L   +  +W
Subjt:  HEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQIS-KRVETDEEKIKSMINWHQPKDVTGLRGFLGLMGYYRRFVKGYGEIATPLTKLLQKNYFKW

Query:  DEEATHAFEKLKLAMTTLPVL
         E+   A EKLK A+   PVL
Subjt:  DEEATHAFEKLKLAMTTLPVL

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.8e-2328.35Show/hide
Query:  LESEYEE-EQEGLPPKRA------IDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSSSVLFVKKKDGGWRFC----------
        L+ +Y E  +  LPP+ A      + H I   PG +   + PY      ++EI K+V ++L    I PS SP SS V+ V KKDG +R C          
Subjt:  LESEYEE-EQEGLPPKRA------IDHQILTLPGHKSVNVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSSSVLFVKKKDGGWRFC----------

Query:  -------------------------LDLKSGYHQIRMKEEDIEKTAFRTA---------------------------------------DI-------NE
                                 LDL SGYHQI M+ +D  KTAF T                                        DI        E
Subjt:  -------------------------LDLKSGYHQIRMKEEDIEKTAFRTA---------------------------------------DI-------NE

Query:  HEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQIS-KRVETDEEKIKSMINWHQPKDVTGLRGFLGLMGYYRRFVKGYGEIATPLTKLLQKNYFKW
        H KHL  V   L++  L   +KKC     + ++LG+ I  +++   + K  ++ ++  PK V   + FLG++ YYRRF+    +IA P+ +L   +  +W
Subjt:  HEKHLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQIS-KRVETDEEKIKSMINWHQPKDVTGLRGFLGLMGYYRRFVKGYGEIATPLTKLLQKNYFKW

Query:  DEEATHAFEKLKLAMTTLPVL
         E+   A +KLK A+   PVL
Subjt:  DEEATHAFEKLKLAMTTLPVL

Arabidopsis top hitse value%identityAlignment
AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding3.0e-0428.74Show/hide
Query:  HILKEEGTLFGVTIGNDTKCKGKGVCKRVELKLNTLTIIADFLAVEL--GSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITL
        +IL++E   F   + +D K      C+ + L++N + I+ D+   +L    VDV+LG +WL   G  +V+W + +  F   +  +TL
Subjt:  HILKEEGTLFGVTIGNDTKCKGKGVCKRVELKLNTLTIIADFLAVEL--GSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITL

ATMG00850.1 DNA/RNA polymerases superfamily protein3.5e-0550Show/hide
Query:  VQKEEIEKLVTEMLSTGVIRPSHSPYSSSVLFVKKKDGGW
        +++  ++  + EML   +I+PS SPYSS VL V+KKDGGW
Subjt:  VQKEEIEKLVTEMLSTGVIRPSHSPYSSSVLFVKKKDGGW

ATMG00860.1 DNA/RNA polymerases superfamily protein3.3e-3554.14Show/hide
Query:  HLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQ---ISKRVETDEEKIKSMINWHQPKDVTGLRGFLGLMGYYRRFVKGYGEIATPLTKLLQKNYFKWD
        HLGMV  +   +Q +ANRKKC     +I YLGH+     + V  D  K+++M+ W +PK+ T LRGFLGL GYYRRFVK YG+I  PLT+LL+KN  KW 
Subjt:  HLGMVFAVLRDNQLFANRKKCVITHSKIQYLGHQ---ISKRVETDEEKIKSMINWHQPKDVTGLRGFLGLMGYYRRFVKGYGEIATPLTKLLQKNYFKWD

Query:  EEATHAFEKLKLAMTTLPVLALPDWTLPFMVKL
        E A  AF+ LK A+TTLPVLALPD  LPF+ ++
Subjt:  EEATHAFEKLKLAMTTLPVLALPDWTLPFMVKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTTCATTCTCAATGAAGAAGGTGGGAATGAAGGAGAAAATTCGAAAGAAAATACAGAGGAGATAGTGGAATTGAAAAAACTAGATCTTACAGAAAAGGCGAAAAT
TGAGCTACGAACTATCACTAGCTTTTCATCAAAAGGGACCATGAAACTGATGGGGATGGTGAAGGGGAAGGCAGCTCTAGTAGAAGAAAGACACATCTTAAAAGAAGAGG
GCACTCTGTTCGGGGTCACGATTGGCAACGACACCAAGTGCAAAGGAAAAGGGGTGTGTAAGAGAGTGGAACTAAAGCTGAATACGTTGACTATTATTGCTGATTTCTTA
GCTGTGGAACTGGGAAGTGTAGATGTGGTATTGGGAATGCAGTGGCTAGATACTACAGGAACTATGAAGGTCCATTGGCCTTCACTAACCATGGTATTTCGGGTGGGAGA
GAAACAAATCACTCTCAAGGGATATCCAACCCTCATTAAAGCTGAATGCTTATTGAAAACATTGAAAAAAACATGGGAAGATGACGACCAAGGGTTCCTCCATGGCTTCC
AACATTATGAAACTAAATTAGAAAGTGAATATGAAGAGGAACAGGAAGGATTGCCCCCCAAGAGAGCCATTGACCATCAAATATTAACACTCCCCGGACATAAATCGGTT
AACGTTCACCCATATAAATACGGTCATGTGCAGAAAGAAGAAATTGAAAAGTTAGTTACAGAAATGCTTTCAACAGGAGTTATTAGGCCAAGCCATAGCCCTTATTCCAG
CTCAGTCCTATTCGTAAAGAAGAAAGATGGGGGATGGAGGTTCTGCCTTGATCTGAAATCGGGATATCATCAAATCCGAATGAAGGAGGAAGACATAGAGAAGACTGCCT
TCAGGACTGCTGATATCAACGAGCATGAGAAACACTTAGGAATGGTTTTTGCTGTTTTGAGGGACAATCAACTATTTGCAAACAGAAAGAAATGTGTAATAACTCATTCT
AAGATCCAATACTTAGGGCATCAAATTTCCAAAAGAGTAGAAACGGATGAGGAGAAAATTAAAAGCATGATAAATTGGCATCAGCCAAAAGACGTGACCGGTCTGCGAGG
ATTTTTGGGCCTAATGGGGTACTATCGAAGATTTGTAAAAGGATACGGAGAGATTGCAACACCGTTGACAAAATTGTTGCAAAAGAATTACTTCAAATGGGATGAAGAAG
CCACACATGCTTTCGAGAAACTAAAACTTGCTATGACTACTCTCCCTGTGTTGGCTTTACCTGATTGGACTCTCCCTTTTATGGTGAAATTGATGCTTCAGGAATCGGCT
TAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCTTCATTCTCAATGAAGAAGGTGGGAATGAAGGAGAAAATTCGAAAGAAAATACAGAGGAGATAGTGGAATTGAAAAAACTAGATCTTACAGAAAAGGCGAAAAT
TGAGCTACGAACTATCACTAGCTTTTCATCAAAAGGGACCATGAAACTGATGGGGATGGTGAAGGGGAAGGCAGCTCTAGTAGAAGAAAGACACATCTTAAAAGAAGAGG
GCACTCTGTTCGGGGTCACGATTGGCAACGACACCAAGTGCAAAGGAAAAGGGGTGTGTAAGAGAGTGGAACTAAAGCTGAATACGTTGACTATTATTGCTGATTTCTTA
GCTGTGGAACTGGGAAGTGTAGATGTGGTATTGGGAATGCAGTGGCTAGATACTACAGGAACTATGAAGGTCCATTGGCCTTCACTAACCATGGTATTTCGGGTGGGAGA
GAAACAAATCACTCTCAAGGGATATCCAACCCTCATTAAAGCTGAATGCTTATTGAAAACATTGAAAAAAACATGGGAAGATGACGACCAAGGGTTCCTCCATGGCTTCC
AACATTATGAAACTAAATTAGAAAGTGAATATGAAGAGGAACAGGAAGGATTGCCCCCCAAGAGAGCCATTGACCATCAAATATTAACACTCCCCGGACATAAATCGGTT
AACGTTCACCCATATAAATACGGTCATGTGCAGAAAGAAGAAATTGAAAAGTTAGTTACAGAAATGCTTTCAACAGGAGTTATTAGGCCAAGCCATAGCCCTTATTCCAG
CTCAGTCCTATTCGTAAAGAAGAAAGATGGGGGATGGAGGTTCTGCCTTGATCTGAAATCGGGATATCATCAAATCCGAATGAAGGAGGAAGACATAGAGAAGACTGCCT
TCAGGACTGCTGATATCAACGAGCATGAGAAACACTTAGGAATGGTTTTTGCTGTTTTGAGGGACAATCAACTATTTGCAAACAGAAAGAAATGTGTAATAACTCATTCT
AAGATCCAATACTTAGGGCATCAAATTTCCAAAAGAGTAGAAACGGATGAGGAGAAAATTAAAAGCATGATAAATTGGCATCAGCCAAAAGACGTGACCGGTCTGCGAGG
ATTTTTGGGCCTAATGGGGTACTATCGAAGATTTGTAAAAGGATACGGAGAGATTGCAACACCGTTGACAAAATTGTTGCAAAAGAATTACTTCAAATGGGATGAAGAAG
CCACACATGCTTTCGAGAAACTAAAACTTGCTATGACTACTCTCCCTGTGTTGGCTTTACCTGATTGGACTCTCCCTTTTATGGTGAAATTGATGCTTCAGGAATCGGCT
TAG
Protein sequenceShow/hide protein sequence
MLFILNEEGGNEGENSKENTEEIVELKKLDLTEKAKIELRTITSFSSKGTMKLMGMVKGKAALVEERHILKEEGTLFGVTIGNDTKCKGKGVCKRVELKLNTLTIIADFL
AVELGSVDVVLGMQWLDTTGTMKVHWPSLTMVFRVGEKQITLKGYPTLIKAECLLKTLKKTWEDDDQGFLHGFQHYETKLESEYEEEQEGLPPKRAIDHQILTLPGHKSV
NVHPYKYGHVQKEEIEKLVTEMLSTGVIRPSHSPYSSSVLFVKKKDGGWRFCLDLKSGYHQIRMKEEDIEKTAFRTADINEHEKHLGMVFAVLRDNQLFANRKKCVITHS
KIQYLGHQISKRVETDEEKIKSMINWHQPKDVTGLRGFLGLMGYYRRFVKGYGEIATPLTKLLQKNYFKWDEEATHAFEKLKLAMTTLPVLALPDWTLPFMVKLMLQESA