; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G27060 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G27060
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr3:24478117..24480538
RNA-Seq ExpressionCSPI03G27060
SyntenyCSPI03G27060
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8637598.1 hypothetical protein CSA_022681 [Cucumis sativus]1.5e-28181.27Show/hide
Query:  MQWLDSTGTMKVHWPSLTMTFWMKGRRIILK----------------------------------VNYEGESETEVELKGKEEGLPMVQRLLEQYADVFR
        MQWLDSTGTMKVHWPSLTMTFWMKGRRIILK                                  VNYEGESETE ELKGKEEGLPMVQRLLEQYADVFR
Subjt:  MQWLDSTGTMKVHWPSLTMTFWMKGRRIILK----------------------------------VNYEGESETEVELKGKEEGLPMVQRLLEQYADVFR

Query:  LPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELL
        LPTGLPPRRAIDHRILTVADQKPINVRPYKY HVQKEEI KLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELL
Subjt:  LPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELL

Query:  DELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAIL
        DELHGAT FSKLDL SGYHQIRMREEDVEKTAF THEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAIL
Subjt:  DELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAIL

Query:  RDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLK
        RDHELFANRSKCVIAHSQVQYLGHLIS+RGVEADE KIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQ                  
Subjt:  RDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLK

Query:  LAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQ----
                                  KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQ    
Subjt:  LAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQ----

Query:  ----------------------------DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTTTGIVDIEIIEKE
                                    DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTTTGIVDIEIIEKE
Subjt:  ----------------------------DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTTTGIVDIEIIEKE

Query:  VEMDQELQKIIAELKGEVDQGGKYQWNNGR
        VEMDQELQKIIAELKGEVDQGGKYQWNNGR
Subjt:  VEMDQELQKIIAELKGEVDQGGKYQWNNGR

TYJ99303.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.0e-26162.91Show/hide
Query:  EEDHKREEDTEDEA-SEVIELNHLELNMDNPIELRLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEG
        EE+   E D+++E   E +EL  LEL  +  IEL+ +  ++SKGTMK KG +  +EVV+LIDSGAT+NFI   L   L+L +DP T FG TIGNG +C G
Subjt:  EEDHKREEDTEDEA-SEVIELNHLELNMDNPIELRLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEG

Query:  SGICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILK------------------------------VNYEGE
         GIC+RV+VKL E+TI+ADFLAVELG+VD VLGMQWLD+TGTMK+HWPSLTM+FW  GR+I+LK                               N + E
Subjt:  SGICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILK------------------------------VNYEGE

Query:  SETEVELKGKEEG----LPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVK
        ++   +   KEEG    +PM++ LL+QY D+F  P GLPP+R IDHRILT+ DQKPINVRPYKY H+QK EI KLV EMLQ GVIRPSRSPYSSPVLLVK
Subjt:  SETEVELKGKEEG----LPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVK

Query:  KKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPF
        KK+GGWRFCVDYRKLNQ T++DKFPIPVIEELLDEL+GA VFSKLDL SGYHQIRM+EED+EKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMN+VFKPF
Subjt:  KKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPF

Query:  LRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFV
        LRRCVLVFFYDILVYS DI EH KHLGMVFA+LRD++L+AN  KCV AHS++QYLGH IS  GVEADE KIRSMVNWPRP D+T LRGFL LTGYYRRFV
Subjt:  LRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFV

Query:  KSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETD
        K Y  IA PLTKLLQKNAF WNE+A                            ET                AF +LK+AMTT+PVLALPDWS PFTIETD
Subjt:  KSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETD

Query:  ASGVGLGAVLSQDGHPIAFFSQKLSPRAQ--------------------------------DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQ
        ASG GLGAVLSQ GHPIAF+SQKLS RAQ                                DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPG  
Subjt:  ASGVGLGAVLSQDGHPIAFFSQKLSPRAQ--------------------------------DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQ

Query:  NKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVEMDQELQKIIAELKGEVDQGGKYQWNNGR
        NK ADALSR +  +EL+ MTT+GIVDI ++ +EV+ D+ LQKI+A+LK E +  GK++W NGR
Subjt:  NKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVEMDQELQKIIAELKGEVDQGGKYQWNNGR

TYK14439.1 uncharacterized protein E5676_scaffold186G00980 [Cucumis melo var. makuwa]5.1e-26663.25Show/hide
Query:  EEDHKREEDTEDEASE-VIELNHLELNMDNPIELRLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEG
        EE+   E  ++DE +E  +EL +LEL  D  IEL  +  +TSKGTMK KG V  +E+V+LIDSGAT+NFI Q L ++LQ+ ++  T+FG TIGNG +C+G
Subjt:  EEDHKREEDTEDEASE-VIELNHLELNMDNPIELRLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEG

Query:  SGICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILK----------------------------------VN
         G+C+RV++KLKE+TI+ADFLAVELGTVD VLGMQWLD+TGTM++HWPSLTM FW +GR+I+LK                                  ++
Subjt:  SGICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILK----------------------------------VN

Query:  YEGESETEVELKGKEEGLPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVK
         E + ET+ E+KG E  +PM++ LL+QYAD+F  P GLPP+R IDHRILT+ DQ+PINVRPYKY HVQKEEI  LV EMLQ G+IRPS SPYSSPVLLVK
Subjt:  YEGESETEVELKGKEEGLPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVK

Query:  KKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPF
        +KDGGWRFCVDYRKLNQ TV+DKFPIPVIEELLDELHGA VFSKLDL SGYHQIRM+EED+EKTAFRTHEGHYEFLVMPFGLTNA ATFQSLMN+VFKPF
Subjt:  KKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPF

Query:  LRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFV
        LRRCVLVF YDILVYS DI EH KHLGMVFA+LRD++L+AN+ KCV AHS++QYLGH IS +GVEADE KI+SMV+WPRP D++ LRGFLGLTGYYRRFV
Subjt:  LRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFV

Query:  KSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETD
        K Y +IA PLTKLLQKNAF W EEA                            ET                AF +LK+AMTT+PVLALPDW+ PFTIETD
Subjt:  KSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETD

Query:  ASGVGLGAVLSQDGHPIAFFSQKLSPRAQ--------------------------------DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQ
        ASG GLGAVLSQ GHPIAFFSQKLS RAQ                                DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGL 
Subjt:  ASGVGLGAVLSQDGHPIAFFSQKLSPRAQ--------------------------------DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQ

Query:  NKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVEMDQELQKIIAELKGEVDQGGKYQWNNG
        NK ADALSR D  VELN MTT+GIVD+  + +EVE D+ELQ+II +LK E + GGK++W NG
Subjt:  NKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVEMDQELQKIIAELKGEVDQGGKYQWNNG

XP_031742100.1 uncharacterized protein LOC116404055 [Cucumis sativus]6.9e-27981.05Show/hide
Query:  EEDHKREEDTEDEASEVIELNHLELNMDNPIELRLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEGS
        EEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI GVTSKGTMK KGH                            LSIDPGTRFGVTIGN NQCEGS
Subjt:  EEDHKREEDTEDEASEVIELNHLELNMDNPIELRLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEGS

Query:  GICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILK----------------------------------VNY
        GICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILK                                  VNY
Subjt:  GICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILK----------------------------------VNY

Query:  EGESETEVELKGKEEGLPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVKK
        EGESETE ELKG+EEGLPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKY HVQKEEI KLVLEMLQAGVIRPSRSPYSSPVLLVK 
Subjt:  EGESETEVELKGKEEGLPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVKK

Query:  KDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFL
        KDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGAT FSKLDL SGYHQIRMREEDVEKTAF THEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFL
Subjt:  KDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFL

Query:  RRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVK
        RRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLIS+RGVEADE KIRSMVNWPRPKDITGLRGFLGLTGYYRRFVK
Subjt:  RRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVK

Query:  SYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDA
        SYGEIAAPLTKLLQ                                            KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDA
Subjt:  SYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDA

Query:  SGVGLGAVLSQDGHPIAFFSQKLSPRAQ
        SGVGLGAVLSQDGHPIAFFSQKLSPRAQ
Subjt:  SGVGLGAVLSQDGHPIAFFSQKLSPRAQ

XP_031745419.1 uncharacterized protein LOC116405630 [Cucumis sativus]2.8e-29676.37Show/hide
Query:  EEDHKREEDTEDEASEVIELNHLELNMDNPIELRLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEGS
        EEDHKREEDTEDEASEVIELNHLELNMDNPIELRLI GVTSKGTMK KGH                            LSIDPGTRFGVTIGN NQCEGS
Subjt:  EEDHKREEDTEDEASEVIELNHLELNMDNPIELRLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEGS

Query:  GICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILKVNYEGESETEVELKGKEEGLPMVQRLLEQYADVFRLP
        GICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKV                                                        
Subjt:  GICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILKVNYEGESETEVELKGKEEGLPMVQRLLEQYADVFRLP

Query:  TGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDE
         GLPPRRAIDHRILTVADQKPINVRPYKY HVQKEEI KLVLEMLQAGVIRPSRSPYSSPVLLVK KDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDE
Subjt:  TGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDE

Query:  LHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRD
        LHGAT FSKLDL SGYHQIRMREEDVEKTAF THEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRD
Subjt:  LHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRD

Query:  HELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLA
        HELFANRSKCVIAHSQVQYLGHLIS+RGVEADE KIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQ                    
Subjt:  HELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLA

Query:  MTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQ------
                                KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQ      
Subjt:  MTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQ------

Query:  --------------------------DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVE
                                  DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVE
Subjt:  --------------------------DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVE

Query:  MDQELQKIIAELKGEVDQGGKYQWNNGR
        MDQELQKIIAELKGEVDQGGKYQWNNGR
Subjt:  MDQELQKIIAELKGEVDQGGKYQWNNGR

TrEMBL top hitse value%identityAlignment
A0A5A7UM77 Ty3/gypsy retrotransposon protein4.1e-26162.91Show/hide
Query:  EEDHKREEDTEDEASE-VIELNHLELNMDNPIELRLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEG
        EE+   E D+++E  E  +EL  LEL  +  IEL+ +  ++SKGTMK KG +  +EVV+LIDSGAT+NFI   L   L+L +DP T FG TIGNG +C G
Subjt:  EEDHKREEDTEDEASE-VIELNHLELNMDNPIELRLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEG

Query:  SGICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILK------------------------------VNYEGE
         GIC+RV+VKL E+TI+ADFLAVELG+VD VLGMQWLD+TGTMK+HWPSLTM+FW  GR+I+LK                               N + E
Subjt:  SGICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILK------------------------------VNYEGE

Query:  SETEVELKGKEEG----LPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVK
        ++   +   KEEG    +PM++ LL+QY D+F  P GLPP+R IDHRILT+ DQKPINVRPYKY H+QK EI KLV EMLQ GVIRPSRSPYSSPVLLVK
Subjt:  SETEVELKGKEEG----LPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVK

Query:  KKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPF
        KK+GGWRFCVDYRKLNQ T++DKFPIPVIEELLDEL+GA VFSKLDL SGYHQIRM+EED+EKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMN+VFKPF
Subjt:  KKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPF

Query:  LRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFV
        LRRCVLVFFYDILVYS DI EH KHLGMVFA+LRD++L+AN  KCV AHS++QYLGH IS  GVEADE KIRSMVNWPRP D+T LRGFL LTGYYRRFV
Subjt:  LRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFV

Query:  KSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETD
        K Y  IA PLTKLLQKNAF WNE+A                            ET                AF +LK+AMTT+PVLALPDWS PFTIETD
Subjt:  KSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETD

Query:  ASGVGLGAVLSQDGHPIAFFSQKLSPRAQ--------------------------------DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQ
        ASG GLGAVLSQ GHPIAF+SQKLS RAQ                                DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPG  
Subjt:  ASGVGLGAVLSQDGHPIAFFSQKLSPRAQ--------------------------------DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQ

Query:  NKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVEMDQELQKIIAELKGEVDQGGKYQWNNGR
        NK ADALSR +  +EL+ MTT+GIVDI ++ +EV+ D+ LQKI+A+LK E +  GK++W NGR
Subjt:  NKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVEMDQELQKIIAELKGEVDQGGKYQWNNGR

A0A5A7UYM1 Ty3/gypsy retrotransposon protein5.4e-26163.04Show/hide
Query:  EEDHKREEDTEDEA-SEVIELNHLELNMDNPIELRLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEG
        EE+   E D+++E   E +EL  LEL  +  IEL+ +  ++SKGTMK KG +  +EVV+LIDSGAT+NFI   L  +L+L +DP T FG TIGNG +C G
Subjt:  EEDHKREEDTEDEA-SEVIELNHLELNMDNPIELRLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEG

Query:  SGICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILK------------------------------VNYEGE
         GIC+RV+VKL E+TI+ADFLAVELG+VD VLGMQWLD+TGTMK++WPSLTM+FW  GR+I+LK                               N E E
Subjt:  SGICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILK------------------------------VNYEGE

Query:  SETEVELKGKEEG----LPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVK
        ++   +   KEEG    +PM++ LL+QY D+F  P GLPP+R IDHRILT+ DQKPINVRPYKY H+QK EI KLV EMLQ GVIRPSRSPYSSPVLLVK
Subjt:  SETEVELKGKEEG----LPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVK

Query:  KKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPF
        KKDGGWRFCVDYRKLNQ T++DKFPIPVIEELLDEL+GA VFSKLDL SGYHQIRM+EEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMN+VFKPF
Subjt:  KKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPF

Query:  LRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFV
        LRRCVLVFF DILVYS DI E  KHLGMVFA+LRD++L+AN  KCV AHS++QYLGH IS  GVEADE KIRSMVNWPRP D+T LRGFLGLTGYYRRFV
Subjt:  LRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFV

Query:  KSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETD
        K Y  IA PLTKLLQKNAF WNE+A                            ET                AF +LK+AMTT+PVLALPDWS PFTIETD
Subjt:  KSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETD

Query:  ASGVGLGAVLSQDGHPIAFFSQKLSPRAQ--------------------------------DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQ
        ASG GLGAVLSQ GHPIAF+SQKLS RAQ                                DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPG  
Subjt:  ASGVGLGAVLSQDGHPIAFFSQKLSPRAQ--------------------------------DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQ

Query:  NKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVEMDQELQKIIAELKGEVDQGGKYQWNNGR
        NK ADALSR +  +EL+ MTT+GIVDI ++ +EV+ D+ LQKI+A+LK E +  GK++W NGR
Subjt:  NKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVEMDQELQKIIAELKGEVDQGGKYQWNNGR

A0A5D3BJ50 Ty3/gypsy retrotransposon protein2.4e-26162.91Show/hide
Query:  EEDHKREEDTEDEA-SEVIELNHLELNMDNPIELRLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEG
        EE+   E D+++E   E +EL  LEL  +  IEL+ +  ++SKGTMK KG +  +EVV+LIDSGAT+NFI   L   L+L +DP T FG TIGNG +C G
Subjt:  EEDHKREEDTEDEA-SEVIELNHLELNMDNPIELRLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEG

Query:  SGICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILK------------------------------VNYEGE
         GIC+RV+VKL E+TI+ADFLAVELG+VD VLGMQWLD+TGTMK+HWPSLTM+FW  GR+I+LK                               N + E
Subjt:  SGICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILK------------------------------VNYEGE

Query:  SETEVELKGKEEG----LPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVK
        ++   +   KEEG    +PM++ LL+QY D+F  P GLPP+R IDHRILT+ DQKPINVRPYKY H+QK EI KLV EMLQ GVIRPSRSPYSSPVLLVK
Subjt:  SETEVELKGKEEG----LPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVK

Query:  KKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPF
        KK+GGWRFCVDYRKLNQ T++DKFPIPVIEELLDEL+GA VFSKLDL SGYHQIRM+EED+EKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMN+VFKPF
Subjt:  KKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPF

Query:  LRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFV
        LRRCVLVFFYDILVYS DI EH KHLGMVFA+LRD++L+AN  KCV AHS++QYLGH IS  GVEADE KIRSMVNWPRP D+T LRGFL LTGYYRRFV
Subjt:  LRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFV

Query:  KSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETD
        K Y  IA PLTKLLQKNAF WNE+A                            ET                AF +LK+AMTT+PVLALPDWS PFTIETD
Subjt:  KSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETD

Query:  ASGVGLGAVLSQDGHPIAFFSQKLSPRAQ--------------------------------DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQ
        ASG GLGAVLSQ GHPIAF+SQKLS RAQ                                DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPG  
Subjt:  ASGVGLGAVLSQDGHPIAFFSQKLSPRAQ--------------------------------DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQ

Query:  NKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVEMDQELQKIIAELKGEVDQGGKYQWNNGR
        NK ADALSR +  +EL+ MTT+GIVDI ++ +EV+ D+ LQKI+A+LK E +  GK++W NGR
Subjt:  NKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVEMDQELQKIIAELKGEVDQGGKYQWNNGR

A0A5D3CW02 Uncharacterized protein2.5e-26663.25Show/hide
Query:  EEDHKREEDTEDEASE-VIELNHLELNMDNPIELRLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEG
        EE+   E  ++DE +E  +EL +LEL  D  IEL  +  +TSKGTMK KG V  +E+V+LIDSGAT+NFI Q L ++LQ+ ++  T+FG TIGNG +C+G
Subjt:  EEDHKREEDTEDEASE-VIELNHLELNMDNPIELRLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEG

Query:  SGICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILK----------------------------------VN
         G+C+RV++KLKE+TI+ADFLAVELGTVD VLGMQWLD+TGTM++HWPSLTM FW +GR+I+LK                                  ++
Subjt:  SGICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILK----------------------------------VN

Query:  YEGESETEVELKGKEEGLPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVK
         E + ET+ E+KG E  +PM++ LL+QYAD+F  P GLPP+R IDHRILT+ DQ+PINVRPYKY HVQKEEI  LV EMLQ G+IRPS SPYSSPVLLVK
Subjt:  YEGESETEVELKGKEEGLPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVK

Query:  KKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPF
        +KDGGWRFCVDYRKLNQ TV+DKFPIPVIEELLDELHGA VFSKLDL SGYHQIRM+EED+EKTAFRTHEGHYEFLVMPFGLTNA ATFQSLMN+VFKPF
Subjt:  KKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPF

Query:  LRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFV
        LRRCVLVF YDILVYS DI EH KHLGMVFA+LRD++L+AN+ KCV AHS++QYLGH IS +GVEADE KI+SMV+WPRP D++ LRGFLGLTGYYRRFV
Subjt:  LRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFV

Query:  KSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETD
        K Y +IA PLTKLLQKNAF W EEA                            ET                AF +LK+AMTT+PVLALPDW+ PFTIETD
Subjt:  KSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETD

Query:  ASGVGLGAVLSQDGHPIAFFSQKLSPRAQ--------------------------------DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQ
        ASG GLGAVLSQ GHPIAFFSQKLS RAQ                                DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGL 
Subjt:  ASGVGLGAVLSQDGHPIAFFSQKLSPRAQ--------------------------------DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQ

Query:  NKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVEMDQELQKIIAELKGEVDQGGKYQWNNG
        NK ADALSR D  VELN MTT+GIVD+  + +EVE D+ELQ+II +LK E + GGK++W NG
Subjt:  NKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVEMDQELQKIIAELKGEVDQGGKYQWNNG

A0A5D3E0Q7 Transposon Ty3-G Gag-Pol polyprotein5.4e-26164.24Show/hide
Query:  EEDHKREEDTEDEASE-VIELNHLELNMDNPIELRLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEG
        EE+   E D+ +E +E ++EL  LEL  D  IEL+ +   +SKGTMK KG +  +E+VILIDSGAT+NFI Q L   L+L ++  T+FG TIGNG +C+G
Subjt:  EEDHKREEDTEDEASE-VIELNHLELNMDNPIELRLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEG

Query:  SGICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILK------------------------------VNYEGE
         GIC+RV+VKL+E+TI+ADFLAVELG+VD VLGMQWLD+ GTMK+HWPSLTM+FW +GR+IILK                               N E E
Subjt:  SGICKRVKVKLKELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILK------------------------------VNYEGE

Query:  SE----TEVELKGKEEGLPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVK
         E    T+ + KG E  +PM++ LL+QY D+F  P GLPP+R IDHRILT+ +Q+PINVRPYKY HVQK EI  LV EMLQ G+IRPSRSPYSS VLLVK
Subjt:  SE----TEVELKGKEEGLPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVK

Query:  KKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPF
        KKDGGWRFCVDYRKLNQ TV+DKFPIPVIEELLDEL+GA VFSKLDL SGYHQIRM+EED+EKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMN+VFKPF
Subjt:  KKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPF

Query:  LRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFV
        LRRCVLVFF DILVYS D+ EH KHLGM+FA+LRD++L+AN  KCV AHS++QYLGH IS  GVEAD+ KIRSMVNWPRP D+T LRGFLGLTGYYR+FV
Subjt:  LRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFV

Query:  KSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETD
        K Y  IA PLTKLLQKNAF WNEEA                            ET                AF +LK+AMTT+ VLALPDWS PFTIETD
Subjt:  KSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETD

Query:  ASGVGLGAVLSQDGHPIAFFSQKLSPRAQDQ----------KALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTTT
        ASG GLGAVLSQ GHPIAF++QKLS RAQ +          KALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGL NK ADALSR +  +EL+ MTT+
Subjt:  ASGVGLGAVLSQDGHPIAFFSQKLSPRAQDQ----------KALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTTT

Query:  GIVDIEIIEKEVEMDQELQKIIAELKGEVDQGGKYQWNNGR
        GIVDI ++ +EV+ D+ELQKI+A+LK E + GGK++W NGR
Subjt:  GIVDIEIIEKEVEMDQELQKIIAELKGEVDQGGKYQWNNGR

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.5e-7935.98Show/hide
Query:  YKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGG-----WRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRM
        Y Y    ++E+   + +ML  G+IR S SPY+SP+ +V KK        +R  +DYRKLN++TV D+ PIP ++E+L +L     F+ +DL  G+HQI M
Subjt:  YKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGG-----WRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRM

Query:  REEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLG
          E V KTAF T  GHYE+L MPFGL NAPATFQ  MN++ +P L +  LV+  DI+V+S  +DEH++ LG+VF  L    L     KC     +  +LG
Subjt:  REEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLG

Query:  HLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETD
        H+++  G++ +  KI ++  +P P     ++ FLGLTGYYR+F+ ++ +IA P+TK L+KN              +K+  T                   
Subjt:  HLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETD

Query:  ASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPR-----------------------------
                 N E   AF +LK  ++  P+L +PD+++ FT+ TDAS V LGAVLSQDGHP+++ S+ L+                               
Subjt:  ASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPR-----------------------------

Query:  ---AQDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSR
           + D + L +L   ++   +  +W  KL  +DF+I Y  G +N VADALSR
Subjt:  ---AQDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSR

P10401 Retrovirus-related Pol polyprotein from transposon gypsy4.3e-6627.79Show/hide
Query:  GREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELTIVADFLAVE-LGTVDLVLGMQWLDSTGTMKVHWPSLTM
        GR + +LID+ A  N+I    V +L+  +   + F V     +   GS   K  K  +K    ++ F  ++ L   D ++G+  L   G +K++    ++
Subjt:  GREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELTIVADFLAVE-LGTVDLVLGMQWLDSTGTMKVHWPSLTM

Query:  TFWMKGRRI----ILKVNYEGESETEVELKGKEEGLPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQA
         +     ++       VN+   ++  V    K+E    + R  + ++        LP   A+   I TV D +P+  R Y       + +   V ++L+ 
Subjt:  TFWMKGRRI----ILKVNYEGESETEVELKGKEEGLPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQA

Query:  GVIRPSRSPYSSPVLLVKKK------DGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFL
        G+IRPSRSPY+SP  +V KK      +   R  +D+RKLN+ T+ D++P+P I  +L  L  A  F+ LDL SGYHQI + E D EKT+F  + G YEF 
Subjt:  GVIRPSRSPYSSPVLLVKKK------DGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFL

Query:  VMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVN
         +PFGL NA + FQ  +++V +  + +   V+  D++++S +  +H++H+  V   L D  +  ++ K       V+YLG ++S  G ++D  K++++  
Subjt:  VMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVN

Query:  WPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQL
        +P P  +  +R FLGL  YYR F+K +  IA P+T +L+                                 +  ++    S+K    +NE    AF +L
Subjt:  WPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQL

Query:  KLAMTTLPV-LALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQ---------------------------------DQKALKFLLEQRE
        +  + +  V L  PD+ +PF + TDAS  G+GAVLSQ+G PI   S+ L    Q                                 D + L F +  R 
Subjt:  KLAMTTLPV-LALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQ---------------------------------DQKALKFLLEQRE

Query:  VQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVEM
           + ++W + +  ++ ++ Y+PG +N VADALSR++    LN +      D   I  E+ +
Subjt:  VQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVEM

P20825 Retrovirus-related Pol polyprotein from transposon 2972.9e-7835.12Show/hide
Query:  ILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGG-----WRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVF
        +L      PI  + Y      + E+   V EML  G+IR S SPY+SP  +V KK        +R  +DYRKLN++T+ D++PIP ++E+L +L     F
Subjt:  ILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGG-----WRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVF

Query:  SKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANR
        + +DL  G+HQI M EE + KTAF T  GHYE+L MPFGL NAPATFQ  MN + +P L +  LV+  DI+++S  + EH+  + +VF  L D  L    
Subjt:  SKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANR

Query:  SKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVL
         KC     +  +LGH+++  G++ +  K++++V++P P     +R FLGLTGYYR+F+ +Y +IA P+T  L+K             D  KL        
Subjt:  SKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVL

Query:  ALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPR---------------
                                 E   AF++LK  +   P+L LPD+ + F + TDAS + LGAVLSQ+GHPI+F S+ L+                 
Subjt:  ALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPR---------------

Query:  -----------------AQDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSR
                         A D + L++L   +E   + ++W  +L  Y F+I Y  G +N VADALSR
Subjt:  -----------------AQDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus4.3e-7433.73Show/hide
Query:  MVQRLLEQYADVFRLP-TGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVKKK-----DGGWRFCVDYR
        ++  LL ++  +F  P +G+    A+   I T   Q PI  + Y Y    + E+ + + E+LQ G+IRPS SPY+SP+ +V KK     +  +R  VD++
Subjt:  MVQRLLEQYADVFRLP-TGLPPRRAIDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVKKK-----DGGWRFCVDYR

Query:  KLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDIL
        +LN VT+ D +PIP I   L  L  A  F+ LDL SG+HQI M+E D+ KTAF T  G YEFL +PFGL NAPA FQ +++++ +  + +   V+  DI+
Subjt:  KLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDIL

Query:  VYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKL
        V+S D D H K+L +V A L    L  N  K     +QV++LG++++  G++AD  K+R++   P P  +  L+ FLG+T YYR+F++ Y ++A PLT L
Subjt:  VYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKL

Query:  LQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQD
                                T  + A         I++  S K     +E A  +F+ LK  + +  +LA P +++PF + TDAS   +GAVLSQD
Subjt:  LQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQD

Query:  ----GHPIAFFSQKLSPRAQ---------------------------------DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALS
              PIA+ S+ L+   +                                 D + L F L  R    + ++W  ++  Y+ E++Y+PG  N VADALS
Subjt:  ----GHPIAFFSQKLSPRAQ---------------------------------DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALS

Query:  R
        R
Subjt:  R

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.6e-6533.27Show/hide
Query:  LLEQYADVFRLPTGLPPRRA------IDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQ
        L ++Y ++ R    LPPR A      + H I      +   ++PY      ++EI K+V ++L    I PS+SP SSPV+LV KKDG +R CVDYR LN+
Subjt:  LLEQYADVFRLPTGLPPRRA------IDHRILTVADQKPINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQ

Query:  VTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSV
         T++D FP+P I+ LL  +  A +F+ LDL SGYHQI M  +D  KTAF T  G YE+ VMPFGL NAP+TF   M + F+    R V V+  DIL++S 
Subjt:  VTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSV

Query:  DIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKN
          +EH KHL  V   L++  L   + KC  A  + ++LG+ I  + +   + K  ++ ++P PK +   + FLG+  YYRRF+ +  +IA P+ +L   +
Subjt:  DIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKN

Query:  AFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPI
           W E+     D+LK A+   PVL   +    + + TDAS+       EE       + +       L     + P      A  + L  ++    H  
Subjt:  AFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPI

Query:  AFFSQKLSPRAQDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVEMDQELQKIIAELK
             K      D  +L  L  + E   + Q+WL  L  YDF + Y  G +N VADA+SR  +++   T  T+  +D E  +   + D     ++  +K
Subjt:  AFFSQKLSPRAQDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTTTGIVDIEIIEKEVEMDQELQKIIAELK

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein2.8e-1228.67Show/hide
Query:  RLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELTIVADFLAVELG--TVDLVL
        +L++ +T    M+  G +   +VV+ IDSGAT+NFI   L   L+L      +  V +G     +  G C  +++ ++E+ I  +FL ++L    VD++L
Subjt:  RLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELTIVADFLAVELG--TVDLVL

Query:  GMQWLDSTGTMKVHWPSLTMTFWMKGRRIILKVNYEGESETEVELKGKEE
        G +WL   G   V+W +   +F    + I L   +E   +   ++K K E
Subjt:  GMQWLDSTGTMKVHWPSLTMTFWMKGRRIILKVNYEGESETEVELKGKEE

AT3G30770.1 Eukaryotic aspartyl protease family protein1.4e-0626.62Show/hide
Query:  MKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELTIVADFLAVEL--GTVDLVLGMQWLDSTGTM
        M+  G ++  +VV++IDSGATNNFIS  L   L+L      +  V +G     +  G C  + + ++E+ I  +FL ++L    VD++LG     +    
Subjt:  MKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKLKELTIVADFLAVEL--GTVDLVLGMQWLDSTGTM

Query:  KVHWPSLTMTFWMKGRRIILKVNYEGESETEVELKGKEE
         + W +   +F+   + + L    +   +   ++K K E
Subjt:  KVHWPSLTMTFWMKGRRIILKVNYEGESETEVELKGKEE

ATMG00850.1 DNA/RNA polymerases superfamily protein1.0e-0679.31Show/hide
Query:  EMLQAGVIRPSRSPYSSPVLLVKKKDGGW
        EML+A +I+PS SPYSSPVLLV+KKDGGW
Subjt:  EMLQAGVIRPSRSPYSSPVLLVKKKDGGW

ATMG00860.1 DNA/RNA polymerases superfamily protein5.0e-4154.36Show/hide
Query:  MKHLGMVFAILRDHELFANRSKCVIAHSQVQYLG--HLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFH
        M HLGMV  I   H+ +ANR KC     Q+ YLG  H+IS  GV AD AK+ +MV WP PK+ T LRGFLGLTGYYRRFVK+YG+I  PLT+LL+KN+  
Subjt:  MKHLGMVFAILRDHELFANRSKCVIAHSQVQYLG--HLISNRGVEADEAKIRSMVNWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFH

Query:  WNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEA
        W E A + F  LK A+TTLPVLALPD   PF         + F   E+A
Subjt:  WNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAAGAAGACCACAAAAGGGAAGAGGATACAGAGGATGAGGCAAGCGAAGTAATAGAACTGAATCATCTGGAATTGAATATGGATAACCCTATTGAATTGAGGTTGATCAT
GGGAGTTACATCAAAGGGAACGATGAAACGAAAAGGACATGTGAACGGAAGGGAAGTAGTTATTCTGATTGACAGCGGGGCGACCAATAACTTCATCAGTCAAGTGTTGG
TAGATAAACTACAGCTGAGCATCGATCCGGGAACTCGTTTTGGAGTTACCATTGGGAATGGCAACCAATGTGAAGGAAGTGGGATTTGCAAGAGGGTGAAGGTGAAGTTA
AAAGAGTTAACAATCGTAGCAGATTTCCTAGCGGTAGAGTTAGGAACGGTAGACTTGGTGCTTGGGATGCAATGGCTAGATTCGACAGGGACTATGAAAGTTCACTGGCC
ATCTCTAACCATGACGTTTTGGATGAAGGGTAGAAGAATAATCCTAAAAGTGAACTATGAAGGAGAATCGGAAACAGAAGTTGAATTGAAAGGAAAGGAGGAAGGATTAC
CCATGGTTCAGCGGCTGCTCGAGCAATATGCAGACGTTTTTAGGTTGCCCACGGGTTTACCGCCAAGGAGAGCCATAGACCATCGCATTCTGACCGTGGCCGATCAGAAA
CCAATTAATGTAAGACCATATAAGTATCAGCATGTACAAAAGGAAGAGATTGGAAAATTGGTGTTAGAAATGTTACAAGCTGGGGTGATTCGTCCAAGTCGTAGCCCATA
TTCGAGCCCGGTCCTCTTAGTGAAGAAAAAAGATGGAGGGTGGAGATTTTGTGTAGATTACAGGAAACTCAATCAAGTAACGGTGGCTGATAAATTTCCAATTCCCGTGA
TCGAAGAACTCTTAGATGAACTTCATGGTGCGACAGTTTTCTCAAAGTTAGACCTTATATCTGGATACCACCAGATTAGGATGAGGGAGGAAGATGTGGAGAAAACAGCT
TTCCGCACGCATGAAGGACATTATGAGTTCTTGGTGATGCCTTTCGGCCTTACGAATGCTCCTGCCACCTTCCAATCTCTCATGAACGAGGTGTTCAAACCATTCCTTCG
AAGGTGTGTCCTGGTTTTTTTTTATGACATTCTAGTTTATAGTGTGGACATAGATGAGCACATGAAACATTTAGGAATGGTTTTTGCTATCTTGAGGGACCATGAATTGT
TTGCAAATAGGTCTAAATGTGTCATTGCTCATTCCCAAGTTCAATATTTGGGTCATCTGATTTCCAACAGAGGAGTGGAGGCTGATGAGGCCAAGATTCGCAGTATGGTA
AATTGGCCACGGCCGAAGGATATAACTGGGCTGAGGGGATTCCTTGGATTGACTGGGTATTATAGAAGATTTGTGAAAAGCTATGGAGAAATAGCTGCACCCTTAACCAA
ATTACTTCAGAAAAATGCATTCCATTGGAATGAGGAAGCCACAATAGTGTTCGACCAGCTGAAGCTAGCAATGACAACCTTACCAGTATTAGCATTGCCGGATTGGTCTA
AGCCCTTCACAATCGAAACTGATGCTTCAAGAAAAAATGCATTCCATTGGAATGAGGAAGCTACAATAGCGTTTGACCAGCTGAAGCTAGCAATGACAACCTTACCGGTA
TTAGCATTGCCGGATTGGTCTCAGCCCTTCACAATCGAAACTGATGCTTCAGGAGTAGGTTTAGGCGCAGTTTTATCACAGGATGGTCATCCCATCGCATTTTTCAGCCA
GAAACTGTCCCCAAGAGCCCAGGATCAGAAGGCTCTGAAATTTCTGTTAGAACAGAGGGAAGTTCAGCCTCAATTCCAAAAGTGGCTCACAAAACTTTTGGGATATGACT
TTGAGATTTTATACCAGCCGGGTCTACAGAATAAGGTTGCGGATGCTCTCTCAAGGAAAGACCATTCGGTGGAGTTAAATACAATGACAACCACAGGCATAGTTGATATA
GAGATAATAGAGAAGGAAGTCGAAATGGATCAAGAACTTCAGAAAATTATTGCCGAACTTAAGGGAGAGGTGGATCAAGGTGGGAAATACCAGTGGAACAATGGCAGGAA
GGATGGTGCTGCCGCGAAATTCTTCCCTCATTCTGAGACTTCTACACACGTTCCATGA
mRNA sequenceShow/hide mRNA sequence
GAAGAAGACCACAAAAGGGAAGAGGATACAGAGGATGAGGCAAGCGAAGTAATAGAACTGAATCATCTGGAATTGAATATGGATAACCCTATTGAATTGAGGTTGATCAT
GGGAGTTACATCAAAGGGAACGATGAAACGAAAAGGACATGTGAACGGAAGGGAAGTAGTTATTCTGATTGACAGCGGGGCGACCAATAACTTCATCAGTCAAGTGTTGG
TAGATAAACTACAGCTGAGCATCGATCCGGGAACTCGTTTTGGAGTTACCATTGGGAATGGCAACCAATGTGAAGGAAGTGGGATTTGCAAGAGGGTGAAGGTGAAGTTA
AAAGAGTTAACAATCGTAGCAGATTTCCTAGCGGTAGAGTTAGGAACGGTAGACTTGGTGCTTGGGATGCAATGGCTAGATTCGACAGGGACTATGAAAGTTCACTGGCC
ATCTCTAACCATGACGTTTTGGATGAAGGGTAGAAGAATAATCCTAAAAGTGAACTATGAAGGAGAATCGGAAACAGAAGTTGAATTGAAAGGAAAGGAGGAAGGATTAC
CCATGGTTCAGCGGCTGCTCGAGCAATATGCAGACGTTTTTAGGTTGCCCACGGGTTTACCGCCAAGGAGAGCCATAGACCATCGCATTCTGACCGTGGCCGATCAGAAA
CCAATTAATGTAAGACCATATAAGTATCAGCATGTACAAAAGGAAGAGATTGGAAAATTGGTGTTAGAAATGTTACAAGCTGGGGTGATTCGTCCAAGTCGTAGCCCATA
TTCGAGCCCGGTCCTCTTAGTGAAGAAAAAAGATGGAGGGTGGAGATTTTGTGTAGATTACAGGAAACTCAATCAAGTAACGGTGGCTGATAAATTTCCAATTCCCGTGA
TCGAAGAACTCTTAGATGAACTTCATGGTGCGACAGTTTTCTCAAAGTTAGACCTTATATCTGGATACCACCAGATTAGGATGAGGGAGGAAGATGTGGAGAAAACAGCT
TTCCGCACGCATGAAGGACATTATGAGTTCTTGGTGATGCCTTTCGGCCTTACGAATGCTCCTGCCACCTTCCAATCTCTCATGAACGAGGTGTTCAAACCATTCCTTCG
AAGGTGTGTCCTGGTTTTTTTTTATGACATTCTAGTTTATAGTGTGGACATAGATGAGCACATGAAACATTTAGGAATGGTTTTTGCTATCTTGAGGGACCATGAATTGT
TTGCAAATAGGTCTAAATGTGTCATTGCTCATTCCCAAGTTCAATATTTGGGTCATCTGATTTCCAACAGAGGAGTGGAGGCTGATGAGGCCAAGATTCGCAGTATGGTA
AATTGGCCACGGCCGAAGGATATAACTGGGCTGAGGGGATTCCTTGGATTGACTGGGTATTATAGAAGATTTGTGAAAAGCTATGGAGAAATAGCTGCACCCTTAACCAA
ATTACTTCAGAAAAATGCATTCCATTGGAATGAGGAAGCCACAATAGTGTTCGACCAGCTGAAGCTAGCAATGACAACCTTACCAGTATTAGCATTGCCGGATTGGTCTA
AGCCCTTCACAATCGAAACTGATGCTTCAAGAAAAAATGCATTCCATTGGAATGAGGAAGCTACAATAGCGTTTGACCAGCTGAAGCTAGCAATGACAACCTTACCGGTA
TTAGCATTGCCGGATTGGTCTCAGCCCTTCACAATCGAAACTGATGCTTCAGGAGTAGGTTTAGGCGCAGTTTTATCACAGGATGGTCATCCCATCGCATTTTTCAGCCA
GAAACTGTCCCCAAGAGCCCAGGATCAGAAGGCTCTGAAATTTCTGTTAGAACAGAGGGAAGTTCAGCCTCAATTCCAAAAGTGGCTCACAAAACTTTTGGGATATGACT
TTGAGATTTTATACCAGCCGGGTCTACAGAATAAGGTTGCGGATGCTCTCTCAAGGAAAGACCATTCGGTGGAGTTAAATACAATGACAACCACAGGCATAGTTGATATA
GAGATAATAGAGAAGGAAGTCGAAATGGATCAAGAACTTCAGAAAATTATTGCCGAACTTAAGGGAGAGGTGGATCAAGGTGGGAAATACCAGTGGAACAATGGCAGGAA
GGATGGTGCTGCCGCGAAATTCTTCCCTCATTCTGAGACTTCTACACACGTTCCATGA
Protein sequenceShow/hide protein sequence
EEDHKREEDTEDEASEVIELNHLELNMDNPIELRLIMGVTSKGTMKRKGHVNGREVVILIDSGATNNFISQVLVDKLQLSIDPGTRFGVTIGNGNQCEGSGICKRVKVKL
KELTIVADFLAVELGTVDLVLGMQWLDSTGTMKVHWPSLTMTFWMKGRRIILKVNYEGESETEVELKGKEEGLPMVQRLLEQYADVFRLPTGLPPRRAIDHRILTVADQK
PINVRPYKYQHVQKEEIGKLVLEMLQAGVIRPSRSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTVADKFPIPVIEELLDELHGATVFSKLDLISGYHQIRMREEDVEKTA
FRTHEGHYEFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRSKCVIAHSQVQYLGHLISNRGVEADEAKIRSMV
NWPRPKDITGLRGFLGLTGYYRRFVKSYGEIAAPLTKLLQKNAFHWNEEATIVFDQLKLAMTTLPVLALPDWSKPFTIETDASRKNAFHWNEEATIAFDQLKLAMTTLPV
LALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTTTGIVDI
EIIEKEVEMDQELQKIIAELKGEVDQGGKYQWNNGRKDGAAAKFFPHSETSTHVP