; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG02G009630 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG02G009630
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionTy3/gypsy retrotransposon protein
Genome locationCG_Chr02:14985682..14991550
RNA-Seq ExpressionClCG02G009630
SyntenyClCG02G009630
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046359.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.1e-22138.7Show/hide
Query:  EEIESEQNENAGSLSKSKYKKLEMSIFTEENLKSWVYKVEYFFEIRELSKAKKVKVAVPHLKSD------YDVGRAEEEDVRIIST-------DDG-KDT
        EE +     +   + +SKYKKLEM +F  EN +SWVY+ E+FFEI  L +A+KVKVAV     D      +   R + E    + T       D G K  
Subjt:  EEIESEQNENAGSLSKSKYKKLEMSIFTEENLKSWVYKVEYFFEIRELSKAKKVKVAVPHLKSD------YDVGRAEEEDVRIIST-------DDG-KDT

Query:  RARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDAFMNELKPDIRAEVTSR----LERCMREAQL--------------VGRIREKPNRDV------
         ARLIRI+QEG+Y DY+KKFV YSAPLP+MAESVL DAF+  L+P ++AEV SR    LE CM  AQL              VG I  K +  V      
Subjt:  RARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDAFMNELKPDIRAEVTSR----LERCMREAQL--------------VGRIREKPNRDV------

Query:  ---KCIQR----------------------------------RLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVE--LVADMEEAELTVV
           K + R                                  RLDKGLCFRCNEK+ HGH CKI+E REL +FI   E+  EE E     +    E+  +
Subjt:  ---KCIQR----------------------------------RLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVE--LVADMEEAELTVV

Query:  EVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV---------------RLAEELELSFTKMMKYGVVIGNGDAINSQGIHG-------GSLAVVDHVVYD
        E  E   +  R++   +  GTMKL+  VK KEV                L  E ++   +  ++G+ IGNG +   +GI         G   V D +V  
Subjt:  EVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV---------------RLAEELELSFTKMMKYGVVIGNGDAINSQGIHG-------GSLAVVDHVVYD

Query:  RGT-----------TSGTV-------------EGLHLEI-GDNH------------------------DFIEEEKKKEDVQLSE-----------MIVTL
         GT           T+GT+             EG  + + GD                          D+   E + ED                MI  L
Subjt:  RGT-----------TSGTV-------------EGLHLEI-GDNH------------------------DFIEEEKKKEDVQLSE-----------MIVTL

Query:  LKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPYKYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKF
        L    D+F  P  LPPK  IDH I     Q P+N+ PYKY H+QKEEIEKLV E  + G+     S  ++ + L         FCVDYR+LN++TIADKF
Subjt:  LKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPYKYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKF

Query:  LIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY---------------------------
         IPVIEEL DELH + V+SKLDL+SGYHQIRMKE DIEKTAFRTHEGHY+F+VMPF        FQ+++ +++                           
Subjt:  LIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY---------------------------

Query:  -------------------------------------IGVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEV
                                              GVEAD +K++ M +WP PK V  LR FLGL GYYRRFV++YG I AP+TKLL KN F WDE 
Subjt:  -------------------------------------IGVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEV

Query:  QVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGLEAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV--------------------------
           AF  LKSAM T+ VL LPD+SL F+IETDASG GL A+L QNS  IA+FSQ LS RAQAKSIYERELM VV                          
Subjt:  QVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGLEAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV--------------------------

Query:  -------------WMTKLLGNDFEILYQLRLQNKPAYALSRVPPLLEI---------------HKVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLG
                     W+TKLLG DFEILYQ  LQNK A ALSR+   +E+                +V +DEELQ +I  L+ NP  + K+    G L+Y G
Subjt:  -------------WMTKLLGNDFEILYQLRLQNKPAYALSRVPPLLEI---------------HKVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLG

Query:  RLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAGELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF----------
        R+VLSKSSS+IPS+LH +HDS++GGHSGFLRTYKRM+GEL+W GMK D++KYVE CEICQRNK++A KPA + QPLPIP ++ ED TMDF          
Subjt:  RLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAGELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF----------

Query:  ---MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVKHHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCN
           MVVVDRLSKY +FV +KHP+ AKQV   F+ +IV+ HG    +   R +     F    F   +   +      P    +  R    +   LRCFCN
Subjt:  ---MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVKHHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCN

Query:  EQPLRWSKWLPWAKLW----LYMSDR-------------PLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGD----
        EQP +W +++PWA+LW     + S R             PL  Y            + + +DLA++ALKE+L +AQ+RMKK A+  RR+L+F+VGD    
Subjt:  EQPLRWSKWLPWAKLW----LYMSDR-------------PLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGD----

Query:  --------------------------------EEV-----------IHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA---
                                        EEV           IHNVFH+SQLK K+G +  VQ     LT +FELQ+  + V+GIRW+ + GA   
Subjt:  --------------------------------EEV-----------IHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA---

Query:  ----------------------------------IREGGIVRPLIQYTYKRRGKKEK----TLEKESGMKSKG
                                          +   GIVRP I   YKRRGKK K      E+E+G +  G
Subjt:  ----------------------------------IREGGIVRPLIQYTYKRRGKKEK----TLEKESGMKSKG

KAA0055376.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.1e-22037.16Show/hide
Query:  EGKWGRSVRKREILREKAKGFRESKIGISRVQKVTYGKVFGSGSSIDKFLIFGSFDAILGEEIESEQ-NENAGSLS-KSKYKKLEMSIFTEENLKSWVYK
        EG     +  +E+L E  K     ++     +  +Y K   SG++ D     GS   + G+  E+E   E  G+L+ ++KYKKLEM +F  EN +SWVY+
Subjt:  EGKWGRSVRKREILREKAKGFRESKIGISRVQKVTYGKVFGSGSSIDKFLIFGSFDAILGEEIESEQ-NENAGSLS-KSKYKKLEMSIFTEENLKSWVYK

Query:  VEYFFEIRELSKAKKVKVAVPHLKSD-YDVGRAEEEDVRIISTDD-------------GKDTRARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDA
         E+FFEI  LS+++KVKVAV     D  D  R      ++ S +D              K   ARLIRI+QEG+Y +Y+KKFV YSAPLP MAESVL+DA
Subjt:  VEYFFEIRELSKAKKVKVAVPHLKSD-YDVGRAEEEDVRIISTDD-------------GKDTRARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDA

Query:  FMNELKPDIRAEVTSR----LERCMREAQLV------------------------------GRIREKPNRDVKC--------------------------
        F+  L+P ++AEV SR    LE CMREAQLV                              G + + P R                              
Subjt:  FMNELKPDIRAEVTSR----LERCMREAQLV------------------------------GRIREKPNRDVKC--------------------------

Query:  -IQRRLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVELVADMEE--AELTVVEVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV-----
          + RLD+GLCFRCN+K+  GH CK +E REL  FI   E+ +EE +   ++ E   EL  +E+ +   + L+++   S  GTMKLK  ++ KEV     
Subjt:  -IQRRLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVELVADMEE--AELTVVEVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV-----

Query:  ----------RLAEELELSFTKMMKYGVVIGNGDAINSQGI-------HGGSLAVVDHVVYDRGT-----------TSGT--------------------
                   LA +LEL       +G  IGNG     +GI             + D +  + G+           T+GT                    
Subjt:  ----------RLAEELELSFTKMMKYGVVIGNGDAINSQGI-------HGGSLAVVDHVVYDRGT-----------TSGT--------------------

Query:  -----------------------------VEGLHLEIGDNHDFIEEEKKKEDVQLSEMIVTLLKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPY
                                     +E  ++++  +  +  +EK++ D     MI  LL+   D+F  P+ LPPK DIDH I    DQ P+N+ PY
Subjt:  -----------------------------VEGLHLEIGDNHDFIEEEKKKEDVQLSEMIVTLLKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPY

Query:  KYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKFLIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIE
        KY H QK EIEKLV E  + G+     S  ++ + L         FCVDYR+LNQ TI+DKF IPVIEEL DEL+ + V+SKLDL+SGYHQIRMKE DIE
Subjt:  KYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKFLIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIE

Query:  KTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY----------------------------------------------------------------I
        KTAFRTHEGHY+FLVMPF        FQ+++ +++                                                                 
Subjt:  KTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY----------------------------------------------------------------I

Query:  GVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEVQVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGL
        GVEAD +KIR+M  WP P  V ELR FL L GYYRRFV+ Y  I  P+TKLL KN F W+E    AF +LK AM T+ VL LPD+SL F IETDASG GL
Subjt:  GVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEVQVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGL

Query:  EAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV---------------------------------------WMTKLLGNDFEILYQLRLQNKPAYA
         A+L Q    IA++SQ LS+RAQAKSIYERELM VV                                       W+TKLLG DFEILYQ    NK A A
Subjt:  EAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV---------------------------------------WMTKLLGNDFEILYQLRLQNKPAYA

Query:  LSRVPPLLEIH---------------KVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLGRLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAG
        LSRV P +E+H               +V +DE LQKI+  LK+  E   KF+W  G LLY GRLVL ++SSLIP +LH +HDSV+GGHSGFLRTYKRM+G
Subjt:  LSRVPPLLEIH---------------KVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLGRLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAG

Query:  ELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF-------------MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVK
        ELYW GMK D++KYVE CE+CQRNK +A KPA + QPLPIP ++ ED TMDF             MVVVDRL+KY +F+ LKHP+ AKQV   F+ +IV+
Subjt:  ELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF-------------MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVK

Query:  HHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCNEQPLRWSKWLPWAKLW----------------LY-MSD
         HG  N +   R +     F    F       +      P    +  R    +   LRCFCNEQP +W +++PWA+LW                +Y  S 
Subjt:  HHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCNEQPLRWSKWLPWAKLW----------------LY-MSD

Query:  RPLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGD------------------------------------------
         PL  Y           ++ + +D AL+ALKE+L +AQ+RMKK A+  RR+L+ +VG+                                          
Subjt:  RPLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGD------------------------------------------

Query:  -----EEVIHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA-------------------------------------IREG
             E  IHNVFH+SQLK K+G +  VQH   +LT  FELQ+  + V+GIRWN + GA                                     +   
Subjt:  -----EEVIHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA-------------------------------------IREG

Query:  GIVRPLIQYTYKRRGKKEKTLEKESG
        GIVRP I +TYKRR +K   L+K  G
Subjt:  GIVRPLIQYTYKRRGKKEKTLEKESG

KAA0061073.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.9e-22337.22Show/hide
Query:  EGKWGRSVRKREILREKAKGFRESKIGISRVQKVTYGKVFGSGSSIDKFLIFGSFDAILG--EEIESEQNENAGSLSKSKYKKLEMSIFTEENLKSWVYK
        EG     +  +E+L E  K     ++     +  +Y K   SG++ D     GS   + G  +E E     N     +SKYKKLEM +F  EN +SWVY+
Subjt:  EGKWGRSVRKREILREKAKGFRESKIGISRVQKVTYGKVFGSGSSIDKFLIFGSFDAILG--EEIESEQNENAGSLSKSKYKKLEMSIFTEENLKSWVYK

Query:  VEYFFEIRELSKAKKVKVAVPHLKSD-YDVGRAEEEDVRIISTDD-------------GKDTRARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDA
         E+FFEI  LS+++KVKVAV     D  D  R      ++ S +D              K   ARLIRI+QEG+Y +Y+KKFV YSAPLP MAESVL+DA
Subjt:  VEYFFEIRELSKAKKVKVAVPHLKSD-YDVGRAEEEDVRIISTDD-------------GKDTRARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDA

Query:  FMNELKPDIRAEVTSR----LERCMREAQLV------------------------------GRIREKPNRDVKC--------------------------
        F+  L+P ++AEV SR    LE CMREAQLV                              G + + P R                              
Subjt:  FMNELKPDIRAEVTSR----LERCMREAQLV------------------------------GRIREKPNRDVKC--------------------------

Query:  -IQRRLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVELVADM--EEAELTVVEVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV-----
          + RLD+GLCFRCN+K+  GH CK +E REL  FI   E+ +EE +   ++  E  EL  +E+ +   + L+++   S  GTMKLK  ++ KEV     
Subjt:  -IQRRLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVELVADM--EEAELTVVEVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV-----

Query:  ----------RLAEELELSFTKMMKYGVVIGNGDAINSQGI-------HGGSLAVVDHVVYDRGT-----------TSGT--------------------
                   LA ELEL       +G  IGNG     +GI             + D +  + G+           T+GT                    
Subjt:  ----------RLAEELELSFTKMMKYGVVIGNGDAINSQGI-------HGGSLAVVDHVVYDRGT-----------TSGT--------------------

Query:  -----------------------------VEGLHLEIGDNHDFIEEEKKKEDVQLSEMIVTLLKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPY
                                     +E  ++E+  +  +  +EK++ D     MI  LL+  +D+F  P+ LPPK DIDH I    DQ P+N+ PY
Subjt:  -----------------------------VEGLHLEIGDNHDFIEEEKKKEDVQLSEMIVTLLKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPY

Query:  KYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKFLIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIE
        KY H QK EIEKLV E  + G+     S  ++ + L         FCVDYR+LNQ TI+DKF IPVIEEL DEL+ + V+SKLDL+SGYHQIRMKE D+E
Subjt:  KYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKFLIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIE

Query:  KTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY----------------------------------------------------------------I
        KTAFRTHEGHY+FLVMPF        FQ+++ +++                                                                 
Subjt:  KTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY----------------------------------------------------------------I

Query:  GVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEVQVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGL
        GVEAD +KIR+M  WP P  V ELR FLGL GYYRRFV+ Y  I  P+TKLL KN F W+E    AF +LK AM T+ VL LPD+SL F IETDASG GL
Subjt:  GVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEVQVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGL

Query:  EAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV---------------------------------------WMTKLLGNDFEILYQLRLQNKPAYA
         A+L Q    IA++SQ LS+RAQAKSIYERELM VV                                       W+TKLLG DFEILYQ    NK A A
Subjt:  EAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV---------------------------------------WMTKLLGNDFEILYQLRLQNKPAYA

Query:  LSRVPPLLEIH---------------KVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLGRLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAG
        LSRV P +E+H               +V +DE LQKI+  LK+  E   KF+W  G LLY GRLVL ++SSLIP +LH +HDSV+GGHSGFLRTYKRM+G
Subjt:  LSRVPPLLEIH---------------KVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLGRLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAG

Query:  ELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF-------------MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVK
        ELYW GMK D++KYVE CE+CQRNK +A KPA + QPLPIP ++ ED TMDF             MVVVDRL+KY +F+ LKHP+ AKQV   F+ +IV+
Subjt:  ELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF-------------MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVK

Query:  HHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCNEQPLRWSKWLPWAKLW----------------LY-MSD
         HG  N +   R +     F    F       +      P    +  R    +   LRCFCNEQP +W +++PWA+LW                +Y  S 
Subjt:  HHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCNEQPLRWSKWLPWAKLW----------------LY-MSD

Query:  RPLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGD------------------------------------------
         PL  Y           ++ + +D AL+ALKE+L +AQ+RMKK A+  RR+L+ +VG+                                          
Subjt:  RPLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGD------------------------------------------

Query:  -----EEVIHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA-------------------------------------IREG
             E  IHNVFH+SQLK K+G +  VQH   +LT  FELQ+  + V+GIRWN + GA                                     +   
Subjt:  -----EEVIHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA-------------------------------------IREG

Query:  GIVRPLIQYTYKRRGKKEKTLEKESG
        GIVRP I +TYKRR +K   L+K  G
Subjt:  GIVRPLIQYTYKRRGKKEKTLEKESG

TYJ96663.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.2e-22138.42Show/hide
Query:  EEIESEQNENAGSLSKSKYKKLEMSIFTEENLKSWVYKVEYFFEIRELSKAKKVKVAVPHLKSD------YDVGRAEEEDVRIIST-------DDG-KDT
        EE +     +   + +SKYKKLEM +F  EN +SWVY+ E+FFEI  L +A+KVKVAV     D      +   R + E    + T       D G K  
Subjt:  EEIESEQNENAGSLSKSKYKKLEMSIFTEENLKSWVYKVEYFFEIRELSKAKKVKVAVPHLKSD------YDVGRAEEEDVRIIST-------DDG-KDT

Query:  RARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDAFMNELKPDIRAEVTSR----LERCMREAQLV-------------------------------
         ARLIRI+QEG+Y DY+KKFV YSAPLP+MAESVL DAF+  L+P ++AEV SR    LE CM  AQLV                               
Subjt:  RARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDAFMNELKPDIRAEVTSR----LERCMREAQLV-------------------------------

Query:  ----GRIR-----------------EKPNRDVKCI-----QRRLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVE--LVADMEEAELTVV
            G +R                 +K    VK +     + RLDKGLCFRCNEK+ HGH CKI+E REL +FI   E+  EE E     + E  E+  +
Subjt:  ----GRIR-----------------EKPNRDVKCI-----QRRLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVE--LVADMEEAELTVV

Query:  EVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV---------------RLAEELELSFTKMMKYGVVIGNGDAINSQGIHG-------GSLAVVDHVVYD
           E   +  R++   +  GTMKL+  VK KEV                L +E ++   +  ++G+ IG+G +   +GI         G   V D +V  
Subjt:  EVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV---------------RLAEELELSFTKMMKYGVVIGNGDAINSQGIHG-------GSLAVVDHVVYD

Query:  RGT-----------TSGTV-------------EGLHLEI---------------------GDNHDFIEEEKKKE---------------DVQLSEMIVTL
         GT           T+GT+             EG  + +                      ++  F+ + ++ E               D +   MI  L
Subjt:  RGT-----------TSGTV-------------EGLHLEI---------------------GDNHDFIEEEKKKE---------------DVQLSEMIVTL

Query:  LKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPYKYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKF
        L    D+FK P  LPPK  IDH I     Q P+N+ PYKY H+QKEEIEKLV E  + G+     S  ++ + L         FCVDYR+LN++TIADKF
Subjt:  LKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPYKYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKF

Query:  LIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY---------------------------
         IPVIEEL DELH + V+SKLDL+SGYHQIRM+E DIEKTAFRTHEGHY+F+VMPF        FQ+++ +++                           
Subjt:  LIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY---------------------------

Query:  -------------------------------------IGVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEV
                                              GVEAD +K+++M +WP PK V  LR FLGL GYYRRFV+ YG I AP+TKLL KN F WDE 
Subjt:  -------------------------------------IGVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEV

Query:  QVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGLEAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV--------------------------
           AF  LKSAM T+ VL LPD+SL F+IETDASG GL A+L QNS  IA+FSQ LS RAQAKSIYERELM VV                          
Subjt:  QVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGLEAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV--------------------------

Query:  -------------WMTKLLGNDFEILYQLRLQNKPAYALSRVPPLLEI---------------HKVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLG
                     W+TKLLG DFEILYQ  LQNK A ALSR+   +E+                +V +DEELQ +I  L+ NP  + K+    G L+Y G
Subjt:  -------------WMTKLLGNDFEILYQLRLQNKPAYALSRVPPLLEI---------------HKVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLG

Query:  RLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAGELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF----------
        R+VLSKSSS+IPS+LH +HDS++GGHSGFLRTYKRM+GEL+W GMK D++KYVE CEICQRNK++A KPA + QPLPIP ++ ED TMDF          
Subjt:  RLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAGELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF----------

Query:  ---MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVKHHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCN
           MVVVDRLSKY +FV +KHP+ AKQV   F+ +IV+ HG    +   R +     F    F       +      P    +  R    +   LRCFCN
Subjt:  ---MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVKHHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCN

Query:  EQPLRWSKWLPWAKLW----LYMSDR-------------PLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGDE---
        EQP +W +++PWA+LW     + S R             PL  Y            + + +DLA++ALKE+L +AQ+RMKK A+  RR+L+F+VGDE   
Subjt:  EQPLRWSKWLPWAKLW----LYMSDR-------------PLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGDE---

Query:  ---------------------------------EV-----------IHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA---
                                         EV           IHNVFH+SQLK K+G +  VQ     LT +FELQ+  + V+GIRW+ + GA   
Subjt:  ---------------------------------EV-----------IHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA---

Query:  ----------------------------------IREGGIVRPLIQYTYKRRGKKEKTL----EKESG
                                          +   GIVRP I   YKRRGKK K      E+E+G
Subjt:  ----------------------------------IREGGIVRPLIQYTYKRRGKKEKTL----EKESG

TYK27058.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]7.2e-22138.5Show/hide
Query:  EEIESEQNENAGSLSKSKYKKLEMSIFTEENLKSWVYKVEYFFEIRELSKAKKVKVAVPHLKSD------YDVGRAEEEDVRIIST-------DDG-KDT
        EE +     +   + +SKYKKLEM +F  EN +SWVY+ E+FFEI  L +A+KVKVAV     D      +   R + E    + T       D G K  
Subjt:  EEIESEQNENAGSLSKSKYKKLEMSIFTEENLKSWVYKVEYFFEIRELSKAKKVKVAVPHLKSD------YDVGRAEEEDVRIIST-------DDG-KDT

Query:  RARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDAFMNELKPDIRAEVTSR----LERCMREAQLV-------------------------------
         ARLIRI+QEG+Y DY+KKFV YSAPLP+MAESVL DAF+  L+P ++AEV SR    LE CM  AQLV                               
Subjt:  RARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDAFMNELKPDIRAEVTSR----LERCMREAQLV-------------------------------

Query:  ----GRIR-----------------EKPNRDVKCI-----QRRLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVE--LVADMEEAELTVV
            G +R                 +K    VK +     + RLDKGLCFRCNEK+ HGH CKI+E REL +FI   E+  EE E     + E  E+  +
Subjt:  ----GRIR-----------------EKPNRDVKCI-----QRRLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVE--LVADMEEAELTVV

Query:  EVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV---------------RLAEELELSFTKMMKYGVVIGNGDAINSQGIHG-------GSLAVVDHVVYD
           E   +  R++   +  GTMKL+  VK KEV                L +E ++   +  ++G+ IG+G +   +GI         G   V D +V  
Subjt:  EVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV---------------RLAEELELSFTKMMKYGVVIGNGDAINSQGIHG-------GSLAVVDHVVYD

Query:  RGT-----------TSGTV-------------EGLHLEI---------------------GDNHDFIEEEKKKE---------------DVQLSEMIVTL
         GT           T+GT+             EG  + +                      ++  F+ + ++ E               D +   MI  L
Subjt:  RGT-----------TSGTV-------------EGLHLEI---------------------GDNHDFIEEEKKKE---------------DVQLSEMIVTL

Query:  LKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPYKYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKF
        L    D+FK P  LPPK  IDH I     Q P+N+ PYKY H+QKEEIEKLV E  + G+     S  ++ + L         FCVDYR+LN++TIADKF
Subjt:  LKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPYKYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKF

Query:  LIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY---------------------------
         IPVIEEL DELH + V+SKLDL+SGYHQIRM+E DIEKTAFRTHEGHY+F+VMPF        FQ+++ +++                           
Subjt:  LIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY---------------------------

Query:  -------------------------------------IGVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEV
                                              GVEAD +K+++M +WP PK V  LR FLGL GYYRRFV+ YG I AP+TKLL KN F WDE 
Subjt:  -------------------------------------IGVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEV

Query:  QVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGLEAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV--------------------------
           AF  LKSAM T+ VL LPD+SL F+IETDASG GL A+L QNS  IA+FSQ LS RAQAKSIYERELM VV                          
Subjt:  QVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGLEAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV--------------------------

Query:  -------------WMTKLLGNDFEILYQLRLQNKPAYALSRVPPLLEI---------------HKVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLG
                     W+TKLLG DFEILYQ  LQNK A ALSR+   +E+                +V +DEELQ +I  L+ NP  + K+    G L+Y G
Subjt:  -------------WMTKLLGNDFEILYQLRLQNKPAYALSRVPPLLEI---------------HKVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLG

Query:  RLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAGELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF----------
        R+VLSKSSS+IPS+LH +HDS++GGHSGFLRTYKRM+GEL+W GMK D++KYVE CEICQRNK++A KPA + QPLPIP ++ ED TMDF          
Subjt:  RLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAGELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF----------

Query:  ---MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVKHHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCN
           MVVVDRLSKY +FV +KHP+ AKQV   F+ +IV+ HG    +   R +     F    F       +      P    +  R    +   LRCFCN
Subjt:  ---MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVKHHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCN

Query:  EQPLRWSKWLPWAKLW----LYMSDR-------------PLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGDE---
        EQP +W +++PWA+LW     + S R             PL  Y            + + +DLA++ALKE+L +AQ+RMKK A+  RR+L+F+VGDE   
Subjt:  EQPLRWSKWLPWAKLW----LYMSDR-------------PLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGDE---

Query:  ---------------------------------EV-----------IHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA---
                                         EV           IHNVFH+SQLK K+G +  VQ     LT +FELQ+  + V+GIRW+ + GA   
Subjt:  ---------------------------------EV-----------IHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA---

Query:  ----------------------------------IREGGIVRPLIQYTYKRRGKKEK
                                          +   GIVRP I   YKRRGKK K
Subjt:  ----------------------------------IREGGIVRPLIQYTYKRRGKKEK

TrEMBL top hitse value%identityAlignment
A0A5A7TYN3 Ty3/gypsy retrotransposon protein5.3e-22238.7Show/hide
Query:  EEIESEQNENAGSLSKSKYKKLEMSIFTEENLKSWVYKVEYFFEIRELSKAKKVKVAVPHLKSD------YDVGRAEEEDVRIIST-------DDG-KDT
        EE +     +   + +SKYKKLEM +F  EN +SWVY+ E+FFEI  L +A+KVKVAV     D      +   R + E    + T       D G K  
Subjt:  EEIESEQNENAGSLSKSKYKKLEMSIFTEENLKSWVYKVEYFFEIRELSKAKKVKVAVPHLKSD------YDVGRAEEEDVRIIST-------DDG-KDT

Query:  RARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDAFMNELKPDIRAEVTSR----LERCMREAQL--------------VGRIREKPNRDV------
         ARLIRI+QEG+Y DY+KKFV YSAPLP+MAESVL DAF+  L+P ++AEV SR    LE CM  AQL              VG I  K +  V      
Subjt:  RARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDAFMNELKPDIRAEVTSR----LERCMREAQL--------------VGRIREKPNRDV------

Query:  ---KCIQR----------------------------------RLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVE--LVADMEEAELTVV
           K + R                                  RLDKGLCFRCNEK+ HGH CKI+E REL +FI   E+  EE E     +    E+  +
Subjt:  ---KCIQR----------------------------------RLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVE--LVADMEEAELTVV

Query:  EVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV---------------RLAEELELSFTKMMKYGVVIGNGDAINSQGIHG-------GSLAVVDHVVYD
        E  E   +  R++   +  GTMKL+  VK KEV                L  E ++   +  ++G+ IGNG +   +GI         G   V D +V  
Subjt:  EVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV---------------RLAEELELSFTKMMKYGVVIGNGDAINSQGIHG-------GSLAVVDHVVYD

Query:  RGT-----------TSGTV-------------EGLHLEI-GDNH------------------------DFIEEEKKKEDVQLSE-----------MIVTL
         GT           T+GT+             EG  + + GD                          D+   E + ED                MI  L
Subjt:  RGT-----------TSGTV-------------EGLHLEI-GDNH------------------------DFIEEEKKKEDVQLSE-----------MIVTL

Query:  LKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPYKYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKF
        L    D+F  P  LPPK  IDH I     Q P+N+ PYKY H+QKEEIEKLV E  + G+     S  ++ + L         FCVDYR+LN++TIADKF
Subjt:  LKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPYKYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKF

Query:  LIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY---------------------------
         IPVIEEL DELH + V+SKLDL+SGYHQIRMKE DIEKTAFRTHEGHY+F+VMPF        FQ+++ +++                           
Subjt:  LIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY---------------------------

Query:  -------------------------------------IGVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEV
                                              GVEAD +K++ M +WP PK V  LR FLGL GYYRRFV++YG I AP+TKLL KN F WDE 
Subjt:  -------------------------------------IGVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEV

Query:  QVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGLEAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV--------------------------
           AF  LKSAM T+ VL LPD+SL F+IETDASG GL A+L QNS  IA+FSQ LS RAQAKSIYERELM VV                          
Subjt:  QVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGLEAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV--------------------------

Query:  -------------WMTKLLGNDFEILYQLRLQNKPAYALSRVPPLLEI---------------HKVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLG
                     W+TKLLG DFEILYQ  LQNK A ALSR+   +E+                +V +DEELQ +I  L+ NP  + K+    G L+Y G
Subjt:  -------------WMTKLLGNDFEILYQLRLQNKPAYALSRVPPLLEI---------------HKVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLG

Query:  RLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAGELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF----------
        R+VLSKSSS+IPS+LH +HDS++GGHSGFLRTYKRM+GEL+W GMK D++KYVE CEICQRNK++A KPA + QPLPIP ++ ED TMDF          
Subjt:  RLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAGELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF----------

Query:  ---MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVKHHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCN
           MVVVDRLSKY +FV +KHP+ AKQV   F+ +IV+ HG    +   R +     F    F   +   +      P    +  R    +   LRCFCN
Subjt:  ---MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVKHHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCN

Query:  EQPLRWSKWLPWAKLW----LYMSDR-------------PLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGD----
        EQP +W +++PWA+LW     + S R             PL  Y            + + +DLA++ALKE+L +AQ+RMKK A+  RR+L+F+VGD    
Subjt:  EQPLRWSKWLPWAKLW----LYMSDR-------------PLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGD----

Query:  --------------------------------EEV-----------IHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA---
                                        EEV           IHNVFH+SQLK K+G +  VQ     LT +FELQ+  + V+GIRW+ + GA   
Subjt:  --------------------------------EEV-----------IHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA---

Query:  ----------------------------------IREGGIVRPLIQYTYKRRGKKEK----TLEKESGMKSKG
                                          +   GIVRP I   YKRRGKK K      E+E+G +  G
Subjt:  ----------------------------------IREGGIVRPLIQYTYKRRGKKEK----TLEKESGMKSKG

A0A5A7UM77 Ty3/gypsy retrotransposon protein1.0e-22037.16Show/hide
Query:  EGKWGRSVRKREILREKAKGFRESKIGISRVQKVTYGKVFGSGSSIDKFLIFGSFDAILGEEIESEQ-NENAGSLS-KSKYKKLEMSIFTEENLKSWVYK
        EG     +  +E+L E  K     ++     +  +Y K   SG++ D     GS   + G+  E+E   E  G+L+ ++KYKKLEM +F  EN +SWVY+
Subjt:  EGKWGRSVRKREILREKAKGFRESKIGISRVQKVTYGKVFGSGSSIDKFLIFGSFDAILGEEIESEQ-NENAGSLS-KSKYKKLEMSIFTEENLKSWVYK

Query:  VEYFFEIRELSKAKKVKVAVPHLKSD-YDVGRAEEEDVRIISTDD-------------GKDTRARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDA
         E+FFEI  LS+++KVKVAV     D  D  R      ++ S +D              K   ARLIRI+QEG+Y +Y+KKFV YSAPLP MAESVL+DA
Subjt:  VEYFFEIRELSKAKKVKVAVPHLKSD-YDVGRAEEEDVRIISTDD-------------GKDTRARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDA

Query:  FMNELKPDIRAEVTSR----LERCMREAQLV------------------------------GRIREKPNRDVKC--------------------------
        F+  L+P ++AEV SR    LE CMREAQLV                              G + + P R                              
Subjt:  FMNELKPDIRAEVTSR----LERCMREAQLV------------------------------GRIREKPNRDVKC--------------------------

Query:  -IQRRLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVELVADMEE--AELTVVEVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV-----
          + RLD+GLCFRCN+K+  GH CK +E REL  FI   E+ +EE +   ++ E   EL  +E+ +   + L+++   S  GTMKLK  ++ KEV     
Subjt:  -IQRRLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVELVADMEE--AELTVVEVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV-----

Query:  ----------RLAEELELSFTKMMKYGVVIGNGDAINSQGI-------HGGSLAVVDHVVYDRGT-----------TSGT--------------------
                   LA +LEL       +G  IGNG     +GI             + D +  + G+           T+GT                    
Subjt:  ----------RLAEELELSFTKMMKYGVVIGNGDAINSQGI-------HGGSLAVVDHVVYDRGT-----------TSGT--------------------

Query:  -----------------------------VEGLHLEIGDNHDFIEEEKKKEDVQLSEMIVTLLKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPY
                                     +E  ++++  +  +  +EK++ D     MI  LL+   D+F  P+ LPPK DIDH I    DQ P+N+ PY
Subjt:  -----------------------------VEGLHLEIGDNHDFIEEEKKKEDVQLSEMIVTLLKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPY

Query:  KYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKFLIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIE
        KY H QK EIEKLV E  + G+     S  ++ + L         FCVDYR+LNQ TI+DKF IPVIEEL DEL+ + V+SKLDL+SGYHQIRMKE DIE
Subjt:  KYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKFLIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIE

Query:  KTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY----------------------------------------------------------------I
        KTAFRTHEGHY+FLVMPF        FQ+++ +++                                                                 
Subjt:  KTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY----------------------------------------------------------------I

Query:  GVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEVQVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGL
        GVEAD +KIR+M  WP P  V ELR FL L GYYRRFV+ Y  I  P+TKLL KN F W+E    AF +LK AM T+ VL LPD+SL F IETDASG GL
Subjt:  GVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEVQVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGL

Query:  EAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV---------------------------------------WMTKLLGNDFEILYQLRLQNKPAYA
         A+L Q    IA++SQ LS+RAQAKSIYERELM VV                                       W+TKLLG DFEILYQ    NK A A
Subjt:  EAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV---------------------------------------WMTKLLGNDFEILYQLRLQNKPAYA

Query:  LSRVPPLLEIH---------------KVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLGRLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAG
        LSRV P +E+H               +V +DE LQKI+  LK+  E   KF+W  G LLY GRLVL ++SSLIP +LH +HDSV+GGHSGFLRTYKRM+G
Subjt:  LSRVPPLLEIH---------------KVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLGRLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAG

Query:  ELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF-------------MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVK
        ELYW GMK D++KYVE CE+CQRNK +A KPA + QPLPIP ++ ED TMDF             MVVVDRL+KY +F+ LKHP+ AKQV   F+ +IV+
Subjt:  ELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF-------------MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVK

Query:  HHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCNEQPLRWSKWLPWAKLW----------------LY-MSD
         HG  N +   R +     F    F       +      P    +  R    +   LRCFCNEQP +W +++PWA+LW                +Y  S 
Subjt:  HHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCNEQPLRWSKWLPWAKLW----------------LY-MSD

Query:  RPLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGD------------------------------------------
         PL  Y           ++ + +D AL+ALKE+L +AQ+RMKK A+  RR+L+ +VG+                                          
Subjt:  RPLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGD------------------------------------------

Query:  -----EEVIHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA-------------------------------------IREG
             E  IHNVFH+SQLK K+G +  VQH   +LT  FELQ+  + V+GIRWN + GA                                     +   
Subjt:  -----EEVIHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA-------------------------------------IREG

Query:  GIVRPLIQYTYKRRGKKEKTLEKESG
        GIVRP I +TYKRR +K   L+K  G
Subjt:  GIVRPLIQYTYKRRGKKEKTLEKESG

A0A5A7UYM1 Ty3/gypsy retrotransposon protein2.8e-22337.22Show/hide
Query:  EGKWGRSVRKREILREKAKGFRESKIGISRVQKVTYGKVFGSGSSIDKFLIFGSFDAILG--EEIESEQNENAGSLSKSKYKKLEMSIFTEENLKSWVYK
        EG     +  +E+L E  K     ++     +  +Y K   SG++ D     GS   + G  +E E     N     +SKYKKLEM +F  EN +SWVY+
Subjt:  EGKWGRSVRKREILREKAKGFRESKIGISRVQKVTYGKVFGSGSSIDKFLIFGSFDAILG--EEIESEQNENAGSLSKSKYKKLEMSIFTEENLKSWVYK

Query:  VEYFFEIRELSKAKKVKVAVPHLKSD-YDVGRAEEEDVRIISTDD-------------GKDTRARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDA
         E+FFEI  LS+++KVKVAV     D  D  R      ++ S +D              K   ARLIRI+QEG+Y +Y+KKFV YSAPLP MAESVL+DA
Subjt:  VEYFFEIRELSKAKKVKVAVPHLKSD-YDVGRAEEEDVRIISTDD-------------GKDTRARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDA

Query:  FMNELKPDIRAEVTSR----LERCMREAQLV------------------------------GRIREKPNRDVKC--------------------------
        F+  L+P ++AEV SR    LE CMREAQLV                              G + + P R                              
Subjt:  FMNELKPDIRAEVTSR----LERCMREAQLV------------------------------GRIREKPNRDVKC--------------------------

Query:  -IQRRLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVELVADM--EEAELTVVEVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV-----
          + RLD+GLCFRCN+K+  GH CK +E REL  FI   E+ +EE +   ++  E  EL  +E+ +   + L+++   S  GTMKLK  ++ KEV     
Subjt:  -IQRRLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVELVADM--EEAELTVVEVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV-----

Query:  ----------RLAEELELSFTKMMKYGVVIGNGDAINSQGI-------HGGSLAVVDHVVYDRGT-----------TSGT--------------------
                   LA ELEL       +G  IGNG     +GI             + D +  + G+           T+GT                    
Subjt:  ----------RLAEELELSFTKMMKYGVVIGNGDAINSQGI-------HGGSLAVVDHVVYDRGT-----------TSGT--------------------

Query:  -----------------------------VEGLHLEIGDNHDFIEEEKKKEDVQLSEMIVTLLKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPY
                                     +E  ++E+  +  +  +EK++ D     MI  LL+  +D+F  P+ LPPK DIDH I    DQ P+N+ PY
Subjt:  -----------------------------VEGLHLEIGDNHDFIEEEKKKEDVQLSEMIVTLLKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPY

Query:  KYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKFLIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIE
        KY H QK EIEKLV E  + G+     S  ++ + L         FCVDYR+LNQ TI+DKF IPVIEEL DEL+ + V+SKLDL+SGYHQIRMKE D+E
Subjt:  KYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKFLIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIE

Query:  KTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY----------------------------------------------------------------I
        KTAFRTHEGHY+FLVMPF        FQ+++ +++                                                                 
Subjt:  KTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY----------------------------------------------------------------I

Query:  GVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEVQVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGL
        GVEAD +KIR+M  WP P  V ELR FLGL GYYRRFV+ Y  I  P+TKLL KN F W+E    AF +LK AM T+ VL LPD+SL F IETDASG GL
Subjt:  GVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEVQVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGL

Query:  EAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV---------------------------------------WMTKLLGNDFEILYQLRLQNKPAYA
         A+L Q    IA++SQ LS+RAQAKSIYERELM VV                                       W+TKLLG DFEILYQ    NK A A
Subjt:  EAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV---------------------------------------WMTKLLGNDFEILYQLRLQNKPAYA

Query:  LSRVPPLLEIH---------------KVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLGRLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAG
        LSRV P +E+H               +V +DE LQKI+  LK+  E   KF+W  G LLY GRLVL ++SSLIP +LH +HDSV+GGHSGFLRTYKRM+G
Subjt:  LSRVPPLLEIH---------------KVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLGRLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAG

Query:  ELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF-------------MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVK
        ELYW GMK D++KYVE CE+CQRNK +A KPA + QPLPIP ++ ED TMDF             MVVVDRL+KY +F+ LKHP+ AKQV   F+ +IV+
Subjt:  ELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF-------------MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVK

Query:  HHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCNEQPLRWSKWLPWAKLW----------------LY-MSD
         HG  N +   R +     F    F       +      P    +  R    +   LRCFCNEQP +W +++PWA+LW                +Y  S 
Subjt:  HHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCNEQPLRWSKWLPWAKLW----------------LY-MSD

Query:  RPLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGD------------------------------------------
         PL  Y           ++ + +D AL+ALKE+L +AQ+RMKK A+  RR+L+ +VG+                                          
Subjt:  RPLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGD------------------------------------------

Query:  -----EEVIHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA-------------------------------------IREG
             E  IHNVFH+SQLK K+G +  VQH   +LT  FELQ+  + V+GIRWN + GA                                     +   
Subjt:  -----EEVIHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA-------------------------------------IREG

Query:  GIVRPLIQYTYKRRGKKEKTLEKESG
        GIVRP I +TYKRR +K   L+K  G
Subjt:  GIVRPLIQYTYKRRGKKEKTLEKESG

A0A5D3BBH7 Ty3/gypsy retrotransposon protein2.0e-22138.42Show/hide
Query:  EEIESEQNENAGSLSKSKYKKLEMSIFTEENLKSWVYKVEYFFEIRELSKAKKVKVAVPHLKSD------YDVGRAEEEDVRIIST-------DDG-KDT
        EE +     +   + +SKYKKLEM +F  EN +SWVY+ E+FFEI  L +A+KVKVAV     D      +   R + E    + T       D G K  
Subjt:  EEIESEQNENAGSLSKSKYKKLEMSIFTEENLKSWVYKVEYFFEIRELSKAKKVKVAVPHLKSD------YDVGRAEEEDVRIIST-------DDG-KDT

Query:  RARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDAFMNELKPDIRAEVTSR----LERCMREAQLV-------------------------------
         ARLIRI+QEG+Y DY+KKFV YSAPLP+MAESVL DAF+  L+P ++AEV SR    LE CM  AQLV                               
Subjt:  RARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDAFMNELKPDIRAEVTSR----LERCMREAQLV-------------------------------

Query:  ----GRIR-----------------EKPNRDVKCI-----QRRLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVE--LVADMEEAELTVV
            G +R                 +K    VK +     + RLDKGLCFRCNEK+ HGH CKI+E REL +FI   E+  EE E     + E  E+  +
Subjt:  ----GRIR-----------------EKPNRDVKCI-----QRRLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVE--LVADMEEAELTVV

Query:  EVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV---------------RLAEELELSFTKMMKYGVVIGNGDAINSQGIHG-------GSLAVVDHVVYD
           E   +  R++   +  GTMKL+  VK KEV                L +E ++   +  ++G+ IG+G +   +GI         G   V D +V  
Subjt:  EVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV---------------RLAEELELSFTKMMKYGVVIGNGDAINSQGIHG-------GSLAVVDHVVYD

Query:  RGT-----------TSGTV-------------EGLHLEI---------------------GDNHDFIEEEKKKE---------------DVQLSEMIVTL
         GT           T+GT+             EG  + +                      ++  F+ + ++ E               D +   MI  L
Subjt:  RGT-----------TSGTV-------------EGLHLEI---------------------GDNHDFIEEEKKKE---------------DVQLSEMIVTL

Query:  LKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPYKYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKF
        L    D+FK P  LPPK  IDH I     Q P+N+ PYKY H+QKEEIEKLV E  + G+     S  ++ + L         FCVDYR+LN++TIADKF
Subjt:  LKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPYKYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKF

Query:  LIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY---------------------------
         IPVIEEL DELH + V+SKLDL+SGYHQIRM+E DIEKTAFRTHEGHY+F+VMPF        FQ+++ +++                           
Subjt:  LIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY---------------------------

Query:  -------------------------------------IGVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEV
                                              GVEAD +K+++M +WP PK V  LR FLGL GYYRRFV+ YG I AP+TKLL KN F WDE 
Subjt:  -------------------------------------IGVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEV

Query:  QVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGLEAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV--------------------------
           AF  LKSAM T+ VL LPD+SL F+IETDASG GL A+L QNS  IA+FSQ LS RAQAKSIYERELM VV                          
Subjt:  QVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGLEAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV--------------------------

Query:  -------------WMTKLLGNDFEILYQLRLQNKPAYALSRVPPLLEI---------------HKVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLG
                     W+TKLLG DFEILYQ  LQNK A ALSR+   +E+                +V +DEELQ +I  L+ NP  + K+    G L+Y G
Subjt:  -------------WMTKLLGNDFEILYQLRLQNKPAYALSRVPPLLEI---------------HKVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLG

Query:  RLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAGELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF----------
        R+VLSKSSS+IPS+LH +HDS++GGHSGFLRTYKRM+GEL+W GMK D++KYVE CEICQRNK++A KPA + QPLPIP ++ ED TMDF          
Subjt:  RLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAGELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF----------

Query:  ---MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVKHHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCN
           MVVVDRLSKY +FV +KHP+ AKQV   F+ +IV+ HG    +   R +     F    F       +      P    +  R    +   LRCFCN
Subjt:  ---MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVKHHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCN

Query:  EQPLRWSKWLPWAKLW----LYMSDR-------------PLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGDE---
        EQP +W +++PWA+LW     + S R             PL  Y            + + +DLA++ALKE+L +AQ+RMKK A+  RR+L+F+VGDE   
Subjt:  EQPLRWSKWLPWAKLW----LYMSDR-------------PLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGDE---

Query:  ---------------------------------EV-----------IHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA---
                                         EV           IHNVFH+SQLK K+G +  VQ     LT +FELQ+  + V+GIRW+ + GA   
Subjt:  ---------------------------------EV-----------IHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA---

Query:  ----------------------------------IREGGIVRPLIQYTYKRRGKKEKTL----EKESG
                                          +   GIVRP I   YKRRGKK K      E+E+G
Subjt:  ----------------------------------IREGGIVRPLIQYTYKRRGKKEKTL----EKESG

A0A5D3DU86 Ty3/gypsy retrotransposon protein3.5e-22138.5Show/hide
Query:  EEIESEQNENAGSLSKSKYKKLEMSIFTEENLKSWVYKVEYFFEIRELSKAKKVKVAVPHLKSD------YDVGRAEEEDVRIIST-------DDG-KDT
        EE +     +   + +SKYKKLEM +F  EN +SWVY+ E+FFEI  L +A+KVKVAV     D      +   R + E    + T       D G K  
Subjt:  EEIESEQNENAGSLSKSKYKKLEMSIFTEENLKSWVYKVEYFFEIRELSKAKKVKVAVPHLKSD------YDVGRAEEEDVRIIST-------DDG-KDT

Query:  RARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDAFMNELKPDIRAEVTSR----LERCMREAQLV-------------------------------
         ARLIRI+QEG+Y DY+KKFV YSAPLP+MAESVL DAF+  L+P ++AEV SR    LE CM  AQLV                               
Subjt:  RARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVLIDAFMNELKPDIRAEVTSR----LERCMREAQLV-------------------------------

Query:  ----GRIR-----------------EKPNRDVKCI-----QRRLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVE--LVADMEEAELTVV
            G +R                 +K    VK +     + RLDKGLCFRCNEK+ HGH CKI+E REL +FI   E+  EE E     + E  E+  +
Subjt:  ----GRIR-----------------EKPNRDVKCI-----QRRLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVE--LVADMEEAELTVV

Query:  EVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV---------------RLAEELELSFTKMMKYGVVIGNGDAINSQGIHG-------GSLAVVDHVVYD
           E   +  R++   +  GTMKL+  VK KEV                L +E ++   +  ++G+ IG+G +   +GI         G   V D +V  
Subjt:  EVEENVEVSLRSLLGFSRIGTMKLKEEVKNKEV---------------RLAEELELSFTKMMKYGVVIGNGDAINSQGIHG-------GSLAVVDHVVYD

Query:  RGT-----------TSGTV-------------EGLHLEI---------------------GDNHDFIEEEKKKE---------------DVQLSEMIVTL
         GT           T+GT+             EG  + +                      ++  F+ + ++ E               D +   MI  L
Subjt:  RGT-----------TSGTV-------------EGLHLEI---------------------GDNHDFIEEEKKKE---------------DVQLSEMIVTL

Query:  LKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPYKYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKF
        L    D+FK P  LPPK  IDH I     Q P+N+ PYKY H+QKEEIEKLV E  + G+     S  ++ + L         FCVDYR+LN++TIADKF
Subjt:  LKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPYKYEHKQKEEIEKLVEESYKLGL-----SDQAATLTL---------FCVDYRRLNQVTIADKF

Query:  LIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY---------------------------
         IPVIEEL DELH + V+SKLDL+SGYHQIRM+E DIEKTAFRTHEGHY+F+VMPF        FQ+++ +++                           
Subjt:  LIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD----ELDFQAILEEIY---------------------------

Query:  -------------------------------------IGVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEV
                                              GVEAD +K+++M +WP PK V  LR FLGL GYYRRFV+ YG I AP+TKLL KN F WDE 
Subjt:  -------------------------------------IGVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEV

Query:  QVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGLEAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV--------------------------
           AF  LKSAM T+ VL LPD+SL F+IETDASG GL A+L QNS  IA+FSQ LS RAQAKSIYERELM VV                          
Subjt:  QVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGLEAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVV--------------------------

Query:  -------------WMTKLLGNDFEILYQLRLQNKPAYALSRVPPLLEI---------------HKVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLG
                     W+TKLLG DFEILYQ  LQNK A ALSR+   +E+                +V +DEELQ +I  L+ NP  + K+    G L+Y G
Subjt:  -------------WMTKLLGNDFEILYQLRLQNKPAYALSRVPPLLEI---------------HKVAEDEELQKIITLLKENPEGKSKFQWLQGMLLYLG

Query:  RLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAGELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF----------
        R+VLSKSSS+IPS+LH +HDS++GGHSGFLRTYKRM+GEL+W GMK D++KYVE CEICQRNK++A KPA + QPLPIP ++ ED TMDF          
Subjt:  RLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAGELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF----------

Query:  ---MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVKHHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCN
           MVVVDRLSKY +FV +KHP+ AKQV   F+ +IV+ HG    +   R +     F    F       +      P    +  R    +   LRCFCN
Subjt:  ---MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVKHHGFQNPLSQTRIRFSSVTFGGSYFRHGE---RHLNIVSPFVHRRMRRPRGSI-IVLRCFCN

Query:  EQPLRWSKWLPWAKLW----LYMSDR-------------PLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGDE---
        EQP +W +++PWA+LW     + S R             PL  Y            + + +DLA++ALKE+L +AQ+RMKK A+  RR+L+F+VGDE   
Subjt:  EQPLRWSKWLPWAKLW----LYMSDR-------------PLRYY----------RMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGDE---

Query:  ---------------------------------EV-----------IHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA---
                                         EV           IHNVFH+SQLK K+G +  VQ     LT +FELQ+  + V+GIRW+ + GA   
Subjt:  ---------------------------------EV-----------IHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVIGIRWNGQ-GA---

Query:  ----------------------------------IREGGIVRPLIQYTYKRRGKKEK
                                          +   GIVRP I   YKRRGKK K
Subjt:  ----------------------------------IREGGIVRPLIQYTYKRRGKKEK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.66.2e-3428.61Show/hide
Query:  HNINISEDQPPMNICPYKYEHKQKEEIEKLVEESYKLGL-----------------SDQAATLTLF--CVDYRRLNQVTIADKFLIPVIEELFDELHRSV
        H IN   + P  +   Y Y    ++E+E  +++    G+                    A+    F   +DYR+LN++T+ D+  IP ++E+  +L R  
Subjt:  HNINISEDQPPMNICPYKYEHKQKEEIEKLVEESYKLGL-----------------SDQAATLTLF--CVDYRRLNQVTIADKFLIPVIEELFDELHRSV

Query:  VYSKLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD----ELDFQAILEEI-----------YI-------------------------------
         ++ +DL  G+HQI M    + KTAF T  GHY++L MPF        FQ  + +I           Y+                               
Subjt:  VYSKLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD----ELDFQAILEEI-----------YI-------------------------------

Query:  ----------------------GVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKN------NFDWDEVQVEAFGKLKS
                              G++ + EKI A++++P P    E++ FLGL GYYR+F+ N+  I  P+TK L KN      N ++D     AF KLK 
Subjt:  ----------------------GVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKN------NFDWDEVQVEAFGKLKS

Query:  AMMTVRVLTLPDFSLHFVIETDASGFGLEAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVVWMTK-----LLGNDFEI
         +    +L +PDF+  F + TDAS   L A+L Q+   ++Y S+ L+E     S  E+EL+ +VW TK     LLG  FEI
Subjt:  AMMTVRVLTLPDFSLHFVIETDASGFGLEAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVVWMTK-----LLGNDFEI

P20825 Retrovirus-related Pol polyprotein from transposon 2974.3e-3529.75Show/hide
Query:  LLKNNRDL-FKVPRKLPPKWDIDHNINISEDQPPMNICPYKYEHKQKEEIEKLVEESYKLGLSDQA-------------------ATLTLFCVDYRRLNQ
        LL   R+L +K   KL     I H +N + +  P+    Y      + E+E  V+E    GL  ++                   A      +DYR+LN+
Subjt:  LLKNNRDL-FKVPRKLPPKWDIDHNINISEDQPPMNICPYKYEHKQKEEIEKLVEESYKLGLSDQA-------------------ATLTLFCVDYRRLNQ

Query:  VTIADKFLIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD----ELDFQ----------------AILEEIYI---
        +TI D++ IP ++E+  +L +   ++ +DL  G+HQI M E  I KTAF T  GHY++L MPF        FQ                  L++I I   
Subjt:  VTIADKFLIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD----ELDFQ----------------AILEEIYI---

Query:  ---------------------------------------------GVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHK-
                                                     G++ +  K++A+  +P P    E+R FLGL GYYR+F+ NY  I  P+T  L K 
Subjt:  ---------------------------------------------GVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHK-

Query:  NNFDWDEVQ-VEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGLEAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVVWMTK-----LLGNDFEI
           D  +++ +EAF KLK+ ++   +L LPDF   FV+ TDAS   L A+L QN   I++ S+ L++     S  E+EL+ +VW TK     LLG  F I
Subjt:  NNFDWDEVQ-VEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGLEAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVVWMTK-----LLGNDFEI

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.0e-4125.57Show/hide
Query:  IDHNINISEDQPPMNICPYKYEHKQKEEIEKLVEE--------------SYKLGLSDQAATLTLFCVDYRRLNQVTIADKFLIPVIEELFDELHRSVVYS
        + H+I I        + PY    K ++EI K+V++              S  + L  +       CVDYR LN+ TI+D F +P I+ L   +  + +++
Subjt:  IDHNINISEDQPPMNICPYKYEHKQKEEIEKLVEE--------------SYKLGLSDQAATLTLFCVDYRRLNQVTIADKFLIPVIEELFDELHRSVVYS

Query:  KLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD-----------------ELDF-QAILEEIYIGVEADNE------------------------
         LDL SGYHQI M+  D  KTAF T  G Y++ VMPF                  +L F    L++I I  E+  E                        
Subjt:  KLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD-----------------ELDF-QAILEEIYIGVEADNE------------------------

Query:  ------------------------KIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEVQVEAFGKLKSAMMTVRVLTLP
                                K  A++ +P PK+V + + FLG+I YYRRF+ N   I  P+ +L   +   W E Q +A  KLK+A+    VL   
Subjt:  ------------------------KIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEVQVEAFGKLKSAMMTVRVLTLP

Query:  DFSLHFVIETDASGFGLEAML--VQNSKR----IAYFSQALSERAQAKSIYERELMVVV---------------------------------------WM
        +   ++ + TDAS  G+ A+L  V N  +    + YFS++L    +     E EL+ ++                                       W+
Subjt:  DFSLHFVIETDASGFGLEAML--VQNSKR----IAYFSQALSERAQAKSIYERELMVVV---------------------------------------WM

Query:  TKLLGNDFEILYQLRLQNKPAYALSRV----------PPLLEIHK--VAEDEELQKIITLLKE------NPEGKSKFQWLQ-----------------GM
          L   DF + Y    +N  A A+SR           P   E  K     D     ++  +KE       PE  S F+  Q                  M
Subjt:  TKLLGNDFEILYQLRLQNKPAYALSRV----------PPLLEIHK--VAEDEELQKIITLLKE------NPEGKSKFQWLQ-----------------GM

Query:  LLYLGRLVLSKSSSLIPSILHMYHD-SVMGGHSGFLRTYKRMAGELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF----
        + Y  RLV+        +++ +YHD ++ GGH G   T  +++   YW  ++  + +Y+ +C  CQ  K+   +   L QPLPI    W D++MDF    
Subjt:  LLYLGRLVLSKSSSLIPSILHMYHD-SVMGGHSGFLRTYKRMAGELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF----

Query:  ----------MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVKHHGFQNPLSQTR
                  +VVVDR SK  HF+  +    A Q++++  + I  +HGF   ++  R
Subjt:  ----------MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVKHHGFQNPLSQTR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus6.9e-3327.84Show/hide
Query:  VDYRRLNQVTIADKFLIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFDELD----FQAILEEI-------------
        VD++RLN VTI D + IP I      L  +  ++ LDL SG+HQI MKE DI KTAF T  G Y+FL +PF   +    FQ ++++I             
Subjt:  VDYRRLNQVTIADKFLIPVIEELFDELHRSVVYSKLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFDELD----FQAILEEI-------------

Query:  -----------------------------------------------YI----GVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAP
                                                       YI    G++AD +K+RA+   PPP SV EL+ FLG+  YYR+F+Q+Y  +  P
Subjt:  -----------------------------------------------YI----GVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAP

Query:  VTKLLH------------KNNFDWDEVQVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGLEAMLVQN----SKRIAYFSQALSERAQAKSIYERE
        +T L              K     DE  +++F  LKS + +  +L  P F+  F + TDAS + + A+L Q+     + IAY S++L++  +  +  E+E
Subjt:  VTKLLH------------KNNFDWDEVQVEAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGLEAMLVQN----SKRIAYFSQALSERAQAKSIYERE

Query:  LMVVVW-----------------------MTKLLGN-----------------DFEILYQLRLQNKPAYALSRVPPLLEIHKVAEDEELQKIITLLKENP
        ++ ++W                       +T  LGN                 + E++Y+    N  A ALSR+PP           +L ++ T L  NP
Subjt:  LMVVVW-----------------------MTKLLGN-----------------DFEILYQLRLQNKPAYALSRVPPLLEIHKVAEDEELQKIITLLKENP

Query:  EGKSKFQWLQGMLLYLGRLVLSKSSSLIPSI
        E   +        L      L  SS LIP +
Subjt:  EGKSKFQWLQGMLLYLGRLVLSKSSSLIPSI

Q99315 Transposon Ty3-G Gag-Pol polyprotein4.0e-4125.27Show/hide
Query:  IDHNINISEDQPPMNICPYKYEHKQKEEIEKLVEE--------------SYKLGLSDQAATLTLFCVDYRRLNQVTIADKFLIPVIEELFDELHRSVVYS
        + H+I I        + PY    K ++EI K+V++              S  + L  +       CVDYR LN+ TI+D F +P I+ L   +  + +++
Subjt:  IDHNINISEDQPPMNICPYKYEHKQKEEIEKLVEE--------------SYKLGLSDQAATLTLFCVDYRRLNQVTIADKFLIPVIEELFDELHRSVVYS

Query:  KLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD-----------------ELDF-QAILEEIYIGVEADNE------------------------
         LDL SGYHQI M+  D  KTAF T  G Y++ VMPF                  +L F    L++I I  E+  E                        
Subjt:  KLDLRSGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFD-----------------ELDF-QAILEEIYIGVEADNE------------------------

Query:  ------------------------KIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEVQVEAFGKLKSAMMTVRVLTLP
                                K  A++ +P PK+V + + FLG+I YYRRF+ N   I  P+ +L   +   W E Q +A  KLK A+    VL   
Subjt:  ------------------------KIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEVQVEAFGKLKSAMMTVRVLTLP

Query:  DFSLHFVIETDASGFGLEAML--VQNSKR----IAYFSQALSERAQAKSIYERELMVVV---------------------------------------WM
        +   ++ + TDAS  G+ A+L  V N  +    + YFS++L    +     E EL+ ++                                       W+
Subjt:  DFSLHFVIETDASGFGLEAML--VQNSKR----IAYFSQALSERAQAKSIYERELMVVV---------------------------------------WM

Query:  TKLLGNDFEILYQLRLQNKPAYALSRVPPLL--EIHKVAEDEELQ----------------KIITLLKENPEGKSKFQWLQ-----------------GM
          L   DF + Y    +N  A A+SR    +  E  +  + E  +                K +T     PE  S F+  Q                  M
Subjt:  TKLLGNDFEILYQLRLQNKPAYALSRVPPLL--EIHKVAEDEELQ----------------KIITLLKENPEGKSKFQWLQ-----------------GM

Query:  LLYLGRLVLSKSSSLIPSILHMYHD-SVMGGHSGFLRTYKRMAGELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF----
        + Y  RLV+        +++ +YHD ++ GGH G   T  +++   YW  ++  + +Y+ +C  CQ  K+   +   L QPLPI    W D++MDF    
Subjt:  LLYLGRLVLSKSSSLIPSILHMYHD-SVMGGHSGFLRTYKRMAGELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPLPIPYQVWEDLTMDF----

Query:  ----------MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVKHHGFQNPLSQTR
                  +VVVDR SK  HF+  +    A Q++++  + I  +HGF   ++  R
Subjt:  ----------MVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVKHHGFQNPLSQTR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.1e-2257.78Show/hide
Query:  GVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEVQVEAFGKLKSAMMTVRVLTLPDFSLHFV
        GV AD  K+ AM  WP PK+  ELR FLGL GYYRRFV+NYG IV P+T+LL KN+  W E+   AF  LK A+ T+ VL LPD  L FV
Subjt:  GVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEVQVEAFGKLKSAMMTVRVLTLPDFSLHFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATGTCCGGAAAAGGGAAAGGTTAAGAGAGAAAGTGAAAGGTTCAGAGAGAAAGCGAACCTTCAGAGTGAAATCGAAAGGGTCGGAGAGAAAGCGAAAGCGACAAG
GTTGGAAGAGGGAAAATGGGGACGTTCAGTGAGAAAGCGGGAAATTTTGAGAGAGAAAGCGAAAGGTTTTCGGGAGAGTAAAATTGGTATTTCACGTGTCCAAAAGGTCA
CTTATGGTAAAGTTTTCGGGAGTGGATCATCTATAGATAAGTTTTTAATTTTTGGATCATTTGATGCAATTTTGGGTGAAGAAATCGAATCAGAACAGAATGAGAATGCT
GGTAGTTTAAGTAAGAGCAAATATAAGAAGCTTGAAATGTCGATCTTCACCGAGGAGAATCTGAAATCGTGGGTCTACAAAGTAGAATATTTTTTTGAAATACGCGAGTT
GAGTAAGGCTAAAAAGGTTAAGGTGGCTGTACCACACCTGAAATCTGATTATGACGTGGGAAGAGCTGAAGAAGAGGATGTTCGAATAATTTCAACCGACGATGGAAAGG
ACACTAGGGCGAGATTAATCAGGATTAAGCAAGAAGGAACGTACATCGACTACCTGAAGAAATTCGTGGCGTACTCAGCTCCACTGCCGAACATGGCGGAGAGTGTGCTA
ATCGACGCCTTTATGAATGAGTTGAAGCCGGATATTAGGGCAGAGGTGACGAGCAGGCTAGAAAGGTGTATGAGGGAGGCACAATTAGTGGGCCGAATAAGGGAGAAGCC
TAATCGGGATGTAAAGTGCATACAGAGAAGGCTGGATAAGGGGCTGTGTTTTCGTTGTAATGAAAAATTTCACCATGGACATTGGTGTAAAATTCGTGAGAATAGGGAAC
TCAACATTTTCATCGCCAGAGGAGAGAAGGGCGAAGAGGAAGTTGAATTGGTAGCCGACATGGAAGAAGCTGAATTAACGGTAGTTGAAGTGGAAGAAAACGTAGAGGTA
TCGTTAAGGTCTCTGTTGGGATTTTCGAGAATTGGGACCATGAAACTCAAAGAGGAGGTGAAGAACAAAGAGGTGAGGCTGGCGGAAGAGCTGGAATTATCGTTCACTAA
AATGATGAAGTACGGTGTGGTGATTGGAAACGGGGATGCCATTAACAGTCAAGGAATTCATGGGGGTTCATTGGCCGTTGTTGACCATGTTGTTTACGATAGGGGAACCA
CAAGTGGCACTGTAGAGGGACTCCATCTGGAAATCGGAGATAACCACGACTTTATCGAAGAGGAAAAGAAAAAAGAAGATGTACAATTGTCGGAGATGATAGTAACACTA
CTCAAGAACAATCGAGATTTGTTCAAAGTGCCAAGGAAGCTTCCCCCAAAGTGGGACATAGACCACAACATTAATATCAGTGAAGATCAACCCCCAATGAACATTTGTCC
TTATAAATATGAGCATAAACAAAAGGAAGAGATTGAAAAACTCGTGGAGGAAAGCTACAAGTTGGGATTATCCGACCAAGCTGCAACCCTTACTCTATTTTGTGTTGACT
ATCGAAGGTTAAATCAAGTAACGATTGCTGATAAGTTTCTGATCCCAGTAATTGAGGAGTTATTTGATGAATTACATAGGTCTGTCGTGTACTCCAAACTAGATCTAAGG
TCTGGATATCACCAAATCCGAATGAAGGAAATGGACATCGAGAAAACAGCGTTTCGAACACACGAGGGACATTACAAGTTCTTAGTCATGCCTTTTGATGAATTAGATTT
TCAAGCCATTCTTGAGGAGATTTATATTGGGGTTGAGGCCGACAATGAAAAGATCAGAGCTATGAAGAGGTGGCCACCACCCAAATCTGTTTTTGAGCTGAGAGAATTTC
TGGGGTTGATAGGGTATTATAGGAGGTTTGTTCAAAACTATGGAGTAATAGTAGCCCCAGTAACCAAATTACTTCACAAGAACAACTTCGATTGGGATGAGGTACAAGTG
GAAGCCTTTGGAAAGCTTAAGAGTGCAATGATGACTGTTCGAGTGTTGACTCTTCCCGATTTCTCCCTCCATTTTGTGATCGAGACAGATGCATCTGGATTTGGTTTAGA
GGCAATGTTAGTGCAGAATTCGAAGCGTATAGCTTATTTTAGTCAGGCCTTGTCAGAGAGAGCTCAAGCGAAATCCATCTATGAACGTGAGCTAATGGTCGTGGTGTGGA
TGACAAAGTTATTGGGAAATGATTTTGAAATTTTGTACCAACTTAGACTTCAAAATAAGCCGGCTTATGCCTTATCAAGGGTGCCACCATTGCTTGAAATTCATAAAGTG
GCAGAGGATGAAGAGCTGCAAAAGATCATCACCCTACTCAAAGAGAACCCGGAAGGAAAGTCGAAATTCCAATGGTTGCAAGGAATGTTATTGTATTTAGGGAGGTTAGT
TTTATCTAAATCCTCTTCACTTATCCCTTCGATTTTACACATGTATCATGATTCGGTGATGGGAGGACACTCAGGTTTTTTGAGGACATATAAAAGAATGGCAGGAGAGC
TCTACTGGTCAGGTATGAAAACAGATGTCAGGAAATATGTCGAGAGTTGTGAAATTTGTCAGAGGAACAAGACTGACGCGCTCAAACCAGCAAGTCTTTTCCAACCATTA
CCCATACCCTACCAGGTCTGGGAAGATCTCACTATGGACTTTATGGTGGTGGTGGATCGTCTAAGTAAGTACAACCATTTTGTGTTGTTAAAGCATCCTTATTTTGCCAA
ACAGGTTGTCGAGGTCTTTGTACAAGAAATAGTCAAACACCATGGGTTCCAAAATCCATTATCTCAAACACGAATAAGATTTTCCTCAGTAACTTTTGGAGGGAGTTATT
TTCGACATGGGGAACGACACTTAAACATAGTATCGCCTTTCGTCCACAGACGGATGCGCAGACCGAGAGGGTCAATCATTGTGTTGAGATGCTTTTGTAATGAGCAGCCG
CTGAGATGGAGCAAGTGGTTACCCTGGGCGAAATTATGGTTGTATATGAGCGACCGCCCCCTCCGATATTATCGTATGGTGAGAAGAAGACAAGACTTAGCCCTGAACGC
CTTAAAGGAGCACCTGCGGGTAGCTCAAGACAGGATGAAGAAACAAGCTAATCAATCAAGGAGAGACCTAGAGTTTGAAGTGGGAGATGAGGAAGTTATTCACAATGTGT
TCCATGTTTCCCAACTTAAGAAGAAGATAGGTCAAAGGGTTCAAGTCCAACACCATCCTTTAGTGTTGACTAGACAGTTCGAGTTGCAGGTAATACGAAAGGAAGTCATT
GGAATCCGTTGGAACGGACAAGGTGCAATTCGCGAAGGGGGTATCGTTAGACCTCTAATTCAATATACATACAAGAGAAGGGGTAAAAAGGAGAAAACACTTGAAAAGGA
ATCAGGGATGAAATCCAAAGGTATTGTTCCTTTGCCTGAGACCTTATTGAATGTTATTAATGAGATTATTGATCTTAGTGGTATTAGTACTAATAGTGAAGCTAGGGTTT
CTGGTATTGAGATTGGGTCTGGTACATCTGTAGATGTTGATAATGTTTATGTGGTTTATAAATATGTTGATAAAATTGTGCAAAGTCAGAAAGGTAGTAGTTTAAATGAC
GTGGTTAAAAATGGTGTAGCTTCTACTTCCTCATCTTCTCTTGATGTTACTGTCATTGTATCTGATACAAATGTTGGTTTGCCATCTGGTGTGGTAAATGTTGAGTTACA
TGCTAAAGATTCGTCTATTAGTTTAAGCCCTTGCCCAAAGATATTCAACAAAAAAATGATCATCTCGTCAGGAGTCCTGGAGAGGTTTGAGTTAGATGATAATGTCCCTA
CAAGTGTCTTGAGCAGAAGAGGAAGAAGAAAATTATATACTGATCAAGTTGCTTCTGAAAATGAATATCATGAATCTGATGATTTTATGGGTGATGATTCATTGTGTGAT
GATGATAGCAGGAATTTGTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAATGTCCGGAAAAGGGAAAGGTTAAGAGAGAAAGTGAAAGGTTCAGAGAGAAAGCGAACCTTCAGAGTGAAATCGAAAGGGTCGGAGAGAAAGCGAAAGCGACAAG
GTTGGAAGAGGGAAAATGGGGACGTTCAGTGAGAAAGCGGGAAATTTTGAGAGAGAAAGCGAAAGGTTTTCGGGAGAGTAAAATTGGTATTTCACGTGTCCAAAAGGTCA
CTTATGGTAAAGTTTTCGGGAGTGGATCATCTATAGATAAGTTTTTAATTTTTGGATCATTTGATGCAATTTTGGGTGAAGAAATCGAATCAGAACAGAATGAGAATGCT
GGTAGTTTAAGTAAGAGCAAATATAAGAAGCTTGAAATGTCGATCTTCACCGAGGAGAATCTGAAATCGTGGGTCTACAAAGTAGAATATTTTTTTGAAATACGCGAGTT
GAGTAAGGCTAAAAAGGTTAAGGTGGCTGTACCACACCTGAAATCTGATTATGACGTGGGAAGAGCTGAAGAAGAGGATGTTCGAATAATTTCAACCGACGATGGAAAGG
ACACTAGGGCGAGATTAATCAGGATTAAGCAAGAAGGAACGTACATCGACTACCTGAAGAAATTCGTGGCGTACTCAGCTCCACTGCCGAACATGGCGGAGAGTGTGCTA
ATCGACGCCTTTATGAATGAGTTGAAGCCGGATATTAGGGCAGAGGTGACGAGCAGGCTAGAAAGGTGTATGAGGGAGGCACAATTAGTGGGCCGAATAAGGGAGAAGCC
TAATCGGGATGTAAAGTGCATACAGAGAAGGCTGGATAAGGGGCTGTGTTTTCGTTGTAATGAAAAATTTCACCATGGACATTGGTGTAAAATTCGTGAGAATAGGGAAC
TCAACATTTTCATCGCCAGAGGAGAGAAGGGCGAAGAGGAAGTTGAATTGGTAGCCGACATGGAAGAAGCTGAATTAACGGTAGTTGAAGTGGAAGAAAACGTAGAGGTA
TCGTTAAGGTCTCTGTTGGGATTTTCGAGAATTGGGACCATGAAACTCAAAGAGGAGGTGAAGAACAAAGAGGTGAGGCTGGCGGAAGAGCTGGAATTATCGTTCACTAA
AATGATGAAGTACGGTGTGGTGATTGGAAACGGGGATGCCATTAACAGTCAAGGAATTCATGGGGGTTCATTGGCCGTTGTTGACCATGTTGTTTACGATAGGGGAACCA
CAAGTGGCACTGTAGAGGGACTCCATCTGGAAATCGGAGATAACCACGACTTTATCGAAGAGGAAAAGAAAAAAGAAGATGTACAATTGTCGGAGATGATAGTAACACTA
CTCAAGAACAATCGAGATTTGTTCAAAGTGCCAAGGAAGCTTCCCCCAAAGTGGGACATAGACCACAACATTAATATCAGTGAAGATCAACCCCCAATGAACATTTGTCC
TTATAAATATGAGCATAAACAAAAGGAAGAGATTGAAAAACTCGTGGAGGAAAGCTACAAGTTGGGATTATCCGACCAAGCTGCAACCCTTACTCTATTTTGTGTTGACT
ATCGAAGGTTAAATCAAGTAACGATTGCTGATAAGTTTCTGATCCCAGTAATTGAGGAGTTATTTGATGAATTACATAGGTCTGTCGTGTACTCCAAACTAGATCTAAGG
TCTGGATATCACCAAATCCGAATGAAGGAAATGGACATCGAGAAAACAGCGTTTCGAACACACGAGGGACATTACAAGTTCTTAGTCATGCCTTTTGATGAATTAGATTT
TCAAGCCATTCTTGAGGAGATTTATATTGGGGTTGAGGCCGACAATGAAAAGATCAGAGCTATGAAGAGGTGGCCACCACCCAAATCTGTTTTTGAGCTGAGAGAATTTC
TGGGGTTGATAGGGTATTATAGGAGGTTTGTTCAAAACTATGGAGTAATAGTAGCCCCAGTAACCAAATTACTTCACAAGAACAACTTCGATTGGGATGAGGTACAAGTG
GAAGCCTTTGGAAAGCTTAAGAGTGCAATGATGACTGTTCGAGTGTTGACTCTTCCCGATTTCTCCCTCCATTTTGTGATCGAGACAGATGCATCTGGATTTGGTTTAGA
GGCAATGTTAGTGCAGAATTCGAAGCGTATAGCTTATTTTAGTCAGGCCTTGTCAGAGAGAGCTCAAGCGAAATCCATCTATGAACGTGAGCTAATGGTCGTGGTGTGGA
TGACAAAGTTATTGGGAAATGATTTTGAAATTTTGTACCAACTTAGACTTCAAAATAAGCCGGCTTATGCCTTATCAAGGGTGCCACCATTGCTTGAAATTCATAAAGTG
GCAGAGGATGAAGAGCTGCAAAAGATCATCACCCTACTCAAAGAGAACCCGGAAGGAAAGTCGAAATTCCAATGGTTGCAAGGAATGTTATTGTATTTAGGGAGGTTAGT
TTTATCTAAATCCTCTTCACTTATCCCTTCGATTTTACACATGTATCATGATTCGGTGATGGGAGGACACTCAGGTTTTTTGAGGACATATAAAAGAATGGCAGGAGAGC
TCTACTGGTCAGGTATGAAAACAGATGTCAGGAAATATGTCGAGAGTTGTGAAATTTGTCAGAGGAACAAGACTGACGCGCTCAAACCAGCAAGTCTTTTCCAACCATTA
CCCATACCCTACCAGGTCTGGGAAGATCTCACTATGGACTTTATGGTGGTGGTGGATCGTCTAAGTAAGTACAACCATTTTGTGTTGTTAAAGCATCCTTATTTTGCCAA
ACAGGTTGTCGAGGTCTTTGTACAAGAAATAGTCAAACACCATGGGTTCCAAAATCCATTATCTCAAACACGAATAAGATTTTCCTCAGTAACTTTTGGAGGGAGTTATT
TTCGACATGGGGAACGACACTTAAACATAGTATCGCCTTTCGTCCACAGACGGATGCGCAGACCGAGAGGGTCAATCATTGTGTTGAGATGCTTTTGTAATGAGCAGCCG
CTGAGATGGAGCAAGTGGTTACCCTGGGCGAAATTATGGTTGTATATGAGCGACCGCCCCCTCCGATATTATCGTATGGTGAGAAGAAGACAAGACTTAGCCCTGAACGC
CTTAAAGGAGCACCTGCGGGTAGCTCAAGACAGGATGAAGAAACAAGCTAATCAATCAAGGAGAGACCTAGAGTTTGAAGTGGGAGATGAGGAAGTTATTCACAATGTGT
TCCATGTTTCCCAACTTAAGAAGAAGATAGGTCAAAGGGTTCAAGTCCAACACCATCCTTTAGTGTTGACTAGACAGTTCGAGTTGCAGGTAATACGAAAGGAAGTCATT
GGAATCCGTTGGAACGGACAAGGTGCAATTCGCGAAGGGGGTATCGTTAGACCTCTAATTCAATATACATACAAGAGAAGGGGTAAAAAGGAGAAAACACTTGAAAAGGA
ATCAGGGATGAAATCCAAAGGTATTGTTCCTTTGCCTGAGACCTTATTGAATGTTATTAATGAGATTATTGATCTTAGTGGTATTAGTACTAATAGTGAAGCTAGGGTTT
CTGGTATTGAGATTGGGTCTGGTACATCTGTAGATGTTGATAATGTTTATGTGGTTTATAAATATGTTGATAAAATTGTGCAAAGTCAGAAAGGTAGTAGTTTAAATGAC
GTGGTTAAAAATGGTGTAGCTTCTACTTCCTCATCTTCTCTTGATGTTACTGTCATTGTATCTGATACAAATGTTGGTTTGCCATCTGGTGTGGTAAATGTTGAGTTACA
TGCTAAAGATTCGTCTATTAGTTTAAGCCCTTGCCCAAAGATATTCAACAAAAAAATGATCATCTCGTCAGGAGTCCTGGAGAGGTTTGAGTTAGATGATAATGTCCCTA
CAAGTGTCTTGAGCAGAAGAGGAAGAAGAAAATTATATACTGATCAAGTTGCTTCTGAAAATGAATATCATGAATCTGATGATTTTATGGGTGATGATTCATTGTGTGAT
GATGATAGCAGGAATTTGTCGTGA
Protein sequenceShow/hide protein sequence
MKCPEKGKVKRESERFREKANLQSEIERVGEKAKATRLEEGKWGRSVRKREILREKAKGFRESKIGISRVQKVTYGKVFGSGSSIDKFLIFGSFDAILGEEIESEQNENA
GSLSKSKYKKLEMSIFTEENLKSWVYKVEYFFEIRELSKAKKVKVAVPHLKSDYDVGRAEEEDVRIISTDDGKDTRARLIRIKQEGTYIDYLKKFVAYSAPLPNMAESVL
IDAFMNELKPDIRAEVTSRLERCMREAQLVGRIREKPNRDVKCIQRRLDKGLCFRCNEKFHHGHWCKIRENRELNIFIARGEKGEEEVELVADMEEAELTVVEVEENVEV
SLRSLLGFSRIGTMKLKEEVKNKEVRLAEELELSFTKMMKYGVVIGNGDAINSQGIHGGSLAVVDHVVYDRGTTSGTVEGLHLEIGDNHDFIEEEKKKEDVQLSEMIVTL
LKNNRDLFKVPRKLPPKWDIDHNINISEDQPPMNICPYKYEHKQKEEIEKLVEESYKLGLSDQAATLTLFCVDYRRLNQVTIADKFLIPVIEELFDELHRSVVYSKLDLR
SGYHQIRMKEMDIEKTAFRTHEGHYKFLVMPFDELDFQAILEEIYIGVEADNEKIRAMKRWPPPKSVFELREFLGLIGYYRRFVQNYGVIVAPVTKLLHKNNFDWDEVQV
EAFGKLKSAMMTVRVLTLPDFSLHFVIETDASGFGLEAMLVQNSKRIAYFSQALSERAQAKSIYERELMVVVWMTKLLGNDFEILYQLRLQNKPAYALSRVPPLLEIHKV
AEDEELQKIITLLKENPEGKSKFQWLQGMLLYLGRLVLSKSSSLIPSILHMYHDSVMGGHSGFLRTYKRMAGELYWSGMKTDVRKYVESCEICQRNKTDALKPASLFQPL
PIPYQVWEDLTMDFMVVVDRLSKYNHFVLLKHPYFAKQVVEVFVQEIVKHHGFQNPLSQTRIRFSSVTFGGSYFRHGERHLNIVSPFVHRRMRRPRGSIIVLRCFCNEQP
LRWSKWLPWAKLWLYMSDRPLRYYRMVRRRQDLALNALKEHLRVAQDRMKKQANQSRRDLEFEVGDEEVIHNVFHVSQLKKKIGQRVQVQHHPLVLTRQFELQVIRKEVI
GIRWNGQGAIREGGIVRPLIQYTYKRRGKKEKTLEKESGMKSKGIVPLPETLLNVINEIIDLSGISTNSEARVSGIEIGSGTSVDVDNVYVVYKYVDKIVQSQKGSSLND
VVKNGVASTSSSSLDVTVIVSDTNVGLPSGVVNVELHAKDSSISLSPCPKIFNKKMIISSGVLERFELDDNVPTSVLSRRGRRKLYTDQVASENEYHESDDFMGDDSLCD
DDSRNLS