; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G011000 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G011000
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionTy3/gypsy retrotransposon protein
Genome locationCG_Chr05:12731052..12735522
RNA-Seq ExpressionClCG05G011000
SyntenyClCG05G011000
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR012337 - Ribonuclease H-like superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039128.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-23339.23Show/hide
Query:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGES-TTSEGSQLKMKGKMVEE----------AEFDKYKKLEMLIFAGENLESW
        MAQ+Q+EE++E  EK + S+KE+LL + K +++LTDE++E+   +K  ES TTS+GS +K+KGKM E            +  KYKKLEM +F GEN +SW
Subjt:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGES-TTSEGSQLKMKGKMVEE----------AEFDKYKKLEMLIFAGENLESW

Query:  V-------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVL
        V             E EKVKV VVSF +DEV+W+ WSH R    +WE+                    LI I+Q+ +Y +YVKKFV+YSA LP M ESVL
Subjt:  V-------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVL

Query:  IDTFMKGLKPDIRVEKLA---------MKELEHVRP------------------------------------------------------NKGEAEAERN
        +D F+ GL+P ++ E ++         M+E + V                                                         KGE   +R 
Subjt:  IDTFMKGLKPDIRVEKLA---------MKELEHVRP------------------------------------------------------NKGEAEAERN

Query:  RNSNDETSDDSGEGELSEEGTSDG------ENRELNVFIAGGEEDEE-------------EIKLKANTED--IEL----RVVDKRT-----W--------
         ++      D G      +  S G      E REL  FI   EE++E             E+K    TED  IEL    R+  KRT     W        
Subjt:  RNSNDETSDDSGEGELSEEGTSDG------ENRELNVFIAGGEEDEE-------------EIKLKANTED--IEL----RVVDKRT-----W--------

Query:  -LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITLLRLVVTANLSPIDLGLV-----MHY----RIYGVHWPLLTMSFVVGEM
         +  GAT+NFI+Q LA +L+L   +  ++G  IGN      +G+C+ V + L  + + A+   ++LG V     M +        +HWP LTMSF  G  
Subjt:  -LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITLLRLVVTANLSPIDLGLV-----MHY----RIYGVHWPLLTMSFVVGEM

Query:  QVILKGDPSFTKSKVSLKATSKVW-KEDQGFLIEFQHLEME------------GNQDMLVKSKRYLKKHGCVQSTDR-ASSKRETDCHI-TIVNGQP---
        Q+ILKGDPS  +++ SL+   K W +EDQGFL+E+ ++E+E            G++  +   +  L+++  + +T +    KR+ D  I T+ N +P   
Subjt:  QVILKGDPSFTKSKVSLKATSKVW-KEDQGFLIEFQHLEME------------GNQDMLVKSKRYLKKHGCVQSTDR-ASSKRETDCHI-TIVNGQP---

Query:  --------SKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST-------------------
                 K EIEKLV EMLQ GIIRPS+SPYS+PVLLV+KKD                  DKFPI VI ELLDEL G+                    
Subjt:  --------SKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST-------------------

Query:  --------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRD------------------------
                   GHYEFLVMPFGLTNAPAT QSLMNQ+FK FLRR +LVFF DIL Y SDIT HK+HL +VF VLRD                        
Subjt:  --------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRD------------------------

Query:  ---------------------SDFFELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA--
                             SD  EL GFLGLTG+YRRFV+ Y  IA PLTKLL KN+F WNEEA  AF RLK AM ++PVL +P+++LPF IETDA  
Subjt:  ---------------------SDFFELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA--

Query:  -----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------A
                                     +   ELM VVL++Q+W++YLLG KF+++SDQKALKFL+EQRE                            A
Subjt:  -----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------A

Query:  ADELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKR
        AD L R+ P +E+H +T   ++D+ V+  EV++DEELQKI+A LK++ E   KF+W  G+LLYK RL                           LRTYKR
Subjt:  ADELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKR

Query:  MTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVV---LQTSSFRKSLKH--------------
        M+GELYW GMK D++KYV+  E CQRNK E+ K AG+LQPLPIP ++ E+ T+DFIEGLPK  G + IMVV   L   ++  +LKH              
Subjt:  MTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVV---LQTSSFRKSLKH--------------

Query:  ----HEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW-----------------------
            H  P SIISDRDKIFLSNFW+                       TERVN+C+ETYLRCFCNEQP  W +++PW                       
Subjt:  ----HEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW-----------------------

Query:  -----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------L
                   KKTTN+ V   L  RD ALNALKE+L +AQNRMKK AD  RRE++ +VG+EV LKL+PYRQRSLA+K+                    +
Subjt:  -----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------L

Query:  AYRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKGKTERQ
        AYRLDLPPE  IHNVFH SQLK K+                FELQL PE ++GI WN EL A EWL+KW+G  E +
Subjt:  AYRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKGKTERQ

KAA0055376.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.4e-23138.6Show/hide
Query:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGES-TTSEGSQLKMKGKMVEE----------AEFDKYKKLEMLIFAGENLESW
        MAQ+Q+EE++E  EK + S+KE+LL + K +E+L DE++E+   +K  ES TTS+GS +K+KGKM E            + +KYKKLEM +F GEN ESW
Subjt:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGES-TTSEGSQLKMKGKMVEE----------AEFDKYKKLEMLIFAGENLESW

Query:  V-------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVL
        V             E EKVKV VVSF +DEV+W+ WSH R    +WE+                    LI I+QE +Y +YVKKFV+YSA LP M ESVL
Subjt:  V-------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVL

Query:  IDTFMKGLKPDIRVEKLA---------MKELEHVRP------------------------------------------------------NKGEAEAERN
        +D F+ GL+P ++ E ++         M+E + V                                                         KGE   +R 
Subjt:  IDTFMKGLKPDIRVEKLA---------MKELEHVRP------------------------------------------------------NKGEAEAERN

Query:  RNSNDETSDDSGEGELSEEGTSDG------ENRELNVFIAGGEEDEEE------------------------IKLKANTEDIELRVVDKRTW--------
         ++      D G      +  S G      E REL  FI   EE++EE                        I+LK  T       +  + W        
Subjt:  RNSNDETSDDSGEGELSEEGTSDG------ENRELNVFIAGGEEDEEE------------------------IKLKANTEDIELRVVDKRTW--------

Query:  -LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITLLRLVVTANLSPIDLGLV-----MHY----RIYGVHWPLLTMSFVVGEM
         +  GAT+NFI+  LA +LEL       +G  IGN      +G+C+ V + L  + + A+   ++LG V     M +        +HWP LTMSF  G  
Subjt:  -LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITLLRLVVTANLSPIDLGLV-----MHY----RIYGVHWPLLTMSFVVGEM

Query:  QVILKGDPSFTKSKVSLKATSKVW-KEDQGFLIEFQHLEM------------EGNQDMLVKSKRYLKKHGCVQSTDRA-SSKRETDCHITI------VNG
        Q++LKGDPS  +++ SL+   K W +EDQGFL+E+ ++++            EG++  +   +  L+++  + +T +    KR+ D  I        +N 
Subjt:  QVILKGDPSFTKSKVSLKATSKVW-KEDQGFLIEFQHLEM------------EGNQDMLVKSKRYLKKHGCVQSTDRA-SSKRETDCHITI------VNG

Query:  QP------SKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST-------------------
        +P       K EIEKLV EMLQ G+IRPS+SPYS+PVLLV+KK+                  DKFPI VI ELLDEL G+                    
Subjt:  QP------SKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST-------------------

Query:  --------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDS-----------------------
                   GHYEFLVMPFGLTNAPAT QSLMNQ+FK FLRR +LVFFYDILVY  DIT H++HL +VF VLRD+                       
Subjt:  --------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDS-----------------------

Query:  ----------------------DFFELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA--
                              D  EL GFL LTG+YRRFV+ Y  IA PLTKLL KN+F WNE+A  AF RLK AM ++PVLA+P++SLPF IETDA  
Subjt:  ----------------------DFFELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA--

Query:  -----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------A
                                     +   ELMAVVL++Q+W++YLLG KF+++SDQKALKFL+EQRE                            A
Subjt:  -----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------A

Query:  ADELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKR
        AD L R+ P +E+H +T   ++D+ V+  EV++DE LQKI+A LK++ E   KF+W  G+LLYK RL                           LRTYKR
Subjt:  ADELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKR

Query:  MTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVV---LQTSSFRKSLKH--------------
        M+GELYW GMK D++KYVE  E CQRNK E+ KPAG+LQPLPIP ++ E+ T+DFIEGLPK  G + IMVV   L   ++  +LKH              
Subjt:  MTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVV---LQTSSFRKSLKH--------------

Query:  ----HEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW-----------------------
            H IP SIISDRDKIFLSNFW+                       TERVN+C+ETYLRCFCNEQP  W +++PW                       
Subjt:  ----HEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW-----------------------

Query:  -----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------L
                   KK TN+ V   L  RD AL+ALKE+L +AQNRMKK AD  RRE++ +VG+EV LKL+PYRQRSLA+K+                    +
Subjt:  -----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------L

Query:  AYRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKGKTE
        AYRLDLPPE  IHNVFH SQLK K+                FELQL PE ++GI WN EL A EWL+KW+G  E
Subjt:  AYRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKGKTE

KAA0061073.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.9e-23038.67Show/hide
Query:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGES-TTSEGSQLKMKGKMVEE----------AEFDKYKKLEMLIFAGENLESW
        MAQ+Q+EE++E  EK + S+KE+LL + K +E+L DE++E+   +K  ES TTS+GS +K+KGKM E            +  KYKKLEM +F GEN ESW
Subjt:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGES-TTSEGSQLKMKGKMVEE----------AEFDKYKKLEMLIFAGENLESW

Query:  V-------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVL
        V             E EKVKV VVSF +DEV+W+ WSH R    +WE+                    LI I+QE +Y +YVKKFV+YSA LP M ESVL
Subjt:  V-------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVL

Query:  IDTFMKGLKPDIRVEKLA---------MKELEHVRP------------------------------------------------------NKGEAEAERN
        +D F+ GL+P ++ E ++         M+E + V                                                         KGE   +R 
Subjt:  IDTFMKGLKPDIRVEKLA---------MKELEHVRP------------------------------------------------------NKGEAEAERN

Query:  RNSNDETSDDSGEGELSEEGTSDG------ENRELNVFIAGGEEDEEE------------------------IKLKANTEDIELRVVDKRTW--------
         ++      D G      +  S G      E REL  FI   EE++EE                        I+LK  T       +  + W        
Subjt:  RNSNDETSDDSGEGELSEEGTSDG------ENRELNVFIAGGEEDEEE------------------------IKLKANTEDIELRVVDKRTW--------

Query:  -LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITLLRLVVTANLSPIDLGLV-----MHY----RIYGVHWPLLTMSFVVGEM
         +  GAT+NFI+  LA ELEL       +G  IGN      +G+C+ V + L  + + A+   ++LG V     M +        ++WP LTMSF  G  
Subjt:  -LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITLLRLVVTANLSPIDLGLV-----MHY----RIYGVHWPLLTMSFVVGEM

Query:  QVILKGDPSFTKSKVSLKATSKVW-KEDQGFLIEFQHLEM------------EGNQDMLVKSKRYLKKHGCVQSTDRA-SSKRETDCHITI------VNG
        Q++LKGDPS  +++ SL+   K W +EDQGFL+E+ ++E+            EG++  +   +  L+++  + +T +    KR+ D  I        +N 
Subjt:  QVILKGDPSFTKSKVSLKATSKVW-KEDQGFLIEFQHLEM------------EGNQDMLVKSKRYLKKHGCVQSTDRA-SSKRETDCHITI------VNG

Query:  QP------SKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST-------------------
        +P       K EIEKLV EMLQ G+IRPS+SPYS+PVLLV+KKD                  DKFPI VI ELLDEL G+                    
Subjt:  QP------SKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST-------------------

Query:  --------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRD------------------------
                   GHYEFLVMPFGLTNAPAT QSLMNQ+FK FLRR +LVFF DILVY  DI   ++HL +VF VLRD                        
Subjt:  --------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRD------------------------

Query:  ---------------------SDFFELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA--
                             SD  EL GFLGLTG+YRRFV+ Y  IA PLTKLL KN+F WNE+A  AF RLK AM ++PVLA+P++SLPF IETDA  
Subjt:  ---------------------SDFFELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA--

Query:  -----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------A
                                     +   ELMAVVL++Q+W++YLLG KF+++SDQKALKFL+EQRE                            A
Subjt:  -----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------A

Query:  ADELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKR
        AD L R+ P +E+H +T   ++D+ V+  EV++DE LQKI+A LK++ E   KF+W  G+LLYK RL                           LRTYKR
Subjt:  ADELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKR

Query:  MTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVV---LQTSSFRKSLKH--------------
        M+GELYW GMK D++KYVE  E CQRNK E+ KPAG+LQPLPIP ++ E+ T+DFIEGLPK  G + IMVV   L   ++  +LKH              
Subjt:  MTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVV---LQTSSFRKSLKH--------------

Query:  ----HEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW-----------------------
            H IP SIISDRDKIFLSNFW+                       TERVN+C+ETYLRCFCNEQP  W +++PW                       
Subjt:  ----HEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW-----------------------

Query:  -----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------L
                   KKT N+ V   L  RD AL+ALKE+L +AQNRMKK AD  RRE++ +VG+EV LKL+PYRQRSLA+K+                    +
Subjt:  -----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------L

Query:  AYRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKGKTE
        AYRLDLPPE  IHNVFH SQLK K+                FELQL PE ++GI WN EL A EWL+KW+G  E
Subjt:  AYRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKGKTE

TYJ99303.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.4e-23138.6Show/hide
Query:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGES-TTSEGSQLKMKGKMVEE----------AEFDKYKKLEMLIFAGENLESW
        MAQ+Q+EE++E  EK + S+KE+LL + K +E+L DE++E+   +K  ES TTS+GS +K+KGKM E            + +KYKKLEM +F GEN ESW
Subjt:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGES-TTSEGSQLKMKGKMVEE----------AEFDKYKKLEMLIFAGENLESW

Query:  V-------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVL
        V             E EKVKV VVSF +DEV+W+ WSH R    +WE+                    LI I+QE +Y +YVKKFV+YSA LP M ESVL
Subjt:  V-------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVL

Query:  IDTFMKGLKPDIRVEKLA---------MKELEHVRP------------------------------------------------------NKGEAEAERN
        +D F+ GL+P ++ E ++         M+E + V                                                         KGE   +R 
Subjt:  IDTFMKGLKPDIRVEKLA---------MKELEHVRP------------------------------------------------------NKGEAEAERN

Query:  RNSNDETSDDSGEGELSEEGTSDG------ENRELNVFIAGGEEDEEE------------------------IKLKANTEDIELRVVDKRTW--------
         ++      D G      +  S G      E REL  FI   EE++EE                        I+LK  T       +  + W        
Subjt:  RNSNDETSDDSGEGELSEEGTSDG------ENRELNVFIAGGEEDEEE------------------------IKLKANTEDIELRVVDKRTW--------

Query:  -LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITLLRLVVTANLSPIDLGLV-----MHY----RIYGVHWPLLTMSFVVGEM
         +  GAT+NFI+  LA +LEL       +G  IGN      +G+C+ V + L  + + A+   ++LG V     M +        +HWP LTMSF  G  
Subjt:  -LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITLLRLVVTANLSPIDLGLV-----MHY----RIYGVHWPLLTMSFVVGEM

Query:  QVILKGDPSFTKSKVSLKATSKVW-KEDQGFLIEFQHLEM------------EGNQDMLVKSKRYLKKHGCVQSTDRA-SSKRETDCHITI------VNG
        Q++LKGDPS  +++ SL+   K W +EDQGFL+E+ ++++            EG++  +   +  L+++  + +T +    KR+ D  I        +N 
Subjt:  QVILKGDPSFTKSKVSLKATSKVW-KEDQGFLIEFQHLEM------------EGNQDMLVKSKRYLKKHGCVQSTDRA-SSKRETDCHITI------VNG

Query:  QP------SKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST-------------------
        +P       K EIEKLV EMLQ G+IRPS+SPYS+PVLLV+KK+                  DKFPI VI ELLDEL G+                    
Subjt:  QP------SKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST-------------------

Query:  --------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDS-----------------------
                   GHYEFLVMPFGLTNAPAT QSLMNQ+FK FLRR +LVFFYDILVY  DIT H++HL +VF VLRD+                       
Subjt:  --------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDS-----------------------

Query:  ----------------------DFFELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA--
                              D  EL GFL LTG+YRRFV+ Y  IA PLTKLL KN+F WNE+A  AF RLK AM ++PVLA+P++SLPF IETDA  
Subjt:  ----------------------DFFELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA--

Query:  -----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------A
                                     +   ELMAVVL++Q+W++YLLG KF+++SDQKALKFL+EQRE                            A
Subjt:  -----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------A

Query:  ADELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKR
        AD L R+ P +E+H +T   ++D+ V+  EV++DE LQKI+A LK++ E   KF+W  G+LLYK RL                           LRTYKR
Subjt:  ADELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKR

Query:  MTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVV---LQTSSFRKSLKH--------------
        M+GELYW GMK D++KYVE  E CQRNK E+ KPAG+LQPLPIP ++ E+ T+DFIEGLPK  G + IMVV   L   ++  +LKH              
Subjt:  MTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVV---LQTSSFRKSLKH--------------

Query:  ----HEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW-----------------------
            H IP SIISDRDKIFLSNFW+                       TERVN+C+ETYLRCFCNEQP  W +++PW                       
Subjt:  ----HEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW-----------------------

Query:  -----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------L
                   KK TN+ V   L  RD AL+ALKE+L +AQNRMKK AD  RRE++ +VG+EV LKL+PYRQRSLA+K+                    +
Subjt:  -----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------L

Query:  AYRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKGKTE
        AYRLDLPPE  IHNVFH SQLK K+                FELQL PE ++GI WN EL A EWL+KW+G  E
Subjt:  AYRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKGKTE

TYK28944.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.1e-22938.78Show/hide
Query:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGESTTSEGSQLKMKGKMVE----------EAEFDKYKKLEMLIFAGENLESWV
        MAQ+Q+EE++E  EK + SLKE++L + K++++L DE+++ +  +K  ES TS+GS +KMKGKM E          + +  KYKKLEM +F GEN ESWV
Subjt:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGESTTSEGSQLKMKGKMVE----------EAEFDKYKKLEMLIFAGENLESWV

Query:  -------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVLI
                     E EKVKV VVSF +DEV+W+ WSH R    +WE+                    LI I+QE +Y+DYVKKFV+YSA LP+M ESVL 
Subjt:  -------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVLI

Query:  DTFMKGLKPDIRVE--------------------------KLAMKELEHVRPNKGEAEAERNRNSNDETSDDSGEGELSE--------------------
        D F+ GL+P ++ E                          KLA  E+  + P + E+   + + +N++      E ++ +                    
Subjt:  DTFMKGLKPDIRVE--------------------------KLAMKELEHVRPNKGEAEAERNRNSNDETSDDSGEGELSE--------------------

Query:  -----------------EGTSDG------ENRELNVFIAGGEE--DEEEIKLKANTEDIEL-------------RVVDKRT------------------W
                         E  S G      E REL +FI   EE  +E E     NTE +E+             R +   T                   
Subjt:  -----------------EGTSDG------ENRELNVFIAGGEE--DEEEIKLKANTEDIEL-------------RVVDKRT------------------W

Query:  LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITL--LRLV---VTANLSPIDLGLVMHY----RIYGVHWPLLTMSFVVGEMQ
        +  GAT+NFI+ +L KE ++   +  ++G+ IG+       G+C  V I L  LR+V   +   L  ID+ L M +        +HWP LTM F     +
Subjt:  LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITL--LRLV---VTANLSPIDLGLVMHY----RIYGVHWPLLTMSFVVGEMQ

Query:  VILKGDPSFTKSKVSLKATSKVWK-EDQGFLIEFQHLEME------------GNQDMLVKSKRYLKKHGCV-QSTDRASSKRETDCHITIVNGQ------
        V+LKGDP+  +++ SLK   K W+ EDQGFL+++Q  E+E            G+++ L   +  L ++  V +S      KR  D  I  + GQ      
Subjt:  VILKGDPSFTKSKVSLKATSKVWK-EDQGFLIEFQHLEME------------GNQDMLVKSKRYLKKHGCV-QSTDRASSKRETDCHITIVNGQ------

Query:  ------PSKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST--------------------
                KEEIEKLV EMLQ GIIRPS SP+S+PVLLV+KKD                  DKFPI VI ELLDEL+G+T                    
Subjt:  ------PSKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST--------------------

Query:  -------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDSDFFE--------------------
                  GHYEF+VMPFGLTNAPAT QSLMNQ+FK FLRR +LVFF DILVY SDIT H++HL +VF  LRD+  +                     
Subjt:  -------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDSDFFE--------------------

Query:  -------------------------LMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA---
                                 L GFLGLTG+YRRFV+ YG IAAPLTKLL KN+F W+E A  AFE LK AM ++PVLA+P++SLPF+IETDA   
Subjt:  -------------------------LMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA---

Query:  ----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------AA
                                    +   ELMAVVL++QKW++YLLG +FT++SDQKALKFL+EQRE                            AA
Subjt:  ----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------AA

Query:  DELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKRM
        D L RM   +E+ +L+   ++D++V+ +EVE+DEELQ +I  L+ +   + K+    G L+YK R+                           LRTYKRM
Subjt:  DELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKRM

Query:  TGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVVLQTSS---------------------FRKS
        +GEL+W GMK D++KYVE  E CQRNK E+ KPAG+LQPLPIP ++ E+ T+DFIEGLPK  G + IMVV+   S                       K 
Subjt:  TGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVVLQTSS---------------------FRKS

Query:  LKHHEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW------------------------
        ++ H IPKSIISDRDKIF+SNFWK                       TERVN+C+ETYLRCFCNEQP+ W +++PW                        
Subjt:  LKHHEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW------------------------

Query:  ----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------LA
                  KKT ND V   L  RDLA++ALKE+L +AQNRMKK AD  RRE++F+VGDEV LKLRPYRQRSLA+KR                    +A
Subjt:  ----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------LA

Query:  YRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKG
        YRLDLPPE  IHNVFH SQLK K+               EFELQL PE ++GI W+ EL A EWLVKWKG
Subjt:  YRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKG

TrEMBL top hitse value%identityAlignment
A0A5A7T8C0 Ty3/gypsy retrotransposon protein6.0e-23439.23Show/hide
Query:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGES-TTSEGSQLKMKGKMVEE----------AEFDKYKKLEMLIFAGENLESW
        MAQ+Q+EE++E  EK + S+KE+LL + K +++LTDE++E+   +K  ES TTS+GS +K+KGKM E            +  KYKKLEM +F GEN +SW
Subjt:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGES-TTSEGSQLKMKGKMVEE----------AEFDKYKKLEMLIFAGENLESW

Query:  V-------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVL
        V             E EKVKV VVSF +DEV+W+ WSH R    +WE+                    LI I+Q+ +Y +YVKKFV+YSA LP M ESVL
Subjt:  V-------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVL

Query:  IDTFMKGLKPDIRVEKLA---------MKELEHVRP------------------------------------------------------NKGEAEAERN
        +D F+ GL+P ++ E ++         M+E + V                                                         KGE   +R 
Subjt:  IDTFMKGLKPDIRVEKLA---------MKELEHVRP------------------------------------------------------NKGEAEAERN

Query:  RNSNDETSDDSGEGELSEEGTSDG------ENRELNVFIAGGEEDEE-------------EIKLKANTED--IEL----RVVDKRT-----W--------
         ++      D G      +  S G      E REL  FI   EE++E             E+K    TED  IEL    R+  KRT     W        
Subjt:  RNSNDETSDDSGEGELSEEGTSDG------ENRELNVFIAGGEEDEE-------------EIKLKANTED--IEL----RVVDKRT-----W--------

Query:  -LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITLLRLVVTANLSPIDLGLV-----MHY----RIYGVHWPLLTMSFVVGEM
         +  GAT+NFI+Q LA +L+L   +  ++G  IGN      +G+C+ V + L  + + A+   ++LG V     M +        +HWP LTMSF  G  
Subjt:  -LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITLLRLVVTANLSPIDLGLV-----MHY----RIYGVHWPLLTMSFVVGEM

Query:  QVILKGDPSFTKSKVSLKATSKVW-KEDQGFLIEFQHLEME------------GNQDMLVKSKRYLKKHGCVQSTDR-ASSKRETDCHI-TIVNGQP---
        Q+ILKGDPS  +++ SL+   K W +EDQGFL+E+ ++E+E            G++  +   +  L+++  + +T +    KR+ D  I T+ N +P   
Subjt:  QVILKGDPSFTKSKVSLKATSKVW-KEDQGFLIEFQHLEME------------GNQDMLVKSKRYLKKHGCVQSTDR-ASSKRETDCHI-TIVNGQP---

Query:  --------SKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST-------------------
                 K EIEKLV EMLQ GIIRPS+SPYS+PVLLV+KKD                  DKFPI VI ELLDEL G+                    
Subjt:  --------SKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST-------------------

Query:  --------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRD------------------------
                   GHYEFLVMPFGLTNAPAT QSLMNQ+FK FLRR +LVFF DIL Y SDIT HK+HL +VF VLRD                        
Subjt:  --------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRD------------------------

Query:  ---------------------SDFFELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA--
                             SD  EL GFLGLTG+YRRFV+ Y  IA PLTKLL KN+F WNEEA  AF RLK AM ++PVL +P+++LPF IETDA  
Subjt:  ---------------------SDFFELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA--

Query:  -----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------A
                                     +   ELM VVL++Q+W++YLLG KF+++SDQKALKFL+EQRE                            A
Subjt:  -----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------A

Query:  ADELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKR
        AD L R+ P +E+H +T   ++D+ V+  EV++DEELQKI+A LK++ E   KF+W  G+LLYK RL                           LRTYKR
Subjt:  ADELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKR

Query:  MTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVV---LQTSSFRKSLKH--------------
        M+GELYW GMK D++KYV+  E CQRNK E+ K AG+LQPLPIP ++ E+ T+DFIEGLPK  G + IMVV   L   ++  +LKH              
Subjt:  MTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVV---LQTSSFRKSLKH--------------

Query:  ----HEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW-----------------------
            H  P SIISDRDKIFLSNFW+                       TERVN+C+ETYLRCFCNEQP  W +++PW                       
Subjt:  ----HEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW-----------------------

Query:  -----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------L
                   KKTTN+ V   L  RD ALNALKE+L +AQNRMKK AD  RRE++ +VG+EV LKL+PYRQRSLA+K+                    +
Subjt:  -----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------L

Query:  AYRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKGKTERQ
        AYRLDLPPE  IHNVFH SQLK K+                FELQL PE ++GI WN EL A EWL+KW+G  E +
Subjt:  AYRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKGKTERQ

A0A5A7UM77 Ty3/gypsy retrotransposon protein2.2e-23138.6Show/hide
Query:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGES-TTSEGSQLKMKGKMVEE----------AEFDKYKKLEMLIFAGENLESW
        MAQ+Q+EE++E  EK + S+KE+LL + K +E+L DE++E+   +K  ES TTS+GS +K+KGKM E            + +KYKKLEM +F GEN ESW
Subjt:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGES-TTSEGSQLKMKGKMVEE----------AEFDKYKKLEMLIFAGENLESW

Query:  V-------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVL
        V             E EKVKV VVSF +DEV+W+ WSH R    +WE+                    LI I+QE +Y +YVKKFV+YSA LP M ESVL
Subjt:  V-------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVL

Query:  IDTFMKGLKPDIRVEKLA---------MKELEHVRP------------------------------------------------------NKGEAEAERN
        +D F+ GL+P ++ E ++         M+E + V                                                         KGE   +R 
Subjt:  IDTFMKGLKPDIRVEKLA---------MKELEHVRP------------------------------------------------------NKGEAEAERN

Query:  RNSNDETSDDSGEGELSEEGTSDG------ENRELNVFIAGGEEDEEE------------------------IKLKANTEDIELRVVDKRTW--------
         ++      D G      +  S G      E REL  FI   EE++EE                        I+LK  T       +  + W        
Subjt:  RNSNDETSDDSGEGELSEEGTSDG------ENRELNVFIAGGEEDEEE------------------------IKLKANTEDIELRVVDKRTW--------

Query:  -LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITLLRLVVTANLSPIDLGLV-----MHY----RIYGVHWPLLTMSFVVGEM
         +  GAT+NFI+  LA +LEL       +G  IGN      +G+C+ V + L  + + A+   ++LG V     M +        +HWP LTMSF  G  
Subjt:  -LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITLLRLVVTANLSPIDLGLV-----MHY----RIYGVHWPLLTMSFVVGEM

Query:  QVILKGDPSFTKSKVSLKATSKVW-KEDQGFLIEFQHLEM------------EGNQDMLVKSKRYLKKHGCVQSTDRA-SSKRETDCHITI------VNG
        Q++LKGDPS  +++ SL+   K W +EDQGFL+E+ ++++            EG++  +   +  L+++  + +T +    KR+ D  I        +N 
Subjt:  QVILKGDPSFTKSKVSLKATSKVW-KEDQGFLIEFQHLEM------------EGNQDMLVKSKRYLKKHGCVQSTDRA-SSKRETDCHITI------VNG

Query:  QP------SKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST-------------------
        +P       K EIEKLV EMLQ G+IRPS+SPYS+PVLLV+KK+                  DKFPI VI ELLDEL G+                    
Subjt:  QP------SKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST-------------------

Query:  --------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDS-----------------------
                   GHYEFLVMPFGLTNAPAT QSLMNQ+FK FLRR +LVFFYDILVY  DIT H++HL +VF VLRD+                       
Subjt:  --------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDS-----------------------

Query:  ----------------------DFFELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA--
                              D  EL GFL LTG+YRRFV+ Y  IA PLTKLL KN+F WNE+A  AF RLK AM ++PVLA+P++SLPF IETDA  
Subjt:  ----------------------DFFELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA--

Query:  -----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------A
                                     +   ELMAVVL++Q+W++YLLG KF+++SDQKALKFL+EQRE                            A
Subjt:  -----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------A

Query:  ADELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKR
        AD L R+ P +E+H +T   ++D+ V+  EV++DE LQKI+A LK++ E   KF+W  G+LLYK RL                           LRTYKR
Subjt:  ADELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKR

Query:  MTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVV---LQTSSFRKSLKH--------------
        M+GELYW GMK D++KYVE  E CQRNK E+ KPAG+LQPLPIP ++ E+ T+DFIEGLPK  G + IMVV   L   ++  +LKH              
Subjt:  MTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVV---LQTSSFRKSLKH--------------

Query:  ----HEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW-----------------------
            H IP SIISDRDKIFLSNFW+                       TERVN+C+ETYLRCFCNEQP  W +++PW                       
Subjt:  ----HEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW-----------------------

Query:  -----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------L
                   KK TN+ V   L  RD AL+ALKE+L +AQNRMKK AD  RRE++ +VG+EV LKL+PYRQRSLA+K+                    +
Subjt:  -----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------L

Query:  AYRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKGKTE
        AYRLDLPPE  IHNVFH SQLK K+                FELQL PE ++GI WN EL A EWL+KW+G  E
Subjt:  AYRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKGKTE

A0A5A7UYM1 Ty3/gypsy retrotransposon protein2.4e-23038.67Show/hide
Query:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGES-TTSEGSQLKMKGKMVEE----------AEFDKYKKLEMLIFAGENLESW
        MAQ+Q+EE++E  EK + S+KE+LL + K +E+L DE++E+   +K  ES TTS+GS +K+KGKM E            +  KYKKLEM +F GEN ESW
Subjt:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGES-TTSEGSQLKMKGKMVEE----------AEFDKYKKLEMLIFAGENLESW

Query:  V-------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVL
        V             E EKVKV VVSF +DEV+W+ WSH R    +WE+                    LI I+QE +Y +YVKKFV+YSA LP M ESVL
Subjt:  V-------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVL

Query:  IDTFMKGLKPDIRVEKLA---------MKELEHVRP------------------------------------------------------NKGEAEAERN
        +D F+ GL+P ++ E ++         M+E + V                                                         KGE   +R 
Subjt:  IDTFMKGLKPDIRVEKLA---------MKELEHVRP------------------------------------------------------NKGEAEAERN

Query:  RNSNDETSDDSGEGELSEEGTSDG------ENRELNVFIAGGEEDEEE------------------------IKLKANTEDIELRVVDKRTW--------
         ++      D G      +  S G      E REL  FI   EE++EE                        I+LK  T       +  + W        
Subjt:  RNSNDETSDDSGEGELSEEGTSDG------ENRELNVFIAGGEEDEEE------------------------IKLKANTEDIELRVVDKRTW--------

Query:  -LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITLLRLVVTANLSPIDLGLV-----MHY----RIYGVHWPLLTMSFVVGEM
         +  GAT+NFI+  LA ELEL       +G  IGN      +G+C+ V + L  + + A+   ++LG V     M +        ++WP LTMSF  G  
Subjt:  -LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITLLRLVVTANLSPIDLGLV-----MHY----RIYGVHWPLLTMSFVVGEM

Query:  QVILKGDPSFTKSKVSLKATSKVW-KEDQGFLIEFQHLEM------------EGNQDMLVKSKRYLKKHGCVQSTDRA-SSKRETDCHITI------VNG
        Q++LKGDPS  +++ SL+   K W +EDQGFL+E+ ++E+            EG++  +   +  L+++  + +T +    KR+ D  I        +N 
Subjt:  QVILKGDPSFTKSKVSLKATSKVW-KEDQGFLIEFQHLEM------------EGNQDMLVKSKRYLKKHGCVQSTDRA-SSKRETDCHITI------VNG

Query:  QP------SKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST-------------------
        +P       K EIEKLV EMLQ G+IRPS+SPYS+PVLLV+KKD                  DKFPI VI ELLDEL G+                    
Subjt:  QP------SKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST-------------------

Query:  --------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRD------------------------
                   GHYEFLVMPFGLTNAPAT QSLMNQ+FK FLRR +LVFF DILVY  DI   ++HL +VF VLRD                        
Subjt:  --------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRD------------------------

Query:  ---------------------SDFFELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA--
                             SD  EL GFLGLTG+YRRFV+ Y  IA PLTKLL KN+F WNE+A  AF RLK AM ++PVLA+P++SLPF IETDA  
Subjt:  ---------------------SDFFELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA--

Query:  -----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------A
                                     +   ELMAVVL++Q+W++YLLG KF+++SDQKALKFL+EQRE                            A
Subjt:  -----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------A

Query:  ADELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKR
        AD L R+ P +E+H +T   ++D+ V+  EV++DE LQKI+A LK++ E   KF+W  G+LLYK RL                           LRTYKR
Subjt:  ADELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKR

Query:  MTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVV---LQTSSFRKSLKH--------------
        M+GELYW GMK D++KYVE  E CQRNK E+ KPAG+LQPLPIP ++ E+ T+DFIEGLPK  G + IMVV   L   ++  +LKH              
Subjt:  MTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVV---LQTSSFRKSLKH--------------

Query:  ----HEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW-----------------------
            H IP SIISDRDKIFLSNFW+                       TERVN+C+ETYLRCFCNEQP  W +++PW                       
Subjt:  ----HEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW-----------------------

Query:  -----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------L
                   KKT N+ V   L  RD AL+ALKE+L +AQNRMKK AD  RRE++ +VG+EV LKL+PYRQRSLA+K+                    +
Subjt:  -----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------L

Query:  AYRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKGKTE
        AYRLDLPPE  IHNVFH SQLK K+                FELQL PE ++GI WN EL A EWL+KW+G  E
Subjt:  AYRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKGKTE

A0A5D3BBH7 Ty3/gypsy retrotransposon protein5.3e-23038.78Show/hide
Query:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGESTTSEGSQLKMKGKMVE----------EAEFDKYKKLEMLIFAGENLESWV
        MAQ+Q+EE++E  EK + SLKE++L + K++++L DE+++ +  +K  ES TS+GS +KMKGKM E          + +  KYKKLEM +F GEN ESWV
Subjt:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGESTTSEGSQLKMKGKMVE----------EAEFDKYKKLEMLIFAGENLESWV

Query:  -------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVLI
                     E EKVKV VVSF +DEV+W+ WSH R    +WE+                    LI I+QE +Y+DYVKKFV+YSA LP+M ESVL 
Subjt:  -------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVLI

Query:  DTFMKGLKPDIRVE--------------------------KLAMKELEHVRPNKGEAEAERNRNSNDETSDDSGEGELSE--------------------
        D F+ GL+P ++ E                          KLA  E+  + P + E+   + + +N++      E ++ +                    
Subjt:  DTFMKGLKPDIRVE--------------------------KLAMKELEHVRPNKGEAEAERNRNSNDETSDDSGEGELSE--------------------

Query:  -----------------EGTSDG------ENRELNVFIAGGEE--DEEEIKLKANTEDIEL-------------RVVDKRT------------------W
                         E  S G      E REL +FI   EE  +E E     NTE +E+             R +   T                   
Subjt:  -----------------EGTSDG------ENRELNVFIAGGEE--DEEEIKLKANTEDIEL-------------RVVDKRT------------------W

Query:  LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITL--LRLV---VTANLSPIDLGLVMHY----RIYGVHWPLLTMSFVVGEMQ
        +  GAT+NFI+ +L KE ++   +  ++G+ IG+       G+C  V I L  LR+V   +   L  ID+ L M +        +HWP LTM F     +
Subjt:  LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITL--LRLV---VTANLSPIDLGLVMHY----RIYGVHWPLLTMSFVVGEMQ

Query:  VILKGDPSFTKSKVSLKATSKVWK-EDQGFLIEFQHLEME------------GNQDMLVKSKRYLKKHGCV-QSTDRASSKRETDCHITIVNGQ------
        V+LKGDP+  +++ SLK   K W+ EDQGFL+++Q  E+E            G+++ L   +  L ++  V +S      KR  D  I  + GQ      
Subjt:  VILKGDPSFTKSKVSLKATSKVWK-EDQGFLIEFQHLEME------------GNQDMLVKSKRYLKKHGCV-QSTDRASSKRETDCHITIVNGQ------

Query:  ------PSKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST--------------------
                KEEIEKLV EMLQ GIIRPS SP+S+PVLLV+KKD                  DKFPI VI ELLDEL+G+T                    
Subjt:  ------PSKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST--------------------

Query:  -------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDSDFFE--------------------
                  GHYEF+VMPFGLTNAPAT QSLMNQ+FK FLRR +LVFF DILVY SDIT H++HL +VF  LRD+  +                     
Subjt:  -------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDSDFFE--------------------

Query:  -------------------------LMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA---
                                 L GFLGLTG+YRRFV+ YG IAAPLTKLL KN+F W+E A  AFE LK AM ++PVLA+P++SLPF+IETDA   
Subjt:  -------------------------LMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA---

Query:  ----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------AA
                                    +   ELMAVVL++QKW++YLLG +FT++SDQKALKFL+EQRE                            AA
Subjt:  ----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------AA

Query:  DELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKRM
        D L RM   +E+ +L+   ++D++V+ +EVE+DEELQ +I  L+ +   + K+    G L+YK R+                           LRTYKRM
Subjt:  DELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKRM

Query:  TGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVVLQTSS---------------------FRKS
        +GEL+W GMK D++KYVE  E CQRNK E+ KPAG+LQPLPIP ++ E+ T+DFIEGLPK  G + IMVV+   S                       K 
Subjt:  TGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVVLQTSS---------------------FRKS

Query:  LKHHEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW------------------------
        ++ H IPKSIISDRDKIF+SNFWK                       TERVN+C+ETYLRCFCNEQP+ W +++PW                        
Subjt:  LKHHEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW------------------------

Query:  ----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------LA
                  KKT ND V   L  RDLA++ALKE+L +AQNRMKK AD  RRE++F+VGDEV LKLRPYRQRSLA+KR                    +A
Subjt:  ----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------LA

Query:  YRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKG
        YRLDLPPE  IHNVFH SQLK K+               EFELQL PE ++GI W+ EL A EWLVKWKG
Subjt:  YRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKG

A0A5D3BJ50 Ty3/gypsy retrotransposon protein2.2e-23138.6Show/hide
Query:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGES-TTSEGSQLKMKGKMVEE----------AEFDKYKKLEMLIFAGENLESW
        MAQ+Q+EE++E  EK + S+KE+LL + K +E+L DE++E+   +K  ES TTS+GS +K+KGKM E            + +KYKKLEM +F GEN ESW
Subjt:  MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGES-TTSEGSQLKMKGKMVEE----------AEFDKYKKLEMLIFAGENLESW

Query:  V-------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVL
        V             E EKVKV VVSF +DEV+W+ WSH R    +WE+                    LI I+QE +Y +YVKKFV+YSA LP M ESVL
Subjt:  V-------------EVEKVKVPVVSFTEDEVNWFCWSHTRSPTTTWEE--------------------LIGIKQEETYADYVKKFVSYSALLPNMVESVL

Query:  IDTFMKGLKPDIRVEKLA---------MKELEHVRP------------------------------------------------------NKGEAEAERN
        +D F+ GL+P ++ E ++         M+E + V                                                         KGE   +R 
Subjt:  IDTFMKGLKPDIRVEKLA---------MKELEHVRP------------------------------------------------------NKGEAEAERN

Query:  RNSNDETSDDSGEGELSEEGTSDG------ENRELNVFIAGGEEDEEE------------------------IKLKANTEDIELRVVDKRTW--------
         ++      D G      +  S G      E REL  FI   EE++EE                        I+LK  T       +  + W        
Subjt:  RNSNDETSDDSGEGELSEEGTSDG------ENRELNVFIAGGEEDEEE------------------------IKLKANTEDIELRVVDKRTW--------

Query:  -LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITLLRLVVTANLSPIDLGLV-----MHY----RIYGVHWPLLTMSFVVGEM
         +  GAT+NFI+  LA +LEL       +G  IGN      +G+C+ V + L  + + A+   ++LG V     M +        +HWP LTMSF  G  
Subjt:  -LFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITLLRLVVTANLSPIDLGLV-----MHY----RIYGVHWPLLTMSFVVGEM

Query:  QVILKGDPSFTKSKVSLKATSKVW-KEDQGFLIEFQHLEM------------EGNQDMLVKSKRYLKKHGCVQSTDRA-SSKRETDCHITI------VNG
        Q++LKGDPS  +++ SL+   K W +EDQGFL+E+ ++++            EG++  +   +  L+++  + +T +    KR+ D  I        +N 
Subjt:  QVILKGDPSFTKSKVSLKATSKVW-KEDQGFLIEFQHLEM------------EGNQDMLVKSKRYLKKHGCVQSTDRA-SSKRETDCHITI------VNG

Query:  QP------SKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST-------------------
        +P       K EIEKLV EMLQ G+IRPS+SPYS+PVLLV+KK+                  DKFPI VI ELLDEL G+                    
Subjt:  QP------SKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST-------------------

Query:  --------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDS-----------------------
                   GHYEFLVMPFGLTNAPAT QSLMNQ+FK FLRR +LVFFYDILVY  DIT H++HL +VF VLRD+                       
Subjt:  --------NTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDS-----------------------

Query:  ----------------------DFFELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA--
                              D  EL GFL LTG+YRRFV+ Y  IA PLTKLL KN+F WNE+A  AF RLK AM ++PVLA+P++SLPF IETDA  
Subjt:  ----------------------DFFELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDA--

Query:  -----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------A
                                     +   ELMAVVL++Q+W++YLLG KF+++SDQKALKFL+EQRE                            A
Subjt:  -----------------------------LDFEELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQRE----------------------------A

Query:  ADELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKR
        AD L R+ P +E+H +T   ++D+ V+  EV++DE LQKI+A LK++ E   KF+W  G+LLYK RL                           LRTYKR
Subjt:  ADELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERL---------------------------LRTYKR

Query:  MTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVV---LQTSSFRKSLKH--------------
        M+GELYW GMK D++KYVE  E CQRNK E+ KPAG+LQPLPIP ++ E+ T+DFIEGLPK  G + IMVV   L   ++  +LKH              
Subjt:  MTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVV---LQTSSFRKSLKH--------------

Query:  ----HEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW-----------------------
            H IP SIISDRDKIFLSNFW+                       TERVN+C+ETYLRCFCNEQP  W +++PW                       
Subjt:  ----HEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHWSKWLPW-----------------------

Query:  -----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------L
                   KK TN+ V   L  RD AL+ALKE+L +AQNRMKK AD  RRE++ +VG+EV LKL+PYRQRSLA+K+                    +
Subjt:  -----------KKTTNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKR--------------------L

Query:  AYRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKGKTE
        AYRLDLPPE  IHNVFH SQLK K+                FELQL PE ++GI WN EL A EWL+KW+G  E
Subjt:  AYRLDLPPEGVIHNVFHESQLKKKI---------------EFELQLIPEEIIGICWNSELAAKEWLVKWKGKTE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.64.1e-3027.44Show/hide
Query:  QPSKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKDE----------------------DKFPILVIVELLDEL------------------------
        Q  ++E+E  +++ML  GIIR S SPY++P+ +V KK +                      D+ PI  + E+L +L                        
Subjt:  QPSKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKDE----------------------DKFPILVIVELLDEL------------------------

Query:  ---NGSTNTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVL----------------RDSDFF-----
              +   GHYE+L MPFGL NAPAT Q  MN I +  L +  LV+  DI+V+ + +  H + L +VF  L                +++ F      
Subjt:  ---NGSTNTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVL----------------RDSDFF-----

Query:  ------------------------ELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNS--FAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDAL
                                E+  FLGLTG+YR+F+ N+  IA P+TK L KN      N E   AF++LK  +   P+L +P+F+  F + TDA 
Subjt:  ------------------------ELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNS--FAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDAL

Query:  DF-------------------------------EELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQREAADELLR
        D                                +EL+A+V A + +++YLLG  F + SD + L +L   ++   +L R
Subjt:  DF-------------------------------EELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQREAADELLR

P0CT34 Transposon Tf2-1 polyprotein7.8e-2922.76Show/hide
Query:  VEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST---------------------------NTRGHYEF
        + + L++GIIR S++  + PV+ V KK+                  + +P+ +I +LL ++ GST                             RG +E+
Subjt:  VEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST---------------------------NTRGHYEF

Query:  LVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDSDFF------------------------------------
        LVMP+G++ APA  Q  +N I        ++ +  DIL++    + H +H+  V   L++++                                      
Subjt:  LVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDSDFF------------------------------------

Query:  ---------ELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKN-SFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDALDF--------------
                 EL  FLG   + R+F+     +  PL  LL K+  + W      A E +K  ++S PVL   +FS   ++ETDA D               
Subjt:  ---------ELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKN-SFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDALDF--------------

Query:  ----------------------EELMAVVLAIQKWKYYLLG--NKFTVISDQKAL-------------------------KFLIEQR-----EAADELLR
                              +E++A++ +++ W++YL      F +++D + L                          F I  R       AD L R
Subjt:  ----------------------EELMAVVLAIQKWKYYLLG--NKFTVISDQKAL-------------------------KFLIEQR-----EAADELLR

Query:  MTPMLE----------IHSLTILSLL-------------DVKVILREVEEDEELQKIIAILKEDLEGKSKFQWV-----------------QGKLLYKER
        +    E          I+ +  +S+              D K++     ED+ +++ I  LK+ L   SK Q +                 +GKL++   
Subjt:  MTPMLE----------IHSLTILSLL-------------DVKVILREVEEDEELQKIIAILKEDLEGKSKFQWV-----------------QGKLLYKER

Query:  LLRT---YKRMTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVVL------------------
         L T    +R T    W G++  +++YV++  TCQ NK  + KP G LQP+P   + WE L++DFI  LP+ +GY+++ VV+                  
Subjt:  LLRT---YKRMTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVVL------------------

Query:  -QTSSF--RKSLKHHEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHW
         QT+    ++ + +   PK II+D D IF S  WK                       TER N+ VE  LRC C+  P+ W
Subjt:  -QTSSF--RKSLKHHEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHW

P0CT35 Transposon Tf2-2 polyprotein7.8e-2922.76Show/hide
Query:  VEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST---------------------------NTRGHYEF
        + + L++GIIR S++  + PV+ V KK+                  + +P+ +I +LL ++ GST                             RG +E+
Subjt:  VEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST---------------------------NTRGHYEF

Query:  LVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDSDFF------------------------------------
        LVMP+G++ APA  Q  +N I        ++ +  DIL++    + H +H+  V   L++++                                      
Subjt:  LVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDSDFF------------------------------------

Query:  ---------ELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKN-SFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDALDF--------------
                 EL  FLG   + R+F+     +  PL  LL K+  + W      A E +K  ++S PVL   +FS   ++ETDA D               
Subjt:  ---------ELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKN-SFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDALDF--------------

Query:  ----------------------EELMAVVLAIQKWKYYLLG--NKFTVISDQKAL-------------------------KFLIEQR-----EAADELLR
                              +E++A++ +++ W++YL      F +++D + L                          F I  R       AD L R
Subjt:  ----------------------EELMAVVLAIQKWKYYLLG--NKFTVISDQKAL-------------------------KFLIEQR-----EAADELLR

Query:  MTPMLE----------IHSLTILSLL-------------DVKVILREVEEDEELQKIIAILKEDLEGKSKFQWV-----------------QGKLLYKER
        +    E          I+ +  +S+              D K++     ED+ +++ I  LK+ L   SK Q +                 +GKL++   
Subjt:  MTPMLE----------IHSLTILSLL-------------DVKVILREVEEDEELQKIIAILKEDLEGKSKFQWV-----------------QGKLLYKER

Query:  LLRT---YKRMTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVVL------------------
         L T    +R T    W G++  +++YV++  TCQ NK  + KP G LQP+P   + WE L++DFI  LP+ +GY+++ VV+                  
Subjt:  LLRT---YKRMTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVVL------------------

Query:  -QTSSF--RKSLKHHEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHW
         QT+    ++ + +   PK II+D D IF S  WK                       TER N+ VE  LRC C+  P+ W
Subjt:  -QTSSF--RKSLKHHEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHW

P0CT41 Transposon Tf2-12 polyprotein7.8e-2922.76Show/hide
Query:  VEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST---------------------------NTRGHYEF
        + + L++GIIR S++  + PV+ V KK+                  + +P+ +I +LL ++ GST                             RG +E+
Subjt:  VEEMLQAGIIRPSQSPYSNPVLLVRKKD-----------------EDKFPILVIVELLDELNGST---------------------------NTRGHYEF

Query:  LVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDSDFF------------------------------------
        LVMP+G++ APA  Q  +N I        ++ +  DIL++    + H +H+  V   L++++                                      
Subjt:  LVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDSDFF------------------------------------

Query:  ---------ELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKN-SFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDALDF--------------
                 EL  FLG   + R+F+     +  PL  LL K+  + W      A E +K  ++S PVL   +FS   ++ETDA D               
Subjt:  ---------ELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKN-SFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDALDF--------------

Query:  ----------------------EELMAVVLAIQKWKYYLLG--NKFTVISDQKAL-------------------------KFLIEQR-----EAADELLR
                              +E++A++ +++ W++YL      F +++D + L                          F I  R       AD L R
Subjt:  ----------------------EELMAVVLAIQKWKYYLLG--NKFTVISDQKAL-------------------------KFLIEQR-----EAADELLR

Query:  MTPMLE----------IHSLTILSLL-------------DVKVILREVEEDEELQKIIAILKEDLEGKSKFQWV-----------------QGKLLYKER
        +    E          I+ +  +S+              D K++     ED+ +++ I  LK+ L   SK Q +                 +GKL++   
Subjt:  MTPMLE----------IHSLTILSLL-------------DVKVILREVEEDEELQKIIAILKEDLEGKSKFQWV-----------------QGKLLYKER

Query:  LLRT---YKRMTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVVL------------------
         L T    +R T    W G++  +++YV++  TCQ NK  + KP G LQP+P   + WE L++DFI  LP+ +GY+++ VV+                  
Subjt:  LLRT---YKRMTGELYWSGMKADVRKYVESWETCQRNKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVVL------------------

Query:  -QTSSF--RKSLKHHEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHW
         QT+    ++ + +   PK II+D D IF S  WK                       TER N+ VE  LRC C+  P+ W
Subjt:  -QTSSF--RKSLKHHEIPKSIISDRDKIFLSNFWK-----------------------TERVNRCVETYLRCFCNEQPSHW

P20825 Retrovirus-related Pol polyprotein from transposon 2975.4e-3027.11Show/hide
Query:  QPSKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKDE----------------------DKFPILVIVELLDEL------------------------
        Q  + E+E  V+EML  G+IR S SPY++P  +V KK +                      D++PI  + E+L +L                        
Subjt:  QPSKEEIEKLVEEMLQAGIIRPSQSPYSNPVLLVRKKDE----------------------DKFPILVIVELLDEL------------------------

Query:  ---NGSTNTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDSDF----------------------
              +   GHYE+L MPFGL NAPAT Q  MN I +  L +  LV+  DI+++ + +T H   + +VF  L D++                       
Subjt:  ---NGSTNTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLRDSDF----------------------

Query:  -----------------------FELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNE--EAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDAL
                                E+  FLGLTG+YR+F+ NY  IA P+T  L K +    +  E   AFE+LK  ++  P+L +P+F   FV+ TDA 
Subjt:  -----------------------FELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNE--EAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDAL

Query:  DF-------------------------------EELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQREAADELLRMTPMLEIHSLTI
        +                                +EL+A+V A + +++YLLG +F + SD + L++L   +E   +L R    L  +   I
Subjt:  DF-------------------------------EELMAVVLAIQKWKYYLLGNKFTVISDQKALKFLIEQREAADELLRMTPMLEIHSLTI

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.6e-1949.47Show/hide
Query:  LVYGSDITTHKRHLAVVFNVLRDSDFFELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFV
        ++ G  ++     L  +       +  EL GFLGLTG+YRRFV+NYG I  PLT+LL KNS  W E AA AF+ LKGA+ ++PVLA+P+  LPFV
Subjt:  LVYGSDITTHKRHLAVVFNVLRDSDFFELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACAGAAACAATTAGAGGAAAAGATAGAGGTGAATGAAAAGAATATGCCAAGCTTGAAAGAACTTTTGCTCGGGCTCTGTAAAAATATCGAGAAGTTAACCGATGA
AGTTCAAGAAAGTACTCTGAGTAGGAAAATGGGAGAGTCGACGACATCGGAGGGATCGCAACTGAAAATGAAAGGTAAGATGGTGGAAGAAGCGGAGTTTGACAAATATA
AGAAGTTGGAAATGCTGATCTTTGCTGGTGAGAACTTGGAATCATGGGTCGAAGTAGAGAAGGTTAAGGTACCTGTGGTAAGTTTTACTGAAGATGAAGTCAACTGGTTC
TGTTGGAGCCATACGCGTAGTCCGACAACGACATGGGAAGAGCTTATCGGGATAAAACAAGAAGAAACCTATGCCGATTATGTTAAGAAGTTTGTATCGTACTCCGCTCT
GTTGCCAAACATGGTAGAGAGTGTGTTGATCGACACGTTTATGAAAGGATTGAAGCCGGACATACGGGTGGAAAAGCTGGCGATGAAGGAATTAGAGCATGTTAGGCCCA
ACAAAGGGGAAGCCGAAGCCGAAAGGAACAGAAACTCCAACGATGAAACAAGTGACGATTCCGGTGAAGGGGAATTAAGTGAAGAGGGAACCTCCGATGGAGAGAATAGG
GAATTGAATGTATTCATAGCCGGGGGAGAAGAGGACGAAGAGGAGATTAAACTCAAAGCCAACACAGAAGACATAGAATTAAGAGTTGTGGATAAGAGAACGTGGTTGTT
CTGTGGAGCAACAAACAATTTTATCAACCAGAAGTTAGCGAAGGAGCTGGAACTATCATTTACTAAGACACTAAAGTATGGTGTGGTGATCGGGAACAAGAATGAGATTA
ACGATAGGGGAGTATGCCAAGCAGTCGTGATCACTTTACTAAGGCTTGTGGTGACTGCTAACCTATCGCCGATTGACTTGGGCCTGGTTATGCACTACCGAATTTATGGG
GTTCATTGGCCGTTGTTGACTATGTCATTTGTTGTGGGGGAAATGCAAGTAATATTGAAGGGTGACCCATCGTTCACTAAATCCAAGGTCTCGTTGAAGGCTACAAGTAA
GGTTTGGAAAGAAGATCAAGGGTTCTTGATAGAATTCCAACACCTGGAAATGGAAGGTAATCAGGACATGTTAGTAAAGAGCAAACGCTACTTAAAGAAACATGGATGTG
TACAAAGTACTGACAGAGCTTCTTCCAAGAGAGAGACGGATTGCCACATCACCATCGTCAATGGACAGCCCTCGAAAGAGGAAATTGAGAAACTTGTGGAGGAAATGTTG
CAAGCTGGCATCATTCGACCTAGCCAGAGTCCTTATTCCAACCCAGTCTTGTTGGTTCGAAAGAAGGACGAAGACAAATTTCCCATTCTGGTTATTGTGGAGTTACTAGA
TGAACTAAACGGTTCAACGAACACACGAGGGCATTATGAGTTCTTAGTTATGCCTTTCGGATTAACAAACGCACCGGCTACATTACAATCGTTAATGAACCAGATTTTTA
AACTGTTTCTGAGGAGACTTATTTTGGTCTTTTTTTATGACATTTTAGTGTATGGCTCCGACATTACAACACACAAGAGGCATTTGGCAGTGGTCTTTAACGTTCTAAGA
GATAGCGACTTTTTTGAGTTGATGGGATTTCTTGGTTTAACGGGGTTTTACCGGAGATTCGTGCAAAACTATGGAGTGATAGCAGCTCCATTAACCAAGCTACTGCATAA
GAACAGCTTTGCATGGAATGAAGAAGCAGCGGGAGCTTTTGAGAGGTTGAAGGGAGCTATGATGTCAATGCCGGTGTTGGCTATCCCTAACTTTTCATTGCCCTTTGTTA
TTGAGACAGATGCCTTAGATTTCGAGGAATTAATGGCAGTGGTGTTGGCAATCCAAAAATGGAAATATTATCTCCTTGGCAACAAATTCACCGTGATTTCAGACCAAAAG
GCCCTTAAGTTTCTGATTGAGCAAAGAGAGGCGGCAGATGAATTATTGAGGATGACCCCTATGCTTGAGATTCACAGCTTGACCATTCTTTCCTTACTGGATGTAAAAGT
CATTCTCAGAGAAGTGGAAGAAGATGAAGAATTACAGAAGATCATTGCAATACTCAAGGAGGACCTAGAGGGGAAGTCGAAATTCCAGTGGGTGCAAGGAAAATTGCTAT
ATAAAGAAAGGCTCCTAAGGACATATAAAAGAATGACAGGGGAGCTATATTGGTCAGGTATGAAAGCTGATGTCAGAAAGTATGTCGAGAGTTGGGAGACTTGTCAAAGA
AACAAAATTGAATCTCTCAAGCCTGCCGGATTATTGCAACCATTACCCATACCATATCAAGTTTGGGAGGAACTGACAATAGATTTTATAGAGGGACTGCCAAAATTAGC
CGGATATGATTCCATTATGGTGGTGTTGCAAACCTCTTCATTCAGGAAGTCTTTAAAGCACCATGAAATTCCAAAGTCTATTATATCGGATAGGGACAAGATCTTCCTCA
GCAACTTCTGGAAAACCGAGAGAGTAAATCGTTGTGTTGAGACGTATTTGCGGTGTTTTTGTAACGAGCAACCTTCGCATTGGAGTAAGTGGTTGCCATGGAAGAAGACA
ACCAATGATGTAGTGGGACAGCAATTGGTGGCTAGGGATTTAGCTTTGAATGCTTTGAAAGAACATTTGAGGGTAGCTCAAAACCGCATGAAGAAACAGGCGGATCAATT
GAGGCGGGAAGTGGAGTTTGAAGTTGGCGATGAGGTTTCTTTGAAGTTAAGACCTTACAGACAGAGATCTTTAGCTAAGAAGAGGTTAGCATACCGATTGGATCTTCCTC
CTGAAGGAGTTATACATAATGTCTTCCATGAGTCACAACTTAAAAAGAAGATTGAATTTGAGTTACAGTTGATACCAGAAGAGATTATAGGGATTTGTTGGAACAGTGAA
TTAGCAGCAAAGGAATGGTTGGTGAAATGGAAAGGAAAGACAGAAAGACAAGGTGCAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCACAGAAACAATTAGAGGAAAAGATAGAGGTGAATGAAAAGAATATGCCAAGCTTGAAAGAACTTTTGCTCGGGCTCTGTAAAAATATCGAGAAGTTAACCGATGA
AGTTCAAGAAAGTACTCTGAGTAGGAAAATGGGAGAGTCGACGACATCGGAGGGATCGCAACTGAAAATGAAAGGTAAGATGGTGGAAGAAGCGGAGTTTGACAAATATA
AGAAGTTGGAAATGCTGATCTTTGCTGGTGAGAACTTGGAATCATGGGTCGAAGTAGAGAAGGTTAAGGTACCTGTGGTAAGTTTTACTGAAGATGAAGTCAACTGGTTC
TGTTGGAGCCATACGCGTAGTCCGACAACGACATGGGAAGAGCTTATCGGGATAAAACAAGAAGAAACCTATGCCGATTATGTTAAGAAGTTTGTATCGTACTCCGCTCT
GTTGCCAAACATGGTAGAGAGTGTGTTGATCGACACGTTTATGAAAGGATTGAAGCCGGACATACGGGTGGAAAAGCTGGCGATGAAGGAATTAGAGCATGTTAGGCCCA
ACAAAGGGGAAGCCGAAGCCGAAAGGAACAGAAACTCCAACGATGAAACAAGTGACGATTCCGGTGAAGGGGAATTAAGTGAAGAGGGAACCTCCGATGGAGAGAATAGG
GAATTGAATGTATTCATAGCCGGGGGAGAAGAGGACGAAGAGGAGATTAAACTCAAAGCCAACACAGAAGACATAGAATTAAGAGTTGTGGATAAGAGAACGTGGTTGTT
CTGTGGAGCAACAAACAATTTTATCAACCAGAAGTTAGCGAAGGAGCTGGAACTATCATTTACTAAGACACTAAAGTATGGTGTGGTGATCGGGAACAAGAATGAGATTA
ACGATAGGGGAGTATGCCAAGCAGTCGTGATCACTTTACTAAGGCTTGTGGTGACTGCTAACCTATCGCCGATTGACTTGGGCCTGGTTATGCACTACCGAATTTATGGG
GTTCATTGGCCGTTGTTGACTATGTCATTTGTTGTGGGGGAAATGCAAGTAATATTGAAGGGTGACCCATCGTTCACTAAATCCAAGGTCTCGTTGAAGGCTACAAGTAA
GGTTTGGAAAGAAGATCAAGGGTTCTTGATAGAATTCCAACACCTGGAAATGGAAGGTAATCAGGACATGTTAGTAAAGAGCAAACGCTACTTAAAGAAACATGGATGTG
TACAAAGTACTGACAGAGCTTCTTCCAAGAGAGAGACGGATTGCCACATCACCATCGTCAATGGACAGCCCTCGAAAGAGGAAATTGAGAAACTTGTGGAGGAAATGTTG
CAAGCTGGCATCATTCGACCTAGCCAGAGTCCTTATTCCAACCCAGTCTTGTTGGTTCGAAAGAAGGACGAAGACAAATTTCCCATTCTGGTTATTGTGGAGTTACTAGA
TGAACTAAACGGTTCAACGAACACACGAGGGCATTATGAGTTCTTAGTTATGCCTTTCGGATTAACAAACGCACCGGCTACATTACAATCGTTAATGAACCAGATTTTTA
AACTGTTTCTGAGGAGACTTATTTTGGTCTTTTTTTATGACATTTTAGTGTATGGCTCCGACATTACAACACACAAGAGGCATTTGGCAGTGGTCTTTAACGTTCTAAGA
GATAGCGACTTTTTTGAGTTGATGGGATTTCTTGGTTTAACGGGGTTTTACCGGAGATTCGTGCAAAACTATGGAGTGATAGCAGCTCCATTAACCAAGCTACTGCATAA
GAACAGCTTTGCATGGAATGAAGAAGCAGCGGGAGCTTTTGAGAGGTTGAAGGGAGCTATGATGTCAATGCCGGTGTTGGCTATCCCTAACTTTTCATTGCCCTTTGTTA
TTGAGACAGATGCCTTAGATTTCGAGGAATTAATGGCAGTGGTGTTGGCAATCCAAAAATGGAAATATTATCTCCTTGGCAACAAATTCACCGTGATTTCAGACCAAAAG
GCCCTTAAGTTTCTGATTGAGCAAAGAGAGGCGGCAGATGAATTATTGAGGATGACCCCTATGCTTGAGATTCACAGCTTGACCATTCTTTCCTTACTGGATGTAAAAGT
CATTCTCAGAGAAGTGGAAGAAGATGAAGAATTACAGAAGATCATTGCAATACTCAAGGAGGACCTAGAGGGGAAGTCGAAATTCCAGTGGGTGCAAGGAAAATTGCTAT
ATAAAGAAAGGCTCCTAAGGACATATAAAAGAATGACAGGGGAGCTATATTGGTCAGGTATGAAAGCTGATGTCAGAAAGTATGTCGAGAGTTGGGAGACTTGTCAAAGA
AACAAAATTGAATCTCTCAAGCCTGCCGGATTATTGCAACCATTACCCATACCATATCAAGTTTGGGAGGAACTGACAATAGATTTTATAGAGGGACTGCCAAAATTAGC
CGGATATGATTCCATTATGGTGGTGTTGCAAACCTCTTCATTCAGGAAGTCTTTAAAGCACCATGAAATTCCAAAGTCTATTATATCGGATAGGGACAAGATCTTCCTCA
GCAACTTCTGGAAAACCGAGAGAGTAAATCGTTGTGTTGAGACGTATTTGCGGTGTTTTTGTAACGAGCAACCTTCGCATTGGAGTAAGTGGTTGCCATGGAAGAAGACA
ACCAATGATGTAGTGGGACAGCAATTGGTGGCTAGGGATTTAGCTTTGAATGCTTTGAAAGAACATTTGAGGGTAGCTCAAAACCGCATGAAGAAACAGGCGGATCAATT
GAGGCGGGAAGTGGAGTTTGAAGTTGGCGATGAGGTTTCTTTGAAGTTAAGACCTTACAGACAGAGATCTTTAGCTAAGAAGAGGTTAGCATACCGATTGGATCTTCCTC
CTGAAGGAGTTATACATAATGTCTTCCATGAGTCACAACTTAAAAAGAAGATTGAATTTGAGTTACAGTTGATACCAGAAGAGATTATAGGGATTTGTTGGAACAGTGAA
TTAGCAGCAAAGGAATGGTTGGTGAAATGGAAAGGAAAGACAGAAAGACAAGGTGCAGTTTGA
Protein sequenceShow/hide protein sequence
MAQKQLEEKIEVNEKNMPSLKELLLGLCKNIEKLTDEVQESTLSRKMGESTTSEGSQLKMKGKMVEEAEFDKYKKLEMLIFAGENLESWVEVEKVKVPVVSFTEDEVNWF
CWSHTRSPTTTWEELIGIKQEETYADYVKKFVSYSALLPNMVESVLIDTFMKGLKPDIRVEKLAMKELEHVRPNKGEAEAERNRNSNDETSDDSGEGELSEEGTSDGENR
ELNVFIAGGEEDEEEIKLKANTEDIELRVVDKRTWLFCGATNNFINQKLAKELELSFTKTLKYGVVIGNKNEINDRGVCQAVVITLLRLVVTANLSPIDLGLVMHYRIYG
VHWPLLTMSFVVGEMQVILKGDPSFTKSKVSLKATSKVWKEDQGFLIEFQHLEMEGNQDMLVKSKRYLKKHGCVQSTDRASSKRETDCHITIVNGQPSKEEIEKLVEEML
QAGIIRPSQSPYSNPVLLVRKKDEDKFPILVIVELLDELNGSTNTRGHYEFLVMPFGLTNAPATLQSLMNQIFKLFLRRLILVFFYDILVYGSDITTHKRHLAVVFNVLR
DSDFFELMGFLGLTGFYRRFVQNYGVIAAPLTKLLHKNSFAWNEEAAGAFERLKGAMMSMPVLAIPNFSLPFVIETDALDFEELMAVVLAIQKWKYYLLGNKFTVISDQK
ALKFLIEQREAADELLRMTPMLEIHSLTILSLLDVKVILREVEEDEELQKIIAILKEDLEGKSKFQWVQGKLLYKERLLRTYKRMTGELYWSGMKADVRKYVESWETCQR
NKIESLKPAGLLQPLPIPYQVWEELTIDFIEGLPKLAGYDSIMVVLQTSSFRKSLKHHEIPKSIISDRDKIFLSNFWKTERVNRCVETYLRCFCNEQPSHWSKWLPWKKT
TNDVVGQQLVARDLALNALKEHLRVAQNRMKKQADQLRREVEFEVGDEVSLKLRPYRQRSLAKKRLAYRLDLPPEGVIHNVFHESQLKKKIEFELQLIPEEIIGICWNSE
LAAKEWLVKWKGKTERQGAV