; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G20900 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G20900
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr3:16957550..16965312
RNA-Seq ExpressionCSPI03G20900
SyntenyCSPI03G20900
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW53010.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.2e-13429.8Show/hide
Query:  PKDQPKTLSPISVSSEESDDLSEV--DPKLDSEIEG----RGLNVLFNEEAIPILLLAMMKIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKES
        P    ++++P+S SSE + +L  +   PK + +IEG    R   V    +++ I + +  K       GL    KR  +K F+ +  PDVVMIQE+KKE 
Subjt:  PKDQPKTLSPISVSSEESDDLSEV--DPKLDSEIEG----RGLNVLFNEEAIPILLLAMMKIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKES

Query:  LNSVFIKSLWSSKDIGWEFVASVKSSGGILTI------------------------------------------KRKLVWPELSSLDVCAAAARCVGGDF
         +   + S+WS ++  W  + +  +SGGIL I                                           RK  W ELS +   +    CVGGDF
Subjt:  LNSVFIKSLWSSKDIGWEFVASVKSSGGILTI------------------------------------------KRKLVWPELSSLDVCAAAARCVGGDF

Query:  NITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSLLDRFLINQQWDDLFENSRVSHKARIFSDHFPLMLEAGAFLWGPSPF
        N+ R + E+    R T  M  F++FI    L++ PL++  +TWS    +P    LDRFL + +W+ +F  S      R  SDH+P++LE   F WGP+PF
Subjt:  NITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSLLDRFLINQQWDDLFENSRVSHKARIFSDHFPLMLEAGAFLWGPSPF

Query:  RFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAW------------------------------------------KIEAQAKIRKN
        RF N WL  +   +      +     GW G     +L+ VK  +K W                                          K E + + ++ 
Subjt:  RFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAW------------------------------------------KIEAQAKIRKN

Query:  V--------------------------------TLVAQFTVEEIFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFI
        +                                 L + FT EEI+KA+  +  +KA GPDGFT+      W + K+    +  +FH +G +N     +FI
Subjt:  V--------------------------------TLVAQFTVEEIFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFI

Query:  CLVQKKENATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAIINPSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKI
         L+ KK  +  + DFRPIS  T  YK++AKVL+  L+ V++  I+ +Q AF++GR ILD +LIANE V++ R   ++G + K+D EKA+  + W+FL+ +
Subjt:  CLVQKKENATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAIINPSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKI

Query:  LSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQGDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYD
        L  K FS +W  W+ GC+ S  +++ +NG  +G + ASRG+RQGDPLSPFLF +V++VL  ++ K       EGF VG++   +S LQFADDT+ F    
Subjt:  LSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQGDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYD

Query:  VNVLFKLKEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTTNLLG----------------------------------------------GGRHTLY
           L  LK  +++F   SG KVN +KS + G+N+  + L++   +L                                               GGR TL 
Subjt:  VNVLFKLKEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTTNLLG----------------------------------------------GGRHTLY

Query:  NSVFVSLPTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKINHLVKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSIQFGG-
         S    +P Y+LSLF IP +VAA +ER+ ++F W G    K +HLV W+ V  P   GGL  G I + N ALL KW WRY +E  ALW +++ SI +G  
Subjt:  NSVFVSLPTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKINHLVKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSIQFGG-

Query:  -----------WLTALPPLTLVTVLE----------------------------LG--FGQTLELVTRSSSLVSLELLFCPP------VQRSLKDEEIVE
                   W    P   +  V +                            LG  + + L +VT  ++L+S  L +  P       +R+L D EI +
Subjt:  -----------WLTALPPLTLVTVLE----------------------------LG--FGQTLELVTRSSSLVSLELLFCPP------VQRSLKDEEIVE

Query:  FQSLLSLLSTEKVVTS-DDFRSWSIDSMDAYPMNKRLFSA---------------IWKSGSPRRTNILIWIMIFESLNSSEILQKKSPNKYLSPSICPLC
         +SL+  L    +  S  D RSWSI     + + K  F A               +W S  P +    +W++  + LN++++LQ + P+K LSP+IC LC
Subjt:  FQSLLSLLSTEKVVTS-DDFRSWSIDSMDAYPMNKRLFSA---------------IWKSGSPRRTNILIWIMIFESLNSSEILQKKSPNKYLSPSICPLC

Query:  LKATRQRAEIMRAVDLNAAAWRSL
        +K       +     L    W  L
Subjt:  LKATRQRAEIMRAVDLNAAAWRSL

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]9.9e-13530.36Show/hide
Query:  LLLAMMKIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKESLNSVFIKSLWSSKDIGWEFVASVKSSGGI--------------------LTIK-
        L L  MKI+SWNTRGL    KR  +++F+   NPD+VM+QE+K+E+ +  F+ S+W  K + W  + +  +SGGI                    +T+K 
Subjt:  LLLAMMKIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKESLNSVFIKSLWSSKDIGWEFVASVKSSGGI--------------------LTIK-

Query:  ---------------------RKLVWPELSSLDVCAAAARCVGGDFNITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSL
                             RK  W EL  L        CVGGDFN+ R   E+    R T  M  F++FI    L++ PL+N  FTWS     P    
Subjt:  ---------------------RKLVWPELSSLDVCAAAARCVGGDFNITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSL

Query:  LDRFLINQQWDDLFENSRVSHKARIFSDHFPLMLEAGAFLWGPSPFRFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAWKI-----
        LDRFL + +WD  F  S      R  SDH P+ LE     WGP+PFRF N WLL  E  +            GW G     +L+ VK  +K W I     
Subjt:  LDRFLINQQWDDLFENSRVSHKARIFSDHFPLMLEAGAFLWGPSPFRFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAWKI-----

Query:  ----------------------------------------------EAQAKIRKNVTLVAQ---------------------------------------
                                                      E Q + +  V  + +                                       
Subjt:  ----------------------------------------------EAQAKIRKNVTLVAQ---------------------------------------

Query:  ----------------------------------------FTVEEIFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQEN
                                                FT EE+ +A+  L   KA GPDGFT+    + W + K+    +  +FH NG +N      
Subjt:  ----------------------------------------FTVEEIFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQEN

Query:  FICLVQKKENATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAIINPSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLE
        FI LV KK  +  + D+RPIS  T  YK++AKVLS  L++V++  I+ SQ AF+EGRHILD +LIANE V++ R   ++G + K+D EKA+  VDW FL+
Subjt:  FICLVQKKENATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAIINPSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLE

Query:  KILSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQGDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCK
         +L  K FS KW  WI GC+ S  F++ +NG  +G + ASRG+RQGDPLSPFLF LV++VL  ++ +   +   EGF VG+D   +SLLQFADDT+ F K
Subjt:  KILSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQGDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCK

Query:  YDVNVLFKLKEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTTNL----------------LG------------------------------GGRHT
          +  L  LK  +++F   SG K+N EKS +SG+N   + L+   ++                LG                              GGR T
Subjt:  YDVNVLFKLKEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTTNL----------------LG------------------------------GGRHT

Query:  LYNSVFVSLPTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKINHLVKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSI---
        L  S    +P+Y+LSLF IP ++A+ +E++ + F W G    K +HLV+W  VS P + GGL  G I + N ALL KW WR+ +E   LW K++ SI   
Subjt:  LYNSVFVSLPTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKINHLVKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSI---

Query:  QFGGWLTALP-------PLTLVTVLELGFGQTLELVTRSSSLV-------------------------------------SLELLFCPPVQRSLKDEEIV
           GW   +        P   +  +   F   + LV  +   +                                     S  L +    +R+L D EI 
Subjt:  QFGGWLTALP-------PLTLVTVLELGFGQTLELVTRSSSLV-------------------------------------SLELLFCPPVQRSLKDEEIV

Query:  EFQSLLSLLSTEKVVTS-DDFRSWSIDSMDAYPM-----------NKRLF---SAIWKSGSPRRTNILIWIMIFESLNSSEILQKKSPNKYLSPSICPLC
          Q L+S LS+ +   S  D R+WS+ S   + +           N  LF     +W S  P +   L WI+    +N+++ LQ + P K L P  C LC
Subjt:  EFQSLLSLLSTEKVVTS-DDFRSWSIDSMDAYPM-----------NKRLF---SAIWKSGSPRRTNILIWIMIFESLNSSEILQKKSPNKYLSPSICPLC

RVX11537.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]4.7e-13731.34Show/hide
Query:  KIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKESLNSVFIKSLWSSKDIGWEFVASVKSSGGI--------------------LTIK-------
        KI+SWNTRGL    KR  +++F+   NPD+VM+QE+K+E+ +  F+ S+W  K + W  + +  + GGI                    +T+K       
Subjt:  KIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKESLNSVFIKSLWSSKDIGWEFVASVKSSGGI--------------------LTIK-------

Query:  ---------------RKLVWPELSSLDVCAAAARCVGGDFNITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSLLDRFLI
                       RK  W EL  L        CVGGDFN+ R   E+    R T  M  F++FI    L++ PL+N  FTWS     P    LDRFL 
Subjt:  ---------------RKLVWPELSSLDVCAAAARCVGGDFNITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSLLDRFLI

Query:  NQQWDDLFENSRVSHKARIFSDHFPLMLEAGAFLWGPSPFRFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAWKI-----------
        + +WD  F  S      R  SDH P+ LE     WGP+PFRF N WLL  E  +            GW G     +L+ VK  +K W I           
Subjt:  NQQWDDLFENSRVSHKARIFSDHFPLMLEAGAFLWGPSPFRFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAWKI-----------

Query:  ----------------------------------------EAQAKIRKNVTLVAQ--------------------------------------FTVEEIF
                                                E Q + +  V  + +                                      FT EE+ 
Subjt:  ----------------------------------------EAQAKIRKNVTLVAQ--------------------------------------FTVEEIF

Query:  KALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFICLVQKKENATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAIIN
        +A+  L   KA GPDGFT+    + W + K+    +  +FH NG +N      FI LV KK  +  + D+RPIS  T  YK++AKVLS  L++V++  I+
Subjt:  KALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFICLVQKKENATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAIIN

Query:  PSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKILSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQGD
         SQ AF+EGRHILD +LIANE V++ R   ++G + K+D EKA+  VDW FL+ +L  K FS KW SWI GC+ S  F++ +NG  +G + ASRG+RQGD
Subjt:  PSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKILSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQGD

Query:  PLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYDVNVLFKLKEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTTNL
        PLSPFLF LV++VL  ++ +   +   EGF VG+D   +SLLQFADDT+ F K  +  L  LK  +++F   SG K+N EKS +SG+N   + L+   ++
Subjt:  PLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYDVNVLFKLKEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTTNL

Query:  ----------------LG------------------------------GGRHTLYNSVFVSLPTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKINHL
                        LG                              GGR TL  S    +P+Y+LSLF IP ++A+ +E++ + F W      K +HL
Subjt:  ----------------LG------------------------------GGRHTLYNSVFVSLPTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKINHL

Query:  VKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSI---QFGGWLTALP-------PLTLVTVLELGFGQTLELVTRSSSLV----
        V+W  VS P + GGL  G I + N ALL KW WR+ +E   LW K++ SI      GW   +        P   +  +   F   + LV  +   +    
Subjt:  VKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSI---QFGGWLTALP-------PLTLVTVLELGFGQTLELVTRSSSLV----

Query:  ---------------------------------SLELLFCPPVQRSLKDEEIVEFQSLLSLLSTEKVVTS-DDFRSWSIDSMDAYPM-----------NK
                                         S  L +    +R+L D EI   Q L+S LS+ +   S  D R+WS+ S   + +           N 
Subjt:  ---------------------------------SLELLFCPPVQRSLKDEEIVEFQSLLSLLSTEKVVTS-DDFRSWSIDSMDAYPM-----------NK

Query:  RLF---SAIWKSGSPRRTNILIWIMIFESLNSSEILQKKSPNKYLSPSICPLC
         LF     +W S  P +   L WI+    +N+++ LQ + P K L P  C LC
Subjt:  RLF---SAIWKSGSPRRTNILIWIMIFESLNSSEILQKKSPNKYLSPSICPLC

RVX23043.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]7.8e-14030.71Show/hide
Query:  IPPLDSFQNNTIKHSVSVNNPKLLEVYSSKSKSLS--QGSSCKADFSPKDQPKTLSPISVSSEE--------------SDDLSEVDPKLDSEIEGRG---
        IP    F N+ + H V +  P       S S+SL+  +       FS K   +   P+ + S E              S+ ++ +   L S  +      
Subjt:  IPPLDSFQNNTIKHSVSVNNPKLLEVYSSKSKSLS--QGSSCKADFSPKDQPKTLSPISVSSEE--------------SDDLSEVDPKLDSEIEGRG---

Query:  --LNVLFNEEAIPILLLAM--------------MKIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKESLNSVFIKSLWSSKDIGWEFVASVKSS
          L  LF EE   +  +                MKI+SWNTRGL    KR  +K F+ +  PDVVM QE+KKE  +  F+ S+W++++  W  + +    
Subjt:  --LNVLFNEEAIPILLLAM--------------MKIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKESLNSVFIKSLWSSKDIGWEFVASVKSS

Query:  GGILTIKRKLVWPELSSLDVCAAAARCVGGDFNITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSLLDRFLINQQWDDLF
            +  RK  W ELS +   A+   CVGGDFN+ R + E+    R T  M  F+ FI    L+++PL++  FTW     +P    LDRFL + +W+  F
Subjt:  GGILTIKRKLVWPELSSLDVCAAAARCVGGDFNITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSLLDRFLINQQWDDLF

Query:  ENSRVSHKARIFSDHFPLMLEAGAFLWGPSPFRFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAW-----------KIEAQAKIRK
          S      R  SDH+P++LE   F WGP+PFRF N WL      +            GW G     +L+ VK  +K W           K E      K
Subjt:  ENSRVSHKARIFSDHFPLMLEAGAFLWGPSPFRFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAW-----------KIEAQAKIRK

Query:  NVT------------------------LVAQFTVEEIFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFICLVQKKE
          T                        L + FT EEIFKA+  +  +KA GPDGFT+      W + K+    +  +FH +G +N     +FI L+ KK 
Subjt:  NVT------------------------LVAQFTVEEIFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFICLVQKKE

Query:  NATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAIINPSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKILSFKIFS
         +  + DFRPIS  T  YK++AKVL+  ++ V++  I+ +Q AF++GR ILD +LIANE V++ R   ++G + K+D EKA+  V W+FL+ +L  K F 
Subjt:  NATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAIINPSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKILSFKIFS

Query:  PKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQGDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYDVNVLFKL
         +W  W+ GC+ S  F++ +NG  +  + ASRG+RQGDPLSPFLF +V++VL  ++ K       EGF VG++   +S LQF DDT+ F       +  L
Subjt:  PKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQGDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYDVNVLFKL

Query:  KEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTTNLLG----------------------------------------------GGRHTLYNSVFVSL
        K  +++F   SG KVN +KS + G+N+  + L++   +L                                               GGR TL  S    +
Subjt:  KEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTTNLLG----------------------------------------------GGRHTLYNSVFVSL

Query:  PTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKINHLVKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSIQFGG--------
        P Y+LSLF IP +VAA +ER+ + F W G    K +HLV W+ V  P   GGL  G I I N ALL KW WRY +E  ALW +++ SI +G         
Subjt:  PTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKINHLVKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSIQFGG--------

Query:  ----WLTALPPLTLVTV------------------------------LELGFGQTLELVTRSSSLVSLELLFCPP------VQRSLKDEEIVEFQSLLSL
            W    P   +  V                              L + + + L +VT  ++ +S  L +  P       +R+L D EI   + L+  
Subjt:  ----WLTALPPLTLVTV------------------------------LELGFGQTLELVTRSSSLVSLELLFCPP------VQRSLKDEEIVEFQSLLSL

Query:  LSTEKVVTS-DDFRSWSID------------SMDAYPMNKRLFSA--IWKSGSPRRTNILIWIMIFESLNSSEILQKKSPNKYLSPSICPLCLK
             + +S  D RSWS+             ++  Y  +  +F    +W +  P +    +W++  + +N++++LQ + P K LSP IC LC+K
Subjt:  LSTEKVVTS-DDFRSWSID------------SMDAYPMNKRLFSA--IWKSGSPRRTNILIWIMIFESLNSSEILQKKSPNKYLSPSICPLCLK

RVX23556.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]4.7e-13730.58Show/hide
Query:  MKIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKESLNSVFIKSLWSSKDIGWEFVASVKSS--GGILTIKRKLVWPELSSLDVCAAAARCVGGD
        MKI+SWN RGL   +KR  +K F+    PDVVMIQE+KKE  +   + S+W+ ++  W+F+ +  +S  G      RK  W EL  +        CVGGD
Subjt:  MKIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKESLNSVFIKSLWSSKDIGWEFVASVKSS--GGILTIKRKLVWPELSSLDVCAAAARCVGGD

Query:  FNITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSLLDRFLINQQWDDLFENSRVSHKARIFSDHFPLMLEAGAFLWGPSP
        FN+ R + E+    R T  M  F+ FI    L++ PL+N  FTWS    SP    LDRFL + +W  LF         R  SDH+P+ L+   F WGP+P
Subjt:  FNITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSLLDRFLINQQWDDLFENSRVSHKARIFSDHFPLMLEAGAFLWGPSP

Query:  FRFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAW---------------------------------KIEAQAKIRKN--------
        FRF N WL      +            GW G     +L+ VK   K W                                 ++  Q  +RK         
Subjt:  FRFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAW---------------------------------KIEAQAKIRKN--------

Query:  -----------------------------------------------------------------------------------------VTLVAQFTVEE
                                                                                                 ++LVA FT EE
Subjt:  -----------------------------------------------------------------------------------------VTLVAQFTVEE

Query:  IFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFICLVQKKENATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAI
        I KA+  +  +KA GPDGFT+      W + K+    +  +FH +G +N     +FI L+ KK     + DFRPIS  T  YK++AKVLS  L+ V++  
Subjt:  IFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFICLVQKKENATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAI

Query:  INPSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKILSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQ
        I+ +Q AF++GR I+D +LIANE V++ R   ++G + K+D EKA+  V W+FL+++L  K FSPKW  W+ GC+ S  +++ +NG  +G + ASRG+RQ
Subjt:  INPSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKILSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQ

Query:  GDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYDVNVLFKLKEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTT
        GDPLSPFLF LV++VL  ++ +       EGF VG++   +S LQFADDT+ F       L  LK  ++ F   SG KVN +KS + G+N+    +++  
Subjt:  GDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYDVNVLFKLKEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTT

Query:  NLLG----------------------------------------------GGRHTLYNSVFVSLPTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKIN
          LG                                              GGR TL +S    +P YYLSLF +P +VAA +ER+ + F W G    K +
Subjt:  NLLG----------------------------------------------GGRHTLYNSVFVSLPTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKIN

Query:  HLVKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSI---QFGGW-----------------LTALPPLTLVTVLELGFGQTLEL
        HLV+W+ V +P + GGL  G I + N ALL KW WRY +E  ALW +++ SI      GW                        +  T   +G G+ +  
Subjt:  HLVKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSI---QFGGW-----------------LTALPPLTLVTVLELGFGQTLEL

Query:  VTRSSSLVSLELLFCPPVQRSLKDEEIVEFQSLLSLLSTEKVVTS-DDFRSWSIDSMDAYPMNKRLFSA---------------IWKSGSPRRTNILIWI
             S      L+    +R+L D EI + + L+  L    +  S  D RSW + S   + + K  F A               +W S  P +    IW+
Subjt:  VTRSSSLVSLELLFCPPVQRSLKDEEIVEFQSLLSLLSTEKVVTS-DDFRSWSIDSMDAYPMNKRLFSA---------------IWKSGSPRRTNILIWI

Query:  MIFESLNSSEILQKKSPNKYLSPSICPLCLKATRQRAEIMRAVDLNAAAWRSL
        +  + +N++++LQ + P K LSP IC LC+K       I     L    W  L
Subjt:  MIFESLNSSEILQKKSPNKYLSPSICPLCLKATRQRAEIMRAVDLNAAAWRSL

TrEMBL top hitse value%identityAlignment
A0A438EZ36 LINE-1 retrotransposable element ORF2 protein1.1e-13429.8Show/hide
Query:  PKDQPKTLSPISVSSEESDDLSEV--DPKLDSEIEG----RGLNVLFNEEAIPILLLAMMKIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKES
        P    ++++P+S SSE + +L  +   PK + +IEG    R   V    +++ I + +  K       GL    KR  +K F+ +  PDVVMIQE+KKE 
Subjt:  PKDQPKTLSPISVSSEESDDLSEV--DPKLDSEIEG----RGLNVLFNEEAIPILLLAMMKIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKES

Query:  LNSVFIKSLWSSKDIGWEFVASVKSSGGILTI------------------------------------------KRKLVWPELSSLDVCAAAARCVGGDF
         +   + S+WS ++  W  + +  +SGGIL I                                           RK  W ELS +   +    CVGGDF
Subjt:  LNSVFIKSLWSSKDIGWEFVASVKSSGGILTI------------------------------------------KRKLVWPELSSLDVCAAAARCVGGDF

Query:  NITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSLLDRFLINQQWDDLFENSRVSHKARIFSDHFPLMLEAGAFLWGPSPF
        N+ R + E+    R T  M  F++FI    L++ PL++  +TWS    +P    LDRFL + +W+ +F  S      R  SDH+P++LE   F WGP+PF
Subjt:  NITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSLLDRFLINQQWDDLFENSRVSHKARIFSDHFPLMLEAGAFLWGPSPF

Query:  RFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAW------------------------------------------KIEAQAKIRKN
        RF N WL  +   +      +     GW G     +L+ VK  +K W                                          K E + + ++ 
Subjt:  RFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAW------------------------------------------KIEAQAKIRKN

Query:  V--------------------------------TLVAQFTVEEIFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFI
        +                                 L + FT EEI+KA+  +  +KA GPDGFT+      W + K+    +  +FH +G +N     +FI
Subjt:  V--------------------------------TLVAQFTVEEIFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFI

Query:  CLVQKKENATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAIINPSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKI
         L+ KK  +  + DFRPIS  T  YK++AKVL+  L+ V++  I+ +Q AF++GR ILD +LIANE V++ R   ++G + K+D EKA+  + W+FL+ +
Subjt:  CLVQKKENATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAIINPSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKI

Query:  LSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQGDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYD
        L  K FS +W  W+ GC+ S  +++ +NG  +G + ASRG+RQGDPLSPFLF +V++VL  ++ K       EGF VG++   +S LQFADDT+ F    
Subjt:  LSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQGDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYD

Query:  VNVLFKLKEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTTNLLG----------------------------------------------GGRHTLY
           L  LK  +++F   SG KVN +KS + G+N+  + L++   +L                                               GGR TL 
Subjt:  VNVLFKLKEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTTNLLG----------------------------------------------GGRHTLY

Query:  NSVFVSLPTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKINHLVKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSIQFGG-
         S    +P Y+LSLF IP +VAA +ER+ ++F W G    K +HLV W+ V  P   GGL  G I + N ALL KW WRY +E  ALW +++ SI +G  
Subjt:  NSVFVSLPTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKINHLVKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSIQFGG-

Query:  -----------WLTALPPLTLVTVLE----------------------------LG--FGQTLELVTRSSSLVSLELLFCPP------VQRSLKDEEIVE
                   W    P   +  V +                            LG  + + L +VT  ++L+S  L +  P       +R+L D EI +
Subjt:  -----------WLTALPPLTLVTVLE----------------------------LG--FGQTLELVTRSSSLVSLELLFCPP------VQRSLKDEEIVE

Query:  FQSLLSLLSTEKVVTS-DDFRSWSIDSMDAYPMNKRLFSA---------------IWKSGSPRRTNILIWIMIFESLNSSEILQKKSPNKYLSPSICPLC
         +SL+  L    +  S  D RSWSI     + + K  F A               +W S  P +    +W++  + LN++++LQ + P+K LSP+IC LC
Subjt:  FQSLLSLLSTEKVVTS-DDFRSWSIDSMDAYPMNKRLFSA---------------IWKSGSPRRTNILIWIMIFESLNSSEILQKKSPNKYLSPSICPLC

Query:  LKATRQRAEIMRAVDLNAAAWRSL
        +K       +     L    W  L
Subjt:  LKATRQRAEIMRAVDLNAAAWRSL

A0A438FWU5 LINE-1 retrotransposable element ORF2 protein4.8e-13530.36Show/hide
Query:  LLLAMMKIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKESLNSVFIKSLWSSKDIGWEFVASVKSSGGI--------------------LTIK-
        L L  MKI+SWNTRGL    KR  +++F+   NPD+VM+QE+K+E+ +  F+ S+W  K + W  + +  +SGGI                    +T+K 
Subjt:  LLLAMMKIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKESLNSVFIKSLWSSKDIGWEFVASVKSSGGI--------------------LTIK-

Query:  ---------------------RKLVWPELSSLDVCAAAARCVGGDFNITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSL
                             RK  W EL  L        CVGGDFN+ R   E+    R T  M  F++FI    L++ PL+N  FTWS     P    
Subjt:  ---------------------RKLVWPELSSLDVCAAAARCVGGDFNITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSL

Query:  LDRFLINQQWDDLFENSRVSHKARIFSDHFPLMLEAGAFLWGPSPFRFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAWKI-----
        LDRFL + +WD  F  S      R  SDH P+ LE     WGP+PFRF N WLL  E  +            GW G     +L+ VK  +K W I     
Subjt:  LDRFLINQQWDDLFENSRVSHKARIFSDHFPLMLEAGAFLWGPSPFRFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAWKI-----

Query:  ----------------------------------------------EAQAKIRKNVTLVAQ---------------------------------------
                                                      E Q + +  V  + +                                       
Subjt:  ----------------------------------------------EAQAKIRKNVTLVAQ---------------------------------------

Query:  ----------------------------------------FTVEEIFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQEN
                                                FT EE+ +A+  L   KA GPDGFT+    + W + K+    +  +FH NG +N      
Subjt:  ----------------------------------------FTVEEIFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQEN

Query:  FICLVQKKENATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAIINPSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLE
        FI LV KK  +  + D+RPIS  T  YK++AKVLS  L++V++  I+ SQ AF+EGRHILD +LIANE V++ R   ++G + K+D EKA+  VDW FL+
Subjt:  FICLVQKKENATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAIINPSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLE

Query:  KILSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQGDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCK
         +L  K FS KW  WI GC+ S  F++ +NG  +G + ASRG+RQGDPLSPFLF LV++VL  ++ +   +   EGF VG+D   +SLLQFADDT+ F K
Subjt:  KILSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQGDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCK

Query:  YDVNVLFKLKEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTTNL----------------LG------------------------------GGRHT
          +  L  LK  +++F   SG K+N EKS +SG+N   + L+   ++                LG                              GGR T
Subjt:  YDVNVLFKLKEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTTNL----------------LG------------------------------GGRHT

Query:  LYNSVFVSLPTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKINHLVKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSI---
        L  S    +P+Y+LSLF IP ++A+ +E++ + F W G    K +HLV+W  VS P + GGL  G I + N ALL KW WR+ +E   LW K++ SI   
Subjt:  LYNSVFVSLPTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKINHLVKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSI---

Query:  QFGGWLTALP-------PLTLVTVLELGFGQTLELVTRSSSLV-------------------------------------SLELLFCPPVQRSLKDEEIV
           GW   +        P   +  +   F   + LV  +   +                                     S  L +    +R+L D EI 
Subjt:  QFGGWLTALP-------PLTLVTVLELGFGQTLELVTRSSSLV-------------------------------------SLELLFCPPVQRSLKDEEIV

Query:  EFQSLLSLLSTEKVVTS-DDFRSWSIDSMDAYPM-----------NKRLF---SAIWKSGSPRRTNILIWIMIFESLNSSEILQKKSPNKYLSPSICPLC
          Q L+S LS+ +   S  D R+WS+ S   + +           N  LF     +W S  P +   L WI+    +N+++ LQ + P K L P  C LC
Subjt:  EFQSLLSLLSTEKVVTS-DDFRSWSIDSMDAYPM-----------NKRLF---SAIWKSGSPRRTNILIWIMIFESLNSSEILQKKSPNKYLSPSICPLC

A0A438JRF4 LINE-1 retrotransposable element ORF2 protein2.3e-13731.34Show/hide
Query:  KIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKESLNSVFIKSLWSSKDIGWEFVASVKSSGGI--------------------LTIK-------
        KI+SWNTRGL    KR  +++F+   NPD+VM+QE+K+E+ +  F+ S+W  K + W  + +  + GGI                    +T+K       
Subjt:  KIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKESLNSVFIKSLWSSKDIGWEFVASVKSSGGI--------------------LTIK-------

Query:  ---------------RKLVWPELSSLDVCAAAARCVGGDFNITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSLLDRFLI
                       RK  W EL  L        CVGGDFN+ R   E+    R T  M  F++FI    L++ PL+N  FTWS     P    LDRFL 
Subjt:  ---------------RKLVWPELSSLDVCAAAARCVGGDFNITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSLLDRFLI

Query:  NQQWDDLFENSRVSHKARIFSDHFPLMLEAGAFLWGPSPFRFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAWKI-----------
        + +WD  F  S      R  SDH P+ LE     WGP+PFRF N WLL  E  +            GW G     +L+ VK  +K W I           
Subjt:  NQQWDDLFENSRVSHKARIFSDHFPLMLEAGAFLWGPSPFRFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAWKI-----------

Query:  ----------------------------------------EAQAKIRKNVTLVAQ--------------------------------------FTVEEIF
                                                E Q + +  V  + +                                      FT EE+ 
Subjt:  ----------------------------------------EAQAKIRKNVTLVAQ--------------------------------------FTVEEIF

Query:  KALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFICLVQKKENATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAIIN
        +A+  L   KA GPDGFT+    + W + K+    +  +FH NG +N      FI LV KK  +  + D+RPIS  T  YK++AKVLS  L++V++  I+
Subjt:  KALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFICLVQKKENATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAIIN

Query:  PSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKILSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQGD
         SQ AF+EGRHILD +LIANE V++ R   ++G + K+D EKA+  VDW FL+ +L  K FS KW SWI GC+ S  F++ +NG  +G + ASRG+RQGD
Subjt:  PSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKILSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQGD

Query:  PLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYDVNVLFKLKEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTTNL
        PLSPFLF LV++VL  ++ +   +   EGF VG+D   +SLLQFADDT+ F K  +  L  LK  +++F   SG K+N EKS +SG+N   + L+   ++
Subjt:  PLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYDVNVLFKLKEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTTNL

Query:  ----------------LG------------------------------GGRHTLYNSVFVSLPTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKINHL
                        LG                              GGR TL  S    +P+Y+LSLF IP ++A+ +E++ + F W      K +HL
Subjt:  ----------------LG------------------------------GGRHTLYNSVFVSLPTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKINHL

Query:  VKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSI---QFGGWLTALP-------PLTLVTVLELGFGQTLELVTRSSSLV----
        V+W  VS P + GGL  G I + N ALL KW WR+ +E   LW K++ SI      GW   +        P   +  +   F   + LV  +   +    
Subjt:  VKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSI---QFGGWLTALP-------PLTLVTVLELGFGQTLELVTRSSSLV----

Query:  ---------------------------------SLELLFCPPVQRSLKDEEIVEFQSLLSLLSTEKVVTS-DDFRSWSIDSMDAYPM-----------NK
                                         S  L +    +R+L D EI   Q L+S LS+ +   S  D R+WS+ S   + +           N 
Subjt:  ---------------------------------SLELLFCPPVQRSLKDEEIVEFQSLLSLLSTEKVVTS-DDFRSWSIDSMDAYPM-----------NK

Query:  RLF---SAIWKSGSPRRTNILIWIMIFESLNSSEILQKKSPNKYLSPSICPLC
         LF     +W S  P +   L WI+    +N+++ LQ + P K L P  C LC
Subjt:  RLF---SAIWKSGSPRRTNILIWIMIFESLNSSEILQKKSPNKYLSPSICPLC

A0A438KPB4 LINE-1 retrotransposable element ORF2 protein3.8e-14030.71Show/hide
Query:  IPPLDSFQNNTIKHSVSVNNPKLLEVYSSKSKSLS--QGSSCKADFSPKDQPKTLSPISVSSEE--------------SDDLSEVDPKLDSEIEGRG---
        IP    F N+ + H V +  P       S S+SL+  +       FS K   +   P+ + S E              S+ ++ +   L S  +      
Subjt:  IPPLDSFQNNTIKHSVSVNNPKLLEVYSSKSKSLS--QGSSCKADFSPKDQPKTLSPISVSSEE--------------SDDLSEVDPKLDSEIEGRG---

Query:  --LNVLFNEEAIPILLLAM--------------MKIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKESLNSVFIKSLWSSKDIGWEFVASVKSS
          L  LF EE   +  +                MKI+SWNTRGL    KR  +K F+ +  PDVVM QE+KKE  +  F+ S+W++++  W  + +    
Subjt:  --LNVLFNEEAIPILLLAM--------------MKIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKESLNSVFIKSLWSSKDIGWEFVASVKSS

Query:  GGILTIKRKLVWPELSSLDVCAAAARCVGGDFNITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSLLDRFLINQQWDDLF
            +  RK  W ELS +   A+   CVGGDFN+ R + E+    R T  M  F+ FI    L+++PL++  FTW     +P    LDRFL + +W+  F
Subjt:  GGILTIKRKLVWPELSSLDVCAAAARCVGGDFNITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSLLDRFLINQQWDDLF

Query:  ENSRVSHKARIFSDHFPLMLEAGAFLWGPSPFRFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAW-----------KIEAQAKIRK
          S      R  SDH+P++LE   F WGP+PFRF N WL      +            GW G     +L+ VK  +K W           K E      K
Subjt:  ENSRVSHKARIFSDHFPLMLEAGAFLWGPSPFRFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAW-----------KIEAQAKIRK

Query:  NVT------------------------LVAQFTVEEIFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFICLVQKKE
          T                        L + FT EEIFKA+  +  +KA GPDGFT+      W + K+    +  +FH +G +N     +FI L+ KK 
Subjt:  NVT------------------------LVAQFTVEEIFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFICLVQKKE

Query:  NATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAIINPSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKILSFKIFS
         +  + DFRPIS  T  YK++AKVL+  ++ V++  I+ +Q AF++GR ILD +LIANE V++ R   ++G + K+D EKA+  V W+FL+ +L  K F 
Subjt:  NATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAIINPSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKILSFKIFS

Query:  PKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQGDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYDVNVLFKL
         +W  W+ GC+ S  F++ +NG  +  + ASRG+RQGDPLSPFLF +V++VL  ++ K       EGF VG++   +S LQF DDT+ F       +  L
Subjt:  PKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQGDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYDVNVLFKL

Query:  KEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTTNLLG----------------------------------------------GGRHTLYNSVFVSL
        K  +++F   SG KVN +KS + G+N+  + L++   +L                                               GGR TL  S    +
Subjt:  KEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTTNLLG----------------------------------------------GGRHTLYNSVFVSL

Query:  PTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKINHLVKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSIQFGG--------
        P Y+LSLF IP +VAA +ER+ + F W G    K +HLV W+ V  P   GGL  G I I N ALL KW WRY +E  ALW +++ SI +G         
Subjt:  PTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKINHLVKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSIQFGG--------

Query:  ----WLTALPPLTLVTV------------------------------LELGFGQTLELVTRSSSLVSLELLFCPP------VQRSLKDEEIVEFQSLLSL
            W    P   +  V                              L + + + L +VT  ++ +S  L +  P       +R+L D EI   + L+  
Subjt:  ----WLTALPPLTLVTV------------------------------LELGFGQTLELVTRSSSLVSLELLFCPP------VQRSLKDEEIVEFQSLLSL

Query:  LSTEKVVTS-DDFRSWSID------------SMDAYPMNKRLFSA--IWKSGSPRRTNILIWIMIFESLNSSEILQKKSPNKYLSPSICPLCLK
             + +S  D RSWS+             ++  Y  +  +F    +W +  P +    +W++  + +N++++LQ + P K LSP IC LC+K
Subjt:  LSTEKVVTS-DDFRSWSID------------SMDAYPMNKRLFSA--IWKSGSPRRTNILIWIMIFESLNSSEILQKKSPNKYLSPSICPLCLK

A0A438KQT0 Transposon TX1 uncharacterized 149 kDa protein2.3e-13730.58Show/hide
Query:  MKIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKESLNSVFIKSLWSSKDIGWEFVASVKSS--GGILTIKRKLVWPELSSLDVCAAAARCVGGD
        MKI+SWN RGL   +KR  +K F+    PDVVMIQE+KKE  +   + S+W+ ++  W+F+ +  +S  G      RK  W EL  +        CVGGD
Subjt:  MKIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKESLNSVFIKSLWSSKDIGWEFVASVKSS--GGILTIKRKLVWPELSSLDVCAAAARCVGGD

Query:  FNITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSLLDRFLINQQWDDLFENSRVSHKARIFSDHFPLMLEAGAFLWGPSP
        FN+ R + E+    R T  M  F+ FI    L++ PL+N  FTWS    SP    LDRFL + +W  LF         R  SDH+P+ L+   F WGP+P
Subjt:  FNITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSLLDRFLINQQWDDLFENSRVSHKARIFSDHFPLMLEAGAFLWGPSP

Query:  FRFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAW---------------------------------KIEAQAKIRKN--------
        FRF N WL      +            GW G     +L+ VK   K W                                 ++  Q  +RK         
Subjt:  FRFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAW---------------------------------KIEAQAKIRKN--------

Query:  -----------------------------------------------------------------------------------------VTLVAQFTVEE
                                                                                                 ++LVA FT EE
Subjt:  -----------------------------------------------------------------------------------------VTLVAQFTVEE

Query:  IFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFICLVQKKENATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAI
        I KA+  +  +KA GPDGFT+      W + K+    +  +FH +G +N     +FI L+ KK     + DFRPIS  T  YK++AKVLS  L+ V++  
Subjt:  IFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFICLVQKKENATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAI

Query:  INPSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKILSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQ
        I+ +Q AF++GR I+D +LIANE V++ R   ++G + K+D EKA+  V W+FL+++L  K FSPKW  W+ GC+ S  +++ +NG  +G + ASRG+RQ
Subjt:  INPSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKILSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQ

Query:  GDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYDVNVLFKLKEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTT
        GDPLSPFLF LV++VL  ++ +       EGF VG++   +S LQFADDT+ F       L  LK  ++ F   SG KVN +KS + G+N+    +++  
Subjt:  GDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYDVNVLFKLKEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTT

Query:  NLLG----------------------------------------------GGRHTLYNSVFVSLPTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKIN
          LG                                              GGR TL +S    +P YYLSLF +P +VAA +ER+ + F W G    K +
Subjt:  NLLG----------------------------------------------GGRHTLYNSVFVSLPTYYLSLFSIPENVAASLERIMQKFFWEGHAGSKIN

Query:  HLVKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSI---QFGGW-----------------LTALPPLTLVTVLELGFGQTLEL
        HLV+W+ V +P + GGL  G I + N ALL KW WRY +E  ALW +++ SI      GW                        +  T   +G G+ +  
Subjt:  HLVKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSI---QFGGW-----------------LTALPPLTLVTVLELGFGQTLEL

Query:  VTRSSSLVSLELLFCPPVQRSLKDEEIVEFQSLLSLLSTEKVVTS-DDFRSWSIDSMDAYPMNKRLFSA---------------IWKSGSPRRTNILIWI
             S      L+    +R+L D EI + + L+  L    +  S  D RSW + S   + + K  F A               +W S  P +    IW+
Subjt:  VTRSSSLVSLELLFCPPVQRSLKDEEIVEFQSLLSLLSTEKVVTS-DDFRSWSIDSMDAYPMNKRLFSA---------------IWKSGSPRRTNILIWI

Query:  MIFESLNSSEILQKKSPNKYLSPSICPLCLKATRQRAEIMRAVDLNAAAWRSL
        +  + +N++++LQ + P K LSP IC LC+K       I     L    W  L
Subjt:  MIFESLNSSEILQKKSPNKYLSPSICPLCLKATRQRAEIMRAVDLNAAAWRSL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.8e-2630.69Show/hide
Query:  TVEEIFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFICLVQKKENATLVKD-FRPISPTTLTYKVVAKVLSECLKQ
        T  EI   + +L + K+ GPDGFT EF  ++          L       G L     E  I L+ K    T  K+ FRPIS   +  K++ K+L+  ++Q
Subjt:  TVEEIFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFICLVQKKENATLVKD-FRPISPTTLTYKVVAKVLSECLKQ

Query:  VINAIINPSQSAFIEGRHILDPILIANEAVEDY-RAKRKKGWILKLDLEKAFHRVDWNFLEKILSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRGRITA
         I  +I+  Q  FI G      I  +   ++   RAK K   I+ +D EKAF ++   F+ K L+       ++  I      P  ++ +NG+       
Subjt:  VINAIINPSQSAFIEGRHILDPILIANEAVEDY-RAKRKKGWILKLDLEKAFHRVDWNFLEKILSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRGRITA

Query:  SRGIRQGDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYDVNVLFKLKEAIILFKWRSGQKVNWEKS
          G RQG PLSP LF +V EVL   I      ++ +G  +GK+ + LSL  FADD +++ +  +     L + I  F   SG K+N +KS
Subjt:  SRGIRQGDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYDVNVLFKLKEAIILFKWRSGQKVNWEKS

O10341 Uncharacterized 29.3 kDa protein1.0e-0947.26Show/hide
Query:  TPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTP
        T   LPSP     PTPS  P+P     PTP+  P+P     PTP+  P+P   L PTP+  PSP   L PTPS  P+P     PTPS  PSP     PTP
Subjt:  TPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTP

Query:  SNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLLSPPHS
        S  PSP     PTPS  P+P SP  PTPS  PSP     PTPS  PSP     PTPS  PSP     PTPS  PSP  P  PTP   PSPL   +  P S
Subjt:  SNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLLSPPHS

Query:  L
        +
Subjt:  L

P11369 LINE-1 retrotransposable element ORF2 protein3.2e-2730.27Show/hide
Query:  EEIFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFICLVQK-KENATLVKDFRPISPTTLTYKVVAKVLSECLKQVI
        +EI   + +L + K+ GPDGF+ EF          +   L     + G L     E  I L+ K +++ T +++FRPIS   +  K++ K+L+  +++ I
Subjt:  EEIFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFICLVQK-KENATLVKDFRPISPTTLTYKVVAKVLSECLKQVI

Query:  NAIINPSQSAFIEG-------RHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKILSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRG
         AII+P Q  FI G       R  ++ I   N      + K K   I+ LD EKAF ++   F+ K+L        +++ I      P  ++ +NG+   
Subjt:  NAIINPSQSAFIEG-------RHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKILSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRG

Query:  RITASRGIRQGDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYDVNVLFKLKEAIILFKWRSGQKVNWEKS
         I    G RQG PLSP+LF +V EVL   I      ++ +G  +GK+ + +SLL  ADD +++     N   +L   I  F    G K+N  KS
Subjt:  RITASRGIRQGDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYDVNVLFKLKEAIILFKWRSGQKVNWEKS

P14381 Transposon TX1 uncharacterized 149 kDa protein7.3e-3226.16Show/hide
Query:  RKNVTLVAQFTVEEIFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFICLVQKKENATLVKDFRPISPTTLTYKVVA
        R+   L    T++E+ +AL+ +  NK+ G DG T+EF    W      F  ++ +    G+L    +   + L+ KK +  L+K++RP+S  +  YK+VA
Subjt:  RKNVTLVAQFTVEEIFKALKALGSNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFICLVQKKENATLVKDFRPISPTTLTYKVVA

Query:  KVLSECLKQVINAIINPSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKILSFKIFSPKWVSWIMGCIKSPKFSLFING
        K +S  LK V+  +I+P QS  + GR I D + +  + +   R        L LD EKAF RVD  +L   L    F P++V ++     S +  + IN 
Subjt:  KVLSECLKQVINAIINPSQSAFIEGRHILDPILIANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKILSFKIFSPKWVSWIMGCIKSPKFSLFING

Query:  KPRGRITASRGIRQGDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYDVNVLFKLKEAIILFKWRSGQKVNWEKSA-
             +   RG+RQG PLS  L+ L  E    ++ K     +  G ++ +  + + L  +ADD +L  + D+  L + +E   ++   S  ++NW KS+ 
Subjt:  KPRGRITASRGIRQGDPLSPFLFLLVSEVLGEIISKLHSSRQFEGFLVGKDSIHLSLLQFADDTLLFCKYDVNVLFKLKEAIILFKWRSGQKVNWEKSA-

Query:  --------------------------LSGVNVGADELTQTTNLLG---------------------GGRHTLYNSVFVSLPTYYLSLFSIPENVAASLER
                                    GV + A+E   + N +                       GR  + N +  S   Y L   S  +   A ++R
Subjt:  --------------------------LSGVNVGADELTQTTNLLG---------------------GGRHTLYNSVFVSLPTYYLSLFSIPENVAASLER

Query:  IMQKFFWEGHAGSKINHLVKWNKVSSPLKDGG
         +  F W G       H V     S PLK+GG
Subjt:  IMQKFFWEGHAGSKINHLVKWNKVSSPLKDGG

Q9FPQ6 Vegetative cell wall protein gp12.0e-1344.59Show/hide
Query:  SPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSP
        SP  PL P+P+  PSP  P+ P+PS  P P SP  P+P+  PSP  P+ P+P+  PSP  P+ P+P+  PSP  P+ P+P+  PSP SP  P+P + P+P
Subjt:  SPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSP

Query:  LSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNL---PS
         SP  P P + P P SP  P+P+  PSP  P  P P + P P  P  P P+N P P SP    PS  PSP  P  PTP + PSP SP+ P+P+ +   P+
Subjt:  LSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNL---PS

Query:  PLSPLLSPPHSLRPPLSGSQVFQQPSPFSSP
        P SP  SPP S  PP         PSP  SP
Subjt:  PLSPLLSPPHSLRPPLSGSQVFQQPSPFSSP

Arabidopsis top hitse value%identityAlignment
AT3G19020.1 Leucine-rich repeat (LRR) family protein3.0e-0434.65Show/hide
Query:  PLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLH-PTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSP
        P  P+PS   +  +     P + P P  P+H  P  + SP  P+H  P  + SP  P+H P P  + SP  P+H  P  + SP  P+H  P  + SP  P
Subjt:  PLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLH-PTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSP

Query:  LHPTPSNLPSPLSPLHPTPSNLPSPLSPLH-PTPSNLPSPLSPLH-PTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSP
        +H  P  + SP     P P   P P +P++ P P  + SP  P+H P P  + SP  P+H  P  + SP  P+H  P  + SP     P PS + SP  P
Subjt:  LHPTPSNLPSPLSPLHPTPSNLPSPLSPLH-PTPSNLPSPLSPLH-PTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSP

Query:  LLSPPHSLRPPL--SGSQVFQQPSPFSS
        + SPP     PL  + S +   P+P SS
Subjt:  LLSPPHSLRPPL--SGSQVFQQPSPFSS

AT3G19430.1 late embryogenesis abundant protein-related / LEA protein-related1.3e-0443.14Show/hide
Query:  TPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPS-NLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPT
        TP     P+SP  PTPS +PSP  P+ P P   P+P  P  PTP     P+SP  PTP+ ++PSP  P+ P P   P+P  P  PTP   P P +P    
Subjt:  TPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPS-NLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPT

Query:  PSNLPSPLSPLHPTPSNLPSPLSPLHPTPS-NLPSP--LSPLHPTPSNLPSP--LSPLHPTPSNLPSP--LSPLHPTPSNLPSPL-SPLLSPPHSLRPPL
        PS  P+P  P  P PS  P P+SP  PTP+ ++PSP  ++P  PTPS +PSP  ++P  PTPS +PSP  ++P  PTP ++P+P  SP   PP S     
Subjt:  PSNLPSPLSPLHPTPSNLPSPLSPLHPTPS-NLPSP--LSPLHPTPSNLPSP--LSPLHPTPSNLPSP--LSPLHPTPSNLPSPL-SPLLSPPHSLRPPL

Query:  SGSQ
        +G++
Subjt:  SGSQ

AT4G18670.1 Leucine-rich repeat (LRR) family protein2.9e-0741.59Show/hide
Query:  SPLSPLH-PTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPS
        SP  P+  P+P   PSP     P+PS +PSP S    TPS    P SP  PTP   P P SP  PTP   P P SP  PTP   P P SP  P+P   P 
Subjt:  SPLSPLH-PTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPS

Query:  PLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSN-LPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSP
          S     P  +PSP S    TP++  SP SP  PTPS+ +PSP     PTPS  P+P+SP   +P  +PSP     P+P + PSP     P P  +PSP
Subjt:  PLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSN-LPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSP

Query:  LSPLLSPPHSLRPPLSGSQVFQQPSP
          P++ P  S  PP + + V+  P P
Subjt:  LSPLLSPPHSLRPPLSGSQVFQQPSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAATTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCT
CCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCT
CTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCACCCTACT
CCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTC
CCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACC
TCCCCTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCTCTCTCCACCCCACTCCCTCCGACCTCCCCTTTCCGGCAGCCAG
GTTTTTCAGCAGCCCTCTCCATTTAGTTCACCATGGAGGTGGTTAGTTGTAAAATTGACCAGTCACACTATTGCATTTGACAAGAAGGAGATCGTCGCTTTATTGAAGAT
TTTTTCGGTAGAGAGACATGTTAGGAAACATAGAGATATTGACAGAGGCAGATTGAGTATTTCAAAGTTTCAAGCTTCGTTGGGTTGGATACTGAGTTGTGATCACTGGC
CATATTCAGGGGGTCTCTTTAATATCAGAGTGTGCTTAGGTAGCAAGAAGCAAGGTTGGAAATTGTTTTGCTCGATGCTAGATAAATTTGTTGAAAAACTGGATTATGCT
AGATGGTTTGCTGAAAATTCCCTCAATATTCCTCCTCCGATTTTGAATAGGAAACTAAGTTATGGTGAAATGGCAAAATCAGTTCTTTCCAATTCCCCTGTTCCAAAACC
TAATTTTAAGGTTACTAACCAGCCTCCTAGTCAAAAAAGGAAGAATGCCCTCATTAGCATTGACGAAGGATCGATTTTAGATCTCATACATGAGGAAGGAAAATGGCAAG
CCTGGGGAATTTTCATCTCAAATTCGAAAGATGGAATAACTTTAATCACAGTCGGCCTCTTGTTCAAAAAGGCTATGGCGGTTGGCTCAAAATTAAAAAACTTCCCTTGG
ATTATTGGTGAAGTACCTTTGAGGATTAGAGAAGTTATGATTGATGAAGATCTTGATCCCTCCTTTCCTCTAGTCCAGAAAATCCCGAAATCAGTTGAAGAAGCATTGAA
TGATCGGGCAACTGAACAGGCCCCTCTTTTGAATTTTACTAAACCCAAGTCAAACAGCTGTTGCAAAAAGGCAGTGGGGTCCACTCCTAATGATCCTGTTTTTAATATTC
CAGTAGCCGAAGAGTCCTCCTTTGAATCTTTAAATGAAAGGAATAAAGGAAAAGGAGTCCACTTCAATAAATCTTTGAGTTATAACCATTTTTGCCCAAGTTTTTCAGAA
GACAACTCAACAAATTTATTGCAGAAAAGGTTACAAATCGTTTCTGGTTTTAATAATCCAGACAGCTCAGCGGATTTATTGCCAAAACCATTGCAAAATGTTTCAGTTTC
TGATTCTTTTCTTTATGTTAGAACGGGGGCCATCTTTCATCAGAACCCTGCTAAATCTGAATTAAATGGGATTTGTGACCTTAAAATTCCCCCTTTGGACTCTTTTCAGA
ATAACACCATCAAGCATTCAGTTTCTGTCAATAACCCAAAACTTCTGGAAGTTTACAGCTCTAAATCCAAGTCCCTGAGTCAAGGCTCTTCATGCAAAGCTGACTTTAGT
CCTAAAGACCAGCCCAAAACTTTATCTCCTATCAGTGTTAGTAGTGAAGAATCAGATGATCTTAGTGAAGTTGACCCTAAATTGGATTCAGAAATTGAAGGGAGGGGCTT
AAATGTTTTGTTTAATGAGGAGGCCATCCCGATCCTTCTCCTAGCCATGATGAAGATTGTCTCATGGAACACTAGGGGCTTAAGTGATTTCTCTAAACGTGCAGCACTCA
AAAAGTTCATCATAAACTGTAATCCAGATGTGGTCATGATTCAAGAATCTAAGAAAGAAAGTTTGAATTCAGTTTTCATTAAATCACTGTGGAGCTCCAAGGATATTGGC
TGGGAATTTGTGGCATCTGTTAAGTCATCCGGGGGAATCCTTACAATAAAAAGAAAATTGGTTTGGCCAGAATTGTCTTCCTTAGATGTTTGTGCGGCAGCAGCTAGGTG
TGTTGGTGGGGATTTTAATATAACTCGATGGGCACATGAAAGATTTCCCTTTGGCAGGAGCACTAGAGGTATGTCTTTGTTCAATAAGTTCATTGAGTTGGGTAATCTCA
TGGAAATTCCTTTGCAAAATGGCAGGTTCACTTGGTCTAGAGAAGGGTTTTCTCCTTCTAGATCACTATTGGACAGATTCTTAATAAATCAGCAGTGGGACGATCTTTTC
GAAAACTCTCGGGTGTCACACAAAGCACGTATTTTCTCGGATCACTTTCCTCTAATGTTGGAGGCTGGAGCTTTTCTTTGGGGACCCTCTCCTTTTCGATTTTGTAACAG
TTGGTTGTTGTCCAATGAATGTAATCAGATTATAGTTGAAACAGTCAATAATTCTAACTTCTACGGTTGGGCTGGATTTATTTTACAAGAACAGTTGAGATCAGTTAAAT
TGGCAGTTAAAGCTTGGAAAATTGAAGCACAAGCCAAGATTAGAAAAAATGTAACTCTTGTTGCTCAATTTACCGTAGAGGAAATATTTAAGGCTCTGAAGGCACTTGGC
AGCAATAAAGCTTCGGGCCCAGATGGTTTTACAGTGGAGTTCCTCCTCAAACACTGGTCTATTTTCAAGGATATGTTTGAATCCTTAATGGATGATTTCCACATAAACGG
AAAATTAAACGCTTGCATTCAAGAAAATTTCATTTGTTTAGTGCAGAAAAAGGAGAACGCAACTCTAGTCAAGGATTTCCGTCCAATCAGCCCTACTACCTTAACATACA
AGGTAGTTGCAAAGGTTCTTTCGGAATGCTTGAAACAAGTTATCAATGCAATTATAAATCCTTCACAAAGTGCTTTTATTGAAGGTAGGCATATTCTTGACCCTATATTA
ATTGCTAATGAGGCAGTGGAAGACTATAGGGCAAAACGAAAAAAGGGATGGATCTTAAAACTTGATCTAGAAAAAGCCTTTCATAGGGTGGATTGGAATTTTCTAGAAAA
GATATTATCTTTTAAAATATTCAGCCCCAAGTGGGTCTCATGGATAATGGGTTGCATAAAAAGTCCAAAATTCTCATTATTCATTAATGGTAAGCCAAGAGGCCGTATCA
CAGCATCCCGAGGCATCCGCCAAGGGGATCCCCTCTCTCCCTTCCTATTTCTCTTAGTAAGTGAGGTTTTAGGAGAAATCATAAGCAAGCTTCATAGCAGCAGACAATTT
GAAGGTTTTCTAGTTGGTAAAGATTCGATCCACCTCTCTTTGCTGCAATTTGCCGATGACACTCTTCTATTTTGTAAGTATGATGTGAATGTGCTTTTTAAGCTTAAGGA
AGCCATAATCTTATTCAAATGGCGTTCGGGGCAGAAAGTAAACTGGGAGAAATCTGCCCTCAGTGGTGTTAATGTGGGGGCAGACGAATTGACTCAAACAACAAATCTAC
TAGGAGGTGGAAGACATACCTTGTACAACTCAGTTTTTGTTAGTCTTCCCACATACTACCTGTCTTTATTTTCAATTCCTGAAAATGTGGCTGCCTCATTAGAGAGAATC
ATGCAGAAATTTTTTTGGGAAGGTCATGCTGGCAGCAAAATCAACCACCTAGTTAAGTGGAACAAGGTTTCATCTCCTTTAAAAGATGGAGGTCTCAGCCTAGGAGGAAT
TAAAATCCATAACTCTGCTTTGCTTGCTAAATGGGGTTGGAGATACTCGAAAGAAGAGTTAGCTCTTTGGAGGAAAATAGTTAGAAGCATCCAGTTTGGAGGTTGGTTGA
CTGCCTTGCCTCCTTTAACCTTGGTAACAGTATTAGAATTGGGTTTTGGACAGACCCTTGAGTTGGTAACAAGGAGCAGTTCTCTAGTCTCTTTAGAATTGCTCTTTTGC
CCTCCGGTTCAAAGAAGCTTGAAAGATGAAGAAATTGTAGAATTTCAATCACTTCTGTCCCTCCTTTCAACAGAAAAAGTGGTTACTTCTGACGACTTTCGATCATGGTC
CATTGATTCCATGGACGCCTATCCGATGAACAAACGGCTTTTTTCAGCAATCTGGAAATCAGGCAGCCCTAGGAGAACTAACATATTGATCTGGATAATGATCTTCGAGT
CCCTTAATAGTTCAGAAATTTTGCAAAAGAAATCCCCGAATAAATATCTTTCTCCTTCGATCTGTCCTCTTTGCTTGAAGGCTACAAGGCAAAGGGCAGAAATTATGCGT
GCAGTAGATCTTAATGCTGCAGCTTGGCGCTCATTAAAAAAGGAATTTTATGGTGCAATGCAGTTTAAGCTCTTGTTTGAGATCTGGCTTGAACATAATCAACGAGTTTT
TCATAATCGAACTTTTCAGTTCAATGCTGGTAATCATGACTCCTTGCACGAAACCAAACACTGTCTAGAACTCGCTGCGGGCCCATTGCTTTTTCAAAGGCATTATACTC
CTTGGTTGATTGATTGGGATCGATTCAGTCCAGATTTATCCCCACCCTTGAAGATGGGGATTGATGCTGTAATCTATAACACCAACTGCCTCCATAAAACTTTTGTACAC
AGCCAAAAAATAGCCAGGGCACATCTGGGTGTTCTTCAACTTGAAGTTCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAAATTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCT
CCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCT
CTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCACCCTACT
CCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTC
CCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACC
TCCCCTCTCCCCTCTCCCCTCTCCACCCTACTCCCTCCAACCTCCCCTCTCCCCTCTCCCCTCTCCTCTCTCCACCCCACTCCCTCCGACCTCCCCTTTCCGGCAGCCAG
GTTTTTCAGCAGCCCTCTCCATTTAGTTCACCATGGAGGTGGTTAGTTGTAAAATTGACCAGTCACACTATTGCATTTGACAAGAAGGAGATCGTCGCTTTATTGAAGAT
TTTTTCGGTAGAGAGACATGTTAGGAAACATAGAGATATTGACAGAGGCAGATTGAGTATTTCAAAGTTTCAAGCTTCGTTGGGTTGGATACTGAGTTGTGATCACTGGC
CATATTCAGGGGGTCTCTTTAATATCAGAGTGTGCTTAGGTAGCAAGAAGCAAGGTTGGAAATTGTTTTGCTCGATGCTAGATAAATTTGTTGAAAAACTGGATTATGCT
AGATGGTTTGCTGAAAATTCCCTCAATATTCCTCCTCCGATTTTGAATAGGAAACTAAGTTATGGTGAAATGGCAAAATCAGTTCTTTCCAATTCCCCTGTTCCAAAACC
TAATTTTAAGGTTACTAACCAGCCTCCTAGTCAAAAAAGGAAGAATGCCCTCATTAGCATTGACGAAGGATCGATTTTAGATCTCATACATGAGGAAGGAAAATGGCAAG
CCTGGGGAATTTTCATCTCAAATTCGAAAGATGGAATAACTTTAATCACAGTCGGCCTCTTGTTCAAAAAGGCTATGGCGGTTGGCTCAAAATTAAAAAACTTCCCTTGG
ATTATTGGTGAAGTACCTTTGAGGATTAGAGAAGTTATGATTGATGAAGATCTTGATCCCTCCTTTCCTCTAGTCCAGAAAATCCCGAAATCAGTTGAAGAAGCATTGAA
TGATCGGGCAACTGAACAGGCCCCTCTTTTGAATTTTACTAAACCCAAGTCAAACAGCTGTTGCAAAAAGGCAGTGGGGTCCACTCCTAATGATCCTGTTTTTAATATTC
CAGTAGCCGAAGAGTCCTCCTTTGAATCTTTAAATGAAAGGAATAAAGGAAAAGGAGTCCACTTCAATAAATCTTTGAGTTATAACCATTTTTGCCCAAGTTTTTCAGAA
GACAACTCAACAAATTTATTGCAGAAAAGGTTACAAATCGTTTCTGGTTTTAATAATCCAGACAGCTCAGCGGATTTATTGCCAAAACCATTGCAAAATGTTTCAGTTTC
TGATTCTTTTCTTTATGTTAGAACGGGGGCCATCTTTCATCAGAACCCTGCTAAATCTGAATTAAATGGGATTTGTGACCTTAAAATTCCCCCTTTGGACTCTTTTCAGA
ATAACACCATCAAGCATTCAGTTTCTGTCAATAACCCAAAACTTCTGGAAGTTTACAGCTCTAAATCCAAGTCCCTGAGTCAAGGCTCTTCATGCAAAGCTGACTTTAGT
CCTAAAGACCAGCCCAAAACTTTATCTCCTATCAGTGTTAGTAGTGAAGAATCAGATGATCTTAGTGAAGTTGACCCTAAATTGGATTCAGAAATTGAAGGGAGGGGCTT
AAATGTTTTGTTTAATGAGGAGGCCATCCCGATCCTTCTCCTAGCCATGATGAAGATTGTCTCATGGAACACTAGGGGCTTAAGTGATTTCTCTAAACGTGCAGCACTCA
AAAAGTTCATCATAAACTGTAATCCAGATGTGGTCATGATTCAAGAATCTAAGAAAGAAAGTTTGAATTCAGTTTTCATTAAATCACTGTGGAGCTCCAAGGATATTGGC
TGGGAATTTGTGGCATCTGTTAAGTCATCCGGGGGAATCCTTACAATAAAAAGAAAATTGGTTTGGCCAGAATTGTCTTCCTTAGATGTTTGTGCGGCAGCAGCTAGGTG
TGTTGGTGGGGATTTTAATATAACTCGATGGGCACATGAAAGATTTCCCTTTGGCAGGAGCACTAGAGGTATGTCTTTGTTCAATAAGTTCATTGAGTTGGGTAATCTCA
TGGAAATTCCTTTGCAAAATGGCAGGTTCACTTGGTCTAGAGAAGGGTTTTCTCCTTCTAGATCACTATTGGACAGATTCTTAATAAATCAGCAGTGGGACGATCTTTTC
GAAAACTCTCGGGTGTCACACAAAGCACGTATTTTCTCGGATCACTTTCCTCTAATGTTGGAGGCTGGAGCTTTTCTTTGGGGACCCTCTCCTTTTCGATTTTGTAACAG
TTGGTTGTTGTCCAATGAATGTAATCAGATTATAGTTGAAACAGTCAATAATTCTAACTTCTACGGTTGGGCTGGATTTATTTTACAAGAACAGTTGAGATCAGTTAAAT
TGGCAGTTAAAGCTTGGAAAATTGAAGCACAAGCCAAGATTAGAAAAAATGTAACTCTTGTTGCTCAATTTACCGTAGAGGAAATATTTAAGGCTCTGAAGGCACTTGGC
AGCAATAAAGCTTCGGGCCCAGATGGTTTTACAGTGGAGTTCCTCCTCAAACACTGGTCTATTTTCAAGGATATGTTTGAATCCTTAATGGATGATTTCCACATAAACGG
AAAATTAAACGCTTGCATTCAAGAAAATTTCATTTGTTTAGTGCAGAAAAAGGAGAACGCAACTCTAGTCAAGGATTTCCGTCCAATCAGCCCTACTACCTTAACATACA
AGGTAGTTGCAAAGGTTCTTTCGGAATGCTTGAAACAAGTTATCAATGCAATTATAAATCCTTCACAAAGTGCTTTTATTGAAGGTAGGCATATTCTTGACCCTATATTA
ATTGCTAATGAGGCAGTGGAAGACTATAGGGCAAAACGAAAAAAGGGATGGATCTTAAAACTTGATCTAGAAAAAGCCTTTCATAGGGTGGATTGGAATTTTCTAGAAAA
GATATTATCTTTTAAAATATTCAGCCCCAAGTGGGTCTCATGGATAATGGGTTGCATAAAAAGTCCAAAATTCTCATTATTCATTAATGGTAAGCCAAGAGGCCGTATCA
CAGCATCCCGAGGCATCCGCCAAGGGGATCCCCTCTCTCCCTTCCTATTTCTCTTAGTAAGTGAGGTTTTAGGAGAAATCATAAGCAAGCTTCATAGCAGCAGACAATTT
GAAGGTTTTCTAGTTGGTAAAGATTCGATCCACCTCTCTTTGCTGCAATTTGCCGATGACACTCTTCTATTTTGTAAGTATGATGTGAATGTGCTTTTTAAGCTTAAGGA
AGCCATAATCTTATTCAAATGGCGTTCGGGGCAGAAAGTAAACTGGGAGAAATCTGCCCTCAGTGGTGTTAATGTGGGGGCAGACGAATTGACTCAAACAACAAATCTAC
TAGGAGGTGGAAGACATACCTTGTACAACTCAGTTTTTGTTAGTCTTCCCACATACTACCTGTCTTTATTTTCAATTCCTGAAAATGTGGCTGCCTCATTAGAGAGAATC
ATGCAGAAATTTTTTTGGGAAGGTCATGCTGGCAGCAAAATCAACCACCTAGTTAAGTGGAACAAGGTTTCATCTCCTTTAAAAGATGGAGGTCTCAGCCTAGGAGGAAT
TAAAATCCATAACTCTGCTTTGCTTGCTAAATGGGGTTGGAGATACTCGAAAGAAGAGTTAGCTCTTTGGAGGAAAATAGTTAGAAGCATCCAGTTTGGAGGTTGGTTGA
CTGCCTTGCCTCCTTTAACCTTGGTAACAGTATTAGAATTGGGTTTTGGACAGACCCTTGAGTTGGTAACAAGGAGCAGTTCTCTAGTCTCTTTAGAATTGCTCTTTTGC
CCTCCGGTTCAAAGAAGCTTGAAAGATGAAGAAATTGTAGAATTTCAATCACTTCTGTCCCTCCTTTCAACAGAAAAAGTGGTTACTTCTGACGACTTTCGATCATGGTC
CATTGATTCCATGGACGCCTATCCGATGAACAAACGGCTTTTTTCAGCAATCTGGAAATCAGGCAGCCCTAGGAGAACTAACATATTGATCTGGATAATGATCTTCGAGT
CCCTTAATAGTTCAGAAATTTTGCAAAAGAAATCCCCGAATAAATATCTTTCTCCTTCGATCTGTCCTCTTTGCTTGAAGGCTACAAGGCAAAGGGCAGAAATTATGCGT
GCAGTAGATCTTAATGCTGCAGCTTGGCGCTCATTAAAAAAGGAATTTTATGGTGCAATGCAGTTTAAGCTCTTGTTTGAGATCTGGCTTGAACATAATCAACGAGTTTT
TCATAATCGAACTTTTCAGTTCAATGCTGGTAATCATGACTCCTTGCACGAAACCAAACACTGTCTAGAACTCGCTGCGGGCCCATTGCTTTTTCAAAGGCATTATACTC
CTTGGTTGATTGATTGGGATCGATTCAGTCCAGATTTATCCCCACCCTTGAAGATGGGGATTGATGCTGTAATCTATAACACCAACTGCCTCCATAAAACTTTTGTACAC
AGCCAAAAAATAGCCAGGGCACATCTGGGTGTTCTTCAACTTGAAGTTCCTTAG
Protein sequenceShow/hide protein sequence
MRNSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPT
PSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLHPTPSNLPSPLSPLLSPPHSLRPPLSGSQ
VFQQPSPFSSPWRWLVVKLTSHTIAFDKKEIVALLKIFSVERHVRKHRDIDRGRLSISKFQASLGWILSCDHWPYSGGLFNIRVCLGSKKQGWKLFCSMLDKFVEKLDYA
RWFAENSLNIPPPILNRKLSYGEMAKSVLSNSPVPKPNFKVTNQPPSQKRKNALISIDEGSILDLIHEEGKWQAWGIFISNSKDGITLITVGLLFKKAMAVGSKLKNFPW
IIGEVPLRIREVMIDEDLDPSFPLVQKIPKSVEEALNDRATEQAPLLNFTKPKSNSCCKKAVGSTPNDPVFNIPVAEESSFESLNERNKGKGVHFNKSLSYNHFCPSFSE
DNSTNLLQKRLQIVSGFNNPDSSADLLPKPLQNVSVSDSFLYVRTGAIFHQNPAKSELNGICDLKIPPLDSFQNNTIKHSVSVNNPKLLEVYSSKSKSLSQGSSCKADFS
PKDQPKTLSPISVSSEESDDLSEVDPKLDSEIEGRGLNVLFNEEAIPILLLAMMKIVSWNTRGLSDFSKRAALKKFIINCNPDVVMIQESKKESLNSVFIKSLWSSKDIG
WEFVASVKSSGGILTIKRKLVWPELSSLDVCAAAARCVGGDFNITRWAHERFPFGRSTRGMSLFNKFIELGNLMEIPLQNGRFTWSREGFSPSRSLLDRFLINQQWDDLF
ENSRVSHKARIFSDHFPLMLEAGAFLWGPSPFRFCNSWLLSNECNQIIVETVNNSNFYGWAGFILQEQLRSVKLAVKAWKIEAQAKIRKNVTLVAQFTVEEIFKALKALG
SNKASGPDGFTVEFLLKHWSIFKDMFESLMDDFHINGKLNACIQENFICLVQKKENATLVKDFRPISPTTLTYKVVAKVLSECLKQVINAIINPSQSAFIEGRHILDPIL
IANEAVEDYRAKRKKGWILKLDLEKAFHRVDWNFLEKILSFKIFSPKWVSWIMGCIKSPKFSLFINGKPRGRITASRGIRQGDPLSPFLFLLVSEVLGEIISKLHSSRQF
EGFLVGKDSIHLSLLQFADDTLLFCKYDVNVLFKLKEAIILFKWRSGQKVNWEKSALSGVNVGADELTQTTNLLGGGRHTLYNSVFVSLPTYYLSLFSIPENVAASLERI
MQKFFWEGHAGSKINHLVKWNKVSSPLKDGGLSLGGIKIHNSALLAKWGWRYSKEELALWRKIVRSIQFGGWLTALPPLTLVTVLELGFGQTLELVTRSSSLVSLELLFC
PPVQRSLKDEEIVEFQSLLSLLSTEKVVTSDDFRSWSIDSMDAYPMNKRLFSAIWKSGSPRRTNILIWIMIFESLNSSEILQKKSPNKYLSPSICPLCLKATRQRAEIMR
AVDLNAAAWRSLKKEFYGAMQFKLLFEIWLEHNQRVFHNRTFQFNAGNHDSLHETKHCLELAAGPLLFQRHYTPWLIDWDRFSPDLSPPLKMGIDAVIYNTNCLHKTFVH
SQKIARAHLGVLQLEVP