; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0012692 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0012692
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionTransposase
Genome locationchr10:25080113..25086442
RNA-Seq ExpressionIVF0012692
SyntenyIVF0012692
Gene Ontology termsNA
InterPro domainsIPR004242 - Transposon, En/Spm-like
IPR025312 - Domain of unknown function DUF4216
IPR025452 - Domain of unknown function DUF4218
IPR029480 - Transposase-associated domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046884.1 transposase [Cucumis melo var. makuwa]0.051.58Show/hide
Query:  MDRSWMHKSRLSKDYGLGVENFISFGFSNTKDASIRCSCLKCGNCEKQSRTTIRDHLYVNGIDESYKIWFWHGEQ-LPESSLYEESSKFDTHMYEENDVG
        MDRSWMHKSRL KDY LGVENFISFGFSNTKDASIRC CLKCGNCEKQSRTTIRDHLYVNGIDESYKIWFWHGEQ LPESSLYEESSKFDTHMYE NDVG
Subjt:  MDRSWMHKSRLSKDYGLGVENFISFGFSNTKDASIRCSCLKCGNCEKQSRTTIRDHLYVNGIDESYKIWFWHGEQ-LPESSLYEESSKFDTHMYEENDVG

Query:  SINETIEVAHEEYSKDPNEFEKLLNDAEKSLYEECKKFTKLSTLVKLYNLKVRYGWSDIRFSELLKTLKEILPTSNEIPTSMYEAKKTLGALGMSYEKIH
         INE IEVAHEEYSKDPNEFEKLLNDAEKSLYE CKKFTKLSTLVKLYNLKVRYGWSDI FSELLKTLKEILPT NEIPTS+YEAKKTLGALGMSYEKIH
Subjt:  SINETIEVAHEEYSKDPNEFEKLLNDAEKSLYEECKKFTKLSTLVKLYNLKVRYGWSDIRFSELLKTLKEILPTSNEIPTSMYEAKKTLGALGMSYEKIH

Query:  ACHNDCCLYRKEHANATECLECGESRWKYANNANGGKKQIPGKVVWYFPSIPRFKRLFRSIDNAKNLIWHSNERVIDGKLRHPADSPAWKLIDLKWPDFG
        AC NDCCLYRKEHANATEC ECGESRWKYANNAN GKKQIP KVVWYFP IPRFKRLFRSI+NAKNLIWHSNERVI GKLRHPADSPAWKLIDLKWPDFG
Subjt:  ACHNDCCLYRKEHANATECLECGESRWKYANNANGGKKQIPGKVVWYFPSIPRFKRLFRSIDNAKNLIWHSNERVIDGKLRHPADSPAWKLIDLKWPDFG

Query:  SEPRNIRLALSADGINPHGEMSSKYSCWPVVIVIHNLPPWLCMKRKFMMLSILISGPRQPGDDIVIDVQQLEKLEEDIVVTICLLEKYFPPSFFTIMMHL
        SEPRNIRLALSAD INPH EMSSKYSCWPVVIVI+NLPPWLCMKRKFMMLS+LISGPRQPGDDI                                    
Subjt:  SEPRNIRLALSADGINPHGEMSSKYSCWPVVIVIHNLPPWLCMKRKFMMLSILISGPRQPGDDIVIDVQQLEKLEEDIVVTICLLEKYFPPSFFTIMMHL

Query:  TVYIVREVKLCRLVYLRWVYPFGRYMKVLKNYVRNRHRPEGCIAESYIVEEDIEFCSDFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLY
                              G Y+  L                     ED++                  L+ES            GV          
Subjt:  TVYIVREVKLCRLVYLRWVYPFGRYMKVLKNYVRNRHRPEGCIAESYIVEEDIEFCSDFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLY

Query:  QAHRYVINGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKNPIIGDMSFYGVIQDIWELNYNTFNVAVFRCDWVENNNGMKIDDLGFVLVDLKRIGH
                      E YD N+    N   ++  T            I D S YG        N + F+V       +E    + IDD        K +G 
Subjt:  QAHRYVINGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKNPIIGDMSFYGVIQDIWELNYNTFNVAVFRCDWVENNNGMKIDDLGFVLVDLKRIGH

Query:  KSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDELGDIMLHCQGVSSDMPNVDGGNDLDENMSTYDILFTHGANDSTRRSRQGGERKLVE
             ++     +  +VE P   R                   + G  ++                        +D+             R  GERKLVE
Subjt:  KSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDELGDIMLHCQGVSSDMPNVDGGNDLDENMSTYDILFTHGANDSTRRSRQGGERKLVE

Query:  YNEDGVPIGENGAKLNSFIESCVHYHIPIIYATWIDVPAELKEKIYTIVEV-------------------------------------------------
        YNEDGVPIGENGAKLNSFI SCVHYHIPIIYATWIDVPAELKEKIYTIVE                                                  
Subjt:  YNEDGVPIGENGAKLNSFIESCVHYHIPIIYATWIDVPAELKEKIYTIVEV-------------------------------------------------

Query:  ----------------------KRKLQQERRKKNKYNHRLSRKGYANLREEL---------------------------------------NDISKTCAD
                              KRKLQQERRKKNKYNHRLSRKGYANL+EEL                                       ++ISKTCAD
Subjt:  ----------------------KRKLQQERRKKNKYNHRLSRKGYANLREEL---------------------------------------NDISKTCAD

Query:  KEPSPNDVLTQALGTRESSGRVRGVGGFVTPSTYFHTAKRSKKRNEEIDKLSEENEKLRLRVQELENIHIST----------------------------
        KE SPNDVLTQALGTRESSGRVRGVG FVTPSTYFHTAKRSKKRNEEIDKLS+ENEKLR +    + +++                              
Subjt:  KEPSPNDVLTQALGTRESSGRVRGVGGFVTPSTYFHTAKRSKKRNEEIDKLSEENEKLRLRVQELENIHIST----------------------------

Query:  ---------------------QSTPTSAHGS--------------CSQTKIG----IRYSMYLHSQDELSNYIFVDPSLISVGHNTQEVRARNLCSRLMA
                                P S  G               C+  ++     + Y++YLHSQD+LSNYIFV PSLISVGHNTQEVRARNLCSRLMA
Subjt:  ---------------------QSTPTSAHGS--------------CSQTKIG----IRYSMYLHSQDELSNYIFVDPSLISVGHNTQEVRARNLCSRLMA

Query:  SKPNQLVLAPFNP---WA--------------------------------ITIFRSQKNIQTSRKQPIWKTVK
        SK NQLVLAPFNP   WA                                ITIF SQKNIQTSRKQPIWKTVK
Subjt:  SKPNQLVLAPFNP---WA--------------------------------ITIFRSQKNIQTSRKQPIWKTVK

KAA0054128.1 pol protein [Cucumis melo var. makuwa]0.077.41Show/hide
Query:  MKVLKNYVRNRHRPEGCIAESYIVEEDIEFCSDFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLYQAHRYV-------------------
        MKVLKNYVRNRHRPEGCIAESYIVEEDIEFCSDFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLYQAHRYV                   
Subjt:  MKVLKNYVRNRHRPEGCIAESYIVEEDIEFCSDFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLYQAHRYV-------------------

Query:  --------------------------------------------------------------INGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKN
                                                                      INGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKN
Subjt:  --------------------------------------------------------------INGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKN

Query:  PIIGDMSFYGVIQDIWELNYNTFNVAVFRCDWVENNNGMKIDDLGFVLVDLKRIGHKSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDE
        PIIGDMSFYGVIQDIWELNYNTFNVAVFRCDWVENNNGMKIDDLGFVLVDLKRIGHKSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDE
Subjt:  PIIGDMSFYGVIQDIWELNYNTFNVAVFRCDWVENNNGMKIDDLGFVLVDLKRIGHKSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDE

Query:  LGDIMLHCQGVSSDMPNVDGGNDLDENMSTYDILFTHGANDSTRRSRQGGERKLVEYNEDGVPIGENGAKLNSFIESCVHYHIPIIYATWIDVPAELKEK
        LGDIMLHCQGVSSDMPNVDGGNDLDENMSTYDILFTHGANDSTRRSRQGGERKLVEYNEDGVPIGENGAKLNSFIESCVHYHIPIIYATWIDVPAELKEK
Subjt:  LGDIMLHCQGVSSDMPNVDGGNDLDENMSTYDILFTHGANDSTRRSRQGGERKLVEYNEDGVPIGENGAKLNSFIESCVHYHIPIIYATWIDVPAELKEK

Query:  IYTIVE-----------------------------------------------------------------------VKRKLQQERRKKNKYNHRLSRKG
        IYTIVE                                                                       VKRKLQQERRKKNKYNHRLSRKG
Subjt:  IYTIVE-----------------------------------------------------------------------VKRKLQQERRKKNKYNHRLSRKG

Query:  YANLREELNDISKTCADKEPSPNDVLTQALGTRESSGRVRGVGGFVTPSTYFHTAKRSKKRNEEIDKLSEENEKLRLRVQELENIHISTQSTPTSAHGSC
        YANLREELNDISKTCADKEPSPNDVLTQALGTRESSGRVRGVGGFVTPSTYFHTAKRSKKRNEEIDKLSEENEKLRLRVQELENIHISTQSTPTSAHGSC
Subjt:  YANLREELNDISKTCADKEPSPNDVLTQALGTRESSGRVRGVGGFVTPSTYFHTAKRSKKRNEEIDKLSEENEKLRLRVQELENIHISTQSTPTSAHGSC

Query:  SQTKIGIRYSMYLHSQDELSNYIFVDPSLISVGHNTQEVRARNLCSRLMASKPNQLVLAPFNPWAITIFRSQKNIQTSRKQPIWKTVKEYYVVVS
        SQTKIGIRYSMYLHSQDELSNYIFVDPSLISVGHNTQEVRARNLCSRLMASKPNQLVLAPFNPWAITIFRSQKNIQTSRKQPIWKTVK    V+S
Subjt:  SQTKIGIRYSMYLHSQDELSNYIFVDPSLISVGHNTQEVRARNLCSRLMASKPNQLVLAPFNPWAITIFRSQKNIQTSRKQPIWKTVKEYYVVVS

TYK03264.1 transposase [Cucumis melo var. makuwa]0.053.08Show/hide
Query:  MDRSWMHKSRLSKDYGLGVENFISFGFSNTKDASIRCSCLKCGNCEKQSRTTIRDHLYVNGIDESYKIWFWHGEQ-LPESSLYEESSKFDTHMYEENDVG
        MDRSWMHKSRL KDY LGVENFISFGFSNTKDASIRC CLKCGNCEKQSRTTIRDHLYVNGIDESYKIWFWHGEQ LPESSLYEESSKFDTHMYE NDVG
Subjt:  MDRSWMHKSRLSKDYGLGVENFISFGFSNTKDASIRCSCLKCGNCEKQSRTTIRDHLYVNGIDESYKIWFWHGEQ-LPESSLYEESSKFDTHMYEENDVG

Query:  SINETIEVAHEEYSKDPNEFEKLLNDAEKSLYEECKKFTKLSTLVKLYNLKVRYGWSDIRFSELLKTLKEILPTSNEIPTSMYEAKKTLGALGMSYEKIH
         INE IEVAHEEYSKDPNEFEKLLNDAEKSLYE CKKFTKLSTLVKLYNLKVRYGWSDI FSELLKTLKEILPT NEIPTS+YEAKKTLGALGMSYEKIH
Subjt:  SINETIEVAHEEYSKDPNEFEKLLNDAEKSLYEECKKFTKLSTLVKLYNLKVRYGWSDIRFSELLKTLKEILPTSNEIPTSMYEAKKTLGALGMSYEKIH

Query:  ACHNDCCLYRKEHANATECLECGESRWKYANNANGGKKQIPGKVVWYFPSIPRFKRLFRSIDNAKNLIWHSNERVIDGKLRHPADSPAWKLIDLKWPDFG
        AC NDCCLYRKEHANATEC ECGESRWKYANNAN GKKQIP KVVWYFP IPRFKRLFRSI+NAKNLIWHSNERVI GKLRHPADSPAWKLIDLKWPDFG
Subjt:  ACHNDCCLYRKEHANATECLECGESRWKYANNANGGKKQIPGKVVWYFPSIPRFKRLFRSIDNAKNLIWHSNERVIDGKLRHPADSPAWKLIDLKWPDFG

Query:  SEPRNIRLALSADGINPHGEMSSKYSCWPVVIVIHNLPPWLCMKRKFMMLSILISGPRQPGDDIVIDVQQLEKLEEDIVVTICLLEKYFPPSFFTIMMHL
        SEPRNIRLALSAD INPH EMSSKYSCWPVVIVI+NLPPWLCMKRKFMMLS+LISGPRQPGDDI                                    
Subjt:  SEPRNIRLALSADGINPHGEMSSKYSCWPVVIVIHNLPPWLCMKRKFMMLSILISGPRQPGDDIVIDVQQLEKLEEDIVVTICLLEKYFPPSFFTIMMHL

Query:  TVYIVREVKLCRLVYLRWVYPFGRYMKVLKNYVRNRHRPEGCIAESYIVEEDIEFCSDFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLY
                              G Y+  L                     ED++                  L+ES            GV          
Subjt:  TVYIVREVKLCRLVYLRWVYPFGRYMKVLKNYVRNRHRPEGCIAESYIVEEDIEFCSDFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLY

Query:  QAHRYVINGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKNPIIGDMSFYGVIQDIWELNYNTFNVAVFRCDWVENNNGMKIDDLGFVLVDLKRIGH
                      E YD N+    N   ++  T            I D S YG        N + F+V       +E    + IDD        K +G 
Subjt:  QAHRYVINGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKNPIIGDMSFYGVIQDIWELNYNTFNVAVFRCDWVENNNGMKIDDLGFVLVDLKRIGH

Query:  KSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDELGDIMLHCQGVSSDMPNVDGGNDLDENMSTYDILFTHGANDSTRRSRQGGERKLVE
             ++     +  +VE P   R                   + G  ++                        +D+             R  GERKLVE
Subjt:  KSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDELGDIMLHCQGVSSDMPNVDGGNDLDENMSTYDILFTHGANDSTRRSRQGGERKLVE

Query:  YNEDGVPIGENGAKLNSFIESCVHYHIPIIYATWIDVPAELKEKIYTIVEV-------------------------------------------------
        YNEDGVPIGENGAKLNSFI SCVHYHIPIIYATWIDVPAELKEKIYTIVE                                                  
Subjt:  YNEDGVPIGENGAKLNSFIESCVHYHIPIIYATWIDVPAELKEKIYTIVEV-------------------------------------------------

Query:  ----------------------KRKLQQERRKKNKYNHRLSRKGYANLREEL---------------------------------------NDISKTCAD
                              KRKLQQERRKKNKYNHRLSRKGYANL+EEL                                       ++ISKTCAD
Subjt:  ----------------------KRKLQQERRKKNKYNHRLSRKGYANLREEL---------------------------------------NDISKTCAD

Query:  KEPSPNDVLTQALGTRESSGRVRGVGGFVTPSTYFHTAKRSKKRNEEIDKLSEENEKLRLRVQELENIHIST----------------------------
        KE SPNDVLTQALGTRESSGRVRGVG FVTPSTYFHTAKRSKKRNEEIDKLS+ENEKLR +    + +++                              
Subjt:  KEPSPNDVLTQALGTRESSGRVRGVGGFVTPSTYFHTAKRSKKRNEEIDKLSEENEKLRLRVQELENIHIST----------------------------

Query:  ---------------------QSTPTSAHGS--------------CSQTKIG----IRYSMYLHSQDELSNYIFVDPSLISVGHNTQEVRARNLCSRLMA
                                P S  G               C+  ++     + Y++YLHSQD+LSNYIFV PSLISVGHNTQEVRARNLCSRLMA
Subjt:  ---------------------QSTPTSAHGS--------------CSQTKIG----IRYSMYLHSQDELSNYIFVDPSLISVGHNTQEVRARNLCSRLMA

Query:  SKPNQLVLAPFNPWAITIFRSQKNIQTSRKQPIWKTVK
        SK NQLVLAPFNP AITIF SQKNIQTSRKQPIWKTVK
Subjt:  SKPNQLVLAPFNPWAITIFRSQKNIQTSRKQPIWKTVK

TYK21059.1 transposase [Cucumis melo var. makuwa]0.051.6Show/hide
Query:  MYEENDVGSINETIEVAHEEYSKDPNEFEKLLNDAEKSLYEECKKFTKLSTLVKLYNLKVRYGWSDIRFSELLKTLKEILPTSNEIPTSMYEAKKTLGAL
        MYEENDVGSINE IEVAHEEYSKDPNEFEKLLNDAEK +YE CKKFTKLSTLVKLYNLKVR                                       
Subjt:  MYEENDVGSINETIEVAHEEYSKDPNEFEKLLNDAEKSLYEECKKFTKLSTLVKLYNLKVRYGWSDIRFSELLKTLKEILPTSNEIPTSMYEAKKTLGAL

Query:  GMSYEKIHACHNDCCLYRKEHANATECLECGESRWKYANNANGGKKQIPGKVVWYFPSIPRFKRLFRSIDNAKNLIWHSNERVIDGKLRHPADSPAWKLI
                          KEHANATEC ECGESRWKY NNAN GKKQIP KVVWYFPSIPRFKRLFRSIDNAKNLIWHSNERVIDGKLRHPAD PAWKLI
Subjt:  GMSYEKIHACHNDCCLYRKEHANATECLECGESRWKYANNANGGKKQIPGKVVWYFPSIPRFKRLFRSIDNAKNLIWHSNERVIDGKLRHPADSPAWKLI

Query:  DLKWPDFGSEPRNIRLALSADGINPHGEMSSKYSCWPVVIVIHNLPPWLCMKRKFMMLSILISGPRQPGDDIVIDVQQLEKLEEDIVVTICLLEKYFPPS
        DLKWPDFGSEPRNIRLAL                                   KFMMLS+LI GPRQPGDDI                            
Subjt:  DLKWPDFGSEPRNIRLALSADGINPHGEMSSKYSCWPVVIVIHNLPPWLCMKRKFMMLSILISGPRQPGDDIVIDVQQLEKLEEDIVVTICLLEKYFPPS

Query:  FFTIMMHLTVYIVREVKLCRLVYLRWVYPFGRYMKVLKNYVRNRHRPEGCIAESYIVEEDIEFCSDFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSF
                                                                                    GIDRL  SLDNSSFGRPLSAGVSF
Subjt:  FFTIMMHLTVYIVREVKLCRLVYLRWVYPFGRYMKVLKNYVRNRHRPEGCIAESYIVEEDIEFCSDFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSF

Query:  KPEQDLLYQAHRYV---------------------------------------------------------------------------------INGCR
        KP+QDLLYQAHRYV                                                                                 INGCR
Subjt:  KPEQDLLYQAHRYV---------------------------------------------------------------------------------INGCR

Query:  YHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKNPIIGDMSFYGVIQDIWELNYNTFNVAVFRCDWVENNNGMKIDDLGFVLVDLKRIGHKSDSFIMATQA
        YHTESYDKNRSVQN+GVSL+AKTMQVSSSKDKN IIGDMSFYGVIQDIWELNYNTFNVAVFRCDWVENNN MKI DLGFVLVDLKRIGHKSDSFIMATQA
Subjt:  YHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKNPIIGDMSFYGVIQDIWELNYNTFNVAVFRCDWVENNNGMKIDDLGFVLVDLKRIGHKSDSFIMATQA

Query:  RQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDELGDIMLHCQGVSSDMPNVDGGNDLDENMSTYDILFTHGANDSTR-----------------------
        RQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDELGDIMLHCQGV SDMPNVD GNDLDENMSTYD LFTHGANDS R                       
Subjt:  RQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDELGDIMLHCQGVSSDMPNVDGGNDLDENMSTYDILFTHGANDSTR-----------------------

Query:  ------------------RSRQGGERKLVEYNEDGVPIGENGAKLNSFIESCVHYHIPIIYATWIDVPAELKEKIYTIVEVKRKLQQERRKKNKYNH---
                          R R   ERKLVEYN DGVPIGENGAKLNSFI SCVHYHI IIYATWIDVPAELKEKIYTIVE  + +  E  + ++ +    
Subjt:  ------------------RSRQGGERKLVEYNEDGVPIGENGAKLNSFIESCVHYHIPIIYATWIDVPAELKEKIYTIVEVKRKLQQERRKKNKYNH---

Query:  -RLSRKG-YAN--LREELN-DISKTCADKEPSPNDVLTQALGTRESSGRVRGVGGFVTPSTYFHTAKRSKKRNEEIDKLSEENEKLRLRVQELENIHIST
         R+ +KG Y N  ++E +N +ISKTCADKEPSPN+VLTQAL                        AKRSKKRNEEIDKLSEENEKLRLRVQELENIHIST
Subjt:  -RLSRKG-YAN--LREELN-DISKTCADKEPSPNDVLTQALGTRESSGRVRGVGGFVTPSTYFHTAKRSKKRNEEIDKLSEENEKLRLRVQELENIHIST

Query:  QSTPTSAHGSCSQTKIG-----------------------------------------------------------------------------------
        QSTPT AHGSCS+ ++                                                                                    
Subjt:  QSTPTSAHGSCSQTKIG-----------------------------------------------------------------------------------

Query:  -------IRYSMYLHSQDELSNYIFVDPSLISVGHNTQEVRARNLCSRLMASKPNQLVLAPFNP---WA-------------------------------
               + Y  YLHSQDELSNYIFVDPSLI VGHNTQEVRARNL SRLMASKPNQLVLAPFNP   WA                               
Subjt:  -------IRYSMYLHSQDELSNYIFVDPSLISVGHNTQEVRARNLCSRLMASKPNQLVLAPFNP---WA-------------------------------

Query:  -ITIFRSQKNIQTSRKQPIWKTVK
         ITIFRSQKNIQTSRKQPIWKT+K
Subjt:  -ITIFRSQKNIQTSRKQPIWKTVK

TYK22869.1 hypothetical protein E5676_scaffold334G00040 [Cucumis melo var. makuwa]0.039.84Show/hide
Query:  MYEENDVGSINETIEVAHEEYSKDPNEFEKLLNDAEKSLYEECKKFTKLSTLVKLYNLKVRYGWSDIRFSELLKTLKEILPTSNEIPTSMYEAKKTLGAL
        MYEENDVG+I E +E+AHE+YSKDP+ FEKLLND EK LYE CKKFTKLSTLVKLYNLKVR+GWS+I FSELLK LK+ILP+ N++PTSMYEAKK LGAL
Subjt:  MYEENDVGSINETIEVAHEEYSKDPNEFEKLLNDAEKSLYEECKKFTKLSTLVKLYNLKVRYGWSDIRFSELLKTLKEILPTSNEIPTSMYEAKKTLGAL

Query:  GMSYEKIHACHNDCCLYRKEHANATECLECGESRWKYANNANGGKKQIPGKVVWYFPSIPRFKRLFRSIDNAKNLIWHSNERVIDGKLRHPADSPAWKLI
        GM YEKIHAC NDCCLYRKE+ANA  C ECGESRWKY  +AN  KK+IP K++WYFP IPRF+R+FRS++ AKNL WH+ ER ID KLRHPADSPAWKL+
Subjt:  GMSYEKIHACHNDCCLYRKEHANATECLECGESRWKYANNANGGKKQIPGKVVWYFPSIPRFKRLFRSIDNAKNLIWHSNERVIDGKLRHPADSPAWKLI

Query:  DLKWPDFGSEPRNIRLALSADGINPHGEMSSKYSCWPVVIVIHNLPPWLCMKRKFMMLSILISGPRQPGDDI----------------------------
        D  WP+F SEPRN+RLALSADGINP+ +MSSKYSCWPVV+VI+NLPPWLCMKRKFMMLSILISGP+QPGDDI                            
Subjt:  DLKWPDFGSEPRNIRLALSADGINPHGEMSSKYSCWPVVIVIHNLPPWLCMKRKFMMLSILISGPRQPGDDI----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------VIDVQQLEKLEEDIVVTICLLEKYFPPSFFTIMMHLTVYIVREVKLCRLVYLRWVYPFGRYMKVLKNYVRNRHRPEGCIAESYIVEEDIEFCS
               VIDV Q+EKL+EDIV+T+CLLEKYFPPSFFTIM+HLTV++VREVKLC  +YLRW+YPF R+MKV+KN VRNR+RPEGCIAE YI+EE +EFCS
Subjt:  -------VIDVQQLEKLEEDIVVTICLLEKYFPPSFFTIMMHLTVYIVREVKLCRLVYLRWVYPFGRYMKVLKNYVRNRHRPEGCIAESYIVEEDIEFCS

Query:  DFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLYQAHRYV---------------------------------------------------
        +FL  V+P+GLG  +L ++ D S  GRPLS+GV+  PE++LLYQAHRYV                                                   
Subjt:  DFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLYQAHRYV---------------------------------------------------

Query:  ------------------------------INGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKNPIIGDMSFYGVIQDIWELNYNTFNVAVFRCDW
                                      INGC YHT+S+DKN++VQN+GVSL+AKTMQV SSKDKNPIIG+MSFYGVI++IWELNYN+F VA+F+CDW
Subjt:  ------------------------------INGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKNPIIGDMSFYGVIQDIWELNYNTFNVAVFRCDW

Query:  VENNNGMKIDDLGFVLVDLKRIGHKSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDELGDIMLHCQGVSSDMPNVDGGNDLDENMSTY-
        VEN+ G+K D+LGFVLVDL R+GHK+DSFI ATQA+QVF+VEDPSD+RWSIVLTPPQRD  DQ NDDELGD +L+CQG+     +++   DLDEN  TY 
Subjt:  VENNNGMKIDDLGFVLVDLKRIGHKSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDELGDIMLHCQGVSSDMPNVDGGNDLDENMSTY-

Query:  -----------------------------DILFTHGANDSTRRSRQG-------------GERKLVEYNEDGVPIGENGAKLNSFIESCVHYHIPIIYAT
                                     D L    A+ + ++  +G             GE+ +VEYNE+G+PIGENG KL SFI SCVH+HIPI +A+
Subjt:  -----------------------------DILFTHGANDSTRRSRQG-------------GERKLVEYNEDGVPIGENGAKLNSFIESCVHYHIPIIYAT

Query:  WIDVPAELKEKIYTIVEVK------------------------------------------------------------------------RKLQQERRK
        W  VP ELKEKI ++VE                                                                          R+ QQ+RRK
Subjt:  WIDVPAELKEKIYTIVEVK------------------------------------------------------------------------RKLQQERRK

Query:  KNKYNHRLSRKGYANLREEL------------------------------------NDISKTC--ADKEPSPNDVLTQALGTRESSGRVRGVGGFVTPST
        ++KY H +SR+GYANL E++                                    N+I +    A  E SPND LTQALGT E  GRVRGVGGF+TP+ 
Subjt:  KNKYNHRLSRKGYANLREEL------------------------------------NDISKTC--ADKEPSPNDVLTQALGTRESSGRVRGVGGFVTPST

Query:  YFHTAK-RSKKRNEEIDKLSEENEKLRLRVQELENIHISTQSTPTSAHGSCSQTKI--------------------------------------------
        YFH AK R  K+ +   ++ +ENE LR R++ELE      QS PTS HGSCS++K+                                            
Subjt:  YFHTAK-RSKKRNEEIDKLSEENEKLRLRVQELENIHISTQSTPTSAHGSCSQTKI--------------------------------------------

Query:  -----------------------------------------------------------GIRYSMYLHSQD-----------------------------
                                                                   GI    Y+  +D                             
Subjt:  -----------------------------------------------------------GIRYSMYLHSQD-----------------------------

Query:  -ELSNYIFVDPSLISVGHNTQEVRARNLCSRLMASKPNQLVLAPFNP---W--------------------------------AITIFRSQKNIQTSRKQ
         ++S Y+FVDPSLIS GH+T+E+RARNLCSRLM SK +QLV+AP+NP   W                                A+TIF+SQKN++ +RK 
Subjt:  -ELSNYIFVDPSLISVGHNTQEVRARNLCSRLMASKPNQLVLAPFNP---W--------------------------------AITIFRSQKNIQTSRKQ

Query:  PIWKTV
          WK V
Subjt:  PIWKTV

TrEMBL top hitse value%identityAlignment
A0A5A7TUX7 Transposase3.5e-27951.36Show/hide
Query:  MDRSWMHKSRLSKDYGLGVENFISFGFSNTKDASIRCSCLKCGNCEKQSRTTIRDHLYVNGIDESYKIWFWHGE-QLPESSLYEESSKFDTHMYEENDVG
        MDRSWMHKSRL KDY LGVENFISFGFSNTKDASIRC CLKCGNCEKQSRTTIRDHLYVNGIDESYKIWFWHGE QLPESSLYEESSKFDTHMYE NDVG
Subjt:  MDRSWMHKSRLSKDYGLGVENFISFGFSNTKDASIRCSCLKCGNCEKQSRTTIRDHLYVNGIDESYKIWFWHGE-QLPESSLYEESSKFDTHMYEENDVG

Query:  SINETIEVAHEEYSKDPNEFEKLLNDAEKSLYEECKKFTKLSTLVKLYNLKVRYGWSDIRFSELLKTLKEILPTSNEIPTSMYEAKKTLGALGMSYEKIH
         INE IEVAHEEYSKDPNEFEKLLNDAEKSLYE CKKFTKLSTLVKLYNLKVRYGWSDI FSELLKTLKEILPT NEIPTS+YEAKKTLGALGMSYEKIH
Subjt:  SINETIEVAHEEYSKDPNEFEKLLNDAEKSLYEECKKFTKLSTLVKLYNLKVRYGWSDIRFSELLKTLKEILPTSNEIPTSMYEAKKTLGALGMSYEKIH

Query:  ACHNDCCLYRKEHANATECLECGESRWKYANNANGGKKQIPGKVVWYFPSIPRFKRLFRSIDNAKNLIWHSNERVIDGKLRHPADSPAWKLIDLKWPDFG
        AC NDCCLYRKEHANATEC ECGESRWKYANNAN GKKQIP KVVWYFP IPRFKRLFRSI+NAKNLIWHSNERVI GKLRHPADSPAWKLIDLKWPDFG
Subjt:  ACHNDCCLYRKEHANATECLECGESRWKYANNANGGKKQIPGKVVWYFPSIPRFKRLFRSIDNAKNLIWHSNERVIDGKLRHPADSPAWKLIDLKWPDFG

Query:  SEPRNIRLALSADGINPHGEMSSKYSCWPVVIVIHNLPPWLCMKRKFMMLSILISGPRQPGDDIVIDVQQLEKLEEDIVVTICLLEKYFPPSFFTIMMHL
        SEPRNIRLALSAD INPH EMSSKYSCWPVVIVI+NLPPWLCMKRKFMMLS+LISGPRQPGDDI                       Y  P         
Subjt:  SEPRNIRLALSADGINPHGEMSSKYSCWPVVIVIHNLPPWLCMKRKFMMLSILISGPRQPGDDIVIDVQQLEKLEEDIVVTICLLEKYFPPSFFTIMMHL

Query:  TVYIVREVKLCRLVYLRWVYPFGRYMKVLKNYVRNRHRPEGCIAESYIVEEDIEFCSDFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLY
           ++ ++KL                                                              L+ES            GV          
Subjt:  TVYIVREVKLCRLVYLRWVYPFGRYMKVLKNYVRNRHRPEGCIAESYIVEEDIEFCSDFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLY

Query:  QAHRYVINGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKNPIIGDMSFYGVIQDIWELNYNTFNV-AVFRCDWVENNNGMKIDDLGFVLVDLKRIG
                      E YD N+    N   ++  T            I D S YG        N + F+V ++F                           
Subjt:  QAHRYVINGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKNPIIGDMSFYGVIQDIWELNYNTFNV-AVFRCDWVENNNGMKIDDLGFVLVDLKRIG

Query:  HKSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDELGDIMLHCQGVSSDMPNVDGGNDLDENMSTYDILFTHGANDSTRRSRQGGERKLV
        H   + ++  Q ++V   E        +V+       E      + G  ++                        +D+  TH         R  GERKLV
Subjt:  HKSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDELGDIMLHCQGVSSDMPNVDGGNDLDENMSTYDILFTHGANDSTRRSRQGGERKLV

Query:  EYNEDGVPIGENGAKLNSFIESCVHYHIPIIYATWIDVPAELKEKIYTIVEV------------------------------------------------
        EYNEDGVPIGENGAKLNSFI SCVHYHIPIIYATWIDVPAELKEKIYTIVE                                                 
Subjt:  EYNEDGVPIGENGAKLNSFIESCVHYHIPIIYATWIDVPAELKEKIYTIVEV------------------------------------------------

Query:  -----------------------KRKLQQERRKKNKYNHRLSRKGYANLREEL---------------------------------------NDISKTCA
                               KRKLQQERRKKNKYNHRLSRKGYANL+EEL                                       ++ISKTCA
Subjt:  -----------------------KRKLQQERRKKNKYNHRLSRKGYANLREEL---------------------------------------NDISKTCA

Query:  DKEPSPNDVLTQALGTRESSGRVRGVGGFVTPSTYFHTAKRSKKRNEEIDKLSEENEKLRLRVQELENIHIST---------------------------
        DKE SPNDVLTQALGTRESSGRVRGVG FVTPSTYFHTAKRSKKRNEEIDKLS+ENEKLR +    + +++                             
Subjt:  DKEPSPNDVLTQALGTRESSGRVRGVGGFVTPSTYFHTAKRSKKRNEEIDKLSEENEKLRLRVQELENIHIST---------------------------

Query:  ----------------------QSTPTSAHG--------------SCSQTKIG----IRYSMYLHSQDELSNYIFVDPSLISVGHNTQEVRARNLCSRLM
                                 P S  G               C+  ++     + Y++YLHSQD+LSNYIFV PSLISVGHNTQEVRARNLCSRLM
Subjt:  ----------------------QSTPTSAHG--------------SCSQTKIG----IRYSMYLHSQDELSNYIFVDPSLISVGHNTQEVRARNLCSRLM

Query:  ASKPNQLVLAPFNP---W--------------------------------AITIFRSQKNIQTSRKQPIWKTVK
        ASK NQLVLAPFNP   W                                AITIF SQKNIQTSRKQPIWKTVK
Subjt:  ASKPNQLVLAPFNP---W--------------------------------AITIFRSQKNIQTSRKQPIWKTVK

A0A5A7UGF9 Pol protein9.5e-29377.41Show/hide
Query:  MKVLKNYVRNRHRPEGCIAESYIVEEDIEFCSDFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLYQAHR---------------------
        MKVLKNYVRNRHRPEGCIAESYIVEEDIEFCSDFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLYQAHR                     
Subjt:  MKVLKNYVRNRHRPEGCIAESYIVEEDIEFCSDFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLYQAHR---------------------

Query:  ------------------------------------------------------------YVINGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKN
                                                                    YVINGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKN
Subjt:  ------------------------------------------------------------YVINGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKN

Query:  PIIGDMSFYGVIQDIWELNYNTFNVAVFRCDWVENNNGMKIDDLGFVLVDLKRIGHKSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDE
        PIIGDMSFYGVIQDIWELNYNTFNVAVFRCDWVENNNGMKIDDLGFVLVDLKRIGHKSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDE
Subjt:  PIIGDMSFYGVIQDIWELNYNTFNVAVFRCDWVENNNGMKIDDLGFVLVDLKRIGHKSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDE

Query:  LGDIMLHCQGVSSDMPNVDGGNDLDENMSTYDILFTHGANDSTRRSRQGGERKLVEYNEDGVPIGENGAKLNSFIESCVHYHIPIIYATWIDVPAELKEK
        LGDIMLHCQGVSSDMPNVDGGNDLDENMSTYDILFTHGANDSTRRSRQGGERKLVEYNEDGVPIGENGAKLNSFIESCVHYHIPIIYATWIDVPAELKEK
Subjt:  LGDIMLHCQGVSSDMPNVDGGNDLDENMSTYDILFTHGANDSTRRSRQGGERKLVEYNEDGVPIGENGAKLNSFIESCVHYHIPIIYATWIDVPAELKEK

Query:  IYTIV-----------------------------------------------------------------------EVKRKLQQERRKKNKYNHRLSRKG
        IYTIV                                                                       EVKRKLQQERRKKNKYNHRLSRKG
Subjt:  IYTIV-----------------------------------------------------------------------EVKRKLQQERRKKNKYNHRLSRKG

Query:  YANLREELNDISKTCADKEPSPNDVLTQALGTRESSGRVRGVGGFVTPSTYFHTAKRSKKRNEEIDKLSEENEKLRLRVQELENIHISTQSTPTSAHGSC
        YANLREELNDISKTCADKEPSPNDVLTQALGTRESSGRVRGVGGFVTPSTYFHTAKRSKKRNEEIDKLSEENEKLRLRVQELENIHISTQSTPTSAHGSC
Subjt:  YANLREELNDISKTCADKEPSPNDVLTQALGTRESSGRVRGVGGFVTPSTYFHTAKRSKKRNEEIDKLSEENEKLRLRVQELENIHISTQSTPTSAHGSC

Query:  SQTKIGIRYSMYLHSQDELSNYIFVDPSLISVGHNTQEVRARNLCSRLMASKPNQLVLAPFNPWAITIFRSQKNIQTSRKQPIWKTVKEYYVVVS
        SQTKIGIRYSMYLHSQDELSNYIFVDPSLISVGHNTQEVRARNLCSRLMASKPNQLVLAPFNPWAITIFRSQKNIQTSRKQPIWKTVK    V+S
Subjt:  SQTKIGIRYSMYLHSQDELSNYIFVDPSLISVGHNTQEVRARNLCSRLMASKPNQLVLAPFNPWAITIFRSQKNIQTSRKQPIWKTVKEYYVVVS

A0A5D3BVS7 Transposase2.8e-28452.85Show/hide
Query:  MDRSWMHKSRLSKDYGLGVENFISFGFSNTKDASIRCSCLKCGNCEKQSRTTIRDHLYVNGIDESYKIWFWHGE-QLPESSLYEESSKFDTHMYEENDVG
        MDRSWMHKSRL KDY LGVENFISFGFSNTKDASIRC CLKCGNCEKQSRTTIRDHLYVNGIDESYKIWFWHGE QLPESSLYEESSKFDTHMYE NDVG
Subjt:  MDRSWMHKSRLSKDYGLGVENFISFGFSNTKDASIRCSCLKCGNCEKQSRTTIRDHLYVNGIDESYKIWFWHGE-QLPESSLYEESSKFDTHMYEENDVG

Query:  SINETIEVAHEEYSKDPNEFEKLLNDAEKSLYEECKKFTKLSTLVKLYNLKVRYGWSDIRFSELLKTLKEILPTSNEIPTSMYEAKKTLGALGMSYEKIH
         INE IEVAHEEYSKDPNEFEKLLNDAEKSLYE CKKFTKLSTLVKLYNLKVRYGWSDI FSELLKTLKEILPT NEIPTS+YEAKKTLGALGMSYEKIH
Subjt:  SINETIEVAHEEYSKDPNEFEKLLNDAEKSLYEECKKFTKLSTLVKLYNLKVRYGWSDIRFSELLKTLKEILPTSNEIPTSMYEAKKTLGALGMSYEKIH

Query:  ACHNDCCLYRKEHANATECLECGESRWKYANNANGGKKQIPGKVVWYFPSIPRFKRLFRSIDNAKNLIWHSNERVIDGKLRHPADSPAWKLIDLKWPDFG
        AC NDCCLYRKEHANATEC ECGESRWKYANNAN GKKQIP KVVWYFP IPRFKRLFRSI+NAKNLIWHSNERVI GKLRHPADSPAWKLIDLKWPDFG
Subjt:  ACHNDCCLYRKEHANATECLECGESRWKYANNANGGKKQIPGKVVWYFPSIPRFKRLFRSIDNAKNLIWHSNERVIDGKLRHPADSPAWKLIDLKWPDFG

Query:  SEPRNIRLALSADGINPHGEMSSKYSCWPVVIVIHNLPPWLCMKRKFMMLSILISGPRQPGDDIVIDVQQLEKLEEDIVVTICLLEKYFPPSFFTIMMHL
        SEPRNIRLALSAD INPH EMSSKYSCWPVVIVI+NLPPWLCMKRKFMMLS+LISGPRQPGDDI                       Y  P         
Subjt:  SEPRNIRLALSADGINPHGEMSSKYSCWPVVIVIHNLPPWLCMKRKFMMLSILISGPRQPGDDIVIDVQQLEKLEEDIVVTICLLEKYFPPSFFTIMMHL

Query:  TVYIVREVKLCRLVYLRWVYPFGRYMKVLKNYVRNRHRPEGCIAESYIVEEDIEFCSDFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLY
           ++ ++KL                                                              L+ES            GV          
Subjt:  TVYIVREVKLCRLVYLRWVYPFGRYMKVLKNYVRNRHRPEGCIAESYIVEEDIEFCSDFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLY

Query:  QAHRYVINGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKNPIIGDMSFYGVIQDIWELNYNTFNV-AVFRCDWVENNNGMKIDDLGFVLVDLKRIG
                      E YD N+    N   ++  T            I D S YG        N + F+V ++F                           
Subjt:  QAHRYVINGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKNPIIGDMSFYGVIQDIWELNYNTFNV-AVFRCDWVENNNGMKIDDLGFVLVDLKRIG

Query:  HKSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDELGDIMLHCQGVSSDMPNVDGGNDLDENMSTYDILFTHGANDSTRRSRQGGERKLV
        H   + ++  Q ++V   E        +V+       E      + G  ++                        +D+  TH         R  GERKLV
Subjt:  HKSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDELGDIMLHCQGVSSDMPNVDGGNDLDENMSTYDILFTHGANDSTRRSRQGGERKLV

Query:  EYNEDGVPIGENGAKLNSFIESCVHYHIPIIYATWIDVPAELKEKIYTIVEV------------------------------------------------
        EYNEDGVPIGENGAKLNSFI SCVHYHIPIIYATWIDVPAELKEKIYTIVE                                                 
Subjt:  EYNEDGVPIGENGAKLNSFIESCVHYHIPIIYATWIDVPAELKEKIYTIVEV------------------------------------------------

Query:  -----------------------KRKLQQERRKKNKYNHRLSRKGYANLREEL---------------------------------------NDISKTCA
                               KRKLQQERRKKNKYNHRLSRKGYANL+EEL                                       ++ISKTCA
Subjt:  -----------------------KRKLQQERRKKNKYNHRLSRKGYANLREEL---------------------------------------NDISKTCA

Query:  DKEPSPNDVLTQALGTRESSGRVRGVGGFVTPSTYFHTAKRSKKRNEEIDKLSEENEKLRLRVQELENIHIST---------------------------
        DKE SPNDVLTQALGTRESSGRVRGVG FVTPSTYFHTAKRSKKRNEEIDKLS+ENEKLR +    + +++                             
Subjt:  DKEPSPNDVLTQALGTRESSGRVRGVGGFVTPSTYFHTAKRSKKRNEEIDKLSEENEKLRLRVQELENIHIST---------------------------

Query:  ----------------------QSTPTSAHG--------------SCSQTKIG----IRYSMYLHSQDELSNYIFVDPSLISVGHNTQEVRARNLCSRLM
                                 P S  G               C+  ++     + Y++YLHSQD+LSNYIFV PSLISVGHNTQEVRARNLCSRLM
Subjt:  ----------------------QSTPTSAHG--------------SCSQTKIG----IRYSMYLHSQDELSNYIFVDPSLISVGHNTQEVRARNLCSRLM

Query:  ASKPNQLVLAPFNPWAITIFRSQKNIQTSRKQPIWKTVK
        ASK NQLVLAPFNP AITIF SQKNIQTSRKQPIWKTVK
Subjt:  ASKPNQLVLAPFNPWAITIFRSQKNIQTSRKQPIWKTVK

A0A5D3CA82 Transposase1.1e-26450.46Show/hide
Query:  ASIRCSCLKCGNCEKQSRTTIRDHLYVNGIDESYKIWFWHGEQLPESSLYEESSKFDTHMYEENDVGSINETIEVAHEEYSKDPNEFEKLLNDAEKSLYE
        A++ CS  +C      SR  +RDHLYVNGIDESYKIWFWHGE LP SS Y E SKFDTH  EENDVGS+ E IEVAHEEYSKDPN FEKLL DAEK LYE
Subjt:  ASIRCSCLKCGNCEKQSRTTIRDHLYVNGIDESYKIWFWHGEQLPESSLYEESSKFDTHMYEENDVGSINETIEVAHEEYSKDPNEFEKLLNDAEKSLYE

Query:  ECKKFTKLSTLVKLYNLKVRYGWSDIRFSELLKTLKEILPTSNEIPTSMYEAKKTLGALGMSYEKIHACHNDCCLYRKEHANATECLECGESRWKYANNA
         CKK+TKLSTLVKLYNLKVRYGWSDI FSELLKTLKEILPT+NE+P S+YEAKKTLGALGM YEKIHAC N+CCLYRKE ANATEC ECG+SRWK   + 
Subjt:  ECKKFTKLSTLVKLYNLKVRYGWSDIRFSELLKTLKEILPTSNEIPTSMYEAKKTLGALGMSYEKIHACHNDCCLYRKEHANATECLECGESRWKYANNA

Query:  NGGKKQIPGKVVWYFPSIPRFKRLFRSIDNAKNLIWHSNERVIDGKLRHPADSPAWKLIDLKWPDFGSEPRNIRLALSADGINPHGEMSSKYSCWPVVIV
        N  +KQIP KV+WYFP IPRFKRLFRSI+ A+NL WH++ER+ DGKLRHPADSPAWKL+D KWPDFGSEPRN+RLALSADG+NPHG+MSSKYSCWP+V+V
Subjt:  NGGKKQIPGKVVWYFPSIPRFKRLFRSIDNAKNLIWHSNERVIDGKLRHPADSPAWKLIDLKWPDFGSEPRNIRLALSADGINPHGEMSSKYSCWPVVIV

Query:  IHNLPPWLCMKRKFMMLSILISGPRQPGDDI---------------------------------------------------------------------
        I+NLPPWLCMKRK+MMLS+LISGP+QPGDDI                                                                     
Subjt:  IHNLPPWLCMKRKFMMLSILISGPRQPGDDI---------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------VIDVQQLEKLEEDIVVTICLLEKYFPPSFFTIMMHLTVYIVREVKLCRLVYLRWVYPFG
                                                 V+D QQL+KLEEDIVVT+CL EKYFPPSFFTIM+HLTV+IVREVKLC  +YLRW+YPF 
Subjt:  -----------------------------------------VIDVQQLEKLEEDIVVTICLLEKYFPPSFFTIMMHLTVYIVREVKLCRLVYLRWVYPFG

Query:  RYMKVLKNYVRNRHRPEGCIAESYIVEEDIEFCSDFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLYQAHRYV-----------------
        R+MKV+KN VRNR+RPEGCIAESY++EE IEFCSDFLSGV+PVGLG  +  + LD S+ GRPLS GV FKPEQ+LL QAHRYV                 
Subjt:  RYMKVLKNYVRNRHRPEGCIAESYIVEEDIEFCSDFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLYQAHRYV-----------------

Query:  ----------------INGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKNPIIGDMSFYGVIQDIWELNYNTFNVAVFRCDWVENNNGMKIDDLGF
                        INGCRYHT+S +K+RSVQN+GVSL+AKTMQVSSSKDKNP+IGDMSFYGVIQ+IWELNYNTFNV VF+CDWV+N+ G++ID+LG+
Subjt:  ----------------INGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKNPIIGDMSFYGVIQDIWELNYNTFNVAVFRCDWVENNNGMKIDDLGF

Query:  VLVDLKRIGHKSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDELGDIMLHCQGVSSDMPNVDGGNDLDENMSTY
         LVDL R+GHKSDSFI+A+QA+QVFYVEDPSD RWS+VLTPPQRD ED+ NDDELGD +L C+G+ +DMP+V   NDLDEN+STY
Subjt:  VLVDLKRIGHKSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDELGDIMLHCQGVSSDMPNVDGGNDLDENMSTY

A0A5D3DHZ4 ULP_PROTEASE domain-containing protein2.3e-27839.78Show/hide
Query:  MYEENDVGSINETIEVAHEEYSKDPNEFEKLLNDAEKSLYEECKKFTKLSTLVKLYNLKVRYGWSDIRFSELLKTLKEILPTSNEIPTSMYEAKKTLGAL
        MYEENDVG+I E +E+AHE+YSKDP+ FEKLLND EK LYE CKKFTKLSTLVKLYNLKVR+GWS+I FSELLK LK+ILP+ N++PTSMYEAKK LGAL
Subjt:  MYEENDVGSINETIEVAHEEYSKDPNEFEKLLNDAEKSLYEECKKFTKLSTLVKLYNLKVRYGWSDIRFSELLKTLKEILPTSNEIPTSMYEAKKTLGAL

Query:  GMSYEKIHACHNDCCLYRKEHANATECLECGESRWKYANNANGGKKQIPGKVVWYFPSIPRFKRLFRSIDNAKNLIWHSNERVIDGKLRHPADSPAWKLI
        GM YEKIHAC NDCCLYRKE+ANA  C ECGESRWKY  +AN  KK+IP K++WYFP IPRF+R+FRS++ AKNL WH+ ER ID KLRHPADSPAWKL+
Subjt:  GMSYEKIHACHNDCCLYRKEHANATECLECGESRWKYANNANGGKKQIPGKVVWYFPSIPRFKRLFRSIDNAKNLIWHSNERVIDGKLRHPADSPAWKLI

Query:  DLKWPDFGSEPRNIRLALSADGINPHGEMSSKYSCWPVVIVIHNLPPWLCMKRKFMMLSILISGPRQPGDDI----------------------------
        D  WP+F SEPRN+RLALSADGINP+ +MSSKYSCWPVV+VI+NLPPWLCMKRKFMMLSILISGP+QPGDDI                            
Subjt:  DLKWPDFGSEPRNIRLALSADGINPHGEMSSKYSCWPVVIVIHNLPPWLCMKRKFMMLSILISGPRQPGDDI----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------VIDVQQLEKLEEDIVVTICLLEKYFPPSFFTIMMHLTVYIVREVKLCRLVYLRWVYPFGRYMKVLKNYVRNRHRPEGCIAESYIVEEDIEFCS
               VIDV Q+EKL+EDIV+T+CLLEKYFPPSFFTIM+HLTV++VREVKLC  +YLRW+YPF R+MKV+KN VRNR+RPEGCIAE YI+EE +EFCS
Subjt:  -------VIDVQQLEKLEEDIVVTICLLEKYFPPSFFTIMMHLTVYIVREVKLCRLVYLRWVYPFGRYMKVLKNYVRNRHRPEGCIAESYIVEEDIEFCS

Query:  DFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLYQAHRYV---------------------------------------------------
        +FL  V+P+GLG  +L ++ D S  GRPLS+GV+  PE++LLYQAHRYV                                                   
Subjt:  DFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLYQAHRYV---------------------------------------------------

Query:  ------------------------------INGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKNPIIGDMSFYGVIQDIWELNYNTFNVAVFRCDW
                                      INGC YHT+S+DKN++VQN+GVSL+AKTMQV SSKDKNPIIG+MSFYGVI++IWELNYN+F VA+F+CDW
Subjt:  ------------------------------INGCRYHTESYDKNRSVQNNGVSLMAKTMQVSSSKDKNPIIGDMSFYGVIQDIWELNYNTFNVAVFRCDW

Query:  VENNNGMKIDDLGFVLVDLKRIGHKSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDELGDIMLHCQGVSSDMPNVDGGNDLDENMSTY-
        VEN+ G+K D+LGFVLVDL R+GHK+DSFI ATQA+QVF+VEDPSD+RWSIVLTPPQRD  DQ NDDELGD +L+CQG+     +++   DLDEN  TY 
Subjt:  VENNNGMKIDDLGFVLVDLKRIGHKSDSFIMATQARQVFYVEDPSDARWSIVLTPPQRDCEDQSNDDELGDIMLHCQGVSSDMPNVDGGNDLDENMSTY-

Query:  -----------------------------DILFTHGANDSTRRSRQG-------------GERKLVEYNEDGVPIGENGAKLNSFIESCVHYHIPIIYAT
                                     D L    A+ + ++  +G             GE+ +VEYNE+G+PIGENG KL SFI SCVH+HIPI +A+
Subjt:  -----------------------------DILFTHGANDSTRRSRQG-------------GERKLVEYNEDGVPIGENGAKLNSFIESCVHYHIPIIYAT

Query:  WIDVPAELKEKIYTIVE------------------------------------------------------------------------VKRKLQQERRK
        W  VP ELKEKI ++VE                                                                          R+ QQ+RRK
Subjt:  WIDVPAELKEKIYTIVE------------------------------------------------------------------------VKRKLQQERRK

Query:  KNKYNHRLSRKGYANLREEL------------------------------------NDISKTC--ADKEPSPNDVLTQALGTRESSGRVRGVGGFVTPST
        ++KY H +SR+GYANL E++                                    N+I +    A  E SPND LTQALGT E  GRVRGVGGF+TP+ 
Subjt:  KNKYNHRLSRKGYANLREEL------------------------------------NDISKTC--ADKEPSPNDVLTQALGTRESSGRVRGVGGFVTPST

Query:  YFHTAK-RSKKRNEEIDKLSEENEKLRLRVQELENIHISTQSTPTSAHGSCSQTKIG-------------------------------------------
        YFH AK R  K+ +   ++ +ENE LR R++ELE      QS PTS HGSCS++K+                                            
Subjt:  YFHTAK-RSKKRNEEIDKLSEENEKLRLRVQELENIHISTQSTPTSAHGSCSQTKIG-------------------------------------------

Query:  --------------------------------------------------------------------------------------IRYSMYLHS----Q
                                                                                              + Y  YL+S     
Subjt:  --------------------------------------------------------------------------------------IRYSMYLHS----Q

Query:  DELSNYIFVDPSLISVGHNTQEVRARNLCSRLMASKPNQLVLAPFNP---W--------------------------------AITIFRSQKNIQTSRKQ
         ++S Y+FVDPSLIS GH+T+E+RARNLCSRLM SK +QLV+AP+NP   W                                A+TIF+SQKN++ +RK 
Subjt:  DELSNYIFVDPSLISVGHNTQEVRARNLCSRLMASKPNQLVLAPFNP---W--------------------------------AITIFRSQKNIQTSRKQ

Query:  PIWKTVKE----YYVV
          WK V      YYV+
Subjt:  PIWKTVKE----YYVV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGATCATGGATGCACAAGAGTAGGTTATCGAAAGATTATGGATTGGGTGTAGAAAACTTCATAAGTTTTGGATTTTCTAATACAAAAGATGCCTCTATT
CGTTGTTCTTGTTTGAAATGTGGGAATTGTGAAAAGCAAAGTCGTACTACTATTAGAGATCACTTATATGTCAATGGAATAGATGAAAGTTATAAAATTTGGTTT
TGGCATGGTGAACAACTACCTGAGTCATCCTTATACGAGGAATCTTCTAAGTTTGACACTCATATGTATGAAGAGAATGATGTTGGAAGTATAAATGAAACGATT
GAAGTTGCTCATGAGGAGTATTCAAAAGATCCAAATGAATTTGAGAAATTGCTTAATGATGCCGAAAAATCACTATATGAAGAATGCAAAAAATTCACCAAGTTG
TCCACATTAGTGAAGTTGTATAATTTAAAAGTTAGGTATGGATGGAGTGATATTAGATTTTCGGAACTACTGAAAACTTTAAAGGAAATTTTGCCTACTTCCAAT
GAGATCCCAACATCCATGTATGAAGCAAAGAAAACTTTAGGTGCATTAGGAATGAGTTATGAAAAGATTCATGCATGCCATAATGATTGTTGTTTGTATAGAAAA
GAACATGCTAATGCAACTGAATGTCTTGAATGTGGTGAATCAAGGTGGAAGTATGCTAATAATGCAAATGGAGGGAAAAAACAGATTCCTGGAAAAGTTGTATGG
TATTTTCCATCGATTCCACGTTTCAAAAGATTGTTTCGAAGTATTGACAATGCTAAAAATTTGATTTGGCATTCTAATGAACGAGTAATTGATGGAAAATTACGA
CATCCTGCAGACTCTCCAGCTTGGAAATTAATAGATTTGAAGTGGCCAGACTTTGGTTCTGAACCTAGAAATATTCGTTTAGCATTGTCAGCGGATGGAATCAAC
CCACACGGTGAAATGAGTTCTAAATATAGTTGTTGGCCCGTAGTGATTGTTATTCACAATCTTCCACCATGGTTGTGCATGAAAAGAAAGTTCATGATGTTATCA
ATTTTGATATCGGGTCCAAGGCAACCAGGGGATGACATTGTCATCGATGTGCAACAACTGGAAAAGTTGGAAGAAGATATTGTGGTGACTATATGTTTGCTAGAG
AAATACTTTCCTCCTTCATTTTTTACCATTATGATGCATCTCACTGTATACATTGTCAGAGAAGTCAAACTTTGTAGGCTTGTATATCTTAGATGGGTGTATCCC
TTCGGAAGATACATGAAAGTTCTAAAAAACTATGTGAGAAATAGACATCGTCCAGAGGGTTGTATTGCTGAGAGTTATATAGTAGAAGAAGATATTGAGTTTTGT
TCAGACTTCTTATCTGGGGTTGAGCCTGTTGGACTAGGCATTGATAGATTATATGAATCTTTAGACAATTCAAGCTTTGGTAGACCATTATCTGCAGGAGTTTCC
TTCAAACCTGAACAAGATCTTCTATACCAAGCTCATCGTTATGTCATTAATGGATGTCGCTATCACACGGAGTCTTATGATAAGAATCGAAGTGTTCAAAATAAT
GGAGTTAGTTTAATGGCCAAAACAATGCAAGTGTCTAGTTCGAAAGATAAAAATCCCATCATTGGAGATATGTCCTTTTATGGAGTGATTCAAGACATATGGGAA
CTTAATTACAATACGTTCAATGTTGCAGTGTTTAGATGCGATTGGGTTGAGAACAATAATGGTATGAAGATTGATGACCTTGGTTTTGTCTTAGTTGATTTAAAA
AGAATAGGTCATAAGTCTGATTCATTTATAATGGCAACCCAAGCAAGACAAGTTTTTTACGTTGAAGATCCAAGTGATGCTAGATGGTCAATTGTGCTTACTCCA
CCACAGAGAGATTGTGAAGATCAATCAAATGACGATGAACTTGGAGATATCATGTTACATTGTCAAGGAGTATCTAGTGATATGCCAAATGTCGATGGAGGTAAC
GATTTGGATGAGAATATGTCAACATATGATATTCTCTTTACACATGGAGCCAACGATAGTACGAGACGATCAAGACAAGGAGGTGAGAGAAAATTGGTTGAATAC
AACGAAGATGGCGTACCCATTGGAGAAAATGGGGCGAAGCTTAACTCCTTTATCGAGTCGTGTGTACACTACCATATCCCTATTATCTATGCAACATGGATAGAT
GTGCCTGCAGAATTGAAGGAAAAGATATATACCATAGTTGAGGTCAAAAGAAAACTTCAACAAGAGAGACGAAAGAAGAATAAGTATAACCATAGACTTTCAAGA
AAAGGTTATGCTAATCTTCGAGAGGAATTGAATGATATCTCCAAAACTTGTGCTGATAAAGAGCCGTCTCCAAACGATGTTTTGACACAAGCTTTAGGTACACGA
GAGAGTAGTGGACGTGTACGTGGAGTTGGCGGGTTTGTCACCCCAAGTACTTATTTTCACACAGCAAAACGCTCAAAGAAACGGAATGAAGAAATTGACAAGCTT
AGTGAAGAGAATGAAAAGCTTCGTTTGAGAGTTCAAGAATTAGAGAATATTCACATATCAACACAATCTACACCGACGTCTGCACATGGATCTTGCTCCCAGACC
AAGATTGGAATACGATATTCAATGTATTTGCATTCACAAGACGAACTTTCAAATTACATCTTCGTTGATCCATCCCTTATATCTGTTGGACACAACACCCAAGAA
GTTAGAGCTCGCAATCTTTGCAGTAGATTAATGGCTTCAAAACCAAACCAACTCGTTCTTGCTCCATTCAATCCATGGGCTATTACTATCTTCCGATCTCAGAAG
AATATTCAAACAAGTCGTAAACAACCAATATGGAAAACAGTGAAGGAATACTATGTGGTTGTTTCTTGGAAATATGCATTATATGTGGTTGTTTCTTGGAAATAT
GCATTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATAGATCATGGATGCACAAGAGTAGGTTATCGAAAGATTATGGATTGGGTGTAGAAAACTTCATAAGTTTTGGATTTTCTAATACAAAAGATGCCTCTATT
CGTTGTTCTTGTTTGAAATGTGGGAATTGTGAAAAGCAAAGTCGTACTACTATTAGAGATCACTTATATGTCAATGGAATAGATGAAAGTTATAAAATTTGGTTT
TGGCATGGTGAACAACTACCTGAGTCATCCTTATACGAGGAATCTTCTAAGTTTGACACTCATATGTATGAAGAGAATGATGTTGGAAGTATAAATGAAACGATT
GAAGTTGCTCATGAGGAGTATTCAAAAGATCCAAATGAATTTGAGAAATTGCTTAATGATGCCGAAAAATCACTATATGAAGAATGCAAAAAATTCACCAAGTTG
TCCACATTAGTGAAGTTGTATAATTTAAAAGTTAGGTATGGATGGAGTGATATTAGATTTTCGGAACTACTGAAAACTTTAAAGGAAATTTTGCCTACTTCCAAT
GAGATCCCAACATCCATGTATGAAGCAAAGAAAACTTTAGGTGCATTAGGAATGAGTTATGAAAAGATTCATGCATGCCATAATGATTGTTGTTTGTATAGAAAA
GAACATGCTAATGCAACTGAATGTCTTGAATGTGGTGAATCAAGGTGGAAGTATGCTAATAATGCAAATGGAGGGAAAAAACAGATTCCTGGAAAAGTTGTATGG
TATTTTCCATCGATTCCACGTTTCAAAAGATTGTTTCGAAGTATTGACAATGCTAAAAATTTGATTTGGCATTCTAATGAACGAGTAATTGATGGAAAATTACGA
CATCCTGCAGACTCTCCAGCTTGGAAATTAATAGATTTGAAGTGGCCAGACTTTGGTTCTGAACCTAGAAATATTCGTTTAGCATTGTCAGCGGATGGAATCAAC
CCACACGGTGAAATGAGTTCTAAATATAGTTGTTGGCCCGTAGTGATTGTTATTCACAATCTTCCACCATGGTTGTGCATGAAAAGAAAGTTCATGATGTTATCA
ATTTTGATATCGGGTCCAAGGCAACCAGGGGATGACATTGTCATCGATGTGCAACAACTGGAAAAGTTGGAAGAAGATATTGTGGTGACTATATGTTTGCTAGAG
AAATACTTTCCTCCTTCATTTTTTACCATTATGATGCATCTCACTGTATACATTGTCAGAGAAGTCAAACTTTGTAGGCTTGTATATCTTAGATGGGTGTATCCC
TTCGGAAGATACATGAAAGTTCTAAAAAACTATGTGAGAAATAGACATCGTCCAGAGGGTTGTATTGCTGAGAGTTATATAGTAGAAGAAGATATTGAGTTTTGT
TCAGACTTCTTATCTGGGGTTGAGCCTGTTGGACTAGGCATTGATAGATTATATGAATCTTTAGACAATTCAAGCTTTGGTAGACCATTATCTGCAGGAGTTTCC
TTCAAACCTGAACAAGATCTTCTATACCAAGCTCATCGTTATGTCATTAATGGATGTCGCTATCACACGGAGTCTTATGATAAGAATCGAAGTGTTCAAAATAAT
GGAGTTAGTTTAATGGCCAAAACAATGCAAGTGTCTAGTTCGAAAGATAAAAATCCCATCATTGGAGATATGTCCTTTTATGGAGTGATTCAAGACATATGGGAA
CTTAATTACAATACGTTCAATGTTGCAGTGTTTAGATGCGATTGGGTTGAGAACAATAATGGTATGAAGATTGATGACCTTGGTTTTGTCTTAGTTGATTTAAAA
AGAATAGGTCATAAGTCTGATTCATTTATAATGGCAACCCAAGCAAGACAAGTTTTTTACGTTGAAGATCCAAGTGATGCTAGATGGTCAATTGTGCTTACTCCA
CCACAGAGAGATTGTGAAGATCAATCAAATGACGATGAACTTGGAGATATCATGTTACATTGTCAAGGAGTATCTAGTGATATGCCAAATGTCGATGGAGGTAAC
GATTTGGATGAGAATATGTCAACATATGATATTCTCTTTACACATGGAGCCAACGATAGTACGAGACGATCAAGACAAGGAGGTGAGAGAAAATTGGTTGAATAC
AACGAAGATGGCGTACCCATTGGAGAAAATGGGGCGAAGCTTAACTCCTTTATCGAGTCGTGTGTACACTACCATATCCCTATTATCTATGCAACATGGATAGAT
GTGCCTGCAGAATTGAAGGAAAAGATATATACCATAGTTGAGGTCAAAAGAAAACTTCAACAAGAGAGACGAAAGAAGAATAAGTATAACCATAGACTTTCAAGA
AAAGGTTATGCTAATCTTCGAGAGGAATTGAATGATATCTCCAAAACTTGTGCTGATAAAGAGCCGTCTCCAAACGATGTTTTGACACAAGCTTTAGGTACACGA
GAGAGTAGTGGACGTGTACGTGGAGTTGGCGGGTTTGTCACCCCAAGTACTTATTTTCACACAGCAAAACGCTCAAAGAAACGGAATGAAGAAATTGACAAGCTT
AGTGAAGAGAATGAAAAGCTTCGTTTGAGAGTTCAAGAATTAGAGAATATTCACATATCAACACAATCTACACCGACGTCTGCACATGGATCTTGCTCCCAGACC
AAGATTGGAATACGATATTCAATGTATTTGCATTCACAAGACGAACTTTCAAATTACATCTTCGTTGATCCATCCCTTATATCTGTTGGACACAACACCCAAGAA
GTTAGAGCTCGCAATCTTTGCAGTAGATTAATGGCTTCAAAACCAAACCAACTCGTTCTTGCTCCATTCAATCCATGGGCTATTACTATCTTCCGATCTCAGAAG
AATATTCAAACAAGTCGTAAACAACCAATATGGAAAACAGTGAAGGAATACTATGTGGTTGTTTCTTGGAAATATGCATTATATGTGGTTGTTTCTTGGAAATAT
GCATTATAG
Protein sequenceShow/hide protein sequence
MDRSWMHKSRLSKDYGLGVENFISFGFSNTKDASIRCSCLKCGNCEKQSRTTIRDHLYVNGIDESYKIWFWHGEQLPESSLYEESSKFDTHMYEENDVGSINETI
EVAHEEYSKDPNEFEKLLNDAEKSLYEECKKFTKLSTLVKLYNLKVRYGWSDIRFSELLKTLKEILPTSNEIPTSMYEAKKTLGALGMSYEKIHACHNDCCLYRK
EHANATECLECGESRWKYANNANGGKKQIPGKVVWYFPSIPRFKRLFRSIDNAKNLIWHSNERVIDGKLRHPADSPAWKLIDLKWPDFGSEPRNIRLALSADGIN
PHGEMSSKYSCWPVVIVIHNLPPWLCMKRKFMMLSILISGPRQPGDDIVIDVQQLEKLEEDIVVTICLLEKYFPPSFFTIMMHLTVYIVREVKLCRLVYLRWVYP
FGRYMKVLKNYVRNRHRPEGCIAESYIVEEDIEFCSDFLSGVEPVGLGIDRLYESLDNSSFGRPLSAGVSFKPEQDLLYQAHRYVINGCRYHTESYDKNRSVQNN
GVSLMAKTMQVSSSKDKNPIIGDMSFYGVIQDIWELNYNTFNVAVFRCDWVENNNGMKIDDLGFVLVDLKRIGHKSDSFIMATQARQVFYVEDPSDARWSIVLTP
PQRDCEDQSNDDELGDIMLHCQGVSSDMPNVDGGNDLDENMSTYDILFTHGANDSTRRSRQGGERKLVEYNEDGVPIGENGAKLNSFIESCVHYHIPIIYATWID
VPAELKEKIYTIVEVKRKLQQERRKKNKYNHRLSRKGYANLREELNDISKTCADKEPSPNDVLTQALGTRESSGRVRGVGGFVTPSTYFHTAKRSKKRNEEIDKL
SEENEKLRLRVQELENIHISTQSTPTSAHGSCSQTKIGIRYSMYLHSQDELSNYIFVDPSLISVGHNTQEVRARNLCSRLMASKPNQLVLAPFNPWAITIFRSQK
NIQTSRKQPIWKTVKEYYVVVSWKYALYVVVSWKYAL