; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004474 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004474
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF668)
Genome locationchr6:4239785..4243968
RNA-Seq ExpressionLag0004474
SyntenyLag0004474
Gene Ontology termsGO:0045927 - positive regulation of growth (biological process)
InterPro domainsIPR007700 - Domain of unknown function DUF668
IPR021864 - Domain of unknown function DUF3475
IPR045021 - Protein PSK SIMULATOR


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598505.1 Protein PSK SIMULATOR 2, partial [Cucurbita argyrosperma subsp. sororia]3.1e-26183.79Show/hide
Query:  MGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNSGFFT
        MGGVCSNGI K+ F SEK TQTSEDRKGNSCL+SEA DP ++PQ+SRSGV LL SPPSKTGSNKVAP+NSQAG+ GRA+DLLKTIGNSVS LHMN GFFT
Subjt:  MGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNSGFFT

Query:  GTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARL
          ASNGREISILAFEVANTISK+ANLSQSLSEENIQ LK+EL QSEGIKQLVS S EELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYF+RL
Subjt:  GTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARL

Query:  DLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV
        DLNDSS+KQAREARAA+QEL VLAQSTSELYHEL ALERFEQDYRRKVDEVES+NQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSR+LDEIVEKLVIV
Subjt:  DLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV

Query:  VTWINQTIAKAFGDNNTG---------KTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEP
        VTWINQTIAKAF D+NTG         KTL IEDRSNGQKLG+VGLALHYANIISQINLIACRPTSIPSNMRDALYRALP SVK  LRSRL+ VD SEEP
Subjt:  VTWINQTIAKAFGDNNTG---------KTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEP

Query:  TYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLRCRSP
        TY+ VKAEMD+IL+WLVPIAANTSKAHQACGRIGEWA+QSKE SKGRATQSNPIRLQTLYHAD+ KTEQ ILELVTLLH  IHL KQQ QRFTSLRC+SP
Subjt:  TYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLRCRSP

Query:  TPKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR
        TPK+MA+PQP NARRIQFKSQIIKAN DG             TP IRKRDP    GTE+ +N+NKDKGIWTLSKG SVSTLSSL+R
Subjt:  TPKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR

KAG7029441.1 hypothetical protein SDJN02_07780, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-26383.9Show/hide
Query:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS
        MGNTMGGVCSNGI K+ F SEK TQTSEDRKGNSCL+SEA DP ++PQ+SRSGV LL SPPSKTGSNKVAP+NSQAG+ GRA+DLLKTIGNSVS LHMN 
Subjt:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS

Query:  GFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
        GFFT  ASNGREISILAFEVANTISK+ANLSQSLSEENIQ LK+EL QSEGIKQLVS S EELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY

Query:  FARLDLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        F+RLDLNDSS+KQAREARAA+QEL VLAQSTSELYHEL ALERFEQDYRRKVDEVES+NQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSR+LDEIVEK
Subjt:  FARLDLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIAKAFGDNNTG---------KTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA
        LVIVVTWINQTIAKAF D+NTG         KTL IEDRSNGQKLG+VGLALHYANIISQINLIACRPTSIPSNMRDALYRALP SVK  LRSRL+ VD 
Subjt:  LVIVVTWINQTIAKAFGDNNTG---------KTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA

Query:  SEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLR
        SEEPTY+ VKAEMD+IL+WLVPIAANTSKAHQACGRIGEWA+QSKE SKGRATQSNPIRLQTLYHAD+ KTEQ ILELVTLLH  IHL KQQ QRFTSLR
Subjt:  SEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLR

Query:  CRSPTPKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR
        C+SPTPK+MA+PQP NARRIQFKSQIIKAN DG             TP IRKRDP    GTE+ +N+NKDKGIWTLSKG SVSTLSSL+R
Subjt:  CRSPTPKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR

XP_022961804.1 uncharacterized protein LOC111462458 isoform X1 [Cucurbita moschata]1.4e-26183.73Show/hide
Query:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS
        MGNTMGGVCSNGI K+ F SEK TQTSEDRKGNSCL+SEA DP ++PQ+SRSGV LL SPPSKTGSNKVAP+NSQAG+ GRA+DLLKTIGNSVS LHMNS
Subjt:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS

Query:  GFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
        GFFTG ASNGREISILAFEVANTISKVANLSQSLSEENIQ LK+EL QSEGIKQLVS S EELLSIAAADKRQEFDVLL EVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY

Query:  FARLDLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        F+RLDLNDSS+KQAREARAA+QEL VLAQSTSELYHEL ALERFEQDYRRKVDEVES+NQAG GESLSIFQGELNVQRKLVRSFQSKCLWSR+LDEIVEK
Subjt:  FARLDLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIAKAFGDNNTG---------KTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA
        LVIVVTWINQTIAK F D+NTG         KTL IEDRSNGQKLG+VGLALHYA IISQINLIACRPTSIPSNMRDALYRALPTSVK  LRSRL+ V+ 
Subjt:  LVIVVTWINQTIAKAFGDNNTG---------KTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA

Query:  SEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLR
        SEEPTY+ VKAEMD+IL+WLVPIAANTSKAHQACGRIGEWA+QSKEHSKGRATQSNPIRLQTLYHAD+ KTEQ ILELVTLLH  IHL KQQ QRFTSLR
Subjt:  SEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLR

Query:  CRSPTPKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR
        C+SPT K+MA+PQP NARRIQFKSQIIKAN DG             TP IRKRDP    GTE+ +N NKDKGIWTLSKG SVSTLSSL+R
Subjt:  CRSPTPKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR

XP_022996976.1 uncharacterized protein LOC111492046 isoform X1 [Cucurbita maxima]1.1e-26183.9Show/hide
Query:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS
        MGNTMGGVCSNGI K+ F SEK TQTSEDR GNSCLNSEA D  ++PQ+SRSGV LL SPPSK GSNKVAP+NSQAG+ GRA+DLLKTIGNSVS LHMN 
Subjt:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS

Query:  GFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
        GFFTG ASNG EISILAFEVANTISKV NLSQSLSEENIQ LK+EL QSEGIKQLVS S EELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY

Query:  FARLDLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        F+RLDLNDSS+KQAREARAA+QEL VLAQSTSELYHEL ALERFEQDYRRKVDEVESLNQ GIGESLSIFQGELNVQRKLVRSFQSKCLWSR+LDEIVEK
Subjt:  FARLDLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIAKAFGDNNTG---------KTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA
        LVIVVTWINQTIAKAFGD+NTG         KTL IEDRS GQKLG+VGLALHYANIISQINLIACRP SIPSNMRDALYRALPTSVK  LRSRL+ VD 
Subjt:  LVIVVTWINQTIAKAFGDNNTG---------KTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA

Query:  SEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLR
        SEEPTY+ VKAEMD+IL+WLVPIAANTSKAHQACGRIGEWA+QSKEHSKGRATQSNPIRLQTLYHAD+ KTEQ ILELVTLLH LIHL+KQQ QRFTSLR
Subjt:  SEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLR

Query:  CRSPTPKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR
        C+SPTPK+MA+PQP NARRIQFKSQIIKAN DG             TP  RKRDP    GTE+ KNEN DKGIWTLSKG SVSTLSSL+R
Subjt:  CRSPTPKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR

XP_023546943.1 uncharacterized protein LOC111805888 [Cucurbita pepo subsp. pepo]9.6e-26383.9Show/hide
Query:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS
        MGNTMGGVCSNGI K+ F SEK TQTSEDRKGNSCLNSEA DP ++ Q+SRSG  LL SPPSKTGSNKVAP+NSQAG+ GRA+DLLKTIGNSVS LHMN 
Subjt:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS

Query:  GFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
        GFFTG ASNGREISILAFEVANTISKVANLSQSLSEENIQ LK+EL QSEGIKQLVS S EELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY

Query:  FARLDLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        F+RLDLNDSS+KQAREARAA+QEL VLAQSTSELYHEL ALERFEQDYRRKVDEVES+NQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSR+LDEIVEK
Subjt:  FARLDLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIAKAFGDNNTG---------KTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA
        LVIVVTWINQTIAK F D+NTG         KTL IEDRSNGQKLG+VGLALHYANIISQINLIACRPTSIPSNMRDALYRALP SVK  LRSRL+ V  
Subjt:  LVIVVTWINQTIAKAFGDNNTG---------KTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA

Query:  SEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLR
        SEEPTY+ VKAEMD+IL+WLVPIAANTSKAHQACGRIGEWA+QSKEHSKGRATQSNPIRLQTL+HAD+ KTEQ ILELVTLLH LIHLAKQQ QRFTSLR
Subjt:  SEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLR

Query:  CRSPTPKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR
        C+SPTPK+MA+PQP NARRIQFKSQIIKAN DG             TP IRKRDP    GTE+ +N+ KDKGIWTLSKG SVSTLSSL+R
Subjt:  CRSPTPKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR

TrEMBL top hitse value%identityAlignment
A0A5A7VI62 Uncharacterized protein1.4e-24881.27Show/hide
Query:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS
        MGNTMGGVCSNGIVK+DF+SEK TQ SEDRKGNS LN EASDP ++P+KS SGVILL SPPSK GSNKVAPMN+QAGA GRAVDL KTIG SVS LH+N+
Subjt:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS

Query:  GFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
        GFFT    +GREISILAFEVANTISKVANLS+SLSEEN+Q LKKEL QSEGIKQL+S SLEELLSIAAADKRQEF V+LREVIRFG QCKD +WHNLDQY
Subjt:  GFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY

Query:  FARLDLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        F+RLD NDSSQKQAREARAA+QELTVLAQ+TSELYHELQALER EQDYRR+V+EVE LNQAGIGE+LSIFQGELNVQR+LVRSFQSKCLWSRNLDEIVEK
Subjt:  FARLDLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIAKAFGDNNTGKTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSV
        LVIVVTWINQTI K FG +NT KTLLI+DRSNGQKLGAVGLALHYANIISQINLIA RPTSIPSNMRDALYRALPTS+K ALRSRL+AVDA EEPTY  V
Subjt:  LVIVVTWINQTIAKAFGDNNTGKTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSV

Query:  KAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQS-NPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLRCRSPTPKD
        KAEMD+IL+WLVPIAANTSKAHQ+CGRIGEWA+QSKE SKGRATQS NP RLQTLYHADK KTE+ ILELV LLH LIH AK QQ+R +SLRCRSPTPKD
Subjt:  KAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQS-NPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLRCRSPTPKD

Query:  MAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTPIRKR----DPGTETSKNENKDKGIWTLSKGVSVSTLSSLSRV
        MA    N++RRIQFKSQII+  KDG P D  N P A Q+PIRKR      G E+ KNEN  KGIWTLSK VSVSTL ++ RV
Subjt:  MAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTPIRKR----DPGTETSKNENKDKGIWTLSKGVSVSTLSSLSRV

A0A6J1CSY9 uncharacterized protein LOC111014009 isoform X13.7e-25283.22Show/hide
Query:  MGNTMGGVCSNGIVKEDFLSEKKTQTS-EDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMN
        MGNTMGGVCSNGI K+DF+ EKKT+ S +DRKGNSCL SEA+DP+ LPQKSRSGVILL SPPSKTGSNKVAPMN Q GA GRAV+L KTIGNSVS LH N
Subjt:  MGNTMGGVCSNGIVKEDFLSEKKTQTS-EDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMN

Query:  SGFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQ
        +GFFTGTASNGREISILAFEVANTISK+ANLSQSLSEE+IQFLKKEL QSEGIKQLVS ++EELLSIAAADKRQEFD LLREVIRFGKQCKDPQWHNLDQ
Subjt:  SGFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQ

Query:  YFARLDLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVE
        YF+RLDLNDSSQKQAREARAA+QELTVLAQ TSELYHELQALERFEQDYRRK+DEVE LNQAGIGESL+IFQGELNVQRKLVRS QSKCLWSRNLDEIV 
Subjt:  YFARLDLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVE

Query:  KLVIVVTWINQTIAKAFGDNNTGKTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLS
        KLV +VTWI QTIAKAFGDN++G+  LI DRS+GQKLG+VGLALHYANII+QINLIACRPTSIPS+MRDALYRALPTSVK  LRSRLQAVDASEEPTYLS
Subjt:  KLVIVVTWINQTIAKAFGDNNTGKTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLS

Query:  VKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLRCRSPTPKD
        VKAEMDR LQWLVPIAANTSKAHQACGRIGEWASQSKE S+GRA Q+N IRLQTLYHADKEKTEQ ILELVT LH +IHLAKQQQQRFTSLRCRSPTPKD
Subjt:  VKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLRCRSPTPKD

Query:  MAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTPIRKRDPGT----ETSKNENKD-KGIWTLSKGVSVSTLS
        +A PQ    RRIQF+S II      A A+N       QTPIRKRDPG     ETSKNEN+D KGIWTL  G   S+LS
Subjt:  MAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTPIRKRDPGT----ETSKNENKD-KGIWTLSKGVSVSTLS

A0A6J1HF33 uncharacterized protein LOC111462458 isoform X16.7e-26283.73Show/hide
Query:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS
        MGNTMGGVCSNGI K+ F SEK TQTSEDRKGNSCL+SEA DP ++PQ+SRSGV LL SPPSKTGSNKVAP+NSQAG+ GRA+DLLKTIGNSVS LHMNS
Subjt:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS

Query:  GFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
        GFFTG ASNGREISILAFEVANTISKVANLSQSLSEENIQ LK+EL QSEGIKQLVS S EELLSIAAADKRQEFDVLL EVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY

Query:  FARLDLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        F+RLDLNDSS+KQAREARAA+QEL VLAQSTSELYHEL ALERFEQDYRRKVDEVES+NQAG GESLSIFQGELNVQRKLVRSFQSKCLWSR+LDEIVEK
Subjt:  FARLDLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIAKAFGDNNTG---------KTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA
        LVIVVTWINQTIAK F D+NTG         KTL IEDRSNGQKLG+VGLALHYA IISQINLIACRPTSIPSNMRDALYRALPTSVK  LRSRL+ V+ 
Subjt:  LVIVVTWINQTIAKAFGDNNTG---------KTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA

Query:  SEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLR
        SEEPTY+ VKAEMD+IL+WLVPIAANTSKAHQACGRIGEWA+QSKEHSKGRATQSNPIRLQTLYHAD+ KTEQ ILELVTLLH  IHL KQQ QRFTSLR
Subjt:  SEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLR

Query:  CRSPTPKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR
        C+SPT K+MA+PQP NARRIQFKSQIIKAN DG             TP IRKRDP    GTE+ +N NKDKGIWTLSKG SVSTLSSL+R
Subjt:  CRSPTPKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR

A0A6J1K3J2 uncharacterized protein LOC111492046 isoform X15.1e-26283.9Show/hide
Query:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS
        MGNTMGGVCSNGI K+ F SEK TQTSEDR GNSCLNSEA D  ++PQ+SRSGV LL SPPSK GSNKVAP+NSQAG+ GRA+DLLKTIGNSVS LHMN 
Subjt:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS

Query:  GFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
        GFFTG ASNG EISILAFEVANTISKV NLSQSLSEENIQ LK+EL QSEGIKQLVS S EELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY

Query:  FARLDLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        F+RLDLNDSS+KQAREARAA+QEL VLAQSTSELYHEL ALERFEQDYRRKVDEVESLNQ GIGESLSIFQGELNVQRKLVRSFQSKCLWSR+LDEIVEK
Subjt:  FARLDLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIAKAFGDNNTG---------KTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA
        LVIVVTWINQTIAKAFGD+NTG         KTL IEDRS GQKLG+VGLALHYANIISQINLIACRP SIPSNMRDALYRALPTSVK  LRSRL+ VD 
Subjt:  LVIVVTWINQTIAKAFGDNNTG---------KTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA

Query:  SEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLR
        SEEPTY+ VKAEMD+IL+WLVPIAANTSKAHQACGRIGEWA+QSKEHSKGRATQSNPIRLQTLYHAD+ KTEQ ILELVTLLH LIHL+KQQ QRFTSLR
Subjt:  SEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLR

Query:  CRSPTPKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR
        C+SPTPK+MA+PQP NARRIQFKSQIIKAN DG             TP  RKRDP    GTE+ KNEN DKGIWTLSKG SVSTLSSL+R
Subjt:  CRSPTPKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR

A0A6J1KT53 uncharacterized protein LOC111496177 isoform X15.0e-24984.04Show/hide
Query:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS
        MGNT+GGVCSNGIVK+DF++EKK + SEDRKGNSCLN +ASDP++LP K RSGVILL SPPSKTGSNKVAP N+  GA  +AVD LKT GNSVS +H N+
Subjt:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS

Query:  GFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
        GFFTG AS+GREISILAFEVANTISKVANLS+SLSEENIQ LKKELSQSEG+KQLVS ++EELLSIAAADKRQEFDVLLREV RFG +CKDPQWHNLDQ+
Subjt:  GFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY

Query:  FARLDLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        F+RLDLNDS+QKQAREARAAMQELTVLAQ TSELYHE+Q LERFEQDYRRKVDEVE +NQAGIGESLSIFQGELNVQRKLVRSFQ+K LWSRNLDEIVEK
Subjt:  FARLDLNDSSQKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIAKAFGDNNTGKTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDAS-EEPTYLS
        LV+VVTWINQ IAKAFGD NT K L+IE+RS GQKLG+VGLALHY+NIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVD S EEPTYL 
Subjt:  LVIVVTWINQTIAKAFGDNNTGKTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDAS-EEPTYLS

Query:  VKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLRCRSPTPKD
        VK EMD+IL+WLVPIA NT+KAHQACGRIGEWASQSKE SKGRATQSNPIRLQTLYHADK KTEQ I+ELVTLLH LIHLAK QQQR TSLRCRSPTP++
Subjt:  VKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLRCRSPTPKD

Query:  MAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTPIRKRD
        MAI QPN ARRIQ+ SQ +K  KDGAPAD  NRPSAGQTPI KR+
Subjt:  MAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTPIRKRD

SwissProt top hitse value%identityAlignment
P0DO24 Protein PSK SIMULATOR 31.8e-8642.92Show/hide
Query:  SKTGSNKVAPMNSQAG--AYGRAVDLLKTIGNSVSTLHMNSGFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSIS
        ++T  +KV   +   G    GRA D+L T+G+S++ L  + GF +G A+ G E+ ILAFEVANTI K +NL +SLS+ NI+ LK  +  SEG++ LVS  
Subjt:  SKTGSNKVAPMNSQAG--AYGRAVDLLKTIGNSVSTLHMNSGFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSIS

Query:  LEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQARE-ARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESL
         +ELL + AADKRQE  V   EV+RFG + KD QWHNL +YF R+    + Q+Q +E A   + +L VL Q T+ELY ELQ L R E+DY +K  E E+ 
Subjt:  LEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQARE-ARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESL

Query:  NQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFG--DNNTGKTLLIEDRSNGQKLGAVGLALHYANIISQINLIA
          +  G+ L+I + EL  QRK+V+S + K LWSR  +E++EKLV +V ++   I   FG  D+   K    E     ++LG  GLALHYANII QI+ + 
Subjt:  NQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFG--DNNTGKTLLIEDRSNGQKLGAVGLALHYANIISQINLIA

Query:  CRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYH
         R +SI SN RD+LY++LP  +K ALRS++++ +  +E +   +K EM+R L WLVP+A NT+KAH   G +GEWA+   + +  + +  + +R++TLYH
Subjt:  CRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYH

Query:  ADKEKTEQHILELVTLLHTLIHLAKQQQQ---RFTSLRCRSPTPKDMAIPQP
        A KEKTE +IL  +  L  L+  AK   +   R +S++    T     I +P
Subjt:  ADKEKTEQHILELVTLLHTLIHLAKQQQQ---RFTSLRCRSPTPKDMAIPQP

Q9SA91 Protein PSK SIMULATOR 21.4e-11248.24Show/hide
Query:  MGGVCSNGIVKEDFLSEKKTQTSEDRKG-------NSCLNSEASDP-------EKLPQKSRSGVILLR-------SPPSKTGSNKVAPMNS---QAGAYG
        MGGVCS   V +D   +KK ++++D K         S   S+ SD            + S+   ++          PP +  S K    NS   +AG  G
Subjt:  MGGVCSNGIVKEDFLSEKKTQTSEDRKG-------NSCLNSEASDP-------EKLPQKSRSGVILLR-------SPPSKTGSNKVAPMNS---QAGAYG

Query:  --RAVDLLKTIGNSVSTLHMNSGFFTG-TASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDV
          +AV++L T+G+S++ ++ ++ + +G T+S G +++ILAFEVANTI+K A L QSLSEEN++F+KK++  SE +K+LVS    EL  +AA+DKR+E D+
Subjt:  --RAVDLLKTIGNSVSTLHMNSGFFTG-TASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDV

Query:  LLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQAR-EARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNV
           EVIRFG  CKD QWHNLD+YF +LD  +S  K  + +A A MQEL  LA+ TSELYHELQAL+RFEQDYRRK+ EVESLN    GE + I Q EL  
Subjt:  LLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQAR-EARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNV

Query:  QRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFGDNNTGKTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPT
        Q+KLV+S Q K LWS+NL EI+EKLV VV++I QTI + FG+N        E     ++LG  GL+LHYAN+I QI+ IA RP+S+PSN+RD LY ALP 
Subjt:  QRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFGDNNTGKTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPT

Query:  SVKTALRSRLQAVDASEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRA---TQSNPIRLQTLYHADKEKTEQHILELVTLL
        +VKTALR RLQ +D  EE +   +KAEM++ LQWLVP A NT+KAHQ  G +GEWA+   E  KG+       NP RLQTL+HADK   + ++LELV  L
Subjt:  SVKTALRSRLQAVDASEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRA---TQSNPIRLQTLYHADKEKTEQHILELVTLL

Query:  HTLIHLAKQQ
        H L+  +K++
Subjt:  HTLIHLAKQQ

Q9XID5 Protein PSK SIMULATOR 16.3e-10045.82Show/hide
Query:  LPQKSRSGVILLRSPPSKTGSNKVAPMNSQAG-----AYGRAVDLLKTIGNSVSTLHMNSGFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQ
        L QKSRS      +   +    KV+ ++S  G       G+AVD+L T+G+S++ L+++ GF + T   G +ISIL+FEVANTI K ANL  SLS+++I 
Subjt:  LPQKSRSGVILLRSPPSKTGSNKVAPMNSQAG-----AYGRAVDLLKTIGNSVSTLHMNSGFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQ

Query:  FLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQAR-EARAAMQELTVLAQSTSELYHELQ
         LK+ +  SEG++ L+S  ++ELL IAAADKR+E  +   EV+RFG +CKDPQ+HNLD++F RL    + QK  + EA   M ++      T++LYHEL 
Subjt:  FLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQAR-EARAAMQELTVLAQSTSELYHELQ

Query:  ALERFEQDYRRKVDEVE--SLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFGDNNTGKTLLIEDRSNGQKLG
        AL+RFEQDY+RK+ E E  S  Q G+G++L+I + EL  Q+K VR+ + K LWSR L+E++EKLV VV +++  I +AFG  +  K    +   N +KLG
Subjt:  ALERFEQDYRRKVDEVE--SLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFGDNNTGKTLLIEDRSNGQKLG

Query:  AVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKE
        + GLALHYANII+QI+ +  R +++P++ RDALY+ LP S+K+ALRSR+Q+    EE T   +KAEM++ LQWLVP+A NT+KAH   G +GEWAS   E
Subjt:  AVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKE

Query:  HSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLRCRSPTPKDMAIPQPNNARRIQFKS
         ++  A Q+  +R+ TL+HADKEKTE +IL+LV  LH L+     Q +  T    RSP    +  P   N + IQ  S
Subjt:  HSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLRCRSPTPKDMAIPQPNNARRIQFKS

Arabidopsis top hitse value%identityAlignment
AT1G30755.1 Protein of unknown function (DUF668)1.0e-11348.24Show/hide
Query:  MGGVCSNGIVKEDFLSEKKTQTSEDRKG-------NSCLNSEASDP-------EKLPQKSRSGVILLR-------SPPSKTGSNKVAPMNS---QAGAYG
        MGGVCS   V +D   +KK ++++D K         S   S+ SD            + S+   ++          PP +  S K    NS   +AG  G
Subjt:  MGGVCSNGIVKEDFLSEKKTQTSEDRKG-------NSCLNSEASDP-------EKLPQKSRSGVILLR-------SPPSKTGSNKVAPMNS---QAGAYG

Query:  --RAVDLLKTIGNSVSTLHMNSGFFTG-TASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDV
          +AV++L T+G+S++ ++ ++ + +G T+S G +++ILAFEVANTI+K A L QSLSEEN++F+KK++  SE +K+LVS    EL  +AA+DKR+E D+
Subjt:  --RAVDLLKTIGNSVSTLHMNSGFFTG-TASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDV

Query:  LLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQAR-EARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNV
           EVIRFG  CKD QWHNLD+YF +LD  +S  K  + +A A MQEL  LA+ TSELYHELQAL+RFEQDYRRK+ EVESLN    GE + I Q EL  
Subjt:  LLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQAR-EARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNV

Query:  QRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFGDNNTGKTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPT
        Q+KLV+S Q K LWS+NL EI+EKLV VV++I QTI + FG+N        E     ++LG  GL+LHYAN+I QI+ IA RP+S+PSN+RD LY ALP 
Subjt:  QRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFGDNNTGKTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPT

Query:  SVKTALRSRLQAVDASEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRA---TQSNPIRLQTLYHADKEKTEQHILELVTLL
        +VKTALR RLQ +D  EE +   +KAEM++ LQWLVP A NT+KAHQ  G +GEWA+   E  KG+       NP RLQTL+HADK   + ++LELV  L
Subjt:  SVKTALRSRLQAVDASEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRA---TQSNPIRLQTLYHADKEKTEQHILELVTLL

Query:  HTLIHLAKQQ
        H L+  +K++
Subjt:  HTLIHLAKQQ

AT1G34320.1 Protein of unknown function (DUF668)4.5e-10145.82Show/hide
Query:  LPQKSRSGVILLRSPPSKTGSNKVAPMNSQAG-----AYGRAVDLLKTIGNSVSTLHMNSGFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQ
        L QKSRS      +   +    KV+ ++S  G       G+AVD+L T+G+S++ L+++ GF + T   G +ISIL+FEVANTI K ANL  SLS+++I 
Subjt:  LPQKSRSGVILLRSPPSKTGSNKVAPMNSQAG-----AYGRAVDLLKTIGNSVSTLHMNSGFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQ

Query:  FLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQAR-EARAAMQELTVLAQSTSELYHELQ
         LK+ +  SEG++ L+S  ++ELL IAAADKR+E  +   EV+RFG +CKDPQ+HNLD++F RL    + QK  + EA   M ++      T++LYHEL 
Subjt:  FLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQAR-EARAAMQELTVLAQSTSELYHELQ

Query:  ALERFEQDYRRKVDEVE--SLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFGDNNTGKTLLIEDRSNGQKLG
        AL+RFEQDY+RK+ E E  S  Q G+G++L+I + EL  Q+K VR+ + K LWSR L+E++EKLV VV +++  I +AFG  +  K    +   N +KLG
Subjt:  ALERFEQDYRRKVDEVE--SLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFGDNNTGKTLLIEDRSNGQKLG

Query:  AVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKE
        + GLALHYANII+QI+ +  R +++P++ RDALY+ LP S+K+ALRSR+Q+    EE T   +KAEM++ LQWLVP+A NT+KAH   G +GEWAS   E
Subjt:  AVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKE

Query:  HSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLRCRSPTPKDMAIPQPNNARRIQFKS
         ++  A Q+  +R+ TL+HADKEKTE +IL+LV  LH L+     Q +  T    RSP    +  P   N + IQ  S
Subjt:  HSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLRCRSPTPKDMAIPQPNNARRIQFKS

AT3G23160.1 Protein of unknown function (DUF668)6.2e-2624.11Show/hide
Query:  ISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQK
        I IL+FEVAN +SK  +L +SLS+  I  LK E+  SEG+++LVS     LL ++ ++K  +   +   V R GK+C +P     +  +  +       +
Subjt:  ISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQK

Query:  Q----AREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWI
        +     ++  + ++++     +T  LY E++ +   EQ        V+        ES+  F+ +L  QR+ V+S +   LW++  D++VE L   V  I
Subjt:  Q----AREARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWI

Query:  NQTIAKAFG-------------------------------------------------------------------------------------------
           I   FG                                                                                           
Subjt:  NQTIAKAFG-------------------------------------------------------------------------------------------

Query:  ------DNNTGKTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKT----ALRSRLQAVDASEEPTYLSVKAEMDRI
               N  G    +   ++   +G   L+LHYAN++  +  +   P  I    RD LY+ LPTS+KT    +LRS L+ +   + P     K  +D I
Subjt:  ------DNNTGKTLLIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKT----ALRSRLQAVDASEEPTYLSVKAEMDRI

Query:  LQWLVPIAANTSKAHQACGRIGEWASQSK-EHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQ
        L WL P+A N  +          W S+   E       ++N + LQTLY AD+EKTE  I +L+  L+ + H  +QQ
Subjt:  LQWLVPIAANTSKAHQACGRIGEWASQSK-EHSKGRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQ

AT5G08660.1 Protein of unknown function (DUF668)1.3e-8742.92Show/hide
Query:  SKTGSNKVAPMNSQAG--AYGRAVDLLKTIGNSVSTLHMNSGFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSIS
        ++T  +KV   +   G    GRA D+L T+G+S++ L  + GF +G A+ G E+ ILAFEVANTI K +NL +SLS+ NI+ LK  +  SEG++ LVS  
Subjt:  SKTGSNKVAPMNSQAG--AYGRAVDLLKTIGNSVSTLHMNSGFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSIS

Query:  LEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQARE-ARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESL
         +ELL + AADKRQE  V   EV+RFG + KD QWHNL +YF R+    + Q+Q +E A   + +L VL Q T+ELY ELQ L R E+DY +K  E E+ 
Subjt:  LEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQARE-ARAAMQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESL

Query:  NQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFG--DNNTGKTLLIEDRSNGQKLGAVGLALHYANIISQINLIA
          +  G+ L+I + EL  QRK+V+S + K LWSR  +E++EKLV +V ++   I   FG  D+   K    E     ++LG  GLALHYANII QI+ + 
Subjt:  NQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFG--DNNTGKTLLIEDRSNGQKLGAVGLALHYANIISQINLIA

Query:  CRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYH
         R +SI SN RD+LY++LP  +K ALRS++++ +  +E +   +K EM+R L WLVP+A NT+KAH   G +GEWA+   + +  + +  + +R++TLYH
Subjt:  CRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYH

Query:  ADKEKTEQHILELVTLLHTLIHLAKQQQQ---RFTSLRCRSPTPKDMAIPQP
        A KEKTE +IL  +  L  L+  AK   +   R +S++    T     I +P
Subjt:  ADKEKTEQHILELVTLLHTLIHLAKQQQQ---RFTSLRCRSPTPKDMAIPQP

AT5G51670.1 Protein of unknown function (DUF668)2.5e-1923.31Show/hide
Query:  HMNSGFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDP---Q
        H +S   + T S+   + +L+FEVA  ++K+ +L+ SL++ N+   +      EG+ ++V+      LS+  A+           V R   +C       
Subjt:  HMNSGFFTGTASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDP---Q

Query:  WHNLDQYFARLDLNDSS-QKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRK--------VDEVESLNQAGIGESLSIFQGELNVQRKLVRSF
        +H L   FA +  +        ++  A  +++      T+ LY E++ +   E   R++         +E +  N+  + + + + Q ++  Q++ V+  
Subjt:  WHNLDQYFARLDLNDSS-QKQAREARAAMQELTVLAQSTSELYHELQALERFEQDYRRK--------VDEVESLNQAGIGESLSIFQGELNVQRKLVRSF

Query:  QSKCLWSRNLDEIVEKLV-------------------------IVVTWINQTIAKAFG--------------DNNTGKTLLIEDRSNGQK-----LGAVG
        + + LW+++ D +V  L                           VV+ + ++++ +                D  T  +  +E+ S   K     LG  G
Subjt:  QSKCLWSRNLDEIVEKLV-------------------------IVVTWINQTIAKAFG--------------DNNTGKTLLIEDRSNGQK-----LGAVG

Query:  LALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAV--DASEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEH
        +ALHYAN+I  +  +  +P  +  + RD LY  LP SV+++LRSRL+ V   A++       KA + RIL+WL+P+A N  +          W S+    
Subjt:  LALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAV--DASEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEH

Query:  SKGRATQSNP----IRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQR-FTSLRC
         +  AT +N     + +QTL  ADK KTE  I EL+  L+ +    ++   +   +L+C
Subjt:  SKGRATQSNP----IRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQR-FTSLRC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAACACAATGGGAGGTGTATGTTCCAATGGAATAGTTAAGGAGGATTTCTTGTCAGAGAAGAAAACGCAAACGTCTGAGGATCGGAAAGGGAATTCGTGTTTGAA
TTCTGAGGCAAGTGATCCGGAGAAATTGCCGCAGAAGTCTCGTTCTGGTGTGATACTATTGCGCTCACCTCCGTCTAAGACAGGAAGCAATAAGGTTGCACCAATGAACT
CACAAGCAGGGGCTTACGGGAGGGCAGTTGATTTATTGAAAACAATTGGGAATAGTGTGTCAACTTTGCACATGAACAGTGGGTTTTTTACAGGCACGGCTTCAAATGGT
AGGGAGATTTCTATATTGGCTTTTGAAGTAGCAAACACAATAAGCAAAGTAGCAAATTTATCCCAATCTCTCTCAGAAGAAAACATCCAGTTCCTCAAAAAGGAACTTTC
ACAATCAGAAGGGATAAAACAATTAGTCTCAATAAGTTTAGAGGAATTACTAAGCATTGCAGCTGCAGACAAAAGGCAGGAATTTGACGTTCTCTTACGGGAGGTAATAC
GATTTGGAAAGCAGTGCAAGGATCCACAGTGGCACAATCTGGATCAGTACTTTGCAAGACTAGATTTGAATGATTCAAGTCAAAAACAAGCTCGAGAGGCCAGAGCAGCC
ATGCAGGAACTAACTGTTTTAGCTCAGAGTACTTCTGAATTATACCATGAATTACAAGCATTGGAAAGATTTGAGCAAGATTACAGAAGGAAAGTTGACGAAGTGGAGTC
CTTGAACCAAGCAGGAATAGGAGAAAGTCTCTCAATATTCCAAGGAGAATTAAACGTACAAAGAAAGCTCGTAAGGAGCTTTCAAAGCAAGTGCCTTTGGTCCAGAAATT
TAGATGAGATTGTGGAAAAGCTCGTTATTGTTGTAACATGGATAAATCAAACAATAGCCAAAGCATTTGGTGACAACAACACAGGTAAAACATTGCTTATCGAGGACAGA
AGTAATGGCCAGAAACTGGGCGCTGTTGGTCTTGCCTTACATTATGCAAACATAATCAGCCAGATAAATCTAATTGCGTGTCGTCCAACCTCTATTCCTTCAAATATGAG
AGATGCATTATACCGGGCATTGCCAACAAGTGTTAAAACAGCTCTGCGCTCTCGATTGCAGGCTGTGGATGCCAGTGAGGAGCCAACTTATCTTAGTGTCAAAGCTGAAA
TGGATAGGATCCTTCAATGGCTTGTTCCAATTGCTGCAAACACGAGCAAAGCACATCAAGCTTGTGGCCGGATTGGAGAATGGGCATCTCAAAGTAAGGAACACAGCAAA
GGCAGAGCCACACAGAGCAACCCAATCCGCCTTCAAACGCTGTACCACGCAGACAAAGAAAAAACAGAGCAACACATTCTCGAGCTGGTCACATTGCTCCACACTCTCAT
CCATTTGGCAAAACAGCAACAACAACGCTTCACATCGCTCCGTTGCCGATCTCCAACTCCCAAAGATATGGCAATTCCTCAGCCAAACAATGCTCGTCGGATCCAATTCA
AGAGCCAAATCATCAAAGCCAACAAGGACGGAGCTCCGGCCGACAACAACAACAGACCATCGGCCGGCCAAACTCCGATCAGAAAGAGGGATCCAGGTACGGAAACCTCT
AAAAATGAGAATAAAGATAAAGGAATTTGGACTTTAAGTAAAGGGGTTTCAGTTTCGACCTTGAGTTCTCTTAGTAGAGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGAACACAATGGGAGGTGTATGTTCCAATGGAATAGTTAAGGAGGATTTCTTGTCAGAGAAGAAAACGCAAACGTCTGAGGATCGGAAAGGGAATTCGTGTTTGAA
TTCTGAGGCAAGTGATCCGGAGAAATTGCCGCAGAAGTCTCGTTCTGGTGTGATACTATTGCGCTCACCTCCGTCTAAGACAGGAAGCAATAAGGTTGCACCAATGAACT
CACAAGCAGGGGCTTACGGGAGGGCAGTTGATTTATTGAAAACAATTGGGAATAGTGTGTCAACTTTGCACATGAACAGTGGGTTTTTTACAGGCACGGCTTCAAATGGT
AGGGAGATTTCTATATTGGCTTTTGAAGTAGCAAACACAATAAGCAAAGTAGCAAATTTATCCCAATCTCTCTCAGAAGAAAACATCCAGTTCCTCAAAAAGGAACTTTC
ACAATCAGAAGGGATAAAACAATTAGTCTCAATAAGTTTAGAGGAATTACTAAGCATTGCAGCTGCAGACAAAAGGCAGGAATTTGACGTTCTCTTACGGGAGGTAATAC
GATTTGGAAAGCAGTGCAAGGATCCACAGTGGCACAATCTGGATCAGTACTTTGCAAGACTAGATTTGAATGATTCAAGTCAAAAACAAGCTCGAGAGGCCAGAGCAGCC
ATGCAGGAACTAACTGTTTTAGCTCAGAGTACTTCTGAATTATACCATGAATTACAAGCATTGGAAAGATTTGAGCAAGATTACAGAAGGAAAGTTGACGAAGTGGAGTC
CTTGAACCAAGCAGGAATAGGAGAAAGTCTCTCAATATTCCAAGGAGAATTAAACGTACAAAGAAAGCTCGTAAGGAGCTTTCAAAGCAAGTGCCTTTGGTCCAGAAATT
TAGATGAGATTGTGGAAAAGCTCGTTATTGTTGTAACATGGATAAATCAAACAATAGCCAAAGCATTTGGTGACAACAACACAGGTAAAACATTGCTTATCGAGGACAGA
AGTAATGGCCAGAAACTGGGCGCTGTTGGTCTTGCCTTACATTATGCAAACATAATCAGCCAGATAAATCTAATTGCGTGTCGTCCAACCTCTATTCCTTCAAATATGAG
AGATGCATTATACCGGGCATTGCCAACAAGTGTTAAAACAGCTCTGCGCTCTCGATTGCAGGCTGTGGATGCCAGTGAGGAGCCAACTTATCTTAGTGTCAAAGCTGAAA
TGGATAGGATCCTTCAATGGCTTGTTCCAATTGCTGCAAACACGAGCAAAGCACATCAAGCTTGTGGCCGGATTGGAGAATGGGCATCTCAAAGTAAGGAACACAGCAAA
GGCAGAGCCACACAGAGCAACCCAATCCGCCTTCAAACGCTGTACCACGCAGACAAAGAAAAAACAGAGCAACACATTCTCGAGCTGGTCACATTGCTCCACACTCTCAT
CCATTTGGCAAAACAGCAACAACAACGCTTCACATCGCTCCGTTGCCGATCTCCAACTCCCAAAGATATGGCAATTCCTCAGCCAAACAATGCTCGTCGGATCCAATTCA
AGAGCCAAATCATCAAAGCCAACAAGGACGGAGCTCCGGCCGACAACAACAACAGACCATCGGCCGGCCAAACTCCGATCAGAAAGAGGGATCCAGGTACGGAAACCTCT
AAAAATGAGAATAAAGATAAAGGAATTTGGACTTTAAGTAAAGGGGTTTCAGTTTCGACCTTGAGTTCTCTTAGTAGAGTCTAG
Protein sequenceShow/hide protein sequence
MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEASDPEKLPQKSRSGVILLRSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNSGFFTGTASNG
REISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQAREARAA
MQELTVLAQSTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFGDNNTGKTLLIEDR
SNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSVKAEMDRILQWLVPIAANTSKAHQACGRIGEWASQSKEHSK
GRATQSNPIRLQTLYHADKEKTEQHILELVTLLHTLIHLAKQQQQRFTSLRCRSPTPKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNNRPSAGQTPIRKRDPGTETS
KNENKDKGIWTLSKGVSVSTLSSLSRV