; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G20910 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G20910
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionProtein of unknown function (DUF668)
Genome locationChr2:18445465..18449100
RNA-Seq ExpressionCSPI02G20910
SyntenyCSPI02G20910
Gene Ontology termsGO:0043434 - response to peptide hormone (biological process)
GO:0045927 - positive regulation of growth (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR007700 - Domain of unknown function DUF668
IPR021864 - Domain of unknown function DUF3475
IPR045021 - Protein PSK SIMULATOR


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065159.1 uncharacterized protein E6C27_scaffold82G004950 [Cucumis melo var. makuwa]2.0e-28992.11Show/hide
Query:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST
        MGGVCSNGIVK+DFVSEK+ QASEDRKGNS+LN EA DPNEMPEKS S VILLPSPPSK GSNKVAPMNAQAGARGRAVDLWKTIGISVSN HIN+G  T
Subjt:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST

Query:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL
         M PSGREISILAFEVANTISKVANLSKSLSEEN+QLLK ELLQSE IKQLIS SLEELLSIAAADKRQEFGVILRE+IRFGN+CKDS+WHNLDQYFSRL
Subjt:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV
        DSNDSSQKQAREARAA+QELTVLAQNTSELYHELQALERLEQDYRRRVEEVE LNQAGIGETLSIFQGELNVQR+LVRSFQSKCLWSRNLDEIVEKLVIV
Subjt:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV

Query:  VSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEM
        V+WINQTI+KEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIIS INLIA RPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVK EM
Subjt:  VSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEM

Query:  DKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANT
        DKILEWLVPIAANTSKAHQSCGRIGEWATQSKE SKGRATQ+NN NRLQTLY+ADKVKTE QILELV LLHHLIH AKHQQRRSSSLRCRSPTPKDMAN+
Subjt:  DKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANT

Query:  SRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKKVLGNKKGMESYKNENKGIWTLSKAVSVSTLRSLGRV
        SRRIQFKSQIIRTTKDGFPTDNIP   Q+PIRK+VLGNKKGMESYKNENKGIWTLSKAVSVSTLR++GRV
Subjt:  SRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKKVLGNKKGMESYKNENKGIWTLSKAVSVSTLRSLGRV

XP_004152648.2 uncharacterized protein LOC101204577 isoform X3 [Cucumis sativus]6.6e-23293.7Show/hide
Query:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST
        MGGVCSNGIVK+DFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST
Subjt:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST

Query:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL
        GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL
Subjt:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV
        DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV
Subjt:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV

Query:  VSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEM
        V+WINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIIS INLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEM
Subjt:  VSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEM

Query:  DKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTE
        DKILEWLVPIAANTSKAHQSCGRIGEWATQ     +  A    +   +QT +    ++T+
Subjt:  DKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTE

XP_011649665.1 uncharacterized protein LOC101204577 isoform X1 [Cucumis sativus]0.0e+0099.47Show/hide
Query:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST
        MGGVCSNGIVK+DFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST
Subjt:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST

Query:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL
        GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL
Subjt:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV
        DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV
Subjt:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV

Query:  VSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEM
        V+WINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIIS INLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEM
Subjt:  VSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEM

Query:  DKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANT
        DKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANT
Subjt:  DKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANT

Query:  SRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKKVLGNKKGMESYKNENKGIWTLSKAVSVSTLRSLGRV
        SRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKKVLGNKKGMESYKNENKGIWTLSKAVSVSTLRSLGRV
Subjt:  SRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKKVLGNKKGMESYKNENKGIWTLSKAVSVSTLRSLGRV

XP_031736339.1 uncharacterized protein LOC101204577 isoform X2 [Cucumis sativus]7.0e-28292.98Show/hide
Query:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST
        MGGVCSNGIVK+DFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST
Subjt:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST

Query:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL
        GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL
Subjt:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV
        DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV
Subjt:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV

Query:  VSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEM
        V+WINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIIS INLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE          
Subjt:  VSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEM

Query:  DKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANT
                    + S   Q+  R+G      KEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANT
Subjt:  DKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANT

Query:  SRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKKVLGNKKGMESYKNENKGIWTLSKAVSVSTLRSLGRV
        SRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKKVLGNKKGMESYKNENKGIWTLSKAVSVSTLRSLGRV
Subjt:  SRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKKVLGNKKGMESYKNENKGIWTLSKAVSVSTLRSLGRV

XP_038886160.1 protein PSK SIMULATOR 2-like isoform X1 [Benincasa hispida]2.5e-26384.56Show/hide
Query:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST
        MGGVCSNGIVK+DFVSEKI Q SED KGNS  N EA DPNEM +KS S VILLPSPPSKTGSNKVAP+NAQAGARGRAVDLWKTIGIS+ N H+N+GV T
Subjt:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST

Query:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL
        GMA +GREISILAFEVANTISKVANLS+SLSE NIQLLK ELLQSE +KQLIS SLEELLSIAAADKRQE  V+ RE+IRFGN CKDS+WHNLDQYFSRL
Subjt:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV
        D ND SQKQAREARAAL+ELTVL+QNTSELYHELQALER EQDYRR+V+EVEFLNQAGIGE+LSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV
Subjt:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV

Query:  VSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEM
        V+WINQTI K FG DNTDKTLL +DRSNGQKLGAVGLALHYANIIS INLIACRPTSIPSNMRDALYRALPTS+KIALRSRLR V+A EE TY+DVK E+
Subjt:  VSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEM

Query:  DKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANT
        DKILEWL PIAANT+KAHQSCGRIGEWATQSKEHSKGRA Q NN  RLQTLY+ADKVKTE QILEL+TLLHHLIHLAK QQ+RS+SLRCRSPTPKDMAN 
Subjt:  DKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANT

Query:  SRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKKVLGNKKGMESYKNENKGIWTLSKAVSVSTLRSLGRV
         RRIQFKSQIIR+ KDG P ++ PS GQTPIRK+ L N KG ESYK+E KGIWTLSKAVSVST+ SLGRV
Subjt:  SRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKKVLGNKKGMESYKNENKGIWTLSKAVSVSTLRSLGRV

TrEMBL top hitse value%identityAlignment
A0A0A0LLM8 Uncharacterized protein0.0e+0099.47Show/hide
Query:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST
        MGGVCSNGIVK+DFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST
Subjt:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST

Query:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL
        GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL
Subjt:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV
        DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV
Subjt:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV

Query:  VSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEM
        V+WINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIIS INLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEM
Subjt:  VSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEM

Query:  DKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANT
        DKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANT
Subjt:  DKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANT

Query:  SRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKKVLGNKKGMESYKNENKGIWTLSKAVSVSTLRSLGRV
        SRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKKVLGNKKGMESYKNENKGIWTLSKAVSVSTLRSLGRV
Subjt:  SRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKKVLGNKKGMESYKNENKGIWTLSKAVSVSTLRSLGRV

A0A5A7VI62 Uncharacterized protein9.8e-29092.11Show/hide
Query:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST
        MGGVCSNGIVK+DFVSEK+ QASEDRKGNS+LN EA DPNEMPEKS S VILLPSPPSK GSNKVAPMNAQAGARGRAVDLWKTIGISVSN HIN+G  T
Subjt:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST

Query:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL
         M PSGREISILAFEVANTISKVANLSKSLSEEN+QLLK ELLQSE IKQLIS SLEELLSIAAADKRQEFGVILRE+IRFGN+CKDS+WHNLDQYFSRL
Subjt:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV
        DSNDSSQKQAREARAA+QELTVLAQNTSELYHELQALERLEQDYRRRVEEVE LNQAGIGETLSIFQGELNVQR+LVRSFQSKCLWSRNLDEIVEKLVIV
Subjt:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV

Query:  VSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEM
        V+WINQTI+KEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIIS INLIA RPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVK EM
Subjt:  VSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEM

Query:  DKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANT
        DKILEWLVPIAANTSKAHQSCGRIGEWATQSKE SKGRATQ+NN NRLQTLY+ADKVKTE QILELV LLHHLIH AKHQQRRSSSLRCRSPTPKDMAN+
Subjt:  DKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANT

Query:  SRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKKVLGNKKGMESYKNENKGIWTLSKAVSVSTLRSLGRV
        SRRIQFKSQIIRTTKDGFPTDNIP   Q+PIRK+VLGNKKGMESYKNENKGIWTLSKAVSVSTLR++GRV
Subjt:  SRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKKVLGNKKGMESYKNENKGIWTLSKAVSVSTLRSLGRV

A0A6J1HF33 uncharacterized protein LOC111462458 isoform X11.8e-23076.41Show/hide
Query:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST
        MGGVCSNGI K+ F SEKI Q SEDRKGNS L+SEA DPNEMP++SRS V LL SPPSKTGSNKVAP+N+QAG+RGRA+DL KTIG SVSN H+NSG  T
Subjt:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST

Query:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL
        GMA +GREISILAFEVANTISKVANLS+SLSEENIQLLK ELLQSE IKQL+S S EELLSIAAADKRQEF V+L E+IRFG +CKD QWHNLDQYFSRL
Subjt:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV
        D NDSS+KQAREARAA+QEL VLAQ+TSELYHEL ALER EQDYRR+V+EVE +NQAG GE+LSIFQGELNVQRKLVRSFQSKCLWSR+LDEIVEKLVIV
Subjt:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV

Query:  VSWINQTIIKEFGVDN---------TDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEP
        V+WINQTI K F   N         TDKTL I+DRSNGQKLG+VGLALHYA IIS INLIACRPTSIPSNMRDALYRALPTS+KI LRSRLR V+  EEP
Subjt:  VSWINQTIIKEFGVDN---------TDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEP

Query:  TYYDVKTEMDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRS
        TY  VK EMDKILEWLVPIAANTSKAHQ+CGRIGEWATQSKEHSKGRATQ+N   RLQTLY+AD++KTE QILELVTLLHH IHL K Q +R +SLRC+S
Subjt:  TYYDVKTEMDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRS

Query:  PTPKDMA----NTSRRIQFKSQIIRTTKDGFPTDNIPSPGQTP-IRKKVLGNKKGMESYKNEN--KGIWTLSKAVSVSTLRSLGR
        PT K+MA      +RRIQFKSQII+   DG           TP IRK+   N KG ESY+N N  KGIWTLSK  SVSTL SL R
Subjt:  PTPKDMA----NTSRRIQFKSQIIRTTKDGFPTDNIPSPGQTP-IRKKVLGNKKGMESYKNEN--KGIWTLSKAVSVSTLRSLGR

A0A6J1K3J2 uncharacterized protein LOC111492046 isoform X13.2e-23276.92Show/hide
Query:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST
        MGGVCSNGI K+ F SEKI Q SEDR GNS LNSEA D NEMP++SRS V LLPSPPSK GSNKVAP+N+QAG+RGRA+DL KTIG SVSN H+N G  T
Subjt:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST

Query:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL
        GMA +G EISILAFEVANTISKV NLS+SLSEENIQLLK ELLQSE IKQL+S S EELLSIAAADKRQEF V+LRE+IRFG +CKD QWHNLDQYFSRL
Subjt:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV
        D NDSS+KQAREARAA+QEL VLAQ+TSELYHEL ALER EQDYRR+V+EVE LNQ GIGE+LSIFQGELNVQRKLVRSFQSKCLWSR+LDEIVEKLVIV
Subjt:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV

Query:  VSWINQTIIKEFGVDNT---------DKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEP
        V+WINQTI K FG  NT         DKTL I+DRS GQKLG+VGLALHYANIIS INLIACRP SIPSNMRDALYRALPTS+KI LRSRLR VD  EEP
Subjt:  VSWINQTIIKEFGVDNT---------DKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEP

Query:  TYYDVKTEMDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRS
        TY  VK EMDKILEWLVPIAANTSKAHQ+CGRIGEWATQSKEHSKGRATQ+N   RLQTLY+AD+VKTE QILELVTLLHHLIHL+K Q +R +SLRC+S
Subjt:  TYYDVKTEMDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRS

Query:  PTPKDMA----NTSRRIQFKSQIIRTTKDGFPTDNIPSPGQTP-IRKKVLGNKKGMESYKNEN--KGIWTLSKAVSVSTLRSLGR
        PTPK+MA      +RRIQFKSQII+   DG           TP  RK+   N KG ESYKNEN  KGIWTLSK  SVSTL SL R
Subjt:  PTPKDMA----NTSRRIQFKSQIIRTTKDGFPTDNIPSPGQTP-IRKKVLGNKKGMESYKNEN--KGIWTLSKAVSVSTLRSLGR

A0A6J1KT53 uncharacterized protein LOC111496177 isoform X11.7e-22577.06Show/hide
Query:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST
        +GGVCSNGIVK+DFV+EK  + SEDRKGNS LN +A DP+E+P K RS VILLPSPPSKTGSNKVAP NA  GAR +AVD  KT G SVSN H N+G  T
Subjt:  MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST

Query:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL
        GMA  GREISILAFEVANTISKVANLS+SLSEENIQLLK EL QSE +KQL+S ++EELLSIAAADKRQEF V+LRE+ RFGNRCKD QWHNLDQ+FSRL
Subjt:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV
        D NDS+QKQAREARAA+QELTVLAQ TSELYHE+Q LER EQDYRR+V+EVEF+NQAGIGE+LSIFQGELNVQRKLVRSFQ+K LWSRNLDEIVEKLV+V
Subjt:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV

Query:  VSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAV-DAREEPTYYDVKTE
        V+WINQ I K FG  NTDK L+I++RS GQKLG+VGLALHY+NIIS INLIACRPTSIPSNMRDALYRALPTS+K ALRSRL+AV D+ EEPTY  VK E
Subjt:  VSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAV-DAREEPTYYDVKTE

Query:  MDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMA-
        MDKIL WLVPIA NT+KAHQ+CGRIGEWA+QSKE SKGRATQ+N   RLQTLY+ADK KTE QI+ELVTLLHHLIHLAKHQQ+R +SLRCRSPTP++MA 
Subjt:  MDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMA-

Query:  ---NTSRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKKVLGNKK
           N +RRIQ+ SQ ++TTKDG P DN PS GQTPI K+   N++
Subjt:  ---NTSRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKKVLGNKK

SwissProt top hitse value%identityAlignment
P0DO24 Protein PSK SIMULATOR 32.0e-8242.19Show/hide
Query:  SKTGSNKVAPMNAQAG--ARGRAVDLWKTIGISVSNFHINSGVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISAS
        ++T  +KV   +   G    GRA D+  T+G S+++   + G ++G+A  G E+ ILAFEVANTI K +NL +SLS+ NI+ LK  +L SE ++ L+S  
Subjt:  SKTGSNKVAPMNAQAG--ARGRAVDLWKTIGISVSNFHINSGVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISAS

Query:  LEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQARE-ARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFL
         +ELL + AADKRQE  V   E++RFGNR KD QWHNL +YF R+    + Q+Q +E A   + +L VL Q T+ELY ELQ L RLE+DY ++  E E  
Subjt:  LEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQARE-ARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFL

Query:  NQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACR
          +  G+ L+I + EL  QRK+V+S + K LWSR  +E++EKLV +V ++   I   FG    D+          ++LG  GLALHYANII  I+ +  R
Subjt:  NQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACR

Query:  PTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEMDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYA
         +SI SN RD+LY++LP  IK+ALRS++++ +  +E +   +K EM++ L WLVP+A NT+KAH   G +GEWA    + +        +  R++TLY+A
Subjt:  PTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEMDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYA

Query:  DKVKTELQILELVTLLHHLIHLAKHQQR---RSSSLRCRSPTPKDMAN
         K KTE+ IL  +  L HL+  AK   R   R SS++    +P D  N
Subjt:  DKVKTELQILELVTLLHHLIHLAKHQQR---RSSSLRCRSPTPKDMAN

Q9SA91 Protein PSK SIMULATOR 21.1e-10441.43Show/hide
Query:  MGGVCSNGIVKEDFVSEKIIQASEDR------KGNSYLNSEARDP-------NEMPEKSRSDVILL-------PSPPSKTGSNKVAPMNAQAGARG----
        MGGVCS  + K+D   +K+    +D+      K  S   S+  D            + S+ D ++        P PP +  S K    N+  G  G    
Subjt:  MGGVCSNGIVKEDFVSEKIIQASEDR------KGNSYLNSEARDP-------NEMPEKSRSDVILL-------PSPPSKTGSNKVAPMNAQAGARG----

Query:  -RAVDLWKTIGISVSNFHINSGVSTGMAPS-GREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVI
         +AV++  T+G S++  + ++   +G+  S G +++ILAFEVANTI+K A L +SLSEEN++ +K ++L SE +K+L+S    EL  +AA+DKR+E  + 
Subjt:  -RAVDLWKTIGISVSNFHINSGVSTGMAPS-GREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVI

Query:  LREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQ
          E+IRFGN CKD QWHNLD+YF +LD+ +S  K  + +A A +QEL  LA+ TSELYHELQAL+R EQDYRR++ EVE LN    GE + I Q EL  Q
Subjt:  LREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQ

Query:  RKLVRSFQSKCLWSRNLDEIVEKLVIVVSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTS
        +KLV+S Q K LWS+NL EI+EKLV VVS+I QTI++ FG +        +     ++LG  GL+LHYAN+I  I+ IA RP+S+PSN+RD LY ALP +
Subjt:  RKLVRSFQSKCLWSRNLDEIVEKLVIVVSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTS

Query:  IKIALRSRLRAVDAREEPTYYDVKTEMDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNAN--RLQTLYYADKVKTELQILELVTLLH
        +K ALR RL+ +D  EE +  ++K EM+K L+WLVP A NT+KAHQ  G +GEWA    E  KG+    NN N  RLQTL++ADK   +  +LELV  LH
Subjt:  IKIALRSRLRAVDAREEPTYYDVKTEMDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNAN--RLQTLYYADKVKTELQILELVTLLH

Query:  HLIHLAK---HQQRRSSSLRCRSPTPKDMANT----SRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKK---VLGNKKGMESYKNENKGIWTLSKAVSV
         L+  +K   H  +   +     P  + ++NT    S    +K+Q+  + +D    D + S    P   K   ++G KK  + +K     IW LS++   
Subjt:  HLIHLAK---HQQRRSSSLRCRSPTPKDMANT----SRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKK---VLGNKKGMESYKNENKGIWTLSKAVSV

Query:  S
        S
Subjt:  S

Q9XID5 Protein PSK SIMULATOR 12.5e-9344.91Show/hide
Query:  NKVAPMNAQAGAR--GRAVDLWKTIGISVSNFHINSGVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELL
        ++V+ +  +AG    G+AVD+  T+G S++N +++ G S+     G +ISIL+FEVANTI K ANL  SLS+++I  LK  +L SE ++ LIS  ++ELL
Subjt:  NKVAPMNAQAGAR--GRAVDLWKTIGISVSNFHINSGVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELL

Query:  SIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVE--FLNQA
         IAAADKR+E  +   E++RFGNRCKD Q+HNLD++F RL S  + QK  + EA   + ++      T++LYHEL AL+R EQDY+R+++E E     Q 
Subjt:  SIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVE--FLNQA

Query:  GIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTS
        G+G+TL+I + EL  Q+K VR+ + K LWSR L+E++EKLV VV +++  I + FG  + DK        N +KLG+ GLALHYANII+ I+ +  R ++
Subjt:  GIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTS

Query:  IPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEMDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKV
        +P++ RDALY+ LP SIK ALRSR+++   +EE T   +K EM+K L+WLVP+A NT+KAH   G +GEWA+   E ++  A Q     R+ TL++ADK 
Subjt:  IPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEMDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKV

Query:  KTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANTSRR-IQFKS
        KTE  IL+LV  LHHL+     Q R ++    RSP    + + +++ IQ  S
Subjt:  KTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANTSRR-IQFKS

Arabidopsis top hitse value%identityAlignment
AT1G30755.1 Protein of unknown function (DUF668)7.8e-10641.43Show/hide
Query:  MGGVCSNGIVKEDFVSEKIIQASEDR------KGNSYLNSEARDP-------NEMPEKSRSDVILL-------PSPPSKTGSNKVAPMNAQAGARG----
        MGGVCS  + K+D   +K+    +D+      K  S   S+  D            + S+ D ++        P PP +  S K    N+  G  G    
Subjt:  MGGVCSNGIVKEDFVSEKIIQASEDR------KGNSYLNSEARDP-------NEMPEKSRSDVILL-------PSPPSKTGSNKVAPMNAQAGARG----

Query:  -RAVDLWKTIGISVSNFHINSGVSTGMAPS-GREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVI
         +AV++  T+G S++  + ++   +G+  S G +++ILAFEVANTI+K A L +SLSEEN++ +K ++L SE +K+L+S    EL  +AA+DKR+E  + 
Subjt:  -RAVDLWKTIGISVSNFHINSGVSTGMAPS-GREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVI

Query:  LREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQ
          E+IRFGN CKD QWHNLD+YF +LD+ +S  K  + +A A +QEL  LA+ TSELYHELQAL+R EQDYRR++ EVE LN    GE + I Q EL  Q
Subjt:  LREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQ

Query:  RKLVRSFQSKCLWSRNLDEIVEKLVIVVSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTS
        +KLV+S Q K LWS+NL EI+EKLV VVS+I QTI++ FG +        +     ++LG  GL+LHYAN+I  I+ IA RP+S+PSN+RD LY ALP +
Subjt:  RKLVRSFQSKCLWSRNLDEIVEKLVIVVSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTS

Query:  IKIALRSRLRAVDAREEPTYYDVKTEMDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNAN--RLQTLYYADKVKTELQILELVTLLH
        +K ALR RL+ +D  EE +  ++K EM+K L+WLVP A NT+KAHQ  G +GEWA    E  KG+    NN N  RLQTL++ADK   +  +LELV  LH
Subjt:  IKIALRSRLRAVDAREEPTYYDVKTEMDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNAN--RLQTLYYADKVKTELQILELVTLLH

Query:  HLIHLAK---HQQRRSSSLRCRSPTPKDMANT----SRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKK---VLGNKKGMESYKNENKGIWTLSKAVSV
         L+  +K   H  +   +     P  + ++NT    S    +K+Q+  + +D    D + S    P   K   ++G KK  + +K     IW LS++   
Subjt:  HLIHLAK---HQQRRSSSLRCRSPTPKDMANT----SRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKK---VLGNKKGMESYKNENKGIWTLSKAVSV

Query:  S
        S
Subjt:  S

AT1G34320.1 Protein of unknown function (DUF668)1.8e-9444.91Show/hide
Query:  NKVAPMNAQAGAR--GRAVDLWKTIGISVSNFHINSGVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELL
        ++V+ +  +AG    G+AVD+  T+G S++N +++ G S+     G +ISIL+FEVANTI K ANL  SLS+++I  LK  +L SE ++ LIS  ++ELL
Subjt:  NKVAPMNAQAGAR--GRAVDLWKTIGISVSNFHINSGVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELL

Query:  SIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVE--FLNQA
         IAAADKR+E  +   E++RFGNRCKD Q+HNLD++F RL S  + QK  + EA   + ++      T++LYHEL AL+R EQDY+R+++E E     Q 
Subjt:  SIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVE--FLNQA

Query:  GIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTS
        G+G+TL+I + EL  Q+K VR+ + K LWSR L+E++EKLV VV +++  I + FG  + DK        N +KLG+ GLALHYANII+ I+ +  R ++
Subjt:  GIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACRPTS

Query:  IPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEMDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKV
        +P++ RDALY+ LP SIK ALRSR+++   +EE T   +K EM+K L+WLVP+A NT+KAH   G +GEWA+   E ++  A Q     R+ TL++ADK 
Subjt:  IPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEMDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKV

Query:  KTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANTSRR-IQFKS
        KTE  IL+LV  LHHL+     Q R ++    RSP    + + +++ IQ  S
Subjt:  KTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANTSRR-IQFKS

AT3G23160.1 Protein of unknown function (DUF668)1.4e-2522.54Show/hide
Query:  ISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQK
        I IL+FEVAN +SK  +L +SLS+  I  LK E+  SE +++L+S+    LL ++ ++K  +   +   + R G +C +      +  +  + +     +
Subjt:  ISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQK

Query:  Q----AREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVSWI
        +     ++  + ++++      T  LY E++ +  LEQ        V+        E++  F+ +L  QR+ V+S +   LW++  D++VE L   V  I
Subjt:  Q----AREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVSWI

Query:  N---QTIIKEFGVDNTDKTLLIKDRSNGQ-----------------------------------------------------------------------
            +T+    G+       L +DRS  +                                                                       
Subjt:  N---QTIIKEFGVDNTDKTLLIKDRSNGQ-----------------------------------------------------------------------

Query:  -----------------------KLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTS----IKIALRSRLRAVDAREEPTYYDVKTEMDKI
                                +G   L+LHYAN++  +  +   P  I    RD LY+ LPTS    +K +LRS L+ +   + P  +D K  +D I
Subjt:  -----------------------KLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTS----IKIALRSRLRAVDAREEPTYYDVKTEMDKI

Query:  LEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRS
        L WL P+A N  +          W ++     + +  +  N   LQTLY+AD+ KTE  I +L+  L+++ H   ++Q++++ L C S
Subjt:  LEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRS

AT5G08660.1 Protein of unknown function (DUF668)1.4e-8342.19Show/hide
Query:  SKTGSNKVAPMNAQAG--ARGRAVDLWKTIGISVSNFHINSGVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISAS
        ++T  +KV   +   G    GRA D+  T+G S+++   + G ++G+A  G E+ ILAFEVANTI K +NL +SLS+ NI+ LK  +L SE ++ L+S  
Subjt:  SKTGSNKVAPMNAQAG--ARGRAVDLWKTIGISVSNFHINSGVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISAS

Query:  LEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQARE-ARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFL
         +ELL + AADKRQE  V   E++RFGNR KD QWHNL +YF R+    + Q+Q +E A   + +L VL Q T+ELY ELQ L RLE+DY ++  E E  
Subjt:  LEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQARE-ARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFL

Query:  NQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACR
          +  G+ L+I + EL  QRK+V+S + K LWSR  +E++EKLV +V ++   I   FG    D+          ++LG  GLALHYANII  I+ +  R
Subjt:  NQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVSWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISHINLIACR

Query:  PTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEMDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYA
         +SI SN RD+LY++LP  IK+ALRS++++ +  +E +   +K EM++ L WLVP+A NT+KAH   G +GEWA    + +        +  R++TLY+A
Subjt:  PTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEMDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANRLQTLYYA

Query:  DKVKTELQILELVTLLHHLIHLAKHQQR---RSSSLRCRSPTPKDMAN
         K KTE+ IL  +  L HL+  AK   R   R SS++    +P D  N
Subjt:  DKVKTELQILELVTLLHHLIHLAKHQQR---RSSSLRCRSPTPKDMAN

AT5G51670.1 Protein of unknown function (DUF668)3.1e-2223.71Show/hide
Query:  ISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDS---QWHNLDQYFSRLDSNDS
        + +L+FEVA  ++K+ +L+ SL++ N+   ++  L  E + ++++      LS+  A+           + R  NRC  +    +H L   F+ +  +  
Subjt:  ISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDS---QWHNLDQYFSRLDSNDS

Query:  S-QKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRR--------VEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
              ++  A  +++      T+ LY E++ +  LE   R++         EE ++ N+  + + + + Q ++  Q++ V+  + + LW+++ D +V  
Subjt:  S-QKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRR--------VEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LV-------------------------IVVSWINQTIIKEFGVDN--------------TDKTLLIKDRSNGQK-----LGAVGLALHYANIISHINLIA
        L                           VVS + +++       N              T  +  +++ S   K     LG  G+ALHYAN+I  +  + 
Subjt:  LV-------------------------IVVSWINQTIIKEFGVDN--------------TDKTLLIKDRSNGQK-----LGAVGLALHYANIISHINLIA

Query:  CRPTSIPSNMRDALYRALPTSIKIALRSRLRAV--DAREEPTYYDVKTEMDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANR---
         +P  +  + RD LY  LP S++ +LRSRL+ V   A +     + K  + +IL WL+P+A N  +          W ++     +  AT  N+ NR   
Subjt:  CRPTSIPSNMRDALYRALPTSIKIALRSRLRAV--DAREEPTYYDVKTEMDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRATQNNNANR---

Query:  LQTLYYADKVKTELQILELVTLLHHL
        +QTL +ADKVKTE  I EL+  L+++
Subjt:  LQTLYYADKVKTELQILELVTLLHHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGGGTTTGTTCTAATGGAATAGTTAAGGAGGATTTTGTATCTGAGAAGATAATTCAAGCATCTGAGGATCGGAAAGGGAATTCGTATTTGAATTCTGAGGCTAG
GGATCCGAATGAAATGCCAGAGAAGTCTCGTTCGGATGTGATACTATTGCCTTCACCTCCGTCTAAGACTGGAAGCAATAAGGTTGCACCAATGAACGCACAAGCAGGGG
CTCGCGGGAGGGCAGTTGATTTATGGAAAACAATTGGGATCAGTGTGTCAAATTTCCATATCAACAGTGGGGTTTCTACAGGCATGGCGCCTAGTGGTAGGGAGATTTCT
ATATTGGCTTTTGAAGTAGCAAACACAATAAGCAAAGTAGCAAATTTGTCAAAATCTCTCTCAGAAGAAAATATTCAGCTCCTTAAAAACGAACTATTACAATCAGAAGC
GATAAAACAATTGATCTCAGCAAGTTTAGAGGAATTGCTTAGCATTGCAGCTGCTGACAAAAGGCAGGAATTTGGCGTCATCTTACGGGAGATAATACGATTTGGAAATC
GGTGCAAGGATTCACAGTGGCATAATCTGGATCAGTACTTTTCAAGACTAGATTCGAATGATTCAAGTCAAAAACAAGCTCGAGAGGCCAGAGCAGCCTTACAGGAACTA
ACCGTTTTAGCTCAGAATACTTCTGAATTATACCACGAATTACAAGCATTGGAAAGACTTGAGCAAGATTATAGACGGAGAGTTGAGGAAGTGGAGTTCTTGAACCAAGC
AGGAATAGGAGAAACTCTCTCAATCTTCCAAGGAGAACTAAACGTACAAAGAAAACTTGTAAGGAGTTTCCAAAGCAAGTGTCTTTGGTCCAGAAATCTAGATGAGATTG
TGGAAAAGCTCGTTATCGTTGTATCATGGATAAATCAAACAATAATCAAAGAATTTGGAGTTGACAATACAGATAAAACATTGCTTATCAAGGACAGAAGTAACGGCCAG
AAACTGGGTGCCGTTGGTCTTGCCTTGCATTATGCAAACATAATCAGCCATATAAATCTCATTGCGTGCCGTCCAACCTCCATTCCTTCAAACATGAGGGATGCATTATA
CCGGGCATTGCCTACAAGCATTAAAATAGCTCTGCGTTCTCGATTGCGGGCTGTGGATGCAAGGGAGGAGCCAACTTATTATGACGTCAAAACTGAAATGGATAAGATCC
TTGAATGGCTTGTTCCGATTGCTGCAAACACGAGCAAAGCGCATCAATCTTGTGGCAGAATTGGAGAATGGGCAACCCAAAGTAAGGAACACAGCAAAGGCAGAGCTACA
CAAAACAACAATGCAAACCGCCTTCAAACGCTCTACTATGCAGACAAAGTAAAAACAGAGTTACAAATTCTTGAATTAGTCACATTGCTTCACCATCTCATCCATTTAGC
AAAACACCAACAACGACGCTCCTCATCTCTCCGTTGCCGATCACCAACTCCCAAGGACATGGCAAATACTTCTCGTCGTATCCAATTCAAGAGCCAAATTATCAGAACCA
CCAAGGATGGATTTCCGACCGACAATATACCATCGCCCGGCCAAACTCCGATCAGAAAAAAGGTGCTCGGTAACAAAAAAGGAATGGAATCTTACAAAAATGAGAATAAA
GGAATTTGGACGTTAAGTAAAGCAGTTTCGGTTTCAACCTTGAGGTCTCTTGGTAGAGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGGGTTTGTTCTAATGGAATAGTTAAGGAGGATTTTGTATCTGAGAAGATAATTCAAGCATCTGAGGATCGGAAAGGGAATTCGTATTTGAATTCTGAGGCTAG
GGATCCGAATGAAATGCCAGAGAAGTCTCGTTCGGATGTGATACTATTGCCTTCACCTCCGTCTAAGACTGGAAGCAATAAGGTTGCACCAATGAACGCACAAGCAGGGG
CTCGCGGGAGGGCAGTTGATTTATGGAAAACAATTGGGATCAGTGTGTCAAATTTCCATATCAACAGTGGGGTTTCTACAGGCATGGCGCCTAGTGGTAGGGAGATTTCT
ATATTGGCTTTTGAAGTAGCAAACACAATAAGCAAAGTAGCAAATTTGTCAAAATCTCTCTCAGAAGAAAATATTCAGCTCCTTAAAAACGAACTATTACAATCAGAAGC
GATAAAACAATTGATCTCAGCAAGTTTAGAGGAATTGCTTAGCATTGCAGCTGCTGACAAAAGGCAGGAATTTGGCGTCATCTTACGGGAGATAATACGATTTGGAAATC
GGTGCAAGGATTCACAGTGGCATAATCTGGATCAGTACTTTTCAAGACTAGATTCGAATGATTCAAGTCAAAAACAAGCTCGAGAGGCCAGAGCAGCCTTACAGGAACTA
ACCGTTTTAGCTCAGAATACTTCTGAATTATACCACGAATTACAAGCATTGGAAAGACTTGAGCAAGATTATAGACGGAGAGTTGAGGAAGTGGAGTTCTTGAACCAAGC
AGGAATAGGAGAAACTCTCTCAATCTTCCAAGGAGAACTAAACGTACAAAGAAAACTTGTAAGGAGTTTCCAAAGCAAGTGTCTTTGGTCCAGAAATCTAGATGAGATTG
TGGAAAAGCTCGTTATCGTTGTATCATGGATAAATCAAACAATAATCAAAGAATTTGGAGTTGACAATACAGATAAAACATTGCTTATCAAGGACAGAAGTAACGGCCAG
AAACTGGGTGCCGTTGGTCTTGCCTTGCATTATGCAAACATAATCAGCCATATAAATCTCATTGCGTGCCGTCCAACCTCCATTCCTTCAAACATGAGGGATGCATTATA
CCGGGCATTGCCTACAAGCATTAAAATAGCTCTGCGTTCTCGATTGCGGGCTGTGGATGCAAGGGAGGAGCCAACTTATTATGACGTCAAAACTGAAATGGATAAGATCC
TTGAATGGCTTGTTCCGATTGCTGCAAACACGAGCAAAGCGCATCAATCTTGTGGCAGAATTGGAGAATGGGCAACCCAAAGTAAGGAACACAGCAAAGGCAGAGCTACA
CAAAACAACAATGCAAACCGCCTTCAAACGCTCTACTATGCAGACAAAGTAAAAACAGAGTTACAAATTCTTGAATTAGTCACATTGCTTCACCATCTCATCCATTTAGC
AAAACACCAACAACGACGCTCCTCATCTCTCCGTTGCCGATCACCAACTCCCAAGGACATGGCAAATACTTCTCGTCGTATCCAATTCAAGAGCCAAATTATCAGAACCA
CCAAGGATGGATTTCCGACCGACAATATACCATCGCCCGGCCAAACTCCGATCAGAAAAAAGGTGCTCGGTAACAAAAAAGGAATGGAATCTTACAAAAATGAGAATAAA
GGAATTTGGACGTTAAGTAAAGCAGTTTCGGTTTCAACCTTGAGGTCTCTTGGTAGAGTCTAGATTTTGAGTTGAAGAAGGATGAAGCTTGGAATTGGTTTAATGTGGAT
AATAACAATTTTGAGTACCTCACACGTTTATACCAAAAATAAGTACAGGGAATTTTTTAAAGATTTTTTGACCGTGTCTTCTAGTTGAATTCCTTTGTGTATATTACTCT
TTTGGGTGAGGTTTTGTTCCGAATTCCCCACTTGTTTTGTATATTAATTGCACCTCTCCAGAATAATGGTCAACGTTCTTAGTTACATGCGCTATATACGATTTGCATAA
AATTG
Protein sequenceShow/hide protein sequence
MGGVCSNGIVKEDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVSTGMAPSGREIS
ILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQAREARAALQEL
TVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVSWINQTIIKEFGVDNTDKTLLIKDRSNGQ
KLGAVGLALHYANIISHINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEPTYYDVKTEMDKILEWLVPIAANTSKAHQSCGRIGEWATQSKEHSKGRAT
QNNNANRLQTLYYADKVKTELQILELVTLLHHLIHLAKHQQRRSSSLRCRSPTPKDMANTSRRIQFKSQIIRTTKDGFPTDNIPSPGQTPIRKKVLGNKKGMESYKNENK
GIWTLSKAVSVSTLRSLGRV