; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G09680 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G09680
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein PTST homolog 3, chloroplastic isoform X2
Genome locationClcChr04:23226417..23235217
RNA-Seq ExpressionClc04G09680
SyntenyClc04G09680
Gene Ontology termsGO:0010581 - regulation of starch biosynthetic process (biological process)
GO:0009507 - chloroplast (cellular component)
InterPro domainsIPR013783 - Immunoglobulin-like fold
IPR014756 - Immunoglobulin E-set
IPR032640 - AMP-activated protein kinase, glycogen-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008462485.1 PREDICTED: uncharacterized protein LOC103500828 isoform X1 [Cucumis melo]4.3e-28677.36Show/hide
Query:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRKSS-VCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF
        MATLSHFPSLLSLSSRN SFLDQL T N HPKFHCFGHHHH HPR+ S VC CSIR+SRAS+R KSNEELCNDIREFIRSVGLP+DH+PSTKELSQHGR 
Subjt:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRKSS-VCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF

Query:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLT-GQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCI
                           TDLANIVRRRGHKL+RELL+TNST  VE DCDLGNITLT GQDGEA DVVED        VLEN S+ SN +H FSFDDC 
Subjt:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLT-GQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCI

Query:  SAPTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVTDNM
        S PT T NSS EEELSNDLI HDEYN S+REN+ENIE VE D S KTE  ASEDC  S+NI LG+ C+D+TGE+ E SKNLS+EN ENSLECQSEVT++ 
Subjt:  SAPTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVTDNM

Query:  VDESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIKY
        VDES WS EV  GENYMINST DEYLDMHDH ++PLLL   SS KEE L YSNEQVEKEDN VDDVSLSA MTIIDDQSSGLNID+AL   ESS +L+KY
Subjt:  VDESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIKY

Query:  SEELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPST
        SE+LSLAEKVARFIQNGDLDI+DDNF++T SESGAG+GNG  TA NAEESEINFHVEAFSEDTTASRGS+MASNGSASEF+D++STT VGQ IRDDQPST
Subjt:  SEELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPST

Query:  EALNGQIDKVLGAEA-FCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVH
        EALNGQI+KV GAE  FCLSMVQIK+SENQVEIDRLKF+LHQKELELSQLKEQIERDKLALS  QSKA AE+S AQKLILE+D+ELV AEE L GLEEV 
Subjt:  EALNGQIDKVLGAEA-FCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVH

Query:  IHYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR
        IHY GEGEIVEVAG FNGWH +IKMDPQPSS+ +DS +SKK   WSTVLWLYPGVYEIKF+VDGHWKIDPHRES TKGAI+NNILRVGR
Subjt:  IHYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR

XP_008462487.1 PREDICTED: uncharacterized protein LOC103500828 isoform X2 [Cucumis melo]1.7e-28777.47Show/hide
Query:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRKSS-VCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF
        MATLSHFPSLLSLSSRN SFLDQL T N HPKFHCFGHHHH HPR+ S VC CSIR+SRAS+R KSNEELCNDIREFIRSVGLP+DH+PSTKELSQHGR 
Subjt:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRKSS-VCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF

Query:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLTGQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCIS
                           TDLANIVRRRGHKL+RELL+TNST  VE DCDLGNITLTGQDGEA DVVED        VLEN S+ SN +H FSFDDC S
Subjt:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLTGQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCIS

Query:  APTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVTDNMV
         PT T NSS EEELSNDLI HDEYN S+REN+ENIE VE D S KTE  ASEDC  S+NI LG+ C+D+TGE+ E SKNLS+EN ENSLECQSEVT++ V
Subjt:  APTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVTDNMV

Query:  DESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIKYS
        DES WS EV  GENYMINST DEYLDMHDH ++PLLL   SS KEE L YSNEQVEKEDN VDDVSLSA MTIIDDQSSGLNID+AL   ESS +L+KYS
Subjt:  DESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIKYS

Query:  EELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPSTE
        E+LSLAEKVARFIQNGDLDI+DDNF++T SESGAG+GNG  TA NAEESEINFHVEAFSEDTTASRGS+MASNGSASEF+D++STT VGQ IRDDQPSTE
Subjt:  EELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPSTE

Query:  ALNGQIDKVLGAEA-FCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHI
        ALNGQI+KV GAE  FCLSMVQIK+SENQVEIDRLKF+LHQKELELSQLKEQIERDKLALS  QSKA AE+S AQKLILE+D+ELV AEE L GLEEV I
Subjt:  ALNGQIDKVLGAEA-FCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHI

Query:  HYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR
        HY GEGEIVEVAG FNGWH +IKMDPQPSS+ +DS +SKK   WSTVLWLYPGVYEIKF+VDGHWKIDPHRES TKGAI+NNILRVGR
Subjt:  HYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR

XP_008462488.1 PREDICTED: uncharacterized protein LOC103500828 isoform X3 [Cucumis melo]5.4e-28176.45Show/hide
Query:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRKSS-VCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF
        MATLSHFPSLLSLSSRN SFLDQL T N HPKFHCFGHHHH HPR+ S VC CSIR+SRAS+R KSNEELCNDIREFIRSVGLP+DH+PSTKELSQHGR 
Subjt:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRKSS-VCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF

Query:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLT-GQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCI
                           TDLANIVRRRGHKL+RELL+TNST  VE DCDLGNITLT GQDGEA DVVED        VLEN S+ SN +H FSFDDC 
Subjt:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLT-GQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCI

Query:  SAPTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVTDNM
        S PT T NSS EEELSNDLI HDEYN S+REN+ENIE VE D S KTE  ASEDC  S+NI LG+ C+D+TGE+ E SKNLS+EN ENSLECQSEVT++ 
Subjt:  SAPTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVTDNM

Query:  VDESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIKY
        VDES WS EV  GENYMINST DEYLDMHDH ++PLLL   SS KEE L YSNEQVEKEDN VDDVSLSA MTIIDDQSSGLNID+AL   ESS +L+KY
Subjt:  VDESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIKY

Query:  SEELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPST
        SE+LSLAEKVARFIQNGDLDI+DDNF++T SESGAG+GNG  TA NAEESEINFHVEAFSEDTTASRGS+MASNGSASEF+D++STT VGQ IRDDQPST
Subjt:  SEELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPST

Query:  EALNGQIDKVLGAEAFCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHI
        EALNGQI+KV GAE        IK+SENQVEIDRLKF+LHQKELELSQLKEQIERDKLALS  QSKA AE+S AQKLILE+D+ELV AEE L GLEEV I
Subjt:  EALNGQIDKVLGAEAFCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHI

Query:  HYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR
        HY GEGEIVEVAG FNGWH +IKMDPQPSS+ +DS +SKK   WSTVLWLYPGVYEIKF+VDGHWKIDPHRES TKGAI+NNILRVGR
Subjt:  HYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR

XP_038882848.1 protein PTST homolog 3, chloroplastic isoform X1 [Benincasa hispida]0.0e+0085.78Show/hide
Query:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRK-SSVCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF
        MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPR+ S+VCTCSIRNSR S++TKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGR 
Subjt:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRK-SSVCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF

Query:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLTGQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCIS
                           TDLANIVRRRGHKLIRELLVTNSTN VESDC+LGNITLTGQDGEA DVVEDLSS NKVLVLENLSH SN  HTFSFDDCIS
Subjt:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLTGQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCIS

Query:  APTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCN--DYTGEITESSKNLSVENPENSLECQSEVTDN
        APTTT NSSVEEELSNDLI HDEYNESH ENN+N+ TVEDDTSMKTE TASEDC  S NI LG+ CN  DYTGE+TE SKNL +ENPE SLECQ EV   
Subjt:  APTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCN--DYTGEITESSKNLSVENPENSLECQSEVTDN

Query:  MVDESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIK
         V ESP SPEVL G+NYMINSTVDEYLDMHDHVDKPLLLITGSS+KEEDL+YSNEQV+KEDNNVDD+SLSA MTII DQSSGLNIDKALE  ESSYKLIK
Subjt:  MVDESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIK

Query:  YSEELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPS
        YSEELSLAE+V RFIQNGDLDIIDDNF++T SESGAGKGNGSFTAVNAEESEINFH EAFSEDTTASRGSVMASNGSASEFED+MSTTTVGQ  RDDQPS
Subjt:  YSEELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPS

Query:  TEALNGQIDKVLGAEAFCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVH
        TEALNGQI+KV GAE        IKISENQVEI+RLKF+LHQKELELSQLK+QIERDKLALSALQSKA AEISKAQKLILEKDTELVAAEESLSGLEEVH
Subjt:  TEALNGQIDKVLGAEAFCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVH

Query:  IHYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR
        IHYGGEGEIVEVAGSFNGWH+RIKMDPQPSSNP+DS +SKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRES TKGAISNNILRVGR
Subjt:  IHYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR

XP_038882852.1 protein PTST homolog 3, chloroplastic isoform X2 [Benincasa hispida]3.7e-30983.6Show/hide
Query:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRK-SSVCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF
        MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPR+ S+VCTCSIRNSR S++TKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGR 
Subjt:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRK-SSVCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF

Query:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLTGQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCIS
                           TDLANIVRRRGHKLIRELLVTNSTN VESDC+LGNITLTGQDGEA DVVEDLSS NKVLVLENLSH SN  HTFSFDDCIS
Subjt:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLTGQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCIS

Query:  APTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCN--DYTGEITESSKNLSVENPENSLECQSEVTDN
        APTTT NSSVEEELSNDLI HDEYNESH ENN+N+ TVEDDTSMKTE TASEDC  S NI LG+ CN  DYTGE+TE SKNL +ENPE SLECQ EV   
Subjt:  APTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCN--DYTGEITESSKNLSVENPENSLECQSEVTDN

Query:  MVDESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIK
         V ESP SPEVL G+NYMINSTVDEYLDMHDHVDKPLLLITGSS+KEEDL+YSNEQV+KEDNNVDD+SLSA MTII DQSSGLNIDKALE  ESSYKLIK
Subjt:  MVDESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIK

Query:  YSEELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPS
        YSEELSLAE+V RFIQNGDLDIIDDNF++T SESGAGKGNGSFTAVNAEESEINFH EAFSEDTTASRGSVMASNGSASEFED+MSTTTVGQ  RDDQPS
Subjt:  YSEELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPS

Query:  TEALNGQIDKVLGAEAFCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVH
        TEALNGQI+KV GAE                         HQKELELSQLK+QIERDKLALSALQSKA AEISKAQKLILEKDTELVAAEESLSGLEEVH
Subjt:  TEALNGQIDKVLGAEAFCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVH

Query:  IHYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR
        IHYGGEGEIVEVAGSFNGWH+RIKMDPQPSSNP+DS +SKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRES TKGAISNNILRVGR
Subjt:  IHYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR

TrEMBL top hitse value%identityAlignment
A0A0A0KF60 AMPK1_CBM domain-containing protein2.6e-28177.7Show/hide
Query:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRKSSVCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRFI
        MATLSHF SLLSLSSRN SFLDQL TQN HPKFHC GH H R  R+S VCTCSI NSRAS+R KSNEELCNDIREFIRSVGLPEDH+PSTKELSQHGR  
Subjt:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRKSSVCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRFI

Query:  RFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLTGQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCISA
                          TDLANIVRRRGHK +RELL+ NST  VE DCDLGNITLTGQDGEA DVVED       LVLEN SH SN +H FSF+DCIS 
Subjt:  RFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLTGQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCISA

Query:  PTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVTDNMVD
        PTTT NSS EEELS DLI HDEYN S+REN+ENIETVE D S KTE  ASEDC  S++I LG  C+D TGE+ E SKNLSVEN ENSLECQSEVT+N  D
Subjt:  PTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVTDNMVD

Query:  ESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIKYSE
        ES WS EV+  ENY+INST DEYLDMHDH + PLLL   SSSKEE L+YSNEQVEKEDN VD VSLSA MTIIDD+SSGLNID+AL  E+SS KL+KYSE
Subjt:  ESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIKYSE

Query:  ELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPSTEA
        ELSLAEKVARFIQNGDLDI+DDNF++T SESGAGKGNGS  A NAEESEINFHVEAFSEDTTASRGS+MASNGSASEF+D +STTTVGQ IRD+QPSTEA
Subjt:  ELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPSTEA

Query:  LNGQIDKVLGAEAFCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHIHY
        LNGQI+KV GAE        IK+SENQVEIDRLKF+LHQKELELSQLKEQIERDKLALSA QSKA AEIS AQKLILE+D+ELVAAEE L GLEEV IHY
Subjt:  LNGQIDKVLGAEAFCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHIHY

Query:  GGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR
         GEGEIVEVAGSFNGWH +IKMDPQPSSN +DS +SKK R WSTVLWLYPGVYEIKFVVDGHWKIDPHRES TKGAISNNILRVGR
Subjt:  GGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR

A0A1S3CH30 uncharacterized protein LOC103500828 isoform X28.4e-28877.47Show/hide
Query:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRKSS-VCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF
        MATLSHFPSLLSLSSRN SFLDQL T N HPKFHCFGHHHH HPR+ S VC CSIR+SRAS+R KSNEELCNDIREFIRSVGLP+DH+PSTKELSQHGR 
Subjt:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRKSS-VCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF

Query:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLTGQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCIS
                           TDLANIVRRRGHKL+RELL+TNST  VE DCDLGNITLTGQDGEA DVVED        VLEN S+ SN +H FSFDDC S
Subjt:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLTGQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCIS

Query:  APTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVTDNMV
         PT T NSS EEELSNDLI HDEYN S+REN+ENIE VE D S KTE  ASEDC  S+NI LG+ C+D+TGE+ E SKNLS+EN ENSLECQSEVT++ V
Subjt:  APTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVTDNMV

Query:  DESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIKYS
        DES WS EV  GENYMINST DEYLDMHDH ++PLLL   SS KEE L YSNEQVEKEDN VDDVSLSA MTIIDDQSSGLNID+AL   ESS +L+KYS
Subjt:  DESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIKYS

Query:  EELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPSTE
        E+LSLAEKVARFIQNGDLDI+DDNF++T SESGAG+GNG  TA NAEESEINFHVEAFSEDTTASRGS+MASNGSASEF+D++STT VGQ IRDDQPSTE
Subjt:  EELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPSTE

Query:  ALNGQIDKVLGAEA-FCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHI
        ALNGQI+KV GAE  FCLSMVQIK+SENQVEIDRLKF+LHQKELELSQLKEQIERDKLALS  QSKA AE+S AQKLILE+D+ELV AEE L GLEEV I
Subjt:  ALNGQIDKVLGAEA-FCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHI

Query:  HYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR
        HY GEGEIVEVAG FNGWH +IKMDPQPSS+ +DS +SKK   WSTVLWLYPGVYEIKF+VDGHWKIDPHRES TKGAI+NNILRVGR
Subjt:  HYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR

A0A1S3CH40 uncharacterized protein LOC103500828 isoform X32.6e-28176.45Show/hide
Query:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRKSS-VCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF
        MATLSHFPSLLSLSSRN SFLDQL T N HPKFHCFGHHHH HPR+ S VC CSIR+SRAS+R KSNEELCNDIREFIRSVGLP+DH+PSTKELSQHGR 
Subjt:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRKSS-VCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF

Query:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLT-GQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCI
                           TDLANIVRRRGHKL+RELL+TNST  VE DCDLGNITLT GQDGEA DVVED        VLEN S+ SN +H FSFDDC 
Subjt:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLT-GQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCI

Query:  SAPTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVTDNM
        S PT T NSS EEELSNDLI HDEYN S+REN+ENIE VE D S KTE  ASEDC  S+NI LG+ C+D+TGE+ E SKNLS+EN ENSLECQSEVT++ 
Subjt:  SAPTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVTDNM

Query:  VDESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIKY
        VDES WS EV  GENYMINST DEYLDMHDH ++PLLL   SS KEE L YSNEQVEKEDN VDDVSLSA MTIIDDQSSGLNID+AL   ESS +L+KY
Subjt:  VDESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIKY

Query:  SEELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPST
        SE+LSLAEKVARFIQNGDLDI+DDNF++T SESGAG+GNG  TA NAEESEINFHVEAFSEDTTASRGS+MASNGSASEF+D++STT VGQ IRDDQPST
Subjt:  SEELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPST

Query:  EALNGQIDKVLGAEAFCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHI
        EALNGQI+KV GAE        IK+SENQVEIDRLKF+LHQKELELSQLKEQIERDKLALS  QSKA AE+S AQKLILE+D+ELV AEE L GLEEV I
Subjt:  EALNGQIDKVLGAEAFCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHI

Query:  HYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR
        HY GEGEIVEVAG FNGWH +IKMDPQPSS+ +DS +SKK   WSTVLWLYPGVYEIKF+VDGHWKIDPHRES TKGAI+NNILRVGR
Subjt:  HYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR

A0A1S3CIK3 uncharacterized protein LOC103500828 isoform X12.1e-28677.36Show/hide
Query:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRKSS-VCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF
        MATLSHFPSLLSLSSRN SFLDQL T N HPKFHCFGHHHH HPR+ S VC CSIR+SRAS+R KSNEELCNDIREFIRSVGLP+DH+PSTKELSQHGR 
Subjt:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRKSS-VCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF

Query:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLT-GQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCI
                           TDLANIVRRRGHKL+RELL+TNST  VE DCDLGNITLT GQDGEA DVVED        VLEN S+ SN +H FSFDDC 
Subjt:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLT-GQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCI

Query:  SAPTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVTDNM
        S PT T NSS EEELSNDLI HDEYN S+REN+ENIE VE D S KTE  ASEDC  S+NI LG+ C+D+TGE+ E SKNLS+EN ENSLECQSEVT++ 
Subjt:  SAPTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVTDNM

Query:  VDESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIKY
        VDES WS EV  GENYMINST DEYLDMHDH ++PLLL   SS KEE L YSNEQVEKEDN VDDVSLSA MTIIDDQSSGLNID+AL   ESS +L+KY
Subjt:  VDESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIKY

Query:  SEELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPST
        SE+LSLAEKVARFIQNGDLDI+DDNF++T SESGAG+GNG  TA NAEESEINFHVEAFSEDTTASRGS+MASNGSASEF+D++STT VGQ IRDDQPST
Subjt:  SEELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPST

Query:  EALNGQIDKVLGAEA-FCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVH
        EALNGQI+KV GAE  FCLSMVQIK+SENQVEIDRLKF+LHQKELELSQLKEQIERDKLALS  QSKA AE+S AQKLILE+D+ELV AEE L GLEEV 
Subjt:  EALNGQIDKVLGAEA-FCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVH

Query:  IHYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR
        IHY GEGEIVEVAG FNGWH +IKMDPQPSS+ +DS +SKK   WSTVLWLYPGVYEIKF+VDGHWKIDPHRES TKGAI+NNILRVGR
Subjt:  IHYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR

A0A6J1EJK2 protein PTST homolog 3, chloroplastic isoform X26.3e-27576.31Show/hide
Query:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPK-FHCFGHHHHRHPRKS-SVCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGR
        MAT SHFPS  SLSS N SFLDQLQT+N H K  HCFG HHHR  R+  SVC CSIRNSRAS+RTKSNEELCNDIREFI SVGLP+DH+PSTK+L QHGR
Subjt:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPK-FHCFGHHHHRHPRKS-SVCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGR

Query:  FIRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLTGQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCI
                             DLANIVRRRGHKLIRELLVTNSTN V++DCDL N+ LTGQDGE  DVVEDLSS NKV VLENLS  SN +H F+ ++ I
Subjt:  FIRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLTGQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCI

Query:  SAPTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVTDNM
         APT+T + SV EEL ND I HDEYNESH ENNEN+E VEDDTS+KTE T S D   S+NI   +  ND+TGE TE +KNLSVENPENSLECQ EV DNM
Subjt:  SAPTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVTDNM

Query:  VDESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIKY
        VDES  SP+VL GENYMINSTVD YLD+HDH DKPLLL TGSSSKEE  + SNEQ++K DNNVDD SLS  MT++DDQSS L+I K LE EES+YKLIKY
Subjt:  VDESPWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIKY

Query:  SEELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPST
        SEELSLAEKVARFIQNGDL++IDDNF++T SE GAG+GNGS TAVNAEESEIN HVEA SEDTT +RGSVMASNGSASEFE  +S+TTVGQ IRDDQPST
Subjt:  SEELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPST

Query:  EALNGQIDKVLGAEAFCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHI
        EA N QIDKVLGAE        IKI+ENQV+IDRLKF+LHQKELELSQLKEQIERDKLALSALQSKA AEISKAQKLILEKDTELVAAEESLSGLEEVHI
Subjt:  EALNGQIDKVLGAEAFCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHI

Query:  HYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR
        HYGGEGEIVEVAGSFNGWH RIKMDPQPS N +DS +SKK RLWSTVLWLYPGVYEIKFVVDG WKIDPHRES  KGAISNNILRVGR
Subjt:  HYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRVGR

SwissProt top hitse value%identityAlignment
F4KFB3 Protein PTST homolog 3, chloroplastic1.6e-6534.34Show/hide
Query:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRKSSVC-TCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF
        MAT+S  P   S+S     F       +Q   F  + +   +H   S +C  CS + +R  KR KSNEEL ++I +F+   GLPE H+PS KELS HGR 
Subjt:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRKSSVC-TCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF

Query:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTN----STNKVESDCDLGNITLTGQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFD
                            DLANIVRRRG+K I+EL+  +      N++ +D +  N  +  + G +   +ED S+        +LS  +    + S D
Subjt:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTN----STNKVESDCDLGNITLTGQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFD

Query:  DCISAPTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVT
        +     +  G  S+E   SN  +    ++    E    IE+VE +     E ++SE     A++F      +++ ++ ++S    +E    S+    EV 
Subjt:  DCISAPTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVT

Query:  DNMVDES-PWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKE---DNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEES
        D   D S  +     P  N+  +      L+   HVD    + TGSS    DL   N     E   +  +DD++ +          SG   D  +E E++
Subjt:  DNMVDES-PWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKE---DNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEES

Query:  SYKLIKYSEELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTI
         +     S   S+ EK  RFIQNG LD +           GA + +    +   E SE     E   +     R      NGSA   ++ +  T V  + 
Subjt:  SYKLIKYSEELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTI

Query:  RDDQPSTEALNGQIDKVLGAEAFCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLS
        R+        + Q D  +G +       + +  ENQVEIDRL+F+L QKELELS+LKEQIE++KL+LS LQ +A  EI KAQ LI EK+ EL  AEESLS
Subjt:  RDDQPSTEALNGQIDKVLGAEAFCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLS

Query:  GLEEVHIHYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRV
        GL+EV I Y G+G  VEV GSFNGW  R+ M+ Q S +        K + WST+LWLYPG YEIKF+VDG W  DP ++S T+G ISNNIL+V
Subjt:  GLEEVHIHYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRV

Q10F03 Protein FLOURY ENDOSPERM 6, chloroplastic3.1e-1334.48Show/hide
Query:  KELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHIHYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKP
        +E ++ Q +E++   +  ++ L+ K A EI +  K+I EK   L  AE++LS L  V+I +      V + GSF+GW S+ +M+                
Subjt:  KELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHIHYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKP

Query:  RLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRV
          +S  L LYPG YEIKF+VDG W+ DP R   +     NN+L V
Subjt:  RLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRV

Q94AX2 Protein PTST, chloroplastic3.8e-1131.21Show/hide
Query:  QIKISENQVEIDRLKFLLHQKELE-LSQLKEQIER-------DKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHIHYGGEGEIVEVAG
        Q+K  E+++   + +  L + E++ L +L E+I          K++   +QS   + +   QK + E+   + AA+      +EVH+ + G  E V+V G
Subjt:  QIKISENQVEIDRLKFLLHQKELE-LSQLKEQIER-------DKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHIHYGGEGEIVEVAG

Query:  SFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRV
        SF+GW  R  + P+ S     +  +K    +ST L+L PG YE+KF+VDG W+I P   +  +G + NN+L V
Subjt:  SFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRV

Q9LFY0 Protein PTST homolog 2, chloroplastic3.1e-1334.97Show/hide
Query:  KELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHIHYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKP
        KE E+   + ++   +  L+ L+ K A  I  AQ+++ EK   +  A  +L  L    I +      V + GSF+GW ++ KM  + + N V S   K  
Subjt:  KELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHIHYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKP

Query:  RLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNIL
                LYPG YEIKF+VDG WK+DP R   T G   NN+L
Subjt:  RLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNIL

Arabidopsis top hitse value%identityAlignment
AT1G27070.1 5'-AMP-activated protein kinase-related2.2e-1434.97Show/hide
Query:  KELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHIHYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKP
        KE E+   + ++   +  L+ L+ K A  I  AQ+++ EK   +  A  +L  L    I +      V + GSF+GW ++ KM  + + N V S   K  
Subjt:  KELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHIHYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKP

Query:  RLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNIL
                LYPG YEIKF+VDG WK+DP R   T G   NN+L
Subjt:  RLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNIL

AT5G03420.1 5'-AMP-activated protein kinase-related1.1e-6634.34Show/hide
Query:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRKSSVC-TCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF
        MAT+S  P   S+S     F       +Q   F  + +   +H   S +C  CS + +R  KR KSNEEL ++I +F+   GLPE H+PS KELS HGR 
Subjt:  MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRKSSVC-TCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRF

Query:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTN----STNKVESDCDLGNITLTGQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFD
                            DLANIVRRRG+K I+EL+  +      N++ +D +  N  +  + G +   +ED S+        +LS  +    + S D
Subjt:  IRFLNASVTVLHLQAGVVVTDLANIVRRRGHKLIRELLVTN----STNKVESDCDLGNITLTGQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFD

Query:  DCISAPTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVT
        +     +  G  S+E   SN  +    ++    E    IE+VE +     E ++SE     A++F      +++ ++ ++S    +E    S+    EV 
Subjt:  DCISAPTTTGNSSVEEELSNDLIFHDEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVT

Query:  DNMVDES-PWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKE---DNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEES
        D   D S  +     P  N+  +      L+   HVD    + TGSS    DL   N     E   +  +DD++ +          SG   D  +E E++
Subjt:  DNMVDES-PWSPEVLPGENYMINSTVDEYLDMHDHVDKPLLLITGSSSKEEDLFYSNEQVEKE---DNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEES

Query:  SYKLIKYSEELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTI
         +     S   S+ EK  RFIQNG LD +           GA + +    +   E SE     E   +     R      NGSA   ++ +  T V  + 
Subjt:  SYKLIKYSEELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSFTAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTI

Query:  RDDQPSTEALNGQIDKVLGAEAFCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLS
        R+        + Q D  +G +       + +  ENQVEIDRL+F+L QKELELS+LKEQIE++KL+LS LQ +A  EI KAQ LI EK+ EL  AEESLS
Subjt:  RDDQPSTEALNGQIDKVLGAEAFCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQIERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLS

Query:  GLEEVHIHYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRV
        GL+EV I Y G+G  VEV GSFNGW  R+ M+ Q S +        K + WST+LWLYPG YEIKF+VDG W  DP ++S T+G ISNNIL+V
Subjt:  GLEEVHIHYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRV

AT5G39790.1 5'-AMP-activated protein kinase-related6.0e-1232.56Show/hide
Query:  ISENQVEIDRLKFLLHQKELE---LSQLKEQIER-------DKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHIHYGGEGEIVEVAGS
        +   + EI  +K  L   ELE   L +L E+I          K++   +QS   + +   QK + E+   + AA+      +EVH+ + G  E V+V GS
Subjt:  ISENQVEIDRLKFLLHQKELE---LSQLKEQIER-------DKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHIHYGGEGEIVEVAGS

Query:  FNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRV
        F+GW  R  + P+ S     +  +K    +ST L+L PG YE+KF+VDG W+I P   +  +G + NN+L V
Subjt:  FNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVDGHWKIDPHRESGTKGAISNNILRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACGCTCTCTCATTTTCCCTCATTGCTGTCTCTCTCTTCTCGGAATCTCTCCTTCCTAGACCAACTGCAAACGCAGAATCAGCACCCGAAGTTTCACTGTTTCGG
CCACCATCACCACCGTCATCCAAGAAAGTCTTCTGTCTGTACTTGTTCAATCAGGAATTCCAGGGCTAGTAAACGGACGAAGAGTAATGAGGAGCTCTGCAACGACATTC
GCGAGTTCATTAGATCGGTCGGACTTCCGGAGGATCATATACCTTCCACAAAGGAGCTTTCGCAGCATGGAAGGTTTATAAGGTTTCTAAATGCCTCTGTTACTGTTTTA
CATTTGCAAGCTGGAGTTGTAGTGACTGACCTGGCGAATATTGTCAGACGAAGGGGTCACAAACTTATACGAGAGCTTCTTGTTACTAACTCAACCAATAAAGTTGAAAG
TGATTGTGACTTGGGGAACATTACTCTGACAGGCCAGGATGGTGAGGCGAATGATGTAGTTGAAGACCTTTCTTCACTAAACAAAGTTCTGGTCTTGGAAAATCTCTCTC
ATGGTTCAAATAACCACCACACTTTTAGTTTCGATGACTGCATTTCGGCTCCTACAACTACAGGCAATTCATCAGTGGAGGAAGAGTTGTCAAATGATCTAATATTTCAT
GATGAATACAATGAATCACATAGGGAAAATAATGAGAACATTGAGACAGTTGAAGATGATACATCCATGAAAACTGAAGCCACTGCTTCAGAAGATTGTCTTATTAGTGC
AAATATCTTTTTAGGGATTATGTGTAATGACTACACTGGGGAGATAACTGAATCTTCAAAGAATTTATCAGTGGAAAATCCTGAAAATTCATTGGAATGTCAAAGTGAGG
TAACAGACAATATGGTTGATGAATCTCCCTGGTCACCTGAAGTCCTTCCTGGAGAGAATTATATGATAAATTCTACAGTTGATGAATATCTCGACATGCATGACCATGTT
GACAAACCTCTGCTTTTGATAACTGGTTCCTCCTCAAAGGAAGAGGACTTGTTTTATTCAAATGAACAGGTCGAGAAAGAAGATAACAATGTTGATGACGTCTCCTTATC
AGCCGGAATGACTATCATAGATGACCAATCTAGTGGTTTAAATATTGATAAGGCTCTTGAACCTGAAGAGTCTAGTTACAAGTTGATAAAGTATTCTGAGGAGTTATCCC
TAGCAGAGAAGGTGGCTAGGTTTATACAAAATGGAGATCTGGATATAATAGATGATAATTTCGAGTCCACATTCAGCGAGAGTGGTGCTGGAAAAGGCAATGGATCCTTT
ACAGCAGTAAATGCAGAAGAATCTGAAATAAACTTCCATGTTGAGGCATTCTCAGAAGATACCACCGCAAGTAGAGGTTCTGTGATGGCATCAAATGGGAGTGCATCTGA
ATTTGAGGATCATATGTCTACCACAACTGTCGGTCAGACAATTAGGGATGATCAACCTTCAACTGAAGCTTTGAATGGTCAAATTGACAAGGTGTTAGGTGCTGAGGCTT
TCTGTCTCTCTATGGTACAGATAAAAATATCTGAGAATCAAGTTGAGATTGACCGTCTCAAATTTTTGTTGCATCAGAAGGAGCTGGAACTGTCTCAGCTGAAGGAACAG
ATTGAGAGGGACAAGCTTGCTTTGTCTGCTTTGCAAAGCAAGGCTGCAGCAGAGATCAGCAAGGCCCAAAAGCTCATTTTGGAAAAAGATACGGAGTTAGTTGCTGCTGA
AGAGAGTCTTTCTGGATTGGAGGAGGTTCACATCCATTACGGTGGGGAAGGAGAAATTGTTGAGGTAGCTGGTAGCTTCAATGGTTGGCACAGTAGGATTAAGATGGATC
CACAGCCATCGTCCAATCCTGTGGATTCCTTTGATTCAAAGAAACCCAGGCTTTGGTCAACAGTGTTGTGGCTTTATCCTGGAGTTTATGAGATAAAATTCGTCGTTGAT
GGACACTGGAAGATTGATCCTCACAGAGAGTCCGGGACCAAGGGAGCAATAAGTAACAACATCCTCCGGGTTGGAAGATGA
mRNA sequenceShow/hide mRNA sequence
CATAAACAAGAATCGGCTCACGACAAAAGCTAAGCGCTCACAAAGCTGCAAATGCTCGCTCAGAGATTTCGCGCGCGGTGAGAGACTCAGAGTTTTGCTTTTCCATGGCT
ACGCTCTCTCATTTTCCCTCATTGCTGTCTCTCTCTTCTCGGAATCTCTCCTTCCTAGACCAACTGCAAACGCAGAATCAGCACCCGAAGTTTCACTGTTTCGGCCACCA
TCACCACCGTCATCCAAGAAAGTCTTCTGTCTGTACTTGTTCAATCAGGAATTCCAGGGCTAGTAAACGGACGAAGAGTAATGAGGAGCTCTGCAACGACATTCGCGAGT
TCATTAGATCGGTCGGACTTCCGGAGGATCATATACCTTCCACAAAGGAGCTTTCGCAGCATGGAAGGTTTATAAGGTTTCTAAATGCCTCTGTTACTGTTTTACATTTG
CAAGCTGGAGTTGTAGTGACTGACCTGGCGAATATTGTCAGACGAAGGGGTCACAAACTTATACGAGAGCTTCTTGTTACTAACTCAACCAATAAAGTTGAAAGTGATTG
TGACTTGGGGAACATTACTCTGACAGGCCAGGATGGTGAGGCGAATGATGTAGTTGAAGACCTTTCTTCACTAAACAAAGTTCTGGTCTTGGAAAATCTCTCTCATGGTT
CAAATAACCACCACACTTTTAGTTTCGATGACTGCATTTCGGCTCCTACAACTACAGGCAATTCATCAGTGGAGGAAGAGTTGTCAAATGATCTAATATTTCATGATGAA
TACAATGAATCACATAGGGAAAATAATGAGAACATTGAGACAGTTGAAGATGATACATCCATGAAAACTGAAGCCACTGCTTCAGAAGATTGTCTTATTAGTGCAAATAT
CTTTTTAGGGATTATGTGTAATGACTACACTGGGGAGATAACTGAATCTTCAAAGAATTTATCAGTGGAAAATCCTGAAAATTCATTGGAATGTCAAAGTGAGGTAACAG
ACAATATGGTTGATGAATCTCCCTGGTCACCTGAAGTCCTTCCTGGAGAGAATTATATGATAAATTCTACAGTTGATGAATATCTCGACATGCATGACCATGTTGACAAA
CCTCTGCTTTTGATAACTGGTTCCTCCTCAAAGGAAGAGGACTTGTTTTATTCAAATGAACAGGTCGAGAAAGAAGATAACAATGTTGATGACGTCTCCTTATCAGCCGG
AATGACTATCATAGATGACCAATCTAGTGGTTTAAATATTGATAAGGCTCTTGAACCTGAAGAGTCTAGTTACAAGTTGATAAAGTATTCTGAGGAGTTATCCCTAGCAG
AGAAGGTGGCTAGGTTTATACAAAATGGAGATCTGGATATAATAGATGATAATTTCGAGTCCACATTCAGCGAGAGTGGTGCTGGAAAAGGCAATGGATCCTTTACAGCA
GTAAATGCAGAAGAATCTGAAATAAACTTCCATGTTGAGGCATTCTCAGAAGATACCACCGCAAGTAGAGGTTCTGTGATGGCATCAAATGGGAGTGCATCTGAATTTGA
GGATCATATGTCTACCACAACTGTCGGTCAGACAATTAGGGATGATCAACCTTCAACTGAAGCTTTGAATGGTCAAATTGACAAGGTGTTAGGTGCTGAGGCTTTCTGTC
TCTCTATGGTACAGATAAAAATATCTGAGAATCAAGTTGAGATTGACCGTCTCAAATTTTTGTTGCATCAGAAGGAGCTGGAACTGTCTCAGCTGAAGGAACAGATTGAG
AGGGACAAGCTTGCTTTGTCTGCTTTGCAAAGCAAGGCTGCAGCAGAGATCAGCAAGGCCCAAAAGCTCATTTTGGAAAAAGATACGGAGTTAGTTGCTGCTGAAGAGAG
TCTTTCTGGATTGGAGGAGGTTCACATCCATTACGGTGGGGAAGGAGAAATTGTTGAGGTAGCTGGTAGCTTCAATGGTTGGCACAGTAGGATTAAGATGGATCCACAGC
CATCGTCCAATCCTGTGGATTCCTTTGATTCAAAGAAACCCAGGCTTTGGTCAACAGTGTTGTGGCTTTATCCTGGAGTTTATGAGATAAAATTCGTCGTTGATGGACAC
TGGAAGATTGATCCTCACAGAGAGTCCGGGACCAAGGGAGCAATAAGTAACAACATCCTCCGGGTTGGAAGATGACACTATGATGGAAAATGGCATTAAGTAATTGGCGC
CTTAGTTACATTGACAAGAACAAACTACCACAAAGTCGTGCATGGTCGACTGGAATGAGCTCAGAATCCTACCAGCTTACACATGGGAGACATGAACCTTATTTCTTCTA
GAGGCTATTTCTAATTGTTGGGCGAATCGAAAGATTTCTATAGGGTTCACTTGTAATATAATCATGTACGTGCTTGTATATGGCAGCTACTTTCTTTCTAATACATTGCA
TGATTTGTTAAGCAAAATAGTGTAGAATTCATGAATTTTGCCATCAGTGAGATGTGCATACCAAGGAA
Protein sequenceShow/hide protein sequence
MATLSHFPSLLSLSSRNLSFLDQLQTQNQHPKFHCFGHHHHRHPRKSSVCTCSIRNSRASKRTKSNEELCNDIREFIRSVGLPEDHIPSTKELSQHGRFIRFLNASVTVL
HLQAGVVVTDLANIVRRRGHKLIRELLVTNSTNKVESDCDLGNITLTGQDGEANDVVEDLSSLNKVLVLENLSHGSNNHHTFSFDDCISAPTTTGNSSVEEELSNDLIFH
DEYNESHRENNENIETVEDDTSMKTEATASEDCLISANIFLGIMCNDYTGEITESSKNLSVENPENSLECQSEVTDNMVDESPWSPEVLPGENYMINSTVDEYLDMHDHV
DKPLLLITGSSSKEEDLFYSNEQVEKEDNNVDDVSLSAGMTIIDDQSSGLNIDKALEPEESSYKLIKYSEELSLAEKVARFIQNGDLDIIDDNFESTFSESGAGKGNGSF
TAVNAEESEINFHVEAFSEDTTASRGSVMASNGSASEFEDHMSTTTVGQTIRDDQPSTEALNGQIDKVLGAEAFCLSMVQIKISENQVEIDRLKFLLHQKELELSQLKEQ
IERDKLALSALQSKAAAEISKAQKLILEKDTELVAAEESLSGLEEVHIHYGGEGEIVEVAGSFNGWHSRIKMDPQPSSNPVDSFDSKKPRLWSTVLWLYPGVYEIKFVVD
GHWKIDPHRESGTKGAISNNILRVGR