; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019167 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019167
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionChlorophyll a-b binding protein, chloroplastic
Genome locationtig00153293:282114..293705
RNA-Seq ExpressionSgr019167
SyntenySgr019167
Gene Ontology termsGO:0009416 - response to light stimulus (biological process)
GO:0009768 - photosynthesis, light harvesting in photosystem I (biological process)
GO:0018298 - protein-chromophore linkage (biological process)
GO:0005634 - nucleus (cellular component)
GO:0009522 - photosystem I (cellular component)
GO:0009523 - photosystem II (cellular component)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
GO:0016168 - chlorophyll binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR001344 - Chlorophyll A-B binding protein, plant and chromista
IPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR022796 - Chlorophyll A-B binding protein
IPR023329 - Chlorophyll a/b binding domain superfamily
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052558.1 transcription factor bHLH144 [Cucumis melo var. makuwa]4.9e-11186.48Show/hide
Query:  SDLDFYLKKAVLPTANQGDDNRMQIPLSSAFPSVLPPGRKNLGPFNAVEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKLNGPTVNMCSKYIQNNFCI
        SDL+FYLKKA  PTA QGDDN MQIPLSSAFPSVLPPGRKNLGPFN VEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKL+GPT NMCSKYIQ NFC+
Subjt:  SDLDFYLKKAVLPTANQGDDNRMQIPLSSAFPSVLPPGRKNLGPFNAVEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKLNGPTVNMCSKYIQNNFCI

Query:  DEEHYVDREISSPLMEDLDDIDALLSLEDED---LDGSEDEEISTARSYLNYGNRSPDSSSSS-YSSKPRKNQTFNPIQRSSSSGSSCDSDMKQLKVKKM
        ++ HY DREISSPLMEDLDDIDALLSLE+E+   LDGSED+E+STARS+LNYGN+SPDSSSSS YSSKPRKN +FNP+ +SSSSGSSC SDMKQLK+KKM
Subjt:  DEEHYVDREISSPLMEDLDDIDALLSLEDED---LDGSEDEEISTARSYLNYGNRSPDSSSSS-YSSKPRKNQTFNPIQRSSSSGSSCDSDMKQLKVKKM

Query:  VRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGVRGLEN
        VRKLREILPGGYQMTTV VLDEAVKYLKSLKDEVQKLGV GLEN
Subjt:  VRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGVRGLEN

KAG5613834.1 hypothetical protein H5410_013658 [Solanum commersonii]4.9e-11177.1Show/hide
Query:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSVGKS--VSSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRW
        A+S +  A ++ +  P   E+ GNGR++MRK+  K+   SSGSPWYGPDRVKYLGPFSGE PSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIH RW
Subjt:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSVGKS--VSSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRW

Query:  AMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAGGPLGEVSDPIYPGGSFDPLGLADDPE
        AMLGALGCVFPELL+RNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSL+HAQSILAIWACQVVLMGAVEGYRIAGGPLGEV DP+YPGGSFDPLGLADDPE
Subjt:  AMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAGGPLGEVSDPIYPGGSFDPLGLADDPE

Query:  AFAELKVKELKNGRLPCSPCSASSFRPSSPERVHWRTWQTTLPIQSTTMLGPMPQTLFPESE
        AFAELKVKE+KNGRL          +    ERVHWRT  TTLP Q T   G +PQT FPE++
Subjt:  AFAELKVKELKNGRLPCSPCSASSFRPSSPERVHWRTWQTTLPIQSTTMLGPMPQTLFPESE

XP_008439654.1 PREDICTED: transcription factor bHLH144 [Cucumis melo]5.8e-11285.48Show/hide
Query:  NLIASDLDFYLKKAVLPTANQGDDNRMQIPLSSAFPSVLPPGRKNLGPFNAVEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKLNGPTVNMCSKYIQN
        + + SDL+FYLKKA  PTA QGDDN MQIPLSSAFPSVLPPGRKNLGPFN VEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKL+GPT NMCSKYIQ 
Subjt:  NLIASDLDFYLKKAVLPTANQGDDNRMQIPLSSAFPSVLPPGRKNLGPFNAVEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKLNGPTVNMCSKYIQN

Query:  NFCIDEEHYVDREISSPLMEDLDDIDALLSLEDED---LDGSEDEEISTARSYLNYGNRSPDSSSSS-YSSKPRKNQTFNPIQRSSSSGSSCDSDMKQLK
        NFC+++ HY DREISSPLMEDLDDIDALLSLE+E+   LDGSED+E+STARS+LNYGN+SPDSSSSS YSSKPRKN +FNP+ +SSSSGSSC SDMKQLK
Subjt:  NFCIDEEHYVDREISSPLMEDLDDIDALLSLEDED---LDGSEDEEISTARSYLNYGNRSPDSSSSS-YSSKPRKNQTFNPIQRSSSSGSSCDSDMKQLK

Query:  VKKMVRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGVRGLEN
        +KKMVRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGV GLEN
Subjt:  VKKMVRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGVRGLEN

XP_022142228.1 transcription factor bHLH144 [Momordica charantia]5.2e-11388.31Show/hide
Query:  NLIASDLDFYLKKAVLPTANQGDDNRMQIPLSSAFPSVLPPGRKNLGPFNAVEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKLNGPTVNMCSKYIQN
        + + SDL FYLKKA+LPTANQGDDN MQIPLSSAFPSVLPPGRKNLGPFNAVE QPSE+CPQNFIIFDHTDNRSQIMFHPAIANKL+GPT NMCSKYIQN
Subjt:  NLIASDLDFYLKKAVLPTANQGDDNRMQIPLSSAFPSVLPPGRKNLGPFNAVEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKLNGPTVNMCSKYIQN

Query:  NFCIDEEHYVDREISSPLMEDLDDIDALLSLEDEDLDGSEDEEISTARSYLNYGNRSPDSSSSS-YSSKP-RKNQTFNPIQR--SSSSGSSCDSDMKQLK
        N C+++EHY DR+ISSPLM+DLD IDALLSLEDEDLD SEDEEISTARS LNYGNRSPDSSSSS YSSKP RKNQTFNPIQ+  SSS GS CDSDMKQLK
Subjt:  NFCIDEEHYVDREISSPLMEDLDDIDALLSLEDEDLDGSEDEEISTARSYLNYGNRSPDSSSSS-YSSKP-RKNQTFNPIQR--SSSSGSSCDSDMKQLK

Query:  VKKMVRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGVRGLEN
        VKKMVRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGV GLEN
Subjt:  VKKMVRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGVRGLEN

XP_038881999.1 transcription factor bHLH144 [Benincasa hispida]4.3e-11587.85Show/hide
Query:  NLIASDLDFYLKKAVLPTANQGDDNRMQIPLSSAFPSVLPPGRKNLGPFNAVEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKLNGPTVNMCSKYIQN
        + + SDL+FYLK+A+ P ANQGDDN MQIPLSSAFPSVLPPGRKNLGPFN VEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKL+GPT NMCSKYIQ 
Subjt:  NLIASDLDFYLKKAVLPTANQGDDNRMQIPLSSAFPSVLPPGRKNLGPFNAVEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKLNGPTVNMCSKYIQN

Query:  NFCIDEEHYVDREISSPLMEDLDDIDALLSLED---EDLDGSEDEEISTARSYLNYGNRSPDSSSSSYSSKPRKNQTFNPIQRSSSSGSSCDSDMKQLKV
        NFCI++EHY DREISSPLMEDLDDIDALLSL+D   EDLDGSEDEE STARS+LNYGN+SPDSSSSSYSSKPRKN++FNP+ +SSSSGSSCDSDMKQLKV
Subjt:  NFCIDEEHYVDREISSPLMEDLDDIDALLSLED---EDLDGSEDEEISTARSYLNYGNRSPDSSSSSYSSKPRKNQTFNPIQRSSSSGSSCDSDMKQLKV

Query:  KKMVRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGVRGLEN
        KKMVRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGV GLEN
Subjt:  KKMVRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGVRGLEN

TrEMBL top hitse value%identityAlignment
A0A0A0KLX4 BHLH domain-containing protein3.1e-11184.27Show/hide
Query:  NLIASDLDFYLKKAVLPTANQGDDNRMQIPLSSAFPSVLPPGRKNLGPFNAVEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKLNGPTVNMCSKYIQN
        + + SDL+FYLKKA  PTA QGDDN MQIPLSSAFPSVLPPGRKNLGPFN VEFQPSEVCPQNFIIFDHTDNRSQIMFHPA+ANKL+GPT NMCSKYIQ 
Subjt:  NLIASDLDFYLKKAVLPTANQGDDNRMQIPLSSAFPSVLPPGRKNLGPFNAVEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKLNGPTVNMCSKYIQN

Query:  NFCIDEEHYVDREISSPLMEDLDDIDALLSLED---EDLDGSEDEEISTARSYLNYGNRSPDSSSSS-YSSKPRKNQTFNPIQRSSSSGSSCDSDMKQLK
        NFC++++H+ DREISSPLMEDLDDIDALLSLE+   EDLDGSED+E+STARS+LNYGN+SPDSSSSS YSSKPRKN +FNP+ +SSSSGSSC+SD+KQLK
Subjt:  NFCIDEEHYVDREISSPLMEDLDDIDALLSLED---EDLDGSEDEEISTARSYLNYGNRSPDSSSSS-YSSKPRKNQTFNPIQRSSSSGSSCDSDMKQLK

Query:  VKKMVRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGVRGLEN
        +KKMVRKLREILPGGYQMTTV VLDEAVKYLKSLKDEVQKLGV GLEN
Subjt:  VKKMVRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGVRGLEN

A0A1S3AYV1 transcription factor bHLH1442.8e-11285.48Show/hide
Query:  NLIASDLDFYLKKAVLPTANQGDDNRMQIPLSSAFPSVLPPGRKNLGPFNAVEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKLNGPTVNMCSKYIQN
        + + SDL+FYLKKA  PTA QGDDN MQIPLSSAFPSVLPPGRKNLGPFN VEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKL+GPT NMCSKYIQ 
Subjt:  NLIASDLDFYLKKAVLPTANQGDDNRMQIPLSSAFPSVLPPGRKNLGPFNAVEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKLNGPTVNMCSKYIQN

Query:  NFCIDEEHYVDREISSPLMEDLDDIDALLSLEDED---LDGSEDEEISTARSYLNYGNRSPDSSSSS-YSSKPRKNQTFNPIQRSSSSGSSCDSDMKQLK
        NFC+++ HY DREISSPLMEDLDDIDALLSLE+E+   LDGSED+E+STARS+LNYGN+SPDSSSSS YSSKPRKN +FNP+ +SSSSGSSC SDMKQLK
Subjt:  NFCIDEEHYVDREISSPLMEDLDDIDALLSLEDED---LDGSEDEEISTARSYLNYGNRSPDSSSSS-YSSKPRKNQTFNPIQRSSSSGSSCDSDMKQLK

Query:  VKKMVRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGVRGLEN
        +KKMVRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGV GLEN
Subjt:  VKKMVRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGVRGLEN

A0A2G3A7I7 Chlorophyll a-b binding protein, chloroplastic3.8e-10969.7Show/hide
Query:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSVGKSVSSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM
        A+S    A ++ +  P   E+ GNGR+SMRK+V K  SSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIH RWAM
Subjt:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSVGKSVSSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM

Query:  LGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAGGPLGEVSDPIYPGGSFDPLGLADDPEAF
        LGALGCVFPELL+RNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSL+HAQSILAIWACQVVLMGAVEGYRIAGGPLGEV DP+YPGGSFDPLGLADDPEAF
Subjt:  LGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAGGPLGEVSDPIYPGGSFDPLGLADDPEAF

Query:  AELKVKELKNGRLPC---------SPCSASSFRPSSPERVHWRTWQTTLPIQSTTMLGPMPQTLFPESENLRIQSMKKNFCLVKPLMVVEIRILPGA
        AELKVKE+KNGRL           +  +      +  + +   T  TTL IQ TT  GP PQTL PE+E  +  ++ K  CL+     +++++  GA
Subjt:  AELKVKELKNGRLPC---------SPCSASSFRPSSPERVHWRTWQTTLPIQSTTMLGPMPQTLFPESENLRIQSMKKNFCLVKPLMVVEIRILPGA

A0A5A7UBJ4 Transcription factor bHLH1442.4e-11186.48Show/hide
Query:  SDLDFYLKKAVLPTANQGDDNRMQIPLSSAFPSVLPPGRKNLGPFNAVEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKLNGPTVNMCSKYIQNNFCI
        SDL+FYLKKA  PTA QGDDN MQIPLSSAFPSVLPPGRKNLGPFN VEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKL+GPT NMCSKYIQ NFC+
Subjt:  SDLDFYLKKAVLPTANQGDDNRMQIPLSSAFPSVLPPGRKNLGPFNAVEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKLNGPTVNMCSKYIQNNFCI

Query:  DEEHYVDREISSPLMEDLDDIDALLSLEDED---LDGSEDEEISTARSYLNYGNRSPDSSSSS-YSSKPRKNQTFNPIQRSSSSGSSCDSDMKQLKVKKM
        ++ HY DREISSPLMEDLDDIDALLSLE+E+   LDGSED+E+STARS+LNYGN+SPDSSSSS YSSKPRKN +FNP+ +SSSSGSSC SDMKQLK+KKM
Subjt:  DEEHYVDREISSPLMEDLDDIDALLSLEDED---LDGSEDEEISTARSYLNYGNRSPDSSSSS-YSSKPRKNQTFNPIQRSSSSGSSCDSDMKQLKVKKM

Query:  VRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGVRGLEN
        VRKLREILPGGYQMTTV VLDEAVKYLKSLKDEVQKLGV GLEN
Subjt:  VRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGVRGLEN

A0A6J1CMS2 transcription factor bHLH1442.5e-11388.31Show/hide
Query:  NLIASDLDFYLKKAVLPTANQGDDNRMQIPLSSAFPSVLPPGRKNLGPFNAVEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKLNGPTVNMCSKYIQN
        + + SDL FYLKKA+LPTANQGDDN MQIPLSSAFPSVLPPGRKNLGPFNAVE QPSE+CPQNFIIFDHTDNRSQIMFHPAIANKL+GPT NMCSKYIQN
Subjt:  NLIASDLDFYLKKAVLPTANQGDDNRMQIPLSSAFPSVLPPGRKNLGPFNAVEFQPSEVCPQNFIIFDHTDNRSQIMFHPAIANKLNGPTVNMCSKYIQN

Query:  NFCIDEEHYVDREISSPLMEDLDDIDALLSLEDEDLDGSEDEEISTARSYLNYGNRSPDSSSSS-YSSKP-RKNQTFNPIQR--SSSSGSSCDSDMKQLK
        N C+++EHY DR+ISSPLM+DLD IDALLSLEDEDLD SEDEEISTARS LNYGNRSPDSSSSS YSSKP RKNQTFNPIQ+  SSS GS CDSDMKQLK
Subjt:  NFCIDEEHYVDREISSPLMEDLDDIDALLSLEDEDLDGSEDEEISTARSYLNYGNRSPDSSSSS-YSSKP-RKNQTFNPIQR--SSSSGSSCDSDMKQLK

Query:  VKKMVRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGVRGLEN
        VKKMVRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGV GLEN
Subjt:  VKKMVRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGVRGLEN

SwissProt top hitse value%identityAlignment
P07369 Chlorophyll a-b binding protein 3C, chloroplastic9.3e-10585.12Show/hide
Query:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSV--GKSVSSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRW
        A+S +  A ++ +  P + E+ GNGRV+MRK+    K  SSGSPWYGPDRVKYLGPFSGE PSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIH RW
Subjt:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSV--GKSVSSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRW

Query:  AMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAGGPLGEVSDPIYPGGSFDPLGLADDPE
        AMLGALGCVFPELL+RNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSL+HAQSILAIWACQVVLMGAVEGYRIAGGPLGEV DP+YPGGSFDPLGLADDPE
Subjt:  AMLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAGGPLGEVSDPIYPGGSFDPLGLADDPE

Query:  AFAELKVKELKNGRL
        AFAELKVKE+KNGRL
Subjt:  AFAELKVKELKNGRL

P07370 Chlorophyll a-b binding protein 1B, chloroplastic1.1e-10585.45Show/hide
Query:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSVGKSVSSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM
        A+S    A ++ +  P A E+ GNGR++MRK+V KS  S SPWYGPDRVKYLGPFSGE PSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIH RWAM
Subjt:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSVGKSVSSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM

Query:  LGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAGGPLGEVSDPIYPGGSFDPLGLADDPEAF
        LGALGCVFPELL+RNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSL+HAQSILAIWACQVVLMGAVEGYRIAGGPLGEV DP+YPGGSFDPLGLA+DPEAF
Subjt:  LGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAGGPLGEVSDPIYPGGSFDPLGLADDPEAF

Query:  AELKVKELKNGRL
        AELKVKE+KNGRL
Subjt:  AELKVKELKNGRL

P08221 Chlorophyll a-b binding protein of LHCII type I, chloroplastic (Fragment)2.1e-10991.26Show/hide
Query:  ATRSCEALPLAPELQGNGRVSMRKSVGKSVSSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWAMLGALGCV
        A ++ +  P APE+QGN + +MRK+  KSVSSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWAMLGALGCV
Subjt:  ATRSCEALPLAPELQGNGRVSMRKSVGKSVSSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWAMLGALGCV

Query:  FPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAGGPLGEVSDPIYPGGSFDPLGLADDPEAFAELKVKE
        FPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSL+HAQSILAIWACQVVLMGAVEGYRIAGGPLGEV+DPIYPGGSFDPLGLADDPEAFAELKVKE
Subjt:  FPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAGGPLGEVSDPIYPGGSFDPLGLADDPEAFAELKVKE

Query:  LKNGRL
        LKNGRL
Subjt:  LKNGRL

P09756 Chlorophyll a-b binding protein 3, chloroplastic7.6e-10789.2Show/hide
Query:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSVGKSVSSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM
        A+S    A ++ +  P APE+   GRVSMRK+V K VSSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM
Subjt:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSVGKSVSSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM

Query:  LGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAGGPLGEVSDPIYPGGSFDPLGLADDPEAF
        LGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWA QV+LMGAVEGYRIAGGPLGEV+DPIYPGGSFDPLGLADDPEAF
Subjt:  LGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAGGPLGEVSDPIYPGGSFDPLGLADDPEAF

Query:  AELKVKELKNGRL
        AELKVKELKNGRL
Subjt:  AELKVKELKNGRL

P27493 Chlorophyll a-b binding protein 21, chloroplastic7.6e-10785.92Show/hide
Query:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSVGKSVSSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM
        A+S    A ++ +  P APE+ GNGRVSMRK+V K V+S SPWYGPDRVKYLGPFSGE PSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIH RWAM
Subjt:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSVGKSVSSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM

Query:  LGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAGGPLGEVSDPIYPGGSFDPLGLADDPEAF
        LGALGCVFPELL+RNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSL+HAQSILAIWACQV+LMGAVEGYR+AGGPLGEV DP+YPGGSFDPLGLA+DPEAF
Subjt:  LGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAGGPLGEVSDPIYPGGSFDPLGLADDPEAF

Query:  AELKVKELKNGRL
        AELKVKE+KNGRL
Subjt:  AELKVKELKNGRL

Arabidopsis top hitse value%identityAlignment
AT1G29910.1 chlorophyll A/B binding protein 32.7e-9981.86Show/hide
Query:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSVGKSVS-SGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWA
        A+S    A ++    P A E+ G+GRV+MRK+V K    SGSPWYG DRVKYLGPFSGE PSYLTGEFPGDYGWDTAGLSADPETFA+NRELEVIHSRWA
Subjt:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSVGKSVS-SGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWA

Query:  MLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAG-GPLGEVSDPIYPGGSFDPLGLADDPE
        MLGALGCVFPELL+RNGVKFGEAVWFKAGSQIFS+GGLDYLGNPSL+HAQSILAIWA QV+LMGAVEGYR+AG GPLGE  D +YPGGSFDPLGLA DPE
Subjt:  MLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAG-GPLGEVSDPIYPGGSFDPLGLADDPE

Query:  AFAELKVKELKNGRL
        AFAELKVKELKNGRL
Subjt:  AFAELKVKELKNGRL

AT1G29920.1 chlorophyll A/B-binding protein 22.7e-9981.86Show/hide
Query:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSVGKSVS-SGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWA
        A+S    A ++    P A E+ G+GRV+MRK+V K    SGSPWYG DRVKYLGPFSGE PSYLTGEFPGDYGWDTAGLSADPETFA+NRELEVIHSRWA
Subjt:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSVGKSVS-SGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWA

Query:  MLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAG-GPLGEVSDPIYPGGSFDPLGLADDPE
        MLGALGCVFPELL+RNGVKFGEAVWFKAGSQIFS+GGLDYLGNPSL+HAQSILAIWA QV+LMGAVEGYR+AG GPLGE  D +YPGGSFDPLGLA DPE
Subjt:  MLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAG-GPLGEVSDPIYPGGSFDPLGLADDPE

Query:  AFAELKVKELKNGRL
        AFAELKVKELKNGRL
Subjt:  AFAELKVKELKNGRL

AT1G29930.1 chlorophyll A/B binding protein 12.1e-9981.86Show/hide
Query:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSVGKSVS-SGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWA
        A+S    A ++ +  P A E+ G+GRV+MRK+V K    SGSPWYG DRVKYLGPFSGE PSYLTGEFPGDYGWDTAGLSADPETFA+NRELEVIHSRWA
Subjt:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSVGKSVS-SGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWA

Query:  MLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAG-GPLGEVSDPIYPGGSFDPLGLADDPE
        MLGALGCVFPELL+RNGVKFGEAVWFKAGSQIFS+GGLDYLGNPSL+HAQSILAIWA QV+LMGAVEGYR+AG GPLGE  D +YPGGSFDPLGLA DPE
Subjt:  MLGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAG-GPLGEVSDPIYPGGSFDPLGLADDPE

Query:  AFAELKVKELKNGRL
        AFAELKVKELKNGRL
Subjt:  AFAELKVKELKNGRL

AT2G34420.1 photosystem II light harvesting complex gene B1B22.7e-9986.5Show/hide
Query:  PLAPELQGNGRVSMRKSVGKSVS-SGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWAMLGALGCVFPELLSR
        P A ++ G+GRV+MRK+V K    SGSPWYG DRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFA+NRELEVIHSRWAMLGALGCVFPELL+R
Subjt:  PLAPELQGNGRVSMRKSVGKSVS-SGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWAMLGALGCVFPELLSR

Query:  NGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAG-GPLGEVSDPIYPGGSFDPLGLADDPEAFAELKVKELKNGRL
        NGVKFGEAVWFKAGSQIFS+GGLDYLGNPSL+HAQSILAIWA QV+LMGAVEGYR+AG GPLGE  D +YPGGSFDPLGLA DPEAFAELKVKELKNGRL
Subjt:  NGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAG-GPLGEVSDPIYPGGSFDPLGLADDPEAFAELKVKELKNGRL

AT2G34430.1 light-harvesting chlorophyll-protein complex II subunit B12.7e-9980.84Show/hide
Query:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSVGKSVSSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM
        A+S      ++ +  P A E+ G GR++MRK+   +  SGSPWYG DRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFA+NRELEVIHSRWAM
Subjt:  AVSGTEEATRSCEALPLAPELQGNGRVSMRKSVGKSVSSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWAM

Query:  LGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAG-GPLGEVSDPIYPGGSFDPLGLADDPEA
        LGALGCVFPELL+RNGVKFGEAVWFKAGSQIFS+GGLDYLGNPSL+HAQSILAIWA QV+LMGAVEGYR+AG GPLGE  D +YPGGSFDPLGLA DPEA
Subjt:  LGALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAG-GPLGEVSDPIYPGGSFDPLGLADDPEA

Query:  FAELKVKELKNGRL
        FAELKVKELKNGRL
Subjt:  FAELKVKELKNGRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTGCCTAAGATTTTTGGAGCAGTAAGTGGCACTGAGGAAGCCACGCGGAGCTGTGAAGCTCTCCCCCTCGCCCCTGAGCTTCAGGGCAATGGAAGAGTCAGCAT
GAGGAAGTCTGTCGGCAAGTCAGTTTCCTCCGGCAGCCCATGGTACGGCCCCGACCGTGTCAAGTACTTGGGTCCATTCTCCGGCGAGCCACCATCCTACCTCACCGGAG
AATTCCCCGGTGACTACGGCTGGGACACTGCTGGCCTTTCAGCCGATCCCGAGACCTTCGCCAAGAACCGTGAGCTCGAGGTGATCCACTCCAGATGGGCCATGCTTGGA
GCTCTGGGGTGTGTCTTCCCCGAGCTTCTGTCCCGCAATGGCGTGAAGTTCGGCGAGGCGGTGTGGTTCAAGGCTGGATCGCAGATCTTCAGCGAGGGTGGTCTCGACTA
CTTGGGCAACCCCAGCTTGATCCATGCACAGAGCATCTTGGCCATCTGGGCCTGTCAGGTCGTGTTGATGGGCGCCGTCGAGGGCTACCGTATTGCCGGCGGTCCACTCG
GCGAGGTCAGCGACCCCATCTACCCAGGTGGAAGCTTCGACCCACTGGGTCTGGCCGACGACCCAGAGGCCTTCGCTGAACTGAAGGTGAAGGAGCTCAAGAATGGAAGG
TTGCCATGTTCTCCATGTTCGGCTTCTTCGTTCAGGCCATCGTCACCGGAAAGGGTCCATTGGAGAACTTGGCAGACCACCTTGCCGATCCAGTCAACAACAATGCTTGG
GCCTATGCCACAAACTTTGTTCCCGGAAAGTGAGAATCTTCGGATTCAGAGTATGAAAAAGAACTTTTGTTTGGTTAAACCATTGATGGTGGTGGAAATTCGGATATTAC
CTGGAGCCAAATATCCAATAGTGCCTTCGAAAGCTGATGAAGAGGAAATGCTGCTTGCGTCTTGGCTTTGCACCCCCAAAACTCTAGCTGTTCCAAAATCACTCACATGG
GCTCTTGAGTCTCCATCCAGAAGAATGTTTGAAGAAAGGTCTAAAGAGTACAAGTGCTCCAGATTTGCCAATTCTTCTGGGATTTCACCAGCTATTTTGTTCCGTGAAAG
GTTCAAGGTTGTGAGCATATTCATTTCAATAAAAGCTTTACCTGGAAGCATACCAGAAAGATTATTTCCTGACAGATCAAGAAAGAACAAATTTCTACAACCTCCAATTG
TTGCGGGAATAATCCCAGTGAGATTGTTGTTTGAGAAGTCAATGGATTGTATCATTTGCAACAACCCAAGCTCAGTTGGAATATCTCCTACCAGGAAATTGTACGAGAAG
TTCATGCATAGCTGCATATTTTTCATACCTGAGATTAAAACCCCAGGGATTGATCCAGAAAGATGGTCTACGACATTAAGAGATGAGCAATTAAAGAAGTCATCTGGGAT
TTCACCAGAAATCCTATTTGATCCAAGGAAGAGGGAAGTAAGATTCTGCAACTTCCCAAATCCCAGAGATGGAATTGAACCCACAAGGAGATTGCCGCTCAATGTCAATC
TCCTCAGGTCCGTCCAATCTGAAAGTGCTCCCAGTGGGTCGAAATGGATGGAACTCTTGAAGGCCTTCAAGGCCTCGATCGGAACAGGATACAGCAATTGGTTGTGTGTG
AATCCACCATACTGCGGACGAAACGCGGTATTCAGTGAACCTGTGTGTGAGAACGGAGAGTGGAAGCAATTTGCAAGAAGAGGCCAAAATCTTATCGCGAGTGACCTCGA
CTTTTACCTCAAAAAGGCAGTGCTCCCAACAGCAAACCAAGGGGATGACAATCGCATGCAGATTCCTCTGTCATCTGCCTTTCCTTCTGTTTTACCTCCTGGGAGAAAGA
ACTTGGGGCCTTTCAATGCTGTTGAATTTCAACCTTCTGAAGTTTGTCCCCAAAATTTCATCATCTTTGACCACACTGATAATCGAAGCCAAATTATGTTCCATCCTGCA
ATTGCCAACAAACTTAATGGTCCTACTGTGAATATGTGCTCGAAGTACATCCAAAATAATTTTTGCATCGATGAAGAGCATTATGTGGATAGAGAAATATCATCTCCTTT
GATGGAGGATTTGGATGATATTGATGCATTATTGAGCTTGGAAGACGAAGACCTTGATGGATCTGAAGATGAAGAGATCAGCACTGCAAGGTCTTATTTGAATTATGGGA
ATAGATCTCCTGATTCCTCATCCTCCTCTTATAGTTCAAAACCCAGGAAGAATCAAACATTTAATCCTATCCAAAGGTCGTCAAGCAGCGGAAGCAGCTGTGACAGTGAT
ATGAAACAGTTGAAAGTGAAGAAAATGGTGAGAAAACTGAGAGAGATTCTCCCTGGCGGTTACCAAATGACAACGGTCACCGTACTCGACGAAGCTGTTAAATACCTGAA
ATCCCTCAAGGACGAAGTGCAGAAGCTCGGAGTCCGGGGCTTGGAGAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTGCCTAAGATTTTTGGAGCAGTAAGTGGCACTGAGGAAGCCACGCGGAGCTGTGAAGCTCTCCCCCTCGCCCCTGAGCTTCAGGGCAATGGAAGAGTCAGCAT
GAGGAAGTCTGTCGGCAAGTCAGTTTCCTCCGGCAGCCCATGGTACGGCCCCGACCGTGTCAAGTACTTGGGTCCATTCTCCGGCGAGCCACCATCCTACCTCACCGGAG
AATTCCCCGGTGACTACGGCTGGGACACTGCTGGCCTTTCAGCCGATCCCGAGACCTTCGCCAAGAACCGTGAGCTCGAGGTGATCCACTCCAGATGGGCCATGCTTGGA
GCTCTGGGGTGTGTCTTCCCCGAGCTTCTGTCCCGCAATGGCGTGAAGTTCGGCGAGGCGGTGTGGTTCAAGGCTGGATCGCAGATCTTCAGCGAGGGTGGTCTCGACTA
CTTGGGCAACCCCAGCTTGATCCATGCACAGAGCATCTTGGCCATCTGGGCCTGTCAGGTCGTGTTGATGGGCGCCGTCGAGGGCTACCGTATTGCCGGCGGTCCACTCG
GCGAGGTCAGCGACCCCATCTACCCAGGTGGAAGCTTCGACCCACTGGGTCTGGCCGACGACCCAGAGGCCTTCGCTGAACTGAAGGTGAAGGAGCTCAAGAATGGAAGG
TTGCCATGTTCTCCATGTTCGGCTTCTTCGTTCAGGCCATCGTCACCGGAAAGGGTCCATTGGAGAACTTGGCAGACCACCTTGCCGATCCAGTCAACAACAATGCTTGG
GCCTATGCCACAAACTTTGTTCCCGGAAAGTGAGAATCTTCGGATTCAGAGTATGAAAAAGAACTTTTGTTTGGTTAAACCATTGATGGTGGTGGAAATTCGGATATTAC
CTGGAGCCAAATATCCAATAGTGCCTTCGAAAGCTGATGAAGAGGAAATGCTGCTTGCGTCTTGGCTTTGCACCCCCAAAACTCTAGCTGTTCCAAAATCACTCACATGG
GCTCTTGAGTCTCCATCCAGAAGAATGTTTGAAGAAAGGTCTAAAGAGTACAAGTGCTCCAGATTTGCCAATTCTTCTGGGATTTCACCAGCTATTTTGTTCCGTGAAAG
GTTCAAGGTTGTGAGCATATTCATTTCAATAAAAGCTTTACCTGGAAGCATACCAGAAAGATTATTTCCTGACAGATCAAGAAAGAACAAATTTCTACAACCTCCAATTG
TTGCGGGAATAATCCCAGTGAGATTGTTGTTTGAGAAGTCAATGGATTGTATCATTTGCAACAACCCAAGCTCAGTTGGAATATCTCCTACCAGGAAATTGTACGAGAAG
TTCATGCATAGCTGCATATTTTTCATACCTGAGATTAAAACCCCAGGGATTGATCCAGAAAGATGGTCTACGACATTAAGAGATGAGCAATTAAAGAAGTCATCTGGGAT
TTCACCAGAAATCCTATTTGATCCAAGGAAGAGGGAAGTAAGATTCTGCAACTTCCCAAATCCCAGAGATGGAATTGAACCCACAAGGAGATTGCCGCTCAATGTCAATC
TCCTCAGGTCCGTCCAATCTGAAAGTGCTCCCAGTGGGTCGAAATGGATGGAACTCTTGAAGGCCTTCAAGGCCTCGATCGGAACAGGATACAGCAATTGGTTGTGTGTG
AATCCACCATACTGCGGACGAAACGCGGTATTCAGTGAACCTGTGTGTGAGAACGGAGAGTGGAAGCAATTTGCAAGAAGAGGCCAAAATCTTATCGCGAGTGACCTCGA
CTTTTACCTCAAAAAGGCAGTGCTCCCAACAGCAAACCAAGGGGATGACAATCGCATGCAGATTCCTCTGTCATCTGCCTTTCCTTCTGTTTTACCTCCTGGGAGAAAGA
ACTTGGGGCCTTTCAATGCTGTTGAATTTCAACCTTCTGAAGTTTGTCCCCAAAATTTCATCATCTTTGACCACACTGATAATCGAAGCCAAATTATGTTCCATCCTGCA
ATTGCCAACAAACTTAATGGTCCTACTGTGAATATGTGCTCGAAGTACATCCAAAATAATTTTTGCATCGATGAAGAGCATTATGTGGATAGAGAAATATCATCTCCTTT
GATGGAGGATTTGGATGATATTGATGCATTATTGAGCTTGGAAGACGAAGACCTTGATGGATCTGAAGATGAAGAGATCAGCACTGCAAGGTCTTATTTGAATTATGGGA
ATAGATCTCCTGATTCCTCATCCTCCTCTTATAGTTCAAAACCCAGGAAGAATCAAACATTTAATCCTATCCAAAGGTCGTCAAGCAGCGGAAGCAGCTGTGACAGTGAT
ATGAAACAGTTGAAAGTGAAGAAAATGGTGAGAAAACTGAGAGAGATTCTCCCTGGCGGTTACCAAATGACAACGGTCACCGTACTCGACGAAGCTGTTAAATACCTGAA
ATCCCTCAAGGACGAAGTGCAGAAGCTCGGAGTCCGGGGCTTGGAGAACTAG
Protein sequenceShow/hide protein sequence
MELPKIFGAVSGTEEATRSCEALPLAPELQGNGRVSMRKSVGKSVSSGSPWYGPDRVKYLGPFSGEPPSYLTGEFPGDYGWDTAGLSADPETFAKNRELEVIHSRWAMLG
ALGCVFPELLSRNGVKFGEAVWFKAGSQIFSEGGLDYLGNPSLIHAQSILAIWACQVVLMGAVEGYRIAGGPLGEVSDPIYPGGSFDPLGLADDPEAFAELKVKELKNGR
LPCSPCSASSFRPSSPERVHWRTWQTTLPIQSTTMLGPMPQTLFPESENLRIQSMKKNFCLVKPLMVVEIRILPGAKYPIVPSKADEEEMLLASWLCTPKTLAVPKSLTW
ALESPSRRMFEERSKEYKCSRFANSSGISPAILFRERFKVVSIFISIKALPGSIPERLFPDRSRKNKFLQPPIVAGIIPVRLLFEKSMDCIICNNPSSVGISPTRKLYEK
FMHSCIFFIPEIKTPGIDPERWSTTLRDEQLKKSSGISPEILFDPRKREVRFCNFPNPRDGIEPTRRLPLNVNLLRSVQSESAPSGSKWMELLKAFKASIGTGYSNWLCV
NPPYCGRNAVFSEPVCENGEWKQFARRGQNLIASDLDFYLKKAVLPTANQGDDNRMQIPLSSAFPSVLPPGRKNLGPFNAVEFQPSEVCPQNFIIFDHTDNRSQIMFHPA
IANKLNGPTVNMCSKYIQNNFCIDEEHYVDREISSPLMEDLDDIDALLSLEDEDLDGSEDEEISTARSYLNYGNRSPDSSSSSYSSKPRKNQTFNPIQRSSSSGSSCDSD
MKQLKVKKMVRKLREILPGGYQMTTVTVLDEAVKYLKSLKDEVQKLGVRGLEN