; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh01G006840 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh01G006840
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
Descriptionprotein CHUP1, chloroplastic
Genome locationCma_Chr01:3574303..3576753
RNA-Seq ExpressionCmaCh01G006840
SyntenyCmaCh01G006840
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607325.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]6.3e-21396.76Show/hide
Query:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSN
        MPMEEDEELAMEI ALKRELEISLQKSNFLEKENQELKQELARFKSH+QSLK HNNDRKSILWKKFHNSMD  VAG DS+PQSPPATDKWETTRTQKQSN
Subjt:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSN

Query:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
        WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKA NGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
Subjt:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN

Query:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
        RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLE EVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
Subjt:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN

Query:  VERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLSQR
        VERTREFNC KYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE+QLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGL L+QR
Subjt:  VERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLSQR

Query:  K
        K
Subjt:  K

KAG7037002.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]6.6e-19490.12Show/hide
Query:  MEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSNWA
        MEEDEELAMEI ALKRELEISLQKSNFLEKENQELKQELARFKSH+QSLK HNNDRKSILWKKFHNSMD  VAG DS+PQSPPATDKWETTRTQKQSNWA
Subjt:  MEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSNWA

Query:  VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL
        VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKA NGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL
Subjt:  VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL

Query:  IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAE-----VCSFRENPKEETNAMLKRAQALQ-DRLEQ
        IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLE E     V       K+      +  + L+  RLEQ
Subjt:  IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAE-----VCSFRENPKEETNAMLKRAQALQ-DRLEQ

Query:  SVSNVERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLL
        SVSNVERTREFNC KYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE+QLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGL 
Subjt:  SVSNVERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLL

Query:  LSQRK
        L+QRK
Subjt:  LSQRK

XP_022948306.1 protein CHUP1, chloroplastic [Cucurbita moschata]3.5e-21196.26Show/hide
Query:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSN
        MPMEEDEELAMEI ALKRELEISLQKS FLEKENQELKQELARFKSH+ SLK HNNDRKSILWKKFHNSMD  VAG DS+PQSPPATDKWETTRTQKQSN
Subjt:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSN

Query:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
        WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKA NGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
Subjt:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN

Query:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
        RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLE EVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
Subjt:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN

Query:  VERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLSQR
        VERTREFNC KYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE QLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGL L+QR
Subjt:  VERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLSQR

Query:  K
        K
Subjt:  K

XP_022998607.1 protein CHUP1, chloroplastic [Cucurbita maxima]9.7e-222100Show/hide
Query:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSN
        MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSN
Subjt:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSN

Query:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
        WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
Subjt:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN

Query:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
        RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
Subjt:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN

Query:  VERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLSQR
        VERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLSQR
Subjt:  VERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLSQR

Query:  K
        K
Subjt:  K

XP_023523072.1 protein CHUP1, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]1.4e-21798Show/hide
Query:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSN
        MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSH+QSLK HNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSN
Subjt:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSN

Query:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
        WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKA NGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
Subjt:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN

Query:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
        RLIREVEAAAPRDIAEVERFVKWLDGEL SLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
Subjt:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN

Query:  VERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLSQR
        VERTREFNC KYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE+QLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGL L+QR
Subjt:  VERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLSQR

Query:  K
        K
Subjt:  K

TrEMBL top hitse value%identityAlignment
A0A0A0LVK7 Uncharacterized protein6.2e-16676.67Show/hide
Query:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPP--ATDKWETTRTQKQ
        MP EEDE LAMEI+ LK+ELEISLQKS FLEKENQEL+QEL R +S +QS K  NN+RKSILWKKFH+S+D++VAG DS P SP   A DK E+T++ KQ
Subjt:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPP--ATDKWETTRTQKQ

Query:  SNWAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        S+W  VKE+ RM     +P PPPPPPLPTKLLGGSKAVRRVPEVLELYR +TKRDAQKENK  +GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG+F
Subjt:  SNWAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSV
        VN LI+EVE  APRDI+EVERFVKWLDG+LASLVDERAVLK+FPRWPE KADALREAAFSY+DLK LE++VC FR+NPKEE N +LKRAQALQDR+EQSV
Subjt:  VNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSV

Query:  SNVERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLS
        SN+ERTREFNC KY  FQIPCQWM DS LP Q+K+S+LRL KE M RIT+ELQ  ETPQ ENLFLQG RFAYRVHQYAGGFDSE I AFEG+K+ G L S
Subjt:  SNVERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLS

Query:  QRK
        QRK
Subjt:  QRK

A0A1S3C4V9 protein CHUP1, chloroplastic isoform X11.3e-16376.67Show/hide
Query:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSP--PATDKWETTRTQKQ
        MP E+DEELAMEID LK++LEISLQKS FLE+ENQEL+ EL R KS +QSLK  NN+RKSILWKKFH+SMD+AVAG DS P +P   A DK E T+  KQ
Subjt:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSP--PATDKWETTRTQKQ

Query:  SNWAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        S+W  VKE+QRM A   +  PPPPPPLP KLLGGSKAVRRVPEVL+LYR +TKRDAQKENK  +GG P VAFTKNMIGEIENRSAYLSAIKSEVETHGEF
Subjt:  SNWAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSV
        VN LI+EVE  APRDI+E E+FVKWLD +LASLVDERAVLKHFPRWPE KADALREAAFSY+DLKSLE++VC FR+NPKEE N +LKRAQALQDR+EQSV
Subjt:  VNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSV

Query:  SNVERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLS
        SN+ERTREFNC KY  FQIPCQWM DS LP Q+KLS+LRL KE M RIT+EL+  ET Q ENLFLQGVRFAYRVHQYAGGFDSEAI AFEG+K+ G L S
Subjt:  SNVERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLS

Query:  QRK
        QRK
Subjt:  QRK

A0A6J1DC83 protein CHUP1, chloroplastic4.0e-16577.23Show/hide
Query:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQ-KQS
        MP EEDEELAMEI +L++EL+I++ KS+FLEKENQEL+QEL R KS +QSLK HNNDRKS+LWKKF+NSMD          +SPPATDK E T++  KQ 
Subjt:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQ-KQS

Query:  NWAVVKENQRMAAAAPTPAP-PPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
         W  VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKA +GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEF
Subjt:  NWAVVKENQRMAAAAPTPAP-PPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSV
        VN LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE+EVCSFR+NPKEE   +LKRAQALQDRLEQSV
Subjt:  VNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSV

Query:  SNVERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQ-LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLL
        SNVE+TREF+CNKY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+ELQ ++ T Q +NL LQGVRFAYRVHQYAGGFDS+AI AFEG+K+VG L 
Subjt:  SNVERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQ-LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLL

Query:  SQRK
        SQRK
Subjt:  SQRK

A0A6J1G8X0 protein CHUP1, chloroplastic1.7e-21196.26Show/hide
Query:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSN
        MPMEEDEELAMEI ALKRELEISLQKS FLEKENQELKQELARFKSH+ SLK HNNDRKSILWKKFHNSMD  VAG DS+PQSPPATDKWETTRTQKQSN
Subjt:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSN

Query:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
        WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKA NGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
Subjt:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN

Query:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
        RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLE EVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
Subjt:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN

Query:  VERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLSQR
        VERTREFNC KYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE QLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGL L+QR
Subjt:  VERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLSQR

Query:  K
        K
Subjt:  K

A0A6J1K8G4 protein CHUP1, chloroplastic4.7e-222100Show/hide
Query:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSN
        MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSN
Subjt:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSN

Query:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
        WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
Subjt:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN

Query:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
        RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
Subjt:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN

Query:  VERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLSQR
        VERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLSQR
Subjt:  VERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLSQR

Query:  K
        K
Subjt:  K

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic1.3e-6448.43Show/hide
Query:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA
        P   PPPPPP P  L    GG   V R PE++E Y+ + KR+++KE   +  ++G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA

Query:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERTREFN
        ++  DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF ++P       LK+   L +++EQSV  + RTR+  
Subjt:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERTREFN

Query:  CNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQ----LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
         ++Y +F IP  W+ D+G+  ++KLSS++L K+ M+R+  EL      ++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  CNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQ----LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

Arabidopsis top hitse value%identityAlignment
AT1G07120.1 FUNCTIONS IN: molecular_function unknown3.9e-9649.36Show/hide
Query:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSN
        +P  ED+    ++  L +EL+  L +++ LEKEN EL+QE+AR ++ + +LK H N+RKS+LWKK  +S D   + TD S    P + K   T+ Q+  N
Subjt:  MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSN

Query:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
                 +   +    PPPPPPLP+K   G ++VRR PEV+E YR +TKR++   NK    G  + AF +NMIGEIENRS YLS IKS+ + H + ++
Subjt:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN

Query:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
         LI +VEAA   DI+EVE FVKW+D EL+SLVDERAVLKHFP+WPE K D+LREAA +YK  K+L  E+ SF++NPK+     L+R Q+LQDRLE+SV+N
Subjt:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN

Query:  VERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQV
         E+ R+    +Y  FQIP +WMLD+GL  Q+K SSLRL +E M+RI KEL+ N + +  NL LQGVRFAY +HQ+AGGFD E +  F  +K++
Subjt:  VERTREFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQV

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein9.5e-6648.43Show/hide
Query:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA
        P   PPPPPP P  L    GG   V R PE++E Y+ + KR+++KE   +  ++G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA

Query:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERTREFN
        ++  DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF ++P       LK+   L +++EQSV  + RTR+  
Subjt:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERTREFN

Query:  CNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQ----LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
         ++Y +F IP  W+ D+G+  ++KLSS++L K+ M+R+  EL      ++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  CNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQ----LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein9.5e-6648.43Show/hide
Query:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA
        P   PPPPPP P  L    GG   V R PE++E Y+ + KR+++KE   +  ++G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA

Query:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERTREFN
        ++  DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF ++P       LK+   L +++EQSV  + RTR+  
Subjt:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERTREFN

Query:  CNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQ----LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
         ++Y +F IP  W+ D+G+  ++KLSS++L K+ M+R+  EL      ++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  CNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQ----LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein9.5e-6648.43Show/hide
Query:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA
        P   PPPPPP P  L    GG   V R PE++E Y+ + KR+++KE   +  ++G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA

Query:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERTREFN
        ++  DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF ++P       LK+   L +++EQSV  + RTR+  
Subjt:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERTREFN

Query:  CNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQ----LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
         ++Y +F IP  W+ D+G+  ++KLSS++L K+ M+R+  EL      ++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  CNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQ----LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.0e-7250.87Show/hide
Query:  AAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVA-------FTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIR
        + +  P PPPPPP P  L   S  VRRVPEV+E Y  + +RD+    + + GG  A A         ++MIGEIENRS YL AIK++VET G+F+  LI+
Subjt:  AAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVA-------FTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIR

Query:  EVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERT
        EV  AA  DI +V  FVKWLD EL+ LVDERAVLKHF  WPE KADALREAAF Y DLK L +E   FRE+P++ +++ LK+ QAL ++LE  V ++ R 
Subjt:  EVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERT

Query:  REFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNE--TPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
        RE    K+  FQIP  WML++G+ +Q+KL+S++L  + M+R++ EL+  E   P+ E L +QGVRFA+RVHQ+AGGFD+E + AFE ++
Subjt:  REFNCNKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNE--TPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAATGGAAGAAGATGAAGAATTGGCCATGGAGATCGACGCCTTGAAAAGGGAATTGGAAATTTCACTGCAGAAATCGAATTTTCTCGAGAAAGAAAATCAA
GAACTCAAACAAGAATTGGCTCGATTCAAATCCCACCTTCAGTCTCTGAAGCCTCATAATAATGACAGAAAATCCATTCTTTGGAAGAAATTTCACAATTCCATG
GATGTCGCCGTCGCCGGAACTGACTCGTCACCACAGAGTCCGCCGGCGACTGACAAATGGGAGACTACCAGAACGCAGAAACAGAGTAATTGGGCTGTTGTGAAA
GAGAATCAGAGAATGGCGGCGGCGGCACCGACCCCGGCTCCTCCACCGCCGCCGCCACTTCCGACGAAGCTGCTCGGGGGATCTAAGGCAGTGCGGCGAGTCCCG
GAAGTGTTGGAGCTGTACCGTTTAGTGACGAAAAGGGATGCCCAGAAGGAAAATAAGGCCACAAACGGAGGATTTCCGGCGGTGGCGTTCACCAAAAACATGATC
GGCGAAATCGAAAACCGATCAGCCTATCTCTCAGCGATAAAATCGGAGGTGGAGACACATGGGGAGTTCGTGAACCGGCTGATCAGAGAAGTGGAAGCGGCAGCG
CCAAGAGACATAGCAGAGGTGGAGAGGTTCGTGAAGTGGCTAGACGGGGAGCTGGCATCGCTCGTGGACGAGAGGGCGGTGCTCAAGCACTTTCCACGGTGGCCG
GAGGGGAAGGCAGACGCACTGCGGGAGGCGGCATTCAGCTACAAGGATCTGAAGAGCTTGGAAGCTGAAGTGTGTTCGTTTAGAGAGAATCCAAAGGAGGAGACG
AATGCTATGTTGAAGAGGGCTCAGGCCTTGCAAGACAGGTTGGAGCAGAGCGTGAGCAATGTGGAGAGGACGAGGGAGTTCAACTGTAACAAGTACAACAAGTTT
CAAATCCCTTGCCAATGGATGCTGGACTCTGGCTTGCCAGCCCAGATGAAGCTGAGCTCATTGAGGCTAGTGAAGGAATGCATGCGTAGGATAACAAAAGAGCTA
CAATTGAACGAAACACCACAAACAGAAAACCTTTTTCTTCAAGGGGTTCGCTTTGCTTACAGGGTGCACCAGTATGCAGGTGGTTTTGATTCGGAAGCTATAGTG
GCTTTTGAAGGAATGAAGCAAGTTGGGCTGCTGCTTAGTCAAAGAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGCCAAAAAGGGAAGCAAATTATTGTTGGTTGGTTGGATTCTCCTTCAGCCCATCTCTAAGAAAGACCACAAAATCAGAGGAGAATGCCAATGGAAGAAGA
TGAAGAATTGGCCATGGAGATCGACGCCTTGAAAAGGGAATTGGAAATTTCACTGCAGAAATCGAATTTTCTCGAGAAAGAAAATCAAGAACTCAAACAAGAATT
GGCTCGATTCAAATCCCACCTTCAGTCTCTGAAGCCTCATAATAATGACAGAAAATCCATTCTTTGGAAGAAATTTCACAATTCCATGGATGTCGCCGTCGCCGG
AACTGACTCGTCACCACAGAGTCCGCCGGCGACTGACAAATGGGAGACTACCAGAACGCAGAAACAGAGTAATTGGGCTGTTGTGAAAGAGAATCAGAGAATGGC
GGCGGCGGCACCGACCCCGGCTCCTCCACCGCCGCCGCCACTTCCGACGAAGCTGCTCGGGGGATCTAAGGCAGTGCGGCGAGTCCCGGAAGTGTTGGAGCTGTA
CCGTTTAGTGACGAAAAGGGATGCCCAGAAGGAAAATAAGGCCACAAACGGAGGATTTCCGGCGGTGGCGTTCACCAAAAACATGATCGGCGAAATCGAAAACCG
ATCAGCCTATCTCTCAGCGATAAAATCGGAGGTGGAGACACATGGGGAGTTCGTGAACCGGCTGATCAGAGAAGTGGAAGCGGCAGCGCCAAGAGACATAGCAGA
GGTGGAGAGGTTCGTGAAGTGGCTAGACGGGGAGCTGGCATCGCTCGTGGACGAGAGGGCGGTGCTCAAGCACTTTCCACGGTGGCCGGAGGGGAAGGCAGACGC
ACTGCGGGAGGCGGCATTCAGCTACAAGGATCTGAAGAGCTTGGAAGCTGAAGTGTGTTCGTTTAGAGAGAATCCAAAGGAGGAGACGAATGCTATGTTGAAGAG
GGCTCAGGCCTTGCAAGACAGGTTGGAGCAGAGCGTGAGCAATGTGGAGAGGACGAGGGAGTTCAACTGTAACAAGTACAACAAGTTTCAAATCCCTTGCCAATG
GATGCTGGACTCTGGCTTGCCAGCCCAGATGAAGCTGAGCTCATTGAGGCTAGTGAAGGAATGCATGCGTAGGATAACAAAAGAGCTACAATTGAACGAAACACC
ACAAACAGAAAACCTTTTTCTTCAAGGGGTTCGCTTTGCTTACAGGGTGCACCAGTATGCAGGTGGTTTTGATTCGGAAGCTATAGTGGCTTTTGAAGGAATGAA
GCAAGTTGGGCTGCTGCTTAGTCAAAGAAAATAGGGTTCTTTTGGCGAAAAGTTATAGGTAAGAATCAACATTGCAGCAGACCACATTCAAAAAAGGATGTAATA
TGAATGATTGAATGGGAAGTTTCTATACATAATCAATCCTATATGCTTATTGCAACTTA
Protein sequenceShow/hide protein sequence
MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKSILWKKFHNSMDVAVAGTDSSPQSPPATDKWETTRTQKQSNWAVVK
ENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEAAA
PRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERTREFNCNKYNKF
QIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKELQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLLLSQRK