; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg07057 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg07057
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionprotein CHUP1, chloroplastic
Genome locationCarg_Chr01:3651136..3653488
RNA-Seq ExpressionCarg07057
SyntenyCarg07057
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607325.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.0e-20293.3Show/hide
Query:  MEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVV
        MEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVV
Subjt:  MEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVV

Query:  KENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIR
        KENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIR
Subjt:  KENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIR

Query:  EVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSV
        EVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGE     V       K+      +  + L+  RLEQSV
Subjt:  EVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSV

Query:  SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLN
        SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLN
Subjt:  SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLN

Query:  QRK
        QRK
Subjt:  QRK

KAG7037002.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]7.2e-225100Show/hide
Query:  MEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVV
        MEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVV
Subjt:  MEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVV

Query:  KENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIR
        KENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIR
Subjt:  KENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIR

Query:  EVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSV
        EVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSV
Subjt:  EVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSV

Query:  SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLN
        SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLN
Subjt:  SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLN

Query:  QRK
        QRK
Subjt:  QRK

XP_022948306.1 protein CHUP1, chloroplastic [Cucurbita moschata]8.0e-20092.06Show/hide
Query:  MEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVV
        MEEDEELAMEIHALKRELEISLQKS FLEKENQELKQELARFKSH+ SLK HNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVV
Subjt:  MEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVV

Query:  KENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIR
        KENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIR
Subjt:  KENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIR

Query:  EVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSV
        EVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGE     V       K+      +  + L+  RLEQSV
Subjt:  EVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSV

Query:  SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLN
        SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE QLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLN
Subjt:  SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLN

Query:  QRK
        QRK
Subjt:  QRK

XP_022998607.1 protein CHUP1, chloroplastic [Cucurbita maxima]1.7e-19490.12Show/hide
Query:  MEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMD--VAGNDSTPQSPPATDKWETTRTQKQSNWA
        MEEDEELAMEI ALKRELEISLQKSNFLEKENQELKQELARFKSH+QSLK HNNDRKSILWKKFHNSMD  VAG DS+PQSPPATDKWETTRTQKQSNWA
Subjt:  MEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMD--VAGNDSTPQSPPATDKWETTRTQKQSNWA

Query:  VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL
        VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKA NGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL
Subjt:  VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL

Query:  IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQ
        IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLE E     V       K+      +  + L+  RLEQ
Subjt:  IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQ

Query:  SVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQ
        SVSNVERTREFNC KYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE+QLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGL 
Subjt:  SVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQ

Query:  LNQRK
        L+QRK
Subjt:  LNQRK

XP_023523072.1 protein CHUP1, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]3.7e-19790.86Show/hide
Query:  MEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMD--VAGNDSTPQSPPATDKWETTRTQKQSNWA
        MEEDEELAMEI ALKRELEISLQKSNFLEKENQELKQELARFKSH+QSLK HNNDRKSILWKKFHNSMD  VAG DS+PQSPPATDKWETTRTQKQSNWA
Subjt:  MEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMD--VAGNDSTPQSPPATDKWETTRTQKQSNWA

Query:  VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL
        VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL
Subjt:  VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL

Query:  IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQ
        IREVEAAAPRDIAEVERFVKWLDGEL SLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLE E     V       K+      +  + L+  RLEQ
Subjt:  IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQ

Query:  SVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQ
        SVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE+QLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQ
Subjt:  SVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQ

Query:  LNQRK
        LNQRK
Subjt:  LNQRK

TrEMBL top hitse value%identityAlignment
A0A0A0LVK7 Uncharacterized protein3.1e-14971.92Show/hide
Query:  EEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMD--VAGNDSTPQSPP--ATDKWETTRTQKQSNW
        EEDE LAMEI+ LK+ELEISLQKS FLEKENQEL+QEL R +S +QS K  NN+RKSILWKKFH+S+D  VAG DS P SP   A DK E+T++ KQS+W
Subjt:  EEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMD--VAGNDSTPQSPP--ATDKWETTRTQKQSNW

Query:  AVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNR
          VKE+ RM     +P PPPPPPLPTKLLGGSKAVRRVPEVLELYR +TKRDAQKENK A+GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG+FVN 
Subjt:  AVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNR

Query:  LIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLE
        LI+EVE  APRDI+EVERFVKWLDG+LASLVDERAVLK+FPRWPE KADALREAAFSY+DLK LE     S V    +  K+   V  +  + L+  R+E
Subjt:  LIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLE

Query:  QSVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGL
        QSVSN+ERTREFNC+KY  FQIPCQWM DS LP Q+K+S+LRL KE M RIT+E+Q  ETPQ ENLFLQG RFAYRVHQYAGGFDSE I AFEG+K+ GL
Subjt:  QSVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGL

Query:  QLNQRK
          +QRK
Subjt:  QLNQRK

A0A1S3C4V9 protein CHUP1, chloroplastic isoform X12.5e-14671.67Show/hide
Query:  EEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMD--VAGNDSTPQSP--PATDKWETTRTQKQSNW
        E+DEELAMEI  LK++LEISLQKS FLE+ENQEL+ EL R KS +QSLK  NN+RKSILWKKFH+SMD  VAG DS P +P   A DK E T+  KQS+W
Subjt:  EEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMD--VAGNDSTPQSP--PATDKWETTRTQKQSNW

Query:  AVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNR
          VKE+QRM A   +  PPPPPPLP KLLGGSKAVRRVPEVL+LYR +TKRDAQKENK A+GG P VAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN 
Subjt:  AVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNR

Query:  LIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLE
        LI+EVE  APRDI+E E+FVKWLD +LASLVDERAVLKHFPRWPE KADALREAAFSY+DLKSLE     S V    +  K+   V  +  + L+  R+E
Subjt:  LIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLE

Query:  QSVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGL
        QSVSN+ERTREFNCKKY  FQIPCQWM DS LP Q+KLS+LRL KE M RIT+E++  ET Q ENLFLQGVRFAYRVHQYAGGFDSEAI AFEG+K+ GL
Subjt:  QSVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGL

Query:  QLNQRK
          +QRK
Subjt:  QLNQRK

A0A6J1DC83 protein CHUP1, chloroplastic1.4e-14972.35Show/hide
Query:  EEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQ-KQSNWAVV
        EEDEELAMEI +L++EL+I++ KS+FLEKENQEL+QEL R KS +QSLK HNNDRKS+LWKKF+NSMD        +SPPATDK E T++  KQ  W  V
Subjt:  EEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQ-KQSNWAVV

Query:  KENQRMAAAAPTPAP-PPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLI
        KE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEFVN LI
Subjt:  KENQRMAAAAPTPAP-PPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLI

Query:  REVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQS
        +EVE AAPRDI EVERFV WLD EL SLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE E     V    +  K+   V  +  + L+  RLEQS
Subjt:  REVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQS

Query:  VSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQ-LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQ
        VSNVE+TREF+C KY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+E+Q ++ T Q +NL LQGVRFAYRVHQYAGGFDS+AI AFEG+K+VGL 
Subjt:  VSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQ-LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQ

Query:  LNQRK
         +QRK
Subjt:  LNQRK

A0A6J1G8X0 protein CHUP1, chloroplastic3.9e-20092.06Show/hide
Query:  MEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVV
        MEEDEELAMEIHALKRELEISLQKS FLEKENQELKQELARFKSH+ SLK HNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVV
Subjt:  MEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVV

Query:  KENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIR
        KENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIR
Subjt:  KENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIR

Query:  EVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSV
        EVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGE     V       K+      +  + L+  RLEQSV
Subjt:  EVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSV

Query:  SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLN
        SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE QLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLN
Subjt:  SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLN

Query:  QRK
        QRK
Subjt:  QRK

A0A6J1K8G4 protein CHUP1, chloroplastic8.4e-19590.12Show/hide
Query:  MEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMD--VAGNDSTPQSPPATDKWETTRTQKQSNWA
        MEEDEELAMEI ALKRELEISLQKSNFLEKENQELKQELARFKSH+QSLK HNNDRKSILWKKFHNSMD  VAG DS+PQSPPATDKWETTRTQKQSNWA
Subjt:  MEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMD--VAGNDSTPQSPPATDKWETTRTQKQSNWA

Query:  VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL
        VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKA NGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL
Subjt:  VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL

Query:  IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQ
        IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLE E     V       K+      +  + L+  RLEQ
Subjt:  IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQ

Query:  SVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQ
        SVSNVERTREFNC KYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE+QLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGL 
Subjt:  SVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQ

Query:  LNQRK
        L+QRK
Subjt:  LNQRK

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic2.2e-5946.42Show/hide
Query:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA
        P   PPPPPP P  L    GG   V R PE++E Y+ + KR+++KE   +  ++G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA

Query:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSVSNVE
        ++  DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE    + V   V +          ++ K LE  ++EQSV  + 
Subjt:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSVSNVE

Query:  RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQ----LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
        RTR+    +Y +F IP  W+ D+G+  ++KLSS++L K+ M+R+  E+      ++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQ----LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

Arabidopsis top hitse value%identityAlignment
AT1G07120.1 FUNCTIONS IN: molecular_function unknown1.8e-8846.95Show/hide
Query:  EEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVK
        E+D +L      L +EL+  L +++ LEKEN EL+QE+AR ++ V +LK H N+RKS+LWKK  +S D +  D +    P + K   T+ Q+  N     
Subjt:  EEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVK

Query:  ENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIRE
            +   +    PPPPPPLP+K   G ++VRR PEV+E YR +TKR++   NK    G  + AF +NMIGEIENRS YLS IKS+ + H + ++ LI +
Subjt:  ENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIRE

Query:  VEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSVS
        VEAA   DI+EVE FVKW+D EL+SLVDERAVLKHFP+WPE K D+LREAA +YK  K+L G  + S   +  + + Q+ +    L+      RLE+SV+
Subjt:  VEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSVS

Query:  NVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQV
        N E+ R+   K+Y  FQIP +WMLD+GL  Q+K SSLRL +E M+RI KE++ N + +  NL LQGVRFAY +HQ+AGGFD E +  F  +K++
Subjt:  NVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQV

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein1.6e-6046.42Show/hide
Query:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA
        P   PPPPPP P  L    GG   V R PE++E Y+ + KR+++KE   +  ++G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA

Query:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSVSNVE
        ++  DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE    + V   V +          ++ K LE  ++EQSV  + 
Subjt:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSVSNVE

Query:  RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQ----LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
        RTR+    +Y +F IP  W+ D+G+  ++KLSS++L K+ M+R+  E+      ++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQ----LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein1.6e-6046.42Show/hide
Query:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA
        P   PPPPPP P  L    GG   V R PE++E Y+ + KR+++KE   +  ++G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA

Query:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSVSNVE
        ++  DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE    + V   V +          ++ K LE  ++EQSV  + 
Subjt:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSVSNVE

Query:  RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQ----LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
        RTR+    +Y +F IP  W+ D+G+  ++KLSS++L K+ M+R+  E+      ++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQ----LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein1.6e-6046.42Show/hide
Query:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA
        P   PPPPPP P  L    GG   V R PE++E Y+ + KR+++KE   +  ++G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA

Query:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSVSNVE
        ++  DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE    + V   V +          ++ K LE  ++EQSV  + 
Subjt:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSVSNVE

Query:  RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQ----LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
        RTR+    +Y +F IP  W+ D+G+  ++KLSS++L K+ M+R+  E+      ++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQ----LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.1e-6140.21Show/hide
Query:  KRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPA
        K E+E   + SN    E       L+  +S V  +      R   L     N  D     S P  PP                 ++++     + +  P 
Subjt:  KRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAAPTPA

Query:  PPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVA-------FTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEAAAP
        PPPPPP P  L   S  VRRVPEV+E Y  + +RD+    + + GG  A A         ++MIGEIENRS YL AIK++VET G+F+  LI+EV  AA 
Subjt:  PPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVA-------FTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEAAAP

Query:  RDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSVSNVERTR
         DI +V  FVKWLD EL+ LVDERAVLKHF  WPE KADALREAAF Y DLK L  E  +       +  + S     +++   E  +LE  V ++ R R
Subjt:  RDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSVSNVERTR

Query:  EFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNE--TPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
        E    K+  FQIP  WML++G+ +Q+KL+S++L  + M+R++ E++  E   P+ E L +QGVRFA+RVHQ+AGGFD+E + AFE ++
Subjt:  EFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEVQLNE--TPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAGATGAAGAATTGGCCATGGAGATCCACGCCTTGAAAAGGGAATTGGAAATTTCACTGCAAAAATCGAATTTTCTTGAGAAAGAAAATCAAGAACTCAAACA
AGAATTGGCTCGATTCAAATCCCACGTTCAGTCTCTGAAAGTTCATAATAATGACAGGAAATCCATTCTTTGGAAGAAATTTCACAATTCCATGGATGTCGCCGGAAATG
ACTCCACGCCGCAGAGTCCGCCGGCGACTGACAAATGGGAGACTACCAGAACACAGAAACAGAGTAATTGGGCTGTTGTGAAAGAGAACCAGAGAATGGCGGCGGCGGCA
CCGACCCCGGCTCCTCCACCGCCGCCGCCACTTCCGACGAAGCTGCTCGGGGGGTCTAAGGCAGTGCGGCGAGTCCCGGAAGTGTTGGAGCTGTACCGTTTAGTGACGAA
AAGGGATGCCCAGAAGGAAAATAAGGCCGCAAACGGAGGATTTCCAGCGGTGGCGTTCACCAAAAACATGATCGGCGAAATCGAAAACCGATCAGCCTATCTCTCAGCGA
TAAAATCGGAGGTGGAGACCCATGGGGAGTTTGTGAACCGGCTGATCAGAGAAGTGGAGGCGGCAGCGCCAAGAGACATAGCGGAGGTGGAGAGGTTCGTGAAGTGGCTA
GACGGGGAGCTGGCATCGCTCGTGGACGAGAGGGCGGTGCTCAAGCACTTCCCACGGTGGCCGGAGGGGAAGGCAGACGCACTGCGGGAGGCAGCGTTCAGCTACAAGGA
TCTGAAGAGCTTGGAAGGCGAGCATGTACAATCTGTTGTATTTCATGTTTCTAATAAGGTGAAGCAGAGTTGGGAAGTGACTGATGAACTAGAGAAATGCTTGGAACTGT
GCAGGTTGGAGCAGAGCGTGAGCAATGTGGAGAGGACGAGGGAGTTCAACTGTAAGAAGTACAACAAGTTTCAAATCCCTTGCCAATGGATGCTGGACTCTGGCTTGCCA
GCCCAGATGAAGCTGAGCTCATTGAGGCTAGTGAAGGAATGCATGCGGAGGATAACAAAAGAGGTACAATTGAACGAAACCCCACAAACAGAAAACCTTTTTCTTCAAGG
GGTTCGCTTCGCTTACAGGGTGCACCAGTATGCAGGTGGTTTTGATTCGGAAGCTATAGTGGCTTTTGAAGGAATGAAGCAAGTTGGGCTGCAGCTTAATCAAAGAAAAT
AG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAGATGAAGAATTGGCCATGGAGATCCACGCCTTGAAAAGGGAATTGGAAATTTCACTGCAAAAATCGAATTTTCTTGAGAAAGAAAATCAAGAACTCAAACA
AGAATTGGCTCGATTCAAATCCCACGTTCAGTCTCTGAAAGTTCATAATAATGACAGGAAATCCATTCTTTGGAAGAAATTTCACAATTCCATGGATGTCGCCGGAAATG
ACTCCACGCCGCAGAGTCCGCCGGCGACTGACAAATGGGAGACTACCAGAACACAGAAACAGAGTAATTGGGCTGTTGTGAAAGAGAACCAGAGAATGGCGGCGGCGGCA
CCGACCCCGGCTCCTCCACCGCCGCCGCCACTTCCGACGAAGCTGCTCGGGGGGTCTAAGGCAGTGCGGCGAGTCCCGGAAGTGTTGGAGCTGTACCGTTTAGTGACGAA
AAGGGATGCCCAGAAGGAAAATAAGGCCGCAAACGGAGGATTTCCAGCGGTGGCGTTCACCAAAAACATGATCGGCGAAATCGAAAACCGATCAGCCTATCTCTCAGCGA
TAAAATCGGAGGTGGAGACCCATGGGGAGTTTGTGAACCGGCTGATCAGAGAAGTGGAGGCGGCAGCGCCAAGAGACATAGCGGAGGTGGAGAGGTTCGTGAAGTGGCTA
GACGGGGAGCTGGCATCGCTCGTGGACGAGAGGGCGGTGCTCAAGCACTTCCCACGGTGGCCGGAGGGGAAGGCAGACGCACTGCGGGAGGCAGCGTTCAGCTACAAGGA
TCTGAAGAGCTTGGAAGGCGAGCATGTACAATCTGTTGTATTTCATGTTTCTAATAAGGTGAAGCAGAGTTGGGAAGTGACTGATGAACTAGAGAAATGCTTGGAACTGT
GCAGGTTGGAGCAGAGCGTGAGCAATGTGGAGAGGACGAGGGAGTTCAACTGTAAGAAGTACAACAAGTTTCAAATCCCTTGCCAATGGATGCTGGACTCTGGCTTGCCA
GCCCAGATGAAGCTGAGCTCATTGAGGCTAGTGAAGGAATGCATGCGGAGGATAACAAAAGAGGTACAATTGAACGAAACCCCACAAACAGAAAACCTTTTTCTTCAAGG
GGTTCGCTTCGCTTACAGGGTGCACCAGTATGCAGGTGGTTTTGATTCGGAAGCTATAGTGGCTTTTGAAGGAATGAAGCAAGTTGGGCTGCAGCTTAATCAAAGAAAAT
AG
Protein sequenceShow/hide protein sequence
MEEDEELAMEIHALKRELEISLQKSNFLEKENQELKQELARFKSHVQSLKVHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAAAA
PTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVKWL
DGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEHVQSVVFHVSNKVKQSWEVTDELEKCLELCRLEQSVSNVERTREFNCKKYNKFQIPCQWMLDSGLP
AQMKLSSLRLVKECMRRITKEVQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK