; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G007120 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G007120
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionprotein CHUP1, chloroplastic
Genome locationCmo_Chr01:3654001..3656554
RNA-Seq ExpressionCmoCh01G007120
SyntenyCmoCh01G007120
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607325.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]2.2e-21898.75Show/hide
Query:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWA
        MPMEEDEELAMEIHALKRELEISLQKS FLEKENQELKQELARFKSH+ SLK HNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWA
Subjt:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWA

Query:  VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL
        VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL
Subjt:  VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL

Query:  IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE
        IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE
Subjt:  IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE

Query:  RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK
        RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE QLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK
Subjt:  RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK

KAG7037002.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-19992.06Show/hide
Query:  MEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVV
        MEEDEELAMEIHALKRELEISLQKS FLEKENQELKQELARFKSH+ SLK HNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVV
Subjt:  MEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVV

Query:  KENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIR
        KENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIR
Subjt:  KENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIR

Query:  EVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGE-----VCSFRENPKEETNAMLKRAQALQ-DRLEQSV
        EVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGE     V       K+      +  + L+  RLEQSV
Subjt:  EVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGE-----VCSFRENPKEETNAMLKRAQALQ-DRLEQSV

Query:  SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLN
        SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE QLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLN
Subjt:  SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLN

Query:  QRK
        QRK
Subjt:  QRK

XP_022948306.1 protein CHUP1, chloroplastic [Cucurbita moschata]1.6e-221100Show/hide
Query:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWA
        MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWA
Subjt:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWA

Query:  VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL
        VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL
Subjt:  VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL

Query:  IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE
        IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE
Subjt:  IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE

Query:  RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK
        RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK
Subjt:  RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK

XP_022998607.1 protein CHUP1, chloroplastic [Cucurbita maxima]2.6e-21196.26Show/hide
Query:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMD--VAGNDSTPQSPPATDKWETTRTQKQSN
        MPMEEDEELAMEI ALKRELEISLQKS FLEKENQELKQELARFKSH+ SLK HNNDRKSILWKKFHNSMD  VAG DS+PQSPPATDKWETTRTQKQSN
Subjt:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMD--VAGNDSTPQSPPATDKWETTRTQKQSN

Query:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
        WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKA NGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
Subjt:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN

Query:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
        RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLE EVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
Subjt:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN

Query:  VERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQR
        VERTREFNC KYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE QLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGL L+QR
Subjt:  VERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQR

Query:  K
        K
Subjt:  K

XP_023523072.1 protein CHUP1, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]5.7e-21497.51Show/hide
Query:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMD--VAGNDSTPQSPPATDKWETTRTQKQSN
        MPMEEDEELAMEI ALKRELEISLQKS FLEKENQELKQELARFKSHI SLKAHNNDRKSILWKKFHNSMD  VAG DS+PQSPPATDKWETTRTQKQSN
Subjt:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMD--VAGNDSTPQSPPATDKWETTRTQKQSN

Query:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
        WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
Subjt:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN

Query:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
        RLIREVEAAAPRDIAEVERFVKWLDGEL SLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLE EVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
Subjt:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN

Query:  VERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQR
        VERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE QLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQR
Subjt:  VERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQR

Query:  K
        K
Subjt:  K

TrEMBL top hitse value%identityAlignment
A0A0A0LVK7 Uncharacterized protein1.2e-16476.92Show/hide
Query:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMD--VAGNDSTPQSPP--ATDKWETTRTQKQ
        MP EEDE LAMEI+ LK+ELEISLQKSIFLEKENQEL+QEL R +S I S KA NN+RKSILWKKFH+S+D  VAG DS P SP   A DK E+T++ KQ
Subjt:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMD--VAGNDSTPQSPP--ATDKWETTRTQKQ

Query:  SNWAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        S+W  VKE+ RM     +P PPPPPPLPTKLLGGSKAVRRVPEVLELYR +TKRDAQKENK A+GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG+F
Subjt:  SNWAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSV
        VN LI+EVE  APRDI+EVERFVKWLDG+LASLVDERAVLK+FPRWPE KADALREAAFSY+DLK LE +VC FR+NPKEE N +LKRAQALQDR+EQSV
Subjt:  VNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSV

Query:  SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLN
        SN+ERTREFNC+KY  FQIPCQWM DS LP Q+K+S+LRL KE M RIT+E Q  ETPQ ENLFLQG RFAYRVHQYAGGFDSE I AFEG+K+ GL  +
Subjt:  SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLN

Query:  QRK
        QRK
Subjt:  QRK

A0A1S3C4V9 protein CHUP1, chloroplastic isoform X19.3e-16276.67Show/hide
Query:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMD--VAGNDSTPQSP--PATDKWETTRTQKQ
        MP E+DEELAMEI  LK++LEISLQKSIFLE+ENQEL+ EL R KS I SLKA NN+RKSILWKKFH+SMD  VAG DS P +P   A DK E T+  KQ
Subjt:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMD--VAGNDSTPQSP--PATDKWETTRTQKQ

Query:  SNWAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF
        S+W  VKE+QRM A   +  PPPPPPLP KLLGGSKAVRRVPEVL+LYR +TKRDAQKENK A+GG P VAFTKNMIGEIENRSAYLSAIKSEVETHGEF
Subjt:  SNWAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEF

Query:  VNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSV
        VN LI+EVE  APRDI+E E+FVKWLD +LASLVDERAVLKHFPRWPE KADALREAAFSY+DLKSLE +VC FR+NPKEE N +LKRAQALQDR+EQSV
Subjt:  VNRLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSV

Query:  SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLN
        SN+ERTREFNCKKY  FQIPCQWM DS LP Q+KLS+LRL KE M RIT+E +  ET Q ENLFLQGVRFAYRVHQYAGGFDSEAI AFEG+K+ GL  +
Subjt:  SNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLN

Query:  QRK
        QRK
Subjt:  QRK

A0A6J1DC83 protein CHUP1, chloroplastic4.4e-16477.36Show/hide
Query:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQ-KQSNW
        MP EEDEELAMEI +L++EL+I++ KS FLEKENQEL+QEL R KS I SLKAHNNDRKS+LWKKF+NSMD        +SPPATDK E T++  KQ  W
Subjt:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQ-KQSNW

Query:  AVVKENQRMAAAAPTPAP-PPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
          VKE+QRM   AP PAP PPPPPLPTKLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GGFPAVAFTKNMIGEIENRSAYL+AIKSEVETHGEFVN
Subjt:  AVVKENQRMAAAAPTPAP-PPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN

Query:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
         LI+EVE AAPRDI EVERFV WLD EL SLVDERAVLKHFPRWPEGKADALREAAFSY+DLKSLE EVCSFR+NPKEE   +LKRAQALQDRLEQSVSN
Subjt:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN

Query:  VERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQ-LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQ
        VE+TREF+C KY  F+IPC+WM +SGL  QMKLSSLRL KE MRRIT+E Q ++ T Q +NL LQGVRFAYRVHQYAGGFDS+AI AFEG+K+VGL  +Q
Subjt:  VERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQ-LNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQ

Query:  RK
        RK
Subjt:  RK

A0A6J1G8X0 protein CHUP1, chloroplastic8.0e-222100Show/hide
Query:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWA
        MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWA
Subjt:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWA

Query:  VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL
        VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL
Subjt:  VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL

Query:  IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE
        IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE
Subjt:  IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE

Query:  RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK
        RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK
Subjt:  RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK

A0A6J1K8G4 protein CHUP1, chloroplastic1.3e-21196.26Show/hide
Query:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMD--VAGNDSTPQSPPATDKWETTRTQKQSN
        MPMEEDEELAMEI ALKRELEISLQKS FLEKENQELKQELARFKSH+ SLK HNNDRKSILWKKFHNSMD  VAG DS+PQSPPATDKWETTRTQKQSN
Subjt:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMD--VAGNDSTPQSPPATDKWETTRTQKQSN

Query:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
        WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKA NGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN
Subjt:  WAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVN

Query:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
        RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLE EVCSFRENPKEETNAMLKRAQALQDRLEQSVSN
Subjt:  RLIREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSN

Query:  VERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQR
        VERTREFNC KYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE QLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGL L+QR
Subjt:  VERTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQR

Query:  K
        K
Subjt:  K

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic8.6e-6448.08Show/hide
Query:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA
        P   PPPPPP P  L    GG   V R PE++E Y+ + KR+++KE   +  ++G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA

Query:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERTREFN
        ++  DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF ++P       LK+   L +++EQSV  + RTR+  
Subjt:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERTREFN

Query:  CKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE----KQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
          +Y +F IP  W+ D+G+  ++KLSS++L K+ M+R+  E       ++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  CKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE----KQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

Arabidopsis top hitse value%identityAlignment
AT1G07120.1 FUNCTIONS IN: molecular_function unknown7.9e-9749.1Show/hide
Query:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWA
        +P  ED+    ++  L +EL+  L ++  LEKEN EL+QE+AR ++ + +LK+H N+RKS+LWKK  +S D +  D +    P + K   T+ Q+  N  
Subjt:  MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWA

Query:  VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL
               +   +    PPPPPPLP+K   G ++VRR PEV+E YR +TKR++   NK    G  + AF +NMIGEIENRS YLS IKS+ + H + ++ L
Subjt:  VVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRL

Query:  IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE
        I +VEAA   DI+EVE FVKW+D EL+SLVDERAVLKHFP+WPE K D+LREAA +YK  K+L  E+ SF++NPK+     L+R Q+LQDRLE+SV+N E
Subjt:  IREVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVE

Query:  RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQV
        + R+   K+Y  FQIP +WMLD+GL  Q+K SSLRL +E M+RI KE + N + +  NL LQGVRFAY +HQ+AGGFD E +  F  +K++
Subjt:  RTREFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQV

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein6.1e-6548.08Show/hide
Query:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA
        P   PPPPPP P  L    GG   V R PE++E Y+ + KR+++KE   +  ++G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA

Query:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERTREFN
        ++  DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF ++P       LK+   L +++EQSV  + RTR+  
Subjt:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERTREFN

Query:  CKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE----KQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
          +Y +F IP  W+ D+G+  ++KLSS++L K+ M+R+  E       ++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  CKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE----KQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein6.1e-6548.08Show/hide
Query:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA
        P   PPPPPP P  L    GG   V R PE++E Y+ + KR+++KE   +  ++G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA

Query:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERTREFN
        ++  DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF ++P       LK+   L +++EQSV  + RTR+  
Subjt:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERTREFN

Query:  CKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE----KQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
          +Y +F IP  W+ D+G+  ++KLSS++L K+ M+R+  E       ++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  CKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE----KQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein6.1e-6548.08Show/hide
Query:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA
        P   PPPPPP P  L    GG   V R PE++E Y+ + KR+++KE   +  ++G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A
Subjt:  PTPAPPPPPPLPTKL---LGGSKAVRRVPEVLELYRLVTKRDAQKE---NKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEA

Query:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERTREFN
        ++  DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF ++P       LK+   L +++EQSV  + RTR+  
Subjt:  AAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERTREFN

Query:  CKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE----KQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
          +Y +F IP  W+ D+G+  ++KLSS++L K+ M+R+  E       ++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  CKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKE----KQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.0e-7150.52Show/hide
Query:  AAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVA-------FTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIR
        + +  P PPPPPP P  L   S  VRRVPEV+E Y  + +RD+    + + GG  A A         ++MIGEIENRS YL AIK++VET G+F+  LI+
Subjt:  AAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVA-------FTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIR

Query:  EVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERT
        EV  AA  DI +V  FVKWLD EL+ LVDERAVLKHF  WPE KADALREAAF Y DLK L  E   FRE+P++ +++ LK+ QAL ++LE  V ++ R 
Subjt:  EVEAAAPRDIAEVERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERT

Query:  REFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNE--TPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
        RE    K+  FQIP  WML++G+ +Q+KL+S++L  + M+R++ E +  E   P+ E L +QGVRFA+RVHQ+AGGFD+E + AFE ++
Subjt:  REFNCKKYNKFQIPCQWMLDSGLPAQMKLSSLRLVKECMRRITKEKQLNE--TPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAATGGAAGAAGATGAAGAATTGGCCATGGAGATCCACGCCTTGAAAAGGGAATTGGAAATTTCACTGCAAAAATCTATTTTTCTCGAGAAAGAAAATCAAGAACT
CAAACAAGAATTGGCTCGATTCAAATCTCACATTCACTCTCTGAAAGCTCATAATAATGACAGAAAATCCATTCTTTGGAAGAAATTTCACAATTCCATGGATGTCGCCG
GAAATGACTCCACGCCGCAGAGTCCACCGGCGACTGACAAATGGGAGACTACCAGAACACAGAAACAGAGTAATTGGGCTGTTGTGAAAGAGAACCAGAGAATGGCGGCG
GCGGCACCGACCCCGGCTCCTCCACCGCCGCCGCCACTTCCGACGAAGCTTCTCGGGGGATCTAAGGCAGTGCGGCGAGTCCCGGAAGTGTTGGAGCTGTACCGTTTAGT
GACGAAAAGGGATGCCCAGAAGGAAAATAAGGCCGCAAACGGAGGATTTCCGGCGGTGGCGTTCACCAAAAACATGATCGGCGAAATCGAAAACCGATCAGCCTATCTCT
CAGCGATAAAATCGGAGGTGGAGACACATGGGGAGTTTGTGAACCGGCTGATCAGAGAAGTGGAAGCGGCAGCGCCAAGAGACATAGCGGAGGTGGAGAGGTTCGTGAAG
TGGCTAGACGGGGAGCTGGCATCGCTCGTGGACGAGAGGGCGGTGCTCAAGCACTTCCCACGGTGGCCGGAGGGGAAGGCAGACGCACTGCGGGAGGCGGCATTCAGCTA
CAAGGATCTGAAGAGCTTGGAAGGTGAAGTGTGTTCGTTTAGAGAGAATCCAAAGGAGGAGACGAATGCAATGTTGAAGAGGGCTCAGGCCTTGCAAGACAGGTTGGAGC
AGAGCGTGAGCAATGTGGAGAGGACGAGGGAGTTCAACTGTAAGAAGTACAACAAGTTTCAAATCCCTTGCCAATGGATGCTGGACTCTGGCTTGCCAGCCCAGATGAAG
CTGAGCTCATTGAGGCTAGTGAAGGAATGCATGCGGAGGATAACAAAAGAGAAACAATTGAACGAAACCCCACAAACAGAAAACCTTTTTCTTCAAGGGGTTCGCTTTGC
TTACAGGGTGCACCAGTATGCAGGAGGTTTTGATTCGGAAGCTATAGTGGCTTTTGAAGGAATGAAGCAAGTTGGGCTGCAGCTTAATCAAAGAAAATAG
mRNA sequenceShow/hide mRNA sequence
GGAAGCAAATTATTGTTGGTTGGTTGGCTTCTCCTTCAGCCCATTTCAGAGAAAGACCACAAAATCAGAGTAGAATGCCAATGGAAGAAGATGAAGAATTGGCCATGGAG
ATCCACGCCTTGAAAAGGGAATTGGAAATTTCACTGCAAAAATCTATTTTTCTCGAGAAAGAAAATCAAGAACTCAAACAAGAATTGGCTCGATTCAAATCTCACATTCA
CTCTCTGAAAGCTCATAATAATGACAGAAAATCCATTCTTTGGAAGAAATTTCACAATTCCATGGATGTCGCCGGAAATGACTCCACGCCGCAGAGTCCACCGGCGACTG
ACAAATGGGAGACTACCAGAACACAGAAACAGAGTAATTGGGCTGTTGTGAAAGAGAACCAGAGAATGGCGGCGGCGGCACCGACCCCGGCTCCTCCACCGCCGCCGCCA
CTTCCGACGAAGCTTCTCGGGGGATCTAAGGCAGTGCGGCGAGTCCCGGAAGTGTTGGAGCTGTACCGTTTAGTGACGAAAAGGGATGCCCAGAAGGAAAATAAGGCCGC
AAACGGAGGATTTCCGGCGGTGGCGTTCACCAAAAACATGATCGGCGAAATCGAAAACCGATCAGCCTATCTCTCAGCGATAAAATCGGAGGTGGAGACACATGGGGAGT
TTGTGAACCGGCTGATCAGAGAAGTGGAAGCGGCAGCGCCAAGAGACATAGCGGAGGTGGAGAGGTTCGTGAAGTGGCTAGACGGGGAGCTGGCATCGCTCGTGGACGAG
AGGGCGGTGCTCAAGCACTTCCCACGGTGGCCGGAGGGGAAGGCAGACGCACTGCGGGAGGCGGCATTCAGCTACAAGGATCTGAAGAGCTTGGAAGGTGAAGTGTGTTC
GTTTAGAGAGAATCCAAAGGAGGAGACGAATGCAATGTTGAAGAGGGCTCAGGCCTTGCAAGACAGGTTGGAGCAGAGCGTGAGCAATGTGGAGAGGACGAGGGAGTTCA
ACTGTAAGAAGTACAACAAGTTTCAAATCCCTTGCCAATGGATGCTGGACTCTGGCTTGCCAGCCCAGATGAAGCTGAGCTCATTGAGGCTAGTGAAGGAATGCATGCGG
AGGATAACAAAAGAGAAACAATTGAACGAAACCCCACAAACAGAAAACCTTTTTCTTCAAGGGGTTCGCTTTGCTTACAGGGTGCACCAGTATGCAGGAGGTTTTGATTC
GGAAGCTATAGTGGCTTTTGAAGGAATGAAGCAAGTTGGGCTGCAGCTTAATCAAAGAAAATAGGGTTCTTTGGTGATAAGTTATAGTTAACAGCACTTGTAAGAATCAA
CATTGCAGCAGACCACATTCAGAAAAGGGATGTAATATGAATGATTGAATGGGAAGTTCTATACACAATCAATCCTATGCTTATTGCAACTTATTC
Protein sequenceShow/hide protein sequence
MPMEEDEELAMEIHALKRELEISLQKSIFLEKENQELKQELARFKSHIHSLKAHNNDRKSILWKKFHNSMDVAGNDSTPQSPPATDKWETTRTQKQSNWAVVKENQRMAA
AAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKENKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEVERFVK
WLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEGEVCSFRENPKEETNAMLKRAQALQDRLEQSVSNVERTREFNCKKYNKFQIPCQWMLDSGLPAQMK
LSSLRLVKECMRRITKEKQLNETPQTENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKQVGLQLNQRK