; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G2487 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G2487
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Descriptionprotein CHUP1, chloroplastic
Genome locationctg1002:5823955..5826074
RNA-Seq ExpressionCucsat.G2487
SyntenyCucsat.G2487
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
GO:0005525 - GTP binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052630.1 protein CHUP1 [Cucumis melo var. makuwa]0.098.04Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEE KQSVEEERRESQER+KAMEGE++ELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA
        NLIRNLKRATKCSDAVVNQDNHKVEHPE KKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTS SSSSTGSSADIEKAIPA
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA

Query:  PPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
        PPPVPTK MPPPPPPPSKSAPPPPPPPPKGKR MPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVT+PPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
Subjt:  PPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF

Query:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
        IRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
Subjt:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY

Query:  NLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
        NLSRMRESAAKRYKAFQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
Subjt:  NLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH

Query:  VQCQNQQQHKY
        VQCQNQQQHK+
Subjt:  VQCQNQQQHKY

XP_004134665.1 protein CHUP1, chloroplastic [Cucumis sativus]0.0100Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA
        NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA

Query:  PPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
        PPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
Subjt:  PPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF

Query:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
        IRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
Subjt:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY

Query:  NLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
        NLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
Subjt:  NLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH

Query:  VQCQNQQQHKYVWSSRPTTC
        VQCQNQQQHKYVWSSRPTTC
Subjt:  VQCQNQQQHKYVWSSRPTTC

XP_008439756.1 PREDICTED: protein CHUP1, chloroplastic [Cucumis melo]0.097.9Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEE KQSVEEERRESQERIKAMEGE++ELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA
        NLIRNLKRATKCSDAVVNQDNHKVEHPE KKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVP+PPPKPSSSSSSSATTS  SSSTGSSADIEKAIPA
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA

Query:  PPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
        PPPVPTK MPPPPPPPSKSAPPPPPPPPKGKR MPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVT+PPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
Subjt:  PPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF

Query:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
        IR LIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
Subjt:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY

Query:  NLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
        NLSRMRESAAKRYKAFQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
Subjt:  NLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH

Query:  VQCQNQQQHKYVWSSRPTTC
        VQCQNQQQHKYVWSSRPTTC
Subjt:  VQCQNQQQHKYVWSSRPTTC

XP_022926872.1 protein CHUP1, chloroplastic-like [Cucurbita moschata]0.092.77Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRKVESSPK STPAQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRV+VEE KQSVEE+RRESQER+KAMEGE+AELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA
        NLIR+LKR TK SD VV QDNHKVE PEAKKEEVETERPRHSR NSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSSAT+S+SS+STGSS D EK IPA
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA

Query:  PPPVPTKAMPPPPPPP-SKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTK  PPPPPPP SKSAPPPPPPPPKGKR  PAKVRRIPEVVEFYHSLMRRDSRR+ GSGVTEPPS+ANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  PPPVPTKAMPPPPPPP-SKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQNQQQHKYVWSS-RPTTC
        HVQCQNQQ HKYV SS RPTTC
Subjt:  HVQCQNQQQHKYVWSS-RPTTC

XP_038883847.1 protein CHUP1, chloroplastic [Benincasa hispida]0.096.29Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPK STPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVPVLENEISTKDAEIERASKRILFLEAENERLRV+VEE KQSVEEERRESQERIKAME E+AELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA
        NLIRNLKR TKCS+AVVNQDNHK EHPEAKKEEVETERPRHSRCNSEELAE TLSNIKSRIPRVPKPPPKPSSSSSSSA  +TSSSSTGSS D+EKAIPA
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA

Query:  PPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
        PPPVPTK MPPPPPPPSKSAPPPPPPPPKGKR MP KVRRIPEVVEFYHSLMRRDSRRDSGS VT+PPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
Subjt:  PPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF

Query:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
        IRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
Subjt:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY

Query:  NLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
        NLSRMRESA KRYKAFQIPVEWMLD GIV QIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
Subjt:  NLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH

Query:  VQCQNQQQHKYVWSSRPTTC
        VQCQNQQ HKYVWSSRPTTC
Subjt:  VQCQNQQQHKYVWSSRPTTC

TrEMBL top hitse value%identityAlignment
A0A0A0KHU8 Uncharacterized protein0.0100Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA
        NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA

Query:  PPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
        PPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
Subjt:  PPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF

Query:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
        IRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
Subjt:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY

Query:  NLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
        NLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
Subjt:  NLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH

Query:  VQCQNQQQHKYVWSSRPTTC
        VQCQNQQQHKYVWSSRPTTC
Subjt:  VQCQNQQQHKYVWSSRPTTC

A0A1S3AZH3 protein CHUP1, chloroplastic0.097.9Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEE KQSVEEERRESQERIKAMEGE++ELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA
        NLIRNLKRATKCSDAVVNQDNHKVEHPE KKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVP+PPPKPSSSSSSSATTS  SSSTGSSADIEKAIPA
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA

Query:  PPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
        PPPVPTK MPPPPPPPSKSAPPPPPPPPKGKR MPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVT+PPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
Subjt:  PPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF

Query:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
        IR LIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
Subjt:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY

Query:  NLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
        NLSRMRESAAKRYKAFQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
Subjt:  NLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH

Query:  VQCQNQQQHKYVWSSRPTTC
        VQCQNQQQHKYVWSSRPTTC
Subjt:  VQCQNQQQHKYVWSSRPTTC

A0A5D3CMM2 Protein CHUP10.098.04Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEE KQSVEEERRESQER+KAMEGE++ELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA
        NLIRNLKRATKCSDAVVNQDNHKVEHPE KKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTS SSSSTGSSADIEKAIPA
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA

Query:  PPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
        PPPVPTK MPPPPPPPSKSAPPPPPPPPKGKR MPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVT+PPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
Subjt:  PPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF

Query:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
        IRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
Subjt:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY

Query:  NLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
        NLSRMRESAAKRYKAFQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
Subjt:  NLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH

Query:  VQCQNQQQHKY
        VQCQNQQQHK+
Subjt:  VQCQNQQQHKY

A0A6J1EFK1 protein CHUP1, chloroplastic-like0.092.77Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRKVESSPK STPAQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRV+VEE KQSVEE+RRESQER+KAMEGE+AELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA
        NLIR+LKR TK SD VV QDNHKVE PEAKKEEVETERPRHSR NSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSSAT+S+SS+STGSS D EK IPA
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA

Query:  PPPVPTKAMPPPPPPP-SKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTK  PPPPPPP SKSAPPPPPPPPKGKR  PAKVRRIPEVVEFYHSLMRRDSRR+ GSGVTEPPS+ANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  PPPVPTKAMPPPPPPP-SKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQNQQQHKYVWSS-RPTTC
        HVQCQNQQ HKYV SS RPTTC
Subjt:  HVQCQNQQQHKYVWSS-RPTTC

A0A6J1KWU6 protein CHUP1, chloroplastic-like0.092.77Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPA RKVESSPK STPAQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRV+VEE KQSVEE+RRESQER+KAMEGE+AELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA
        NLIR+LKR TK SD VV QDNHKVE PEAKKEEVETERPRHSR NSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSAT+S+SS+STGSS D EK IPA
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPA

Query:  PPPVPTKAMPPPPPPP-SKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTK  PPPPPPP SKSAPPPPPPPPKGKR   AKVRRIPEVVEFYHSLMRRDSRR+ GSGVTEPPS+ANARDMIGEIENRSAHLLAIKTDVETQGD
Subjt:  PPPVPTKAMPPPPPPP-SKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQNQQQHKYVWSS-RPTTC
        HVQCQNQQ HKYV SS RPTTC
Subjt:  HVQCQNQQQHKYVWSS-RPTTC

SwissProt top hitse value%identityAlignment
Q1PEB4 Uncharacterized protein At4g049801.3e-0522.73Show/hide
Query:  PKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERASKRIL
        P   TP    P +  +S     S S  ++  R+ A  +  P D+      +   RD E+  +T +   +  +ES         EI  ++ E E     +L
Subjt:  PKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERASKRIL

Query:  FLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKV
          E   + ++ +        E E  E+++  +  E E            R+E      E+ A    +   +   + + I  L+   +  DA  + ++ + 
Subjt:  FLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKV

Query:  EHP--EAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPAPPPVPTKAMPPPPPPP------
        EH   E  + E E E   HS   + E  +ST S+ K  +P    PPP  +S  + S T ST +    + + +    P PPP P    P PPPPP      
Subjt:  EHP--EAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPAPPPVPTKAMPPPPPPP------

Query:  ------------------SKSAPPPPPPPPKGKRLMPA--KVRRIPEVVEFYHSL------------MRRDSRRDSGSGVTEPPSTANA--RDMIGEIEN
                          + S P PP PP  G+ L  A  K+RR  ++   Y +L             ++ S+  +      P   A +   D + E+  
Subjt:  ------------------SKSAPPPPPPPPKGKRLMPA--KVRRIPEVVEFYHSL------------MRRDSRRDSGSGVTEPPSTANA--RDMIGEIEN

Query:  RSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQ-WPEQKADALREAAFGYCDLKKLESEASSFRGDARQPC
        RS++   I+ DV+     I  L   + +    D+++++ F   ++  L  L DE  VL  F+ +PE+K + +R A   Y  L  +  E  +++     P 
Subjt:  RSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQ-WPEQKADALREAAFGYCDLKKLESEASSFRGDARQPC

Query:  GSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVS---VKLAMKYMKRVSAELETVGGGPEEEE---LIVQGVRFAFRVH
           L K++    K +  +  + R ++  AK +K + I +++ +   +   +  VS   ++LA+K  +  + E +       +EE    + +  +FAF+V+
Subjt:  GSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVS---VKLAMKYMKRVSAELETVGGGPEEEE---LIVQGVRFAFRVH

Query:  QFAGGFD
         FAGG D
Subjt:  QFAGGFD

Q9LI74 Protein CHUP1, chloroplastic4.7e-8547.97Show/hide
Query:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRV
        K+A++R +   I    + + ++RF G + +  K   ++  KR    S   A  +Q N   E  E K  E           N+  + +  L +I+ R PRV
Subjt:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRV

Query:  PKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPAPPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPA----KVRRIPEVVEFYHSLMRRDSRRDS
        P+PPP+ +    S+   S      G         P PPP P    PPPPP       PPPPPPP G     A    KV R PE+VEFY SLM+R+S+++ 
Subjt:  PKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPAPPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPA----KVRRIPEVVEFYHSLMRRDSRRDS

Query:  GSGVTEP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAA
           +       S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELSFLVDERAVLKHF WPE KADALREAA
Subjt:  GSGVTEP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAA

Query:  FGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGG
        F Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D G+V +IKL SV+LA KYMKRV+ EL++V G 
Subjt:  FGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGG

Query:  ---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
           P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  ---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

Q9LI74 Protein CHUP1, chloroplastic2.3e+0124.87Show/hide
Query:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQS--VEEERRESQERIKAMEGEVAE
        ++  L ++V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L+AE ++L+   EE  Q+  V +E   ++ +IK ++ ++  
Subjt:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQS--VEEERRESQERIKAMEGEVAE

Query:  LKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNI
          ++  ++++ +L+L    +S+ Q     M+     N    ++R  K   AV + +   +E     +E    +R    + +S E   +TLSN+
Subjt:  LKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNI

Arabidopsis top hitse value%identityAlignment
AT1G07120.1 FUNCTIONS IN: molecular_function unknown4.9e-6144.32Show/hide
Query:  ERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIE---KAIPAPPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRL
        E+  H          + +SN+KS          K   SS   + T  S+     S       + +  P P PT          + + PPPPPP P  + L
Subjt:  ERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIE---KAIPAPPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRL

Query:  MPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFL
            VRR PEVVEFY +L +R+S   +        S A  R+MIGEIENRS +L  IK+D +   D I  LI +VE A+FTDI +V  FVKW+D+ELS L
Subjt:  MPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFL

Query:  VDERAVLKHF-QWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQI
        VDERAVLKHF +WPE+K D+LREAA  Y   K L +E  SF+ + +     AL+++Q+L ++LE  V N  +MR+S  KRYK FQIP EWMLD G++ Q+
Subjt:  VDERAVLKHF-QWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQI

Query:  KLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELR
        K  S++LA +YMKR++ ELE+ G G +E  L++QGVRFA+ +HQFAGGFD ET+  F EL+
Subjt:  KLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELR

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein3.4e-8647.97Show/hide
Query:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRV
        K+A++R +   I    + + ++RF G + +  K   ++  KR    S   A  +Q N   E  E K  E           N+  + +  L +I+ R PRV
Subjt:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRV

Query:  PKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPAPPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPA----KVRRIPEVVEFYHSLMRRDSRRDS
        P+PPP+ +    S+   S      G         P PPP P    PPPPP       PPPPPPP G     A    KV R PE+VEFY SLM+R+S+++ 
Subjt:  PKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPAPPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPA----KVRRIPEVVEFYHSLMRRDSRRDS

Query:  GSGVTEP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAA
           +       S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELSFLVDERAVLKHF WPE KADALREAA
Subjt:  GSGVTEP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAA

Query:  FGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGG
        F Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D G+V +IKL SV+LA KYMKRV+ EL++V G 
Subjt:  FGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGG

Query:  ---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
           P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  ---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein1.6e+0024.87Show/hide
Query:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQS--VEEERRESQERIKAMEGEVAE
        ++  L ++V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L+AE ++L+   EE  Q+  V +E   ++ +IK ++ ++  
Subjt:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQS--VEEERRESQERIKAMEGEVAE

Query:  LKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNI
          ++  ++++ +L+L    +S+ Q     M+     N    ++R  K   AV + +   +E     +E    +R    + +S E   +TLSN+
Subjt:  LKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNI

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein3.4e-8647.97Show/hide
Query:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRV
        K+A++R +   I    + + ++RF G + +  K   ++  KR    S   A  +Q N   E  E K  E           N+  + +  L +I+ R PRV
Subjt:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRV

Query:  PKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPAPPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPA----KVRRIPEVVEFYHSLMRRDSRRDS
        P+PPP+ +    S+   S      G         P PPP P    PPPPP       PPPPPPP G     A    KV R PE+VEFY SLM+R+S+++ 
Subjt:  PKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPAPPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPA----KVRRIPEVVEFYHSLMRRDSRRDS

Query:  GSGVTEP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAA
           +       S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELSFLVDERAVLKHF WPE KADALREAA
Subjt:  GSGVTEP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAA

Query:  FGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGG
        F Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D G+V +IKL SV+LA KYMKRV+ EL++V G 
Subjt:  FGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGG

Query:  ---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
           P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  ---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein1.6e+0024.87Show/hide
Query:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQS--VEEERRESQERIKAMEGEVAE
        ++  L ++V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L+AE ++L+   EE  Q+  V +E   ++ +IK ++ ++  
Subjt:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQS--VEEERRESQERIKAMEGEVAE

Query:  LKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNI
          ++  ++++ +L+L    +S+ Q     M+     N    ++R  K   AV + +   +E     +E    +R    + +S E   +TLSN+
Subjt:  LKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNI

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein3.4e-8647.97Show/hide
Query:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRV
        K+A++R +   I    + + ++RF G + +  K   ++  KR    S   A  +Q N   E  E K  E           N+  + +  L +I+ R PRV
Subjt:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRV

Query:  PKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPAPPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPA----KVRRIPEVVEFYHSLMRRDSRRDS
        P+PPP+ +    S+   S      G         P PPP P    PPPPP       PPPPPPP G     A    KV R PE+VEFY SLM+R+S+++ 
Subjt:  PKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPAPPPVPTKAMPPPPPPPSKSAPPPPPPPPKGKRLMPA----KVRRIPEVVEFYHSLMRRDSRRDS

Query:  GSGVTEP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAA
           +       S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELSFLVDERAVLKHF WPE KADALREAA
Subjt:  GSGVTEP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAA

Query:  FGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGG
        F Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D G+V +IKL SV+LA KYMKRV+ EL++V G 
Subjt:  FGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETVGGG

Query:  ---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
           P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  ---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-17359.06Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPK------TSTPAQPSPSSGKVS-------QKTVFSRSFGVYFPRSSAQVQPRPPD------VTELLRMVEELRDR
        MVAGKV+V MG  KSP+++K +  P          P    PSSG  +        K  F+RSFGVYFPR+SAQV            V+EL R VEELR+R
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPK------TSTPAQPSPSSGKVS-------QKTVFSRSFGVYFPRSSAQVQPRPPD------VTELLRMVEELRDR

Query:  EARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILEN
        EA LKT+ LE KLL+ESV+++P+LE++I+ K+ EI+   K    L  +NERLR + + +    EE RRE + R K ME E+ EL+K+    S      ++
Subjt:  EARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILEN

Query:  DELSASQRFQGLMEVSGKSNLIRNLKRA---TKCSDAVVNQDNHKVEHPEA--------KKEEVETERPRHSR-CNSEELAE-STLSNIKSRIPRVPKPP
          LS SQRFQGLM+VS KSNLIR+LKR        + + NQ+N       +        +K+E+E+    +SR  NSEEL E S+LS ++SR+PRVPKPP
Subjt:  DELSASQRFQGLMEVSGKSNLIRNLKRA---TKCSDAVVNQDNHKVEHPEA--------KKEEVETERPRHSR-CNSEELAE-STLSNIKSRIPRVPKPP

Query:  PKPSSSSSSSATTSTSSSSTGSSADIEKAIPAPPPVPTKAM---PPPPPPPSKS-APPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRD---SRRDSG
        PK S S          S+   +    +K+IP PPP P   +   PPPPP  SK+  PPPPPPPPK   +  AKVRR+PEVVEFYHSLMRRD   SRRDS 
Subjt:  PKPSSSSSSSATTSTSSSSTGSSADIEKAIPAPPPVPTKAM---PPPPPPPSKS-APPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRD---SRRDSG

Query:  SGVTEPP----STANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAA
         G         + +NARDMIGEIENRS +LLAIKTDVETQGDFIRFLIKEV NA+F+DIEDVVPFVKWLDDELS+LVDERAVLKHF+WPEQKADALREAA
Subjt:  SGVTEPP----STANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAA

Query:  FGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETV-GG
        F Y DLKKL SEAS FR D RQ   SALKKMQAL EKLEHGVY+LSRMRESAA ++K+FQIPV+WML+ GI SQIKL SVKLAMKYMKRVSAELE + GG
Subjt:  FGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAELETV-GG

Query:  GPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQNQ-QQHKYVWSSRP
        GPEEEELIVQGVRFAFRVHQFAGGFD ETM+AF+ELRDKA SCHVQCQ+Q  QHK  + S P
Subjt:  GPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQNQ-QQHKYVWSSRP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGCTGGGAAGGTGAAGGTCGCAATGGGGTTGCAGAAGTCTCCGGCGAGTAGAAAGGTTGAGAGCTCACCGAAGACATCGACGCCGGCGCAGCCTTCTCCGAGCTC
CGGTAAGGTTTCTCAGAAAACGGTTTTCTCTCGCTCATTTGGTGTCTATTTCCCTCGTTCTTCTGCTCAGGTTCAGCCTCGACCGCCTGACGTCACGGAGCTCCTTCGTA
TGGTTGAGGAGTTGCGTGACAGAGAGGCGCGATTGAAGACTGACCTATTGGAGCATAAGTTGTTGAAGGAATCTGTCGCCATTGTGCCTGTGCTTGAGAATGAGATATCT
ACGAAAGATGCGGAGATTGAAAGAGCGTCTAAGCGGATTCTGTTCTTGGAGGCGGAGAATGAGCGGTTGAGAGTTCAAGTGGAGGAAGCTAAACAGAGTGTTGAGGAGGA
GAGGAGAGAAAGTCAAGAGAGAATAAAGGCAATGGAAGGTGAAGTCGCTGAGCTGAAGAAAATGGCGTTGGATCGAAGCAGAATGGAGCTTATTTTGGAGAACGATGAGC
TTTCGGCGTCACAGAGGTTTCAGGGATTAATGGAGGTCTCGGGAAAGTCTAACCTAATCAGGAACTTGAAAAGAGCGACTAAATGTTCGGATGCTGTTGTTAACCAAGAC
AATCATAAGGTCGAACATCCGGAGGCAAAGAAAGAAGAAGTTGAAACCGAGAGGCCGAGACACTCGCGTTGTAACTCGGAAGAACTTGCTGAGTCCACCCTCTCCAACAT
AAAATCGAGAATACCTAGGGTTCCAAAACCTCCTCCGAAACCTTCTTCATCTTCCTCTTCTTCTGCCACTACCTCCACCTCTTCCTCATCAACTGGCTCTTCTGCTGACA
TAGAGAAAGCGATCCCAGCCCCACCCCCTGTTCCAACCAAGGCAATGCCGCCTCCTCCGCCGCCACCATCGAAATCCGCACCCCCTCCCCCTCCACCACCTCCCAAGGGT
AAGAGGCTCATGCCAGCGAAGGTACGGCGAATACCGGAGGTTGTTGAGTTCTATCATTCGTTAATGCGGAGAGATTCCCGGCGAGATTCCGGCTCCGGCGTTACGGAGCC
GCCATCGACCGCCAATGCTCGGGACATGATCGGAGAGATTGAGAACCGGTCGGCTCACTTACTCGCTATAAAGACAGATGTAGAAACTCAGGGAGATTTCATAAGATTCT
TGATAAAAGAAGTCGAAAATGCTTCATTTACTGACATTGAAGACGTTGTACCATTTGTCAAATGGTTGGATGATGAGCTCTCATTTCTGGTAGATGAAAGAGCCGTGCTT
AAACACTTCCAGTGGCCGGAGCAAAAGGCTGATGCTCTGCGTGAGGCTGCATTTGGCTATTGCGATCTGAAGAAGCTGGAATCCGAAGCCTCATCGTTTCGTGGTGATGC
CCGCCAGCCCTGCGGATCGGCTCTCAAGAAGATGCAAGCTTTGCTTGAAAAGTTGGAGCATGGCGTATACAATTTGTCTAGAATGCGTGAATCTGCCGCTAAGAGATATA
AAGCATTTCAAATTCCAGTGGAATGGATGCTTGATGGTGGAATTGTGAGTCAGATCAAGCTTGTCTCTGTAAAATTAGCAATGAAGTACATGAAGAGAGTATCCGCAGAA
CTTGAAACAGTCGGTGGTGGACCTGAAGAAGAAGAGCTGATTGTTCAAGGCGTTAGATTTGCCTTCCGTGTGCATCAGTTTGCTGGAGGGTTTGATGTAGAAACAATGAG
GGCATTTCAAGAGCTGAGGGACAAGGCAAGTTCATGTCATGTACAATGCCAAAACCAGCAACAACATAAGTACGTGTGGAGTAGCAGGCCTACAACTTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGCTGGGAAGGTGAAGGTCGCAATGGGGTTGCAGAAGTCTCCGGCGAGTAGAAAGGTTGAGAGCTCACCGAAGACATCGACGCCGGCGCAGCCTTCTCCGAGCTC
CGGTAAGGTTTCTCAGAAAACGGTTTTCTCTCGCTCATTTGGTGTCTATTTCCCTCGTTCTTCTGCTCAGGTTCAGCCTCGACCGCCTGACGTCACGGAGCTCCTTCGTA
TGGTTGAGGAGTTGCGTGACAGAGAGGCGCGATTGAAGACTGACCTATTGGAGCATAAGTTGTTGAAGGAATCTGTCGCCATTGTGCCTGTGCTTGAGAATGAGATATCT
ACGAAAGATGCGGAGATTGAAAGAGCGTCTAAGCGGATTCTGTTCTTGGAGGCGGAGAATGAGCGGTTGAGAGTTCAAGTGGAGGAAGCTAAACAGAGTGTTGAGGAGGA
GAGGAGAGAAAGTCAAGAGAGAATAAAGGCAATGGAAGGTGAAGTCGCTGAGCTGAAGAAAATGGCGTTGGATCGAAGCAGAATGGAGCTTATTTTGGAGAACGATGAGC
TTTCGGCGTCACAGAGGTTTCAGGGATTAATGGAGGTCTCGGGAAAGTCTAACCTAATCAGGAACTTGAAAAGAGCGACTAAATGTTCGGATGCTGTTGTTAACCAAGAC
AATCATAAGGTCGAACATCCGGAGGCAAAGAAAGAAGAAGTTGAAACCGAGAGGCCGAGACACTCGCGTTGTAACTCGGAAGAACTTGCTGAGTCCACCCTCTCCAACAT
AAAATCGAGAATACCTAGGGTTCCAAAACCTCCTCCGAAACCTTCTTCATCTTCCTCTTCTTCTGCCACTACCTCCACCTCTTCCTCATCAACTGGCTCTTCTGCTGACA
TAGAGAAAGCGATCCCAGCCCCACCCCCTGTTCCAACCAAGGCAATGCCGCCTCCTCCGCCGCCACCATCGAAATCCGCACCCCCTCCCCCTCCACCACCTCCCAAGGGT
AAGAGGCTCATGCCAGCGAAGGTACGGCGAATACCGGAGGTTGTTGAGTTCTATCATTCGTTAATGCGGAGAGATTCCCGGCGAGATTCCGGCTCCGGCGTTACGGAGCC
GCCATCGACCGCCAATGCTCGGGACATGATCGGAGAGATTGAGAACCGGTCGGCTCACTTACTCGCTATAAAGACAGATGTAGAAACTCAGGGAGATTTCATAAGATTCT
TGATAAAAGAAGTCGAAAATGCTTCATTTACTGACATTGAAGACGTTGTACCATTTGTCAAATGGTTGGATGATGAGCTCTCATTTCTGGTAGATGAAAGAGCCGTGCTT
AAACACTTCCAGTGGCCGGAGCAAAAGGCTGATGCTCTGCGTGAGGCTGCATTTGGCTATTGCGATCTGAAGAAGCTGGAATCCGAAGCCTCATCGTTTCGTGGTGATGC
CCGCCAGCCCTGCGGATCGGCTCTCAAGAAGATGCAAGCTTTGCTTGAAAAGTTGGAGCATGGCGTATACAATTTGTCTAGAATGCGTGAATCTGCCGCTAAGAGATATA
AAGCATTTCAAATTCCAGTGGAATGGATGCTTGATGGTGGAATTGTGAGTCAGATCAAGCTTGTCTCTGTAAAATTAGCAATGAAGTACATGAAGAGAGTATCCGCAGAA
CTTGAAACAGTCGGTGGTGGACCTGAAGAAGAAGAGCTGATTGTTCAAGGCGTTAGATTTGCCTTCCGTGTGCATCAGTTTGCTGGAGGGTTTGATGTAGAAACAATGAG
GGCATTTCAAGAGCTGAGGGACAAGGCAAGTTCATGTCATGTACAATGCCAAAACCAGCAACAACATAAGTACGTGTGGAGTAGCAGGCCTACAACTTGTTAA
Protein sequenceShow/hide protein sequence
MVAGKVKVAMGLQKSPASRKVESSPKTSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEIS
TKDAEIERASKRILFLEAENERLRVQVEEAKQSVEEERRESQERIKAMEGEVAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQD
NHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSTSSSSTGSSADIEKAIPAPPPVPTKAMPPPPPPPSKSAPPPPPPPPKG
KRLMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVL
KHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESAAKRYKAFQIPVEWMLDGGIVSQIKLVSVKLAMKYMKRVSAE
LETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQNQQQHKYVWSSRPTTC