; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G002510 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G002510
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionprotein CHUP1, chloroplastic
Genome locationchr01:2260527..2272260
RNA-Seq ExpressionLsi01G002510
SyntenyLsi01G002510
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
GO:0005525 - GTP binding (molecular function)
InterPro domainsIPR011719 - Conserved hypothetical protein CHP02058
IPR037103 - Tubulin/FtsZ-like, C-terminal domain
IPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052630.1 protein CHUP1 [Cucumis melo var. makuwa]0.0e+0096.5Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPK STPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS
        IVPVLENEISTKDAEIERASKRILFLEAENERLRV+VEEVKQSVEEERRESQER+KAMEGEI+ELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP
        NLIRNLKRATKCSDAVVNQDNHKVEHPE KKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATT  SSSSSTGSS+D+EKAIP
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP

Query:  APPPVPTKSMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGD
        APPPVPTK MPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTD PSTANARDMIGEIENRSAHLLAIKTDVETQGD
Subjt:  APPPVPTKSMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLDSGI+SQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQN-QQHKFRSYPLSNHLSSSAHLICSLTRPRISRLFKVTAASSMEVEQGGKSAPVGSPVPPMKLLFVEMGVGYDQHGQDITAAAMRACRDAISSNS
        HVQCQN QQHKFRSYP SNHLSS  HLICSLTRPRISRL KVTA SSMEVEQGG+SAPV S VPPMKLLFVEMGVGYDQHGQDITAAAMRACRDAISSNS
Subjt:  HVQCQN-QQHKFRSYPLSNHLSSSAHLICSLTRPRISRLFKVTAASSMEVEQGGKSAPVGSPVPPMKLLFVEMGVGYDQHGQDITAAAMRACRDAISSNS

Query:  IPAFRRGSIPGVTFGEMKLQIKLGVPQSLQQSLDIEKVKSVFP
        IPAFRRGSIPGV+FGEMKLQIKLGVP SLQQSLDIEKVKSVFP
Subjt:  IPAFRRGSIPGVTFGEMKLQIKLGVPQSLQQSLDIEKVKSVFP

KAG6594554.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0088.99Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRKVESSPKPSTPAQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRVEVEEVKQSVEE+RRES+ER+KAMEGEIAELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP
        NLIR+LKR TK SD VV QDNHKVE PEAKKEEVETERPRHSR NSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSA TSSSSS+STGSS D EK IP
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP

Query:  APPPVPTK-SMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQG
        APPPVPTK + PPPPPPPSKSAPPPPPPPPKGKRP PAKVRRIPEVVEFYHSLMRRDSRR+ GS VT+ PS+ANARDMIGEIENRSAHLLAIKTDVETQG
Subjt:  APPPVPTK-SMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQG

Query:  DFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHG
        DFIRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPC SALKKMQALLEKLEHG
Subjt:  DFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHG

Query:  VYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
        VYNLSRMRESATKRYKAFQIPVEWMLDSGI+SQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASS
Subjt:  VYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

Query:  CHVQCQNQQHKFRSYPL--------------------SNHLSSSAHLICSLTRPRISRLFKVTAASSMEVEQGGKSAPVGSPVPPMKLLFVEMGVGYDQH
        CHVQCQNQQHK+ SY                      +NH SSSAHLIC    PR+ RL K  AA SMEVEQG +SAPV S   PMKLLFVEMGVGYDQH
Subjt:  CHVQCQNQQHKFRSYPL--------------------SNHLSSSAHLICSLTRPRISRLFKVTAASSMEVEQGGKSAPVGSPVPPMKLLFVEMGVGYDQH

Query:  GQDITAAAMRACRDAISSNSIPAFRRGSIPGVTFGEMKLQIKLGVPQSLQQSLDIEKVKSVFP
        GQDITAAAMRACRDAI SNSIPAFRRG+IPGV+FGEMKLQIKLGVPQSLQQSLD+EKVKSVFP
Subjt:  GQDITAAAMRACRDAISSNSIPAFRRGSIPGVTFGEMKLQIKLGVPQSLQQSLDIEKVKSVFP

KAG7026530.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0087.55Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRKVESSPKPSTPAQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRVEVEEVKQSVEE+RRES+ER+KAMEGEIAELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP
        NLIRNLKR TK SD VV QDNHKVE PEAKKEEVETERPRHSR NSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSA TSSSSS+STGSS D EK IP
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP

Query:  APPPVPTK-SMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQG
        APPPVPTK + PPPPPPPSKSAPPPPPPPPKGKRP PAKVRRIPEVVEFYHSLMRRDSRR+ GS VT+ PS+ANARDMIGEIENRSAHLLAIKTDVETQG
Subjt:  APPPVPTK-SMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQG

Query:  DFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHG
        DFIRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPC SALKKMQALLEKLEHG
Subjt:  DFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHG

Query:  VYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
        VYNLSRMRESATKRYKAFQIPVEWMLDSGI+SQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASS
Subjt:  VYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

Query:  CHVQCQNQQHK-----------------------------------FRSYPLS-NHLSSSAHLICSLTRPRISRLFKVTAASSMEVEQGGKSAPVGSPVP
        CHVQCQNQQHK                                   F SY  S NH SSSAHLIC    PR+ RL K  AA SMEVEQG +SAPV S   
Subjt:  CHVQCQNQQHK-----------------------------------FRSYPLS-NHLSSSAHLICSLTRPRISRLFKVTAASSMEVEQGGKSAPVGSPVP

Query:  PMKLLFVEMGVGYDQHGQDITAAAMRACRDAISSNSIPAFRRGSIPGVTFGEMKLQIKLGVPQSLQQSLDIEKVKSVFP
        PMKLLFVEMGVGYDQHGQDITAAAMRACRDAI SNSIPAFRRG+IPGV+FGEMKLQIKLGVPQSLQQSLD+EKVKSVFP
Subjt:  PMKLLFVEMGVGYDQHGQDITAAAMRACRDAISSNSIPAFRRGSIPGVTFGEMKLQIKLGVPQSLQQSLDIEKVKSVFP

XP_004134665.1 protein CHUP1, chloroplastic [Cucumis sativus]0.0e+0097.06Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPK STPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS
        IVPVLENEISTKDAEIERASKRILFLEAENERLRV+VEE KQSVEEERRESQERIKAMEGE+AELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP
        NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATT S+SSSSTGSS+D+EKAIP
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP

Query:  APPPVPTKSMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGD
        APPPVPTK+MPPPPPPPSKSAPPPPPPPPKGKR MPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVT+ PSTANARDMIGEIENRSAHLLAIKTDVETQGD
Subjt:  APPPVPTKSMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLD GI+SQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQN-QQHKF
        HVQCQN QQHK+
Subjt:  HVQCQN-QQHKF

XP_008439756.1 PREDICTED: protein CHUP1, chloroplastic [Cucumis melo]0.0e+0096.9Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPK STPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS
        IVPVLENEISTKDAEIERASKRILFLEAENERLRV+VEEVKQSVEEERRESQERIKAMEGEI+ELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP
        NLIRNLKRATKCSDAVVNQDNHKVEHPE KKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVP+PPPKPSSSSSSSATT   SSSSTGSS+D+EKAIP
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP

Query:  APPPVPTKSMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGD
        APPPVPTK MPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTD PSTANARDMIGEIENRSAHLLAIKTDVETQGD
Subjt:  APPPVPTKSMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIR LIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLDSGI+SQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQN-QQHKF
        HVQCQN QQHK+
Subjt:  HVQCQN-QQHKF

TrEMBL top hitse value%identityAlignment
A0A0A0KHU8 Uncharacterized protein0.0e+0097.06Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPK STPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS
        IVPVLENEISTKDAEIERASKRILFLEAENERLRV+VEE KQSVEEERRESQERIKAMEGE+AELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP
        NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATT S+SSSSTGSS+D+EKAIP
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP

Query:  APPPVPTKSMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGD
        APPPVPTK+MPPPPPPPSKSAPPPPPPPPKGKR MPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVT+ PSTANARDMIGEIENRSAHLLAIKTDVETQGD
Subjt:  APPPVPTKSMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLD GI+SQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQN-QQHKF
        HVQCQN QQHK+
Subjt:  HVQCQN-QQHKF

A0A1S3AZH3 protein CHUP1, chloroplastic0.0e+0096.9Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPK STPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS
        IVPVLENEISTKDAEIERASKRILFLEAENERLRV+VEEVKQSVEEERRESQERIKAMEGEI+ELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP
        NLIRNLKRATKCSDAVVNQDNHKVEHPE KKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVP+PPPKPSSSSSSSATT   SSSSTGSS+D+EKAIP
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP

Query:  APPPVPTKSMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGD
        APPPVPTK MPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTD PSTANARDMIGEIENRSAHLLAIKTDVETQGD
Subjt:  APPPVPTKSMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIR LIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLDSGI+SQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQN-QQHKF
        HVQCQN QQHK+
Subjt:  HVQCQN-QQHKF

A0A5D3CMM2 Protein CHUP10.0e+0096.5Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPK STPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS
        IVPVLENEISTKDAEIERASKRILFLEAENERLRV+VEEVKQSVEEERRESQER+KAMEGEI+ELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP
        NLIRNLKRATKCSDAVVNQDNHKVEHPE KKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATT  SSSSSTGSS+D+EKAIP
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP

Query:  APPPVPTKSMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGD
        APPPVPTK MPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTD PSTANARDMIGEIENRSAHLLAIKTDVETQGD
Subjt:  APPPVPTKSMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLDSGI+SQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQN-QQHKFRSYPLSNHLSSSAHLICSLTRPRISRLFKVTAASSMEVEQGGKSAPVGSPVPPMKLLFVEMGVGYDQHGQDITAAAMRACRDAISSNS
        HVQCQN QQHKFRSYP SNHLSS  HLICSLTRPRISRL KVTA SSMEVEQGG+SAPV S VPPMKLLFVEMGVGYDQHGQDITAAAMRACRDAISSNS
Subjt:  HVQCQN-QQHKFRSYPLSNHLSSSAHLICSLTRPRISRLFKVTAASSMEVEQGGKSAPVGSPVPPMKLLFVEMGVGYDQHGQDITAAAMRACRDAISSNS

Query:  IPAFRRGSIPGVTFGEMKLQIKLGVPQSLQQSLDIEKVKSVFP
        IPAFRRGSIPGV+FGEMKLQIKLGVP SLQQSLDIEKVKSVFP
Subjt:  IPAFRRGSIPGVTFGEMKLQIKLGVPQSLQQSLDIEKVKSVFP

A0A6J1EFK1 protein CHUP1, chloroplastic-like7.2e-30193.79Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRKVESSPKPSTPAQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRVEVEEVKQSVEE+RRESQER+KAMEGEIAELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP
        NLIR+LKR TK SD VV QDNHKVE PEAKKEEVETERPRHSR NSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSSA TSSSSS+STGSS D EK IP
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP

Query:  APPPVPTK-SMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQG
        APPPVPTK + PPPPPPPSKSAPPPPPPPPKGKRP PAKVRRIPEVVEFYHSLMRRDSRR+ GSGVT+ PS+ANARDMIGEIENRS HLLAIKTDVETQG
Subjt:  APPPVPTK-SMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQG

Query:  DFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHG
        DFIRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHG
Subjt:  DFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHG

Query:  VYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
        VYNLSRMRESATKRYKAFQIPVEWMLDSGI+SQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASS
Subjt:  VYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

Query:  CHVQCQNQQHKF
        CHVQCQNQQHK+
Subjt:  CHVQCQNQQHKF

A0A6J1KWU6 protein CHUP1, chloroplastic-like6.1e-30093.79Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPA RKVESSPKPSTPAQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRVEVEEVKQSVEE+RRESQER+KAMEGEIAELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP
        NLIR+LKR TK SD VV QDNHKVE PEAKKEEVETERPRHSR NSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSA TSSSSS+STGSS D EK IP
Subjt:  NLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIP

Query:  APPPVPTK-SMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQG
        APPPVPTK + PPPPPPPSKSAPPPPPPPPKGKRP  AKVRRIPEVVEFYHSLMRRDSRR+ GSGVT+ PS+ANARDMIGEIENRSAHLLAIKTDVETQG
Subjt:  APPPVPTK-SMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQG

Query:  DFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHG
        DFIRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHG
Subjt:  DFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHG

Query:  VYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
        VYNLSRMRESATKRYKAFQIPVEWMLDSGI+SQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASS
Subjt:  VYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

Query:  CHVQCQNQQHKF
        CHVQCQNQQHK+
Subjt:  CHVQCQNQQHKF

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic2.1e-8447.47Show/hide
Query:  KMALDRGRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRV
        K+A++R +   I    + + ++RF G + +  K   ++  KR    S   A  +Q N   E  E K  E           N+  + +  L +I+ R PRV
Subjt:  KMALDRGRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRV

Query:  PKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIPAPPPVPTKSMPPPPPPPSKSAP------PPPPPPPKGKRPMPA----KVRRIPEVVEFYHSLMR
        P+PPP+ +    S+   S                  A PP+P    PPPPPPP    P      PPPPPPP G     A    KV R PE+VEFY SLM+
Subjt:  PKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIPAPPPVPTKSMPPPPPPPSKSAP------PPPPPPPKGKRPMPA----KVRRIPEVVEFYHSLMR

Query:  RDSRRD-------SGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWP
        R+S+++       SG+G     S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELSFLVDERAVLKHF WP
Subjt:  RDSRRD-------SGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWP

Query:  EQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKR
        E KADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G++ +IKL SV+LA KYMKR
Subjt:  EQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKR

Query:  VSAELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
        V+ EL++V G    P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  VSAELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

Q9LI74 Protein CHUP1, chloroplastic2.1e+0124.61Show/hide
Query:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELK
        ++  L ++V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L+AE ++L+ E+ +    V +E   ++ +IK ++ +I    
Subjt:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELK

Query:  KMALDRGRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNI
        ++  ++ + +L+L    +S+ Q     M+     N    ++R  K   AV + +   +E     +E    +R    + +S E   +TLSN+
Subjt:  KMALDRGRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNI

Arabidopsis top hitse value%identityAlignment
AT1G48280.1 hydroxyproline-rich glycoprotein family protein3.2e-5938.67Show/hide
Query:  PVLENEISTKDAEIERASKRILFLEA---ENERLRVEVEEVKQSVEEER-RESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSG
        P +  +       I R S+  +   A   + +R R+E  E K  V E   ++ Q ++  ++ E+ E +        +EL L N +LS     Q L+    
Subjt:  PVLENEISTKDAEIERASKRILFLEA---ENERLRVEVEEVKQSVEEER-RESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSG

Query:  K-SNLIRNLKRATKCSDA----VVNQDNHKVEHPEAKKE-EVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSS
        K S+L  N K A +  ++    +      K+E P+ KKE  VE+ R             S  S   SR+P  P P PK   S +SS      +SS     
Subjt:  K-SNLIRNLKRATKCSDA----VVNQDNHKVEHPEAKKE-EVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSS

Query:  SDVEKAIPAPPPVPTKSMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANA--RDMIGEIENRSAHLLA
                          PP PPPP    PPPPPP P  K    A+ ++ P V + +  L ++D+ R+    V    S  N+    ++GEI+NRSAHL+A
Subjt:  SDVEKAIPAPPPVPTKSMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANA--RDMIGEIENRSAHLLA

Query:  IKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQ
        IK D+ET+G+FI  LI++V    F+D+EDV+ FV WLD EL+ L DERAVLKHF+WPE+KAD L+EAA  Y +LKKLE E SS+  D     G ALKKM 
Subjt:  IKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQ

Query:  ALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEE---EELIVQGVRFAFRVHQFAGGFDVETM
         LL+K E  +  L R+R S+ + Y+ F+IPVEWMLDSG++ +IK  S+KLA  YM RV+ EL++      E   E L++QGVRFA+R HQFAGG D ET+
Subjt:  ALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEE---EELIVQGVRFAFRVHQFAGGFDVETM

Query:  RAFQELRDKASS
         A +E++ +  S
Subjt:  RAFQELRDKASS

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein1.5e-8547.47Show/hide
Query:  KMALDRGRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRV
        K+A++R +   I    + + ++RF G + +  K   ++  KR    S   A  +Q N   E  E K  E           N+  + +  L +I+ R PRV
Subjt:  KMALDRGRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRV

Query:  PKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIPAPPPVPTKSMPPPPPPPSKSAP------PPPPPPPKGKRPMPA----KVRRIPEVVEFYHSLMR
        P+PPP+ +    S+   S                  A PP+P    PPPPPPP    P      PPPPPPP G     A    KV R PE+VEFY SLM+
Subjt:  PKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIPAPPPVPTKSMPPPPPPPSKSAP------PPPPPPPKGKRPMPA----KVRRIPEVVEFYHSLMR

Query:  RDSRRD-------SGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWP
        R+S+++       SG+G     S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELSFLVDERAVLKHF WP
Subjt:  RDSRRD-------SGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWP

Query:  EQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKR
        E KADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G++ +IKL SV+LA KYMKR
Subjt:  EQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKR

Query:  VSAELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
        V+ EL++V G    P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  VSAELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein1.5e+0024.61Show/hide
Query:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELK
        ++  L ++V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L+AE ++L+ E+ +    V +E   ++ +IK ++ +I    
Subjt:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELK

Query:  KMALDRGRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNI
        ++  ++ + +L+L    +S+ Q     M+     N    ++R  K   AV + +   +E     +E    +R    + +S E   +TLSN+
Subjt:  KMALDRGRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNI

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein1.5e-8547.47Show/hide
Query:  KMALDRGRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRV
        K+A++R +   I    + + ++RF G + +  K   ++  KR    S   A  +Q N   E  E K  E           N+  + +  L +I+ R PRV
Subjt:  KMALDRGRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRV

Query:  PKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIPAPPPVPTKSMPPPPPPPSKSAP------PPPPPPPKGKRPMPA----KVRRIPEVVEFYHSLMR
        P+PPP+ +    S+   S                  A PP+P    PPPPPPP    P      PPPPPPP G     A    KV R PE+VEFY SLM+
Subjt:  PKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIPAPPPVPTKSMPPPPPPPSKSAP------PPPPPPPKGKRPMPA----KVRRIPEVVEFYHSLMR

Query:  RDSRRD-------SGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWP
        R+S+++       SG+G     S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELSFLVDERAVLKHF WP
Subjt:  RDSRRD-------SGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWP

Query:  EQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKR
        E KADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G++ +IKL SV+LA KYMKR
Subjt:  EQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKR

Query:  VSAELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
        V+ EL++V G    P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  VSAELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein1.5e+0024.61Show/hide
Query:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELK
        ++  L ++V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L+AE ++L+ E+ +    V +E   ++ +IK ++ +I    
Subjt:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELK

Query:  KMALDRGRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNI
        ++  ++ + +L+L    +S+ Q     M+     N    ++R  K   AV + +   +E     +E    +R    + +S E   +TLSN+
Subjt:  KMALDRGRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNI

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein1.5e-8547.47Show/hide
Query:  KMALDRGRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRV
        K+A++R +   I    + + ++RF G + +  K   ++  KR    S   A  +Q N   E  E K  E           N+  + +  L +I+ R PRV
Subjt:  KMALDRGRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNQDNHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRV

Query:  PKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIPAPPPVPTKSMPPPPPPPSKSAP------PPPPPPPKGKRPMPA----KVRRIPEVVEFYHSLMR
        P+PPP+ +    S+   S                  A PP+P    PPPPPPP    P      PPPPPPP G     A    KV R PE+VEFY SLM+
Subjt:  PKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIPAPPPVPTKSMPPPPPPPSKSAP------PPPPPPPKGKRPMPA----KVRRIPEVVEFYHSLMR

Query:  RDSRRD-------SGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWP
        R+S+++       SG+G     S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELSFLVDERAVLKHF WP
Subjt:  RDSRRD-------SGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWP

Query:  EQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKR
        E KADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G++ +IKL SV+LA KYMKR
Subjt:  EQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKR

Query:  VSAELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
        V+ EL++V G    P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  VSAELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-17259.13Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPK------PSTPAQPSPSSGKVS-------QKTVFSRSFGVYFPRSSAQVQPRPPD------VTELLRMVEELRDR
        MVAGKV+V MG  KSP+++K +  P       P  P    PSSG  +        K  F+RSFGVYFPR+SAQV            V+EL R VEELR+R
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPK------PSTPAQPSPSSGKVS-------QKTVFSRSFGVYFPRSSAQVQPRPPD------VTELLRMVEELRDR

Query:  EARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILEN
        EA LKT+ LE KLL+ESV+++P+LE++I+ K+ EI+   K    L  +NERLR E +      EE RRE + R K ME EI EL+K+           ++
Subjt:  EARLKTDLLEHKLLKESVAIVPVLENEISTKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILEN

Query:  DELSASQRFQGLMEVSGKSNLIRNLKRA---TKCSDAVVNQDNHKVEHPEA--------KKEEVETERPRHSR-CNSEELAE-STLSNIKSRIPRVPKPP
          LS SQRFQGLM+VS KSNLIR+LKR        + + NQ+N       +        +K+E+E+    +SR  NSEEL E S+LS ++SR+PRVPKPP
Subjt:  DELSASQRFQGLMEVSGKSNLIRNLKRA---TKCSDAVVNQDNHKVEHPEA--------KKEEVETERPRHSR-CNSEELAE-STLSNIKSRIPRVPKPP

Query:  PKPSSSSSSSATTSSSSSSSTGSSSDVEKAIPAPPPVPTKSM---PPPPPPPSKS-APPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRD---SRRDS
        PK S         S   S+   +    +K+IP PPP P   +   PPPPP  SK+  PPPPPPPPK      AKVRR+PEVVEFYHSLMRRD   SRRDS
Subjt:  PKPSSSSSSSATTSSSSSSSTGSSSDVEKAIPAPPPVPTKSM---PPPPPPPSKS-APPPPPPPPKGKRPMPAKVRRIPEVVEFYHSLMRRD---SRRDS

Query:  GSG----VTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREA
          G       + + +NARDMIGEIENRS +LLAIKTDVETQGDFIRFLIKEV NA+F+DIEDVVPFVKWLDDELS+LVDERAVLKHF+WPEQKADALREA
Subjt:  GSG----VTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAVLKHFQWPEQKADALREA

Query:  AFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETV-G
        AF Y DLKKL SEAS FR D RQ   SALKKMQAL EKLEHGVY+LSRMRESA  ++K+FQIPV+WML++GI SQIKL SVKLAMKYMKRVSAELE + G
Subjt:  AFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETV-G

Query:  GGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQNQQHK----FRSYP
        GGPEEEELIVQGVRFAFRVHQFAGGFD ETM+AF+ELRDKA SCHVQCQ+Q H+    FRS P
Subjt:  GGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQNQQHK----FRSYP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGCTGGAAAGGTGAAAGTCGCAATGGGGCTGCAGAAGTCTCCGGCGAGTAGAAAGGTCGAGAGCTCACCGAAGCCATCGACGCCGGCGCAGCCTTCTCCGAGCTC
CGGTAAGGTTTCTCAGAAAACAGTCTTCTCCCGCTCGTTTGGTGTCTATTTCCCTCGCTCTTCTGCTCAGGTTCAGCCTCGACCGCCTGACGTCACGGAGCTCCTTCGTA
TGGTTGAGGAGTTGCGTGACAGAGAGGCGCGATTGAAGACTGACCTATTGGAGCACAAGCTGTTGAAGGAATCTGTCGCCATTGTTCCTGTGCTTGAGAACGAGATCTCT
ACGAAAGATGCGGAGATTGAAAGAGCGTCTAAGCGGATACTGTTCTTGGAGGCGGAGAATGAGCGATTGAGAGTTGAAGTGGAGGAAGTTAAACAGAGTGTTGAGGAGGA
GAGGAGAGAGAGTCAAGAGAGAATAAAGGCAATGGAAGGTGAAATCGCGGAGCTGAAGAAAATGGCATTGGATCGAGGCAGAATGGAGCTTATTTTGGAGAACGACGAGC
TTTCGGCGTCGCAGAGGTTCCAGGGATTAATGGAGGTCTCGGGAAAGTCTAACCTAATCAGGAACTTGAAAAGAGCGACCAAATGTTCGGATGCTGTTGTTAACCAAGAC
AATCATAAGGTTGAACATCCAGAGGCAAAGAAAGAAGAAGTTGAAACCGAGAGACCGAGACACTCGCGGTGTAACTCGGAAGAACTCGCCGAGTCCACTCTCTCTAACAT
AAAATCGCGAATACCTAGGGTTCCAAAACCTCCTCCGAAACCTTCTTCATCTTCCTCTTCTTCTGCCACTACTTCCTCCTCCTCCTCCTCATCAACTGGCTCTTCTAGTG
ACGTAGAGAAAGCGATCCCAGCCCCACCCCCTGTCCCAACCAAGTCAATGCCGCCTCCTCCTCCGCCACCTTCGAAATCCGCACCTCCTCCCCCTCCACCGCCACCCAAG
GGTAAGAGGCCGATGCCGGCGAAGGTGCGGCGAATACCGGAGGTTGTTGAGTTCTATCATTCGTTAATGCGGAGGGATTCCCGGCGAGATTCCGGCTCCGGCGTTACGGA
CCTGCCGTCGACCGCCAATGCTCGCGACATGATCGGAGAGATTGAGAACCGGTCCGCTCACTTACTCGCTATAAAGACAGATGTAGAGACACAGGGGGATTTCATAAGGT
TCTTGATAAAAGAAGTCGAAAATGCTTCATTTACTGACATTGAGGACGTTGTGCCGTTTGTCAAATGGTTGGATGATGAACTCTCATTTCTGGTAGATGAAAGAGCCGTG
CTTAAACACTTCCAGTGGCCCGAGCAAAAGGCCGACGCTCTGCGTGAGGCTGCATTTGGCTATTGCGACTTAAAGAAGCTGGAATCCGAAGCCTCATCGTTTCGTGGTGA
TGCCCGCCAGCCCTGCGGATCGGCTCTCAAGAAGATGCAAGCTTTGCTTGAAAAGTTGGAGCATGGCGTATACAATTTGTCTAGAATGCGTGAATCTGCCACTAAGAGAT
ACAAAGCATTTCAAATTCCAGTGGAATGGATGCTTGATAGTGGAATTATGAGTCAGATCAAGCTTGTCTCTGTAAAATTAGCAATGAAGTACATGAAGAGAGTATCCGCA
GAGCTTGAAACAGTCGGTGGTGGACCTGAAGAAGAAGAGCTGATTGTTCAAGGCGTTAGATTTGCCTTCCGTGTGCATCAGTTCGCAGGAGGGTTTGATGTAGAAACAAT
GAGGGCATTTCAAGAGCTGAGAGATAAGGCAAGTTCATGTCATGTACAATGCCAAAACCAGCAACACAAGTTCCGCTCTTACCCACTTTCTAATCATCTTTCTTCATCTG
CACATTTGATTTGCTCTCTTACACGCCCTCGAATTTCTCGCCTTTTCAAAGTTACTGCTGCATCGTCCATGGAGGTCGAGCAAGGTGGAAAATCTGCACCTGTCGGTAGC
CCAGTTCCGCCCATGAAGCTCTTATTCGTCGAGATGGGAGTTGGCTACGATCAACATGGCCAAGATATCACGGCGGCTGCAATGCGAGCCTGCAGGGATGCCATCTCTTC
CAATTCGATTCCAGCATTCCGTAGAGGTTCCATTCCTGGAGTCACATTTGGAGAGATGAAACTACAGATCAAACTTGGAGTTCCACAGTCTCTTCAACAATCCTTGGATA
TTGAAAAAGTCAAGTCCGTCTTCCCATAG
mRNA sequenceShow/hide mRNA sequence
CTGAAATTTAATTTTAATTTTCACAAACTCCAACTTCCTTTATCTCTCTTTCTCGCTTTCTCTCTCAGATGAGACGAAGACAAGGACGTTTCGTTCACACTCTAAAGTAG
ATTGAAAATTGGAGGACTGTAAAACTCAGTCCCGACCTTACCTCCAAGCACAGAGGGAGCGACGCGAGAGAGACACACACACAGTGAGAGTTTAGACATGGTAGCTGGAA
AGGTGAAAGTCGCAATGGGGCTGCAGAAGTCTCCGGCGAGTAGAAAGGTCGAGAGCTCACCGAAGCCATCGACGCCGGCGCAGCCTTCTCCGAGCTCCGGTAAGGTTTCT
CAGAAAACAGTCTTCTCCCGCTCGTTTGGTGTCTATTTCCCTCGCTCTTCTGCTCAGGTTCAGCCTCGACCGCCTGACGTCACGGAGCTCCTTCGTATGGTTGAGGAGTT
GCGTGACAGAGAGGCGCGATTGAAGACTGACCTATTGGAGCACAAGCTGTTGAAGGAATCTGTCGCCATTGTTCCTGTGCTTGAGAACGAGATCTCTACGAAAGATGCGG
AGATTGAAAGAGCGTCTAAGCGGATACTGTTCTTGGAGGCGGAGAATGAGCGATTGAGAGTTGAAGTGGAGGAAGTTAAACAGAGTGTTGAGGAGGAGAGGAGAGAGAGT
CAAGAGAGAATAAAGGCAATGGAAGGTGAAATCGCGGAGCTGAAGAAAATGGCATTGGATCGAGGCAGAATGGAGCTTATTTTGGAGAACGACGAGCTTTCGGCGTCGCA
GAGGTTCCAGGGATTAATGGAGGTCTCGGGAAAGTCTAACCTAATCAGGAACTTGAAAAGAGCGACCAAATGTTCGGATGCTGTTGTTAACCAAGACAATCATAAGGTTG
AACATCCAGAGGCAAAGAAAGAAGAAGTTGAAACCGAGAGACCGAGACACTCGCGGTGTAACTCGGAAGAACTCGCCGAGTCCACTCTCTCTAACATAAAATCGCGAATA
CCTAGGGTTCCAAAACCTCCTCCGAAACCTTCTTCATCTTCCTCTTCTTCTGCCACTACTTCCTCCTCCTCCTCCTCATCAACTGGCTCTTCTAGTGACGTAGAGAAAGC
GATCCCAGCCCCACCCCCTGTCCCAACCAAGTCAATGCCGCCTCCTCCTCCGCCACCTTCGAAATCCGCACCTCCTCCCCCTCCACCGCCACCCAAGGGTAAGAGGCCGA
TGCCGGCGAAGGTGCGGCGAATACCGGAGGTTGTTGAGTTCTATCATTCGTTAATGCGGAGGGATTCCCGGCGAGATTCCGGCTCCGGCGTTACGGACCTGCCGTCGACC
GCCAATGCTCGCGACATGATCGGAGAGATTGAGAACCGGTCCGCTCACTTACTCGCTATAAAGACAGATGTAGAGACACAGGGGGATTTCATAAGGTTCTTGATAAAAGA
AGTCGAAAATGCTTCATTTACTGACATTGAGGACGTTGTGCCGTTTGTCAAATGGTTGGATGATGAACTCTCATTTCTGGTAGATGAAAGAGCCGTGCTTAAACACTTCC
AGTGGCCCGAGCAAAAGGCCGACGCTCTGCGTGAGGCTGCATTTGGCTATTGCGACTTAAAGAAGCTGGAATCCGAAGCCTCATCGTTTCGTGGTGATGCCCGCCAGCCC
TGCGGATCGGCTCTCAAGAAGATGCAAGCTTTGCTTGAAAAGTTGGAGCATGGCGTATACAATTTGTCTAGAATGCGTGAATCTGCCACTAAGAGATACAAAGCATTTCA
AATTCCAGTGGAATGGATGCTTGATAGTGGAATTATGAGTCAGATCAAGCTTGTCTCTGTAAAATTAGCAATGAAGTACATGAAGAGAGTATCCGCAGAGCTTGAAACAG
TCGGTGGTGGACCTGAAGAAGAAGAGCTGATTGTTCAAGGCGTTAGATTTGCCTTCCGTGTGCATCAGTTCGCAGGAGGGTTTGATGTAGAAACAATGAGGGCATTTCAA
GAGCTGAGAGATAAGGCAAGTTCATGTCATGTACAATGCCAAAACCAGCAACACAAGTTCCGCTCTTACCCACTTTCTAATCATCTTTCTTCATCTGCACATTTGATTTG
CTCTCTTACACGCCCTCGAATTTCTCGCCTTTTCAAAGTTACTGCTGCATCGTCCATGGAGGTCGAGCAAGGTGGAAAATCTGCACCTGTCGGTAGCCCAGTTCCGCCCA
TGAAGCTCTTATTCGTCGAGATGGGAGTTGGCTACGATCAACATGGCCAAGATATCACGGCGGCTGCAATGCGAGCCTGCAGGGATGCCATCTCTTCCAATTCGATTCCA
GCATTCCGTAGAGGTTCCATTCCTGGAGTCACATTTGGAGAGATGAAACTACAGATCAAACTTGGAGTTCCACAGTCTCTTCAACAATCCTTGGATATTGAAAAAGTCAA
GTCCGTCTTCCCATAGATACCAAGCAGTCAAGCCCAGCATAACCCAAAAAACAGCTTTTGAAAAAGTGAATAAAGAACAGAGAATAAAGCAAGCAGACCATTAGTATACT
GTGAAGTGGGAGACTTTGAATCAGGAAATTTGAGGAAAAGAAAAAGGGAAAAGAACTCGCCTGGAAAATCACAGTAGAGAGTGACTCCCTCCTGACTATGGAGGTTTTGG
CTTTTGGCGCCGACGCTTGAATCATACATGTTAAGGACTGCCACATGTTAAGACTCAGCGAGTCGCCTACAAACATTATCTTCTTCCCTCTCCACCTCCTCAGAAGCTCC
AACCCGTCAAACCTTCAAAGAAATGAACACATTTTTCTGTGTGAGAGAGGGCTGCTTTGGTTTTTCTCCCACATTCTCAGAGAAAAGAGCTTGACAAATGACATTTGCTT
AACAATGTCGCATGGAGGAAAGCAGGAATATGTCTAAAATGGAAAACCTTAGAAGGAGACAAAAGAATGGAAGAAAAATGAACGAGAAGTAGTACCTTGGAAGATTACAG
AAGTCAGGTTTCCAAGTGTACTTGAGGTAGGATCGATCTGGTCTGCCATATTTTTGGCAGTTAAACTCAGGGTCTATAAAAGGACAGCTTGAAGATTCATAAAGAGGTAA
AGAAGGATCAAAAACCCATTTGCCTTCAAACAAATTGCACGTCCTCTCTTGCTTCCTACTTCCCACATTGCTTATGTTGTAGAAATCTTCAGCTTTTGCAGTTTCTAGTA
AAAAGAGAAACAGAACTTGCAAAAGCAAGAGGGACAGAGCTCTGAATCGAAAACCCATCTCAGATTTTGGGCTCTGACTGAAAATAAAGGAGGACTTTGGGGAATAGAAG
TAGACAGAGGCAAGTGGGTCTATATAGAGACAAACAAGAGAGAGAGTGAGAAAGAACCTTCCCTGATTATTATCATTATTATTACTGTTATTTTAATCCTTTAAAATTTC
ATGCCATGAGCTGGGAATTGAATTATCTCGGCTGGATTGATTAAGGGAAATACAAGAGATTGTGG
Protein sequenceShow/hide protein sequence
MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPVLENEIS
TKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEERRESQERIKAMEGEIAELKKMALDRGRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNQD
NHKVEHPEAKKEEVETERPRHSRCNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATTSSSSSSSTGSSSDVEKAIPAPPPVPTKSMPPPPPPPSKSAPPPPPPPPK
GKRPMPAKVRRIPEVVEFYHSLMRRDSRRDSGSGVTDLPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSFLVDERAV
LKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSA
ELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQNQQHKFRSYPLSNHLSSSAHLICSLTRPRISRLFKVTAASSMEVEQGGKSAPVGS
PVPPMKLLFVEMGVGYDQHGQDITAAAMRACRDAISSNSIPAFRRGSIPGVTFGEMKLQIKLGVPQSLQQSLDIEKVKSVFP