; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg011845 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg011845
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionprotein CHUP1, chloroplastic
Genome locationscaffold1:2503000..2507086
RNA-Seq ExpressionSpg011845
SyntenySpg011845
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
GO:0005525 - GTP binding (molecular function)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052630.1 protein CHUP1 [Cucumis melo var. makuwa]1.4e-30795.05Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPK STPAQPSPS+ KVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEISTKD EIERASKRILFLEAENERLRV+VEEVK S+EE+RRESQER+KA+EGEI+E+KKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSA-TSSSSSTGSSGDVEKTIPVP
        NLIRNLKRATKCSDAVV+QDNHKVEHPE KKEEVETERPRHSRC+SEELAESTLSNIKSRIPRVPKPPPKPSSSS+SSA TSSSSSTGSS D+EK IP P
Subjt:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSA-TSSSSSTGSSGDVEKTIPVP

Query:  PPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFI
        PPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRR+PEVVEFYHSLMRRDSRR+ GSG+TDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFI
Subjt:  PPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFI

Query:  RFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYN
        RFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHGVYN
Subjt:  RFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYN

Query:  LSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHV
        LSRMRESA KRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSA LETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHV
Subjt:  LSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHV

Query:  QCQSQQ
        QCQ+QQ
Subjt:  QCQSQQ

XP_004134665.1 protein CHUP1, chloroplastic [Cucumis sativus]3.6e-30893.55Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPK STPAQPSPS+ KVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEISTKD EIERASKRILFLEAENERLRV+VEE K S+EE+RRESQER+KA+EGE+AE+KKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSAT--SSSSSTGSSGDVEKTIPV
        NLIRNLKRATKCSDAVV+QDNHKVEHPEAKKEEVETERPRHSRC+SEELAESTLSNIKSRIPRVPKPPPKPSSSS+SSAT  +SSSSTGSS D+EK IP 
Subjt:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSAT--SSSSSTGSSGDVEKTIPV

Query:  PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
        PPPVPTK MPPPPPPPSKSAPPPPPPPPKGKR MPAKVRR+PEVVEFYHSLMRRDSRR+ GSG+T+PPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
Subjt:  PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF

Query:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVY
        IRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHGVY
Subjt:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVY

Query:  NLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
        NLSRMRESA KRYKAFQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSA LETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
Subjt:  NLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH

Query:  VQCQSQQP-KYVCSSRPTTC
        VQCQ+QQ  KYV SSRPTTC
Subjt:  VQCQSQQP-KYVCSSRPTTC

XP_008439756.1 PREDICTED: protein CHUP1, chloroplastic [Cucumis melo]0.0e+0094.34Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPK STPAQPSPS+ KVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEISTKD EIERASKRILFLEAENERLRV+VEEVK S+EE+RRESQER+KA+EGEI+E+KKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSATSSSSSTGSSGDVEKTIPVPP
        NLIRNLKRATKCSDAVV+QDNHKVEHPE KKEEVETERPRHSRC+SEELAESTLSNIKSRIPRVP+PPPKPSSSS+SSAT+SSSSTGSS D+EK IP PP
Subjt:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSATSSSSSTGSSGDVEKTIPVPP

Query:  PVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIR
        PVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRR+PEVVEFYHSLMRRDSRR+ GSG+TDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIR
Subjt:  PVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIR

Query:  FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYNL
         LIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHGVYNL
Subjt:  FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYNL

Query:  SRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQ
        SRMRESA KRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSA LETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQ
Subjt:  SRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQ

Query:  CQSQQP-KYVCSSRPTTC
        CQ+QQ  KYV SSRPTTC
Subjt:  CQSQQP-KYVCSSRPTTC

XP_022926872.1 protein CHUP1, chloroplastic-like [Cucurbita moschata]4.5e-30392.75Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRKVESSPKPSTPAQPSPS+ K+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVPMLENEI+TKD EIERASKRILFLEAENERLRVEVEEVK S+EEQRRESQER+KA+EGEIAE+KKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSATSSSS--STGSSGDVEKTIPV
        NLIR+LKR TK SD VV QDNHKVE PEAKKEEVETERPRHSR +SEELAESTLSN+KSRIPRVPKPPPKPSSSS+SSATSSSS  STGSSGD EK IP 
Subjt:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSATSSSS--STGSSGDVEKTIPV

Query:  PPPVPTKPM-PPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTKP  PPPPPPPSKSAPPPPPPPPKGKRP PAKVRR+PEVVEFYHSLMRRDSRRE GSG+T+PPS+ANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  PPPVPTKPM-PPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE EASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSA LETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQSQQPKYVCSS-RPTTC
        HVQCQ+QQ KYVCSS RPTTC
Subjt:  HVQCQSQQPKYVCSS-RPTTC

XP_038883847.1 protein CHUP1, chloroplastic [Benincasa hispida]0.0e+0094.17Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPS+ KVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEISTKD EIERASKRILFLEAENERLRVEVEEVK S+EE+RRESQER+KA+E EIAE+KKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSATSSSSSTGSSGDVEKTIPVPP
        NLIRNLKR TKCS+AVV+QDNHK EHPEAKKEEVETERPRHSRC+SEELAE TLSNIKSRIPRVPKPPPKPSSSS+SSA +SSSSTGSSGD+EK IP PP
Subjt:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSATSSSSSTGSSGDVEKTIPVPP

Query:  PVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIR
        PVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMP KVRR+PEVVEFYHSLMRRDSRR+ GS +TDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIR
Subjt:  PVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIR

Query:  FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYNL
        FLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHGVYNL
Subjt:  FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYNL

Query:  SRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQ
        SRMRESATKRYKAFQIPVEWMLDSGIV QIKLVSVKLAMKYMKRVSA LETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQ
Subjt:  SRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQ

Query:  CQSQQPKYVCSSRPTTC
        CQ+QQ KYV SSRPTTC
Subjt:  CQSQQPKYVCSSRPTTC

TrEMBL top hitse value%identityAlignment
A0A0A0KHU8 Uncharacterized protein1.7e-30893.55Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPK STPAQPSPS+ KVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEISTKD EIERASKRILFLEAENERLRV+VEE K S+EE+RRESQER+KA+EGE+AE+KKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSAT--SSSSSTGSSGDVEKTIPV
        NLIRNLKRATKCSDAVV+QDNHKVEHPEAKKEEVETERPRHSRC+SEELAESTLSNIKSRIPRVPKPPPKPSSSS+SSAT  +SSSSTGSS D+EK IP 
Subjt:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSAT--SSSSSTGSSGDVEKTIPV

Query:  PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
        PPPVPTK MPPPPPPPSKSAPPPPPPPPKGKR MPAKVRR+PEVVEFYHSLMRRDSRR+ GSG+T+PPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
Subjt:  PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF

Query:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVY
        IRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHGVY
Subjt:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVY

Query:  NLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
        NLSRMRESA KRYKAFQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSA LETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
Subjt:  NLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH

Query:  VQCQSQQP-KYVCSSRPTTC
        VQCQ+QQ  KYV SSRPTTC
Subjt:  VQCQSQQP-KYVCSSRPTTC

A0A1S3AZH3 protein CHUP1, chloroplastic0.0e+0094.34Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPK STPAQPSPS+ KVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEISTKD EIERASKRILFLEAENERLRV+VEEVK S+EE+RRESQER+KA+EGEI+E+KKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSATSSSSSTGSSGDVEKTIPVPP
        NLIRNLKRATKCSDAVV+QDNHKVEHPE KKEEVETERPRHSRC+SEELAESTLSNIKSRIPRVP+PPPKPSSSS+SSAT+SSSSTGSS D+EK IP PP
Subjt:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSATSSSSSTGSSGDVEKTIPVPP

Query:  PVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIR
        PVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRR+PEVVEFYHSLMRRDSRR+ GSG+TDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIR
Subjt:  PVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIR

Query:  FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYNL
         LIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHGVYNL
Subjt:  FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYNL

Query:  SRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQ
        SRMRESA KRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSA LETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQ
Subjt:  SRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQ

Query:  CQSQQP-KYVCSSRPTTC
        CQ+QQ  KYV SSRPTTC
Subjt:  CQSQQP-KYVCSSRPTTC

A0A5D3CMM2 Protein CHUP16.6e-30895.05Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPK STPAQPSPS+ KVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEISTKD EIERASKRILFLEAENERLRV+VEEVK S+EE+RRESQER+KA+EGEI+E+KKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSA-TSSSSSTGSSGDVEKTIPVP
        NLIRNLKRATKCSDAVV+QDNHKVEHPE KKEEVETERPRHSRC+SEELAESTLSNIKSRIPRVPKPPPKPSSSS+SSA TSSSSSTGSS D+EK IP P
Subjt:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSA-TSSSSSTGSSGDVEKTIPVP

Query:  PPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFI
        PPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRR+PEVVEFYHSLMRRDSRR+ GSG+TDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFI
Subjt:  PPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFI

Query:  RFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYN
        RFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHGVYN
Subjt:  RFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYN

Query:  LSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHV
        LSRMRESA KRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSA LETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHV
Subjt:  LSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHV

Query:  QCQSQQ
        QCQ+QQ
Subjt:  QCQSQQ

A0A6J1EFK1 protein CHUP1, chloroplastic-like2.2e-30392.75Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRKVESSPKPSTPAQPSPS+ K+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVPMLENEI+TKD EIERASKRILFLEAENERLRVEVEEVK S+EEQRRESQER+KA+EGEIAE+KKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSATSSSS--STGSSGDVEKTIPV
        NLIR+LKR TK SD VV QDNHKVE PEAKKEEVETERPRHSR +SEELAESTLSN+KSRIPRVPKPPPKPSSSS+SSATSSSS  STGSSGD EK IP 
Subjt:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSATSSSS--STGSSGDVEKTIPV

Query:  PPPVPTKPM-PPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTKP  PPPPPPPSKSAPPPPPPPPKGKRP PAKVRR+PEVVEFYHSLMRRDSRRE GSG+T+PPS+ANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  PPPVPTKPM-PPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE EASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSA LETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQSQQPKYVCSS-RPTTC
        HVQCQ+QQ KYVCSS RPTTC
Subjt:  HVQCQSQQPKYVCSS-RPTTC

A0A6J1KWU6 protein CHUP1, chloroplastic-like1.9e-30292.75Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPA RKVESSPKPSTPAQPSPS+ K+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVPMLENEI+TKD EIERASKRILFLEAENERLRVEVEEVK S+EEQRRESQER+KA+EGEIAE+KKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSATSSSS--STGSSGDVEKTIPV
        NLIR+LKR TK SD VV QDNHKVE PEAKKEEVETERPRHSR +SEELAESTLSNIKSRIPRVPKPPPKPSSSS+SSATSSSS  STGSSGD EK IP 
Subjt:  NLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSATSSSS--STGSSGDVEKTIPV

Query:  PPPVPTKPM-PPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTKP  PPPPPPPSKSAPPPPPPPPKGKRP  AKVRR+PEVVEFYHSLMRRDSRRE GSG+T+PPS+ANARDMIGEIENRSAHLLAIKTDVETQGD
Subjt:  PPPVPTKPM-PPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE EASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSA LETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQSQQPKYVCSS-RPTTC
        HVQCQ+QQ KYVCSS RPTTC
Subjt:  HVQCQSQQPKYVCSS-RPTTC

SwissProt top hitse value%identityAlignment
Q1PEB4 Uncharacterized protein At4g049802.0e-0623.72Show/hide
Query:  PKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPMLENEISTKDVEIERASKRIL
        P P TP    P    +S     S S  ++  R+ A  +  P D+      +   RD E+  +T +   +  +ES         EI  ++ E E     +L
Subjt:  PKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPMLENEISTKDVEIERASKRIL

Query:  FLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVHQDNHKV
          E   + ++ E    + S  E   E+++  +  E E            R+E      E+ A    +   +   + + I  L+   +  DA  H ++ + 
Subjt:  FLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVHQDNHKV

Query:  EHP--EAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSATSSSSSTGSSGDVEKTIPVPPPVPTKPMPPPPPPPSK------
        EH   E  + E E E   HS   + E  +ST S+ K  +P    PPP  +S  T S T S+ +T SS   +   P P P    P PPPPPP SK      
Subjt:  EHP--EAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSATSSSSSTGSSGDVEKTIPVPPPVPTKPMPPPPPPPSK------

Query:  ----------------SAPPPPPPPPKGK--RPMPAKVRRVPEVVEFYHSL------------MRRDSRREPGSGITDPPSTANA--RDMIGEIENRSAH
                        S P PP PP  G+  +   +K+RR  ++   Y +L             ++ S+ +       P   A +   D + E+  RS++
Subjt:  ----------------SAPPPPPPPPKGK--RPMPAKVRRVPEVVEFYHSL------------MRRDSRREPGSGITDPPSTANA--RDMIGEIENRSAH

Query:  LLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQ-WPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSAL
           I+ DV+     I  L   + +    D+++++ F   ++  L  L DE  VL  F+ +PE+K + +R A   Y  L  +  E  +++     P    L
Subjt:  LLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQ-WPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSAL

Query:  KKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYM----KRVSAALETVGGGPEEE------ELIVQGVRFAFRVH
         K++    K +  +  + R ++   K +K + I +    D  ++ Q+K   V ++   M    K    A E    G E +      + + +  +FAF+V+
Subjt:  KKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYM----KRVSAALETVGGGPEEE------ELIVQGVRFAFRVH

Query:  QFAGGFD
         FAGG D
Subjt:  QFAGGFD

Q9LI74 Protein CHUP1, chloroplastic9.8e-8347.73Show/hide
Query:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRV
        K+A++R +   I    + + ++RF G + +  K   ++  KR    S   A   Q N   E  E K  E           ++  + +  L +I+ R PRV
Subjt:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRV

Query:  PKPPPKPSSSSTSSATSSSSSTGSSGDVEKTIPVPPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPA--KVRRVPEVVEFYHSLMRRDSRREPGSGI
        P+PPP+ +    S+   S+      G        PPP P  P   PPPPP    PPPPPPP    R      KV R PE+VEFY SLM+R+S++E    +
Subjt:  PKPPPKPSSSSTSSATSSSSSTGSSGDVEKTIPVPPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPA--KVRRVPEVVEFYHSLMRRDSRREPGSGI

Query:  TDP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYC
               S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE KADALREAAF Y 
Subjt:  TDP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYC

Query:  DLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGG---P
        DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+  L++V G    P
Subjt:  DLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGG---P

Query:  EEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
          E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  EEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

Q9LI74 Protein CHUP1, chloroplastic4.2e-0123.56Show/hide
Query:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMK
        ++  L ++V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K VEI+  +  I  L+AE ++L+ E+ +  + + ++   ++ ++K L+ +I    
Subjt:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMK

Query:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNI
        ++  ++++ +L+L    +S+ Q  +   E   K   +    +A +  +  V +   K    + +K E+        + DS E   +TLSN+
Subjt:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNI

Arabidopsis top hitse value%identityAlignment
AT1G48280.1 hydroxyproline-rich glycoprotein family protein3.7e-6138.79Show/hide
Query:  IERASKRILFLEA---ENERLRVEVEEVKLSIEEQR-RESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGK-SNLIRNLKRAT
        I R S+  +   A   + +R R+E  E KL + E   ++ Q ++  L+ E+ E +      S +EL L N +LS     Q L+    K S+L  N K A 
Subjt:  IERASKRILFLEA---ENERLRVEVEEVKLSIEEQR-RESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGK-SNLIRNLKRAT

Query:  KCSDA----VVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSATSSSSSTGSSGDVEKTIPVPPPVPTKP
        +  ++    +      K+E P+ KKE                +  S LS       R+P  PP P         S +SS G   +            + P
Subjt:  KCSDA----VVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSATSSSSSTGSSGDVEKTIPVPPPVPTKP

Query:  MPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANA--RDMIGEIENRSAHLLAIKTDVETQGDFIRFLIK
          PP PPP    PPPPPP P  K    A+ ++ P V + +  L ++D+ R     +    S  N+    ++GEI+NRSAHL+AIK D+ET+G+FI  LI+
Subjt:  MPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANA--RDMIGEIENRSAHLLAIKTDVETQGDFIRFLIK

Query:  EVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMR
        +V    F+D+EDV+ FV WLD EL+ L DERAVLKHF+WPE+KAD L+EAA  Y +LKKLE E SS+  D     G ALKKM  LL+K E  +  L R+R
Subjt:  EVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMR

Query:  ESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEE---EELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
         S+ + Y+ F+IPVEWMLDSG++ +IK  S+KLA  YM RV+  L++      E   E L++QGVRFA+R HQFAGG D ET+ A +E++ +  S
Subjt:  ESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGGPEE---EELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein7.0e-8447.73Show/hide
Query:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRV
        K+A++R +   I    + + ++RF G + +  K   ++  KR    S   A   Q N   E  E K  E           ++  + +  L +I+ R PRV
Subjt:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRV

Query:  PKPPPKPSSSSTSSATSSSSSTGSSGDVEKTIPVPPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPA--KVRRVPEVVEFYHSLMRRDSRREPGSGI
        P+PPP+ +    S+   S+      G        PPP P  P   PPPPP    PPPPPPP    R      KV R PE+VEFY SLM+R+S++E    +
Subjt:  PKPPPKPSSSSTSSATSSSSSTGSSGDVEKTIPVPPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPA--KVRRVPEVVEFYHSLMRRDSRREPGSGI

Query:  TDP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYC
               S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE KADALREAAF Y 
Subjt:  TDP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYC

Query:  DLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGG---P
        DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+  L++V G    P
Subjt:  DLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGG---P

Query:  EEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
          E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  EEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein3.0e-0223.56Show/hide
Query:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMK
        ++  L ++V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K VEI+  +  I  L+AE ++L+ E+ +  + + ++   ++ ++K L+ +I    
Subjt:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMK

Query:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNI
        ++  ++++ +L+L    +S+ Q  +   E   K   +    +A +  +  V +   K    + +K E+        + DS E   +TLSN+
Subjt:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNI

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein7.0e-8447.73Show/hide
Query:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRV
        K+A++R +   I    + + ++RF G + +  K   ++  KR    S   A   Q N   E  E K  E           ++  + +  L +I+ R PRV
Subjt:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRV

Query:  PKPPPKPSSSSTSSATSSSSSTGSSGDVEKTIPVPPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPA--KVRRVPEVVEFYHSLMRRDSRREPGSGI
        P+PPP+ +    S+   S+      G        PPP P  P   PPPPP    PPPPPPP    R      KV R PE+VEFY SLM+R+S++E    +
Subjt:  PKPPPKPSSSSTSSATSSSSSTGSSGDVEKTIPVPPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPA--KVRRVPEVVEFYHSLMRRDSRREPGSGI

Query:  TDP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYC
               S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE KADALREAAF Y 
Subjt:  TDP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYC

Query:  DLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGG---P
        DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+  L++V G    P
Subjt:  DLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGG---P

Query:  EEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
          E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  EEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein3.0e-0223.56Show/hide
Query:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMK
        ++  L ++V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K VEI+  +  I  L+AE ++L+ E+ +  + + ++   ++ ++K L+ +I    
Subjt:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMK

Query:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNI
        ++  ++++ +L+L    +S+ Q  +   E   K   +    +A +  +  V +   K    + +K E+        + DS E   +TLSN+
Subjt:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNI

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein7.0e-8447.73Show/hide
Query:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRV
        K+A++R +   I    + + ++RF G + +  K   ++  KR    S   A   Q N   E  E K  E           ++  + +  L +I+ R PRV
Subjt:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVHQDNHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRV

Query:  PKPPPKPSSSSTSSATSSSSSTGSSGDVEKTIPVPPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPA--KVRRVPEVVEFYHSLMRRDSRREPGSGI
        P+PPP+ +    S+   S+      G        PPP P  P   PPPPP    PPPPPPP    R      KV R PE+VEFY SLM+R+S++E    +
Subjt:  PKPPPKPSSSSTSSATSSSSSTGSSGDVEKTIPVPPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPA--KVRRVPEVVEFYHSLMRRDSRREPGSGI

Query:  TDP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYC
               S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE KADALREAAF Y 
Subjt:  TDP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYC

Query:  DLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGG---P
        DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+  L++V G    P
Subjt:  DLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETVGGG---P

Query:  EEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
          E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  EEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.9e-17058.55Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVES--SPKPSTPAQPSPSNAKVS-----------QKTVFSRSFGVYFPRSSAQVQPRPPD------VTELLRMVEELRDR
        MVAGKV+V MG  KSP+++K +   SP P  P  P P     S            K  F+RSFGVYFPR+SAQV            V+EL R VEELR+R
Subjt:  MVAGKVKVAMGLQKSPASRKVES--SPKPSTPAQPSPSNAKVS-----------QKTVFSRSFGVYFPRSSAQVQPRPPD------VTELLRMVEELRDR

Query:  EARLKTDLLEHKLLKESVAIVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILEN
        EA LKT+ LE KLL+ESV+++P+LE++I+ K+ EI+   K    L  +NERLR E +      EE RRE + R K +E EI E++K+    S      ++
Subjt:  EARLKTDLLEHKLLKESVAIVPMLENEISTKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILEN

Query:  DELSASQRFQGLMEVSGKSNLIRNLKRA---TKCSDAVVHQDNHKVEHPEA--------KKEEVETERPRHSR-CDSEELAE-STLSNIKSRIPRVPKPP
          LS SQRFQGLM+VS KSNLIR+LKR        + + +Q+N       +        +K+E+E+    +SR  +SEEL E S+LS ++SR+PRVPKPP
Subjt:  DELSASQRFQGLMEVSGKSNLIRNLKRA---TKCSDAVVHQDNHKVEHPEA--------KKEEVETERPRHSR-CDSEELAE-STLSNIKSRIPRVPKPP

Query:  PKPSSSSTSSATSSSSSTGSSGDVEKTIPVPPPVPTKPM---PPPPPPPSKS-APPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDS----RREPGS
        PK S S        S+   +    +K+IP PPP P  P+   PPPPP  SK+  PPPPPPPPK      AKVRRVPEVVEFYHSLMRRDS    R   G 
Subjt:  PKPSSSSTSSATSSSSSTGSSGDVEKTIPVPPPVPTKPM---PPPPPPPSKS-APPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDS----RREPGS

Query:  GITDPP---STANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFG
        G        + +NARDMIGEIENRS +LLAIKTDVETQGDFIRFLIKEV NA+F+DIEDVVPFVKWLDDELSYLVDERAVLKHF+WPEQKADALREAAF 
Subjt:  GITDPP---STANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFG

Query:  YCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETV-GGGP
        Y DLKKL +EAS FR D RQ   SALKKMQAL EKLEHGVY+LSRMRESA  ++K+FQIPV+WML++GI SQIKL SVKLAMKYMKRVSA LE + GGGP
Subjt:  YCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALETV-GGGP

Query:  EEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQSQQPKYVCSSRPTTC
        EEEELIVQGVRFAFRVHQFAGGFD ETM+AF+ELRDKA SCHVQCQSQ  ++    R T C
Subjt:  EEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQSQQPKYVCSSRPTTC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGCTGGGAAGGTGAAGGTCGCCATGGGGCTGCAGAAGTCTCCGGCCAGTAGAAAGGTTGAGAGCTCGCCCAAGCCATCCACGCCGGCGCAGCCTTCTCCGAGCAA
CGCTAAGGTTTCTCAGAAAACAGTATTCTCCCGCTCGTTTGGTGTCTACTTCCCTCGCTCTTCTGCTCAGGTTCAGCCTCGACCGCCTGACGTCACGGAGCTCCTCCGTA
TGGTCGAGGAGTTGCGTGACAGAGAGGCGCGATTGAAGACTGACCTATTGGAGCACAAGCTGTTGAAGGAATCTGTCGCCATTGTTCCTATGCTTGAGAACGAGATCTCT
ACGAAGGATGTGGAGATTGAGAGAGCTTCTAAGCGGATACTGTTCTTGGAGGCGGAGAATGAGAGATTGAGAGTTGAAGTGGAGGAAGTTAAACTGAGTATCGAGGAACA
GAGGAGAGAGAGTCAGGAGAGATTAAAGGCACTGGAAGGCGAAATCGCGGAGATGAAGAAAATGGCGTTGGATCGAAGCAGAATGGAGCTTATTTTGGAGAACGACGAGC
TTTCGGCGTCGCAGAGGTTCCAGGGATTAATGGAGGTCTCGGGAAAGTCTAACCTAATCAGGAACTTGAAAAGAGCGACCAAATGTTCGGATGCTGTTGTTCACCAAGAC
AATCATAAGGTTGAACATCCAGAGGCAAAGAAAGAAGAAGTTGAAACCGAGAGACCGAGACACTCGCGATGCGACTCTGAAGAACTCGCCGAATCCACTCTATCTAACAT
TAAATCGCGTATACCTAGGGTTCCAAAACCTCCTCCGAAACCTTCTTCATCTTCCACTTCTTCTGCCACTTCTTCCTCCTCATCAACGGGCTCTTCTGGTGACGTAGAGA
AAACGATCCCAGTCCCACCCCCTGTCCCAACCAAGCCAATGCCGCCGCCTCCTCCGCCACCTTCGAAGTCGGCTCCACCTCCCCCTCCACCGCCTCCCAAGGGTAAGAGG
CCGATGCCGGCGAAGGTGCGGCGAGTGCCGGAGGTTGTTGAGTTCTATCATTCATTAATGCGGAGGGACTCCCGGCGAGAACCCGGCTCCGGCATTACGGACCCGCCGTC
GACCGCCAATGCTCGTGACATGATCGGAGAGATCGAGAACCGGTCCGCTCACTTACTCGCTATTAAGACGGATGTAGAGACTCAAGGGGATTTCATAAGGTTCTTGATAA
AAGAAGTTGAAAATGCTTCATTTACTGACATTGAGGACGTTGTGCCATTTGTCAAATGGTTAGATGATGAGCTATCATATCTGGTGGATGAAAGAGCCGTGCTTAAACAC
TTCCAGTGGCCGGAGCAAAAGGCCGACGCTTTGCGTGAGGCTGCATTTGGCTATTGCGATCTTAAGAAGCTGGAAACTGAAGCGTCGTCGTTTCGTGGTGATGCCCGCCA
GCCTTGTGGTTCGGCTCTCAAGAAGATGCAAGCTTTGCTTGAAAAATTGGAGCATGGTGTATACAATTTGTCTCGAATGCGTGAATCTGCAACTAAGAGATACAAAGCAT
TTCAAATTCCAGTGGAATGGATGCTTGATAGTGGAATTGTGAGTCAGATCAAGCTTGTCTCTGTAAAATTAGCAATGAAGTACATGAAGAGGGTATCCGCAGCGCTTGAA
ACGGTCGGTGGTGGACCTGAAGAAGAAGAGCTGATCGTCCAAGGCGTTCGATTTGCTTTCCGTGTGCATCAGTTTGCAGGAGGGTTTGATGTGGAAACGATGAGGGCATT
TCAAGAGCTGAGAGATAAAGCAAGTTCATGTCACGTACAATGCCAAAGCCAACAACCCAAGTACGTGTGCAGTAGCAGGCCTACAACTTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGCTGGGAAGGTGAAGGTCGCCATGGGGCTGCAGAAGTCTCCGGCCAGTAGAAAGGTTGAGAGCTCGCCCAAGCCATCCACGCCGGCGCAGCCTTCTCCGAGCAA
CGCTAAGGTTTCTCAGAAAACAGTATTCTCCCGCTCGTTTGGTGTCTACTTCCCTCGCTCTTCTGCTCAGGTTCAGCCTCGACCGCCTGACGTCACGGAGCTCCTCCGTA
TGGTCGAGGAGTTGCGTGACAGAGAGGCGCGATTGAAGACTGACCTATTGGAGCACAAGCTGTTGAAGGAATCTGTCGCCATTGTTCCTATGCTTGAGAACGAGATCTCT
ACGAAGGATGTGGAGATTGAGAGAGCTTCTAAGCGGATACTGTTCTTGGAGGCGGAGAATGAGAGATTGAGAGTTGAAGTGGAGGAAGTTAAACTGAGTATCGAGGAACA
GAGGAGAGAGAGTCAGGAGAGATTAAAGGCACTGGAAGGCGAAATCGCGGAGATGAAGAAAATGGCGTTGGATCGAAGCAGAATGGAGCTTATTTTGGAGAACGACGAGC
TTTCGGCGTCGCAGAGGTTCCAGGGATTAATGGAGGTCTCGGGAAAGTCTAACCTAATCAGGAACTTGAAAAGAGCGACCAAATGTTCGGATGCTGTTGTTCACCAAGAC
AATCATAAGGTTGAACATCCAGAGGCAAAGAAAGAAGAAGTTGAAACCGAGAGACCGAGACACTCGCGATGCGACTCTGAAGAACTCGCCGAATCCACTCTATCTAACAT
TAAATCGCGTATACCTAGGGTTCCAAAACCTCCTCCGAAACCTTCTTCATCTTCCACTTCTTCTGCCACTTCTTCCTCCTCATCAACGGGCTCTTCTGGTGACGTAGAGA
AAACGATCCCAGTCCCACCCCCTGTCCCAACCAAGCCAATGCCGCCGCCTCCTCCGCCACCTTCGAAGTCGGCTCCACCTCCCCCTCCACCGCCTCCCAAGGGTAAGAGG
CCGATGCCGGCGAAGGTGCGGCGAGTGCCGGAGGTTGTTGAGTTCTATCATTCATTAATGCGGAGGGACTCCCGGCGAGAACCCGGCTCCGGCATTACGGACCCGCCGTC
GACCGCCAATGCTCGTGACATGATCGGAGAGATCGAGAACCGGTCCGCTCACTTACTCGCTATTAAGACGGATGTAGAGACTCAAGGGGATTTCATAAGGTTCTTGATAA
AAGAAGTTGAAAATGCTTCATTTACTGACATTGAGGACGTTGTGCCATTTGTCAAATGGTTAGATGATGAGCTATCATATCTGGTGGATGAAAGAGCCGTGCTTAAACAC
TTCCAGTGGCCGGAGCAAAAGGCCGACGCTTTGCGTGAGGCTGCATTTGGCTATTGCGATCTTAAGAAGCTGGAAACTGAAGCGTCGTCGTTTCGTGGTGATGCCCGCCA
GCCTTGTGGTTCGGCTCTCAAGAAGATGCAAGCTTTGCTTGAAAAATTGGAGCATGGTGTATACAATTTGTCTCGAATGCGTGAATCTGCAACTAAGAGATACAAAGCAT
TTCAAATTCCAGTGGAATGGATGCTTGATAGTGGAATTGTGAGTCAGATCAAGCTTGTCTCTGTAAAATTAGCAATGAAGTACATGAAGAGGGTATCCGCAGCGCTTGAA
ACGGTCGGTGGTGGACCTGAAGAAGAAGAGCTGATCGTCCAAGGCGTTCGATTTGCTTTCCGTGTGCATCAGTTTGCAGGAGGGTTTGATGTGGAAACGATGAGGGCATT
TCAAGAGCTGAGAGATAAAGCAAGTTCATGTCACGTACAATGCCAAAGCCAACAACCCAAGTACGTGTGCAGTAGCAGGCCTACAACTTGTTAA
Protein sequenceShow/hide protein sequence
MVAGKVKVAMGLQKSPASRKVESSPKPSTPAQPSPSNAKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPMLENEIS
TKDVEIERASKRILFLEAENERLRVEVEEVKLSIEEQRRESQERLKALEGEIAEMKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVHQD
NHKVEHPEAKKEEVETERPRHSRCDSEELAESTLSNIKSRIPRVPKPPPKPSSSSTSSATSSSSSTGSSGDVEKTIPVPPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKR
PMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKH
FQWPEQKADALREAAFGYCDLKKLETEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAALE
TVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQSQQPKYVCSSRPTTC