; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg18215 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg18215
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionprotein CHUP1, chloroplastic-like
Genome locationCarg_Chr07:959792..967556
RNA-Seq ExpressionCarg18215
SyntenyCarg18215
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
GO:0005525 - GTP binding (molecular function)
InterPro domainsIPR011719 - Conserved hypothetical protein CHP02058
IPR037103 - Tubulin/FtsZ-like, C-terminal domain
IPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052630.1 protein CHUP1 [Cucumis melo var. makuwa]0.0e+0086.89Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRKVESSPK STPAQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRV+VEEVKQSVEE+RRES+ER+KAMEGEI+ELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIRNLKR TK SD VV QDNHKVE PE KKEEVETERPRHSR NSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSAT+SSSS STGSS D EK IPA
Subjt:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTKP  PPPPPPPSKSAPPPPPPPPKGKRP PAKVRRIPEVVEFYHSLMRRDSRR+ GS VT+PPS+ANARDMIGEIENRSAHLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPC SALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQNQQHKYGNPCKSPFGQWTWICSFLTHLATRQLASSLGDFRFHSYLRSNNHPSSSAHLIC----PRVFRLVKAAAALSMEVEQGAQSAPVSSTALP
        HVQCQNQQ                                    +F SY  S NH SS  HLIC    PR+ RL+K  A  SMEVEQG +SAPV ST  P
Subjt:  HVQCQNQQHKYGNPCKSPFGQWTWICSFLTHLATRQLASSLGDFRFHSYLRSNNHPSSSAHLIC----PRVFRLVKAAAALSMEVEQGAQSAPVSSTALP

Query:  MKLLFVEMGVGYDQHGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFP
        MKLLFVEMGVGYDQHGQDITAAAMRACRDAI SNSIPAFRRG+IPGVSFGEMKLQIKLGVP SLQQSLD+EKVKSVFP
Subjt:  MKLLFVEMGVGYDQHGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFP

KAG6594554.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0096.94Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIR+LKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
Subjt:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQNQQHKYGNPCKSPFGQWTWICSFLTHLATRQLASSLGDFRFHSYLRSNNHPSSSAHLICPRVFRLVKAAAALSMEVEQGAQSAPVSSTALPMKLL
        HVQCQNQQHKY       FG                 ASSLGDFRFHSYLRSNNHPSSSAHLICPRVFRLVKAAAALSMEVEQGAQSAPVSSTALPMKLL
Subjt:  HVQCQNQQHKYGNPCKSPFGQWTWICSFLTHLATRQLASSLGDFRFHSYLRSNNHPSSSAHLICPRVFRLVKAAAALSMEVEQGAQSAPVSSTALPMKLL

Query:  FVEMGVGYDQHGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMG
        FVEMGVGYDQHGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMG
Subjt:  FVEMGVGYDQHGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMG

Query:  DKNDDCYIVNAAVYVGY
        DKNDDCYIVNAAVYVGY
Subjt:  DKNDDCYIVNAAVYVGY

KAG7026530.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
Subjt:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQNQQHKYGNPCKSPFGQWTWICSFLTHLATRQLASSLGDFRFHSYLRSNNHPSSSAHLICPRVFRLVKAAAALSMEVEQGAQSAPVSSTALPMKLL
        HVQCQNQQHKYGNPCKSPFGQWTWICSFLTHLATRQLASSLGDFRFHSYLRSNNHPSSSAHLICPRVFRLVKAAAALSMEVEQGAQSAPVSSTALPMKLL
Subjt:  HVQCQNQQHKYGNPCKSPFGQWTWICSFLTHLATRQLASSLGDFRFHSYLRSNNHPSSSAHLICPRVFRLVKAAAALSMEVEQGAQSAPVSSTALPMKLL

Query:  FVEMGVGYDQHGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMG
        FVEMGVGYDQHGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMG
Subjt:  FVEMGVGYDQHGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMG

Query:  DKNDDCYIVNAAVYVGY
        DKNDDCYIVNAAVYVGY
Subjt:  DKNDDCYIVNAAVYVGY

XP_022926872.1 protein CHUP1, chloroplastic-like [Cucurbita moschata]0.0e+0099.02Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRES+ERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIR+LKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
Subjt:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGS VTEPPSSANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPC SALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQNQQHKY
        HVQCQNQQHKY
Subjt:  HVQCQNQQHKY

XP_023003548.1 protein CHUP1, chloroplastic-like [Cucurbita maxima]0.0e+0099.02Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKLAMGLQKSPA RKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRES+ERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIR+LKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
Subjt:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPT AKVRRIPEVVEFYHSLMRRDSRRELGS VTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPC SALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQNQQHKY
        HVQCQNQQHKY
Subjt:  HVQCQNQQHKY

TrEMBL top hitse value%identityAlignment
A0A0A0KHU8 Uncharacterized protein3.4e-29692.97Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRKVESSPK STPAQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRV+VEE KQSVEE+RRES+ER+KAMEGE+AELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIRNLKR TK SD VV QDNHKVE PEAKKEEVETERPRHSR NSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSAT+S+SS+STGSS D EK IPA
Subjt:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTK   PPPPPPPSKSAPPPPPPPPKGKR  PAKVRRIPEVVEFYHSLMRRDSRR+ GS VTEPPS+ANARDMIGEIENRSAHLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPC SALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQN-QQHKY
        HVQCQN QQHKY
Subjt:  HVQCQN-QQHKY

A0A1S3AZH3 protein CHUP1, chloroplastic3.4e-29693.14Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRKVESSPK STPAQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRV+VEEVKQSVEE+RRES+ER+KAMEGEI+ELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIRNLKR TK SD VV QDNHKVE PE KKEEVETERPRHSR NSEELAESTLSNIKSRIPRVP+PPPKPSSSSSSSAT+SSS  STGSS D EK IPA
Subjt:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTKP  PPPPPPPSKSAPPPPPPPPKGKRP PAKVRRIPEVVEFYHSLMRRDSRR+ GS VT+PPS+ANARDMIGEIENRSAHLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV
        FIR LIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPC SALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQN-QQHKY
        HVQCQN QQHKY
Subjt:  HVQCQN-QQHKY

A0A5D3CMM2 Protein CHUP10.0e+0086.89Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRKVESSPK STPAQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRV+VEEVKQSVEE+RRES+ER+KAMEGEI+ELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIRNLKR TK SD VV QDNHKVE PE KKEEVETERPRHSR NSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSAT+SSSS STGSS D EK IPA
Subjt:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTKP  PPPPPPPSKSAPPPPPPPPKGKRP PAKVRRIPEVVEFYHSLMRRDSRR+ GS VT+PPS+ANARDMIGEIENRSAHLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPC SALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQNQQHKYGNPCKSPFGQWTWICSFLTHLATRQLASSLGDFRFHSYLRSNNHPSSSAHLIC----PRVFRLVKAAAALSMEVEQGAQSAPVSSTALP
        HVQCQNQQ                                    +F SY  S NH SS  HLIC    PR+ RL+K  A  SMEVEQG +SAPV ST  P
Subjt:  HVQCQNQQHKYGNPCKSPFGQWTWICSFLTHLATRQLASSLGDFRFHSYLRSNNHPSSSAHLIC----PRVFRLVKAAAALSMEVEQGAQSAPVSSTALP

Query:  MKLLFVEMGVGYDQHGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFP
        MKLLFVEMGVGYDQHGQDITAAAMRACRDAI SNSIPAFRRG+IPGVSFGEMKLQIKLGVP SLQQSLD+EKVKSVFP
Subjt:  MKLLFVEMGVGYDQHGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFP

A0A6J1EFK1 protein CHUP1, chloroplastic-like0.0e+0099.02Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRES+ERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIR+LKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
Subjt:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGS VTEPPSSANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPC SALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQNQQHKY
        HVQCQNQQHKY
Subjt:  HVQCQNQQHKY

A0A6J1KWU6 protein CHUP1, chloroplastic-like0.0e+0099.02Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKLAMGLQKSPA RKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRES+ERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIR+LKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
Subjt:  NLIRNLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPT AKVRRIPEVVEFYHSLMRRDSRRELGS VTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPC SALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQNQQHKY
        HVQCQNQQHKY
Subjt:  HVQCQNQQHKY

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic4.9e-8247.75Show/hide
Query:  KMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRPTKFSDTVVT----QDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIP
        K+A++R +   I    + + ++RF G + +  K   +  LK       +V+T    Q N   E  E K  E           N+  + +  L +I+ R P
Subjt:  KMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRPTKFSDTVVT----QDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIP

Query:  RVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPA-KVRRIPEVVEFYHSLMRRDSRREL
        RVP+PPP+ +    S+   S+     G           PPP P  P   PPPPP     PPPPPP   G+      KV R PE+VEFY SLM+R+S++E 
Subjt:  RVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPA-KVRRIPEVVEFYHSLMRRDSRREL

Query:  GSSVTEP---PSSANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAA
          S+       SSA   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE KADALREAA
Subjt:  GSSVTEP---PSSANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAA

Query:  FGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGG
        F Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+ EL++V G 
Subjt:  FGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGG

Query:  ---PEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
           P  E L+++GVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  ---PEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

Q9LI74 Protein CHUP1, chloroplastic4.7e+0022.99Show/hide
Query:  DVTELLQLVEELRDREARLKTDLLEHKLLKESVAIVPMLENEIATKDAEIERASKRILFLEAENERL----------RVEVEEVKQSVEEQRRESEERVK
        ++  L QLV+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L+AE ++L          R E+E  +  ++E +R+ +    
Subjt:  DVTELLQLVEELRDREARLKTDLLEHKLLKESVAIVPMLENEIATKDAEIERASKRILFLEAENERL----------RVEVEEVKQSVEEQRRESEERVK

Query:  AMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRPTKFSDTVVTQDNHKVEQPEAK
          +G++  LK+     +  E    N +    ++ + + ++  +   +  LKR  +       + + K++  EA+
Subjt:  AMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRPTKFSDTVVTQDNHKVEQPEAK

Arabidopsis top hitse value%identityAlignment
AT1G48280.1 hydroxyproline-rich glycoprotein family protein2.1e-5936.68Show/hide
Query:  ELRDREARLKTDLLEH-KLLKESVAIVPMLENEIATKDAEIE-----------RASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAE
        ++++  A+ ++ LL+  K  +E +A++         + A +E           ++ + ++   A  +  R  +EE    +EE+   +E  +K ++ ++  
Subjt:  ELRDREARLKTDLLEH-KLLKESVAIVPMLENEIATKDAEIE-----------RASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAE

Query:  LKKMALDRR--RMELILENDELSASQRFQGLMEVSGKSNLIRNLKRPTKFSDTVVTQD-----NHKVEQPEAKKE-EVETERPRHSRSNSEELAESTLSN
        LK    + R   +EL L N +LS     Q L+    K + + +  +P K       +D       K+EQP+ KKE  VE+ R             S  S 
Subjt:  LKKMALDRR--RMELILENDELSASQRFQGLMEVSGKSNLIRNLKRPTKFSDTVVTQD-----NHKVEQPEAKKE-EVETERPRHSRSNSEELAESTLSN

Query:  IKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRD
          SR+P  P P PK   S +SS      ++S                    P  PP PPPP    PPPPPP P  K    A+ ++ P V + +  L ++D
Subjt:  IKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRD

Query:  SRRELGSSVTEPPSSANA--RDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADAL
        + R L  SV    S  N+    ++GEI+NRSAHL+AIK D+ET+G+FI  LI++V    F+D+EDV+ FV WLD EL+ L DERAVLKHF+WPE+KAD L
Subjt:  SRRELGSSVTEPPSSANA--RDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADAL

Query:  REAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELET
        +EAA  Y +LKKLE E SS+  D       ALKKM  LL+K E  +  L R+R S+ + Y+ F+IPVEWMLDSG++ +IK  S+KLA  YM RV+ EL++
Subjt:  REAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELET

Query:  VGGGPEE---EELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
              E   E L+++GVRFA+R HQFAGG D ET+ A +E++ +  S
Subjt:  VGGGPEE---EELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein3.5e-8347.75Show/hide
Query:  KMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRPTKFSDTVVT----QDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIP
        K+A++R +   I    + + ++RF G + +  K   +  LK       +V+T    Q N   E  E K  E           N+  + +  L +I+ R P
Subjt:  KMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRPTKFSDTVVT----QDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIP

Query:  RVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPA-KVRRIPEVVEFYHSLMRRDSRREL
        RVP+PPP+ +    S+   S+     G           PPP P  P   PPPPP     PPPPPP   G+      KV R PE+VEFY SLM+R+S++E 
Subjt:  RVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPA-KVRRIPEVVEFYHSLMRRDSRREL

Query:  GSSVTEP---PSSANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAA
          S+       SSA   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE KADALREAA
Subjt:  GSSVTEP---PSSANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAA

Query:  FGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGG
        F Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+ EL++V G 
Subjt:  FGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGG

Query:  ---PEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
           P  E L+++GVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  ---PEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein3.4e-0122.99Show/hide
Query:  DVTELLQLVEELRDREARLKTDLLEHKLLKESVAIVPMLENEIATKDAEIERASKRILFLEAENERL----------RVEVEEVKQSVEEQRRESEERVK
        ++  L QLV+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L+AE ++L          R E+E  +  ++E +R+ +    
Subjt:  DVTELLQLVEELRDREARLKTDLLEHKLLKESVAIVPMLENEIATKDAEIERASKRILFLEAENERL----------RVEVEEVKQSVEEQRRESEERVK

Query:  AMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRPTKFSDTVVTQDNHKVEQPEAK
          +G++  LK+     +  E    N +    ++ + + ++  +   +  LKR  +       + + K++  EA+
Subjt:  AMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRPTKFSDTVVTQDNHKVEQPEAK

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein3.5e-8347.75Show/hide
Query:  KMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRPTKFSDTVVT----QDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIP
        K+A++R +   I    + + ++RF G + +  K   +  LK       +V+T    Q N   E  E K  E           N+  + +  L +I+ R P
Subjt:  KMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRPTKFSDTVVT----QDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIP

Query:  RVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPA-KVRRIPEVVEFYHSLMRRDSRREL
        RVP+PPP+ +    S+   S+     G           PPP P  P   PPPPP     PPPPPP   G+      KV R PE+VEFY SLM+R+S++E 
Subjt:  RVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPA-KVRRIPEVVEFYHSLMRRDSRREL

Query:  GSSVTEP---PSSANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAA
          S+       SSA   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE KADALREAA
Subjt:  GSSVTEP---PSSANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAA

Query:  FGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGG
        F Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+ EL++V G 
Subjt:  FGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGG

Query:  ---PEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
           P  E L+++GVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  ---PEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein3.4e-0122.99Show/hide
Query:  DVTELLQLVEELRDREARLKTDLLEHKLLKESVAIVPMLENEIATKDAEIERASKRILFLEAENERL----------RVEVEEVKQSVEEQRRESEERVK
        ++  L QLV+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L+AE ++L          R E+E  +  ++E +R+ +    
Subjt:  DVTELLQLVEELRDREARLKTDLLEHKLLKESVAIVPMLENEIATKDAEIERASKRILFLEAENERL----------RVEVEEVKQSVEEQRRESEERVK

Query:  AMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRPTKFSDTVVTQDNHKVEQPEAK
          +G++  LK+     +  E    N +    ++ + + ++  +   +  LKR  +       + + K++  EA+
Subjt:  AMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRPTKFSDTVVTQDNHKVEQPEAK

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein3.5e-8347.75Show/hide
Query:  KMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRPTKFSDTVVT----QDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIP
        K+A++R +   I    + + ++RF G + +  K   +  LK       +V+T    Q N   E  E K  E           N+  + +  L +I+ R P
Subjt:  KMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRPTKFSDTVVT----QDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIP

Query:  RVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPA-KVRRIPEVVEFYHSLMRRDSRREL
        RVP+PPP+ +    S+   S+     G           PPP P  P   PPPPP     PPPPPP   G+      KV R PE+VEFY SLM+R+S++E 
Subjt:  RVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPA-KVRRIPEVVEFYHSLMRRDSRREL

Query:  GSSVTEP---PSSANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAA
          S+       SSA   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE KADALREAA
Subjt:  GSSVTEP---PSSANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAA

Query:  FGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGG
        F Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+ EL++V G 
Subjt:  FGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGG

Query:  ---PEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
           P  E L+++GVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  ---PEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.8e-17059.33Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPK------PSTPAQPSPSSGKIS-------QKTVFSRSFGVYFPRSSAQVQPRLPD------VTELLQLVEELRDR
        MVAGKV++ MG  KSP+++K +  P       P  P    PSSG  +        K  F+RSFGVYFPR+SAQV            V+EL + VEELR+R
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPK------PSTPAQPSPSSGKIS-------QKTVFSRSFGVYFPRSSAQVQPRLPD------VTELLQLVEELRDR

Query:  EARLKTDLLEHKLLKESVAIVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILEN
        EA LKT+ LE KLL+ESV+++P+LE++IA K+ EI+   K    L  +NERLR E +      EE RRE E R K ME EI EL+K+           ++
Subjt:  EARLKTDLLEHKLLKESVAIVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILEN

Query:  DELSASQRFQGLMEVSGKSNLIRNLKRP---TKFSDTVVTQDNHKVEQPEA--------KKEEVETERPRHSR-SNSEELAE-STLSNIKSRIPRVPKPP
          LS SQRFQGLM+VS KSNLIR+LKR        + +  Q+N       +        +K+E+E+    +SR SNSEEL E S+LS ++SR+PRVPKPP
Subjt:  DELSASQRFQGLMEVSGKSNLIRNLKRP---TKFSDTVVTQDNHKVEQPEA--------KKEEVETERPRHSR-SNSEELAE-STLSNIKSRIPRVPKPP

Query:  PKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKS---APPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRD---SRREL-
        PK S        S   ST   +    +K IP PPP P  P    PPPPPS S    PPPPPPPPK      AKVRR+PEVVEFYHSLMRRD   SRR+  
Subjt:  PKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKS---APPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRD---SRREL-

Query:  --GSSVTEP-PSSANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAA
          G++  E   +++NARDMIGEIENRS +LLAIKTDVETQGDFIRFLIKEV NA+F+DIEDVVPFVKWLDDELSYLVDERAVLKHF+WPEQKADALREAA
Subjt:  --GSSVTEP-PSSANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAA

Query:  FGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETV-GG
        F Y DLKKL +EAS FR D RQ  +SALKKMQAL EKLEHGVY+LSRMRESA  ++K+FQIPV+WML++GI SQIKL SVKLAMKYMKRVSAELE + GG
Subjt:  FGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETV-GG

Query:  GPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQNQQHKY
        GPEEEELIV+GVRFAFRVHQFAGGFD ETM+AF+ELRDKA SCHVQCQ+Q H++
Subjt:  GPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQNQQHKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGCTGGGAAGGTGAAGCTCGCAATGGGGCTGCAGAAGTCTCCGGCGAGTAGAAAGGTTGAGAGTTCACCGAAGCCATCCACCCCGGCACAGCCTTCTCCGAGCTC
TGGTAAAATTTCTCAGAAAACTGTCTTCTCCCGCTCGTTTGGTGTGTATTTCCCGCGCTCTTCTGCTCAGGTTCAGCCTCGACTGCCTGACGTGACGGAGCTCCTCCAGT
TGGTGGAGGAGTTGCGTGACAGAGAGGCGCGATTGAAGACTGACCTATTGGAGCACAAGCTGTTGAAGGAATCTGTCGCCATCGTTCCTATGCTTGAGAACGAGATCGCT
ACGAAAGATGCGGAGATTGAGAGAGCGTCTAAGCGGATACTGTTCTTGGAGGCGGAGAATGAGAGATTAAGAGTTGAAGTGGAGGAAGTTAAACAGAGTGTTGAGGAACA
GAGGAGAGAGAGTGAAGAGAGAGTAAAAGCAATGGAAGGTGAAATCGCGGAGCTGAAGAAAATGGCGTTGGATCGACGCAGAATGGAGCTTATTTTGGAGAACGACGAGC
TTTCGGCGTCGCAGAGGTTCCAGGGATTAATGGAGGTCTCGGGAAAATCTAACCTAATCAGGAACCTGAAAAGACCGACCAAGTTTTCGGATACTGTTGTTACTCAAGAC
AATCATAAGGTTGAACAACCAGAGGCGAAGAAAGAAGAGGTTGAAACCGAGAGACCGAGACACTCGCGAAGTAATTCCGAAGAACTCGCCGAGTCCACTTTATCTAACAT
AAAATCGCGAATACCTAGGGTTCCAAAACCTCCTCCCAAACCTTCTTCCTCTTCCTCTTCTTCTGCCACTTCTTCCTCCTCCTCCACATCAACTGGCTCTTCTGGTGACG
CTGAGAAAAAGATCCCAGCCCCACCTCCTGTCCCAACCAAGCCAACGCCACCGCCGCCTCCTCCGCCACCTTCGAAGTCGGCTCCGCCTCCTCCTCCACCGCCTCCCAAG
GGTAAGAGGCCGACGCCAGCGAAGGTGCGACGAATACCGGAGGTTGTGGAGTTCTATCACTCATTAATGCGGAGGGATTCCCGGCGAGAACTCGGCTCCAGTGTTACGGA
ACCGCCGTCCTCCGCCAATGCTCGTGACATGATCGGAGAGATCGAGAACCGGTCCGCTCACTTGCTCGCTATAAAAACGGATGTGGAGACTCAAGGGGATTTCATAAGGT
TCTTGATCAAAGAAGTTGAAAATGCTTCATTTACTGACATTGAGGACGTTGTGCCATTTGTGAAATGGTTGGATGATGAGCTCTCATATCTGGTGGATGAAAGAGCCGTG
CTTAAACACTTCCAGTGGCCGGAGCAGAAGGCCGATGCTCTACGTGAGGCTGCATTTGGCTATTGTGATCTAAAGAAGCTGGAAGCCGAAGCGTCGTCCTTTCGTGGTGA
TGCCCGCCAACCTTGTGCTTCGGCTCTCAAGAAGATGCAAGCTTTGCTTGAAAAGTTGGAGCATGGAGTATACAATCTGTCTAGAATGCGTGAATCTGCAACTAAGAGAT
ACAAAGCATTTCAAATTCCAGTGGAATGGATGCTTGATAGTGGAATTGTGAGTCAGATCAAGCTTGTCTCTGTAAAATTAGCAATGAAGTACATGAAAAGAGTATCCGCA
GAGCTTGAAACAGTGGGTGGTGGACCTGAAGAAGAAGAGTTGATTGTTCGAGGCGTTCGATTTGCTTTCCGTGTGCATCAGTTTGCTGGAGGGTTTGATGTGGAAACAAT
GAGGGCATTTCAAGAGCTGAGAGACAAGGCAAGTTCATGTCACGTACAATGCCAAAACCAGCAACATAAGTACGGTAATCCTTGTAAATCCCCGTTCGGACAGTGGACTT
GGATTTGTAGCTTTCTTACGCATTTGGCAACGCGTCAGCTGGCCAGTTCATTGGGTGATTTCAGGTTTCATTCTTACCTTCGTTCCAATAATCATCCTTCATCTTCTGCG
CATTTGATCTGCCCTCGAGTTTTTCGCCTTGTCAAAGCTGCTGCTGCGTTGTCCATGGAGGTCGAGCAAGGTGCACAATCTGCTCCTGTTAGCAGCACAGCTCTGCCCAT
GAAGCTCTTGTTTGTCGAGATGGGCGTTGGCTACGATCAACATGGTCAAGATATCACGGCTGCTGCAATGCGAGCCTGCAGGGATGCCATATGCTCCAATTCGATTCCAG
CATTCCGTAGAGGTACCATTCCTGGAGTCTCATTTGGAGAGATGAAACTACAGATCAAACTTGGAGTTCCACAGTCGCTTCAACAATCCTTGGATGTTGAAAAAGTCAAG
TCTGTCTTTCCATATGGGAAGATTCTGAATGTTGAGGTTGTCGATGGTGGCTTAATATGTTCCAGTGGTGTGCATGTGGAAGAAATGGGAGACAAAAATGATGACTGTTA
CATAGTAAATGCCGCCGTATATGTTGGCTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGCTGGGAAGGTGAAGCTCGCAATGGGGCTGCAGAAGTCTCCGGCGAGTAGAAAGGTTGAGAGTTCACCGAAGCCATCCACCCCGGCACAGCCTTCTCCGAGCTC
TGGTAAAATTTCTCAGAAAACTGTCTTCTCCCGCTCGTTTGGTGTGTATTTCCCGCGCTCTTCTGCTCAGGTTCAGCCTCGACTGCCTGACGTGACGGAGCTCCTCCAGT
TGGTGGAGGAGTTGCGTGACAGAGAGGCGCGATTGAAGACTGACCTATTGGAGCACAAGCTGTTGAAGGAATCTGTCGCCATCGTTCCTATGCTTGAGAACGAGATCGCT
ACGAAAGATGCGGAGATTGAGAGAGCGTCTAAGCGGATACTGTTCTTGGAGGCGGAGAATGAGAGATTAAGAGTTGAAGTGGAGGAAGTTAAACAGAGTGTTGAGGAACA
GAGGAGAGAGAGTGAAGAGAGAGTAAAAGCAATGGAAGGTGAAATCGCGGAGCTGAAGAAAATGGCGTTGGATCGACGCAGAATGGAGCTTATTTTGGAGAACGACGAGC
TTTCGGCGTCGCAGAGGTTCCAGGGATTAATGGAGGTCTCGGGAAAATCTAACCTAATCAGGAACCTGAAAAGACCGACCAAGTTTTCGGATACTGTTGTTACTCAAGAC
AATCATAAGGTTGAACAACCAGAGGCGAAGAAAGAAGAGGTTGAAACCGAGAGACCGAGACACTCGCGAAGTAATTCCGAAGAACTCGCCGAGTCCACTTTATCTAACAT
AAAATCGCGAATACCTAGGGTTCCAAAACCTCCTCCCAAACCTTCTTCCTCTTCCTCTTCTTCTGCCACTTCTTCCTCCTCCTCCACATCAACTGGCTCTTCTGGTGACG
CTGAGAAAAAGATCCCAGCCCCACCTCCTGTCCCAACCAAGCCAACGCCACCGCCGCCTCCTCCGCCACCTTCGAAGTCGGCTCCGCCTCCTCCTCCACCGCCTCCCAAG
GGTAAGAGGCCGACGCCAGCGAAGGTGCGACGAATACCGGAGGTTGTGGAGTTCTATCACTCATTAATGCGGAGGGATTCCCGGCGAGAACTCGGCTCCAGTGTTACGGA
ACCGCCGTCCTCCGCCAATGCTCGTGACATGATCGGAGAGATCGAGAACCGGTCCGCTCACTTGCTCGCTATAAAAACGGATGTGGAGACTCAAGGGGATTTCATAAGGT
TCTTGATCAAAGAAGTTGAAAATGCTTCATTTACTGACATTGAGGACGTTGTGCCATTTGTGAAATGGTTGGATGATGAGCTCTCATATCTGGTGGATGAAAGAGCCGTG
CTTAAACACTTCCAGTGGCCGGAGCAGAAGGCCGATGCTCTACGTGAGGCTGCATTTGGCTATTGTGATCTAAAGAAGCTGGAAGCCGAAGCGTCGTCCTTTCGTGGTGA
TGCCCGCCAACCTTGTGCTTCGGCTCTCAAGAAGATGCAAGCTTTGCTTGAAAAGTTGGAGCATGGAGTATACAATCTGTCTAGAATGCGTGAATCTGCAACTAAGAGAT
ACAAAGCATTTCAAATTCCAGTGGAATGGATGCTTGATAGTGGAATTGTGAGTCAGATCAAGCTTGTCTCTGTAAAATTAGCAATGAAGTACATGAAAAGAGTATCCGCA
GAGCTTGAAACAGTGGGTGGTGGACCTGAAGAAGAAGAGTTGATTGTTCGAGGCGTTCGATTTGCTTTCCGTGTGCATCAGTTTGCTGGAGGGTTTGATGTGGAAACAAT
GAGGGCATTTCAAGAGCTGAGAGACAAGGCAAGTTCATGTCACGTACAATGCCAAAACCAGCAACATAAGTACGGTAATCCTTGTAAATCCCCGTTCGGACAGTGGACTT
GGATTTGTAGCTTTCTTACGCATTTGGCAACGCGTCAGCTGGCCAGTTCATTGGGTGATTTCAGGTTTCATTCTTACCTTCGTTCCAATAATCATCCTTCATCTTCTGCG
CATTTGATCTGCCCTCGAGTTTTTCGCCTTGTCAAAGCTGCTGCTGCGTTGTCCATGGAGGTCGAGCAAGGTGCACAATCTGCTCCTGTTAGCAGCACAGCTCTGCCCAT
GAAGCTCTTGTTTGTCGAGATGGGCGTTGGCTACGATCAACATGGTCAAGATATCACGGCTGCTGCAATGCGAGCCTGCAGGGATGCCATATGCTCCAATTCGATTCCAG
CATTCCGTAGAGGTACCATTCCTGGAGTCTCATTTGGAGAGATGAAACTACAGATCAAACTTGGAGTTCCACAGTCGCTTCAACAATCCTTGGATGTTGAAAAAGTCAAG
TCTGTCTTTCCATATGGGAAGATTCTGAATGTTGAGGTTGTCGATGGTGGCTTAATATGTTCCAGTGGTGTGCATGTGGAAGAAATGGGAGACAAAAATGATGACTGTTA
CATAGTAAATGCCGCCGTATATGTTGGCTACTAATTCTTGTTGCCTTATATTCCTTTCGAAGACCTTGAGGATTATTTACCAGTCATCTTTGATTGTAACCATACAGTTA
GTTATGCTTGATGTATACATTGACGTTTCAGTTTCCTTGGAGCTGTTGCTAGAACCGCAGGGTTATTTCTGGGCTCGATTAATCGCTCAATTTTAGTATTTCTTGGTTTC
AACGTTCTGCAATATACTGTTTCAAACTCATTTTTTGGTTATCATCATCTATGATCATT
Protein sequenceShow/hide protein sequence
MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVAIVPMLENEIA
TKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESEERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRPTKFSDTVVTQD
NHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPK
GKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSSVTEPPSSANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAV
LKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCASALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSA
ELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQNQQHKYGNPCKSPFGQWTWICSFLTHLATRQLASSLGDFRFHSYLRSNNHPSSSA
HLICPRVFRLVKAAAALSMEVEQGAQSAPVSSTALPMKLLFVEMGVGYDQHGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVK
SVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIVNAAVYVGY