; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh07G001890 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh07G001890
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionprotein CHUP1, chloroplastic-like
Genome locationCmo_Chr07:969192..979978
RNA-Seq ExpressionCmoCh07G001890
SyntenyCmoCh07G001890
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
GO:0005525 - GTP binding (molecular function)
InterPro domainsIPR011719 - Conserved hypothetical protein CHP02058
IPR037103 - Tubulin/FtsZ-like, C-terminal domain
IPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052630.1 protein CHUP1 [Cucumis melo var. makuwa]0.0e+0087.37Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRKVESSPK STPAQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRV+VEEVKQSVEE+RRESQER+KAMEGEI+ELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIR+LKR TK SD VV QDNHKVE PE KKEEVETERPRHSR NSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSSAT+SSSS STGSS D EK IPA
Subjt:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD
        PPPVPTKP  PPPPPPPSKSAPPPPPPPPKGKRP PAKVRRIPEVVEFYHSLMRRDSRR+ GSGVT+PPS+ANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  H-QSQLQRALFFGCKDHLSYAFGNASAASSLGDFRFHSYLHSNNHPSSSAHLIC----PRVFRLVKAAAALSMEVEKGAQSAPVGSTALPMKLLFVEMGV
        H Q Q Q+                          +F SY  S NH SS  HLIC    PR+ RL+K  A  SMEVE+G +SAPV ST  PMKLLFVEMGV
Subjt:  H-QSQLQRALFFGCKDHLSYAFGNASAASSLGDFRFHSYLHSNNHPSSSAHLIC----PRVFRLVKAAAALSMEVEKGAQSAPVGSTALPMKLLFVEMGV

Query:  GYDQHGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFP
        GYDQHGQDITAAAMRACRDAI SNSIPAFRRG+IPGVSFGEMKLQIKLGVP SLQQSLD+EKVKSVFP
Subjt:  GYDQHGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFP

KAG6594554.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0097.25Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRES+ERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
Subjt:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD
        PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGS VTEPPSSANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPC SALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  H-QSQLQRALFFGCKDHLSYAFGNASAASSLGDFRFHSYLHSNNHPSSSAHLICPRVFRLVKAAAALSMEVEKGAQSAPVGSTALPMKLLFVEMGVGYDQ
        H Q Q Q+        +LSYAFGNASAASSLGDFRFHSYL SNNHPSSSAHLICPRVFRLVKAAAALSMEVE+GAQSAPV STALPMKLLFVEMGVGYDQ
Subjt:  H-QSQLQRALFFGCKDHLSYAFGNASAASSLGDFRFHSYLHSNNHPSSSAHLICPRVFRLVKAAAALSMEVEKGAQSAPVGSTALPMKLLFVEMGVGYDQ

Query:  HGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFP
        HGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFP
Subjt:  HGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFP

KAG7026530.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0094.83Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRES+ERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIR+LKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
Subjt:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD
        PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGS VTEPPSSANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPC SALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  H-QSQLQRALFFG-CKDHLSY---------AFGNASAASSLGDFRFHSYLHSNNHPSSSAHLICPRVFRLVKAAAALSMEVEKGAQSAPVGSTALPMKLL
        H Q Q Q+  +   CK                     ASSLGDFRFHSYL SNNHPSSSAHLICPRVFRLVKAAAALSMEVE+GAQSAPV STALPMKLL
Subjt:  H-QSQLQRALFFG-CKDHLSY---------AFGNASAASSLGDFRFHSYLHSNNHPSSSAHLICPRVFRLVKAAAALSMEVEKGAQSAPVGSTALPMKLL

Query:  FVEMGVGYDQHGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFP
        FVEMGVGYDQHGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFP
Subjt:  FVEMGVGYDQHGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFP

XP_022926872.1 protein CHUP1, chloroplastic-like [Cucurbita moschata]0.0e+00100Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
Subjt:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD
        PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  H
        H
Subjt:  H

XP_023518440.1 protein CHUP1, chloroplastic [Cucurbita pepo subsp. pepo]0.0e+0099.5Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLS++KSRIPRVPKPPPKPSSSSSSSATSSSSS+STGSSGDAEKKIPA
Subjt:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD
        PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  H
        H
Subjt:  H

TrEMBL top hitse value%identityAlignment
A0A0A0KHU8 Uncharacterized protein6.3e-29293.01Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRKVESSPK STPAQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRV+VEE KQSVEE+RRESQER+KAMEGE+AELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIR+LKR TK SD VV QDNHKVE PEAKKEEVETERPRHSR NSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSSAT+S+SS+STGSS D EK IPA
Subjt:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD
        PPPVPTK   PPPPPPPSKSAPPPPPPPPKGKR  PAKVRRIPEVVEFYHSLMRRDSRR+ GSGVTEPPS+ANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  H
        H
Subjt:  H

A0A1S3AZH3 protein CHUP1, chloroplastic8.2e-29293.18Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRKVESSPK STPAQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRV+VEEVKQSVEE+RRESQER+KAMEGEI+ELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIR+LKR TK SD VV QDNHKVE PE KKEEVETERPRHSR NSEELAESTLSN+KSRIPRVP+PPPKPSSSSSSSAT+SSS  STGSS D EK IPA
Subjt:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD
        PPPVPTKP  PPPPPPPSKSAPPPPPPPPKGKRP PAKVRRIPEVVEFYHSLMRRDSRR+ GSGVT+PPS+ANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIR LIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  H
        H
Subjt:  H

A0A5D3CMM2 Protein CHUP10.0e+0087.37Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRKVESSPK STPAQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRV+VEEVKQSVEE+RRESQER+KAMEGEI+ELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIR+LKR TK SD VV QDNHKVE PE KKEEVETERPRHSR NSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSSAT+SSSS STGSS D EK IPA
Subjt:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD
        PPPVPTKP  PPPPPPPSKSAPPPPPPPPKGKRP PAKVRRIPEVVEFYHSLMRRDSRR+ GSGVT+PPS+ANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  H-QSQLQRALFFGCKDHLSYAFGNASAASSLGDFRFHSYLHSNNHPSSSAHLIC----PRVFRLVKAAAALSMEVEKGAQSAPVGSTALPMKLLFVEMGV
        H Q Q Q+                          +F SY  S NH SS  HLIC    PR+ RL+K  A  SMEVE+G +SAPV ST  PMKLLFVEMGV
Subjt:  H-QSQLQRALFFGCKDHLSYAFGNASAASSLGDFRFHSYLHSNNHPSSSAHLIC----PRVFRLVKAAAALSMEVEKGAQSAPVGSTALPMKLLFVEMGV

Query:  GYDQHGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFP
        GYDQHGQDITAAAMRACRDAI SNSIPAFRRG+IPGVSFGEMKLQIKLGVP SLQQSLD+EKVKSVFP
Subjt:  GYDQHGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFP

A0A6J1EFK1 protein CHUP1, chloroplastic-like0.0e+00100Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
Subjt:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD
        PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  H
        H
Subjt:  H

A0A6J1KWU6 protein CHUP1, chloroplastic-like0.0e+0099.33Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKLAMGLQKSPA RKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
        IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
        NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA
Subjt:  NLIRSLKRPTKFSDTVVTQDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPA

Query:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD
        PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPT AKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  PPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  H
        H
Subjt:  H

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic1.6e-8247.76Show/hide
Query:  KMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVT----QDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIP
        K+A++R +   I    + + ++RF G + +  K   +  LK       +V+T    Q N   E  E K  E           N+  + +  L +++ R P
Subjt:  KMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVT----QDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIP

Query:  RVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPA-KVRRIPEVVEFYHSLMRRDSRRE-
        RVP+PPP+ +    S+   S+     G           PPP P  P   PPPPP     PPPPPP   G+      KV R PE+VEFY SLM+R+S++E 
Subjt:  RVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPA-KVRRIPEVVEFYHSLMRRDSRRE-

Query:  ----LGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALRE
            + SG     SSA   +MIGEIENRST LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE KADALRE
Subjt:  ----LGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALRE

Query:  AAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVG
        AAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+ EL++V 
Subjt:  AAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVG

Query:  GG---PEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
        G    P  E L+++GVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  GG---PEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

Q9LI74 Protein CHUP1, chloroplastic2.0e+0023.56Show/hide
Query:  DVTELLQLVEELRDREARLKTDLLEHKLLKESVAIVPMLENEIATKDAEIERASKRILFLEAENERL----------RVEVEEVKQSVEEQRRESQERVK
        ++  L QLV+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L+AE ++L          R E+E  +  ++E +R+ Q    
Subjt:  DVTELLQLVEELRDREARLKTDLLEHKLLKESVAIVPMLENEIATKDAEIERASKRILFLEAENERL----------RVEVEEVKQSVEEQRRESQERVK

Query:  AMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVTQDNHKVEQPEAK
          +G++  LK+     +  E    N +    ++ + + ++  +   +  LKR  +       + + K++  EA+
Subjt:  AMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVTQDNHKVEQPEAK

Arabidopsis top hitse value%identityAlignment
AT1G48280.1 hydroxyproline-rich glycoprotein family protein1.5e-5936.5Show/hide
Query:  ELRDREARLKTDLLEH-KLLKESVAIVPMLENEIATKDAEIE-----------RASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAE
        ++++  A+ ++ LL+  K  +E +A++         + A +E           ++ + ++   A  +  R  +EE    +EE+   ++  +K ++ ++  
Subjt:  ELRDREARLKTDLLEH-KLLKESVAIVPMLENEIATKDAEIE-----------RASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAE

Query:  LKKMALDRR--RMELILENDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVTQD-----NHKVEQPEAKKE-EVETERPRHSRSNSEELAESTLSN
        LK    + R   +EL L N +LS     Q L+    K + + S  +P K       +D       K+EQP+ KKE  VE+ R             S  S 
Subjt:  LKKMALDRR--RMELILENDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVTQD-----NHKVEQPEAKKE-EVETERPRHSRSNSEELAESTLSN

Query:  VKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRD
          SR+P  P P PK   S +SS      ++S                    P  PP PPPP    PPPPPP P  K    A+ ++ P V + +  L ++D
Subjt:  VKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRD

Query:  SRRELGSGVTEPPSSANA--RDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADAL
        + R L   V    S  N+    ++GEI+NRS HL+AIK D+ET+G+FI  LI++V    F+D+EDV+ FV WLD EL+ L DERAVLKHF+WPE+KAD L
Subjt:  SRRELGSGVTEPPSSANA--RDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADAL

Query:  REAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELET
        +EAA  Y +LKKLE E SS+  D     G ALKKM  LL+K E  +  L R+R S+ + Y+ F+IPVEWMLDSG++ +IK  S+KLA  YM RV+ EL++
Subjt:  REAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELET

Query:  VGGGPEE---EELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
              E   E L+++GVRFA+R HQFAGG D ET+ A +E++ +  S
Subjt:  VGGGPEE---EELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein1.1e-8347.76Show/hide
Query:  KMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVT----QDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIP
        K+A++R +   I    + + ++RF G + +  K   +  LK       +V+T    Q N   E  E K  E           N+  + +  L +++ R P
Subjt:  KMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVT----QDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIP

Query:  RVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPA-KVRRIPEVVEFYHSLMRRDSRRE-
        RVP+PPP+ +    S+   S+     G           PPP P  P   PPPPP     PPPPPP   G+      KV R PE+VEFY SLM+R+S++E 
Subjt:  RVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPA-KVRRIPEVVEFYHSLMRRDSRRE-

Query:  ----LGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALRE
            + SG     SSA   +MIGEIENRST LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE KADALRE
Subjt:  ----LGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALRE

Query:  AAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVG
        AAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+ EL++V 
Subjt:  AAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVG

Query:  GG---PEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
        G    P  E L+++GVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  GG---PEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein1.4e-0123.56Show/hide
Query:  DVTELLQLVEELRDREARLKTDLLEHKLLKESVAIVPMLENEIATKDAEIERASKRILFLEAENERL----------RVEVEEVKQSVEEQRRESQERVK
        ++  L QLV+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L+AE ++L          R E+E  +  ++E +R+ Q    
Subjt:  DVTELLQLVEELRDREARLKTDLLEHKLLKESVAIVPMLENEIATKDAEIERASKRILFLEAENERL----------RVEVEEVKQSVEEQRRESQERVK

Query:  AMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVTQDNHKVEQPEAK
          +G++  LK+     +  E    N +    ++ + + ++  +   +  LKR  +       + + K++  EA+
Subjt:  AMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVTQDNHKVEQPEAK

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein1.1e-8347.76Show/hide
Query:  KMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVT----QDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIP
        K+A++R +   I    + + ++RF G + +  K   +  LK       +V+T    Q N   E  E K  E           N+  + +  L +++ R P
Subjt:  KMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVT----QDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIP

Query:  RVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPA-KVRRIPEVVEFYHSLMRRDSRRE-
        RVP+PPP+ +    S+   S+     G           PPP P  P   PPPPP     PPPPPP   G+      KV R PE+VEFY SLM+R+S++E 
Subjt:  RVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPA-KVRRIPEVVEFYHSLMRRDSRRE-

Query:  ----LGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALRE
            + SG     SSA   +MIGEIENRST LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE KADALRE
Subjt:  ----LGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALRE

Query:  AAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVG
        AAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+ EL++V 
Subjt:  AAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVG

Query:  GG---PEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
        G    P  E L+++GVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  GG---PEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein1.4e-0123.56Show/hide
Query:  DVTELLQLVEELRDREARLKTDLLEHKLLKESVAIVPMLENEIATKDAEIERASKRILFLEAENERL----------RVEVEEVKQSVEEQRRESQERVK
        ++  L QLV+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L+AE ++L          R E+E  +  ++E +R+ Q    
Subjt:  DVTELLQLVEELRDREARLKTDLLEHKLLKESVAIVPMLENEIATKDAEIERASKRILFLEAENERL----------RVEVEEVKQSVEEQRRESQERVK

Query:  AMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVTQDNHKVEQPEAK
          +G++  LK+     +  E    N +    ++ + + ++  +   +  LKR  +       + + K++  EA+
Subjt:  AMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVTQDNHKVEQPEAK

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein1.1e-8347.76Show/hide
Query:  KMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVT----QDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIP
        K+A++R +   I    + + ++RF G + +  K   +  LK       +V+T    Q N   E  E K  E           N+  + +  L +++ R P
Subjt:  KMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVT----QDNHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIP

Query:  RVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPA-KVRRIPEVVEFYHSLMRRDSRRE-
        RVP+PPP+ +    S+   S+     G           PPP P  P   PPPPP     PPPPPP   G+      KV R PE+VEFY SLM+R+S++E 
Subjt:  RVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPKGKRPTPA-KVRRIPEVVEFYHSLMRRDSRRE-

Query:  ----LGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALRE
            + SG     SSA   +MIGEIENRST LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE KADALRE
Subjt:  ----LGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALRE

Query:  AAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVG
        AAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+ EL++V 
Subjt:  AAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVG

Query:  GG---PEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
        G    P  E L+++GVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  GG---PEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.0e-16659.32Show/hide
Query:  MVAGKVKLAMGLQKSPASRKVESSPK------PSTPAQPSPSSGKIS-------QKTVFSRSFGVYFPRSSAQVQPRLPD------VTELLQLVEELRDR
        MVAGKV++ MG  KSP+++K +  P       P  P    PSSG  +        K  F+RSFGVYFPR+SAQV            V+EL + VEELR+R
Subjt:  MVAGKVKLAMGLQKSPASRKVESSPK------PSTPAQPSPSSGKIS-------QKTVFSRSFGVYFPRSSAQVQPRLPD------VTELLQLVEELRDR

Query:  EARLKTDLLEHKLLKESVAIVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILEN
        EA LKT+ LE KLL+ESV+++P+LE++IA K+ EI+   K    L  +NERLR E +      EE RRE + R K ME EI EL+K+           ++
Subjt:  EARLKTDLLEHKLLKESVAIVPMLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILEN

Query:  DELSASQRFQGLMEVSGKSNLIRSLKRP---TKFSDTVVTQDNHKVEQPEA--------KKEEVETERPRHSR-SNSEELAE-STLSNVKSRIPRVPKPP
          LS SQRFQGLM+VS KSNLIRSLKR        + +  Q+N       +        +K+E+E+    +SR SNSEEL E S+LS V+SR+PRVPKPP
Subjt:  DELSASQRFQGLMEVSGKSNLIRSLKRP---TKFSDTVVTQDNHKVEQPEA--------KKEEVETERPRHSR-SNSEELAE-STLSNVKSRIPRVPKPP

Query:  PKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKS---APPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDS----RREL
        PK S        S   ST   +    +K IP PPP P  P    PPPPPS S    PPPPPPPPK      AKVRR+PEVVEFYHSLMRRDS    R   
Subjt:  PKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKS---APPPPPPPPKGKRPTPAKVRRIPEVVEFYHSLMRRDS----RREL

Query:  GSGVTEPP---SSANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAA
        G G        +++NARDMIGEIENRS +LLAIKTDVETQGDFIRFLIKEV NA+F+DIEDVVPFVKWLDDELSYLVDERAVLKHF+WPEQKADALREAA
Subjt:  GSGVTEPP---SSANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAA

Query:  FGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETV-GG
        F Y DLKKL +EAS FR D RQ   SALKKMQAL EKLEHGVY+LSRMRESA  ++K+FQIPV+WML++GI SQIKL SVKLAMKYMKRVSAELE + GG
Subjt:  FGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETV-GG

Query:  GPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
        GPEEEELIV+GVRFAFRVHQFAGGFD ETM+AF+ELRDKA SCH
Subjt:  GPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGCTGGGAAGGTGAAGCTCGCAATGGGGCTGCAGAAGTCTCCGGCGAGTAGAAAGGTTGAGAGTTCACCGAAGCCATCCACCCCGGCACAGCCTTCTCCGAGCTC
TGGTAAAATTTCTCAGAAAACTGTCTTCTCCCGCTCGTTTGGTGTGTATTTCCCGCGCTCTTCTGCTCAGGTTCAGCCTCGACTGCCTGACGTGACGGAGCTCCTCCAGT
TGGTGGAGGAGTTGCGTGACAGAGAGGCGCGATTGAAGACTGACCTATTGGAGCACAAGCTGTTGAAGGAATCTGTCGCCATTGTTCCTATGCTTGAGAACGAGATCGCT
ACGAAAGATGCGGAGATTGAGAGAGCGTCTAAGCGGATACTGTTTTTGGAGGCGGAGAATGAGAGATTAAGAGTTGAAGTGGAGGAAGTTAAACAGAGTGTTGAGGAACA
GAGGAGAGAGAGTCAAGAGAGAGTAAAAGCAATGGAAGGTGAAATCGCGGAGCTGAAGAAAATGGCGTTGGATCGACGCAGAATGGAGCTTATTTTGGAGAACGACGAGC
TTTCGGCGTCGCAGAGGTTCCAGGGATTAATGGAGGTCTCGGGAAAATCTAACCTAATCAGGAGCCTCAAAAGACCGACCAAGTTTTCGGATACTGTTGTTACTCAAGAC
AATCATAAGGTTGAACAACCAGAGGCGAAGAAAGAAGAGGTTGAAACCGAGAGACCGAGACACTCGCGAAGTAATTCTGAAGAACTCGCCGAGTCCACTCTATCTAACGT
AAAATCGCGAATACCTAGGGTTCCAAAACCTCCTCCCAAACCTTCTTCCTCTTCCTCTTCCTCTGCCACTTCCTCCTCCTCCTCCACATCAACTGGCTCTTCTGGTGATG
CTGAGAAAAAGATCCCAGCCCCACCCCCTGTCCCAACCAAGCCAACGCCACCGCCGCCTCCTCCTCCACCTTCGAAGTCGGCTCCGCCTCCTCCTCCACCGCCTCCCAAG
GGTAAGAGGCCGACGCCGGCGAAGGTGCGACGAATACCGGAGGTTGTGGAGTTCTATCACTCATTAATGCGGAGGGATTCTCGGCGAGAACTCGGCTCCGGTGTTACGGA
ACCGCCGTCCTCCGCCAATGCTCGTGACATGATCGGAGAGATCGAGAACCGGTCCACCCACTTGCTCGCTATAAAGACGGATGTAGAGACTCAAGGGGATTTCATAAGGT
TCTTGATCAAAGAAGTTGAAAATGCTTCATTTACTGACATCGAGGACGTTGTGCCATTTGTGAAATGGTTGGATGATGAGCTCTCATATCTGGTGGATGAAAGAGCCGTG
CTTAAGCACTTCCAGTGGCCGGAGCAGAAGGCCGATGCTCTACGTGAGGCTGCATTTGGCTATTGTGATCTAAAGAAGCTGGAAGCCGAAGCGTCATCCTTTCGTGGTGA
TGCCCGCCAGCCCTGTGGTTCGGCTCTCAAGAAGATGCAAGCTTTGCTTGAAAAGTTGGAGCATGGAGTATACAATCTGTCTAGAATGCGTGAATCTGCAACTAAGAGAT
ACAAAGCATTTCAAATTCCAGTGGAATGGATGCTTGATAGTGGAATTGTGAGTCAGATCAAGCTTGTCTCTGTAAAATTAGCAATGAAGTACATGAAAAGAGTATCCGCA
GAGCTTGAAACAGTGGGTGGTGGACCTGAAGAAGAAGAGTTGATTGTTCGAGGCGTTCGATTTGCTTTCCGTGTGCATCAGTTTGCTGGAGGGTTTGATGTGGAAACAAT
GAGGGCATTTCAAGAGCTGAGAGACAAGGCAAGTTCATGTCACCAATCTCAGCTTCAGAGGGCTCTGTTTTTTGGGTGTAAAGATCATCTTTCTTACGCATTTGGCAACG
CGTCAGCTGCCAGTTCATTGGGTGATTTCAGGTTTCATTCTTACCTTCATTCCAATAATCATCCTTCATCTTCTGCGCATTTGATCTGCCCTCGAGTTTTTCGCCTTGTC
AAAGCTGCTGCTGCGTTGTCCATGGAGGTCGAGAAAGGTGCACAATCTGCTCCTGTTGGCAGCACAGCTCTGCCCATGAAGCTCTTGTTTGTCGAGATGGGCGTTGGCTA
CGATCAACATGGTCAAGATATCACGGCTGCTGCAATGCGAGCCTGCAGGGATGCCATATGCTCCAATTCGATTCCAGCATTCCGTAGAGGTACCATTCCTGGAGTTTCAT
TTGGAGAGATGAAACTACAGATCAAACTTGGAGTTCCACAGTCGCTTCAACAATCCTTGGATGTTGAAAAAGTCAAGTCCGTCTTTCCATAG
mRNA sequenceShow/hide mRNA sequence
CTTTTTCAAATTGGCGAATGAGATCCAGGGAAATACGACCGTTCCTCGTGATCGACGCCATTGCCATCTTGCGTCTGTTGTCTTCGTCGCTTCTCCCACCCAAACCCAAA
CGTTCTCTGTTACTTACTTTTTTTTCCTTTTAACAATTCTGAAATTTTTTATTTACACAAACTCCAACTTTCTTTATCTCTCTTTCTCTCTCAGATGACACGAAGACAAG
GACGTTTCGTTCACACTCTAAAGTAGATTGAAAATTGGAGGACTGTAAAACCTCAGTCCCGACCTCCTTACCCCGAAGCTCAGAGAGATACAGACAGAGCGAGAGAGAGA
GCTTAGAAATGGTAGCTGGGAAGGTGAAGCTCGCAATGGGGCTGCAGAAGTCTCCGGCGAGTAGAAAGGTTGAGAGTTCACCGAAGCCATCCACCCCGGCACAGCCTTCT
CCGAGCTCTGGTAAAATTTCTCAGAAAACTGTCTTCTCCCGCTCGTTTGGTGTGTATTTCCCGCGCTCTTCTGCTCAGGTTCAGCCTCGACTGCCTGACGTGACGGAGCT
CCTCCAGTTGGTGGAGGAGTTGCGTGACAGAGAGGCGCGATTGAAGACTGACCTATTGGAGCACAAGCTGTTGAAGGAATCTGTCGCCATTGTTCCTATGCTTGAGAACG
AGATCGCTACGAAAGATGCGGAGATTGAGAGAGCGTCTAAGCGGATACTGTTTTTGGAGGCGGAGAATGAGAGATTAAGAGTTGAAGTGGAGGAAGTTAAACAGAGTGTT
GAGGAACAGAGGAGAGAGAGTCAAGAGAGAGTAAAAGCAATGGAAGGTGAAATCGCGGAGCTGAAGAAAATGGCGTTGGATCGACGCAGAATGGAGCTTATTTTGGAGAA
CGACGAGCTTTCGGCGTCGCAGAGGTTCCAGGGATTAATGGAGGTCTCGGGAAAATCTAACCTAATCAGGAGCCTCAAAAGACCGACCAAGTTTTCGGATACTGTTGTTA
CTCAAGACAATCATAAGGTTGAACAACCAGAGGCGAAGAAAGAAGAGGTTGAAACCGAGAGACCGAGACACTCGCGAAGTAATTCTGAAGAACTCGCCGAGTCCACTCTA
TCTAACGTAAAATCGCGAATACCTAGGGTTCCAAAACCTCCTCCCAAACCTTCTTCCTCTTCCTCTTCCTCTGCCACTTCCTCCTCCTCCTCCACATCAACTGGCTCTTC
TGGTGATGCTGAGAAAAAGATCCCAGCCCCACCCCCTGTCCCAACCAAGCCAACGCCACCGCCGCCTCCTCCTCCACCTTCGAAGTCGGCTCCGCCTCCTCCTCCACCGC
CTCCCAAGGGTAAGAGGCCGACGCCGGCGAAGGTGCGACGAATACCGGAGGTTGTGGAGTTCTATCACTCATTAATGCGGAGGGATTCTCGGCGAGAACTCGGCTCCGGT
GTTACGGAACCGCCGTCCTCCGCCAATGCTCGTGACATGATCGGAGAGATCGAGAACCGGTCCACCCACTTGCTCGCTATAAAGACGGATGTAGAGACTCAAGGGGATTT
CATAAGGTTCTTGATCAAAGAAGTTGAAAATGCTTCATTTACTGACATCGAGGACGTTGTGCCATTTGTGAAATGGTTGGATGATGAGCTCTCATATCTGGTGGATGAAA
GAGCCGTGCTTAAGCACTTCCAGTGGCCGGAGCAGAAGGCCGATGCTCTACGTGAGGCTGCATTTGGCTATTGTGATCTAAAGAAGCTGGAAGCCGAAGCGTCATCCTTT
CGTGGTGATGCCCGCCAGCCCTGTGGTTCGGCTCTCAAGAAGATGCAAGCTTTGCTTGAAAAGTTGGAGCATGGAGTATACAATCTGTCTAGAATGCGTGAATCTGCAAC
TAAGAGATACAAAGCATTTCAAATTCCAGTGGAATGGATGCTTGATAGTGGAATTGTGAGTCAGATCAAGCTTGTCTCTGTAAAATTAGCAATGAAGTACATGAAAAGAG
TATCCGCAGAGCTTGAAACAGTGGGTGGTGGACCTGAAGAAGAAGAGTTGATTGTTCGAGGCGTTCGATTTGCTTTCCGTGTGCATCAGTTTGCTGGAGGGTTTGATGTG
GAAACAATGAGGGCATTTCAAGAGCTGAGAGACAAGGCAAGTTCATGTCACCAATCTCAGCTTCAGAGGGCTCTGTTTTTTGGGTGTAAAGATCATCTTTCTTACGCATT
TGGCAACGCGTCAGCTGCCAGTTCATTGGGTGATTTCAGGTTTCATTCTTACCTTCATTCCAATAATCATCCTTCATCTTCTGCGCATTTGATCTGCCCTCGAGTTTTTC
GCCTTGTCAAAGCTGCTGCTGCGTTGTCCATGGAGGTCGAGAAAGGTGCACAATCTGCTCCTGTTGGCAGCACAGCTCTGCCCATGAAGCTCTTGTTTGTCGAGATGGGC
GTTGGCTACGATCAACATGGTCAAGATATCACGGCTGCTGCAATGCGAGCCTGCAGGGATGCCATATGCTCCAATTCGATTCCAGCATTCCGTAGAGGTACCATTCCTGG
AGTTTCATTTGGAGAGATGAAACTACAGATCAAACTTGGAGTTCCACAGTCGCTTCAACAATCCTTGGATGTTGAAAAAGTCAAGTCCGTCTTTCCATAGGTCCCTGTTG
CACTGAAAGTTATTGGATCACTTTTCAATCCATGTTTTCAATTTCTGGTTCCTTGCTTATCAAAAATTAAGCGTGACAAGCTTTCACCTCACTATAGTGGGAAGATTCTG
AATGTTGAGGTTGTCGAGGGTGGCTTAATATGCTCCAGCGGTGTTCATGTAGAAGAAATGGGAGACAAAAATGATGACTGTTACATAGTAAATGCCGCCGTATATGTTGG
CTACTAATTCTTGTTGCCTTATTTTCCTTTCGAAGACCTTGAGGATTATTTACCAGTCATCTTTGATTGTAACTACACAGTTAGTTATGCTTGATGCATACATTGACGTT
TCAGTTTCCTTGGAGATATTGCTAGAACCGCAGGGTTATTTCTGGGCTCGATTAATCGCTCAATTTTAGTATTCCTTGGTTTCAACGTTCTGCAATATACTGTTTCAAAC
TCATTTTTTGGTTATCAGCATCTATGATCAATAAACGAACTGCCCATATCCAGTTCGTGGAATGATTCAATCGGACCCATATCTTAAAGATGCGCAATAGAGTATTTTAT
TCCAGTTTATTACCAAGCAAGATGCCATCTTGATACCATCTCACTTGTATTCTTTCCTTATCGATTTGAAACAATTTTCTGAAACTGCAACAAATAGAATCATCAATTAC
TTTGTACAATTGCTTTGTTGCCATTACATTTTTTTTCTTCCAATTCCCGTTAAGGTGGTTAATTAAATGAATGAAGTTCTTTCTTCTAACACTTCCTGTGAGGAATTGCA
GAGACGAAAAGAACCCCAAGTGTTTCAGGCTTGGCCTTCACATGGTAAAAACTGCATAGAGAAGCTGGTTCCAAGTGTCGGGCAAGCCGGGAAGGCACCAGTGGCTGCAA
TCTGGTCCTCCGCCTTCCCCGCTGTAAGTAGAGGGATGTGCATCTTTCCGGAGTTGCGACAACGTTGTGATGTCCAGCAAGAAAACGGGCTTCCTTATTCTACTCAGCAC
TCTCTTCACAATTTCCACCGCTGGTGGCGCCCCTGCTGGATACAGTGACCCTGAGAGCGGTACGCTTTCTCCATTGCAACTCCTCCTTGGTTGGTTCCAATCCTTTCCCC
TGGTAAAAAGTGTTAACATCAATTTTTCCAGTTGAGAAAGAAAAACAAAGTTATATTCTTAAATTCTGCAGAAACTAGTTGAGCAGGTATAACTAACGGAGTTTTCTGGT
GTGAAAATTACTGCTCAACAGCAAAATAGTTAAAAAGAAAACGAGGAATAGCAATGCATTAAATAGGTAGACACAAGTAGGACATTGTTTTTCGTCGGCAGATGAAAAAG
CGATTTGCATGTCAATTTCCATTTTCATGTATACACTTTTACAGACCTCCGATGACAGGCGTGAGTTTTACTTACTCATAATGAGTGGGGGAGATTCCCTGGAAAATGAC
TCTGGTTTTGCTTGGATCAACATTCATGTCCACCCATCTTGCCCAGGTGGTCAGCCCTTGATAGAAGGCCTCCAGACGATCCATATCTTTCTTCACTGCATTTCCTACTT
GCACATAATCCCACCTGAGCAACAGCACTCGCAACCATCAGATTTTACCTTACCATCACCAACCACATCAGCATGATGAAGAAAAAAAAAGAGGTTGATCTTACGCTTGG
GATCTTCCAGTGTGGGTCCACCAGTGCCAAGAGTTGAATATGAGCACGTCCATTCCCTTCCACACATTGCCTCCTTCAATGGAATCAAGCTTTAGTACTCGTCCAACCTT
CTCTCTCACTACGTCCACAAGGTAGGGCGTTCTGTGAAGAAGCAAGCTTACTCCATACTCCTGCACACACCAGATTGTTCCCTCTTCTTCATTGGTTAGGAGAACCCGAA
AAACAGCTTTTGAAAAAGTGAATAAAGAATAGAGAATAAAGAAGCAGACAATTAGTATACTGTGAAGTGGGAGATTTTGAAACAGAAAATTGGAGATAAAGAAAAAGGGG
AAAGAACTCGCCTCGAAAATCACAGTAGAGAGTGATTCCCTCCTGACTATGGAGATTTTGGCTTTTGGTGCCGACGCTAGGATCATACATGTCAAGGATTGCCACATGTT
CAGACTCAGTGAGTCGCCTACAAACATTATCCTCTTCCCTCTCCACCTACTCAGAAGCTCCAGCCCATCAAACCTTCAAAGAAATGAACACATTTTCTTAGACAGACAGA
GATAGCTGCTTTGATTTTGCTCCCACATTTTCAGAGAAGGAAGCTTGACTTTTGCAAGGAGAGCAGGATTGGCTAAAAATGGAAAACCTTACAAGAAGGAAAGAAAAGAA
TGGAGGAGTAGTACCTTGGAAGATCACAGAAGTCAGGTTTCCAAGTGTACTTGAGGTAAGATCGATCTGGTCTGCCATACTTTTGGCAGTTAAACTCAGGGTCTATGAAT
GGACAGTTTGAAGATTCATAAAGAGGCAAAGAAGGATCAAAAACCCATTTGCCTTGGAACAAATTGCACCCCCTCCCTAGCTTGCTACTTCCCACATTGCTTATGTTATA
GAAATCCTCAGCTTTAGCAGCTCCAAGCAAAAAGAGAAACAGAAGTTGTAAAAGCAGGAGAGACAGAGCTCTGAATCGAAAACCCATCTCAGATTTTGGGCTCTGACTGC
AAAGAAAGGGAGGGGAGCTTTGGGTAATAGAAATAGAATAGAAAGAAGCAAGTGGGTCTATATAGAGACAAACAAGAGAAAAGCCTGATTATTATCATTATCATTATCAT
TACTATTATTTAAATCTTTTAAAACTTTATTCCATGAGCTGGGAATTGAATTATCTTGGCTGGATTGATGAAGGCAGATACAATTGTGGGAGTTGAAATTGAGAGTGAGT
TGCAGTGCGCTGGTTTGCACATCGAGGGCCATAGCATATTACCCACCAAGCAATGATGCACATGGCTCAATGATGTTTTGGTTATCTAACTACCCAACTTTATTTCTTAT
TTTTTTATTCTACCTTTCAGTTTCTTCATTCAAATATCCATATAATATATCGATGCAATTACATGTATTAAAGGATTCCAAAGTGAGTTTGAATGATTTGGTGGAATCAA
GTATGTCCCATCAGCCTAACGTGTTGCTAAAGCTATGTTGATGTTCTTATGTATGATAACTTTATCA
Protein sequenceShow/hide protein sequence
MVAGKVKLAMGLQKSPASRKVESSPKPSTPAQPSPSSGKISQKTVFSRSFGVYFPRSSAQVQPRLPDVTELLQLVEELRDREARLKTDLLEHKLLKESVAIVPMLENEIA
TKDAEIERASKRILFLEAENERLRVEVEEVKQSVEEQRRESQERVKAMEGEIAELKKMALDRRRMELILENDELSASQRFQGLMEVSGKSNLIRSLKRPTKFSDTVVTQD
NHKVEQPEAKKEEVETERPRHSRSNSEELAESTLSNVKSRIPRVPKPPPKPSSSSSSSATSSSSSTSTGSSGDAEKKIPAPPPVPTKPTPPPPPPPPSKSAPPPPPPPPK
GKRPTPAKVRRIPEVVEFYHSLMRRDSRRELGSGVTEPPSSANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAV
LKHFQWPEQKADALREAAFGYCDLKKLEAEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSA
ELETVGGGPEEEELIVRGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHQSQLQRALFFGCKDHLSYAFGNASAASSLGDFRFHSYLHSNNHPSSSAHLICPRVFRLV
KAAAALSMEVEKGAQSAPVGSTALPMKLLFVEMGVGYDQHGQDITAAAMRACRDAICSNSIPAFRRGTIPGVSFGEMKLQIKLGVPQSLQQSLDVEKVKSVFP