; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021588 (gene) of Snake gourd v1 genome

Gene IDTan0021588
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein CHUP1, chloroplastic
Genome locationLG03:78833126..78836844
RNA-Seq ExpressionTan0021588
SyntenyTan0021588
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
GO:0005525 - GTP binding (molecular function)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134665.1 protein CHUP1, chloroplastic [Cucumis sativus]0.0e+0094.68Show/hide
Query:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPA+RKVESSPK STPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEIS KD EIERASKRILFLEAENERLRV+VEE KQS+EE+RRESQERIKAMEGE+AELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA
        NLIRNLKRAT+ SD+VVNQDNHKVEHPEAKKEEVETERPRHSR NSEELAESTLSNIKSRIPRVPKPPPKPSSSSS SAT+S+SSSSTGSS ++EK IPA
Subjt:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA

Query:  PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
        PPPVPTK MPPPPPPPSKSAPPPPPPPPKGKR MPAKVRR+PEVVEFYHSLMRRDSRR+ GSG+TEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
Subjt:  PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF

Query:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
        IRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
Subjt:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY

Query:  NLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
        NLSRMRE+A KRYKAFQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
Subjt:  NLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH

Query:  VQCQNQQQHKYVCSSRPTTC
        VQCQNQQQHKYV SSRPTTC
Subjt:  VQCQNQQQHKYVCSSRPTTC

XP_008439756.1 PREDICTED: protein CHUP1, chloroplastic [Cucumis melo]0.0e+0094.68Show/hide
Query:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPA+RKVESSPK STPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEIS KD EIERASKRILFLEAENERLRV+VEEVKQS+EE+RRESQERIKAMEGEI+ELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA
        NLIRNLKRAT+ SD+VVNQDNHKVEHPE KKEEVETERPRHSR NSEELAESTLSNIKSRIPRVP+PPPKPSSSSS SAT  +SSSSTGSS ++EK IPA
Subjt:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA

Query:  PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
        PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRR+PEVVEFYHSLMRRDSRR+ GSG+T+PPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
Subjt:  PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF

Query:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
        IR LIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
Subjt:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY

Query:  NLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
        NLSRMRE+A KRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
Subjt:  NLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH

Query:  VQCQNQQQHKYVCSSRPTTC
        VQCQNQQQHKYV SSRPTTC
Subjt:  VQCQNQQQHKYVCSSRPTTC

XP_022926872.1 protein CHUP1, chloroplastic-like [Cucurbita moschata]1.6e-30894.21Show/hide
Query:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKLAMGLQKSPA+RKVESSPKPSTPAQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVPMLENEI+ KD EIERASKRILFLEAENERLRVEVEEVKQS+EEQRRESQER+KAMEGEIAELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA
        NLIR+LKR T+FSD+VV QDNHKVE PEAKKEEVETERPRHSRSNSEELAESTLSN+KSRIPRVPKPPPKPSSSSS SATSSSSS+STGSSG+ EK IPA
Subjt:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA

Query:  PPPVPTKPM-PPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTKP  PPPPPPPSKSAPPPPPPPPKGKRP PAKVRR+PEVVEFYHSLMRRDSRRE GSG+TEPPS+ANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  PPPVPTKPM-PPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRE+ATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQNQQQHKYVCSS-RPTTC
        HVQCQN QQHKYVCSS RPTTC
Subjt:  HVQCQNQQQHKYVCSS-RPTTC

XP_023518440.1 protein CHUP1, chloroplastic [Cucurbita pepo subsp. pepo]2.1e-30894.37Show/hide
Query:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKLAMGLQKSPA+RKVESSPKPSTPAQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVPMLENEI+ KD EIERASKRILFLEAENERLRVEVEEVKQS+EEQRRESQER+KAMEGEIAELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA
        NLIR+LKR T+FSD+VV QDNHKVE PEAKKEEVETERPRHSRSNSEELAESTLS+IKSRIPRVPKPPPKPSSSSS SATSSSSSSSTGSSG+ EK IPA
Subjt:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA

Query:  PPPVPTKPM-PPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTKP  PPPPPPPSKSAPPPPPPPPKGKRP PAKVRR+PEVVEFYHSLMRRDSRRE GSG+TEPPS+ANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  PPPVPTKPM-PPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRE+ATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQNQQQHKYVCSS-RPTTC
        HVQCQN QQHKYVCSS RPTTC
Subjt:  HVQCQNQQQHKYVCSS-RPTTC

XP_038883847.1 protein CHUP1, chloroplastic [Benincasa hispida]2.8e-30894.19Show/hide
Query:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPA+RKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEIS KD EIERASKRILFLEAENERLRVEVEEVKQS+EE+RRESQERIKAME EIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA
        NLIRNLKR T+ S++VVNQDNHK EHPEAKKEEVETERPRHSR NSEELAE TLSNIKSRIPRVPKPPPKPSSSSS SA  ++SSSSTGSSG++EK IPA
Subjt:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA

Query:  PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
        PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMP KVRR+PEVVEFYHSLMRRDSRR+ GS +T+PPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
Subjt:  PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF

Query:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
        IRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
Subjt:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY

Query:  NLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
        NLSRMRE+ATKRYKAFQIPVEWMLDSGIV QIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
Subjt:  NLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH

Query:  VQCQNQQQHKYVCSSRPTTC
        VQCQN QQHKYV SSRPTTC
Subjt:  VQCQNQQQHKYVCSSRPTTC

TrEMBL top hitse value%identityAlignment
A0A0A0KHU8 Uncharacterized protein0.0e+0094.68Show/hide
Query:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPA+RKVESSPK STPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEIS KD EIERASKRILFLEAENERLRV+VEE KQS+EE+RRESQERIKAMEGE+AELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA
        NLIRNLKRAT+ SD+VVNQDNHKVEHPEAKKEEVETERPRHSR NSEELAESTLSNIKSRIPRVPKPPPKPSSSSS SAT+S+SSSSTGSS ++EK IPA
Subjt:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA

Query:  PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
        PPPVPTK MPPPPPPPSKSAPPPPPPPPKGKR MPAKVRR+PEVVEFYHSLMRRDSRR+ GSG+TEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
Subjt:  PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF

Query:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
        IRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
Subjt:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY

Query:  NLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
        NLSRMRE+A KRYKAFQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
Subjt:  NLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH

Query:  VQCQNQQQHKYVCSSRPTTC
        VQCQNQQQHKYV SSRPTTC
Subjt:  VQCQNQQQHKYVCSSRPTTC

A0A1S3AZH3 protein CHUP1, chloroplastic0.0e+0094.68Show/hide
Query:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPA+RKVESSPK STPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEIS KD EIERASKRILFLEAENERLRV+VEEVKQS+EE+RRESQERIKAMEGEI+ELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA
        NLIRNLKRAT+ SD+VVNQDNHKVEHPE KKEEVETERPRHSR NSEELAESTLSNIKSRIPRVP+PPPKPSSSSS SAT  +SSSSTGSS ++EK IPA
Subjt:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA

Query:  PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
        PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRR+PEVVEFYHSLMRRDSRR+ GSG+T+PPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
Subjt:  PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF

Query:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
        IR LIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
Subjt:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY

Query:  NLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
        NLSRMRE+A KRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
Subjt:  NLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH

Query:  VQCQNQQQHKYVCSSRPTTC
        VQCQNQQQHKYV SSRPTTC
Subjt:  VQCQNQQQHKYVCSSRPTTC

A0A5D3CMM2 Protein CHUP13.0e-30894.93Show/hide
Query:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPA+RKVESSPK STPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVP+LENEIS KD EIERASKRILFLEAENERLRV+VEEVKQS+EE+RRESQER+KAMEGEI+ELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA
        NLIRNLKRAT+ SD+VVNQDNHKVEHPE KKEEVETERPRHSR NSEELAESTLSNIKSRIPRVPKPPPKPSSSSS SAT +SSSSSTGSS ++EK IPA
Subjt:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA

Query:  PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
        PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRR+PEVVEFYHSLMRRDSRR+ GSG+T+PPSTANARDMIGEIENRSAHLLAIKTDVETQGDF
Subjt:  PPPVPTKPMPPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDF

Query:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
        IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY
Subjt:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVY

Query:  NLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
        NLSRMRE+A KRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH
Subjt:  NLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCH

Query:  VQCQNQQQHKY
        VQCQNQQQHK+
Subjt:  VQCQNQQQHKY

A0A6J1EFK1 protein CHUP1, chloroplastic-like7.8e-30994.21Show/hide
Query:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKLAMGLQKSPA+RKVESSPKPSTPAQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVPMLENEI+ KD EIERASKRILFLEAENERLRVEVEEVKQS+EEQRRESQER+KAMEGEIAELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA
        NLIR+LKR T+FSD+VV QDNHKVE PEAKKEEVETERPRHSRSNSEELAESTLSN+KSRIPRVPKPPPKPSSSSS SATSSSSS+STGSSG+ EK IPA
Subjt:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA

Query:  PPPVPTKPM-PPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTKP  PPPPPPPSKSAPPPPPPPPKGKRP PAKVRR+PEVVEFYHSLMRRDSRRE GSG+TEPPS+ANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  PPPVPTKPM-PPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRE+ATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQNQQQHKYVCSS-RPTTC
        HVQCQN QQHKYVCSS RPTTC
Subjt:  HVQCQNQQQHKYVCSS-RPTTC

A0A6J1KWU6 protein CHUP1, chloroplastic-like5.1e-30894.37Show/hide
Query:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKLAMGLQKSPA RKVESSPKPSTPAQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS
        IVPMLENEI+ KD EIERASKRILFLEAENERLRVEVEEVKQS+EEQRRESQER+KAMEGEIAELKKMALDR RMELILENDELSASQRFQGLMEVSGKS
Subjt:  IVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA
        NLIR+LKR T+FSD+VV QDNHKVE PEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSS SATSSSSS+STGSSG+ EK IPA
Subjt:  NLIRNLKRATRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPA

Query:  PPPVPTKPM-PPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGD
        PPPVPTKP  PPPPPPPSKSAPPPPPPPPKGKRP  AKVRR+PEVVEFYHSLMRRDSRRE GSG+TEPPS+ANARDMIGEIENRSAHLLAIKTDVETQGD
Subjt:  PPPVPTKPM-PPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLEHGV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGV

Query:  YNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRE+ATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQNQQQHKYVCSS-RPTTC
        HVQCQN QQHKYVCSS RPTTC
Subjt:  HVQCQNQQQHKYVCSS-RPTTC

SwissProt top hitse value%identityAlignment
Q1PEB4 Uncharacterized protein At4g049809.1e-0423.01Show/hide
Query:  PKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPMLENEISAKDVEIERASKRIL
        P P TP    P +  +S     S S  ++  R+ A  +  P D+      +   RD E+  +T +   +  +ES         EI A++ E E     +L
Subjt:  PKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPMLENEISAKDVEIERASKRIL

Query:  FLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATRF-SDSVVNQDNHK
          E   + ++ E    + S  E   E+++  +  E E            R+E      E+ A    +   +   + + I  L+      ++  +     +
Subjt:  FLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATRF-SDSVVNQDNHK

Query:  VEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPAPPPVPTKPMPPPPPPPSK-----
          H E  + E E E   HS + + E  +ST S+ K  +P    PPP  +S  +PS T   S+ +T SS   +   P P P    P PPPPPP SK     
Subjt:  VEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPAPPPVPTKPMPPPPPPPSK-----

Query:  -----------------SAPPPPPPPPKGK--RPMPAKVRRVPEVVEFYHSL------------MRRDSRREPGSGITEPPSTANA--RDMIGEIENRSA
                         S P PP PP  G+  +   +K+RR  ++   Y +L             ++ S+ +       P   A +   D + E+  RS+
Subjt:  -----------------SAPPPPPPPPKGK--RPMPAKVRRVPEVVEFYHSL------------MRRDSRREPGSGITEPPSTANA--RDMIGEIENRSA

Query:  HLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQ-WPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSA
        +   I+ DV+     I  L   + +    D+++++ F   ++  L  L DE  VL  F+ +PE+K + +R A   Y  L  +  E  +++     P    
Subjt:  HLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQ-WPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSA

Query:  LKKMQALLEKLEHGVYNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVS---VKLAMKYMKRVSAELETVGGGPEEEE---LIVQGVRFAFRVHQFA
        L K++    K +  +  + R ++   K +K + I +++ +   +   +  VS   ++LA+K  +  + E +       +EE    + +  +FAF+V+ FA
Subjt:  LKKMQALLEKLEHGVYNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVS---VKLAMKYMKRVSAELETVGGGPEEEE---LIVQGVRFAFRVHQFA

Query:  GGFD
        GG D
Subjt:  GGFD

Q9LI74 Protein CHUP1, chloroplastic4.7e-8547.79Show/hide
Query:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATRFSDSVV----NQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIP
        K+A++R +   I    + + ++RF G + +  K   +  LK       SV+    +Q N   E  E K  E           N+  + +  L +I+ R P
Subjt:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATRFSDSVV----NQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIP

Query:  RVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPAPPPVPTKPMPPPPPPPSKSAP------PPPPPPPKGKRPMPA----KVRRVPEVVEFYHSLM
        RVP+PPP+              S+  G S N+     A PP+P    PPPPPPP    P      PPPPPPP G     A    KV R PE+VEFY SLM
Subjt:  RVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPAPPPVPTKPMPPPPPPPSKSAP------PPPPPPPKGKRPMPA----KVRRVPEVVEFYHSLM

Query:  RRDSRREPGSGITEP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQK
        +R+S++E    +       S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE K
Subjt:  RRDSRREPGSGITEP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQK

Query:  ADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSA
        ADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+ 
Subjt:  ADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSA

Query:  ELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
        EL++V G    P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  ELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

Q9LI74 Protein CHUP1, chloroplastic4.2e-0124.14Show/hide
Query:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPMLENEISAKDVEIERASKRILFLEAENERL----------RVEVEEVKQSIEEQRRESQERIK
        ++  L ++V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K VEI+  +  I  L+AE ++L          R E+E  +  I+E +R+ Q    
Subjt:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPMLENEISAKDVEIERASKRILFLEAENERL----------RVEVEEVKQSIEEQRRESQERIK

Query:  AMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATRFSDSVVNQDNHKVEHPEAK
          +G++  LK+        E    N +    ++ + + ++  +   +  LKR  R       + + K++  EA+
Subjt:  AMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATRFSDSVVNQDNHKVEHPEAK

Arabidopsis top hitse value%identityAlignment
AT1G48280.1 hydroxyproline-rich glycoprotein family protein6.4e-6138.06Show/hide
Query:  RASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALD--RSRMELILENDELSASQRFQGLMEVSGK-SNLIRNLKRA-----
        ++ + ++   A  +  R  +EE    +EE+   ++  IK ++ ++  LK    +   S +EL L N +LS     Q L+    K S+L  N K A     
Subjt:  RASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALD--RSRMELILENDELSASQRFQGLMEVSGK-SNLIRNLKRA-----

Query:  TRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPAPPPVPTKPM
        +RF D +      K+E P+ KKE                +  S LS       R+P  PP P    SP+++      +                  + P 
Subjt:  TRFSDSVVNQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPAPPPVPTKPM

Query:  PPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANA--RDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKE
         PP PPP    PPPPPP P  K    A+ ++ P V + +  L ++D+ R     +    S  N+    ++GEI+NRSAHL+AIK D+ET+G+FI  LI++
Subjt:  PPPPPPPSKSAPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANA--RDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKE

Query:  VENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRE
        V    F+D+EDV+ FV WLD EL+ L DERAVLKHF+WPE+KAD L+EAA  Y +LKKLE E SS+  D     G ALKKM  LL+K E  +  L R+R 
Subjt:  VENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMRE

Query:  AATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEE---EELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
        ++ + Y+ F+IPVEWMLDSG++ +IK  S+KLA  YM RV+ EL++      E   E L++QGVRFA+R HQFAGG D ET+ A +E++ +  S
Subjt:  AATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEE---EELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein3.4e-8647.79Show/hide
Query:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATRFSDSVV----NQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIP
        K+A++R +   I    + + ++RF G + +  K   +  LK       SV+    +Q N   E  E K  E           N+  + +  L +I+ R P
Subjt:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATRFSDSVV----NQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIP

Query:  RVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPAPPPVPTKPMPPPPPPPSKSAP------PPPPPPPKGKRPMPA----KVRRVPEVVEFYHSLM
        RVP+PPP+              S+  G S N+     A PP+P    PPPPPPP    P      PPPPPPP G     A    KV R PE+VEFY SLM
Subjt:  RVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPAPPPVPTKPMPPPPPPPSKSAP------PPPPPPPKGKRPMPA----KVRRVPEVVEFYHSLM

Query:  RRDSRREPGSGITEP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQK
        +R+S++E    +       S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE K
Subjt:  RRDSRREPGSGITEP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQK

Query:  ADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSA
        ADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+ 
Subjt:  ADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSA

Query:  ELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
        EL++V G    P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  ELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein3.0e-0224.14Show/hide
Query:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPMLENEISAKDVEIERASKRILFLEAENERL----------RVEVEEVKQSIEEQRRESQERIK
        ++  L ++V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K VEI+  +  I  L+AE ++L          R E+E  +  I+E +R+ Q    
Subjt:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPMLENEISAKDVEIERASKRILFLEAENERL----------RVEVEEVKQSIEEQRRESQERIK

Query:  AMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATRFSDSVVNQDNHKVEHPEAK
          +G++  LK+        E    N +    ++ + + ++  +   +  LKR  R       + + K++  EA+
Subjt:  AMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATRFSDSVVNQDNHKVEHPEAK

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein3.4e-8647.79Show/hide
Query:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATRFSDSVV----NQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIP
        K+A++R +   I    + + ++RF G + +  K   +  LK       SV+    +Q N   E  E K  E           N+  + +  L +I+ R P
Subjt:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATRFSDSVV----NQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIP

Query:  RVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPAPPPVPTKPMPPPPPPPSKSAP------PPPPPPPKGKRPMPA----KVRRVPEVVEFYHSLM
        RVP+PPP+              S+  G S N+     A PP+P    PPPPPPP    P      PPPPPPP G     A    KV R PE+VEFY SLM
Subjt:  RVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPAPPPVPTKPMPPPPPPPSKSAP------PPPPPPPKGKRPMPA----KVRRVPEVVEFYHSLM

Query:  RRDSRREPGSGITEP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQK
        +R+S++E    +       S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE K
Subjt:  RRDSRREPGSGITEP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQK

Query:  ADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSA
        ADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+ 
Subjt:  ADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSA

Query:  ELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
        EL++V G    P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  ELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein3.0e-0224.14Show/hide
Query:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPMLENEISAKDVEIERASKRILFLEAENERL----------RVEVEEVKQSIEEQRRESQERIK
        ++  L ++V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K VEI+  +  I  L+AE ++L          R E+E  +  I+E +R+ Q    
Subjt:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPMLENEISAKDVEIERASKRILFLEAENERL----------RVEVEEVKQSIEEQRRESQERIK

Query:  AMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATRFSDSVVNQDNHKVEHPEAK
          +G++  LK+        E    N +    ++ + + ++  +   +  LKR  R       + + K++  EA+
Subjt:  AMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATRFSDSVVNQDNHKVEHPEAK

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein3.4e-8647.79Show/hide
Query:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATRFSDSVV----NQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIP
        K+A++R +   I    + + ++RF G + +  K   +  LK       SV+    +Q N   E  E K  E           N+  + +  L +I+ R P
Subjt:  KMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATRFSDSVV----NQDNHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIP

Query:  RVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPAPPPVPTKPMPPPPPPPSKSAP------PPPPPPPKGKRPMPA----KVRRVPEVVEFYHSLM
        RVP+PPP+              S+  G S N+     A PP+P    PPPPPPP    P      PPPPPPP G     A    KV R PE+VEFY SLM
Subjt:  RVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPAPPPVPTKPMPPPPPPPSKSAP------PPPPPPPKGKRPMPA----KVRRVPEVVEFYHSLM

Query:  RRDSRREPGSGITEP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQK
        +R+S++E    +       S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE K
Subjt:  RRDSRREPGSGITEP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQK

Query:  ADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSA
        ADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYMKRV+ 
Subjt:  ADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSA

Query:  ELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
        EL++V G    P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  ELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.6e-17159.37Show/hide
Query:  MVAGKVKLAMGLQKSPANRKVESSPK------PSTPAQPSPSSGKVS-------QKTVFSRSFGVYFPRSSAQVQPRPPD------VTELLRMVEELRDR
        MVAGKV++ MG  KSP+ +K +  P       P  P    PSSG  +        K  F+RSFGVYFPR+SAQV            V+EL R VEELR+R
Subjt:  MVAGKVKLAMGLQKSPANRKVESSPK------PSTPAQPSPSSGKVS-------QKTVFSRSFGVYFPRSSAQVQPRPPD------VTELLRMVEELRDR

Query:  EARLKTDLLEHKLLKESVAIVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILEN
        EA LKT+ LE KLL+ESV+++P+LE++I+ K+ EI+   K    L  +NERLR E +      EE RRE + R K ME EI EL+K+    S      ++
Subjt:  EARLKTDLLEHKLLKESVAIVPMLENEISAKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILEN

Query:  DELSASQRFQGLMEVSGKSNLIRNLKRA---TRFSDSVVNQDNHKVEHPEA--------KKEEVETERPRHSR-SNSEELAE-STLSNIKSRIPRVPKPP
          LS SQRFQGLM+VS KSNLIR+LKR        + + NQ+N       +        +K+E+E+    +SR SNSEEL E S+LS ++SR+PRVPKPP
Subjt:  DELSASQRFQGLMEVSGKSNLIRNLKRA---TRFSDSVVNQDNHKVEHPEA--------KKEEVETERPRHSR-SNSEELAE-STLSNIKSRIPRVPKPP

Query:  PKPSSSSSPSATSSSSSSSTGSSGNVEKTIPAPPPVPTKPM---PPPPPPPSKS-APPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDS----RREP
        PK S S   S  + +           +K+IP PPP P  P+   PPPPP  SK+  PPPPPPPPK      AKVRRVPEVVEFYHSLMRRDS    R   
Subjt:  PKPSSSSSPSATSSSSSSSTGSSGNVEKTIPAPPPVPTKPM---PPPPPPPSKS-APPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDS----RREP

Query:  GSGITEPP---STANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAA
        G G        + +NARDMIGEIENRS +LLAIKTDVETQGDFIRFLIKEV NA+F+DIEDVVPFVKWLDDELSYLVDERAVLKHF+WPEQKADALREAA
Subjt:  GSGITEPP---STANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAA

Query:  FGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETV-GG
        F Y DLKKL SEAS FR D RQ   SALKKMQAL EKLEHGVY+LSRMRE+A  ++K+FQIPV+WML++GI SQIKL SVKLAMKYMKRVSAELE + GG
Subjt:  FGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETV-GG

Query:  GPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQNQ-QQHKYVCSSRP
        GPEEEELIVQGVRFAFRVHQFAGGFD ETM+AF+ELRDKA SCHVQCQ+Q  QHK    S P
Subjt:  GPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQNQ-QQHKYVCSSRP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGCTGGGAAGGTGAAGCTCGCAATGGGGCTGCAGAAGTCTCCGGCGAATAGAAAGGTGGAGAGCTCTCCGAAACCATCGACGCCGGCGCAGCCTTCTCCGAGCTC
CGGTAAGGTTTCTCAGAAAACAGTCTTCTCCCGCTCGTTTGGTGTGTATTTCCCTCGCTCTTCTGCTCAGGTGCAACCTCGACCGCCGGACGTGACGGAGCTTCTCCGTA
TGGTTGAGGAGTTGCGTGACAGAGAGGCACGATTGAAGACTGACCTACTGGAGCACAAGCTTTTGAAGGAATCTGTCGCTATTGTTCCTATGCTTGAGAACGAGATCTCT
GCGAAGGATGTGGAGATTGAAAGAGCTTCTAAGCGGATATTGTTCTTGGAGGCTGAGAATGAGAGATTGAGAGTTGAAGTGGAGGAAGTTAAACAGAGTATTGAGGAACA
GAGGAGAGAGAGTCAAGAGAGAATAAAGGCAATGGAAGGTGAAATCGCGGAGCTGAAGAAAATGGCGTTGGATCGTAGCAGAATGGAGCTTATTTTGGAGAACGACGAGC
TTTCGGCGTCGCAGAGGTTCCAGGGATTAATGGAGGTCTCGGGAAAGTCTAACCTAATCAGGAACTTGAAAAGAGCGACAAGATTTTCGGATTCTGTTGTTAACCAAGAC
AATCATAAGGTTGAACATCCAGAGGCGAAGAAAGAAGAAGTTGAAACCGAGAGACCCAGACACTCGCGAAGTAACTCTGAAGAACTCGCCGAATCCACTCTCTCTAACAT
AAAATCGCGAATACCTAGGGTTCCAAAACCTCCTCCGAAACCCTCTTCGTCTTCCTCTCCTTCTGCCACTTCTTCTTCCTCCTCCTCATCAACTGGCTCTTCTGGTAACG
TAGAGAAAACGATCCCAGCCCCACCCCCTGTTCCAACCAAGCCAATGCCACCACCTCCTCCGCCGCCTTCGAAGTCGGCCCCGCCTCCCCCTCCACCGCCTCCCAAGGGT
AAGAGGCCGATGCCGGCGAAGGTCCGGCGAGTACCTGAGGTTGTTGAGTTCTATCATTCATTAATGCGGAGGGATTCCCGACGAGAACCAGGCTCCGGCATTACGGAACC
GCCGTCGACCGCCAATGCTCGTGACATGATCGGAGAGATCGAGAACCGGTCCGCCCACTTACTTGCTATAAAGACGGACGTAGAGACTCAAGGGGATTTCATAAGGTTCT
TGATAAAAGAAGTTGAAAATGCTTCATTTACTGACATTGAGGACGTTGTTCCATTTGTCAAATGGTTGGATGATGAGCTCTCGTATCTGGTAGATGAAAGAGCCGTGCTT
AAACACTTCCAGTGGCCGGAGCAAAAGGCCGACGCTCTGCGTGAGGCTGCCTTTGGCTACTGCGATTTAAAGAAGCTGGAATCTGAAGCATCCTCGTTTCGTGGTGATGC
CCGCCAGCCCTGTGGTTCGGCTCTCAAGAAGATGCAAGCTTTGCTTGAAAAGTTGGAGCATGGTGTATACAATTTGTCTAGAATGCGTGAAGCTGCAACTAAGAGATACA
AAGCATTTCAAATTCCAGTGGAATGGATGCTTGATAGTGGAATTGTGAGTCAGATCAAGCTTGTCTCTGTAAAATTAGCAATGAAGTACATGAAGAGAGTATCCGCAGAG
CTTGAAACAGTCGGTGGTGGACCTGAAGAAGAAGAGCTGATTGTCCAAGGCGTTCGATTTGCCTTCCGTGTGCATCAGTTTGCAGGAGGGTTTGATGTGGAAACGATGAG
GGCGTTTCAAGAGCTGAGAGATAAAGCAAGTTCATGTCACGTACAATGCCAAAACCAGCAACAACATAAGTACGTGTGCAGTAGCAGGCCTACAACTTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATTTTCACAAACTCCAACTTTCTTTATCTCTCTCTCTTTCTTGCTTTCTCTCTCAGATGAGACGAAGACAAGGACGTTTCGTTCACACTCTAAAGTAGATTGAAAGTTGG
AGGACTGTAAAAACTCAGTCCCGACCTTACCCAAGCACACAGAGAGAGAGAGATACAGAGAGAGCTTAGACATGGTAGCTGGGAAGGTGAAGCTCGCAATGGGGCTGCAG
AAGTCTCCGGCGAATAGAAAGGTGGAGAGCTCTCCGAAACCATCGACGCCGGCGCAGCCTTCTCCGAGCTCCGGTAAGGTTTCTCAGAAAACAGTCTTCTCCCGCTCGTT
TGGTGTGTATTTCCCTCGCTCTTCTGCTCAGGTGCAACCTCGACCGCCGGACGTGACGGAGCTTCTCCGTATGGTTGAGGAGTTGCGTGACAGAGAGGCACGATTGAAGA
CTGACCTACTGGAGCACAAGCTTTTGAAGGAATCTGTCGCTATTGTTCCTATGCTTGAGAACGAGATCTCTGCGAAGGATGTGGAGATTGAAAGAGCTTCTAAGCGGATA
TTGTTCTTGGAGGCTGAGAATGAGAGATTGAGAGTTGAAGTGGAGGAAGTTAAACAGAGTATTGAGGAACAGAGGAGAGAGAGTCAAGAGAGAATAAAGGCAATGGAAGG
TGAAATCGCGGAGCTGAAGAAAATGGCGTTGGATCGTAGCAGAATGGAGCTTATTTTGGAGAACGACGAGCTTTCGGCGTCGCAGAGGTTCCAGGGATTAATGGAGGTCT
CGGGAAAGTCTAACCTAATCAGGAACTTGAAAAGAGCGACAAGATTTTCGGATTCTGTTGTTAACCAAGACAATCATAAGGTTGAACATCCAGAGGCGAAGAAAGAAGAA
GTTGAAACCGAGAGACCCAGACACTCGCGAAGTAACTCTGAAGAACTCGCCGAATCCACTCTCTCTAACATAAAATCGCGAATACCTAGGGTTCCAAAACCTCCTCCGAA
ACCCTCTTCGTCTTCCTCTCCTTCTGCCACTTCTTCTTCCTCCTCCTCATCAACTGGCTCTTCTGGTAACGTAGAGAAAACGATCCCAGCCCCACCCCCTGTTCCAACCA
AGCCAATGCCACCACCTCCTCCGCCGCCTTCGAAGTCGGCCCCGCCTCCCCCTCCACCGCCTCCCAAGGGTAAGAGGCCGATGCCGGCGAAGGTCCGGCGAGTACCTGAG
GTTGTTGAGTTCTATCATTCATTAATGCGGAGGGATTCCCGACGAGAACCAGGCTCCGGCATTACGGAACCGCCGTCGACCGCCAATGCTCGTGACATGATCGGAGAGAT
CGAGAACCGGTCCGCCCACTTACTTGCTATAAAGACGGACGTAGAGACTCAAGGGGATTTCATAAGGTTCTTGATAAAAGAAGTTGAAAATGCTTCATTTACTGACATTG
AGGACGTTGTTCCATTTGTCAAATGGTTGGATGATGAGCTCTCGTATCTGGTAGATGAAAGAGCCGTGCTTAAACACTTCCAGTGGCCGGAGCAAAAGGCCGACGCTCTG
CGTGAGGCTGCCTTTGGCTACTGCGATTTAAAGAAGCTGGAATCTGAAGCATCCTCGTTTCGTGGTGATGCCCGCCAGCCCTGTGGTTCGGCTCTCAAGAAGATGCAAGC
TTTGCTTGAAAAGTTGGAGCATGGTGTATACAATTTGTCTAGAATGCGTGAAGCTGCAACTAAGAGATACAAAGCATTTCAAATTCCAGTGGAATGGATGCTTGATAGTG
GAATTGTGAGTCAGATCAAGCTTGTCTCTGTAAAATTAGCAATGAAGTACATGAAGAGAGTATCCGCAGAGCTTGAAACAGTCGGTGGTGGACCTGAAGAAGAAGAGCTG
ATTGTCCAAGGCGTTCGATTTGCCTTCCGTGTGCATCAGTTTGCAGGAGGGTTTGATGTGGAAACGATGAGGGCGTTTCAAGAGCTGAGAGATAAAGCAAGTTCATGTCA
CGTACAATGCCAAAACCAGCAACAACATAAGTACGTGTGCAGTAGCAGGCCTACAACTTGTTAATCTGCAGCAATCTCAGCTTCAGAGGGGGCTCCTGTGTTTTGGTTGT
AAAGATCATGTTTGTGAATATTAATGGCAGCGGCTGTCTTGGGTGGGATGGAACTCCAATTTATTCATTTAAAAAGAAAATGAACTGAGGTG
Protein sequenceShow/hide protein sequence
MVAGKVKLAMGLQKSPANRKVESSPKPSTPAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPMLENEIS
AKDVEIERASKRILFLEAENERLRVEVEEVKQSIEEQRRESQERIKAMEGEIAELKKMALDRSRMELILENDELSASQRFQGLMEVSGKSNLIRNLKRATRFSDSVVNQD
NHKVEHPEAKKEEVETERPRHSRSNSEELAESTLSNIKSRIPRVPKPPPKPSSSSSPSATSSSSSSSTGSSGNVEKTIPAPPPVPTKPMPPPPPPPSKSAPPPPPPPPKG
KRPMPAKVRRVPEVVEFYHSLMRRDSRREPGSGITEPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVL
KHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLEHGVYNLSRMREAATKRYKAFQIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAE
LETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQNQQQHKYVCSSRPTTC