; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g32460 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g32460
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein CHUP1, chloroplastic
Genome locationchr1:22841562..22848992
RNA-Seq ExpressionMoc01g32460
SyntenyMoc01g32460
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
GO:0005525 - GTP binding (molecular function)
InterPro domainsIPR011719 - Conserved hypothetical protein CHP02058
IPR037103 - Tubulin/FtsZ-like, C-terminal domain
IPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052630.1 protein CHUP1 [Cucumis melo var. makuwa]0.0e+0091.41Show/hide
Query:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRK++SSPK ST AQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLR VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRV+VEEVK S EE+R ESQER+KAMEGEI+ELKKMALDRSRMELIL+NDELSASQRFQGLMEVSGKS
Subjt:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIP
        NLIRNLKRATKCSDAVVN DNHKVE+PE KKEEVE   ERPRHSRCNSEELAESTLSN+KSRIPRVPKPPPKPSSSSSS+A++SSSSSTGSS DIE AIP
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIP

Query:  APPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQGD
        APPPVPTKPMP PPPPPSKS PPPPPPPPKGKRPMPAKVRR+PEVVEFYHSLMRRDSRRD GSG++DPPSTANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  APPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQGD

Query:  FIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGV
        FI+FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGS LKKMQALLEKLEHGV
Subjt:  FIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLDSGI+SQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQN-QQHKFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRACRDAISS
        HVQCQN QQHKFRSY  S  N  SS  H +CSLTRP ISRLIK TA SSMEVE+GG SA V ST  PMKLLFVEMGVGYDQHGQD+TAAAMRACRDAISS
Subjt:  HVQCQN-QQHKFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRACRDAISS

Query:  NSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFP
        NSIPAFRRGSIPGV+F +MKLQIKLGVP +LQQSLDIEKVKSVFP
Subjt:  NSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFP

KAG6594554.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0087.13Show/hide
Query:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRK++SSPKPST AQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL+ VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS
        IVP+LENEIATKDAEIERASKRILFLEAENERLRVEVEEVK S EEQR ES+ER+KAMEGEIAELKKMALDR RMELIL+NDELSASQRFQGLMEVSGKS
Subjt:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTA-SSSSSSSTGSSGDIENAI
        NLIR+LKR TK SD VV  DNHKVE PEAKKEEVE   ERPRHSR NSEELAESTLSN+KSRIPRVPKPPPKPSSSSSS+A SSSSS+STGSSGD E  I
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTA-SSSSSSSTGSSGDIENAI

Query:  PAPPPVPTKPM-PRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQ
        PAPPPVPTKP  P PPPPPSKS PPPPPPPPKGKRP PAKVRR+PEVVEFYHSLMRRDSRR+ GS +++PPS+ANARDMIGEIENRS HLLAIKTDVETQ
Subjt:  PAPPPVPTKPM-PRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQ

Query:  GDFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEH
        GDFI+FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPC S LKKMQALLEKLEH
Subjt:  GDFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEH

Query:  GVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAS
        GVYNLSRMRESATKRYKAFQIPVEWMLDSGI+SQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKAS
Subjt:  GVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAS

Query:  SCHVQCQNQQHK-------------------FRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYD
        SCHVQCQNQQHK                   F SYL SN N  SSSAH +C    P + RL+KA AA SMEVE+G +SA V STA PMKLLFVEMGVGYD
Subjt:  SCHVQCQNQQHK-------------------FRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYD

Query:  QHGQDVTAAAMRACRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIV
        QHGQD+TAAAMRACRDAI SNSIPAFRRG+IPGV+F +MKLQIKLGVPQ+LQQSLD+EKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIV
Subjt:  QHGQDVTAAAMRACRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMGDKNDDCYIV

Query:  NAAVYVGY
        NAAVYVGY
Subjt:  NAAVYVGY

KAG7026530.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0085.56Show/hide
Query:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRK++SSPKPST AQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL+ VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS
        IVP+LENEIATKDAEIERASKRILFLEAENERLRVEVEEVK S EEQR ES+ER+KAMEGEIAELKKMALDR RMELIL+NDELSASQRFQGLMEVSGKS
Subjt:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTA-SSSSSSSTGSSGDIENAI
        NLIRNLKR TK SD VV  DNHKVE PEAKKEEVE   ERPRHSR NSEELAESTLSN+KSRIPRVPKPPPKPSSSSSS+A SSSSS+STGSSGD E  I
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTA-SSSSSSSTGSSGDIENAI

Query:  PAPPPVPTKPM-PRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQ
        PAPPPVPTKP  P PPPPPSKS PPPPPPPPKGKRP PAKVRR+PEVVEFYHSLMRRDSRR+ GS +++PPS+ANARDMIGEIENRS HLLAIKTDVETQ
Subjt:  PAPPPVPTKPM-PRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQ

Query:  GDFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEH
        GDFI+FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPC S LKKMQALLEKLEH
Subjt:  GDFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEH

Query:  GVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAS
        GVYNLSRMRESATKRYKAFQIPVEWMLDSGI+SQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKAS
Subjt:  GVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAS

Query:  SCHVQCQNQQHK-----------------------------------FRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGST
        SCHVQCQNQQHK                                   F SYL SN N  SSSAH +C    P + RL+KA AA SMEVE+G +SA V ST
Subjt:  SCHVQCQNQQHK-----------------------------------FRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGST

Query:  ASPMKLLFVEMGVGYDQHGQDVTAAAMRACRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSG
        A PMKLLFVEMGVGYDQHGQD+TAAAMRACRDAI SNSIPAFRRG+IPGV+F +MKLQIKLGVPQ+LQQSLD+EKVKSVFPYGKILNVEVVDGGLICSSG
Subjt:  ASPMKLLFVEMGVGYDQHGQDVTAAAMRACRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSG

Query:  VHVEEMGDKNDDCYIVNAAVYVGY
        VHVEEMGDKNDDCYIVNAAVYVGY
Subjt:  VHVEEMGDKNDDCYIVNAAVYVGY

XP_008439756.1 PREDICTED: protein CHUP1, chloroplastic [Cucumis melo]4.9e-30293.14Show/hide
Query:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRK++SSPK ST AQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLR VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRV+VEEVK S EE+R ESQERIKAMEGEI+ELKKMALDRSRMELIL+NDELSASQRFQGLMEVSGKS
Subjt:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIP
        NLIRNLKRATKCSDAVVN DNHKVE+PE KKEEVE   ERPRHSRCNSEELAESTLSN+KSRIPRVP+PPPKPSSSSSS+A ++SSSSTGSS DIE AIP
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIP

Query:  APPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQGD
        APPPVPTKPMP PPPPPSKS PPPPPPPPKGKRPMPAKVRR+PEVVEFYHSLMRRDSRRD GSG++DPPSTANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  APPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQGD

Query:  FIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGV
        FI+ LIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGS LKKMQALLEKLEHGV
Subjt:  FIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLDSGI+SQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQN-QQHKF
        HVQCQN QQHK+
Subjt:  HVQCQN-QQHKF

XP_038883847.1 protein CHUP1, chloroplastic [Benincasa hispida]2.9e-30292.96Show/hide
Query:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRK++SSPKPST AQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLR VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRVEVEEVK S EE+R ESQERIKAME EIAELKKMALDRSRMELIL+NDELSASQRFQGLMEVSGKS
Subjt:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIP
        NLIRNLKR TKCS+AVVN DNHK E+PEAKKEEVE   ERPRHSRCNSEELAE TLSN+KSRIPRVPKPPPKPSSSSSS+A ++SSSSTGSSGD+E AIP
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIP

Query:  APPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQGD
        APPPVPTKPMP PPPPPSKS PPPPPPPPKGKRPMP KVRR+PEVVEFYHSLMRRDSRRD GS ++DPPSTANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  APPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQGD

Query:  FIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGV
        FI+FLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGS LKKMQALLEKLEHGV
Subjt:  FIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESATKRYKAFQIPVEWMLDSGI+ QIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQNQQHKF
        HVQCQNQQHK+
Subjt:  HVQCQNQQHKF

TrEMBL top hitse value%identityAlignment
A0A0A0KHU8 Uncharacterized protein1.7e-30092.66Show/hide
Query:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRK++SSPK ST AQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLR VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRV+VEE K S EE+R ESQERIKAMEGE+AELKKMALDRSRMELIL+NDELSASQRFQGLMEVSGKS
Subjt:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTA-SSSSSSSTGSSGDIENAI
        NLIRNLKRATKCSDAVVN DNHKVE+PEAKKEEVE   ERPRHSRCNSEELAESTLSN+KSRIPRVPKPPPKPSSSSSS+A +S+SSSSTGSS DIE AI
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTA-SSSSSSSTGSSGDIENAI

Query:  PAPPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQG
        PAPPPVPTK MP PPPPPSKS PPPPPPPPKGKR MPAKVRR+PEVVEFYHSLMRRDSRRD GSG+++PPSTANARDMIGEIENRS HLLAIKTDVETQG
Subjt:  PAPPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQG

Query:  DFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHG
        DFI+FLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGS LKKMQALLEKLEHG
Subjt:  DFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHG

Query:  VYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
        VYNLSRMRESA KRYKAFQIPVEWMLD GI+SQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
Subjt:  VYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

Query:  CHVQCQN-QQHKF
        CHVQCQN QQHK+
Subjt:  CHVQCQN-QQHKF

A0A1S3AZH3 protein CHUP1, chloroplastic2.4e-30293.14Show/hide
Query:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRK++SSPK ST AQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLR VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRV+VEEVK S EE+R ESQERIKAMEGEI+ELKKMALDRSRMELIL+NDELSASQRFQGLMEVSGKS
Subjt:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIP
        NLIRNLKRATKCSDAVVN DNHKVE+PE KKEEVE   ERPRHSRCNSEELAESTLSN+KSRIPRVP+PPPKPSSSSSS+A ++SSSSTGSS DIE AIP
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIP

Query:  APPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQGD
        APPPVPTKPMP PPPPPSKS PPPPPPPPKGKRPMPAKVRR+PEVVEFYHSLMRRDSRRD GSG++DPPSTANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  APPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQGD

Query:  FIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGV
        FI+ LIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGS LKKMQALLEKLEHGV
Subjt:  FIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLDSGI+SQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQN-QQHKF
        HVQCQN QQHK+
Subjt:  HVQCQN-QQHKF

A0A5D3CMM2 Protein CHUP10.0e+0091.41Show/hide
Query:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRK++SSPK ST AQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLR VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS
        IVP+LENEI+TKDAEIERASKRILFLEAENERLRV+VEEVK S EE+R ESQER+KAMEGEI+ELKKMALDRSRMELIL+NDELSASQRFQGLMEVSGKS
Subjt:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIP
        NLIRNLKRATKCSDAVVN DNHKVE+PE KKEEVE   ERPRHSRCNSEELAESTLSN+KSRIPRVPKPPPKPSSSSSS+A++SSSSSTGSS DIE AIP
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIP

Query:  APPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQGD
        APPPVPTKPMP PPPPPSKS PPPPPPPPKGKRPMPAKVRR+PEVVEFYHSLMRRDSRRD GSG++DPPSTANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  APPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQGD

Query:  FIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGV
        FI+FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGS LKKMQALLEKLEHGV
Subjt:  FIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGV

Query:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
        YNLSRMRESA KRYKAFQIPVEWMLDSGI+SQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC
Subjt:  YNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSC

Query:  HVQCQN-QQHKFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRACRDAISS
        HVQCQN QQHKFRSY  S  N  SS  H +CSLTRP ISRLIK TA SSMEVE+GG SA V ST  PMKLLFVEMGVGYDQHGQD+TAAAMRACRDAISS
Subjt:  HVQCQN-QQHKFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAVGSTASPMKLLFVEMGVGYDQHGQDVTAAAMRACRDAISS

Query:  NSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFP
        NSIPAFRRGSIPGV+F +MKLQIKLGVP +LQQSLDIEKVKSVFP
Subjt:  NSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFP

A0A6J1EFK1 protein CHUP1, chloroplastic-like7.4e-29691.68Show/hide
Query:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRK++SSPKPST AQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL+ VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS
        IVP+LENEIATKDAEIERASKRILFLEAENERLRVEVEEVK S EEQR ESQER+KAMEGEIAELKKMALDR RMELIL+NDELSASQRFQGLMEVSGKS
Subjt:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTA-SSSSSSSTGSSGDIENAI
        NLIR+LKR TK SD VV  DNHKVE PEAKKEEVE   ERPRHSR NSEELAESTLSN+KSRIPRVPKPPPKPSSSSSS+A SSSSS+STGSSGD E  I
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTA-SSSSSSSTGSSGDIENAI

Query:  PAPPPVPTKPM-PRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQ
        PAPPPVPTKP  P PPPPPSKS PPPPPPPPKGKRP PAKVRR+PEVVEFYHSLMRRDSRR+ GSG+++PPS+ANARDMIGEIENRSTHLLAIKTDVETQ
Subjt:  PAPPPVPTKPM-PRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQ

Query:  GDFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEH
        GDFI+FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGS LKKMQALLEKLEH
Subjt:  GDFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEH

Query:  GVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAS
        GVYNLSRMRESATKRYKAFQIPVEWMLDSGI+SQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKAS
Subjt:  GVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAS

Query:  SCHVQCQNQQHKF
        SCHVQCQNQQHK+
Subjt:  SCHVQCQNQQHKF

A0A6J1KWU6 protein CHUP1, chloroplastic-like6.9e-29491.19Show/hide
Query:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPA RK++SSPKPST AQPSPSSGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL+ VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS
        IVP+LENEIATKDAEIERASKRILFLEAENERLRVEVEEVK S EEQR ESQER+KAMEGEIAELKKMALDR RMELIL+NDELSASQRFQGLMEVSGKS
Subjt:  IVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTA-SSSSSSSTGSSGDIENAI
        NLIR+LKR TK SD VV  DNHKVE PEAKKEEVE   ERPRHSR NSEELAESTLSN+KSRIPRVPKPPPKPSSSSSS+A SSSSS+STGSSGD E  I
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTA-SSSSSSSTGSSGDIENAI

Query:  PAPPPVPTKPM-PRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQ
        PAPPPVPTKP  P PPPPPSKS PPPPPPPPKGKRP  AKVRR+PEVVEFYHSLMRRDSRR+ GSG+++PPS+ANARDMIGEIENRS HLLAIKTDVETQ
Subjt:  PAPPPVPTKPM-PRPPPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQ

Query:  GDFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEH
        GDFI+FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGS LKKMQALLEKLEH
Subjt:  GDFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEH

Query:  GVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAS
        GVYNLSRMRESATKRYKAFQIPVEWMLDSGI+SQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKAS
Subjt:  GVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAS

Query:  SCHVQCQNQQHKF
        SCHVQCQNQQHK+
Subjt:  SCHVQCQNQQHKF

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic1.5e-8347.18Show/hide
Query:  KMALDRSRMELILQNDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIP
        K+A++R +   I    + + ++RF G + +  K   ++  KR    S   A  +  N   E  E K  E             N+  + +  L +++ R P
Subjt:  KMALDRSRMELILQNDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIP

Query:  RVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIPAPPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPA--KVRRVPEVVEFYHSLMRRDSRRDPG
        RVP+PPP+ +    ST   S+       G        PPP P  P   PPPPP    PPPPPPP    R      KV R PE+VEFY SLM+R+S+++  
Subjt:  RVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIPAPPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPA--KVRRVPEVVEFYHSLMRRDSRRDPG

Query:  SGI---SDPPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAF
          +       S+A   +MIGEIENRST LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE KADALREAAF
Subjt:  SGI---SDPPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAF

Query:  GYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGG-
         Y DL KLE + +SF  D    C   LKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G++ +IKL SV+LA KYMKRV+ EL++V G  
Subjt:  GYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGG-

Query:  --PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
          P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  --PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

Q9LI74 Protein CHUP1, chloroplastic2.0e+0024.37Show/hide
Query:  DVTELLRTVEELRDREARLKTDLLEHKLLKESVAIVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELK
        ++  L + V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L+AE ++L+ E+         Q G  ++ ++    +I EL+
Subjt:  DVTELLRTVEELRDREARLKTDLLEHKLLKESVAIVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELK

Query:  K---MALDRSRMELILQNDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHS-RCNSEELAESTLSNL
        +   +  ++++ +L+L    +S+ Q     M+     N    ++R  K    +      +V+  E K++  E + E+   S + +S E   +TLSN+
Subjt:  K---MALDRSRMELILQNDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHS-RCNSEELAESTLSNL

Arabidopsis top hitse value%identityAlignment
AT1G48280.1 hydroxyproline-rich glycoprotein family protein3.6e-6138.09Show/hide
Query:  RASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALD--RSRMELILQNDELSASQRFQGLMEVSGK-SNLIRNLKRATKCSD
        ++ + ++   A  +  R  +EE+    EE+   ++  IK ++ ++  LK    +   S +EL L N +LS     Q L+    K S+L  N K A +   
Subjt:  RASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALD--RSRMELILQNDELSASQRFQGLMEVSGK-SNLIRNLKRATKCSD

Query:  AVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIP-RVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIPAPPPVPTKPMPRP
            H N +    +  +  + +++E+P   +   E   ES+  +  S  P R+P  PP P    S  +S             EN+ P  PP         
Subjt:  AVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIP-RVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIPAPPPVPTKPMPRP

Query:  PPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANA--RDMIGEIENRSTHLLAIKTDVETQGDFIKFLIKEVEN
        PPPP    PPPPPP P  K    A+ ++ P V + +  L ++D+ R+    ++   S  N+    ++GEI+NRS HL+AIK D+ET+G+FI  LI++V  
Subjt:  PPPPSKSVPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANA--RDMIGEIENRSTHLLAIKTDVETQGDFIKFLIKEVEN

Query:  ASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGVYNLSRMRESAT
          F+D+EDV+ FV WLD EL+ L DERAVLKHF+WPE+KAD L+EAA  Y +LKKLE E SS+  D     G  LKKM  LL+K E  +  L R+R S+ 
Subjt:  ASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGVYNLSRMRESAT

Query:  KRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEE---EELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
        + Y+ F+IPVEWMLDSG++ +IK  S+KLA  YM RV+ EL++      E   E L++QGVRFA+R HQFAGG D ET+ A +E++ +  S
Subjt:  KRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGGPEE---EELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein1.0e-8447.18Show/hide
Query:  KMALDRSRMELILQNDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIP
        K+A++R +   I    + + ++RF G + +  K   ++  KR    S   A  +  N   E  E K  E             N+  + +  L +++ R P
Subjt:  KMALDRSRMELILQNDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIP

Query:  RVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIPAPPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPA--KVRRVPEVVEFYHSLMRRDSRRDPG
        RVP+PPP+ +    ST   S+       G        PPP P  P   PPPPP    PPPPPPP    R      KV R PE+VEFY SLM+R+S+++  
Subjt:  RVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIPAPPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPA--KVRRVPEVVEFYHSLMRRDSRRDPG

Query:  SGI---SDPPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAF
          +       S+A   +MIGEIENRST LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE KADALREAAF
Subjt:  SGI---SDPPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAF

Query:  GYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGG-
         Y DL KLE + +SF  D    C   LKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G++ +IKL SV+LA KYMKRV+ EL++V G  
Subjt:  GYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGG-

Query:  --PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
          P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  --PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein1.4e-0124.37Show/hide
Query:  DVTELLRTVEELRDREARLKTDLLEHKLLKESVAIVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELK
        ++  L + V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L+AE ++L+ E+         Q G  ++ ++    +I EL+
Subjt:  DVTELLRTVEELRDREARLKTDLLEHKLLKESVAIVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELK

Query:  K---MALDRSRMELILQNDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHS-RCNSEELAESTLSNL
        +   +  ++++ +L+L    +S+ Q     M+     N    ++R  K    +      +V+  E K++  E + E+   S + +S E   +TLSN+
Subjt:  K---MALDRSRMELILQNDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHS-RCNSEELAESTLSNL

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein1.0e-8447.18Show/hide
Query:  KMALDRSRMELILQNDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIP
        K+A++R +   I    + + ++RF G + +  K   ++  KR    S   A  +  N   E  E K  E             N+  + +  L +++ R P
Subjt:  KMALDRSRMELILQNDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIP

Query:  RVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIPAPPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPA--KVRRVPEVVEFYHSLMRRDSRRDPG
        RVP+PPP+ +    ST   S+       G        PPP P  P   PPPPP    PPPPPPP    R      KV R PE+VEFY SLM+R+S+++  
Subjt:  RVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIPAPPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPA--KVRRVPEVVEFYHSLMRRDSRRDPG

Query:  SGI---SDPPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAF
          +       S+A   +MIGEIENRST LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE KADALREAAF
Subjt:  SGI---SDPPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAF

Query:  GYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGG-
         Y DL KLE + +SF  D    C   LKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G++ +IKL SV+LA KYMKRV+ EL++V G  
Subjt:  GYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGG-

Query:  --PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
          P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  --PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein1.4e-0124.37Show/hide
Query:  DVTELLRTVEELRDREARLKTDLLEHKLLKESVAIVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELK
        ++  L + V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L+AE ++L+ E+         Q G  ++ ++    +I EL+
Subjt:  DVTELLRTVEELRDREARLKTDLLEHKLLKESVAIVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELK

Query:  K---MALDRSRMELILQNDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHS-RCNSEELAESTLSNL
        +   +  ++++ +L+L    +S+ Q     M+     N    ++R  K    +      +V+  E K++  E + E+   S + +S E   +TLSN+
Subjt:  K---MALDRSRMELILQNDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVEYPEAKKEEVEAEIERPRHS-RCNSEELAESTLSNL

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein1.0e-8447.18Show/hide
Query:  KMALDRSRMELILQNDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIP
        K+A++R +   I    + + ++RF G + +  K   ++  KR    S   A  +  N   E  E K  E             N+  + +  L +++ R P
Subjt:  KMALDRSRMELILQNDELSASQRFQGLMEVSGKSNLIRNLKRATKCS--DAVVNHDNHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIP

Query:  RVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIPAPPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPA--KVRRVPEVVEFYHSLMRRDSRRDPG
        RVP+PPP+ +    ST   S+       G        PPP P  P   PPPPP    PPPPPPP    R      KV R PE+VEFY SLM+R+S+++  
Subjt:  RVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIPAPPPVPTKPMPRPPPPPSKSVPPPPPPPPKGKRPMPA--KVRRVPEVVEFYHSLMRRDSRRDPG

Query:  SGI---SDPPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAF
          +       S+A   +MIGEIENRST LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF WPE KADALREAAF
Subjt:  SGI---SDPPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAF

Query:  GYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGG-
         Y DL KLE + +SF  D    C   LKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G++ +IKL SV+LA KYMKRV+ EL++V G  
Subjt:  GYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETVGGG-

Query:  --PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS
          P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A +
Subjt:  --PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASS

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.6e-17259.15Show/hide
Query:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQP------SPSSGKVS-------QKTVFSRSFGVYFPRSSAQVQPRPPD------VTELLRTVEELRDR
        MVAGKV+V MG  KSP+++K    P P     P       PSSG  +        K  F+RSFGVYFPR+SAQV            V+EL R VEELR+R
Subjt:  MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQP------SPSSGKVS-------QKTVFSRSFGVYFPRSSAQVQPRPPD------VTELLRTVEELRDR

Query:  EARLKTDLLEHKLLKESVAIVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQN
        EA LKT+ LE KLL+ESV+++PLLE++IA K+ EI+   K    L  +NERLR E +      EE R E + R K ME EI EL+K+    S       +
Subjt:  EARLKTDLLEHKLLKESVAIVPLLENEIATKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQN

Query:  DELSASQRFQGLMEVSGKSNLIRNLKRA---TKCSDAVVNHDNHKVEYPEA--------KKEEVEAEIERPRHSR-CNSEELAE-STLSNLKSRIPRVPK
          LS SQRFQGLM+VS KSNLIR+LKR        + + N +N       +        +K+E+E+      +SR  NSEEL E S+LS ++SR+PRVPK
Subjt:  DELSASQRFQGLMEVSGKSNLIRNLKRA---TKCSDAVVNHDNHKVEYPEA--------KKEEVEAEIERPRHSR-CNSEELAE-STLSNLKSRIPRVPK

Query:  PPPKPSSSSSSTASSSSSSSTGSSGDIENAIPAPPPVPTKP-MPRPPPPPSKS---VPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRD---SRRDP
        PPPK S       S   S+   +    + +IP PPP P  P + +PPPPPS S    PPPPPPPPK      AKVRRVPEVVEFYHSLMRRD   SRRD 
Subjt:  PPPKPSSSSSSTASSSSSSSTGSSGDIENAIPAPPPVPTKP-MPRPPPPPSKS---VPPPPPPPPKGKRPMPAKVRRVPEVVEFYHSLMRRD---SRRDP

Query:  GSGISDPP----STANARDMIGEIENRSTHLLAIKTDVETQGDFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREA
          G +       + +NARDMIGEIENRS +LLAIKTDVETQGDFI+FLIKEV NA+F+DIEDVVPFVKWLDDELSYLVDERAVLKHF+WPEQKADALREA
Subjt:  GSGISDPP----STANARDMIGEIENRSTHLLAIKTDVETQGDFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREA

Query:  AFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETV-G
        AF Y DLKKL SEAS FR D RQ   S LKKMQAL EKLEHGVY+LSRMRESA  ++K+FQIPV+WML++GI SQIKL SVKLAMKYMKRVSAELE + G
Subjt:  AFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSAELETV-G

Query:  GGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQNQQHKFR
        GGPEEEELIVQGVRFAFRVHQFAGGFD ETM+AF+ELRDKA SCHVQCQ+Q H+ +
Subjt:  GGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQNQQHKFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGCTGGGAAGGTGAAGGTTGCAATGGGGCTGCAGAAGTCCCCCGCGAGCAGAAAGCTTGACAGCTCACCGAAGCCATCCACGCAGGCGCAGCCTTCTCCGAGCTC
CGGTAAGGTTTCTCAGAAAACGGTCTTCTCCCGCTCATTTGGTGTCTATTTTCCTCGCTCCTCTGCTCAGGTCCAGCCTCGACCGCCTGACGTGACGGAGCTTCTCCGTA
CGGTCGAGGAGTTGCGTGACAGAGAGGCGCGATTAAAGACTGACCTATTGGAGCACAAGCTTTTGAAGGAGTCTGTCGCCATTGTTCCTCTGCTTGAGAACGAAATCGCT
ACGAAGGACGCTGAGATTGAAAGAGCATCCAAGCGGATATTATTCTTGGAGGCGGAGAATGAGAGATTGAGAGTTGAGGTGGAGGAAGTTAAACTTAGTTTTGAGGAACA
GAGGGGGGAGAGTCAGGAAAGAATAAAGGCAATGGAAGGGGAAATCGCGGAGCTGAAGAAAATGGCGTTGGATCGAAGCAGGATGGAGCTTATTTTGCAGAACGACGAGC
TTTCGGCGTCCCAGAGGTTCCAGGGATTAATGGAGGTCTCGGGAAAGTCTAACCTAATCAGGAACTTGAAAAGAGCGACCAAATGTTCGGATGCTGTTGTTAACCACGAC
AATCATAAGGTTGAATATCCAGAGGCAAAGAAAGAAGAAGTTGAAGCTGAAATCGAGAGACCGAGACACTCGCGGTGTAACTCTGAAGAACTCGCCGAGTCCACTCTCTC
TAACCTAAAATCACGAATACCTAGGGTTCCCAAACCTCCTCCCAAACCTTCCTCGTCTTCCTCTTCTACTGCCTCTTCTTCCTCCTCCTCATCGACTGGCTCTTCTGGTG
ACATAGAGAATGCGATCCCAGCCCCACCCCCTGTCCCTACGAAGCCAATGCCGCGGCCTCCACCGCCGCCTTCGAAGTCCGTTCCGCCTCCCCCTCCGCCGCCTCCCAAG
GGTAAGAGGCCGATGCCAGCGAAGGTGCGGCGAGTACCGGAGGTTGTTGAGTTCTATCATTCGTTAATGCGGAGGGACTCCCGGCGAGATCCCGGCTCCGGAATTTCCGA
CCCGCCGTCGACCGCCAATGCTCGTGACATGATAGGGGAGATCGAGAACCGGTCCACTCACCTACTCGCTATAAAGACGGATGTAGAGACTCAAGGGGATTTCATAAAGT
TTCTGATTAAAGAAGTTGAAAATGCTTCATTTACTGACATCGAGGACGTTGTGCCATTTGTCAAATGGTTGGACGATGAGCTCTCATATCTGGTGGACGAAAGAGCCGTC
CTTAAACACTTTCAGTGGCCGGAGCAAAAGGCCGACGCTCTGCGTGAGGCCGCCTTTGGCTATTGCGATCTAAAGAAGCTGGAGTCCGAAGCGTCATCGTTTCGTGGTGA
TGCCCGCCAGCCCTGTGGTTCCGTTCTCAAGAAGATGCAAGCTCTGCTTGAAAAATTGGAGCATGGCGTATACAATCTCTCGAGAATGCGTGAATCTGCAACTAAGAGAT
ACAAAGCTTTTCAAATTCCAGTGGAATGGATGCTTGATAGTGGAATTATGAGTCAGATCAAGCTTGTCTCTGTGAAATTAGCAATGAAGTACATGAAGAGAGTATCCGCG
GAGCTTGAAACAGTCGGTGGTGGACCTGAGGAAGAAGAGCTGATTGTCCAAGGCGTTAGATTTGCCTTCCGTGTGCATCAGTTTGCCGGAGGATTTGATGTGGAAACGAT
GAGGGCATTTCAAGAGCTGAGAGATAAAGCAAGTTCATGCCACGTTCAATGCCAAAACCAGCAACATAAGTTTCGTTCTTACCTTCATTCCAATAAGAATCGATCATCAT
CATCTGCGCATTTTCTCTGCTCTCTCACACGGCCTCCAATTTCTCGCCTTATCAAAGCTACTGCTGCGTCGTCCATGGAGGTTGAGGAAGGTGGAAGATCTGCAGCTGTT
GGCAGCACAGCTTCGCCCATGAAGCTCTTGTTCGTCGAGATGGGTGTCGGATACGATCAACATGGCCAAGATGTCACGGCAGCTGCTATGCGGGCCTGCAGGGATGCCAT
ATCTTCCAATTCGATTCCAGCATTCCGTAGAGGTTCCATTCCTGGAGTCACATTTGATCAGATGAAACTTCAGATCAAACTTGGAGTACCACAGACACTTCAACAATCCT
TGGATATTGAAAAAGTCAAGTCTGTCTTCCCATATGGAAAGATTCTGAATGTTGAGGTCGTTGATGGTGGCTTAATATGCTCCAGCGGTGTGCACGTGGAAGAAATGGGA
GACAAAAATGATGACTGTTATATAGTAAATGCTGCTGTATATGTTGGCTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGCTGGGAAGGTGAAGGTTGCAATGGGGCTGCAGAAGTCCCCCGCGAGCAGAAAGCTTGACAGCTCACCGAAGCCATCCACGCAGGCGCAGCCTTCTCCGAGCTC
CGGTAAGGTTTCTCAGAAAACGGTCTTCTCCCGCTCATTTGGTGTCTATTTTCCTCGCTCCTCTGCTCAGGTCCAGCCTCGACCGCCTGACGTGACGGAGCTTCTCCGTA
CGGTCGAGGAGTTGCGTGACAGAGAGGCGCGATTAAAGACTGACCTATTGGAGCACAAGCTTTTGAAGGAGTCTGTCGCCATTGTTCCTCTGCTTGAGAACGAAATCGCT
ACGAAGGACGCTGAGATTGAAAGAGCATCCAAGCGGATATTATTCTTGGAGGCGGAGAATGAGAGATTGAGAGTTGAGGTGGAGGAAGTTAAACTTAGTTTTGAGGAACA
GAGGGGGGAGAGTCAGGAAAGAATAAAGGCAATGGAAGGGGAAATCGCGGAGCTGAAGAAAATGGCGTTGGATCGAAGCAGGATGGAGCTTATTTTGCAGAACGACGAGC
TTTCGGCGTCCCAGAGGTTCCAGGGATTAATGGAGGTCTCGGGAAAGTCTAACCTAATCAGGAACTTGAAAAGAGCGACCAAATGTTCGGATGCTGTTGTTAACCACGAC
AATCATAAGGTTGAATATCCAGAGGCAAAGAAAGAAGAAGTTGAAGCTGAAATCGAGAGACCGAGACACTCGCGGTGTAACTCTGAAGAACTCGCCGAGTCCACTCTCTC
TAACCTAAAATCACGAATACCTAGGGTTCCCAAACCTCCTCCCAAACCTTCCTCGTCTTCCTCTTCTACTGCCTCTTCTTCCTCCTCCTCATCGACTGGCTCTTCTGGTG
ACATAGAGAATGCGATCCCAGCCCCACCCCCTGTCCCTACGAAGCCAATGCCGCGGCCTCCACCGCCGCCTTCGAAGTCCGTTCCGCCTCCCCCTCCGCCGCCTCCCAAG
GGTAAGAGGCCGATGCCAGCGAAGGTGCGGCGAGTACCGGAGGTTGTTGAGTTCTATCATTCGTTAATGCGGAGGGACTCCCGGCGAGATCCCGGCTCCGGAATTTCCGA
CCCGCCGTCGACCGCCAATGCTCGTGACATGATAGGGGAGATCGAGAACCGGTCCACTCACCTACTCGCTATAAAGACGGATGTAGAGACTCAAGGGGATTTCATAAAGT
TTCTGATTAAAGAAGTTGAAAATGCTTCATTTACTGACATCGAGGACGTTGTGCCATTTGTCAAATGGTTGGACGATGAGCTCTCATATCTGGTGGACGAAAGAGCCGTC
CTTAAACACTTTCAGTGGCCGGAGCAAAAGGCCGACGCTCTGCGTGAGGCCGCCTTTGGCTATTGCGATCTAAAGAAGCTGGAGTCCGAAGCGTCATCGTTTCGTGGTGA
TGCCCGCCAGCCCTGTGGTTCCGTTCTCAAGAAGATGCAAGCTCTGCTTGAAAAATTGGAGCATGGCGTATACAATCTCTCGAGAATGCGTGAATCTGCAACTAAGAGAT
ACAAAGCTTTTCAAATTCCAGTGGAATGGATGCTTGATAGTGGAATTATGAGTCAGATCAAGCTTGTCTCTGTGAAATTAGCAATGAAGTACATGAAGAGAGTATCCGCG
GAGCTTGAAACAGTCGGTGGTGGACCTGAGGAAGAAGAGCTGATTGTCCAAGGCGTTAGATTTGCCTTCCGTGTGCATCAGTTTGCCGGAGGATTTGATGTGGAAACGAT
GAGGGCATTTCAAGAGCTGAGAGATAAAGCAAGTTCATGCCACGTTCAATGCCAAAACCAGCAACATAAGTTTCGTTCTTACCTTCATTCCAATAAGAATCGATCATCAT
CATCTGCGCATTTTCTCTGCTCTCTCACACGGCCTCCAATTTCTCGCCTTATCAAAGCTACTGCTGCGTCGTCCATGGAGGTTGAGGAAGGTGGAAGATCTGCAGCTGTT
GGCAGCACAGCTTCGCCCATGAAGCTCTTGTTCGTCGAGATGGGTGTCGGATACGATCAACATGGCCAAGATGTCACGGCAGCTGCTATGCGGGCCTGCAGGGATGCCAT
ATCTTCCAATTCGATTCCAGCATTCCGTAGAGGTTCCATTCCTGGAGTCACATTTGATCAGATGAAACTTCAGATCAAACTTGGAGTACCACAGACACTTCAACAATCCT
TGGATATTGAAAAAGTCAAGTCTGTCTTCCCATATGGAAAGATTCTGAATGTTGAGGTCGTTGATGGTGGCTTAATATGCTCCAGCGGTGTGCACGTGGAAGAAATGGGA
GACAAAAATGATGACTGTTATATAGTAAATGCTGCTGTATATGTTGGCTATTAA
Protein sequenceShow/hide protein sequence
MVAGKVKVAMGLQKSPASRKLDSSPKPSTQAQPSPSSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELRDREARLKTDLLEHKLLKESVAIVPLLENEIA
TKDAEIERASKRILFLEAENERLRVEVEEVKLSFEEQRGESQERIKAMEGEIAELKKMALDRSRMELILQNDELSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHD
NHKVEYPEAKKEEVEAEIERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSTASSSSSSSTGSSGDIENAIPAPPPVPTKPMPRPPPPPSKSVPPPPPPPPK
GKRPMPAKVRRVPEVVEFYHSLMRRDSRRDPGSGISDPPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIKFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAV
LKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSVLKKMQALLEKLEHGVYNLSRMRESATKRYKAFQIPVEWMLDSGIMSQIKLVSVKLAMKYMKRVSA
ELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKASSCHVQCQNQQHKFRSYLHSNKNRSSSSAHFLCSLTRPPISRLIKATAASSMEVEEGGRSAAV
GSTASPMKLLFVEMGVGYDQHGQDVTAAAMRACRDAISSNSIPAFRRGSIPGVTFDQMKLQIKLGVPQTLQQSLDIEKVKSVFPYGKILNVEVVDGGLICSSGVHVEEMG
DKNDDCYIVNAAVYVGY