; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0016456 (gene) of Chayote v1 genome

Gene IDSed0016456
OrganismSechium edule (Chayote v1)
Descriptionprotein CHUP1, chloroplastic
Genome locationLG14:23388876..23393904
RNA-Seq ExpressionSed0016456
SyntenySed0016456
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
GO:0005525 - GTP binding (molecular function)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134665.1 protein CHUP1, chloroplastic [Cucumis sativus]9.4e-27784.35Show/hide
Query:  MVAGKVKLAMGLHKPPASKPVETSP-APPPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA
        MVAGKVK+AMGL K PAS+ VE+SP    P QPSP+SGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLR VEELR+REARL T+LLEHKLLKESVA
Subjt:  MVAGKVKLAMGLHKPPASKPVETSP-APPPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA

Query:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS
        IVP+LE  ISTKD EIERA KRILFLEAENERLRV++EE KQS EE+RR S+ER+ AMEGE+ ELK+MALDRSRME++ END+ S   RFQ LMEVSGKS
Subjt:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS

Query:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSSTS------SSSSTSSSGNVEKTIPA
        NLIRNLKRATKCS++V+NQ+NHKVE PEAKK EVETER R+S+C +EE+A+STLSN+KSRIPRVPKPPPKPSSSS+S      SSSST SS ++EK IPA
Subjt:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSSTS------SSSSTSSSGNVEKTIPA

Query:  PPPVPTMRI----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGDF
        PPPVPT  +    PPP KSAPPPPPPPPKGK  + +KVRRIPEVVEFYHSLMRRDSRR+SG+GV E PSTANARDMIGEIENRS HLLAIKTDVETQGDF
Subjt:  PPPVPTMRI----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGDF

Query:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVY
        IRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLE GVY
Subjt:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVY

Query:  NLSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSCH
        NLSRMRESA K+YK FQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGP+EEELIVQGVRFAFRVHQFAGGFDVETM+AFQELRDKASSCH
Subjt:  NLSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSCH

Query:  VQCQN-QQHKYVCSNRPTTC
        VQCQN QQHKYV S+RPTTC
Subjt:  VQCQN-QQHKYVCSNRPTTC

XP_008439756.1 PREDICTED: protein CHUP1, chloroplastic [Cucumis melo]2.5e-27784.63Show/hide
Query:  MVAGKVKLAMGLHKPPASKPVETSP-APPPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA
        MVAGKVK+AMGL K PAS+ VE+SP    P QPSP+SGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLR VEELR+REARL T+LLEHKLLKESVA
Subjt:  MVAGKVKLAMGLHKPPASKPVETSP-APPPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA

Query:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS
        IVP+LE  ISTKD EIERA KRILFLEAENERLRV++EEVKQS EE+RR S+ER+ AMEGEI ELK+MALDRSRME++ END+ S   RFQ LMEVSGKS
Subjt:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS

Query:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSS----TSSSSSTSSSGNVEKTIPAPP
        NLIRNLKRATKCS++V+NQ+NHKVE PE KK EVETER R+S+C +EE+A+STLSN+KSRIPRVP+PPPKPSSSS    T+SSSST SS ++EK IPAPP
Subjt:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSS----TSSSSSTSSSGNVEKTIPAPP

Query:  PVPTMRI----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIR
        PVPT  +    PPP KSAPPPPPPPPKGK P+ +KVRRIPEVVEFYHSLMRRDSRR+SG+GV + PSTANARDMIGEIENRS HLLAIKTDVETQGDFIR
Subjt:  PVPTMRI----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIR

Query:  FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYNL
         LIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLE GVYNL
Subjt:  FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYNL

Query:  SRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSCHVQ
        SRMRESA K+YK FQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGP+EEELIVQGVRFAFRVHQFAGGFDVETM+AFQELRDKASSCHVQ
Subjt:  SRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSCHVQ

Query:  CQN-QQHKYVCSNRPTTC
        CQN QQHKYV S+RPTTC
Subjt:  CQN-QQHKYVCSNRPTTC

XP_022926872.1 protein CHUP1, chloroplastic-like [Cucurbita moschata]1.4e-27584.54Show/hide
Query:  MVAGKVKLAMGLHKPPASKPVETSPAP-PPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA
        MVAGKVKLAMGL K PAS+ VE+SP P  P QPSP+SGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL+ VEELR+REARL T+LLEHKLLKESVA
Subjt:  MVAGKVKLAMGLHKPPASKPVETSPAP-PPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA

Query:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS
        IVPMLE  I+TKD EIERA KRILFLEAENERLRVE+EEVKQS EEQRR S+ERV AMEGEI ELK+MALDR RME++ END+ S   RFQ LMEVSGKS
Subjt:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS

Query:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSS------STSSSSSTSSSGNVEKTIPA
        NLIR+LKR TK S++V+ Q+NHKVE PEAKK EVETER R+S+  +EE+A+STLSNVKSRIPRVPKPPPKPSSS      S+SSS+ST SSG+ EK IPA
Subjt:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSS------STSSSSSTSSSGNVEKTIPA

Query:  PPPVPTMRI-----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGD
        PPPVPT        PPP KSAPPPPPPPPKGK P  +KVRRIPEVVEFYHSLMRRDSRRE G+GV E PS+ANARDMIGEIENRSTHLLAIKTDVETQGD
Subjt:  PPPVPTMRI-----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLE GV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGV

Query:  YNLSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSC
        YNLSRMRESATK+YK FQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGP+EEELIV+GVRFAFRVHQFAGGFDVETM+AFQELRDKASSC
Subjt:  YNLSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSC

Query:  HVQCQNQQHKYVC-SNRPTTC
        HVQCQNQQHKYVC SNRPTTC
Subjt:  HVQCQNQQHKYVC-SNRPTTC

XP_023518440.1 protein CHUP1, chloroplastic [Cucurbita pepo subsp. pepo]3.0e-27584.38Show/hide
Query:  MVAGKVKLAMGLHKPPASKPVETSPAP-PPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA
        MVAGKVKLAMGL K PAS+ VE+SP P  P QPSP+SGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL+ VEELR+REARL T+LLEHKLLKESVA
Subjt:  MVAGKVKLAMGLHKPPASKPVETSPAP-PPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA

Query:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS
        IVPMLE  I+TKD EIERA KRILFLEAENERLRVE+EEVKQS EEQRR S+ERV AMEGEI ELK+MALDR RME++ END+ S   RFQ LMEVSGKS
Subjt:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS

Query:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSS------STSSSSSTSSSGNVEKTIPA
        NLIR+LKR TK S++V+ Q+NHKVE PEAKK EVETER R+S+  +EE+A+STLS++KSRIPRVPKPPPKPSSS      S+SSSSST SSG+ EK IPA
Subjt:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSS------STSSSSSTSSSGNVEKTIPA

Query:  PPPVPTMRI-----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGD
        PPPVPT        PPP KSAPPPPPPPPKGK P  +KVRRIPEVVEFYHSLMRRDSRRE G+GV E PS+ANARDMIGEIENRSTHLLAIKTDVETQGD
Subjt:  PPPVPTMRI-----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLE GV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGV

Query:  YNLSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSC
        YNLSRMRESATK+YK FQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGP+EEELIV+GVRFAFRVHQFAGGFDVETM+AFQELRDKASSC
Subjt:  YNLSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSC

Query:  HVQCQNQQHKYVC-SNRPTTC
        HVQCQNQQHKYVC SNRPTTC
Subjt:  HVQCQNQQHKYVC-SNRPTTC

XP_038883847.1 protein CHUP1, chloroplastic [Benincasa hispida]1.7e-27884.93Show/hide
Query:  MVAGKVKLAMGLHKPPASKPVETSPAP-PPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA
        MVAGKVK+AMGL K PAS+ VE+SP P  P QPSP+SGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLR VEELR+REARL T+LLEHKLLKESVA
Subjt:  MVAGKVKLAMGLHKPPASKPVETSPAP-PPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA

Query:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS
        IVP+LE  ISTKD EIERA KRILFLEAENERLRVE+EEVKQS EE+RR S+ER+ AME EI ELK+MALDRSRME++ END+ S   RFQ LMEVSGKS
Subjt:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS

Query:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSSTS----SSSSTSSSGNVEKTIPAPP
        NLIRNLKR TKCSE+V+NQ+NHK E PEAKK EVETER R+S+C +EE+A+ TLSN+KSRIPRVPKPPPKPSSSS+S    SSSST SSG++EK IPAPP
Subjt:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSSTS----SSSSTSSSGNVEKTIPAPP

Query:  PVPTMRI----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIR
        PVPT  +    PPP KSAPPPPPPPPKGK P+ +KVRRIPEVVEFYHSLMRRDSRR+SG+ V + PSTANARDMIGEIENRS HLLAIKTDVETQGDFIR
Subjt:  PVPTMRI----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIR

Query:  FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYNL
        FLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLE GVYNL
Subjt:  FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYNL

Query:  SRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSCHVQ
        SRMRESATK+YK FQIPVEWMLD GIV QIKLVSVKLAMKYMKRVSAELETVGGGP+EEELIVQGVRFAFRVHQFAGGFDVETM+AFQELRDKASSCHVQ
Subjt:  SRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSCHVQ

Query:  CQNQQHKYVCSNRPTTC
        CQNQQHKYV S+RPTTC
Subjt:  CQNQQHKYVCSNRPTTC

TrEMBL top hitse value%identityAlignment
A0A0A0KHU8 Uncharacterized protein4.5e-27784.35Show/hide
Query:  MVAGKVKLAMGLHKPPASKPVETSP-APPPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA
        MVAGKVK+AMGL K PAS+ VE+SP    P QPSP+SGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLR VEELR+REARL T+LLEHKLLKESVA
Subjt:  MVAGKVKLAMGLHKPPASKPVETSP-APPPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA

Query:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS
        IVP+LE  ISTKD EIERA KRILFLEAENERLRV++EE KQS EE+RR S+ER+ AMEGE+ ELK+MALDRSRME++ END+ S   RFQ LMEVSGKS
Subjt:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS

Query:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSSTS------SSSSTSSSGNVEKTIPA
        NLIRNLKRATKCS++V+NQ+NHKVE PEAKK EVETER R+S+C +EE+A+STLSN+KSRIPRVPKPPPKPSSSS+S      SSSST SS ++EK IPA
Subjt:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSSTS------SSSSTSSSGNVEKTIPA

Query:  PPPVPTMRI----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGDF
        PPPVPT  +    PPP KSAPPPPPPPPKGK  + +KVRRIPEVVEFYHSLMRRDSRR+SG+GV E PSTANARDMIGEIENRS HLLAIKTDVETQGDF
Subjt:  PPPVPTMRI----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGDF

Query:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVY
        IRFLIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLE GVY
Subjt:  IRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVY

Query:  NLSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSCH
        NLSRMRESA K+YK FQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGP+EEELIVQGVRFAFRVHQFAGGFDVETM+AFQELRDKASSCH
Subjt:  NLSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSCH

Query:  VQCQN-QQHKYVCSNRPTTC
        VQCQN QQHKYV S+RPTTC
Subjt:  VQCQN-QQHKYVCSNRPTTC

A0A1S3AZH3 protein CHUP1, chloroplastic1.2e-27784.63Show/hide
Query:  MVAGKVKLAMGLHKPPASKPVETSP-APPPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA
        MVAGKVK+AMGL K PAS+ VE+SP    P QPSP+SGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLR VEELR+REARL T+LLEHKLLKESVA
Subjt:  MVAGKVKLAMGLHKPPASKPVETSP-APPPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA

Query:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS
        IVP+LE  ISTKD EIERA KRILFLEAENERLRV++EEVKQS EE+RR S+ER+ AMEGEI ELK+MALDRSRME++ END+ S   RFQ LMEVSGKS
Subjt:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS

Query:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSS----TSSSSSTSSSGNVEKTIPAPP
        NLIRNLKRATKCS++V+NQ+NHKVE PE KK EVETER R+S+C +EE+A+STLSN+KSRIPRVP+PPPKPSSSS    T+SSSST SS ++EK IPAPP
Subjt:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSS----TSSSSSTSSSGNVEKTIPAPP

Query:  PVPTMRI----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIR
        PVPT  +    PPP KSAPPPPPPPPKGK P+ +KVRRIPEVVEFYHSLMRRDSRR+SG+GV + PSTANARDMIGEIENRS HLLAIKTDVETQGDFIR
Subjt:  PVPTMRI----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIR

Query:  FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYNL
         LIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLE GVYNL
Subjt:  FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYNL

Query:  SRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSCHVQ
        SRMRESA K+YK FQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGP+EEELIVQGVRFAFRVHQFAGGFDVETM+AFQELRDKASSCHVQ
Subjt:  SRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSCHVQ

Query:  CQN-QQHKYVCSNRPTTC
        CQN QQHKYV S+RPTTC
Subjt:  CQN-QQHKYVCSNRPTTC

A0A5D3CMM2 Protein CHUP14.7e-27484.92Show/hide
Query:  MVAGKVKLAMGLHKPPASKPVETSP-APPPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA
        MVAGKVK+AMGL K PAS+ VE+SP    P QPSP+SGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLR VEELR+REARL T+LLEHKLLKESVA
Subjt:  MVAGKVKLAMGLHKPPASKPVETSP-APPPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA

Query:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS
        IVP+LE  ISTKD EIERA KRILFLEAENERLRV++EEVKQS EE+RR S+ER+ AMEGEI ELK+MALDRSRME++ END+ S   RFQ LMEVSGKS
Subjt:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS

Query:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSS-----TSSSSSTSSSGNVEKTIPAP
        NLIRNLKRATKCS++V+NQ+NHKVE PE KK EVETER R+S+C +EE+A+STLSN+KSRIPRVPKPPPKPSSSS     TSSSSST SS ++EK IPAP
Subjt:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSS-----TSSSSSTSSSGNVEKTIPAP

Query:  PPVPTMRI----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGDFI
        PPVPT  +    PPP KSAPPPPPPPPKGK P+ +KVRRIPEVVEFYHSLMRRDSRR+SG+GV + PSTANARDMIGEIENRS HLLAIKTDVETQGDFI
Subjt:  PPVPTMRI----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGDFI

Query:  RFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYN
        RFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLE GVYN
Subjt:  RFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYN

Query:  LSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSCHV
        LSRMRESA K+YK FQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGP+EEELIVQGVRFAFRVHQFAGGFDVETM+AFQELRDKASSCHV
Subjt:  LSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSCHV

Query:  QCQN-QQHKY
        QCQN QQHK+
Subjt:  QCQN-QQHKY

A0A6J1EFK1 protein CHUP1, chloroplastic-like6.6e-27684.54Show/hide
Query:  MVAGKVKLAMGLHKPPASKPVETSPAP-PPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA
        MVAGKVKLAMGL K PAS+ VE+SP P  P QPSP+SGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL+ VEELR+REARL T+LLEHKLLKESVA
Subjt:  MVAGKVKLAMGLHKPPASKPVETSPAP-PPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA

Query:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS
        IVPMLE  I+TKD EIERA KRILFLEAENERLRVE+EEVKQS EEQRR S+ERV AMEGEI ELK+MALDR RME++ END+ S   RFQ LMEVSGKS
Subjt:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS

Query:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSS------STSSSSSTSSSGNVEKTIPA
        NLIR+LKR TK S++V+ Q+NHKVE PEAKK EVETER R+S+  +EE+A+STLSNVKSRIPRVPKPPPKPSSS      S+SSS+ST SSG+ EK IPA
Subjt:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSS------STSSSSSTSSSGNVEKTIPA

Query:  PPPVPTMRI-----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGD
        PPPVPT        PPP KSAPPPPPPPPKGK P  +KVRRIPEVVEFYHSLMRRDSRRE G+GV E PS+ANARDMIGEIENRSTHLLAIKTDVETQGD
Subjt:  PPPVPTMRI-----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLE GV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGV

Query:  YNLSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSC
        YNLSRMRESATK+YK FQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGP+EEELIV+GVRFAFRVHQFAGGFDVETM+AFQELRDKASSC
Subjt:  YNLSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSC

Query:  HVQCQNQQHKYVC-SNRPTTC
        HVQCQNQQHKYVC SNRPTTC
Subjt:  HVQCQNQQHKYVC-SNRPTTC

A0A6J1KWU6 protein CHUP1, chloroplastic-like3.3e-27584.22Show/hide
Query:  MVAGKVKLAMGLHKPPASKPVETSPAP-PPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA
        MVAGKVKLAMGL K PA + VE+SP P  P QPSP+SGK+SQKTVFSRSFGVYFPRSSAQVQPR PDVTELL+ VEELR+REARL T+LLEHKLLKESVA
Subjt:  MVAGKVKLAMGLHKPPASKPVETSPAP-PPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVA

Query:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS
        IVPMLE  I+TKD EIERA KRILFLEAENERLRVE+EEVKQS EEQRR S+ERV AMEGEI ELK+MALDR RME++ END+ S   RFQ LMEVSGKS
Subjt:  IVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKS

Query:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSS------STSSSSSTSSSGNVEKTIPA
        NLIR+LKR TK S++V+ Q+NHKVE PEAKK EVETER R+S+  +EE+A+STLSN+KSRIPRVPKPPPKPSSS      S+SSS+ST SSG+ EK IPA
Subjt:  NLIRNLKRATKCSESVINQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSS------STSSSSSTSSSGNVEKTIPA

Query:  PPPVPTMRI-----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGD
        PPPVPT        PPP KSAPPPPPPPPKGK P S+KVRRIPEVVEFYHSLMRRDSRRE G+GV E PS+ANARDMIGEIENRS HLLAIKTDVETQGD
Subjt:  PPPVPTMRI-----PPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGD

Query:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGV
        FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFRGDARQPCGSALKKMQALLEKLE GV
Subjt:  FIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGV

Query:  YNLSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSC
        YNLSRMRESATK+YK FQIPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGP+EEELIV+GVRFAFRVHQFAGGFDVETM+AFQELRDKASSC
Subjt:  YNLSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSC

Query:  HVQCQNQQHKYVC-SNRPTTC
        HVQCQNQQHKYVC SNRPTTC
Subjt:  HVQCQNQQHKYVC-SNRPTTC

SwissProt top hitse value%identityAlignment
Q1PEB4 Uncharacterized protein At4g049804.1e-0922.9Show/hide
Query:  EKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTEN--DDFSPPHRFQALMEVSGKSNLI
        E  I  ++ E E     +L  E   + ++ E        E +    +      EG   E  R+    +  EI  ++  +DF   H     +E        
Subjt:  EKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTEN--DDFSPPHRFQALMEVSGKSNLI

Query:  RNLKRATKCSESVINQENHKVEDPEAKKNEV---ETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSSTSSSSSTSSSGNVEKTIPAPPP---
                 +E  I+  +H +ED E + + +   ETE    ++ ++E     T S   S    VP PPP  S  + S + ST ++ +  ++ P PPP   
Subjt:  RNLKRATKCSESVINQENHKVEDPEAKKNEV---ETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSSTSSSSSTSSSGNVEKTIPAPPP---

Query:  ---VPTMRIPPPLKSA-----------------------PPPPPPPPKGKS--PVSSKVRRIPEVVEFYHSLM-RRDSRRESGA---------GVAELPS
            P    PPP+  A                       P PP PP  G+S    +SK+RR  ++   Y +L  + + R   G           VAE   
Subjt:  ---VPTMRIPPPLKSA-----------------------PPPPPPPPKGKS--PVSSKVRRIPEVVEFYHSLM-RRDSRRESGA---------GVAELPS

Query:  TANAR----DMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQ-WPEQKADALREAAFGYCDLK
           AR    D + E+  RS++   I+ DV+     I  L   + +    D+++++ F   ++  L  L DE  VL  F+ +PE+K + +R A   Y  L 
Subjt:  TANAR----DMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQ-WPEQKADALREAAFGYCDLK

Query:  KLESEASSFRGDARQPCGSALKKMQALLEKLERGVYNLSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVS---VKLAMKYMKRVSAELETVGGGPDEE
         +  E  +++     P    L K++    K +  +  + R ++   K +K + I +++ +   +   +  VS   ++LA+K  +  + E +       +E
Subjt:  KLESEASSFRGDARQPCGSALKKMQALLEKLERGVYNLSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVS---VKLAMKYMKRVSAELETVGGGPDEE

Query:  E---LIVQGVRFAFRVHQFAGGFD
        E    + +  +FAF+V+ FAGG D
Subjt:  E---LIVQGVRFAFRVHQFAGGFD

Q9LI74 Protein CHUP1, chloroplastic3.4e-8851.58Show/hide
Query:  NQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSSTSSSSSTSSSGNVEKTIPAPPPVPTMRIPPPLKSAPPPPPPP
        +Q N   E  E K +E            A  +    L +++ R PRVP+PPP+ +    S++  ++         P PPP P    PPP    PPPPPPP
Subjt:  NQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSSTSSSSSTSSSGNVEKTIPAPPPVPTMRIPPPLKSAPPPPPPP

Query:  PKG---KSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAEL---PSTANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVP
        P      +   +KV R PE+VEFY SLM+R+S++E    +       S+A   +MIGEIENRST LLA+K DVETQGDF++ L  EV  +SFTDIED++ 
Subjt:  PKG---KSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAEL---PSTANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVP

Query:  FVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYNLSRMRESATKKYKGFQIPVE
        FV WLD+ELS+LVDERAVLKHF WPE KADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E+ VY L R R+ A  +YK F IPV+
Subjt:  FVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYNLSRMRESATKKYKGFQIPVE

Query:  WMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGG---PDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASS
        W+ D G+V +IKL SV+LA KYMKRV+ EL++V G    P+ E L++QGVRFAFRVHQFAGGFD E+MKAF+ELR +A +
Subjt:  WMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGG---PDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASS

Q9LI74 Protein CHUP1, chloroplastic6.0e+0024.71Show/hide
Query:  DVTELLRTVEELREREARLMTELLEHKLLKESVAIVPMLEKSISTKDVEIERACKRILFLEAENERL----------RVEMEEVKQSNEEQRRVSEERVM
        ++  L + V+EL ERE +L  ELLE+  LKE  + +  L++ +  K VEI+     I  L+AE ++L          R E+E  +   +E +R  +    
Subjt:  DVTELLRTVEELREREARLMTELLEHKLLKESVAIVPMLEKSISTKDVEIERACKRILFLEAENERL----------RVEMEEVKQSNEEQRRVSEERVM

Query:  AMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKSNLIRNLKRATKCSESVINQENHKVEDPEAK
          +G+++ LK+        E    N D     + +A+ ++  +   +  LKR  +  +    + + K++  EA+
Subjt:  AMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKSNLIRNLKRATKCSESVINQENHKVEDPEAK

Arabidopsis top hitse value%identityAlignment
AT1G07120.1 FUNCTIONS IN: molecular_function unknown4.6e-6442.09Show/hide
Query:  SNLIRNLK--RATKCSESVINQENHKVEDPEAK-----KNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSSTSSSSSTSSSGNVEKTI
        S+L+R +K  +A       + +ENH++    A+      N    E  R S  + +  +    SN      + P+          S  S+T       + +
Subjt:  SNLIRNLK--RATKCSESVINQENHKVEDPEAK-----KNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSSTSSSSSTSSSGNVEKTI

Query:  PAPPPVPTMRIPPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIR
          P P PT++      + PPPPPP P  ++     VRR PEVVEFY +L +R+S   +      + S A  R+MIGEIENRS +L  IK+D +   D I 
Subjt:  PAPPPVPTMRIPPPLKSAPPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIR

Query:  FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHF-QWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYN
         LI +VE A+FTDI +V  FVKW+D+ELS LVDERAVLKHF +WPE+K D+LREAA  Y   K L +E  SF+ + +     AL+++Q+L ++LE  V N
Subjt:  FLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHF-QWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYN

Query:  LSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELR
          +MR+S  K+YK FQIP EWMLD G++ Q+K  S++LA +YMKR++ ELE+ G G  E  L++QGVRFA+ +HQFAGGFD ET+  F EL+
Subjt:  LSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELR

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein2.4e-8951.58Show/hide
Query:  NQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSSTSSSSSTSSSGNVEKTIPAPPPVPTMRIPPPLKSAPPPPPPP
        +Q N   E  E K +E            A  +    L +++ R PRVP+PPP+ +    S++  ++         P PPP P    PPP    PPPPPPP
Subjt:  NQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSSTSSSSSTSSSGNVEKTIPAPPPVPTMRIPPPLKSAPPPPPPP

Query:  PKG---KSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAEL---PSTANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVP
        P      +   +KV R PE+VEFY SLM+R+S++E    +       S+A   +MIGEIENRST LLA+K DVETQGDF++ L  EV  +SFTDIED++ 
Subjt:  PKG---KSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAEL---PSTANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVP

Query:  FVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYNLSRMRESATKKYKGFQIPVE
        FV WLD+ELS+LVDERAVLKHF WPE KADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E+ VY L R R+ A  +YK F IPV+
Subjt:  FVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYNLSRMRESATKKYKGFQIPVE

Query:  WMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGG---PDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASS
        W+ D G+V +IKL SV+LA KYMKRV+ EL++V G    P+ E L++QGVRFAFRVHQFAGGFD E+MKAF+ELR +A +
Subjt:  WMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGG---PDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASS

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein4.3e-0124.71Show/hide
Query:  DVTELLRTVEELREREARLMTELLEHKLLKESVAIVPMLEKSISTKDVEIERACKRILFLEAENERL----------RVEMEEVKQSNEEQRRVSEERVM
        ++  L + V+EL ERE +L  ELLE+  LKE  + +  L++ +  K VEI+     I  L+AE ++L          R E+E  +   +E +R  +    
Subjt:  DVTELLRTVEELREREARLMTELLEHKLLKESVAIVPMLEKSISTKDVEIERACKRILFLEAENERL----------RVEMEEVKQSNEEQRRVSEERVM

Query:  AMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKSNLIRNLKRATKCSESVINQENHKVEDPEAK
          +G+++ LK+        E    N D     + +A+ ++  +   +  LKR  +  +    + + K++  EA+
Subjt:  AMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKSNLIRNLKRATKCSESVINQENHKVEDPEAK

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein2.4e-8951.58Show/hide
Query:  NQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSSTSSSSSTSSSGNVEKTIPAPPPVPTMRIPPPLKSAPPPPPPP
        +Q N   E  E K +E            A  +    L +++ R PRVP+PPP+ +    S++  ++         P PPP P    PPP    PPPPPPP
Subjt:  NQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSSTSSSSSTSSSGNVEKTIPAPPPVPTMRIPPPLKSAPPPPPPP

Query:  PKG---KSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAEL---PSTANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVP
        P      +   +KV R PE+VEFY SLM+R+S++E    +       S+A   +MIGEIENRST LLA+K DVETQGDF++ L  EV  +SFTDIED++ 
Subjt:  PKG---KSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAEL---PSTANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVP

Query:  FVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYNLSRMRESATKKYKGFQIPVE
        FV WLD+ELS+LVDERAVLKHF WPE KADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E+ VY L R R+ A  +YK F IPV+
Subjt:  FVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYNLSRMRESATKKYKGFQIPVE

Query:  WMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGG---PDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASS
        W+ D G+V +IKL SV+LA KYMKRV+ EL++V G    P+ E L++QGVRFAFRVHQFAGGFD E+MKAF+ELR +A +
Subjt:  WMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGG---PDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASS

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein4.3e-0124.71Show/hide
Query:  DVTELLRTVEELREREARLMTELLEHKLLKESVAIVPMLEKSISTKDVEIERACKRILFLEAENERL----------RVEMEEVKQSNEEQRRVSEERVM
        ++  L + V+EL ERE +L  ELLE+  LKE  + +  L++ +  K VEI+     I  L+AE ++L          R E+E  +   +E +R  +    
Subjt:  DVTELLRTVEELREREARLMTELLEHKLLKESVAIVPMLEKSISTKDVEIERACKRILFLEAENERL----------RVEMEEVKQSNEEQRRVSEERVM

Query:  AMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKSNLIRNLKRATKCSESVINQENHKVEDPEAK
          +G+++ LK+        E    N D     + +A+ ++  +   +  LKR  +  +    + + K++  EA+
Subjt:  AMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKSNLIRNLKRATKCSESVINQENHKVEDPEAK

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein2.4e-8951.58Show/hide
Query:  NQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSSTSSSSSTSSSGNVEKTIPAPPPVPTMRIPPPLKSAPPPPPPP
        +Q N   E  E K +E            A  +    L +++ R PRVP+PPP+ +    S++  ++         P PPP P    PPP    PPPPPPP
Subjt:  NQENHKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSSTSSSSSTSSSGNVEKTIPAPPPVPTMRIPPPLKSAPPPPPPP

Query:  PKG---KSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAEL---PSTANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVP
        P      +   +KV R PE+VEFY SLM+R+S++E    +       S+A   +MIGEIENRST LLA+K DVETQGDF++ L  EV  +SFTDIED++ 
Subjt:  PKG---KSPVSSKVRRIPEVVEFYHSLMRRDSRRESGAGVAEL---PSTANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVP

Query:  FVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYNLSRMRESATKKYKGFQIPVE
        FV WLD+ELS+LVDERAVLKHF WPE KADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E+ VY L R R+ A  +YK F IPV+
Subjt:  FVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYNLSRMRESATKKYKGFQIPVE

Query:  WMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGG---PDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASS
        W+ D G+V +IKL SV+LA KYMKRV+ EL++V G    P+ E L++QGVRFAFRVHQFAGGFD E+MKAF+ELR +A +
Subjt:  WMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGG---PDEEELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASS

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-17459.12Show/hide
Query:  MVAGKVKLAMGLHKPPASK-------PVETSPAPPPPQPSPTSGKVS-------QKTVFSRSFGVYFPRSSAQVQPRPPD------VTELLRTVEELRER
        MVAGKV++ MG HK P++K       P+   P PPPP   P+SG  +        K  F+RSFGVYFPR+SAQV            V+EL R VEELRER
Subjt:  MVAGKVKLAMGLHKPPASK-------PVETSPAPPPPQPSPTSGKVS-------QKTVFSRSFGVYFPRSSAQVQPRPPD------VTELLRTVEELRER

Query:  EARLMTELLEHKLLKESVAIVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTEN
        EA L TE LE KLL+ESV+++P+LE  I+ K+ EI+   K    L  +NERLR E +     +EE RR  E R   ME EIVEL+++        + +E+
Subjt:  EARLMTELLEHKLLKESVAIVPMLEKSISTKDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTEN

Query:  DD--FSPPHRFQALMEVSGKSNLIRNLKRA---TKCSESVINQENHK--------VEDPEAKKNEVET-ERSRNSQCYAEEIADSTLSNVKSRIPRVPKP
        DD   S   RFQ LM+VS KSNLIR+LKR        E + NQEN           +    +K+E+E+  RS NS+   E    S+LS V+SR+PRVPKP
Subjt:  DD--FSPPHRFQALMEVSGKSNLIRNLKRA---TKCSESVINQENHK--------VEDPEAKKNEVET-ERSRNSQCYAEEIADSTLSNVKSRIPRVPKP

Query:  PPKPSSSSTSSSSSTSSSGNVEKTI---PAPPPVPTMRIPPPLKSA-----PPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRD---SRRES----GA
        PPK  S S   S+   +    +K+I   P PPP P ++ PPP  S      PPPPPPPPK  S  S+KVRR+PEVVEFYHSLMRRD   SRR+S     A
Subjt:  PPKPSSSSTSSSSSTSSSGNVEKTI---PAPPPVPTMRIPPPLKSA-----PPPPPPPPKGKSPVSSKVRRIPEVVEFYHSLMRRD---SRRES----GA

Query:  GVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCD
            + + +NARDMIGEIENRS +LLAIKTDVETQGDFIRFLIKEV NA+F+DIEDVVPFVKWLDDELSYLVDERAVLKHF+WPEQKADALREAAF Y D
Subjt:  GVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCD

Query:  LKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYNLSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETV-GGGPDEE
        LKKL SEAS FR D RQ   SALKKMQAL EKLE GVY+LSRMRESA  K+K FQIPV+WML+ GI SQIKL SVKLAMKYMKRVSAELE + GGGP+EE
Subjt:  LKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYNLSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETV-GGGPDEE

Query:  ELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSCHVQCQNQQHKYVCSNRPTTC
        ELIVQGVRFAFRVHQFAGGFD ETMKAF+ELRDKA SCHVQCQ+Q H++    R T C
Subjt:  ELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSCHVQCQNQQHKYVCSNRPTTC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGCTGGGAAGGTCAAGCTCGCAATGGGGCTACACAAGCCTCCGGCGAGTAAACCGGTCGAGACCTCGCCGGCGCCGCCGCCGCCGCAGCCTTCTCCGACCTCGGG
CAAGGTTTCTCAGAAAACAGTCTTCTCCCGCTCGTTTGGCGTGTATTTCCCTCGCTCTTCCGCTCAGGTTCAGCCTCGGCCGCCGGACGTGACGGAGCTCCTCCGTACAG
TCGAGGAGCTGCGAGAAAGAGAAGCGCGGTTGATGACGGAGCTATTGGAGCACAAGCTGTTGAAGGAATCTGTCGCCATTGTTCCTATGCTTGAGAAGTCAATCTCCACG
AAGGATGTGGAGATTGAAAGAGCGTGTAAGCGGATACTGTTCTTGGAGGCGGAGAATGAGAGGTTGAGAGTTGAAATGGAGGAAGTGAAGCAGAGTAATGAAGAACAGAG
GAGAGTGAGTGAAGAGAGAGTAATGGCAATGGAAGGTGAAATCGTGGAGTTGAAGAGAATGGCGTTGGATCGAAGCAGAATGGAGATTGTTACGGAGAATGACGATTTCT
CGCCGCCGCATAGGTTTCAGGCATTAATGGAGGTTTCTGGAAAATCTAACCTAATCAGGAACTTGAAAAGAGCGACGAAATGTTCGGAATCTGTAATCAATCAAGAGAAT
CACAAGGTTGAAGATCCAGAGGCGAAGAAAAATGAAGTTGAAACCGAGAGATCGAGAAACTCGCAATGTTACGCTGAAGAAATCGCCGATTCCACTCTCTCGAATGTAAA
ATCGCGAATACCTAGGGTTCCGAAACCTCCTCCCAAACCTTCTTCATCTTCCACTTCTTCTTCATCATCAACCAGTTCTTCAGGTAATGTAGAGAAAACGATTCCCGCTC
CACCGCCTGTTCCAACCATGCGGATTCCGCCGCCGTTGAAATCGGCGCCGCCGCCTCCTCCGCCGCCTCCCAAAGGTAAGAGCCCGGTGTCTTCGAAGGTGCGGCGGATT
CCGGAGGTTGTTGAGTTCTATCATTCGTTAATGCGGAGGGATTCCCGGCGAGAATCCGGCGCCGGCGTTGCGGAACTGCCGTCGACCGCCAATGCTCGTGACATGATCGG
AGAGATTGAGAACCGGTCCACTCACTTACTCGCTATAAAGACGGATGTAGAGACTCAAGGGGATTTCATTAGGTTCTTGATAAAAGAAGTTGAAAATGCTTCATTTACTG
ACATTGAGGACGTTGTACCATTTGTTAAATGGTTGGATGATGAGCTCTCGTATCTGGTGGACGAAAGAGCCGTGCTTAAACACTTCCAGTGGCCGGAGCAAAAGGCCGAC
GCTCTGCGTGAGGCTGCATTTGGCTATTGCGATCTAAAGAAGCTGGAATCCGAAGCGTCATCGTTTCGTGGCGATGCCCGCCAGCCCTGTGGTTCGGCTCTCAAGAAGAT
GCAAGCTTTGCTGGAAAAGTTGGAGCGTGGCGTCTATAATTTGTCTAGAATGCGTGAGTCTGCAACTAAGAAATACAAAGGGTTTCAAATTCCAGTGGAATGGATGCTTG
ATTGTGGAATTGTGAGTCAGATCAAGCTTGTCTCTGTAAAATTAGCAATGAAGTACATGAAGAGAGTATCGGCAGAGCTTGAAACAGTCGGTGGTGGACCAGATGAAGAA
GAGCTGATTGTTCAAGGCGTTAGATTTGCTTTCCGTGTGCATCAGTTTGCAGGAGGGTTTGATGTGGAAACGATGAAGGCATTTCAAGAACTGAGAGATAAAGCGAGTTC
ATGTCATGTACAATGCCAAAACCAGCAACATAAGTACGTGTGCAGTAATAGGCCTACAACTTGTTAA
mRNA sequenceShow/hide mRNA sequence
CTTCTCTCTGTTACCTATTTTTTCTTTGAAACACTCTCTGTTAATCTGTATTTTCTCAAACTCCAACTTTCTTTATCACTCTCAGATGAGAACTAAATTGAAAATTGAAG
AACTGTAAGAACTGAGTCCCAACCAACGAGAGCTAAGCCATGGTAGCTGGGAAGGTCAAGCTCGCAATGGGGCTACACAAGCCTCCGGCGAGTAAACCGGTCGAGACCTC
GCCGGCGCCGCCGCCGCCGCAGCCTTCTCCGACCTCGGGCAAGGTTTCTCAGAAAACAGTCTTCTCCCGCTCGTTTGGCGTGTATTTCCCTCGCTCTTCCGCTCAGGTTC
AGCCTCGGCCGCCGGACGTGACGGAGCTCCTCCGTACAGTCGAGGAGCTGCGAGAAAGAGAAGCGCGGTTGATGACGGAGCTATTGGAGCACAAGCTGTTGAAGGAATCT
GTCGCCATTGTTCCTATGCTTGAGAAGTCAATCTCCACGAAGGATGTGGAGATTGAAAGAGCGTGTAAGCGGATACTGTTCTTGGAGGCGGAGAATGAGAGGTTGAGAGT
TGAAATGGAGGAAGTGAAGCAGAGTAATGAAGAACAGAGGAGAGTGAGTGAAGAGAGAGTAATGGCAATGGAAGGTGAAATCGTGGAGTTGAAGAGAATGGCGTTGGATC
GAAGCAGAATGGAGATTGTTACGGAGAATGACGATTTCTCGCCGCCGCATAGGTTTCAGGCATTAATGGAGGTTTCTGGAAAATCTAACCTAATCAGGAACTTGAAAAGA
GCGACGAAATGTTCGGAATCTGTAATCAATCAAGAGAATCACAAGGTTGAAGATCCAGAGGCGAAGAAAAATGAAGTTGAAACCGAGAGATCGAGAAACTCGCAATGTTA
CGCTGAAGAAATCGCCGATTCCACTCTCTCGAATGTAAAATCGCGAATACCTAGGGTTCCGAAACCTCCTCCCAAACCTTCTTCATCTTCCACTTCTTCTTCATCATCAA
CCAGTTCTTCAGGTAATGTAGAGAAAACGATTCCCGCTCCACCGCCTGTTCCAACCATGCGGATTCCGCCGCCGTTGAAATCGGCGCCGCCGCCTCCTCCGCCGCCTCCC
AAAGGTAAGAGCCCGGTGTCTTCGAAGGTGCGGCGGATTCCGGAGGTTGTTGAGTTCTATCATTCGTTAATGCGGAGGGATTCCCGGCGAGAATCCGGCGCCGGCGTTGC
GGAACTGCCGTCGACCGCCAATGCTCGTGACATGATCGGAGAGATTGAGAACCGGTCCACTCACTTACTCGCTATAAAGACGGATGTAGAGACTCAAGGGGATTTCATTA
GGTTCTTGATAAAAGAAGTTGAAAATGCTTCATTTACTGACATTGAGGACGTTGTACCATTTGTTAAATGGTTGGATGATGAGCTCTCGTATCTGGTGGACGAAAGAGCC
GTGCTTAAACACTTCCAGTGGCCGGAGCAAAAGGCCGACGCTCTGCGTGAGGCTGCATTTGGCTATTGCGATCTAAAGAAGCTGGAATCCGAAGCGTCATCGTTTCGTGG
CGATGCCCGCCAGCCCTGTGGTTCGGCTCTCAAGAAGATGCAAGCTTTGCTGGAAAAGTTGGAGCGTGGCGTCTATAATTTGTCTAGAATGCGTGAGTCTGCAACTAAGA
AATACAAAGGGTTTCAAATTCCAGTGGAATGGATGCTTGATTGTGGAATTGTGAGTCAGATCAAGCTTGTCTCTGTAAAATTAGCAATGAAGTACATGAAGAGAGTATCG
GCAGAGCTTGAAACAGTCGGTGGTGGACCAGATGAAGAAGAGCTGATTGTTCAAGGCGTTAGATTTGCTTTCCGTGTGCATCAGTTTGCAGGAGGGTTTGATGTGGAAAC
GATGAAGGCATTTCAAGAACTGAGAGATAAAGCGAGTTCATGTCATGTACAATGCCAAAACCAGCAACATAAGTACGTGTGCAGTAATAGGCCTACAACTTGTTAATCTG
CAGCAATCTAGCCTTCAGAGGGCTGCTCTGTTTTTTGGTCGGAAAGATGATGTTTGTGAATATCAATAACAGCGGCAGTTTTGGGATGGAAACCCCATTTATTCATTTAT
AAAGAAAGTGAACTGAGGTGGTAATATTATATTGGATAAAGGAAGGGAAGCCATTTTCTTCACTAAATATGCATTCAACTTCAATTCGGTTTGAGATGAACCTCATATAC
GGCCTAATAATAGGTTCGTTATTGTAATACGTGTTTGCCTTCTCGGGTTGGCGTAATGATTGAGCACCAGGGCTTTGAGGCGATTGTGAAAAATAGTTTTCTAGACGATC
TCAATTTTATTTTTGATCTTAGGACATATCTGATGCCACAGTCTAGAAATGAATCAGTG
Protein sequenceShow/hide protein sequence
MVAGKVKLAMGLHKPPASKPVETSPAPPPPQPSPTSGKVSQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRTVEELREREARLMTELLEHKLLKESVAIVPMLEKSIST
KDVEIERACKRILFLEAENERLRVEMEEVKQSNEEQRRVSEERVMAMEGEIVELKRMALDRSRMEIVTENDDFSPPHRFQALMEVSGKSNLIRNLKRATKCSESVINQEN
HKVEDPEAKKNEVETERSRNSQCYAEEIADSTLSNVKSRIPRVPKPPPKPSSSSTSSSSSTSSSGNVEKTIPAPPPVPTMRIPPPLKSAPPPPPPPPKGKSPVSSKVRRI
PEVVEFYHSLMRRDSRRESGAGVAELPSTANARDMIGEIENRSTHLLAIKTDVETQGDFIRFLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKAD
ALREAAFGYCDLKKLESEASSFRGDARQPCGSALKKMQALLEKLERGVYNLSRMRESATKKYKGFQIPVEWMLDCGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPDEE
ELIVQGVRFAFRVHQFAGGFDVETMKAFQELRDKASSCHVQCQNQQHKYVCSNRPTTC