; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr007036 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr007036
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein CHUP1, chloroplastic
Genome locationtig00005158:417..6391
RNA-Seq ExpressionSgr007036
SyntenySgr007036
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
GO:0005525 - GTP binding (molecular function)
InterPro domainsIPR011719 - Conserved hypothetical protein CHP02058
IPR037103 - Tubulin/FtsZ-like, C-terminal domain
IPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052630.1 protein CHUP1 [Cucumis melo var. makuwa]0.0e+0084.07Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPK ST AQPSPSSGKV QKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS
        IVP+LENEI TKDAEIERASKRILFLEAENERLRV+VEE++ +++E+RRESQER+KAMEGEI+ELKKMALDR+RMEL LEND+LSASQRFQGLMEVSGKS
Subjt:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI
        NLIRNLKRATKCSDAVVN DNHKV+ HPE KKEEVETERPRHSRCNSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSS ++SSSSS  TGS  D + AI
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI

Query:  PPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQG
        P PPPVPTKPM PPPPP SKSAPPPPPPPPKGKR MPAKVRRIPEVVEFYHSLMRRDSRRD GSG+TDPPSTANARDMIGEIENRSAHLLAIKTDVETQG
Subjt:  PPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQG

Query:  DFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEHG
        DFI+ LIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR DARQPC SALKKMQ+LLEKLEHG
Subjt:  DFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEHG

Query:  VYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAT-
        VYNLSRMRESA KRYK F+IPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKA+ 
Subjt:  VYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAT-

Query:  -------------------------------------------TAASFMEVEQGGKSAAVSSTAPPMKLLFVEM-------GQDVTAAAMRACRDAISSN
                                                   TA S MEVEQGG+SA V ST PPMKLLFVEM       GQD+TAAAMRACRDAISSN
Subjt:  -------------------------------------------TAASFMEVEQGGKSAAVSSTAPPMKLLFVEM-------GQDVTAAAMRACRDAISSN

Query:  SIPAFRRGMFPWVSISSLYLFI
        SIPAFRRG  P VS   + L I
Subjt:  SIPAFRRGMFPWVSISSLYLFI

KAG6594554.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]5.9e-29679.95Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRKVESSPKPST AQPSPSSGK+ QKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS
        IVP+LENEI TKDAEIERASKRILFLEAENERLRVEVEE++ +++EQRRES+ER+KAMEGEIAELKKMALDR RMEL LEND+LSASQRFQGLMEVSGKS
Subjt:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI
        NLIR+LKR TK SD VV  DNHKV+  PEAKKEEVETERPRHSR NSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSS +SSSSS+S TGS  D +  I
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI

Query:  PPPPPVPTKPM-TPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQ
        P PPPVPTKP   PPPPP SKSAPPPPPPPPKGKR  PAKVRRIPEVVEFYHSLMRRDSRR+ GS +T+PPS+ANARDMIGEIENRSAHLLAIKTDVETQ
Subjt:  PPPPPVPTKPM-TPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQ

Query:  GDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEH
        GDFI+ LIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFR DARQPC+SALKKMQ+LLEKLEH
Subjt:  GDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEH

Query:  GVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAT
        GVYNLSRMRESATKRYK F+IPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKA+
Subjt:  GVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAT

Query:  T-----------------------------------------------------------AASFMEVEQGGKSAAVSSTAPPMKLLFVEM-------GQD
        +                                                           AA  MEVEQG +SA VSSTA PMKLLFVEM       GQD
Subjt:  T-----------------------------------------------------------AASFMEVEQGGKSAAVSSTAPPMKLLFVEM-------GQD

Query:  VTAAAMRACRDAISSNSIPAFRRGMFPWVSISSLYLFI
        +TAAAMRACRDAI SNSIPAFRRG  P VS   + L I
Subjt:  VTAAAMRACRDAISSNSIPAFRRGMFPWVSISSLYLFI

KAG7026530.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-29478.38Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRKVESSPKPST AQPSPSSGK+ QKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS
        IVP+LENEI TKDAEIERASKRILFLEAENERLRVEVEE++ +++EQRRES+ER+KAMEGEIAELKKMALDR RMEL LEND+LSASQRFQGLMEVSGKS
Subjt:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI
        NLIRNLKR TK SD VV  DNHKV+  PEAKKEEVETERPRHSR NSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSS +SSSSS+S TGS  D +  I
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI

Query:  PPPPPVPTKPM-TPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQ
        P PPPVPTKP   PPPPP SKSAPPPPPPPPKGKR  PAKVRRIPEVVEFYHSLMRRDSRR+ GS +T+PPS+ANARDMIGEIENRSAHLLAIKTDVETQ
Subjt:  PPPPPVPTKPM-TPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQ

Query:  GDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEH
        GDFI+ LIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFR DARQPC+SALKKMQ+LLEKLEH
Subjt:  GDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEH

Query:  GVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAT
        GVYNLSRMRESATKRYK F+IPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKA+
Subjt:  GVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAT

Query:  T---------------------------------------------------------------------------AASFMEVEQGGKSAAVSSTAPPMK
        +                                                                           AA  MEVEQG +SA VSSTA PMK
Subjt:  T---------------------------------------------------------------------------AASFMEVEQGGKSAAVSSTAPPMK

Query:  LLFVEM-------GQDVTAAAMRACRDAISSNSIPAFRRGMFPWVSISSLYLFI
        LLFVEM       GQD+TAAAMRACRDAI SNSIPAFRRG  P VS   + L I
Subjt:  LLFVEM-------GQDVTAAAMRACRDAISSNSIPAFRRGMFPWVSISSLYLFI

XP_004134665.1 protein CHUP1, chloroplastic [Cucumis sativus]1.2e-28891.67Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPK ST AQPSPSSGKV QKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS
        IVP+LENEI TKDAEIERASKRILFLEAENERLRV+VEE + +++E+RRESQERIKAMEGE+AELKKMALDR+RMEL LEND+LSASQRFQGLMEVSGKS
Subjt:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI
        NLIRNLKRATKCSDAVVN DNHKV+ HPEAKKEEVETERPRHSRCNSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSS ++S+SSSS TGS  D + AI
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI

Query:  PPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQG
        P PPPVPTK M PPPPP SKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRD GSG+T+PPSTANARDMIGEIENRSAHLLAIKTDVETQG
Subjt:  PPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQG

Query:  DFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEHG
        DFI+ LIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR DARQPC SALKKMQ+LLEKLEHG
Subjt:  DFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEHG

Query:  VYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKATT
        VYNLSRMRESA KRYK F+IPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKA++
Subjt:  VYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKATT

XP_008439756.1 PREDICTED: protein CHUP1, chloroplastic [Cucumis melo]2.4e-28991.83Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPK ST AQPSPSSGKV QKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS
        IVP+LENEI TKDAEIERASKRILFLEAENERLRV+VEE++ +++E+RRESQERIKAMEGEI+ELKKMALDR+RMEL LEND+LSASQRFQGLMEVSGKS
Subjt:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI
        NLIRNLKRATKCSDAVVN DNHKV+ HPE KKEEVETERPRHSRCNSEELAESTLSN+KSRIPRVP+PPPKPSSSSSSS ++SSSS   TGS  D + AI
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI

Query:  PPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQG
        P PPPVPTKPM PPPPP SKSAPPPPPPPPKGKR MPAKVRRIPEVVEFYHSLMRRDSRRD GSG+TDPPSTANARDMIGEIENRSAHLLAIKTDVETQG
Subjt:  PPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQG

Query:  DFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEHG
        DFI+LLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR DARQPC SALKKMQ+LLEKLEHG
Subjt:  DFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEHG

Query:  VYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKATT
        VYNLSRMRESA KRYK F+IPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKA++
Subjt:  VYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKATT

TrEMBL top hitse value%identityAlignment
A0A0A0KHU8 Uncharacterized protein5.7e-28991.67Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPK ST AQPSPSSGKV QKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS
        IVP+LENEI TKDAEIERASKRILFLEAENERLRV+VEE + +++E+RRESQERIKAMEGE+AELKKMALDR+RMEL LEND+LSASQRFQGLMEVSGKS
Subjt:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI
        NLIRNLKRATKCSDAVVN DNHKV+ HPEAKKEEVETERPRHSRCNSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSS ++S+SSSS TGS  D + AI
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI

Query:  PPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQG
        P PPPVPTK M PPPPP SKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRD GSG+T+PPSTANARDMIGEIENRSAHLLAIKTDVETQG
Subjt:  PPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQG

Query:  DFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEHG
        DFI+ LIKEVENASFTDIEDVVPFVKWLDDELS+LVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR DARQPC SALKKMQ+LLEKLEHG
Subjt:  DFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEHG

Query:  VYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKATT
        VYNLSRMRESA KRYK F+IPVEWMLD GIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKA++
Subjt:  VYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKATT

A0A1S3AZH3 protein CHUP1, chloroplastic1.2e-28991.83Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPK ST AQPSPSSGKV QKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS
        IVP+LENEI TKDAEIERASKRILFLEAENERLRV+VEE++ +++E+RRESQERIKAMEGEI+ELKKMALDR+RMEL LEND+LSASQRFQGLMEVSGKS
Subjt:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI
        NLIRNLKRATKCSDAVVN DNHKV+ HPE KKEEVETERPRHSRCNSEELAESTLSN+KSRIPRVP+PPPKPSSSSSSS ++SSSS   TGS  D + AI
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI

Query:  PPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQG
        P PPPVPTKPM PPPPP SKSAPPPPPPPPKGKR MPAKVRRIPEVVEFYHSLMRRDSRRD GSG+TDPPSTANARDMIGEIENRSAHLLAIKTDVETQG
Subjt:  PPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQG

Query:  DFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEHG
        DFI+LLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR DARQPC SALKKMQ+LLEKLEHG
Subjt:  DFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEHG

Query:  VYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKATT
        VYNLSRMRESA KRYK F+IPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKA++
Subjt:  VYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKATT

A0A5D3CMM2 Protein CHUP10.0e+0084.07Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVKVAMGLQKSPASRKVESSPK ST AQPSPSSGKV QKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS
        IVP+LENEI TKDAEIERASKRILFLEAENERLRV+VEE++ +++E+RRESQER+KAMEGEI+ELKKMALDR+RMEL LEND+LSASQRFQGLMEVSGKS
Subjt:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI
        NLIRNLKRATKCSDAVVN DNHKV+ HPE KKEEVETERPRHSRCNSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSS ++SSSSS  TGS  D + AI
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI

Query:  PPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQG
        P PPPVPTKPM PPPPP SKSAPPPPPPPPKGKR MPAKVRRIPEVVEFYHSLMRRDSRRD GSG+TDPPSTANARDMIGEIENRSAHLLAIKTDVETQG
Subjt:  PPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQG

Query:  DFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEHG
        DFI+ LIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR DARQPC SALKKMQ+LLEKLEHG
Subjt:  DFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEHG

Query:  VYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAT-
        VYNLSRMRESA KRYK F+IPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKA+ 
Subjt:  VYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAT-

Query:  -------------------------------------------TAASFMEVEQGGKSAAVSSTAPPMKLLFVEM-------GQDVTAAAMRACRDAISSN
                                                   TA S MEVEQGG+SA V ST PPMKLLFVEM       GQD+TAAAMRACRDAISSN
Subjt:  -------------------------------------------TAASFMEVEQGGKSAAVSSTAPPMKLLFVEM-------GQDVTAAAMRACRDAISSN

Query:  SIPAFRRGMFPWVSISSLYLFI
        SIPAFRRG  P VS   + L I
Subjt:  SIPAFRRGMFPWVSISSLYLFI

A0A6J1EFK1 protein CHUP1, chloroplastic-like1.2e-27889.35Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPASRKVESSPKPST AQPSPSSGK+ QKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS
        IVP+LENEI TKDAEIERASKRILFLEAENERLRVEVEE++ +++EQRRESQER+KAMEGEIAELKKMALDR RMEL LEND+LSASQRFQGLMEVSGKS
Subjt:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI
        NLIR+LKR TK SD VV  DNHKV+  PEAKKEEVETERPRHSR NSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSS +SSSSS+S TGS  D +  I
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI

Query:  PPPPPVPTKPM-TPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQ
        P PPPVPTKP   PPPPP SKSAPPPPPPPPKGKR  PAKVRRIPEVVEFYHSLMRRDSRR+ GSG+T+PPS+ANARDMIGEIENRS HLLAIKTDVETQ
Subjt:  PPPPPVPTKPM-TPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQ

Query:  GDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEH
        GDFI+ LIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFR DARQPC SALKKMQ+LLEKLEH
Subjt:  GDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEH

Query:  GVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAT
        GVYNLSRMRESATKRYK F+IPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKA+
Subjt:  GVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAT

Query:  T
        +
Subjt:  T

A0A6J1KWU6 protein CHUP1, chloroplastic-like1.0e-27789.18Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA
        MVAGKVK+AMGLQKSPA RKVESSPKPST AQPSPSSGK+ QKTVFSRSFGVYFPRSSAQVQPR PDVTELL++VEELRDREARLKTDLLEHKLLKESVA
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVA

Query:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS
        IVP+LENEI TKDAEIERASKRILFLEAENERLRVEVEE++ +++EQRRESQER+KAMEGEIAELKKMALDR RMEL LEND+LSASQRFQGLMEVSGKS
Subjt:  IVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKS

Query:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI
        NLIR+LKR TK SD VV  DNHKV+  PEAKKEEVETERPRHSR NSEELAESTLSN+KSRIPRVPKPPPKPSSSSSSS +SSSSS+S TGS  D +  I
Subjt:  NLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAI

Query:  PPPPPVPTKPM-TPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQ
        P PPPVPTKP   PPPPP SKSAPPPPPPPPKGKR   AKVRRIPEVVEFYHSLMRRDSRR+ GSG+T+PPS+ANARDMIGEIENRSAHLLAIKTDVETQ
Subjt:  PPPPPVPTKPM-TPPPPPSSKSAPPPPPPPPKGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQ

Query:  GDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEH
        GDFI+ LIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLE+EASSFR DARQPC SALKKMQ+LLEKLEH
Subjt:  GDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALREAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEH

Query:  GVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAT
        GVYNLSRMRESATKRYK F+IPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIV+GVRFAFRVHQFAGGFDVETMRAFQELRDKA+
Subjt:  GVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKAT

Query:  T
        +
Subjt:  T

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic1.1e-8245.08Show/hide
Query:  KAMEGEIAELKKMALDRNRMELSLEND-----DLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEEL
        K+++  + E      DR+++ +  E       D + ++RF G + +  K   ++  +             +++ +   E K  E           N+  +
Subjt:  KAMEGEIAELKKMALDRNRMELSLEND-----DLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEEL

Query:  AESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAIPPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPA-KVRRIPEVVEF
         +  L +++ R PRVP+PPP+ +    S+   S+    P G          PPPP P     PPPPP     PPPPPP   G+      KV R PE+VEF
Subjt:  AESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAIPPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPA-KVRRIPEVVEF

Query:  YHSLMRRDSRRDPGSGITDP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQ
        Y SLM+R+S+++    +       S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF 
Subjt:  YHSLMRRDSRRDPGSGITDP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQ

Query:  WPEQKADALREAAFGYCDLKKLESEASSF-RDARQPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYM
        WPE KADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYM
Subjt:  WPEQKADALREAAFGYCDLKKLESEASSF-RDARQPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYM

Query:  KRVSAELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKATT
        KRV+ EL++V G    P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A T
Subjt:  KRVSAELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKATT

Q9LI74 Protein CHUP1, chloroplastic6.0e-0124.52Show/hide
Query:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERASKRILFLEAENERL----------RVEVEELQLNIDEQRRESQERIK
        ++  L ++V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L+AE ++L          R E+E  +  I E +R+ Q    
Subjt:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERASKRILFLEAENERL----------RVEVEELQLNIDEQRRESQERIK

Query:  AMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKSNLIRNLKRATK
          +G++  LK+        E    N D    ++ + + ++  +   +  LKR  +
Subjt:  AMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKSNLIRNLKRATK

Arabidopsis top hitse value%identityAlignment
AT1G07120.1 FUNCTIONS IN: molecular_function unknown6.9e-6145.9Show/hide
Query:  ERPRHSRCNSEELAESTLSNLKS-RIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDT-DNAIPPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRL
        E+  H          + +SNLKS    R      K  SS   S +  S+  +P     +T    +  P P PT         S+ + PPPPPP P  + L
Subjt:  ERPRHSRCNSEELAESTLSNLKS-RIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDT-DNAIPPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRL

Query:  MPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYL
            VRR PEVVEFY +L +R+S            S A  R+MIGEIENRS +L  IK+D +   D I +LI +VE A+FTDI +V  FVKW+D+ELS L
Subjt:  MPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYL

Query:  VDERAVLKHF-QWPEQKADALREAAFGYCDLKKLESEASSFRD-ARQPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQI
        VDERAVLKHF +WPE+K D+LREAA  Y   K L +E  SF+D  +   + AL+++QSL ++LE  V N  +MR+S  KRYK F+IP EWMLD+G++ Q+
Subjt:  VDERAVLKHF-QWPEQKADALREAAFGYCDLKKLESEASSFRD-ARQPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQI

Query:  KLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKATT
        K  S++LA +YMKR++ ELE+ G G +E  L++QGVRFA+ +HQFAGGFD ET+  F EL+ K TT
Subjt:  KLVSVKLAMKYMKRVSAELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKATT

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein7.6e-8445.08Show/hide
Query:  KAMEGEIAELKKMALDRNRMELSLEND-----DLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEEL
        K+++  + E      DR+++ +  E       D + ++RF G + +  K   ++  +             +++ +   E K  E           N+  +
Subjt:  KAMEGEIAELKKMALDRNRMELSLEND-----DLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEEL

Query:  AESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAIPPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPA-KVRRIPEVVEF
         +  L +++ R PRVP+PPP+ +    S+   S+    P G          PPPP P     PPPPP     PPPPPP   G+      KV R PE+VEF
Subjt:  AESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAIPPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPA-KVRRIPEVVEF

Query:  YHSLMRRDSRRDPGSGITDP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQ
        Y SLM+R+S+++    +       S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF 
Subjt:  YHSLMRRDSRRDPGSGITDP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQ

Query:  WPEQKADALREAAFGYCDLKKLESEASSF-RDARQPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYM
        WPE KADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYM
Subjt:  WPEQKADALREAAFGYCDLKKLESEASSF-RDARQPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYM

Query:  KRVSAELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKATT
        KRV+ EL++V G    P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A T
Subjt:  KRVSAELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKATT

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein4.3e-0224.52Show/hide
Query:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERASKRILFLEAENERL----------RVEVEELQLNIDEQRRESQERIK
        ++  L ++V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L+AE ++L          R E+E  +  I E +R+ Q    
Subjt:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERASKRILFLEAENERL----------RVEVEELQLNIDEQRRESQERIK

Query:  AMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKSNLIRNLKRATK
          +G++  LK+        E    N D    ++ + + ++  +   +  LKR  +
Subjt:  AMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKSNLIRNLKRATK

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein7.6e-8445.08Show/hide
Query:  KAMEGEIAELKKMALDRNRMELSLEND-----DLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEEL
        K+++  + E      DR+++ +  E       D + ++RF G + +  K   ++  +             +++ +   E K  E           N+  +
Subjt:  KAMEGEIAELKKMALDRNRMELSLEND-----DLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEEL

Query:  AESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAIPPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPA-KVRRIPEVVEF
         +  L +++ R PRVP+PPP+ +    S+   S+    P G          PPPP P     PPPPP     PPPPPP   G+      KV R PE+VEF
Subjt:  AESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAIPPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPA-KVRRIPEVVEF

Query:  YHSLMRRDSRRDPGSGITDP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQ
        Y SLM+R+S+++    +       S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF 
Subjt:  YHSLMRRDSRRDPGSGITDP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQ

Query:  WPEQKADALREAAFGYCDLKKLESEASSF-RDARQPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYM
        WPE KADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYM
Subjt:  WPEQKADALREAAFGYCDLKKLESEASSF-RDARQPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYM

Query:  KRVSAELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKATT
        KRV+ EL++V G    P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A T
Subjt:  KRVSAELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKATT

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein4.3e-0224.52Show/hide
Query:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERASKRILFLEAENERL----------RVEVEELQLNIDEQRRESQERIK
        ++  L ++V+EL +RE +L+ +LLE+  LKE  + +  L+ ++  K  EI+  +  I  L+AE ++L          R E+E  +  I E +R+ Q    
Subjt:  DVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERASKRILFLEAENERL----------RVEVEELQLNIDEQRRESQERIK

Query:  AMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKSNLIRNLKRATK
          +G++  LK+        E    N D    ++ + + ++  +   +  LKR  +
Subjt:  AMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKSNLIRNLKRATK

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein7.6e-8445.08Show/hide
Query:  KAMEGEIAELKKMALDRNRMELSLEND-----DLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEEL
        K+++  + E      DR+++ +  E       D + ++RF G + +  K   ++  +             +++ +   E K  E           N+  +
Subjt:  KAMEGEIAELKKMALDRNRMELSLEND-----DLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHDNHKVDYHPEAKKEEVETERPRHSRCNSEEL

Query:  AESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAIPPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPA-KVRRIPEVVEF
         +  L +++ R PRVP+PPP+ +    S+   S+    P G          PPPP P     PPPPP     PPPPPP   G+      KV R PE+VEF
Subjt:  AESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAIPPPPPVPTKPMTPPPPPSSKSAPPPPPPPPKGKRLMPA-KVRRIPEVVEF

Query:  YHSLMRRDSRRDPGSGITDP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQ
        Y SLM+R+S+++    +       S+A   +MIGEIENRS  LLA+K DVETQGDF++ L  EV  +SFTDIED++ FV WLD+ELS+LVDERAVLKHF 
Subjt:  YHSLMRRDSRRDPGSGITDP---PSTANARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQ

Query:  WPEQKADALREAAFGYCDLKKLESEASSF-RDARQPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYM
        WPE KADALREAAF Y DL KLE + +SF  D    C  ALKKM  LLEK+E  VY L R R+ A  RYK F IPV+W+ D+G+V +IKL SV+LA KYM
Subjt:  WPEQKADALREAAFGYCDLKKLESEASSF-RDARQPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYM

Query:  KRVSAELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKATT
        KRV+ EL++V G    P  E L++QGVRFAFRVHQFAGGFD E+M+AF+ELR +A T
Subjt:  KRVSAELETVGGG---PEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKATT

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.2e-16759.56Show/hide
Query:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQP------SPSSGKV-------PQKTVFSRSFGVYFPRSSAQVQPRPPD------VTELLRMVEELRDR
        MVAGKV+V MG  KSP+++K +  P P     P       PSSG         P K  F+RSFGVYFPR+SAQV            V+EL R VEELR+R
Subjt:  MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQP------SPSSGKV-------PQKTVFSRSFGVYFPRSSAQVQPRPPD------VTELLRMVEELRDR

Query:  EARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLEN
        EA LKT+ LE KLL+ESV+++PLLE++I  K+ EI+   K    L  +NERLR E +      +E RRE + R K ME EI EL+K+        +S E+
Subjt:  EARLKTDLLEHKLLKESVAIVPLLENEICTKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLEN

Query:  DD--LSASQRFQGLMEVSGKSNLIRNLKRA---TKCSDAVVNHDNHKVDYHPEA-------KKEEVETERPRHSR-CNSEELAE-STLSNLKSRIPRVPK
        DD  LS SQRFQGLM+VS KSNLIR+LKR        + + N +N                +K+E+E+    +SR  NSEEL E S+LS ++SR+PRVPK
Subjt:  DD--LSASQRFQGLMEVSGKSNLIRNLKRA---TKCSDAVVNHDNHKVDYHPEA-------KKEEVETERPRHSR-CNSEELAE-STLSNLKSRIPRVPK

Query:  PPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAIPPPPPVPTKPM--TPPPPPSSKSAPPPPPPPPKGKRL--MPAKVRRIPEVVEFYHSLMRRD---SRR
        PPPK S S   ST + +              +IPPPPP P  P+   PPPPPS   APPPPPPPP  K L    AKVRR+PEVVEFYHSLMRRD   SRR
Subjt:  PPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAIPPPPPVPTKPM--TPPPPPSSKSAPPPPPPPPKGKRL--MPAKVRRIPEVVEFYHSLMRRD---SRR

Query:  DPGSGITDPP----STANARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALR
        D   G         + +NARDMIGEIENRS +LLAIKTDVETQGDFI+ LIKEV NA+F+DIEDVVPFVKWLDDELSYLVDERAVLKHF+WPEQKADALR
Subjt:  DPGSGITDPP----STANARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERAVLKHFQWPEQKADALR

Query:  EAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETV
        EAAF Y DLKKL SEAS FR D RQ  SSALKKMQ+L EKLEHGVY+LSRMRESA  ++K F+IPV+WML++GI SQIKL SVKLAMKYMKRVSAELE +
Subjt:  EAAFGYCDLKKLESEASSFR-DARQPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSAELETV

Query:  -GGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKA
         GGGPEEEELIVQGVRFAFRVHQFAGGFD ETM+AF+ELRDKA
Subjt:  -GGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGCTGGGAAGGTGAAGGTCGCAATGGGGCTGCAGAAGTCCCCGGCGAGCAGAAAGGTTGAGAGCTCACCGAAACCGTCGACGCAGGCGCAGCCTTCTCCGAGCTC
TGGTAAGGTTCCTCAGAAAACGGTGTTCTCCCGCTCGTTTGGGGTCTATTTTCCTCGCTCTTCTGCTCAGGTCCAGCCTCGACCGCCTGATGTGACGGAGCTTCTCCGCA
TGGTCGAGGAGTTGCGTGACAGAGAGGCGCGATTGAAGACTGACCTATTGGAGCACAAGCTTTTGAAGGAGTCTGTCGCCATTGTTCCTCTGCTTGAGAACGAGATCTGT
ACGAAGGATGCTGAGATTGAGAGAGCGTCTAAGCGGATACTGTTCTTGGAGGCAGAGAATGAGAGGTTGAGAGTTGAGGTGGAGGAACTTCAACTGAATATTGACGAACA
GAGGAGAGAGAGTCAGGAAAGAATAAAGGCAATGGAAGGTGAAATCGCAGAGCTGAAGAAAATGGCGTTGGATCGAAACAGAATGGAGCTTAGTTTGGAGAACGACGACC
TTTCGGCCTCCCAGAGGTTCCAGGGATTAATGGAGGTCTCGGGAAAGTCTAACCTAATCAGGAACTTGAAAAGAGCGACCAAATGTTCGGATGCTGTTGTTAACCACGAC
AATCATAAGGTTGACTATCATCCAGAGGCAAAGAAAGAAGAAGTTGAAACCGAGAGACCGAGACATTCGCGATGTAACTCTGAAGAACTCGCCGAGTCCACTCTCTCTAA
CCTAAAATCACGAATACCTAGGGTTCCAAAACCTCCTCCGAAACCTTCCTCATCTTCCTCTTCTTCTACCTCTTCCTCCTCCTCCTCTTCCTCACCAACTGGCTCTTGTG
TTGACACAGATAACGCGATCCCTCCCCCACCCCCTGTCCCAACAAAGCCAATGACGCCGCCTCCCCCGCCAAGTTCGAAGTCGGCTCCGCCTCCCCCTCCGCCGCCTCCC
AAGGGTAAGAGGCTGATGCCAGCGAAGGTGCGGCGAATACCGGAGGTAGTTGAGTTCTATCATTCATTAATGCGGAGGGACTCCCGACGAGATCCCGGTTCCGGCATTAC
GGACCCGCCGTCGACCGCCAATGCTCGTGACATGATAGGAGAGATCGAGAACCGGTCCGCTCACTTGCTCGCTATAAAGACGGATGTAGAGACTCAAGGGGATTTCATAA
AGTTGTTGATCAAAGAAGTTGAAAATGCTTCATTTACTGACATCGAGGACGTTGTGCCATTTGTCAAATGGTTGGATGATGAGCTCTCATATCTGGTAGATGAAAGAGCC
GTGCTTAAACACTTCCAGTGGCCGGAGCAAAAGGCCGACGCTCTGCGTGAGGCTGCCTTCGGCTATTGCGATCTAAAGAAGCTGGAATCTGAAGCGTCATCGTTTCGTGA
TGCCCGCCAGCCCTGTAGTTCAGCTCTCAAGAAGATGCAATCTTTGCTTGAAAAATTGGAGCATGGCGTATACAATCTGTCGAGAATGCGCGAATCTGCAACTAAGAGAT
ACAAAGTGTTTAAAATTCCAGTGGAATGGATGCTTGATAGTGGAATTGTGAGTCAGATCAAACTTGTGTCTGTAAAATTAGCAATGAAGTACATGAAGAGAGTATCCGCA
GAGCTCGAGACAGTCGGTGGTGGACCTGAAGAAGAAGAGCTGATTGTCCAAGGCGTTAGGTTTGCCTTCCGTGTGCATCAGTTTGCAGGAGGGTTTGATGTGGAAACGAT
GAGGGCATTTCAAGAGCTGAGAGATAAAGCAACTACTGCTGCGTCGTTCATGGAGGTCGAGCAAGGTGGAAAATCTGCTGCTGTTAGCAGCACAGCTCCGCCTATGAAGC
TCTTGTTCGTTGAGATGGGCCAAGATGTCACGGCTGCTGCAATGCGAGCCTGTAGGGATGCCATATCTTCCAATTCGATTCCAGCATTCCGTCGAGGTATGTTCCCGTGG
GTTTCCATATCCTCACTCTATTTGTTTATCTCTGAGCAGTTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTAGCTGGGAAGGTGAAGGTCGCAATGGGGCTGCAGAAGTCCCCGGCGAGCAGAAAGGTTGAGAGCTCACCGAAACCGTCGACGCAGGCGCAGCCTTCTCCGAGCTC
TGGTAAGGTTCCTCAGAAAACGGTGTTCTCCCGCTCGTTTGGGGTCTATTTTCCTCGCTCTTCTGCTCAGGTCCAGCCTCGACCGCCTGATGTGACGGAGCTTCTCCGCA
TGGTCGAGGAGTTGCGTGACAGAGAGGCGCGATTGAAGACTGACCTATTGGAGCACAAGCTTTTGAAGGAGTCTGTCGCCATTGTTCCTCTGCTTGAGAACGAGATCTGT
ACGAAGGATGCTGAGATTGAGAGAGCGTCTAAGCGGATACTGTTCTTGGAGGCAGAGAATGAGAGGTTGAGAGTTGAGGTGGAGGAACTTCAACTGAATATTGACGAACA
GAGGAGAGAGAGTCAGGAAAGAATAAAGGCAATGGAAGGTGAAATCGCAGAGCTGAAGAAAATGGCGTTGGATCGAAACAGAATGGAGCTTAGTTTGGAGAACGACGACC
TTTCGGCCTCCCAGAGGTTCCAGGGATTAATGGAGGTCTCGGGAAAGTCTAACCTAATCAGGAACTTGAAAAGAGCGACCAAATGTTCGGATGCTGTTGTTAACCACGAC
AATCATAAGGTTGACTATCATCCAGAGGCAAAGAAAGAAGAAGTTGAAACCGAGAGACCGAGACATTCGCGATGTAACTCTGAAGAACTCGCCGAGTCCACTCTCTCTAA
CCTAAAATCACGAATACCTAGGGTTCCAAAACCTCCTCCGAAACCTTCCTCATCTTCCTCTTCTTCTACCTCTTCCTCCTCCTCCTCTTCCTCACCAACTGGCTCTTGTG
TTGACACAGATAACGCGATCCCTCCCCCACCCCCTGTCCCAACAAAGCCAATGACGCCGCCTCCCCCGCCAAGTTCGAAGTCGGCTCCGCCTCCCCCTCCGCCGCCTCCC
AAGGGTAAGAGGCTGATGCCAGCGAAGGTGCGGCGAATACCGGAGGTAGTTGAGTTCTATCATTCATTAATGCGGAGGGACTCCCGACGAGATCCCGGTTCCGGCATTAC
GGACCCGCCGTCGACCGCCAATGCTCGTGACATGATAGGAGAGATCGAGAACCGGTCCGCTCACTTGCTCGCTATAAAGACGGATGTAGAGACTCAAGGGGATTTCATAA
AGTTGTTGATCAAAGAAGTTGAAAATGCTTCATTTACTGACATCGAGGACGTTGTGCCATTTGTCAAATGGTTGGATGATGAGCTCTCATATCTGGTAGATGAAAGAGCC
GTGCTTAAACACTTCCAGTGGCCGGAGCAAAAGGCCGACGCTCTGCGTGAGGCTGCCTTCGGCTATTGCGATCTAAAGAAGCTGGAATCTGAAGCGTCATCGTTTCGTGA
TGCCCGCCAGCCCTGTAGTTCAGCTCTCAAGAAGATGCAATCTTTGCTTGAAAAATTGGAGCATGGCGTATACAATCTGTCGAGAATGCGCGAATCTGCAACTAAGAGAT
ACAAAGTGTTTAAAATTCCAGTGGAATGGATGCTTGATAGTGGAATTGTGAGTCAGATCAAACTTGTGTCTGTAAAATTAGCAATGAAGTACATGAAGAGAGTATCCGCA
GAGCTCGAGACAGTCGGTGGTGGACCTGAAGAAGAAGAGCTGATTGTCCAAGGCGTTAGGTTTGCCTTCCGTGTGCATCAGTTTGCAGGAGGGTTTGATGTGGAAACGAT
GAGGGCATTTCAAGAGCTGAGAGATAAAGCAACTACTGCTGCGTCGTTCATGGAGGTCGAGCAAGGTGGAAAATCTGCTGCTGTTAGCAGCACAGCTCCGCCTATGAAGC
TCTTGTTCGTTGAGATGGGCCAAGATGTCACGGCTGCTGCAATGCGAGCCTGTAGGGATGCCATATCTTCCAATTCGATTCCAGCATTCCGTCGAGGTATGTTCCCGTGG
GTTTCCATATCCTCACTCTATTTGTTTATCTCTGAGCAGTTATAG
Protein sequenceShow/hide protein sequence
MVAGKVKVAMGLQKSPASRKVESSPKPSTQAQPSPSSGKVPQKTVFSRSFGVYFPRSSAQVQPRPPDVTELLRMVEELRDREARLKTDLLEHKLLKESVAIVPLLENEIC
TKDAEIERASKRILFLEAENERLRVEVEELQLNIDEQRRESQERIKAMEGEIAELKKMALDRNRMELSLENDDLSASQRFQGLMEVSGKSNLIRNLKRATKCSDAVVNHD
NHKVDYHPEAKKEEVETERPRHSRCNSEELAESTLSNLKSRIPRVPKPPPKPSSSSSSSTSSSSSSSSPTGSCVDTDNAIPPPPPVPTKPMTPPPPPSSKSAPPPPPPPP
KGKRLMPAKVRRIPEVVEFYHSLMRRDSRRDPGSGITDPPSTANARDMIGEIENRSAHLLAIKTDVETQGDFIKLLIKEVENASFTDIEDVVPFVKWLDDELSYLVDERA
VLKHFQWPEQKADALREAAFGYCDLKKLESEASSFRDARQPCSSALKKMQSLLEKLEHGVYNLSRMRESATKRYKVFKIPVEWMLDSGIVSQIKLVSVKLAMKYMKRVSA
ELETVGGGPEEEELIVQGVRFAFRVHQFAGGFDVETMRAFQELRDKATTAASFMEVEQGGKSAAVSSTAPPMKLLFVEMGQDVTAAAMRACRDAISSNSIPAFRRGMFPW
VSISSLYLFISEQL