; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS023209 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS023209
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Description4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase
Genome locationscaffold78:984764..989051
RNA-Seq ExpressionMS023209
SyntenyMS023209
Gene Ontology termsGO:0016114 - terpenoid biosynthetic process (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0005524 - ATP binding (molecular function)
GO:0050515 - 4-(cytidine 5'-diphospho)-2-C-methyl-D-erythritol kinase activity (molecular function)
InterPro domainsIPR004424 - 4-diphosphocytidyl-2C-methyl-D-erythritol kinase
IPR006204 - GHMP kinase N-terminal domain
IPR014721 - Ribosomal protein S5 domain 2-type fold, subgroup
IPR020568 - Ribosomal protein S5 domain 2-type fold
IPR036554 - GHMP kinase, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK09939.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase [Cucumis melo var. makuwa]1.8e-19987.19Show/hide
Query:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCK---
        MA C+IP +SQLQF +ISFRKNFA NS GSHGS AF SR K +K       AITCNSTASKQQ EIVY+PDERINKLADEVDRDAPLSRLTLFSPCK   
Subjt:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCK---

Query:  ----INVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNA
            INVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDD+NLIIKALNLYRKKTGS+ FFWIHLDKKVPTGAGLGGGSSNA
Subjt:  ----INVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNA

Query:  ATALWAANKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKN
        ATALWAAN+FSGCLATEKDLQEWS EIGSDIPFFFS+GAA+CTGRGE+VQN+PPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTS +DPLSLLD+ITKN
Subjt:  ATALWAANKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKN

Query:  GISQDVCINDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSE
        GISQDVCINDLEPPAF+VLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EF+DVFLAEANFLTRE N+WYQEPAS+SACSPPSE
Subjt:  GISQDVCINDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSE

Query:  HLDSAT
        H +S++
Subjt:  HLDSAT

XP_008451044.1 PREDICTED: 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic isoform X1 [Cucumis melo]1.4e-20188.72Show/hide
Query:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV
        MA C+IP +SQLQF +ISFRKNFA NS GSHGS AF SR K +K       AITCNSTASKQQ EIVY+PDERINKLADEVDRDAPLSRLTLFSPCKINV
Subjt:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDD+NLIIKALNLYRKKTGS+ FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVC
        N+FSGCLATEKDLQEWS EIGSDIPFFFS+GAA+CTGRGE+VQN+PPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTS +DPLSLLD+ITKNGISQDVC
Subjt:  NKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVC

Query:  INDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSEHLDSAT
        INDLEPPAF+VLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EF+DVFLAEANFLTRE N+WYQEPAS+SACSPPSEH +S++
Subjt:  INDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSEHLDSAT

XP_022147682.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic [Momordica charantia]1.4e-228100Show/hide
Query:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV
        MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV
Subjt:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVC
        NKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVC
Subjt:  NKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVC

Query:  INDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSEHLDSAT
        INDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSEHLDSAT
Subjt:  INDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSEHLDSAT

XP_022933435.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic-like isoform X1 [Cucurbita moschata]1.1e-19888.47Show/hide
Query:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV
        MA C I F+S+L+FP++S RKNFA +S G  GSF F SRSK++K+ LIQK AI CNSTASKQQVEIVY+ DERINKLADEVDRDAPLSRLTLFSPCKINV
Subjt:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNL-IIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWA
        FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDD+NL IIKALNLYRKKTG + FFWIHLDKKVPTGAGLGGGSSNAATALWA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNL-IIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWA

Query:  ANKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDV
        AN+FSGCLATEKDLQ+WSSEIGSDIPFFFSEGAA+CTGRGE VQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLD+TS +DPLSLLD+ITKNGISQDV
Subjt:  ANKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDV

Query:  CINDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSEHLDSA
        CINDLEPPAF+VLPSLKRLKQRI+SASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYND+EFQDVFLAEANFLTREANQWY+EPA+ASACS PSE  +SA
Subjt:  CINDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSEHLDSA

XP_022933436.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic-like isoform X2 [Cucurbita moschata]4.7e-20088.69Show/hide
Query:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV
        MA C I F+S+L+FP++S RKNFA +S G  GSF F SRSK++K+ LIQK AI CNSTASKQQVEIVY+ DERINKLADEVDRDAPLSRLTLFSPCKINV
Subjt:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDD+NLIIKALNLYRKKTG + FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVC
        N+FSGCLATEKDLQ+WSSEIGSDIPFFFSEGAA+CTGRGE VQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLD+TS +DPLSLLD+ITKNGISQDVC
Subjt:  NKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVC

Query:  INDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSEHLDSA
        INDLEPPAF+VLPSLKRLKQRI+SASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYND+EFQDVFLAEANFLTREANQWY+EPA+ASACS PSE  +SA
Subjt:  INDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSEHLDSA

TrEMBL top hitse value%identityAlignment
A0A1S3BQ16 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase7.0e-20288.72Show/hide
Query:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV
        MA C+IP +SQLQF +ISFRKNFA NS GSHGS AF SR K +K       AITCNSTASKQQ EIVY+PDERINKLADEVDRDAPLSRLTLFSPCKINV
Subjt:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDD+NLIIKALNLYRKKTGS+ FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVC
        N+FSGCLATEKDLQEWS EIGSDIPFFFS+GAA+CTGRGE+VQN+PPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTS +DPLSLLD+ITKNGISQDVC
Subjt:  NKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVC

Query:  INDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSEHLDSAT
        INDLEPPAF+VLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EF+DVFLAEANFLTRE N+WYQEPAS+SACSPPSEH +S++
Subjt:  INDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSEHLDSAT

A0A5A7UKM7 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase7.0e-20288.72Show/hide
Query:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV
        MA C+IP +SQLQF +ISFRKNFA NS GSHGS AF SR K +K       AITCNSTASKQQ EIVY+PDERINKLADEVDRDAPLSRLTLFSPCKINV
Subjt:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDD+NLIIKALNLYRKKTGS+ FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVC
        N+FSGCLATEKDLQEWS EIGSDIPFFFS+GAA+CTGRGE+VQN+PPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTS +DPLSLLD+ITKNGISQDVC
Subjt:  NKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVC

Query:  INDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSEHLDSAT
        INDLEPPAF+VLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EF+DVFLAEANFLTRE N+WYQEPAS+SACSPPSEH +S++
Subjt:  INDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSEHLDSAT

A0A5D3CI45 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase8.6e-20087.19Show/hide
Query:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCK---
        MA C+IP +SQLQF +ISFRKNFA NS GSHGS AF SR K +K       AITCNSTASKQQ EIVY+PDERINKLADEVDRDAPLSRLTLFSPCK   
Subjt:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCK---

Query:  ----INVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNA
            INVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDD+NLIIKALNLYRKKTGS+ FFWIHLDKKVPTGAGLGGGSSNA
Subjt:  ----INVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNA

Query:  ATALWAANKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKN
        ATALWAAN+FSGCLATEKDLQEWS EIGSDIPFFFS+GAA+CTGRGE+VQN+PPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTS +DPLSLLD+ITKN
Subjt:  ATALWAANKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKN

Query:  GISQDVCINDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSE
        GISQDVCINDLEPPAF+VLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EF+DVFLAEANFLTRE N+WYQEPAS+SACSPPSE
Subjt:  GISQDVCINDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSE

Query:  HLDSAT
        H +S++
Subjt:  HLDSAT

A0A6J1D320 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase6.7e-229100Show/hide
Query:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV
        MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV
Subjt:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVC
        NKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVC
Subjt:  NKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVC

Query:  INDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSEHLDSAT
        INDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSEHLDSAT
Subjt:  INDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSEHLDSAT

A0A6J1EZ14 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase2.3e-20088.69Show/hide
Query:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV
        MA C I F+S+L+FP++S RKNFA +S G  GSF F SRSK++K+ LIQK AI CNSTASKQQVEIVY+ DERINKLADEVDRDAPLSRLTLFSPCKINV
Subjt:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDD+NLIIKALNLYRKKTG + FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVC
        N+FSGCLATEKDLQ+WSSEIGSDIPFFFSEGAA+CTGRGE VQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLD+TS +DPLSLLD+ITKNGISQDVC
Subjt:  NKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVC

Query:  INDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSEHLDSA
        INDLEPPAF+VLPSLKRLKQRI+SASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYND+EFQDVFLAEANFLTREANQWY+EPA+ASACS PSE  +SA
Subjt:  INDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSEHLDSA

SwissProt top hitse value%identityAlignment
O81014 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic2.2e-16071.32Show/hide
Query:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV
        MA  + PF S L F   SF+         S  SF          S  + +  ++ +  AS++QVEIV++PDER+NK+ D+VD++APLSRL LFSPCKINV
Subjt:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWA
        FLRIT KREDG+HDLASLFHVISLGDTIKFSLSPSK KDRLSTNV GVP+D +NLIIKALNLYRKKTGS  FFWIHLDKKVPTGAGLGGGSSNAATALWA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWA

Query:  ANKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDV
        AN+ +G L TE +LQ+WSSEIGSDIPFFFS GAAYCTGRGE+VQ++PPP PLD+PMVLIKP+EACSTAEVYKRLRLDQTS I+PL+LL+ +T NG+SQ +
Subjt:  ANKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDV

Query:  CINDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSE
        C+NDLEPPAF VLPSLKRLKQRII++ RGE+DAVFMSGSGSTI+GIGSPDPP FIY+D+E+++VFL+EANF+TREAN+WY+EPASA+A +  +E
Subjt:  CINDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSE

P56848 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic3.2e-15169.58Show/hide
Query:  FNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK
        +NS+  F + +       +S   +GS +F  + ++ +  +I+  A   + T  + Q+E+VY+ + ++NKLADEVDR+A +SRLTLFSPCKINVFLRIT K
Subjt:  FNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK

Query:  REDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAANKFSGC
        REDG+HDLASLFHVISLGD IKFSLSPSK      TNV GVPLD+KNLIIKALNL+RKKTG++  FWIHLDKKVPTGAGLGGGSSNAATALWAAN+FSGC
Subjt:  REDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAANKFSGC

Query:  LATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVCINDLEP
        +ATEKDLQEWS EIGSDIPFFFS GAAYCTGRGEVV++IPPPVP D+ MVL+KPQEAC T EVYKRLRLDQTS IDPL LL++I+K GISQDVC+NDLEP
Subjt:  LATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVCINDLEP

Query:  PAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPAS
        PAF+V+PSLKRLKQRI +A R ++DAVFMSGSGSTIVG+GSPDPP F+Y+ DE++++F +EA F+TR ANQWY EP S
Subjt:  PAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPAS

P93841 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic/chromoplastic (Fragment)2.2e-16075.55Show/hide
Query:  HGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKF
        HGS  F    + R++S +   A    S  SK+QVEI YNP+E+ NKLADEVDR+A LSRLTLFSPCKINVFLRIT KR+DGYHDLASLFHVISLGD IKF
Subjt:  HGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKF

Query:  SLSPSK-KDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAANKFSGCLATEKDLQEWSSEIGSDIPFFFS
        SLSPSK KDRLSTNV+GVPLD++NLIIKALNLYRKKTG++N+FWIHLDKKVPTGAGLGGGSSNAAT LWAAN+FSGC+ATEK+LQEWS EIGSDIPFFFS
Subjt:  SLSPSK-KDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAANKFSGCLATEKDLQEWSSEIGSDIPFFFS

Query:  EGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVCINDLEPPAFDVLPSLKRLKQRIISASRGE
         GAAYCTGRGEVVQ+IP P+P D+PMVLIKPQ+ACSTAEVYKR +LD +S +DPLSLL++I+ +GISQDVC+NDLEPPAF+VLPSLKRLKQR+I+A RG+
Subjt:  EGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVCINDLEPPAFDVLPSLKRLKQRIISASRGE

Query:  FDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSE
        +DAVFMSGSGSTIVG+GSPDPP F+Y+D+E++DVFL+EA+F+TR AN+WY EP S S      E
Subjt:  FDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSE

Q6MAT6 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase1.4e-5843.1Show/hide
Query:  LTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGG
        + LFSP KIN+FL++  KR DGYH+L+SLF  IS GD + F       D L+ +   +P DD NL++KA+ L+R KTG +    IHLDK++P+ AGLGGG
Subjt:  LTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGG

Query:  SSNAATALWAANKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQ
        SSNAAT LWA N+ +G + T ++L +W SEIG+DIPFFFS+G A+CTGRGE V ++ P     +   ++KP    ST EVYK L   Q +        + 
Subjt:  SSNAATALWAANKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQ

Query:  ITKNGISQDVCINDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWY
               +    NDLE  AF++ P LK LK  ++S+    FD V MSGSGS+   IG    P                A F+ R +N+WY
Subjt:  ITKNGISQDVCINDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWY

Q8S2G0 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic1.7e-14973.96Show/hide
Query:  ITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDD
        +  ++   ++QVE+ Y+   + NKLAD++D++A ++RL LFSPCKINVFLRIT KR DG+HDLASLFHVISLGDTIKFSLSPSK KDRLSTNV+GVP+D+
Subjt:  ITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDD

Query:  KNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAANKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPL
         NLIIKALNLYRKKTG++NFFWIHLDKKVPTGAGLGGGSSNAATALWAAN+FSGC+A+EK+LQEWS EIGSDIPFFFS+GAAYCTGRGE+V++I  P+P 
Subjt:  KNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAANKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPL

Query:  DVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVCINDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPP
        ++PMVL+KP EACSTAEVYKRLRL+ TS  DPL LL +IT+NGISQD C+NDLEPPAF+VLPSLKRLK+RII+A+RG++DAVFMSGSGSTIVGIGSPDPP
Subjt:  DVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVCINDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPP

Query:  GFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACS
         F+Y+DD+++D F++EA FLTR  N+WY+EP S+   S
Subjt:  GFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACS

Arabidopsis top hitse value%identityAlignment
AT2G26930.1 4-(cytidine 5'-phospho)-2-C-methyl-D-erithritol kinase1.6e-16171.32Show/hide
Query:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV
        MA  + PF S L F   SF+         S  SF          S  + +  ++ +  AS++QVEIV++PDER+NK+ D+VD++APLSRL LFSPCKINV
Subjt:  MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWA
        FLRIT KREDG+HDLASLFHVISLGDTIKFSLSPSK KDRLSTNV GVP+D +NLIIKALNLYRKKTGS  FFWIHLDKKVPTGAGLGGGSSNAATALWA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWA

Query:  ANKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDV
        AN+ +G L TE +LQ+WSSEIGSDIPFFFS GAAYCTGRGE+VQ++PPP PLD+PMVLIKP+EACSTAEVYKRLRLDQTS I+PL+LL+ +T NG+SQ +
Subjt:  ANKFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDV

Query:  CINDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSE
        C+NDLEPPAF VLPSLKRLKQRII++ RGE+DAVFMSGSGSTI+GIGSPDPP FIY+D+E+++VFL+EANF+TREAN+WY+EPASA+A +  +E
Subjt:  CINDLEPPAFDVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTCTGTACTATCCCTTTCAATTCGCAGCTCCAATTTCCTGCCATTTCGTTTAGGAAGAATTTCGCCTGCAATTCTGTTGGGTCTCACGGTTCGTTCGCTTTTCC
CTCGAGGTCGAAGAATCGGAAGAGTTCACTCATCCAGAAGACCGCCATAACATGTAATTCCACCGCTTCCAAACAACAAGTCGAGATAGTTTATAATCCTGATGAAAGGA
TAAACAAGTTGGCTGATGAAGTGGACCGGGATGCTCCTCTTTCGAGGCTCACTCTGTTCTCGCCTTGCAAGATTAATGTTTTCTTGAGAATAACTAAGAAGAGGGAAGAT
GGATATCATGATTTGGCATCTCTTTTTCATGTGATAAGCTTAGGGGATACAATTAAATTTTCGCTGTCGCCATCGAAGAAGGACCGTCTTTCTACCAATGTATCTGGGGT
ACCCCTTGATGATAAAAATTTGATCATAAAGGCTCTTAACCTTTACAGGAAAAAGACTGGCAGTGAGAATTTTTTCTGGATCCATCTCGACAAAAAGGTACCAACTGGAG
CGGGGCTTGGCGGAGGAAGCAGTAATGCTGCAACCGCGCTGTGGGCTGCCAATAAGTTCAGTGGATGTCTTGCTACTGAGAAAGATCTTCAAGAATGGTCAAGTGAAATA
GGATCTGATATTCCCTTCTTCTTTTCCGAAGGGGCAGCCTACTGCACCGGGCGAGGTGAAGTTGTACAGAATATTCCACCTCCAGTACCCTTGGACGTTCCAATGGTTCT
TATAAAGCCCCAGGAAGCATGCTCTACTGCTGAAGTTTATAAGCGCCTACGGCTGGATCAAACAAGCGGGATCGATCCTTTATCGTTGTTGGATCAAATCACAAAGAATG
GAATATCCCAAGATGTGTGTATCAATGATTTGGAACCTCCGGCTTTTGATGTCCTCCCGTCTCTTAAAAGGTTGAAACAGCGTATAATCTCTGCCAGCCGTGGAGAGTTT
GATGCCGTTTTTATGTCTGGGAGTGGTAGCACAATTGTAGGGATCGGGTCCCCAGATCCTCCAGGCTTCATATATAATGATGATGAATTCCAGGATGTATTTTTGGCAGA
GGCGAACTTTCTCACTCGTGAAGCTAATCAATGGTATCAAGAACCCGCTTCGGCATCAGCTTGTAGTCCGCCCTCCGAGCATCTGGATTCGGCTACA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTCTGTACTATCCCTTTCAATTCGCAGCTCCAATTTCCTGCCATTTCGTTTAGGAAGAATTTCGCCTGCAATTCTGTTGGGTCTCACGGTTCGTTCGCTTTTCC
CTCGAGGTCGAAGAATCGGAAGAGTTCACTCATCCAGAAGACCGCCATAACATGTAATTCCACCGCTTCCAAACAACAAGTCGAGATAGTTTATAATCCTGATGAAAGGA
TAAACAAGTTGGCTGATGAAGTGGACCGGGATGCTCCTCTTTCGAGGCTCACTCTGTTCTCGCCTTGCAAGATTAATGTTTTCTTGAGAATAACTAAGAAGAGGGAAGAT
GGATATCATGATTTGGCATCTCTTTTTCATGTGATAAGCTTAGGGGATACAATTAAATTTTCGCTGTCGCCATCGAAGAAGGACCGTCTTTCTACCAATGTATCTGGGGT
ACCCCTTGATGATAAAAATTTGATCATAAAGGCTCTTAACCTTTACAGGAAAAAGACTGGCAGTGAGAATTTTTTCTGGATCCATCTCGACAAAAAGGTACCAACTGGAG
CGGGGCTTGGCGGAGGAAGCAGTAATGCTGCAACCGCGCTGTGGGCTGCCAATAAGTTCAGTGGATGTCTTGCTACTGAGAAAGATCTTCAAGAATGGTCAAGTGAAATA
GGATCTGATATTCCCTTCTTCTTTTCCGAAGGGGCAGCCTACTGCACCGGGCGAGGTGAAGTTGTACAGAATATTCCACCTCCAGTACCCTTGGACGTTCCAATGGTTCT
TATAAAGCCCCAGGAAGCATGCTCTACTGCTGAAGTTTATAAGCGCCTACGGCTGGATCAAACAAGCGGGATCGATCCTTTATCGTTGTTGGATCAAATCACAAAGAATG
GAATATCCCAAGATGTGTGTATCAATGATTTGGAACCTCCGGCTTTTGATGTCCTCCCGTCTCTTAAAAGGTTGAAACAGCGTATAATCTCTGCCAGCCGTGGAGAGTTT
GATGCCGTTTTTATGTCTGGGAGTGGTAGCACAATTGTAGGGATCGGGTCCCCAGATCCTCCAGGCTTCATATATAATGATGATGAATTCCAGGATGTATTTTTGGCAGA
GGCGAACTTTCTCACTCGTGAAGCTAATCAATGGTATCAAGAACCCGCTTCGGCATCAGCTTGTAGTCCGCCCTCCGAGCATCTGGATTCGGCTACA
Protein sequenceShow/hide protein sequence
MAFCTIPFNSQLQFPAISFRKNFACNSVGSHGSFAFPSRSKNRKSSLIQKTAITCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKRED
GYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDKNLIIKALNLYRKKTGSENFFWIHLDKKVPTGAGLGGGSSNAATALWAANKFSGCLATEKDLQEWSSEI
GSDIPFFFSEGAAYCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSGIDPLSLLDQITKNGISQDVCINDLEPPAFDVLPSLKRLKQRIISASRGEF
DAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYQEPASASACSPPSEHLDSAT