; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020696 (gene) of Snake gourd v1 genome

Gene IDTan0020696
OrganismTrichosanthes anguina (Snake gourd v1)
Description4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase
Genome locationLG09:61787554..61792485
RNA-Seq ExpressionTan0020696
SyntenyTan0020696
Gene Ontology termsGO:0016114 - terpenoid biosynthetic process (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0005524 - ATP binding (molecular function)
GO:0050515 - 4-(cytidine 5'-diphospho)-2-C-methyl-D-erythritol kinase activity (molecular function)
InterPro domainsIPR004424 - 4-diphosphocytidyl-2C-methyl-D-erythritol kinase
IPR006204 - GHMP kinase N-terminal domain
IPR014721 - Ribosomal protein S5 domain 2-type fold, subgroup
IPR020568 - Ribosomal protein S5 domain 2-type fold
IPR036554 - GHMP kinase, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033454.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic/chromoplastic [Cucurbita argyrosperma subsp. argyrosperma]8.2e-20590.68Show/hide
Query:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASCNIP  S  +F SISFR+N  F+  GPHGS  FA   K+QKNPL+Q+V  CNS ASKQQ EIVY+PDERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRIT+KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD+FFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVCI
        QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQN+PPPV LDVPMVLIKPQEACSTAEVYKRLRLDQTS +DPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESA
        NDLEPPAFEVLPSL+RLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFL EANFLTREANQWYREPASASA SPPSE PE A
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESA

XP_022147682.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic [Momordica charantia]6.7e-20790.98Show/hide
Query:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQK-VIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV
        MA C IPF SQL+FP+ISFRKN A +++G HGSF F  RSK +K+ L+QK  I CNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV
Subjt:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQK-VIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDD+NLIIKALNLYRKKTGS+ FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVC
        N+FSGCLATEKDLQEWSSEIGSDIPFFFSEGAA+CTGRGEVVQNIPPPV LDVPMVLIKPQEACSTAEVYKRLRLDQTS IDPLSLLD+ITKNGISQDVC
Subjt:  NQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESAT
        INDLEPPAF+VLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFL EANFLTREANQWY+EPASASACSPPSE  +SAT
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESAT

XP_022933435.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic-like isoform X1 [Cucurbita moschata]3.3e-20691.21Show/hide
Query:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASCNI F S+L FPS+S RKN A  + GP GSF FA RSK+QKN L+QK I+CNSTASKQQVEIVY+ DERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNL-IIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNL IIKALNLYRKKTG DQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNL-IIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVC
        NQFSGCLATEKDLQ+WSSEIGSDIPFFFSEGAAFCTGRGE VQNIPPPV LDVPMVLIKPQEACSTAEVYKRLRLD+TS++DPLSLLDKITKNGISQDVC
Subjt:  NQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESA
        INDLEPPAFEVLPSLKRLKQRI+SASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYND+EFQDVFL EANFLTREANQWYREPA+ASACS PSER ESA
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESA

XP_022933436.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic-like isoform X2 [Cucurbita moschata]1.3e-20791.44Show/hide
Query:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASCNI F S+L FPS+S RKN A  + GP GSF FA RSK+QKN L+QK I+CNSTASKQQVEIVY+ DERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTG DQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVCI
        QFSGCLATEKDLQ+WSSEIGSDIPFFFSEGAAFCTGRGE VQNIPPPV LDVPMVLIKPQEACSTAEVYKRLRLD+TS++DPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESA
        NDLEPPAFEVLPSLKRLKQRI+SASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYND+EFQDVFL EANFLTREANQWYREPA+ASACS PSER ESA
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESA

XP_023515762.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic [Cucurbita pepo subsp. pepo]8.2e-20590.43Show/hide
Query:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASCNIP  S  +F SISFR+N  F+  GPHGS  FA   K+QKNPL+Q+V  CNS ASKQQ EIVY+PDERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRIT+KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD+FFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVCI
        QFSGC+ATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQN+PPPV LD+PMVLIKPQEACSTAEVYKRLRLDQTS +DPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESA
        NDLEPPAFEVLPSLKRLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFL EANFLTREANQWYREPASASA SPPSE PE A
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESA

TrEMBL top hitse value%identityAlignment
A0A6J1D320 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase3.2e-20790.98Show/hide
Query:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQK-VIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV
        MA C IPF SQL+FP+ISFRKN A +++G HGSF F  RSK +K+ L+QK  I CNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV
Subjt:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQK-VIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDD+NLIIKALNLYRKKTGS+ FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVC
        N+FSGCLATEKDLQEWSSEIGSDIPFFFSEGAA+CTGRGEVVQNIPPPV LDVPMVLIKPQEACSTAEVYKRLRLDQTS IDPLSLLD+ITKNGISQDVC
Subjt:  NQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESAT
        INDLEPPAF+VLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFL EANFLTREANQWY+EPASASACSPPSE  +SAT
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESAT

A0A6J1EZ14 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase6.5e-20891.44Show/hide
Query:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASCNI F S+L FPS+S RKN A  + GP GSF FA RSK+QKN L+QK I+CNSTASKQQVEIVY+ DERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTG DQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVCI
        QFSGCLATEKDLQ+WSSEIGSDIPFFFSEGAAFCTGRGE VQNIPPPV LDVPMVLIKPQEACSTAEVYKRLRLD+TS++DPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESA
        NDLEPPAFEVLPSLKRLKQRI+SASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYND+EFQDVFL EANFLTREANQWYREPA+ASACS PSER ESA
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESA

A0A6J1F4W1 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase1.6e-20691.21Show/hide
Query:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASCNI F S+L FPS+S RKN A  + GP GSF FA RSK+QKN L+QK I+CNSTASKQQVEIVY+ DERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNL-IIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNL IIKALNLYRKKTG DQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNL-IIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVC
        NQFSGCLATEKDLQ+WSSEIGSDIPFFFSEGAAFCTGRGE VQNIPPPV LDVPMVLIKPQEACSTAEVYKRLRLD+TS++DPLSLLDKITKNGISQDVC
Subjt:  NQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESA
        INDLEPPAFEVLPSLKRLKQRI+SASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYND+EFQDVFL EANFLTREANQWYREPA+ASACS PSER ESA
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESA

A0A6J1HBW0 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase6.8e-20590.43Show/hide
Query:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASCNIP  S  +F SISFR+N  F+  GPHGS  FA   K+QKNPL+Q+V  CNS ASKQQ EIVY+PDERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRIT+KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD+FFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVCI
        QFSGC+ATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQN+PPPV LDVPMVLIKPQEACSTAEVYKRLRLDQTS +DPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESA
        NDLEPPAFEVLPSL+RLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFL EANFLTREANQWYREPASASA SPPSE PE A
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESA

A0A6J1HR79 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase3.4e-20489.92Show/hide
Query:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MAS NI F S+L FPS+S RKN A  + GP GSF FA RSK+QKNPL+QK I+CNSTASKQQVEIVY+ DERINKLA+EVDR+APLSRLTLFSPCKINVF
Subjt:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRITKKREDGYHDLASLFHVI+LGDTIKFSLSP KKDRLSTNVSGVPLDDRNLIIKALNLYRKKTG DQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVCI
        QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGE VQNIPPPV LDVPMVLIKPQEACSTAEVYKRLRLD+TS++DPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESA
        NDLEPPAFEVLPSLKRLKQRI+SASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYND+EF DVFL EANFLTREANQWYREPA+ASACSP SE  +SA
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESA

SwissProt top hitse value%identityAlignment
O81014 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic6.8e-16271.5Show/hide
Query:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MA+ + PF S L F                H SF  +  S +    LL+ ++  +  AS++QVEIV++PDER+NK+ D+VD++APLSRL LFSPCKINVF
Subjt:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        LRIT KREDG+HDLASLFHVISLGDTIKFSLSPSK KDRLSTNV GVP+D RNLIIKALNLYRKKTGS++FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVC
        N+ +G L TE +LQ+WSSEIGSDIPFFFS GAA+CTGRGE+VQ++PPP  LD+PMVLIKP+EACSTAEVYKRLRLDQTSNI+PL+LL+ +T NG+SQ +C
Subjt:  NQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSE
        +NDLEPPAF VLPSLKRLKQRII++ RGE+DAVFMSGSGSTI+GIGSPDPP FIY+D+E+++VFL+EANF+TREAN+WY+EPASA+A +  +E
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSE

P56848 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic7.6e-15370.37Show/hide
Query:  FGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPL-LQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK
        + S+  F S +      F +  P+GS  F  R K Q + + + +    + T  + Q+E+VY+ + ++NKLADEVDR+A +SRLTLFSPCKINVFLRIT K
Subjt:  FGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPL-LQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK

Query:  REDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC
        REDG+HDLASLFHVISLGD IKFSLSPSK      TNV GVPLD++NLIIKALNL+RKKTG+D+ FWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC
Subjt:  REDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC

Query:  LATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVCINDLEP
        +ATEKDLQEWS EIGSDIPFFFS GAA+CTGRGEVV++IPPPV  D+ MVL+KPQEAC T EVYKRLRLDQTS+IDPL LL+KI+K GISQDVC+NDLEP
Subjt:  LATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVCINDLEP

Query:  PAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPAS
        PAFEV+PSLKRLKQRI +A R ++DAVFMSGSGSTIVG+GSPDPP F+Y+ DE++++F +EA F+TR ANQWY EP S
Subjt:  PAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPAS

P93841 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic/chromoplastic (Fragment)2.3e-16275.2Show/hide
Query:  PHGSFGFALRSKYQKNPLLQKVIKCN-STASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIK
        PHGS  F    ++++N  +  ++K + S  SK+QVEI YNP+E+ NKLADEVDR+A LSRLTLFSPCKINVFLRIT KR+DGYHDLASLFHVISLGD IK
Subjt:  PHGSFGFALRSKYQKNPLLQKVIKCN-STASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIK

Query:  FSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIPFFF
        FSLSPSK KDRLSTNV+GVPLD+RNLIIKALNLYRKKTG+D +FWIHLDKKVPTGAGLGGGSSNAAT LWAANQFSGC+ATEK+LQEWS EIGSDIPFFF
Subjt:  FSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIPFFF

Query:  SEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRG
        S GAA+CTGRGEVVQ+IP P+  D+PMVLIKPQ+ACSTAEVYKR +LD +S +DPLSLL+KI+ +GISQDVC+NDLEPPAFEVLPSLKRLKQR+I+A RG
Subjt:  SEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRG

Query:  EFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESAT
        ++DAVFMSGSGSTIVG+GSPDPP F+Y+D+E++DVFL+EA+F+TR AN+WY EP S S      ++PE +T
Subjt:  EFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESAT

Q6MAT6 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase2.0e-6045.21Show/hide
Query:  LTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGG
        + LFSP KIN+FL++  KR DGYH+L+SLF  IS GD + F       D L+ +   +P DD NL++KA+ L+R KTG D    IHLDK++P+ AGLGGG
Subjt:  LTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGG

Query:  SSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQ--TSNIDPLSLL
        SSNAAT LWA NQ +G + T ++L +W SEIG+DIPFFFS+G A CTGRGE V ++ P     +   ++KP    ST EVYK L   Q   +N D  S  
Subjt:  SSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQ--TSNIDPLSLL

Query:  DKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWY
        +K            NDLE  AFE+ P LK LK  ++S+    FD V MSGSGS+   IG    P                A F+ R +N+WY
Subjt:  DKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWY

Q8S2G0 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic1.7e-14972.99Show/hide
Query:  IKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDD
        +  ++   ++QVE+ Y+   + NKLAD++D++A ++RL LFSPCKINVFLRIT KR DG+HDLASLFHVISLGDTIKFSLSPSK KDRLSTNV+GVP+D+
Subjt:  IKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDD

Query:  RNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSL
         NLIIKALNLYRKKTG+D FFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC+A+EK+LQEWS EIGSDIPFFFS+GAA+CTGRGE+V++I  P+  
Subjt:  RNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSL

Query:  DVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPP
        ++PMVL+KP EACSTAEVYKRLRL+ TS  DPL LL +IT+NGISQD C+NDLEPPAFEVLPSLKRLK+RII+A+RG++DAVFMSGSGSTIVGIGSPDPP
Subjt:  DVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPP

Query:  GFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESAT
         F+Y+DD+++D F++EA FLTR  N+WYREP S+   S     PE A+
Subjt:  GFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESAT

Arabidopsis top hitse value%identityAlignment
AT2G26930.1 4-(cytidine 5'-phospho)-2-C-methyl-D-erithritol kinase4.8e-16371.5Show/hide
Query:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MA+ + PF S L F                H SF  +  S +    LL+ ++  +  AS++QVEIV++PDER+NK+ D+VD++APLSRL LFSPCKINVF
Subjt:  MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        LRIT KREDG+HDLASLFHVISLGDTIKFSLSPSK KDRLSTNV GVP+D RNLIIKALNLYRKKTGS++FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVC
        N+ +G L TE +LQ+WSSEIGSDIPFFFS GAA+CTGRGE+VQ++PPP  LD+PMVLIKP+EACSTAEVYKRLRLDQTSNI+PL+LL+ +T NG+SQ +C
Subjt:  NQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSE
        +NDLEPPAF VLPSLKRLKQRII++ RGE+DAVFMSGSGSTI+GIGSPDPP FIY+D+E+++VFL+EANF+TREAN+WY+EPASA+A +  +E
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCTGTAACATCCCTTTTGGTTCACAGCTCGAATTTCCTTCCATTTCGTTTAGAAAGAATTTGGCCTTCCATGCTCTTGGGCCTCACGGGTCTTTCGGGTTTGC
CTTGAGGTCGAAATACCAGAAGAATCCACTCCTTCAGAAGGTCATAAAATGCAATTCCACAGCTTCCAAACAACAAGTTGAGATAGTTTATAATCCTGATGAAAGGATAA
ACAAGTTAGCTGATGAAGTGGACCGGGATGCTCCTCTTTCAAGGCTTACTCTGTTCTCACCTTGCAAGATTAATGTTTTCTTGAGAATAACTAAGAAGAGAGAAGATGGA
TATCATGATTTGGCATCTCTCTTTCATGTGATAAGCTTAGGGGATACAATTAAATTCTCTTTGTCGCCATCAAAGAAGGATCGTCTTTCTACCAATGTATCAGGGGTACC
ACTTGATGATAGAAATTTGATAATCAAGGCTCTTAACCTCTACAGGAAAAAGACTGGCAGTGACCAATTTTTCTGGATTCATCTCGACAAAAAGGTACCGACTGGAGCTG
GGCTTGGTGGAGGAAGCAGTAATGCTGCAACTGCACTGTGGGCAGCCAATCAGTTCAGTGGATGTCTTGCTACTGAAAAGGACCTTCAAGAATGGTCCAGTGAGATAGGA
TCCGATATTCCCTTCTTTTTCTCGGAAGGGGCGGCCTTCTGCACCGGGAGAGGTGAGGTTGTACAAAATATTCCACCGCCTGTATCCTTGGACGTTCCGATGGTTCTCAT
AAAGCCCCAGGAAGCATGCTCTACAGCAGAAGTTTATAAGCGCTTACGATTGGATCAAACAAGCAACATCGATCCTTTATCATTGCTGGATAAAATCACAAAGAATGGAA
TATCTCAAGATGTGTGTATCAACGATTTGGAACCTCCTGCTTTTGAGGTCCTCCCATCTCTTAAAAGATTGAAACAGCGTATAATCTCTGCCAGCCGTGGAGAGTTCGAT
GCTGTTTTTATGTCCGGGAGTGGTAGCACAATAGTAGGGATTGGGTCCCCAGATCCTCCAGGCTTCATATATAATGACGATGAATTCCAGGACGTATTTTTGACAGAGGC
CAACTTTCTCACTCGTGAAGCAAATCAATGGTATCGAGAACCCGCTTCGGCATCCGCTTGTAGTCCGCCTTCCGAGCGTCCCGAATCAGCTACATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCCTGTAACATCCCTTTTGGTTCACAGCTCGAATTTCCTTCCATTTCGTTTAGAAAGAATTTGGCCTTCCATGCTCTTGGGCCTCACGGGTCTTTCGGGTTTGC
CTTGAGGTCGAAATACCAGAAGAATCCACTCCTTCAGAAGGTCATAAAATGCAATTCCACAGCTTCCAAACAACAAGTTGAGATAGTTTATAATCCTGATGAAAGGATAA
ACAAGTTAGCTGATGAAGTGGACCGGGATGCTCCTCTTTCAAGGCTTACTCTGTTCTCACCTTGCAAGATTAATGTTTTCTTGAGAATAACTAAGAAGAGAGAAGATGGA
TATCATGATTTGGCATCTCTCTTTCATGTGATAAGCTTAGGGGATACAATTAAATTCTCTTTGTCGCCATCAAAGAAGGATCGTCTTTCTACCAATGTATCAGGGGTACC
ACTTGATGATAGAAATTTGATAATCAAGGCTCTTAACCTCTACAGGAAAAAGACTGGCAGTGACCAATTTTTCTGGATTCATCTCGACAAAAAGGTACCGACTGGAGCTG
GGCTTGGTGGAGGAAGCAGTAATGCTGCAACTGCACTGTGGGCAGCCAATCAGTTCAGTGGATGTCTTGCTACTGAAAAGGACCTTCAAGAATGGTCCAGTGAGATAGGA
TCCGATATTCCCTTCTTTTTCTCGGAAGGGGCGGCCTTCTGCACCGGGAGAGGTGAGGTTGTACAAAATATTCCACCGCCTGTATCCTTGGACGTTCCGATGGTTCTCAT
AAAGCCCCAGGAAGCATGCTCTACAGCAGAAGTTTATAAGCGCTTACGATTGGATCAAACAAGCAACATCGATCCTTTATCATTGCTGGATAAAATCACAAAGAATGGAA
TATCTCAAGATGTGTGTATCAACGATTTGGAACCTCCTGCTTTTGAGGTCCTCCCATCTCTTAAAAGATTGAAACAGCGTATAATCTCTGCCAGCCGTGGAGAGTTCGAT
GCTGTTTTTATGTCCGGGAGTGGTAGCACAATAGTAGGGATTGGGTCCCCAGATCCTCCAGGCTTCATATATAATGACGATGAATTCCAGGACGTATTTTTGACAGAGGC
CAACTTTCTCACTCGTGAAGCAAATCAATGGTATCGAGAACCCGCTTCGGCATCCGCTTGTAGTCCGCCTTCCGAGCGTCCCGAATCAGCTACATAA
Protein sequenceShow/hide protein sequence
MASCNIPFGSQLEFPSISFRKNLAFHALGPHGSFGFALRSKYQKNPLLQKVIKCNSTASKQQVEIVYNPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDG
YHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIG
SDIPFFFSEGAAFCTGRGEVVQNIPPPVSLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFD
AVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLTEANFLTREANQWYREPASASACSPPSERPESAT