; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr000689 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr000689
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Description4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase
Genome locationtig00000447:4814..11348
RNA-Seq ExpressionSgr000689
SyntenySgr000689
Gene Ontology termsGO:0016114 - terpenoid biosynthetic process (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0005524 - ATP binding (molecular function)
GO:0050515 - 4-(cytidine 5'-diphospho)-2-C-methyl-D-erythritol kinase activity (molecular function)
InterPro domainsIPR004424 - 4-diphosphocytidyl-2C-methyl-D-erythritol kinase
IPR006204 - GHMP kinase N-terminal domain
IPR014721 - Ribosomal protein S5 domain 2-type fold, subgroup
IPR020568 - Ribosomal protein S5 domain 2-type fold
IPR036554 - GHMP kinase, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033454.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic/chromoplastic [Cucurbita argyrosperma subsp. argyrosperma]4.2e-19091.58Show/hide
Query:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV
        MASCNIPC+S  QF SISFR+NFVFN  GPHGS AFAS  KH +N L+Q+   TCNS  SKQQ EI YDPDERINKLADEVDR APLSRLTLFSPCKINV
Subjt:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRIT+KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC
        NQFSGCLATEKDLQEWSSEIGSDI FFFSEGAAFCTGRGEVVQN+PPPVPLD+PMVLIKPQEACSTAEVYKRLRLDQTSKV+PLSLLDKITKNGISQDVC
Subjt:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA
        INDLEPPAFEVLPSL+RLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA

XP_008451044.1 PREDICTED: 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic isoform X1 [Cucumis melo]3.6e-18991.3Show/hide
Query:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV
        MASC+IPC+SQLQF SISFRKNF FNS G HGS AFASR K       Q+ AITCNST SKQQ EI YDPDERINKLADEVDR APLSRLTLFSPCKINV
Subjt:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC
        NQFSGCLATEKDLQEWS EIGSDI FFFS+GAAFCTGRGE+VQN+PPPVPLD+PMVLIKPQEACSTAEVYKRLRLDQTSKV+PLSLLDKITKNGISQDVC
Subjt:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA
        INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EF+DVFLAEA
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA

XP_022147682.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic [Momordica charantia]6.9e-19392.12Show/hide
Query:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV
        MA C IP NSQLQFP+ISFRKNF  NS+G HGSFAF SRSK+ ++SLIQK AITCNST SKQQVEI Y+PDERINKLADEVDR APLSRLTLFSPCKINV
Subjt:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDD+NLIIKALNLYRKKTGS+NFFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC
        N+FSGCLATEKDLQEWSSEIGSDI FFFSEGAA+CTGRGEVVQNIPPPVPLD+PMVLIKPQEACSTAEVYKRLRLDQTS ++PLSLLD+ITKNGISQDVC
Subjt:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA
        INDLEPPAF+VLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA

XP_022960704.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic-like [Cucurbita moschata]7.2e-19091.3Show/hide
Query:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV
        MASCNIPC+S  QF SISFR+NFVFN  GPHGS AFAS  KH +N L+Q+   TCNS  SKQQ EI YDPDERINKLADEVDR APLSRLTLFSPCKINV
Subjt:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRIT+KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC
        NQFSGC+ATEKDLQEWSSEIGSDI FFFSEGAAFCTGRGEVVQN+PPPVPLD+PMVLIKPQEACSTAEVYKRLRLDQTSKV+PLSLLDKITKNGISQDVC
Subjt:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA
        INDLEPPAFEVLPSL+RLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA

XP_023515762.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic [Cucurbita pepo subsp. pepo]2.5e-19091.85Show/hide
Query:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV
        MASCNIPC+S  QF SISFR+NFVFN  GPHGS AFAS  KH +N L+Q+   TCNS  SKQQ EI YDPDERINKLADEVDR APLSRLTLFSPCKINV
Subjt:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRIT+KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC
        NQFSGC+ATEKDLQEWSSEIGSDI FFFSEGAAFCTGRGEVVQN+PPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKV+PLSLLDKITKNGISQDVC
Subjt:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA
        INDLEPPAFEVLPSLKRLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA

TrEMBL top hitse value%identityAlignment
A0A1S3BQ16 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase1.7e-18991.3Show/hide
Query:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV
        MASC+IPC+SQLQF SISFRKNF FNS G HGS AFASR K       Q+ AITCNST SKQQ EI YDPDERINKLADEVDR APLSRLTLFSPCKINV
Subjt:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC
        NQFSGCLATEKDLQEWS EIGSDI FFFS+GAAFCTGRGE+VQN+PPPVPLD+PMVLIKPQEACSTAEVYKRLRLDQTSKV+PLSLLDKITKNGISQDVC
Subjt:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA
        INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EF+DVFLAEA
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA

A0A5A7UKM7 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase1.7e-18991.3Show/hide
Query:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV
        MASC+IPC+SQLQF SISFRKNF FNS G HGS AFASR K       Q+ AITCNST SKQQ EI YDPDERINKLADEVDR APLSRLTLFSPCKINV
Subjt:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC
        NQFSGCLATEKDLQEWS EIGSDI FFFS+GAAFCTGRGE+VQN+PPPVPLD+PMVLIKPQEACSTAEVYKRLRLDQTSKV+PLSLLDKITKNGISQDVC
Subjt:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA
        INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EF+DVFLAEA
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA

A0A6J1D320 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase3.4e-19392.12Show/hide
Query:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV
        MA C IP NSQLQFP+ISFRKNF  NS+G HGSFAF SRSK+ ++SLIQK AITCNST SKQQVEI Y+PDERINKLADEVDR APLSRLTLFSPCKINV
Subjt:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDD+NLIIKALNLYRKKTGS+NFFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC
        N+FSGCLATEKDLQEWSSEIGSDI FFFSEGAA+CTGRGEVVQNIPPPVPLD+PMVLIKPQEACSTAEVYKRLRLDQTS ++PLSLLD+ITKNGISQDVC
Subjt:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA
        INDLEPPAF+VLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA

A0A6J1EZ14 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase3.3e-18891.03Show/hide
Query:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV
        MASCNI  +S+L+FPS+S RKNF  +S GP GSF FASRSKH +N LIQK AI CNST SKQQVEI YD DERINKLADEVDR APLSRLTLFSPCKINV
Subjt:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTG D FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC
        NQFSGCLATEKDLQ+WSSEIGSDI FFFSEGAAFCTGRGE VQNIPPPVPLD+PMVLIKPQEACSTAEVYKRLRLD+TS V+PLSLLDKITKNGISQDVC
Subjt:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA
        INDLEPPAFEVLPSLKRLKQRI+SASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYND+EFQDVFLAEA
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA

A0A6J1HBW0 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase3.5e-19091.3Show/hide
Query:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV
        MASCNIPC+S  QF SISFR+NFVFN  GPHGS AFAS  KH +N L+Q+   TCNS  SKQQ EI YDPDERINKLADEVDR APLSRLTLFSPCKINV
Subjt:  MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRIT+KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC
        NQFSGC+ATEKDLQEWSSEIGSDI FFFSEGAAFCTGRGEVVQN+PPPVPLD+PMVLIKPQEACSTAEVYKRLRLDQTSKV+PLSLLDKITKNGISQDVC
Subjt:  NQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA
        INDLEPPAFEVLPSL+RLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA

SwissProt top hitse value%identityAlignment
O81014 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic8.3e-14971.2Show/hide
Query:  HGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKF
        H SF  +S S    +  + +  ++ +   S++QVEI +DPDER+NK+ D+VD+ APLSRL LFSPCKINVFLRIT KREDG+HDLASLFHVISLGDTIKF
Subjt:  HGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKF

Query:  SLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIAFFFS
        SLSPSK KDRLSTNV GVP+D RNLIIKALNLYRKKTGS+ FFWIHLDKKVPTGAGLGGGSSNAATALWAAN+ +G L TE +LQ+WSSEIGSDI FFFS
Subjt:  SLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIAFFFS

Query:  EGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGE
         GAA+CTGRGE+VQ++PPP PLD+PMVLIKP+EACSTAEVYKRLRLDQTS +NPL+LL+ +T NG+SQ +C+NDLEPPAF VLPSLKRLKQRII++ RGE
Subjt:  EGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGE

Query:  FDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEATPPFCNPAIEPLLKLAGTDATNGCASTRV
        +DAVFMSGSGSTI+GIGSPDPP FIY+D+E+++VFL+EA       A E   + A  +AT   A +R+
Subjt:  FDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEATPPFCNPAIEPLLKLAGTDATNGCASTRV

P56848 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic1.5e-14571.19Show/hide
Query:  NSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINVFLRITKKR
        NS+  F S +      F+S  P+GS +F  + +  R  +I+ AA   + TT + Q+E+ YD + ++NKLADEVDR A +SRLTLFSPCKINVFLRIT KR
Subjt:  NSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINVFLRITKKR

Query:  EDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
        EDG+HDLASLFHVISLGD IKFSLSPSK      TNV GVPLD++NLIIKALNL+RKKTG+D  FWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC+
Subjt:  EDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL

Query:  ATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVCINDLEPP
        ATEKDLQEWS EIGSDI FFFS GAA+CTGRGEVV++IPPPVP D+ MVL+KPQEAC T EVYKRLRLDQTS ++PL LL+KI+K GISQDVC+NDLEPP
Subjt:  ATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVCINDLEPP

Query:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA
        AFEV+PSLKRLKQRI +A R ++DAVFMSGSGSTIVG+GSPDPP F+Y+ DE++++F +EA
Subjt:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEA

P93841 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic/chromoplastic (Fragment)4.6e-15576.6Show/hide
Query:  PSISFRKNFVFN---SLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINVFLRITKKREDG
        P +   K  VF    S  PHGS  F    +  RNS +   A    S TSK+QVEI Y+P+E+ NKLADEVDR A LSRLTLFSPCKINVFLRIT KR+DG
Subjt:  PSISFRKNFVFN---SLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINVFLRITKKREDG

Query:  YHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATE
        YHDLASLFHVISLGD IKFSLSPSK KDRLSTNV+GVPLD+RNLIIKALNLYRKKTG+DN+FWIHLDKKVPTGAGLGGGSSNAAT LWAANQFSGC+ATE
Subjt:  YHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATE

Query:  KDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVCINDLEPPAFE
        K+LQEWS EIGSDI FFFS GAA+CTGRGEVVQ+IP P+P DIPMVLIKPQ+ACSTAEVYKR +LD +SKV+PLSLL+KI+ +GISQDVC+NDLEPPAFE
Subjt:  KDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVCINDLEPPAFE

Query:  VLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEAT
        VLPSLKRLKQR+I+A RG++DAVFMSGSGSTIVG+GSPDPP F+Y+D+E++DVFL+EA+
Subjt:  VLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEAT

Q6MAT6 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase7.5e-5746.95Show/hide
Query:  LTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGG
        + LFSP KIN+FL++  KR DGYH+L+SLF  IS GD + F       D L+ +   +P DD NL++KA+ L+R KTG D    IHLDK++P+ AGLGGG
Subjt:  LTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGG

Query:  SSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDK
        SSNAAT LWA NQ +G + T ++L +W SEIG+DI FFFS+G A CTGRGE V ++ P     I   ++KP    ST EVYK L   Q ++ N       
Subjt:  SSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDK

Query:  ITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPP
               +    NDLE  AFE+ P LK LK  ++S+    FD V MSGSGS+   IG    P
Subjt:  ITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPP

Q8S2G0 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic1.5e-14275.71Show/hide
Query:  ITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDD
        +  ++   ++QVE+ YD   + NKLAD++D++A ++RL LFSPCKINVFLRIT KR DG+HDLASLFHVISLGDTIKFSLSPSK KDRLSTNV+GVP+D+
Subjt:  ITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDD

Query:  RNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPL
         NLIIKALNLYRKKTG+DNFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC+A+EK+LQEWS EIGSDI FFFS+GAA+CTGRGE+V++I  P+P 
Subjt:  RNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIAFFFSEGAAFCTGRGEVVQNIPPPVPL

Query:  DIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPP
        ++PMVL+KP EACSTAEVYKRLRL+ TS+ +PL LL +IT+NGISQD C+NDLEPPAFEVLPSLKRLK+RII+A+RG++DAVFMSGSGSTIVGIGSPDPP
Subjt:  DIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPP

Query:  GFIYNDDEFQDVFLAEA
         F+Y+DD+++D F++EA
Subjt:  GFIYNDDEFQDVFLAEA

Arabidopsis top hitse value%identityAlignment
AT2G26930.1 4-(cytidine 5'-phospho)-2-C-methyl-D-erithritol kinase5.9e-15071.2Show/hide
Query:  HGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKF
        H SF  +S S    +  + +  ++ +   S++QVEI +DPDER+NK+ D+VD+ APLSRL LFSPCKINVFLRIT KREDG+HDLASLFHVISLGDTIKF
Subjt:  HGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKF

Query:  SLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIAFFFS
        SLSPSK KDRLSTNV GVP+D RNLIIKALNLYRKKTGS+ FFWIHLDKKVPTGAGLGGGSSNAATALWAAN+ +G L TE +LQ+WSSEIGSDI FFFS
Subjt:  SLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIAFFFS

Query:  EGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGE
         GAA+CTGRGE+VQ++PPP PLD+PMVLIKP+EACSTAEVYKRLRLDQTS +NPL+LL+ +T NG+SQ +C+NDLEPPAF VLPSLKRLKQRII++ RGE
Subjt:  EGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGE

Query:  FDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEATPPFCNPAIEPLLKLAGTDATNGCASTRV
        +DAVFMSGSGSTI+GIGSPDPP FIY+D+E+++VFL+EA       A E   + A  +AT   A +R+
Subjt:  FDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEATPPFCNPAIEPLLKLAGTDATNGCASTRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTGTAATATCCCTTGCAATTCACAGCTCCAATTTCCTTCCATTTCGTTTAGAAAGAATTTCGTCTTCAATTCGCTTGGGCCGCACGGTTCATTCGCTTTTGC
CTCGAGGTCGAAGCACTGGAGGAACTCACTTATCCAGAAGGCCGCCATAACCTGTAATTCCACCACTTCCAAACAACAAGTTGAGATAGCTTATGATCCTGATGAAAGGA
TAAACAAGCTAGCTGATGAAGTGGACCGGAGTGCTCCTCTTTCGAGGCTCACTCTGTTCTCACCTTGCAAGATTAATGTTTTCTTGAGAATAACTAAGAAGAGGGAAGAT
GGATATCATGATTTGGCATCTCTCTTTCATGTGATAAGCTTAGGGGATACAATTAAATTCTCTTTGTCGCCATCGAAGAAGGACCGTCTTTCTACCAATGTATCTGGGGT
ACCCCTTGATGATAGAAATTTGATTATCAAGGCTCTTAACCTCTACAGGAAAAAGACTGGCAGTGACAATTTTTTCTGGATCCATCTTGACAAGAAGGTACCAACTGGAG
CAGGGCTTGGTGGAGGAAGCAGTAACGCTGCAACGGCACTGTGGGCGGCCAATCAGTTCAGTGGATGTCTTGCTACTGAAAAGGATCTTCAAGAATGGTCAAGTGAGATA
GGATCTGATATTGCCTTCTTTTTTTCTGAAGGGGCAGCCTTCTGCACCGGGCGAGGTGAGGTTGTACAGAATATTCCACCTCCAGTACCCTTGGACATTCCAATGGTTCT
CATAAAGCCCCAGGAAGCATGCTCTACTGCAGAAGTTTATAAGCGCCTACGGTTGGATCAAACAAGCAAGGTCAATCCTTTATCATTGTTGGATAAAATCACAAAGAACG
GAATATCCCAAGATGTGTGTATCAATGATTTGGAACCTCCTGCTTTTGAGGTCCTCCCATCTCTTAAAAGATTAAAACAGCGTATAATTTCTGCCAGCCGTGGAGAGTTC
GATGCTGTTTTTATGTCCGGAAGTGGTAGCACAATTGTAGGCATCGGGTCCCCAGATCCTCCAGGTTTCATATATAATGATGATGAATTCCAAGATGTGTTTTTGGCAGA
AGCAACTCCTCCATTTTGCAATCCAGCAATTGAACCTCTATTGAAGTTAGCGGGCACTGACGCCACAAATGGCTGTGCATCCACACGAGTGTTCCAATCCACCGGAGAAG
ATGCCCCTGATTGGGTTGAACTTGCTCCTGAAAATAGTCCAAGTGAGGAAGCAGCAGCTATGGAAGACGATGCACCTGTTCTGCTCCATGAAGTATTGGGACAAAATGAT
GCTGCTGCTGCTGATACTGAACTTGTGGGTAGATCTGATTTAAGATGGGTGCCGGACCTGCACAACCGAACTTGGTTCTCTCACTGGACCAGATTGAAGGCTATTGAATT
CGGTTCCAAATGTGCTGATGCTAAAGAGAAAGAAATGGGGCTAGCTCCTGCACTTGGCCACCTTTTCTCGGCGGTTGTGATTTCATTACCAGGAAACTTACTACTGCTTA
GAGCAGTTGGAGGATCACCAGTTTCTTCTGGCTTAGGAGGAGCAGCAGGCGGATCTAGCTTCAATTCTCTCAAGGATTCAGCTGCCTTCTCCAGATCACCTTCACAAGTA
ACAACAGCTCTCTCAACTTCTTGTTTTGTACACTTGTACTGTGATTCCAAATCTGATATACGAGCAAGCTCTTCTGATATAGGTGAAGCAACAGATACTGGCGATTCAAG
TGCATGGAAGGTTCCTGAAAGAGGATTATATGCACTTGCTGGAACACTACTACTACCAGCAGCAGCAGGTCCTGTCGGCTTTGCTGTAGGCTTCTGTAGCTCTTTGCTGG
CTTTCTTGTCTTTTGATTTAGATCTGGATGCTGGAGACATCTCTACAGCTATAAAACGCCTTCTGAAACATGCAATACACAACATCAAATTTTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCTGTAATATCCCTTGCAATTCACAGCTCCAATTTCCTTCCATTTCGTTTAGAAAGAATTTCGTCTTCAATTCGCTTGGGCCGCACGGTTCATTCGCTTTTGC
CTCGAGGTCGAAGCACTGGAGGAACTCACTTATCCAGAAGGCCGCCATAACCTGTAATTCCACCACTTCCAAACAACAAGTTGAGATAGCTTATGATCCTGATGAAAGGA
TAAACAAGCTAGCTGATGAAGTGGACCGGAGTGCTCCTCTTTCGAGGCTCACTCTGTTCTCACCTTGCAAGATTAATGTTTTCTTGAGAATAACTAAGAAGAGGGAAGAT
GGATATCATGATTTGGCATCTCTCTTTCATGTGATAAGCTTAGGGGATACAATTAAATTCTCTTTGTCGCCATCGAAGAAGGACCGTCTTTCTACCAATGTATCTGGGGT
ACCCCTTGATGATAGAAATTTGATTATCAAGGCTCTTAACCTCTACAGGAAAAAGACTGGCAGTGACAATTTTTTCTGGATCCATCTTGACAAGAAGGTACCAACTGGAG
CAGGGCTTGGTGGAGGAAGCAGTAACGCTGCAACGGCACTGTGGGCGGCCAATCAGTTCAGTGGATGTCTTGCTACTGAAAAGGATCTTCAAGAATGGTCAAGTGAGATA
GGATCTGATATTGCCTTCTTTTTTTCTGAAGGGGCAGCCTTCTGCACCGGGCGAGGTGAGGTTGTACAGAATATTCCACCTCCAGTACCCTTGGACATTCCAATGGTTCT
CATAAAGCCCCAGGAAGCATGCTCTACTGCAGAAGTTTATAAGCGCCTACGGTTGGATCAAACAAGCAAGGTCAATCCTTTATCATTGTTGGATAAAATCACAAAGAACG
GAATATCCCAAGATGTGTGTATCAATGATTTGGAACCTCCTGCTTTTGAGGTCCTCCCATCTCTTAAAAGATTAAAACAGCGTATAATTTCTGCCAGCCGTGGAGAGTTC
GATGCTGTTTTTATGTCCGGAAGTGGTAGCACAATTGTAGGCATCGGGTCCCCAGATCCTCCAGGTTTCATATATAATGATGATGAATTCCAAGATGTGTTTTTGGCAGA
AGCAACTCCTCCATTTTGCAATCCAGCAATTGAACCTCTATTGAAGTTAGCGGGCACTGACGCCACAAATGGCTGTGCATCCACACGAGTGTTCCAATCCACCGGAGAAG
ATGCCCCTGATTGGGTTGAACTTGCTCCTGAAAATAGTCCAAGTGAGGAAGCAGCAGCTATGGAAGACGATGCACCTGTTCTGCTCCATGAAGTATTGGGACAAAATGAT
GCTGCTGCTGCTGATACTGAACTTGTGGGTAGATCTGATTTAAGATGGGTGCCGGACCTGCACAACCGAACTTGGTTCTCTCACTGGACCAGATTGAAGGCTATTGAATT
CGGTTCCAAATGTGCTGATGCTAAAGAGAAAGAAATGGGGCTAGCTCCTGCACTTGGCCACCTTTTCTCGGCGGTTGTGATTTCATTACCAGGAAACTTACTACTGCTTA
GAGCAGTTGGAGGATCACCAGTTTCTTCTGGCTTAGGAGGAGCAGCAGGCGGATCTAGCTTCAATTCTCTCAAGGATTCAGCTGCCTTCTCCAGATCACCTTCACAAGTA
ACAACAGCTCTCTCAACTTCTTGTTTTGTACACTTGTACTGTGATTCCAAATCTGATATACGAGCAAGCTCTTCTGATATAGGTGAAGCAACAGATACTGGCGATTCAAG
TGCATGGAAGGTTCCTGAAAGAGGATTATATGCACTTGCTGGAACACTACTACTACCAGCAGCAGCAGGTCCTGTCGGCTTTGCTGTAGGCTTCTGTAGCTCTTTGCTGG
CTTTCTTGTCTTTTGATTTAGATCTGGATGCTGGAGACATCTCTACAGCTATAAAACGCCTTCTGAAACATGCAATACACAACATCAAATTTTGGTAA
Protein sequenceShow/hide protein sequence
MASCNIPCNSQLQFPSISFRKNFVFNSLGPHGSFAFASRSKHWRNSLIQKAAITCNSTTSKQQVEIAYDPDERINKLADEVDRSAPLSRLTLFSPCKINVFLRITKKRED
GYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDNFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEI
GSDIAFFFSEGAAFCTGRGEVVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVNPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEF
DAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEATPPFCNPAIEPLLKLAGTDATNGCASTRVFQSTGEDAPDWVELAPENSPSEEAAAMEDDAPVLLHEVLGQND
AAAADTELVGRSDLRWVPDLHNRTWFSHWTRLKAIEFGSKCADAKEKEMGLAPALGHLFSAVVISLPGNLLLLRAVGGSPVSSGLGGAAGGSSFNSLKDSAAFSRSPSQV
TTALSTSCFVHLYCDSKSDIRASSSDIGEATDTGDSSAWKVPERGLYALAGTLLLPAAAGPVGFAVGFCSSLLAFLSFDLDLDAGDISTAIKRLLKHAIHNIKFW