; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007374 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007374
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Description4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase
Genome locationChr10:4261053..4266171
RNA-Seq ExpressionHG10007374
SyntenyHG10007374
Gene Ontology termsGO:0016114 - terpenoid biosynthetic process (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0005524 - ATP binding (molecular function)
GO:0050515 - 4-(cytidine 5'-diphospho)-2-C-methyl-D-erythritol kinase activity (molecular function)
InterPro domainsIPR004424 - 4-diphosphocytidyl-2C-methyl-D-erythritol kinase
IPR006204 - GHMP kinase N-terminal domain
IPR014721 - Ribosomal protein S5 domain 2-type fold, subgroup
IPR020568 - Ribosomal protein S5 domain 2-type fold
IPR036554 - GHMP kinase, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK09939.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase [Cucumis melo var. makuwa]3.3e-21494.22Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK-------INV
        MASC+IPCSSQ QFHSISFRKNFAFNSHGSHGSLAFASRLKQQ+AITCNSTASKQQFE+VYDPDERINKLADEVDRDAPLSRLTLFSPCK       INV
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK-------INV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVC
        NQFSGCLATE DLQEWS EIGSDIPFFFS+GAAFCTGRGE+VQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVC
Subjt:  NQFSGCLATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA
        INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EFRDVFLAEANFLTRE N+WYQEPAS+SACSPP EHPES+
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA

XP_004144086.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic isoform X1 [Cucumis sativus]5.2e-21294.88Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK
        MASC+I CSSQ QFHSISFRKNFAFNSHGS GSLAFASRLKQQRAITCNSTASKQQFE+VYDPDERI+KLADEVDRDAPLSRLTLFSPCKINVFLRITKK
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK

Query:  REDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
        REDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD FFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
Subjt:  REDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL

Query:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
        ATE DLQEWS EIGSDIPFFFS+GAAFCTGRGEVVQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDP SLLDKITKNGISQDVCINDLEPP
Subjt:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP

Query:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA
        AFEVLPSLKRLKQRIISASRG FDAVFMSGSGSTIVGIGSPDPPGFIY+D+EFRDVFLAEANFLTRE N+WYQEPAS+SACSPP EHPES+
Subjt:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA

XP_008451044.1 PREDICTED: 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic isoform X1 [Cucumis melo]2.7e-21695.91Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK
        MASC+IPCSSQ QFHSISFRKNFAFNSHGSHGSLAFASRLKQQ+AITCNSTASKQQFE+VYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK

Query:  REDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
        REDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
Subjt:  REDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL

Query:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
        ATE DLQEWS EIGSDIPFFFS+GAAFCTGRGE+VQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
Subjt:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP

Query:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA
        AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EFRDVFLAEANFLTRE N+WYQEPAS+SACSPP EHPES+
Subjt:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA

XP_038879880.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic isoform X1 [Benincasa hispida]2.8e-21395.41Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK
        MASCNIPCSSQ QFHSISFRKNF FNSH SH S AFASRLK+QRAITCNSTASKQQFE+VYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK

Query:  REDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
        REDGYHDLASLFHVISLGDTIKFS+SPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC 
Subjt:  REDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL

Query:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
        ATE DLQEWS EIGSDIPFFFSEGAAFCTGRGEVVQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTS VDPL+LLDKITKNGISQDVCINDLE P
Subjt:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP

Query:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESAR
        AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEF+DVFLAEANFLTREANQWYQEPASASA SPP EHPESAR
Subjt:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESAR

XP_038879881.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic isoform X2 [Benincasa hispida]2.6e-21195.15Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK
        MASCNIPCSSQ QFHSISFRKNF FNSH SH S AFASRLK+QRAITCNSTASKQQFE+VYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK

Query:  REDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
        REDGYHDLASLFHVISLGDTIKFS+SPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC 
Subjt:  REDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL

Query:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
        ATE DLQEWS EIGSDIPFFFSEGAAFCTGRGEVVQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTS VDPL+LLDKITKNGISQDVCINDL  P
Subjt:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP

Query:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESAR
        AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEF+DVFLAEANFLTREANQWYQEPASASA SPP EHPESAR
Subjt:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESAR

TrEMBL top hitse value%identityAlignment
A0A0A0LZC4 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase2.5e-21294.88Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK
        MASC+I CSSQ QFHSISFRKNFAFNSHGS GSLAFASRLKQQRAITCNSTASKQQFE+VYDPDERI+KLADEVDRDAPLSRLTLFSPCKINVFLRITKK
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK

Query:  REDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
        REDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD FFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
Subjt:  REDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL

Query:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
        ATE DLQEWS EIGSDIPFFFS+GAAFCTGRGEVVQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDP SLLDKITKNGISQDVCINDLEPP
Subjt:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP

Query:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA
        AFEVLPSLKRLKQRIISASRG FDAVFMSGSGSTIVGIGSPDPPGFIY+D+EFRDVFLAEANFLTRE N+WYQEPAS+SACSPP EHPES+
Subjt:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA

A0A1S3BQ16 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase1.3e-21695.91Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK
        MASC+IPCSSQ QFHSISFRKNFAFNSHGSHGSLAFASRLKQQ+AITCNSTASKQQFE+VYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK

Query:  REDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
        REDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
Subjt:  REDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL

Query:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
        ATE DLQEWS EIGSDIPFFFS+GAAFCTGRGE+VQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
Subjt:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP

Query:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA
        AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EFRDVFLAEANFLTRE N+WYQEPAS+SACSPP EHPES+
Subjt:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA

A0A5A7UKM7 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase1.3e-21695.91Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK
        MASC+IPCSSQ QFHSISFRKNFAFNSHGSHGSLAFASRLKQQ+AITCNSTASKQQFE+VYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK

Query:  REDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
        REDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
Subjt:  REDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL

Query:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
        ATE DLQEWS EIGSDIPFFFS+GAAFCTGRGE+VQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
Subjt:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP

Query:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA
        AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EFRDVFLAEANFLTRE N+WYQEPAS+SACSPP EHPES+
Subjt:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA

A0A5D3CI45 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase1.6e-21494.22Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK-------INV
        MASC+IPCSSQ QFHSISFRKNFAFNSHGSHGSLAFASRLKQQ+AITCNSTASKQQFE+VYDPDERINKLADEVDRDAPLSRLTLFSPCK       INV
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK-------INV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVC
        NQFSGCLATE DLQEWS EIGSDIPFFFS+GAAFCTGRGE+VQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVC
Subjt:  NQFSGCLATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA
        INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EFRDVFLAEANFLTRE N+WYQEPAS+SACSPP EHPES+
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA

A0A6J1HBW0 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase3.2e-20792.46Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQ------QRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASCNIPCSS FQF SISFR+NF FN  G HGSLAFAS LK       QR  TCNS ASKQQFE+VYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQ------QRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRIT+KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
        QFSGC+ATE DLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESAR
        NDLEPPAFEVLPSL+RLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEF+DVFLAEANFLTREANQWY+EPASASA SPP EHPE AR
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESAR

SwissProt top hitse value%identityAlignment
O81014 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic6.3e-16074.32Show/hide
Query:  NFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTI
        +F  +S  +  S +F+ +L +   ++ +  AS++Q E+V+DPDER+NK+ D+VD++APLSRL LFSPCKINVFLRIT KREDG+HDLASLFHVISLGDTI
Subjt:  NFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTI

Query:  KFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFF
        KFSLSPSK KDRLSTNV GVP+D RNLIIKALNLYRKKTGS++FFWIHLDKKVPTGAGLGGGSSNAATALWAAN+ +G L TEN+LQ+WSSEIGSDIPFF
Subjt:  KFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFF

Query:  FSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASR
        FS GAA+CTGRGE+VQ+LP P PLD+PMVLIKP+EACSTAEVYKRLRLDQTS ++PL+LL+ +T NG+SQ +C+NDLEPPAF VLPSLKRLKQRII++ R
Subjt:  FSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASR

Query:  GEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPE
        GE+DAVFMSGSGSTI+GIGSPDPP FIY+D+E+++VFL+EANF+TREAN+WY+EPASA+A +   E
Subjt:  GEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPE

P56848 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic3.7e-15269.25Show/hide
Query:  SSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQ-----RAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKRED
        +S+  F+S +      F+S   +GS +F  +L+       RA   + T  + Q EVVYD + ++NKLADEVDR+A +SRLTLFSPCKINVFLRIT KRED
Subjt:  SSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQ-----RAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKRED

Query:  GYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLAT
        G+HDLASLFHVISLGD IKFSLSPSK      TNV GVPLD++NLIIKALNL+RKKTG+DK FWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC+AT
Subjt:  GYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLAT

Query:  ENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAF
        E DLQEWS EIGSDIPFFFS GAA+CTGRGEVV+++P PVP D+ MVL+KPQEAC T EVYKRLRLDQTS +DPL LL+KI+K GISQDVC+NDLEPPAF
Subjt:  ENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAF

Query:  EVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPE
        EV+PSLKRLKQRI +A R ++DAVFMSGSGSTIVG+GSPDPP F+Y+ DE++++F +EA F+TR ANQWY EP S       P+  E
Subjt:  EVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPE

P93841 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic/chromoplastic (Fragment)5.7e-16174.86Show/hide
Query:  SHGSHGSLAFASRLKQQR-----AITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTI
        S+  HGS  F   ++ +R          S  SK+Q E+ Y+P+E+ NKLADEVDR+A LSRLTLFSPCKINVFLRIT KR+DGYHDLASLFHVISLGD I
Subjt:  SHGSHGSLAFASRLKQQR-----AITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTI

Query:  KFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFF
        KFSLSPSK KDRLSTNV+GVPLD+RNLIIKALNLYRKKTG+D +FWIHLDKKVPTGAGLGGGSSNAAT LWAANQFSGC+ATE +LQEWS EIGSDIPFF
Subjt:  KFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFF

Query:  FSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASR
        FS GAA+CTGRGEVVQ++PSP+P D+PMVLIKPQ+ACSTAEVYKR +LD +SKVDPLSLL+KI+ +GISQDVC+NDLEPPAFEVLPSLKRLKQR+I+A R
Subjt:  FSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASR

Query:  GEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPES
        G++DAVFMSGSGSTIVG+GSPDPP F+Y+D+E++DVFL+EA+F+TR AN+WY EP S S     PE   S
Subjt:  GEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPES

Q6MAT6 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase3.7e-5945.02Show/hide
Query:  LTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGG
        + LFSP KIN+FL++  KR DGYH+L+SLF  IS GD + F       D L+ +   +P DD NL++KA+ L+R KTG D    IHLDK++P+ AGLGGG
Subjt:  LTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGG

Query:  SSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPL-DVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLD
        SSNAAT LWA NQ +G + T  +L +W SEIG+DIPFFFS+G A CTGRGE V +L    PL    + ++KP    ST EVYK L   Q ++       +
Subjt:  SSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPL-DVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLD

Query:  KITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWY
                +    NDLE  AFE+ P LK LK  ++S+    FD V MSGSGS+   IG    P        F+      A F+ R +N+WY
Subjt:  KITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWY

Q8S2G0 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic5.9e-15071.15Show/hide
Query:  SLAFASRLKQQRAITCNSTAS----KQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPS
        S+   S   ++RA+     AS    ++Q EV YD   + NKLAD++D++A ++RL LFSPCKINVFLRIT KR DG+HDLASLFHVISLGDTIKFSLSPS
Subjt:  SLAFASRLKQQRAITCNSTAS----KQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPS

Query:  K-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFFFSEGAAF
        K KDRLSTNV+GVP+D+ NLIIKALNLYRKKTG+D FFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC+A+E +LQEWS EIGSDIPFFFS+GAA+
Subjt:  K-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFFFSEGAAF

Query:  CTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVF
        CTGRGE+V+++ +P+P ++PMVL+KP EACSTAEVYKRLRL+ TS+ DPL LL +IT+NGISQD C+NDLEPPAFEVLPSLKRLK+RII+A+RG++DAVF
Subjt:  CTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVF

Query:  MSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA
        MSGSGSTIVGIGSPDPP F+Y+DD+++D F++EA FLTR  N+WY+EP S+   S     PE A
Subjt:  MSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA

Arabidopsis top hitse value%identityAlignment
AT2G26930.1 4-(cytidine 5'-phospho)-2-C-methyl-D-erithritol kinase4.5e-16174.32Show/hide
Query:  NFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTI
        +F  +S  +  S +F+ +L +   ++ +  AS++Q E+V+DPDER+NK+ D+VD++APLSRL LFSPCKINVFLRIT KREDG+HDLASLFHVISLGDTI
Subjt:  NFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTI

Query:  KFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFF
        KFSLSPSK KDRLSTNV GVP+D RNLIIKALNLYRKKTGS++FFWIHLDKKVPTGAGLGGGSSNAATALWAAN+ +G L TEN+LQ+WSSEIGSDIPFF
Subjt:  KFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFF

Query:  FSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASR
        FS GAA+CTGRGE+VQ+LP P PLD+PMVLIKP+EACSTAEVYKRLRLDQTS ++PL+LL+ +T NG+SQ +C+NDLEPPAF VLPSLKRLKQRII++ R
Subjt:  FSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASR

Query:  GEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPE
        GE+DAVFMSGSGSTI+GIGSPDPP FIY+D+E+++VFL+EANF+TREAN+WY+EPASA+A +   E
Subjt:  GEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTGCAATATCCCTTGCAGTTCGCAGTTCCAATTTCACTCTATTTCGTTTAGAAAGAATTTCGCCTTCAATTCTCATGGCTCTCACGGGTCACTCGCTTTTGC
CTCGAGGTTGAAGCAACAAAGGGCCATTACATGCAATTCCACCGCTTCCAAACAACAATTTGAGGTAGTTTATGATCCTGATGAAAGGATTAACAAGTTAGCTGATGAAG
TGGACCGGGATGCTCCTCTTTCGAGGCTTACTCTGTTCTCACCTTGCAAGATTAATGTTTTCTTGAGAATAACAAAGAAGAGGGAAGATGGATATCATGATTTGGCATCT
CTCTTTCATGTGATAAGCTTAGGGGATACTATTAAATTCTCTTTGTCGCCATCAAAGAAGGATCGTCTTTCTACCAACGTATCAGGGGTGCCCCTTGATGATAGAAATTT
GATTATCAAGGCTCTTAACCTCTACAGGAAAAAGACTGGCAGTGACAAATTTTTCTGGATACATCTCGACAAAAAAGTACCAACTGGAGCAGGGCTTGGTGGAGGAAGCA
GTAATGCTGCAACTGCACTGTGGGCGGCGAATCAGTTCAGTGGATGTCTTGCTACTGAAAATGACCTTCAAGAATGGTCGAGCGAGATAGGATCTGATATTCCCTTCTTT
TTCTCGGAAGGGGCGGCCTTTTGCACCGGGAGAGGTGAGGTTGTACAAAATCTTCCATCTCCAGTACCTTTGGACGTTCCAATGGTTCTCATAAAACCCCAGGAAGCATG
CTCTACAGCAGAAGTTTATAAGCGCTTACGGTTGGATCAAACGAGCAAGGTTGATCCTTTATCATTGTTGGATAAAATCACAAAGAATGGAATATCCCAAGATGTGTGTA
TCAACGATTTGGAACCTCCTGCTTTTGAGGTCCTCCCATCTCTTAAAAGATTGAAGCAACGTATAATTTCTGCCAGCCGTGGAGAATTCGATGCCGTATTTATGTCCGGG
AGTGGTAGCACAATAGTAGGGATTGGGTCCCCAGATCCTCCGGGCTTCATATATAATGACGATGAATTCCGGGACGTATTTTTGGCAGAGGCCAACTTTCTCACTCGTGA
AGCGAATCAATGGTATCAAGAACCTGCTTCGGCATCCGCTTGTAGTCCGCCTCCCGAGCATCCTGAATCAGCTAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCTGCAATATCCCTTGCAGTTCGCAGTTCCAATTTCACTCTATTTCGTTTAGAAAGAATTTCGCCTTCAATTCTCATGGCTCTCACGGGTCACTCGCTTTTGC
CTCGAGGTTGAAGCAACAAAGGGCCATTACATGCAATTCCACCGCTTCCAAACAACAATTTGAGGTAGTTTATGATCCTGATGAAAGGATTAACAAGTTAGCTGATGAAG
TGGACCGGGATGCTCCTCTTTCGAGGCTTACTCTGTTCTCACCTTGCAAGATTAATGTTTTCTTGAGAATAACAAAGAAGAGGGAAGATGGATATCATGATTTGGCATCT
CTCTTTCATGTGATAAGCTTAGGGGATACTATTAAATTCTCTTTGTCGCCATCAAAGAAGGATCGTCTTTCTACCAACGTATCAGGGGTGCCCCTTGATGATAGAAATTT
GATTATCAAGGCTCTTAACCTCTACAGGAAAAAGACTGGCAGTGACAAATTTTTCTGGATACATCTCGACAAAAAAGTACCAACTGGAGCAGGGCTTGGTGGAGGAAGCA
GTAATGCTGCAACTGCACTGTGGGCGGCGAATCAGTTCAGTGGATGTCTTGCTACTGAAAATGACCTTCAAGAATGGTCGAGCGAGATAGGATCTGATATTCCCTTCTTT
TTCTCGGAAGGGGCGGCCTTTTGCACCGGGAGAGGTGAGGTTGTACAAAATCTTCCATCTCCAGTACCTTTGGACGTTCCAATGGTTCTCATAAAACCCCAGGAAGCATG
CTCTACAGCAGAAGTTTATAAGCGCTTACGGTTGGATCAAACGAGCAAGGTTGATCCTTTATCATTGTTGGATAAAATCACAAAGAATGGAATATCCCAAGATGTGTGTA
TCAACGATTTGGAACCTCCTGCTTTTGAGGTCCTCCCATCTCTTAAAAGATTGAAGCAACGTATAATTTCTGCCAGCCGTGGAGAATTCGATGCCGTATTTATGTCCGGG
AGTGGTAGCACAATAGTAGGGATTGGGTCCCCAGATCCTCCGGGCTTCATATATAATGACGATGAATTCCGGGACGTATTTTTGGCAGAGGCCAACTTTCTCACTCGTGA
AGCGAATCAATGGTATCAAGAACCTGCTTCGGCATCCGCTTGTAGTCCGCCTCCCGAGCATCCTGAATCAGCTAGATAG
Protein sequenceShow/hide protein sequence
MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLAS
LFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFF
FSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSG
SGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESAR