; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G004190 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G004190
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Description4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase
Genome locationchr06:4612192..4618134
RNA-Seq ExpressionLsi06G004190
SyntenyLsi06G004190
Gene Ontology termsGO:0016114 - terpenoid biosynthetic process (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0005524 - ATP binding (molecular function)
GO:0050515 - 4-(cytidine 5'-diphospho)-2-C-methyl-D-erythritol kinase activity (molecular function)
InterPro domainsIPR004424 - 4-diphosphocytidyl-2C-methyl-D-erythritol kinase
IPR006204 - GHMP kinase N-terminal domain
IPR014721 - Ribosomal protein S5 domain 2-type fold, subgroup
IPR020568 - Ribosomal protein S5 domain 2-type fold
IPR036554 - GHMP kinase, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK09939.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase [Cucumis melo var. makuwa]7.6e-19788.44Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----------
        MASC+IPCSSQ QFHSISFRKNFAFNSHGSHGSLAFASRLKQQ+AITCNSTASKQQFE+VYDPDERINKLADEVDRDAPLSRLTLFSPCK          
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----------

Query:  --------------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAA
                            VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  --------------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVC
        NQFSGCLATE DLQEWS EIGSDIPFFFS+GAAFCTGRGE+VQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVC
Subjt:  NQFSGCLATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA
        INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EFRDVFLAEANFLTRE N+WYQEPAS+SACSPP EHPES+
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA

XP_004144086.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic isoform X1 [Cucumis sativus]2.3e-19389Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----------
        MASC+I CSSQ QFHSISFRKNFAFNSHGS GSLAFASRLKQQRAITCNSTASKQQFE+VYDPDERI+KLADEVDRDAPLSRLTLFSPCK          
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----------

Query:  -------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
                     VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD FFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
Subjt:  -------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL

Query:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
        ATE DLQEWS EIGSDIPFFFS+GAAFCTGRGEVVQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDP SLLDKITKNGISQDVCINDLEPP
Subjt:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP

Query:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA
        AFEVLPSLKRLKQRIISASRG FDAVFMSGSGSTIVGIGSPDPPGFIY+D+EFRDVFLAEANFLTRE N+WYQEPAS+SACSPP EHPES+
Subjt:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA

XP_008451044.1 PREDICTED: 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic isoform X1 [Cucumis melo]1.2e-19790.03Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----------
        MASC+IPCSSQ QFHSISFRKNFAFNSHGSHGSLAFASRLKQQ+AITCNSTASKQQFE+VYDPDERINKLADEVDRDAPLSRLTLFSPCK          
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----------

Query:  -------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
                     VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
Subjt:  -------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL

Query:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
        ATE DLQEWS EIGSDIPFFFS+GAAFCTGRGE+VQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
Subjt:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP

Query:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA
        AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EFRDVFLAEANFLTRE N+WYQEPAS+SACSPP EHPES+
Subjt:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA

XP_038879880.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic isoform X1 [Benincasa hispida]1.2e-19489.54Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----------
        MASCNIPCSSQ QFHSISFRKNF FNSH SH S AFASRLK+QRAITCNSTASKQQFE+VYDPDERINKLADEVDRDAPLSRLTLFSPCK          
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----------

Query:  -------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
                     VISLGDTIKFS+SPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC 
Subjt:  -------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL

Query:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
        ATE DLQEWS EIGSDIPFFFSEGAAFCTGRGEVVQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTS VDPL+LLDKITKNGISQDVCINDLE P
Subjt:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP

Query:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESAR
        AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEF+DVFLAEANFLTREANQWYQEPASASA SPP EHPESAR
Subjt:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESAR

XP_038879881.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic isoform X2 [Benincasa hispida]1.1e-19289.29Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----------
        MASCNIPCSSQ QFHSISFRKNF FNSH SH S AFASRLK+QRAITCNSTASKQQFE+VYDPDERINKLADEVDRDAPLSRLTLFSPCK          
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----------

Query:  -------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
                     VISLGDTIKFS+SPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC 
Subjt:  -------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL

Query:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
        ATE DLQEWS EIGSDIPFFFSEGAAFCTGRGEVVQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTS VDPL+LLDKITKNGISQDVCINDL  P
Subjt:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP

Query:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESAR
        AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEF+DVFLAEANFLTREANQWYQEPASASA SPP EHPESAR
Subjt:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESAR

TrEMBL top hitse value%identityAlignment
A0A0A0LZC4 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase1.1e-19389Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----------
        MASC+I CSSQ QFHSISFRKNFAFNSHGS GSLAFASRLKQQRAITCNSTASKQQFE+VYDPDERI+KLADEVDRDAPLSRLTLFSPCK          
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----------

Query:  -------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
                     VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD FFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
Subjt:  -------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL

Query:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
        ATE DLQEWS EIGSDIPFFFS+GAAFCTGRGEVVQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDP SLLDKITKNGISQDVCINDLEPP
Subjt:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP

Query:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA
        AFEVLPSLKRLKQRIISASRG FDAVFMSGSGSTIVGIGSPDPPGFIY+D+EFRDVFLAEANFLTRE N+WYQEPAS+SACSPP EHPES+
Subjt:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA

A0A1S3BQ16 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase5.7e-19890.03Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----------
        MASC+IPCSSQ QFHSISFRKNFAFNSHGSHGSLAFASRLKQQ+AITCNSTASKQQFE+VYDPDERINKLADEVDRDAPLSRLTLFSPCK          
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----------

Query:  -------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
                     VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
Subjt:  -------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL

Query:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
        ATE DLQEWS EIGSDIPFFFS+GAAFCTGRGE+VQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
Subjt:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP

Query:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA
        AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EFRDVFLAEANFLTRE N+WYQEPAS+SACSPP EHPES+
Subjt:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA

A0A5A7UKM7 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase5.7e-19890.03Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----------
        MASC+IPCSSQ QFHSISFRKNFAFNSHGSHGSLAFASRLKQQ+AITCNSTASKQQFE+VYDPDERINKLADEVDRDAPLSRLTLFSPCK          
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----------

Query:  -------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
                     VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL
Subjt:  -------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCL

Query:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
        ATE DLQEWS EIGSDIPFFFS+GAAFCTGRGE+VQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP
Subjt:  ATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPP

Query:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA
        AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EFRDVFLAEANFLTRE N+WYQEPAS+SACSPP EHPES+
Subjt:  AFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA

A0A5D3CI45 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase3.7e-19788.44Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----------
        MASC+IPCSSQ QFHSISFRKNFAFNSHGSHGSLAFASRLKQQ+AITCNSTASKQQFE+VYDPDERINKLADEVDRDAPLSRLTLFSPCK          
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----------

Query:  --------------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAA
                            VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  --------------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVC
        NQFSGCLATE DLQEWS EIGSDIPFFFS+GAAFCTGRGE+VQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVC
Subjt:  NQFSGCLATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA
        INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EFRDVFLAEANFLTRE N+WYQEPAS+SACSPP EHPES+
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA

A0A6J1HBW0 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase4.8e-18986.93Show/hide
Query:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQ------QRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----
        MASCNIPCSS FQF SISFR+NF FN  G HGSLAFAS LK       QR  TCNS ASKQQFE+VYDPDERINKLADEVDRDAPLSRLTLFSPCK    
Subjt:  MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQ------QRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----

Query:  -------------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
                           VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  -------------------VISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
        QFSGC+ATE DLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLP PVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESAR
        NDLEPPAFEVLPSL+RLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEF+DVFLAEANFLTREANQWY+EPASASA SPP EHPE AR
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESAR

SwissProt top hitse value%identityAlignment
O81014 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic1.1e-14268.58Show/hide
Query:  NFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK-----------------------VISLGDTI
        +F  +S  +  S +F+ +L +   ++ +  AS++Q E+V+DPDER+NK+ D+VD++APLSRL LFSPCK                       VISLGDTI
Subjt:  NFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK-----------------------VISLGDTI

Query:  KFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFF
        KFSLSPSK KDRLSTNV GVP+D RNLIIKALNLYRKKTGS++FFWIHLDKKVPTGAGLGGGSSNAATALWAAN+ +G L TEN+LQ+WSSEIGSDIPFF
Subjt:  KFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFF

Query:  FSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASR
        FS GAA+CTGRGE+VQ+LP P PLD+PMVLIKP+EACSTAEVYKRLRLDQTS ++PL+LL+ +T NG+SQ +C+NDLEPPAF VLPSLKRLKQRII++ R
Subjt:  FSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASR

Query:  GEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPE
        GE+DAVFMSGSGSTI+GIGSPDPP FIY+D+E+++VFL+EANF+TREAN+WY+EPASA+A +   E
Subjt:  GEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPE

P56848 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic6.6e-13563.82Show/hide
Query:  SSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQ-----RAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK-------------
        +S+  F+S +      F+S   +GS +F  +L+       RA   + T  + Q EVVYD + ++NKLADEVDR+A +SRLTLFSPCK             
Subjt:  SSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQ-----RAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK-------------

Query:  ----------VISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLAT
                  VISLGD IKFSLSPSK      TNV GVPLD++NLIIKALNL+RKKTG+DK FWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC+AT
Subjt:  ----------VISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLAT

Query:  ENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAF
        E DLQEWS EIGSDIPFFFS GAA+CTGRGEVV+++P PVP D+ MVL+KPQEAC T EVYKRLRLDQTS +DPL LL+KI+K GISQDVC+NDLEPPAF
Subjt:  ENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAF

Query:  EVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPE
        EV+PSLKRLKQRI +A R ++DAVFMSGSGSTIVG+GSPDPP F+Y+ DE++++F +EA F+TR ANQWY EP S       P+  E
Subjt:  EVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPE

P93841 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic/chromoplastic (Fragment)2.3e-14369.19Show/hide
Query:  SHGSHGSLAFASRLKQQR-----AITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK-----------------------VISLGDTI
        S+  HGS  F   ++ +R          S  SK+Q E+ Y+P+E+ NKLADEVDR+A LSRLTLFSPCK                       VISLGD I
Subjt:  SHGSHGSLAFASRLKQQR-----AITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK-----------------------VISLGDTI

Query:  KFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFF
        KFSLSPSK KDRLSTNV+GVPLD+RNLIIKALNLYRKKTG+D +FWIHLDKKVPTGAGLGGGSSNAAT LWAANQFSGC+ATE +LQEWS EIGSDIPFF
Subjt:  KFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFF

Query:  FSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASR
        FS GAA+CTGRGEVVQ++PSP+P D+PMVLIKPQ+ACSTAEVYKR +LD +SKVDPLSLL+KI+ +GISQDVC+NDLEPPAFEVLPSLKRLKQR+I+A R
Subjt:  FSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASR

Query:  GEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPES
        G++DAVFMSGSGSTIVG+GSPDPP F+Y+D+E++DVFL+EA+F+TR AN+WY EP S S     PE   S
Subjt:  GEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPES

Q6MAT6 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase2.0e-4642.86Show/hide
Query:  LFSPCKVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQ
        L S  + IS GD + F       D L+ +   +P DD NL++KA+ L+R KTG D    IHLDK++P+ AGLGGGSSNAAT LWA NQ +G + T  +L 
Subjt:  LFSPCKVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQ

Query:  EWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPL-DVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLP
        +W SEIG+DIPFFFS+G A CTGRGE V +L    PL    + ++KP    ST EVYK L   Q ++       +        +    NDLE  AFE+ P
Subjt:  EWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVPL-DVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLP

Query:  SLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWY
         LK LK  ++S+    FD V MSGSGS+   IG    P        F+      A F+ R +N+WY
Subjt:  SLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWY

Q8S2G0 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic2.8e-13365.66Show/hide
Query:  SLAFASRLKQQRAITCNSTAS----KQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK-----------------------VISLGDTIKFSLSPS
        S+   S   ++RA+     AS    ++Q EV YD   + NKLAD++D++A ++RL LFSPCK                       VISLGDTIKFSLSPS
Subjt:  SLAFASRLKQQRAITCNSTAS----KQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK-----------------------VISLGDTIKFSLSPS

Query:  K-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFFFSEGAAF
        K KDRLSTNV+GVP+D+ NLIIKALNLYRKKTG+D FFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC+A+E +LQEWS EIGSDIPFFFS+GAA+
Subjt:  K-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFFFSEGAAF

Query:  CTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVF
        CTGRGE+V+++ +P+P ++PMVL+KP EACSTAEVYKRLRL+ TS+ DPL LL +IT+NGISQD C+NDLEPPAFEVLPSLKRLK+RII+A+RG++DAVF
Subjt:  CTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVF

Query:  MSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA
        MSGSGSTIVGIGSPDPP F+Y+DD+++D F++EA FLTR  N+WY+EP S+   S     PE A
Subjt:  MSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPEHPESA

Arabidopsis top hitse value%identityAlignment
AT2G26930.1 4-(cytidine 5'-phospho)-2-C-methyl-D-erithritol kinase8.0e-14468.58Show/hide
Query:  NFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK-----------------------VISLGDTI
        +F  +S  +  S +F+ +L +   ++ +  AS++Q E+V+DPDER+NK+ D+VD++APLSRL LFSPCK                       VISLGDTI
Subjt:  NFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK-----------------------VISLGDTI

Query:  KFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFF
        KFSLSPSK KDRLSTNV GVP+D RNLIIKALNLYRKKTGS++FFWIHLDKKVPTGAGLGGGSSNAATALWAAN+ +G L TEN+LQ+WSSEIGSDIPFF
Subjt:  KFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFF

Query:  FSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASR
        FS GAA+CTGRGE+VQ+LP P PLD+PMVLIKP+EACSTAEVYKRLRLDQTS ++PL+LL+ +T NG+SQ +C+NDLEPPAF VLPSLKRLKQRII++ R
Subjt:  FSEGAAFCTGRGEVVQNLPSPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASR

Query:  GEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPE
        GE+DAVFMSGSGSTI+GIGSPDPP FIY+D+E+++VFL+EANF+TREAN+WY+EPASA+A +   E
Subjt:  GEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFRDVFLAEANFLTREANQWYQEPASASACSPPPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTGCAATATCCCTTGCAGTTCGCAGTTCCAATTTCACTCTATTTCGTTTAGAAAGAATTTCGCCTTCAATTCTCATGGCTCTCACGGGTCACTCGCTTTTGC
CTCGAGGTTGAAGCAACAAAGGGCCATTACATGCAATTCCACCGCTTCCAAACAACAATTTGAGGTAGTTTATGATCCTGATGAAAGGATTAACAAGTTAGCTGATGAAG
TGGACCGGGATGCTCCTCTTTCGAGGCTTACTCTGTTCTCACCTTGCAAGGTGATAAGCTTAGGGGATACTATTAAATTCTCTTTGTCGCCATCAAAGAAGGATCGTCTT
TCTACCAACGTATCAGGGGTGCCCCTTGATGATAGAAATTTGATTATCAAGGCTCTTAACCTCTACAGGAAAAAGACTGGCAGTGACAAATTTTTCTGGATACATCTCGA
CAAAAAAGTACCAACTGGAGCAGGGCTTGGTGGAGGAAGCAGTAATGCTGCAACTGCACTGTGGGCGGCGAATCAGTTCAGTGGATGTCTTGCTACTGAAAATGACCTTC
AAGAATGGTCGAGCGAGATAGGATCTGATATTCCCTTCTTTTTCTCGGAAGGGGCGGCCTTTTGCACCGGGAGAGGTGAGGTTGTACAAAATCTTCCATCTCCAGTACCT
TTGGACGTTCCAATGGTTCTCATAAAACCCCAGGAAGCATGCTCTACAGCAGAAGTTTATAAGCGCTTACGGTTGGATCAAACGAGCAAGGTTGATCCTTTATCATTGTT
GGATAAAATCACAAAGAATGGAATATCCCAAGATGTGTGTATCAACGATTTGGAACCTCCTGCTTTTGAGGTCCTCCCATCTCTTAAAAGATTGAAGCAACGTATAATTT
CTGCCAGCCGTGGAGAATTCGATGCCGTATTTATGTCCGGGAGTGGTAGCACAATAGTAGGGATTGGGTCCCCAGATCCTCCGGGCTTCATATATAATGACGATGAATTC
CGGGACGTATTTTTGGCAGAGGCCAACTTTCTCACTCGTGAAGCGAATCAATGGTATCAAGAACCTGCTTCGGCATCCGCTTGTAGTCCGCCTCCCGAGCATCCTGAATC
AGCTAGATAG
mRNA sequenceShow/hide mRNA sequence
CTTCGATCCCCAGCGGAAAAGAAGAAAAATAATAACTTATGTGAGTTTGTGAGAAAATTGACCGTCCAAATCGAAATGTGAAGGCTCTCTATTAGTATTAAACCTTACTT
AATTCCAAGCTTTCATGGAAGGAAACTGACATGGATGCCAACAAAAAAAAACCCTGAATTCAGCGATATAAATACGTTACTCTCCAAATTACCATGAAAAAAACTCGACC
CATTATACGAAAACTTCAACTTCTTCTTAATCTTGCTCGAAATTCTTCAATTTCTCATTTCTGATCATCTGGGTCTGTAGCCAGTGAAGAATCTTGTACTCATCAATGGC
TTCCTGCAATATCCCTTGCAGTTCGCAGTTCCAATTTCACTCTATTTCGTTTAGAAAGAATTTCGCCTTCAATTCTCATGGCTCTCACGGGTCACTCGCTTTTGCCTCGA
GGTTGAAGCAACAAAGGGCCATTACATGCAATTCCACCGCTTCCAAACAACAATTTGAGGTAGTTTATGATCCTGATGAAAGGATTAACAAGTTAGCTGATGAAGTGGAC
CGGGATGCTCCTCTTTCGAGGCTTACTCTGTTCTCACCTTGCAAGGTGATAAGCTTAGGGGATACTATTAAATTCTCTTTGTCGCCATCAAAGAAGGATCGTCTTTCTAC
CAACGTATCAGGGGTGCCCCTTGATGATAGAAATTTGATTATCAAGGCTCTTAACCTCTACAGGAAAAAGACTGGCAGTGACAAATTTTTCTGGATACATCTCGACAAAA
AAGTACCAACTGGAGCAGGGCTTGGTGGAGGAAGCAGTAATGCTGCAACTGCACTGTGGGCGGCGAATCAGTTCAGTGGATGTCTTGCTACTGAAAATGACCTTCAAGAA
TGGTCGAGCGAGATAGGATCTGATATTCCCTTCTTTTTCTCGGAAGGGGCGGCCTTTTGCACCGGGAGAGGTGAGGTTGTACAAAATCTTCCATCTCCAGTACCTTTGGA
CGTTCCAATGGTTCTCATAAAACCCCAGGAAGCATGCTCTACAGCAGAAGTTTATAAGCGCTTACGGTTGGATCAAACGAGCAAGGTTGATCCTTTATCATTGTTGGATA
AAATCACAAAGAATGGAATATCCCAAGATGTGTGTATCAACGATTTGGAACCTCCTGCTTTTGAGGTCCTCCCATCTCTTAAAAGATTGAAGCAACGTATAATTTCTGCC
AGCCGTGGAGAATTCGATGCCGTATTTATGTCCGGGAGTGGTAGCACAATAGTAGGGATTGGGTCCCCAGATCCTCCGGGCTTCATATATAATGACGATGAATTCCGGGA
CGTATTTTTGGCAGAGGCCAACTTTCTCACTCGTGAAGCGAATCAATGGTATCAAGAACCTGCTTCGGCATCCGCTTGTAGTCCGCCTCCCGAGCATCCTGAATCAGCTA
GATAGCTTCCTCCTCGGACGAAATAGAGGTTCAGCTTATGTTTCCTGTTCTAGTGAGAATTTCTGTATCACAGTTACAACTACATCCTCCATTGATAAACTTGGGAGGTG
ATCTTGAGTTCTCTTTCTTTAGCCACAACCATCTTAAATCTTTGGTCTGAAAAAAAAAAAAAAACTATGACCAAAGAGTGCAAAATTTGGAATTAGCACCAACAACATAA
GTATTGAAAATGAAAATGTCATGAATATAAACAAGGGTAATAATCTTACAATGTTAAAATGAATTCTATCTATTAATTTGGAATGATCTTAATTTTTATATGATTTTCGT
GATCATTAATATAACATTATCAACAAATTTGAAACTTTGACGTTTTTTCCGAACAAAATTTAATTTAAAAAACTACTACAAACAAAGAAAATATCAAATTACTGTATATC
ATTCAAGCAATAAAATTTTTCTATATTTATAAATAGTTTATCAAATTACTGTATATCATTCAAG
Protein sequenceShow/hide protein sequence
MASCNIPCSSQFQFHSISFRKNFAFNSHGSHGSLAFASRLKQQRAITCNSTASKQQFEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKVISLGDTIKFSLSPSKKDRL
STNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPSPVP
LDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEF
RDVFLAEANFLTREANQWYQEPASASACSPPPEHPESAR