; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021824 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021824
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Description4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase
Genome locationscaffold2:10702201..10706886
RNA-Seq ExpressionSpg021824
SyntenySpg021824
Gene Ontology termsGO:0016114 - terpenoid biosynthetic process (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0005524 - ATP binding (molecular function)
GO:0050515 - 4-(cytidine 5'-diphospho)-2-C-methyl-D-erythritol kinase activity (molecular function)
InterPro domainsIPR004424 - 4-diphosphocytidyl-2C-methyl-D-erythritol kinase
IPR006204 - GHMP kinase N-terminal domain
IPR014721 - Ribosomal protein S5 domain 2-type fold, subgroup
IPR020568 - Ribosomal protein S5 domain 2-type fold
IPR036554 - GHMP kinase, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033454.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic/chromoplastic [Cucurbita argyrosperma subsp. argyrosperma]3.5e-21192.7Show/hide
Query:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASCNIPCSS  QF SISFR+N  FN   PHGS  F S LKHQKNPLVQR  TCNS ASKQQ E+VYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRIT+KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD+FFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
        QFSGCLATEKDLQEWS EIGSDIPFFFSEGAAFC+GRGE+VQN+PPPVPLD+PMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESA
        NDLEPPAFEVLPSL+RLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASA SPPSEHPE A
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESA

XP_008451044.1 PREDICTED: 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic isoform X1 [Cucumis melo]2.1e-20891.23Show/hide
Query:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASC+IPCSSQLQF SISFRKN AFNS   HGS  F S+LK QK      A+TCNSTASKQQ E+VYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD+FFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
        QFSGCLATEKDLQEWSGEIGSDIPFFFS+GAAFC+GRGEIVQN+PPPVPLD+PMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESAES
        NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EF+DVFLAEANFLTRE N+WY+EPAS+SACSPPSEHPES+ +
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESAES

XP_022960704.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic-like [Cucurbita moschata]5.9e-21192.44Show/hide
Query:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASCNIPCSS  QF SISFR+N  FN   PHGS  F S LKHQKNPLVQR  TCNS ASKQQ E+VYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRIT+KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD+FFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
        QFSGC+ATEKDLQEWS EIGSDIPFFFSEGAAFC+GRGE+VQN+PPPVPLD+PMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESA
        NDLEPPAFEVLPSL+RLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASA SPPSEHPE A
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESA

XP_022988144.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic-like [Cucurbita maxima]8.5e-21092.19Show/hide
Query:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASCNIPCSS  QF SISFR+N+AFN   PHGS  F S LKHQKNPL+QR  TCNS ASKQQ E+VYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRIT KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD+FFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
        QFSGC+ATEKDLQEWS EIGSDIPFFFSEGAAFC+GRGE+VQN+PPPVPLD+PMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESA
        NDLEPPAFEVLPSLKRLKQRII +SR EFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASA SPPSEHPE A
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESA

XP_023515762.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic [Cucurbita pepo subsp. pepo]2.0e-21192.95Show/hide
Query:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASCNIPCSS  QF SISFR+N  FN   PHGS  F S LKHQKNPLVQR  TCNS ASKQQ E+VYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRIT+KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD+FFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
        QFSGC+ATEKDLQEWS EIGSDIPFFFSEGAAFC+GRGE+VQN+PPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESA
        NDLEPPAFEVLPSLKRLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASA SPPSEHPE A
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESA

TrEMBL top hitse value%identityAlignment
A0A1S3BQ16 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase1.0e-20891.23Show/hide
Query:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASC+IPCSSQLQF SISFRKN AFNS   HGS  F S+LK QK      A+TCNSTASKQQ E+VYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD+FFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
        QFSGCLATEKDLQEWSGEIGSDIPFFFS+GAAFC+GRGEIVQN+PPPVPLD+PMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESAES
        NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EF+DVFLAEANFLTRE N+WY+EPAS+SACSPPSEHPES+ +
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESAES

A0A5A7UKM7 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase1.0e-20891.23Show/hide
Query:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASC+IPCSSQLQF SISFRKN AFNS   HGS  F S+LK QK      A+TCNSTASKQQ E+VYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD+FFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
        QFSGCLATEKDLQEWSGEIGSDIPFFFS+GAAFC+GRGEIVQN+PPPVPLD+PMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESAES
        NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EF+DVFLAEANFLTRE N+WY+EPAS+SACSPPSEHPES+ +
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESAES

A0A5D3CI45 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase1.2e-20689.66Show/hide
Query:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----
        MASC+IPCSSQLQF SISFRKN AFNS   HGS  F S+LK QK      A+TCNSTASKQQ E+VYDPDERINKLADEVDRDAPLSRLTLFSPCK    
Subjt:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCK----

Query:  ---INVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAA
           INVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD+FFWIHLDKKVPTGAGLGGGSSNAA
Subjt:  ---INVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAA

Query:  TALWAANQFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNG
        TALWAANQFSGCLATEKDLQEWSGEIGSDIPFFFS+GAAFC+GRGEIVQN+PPPVPLD+PMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNG
Subjt:  TALWAANQFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNG

Query:  ISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEH
        ISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EF+DVFLAEANFLTRE N+WY+EPAS+SACSPPSEH
Subjt:  ISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEH

Query:  PESAES
        PES+ +
Subjt:  PESAES

A0A6J1HBW0 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase2.9e-21192.44Show/hide
Query:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASCNIPCSS  QF SISFR+N  FN   PHGS  F S LKHQKNPLVQR  TCNS ASKQQ E+VYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRIT+KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD+FFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
        QFSGC+ATEKDLQEWS EIGSDIPFFFSEGAAFC+GRGE+VQN+PPPVPLD+PMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESA
        NDLEPPAFEVLPSL+RLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASA SPPSEHPE A
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESA

A0A6J1JIS3 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase4.1e-21092.19Show/hide
Query:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASCNIPCSS  QF SISFR+N+AFN   PHGS  F S LKHQKNPL+QR  TCNS ASKQQ E+VYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRIT KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD+FFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
        QFSGC+ATEKDLQEWS EIGSDIPFFFSEGAAFC+GRGE+VQN+PPPVPLD+PMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESA
        NDLEPPAFEVLPSLKRLKQRII +SR EFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASA SPPSEHPE A
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESA

SwissProt top hitse value%identityAlignment
O81014 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic2.9e-16070.74Show/hide
Query:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MA+ + P  S L F   SF+ +          S  F  K       L++  L+ +  AS++Q+E+V+DPDER+NK+ D+VD++APLSRL LFSPCKINVF
Subjt:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        LRIT KREDG+HDLASLFHVISLGDTIKFSLSPSK KDRLSTNV GVP+D RNLIIKALNLYRKKTGS++FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVC
        N+ +G L TE +LQ+WS EIGSDIPFFFS GAA+C+GRGEIVQ++PPP PLD+PMVLIKP+EACSTAEVYKRLRLDQTS ++PL+LL+ +T NG+SQ +C
Subjt:  NQFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSE
        +NDLEPPAF VLPSLKRLKQRII++ RGE+DAVFMSGSGSTI+GIGSPDPP FIY+D+E+++VFL+EANF+TREAN+WY+EPASA+A +  +E
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSE

P56848 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic1.9e-15672.53Show/hide
Query:  FNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLG
        F+S +P+GS  F  KL+  +  ++ RA   + T  + QLEVVYD + ++NKLADEVDR+A +SRLTLFSPCKINVFLRIT KREDG+HDLASLFHVISLG
Subjt:  FNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLG

Query:  DTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSGEIGSDI
        D IKFSLSPSK      TNV GVPLD++NLIIKALNL+RKKTG+D+ FWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC+ATEKDLQEWSGEIGSDI
Subjt:  DTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSGEIGSDI

Query:  PFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIIS
        PFFFS GAA+C+GRGE+V++IPPPVP D+ MVL+KPQEAC T EVYKRLRLDQTS +DPL LL+KI+K GISQDVC+NDLEPPAFEV+PSLKRLKQRI +
Subjt:  PFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIIS

Query:  ASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESAE
        A R ++DAVFMSGSGSTIVG+GSPDPP F+Y+ DE++++F +EA F+TR ANQWY EP S    SP    P+ AE
Subjt:  ASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESAE

P93841 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic/chromoplastic (Fragment)1.6e-16375.47Show/hide
Query:  SLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDT
        S RPHGS  F   ++ ++N  V       S  SK+Q+E+ Y+P+E+ NKLADEVDR+A LSRLTLFSPCKINVFLRIT KR+DGYHDLASLFHVISLGD 
Subjt:  SLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDT

Query:  IKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSGEIGSDIPF
        IKFSLSPSK KDRLSTNV+GVPLD+RNLIIKALNLYRKKTG+D +FWIHLDKKVPTGAGLGGGSSNAAT LWAANQFSGC+ATEK+LQEWSGEIGSDIPF
Subjt:  IKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSGEIGSDIPF

Query:  FFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISAS
        FFS GAA+C+GRGE+VQ+IP P+P DIPMVLIKPQ+ACSTAEVYKR +LD +SKVDPLSLL+KI+ +GISQDVC+NDLEPPAFEVLPSLKRLKQR+I+A 
Subjt:  FFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISAS

Query:  RGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPES
        RG++DAVFMSGSGSTIVG+GSPDPP F+Y+D+E++DVFL+EA+F+TR AN+WY EP S S      E   S
Subjt:  RGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPES

Q6MAT6 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase3.8e-5943.79Show/hide
Query:  LTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGG
        + LFSP KIN+FL++  KR DGYH+L+SLF  IS GD + F       D L+ +   +P DD NL++KA+ L+R KTG D    IHLDK++P+ AGLGGG
Subjt:  LTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGG

Query:  SSNAATALWAANQFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDK
        SSNAAT LWA NQ +G + T ++L +W  EIG+DIPFFFS+G A C+GRGE V ++ P     I   ++KP    ST EVYK L   Q ++       + 
Subjt:  SSNAATALWAANQFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDK

Query:  ITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWY
               +    NDLE  AFE+ P LK LK  ++S+    FD V MSGSGS+   IG    P                A F+ R +N+WY
Subjt:  ITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWY

Q8S2G0 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic2.4e-15174.01Show/hide
Query:  QRALTCNSTAS----KQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNV
        +RAL     AS    ++Q+EV YD   + NKLAD++D++A ++RL LFSPCKINVFLRIT KR DG+HDLASLFHVISLGDTIKFSLSPSK KDRLSTNV
Subjt:  QRALTCNSTAS----KQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNV

Query:  SGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQN
        +GVP+D+ NLIIKALNLYRKKTG+D FFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC+A+EK+LQEWSGEIGSDIPFFFS+GAA+C+GRGEIV++
Subjt:  SGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQN

Query:  IPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVG
        I  P+P ++PMVL+KP EACSTAEVYKRLRL+ TS+ DPL LL +IT+NGISQD C+NDLEPPAFEVLPSLKRLK+RII+A+RG++DAVFMSGSGSTIVG
Subjt:  IPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVG

Query:  IGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESA
        IGSPDPP F+Y+DD+++D F++EA FLTR  N+WYREP S+   S     PE A
Subjt:  IGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESA

Arabidopsis top hitse value%identityAlignment
AT2G26930.1 4-(cytidine 5'-phospho)-2-C-methyl-D-erithritol kinase2.1e-16170.74Show/hide
Query:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MA+ + P  S L F   SF+ +          S  F  K       L++  L+ +  AS++Q+E+V+DPDER+NK+ D+VD++APLSRL LFSPCKINVF
Subjt:  MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        LRIT KREDG+HDLASLFHVISLGDTIKFSLSPSK KDRLSTNV GVP+D RNLIIKALNLYRKKTGS++FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVC
        N+ +G L TE +LQ+WS EIGSDIPFFFS GAA+C+GRGEIVQ++PPP PLD+PMVLIKP+EACSTAEVYKRLRLDQTS ++PL+LL+ +T NG+SQ +C
Subjt:  NQFSGCLATEKDLQEWSGEIGSDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSE
        +NDLEPPAF VLPSLKRLKQRII++ RGE+DAVFMSGSGSTI+GIGSPDPP FIY+D+E+++VFL+EANF+TREAN+WY+EPASA+A +  +E
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTGCAACATCCCTTGTAGTTCACAGCTCCAATTTTGCTCCATTTCGTTTAGAAAGAATCTCGCCTTCAATTCACTTCGGCCTCACGGTTCATTCGGGTTTGG
CTCGAAGTTGAAGCACCAGAAGAATCCACTCGTTCAGAGAGCCTTAACATGCAATTCCACCGCTTCCAAACAGCAACTCGAGGTAGTTTATGATCCTGATGAAAGGATAA
ACAAGTTAGCTGATGAAGTAGACCGAGATGCTCCTCTTTCGAGGCTTACTCTGTTCTCACCTTGCAAGATTAATGTTTTCTTGAGAATAACTAAGAAGAGGGAAGATGGA
TATCATGATTTGGCATCTCTCTTTCATGTGATAAGCTTAGGCGACACTATTAAATTCTCTTTGTCGCCATCGAAGAAGGATCGTCTTTCTACCAATGTATCAGGGGTACC
CCTTGATGATAGAAATTTGATTATAAAGGCTCTTAACCTCTACAGGAAAAAGACTGGCAGTGACCAATTTTTCTGGATCCATCTCGACAAGAAGGTACCAACTGGAGCAG
GGCTTGGTGGAGGAAGCAGTAATGCTGCAACTGCACTGTGGGCGGCCAATCAGTTCAGTGGATGTCTTGCTACTGAAAAGGACCTTCAAGAATGGTCAGGTGAGATAGGA
TCTGATATTCCCTTCTTTTTCTCGGAAGGGGCGGCCTTCTGCAGCGGGAGAGGTGAGATTGTACAGAATATTCCACCTCCAGTACCCTTGGACATTCCAATGGTTCTCAT
AAAGCCCCAGGAAGCATGCTCTACAGCAGAAGTTTACAAGAGATTACGGTTGGATCAAACAAGCAAGGTTGATCCTTTATCATTGTTGGATAAAATCACAAAGAATGGAA
TATCCCAAGATGTGTGTATCAACGATTTAGAACCTCCTGCTTTTGAGGTCCTCCCATCTCTTAAAAGATTGAAACAGCGTATAATTTCTGCCAGCCGTGGAGAGTTCGAC
GCTGTTTTTATGTCCGGGAGTGGTAGCACAATAGTAGGGATTGGGTCCCCAGATCCTCCAGGCTTCATATATAATGATGACGAATTCCAGGACGTATTTTTGGCAGAGGC
CAACTTTCTCACTCGTGAAGCAAATCAATGGTATCGAGAACCTGCTTCGGCATCTGCTTGTAGTCCACCTTCCGAGCATCCCGAATCGGCAGAATCGACTAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCTGCAACATCCCTTGTAGTTCACAGCTCCAATTTTGCTCCATTTCGTTTAGAAAGAATCTCGCCTTCAATTCACTTCGGCCTCACGGTTCATTCGGGTTTGG
CTCGAAGTTGAAGCACCAGAAGAATCCACTCGTTCAGAGAGCCTTAACATGCAATTCCACCGCTTCCAAACAGCAACTCGAGGTAGTTTATGATCCTGATGAAAGGATAA
ACAAGTTAGCTGATGAAGTAGACCGAGATGCTCCTCTTTCGAGGCTTACTCTGTTCTCACCTTGCAAGATTAATGTTTTCTTGAGAATAACTAAGAAGAGGGAAGATGGA
TATCATGATTTGGCATCTCTCTTTCATGTGATAAGCTTAGGCGACACTATTAAATTCTCTTTGTCGCCATCGAAGAAGGATCGTCTTTCTACCAATGTATCAGGGGTACC
CCTTGATGATAGAAATTTGATTATAAAGGCTCTTAACCTCTACAGGAAAAAGACTGGCAGTGACCAATTTTTCTGGATCCATCTCGACAAGAAGGTACCAACTGGAGCAG
GGCTTGGTGGAGGAAGCAGTAATGCTGCAACTGCACTGTGGGCGGCCAATCAGTTCAGTGGATGTCTTGCTACTGAAAAGGACCTTCAAGAATGGTCAGGTGAGATAGGA
TCTGATATTCCCTTCTTTTTCTCGGAAGGGGCGGCCTTCTGCAGCGGGAGAGGTGAGATTGTACAGAATATTCCACCTCCAGTACCCTTGGACATTCCAATGGTTCTCAT
AAAGCCCCAGGAAGCATGCTCTACAGCAGAAGTTTACAAGAGATTACGGTTGGATCAAACAAGCAAGGTTGATCCTTTATCATTGTTGGATAAAATCACAAAGAATGGAA
TATCCCAAGATGTGTGTATCAACGATTTAGAACCTCCTGCTTTTGAGGTCCTCCCATCTCTTAAAAGATTGAAACAGCGTATAATTTCTGCCAGCCGTGGAGAGTTCGAC
GCTGTTTTTATGTCCGGGAGTGGTAGCACAATAGTAGGGATTGGGTCCCCAGATCCTCCAGGCTTCATATATAATGATGACGAATTCCAGGACGTATTTTTGGCAGAGGC
CAACTTTCTCACTCGTGAAGCAAATCAATGGTATCGAGAACCTGCTTCGGCATCTGCTTGTAGTCCACCTTCCGAGCATCCCGAATCGGCAGAATCGACTAGATAA
Protein sequenceShow/hide protein sequence
MASCNIPCSSQLQFCSISFRKNLAFNSLRPHGSFGFGSKLKHQKNPLVQRALTCNSTASKQQLEVVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDG
YHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSGEIG
SDIPFFFSEGAAFCSGRGEIVQNIPPPVPLDIPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFD
AVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHPESAESTR