; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg06137 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg06137
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Description4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase
Genome locationCarg_Chr04:22149355..22153728
RNA-Seq ExpressionCarg06137
SyntenyCarg06137
Gene Ontology termsGO:0016114 - terpenoid biosynthetic process (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0005524 - ATP binding (molecular function)
GO:0050515 - 4-(cytidine 5'-diphospho)-2-C-methyl-D-erythritol kinase activity (molecular function)
InterPro domainsIPR004424 - 4-diphosphocytidyl-2C-methyl-D-erythritol kinase
IPR006204 - GHMP kinase N-terminal domain
IPR014721 - Ribosomal protein S5 domain 2-type fold, subgroup
IPR020568 - Ribosomal protein S5 domain 2-type fold
IPR036554 - GHMP kinase, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033454.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic/chromoplastic [Cucurbita argyrosperma subsp. argyrosperma]1.3e-229100Show/hide
Query:  MFSSMASCNIPCSSGFQFRSISFRRNFVFNGPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKIN
        MFSSMASCNIPCSSGFQFRSISFRRNFVFNGPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKIN
Subjt:  MFSSMASCNIPCSSGFQFRSISFRRNFVFNGPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKIN

Query:  VFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWA
        VFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWA
Subjt:  VFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWA

Query:  ANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDV
        ANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDV
Subjt:  ANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDV

Query:  CINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPELAR
        CINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPELAR
Subjt:  CINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPELAR

XP_008451044.1 PREDICTED: 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic isoform X1 [Cucumis melo]1.4e-20491.39Show/hide
Query:  MASCNIPCSSGFQFRSISFRRNFVFN--GPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASC+IPCSS  QF SISFR+NF FN  G HGSLAFAS LK QK        TCNS ASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPCSSGFQFRSISFRRNFVFN--GPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRIT+KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
        QFSGCLATEKDLQEWS EIGSDIPFFFS+GAAFCTGRGE+VQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPE
        NDLEPPAFEVLPSL+RLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EF+DVFLAEANFLTRE N+WY+EPAS+SA SPPSEHPE
Subjt:  NDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPE

XP_022960704.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic-like [Cucurbita moschata]2.2e-22999.75Show/hide
Query:  MFSSMASCNIPCSSGFQFRSISFRRNFVFNGPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKIN
        MFSSMASCNIPCSSGFQFRSISFRRNFVFNGPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKIN
Subjt:  MFSSMASCNIPCSSGFQFRSISFRRNFVFNGPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKIN

Query:  VFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWA
        VFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWA
Subjt:  VFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWA

Query:  ANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDV
        ANQFSGC+ATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDV
Subjt:  ANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDV

Query:  CINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPELAR
        CINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPELAR
Subjt:  CINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPELAR

XP_022988144.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic-like [Cucurbita maxima]1.6e-22497.75Show/hide
Query:  MFSSMASCNIPCSSGFQFRSISFRRNFVFNGPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKIN
        M SSMASCNIPCSSGFQFRSISFRRN  FNGPHGSLAFASGLKHQKNPL+QRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKIN
Subjt:  MFSSMASCNIPCSSGFQFRSISFRRNFVFNGPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKIN

Query:  VFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWA
        VFLRIT KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWA
Subjt:  VFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWA

Query:  ANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDV
        ANQFSGC+ATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDV
Subjt:  ANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDV

Query:  CINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPELAR
        CINDLEPPAFEVLPSL+RLKQRII SSR EFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPELAR
Subjt:  CINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPELAR

XP_023515762.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic [Cucurbita pepo subsp. pepo]3.1e-22899Show/hide
Query:  MFSSMASCNIPCSSGFQFRSISFRRNFVFNGPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKIN
        M SSMASCNIPCSSGFQFRSISFRRNFVFNGPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKIN
Subjt:  MFSSMASCNIPCSSGFQFRSISFRRNFVFNGPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKIN

Query:  VFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWA
        VFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWA
Subjt:  VFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWA

Query:  ANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDV
        ANQFSGC+ATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLD+PMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDV
Subjt:  ANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDV

Query:  CINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPELAR
        CINDLEPPAFEVLPSL+RLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPELAR
Subjt:  CINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPELAR

TrEMBL top hitse value%identityAlignment
A0A1S3BQ16 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase6.8e-20591.39Show/hide
Query:  MASCNIPCSSGFQFRSISFRRNFVFN--GPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASC+IPCSS  QF SISFR+NF FN  G HGSLAFAS LK QK        TCNS ASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPCSSGFQFRSISFRRNFVFN--GPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRIT+KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
        QFSGCLATEKDLQEWS EIGSDIPFFFS+GAAFCTGRGE+VQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPE
        NDLEPPAFEVLPSL+RLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EF+DVFLAEANFLTRE N+WY+EPAS+SA SPPSEHPE
Subjt:  NDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPE

A0A5A7UKM7 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase6.8e-20591.39Show/hide
Query:  MASCNIPCSSGFQFRSISFRRNFVFN--GPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASC+IPCSS  QF SISFR+NF FN  G HGSLAFAS LK QK        TCNS ASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNIPCSSGFQFRSISFRRNFVFN--GPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRIT+KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
        QFSGCLATEKDLQEWS EIGSDIPFFFS+GAAFCTGRGE+VQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPE
        NDLEPPAFEVLPSL+RLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EF+DVFLAEANFLTRE N+WY+EPAS+SA SPPSEHPE
Subjt:  NDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPE

A0A5D3CI45 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase8.3e-20389.8Show/hide
Query:  MASCNIPCSSGFQFRSISFRRNFVFN--GPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCK----
        MASC+IPCSS  QF SISFR+NF FN  G HGSLAFAS LK QK        TCNS ASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCK    
Subjt:  MASCNIPCSSGFQFRSISFRRNFVFN--GPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCK----

Query:  ---INVFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAA
           INVFLRIT+KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAA
Subjt:  ---INVFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAA

Query:  TALWAANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNG
        TALWAANQFSGCLATEKDLQEWS EIGSDIPFFFS+GAAFCTGRGE+VQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNG
Subjt:  TALWAANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNG

Query:  ISQDVCINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEH
        ISQDVCINDLEPPAFEVLPSL+RLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EF+DVFLAEANFLTRE N+WY+EPAS+SA SPPSEH
Subjt:  ISQDVCINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEH

Query:  PE
        PE
Subjt:  PE

A0A6J1HBW0 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase1.0e-22999.75Show/hide
Query:  MFSSMASCNIPCSSGFQFRSISFRRNFVFNGPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKIN
        MFSSMASCNIPCSSGFQFRSISFRRNFVFNGPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKIN
Subjt:  MFSSMASCNIPCSSGFQFRSISFRRNFVFNGPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKIN

Query:  VFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWA
        VFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWA
Subjt:  VFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWA

Query:  ANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDV
        ANQFSGC+ATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDV
Subjt:  ANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDV

Query:  CINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPELAR
        CINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPELAR
Subjt:  CINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPELAR

A0A6J1JIS3 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase7.7e-22597.75Show/hide
Query:  MFSSMASCNIPCSSGFQFRSISFRRNFVFNGPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKIN
        M SSMASCNIPCSSGFQFRSISFRRN  FNGPHGSLAFASGLKHQKNPL+QRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKIN
Subjt:  MFSSMASCNIPCSSGFQFRSISFRRNFVFNGPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKIN

Query:  VFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWA
        VFLRIT KREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWA
Subjt:  VFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWA

Query:  ANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDV
        ANQFSGC+ATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDV
Subjt:  ANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDV

Query:  CINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPELAR
        CINDLEPPAFEVLPSL+RLKQRII SSR EFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPELAR
Subjt:  CINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPELAR

SwissProt top hitse value%identityAlignment
O81014 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic1.7e-16077.52Show/hide
Query:  LVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSG
        L++ + + +  AS++Q EIV+DPDER+NK+ D+VD++APLSRL LFSPCKINVFLRIT KREDG+HDLASLFHVISLGDTIKFSLSPSK KDRLSTNV G
Subjt:  LVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSG

Query:  VPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLP
        VP+D RNLIIKALNLYRKKTGS++FFWIHLDKKVPTGAGLGGGSSNAATALWAAN+ +G L TE +LQ+WSSEIGSDIPFFFS GAA+CTGRGE+VQ+LP
Subjt:  VPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLP

Query:  PPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIG
        PP PLD+PMVLIKP+EACSTAEVYKRLRLDQTS ++PL+LL+ +T NG+SQ +C+NDLEPPAF VLPSL+RLKQRII+S RGE+DAVFMSGSGSTI+GIG
Subjt:  PPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIG

Query:  SPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSE
        SPDPP FIY+D+E+++VFL+EANF+TREAN+WY+EPASA+A +  +E
Subjt:  SPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSE

P56848 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic7.6e-15372.96Show/hide
Query:  PHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITEKREDGYHDLASLFHVISLGDTIKF
        P+GS +F   L+  +  ++ R    +    + Q E+VYD + ++NKLADEVDR+A +SRLTLFSPCKINVFLRIT KREDG+HDLASLFHVISLGD IKF
Subjt:  PHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITEKREDGYHDLASLFHVISLGDTIKF

Query:  SLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIPFFFS
        SLSPSK      TNV GVPLD++NLIIKALNL+RKKTG+DK FWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC+ATEKDLQEWS EIGSDIPFFFS
Subjt:  SLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIPFFFS

Query:  EGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLRRLKQRIISSSRGE
         GAA+CTGRGEVV+++PPPVP D+ MVL+KPQEAC T EVYKRLRLDQTS +DPL LL+KI+K GISQDVC+NDLEPPAFEV+PSL+RLKQRI ++ R +
Subjt:  EGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLRRLKQRIISSSRGE

Query:  FDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPAS
        +DAVFMSGSGSTIVG+GSPDPP F+Y+ DE++++F +EA F+TR ANQWY EP S
Subjt:  FDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPAS

P93841 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic/chromoplastic (Fragment)4.7e-16376.88Show/hide
Query:  NGPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITEKREDGYHDLASLFHVISLGDTI
        N PHGS  F   ++ ++N  V  V    S  SK+Q EI Y+P+E+ NKLADEVDR+A LSRLTLFSPCKINVFLRIT KR+DGYHDLASLFHVISLGD I
Subjt:  NGPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITEKREDGYHDLASLFHVISLGDTI

Query:  KFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIPFF
        KFSLSPSK KDRLSTNV+GVPLD+RNLIIKALNLYRKKTG+D +FWIHLDKKVPTGAGLGGGSSNAAT LWAANQFSGC+ATEK+LQEWS EIGSDIPFF
Subjt:  KFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIPFF

Query:  FSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLRRLKQRIISSSR
        FS GAA+CTGRGEVVQ++P P+P D+PMVLIKPQ+ACSTAEVYKR +LD +SKVDPLSLL+KI+ +GISQDVC+NDLEPPAFEVLPSL+RLKQR+I++ R
Subjt:  FSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLRRLKQRIISSSR

Query:  GEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASAS
        G++DAVFMSGSGSTIVG+GSPDPP F+Y+D+E++DVFL+EA+F+TR AN+WY EP S S
Subjt:  GEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASAS

Q6MAT6 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase5.8e-6044.48Show/hide
Query:  LTLFSPCKINVFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGG
        + LFSP KIN+FL++  KR DGYH+L+SLF  IS GD + F       D L+ +   +P DD NL++KA+ L+R KTG D    IHLDK++P+ AGLGGG
Subjt:  LTLFSPCKINVFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGG

Query:  SSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDK
        SSNAAT LWA NQ +G + T ++L +W SEIG+DIPFFFS+G A CTGRGE V +L P     +   ++KP    ST EVYK L   Q ++       + 
Subjt:  SSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDK

Query:  ITKNGISQDVCINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWY
               +    NDLE  AFE+ P L+ LK  ++SS    FD V MSGSGS+   IG    P                A F+ R +N+WY
Subjt:  ITKNGISQDVCINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWY

Q8S2G0 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic7.9e-15074.34Show/hide
Query:  KQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKAL
        ++Q E+ YD   + NKLAD++D++A ++RL LFSPCKINVFLRIT KR DG+HDLASLFHVISLGDTIKFSLSPSK KDRLSTNV+GVP+D+ NLIIKAL
Subjt:  KQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKAL

Query:  NLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIK
        NLYRKKTG+D FFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC+A+EK+LQEWS EIGSDIPFFFS+GAA+CTGRGE+V+++  P+P ++PMVL+K
Subjt:  NLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIK

Query:  PQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDE
        P EACSTAEVYKRLRL+ TS+ DPL LL +IT+NGISQD C+NDLEPPAFEVLPSL+RLK+RII+++RG++DAVFMSGSGSTIVGIGSPDPP F+Y+DD+
Subjt:  PQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDE

Query:  FQDVFLAEANFLTREANQWYREPASASARSPPSEHPELA
        ++D F++EA FLTR  N+WYREP S+   S     PE+A
Subjt:  FQDVFLAEANFLTREANQWYREPASASARSPPSEHPELA

Arabidopsis top hitse value%identityAlignment
AT2G26930.1 4-(cytidine 5'-phospho)-2-C-methyl-D-erithritol kinase1.2e-16177.52Show/hide
Query:  LVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSG
        L++ + + +  AS++Q EIV+DPDER+NK+ D+VD++APLSRL LFSPCKINVFLRIT KREDG+HDLASLFHVISLGDTIKFSLSPSK KDRLSTNV G
Subjt:  LVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITEKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSG

Query:  VPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLP
        VP+D RNLIIKALNLYRKKTGS++FFWIHLDKKVPTGAGLGGGSSNAATALWAAN+ +G L TE +LQ+WSSEIGSDIPFFFS GAA+CTGRGE+VQ+LP
Subjt:  VPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSEIGSDIPFFFSEGAAFCTGRGEVVQNLP

Query:  PPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIG
        PP PLD+PMVLIKP+EACSTAEVYKRLRLDQTS ++PL+LL+ +T NG+SQ +C+NDLEPPAF VLPSL+RLKQRII+S RGE+DAVFMSGSGSTI+GIG
Subjt:  PPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLRRLKQRIISSSRGEFDAVFMSGSGSTIVGIG

Query:  SPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSE
        SPDPP FIY+D+E+++VFL+EANF+TREAN+WY+EPASA+A +  +E
Subjt:  SPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTCATCAATGGCTTCCTGTAACATCCCGTGCAGTTCAGGGTTCCAATTTCGGTCCATTTCTTTTAGAAGGAATTTCGTCTTCAATGGGCCACACGGTTCGCTCGC
TTTTGCCTCGGGGTTGAAGCACCAGAAGAATCCACTCGTTCAAAGGGTCACTACATGCAATTCCGCCGCTTCCAAACAACAATTTGAGATAGTTTATGATCCTGATGAAC
GGATAAACAAGTTGGCTGATGAAGTAGACCGGGATGCTCCACTTTCGAGGCTTACTTTGTTCTCACCTTGCAAGATTAATGTGTTCTTGAGAATAACTGAGAAGAGGGAA
GATGGATATCATGATTTGGCCTCTCTTTTCCATGTGATAAGCTTAGGGGATACTATTAAGTTCTCATTGTCGCCATCGAAGAAGGACCGTCTTTCTACCAATGTATCGGG
AGTACCCCTCGATGACAGAAATTTGATTATCAAAGCTCTTAACCTCTACAGGAAGAAGACTGGTAGTGACAAGTTTTTCTGGATCCATCTCGACAAGAAGGTACCGACTG
GAGCGGGGCTTGGTGGAGGAAGCAGTAATGCTGCCACAGCACTGTGGGCGGCCAATCAGTTCAGTGGATGTCTTGCTACTGAAAAGGACCTTCAAGAATGGTCGAGCGAG
ATAGGCTCTGATATTCCCTTCTTTTTCTCGGAAGGGGCGGCGTTTTGTACCGGGAGAGGCGAGGTTGTACAGAATCTTCCACCTCCAGTACCCTTGGACGTTCCAATGGT
TCTCATAAAGCCCCAGGAAGCATGCTCTACAGCAGAAGTTTATAAGCGCTTACGATTGGATCAAACAAGCAAGGTCGATCCTTTGTCGTTGTTGGATAAAATCACCAAGA
ACGGAATATCCCAAGATGTGTGTATCAACGATTTGGAACCTCCTGCTTTCGAGGTCCTCCCGTCTCTTAGAAGATTGAAACAGCGTATAATTTCTTCCAGCCGTGGCGAA
TTCGACGCTGTGTTCATGTCTGGGAGTGGTAGCACAATAGTAGGAATTGGGTCCCCAGATCCTCCAGGCTTCATATATAACGACGATGAATTTCAGGACGTATTTTTGGC
AGAGGCAAACTTTCTCACCCGTGAAGCAAATCAATGGTATCGAGAGCCTGCATCGGCATCAGCTCGTAGTCCGCCTTCCGAGCATCCTGAATTGGCTAGATAG
mRNA sequenceShow/hide mRNA sequence
TTTGAATATTCGACTAATCTTCCCTATTCCCACATCATTTGATTGCTCACTCAATTCCAAGAGTTCATAAAAGGAAACCAACAATGGATGCCAAAAACCCTGAATTCAGC
GATGTAGATATCAATCCGTTCGCGAAATCTCATTAAAAAGACACCATCCATATTACAAATCTTCATCTTACTCTTAATCCTGTTTGAAATTCTTCAATTTTTCATTTCTA
ATCATCAGGGGATCGGTAGCTGCTGGTGAAGAATCATGTTCTCATCAATGGCTTCCTGTAACATCCCGTGCAGTTCAGGGTTCCAATTTCGGTCCATTTCTTTTAGAAGG
AATTTCGTCTTCAATGGGCCACACGGTTCGCTCGCTTTTGCCTCGGGGTTGAAGCACCAGAAGAATCCACTCGTTCAAAGGGTCACTACATGCAATTCCGCCGCTTCCAA
ACAACAATTTGAGATAGTTTATGATCCTGATGAACGGATAAACAAGTTGGCTGATGAAGTAGACCGGGATGCTCCACTTTCGAGGCTTACTTTGTTCTCACCTTGCAAGA
TTAATGTGTTCTTGAGAATAACTGAGAAGAGGGAAGATGGATATCATGATTTGGCCTCTCTTTTCCATGTGATAAGCTTAGGGGATACTATTAAGTTCTCATTGTCGCCA
TCGAAGAAGGACCGTCTTTCTACCAATGTATCGGGAGTACCCCTCGATGACAGAAATTTGATTATCAAAGCTCTTAACCTCTACAGGAAGAAGACTGGTAGTGACAAGTT
TTTCTGGATCCATCTCGACAAGAAGGTACCGACTGGAGCGGGGCTTGGTGGAGGAAGCAGTAATGCTGCCACAGCACTGTGGGCGGCCAATCAGTTCAGTGGATGTCTTG
CTACTGAAAAGGACCTTCAAGAATGGTCGAGCGAGATAGGCTCTGATATTCCCTTCTTTTTCTCGGAAGGGGCGGCGTTTTGTACCGGGAGAGGCGAGGTTGTACAGAAT
CTTCCACCTCCAGTACCCTTGGACGTTCCAATGGTTCTCATAAAGCCCCAGGAAGCATGCTCTACAGCAGAAGTTTATAAGCGCTTACGATTGGATCAAACAAGCAAGGT
CGATCCTTTGTCGTTGTTGGATAAAATCACCAAGAACGGAATATCCCAAGATGTGTGTATCAACGATTTGGAACCTCCTGCTTTCGAGGTCCTCCCGTCTCTTAGAAGAT
TGAAACAGCGTATAATTTCTTCCAGCCGTGGCGAATTCGACGCTGTGTTCATGTCTGGGAGTGGTAGCACAATAGTAGGAATTGGGTCCCCAGATCCTCCAGGCTTCATA
TATAACGACGATGAATTTCAGGACGTATTTTTGGCAGAGGCAAACTTTCTCACCCGTGAAGCAAATCAATGGTATCGAGAGCCTGCATCGGCATCAGCTCGTAGTCCGCC
TTCCGAGCATCCTGAATTGGCTAGATAGCTTGCTCATTGGATGAAATAGAGGTTCAGCTTATGTTCCTTGTTCTAGTGAGGATTCTTGTATGACAGTTAAACTACATCCT
CCATTGATAAACTTGGGAAGGGATCTTGAGCTCTTTTTTCTATGCCAAAAGAGAGTTATTTAAGAAACTTATGTGATTTCCATGACCATTGCTAGAACATTTTCGTAAGT
TAGAGAGCTCGCTAACG
Protein sequenceShow/hide protein sequence
MFSSMASCNIPCSSGFQFRSISFRRNFVFNGPHGSLAFASGLKHQKNPLVQRVTTCNSAASKQQFEIVYDPDERINKLADEVDRDAPLSRLTLFSPCKINVFLRITEKRE
DGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDKFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQEWSSE
IGSDIPFFFSEGAAFCTGRGEVVQNLPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSKVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLRRLKQRIISSSRGE
FDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASARSPPSEHPELAR