; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0006982 (gene) of Chayote v1 genome

Gene IDSed0006982
OrganismSechium edule (Chayote v1)
Description4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase
Genome locationLG08:9892899..9897512
RNA-Seq ExpressionSed0006982
SyntenySed0006982
Gene Ontology termsGO:0016114 - terpenoid biosynthetic process (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0005524 - ATP binding (molecular function)
GO:0050515 - 4-(cytidine 5'-diphospho)-2-C-methyl-D-erythritol kinase activity (molecular function)
InterPro domainsIPR004424 - 4-diphosphocytidyl-2C-methyl-D-erythritol kinase
IPR006204 - GHMP kinase N-terminal domain
IPR014721 - Ribosomal protein S5 domain 2-type fold, subgroup
IPR020568 - Ribosomal protein S5 domain 2-type fold
IPR036554 - GHMP kinase, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033454.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic/chromoplastic [Cucurbita argyrosperma subsp. argyrosperma]4.5e-20390.13Show/hide
Query:  MAACNFPLSSHLQFHSVSFRKNFASLGPHCSFASASRFKHQKNPLLPRPIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFLR
        MA+CN P SS  QF S+SFR+NF   GPH S A AS  KHQKNPL+ R   CNS ASKQQ EIVY+PDERINKLADEVD+DAPLSRLTLFSPCKINVFLR
Subjt:  MAACNFPLSSHLQFHSVSFRKNFASLGPHCSFASASRFKHQKNPLLPRPIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFLR

Query:  ITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAANQF
        ITEKR+DGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD++FWIHLDKKVPTGAGLGGGSSNAATALWAANQF
Subjt:  ITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAANQF

Query:  SGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCIND
        SGCLATE DLQEWS EIGSDIPFFFSEGAAFCTGRGEVVQN+PPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTS +DP SLLDKITKNGISQDVCIND
Subjt:  SGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCIND

Query:  LEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPA
        LEPPAFEVLPSL+RLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASA SPPSEH E A
Subjt:  LEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPA

XP_022147682.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic [Momordica charantia]1.8e-19988.47Show/hide
Query:  MAACNFPLSSHLQFHSVSFRKNFA--SLGPHCSFASASRFKHQKNPLLPR-PIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINV
        MA C  P +S LQF ++SFRKNFA  S+G H SFA  SR K++K+ L+ +  I CNSTASKQQVEIVYNPDERINKLADEVD+DAPLSRLTLFSPCKINV
Subjt:  MAACNFPLSSHLQFHSVSFRKNFA--SLGPHCSFASASRFKHQKNPLLPR-PIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINV

Query:  FLRITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRIT+KR+DGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDD+NLIIKALNLYRKKTGS+ +FWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVC
        N+FSGCLATE DLQEWS EIGSDIPFFFSEGAA+CTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTS IDP SLLD+ITKNGISQDVC
Subjt:  NQFSGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPAT
        INDLEPPAF+VLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWY+EPASASACSPPSEH + AT
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPAT

XP_022960704.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic-like [Cucurbita moschata]7.6e-20389.87Show/hide
Query:  MAACNFPLSSHLQFHSVSFRKNFASLGPHCSFASASRFKHQKNPLLPRPIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFLR
        MA+CN P SS  QF S+SFR+NF   GPH S A AS  KHQKNPL+ R   CNS ASKQQ EIVY+PDERINKLADEVD+DAPLSRLTLFSPCKINVFLR
Subjt:  MAACNFPLSSHLQFHSVSFRKNFASLGPHCSFASASRFKHQKNPLLPRPIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFLR

Query:  ITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAANQF
        ITEKR+DGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD++FWIHLDKKVPTGAGLGGGSSNAATALWAANQF
Subjt:  ITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAANQF

Query:  SGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCIND
        SGC+ATE DLQEWS EIGSDIPFFFSEGAAFCTGRGEVVQN+PPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTS +DP SLLDKITKNGISQDVCIND
Subjt:  SGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCIND

Query:  LEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPA
        LEPPAFEVLPSL+RLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASA SPPSEH E A
Subjt:  LEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPA

XP_022988144.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic-like [Cucurbita maxima]1.6e-20089.37Show/hide
Query:  MAACNFPLSSHLQFHSVSFRKNFASLGPHCSFASASRFKHQKNPLLPRPIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFLR
        MA+CN P SS  QF S+SFR+N A  GPH S A AS  KHQKNPL+ R   CNS ASKQQ EIVY+PDERINKLADEVD+DAPLSRLTLFSPCKINVFLR
Subjt:  MAACNFPLSSHLQFHSVSFRKNFASLGPHCSFASASRFKHQKNPLLPRPIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFLR

Query:  ITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAANQF
        IT KR+DGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD++FWIHLDKKVPTGAGLGGGSSNAATALWAANQF
Subjt:  ITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAANQF

Query:  SGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCIND
        SGC+ATE DLQEWS EIGSDIPFFFSEGAAFCTGRGEVVQN+PPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTS +DP SLLDKITKNGISQDVCIND
Subjt:  SGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCIND

Query:  LEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPA
        LEPPAFEVLPSLKRLKQRII +SR EFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASA SPPSEH E A
Subjt:  LEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPA

XP_023515762.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic [Cucurbita pepo subsp. pepo]4.5e-20389.87Show/hide
Query:  MAACNFPLSSHLQFHSVSFRKNFASLGPHCSFASASRFKHQKNPLLPRPIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFLR
        MA+CN P SS  QF S+SFR+NF   GPH S A AS  KHQKNPL+ R   CNS ASKQQ EIVY+PDERINKLADEVD+DAPLSRLTLFSPCKINVFLR
Subjt:  MAACNFPLSSHLQFHSVSFRKNFASLGPHCSFASASRFKHQKNPLLPRPIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFLR

Query:  ITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAANQF
        ITEKR+DGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD++FWIHLDKKVPTGAGLGGGSSNAATALWAANQF
Subjt:  ITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAANQF

Query:  SGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCIND
        SGC+ATE DLQEWS EIGSDIPFFFSEGAAFCTGRGEVVQN+PPPVPLD+PMVLIKPQEACSTAEVYKRLRLDQTS +DP SLLDKITKNGISQDVCIND
Subjt:  SGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCIND

Query:  LEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPA
        LEPPAFEVLPSLKRLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASA SPPSEH E A
Subjt:  LEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPA

TrEMBL top hitse value%identityAlignment
A0A5A7UKM7 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase1.2e-19887.94Show/hide
Query:  MAACNFPLSSHLQFHSVSFRKNFA--SLGPHCSFASASRFKHQKNPLLPRPIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVF
        MA+C+ P SS LQFHS+SFRKNFA  S G H S A ASR K QK       I CNSTASKQQ EIVY+PDERINKLADEVD+DAPLSRLTLFSPCKINVF
Subjt:  MAACNFPLSSHLQFHSVSFRKNFA--SLGPHCSFASASRFKHQKNPLLPRPIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVF

Query:  LRITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRIT+KR+DGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD++FWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCI
        QFSGCLATE DLQEWSGEIGSDIPFFFS+GAAFCTGRGE+VQN+PPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTS +DP SLLDKITKNGISQDVCI
Subjt:  QFSGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPAT
        NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY+D+EF+DVFLAEANFLTRE N+WY+EPAS+SACSPPSEH E ++
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPAT

A0A6J1D320 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase8.5e-20088.47Show/hide
Query:  MAACNFPLSSHLQFHSVSFRKNFA--SLGPHCSFASASRFKHQKNPLLPR-PIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINV
        MA C  P +S LQF ++SFRKNFA  S+G H SFA  SR K++K+ L+ +  I CNSTASKQQVEIVYNPDERINKLADEVD+DAPLSRLTLFSPCKINV
Subjt:  MAACNFPLSSHLQFHSVSFRKNFA--SLGPHCSFASASRFKHQKNPLLPR-PIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINV

Query:  FLRITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRIT+KR+DGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDD+NLIIKALNLYRKKTGS+ +FWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVC
        N+FSGCLATE DLQEWS EIGSDIPFFFSEGAA+CTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTS IDP SLLD+ITKNGISQDVC
Subjt:  NQFSGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPAT
        INDLEPPAF+VLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWY+EPASASACSPPSEH + AT
Subjt:  INDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPAT

A0A6J1EZ14 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase1.2e-19888.92Show/hide
Query:  MAACNFPLSSHLQFHSVSFRKNFA--SLGPHCSFASASRFKHQKNPLLPRPIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVF
        MA+CN   SS L+F SVS RKNFA  S GP  SF  ASR KHQKN L+ + I CNSTASKQQVEIVY+ DERINKLADEVD+DAPLSRLTLFSPCKINVF
Subjt:  MAACNFPLSSHLQFHSVSFRKNFA--SLGPHCSFASASRFKHQKNPLLPRPIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVF

Query:  LRITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRIT+KR+DGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTG DQ+FWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCI
        QFSGCLATE DLQ+WS EIGSDIPFFFSEGAAFCTGRGE VQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLD+TS++DP SLLDKITKNGISQDVCI
Subjt:  QFSGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPA
        NDLEPPAFEVLPSLKRLKQRI+SASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYND+EFQDVFLAEANFLTREANQWYREPA+ASACS PSE  E A
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPA

A0A6J1HBW0 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase3.7e-20389.87Show/hide
Query:  MAACNFPLSSHLQFHSVSFRKNFASLGPHCSFASASRFKHQKNPLLPRPIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFLR
        MA+CN P SS  QF S+SFR+NF   GPH S A AS  KHQKNPL+ R   CNS ASKQQ EIVY+PDERINKLADEVD+DAPLSRLTLFSPCKINVFLR
Subjt:  MAACNFPLSSHLQFHSVSFRKNFASLGPHCSFASASRFKHQKNPLLPRPIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFLR

Query:  ITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAANQF
        ITEKR+DGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD++FWIHLDKKVPTGAGLGGGSSNAATALWAANQF
Subjt:  ITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAANQF

Query:  SGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCIND
        SGC+ATE DLQEWS EIGSDIPFFFSEGAAFCTGRGEVVQN+PPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTS +DP SLLDKITKNGISQDVCIND
Subjt:  SGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCIND

Query:  LEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPA
        LEPPAFEVLPSL+RLKQRIIS+SRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASA SPPSEH E A
Subjt:  LEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPA

A0A6J1JIS3 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase7.7e-20189.37Show/hide
Query:  MAACNFPLSSHLQFHSVSFRKNFASLGPHCSFASASRFKHQKNPLLPRPIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFLR
        MA+CN P SS  QF S+SFR+N A  GPH S A AS  KHQKNPL+ R   CNS ASKQQ EIVY+PDERINKLADEVD+DAPLSRLTLFSPCKINVFLR
Subjt:  MAACNFPLSSHLQFHSVSFRKNFASLGPHCSFASASRFKHQKNPLLPRPIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFLR

Query:  ITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAANQF
        IT KR+DGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSD++FWIHLDKKVPTGAGLGGGSSNAATALWAANQF
Subjt:  ITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAANQF

Query:  SGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCIND
        SGC+ATE DLQEWS EIGSDIPFFFSEGAAFCTGRGEVVQN+PPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTS +DP SLLDKITKNGISQDVCIND
Subjt:  SGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCIND

Query:  LEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPA
        LEPPAFEVLPSLKRLKQRII +SR EFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASA SPPSEH E A
Subjt:  LEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPA

SwissProt top hitse value%identityAlignment
O81014 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic3.4e-16171.94Show/hide
Query:  MAACNFPLSSHLQFHSVSFRKNFASLGPHCSFASASRFKHQKNPLLPRPIICNST-ASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFL
        MA  + P  S L F   SF+            +S+S F    +P L RP++  S  AS++QVEIV++PDER+NK+ D+VD++APLSRL LFSPCKINVFL
Subjt:  MAACNFPLSSHLQFHSVSFRKNFASLGPHCSFASASRFKHQKNPLLPRPIICNST-ASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFL

Query:  RITEKRKDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        RIT KR+DG+HDLASLFHVISLGDTIKFSLSPSK KDRLSTNV GVP+D RNLIIKALNLYRKKTGS+++FWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  RITEKRKDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCI
        + +G L TEN+LQ+WS EIGSDIPFFFS GAA+CTGRGE+VQ++PPP PLD+PMVLIKP+EACSTAEVYKRLRLDQTSNI+P +LL+ +T NG+SQ +C+
Subjt:  QFSGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSE
        NDLEPPAF VLPSLKRLKQRII++ RGE+DAVFMSGSGSTI+GIGSPDPP FIY+D+E+++VFL+EANF+TREAN+WY+EPASA+A +  +E
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSE

P56848 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic5.4e-15170.59Show/hide
Query:  SHLQFHSVSFRKNFASLGPHCSFASASRFKHQKNPL-LPRPIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFLRITEKRKDG
        SH    + +    F+S  P+ S  S+ R K Q + + + R    + T  + Q+E+VY+ + ++NKLADEVD++A +SRLTLFSPCKINVFLRIT KR+DG
Subjt:  SHLQFHSVSFRKNFASLGPHCSFASASRFKHQKNPL-LPRPIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFLRITEKRKDG

Query:  YHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATE
        +HDLASLFHVISLGD IKFSLSPSK      TNV GVPLD++NLIIKALNL+RKKTG+D++FWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC+ATE
Subjt:  YHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATE

Query:  NDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCINDLEPPAFE
         DLQEWSGEIGSDIPFFFS GAA+CTGRGEVV++IPPPVP D+ MVL+KPQEAC T EVYKRLRLDQTS+IDP  LL+KI+K GISQDVC+NDLEPPAFE
Subjt:  NDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCINDLEPPAFE

Query:  VLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPAS
        V+PSLKRLKQRI +A R ++DAVFMSGSGSTIVG+GSPDPP F+Y+ DE++++F +EA F+TR ANQWY EP S
Subjt:  VLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPAS

P93841 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic/chromoplastic (Fragment)8.3e-16079.71Show/hide
Query:  STASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFLRITEKRKDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLI
        S  SK+QVEI YNP+E+ NKLADEVD++A LSRLTLFSPCKINVFLRIT KR DGYHDLASLFHVISLGD IKFSLSPSK KDRLSTNV+GVPLD+RNLI
Subjt:  STASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFLRITEKRKDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLI

Query:  IKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPM
        IKALNLYRKKTG+D YFWIHLDKKVPTGAGLGGGSSNAAT LWAANQFSGC+ATE +LQEWSGEIGSDIPFFFS GAA+CTGRGEVVQ+IP P+P D+PM
Subjt:  IKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPM

Query:  VLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY
        VLIKPQ+ACSTAEVYKR +LD +S +DP SLL+KI+ +GISQDVC+NDLEPPAFEVLPSLKRLKQR+I+A RG++DAVFMSGSGSTIVG+GSPDPP F+Y
Subjt:  VLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIY

Query:  NDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHS
        +D+E++DVFL+EA+F+TR AN+WY EP S S      E S
Subjt:  NDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHS

Q6MAT6 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase4.4e-6044.86Show/hide
Query:  LTLFSPCKINVFLRITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGG
        + LFSP KIN+FL++  KR DGYH+L+SLF  IS GD + F       D L+ +   +P DD NL++KA+ L+R KTG D +  IHLDK++P+ AGLGGG
Subjt:  LTLFSPCKINVFLRITEKRKDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGG

Query:  SSNAATALWAANQFSGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQ--TSNIDPSSLL
        SSNAAT LWA NQ +G + T  +L +W  EIG+DIPFFFS+G A CTGRGE V ++ P     +   ++KP    ST EVYK L   Q   +N D +S  
Subjt:  SSNAATALWAANQFSGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQ--TSNIDPSSLL

Query:  DKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWY
        +K            NDLE  AFE+ P LK LK  ++S+    FD V MSGSGS+   IG    P                A F+ R +N+WY
Subjt:  DKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWY

Q8S2G0 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic1.3e-14974.26Show/hide
Query:  IICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFLRITEKRKDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDD
        +  ++   ++QVE+ Y+   + NKLAD++DQ+A ++RL LFSPCKINVFLRIT KR DG+HDLASLFHVISLGDTIKFSLSPSK KDRLSTNV+GVP+D+
Subjt:  IICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFLRITEKRKDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDD

Query:  RNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPL
         NLIIKALNLYRKKTG+D +FWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC+A+E +LQEWSGEIGSDIPFFFS+GAA+CTGRGE+V++I  P+P 
Subjt:  RNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPL

Query:  DVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPP
        ++PMVL+KP EACSTAEVYKRLRL+ TS  DP  LL +IT+NGISQD C+NDLEPPAFEVLPSLKRLK+RII+A+RG++DAVFMSGSGSTIVGIGSPDPP
Subjt:  DVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPP

Query:  GFIYNDDEFQDVFLAEANFLTREANQWYREPASASACS
         F+Y+DD+++D F++EA FLTR  N+WYREP S+   S
Subjt:  GFIYNDDEFQDVFLAEANFLTREANQWYREPASASACS

Arabidopsis top hitse value%identityAlignment
AT2G26930.1 4-(cytidine 5'-phospho)-2-C-methyl-D-erithritol kinase2.4e-16271.94Show/hide
Query:  MAACNFPLSSHLQFHSVSFRKNFASLGPHCSFASASRFKHQKNPLLPRPIICNST-ASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFL
        MA  + P  S L F   SF+            +S+S F    +P L RP++  S  AS++QVEIV++PDER+NK+ D+VD++APLSRL LFSPCKINVFL
Subjt:  MAACNFPLSSHLQFHSVSFRKNFASLGPHCSFASASRFKHQKNPLLPRPIICNST-ASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFL

Query:  RITEKRKDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        RIT KR+DG+HDLASLFHVISLGDTIKFSLSPSK KDRLSTNV GVP+D RNLIIKALNLYRKKTGS+++FWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  RITEKRKDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCI
        + +G L TEN+LQ+WS EIGSDIPFFFS GAA+CTGRGE+VQ++PPP PLD+PMVLIKP+EACSTAEVYKRLRLDQTSNI+P +LL+ +T NG+SQ +C+
Subjt:  QFSGCLATENDLQEWSGEIGSDIPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSE
        NDLEPPAF VLPSLKRLKQRII++ RGE+DAVFMSGSGSTI+GIGSPDPP FIY+D+E+++VFL+EANF+TREAN+WY+EPASA+A +  +E
Subjt:  NDLEPPAFEVLPSLKRLKQRIISASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCCTGTAACTTTCCTCTTAGTTCGCATCTCCAATTTCATTCAGTTTCATTTAGAAAGAATTTCGCCTCACTTGGGCCTCACTGTTCGTTCGCTTCTGCCTCGAG
GTTCAAACACCAGAAGAATCCACTCCTTCCGAGACCCATAATATGCAATTCCACCGCTTCCAAACAACAAGTTGAGATAGTTTATAATCCTGATGAAAGGATAAACAAGT
TAGCTGATGAAGTCGACCAGGATGCTCCTCTTTCGAGGCTTACTCTGTTCTCACCTTGCAAGATTAATGTTTTCTTGAGAATAACTGAGAAGAGGAAAGATGGATATCAT
GACTTGGCATCTCTCTTTCATGTGATTAGCTTAGGGGATACGATTAAATTTTCTTTGTCGCCATCAAAGAAGGATCGCCTTTCCACCAATGTATCAGGGGTACCCCTTGA
TGATAGAAATTTGATTATCAAGGCTCTTAACCTCTACAGGAAAAAGACTGGCAGTGACCAATATTTCTGGATTCATCTTGACAAAAAGGTACCGACTGGCGCTGGGCTTG
GTGGAGGAAGCAGTAATGCTGCAACTGCACTGTGGGCAGCGAATCAGTTCAGTGGATGTCTTGCTACTGAAAATGACCTTCAAGAATGGTCAGGGGAGATAGGATCTGAT
ATTCCCTTCTTTTTCTCAGAAGGGGCGGCCTTCTGCACCGGAAGAGGTGAGGTTGTGCAGAATATTCCACCTCCAGTACCCTTGGACGTTCCAATGGTTCTCATAAAGCC
CCAGGAAGCATGCTCTACAGCAGAAGTTTATAAGCGCTTGCGGTTGGATCAAACAAGCAACATTGATCCTTCATCGTTGTTGGATAAAATCACAAAGAATGGAATATCCC
AAGATGTATGTATCAACGATTTGGAACCTCCTGCTTTTGAGGTCCTCCCATCTCTTAAGAGATTGAAACAGCGTATTATCTCTGCCAGCCGCGGAGAGTTCGATGCTGTT
TTTATGTCGGGAAGTGGTAGCACAATAGTAGGGATTGGGTCCCCAGATCCTCCAGGCTTCATATATAATGACGATGAATTCCAGGACGTGTTTTTGGCAGAGGCCAACTT
TCTCACCCGTGAAGCAAATCAATGGTATCGAGAACCCGCTTCGGCATCTGCTTGTAGCCCGCCTTCCGAACACTCTGAACCAGCTACATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCCTGTAACTTTCCTCTTAGTTCGCATCTCCAATTTCATTCAGTTTCATTTAGAAAGAATTTCGCCTCACTTGGGCCTCACTGTTCGTTCGCTTCTGCCTCGAG
GTTCAAACACCAGAAGAATCCACTCCTTCCGAGACCCATAATATGCAATTCCACCGCTTCCAAACAACAAGTTGAGATAGTTTATAATCCTGATGAAAGGATAAACAAGT
TAGCTGATGAAGTCGACCAGGATGCTCCTCTTTCGAGGCTTACTCTGTTCTCACCTTGCAAGATTAATGTTTTCTTGAGAATAACTGAGAAGAGGAAAGATGGATATCAT
GACTTGGCATCTCTCTTTCATGTGATTAGCTTAGGGGATACGATTAAATTTTCTTTGTCGCCATCAAAGAAGGATCGCCTTTCCACCAATGTATCAGGGGTACCCCTTGA
TGATAGAAATTTGATTATCAAGGCTCTTAACCTCTACAGGAAAAAGACTGGCAGTGACCAATATTTCTGGATTCATCTTGACAAAAAGGTACCGACTGGCGCTGGGCTTG
GTGGAGGAAGCAGTAATGCTGCAACTGCACTGTGGGCAGCGAATCAGTTCAGTGGATGTCTTGCTACTGAAAATGACCTTCAAGAATGGTCAGGGGAGATAGGATCTGAT
ATTCCCTTCTTTTTCTCAGAAGGGGCGGCCTTCTGCACCGGAAGAGGTGAGGTTGTGCAGAATATTCCACCTCCAGTACCCTTGGACGTTCCAATGGTTCTCATAAAGCC
CCAGGAAGCATGCTCTACAGCAGAAGTTTATAAGCGCTTGCGGTTGGATCAAACAAGCAACATTGATCCTTCATCGTTGTTGGATAAAATCACAAAGAATGGAATATCCC
AAGATGTATGTATCAACGATTTGGAACCTCCTGCTTTTGAGGTCCTCCCATCTCTTAAGAGATTGAAACAGCGTATTATCTCTGCCAGCCGCGGAGAGTTCGATGCTGTT
TTTATGTCGGGAAGTGGTAGCACAATAGTAGGGATTGGGTCCCCAGATCCTCCAGGCTTCATATATAATGACGATGAATTCCAGGACGTGTTTTTGGCAGAGGCCAACTT
TCTCACCCGTGAAGCAAATCAATGGTATCGAGAACCCGCTTCGGCATCTGCTTGTAGCCCGCCTTCCGAACACTCTGAACCAGCTACATAA
Protein sequenceShow/hide protein sequence
MAACNFPLSSHLQFHSVSFRKNFASLGPHCSFASASRFKHQKNPLLPRPIICNSTASKQQVEIVYNPDERINKLADEVDQDAPLSRLTLFSPCKINVFLRITEKRKDGYH
DLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGSDQYFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATENDLQEWSGEIGSD
IPFFFSEGAAFCTGRGEVVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDQTSNIDPSSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIISASRGEFDAV
FMSGSGSTIVGIGSPDPPGFIYNDDEFQDVFLAEANFLTREANQWYREPASASACSPPSEHSEPAT