; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg18023 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg18023
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Description4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase
Genome locationCarg_Chr11:3765757..3770987
RNA-Seq ExpressionCarg18023
SyntenyCarg18023
Gene Ontology termsGO:0016114 - terpenoid biosynthetic process (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0005524 - ATP binding (molecular function)
GO:0050515 - 4-(cytidine 5'-diphospho)-2-C-methyl-D-erythritol kinase activity (molecular function)
InterPro domainsIPR004424 - 4-diphosphocytidyl-2C-methyl-D-erythritol kinase
IPR006204 - GHMP kinase N-terminal domain
IPR014721 - Ribosomal protein S5 domain 2-type fold, subgroup
IPR020568 - Ribosomal protein S5 domain 2-type fold
IPR036554 - GHMP kinase, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7022034.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic/chromoplastic [Cucurbita argyrosperma subsp. argyrosperma]3.1e-217100Show/hide
Query:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQQWSRTFMFSLLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSL
        QFSGCLATEKDLQQWSRTFMFSLLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSL
Subjt:  QFSGCLATEKDLQQWSRTFMFSLLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSL

Query:  KRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAAA
        KRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAAA
Subjt:  KRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAAA

XP_022933435.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic-like isoform X1 [Cucurbita moschata]1.0e-20794.75Show/hide
Query:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNL-IIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNL IIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNL-IIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVC
        NQFSGCLATEKDLQQWS        F FS          +FVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVC
Subjt:  NQFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAAA
        INDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAAA
Subjt:  INDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAAA

XP_022933436.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic-like isoform X2 [Cucurbita moschata]4.1e-20994.99Show/hide
Query:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCI
        QFSGCLATEKDLQQWS        F FS          +FVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAAA
        NDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAAA
Subjt:  NDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAAA

XP_022967572.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic-like isoform X1 [Cucurbita maxima]1.7e-19991.73Show/hide
Query:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF
        MAS NISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKN LIQKAIRCNSTASKQQVEIVYDADERINKLA+EVDR+APLSRLTLFSPCKINVF
Subjt:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNL-IIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        LRITKKREDGYHDLASLFHVI+LGDTIKFSLSP KKDRLSTNVSGVPLDDRNL IIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNL-IIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVC
        NQFSGCLATEKDLQ+WS        F FS          +FVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVC
Subjt:  NQFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAA
        INDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEF DVFLAEANFLTREANQWYREPATASACS  SE R+SAA
Subjt:  INDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAA

XP_022967582.1 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic-like isoform X2 [Cucurbita maxima]6.9e-20191.96Show/hide
Query:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF
        MAS NISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKN LIQKAIRCNSTASKQQVEIVYDADERINKLA+EVDR+APLSRLTLFSPCKINVF
Subjt:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRITKKREDGYHDLASLFHVI+LGDTIKFSLSP KKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCI
        QFSGCLATEKDLQ+WS        F FS          +FVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAA
        NDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEF DVFLAEANFLTREANQWYREPATASACS  SE R+SAA
Subjt:  NDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAA

TrEMBL top hitse value%identityAlignment
A0A6J1D320 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase1.2e-18283.92Show/hide
Query:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQK-AIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINV
        MA C I F+S+L+FP++S RKNFA +S G  GSF F SRSK++K+ LIQK AI CNSTASKQQVEIVY+ DERINKLADEVDRDAPLSRLTLFSPCKINV
Subjt:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQK-AIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINV

Query:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDD+NLIIKALNLYRKKTG + FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  FLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVC
        N+FSGCLATEKDLQ+WS        F FS          + VQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLD+TS +DPLSLLD+ITKNGISQDVC
Subjt:  NQFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESA
        INDLEPPAF+VLPSLKRLKQRI+SASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYND+EFQDVFLAEANFLTREANQWY+EPA+ASACS PSE  +SA
Subjt:  INDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESA

A0A6J1EZ14 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase2.0e-20994.99Show/hide
Query:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCI
        QFSGCLATEKDLQQWS        F FS          +FVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAAA
        NDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAAA
Subjt:  NDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAAA

A0A6J1F4W1 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase4.8e-20894.75Show/hide
Query:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF
        MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF
Subjt:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNL-IIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNL IIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNL-IIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVC
        NQFSGCLATEKDLQQWS        F FS          +FVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVC
Subjt:  NQFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAAA
        INDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAAA
Subjt:  INDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAAA

A0A6J1HR69 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase8.3e-20091.73Show/hide
Query:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF
        MAS NISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKN LIQKAIRCNSTASKQQVEIVYDADERINKLA+EVDR+APLSRLTLFSPCKINVF
Subjt:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNL-IIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        LRITKKREDGYHDLASLFHVI+LGDTIKFSLSP KKDRLSTNVSGVPLDDRNL IIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNL-IIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVC
        NQFSGCLATEKDLQ+WS        F FS          +FVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVC
Subjt:  NQFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAA
        INDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEF DVFLAEANFLTREANQWYREPATASACS  SE R+SAA
Subjt:  INDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAA

A0A6J1HR79 4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase3.4e-20191.96Show/hide
Query:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF
        MAS NISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKN LIQKAIRCNSTASKQQVEIVYDADERINKLA+EVDR+APLSRLTLFSPCKINVF
Subjt:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
        LRITKKREDGYHDLASLFHVI+LGDTIKFSLSP KKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAAN

Query:  QFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCI
        QFSGCLATEKDLQ+WS        F FS          +FVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCI
Subjt:  QFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCI

Query:  NDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAA
        NDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEF DVFLAEANFLTREANQWYREPATASACS  SE R+SAA
Subjt:  NDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAA

SwissProt top hitse value%identityAlignment
O81014 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic2.2e-14165.32Show/hide
Query:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF
        MA+ +  F S L F + S  K  +  SF P+               L++  +  +  AS++QVEIV+D DER+NK+ D+VD++APLSRL LFSPCKINVF
Subjt:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        LRIT KREDG+HDLASLFHVISLGDTIKFSLSPSK KDRLSTNV GVP+D RNLIIKALNLYRKKTG ++FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVC
        N+ +G L TE +LQ WS        F FS          + VQ++PPP PLD+PMVLIKP+EACSTAEVYKRLRLD+TS+++PL+LL+ +T NG+SQ +C
Subjt:  NQFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERR
        +NDLEPPAF VLPSLKRLKQRI+++ RGE+DAVFMSGSGSTI+GIGSPDPP FIY+D E+++VFL+EANF+TREAN+WY+EPA+A+A ++ +E R
Subjt:  INDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERR

P56848 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic3.6e-13664.43Show/hide
Query:  SFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK
        S++S+  F S +        SF P GS +F  + +  + + I +A   + T  + Q+E+VYD + ++NKLADEVDR+A +SRLTLFSPCKINVFLRIT K
Subjt:  SFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKK

Query:  REDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC
        REDG+HDLASLFHVISLGD IKFSLSPSK      TNV GVPLD++NLIIKALNL+RKKTG D+ FWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC
Subjt:  REDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC

Query:  LATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCINDLEP
        +ATEKDLQ+WS        F FS          + V++IPPPVP D+ MVL+KPQEAC T EVYKRLRLD+TSD+DPL LL+KI+K GISQDVC+NDLEP
Subjt:  LATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCINDLEP

Query:  PAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREP-ATASACSTPSE
        PAFEV+PSLKRLKQRI +A R ++DAVFMSGSGSTIVG+GSPDPP F+Y+ +E++++F +EA F+TR ANQWY EP +T  + S P +
Subjt:  PAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREP-ATASACSTPSE

P93841 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic/chromoplastic (Fragment)2.9e-14170.41Show/hide
Query:  PRGSFNFASRSKHQKN-LLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIK
        P GS  F    + ++N  +I KA    S  SK+QVEI Y+ +E+ NKLADEVDR+A LSRLTLFSPCKINVFLRIT KR+DGYHDLASLFHVISLGD IK
Subjt:  PRGSFNFASRSKHQKN-LLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIK

Query:  FSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQQWSR------TFMF
        FSLSPSK KDRLSTNV+GVPLD+RNLIIKALNLYRKKTG D +FWIHLDKKVPTGAGLGGGSSNAAT LWAANQFSGC+ATEK+LQ+WS        F F
Subjt:  FSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQQWSR------TFMF

Query:  S--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIVSASRG
        S          + VQ+IP P+P D+PMVLIKPQ+ACSTAEVYKR +LD +S VDPLSLL+KI+ +GISQDVC+NDLEPPAFEVLPSLKRLKQR+++A RG
Subjt:  S--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIVSASRG

Query:  EFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSE
        ++DAVFMSGSGSTIVG+GSPDPP F+Y+D E++DVFL+EA+F+TR AN+WY EP + S      E
Subjt:  EFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSE

Q6MAT6 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase7.3e-4438.62Show/hide
Query:  LTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGG
        + LFSP KIN+FL++  KR DGYH+L+SLF  IS GD + F       D L+ +   +P DD NL++KA+ L+R KTG D    IHLDK++P+ AGLGGG
Subjt:  LTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGG

Query:  SSNAATALWAANQFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDK
        SSNAAT LWA NQ +G + T ++L QW         F FS          + V ++ P     +   ++KP    ST EVYK L   + ++       + 
Subjt:  SSNAATALWAANQFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDK

Query:  ITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWY
               +    NDLE  AFE+ P LK LK  ++S+    FD V MSGSGS+   IG    P                A F+ R +N+WY
Subjt:  ITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWY

Q8S2G0 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase, chloroplastic5.4e-13268.64Show/hide
Query:  IRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDD
        +  ++   ++QVE+ YD   + NKLAD++D++A ++RL LFSPCKINVFLRIT KR DG+HDLASLFHVISLGDTIKFSLSPSK KDRLSTNV+GVP+D+
Subjt:  IRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDD

Query:  RNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPL
         NLIIKALNLYRKKTG D FFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGC+A+EK+LQ+WS        F FS          + V++I  P+P 
Subjt:  RNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPL

Query:  DVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPP
        ++PMVL+KP EACSTAEVYKRLRL+ TS  DPL LL +IT+NGISQD C+NDLEPPAFEVLPSLKRLK+RI++A+RG++DAVFMSGSGSTIVGIGSPDPP
Subjt:  DVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPP

Query:  GFIYNDNEFQDVFLAEANFLTREANQWYREPATASACS
         F+Y+D++++D F++EA FLTR  N+WYREP ++   S
Subjt:  GFIYNDNEFQDVFLAEANFLTREANQWYREPATASACS

Arabidopsis top hitse value%identityAlignment
AT2G26930.1 4-(cytidine 5'-phospho)-2-C-methyl-D-erithritol kinase1.6e-14265.32Show/hide
Query:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF
        MA+ +  F S L F + S  K  +  SF P+               L++  +  +  AS++QVEIV+D DER+NK+ D+VD++APLSRL LFSPCKINVF
Subjt:  MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVF

Query:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA
        LRIT KREDG+HDLASLFHVISLGDTIKFSLSPSK KDRLSTNV GVP+D RNLIIKALNLYRKKTG ++FFWIHLDKKVPTGAGLGGGSSNAATALWAA
Subjt:  LRITKKREDGYHDLASLFHVISLGDTIKFSLSPSK-KDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAA

Query:  NQFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVC
        N+ +G L TE +LQ WS        F FS          + VQ++PPP PLD+PMVLIKP+EACSTAEVYKRLRLD+TS+++PL+LL+ +T NG+SQ +C
Subjt:  NQFSGCLATEKDLQQWSR------TFMFS--------LLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVC

Query:  INDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERR
        +NDLEPPAF VLPSLKRLKQRI+++ RGE+DAVFMSGSGSTI+GIGSPDPP FIY+D E+++VFL+EANF+TREAN+WY+EPA+A+A ++ +E R
Subjt:  INDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGIGSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTGTAATATCTCTTTTAGTTCGCGGCTTCGATTTCCATCCGTTTCGCCTAGAAAGAATTTCGCCGTCGATTCATTTGGGCCTCGCGGTTCATTTAATTTTGC
GTCGAGGTCGAAACACCAGAAGAATCTACTCATTCAGAAGGCCATTAGATGTAATTCCACCGCTTCTAAACAACAAGTTGAGATTGTTTACGATGCTGATGAAAGGATAA
ACAAGTTAGCTGACGAAGTGGACCGAGATGCTCCTCTTTCGAGGCTTACCCTGTTCTCACCTTGCAAGATTAATGTTTTCTTGAGAATAACTAAGAAGAGGGAAGATGGA
TATCATGATTTGGCATCTCTCTTTCACGTGATAAGCTTAGGGGATACAATTAAATTCTCTTTGTCGCCATCGAAGAAGGATCGCCTTTCGACCAATGTATCGGGGGTACC
CCTTGATGATAGAAATTTGATTATCAAGGCTCTTAACCTCTACAGGAAAAAGACTGGCTGTGACCAATTTTTCTGGATCCATCTCGACAAAAAGGTACCAACTGGAGCAG
GGCTTGGTGGAGGAAGCAGTAATGCTGCAACTGCACTGTGGGCAGCCAATCAGTTCAGTGGATGTCTTGCTACTGAAAAGGATCTTCAACAATGGTCAAGAACTTTTATG
TTTTCTCTTTTACAGTTCGTGCAGAATATTCCTCCTCCGGTACCCTTGGACGTTCCGATGGTTCTCATAAAGCCCCAGGAAGCATGCTCTACAGCAGAAGTTTATAAGCG
CTTACGGTTGGATCGAACGAGCGATGTCGATCCTTTATCATTGTTGGATAAAATCACAAAGAATGGAATATCACAAGATGTGTGTATCAATGATTTGGAACCTCCTGCTT
TTGAGGTCCTCCCGTCGCTTAAAAGATTGAAACAGCGAATAGTCTCTGCCAGCCGTGGCGAGTTTGATGCCGTTTTCATGTCCGGGAGTGGTAGCACAATAGTAGGAATT
GGGTCTCCAGATCCTCCAGGCTTCATCTATAACGACAACGAATTCCAGGACGTGTTTTTGGCAGAGGCCAACTTTCTCACCCGTGAAGCGAATCAATGGTATCGAGAACC
CGCTACAGCATCCGCGTGTAGTACACCGTCCGAGCGTCGCGAATCTGCTGCTGCATAA
mRNA sequenceShow/hide mRNA sequence
CACTCCAAAGTTTTCATATAAGGGAAATGACATGGACGCGATTTTCACGTTTTTCACAGATTTCTTACAATAATCTCTCGGCCCACAATGCATAAATCTTCATCTTCGTC
TTAATCCGCTTCGAAATTATTCGATTTCTCTTATTTGATCATCTGGGACTGTACCAGTGAAGAGTTTCTCAGTCATCAATGGCTTCCTGTAATATCTCTTTTAGTTCGCG
GCTTCGATTTCCATCCGTTTCGCCTAGAAAGAATTTCGCCGTCGATTCATTTGGGCCTCGCGGTTCATTTAATTTTGCGTCGAGGTCGAAACACCAGAAGAATCTACTCA
TTCAGAAGGCCATTAGATGTAATTCCACCGCTTCTAAACAACAAGTTGAGATTGTTTACGATGCTGATGAAAGGATAAACAAGTTAGCTGACGAAGTGGACCGAGATGCT
CCTCTTTCGAGGCTTACCCTGTTCTCACCTTGCAAGATTAATGTTTTCTTGAGAATAACTAAGAAGAGGGAAGATGGATATCATGATTTGGCATCTCTCTTTCACGTGAT
AAGCTTAGGGGATACAATTAAATTCTCTTTGTCGCCATCGAAGAAGGATCGCCTTTCGACCAATGTATCGGGGGTACCCCTTGATGATAGAAATTTGATTATCAAGGCTC
TTAACCTCTACAGGAAAAAGACTGGCTGTGACCAATTTTTCTGGATCCATCTCGACAAAAAGGTACCAACTGGAGCAGGGCTTGGTGGAGGAAGCAGTAATGCTGCAACT
GCACTGTGGGCAGCCAATCAGTTCAGTGGATGTCTTGCTACTGAAAAGGATCTTCAACAATGGTCAAGAACTTTTATGTTTTCTCTTTTACAGTTCGTGCAGAATATTCC
TCCTCCGGTACCCTTGGACGTTCCGATGGTTCTCATAAAGCCCCAGGAAGCATGCTCTACAGCAGAAGTTTATAAGCGCTTACGGTTGGATCGAACGAGCGATGTCGATC
CTTTATCATTGTTGGATAAAATCACAAAGAATGGAATATCACAAGATGTGTGTATCAATGATTTGGAACCTCCTGCTTTTGAGGTCCTCCCGTCGCTTAAAAGATTGAAA
CAGCGAATAGTCTCTGCCAGCCGTGGCGAGTTTGATGCCGTTTTCATGTCCGGGAGTGGTAGCACAATAGTAGGAATTGGGTCTCCAGATCCTCCAGGCTTCATCTATAA
CGACAACGAATTCCAGGACGTGTTTTTGGCAGAGGCCAACTTTCTCACCCGTGAAGCGAATCAATGGTATCGAGAACCCGCTACAGCATCCGCGTGTAGTACACCGTCCG
AGCGTCGCGAATCTGCTGCTGCATAACTGGCTCCTCCCTCCTAGGATGAAATAGAGGTTCAGCTTATGATTCCTGTAGACTGAGAATTCCTGTAGGACTTTAAACTACAT
CCATTGTTAACTTGCGAATTGATCTCGAGTTCTCATTTGTGACCTTAACTGGCTTTAATCTCGGTCTAGGAAAAAACGAGCGTGAATATAAACATTAATATTAATAATAA
TAATGCAAACGTTGAATTGGTATCGATCATCATTACTAGAACGCTTAGCTCAATGAGAAACATTCACCATGAAGAATCTAAAAGAAGGAACAAAAAAGCTATAAAGATTC
AATCAAACTCACTTT
Protein sequenceShow/hide protein sequence
MASCNISFSSRLRFPSVSPRKNFAVDSFGPRGSFNFASRSKHQKNLLIQKAIRCNSTASKQQVEIVYDADERINKLADEVDRDAPLSRLTLFSPCKINVFLRITKKREDG
YHDLASLFHVISLGDTIKFSLSPSKKDRLSTNVSGVPLDDRNLIIKALNLYRKKTGCDQFFWIHLDKKVPTGAGLGGGSSNAATALWAANQFSGCLATEKDLQQWSRTFM
FSLLQFVQNIPPPVPLDVPMVLIKPQEACSTAEVYKRLRLDRTSDVDPLSLLDKITKNGISQDVCINDLEPPAFEVLPSLKRLKQRIVSASRGEFDAVFMSGSGSTIVGI
GSPDPPGFIYNDNEFQDVFLAEANFLTREANQWYREPATASACSTPSERRESAAA