; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0005056 (gene) of Chayote v1 genome

Gene IDSed0005056
OrganismSechium edule (Chayote v1)
DescriptionCholine monooxygenase, chloroplastic
Genome locationLG08:2814217..2821365
RNA-Seq ExpressionSed0005056
SyntenySed0005056
Gene Ontology termsGO:0019285 - glycine betaine biosynthetic process from choline (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0031967 - organelle envelope (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0019133 - choline monooxygenase activity (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR001663 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit
IPR015879 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit, C-terminal domain
IPR017941 - Rieske [2Fe-2S] iron-sulphur domain
IPR036922 - Rieske [2Fe-2S] iron-sulphur domain superfamily
IPR044637 - Aromatic-ring-hydroxylating dioxygenase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139149.1 choline monooxygenase, chloroplastic isoform X1 [Cucumis sativus]1.2e-19579.71Show/hide
Query:  MAMIRKHILPHNFFQKP--CITRHSLNSRSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVE
        MAM+ KHI  H FFQ P      HS N RSP RISAAL  RN DS R +EA+KLV +FDPQIP EKA+TPPSSWY D SFFALEL+ VFYRGWQAVGYVE
Subjt:  MAMIRKHILPHNFFQKP--CITRHSLNSRSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVE

Query:  QLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILL
        QL+D HD FTGRLGNVE+VVC+DN+RKVRAFHNVCRHH SLLASG GKKSCFVCPYHGWT G+DG LLKATR +GIQNFD NDFGL+PLPVATWGPF+LL
Subjt:  QLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILL

Query:  NLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGE
        NLD KLSS+ DVDEDKV  EWLG CADVL LNGVDASLS+VCRREYTIECNWKVFCDN+LDGGYHVPYAHKGLASNLKL+SYSTELFETVSIQSCK  GE
Subjt:  NLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGE

Query:  SECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVP
        S+ D  GRLGSEALY F+YPNF+INRYGPWMDTNLVLPLGPRKCLVVF YFLEASFKND+S IQ SLEDSE VQNEDIILCE V      PAY  GRY P
Subjt:  SECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVP

Query:  SVEKAVHHFHRFLHLSLTK
        SVE A+HHFHR LH +LTK
Subjt:  SVEKAVHHFHRFLHLSLTK

XP_022965039.1 choline monooxygenase, chloroplastic isoform X1 [Cucurbita moschata]3.8e-19779.05Show/hide
Query:  MAMIRKHILPHNFFQKPCITRHSLNS---RSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYV
        MA + KHI  H  F +P     +  S   RSP+RIS+ L  RN+DSH F+EA KLV EFDP+IP EKAVTPPSSWYTD SFFALELDRVF+RGWQAVGYV
Subjt:  MAMIRKHILPHNFFQKPCITRHSLNS---RSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYV

Query:  EQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFIL
        EQL+DPHD FTGRLGNVE+VVC+DN++KVRAFHNVCRHH SLLASG GKKSCFVCPYHGWT G+DG LLKATR NGIQNFDVNDFGL+PLPVATWGPF+L
Subjt:  EQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFIL

Query:  LNLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVG
        LNLDEKLSSEL VDED+V HEWLG CAD+LSLNGVDASLSFVCRREYTIECNWKVFCDN+LDGGYHVPYAHKGLASNLKL+SYSTE+FE VSIQSCK  G
Subjt:  LNLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVG

Query:  ESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYV
        ES+ D YGRLG EALY F+YPNF+INRYGPWMDTNLVLPLGPRKCLVVF YFLE  FKND S IQ+SLEDSE VQ EDIILCE V      PAY  GRY 
Subjt:  ESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYV

Query:  PSVEKAVHHFHRFLHLSLTK
        PSVE A+HHFHR LHL+LTK
Subjt:  PSVEKAVHHFHRFLHLSLTK

XP_022987330.1 choline monooxygenase, chloroplastic isoform X2 [Cucurbita maxima]5.5e-19678.57Show/hide
Query:  MAMIRKHILPHNFFQKPCITRHSLNS---RSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYV
        MA + KHI  H  F +P     +  S   RSP+RIS  L  RN+DSH F+EA KLV EFDP+IP EKAVTPPSSWYTD SFFALELDRVF+RGWQAVGYV
Subjt:  MAMIRKHILPHNFFQKPCITRHSLNS---RSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYV

Query:  EQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFIL
        EQL+DPHD FTGRLGNVE+VVC+DN++KVRAFHNVCRHH SLLASG GKKSCFVCPYHGWT G+DG LLKATR NGIQNFDVNDFGL+PLPVATWGPF+L
Subjt:  EQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFIL

Query:  LNLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVG
        LN+DEKLSSEL VDEDKV +EWLG CAD+LSLNGVDASLSFVCRREYTIECNWKVFCDN+LDGGYHVPYAHKGLASNLKL+SYSTE+FE VSIQSCK  G
Subjt:  LNLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVG

Query:  ESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYV
        ES+ D YGRLG EALY F+YPNF+INRYGPWMDTNLVLPLGP+KCLVVF YFLE  FKND S IQ+SLEDSE VQ EDIILCE V      PAY  GRY 
Subjt:  ESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYV

Query:  PSVEKAVHHFHRFLHLSLTK
        PSVE A+HHFHR LHL+LTK
Subjt:  PSVEKAVHHFHRFLHLSLTK

XP_023515603.1 choline monooxygenase, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]9.4e-19678.81Show/hide
Query:  MAMIRKHILPHNFFQKPCITRHSLNS---RSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYV
        MA + KHI  H  F +P     +  S   RSP+RIS  L  RN+DSH F+EA KLV EFDP+IP EKAVTPPSSWYTD SFF LELDRVF+RGWQAVGYV
Subjt:  MAMIRKHILPHNFFQKPCITRHSLNS---RSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYV

Query:  EQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFIL
        EQL+DPHD FTGRLGNVE+VVC+DN++KVRAFHNVCRHH SLLASG GKKSCFVCPYHGWT G+DG LLKATR NGIQNFDVNDFGL+PLPVATWGPF+L
Subjt:  EQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFIL

Query:  LNLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVG
        LNLDEKLSSEL VDEDKV +EWLG CAD+LSLNGVDASLSFVCRREYTIECNWKVFCDN+LDGGYHVPYAHKGLASNLKL+SYSTE+FE VSIQSCK  G
Subjt:  LNLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVG

Query:  ESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYV
        ES+ D YGRLG EALY F+YPNF+INRYGPWMDTNLVLPLGPRKCLVVF YFLE  FKND S IQ+SLEDSE VQ EDIILCE V      PAY  GRY 
Subjt:  ESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYV

Query:  PSVEKAVHHFHRFLHLSLTK
        PSVE A+HHFHR LHL+LTK
Subjt:  PSVEKAVHHFHRFLHLSLTK

XP_038880722.1 choline monooxygenase, chloroplastic [Benincasa hispida]6.3e-20080.91Show/hide
Query:  MAMIRKHILPHNFFQKPCIT--RHSLNSRSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVE
        MA + KHI  H FFQ P I+   HS N RSPSRISAAL  RN+DSH F+EAQKLV EFDP+IP EKAVTPPSSWY D SF+ALELDRVFYRGWQAVGYVE
Subjt:  MAMIRKHILPHNFFQKPCIT--RHSLNSRSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVE

Query:  QLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILL
        QL+DPHD FTGRLGNVE+VVC+DN+RKVRAFHNVCRHH SLLASG GK+SCFVCPYHGWT G+DG LLKATR NGIQNFD N+FGLIPLPVATWGPF+LL
Subjt:  QLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILL

Query:  NLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGE
        NLD KLSSELDVDEDKV  EWLG CAD+LSLNGVDASLS+VCRREYTIECNWKVFCDN+LDGGYHVPYAHKGLASNLKLDSYSTE+FETVSIQSCK  GE
Subjt:  NLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGE

Query:  SECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVP
        ++ + YGRLG EALY FIYPNF+INRYGPWMDTNLVLPLG RKCLVVF YFLEASFKND   IQ+SLEDSE VQ EDIILCE V      PAY  GRY P
Subjt:  SECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVP

Query:  SVEKAVHHFHRFLHLSLTK
        SVE A+HHFHR LHL+LTK
Subjt:  SVEKAVHHFHRFLHLSLTK

TrEMBL top hitse value%identityAlignment
A0A0A0LXX3 Choline monooxygenase, chloroplastic5.9e-19679.71Show/hide
Query:  MAMIRKHILPHNFFQKP--CITRHSLNSRSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVE
        MAM+ KHI  H FFQ P      HS N RSP RISAAL  RN DS R +EA+KLV +FDPQIP EKA+TPPSSWY D SFFALEL+ VFYRGWQAVGYVE
Subjt:  MAMIRKHILPHNFFQKP--CITRHSLNSRSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVE

Query:  QLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILL
        QL+D HD FTGRLGNVE+VVC+DN+RKVRAFHNVCRHH SLLASG GKKSCFVCPYHGWT G+DG LLKATR +GIQNFD NDFGL+PLPVATWGPF+LL
Subjt:  QLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILL

Query:  NLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGE
        NLD KLSS+ DVDEDKV  EWLG CADVL LNGVDASLS+VCRREYTIECNWKVFCDN+LDGGYHVPYAHKGLASNLKL+SYSTELFETVSIQSCK  GE
Subjt:  NLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGE

Query:  SECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVP
        S+ D  GRLGSEALY F+YPNF+INRYGPWMDTNLVLPLGPRKCLVVF YFLEASFKND+S IQ SLEDSE VQNEDIILCE V      PAY  GRY P
Subjt:  SECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVP

Query:  SVEKAVHHFHRFLHLSLTK
        SVE A+HHFHR LH +LTK
Subjt:  SVEKAVHHFHRFLHLSLTK

A0A5A7UY48 Choline monooxygenase, chloroplastic5.0e-19579Show/hide
Query:  MAMIRKHILPHNFFQKPCIT--RHSLNSRSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVE
        MA + KHI  H FFQ P I+   H  N RSPSRISA+L  R+ DS RF+EA+KLV +FDP+IP EKA+TPPSSWY D SFFALELDRVFYRGWQAVGYVE
Subjt:  MAMIRKHILPHNFFQKPCIT--RHSLNSRSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVE

Query:  QLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILL
        QL+D HD FTGRLGNVE+VVC+DN+RKVRAFHNVCRHH SLLA+G GKKSCFVCPYHGWT G+DG LLKATR NGIQNF+ NDFGL+PLPVA WGPF+LL
Subjt:  QLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILL

Query:  NLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGE
        NLD KLSS+ DVDEDKV  EWLG CADVL LNGVDASLS+VCRREYTI+CNWKVFCDN+LDGGYHVPYAHKGLASNL L+SYSTELFETVSIQSCK  GE
Subjt:  NLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGE

Query:  SECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVP
        S+ D YGRLG EALY FIYPNF+INRYGPWMDTNLVLPLGPRKCLVVF YFLEASFKND+S IQ+SLEDSE VQNEDIILCE V      PAY  GRY P
Subjt:  SECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVP

Query:  SVEKAVHHFHRFLHLSLTK
        SVE A+HHFHR LH +LTK
Subjt:  SVEKAVHHFHRFLHLSLTK

A0A5D3BWX2 Choline monooxygenase, chloroplastic3.8e-19578.76Show/hide
Query:  MAMIRKHILPHNFFQKPCIT--RHSLNSRSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVE
        MA + KHI  H FFQ P I+   H  N RSPSRISA+L  R+ DS RF+EA+KLV +FDP+IP EKA+TPPSSWY D SFFALELDRVFYRGWQAVGYVE
Subjt:  MAMIRKHILPHNFFQKPCIT--RHSLNSRSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVE

Query:  QLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILL
        QL+D HD FTGRLGNVE+VVC+DN+RKVRAFHNVCRHH SLLA+G GKKSCFVCPYHGWT G+DG LLKATR NGIQNF+ NDFGL+PLPVA WGPF+LL
Subjt:  QLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILL

Query:  NLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGE
        NLD KLSS++DVDEDKV  EWLG CADVL LNGVDASLS+VCRREYTI+CNWKVFCDN+LDGGYHVPYAHKGLASNL L+SYSTELFETVSIQSCK  GE
Subjt:  NLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGE

Query:  SECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVP
        S+ D YGRLG EALY FIYPNF+INRYGPWMDTNLVLPLGPRKCLVVF YFLE+SFKND+S IQ+SLEDSE VQNEDIILCE V      PAY  GRY P
Subjt:  SECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVP

Query:  SVEKAVHHFHRFLHLSLTK
        SVE A+HHFHR LH +LTK
Subjt:  SVEKAVHHFHRFLHLSLTK

A0A6J1HPX3 Choline monooxygenase, chloroplastic1.8e-19779.05Show/hide
Query:  MAMIRKHILPHNFFQKPCITRHSLNS---RSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYV
        MA + KHI  H  F +P     +  S   RSP+RIS+ L  RN+DSH F+EA KLV EFDP+IP EKAVTPPSSWYTD SFFALELDRVF+RGWQAVGYV
Subjt:  MAMIRKHILPHNFFQKPCITRHSLNS---RSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYV

Query:  EQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFIL
        EQL+DPHD FTGRLGNVE+VVC+DN++KVRAFHNVCRHH SLLASG GKKSCFVCPYHGWT G+DG LLKATR NGIQNFDVNDFGL+PLPVATWGPF+L
Subjt:  EQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFIL

Query:  LNLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVG
        LNLDEKLSSEL VDED+V HEWLG CAD+LSLNGVDASLSFVCRREYTIECNWKVFCDN+LDGGYHVPYAHKGLASNLKL+SYSTE+FE VSIQSCK  G
Subjt:  LNLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVG

Query:  ESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYV
        ES+ D YGRLG EALY F+YPNF+INRYGPWMDTNLVLPLGPRKCLVVF YFLE  FKND S IQ+SLEDSE VQ EDIILCE V      PAY  GRY 
Subjt:  ESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYV

Query:  PSVEKAVHHFHRFLHLSLTK
        PSVE A+HHFHR LHL+LTK
Subjt:  PSVEKAVHHFHRFLHLSLTK

A0A6J1JGJ2 Choline monooxygenase, chloroplastic2.7e-19678.57Show/hide
Query:  MAMIRKHILPHNFFQKPCITRHSLNS---RSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYV
        MA + KHI  H  F +P     +  S   RSP+RIS  L  RN+DSH F+EA KLV EFDP+IP EKAVTPPSSWYTD SFFALELDRVF+RGWQAVGYV
Subjt:  MAMIRKHILPHNFFQKPCITRHSLNS---RSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYV

Query:  EQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFIL
        EQL+DPHD FTGRLGNVE+VVC+DN++KVRAFHNVCRHH SLLASG GKKSCFVCPYHGWT G+DG LLKATR NGIQNFDVNDFGL+PLPVATWGPF+L
Subjt:  EQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFIL

Query:  LNLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVG
        LN+DEKLSSEL VDEDKV +EWLG CAD+LSLNGVDASLSFVCRREYTIECNWKVFCDN+LDGGYHVPYAHKGLASNLKL+SYSTE+FE VSIQSCK  G
Subjt:  LNLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVG

Query:  ESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYV
        ES+ D YGRLG EALY F+YPNF+INRYGPWMDTNLVLPLGP+KCLVVF YFLE  FKND S IQ+SLEDSE VQ EDIILCE V      PAY  GRY 
Subjt:  ESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYV

Query:  PSVEKAVHHFHRFLHLSLTK
        PSVE A+HHFHR LHL+LTK
Subjt:  PSVEKAVHHFHRFLHLSLTK

SwissProt top hitse value%identityAlignment
O04121 Choline monooxygenase, chloroplastic2.0e-11649.5Show/hide
Query:  TRHSLNSRSPSRIS----AALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVEQLQDPHDVFTGRLGNVE
        T  +L SR+P++I+    AA    +  +      Q LVHEFDPQIP E A TPPSSWYT+ +F++ EL+R+FY+GWQ  G  +Q+++P+  FTG LGNVE
Subjt:  TRHSLNSRSPSRIS----AALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVEQLQDPHDVFTGRLGNVE

Query:  FVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILLNLDEKLSSELDVDEDKV
        ++V RD + KV AFHNVC H  S+LA GSGKKSCFVCPYHGW  GMDG+L KA++    QN D  + GL+PL VA WGPF+L++LD  L    DV     
Subjt:  FVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILLNLDEKLSSELDVDEDKV

Query:  THEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGESECDGYGRLGSEALYVF
          EWLG  A+ +  +  D SL F+ R E+ +E NWK+F DN+LD  YHVPYAHK  A+ L  D+Y T++ E V+IQ  +    ++ DG+ R+G +A Y F
Subjt:  THEWLGGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGESECDGYGRLGSEALYVF

Query:  IYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVPSVEKAVHHFHRFLHLSL
         YPNF + RYGPWM T  + PLGPRKC +V  Y++E S  +D   I+K +  ++ VQ ED++LCESV      PAY  GRYV  +EK +HHFH +L  +L
Subjt:  IYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVPSVEKAVHHFHRFLHLSL

O22553 Choline monooxygenase, chloroplastic4.9e-11551.91Show/hide
Query:  QKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVEQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSC
        + LVHEFDP+IP E A+TPPS+WYT+ +F++ EL+R+FY+GWQ  GY EQ+++ +  FTG LGNVE++V RD   ++ AFHNVC H  S+LA GSGKKSC
Subjt:  QKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVEQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSC

Query:  FVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILLNLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECN
        FVCPYHGW  G+DG+L KA++    QN D  + GL PL VA WGPFIL++LD  L +  DV       EW+G  A+ +  +  D +L F  R E+ +ECN
Subjt:  FVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILLNLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECN

Query:  WKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYF
        WKVFCDN+LD  YHVPYAHK  A+ L  D+Y+TE+ E   IQ   S   ++ DG+ RLG+EA Y FIYPNF + RYG WM T  V+P+G RKC +V  Y+
Subjt:  WKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYF

Query:  LEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVPSVEKAVHHFHRFLHLSL
        LE +  +D + I K +  ++ VQ ED +LCESV      PAY  GRYV  +EK +HHFH +LH +L
Subjt:  LEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVPSVEKAVHHFHRFLHLSL

Q93XE1 Choline monooxygenase, chloroplastic6.2e-11852.73Show/hide
Query:  QKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVEQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSC
        ++++HEFDP++P E   TPPS+WYTD S ++ ELDR+F +GWQ  GY +Q+++P+  FTG LGNVE++VCRD   KV AFHNVC H  S+LA G+GKKSC
Subjt:  QKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVEQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSC

Query:  FVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILLNLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECN
        FVCPYHGW  G+DG+L+KAT+    Q FD  + GL+ L VA WGPF+L++LD   S       + V  EW+G CA+ +  +  D SL F+ R E+ +E N
Subjt:  FVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILLNLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECN

Query:  WKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYF
        WKVFCDN+LD  YHVPYAHK  A+ L  D+Y T+L E V IQ   S   ++ +G+ RLGSEA Y FIYPNF + RYGPWM T  + PLGPRKC +V  Y+
Subjt:  WKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYF

Query:  LEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVPSVEKAVHHFHRFLHLSL
        LE +  ND   I+KS+  ++ VQ ED++LCESV      PAY  GRYV  +EK +HHFH +LH +L
Subjt:  LEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVPSVEKAVHHFHRFLHLSL

Q9LKN0 Choline monooxygenase, chloroplastic1.1e-11451.52Show/hide
Query:  QKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVEQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSC
        Q LV +FDP +P E A+TPPSSWYT+ +F+A ELDR+FY+GWQ  GY +Q+++ +  FTG LGNVE++VCRD + KV AFHNVC H  S+LA GSGKKSC
Subjt:  QKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVEQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSC

Query:  FVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILLNLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECN
        FVCPYHGW  GM+G+L KA++    Q+ + ++ GL+PL VA WGPFIL++LD       DV       EWLG CA+ +  +  D +L F+ R E+ IE N
Subjt:  FVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILLNLDEKLSSELDVDEDKVTHEWLGGCADVLSLNGVDASLSFVCRREYTIECN

Query:  WKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYF
        WK+F DN+LD  YHVPYAHK  A+ L  D+Y T++   V+IQ    V  +  +G+ RLG++A Y F YPNF + RYGPWM T  ++PLGPRKC +V  Y+
Subjt:  WKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVFYYF

Query:  LEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVPSVEKAVHHFHRFLH
        +E S  +D   I+K +  ++ VQ ED++LCESV      PAY  GRYV  +EK +HHFH +LH
Subjt:  LEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVPSVEKAVHHFHRFLH

Q9SZR0 Choline monooxygenase, chloroplastic5.8e-14062.6Show/hide
Query:  EAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVEQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKK
        +  KLV EFDP+IP E+A TPPSSWYTD  F++ ELDRVFY GWQAVGY +Q+++  D FTGRLG+V+FVVCRD + K+ AFHNVC HH S+LASG+G+K
Subjt:  EAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVEQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKK

Query:  SCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILLNLDEKLSSELDVDEDK-VTHEWLGGCADVLSLNGVDASLSFVCRREYTI
        SCFVC YHGWT  + G+L+KATR +GIQNF +++ GL PL VA WGPF+LL +    S + +V+ D+ V  EWLG     LS  GVD+ LS++CRREYTI
Subjt:  SCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILLNLDEKLSSELDVDEDK-VTHEWLGGCADVLSLNGVDASLSFVCRREYTI

Query:  ECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVF
        +CNWKVFCDN+LDGGYHVPYAHKGL S L L++YST +FE VSIQ C    +   DG+ RLGSEALY F+YPNF+INRYGPWMDTNLVLPLGPRKC VVF
Subjt:  ECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVF

Query:  YYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVPSVEKAVHHFHRFLHLSL
         YFL+ S K+D + I++SLE+S+ VQ ED++LCESV       AY+ GRY   VEK +HHFH  LH +L
Subjt:  YYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVPSVEKAVHHFHRFLHLSL

Arabidopsis top hitse value%identityAlignment
AT4G29890.1 choline monooxygenase, putative (CMO-like)4.1e-14162.6Show/hide
Query:  EAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVEQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKK
        +  KLV EFDP+IP E+A TPPSSWYTD  F++ ELDRVFY GWQAVGY +Q+++  D FTGRLG+V+FVVCRD + K+ AFHNVC HH S+LASG+G+K
Subjt:  EAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVEQLQDPHDVFTGRLGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKK

Query:  SCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILLNLDEKLSSELDVDEDK-VTHEWLGGCADVLSLNGVDASLSFVCRREYTI
        SCFVC YHGWT  + G+L+KATR +GIQNF +++ GL PL VA WGPF+LL +    S + +V+ D+ V  EWLG     LS  GVD+ LS++CRREYTI
Subjt:  SCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILLNLDEKLSSELDVDEDK-VTHEWLGGCADVLSLNGVDASLSFVCRREYTI

Query:  ECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVF
        +CNWKVFCDN+LDGGYHVPYAHKGL S L L++YST +FE VSIQ C    +   DG+ RLGSEALY F+YPNF+INRYGPWMDTNLVLPLGPRKC VVF
Subjt:  ECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGESECDGYGRLGSEALYVFIYPNFIINRYGPWMDTNLVLPLGPRKCLVVF

Query:  YYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVPSVEKAVHHFHRFLHLSL
         YFL+ S K+D + I++SLE+S+ VQ ED++LCESV       AY+ GRY   VEK +HHFH  LH +L
Subjt:  YYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESV------PAYNVGRYVPSVEKAVHHFHRFLHLSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATGATAAGGAAGCATATTCTTCCCCATAATTTCTTTCAAAAGCCTTGCATTACTCGGCATTCGCTTAACAGTCGTTCGCCCTCACGAATCTCGGCCGCTCTCGG
TCTTCGAAATGCCGATTCTCATCGCTTCGTTGAAGCTCAGAAACTCGTCCATGAATTCGACCCTCAAATTCCTTTTGAGAAGGCCGTCACTCCACCTAGCTCCTGGTACA
CCGACTCTTCGTTTTTCGCTCTCGAGCTCGATCGAGTGTTTTACAGAGGATGGCAGGCTGTAGGATATGTTGAACAGTTACAAGATCCCCATGACGTTTTCACTGGCAGG
TTGGGGAATGTGGAGTTTGTGGTGTGCAGAGATAACGACAGGAAGGTTCGTGCATTTCACAATGTCTGCCGCCATCACACGTCACTTCTCGCATCCGGAAGTGGGAAAAA
GTCGTGCTTTGTTTGCCCGTACCATGGATGGACAGATGGGATGGATGGAGCTCTGCTTAAGGCGACTAGAAAAAATGGGATACAAAACTTTGATGTAAATGATTTTGGGC
TCATACCATTACCAGTAGCTACATGGGGGCCTTTCATTCTTCTCAATTTGGATGAAAAATTATCATCTGAGCTGGATGTTGATGAAGATAAAGTAACACATGAATGGCTT
GGAGGCTGTGCAGATGTGCTGAGTTTGAATGGAGTTGATGCATCGTTAAGCTTTGTTTGTCGACGCGAATATACCATTGAATGTAATTGGAAGGTTTTTTGTGACAACTT
CTTAGATGGAGGGTATCATGTTCCCTATGCACATAAAGGGCTCGCTTCAAATCTCAAGCTTGATTCTTATTCTACAGAACTATTTGAAACTGTTAGCATTCAAAGTTGTA
AGAGTGTGGGAGAATCCGAATGCGATGGTTACGGTCGACTCGGATCAGAAGCTCTATATGTTTTTATATACCCAAACTTCATCATAAATAGGTATGGACCTTGGATGGAC
ACTAATCTAGTACTGCCACTCGGACCTCGAAAATGTCTTGTGGTTTTTTATTATTTTCTCGAAGCTTCTTTTAAGAATGACAACTCTGCTATACAAAAAAGTTTAGAAGA
TAGTGAATGTGTGCAGAATGAAGACATTATTTTGTGTGAAAGTGTTCCTGCTTACAATGTTGGTCGATATGTTCCTTCTGTTGAGAAAGCTGTGCACCATTTCCATCGTT
TTCTTCATCTTAGCCTCACCAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGATGATAAGGAAGCATATTCTTCCCCATAATTTCTTTCAAAAGCCTTGCATTACTCGGCATTCGCTTAACAGTCGTTCGCCCTCACGAATCTCGGCCGCTCTCGG
TCTTCGAAATGCCGATTCTCATCGCTTCGTTGAAGCTCAGAAACTCGTCCATGAATTCGACCCTCAAATTCCTTTTGAGAAGGCCGTCACTCCACCTAGCTCCTGGTACA
CCGACTCTTCGTTTTTCGCTCTCGAGCTCGATCGAGTGTTTTACAGAGGATGGCAGGCTGTAGGATATGTTGAACAGTTACAAGATCCCCATGACGTTTTCACTGGCAGG
TTGGGGAATGTGGAGTTTGTGGTGTGCAGAGATAACGACAGGAAGGTTCGTGCATTTCACAATGTCTGCCGCCATCACACGTCACTTCTCGCATCCGGAAGTGGGAAAAA
GTCGTGCTTTGTTTGCCCGTACCATGGATGGACAGATGGGATGGATGGAGCTCTGCTTAAGGCGACTAGAAAAAATGGGATACAAAACTTTGATGTAAATGATTTTGGGC
TCATACCATTACCAGTAGCTACATGGGGGCCTTTCATTCTTCTCAATTTGGATGAAAAATTATCATCTGAGCTGGATGTTGATGAAGATAAAGTAACACATGAATGGCTT
GGAGGCTGTGCAGATGTGCTGAGTTTGAATGGAGTTGATGCATCGTTAAGCTTTGTTTGTCGACGCGAATATACCATTGAATGTAATTGGAAGGTTTTTTGTGACAACTT
CTTAGATGGAGGGTATCATGTTCCCTATGCACATAAAGGGCTCGCTTCAAATCTCAAGCTTGATTCTTATTCTACAGAACTATTTGAAACTGTTAGCATTCAAAGTTGTA
AGAGTGTGGGAGAATCCGAATGCGATGGTTACGGTCGACTCGGATCAGAAGCTCTATATGTTTTTATATACCCAAACTTCATCATAAATAGGTATGGACCTTGGATGGAC
ACTAATCTAGTACTGCCACTCGGACCTCGAAAATGTCTTGTGGTTTTTTATTATTTTCTCGAAGCTTCTTTTAAGAATGACAACTCTGCTATACAAAAAAGTTTAGAAGA
TAGTGAATGTGTGCAGAATGAAGACATTATTTTGTGTGAAAGTGTTCCTGCTTACAATGTTGGTCGATATGTTCCTTCTGTTGAGAAAGCTGTGCACCATTTCCATCGTT
TTCTTCATCTTAGCCTCACCAAATAA
Protein sequenceShow/hide protein sequence
MAMIRKHILPHNFFQKPCITRHSLNSRSPSRISAALGLRNADSHRFVEAQKLVHEFDPQIPFEKAVTPPSSWYTDSSFFALELDRVFYRGWQAVGYVEQLQDPHDVFTGR
LGNVEFVVCRDNDRKVRAFHNVCRHHTSLLASGSGKKSCFVCPYHGWTDGMDGALLKATRKNGIQNFDVNDFGLIPLPVATWGPFILLNLDEKLSSELDVDEDKVTHEWL
GGCADVLSLNGVDASLSFVCRREYTIECNWKVFCDNFLDGGYHVPYAHKGLASNLKLDSYSTELFETVSIQSCKSVGESECDGYGRLGSEALYVFIYPNFIINRYGPWMD
TNLVLPLGPRKCLVVFYYFLEASFKNDNSAIQKSLEDSECVQNEDIILCESVPAYNVGRYVPSVEKAVHHFHRFLHLSLTK