; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G7371 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G7371
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionCholine monooxygenase, chloroplastic
Genome locationctg1528:308395..311655
RNA-Seq ExpressionCucsat.G7371
SyntenyCucsat.G7371
Gene Ontology termsGO:0019285 - glycine betaine biosynthetic process from choline (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0016020 - membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0031967 - organelle envelope (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0019133 - choline monooxygenase activity (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR001663 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit
IPR015879 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit, C-terminal domain
IPR017941 - Rieske [2Fe-2S] iron-sulphur domain
IPR036922 - Rieske [2Fe-2S] iron-sulphur domain superfamily
IPR044637 - Aromatic-ring-hydroxylating dioxygenase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK04233.1 choline monooxygenase [Cucumis melo var. makuwa]1.20e-30494.24Show/hide
Query:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
        MA LTKHIQ+HFFQ PSISFNFH CNHRSP RISA+LSFR+PDSR IEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALEL+ VFYRGWQAVGYVEQL
Subjt:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRI+GIQNF+ENDFGLVPLPVA WGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL

Query:  DGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
        DGKLSSK DVDE+KVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+SFKNDDSFIQ SLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHCNLTK

XP_004139149.1 choline monooxygenase, chloroplastic isoform X1 [Cucumis sativus]0.099.28Show/hide
Query:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
        MAMLTKHIQVHFFQLPS SFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
Subjt:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL

Query:  DGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
        DGKLSSKPDVDE+KVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDD GRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLHCNLTK
Subjt:  ENAMHHFHRLLHCNLTK

XP_008443631.1 PREDICTED: choline monooxygenase, chloroplastic isoform X2 [Cucumis melo]1.77e-30694.72Show/hide
Query:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
        MA LTKHIQ+HFFQ PSISFNFH CNHRSP RISA+LSFR+PDSR IEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALEL+ VFYRGWQAVGYVEQL
Subjt:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRI+GIQNF+ENDFGLVPLPVA WGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL

Query:  DGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
        DGKLSSKPDVDE+KVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQ SLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHCNLTK

XP_016899776.1 PREDICTED: choline monooxygenase, chloroplastic isoform X1 [Cucumis melo]1.94e-30190.37Show/hide
Query:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAV------
        MA LTKHIQ+HFFQ PSISFNFH CNHRSP RISA+LSFR+PDSR IEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALEL+ VFYRGWQA+      
Subjt:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAV------

Query:  -------------GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDEND
                     GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRI+GIQNF+END
Subjt:  -------------GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDEND

Query:  FGLVPLPVATWGPFVLLNLDGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS
        FGLVPLPVA WGPFVLLNLDGKLSSKPDVDE+KVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYS
Subjt:  FGLVPLPVATWGPFVLLNLDGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS

Query:  TELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEG
        TELFETVSIQSCKGGGESKGDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQ SLEDSESVQNEDIILCEG
Subjt:  TELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEG

Query:  VQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNLTK
        VQKGLESPAYKFGRYAPSVENAMHHFHRLLH NLTK
Subjt:  VQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNLTK

XP_038880722.1 choline monooxygenase, chloroplastic [Benincasa hispida]7.61e-29289.93Show/hide
Query:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
        MA LTKHI  HFFQ PSISFN HSCNHRSP RISAALSFRN DS  IEA+KLVD+FDP+IPLEKA+TPPSSWYIDPSF+ALEL+ VFYRGWQAVGYVEQL
Subjt:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
        KD HDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGK+SCFVCPYHGWTYGLDG+LLKATRI+GIQNFDEN+FGL+PLPVATWGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL

Query:  DGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
        DGKLSS+ DVDE+KV  EWLG+CAD+L LNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKL+SYSTE+FETVSIQSCKGGGE+K
Subjt:  DGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        G+DYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLG RKCLVVFDYFLEASFKND  FIQ SLEDSESVQ EDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHCNLTK

TrEMBL top hitse value%identityAlignment
A0A0A0LXX3 Choline monooxygenase, chloroplastic0.099.28Show/hide
Query:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
        MAMLTKHIQVHFFQLPS SFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
Subjt:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL

Query:  DGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
        DGKLSSKPDVDE+KVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDD GRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLHCNLTK
Subjt:  ENAMHHFHRLLHCNLTK

A0A1S3B9A0 Choline monooxygenase, chloroplastic8.59e-30794.72Show/hide
Query:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
        MA LTKHIQ+HFFQ PSISFNFH CNHRSP RISA+LSFR+PDSR IEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALEL+ VFYRGWQAVGYVEQL
Subjt:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRI+GIQNF+ENDFGLVPLPVA WGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL

Query:  DGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
        DGKLSSKPDVDE+KVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQ SLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHCNLTK

A0A1S4DUW5 choline monooxygenase, chloroplastic isoform X19.39e-30290.37Show/hide
Query:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAV------
        MA LTKHIQ+HFFQ PSISFNFH CNHRSP RISA+LSFR+PDSR IEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALEL+ VFYRGWQA+      
Subjt:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAV------

Query:  -------------GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDEND
                     GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRI+GIQNF+END
Subjt:  -------------GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDEND

Query:  FGLVPLPVATWGPFVLLNLDGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS
        FGLVPLPVA WGPFVLLNLDGKLSSKPDVDE+KVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYS
Subjt:  FGLVPLPVATWGPFVLLNLDGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS

Query:  TELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEG
        TELFETVSIQSCKGGGESKGDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQ SLEDSESVQNEDIILCEG
Subjt:  TELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEG

Query:  VQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNLTK
        VQKGLESPAYKFGRYAPSVENAMHHFHRLLH NLTK
Subjt:  VQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNLTK

A0A5A7UY48 Choline monooxygenase, chloroplastic8.59e-30794.72Show/hide
Query:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
        MA LTKHIQ+HFFQ PSISFNFH CNHRSP RISA+LSFR+PDSR IEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALEL+ VFYRGWQAVGYVEQL
Subjt:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRI+GIQNF+ENDFGLVPLPVA WGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL

Query:  DGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
        DGKLSSKPDVDE+KVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQ SLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHCNLTK

A0A5D3BWX2 Choline monooxygenase, chloroplastic5.80e-30594.24Show/hide
Query:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
        MA LTKHIQ+HFFQ PSISFNFH CNHRSP RISA+LSFR+PDSR IEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALEL+ VFYRGWQAVGYVEQL
Subjt:  MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRI+GIQNF+ENDFGLVPLPVA WGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL

Query:  DGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
        DGKLSSK DVDE+KVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+SFKNDDSFIQ SLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHCNLTK

SwissProt top hitse value%identityAlignment
O04121 Choline monooxygenase, chloroplastic5.7e-11952.65Show/hide
Query:  RKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC
        + LV +FDPQIP E A TPPSSWY +P+F++ EL  +FY+GWQ  G  +Q+K+ + +FTG LGNVEY+V +D   KV AFHNVC H AS+LA G GKKSC
Subjt:  RKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC

Query:  FVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECN
        FVCPYHGW YG+DG L KA++    QN D  + GLVPL VA WGPFVL++LD  L    D     V  EWLGT A+ ++ +  D SL ++ R E+ +E N
Subjt:  FVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WK+F DNYLD  YHVPYAHK  A+ L  ++Y T++ E V+IQ  + G  +K D + R+G +A YAF YPNF + RYGPWM T  + PLGPRKC +V DY+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFH
        +E S  +D  +I+  +  +++VQ ED++LCE VQ+GLE+PAY+ GRY   +E  +HHFH
Subjt:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFH

O22553 Choline monooxygenase, chloroplastic6.8e-12052.34Show/hide
Query:  RKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC
        R LV +FDP+IP E ALTPPS+WY +P+F++ EL  +FY+GWQ  GY EQ+K+ + +FTG LGNVEY+V +D   ++ AFHNVC H AS+LA G GKKSC
Subjt:  RKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC

Query:  FVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECN
        FVCPYHGW YGLDG L KA++    QN D  + GL PL VA WGPF+L++LD  L +  D     V  EW+G  A+ ++ +  D +L +  R E+ +ECN
Subjt:  FVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WKVFCDNYLD  YHVPYAHK  A+ L  ++Y+TE+ E   IQ   G   +K D + RLG+EA YAF+YPNF + RYG WM T  V+P+G RKC +V DY+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLH
        LE +  +D ++I   +  +++VQ ED +LCE VQ+GLE+PAY+ GRY   +E  +HHFH  LH
Subjt:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLH

Q93XE1 Choline monooxygenase, chloroplastic9.4e-12250.65Show/hide
Query:  PRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFH
        P I ++++  N  +     ++++ +FDP++P E   TPPS+WY DPS ++ EL+ +F +GWQ  GY +Q+K+ + +FTG LGNVEY+VC+D   KV AFH
Subjt:  PRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFH

Query:  NVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDENKVAREWLGTCADVLRLN
        NVC H AS+LA G GKKSCFVCPYHGW +GLDG L+KAT+ +  Q FD  + GLV L VA WGPFVL++LD   S   +     V +EW+G+CA+ ++ +
Subjt:  NVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDENKVAREWLGTCADVLRLN

Query:  GVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMD
          D SL ++ R E+ +E NWKVFCDNYLD  YHVPYAHK  A+ L  ++Y T+L E V IQ       +K + + RLGSEA YAF+YPNF + RYGPWM 
Subjt:  GVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMD

Query:  TNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNL
        T  + PLGPRKC +V DY+LE +  ND  +I+ S+  +++VQ ED++LCE VQ+GLE+PAY+ GRY   +E  +HHFH  LH  L
Subjt:  TNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNL

Q9LKN0 Choline monooxygenase, chloroplastic2.3e-12051.79Show/hide
Query:  RKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC
        + LV DFDP +P E ALTPPSSWY +P+F+A EL+ +FY+GWQ  GY +Q+K+A+ +FTG LGNVEY+VC+D   KV AFHNVC H AS+LA G GKKSC
Subjt:  RKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC

Query:  FVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECN
        FVCPYHGW YG++G L KA++    Q+ + ++ GLVPL VA WGPF+L++LD     +   +   V  EWLG+CA+ ++ +  D +L ++ R E+ IE N
Subjt:  FVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDENKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WK+F DNYLD  YHVPYAHK  A+ L  ++Y T++   V+IQ   G   +  + + RLG++A YAF YPNF + RYGPWM T  ++PLGPRKC +V DY+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLH
        +E S  +D  +I+  +  +++VQ ED++LCE VQKGLE+PAY+ GRY   +E  +HHFH  LH
Subjt:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLH

Q9SZR0 Choline monooxygenase, chloroplastic2.7e-15365.16Show/hide
Query:  FNFHSCNHRSPPRISAALSFRNPDS--RLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYV
        FN HS    S  + S    F NP     + +  KLV +FDP+IPLE+A TPPSSWY DP F++ EL+ VFY GWQAVGY +Q+K++ DFFTGRLG+V++V
Subjt:  FNFHSCNHRSPPRISAALSFRNPDS--RLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYV

Query:  VCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDENK-VA
        VC+D N K+ AFHNVC HHAS+LASG G+KSCFVC YHGWTY L G L+KATR+ GIQNF  ++ GL PL VA WGPFVLL +    S K +V+ ++ VA
Subjt:  VCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDENK-VA

Query:  REWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFV
         EWLGT    L   GVD+ LSY+CRREYTI+CNWKVFCDNYLDGGYHVPYAHKGL S L LE+YST +FE VSIQ C GG +   D + RLGSEALYAFV
Subjt:  REWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFV

Query:  YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNL
        YPNFMINRYGPWMDTNLVLPLGPRKC VVFDYFL+ S K+D++FI+ SLE+S+ VQ ED++LCE VQ+GLES AY  GRYA  VE  MHHFH LLH NL
Subjt:  YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNL

Arabidopsis top hitse value%identityAlignment
AT4G29890.1 choline monooxygenase, putative (CMO-like)1.9e-15465.16Show/hide
Query:  FNFHSCNHRSPPRISAALSFRNPDS--RLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYV
        FN HS    S  + S    F NP     + +  KLV +FDP+IPLE+A TPPSSWY DP F++ EL+ VFY GWQAVGY +Q+K++ DFFTGRLG+V++V
Subjt:  FNFHSCNHRSPPRISAALSFRNPDS--RLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYV

Query:  VCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDENK-VA
        VC+D N K+ AFHNVC HHAS+LASG G+KSCFVC YHGWTY L G L+KATR+ GIQNF  ++ GL PL VA WGPFVLL +    S K +V+ ++ VA
Subjt:  VCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDENK-VA

Query:  REWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFV
         EWLGT    L   GVD+ LSY+CRREYTI+CNWKVFCDNYLDGGYHVPYAHKGL S L LE+YST +FE VSIQ C GG +   D + RLGSEALYAFV
Subjt:  REWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFV

Query:  YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNL
        YPNFMINRYGPWMDTNLVLPLGPRKC VVFDYFL+ S K+D++FI+ SLE+S+ VQ ED++LCE VQ+GLES AY  GRYA  VE  MHHFH LLH NL
Subjt:  YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATGTTGACGAAGCATATCCAAGTTCACTTCTTCCAACTTCCTTCCATTTCCTTCAATTTCCATTCCTGTAATCATCGTTCTCCCCCACGGATCTCCGCAGCTTT
ATCCTTTCGAAACCCTGATTCTCGACTCATTGAAGCTCGGAAACTCGTTGATGATTTCGACCCTCAAATTCCATTGGAGAAGGCCCTCACTCCACCCTCCTCCTGGTATA
TAGACCCTTCATTTTTTGCTCTCGAGCTCAATCATGTCTTCTACAGAGGATGGCAGGCTGTAGGATATGTTGAACAGTTAAAAGATGCCCATGACTTTTTCACAGGCAGG
TTGGGAAATGTAGAGTATGTGGTATGCAAAGATAATAACAGAAAGGTTCGTGCATTTCACAATGTTTGTCGCCATCATGCCTCACTTCTTGCGTCTGGATGTGGGAAGAA
GTCGTGCTTTGTATGCCCATATCATGGATGGACATACGGGTTGGATGGAATACTGCTTAAGGCGACTAGAATAGATGGGATACAGAACTTTGATGAAAATGATTTTGGGC
TTGTACCATTGCCAGTAGCTACGTGGGGGCCTTTCGTTCTTCTCAATTTGGATGGAAAATTATCATCTAAGCCAGATGTTGATGAAAATAAAGTGGCACGTGAATGGCTT
GGAACATGTGCAGATGTGCTGAGGCTAAACGGAGTTGATGCTTCCCTAAGTTATGTCTGTCGACGTGAATACACTATTGAATGTAACTGGAAGGTTTTTTGTGACAACTA
TTTAGATGGAGGATATCACGTTCCATATGCACATAAAGGGCTTGCATCTAATCTCAAGCTTGAGTCTTATTCTACAGAACTATTTGAAACTGTTAGCATTCAAAGTTGTA
AGGGTGGGGGAGAATCAAAAGGTGATGATTATGGTCGACTTGGATCAGAAGCACTCTATGCTTTTGTATACCCAAATTTCATGATAAATAGGTATGGACCTTGGATGGAC
ACTAATCTAGTACTCCCACTTGGGCCTCGAAAATGTCTGGTGGTTTTTGATTATTTTCTTGAAGCTTCTTTTAAGAATGATGACTCCTTTATACAACTAAGTTTAGAAGA
CAGTGAAAGCGTGCAGAATGAAGACATTATTCTGTGTGAAGGAGTTCAAAAGGGTCTCGAGTCACCAGCTTACAAGTTTGGCCGATATGCACCTTCGGTCGAGAATGCCA
TGCACCATTTCCATCGTCTTCTTCATTGTAACCTCACAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGATGTTGACGAAGCATATCCAAGTTCACTTCTTCCAACTTCCTTCCATTTCCTTCAATTTCCATTCCTGTAATCATCGTTCTCCCCCACGGATCTCCGCAGCTTT
ATCCTTTCGAAACCCTGATTCTCGACTCATTGAAGCTCGGAAACTCGTTGATGATTTCGACCCTCAAATTCCATTGGAGAAGGCCCTCACTCCACCCTCCTCCTGGTATA
TAGACCCTTCATTTTTTGCTCTCGAGCTCAATCATGTCTTCTACAGAGGATGGCAGGCTGTAGGATATGTTGAACAGTTAAAAGATGCCCATGACTTTTTCACAGGCAGG
TTGGGAAATGTAGAGTATGTGGTATGCAAAGATAATAACAGAAAGGTTCGTGCATTTCACAATGTTTGTCGCCATCATGCCTCACTTCTTGCGTCTGGATGTGGGAAGAA
GTCGTGCTTTGTATGCCCATATCATGGATGGACATACGGGTTGGATGGAATACTGCTTAAGGCGACTAGAATAGATGGGATACAGAACTTTGATGAAAATGATTTTGGGC
TTGTACCATTGCCAGTAGCTACGTGGGGGCCTTTCGTTCTTCTCAATTTGGATGGAAAATTATCATCTAAGCCAGATGTTGATGAAAATAAAGTGGCACGTGAATGGCTT
GGAACATGTGCAGATGTGCTGAGGCTAAACGGAGTTGATGCTTCCCTAAGTTATGTCTGTCGACGTGAATACACTATTGAATGTAACTGGAAGGTTTTTTGTGACAACTA
TTTAGATGGAGGATATCACGTTCCATATGCACATAAAGGGCTTGCATCTAATCTCAAGCTTGAGTCTTATTCTACAGAACTATTTGAAACTGTTAGCATTCAAAGTTGTA
AGGGTGGGGGAGAATCAAAAGGTGATGATTATGGTCGACTTGGATCAGAAGCACTCTATGCTTTTGTATACCCAAATTTCATGATAAATAGGTATGGACCTTGGATGGAC
ACTAATCTAGTACTCCCACTTGGGCCTCGAAAATGTCTGGTGGTTTTTGATTATTTTCTTGAAGCTTCTTTTAAGAATGATGACTCCTTTATACAACTAAGTTTAGAAGA
CAGTGAAAGCGTGCAGAATGAAGACATTATTCTGTGTGAAGGAGTTCAAAAGGGTCTCGAGTCACCAGCTTACAAGTTTGGCCGATATGCACCTTCGGTCGAGAATGCCA
TGCACCATTTCCATCGTCTTCTTCATTGTAACCTCACAAAATAA
Protein sequenceShow/hide protein sequence
MAMLTKHIQVHFFQLPSISFNFHSCNHRSPPRISAALSFRNPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGR
LGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDENKVAREWL
GTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMD
TNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNLTK