; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G31640 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G31640
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionCholine monooxygenase, chloroplastic
Genome locationChr1:26303491..26306612
RNA-Seq ExpressionCSPI01G31640
SyntenyCSPI01G31640
Gene Ontology termsGO:0019285 - glycine betaine biosynthetic process from choline (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0031967 - organelle envelope (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0019133 - choline monooxygenase activity (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR001663 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit
IPR015879 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit, C-terminal domain
IPR017941 - Rieske [2Fe-2S] iron-sulphur domain
IPR036922 - Rieske [2Fe-2S] iron-sulphur domain superfamily
IPR044637 - Aromatic-ring-hydroxylating dioxygenase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK04233.1 choline monooxygenase [Cucumis melo var. makuwa]9.7e-24194.48Show/hide
Query:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
        MA LTKHIQ+HFFQ PS SFNFH CNHRSP RISA+LSFRSPDSR IEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALEL+ VFYRGWQAVGYVEQL
Subjt:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRI+GIQNF+ENDFGLVPLPVA WGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL

Query:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
        DGKLSSK DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+SFKNDDSFIQ SLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHCNLTK

XP_004139149.1 choline monooxygenase, chloroplastic isoform X1 [Cucumis sativus]2.2e-25399.52Show/hide
Query:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
        MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFR+PDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
Subjt:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL

Query:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
        DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDD GRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLHCNLTK
Subjt:  ENAMHHFHRLLHCNLTK

XP_008443631.1 PREDICTED: choline monooxygenase, chloroplastic isoform X2 [Cucumis melo]3.9e-24294.96Show/hide
Query:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
        MA LTKHIQ+HFFQ PS SFNFH CNHRSP RISA+LSFRSPDSR IEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALEL+ VFYRGWQAVGYVEQL
Subjt:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRI+GIQNF+ENDFGLVPLPVA WGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL

Query:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
        DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQ SLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHCNLTK

XP_016899776.1 PREDICTED: choline monooxygenase, chloroplastic isoform X1 [Cucumis melo]1.6e-23890.6Show/hide
Query:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAV------
        MA LTKHIQ+HFFQ PS SFNFH CNHRSP RISA+LSFRSPDSR IEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALEL+ VFYRGWQA+      
Subjt:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAV------

Query:  -------------GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDEND
                     GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRI+GIQNF+END
Subjt:  -------------GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDEND

Query:  FGLVPLPVATWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS
        FGLVPLPVA WGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYS
Subjt:  FGLVPLPVATWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS

Query:  TELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEG
        TELFETVSIQSCKGGGESKGDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQ SLEDSESVQNEDIILCEG
Subjt:  TELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEG

Query:  VQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNLTK
        VQKGLESPAYKFGRYAPSVENAMHHFHRLLH NLTK
Subjt:  VQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNLTK

XP_038880722.1 choline monooxygenase, chloroplastic [Benincasa hispida]4.5e-23089.69Show/hide
Query:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
        MA LTKHI  HFFQ PS SFN HSCNHRSP RISAALSFR+ DS  IEA+KLVD+FDP+IPLEKA+TPPSSWYIDPSF+ALEL+ VFYRGWQAVGYVEQL
Subjt:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
        KD HDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGK+SCFVCPYHGWTYGLDG+LLKATRI+GIQNFDEN+FGL+PLPVATWGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL

Query:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
        DGKLSS+ DVDEDKV  EWLG+CAD+L LNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKL+SYSTE+FETVSIQSCKGGGE+K
Subjt:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        G+DYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLG RKCLVVFDYFLEASFKND  FIQ SLEDSESVQ EDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHCNLTK

TrEMBL top hitse value%identityAlignment
A0A0A0LXX3 Choline monooxygenase, chloroplastic1.1e-25399.52Show/hide
Query:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
        MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFR+PDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
Subjt:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL

Query:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
        DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDD GRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLHCNLTK
Subjt:  ENAMHHFHRLLHCNLTK

A0A1S3B9A0 Choline monooxygenase, chloroplastic1.9e-24294.96Show/hide
Query:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
        MA LTKHIQ+HFFQ PS SFNFH CNHRSP RISA+LSFRSPDSR IEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALEL+ VFYRGWQAVGYVEQL
Subjt:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRI+GIQNF+ENDFGLVPLPVA WGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL

Query:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
        DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQ SLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHCNLTK

A0A1S4DUW5 choline monooxygenase, chloroplastic isoform X17.5e-23990.6Show/hide
Query:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAV------
        MA LTKHIQ+HFFQ PS SFNFH CNHRSP RISA+LSFRSPDSR IEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALEL+ VFYRGWQA+      
Subjt:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAV------

Query:  -------------GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDEND
                     GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRI+GIQNF+END
Subjt:  -------------GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDEND

Query:  FGLVPLPVATWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS
        FGLVPLPVA WGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYS
Subjt:  FGLVPLPVATWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS

Query:  TELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEG
        TELFETVSIQSCKGGGESKGDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQ SLEDSESVQNEDIILCEG
Subjt:  TELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEG

Query:  VQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNLTK
        VQKGLESPAYKFGRYAPSVENAMHHFHRLLH NLTK
Subjt:  VQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNLTK

A0A5A7UY48 Choline monooxygenase, chloroplastic1.9e-24294.96Show/hide
Query:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
        MA LTKHIQ+HFFQ PS SFNFH CNHRSP RISA+LSFRSPDSR IEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALEL+ VFYRGWQAVGYVEQL
Subjt:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRI+GIQNF+ENDFGLVPLPVA WGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL

Query:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
        DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQ SLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHCNLTK

A0A5D3BWX2 Choline monooxygenase, chloroplastic4.7e-24194.48Show/hide
Query:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL
        MA LTKHIQ+HFFQ PS SFNFH CNHRSP RISA+LSFRSPDSR IEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALEL+ VFYRGWQAVGYVEQL
Subjt:  MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRI+GIQNF+ENDFGLVPLPVA WGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNL

Query:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK
        DGKLSSK DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+SFKNDDSFIQ SLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHCNLTK

SwissProt top hitse value%identityAlignment
O04121 Choline monooxygenase, chloroplastic4.4e-11951.04Show/hide
Query:  RSPPRIS----AALSFRSPDSRLIEA-RKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDN
        R+P +I+    AA SF S  +    + + LV +FDPQIP E A TPPSSWY +P+F++ EL  +FY+GWQ  G  +Q+K+ + +FTG LGNVEY+V +D 
Subjt:  RSPPRIS----AALSFRSPDSRLIEA-RKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDN

Query:  NRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDEDKVAREWLGT
          KV AFHNVC H AS+LA G GKKSCFVCPYHGW YG+DG L KA++    QN D  + GLVPL VA WGPFVL++LD  L    D     V  EWLGT
Subjt:  NRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDEDKVAREWLGT

Query:  CADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMI
         A+ ++ +  D SL ++ R E+ +E NWK+F DNYLD  YHVPYAHK  A+ L  ++Y T++ E V+IQ  + G  +K D + R+G +A YAF YPNF +
Subjt:  CADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMI

Query:  NRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFH
         RYGPWM T  + PLGPRKC +V DY++E S  +D  +I+  +  +++VQ ED++LCE VQ+GLE+PAY+ GRY   +E  +HHFH
Subjt:  NRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFH

O22553 Choline monooxygenase, chloroplastic5.2e-12052.34Show/hide
Query:  RKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC
        R LV +FDP+IP E ALTPPS+WY +P+F++ EL  +FY+GWQ  GY EQ+K+ + +FTG LGNVEY+V +D   ++ AFHNVC H AS+LA G GKKSC
Subjt:  RKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC

Query:  FVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECN
        FVCPYHGW YGLDG L KA++    QN D  + GL PL VA WGPF+L++LD  L +  D     V  EW+G  A+ ++ +  D +L +  R E+ +ECN
Subjt:  FVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WKVFCDNYLD  YHVPYAHK  A+ L  ++Y+TE+ E   IQ   G   +K D + RLG+EA YAF+YPNF + RYG WM T  V+P+G RKC +V DY+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLH
        LE +  +D ++I   +  +++VQ ED +LCE VQ+GLE+PAY+ GRY   +E  +HHFH  LH
Subjt:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLH

Q93XE1 Choline monooxygenase, chloroplastic1.6e-12150.91Show/hide
Query:  PRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFH
        P I ++++  +  +     ++++ +FDP++P E   TPPS+WY DPS ++ EL+ +F +GWQ  GY +Q+K+ + +FTG LGNVEY+VC+D   KV AFH
Subjt:  PRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFH

Query:  NVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLN
        NVC H AS+LA G GKKSCFVCPYHGW +GLDG L+KAT+ +  Q FD  + GLV L VA WGPFVL++LD    S  +  ED V +EW+G+CA+ ++ +
Subjt:  NVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLN

Query:  GVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMD
          D SL ++ R E+ +E NWKVFCDNYLD  YHVPYAHK  A+ L  ++Y T+L E V IQ       +K + + RLGSEA YAF+YPNF + RYGPWM 
Subjt:  GVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMD

Query:  TNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNL
        T  + PLGPRKC +V DY+LE +  ND  +I+ S+  +++VQ ED++LCE VQ+GLE+PAY+ GRY   +E  +HHFH  LH  L
Subjt:  TNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNL

Q9LKN0 Choline monooxygenase, chloroplastic1.4e-12052.34Show/hide
Query:  RKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC
        + LV DFDP +P E ALTPPSSWY +P+F+A EL+ +FY+GWQ  GY +Q+K+A+ +FTG LGNVEY+VC+D   KV AFHNVC H AS+LA G GKKSC
Subjt:  RKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC

Query:  FVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECN
        FVCPYHGW YG++G L KA++    Q+ + ++ GLVPL VA WGPF+L++LD    S  +V +  V  EWLG+CA+ ++ +  D +L ++ R E+ IE N
Subjt:  FVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WK+F DNYLD  YHVPYAHK  A+ L  ++Y T++   V+IQ   G   +  + + RLG++A YAF YPNF + RYGPWM T  ++PLGPRKC +V DY+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLH
        +E S  +D  +I+  +  +++VQ ED++LCE VQKGLE+PAY+ GRY   +E  +HHFH  LH
Subjt:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLH

Q9SZR0 Choline monooxygenase, chloroplastic5.5e-15468.85Show/hide
Query:  KLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF
        KLV +FDP+IPLE+A TPPSSWY DP F++ EL+ VFY GWQAVGY +Q+K++ DFFTGRLG+V++VVC+D N K+ AFHNVC HHAS+LASG G+KSCF
Subjt:  KLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF

Query:  VCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDEDK-VAREWLGTCADVLRLNGVDASLSYVCRREYTIECN
        VC YHGWTY L G L+KATR+ GIQNF  ++ GL PL VA WGPFVLL +    S K +V+ D+ VA EWLGT    L   GVD+ LSY+CRREYTI+CN
Subjt:  VCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDEDK-VAREWLGTCADVLRLNGVDASLSYVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WKVFCDNYLDGGYHVPYAHKGL S L LE+YST +FE VSIQ C GG +   D + RLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKC VVFDYF
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNL
        L+ S K+D++FI+ SLE+S+ VQ ED++LCE VQ+GLES AY  GRYA  VE  MHHFH LLH NL
Subjt:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNL

Arabidopsis top hitse value%identityAlignment
AT4G29890.1 choline monooxygenase, putative (CMO-like)3.9e-15568.85Show/hide
Query:  KLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF
        KLV +FDP+IPLE+A TPPSSWY DP F++ EL+ VFY GWQAVGY +Q+K++ DFFTGRLG+V++VVC+D N K+ AFHNVC HHAS+LASG G+KSCF
Subjt:  KLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF

Query:  VCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDEDK-VAREWLGTCADVLRLNGVDASLSYVCRREYTIECN
        VC YHGWTY L G L+KATR+ GIQNF  ++ GL PL VA WGPFVLL +    S K +V+ D+ VA EWLGT    L   GVD+ LSY+CRREYTI+CN
Subjt:  VCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDEDK-VAREWLGTCADVLRLNGVDASLSYVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WKVFCDNYLDGGYHVPYAHKGL S L LE+YST +FE VSIQ C GG +   D + RLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKC VVFDYF
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNL
        L+ S K+D++FI+ SLE+S+ VQ ED++LCE VQ+GLES AY  GRYA  VE  MHHFH LLH NL
Subjt:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATGTTGACGAAGCATATCCAAGTTCACTTCTTCCAACTTCCTTCCACTTCCTTCAATTTCCATTCCTGTAATCATCGTTCTCCCCCACGGATCTCCGCAGCTTT
ATCCTTTCGAAGCCCTGATTCTCGACTCATTGAAGCTCGGAAACTCGTTGATGATTTCGACCCTCAAATTCCATTGGAGAAGGCCCTCACTCCACCCTCCTCCTGGTATA
TAGACCCTTCATTTTTTGCTCTCGAGCTCAATCATGTCTTCTACAGAGGATGGCAGGCTGTAGGATATGTTGAACAGTTAAAAGATGCCCATGACTTTTTCACAGGCAGG
TTGGGAAATGTAGAGTATGTGGTATGCAAAGATAATAACAGAAAGGTTCGTGCATTTCACAATGTTTGTCGCCATCATGCCTCACTTCTTGCGTCTGGATGTGGGAAGAA
GTCGTGCTTTGTATGCCCATATCATGGATGGACATACGGGTTGGATGGAATACTGCTTAAGGCGACTAGAATAGATGGGATACAGAACTTTGATGAAAATGATTTTGGGC
TTGTACCATTGCCAGTAGCTACGTGGGGGCCTTTCGTTCTTCTCAATTTGGATGGAAAATTATCATCTAAGCCAGATGTTGATGAAGATAAAGTGGCACGTGAATGGCTT
GGAACATGTGCAGATGTGCTGAGGCTAAACGGAGTTGATGCTTCCCTAAGTTATGTCTGTCGACGTGAATACACTATTGAATGTAACTGGAAGGTTTTTTGTGACAACTA
TTTAGATGGAGGATATCACGTTCCCTATGCACATAAAGGGCTTGCATCTAATCTCAAGCTTGAGTCTTATTCTACAGAACTATTTGAAACTGTTAGCATTCAAAGTTGTA
AGGGTGGGGGAGAATCAAAAGGTGACGATTATGGTCGACTTGGATCAGAAGCACTCTATGCTTTTGTATACCCAAATTTCATGATAAATAGGTATGGACCTTGGATGGAC
ACTAATCTAGTACTCCCACTTGGACCTCGAAAATGTCTGGTGGTTTTCGATTATTTTCTTGAAGCTTCTTTTAAGAATGATGACTCCTTTATACAACTAAGTTTAGAAGA
CAGTGAAAGCGTGCAGAATGAAGACATTATTCTGTGTGAAGGAGTTCAAAAGGGTCTCGAGTCACCAGCTTACAAGTTTGGCCGATATGCACCTTCGGTCGAGAATGCCA
TGCACCATTTCCATCGTCTTCTTCATTGTAACCTCACAAAATAA
mRNA sequenceShow/hide mRNA sequence
AAAACAAGTTGTTATTTGAAGAAGAAGCTTGAGGACTGTTGCAATGGCGATGTTGACGAAGCATATCCAAGTTCACTTCTTCCAACTTCCTTCCACTTCCTTCAATTTCC
ATTCCTGTAATCATCGTTCTCCCCCACGGATCTCCGCAGCTTTATCCTTTCGAAGCCCTGATTCTCGACTCATTGAAGCTCGGAAACTCGTTGATGATTTCGACCCTCAA
ATTCCATTGGAGAAGGCCCTCACTCCACCCTCCTCCTGGTATATAGACCCTTCATTTTTTGCTCTCGAGCTCAATCATGTCTTCTACAGAGGATGGCAGGCTGTAGGATA
TGTTGAACAGTTAAAAGATGCCCATGACTTTTTCACAGGCAGGTTGGGAAATGTAGAGTATGTGGTATGCAAAGATAATAACAGAAAGGTTCGTGCATTTCACAATGTTT
GTCGCCATCATGCCTCACTTCTTGCGTCTGGATGTGGGAAGAAGTCGTGCTTTGTATGCCCATATCATGGATGGACATACGGGTTGGATGGAATACTGCTTAAGGCGACT
AGAATAGATGGGATACAGAACTTTGATGAAAATGATTTTGGGCTTGTACCATTGCCAGTAGCTACGTGGGGGCCTTTCGTTCTTCTCAATTTGGATGGAAAATTATCATC
TAAGCCAGATGTTGATGAAGATAAAGTGGCACGTGAATGGCTTGGAACATGTGCAGATGTGCTGAGGCTAAACGGAGTTGATGCTTCCCTAAGTTATGTCTGTCGACGTG
AATACACTATTGAATGTAACTGGAAGGTTTTTTGTGACAACTATTTAGATGGAGGATATCACGTTCCCTATGCACATAAAGGGCTTGCATCTAATCTCAAGCTTGAGTCT
TATTCTACAGAACTATTTGAAACTGTTAGCATTCAAAGTTGTAAGGGTGGGGGAGAATCAAAAGGTGACGATTATGGTCGACTTGGATCAGAAGCACTCTATGCTTTTGT
ATACCCAAATTTCATGATAAATAGGTATGGACCTTGGATGGACACTAATCTAGTACTCCCACTTGGACCTCGAAAATGTCTGGTGGTTTTCGATTATTTTCTTGAAGCTT
CTTTTAAGAATGATGACTCCTTTATACAACTAAGTTTAGAAGACAGTGAAAGCGTGCAGAATGAAGACATTATTCTGTGTGAAGGAGTTCAAAAGGGTCTCGAGTCACCA
GCTTACAAGTTTGGCCGATATGCACCTTCGGTCGAGAATGCCATGCACCATTTCCATCGTCTTCTTCATTGTAACCTCACAAAATAAAAATAATTGAAAGAATGCCATGC
ACCATTTGCAAGATATTCTTTTAGGTTTTCATAGTTATCTTGGAAATGGTCTGAAATAACTTGCTTTTGTTACAGCAC
Protein sequenceShow/hide protein sequence
MAMLTKHIQVHFFQLPSTSFNFHSCNHRSPPRISAALSFRSPDSRLIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELNHVFYRGWQAVGYVEQLKDAHDFFTGR
LGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRIDGIQNFDENDFGLVPLPVATWGPFVLLNLDGKLSSKPDVDEDKVAREWL
GTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETVSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMD
TNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHCNLTK