; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy2G038700 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy2G038700
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionCholine monooxygenase, chloroplastic
Genome locationchrH02:14794102..14797342
RNA-Seq ExpressionChy2G038700
SyntenyChy2G038700
Gene Ontology termsGO:0019285 - glycine betaine biosynthetic process from choline (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0031967 - organelle envelope (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0019133 - choline monooxygenase activity (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR001663 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit
IPR015879 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit, C-terminal domain
IPR017941 - Rieske [2Fe-2S] iron-sulphur domain
IPR036922 - Rieske [2Fe-2S] iron-sulphur domain superfamily
IPR044637 - Aromatic-ring-hydroxylating dioxygenase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK04233.1 choline monooxygenase [Cucumis melo var. makuwa]1.02e-30394.24Show/hide
Query:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQL
        MA LTK IQIHFFQ PS SFNFHFCN+RSPSRISA+LSFR+ DSRFIEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALELDR+FYRGWQAVGYVEQL
Subjt:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNF+ENDFGL PLPVA WGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNL

Query:  DGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESK
        DGKLSSK+DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTELFET SIQSCKGGGESK
Subjt:  DGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSV
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+SFKNDDSFIQ SLEDSESVQNEDIILCEGVQKGLESPAYKFGRY PSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHCNLTK

XP_004139149.1 choline monooxygenase, chloroplastic isoform X1 [Cucumis sativus]1.36e-30896.16Show/hide
Query:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQL
        MAMLTK IQ+HFFQLPSTSFNFH CN+RSP RISAALSFRN DSR IEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALEL+ +FYRGWQAVGYVEQL
Subjt:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRI+GIQNFDENDFGL PLPVATWGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNL

Query:  DGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESK
        DGKLSSK DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFET SIQSCKGGGESK
Subjt:  DGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSV
        GDD GRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRY PSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLHCNLTK
Subjt:  ENAMHHFHRLLHCNLTK

XP_008443631.1 PREDICTED: choline monooxygenase, chloroplastic isoform X2 [Cucumis melo]1.45e-30394.48Show/hide
Query:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQL
        MA LTK IQIHFFQ PS SFNFHFCN+RSPSRISA+LSFR+ DSRFIEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALELDR+FYRGWQAVGYVEQL
Subjt:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNF+ENDFGL PLPVA WGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNL

Query:  DGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESK
        DGKLSSK DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTELFET SIQSCKGGGESK
Subjt:  DGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSV
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQ SLEDSESVQNEDIILCEGVQKGLESPAYKFGRY PSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHCNLTK

XP_016899776.1 PREDICTED: choline monooxygenase, chloroplastic isoform X1 [Cucumis melo]1.59e-29890.14Show/hide
Query:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAV------
        MA LTK IQIHFFQ PS SFNFHFCN+RSPSRISA+LSFR+ DSRFIEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALELDR+FYRGWQA+      
Subjt:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAV------

Query:  -------------GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDEND
                     GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNF+END
Subjt:  -------------GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDEND

Query:  FGLEPLPVATWGPFVLLNLDGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS
        FGL PLPVA WGPFVLLNLDGKLSSK DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYS
Subjt:  FGLEPLPVATWGPFVLLNLDGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS

Query:  TELFETGSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEG
        TELFET SIQSCKGGGESKGDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQ SLEDSESVQNEDIILCEG
Subjt:  TELFETGSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEG

Query:  VQKGLESPAYKFGRYTPSVENAMHHFHRLLHCNLTK
        VQKGLESPAYKFGRY PSVENAMHHFHRLLH NLTK
Subjt:  VQKGLESPAYKFGRYTPSVENAMHHFHRLLHCNLTK

XP_038880722.1 choline monooxygenase, chloroplastic [Benincasa hispida]4.58e-29190.17Show/hide
Query:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQL
        MA LTK I  HFFQ PS SFN H CN+RSPSRISAALSFRNSDS FIEA+KLVD+FDP+IPLEKA+TPPSSWYIDPSF+ALELDR+FYRGWQAVGYVEQL
Subjt:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNL
        KD HDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGK+SCFVCPYHGWTYGLDG+LLKATRINGIQNFDEN+FGL PLPVATWGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNL

Query:  DGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESK
        DGKLSS+LDVDEDKV  EWLG+CAD+L LNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKL+SYSTE+FET SIQSCKGGGE+K
Subjt:  DGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSV
        G+DYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLG RKCLVVFDYFLEASFKND  FIQ SLEDSESVQ EDIILCEGVQKGLESPAYKFGRY PSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHCNLTK

TrEMBL top hitse value%identityAlignment
A0A0A0LXX3 Choline monooxygenase, chloroplastic5.9e-24496.16Show/hide
Query:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQL
        MAMLTK IQ+HFFQLPSTSFNFH CN+RSP RISAALSFRN DSR IEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALEL+ +FYRGWQAVGYVEQL
Subjt:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRI+GIQNFDENDFGL PLPVATWGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNL

Query:  DGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESK
        DGKLSSK DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFET SIQSCKGGGESK
Subjt:  DGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSV
        GDD GRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRY PSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLHCNLTK
Subjt:  ENAMHHFHRLLHCNLTK

A0A1S3B9A0 Choline monooxygenase, chloroplastic4.0e-24094.48Show/hide
Query:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQL
        MA LTK IQIHFFQ PS SFNFHFCN+RSPSRISA+LSFR+ DSRFIEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALELDR+FYRGWQAVGYVEQL
Subjt:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNF+ENDFGL PLPVA WGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNL

Query:  DGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESK
        DGKLSSK DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTELFET SIQSCKGGGESK
Subjt:  DGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSV
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQ SLEDSESVQNEDIILCEGVQKGLESPAYKFGRY PSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHCNLTK

A0A1S4DUW5 choline monooxygenase, chloroplastic isoform X11.6e-23690.14Show/hide
Query:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAV------
        MA LTK IQIHFFQ PS SFNFHFCN+RSPSRISA+LSFR+ DSRFIEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALELDR+FYRGWQA+      
Subjt:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAV------

Query:  -------------GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDEND
                     GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNF+END
Subjt:  -------------GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDEND

Query:  FGLEPLPVATWGPFVLLNLDGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS
        FGL PLPVA WGPFVLLNLDGKLSSK DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYS
Subjt:  FGLEPLPVATWGPFVLLNLDGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS

Query:  TELFETGSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEG
        TELFET SIQSCKGGGESKGDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQ SLEDSESVQNEDIILCEG
Subjt:  TELFETGSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEG

Query:  VQKGLESPAYKFGRYTPSVENAMHHFHRLLHCNLTK
        VQKGLESPAYKFGRY PSVENAMHHFHRLLH NLTK
Subjt:  VQKGLESPAYKFGRYTPSVENAMHHFHRLLHCNLTK

A0A5A7UY48 Choline monooxygenase, chloroplastic4.0e-24094.48Show/hide
Query:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQL
        MA LTK IQIHFFQ PS SFNFHFCN+RSPSRISA+LSFR+ DSRFIEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALELDR+FYRGWQAVGYVEQL
Subjt:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNF+ENDFGL PLPVA WGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNL

Query:  DGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESK
        DGKLSSK DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTELFET SIQSCKGGGESK
Subjt:  DGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSV
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQ SLEDSESVQNEDIILCEGVQKGLESPAYKFGRY PSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHCNLTK

A0A5D3BWX2 Choline monooxygenase, chloroplastic3.1e-24094.24Show/hide
Query:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQL
        MA LTK IQIHFFQ PS SFNFHFCN+RSPSRISA+LSFR+ DSRFIEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALELDR+FYRGWQAVGYVEQL
Subjt:  MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNF+ENDFGL PLPVA WGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNL

Query:  DGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESK
        DGKLSSK+DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTELFET SIQSCKGGGESK
Subjt:  DGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESK

Query:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSV
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+SFKNDDSFIQ SLEDSESVQNEDIILCEGVQKGLESPAYKFGRY PSV
Subjt:  GDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSV

Query:  ENAMHHFHRLLHCNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHCNLTK

SwissProt top hitse value%identityAlignment
O04121 Choline monooxygenase, chloroplastic2.2e-11850.78Show/hide
Query:  RSPSRIS----AALSFRN-SDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDN
        R+P++I+    AA SF + + +     + LV +FDPQIP E A TPPSSWY +P+F++ EL+RIFY+GWQ  G  +Q+K+ + +FTG LGNVEY+V +D 
Subjt:  RSPSRIS----AALSFRN-SDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDN

Query:  NRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNLDGKLSSKLDVDEDKVAREWLGT
          KV AFHNVC H AS+LA G GKKSCFVCPYHGW YG+DG L KA++    QN D  + GL PL VA WGPFVL++LD  L    D     V  EWLGT
Subjt:  NRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNLDGKLSSKLDVDEDKVAREWLGT

Query:  CADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMI
         A+ ++ +  D SL ++ R E+ +E NWK+F DNYLD  YHVPYAHK  A+ L  ++Y T++ E  +IQ  + G  +K D + R+G +A YAF YPNF +
Subjt:  CADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMI

Query:  NRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSVENAMHHFH
         RYGPWM T  + PLGPRKC +V DY++E S  +D  +I+  +  +++VQ ED++LCE VQ+GLE+PAY+ GRY   +E  +HHFH
Subjt:  NRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSVENAMHHFH

O22553 Choline monooxygenase, chloroplastic1.4e-12052.89Show/hide
Query:  RKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC
        R LV +FDP+IP E ALTPPS+WY +P+F++ EL+RIFY+GWQ  GY EQ+K+ + +FTG LGNVEY+V +D   ++ AFHNVC H AS+LA G GKKSC
Subjt:  RKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC

Query:  FVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNLDGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECN
        FVCPYHGW YGLDG L KA++    QN D  + GL PL VA WGPF+L++LD  L +  D     V  EW+G  A+ ++ +  D +L +  R E+ +ECN
Subjt:  FVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNLDGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WKVFCDNYLD  YHVPYAHK  A+ L  ++Y+TE+ E   IQ   G   +K D + RLG+EA YAF+YPNF + RYG WM T  V+P+G RKC +V DY+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSVENAMHHFHRLLH
        LE +  +D ++I   +  +++VQ ED +LCE VQ+GLE+PAY+ GRY   +E  +HHFH  LH
Subjt:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSVENAMHHFHRLLH

Q93XE1 Choline monooxygenase, chloroplastic6.1e-12150.91Show/hide
Query:  ISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNV
        I ++++  N  +     ++++ +FDP++P E   TPPS+WY DPS ++ ELDRIF +GWQ  GY +Q+K+ + +FTG LGNVEY+VC+D   KV AFHNV
Subjt:  ISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNV

Query:  CRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNLDGKLSSKLDVDEDKVAREWLGTCADVLRLNGV
        C H AS+LA G GKKSCFVCPYHGW +GLDG L+KAT+    Q FD  + GL  L VA WGPFVL++LD   S       + V +EW+G+CA+ ++ +  
Subjt:  CRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNLDGKLSSKLDVDEDKVAREWLGTCADVLRLNGV

Query:  DASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTN
        D SL ++ R E+ +E NWKVFCDNYLD  YHVPYAHK  A+ L  ++Y T+L E   IQ       +K + + RLGSEA YAF+YPNF + RYGPWM T 
Subjt:  DASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTN

Query:  LVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSVENAMHHFHRLLHCNL
         + PLGPRKC +V DY+LE +  ND  +I+ S+  +++VQ ED++LCE VQ+GLE+PAY+ GRY   +E  +HHFH  LH  L
Subjt:  LVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSVENAMHHFHRLLHCNL

Q9LKN0 Choline monooxygenase, chloroplastic1.8e-12052.62Show/hide
Query:  RKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC
        + LV DFDP +P E ALTPPSSWY +P+F+A ELDRIFY+GWQ  GY +Q+K+A+ +FTG LGNVEY+VC+D   KV AFHNVC H AS+LA G GKKSC
Subjt:  RKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC

Query:  FVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNLDGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECN
        FVCPYHGW YG++G L KA++    Q+ + ++ GL PL VA WGPF+L++LD    S  +V +  V  EWLG+CA+ ++ +  D +L ++ R E+ IE N
Subjt:  FVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNLDGKLSSKLDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WK+F DNYLD  YHVPYAHK  A+ L  ++Y T++    +IQ   G   +  + + RLG++A YAF YPNF + RYGPWM T  ++PLGPRKC +V DY+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSVENAMHHFHRLLH
        +E S  +D  +I+  +  +++VQ ED++LCE VQKGLE+PAY+ GRY   +E  +HHFH  LH
Subjt:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSVENAMHHFHRLLH

Q9SZR0 Choline monooxygenase, chloroplastic7.2e-15468.58Show/hide
Query:  KLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF
        KLV +FDP+IPLE+A TPPSSWY DP F++ ELDR+FY GWQAVGY +Q+K++ DFFTGRLG+V++VVC+D N K+ AFHNVC HHAS+LASG G+KSCF
Subjt:  KLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF

Query:  VCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNLDGKLSSKLDVDEDK-VAREWLGTCADVLRLNGVDASLSYVCRREYTIECN
        VC YHGWTY L G L+KATR++GIQNF  ++ GL+PL VA WGPFVLL +    S K +V+ D+ VA EWLGT    L   GVD+ LSY+CRREYTI+CN
Subjt:  VCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNLDGKLSSKLDVDEDK-VAREWLGTCADVLRLNGVDASLSYVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WKVFCDNYLDGGYHVPYAHKGL S L LE+YST +FE  SIQ C GG +   D + RLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKC VVFDYF
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSVENAMHHFHRLLHCNL
        L+ S K+D++FI+ SLE+S+ VQ ED++LCE VQ+GLES AY  GRY   VE  MHHFH LLH NL
Subjt:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSVENAMHHFHRLLHCNL

Arabidopsis top hitse value%identityAlignment
AT4G29890.1 choline monooxygenase, putative (CMO-like)5.1e-15568.58Show/hide
Query:  KLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF
        KLV +FDP+IPLE+A TPPSSWY DP F++ ELDR+FY GWQAVGY +Q+K++ DFFTGRLG+V++VVC+D N K+ AFHNVC HHAS+LASG G+KSCF
Subjt:  KLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF

Query:  VCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNLDGKLSSKLDVDEDK-VAREWLGTCADVLRLNGVDASLSYVCRREYTIECN
        VC YHGWTY L G L+KATR++GIQNF  ++ GL+PL VA WGPFVLL +    S K +V+ D+ VA EWLGT    L   GVD+ LSY+CRREYTI+CN
Subjt:  VCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNLDGKLSSKLDVDEDK-VAREWLGTCADVLRLNGVDASLSYVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WKVFCDNYLDGGYHVPYAHKGL S L LE+YST +FE  SIQ C GG +   D + RLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKC VVFDYF
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSVENAMHHFHRLLHCNL
        L+ S K+D++FI+ SLE+S+ VQ ED++LCE VQ+GLES AY  GRY   VE  MHHFH LLH NL
Subjt:  LEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSVENAMHHFHRLLHCNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATGTTGACGAAGCAGATCCAAATTCACTTCTTCCAACTTCCTTCCACTTCCTTCAATTTCCATTTCTGTAATTATCGTTCGCCCTCACGAATCTCCGCAGCTTT
ATCGTTTCGAAACTCTGATTCTCGATTCATTGAAGCTCGGAAACTCGTTGATGATTTCGACCCTCAAATTCCATTGGAGAAGGCCCTCACTCCACCCTCCTCCTGGTATA
TAGACCCTTCATTTTTCGCTCTCGAGCTCGATCGTATCTTCTACAGAGGATGGCAGGCTGTAGGATATGTTGAACAGTTAAAAGATGCCCATGACTTTTTCACAGGCAGG
TTGGGAAATGTAGAGTATGTGGTATGCAAAGATAATAACAGAAAGGTTCGTGCATTTCACAATGTTTGTCGCCATCATGCCTCACTTCTTGCGTCTGGATGTGGGAAGAA
GTCGTGCTTTGTATGCCCATATCATGGATGGACATACGGGTTGGATGGAATACTGCTTAAGGCGACTAGAATAAATGGGATACAGAACTTCGATGAAAATGATTTTGGGC
TTGAACCATTACCAGTAGCTACGTGGGGGCCTTTCGTTCTTCTCAATTTGGATGGAAAATTATCATCTAAGCTGGATGTTGATGAAGATAAAGTGGCACGTGAATGGCTT
GGAACATGTGCAGATGTGCTGAGGTTAAACGGAGTTGATGCTTCCCTAAGTTATGTCTGTCGACGTGAATACACTATTGAATGTAACTGGAAGGTTTTTTGTGACAACTA
TTTAGATGGAGGATATCACGTTCCCTATGCACATAAAGGGCTTGCATCTAATCTCAAGCTTGAGTCTTATTCTACAGAACTATTTGAAACTGGTAGCATTCAAAGTTGTA
AGGGTGGGGGAGAATCAAAAGGTGATGATTATGGTCGACTTGGATCAGAAGCACTCTATGCTTTTGTATACCCAAATTTCATGATAAATAGGTATGGACCTTGGATGGAC
ACTAATCTAGTACTCCCACTTGGACCGCGAAAATGTCTGGTGGTTTTCGATTATTTTCTTGAAGCTTCTTTTAAGAATGATGACTCCTTTATACAACTAAGTTTAGAAGA
CAGTGAAAGTGTGCAGAATGAAGACATTATTCTGTGTGAAGGAGTTCAAAAGGGTCTCGAGTCACCAGCTTACAAGTTTGGCCGATATACACCTTCGGTCGAGAATGCCA
TGCACCATTTCCATCGTCTTCTTCATTGTAACCTCACAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGATGTTGACGAAGCAGATCCAAATTCACTTCTTCCAACTTCCTTCCACTTCCTTCAATTTCCATTTCTGTAATTATCGTTCGCCCTCACGAATCTCCGCAGCTTT
ATCGTTTCGAAACTCTGATTCTCGATTCATTGAAGCTCGGAAACTCGTTGATGATTTCGACCCTCAAATTCCATTGGAGAAGGCCCTCACTCCACCCTCCTCCTGGTATA
TAGACCCTTCATTTTTCGCTCTCGAGCTCGATCGTATCTTCTACAGAGGATGGCAGGCTGTAGGATATGTTGAACAGTTAAAAGATGCCCATGACTTTTTCACAGGCAGG
TTGGGAAATGTAGAGTATGTGGTATGCAAAGATAATAACAGAAAGGTTCGTGCATTTCACAATGTTTGTCGCCATCATGCCTCACTTCTTGCGTCTGGATGTGGGAAGAA
GTCGTGCTTTGTATGCCCATATCATGGATGGACATACGGGTTGGATGGAATACTGCTTAAGGCGACTAGAATAAATGGGATACAGAACTTCGATGAAAATGATTTTGGGC
TTGAACCATTACCAGTAGCTACGTGGGGGCCTTTCGTTCTTCTCAATTTGGATGGAAAATTATCATCTAAGCTGGATGTTGATGAAGATAAAGTGGCACGTGAATGGCTT
GGAACATGTGCAGATGTGCTGAGGTTAAACGGAGTTGATGCTTCCCTAAGTTATGTCTGTCGACGTGAATACACTATTGAATGTAACTGGAAGGTTTTTTGTGACAACTA
TTTAGATGGAGGATATCACGTTCCCTATGCACATAAAGGGCTTGCATCTAATCTCAAGCTTGAGTCTTATTCTACAGAACTATTTGAAACTGGTAGCATTCAAAGTTGTA
AGGGTGGGGGAGAATCAAAAGGTGATGATTATGGTCGACTTGGATCAGAAGCACTCTATGCTTTTGTATACCCAAATTTCATGATAAATAGGTATGGACCTTGGATGGAC
ACTAATCTAGTACTCCCACTTGGACCGCGAAAATGTCTGGTGGTTTTCGATTATTTTCTTGAAGCTTCTTTTAAGAATGATGACTCCTTTATACAACTAAGTTTAGAAGA
CAGTGAAAGTGTGCAGAATGAAGACATTATTCTGTGTGAAGGAGTTCAAAAGGGTCTCGAGTCACCAGCTTACAAGTTTGGCCGATATACACCTTCGGTCGAGAATGCCA
TGCACCATTTCCATCGTCTTCTTCATTGTAACCTCACAAAATAA
Protein sequenceShow/hide protein sequence
MAMLTKQIQIHFFQLPSTSFNFHFCNYRSPSRISAALSFRNSDSRFIEARKLVDDFDPQIPLEKALTPPSSWYIDPSFFALELDRIFYRGWQAVGYVEQLKDAHDFFTGR
LGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFDENDFGLEPLPVATWGPFVLLNLDGKLSSKLDVDEDKVAREWL
GTCADVLRLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTELFETGSIQSCKGGGESKGDDYGRLGSEALYAFVYPNFMINRYGPWMD
TNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQLSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYTPSVENAMHHFHRLLHCNLTK