; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C009992 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C009992
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionCholine monooxygenase, chloroplastic
Genome locationchr02:11074210..11077789
RNA-Seq ExpressionMELO3C009992
SyntenyMELO3C009992
Gene Ontology termsGO:0019285 - glycine betaine biosynthetic process from choline (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0031967 - organelle envelope (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0019133 - choline monooxygenase activity (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR001663 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit
IPR015879 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit, C-terminal domain
IPR017941 - Rieske [2Fe-2S] iron-sulphur domain
IPR036922 - Rieske [2Fe-2S] iron-sulphur domain superfamily
IPR044637 - Aromatic-ring-hydroxylating dioxygenase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK04233.1 choline monooxygenase [Cucumis melo var. makuwa]3.8e-25399.52Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL

Query:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
        DGKLSSK DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+SFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHRNLTK
        ENAMHHFHRLLHRNLTK
Subjt:  ENAMHHFHRLLHRNLTK

XP_004139149.1 choline monooxygenase, chloroplastic isoform X1 [Cucumis sativus]2.2e-24094.48Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHIQ+HFFQ PS SFNFH CNHRSP RISA+LSFR+PDSR IEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALEL+ VFYRGWQAVGYVEQL
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRI+GIQNF+ENDFGLVPLPVA WGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL

Query:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
        DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDD GRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQ SLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHRNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHRNLTK

XP_008443631.1 PREDICTED: choline monooxygenase, chloroplastic isoform X2 [Cucumis melo]1.5e-254100Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL

Query:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
        DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHRNLTK
        ENAMHHFHRLLHRNLTK
Subjt:  ENAMHHFHRLLHRNLTK

XP_016899776.1 PREDICTED: choline monooxygenase, chloroplastic isoform X1 [Cucumis melo]6.1e-25195.41Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAV------
        MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQA+      
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAV------

Query:  -------------GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNEND
                     GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNEND
Subjt:  -------------GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNEND

Query:  FGLVPLPVAMWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYS
        FGLVPLPVAMWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYS
Subjt:  FGLVPLPVAMWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYS

Query:  TELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEG
        TELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEG
Subjt:  TELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEG

Query:  VQKGLESPAYKFGRYAPSVENAMHHFHRLLHRNLTK
        VQKGLESPAYKFGRYAPSVENAMHHFHRLLHRNLTK
Subjt:  VQKGLESPAYKFGRYAPSVENAMHHFHRLLHRNLTK

XP_038880722.1 choline monooxygenase, chloroplastic [Benincasa hispida]5.7e-23390.65Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHI  HFFQPPSISFN H CNHRSPSRISA+LSFR+ DS FIEA+KLVD+FDPEIPLEKA+TPPSSWYIDPSF+ALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL
        KD HDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGK+SCFVCPYHGWTYGLDG+LLKATRINGIQNF+EN+FGL+PLPVA WGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL

Query:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
        DGKLSS+ DVDEDKV  EWLG+CAD+L LNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL L+SYSTE+FETVSIQSCKGGGE+K
Subjt:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        G+DYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLG RKCLVVFDYFLEASFKND  FIQ+SLEDSESVQ EDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHRNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHRNLTK

TrEMBL top hitse value%identityAlignment
A0A0A0LXX3 Choline monooxygenase, chloroplastic1.0e-24094.48Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHIQ+HFFQ PS SFNFH CNHRSP RISA+LSFR+PDSR IEARKLVDDFDP+IPLEKALTPPSSWYIDPSFFALEL+ VFYRGWQAVGYVEQL
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDGILLKATRI+GIQNF+ENDFGLVPLPVA WGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL

Query:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
        DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDD GRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQ SLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHRNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHRNLTK

A0A1S3B9A0 Choline monooxygenase, chloroplastic7.5e-255100Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL

Query:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
        DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHRNLTK
        ENAMHHFHRLLHRNLTK
Subjt:  ENAMHHFHRLLHRNLTK

A0A1S4DUW5 choline monooxygenase, chloroplastic isoform X12.9e-25195.41Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAV------
        MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQA+      
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAV------

Query:  -------------GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNEND
                     GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNEND
Subjt:  -------------GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNEND

Query:  FGLVPLPVAMWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYS
        FGLVPLPVAMWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYS
Subjt:  FGLVPLPVAMWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYS

Query:  TELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEG
        TELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEG
Subjt:  TELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEG

Query:  VQKGLESPAYKFGRYAPSVENAMHHFHRLLHRNLTK
        VQKGLESPAYKFGRYAPSVENAMHHFHRLLHRNLTK
Subjt:  VQKGLESPAYKFGRYAPSVENAMHHFHRLLHRNLTK

A0A5A7UY48 Choline monooxygenase, chloroplastic7.5e-255100Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL

Query:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
        DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHRNLTK
        ENAMHHFHRLLHRNLTK
Subjt:  ENAMHHFHRLLHRNLTK

A0A5D3BWX2 Choline monooxygenase, chloroplastic1.8e-25399.52Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL
        KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL
Subjt:  KDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNL

Query:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
        DGKLSSK DVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK
Subjt:  DGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+SFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHRNLTK
        ENAMHHFHRLLHRNLTK
Subjt:  ENAMHHFHRLLHRNLTK

SwissProt top hitse value%identityAlignment
O04121 Choline monooxygenase, chloroplastic5.2e-12049.62Show/hide
Query:  RSPSRISA---------SLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVV
        R+P++I+          SL+  +P S     + LV +FDP+IP E A TPPSSWY +P+F++ EL+R+FY+GWQ  G  +Q+K+ + +FTG LGNVEY+V
Subjt:  RSPSRISA---------SLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVV

Query:  CKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNLDGKLSSKPDVDEDKVARE
         +D   KV AFHNVC H AS+LA G GKKSCFVCPYHGW YG+DG L KA++    QN +  + GLVPL VA+WGPFVL++LD  L    D     V  E
Subjt:  CKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNLDGKLSSKPDVDEDKVARE

Query:  WLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYP
        WLGT A+ ++ +  D SL ++ R E+ ++ NWK+F DNYLD  YHVPYAHK  A+ LN ++Y T++ E V+IQ  + G  +K D + R+G +A YAF YP
Subjt:  WLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYP

Query:  NFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHRNL
        NF + RYGPWM T  + PLGPRKC +V DY++E S  +D  +I++ +  +++VQ ED++LCE VQ+GLE+PAY+ GRY   +E  +HHFH  L + L
Subjt:  NFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHRNL

O22553 Choline monooxygenase, chloroplastic8.0e-12152.46Show/hide
Query:  RKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSC
        R LV +FDPEIP E ALTPPS+WY +P+F++ EL+R+FY+GWQ  GY EQ+K+ + +FTG LGNVEY+V +D   ++ AFHNVC H AS+LA G GKKSC
Subjt:  RKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSC

Query:  FVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCN
        FVCPYHGW YGLDG L KA++    QN +  + GL PL VA WGPF+L++LD  L +  D     V  EW+G  A+ ++ +  D +L +  R E+ ++CN
Subjt:  FVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WKVFCDNYLD  YHVPYAHK  A+ L+ ++Y+TE+ E   IQ   G   +K D + RLG EA YAFIYPNF + RYG WM T  V+P+G RKC +V DY+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHRNL
        LE +  +D ++I + +  +++VQ ED +LCE VQ+GLE+PAY+ GRY   +E  +HHFH  LH  L
Subjt:  LEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHRNL

Q93XE1 Choline monooxygenase, chloroplastic9.4e-12253.01Show/hide
Query:  RKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSC
        ++++ +FDP++P E   TPPS+WY DPS ++ ELDR+F +GWQ  GY +Q+K+ + +FTG LGNVEY+VC+D   KV AFHNVC H AS+LA G GKKSC
Subjt:  RKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSC

Query:  FVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCN
        FVCPYHGW +GLDG L+KAT+    Q F+  + GLV L VA+WGPFVL++LD    S  +  ED V +EW+G+CA+ ++ +  D SL ++ R E+ ++ N
Subjt:  FVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WKVFCDNYLD  YHVPYAHK  A+ L+ ++Y T+L E V IQ       +K + + RLG EA YAFIYPNF + RYGPWM T  + PLGPRKC +V DY+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHRNL
        LE +  ND  +I++S+  +++VQ ED++LCE VQ+GLE+PAY+ GRY   +E  +HHFH  LH+ L
Subjt:  LEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHRNL

Q9LKN0 Choline monooxygenase, chloroplastic3.2e-12252.73Show/hide
Query:  RKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSC
        + LV DFDP +P E ALTPPSSWY +P+F+A ELDR+FY+GWQ  GY +Q+K+A+ +FTG LGNVEY+VC+D   KV AFHNVC H AS+LA G GKKSC
Subjt:  RKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSC

Query:  FVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCN
        FVCPYHGW YG++G L KA++    Q+ N ++ GLVPL VA+WGPF+L++LD    S  +V +  V  EWLG+CA+ ++ +  D +L ++ R E+ I+ N
Subjt:  FVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNLDGKLSSKPDVDEDKVAREWLGTCADVLRLNGVDASLSYVCRREYTIDCN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WK+F DNYLD  YHVPYAHK  A+ L+ ++Y T++   V+IQ   G   +  + + RLG +A YAF YPNF + RYGPWM T  ++PLGPRKC +V DY+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHRNL
        +E S  +D  +I++ +  +++VQ ED++LCE VQKGLE+PAY+ GRY   +E  +HHFH  LH+ L
Subjt:  LEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHRNL

Q9SZR0 Choline monooxygenase, chloroplastic6.9e-15763.27Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISAS----LSFRSPDSRFI--EARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAV
        M TLT    +  F PPS+     + N  S   +S S      F +P   F   +  KLV +FDP+IPLE+A TPPSSWY DP F++ ELDRVFY GWQAV
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISAS----LSFRSPDSRFI--EARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAV

Query:  GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGP
        GY +Q+K++ DFFTGRLG+V++VVC+D N K+ AFHNVC HHAS+LA+G G+KSCFVC YHGWTY L G L+KATR++GIQNF+ ++ GL PL VA+WGP
Subjt:  GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGP

Query:  FVLLNLDGKLSSKPDVDEDK-VAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSC
        FVLL +    S K +V+ D+ VA EWLGT    L   GVD+ LSY+CRREYTIDCNWKVFCDNYLDGGYHVPYAHKGL S L+LE+YST +FE VSIQ C
Subjt:  FVLLNLDGKLSSKPDVDEDK-VAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSC

Query:  KGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKF
         GG +   D + RLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKC VVFDYFL+ S K+D++FI++SLE+S+ VQ ED++LCE VQ+GLES AY  
Subjt:  KGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKF

Query:  GRYAPSVENAMHHFHRLLHRNL
        GRYA  VE  MHHFH LLH NL
Subjt:  GRYAPSVENAMHHFHRLLHRNL

Arabidopsis top hitse value%identityAlignment
AT4G29890.1 choline monooxygenase, putative (CMO-like)4.9e-15863.27Show/hide
Query:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISAS----LSFRSPDSRFI--EARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAV
        M TLT    +  F PPS+     + N  S   +S S      F +P   F   +  KLV +FDP+IPLE+A TPPSSWY DP F++ ELDRVFY GWQAV
Subjt:  MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISAS----LSFRSPDSRFI--EARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAV

Query:  GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGP
        GY +Q+K++ DFFTGRLG+V++VVC+D N K+ AFHNVC HHAS+LA+G G+KSCFVC YHGWTY L G L+KATR++GIQNF+ ++ GL PL VA+WGP
Subjt:  GYVEQLKDAHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGP

Query:  FVLLNLDGKLSSKPDVDEDK-VAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSC
        FVLL +    S K +V+ D+ VA EWLGT    L   GVD+ LSY+CRREYTIDCNWKVFCDNYLDGGYHVPYAHKGL S L+LE+YST +FE VSIQ C
Subjt:  FVLLNLDGKLSSKPDVDEDK-VAREWLGTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSC

Query:  KGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKF
         GG +   D + RLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKC VVFDYFL+ S K+D++FI++SLE+S+ VQ ED++LCE VQ+GLES AY  
Subjt:  KGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKF

Query:  GRYAPSVENAMHHFHRLLHRNL
        GRYA  VE  MHHFH LLH NL
Subjt:  GRYAPSVENAMHHFHRLLHRNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACTTTGACGAAGCATATCCAAATTCACTTCTTCCAACCTCCTTCCATTTCTTTCAATTTCCATTTCTGTAATCATCGTTCTCCCTCACGAATCTCCGCGTCTTT
ATCCTTTCGAAGCCCTGATTCTCGATTCATTGAAGCTCGGAAACTCGTTGATGATTTCGACCCTGAAATTCCATTGGAGAAGGCCCTCACTCCACCCTCCTCCTGGTATA
TAGACCCTTCATTTTTCGCTCTCGAGCTCGATCGTGTCTTCTACAGAGGATGGCAGGCTGTAGGATATGTTGAACAGTTAAAAGATGCCCATGACTTTTTCACAGGCAGG
TTGGGAAATGTAGAGTATGTGGTATGCAAAGATAATAACAGGAAGGTTCGTGCATTTCACAATGTTTGTCGCCATCATGCCTCACTTCTTGCGACTGGATGTGGGAAGAA
GTCGTGCTTTGTATGCCCCTATCATGGATGGACATACGGGTTGGATGGAATACTGCTTAAGGCGACTAGAATAAATGGGATACAGAACTTCAATGAAAATGATTTTGGGC
TTGTACCATTACCAGTAGCTATGTGGGGGCCTTTCGTTCTTCTCAATTTGGATGGAAAATTATCATCTAAGCCGGATGTTGATGAAGATAAAGTGGCACGTGAATGGCTT
GGAACATGTGCAGATGTGCTGAGATTAAACGGAGTTGATGCTTCCCTAAGTTATGTCTGTCGACGTGAATACACTATTGATTGTAACTGGAAGGTTTTTTGTGACAACTA
TTTAGATGGAGGATATCACGTTCCCTATGCACATAAAGGGCTTGCATCTAATCTCAACCTTGAGTCTTATTCTACAGAATTATTTGAAACTGTTAGCATTCAAAGTTGTA
AGGGTGGGGGAGAATCAAAAGGTGATGATTATGGTCGACTTGGACCAGAAGCACTATATGCTTTTATATACCCAAATTTCATGATAAATAGGTATGGACCTTGGATGGAC
ACCAATCTAGTACTCCCACTTGGACCTCGAAAATGTTTGGTGGTTTTCGATTATTTTCTTGAAGCTTCTTTTAAGAATGATGACTCCTTTATACAACAAAGTTTAGAAGA
CAGTGAAAGTGTGCAGAATGAAGACATTATTCTGTGCGAAGGAGTTCAAAAGGGTCTCGAGTCACCAGCTTACAAGTTTGGCCGGTATGCTCCTTCGGTTGAGAATGCCA
TGCACCATTTCCATCGTCTTCTTCATCGTAACCTCACAAAATAA
mRNA sequenceShow/hide mRNA sequence
TTTTAAAAAAAGAAAATATTTGTGTGTTAGTTAGCCACCAAAACGGTAGAATAAAATAAGTTGTCCTCGGAGAAGAAGGTCGAGGACTGTTGCTGTTGCAATGGCGACTT
TGACGAAGCATATCCAAATTCACTTCTTCCAACCTCCTTCCATTTCTTTCAATTTCCATTTCTGTAATCATCGTTCTCCCTCACGAATCTCCGCGTCTTTATCCTTTCGA
AGCCCTGATTCTCGATTCATTGAAGCTCGGAAACTCGTTGATGATTTCGACCCTGAAATTCCATTGGAGAAGGCCCTCACTCCACCCTCCTCCTGGTATATAGACCCTTC
ATTTTTCGCTCTCGAGCTCGATCGTGTCTTCTACAGAGGATGGCAGGCTGTAGGATATGTTGAACAGTTAAAAGATGCCCATGACTTTTTCACAGGCAGGTTGGGAAATG
TAGAGTATGTGGTATGCAAAGATAATAACAGGAAGGTTCGTGCATTTCACAATGTTTGTCGCCATCATGCCTCACTTCTTGCGACTGGATGTGGGAAGAAGTCGTGCTTT
GTATGCCCCTATCATGGATGGACATACGGGTTGGATGGAATACTGCTTAAGGCGACTAGAATAAATGGGATACAGAACTTCAATGAAAATGATTTTGGGCTTGTACCATT
ACCAGTAGCTATGTGGGGGCCTTTCGTTCTTCTCAATTTGGATGGAAAATTATCATCTAAGCCGGATGTTGATGAAGATAAAGTGGCACGTGAATGGCTTGGAACATGTG
CAGATGTGCTGAGATTAAACGGAGTTGATGCTTCCCTAAGTTATGTCTGTCGACGTGAATACACTATTGATTGTAACTGGAAGGTTTTTTGTGACAACTATTTAGATGGA
GGATATCACGTTCCCTATGCACATAAAGGGCTTGCATCTAATCTCAACCTTGAGTCTTATTCTACAGAATTATTTGAAACTGTTAGCATTCAAAGTTGTAAGGGTGGGGG
AGAATCAAAAGGTGATGATTATGGTCGACTTGGACCAGAAGCACTATATGCTTTTATATACCCAAATTTCATGATAAATAGGTATGGACCTTGGATGGACACCAATCTAG
TACTCCCACTTGGACCTCGAAAATGTTTGGTGGTTTTCGATTATTTTCTTGAAGCTTCTTTTAAGAATGATGACTCCTTTATACAACAAAGTTTAGAAGACAGTGAAAGT
GTGCAGAATGAAGACATTATTCTGTGCGAAGGAGTTCAAAAGGGTCTCGAGTCACCAGCTTACAAGTTTGGCCGGTATGCTCCTTCGGTTGAGAATGCCATGCACCATTT
CCATCGTCTTCTTCATCGTAACCTCACAAAATAAAAATAATTGAAAGAAATTTGTTCTTATAAAACAGAAGAAAGCAAGTTTGGTAAGATATTTCTTTTAGGTTTTCATA
GTTTATCTTGGAAACTATCAATAATAACTTGCTTTCGCTGCAGTGCCTTTGAGTTGTTGAAATGATGTGTACCAAATATGAGGATTTTATTAATGTTTTAAAAGTTGTGA
ATATAAGAACGTGAGTACTTTATTTATAATTTAAAAACAAACTAACGGTAATCCTAATTAA
Protein sequenceShow/hide protein sequence
MATLTKHIQIHFFQPPSISFNFHFCNHRSPSRISASLSFRSPDSRFIEARKLVDDFDPEIPLEKALTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDAHDFFTGR
LGNVEYVVCKDNNRKVRAFHNVCRHHASLLATGCGKKSCFVCPYHGWTYGLDGILLKATRINGIQNFNENDFGLVPLPVAMWGPFVLLNLDGKLSSKPDVDEDKVAREWL
GTCADVLRLNGVDASLSYVCRREYTIDCNWKVFCDNYLDGGYHVPYAHKGLASNLNLESYSTELFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMD
TNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHRNLTK