; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0004779 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0004779
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionCholine monooxygenase, chloroplastic
Genome locationchr02:7943669..7947217
RNA-Seq ExpressionPI0004779
SyntenyPI0004779
Gene Ontology termsGO:0019285 - glycine betaine biosynthetic process from choline (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0031967 - organelle envelope (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0019133 - choline monooxygenase activity (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR001663 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit
IPR015879 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit, C-terminal domain
IPR017941 - Rieske [2Fe-2S] iron-sulphur domain
IPR036922 - Rieske [2Fe-2S] iron-sulphur domain superfamily
IPR044637 - Aromatic-ring-hydroxylating dioxygenase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK04233.1 choline monooxygenase [Cucumis melo var. makuwa]2.4e-23993.05Show/hide
Query:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHIQIHFFQPPS  F+FHFCNHRSPSRI   LSFR+PDSRFIEARKLV DFDPE+PLEKALTPPSSWY DPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNL
        KD HDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDG+LLKATRINGIQNF+ENDFGL+ LPVA WGPF+LLNL
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNL

Query:  DGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESK
        DGKLSS++DVDEDKVAREWLG CAD+L LNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTE+FETVSIQSCKGGGESK
Subjt:  DGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+SFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHHNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHHNLTK

XP_004139149.1 choline monooxygenase, chloroplastic isoform X1 [Cucumis sativus]3.0e-23491.85Show/hide
Query:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHIQ+HFFQ PST F+FH CNHRSP RI   LSFRNPDSR IEARKLV DFDP++PLEKALTPPSSWY DPSFFALEL+ VFYRGWQAVGYVEQL
Subjt:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNL
        KD HDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDG+LLKATRI+GIQNFDENDFGL+ LPVATWGPF+LLNL
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNL

Query:  DGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESK
        DGKLSS+ DVDEDKVAREWLG CAD+L LNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTE+FETVSIQSCKGGGESK
Subjt:  DGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDD GRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQ SLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHHNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHHNLTK

XP_008443631.1 PREDICTED: choline monooxygenase, chloroplastic isoform X2 [Cucumis melo]3.1e-23993.29Show/hide
Query:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHIQIHFFQPPS  F+FHFCNHRSPSRI   LSFR+PDSRFIEARKLV DFDPE+PLEKALTPPSSWY DPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNL
        KD HDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDG+LLKATRINGIQNF+ENDFGL+ LPVA WGPF+LLNL
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNL

Query:  DGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESK
        DGKLSS+ DVDEDKVAREWLG CAD+L LNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTE+FETVSIQSCKGGGESK
Subjt:  DGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHHNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHHNLTK

XP_016899776.1 PREDICTED: choline monooxygenase, chloroplastic isoform X1 [Cucumis melo]1.2e-23588.99Show/hide
Query:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAV------
        MA LTKHIQIHFFQPPS  F+FHFCNHRSPSRI   LSFR+PDSRFIEARKLV DFDPE+PLEKALTPPSSWY DPSFFALELDRVFYRGWQA+      
Subjt:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAV------

Query:  -------------GYVEQLKDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDEND
                     GYVEQLKD HDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDG+LLKATRINGIQNF+END
Subjt:  -------------GYVEQLKDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDEND

Query:  FGLISLPVATWGPFILLNLDGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS
        FGL+ LPVA WGPF+LLNLDGKLSS+ DVDEDKVAREWLG CAD+L LNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYS
Subjt:  FGLISLPVATWGPFILLNLDGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS

Query:  TEIFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEG
        TE+FETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEG
Subjt:  TEIFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEG

Query:  VQKGLESPAYKFGRYAPSVENAMHHFHRLLHHNLTK
        VQKGLESPAYKFGRYAPSVENAMHHFHRLLH NLTK
Subjt:  VQKGLESPAYKFGRYAPSVENAMHHFHRLLHHNLTK

XP_038880722.1 choline monooxygenase, chloroplastic [Benincasa hispida]1.3e-23291.37Show/hide
Query:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHI  HFFQPPS  F+ H CNHRSPSRI   LSFRN DS FIEA+KLV +FDPE+PLEKA+TPPSSWY DPSF+ALELDRVFYRGWQAVGYVEQL
Subjt:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNL
        KDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGK+SCFVCPYHGWTYGLDGVLLKATRINGIQNFDEN+FGLI LPVATWGPF+LLNL
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNL

Query:  DGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESK
        DGKLSSELDVDEDKV  EWLG CADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKL+SYSTEIFETVSIQSCKGGGE+K
Subjt:  DGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        G+DYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLG RKCLVVFDYFLEASFKND  FIQ+SLEDSESVQ EDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHHNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHHNLTK

TrEMBL top hitse value%identityAlignment
A0A0A0LXX3 Choline monooxygenase, chloroplastic1.5e-23491.85Show/hide
Query:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHIQ+HFFQ PST F+FH CNHRSP RI   LSFRNPDSR IEARKLV DFDP++PLEKALTPPSSWY DPSFFALEL+ VFYRGWQAVGYVEQL
Subjt:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNL
        KD HDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDG+LLKATRI+GIQNFDENDFGL+ LPVATWGPF+LLNL
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNL

Query:  DGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESK
        DGKLSS+ DVDEDKVAREWLG CAD+L LNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTE+FETVSIQSCKGGGESK
Subjt:  DGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDD GRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQ SLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHHNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHHNLTK

A0A1S3B9A0 Choline monooxygenase, chloroplastic1.5e-23993.29Show/hide
Query:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHIQIHFFQPPS  F+FHFCNHRSPSRI   LSFR+PDSRFIEARKLV DFDPE+PLEKALTPPSSWY DPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNL
        KD HDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDG+LLKATRINGIQNF+ENDFGL+ LPVA WGPF+LLNL
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNL

Query:  DGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESK
        DGKLSS+ DVDEDKVAREWLG CAD+L LNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTE+FETVSIQSCKGGGESK
Subjt:  DGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHHNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHHNLTK

A0A1S4DUW5 choline monooxygenase, chloroplastic isoform X15.9e-23688.99Show/hide
Query:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAV------
        MA LTKHIQIHFFQPPS  F+FHFCNHRSPSRI   LSFR+PDSRFIEARKLV DFDPE+PLEKALTPPSSWY DPSFFALELDRVFYRGWQA+      
Subjt:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAV------

Query:  -------------GYVEQLKDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDEND
                     GYVEQLKD HDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDG+LLKATRINGIQNF+END
Subjt:  -------------GYVEQLKDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDEND

Query:  FGLISLPVATWGPFILLNLDGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS
        FGL+ LPVA WGPF+LLNLDGKLSS+ DVDEDKVAREWLG CAD+L LNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYS
Subjt:  FGLISLPVATWGPFILLNLDGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS

Query:  TEIFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEG
        TE+FETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEG
Subjt:  TEIFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEG

Query:  VQKGLESPAYKFGRYAPSVENAMHHFHRLLHHNLTK
        VQKGLESPAYKFGRYAPSVENAMHHFHRLLH NLTK
Subjt:  VQKGLESPAYKFGRYAPSVENAMHHFHRLLHHNLTK

A0A5A7UY48 Choline monooxygenase, chloroplastic1.5e-23993.29Show/hide
Query:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHIQIHFFQPPS  F+FHFCNHRSPSRI   LSFR+PDSRFIEARKLV DFDPE+PLEKALTPPSSWY DPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNL
        KD HDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDG+LLKATRINGIQNF+ENDFGL+ LPVA WGPF+LLNL
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNL

Query:  DGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESK
        DGKLSS+ DVDEDKVAREWLG CAD+L LNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTE+FETVSIQSCKGGGESK
Subjt:  DGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHHNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHHNLTK

A0A5D3BWX2 Choline monooxygenase, chloroplastic1.2e-23993.05Show/hide
Query:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHIQIHFFQPPS  F+FHFCNHRSPSRI   LSFR+PDSRFIEARKLV DFDPE+PLEKALTPPSSWY DPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNL
        KD HDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDG+LLKATRINGIQNF+ENDFGL+ LPVA WGPF+LLNL
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNL

Query:  DGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESK
        DGKLSS++DVDEDKVAREWLG CAD+L LNGVDASLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTE+FETVSIQSCKGGGESK
Subjt:  DGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+SFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHHNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHHNLTK

SwissProt top hitse value%identityAlignment
O04121 Choline monooxygenase, chloroplastic9.8e-11951.37Show/hide
Query:  RKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC
        + LV++FDP++P E A TPPSSWYT+P+F++ EL+R+FY+GWQ  G  +Q+K+P+ +FTG LGNVEY+V +D   KV AFHNVC H AS+LA G GKKSC
Subjt:  RKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC

Query:  FVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNLDGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECN
        FVCPYHGW YG+DG L KA++    QN D  + GL+ L VA WGPF+L++LD  L    D     V  EWLG  A+ +  +  D SL ++ R E+ +E N
Subjt:  FVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNLDGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WK+F DNYLD  YHVPYAHK  A+ L  ++Y T++ E V+IQ  + G  +K D + R+G +A YAF YPNF + RYGPWM T  + PLGPRKC +V DY+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHHNL
        +E S  +D  +I++ +  +++VQ ED++LCE VQ+GLE+PAY+ GRY   +E  +HHFH  L   L
Subjt:  LEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHHNL

O22553 Choline monooxygenase, chloroplastic3.6e-12153.01Show/hide
Query:  RKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC
        R LV++FDPE+P E ALTPPS+WYT+P+F++ EL+R+FY+GWQ  GY EQ+K+ + +FTG LGNVEY+V +D   ++ AFHNVC H AS+LA G GKKSC
Subjt:  RKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC

Query:  FVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNLDGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECN
        FVCPYHGW YGLDG L KA++    QN D  + GL  L VA WGPFIL++LD  L +  D     V  EW+G+ A+ +  +  D +L +  R E+ +ECN
Subjt:  FVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNLDGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WKVFCDNYLD  YHVPYAHK  A+ L  ++Y+TE+ E   IQ   G   +K D + RLG EA YAFIYPNF + RYG WM T  V+P+G RKC +V DY+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHHNL
        LE +  +D ++I + +  +++VQ ED +LCE VQ+GLE+PAY+ GRY   +E  +HHFH  LH  L
Subjt:  LEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHHNL

Q93XE1 Choline monooxygenase, chloroplastic1.1e-12253.01Show/hide
Query:  RKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC
        ++++++FDP++P E   TPPS+WYTDPS ++ ELDR+F +GWQ  GY +Q+K+P+ +FTG LGNVEY+VC+D   KV AFHNVC H AS+LA G GKKSC
Subjt:  RKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC

Query:  FVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNLDGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECN
        FVCPYHGW +GLDG L+KAT+    Q FD  + GL++L VA WGPF+L++LD +  SE     + V +EW+G CA+ +  +  D SL ++ R E+ +E N
Subjt:  FVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNLDGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WKVFCDNYLD  YHVPYAHK  A+ L  ++Y T++ E V IQ       +K + + RLG EA YAFIYPNF + RYGPWM T  + PLGPRKC +V DY+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHHNL
        LE +  ND  +I++S+  +++VQ ED++LCE VQ+GLE+PAY+ GRY   +E  +HHFH  LH  L
Subjt:  LEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHHNL

Q9LKN0 Choline monooxygenase, chloroplastic2.0e-11952.46Show/hide
Query:  RKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC
        + LV DFDP +P E ALTPPSSWYT+P+F+A ELDR+FY+GWQ  GY +Q+K+ + +FTG LGNVEY+VC+D   KV AFHNVC H AS+LA G GKKSC
Subjt:  RKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC

Query:  FVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNLDGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECN
        FVCPYHGW YG++G L KA++    Q+ + ++ GL+ L VA WGPFIL++LD    S  +V +  V  EWLG CA+ +  +  D +L ++ R E+ IE N
Subjt:  FVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNLDGKLSSELDVDEDKVAREWLGRCADLLSLNGVDASLSYVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WK+F DNYLD  YHVPYAHK  A+ L  ++Y T++   V+IQ   G   +  + + RLG +A YAF YPNF + RYGPWM T  ++PLGPRKC +V DY+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHHNL
        +E S  +D  +I++ +  +++VQ ED++LCE VQKGLE+PAY+ GRY   +E  +HHFH  LH  L
Subjt:  LEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHHNL

Q9SZR0 Choline monooxygenase, chloroplastic6.5e-15563.9Show/hide
Query:  FQPPSTFFSFHFCNHRSPSRI----FVPLSFRNPDSRFI--EARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLKDPHDF
        F PPS   +  + N  S   +    F    F NP   F   +  KLV +FDP++PLE+A TPPSSWYTDP F++ ELDRVFY GWQAVGY +Q+K+  DF
Subjt:  FQPPSTFFSFHFCNHRSPSRI----FVPLSFRNPDSRFI--EARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLKDPHDF

Query:  FTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNLDGKLSS
        FTGRLG+V++VVC+D N K+ AFHNVC HHAS+LASG G+KSCFVC YHGWTY L G L+KATR++GIQNF  ++ GL  L VA WGPF+LL +    S 
Subjt:  FTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNLDGKLSS

Query:  ELDVDEDK-VAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESKGDDYG
        + +V+ D+ VA EWLG     LS  GVD+ LSY+CRREYTI+CNWKVFCDNYLDGGYHVPYAHKGL S L LE+YST IFE VSIQ C GG +   D + 
Subjt:  ELDVDEDK-VAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESKGDDYG

Query:  RLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMH
        RLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKC VVFDYFL+ S K+D++FI++SLE+S+ VQ ED++LCE VQ+GLES AY  GRYA  VE  MH
Subjt:  RLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMH

Query:  HFHRLLHHNL
        HFH LLHHNL
Subjt:  HFHRLLHHNL

Arabidopsis top hitse value%identityAlignment
AT4G29890.1 choline monooxygenase, putative (CMO-like)4.6e-15663.9Show/hide
Query:  FQPPSTFFSFHFCNHRSPSRI----FVPLSFRNPDSRFI--EARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLKDPHDF
        F PPS   +  + N  S   +    F    F NP   F   +  KLV +FDP++PLE+A TPPSSWYTDP F++ ELDRVFY GWQAVGY +Q+K+  DF
Subjt:  FQPPSTFFSFHFCNHRSPSRI----FVPLSFRNPDSRFI--EARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLKDPHDF

Query:  FTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNLDGKLSS
        FTGRLG+V++VVC+D N K+ AFHNVC HHAS+LASG G+KSCFVC YHGWTY L G L+KATR++GIQNF  ++ GL  L VA WGPF+LL +    S 
Subjt:  FTGRLGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNLDGKLSS

Query:  ELDVDEDK-VAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESKGDDYG
        + +V+ D+ VA EWLG     LS  GVD+ LSY+CRREYTI+CNWKVFCDNYLDGGYHVPYAHKGL S L LE+YST IFE VSIQ C GG +   D + 
Subjt:  ELDVDEDK-VAREWLGRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESKGDDYG

Query:  RLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMH
        RLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKC VVFDYFL+ S K+D++FI++SLE+S+ VQ ED++LCE VQ+GLES AY  GRYA  VE  MH
Subjt:  RLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMH

Query:  HFHRLLHHNL
        HFH LLHHNL
Subjt:  HFHRLLHHNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCGTTGACGAAGCATATCCAAATTCACTTCTTCCAACCTCCTTCCACTTTCTTCAGTTTCCATTTCTGTAACCATCGTTCTCCCTCAAGAATCTTCGTGCCTTT
ATCCTTTCGAAACCCTGATTCTCGATTCATTGAAGCTCGGAAACTCGTTTATGATTTCGACCCTGAACTTCCATTAGAGAAGGCCCTCACTCCACCCTCCTCCTGGTATA
CAGACCCTTCATTTTTCGCTCTCGAGCTCGATCGTGTCTTCTACAGAGGATGGCAGGCTGTAGGATATGTTGAACAGTTAAAAGATCCCCATGACTTTTTCACAGGCAGG
TTGGGAAATGTAGAGTATGTGGTATGCAAAGATAATAACAGGAAGGTTCGTGCATTTCATAATGTTTGTCGCCATCATGCCTCACTTCTTGCGTCTGGATGTGGGAAGAA
GTCGTGCTTTGTATGCCCATATCATGGATGGACATACGGGTTGGATGGAGTACTGCTTAAGGCGACTAGAATAAATGGGATACAGAACTTCGATGAAAATGATTTTGGGC
TCATATCATTACCAGTAGCTACGTGGGGGCCTTTCATTCTTCTCAATTTGGATGGAAAATTATCATCTGAGCTGGATGTTGATGAAGATAAAGTGGCACGTGAATGGCTT
GGAAGATGTGCAGATCTGCTGAGTTTAAACGGAGTTGATGCTTCCTTAAGTTATGTCTGTCGACGTGAATACACTATTGAATGTAACTGGAAGGTATTTTGTGACAACTA
CTTAGATGGAGGATATCACGTTCCCTATGCACATAAAGGGCTTGCATCTAATCTCAAGCTTGAGTCTTATTCTACAGAAATATTTGAAACTGTTAGCATTCAAAGTTGTA
AGGGTGGGGGAGAATCAAAAGGTGATGATTATGGTCGACTTGGACCAGAAGCACTCTATGCTTTTATATACCCAAACTTCATGATAAATAGGTATGGACCTTGGATGGAC
ACTAATCTAGTACTCCCACTTGGACCTCGAAAATGTCTGGTGGTTTTCGATTATTTTCTTGAAGCTTCTTTTAAGAATGATGACTCCTTTATACAACAAAGTTTGGAAGA
CAGTGAAAGTGTGCAGAATGAAGACATTATTCTATGTGAAGGGGTTCAAAAGGGTCTCGAGTCACCAGCTTACAAGTTTGGCCGGTATGCTCCTTCGGTCGAGAATGCCA
TGCACCATTTCCATCGTCTTCTTCATCATAACCTCACAAAATAA
mRNA sequenceShow/hide mRNA sequence
GGAAGCTTGAGGACTGTTGCAATGGCGGCGTTGACGAAGCATATCCAAATTCACTTCTTCCAACCTCCTTCCACTTTCTTCAGTTTCCATTTCTGTAACCATCGTTCTCC
CTCAAGAATCTTCGTGCCTTTATCCTTTCGAAACCCTGATTCTCGATTCATTGAAGCTCGGAAACTCGTTTATGATTTCGACCCTGAACTTCCATTAGAGAAGGCCCTCA
CTCCACCCTCCTCCTGGTATACAGACCCTTCATTTTTCGCTCTCGAGCTCGATCGTGTCTTCTACAGAGGATGGCAGGCTGTAGGATATGTTGAACAGTTAAAAGATCCC
CATGACTTTTTCACAGGCAGGTTGGGAAATGTAGAGTATGTGGTATGCAAAGATAATAACAGGAAGGTTCGTGCATTTCATAATGTTTGTCGCCATCATGCCTCACTTCT
TGCGTCTGGATGTGGGAAGAAGTCGTGCTTTGTATGCCCATATCATGGATGGACATACGGGTTGGATGGAGTACTGCTTAAGGCGACTAGAATAAATGGGATACAGAACT
TCGATGAAAATGATTTTGGGCTCATATCATTACCAGTAGCTACGTGGGGGCCTTTCATTCTTCTCAATTTGGATGGAAAATTATCATCTGAGCTGGATGTTGATGAAGAT
AAAGTGGCACGTGAATGGCTTGGAAGATGTGCAGATCTGCTGAGTTTAAACGGAGTTGATGCTTCCTTAAGTTATGTCTGTCGACGTGAATACACTATTGAATGTAACTG
GAAGGTATTTTGTGACAACTACTTAGATGGAGGATATCACGTTCCCTATGCACATAAAGGGCTTGCATCTAATCTCAAGCTTGAGTCTTATTCTACAGAAATATTTGAAA
CTGTTAGCATTCAAAGTTGTAAGGGTGGGGGAGAATCAAAAGGTGATGATTATGGTCGACTTGGACCAGAAGCACTCTATGCTTTTATATACCCAAACTTCATGATAAAT
AGGTATGGACCTTGGATGGACACTAATCTAGTACTCCCACTTGGACCTCGAAAATGTCTGGTGGTTTTCGATTATTTTCTTGAAGCTTCTTTTAAGAATGATGACTCCTT
TATACAACAAAGTTTGGAAGACAGTGAAAGTGTGCAGAATGAAGACATTATTCTATGTGAAGGGGTTCAAAAGGGTCTCGAGTCACCAGCTTACAAGTTTGGCCGGTATG
CTCCTTCGGTCGAGAATGCCATGCACCATTTCCATCGTCTTCTTCATCATAACCTCACAAAATAAAAATAAAAAATAATTGAAAGAAGTTTGTTTCTTATAAAACAGAAG
AAAACAAGTTTGGCAAGATATTTCTTTTAGGTTTTCATAGTTTGTCTTGGAAATGGTTTATAATAACTTGCTTTTGCTGAGGTTTGTAACACATAACCTCTCTCCTAAAT
GGAAAATACGGGTTCTTAAAAGTTTTGTATATGTTGTTATTCGTGCAGAAGGGTTCTTCAAAGAATTTATTTCTTAGTCTC
Protein sequenceShow/hide protein sequence
MAALTKHIQIHFFQPPSTFFSFHFCNHRSPSRIFVPLSFRNPDSRFIEARKLVYDFDPELPLEKALTPPSSWYTDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGR
LGNVEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDENDFGLISLPVATWGPFILLNLDGKLSSELDVDEDKVAREWL
GRCADLLSLNGVDASLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFETVSIQSCKGGGESKGDDYGRLGPEALYAFIYPNFMINRYGPWMD
TNLVLPLGPRKCLVVFDYFLEASFKNDDSFIQQSLEDSESVQNEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHHNLTK