; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh10G003970 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh10G003970
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionCholine monooxygenase, chloroplastic
Genome locationCmo_Chr10:1793947..1798117
RNA-Seq ExpressionCmoCh10G003970
SyntenyCmoCh10G003970
Gene Ontology termsGO:0019285 - glycine betaine biosynthetic process from choline (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0031967 - organelle envelope (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0019133 - choline monooxygenase activity (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR001663 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit
IPR015879 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit, C-terminal domain
IPR017941 - Rieske [2Fe-2S] iron-sulphur domain
IPR036922 - Rieske [2Fe-2S] iron-sulphur domain superfamily
IPR044637 - Aromatic-ring-hydroxylating dioxygenase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589681.1 Choline monooxygenase, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]9.7e-24998.32Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL
        MAALTKHIHTHFFQPLFASFNFQSRS+RS TRIS TLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFF LELDRVFHRGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL
        KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLN+
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL

Query:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        DEKLSSELVVDED+VAHEWLGSCADLLSLNGVDASL FVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHLNLTK
        ENAMHHFHRLLHLNLTK
Subjt:  ENAMHHFHRLLHLNLTK

XP_022965039.1 choline monooxygenase, chloroplastic isoform X1 [Cucurbita moschata]1.1e-252100Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL
        MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL
        KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL

Query:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHLNLTK
        ENAMHHFHRLLHLNLTK
Subjt:  ENAMHHFHRLLHLNLTK

XP_022987329.1 choline monooxygenase, chloroplastic isoform X1 [Cucurbita maxima]1.9e-24491.13Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL
        MAALTKHIHTHFFQPLFASFNFQSRS+RSPTRIS TLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL
        KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLN+
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL

Query:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        DEKLSSELVVDED+VA+EWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMI----------------------------------NRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLE
        GDDYGRLGPEALYAFVYPNFMI                                  NRYGPWMDTNLVLPLGP+KCLVVFDYFLETPFKNDESFIQQSLE
Subjt:  GDDYGRLGPEALYAFVYPNFMI----------------------------------NRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLE

Query:  DSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNLTK
        DSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNLTK
Subjt:  DSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNLTK

XP_022987330.1 choline monooxygenase, chloroplastic isoform X2 [Cucurbita maxima]1.1e-24998.56Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL
        MAALTKHIHTHFFQPLFASFNFQSRS+RSPTRIS TLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL
        KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLN+
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL

Query:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        DEKLSSELVVDED+VA+EWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGP+KCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHLNLTK
        ENAMHHFHRLLHLNLTK
Subjt:  ENAMHHFHRLLHLNLTK

XP_023515603.1 choline monooxygenase, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]9.7e-24998.56Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL
        MAALTKHIHTHFFQPLFASFNFQSRS RSPTRIS TLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFF LELDRVFHRGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL
        KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL

Query:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        DEKLSSELVVDED+VA+EWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSE VQIEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHLNLTK
        ENAMHHFHRLLHLNLTK
Subjt:  ENAMHHFHRLLHLNLTK

TrEMBL top hitse value%identityAlignment
A0A5A7UY48 Choline monooxygenase, chloroplastic1.5e-22388.25Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL
        MA LTKHI  HFFQP   SFNF   ++RSP+RIS++LSFR+ DS FIEA KLVD+FDPEIPLEKA+TPPSSWY DPSFFALELDRVF+RGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL
        KD HDFFTGRLGNVEYVVCKDNN+KVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDG LLKATRINGIQNF+ NDFGLVPLPVA WGPFVLLNL
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL

Query:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        D KLSS+  VDED+VA EWLG+CAD+L LNGVDASLS+VCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTE+FE VSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE  FKND+SFIQQSLEDSESVQ EDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHLNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHLNLTK

A0A5D3BWX2 Choline monooxygenase, chloroplastic4.0e-22488.25Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL
        MA LTKHI  HFFQP   SFNF   ++RSP+RIS++LSFR+ DS FIEA KLVD+FDPEIPLEKA+TPPSSWY DPSFFALELDRVF+RGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL
        KD HDFFTGRLGNVEYVVCKDNN+KVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDG LLKATRINGIQNF+ NDFGLVPLPVA WGPFVLLNL
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL

Query:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        D KLSS++ VDED+VA EWLG+CAD+L LNGVDASLS+VCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTE+FE VSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+ FKND+SFIQQSLEDSESVQ EDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHLNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHLNLTK

A0A6J1HPX3 Choline monooxygenase, chloroplastic5.4e-253100Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL
        MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL
        KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL

Query:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHLNLTK
        ENAMHHFHRLLHLNLTK
Subjt:  ENAMHHFHRLLHLNLTK

A0A6J1JGJ2 Choline monooxygenase, chloroplastic5.5e-25098.56Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL
        MAALTKHIHTHFFQPLFASFNFQSRS+RSPTRIS TLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL
        KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLN+
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL

Query:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        DEKLSSELVVDED+VA+EWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGP+KCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHLNLTK
        ENAMHHFHRLLHLNLTK
Subjt:  ENAMHHFHRLLHLNLTK

A0A6J1JIK0 Choline monooxygenase, chloroplastic9.1e-24591.13Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL
        MAALTKHIHTHFFQPLFASFNFQSRS+RSPTRIS TLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL
        KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLN+
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNL

Query:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        DEKLSSELVVDED+VA+EWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMI----------------------------------NRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLE
        GDDYGRLGPEALYAFVYPNFMI                                  NRYGPWMDTNLVLPLGP+KCLVVFDYFLETPFKNDESFIQQSLE
Subjt:  GDDYGRLGPEALYAFVYPNFMI----------------------------------NRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLE

Query:  DSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNLTK
        DSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNLTK
Subjt:  DSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNLTK

SwissProt top hitse value%identityAlignment
O04121 Choline monooxygenase, chloroplastic1.5e-11952.47Show/hide
Query:  LVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFV
        LV EFDP+IP E A TPPSSWYT+P+F++ EL+R+F++GWQ  G  +Q+K+P+ +FTG LGNVEY+V +D   KV AFHNVC H AS+LA G GKKSCFV
Subjt:  LVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFV

Query:  CPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNLDEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWK
        CPYHGW YG+DG L KA++    QN D  + GLVPL VA WGPFVL++LD  L      +  +V  EWLG+ A+ +  +  D SL F+ R E+ +E NWK
Subjt:  CPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNLDEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWK

Query:  VFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE
        +F DNYLD  YHVPYAHK  A+ L  ++Y T++ E V+IQ  + G  +K D + R+G +A YAF YPNF + RYGPWM T  + PLGPRKC +V DY++E
Subjt:  VFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE

Query:  TPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL
            +D+ +I++ +  +++VQ ED++LCE VQ+GLE+PAY+ GRY   +E  +HHFH  L   L
Subjt:  TPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL

O22553 Choline monooxygenase, chloroplastic4.0e-12052.75Show/hide
Query:  LVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFV
        LV EFDPEIP E A+TPPS+WYT+P+F++ EL+R+F++GWQ  GY EQ+K+ + +FTG LGNVEY+V +D   ++ AFHNVC H AS+LA G GKKSCFV
Subjt:  LVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFV

Query:  CPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNLDEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWK
        CPYHGW YGLDG L KA++    QN D  + GL PL VA WGPF+L++LD  L +       +V  EW+G  A+ +  +  D +L F  R E+ +ECNWK
Subjt:  CPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNLDEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWK

Query:  VFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE
        VFCDNYLD  YHVPYAHK  A+ L  ++Y+TE+ E   IQ   G   +K D + RLG EA YAF+YPNF + RYG WM T  V+P+G RKC +V DY+LE
Subjt:  VFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE

Query:  TPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL
            +D+++I + +  +++VQ ED +LCE VQ+GLE+PAY+ GRY   +E  +HHFH  LH  L
Subjt:  TPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL

Q93XE1 Choline monooxygenase, chloroplastic1.0e-12352.22Show/hide
Query:  ISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNV
        I S+++  N  +      +++ EFDP++P E   TPPS+WYTDPS ++ ELDR+F +GWQ  GY +Q+K+P+ +FTG LGNVEY+VC+D   KV AFHNV
Subjt:  ISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNV

Query:  CRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNLDEKLSSELVVDEDEVAHEWLGSCADLLSLNGV
        C H AS+LA G GKKSCFVCPYHGW +GLDG L+KAT+    Q FD  + GLV L VA WGPFVL++LD   S       ++V  EW+GSCA+ +  +  
Subjt:  CRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNLDEKLSSELVVDEDEVAHEWLGSCADLLSLNGV

Query:  DASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTN
        D SL F+ R E+ +E NWKVFCDNYLD  YHVPYAHK  A+ L  ++Y T++ E V IQ       +K + + RLG EA YAF+YPNF + RYGPWM T 
Subjt:  DASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTN

Query:  LVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL
         + PLGPRKC +V DY+LE    ND+ +I++S+  +++VQ ED++LCE VQ+GLE+PAY+ GRY   +E  +HHFH  LH  L
Subjt:  LVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL

Q9LKN0 Choline monooxygenase, chloroplastic6.8e-12053.19Show/hide
Query:  LVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFV
        LV +FDP +P E A+TPPSSWYT+P+F+A ELDR+F++GWQ  GY +Q+K+ + +FTG LGNVEY+VC+D   KV AFHNVC H AS+LA G GKKSCFV
Subjt:  LVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFV

Query:  CPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNLDEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWK
        CPYHGW YG++G L KA++    Q+ + ++ GLVPL VA WGPF+L++LD   SS  V D   V  EWLGSCA+ +  +  D +L F+ R E+ IE NWK
Subjt:  CPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNLDEKLSSELVVDEDEVAHEWLGSCADLLSLNGVDASLSFVCRREYTIECNWK

Query:  VFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE
        +F DNYLD  YHVPYAHK  A+ L  ++Y T++   V+IQ   G   +  + + RLG +A YAF YPNF + RYGPWM T  ++PLGPRKC +V DY++E
Subjt:  VFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE

Query:  TPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLH
            +D+ +I++ +  +++VQ ED++LCE VQKGLE+PAY+ GRY   +E  +HHFH  LH
Subjt:  TPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLH

Q9SZR0 Choline monooxygenase, chloroplastic1.7e-15566.33Show/hide
Query:  FQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKD
        F  R + +PTR+     F  SD       KLV EFDP+IPLE+A TPPSSWYTDP F++ ELDRVF+ GWQAVGY +Q+K+  DFFTGRLG+V++VVC+D
Subjt:  FQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKD

Query:  NNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNLDEKLSSELVVDEDE-VAHEWL
         N K+ AFHNVC HHAS+LASG G+KSCFVC YHGWTY L G L+KATR++GIQNF +++ GL PL VA WGPFVLL +    S +  V+ DE VA EWL
Subjt:  NNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNLDEKLSSELVVDEDE-VAHEWL

Query:  GSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNF
        G+    LS  GVD+ LS++CRREYTI+CNWKVFCDNYLDGGYHVPYAHKGL S L LE+YST IFE VSIQ C GG +   D + RLG EALYAFVYPNF
Subjt:  GSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNF

Query:  MINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL
        MINRYGPWMDTNLVLPLGPRKC VVFDYFL+   K+DE+FI++SLE+S+ VQ+ED++LCE VQ+GLES AY  GRYA  VE  MHHFH LLH NL
Subjt:  MINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL

Arabidopsis top hitse value%identityAlignment
AT4G29890.1 choline monooxygenase, putative (CMO-like)1.2e-15666.33Show/hide
Query:  FQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKD
        F  R + +PTR+     F  SD       KLV EFDP+IPLE+A TPPSSWYTDP F++ ELDRVF+ GWQAVGY +Q+K+  DFFTGRLG+V++VVC+D
Subjt:  FQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKD

Query:  NNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNLDEKLSSELVVDEDE-VAHEWL
         N K+ AFHNVC HHAS+LASG G+KSCFVC YHGWTY L G L+KATR++GIQNF +++ GL PL VA WGPFVLL +    S +  V+ DE VA EWL
Subjt:  NNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNLDEKLSSELVVDEDE-VAHEWL

Query:  GSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNF
        G+    LS  GVD+ LS++CRREYTI+CNWKVFCDNYLDGGYHVPYAHKGL S L LE+YST IFE VSIQ C GG +   D + RLG EALYAFVYPNF
Subjt:  GSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNF

Query:  MINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL
        MINRYGPWMDTNLVLPLGPRKC VVFDYFL+   K+DE+FI++SLE+S+ VQ+ED++LCE VQ+GLES AY  GRYA  VE  MHHFH LLH NL
Subjt:  MINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCGTTAACGAAGCATATCCATACTCACTTCTTCCAACCTCTTTTCGCTTCCTTCAATTTTCAGTCGCGCAGTTATCGGTCTCCCACACGAATCTCGTCGACTTT
ATCCTTTCGTAACTCTGATTCTCACTTTATTGAAGCTCTGAAACTCGTTGATGAATTCGACCCCGAAATTCCCTTGGAGAAGGCCGTCACTCCACCCAGCTCCTGGTACA
CCGATCCTTCATTTTTCGCTCTCGAGCTCGATCGTGTCTTCCACAGAGGATGGCAGGCCGTAGGATATGTTGAACAGTTGAAAGATCCCCATGATTTTTTCACAGGCAGG
TTGGGAAATGTAGAATACGTTGTCTGCAAAGATAATAACAAGAAGGTTCGTGCATTTCACAATGTCTGCCGCCATCATGCCTCACTTCTTGCATCTGGATGTGGGAAGAA
GTCCTGCTTTGTGTGCCCATATCATGGATGGACATATGGGTTGGATGGAGGGCTGCTTAAGGCGACTAGAATAAATGGAATTCAGAACTTTGATGTAAATGATTTTGGGC
TCGTACCGTTACCCGTAGCGACATGGGGGCCGTTCGTTCTTCTCAATCTGGATGAAAAATTATCATCTGAGCTGGTTGTTGATGAAGATGAAGTGGCACATGAATGGCTG
GGAAGCTGTGCAGATTTGCTGAGTTTAAATGGAGTTGATGCTTCTTTAAGTTTTGTCTGTCGACGTGAATACACCATTGAATGTAATTGGAAGGTTTTTTGTGACAACTA
CTTAGACGGAGGGTATCACGTCCCGTATGCACACAAAGGGCTTGCATCAAATCTCAAGCTCGAGTCATATTCTACCGAAATATTTGAAGCTGTTAGTATTCAAAGTTGTA
AGGGTGGGGGAGAATCAAAAGGCGATGATTATGGTCGACTTGGACCAGAAGCACTATATGCTTTCGTATACCCAAACTTCATGATAAATAGATATGGACCTTGGATGGAC
ACTAATCTAGTACTCCCACTCGGACCTCGAAAATGTCTAGTGGTTTTCGATTATTTTCTCGAAACACCTTTTAAGAATGACGAATCGTTTATACAACAAAGTTTAGAAGA
CAGTGAAAGTGTGCAGATTGAAGACATTATTCTGTGTGAAGGTGTTCAAAAGGGTCTCGAGTCGCCTGCTTACAAGTTCGGCCGATATGCTCCTTCGGTCGAGAATGCGA
TGCACCATTTCCATCGTCTTCTTCATCTTAACCTCACAAAATAA
mRNA sequenceShow/hide mRNA sequence
TATTCTATTGAGACTTTTTTTTTTCGTCATTCTTTTGAAAGGCATCGTAAATAAAATCTACGAAATACCAACCGATAACTGTAGAATACAACTAGCCGGAGGGAAGCTTG
GGGGCTGCTGCAATGGCGGCGTTAACGAAGCATATCCATACTCACTTCTTCCAACCTCTTTTCGCTTCCTTCAATTTTCAGTCGCGCAGTTATCGGTCTCCCACACGAAT
CTCGTCGACTTTATCCTTTCGTAACTCTGATTCTCACTTTATTGAAGCTCTGAAACTCGTTGATGAATTCGACCCCGAAATTCCCTTGGAGAAGGCCGTCACTCCACCCA
GCTCCTGGTACACCGATCCTTCATTTTTCGCTCTCGAGCTCGATCGTGTCTTCCACAGAGGATGGCAGGCCGTAGGATATGTTGAACAGTTGAAAGATCCCCATGATTTT
TTCACAGGCAGGTTGGGAAATGTAGAATACGTTGTCTGCAAAGATAATAACAAGAAGGTTCGTGCATTTCACAATGTCTGCCGCCATCATGCCTCACTTCTTGCATCTGG
ATGTGGGAAGAAGTCCTGCTTTGTGTGCCCATATCATGGATGGACATATGGGTTGGATGGAGGGCTGCTTAAGGCGACTAGAATAAATGGAATTCAGAACTTTGATGTAA
ATGATTTTGGGCTCGTACCGTTACCCGTAGCGACATGGGGGCCGTTCGTTCTTCTCAATCTGGATGAAAAATTATCATCTGAGCTGGTTGTTGATGAAGATGAAGTGGCA
CATGAATGGCTGGGAAGCTGTGCAGATTTGCTGAGTTTAAATGGAGTTGATGCTTCTTTAAGTTTTGTCTGTCGACGTGAATACACCATTGAATGTAATTGGAAGGTTTT
TTGTGACAACTACTTAGACGGAGGGTATCACGTCCCGTATGCACACAAAGGGCTTGCATCAAATCTCAAGCTCGAGTCATATTCTACCGAAATATTTGAAGCTGTTAGTA
TTCAAAGTTGTAAGGGTGGGGGAGAATCAAAAGGCGATGATTATGGTCGACTTGGACCAGAAGCACTATATGCTTTCGTATACCCAAACTTCATGATAAATAGATATGGA
CCTTGGATGGACACTAATCTAGTACTCCCACTCGGACCTCGAAAATGTCTAGTGGTTTTCGATTATTTTCTCGAAACACCTTTTAAGAATGACGAATCGTTTATACAACA
AAGTTTAGAAGACAGTGAAAGTGTGCAGATTGAAGACATTATTCTGTGTGAAGGTGTTCAAAAGGGTCTCGAGTCGCCTGCTTACAAGTTCGGCCGATATGCTCCTTCGG
TCGAGAATGCGATGCACCATTTCCATCGTCTTCTTCATCTTAACCTCACAAAATAATTGTCCTAGTGTTACTAATCTGAACTCATAAGTAAGGTATCTTTTCTCATCTCA
TCGTTTTCATTTTAACTCTTTAACATATGATTACTCGACATCGGATGTTCTGATGGTAGTGTGAATAGAAAGTTATTTGATTCCAATTGTCTAACTCGGAATACACGTTG
ACTTATAATTGTTGTAACTTCAATTAAATTTGTGGCTAATTTAGACAATTATACAATTAATATGGCAAAATATTGTAATCAATCAGGATATTTAATTTTCAACCGTTTAC
AAATTTAATTATCGTTTATGTCAACCTAAGTTTTTAAATTTTAAAAGGCAT
Protein sequenceShow/hide protein sequence
MAALTKHIHTHFFQPLFASFNFQSRSYRSPTRISSTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFALELDRVFHRGWQAVGYVEQLKDPHDFFTGR
LGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNLDEKLSSELVVDEDEVAHEWL
GSCADLLSLNGVDASLSFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMD
TNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNLTK