; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg15214 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg15214
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionCholine monooxygenase, chloroplastic
Genome locationCarg_Chr10:1715383..1719255
RNA-Seq ExpressionCarg15214
SyntenyCarg15214
Gene Ontology termsGO:0019285 - glycine betaine biosynthetic process from choline (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0031967 - organelle envelope (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0019133 - choline monooxygenase activity (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR001663 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit
IPR015879 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit, C-terminal domain
IPR017941 - Rieske [2Fe-2S] iron-sulphur domain
IPR036922 - Rieske [2Fe-2S] iron-sulphur domain superfamily
IPR044637 - Aromatic-ring-hydroxylating dioxygenase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589681.1 Choline monooxygenase, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.7e-253100Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL
        MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM
        KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM

Query:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHLNLTK
        ENAMHHFHRLLHLNLTK
Subjt:  ENAMHHFHRLLHLNLTK

XP_022965039.1 choline monooxygenase, chloroplastic isoform X1 [Cucurbita moschata]7.4e-24998.32Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL
        MAALTKHIHTHFFQPLFASFNFQSRS+RS TRIS TLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFF LELDRVFHRGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM
        KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLN+
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM

Query:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        DEKLSSELVVDED+VAHEWLGSCADLLSLNGVDASL FVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHLNLTK
        ENAMHHFHRLLHLNLTK
Subjt:  ENAMHHFHRLLHLNLTK

XP_022987329.1 choline monooxygenase, chloroplastic isoform X1 [Cucurbita maxima]6.5e-24591.35Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL
        MAALTKHIHTHFFQPLFASFNFQSRSHRS TRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFF LELDRVFHRGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM
        KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM

Query:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        DEKLSSELVVDEDKVA+EWLGSCADLLSLNGVDASL FVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMI----------------------------------NRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLE
        GDDYGRLGPEALYAFVYPNFMI                                  NRYGPWMDTNLVLPLGP+KCLVVFDYFLETPFKNDESFIQQSLE
Subjt:  GDDYGRLGPEALYAFVYPNFMI----------------------------------NRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLE

Query:  DSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNLTK
        DSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNLTK
Subjt:  DSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNLTK

XP_022987330.1 choline monooxygenase, chloroplastic isoform X2 [Cucurbita maxima]3.9e-25098.8Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL
        MAALTKHIHTHFFQPLFASFNFQSRSHRS TRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFF LELDRVFHRGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM
        KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM

Query:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        DEKLSSELVVDEDKVA+EWLGSCADLLSLNGVDASL FVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGP+KCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHLNLTK
        ENAMHHFHRLLHLNLTK
Subjt:  ENAMHHFHRLLHLNLTK

XP_023515603.1 choline monooxygenase, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]2.5e-24998.56Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL
        MAALTKHIHTHFFQPLFASFNFQSRS RS TRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM
        KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLN+
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM

Query:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        DEKLSSELVVDEDKVA+EWLGSCADLLSLNGVDASL FVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSE VQIEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHLNLTK
        ENAMHHFHRLLHLNLTK
Subjt:  ENAMHHFHRLLHLNLTK

TrEMBL top hitse value%identityAlignment
A0A5A7UY48 Choline monooxygenase, chloroplastic2.9e-22287.77Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL
        MA LTKHI  HFFQP   SFNF   +HRS +RIS +LSFR+ DS FIEA KLVD+FDPEIPLEKA+TPPSSWY DPSFF LELDRVF+RGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM
        KD HDFFTGRLGNVEYVVCKDNN+KVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDG LLKATRINGIQNF+ NDFGLVPLPVA WGPFVLLN+
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM

Query:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        D KLSS+  VDEDKVA EWLG+CAD+L LNGVDASL +VCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTE+FE VSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE  FKND+SFIQQSLEDSESVQ EDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHLNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHLNLTK

A0A5D3BWX2 Choline monooxygenase, chloroplastic7.5e-22387.77Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL
        MA LTKHI  HFFQP   SFNF   +HRS +RIS +LSFR+ DS FIEA KLVD+FDPEIPLEKA+TPPSSWY DPSFF LELDRVF+RGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM
        KD HDFFTGRLGNVEYVVCKDNN+KVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDG LLKATRINGIQNF+ NDFGLVPLPVA WGPFVLLN+
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM

Query:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        D KLSS++ VDEDKVA EWLG+CAD+L LNGVDASL +VCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYSTE+FE VSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+ FKND+SFIQQSLEDSESVQ EDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHLNLTK
        ENAMHHFHRLLH NLTK
Subjt:  ENAMHHFHRLLHLNLTK

A0A6J1HPX3 Choline monooxygenase, chloroplastic3.6e-24998.32Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL
        MAALTKHIHTHFFQPLFASFNFQSRS+RS TRIS TLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFF LELDRVFHRGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM
        KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLN+
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM

Query:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        DEKLSSELVVDED+VAHEWLGSCADLLSLNGVDASL FVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHLNLTK
        ENAMHHFHRLLHLNLTK
Subjt:  ENAMHHFHRLLHLNLTK

A0A6J1JGJ2 Choline monooxygenase, chloroplastic1.9e-25098.8Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL
        MAALTKHIHTHFFQPLFASFNFQSRSHRS TRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFF LELDRVFHRGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM
        KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM

Query:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        DEKLSSELVVDEDKVA+EWLGSCADLLSLNGVDASL FVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
        GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGP+KCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV
Subjt:  GDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSV

Query:  ENAMHHFHRLLHLNLTK
        ENAMHHFHRLLHLNLTK
Subjt:  ENAMHHFHRLLHLNLTK

A0A6J1JIK0 Choline monooxygenase, chloroplastic3.1e-24591.35Show/hide
Query:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL
        MAALTKHIHTHFFQPLFASFNFQSRSHRS TRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFF LELDRVFHRGWQAVGYVEQL
Subjt:  MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM
        KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM
Subjt:  KDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNM

Query:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
        DEKLSSELVVDEDKVA+EWLGSCADLLSLNGVDASL FVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK
Subjt:  DEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESK

Query:  GDDYGRLGPEALYAFVYPNFMI----------------------------------NRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLE
        GDDYGRLGPEALYAFVYPNFMI                                  NRYGPWMDTNLVLPLGP+KCLVVFDYFLETPFKNDESFIQQSLE
Subjt:  GDDYGRLGPEALYAFVYPNFMI----------------------------------NRYGPWMDTNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLE

Query:  DSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNLTK
        DSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNLTK
Subjt:  DSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNLTK

SwissProt top hitse value%identityAlignment
O04121 Choline monooxygenase, chloroplastic5.7e-11952.2Show/hide
Query:  LVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFV
        LV EFDP+IP E A TPPSSWYT+P+F+  EL+R+F++GWQ  G  +Q+K+P+ +FTG LGNVEY+V +D   KV AFHNVC H AS+LA G GKKSCFV
Subjt:  LVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFV

Query:  CPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNMDEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWK
        CPYHGW YG+DG L KA++    QN D  + GLVPL VA WGPFVL+++D  L      +   V  EWLG+ A+ +  +  D SL F+ R E+ +E NWK
Subjt:  CPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNMDEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWK

Query:  VFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE
        +F DNYLD  YHVPYAHK  A+ L  ++Y T++ E V+IQ  + G  +K D + R+G +A YAF YPNF + RYGPWM T  + PLGPRKC +V DY++E
Subjt:  VFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE

Query:  TPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL
            +D+ +I++ +  +++VQ ED++LCE VQ+GLE+PAY+ GRY   +E  +HHFH  L   L
Subjt:  TPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL

O22553 Choline monooxygenase, chloroplastic1.5e-11950.91Show/hide
Query:  ISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNV
        + PTL   ++    I +  LV EFDPEIP E A+TPPS+WYT+P+F+  EL+R+F++GWQ  GY EQ+K+ + +FTG LGNVEY+V +D   ++ AFHNV
Subjt:  ISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNV

Query:  CRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNMDEKLSSELVVDEDKVAHEWLGSCADLLSLNGV
        C H AS+LA G GKKSCFVCPYHGW YGLDG L KA++    QN D  + GL PL VA WGPF+L+++D  L +        V  EW+G  A+ +  +  
Subjt:  CRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNMDEKLSSELVVDEDKVAHEWLGSCADLLSLNGV

Query:  DASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTN
        D +L F  R E+ +ECNWKVFCDNYLD  YHVPYAHK  A+ L  ++Y+TE+ E   IQ   G   +K D + RLG EA YAF+YPNF + RYG WM T 
Subjt:  DASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTN

Query:  LVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL
         V+P+G RKC +V DY+LE    +D+++I + +  +++VQ ED +LCE VQ+GLE+PAY+ GRY   +E  +HHFH  LH  L
Subjt:  LVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL

Q93XE1 Choline monooxygenase, chloroplastic5.0e-12353.7Show/hide
Query:  KLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCF
        +++ EFDP++P E   TPPS+WYTDPS +  ELDR+F +GWQ  GY +Q+K+P+ +FTG LGNVEY+VC+D   KV AFHNVC H AS+LA G GKKSCF
Subjt:  KLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCF

Query:  VCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNMDEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNW
        VCPYHGW +GLDG L+KAT+    Q FD  + GLV L VA WGPFVL+++D   S       + V  EW+GSCA+ +  +  D SL F+ R E+ +E NW
Subjt:  VCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNMDEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNW

Query:  KVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFL
        KVFCDNYLD  YHVPYAHK  A+ L  ++Y T++ E V IQ       +K + + RLG EA YAF+YPNF + RYGPWM T  + PLGPRKC +V DY+L
Subjt:  KVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFL

Query:  ETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL
        E    ND+ +I++S+  +++VQ ED++LCE VQ+GLE+PAY+ GRY   +E  +HHFH  LH  L
Subjt:  ETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL

Q9LKN0 Choline monooxygenase, chloroplastic4.4e-11952.63Show/hide
Query:  LVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFV
        LV +FDP +P E A+TPPSSWYT+P+F+  ELDR+F++GWQ  GY +Q+K+ + +FTG LGNVEY+VC+D   KV AFHNVC H AS+LA G GKKSCFV
Subjt:  LVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFV

Query:  CPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNMDEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWK
        CPYHGW YG++G L KA++    Q+ + ++ GLVPL VA WGPF+L+++D   SS  V D   V  EWLGSCA+ +  +  D +L F+ R E+ IE NWK
Subjt:  CPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNMDEKLSSELVVDEDKVAHEWLGSCADLLSLNGVDASLGFVCRREYTIECNWK

Query:  VFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE
        +F DNYLD  YHVPYAHK  A+ L  ++Y T++   V+IQ   G   +  + + RLG +A YAF YPNF + RYGPWM T  ++PLGPRKC +V DY++E
Subjt:  VFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE

Query:  TPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLH
            +D+ +I++ +  +++VQ ED++LCE VQKGLE+PAY+ GRY   +E  +HHFH  LH
Subjt:  TPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLH

Q9SZR0 Choline monooxygenase, chloroplastic5.5e-15467.1Show/hide
Query:  SPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVC
        +PT  F  SD       KLV EFDP+IPLE+A TPPSSWYTDP F+  ELDRVF+ GWQAVGY +Q+K+  DFFTGRLG+V++VVC+D N K+ AFHNVC
Subjt:  SPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVC

Query:  RHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNMDEKLSSELVVDEDK-VAHEWLGSCADLLSLNGV
         HHAS+LASG G+KSCFVC YHGWTY L G L+KATR++GIQNF +++ GL PL VA WGPFVLL +    S +  V+ D+ VA EWLG+    LS  GV
Subjt:  RHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNMDEKLSSELVVDEDK-VAHEWLGSCADLLSLNGV

Query:  DASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTN
        D+ L ++CRREYTI+CNWKVFCDNYLDGGYHVPYAHKGL S L LE+YST IFE VSIQ C GG +   D + RLG EALYAFVYPNFMINRYGPWMDTN
Subjt:  DASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTN

Query:  LVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL
        LVLPLGPRKC VVFDYFL+   K+DE+FI++SLE+S+ VQ+ED++LCE VQ+GLES AY  GRYA  VE  MHHFH LLH NL
Subjt:  LVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL

Arabidopsis top hitse value%identityAlignment
AT4G29890.1 choline monooxygenase, putative (CMO-like)3.9e-15567.1Show/hide
Query:  SPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVC
        +PT  F  SD       KLV EFDP+IPLE+A TPPSSWYTDP F+  ELDRVF+ GWQAVGY +Q+K+  DFFTGRLG+V++VVC+D N K+ AFHNVC
Subjt:  SPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQLKDPHDFFTGRLGNVEYVVCKDNNKKVRAFHNVC

Query:  RHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNMDEKLSSELVVDEDK-VAHEWLGSCADLLSLNGV
         HHAS+LASG G+KSCFVC YHGWTY L G L+KATR++GIQNF +++ GL PL VA WGPFVLL +    S +  V+ D+ VA EWLG+    LS  GV
Subjt:  RHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNMDEKLSSELVVDEDK-VAHEWLGSCADLLSLNGV

Query:  DASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTN
        D+ L ++CRREYTI+CNWKVFCDNYLDGGYHVPYAHKGL S L LE+YST IFE VSIQ C GG +   D + RLG EALYAFVYPNFMINRYGPWMDTN
Subjt:  DASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMDTN

Query:  LVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL
        LVLPLGPRKC VVFDYFL+   K+DE+FI++SLE+S+ VQ+ED++LCE VQ+GLES AY  GRYA  VE  MHHFH LLH NL
Subjt:  LVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCGTTAACGAAGCATATCCATACTCACTTCTTCCAACCTCTTTTCGCTTCCTTCAATTTTCAGTCGCGCAGTCATCGTTCTTCCACTCGAATCTCGCCGACTTT
ATCCTTTCGTAACTCTGATTCTCACTTTATTGAAGCTCTGAAACTCGTTGATGAATTCGACCCCGAAATTCCCTTGGAGAAGGCCGTCACTCCACCCAGCTCCTGGTACA
CCGATCCTTCATTTTTCGATCTCGAGCTCGATCGTGTCTTCCACAGAGGATGGCAGGCCGTAGGATATGTTGAACAGTTGAAAGATCCCCATGATTTTTTCACAGGCAGG
TTGGGAAATGTAGAATACGTTGTCTGCAAAGATAATAACAAGAAGGTTCGTGCATTTCACAATGTCTGCCGCCACCATGCCTCACTTCTTGCATCTGGATGTGGGAAGAA
GTCCTGCTTTGTGTGCCCATATCATGGATGGACATATGGGTTGGATGGAGGGCTGCTTAAGGCGACTAGAATAAATGGAATTCAGAACTTTGATGTAAATGATTTTGGGC
TCGTACCGTTACCCGTAGCGACATGGGGGCCGTTCGTTCTTCTCAATATGGATGAAAAATTATCATCTGAGCTGGTTGTTGATGAAGATAAAGTGGCACATGAATGGCTG
GGAAGCTGTGCAGACTTGCTGAGTTTAAATGGAGTTGATGCTTCTTTAGGTTTTGTCTGTCGACGTGAATACACCATTGAATGTAATTGGAAGGTTTTTTGTGACAACTA
CTTAGACGGAGGGTATCACGTCCCGTATGCACACAAAGGGCTTGCATCAAATCTCAAGCTCGAGTCGTATTCTACCGAAATATTTGAAGCTGTTAGTATTCAAAGTTGTA
AGGGTGGGGGAGAATCAAAAGGCGATGATTATGGTCGACTTGGACCAGAAGCACTATATGCTTTCGTATACCCAAACTTCATGATAAATAGATATGGACCTTGGATGGAC
ACTAATCTAGTACTCCCACTAGGACCTCGAAAATGTCTAGTGGTTTTCGATTATTTTCTTGAAACACCTTTTAAGAATGACGAATCGTTTATACAACAAAGTTTAGAAGA
CAGTGAAAGTGTGCAGATTGAAGACATTATTCTGTGTGAAGGTGTTCAAAAGGGTCTCGAGTCGCCTGCTTACAAGTTCGGCCGATATGCTCCTTCGGTCGAGAATGCGA
TGCACCATTTTCATCGTCTTCTTCATCTCAACCTCACAAAATAA
mRNA sequenceShow/hide mRNA sequence
AGACTTTTTTATTTTTTATTTTGAAAGGCATCGTAAATAAAATCTACGAAATACCAACCGATAACTGTAGAATACAACTAGCCGGAGGGAAGCTTGGGGGCTGCTGCAAT
GGCGGCGTTAACGAAGCATATCCATACTCACTTCTTCCAACCTCTTTTCGCTTCCTTCAATTTTCAGTCGCGCAGTCATCGTTCTTCCACTCGAATCTCGCCGACTTTAT
CCTTTCGTAACTCTGATTCTCACTTTATTGAAGCTCTGAAACTCGTTGATGAATTCGACCCCGAAATTCCCTTGGAGAAGGCCGTCACTCCACCCAGCTCCTGGTACACC
GATCCTTCATTTTTCGATCTCGAGCTCGATCGTGTCTTCCACAGAGGATGGCAGGCCGTAGGATATGTTGAACAGTTGAAAGATCCCCATGATTTTTTCACAGGCAGGTT
GGGAAATGTAGAATACGTTGTCTGCAAAGATAATAACAAGAAGGTTCGTGCATTTCACAATGTCTGCCGCCACCATGCCTCACTTCTTGCATCTGGATGTGGGAAGAAGT
CCTGCTTTGTGTGCCCATATCATGGATGGACATATGGGTTGGATGGAGGGCTGCTTAAGGCGACTAGAATAAATGGAATTCAGAACTTTGATGTAAATGATTTTGGGCTC
GTACCGTTACCCGTAGCGACATGGGGGCCGTTCGTTCTTCTCAATATGGATGAAAAATTATCATCTGAGCTGGTTGTTGATGAAGATAAAGTGGCACATGAATGGCTGGG
AAGCTGTGCAGACTTGCTGAGTTTAAATGGAGTTGATGCTTCTTTAGGTTTTGTCTGTCGACGTGAATACACCATTGAATGTAATTGGAAGGTTTTTTGTGACAACTACT
TAGACGGAGGGTATCACGTCCCGTATGCACACAAAGGGCTTGCATCAAATCTCAAGCTCGAGTCGTATTCTACCGAAATATTTGAAGCTGTTAGTATTCAAAGTTGTAAG
GGTGGGGGAGAATCAAAAGGCGATGATTATGGTCGACTTGGACCAGAAGCACTATATGCTTTCGTATACCCAAACTTCATGATAAATAGATATGGACCTTGGATGGACAC
TAATCTAGTACTCCCACTAGGACCTCGAAAATGTCTAGTGGTTTTCGATTATTTTCTTGAAACACCTTTTAAGAATGACGAATCGTTTATACAACAAAGTTTAGAAGACA
GTGAAAGTGTGCAGATTGAAGACATTATTCTGTGTGAAGGTGTTCAAAAGGGTCTCGAGTCGCCTGCTTACAAGTTCGGCCGATATGCTCCTTCGGTCGAGAATGCGATG
CACCATTTTCATCGTCTTCTTCATCTCAACCTCACAAAATAA
Protein sequenceShow/hide protein sequence
MAALTKHIHTHFFQPLFASFNFQSRSHRSSTRISPTLSFRNSDSHFIEALKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFHRGWQAVGYVEQLKDPHDFFTGR
LGNVEYVVCKDNNKKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGGLLKATRINGIQNFDVNDFGLVPLPVATWGPFVLLNMDEKLSSELVVDEDKVAHEWL
GSCADLLSLNGVDASLGFVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSTEIFEAVSIQSCKGGGESKGDDYGRLGPEALYAFVYPNFMINRYGPWMD
TNLVLPLGPRKCLVVFDYFLETPFKNDESFIQQSLEDSESVQIEDIILCEGVQKGLESPAYKFGRYAPSVENAMHHFHRLLHLNLTK