; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027039 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027039
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCholine monooxygenase, chloroplastic
Genome locationtig00153047:3396511..3402705
RNA-Seq ExpressionSgr027039
SyntenySgr027039
Gene Ontology termsGO:0019285 - glycine betaine biosynthetic process from choline (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0031967 - organelle envelope (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0019133 - choline monooxygenase activity (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR001663 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit
IPR006594 - LIS1 homology motif
IPR015879 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit, C-terminal domain
IPR017941 - Rieske [2Fe-2S] iron-sulphur domain
IPR036922 - Rieske [2Fe-2S] iron-sulphur domain superfamily
IPR044637 - Aromatic-ring-hydroxylating dioxygenase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589681.1 Choline monooxygenase, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.5e-13883.75Show/hide
Query:  MAMLIMHIPTHFFQA--PCIIRPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELL
        MA L  HI THFFQ         S +HR   RIS  +SFRN DS F+EA KLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVF+RGWQAVGYVE L
Subjt:  MAMLIMHIPTHFFQA--PCIIRPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELL

Query:  KDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KDPHDFFTGRLGNVE+VVCKDNN KVRAF+NVCRHHASLL SG GKKSCFVCPYHGWTYGLDG LLKATRI GIQNFDVNDFGL+PLPVATWGPFVLLN+
Subjt:  KDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DKELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE
        D++LSSEL VDEDKVAHEWLGSC D+LS+NGVD SL FVCR EYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKL+SYSTE
Subjt:  DKELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE

XP_022135573.1 choline monooxygenase, chloroplastic isoform X1 [Momordica charantia]1.3e-14775Show/hide
Query:  MAMLIMHIPTHFFQAPCIIRPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKD
        MA +I  IPTHFFQ P I  PS NHRP   ISA  +FRNPDSRF+EAQKLV EFDPEIPLEKA+TPPSSWYTDPSFF LELDRVFYRGWQAVGYVE LKD
Subjt:  MAMLIMHIPTHFFQAPCIIRPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKD

Query:  PHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDK
        PHDFFTGRLGNVEFVVCKDNNGKVRAF+NVCRHHASLL SGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPL VATWGPFVLLNLDK
Subjt:  PHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDK

Query:  ELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEQHAFLPFAVLGS-------
        EL SE +VDEDKVA EWLGSCVDVLS+NGV+ SLS+VCR EYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKL+SY TE    +      S       
Subjt:  ELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEQHAFLPFAVLGS-------

Query:  ---VYFSIISYCSVSPKVVPGYFDIDGSLSFELVCRGVCLINFGFYDR
              S   Y  + P  +   + +       ++   VCLIN GFYDR
Subjt:  ---VYFSIISYCSVSPKVVPGYFDIDGSLSFELVCRGVCLINFGFYDR

XP_022135574.1 choline monooxygenase, chloroplastic isoform X2 [Momordica charantia]1.6e-14587.9Show/hide
Query:  MAMLIMHIPTHFFQAPCIIRPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKD
        MA +I  IPTHFFQ P I  PS NHRP   ISA  +FRNPDSRF+EAQKLV EFDPEIPLEKA+TPPSSWYTDPSFF LELDRVFYRGWQAVGYVE LKD
Subjt:  MAMLIMHIPTHFFQAPCIIRPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKD

Query:  PHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDK
        PHDFFTGRLGNVEFVVCKDNNGKVRAF+NVCRHHASLL SGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPL VATWGPFVLLNLDK
Subjt:  PHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDK

Query:  ELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE
        EL SE +VDEDKVA EWLGSCVDVLS+NGV+ SLS+VCR EYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKL+SY TE
Subjt:  ELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE

XP_022987329.1 choline monooxygenase, chloroplastic isoform X1 [Cucurbita maxima]3.7e-13783.39Show/hide
Query:  MAMLIMHIPTHFFQA--PCIIRPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELL
        MA L  HI THFFQ         S +HR   RIS  +SFRN DS F+EA KLVDEFDPEIPLEKAVTPPSSWYTDPSFF LELDRVF+RGWQAVGYVE L
Subjt:  MAMLIMHIPTHFFQA--PCIIRPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELL

Query:  KDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KDPHDFFTGRLGNVE+VVCKDNN KVRAF+NVCRHHASLL SG GKKSCFVCPYHGWTYGLDG LLKATRI GIQNFDVNDFGL+PLPVATWGPFVLLN+
Subjt:  KDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DKELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE
        D++LSSEL VDEDKVA+EWLGSC D+LS+NGVD SLSFVCR EYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKL+SYSTE
Subjt:  DKELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE

XP_038880722.1 choline monooxygenase, chloroplastic [Benincasa hispida]4.6e-14085.16Show/hide
Query:  MAMLIMHIPTHFFQAPCII--RPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELL
        MA L  HI THFFQ P I     S NHR  +RISAA+SFRN DS F+EAQKLVDEFDPEIPLEKAVTPPSSWY DPSF+ LELDRVFYRGWQAVGYVE L
Subjt:  MAMLIMHIPTHFFQAPCII--RPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELL

Query:  KDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KDPHDFFTGRLGNVE+VVCKDNN KVRAF+NVCRHHASLL SG GK+SCFVCPYHGWTYGLDGVLLKATRI GIQNFD N+FGLIPLPVATWGPFVLLNL
Subjt:  KDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DKELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE
        D +LSSELDVDEDKV  EWLGSC D+LS+NGVD SLS+VCR EYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE
Subjt:  DKELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE

TrEMBL top hitse value%identityAlignment
A0A6J1C1T8 Choline monooxygenase, chloroplastic7.9e-14687.9Show/hide
Query:  MAMLIMHIPTHFFQAPCIIRPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKD
        MA +I  IPTHFFQ P I  PS NHRP   ISA  +FRNPDSRF+EAQKLV EFDPEIPLEKA+TPPSSWYTDPSFF LELDRVFYRGWQAVGYVE LKD
Subjt:  MAMLIMHIPTHFFQAPCIIRPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKD

Query:  PHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDK
        PHDFFTGRLGNVEFVVCKDNNGKVRAF+NVCRHHASLL SGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPL VATWGPFVLLNLDK
Subjt:  PHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDK

Query:  ELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE
        EL SE +VDEDKVA EWLGSCVDVLS+NGV+ SLS+VCR EYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKL+SY TE
Subjt:  ELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE

A0A6J1C563 Choline monooxygenase, chloroplastic6.5e-14875Show/hide
Query:  MAMLIMHIPTHFFQAPCIIRPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKD
        MA +I  IPTHFFQ P I  PS NHRP   ISA  +FRNPDSRF+EAQKLV EFDPEIPLEKA+TPPSSWYTDPSFF LELDRVFYRGWQAVGYVE LKD
Subjt:  MAMLIMHIPTHFFQAPCIIRPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKD

Query:  PHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDK
        PHDFFTGRLGNVEFVVCKDNNGKVRAF+NVCRHHASLL SGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPL VATWGPFVLLNLDK
Subjt:  PHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDK

Query:  ELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEQHAFLPFAVLGS-------
        EL SE +VDEDKVA EWLGSCVDVLS+NGV+ SLS+VCR EYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKL+SY TE    +      S       
Subjt:  ELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEQHAFLPFAVLGS-------

Query:  ---VYFSIISYCSVSPKVVPGYFDIDGSLSFELVCRGVCLINFGFYDR
              S   Y  + P  +   + +       ++   VCLIN GFYDR
Subjt:  ---VYFSIISYCSVSPKVVPGYFDIDGSLSFELVCRGVCLINFGFYDR

A0A6J1HPX3 Choline monooxygenase, chloroplastic1.8e-13783.39Show/hide
Query:  MAMLIMHIPTHFFQA--PCIIRPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELL
        MA L  HI THFFQ         S ++R   RIS+ +SFRN DS F+EA KLVDEFDPEIPLEKAVTPPSSWYTDPSFF LELDRVF+RGWQAVGYVE L
Subjt:  MAMLIMHIPTHFFQA--PCIIRPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELL

Query:  KDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KDPHDFFTGRLGNVE+VVCKDNN KVRAF+NVCRHHASLL SG GKKSCFVCPYHGWTYGLDG LLKATRI GIQNFDVNDFGL+PLPVATWGPFVLLNL
Subjt:  KDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DKELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE
        D++LSSEL VDED+VAHEWLGSC D+LS+NGVD SLSFVCR EYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKL+SYSTE
Subjt:  DKELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE

A0A6J1JGJ2 Choline monooxygenase, chloroplastic1.8e-13783.39Show/hide
Query:  MAMLIMHIPTHFFQA--PCIIRPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELL
        MA L  HI THFFQ         S +HR   RIS  +SFRN DS F+EA KLVDEFDPEIPLEKAVTPPSSWYTDPSFF LELDRVF+RGWQAVGYVE L
Subjt:  MAMLIMHIPTHFFQA--PCIIRPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELL

Query:  KDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KDPHDFFTGRLGNVE+VVCKDNN KVRAF+NVCRHHASLL SG GKKSCFVCPYHGWTYGLDG LLKATRI GIQNFDVNDFGL+PLPVATWGPFVLLN+
Subjt:  KDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DKELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE
        D++LSSEL VDEDKVA+EWLGSC D+LS+NGVD SLSFVCR EYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKL+SYSTE
Subjt:  DKELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE

A0A6J1JIK0 Choline monooxygenase, chloroplastic1.8e-13783.39Show/hide
Query:  MAMLIMHIPTHFFQA--PCIIRPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELL
        MA L  HI THFFQ         S +HR   RIS  +SFRN DS F+EA KLVDEFDPEIPLEKAVTPPSSWYTDPSFF LELDRVF+RGWQAVGYVE L
Subjt:  MAMLIMHIPTHFFQA--PCIIRPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELL

Query:  KDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KDPHDFFTGRLGNVE+VVCKDNN KVRAF+NVCRHHASLL SG GKKSCFVCPYHGWTYGLDG LLKATRI GIQNFDVNDFGL+PLPVATWGPFVLLN+
Subjt:  KDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DKELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE
        D++LSSEL VDEDKVA+EWLGSC D+LS+NGVD SLSFVCR EYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKL+SYSTE
Subjt:  DKELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE

SwissProt top hitse value%identityAlignment
O04121 Choline monooxygenase, chloroplastic2.8e-7955.56Show/hide
Query:  QKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSC
        Q LV EFDP+IP E A TPPSSWYT+P+F+  EL+R+FY+GWQ  G  + +K+P+ +FTG LGNVE++V +D  GKV AF+NVC H AS+L  GSGKKSC
Subjt:  QKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSC

Query:  FVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDKELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECN
        FVCPYHGW YG+DG L KA++ K  QN D  + GL+PL VA WGPFVL++LD+ L    D     V  EWLG+  + +  +  D SL F+ R E+ +E N
Subjt:  FVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDKELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE
        WK+F DNYLD  YHVPYAHK  A+ L  D+Y T+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE

O22553 Choline monooxygenase, chloroplastic3.3e-8055.13Show/hide
Query:  QKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSC
        + LV EFDPEIP E A+TPPS+WYT+P+F+  EL+R+FY+GWQ  GY E +K+ + +FTG LGNVE++V +D  G++ AF+NVC H AS+L  GSGKKSC
Subjt:  QKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSC

Query:  FVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDKELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECN
        FVCPYHGW YGLDG L KA++    QN D  + GL PL VA WGPF+L++LD+ L +  D     V  EW+G   + +  +  D +L F  R E+ +ECN
Subjt:  FVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDKELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE
        WKVFCDNYLD  YHVPYAHK  A+ L  D+Y+TE
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE

Q93XE1 Choline monooxygenase, chloroplastic1.2e-7951.79Show/hide
Query:  ISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNV
        I ++I+  N  +     ++++ EFDP++P E   TPPS+WYTDPS +  ELDR+F +GWQ  GY + +K+P+ +FTG LGNVE++VC+D  GKV AF+NV
Subjt:  ISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNV

Query:  CRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDKELSSELDVDEDKVAHEWLGSCVDVLSVNGV
        C H AS+L  G+GKKSCFVCPYHGW +GLDG L+KAT+ +  Q FD  + GL+ L VA WGPFVL++LD+  S       + V  EW+GSC + +  +  
Subjt:  CRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDKELSSELDVDEDKVAHEWLGSCVDVLSVNGV

Query:  DVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE
        D SL F+ R E+ +E NWKVFCDNYLD  YHVPYAHK  A+ L  D+Y T+
Subjt:  DVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE

Q9LKN0 Choline monooxygenase, chloroplastic1.8e-7854.27Show/hide
Query:  QKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSC
        Q LV +FDP +P E A+TPPSSWYT+P+F+  ELDR+FY+GWQ  GY + +K+ + +FTG LGNVE++VC+D  GKV AF+NVC H AS+L  GSGKKSC
Subjt:  QKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSC

Query:  FVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDKELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECN
        FVCPYHGW YG++G L KA++    Q+ + ++ GL+PL VA WGPF+L++LD+   S  +V +  V  EWLGSC + +  +  D +L F+ R E+ IE N
Subjt:  FVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDKELSSELDVDEDKVAHEWLGSCVDVLSVNGVDVSLSFVCRCEYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE
        WK+F DNYLD  YHVPYAHK  A+ L  D+Y T+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTE

Q9SZR0 Choline monooxygenase, chloroplastic8.6e-9765.59Show/hide
Query:  FRNPDSRFL--EAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHH
        F NP   F   +  KLV EFDP+IPLE+A TPPSSWYTDP F+  ELDRVFY GWQAVGY + +K+  DFFTGRLG+V+FVVC+D NGK+ AF+NVC HH
Subjt:  FRNPDSRFL--EAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHH

Query:  ASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDKELSSELDVDEDK-VAHEWLGSCVDVLSVNGVDVS
        AS+L SG+G+KSCFVC YHGWTY L G L+KATR+ GIQNF +++ GL PL VA WGPFVLL +    S + +V+ D+ VA EWLG+ V  LS  GVD  
Subjt:  ASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDKELSSELDVDEDK-VAHEWLGSCVDVLSVNGVDVS

Query:  LSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYST
        LS++CR EYTI+CNWKVFCDNYLDGGYHVPYAHKGL S L L++YST
Subjt:  LSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYST

Arabidopsis top hitse value%identityAlignment
AT1G44446.1 Pheophorbide a oxygenase family protein with Rieske [2Fe-2S] domain1.3e-0435.09Show/hide
Query:  FVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIK
        +V+ +  +GK     N C H A  L  G+  +    CPYHGW Y  DG   K    K
Subjt:  FVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIK

AT1G44446.2 Pheophorbide a oxygenase family protein with Rieske [2Fe-2S] domain1.3e-0435.09Show/hide
Query:  FVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIK
        +V+ +  +GK     N C H A  L  G+  +    CPYHGW Y  DG   K    K
Subjt:  FVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIK

AT1G44446.3 Pheophorbide a oxygenase family protein with Rieske [2Fe-2S] domain1.3e-0435.09Show/hide
Query:  FVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIK
        +V+ +  +GK     N C H A  L  G+  +    CPYHGW Y  DG   K    K
Subjt:  FVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIK

AT4G29890.1 choline monooxygenase, putative (CMO-like)6.1e-9865.59Show/hide
Query:  FRNPDSRFL--EAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHH
        F NP   F   +  KLV EFDP+IPLE+A TPPSSWYTDP F+  ELDRVFY GWQAVGY + +K+  DFFTGRLG+V+FVVC+D NGK+ AF+NVC HH
Subjt:  FRNPDSRFL--EAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKDPHDFFTGRLGNVEFVVCKDNNGKVRAFYNVCRHH

Query:  ASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDKELSSELDVDEDK-VAHEWLGSCVDVLSVNGVDVS
        AS+L SG+G+KSCFVC YHGWTY L G L+KATR+ GIQNF +++ GL PL VA WGPFVLL +    S + +V+ D+ VA EWLG+ V  LS  GVD  
Subjt:  ASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDKELSSELDVDEDK-VAHEWLGSCVDVLSVNGVDVS

Query:  LSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYST
        LS++CR EYTI+CNWKVFCDNYLDGGYHVPYAHKGL S L L++YST
Subjt:  LSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYST

AT5G57120.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: nucleolus; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: LisH dimerisation motif (InterPro:IPR006594), SRP40, C-terminal (InterPro:IPR007718); Has 101969 Blast hits to 55488 proteins in 2506 species: Archae - 424; Bacteria - 13843; Metazoa - 37674; Fungi - 9726; Plants - 4941; Viruses - 569; Other Eukaryotes - 34792 (source: NCBI BLink).7.9e-0531.21Show/hide
Query:  GSADEAITLEPEQRTLLLHAVAFYLERNGFSKTLKRFRSEAQIEVG---------TQLFSSMSVK--IEGQIDGLCSVGYLSLLYAVKKDSSKDFLLSLE
        G+  +   LE EQ+ LLL +VA YLER GFSK  K+  SEA+IE            ++FS    K   E   +G      +  +  VKKD  K      +
Subjt:  GSADEAITLEPEQRTLLLHAVAFYLERNGFSKTLKRFRSEAQIEVG---------TQLFSSMSVK--IEGQIDGLCSVGYLSLLYAVKKDSSKDFLLSLE

Query:  EMCHKYLKKCKCRNSKEVADDSVPEAQEEPKKKHKDKRERK
        E   +  ++ K + +  V +D V E +++ + K K   E K
Subjt:  EMCHKYLKKCKCRNSKEVADDSVPEAQEEPKKKHKDKRERK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATGTTAATCATGCATATCCCTACTCACTTCTTCCAAGCTCCTTGCATCATTAGGCCTTCGAGCAATCATCGTCCTTTCGCACGAATCTCGGCGGCTATCTCTTT
TCGAAACCCAGATTCACGCTTCCTTGAAGCTCAGAAACTCGTCGATGAATTCGACCCTGAAATTCCCTTAGAGAAGGCCGTCACTCCACCCAGCTCGTGGTACACCGACC
CTTCGTTTTTCGATCTCGAACTCGATCGTGTCTTCTACAGAGGATGGCAGGCCGTAGGATATGTTGAACTATTAAAAGATCCCCATGACTTTTTCACAGGCAGGTTGGGA
AATGTAGAGTTTGTGGTGTGCAAAGATAATAATGGGAAGGTTCGTGCATTTTACAATGTTTGCCGCCATCATGCCTCACTTCTTACATCTGGAAGTGGGAAAAAGTCATG
CTTTGTATGCCCATATCATGGATGGACATATGGGTTGGATGGAGTTCTGCTCAAGGCTACTAGAATAAAAGGGATACAAAACTTTGATGTAAATGATTTTGGGCTCATAC
CATTGCCAGTAGCTACATGGGGGCCTTTTGTTCTTCTCAATCTAGATAAAGAGTTATCATCTGAGCTGGATGTTGATGAAGATAAAGTAGCACATGAATGGCTTGGAAGC
TGTGTAGATGTGCTGAGTGTGAATGGAGTCGATGTTTCGTTAAGTTTTGTCTGTAGATGCGAATACACCATTGAATGTAATTGGAAGGTTTTTTGTGACAACTACTTAGA
TGGAGGATATCACGTTCCCTATGCACATAAAGGGCTCGCATCAAATCTCAAGCTCGACTCGTATTCTACAGAACAACATGCTTTTCTGCCATTTGCTGTTTTAGGATCTG
TTTACTTTAGTATTATAAGCTATTGTTCTGTTTCACCTAAGGTAGTTCCTGGCTACTTTGACATAGATGGTTCTTTATCTTTTGAGTTGGTCTGCCGAGGTGTTTGTCTC
ATCAATTTCGGGTTCTATGACAGGTGTTCAAAGGGGCCTCGAGTCGCCCGCTTACAAGTTTGGCCGATACGCTCCTACAGTCGAGAATGCCATGCACCATTTCCATCGTC
CCTTTTCATCGATTTCTTTTTTCCTCAATATGTGGTGATTCTCAAATCTGATTTGAGAGGGGCTTCTCTTGTGGATTGCAATTCTGGCTGGCGCATGTCCCGAACCCTAA
CCAGCTCTGTAAGTCCTGCTCTTCTGGCATTGAAGCCTCGTCAAGTTCTTCTCTCCCACCACAGTATGAAGCACCCCAAATCTTCGGCTGTTCTTAAACCCCAAACCGGC
TCTGCCGATGAAGCCATCACCCTAGAGCCGGAGCAGAGGACTCTTCTTCTTCACGCTGTAGCCTTCTATTTGGAGCGCAATGGCTTCTCCAAGACCCTCAAAAGGTTTCG
TTCGGAAGCGCAGATTGAGGTCGGTACTCAACTTTTCTCTTCCATGTCTGTGAAGATTGAAGGACAGATAGATGGCTTGTGTAGTGTAGGGTATTTAAGCTTATTATATG
CTGTAAAGAAGGATTCTTCAAAGGATTTCTTGCTTAGTCTGGAAGAGATGTGCCACAAGTATTTAAAGAAATGTAAATGTAGAAACAGCAAGGAGGTTGCTGATGATAGC
GTTCCTGAAGCACAAGAGGAGCCTAAAAAGAAGCACAAAGATAAAAGAGAACGAAAAAGAACGTGGTTCCTGAAACGGATGCTGACACTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGATGTTAATCATGCATATCCCTACTCACTTCTTCCAAGCTCCTTGCATCATTAGGCCTTCGAGCAATCATCGTCCTTTCGCACGAATCTCGGCGGCTATCTCTTT
TCGAAACCCAGATTCACGCTTCCTTGAAGCTCAGAAACTCGTCGATGAATTCGACCCTGAAATTCCCTTAGAGAAGGCCGTCACTCCACCCAGCTCGTGGTACACCGACC
CTTCGTTTTTCGATCTCGAACTCGATCGTGTCTTCTACAGAGGATGGCAGGCCGTAGGATATGTTGAACTATTAAAAGATCCCCATGACTTTTTCACAGGCAGGTTGGGA
AATGTAGAGTTTGTGGTGTGCAAAGATAATAATGGGAAGGTTCGTGCATTTTACAATGTTTGCCGCCATCATGCCTCACTTCTTACATCTGGAAGTGGGAAAAAGTCATG
CTTTGTATGCCCATATCATGGATGGACATATGGGTTGGATGGAGTTCTGCTCAAGGCTACTAGAATAAAAGGGATACAAAACTTTGATGTAAATGATTTTGGGCTCATAC
CATTGCCAGTAGCTACATGGGGGCCTTTTGTTCTTCTCAATCTAGATAAAGAGTTATCATCTGAGCTGGATGTTGATGAAGATAAAGTAGCACATGAATGGCTTGGAAGC
TGTGTAGATGTGCTGAGTGTGAATGGAGTCGATGTTTCGTTAAGTTTTGTCTGTAGATGCGAATACACCATTGAATGTAATTGGAAGGTTTTTTGTGACAACTACTTAGA
TGGAGGATATCACGTTCCCTATGCACATAAAGGGCTCGCATCAAATCTCAAGCTCGACTCGTATTCTACAGAACAACATGCTTTTCTGCCATTTGCTGTTTTAGGATCTG
TTTACTTTAGTATTATAAGCTATTGTTCTGTTTCACCTAAGGTAGTTCCTGGCTACTTTGACATAGATGGTTCTTTATCTTTTGAGTTGGTCTGCCGAGGTGTTTGTCTC
ATCAATTTCGGGTTCTATGACAGGTGTTCAAAGGGGCCTCGAGTCGCCCGCTTACAAGTTTGGCCGATACGCTCCTACAGTCGAGAATGCCATGCACCATTTCCATCGTC
CCTTTTCATCGATTTCTTTTTTCCTCAATATGTGGTGATTCTCAAATCTGATTTGAGAGGGGCTTCTCTTGTGGATTGCAATTCTGGCTGGCGCATGTCCCGAACCCTAA
CCAGCTCTGTAAGTCCTGCTCTTCTGGCATTGAAGCCTCGTCAAGTTCTTCTCTCCCACCACAGTATGAAGCACCCCAAATCTTCGGCTGTTCTTAAACCCCAAACCGGC
TCTGCCGATGAAGCCATCACCCTAGAGCCGGAGCAGAGGACTCTTCTTCTTCACGCTGTAGCCTTCTATTTGGAGCGCAATGGCTTCTCCAAGACCCTCAAAAGGTTTCG
TTCGGAAGCGCAGATTGAGGTCGGTACTCAACTTTTCTCTTCCATGTCTGTGAAGATTGAAGGACAGATAGATGGCTTGTGTAGTGTAGGGTATTTAAGCTTATTATATG
CTGTAAAGAAGGATTCTTCAAAGGATTTCTTGCTTAGTCTGGAAGAGATGTGCCACAAGTATTTAAAGAAATGTAAATGTAGAAACAGCAAGGAGGTTGCTGATGATAGC
GTTCCTGAAGCACAAGAGGAGCCTAAAAAGAAGCACAAAGATAAAAGAGAACGAAAAAGAACGTGGTTCCTGAAACGGATGCTGACACTGTAG
Protein sequenceShow/hide protein sequence
MAMLIMHIPTHFFQAPCIIRPSSNHRPFARISAAISFRNPDSRFLEAQKLVDEFDPEIPLEKAVTPPSSWYTDPSFFDLELDRVFYRGWQAVGYVELLKDPHDFFTGRLG
NVEFVVCKDNNGKVRAFYNVCRHHASLLTSGSGKKSCFVCPYHGWTYGLDGVLLKATRIKGIQNFDVNDFGLIPLPVATWGPFVLLNLDKELSSELDVDEDKVAHEWLGS
CVDVLSVNGVDVSLSFVCRCEYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLDSYSTEQHAFLPFAVLGSVYFSIISYCSVSPKVVPGYFDIDGSLSFELVCRGVCL
INFGFYDRCSKGPRVARLQVWPIRSYSRECHAPFPSSLFIDFFFPQYVVILKSDLRGASLVDCNSGWRMSRTLTSSVSPALLALKPRQVLLSHHSMKHPKSSAVLKPQTG
SADEAITLEPEQRTLLLHAVAFYLERNGFSKTLKRFRSEAQIEVGTQLFSSMSVKIEGQIDGLCSVGYLSLLYAVKKDSSKDFLLSLEEMCHKYLKKCKCRNSKEVADDS
VPEAQEEPKKKHKDKRERKRTWFLKRMLTL