; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy03g017790 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy03g017790
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionCholine monooxygenase, chloroplastic
Genome locationChr03:51533664..51538084
RNA-Seq ExpressionLcy03g017790
SyntenyLcy03g017790
Gene Ontology termsGO:0019285 - glycine betaine biosynthetic process from choline (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0031967 - organelle envelope (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0019133 - choline monooxygenase activity (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR001663 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit
IPR015879 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit, C-terminal domain
IPR017941 - Rieske [2Fe-2S] iron-sulphur domain
IPR036922 - Rieske [2Fe-2S] iron-sulphur domain superfamily
IPR044637 - Aromatic-ring-hydroxylating dioxygenase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589681.1 Choline monooxygenase, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]3.8e-17783.43Show/hide
Query:  MATLTKQIPTHFFQAPRPWITCPS----SSSRIPAAFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL
        MA LTK I THFFQ         S    SS+RI    SFR  DS+ IEA KLVDEFDP+IPLEKAVTPPSSWYTDPSFF LELDRVF RGWQAVGYVEQL
Subjt:  MATLTKQIPTHFFQAPRPWITCPS----SSSRIPAAFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL
        KDPHDFFTGRLGNVE+VVCKDNN+KVRAFHNVCRHHASL+ASG GKKSCFVCPYHGWTYGLDG LLKATRINGIQNFDVNDFGL+P+P ATWGPF+LLN+
Subjt:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL

Query:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCK----AK
        D+KLSSEL VDEDKVAHEWLGS AD+LSLNGVDASL FVCRREYTIECNWKV+CDNYLDGGYHVPYAHKGLASNL ++SYSTE FE VSIQSCK    +K
Subjt:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCK----AK

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE PFK+
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD

XP_022965039.1 choline monooxygenase, chloroplastic isoform X1 [Cucurbita moschata]1.0e-17783.71Show/hide
Query:  MATLTKQIPTHFFQAPRPWITCPS----SSSRIPAAFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL
        MA LTK I THFFQ         S    S +RI +  SFR  DS+ IEA KLVDEFDP+IPLEKAVTPPSSWYTDPSFFALELDRVF RGWQAVGYVEQL
Subjt:  MATLTKQIPTHFFQAPRPWITCPS----SSSRIPAAFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL
        KDPHDFFTGRLGNVE+VVCKDNN+KVRAFHNVCRHHASL+ASG GKKSCFVCPYHGWTYGLDG LLKATRINGIQNFDVNDFGL+P+P ATWGPF+LLNL
Subjt:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL

Query:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCK----AK
        D+KLSSEL VDED+VAHEWLGS AD+LSLNGVDASLSFVCRREYTIECNWKV+CDNYLDGGYHVPYAHKGLASNL ++SYSTE FE VSIQSCK    +K
Subjt:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCK----AK

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE PFK+
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD

XP_022987330.1 choline monooxygenase, chloroplastic isoform X2 [Cucurbita maxima]1.1e-17683.15Show/hide
Query:  MATLTKQIPTHFFQAPRPWITCPSSSSRIPA----AFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL
        MA LTK I THFFQ         S S R P       SFR  DS+ IEA KLVDEFDP+IPLEKAVTPPSSWYTDPSFFALELDRVF RGWQAVGYVEQL
Subjt:  MATLTKQIPTHFFQAPRPWITCPSSSSRIPA----AFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL
        KDPHDFFTGRLGNVE+VVCKDNN+KVRAFHNVCRHHASL+ASG GKKSCFVCPYHGWTYGLDG LLKATRINGIQNFDVNDFGL+P+P ATWGPF+LLN+
Subjt:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL

Query:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCK----AK
        D+KLSSEL VDEDKVA+EWLGS AD+LSLNGVDASLSFVCRREYTIECNWKV+CDNYLDGGYHVPYAHKGLASNL ++SYSTE FE VSIQSCK    +K
Subjt:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCK----AK

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGP+KCLVVFDYFLE PFK+
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD

XP_023515603.1 choline monooxygenase, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]1.4e-17683.43Show/hide
Query:  MATLTKQIPTHFFQAPRPWITCPS----SSSRIPAAFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL
        MA LTK I THFFQ         S    S +RI    SFR  DS+ IEA KLVDEFDP+IPLEKAVTPPSSWYTDPSFF LELDRVF RGWQAVGYVEQL
Subjt:  MATLTKQIPTHFFQAPRPWITCPS----SSSRIPAAFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL
        KDPHDFFTGRLGNVE+VVCKDNN+KVRAFHNVCRHHASL+ASG GKKSCFVCPYHGWTYGLDG LLKATRINGIQNFDVNDFGL+P+P ATWGPF+LLNL
Subjt:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL

Query:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCK----AK
        D+KLSSEL VDEDKVA+EWLGS AD+LSLNGVDASLSFVCRREYTIECNWKV+CDNYLDGGYHVPYAHKGLASNL ++SYSTE FE VSIQSCK    +K
Subjt:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCK----AK

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE PFK+
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD

XP_038880722.1 choline monooxygenase, chloroplastic [Benincasa hispida]9.0e-17985.11Show/hide
Query:  MATLTKQIPTHFFQAPRPWITCPS----SSSRIPAAFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL
        MATLTK I THFFQ P       S    S SRI AA SFR  DS+ IEAQKLVDEFDP+IPLEKAVTPPSSWY DPSF+ALELDRVF RGWQAVGYVEQL
Subjt:  MATLTKQIPTHFFQAPRPWITCPS----SSSRIPAAFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL
        KDPHDFFTGRLGNVE+VVCKDNNRKVRAFHNVCRHHASL+ASG GK+SCFVCPYHGWTYGLDGVLLKATRINGIQNFD N+FGLIP+P ATWGPF+LLNL
Subjt:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL

Query:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCK----AK
        D KLSSELDVDEDKV  EWLGS AD+LSLNGVDASLS+VCRREYTIECNWKV+CDNYLDGGYHVPYAHKGLASNL +DSYSTE FETVSIQSCK    AK
Subjt:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCK----AK

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD
        G+DYGRLG EALYAFIYPNFMINRYGPWMDTNLVLPLG RKCLVVFDYFLEA FK+
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD

TrEMBL top hitse value%identityAlignment
A0A5A7UY48 Choline monooxygenase, chloroplastic3.2e-17482.3Show/hide
Query:  MATLTKQIPTHFFQAPR---PWITC-PSSSSRIPAAFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL
        MATLTK I  HFFQ P     +  C   S SRI A+ SFR  DS  IEA+KLVD+FDP+IPLEKA+TPPSSWY DPSFFALELDRVF RGWQAVGYVEQL
Subjt:  MATLTKQIPTHFFQAPR---PWITC-PSSSSRIPAAFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL
        KD HDFFTGRLGNVE+VVCKDNNRKVRAFHNVCRHHASL+A+G GKKSCFVCPYHGWTYGLDG+LLKATRINGIQNF+ NDFGL+P+P A WGPF+LLNL
Subjt:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL

Query:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCK----AK
        D KLSS+ DVDEDKVA EWLG+ ADVL LNGVDASLS+VCRREYTI+CNWKV+CDNYLDGGYHVPYAHKGLASNLN++SYSTE FETVSIQSCK    +K
Subjt:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCK----AK

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD
        GDDYGRLG EALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEA FK+
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD

A0A5D3BWX2 Choline monooxygenase, chloroplastic2.5e-17482.02Show/hide
Query:  MATLTKQIPTHFFQAPR---PWITC-PSSSSRIPAAFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL
        MATLTK I  HFFQ P     +  C   S SRI A+ SFR  DS  IEA+KLVD+FDP+IPLEKA+TPPSSWY DPSFFALELDRVF RGWQAVGYVEQL
Subjt:  MATLTKQIPTHFFQAPR---PWITC-PSSSSRIPAAFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL
        KD HDFFTGRLGNVE+VVCKDNNRKVRAFHNVCRHHASL+A+G GKKSCFVCPYHGWTYGLDG+LLKATRINGIQNF+ NDFGL+P+P A WGPF+LLNL
Subjt:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL

Query:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCK----AK
        D KLSS++DVDEDKVA EWLG+ ADVL LNGVDASLS+VCRREYTI+CNWKV+CDNYLDGGYHVPYAHKGLASNLN++SYSTE FETVSIQSCK    +K
Subjt:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCK----AK

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD
        GDDYGRLG EALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+ FK+
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD

A0A6J1C1T8 Choline monooxygenase, chloroplastic5.0e-17583.15Show/hide
Query:  MATLTKQIPTHFFQAPRPWITCPSSSSR----IPAAFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL
        MAT+ KQIPTHFFQ   P I  PS + R    I A  +FR  DS  +EAQKLV EFDP+IPLEKA+TPPSSWYTDPSFF LELDRVF RGWQAVGYVEQL
Subjt:  MATLTKQIPTHFFQAPRPWITCPSSSSR----IPAAFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL
        KDPHDFFTGRLGNVEFVVCKDNN KVRAFHNVCRHHASL+ASGSGKKSCFVCPYHGWTYGLDGVLLKATRI GIQNFDVNDFGLIP+  ATWGPF+LLNL
Subjt:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL

Query:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCKA----K
        DK+L SE +VDEDKVA EWLGS  DVLSLNGV+ SLS+VCRREYTIECNWKV+CDNYLDGGYHVPYAHKGLASNL ++SY TE FETVSIQSC +    K
Subjt:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCKA----K

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD
         DDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFK+
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD

A0A6J1HPX3 Choline monooxygenase, chloroplastic4.8e-17883.71Show/hide
Query:  MATLTKQIPTHFFQAPRPWITCPS----SSSRIPAAFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL
        MA LTK I THFFQ         S    S +RI +  SFR  DS+ IEA KLVDEFDP+IPLEKAVTPPSSWYTDPSFFALELDRVF RGWQAVGYVEQL
Subjt:  MATLTKQIPTHFFQAPRPWITCPS----SSSRIPAAFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL
        KDPHDFFTGRLGNVE+VVCKDNN+KVRAFHNVCRHHASL+ASG GKKSCFVCPYHGWTYGLDG LLKATRINGIQNFDVNDFGL+P+P ATWGPF+LLNL
Subjt:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL

Query:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCK----AK
        D+KLSSEL VDED+VAHEWLGS AD+LSLNGVDASLSFVCRREYTIECNWKV+CDNYLDGGYHVPYAHKGLASNL ++SYSTE FE VSIQSCK    +K
Subjt:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCK----AK

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE PFK+
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD

A0A6J1JGJ2 Choline monooxygenase, chloroplastic5.4e-17783.15Show/hide
Query:  MATLTKQIPTHFFQAPRPWITCPSSSSRIPA----AFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL
        MA LTK I THFFQ         S S R P       SFR  DS+ IEA KLVDEFDP+IPLEKAVTPPSSWYTDPSFFALELDRVF RGWQAVGYVEQL
Subjt:  MATLTKQIPTHFFQAPRPWITCPSSSSRIPA----AFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL
        KDPHDFFTGRLGNVE+VVCKDNN+KVRAFHNVCRHHASL+ASG GKKSCFVCPYHGWTYGLDG LLKATRINGIQNFDVNDFGL+P+P ATWGPF+LLN+
Subjt:  KDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNL

Query:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCK----AK
        D+KLSSEL VDEDKVA+EWLGS AD+LSLNGVDASLSFVCRREYTIECNWKV+CDNYLDGGYHVPYAHKGLASNL ++SYSTE FE VSIQSCK    +K
Subjt:  DKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCK----AK

Query:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD
        GDDYGRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGP+KCLVVFDYFLE PFK+
Subjt:  GDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD

SwissProt top hitse value%identityAlignment
O04121 Choline monooxygenase, chloroplastic4.6e-10152.94Show/hide
Query:  QKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQLKDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSC
        Q LV EFDP+IP E A TPPSSWYT+P+F++ EL+R+F +GWQ  G  +Q+K+P+ +FTG LGNVE++V +D   KV AFHNVC H AS++A GSGKKSC
Subjt:  QKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQLKDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSC

Query:  FVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNLDKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECN
        FVCPYHGW YG+DG L KA++    QN D  + GL+P+  A WGPF+L++LD+ L    D     V  EWLG+ A+ +  +  D SL F+ R E+ +E N
Subjt:  FVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNLDKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECN

Query:  WKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCKA---KGDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFL
        WK++ DNYLD  YHVPYAHK  A+ LN D+Y T+  E V+IQ  +    K D + R+G +A YAF YPNF + RYGPWM T  + PLGPRKC +V DY++
Subjt:  WKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCKA---KGDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFL

Query:  EAPFKD
        E    D
Subjt:  EAPFKD

O22553 Choline monooxygenase, chloroplastic4.2e-10252.68Show/hide
Query:  QKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQLKDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSC
        + LV EFDP+IP E A+TPPS+WYT+P+F++ EL+R+F +GWQ  GY EQ+K+ + +FTG LGNVE++V +D   ++ AFHNVC H AS++A GSGKKSC
Subjt:  QKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQLKDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSC

Query:  FVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNLDKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECN
        FVCPYHGW YGLDG L KA++    QN D  + GL P+  A WGPFIL++LD+ L +  D     V  EW+G  A+ +  +  D +L F  R E+ +ECN
Subjt:  FVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNLDKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECN

Query:  WKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQ---SCKAKGDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFL
        WKV+CDNYLD  YHVPYAHK  A+ L+ D+Y+TE  E   IQ   S   K D + RLG+EA YAFIYPNF + RYG WM T  V+P+G RKC +V DY+L
Subjt:  WKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQ---SCKAKGDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFL

Query:  EAPFKD--CSVDKPVHI
        E    D    +DK + I
Subjt:  EAPFKD--CSVDKPVHI

Q93XE1 Choline monooxygenase, chloroplastic2.2e-10351.05Show/hide
Query:  SSSRIPAAFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQLKDPHDFFTGRLGNVEFVVCKDNNRKVRA
        +S  IP++ +     + +   ++++ EFDPK+P E   TPPS+WYTDPS ++ ELDR+F +GWQ  GY +Q+K+P+ +FTG LGNVE++VC+D   KV A
Subjt:  SSSRIPAAFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQLKDPHDFFTGRLGNVEFVVCKDNNRKVRA

Query:  FHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNLDKKLSSELDVDEDKVAHEWLGSRADVLS
        FHNVC H AS++A G+GKKSCFVCPYHGW +GLDG L+KAT+    Q FD  + GL+ +  A WGPF+L++LD+  S       + V  EW+GS A+ + 
Subjt:  FHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNLDKKLSSELDVDEDKVAHEWLGSRADVLS

Query:  LNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQ---SCKAKGDDYGRLGSEALYAFIYPNFMINRYGPWM
         +  D SL F+ R E+ +E NWKV+CDNYLD  YHVPYAHK  A+ L+ D+Y T+  E V IQ   S   K + + RLGSEA YAFIYPNF + RYGPWM
Subjt:  LNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQ---SCKAKGDDYGRLGSEALYAFIYPNFMINRYGPWM

Query:  DTNLVLPLGPRKCLVVFDYFLEAPFKDCSVDKP
         T  + PLGPRKC +V DY+LE    +   DKP
Subjt:  DTNLVLPLGPRKCLVVFDYFLEAPFKDCSVDKP

Q9LKN0 Choline monooxygenase, chloroplastic1.9e-9952.84Show/hide
Query:  QKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQLKDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSC
        Q LV +FDP +P E A+TPPSSWYT+P+F+A ELDR+F +GWQ  GY +Q+K+ + +FTG LGNVE++VC+D   KV AFHNVC H AS++A GSGKKSC
Subjt:  QKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQLKDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSC

Query:  FVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNLDKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECN
        FVCPYHGW YG++G L KA++    Q+ + ++ GL+P+  A WGPFIL++LD+   S  +V +  V  EWLGS A+ +  +  D +L F+ R E+ IE N
Subjt:  FVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNLDKKLSSELDVDEDKVAHEWLGSRADVLSLNGVDASLSFVCRREYTIECN

Query:  WKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCKAKGDD-YGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE
        WK++ DNYLD  YHVPYAHK  A+ L+ D+Y T+    V+IQ      ++ + RLG++A YAF YPNF + RYGPWM T  ++PLGPRKC +V DY++E
Subjt:  WKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCKAKGDD-YGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE

Q9SZR0 Choline monooxygenase, chloroplastic2.1e-13060.66Show/hide
Query:  MATLTKQIPTHF---FQAPRPWITCPSSSSRIPAAFSFRK-HDSYSI----EAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGY
        M TLT  +P       ++ R +    S      + FS R+ H+   +    +  KLV EFDPKIPLE+A TPPSSWYTDP F++ ELDRVF  GWQAVGY
Subjt:  MATLTKQIPTHF---FQAPRPWITCPSSSSRIPAAFSFRK-HDSYSI----EAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGY

Query:  VEQLKDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFI
         +Q+K+  DFFTGRLG+V+FVVC+D N K+ AFHNVC HHAS++ASG+G+KSCFVC YHGWTY L G L+KATR++GIQNF +++ GL P+  A WGPF+
Subjt:  VEQLKDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFI

Query:  LLNLDKKLSSELDVDEDK-VAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSC--
        LL +    S + +V+ D+ VA EWLG+    LS  GVD+ LS++CRREYTI+CNWKV+CDNYLDGGYHVPYAHKGL S L++++YST  FE VSIQ C  
Subjt:  LLNLDKKLSSELDVDEDK-VAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSC--

Query:  --KAKGDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD
          K   D + RLGSEALYAF+YPNFMINRYGPWMDTNLVLPLGPRKC VVFDYFL+   KD
Subjt:  --KAKGDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD

Arabidopsis top hitse value%identityAlignment
AT1G44446.1 Pheophorbide a oxygenase family protein with Rieske [2Fe-2S] domain9.2e-0429.73Show/hide
Query:  WQAVGYVEQLKDPHDFFTG-RLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDG
        W  V +   LK  HD           +V+ +  + K     N C H A  +  G+  +    CPYHGW Y  DG
Subjt:  WQAVGYVEQLKDPHDFFTG-RLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDG

AT1G44446.2 Pheophorbide a oxygenase family protein with Rieske [2Fe-2S] domain9.2e-0429.73Show/hide
Query:  WQAVGYVEQLKDPHDFFTG-RLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDG
        W  V +   LK  HD           +V+ +  + K     N C H A  +  G+  +    CPYHGW Y  DG
Subjt:  WQAVGYVEQLKDPHDFFTG-RLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDG

AT1G44446.3 Pheophorbide a oxygenase family protein with Rieske [2Fe-2S] domain9.2e-0429.73Show/hide
Query:  WQAVGYVEQLKDPHDFFTG-RLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDG
        W  V +   LK  HD           +V+ +  + K     N C H A  +  G+  +    CPYHGW Y  DG
Subjt:  WQAVGYVEQLKDPHDFFTG-RLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDG

AT4G29890.1 choline monooxygenase, putative (CMO-like)1.5e-13160.66Show/hide
Query:  MATLTKQIPTHF---FQAPRPWITCPSSSSRIPAAFSFRK-HDSYSI----EAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGY
        M TLT  +P       ++ R +    S      + FS R+ H+   +    +  KLV EFDPKIPLE+A TPPSSWYTDP F++ ELDRVF  GWQAVGY
Subjt:  MATLTKQIPTHF---FQAPRPWITCPSSSSRIPAAFSFRK-HDSYSI----EAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGY

Query:  VEQLKDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFI
         +Q+K+  DFFTGRLG+V+FVVC+D N K+ AFHNVC HHAS++ASG+G+KSCFVC YHGWTY L G L+KATR++GIQNF +++ GL P+  A WGPF+
Subjt:  VEQLKDPHDFFTGRLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFI

Query:  LLNLDKKLSSELDVDEDK-VAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSC--
        LL +    S + +V+ D+ VA EWLG+    LS  GVD+ LS++CRREYTI+CNWKV+CDNYLDGGYHVPYAHKGL S L++++YST  FE VSIQ C  
Subjt:  LLNLDKKLSSELDVDEDK-VAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSC--

Query:  --KAKGDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD
          K   D + RLGSEALYAF+YPNFMINRYGPWMDTNLVLPLGPRKC VVFDYFL+   KD
Subjt:  --KAKGDDYGRLGSEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACGTTAACGAAGCAAATCCCTACTCACTTCTTCCAAGCTCCTCGTCCTTGGATCACTTGCCCTTCATCTTCCTCACGAATCCCGGCGGCTTTCTCCTTT
CGAAAACACGATTCTTACTCCATTGAAGCTCAGAAACTCGTCGATGAATTCGACCCTAAAATTCCCTTGGAGAAGGCTGTCACTCCACCCAGCTCGTGGTACACC
GATCCTTCGTTTTTCGCTCTCGAGCTCGATCGTGTCTTCTGCAGAGGATGGCAGGCCGTAGGATATGTTGAACAGTTAAAAGATCCCCATGACTTTTTCACAGGC
AGGTTGGGAAATGTAGAGTTTGTAGTGTGCAAAGATAATAACAGGAAGGTTCGTGCATTTCACAATGTCTGCCGCCATCATGCTTCACTTATTGCTTCTGGAAGT
GGGAAAAAGTCGTGCTTTGTATGCCCGTATCATGGATGGACTTATGGGTTGGATGGAGTTCTGCTTAAGGCGACTAGAATAAATGGGATACAAAACTTTGATGTA
AATGATTTTGGGCTCATACCAATACCGGCAGCTACATGGGGGCCTTTCATTCTTCTCAATCTGGATAAAAAATTATCATCTGAGCTGGATGTTGATGAAGATAAA
GTAGCACATGAATGGCTTGGAAGCCGTGCAGATGTGCTGAGTTTGAATGGAGTTGATGCTTCTTTAAGTTTTGTTTGTCGACGTGAATACACCATTGAATGTAAT
TGGAAGGTTTATTGTGACAACTACTTAGACGGAGGATATCACGTTCCCTATGCACATAAAGGGCTAGCATCAAATCTGAATATTGATTCGTATTCTACAGAAACG
TTTGAAACTGTTAGCATTCAAAGTTGTAAGGCAAAAGGCGATGATTATGGTCGACTTGGATCAGAAGCACTATATGCTTTTATATACCCAAACTTCATGATAAAT
AGGTATGGACCTTGGATGGACACTAATCTAGTACTGCCACTTGGACCTCGAAAATGTCTTGTGGTTTTTGATTATTTTCTTGAAGCTCCTTTCAAGGACTGCTCA
GTCGACAAACCAGTCCATATCTAA
mRNA sequenceShow/hide mRNA sequence
CAAGTTACAGAGAAGCCTGGTGATTGTTGCAATGGCGACGTTAACGAAGCAAATCCCTACTCACTTCTTCCAAGCTCCTCGTCCTTGGATCACTTGCCCTTCATC
TTCCTCACGAATCCCGGCGGCTTTCTCCTTTCGAAAACACGATTCTTACTCCATTGAAGCTCAGAAACTCGTCGATGAATTCGACCCTAAAATTCCCTTGGAGAA
GGCTGTCACTCCACCCAGCTCGTGGTACACCGATCCTTCGTTTTTCGCTCTCGAGCTCGATCGTGTCTTCTGCAGAGGATGGCAGGCCGTAGGATATGTTGAACA
GTTAAAAGATCCCCATGACTTTTTCACAGGCAGGTTGGGAAATGTAGAGTTTGTAGTGTGCAAAGATAATAACAGGAAGGTTCGTGCATTTCACAATGTCTGCCG
CCATCATGCTTCACTTATTGCTTCTGGAAGTGGGAAAAAGTCGTGCTTTGTATGCCCGTATCATGGATGGACTTATGGGTTGGATGGAGTTCTGCTTAAGGCGAC
TAGAATAAATGGGATACAAAACTTTGATGTAAATGATTTTGGGCTCATACCAATACCGGCAGCTACATGGGGGCCTTTCATTCTTCTCAATCTGGATAAAAAATT
ATCATCTGAGCTGGATGTTGATGAAGATAAAGTAGCACATGAATGGCTTGGAAGCCGTGCAGATGTGCTGAGTTTGAATGGAGTTGATGCTTCTTTAAGTTTTGT
TTGTCGACGTGAATACACCATTGAATGTAATTGGAAGGTTTATTGTGACAACTACTTAGACGGAGGATATCACGTTCCCTATGCACATAAAGGGCTAGCATCAAA
TCTGAATATTGATTCGTATTCTACAGAAACGTTTGAAACTGTTAGCATTCAAAGTTGTAAGGCAAAAGGCGATGATTATGGTCGACTTGGATCAGAAGCACTATA
TGCTTTTATATACCCAAACTTCATGATAAATAGGTATGGACCTTGGATGGACACTAATCTAGTACTGCCACTTGGACCTCGAAAATGTCTTGTGGTTTTTGATTA
TTTTCTTGAAGCTCCTTTCAAGGACTGCTCAGTCGACAAACCAGTCCATATCTAATACGAAAACTCTGCAAAAACTGGGCATTTGTATTGTAAAGGATTGTCCGT
TCATTCGGGTTTTTACCTTTCTCTTCCTTGATATTCAGAATGACGACTCTCTTATACAAAACAGTTTAGAAGACAGTGAAAGCGTGCAGCTTGAAGACATTATTC
TGTGTGAAGGTGTTCAAAAGGGTCTCGAGTCGCCTGCCTACAATGTTGGCCGATACGCTCCTTCTGTCGAGAATGCCATGCACCATTTCCATCGTCTTCTTCATC
TTAATCTCACCAAATAATTGTTCTGCATATCTTGCCTCTTAGAGATATGAATCTGTAATCATATTTCAGGTCCTTTTTCTCATTGTTTTCCATTTTCCCTCACAA
TATGTGATTACCCTGAAATCTGATCTGAGGGGGGCTTCTCCTGTAGATTGCCATTCTAGCTGTTCTGTGGACAAGAAGTTGCTCGATTTCAATTGTTTGACCTGT
AATGCACATGTCAGAGTTAGTTAATAAACTTGTTCAATTAATATCTTTGGTATCTGAGCTATGTTTAGATTGGCACAGATGTGAATAATGAATATTTAGCAACTT
GTTCGAATTGGCAAAATTTTCTATTGCTCGTCTTCAAATGA
Protein sequenceShow/hide protein sequence
MATLTKQIPTHFFQAPRPWITCPSSSSRIPAAFSFRKHDSYSIEAQKLVDEFDPKIPLEKAVTPPSSWYTDPSFFALELDRVFCRGWQAVGYVEQLKDPHDFFTG
RLGNVEFVVCKDNNRKVRAFHNVCRHHASLIASGSGKKSCFVCPYHGWTYGLDGVLLKATRINGIQNFDVNDFGLIPIPAATWGPFILLNLDKKLSSELDVDEDK
VAHEWLGSRADVLSLNGVDASLSFVCRREYTIECNWKVYCDNYLDGGYHVPYAHKGLASNLNIDSYSTETFETVSIQSCKAKGDDYGRLGSEALYAFIYPNFMIN
RYGPWMDTNLVLPLGPRKCLVVFDYFLEAPFKDCSVDKPVHI