; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007641 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007641
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCholine monooxygenase, chloroplastic
Genome locationChr10:9088497..9091601
RNA-Seq ExpressionHG10007641
SyntenyHG10007641
Gene Ontology termsGO:0019285 - glycine betaine biosynthetic process from choline (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0031967 - organelle envelope (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0019133 - choline monooxygenase activity (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR001663 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit
IPR015879 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit, C-terminal domain
IPR017941 - Rieske [2Fe-2S] iron-sulphur domain
IPR036922 - Rieske [2Fe-2S] iron-sulphur domain superfamily
IPR044637 - Aromatic-ring-hydroxylating dioxygenase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK04233.1 choline monooxygenase [Cucumis melo var. makuwa]2.8e-22488.46Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHI  HFFQPPS +FN H CN+RSP RISA+LSFR+ DS FIEA+KLVD+FDPEIPLEKA+TPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KD HDFFTGRLGN+EYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCF     GWTYGLDG+L+KATRINGIQNF+ NDFGL+PLPVA WGPFVLLNL
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        DGKLSS++DVDEDKVA +WLG+CADVL LNGVD SLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYS E+FETVSIQSC GGGESK
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV
         DDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+SFKNDDS+IQQSL DSESVQNEDIILCEGVQKGLES AY FGRYAPSV
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV

Query:  ENAMHHFHRLLHFNLT
        ENAMHHFHRLLH NLT
Subjt:  ENAMHHFHRLLHFNLT

XP_004139149.1 choline monooxygenase, chloroplastic isoform X1 [Cucumis sativus]1.0e-22188.22Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHI  HFFQ PST+FN HSCN+RSP RISAALSFRN DS  IEA+KLVD+FDP+IPLEKA+TPPSSWYIDPSFFALEL+ VFYRGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KD HDFFTGRLGN+EYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF     GWTYGLDG+L+KATRI+GIQNFD NDFGL+PLPVATWGPFVLLNL
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        DGKLSS+ DVDEDKVA +WLG+CADVL LNGVD SLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS E+FETVSIQSC GGGESK
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV
         DD GRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDS+IQ SL DSESVQNEDIILCEGVQKGLES AY FGRYAPSV
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV

Query:  ENAMHHFHRLLHFNLT
        ENAMHHFHRLLH NLT
Subjt:  ENAMHHFHRLLHFNLT

XP_008443631.1 PREDICTED: choline monooxygenase, chloroplastic isoform X2 [Cucumis melo]3.7e-22488.7Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHI  HFFQPPS +FN H CN+RSP RISA+LSFR+ DS FIEA+KLVD+FDPEIPLEKA+TPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KD HDFFTGRLGN+EYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCF     GWTYGLDG+L+KATRINGIQNF+ NDFGL+PLPVA WGPFVLLNL
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        DGKLSS+ DVDEDKVA +WLG+CADVL LNGVD SLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYS E+FETVSIQSC GGGESK
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV
         DDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDS+IQQSL DSESVQNEDIILCEGVQKGLES AY FGRYAPSV
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV

Query:  ENAMHHFHRLLHFNLT
        ENAMHHFHRLLH NLT
Subjt:  ENAMHHFHRLLHFNLT

XP_022965039.1 choline monooxygenase, chloroplastic isoform X1 [Cucurbita moschata]5.9e-22288.7Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHIHTHFFQP   +FN  S + RSP RIS+ LSFRNSDSHFIEA KLVDEFDPEIPLEKAVTPPSSWY DPSFFALELDRVF+RGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KDPHDFFTGRLGN+EYVVCKDNN+KVRAFHNVCRHHASLLASGCGKKSCF     GWTYGLDG L+KATRINGIQNFDVNDFGL+PLPVATWGPFVLLNL
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        D KLSSEL VDED+VAH+WLGSCAD+LSLNGVD SLS+VCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS EIFE VSIQSC GGGESK
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV
         DDYGRLGPEALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE  FKND+S+IQQSL DSESVQ EDIILCEGVQKGLES AY FGRYAPSV
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV

Query:  ENAMHHFHRLLHFNLT
        ENAMHHFHRLLH NLT
Subjt:  ENAMHHFHRLLHFNLT

XP_038880722.1 choline monooxygenase, chloroplastic [Benincasa hispida]7.7e-23091.35Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHIHTHFFQPPS +FNLHSCN+RSP RISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSF+ALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KDPHDFFTGRLGN+EYVVCKDNNRKVRAFHNVCRHHASLLASGCGK+SCF     GWTYGLDGVL+KATRINGIQNFD N+FGLIPLPVATWGPFVLLNL
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        DGKLSSELDVDEDKV  +WLGSCAD+LSLNGVD SLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKL+SYS EIFETVSIQSC GGGE+K
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV
         +DYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLG RKCLVVFDYFLEASFKND  +IQ+SL DSESVQ EDIILCEGVQKGLES AY FGRYAPSV
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV

Query:  ENAMHHFHRLLHFNLT
        ENAMHHFHRLLH NLT
Subjt:  ENAMHHFHRLLHFNLT

TrEMBL top hitse value%identityAlignment
A0A0A0LXX3 Choline monooxygenase, chloroplastic4.8e-22288.22Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHI  HFFQ PST+FN HSCN+RSP RISAALSFRN DS  IEA+KLVD+FDP+IPLEKA+TPPSSWYIDPSFFALEL+ VFYRGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KD HDFFTGRLGN+EYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF     GWTYGLDG+L+KATRI+GIQNFD NDFGL+PLPVATWGPFVLLNL
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        DGKLSS+ DVDEDKVA +WLG+CADVL LNGVD SLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS E+FETVSIQSC GGGESK
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV
         DD GRLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDS+IQ SL DSESVQNEDIILCEGVQKGLES AY FGRYAPSV
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV

Query:  ENAMHHFHRLLHFNLT
        ENAMHHFHRLLH NLT
Subjt:  ENAMHHFHRLLHFNLT

A0A1S3B9A0 Choline monooxygenase, chloroplastic1.8e-22488.7Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHI  HFFQPPS +FN H CN+RSP RISA+LSFR+ DS FIEA+KLVD+FDPEIPLEKA+TPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KD HDFFTGRLGN+EYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCF     GWTYGLDG+L+KATRINGIQNF+ NDFGL+PLPVA WGPFVLLNL
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        DGKLSS+ DVDEDKVA +WLG+CADVL LNGVD SLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYS E+FETVSIQSC GGGESK
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV
         DDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDS+IQQSL DSESVQNEDIILCEGVQKGLES AY FGRYAPSV
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV

Query:  ENAMHHFHRLLHFNLT
        ENAMHHFHRLLH NLT
Subjt:  ENAMHHFHRLLHFNLT

A0A5A7UY48 Choline monooxygenase, chloroplastic1.8e-22488.7Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHI  HFFQPPS +FN H CN+RSP RISA+LSFR+ DS FIEA+KLVD+FDPEIPLEKA+TPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KD HDFFTGRLGN+EYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCF     GWTYGLDG+L+KATRINGIQNF+ NDFGL+PLPVA WGPFVLLNL
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        DGKLSS+ DVDEDKVA +WLG+CADVL LNGVD SLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYS E+FETVSIQSC GGGESK
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV
         DDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDS+IQQSL DSESVQNEDIILCEGVQKGLES AY FGRYAPSV
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV

Query:  ENAMHHFHRLLHFNLT
        ENAMHHFHRLLH NLT
Subjt:  ENAMHHFHRLLHFNLT

A0A5D3BWX2 Choline monooxygenase, chloroplastic1.4e-22488.46Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHI  HFFQPPS +FN H CN+RSP RISA+LSFR+ DS FIEA+KLVD+FDPEIPLEKA+TPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KD HDFFTGRLGN+EYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCF     GWTYGLDG+L+KATRINGIQNF+ NDFGL+PLPVA WGPFVLLNL
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        DGKLSS++DVDEDKVA +WLG+CADVL LNGVD SLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYS E+FETVSIQSC GGGESK
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV
         DDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE+SFKNDDS+IQQSL DSESVQNEDIILCEGVQKGLES AY FGRYAPSV
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV

Query:  ENAMHHFHRLLHFNLT
        ENAMHHFHRLLH NLT
Subjt:  ENAMHHFHRLLHFNLT

A0A6J1HPX3 Choline monooxygenase, chloroplastic2.8e-22288.7Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHIHTHFFQP   +FN  S + RSP RIS+ LSFRNSDSHFIEA KLVDEFDPEIPLEKAVTPPSSWY DPSFFALELDRVF+RGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KDPHDFFTGRLGN+EYVVCKDNN+KVRAFHNVCRHHASLLASGCGKKSCF     GWTYGLDG L+KATRINGIQNFDVNDFGL+PLPVATWGPFVLLNL
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        D KLSSEL VDED+VAH+WLGSCAD+LSLNGVD SLS+VCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS EIFE VSIQSC GGGESK
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV
         DDYGRLGPEALYAF+YPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLE  FKND+S+IQQSL DSESVQ EDIILCEGVQKGLES AY FGRYAPSV
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSV

Query:  ENAMHHFHRLLHFNLT
        ENAMHHFHRLLH NLT
Subjt:  ENAMHHFHRLLHFNLT

SwissProt top hitse value%identityAlignment
O04121 Choline monooxygenase, chloroplastic3.0e-11248.49Show/hide
Query:  PSTTFNLHSCNNRSPKRIS----AALSFRN-SDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGR
        P  T N  +  +R+P +I+    AA SF + + +     Q LV EFDP+IP E A TPPSSWY +P+F++ EL+R+FY+GWQ  G  +Q+K+P+ +FTG 
Subjt:  PSTTFNLHSCNNRSPKRIS----AALSFRN-SDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGR

Query:  LGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNLDGKLSSELDV
        LGN+EY+V +D   KV AFHNVC H AS+LA G GKKSCF     GW YG+DG L KA++    QN D  + GL+PL VA WGPFVL++LD  L    D 
Subjt:  LGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNLDGKLSSELDV

Query:  DEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESKRDDYGRLGPE
            V  +WLG+ A+ +  +  D SL ++ R E+ +E NWK+F DNYLD  YHVPYAHK  A+ L  ++Y  ++ E V+IQ    G  +K D + R+G +
Subjt:  DEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESKRDDYGRLGPE

Query:  ALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSVENAMHHFH
        A YAF YPNF + RYGPWM T  + PLGPRKC +V DY++E S  +D  YI++ +A +++VQ ED++LCE VQ+GLE+ AY  GRY   +E  +HHFH
Subjt:  ALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSVENAMHHFH

O22553 Choline monooxygenase, chloroplastic4.7e-11350.82Show/hide
Query:  QKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC
        + LV EFDPEIP E A+TPPS+WY +P+F++ EL+R+FY+GWQ  GY EQ+K+ + +FTG LGN+EY+V +D   ++ AFHNVC H AS+LA G GKKSC
Subjt:  QKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC

Query:  F-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNLDGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECN
        F     GW YGLDG L KA++    QN D  + GL PL VA WGPF+L++LD  L +  D     V  +W+G  A+ +  +  D +L +  R E+ +ECN
Subjt:  F-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNLDGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WKVFCDNYLD  YHVPYAHK  A+ L  ++Y+ E+ E   IQ   G   +K D + RLG EA YAFIYPNF + RYG WM T  V+P+G RKC +V DY+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSVENAMHHFHRLLHFNL
        LE +  +D +YI + +A +++VQ ED +LCE VQ+GLE+ AY  GRY   +E  +HHFH  LH  L
Subjt:  LEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSVENAMHHFHRLLHFNL

Q93XE1 Choline monooxygenase, chloroplastic2.5e-11449.61Show/hide
Query:  ISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNV
        I ++++  N  +     ++++ EFDP++P E   TPPS+WY DPS ++ ELDR+F +GWQ  GY +Q+K+P+ +FTG LGN+EY+VC+D   KV AFHNV
Subjt:  ISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNV

Query:  CRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNLDGKLSSELDVDEDKVAHQWLGSCADVLSLNGV
        C H AS+LA G GKKSCF     GW +GLDG L+KAT+    Q FD  + GL+ L VA WGPFVL++LD +  SE     + V  +W+GSCA+ +  +  
Subjt:  CRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNLDGKLSSELDVDEDKVAHQWLGSCADVLSLNGV

Query:  DTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTN
        D SL ++ R E+ +E NWKVFCDNYLD  YHVPYAHK  A+ L  ++Y  ++ E V IQ       +K + + RLG EA YAFIYPNF + RYGPWM T 
Subjt:  DTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTN

Query:  LVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSVENAMHHFHRLLHFNL
         + PLGPRKC +V DY+LE +  ND  YI++S+  +++VQ ED++LCE VQ+GLE+ AY  GRY   +E  +HHFH  LH  L
Subjt:  LVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSVENAMHHFHRLLHFNL

Q9LKN0 Choline monooxygenase, chloroplastic6.1e-11350.41Show/hide
Query:  QKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC
        Q LV +FDP +P E A+TPPSSWY +P+F+A ELDR+FY+GWQ  GY +Q+K+ + +FTG LGN+EY+VC+D   KV AFHNVC H AS+LA G GKKSC
Subjt:  QKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC

Query:  F-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNLDGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECN
        F     GW YG++G L KA++    Q+ + ++ GL+PL VA WGPF+L++LD    S  +V +  V  +WLGSCA+ +  +  D +L ++ R E+ IE N
Subjt:  F-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNLDGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF
        WK+F DNYLD  YHVPYAHK  A+ L  ++Y  ++   V+IQ   G   +  + + RLG +A YAF YPNF + RYGPWM T  ++PLGPRKC +V DY+
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYF

Query:  LEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSVENAMHHFHRLLH
        +E S  +D  YI++ +A +++VQ ED++LCE VQKGLE+ AY  GRY   +E  +HHFH  LH
Subjt:  LEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSVENAMHHFHRLLH

Q9SZR0 Choline monooxygenase, chloroplastic8.1e-15062.17Show/hide
Query:  MATLTKHIHTHFFQPPSTT-----FNLHSCNNRSPKRISAALSFRNSDSHFI--EAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQA
        M TLT  +    F PPS       FN HS    S  + S    F N    F   +  KLV EFDP+IPLE+A TPPSSWY DP F++ ELDRVFY GWQA
Subjt:  MATLTKHIHTHFFQPPSTT-----FNLHSCNNRSPKRISAALSFRNSDSHFI--EAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQA

Query:  VGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWG
        VGY +Q+K+  DFFTGRLG++++VVC+D N K+ AFHNVC HHAS+LASG G+KSCF     GWTY L G LVKATR++GIQNF +++ GL PL VA WG
Subjt:  VGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWG

Query:  PFVLLNLDGKLSSELDVDEDK-VAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQS
        PFVLL +    S + +V+ D+ VA +WLG+    LS  GVD+ LSY+CRREYTI+CNWKVFCDNYLDGGYHVPYAHKGL S L LE+YS  IFE VSIQ 
Subjt:  PFVLLNLDGKLSSELDVDEDK-VAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQS

Query:  CNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYT
        C GG +   D + RLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKC VVFDYFL+ S K+D+++I++SL +S+ VQ ED++LCE VQ+GLESQAY 
Subjt:  CNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYT

Query:  FGRYAPSVENAMHHFHRLLHFNL
         GRYA  VE  MHHFH LLH NL
Subjt:  FGRYAPSVENAMHHFHRLLHFNL

Arabidopsis top hitse value%identityAlignment
AT4G29890.1 choline monooxygenase, putative (CMO-like)5.8e-15162.17Show/hide
Query:  MATLTKHIHTHFFQPPSTT-----FNLHSCNNRSPKRISAALSFRNSDSHFI--EAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQA
        M TLT  +    F PPS       FN HS    S  + S    F N    F   +  KLV EFDP+IPLE+A TPPSSWY DP F++ ELDRVFY GWQA
Subjt:  MATLTKHIHTHFFQPPSTT-----FNLHSCNNRSPKRISAALSFRNSDSHFI--EAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQA

Query:  VGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWG
        VGY +Q+K+  DFFTGRLG++++VVC+D N K+ AFHNVC HHAS+LASG G+KSCF     GWTY L G LVKATR++GIQNF +++ GL PL VA WG
Subjt:  VGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCF-----GWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWG

Query:  PFVLLNLDGKLSSELDVDEDK-VAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQS
        PFVLL +    S + +V+ D+ VA +WLG+    LS  GVD+ LSY+CRREYTI+CNWKVFCDNYLDGGYHVPYAHKGL S L LE+YS  IFE VSIQ 
Subjt:  PFVLLNLDGKLSSELDVDEDK-VAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQS

Query:  CNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYT
        C GG +   D + RLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRKC VVFDYFL+ S K+D+++I++SL +S+ VQ ED++LCE VQ+GLESQAY 
Subjt:  CNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYT

Query:  FGRYAPSVENAMHHFHRLLHFNL
         GRYA  VE  MHHFH LLH NL
Subjt:  FGRYAPSVENAMHHFHRLLHFNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACGTTAACGAAGCATATCCATACTCACTTCTTCCAACCTCCTTCAACTACCTTCAATTTGCATTCGTGCAATAATCGTTCTCCCAAACGAATCTCGGCGGCTTT
ATCCTTTCGAAACTCTGATTCTCACTTCATTGAAGCTCAGAAACTCGTTGATGAATTCGACCCTGAAATTCCCTTGGAGAAGGCCGTCACTCCACCCAGCTCCTGGTATA
TTGACCCTTCATTTTTTGCTCTCGAGCTCGATCGTGTCTTCTACAGAGGATGGCAGGCTGTAGGATATGTTGAACAGTTAAAAGATCCCCATGACTTTTTCACAGGCAGG
TTGGGAAATATAGAGTACGTGGTATGCAAAGATAATAACAGGAAGGTTCGTGCATTTCACAATGTTTGTCGCCATCATGCCTCACTTCTTGCGTCTGGATGTGGGAAGAA
GTCATGCTTTGGATGGACATATGGGTTGGATGGAGTACTGGTTAAGGCGACTAGAATAAATGGGATACAGAACTTTGATGTAAATGATTTTGGGCTCATACCATTACCAG
TAGCTACATGGGGGCCTTTTGTTCTTCTCAATCTAGATGGAAAATTATCATCTGAGCTGGATGTCGATGAAGATAAAGTAGCACATCAATGGCTTGGAAGCTGTGCAGAT
GTGCTGAGTTTAAATGGAGTTGATACTTCTTTAAGTTATGTTTGTCGACGTGAATACACCATTGAATGTAATTGGAAGGTTTTTTGTGACAACTACTTAGATGGAGGATA
TCACGTCCCCTATGCACATAAAGGGCTTGCATCAAATCTCAAGCTTGAGTCTTATTCTAGAGAAATATTTGAAACTGTTAGCATTCAAAGTTGTAATGGTGGGGGAGAAT
CAAAACGTGATGATTATGGTCGACTTGGACCAGAAGCACTATATGCTTTTATATACCCAAATTTCATGATAAATAGGTATGGACCTTGGATGGACACTAATCTAGTACTT
CCACTTGGACCTCGAAAATGTCTAGTGGTTTTCGATTATTTTCTTGAAGCTTCTTTTAAGAATGACGACTCTTATATACAACAAAGTTTAGCAGACAGTGAAAGTGTGCA
GAATGAAGACATTATTCTGTGTGAAGGTGTTCAAAAGGGTCTCGAGTCACAGGCTTACACGTTTGGTCGATATGCTCCTTCGGTCGAGAATGCCATGCACCATTTCCATC
GTCTTCTTCATTTTAACCTCACAAATGAAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGACGTTAACGAAGCATATCCATACTCACTTCTTCCAACCTCCTTCAACTACCTTCAATTTGCATTCGTGCAATAATCGTTCTCCCAAACGAATCTCGGCGGCTTT
ATCCTTTCGAAACTCTGATTCTCACTTCATTGAAGCTCAGAAACTCGTTGATGAATTCGACCCTGAAATTCCCTTGGAGAAGGCCGTCACTCCACCCAGCTCCTGGTATA
TTGACCCTTCATTTTTTGCTCTCGAGCTCGATCGTGTCTTCTACAGAGGATGGCAGGCTGTAGGATATGTTGAACAGTTAAAAGATCCCCATGACTTTTTCACAGGCAGG
TTGGGAAATATAGAGTACGTGGTATGCAAAGATAATAACAGGAAGGTTCGTGCATTTCACAATGTTTGTCGCCATCATGCCTCACTTCTTGCGTCTGGATGTGGGAAGAA
GTCATGCTTTGGATGGACATATGGGTTGGATGGAGTACTGGTTAAGGCGACTAGAATAAATGGGATACAGAACTTTGATGTAAATGATTTTGGGCTCATACCATTACCAG
TAGCTACATGGGGGCCTTTTGTTCTTCTCAATCTAGATGGAAAATTATCATCTGAGCTGGATGTCGATGAAGATAAAGTAGCACATCAATGGCTTGGAAGCTGTGCAGAT
GTGCTGAGTTTAAATGGAGTTGATACTTCTTTAAGTTATGTTTGTCGACGTGAATACACCATTGAATGTAATTGGAAGGTTTTTTGTGACAACTACTTAGATGGAGGATA
TCACGTCCCCTATGCACATAAAGGGCTTGCATCAAATCTCAAGCTTGAGTCTTATTCTAGAGAAATATTTGAAACTGTTAGCATTCAAAGTTGTAATGGTGGGGGAGAAT
CAAAACGTGATGATTATGGTCGACTTGGACCAGAAGCACTATATGCTTTTATATACCCAAATTTCATGATAAATAGGTATGGACCTTGGATGGACACTAATCTAGTACTT
CCACTTGGACCTCGAAAATGTCTAGTGGTTTTCGATTATTTTCTTGAAGCTTCTTTTAAGAATGACGACTCTTATATACAACAAAGTTTAGCAGACAGTGAAAGTGTGCA
GAATGAAGACATTATTCTGTGTGAAGGTGTTCAAAAGGGTCTCGAGTCACAGGCTTACACGTTTGGTCGATATGCTCCTTCGGTCGAGAATGCCATGCACCATTTCCATC
GTCTTCTTCATTTTAACCTCACAAATGAAAAATAG
Protein sequenceShow/hide protein sequence
MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGR
LGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNLDGKLSSELDVDEDKVAHQWLGSCAD
VLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVL
PLGPRKCLVVFDYFLEASFKNDDSYIQQSLADSESVQNEDIILCEGVQKGLESQAYTFGRYAPSVENAMHHFHRLLHFNLTNEK