; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G006460 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G006460
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionCholine monooxygenase, chloroplastic
Genome locationchr06:9586275..9589755
RNA-Seq ExpressionLsi06G006460
SyntenyLsi06G006460
Gene Ontology termsGO:0019285 - glycine betaine biosynthetic process from choline (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0031967 - organelle envelope (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0019133 - choline monooxygenase activity (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR001663 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit
IPR015879 - Aromatic-ring-hydroxylating dioxygenase, alpha subunit, C-terminal domain
IPR017941 - Rieske [2Fe-2S] iron-sulphur domain
IPR036922 - Rieske [2Fe-2S] iron-sulphur domain superfamily
IPR044637 - Aromatic-ring-hydroxylating dioxygenase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK04233.1 choline monooxygenase [Cucumis melo var. makuwa]6.1e-21986.81Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHI  HFFQPPS +FN H CN+RSP RISA+LSFR+ DS FIEA+KLVD+FDPEIPLEKA+TPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KD HDFFTGRLGN+EYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDG+L+KATRINGIQNF+ NDFGL+PLPVA WGPFVLLNL
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        DGKLSS++DVDEDKVA +WLG+CADVL LNGVD SLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYS E+FETVSIQSC GGGESK
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS
         DDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRK L  V D  ++ +  NDDS+IQQSL DSESVQNEDIIL EGVQKGLES AY FGRYAPS
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS

Query:  VENAMHHFHRLLHFNLT
        VENAMHHFHRLLH NLT
Subjt:  VENAMHHFHRLLHFNLT

XP_008443631.1 PREDICTED: choline monooxygenase, chloroplastic isoform X2 [Cucumis melo]2.3e-21886.81Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHI  HFFQPPS +FN H CN+RSP RISA+LSFR+ DS FIEA+KLVD+FDPEIPLEKA+TPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KD HDFFTGRLGN+EYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDG+L+KATRINGIQNF+ NDFGL+PLPVA WGPFVLLNL
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        DGKLSS+ DVDEDKVA +WLG+CADVL LNGVD SLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYS E+FETVSIQSC GGGESK
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS
         DDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRK L  V D  ++ +  NDDS+IQQSL DSESVQNEDIIL EGVQKGLES AY FGRYAPS
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS

Query:  VENAMHHFHRLLHFNLT
        VENAMHHFHRLLH NLT
Subjt:  VENAMHHFHRLLHFNLT

XP_022965039.1 choline monooxygenase, chloroplastic isoform X1 [Cucurbita moschata]5.7e-21787.29Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHIHTHFFQP   +FN  S + RSP RIS+ LSFRNSDSHFIEA KLVDEFDPEIPLEKAVTPPSSWY DPSFFALELDRVF+RGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KDPHDFFTGRLGN+EYVVCKDNN+KVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDG L+KATRINGIQNFDVNDFGL+PLPVATWGPFVLLNL
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        D KLSSEL VDED+VAH+WLGSCAD+LSLNGVD SLS+VCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS EIFE VSIQSC GGGESK
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS
         DDYGRLGPEALYAF+YPNFMINRYGPWMDTNLVLPLGPRK L  V D  ++    ND+S+IQQSL DSESVQ EDIIL EGVQKGLES AY FGRYAPS
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS

Query:  VENAMHHFHRLLHFNLT
        VENAMHHFHRLLH NLT
Subjt:  VENAMHHFHRLLHFNLT

XP_022987330.1 choline monooxygenase, chloroplastic isoform X2 [Cucurbita maxima]3.7e-21686.81Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHIHTHFFQP   +FN  S ++RSP RIS  LSFRNSDSHFIEA KLVDEFDPEIPLEKAVTPPSSWY DPSFFALELDRVF+RGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KDPHDFFTGRLGN+EYVVCKDNN+KVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDG L+KATRINGIQNFDVNDFGL+PLPVATWGPFVLLN+
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        D KLSSEL VDEDKVA++WLGSCAD+LSLNGVD SLS+VCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS EIFE VSIQSC GGGESK
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS
         DDYGRLGPEALYAF+YPNFMINRYGPWMDTNLVLPLGP+K L  V D  ++    ND+S+IQQSL DSESVQ EDIIL EGVQKGLES AY FGRYAPS
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS

Query:  VENAMHHFHRLLHFNLT
        VENAMHHFHRLLH NLT
Subjt:  VENAMHHFHRLLHFNLT

XP_038880722.1 choline monooxygenase, chloroplastic [Benincasa hispida]4.9e-22489.45Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHIHTHFFQPPS +FNLHSCN+RSP RISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSF+ALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KDPHDFFTGRLGN+EYVVCKDNNRKVRAFHNVCRHHASLLASGCGK+SCFVCPYHGWTYGLDGVL+KATRINGIQNFD N+FGLIPLPVATWGPFVLLNL
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        DGKLSSELDVDEDKV  +WLGSCAD+LSLNGVD SLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKL+SYS EIFETVSIQSC GGGE+K
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS
         +DYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLG RK L  V D  ++ +  ND  +IQ+SL DSESVQ EDIIL EGVQKGLES AY FGRYAPS
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS

Query:  VENAMHHFHRLLHFNLT
        VENAMHHFHRLLH NLT
Subjt:  VENAMHHFHRLLHFNLT

TrEMBL top hitse value%identityAlignment
A0A1S3B9A0 Choline monooxygenase, chloroplastic1.1e-21886.81Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHI  HFFQPPS +FN H CN+RSP RISA+LSFR+ DS FIEA+KLVD+FDPEIPLEKA+TPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KD HDFFTGRLGN+EYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDG+L+KATRINGIQNF+ NDFGL+PLPVA WGPFVLLNL
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        DGKLSS+ DVDEDKVA +WLG+CADVL LNGVD SLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYS E+FETVSIQSC GGGESK
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS
         DDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRK L  V D  ++ +  NDDS+IQQSL DSESVQNEDIIL EGVQKGLES AY FGRYAPS
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS

Query:  VENAMHHFHRLLHFNLT
        VENAMHHFHRLLH NLT
Subjt:  VENAMHHFHRLLHFNLT

A0A5A7UY48 Choline monooxygenase, chloroplastic1.1e-21886.81Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHI  HFFQPPS +FN H CN+RSP RISA+LSFR+ DS FIEA+KLVD+FDPEIPLEKA+TPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KD HDFFTGRLGN+EYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDG+L+KATRINGIQNF+ NDFGL+PLPVA WGPFVLLNL
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        DGKLSS+ DVDEDKVA +WLG+CADVL LNGVD SLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYS E+FETVSIQSC GGGESK
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS
         DDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRK L  V D  ++ +  NDDS+IQQSL DSESVQNEDIIL EGVQKGLES AY FGRYAPS
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS

Query:  VENAMHHFHRLLHFNLT
        VENAMHHFHRLLH NLT
Subjt:  VENAMHHFHRLLHFNLT

A0A5D3BWX2 Choline monooxygenase, chloroplastic3.0e-21986.81Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MATLTKHI  HFFQPPS +FN H CN+RSP RISA+LSFR+ DS FIEA+KLVD+FDPEIPLEKA+TPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KD HDFFTGRLGN+EYVVCKDNNRKVRAFHNVCRHHASLLA+GCGKKSCFVCPYHGWTYGLDG+L+KATRINGIQNF+ NDFGL+PLPVA WGPFVLLNL
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        DGKLSS++DVDEDKVA +WLG+CADVL LNGVD SLSYVCRREYTI+CNWKVFCDNYLDGGYHVPYAHKGLASNL LESYS E+FETVSIQSC GGGESK
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS
         DDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRK L  V D  ++ +  NDDS+IQQSL DSESVQNEDIIL EGVQKGLES AY FGRYAPS
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS

Query:  VENAMHHFHRLLHFNLT
        VENAMHHFHRLLH NLT
Subjt:  VENAMHHFHRLLHFNLT

A0A6J1HPX3 Choline monooxygenase, chloroplastic2.8e-21787.29Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHIHTHFFQP   +FN  S + RSP RIS+ LSFRNSDSHFIEA KLVDEFDPEIPLEKAVTPPSSWY DPSFFALELDRVF+RGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KDPHDFFTGRLGN+EYVVCKDNN+KVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDG L+KATRINGIQNFDVNDFGL+PLPVATWGPFVLLNL
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        D KLSSEL VDED+VAH+WLGSCAD+LSLNGVD SLS+VCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS EIFE VSIQSC GGGESK
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS
         DDYGRLGPEALYAF+YPNFMINRYGPWMDTNLVLPLGPRK L  V D  ++    ND+S+IQQSL DSESVQ EDIIL EGVQKGLES AY FGRYAPS
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS

Query:  VENAMHHFHRLLHFNLT
        VENAMHHFHRLLH NLT
Subjt:  VENAMHHFHRLLHFNLT

A0A6J1JGJ2 Choline monooxygenase, chloroplastic1.8e-21686.81Show/hide
Query:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL
        MA LTKHIHTHFFQP   +FN  S ++RSP RIS  LSFRNSDSHFIEA KLVDEFDPEIPLEKAVTPPSSWY DPSFFALELDRVF+RGWQAVGYVEQL
Subjt:  MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQL

Query:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL
        KDPHDFFTGRLGN+EYVVCKDNN+KVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDG L+KATRINGIQNFDVNDFGL+PLPVATWGPFVLLN+
Subjt:  KDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNL

Query:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK
        D KLSSEL VDEDKVA++WLGSCAD+LSLNGVD SLS+VCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYS EIFE VSIQSC GGGESK
Subjt:  DGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESK

Query:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS
         DDYGRLGPEALYAF+YPNFMINRYGPWMDTNLVLPLGP+K L  V D  ++    ND+S+IQQSL DSESVQ EDIIL EGVQKGLES AY FGRYAPS
Subjt:  RDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPS

Query:  VENAMHHFHRLLHFNLT
        VENAMHHFHRLLH NLT
Subjt:  VENAMHHFHRLLHFNLT

SwissProt top hitse value%identityAlignment
O04121 Choline monooxygenase, chloroplastic4.4e-11148.62Show/hide
Query:  PSTTFNLHSCNNRSPKRIS----AALSFRN-SDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGR
        P  T N  +  +R+P +I+    AA SF + + +     Q LV EFDP+IP E A TPPSSWY +P+F++ EL+R+FY+GWQ  G  +Q+K+P+ +FTG 
Subjt:  PSTTFNLHSCNNRSPKRIS----AALSFRN-SDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGR

Query:  LGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNLDGKLSSELDV
        LGN+EY+V +D   KV AFHNVC H AS+LA G GKKSCFVCPYHGW YG+DG L KA++    QN D  + GL+PL VA WGPFVL++LD  L    D 
Subjt:  LGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNLDGKLSSELDV

Query:  DEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESKRDDYGRLGPE
            V  +WLG+ A+ +  +  D SL ++ R E+ +E NWK+F DNYLD  YHVPYAHK  A+ L  ++Y  ++ E V+IQ    G  +K D + R+G +
Subjt:  DEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESKRDDYGRLGPE

Query:  ALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPSVENAMHHFH
        A YAF YPNF + RYGPWM T  + PLGPRK    V+D  I+ +  +D  YI++ +A +++VQ ED++L E VQ+GLE+ AY  GRY   +E  +HHFH
Subjt:  ALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPSVENAMHHFH

O22553 Choline monooxygenase, chloroplastic4.0e-11250.95Show/hide
Query:  QKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC
        + LV EFDPEIP E A+TPPS+WY +P+F++ EL+R+FY+GWQ  GY EQ+K+ + +FTG LGN+EY+V +D   ++ AFHNVC H AS+LA G GKKSC
Subjt:  QKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC

Query:  FVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNLDGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECN
        FVCPYHGW YGLDG L KA++    QN D  + GL PL VA WGPF+L++LD  L +  D     V  +W+G  A+ +  +  D +L +  R E+ +ECN
Subjt:  FVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNLDGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD-
        WKVFCDNYLD  YHVPYAHK  A+ L  ++Y+ E+ E   IQ   G   +K D + RLG EA YAFIYPNF + RYG WM T  V+P+G RK    V+D 
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD-

Query:  -IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPSVENAMHHFHRLLHFNL
         ++K   +D +YI + +A +++VQ ED +L E VQ+GLE+ AY  GRY   +E  +HHFH  LH  L
Subjt:  -IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPSVENAMHHFHRLLHFNL

Q93XE1 Choline monooxygenase, chloroplastic6.1e-11349.48Show/hide
Query:  ISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNV
        I ++++  N  +     ++++ EFDP++P E   TPPS+WY DPS ++ ELDR+F +GWQ  GY +Q+K+P+ +FTG LGN+EY+VC+D   KV AFHNV
Subjt:  ISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNV

Query:  CRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNLDGKLSSELDVDEDKVAHQWLGSCADVLSLNGV
        C H AS+LA G GKKSCFVCPYHGW +GLDG L+KAT+    Q FD  + GL+ L VA WGPFVL++LD +  SE     + V  +W+GSCA+ +  +  
Subjt:  CRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNLDGKLSSELDVDEDKVAHQWLGSCADVLSLNGV

Query:  DTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTN
        D SL ++ R E+ +E NWKVFCDNYLD  YHVPYAHK  A+ L  ++Y  ++ E V IQ       +K + + RLG EA YAFIYPNF + RYGPWM T 
Subjt:  DTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTN

Query:  LVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPSVENAMHHFHRLLHFNL
         + PLGPRK    V+D  ++    ND  YI++S+  +++VQ ED++L E VQ+GLE+ AY  GRY   +E  +HHFH  LH  L
Subjt:  LVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPSVENAMHHFHRLLHFNL

Q9LKN0 Choline monooxygenase, chloroplastic1.4e-11250.82Show/hide
Query:  QKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC
        Q LV +FDP +P E A+TPPSSWY +P+F+A ELDR+FY+GWQ  GY +Q+K+ + +FTG LGN+EY+VC+D   KV AFHNVC H AS+LA G GKKSC
Subjt:  QKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSC

Query:  FVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNLDGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECN
        FVCPYHGW YG++G L KA++    Q+ + ++ GL+PL VA WGPF+L++LD    S  +V +  V  +WLGSCA+ +  +  D +L ++ R E+ IE N
Subjt:  FVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNLDGKLSSELDVDEDKVAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECN

Query:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD-
        WK+F DNYLD  YHVPYAHK  A+ L  ++Y  ++   V+IQ   G   +  + + RLG +A YAF YPNF + RYGPWM T  ++PLGPRK    V+D 
Subjt:  WKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD-

Query:  -IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPSVENAMHHFHRLLH
         I+K+  +D  YI++ +A +++VQ ED++L E VQKGLE+ AY  GRY   +E  +HHFH  LH
Subjt:  -IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPSVENAMHHFHRLLH

Q9SZR0 Choline monooxygenase, chloroplastic2.3e-14460.85Show/hide
Query:  MATLTKHIHTHFFQPPSTT-----FNLHSCNNRSPKRISAALSFRNSDSHFI--EAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQA
        M TLT  +    F PPS       FN HS    S  + S    F N    F   +  KLV EFDP+IPLE+A TPPSSWY DP F++ ELDRVFY GWQA
Subjt:  MATLTKHIHTHFFQPPSTT-----FNLHSCNNRSPKRISAALSFRNSDSHFI--EAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQA

Query:  VGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWG
        VGY +Q+K+  DFFTGRLG++++VVC+D N K+ AFHNVC HHAS+LASG G+KSCFVC YHGWTY L G LVKATR++GIQNF +++ GL PL VA WG
Subjt:  VGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWG

Query:  PFVLLNLDGKLSSELDVDEDK-VAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQS
        PFVLL +    S + +V+ D+ VA +WLG+    LS  GVD+ LSY+CRREYTI+CNWKVFCDNYLDGGYHVPYAHKGL S L LE+YS  IFE VSIQ 
Subjt:  PFVLLNLDGKLSSELDVDEDK-VAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQS

Query:  CNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAY
        C GG +   D + RLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRK    V D  +  +  +D+++I++SL +S+ VQ ED++L E VQ+GLESQAY
Subjt:  CNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAY

Query:  TFGRYAPSVENAMHHFHRLLHFNL
          GRYA  VE  MHHFH LLH NL
Subjt:  TFGRYAPSVENAMHHFHRLLHFNL

Arabidopsis top hitse value%identityAlignment
AT4G29890.1 choline monooxygenase, putative (CMO-like)1.6e-14560.85Show/hide
Query:  MATLTKHIHTHFFQPPSTT-----FNLHSCNNRSPKRISAALSFRNSDSHFI--EAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQA
        M TLT  +    F PPS       FN HS    S  + S    F N    F   +  KLV EFDP+IPLE+A TPPSSWY DP F++ ELDRVFY GWQA
Subjt:  MATLTKHIHTHFFQPPSTT-----FNLHSCNNRSPKRISAALSFRNSDSHFI--EAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQA

Query:  VGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWG
        VGY +Q+K+  DFFTGRLG++++VVC+D N K+ AFHNVC HHAS+LASG G+KSCFVC YHGWTY L G LVKATR++GIQNF +++ GL PL VA WG
Subjt:  VGYVEQLKDPHDFFTGRLGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWG

Query:  PFVLLNLDGKLSSELDVDEDK-VAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQS
        PFVLL +    S + +V+ D+ VA +WLG+    LS  GVD+ LSY+CRREYTI+CNWKVFCDNYLDGGYHVPYAHKGL S L LE+YS  IFE VSIQ 
Subjt:  PFVLLNLDGKLSSELDVDEDK-VAHQWLGSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQS

Query:  CNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAY
        C GG +   D + RLG EALYAF+YPNFMINRYGPWMDTNLVLPLGPRK    V D  +  +  +D+++I++SL +S+ VQ ED++L E VQ+GLESQAY
Subjt:  CNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMDTNLVLPLGPRKFLTRVLD--IKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAY

Query:  TFGRYAPSVENAMHHFHRLLHFNL
          GRYA  VE  MHHFH LLH NL
Subjt:  TFGRYAPSVENAMHHFHRLLHFNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACGTTAACGAAGCATATCCATACTCACTTCTTCCAACCTCCTTCAACTACCTTCAATTTGCATTCGTGCAATAATCGTTCTCCCAAACGAATCTCGGCGGCTTT
ATCCTTTCGAAACTCTGATTCTCACTTCATTGAAGCTCAGAAACTCGTTGATGAATTCGACCCTGAAATTCCCTTGGAGAAGGCCGTCACTCCACCCAGCTCCTGGTATA
TTGACCCTTCATTTTTTGCTCTCGAGCTCGATCGTGTCTTCTACAGAGGATGGCAGGCTGTAGGATATGTTGAACAGTTAAAAGATCCCCATGACTTTTTCACAGGCAGG
TTGGGAAATATAGAGTACGTGGTATGCAAAGATAATAACAGGAAGGTTCGTGCATTTCACAATGTTTGTCGCCATCATGCCTCACTTCTTGCGTCTGGATGTGGGAAGAA
GTCATGCTTTGTATGTCCATATCATGGATGGACATATGGGTTGGATGGAGTACTGGTTAAGGCGACTAGAATAAATGGGATACAGAACTTTGATGTAAATGATTTTGGGC
TCATACCATTACCAGTAGCTACATGGGGGCCTTTTGTTCTTCTCAATCTAGATGGAAAATTATCATCTGAGCTGGATGTCGATGAAGATAAAGTAGCACATCAATGGCTT
GGAAGCTGTGCAGATGTGCTGAGTTTAAATGGAGTTGATACTTCTTTAAGTTATGTTTGTCGACGTGAATACACCATTGAATGTAATTGGAAGGTTTTTTGTGACAACTA
CTTAGATGGAGGATATCACGTCCCCTATGCACATAAAGGGCTTGCATCAAATCTCAAGCTTGAGTCTTATTCTAGAGAAATATTTGAAACTGTTAGCATTCAAAGTTGTA
ATGGTGGGGGAGAATCAAAACGTGATGATTATGGTCGACTTGGACCAGAAGCACTATATGCTTTTATATACCCAAATTTCATGATAAATAGGTATGGACCTTGGATGGAC
ACTAATCTAGTACTTCCACTTGGACCTCGAAAATTTCTCACTCGTGTTCTTGATATTAAAAAGACTGCTTGGAATGACGACTCTTATATACAACAAAGTTTAGCAGACAG
TGAAAGTGTGCAGAATGAAGACATTATTCTGTATGAAGGTGTTCAAAAGGGTCTCGAGTCACAGGCTTACACGTTTGGTCGATATGCTCCTTCGGTCGAGAATGCCATGC
ACCATTTCCATCGTCTTCTTCATTTTAACCTCACAAATGAAAAATAG
mRNA sequenceShow/hide mRNA sequence
CTCGAAATTATGATTAAAAATATTAGTTATAATCTTACTCGATATTTCCATCGTGGATATTTCGAACTAGTGCTGTATTGCTGATTGTTGTTTCAAAGAGAAGCAAGCTT
GAGGATTGCTGCAATGGCGACGTTAACGAAGCATATCCATACTCACTTCTTCCAACCTCCTTCAACTACCTTCAATTTGCATTCGTGCAATAATCGTTCTCCCAAACGAA
TCTCGGCGGCTTTATCCTTTCGAAACTCTGATTCTCACTTCATTGAAGCTCAGAAACTCGTTGATGAATTCGACCCTGAAATTCCCTTGGAGAAGGCCGTCACTCCACCC
AGCTCCTGGTATATTGACCCTTCATTTTTTGCTCTCGAGCTCGATCGTGTCTTCTACAGAGGATGGCAGGCTGTAGGATATGTTGAACAGTTAAAAGATCCCCATGACTT
TTTCACAGGCAGGTTGGGAAATATAGAGTACGTGGTATGCAAAGATAATAACAGGAAGGTTCGTGCATTTCACAATGTTTGTCGCCATCATGCCTCACTTCTTGCGTCTG
GATGTGGGAAGAAGTCATGCTTTGTATGTCCATATCATGGATGGACATATGGGTTGGATGGAGTACTGGTTAAGGCGACTAGAATAAATGGGATACAGAACTTTGATGTA
AATGATTTTGGGCTCATACCATTACCAGTAGCTACATGGGGGCCTTTTGTTCTTCTCAATCTAGATGGAAAATTATCATCTGAGCTGGATGTCGATGAAGATAAAGTAGC
ACATCAATGGCTTGGAAGCTGTGCAGATGTGCTGAGTTTAAATGGAGTTGATACTTCTTTAAGTTATGTTTGTCGACGTGAATACACCATTGAATGTAATTGGAAGGTTT
TTTGTGACAACTACTTAGATGGAGGATATCACGTCCCCTATGCACATAAAGGGCTTGCATCAAATCTCAAGCTTGAGTCTTATTCTAGAGAAATATTTGAAACTGTTAGC
ATTCAAAGTTGTAATGGTGGGGGAGAATCAAAACGTGATGATTATGGTCGACTTGGACCAGAAGCACTATATGCTTTTATATACCCAAATTTCATGATAAATAGGTATGG
ACCTTGGATGGACACTAATCTAGTACTTCCACTTGGACCTCGAAAATTTCTCACTCGTGTTCTTGATATTAAAAAGACTGCTTGGAATGACGACTCTTATATACAACAAA
GTTTAGCAGACAGTGAAAGTGTGCAGAATGAAGACATTATTCTGTATGAAGGTGTTCAAAAGGGTCTCGAGTCACAGGCTTACACGTTTGGTCGATATGCTCCTTCGGTC
GAGAATGCCATGCACCATTTCCATCGTCTTCTTCATTTTAACCTCACAAATGAAAAATAGTTAAAAGAAGTTTGTTTCTTATAAAAAAGTTTGGATAGAAGTGAACTTTA
GTATTTTGGTACAAGACATCGAAAGATGTTTCTTTTAGGTTTTAGGTTTTGGTAGTTTATCCTAGAAATGATTCGTAATAAGTTGCCATTGCTGAAATGTCTTAAGTTGT
AAAAGTTTCTAATTGTTATGTTTTTGTTGTGGATGGGCAAAAAAGTACCAAAAGAACATACCTATTACCAATCCTCCTCAAGGAGGGGAAACAC
Protein sequenceShow/hide protein sequence
MATLTKHIHTHFFQPPSTTFNLHSCNNRSPKRISAALSFRNSDSHFIEAQKLVDEFDPEIPLEKAVTPPSSWYIDPSFFALELDRVFYRGWQAVGYVEQLKDPHDFFTGR
LGNIEYVVCKDNNRKVRAFHNVCRHHASLLASGCGKKSCFVCPYHGWTYGLDGVLVKATRINGIQNFDVNDFGLIPLPVATWGPFVLLNLDGKLSSELDVDEDKVAHQWL
GSCADVLSLNGVDTSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAHKGLASNLKLESYSREIFETVSIQSCNGGGESKRDDYGRLGPEALYAFIYPNFMINRYGPWMD
TNLVLPLGPRKFLTRVLDIKKTAWNDDSYIQQSLADSESVQNEDIILYEGVQKGLESQAYTFGRYAPSVENAMHHFHRLLHFNLTNEK