; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10017024 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10017024
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptioncarotene epsilon-monooxygenase, chloroplastic
Genome locationChr03:10331869..10337781
RNA-Seq ExpressionHG10017024
SyntenyHG10017024
Gene Ontology termsGO:0016117 - carotenoid biosynthetic process (biological process)
GO:0005506 - iron ion binding (molecular function)
GO:0009974 - zeinoxanthin epsilon hydroxylase activity (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR001128 - Cytochrome P450
IPR002401 - Cytochrome P450, E-class, group I
IPR017972 - Cytochrome P450, conserved site
IPR036396 - Cytochrome P450 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK07354.1 carotene epsilon-monooxygenase [Cucumis melo var. makuwa]9.9e-27090.49Show/hide
Query:  MASTLCFPSITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLLG
        MASTLCF S+TFPS+ LHKRIPL   T  PY SIKSS+DE  NPSTP KLKN TN  KS SWVSPDWLTSLTR ITLGQGDDSGIPIATAKLDDVSDLLG
Subjt:  MASTLCFPSITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLLG

Query:  GALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAM
        GALFLPLFKWMNDYGPIYRLAAGPRNFV+VSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLW VRRRAVVPSLHKKYLSVIVDRVFCKCAM
Subjt:  GALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAM

Query:  RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEELI
        RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP                        IKALCKIIPRQIKAEEAVTVIRRTVEELI
Subjt:  RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEELI

Query:  AKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKEL
        AKCKEIVE EGERI+EEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVD+VLQGRPPSYEDTKEL
Subjt:  AKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKEL

Query:  KYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL
        KYLTRCILESMRLYPHPPVLIRRAQVAD LPGNYKVNAGQDIMISVYNIHRS QVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL
Subjt:  KYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL

Query:  LEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN
        LEAIVALAIFLQH+NFELVPNQTIGMTTGATIHTTN
Subjt:  LEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN

XP_004143287.1 carotene epsilon-monooxygenase, chloroplastic [Cucumis sativus]1.7e-26990.13Show/hide
Query:  MASTLCFPSITFPSAPLHKRIPLRRRTLS-PYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLL
        MAS+LCFPS+TFPS+ LHKRIPL   T   P LSIKSSIDE RN STP K+KN TN  KS SWVSPDWLTSLTRYITLGQGDDSGIP+ATAKLDDVSDLL
Subjt:  MASTLCFPSITFPSAPLHKRIPLRRRTLS-PYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLL

Query:  GGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCA
        GGALFLPLFKWMNDYGPIYRLAAGPRNFV+VSDP IAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLW VRRRAVVPSLHKKYLSVIVDRVFCKCA
Subjt:  GGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCA

Query:  MRLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEEL
        MRLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLS DSP                        IKALCKIIPRQIKAEEAVTVIR+TVEEL
Subjt:  MRLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEEL

Query:  IAKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKE
        IAKCKEIVE EGERI+EEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKE
Subjt:  IAKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKE

Query:  LKYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFA
        LKYLTRCILESMRLYPHPPVLIRRAQVAD+LPG+YKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFA
Subjt:  LKYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFA

Query:  LLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN
        LLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN
Subjt:  LLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN

XP_008462512.1 PREDICTED: carotene epsilon-monooxygenase, chloroplastic [Cucumis melo]3.4e-27090.67Show/hide
Query:  MASTLCFPSITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLLG
        MASTLCF S+TFPS+ LHKRIPL   T  PY SIKSS+DE  NPSTP KLKN TN  KS SWVSPDWLTSLTR ITLGQGDDSGIPIATAKLDDVSDLLG
Subjt:  MASTLCFPSITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLLG

Query:  GALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAM
        GALFLPLFKWMNDYGPIYRLAAGPRNFV+VSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLW VRRRAVVPSLHKKYLSVIVDRVFCKCAM
Subjt:  GALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAM

Query:  RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEELI
        RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP                        IKALCKIIPRQIKAEEAVTVIRRTVEELI
Subjt:  RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEELI

Query:  AKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKEL
        AKCKEIVE EGERI+EEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKEL
Subjt:  AKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKEL

Query:  KYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL
        KYLTRCILESMRLYPHPPVLIRRAQVAD LPGNYKVNAGQDIMISVYNIHRS QVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL
Subjt:  KYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL

Query:  LEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN
        LEAIVALAIFLQH+NFELVPNQTIGMTTGATIHTTN
Subjt:  LEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN

XP_022143881.1 carotene epsilon-monooxygenase, chloroplastic [Momordica charantia]3.1e-27190.49Show/hide
Query:  MASTLCFPSITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLLG
        M+S LCFPS++F SA LHKR PLRRR   P+LSIKSSIDEG NP TPTKLKNSTN AKSGSWVSPDWLTSLTRYITLGQGDDSGIPIA+AKLDDVSDLLG
Subjt:  MASTLCFPSITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLLG

Query:  GALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAM
        GALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLW VRRRAVVPSLHKKYLSVIVDRVFCKCAM
Subjt:  GALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAM

Query:  RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEELI
        RLVEKL+KDALNNNSVNMEEKFSQLTLD+IGLSVFNYSFDSLSADSP                        IKALCKIIPRQIKAEEAVTVIRRTVEELI
Subjt:  RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEELI

Query:  AKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKEL
        AKCKEIVETEGERIDEEEYVND DPSILRFLLASR+EVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSK SSSL+KAQNEVDRVLQGRPPSYEDTKEL
Subjt:  AKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKEL

Query:  KYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL
        K+L RCILESMRLYPHPPVLIRRA+VAD+LPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL
Subjt:  KYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL

Query:  LEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN
        LEA+VALAIFLQHMNFELVPNQTIGMTTGATIHTTN
Subjt:  LEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN

XP_038882587.1 carotene epsilon-monooxygenase, chloroplastic [Benincasa hispida]1.0e-27491.98Show/hide
Query:  MASTLCFPSITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLLG
        MASTLCFPSITFPS+PLH RIPLRRRT SP+  IKSSIDEG+NPS   KLKNST TAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLLG
Subjt:  MASTLCFPSITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLLG

Query:  GALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAM
        GALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLW VRRRAVVPSLHKKYLSVIVD+VFCKCAM
Subjt:  GALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAM

Query:  RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEELI
        RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP                        IKALCKIIPRQIKAEEAVTVIRRTVEELI
Subjt:  RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEELI

Query:  AKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKEL
        AKCKEIVETEGERIDEEEYVNDADPSILRFLLASRE+VSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSL+KA+NEVDRVLQGRPPSYEDTKEL
Subjt:  AKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKEL

Query:  KYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL
        KYLTRCILESMRLYPHPPVLIRRAQVAD+LPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDL GPVPNESNTDFRFIPFSGGPRKCVGDQFAL
Subjt:  KYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL

Query:  LEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN
        LEAIVALAIFLQHMNFELVPNQ+IGMTTGATIHTTN
Subjt:  LEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN

TrEMBL top hitse value%identityAlignment
A0A0A0KIH3 Uncharacterized protein8.2e-27090.13Show/hide
Query:  MASTLCFPSITFPSAPLHKRIPLRRRTLS-PYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLL
        MAS+LCFPS+TFPS+ LHKRIPL   T   P LSIKSSIDE RN STP K+KN TN  KS SWVSPDWLTSLTRYITLGQGDDSGIP+ATAKLDDVSDLL
Subjt:  MASTLCFPSITFPSAPLHKRIPLRRRTLS-PYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLL

Query:  GGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCA
        GGALFLPLFKWMNDYGPIYRLAAGPRNFV+VSDP IAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLW VRRRAVVPSLHKKYLSVIVDRVFCKCA
Subjt:  GGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCA

Query:  MRLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEEL
        MRLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLS DSP                        IKALCKIIPRQIKAEEAVTVIR+TVEEL
Subjt:  MRLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEEL

Query:  IAKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKE
        IAKCKEIVE EGERI+EEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKE
Subjt:  IAKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKE

Query:  LKYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFA
        LKYLTRCILESMRLYPHPPVLIRRAQVAD+LPG+YKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFA
Subjt:  LKYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFA

Query:  LLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN
        LLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN
Subjt:  LLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN

A0A1S3CIN0 carotene epsilon-monooxygenase, chloroplastic1.7e-27090.67Show/hide
Query:  MASTLCFPSITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLLG
        MASTLCF S+TFPS+ LHKRIPL   T  PY SIKSS+DE  NPSTP KLKN TN  KS SWVSPDWLTSLTR ITLGQGDDSGIPIATAKLDDVSDLLG
Subjt:  MASTLCFPSITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLLG

Query:  GALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAM
        GALFLPLFKWMNDYGPIYRLAAGPRNFV+VSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLW VRRRAVVPSLHKKYLSVIVDRVFCKCAM
Subjt:  GALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAM

Query:  RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEELI
        RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP                        IKALCKIIPRQIKAEEAVTVIRRTVEELI
Subjt:  RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEELI

Query:  AKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKEL
        AKCKEIVE EGERI+EEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKEL
Subjt:  AKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKEL

Query:  KYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL
        KYLTRCILESMRLYPHPPVLIRRAQVAD LPGNYKVNAGQDIMISVYNIHRS QVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL
Subjt:  KYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL

Query:  LEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN
        LEAIVALAIFLQH+NFELVPNQTIGMTTGATIHTTN
Subjt:  LEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN

A0A5A7SKS2 Carotene epsilon-monooxygenase1.8e-26990.3Show/hide
Query:  MASTLCFPSITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLLG
        MASTLCF S+TFPS+ LHKRIPL   T  PY SIKSS+DE  NPSTP KLKN TN  KS SWVSPDWLTSLTR ITLGQGDDSGIPIATAKLDDVSDLLG
Subjt:  MASTLCFPSITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLLG

Query:  GALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAM
        GALFLPLFKWMNDYGPIYRLAAGPRNFV+VSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLW VRRRAVVPSLHKKYLSVIVDRVFCKCAM
Subjt:  GALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAM

Query:  RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEELI
        RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP                        IKALCKIIPRQIKAEEAV VIRRTVEELI
Subjt:  RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEELI

Query:  AKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKEL
        AKCKEIVE EGERI+EEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVD+VLQGRPPSYEDTKEL
Subjt:  AKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKEL

Query:  KYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL
        KYLTRCILESMRLYPHPPVLIRRAQVAD LPGNYKVNAGQDIMISVYNIHRS QVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL
Subjt:  KYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL

Query:  LEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN
        LEAIVALAIFLQH+NFELVPNQTIGMTTGATIHTTN
Subjt:  LEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN

A0A5D3C7X9 Carotene epsilon-monooxygenase4.8e-27090.49Show/hide
Query:  MASTLCFPSITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLLG
        MASTLCF S+TFPS+ LHKRIPL   T  PY SIKSS+DE  NPSTP KLKN TN  KS SWVSPDWLTSLTR ITLGQGDDSGIPIATAKLDDVSDLLG
Subjt:  MASTLCFPSITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLLG

Query:  GALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAM
        GALFLPLFKWMNDYGPIYRLAAGPRNFV+VSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLW VRRRAVVPSLHKKYLSVIVDRVFCKCAM
Subjt:  GALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAM

Query:  RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEELI
        RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP                        IKALCKIIPRQIKAEEAVTVIRRTVEELI
Subjt:  RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEELI

Query:  AKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKEL
        AKCKEIVE EGERI+EEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVD+VLQGRPPSYEDTKEL
Subjt:  AKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKEL

Query:  KYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL
        KYLTRCILESMRLYPHPPVLIRRAQVAD LPGNYKVNAGQDIMISVYNIHRS QVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL
Subjt:  KYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL

Query:  LEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN
        LEAIVALAIFLQH+NFELVPNQTIGMTTGATIHTTN
Subjt:  LEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN

A0A6J1CRM7 carotene epsilon-monooxygenase, chloroplastic1.5e-27190.49Show/hide
Query:  MASTLCFPSITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLLG
        M+S LCFPS++F SA LHKR PLRRR   P+LSIKSSIDEG NP TPTKLKNSTN AKSGSWVSPDWLTSLTRYITLGQGDDSGIPIA+AKLDDVSDLLG
Subjt:  MASTLCFPSITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLLG

Query:  GALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAM
        GALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLW VRRRAVVPSLHKKYLSVIVDRVFCKCAM
Subjt:  GALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAM

Query:  RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEELI
        RLVEKL+KDALNNNSVNMEEKFSQLTLD+IGLSVFNYSFDSLSADSP                        IKALCKIIPRQIKAEEAVTVIRRTVEELI
Subjt:  RLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEELI

Query:  AKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKEL
        AKCKEIVETEGERIDEEEYVND DPSILRFLLASR+EVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSK SSSL+KAQNEVDRVLQGRPPSYEDTKEL
Subjt:  AKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKEL

Query:  KYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL
        K+L RCILESMRLYPHPPVLIRRA+VAD+LPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL
Subjt:  KYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFAL

Query:  LEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN
        LEA+VALAIFLQHMNFELVPNQTIGMTTGATIHTTN
Subjt:  LEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN

SwissProt top hitse value%identityAlignment
O23365 Cytochrome P450 97B3, chloroplastic7.8e-9237.91Show/hide
Query:  MASTLCFP-SITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLL
        M + + FP + T+P+      + L R     +     +I    +      +K  +   K+   +  +    LT +  L  G    +P A      VSDL 
Subjt:  MASTLCFP-SITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLL

Query:  GGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLR-NYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKC
        G  LFL L+ W  ++G IY+LA GP+ FVV+SDP IA+HVLR N  +Y KG+++E+ E + G G   A+   W +RRRA+ P+ HK YL  +V +VF  C
Subjt:  GGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLR-NYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKC

Query:  AMRLVEKLEK--------DALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP-IKALCK-----------------------IIPRQIKAEEAVT
        + +++ K EK           +   +++E +FS L LD+IGLSVFNY F S++ +SP IKA+                         I+PRQ K +  + 
Subjt:  AMRLVEKLEK--------DALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP-IKALCK-----------------------IIPRQIKAEEAVT

Query:  VIRRTVEELIAKCKEI-VETEGERIDEEEYVNDADPSILRFLLASR-EEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQ
        +I   ++ LI   KE   ET+ E++ E +Y N  D S+LRFL+  R  ++   QLRDDL++ML+AGHETT +VLTW ++LLS++   + KAQ E+D VL 
Subjt:  VIRRTVEELIAKCKEI-VETEGERIDEEEYVNDADPSILRFLLASR-EEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQ

Query:  GRPPSYEDTKELKYLTRCILESMRLYPHPPVLIRRAQVADVLPG-------NYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERF-------DLEG---
          PP+YE  K+L+Y+   ++E +RL+P PP+LIRR    + LPG        +KV  G DI ISVYN+HRS   W+   +F PERF        +EG   
Subjt:  GRPPSYEDTKELKYLTRCILESMRLYPHPPVLIRRAQVADVLPG-------NYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERF-------DLEG---

Query:  ---------PVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPN-QTIGMTTGATIHTTN
                   PNE   DF F+PF GGPRKC+GDQFAL+E+ VALA+  Q  + EL    +++ + +GATIH  N
Subjt:  ---------PVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPN-QTIGMTTGATIHTTN

O48921 Cytochrome P450 97B2, chloroplastic2.0e-9539.4Show/hide
Query:  TFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLK-NSTNTAKSGSWVSPDWLTSLTRYIT--LGQGDDSGIPIATAKLDDVSDLLGGALFLPL
        T   A LH R   R    + + S+         P   + ++  S NT K  S  S + L + +  +T  L  G    +PIA      VSDLLG  LF  L
Subjt:  TFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLK-NSTNTAKSGSWVSPDWLTSLTRYIT--LGQGDDSGIPIATAKLDDVSDLLGGALFLPL

Query:  FKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLR-NYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKL
        + W  ++G +Y+LA GP+ FVVVSDP +A+H+LR N  +Y KG+++++ E + G G   A+   W  RRR + P+ H  YL  +V ++F  C+ R + K 
Subjt:  FKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLR-NYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKL

Query:  EK-------DALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP-IKALCK-----------------------IIPRQIKAEEAVTVIRRTVEEL
         K       D  ++  +++E +FS L LD+IGL VFNY F S++ +SP IKA+                         I+PRQ K ++ + VI   ++ L
Subjt:  EK-------DALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP-IKALCK-----------------------IIPRQIKAEEAVTVIRRTVEEL

Query:  IAKCKEI-VETEGERIDEEEYVNDADPSILRFLLASR-EEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDT
        I   KE   ET+ E++ + +Y+N  D S+LRFL+  R  +V   QLRDDL++ML+AGHETT +VLTW ++LL+++ S + KAQ EVD VL    P++E  
Subjt:  IAKCKEI-VETEGERIDEEEYVNDADPSILRFLLASR-EEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDT

Query:  KELKYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYK-------VNAGQDIMISVYNIHRSSQVWEQAEEFIPERF-------DLEG------------
        KEL+Y+   ++E++RLYP PP+LIRR+  +DVLPG +K       + AG D+ ISVYN+HRS   W++ ++F PERF       ++EG            
Subjt:  KELKYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYK-------VNAGQDIMISVYNIHRSSQVWEQAEEFIPERF-------DLEG------------

Query:  PVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPN-QTIGMTTGATIHTTN
          PNE  +DF F+PF GGPRKCVGDQFAL+E+ VAL + LQ+ + EL    +++ + TGATIHT N
Subjt:  PVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPN-QTIGMTTGATIHTTN

Q43078 Cytochrome P450 97B1, chloroplastic5.4e-8540.55Show/hide
Query:  LTSLTRYITLGQGDDSGIPIATAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLR-NYGTYAKGLVSEVSEFLFGSGFAIA
        LTSL     LG      +PIA      V+DL    LF  L+ W  ++G +Y+LA GP+ FVVVSDP +A+H+LR N  +Y KG+++++ E + G G   A
Subjt:  LTSLTRYITLGQGDDSGIPIATAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLR-NYGTYAKGLVSEVSEFLFGSGFAIA

Query:  EGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLE-------KDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP-IKALCK-----
        +   W  RRR + P  H  YL  +V ++F  C+ R V K+        +D   +  +++E +FS L L++IGL VFNY F S++ +SP IKA+       
Subjt:  EGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLE-------KDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP-IKALCK-----

Query:  ------------------IIPRQIKAEEAVTVIRRTVEELIAKCKEI-VETEGERIDEEEYVNDADPSILRFLLASR-EEVSSVQLRDDLLSMLVAGHET
                          I+PRQ K ++ + VI   ++ LI   KE   ET+ E++ + +Y N  D S+LRFL+  R  +V   QLRDDL++ML+AGHET
Subjt:  ------------------IIPRQIKAEEAVTVIRRTVEELIAKCKEI-VETEGERIDEEEYVNDADPSILRFLLASR-EEVSSVQLRDDLLSMLVAGHET

Query:  TGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKELKYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYK-------VNAGQDIMISVYNIH
        T +VLTW ++LL+++   + KAQ EVD VL    P++E  K+L+Y+   ++E++RLYP PP+LIRR+   DVLPG +K       + AG D+ ISVYN+H
Subjt:  TGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKELKYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYK-------VNAGQDIMISVYNIH

Query:  RSSQVWEQAEEFIPERF-------DLEG------------PVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVAL
        RS   W++  +F PERF       ++EG              PNE  +DF F+PF GGPRKCVGDQFAL+E+ VAL
Subjt:  RSSQVWEQAEEFIPERF-------DLEG------------PVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVAL

Q6TBX7 Carotene epsilon-monooxygenase, chloroplastic4.4e-22073.79Show/hide
Query:  MASTLCFPSITFPSAPLHKRIPLRRRTLSP--YLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDL
        M S+L  PS +  S+ L    P R  +  P    SI+SSI++        K K  TN++KS SWVSPDWLT+LTR ++ G+ D+SGIPIA AKLDDV+DL
Subjt:  MASTLCFPSITFPSAPLHKRIPLRRRTLSP--YLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDL

Query:  LGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKC
        LGGALFLPL+KWMN+YGPIYRLAAGPRNFV+VSDPAIAKHVLRNY  YAKGLV+EVSEFLFGSGFAIAEGPLW  RRRAVVPSLH++YLSVIV+RVFCKC
Subjt:  LGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKC

Query:  AMRLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEE
        A RLVEKL+  A + ++VNME KFSQ+TLDVIGLS+FNY+FDSL+ DSP                        I ALCKI+PRQ+KAE+AVT+IR TVE+
Subjt:  AMRLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEE

Query:  LIAKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTK
        LIAKCKEIVE EGERI++EEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSK+SS+L KAQ EVDRVL+GR P++ED K
Subjt:  LIAKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTK

Query:  ELKYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQF
        ELKY+TRCI ESMRLYPHPPVLIRRAQV D+LPGNYKVN GQDIMISVYNIHRSS+VWE+AEEF+PERFD++G +PNE+NTDF+FIPFSGGPRKCVGDQF
Subjt:  ELKYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQF

Query:  ALLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN
        AL+EAIVALA+FLQ +N ELVP+QTI MTTGATIHTTN
Subjt:  ALLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN

Q93VK5 Protein LUTEIN DEFICIENT 5, chloroplastic3.4e-11146.72Show/hide
Query:  GDDSGIPIATAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLR-NYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAV
        G D   P        +  +   A F+PL++    YG I+RL  GP++F++VSDP+IAKH+L+ N   Y+KG+++E+ +F+ G G   A+G +W  RRRA+
Subjt:  GDDSGIPIATAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLR-NYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAV

Query:  VPSLHKKYLSVIVDRVFCKCAMRLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADS-PIKALCKII----------------------
        VP+LH+KY++ ++  +F + + RL +KL+  AL    V ME  FS+LTLD+IG +VFNY FDSL+ D+  I+A+  ++                      
Subjt:  VPSLHKKYLSVIVDRVFCKCAMRLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADS-PIKALCKII----------------------

Query:  -PRQIKAEEAVTVIRRTVEELIAKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVK
         PRQ K   ++ +I  T+++LIA CK +VE E E    EEY+N+ DPSIL FLLAS ++VSS QLRDDL++ML+AGHET+ +VLTWT YLL+   S + K
Subjt:  -PRQIKAEEAVTVIRRTVEELIAKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVK

Query:  AQNEVDRVLQGRPPSYEDTKELKYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESN
         Q EVD V+  R P+ +D K+LKY TR + ES+RLYP PPVLIRR+   D+L G Y +  G+DI ISV+N+HRS   W+ AE+F PER+ L+GP PNE+N
Subjt:  AQNEVDRVLQGRPPSYEDTKELKYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESN

Query:  TDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPN-QTIGMTTGATIHTT
         +F ++PF GGPRKC+GD FA  E +VA+A+ ++  NF++ P    + MTTGATIHTT
Subjt:  TDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPN-QTIGMTTGATIHTT

Arabidopsis top hitse value%identityAlignment
AT1G31800.1 cytochrome P450, family 97, subfamily A, polypeptide 32.4e-11246.72Show/hide
Query:  GDDSGIPIATAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLR-NYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAV
        G D   P        +  +   A F+PL++    YG I+RL  GP++F++VSDP+IAKH+L+ N   Y+KG+++E+ +F+ G G   A+G +W  RRRA+
Subjt:  GDDSGIPIATAKLDDVSDLLGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLR-NYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAV

Query:  VPSLHKKYLSVIVDRVFCKCAMRLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADS-PIKALCKII----------------------
        VP+LH+KY++ ++  +F + + RL +KL+  AL    V ME  FS+LTLD+IG +VFNY FDSL+ D+  I+A+  ++                      
Subjt:  VPSLHKKYLSVIVDRVFCKCAMRLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADS-PIKALCKII----------------------

Query:  -PRQIKAEEAVTVIRRTVEELIAKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVK
         PRQ K   ++ +I  T+++LIA CK +VE E E    EEY+N+ DPSIL FLLAS ++VSS QLRDDL++ML+AGHET+ +VLTWT YLL+   S + K
Subjt:  -PRQIKAEEAVTVIRRTVEELIAKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVK

Query:  AQNEVDRVLQGRPPSYEDTKELKYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESN
         Q EVD V+  R P+ +D K+LKY TR + ES+RLYP PPVLIRR+   D+L G Y +  G+DI ISV+N+HRS   W+ AE+F PER+ L+GP PNE+N
Subjt:  AQNEVDRVLQGRPPSYEDTKELKYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESN

Query:  TDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPN-QTIGMTTGATIHTT
         +F ++PF GGPRKC+GD FA  E +VA+A+ ++  NF++ P    + MTTGATIHTT
Subjt:  TDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPN-QTIGMTTGATIHTT

AT1G67110.1 cytochrome P450, family 735, subfamily A, polypeptide 23.2e-3226.41Show/hide
Query:  WMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYG--TYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLE
        W   YG  + +  G    + +++  + K +L  +   T    L  + ++   G G  +A G  W  +R    P+  +  L      +  +C   + E+L 
Subjt:  WMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYG--TYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLE

Query:  KDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS----LSADSPIKALCKIIPRQI----------KAEEAVTVIRRTVEELIAKCKEIVETEGERID--
        K+      V + E+  +LT D+I  + F  S D      S  + ++ LC    R +          K    +  ++  VE L+    EI+++  + ++  
Subjt:  KDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDS----LSADSPIKALCKIIPRQI----------KAEEAVTVIRRTVEELIAKCKEIVETEGERID--

Query:  -EEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVL-QGRPPSYEDTKELKYLTRCILESMRL
            Y +D    +L  + +++  ++   + D+  +    GHETT  +LTWTL LL+ + +     ++EV +V  Q   PS E    L  L + I ES+RL
Subjt:  -EEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVL-QGRPPSYEDTKELKYLTRCILESMRL

Query:  YPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVW-EQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQ
        YP P  L+ R    D+  G+  +  G  I I V  IH S+++W E A EF PERF       +       F+PF+ GPR C+G  FA++EA + LA+ + 
Subjt:  YPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVW-EQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQ

Query:  HMNFELVPN
          +F +  N
Subjt:  HMNFELVPN

AT2G26710.1 Cytochrome P450 superfamily protein2.1e-3629.02Show/hide
Query:  WMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLEKD
        W   YG  + +  GP   + V+DP + + +      Y K     + + L G G    +G  W   R+ + P+ H + L ++V  V  K    +V+K    
Subjt:  WMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLEKD

Query:  ALNNNSVNME--EKFSQLTLDVIGLSVFNYSFDSLSADSPIKA----LC------------KIIPRQ--IKAEEAVTVIRRTVEELIAKCKE-IVETEGE
           N  V ++  E F  LT DVI  + F  S++   A   ++A    LC            +  P +  +K+ +    IR+++ +LI + ++  ++ EGE
Subjt:  ALNNNSVNME--EKFSQLTLDVIGLSVFNYSFDSLSADSPIKA----LC------------KIIPRQ--IKAEEAVTVIRRTVEELIAKCKE-IVETEGE

Query:  RIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRP-PSYEDTKELKYLTRCILESM
           E    +      L  L+   + V+   + ++  S   AG +TT ++LTWT  LLS H     KA++EV RV   R  P+ +   +LK L+  + ES+
Subjt:  RIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRP-PSYEDTKELKYLTRCILESM

Query:  RLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVW-EQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIF
        RLYP     IRRA+ +DV  G YK+  G +++I +  +H    +W     EF P RF     VP  +     FIPF  G R C+G   A+L+A + LA+ 
Subjt:  RLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVW-EQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIF

Query:  LQHMNFELVP
        +Q   F L P
Subjt:  LQHMNFELVP

AT3G53130.1 Cytochrome P450 superfamily protein3.1e-22173.79Show/hide
Query:  MASTLCFPSITFPSAPLHKRIPLRRRTLSP--YLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDL
        M S+L  PS +  S+ L    P R  +  P    SI+SSI++        K K  TN++KS SWVSPDWLT+LTR ++ G+ D+SGIPIA AKLDDV+DL
Subjt:  MASTLCFPSITFPSAPLHKRIPLRRRTLSP--YLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDL

Query:  LGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKC
        LGGALFLPL+KWMN+YGPIYRLAAGPRNFV+VSDPAIAKHVLRNY  YAKGLV+EVSEFLFGSGFAIAEGPLW  RRRAVVPSLH++YLSVIV+RVFCKC
Subjt:  LGGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKC

Query:  AMRLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEE
        A RLVEKL+  A + ++VNME KFSQ+TLDVIGLS+FNY+FDSL+ DSP                        I ALCKI+PRQ+KAE+AVT+IR TVE+
Subjt:  AMRLVEKLEKDALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP------------------------IKALCKIIPRQIKAEEAVTVIRRTVEE

Query:  LIAKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTK
        LIAKCKEIVE EGERI++EEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSK+SS+L KAQ EVDRVL+GR P++ED K
Subjt:  LIAKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTK

Query:  ELKYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQF
        ELKY+TRCI ESMRLYPHPPVLIRRAQV D+LPGNYKVN GQDIMISVYNIHRSS+VWE+AEEF+PERFD++G +PNE+NTDF+FIPFSGGPRKCVGDQF
Subjt:  ELKYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQF

Query:  ALLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN
        AL+EAIVALA+FLQ +N ELVP+QTI MTTGATIHTTN
Subjt:  ALLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTN

AT4G15110.1 cytochrome P450, family 97, subfamily B, polypeptide 35.5e-9337.91Show/hide
Query:  MASTLCFP-SITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLL
        M + + FP + T+P+      + L R     +     +I    +      +K  +   K+   +  +    LT +  L  G    +P A      VSDL 
Subjt:  MASTLCFP-SITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLL

Query:  GGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLR-NYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKC
        G  LFL L+ W  ++G IY+LA GP+ FVV+SDP IA+HVLR N  +Y KG+++E+ E + G G   A+   W +RRRA+ P+ HK YL  +V +VF  C
Subjt:  GGALFLPLFKWMNDYGPIYRLAAGPRNFVVVSDPAIAKHVLR-NYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKC

Query:  AMRLVEKLEK--------DALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP-IKALCK-----------------------IIPRQIKAEEAVT
        + +++ K EK           +   +++E +FS L LD+IGLSVFNY F S++ +SP IKA+                         I+PRQ K +  + 
Subjt:  AMRLVEKLEK--------DALNNNSVNMEEKFSQLTLDVIGLSVFNYSFDSLSADSP-IKALCK-----------------------IIPRQIKAEEAVT

Query:  VIRRTVEELIAKCKEI-VETEGERIDEEEYVNDADPSILRFLLASR-EEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQ
        +I   ++ LI   KE   ET+ E++ E +Y N  D S+LRFL+  R  ++   QLRDDL++ML+AGHETT +VLTW ++LLS++   + KAQ E+D VL 
Subjt:  VIRRTVEELIAKCKEI-VETEGERIDEEEYVNDADPSILRFLLASR-EEVSSVQLRDDLLSMLVAGHETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQ

Query:  GRPPSYEDTKELKYLTRCILESMRLYPHPPVLIRRAQVADVLPG-------NYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERF-------DLEG---
          PP+YE  K+L+Y+   ++E +RL+P PP+LIRR    + LPG        +KV  G DI ISVYN+HRS   W+   +F PERF        +EG   
Subjt:  GRPPSYEDTKELKYLTRCILESMRLYPHPPVLIRRAQVADVLPG-------NYKVNAGQDIMISVYNIHRSSQVWEQAEEFIPERF-------DLEG---

Query:  ---------PVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPN-QTIGMTTGATIHTTN
                   PNE   DF F+PF GGPRKC+GDQFAL+E+ VALA+  Q  + EL    +++ + +GATIH  N
Subjt:  ---------PVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPN-QTIGMTTGATIHTTN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCACTCTCTGTTTTCCCTCTATCACTTTCCCCTCTGCTCCCCTCCACAAACGAATCCCTCTCAGACGAAGAACCCTATCTCCATATCTCTCCATTAAATCCTC
CATTGACGAAGGACGAAATCCCTCAACGCCCACAAAGCTTAAGAACTCCACCAACACTGCAAAATCCGGTTCCTGGGTCAGCCCTGATTGGCTCACCTCTCTCACTCGCT
ACATTACTCTAGGGCAGGGCGACGACTCCGGCATCCCCATTGCAACTGCCAAGCTCGATGACGTTTCTGATCTTCTTGGCGGTGCCCTTTTCCTTCCACTCTTCAAGTGG
ATGAATGACTATGGACCCATTTACAGGCTCGCTGCTGGCCCTAGAAATTTCGTCGTCGTTAGTGATCCCGCCATTGCTAAGCACGTTCTCAGGAATTATGGGACTTACGC
TAAAGGCCTTGTTTCTGAGGTTTCCGAGTTCTTGTTTGGGTCGGGTTTCGCCATTGCAGAAGGCCCTCTCTGGATGGTTCGCCGTAGGGCTGTGGTTCCATCTCTTCACA
AGAAGTACTTATCGGTTATTGTTGATCGAGTATTTTGTAAATGTGCCATGAGATTGGTGGAGAAGCTGGAAAAGGATGCATTAAATAATAATTCGGTTAACATGGAGGAA
AAGTTTTCTCAACTAACTCTTGATGTTATTGGTCTATCTGTATTCAACTACAGTTTTGATTCTCTCTCTGCTGACAGCCCTATTAAGGCTCTGTGTAAGATAATCCCAAG
ACAGATAAAAGCTGAAGAAGCAGTTACAGTGATCAGGAGAACTGTTGAAGAACTCATTGCCAAGTGCAAAGAAATTGTTGAAACTGAGGGTGAGCGTATTGATGAGGAGG
AATATGTGAACGATGCTGATCCAAGCATCCTCCGATTTCTGCTGGCCAGTAGAGAAGAGGTCTCAAGTGTACAATTACGAGATGATCTATTGTCCATGTTGGTTGCTGGA
CATGAAACTACTGGCTCTGTTCTGACTTGGACACTGTATCTTTTAAGTAAGCATTCCTCATCATTGGTCAAGGCACAAAATGAAGTTGATAGAGTCTTACAAGGAAGGCC
TCCTTCTTATGAAGATACGAAGGAACTTAAATATTTGACACGTTGTATCCTTGAGTCAATGCGTCTTTACCCACATCCACCTGTTTTGATAAGAAGAGCTCAAGTGGCTG
ACGTACTCCCTGGAAATTACAAGGTTAACGCTGGTCAAGATATCATGATTTCAGTATATAACATCCATCGCTCTTCCCAGGTCTGGGAACAAGCAGAAGAGTTTATACCA
GAAAGATTTGACTTGGAAGGCCCTGTGCCTAATGAAAGCAATACAGATTTCAGATTTATTCCGTTCAGCGGTGGGCCTCGAAAGTGTGTTGGTGATCAATTTGCCCTGCT
TGAAGCTATTGTTGCACTTGCCATATTTCTACAGCATATGAATTTCGAGCTGGTTCCGAATCAGACCATTGGGATGACTACTGGAGCAACTATACATACAACAAATCTCG
GATCCCCAGTAGTACCTGCAACTGTGATTTCTTCTGATTATTATTTCAACAATTCATTTCCGACTTGTGTTCTTGGTATTTCAGGCTGTGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCACTCTCTGTTTTCCCTCTATCACTTTCCCCTCTGCTCCCCTCCACAAACGAATCCCTCTCAGACGAAGAACCCTATCTCCATATCTCTCCATTAAATCCTC
CATTGACGAAGGACGAAATCCCTCAACGCCCACAAAGCTTAAGAACTCCACCAACACTGCAAAATCCGGTTCCTGGGTCAGCCCTGATTGGCTCACCTCTCTCACTCGCT
ACATTACTCTAGGGCAGGGCGACGACTCCGGCATCCCCATTGCAACTGCCAAGCTCGATGACGTTTCTGATCTTCTTGGCGGTGCCCTTTTCCTTCCACTCTTCAAGTGG
ATGAATGACTATGGACCCATTTACAGGCTCGCTGCTGGCCCTAGAAATTTCGTCGTCGTTAGTGATCCCGCCATTGCTAAGCACGTTCTCAGGAATTATGGGACTTACGC
TAAAGGCCTTGTTTCTGAGGTTTCCGAGTTCTTGTTTGGGTCGGGTTTCGCCATTGCAGAAGGCCCTCTCTGGATGGTTCGCCGTAGGGCTGTGGTTCCATCTCTTCACA
AGAAGTACTTATCGGTTATTGTTGATCGAGTATTTTGTAAATGTGCCATGAGATTGGTGGAGAAGCTGGAAAAGGATGCATTAAATAATAATTCGGTTAACATGGAGGAA
AAGTTTTCTCAACTAACTCTTGATGTTATTGGTCTATCTGTATTCAACTACAGTTTTGATTCTCTCTCTGCTGACAGCCCTATTAAGGCTCTGTGTAAGATAATCCCAAG
ACAGATAAAAGCTGAAGAAGCAGTTACAGTGATCAGGAGAACTGTTGAAGAACTCATTGCCAAGTGCAAAGAAATTGTTGAAACTGAGGGTGAGCGTATTGATGAGGAGG
AATATGTGAACGATGCTGATCCAAGCATCCTCCGATTTCTGCTGGCCAGTAGAGAAGAGGTCTCAAGTGTACAATTACGAGATGATCTATTGTCCATGTTGGTTGCTGGA
CATGAAACTACTGGCTCTGTTCTGACTTGGACACTGTATCTTTTAAGTAAGCATTCCTCATCATTGGTCAAGGCACAAAATGAAGTTGATAGAGTCTTACAAGGAAGGCC
TCCTTCTTATGAAGATACGAAGGAACTTAAATATTTGACACGTTGTATCCTTGAGTCAATGCGTCTTTACCCACATCCACCTGTTTTGATAAGAAGAGCTCAAGTGGCTG
ACGTACTCCCTGGAAATTACAAGGTTAACGCTGGTCAAGATATCATGATTTCAGTATATAACATCCATCGCTCTTCCCAGGTCTGGGAACAAGCAGAAGAGTTTATACCA
GAAAGATTTGACTTGGAAGGCCCTGTGCCTAATGAAAGCAATACAGATTTCAGATTTATTCCGTTCAGCGGTGGGCCTCGAAAGTGTGTTGGTGATCAATTTGCCCTGCT
TGAAGCTATTGTTGCACTTGCCATATTTCTACAGCATATGAATTTCGAGCTGGTTCCGAATCAGACCATTGGGATGACTACTGGAGCAACTATACATACAACAAATCTCG
GATCCCCAGTAGTACCTGCAACTGTGATTTCTTCTGATTATTATTTCAACAATTCATTTCCGACTTGTGTTCTTGGTATTTCAGGCTGTGCTTGA
Protein sequenceShow/hide protein sequence
MASTLCFPSITFPSAPLHKRIPLRRRTLSPYLSIKSSIDEGRNPSTPTKLKNSTNTAKSGSWVSPDWLTSLTRYITLGQGDDSGIPIATAKLDDVSDLLGGALFLPLFKW
MNDYGPIYRLAAGPRNFVVVSDPAIAKHVLRNYGTYAKGLVSEVSEFLFGSGFAIAEGPLWMVRRRAVVPSLHKKYLSVIVDRVFCKCAMRLVEKLEKDALNNNSVNMEE
KFSQLTLDVIGLSVFNYSFDSLSADSPIKALCKIIPRQIKAEEAVTVIRRTVEELIAKCKEIVETEGERIDEEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAG
HETTGSVLTWTLYLLSKHSSSLVKAQNEVDRVLQGRPPSYEDTKELKYLTRCILESMRLYPHPPVLIRRAQVADVLPGNYKVNAGQDIMISVYNIHRSSQVWEQAEEFIP
ERFDLEGPVPNESNTDFRFIPFSGGPRKCVGDQFALLEAIVALAIFLQHMNFELVPNQTIGMTTGATIHTTNLGSPVVPATVISSDYYFNNSFPTCVLGISGCA