; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029770 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029770
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCytochrome P450 97B2
Genome locationtig00153490:1069844..1089206
RNA-Seq ExpressionSgr029770
SyntenySgr029770
Gene Ontology termsGO:0004497 - monooxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR001128 - Cytochrome P450
IPR036396 - Cytochrome P450 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136416.1 cytochrome P450 97B2, chloroplastic isoform X1 [Momordica charantia]8.5e-16879.5Show/hide
Query:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD
        AVFHGNLQ NEFGFVGM RQP+VCS TTATAK+ KSNLRG VVRCQS RTDE KTKRNLLDNASNLLTN LSGGNLGSMPIAEGAVSDLFGRPLFFALYD
Subjt:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD

Query:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL
        WFLEHGSVYKLAFGPKAFVVVSDPIVARY+LRENAFGYDKGVLADILEPIMGKGLIPADL TWKQRRRVIAPGFHALYLEAMTKVF +CSERS+LKLEKL
Subjt:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL

Query:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK----------------------------------------------------
        LGEGE  E+KTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK                                                    
Subjt:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK----------------------------------------------------

Query:  ------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL
              E DVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAV+LLAQNPSKMKKAQAEIDLVLG G  TFES KAL
Subjt:  ------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL

XP_022136417.1 cytochrome P450 97B2, chloroplastic isoform X2 [Momordica charantia]8.5e-17692.98Show/hide
Query:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD
        AVFHGNLQ NEFGFVGM RQP+VCS TTATAK+ KSNLRG VVRCQS RTDE KTKRNLLDNASNLLTN LSGGNLGSMPIAEGAVSDLFGRPLFFALYD
Subjt:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD

Query:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL
        WFLEHGSVYKLAFGPKAFVVVSDPIVARY+LRENAFGYDKGVLADILEPIMGKGLIPADL TWKQRRRVIAPGFHALYLEAMTKVF +CSERS+LKLEKL
Subjt:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL

Query:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIKETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETT
        LGEGE  E+KTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIKE DVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETT
Subjt:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIKETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETT

Query:  AAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL
        AAVLTWAV+LLAQNPSKMKKAQAEIDLVLG G  TFES KAL
Subjt:  AAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL

XP_022980002.1 cytochrome P450 97B2, chloroplastic [Cucurbita maxima]5.5e-15976.25Show/hide
Query:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD
        AVF+GNLQ NEFGFVGMPR+P         AKYLKSN RG V+RCQST+ DE KTKRNLLDNASNLLTN L+GGNLGSMPIAEGAVSDLFGRPLFFALYD
Subjt:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD

Query:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL
        WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAF YDKGVLADILEPIMGKGLIPADL TWKQRRRVIAPGFHALYLEAM KVFA+CSERSI KLEKL
Subjt:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL

Query:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK----------------------------------------------------
        LGEGE+ E+KTIELDMEAEFS+LALDIIGLGVFNYDFGSVTKESPVIK                                                    
Subjt:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK----------------------------------------------------

Query:  ------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL
              E DVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQ E+DLVLG GRP FE  K L
Subjt:  ------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL

XP_023529541.1 cytochrome P450 97B2, chloroplastic [Cucurbita pepo subsp. pepo]2.8e-15876.25Show/hide
Query:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD
        AVF+GNLQ NEFGFVGMPR+P         AKYLKSN RG V+RCQST+ DE KTKRNLLDNASNLLTN L+GGNLGSMPIAEGAVSDLFGRPLFFALYD
Subjt:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD

Query:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL
        WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAF YDKGVLADILEPIMGKGLIPADL TWKQRRRVIAPGFHALYLEAM KVFA+CSERSI KLEKL
Subjt:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL

Query:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK----------------------------------------------------
        LGEGE QE+KTIELDMEAEFS+LALDIIGLGVFNYDFGSVTKESPVIK                                                    
Subjt:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK----------------------------------------------------

Query:  ------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL
              E DVEKLQQRDYLNLKD SLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMK+AQ EIDLVLG GRP FE  K L
Subjt:  ------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL

XP_038897993.1 cytochrome P450 97B2, chloroplastic [Benincasa hispida]1.2e-16177.25Show/hide
Query:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD
        A FHGNLQ NEFG++G+PRQP          K+LKSNLR P+ RCQST  DE KTKRNLLDNASNLLTN LSGGNLGSMPIAEGAVSDLFGRPLFFALYD
Subjt:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD

Query:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL
        WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADL TWKQRRRVIAPGFHALYLEAMTKVF +CSERSILKLEKL
Subjt:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL

Query:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK----------------------------------------------------
        LGEGELQE KTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK                                                    
Subjt:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK----------------------------------------------------

Query:  ------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL
              E DVEKLQQRDYLNLKDASLLRFLVDMRG DVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLG GRPTFE  K L
Subjt:  ------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL

TrEMBL top hitse value%identityAlignment
A0A0A0KVB5 Uncharacterized protein6.6e-15876Show/hide
Query:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD
        A+FHGN   NE GF+GM RQP          KYLK NLR PVVRCQST   E KTKRNLLDNASNLLTN LSGGNLGSMPIAEGAVSDLFGRPLFFALYD
Subjt:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD

Query:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL
        WFLEHGSVYKLAFGPKAFVVVSDPIVA+YILRENAF YDKGVLADILEPIMGKGLIPADL TWKQRRRVIAPGFH  YLEAMTKVFA+CSERSILKLEKL
Subjt:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL

Query:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK----------------------------------------------------
        LGEGELQ++KTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK                                                    
Subjt:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK----------------------------------------------------

Query:  ------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL
              E DVEKLQQRDYLNLKDASLLRFLVDMRG DVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLG G+PTFE FK L
Subjt:  ------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL

A0A6J1C3G0 cytochrome P450 97B2, chloroplastic isoform X14.1e-16879.5Show/hide
Query:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD
        AVFHGNLQ NEFGFVGM RQP+VCS TTATAK+ KSNLRG VVRCQS RTDE KTKRNLLDNASNLLTN LSGGNLGSMPIAEGAVSDLFGRPLFFALYD
Subjt:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD

Query:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL
        WFLEHGSVYKLAFGPKAFVVVSDPIVARY+LRENAFGYDKGVLADILEPIMGKGLIPADL TWKQRRRVIAPGFHALYLEAMTKVF +CSERS+LKLEKL
Subjt:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL

Query:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK----------------------------------------------------
        LGEGE  E+KTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK                                                    
Subjt:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK----------------------------------------------------

Query:  ------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL
              E DVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAV+LLAQNPSKMKKAQAEIDLVLG G  TFES KAL
Subjt:  ------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL

A0A6J1C5G4 cytochrome P450 97B2, chloroplastic isoform X24.1e-17692.98Show/hide
Query:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD
        AVFHGNLQ NEFGFVGM RQP+VCS TTATAK+ KSNLRG VVRCQS RTDE KTKRNLLDNASNLLTN LSGGNLGSMPIAEGAVSDLFGRPLFFALYD
Subjt:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD

Query:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL
        WFLEHGSVYKLAFGPKAFVVVSDPIVARY+LRENAFGYDKGVLADILEPIMGKGLIPADL TWKQRRRVIAPGFHALYLEAMTKVF +CSERS+LKLEKL
Subjt:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL

Query:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIKETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETT
        LGEGE  E+KTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIKE DVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETT
Subjt:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIKETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETT

Query:  AAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL
        AAVLTWAV+LLAQNPSKMKKAQAEIDLVLG G  TFES KAL
Subjt:  AAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL

A0A6J1FPU1 cytochrome P450 97B2, chloroplastic1.3e-15876Show/hide
Query:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD
        AVF+GNLQ NEFGFVGMPR+P         AKYLKSN R  V+RCQST+ DE KTKRNLLDNASNLLTN L+GGNLGSMPIAEGAVSDLFGRPLFFALYD
Subjt:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD

Query:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL
        WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAF YDKGVLADILEPIMGKGLIPADL TWKQRRRVIAPGFHALYLEAM KVFA+CSERSI KLEKL
Subjt:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL

Query:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK----------------------------------------------------
        LGEGE+ E+KTIELDMEAEFS+LALDIIGLGVFNYDFGSVTKESPVIK                                                    
Subjt:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK----------------------------------------------------

Query:  ------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL
              E DVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQ E+DLVLG GRP FE  K L
Subjt:  ------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL

A0A6J1IQ92 cytochrome P450 97B2, chloroplastic2.7e-15976.25Show/hide
Query:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD
        AVF+GNLQ NEFGFVGMPR+P         AKYLKSN RG V+RCQST+ DE KTKRNLLDNASNLLTN L+GGNLGSMPIAEGAVSDLFGRPLFFALYD
Subjt:  AVFHGNLQSNEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYD

Query:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL
        WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAF YDKGVLADILEPIMGKGLIPADL TWKQRRRVIAPGFHALYLEAM KVFA+CSERSI KLEKL
Subjt:  WFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKL

Query:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK----------------------------------------------------
        LGEGE+ E+KTIELDMEAEFS+LALDIIGLGVFNYDFGSVTKESPVIK                                                    
Subjt:  LGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK----------------------------------------------------

Query:  ------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL
              E DVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQ E+DLVLG GRP FE  K L
Subjt:  ------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL

SwissProt top hitse value%identityAlignment
O23365 Cytochrome P450 97B3, chloroplastic7.6e-12767.13Show/hide
Query:  VRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYDWFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGV
        ++CQST   E KT  N+LDNASNLLTNFLSGG+LGSMP AEG+VSDLFG+PLF +LYDWFLEHG +YKLAFGPKAFVV+SDPI+AR++LRENAF YDKGV
Subjt:  VRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYDWFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGV

Query:  LADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKLLGEGELQENK-TIELDMEAEFSSLALDIIGLGVFNYDFGSVT
        LA+ILEPIMGKGLIPADL TWK RRR I P FH LYLEAM KVF++CSE+ ILK EKL+ E E    + TIELD+EAEFSSLALDIIGL VFNYDFGSVT
Subjt:  LADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKLLGEGELQENK-TIELDMEAEFSSLALDIIGLGVFNYDFGSVT

Query:  KESPVIK----------------------------------------------------------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDR
        KESPVIK                                                          ETDVEKLQ+RDY NLKDASLLRFLVDMRG D+DDR
Subjt:  KESPVIK----------------------------------------------------------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDR

Query:  QLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL
        QLRDDLMTMLIAGHETTAAVLTWAVFLL+QNP K++KAQAEID VLG G PT+ES K L
Subjt:  QLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL

O48921 Cytochrome P450 97B2, chloroplastic9.9e-13572.42Show/hide
Query:  VRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYDWFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGV
        +RCQS  TD+ K+ RNLL NASNLLT+ LSGG++GSMPIAEGAVSDL GRPLFF+LYDWFLEHG+VYKLAFGPKAFVVVSDPIVAR+ILRENAF YDKGV
Subjt:  VRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYDWFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGV

Query:  LADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKLLGEGELQEN-KTIELDMEAEFSSLALDIIGLGVFNYDFGSVT
        LADILEPIMGKGLIPADL TWKQRRRVIAP FH  YLEAM K+F  CSER+ILK  KLL EGE  +   +IELD+EAEFSSLALDIIGLGVFNYDFGSVT
Subjt:  LADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKLLGEGELQEN-KTIELDMEAEFSSLALDIIGLGVFNYDFGSVT

Query:  KESPVIK----------------------------------------------------------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDR
        KESPVIK                                                          ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDR
Subjt:  KESPVIK----------------------------------------------------------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDR

Query:  QLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL
        QLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAE+DLVLG GRPTFES K L
Subjt:  QLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL

Q43078 Cytochrome P450 97B1, chloroplastic1.6e-12969.17Show/hide
Query:  VRCQSTRTDELK-TKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYDWFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKG
        +RCQS   ++ K + RN+ DNASNLLT+ LSG NLGSMPIAEGAV+DLF RPLFF+LYDWFLEHGSVYKLAFGPKAFVVVSDPIVAR+ILRENAF YDKG
Subjt:  VRCQSTRTDELK-TKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYDWFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKG

Query:  VLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKLL-GEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSV
        VLADILEPIMGKGLIPADL TWKQRRRVIAPGFH  YLEAM ++F +CSER++LK+ +LL GEG     K++ELD+EAEFS+LAL+IIGLGVFNYDFGSV
Subjt:  VLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKLL-GEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSV

Query:  TKESPVIK----------------------------------------------------------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDD
        T ESPVIK                                                          ETDVEKLQQRDY NLKDASLLRFLVDMRG DVDD
Subjt:  TKESPVIK----------------------------------------------------------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDD

Query:  RQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL
        RQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNP KMKKAQAE+DLVLGMG+PTFE  K L
Subjt:  RQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL

Q6TBX7 Carotene epsilon-monooxygenase, chloroplastic1.2e-5036.58Show/hide
Query:  LTNFLSGG--NLGSMPIAEGA---VSDLFGRPLFFALYDWFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLG
        LT  LS G  +   +PIA      V+DL G  LF  LY W  E+G +Y+LA GP+ FV+VSDP +A+++LR N   Y KG++A++ E + G G   A+  
Subjt:  LTNFLSGG--NLGSMPIAEGA---VSDLFGRPLFFALYDWFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLG

Query:  TWKQRRRVIAPGFHALYLEAMT-KVFANCSERSILKLEKLLGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK------------
         W  RRR + P  H  YL  +  +VF  C+ER + KL+    +G         ++MEA+FS + LD+IGL +FNY+F S+T +SPVI+            
Subjt:  TWKQRRRVIAPGFHALYLEAMT-KVFANCSERSILKLEKLLGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK------------

Query:  ---------------------------------------------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAV
                                                     E + E++   +Y+N  D S+LRFL+  R  +V   QLRDDL++ML+AGHETT +V
Subjt:  ---------------------------------------------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAV

Query:  LTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL
        LTW ++LL++N S ++KAQ E+D VL    P FE  K L
Subjt:  LTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL

Q93VK5 Protein LUTEIN DEFICIENT 5, chloroplastic8.3e-5739.69Show/hide
Query:  MPIAEGAVSDLFGRPLFFALYDWFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALY
        +P A+G++  +     F  LY+ FL +G +++L FGPK+F++VSDP +A++IL++NA  Y KG+LA+IL+ +MGKGLIPAD   W++RRR I P  H  Y
Subjt:  MPIAEGAVSDLFGRPLFFALYDWFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALY

Query:  LEAMTKVFANCSERSILKLEKLLGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVI-------------------------------
        + AM  +F   S+R   KL+    +GE       E++ME+ FS L LDIIG  VFNYDF S+T ++ VI                               
Subjt:  LEAMTKVFANCSERSILKLEKLLGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVI-------------------------------

Query:  ------------------------KETDVEKLQ-QRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQ
                                +  + E+LQ   +Y+N +D S+L FL+   G DV  +QLRDDLMTMLIAGHET+AAVLTW  +LL   PS + K Q
Subjt:  ------------------------KETDVEKLQ-QRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQ

Query:  AEIDLVLGMGRPTFESFKAL
         E+D V+G   PT +  K L
Subjt:  AEIDLVLGMGRPTFESFKAL

Arabidopsis top hitse value%identityAlignment
AT1G31800.1 cytochrome P450, family 97, subfamily A, polypeptide 35.9e-5839.69Show/hide
Query:  MPIAEGAVSDLFGRPLFFALYDWFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALY
        +P A+G++  +     F  LY+ FL +G +++L FGPK+F++VSDP +A++IL++NA  Y KG+LA+IL+ +MGKGLIPAD   W++RRR I P  H  Y
Subjt:  MPIAEGAVSDLFGRPLFFALYDWFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALY

Query:  LEAMTKVFANCSERSILKLEKLLGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVI-------------------------------
        + AM  +F   S+R   KL+    +GE       E++ME+ FS L LDIIG  VFNYDF S+T ++ VI                               
Subjt:  LEAMTKVFANCSERSILKLEKLLGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVI-------------------------------

Query:  ------------------------KETDVEKLQ-QRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQ
                                +  + E+LQ   +Y+N +D S+L FL+   G DV  +QLRDDLMTMLIAGHET+AAVLTW  +LL   PS + K Q
Subjt:  ------------------------KETDVEKLQ-QRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQ

Query:  AEIDLVLGMGRPTFESFKAL
         E+D V+G   PT +  K L
Subjt:  AEIDLVLGMGRPTFESFKAL

AT1G69500.1 cytochrome P450, family 704, subfamily B, polypeptide 17.0e-1122.04Show/hide
Query:  PIAEGAVSDLFGRPLFFALYDWFLEHGSVYKLAFGPKAFVV---VSDPIVARYILRENAFGYDKG-VLADILEPIMGKGLIPADLGTWKQRRRVIAPGFH
        P+   A+  L     F  ++DW +E+    +    P  F     ++DPI   Y+L+ N   Y KG      +E ++G G+  +D   W+++R+  +  F 
Subjt:  PIAEGAVSDLFGRPLFFALYDWFLEHGSVYKLAFGPKAFVV---VSDPIVARYILRENAFGYDKG-VLADILEPIMGKGLIPADLGTWKQRRRVIAPGFH

Query:  ALYL-EAMTKVFANCSERSILKLEKLLGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESP---VIKETDVEKL-----------QQRDY
        +  L +  T VF   S    LKL  +L +   +E    ++DM+     + LD I    F  + G++  E P     K  D   +           + + +
Subjt:  ALYL-EAMTKVFANCSERSILKLEKLLGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESP---VIKETDVEKL-----------QQRDY

Query:  LNLKDASLL----------------------------------------------RF--LVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLA
        LN+   +LL                                              RF  + D   +   ++ LRD ++  +IAG +TTA  LTWA++++ 
Subjt:  LNLKDASLL----------------------------------------------RF--LVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLA

Query:  QNPSKMKKAQAEI
         N +  +K  +E+
Subjt:  QNPSKMKKAQAEI

AT3G53130.1 Cytochrome P450 superfamily protein8.2e-5236.58Show/hide
Query:  LTNFLSGG--NLGSMPIAEGA---VSDLFGRPLFFALYDWFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLG
        LT  LS G  +   +PIA      V+DL G  LF  LY W  E+G +Y+LA GP+ FV+VSDP +A+++LR N   Y KG++A++ E + G G   A+  
Subjt:  LTNFLSGG--NLGSMPIAEGA---VSDLFGRPLFFALYDWFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLG

Query:  TWKQRRRVIAPGFHALYLEAMT-KVFANCSERSILKLEKLLGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK------------
         W  RRR + P  H  YL  +  +VF  C+ER + KL+    +G         ++MEA+FS + LD+IGL +FNY+F S+T +SPVI+            
Subjt:  TWKQRRRVIAPGFHALYLEAMT-KVFANCSERSILKLEKLLGEGELQENKTIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIK------------

Query:  ---------------------------------------------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAV
                                                     E + E++   +Y+N  D S+LRFL+  R  +V   QLRDDL++ML+AGHETT +V
Subjt:  ---------------------------------------------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAV

Query:  LTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL
        LTW ++LL++N S ++KAQ E+D VL    P FE  K L
Subjt:  LTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL

AT4G15110.1 cytochrome P450, family 97, subfamily B, polypeptide 35.4e-12867.13Show/hide
Query:  VRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYDWFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGV
        ++CQST   E KT  N+LDNASNLLTNFLSGG+LGSMP AEG+VSDLFG+PLF +LYDWFLEHG +YKLAFGPKAFVV+SDPI+AR++LRENAF YDKGV
Subjt:  VRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYDWFLEHGSVYKLAFGPKAFVVVSDPIVARYILRENAFGYDKGV

Query:  LADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKLLGEGELQENK-TIELDMEAEFSSLALDIIGLGVFNYDFGSVT
        LA+ILEPIMGKGLIPADL TWK RRR I P FH LYLEAM KVF++CSE+ ILK EKL+ E E    + TIELD+EAEFSSLALDIIGL VFNYDFGSVT
Subjt:  LADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKLLGEGELQENK-TIELDMEAEFSSLALDIIGLGVFNYDFGSVT

Query:  KESPVIK----------------------------------------------------------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDR
        KESPVIK                                                          ETDVEKLQ+RDY NLKDASLLRFLVDMRG D+DDR
Subjt:  KESPVIK----------------------------------------------------------ETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDR

Query:  QLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL
        QLRDDLMTMLIAGHETTAAVLTWAVFLL+QNP K++KAQAEID VLG G PT+ES K L
Subjt:  QLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL

AT5G52400.1 cytochrome P450, family 715, subfamily A, polypeptide 13.7e-1223.81Show/hide
Query:  WFLEHGSVYKLAFGPKAFVVVSDP----IVARYIL-----RENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSE
        W  E+G V+    G + FV V+DP    ++++ +L     + N F  D+       EP+ G GL+  +   W + R +I P F  L L+ MT +      
Subjt:  WFLEHGSVYKLAFGPKAFVVVSDP----IVARYIL-----RENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSE

Query:  RSILKLEKLLGEGELQENK-TIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIKETDVEKLQQRDYLNLKDASLLRF--LVDMRG-----------
          +  +  +L    +Q N    E DME+E    A +II    F     + T+    ++           Y+ +  +++L +   V  +G           
Subjt:  RSILKLEKLLGEGELQENK-TIELDMEAEFSSLALDIIGLGVFNYDFGSVTKESPVIKETDVEKLQQRDYLNLKDASLLRF--LVDMRG-----------

Query:  ---------ADVDD--------------------RQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL
                 A+ DD                    ++L D+  T   AGHETTA  LTW   LLA +P      + EI  V+G  +  +     L
Subjt:  ---------ADVDD--------------------RQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESFKAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GTCAGTCATCGGCGCCGCCTAACTTCCTTCCCTCTGGTCAGAAGAGTAAGCCGCCATTGCCGCCATTGCTCTCGTTGGAGGAATCTGCAGAAGCTCCGCCGGAAGGAATT
TGTGAAGAAACTGCACCACCATCCATCAGAAATCCGTCTCCGGAGACTCCACCATTAGCCGGCCTCGTCCCGGATAGCCTCTGCAGAAAGCCGTCGCCGTGGGAGGCGGA
CGGAAACGCCAACGGCTGGGGGCGCGCCGGGGTGCGAAATCGACGAGGCGAAAGGCGGAGTCGGACCGCCGGTGAATTGCTGAACCATCGCTCTGAAATTCGTAGTGTCC
GTGTTGAGCAACGTCGTCGGAGTCCGCCGAGACGCCCTGGAGCGGCGGCGAACCGGCTTTCCGACGCGACCTTCCGGGTTCAAGCCAGTGGAAACGAAACTCGATTGACG
AACATCATTGCTGAGGTGGTTGATCAGAAGAAGGTGGCACCGTCGCGGCTGCCACCGTGTTTGAAAGGTTTTGATGGTAAAATTGAAGCCAGTCGCCAGGGCCAGACATG
GTTCAATGGCACCACCAAAGAGGCAGCAATTCAGGCCTTTGTGGGCTTTCGTACCTATAGATTAGAGAAAACAGACAAAGAAAAAGCTGTCTTCCATGGAAATCTGCAGA
GCAATGAATTTGGGTTTGTGGGTATGCCGAGACAACCGGCGGTTTGCTCTACCACCACCGCCACTGCCAAGTATCTCAAATCCAATCTCAGAGGTCCGGTTGTTAGATGC
CAATCAACTCGAACCGATGAACTTAAAACGAAAAGAAATCTACTTGACAATGCAAGCAATCTCCTTACCAATTTCTTAAGTGGTGGAAATCTGGGATCTATGCCGATCGC
AGAAGGTGCAGTCTCTGATTTGTTTGGTCGCCCACTCTTCTTTGCACTATATGATTGGTTCTTAGAGCATGGATCTGTTTATAAACTTGCCTTTGGACCAAAAGCCTTTG
TTGTTGTATCAGATCCCATTGTGGCAAGATATATTCTTCGAGAAAATGCATTTGGTTATGACAAGGGAGTGCTTGCTGATATTTTAGAACCGATAATGGGTAAAGGACTA
ATACCAGCTGACCTTGGCACTTGGAAGCAGAGGAGACGAGTTATTGCTCCAGGATTCCATGCCTTGTACTTGGAAGCTATGACCAAAGTATTTGCCAATTGTTCAGAACG
ATCAATATTGAAATTGGAGAAGCTTCTAGGAGAAGGTGAACTACAGGAGAATAAAACCATTGAGTTGGATATGGAAGCAGAGTTTTCAAGTTTGGCTCTTGATATCATTG
GACTCGGTGTTTTCAACTATGATTTTGGTTCTGTAACCAAAGAATCTCCGGTGATTAAGGAAACGGATGTTGAGAAATTGCAGCAAAGGGACTACTTAAATCTCAAGGAT
GCCAGTCTTTTGCGTTTCTTAGTTGATATGCGGGGAGCTGATGTTGATGATCGCCAGCTTAGGGACGATCTGATGACGATGCTTATTGCTGGCCATGAAACAACTGCTGC
TGTGCTTACATGGGCTGTTTTTTTGCTTGCACAAAATCCTTCAAAAATGAAAAAAGCGCAAGCAGAGATTGATTTGGTTCTTGGCATGGGGAGGCCAACTTTTGAATCAT
TTAAAGCATTGAA
mRNA sequenceShow/hide mRNA sequence
GTCAGTCATCGGCGCCGCCTAACTTCCTTCCCTCTGGTCAGAAGAGTAAGCCGCCATTGCCGCCATTGCTCTCGTTGGAGGAATCTGCAGAAGCTCCGCCGGAAGGAATT
TGTGAAGAAACTGCACCACCATCCATCAGAAATCCGTCTCCGGAGACTCCACCATTAGCCGGCCTCGTCCCGGATAGCCTCTGCAGAAAGCCGTCGCCGTGGGAGGCGGA
CGGAAACGCCAACGGCTGGGGGCGCGCCGGGGTGCGAAATCGACGAGGCGAAAGGCGGAGTCGGACCGCCGGTGAATTGCTGAACCATCGCTCTGAAATTCGTAGTGTCC
GTGTTGAGCAACGTCGTCGGAGTCCGCCGAGACGCCCTGGAGCGGCGGCGAACCGGCTTTCCGACGCGACCTTCCGGGTTCAAGCCAGTGGAAACGAAACTCGATTGACG
AACATCATTGCTGAGGTGGTTGATCAGAAGAAGGTGGCACCGTCGCGGCTGCCACCGTGTTTGAAAGGTTTTGATGGTAAAATTGAAGCCAGTCGCCAGGGCCAGACATG
GTTCAATGGCACCACCAAAGAGGCAGCAATTCAGGCCTTTGTGGGCTTTCGTACCTATAGATTAGAGAAAACAGACAAAGAAAAAGCTGTCTTCCATGGAAATCTGCAGA
GCAATGAATTTGGGTTTGTGGGTATGCCGAGACAACCGGCGGTTTGCTCTACCACCACCGCCACTGCCAAGTATCTCAAATCCAATCTCAGAGGTCCGGTTGTTAGATGC
CAATCAACTCGAACCGATGAACTTAAAACGAAAAGAAATCTACTTGACAATGCAAGCAATCTCCTTACCAATTTCTTAAGTGGTGGAAATCTGGGATCTATGCCGATCGC
AGAAGGTGCAGTCTCTGATTTGTTTGGTCGCCCACTCTTCTTTGCACTATATGATTGGTTCTTAGAGCATGGATCTGTTTATAAACTTGCCTTTGGACCAAAAGCCTTTG
TTGTTGTATCAGATCCCATTGTGGCAAGATATATTCTTCGAGAAAATGCATTTGGTTATGACAAGGGAGTGCTTGCTGATATTTTAGAACCGATAATGGGTAAAGGACTA
ATACCAGCTGACCTTGGCACTTGGAAGCAGAGGAGACGAGTTATTGCTCCAGGATTCCATGCCTTGTACTTGGAAGCTATGACCAAAGTATTTGCCAATTGTTCAGAACG
ATCAATATTGAAATTGGAGAAGCTTCTAGGAGAAGGTGAACTACAGGAGAATAAAACCATTGAGTTGGATATGGAAGCAGAGTTTTCAAGTTTGGCTCTTGATATCATTG
GACTCGGTGTTTTCAACTATGATTTTGGTTCTGTAACCAAAGAATCTCCGGTGATTAAGGAAACGGATGTTGAGAAATTGCAGCAAAGGGACTACTTAAATCTCAAGGAT
GCCAGTCTTTTGCGTTTCTTAGTTGATATGCGGGGAGCTGATGTTGATGATCGCCAGCTTAGGGACGATCTGATGACGATGCTTATTGCTGGCCATGAAACAACTGCTGC
TGTGCTTACATGGGCTGTTTTTTTGCTTGCACAAAATCCTTCAAAAATGAAAAAAGCGCAAGCAGAGATTGATTTGGTTCTTGGCATGGGGAGGCCAACTTTTGAATCAT
TTAAAGCATTGAA
Protein sequenceShow/hide protein sequence
QSSAPPNFLPSGQKSKPPLPPLLSLEESAEAPPEGICEETAPPSIRNPSPETPPLAGLVPDSLCRKPSPWEADGNANGWGRAGVRNRRGERRSRTAGELLNHRSEIRSVR
VEQRRRSPPRRPGAAANRLSDATFRVQASGNETRLTNIIAEVVDQKKVAPSRLPPCLKGFDGKIEASRQGQTWFNGTTKEAAIQAFVGFRTYRLEKTDKEKAVFHGNLQS
NEFGFVGMPRQPAVCSTTTATAKYLKSNLRGPVVRCQSTRTDELKTKRNLLDNASNLLTNFLSGGNLGSMPIAEGAVSDLFGRPLFFALYDWFLEHGSVYKLAFGPKAFV
VVSDPIVARYILRENAFGYDKGVLADILEPIMGKGLIPADLGTWKQRRRVIAPGFHALYLEAMTKVFANCSERSILKLEKLLGEGELQENKTIELDMEAEFSSLALDIIG
LGVFNYDFGSVTKESPVIKETDVEKLQQRDYLNLKDASLLRFLVDMRGADVDDRQLRDDLMTMLIAGHETTAAVLTWAVFLLAQNPSKMKKAQAEIDLVLGMGRPTFESF
KALX