; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027661 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027661
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCytochrome P450
Genome locationtig00153055:1555773..1566891
RNA-Seq ExpressionSgr027661
SyntenySgr027661
Gene Ontology termsGO:0004497 - monooxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR001128 - Cytochrome P450
IPR002401 - Cytochrome P450, E-class, group I
IPR017972 - Cytochrome P450, conserved site
IPR036396 - Cytochrome P450 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8494186.1 hypothetical protein CXB51_011432 [Gossypium anomalum]1.7e-23143.53Show/hide
Query:  AVAGIFAFLFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEP-AHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFAS
        AVA   + +F+   L R+ + AG++  KR+ PE  GAWP++GHLHLL GS+   + TL  MAD +GP FT+RLG  RA +VS WE+ +ECFT ND+  AS
Subjt:  AVAGIFAFLFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEP-AHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFAS

Query:  RPKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKR
        RP  VA+KH+GY+YA+ G  PY P+WR +RKI+TLELL+N RLE L+H+RISEV+  I++LY L     +NG   ++VE+K WF D+TLN + R+V GKR
Subjt:  RPKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKR

Query:  F---STAVDGGGNEDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGEL
        +   S   D G     +KA+  FF L G+ V +DA PFL WLD+ GHEKAMKKTAK LD + + WLKEHRQR  SG        DF+DVMLS +++ G L
Subjt:  F---STAVDGGGNEDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGEL

Query:  SG--YDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTI
        S   YDADT  K+TCL+ IL GSDTT  T+ W +SLLLNN+E L++AQ ELD  VG++R V ESD+KNL YLQAI+KETLRLYPAGPL  P E++ DCT+
Subjt:  SG--YDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTI

Query:  ASYHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-------------------------ISFALQIMHLTLANLLHGFDISRPSTKLLDMEESA-
        A YHIP GTRL+VN+ K+QRDP VW +P+ + PERFLT                           SFALQ++HLTLA  LH F++S P  + +DM ES  
Subjt:  ASYHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-------------------------ISFALQIMHLTLANLLHGFDISRPSTKLLDMEESA-

Query:  ----------------------DQAHGL--FTTSGHLHLVRR---------------------AGIFALLFFLYALFTISRRFGAHRKRLPPEAAGAWPL
                              D+   +  F T   + L+                       AG+FA+L   + + + SR+  + + R PPEAAG WPL
Subjt:  ----------------------DQAHGL--FTTSGHLHLVRR---------------------AGIFALLFFLYALFTISRRFGAHRKRLPPEAAGAWPL

Query:  IGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSWEIARDCFTTNDRVFASRPKLVASKLLGYDYAMFGASPYGPHWR-IRNNGGDE-----
        IGHLH+L G++  H+ L ++AD YGPIF++R+G++ A++V+S E+A++ FT  D V +SRPKL   KLLG++YA FG  PYG +WR +R     E     
Subjt:  IGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSWEIARDCFTTNDRVFASRPKLVASKLLGYDYAMFGASPYGPHWR-IRNNGGDE-----

Query:  --------------------------------KVLVEMKTWFGDVTLNTIFRMVVGKRF---STAFDGGGNEPYRKALRDFFELFGIFLPSDSFPFLSWL
                                         VLVEMK   G++ LN I RM+ GKRF     A D      YRKA+R+FF   G+F+  D+ PFL WL
Subjt:  --------------------------------KVLVEMKTWFGDVTLNTIFRMVVGKRF---STAFDGGGNEPYRKALRDFFELFGIFLPSDSFPFLSWL

Query:  DLGGHEKAMKKAAKVADETEE--------------------QDFMDVMLSAIEGSEELSSYDADTVIKA-------------------------TW----
        DLGGHEKAMKK AK  D   E                    QDF+DV+LS ++G  +LS YD DTVIKA                         TW    
Subjt:  DLGGHEKAMKKAAKVADETEE--------------------QDFMDVMLSAIEGSEELSSYDADTVIKA-------------------------TW----

Query:  --------------------------------------------------PLAIPHESFEDCTVADYHIPAKTRLIVNLHKLQRDPVVWEDPCEFRPERF
                                                          PLA   E  EDCTV  Y +P  TRLIVN+ KLQRDP VW DP EF+PERF
Subjt:  --------------------------------------------------PLAIPHESFEDCTVADYHIPAKTRLIVNLHKLQRDPVVWEDPCEFRPERF

Query:  LRSQKNFNVRGQTPQLIPFGSGRRMCPGISFALQIMHLTLANLLHGFEIDKPTEEQLDMEESVGISCMKRAHLKL
        L S ++ +V+GQ  +LIPFG+GRR+CPGISF LQ+  + LA LLHGF I  P+   +DM  S G++ +K   L++
Subjt:  LRSQKNFNVRGQTPQLIPFGSGRRMCPGISFALQIMHLTLANLLHGFEIDKPTEEQLDMEESVGISCMKRAHLKL

KJB63763.1 hypothetical protein B456_010G015000, partial [Gossypium raimondii]3.2e-24147.24Show/hide
Query:  AVAGIFAF-LFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFAS
        + A I AF L FL++    S R    + K+  PEAGGAWP+IGHL LL GS+P HI+LA MAD YG IF+++LG+HRAL+VS WEIA+EC T ND+ FAS
Subjt:  AVAGIFAF-LFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFAS

Query:  RPKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKR
        RP L  S+ +GY+ AMIG  PYGPYWR VRKI T+ELL+NHRLE L+ +  SE+++S+++LY+LW  K++   +KV+VEMK WF ++TLN I R++VGKR
Subjt:  RPKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKR

Query:  FSTAVDGGGNEDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELSGY
           + +GG N  ++K L DFFEL G  + +DA PFL WL++GG  K MKKTAK LD V   WL+EH+Q+    +    +  DFM VMLS + D  EL   
Subjt:  FSTAVDGGGNEDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELSGY

Query:  DADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQR-QVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIASYH
          DT  KA  L+ I+A  DTT+VT+ W LSLLLNN++ L + Q ELD +VG+ R  V ESD K+L YLQ+I+KETLRLYPA PL++ HE+IEDCT+  YH
Subjt:  DADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQR-QVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIASYH

Query:  IPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-------------------------ISFALQIMHLTLANLLHGFDISRPSTKLLDMEESADQAHG
        + AGT L++N+ K+ RDPLVW +P++++PERF+T                          +SFALQ + L LAN+LH F+      + +DM E    A G
Subjt:  IPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-------------------------ISFALQIMHLTLANLLHGFDISRPSTKLLDMEESADQAHG

Query:  LFTTSG---HLHLVRRAGIFAL----LFFLYALFTISRRFGAHRKRLPPEAAGAWPLIGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSW
        L ++      +H+       A+    L FL++   ISRR   + K+  PEA GAWP+IGHL LL G++P HI LA MAD YG IF+++LG++RAL+VS W
Subjt:  LFTTSG---HLHLVRRAGIFAL----LFFLYALFTISRRFGAHRKRLPPEAAGAWPLIGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSW

Query:  EIARDCFTTNDRVFASRPKLVASKLLGYDYAMFGASPYGPHWR-IR----------------------------NNGGDEKVLVEMKTWFGDVTLNTIFR
        EIA++C T ND+ FASRPKL +S++LG + AM G +PYGP+WR IR                             +   +KVLVEM+ WF +VTLN I  
Subjt:  EIARDCFTTNDRVFASRPKLVASKLLGYDYAMFGASPYGPHWR-IR----------------------------NNGGDEKVLVEMKTWFGDVTLNTIFR

Query:  MVVGKRFSTAFDGGGNEPYRKALRDFFELFGIFLPSDSFPFLSWLDLGGHEKAMKKAAKVADE------------------TEEQDFMDVMLSAIEGSEE
        M+VGKR   + +GG N  +RK++ DFF L G FL SD+ PFL +LD+GG  K MKK AK  D+                    E+DFM VMLS +  +EE
Subjt:  MVVGKRFSTAFDGGGNEPYRKALRDFFELFGIFLPSDSFPFLSWLDLGGHEKAMKKAAKVADE------------------TEEQDFMDVMLSAIEGSEE

Query:  --------LSSYDADT--------VIKAT------WPLAIPHESFEDCTVADYHIPAKTRLIVNLHKLQRDPVVWEDPCEFRPERFLRSQKNFNVRGQTP
                ++   +DT        +IK T       PL++ HE+ EDCTV  YH+ A T LI+NLHK+  DP++W +P EF+PERF+ + K+ +VRGQ  
Subjt:  --------LSSYDADT--------VIKAT------WPLAIPHESFEDCTVADYHIPAKTRLIVNLHKLQRDPVVWEDPCEFRPERFLRSQKNFNVRGQTP

Query:  QLIPFGSGRRMCPGISFALQIMHLTLANLLHGFEIDKPTEEQLDMEESVGISCMKRAHLKL
        +L+PFGSGRRMCPG+SFALQ++ LTLAN+LH FE + P+   +DM E +GI   K   L++
Subjt:  QLIPFGSGRRMCPGISFALQIMHLTLANLLHGFEIDKPTEEQLDMEESVGISCMKRAHLKL

OVA15437.1 Cytochrome P450 [Macleaya cordata]1.3e-23443.87Show/hide
Query:  VAGIFAF-LFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASR
        + G+FA  L+F+    R S++       + PPE  GAWP+IGHL LL G +  HITL K+AD YGP FT+R+G+H+AL+++ WE+A+ECFTTND+ F+SR
Subjt:  VAGIFAF-LFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASR

Query:  PKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRF
        P+ VA KH+GY++AM G  PYG YWR +RKI   E+L++ R+E L H+  +EV +S+K+LY+    K+NNGG  V+VEMK WF D+TLN    +V GKRF
Subjt:  PKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRF

Query:  STAVDGGGNEDYR--KALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQR----INSGQPAETNGHDFMDVMLSAVKDGG
        + + +   +E  R  +ALR+FF L G+ VP+DA PFL+WLD+GG+EK MKK A+ LD +   WL+EH+++       G+  +    DFMDVM++ ++D  
Subjt:  STAVDGGGNEDYR--KALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQR----INSGQPAETNGHDFMDVMLSAVKDGG

Query:  ELSGYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTI
         LS +DADTV KATCL+ IL G+DT  V ++W L+LL+NNQ+ALK+A  ELD  VGR RQV+ESDVKNL YLQAI+KETLRLY +GPL+   ES EDCT+
Subjt:  ELSGYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTI

Query:  ASYHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-------------------------ISFALQIMHLTLANLLHGFDISRPSTKLLDMEESAD
        A YH+PAGTRL++N  K+ RDP VW DP  ++P+RFLT                          +SFALQ++ L LA L+HGFD   P+   +DM ES  
Subjt:  ASYHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-------------------------ISFALQIMHLTLANLLHGFDISRPSTKLLDMEESAD

Query:  QAHGLFTTSGHLHLVRRA-GIFALLFFLYALFTISRRFGAHRKRLPPEAAGAWPLIGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSWEI
          +   T    L   R +  ++  L+FL   ++ ++     + + PPE AGAWP++GHL LL G++  HI L K+AD YGP+FT+R+G++++L+VSSWE+
Subjt:  QAHGLFTTSGHLHLVRRA-GIFALLFFLYALFTISRRFGAHRKRLPPEAAGAWPLIGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSWEI

Query:  ARDCFTTNDRVFASRPKLVASKLLGYDYAMFGASPYGPHWR----------------------------------------IRNNGGDEKVLVEMKTWFG
         ++CFTTND+V++SRP  VA K +G   AMFG SPYG +WR                                          NNG    VLVEMK WF 
Subjt:  ARDCFTTNDRVFASRPKLVASKLLGYDYAMFGASPYGPHWR----------------------------------------IRNNGGDEKVLVEMKTWFG

Query:  DVTLNTIFRMVVGKR--FSTAFDGGGNEP---YRKALRDFFELFGIFLPSDSFPFLSWLDLGGHEKAMKKAAKVADET----------------------
        D+TLN    M  GKR  F  A    GN+     + ALR+FF L G F+PSD+ PFL WLD+GG+EK MKK AK  D+                       
Subjt:  DVTLNTIFRMVVGKR--FSTAFDGGGNEP---YRKALRDFFELFGIFLPSDSFPFLSWLDLGGHEKAMKKAAKVADET----------------------

Query:  -EEQDFMDVMLSAIEGSEELSSYDADTVIKAT-----------------W--------------------------------------------------
         +EQDFMDVM++ ++  +    YDADT+ KAT                 W                                                  
Subjt:  -EEQDFMDVMLSAIEGSEELSSYDADTVIKAT-----------------W--------------------------------------------------

Query:  ----PLAIPHESFEDCTVADYHIPAKTRLIVNLHKLQRDPVVWEDPCEFRPERFLR-SQKNFNVRGQTPQLIPFGSGRRMCPGISFALQIMHLTLANLLH
            PL  P ES EDCT+A YH+PA TRL+VN+ K+QRDP VW DP +F+P+RFL  S  N +VRGQ  +LIPFG GRR CPG + ALQ++ LTLA L+H
Subjt:  ----PLAIPHESFEDCTVADYHIPAKTRLIVNLHKLQRDPVVWEDPCEFRPERFLR-SQKNFNVRGQTPQLIPFGSGRRMCPGISFALQIMHLTLANLLH

Query:  GFEIDKPTEEQLDMEESVGISCMKRAHL
        GFE   P+   +DM  S GI+  K   L
Subjt:  GFEIDKPTEEQLDMEESVGISCMKRAHL

XP_012445065.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC105769156 [Gossypium raimondii]1.6e-23244.86Show/hide
Query:  AVAGIFAFLFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASR
        AVA   + +F+   L R+ + AG++  KR+ PE  GAWP++GHLHLL G +  + TL  MAD +GP FT+RLG  RA +VS WE+ +ECFT ND+  ASR
Subjt:  AVAGIFAFLFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASR

Query:  PKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRF
        P  VA+KH+GY+YA+ G  PY P+WR +RKI+TLELL+N RLE L+H+RISEV+  I++LY L     +NG   ++VE+K WF D+TLN + R+V GKR+
Subjt:  PKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRF

Query:  ---STAVDGGGNEDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELS
           +   D G     +KA+  FF L G+ V +DA PFL WLD+ GHEKAMKKTAK LD + + WLKEHRQR  SG        DF+DVML  +++ G LS
Subjt:  ---STAVDGGGNEDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELS

Query:  G--YDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIA
           YDADT  K+TCL+ IL GSDTT  T+ W +SLLLNN+E L++AQ ELD  VG++R V ESD+KNL YLQAI+KETLRLYPAGPL  P E++ DCT+A
Subjt:  G--YDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIA

Query:  SYHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-------------------------ISFALQIMHLTLANLLHGFDISRPSTKLLDMEESADQ
         YH+PAGTRL+VN+ K+QRDP VW +P+ + PERFL                            SFALQ++HLTLA  LH F++S P  + +DM ES   
Subjt:  SYHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-------------------------ISFALQIMHLTLANLLHGFDISRPSTKLLDMEESADQ

Query:  AHGLFTTSGHLHLVR-RAGIFALLFFLYALFTISRRFGAHRKRLPPEAAGAWPLIGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSWEIA
             T    L   R  A ++A   + +      R  G  RK  PPEAAG WPLIGHLH+L G++  ++ L ++AD YGPIF++R+G++ A++V+S E+A
Subjt:  AHGLFTTSGHLHLVR-RAGIFALLFFLYALFTISRRFGAHRKRLPPEAAGAWPLIGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSWEIA

Query:  RDCFTTNDRVFASRPKLVASKLLGYDYAMFGASPYGPHWR--------------------------------------IRNNGGDEKVLVEMKTWFGDVT
        ++ FT  D V +SRPKL   KLLG++YA FG  PYG +WR                                       +       VLVEMK W G++ 
Subjt:  RDCFTTNDRVFASRPKLVASKLLGYDYAMFGASPYGPHWR--------------------------------------IRNNGGDEKVLVEMKTWFGDVT

Query:  LNTIFRMVVGKRF---STAFDGGGNEPYRKALRDFFELFGIFLPSDSFPFLSWLDLGGHEKAMKKAAKVADETEE--------------------QDFMD
        LN I RM+ GKRF     A D       RKA+R+FF   G+F+  D+ PFL WLDLGGHEKAMKK AK  D   E                    QDF+D
Subjt:  LNTIFRMVVGKRF---STAFDGGGNEPYRKALRDFFELFGIFLPSDSFPFLSWLDLGGHEKAMKKAAKVADETEE--------------------QDFMD

Query:  VMLSAIEGSEELSSYDADTVIKA-----------------TW------------------------------------------------------PLAI
        V+LS ++G  +LS YD DTV KA                 TW                                                      PLA 
Subjt:  VMLSAIEGSEELSSYDADTVIKA-----------------TW------------------------------------------------------PLAI

Query:  PHESFEDCTVADYHIPAKTRLIVNLHKLQRDPVVWEDPCEFRPERFLRSQKNFNVRGQTPQLIPFGSGRRMCPGISFALQIMHLTLANLLHGFEIDKPTE
          E  EDCTV  Y +P  TRLIVN+ KLQRDP VW DP EF+PERFL S ++ +V+GQ  +LIPFG+GRR+CPGISF LQ+  L LA  LHGF I  P+ 
Subjt:  PHESFEDCTVADYHIPAKTRLIVNLHKLQRDPVVWEDPCEFRPERFLRSQKNFNVRGQTPQLIPFGSGRRMCPGISFALQIMHLTLANLLHGFEIDKPTE

Query:  EQLDMEESVGISCMKRAHLKL
        E +DM  S G++ +K   L++
Subjt:  EQLDMEESVGISCMKRAHLKL

XP_022975796.1 cytochrome P450 82A3-like [Cucurbita maxima]1.6e-30156.08Show/hide
Query:  MELPHVSSSGAVAGIFAFLFFLYALFRKSARAGAAHR--KRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARE
        ME+ H+     +  IFA L  LYALFR ++   AA+R  K LPPEA G  PLIGHLH LS +EP HITLAKMAD YGPIFTLRLGM+ AL+VS WEIARE
Subjt:  MELPHVSSSGAVAGIFAFLFFLYALFRKSARAGAAHR--KRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARE

Query:  CFTTNDRIFASRPKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNG-GEKVMVEMKTWFGDIT
        CFTTND IFASRPK +A + LGY+ AM+ + PYGPYWRHVRK++TLELLTNHRLEQLQHIRISEVQSSIKKLYEL  +  + G G KV VEM+ WFGD++
Subjt:  CFTTNDRIFASRPKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNG-GEKVMVEMKTWFGDIT

Query:  LNTIFRIVVGKRFSTAVDGGGNEDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQ-RINSGQ-PAETNGHDFMDV
        LNTIFR+VVGKRFS   +G G E YR+ALRD FELFG  VP+D+FP LSWLD+GG+++AM KTAKVLD +  +W++EHR+ + N G+   +T   DFMDV
Subjt:  LNTIFRIVVGKRFSTAVDGGGNEDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQ-RINSGQ-PAETNGHDFMDV

Query:  MLSAVKDGGELSGYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIP
        MLSAV D G L G+DADTVTKATC + IL  SDTT +TM W LSLLLN+ EAL++ QLELDE+VGR RQVKESDVKNL YLQAI+KETLRLYPA PL IP
Subjt:  MLSAVKDGGELSGYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIP

Query:  HESIEDCTIASYHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTRISFALQIMHLTLANLLHGFDISRPSTKLLDMEESADQAHGLFTTSGHLHLVR
        HES+EDCT+  YHIPA TRL+VN+QKL +DPL+WE+P E+RPERFLT  +                FD+   + +L+          G+      L L+R
Subjt:  HESIEDCTIASYHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTRISFALQIMHLTLANLLHGFDISRPSTKLLDMEESADQAHGLFTTSGHLHLVR

Query:  --RAGIFALLFFLYALFTI-SRRFGAHR--KRLPPEAAGAWPLIGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSWEIARDCFTTNDRVF
             IFALL  LYALF I S R  A+R  K LPPEAAG  PLIGHLH L   EP HI LAKMAD YGPIFTLRLGMN AL+VSSWEIAR+CFTTND +F
Subjt:  --RAGIFALLFFLYALFTI-SRRFGAHR--KRLPPEAAGAWPLIGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSWEIARDCFTTNDRVF

Query:  ASRPKLVASKLLGYDYAMFGASPYGPHWR---------------------IR-------------------NNGGDEKVLVEMKTWFGDVTLNTIFRMVV
        ASRPK +A +LLGY+ AM   SPYGP+WR                     IR                   + G   KV VEM+ WFGDV+LNTIFRMVV
Subjt:  ASRPKLVASKLLGYDYAMFGASPYGPHWR---------------------IR-------------------NNGGDEKVLVEMKTWFGDVTLNTIFRMVV

Query:  GKRFSTAFDGGGNEPYRKALRDFFELFGIFLPSDSFPFLSWLDLGGHEKAMKKAAKVADE----------------------TEEQDFMDVMLSAIEGSE
        GKRFS  F+G G E YR+ALRD FELFG F+PSDSFP LSWLD+GG+++AM K AKV DE                      T+EQDFMDVMLSA+    
Subjt:  GKRFSTAFDGGGNEPYRKALRDFFELFGIFLPSDSFPFLSWLDLGGHEKAMKKAAKVADE----------------------TEEQDFMDVMLSAIEGSE

Query:  ELSSYDADTVIKA-----------------TW------------------------------------------------------PLAIPHESFEDCTV
         L  +DADTV KA                 TW                                                      PL IPHES EDCTV
Subjt:  ELSSYDADTVIKA-----------------TW------------------------------------------------------PLAIPHESFEDCTV

Query:  ADYHIPAKTRLIVNLHKLQRDPVVWEDPCEFRPERFLRSQKNFNVRGQTPQLIPFGSGRRMCPGISFALQIMHLTLANLLHGFEIDKPTEEQLDMEESVG
          YHIPA+TRLIVN+ KL +DP++WE+PCEFRPERFL S   F++RGQ  QL+PFGSGRRMCPGI FAL+ MH  LANLL  FEI++P+E+ LDMEE  G
Subjt:  ADYHIPAKTRLIVNLHKLQRDPVVWEDPCEFRPERFLRSQKNFNVRGQTPQLIPFGSGRRMCPGISFALQIMHLTLANLLHGFEIDKPTEEQLDMEESVG

Query:  ISCMKRAHLKL
         +  ++  L++
Subjt:  ISCMKRAHLKL

TrEMBL top hitse value%identityAlignment
A0A0D2SKX9 Uncharacterized protein (Fragment)1.5e-24147.24Show/hide
Query:  AVAGIFAF-LFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFAS
        + A I AF L FL++    S R    + K+  PEAGGAWP+IGHL LL GS+P HI+LA MAD YG IF+++LG+HRAL+VS WEIA+EC T ND+ FAS
Subjt:  AVAGIFAF-LFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFAS

Query:  RPKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKR
        RP L  S+ +GY+ AMIG  PYGPYWR VRKI T+ELL+NHRLE L+ +  SE+++S+++LY+LW  K++   +KV+VEMK WF ++TLN I R++VGKR
Subjt:  RPKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKR

Query:  FSTAVDGGGNEDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELSGY
           + +GG N  ++K L DFFEL G  + +DA PFL WL++GG  K MKKTAK LD V   WL+EH+Q+    +    +  DFM VMLS + D  EL   
Subjt:  FSTAVDGGGNEDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELSGY

Query:  DADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQR-QVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIASYH
          DT  KA  L+ I+A  DTT+VT+ W LSLLLNN++ L + Q ELD +VG+ R  V ESD K+L YLQ+I+KETLRLYPA PL++ HE+IEDCT+  YH
Subjt:  DADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQR-QVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIASYH

Query:  IPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-------------------------ISFALQIMHLTLANLLHGFDISRPSTKLLDMEESADQAHG
        + AGT L++N+ K+ RDPLVW +P++++PERF+T                          +SFALQ + L LAN+LH F+      + +DM E    A G
Subjt:  IPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-------------------------ISFALQIMHLTLANLLHGFDISRPSTKLLDMEESADQAHG

Query:  LFTTSG---HLHLVRRAGIFAL----LFFLYALFTISRRFGAHRKRLPPEAAGAWPLIGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSW
        L ++      +H+       A+    L FL++   ISRR   + K+  PEA GAWP+IGHL LL G++P HI LA MAD YG IF+++LG++RAL+VS W
Subjt:  LFTTSG---HLHLVRRAGIFAL----LFFLYALFTISRRFGAHRKRLPPEAAGAWPLIGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSW

Query:  EIARDCFTTNDRVFASRPKLVASKLLGYDYAMFGASPYGPHWR-IR----------------------------NNGGDEKVLVEMKTWFGDVTLNTIFR
        EIA++C T ND+ FASRPKL +S++LG + AM G +PYGP+WR IR                             +   +KVLVEM+ WF +VTLN I  
Subjt:  EIARDCFTTNDRVFASRPKLVASKLLGYDYAMFGASPYGPHWR-IR----------------------------NNGGDEKVLVEMKTWFGDVTLNTIFR

Query:  MVVGKRFSTAFDGGGNEPYRKALRDFFELFGIFLPSDSFPFLSWLDLGGHEKAMKKAAKVADE------------------TEEQDFMDVMLSAIEGSEE
        M+VGKR   + +GG N  +RK++ DFF L G FL SD+ PFL +LD+GG  K MKK AK  D+                    E+DFM VMLS +  +EE
Subjt:  MVVGKRFSTAFDGGGNEPYRKALRDFFELFGIFLPSDSFPFLSWLDLGGHEKAMKKAAKVADE------------------TEEQDFMDVMLSAIEGSEE

Query:  --------LSSYDADT--------VIKAT------WPLAIPHESFEDCTVADYHIPAKTRLIVNLHKLQRDPVVWEDPCEFRPERFLRSQKNFNVRGQTP
                ++   +DT        +IK T       PL++ HE+ EDCTV  YH+ A T LI+NLHK+  DP++W +P EF+PERF+ + K+ +VRGQ  
Subjt:  --------LSSYDADT--------VIKAT------WPLAIPHESFEDCTVADYHIPAKTRLIVNLHKLQRDPVVWEDPCEFRPERFLRSQKNFNVRGQTP

Query:  QLIPFGSGRRMCPGISFALQIMHLTLANLLHGFEIDKPTEEQLDMEESVGISCMKRAHLKL
        +L+PFGSGRRMCPG+SFALQ++ LTLAN+LH FE + P+   +DM E +GI   K   L++
Subjt:  QLIPFGSGRRMCPGISFALQIMHLTLANLLHGFEIDKPTEEQLDMEESVGISCMKRAHLKL

A0A200QY67 Cytochrome P4506.2e-23543.87Show/hide
Query:  VAGIFAF-LFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASR
        + G+FA  L+F+    R S++       + PPE  GAWP+IGHL LL G +  HITL K+AD YGP FT+R+G+H+AL+++ WE+A+ECFTTND+ F+SR
Subjt:  VAGIFAF-LFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASR

Query:  PKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRF
        P+ VA KH+GY++AM G  PYG YWR +RKI   E+L++ R+E L H+  +EV +S+K+LY+    K+NNGG  V+VEMK WF D+TLN    +V GKRF
Subjt:  PKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRF

Query:  STAVDGGGNEDYR--KALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQR----INSGQPAETNGHDFMDVMLSAVKDGG
        + + +   +E  R  +ALR+FF L G+ VP+DA PFL+WLD+GG+EK MKK A+ LD +   WL+EH+++       G+  +    DFMDVM++ ++D  
Subjt:  STAVDGGGNEDYR--KALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQR----INSGQPAETNGHDFMDVMLSAVKDGG

Query:  ELSGYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTI
         LS +DADTV KATCL+ IL G+DT  V ++W L+LL+NNQ+ALK+A  ELD  VGR RQV+ESDVKNL YLQAI+KETLRLY +GPL+   ES EDCT+
Subjt:  ELSGYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTI

Query:  ASYHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-------------------------ISFALQIMHLTLANLLHGFDISRPSTKLLDMEESAD
        A YH+PAGTRL++N  K+ RDP VW DP  ++P+RFLT                          +SFALQ++ L LA L+HGFD   P+   +DM ES  
Subjt:  ASYHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-------------------------ISFALQIMHLTLANLLHGFDISRPSTKLLDMEESAD

Query:  QAHGLFTTSGHLHLVRRA-GIFALLFFLYALFTISRRFGAHRKRLPPEAAGAWPLIGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSWEI
          +   T    L   R +  ++  L+FL   ++ ++     + + PPE AGAWP++GHL LL G++  HI L K+AD YGP+FT+R+G++++L+VSSWE+
Subjt:  QAHGLFTTSGHLHLVRRA-GIFALLFFLYALFTISRRFGAHRKRLPPEAAGAWPLIGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSWEI

Query:  ARDCFTTNDRVFASRPKLVASKLLGYDYAMFGASPYGPHWR----------------------------------------IRNNGGDEKVLVEMKTWFG
         ++CFTTND+V++SRP  VA K +G   AMFG SPYG +WR                                          NNG    VLVEMK WF 
Subjt:  ARDCFTTNDRVFASRPKLVASKLLGYDYAMFGASPYGPHWR----------------------------------------IRNNGGDEKVLVEMKTWFG

Query:  DVTLNTIFRMVVGKR--FSTAFDGGGNEP---YRKALRDFFELFGIFLPSDSFPFLSWLDLGGHEKAMKKAAKVADET----------------------
        D+TLN    M  GKR  F  A    GN+     + ALR+FF L G F+PSD+ PFL WLD+GG+EK MKK AK  D+                       
Subjt:  DVTLNTIFRMVVGKR--FSTAFDGGGNEP---YRKALRDFFELFGIFLPSDSFPFLSWLDLGGHEKAMKKAAKVADET----------------------

Query:  -EEQDFMDVMLSAIEGSEELSSYDADTVIKAT-----------------W--------------------------------------------------
         +EQDFMDVM++ ++  +    YDADT+ KAT                 W                                                  
Subjt:  -EEQDFMDVMLSAIEGSEELSSYDADTVIKAT-----------------W--------------------------------------------------

Query:  ----PLAIPHESFEDCTVADYHIPAKTRLIVNLHKLQRDPVVWEDPCEFRPERFLR-SQKNFNVRGQTPQLIPFGSGRRMCPGISFALQIMHLTLANLLH
            PL  P ES EDCT+A YH+PA TRL+VN+ K+QRDP VW DP +F+P+RFL  S  N +VRGQ  +LIPFG GRR CPG + ALQ++ LTLA L+H
Subjt:  ----PLAIPHESFEDCTVADYHIPAKTRLIVNLHKLQRDPVVWEDPCEFRPERFLR-SQKNFNVRGQTPQLIPFGSGRRMCPGISFALQIMHLTLANLLH

Query:  GFEIDKPTEEQLDMEESVGISCMKRAHL
        GFE   P+   +DM  S GI+  K   L
Subjt:  GFEIDKPTEEQLDMEESVGISCMKRAHL

A0A6J1IF60 cytochrome P450 82A3-like7.7e-30256.08Show/hide
Query:  MELPHVSSSGAVAGIFAFLFFLYALFRKSARAGAAHR--KRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARE
        ME+ H+     +  IFA L  LYALFR ++   AA+R  K LPPEA G  PLIGHLH LS +EP HITLAKMAD YGPIFTLRLGM+ AL+VS WEIARE
Subjt:  MELPHVSSSGAVAGIFAFLFFLYALFRKSARAGAAHR--KRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARE

Query:  CFTTNDRIFASRPKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNG-GEKVMVEMKTWFGDIT
        CFTTND IFASRPK +A + LGY+ AM+ + PYGPYWRHVRK++TLELLTNHRLEQLQHIRISEVQSSIKKLYEL  +  + G G KV VEM+ WFGD++
Subjt:  CFTTNDRIFASRPKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNG-GEKVMVEMKTWFGDIT

Query:  LNTIFRIVVGKRFSTAVDGGGNEDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQ-RINSGQ-PAETNGHDFMDV
        LNTIFR+VVGKRFS   +G G E YR+ALRD FELFG  VP+D+FP LSWLD+GG+++AM KTAKVLD +  +W++EHR+ + N G+   +T   DFMDV
Subjt:  LNTIFRIVVGKRFSTAVDGGGNEDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQ-RINSGQ-PAETNGHDFMDV

Query:  MLSAVKDGGELSGYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIP
        MLSAV D G L G+DADTVTKATC + IL  SDTT +TM W LSLLLN+ EAL++ QLELDE+VGR RQVKESDVKNL YLQAI+KETLRLYPA PL IP
Subjt:  MLSAVKDGGELSGYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIP

Query:  HESIEDCTIASYHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTRISFALQIMHLTLANLLHGFDISRPSTKLLDMEESADQAHGLFTTSGHLHLVR
        HES+EDCT+  YHIPA TRL+VN+QKL +DPL+WE+P E+RPERFLT  +                FD+   + +L+          G+      L L+R
Subjt:  HESIEDCTIASYHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTRISFALQIMHLTLANLLHGFDISRPSTKLLDMEESADQAHGLFTTSGHLHLVR

Query:  --RAGIFALLFFLYALFTI-SRRFGAHR--KRLPPEAAGAWPLIGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSWEIARDCFTTNDRVF
             IFALL  LYALF I S R  A+R  K LPPEAAG  PLIGHLH L   EP HI LAKMAD YGPIFTLRLGMN AL+VSSWEIAR+CFTTND +F
Subjt:  --RAGIFALLFFLYALFTI-SRRFGAHR--KRLPPEAAGAWPLIGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSWEIARDCFTTNDRVF

Query:  ASRPKLVASKLLGYDYAMFGASPYGPHWR---------------------IR-------------------NNGGDEKVLVEMKTWFGDVTLNTIFRMVV
        ASRPK +A +LLGY+ AM   SPYGP+WR                     IR                   + G   KV VEM+ WFGDV+LNTIFRMVV
Subjt:  ASRPKLVASKLLGYDYAMFGASPYGPHWR---------------------IR-------------------NNGGDEKVLVEMKTWFGDVTLNTIFRMVV

Query:  GKRFSTAFDGGGNEPYRKALRDFFELFGIFLPSDSFPFLSWLDLGGHEKAMKKAAKVADE----------------------TEEQDFMDVMLSAIEGSE
        GKRFS  F+G G E YR+ALRD FELFG F+PSDSFP LSWLD+GG+++AM K AKV DE                      T+EQDFMDVMLSA+    
Subjt:  GKRFSTAFDGGGNEPYRKALRDFFELFGIFLPSDSFPFLSWLDLGGHEKAMKKAAKVADE----------------------TEEQDFMDVMLSAIEGSE

Query:  ELSSYDADTVIKA-----------------TW------------------------------------------------------PLAIPHESFEDCTV
         L  +DADTV KA                 TW                                                      PL IPHES EDCTV
Subjt:  ELSSYDADTVIKA-----------------TW------------------------------------------------------PLAIPHESFEDCTV

Query:  ADYHIPAKTRLIVNLHKLQRDPVVWEDPCEFRPERFLRSQKNFNVRGQTPQLIPFGSGRRMCPGISFALQIMHLTLANLLHGFEIDKPTEEQLDMEESVG
          YHIPA+TRLIVN+ KL +DP++WE+PCEFRPERFL S   F++RGQ  QL+PFGSGRRMCPGI FAL+ MH  LANLL  FEI++P+E+ LDMEE  G
Subjt:  ADYHIPAKTRLIVNLHKLQRDPVVWEDPCEFRPERFLRSQKNFNVRGQTPQLIPFGSGRRMCPGISFALQIMHLTLANLLHGFEIDKPTEEQLDMEESVG

Query:  ISCMKRAHLKL
         +  ++  L++
Subjt:  ISCMKRAHLKL

A0A6N2N9K7 Uncharacterized protein5.4e-23144.8Show/hide
Query:  AFLFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASRPKLVAS
        A L F   L R+S     + + +  PEAGGAWP+IGHL LL+G+E  H+ L  +AD YGPIF +R+GM  A++VS WE+A+E FTTND   +SRPKL +S
Subjt:  AFLFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASRPKLVAS

Query:  KHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRFSTA-VD
        K LG + A  G  PYG +WR +RK+   ELL+N RLE L+H+R SEV+ S+K+LY+LW  +  N   +++V MK W GDI LN + R++ GKR+  A + 
Subjt:  KHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRFSTA-VD

Query:  GGGNEDYR--KALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELSGYDADT
           NE  R  KA+R+F  L GV V  DA PFL WLD GGHEKAMK+ A+ LD +FD WL+EHR+   SG+ A     DFMD ML ++ DG +L+GYDADT
Subjt:  GGGNEDYR--KALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELSGYDADT

Query:  VTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIASYHIPAGT
        V KAT LS I+ G+DT TVT+IW LSLLLNN EALK AQ ELD  VG+ R V ESD++ L YLQA VKE  RLYPAGPL+   E   DCTI  Y +PAGT
Subjt:  VTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIASYHIPAGT

Query:  RLMVNIQKLQRDPLVWEDPNEYRPERFL-------------------------TRISFALQIMHLTLANLLHGFDISRPSTKLLDMEESADQAHGLFTTS
        RL++NI K+QRDP VW DP+E++PERFL                            +  LQ+  L LA++L  F+IS PS + +DM  +A     L T  
Subjt:  RLMVNIQKLQRDPLVWEDPNEYRPERFL-------------------------TRISFALQIMHLTLANLLHGFDISRPSTKLLDMEESADQAHGLFTTS

Query:  GHLHLVRRAGIFA----------LLFFLYALFTISRRFGAHRKRLPPEAAGAWPLIGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSWEI
          L   R    F+          +   +Y LF I R+   + KR PPEAAGAWPLIGHLHLL G++P HI L  +A+ YGPIFT++LG++R LIVS+ E+
Subjt:  GHLHLVRRAGIFA----------LLFFLYALFTISRRFGAHRKRLPPEAAGAWPLIGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSWEI

Query:  ARDCFTTNDRVFASRPKLVASKLLGYDYAMFGASPYGPHWR--------------------------------------IRNNGG-DEKVLVEMKTWFGD
        A++C TTND+ FA+RPK +A  +LGY+  MFG SPYG +WR                                      I N     +K+LVEMK WF D
Subjt:  ARDCFTTNDRVFASRPKLVASKLLGYDYAMFGASPYGPHWR--------------------------------------IRNNGG-DEKVLVEMKTWFGD

Query:  VTLNTIFRMVVGKRFSTAFDGGGNEP---YRKALRDFFELFGIFLPSDSFPFLSWLDLGGHEKAMKKAAKVADET------------------EEQDFMD
        +TLN I ++VV KR+          P   +R +LR FFEL G+F+ SD+ PFL W+DLGG +KAM K AK  D                    +E+DFMD
Subjt:  VTLNTIFRMVVGKRFSTAFDGGGNEP---YRKALRDFFELFGIFLPSDSFPFLSWLDLGGHEKAMKKAAKVADET------------------EEQDFMD

Query:  VMLSAIEGSEELSSYDADTVIKA-----------------TW------------------------------------------------------PLAI
        +MLS ++ ++E S+ + DT+ KA                 TW                                                      PL++
Subjt:  VMLSAIEGSEELSSYDADTVIKA-----------------TW------------------------------------------------------PLAI

Query:  PHESFEDCTVADYHIPAKTRLIVNLHKLQRDPVVWEDPCEFRPERFLRSQKNFNVRGQTPQLIPFGSGRRMCPGISFALQIMHLTLANLLHGFEIDKPTE
         HES E+CTV  YHIP  TRL  NL K+ RDP VW DP EF+PERFL   K+ + RGQ  +LIPFGSGRRMCPG+SFA QI++L LA LLHGF+I+   +
Subjt:  PHESFEDCTVADYHIPAKTRLIVNLHKLQRDPVVWEDPCEFRPERFLRSQKNFNVRGQTPQLIPFGSGRRMCPGISFALQIMHLTLANLLHGFEIDKPTE

Query:  EQLDMEESVGISCMKRAHLK
          +DM E+ GI+ +K   LK
Subjt:  EQLDMEESVGISCMKRAHLK

F6H021 Uncharacterized protein1.1e-23640.94Show/hide
Query:  MELPHVSS-----SGAVAGIFAFLFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEI
        M +PH SS     +    G+   LF  Y L  + +RAG   ++R+ PEA GAWP+IGHLHLL GS+  H+TL  MAD YGP+FT+RLG+HRAL+VS WE+
Subjt:  MELPHVSS-----SGAVAGIFAFLFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEI

Query:  ARECFTTNDRIFASRPKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGD
        A+EC TTND+  +SRP+L+ASKHLGY+Y+M G  PYG YWR VRKI +LELL+N RLE L+ +R SEV +SIK+LYELW  K+N  G  V VEMK WFGD
Subjt:  ARECFTTNDRIFASRPKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGD

Query:  ITLNTIFRIVVGKRFSTAVDGGGNED---YRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDF
        +TLN I R+V GKR+ +A D   N+     R+  R+F  L G+   ADA PFL WLDLG HEK +KKTAK +D++   WL+EHR+R +SG+   T   DF
Subjt:  ITLNTIFRIVVGKRFSTAVDGGGNED---YRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDF

Query:  MDVMLSAVKDGGELSGYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPL
        MDVMLS V D   L  YDADTV KATCL+ I+ GSDTT VT+ W LSLLLNN++ LK+AQ ELD QVG++R V E D+  L YLQAIVKETLRLYP   L
Subjt:  MDVMLSAVKDGGELSGYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPL

Query:  AIPHESIEDCTIASYHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-------------------------ISFALQIMHLTLANLLHGFDISRP
          P +  EDCT+  YH+  GTRL++N+ K+Q+DP +W  P E++PERFLT                          I+FALQ++HLTLAN L  FD S P
Subjt:  AIPHESIEDCTIASYHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-------------------------ISFALQIMHLTLANLLHGFDISRP

Query:  STKLLDMEES---------------------------------------------------------------------------------ADQAHGLFT
        S   +DM ES                                                                                 A + H    
Subjt:  STKLLDMEES---------------------------------------------------------------------------------ADQAHGLFT

Query:  TSGHLHLVRR---------------------------------------------------------------------------AGIFALLFFLYALFT
        T   L  + R                                                                            G+  +LF  Y  + 
Subjt:  TSGHLHLVRR---------------------------------------------------------------------------AGIFALLFFLYALFT

Query:  ISRRFGAHRKRLPPEAAGAWPLIGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSWEIARDCFTTNDRVFASRPKLVASKLLGYDYAMFGA
        + RR  A ++R+ PEAAGAWP+IGHLHLL G++  H+ L  MAD YGPIFT+RLG++RAL+VSS E+A++CFTTND   + RPKLVA + LGY+YAMF  
Subjt:  ISRRFGAHRKRLPPEAAGAWPLIGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSWEIARDCFTTNDRVFASRPKLVASKLLGYDYAMFGA

Query:  SPYGPHWR-IRNNGGDE-------------------------------------KVLVEMKTWFGDVTLNTIFRMVVGKRFSTAFDGGGNEPYR---KAL
        SPY  +WR +R     E                                      VLVEMK WFGD+++N I RMVVGKR+     GG  E  R   KA+
Subjt:  SPYGPHWR-IRNNGGDE-------------------------------------KVLVEMKTWFGDVTLNTIFRMVVGKRFSTAFDGGGNEPYR---KAL

Query:  RDFFELFGIFLPSDSFPFLSWLDLGGHEKAMKKAAKVADETEEQ------------------DFMDVMLSAIEGSEELSSYDADTVIKA-----------
        R+FF L G+F+  D  P L WLDLGGHEKAMKK AK  D   ++                  DFMDV+LS ++G + L  YDADT+ KA           
Subjt:  RDFFELFGIFLPSDSFPFLSWLDLGGHEKAMKKAAKVADETEEQ------------------DFMDVMLSAIEGSEELSSYDADTVIKA-----------

Query:  ------TW------------------------------------------------------PLAIPHESFEDCTVADYHIPAKTRLIVNLHKLQRDPVV
              TW                                                      PL+ P +  EDC +  YH+P  TRL++NL KL RDP V
Subjt:  ------TW------------------------------------------------------PLAIPHESFEDCTVADYHIPAKTRLIVNLHKLQRDPVV

Query:  WEDPCEFRPERFLRSQKNFNVRGQTPQLIPFGSGRRMCPGISFALQIMHLTLANLLHGFEIDKPTEEQLDMEESVGISCMKRAHLKL
        W DP EF+PERFL + ++ + RGQ  QL+PFG+GRR CPGI+FALQ++HL LA+ LHGFE+  P+   +DM E  G++ +K   L++
Subjt:  WEDPCEFRPERFLRSQKNFNVRGQTPQLIPFGSGRRMCPGISFALQIMHLTLANLLHGFEIDKPTEEQLDMEESVGISCMKRAHLKL

SwissProt top hitse value%identityAlignment
A0A0N7F297 Demethylepipodophyllotoxin synthase2.1e-12648.11Show/hide
Query:  GIFAFL-FFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASRPK
        G F  L  F Y +++K       ++ + PP+  GAWP+IGHLHLL+  +  H  L+  AD  GP+F ++LG+H+AL+V+  EIA+ECFTTNDR F +RP 
Subjt:  GIFAFL-FFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASRPK

Query:  LVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRF--
         VA+K +GY+Y M+G+ PYGPYWR +RKI  LE L+N RL+ L+H+  SE+  S K+LY+LW        +  +V+MK W  D+TLN   ++VVGKRF  
Subjt:  LVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRF--

Query:  ---STAVDGGGNEDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEH-RQRINSGQPAETNGHDFMDVMLSAVKDGGEL
           ++A +   + +  K LR+ F L G  V +D  P+L WLDLGGHEK MK+T K LD +F  WL EH R+R++ G+  E +  DFMDVMLS +++    
Subjt:  ---STAVDGGGNEDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEH-RQRINSGQPAETNGHDFMDVMLSAVKDGGEL

Query:  SGYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIAS
         G D DT+ K  CL+ IL G+DTT  T+ W LSLLLNN  ALK+AQ ELD  VGR R V ESD+  L Y+ AI+KETLRLYP GPL  P    EDCTIA 
Subjt:  SGYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIAS

Query:  YHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-----------------------ISFALQIMHLTLANLLHGFDISRPSTKLLDMEESADQAHG
        YH+ AGTRL+VN  K+QRDPLVW  P+EY+PERFL R                       IS ALQ++ LTLA++LHGF++  P+   +DM E+    H 
Subjt:  YHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-----------------------ISFALQIMHLTLANLLHGFDISRPSTKLLDMEESADQAHG

Query:  LFT
          T
Subjt:  LFT

O49394 Xanthotoxin 5-hydroxylase CYP82C25.8e-12146.37Show/hide
Query:  LFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEP-AHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASRPKLVASK
        +F   ALF+KS +    H K   P   GAWP+IGHLHLLSG E   + TL KMAD YGP  +LRLG     +VS +E+A++CFT ND+  ASRP   A+K
Subjt:  LFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEP-AHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASRPKLVASK

Query:  HLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRFSTAVDGG
        H+GYD A+ G  PY  +WR +RKI+TLELL+N RL+ L+H+R+SE+   ++ LY LWV K   G E VMV++K+W  D++LN + R+V GKR+     GG
Subjt:  HLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRFSTAVDGG

Query:  GN---------EDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGG-ELS
        G+            RK + +FF L G+   +DAFP L W D  GHEK MK+T + LD + +RW++ HRQ+       + N  DF+DVMLS  + G     
Subjt:  GN---------EDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGG-ELS

Query:  GYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIASY
         +DA T  K+TCL+ IL GS+T+  T+ W +SLLLNN++ LK+AQ E+D  VGR R V++SD++NL Y+QAI+KETLRLYPAGPL    E+IEDCT+A Y
Subjt:  GYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIASY

Query:  HIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLT--------------------------RISFALQIMHLTLANLLHGFDISRPSTKLLDMEES
        ++  GTR++VN+ K+QRDP V+ +PNE+RPERF+T                            S A+Q++HL LA  L  FD+       +DM ES
Subjt:  HIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLT--------------------------RISFALQIMHLTLANLLHGFDISRPSTKLLDMEES

O49858 Cytochrome P450 82A31.9e-12746.76Show/hide
Query:  AVAGIFAFLFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASR
        A+A I + +F    L+RK++R   A      P   GAWP++GHL LL+GS+  H  L  +AD YGP+FT++LGM  AL++S WE+++E FTTND   +SR
Subjt:  AVAGIFAFLFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASR

Query:  PKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRF
        PKLVA + + Y+ A +GL PYGPYWR +RKI T E L+N R+EQ  HIR+SEV++SIK+L+++W N   N     +V++K W   +T N + R+VVGKR+
Subjt:  PKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRF

Query:  STAVDGGGNED---YRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELS
           +   G +    + K +R+F  L G    AD  P L WLDLGGHEKAMK  AK +D +   WL+EHRQ+   G+  E++  DFMDVM+SA+ +G ++ 
Subjt:  STAVDGGGNED---YRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELS

Query:  GYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIASY
         +DADT+ KAT L  IL G+D+T VT+ W LSLLL N  AL +A+ E+D Q+G+   ++ESD+  L YLQAIVKETLRLYP  P + P E  E+C +  Y
Subjt:  GYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIASY

Query:  HIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-------------------------ISFALQIMHLTLANLLHGFDISRPSTKLLDMEE
        HI  GTRL+ N+ K+ RDP VW DP E++PERFLT                          +S  L ++H TLANLLH FDI  PS + +DM E
Subjt:  HIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-------------------------ISFALQIMHLTLANLLHGFDISRPSTKLLDMEE

O49859 Cytochrome P450 82A41.9e-11946Show/hide
Query:  HVSSSGAVAGIFAFLFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTND
        + ++ G V+ IF    FLY   +K          +  P  GGAWP+ GHL LL GS+  H  L  +A+ +GP+FT++LG  +AL+VS WE+ARECFTTND
Subjt:  HVSSSGAVAGIFAFLFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTND

Query:  RIFASRPKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRI
           ++RPKL+ ++ + Y+ AM+ + PYGPYWR +RKI   E+L++ R+EQLQ +R+SEVQ+SI +LY++W +++N   +   VE+K WF     N + R+
Subjt:  RIFASRPKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRI

Query:  VVGKRF--STAVDGGGNEDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKD
        VVGKRF  +TA D    E   KA+ +F  L GV    DA P+L WLD GG+EKAMK+TAK LD +   WL+EHRQ+   G+  +    DFM+VMLS++ D
Subjt:  VVGKRF--STAVDGGGNEDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKD

Query:  GGELSGYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDC
        G  + G DADT+ K+T L+ I AG++ +  T+IW + L+L N   L+  + ELD QVG+ R + ESD+ NL YLQA+VKETLRLY  GPL+ P E  EDC
Subjt:  GGELSGYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDC

Query:  TIASYHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-------------------------ISFALQIMHLTLANLLHGFDISRPSTKLLDMEES
        T+  YH+  GTRL+ NI K+  DP VW DP E++P+RFLT                          ISF LQ +HL LA+ LH F+I  PST+ LDM E+
Subjt:  TIASYHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTR-------------------------ISFALQIMHLTLANLLHGFDISRPSTKLLDMEES

Q9SZ46 Xanthotoxin 5-hydroxylase CYP82C41.6e-12647.37Show/hide
Query:  LFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEP-AHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASRPKLVASK
        +F   ALF+KS +     +    P   GAWP+IGHLHLL G E   + TL KMAD YGP  +L+LG + A +VS +E+A++CFT ND+  ASRP   A+K
Subjt:  LFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEP-AHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASRPKLVASK

Query:  HLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRFSTAVDGG
        H+GY++A+ G  PY  +WR +RKI+T+ELL+N RL+ L+H+R+SE+   +K LY LW   +N G + VMV++K+W  D+TLN I R+V GKR+       
Subjt:  HLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRFSTAVDGG

Query:  GNED------YRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELS--GY
         +ED       +KA+  FF L G+   +DAFP LS+ DL GHEK MK+T   LD + +RW++ HRQ+       E N  DF+DVM+S + + G+LS   Y
Subjt:  GNED------YRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELS--GY

Query:  DADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIASYHI
        DA+T  K+TCL+ IL GSDT+  T+ W +SLLLNN+E LK+AQ E+D  VGR R V++SD++NL YLQAI+KETLRLYPAGPL  P E++EDCT+A Y++
Subjt:  DADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIASYHI

Query:  PAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLT--------------------------RISFALQIMHLTLANLLHGFDISRPSTKLLDMEES
        P GTRL+VN+ K+QRDP V+ +PNE+RPERF+T                            S A+Q++HL LA  LH FD+       +DM E+
Subjt:  PAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLT--------------------------RISFALQIMHLTLANLLHGFDISRPSTKLLDMEES

Arabidopsis top hitse value%identityAlignment
AT2G25160.1 cytochrome P450, family 82, subfamily F, polypeptide 17.1e-9037.32Show/hide
Query:  IFAFLFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASRPKLV
        + + LF    L    +R    ++K   P   GAWPL+GHLHL     P H+T   MAD YGP+F  +LG  + +I++  E+A+E +T +D++   RP+L 
Subjt:  IFAFLFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASRPKLV

Query:  ASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRFSTAV
        ASK LGY+ + +   PYG YWR +RKI+  EL +   ++     R  E   + + LY  W  KR    E V+V+MK  F D+T N    +V GKR+    
Subjt:  ASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRFSTAV

Query:  DGGGNEDYR---KALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELSGYDA
             ++ R   K +R+F + F + + +D  P L +LD    ++ MK+TAK LD V + W++EH+ + +    +E   +D++D+++  +         D 
Subjt:  DGGGNEDYR---KALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELSGYDA

Query:  DTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIA--SYHI
         T  KA CL+ +LAGS+T  V ++W +SLLLNN   L++AQ ELD ++G++R V+E D+K+L YLQAIVKET RLYP  PL      +ED  IA    H+
Subjt:  DTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIA--SYHI

Query:  PAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLT-------------------------RISFALQIMHLTLANLLHGFDISRPSTKLLDMEES
        PAGT+LMV+  K+ RDP VW +P ++ PERFLT                          I   ++++H  L   LH FD++RPS++ +DM ES
Subjt:  PAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLT-------------------------RISFALQIMHLTLANLLHGFDISRPSTKLLDMEES

AT3G25180.1 cytochrome P450, family 82, subfamily G, polypeptide 11.4e-9339.54Show/hide
Query:  AFLFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASRPKLVAS
        A + F Y   RK           + PE  GA PL GHLHLL G +     LA M+  +GPIF+L+LG +R ++ S  +  ++CFTTND   A+RP +   
Subjt:  AFLFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASRPKLVAS

Query:  KHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRFSTAVDG
        +++GY+ A + L PYG YWR +RKI T+ L +NH +E L HIR SEV + IK LY      + NGG  + V++   F  +T N I R +VGKR       
Subjt:  KHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRFSTAVDG

Query:  GGNEDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELSGYDADTVTK
             Y++AL+    L  + +  D  P+L WLD   + + MK+  K LD+V  +WL EH ++ +  +  +      MD++L  + +   +SG+  D + K
Subjt:  GGNEDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELSGYDADTVTK

Query:  ATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIASYHIPAGTRLM
        AT L+  L GSD+T++T+ W +SLLLNN  AL+ AQ E+D  VG+ R ++ESD++NLKYLQAIVKET RLYP  PL    E+ EDC +  Y +  GTRL+
Subjt:  ATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIASYHIPAGTRLM

Query:  VNIQKLQRDPLVWEDPNEYRPERFL-----------------------TRISFALQIMHLTLANLLHGFDISRPSTKLLDMEE
        VNI KL RDP +W DP  ++PERF+                         ++  L+++H  LA LL GF++ + S + LDM E
Subjt:  VNIQKLQRDPLVWEDPNEYRPERFL-----------------------TRISFALQIMHLTLANLLHGFDISRPSTKLLDMEE

AT4G31940.1 cytochrome P450, family 82, subfamily C, polypeptide 41.1e-12747.37Show/hide
Query:  LFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEP-AHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASRPKLVASK
        +F   ALF+KS +     +    P   GAWP+IGHLHLL G E   + TL KMAD YGP  +L+LG + A +VS +E+A++CFT ND+  ASRP   A+K
Subjt:  LFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEP-AHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASRPKLVASK

Query:  HLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRFSTAVDGG
        H+GY++A+ G  PY  +WR +RKI+T+ELL+N RL+ L+H+R+SE+   +K LY LW   +N G + VMV++K+W  D+TLN I R+V GKR+       
Subjt:  HLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRFSTAVDGG

Query:  GNED------YRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELS--GY
         +ED       +KA+  FF L G+   +DAFP LS+ DL GHEK MK+T   LD + +RW++ HRQ+       E N  DF+DVM+S + + G+LS   Y
Subjt:  GNED------YRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELS--GY

Query:  DADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIASYHI
        DA+T  K+TCL+ IL GSDT+  T+ W +SLLLNN+E LK+AQ E+D  VGR R V++SD++NL YLQAI+KETLRLYPAGPL  P E++EDCT+A Y++
Subjt:  DADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIASYHI

Query:  PAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLT--------------------------RISFALQIMHLTLANLLHGFDISRPSTKLLDMEES
        P GTRL+VN+ K+QRDP V+ +PNE+RPERF+T                            S A+Q++HL LA  LH FD+       +DM E+
Subjt:  PAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLT--------------------------RISFALQIMHLTLANLLHGFDISRPSTKLLDMEES

AT4G31950.1 cytochrome P450, family 82, subfamily C, polypeptide 33.5e-11344.87Show/hide
Query:  LFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEP-AHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASRPKLVASK
        +F   ALF+KS +     +    P   GAWP+IGHLHLL G E   + TL KMAD YGP  +LRLG     + S +E+A++CFT ND+  AS     A+K
Subjt:  LFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEP-AHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASRPKLVASK

Query:  HLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRFSTAVDGG
        H+GY            +W  +RKI+ +ELL+N RL+ L ++R+SE+   +K LY LWV K   G E VMV++K+W  D+  N I R+V GKR+     GG
Subjt:  HLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRFSTAVDGG

Query:  GNE---------DYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELS-
        G E          +RK +  FF L G+   +DAFP L WLDL GHEK MK+T + LD + +RW++ HRQ+       + N  DF+DVMLS + + G+LS 
Subjt:  GNE---------DYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELS-

Query:  -GYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIAS
          YDA+T  K TCL+ IL GS+T+  T+ W +SLLLNN++ LK+ Q E+D  VGR R V++SD+KNL YLQAI+KETLRLYPA PL    E++EDCT+A 
Subjt:  -GYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIAS

Query:  YHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLT--------------------------RISFALQIMHLTLANLLHGFDISRPSTKLLDMEES
        Y++P GTRL+VN+ K+QRDP V+ +PNE+RPERF+T                            S A+Q++HL LA  LH F++     + +DM ES
Subjt:  YHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLT--------------------------RISFALQIMHLTLANLLHGFDISRPSTKLLDMEES

AT4G31970.1 cytochrome P450, family 82, subfamily C, polypeptide 24.1e-12246.37Show/hide
Query:  LFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEP-AHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASRPKLVASK
        +F   ALF+KS +    H K   P   GAWP+IGHLHLLSG E   + TL KMAD YGP  +LRLG     +VS +E+A++CFT ND+  ASRP   A+K
Subjt:  LFFLYALFRKSARAGAAHRKRLPPEAGGAWPLIGHLHLLSGSEP-AHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASRPKLVASK

Query:  HLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRFSTAVDGG
        H+GYD A+ G  PY  +WR +RKI+TLELL+N RL+ L+H+R+SE+   ++ LY LWV K   G E VMV++K+W  D++LN + R+V GKR+     GG
Subjt:  HLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRISEVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRFSTAVDGG

Query:  GN---------EDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGG-ELS
        G+            RK + +FF L G+   +DAFP L W D  GHEK MK+T + LD + +RW++ HRQ+       + N  DF+DVMLS  + G     
Subjt:  GN---------EDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRWLKEHRQRINSGQPAETNGHDFMDVMLSAVKDGG-ELS

Query:  GYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIASY
         +DA T  K+TCL+ IL GS+T+  T+ W +SLLLNN++ LK+AQ E+D  VGR R V++SD++NL Y+QAI+KETLRLYPAGPL    E+IEDCT+A Y
Subjt:  GYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVKETLRLYPAGPLAIPHESIEDCTIASY

Query:  HIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLT--------------------------RISFALQIMHLTLANLLHGFDISRPSTKLLDMEES
        ++  GTR++VN+ K+QRDP V+ +PNE+RPERF+T                            S A+Q++HL LA  L  FD+       +DM ES
Subjt:  HIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLT--------------------------RISFALQIMHLTLANLLHGFDISRPSTKLLDMEES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACTCTCCCACGTCTCCGTCGCCGGAATCTTCGCCCTTCTCTTCTTTCTCTATGCTCTATTTAGGAGGTCTGCAAGCTTCGGTGCTCTCCAGAAGAGAATCCCACC
AGAAGCCGGCTGTGCTTGGCCGGTGATCGGCCACCTCCATCTGCTAACTGGAAAAGAACCAGCCCACATAACCTTGCCGAAAATGGCGGACACCAGAGCCGTGGTCAATT
GGGTTCAGCCCTTATACGGTCCACACTGGCGCCATGTGAGCAGAATAGCCTCGCTCGAACTCCTCAAAAACCAACGCCTCGAGCAACTCCAACCCATCACAATATCGGAG
GTTCAGAGTTCGATTAAGAAACTGTACGAGTTATGGGTCAATAACGGAGGCGGCGATGAGAAAGTGTTTGGTGGAGATGAAGACGTGTGGTCGGAGAGCGATCCTCCACT
GCGGTGGTCGGCAATGGAAATGAAGAGCGTTCCGGAAGACGTTGAGGATTTCTTTGAATTGTTCGGAGTATTCGTTCCGACATGGGAACGCGATGAAGAAGACGCAGAAA
AGTGCTGGACGAGGTGTTTGATAAAATGGCTCAAAGAGCACCAGCGGAGGAGAAATTTCGGCCAGCCGGCGGAGACTGACCACCATGACTTCATGGACGTGATGGTGATG
CTTTCAACGGTTCAAGATGGCGAAGAACTTTTCGGCTACGATGCCGATACACTCACCAAAGCTACCTGTTTCAATATCCATTCGAAGAGCAGGGGCTCCTTCCACCACAA
ACCAATAATCTCATACAAATCTTTCCTATTGCTCATTTTCCCAACTCTAAGCATCCGCCCACCTGAAATGGAACTTCCTCACGTCTCATCATCCGGTGCCGTTGCCGGAA
TCTTCGCCTTCCTCTTCTTTCTCTATGCTCTCTTCAGGAAATCTGCAAGAGCCGGTGCTGCTCACCGGAAGAGACTACCACCCGAAGCAGGCGGTGCTTGGCCGCTGATC
GGCCACCTCCATCTACTAAGTGGAAGTGAACCAGCCCACATAACCTTGGCGAAAATGGCGGACGCCTACGGACCCATCTTTACGTTAAGATTGGGTATGCACAGAGCCTT
GATTGTGAGCGGCTGGGAAATAGCGAGAGAGTGTTTTACTACCAACGACAGAATCTTTGCCTCCCGTCCCAAGCTTGTGGCCTCAAAGCATCTTGGCTATGACTACGCCA
TGATTGGGTTGTGCCCTTACGGTCCATACTGGCGCCATGTGCGCAAAATATCCACGCTTGAACTCCTCACAAACCACCGTCTCGAGCAGCTCCAACACATCAGAATATCG
GAGGTTCAAAGTTCGATTAAGAAACTATACGAGTTATGGGTCAACAAACGAAATAATGGAGGCGAGAAAGTGATGGTAGAGATGAAGACGTGGTTTGGAGATATAACTCT
GAACACGATATTCAGGATAGTGGTCGGAAAGCGATTCTCGACTGCCGTGGACGGCGGTGGTAATGAAGATTATCGGAAGGCGTTGAGGGATTTCTTTGAATTGTTCGGAG
TATGCGTTCCGGCAGATGCGTTTCCATTTCTAAGTTGGTTGGACTTGGGAGGACATGAGAAGGCGATGAAAAAGACGGCGAAAGTGCTGGACGCAGTGTTTGATAGATGG
CTTAAAGAACATAGGCAGAGGATAAATTCCGGCCAGCCGGCAGAGACGAATGGGCATGACTTCATGGACGTGATGCTTTCTGCTGTTAAAGATGGTGGAGAACTTTCCGG
TTACGACGCCGATACAGTCACCAAAGCTACATGTTTGTCTTTTATATTGGCCGGATCTGACACTACAACGGTAACAATGATTTGGACTCTCTCTTTACTTCTCAACAATC
AAGAAGCACTCAAAAGAGCCCAACTTGAATTAGATGAACAAGTTGGAAGACAAAGACAAGTGAAAGAGTCAGATGTCAAAAATCTAAAGTATCTACAAGCAATTGTGAAG
GAAACGCTGCGTTTGTACCCTGCAGGACCTCTTGCAATCCCTCATGAATCCATAGAAGATTGCACAATTGCTAGCTACCACATTCCAGCAGGAACCCGACTTATGGTGAA
TATTCAAAAGCTTCAAAGAGACCCACTTGTATGGGAGGATCCTAATGAATATCGACCAGAAAGATTTCTAACAAGAATCTCGTTTGCCCTCCAAATTATGCATTTAACGC
TTGCGAATCTGCTTCATGGGTTTGATATTAGTAGGCCATCTACAAAATTACTTGATATGGAGGAAAGCGCTGATCAAGCCCATGGCCTCTTCACTACCTCCGGCCATCTC
CACCTCGTACGCCGCGCCGGAATATTCGCTCTTCTCTTCTTTCTCTATGCTCTCTTCACTATCTCCCGCAGATTCGGTGCTCATCGGAAGAGACTCCCACCTGAAGCCGC
GGGCGCCTGGCCGCTGATCGGCCACCTTCATCTGCTAATTGGAGCAGAACCAGCTCACATAAAATTGGCGAAAATGGCCGACGCCTACGGACCAATCTTTACGTTAAGGT
TAGGTATGAACAGAGCCTTGATTGTGAGCAGTTGGGAAATAGCGAGAGATTGCTTTACCACAAACGACAGAGTGTTTGCCTCCCGCCCCAAACTTGTAGCCTCGAAGCTG
CTGGGATATGACTATGCCATGTTTGGGGCGAGCCCTTACGGTCCACACTGGCGCATCAGAAACAACGGAGGCGATGAGAAAGTGTTGGTGGAGATGAAGACGTGGTTCGG
AGATGTAACCCTCAACACCATATTCAGGATGGTGGTCGGAAAACGGTTCTCGACTGCCTTTGACGGCGGTGGTAATGAACCGTACCGGAAGGCGTTGAGGGATTTCTTTG
AATTGTTTGGAATATTCCTTCCGTCGGATTCCTTTCCATTTCTAAGTTGGTTGGACTTGGGAGGACATGAGAAGGCGATGAAGAAGGCGGCGAAAGTGGCGGACGAGACG
GAGGAGCAGGACTTCATGGACGTGATGCTTTCTGCCATTGAAGGTAGCGAAGAACTTTCTAGCTATGATGCCGATACAGTCATTAAGGCTACATGGCCGCTTGCTATCCC
TCACGAATCTTTTGAAGATTGCACAGTTGCCGACTATCACATCCCTGCAAAGACACGCCTCATAGTGAATCTTCATAAACTTCAAAGAGATCCAGTTGTATGGGAAGATC
CTTGTGAATTTCGACCAGAAAGATTTCTAAGGAGTCAAAAAAATTTCAATGTTAGAGGACAAACTCCCCAGCTGATACCATTTGGAAGTGGGCGAAGGATGTGTCCAGGA
ATCTCGTTTGCCCTCCAAATTATGCATTTAACGCTTGCAAATCTGCTTCATGGATTTGAAATTGATAAACCAACCGAAGAACAACTCGACATGGAGGAGAGTGTTGGTAT
AAGTTGTATGAAAAGAGCCCACTTGAAGTTGTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAACTCTCCCACGTCTCCGTCGCCGGAATCTTCGCCCTTCTCTTCTTTCTCTATGCTCTATTTAGGAGGTCTGCAAGCTTCGGTGCTCTCCAGAAGAGAATCCCACC
AGAAGCCGGCTGTGCTTGGCCGGTGATCGGCCACCTCCATCTGCTAACTGGAAAAGAACCAGCCCACATAACCTTGCCGAAAATGGCGGACACCAGAGCCGTGGTCAATT
GGGTTCAGCCCTTATACGGTCCACACTGGCGCCATGTGAGCAGAATAGCCTCGCTCGAACTCCTCAAAAACCAACGCCTCGAGCAACTCCAACCCATCACAATATCGGAG
GTTCAGAGTTCGATTAAGAAACTGTACGAGTTATGGGTCAATAACGGAGGCGGCGATGAGAAAGTGTTTGGTGGAGATGAAGACGTGTGGTCGGAGAGCGATCCTCCACT
GCGGTGGTCGGCAATGGAAATGAAGAGCGTTCCGGAAGACGTTGAGGATTTCTTTGAATTGTTCGGAGTATTCGTTCCGACATGGGAACGCGATGAAGAAGACGCAGAAA
AGTGCTGGACGAGGTGTTTGATAAAATGGCTCAAAGAGCACCAGCGGAGGAGAAATTTCGGCCAGCCGGCGGAGACTGACCACCATGACTTCATGGACGTGATGGTGATG
CTTTCAACGGTTCAAGATGGCGAAGAACTTTTCGGCTACGATGCCGATACACTCACCAAAGCTACCTGTTTCAATATCCATTCGAAGAGCAGGGGCTCCTTCCACCACAA
ACCAATAATCTCATACAAATCTTTCCTATTGCTCATTTTCCCAACTCTAAGCATCCGCCCACCTGAAATGGAACTTCCTCACGTCTCATCATCCGGTGCCGTTGCCGGAA
TCTTCGCCTTCCTCTTCTTTCTCTATGCTCTCTTCAGGAAATCTGCAAGAGCCGGTGCTGCTCACCGGAAGAGACTACCACCCGAAGCAGGCGGTGCTTGGCCGCTGATC
GGCCACCTCCATCTACTAAGTGGAAGTGAACCAGCCCACATAACCTTGGCGAAAATGGCGGACGCCTACGGACCCATCTTTACGTTAAGATTGGGTATGCACAGAGCCTT
GATTGTGAGCGGCTGGGAAATAGCGAGAGAGTGTTTTACTACCAACGACAGAATCTTTGCCTCCCGTCCCAAGCTTGTGGCCTCAAAGCATCTTGGCTATGACTACGCCA
TGATTGGGTTGTGCCCTTACGGTCCATACTGGCGCCATGTGCGCAAAATATCCACGCTTGAACTCCTCACAAACCACCGTCTCGAGCAGCTCCAACACATCAGAATATCG
GAGGTTCAAAGTTCGATTAAGAAACTATACGAGTTATGGGTCAACAAACGAAATAATGGAGGCGAGAAAGTGATGGTAGAGATGAAGACGTGGTTTGGAGATATAACTCT
GAACACGATATTCAGGATAGTGGTCGGAAAGCGATTCTCGACTGCCGTGGACGGCGGTGGTAATGAAGATTATCGGAAGGCGTTGAGGGATTTCTTTGAATTGTTCGGAG
TATGCGTTCCGGCAGATGCGTTTCCATTTCTAAGTTGGTTGGACTTGGGAGGACATGAGAAGGCGATGAAAAAGACGGCGAAAGTGCTGGACGCAGTGTTTGATAGATGG
CTTAAAGAACATAGGCAGAGGATAAATTCCGGCCAGCCGGCAGAGACGAATGGGCATGACTTCATGGACGTGATGCTTTCTGCTGTTAAAGATGGTGGAGAACTTTCCGG
TTACGACGCCGATACAGTCACCAAAGCTACATGTTTGTCTTTTATATTGGCCGGATCTGACACTACAACGGTAACAATGATTTGGACTCTCTCTTTACTTCTCAACAATC
AAGAAGCACTCAAAAGAGCCCAACTTGAATTAGATGAACAAGTTGGAAGACAAAGACAAGTGAAAGAGTCAGATGTCAAAAATCTAAAGTATCTACAAGCAATTGTGAAG
GAAACGCTGCGTTTGTACCCTGCAGGACCTCTTGCAATCCCTCATGAATCCATAGAAGATTGCACAATTGCTAGCTACCACATTCCAGCAGGAACCCGACTTATGGTGAA
TATTCAAAAGCTTCAAAGAGACCCACTTGTATGGGAGGATCCTAATGAATATCGACCAGAAAGATTTCTAACAAGAATCTCGTTTGCCCTCCAAATTATGCATTTAACGC
TTGCGAATCTGCTTCATGGGTTTGATATTAGTAGGCCATCTACAAAATTACTTGATATGGAGGAAAGCGCTGATCAAGCCCATGGCCTCTTCACTACCTCCGGCCATCTC
CACCTCGTACGCCGCGCCGGAATATTCGCTCTTCTCTTCTTTCTCTATGCTCTCTTCACTATCTCCCGCAGATTCGGTGCTCATCGGAAGAGACTCCCACCTGAAGCCGC
GGGCGCCTGGCCGCTGATCGGCCACCTTCATCTGCTAATTGGAGCAGAACCAGCTCACATAAAATTGGCGAAAATGGCCGACGCCTACGGACCAATCTTTACGTTAAGGT
TAGGTATGAACAGAGCCTTGATTGTGAGCAGTTGGGAAATAGCGAGAGATTGCTTTACCACAAACGACAGAGTGTTTGCCTCCCGCCCCAAACTTGTAGCCTCGAAGCTG
CTGGGATATGACTATGCCATGTTTGGGGCGAGCCCTTACGGTCCACACTGGCGCATCAGAAACAACGGAGGCGATGAGAAAGTGTTGGTGGAGATGAAGACGTGGTTCGG
AGATGTAACCCTCAACACCATATTCAGGATGGTGGTCGGAAAACGGTTCTCGACTGCCTTTGACGGCGGTGGTAATGAACCGTACCGGAAGGCGTTGAGGGATTTCTTTG
AATTGTTTGGAATATTCCTTCCGTCGGATTCCTTTCCATTTCTAAGTTGGTTGGACTTGGGAGGACATGAGAAGGCGATGAAGAAGGCGGCGAAAGTGGCGGACGAGACG
GAGGAGCAGGACTTCATGGACGTGATGCTTTCTGCCATTGAAGGTAGCGAAGAACTTTCTAGCTATGATGCCGATACAGTCATTAAGGCTACATGGCCGCTTGCTATCCC
TCACGAATCTTTTGAAGATTGCACAGTTGCCGACTATCACATCCCTGCAAAGACACGCCTCATAGTGAATCTTCATAAACTTCAAAGAGATCCAGTTGTATGGGAAGATC
CTTGTGAATTTCGACCAGAAAGATTTCTAAGGAGTCAAAAAAATTTCAATGTTAGAGGACAAACTCCCCAGCTGATACCATTTGGAAGTGGGCGAAGGATGTGTCCAGGA
ATCTCGTTTGCCCTCCAAATTATGCATTTAACGCTTGCAAATCTGCTTCATGGATTTGAAATTGATAAACCAACCGAAGAACAACTCGACATGGAGGAGAGTGTTGGTAT
AAGTTGTATGAAAAGAGCCCACTTGAAGTTGTTTTAA
Protein sequenceShow/hide protein sequence
MELSHVSVAGIFALLFFLYALFRRSASFGALQKRIPPEAGCAWPVIGHLHLLTGKEPAHITLPKMADTRAVVNWVQPLYGPHWRHVSRIASLELLKNQRLEQLQPITISE
VQSSIKKLYELWVNNGGGDEKVFGGDEDVWSESDPPLRWSAMEMKSVPEDVEDFFELFGVFVPTWERDEEDAEKCWTRCLIKWLKEHQRRRNFGQPAETDHHDFMDVMVM
LSTVQDGEELFGYDADTLTKATCFNIHSKSRGSFHHKPIISYKSFLLLIFPTLSIRPPEMELPHVSSSGAVAGIFAFLFFLYALFRKSARAGAAHRKRLPPEAGGAWPLI
GHLHLLSGSEPAHITLAKMADAYGPIFTLRLGMHRALIVSGWEIARECFTTNDRIFASRPKLVASKHLGYDYAMIGLCPYGPYWRHVRKISTLELLTNHRLEQLQHIRIS
EVQSSIKKLYELWVNKRNNGGEKVMVEMKTWFGDITLNTIFRIVVGKRFSTAVDGGGNEDYRKALRDFFELFGVCVPADAFPFLSWLDLGGHEKAMKKTAKVLDAVFDRW
LKEHRQRINSGQPAETNGHDFMDVMLSAVKDGGELSGYDADTVTKATCLSFILAGSDTTTVTMIWTLSLLLNNQEALKRAQLELDEQVGRQRQVKESDVKNLKYLQAIVK
ETLRLYPAGPLAIPHESIEDCTIASYHIPAGTRLMVNIQKLQRDPLVWEDPNEYRPERFLTRISFALQIMHLTLANLLHGFDISRPSTKLLDMEESADQAHGLFTTSGHL
HLVRRAGIFALLFFLYALFTISRRFGAHRKRLPPEAAGAWPLIGHLHLLIGAEPAHIKLAKMADAYGPIFTLRLGMNRALIVSSWEIARDCFTTNDRVFASRPKLVASKL
LGYDYAMFGASPYGPHWRIRNNGGDEKVLVEMKTWFGDVTLNTIFRMVVGKRFSTAFDGGGNEPYRKALRDFFELFGIFLPSDSFPFLSWLDLGGHEKAMKKAAKVADET
EEQDFMDVMLSAIEGSEELSSYDADTVIKATWPLAIPHESFEDCTVADYHIPAKTRLIVNLHKLQRDPVVWEDPCEFRPERFLRSQKNFNVRGQTPQLIPFGSGRRMCPG
ISFALQIMHLTLANLLHGFEIDKPTEEQLDMEESVGISCMKRAHLKLF