; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008218 (gene) of Snake gourd v1 genome

Gene IDTan0008218
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptioncytochrome P450 CYP72A219-like
Genome locationLG01:107115288..107121372
RNA-Seq ExpressionTan0008218
SyntenyTan0008218
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0004497 - monooxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR001128 - Cytochrome P450
IPR002401 - Cytochrome P450, E-class, group I
IPR036396 - Cytochrome P450 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022936239.1 cytochrome P450 CYP72A219-like [Cucurbita moschata]8.8e-21271.7Show/hide
Query:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW
        MEYSWVG  I+V+ SL+  W SW  LNWVWIRP+KLEK LREQGLAGNPYRILYGDLKE  AL +EANSKPM FSHDI  R+ PSI   IQ +GKNSY+W
Subjt:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW

Query:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPH
        LGPYPRV+IMDPEQL +TFSLI DIQKP++NP  KFLL+GII  EG KWAKHRKII+PAFH+DKLK+MVP F +S NEIVSEWE+++PE+G  ELDVMP+
Subjt:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPH

Query:  LQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILL
        LQ+L  DAISR AFGSSYKEGQMIFQLL+QLT+ VVKVA GIYIPGWRFLPTKSN K+KE N +IK LVLGIINKRQKAM +      E VQNDLLGILL
Subjt:  LQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILL

Query:  ESNLKEIGA-----DVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMA
        +SN KEI A     DVGMSIEDVI+ECK FYIGGQETTA+LL W+MILLS +TEWQ++ARAEV +VFG   P+ DGLSRLKVV+ I       YP ASM 
Subjt:  ESNLKEIGA-----DVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMA

Query:  TRVVEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPS
         RVV+KET +VGK+++P GVML+VP+ LIHRD E+WGEDA EFKP+RFS GVS ASK QPAF PFG    +C+G NF++ EAK+ALSLILQRFSF LSPS
Subjt:  TRVVEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPS

Query:  YTHAPFVLMTTQPQHGAHIILRK
        YTHAP V+M+ +PQHGAHIILRK
Subjt:  YTHAPFVLMTTQPQHGAHIILRK

XP_022936863.1 uncharacterized protein LOC111443322 isoform X1 [Cucurbita moschata]1.8e-21271.51Show/hide
Query:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW
        MEYSWVG  I  + SLL  W SW VLNWVWIRP+KL+K L++QGLAGNPYRIL+GDLKERS L EEAN+KP+ FSHDI  R+ PSIYKTIQ YGK+SYMW
Subjt:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW

Query:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPH
        LGPYPRVHIMDP+QL++TFS I DIQKP++NPLI +LL+GII  EG KWAKHRKII+PAF +DKLK MVPAF ES  EI+SEWE+++PEEG  EL+VMP+
Subjt:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPH

Query:  LQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILL
        LQ++  DAISRTAFGSSYKEGQMIFQLL+QL +LVVKVA G+YIPGWRFLPTKSNKK+KEIN +IK LVL IINKRQKAM +      E VQNDLLGILL
Subjt:  LQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILL

Query:  ESNLKEIGA-----DVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMA
        +SN KEI A     D GMSIEDVI+ECK FYIGGQETTA+LL WTMILLS+HTEWQ++ARAEV +VFG   P+ DGL+RLKVV+ I       YP ASM 
Subjt:  ESNLKEIGA-----DVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMA

Query:  TRVVEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPS
         R V+KET  VGK+++P GVML+ P+ LIHRD E+WGEDA EFKP+RFS GVSKASK QPAFFPFG    +CMG NF+++EAK+A+SLILQRFSF LSPS
Subjt:  TRVVEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPS

Query:  YTHAPFVLMTTQPQHGAHIILRK
        Y HAP V+M+ QPQHGAHIILRK
Subjt:  YTHAPFVLMTTQPQHGAHIILRK

XP_022936865.1 cytochrome P450 CYP72A219-like [Cucurbita moschata]4.8e-21070.94Show/hide
Query:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW
        MEYSWVG  I+V+ SL+  W SW  LNWVWIRP+KLEK LREQGLAGNPYRILYGDLKE  AL +EANSKPM FSHDI  R+ PSI   IQ +GKNSY+W
Subjt:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW

Query:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPH
        LGPYPRV+IMDPEQL +TFSLI DIQKP++NP  KFLL+GII  EG KWAKHRKII+PAFH+DKLK+MVP F +S NEIVSEWE+++PE+G  ELDVMP+
Subjt:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPH

Query:  LQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILL
        LQ+L  DAISR AFGSSYKEGQMIFQLL+QLT+ VVKVA GIYIPGWRFLPTKSNKKMKEIN +IK +VL IINKR++AM +      E VQNDLLGILL
Subjt:  LQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILL

Query:  ESNLKEIGA-----DVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMA
        +SN K+I A     DVGMSIED+I ECK FYI GQETT +LL WTMILLS+HTEWQ++ARAEV +VFG   PD DGL RLKVV+ I       YP AS  
Subjt:  ESNLKEIGA-----DVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMA

Query:  TRVVEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPS
         R+V+KET +VG +S+P GVML+ P+ LIHRD E+WGEDA EFKP+RFS GVSKASK QPAFFPFG     CMG + +++EAK+A+SLILQRFSF LSPS
Subjt:  TRVVEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPS

Query:  YTHAPFVLMTTQPQHGAHIILRK
        Y HAP +LMTTQPQHGAHIILRK
Subjt:  YTHAPFVLMTTQPQHGAHIILRK

XP_022976684.1 uncharacterized protein LOC111477005 [Cucurbita maxima]1.7e-21070.94Show/hide
Query:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW
        MEYSWVG  IS++ SLL  W SW VLNWVWIRP+KL+K L++QGLAGNPYRIL+GDLKERSAL EEANSKP+  SHDI PR+ PSIY TIQ YGK+SYMW
Subjt:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW

Query:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPH
        LGPYPRVHIMDPEQL++TFS I DIQKP +NPLI +LL+GII  EG KW KHRKII+PAF  DKLK MVP F +S  EI+SEWEK+IPEEG  ELDVMP+
Subjt:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPH

Query:  LQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILL
        LQ++  DAISRTAFGSSYK GQMIFQLL+QL +LVVKVA G+YIPGWRFLPTKSNKK+KEIN +IK LVLGIINKRQKAM +      E VQNDLLGILL
Subjt:  LQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILL

Query:  ESNLKEIGA-----DVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMA
        +SN KEI A     DV MSIEDVI+ECK FYIGGQETTA+LL WTMILLS+HTEWQ++ARAEV +VFG   P+ DGL+RLKVV+ I       YP ASM 
Subjt:  ESNLKEIGA-----DVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMA

Query:  TRVVEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPS
         R ++KET  VGK+++P G+ML+VP+ LIHRD E+WGEDA EF P+RFS GVSKASK QPAFFPFG    +CMG NF+++EAK+A+S+ILQRFSF LSPS
Subjt:  TRVVEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPS

Query:  YTHAPFVLMTTQPQHGAHIILRK
        Y  AP V+M+ +PQHGAHIIL K
Subjt:  YTHAPFVLMTTQPQHGAHIILRK

XP_023536193.1 cytochrome P450 CYP72A219-like [Cucurbita pepo subsp. pepo]6.8e-21271.51Show/hide
Query:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW
        MEYSWVG  I+V+ SLL    SW VLNWVWIRP+KLEK L++QGLAGNPYRIL+GDLKER+AL EEAN+KP+ FSHDI  R+ PSIYKTIQ YGK SYMW
Subjt:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW

Query:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPH
        LGPYPRVHIMDPEQL++TFS I DIQKP++NPLI +LL+GII  EG KWAKHRKII+PAFH+DKLK MVPAF ES  EIVSEWE+++PEEG  EL+VMP+
Subjt:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPH

Query:  LQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILL
        LQ++  DAISRTAFGSSYKEGQMIFQLL+QL +LVVKVA G+YIPGWRFLPTKSNKK+KEIN +IK  VLGIINKRQKAM +      E VQNDLLGILL
Subjt:  LQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILL

Query:  ESNLKEIGAD-----VGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMA
        +SN KEI A       GMSIEDVI+ECK FYIGGQETTA+LL WTM+LLS+HTEWQ++ARAEV +VFG   P+ DGL+RLKVV+ I       YP ASM 
Subjt:  ESNLKEIGAD-----VGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMA

Query:  TRVVEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPS
         R V+KET  VGK+++P GVML+ P+ LIHRD E+WGEDA EFKP+RFS GVSKASK QPAFFPFG    +CMG NF+++EAK+A+S+ILQRFSF LSPS
Subjt:  TRVVEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPS

Query:  YTHAPFVLMTTQPQHGAHIILRK
        Y HAP V+M+  PQHGAHIILRK
Subjt:  YTHAPFVLMTTQPQHGAHIILRK

TrEMBL top hitse value%identityAlignment
A0A6J1F7R4 cytochrome P450 CYP72A219-like4.3e-21271.7Show/hide
Query:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW
        MEYSWVG  I+V+ SL+  W SW  LNWVWIRP+KLEK LREQGLAGNPYRILYGDLKE  AL +EANSKPM FSHDI  R+ PSI   IQ +GKNSY+W
Subjt:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW

Query:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPH
        LGPYPRV+IMDPEQL +TFSLI DIQKP++NP  KFLL+GII  EG KWAKHRKII+PAFH+DKLK+MVP F +S NEIVSEWE+++PE+G  ELDVMP+
Subjt:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPH

Query:  LQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILL
        LQ+L  DAISR AFGSSYKEGQMIFQLL+QLT+ VVKVA GIYIPGWRFLPTKSN K+KE N +IK LVLGIINKRQKAM +      E VQNDLLGILL
Subjt:  LQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILL

Query:  ESNLKEIGA-----DVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMA
        +SN KEI A     DVGMSIEDVI+ECK FYIGGQETTA+LL W+MILLS +TEWQ++ARAEV +VFG   P+ DGLSRLKVV+ I       YP ASM 
Subjt:  ESNLKEIGA-----DVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMA

Query:  TRVVEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPS
         RVV+KET +VGK+++P GVML+VP+ LIHRD E+WGEDA EFKP+RFS GVS ASK QPAF PFG    +C+G NF++ EAK+ALSLILQRFSF LSPS
Subjt:  TRVVEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPS

Query:  YTHAPFVLMTTQPQHGAHIILRK
        YTHAP V+M+ +PQHGAHIILRK
Subjt:  YTHAPFVLMTTQPQHGAHIILRK

A0A6J1F9H9 uncharacterized protein LOC111443322 isoform X26.0e-20671.29Show/hide
Query:  VVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMWLGPYPRVHIMD
        ++ SL+  W SW  LNWVWIRP+KLEK LREQGLAGNPYRIL+GD KE  A+  EA SKPM FSHDI  R+ PSIYKTIQ YGKNSYMW+GP PRVHIMD
Subjt:  VVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMWLGPYPRVHIMD

Query:  PEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPHLQNLAADAISR
        PEQL++TFSL+ DIQKP+ NPLI +LL+GII  EG KWAKHRK+ISPAFHMDKLK+MVPAF +S NEI+SEWE+++PEEG  ELDVMP+LQNL+ DAISR
Subjt:  PEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPHLQNLAADAISR

Query:  TAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILLESNLKEIGA--
        TAFGSSYKEGQMIFQL++QL +LV+KV+ G YIPGWRFLPTKSN K+KE N +IK LVLGIINKRQKAM +      E VQNDLLGILL+SN KEI A  
Subjt:  TAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILLESNLKEIGA--

Query:  ---DVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRVVEKETIKV
           DVGMSIEDVI+ECK FYIGGQETTA+LL W+MILLS +TEWQ++ARAEV +VFG   P+ DGLSRLKVV+ I       YP ASM  RVV+KET +V
Subjt:  ---DVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRVVEKETIKV

Query:  GKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTHAPFVLMTT
        GK+++P GVML+VP+ LIHRD E+WGEDA EFKP+RFS GVS ASK QPAF PFG    +C+G NF++ EAK+ALSLILQRFSF LSPSYTHAP V+M+ 
Subjt:  GKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTHAPFVLMTT

Query:  QPQHGAHIILRK
        +PQHGAHIILRK
Subjt:  QPQHGAHIILRK

A0A6J1FEF2 uncharacterized protein LOC111443322 isoform X18.6e-21371.51Show/hide
Query:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW
        MEYSWVG  I  + SLL  W SW VLNWVWIRP+KL+K L++QGLAGNPYRIL+GDLKERS L EEAN+KP+ FSHDI  R+ PSIYKTIQ YGK+SYMW
Subjt:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW

Query:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPH
        LGPYPRVHIMDP+QL++TFS I DIQKP++NPLI +LL+GII  EG KWAKHRKII+PAF +DKLK MVPAF ES  EI+SEWE+++PEEG  EL+VMP+
Subjt:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPH

Query:  LQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILL
        LQ++  DAISRTAFGSSYKEGQMIFQLL+QL +LVVKVA G+YIPGWRFLPTKSNKK+KEIN +IK LVL IINKRQKAM +      E VQNDLLGILL
Subjt:  LQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILL

Query:  ESNLKEIGA-----DVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMA
        +SN KEI A     D GMSIEDVI+ECK FYIGGQETTA+LL WTMILLS+HTEWQ++ARAEV +VFG   P+ DGL+RLKVV+ I       YP ASM 
Subjt:  ESNLKEIGA-----DVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMA

Query:  TRVVEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPS
         R V+KET  VGK+++P GVML+ P+ LIHRD E+WGEDA EFKP+RFS GVSKASK QPAFFPFG    +CMG NF+++EAK+A+SLILQRFSF LSPS
Subjt:  TRVVEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPS

Query:  YTHAPFVLMTTQPQHGAHIILRK
        Y HAP V+M+ QPQHGAHIILRK
Subjt:  YTHAPFVLMTTQPQHGAHIILRK

A0A6J1FEX2 cytochrome P450 CYP72A219-like2.3e-21070.94Show/hide
Query:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW
        MEYSWVG  I+V+ SL+  W SW  LNWVWIRP+KLEK LREQGLAGNPYRILYGDLKE  AL +EANSKPM FSHDI  R+ PSI   IQ +GKNSY+W
Subjt:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW

Query:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPH
        LGPYPRV+IMDPEQL +TFSLI DIQKP++NP  KFLL+GII  EG KWAKHRKII+PAFH+DKLK+MVP F +S NEIVSEWE+++PE+G  ELDVMP+
Subjt:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPH

Query:  LQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILL
        LQ+L  DAISR AFGSSYKEGQMIFQLL+QLT+ VVKVA GIYIPGWRFLPTKSNKKMKEIN +IK +VL IINKR++AM +      E VQNDLLGILL
Subjt:  LQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILL

Query:  ESNLKEIGA-----DVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMA
        +SN K+I A     DVGMSIED+I ECK FYI GQETT +LL WTMILLS+HTEWQ++ARAEV +VFG   PD DGL RLKVV+ I       YP AS  
Subjt:  ESNLKEIGA-----DVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMA

Query:  TRVVEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPS
         R+V+KET +VG +S+P GVML+ P+ LIHRD E+WGEDA EFKP+RFS GVSKASK QPAFFPFG     CMG + +++EAK+A+SLILQRFSF LSPS
Subjt:  TRVVEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPS

Query:  YTHAPFVLMTTQPQHGAHIILRK
        Y HAP +LMTTQPQHGAHIILRK
Subjt:  YTHAPFVLMTTQPQHGAHIILRK

A0A6J1IP99 uncharacterized protein LOC1114770058.1e-21170.94Show/hide
Query:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW
        MEYSWVG  IS++ SLL  W SW VLNWVWIRP+KL+K L++QGLAGNPYRIL+GDLKERSAL EEANSKP+  SHDI PR+ PSIY TIQ YGK+SYMW
Subjt:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW

Query:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPH
        LGPYPRVHIMDPEQL++TFS I DIQKP +NPLI +LL+GII  EG KW KHRKII+PAF  DKLK MVP F +S  EI+SEWEK+IPEEG  ELDVMP+
Subjt:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPH

Query:  LQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILL
        LQ++  DAISRTAFGSSYK GQMIFQLL+QL +LVVKVA G+YIPGWRFLPTKSNKK+KEIN +IK LVLGIINKRQKAM +      E VQNDLLGILL
Subjt:  LQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILL

Query:  ESNLKEIGA-----DVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMA
        +SN KEI A     DV MSIEDVI+ECK FYIGGQETTA+LL WTMILLS+HTEWQ++ARAEV +VFG   P+ DGL+RLKVV+ I       YP ASM 
Subjt:  ESNLKEIGA-----DVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMA

Query:  TRVVEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPS
         R ++KET  VGK+++P G+ML+VP+ LIHRD E+WGEDA EF P+RFS GVSKASK QPAFFPFG    +CMG NF+++EAK+A+S+ILQRFSF LSPS
Subjt:  TRVVEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPS

Query:  YTHAPFVLMTTQPQHGAHIILRK
        Y  AP V+M+ +PQHGAHIIL K
Subjt:  YTHAPFVLMTTQPQHGAHIILRK

SwissProt top hitse value%identityAlignment
A0A0S2IHL2 Cytochrome P450 72A3971.0e-15752.31Show/hide
Query:  YSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMWLG
        Y+ +   ++V V ++VGW +W+VLNWVW+ P+KLE+ LR+QG  GN YR+ YGDLKE S ++ +A  KP+  S D   R+ P I++T++KYGK+S++W+G
Subjt:  YSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMWLG

Query:  PYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPHLQ
        P PRV IMDPE ++      Y   KP  NPL+K   DG+   EG  WAKHRK+++PAFH+++LK M+PA + SC E+VS+W+K+I ++GS ELDV P LQ
Subjt:  PYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPHLQ

Query:  NLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILLES
         L +D IS TAFGSSY+EG ++F+L  +  ELV+K    +YIPGW +LPTK N+KMKEI+ K +S ++ IINK+ KAM        E   +D+LGILLES
Subjt:  NLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILLES

Query:  NLKE-IG---ADVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRV
        NLKE +G    +VGMSI++V+ ECK FY  GQETT+ LL+WTM+LLS H  WQ +AR EVLQ FG  +PDFD L+ LK+V+ I       YP      R 
Subjt:  NLKE-IG---ADVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRV

Query:  VEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTH
        V++ET  +G +++P GV + +PI ++H D  +WG+DA EF P+RFS+GVSKA+K+Q  FFPFG    +C+G NF+++EAK+AL++ILQRFSF LSPSYTH
Subjt:  VEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTH

Query:  APFVLMTTQPQHGAHIILRK
        AP  ++T QPQHGA++IL K
Subjt:  APFVLMTTQPQHGAHIILRK

A0A481NR20 Cytochrome P450 72A5525.0e-14950.67Show/hide
Query:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW
        ME S     +SVV++ +V W  WR L WVW +PK LE  LR QGL+G PY  L GDLK  S +  EA SKP+  + DI  R+ P   + ++ YG+  + W
Subjt:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW

Query:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEE-GSYELDVMP
        LGP P + IMDPE ++  F+ +YD QK  L PL + +  G++  +G KWAKHRKII+PAFH++KLK+MVPAF + C+E+V  W+KL+ ++  S E+DV P
Subjt:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEE-GSYELDVMP

Query:  HLQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGIL
         L ++ AD ISRTAFGSSYKEGQ IF+L +++ EL+++     +IPG+ +LPTK N++MK  + +IK ++ GI+NKR +A    +E G E    DLLGIL
Subjt:  HLQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGIL

Query:  LESNLKEIGADVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRVV
        LESNL +   + GMSIEDV+EECK FY+ GQETT+ LL+WTM++LS H +WQ +AR EV QVFG K+P+ +GL++LKV++ I       YP  +   R +
Subjt:  LESNLKEIGADVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRVV

Query:  EKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTHA
         KE +K+G M++P GV + +PI L+ RD ELWG DA EFKP+RF  G+SKA+K+Q +FF F     +C+G NF+++EAKMA++LILQRFS  LSPSY HA
Subjt:  EKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTHA

Query:  PFVLMTTQPQHGAHIILRK
        P+ ++T  PQ GAH+IL K
Subjt:  PFVLMTTQPQHGAHIILRK

H2DH21 Cytochrome P450 CYP72A2191.4e-15953.33Show/hide
Query:  SVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMWLGPYPRVHIM
        ++VV +L+G   WR+ NWVW+RP+KLEK LR QG  GN YR+ +GD+KE   + +EA SKP+    DI PRI P   K I  YGKNS++WLGP P VHIM
Subjt:  SVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMWLGPYPRVHIM

Query:  DPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPHLQNLAADAIS
        +P+ ++   S  Y  QKP  NPL K L  G+   EG +WAKHRK+I+PAFH++KLK+M+PA + S +EIV++WE+++  +G +ELDV+P+L+ L +D IS
Subjt:  DPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPHLQNLAADAIS

Query:  RTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILLESNLKEI---
        RTAFGSSY+EG+ IFQL ++  EL+++ +  IY+PG RFLPTK NK+MKEI  ++K  +  IINKR KAM    E GE    +DLLGILLESN KEI   
Subjt:  RTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILLESNLKEI---

Query:  -GADVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRVVEKETIKV
           + G+++++VIEECK F+  GQETT+ LL+WTMILLS H +WQ +A+ EVL+ FG  +PDFDGL+ LKVV+ I       YP      R + +E IK+
Subjt:  -GADVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRVVEKETIKV

Query:  GKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTHAPFVLMTT
        G++S+P GV+L++PI L+H D E+WG+DA EF P+RFS+GV KA+K +  +FPF     +C+G NF+M+EAKMA+++ILQRFSF LSPSY HAP  ++T 
Subjt:  GKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTHAPFVLMTT

Query:  QPQHGAHIIL
        QPQ+GAH+IL
Subjt:  QPQHGAHIIL

Q9LUC5 Cytochrome P450 72A155.3e-15150.67Show/hide
Query:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW
        ME S     ISVV++ +V W  WR L WVW +PK LE  LR QGLAG PY  L GDLK+   +  EA SKP+  + DI+PR+ P   +  + YG+  + W
Subjt:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW

Query:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEG-SYELDVMP
         GP P + IMDPEQ++  F+ +YD QKP+  PL   +  G+   +G KWAKHR+II+PAFH++K+K+MVPAF +SC E+V EW++L+ ++G S E+DV P
Subjt:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEG-SYELDVMP

Query:  HLQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGIL
         L ++ AD ISRTAFGSSYKEGQ IF+L  +L +L+++     +IPG+ +LPTKSN++MK    +I+ ++ GI+NKR +A    +E G E   +DLLGIL
Subjt:  HLQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGIL

Query:  LESNLKEIGADVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRVV
        LESNL++   + GMS ED++EECK FY  GQETT+ LL+WTM+LLS H +WQ +AR EV QVFG K+PD +GL++LKV++ I       YP  +  TR +
Subjt:  LESNLKEIGADVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRVV

Query:  EKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTHA
         KE +K+G +++PGGV + +PI L+  D ELWG DA EF P RF  G+SKA+KSQ +FFPF     +C+G NF+++EAKMA++LIL+RFSF +SPSY HA
Subjt:  EKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTHA

Query:  PFVLMTTQPQHGAHIILRK
        P+ ++T  PQ GA +I+ K
Subjt:  PFVLMTTQPQHGAHIILRK

W8JWW3 Cytochrome P450 72A2251.0e-14950.67Show/hide
Query:  GISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMWLGPYPRVH
        GISV+ +      +W++LNW WI+PKKLEK LR+ GL GN YR+L GDL E S + +EA SKP+ F++ I PRI P ++K IQ +GKN ++W GP P V 
Subjt:  GISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMWLGPYPRVH

Query:  IMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKL--IPEEGSYELDVMPHLQNLAA
        I DPE ++   +  Y  QKP  + L K +  G+   +  KWAKHR++++PAFH++KLK M+PAF  SC+E++ +WEKL     EGS+ELDV P+LQ++ +
Subjt:  IMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKL--IPEEGSYELDVMPHLQNLAA

Query:  DAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILLESNLKE
        D ISRTAFGS+ +EG+ IF+L +++ E ++++    Y PG  +LPTK  ++MK+I+ ++ S VL II KR +AM        E   +DLLGILL+SN KE
Subjt:  DAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILLESNLKE

Query:  IGAD--------VGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRV
        I  D         GMS E+VIEECK FY  GQETTA LL+WT++LLS H EWQ +AR EVLQ+FG  +PDFD L+ LKV++ +       YP   M +RV
Subjt:  IGAD--------VGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRV

Query:  VEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTH
        + ++T K+G +S+PGG+ + +P  ++H D ELWG+DA EFKP+RFS+G+SKA+K Q  +FPF     +C+G NF+M+EAKMAL+LILQRF+F +SPSYTH
Subjt:  VEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTH

Query:  APFVLMTTQPQHGAHIILRKR
        AP  L+T QPQ+GA +IL KR
Subjt:  APFVLMTTQPQHGAHIILRKR

Arabidopsis top hitse value%identityAlignment
AT3G14610.1 cytochrome P450, family 72, subfamily A, polypeptide 71.5e-14849.9Show/hide
Query:  VGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMWLGPYP
        V   + V+V+++V W +WR++ WVWI+PK LE  L+ QGL G PY  L GD+K    +  EA SKP+  + DI PR+ P   K +  +GK  ++W+GP P
Subjt:  VGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMWLGPYP

Query:  RVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPE-EGSYELDVMPHLQNL
         + I +PEQ++  F+ + D +K +  PLI+ L  G+   +G KWA HR+II+PAFH++K+K+M+PAF+  C+E+V +WEKL  + E   E+DV P L N+
Subjt:  RVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPE-EGSYELDVMPHLQNL

Query:  AADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILLESNL
         AD IS TAFGSSYKEGQ IFQL  +L EL+ +     YIPG RF PTKSN++MK I+ ++  ++ GI++KR+KA    +E GE    +DLLGILLESN 
Subjt:  AADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILLESNL

Query:  KEIGADVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGK-KQPDFDGLSRLKVVSYI-------YPAASMATRVVEKET
        +E   + GMS+EDV++ECK FY  GQETT+ LL+WTM+LLS+H +WQ +AR EV+QV G+  +PD + L+ LKV++ I       YP  +   RVV KE 
Subjt:  KEIGADVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGK-KQPDFDGLSRLKVVSYI-------YPAASMATRVVEKET

Query:  IKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTHAPFVL
        +K+G++++P G+ + +P  L+ RD ELWG+DA +FKP+RF  G+SKA+K+Q +FFPFG    +C+G NF+M+EAKMA++LILQ+FSF LSPSY HAP  +
Subjt:  IKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTHAPFVL

Query:  MTTQPQHGAHIILRK
        MTT+PQ GAH+IL K
Subjt:  MTTQPQHGAHIILRK

AT3G14640.1 cytochrome P450, family 72, subfamily A, polypeptide 107.1e-15151.15Show/hide
Query:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW
        ME S     +SVVV ++V W  WR L WVW +PK LE  LR QGLAG PY  L GDLK    +  EA SKP+  + DI PR+ P  ++ ++ +G+  + W
Subjt:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW

Query:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEG--SYELDVM
        LGP P + IMDPE ++  F+ +YD  K     L + +  GII  +G KWAKHR+II+PAFH++K+K+MVPAF +SC+++V EW KL+ ++G  S E+DV 
Subjt:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEG--SYELDVM

Query:  PHLQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGI
        P L ++  D ISRTAFGSSYKEGQ IF+L  +L  L+++    +YIPG+R+LPTKSN++MK    +I+ ++ GI+NKR +A    +E G+    +DLLGI
Subjt:  PHLQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGI

Query:  LLESNLKEIGADVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRV
        LLESNL +   + GMS EDV+EECK FY  GQETT+ LL+W M+LLS+H +WQ +AR EV QVFG K+PD + LS+LKV++ I       YP  +  TR 
Subjt:  LLESNLKEIGADVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRV

Query:  VEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTH
        ++KE +K+G +++P GV + +PI L+ RDP LWG DA EFKP+RF  G+SKA+KSQ +FFPF     +C+G NF+M+EAKMA++LILQ F+F LSPSY H
Subjt:  VEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTH

Query:  APFVLMTTQPQHGAHIILRK
        AP  ++T  PQ GAH+ILRK
Subjt:  APFVLMTTQPQHGAHIILRK

AT3G14660.1 cytochrome P450, family 72, subfamily A, polypeptide 131.1e-14850.29Show/hide
Query:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW
        ME S     +SV V ++V W  WR L  VW++PK LE  LR QGLAG PY  L GDLK   ++  EA SKP+  + DI PRI P   + ++ +G+  + W
Subjt:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW

Query:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPE-EGSYELDVMP
         GP P + IMDPEQ++  F+ +YD QK +  PL + +  G++  +G KW KHR+II+PAFH++K+K+MVPAF +SC+EIV EW+KL+ + + S E+D+ P
Subjt:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPE-EGSYELDVMP

Query:  HLQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGIL
         L ++ AD ISRTAFGSSYKEGQ IF+L  +L +L+++      IPG+R+ PTK N++MK    +IK ++ GI+NKR +A    +E G E   +DLLGIL
Subjt:  HLQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGIL

Query:  LESNLKEIGADVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRVV
        LESNL +   + GMS E+++EECK FY  GQETT  LL+WTM+LLS H +WQ +AR EV QVFG K+PD +GL++LKV++ I       YP     TR +
Subjt:  LESNLKEIGADVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRVV

Query:  EKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTHA
         KE +++G +++PGGV + +PI LI RD ELWG DA EFKP RF  G+SKA+K+Q +FFPF     +C+G NF+++EAKMA++LIL++FSF LSPSY HA
Subjt:  EKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTHA

Query:  PFVLMTTQPQHGAHIILRK
        P+ ++TT PQ GA +IL K
Subjt:  PFVLMTTQPQHGAHIILRK

AT3G14680.1 cytochrome P450, family 72, subfamily A, polypeptide 141.3e-14950.99Show/hide
Query:  LLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMWLGPYPRVHIMDPEQL
        ++V W  WR L WVW  PK LE+ LR QGL+G  Y  L GD K+  ++  EA SKP+  + DI PR+ P   + ++ +G+ +  W GP P + IMDPEQ+
Subjt:  LLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMWLGPYPRVHIMDPEQL

Query:  RSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEG-SYELDVMPHLQNLAADAISRTAF
        +  F+ +YD QK +  PL K L  G++  +G KWA+HR+II+PAFH++K+K+MV  F ESC+E+V EW+KL+ ++G S E+DV P L ++ AD ISRTAF
Subjt:  RSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEG-SYELDVMPHLQNLAADAISRTAF

Query:  GSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILLESNLKEIGADVGMS
        GSSY+EG  IF+L  +L +LV++     +IPG+ +LPTK N++MK    +I+ ++ GIINKR++A    +E G E    DLLGILLESNL +   + GMS
Subjt:  GSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILLESNLKEIGADVGMS

Query:  IEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRVVEKETIKVGKMSIPGG
         ED++EECK FY+ GQETT+ LL+WTM+LLS H +WQ +AR EV QVFG KQPD +GL++LKV++ I       YP     TR + KE +K+G +++PGG
Subjt:  IEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRVVEKETIKVGKMSIPGG

Query:  VMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTHAPFVLMTTQPQHGAHI
        V + +P+ L+HRD ELWG DA EFKP+RF  G+SKA+K+Q +FFPF     +C+G NF+++EAKMA+SLILQRFSF LSPSY HAP+ ++T  PQ GAH+
Subjt:  VMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTHAPFVLMTTQPQHGAHI

Query:  ILRK
        +L K
Subjt:  ILRK

AT3G14690.1 cytochrome P450, family 72, subfamily A, polypeptide 153.8e-15250.67Show/hide
Query:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW
        ME S     ISVV++ +V W  WR L WVW +PK LE  LR QGLAG PY  L GDLK+   +  EA SKP+  + DI+PR+ P   +  + YG+  + W
Subjt:  MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMW

Query:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEG-SYELDVMP
         GP P + IMDPEQ++  F+ +YD QKP+  PL   +  G+   +G KWAKHR+II+PAFH++K+K+MVPAF +SC E+V EW++L+ ++G S E+DV P
Subjt:  LGPYPRVHIMDPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEG-SYELDVMP

Query:  HLQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGIL
         L ++ AD ISRTAFGSSYKEGQ IF+L  +L +L+++     +IPG+ +LPTKSN++MK    +I+ ++ GI+NKR +A    +E G E   +DLLGIL
Subjt:  HLQNLAADAISRTAFGSSYKEGQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGIL

Query:  LESNLKEIGADVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRVV
        LESNL++   + GMS ED++EECK FY  GQETT+ LL+WTM+LLS H +WQ +AR EV QVFG K+PD +GL++LKV++ I       YP  +  TR +
Subjt:  LESNLKEIGADVGMSIEDVIEECKTFYIGGQETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYI-------YPAASMATRVV

Query:  EKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTHA
         KE +K+G +++PGGV + +PI L+  D ELWG DA EF P RF  G+SKA+KSQ +FFPF     +C+G NF+++EAKMA++LIL+RFSF +SPSY HA
Subjt:  EKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGVSKASKSQPAFFPFG----LCMGLNFSMIEAKMALSLILQRFSFHLSPSYTHA

Query:  PFVLMTTQPQHGAHIILRK
        P+ ++T  PQ GA +I+ K
Subjt:  PFVLMTTQPQHGAHIILRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTACTCGTGGGTTGGCGGTGGGATTTCAGTAGTGGTGAGCTTATTAGTGGGATGGTGTTCATGGAGAGTTTTGAACTGGGTTTGGATAAGGCCAAAGAAGCTAGA
GAAGTTGCTGAGAGAGCAAGGTTTGGCCGGAAACCCTTACCGGATTCTCTATGGTGACTTAAAGGAGAGATCGGCGTTGTCGGAGGAGGCCAACTCCAAGCCTATGACCT
TCTCCCATGATATTGCTCCAAGGATCTTCCCCTCCATCTATAAAACAATACAAAAATATGGTAAGAATTCATACATGTGGCTTGGCCCGTATCCAAGAGTGCATATCATG
GATCCAGAGCAACTTAGATCTACTTTTTCTTTAATCTATGATATTCAAAAGCCGAATTTGAATCCTCTTATCAAGTTTCTTTTGGATGGGATTATAATGCTTGAAGGACC
CAAATGGGCAAAACACAGAAAGATAATCAGCCCTGCATTTCATATGGATAAACTGAAGGATATGGTACCAGCATTCTTTGAGAGTTGTAATGAAATAGTGAGTGAATGGG
AAAAATTAATCCCAGAAGAGGGATCGTATGAGTTGGATGTAATGCCCCATCTACAAAACTTGGCAGCTGATGCAATTTCTCGAACAGCATTTGGAAGTAGCTACAAAGAA
GGACAAATGATCTTTCAACTTCTACAACAACTTACTGAATTGGTGGTCAAAGTTGCCTCTGGGATTTATATTCCTGGATGGAGGTTTCTACCAACGAAGTCAAACAAAAA
AATGAAAGAAATAAATGGGAAAATAAAAAGTTTGGTTTTGGGTATTATAAACAAAAGGCAAAAGGCTATGATGAAGATGAAAGAATTAGGTGAGGAAGTTGTACAAAATG
ATTTACTGGGCATTCTACTGGAATCAAATTTAAAAGAAATTGGAGCTGATGTTGGAATGAGCATAGAAGATGTAATTGAAGAATGCAAAACTTTCTATATTGGTGGCCAA
GAAACCACTGCTAGATTATTAATTTGGACTATGATTTTATTGAGCTACCACACAGAGTGGCAAGACAAAGCAAGAGCCGAGGTTCTGCAAGTTTTTGGCAAGAAGCAGCC
AGATTTTGATGGTTTAAGTCGTCTAAAAGTTGTAAGCTATATATATCCAGCAGCGAGTATGGCTACACGAGTTGTTGAAAAGGAAACAATAAAAGTTGGAAAAATGAGTA
TACCAGGTGGAGTAATGCTAATGGTACCGATAGCTCTTATTCATCGCGATCCTGAACTATGGGGTGAGGATGCATTTGAATTTAAACCACAAAGATTTTCTCAAGGAGTT
TCTAAAGCATCAAAATCCCAACCTGCTTTCTTCCCATTTGGATTGTGTATGGGCCTCAATTTTTCCATGATTGAGGCTAAAATGGCATTATCCTTAATTCTACAACGTTT
TTCATTTCACCTTTCTCCATCTTATACTCATGCTCCTTTCGTCCTTATGACTACTCAACCTCAACATGGAGCTCATATTATCCTACGCAAACGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGTACTCGTGGGTTGGCGGTGGGATTTCAGTAGTGGTGAGCTTATTAGTGGGATGGTGTTCATGGAGAGTTTTGAACTGGGTTTGGATAAGGCCAAAGAAGCTAGA
GAAGTTGCTGAGAGAGCAAGGTTTGGCCGGAAACCCTTACCGGATTCTCTATGGTGACTTAAAGGAGAGATCGGCGTTGTCGGAGGAGGCCAACTCCAAGCCTATGACCT
TCTCCCATGATATTGCTCCAAGGATCTTCCCCTCCATCTATAAAACAATACAAAAATATGGTAAGAATTCATACATGTGGCTTGGCCCGTATCCAAGAGTGCATATCATG
GATCCAGAGCAACTTAGATCTACTTTTTCTTTAATCTATGATATTCAAAAGCCGAATTTGAATCCTCTTATCAAGTTTCTTTTGGATGGGATTATAATGCTTGAAGGACC
CAAATGGGCAAAACACAGAAAGATAATCAGCCCTGCATTTCATATGGATAAACTGAAGGATATGGTACCAGCATTCTTTGAGAGTTGTAATGAAATAGTGAGTGAATGGG
AAAAATTAATCCCAGAAGAGGGATCGTATGAGTTGGATGTAATGCCCCATCTACAAAACTTGGCAGCTGATGCAATTTCTCGAACAGCATTTGGAAGTAGCTACAAAGAA
GGACAAATGATCTTTCAACTTCTACAACAACTTACTGAATTGGTGGTCAAAGTTGCCTCTGGGATTTATATTCCTGGATGGAGGTTTCTACCAACGAAGTCAAACAAAAA
AATGAAAGAAATAAATGGGAAAATAAAAAGTTTGGTTTTGGGTATTATAAACAAAAGGCAAAAGGCTATGATGAAGATGAAAGAATTAGGTGAGGAAGTTGTACAAAATG
ATTTACTGGGCATTCTACTGGAATCAAATTTAAAAGAAATTGGAGCTGATGTTGGAATGAGCATAGAAGATGTAATTGAAGAATGCAAAACTTTCTATATTGGTGGCCAA
GAAACCACTGCTAGATTATTAATTTGGACTATGATTTTATTGAGCTACCACACAGAGTGGCAAGACAAAGCAAGAGCCGAGGTTCTGCAAGTTTTTGGCAAGAAGCAGCC
AGATTTTGATGGTTTAAGTCGTCTAAAAGTTGTAAGCTATATATATCCAGCAGCGAGTATGGCTACACGAGTTGTTGAAAAGGAAACAATAAAAGTTGGAAAAATGAGTA
TACCAGGTGGAGTAATGCTAATGGTACCGATAGCTCTTATTCATCGCGATCCTGAACTATGGGGTGAGGATGCATTTGAATTTAAACCACAAAGATTTTCTCAAGGAGTT
TCTAAAGCATCAAAATCCCAACCTGCTTTCTTCCCATTTGGATTGTGTATGGGCCTCAATTTTTCCATGATTGAGGCTAAAATGGCATTATCCTTAATTCTACAACGTTT
TTCATTTCACCTTTCTCCATCTTATACTCATGCTCCTTTCGTCCTTATGACTACTCAACCTCAACATGGAGCTCATATTATCCTACGCAAACGCTAG
Protein sequenceShow/hide protein sequence
MEYSWVGGGISVVVSLLVGWCSWRVLNWVWIRPKKLEKLLREQGLAGNPYRILYGDLKERSALSEEANSKPMTFSHDIAPRIFPSIYKTIQKYGKNSYMWLGPYPRVHIM
DPEQLRSTFSLIYDIQKPNLNPLIKFLLDGIIMLEGPKWAKHRKIISPAFHMDKLKDMVPAFFESCNEIVSEWEKLIPEEGSYELDVMPHLQNLAADAISRTAFGSSYKE
GQMIFQLLQQLTELVVKVASGIYIPGWRFLPTKSNKKMKEINGKIKSLVLGIINKRQKAMMKMKELGEEVVQNDLLGILLESNLKEIGADVGMSIEDVIEECKTFYIGGQ
ETTARLLIWTMILLSYHTEWQDKARAEVLQVFGKKQPDFDGLSRLKVVSYIYPAASMATRVVEKETIKVGKMSIPGGVMLMVPIALIHRDPELWGEDAFEFKPQRFSQGV
SKASKSQPAFFPFGLCMGLNFSMIEAKMALSLILQRFSFHLSPSYTHAPFVLMTTQPQHGAHIILRKR