; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021780 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021780
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionisoleucine N-monooxygenase 2-like
Genome locationscaffold2:13533749..13534968
RNA-Seq ExpressionSpg021780
SyntenySpg021780
Gene Ontology termsGO:0004497 - monooxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR001128 - Cytochrome P450
IPR002401 - Cytochrome P450, E-class, group I
IPR036396 - Cytochrome P450 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148949.1 isoleucine N-monooxygenase 2-like [Momordica charantia]1.4e-11461.23Show/hide
Query:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF
        MS+H I EAD LV  IF+LT  ++      +VN RS ITQH+CGNIIR+MIFN RYYG GRE DGGPTFEEEE  +ALFTI++HVYAFS+SD+MPCLR  
Subjt:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF

Query:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAV
        DLDGH+ I+K A +V+RKYDE IIN R+++WKDG  KEV DILDILISL+D+NGK LLS           + ATI NPSNA+EWAMAE++NQPKIL KAV
Subjt:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAV

Query:  EELDR-------------------------------------------------------VHVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLE
        EELDR                                                         VLLSRSGLGRNPRIWEDPM+FDPERHL+D TVELGL E
Subjt:  EELDR-------------------------------------------------------VHVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLE

Query:  PGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMY
        P LRFITFTR RRGCIGSSLG++ITMMLFARMLQAFSWSL QGMTKIDFSES+E+SLP+PL L  +PRLS +MY
Subjt:  PGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMY

XP_022148952.1 isoleucine N-monooxygenase 2-like [Momordica charantia]4.3e-11659.31Show/hide
Query:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF
        M  HR +EAD L+  IFSLTT N    G AVVN  S +TQHYCGNIIR+M+FNRRYYG+GRE DGGP+FEEEE N+AL TI+ HV AFS+SDFMPCL+PF
Subjt:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF

Query:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAV
        DLDGHE I+K+A + L KYDEPIINER+QQWKDGK++   DILDILIS+KD NGK LL+           + ATI NPSNA+EW MAE++NQPKIL KA+
Subjt:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAV

Query:  EELDRV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLE
        EELDRV                                                        V+LSRSGLGRNPRIWEDPM+FDPERHL+D TV LGL E
Subjt:  EELDRV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLE

Query:  PGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMYPT
        P LRFI+FTR RRGC+GSSLG++ITM L  R+LQ FSWSL  G TKI+FSE+DE+SLP+PL LHA+PRL   MYPT
Subjt:  PGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMYPT

XP_022148957.1 isoleucine N-monooxygenase 2-like [Momordica charantia]8.6e-11759.57Show/hide
Query:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF
        M  HR +EAD L+  IFSLTT N   +  AVVN RS +TQHYCGNIIR+M+FN+RYYG+GRE DGGP+FEEEE N+AL TI+ HV AFS+SDFMPCL+PF
Subjt:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF

Query:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAV
        DLDGHE I+K+A + L KYDEPIINER+QQWKDGK++   DILDILIS KD NGK LL+           + ATI NPSNA+EW MAE++NQPKIL KA+
Subjt:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAV

Query:  EELDRV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLE
        EELDRV                                                        V+LSRSGLGRNPRIWEDPM+FDPERHL+D TVELGL E
Subjt:  EELDRV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLE

Query:  PGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMYPT
        P LRFI+FTR RRGC+GSSLG++ITMML  R+LQ FSWSL  G TKI+FSE+DE+SLP+PL LHA+PRL   MYPT
Subjt:  PGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMYPT

XP_022154341.1 isoleucine N-monooxygenase 2-like [Momordica charantia]8.9e-11460.7Show/hide
Query:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF
        MS+H I EAD LV  IF+LT  ++      +VN RS ITQH+CGNIIR+MIFN RYYG GRE DGGPTFEEEE  +AL TI++H++AFS+SD+MPCLR  
Subjt:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF

Query:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAV
        DLDGH+ I+K A +V+RKYDEPIINER+++WK G  KEV DILDILISL D+NGK LLS           + AT+ NPSNA+EWAMAE+INQPKIL KAV
Subjt:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAV

Query:  EELDR-------------------------------------------------------VHVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLE
        EELDR                                                         VLLSRSGLGRNPRIWEDPM+FDPERHL+D TVELGL E
Subjt:  EELDR-------------------------------------------------------VHVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLE

Query:  PGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMY
        P LRFITFTR RRGCIGSSLG++ITMMLFARMLQAFSWSL  GMT IDFSESDE+SLP+PL L A+ RLSS+MY
Subjt:  PGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMY

XP_023516709.1 isoleucine N-monooxygenase 2-like [Cucurbita pepo subsp. pepo]5.8e-11358.59Show/hide
Query:  MSSHRIDEADRLVHQIFSLTTAN--------DEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSD
        M  HR  EAD L+  IFS TT N           RG AVVN RS ITQHYCGNIIR+M+FN+RYYG+GRE DGGPTFEEEE N+AL TI+ HV AFS+SD
Subjt:  MSSHRIDEADRLVHQIFSLTTAN--------DEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSD

Query:  FMPCLRPFDLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEV-GDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMIN
        F+P L+PFDLDGH+ +LK A  V++KYDEPII+ER+Q+WKD K +EV  DILDILISLKD+N KPLLS           + ATI NPSNA+EWAMAE++N
Subjt:  FMPCLRPFDLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEV-GDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMIN

Query:  QPKILNKAVEELDRV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRD
        QPKIL KA+EELD+V                                                        VLLSR GLGRNP+IWEDPMKF+PERH+ D
Subjt:  QPKILNKAVEELDRV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRD

Query:  NTVELGLLEPGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMYP
         T ELGLLEP LRF++F+R RRGC GSSLG++ITMMLFAR+LQ FSWSLL+G+TKIDFSE+DE+SLP+PL LHA+PRLS  MYP
Subjt:  NTVELGLLEPGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMYP

TrEMBL top hitse value%identityAlignment
A0A6J1D5K2 isoleucine N-monooxygenase 2-like6.6e-11561.23Show/hide
Query:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF
        MS+H I EAD LV  IF+LT  ++      +VN RS ITQH+CGNIIR+MIFN RYYG GRE DGGPTFEEEE  +ALFTI++HVYAFS+SD+MPCLR  
Subjt:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF

Query:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAV
        DLDGH+ I+K A +V+RKYDE IIN R+++WKDG  KEV DILDILISL+D+NGK LLS           + ATI NPSNA+EWAMAE++NQPKIL KAV
Subjt:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAV

Query:  EELDR-------------------------------------------------------VHVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLE
        EELDR                                                         VLLSRSGLGRNPRIWEDPM+FDPERHL+D TVELGL E
Subjt:  EELDR-------------------------------------------------------VHVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLE

Query:  PGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMY
        P LRFITFTR RRGCIGSSLG++ITMMLFARMLQAFSWSL QGMTKIDFSES+E+SLP+PL L  +PRLS +MY
Subjt:  PGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMY

A0A6J1D6G3 isoleucine N-monooxygenase 2-like2.1e-11659.31Show/hide
Query:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF
        M  HR +EAD L+  IFSLTT N    G AVVN  S +TQHYCGNIIR+M+FNRRYYG+GRE DGGP+FEEEE N+AL TI+ HV AFS+SDFMPCL+PF
Subjt:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF

Query:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAV
        DLDGHE I+K+A + L KYDEPIINER+QQWKDGK++   DILDILIS+KD NGK LL+           + ATI NPSNA+EW MAE++NQPKIL KA+
Subjt:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAV

Query:  EELDRV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLE
        EELDRV                                                        V+LSRSGLGRNPRIWEDPM+FDPERHL+D TV LGL E
Subjt:  EELDRV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLE

Query:  PGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMYPT
        P LRFI+FTR RRGC+GSSLG++ITM L  R+LQ FSWSL  G TKI+FSE+DE+SLP+PL LHA+PRL   MYPT
Subjt:  PGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMYPT

A0A6J1D6G9 isoleucine N-monooxygenase 2-like4.2e-11759.57Show/hide
Query:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF
        M  HR +EAD L+  IFSLTT N   +  AVVN RS +TQHYCGNIIR+M+FN+RYYG+GRE DGGP+FEEEE N+AL TI+ HV AFS+SDFMPCL+PF
Subjt:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF

Query:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAV
        DLDGHE I+K+A + L KYDEPIINER+QQWKDGK++   DILDILIS KD NGK LL+           + ATI NPSNA+EW MAE++NQPKIL KA+
Subjt:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAV

Query:  EELDRV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLE
        EELDRV                                                        V+LSRSGLGRNPRIWEDPM+FDPERHL+D TVELGL E
Subjt:  EELDRV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLE

Query:  PGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMYPT
        P LRFI+FTR RRGC+GSSLG++ITMML  R+LQ FSWSL  G TKI+FSE+DE+SLP+PL LHA+PRL   MYPT
Subjt:  PGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMYPT

A0A6J1DNF2 isoleucine N-monooxygenase 2-like4.3e-11460.7Show/hide
Query:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF
        MS+H I EAD LV  IF+LT  ++      +VN RS ITQH+CGNIIR+MIFN RYYG GRE DGGPTFEEEE  +AL TI++H++AFS+SD+MPCLR  
Subjt:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF

Query:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAV
        DLDGH+ I+K A +V+RKYDEPIINER+++WK G  KEV DILDILISL D+NGK LLS           + AT+ NPSNA+EWAMAE+INQPKIL KAV
Subjt:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAV

Query:  EELDR-------------------------------------------------------VHVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLE
        EELDR                                                         VLLSRSGLGRNPRIWEDPM+FDPERHL+D TVELGL E
Subjt:  EELDR-------------------------------------------------------VHVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLE

Query:  PGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMY
        P LRFITFTR RRGCIGSSLG++ITMMLFARMLQAFSWSL  GMT IDFSESDE+SLP+PL L A+ RLSS+MY
Subjt:  PGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMY

A0A6J1H9V7 isoleucine N-monooxygenase 2-like4.5e-11159.04Show/hide
Query:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF
        M  HR  EAD L+  IFS T+     R  AVVN RS ITQHYCGNIIR+M+FN+RYYG+GRE DGGPTFEEEE N+AL TI+ HV AFS SDFMP L+PF
Subjt:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF

Query:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEV-GDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKA
        DLDGH+ I+K A  V++KYDEPII+ER+Q+WKDGK + V  DIL+ILISLKD+N KPLLS           + ATI NPSNA+EWAMAE++NQPKIL KA
Subjt:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEV-GDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKA

Query:  VEELDRV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLL
        +EELD+V                                                        VLLSR  LGRNPRIWEDPM+F+PERHL D TVELGLL
Subjt:  VEELDRV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLL

Query:  EPGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMYP
        EP LRF++F+R RRGC GSSLG++ITMMLFAR+LQ FSWS + G+TKIDFSE+DE+SLP+PL LHAQPRLS  MYP
Subjt:  EPGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMYP

SwissProt top hitse value%identityAlignment
A0A068Q605 Tryptophan N-monooxygenase CYP79A683.3e-7944.95Show/hide
Query:  RIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPFDLDG
        R +EAD LV  +++  +AN   +  AVVN R I  Q Y G+I+RKMIFNR Y+G+GRE DGGP  EEEE   AL T++++ YAF VSD++P LR FD+DG
Subjt:  RIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPFDLDG

Query:  HENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAVEELD
        HE  ++KA  +++K+ EPI+NER+Q+W+DGK+ E  D+LD+ ISLKD NG+PLLS           + +T+ +P N  EWA+ EM+NQP++L KA EELD
Subjt:  HENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAVEELD

Query:  RV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLR---DNTVELGLLEP
        RV                                                       +V+LSR  LGRNPR+WE+P++F+PERHL    D  V+L   E 
Subjt:  RV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLR---DNTVELGLLEP

Query:  GLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQ--PLCLHAQPRLSSSMYP
         LRF++F+  RRGC+G  LGS+I +ML AR+LQ FSWSL   + KIDF+E D+I L +  PL   A+PRL +S+YP
Subjt:  GLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQ--PLCLHAQPRLSSSMYP

Q43135 Tyrosine N-monooxygenase1.3e-7540.48Show/hide
Query:  RIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPFDLDG
        R DEAD L   +++L T      G   V+ R  + +HYCGN+IR+++FNRRY+G   +ADGGP   E     A+FT +  +YAF VSD++P LR  DLDG
Subjt:  RIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPFDLDG

Query:  HENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLSYR----------FATIVNPSNAIEWAMAEMINQPKILNKAVEELD
        HE I+K+A+  + +  + +I++R +QWK G+++E+ D LD+LI+LKD  G PLL+            FA + NPSNA+EWA+AEM+N P+++ KA+EELD
Subjt:  HENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLSYR----------FATIVNPSNAIEWAMAEMINQPKILNKAVEELD

Query:  RV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLEPGLR
        RV                                                       HV+LSR+GLGRNPR+W++P++F P+RHL     ++ L E  LR
Subjt:  RV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLEPGLR

Query:  FITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSES-DEISLPQPLCLHAQPRLSSSMYPT
        FI+F+  RRGCI +SLG+++++MLF R+LQ F+WS   G+  +D SES  +  +  PL LHA+PRL + +YP+
Subjt:  FITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSES-DEISLPQPLCLHAQPRLSSSMYPT

Q6J540 Isoleucine N-monooxygenase 22.4e-7743.16Show/hide
Query:  RIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPFDLDG
        R +EAD L+  I++     D   G  +VN R I  QHY GN+ RK+IFN RY+G+  E DGGP FEE E   A FTI+ +VYAFS+SDF+P LR  DLDG
Subjt:  RIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPFDLDG

Query:  HENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLSYR----------FATIVNPSNAIEWAMAEMINQPKILNKAVEELD
        H + + KA  +++KY +PII++RI+QW DG K    D+LD+LI LKD + KPLL+ +             + NPSNA EWA+AEM+NQP++L +A EELD
Subjt:  HENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLSYR----------FATIVNPSNAIEWAMAEMINQPKILNKAVEELD

Query:  RV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLEPGLR
         V                                                        VLLSR  LGRNP+ W DP+KF+PERHL++  +++ L EP LR
Subjt:  RV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLEPGLR

Query:  FITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSE-SDEISLPQPLCLHAQPRLSSSMYPT
        FI+FT  RR C G +LG+++T+MLFARML  FSWS    ++ ID ++ SD++ + +PLC+ A+PRL++ +Y T
Subjt:  FITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSE-SDEISLPQPLCLHAQPRLSSSMYPT

Q6J541 Isoleucine N-monooxygenase 14.1e-7744.24Show/hide
Query:  RIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPFDLDG
        R +EAD L+  I++     D   G  +VN R I  QHY GN+ RK+IFN RY+G+  E DGGP FEE E   A FTI+ +VYAFS+SDF+P LR  DLDG
Subjt:  RIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPFDLDG

Query:  HENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLSYR----------FATIVNPSNAIEWAMAEMINQPKILNKAVEELD
        H + + KA +++RKY +PII++RI+QW DG K    D+LD+LI LKD N KPLL+ +             + NPSNA EWA+AEMINQP++L +A EELD
Subjt:  HENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLSYR----------FATIVNPSNAIEWAMAEMINQPKILNKAVEELD

Query:  RV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLEPGLR
         V                                                        VLLSR  LGRNP+ W DP+KF+PERHL++  +++ L EP LR
Subjt:  RV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLEPGLR

Query:  FITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSES-DEISLPQPLCLHAQPRLSSSMYPT
        FI+FT  RR C G +LG+++T+MLFARML  FSWS    ++ ID   S D++ L +PL L A+PRL++ +Y T
Subjt:  FITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSES-DEISLPQPLCLHAQPRLSSSMYPT

Q9M7B8 Valine N-monooxygenase 16.5e-7543.43Show/hide
Query:  RIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPFDLDG
        R +EAD LV  I +   AN        VN R+  T+HY GN+IRKM+F++RY+G+G   DGGP  EE E   A+FT + ++Y F +SDF+P L   DLDG
Subjt:  RIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPFDLDG

Query:  HENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAVEELD
         E  +  A++ +R Y  P+I+ERIQQWK G++KE+ D+LD+ I+LKD +G PLL+             AT+ NPSNAIEWAM EM+NQP+IL KA EELD
Subjt:  HENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAVEELD

Query:  RV-------------------------------------HV------------------LLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLEPGLR
        RV                                     HV                  +LSR GLGRNP+ W DP+K+DPERH+  N  E+ L E  LR
Subjt:  RV-------------------------------------HV------------------LLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLEPGLR

Query:  FITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSES-DEISLPQPLCLHAQPRLSSSMYPT
        F+TF+  RRGC+ S LGS +T ML ARMLQ F+W+    ++KID +E+ DE++   P+   A+PRL+  +YPT
Subjt:  FITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSES-DEISLPQPLCLHAQPRLSSSMYPT

Arabidopsis top hitse value%identityAlignment
AT1G79370.1 cytochrome P450, family 79, subfamily C, polypeptide 11.1e-5033.51Show/hide
Query:  EADRLVHQIFSLTTANDEK-RGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPFDLDGHE
        E+D L+  + +L   ++ K +  AVVN R I+  H   N+  +++F RR++ +    DG     E+E   A+F  +   ++F V+D+ P LR ++L G E
Subjt:  EADRLVHQIFSLTTANDEK-RGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPFDLDGHE

Query:  NILKKASQVLRKYDEPIINERIQQW----KDGKKKE-------VGDILDILISLKDDNGKPLLSYRFAT----------IVNPSNAIEWAMAEMINQPKI
          L++A  V+ +Y++ II+E+I+ W    KD  + E       + D LDIL +LKD+NGKPLL+ +  T          I N  N IEW +AEM+NQ +I
Subjt:  NILKKASQVLRKYDEPIINERIQQW----KDGKKKE-------VGDILDILISLKDDNGKPLLSYRFAT----------IVNPSNAIEWAMAEMINQPKI

Query:  LNKAVEELDRV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVE
        L KAVEE+D V                                                       H+L+SR G+GRNP+ W++P+ + PERH+  N  E
Subjt:  LNKAVEELDRV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVE

Query:  LGLLEPGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMT-KIDFSESDE-ISLPQPLCLHAQPRLSSSMYP
        + L EP LR ++F   RRGC+G+ LG+S+ + L  R+LQ F W++  G T +++  ES E + +  PL    +PRL  +MYP
Subjt:  LGLLEPGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMT-KIDFSESDE-ISLPQPLCLHAQPRLSSSMYP

AT2G22330.1 cytochrome P450, family 79, subfamily B, polypeptide 31.9e-6937.77Show/hide
Query:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF
        +  +R +E D L   ++++       + S  V+ R  +T+HYCGN I++++F  R +    EADGGPT E+ E   A+F  +   +AF +SD++P L   
Subjt:  MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPF

Query:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAV
        DL+GHE I++++S ++ KY +PII+ERI+ W++GK+ ++ D LDI IS+KD+ G+PLL+             A   NPSNA+EWA+AEMIN+P+IL+KA+
Subjt:  DLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAV

Query:  EELDRV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLE
        EE+DRV                                                        VLLSR GLGRNP++W DP+ F PERHL + + E+ L E
Subjt:  EELDRV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLE

Query:  PGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESD-EISLPQPLCLHAQPRLSSSMYP
          LRFI+F+  +RGC   +LG++IT M+ AR+LQ F W L    T+++  ES  ++ L +PL L  + RLS  +YP
Subjt:  PGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESD-EISLPQPLCLHAQPRLSSSMYP

AT4G39950.1 cytochrome P450, family 79, subfamily B, polypeptide 21.6e-6837.27Show/hide
Query:  RIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPFDLDG
        R +E D L   ++++       + S  V+ R  +T+HYCGN I+K++F  R + +    DGGPT E+ E  +A+F  +   +AF +SD++P L   DL+G
Subjt:  RIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPFDLDG

Query:  HENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAVEELD
        HE I++++S ++ KY +PII+ERI+ W++GK+ ++ D LDI IS+KD+ G PLL+             A   NPSNA+EWAMAEM+N+P+IL KA+EE+D
Subjt:  HENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKAVEELD

Query:  RV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLEPGLR
        RV                                                        VLLSR GLGRNP++W DP+ F PERHL + + E+ L E  LR
Subjt:  RV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLEPGLR

Query:  FITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESD-EISLPQPLCLHAQPRLSSSMYPT
        FI+F+  +RGC   +LG+++T M+ AR+LQ F+W L +  T+++  ES  ++ L +PL +    RL   +YPT
Subjt:  FITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESD-EISLPQPLCLHAQPRLSSSMYPT

AT5G05260.1 cytochrome p450 79a22.8e-7340.48Show/hide
Query:  MSSHRIDEADRLVHQIFSLTTANDEKRGSA-VVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRP
        M   R +EAD LV  I + +  N   RG+A VV    +  + Y GN+ RKM+F  R++G+G E   GP  EE E  ++LFT+++H+YAF++SD++P LR 
Subjt:  MSSHRIDEADRLVHQIFSLTTANDEKRGSA-VVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRP

Query:  FDLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKA
         DL+GHE ++  A + + KY++P ++ER+ QW++GK KE  D LD+ I  KD +GKP LS             AT+ NPSNA EW MAEMIN+P I+ KA
Subjt:  FDLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKA

Query:  VEELDRV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLL
        VEE+DRV                                                       HVL+SR G+GRNP +W+ P KFDPERHL  NT  + L 
Subjt:  VEELDRV-------------------------------------------------------HVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLL

Query:  EPGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSES-DEISLPQPLCLHAQPRLSSSMYPT
        E  L  I+F+  RRGC+G  +GS++T ML AR++Q F+W  + G  KID SES +++ + +PL   A PRL+  +YPT
Subjt:  EPGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSES-DEISLPQPLCLHAQPRLSSSMYPT

AT5G35917.1 cytochrome P450, family 79, subfamily A, polypeptide 3 pseudogene9.6e-6639.12Show/hide
Query:  MSSHRIDEADRLVHQIFSLTTAN-DEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRP
        M   R +EAD LV  I +    N     G AV++ R ++ Q Y GN+ RKM+F  R++G+G E + GP  EE E  ++LFT+++H+YAF++SD++P LR 
Subjt:  MSSHRIDEADRLVHQIFSLTTAN-DEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRP

Query:  FDLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKA
         DL+GHE ++  A + + KY++P ++ER+ QW++GK KE  D LD+ I  KD NGKP LS             AT+ NPSNA EWAMAEMIN+P I+ KA
Subjt:  FDLDGHENILKKASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLS----------YRFATIVNPSNAIEWAMAEMINQPKILNKA

Query:  VEELDR----------------------------------------VHVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLEPGLRFITFTRSRRG
        VEE+DR                                         H+L+SR G+GRNP +W+ P+KF+PERHL +N                      
Subjt:  VEELDR----------------------------------------VHVLLSRSGLGRNPRIWEDPMKFDPERHLRDNTVELGLLEPGLRFITFTRSRRG

Query:  CIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSES-DEISLPQPLCLHAQPRLSSSMYPT
               SS+T ML AR++Q F+WS + G +KID SES  ++ + +PL   A PRL   +YPT
Subjt:  CIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSES-DEISLPQPLCLHAQPRLSSSMYPT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGCCATAGGATAGACGAAGCTGACAGACTTGTACATCAAATTTTCAGTCTAACCACTGCTAACGATGAAAAACGAGGCAGTGCAGTGGTGAACTCAAGAAGCAT
TATTACACAACATTATTGTGGCAACATCATTAGAAAGATGATATTCAATAGAAGATACTATGGCCGAGGAAGGGAAGCTGATGGAGGGCCAACTTTTGAAGAAGAAGAGC
GTAATAAAGCTTTGTTCACAATCATTAGTCATGTTTACGCATTTTCTGTATCGGATTTCATGCCTTGTTTAAGACCATTTGATTTGGACGGACATGAGAATATTTTGAAA
AAGGCTTCTCAAGTGCTCAGAAAGTACGATGAACCCATTATAAATGAGAGAATACAACAATGGAAAGATGGTAAGAAGAAAGAGGTTGGAGACATTCTTGATATCCTTAT
CTCACTTAAAGATGATAATGGCAAACCACTCTTGAGCTACAGATTTGCAACAATAGTTAATCCTTCGAATGCAATTGAATGGGCGATGGCAGAAATGATCAATCAACCAA
AGATTCTCAACAAAGCAGTTGAAGAATTGGATAGAGTCCATGTCCTACTAAGTCGTTCAGGACTTGGACGAAACCCAAGGATCTGGGAGGACCCAATGAAGTTTGATCCA
GAACGTCATCTCAGAGATAATACGGTTGAATTAGGATTGTTAGAACCAGGTTTGCGATTTATTACCTTTACTAGAAGCAGGAGAGGCTGCATAGGCAGCTCATTGGGCAG
CAGCATTACCATGATGCTCTTTGCAAGGATGCTACAAGCATTTTCATGGAGCTTGCTACAAGGAATGACGAAGATAGACTTTTCTGAATCTGATGAAATTTCTCTTCCTC
AACCATTGTGCCTGCATGCACAACCACGTCTATCTTCTTCCATGTATCCAACTTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAGCCATAGGATAGACGAAGCTGACAGACTTGTACATCAAATTTTCAGTCTAACCACTGCTAACGATGAAAAACGAGGCAGTGCAGTGGTGAACTCAAGAAGCAT
TATTACACAACATTATTGTGGCAACATCATTAGAAAGATGATATTCAATAGAAGATACTATGGCCGAGGAAGGGAAGCTGATGGAGGGCCAACTTTTGAAGAAGAAGAGC
GTAATAAAGCTTTGTTCACAATCATTAGTCATGTTTACGCATTTTCTGTATCGGATTTCATGCCTTGTTTAAGACCATTTGATTTGGACGGACATGAGAATATTTTGAAA
AAGGCTTCTCAAGTGCTCAGAAAGTACGATGAACCCATTATAAATGAGAGAATACAACAATGGAAAGATGGTAAGAAGAAAGAGGTTGGAGACATTCTTGATATCCTTAT
CTCACTTAAAGATGATAATGGCAAACCACTCTTGAGCTACAGATTTGCAACAATAGTTAATCCTTCGAATGCAATTGAATGGGCGATGGCAGAAATGATCAATCAACCAA
AGATTCTCAACAAAGCAGTTGAAGAATTGGATAGAGTCCATGTCCTACTAAGTCGTTCAGGACTTGGACGAAACCCAAGGATCTGGGAGGACCCAATGAAGTTTGATCCA
GAACGTCATCTCAGAGATAATACGGTTGAATTAGGATTGTTAGAACCAGGTTTGCGATTTATTACCTTTACTAGAAGCAGGAGAGGCTGCATAGGCAGCTCATTGGGCAG
CAGCATTACCATGATGCTCTTTGCAAGGATGCTACAAGCATTTTCATGGAGCTTGCTACAAGGAATGACGAAGATAGACTTTTCTGAATCTGATGAAATTTCTCTTCCTC
AACCATTGTGCCTGCATGCACAACCACGTCTATCTTCTTCCATGTATCCAACTTATTGA
Protein sequenceShow/hide protein sequence
MSSHRIDEADRLVHQIFSLTTANDEKRGSAVVNSRSIITQHYCGNIIRKMIFNRRYYGRGREADGGPTFEEEERNKALFTIISHVYAFSVSDFMPCLRPFDLDGHENILK
KASQVLRKYDEPIINERIQQWKDGKKKEVGDILDILISLKDDNGKPLLSYRFATIVNPSNAIEWAMAEMINQPKILNKAVEELDRVHVLLSRSGLGRNPRIWEDPMKFDP
ERHLRDNTVELGLLEPGLRFITFTRSRRGCIGSSLGSSITMMLFARMLQAFSWSLLQGMTKIDFSESDEISLPQPLCLHAQPRLSSSMYPTY