; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10015046 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10015046
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCysteine proteinase inhibitor
Genome locationChr02:23285164..23301151
RNA-Seq ExpressionHG10015046
SyntenyHG10015046
Gene Ontology termsGO:0016125 - sterol metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0004497 - monooxygenase activity (molecular function)
GO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR000010 - Cystatin domain
IPR001128 - Cytochrome P450
IPR002401 - Cytochrome P450, E-class, group I
IPR017972 - Cytochrome P450, conserved site
IPR018073 - Proteinase inhibitor I25, cystatin, conserved site
IPR027214 - Cystatin
IPR036396 - Cytochrome P450 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008445945.1 PREDICTED: 2-hydroxyisoflavanone synthase-like [Cucumis melo]1.8e-20172.85Show/hide
Query:  MSSKCDGLTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLLY
        MSS+CDG TP NVIGILLR ESSRNCNSE+KSRILRDFVTREVNAFLWFFL AITAVLISKVVALF+LWSKAK LPGPPCPSFYGHS+VISRRNLTD+LY
Subjt:  MSSKCDGLTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLLY

Query:  DSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQD
        DSHKKYG V+KLWLGPMQLLVSVKEPALLKEILVKAEDKLP TGRAFRLAFGRSSLFASSFEKVQSRR WL EKLD ISFQ ANVIPAKAVDCSVGRIQD
Subjt:  DSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQD

Query:  LMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWR--------------------------------
        LM++ESIDC KVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWR                                
Subjt:  LMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWR--------------------------------

Query:  ---------------PTPPQLMAKL--------PKRLLMKN----------------------GSLRRRLG------GKINSELNTVQKGSVKDPQKNVD
                         PP  +A++        P   +  N                       S+  RL        KINSELN   K SVKDPQ NVD
Subjt:  ---------------PTPPQLMAKL--------PKRLLMKN----------------------GSLRRRLG------GKINSELNTVQKGSVKDPQKNVD

Query:  NMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGESSF
        NMPLLLATIYESARLLP+G LLQRCSLKQDLVLKTGIT+PAGTLVVVPIKL+QMD+SSWGSDA+EFSPYRFLSMACNGTD  QRTS+AGEN G++G+SSF
Subjt:  NMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGESSF

Query:  VLNNPTGNVGFLPFGFGARSCVG
        VLN+PTGNV FLPFGFGARSCVG
Subjt:  VLNNPTGNVGFLPFGFGARSCVG

XP_011655532.1 tabersonine 16-hydroxylase 1 isoform X1 [Cucumis sativus]3.7e-19470.17Show/hide
Query:  MSSKCDGLTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLLY
        MSS+CD  TP NVIGILLRS+SSRNC+S++KSRILRDFVTREVNAFLWFFL AITAVLISKVVALF+LWSKAK LPGP CPSFYGHS+VISRRNLTD+LY
Subjt:  MSSKCDGLTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLLY

Query:  DSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQD
        DSHKKYGPV+KLWLGPMQLLVSVKEPALLKEILVKAEDKLP TGRAFRLAFGRSSLFASSFEKVQSRR  L EKLD ISFQ  NVIPAKAVDCSVGRIQD
Subjt:  DSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQD

Query:  LMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWR--------------------------------
        LM++ESIDC KVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDAN+WASYRVTPFWK+GFWR                                
Subjt:  LMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWR--------------------------------

Query:  ---------------PTPPQLMAKLPKRLLMKN------------------------------GSLRRRLG------GKINSELNTVQKGSVKDPQKNVD
                         PP   A++                                       S+  RL        KIN ELN  QK SVKDPQ NVD
Subjt:  ---------------PTPPQLMAKLPKRLLMKN------------------------------GSLRRRLG------GKINSELNTVQKGSVKDPQKNVD

Query:  NMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGESSF
        NMPLLLATIYESARLLP+G LLQRCSLKQDLVLKTGIT+PAGTLVVVPIKL+QMD+SSWGSDA+EF+PYRFLSM CNGTDT Q+TS+AGEN  ++G++SF
Subjt:  NMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGESSF

Query:  VLNNPTGNVGFLPFGFGARSCVG
        VLN+PTGN  FLPFGFGARSCVG
Subjt:  VLNNPTGNVGFLPFGFGARSCVG

XP_022945455.1 cytochrome P450 714C3-like [Cucurbita moschata]3.7e-19470.34Show/hide
Query:  MSSKCDG-LTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLL
        MSSKCDG  TP+NVIGILLR ESSRNCNS++ SRILRDFVTREVNAFLWF + AITAVLI+KVVALFRLWSKAK LPGPPCPSFYGHSEVISRRNLTDLL
Subjt:  MSSKCDG-LTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLL

Query:  YDSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQ
        YDSHK+YGPVVKLWLGPMQLLVSVKEPAL+KEIL+KAEDKLP TGR FR AFGRSSLFASSFEKVQ+RRQ L EKLD+ISF+  NV+PAKAVDCSVGR+Q
Subjt:  YDSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQ

Query:  DLMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWRPTPPQLMAKL---------------------
        DLMI+ESIDC KVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDA+ WASYRVTPFWKQGFWR    +L  KL                     
Subjt:  DLMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWRPTPPQLMAKL---------------------

Query:  -----------------------PKRLLMKNG-----------------------------------SLRRRLGG------KINSELNTVQKGSVKDPQK
                               P   +  NG                                   S+  RL        KINSELN+ +KGSVKD QK
Subjt:  -----------------------PKRLLMKNG-----------------------------------SLRRRLGG------KINSELNTVQKGSVKDPQK

Query:  NVDNMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGE
        NVDNMPLLLATIYESARLLPAG LLQRCSLKQDLVLKTGIT+PAGTLVVVP+KLVQMD +SWGS+ +EF+PYRFLS ACNGTDT+QRTSLAGENA +QGE
Subjt:  NVDNMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGE

Query:  SSFVLNNPTGNVGFLPFGFGARSCVG
        SSFVLN+PTG + FLPFGFGAR+CVG
Subjt:  SSFVLNNPTGNVGFLPFGFGARSCVG

XP_022957389.1 cytochrome P450 714C3-like [Cucurbita moschata]4.1e-19370.17Show/hide
Query:  MSSKCDGLTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLLY
        MSSKCDG TPLNVI ILLRSESS+NC SEEKSRIL DFVTREVN FLWF L AIT VLI KVV LFRLWSKAKQLPGPP PSF GHS VISRRNLTDLLY
Subjt:  MSSKCDGLTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLLY

Query:  DSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQD
        DSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKL  TGRAFRLAFGRSSLF SS EKVQ+RR+WL EKLDEI FQ ANV PAKAVDCSVGR+QD
Subjt:  DSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQD

Query:  LMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWR---------PTPPQLMAKLPKRLLMKNGSLRR
        +MI+ES+DC KVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMM+AKDA+ WASYRVTPFWKQGFWR              ++ +  K   + + S  +
Subjt:  LMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWR---------PTPPQLMAKLPKRLLMKNGSLRR

Query:  RLGG--------------------------------------------------------------------------KINSELNTVQKGSVKDPQKNVD
         L                                                                            KINSELN VQ+GSVKD QKNVD
Subjt:  RLGG--------------------------------------------------------------------------KINSELNTVQKGSVKDPQKNVD

Query:  NMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGESSF
        NMPLLLATIYESARLLPAG LLQRCSLKQDLVLKTGIT+PAGTLVVVP+KLVQMD+SSWGSDA +F+PYRFLS+ACNG  TSQRTSLAGENAG++GESSF
Subjt:  NMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGESSF

Query:  VLNNPTGNVGFLPFGFGARSCVG
        VLN+PTGNV FLPFGFGARSCVG
Subjt:  VLNNPTGNVGFLPFGFGARSCVG

XP_038892410.1 cytochrome P450 714C3-like [Benincasa hispida]2.2e-20274.38Show/hide
Query:  MSSKCDGLTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLLY
        MSS C GLTPLNVIGILLRSESSRNCNS++KSRILRDFVTREVNAFLWFFL AITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHS+VISR NLTDLLY
Subjt:  MSSKCDGLTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLLY

Query:  DSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQD
        DSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLP TGRAFRLAFGRSSLFASSFEKVQSRRQWL EKLDEISFQ + VIPAKAVD SV RIQD
Subjt:  DSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQD

Query:  LMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWR--------------------------------
        LMI+ESIDC KVSQHLAFTLLG TLFGDAFLGWSKATIYEELLMMIAKDAN+WASYRVTPFWKQGFWR                                
Subjt:  LMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWR--------------------------------

Query:  ---------------PTPPQLMAKLPKRL---------------------LMKNG---------SLRRRLG------GKINSELNTVQKGSVKDPQKNVD
                         PP   A++                         +M +G         S+  RL        KIN EL+T QKGSVKDPQKNVD
Subjt:  ---------------PTPPQLMAKLPKRL---------------------LMKNG---------SLRRRLG------GKINSELNTVQKGSVKDPQKNVD

Query:  NMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGESSF
        NMPLL ATIYESARLLPAG LLQRCSLKQDL LKTGIT+PAGTLVVVPIKLVQMD SSWGSDA+EFSPYRFLSMACNGT+ S+RTSLAGENAG+QGESSF
Subjt:  NMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGESSF

Query:  VLNNPTGNVGFLPFGFGARSCVG
        VLN+PTGNVGFLPFGFGARSCVG
Subjt:  VLNNPTGNVGFLPFGFGARSCVG

TrEMBL top hitse value%identityAlignment
A0A1S3BER0 2-hydroxyisoflavanone synthase-like8.8e-20272.85Show/hide
Query:  MSSKCDGLTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLLY
        MSS+CDG TP NVIGILLR ESSRNCNSE+KSRILRDFVTREVNAFLWFFL AITAVLISKVVALF+LWSKAK LPGPPCPSFYGHS+VISRRNLTD+LY
Subjt:  MSSKCDGLTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLLY

Query:  DSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQD
        DSHKKYG V+KLWLGPMQLLVSVKEPALLKEILVKAEDKLP TGRAFRLAFGRSSLFASSFEKVQSRR WL EKLD ISFQ ANVIPAKAVDCSVGRIQD
Subjt:  DSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQD

Query:  LMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWR--------------------------------
        LM++ESIDC KVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWR                                
Subjt:  LMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWR--------------------------------

Query:  ---------------PTPPQLMAKL--------PKRLLMKN----------------------GSLRRRLG------GKINSELNTVQKGSVKDPQKNVD
                         PP  +A++        P   +  N                       S+  RL        KINSELN   K SVKDPQ NVD
Subjt:  ---------------PTPPQLMAKL--------PKRLLMKN----------------------GSLRRRLG------GKINSELNTVQKGSVKDPQKNVD

Query:  NMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGESSF
        NMPLLLATIYESARLLP+G LLQRCSLKQDLVLKTGIT+PAGTLVVVPIKL+QMD+SSWGSDA+EFSPYRFLSMACNGTD  QRTS+AGEN G++G+SSF
Subjt:  NMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGESSF

Query:  VLNNPTGNVGFLPFGFGARSCVG
        VLN+PTGNV FLPFGFGARSCVG
Subjt:  VLNNPTGNVGFLPFGFGARSCVG

A0A5A7SU18 2-hydroxyisoflavanone synthase-like8.8e-20272.85Show/hide
Query:  MSSKCDGLTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLLY
        MSS+CDG TP NVIGILLR ESSRNCNSE+KSRILRDFVTREVNAFLWFFL AITAVLISKVVALF+LWSKAK LPGPPCPSFYGHS+VISRRNLTD+LY
Subjt:  MSSKCDGLTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLLY

Query:  DSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQD
        DSHKKYG V+KLWLGPMQLLVSVKEPALLKEILVKAEDKLP TGRAFRLAFGRSSLFASSFEKVQSRR WL EKLD ISFQ ANVIPAKAVDCSVGRIQD
Subjt:  DSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQD

Query:  LMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWR--------------------------------
        LM++ESIDC KVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWR                                
Subjt:  LMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWR--------------------------------

Query:  ---------------PTPPQLMAKL--------PKRLLMKN----------------------GSLRRRLG------GKINSELNTVQKGSVKDPQKNVD
                         PP  +A++        P   +  N                       S+  RL        KINSELN   K SVKDPQ NVD
Subjt:  ---------------PTPPQLMAKL--------PKRLLMKN----------------------GSLRRRLG------GKINSELNTVQKGSVKDPQKNVD

Query:  NMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGESSF
        NMPLLLATIYESARLLP+G LLQRCSLKQDLVLKTGIT+PAGTLVVVPIKL+QMD+SSWGSDA+EFSPYRFLSMACNGTD  QRTS+AGEN G++G+SSF
Subjt:  NMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGESSF

Query:  VLNNPTGNVGFLPFGFGARSCVG
        VLN+PTGNV FLPFGFGARSCVG
Subjt:  VLNNPTGNVGFLPFGFGARSCVG

A0A6J1G0Z6 cytochrome P450 714C3-like1.8e-19470.34Show/hide
Query:  MSSKCDG-LTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLL
        MSSKCDG  TP+NVIGILLR ESSRNCNS++ SRILRDFVTREVNAFLWF + AITAVLI+KVVALFRLWSKAK LPGPPCPSFYGHSEVISRRNLTDLL
Subjt:  MSSKCDG-LTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLL

Query:  YDSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQ
        YDSHK+YGPVVKLWLGPMQLLVSVKEPAL+KEIL+KAEDKLP TGR FR AFGRSSLFASSFEKVQ+RRQ L EKLD+ISF+  NV+PAKAVDCSVGR+Q
Subjt:  YDSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQ

Query:  DLMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWRPTPPQLMAKL---------------------
        DLMI+ESIDC KVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDA+ WASYRVTPFWKQGFWR    +L  KL                     
Subjt:  DLMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWRPTPPQLMAKL---------------------

Query:  -----------------------PKRLLMKNG-----------------------------------SLRRRLGG------KINSELNTVQKGSVKDPQK
                               P   +  NG                                   S+  RL        KINSELN+ +KGSVKD QK
Subjt:  -----------------------PKRLLMKNG-----------------------------------SLRRRLGG------KINSELNTVQKGSVKDPQK

Query:  NVDNMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGE
        NVDNMPLLLATIYESARLLPAG LLQRCSLKQDLVLKTGIT+PAGTLVVVP+KLVQMD +SWGS+ +EF+PYRFLS ACNGTDT+QRTSLAGENA +QGE
Subjt:  NVDNMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGE

Query:  SSFVLNNPTGNVGFLPFGFGARSCVG
        SSFVLN+PTG + FLPFGFGAR+CVG
Subjt:  SSFVLNNPTGNVGFLPFGFGARSCVG

A0A6J1H1T3 cytochrome P450 714C3-like2.0e-19370.17Show/hide
Query:  MSSKCDGLTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLLY
        MSSKCDG TPLNVI ILLRSESS+NC SEEKSRIL DFVTREVN FLWF L AIT VLI KVV LFRLWSKAKQLPGPP PSF GHS VISRRNLTDLLY
Subjt:  MSSKCDGLTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLLY

Query:  DSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQD
        DSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKL  TGRAFRLAFGRSSLF SS EKVQ+RR+WL EKLDEI FQ ANV PAKAVDCSVGR+QD
Subjt:  DSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQD

Query:  LMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWR---------PTPPQLMAKLPKRLLMKNGSLRR
        +MI+ES+DC KVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMM+AKDA+ WASYRVTPFWKQGFWR              ++ +  K   + + S  +
Subjt:  LMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWR---------PTPPQLMAKLPKRLLMKNGSLRR

Query:  RLGG--------------------------------------------------------------------------KINSELNTVQKGSVKDPQKNVD
         L                                                                            KINSELN VQ+GSVKD QKNVD
Subjt:  RLGG--------------------------------------------------------------------------KINSELNTVQKGSVKDPQKNVD

Query:  NMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGESSF
        NMPLLLATIYESARLLPAG LLQRCSLKQDLVLKTGIT+PAGTLVVVP+KLVQMD+SSWGSDA +F+PYRFLS+ACNG  TSQRTSLAGENAG++GESSF
Subjt:  NMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGESSF

Query:  VLNNPTGNVGFLPFGFGARSCVG
        VLN+PTGNV FLPFGFGARSCVG
Subjt:  VLNNPTGNVGFLPFGFGARSCVG

A0A6J1HSZ9 cytochrome P450 83A1-like8.3e-19269.77Show/hide
Query:  MSSKCDG-LTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLL
        MSSKCDG  TP+NVIGILLR ESSRNCNS++ SRILRDFVTREVNAFLWF + AITAVLI+KVVALFRLWSKAK LPGPPCPSFYGHSEVISRRNLTDLL
Subjt:  MSSKCDG-LTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLL

Query:  YDSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQ
        YDSHKKYGPVVKLWLGPMQLLVSVKEPAL+KEIL+KAEDKLP TGR FR AFGRSSLFASSFEKVQ+RRQ L EKLD+ISF+ ANVIPAKAVD SVGR+Q
Subjt:  YDSHKKYGPVVKLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQ

Query:  DLMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWRPTPPQLMAKL---------------------
        DLMI+ESIDC KVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDA+ WASY+VTPFWKQ FWR    +L  KL                     
Subjt:  DLMIDESIDCIKVSQHLAFTLLGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWRPTPPQLMAKL---------------------

Query:  -----------------------PKRLLMKNG-----------------------------------SLRRRLGG------KINSELNTVQKGSVKDPQK
                               P   +  NG                                   S+  RL        KINSELN+ +KGSVKD QK
Subjt:  -----------------------PKRLLMKNG-----------------------------------SLRRRLGG------KINSELNTVQKGSVKDPQK

Query:  NVDNMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGE
        NVDNMPLLLATIYES RLLPAG LLQRCSLKQDLVLKTGIT+PAGTLVVVP+KLVQMD +SWGS+ +EF+PYRFLS ACNGTDT+Q+ SLAGENA +QGE
Subjt:  NVDNMPLLLATIYESARLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGE

Query:  SSFVLNNPTGNVGFLPFGFGARSCVG
        SSFVLN+PTG + FLPFGFGAR+CVG
Subjt:  SSFVLNNPTGNVGFLPFGFGARSCVG

SwissProt top hitse value%identityAlignment
Q06445 Cysteine proteinase inhibitor6.6e-2962.5Show/hide
Query:  STLGGVHDSQGASNSVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNFKELQEFKHAGDVPS
        + LGG  D  G  NS+++D LARFAV++HNKK+N+LLE+ RVV A++QVV+GTL+ +TLE  D G+KK+YEAKVW K W+NFKELQEFKH GD P+
Subjt:  STLGGVHDSQGASNSVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNFKELQEFKHAGDVPS

Q0JNR2 Cysteine proteinase inhibitor 123.7e-8068.37Show/hide
Query:  ASAHFCAQQDDPLLMASTLGGVHDSQGASNSVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNFKE
        A+A F        +    LGG HD+  A+NSV+ D LARFAVD+HNK+EN+LLE+VRVV+AKEQVVAGTLHHLTLE ++AG+KK+YEAKVWVK W++FKE
Subjt:  ASAHFCAQQDDPLLMASTLGGVHDSQGASNSVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNFKE

Query:  LQEFKHAGDVPSITPSDLGAKKGDHPQGWREVPPHDPHVQDAAQHAVRTIQQRSNSLLPYELLEIIHAKAEVIEDAAKFDLLLKLKRGNKEEKFKVEVHK
        LQEF++ GD  + T +DLGAKKG H  GWR+VP HDP V+DAA HAV++IQQRSNSL PYELLEI+ AKAEV+ED AKFD+L+KLKRGNKEEKFK EVHK
Subjt:  LQEFKHAGDVPSITPSDLGAKKGDHPQGWREVPPHDPHVQDAAQHAVRTIQQRSNSLLPYELLEIIHAKAEVIEDAAKFDLLLKLKRGNKEEKFKVEVHK

Query:  NNEGNFLLNQMVQDH
        N EG F+LNQM Q+H
Subjt:  NNEGNFLLNQMVQDH

Q41906 Cysteine proteinase inhibitor 34.0e-2650.46Show/hide
Query:  GNASAHFCAQQDDPLLMASTLGGVHDSQGASNSVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNF
        G      C  ++        LGGVHD +G  NS +++ LARFA+ +HNK++N +LE+ ++VKA+EQVVAGT++HLTLE  +  + K +EAKVWVK WMNF
Subjt:  GNASAHFCAQQDDPLLMASTLGGVHDSQGASNSVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNF

Query:  KELQEFKHA
        K+LQEFK +
Subjt:  KELQEFKHA

Q8H0X6 Cysteine proteinase inhibitor 65.2e-7470.56Show/hide
Query:  LGGVHDSQGASNSVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNFKELQEFKHAGDVPSITPSDL
        +GGV D     NS +V+ LARFAVD+HNKKEN+LLE+ RVVKAKEQVVAGTLHHLTLE+++AG+KKLYEAKVWVK W+NFKELQEFK A D P+IT SDL
Subjt:  LGGVHDSQGASNSVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNFKELQEFKHAGDVPSITPSDL

Query:  GAKKGDHPQGWREVPPHDPHVQDAAQHAVRTIQQRSNSLLPYELLEIIHAKAEVIEDAAKFDLLLKLKRGNKEEKFKVEVHKNNEGNFLLNQMVQDH
        G K+G+H  GWREVP  DP V+  A+ AV+TIQQRSNSL PYELLE++HAKAEV  +AAK+++LLKLKRG KEEKFKVEVHKN+EG   LN   Q H
Subjt:  GAKKGDHPQGWREVPPHDPHVQDAAQHAVRTIQQRSNSLLPYELLEIIHAKAEVIEDAAKFDLLLKLKRGNKEEKFKVEVHKNNEGNFLLNQMVQDH

Q8LC76 Cysteine proteinase inhibitor 71.5e-5256.61Show/hide
Query:  LGGVHDSQGASN-SVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNFKELQEFKHAGDVPSITPSD
        LGG  DS+   N   ++D++A FAV +HN++EN++LE  RV+KA EQVVAG L+ LTLEV++AG+KK+YEAKVWVK WMNFK+LQEFK+   +PS T SD
Subjt:  LGGVHDSQGASN-SVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNFKELQEFKHAGDVPSITPSD

Query:  LGAKKGDHPQGWREVPPHDPHVQDAAQHAVRTIQQRSNSLLPYELLEIIHAKAEVIEDAAKFDLLLKLKRGNKEEKFKVEVHKNNEGNF
        LG K   +   WR V  ++P VQ+AA+HA++++QQ+SNSL PY+L++II A+A+V+E+  KF+LLLKL+RGNK EKF VEV K+  G +
Subjt:  LGAKKGDHPQGWREVPPHDPHVQDAAQHAVRTIQQRSNSLLPYELLEIIHAKAEVIEDAAKFDLLLKLKRGNKEEKFKVEVHKNNEGNF

Arabidopsis top hitse value%identityAlignment
AT2G40880.1 cystatin A2.9e-2750.46Show/hide
Query:  GNASAHFCAQQDDPLLMASTLGGVHDSQGASNSVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNF
        G      C  ++        LGGVHD +G  NS +++ LARFA+ +HNK++N +LE+ ++VKA+EQVVAGT++HLTLE  +  + K +EAKVWVK WMNF
Subjt:  GNASAHFCAQQDDPLLMASTLGGVHDSQGASNSVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNF

Query:  KELQEFKHA
        K+LQEFK +
Subjt:  KELQEFKHA

AT3G12490.1 cystatin B3.7e-7570.56Show/hide
Query:  LGGVHDSQGASNSVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNFKELQEFKHAGDVPSITPSDL
        +GGV D     NS +V+ LARFAVD+HNKKEN+LLE+ RVVKAKEQVVAGTLHHLTLE+++AG+KKLYEAKVWVK W+NFKELQEFK A D P+IT SDL
Subjt:  LGGVHDSQGASNSVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNFKELQEFKHAGDVPSITPSDL

Query:  GAKKGDHPQGWREVPPHDPHVQDAAQHAVRTIQQRSNSLLPYELLEIIHAKAEVIEDAAKFDLLLKLKRGNKEEKFKVEVHKNNEGNFLLNQMVQDH
        G K+G+H  GWREVP  DP V+  A+ AV+TIQQRSNSL PYELLE++HAKAEV  +AAK+++LLKLKRG KEEKFKVEVHKN+EG   LN   Q H
Subjt:  GAKKGDHPQGWREVPPHDPHVQDAAQHAVRTIQQRSNSLLPYELLEIIHAKAEVIEDAAKFDLLLKLKRGNKEEKFKVEVHKNNEGNFLLNQMVQDH

AT3G12490.2 cystatin B3.7e-7570.56Show/hide
Query:  LGGVHDSQGASNSVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNFKELQEFKHAGDVPSITPSDL
        +GGV D     NS +V+ LARFAVD+HNKKEN+LLE+ RVVKAKEQVVAGTLHHLTLE+++AG+KKLYEAKVWVK W+NFKELQEFK A D P+IT SDL
Subjt:  LGGVHDSQGASNSVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNFKELQEFKHAGDVPSITPSDL

Query:  GAKKGDHPQGWREVPPHDPHVQDAAQHAVRTIQQRSNSLLPYELLEIIHAKAEVIEDAAKFDLLLKLKRGNKEEKFKVEVHKNNEGNFLLNQMVQDH
        G K+G+H  GWREVP  DP V+  A+ AV+TIQQRSNSL PYELLE++HAKAEV  +AAK+++LLKLKRG KEEKFKVEVHKN+EG   LN   Q H
Subjt:  GAKKGDHPQGWREVPPHDPHVQDAAQHAVRTIQQRSNSLLPYELLEIIHAKAEVIEDAAKFDLLLKLKRGNKEEKFKVEVHKNNEGNFLLNQMVQDH

AT5G05110.1 Cystatin/monellin family protein1.0e-5356.61Show/hide
Query:  LGGVHDSQGASN-SVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNFKELQEFKHAGDVPSITPSD
        LGG  DS+   N   ++D++A FAV +HN++EN++LE  RV+KA EQVVAG L+ LTLEV++AG+KK+YEAKVWVK WMNFK+LQEFK+   +PS T SD
Subjt:  LGGVHDSQGASN-SVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNFKELQEFKHAGDVPSITPSD

Query:  LGAKKGDHPQGWREVPPHDPHVQDAAQHAVRTIQQRSNSLLPYELLEIIHAKAEVIEDAAKFDLLLKLKRGNKEEKFKVEVHKNNEGNF
        LG K   +   WR V  ++P VQ+AA+HA++++QQ+SNSL PY+L++II A+A+V+E+  KF+LLLKL+RGNK EKF VEV K+  G +
Subjt:  LGAKKGDHPQGWREVPPHDPHVQDAAQHAVRTIQQRSNSLLPYELLEIIHAKAEVIEDAAKFDLLLKLKRGNKEEKFKVEVHKNNEGNF

AT5G12140.1 cystatin-15.6e-2356.52Show/hide
Query:  LGGVHDSQGASNSVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNFKELQEFKHAGDV
        +GGV D    +N + V+ LARFAVD+HNK EN  LEY R++ AK QVVAGT+HHLT+EV D    K+YEAKV  K+W N K+L+ F H  DV
Subjt:  LGGVHDSQGASNSVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNFKELQEFKHAGDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCCAAATGCGACGGCCTTACTCCTCTCAACGTCATCGGAATTCTTCTCCGATCGGAGTCTTCCCGGAACTGTAATTCCGAAGAGAAGTCCCGGATTCTTAGAGA
TTTCGTTACTCGCGAAGTCAATGCGTTCCTCTGGTTCTTTCTCTTCGCTATCACTGCGGTTCTGATCAGCAAGGTTGTTGCTCTCTTCAGACTGTGGTCGAAGGCGAAGC
AACTCCCTGGACCTCCTTGTCCGTCTTTCTACGGTCACTCCGAGGTTATTTCGCGCCGGAATCTCACTGATCTGTTATATGACTCTCATAAAAAATATGGACCAGTCGTC
AAGCTATGGTTGGGTCCCATGCAGCTCTTAGTGTCTGTGAAGGAGCCAGCTCTTCTCAAAGAAATTCTGGTAAAGGCCGAGGATAAATTGCCTTTTACAGGAAGGGCATT
CAGATTGGCATTTGGGCGTTCAAGTCTCTTTGCTTCCTCTTTTGAGAAGGTGCAAAGCAGAAGACAATGGTTAGAAGAAAAATTAGATGAAATATCGTTTCAGAGTGCTA
ATGTTATTCCTGCAAAGGCTGTGGATTGTTCCGTAGGGAGAATACAAGATCTTATGATTGATGAAAGTATAGATTGTATTAAGGTTTCTCAACATTTGGCTTTTACATTG
TTAGGGTGCACACTTTTCGGGGATGCCTTTTTGGGTTGGTCTAAGGCAACCATCTATGAGGAACTTCTGATGATGATTGCAAAAGATGCCAATATTTGGGCCTCCTACAG
AGTTACTCCTTTCTGGAAACAAGGATTCTGGCGACCTACACCTCCACAGTTAATGGCCAAGCTCCCCAAACGCCTGCTTATGAAGAATGGATCACTAAGGAGAAGGCTTG
GGGGTAAGATCAACTCAGAACTAAATACAGTACAAAAGGGCTCGGTGAAAGATCCCCAGAAAAATGTTGATAACATGCCACTTCTGTTGGCAACTATATATGAATCTGCT
CGTCTTCTGCCAGCAGGGTCTCTGTTACAAAGATGTTCACTCAAACAAGATTTGGTCCTAAAGACTGGGATAACTGTACCAGCCGGAACATTAGTTGTTGTACCTATAAA
ATTGGTACAGATGGATACTTCAAGTTGGGGAAGTGATGCCAGTGAGTTTAGTCCCTACCGTTTTCTATCAATGGCCTGTAATGGGACTGATACGAGTCAGCGGACATCAC
TTGCAGGTGAAAATGCTGGGAATCAAGGAGAGAGCTCATTTGTTTTGAACAATCCAACCGGCAATGTTGGGTTTCTTCCCTTTGGCTTTGGTGCACGTTCTTGTGTTGGT
AACGCATCAGCTCACTTTTGCGCTCAACAGGACGATCCACTGCTCATGGCGTCCACACTTGGAGGCGTTCACGATTCTCAGGGAGCTTCCAACTCCGTCGATGTCGATGA
ACTCGCTCGTTTCGCCGTCGATAAACACAATAAGAAAGAGAATTCACTTCTTGAGTATGTGAGAGTCGTGAAGGCGAAAGAGCAGGTAGTAGCTGGTACACTGCACCATC
TTACTCTTGAAGTTGTTGATGCTGGTAAAAAGAAGCTGTATGAAGCTAAGGTCTGGGTGAAGTCATGGATGAATTTTAAGGAATTGCAAGAGTTCAAGCATGCAGGCGAT
GTCCCCTCAATTACTCCTTCAGATCTTGGTGCAAAGAAAGGTGATCATCCCCAAGGATGGCGAGAAGTGCCACCACATGATCCTCATGTTCAGGATGCAGCACAGCATGC
TGTTCGAACCATCCAGCAGAGATCTAATTCTCTACTCCCATATGAACTGCTGGAGATCATACATGCCAAGGCAGAGGTGATTGAAGATGCTGCAAAGTTTGATTTGCTCC
TAAAGCTGAAGAGAGGGAATAAAGAAGAGAAGTTCAAAGTGGAGGTCCACAAGAACAATGAAGGTAACTTCCTTCTGAATCAGATGGTGCAAGATCATTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTCCAAATGCGACGGCCTTACTCCTCTCAACGTCATCGGAATTCTTCTCCGATCGGAGTCTTCCCGGAACTGTAATTCCGAAGAGAAGTCCCGGATTCTTAGAGA
TTTCGTTACTCGCGAAGTCAATGCGTTCCTCTGGTTCTTTCTCTTCGCTATCACTGCGGTTCTGATCAGCAAGGTTGTTGCTCTCTTCAGACTGTGGTCGAAGGCGAAGC
AACTCCCTGGACCTCCTTGTCCGTCTTTCTACGGTCACTCCGAGGTTATTTCGCGCCGGAATCTCACTGATCTGTTATATGACTCTCATAAAAAATATGGACCAGTCGTC
AAGCTATGGTTGGGTCCCATGCAGCTCTTAGTGTCTGTGAAGGAGCCAGCTCTTCTCAAAGAAATTCTGGTAAAGGCCGAGGATAAATTGCCTTTTACAGGAAGGGCATT
CAGATTGGCATTTGGGCGTTCAAGTCTCTTTGCTTCCTCTTTTGAGAAGGTGCAAAGCAGAAGACAATGGTTAGAAGAAAAATTAGATGAAATATCGTTTCAGAGTGCTA
ATGTTATTCCTGCAAAGGCTGTGGATTGTTCCGTAGGGAGAATACAAGATCTTATGATTGATGAAAGTATAGATTGTATTAAGGTTTCTCAACATTTGGCTTTTACATTG
TTAGGGTGCACACTTTTCGGGGATGCCTTTTTGGGTTGGTCTAAGGCAACCATCTATGAGGAACTTCTGATGATGATTGCAAAAGATGCCAATATTTGGGCCTCCTACAG
AGTTACTCCTTTCTGGAAACAAGGATTCTGGCGACCTACACCTCCACAGTTAATGGCCAAGCTCCCCAAACGCCTGCTTATGAAGAATGGATCACTAAGGAGAAGGCTTG
GGGGTAAGATCAACTCAGAACTAAATACAGTACAAAAGGGCTCGGTGAAAGATCCCCAGAAAAATGTTGATAACATGCCACTTCTGTTGGCAACTATATATGAATCTGCT
CGTCTTCTGCCAGCAGGGTCTCTGTTACAAAGATGTTCACTCAAACAAGATTTGGTCCTAAAGACTGGGATAACTGTACCAGCCGGAACATTAGTTGTTGTACCTATAAA
ATTGGTACAGATGGATACTTCAAGTTGGGGAAGTGATGCCAGTGAGTTTAGTCCCTACCGTTTTCTATCAATGGCCTGTAATGGGACTGATACGAGTCAGCGGACATCAC
TTGCAGGTGAAAATGCTGGGAATCAAGGAGAGAGCTCATTTGTTTTGAACAATCCAACCGGCAATGTTGGGTTTCTTCCCTTTGGCTTTGGTGCACGTTCTTGTGTTGGT
AACGCATCAGCTCACTTTTGCGCTCAACAGGACGATCCACTGCTCATGGCGTCCACACTTGGAGGCGTTCACGATTCTCAGGGAGCTTCCAACTCCGTCGATGTCGATGA
ACTCGCTCGTTTCGCCGTCGATAAACACAATAAGAAAGAGAATTCACTTCTTGAGTATGTGAGAGTCGTGAAGGCGAAAGAGCAGGTAGTAGCTGGTACACTGCACCATC
TTACTCTTGAAGTTGTTGATGCTGGTAAAAAGAAGCTGTATGAAGCTAAGGTCTGGGTGAAGTCATGGATGAATTTTAAGGAATTGCAAGAGTTCAAGCATGCAGGCGAT
GTCCCCTCAATTACTCCTTCAGATCTTGGTGCAAAGAAAGGTGATCATCCCCAAGGATGGCGAGAAGTGCCACCACATGATCCTCATGTTCAGGATGCAGCACAGCATGC
TGTTCGAACCATCCAGCAGAGATCTAATTCTCTACTCCCATATGAACTGCTGGAGATCATACATGCCAAGGCAGAGGTGATTGAAGATGCTGCAAAGTTTGATTTGCTCC
TAAAGCTGAAGAGAGGGAATAAAGAAGAGAAGTTCAAAGTGGAGGTCCACAAGAACAATGAAGGTAACTTCCTTCTGAATCAGATGGTGCAAGATCATTCCTAA
Protein sequenceShow/hide protein sequence
MSSKCDGLTPLNVIGILLRSESSRNCNSEEKSRILRDFVTREVNAFLWFFLFAITAVLISKVVALFRLWSKAKQLPGPPCPSFYGHSEVISRRNLTDLLYDSHKKYGPVV
KLWLGPMQLLVSVKEPALLKEILVKAEDKLPFTGRAFRLAFGRSSLFASSFEKVQSRRQWLEEKLDEISFQSANVIPAKAVDCSVGRIQDLMIDESIDCIKVSQHLAFTL
LGCTLFGDAFLGWSKATIYEELLMMIAKDANIWASYRVTPFWKQGFWRPTPPQLMAKLPKRLLMKNGSLRRRLGGKINSELNTVQKGSVKDPQKNVDNMPLLLATIYESA
RLLPAGSLLQRCSLKQDLVLKTGITVPAGTLVVVPIKLVQMDTSSWGSDASEFSPYRFLSMACNGTDTSQRTSLAGENAGNQGESSFVLNNPTGNVGFLPFGFGARSCVG
NASAHFCAQQDDPLLMASTLGGVHDSQGASNSVDVDELARFAVDKHNKKENSLLEYVRVVKAKEQVVAGTLHHLTLEVVDAGKKKLYEAKVWVKSWMNFKELQEFKHAGD
VPSITPSDLGAKKGDHPQGWREVPPHDPHVQDAAQHAVRTIQQRSNSLLPYELLEIIHAKAEVIEDAAKFDLLLKLKRGNKEEKFKVEVHKNNEGNFLLNQMVQDHS