; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10016546 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10016546
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionGlycosyltransferase
Genome locationChr03:5821866..5827789
RNA-Seq ExpressionHG10016546
SyntenyHG10016546
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
GO:0080043 - quercetin 3-O-glucosyltransferase activity (molecular function)
GO:0080044 - quercetin 7-O-glucosyltransferase activity (molecular function)
InterPro domainsIPR006439 - HAD hydrolase, subfamily IA
IPR023198 - Phosphoglycolate phosphatase-like, domain 2
IPR023214 - HAD superfamily
IPR036412 - HAD-like superfamily
IPR041492 - Haloacid dehalogenase-like hydrolase
IPR044999 - Protein CbbY-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5960478.1 hypothetical protein HYC85_001687 [Camellia sinensis]9.4e-20357.47Show/hide
Query:  ITTMLVRPNLIPSSSDLVSPPLSRFLPPAAIRFRTSRKSTATHKRFSL-----SVSAALQALIFDCDGVILESEHLHRQAYNDAFAHFDVRCPNSTS---
        + TML+ P   P SS      L   L  + +R  T+  +T   +  S+     S S+ALQALIFDCDGVILESEHLHRQAYNDAF+HF+VRCP+S S   
Subjt:  ITTMLVRPNLIPSSSDLVSPPLSRFLPPAAIRFRTSRKSTATHKRFSL-----SVSAALQALIFDCDGVILESEHLHRQAYNDAFAHFDVRCPNSTS---

Query:  QPLNWSIEFYDELQNRIGGGKPKMRWYFKENGWPSSTIFEKAPEDDEDRAKLIDILQAYLFNFLTYIDTHRYCCYLNDWKTERYKEIIKSGTVNPRPGVL
        QPLNW   FYD LQNRIGGGKPKMRWYFKE+GWPSSTI E  PEDD DRA LID LQ                                           
Subjt:  QPLNWSIEFYDELQNRIGGGKPKMRWYFKENGWPSSTIFEKAPEDDEDRAKLIDILQAYLFNFLTYIDTHRYCCYLNDWKTERYKEIIKSGTVNPRPGVL

Query:  RLMDEAKSAGRKLAVCSAATKSSVILCLENLIGIDRFQNLDCFLAGDDVKEKKPDPSIYITASKKLGVSEKDCLVVEDSVIGLQAATKAGMQCVITYTTS
                 G+KLAVCSAATKSSV+LCLENLIGI+RF++LDCFLAGDDVKEKKPDPSIY+TA+KKLG+S K+CLVVEDS+IGLQAAT AGM CVITYT+S
Subjt:  RLMDEAKSAGRKLAVCSAATKSSVILCLENLIGIDRFQNLDCFLAGDDVKEKKPDPSIYITASKKLGVSEKDCLVVEDSVIGLQAATKAGMQCVITYTTS

Query:  TANQDFKEAIATYPDLSD---ISFSSSFIEMSKSVAD------------------KPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHIT
        T+NQDFK+AIA YPDLS+   +  +  +++++K +                    KPH VC+PYP QGH  PL+ LA+LL+S GFH+TFV  EF H  + 
Subjt:  TANQDFKEAIATYPDLSD---ISFSSSFIEMSKSVAD------------------KPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHIT

Query:  QSHG--TNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNPFKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACG
        +S G  ++ V+GL  F+F TIPDGLPPS+  A+ DVP LCDS R+N L PF  L+  LNS+ EVP V+CI++DGV+SFAI+AA+E+GIPE+Q WTASAC 
Subjt:  QSHG--TNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNPFKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACG

Query:  FMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSFIRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQI
        FMGYLH+ ELI+RGI PFKDE F+SDGTLDT +DWIPGM+NIRL+DLPS +RTTN   IMFDFMG EA+NC+++ AIIFNTFD  E +VLEAI +KFP+I
Subjt:  FMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSFIRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQI

Query:  YTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVYSKL
        YT+GPL LL +   +S + +L  S+WKED +CLEWLD +   SVVY +L
Subjt:  YTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVYSKL

XP_022950861.1 linamarin synthase 2-like [Cucurbita moschata]1.9e-15590.54Show/hide
Query:  MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNP
        MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHI  S G +V++GL DFQFR IPDGLPPSERKASPDVPTLCDSTRRNFLNP
Subjt:  MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNP

Query:  FKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSF
        FKQLVAGLNSSVEVPSV+CIIADGVLSFAIKAA ELGIPE+QFWTASACGFMGYL+FDELIQRGILPFKDETFLSDGTLDT ++WIPGMKNIRLRDLPSF
Subjt:  FKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSF

Query:  IRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY
        IRTTN +D+MFDFMGSE RNC+ SSAIIFNTFDELEHD LEAIS KFPQIYTVGPLSLLSRE TESHLK LRLSVWKEDQ+CLEWLDTQAP+SVVY
Subjt:  IRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY

XP_023545186.1 linamarin synthase 2-like [Cucurbita pepo subsp. pepo]1.1e-15590.54Show/hide
Query:  MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNP
        MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHI  S G  V++GL DFQFR IPDGLPPSERKASPDVPTLCDSTRRNF NP
Subjt:  MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNP

Query:  FKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSF
        FKQLVAGLNSSVEVPSV+CIIADGVLSFAIKAAEELGIPE+QFWTAS CGFMGYL+FDELIQRGILPFKDETFLSDGTLDT ++WIPGMKNIRLRDLPSF
Subjt:  FKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSF

Query:  IRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY
        IRTTN +D+MFDFMGSE RNC+ SSAIIFNTFDELEHDVLEAIS KFPQIYTVGPLSLLSRE TESHLK LRLSVWKEDQ+CLEWLDTQAP+SVVY
Subjt:  IRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY

XP_038881808.1 linamarin synthase 2-like [Benincasa hispida]2.8e-16294.93Show/hide
Query:  MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNP
        MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHI QSHG N V+GL DFQFRTIPDGLPPSERKASPDVPTLCDSTRRN LNP
Subjt:  MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNP

Query:  FKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSF
        FKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDEL++RGILPFKDETFLSDG+LDTSVDWIPGMKNIRLRDLPSF
Subjt:  FKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSF

Query:  IRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY
        IRTTN +D MFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYT+GPLSLLSREATESHLK LRLSVWKEDQQCLEWLDTQAP+SVVY
Subjt:  IRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY

XP_038881810.1 haloacid dehalogenase-like hydrolase domain-containing protein At4g39970 [Benincasa hispida]2.5e-15591.08Show/hide
Query:  MASITTMLVRPNLIPSSSDLVSPPLSRFLPPAAIRFRTSRKSTATHKRFSLSVSAALQALIFDCDGVILESEHLHRQAYNDAFAHFDVRCPNSTSQPLNW
        MASITTMLVRPNL+PSSSDLVSPPL R LPPAAIRFRTSRKST THKRF LSVSAALQALIFDCDGVILESEHLHRQAYNDAFAHFDVRCPNSTS+PLNW
Subjt:  MASITTMLVRPNLIPSSSDLVSPPLSRFLPPAAIRFRTSRKSTATHKRFSLSVSAALQALIFDCDGVILESEHLHRQAYNDAFAHFDVRCPNSTSQPLNW

Query:  SIEFYDELQNRIGGGKPKMRWYFKENGWPSSTIFEKAPEDDEDRAKLIDILQAYLFNFLTYIDTHRYCCYLNDWKTERYKEIIKSGTVNPRPGVLRLMDE
        SIEFYDELQNRIGGGKPKMRWYFKENGWPSSTIFEKAPEDDEDRAKLIDILQ                    DWKTERYKEIIKSGTVNPRPGVLRLMDE
Subjt:  SIEFYDELQNRIGGGKPKMRWYFKENGWPSSTIFEKAPEDDEDRAKLIDILQAYLFNFLTYIDTHRYCCYLNDWKTERYKEIIKSGTVNPRPGVLRLMDE

Query:  AKSAGRKLAVCSAATKSSVILCLENLIGIDRFQNLDCFLAGDDVKEKKPDPSIYITASKKLGVSEKDCLVVEDSVIGLQAATKAGMQCVITYTTSTANQD
        AKSAGRKLAVCSAATKSSVILCLEN IGIDRFQNLDCFLAGDDVKEKKPDPSIYITASKKLGVSEKDCLVVEDSVIGLQAATKAGMQCVITYTTSTANQD
Subjt:  AKSAGRKLAVCSAATKSSVILCLENLIGIDRFQNLDCFLAGDDVKEKKPDPSIYITASKKLGVSEKDCLVVEDSVIGLQAATKAGMQCVITYTTSTANQD

Query:  FKEAIATYPDLSDI
        FKEAIATYPDLSD+
Subjt:  FKEAIATYPDLSDI

TrEMBL top hitse value%identityAlignment
A0A0A0KG87 Glycosyltransferase3.0e-15489.86Show/hide
Query:  MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNP
        MSK+VA+KPHAVCIPYPEQGHTLPLLQLAKLLHSTG HITFVI EFYHDHI QSHG NVV+ L+DFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFL+P
Subjt:  MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNP

Query:  FKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSF
        FK+LVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASAC FMGYLHFDELI+R ILPFKDETFL DG LDTSVDWIPGM+NIRLRDLPSF
Subjt:  FKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSF

Query:  IRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY
        IRTTN +D MFDFMGSEARNCMRSS IIFNTFDELEHDVLEAISAKFPQIY +GPLS+ SREA+E+HLK LRLSVWKEDQQCL WLDTQAP+SVVY
Subjt:  IRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY

A0A1S3B1H7 haloacid dehalogenase-like hydrolase domain-containing protein At4g399705.3e-15189.49Show/hide
Query:  MASITTMLVRPNLIPSSSDLVSPPLSRFLPPAAIRFRTSRKSTATHKRFSLSVSAALQALIFDCDGVILESEHLHRQAYNDAFAHFDVRCPNSTSQPLNW
        MASI T LVRPNLIPSSSDLVSP L   LPP+AIRFRT RKST THKRFSLSVSAALQALIFDCDGVILESEHLHRQAYNDAF HFDVRCPNSTSQPLNW
Subjt:  MASITTMLVRPNLIPSSSDLVSPPLSRFLPPAAIRFRTSRKSTATHKRFSLSVSAALQALIFDCDGVILESEHLHRQAYNDAFAHFDVRCPNSTSQPLNW

Query:  SIEFYDELQNRIGGGKPKMRWYFKENGWPSSTIFEKAPEDDEDRAKLIDILQAYLFNFLTYIDTHRYCCYLNDWKTERYKEIIKSGTVNPRPGVLRLMDE
        SIEFYDELQNRIGGGKPKMRWYFKENGWPSSTIFEKAPEDDE+RAKLIDILQ                    DWKTERYKEIIKSGTV+PRPGVLRLMDE
Subjt:  SIEFYDELQNRIGGGKPKMRWYFKENGWPSSTIFEKAPEDDEDRAKLIDILQAYLFNFLTYIDTHRYCCYLNDWKTERYKEIIKSGTVNPRPGVLRLMDE

Query:  AKSAGRKLAVCSAATKSSVILCLENLIGIDRFQNLDCFLAGDDVKEKKPDPSIYITASKKLGVSEKDCLVVEDSVIGLQAATKAGMQCVITYTTSTANQD
        AKSAGRKLAVCSAATKSSVILCLENLIG DRFQNLDCFLAGDDVKEKKPDPSIYITASKKLGVSEKDCLVVEDSVIGLQAATKAGMQCVITYTTSTANQD
Subjt:  AKSAGRKLAVCSAATKSSVILCLENLIGIDRFQNLDCFLAGDDVKEKKPDPSIYITASKKLGVSEKDCLVVEDSVIGLQAATKAGMQCVITYTTSTANQD

Query:  FKEAIATYPDLSDI
        FKEAIATYPDLSDI
Subjt:  FKEAIATYPDLSDI

A0A6J1GFZ4 Glycosyltransferase9.3e-15690.54Show/hide
Query:  MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNP
        MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHI  S G +V++GL DFQFR IPDGLPPSERKASPDVPTLCDSTRRNFLNP
Subjt:  MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNP

Query:  FKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSF
        FKQLVAGLNSSVEVPSV+CIIADGVLSFAIKAA ELGIPE+QFWTASACGFMGYL+FDELIQRGILPFKDETFLSDGTLDT ++WIPGMKNIRLRDLPSF
Subjt:  FKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSF

Query:  IRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY
        IRTTN +D+MFDFMGSE RNC+ SSAIIFNTFDELEHD LEAIS KFPQIYTVGPLSLLSRE TESHLK LRLSVWKEDQ+CLEWLDTQAP+SVVY
Subjt:  IRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY

A0A6J1IMT4 Glycosyltransferase7.9e-15589.86Show/hide
Query:  MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNP
        MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHS GFHITFVIPEFYHDHI  S G +V++GL DFQFR IPDGLPPSERKASPDVPTLCDSTRRNFLNP
Subjt:  MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNP

Query:  FKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSF
        FKQLVAGLNSSVEVPS++CIIADGVLSFAIKAAEELGIPE+QFWTASACGFMGYL+FDELIQRGILPFKDETFLSDGTLDT ++WIPGMKNIRLRDLPSF
Subjt:  FKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSF

Query:  IRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY
        IRTTN +D+MFDFMGSE RNC  SSAIIFNTFDELEHDVLEAIS KFPQIYTVGPLSLLS+E TESHLK LRLSVWKEDQ+C EWLDTQAP+SVVY
Subjt:  IRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY

A0A7J7I8I5 Uncharacterized protein4.6e-20357.47Show/hide
Query:  ITTMLVRPNLIPSSSDLVSPPLSRFLPPAAIRFRTSRKSTATHKRFSL-----SVSAALQALIFDCDGVILESEHLHRQAYNDAFAHFDVRCPNSTS---
        + TML+ P   P SS      L   L  + +R  T+  +T   +  S+     S S+ALQALIFDCDGVILESEHLHRQAYNDAF+HF+VRCP+S S   
Subjt:  ITTMLVRPNLIPSSSDLVSPPLSRFLPPAAIRFRTSRKSTATHKRFSL-----SVSAALQALIFDCDGVILESEHLHRQAYNDAFAHFDVRCPNSTS---

Query:  QPLNWSIEFYDELQNRIGGGKPKMRWYFKENGWPSSTIFEKAPEDDEDRAKLIDILQAYLFNFLTYIDTHRYCCYLNDWKTERYKEIIKSGTVNPRPGVL
        QPLNW   FYD LQNRIGGGKPKMRWYFKE+GWPSSTI E  PEDD DRA LID LQ                                           
Subjt:  QPLNWSIEFYDELQNRIGGGKPKMRWYFKENGWPSSTIFEKAPEDDEDRAKLIDILQAYLFNFLTYIDTHRYCCYLNDWKTERYKEIIKSGTVNPRPGVL

Query:  RLMDEAKSAGRKLAVCSAATKSSVILCLENLIGIDRFQNLDCFLAGDDVKEKKPDPSIYITASKKLGVSEKDCLVVEDSVIGLQAATKAGMQCVITYTTS
                 G+KLAVCSAATKSSV+LCLENLIGI+RF++LDCFLAGDDVKEKKPDPSIY+TA+KKLG+S K+CLVVEDS+IGLQAAT AGM CVITYT+S
Subjt:  RLMDEAKSAGRKLAVCSAATKSSVILCLENLIGIDRFQNLDCFLAGDDVKEKKPDPSIYITASKKLGVSEKDCLVVEDSVIGLQAATKAGMQCVITYTTS

Query:  TANQDFKEAIATYPDLSD---ISFSSSFIEMSKSVAD------------------KPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHIT
        T+NQDFK+AIA YPDLS+   +  +  +++++K +                    KPH VC+PYP QGH  PL+ LA+LL+S GFH+TFV  EF H  + 
Subjt:  TANQDFKEAIATYPDLSD---ISFSSSFIEMSKSVAD------------------KPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHIT

Query:  QSHG--TNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNPFKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACG
        +S G  ++ V+GL  F+F TIPDGLPPS+  A+ DVP LCDS R+N L PF  L+  LNS+ EVP V+CI++DGV+SFAI+AA+E+GIPE+Q WTASAC 
Subjt:  QSHG--TNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNPFKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACG

Query:  FMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSFIRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQI
        FMGYLH+ ELI+RGI PFKDE F+SDGTLDT +DWIPGM+NIRL+DLPS +RTTN   IMFDFMG EA+NC+++ AIIFNTFD  E +VLEAI +KFP+I
Subjt:  FMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSFIRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQI

Query:  YTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVYSKL
        YT+GPL LL +   +S + +L  S+WKED +CLEWLD +   SVVY +L
Subjt:  YTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVYSKL

SwissProt top hitse value%identityAlignment
F8WKW1 7-deoxyloganetin glucosyltransferase1.3e-9855.89Show/hide
Query:  SKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNPF
        S S+ +K HAVCIPYP QGH  P+L+LAK+LH  GFHITFV  EF H  + +S G + + GL DFQF+TIPDGLPPS+  A+ D+P+LC+ST    L+PF
Subjt:  SKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNPF

Query:  KQLVAGLN--SSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPS
        + L+A LN  SS +VP V+CI++DGV+SF ++AA ELG+PEI FWT SACGF+GY+H+ +LI++G+ P KD ++LS+G L+ S+DWIPGMK+IRL+DLPS
Subjt:  KQLVAGLN--SSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPS

Query:  FIRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY
        F+RTTN +D M  F+  E     ++SAII NTF ELE DV+ A+SA  P IYT+GPL  L +E  +  L  L  ++WKE+ +CL+WLD++ P SVVY
Subjt:  FIRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY

F8WLS6 7-deoxyloganetin glucosyltransferase1.2e-9453.54Show/hide
Query:  SKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNPF
        S   + KPHAVCIPYP QGH  P+L+LAKLLH  GFHITFV  EF H  + +S G++ ++GL  FQF+TIPDGLPPS+  A+ D+P+LC+ST  + L PF
Subjt:  SKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNPF

Query:  KQLVAGLN--SSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPS
        KQL+  LN  SS EVP V+C+++D V+SF I AA+EL IPE+ FWT SACG +GY+H+ +LI +G+ P KD ++ S+G LD  +DWIPGM+ IRLRDLP+
Subjt:  KQLVAGLN--SSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPS

Query:  FIRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY
        F+RTTN ++ M  F+  E     ++SAI+ NTF ELE +V++++S   P IY +GPL +L  +  +  LK L  ++WKE+ +CLEWLDT+ P SVVY
Subjt:  FIRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY

G3FIN8 Linamarin synthase 11.9e-9759.17Show/hide
Query:  PHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNPFKQLVAGL
        PHA+ +PYP QGH  PL+QL KLLH+ GF+ITFV  E  H  + +S G   ++GL DF+F  IPDGLP ++R A+  VP+L DSTR++ L PF  L+A L
Subjt:  PHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNPFKQLVAGL

Query:  NSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSFIRTTNTED
         +S +VP +TCII+DGV++FAI AA   GI EIQFWT SACGFM YLH  EL++RGI+PFKDE+FL DGTLD  VD+IPGM N++LRD+PSFIR T+  D
Subjt:  NSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSFIRTTNTED

Query:  IMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQ-IYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY
        IMFDF+GSEA   +++ AII NTFDELE +VL+AI+A++ + IYTVGP  LL +   E   KA R S+WKED  CLEWLD + P SVVY
Subjt:  IMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQ-IYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY

G3FIN9 Linamarin synthase 27.7e-9959.86Show/hide
Query:  PHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNPFKQLVAGL
        PHAV +PYP QGH  PL+QL KLLHS GF+ITFV  E  H  + +S G   ++GL DF+F  IPDGLP ++R A+  VP+L DSTR++ L PF  L+A L
Subjt:  PHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNPFKQLVAGL

Query:  NSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSFIRTTNTED
         +S +VP +TCII+DGV++FAI AA   GIPEIQFWT SACGFM YLH  EL++RGI+PFKDE+FL DGTLD  VD+IPGM N++LRD+PSFIR T+  D
Subjt:  NSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSFIRTTNTED

Query:  IMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQ-IYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY
        IMFDFMGSEA   +++ AII NT+DELE +VL+AI+A++ + IYTVGP  LL +   E   KA R S+WKED  C+EWLD + P SVVY
Subjt:  IMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQ-IYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY

Q680K2 Haloacid dehalogenase-like hydrolase domain-containing protein At4g399701.9e-11369.18Show/hide
Query:  IPSSSDLVSPPLSRFLPPAAIRFRTSRKSTATHKRFSLSVSA----ALQALIFDCDGVILESEHLHRQAYNDAFAHFDVRCPNSTSQPLNWSIEFYDELQ
        + SSS L+  P  RF     +RF++  +S  +  R S  VSA    +L+ALIFDCDGVILESE+LHRQAYNDAF+HFDVRCP S+S+ L+WS+EFYD+ Q
Subjt:  IPSSSDLVSPPLSRFLPPAAIRFRTSRKSTATHKRFSLSVSA----ALQALIFDCDGVILESEHLHRQAYNDAFAHFDVRCPNSTSQPLNWSIEFYDELQ

Query:  NRIGGGKPKMRWYFKENGWPSSTIFEKAPEDDEDRAKLIDILQAYLFNFLTYIDTHRYCCYLNDWKTERYKEIIKSGTVNPRPGVLRLMDEAKSAGRKLA
        N +GGGKPKMRWYFKENGWP+STIF+  P++D+DRAKLID LQ                    DWKTERYKEIIKSG+V PRPGV+RLMDEAK+AG+KLA
Subjt:  NRIGGGKPKMRWYFKENGWPSSTIFEKAPEDDEDRAKLIDILQAYLFNFLTYIDTHRYCCYLNDWKTERYKEIIKSGTVNPRPGVLRLMDEAKSAGRKLA

Query:  VCSAATKSSVILCLENLIGIDRFQNLDCFLAGDDVKEKKPDPSIYITASKKLGVSEKDCLVVEDSVIGLQAATKAGMQCVITYTTSTANQDFKEAIATYP
        VCSAATKSSVILCLENLI I+RFQ LDCFLAGDDVKEKKPDPSIYITA++KLGVS KDCLVVEDSVIGLQAATKAGM CVITYT+ST++Q+F +AIA YP
Subjt:  VCSAATKSSVILCLENLIGIDRFQNLDCFLAGDDVKEKKPDPSIYITASKKLGVSEKDCLVVEDSVIGLQAATKAGMQCVITYTTSTANQDFKEAIATYP

Query:  DLSDI
        DLS++
Subjt:  DLSDI

Arabidopsis top hitse value%identityAlignment
AT1G22340.1 UDP-glucosyl transferase 85A73.0e-9049.49Show/hide
Query:  ADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNPFKQLV
        A KPH VC+PYP QGH  P+L++AKLL++ GFH+TFV   + H+ + +S G N ++G   F+F +IPDGLP ++   +   PT+C S  +N L PFK+++
Subjt:  ADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNPFKQLV

Query:  AGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSFIRTTN
          +N   +VP V+CI++DGV+SF + AAEELG+PE+ FWT SACGFM  LHF   I++G+ PFKDE+++S   LDT +DWIP MKN+RL+D+PS+IRTTN
Subjt:  AGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSFIRTTN

Query:  TEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATE--SHLKALRLSVWKEDQQCLEWLDTQAPQSVVY
         ++IM +F+  E     R+SAII NTFDELEHDV++++ +  P +Y++GPL LL +E     S +  + L++W+E+ +CL+WLDT+ P SV++
Subjt:  TEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATE--SHLKALRLSVWKEDQQCLEWLDTQAPQSVVY

AT1G22360.1 UDP-glucosyl transferase 85A25.5e-9249.66Show/hide
Query:  MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNP
        M   VA K H VC+PYP QGH  P++++AKLL++ GFHITFV   + H+ + +S G N V+GL  F+F +IPDGLP ++   + D+PTLC+ST ++ L P
Subjt:  MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNP

Query:  FKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSF
        FK+L+  +N+  +VP V+CI++DG +SF + AAEELG+PE+ FWT SACGF+ YL++   I++G+ P KDE++L+   LDT +DWIP MKN+RL+D+PSF
Subjt:  FKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSF

Query:  IRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRL--SVWKEDQQCLEWLDTQAPQSVVY
        IRTTN +DIM +F+  EA    R+SAII NTFD+LEHDV++++ +  P +Y++GPL LL ++ +  + +  R   ++W+E+ +CL+WL+T+A  SVVY
Subjt:  IRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRL--SVWKEDQQCLEWLDTQAPQSVVY

AT1G22360.2 UDP-glucosyl transferase 85A25.5e-9249.66Show/hide
Query:  MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNP
        M   VA K H VC+PYP QGH  P++++AKLL++ GFHITFV   + H+ + +S G N V+GL  F+F +IPDGLP ++   + D+PTLC+ST ++ L P
Subjt:  MSKSVADKPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNP

Query:  FKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSF
        FK+L+  +N+  +VP V+CI++DG +SF + AAEELG+PE+ FWT SACGF+ YL++   I++G+ P KDE++L+   LDT +DWIP MKN+RL+D+PSF
Subjt:  FKQLVAGLNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSF

Query:  IRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRL--SVWKEDQQCLEWLDTQAPQSVVY
        IRTTN +DIM +F+  EA    R+SAII NTFD+LEHDV++++ +  P +Y++GPL LL ++ +  + +  R   ++W+E+ +CL+WL+T+A  SVVY
Subjt:  IRTTNTEDIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRL--SVWKEDQQCLEWLDTQAPQSVVY

AT1G22370.2 UDP-glucosyl transferase 85A54.2e-9252.23Show/hide
Query:  KPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNPFKQLVAG
        KPH VCIP+P QGH  P+L++AKLL++ GFH+TFV   + H+ + +S G N ++GL  F+F +IPDGLP   +    DVPTLC+ST +N L PFK+L+  
Subjt:  KPHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNPFKQLVAG

Query:  LNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSFIRTTNTE
        +N++ +VP V+CI++DGV+SF + AAEELG+P++ FWT SACGF+ YLHF   I++G+ P KDE+     +LDT ++WIP MKN+ L+D+PSFIR TNTE
Subjt:  LNSSVEVPSVTCIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSFIRTTNTE

Query:  DIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSL-LSREA-TESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY
        DIM +F   EA    R+SAII NTFD LEHDV+ +I +  PQ+YT+GPL L ++R+   ES +  +  ++W+E+ +CL+WLDT++P SVVY
Subjt:  DIMFDFMGSEARNCMRSSAIIFNTFDELEHDVLEAISAKFPQIYTVGPLSL-LSREA-TESHLKALRLSVWKEDQQCLEWLDTQAPQSVVY

AT4G39970.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein1.3e-11469.18Show/hide
Query:  IPSSSDLVSPPLSRFLPPAAIRFRTSRKSTATHKRFSLSVSA----ALQALIFDCDGVILESEHLHRQAYNDAFAHFDVRCPNSTSQPLNWSIEFYDELQ
        + SSS L+  P  RF     +RF++  +S  +  R S  VSA    +L+ALIFDCDGVILESE+LHRQAYNDAF+HFDVRCP S+S+ L+WS+EFYD+ Q
Subjt:  IPSSSDLVSPPLSRFLPPAAIRFRTSRKSTATHKRFSLSVSA----ALQALIFDCDGVILESEHLHRQAYNDAFAHFDVRCPNSTSQPLNWSIEFYDELQ

Query:  NRIGGGKPKMRWYFKENGWPSSTIFEKAPEDDEDRAKLIDILQAYLFNFLTYIDTHRYCCYLNDWKTERYKEIIKSGTVNPRPGVLRLMDEAKSAGRKLA
        N +GGGKPKMRWYFKENGWP+STIF+  P++D+DRAKLID LQ                    DWKTERYKEIIKSG+V PRPGV+RLMDEAK+AG+KLA
Subjt:  NRIGGGKPKMRWYFKENGWPSSTIFEKAPEDDEDRAKLIDILQAYLFNFLTYIDTHRYCCYLNDWKTERYKEIIKSGTVNPRPGVLRLMDEAKSAGRKLA

Query:  VCSAATKSSVILCLENLIGIDRFQNLDCFLAGDDVKEKKPDPSIYITASKKLGVSEKDCLVVEDSVIGLQAATKAGMQCVITYTTSTANQDFKEAIATYP
        VCSAATKSSVILCLENLI I+RFQ LDCFLAGDDVKEKKPDPSIYITA++KLGVS KDCLVVEDSVIGLQAATKAGM CVITYT+ST++Q+F +AIA YP
Subjt:  VCSAATKSSVILCLENLIGIDRFQNLDCFLAGDDVKEKKPDPSIYITASKKLGVSEKDCLVVEDSVIGLQAATKAGMQCVITYTTSTANQDFKEAIATYP

Query:  DLSDI
        DLS++
Subjt:  DLSDI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAGCATCACTACAATGCTCGTTCGTCCCAATCTCATACCCTCTTCTTCCGATCTGGTTTCCCCTCCTCTCTCTCGATTTCTTCCTCCCGCAGCCATCCGCTTCCG
GACCTCCAGAAAATCCACCGCCACTCACAAACGTTTCTCCCTTTCAGTCTCTGCTGCTTTACAAGCTCTAATATTCGACTGCGACGGCGTGATCCTTGAGTCCGAGCACT
TGCACCGTCAAGCATATAACGATGCGTTTGCTCATTTTGACGTTCGTTGCCCTAATTCGACGTCGCAGCCTCTCAATTGGAGCATCGAGTTTTACGACGAGCTCCAGAAC
CGCATTGGTGGCGGTAAACCTAAAATGCGATGGTACTTCAAGGAGAATGGATGGCCATCTTCAACGATCTTCGAGAAGGCTCCGGAAGATGATGAGGACCGAGCAAAGTT
GATTGATATTCTTCAGGCATACTTATTTAATTTCCTTACTTACATCGATACACATCGATACTGTTGTTATCTTAATGATTGGAAAACGGAAAGGTACAAGGAAATAATCA
AATCTGGAACTGTGAATCCCAGGCCTGGAGTACTAAGATTGATGGATGAGGCAAAGTCTGCTGGCAGGAAACTTGCTGTATGCTCTGCGGCTACAAAAAGTTCAGTTATT
CTTTGCCTTGAAAATCTTATTGGAATTGATCGGTTTCAAAATCTTGATTGCTTCCTTGCTGGTGATGATGTGAAGGAAAAGAAGCCTGATCCATCCATTTATATAACAGC
TTCAAAAAAGCTGGGCGTGTCAGAAAAGGATTGTCTGGTGGTTGAGGACAGTGTCATTGGCTTGCAGGCTGCTACAAAGGCTGGAATGCAATGTGTGATAACCTACACAA
CTTCCACAGCTAACCAGGATTTTAAAGAAGCGATAGCAACCTATCCAGACTTGAGTGACATAAGTTTTTCATCTTCGTTTATCGAGATGAGCAAGTCAGTGGCCGATAAA
CCTCATGCAGTGTGCATCCCATATCCAGAGCAAGGGCACACGTTGCCTCTTCTGCAATTAGCCAAGCTTCTCCATTCAACTGGTTTCCACATAACCTTTGTCATCCCCGA
GTTCTATCACGATCACATAACACAGTCCCACGGAACTAATGTCGTAGAAGGCTTGTTCGATTTCCAGTTCCGGACCATACCAGACGGGTTGCCTCCATCTGAACGCAAAG
CCTCCCCAGATGTTCCGACGCTCTGTGACTCAACTAGGAGAAATTTTTTGAACCCATTCAAACAGTTGGTGGCTGGGCTGAATTCTTCGGTGGAGGTGCCTTCAGTAACC
TGTATAATTGCTGATGGGGTCTTGAGCTTTGCTATAAAGGCTGCTGAGGAGCTGGGGATTCCGGAGATTCAGTTTTGGACAGCTTCTGCTTGTGGTTTCATGGGGTATCT
GCATTTTGATGAACTTATTCAAAGAGGAATCTTGCCGTTCAAAGATGAAACATTCTTGAGTGATGGTACGCTTGACACCTCCGTGGATTGGATCCCAGGAATGAAAAATA
TCCGCCTGAGAGACCTCCCAAGCTTCATCAGAACTACAAACACAGAGGATATAATGTTTGATTTCATGGGGTCGGAAGCCCGAAACTGCATGAGATCTTCTGCTATCATC
TTCAACACATTCGACGAGCTTGAACATGATGTGTTAGAAGCAATTTCAGCAAAGTTTCCCCAAATATACACAGTTGGTCCCTTATCCTTACTAAGCAGAGAAGCAACTGA
AAGCCATTTGAAGGCATTAAGGCTGAGTGTGTGGAAGGAAGACCAACAATGCCTCGAGTGGCTCGATACACAGGCCCCCCAATCTGTTGTATATTCAAAGCTAGACCTGC
CATTACAAGGAGTGGTTGTCTCTTCAGTTGCACAACATGATGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAGCATCACTACAATGCTCGTTCGTCCCAATCTCATACCCTCTTCTTCCGATCTGGTTTCCCCTCCTCTCTCTCGATTTCTTCCTCCCGCAGCCATCCGCTTCCG
GACCTCCAGAAAATCCACCGCCACTCACAAACGTTTCTCCCTTTCAGTCTCTGCTGCTTTACAAGCTCTAATATTCGACTGCGACGGCGTGATCCTTGAGTCCGAGCACT
TGCACCGTCAAGCATATAACGATGCGTTTGCTCATTTTGACGTTCGTTGCCCTAATTCGACGTCGCAGCCTCTCAATTGGAGCATCGAGTTTTACGACGAGCTCCAGAAC
CGCATTGGTGGCGGTAAACCTAAAATGCGATGGTACTTCAAGGAGAATGGATGGCCATCTTCAACGATCTTCGAGAAGGCTCCGGAAGATGATGAGGACCGAGCAAAGTT
GATTGATATTCTTCAGGCATACTTATTTAATTTCCTTACTTACATCGATACACATCGATACTGTTGTTATCTTAATGATTGGAAAACGGAAAGGTACAAGGAAATAATCA
AATCTGGAACTGTGAATCCCAGGCCTGGAGTACTAAGATTGATGGATGAGGCAAAGTCTGCTGGCAGGAAACTTGCTGTATGCTCTGCGGCTACAAAAAGTTCAGTTATT
CTTTGCCTTGAAAATCTTATTGGAATTGATCGGTTTCAAAATCTTGATTGCTTCCTTGCTGGTGATGATGTGAAGGAAAAGAAGCCTGATCCATCCATTTATATAACAGC
TTCAAAAAAGCTGGGCGTGTCAGAAAAGGATTGTCTGGTGGTTGAGGACAGTGTCATTGGCTTGCAGGCTGCTACAAAGGCTGGAATGCAATGTGTGATAACCTACACAA
CTTCCACAGCTAACCAGGATTTTAAAGAAGCGATAGCAACCTATCCAGACTTGAGTGACATAAGTTTTTCATCTTCGTTTATCGAGATGAGCAAGTCAGTGGCCGATAAA
CCTCATGCAGTGTGCATCCCATATCCAGAGCAAGGGCACACGTTGCCTCTTCTGCAATTAGCCAAGCTTCTCCATTCAACTGGTTTCCACATAACCTTTGTCATCCCCGA
GTTCTATCACGATCACATAACACAGTCCCACGGAACTAATGTCGTAGAAGGCTTGTTCGATTTCCAGTTCCGGACCATACCAGACGGGTTGCCTCCATCTGAACGCAAAG
CCTCCCCAGATGTTCCGACGCTCTGTGACTCAACTAGGAGAAATTTTTTGAACCCATTCAAACAGTTGGTGGCTGGGCTGAATTCTTCGGTGGAGGTGCCTTCAGTAACC
TGTATAATTGCTGATGGGGTCTTGAGCTTTGCTATAAAGGCTGCTGAGGAGCTGGGGATTCCGGAGATTCAGTTTTGGACAGCTTCTGCTTGTGGTTTCATGGGGTATCT
GCATTTTGATGAACTTATTCAAAGAGGAATCTTGCCGTTCAAAGATGAAACATTCTTGAGTGATGGTACGCTTGACACCTCCGTGGATTGGATCCCAGGAATGAAAAATA
TCCGCCTGAGAGACCTCCCAAGCTTCATCAGAACTACAAACACAGAGGATATAATGTTTGATTTCATGGGGTCGGAAGCCCGAAACTGCATGAGATCTTCTGCTATCATC
TTCAACACATTCGACGAGCTTGAACATGATGTGTTAGAAGCAATTTCAGCAAAGTTTCCCCAAATATACACAGTTGGTCCCTTATCCTTACTAAGCAGAGAAGCAACTGA
AAGCCATTTGAAGGCATTAAGGCTGAGTGTGTGGAAGGAAGACCAACAATGCCTCGAGTGGCTCGATACACAGGCCCCCCAATCTGTTGTATATTCAAAGCTAGACCTGC
CATTACAAGGAGTGGTTGTCTCTTCAGTTGCACAACATGATGCGTAA
Protein sequenceShow/hide protein sequence
MASITTMLVRPNLIPSSSDLVSPPLSRFLPPAAIRFRTSRKSTATHKRFSLSVSAALQALIFDCDGVILESEHLHRQAYNDAFAHFDVRCPNSTSQPLNWSIEFYDELQN
RIGGGKPKMRWYFKENGWPSSTIFEKAPEDDEDRAKLIDILQAYLFNFLTYIDTHRYCCYLNDWKTERYKEIIKSGTVNPRPGVLRLMDEAKSAGRKLAVCSAATKSSVI
LCLENLIGIDRFQNLDCFLAGDDVKEKKPDPSIYITASKKLGVSEKDCLVVEDSVIGLQAATKAGMQCVITYTTSTANQDFKEAIATYPDLSDISFSSSFIEMSKSVADK
PHAVCIPYPEQGHTLPLLQLAKLLHSTGFHITFVIPEFYHDHITQSHGTNVVEGLFDFQFRTIPDGLPPSERKASPDVPTLCDSTRRNFLNPFKQLVAGLNSSVEVPSVT
CIIADGVLSFAIKAAEELGIPEIQFWTASACGFMGYLHFDELIQRGILPFKDETFLSDGTLDTSVDWIPGMKNIRLRDLPSFIRTTNTEDIMFDFMGSEARNCMRSSAII
FNTFDELEHDVLEAISAKFPQIYTVGPLSLLSREATESHLKALRLSVWKEDQQCLEWLDTQAPQSVVYSKLDLPLQGVVVSSVAQHDA