; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001056 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001056
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionGlycosyltransferase
Genome locationChr09:13605619..13612195
RNA-Seq ExpressionHG10001056
SyntenyHG10001056
Gene Ontology termsGO:0008194 - UDP-glycosyltransferase activity (molecular function)
InterPro domainsIPR002213 - UDP-glucuronosyl/UDP-glucosyltransferase
IPR035595 - UDP-glycosyltransferase family, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040082.1 beta-D-glucosyl crocetin beta-1,6-glucosyltransferase-like [Cucumis melo var. makuwa]8.3e-18676.33Show/hide
Query:  MDAQQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVELHLPSSPEFPSHLHTTNALLIHLTPTL
        MDA QA     T TTILMF WLGYGHLSAYLEL+KALSSR NF IYFCSTPVNLDSIKPKLI S   SIQFVELHLPSSPEFP HLHTTNAL +HLTP L
Subjt:  MDAQQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVELHLPSSPEFPSHLHTTNALLIHLTPTL

Query:  HQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNHWKAKFTS-----AHSVKEAISN
        HQAF AAAP FE IL+TLSPHLLIYDCFQSWAPRLASSLNIPAINFNTS   ++ + FHS++ P SKFP+SDFVLHNHW +K+ S     AH VKEA   
Subjt:  HQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNHWKAKFTS-----AHSVKEAISN

Query:  SFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNE-EEEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGNGLEESETNFIW
          N S DVIL NSF+EV+G+YMDYLS LLKKKVI VGPLVYEPNE +EEDEDY RIKNWLDKKEALSTVL S GSESYA EEEKEE+  GL ES  NFIW
Subjt:  SFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNE-EEEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGNGLEESETNFIW

Query:  VERVSLKEDQEQE--RRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGVEAKRDPNGKI
        VER++ KED+EQ+  RR  +E+ GERA+V+KGWAPQGKILKHGSIGGFVSHCGWNSVLES VSGVPIIGVP+ GDQPFN GVVEEAG+GVEAKRDP+GKI
Subjt:  VERVSLKEDQEQE--RRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGVEAKRDPNGKI

Query:  QRQEVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEMLAQISLLRN
        QRQEVAKLIK+VVVEK+REE+R++VREMSE ++R+ D+KI+E+L QIS L N
Subjt:  QRQEVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEMLAQISLLRN

KAF7133092.1 hypothetical protein RHSIM_Rhsim09G0143900 [Rhododendron simsii]8.9e-19645.27Show/hide
Query:  NTSTPTTILMFPWIGYGHLCAYLELAKALSRRNNFHIYFCSTPVCLDSIKPKL--IPSSSIEFVEFHLFPSPELPPHLHTTNGVPPHIALTLHQAATAAA
        +T+T   +LM PW+  GH+  +LELAK LS + NFHIY CSTP+ L SI+ ++    S SIE V FHL P+PELP H HTTNG+PPH+   L  A    +
Subjt:  NTSTPTTILMFPWIGYGHLCAYLELAKALSRRNNFHIYFCSTPVCLDSIKPKL--IPSSSIEFVEFHLFPSPELPPHLHTTNGVPPHIALTLHQAATAAA

Query:  PRFESILQTLSPHLLIYDCFQPWAPRI-ASTLNIPAINFSTTGASIVSHEFHSIHYPDSKFPFSNFVLHNYWKAKLKSVTSEGACIIEGFFNCFNASCDV
        P   + LQTLSP L+IYD F+P  P   AS   IPA+   T+GA++VS  FH    PD +FPF    L++      +                   SC +
Subjt:  PRFESILQTLSPHLLIYDCFQPWAPRI-ASTLNIPAINFSTTGASIVSHEFHSIHYPDSKFPFSNFVLHNYWKAKLKSVTSEGACIIEGFFNCFNASCDV

Query:  ILMNSFREIEGEYMNYVSLLTKKKVIPVGPLVYEPNEEEEDENYSRIKNWLDKKETLSTVLVSLGSERTASEEEINEIGKGLEESEVNFIWVERSNSKGD
         L+NSFRE+EG+Y++Y+ +LT+KK +P GPLV     + + +  S I  WL  K+  STV VS GSE   S EE  E+  GLE S VNFIW+ R  + G+
Subjt:  ILMNSFREIEGEYMNYVSLLTKKKVIPVGPLVYEPNEEEEDENYSRIKNWLDKKETLSTVLVSLGSERTASEEEINEIGKGLEESEVNFIWVERSNSKGD

Query:  E----EQKRREFVEMVGERVMVVKGWAPQGKILKHGSIGGFVSHCGWNSVLESITFGVPIIGVPMFGDQPFNAIVVEEAGLGVEAKRDSDGKIQRKEIAR
             E    EF+E +GER M+V+GWAPQ +IL H S GGFVSHCGWNS+LES+ FGVPI+ VPM  DQP NA +VE AG G E KRD + ++ R+ IA+
Subjt:  E----EQKRREFVEMVGERVMVVKGWAPQGKILKHGSIGGFVSHCGWNSVLESITFGVPIIGVPMFGDQPFNAIVVEEAGLGVEAKRDSDGKIQRKEIAR

Query:  LIKEVVVEKTREEIR--------------------MKPIICIAMDAQQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSI
        +IK+VVV++   ++R                    ++P     MD++Q+ +     T +LM  WL  GH++ +LELAK LS++ +F IY CSTP+ L SI
Subjt:  LIKEVVVEKTREEIR--------------------MKPIICIAMDAQQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSI

Query:  KPKLISSSFSSIQFVELHL-PSSPEFPSHLHTTNALLIHLTPTLHQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVC-
        + ++  +   SIQ V LHL P+SPE P H HTTN L  HL PTL  AF +A+P F  +LQTL P L+IYD  Q WAP +AS  NIPA+ F T+ A VV  
Subjt:  KPKLISSSFSSIQFVELHL-PSSPEFPSHLHTTNALLIHLTPTLHQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVC-

Query:  HGFHSIYYPNSKFPVSDF--VLHNHWKAK-----FTSAHSVKEAISNSFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNEEEEDEDYW
          F +   P+ + P  +   +L +   A+               +      S  +IL N+FRE++GKY+DYL   ++KK++ +GPLV+E +  EE++D  
Subjt:  HGFHSIYYPNSKFPVSDF--VLHNHWKAK-----FTSAHSVKEAISNSFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNEEEEDEDYW

Query:  RIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGNGLEESETNFIWVERVSLKEDQEQERR---GFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCG
         I  WL  K+ LSTV VSFGSES+  +EE+EE+ NGLE S  NFIWV R  ++E  E E     GF+ER G+R LV+ GWAPQ +IL H S  GFVSHCG
Subjt:  RIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGNGLEESETNFIWVERVSLKEDQEQERR---GFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCG

Query:  WNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGVEAKRDPNGKIQRQEVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEMLAQISLLRN
        WNSVLES+  GVPII +P+  DQP +  VVE+ GVGVE KRD N K+ R+ +A++I++VV +++  ++R K RE+SE +R K +K ID ++ +   L N
Subjt:  WNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGVEAKRDPNGKIQRQEVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEMLAQISLLRN

KAG5574888.1 hypothetical protein H5410_055022 [Solanum commersonii]1.6e-20547.7Show/hide
Query:  TILMFPWIGYGHLCAYLELAKALSRRNNFHIYFCSTPVCLDSIKPKLIP--SSSIEFVEFHLFPSPELPPHLHTTNGVPPHIALTLHQAATAAAPRFESI
        ++LM PW+ +GH+  +LELAK L+ R NFHIY CSTP+ L SIK  +    S SIE VE HL   P LPPH HTTNG+PP +  TL  A   A+P F  I
Subjt:  TILMFPWIGYGHLCAYLELAKALSRRNNFHIYFCSTPVCLDSIKPKLIP--SSSIEFVEFHLFPSPELPPHLHTTNGVPPHIALTLHQAATAAAPRFESI

Query:  LQTLSPHLLIYDCFQPWAPRIASTLNIPAINFSTTGASIVSHEFHSIHYPDSKFPFSNFVLHNYWKAKLKSVTSEGACIIEGFFNCFNASCDVILMNSFR
        LQTL P L+IYD  QPWA   AS++NIPA+ F T GA++VS   H     + KFPF    LH Y    LK    E       F      S D+IL+ + R
Subjt:  LQTLSPHLLIYDCFQPWAPRIASTLNIPAINFSTTGASIVSHEFHSIHYPDSKFPFSNFVLHNYWKAKLKSVTSEGACIIEGFFNCFNASCDVILMNSFR

Query:  EIEGEYMNYVSLLTKKKVIPVGPLVYEPNEEEEDENYSRIKNWLDKKETLSTVLVSLGSERTASEEEINEIGKGLEESEVNFIWVER---SNSKGDEEQK
        + EG+Y++Y+S L  KK++PVG LV E     +D++   I  WLDKKE  STV VS GSE   S+E+I  + +GLE S+VNFIWV R     S   ++  
Subjt:  EIEGEYMNYVSLLTKKKVIPVGPLVYEPNEEEEDENYSRIKNWLDKKETLSTVLVSLGSERTASEEEINEIGKGLEESEVNFIWVER---SNSKGDEEQK

Query:  RREFVEMVGERVMVVKGWAPQGKILKHGSIGGFVSHCGWNSVLESITFGVPIIGVPMFGDQPFNAIVVEEAGLGVEAKRDSDGKIQRKEIARLIKEVVVE
           ++E VGER MV+KGWAPQ  IL+H SIGGFVSHCGW+S +ES+ FGVPII +PM  DQP NA +VE   +GVEA RD DGK+Q +EIA  I++V+VE
Subjt:  RREFVEMVGERVMVVKGWAPQGKILKHGSIGGFVSHCGWNSVLESITFGVPIIGVPMFGDQPFNAIVVEEAGLGVEAKRDSDGKIQRKEIARLIKEVVVE

Query:  KTREEIRMK-PIICIAMDA-----------QQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVE
        ++ E++R K   +   M+A           + A  +T+   ++LM  W  +GH++ +LELAK L+S+ NF IY CSTPVNL+SIK ++      SI+ +E
Subjt:  KTREEIRMK-PIICIAMDA-----------QQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVE

Query:  LHLPSSPEFPSHLHTTNALLIHLTPTLHQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDF
        LHLPS P+ P H HTTN L  HL  TL  AF  A+P F  ILQTL P L+I+D  Q W    ASS+NIPA+ F T SA VV    H     + KFP  + 
Subjt:  LHLPSSPEFPSHLHTTNALLIHLTPTLHQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDF

Query:  VLHNH----WKAKFTSAHSVKEAISNSFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNEEEEDEDYWRIKNWLDKKEALSTVLVSFGS
         L  H     K       S K     +   S D+IL  + R+ +GKY+DYLS L  KK++ VG LV    E  + +DY  I  WLDKK+  STV VSFGS
Subjt:  VLHNH----WKAKFTSAHSVKEAISNSFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNEEEEDEDYWRIKNWLDKKEALSTVLVSFGS

Query:  ESYALEEEKEEVGNGLEESETNFIWVERVSLKED---QEQERRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISG
        E +  +EE   V  GLE S+ NFIWV R S  E    Q+    G++ER GER +V+KGWAPQ  IL+H SIGGFVSHCGW+S +ES+  GVPII +P+  
Subjt:  ESYALEEEKEEVGNGLEESETNFIWVERVSLKED---QEQERRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISG

Query:  DQPFNVGVVEEAGVGVEAKRDPNGKIQRQEVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEML
        DQP N  +VE  GVGVEA +D +GK+Q +E+AK I++V+VE+  E++R K +E+SE +  K D++ID ++
Subjt:  DQPFNVGVVEEAGVGVEAKRDPNGKIQRQEVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEML

XP_038901884.1 UDP-glucosyltransferase 29-like [Benincasa hispida]3.4e-20381.6Show/hide
Query:  KPIICIAMDAQQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALS--SRNNFQIYFCSTPVNLDSIKPKLISSSFSS-IQFVELHLPSSPEFPSHLHTTN
        +PIICIAM+AQQAGSNTL PTTILMF WLGYGHLS YLELAKAL+    N F IYFCSTPVNL+SIKPKLI SS SS I+FVELHLPSSPEFP +L+TTN
Subjt:  KPIICIAMDAQQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALS--SRNNFQIYFCSTPVNLDSIKPKLISSSFSS-IQFVELHLPSSPEFPSHLHTTN

Query:  ALLIHLTPTLHQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNHWKAKFTSA----
        AL  HLTPTLHQAF AAAPRFEAILQTLSPHLLIYD FQ WAPRLASSLNIPAINFNTS   ++ H FHSI+YP+SK+PVSDFVLH HWKAK   A    
Subjt:  ALLIHLTPTLHQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNHWKAKFTSA----

Query:  --HSVKEAISNSFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNEE-EEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGN
           SV+EA    FNASCDVILTNSFREVDG++MDYLS LLKKKVI VGPLVYEPNEE +EDEDYWRIKNWLDKKEALSTVLVS GSESYA EEEKEE+G 
Subjt:  --HSVKEAISNSFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNEE-EEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGN

Query:  GLEESETNFIWVERV-SLKEDQEQERRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGV
        GLEESE NFIWVER+ + K D+EQ+RR FVERAGERA+V+KGW PQGKILKHG+IGGFVSHCGWNSVLESIVSGVPIIGVPI+GDQPFN GVV+EAGVGV
Subjt:  GLEESETNFIWVERV-SLKEDQEQERRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGV

Query:  EAKRDPNGKIQRQEVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEMLAQISLLRN
        E KRDP+GKIQRQEVAKLIK VVVEKTRE+LR+KVREMSE LRRKR++KIDEMLAQISLLRN
Subjt:  EAKRDPNGKIQRQEVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEMLAQISLLRN

XP_038902166.1 UDP-glucosyltransferase 29-like [Benincasa hispida]3.0e-19178.84Show/hide
Query:  MDAQQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVELHLPSSPEFPSHLHTTNALLIHLTPTL
        MD QQ GS       ILMF WLGYGHL AYLELAKALS+RNNF IYFCSTPVNLDSIKPKLI S  SSI+FVELHLPSSPEFP HLH  NAL  HL  TL
Subjt:  MDAQQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVELHLPSSPEFPSHLHTTNALLIHLTPTL

Query:  HQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNHWKAKFTS-----AHSVKEAISN
        HQAF AAAPRFEAILQTLSPHLLIYD  QSWAP++ASSL IPAINFNTS AF++ HGFHSI+YPNSK P SDFVLHNHWKAK        A SV+EA   
Subjt:  HQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNHWKAKFTS-----AHSVKEAISN

Query:  SFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNEEEEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGNGLEESETNFIWV
         FNASCDVIL NSFREV+G+YMDY+S L KKKVI VGPLVYEPN+EEEDEDY RIKNWLDKKEALSTVLVSFGSE YA EEE EE+G GLEESE NFIWV
Subjt:  SFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNEEEEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGNGLEESETNFIWV

Query:  ERVSLKEDQEQERRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGVEAKRDPNGKIQRQ
        ER++ KED+E++RRGFVERAGE+A+V+KGWAPQGKILKHGSIGGFVSHCGWNSVLESIV GVPIIGVP+S DQPFN  VVEEAGVGVEAKRDP+GKIQRQ
Subjt:  ERVSLKEDQEQERRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGVEAKRDPNGKIQRQ

Query:  EVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEMLAQISLLRN
        E+AKLIK+VVVEKTREELR+KVREMSE L +  D+KI+ M+AQIS L N
Subjt:  EVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEMLAQISLLRN

TrEMBL top hitse value%identityAlignment
A0A0A0KX18 UDP-glucose:sesaminol 2'-O-glucoside-O-glucosyltransferase2.7e-17472.47Show/hide
Query:  IAMDAQQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVELHLPSSPEFPSHLHTTNALLIHLTP
        +AMD  QA  +T T TTILMF WLGYGHLS YLELAKALS+R NF IYFCSTPVNLDSIKPKLI S   SIQ VELHLPSSP+ P HLHTTNAL  HLTP
Subjt:  IAMDAQQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVELHLPSSPEFPSHLHTTNALLIHLTP

Query:  TLHQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNHWKAKFTSAHS-----VKEAI
         L+QAF AAAP FE IL+TLSPHLLIYDCFQ WAPRLASSLNIPAI+FNTSSA ++   FH+ + P SKFP SDFVLHNHWK+K  S  S     V E+ 
Subjt:  TLHQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNHWKAKFTSAHS-----VKEAI

Query:  SNSFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNE-EEEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGNGLEESETNF
            N S DVIL NSF+EV+G++MDY+  L KKKVI VGPLVYEP+E +EEDEDY RIKNWLDKKEALSTVL S GSESYA EEEKEE+  GL ESE NF
Subjt:  SNSFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNE-EEEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGNGLEESETNF

Query:  IWVERVSLKEDQEQE--RRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGVEAKRDPNG
        IWVER++ K D+EQ+  RR  +E++GERA+V++GWAPQGKI KHGSIGGFVSHCGWNSVLESIVSGVPIIGVP+ GDQP N GVVEEAG+GVEAKRDP+G
Subjt:  IWVERVSLKEDQEQE--RRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGVEAKRDPNG

Query:  KIQRQEVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEMLAQISLLRN
        KIQR+E+A+LIK+VV+EK+REELR+KVREMSE ++RK D+KI+E+L QIS   N
Subjt:  KIQRQEVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEMLAQISLLRN

A0A0A0L0D8 Glycosyltransferase1.2e-18275.55Show/hide
Query:  IAMDAQQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVELHLPSSPEFPSHLHTTNALLIHLTP
        + MDA QA  +T T TTILMF WLGYGHLS YLEL+KALS+R NF IYFCSTPVNLDSIKPKLI S   SIQFVELHLPSSP+FP HLHTTNAL  HLTP
Subjt:  IAMDAQQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVELHLPSSPEFPSHLHTTNALLIHLTP

Query:  TLHQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNHWKAKFTS-----AHSVKEAI
         LHQAF AAAP FE IL+TLSPHLLIYDCFQSWAPRLASSLNIPAINF+TS   ++ +GFHSI++P+SKFP SDFVLHN W++K+ S     A SV+EA 
Subjt:  TLHQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNHWKAKFTS-----AHSVKEAI

Query:  SNSFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNE-EEEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGNGLEESETNF
            N S DVIL NSF+EV+G+YMDYLS LLKKKVI VGPLVYEPNE +EEDEDY RIKNWLDKKEALSTVLVS GSESYA EEEKEE+  GL ESE NF
Subjt:  SNSFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNE-EEEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGNGLEESETNF

Query:  IWVERVSLKEDQEQE--RRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGVEAKRDPNG
        IWVER++ K D+EQ+  RR  +E++GERA+V+KGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVP+ GDQPFN GVVE AG+GVEAKRDP+G
Subjt:  IWVERVSLKEDQEQE--RRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGVEAKRDPNG

Query:  KIQRQEVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEMLAQISLLRN
        KIQR+EVAKLIK+VV+EK REELR+KVREMSE ++R+ D  I+EMLAQIS   N
Subjt:  KIQRQEVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEMLAQISLLRN

A0A1S4DXN6 beta-D-glucosyl crocetin beta-1,6-glucosyltransferase-like1.4e-18375.44Show/hide
Query:  MDAQQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVELHLPSSPEFPSHLHTTNALLIHLTPTL
        MDA QA ++  T TTILMF WLGYGHL+ YLEL+KALS R NF IYFCSTPVNLDSIKPKLI S   SIQFVELHLPSSPEFP HLHTT AL +HLTP L
Subjt:  MDAQQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVELHLPSSPEFPSHLHTTNALLIHLTPTL

Query:  HQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNHWKAKFTS-----AHSVKEAISN
        HQAF AAAP FE IL+TLSPHLLIYDCFQSWAPRLASSLNIPAINFNTS A ++ + FHSI+ P SKFP+SDFVLHNHW +K+ S     AH VKEA   
Subjt:  HQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNHWKAKFTS-----AHSVKEAISN

Query:  SFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNE-EEEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGNGLEESETNFIW
          N S DVILTNSF+EV+G+YMDY+S L KKKVI VGPLVYEPNE +EEDEDY RIKNWLDKKEALSTVLVS GSESYA EEEKEE+  GL ES  NFIW
Subjt:  SFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNE-EEEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGNGLEESETNFIW

Query:  VERVSLKEDQEQE--RRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGVEAKRDPNGKI
        VER++ K D+EQ+  RR  +E+ GERA+V+KGWAPQGKILKHGSIGGFVSHCGWNSVLES VSGVPIIGVP+ GDQPFN GVVEEAG+GVEAKRD +GKI
Subjt:  VERVSLKEDQEQE--RRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGVEAKRDPNGKI

Query:  QRQEVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEMLAQISLLRN
        QRQEVAKLIK+VVVEK+REE+R++VREMSE ++R+ D+KI+E+L QIS L N
Subjt:  QRQEVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEMLAQISLLRN

A0A3Q7IQZ7 Uncharacterized protein6.9e-20247.36Show/hide
Query:  TILMFPWIGYGHLCAYLELAKALSRRNNFHIYFCSTPVCLDSIKPKLIPS--SSIEFVEFHLFPSPELPPHLHTTNGVPPHIALTLHQAATAAAPRFESI
        +ILM PW+ +GH+  +LELAK L+ R NFHIY CSTP+ L SIK  +      SIE VEFHL   P LPPH HTTNG+PPH+  TL  A   A+P F  I
Subjt:  TILMFPWIGYGHLCAYLELAKALSRRNNFHIYFCSTPVCLDSIKPKLIPS--SSIEFVEFHLFPSPELPPHLHTTNGVPPHIALTLHQAATAAAPRFESI

Query:  LQTLSPHLLIYDCFQPWAPRIASTLNIPAINFSTTGASIVSHEFHSIHYPDSKFPFSNFVLHNYWKAKLKSVTSEGACIIEGFFNCFNASCDVILMNSFR
        LQTL+P L+IYD  QPWA   AS++NIPA+ F T GA++VS   H     + KFPF    LH Y    LK    E       F      S D++L+ + R
Subjt:  LQTLSPHLLIYDCFQPWAPRIASTLNIPAINFSTTGASIVSHEFHSIHYPDSKFPFSNFVLHNYWKAKLKSVTSEGACIIEGFFNCFNASCDVILMNSFR

Query:  EIEGEYMNYVSLLTKKKVIPVGPLVYEPNEEEEDENYSRIKNWLDKKETLSTVLVSLGSERTASEEEINEIGKGLEESEVNFIWVERSNSKGD----EEQ
        + EG+Y++Y+S L  KK++PVG LV E      D+N   I  WLDKKE   TV VS GSE   S+E+I  + +GLE S+VNFIWV R  S+G+    ++ 
Subjt:  EIEGEYMNYVSLLTKKKVIPVGPLVYEPNEEEEDENYSRIKNWLDKKETLSTVLVSLGSERTASEEEINEIGKGLEESEVNFIWVERSNSKGD----EEQ

Query:  KRREFVEMVGERVMVVKGWAPQGKILKHGSIGGFVSHCGWNSVLESITFGVPIIGVPMFGDQPFNAIVVEEAGLGVEAKRDSDGKIQRKEIARLIKEVVV
            ++E VGER MV++GWAPQ  IL+H SIGGFVSHCGW+S +ES+ FGVPII +PM  DQP NA +VE   +GVEA RD +GK+Q +EIA  I++V+V
Subjt:  KRREFVEMVGERVMVVKGWAPQGKILKHGSIGGFVSHCGWNSVLESITFGVPIIGVPMFGDQPFNAIVVEEAGLGVEAKRDSDGKIQRKEIARLIKEVVV

Query:  EKTREEIRMK-PIICIAMDAQQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVELHLPSSPEFP
        E++ E++R K   +   M+A+  G   +            +GH++ +LELAK L+S+ NF IY CST VNL SIK ++      SI+ +ELHLPS P+ P
Subjt:  EKTREEIRMK-PIICIAMDAQQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVELHLPSSPEFP

Query:  SHLHTTNALLIHLTPTLHQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNH----W
         H HTTN L  HL  TL  AF  A+P F  ILQTL P L+I+D  Q W    ASS+NIPA+ F T SA VV    H       KFP  +  L  H     
Subjt:  SHLHTTNALLIHLTPTLHQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNH----W

Query:  KAKFTSAHSVKEAISNSFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNEEEEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKE
        K       S K     +   S D+IL  + R+ +GKY+DYLS L  KKV+ VG LV E  ++   +DY  I  WLDKKE  STV VSFGSE +  +EE  
Subjt:  KAKFTSAHSVKEAISNSFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNEEEEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKE

Query:  EVGNGLEESETNFIWV------ERVSLKEDQEQERRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVG
         V  GLE S+ NFIWV      ER+++++   +E   ++ER GER +V++GWAPQ  IL+H SIGGFVSHCGW+S +ES+  GVPII +P+  DQP N  
Subjt:  EVGNGLEESETNFIWV------ERVSLKEDQEQERRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVG

Query:  VVEEAGVGVEAKRDPNGKIQRQEVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEMLAQISLLRNN
        +VE  GVGVEA +D +GK+Q +E+AK I++VV E++ E++R KV+E+SE +  K D++ID +  ++  LR N
Subjt:  VVEEAGVGVEAKRDPNGKIQRQEVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEMLAQISLLRNN

A0A5D3DDR2 Beta-D-glucosyl crocetin beta-1,6-glucosyltransferase-like4.0e-18676.33Show/hide
Query:  MDAQQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVELHLPSSPEFPSHLHTTNALLIHLTPTL
        MDA QA     T TTILMF WLGYGHLSAYLEL+KALSSR NF IYFCSTPVNLDSIKPKLI S   SIQFVELHLPSSPEFP HLHTTNAL +HLTP L
Subjt:  MDAQQAGSNTLTPTTILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVELHLPSSPEFPSHLHTTNALLIHLTPTL

Query:  HQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNHWKAKFTS-----AHSVKEAISN
        HQAF AAAP FE IL+TLSPHLLIYDCFQSWAPRLASSLNIPAINFNTS   ++ + FHS++ P SKFP+SDFVLHNHW +K+ S     AH VKEA   
Subjt:  HQAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNHWKAKFTS-----AHSVKEAISN

Query:  SFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNE-EEEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGNGLEESETNFIW
          N S DVIL NSF+EV+G+YMDYLS LLKKKVI VGPLVYEPNE +EEDEDY RIKNWLDKKEALSTVL S GSESYA EEEKEE+  GL ES  NFIW
Subjt:  SFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYEPNE-EEEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGNGLEESETNFIW

Query:  VERVSLKEDQEQE--RRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGVEAKRDPNGKI
        VER++ KED+EQ+  RR  +E+ GERA+V+KGWAPQGKILKHGSIGGFVSHCGWNSVLES VSGVPIIGVP+ GDQPFN GVVEEAG+GVEAKRDP+GKI
Subjt:  VERVSLKEDQEQE--RRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGVEAKRDPNGKI

Query:  QRQEVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEMLAQISLLRN
        QRQEVAKLIK+VVVEK+REE+R++VREMSE ++R+ D+KI+E+L QIS L N
Subjt:  QRQEVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEMLAQISLLRN

SwissProt top hitse value%identityAlignment
A0A0A6ZFY4 UDP-glucosyltransferase 292.6e-9747.31Show/hide
Query:  TILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVELHLPSSPEFPSHLHTTNALLIHLTPTLHQAFVAAAPRFEAI
        +I +  +L +GH+S + ELAK L+ R N  ++ CSTP+NL SIK K    S +SI+ VELHLPSSP+ P H HTTN L  HL   L  AF  A P F  I
Subjt:  TILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVELHLPSSPEFPSHLHTTNALLIHLTPTLHQAFVAAAPRFEAI

Query:  LQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNHWKAKFTSAHSVK--EAISNSFNASCDVILTNSFREV
        L+TL+P LLIYD   SWAP +ASS NIPA+ F T++A     G H+   P  K+P  DF  +++   +  SA ++K        F  SCD+IL  SFRE+
Subjt:  LQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNHWKAKFTSAHSVK--EAISNSFNASCDVILTNSFREV

Query:  DGKYMDYLSFLLKKKVISVGPLVYEPNEEEEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGNGLEESETNFIWVERVSLKEDQEQERRGFV
        +GKY+D LS L  K ++ VGPLV +P    ED    +I NWLDK+   + V V FGSE +   EE EEV  GLE S  NFIW  R+   E +     GFV
Subjt:  DGKYMDYLSFLLKKKVISVGPLVYEPNEEEEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGNGLEESETNFIWVERVSLKEDQEQERRGFV

Query:  ERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGVEAKRDPNGKIQRQEVAKLIKQVVVEKTREE
        +R G+R LV++GWAPQ +IL H S GGFVSHCGW+S+ ES+  GVP+I +    DQP N  +  E GVG+E  RD NGK +R+ +A++I++VVVEK+ E 
Subjt:  ERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGVEAKRDPNGKIQRQEVAKLIKQVVVEKTREE

Query:  LRIKVREMSENLRRKRDKKIDEMLAQI
        +R K RE+SE ++ K +++ID  L ++
Subjt:  LRIKVREMSENLRRKRDKKIDEMLAQI

F8WKW8 Beta-D-glucosyl crocetin beta-1,6-glucosyltransferase4.0e-9043.12Show/hide
Query:  MFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVELHLPSSPEFPSHLHTTNALLIHLTPTLHQAFVAAAPRFEAILQT
        MF WL YGH+S YLELAK L+ R  F IY CSTP+NL  IK ++      +I+ VELHLP +PE P H HTTN L  HL  TL +A   A P    IL+T
Subjt:  MFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVELHLPSSPEFPSHLHTTNALLIHLTPTLHQAFVAAAPRFEAILQT

Query:  LSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNHWKAKF-TSAHSVK------EAISNSFNASCD-VILTNSF
        L P  +IYD  Q+W   L  + NIPA+ F TSS  ++ +  H    P  +FP     L +  +AK  T+A   +      +  +   N  CD + L  S 
Subjt:  LSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNHWKAKF-TSAHSVK------EAISNSFNASCD-VILTNSF

Query:  REVDGKYMDYLSFLLKKKVISVGPLVYEPNEEEEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGNGLEESETNFIWVERVSLKE---DQEQ
        R ++GKY+DYL  L+K K++ VG LV EP ++++ ++   +  WL  K   STVLVSFG+E +  +EE EE+ +GLE SE NFIWV R ++ +     E 
Subjt:  REVDGKYMDYLSFLLKKKVISVGPLVYEPNEEEEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGNGLEESETNFIWVERVSLKE---DQEQ

Query:  ERRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGVEAKRDPNGKIQRQEVAKLIKQVVV
           GF+ER G+R  +++GWAPQ ++L H S GGF+ HCGWNSV+ESI  GVP+I +P+  DQP N  +V E G G+E  RD  GK  R+E+A+ IK  +V
Subjt:  ERRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGVEAKRDPNGKIQRQEVAKLIKQVVV

Query:  EKTREELRIKVREMSENLRRKRDKKIDEMLAQISLL
        EKT E  R K+ ++   +  K  +++DE+   ++ L
Subjt:  EKTREELRIKVREMSENLRRKRDKKIDEMLAQISLL

Q5NTH0 Cyanidin-3-O-glucoside 2-O-glucuronosyltransferase2.3e-8542.05Show/hide
Query:  ILMFPWIGYGHLCAYLELAKALSRRNNFHIYFCSTPVCLDSIKPKLIP--SSSIEFVEFHLFPSPELPPHLHTTNGVPPHIALTLHQAATAAAPRFESIL
        ++M PW+ Y H+  +L  AK L+  +NFHIY CS+   +  +K  L    S SI+ +E +L  S ELP   HTT+G+PPH+  TL      + P FE+IL
Subjt:  ILMFPWIGYGHLCAYLELAKALSRRNNFHIYFCSTPVCLDSIKPKLIP--SSSIEFVEFHLFPSPELPPHLHTTNGVPPHIALTLHQAATAAAPRFESIL

Query:  QTLSPHLLIYDCFQPWAPRIASTLNIPAINFSTTGASIVSHEFHSIHYP----DSKFPFSNFVLHNYWKAKLKSVTSEGACIIEGFFNCFNASCDVILMN
          L+PHL+IYD  Q WAP +ASTL+IP+I   +   ++ + + H    P     +KFPF      N      + +   G+  IE F +C   SC++IL+ 
Subjt:  QTLSPHLLIYDCFQPWAPRIASTLNIPAINFSTTGASIVSHEFHSIHYP----DSKFPFSNFVLHNYWKAKLKSVTSEGACIIEGFFNCFNASCDVILMN

Query:  SFREIEGEYMNYVSLLTKKKVIPVGPLVYEPNEEEEDENYSRIKNWLDKKETLSTVLVSLGSERTASEEEINEIGKGLEESEVNFIWVERSNSKGDEEQK
        S  E+EG+Y++Y+S    KKV+PVGPLV E +  ++D  +  I  WLDKKE  S V V  GSE   S+ EI +I  GLE S+V+F+W  R+ +       
Subjt:  SFREIEGEYMNYVSLLTKKKVIPVGPLVYEPNEEEEDENYSRIKNWLDKKETLSTVLVSLGSERTASEEEINEIGKGLEESEVNFIWVERSNSKGDEEQK

Query:  RREFVEMVGERVMVVKGWAPQGKILKHGSIGGFVSHCGWNSVLESITFGVPIIGVPMFGDQPFNAIVVEEAGLGVEAKRDSDGKIQRKEIARLIKEVVVE
           F++ VG++ +V+  W PQ  IL H S GGF+SHCGW+S +ESI +GVPII +PM  DQP+NA ++E  G G+E  RD +G+++R+EIA ++++VVVE
Subjt:  RREFVEMVGERVMVVKGWAPQGKILKHGSIGGFVSHCGWNSVLESITFGVPIIGVPMFGDQPFNAIVVEEAGLGVEAKRDSDGKIQRKEIARLIKEVVVE

Query:  KTREEIRMK
         + E IR K
Subjt:  KTREEIRMK

Q8GVE3 Flavanone 7-O-glucoside 2''-O-beta-L-rhamnosyltransferase3.4e-8143.44Show/hide
Query:  TILMFPWIGYGHLCAYLELAKALSRRNNFHIYFCSTPVCLDSIKPKLIP--SSSIEFVEFHLFPS-PELPPHLHTTNGVPPHIALTLHQAATAAAPRFES
        +ILM PW+ +GH+  +LELAK LS++ NFHIYFCSTP  L S    +    SSSI+ +E  L  + PELP    TT  +PPH+  TL  A   A P F +
Subjt:  TILMFPWIGYGHLCAYLELAKALSRRNNFHIYFCSTPVCLDSIKPKLIP--SSSIEFVEFHLFPS-PELPPHLHTTNGVPPHIALTLHQAATAAAPRFES

Query:  ILQTLSPHLLIYDCFQPWAPRIASTLNIPAINFSTTGASIVSHEFHSIHYPDSKFPFSNFVLHNYWKAKLKSV------TSEGACIIEGFFNCFNASCDV
        IL+TL P L++YD FQPWA   A   +I AI F    A   S   H+I  P  K+PF      +Y   + K++      T+ G    + F   F  SC  
Subjt:  ILQTLSPHLLIYDCFQPWAPRIASTLNIPAINFSTTGASIVSHEFHSIHYPDSKFPFSNFVLHNYWKAKLKSV------TSEGACIIEGFFNCFNASCDV

Query:  ILMNSFREIEGEYMNYVSLLTKKKVIPVGPLVYEPNEEEEDENYSRIKNWLDKKETLSTVLVSLGSERTASEEEINEIGKGLEESEVNFIWVER---SNS
        + + + REIE +Y++Y   L   ++IPVGPL+ EP  +E+D   ++I +WL +KE  S V  S GSE   S++EI+EI  GL  SEVNFIW  R      
Subjt:  ILMNSFREIEGEYMNYVSLLTKKKVIPVGPLVYEPNEEEEDENYSRIKNWLDKKETLSTVLVSLGSERTASEEEINEIGKGLEESEVNFIWVER---SNS

Query:  KGDEEQKRREFVEMV--GERVMVVKGWAPQGKILKHGSIGGFVSHCGWNSVLESITFGVPIIGVPMFGDQPFNAIVVEEAGLGVEAKRDS-DGKIQRKEI
           EE   + F E +    + M+V+GW PQ KIL+HGSIGGF+SHCGW SV+E + FGVPIIGVPM  +QP NA VV + G+G+   RD  + ++  +E+
Subjt:  KGDEEQKRREFVEMV--GERVMVVKGWAPQGKILKHGSIGGFVSHCGWNSVLESITFGVPIIGVPMFGDQPFNAIVVEEAGLGVEAKRDS-DGKIQRKEI

Query:  ARLIKEVVVEKTREEIRMK
        AR+IK VV+++  ++IR K
Subjt:  ARLIKEVVVEKTREEIRMK

Q9LTA3 UDP-glycosyltransferase 91C12.4e-5035.61Show/hide
Query:  ILMFPWIGYGHLCAYLELAKALSRRNNFHIYFCSTPVCLDSIKPKLIP--SSSIEFVEFHLFPSPELPPHLHTTNGVPPHIALTLHQAATAAAPRFESIL
        + MFPW+  GHL  +L L+K L+++ +  I F STP  ++ + PKL    +SSI FV F L P   LPP   ++  VP +   +L  A     P  +  L
Subjt:  ILMFPWIGYGHLCAYLELAKALSRRNNFHIYFCSTPVCLDSIKPKLIP--SSSIEFVEFHLFPSPELPPHLHTTNGVPPHIALTLHQAATAAAPRFESIL

Query:  QTLSPHLLIYDCFQPWAPRIASTLNIPAINFSTTGASIV------SHEFHSIHYPDSKF-------PF-SNFVLHNYWKAKLKSVTSEGACIIEGF--FN
        +  SP  +IYD    W P IA+ L I    FS   A+ +      S     I      F       PF SN V   +   +    T E    +     F 
Subjt:  QTLSPHLLIYDCFQPWAPRIASTLNIPAINFSTTGASIV------SHEFHSIHYPDSKF-------PF-SNFVLHNYWKAKLKSVTSEGACIIEGF--FN

Query:  CFNASCDVILMNSFREIEGEYMNYVSLLTKKKVIPVG--PLVYEPNEEEEDENYSRIKNWLDKKETLSTVLVSLGSERTASEEEINEIGKGLEESEVNFI
              D + + S  E E E+   +  L +K V P+G  P V E +++  D  + RIK WLDK+   S V VSLG+E +   EE+ E+  GLE+SE  F 
Subjt:  CFNASCDVILMNSFREIEGEYMNYVSLLTKKKVIPVG--PLVYEPNEEEEDENYSRIKNWLDKKETLSTVLVSLGSERTASEEEINEIGKGLEESEVNFI

Query:  WVERSNSKGDEEQKRREFVEMVGERVMVVKGWAPQGKILKHGSIGGFVSHCGWNSVLESITFGVPIIGVPMFGDQPFNAIVVEEAGLGVEAKRDS-DGKI
        WV R+  K  +  K R     V  R MV  GW PQ KIL H S+GGF++HCGWNSV+E + FG   I  P+  +Q  N  ++   GLGVE  RD  DG  
Subjt:  WVERSNSKGDEEQKRREFVEMVGERVMVVKGWAPQGKILKHGSIGGFVSHCGWNSVLESITFGVPIIGVPMFGDQPFNAIVVEEAGLGVEAKRDS-DGKI

Query:  QRKEIARLIKEVVVEKTREEIRMK
            +A  I+ V+++   EEIR K
Subjt:  QRKEIARLIKEVVVEKTREEIRMK

Arabidopsis top hitse value%identityAlignment
AT2G15490.1 UDP-glycosyltransferase 73B46.2e-3828.76Show/hide
Query:  ILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKL-----------ISSSFSSIQFVELHLPSSPE---FPSHLHTTNALLIHLTPTLH
        IL F ++ +GH+   L++AK L +R   +    +TP+N   ++  +           I     +   VEL LP   E   F +    +++  + L     
Subjt:  ILMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKL-----------ISSSFSSIQFVELHLPSSPE---FPSHLHTTNALLIHLTPTLH

Query:  QAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFH-SIYYPNSKFPVS-----------DFVLHNHWKAKFTSAHS-
          ++    + E+ ++T  P  L+ D F  WA   A  + +P + F+ +S+F +C  ++  I+ P+ K   S           D V+    +A  T+  + 
Subjt:  QAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGFH-SIYYPNSKFPVS-----------DFVLHNHWKAKFTSAHS-

Query:  -------VKEAISNSFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYE--------PNEEEEDEDYWRIKNWLDKKEALSTVLVSFGSESYA
               V+E+ ++SF      +L NSF E++  Y D+    + KK   +GPL              ++ + D      WLD K   S V +SFGS +  
Subjt:  -------VKEAISNSFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVYE--------PNEEEEDEDYWRIKNWLDKKEALSTVLVSFGSESYA

Query:  LEEEKEEVGNGLEESETNFIWV-----ERVSLKEDQEQERRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQ
          E+  E+  GLE S  NFIWV      +V   E+++   +GF ER   + L+++GWAPQ  IL H +IGGFV+HCGWNS LE I +G+P++  P+  +Q
Subjt:  LEEEKEEVGNGLEESETNFIWV-----ERVSLKEDQEQERRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQ

Query:  PFNVGVVEEA---GVGVEA-KRDPNGK-IQRQEVAKLIKQVVVEKTREELRIKVREMSE
         +N  ++ +    GV V A +    GK I R +V K +++V+  +  EE R++ +E+ E
Subjt:  PFNVGVVEEA---GVGVEA-KRDPNGK-IQRQEVAKLIKQVVVEKTREELRIKVREMSE

AT2G22590.1 UDP-Glycosyltransferase superfamily protein2.3e-4029.96Show/hide
Query:  TPTTILMFPWIGYGHLCAYLELAKALSRRNNFHIYFCSTPVCLDSIKPKLIP--SSSIEFVEFHL-FPSPELPPHLHTTNGVPPHIALTLHQAATAAAPR
        T   ++MFPW+ +GH+  YLEL+K ++++ +  + F STP  +D + P+L    SS I FV+  L     +LP     T  VP  +   L  A       
Subjt:  TPTTILMFPWIGYGHLCAYLELAKALSRRNNFHIYFCSTPVCLDSIKPKLIP--SSSIEFVEFHL-FPSPELPPHLHTTNGVPPHIALTLHQAATAAAPR

Query:  FESILQTLSPHLLIYDCFQPWAPRIASTLNIPAINFSTTGASIVS-------HEFHS----IHYPDSKFPFSNFVLHNYWKAK-----LKSVTSEG----
            L++  P  ++ D    W P I+  L I    FS    + +         E+ +       P    PF   V    ++ +       + T+EG    
Subjt:  FESILQTLSPHLLIYDCFQPWAPRIASTLNIPAINFSTTGASIVS-------HEFHS----IHYPDSKFPFSNFVLHNYWKAK-----LKSVTSEG----

Query:  ----ACIIEGFFNCFNASCDVILMNSFREIEGEYMNYVSLLTKKKVIPVGPLVYEPNEEEED-ENYSRIKNWLDKKETLSTVLVSLGSERTASEEEINEI
              +I+G        CDVI + S  E E E++     L +K VIPVG L  +P+E+ ED + +  +K WLD +++ S V V+ GSE   S+ E+NEI
Subjt:  ----ACIIEGFFNCFNASCDVILMNSFREIEGEYMNYVSLLTKKKVIPVGPLVYEPNEEEED-ENYSRIKNWLDKKETLSTVLVSLGSERTASEEEINEI

Query:  GKGLEESEVNFIWVERSNSKGDEEQKRRE----FVEMVGERVMVVKGWAPQGKILKHGSIGGFVSHCGWNSVLESITFGVPIIGVPMFGDQPFNAIVVEE
          GLE S + F WV ++  +G  + +  E    F E   +R MV +GW  Q + L H SIG  ++H GW +++E+I F  P+  +    DQ  NA V+EE
Subjt:  GKGLEESEVNFIWVERSNSKGDEEQKRRE----FVEMVGERVMVVKGWAPQGKILKHGSIGGFVSHCGWNSVLESITFGVPIIGVPMFGDQPFNAIVVEE

Query:  AGLGVEAKRD-SDGKIQRKEIARLIKEVVVEKT----REEIRMKPIICIAMDAQ
          +G    RD ++G   ++ +A  ++ V+VE+     RE ++    +   MD Q
Subjt:  AGLGVEAKRD-SDGKIQRKEIARLIKEVVVEKT----REEIRMKPIICIAMDAQ

AT4G34135.1 UDP-glucosyltransferase 73B29.0e-3728.89Show/hide
Query:  ILMFSWLGYGHLSAYLELAKALSSR--------NNFQIYFCSTPVN-LDSIKPKL-ISSSFSSIQFVELHLPSSPE----FPSHLHTTNALLIHLTPTLH
        ++ F ++ YGH+   L++AK  SSR         +        P++   ++ P L I     +   VEL LP   E    F S+ +     +I       
Subjt:  ILMFSWLGYGHLSAYLELAKALSSR--------NNFQIYFCSTPVN-LDSIKPKL-ISSSFSSIQFVELHLPSSPE----FPSHLHTTNALLIHLTPTLH

Query:  QAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGF-HSIYYPNSKFPVSD--FVLHNHWKAKFTSAHSVKEA------
        + F     + E +L T  P  LI D F  WA   A   N+P + F+ +  F +C G+   ++ P  +   S   FV+         +   + +       
Subjt:  QAFVAAAPRFEAILQTLSPHLLIYDCFQSWAPRLASSLNIPAINFNTSSAFVVCHGF-HSIYYPNSKFPVSD--FVLHNHWKAKFTSAHSVKEA------

Query:  ------ISNSFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPL-VYEPNEEEEDE-------DYWRIKNWLDKKEALSTVLVSFGSESYALEEEK
              +  S   S  V+L NSF E++  Y D+    ++K+   +GPL VY    EE+ E       D      WLD K+  S + VSFGS ++   E+ 
Subjt:  ------ISNSFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPL-VYEPNEEEEDE-------DYWRIKNWLDKKEALSTVLVSFGSESYALEEEK

Query:  EEVGNGLEESETNFIWVERVSLKEDQEQERRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEE-
         E+  GLE S T+FIWV R +  + +E    GF ER   + ++++GWAPQ  IL H + GGFV+HCGWNS+LE + +G+P++  P+  +Q +N  +V + 
Subjt:  EEVGNGLEESETNFIWVERVSLKEDQEQERRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGWNSVLESIVSGVPIIGVPISGDQPFNVGVVEE-

Query:  --AGVGVEAKRD----PNGKIQRQEVAKLIKQVVVEKTREELRIKVREMS
           GV V A +         I R++V K +++V+  +  EE R + ++++
Subjt:  --AGVGVEAKRD----PNGKIQRQEVAKLIKQVVVEKTREELRIKVREMS

AT5G49690.1 UDP-Glycosyltransferase superfamily protein1.7e-5135.61Show/hide
Query:  ILMFPWIGYGHLCAYLELAKALSRRNNFHIYFCSTPVCLDSIKPKLIP--SSSIEFVEFHLFPSPELPPHLHTTNGVPPHIALTLHQAATAAAPRFESIL
        + MFPW+  GHL  +L L+K L+++ +  I F STP  ++ + PKL    +SSI FV F L P   LPP   ++  VP +   +L  A     P  +  L
Subjt:  ILMFPWIGYGHLCAYLELAKALSRRNNFHIYFCSTPVCLDSIKPKLIP--SSSIEFVEFHLFPSPELPPHLHTTNGVPPHIALTLHQAATAAAPRFESIL

Query:  QTLSPHLLIYDCFQPWAPRIASTLNIPAINFSTTGASIV------SHEFHSIHYPDSKF-------PF-SNFVLHNYWKAKLKSVTSEGACIIEGF--FN
        +  SP  +IYD    W P IA+ L I    FS   A+ +      S     I      F       PF SN V   +   +    T E    +     F 
Subjt:  QTLSPHLLIYDCFQPWAPRIASTLNIPAINFSTTGASIV------SHEFHSIHYPDSKF-------PF-SNFVLHNYWKAKLKSVTSEGACIIEGF--FN

Query:  CFNASCDVILMNSFREIEGEYMNYVSLLTKKKVIPVG--PLVYEPNEEEEDENYSRIKNWLDKKETLSTVLVSLGSERTASEEEINEIGKGLEESEVNFI
              D + + S  E E E+   +  L +K V P+G  P V E +++  D  + RIK WLDK+   S V VSLG+E +   EE+ E+  GLE+SE  F 
Subjt:  CFNASCDVILMNSFREIEGEYMNYVSLLTKKKVIPVG--PLVYEPNEEEEDENYSRIKNWLDKKETLSTVLVSLGSERTASEEEINEIGKGLEESEVNFI

Query:  WVERSNSKGDEEQKRREFVEMVGERVMVVKGWAPQGKILKHGSIGGFVSHCGWNSVLESITFGVPIIGVPMFGDQPFNAIVVEEAGLGVEAKRDS-DGKI
        WV R+  K  +  K R     V  R MV  GW PQ KIL H S+GGF++HCGWNSV+E + FG   I  P+  +Q  N  ++   GLGVE  RD  DG  
Subjt:  WVERSNSKGDEEQKRREFVEMVGERVMVVKGWAPQGKILKHGSIGGFVSHCGWNSVLESITFGVPIIGVPMFGDQPFNAIVVEEAGLGVEAKRDS-DGKI

Query:  QRKEIARLIKEVVVEKTREEIRMK
            +A  I+ V+++   EEIR K
Subjt:  QRKEIARLIKEVVVEKTREEIRMK

AT5G65550.1 UDP-Glycosyltransferase superfamily protein5.1e-4831.5Show/hide
Query:  ILMFPWIGYGHLCAYLELAKALSRRNNFHIYFCSTPVCLDSIKPKLIPSSSIEFVEFHLFPSPE-LPPHLHTTNGVPPHIALTLHQAATAAAPRFESILQ
        + +FPW+  GH+  YL+L+K ++R+ +  + F ST   +  + P +    S+ FV   L  + + LP +   T  VP      L +A    +  F   L+
Subjt:  ILMFPWIGYGHLCAYLELAKALSRRNNFHIYFCSTPVCLDSIKPKLIPSSSIEFVEFHLFPSPE-LPPHLHTTNGVPPHIALTLHQAATAAAPRFESILQ

Query:  TLSPHLLIYDCFQPWAPRIASTLNIPAINFSTTGAS------------IVSHE----FHSIHYPDSKFPFSNFVLHNYWKAK-LKSVTSEGACIIEGFFN
           P+ ++YD    W P IA  L +    F T  A+            I  H+       +  P    PF   +++  ++AK +    + G   +E   N
Subjt:  TLSPHLLIYDCFQPWAPRIASTLNIPAINFSTTGAS------------IVSHE----FHSIHYPDSKFPFSNFVLHNYWKAK-LKSVTSEGACIIEGFFN

Query:  C----FNASCDVILMNSFREIEGEYMNYVSLLTKKKVIPVGPLVYEPNEEEEDE-NYSRIKNWLDKKETLSTVLVSLGSERTASEEEINEIGKGLEESEV
        C         +VI++ S  E+E E++  +S L  K VIP+G L   P ++ +DE  +  I+ WLD+ +  S V V+LG+E T S EEI  +  GLE   +
Subjt:  C----FNASCDVILMNSFREIEGEYMNYVSLLTKKKVIPVGPLVYEPNEEEEDE-NYSRIKNWLDKKETLSTVLVSLGSERTASEEEINEIGKGLEESEV

Query:  NFIWVERSNSKGDEEQKRREFVEMVGERVMVVKGWAPQGKILKHGSIGGFVSHCGWNSVLESITFGVPIIGVPMFGDQPFNAIVVEEAGLGVEAKR-DSD
         F W  R  ++         F E V ER ++   W PQ KIL HGS+GGFV+HCGW S +E ++FGVP+I  P   DQP  A ++    +G+E  R + D
Subjt:  NFIWVERSNSKGDEEQKRREFVEMVGERVMVVKGWAPQGKILKHGSIGGFVSHCGWNSVLESITFGVPIIGVPMFGDQPFNAIVVEEAGLGVEAKR-DSD

Query:  GKIQRKEIARLIKEVVVEK
        G      +A  I+ VVVE+
Subjt:  GKIQRKEIARLIKEVVVEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGCCAGCAAGGTGGAAGCAACACCTCCACTCCCACAACCATTTTGATGTTTCCATGGATTGGCTATGGCCATCTCTGTGCTTACTTAGAGTTAGCTAAAGCTCT
CTCTAGAAGAAATAATTTCCACATCTACTTCTGTTCAACCCCGGTTTGTCTTGATTCCATTAAACCAAAGCTAATTCCTTCTTCTTCCATCGAATTTGTGGAGTTTCATC
TTTTTCCTTCTCCTGAACTTCCTCCACATCTTCACACAACTAACGGCGTCCCCCCTCACATTGCGCTCACTCTCCACCAAGCTGCCACTGCCGCTGCCCCACGCTTTGAG
TCAATTTTACAAACACTTTCTCCACATCTCCTCATTTATGACTGTTTTCAACCATGGGCTCCTCGAATAGCTTCCACTCTCAACATCCCCGCCATCAACTTCAGCACCAC
TGGCGCTTCCATTGTTTCTCATGAATTTCACTCCATTCACTACCCTGATTCTAAATTCCCATTCTCAAATTTTGTTCTTCACAATTATTGGAAAGCCAAGCTTAAATCTG
TCACTTCAGAAGGCGCCTGCATCATAGAAGGCTTTTTCAATTGCTTCAATGCTTCTTGCGATGTAATTCTGATGAATAGCTTCAGGGAGATAGAAGGGGAATATATGAAT
TATGTTTCCCTTTTGACGAAGAAGAAAGTAATCCCAGTTGGCCCTTTGGTGTACGAACCGAACGAGGAGGAGGAAGATGAAAATTATTCAAGAATCAAGAATTGGTTGGA
CAAAAAGGAAACTTTGTCAACAGTTCTGGTGTCATTAGGAAGCGAAAGGACCGCGTCGGAGGAAGAAATAAACGAAATAGGGAAAGGATTAGAGGAAAGTGAGGTGAATT
TCATATGGGTGGAAAGGAGTAATTCAAAAGGAGATGAAGAACAAAAGAGAAGGGAATTTGTGGAGATGGTTGGAGAAAGGGTGATGGTAGTGAAAGGATGGGCTCCACAG
GGGAAGATACTGAAGCATGGGAGCATCGGGGGATTTGTGAGCCATTGTGGATGGAATTCTGTGTTAGAGAGCATAACATTTGGGGTACCAATAATTGGAGTTCCGATGTT
TGGAGACCAACCCTTTAACGCCATAGTGGTGGAAGAAGCAGGGCTAGGAGTGGAGGCCAAGAGAGATTCCGACGGCAAAATTCAAAGAAAAGAAATTGCAAGGCTGATCA
AAGAAGTAGTAGTTGAGAAAACCAGAGAAGAGATAAGGATGAAGCCTATAATTTGCATAGCCATGGATGCCCAGCAAGCTGGAAGCAACACCCTCACCCCAACAACCATT
TTGATGTTTTCGTGGCTTGGCTATGGTCATCTCTCTGCTTACTTGGAGCTGGCCAAAGCTCTCTCTAGTAGGAATAATTTTCAAATCTACTTCTGTTCAACCCCTGTTAA
TCTTGACTCTATTAAACCAAAGCTAATTTCTTCTTCTTTTTCTTCCATCCAATTTGTGGAGCTTCATCTCCCTTCTTCTCCTGAATTTCCTTCTCATCTTCACACAACCA
ATGCCCTCCTCATTCACCTTACGCCCACTCTCCACCAAGCCTTCGTTGCGGCTGCCCCACGCTTTGAGGCAATTTTACAAACACTTTCTCCGCATCTCCTCATTTACGAC
TGTTTCCAGTCTTGGGCTCCTCGCTTAGCTTCCTCCCTCAATATCCCTGCCATCAACTTCAACACCTCTAGCGCTTTTGTCGTTTGTCATGGCTTTCACTCCATTTACTA
CCCTAATTCTAAATTCCCAGTCTCAGATTTTGTTCTTCACAATCATTGGAAAGCTAAGTTCACCTCCGCCCACAGCGTCAAAGAAGCCATTTCCAATAGCTTCAATGCTT
CTTGCGATGTAATTTTGACGAATAGTTTCAGAGAGGTGGATGGAAAATATATGGACTATCTCTCCTTTCTATTGAAGAAGAAAGTAATCTCGGTAGGACCTTTGGTGTAC
GAACCGAATGAGGAAGAGGAAGATGAAGATTATTGGAGGATCAAGAATTGGCTAGACAAAAAGGAAGCTCTATCGACGGTTCTGGTGTCATTTGGAAGTGAAAGCTACGC
CTTAGAGGAAGAAAAGGAGGAGGTTGGGAATGGGTTAGAGGAGAGTGAGACTAATTTCATATGGGTGGAAAGGGTTAGTCTCAAAGAAGATCAAGAACAAGAGAGAAGGG
GGTTTGTGGAGAGGGCTGGAGAAAGGGCGTTGGTTTTGAAAGGATGGGCTCCACAGGGGAAGATACTAAAGCATGGGAGCATTGGGGGATTTGTGAGTCATTGTGGATGG
AACTCTGTGTTGGAGAGCATCGTGTCTGGAGTACCCATAATTGGGGTTCCGATATCTGGGGACCAACCCTTTAATGTTGGAGTGGTGGAAGAAGCAGGGGTGGGGGTGGA
GGCCAAGAGAGATCCCAATGGCAAAATTCAAAGACAAGAAGTCGCAAAGCTGATCAAACAAGTAGTGGTTGAGAAAACCAGAGAAGAGTTGAGGATAAAAGTAAGAGAAA
TGAGTGAGAATTTGAGGAGAAAACGAGACAAGAAGATTGATGAGATGTTGGCTCAAATTTCTCTCTTGCGTAACAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGGCCAGCAAGGTGGAAGCAACACCTCCACTCCCACAACCATTTTGATGTTTCCATGGATTGGCTATGGCCATCTCTGTGCTTACTTAGAGTTAGCTAAAGCTCT
CTCTAGAAGAAATAATTTCCACATCTACTTCTGTTCAACCCCGGTTTGTCTTGATTCCATTAAACCAAAGCTAATTCCTTCTTCTTCCATCGAATTTGTGGAGTTTCATC
TTTTTCCTTCTCCTGAACTTCCTCCACATCTTCACACAACTAACGGCGTCCCCCCTCACATTGCGCTCACTCTCCACCAAGCTGCCACTGCCGCTGCCCCACGCTTTGAG
TCAATTTTACAAACACTTTCTCCACATCTCCTCATTTATGACTGTTTTCAACCATGGGCTCCTCGAATAGCTTCCACTCTCAACATCCCCGCCATCAACTTCAGCACCAC
TGGCGCTTCCATTGTTTCTCATGAATTTCACTCCATTCACTACCCTGATTCTAAATTCCCATTCTCAAATTTTGTTCTTCACAATTATTGGAAAGCCAAGCTTAAATCTG
TCACTTCAGAAGGCGCCTGCATCATAGAAGGCTTTTTCAATTGCTTCAATGCTTCTTGCGATGTAATTCTGATGAATAGCTTCAGGGAGATAGAAGGGGAATATATGAAT
TATGTTTCCCTTTTGACGAAGAAGAAAGTAATCCCAGTTGGCCCTTTGGTGTACGAACCGAACGAGGAGGAGGAAGATGAAAATTATTCAAGAATCAAGAATTGGTTGGA
CAAAAAGGAAACTTTGTCAACAGTTCTGGTGTCATTAGGAAGCGAAAGGACCGCGTCGGAGGAAGAAATAAACGAAATAGGGAAAGGATTAGAGGAAAGTGAGGTGAATT
TCATATGGGTGGAAAGGAGTAATTCAAAAGGAGATGAAGAACAAAAGAGAAGGGAATTTGTGGAGATGGTTGGAGAAAGGGTGATGGTAGTGAAAGGATGGGCTCCACAG
GGGAAGATACTGAAGCATGGGAGCATCGGGGGATTTGTGAGCCATTGTGGATGGAATTCTGTGTTAGAGAGCATAACATTTGGGGTACCAATAATTGGAGTTCCGATGTT
TGGAGACCAACCCTTTAACGCCATAGTGGTGGAAGAAGCAGGGCTAGGAGTGGAGGCCAAGAGAGATTCCGACGGCAAAATTCAAAGAAAAGAAATTGCAAGGCTGATCA
AAGAAGTAGTAGTTGAGAAAACCAGAGAAGAGATAAGGATGAAGCCTATAATTTGCATAGCCATGGATGCCCAGCAAGCTGGAAGCAACACCCTCACCCCAACAACCATT
TTGATGTTTTCGTGGCTTGGCTATGGTCATCTCTCTGCTTACTTGGAGCTGGCCAAAGCTCTCTCTAGTAGGAATAATTTTCAAATCTACTTCTGTTCAACCCCTGTTAA
TCTTGACTCTATTAAACCAAAGCTAATTTCTTCTTCTTTTTCTTCCATCCAATTTGTGGAGCTTCATCTCCCTTCTTCTCCTGAATTTCCTTCTCATCTTCACACAACCA
ATGCCCTCCTCATTCACCTTACGCCCACTCTCCACCAAGCCTTCGTTGCGGCTGCCCCACGCTTTGAGGCAATTTTACAAACACTTTCTCCGCATCTCCTCATTTACGAC
TGTTTCCAGTCTTGGGCTCCTCGCTTAGCTTCCTCCCTCAATATCCCTGCCATCAACTTCAACACCTCTAGCGCTTTTGTCGTTTGTCATGGCTTTCACTCCATTTACTA
CCCTAATTCTAAATTCCCAGTCTCAGATTTTGTTCTTCACAATCATTGGAAAGCTAAGTTCACCTCCGCCCACAGCGTCAAAGAAGCCATTTCCAATAGCTTCAATGCTT
CTTGCGATGTAATTTTGACGAATAGTTTCAGAGAGGTGGATGGAAAATATATGGACTATCTCTCCTTTCTATTGAAGAAGAAAGTAATCTCGGTAGGACCTTTGGTGTAC
GAACCGAATGAGGAAGAGGAAGATGAAGATTATTGGAGGATCAAGAATTGGCTAGACAAAAAGGAAGCTCTATCGACGGTTCTGGTGTCATTTGGAAGTGAAAGCTACGC
CTTAGAGGAAGAAAAGGAGGAGGTTGGGAATGGGTTAGAGGAGAGTGAGACTAATTTCATATGGGTGGAAAGGGTTAGTCTCAAAGAAGATCAAGAACAAGAGAGAAGGG
GGTTTGTGGAGAGGGCTGGAGAAAGGGCGTTGGTTTTGAAAGGATGGGCTCCACAGGGGAAGATACTAAAGCATGGGAGCATTGGGGGATTTGTGAGTCATTGTGGATGG
AACTCTGTGTTGGAGAGCATCGTGTCTGGAGTACCCATAATTGGGGTTCCGATATCTGGGGACCAACCCTTTAATGTTGGAGTGGTGGAAGAAGCAGGGGTGGGGGTGGA
GGCCAAGAGAGATCCCAATGGCAAAATTCAAAGACAAGAAGTCGCAAAGCTGATCAAACAAGTAGTGGTTGAGAAAACCAGAGAAGAGTTGAGGATAAAAGTAAGAGAAA
TGAGTGAGAATTTGAGGAGAAAACGAGACAAGAAGATTGATGAGATGTTGGCTCAAATTTCTCTCTTGCGTAACAATTGA
Protein sequenceShow/hide protein sequence
MDGQQGGSNTSTPTTILMFPWIGYGHLCAYLELAKALSRRNNFHIYFCSTPVCLDSIKPKLIPSSSIEFVEFHLFPSPELPPHLHTTNGVPPHIALTLHQAATAAAPRFE
SILQTLSPHLLIYDCFQPWAPRIASTLNIPAINFSTTGASIVSHEFHSIHYPDSKFPFSNFVLHNYWKAKLKSVTSEGACIIEGFFNCFNASCDVILMNSFREIEGEYMN
YVSLLTKKKVIPVGPLVYEPNEEEEDENYSRIKNWLDKKETLSTVLVSLGSERTASEEEINEIGKGLEESEVNFIWVERSNSKGDEEQKRREFVEMVGERVMVVKGWAPQ
GKILKHGSIGGFVSHCGWNSVLESITFGVPIIGVPMFGDQPFNAIVVEEAGLGVEAKRDSDGKIQRKEIARLIKEVVVEKTREEIRMKPIICIAMDAQQAGSNTLTPTTI
LMFSWLGYGHLSAYLELAKALSSRNNFQIYFCSTPVNLDSIKPKLISSSFSSIQFVELHLPSSPEFPSHLHTTNALLIHLTPTLHQAFVAAAPRFEAILQTLSPHLLIYD
CFQSWAPRLASSLNIPAINFNTSSAFVVCHGFHSIYYPNSKFPVSDFVLHNHWKAKFTSAHSVKEAISNSFNASCDVILTNSFREVDGKYMDYLSFLLKKKVISVGPLVY
EPNEEEEDEDYWRIKNWLDKKEALSTVLVSFGSESYALEEEKEEVGNGLEESETNFIWVERVSLKEDQEQERRGFVERAGERALVLKGWAPQGKILKHGSIGGFVSHCGW
NSVLESIVSGVPIIGVPISGDQPFNVGVVEEAGVGVEAKRDPNGKIQRQEVAKLIKQVVVEKTREELRIKVREMSENLRRKRDKKIDEMLAQISLLRNN