; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G18230 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G18230
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGlycosyltransferase
Genome locationClcChr11:28868222..28877969
RNA-Seq ExpressionClc11G18230
SyntenyClc11G18230
Gene Ontology termsGO:0071577 - zinc ion transmembrane transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0005385 - zinc ion transmembrane transporter activity (molecular function)
GO:0008194 - UDP-glycosyltransferase activity (molecular function)
InterPro domainsIPR002213 - UDP-glucuronosyl/UDP-glucosyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649804.1 hypothetical protein Csa_012084 [Cucumis sativus]1.3e-17570.42Show/hide
Query:  MFELIFIPAPGMGHLASTVEMANVLATRDHRLSITVLAMKLPYDVKVAECIESLSSSFAGKHIQFNVLPEPPLPEESKKDFIVLVESYKPYVREAVTNFT
        MFELIFIPAPG+GHLASTVEMANVL TRDHRLS+T+LAMKLPYDVKVAECIESLS+SFAGK+IQFNVLPEPPLPEESKKDFIVLVESYKPYVRE V+N T
Subjt:  MFELIFIPAPGMGHLASTVEMANVLATRDHRLSITVLAMKLPYDVKVAECIESLSSSFAGKHIQFNVLPEPPLPEESKKDFIVLVESYKPYVREAVTNFT

Query:  TSAETSLDSPRLVGLVIDMFCTTMIDVANEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNDELTLPNFVNPIPSKLIPSLFSNKVKA
         SA TS+DSPRLVGLVIDMFCTTMIDV NEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDN ELTLPNFVNPIPSKLIP+LFSNK KA
Subjt:  TSAETSLDSPRLVGLVIDMFCTTMIDVANEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNDELTLPNFVNPIPSKLIPSLFSNKVKA

Query:  VWFHDHIKRFRLEIKGILINTFLEMESHVVKSYSQVLPPLYFVGPVLHLKNTGMAGSSEAQNYGD-ILKWLDDQPPSSVVLVCFGTMVSFDEDQVVAIAQ
        VWFH+HIKRFRLEIKGILINTF EMESHV KSYSQVLPPLYFVGPVLHLKN G+AGSSEAQN  D I+KWLDDQPPSSVVLVCFGTMVSFDE QV  IA 
Subjt:  VWFHDHIKRFRLEIKGILINTFLEMESHVVKSYSQVLPPLYFVGPVLHLKNTGMAGSSEAQNYGD-ILKWLDDQPPSSVVLVCFGTMVSFDEDQVVAIAQ

Query:  ALEESGVRFIWSLRQPPPKGKFEAPRNYTDIKD-------------------------------------------------------------------
        ALEESGVRFIWSLRQPPPKGKFEAP+NY DI++                                                                   
Subjt:  ALEESGVRFIWSLRQPPPKGKFEAPRNYTDIKD-------------------------------------------------------------------

Query:  --------------------------------------KLMGEESDEIRKKVKAKSEESRKSVMEGGSSFISLGKFIDDM
                                              KLMGEES+E+RKKVKAKSEESRKSVMEGGSSF+SLGKFIDD+
Subjt:  --------------------------------------KLMGEESDEIRKKVKAKSEESRKSVMEGGSSFISLGKFIDDM

RXH90235.1 hypothetical protein DVH24_032592 [Malus domestica]7.3e-20634.56Show/hide
Query:  GTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANEYIQSLSAQSLTNNNTSIRFIVLPELPPPSKD-GNHFLLDVVLESYKPHVKQALISSLTTTT
        G GHLAS+ E+A +LVARD RL + +L +KLP      EY   L      ++   IRFI LPE    +K       L + +E++K HV+ A ++ LT+  
Subjt:  GTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANEYIQSLSAQSLTNNNTSIRFIVLPELPPPSKD-GNHFLLDVVLESYKPHVKQALISSLTTTT

Query:  ------NHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFINKAAVWL
                L GFV+DMFCTTM+DVANEF VP+YV++TS A +L    HL+ L+ + N       +LK+SD +L +  LVN +P++V+PG++    +A   
Subjt:  ------NHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFINKAAVWL

Query:  HEQVKRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKKNIETMD---HVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGRAL
            +R R E KG+L+NTF E+ESH + +L   +  + PPLY VGP+L+LK + E  +     ++L+WLDDQPP SVVFLCFGS G    DQV+EI  AL
Subjt:  HEQVKRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKKNIETMD---HVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGRAL

Query:  EQSGVRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ-----------------------------------------------
        E  G+RF+WSLR+P P G    A DY +   +LP+GFL+RT   GKVIGWAPQ                                               
Subjt:  EQSGVRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ-----------------------------------------------

Query:  --------------EQDKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKF-------IDDKFELDFIPMPGTGHLASTVE
                       ++   +V AEEIERGIR++M +  +E RKKV+  SE+ R+++++GGSS  SL          +    +L FIP  GTGHL S VE
Subjt:  --------------EQDKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKF-------IDDKFELDFIPMPGTGHLASTVE

Query:  MANILVARDPRLTVAILAIKLSFEVKANEC-IQSLSNN---TSIRFIDLPELPP--PPKGGNHFFLKVVLESYKPHVKQAL--ISSLTTTTNHLAGFVLD
        MA +LV+RD +L + +L +KL FE K  E  I SL ++   + + FI LP+       +   + F    +ES+KP+VK A+  ++   + +  +AGFV+D
Subjt:  MANILVARDPRLTVAILAIKLSFEVKANEC-IQSLSNN---TSIRFIDLPELPP--PPKGGNHFFLKVVLESYKPHVKQAL--ISSLTTTTNHLAGFVLD

Query:  MFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSCNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIF-----------FIIR-SEVKGILI
        MFCTTM+DVAN+F VP+Y+++TS A +L    +++   T  +   + + + K SD  L++ + V+ VP+ V+P +            +  R  E+KGILI
Subjt:  MFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSCNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIF-----------FIIR-SEVKGILI

Query:  NTFEELEAHVICSLSTDSSLQLPPFG-------------------------------SSSVVFLCFGSRGSFEKDQVEEIGRALERSGVRFIWSLRRPSP
        NTF ELE+H + SLS   +  + P G                                SSVVFLCFGS GSF +DQV EI   LE+SG RF+WSLR+P P
Subjt:  NTFEELEAHVICSLSTDSSLQLPPFG-------------------------------SSSVVFLCFGSRGSFEKDQVEEIGRALERSGVRFIWSLRRPSP

Query:  GGAFEVAIDYSNFEDILPEGFLNRTVDVGRVIGWAPQMFELIFIPAPGMGHLASTVEMANVLATRDHRLSITVLAMKLPYDVKVAECIESLSSSFAGKHI
         G      DYS+ + +LPEGFL RT  +G+VIGWAPQ      +  P +G   S     + L                          ESL         
Subjt:  GGAFEVAIDYSNFEDILPEGFLNRTVDVGRVIGWAPQMFELIFIPAPGMGHLASTVEMANVLATRDHRLSITVLAMKLPYDVKVAECIESLSSSFAGKHI

Query:  QFNVLPEPPLPEESKKDFIVLVESYKPYVREAVTNFTTSAETSLDSPRLVGLVIDMFCTTMIDVANEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNE
                         F V V ++  Y  + +  F        +  R +GL +++      D+  E+                       + + +    
Subjt:  QFNVLPEPPLPEESKKDFIVLVESYKPYVREAVTNFTTSAETSLDSPRLVGLVIDMFCTTMIDVANEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNE

Query:  VVEQLLNSDNDELTLPNFVNPIPSKLIPSLFSNKVKAVWFHDHIKRFRLEIKGILINTFLEMESHVVKSYSQVLPPLYFVGPVLHLKNTGMAGSSEAQNY
         +++L+  D+D             K +  +     KA+      +  RLE                                   L+      S   +++
Subjt:  VVEQLLNSDNDELTLPNFVNPIPSKLIPSLFSNKVKAVWFHDHIKRFRLEIKGILINTFLEMESHVVKSYSQVLPPLYFVGPVLHLKNTGMAGSSEAQNY

Query:  GDIL-KWLDDQPPSSVVLVCFGTMVSFDEDQVVAIAQALEESGVRFIWSLRQPPPKGKFEAPRNYTDIKDKLMGEESDEIRKKVKAKSEESRKSVMEGGS
        GD    WL     +S + V  G    + +D                     + P     E  R     + K M E+  ++RK++K     S+K++M   S
Subjt:  GDIL-KWLDDQPPSSVVLVCFGTMVSFDEDQVVAIAQALEESGVRFIWSLRQPPPKGKFEAPRNYTDIKDKLMGEESDEIRKKVKAKSEESRKSVMEGGS

Query:  SFISLGKFIDDMKKFELVFIPGPGIGHLASTVEMANMLVTRDHLLVVTVLVMQLSYDNKAMDHIQLLSASFAGKSIHLILLPEFPLPEECQNGMPQP---
        S+ S G+F+   +  +LVF+P P IGH+ STVEMA  LV RD  L +TVLVM+L +     +     + S     I+ + LPE     + Q+ +P P   
Subjt:  SFISLGKFIDDMKKFELVFIPGPGIGHLASTVEMANMLVTRDHLLVVTVLVMQLSYDNKAMDHIQLLSASFAGKSIHLILLPEFPLPEECQNGMPQP---

Query:  ---LIEIYKPHVREAMAK---HANSQTSPDLPQLVGFVLDMFCMTMIDVAKEFEVPCYVFYTSSAAFLALNFHLQELYDQSNSNKVVEQLKNSDSEALTI
           L+E +  HVR+A+      ++   S   P+L GFVLDMFC +++DVA EF+VP Y+F+TS+A+ LAL FH Q L D+   +  + +L +S +E L +
Subjt:  ---LIEIYKPHVREAMAK---HANSQTSPDLPQLVGFVLDMFCMTMIDVAKEFEVPCYVFYTSSAAFLALNFHLQELYDQSNSNKVVEQLKNSDSEALTI

Query:  PSFVKPIPGKVIPSIFVYNDIAVWLYENTRKFRTEIKGVLINTCAEIESHWL-ILTS-----PLYFVGPILNLKNTVNQ--VNILKWLDDQPQASVVFLC
        PSF  P P  V+P  F+  +    +  +  +F+ + KG+L+NT  E+ESH L  L S     P+Y VGP+LNLK++      ++L+WLDDQP  SVVFLC
Subjt:  PSFVKPIPGKVIPSIFVYNDIAVWLYENTRKFRTEIKGVLINTCAEIESHWL-ILTS-----PLYFVGPILNLKNTVNQ--VNILKWLDDQPQASVVFLC

Query:  FGSMGSFDEDQVKEIARGLEHCGVRFLWSLRQPPPKGKWVAPSDYADIKDV
        FGSMGSF E QVKEIA  LEH G RFLWSLR+PPPKGK   PSDY D+K V
Subjt:  FGSMGSFDEDQVKEIARGLEHCGVRFLWSLRQPPPKGKWVAPSDYADIKDV

XP_004146065.2 anthocyanidin 3-O-glucosyltransferase 2 [Cucumis sativus]7.9e-17670.27Show/hide
Query:  QMFELIFIPAPGMGHLASTVEMANVLATRDHRLSITVLAMKLPYDVKVAECIESLSSSFAGKHIQFNVLPEPPLPEESKKDFIVLVESYKPYVREAVTNF
        +MFELIFIPAPG+GHLASTVEMANVL TRDHRLS+T+LAMKLPYDVKVAECIESLS+SFAGK+IQFNVLPEPPLPEESKKDFIVLVESYKPYVRE V+N 
Subjt:  QMFELIFIPAPGMGHLASTVEMANVLATRDHRLSITVLAMKLPYDVKVAECIESLSSSFAGKHIQFNVLPEPPLPEESKKDFIVLVESYKPYVREAVTNF

Query:  TTSAETSLDSPRLVGLVIDMFCTTMIDVANEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNDELTLPNFVNPIPSKLIPSLFSNKVK
        T SA TS+DSPRLVGLVIDMFCTTMIDV NEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDN ELTLPNFVNPIPSKLIP+LFSNK K
Subjt:  TTSAETSLDSPRLVGLVIDMFCTTMIDVANEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNDELTLPNFVNPIPSKLIPSLFSNKVK

Query:  AVWFHDHIKRFRLEIKGILINTFLEMESHVVKSYSQVLPPLYFVGPVLHLKNTGMAGSSEAQNYGD-ILKWLDDQPPSSVVLVCFGTMVSFDEDQVVAIA
        AVWFH+HIKRFRLEIKGILINTF EMESHV KSYSQVLPPLYFVGPVLHLKN G+AGSSEAQN  D I+KWLDDQPPSSVVLVCFGTMVSFDE QV  IA
Subjt:  AVWFHDHIKRFRLEIKGILINTFLEMESHVVKSYSQVLPPLYFVGPVLHLKNTGMAGSSEAQNYGD-ILKWLDDQPPSSVVLVCFGTMVSFDEDQVVAIA

Query:  QALEESGVRFIWSLRQPPPKGKFEAPRNYTDIKD------------------------------------------------------------------
         ALEESGVRFIWSLRQPPPKGKFEAP+NY DI++                                                                  
Subjt:  QALEESGVRFIWSLRQPPPKGKFEAPRNYTDIKD------------------------------------------------------------------

Query:  ---------------------------------------KLMGEESDEIRKKVKAKSEESRKSVMEGGSSFISLGKFIDDM
                                               KLMGEES+E+RKKVKAKSEESRKSVMEGGSSF+SLGKFIDD+
Subjt:  ---------------------------------------KLMGEESDEIRKKVKAKSEESRKSVMEGGSSFISLGKFIDDM

XP_008464636.1 PREDICTED: anthocyanidin 3-O-glucosyltransferase 2-like [Cucumis melo]2.7e-17670.83Show/hide
Query:  MFELIFIPAPGMGHLASTVEMANVLATRDHRLSITVLAMKLPYDVKVAECIESLSSSFAGKHIQFNVLPEPPLPEESKKDFIVLVESYKPYVREAVTNFT
        MFELIFIPAPG+GHLASTVEMANVL TRDHRLS+T+LAMKLPYDVKVAECIESLS+SFAGK+IQFNVLPEPPLPEESKKDFIVLVESYKPYVREAV+NFT
Subjt:  MFELIFIPAPGMGHLASTVEMANVLATRDHRLSITVLAMKLPYDVKVAECIESLSSSFAGKHIQFNVLPEPPLPEESKKDFIVLVESYKPYVREAVTNFT

Query:  TSAETSLDSPRLVGLVIDMFCTTMIDVANEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNDELTLPNFVNPIPSKLIPSLFSNKVKA
         SA TSLDSPRLVGLVIDMFCTTMIDV NEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDN ELTLPNF NPIPSKLIP+LFSNK KA
Subjt:  TSAETSLDSPRLVGLVIDMFCTTMIDVANEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNDELTLPNFVNPIPSKLIPSLFSNKVKA

Query:  VWFHDHIKRFRLEIKGILINTFLEMESHVVKSYSQVLPPLYFVGPVLHLKNTGMAGSSEAQNYGD-ILKWLDDQPPSSVVLVCFGTMVSFDEDQVVAIAQ
        VWFH+HIKRFRLEIKGILINTF EMESH  KSYSQVLPPLYFVGPVLHLKN G+AGSSEAQ+  D I+KWLDDQPPSSVVLVCFGTMVSFDE QV  IA 
Subjt:  VWFHDHIKRFRLEIKGILINTFLEMESHVVKSYSQVLPPLYFVGPVLHLKNTGMAGSSEAQNYGD-ILKWLDDQPPSSVVLVCFGTMVSFDEDQVVAIAQ

Query:  ALEESGVRFIWSLRQPPPKGKFEAPRNYTDIKD-------------------------------------------------------------------
        ALEESGVRFIWSLRQPPPKGKFEAPRNY D+K+                                                                   
Subjt:  ALEESGVRFIWSLRQPPPKGKFEAPRNYTDIKD-------------------------------------------------------------------

Query:  --------------------------------------KLMGEESDEIRKKVKAKSEESRKSVMEGGSSFISLGKFIDDM
                                              KLMGEESDE+RKKVKAKSEES+KSVMEGGSSFISLGKFIDD+
Subjt:  --------------------------------------KLMGEESDEIRKKVKAKSEESRKSVMEGGSSFISLGKFIDDM

XP_038899112.1 anthocyanidin 3-O-glucosyltransferase 2-like [Benincasa hispida]3.6e-18172.29Show/hide
Query:  QMFELIFIPAPGMGHLASTVEMANVLATRDHRLSITVLAMKLPYDVKVAECIESLSSSFAGKHIQFNVLPEPPLPEESKKDFIVLVESYKPYVREAVTNF
        +MFELIF+PAPG+GHLASTVEMANVLATRDHRLSITVLAMKLPYDVKVAECIESLS+SFAGK+IQ NVLPEPPLPEESKKDFIVLVESYKPYVREAVT+F
Subjt:  QMFELIFIPAPGMGHLASTVEMANVLATRDHRLSITVLAMKLPYDVKVAECIESLSSSFAGKHIQFNVLPEPPLPEESKKDFIVLVESYKPYVREAVTNF

Query:  TTSAETSLDSPRLVGLVIDMFCTTMIDVANEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNDELTLPNFVNPIPSKLIPSLFSNKVK
        T SAETSLDSPRLVGLVIDMFCTTMIDVANEFGVPCYVFYTCSASFLAFSLYLQEL+EENGSNEVVE+LLNSDN E+TLPNFVNPIP+K+IPSLFSNK K
Subjt:  TTSAETSLDSPRLVGLVIDMFCTTMIDVANEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNDELTLPNFVNPIPSKLIPSLFSNKVK

Query:  AVWFHDHIKRFRLEIKGILINTFLEMESHVVKSYSQVLPPLYFVGPVLHLKNTGMAGSSEAQNYGDILKWLDDQPPSSVVLVCFGTMVSFDEDQVVAIAQ
        AVWFH+HIKRFRLEIKGILINTF+EMESHVVKSYSQVLPPLYFVGPVLHLKNTG+AGSSEAQN  DIL+WLDDQPPSSVVLVCFGTMVSFDEDQVV IA 
Subjt:  AVWFHDHIKRFRLEIKGILINTFLEMESHVVKSYSQVLPPLYFVGPVLHLKNTGMAGSSEAQNYGDILKWLDDQPPSSVVLVCFGTMVSFDEDQVVAIAQ

Query:  ALEESGVRFIWSLRQPPPKGKFEAPRNYTDIKD-------------------------------------------------------------------
        ALEESGVRFIWSLRQPPPKGKFEAPRNYT IKD                                                                   
Subjt:  ALEESGVRFIWSLRQPPPKGKFEAPRNYTDIKD-------------------------------------------------------------------

Query:  --------------------------------------KLMGEESDEIRKKVKAKSEESRKSVMEGGSSFISLGKFIDDM
                                              KLMGE SD++RKKVKAKSEESRKSVMEGGSSFISLGKFIDD+
Subjt:  --------------------------------------KLMGEESDEIRKKVKAKSEESRKSVMEGGSSFISLGKFIDDM

TrEMBL top hitse value%identityAlignment
A0A1S3CLY5 Glycosyltransferase1.3e-17670.83Show/hide
Query:  MFELIFIPAPGMGHLASTVEMANVLATRDHRLSITVLAMKLPYDVKVAECIESLSSSFAGKHIQFNVLPEPPLPEESKKDFIVLVESYKPYVREAVTNFT
        MFELIFIPAPG+GHLASTVEMANVL TRDHRLS+T+LAMKLPYDVKVAECIESLS+SFAGK+IQFNVLPEPPLPEESKKDFIVLVESYKPYVREAV+NFT
Subjt:  MFELIFIPAPGMGHLASTVEMANVLATRDHRLSITVLAMKLPYDVKVAECIESLSSSFAGKHIQFNVLPEPPLPEESKKDFIVLVESYKPYVREAVTNFT

Query:  TSAETSLDSPRLVGLVIDMFCTTMIDVANEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNDELTLPNFVNPIPSKLIPSLFSNKVKA
         SA TSLDSPRLVGLVIDMFCTTMIDV NEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDN ELTLPNF NPIPSKLIP+LFSNK KA
Subjt:  TSAETSLDSPRLVGLVIDMFCTTMIDVANEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNDELTLPNFVNPIPSKLIPSLFSNKVKA

Query:  VWFHDHIKRFRLEIKGILINTFLEMESHVVKSYSQVLPPLYFVGPVLHLKNTGMAGSSEAQNYGD-ILKWLDDQPPSSVVLVCFGTMVSFDEDQVVAIAQ
        VWFH+HIKRFRLEIKGILINTF EMESH  KSYSQVLPPLYFVGPVLHLKN G+AGSSEAQ+  D I+KWLDDQPPSSVVLVCFGTMVSFDE QV  IA 
Subjt:  VWFHDHIKRFRLEIKGILINTFLEMESHVVKSYSQVLPPLYFVGPVLHLKNTGMAGSSEAQNYGD-ILKWLDDQPPSSVVLVCFGTMVSFDEDQVVAIAQ

Query:  ALEESGVRFIWSLRQPPPKGKFEAPRNYTDIKD-------------------------------------------------------------------
        ALEESGVRFIWSLRQPPPKGKFEAPRNY D+K+                                                                   
Subjt:  ALEESGVRFIWSLRQPPPKGKFEAPRNYTDIKD-------------------------------------------------------------------

Query:  --------------------------------------KLMGEESDEIRKKVKAKSEESRKSVMEGGSSFISLGKFIDDM
                                              KLMGEESDE+RKKVKAKSEES+KSVMEGGSSFISLGKFIDD+
Subjt:  --------------------------------------KLMGEESDEIRKKVKAKSEESRKSVMEGGSSFISLGKFIDDM

A0A1S3CLY9 Glycosyltransferase7.7e-16969.15Show/hide
Query:  MPGTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANEYIQSLSAQSLTNNNTSIRFIVLPELPPPSKDGNHFLLDVVLESYKPHVKQALISSLTTT
        +PG+GHLAS  EMAN L+ARD RL V ++AIKLP DAK NEYIQSL AQSLTNN  SI+FI+LPELPPP  D N    +VVLESYKPHVKQALIS LTT+
Subjt:  MPGTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANEYIQSLSAQSLTNNNTSIRFIVLPELPPPSKDGNHFLLDVVLESYKPHVKQALISSLTTT

Query:  TNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFINKAAVWLHEQVK
        TNHL GFVLD FC TMVDVANEF+VPSYVYYTSSAAYLAFS HLEQLYTQ+NSSNEVIQQ KDS+VN SVSSLVNQVPSKVIP +FFIN  AVW HEQ K
Subjt:  TNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFINKAAVWLHEQVK

Query:  RIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKKNIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGRALEQSGVRFI
        RIRF+VKGVLINTF+E+ESHVISSLSTDSSLQLPPLY VGP+LHL KN ETMD   VLKWLDDQP  SVVFLCFGSRG+ +KDQVEEI RALE+S VRFI
Subjt:  RIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKKNIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGRALEQSGVRFI

Query:  WSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ-------------------------------------------------------
        WSLRRPS G  F+ +IDYTNFEDILPEGFL+RT ++G+VI WAPQ                                                       
Subjt:  WSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ-------------------------------------------------------

Query:  -----------EQDKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFIDD
                   E DKP+I+SAEEIE+GIRKLMDDNNNEIRKKVKTKSEECRKSV+EGGSSFISL KFIDD
Subjt:  -----------EQDKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFIDD

A0A498J662 Uncharacterized protein3.5e-20634.56Show/hide
Query:  GTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANEYIQSLSAQSLTNNNTSIRFIVLPELPPPSKD-GNHFLLDVVLESYKPHVKQALISSLTTTT
        G GHLAS+ E+A +LVARD RL + +L +KLP      EY   L      ++   IRFI LPE    +K       L + +E++K HV+ A ++ LT+  
Subjt:  GTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANEYIQSLSAQSLTNNNTSIRFIVLPELPPPSKD-GNHFLLDVVLESYKPHVKQALISSLTTTT

Query:  ------NHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFINKAAVWL
                L GFV+DMFCTTM+DVANEF VP+YV++TS A +L    HL+ L+ + N       +LK+SD +L +  LVN +P++V+PG++    +A   
Subjt:  ------NHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFINKAAVWL

Query:  HEQVKRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKKNIETMD---HVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGRAL
            +R R E KG+L+NTF E+ESH + +L   +  + PPLY VGP+L+LK + E  +     ++L+WLDDQPP SVVFLCFGS G    DQV+EI  AL
Subjt:  HEQVKRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKKNIETMD---HVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGRAL

Query:  EQSGVRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ-----------------------------------------------
        E  G+RF+WSLR+P P G    A DY +   +LP+GFL+RT   GKVIGWAPQ                                               
Subjt:  EQSGVRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ-----------------------------------------------

Query:  --------------EQDKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKF-------IDDKFELDFIPMPGTGHLASTVE
                       ++   +V AEEIERGIR++M +  +E RKKV+  SE+ R+++++GGSS  SL          +    +L FIP  GTGHL S VE
Subjt:  --------------EQDKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKF-------IDDKFELDFIPMPGTGHLASTVE

Query:  MANILVARDPRLTVAILAIKLSFEVKANEC-IQSLSNN---TSIRFIDLPELPP--PPKGGNHFFLKVVLESYKPHVKQAL--ISSLTTTTNHLAGFVLD
        MA +LV+RD +L + +L +KL FE K  E  I SL ++   + + FI LP+       +   + F    +ES+KP+VK A+  ++   + +  +AGFV+D
Subjt:  MANILVARDPRLTVAILAIKLSFEVKANEC-IQSLSNN---TSIRFIDLPELPP--PPKGGNHFFLKVVLESYKPHVKQAL--ISSLTTTTNHLAGFVLD

Query:  MFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSCNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIF-----------FIIR-SEVKGILI
        MFCTTM+DVAN+F VP+Y+++TS A +L    +++   T  +   + + + K SD  L++ + V+ VP+ V+P +            +  R  E+KGILI
Subjt:  MFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSCNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIF-----------FIIR-SEVKGILI

Query:  NTFEELEAHVICSLSTDSSLQLPPFG-------------------------------SSSVVFLCFGSRGSFEKDQVEEIGRALERSGVRFIWSLRRPSP
        NTF ELE+H + SLS   +  + P G                                SSVVFLCFGS GSF +DQV EI   LE+SG RF+WSLR+P P
Subjt:  NTFEELEAHVICSLSTDSSLQLPPFG-------------------------------SSSVVFLCFGSRGSFEKDQVEEIGRALERSGVRFIWSLRRPSP

Query:  GGAFEVAIDYSNFEDILPEGFLNRTVDVGRVIGWAPQMFELIFIPAPGMGHLASTVEMANVLATRDHRLSITVLAMKLPYDVKVAECIESLSSSFAGKHI
         G      DYS+ + +LPEGFL RT  +G+VIGWAPQ      +  P +G   S     + L                          ESL         
Subjt:  GGAFEVAIDYSNFEDILPEGFLNRTVDVGRVIGWAPQMFELIFIPAPGMGHLASTVEMANVLATRDHRLSITVLAMKLPYDVKVAECIESLSSSFAGKHI

Query:  QFNVLPEPPLPEESKKDFIVLVESYKPYVREAVTNFTTSAETSLDSPRLVGLVIDMFCTTMIDVANEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNE
                         F V V ++  Y  + +  F        +  R +GL +++      D+  E+                       + + +    
Subjt:  QFNVLPEPPLPEESKKDFIVLVESYKPYVREAVTNFTTSAETSLDSPRLVGLVIDMFCTTMIDVANEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNE

Query:  VVEQLLNSDNDELTLPNFVNPIPSKLIPSLFSNKVKAVWFHDHIKRFRLEIKGILINTFLEMESHVVKSYSQVLPPLYFVGPVLHLKNTGMAGSSEAQNY
         +++L+  D+D             K +  +     KA+      +  RLE                                   L+      S   +++
Subjt:  VVEQLLNSDNDELTLPNFVNPIPSKLIPSLFSNKVKAVWFHDHIKRFRLEIKGILINTFLEMESHVVKSYSQVLPPLYFVGPVLHLKNTGMAGSSEAQNY

Query:  GDIL-KWLDDQPPSSVVLVCFGTMVSFDEDQVVAIAQALEESGVRFIWSLRQPPPKGKFEAPRNYTDIKDKLMGEESDEIRKKVKAKSEESRKSVMEGGS
        GD    WL     +S + V  G    + +D                     + P     E  R     + K M E+  ++RK++K     S+K++M   S
Subjt:  GDIL-KWLDDQPPSSVVLVCFGTMVSFDEDQVVAIAQALEESGVRFIWSLRQPPPKGKFEAPRNYTDIKDKLMGEESDEIRKKVKAKSEESRKSVMEGGS

Query:  SFISLGKFIDDMKKFELVFIPGPGIGHLASTVEMANMLVTRDHLLVVTVLVMQLSYDNKAMDHIQLLSASFAGKSIHLILLPEFPLPEECQNGMPQP---
        S+ S G+F+   +  +LVF+P P IGH+ STVEMA  LV RD  L +TVLVM+L +     +     + S     I+ + LPE     + Q+ +P P   
Subjt:  SFISLGKFIDDMKKFELVFIPGPGIGHLASTVEMANMLVTRDHLLVVTVLVMQLSYDNKAMDHIQLLSASFAGKSIHLILLPEFPLPEECQNGMPQP---

Query:  ---LIEIYKPHVREAMAK---HANSQTSPDLPQLVGFVLDMFCMTMIDVAKEFEVPCYVFYTSSAAFLALNFHLQELYDQSNSNKVVEQLKNSDSEALTI
           L+E +  HVR+A+      ++   S   P+L GFVLDMFC +++DVA EF+VP Y+F+TS+A+ LAL FH Q L D+   +  + +L +S +E L +
Subjt:  ---LIEIYKPHVREAMAK---HANSQTSPDLPQLVGFVLDMFCMTMIDVAKEFEVPCYVFYTSSAAFLALNFHLQELYDQSNSNKVVEQLKNSDSEALTI

Query:  PSFVKPIPGKVIPSIFVYNDIAVWLYENTRKFRTEIKGVLINTCAEIESHWL-ILTS-----PLYFVGPILNLKNTVNQ--VNILKWLDDQPQASVVFLC
        PSF  P P  V+P  F+  +    +  +  +F+ + KG+L+NT  E+ESH L  L S     P+Y VGP+LNLK++      ++L+WLDDQP  SVVFLC
Subjt:  PSFVKPIPGKVIPSIFVYNDIAVWLYENTRKFRTEIKGVLINTCAEIESHWL-ILTS-----PLYFVGPILNLKNTVNQ--VNILKWLDDQPQASVVFLC

Query:  FGSMGSFDEDQVKEIARGLEHCGVRFLWSLRQPPPKGKWVAPSDYADIKDV
        FGSMGSF E QVKEIA  LEH G RFLWSLR+PPPKGK   PSDY D+K V
Subjt:  FGSMGSFDEDQVKEIARGLEHCGVRFLWSLRQPPPKGKWVAPSDYADIKDV

A0A5D3DW35 Glycosyltransferase7.7e-16969.15Show/hide
Query:  MPGTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANEYIQSLSAQSLTNNNTSIRFIVLPELPPPSKDGNHFLLDVVLESYKPHVKQALISSLTTT
        +PG+GHLAS  EMAN L+ARD RL V ++AIKLP DAK NEYIQSL AQSLTNN  SI+FI+LPELPPP  D N    +VVLESYKPHVKQALIS LTT+
Subjt:  MPGTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANEYIQSLSAQSLTNNNTSIRFIVLPELPPPSKDGNHFLLDVVLESYKPHVKQALISSLTTT

Query:  TNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFINKAAVWLHEQVK
        TNHL GFVLD FC TMVDVANEF+VPSYVYYTSSAAYLAFS HLEQLYTQ+NSSNEVIQQ KDS+VN SVSSLVNQVPSKVIP +FFIN  AVW HEQ K
Subjt:  TNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFINKAAVWLHEQVK

Query:  RIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKKNIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGRALEQSGVRFI
        RIRF+VKGVLINTF+E+ESHVISSLSTDSSLQLPPLY VGP+LHL KN ETMD   VLKWLDDQP  SVVFLCFGSRG+ +KDQVEEI RALE+S VRFI
Subjt:  RIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKKNIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGRALEQSGVRFI

Query:  WSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ-------------------------------------------------------
        WSLRRPS G  F+ +IDYTNFEDILPEGFL+RT ++G+VI WAPQ                                                       
Subjt:  WSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ-------------------------------------------------------

Query:  -----------EQDKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFIDD
                   E DKP+I+SAEEIE+GIRKLMDDNNNEIRKKVKTKSEECRKSV+EGGSSFISL KFIDD
Subjt:  -----------EQDKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFIDD

A0A5D3DX75 Glycosyltransferase1.3e-17670.83Show/hide
Query:  MFELIFIPAPGMGHLASTVEMANVLATRDHRLSITVLAMKLPYDVKVAECIESLSSSFAGKHIQFNVLPEPPLPEESKKDFIVLVESYKPYVREAVTNFT
        MFELIFIPAPG+GHLASTVEMANVL TRDHRLS+T+LAMKLPYDVKVAECIESLS+SFAGK+IQFNVLPEPPLPEESKKDFIVLVESYKPYVREAV+NFT
Subjt:  MFELIFIPAPGMGHLASTVEMANVLATRDHRLSITVLAMKLPYDVKVAECIESLSSSFAGKHIQFNVLPEPPLPEESKKDFIVLVESYKPYVREAVTNFT

Query:  TSAETSLDSPRLVGLVIDMFCTTMIDVANEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNDELTLPNFVNPIPSKLIPSLFSNKVKA
         SA TSLDSPRLVGLVIDMFCTTMIDV NEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDN ELTLPNF NPIPSKLIP+LFSNK KA
Subjt:  TSAETSLDSPRLVGLVIDMFCTTMIDVANEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLNSDNDELTLPNFVNPIPSKLIPSLFSNKVKA

Query:  VWFHDHIKRFRLEIKGILINTFLEMESHVVKSYSQVLPPLYFVGPVLHLKNTGMAGSSEAQNYGD-ILKWLDDQPPSSVVLVCFGTMVSFDEDQVVAIAQ
        VWFH+HIKRFRLEIKGILINTF EMESH  KSYSQVLPPLYFVGPVLHLKN G+AGSSEAQ+  D I+KWLDDQPPSSVVLVCFGTMVSFDE QV  IA 
Subjt:  VWFHDHIKRFRLEIKGILINTFLEMESHVVKSYSQVLPPLYFVGPVLHLKNTGMAGSSEAQNYGD-ILKWLDDQPPSSVVLVCFGTMVSFDEDQVVAIAQ

Query:  ALEESGVRFIWSLRQPPPKGKFEAPRNYTDIKD-------------------------------------------------------------------
        ALEESGVRFIWSLRQPPPKGKFEAPRNY D+K+                                                                   
Subjt:  ALEESGVRFIWSLRQPPPKGKFEAPRNYTDIKD-------------------------------------------------------------------

Query:  --------------------------------------KLMGEESDEIRKKVKAKSEESRKSVMEGGSSFISLGKFIDDM
                                              KLMGEESDE+RKKVKAKSEES+KSVMEGGSSFISLGKFIDD+
Subjt:  --------------------------------------KLMGEESDEIRKKVKAKSEESRKSVMEGGSSFISLGKFIDDM

SwissProt top hitse value%identityAlignment
D3THI6 UDP-glycosyltransferase 71A153.8e-8039.87Show/hide
Query:  PGTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANEYIQSLSAQSLTNNNTS----IRFIVLPELPPPSKD-----GNHFLLDVVLESYKPHVKQA
        PG GH+ STVEMA  L ARD +L + +L +KLP+            AQ  TN ++S    I F+ LPE  P  +D     G+ F + V  E++K HV+ A
Subjt:  PGTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANEYIQSLSAQSLTNNNTS----IRFIVLPELPPPSKD-----GNHFLLDVVLESYKPHVKQA

Query:  LISSL-------TTTTNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGI
        +I+ L       +T+   LAGFVLDMF  +++DVANEF+VPSY+++TS+A+ LA   H + L  +       I +L  S   L+V S +N  P+ V+PG 
Subjt:  LISSL-------TTTTNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGI

Query:  FFINKAAVWLHEQVKRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKKNIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQV
            ++       V + + + KG+L+NTF E+ESH +  L  DS  ++PP+Y VGP+L+LK + E     D+L+WLDDQPP SVVFLCFGS GS  + QV
Subjt:  FFINKAAVWLHEQVKRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKKNIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQV

Query:  EEIGRALEQSGVRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ----------------------------------------
        +EI  ALE SG RF+WSLRRP P G   +  DY + + +LPEGFL+RT  VGKVIGWAPQ                                        
Subjt:  EEIGRALEQSGVRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ----------------------------------------

Query:  ---------------------EQDKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFID
                              +D   +VSAE+IERGIR++M + ++++RK+VK  SE+ +K++V+GGSS+ SL +FID
Subjt:  ---------------------EQDKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFID

D3UAG1 UDP-glycosyltransferase 71A161.6e-7839.46Show/hide
Query:  PGTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANEYIQSLSAQSLTNNNTS----IRFIVLPELPPPSKD-----GNHFLLDVVLESYKPHVKQA
        PG GH+ STVEMA  LVARD +L + +L +KLP+D            Q  TN ++S    I F+ LPE     +D     G+ F + V  E++K HV+ A
Subjt:  PGTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANEYIQSLSAQSLTNNNTS----IRFIVLPELPPPSKD-----GNHFLLDVVLESYKPHVKQA

Query:  LISSL-------TTTTNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGI
        +I+ L       +T+   LAGFVLDMF  +++DVANEF+VPSYV++TS+++ LA   H + L  +       I +L  S   L+V S +N  P  V+PG 
Subjt:  LISSL-------TTTTNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGI

Query:  FFINKAAVWLHEQVKRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKKNIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQV
        F   ++       V R + + KG+L+NTF E+ESH +  L  DS +++PP+Y VGP+L+LK + E     D+L+WLDDQPP SVVFLCFGS GS    QV
Subjt:  FFINKAAVWLHEQVKRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKKNIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQV

Query:  EEIGRALEQSGVRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ----------------------------------------
        +EI   LE SG RF+WSLR+P   G   +  DY + + +LPEGFL+RT  VG+VIGWAPQ                                        
Subjt:  EEIGRALEQSGVRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ----------------------------------------

Query:  ---------------------EQDKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFID
                              +D   +VSAE+IERGIR++M + ++++RK+VK  SE+ +K++V+GGSS+ SL +FID
Subjt:  ---------------------EQDKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFID

Q2V6K0 UDP-glucose flavonoid 3-O-glucosyltransferase 69.0e-8240.3Show/hide
Query:  MPGTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANE-YIQSLSAQSLTNNNTSIRFIVLPELPPPSKDGNHFLLDVVLESYKPHVKQALISSLTT
        +PG GH+ STVE+A +L+ RD  L + IL +K PF A  ++ YI+SL+    +     IRF+ LP+          F     ++S+K HVK A+   + T
Subjt:  MPGTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANE-YIQSLSAQSLTNNNTSIRFIVLPELPPPSKDGNHFLLDVVLESYKPHVKQALISSLTT

Query:  --TTNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVP-SKVIPGIFFINKAAVWLH
           T  +AGFV+DMFCT M+D+ANEF +PSYV+YTS AA L   FHL+ L    +  N+   + KDSD  L VSS VN +P ++V+P + F  +   +  
Subjt:  --TTNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVP-SKVIPGIFFINKAAVWLH

Query:  EQVKRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLK------KNIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGR
           KR R E KG+L+NTF E+E H I SLS+D   ++ P+Y VGP+L++K       + ++    D+L+WLDDQPPSSVVFLCFGS G   +DQV+EI  
Subjt:  EQVKRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLK------KNIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGR

Query:  ALEQSGVRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ---------------------------------------------
        ALEQ G+RF+WSLR+PS         DYT+++ +LPEGFL+RT D+GKVIGWAPQ                                             
Subjt:  ALEQSGVRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ---------------------------------------------

Query:  ----------------EQDKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFID
                         +D   IVS E IE+GI+++M +  +E+RK+VK  S+  RK++ E GSS+ SL +F+D
Subjt:  ----------------EQDKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFID

Q66PF3 Putative UDP-glucose flavonoid 3-O-glucosyltransferase 39.3e-8741.19Show/hide
Query:  PGTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANE-YIQSLSAQSLTNNNTSIRFIVLPELPPPSKDGN-HFLLDVVLESYKPHVKQALISSLTT
        PG GHL ST+E+A +LV+RD +L + +L +  P  +K  + Y+QSL A S +  +  I FI LP       +G+    L   +ES +PHVK A+ +   +
Subjt:  PGTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANE-YIQSLSAQSLTNNNTSIRFIVLPELPPPSKDGN-HFLLDVVLESYKPHVKQALISSLTT

Query:  TTNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFINKAAVWLHEQV
         T  LAGFV+DMFCTTM++VAN+  VPSYV++TS AA L   FHL++L  Q N       + KDSD  L + S  N +P+KV+PG   +  +A      +
Subjt:  TTNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFINKAAVWLHEQV

Query:  KRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKKNIETMD------HVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGRALE
        KR R E KG+L+NTF ++ESH + +LS+D+  ++PP+Y VGP+L+L  N   +D        D+LKWLDDQPP SVVFLCFGS GS ++ QV EI  ALE
Subjt:  KRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKKNIETMD------HVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGRALE

Query:  QSGVRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ------------------------------------------------
         +G RF+WSLRR  P G      DY +   +LPEGFL+RT  +GKVIGWAPQ                                                
Subjt:  QSGVRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ------------------------------------------------

Query:  -------------EQDKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFID
                         P +VSA+EIERGIR++M+ ++++IRK+VK  SE+ +K++++GGSS+ SL  FID
Subjt:  -------------EQDKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFID

Q6VAB2 UDP-glycosyltransferase 71E14.9e-7238.35Show/hide
Query:  PGTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANEYIQSLSAQSLTNNNTSIRFIVLP--ELPPPSKDGNHFLLDVVLESYKPHVKQALISSLTT
        PG GHL  TVE+A +L+ RD RL+V I+ + L    K N   +            S+RF+ +P  E        N F +   +E +KP V+  +   + +
Subjt:  PGTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANEYIQSLSAQSLTNNNTSIRFIVLP--ELPPPSKDGNHFLLDVVLESYKPHVKQALISSLTT

Query:  TTNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFINKAAVWLH-EQ
         +  LAGFVLDMFC  M DVANEF VPSY Y+TS AA L   FHL+    + +       +LK+SD  LSV S VN VP+KV+P +    +    +  + 
Subjt:  TTNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFINKAAVWLH-EQ

Query:  VKRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKKNIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGRALEQSGVR
         +RIR E KG+++N+ + +E H +  LS++++  +PP++ VGP+L+L+   +     ++++WL++QP SSVVFLCFGS GS  + QV+EI  A+E+SG R
Subjt:  VKRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKKNIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGRALEQSGVR

Query:  FIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ--------------------------------------EQ-------------
        F+WSLRRP+P    E   +Y N E++LPEGFL RT  +GKVIGWAPQ                                      EQ             
Subjt:  FIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ--------------------------------------EQ-------------

Query:  ----------------DKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFID
                        D    V+ EEIE GIRKLM D   EIR KVK   E+ R +VVEGGSS+ S+ KFI+
Subjt:  ----------------DKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFID

Arabidopsis top hitse value%identityAlignment
AT3G21750.1 UDP-glucosyl transferase 71B18.4e-6733.05Show/hide
Query:  PGTGHLASTVEMANILVARDPRLTVAILAI--KLPFDAKANEYIQSLSAQSLTNNNTSIRFIVLPELPPPSKDGNHFLLDVVLESYKPHVKQALISSL--
        PG GH+ +T  +A +LVA D RL+V ++ I  ++  DA ++ Y         TN+   +R+I+L     P++D    L+  + +S KP V+ A++S +  
Subjt:  PGTGHLASTVEMANILVARDPRLTVAILAI--KLPFDAKANEYIQSLSAQSLTNNNTSIRFIVLPELPPPSKDGNHFLLDVVLESYKPHVKQALISSL--

Query:  ---TTTTNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFINKAAVW
           T + + LAG V+DMFCT+M+D+A+EF + +Y++YTS+A+YL   FH++ LY +       + + KD+++   V +L    P+K +P +    K   +
Subjt:  ---TTTTNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFINKAAVW

Query:  LHEQVKRIRFEVKGVLINTFEEMESHVISSLS-TDSSLQLPPLYSVGPVLHLKKNIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGRALE
        +  + +  R   KG+L+N+  +ME   +S  S  + +  +PP+Y+VGP++ L+ + +     ++L WL +QP  SVVFLCFGS G   ++Q  EI  ALE
Subjt:  LHEQVKRIRFEVKGVLINTFEEMESHVISSLS-TDSSLQLPPLYSVGPVLHLKKNIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGRALE

Query:  QSGVRFIWSLRRPSPGG--AFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ--------------------------------------EQ------
        +SG RF+WSLRR SP G  +     ++TN E+ILP+GFL+RTV++GK+I WAPQ                                      EQ      
Subjt:  QSGVRFIWSLRRPSPGG--AFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ--------------------------------------EQ------

Query:  ---------------------DKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFIDD
                             ++P+IV+A+EIERGI+  M + ++++RK+V    ++   ++V+GGSS  +L+KF+ D
Subjt:  ---------------------DKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFIDD

AT3G21760.1 UDP-Glycosyltransferase superfamily protein4.3e-7136.67Show/hide
Query:  PGTGHLASTVEMANILVARDPRLTVAILAIKLPF---DAKANEYIQSLSAQSLTNNNTSIRFIVLPELPPPSKDGNHFLLDVVLESYKPHVKQALISSLT
        PG GHL   VE+A + V RD  L++ I+ I        + ++ YI SLS+ S      S   + +P+ P       HF     ++++KP VK A +  LT
Subjt:  PGTGHLASTVEMANILVARDPRLTVAILAIKLPF---DAKANEYIQSLSAQSLTNNNTSIRFIVLPELPPPSKDGNHFLLDVVLESYKPHVKQALISSLT

Query:  -----TTTNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSD-VNLSVSSLVNQVPSKVIPGIFFINKAA
              + + LAGFV+DMFC  M+DVANEF VPSY++YTS+A +L    H+E LY   N     +  LKDSD   L V  L   +P K  P +    +  
Subjt:  -----TTTNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSD-VNLSVSSLVNQVPSKVIPGIFFINKAA

Query:  VWLHEQVKRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLK---KNIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIG
          +  Q +R R E KG+L+NTF E+E   +   S   S  LP +Y+VGPV++LK    N       ++L+WLD+QP  SVVFLCFGS G   + Q +EI 
Subjt:  VWLHEQVKRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLK---KNIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIG

Query:  RALEQSGVRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ--------------------------------------EQD---
         ALE+SG RF+WSLRR  P G+     ++TN E+ILPEGFL RT ++GK++GWAPQ                                      EQ    
Subjt:  RALEQSGVRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ--------------------------------------EQD---

Query:  ------------------------KPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFIDD
                                  ++++AEEIERGIR LM + ++++R +VK  SE+   ++++GGSS ++L KFI D
Subjt:  ------------------------KPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFIDD

AT3G21780.1 UDP-glucosyl transferase 71B63.4e-6836.48Show/hide
Query:  PGTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANEYIQSLSAQSLTNNNTSIRFIVLPELPPPSKDGNHFLLDVVLESYKPHVKQA---LISSLT
        P   HL +TVEMA  LV ++  L++ ++ I   F +K    I SL+    +NN      I   +  P          D  ++S KP V+ A   L+ S  
Subjt:  PGTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANEYIQSLSAQSLTNNNTSIRFIVLPELPPPSKDGNHFLLDVVLESYKPHVKQA---LISSLT

Query:  TTTNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFINKAAVWLHEQ
             LAGFV+DM+CT+M+DVANEF VPSY++YTS+A +L    H++ +Y   +  +  + +L+DSDV L V SL +  P K +P IF   +   +   Q
Subjt:  TTTNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFINKAAVWLHEQ

Query:  VKRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKK-NIETMD--HVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGRALEQS
         +R R E KG+L+NT  ++E   ++ LS  +   +P  Y VGP+LHLK  N + +D    ++L+WLD+QPP SVVFLCFGS G   ++QV E   AL++S
Subjt:  VKRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKK-NIETMD--HVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGRALEQS

Query:  GVRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ--------------------------------------EQ----------
        G RF+WSLRR SP    E   ++TN E+ILPEGF +RT + GKVIGWA Q                                      EQ          
Subjt:  GVRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ--------------------------------------EQ----------

Query:  -----------------DKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFIDDKFE
                          + +IV+AEEIE+GI  LM + ++++RK+V   SE+C  ++++GGSS  +L++FI D  E
Subjt:  -----------------DKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFIDDKFE

AT3G21790.1 UDP-Glycosyltransferase superfamily protein4.6e-7337.19Show/hide
Query:  PGTGHLASTVEMANILVARDPRLTVAILAIKLPF----DAKANEYIQSLSAQSLTNNNTSIRFIVLPELPPPSKDGNHFLLDVVLESYKPHVKQALISSL
        PG GHL STVEMA +LV R+ RL++++  I LPF    +  A++YI +LSA S    N  +R+ V+  +  P+ +     +++ +++ +P V+  +   L
Subjt:  PGTGHLASTVEMANILVARDPRLTVAILAIKLPF----DAKANEYIQSLSAQSLTNNNTSIRFIVLPELPPPSKDGNHFLLDVVLESYKPHVKQALISSL

Query:  TTTTN-----HLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNN---SSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFIN
           ++      +AGFVLDMFCT+MVDVANEF  PSY++YTSSA  L+ ++H++ L  +N    S N+      DS+  L+  SL    P K +P     N
Subjt:  TTTTN-----HLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNN---SSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFIN

Query:  KAAVWLHEQVKRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHL---KKNIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVE
                Q ++ R E+KG+L+NT  E+E +V+  LS+  +   PP+Y VGP+LHL   + + +    +++++WLD QPPSSVVFLCFGS G   ++QV 
Subjt:  KAAVWLHEQVKRIRFEVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHL---KKNIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVE

Query:  EIGRALEQSGVRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ--------------------------------------EQD
        EI  ALE+SG RF+WSLRR SP    E+  ++TN E++LPEGF +RT D+GKVIGWAPQ                                      EQ 
Subjt:  EIGRALEQSGVRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQ--------------------------------------EQD

Query:  ----------------------------KPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFIDD
                                        V+AEEIE+ I  LM + ++++RK+VK  SE+C  ++++GGSS  +LQKFI++
Subjt:  ----------------------------KPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFIDD

AT4G15280.1 UDP-glucosyl transferase 71B51.4e-6935.5Show/hide
Query:  MPGTGHLASTVEMANILVARDPRLTVAILAIKLPFDA-KANEYIQSLSAQSLTNNNTSIRFIVLPELPPPSKDGNHFLLDVVLESYKPHVKQALISSLTT
        +PG GHL  TV++A  L+  + RL++ I+ I   FDA  A+  I SL+  S  +        V  +  PP+ D +     V +E  K  V+ A+ + +  
Subjt:  MPGTGHLASTVEMANILVARDPRLTVAILAIKLPFDA-KANEYIQSLSAQSLTNNNTSIRFIVLPELPPPSKDGNHFLLDVVLESYKPHVKQALISSLTT

Query:  TTNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFINKAAVWLHEQV
         T  LAGFV+DMFC++M+DVANEF VP Y+ YTS+A +L    H++Q+Y Q       + +L++S   L   SL    P K +P I     +  WL   +
Subjt:  TTNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFINKAAVWLHEQV

Query:  KRIRF--EVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKK-NIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGRALEQSG
         + R   ++KG+L+NT  E+E H +   + +    LP +Y VGPVLHL+  N +     ++L+WLD+QP  SVVFLCFGS G   ++Q  E   AL++SG
Subjt:  KRIRF--EVKGVLINTFEEMESHVISSLSTDSSLQLPPLYSVGPVLHLKK-NIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGRALEQSG

Query:  VRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQEQ--DKPKI------------------------------------------
         RF+W LR  SP    +   DYTN E++LPEGFL RT+D GKVIGWAPQ    +KP I                                          
Subjt:  VRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFLNRTVDVGKVIGWAPQEQ--DKPKI------------------------------------------

Query:  ---------------------VSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFIDDKFE
                             V+AE+IER IR++M + ++++R  VK  +E+C  ++++GGSS  +L+KFI D  E
Subjt:  ---------------------VSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFIDDKFE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGGGACGGGTCACCTCGCATCCACGGTGGAGATGGCCAATATTCTTGTTGCTCGAGATCCTCGTCTCACCGTCGCAATTCTTGCCATTAAGCTACCCTTTGATGC
CAAAGCTAATGAATATATTCAATCCCTTTCTGCACAGTCTCTTACCAACAATAACACATCCATACGCTTCATTGTGCTTCCTGAATTACCACCTCCCTCAAAAGATGGGA
ACCATTTCTTGCTAGACGTAGTGCTTGAAAGCTACAAACCCCATGTCAAACAAGCTCTTATCTCCTCTCTAACTACCACCACCAACCACCTTGCTGGATTTGTCTTGGAC
ATGTTCTGCACAACCATGGTTGATGTTGCTAATGAATTTCAGGTTCCTTCTTATGTGTATTACACTTCTAGTGCCGCCTATCTTGCTTTTAGTTTTCATCTTGAACAACT
TTACACACAAAATAATAGCAGTAATGAGGTAATCCAACAGTTGAAGGATTCAGATGTTAATTTGAGTGTATCAAGCTTAGTGAATCAAGTTCCAAGTAAAGTCATTCCTG
GTATCTTCTTTATTAACAAAGCTGCTGTTTGGTTGCATGAACAAGTTAAAAGAATTAGATTCGAAGTAAAAGGTGTTCTCATCAATACATTTGAGGAGATGGAATCACAT
GTGATATCTTCTTTGTCAACTGACTCCTCTTTGCAACTCCCACCATTGTATTCTGTTGGGCCTGTTTTGCACTTGAAGAAGAATATTGAGACTATGGATCATGTAGATGT
ATTGAAGTGGCTTGATGATCAACCTCCATCATCGGTGGTGTTTTTGTGCTTTGGAAGCAGGGGAAGCATTGAGAAGGATCAAGTGGAGGAGATTGGACGAGCGCTTGAGC
AAAGCGGGGTTCGTTTCATTTGGTCCCTCCGACGACCATCACCAGGGGGTGCATTTGAAGTGGCAATTGACTATACAAACTTCGAAGACATCTTACCTGAGGGATTTCTC
AATCGGACCGTAGATGTTGGGAAGGTCATTGGTTGGGCACCACAAGAACAAGACAAGCCAAAAATAGTGAGTGCAGAAGAGATAGAGAGAGGGATCAGAAAGTTGATGGA
CGACAACAATAATGAGATTCGAAAGAAAGTAAAAACAAAAAGTGAAGAATGTAGAAAAAGTGTAGTAGAAGGTGGATCCTCCTTCATCTCATTGCAAAAATTTATAGATG
ACAAGTTTGAGCTGGATTTCATACCGATGCCGGGGACGGGTCACCTCGCATCCACGGTGGAGATGGCCAATATTCTTGTCGCTCGAGATCCTCGTCTCACCGTCGCAATT
CTTGCCATTAAGCTATCCTTTGAGGTCAAAGCTAATGAATGTATCCAGTCTCTTTCCAACAACACATCCATACGCTTCATTGACCTTCCTGAATTACCACCTCCCCCAAA
AGGTGGGAACCATTTTTTCCTGAAAGTTGTTCTTGAAAGCTACAAACCCCATGTCAAACAAGCTCTTATCTCCTCTCTAACTACCACCACCAACCACCTTGCTGGATTTG
TCTTGGACATGTTCTGCACAACCATGGTTGATGTTGCTAATGAATTTCAGGTTCCTTCTTATGTGTATTATACTTCTAGTGCCGCCTATCTTGCTTTTAGTTTTCATCTT
GAACAACTTTACACACAAAATAATAGTTGCAATGAGGTAATACAACAATTGAAGGATTCAGATGTCAATTTGAGTGTGTCAAGCTTAGTGAATCAAGTTCCAAGTAAAGT
CATTCCTGGTATCTTCTTTATAATTAGATCTGAAGTAAAAGGTATTCTCATCAATACATTTGAGGAGCTGGAAGCACATGTGATATGTTCTTTGTCAACTGACTCCTCTT
TGCAACTCCCACCTTTTGGCTCATCATCGGTGGTGTTCTTGTGCTTTGGAAGCAGGGGAAGCTTTGAGAAGGATCAAGTGGAGGAGATTGGACGAGCGCTTGAGCGAAGC
GGGGTTCGTTTCATTTGGTCCCTCCGACGACCATCACCAGGGGGTGCGTTTGAAGTGGCAATCGACTATTCAAACTTCGAAGACATCTTACCTGAGGGATTTCTCAATCG
GACTGTAGATGTTGGGAGGGTCATCGGTTGGGCACCTCAAATGTTTGAGTTGATTTTCATACCTGCCCCCGGAATGGGTCACCTTGCATCCACCGTCGAAATGGCCAATG
TTCTTGCCACTCGGGATCATCGTCTCTCCATCACAGTGCTTGCCATGAAGCTTCCCTACGATGTCAAAGTTGCCGAATGTATCGAGTCACTTTCTTCGTCTTTCGCAGGC
AAACATATACAATTCAACGTCCTTCCGGAACCGCCACTTCCAGAAGAAAGTAAAAAGGACTTCATTGTGTTAGTTGAAAGCTACAAGCCTTATGTTAGAGAGGCTGTCAC
CAACTTTACTACCTCAGCAGAGACAAGTCTCGACTCGCCTCGGCTTGTCGGGTTAGTCATTGACATGTTTTGTACAACCATGATAGATGTGGCTAACGAATTCGGGGTTC
CTTGTTATGTGTTTTATACTTGTAGTGCTAGTTTTCTTGCTTTTAGTCTTTATCTTCAAGAGCTTTATGAGGAGAATGGTAGCAATGAAGTGGTGGAACAGTTGCTGAAC
TCAGATAATGATGAGTTAACTTTGCCAAATTTTGTTAATCCTATTCCTAGTAAACTCATTCCTAGTCTCTTCTCTAACAAGGTCAAGGCTGTTTGGTTTCATGACCATAT
TAAAAGATTTAGATTGGAAATCAAAGGGATTCTCATCAATACATTCTTGGAGATGGAATCACATGTGGTGAAGTCCTATTCTCAAGTCCTTCCACCCTTATATTTTGTTG
GACCTGTTTTGCACTTGAAAAATACAGGAATGGCTGGATCAAGTGAGGCTCAAAACTATGGAGATATATTAAAGTGGCTTGATGATCAACCTCCATCATCAGTGGTTCTA
GTGTGCTTTGGGACCATGGTGAGTTTTGATGAGGATCAGGTGGTGGCAATTGCACAAGCATTGGAGGAAAGCGGGGTTCGTTTCATATGGTCTCTCCGGCAACCCCCACC
GAAGGGTAAGTTCGAAGCACCGAGAAACTACACGGACATCAAAGACAAATTAATGGGGGAAGAAAGTGATGAGATAAGGAAGAAAGTGAAAGCAAAAAGTGAAGAAAGTA
GGAAAAGTGTAATGGAAGGTGGATCTTCCTTCATCTCGTTGGGAAAATTTATAGATGATATGAAGAAGTTTGAGCTGGTTTTCATACCAGGACCTGGAATAGGCCATCTT
GCATCTACTGTTGAGATGGCCAATATGCTTGTCACTCGAGATCACCTTCTCGTCGTCACGGTGCTTGTCATGCAATTGTCTTACGACAACAAAGCAATGGATCATATCCA
ACTACTTTCTGCATCTTTTGCTGGAAAATCTATACACTTGATCCTTCTTCCTGAATTCCCCCTTCCAGAGGAATGTCAAAATGGTATGCCACAACCCCTTATTGAAATCT
ATAAGCCCCATGTTAGAGAAGCTATGGCCAAACATGCCAACTCTCAAACTAGTCCTGACTTGCCCCAACTAGTTGGATTTGTTCTTGACATGTTTTGTATGACAATGATA
GATGTGGCTAAAGAATTTGAGGTACCTTGTTATGTGTTTTACACTTCCAGTGCTGCCTTTCTTGCTCTCAATTTTCACCTTCAAGAGCTTTATGATCAGAGTAATAGCAA
CAAGGTAGTTGAACAGTTAAAGAACTCAGATTCTGAAGCACTAACTATACCAAGTTTTGTCAAACCAATTCCTGGGAAAGTCATTCCTAGTATATTTGTTTATAATGACA
TTGCTGTTTGGTTGTATGAAAATACTAGGAAGTTTAGAACAGAAATCAAAGGTGTTTTGATCAATACTTGTGCAGAGATAGAATCTCATTGGCTCATCCTCACAAGTCCC
TTGTATTTTGTTGGACCTATTTTGAACTTGAAAAACACTGTCAACCAAGTAAATATACTTAAATGGCTTGATGATCAGCCTCAAGCATCAGTGGTTTTCTTGTGTTTTGG
GAGCATGGGAAGCTTTGATGAGGATCAAGTGAAGGAGATTGCACGAGGGCTAGAGCACTGTGGGGTTCGCTTCTTGTGGTCGCTTCGACAACCCCCACCAAAGGGGAAGT
GGGTAGCACCAAGCGATTATGCTGACATCAAAGACGTCAAAAATAGTAACTGCAAAAGAGATAAAGAGTGGAATAAGAAAATTGATGGACGACGGGGACCAAATTCGGAA
GAAAGTCAAAGCAAAAAGCGAAGAAATTCGAAAGGCTGTAATGGAAGTATTGGCTACTCGGAAGCCAGGTTAGTTGTGCCATGTTTTGTCAATCCAATTCCTAGCACAGA
GTCTTCCCTCGCTCTTTGTTTGATTATGAAACTGCTCTTTGGTTTTATGATAATGTCTAAAAGATTTAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGGGGACGGGTCACCTCGCATCCACGGTGGAGATGGCCAATATTCTTGTTGCTCGAGATCCTCGTCTCACCGTCGCAATTCTTGCCATTAAGCTACCCTTTGATGC
CAAAGCTAATGAATATATTCAATCCCTTTCTGCACAGTCTCTTACCAACAATAACACATCCATACGCTTCATTGTGCTTCCTGAATTACCACCTCCCTCAAAAGATGGGA
ACCATTTCTTGCTAGACGTAGTGCTTGAAAGCTACAAACCCCATGTCAAACAAGCTCTTATCTCCTCTCTAACTACCACCACCAACCACCTTGCTGGATTTGTCTTGGAC
ATGTTCTGCACAACCATGGTTGATGTTGCTAATGAATTTCAGGTTCCTTCTTATGTGTATTACACTTCTAGTGCCGCCTATCTTGCTTTTAGTTTTCATCTTGAACAACT
TTACACACAAAATAATAGCAGTAATGAGGTAATCCAACAGTTGAAGGATTCAGATGTTAATTTGAGTGTATCAAGCTTAGTGAATCAAGTTCCAAGTAAAGTCATTCCTG
GTATCTTCTTTATTAACAAAGCTGCTGTTTGGTTGCATGAACAAGTTAAAAGAATTAGATTCGAAGTAAAAGGTGTTCTCATCAATACATTTGAGGAGATGGAATCACAT
GTGATATCTTCTTTGTCAACTGACTCCTCTTTGCAACTCCCACCATTGTATTCTGTTGGGCCTGTTTTGCACTTGAAGAAGAATATTGAGACTATGGATCATGTAGATGT
ATTGAAGTGGCTTGATGATCAACCTCCATCATCGGTGGTGTTTTTGTGCTTTGGAAGCAGGGGAAGCATTGAGAAGGATCAAGTGGAGGAGATTGGACGAGCGCTTGAGC
AAAGCGGGGTTCGTTTCATTTGGTCCCTCCGACGACCATCACCAGGGGGTGCATTTGAAGTGGCAATTGACTATACAAACTTCGAAGACATCTTACCTGAGGGATTTCTC
AATCGGACCGTAGATGTTGGGAAGGTCATTGGTTGGGCACCACAAGAACAAGACAAGCCAAAAATAGTGAGTGCAGAAGAGATAGAGAGAGGGATCAGAAAGTTGATGGA
CGACAACAATAATGAGATTCGAAAGAAAGTAAAAACAAAAAGTGAAGAATGTAGAAAAAGTGTAGTAGAAGGTGGATCCTCCTTCATCTCATTGCAAAAATTTATAGATG
ACAAGTTTGAGCTGGATTTCATACCGATGCCGGGGACGGGTCACCTCGCATCCACGGTGGAGATGGCCAATATTCTTGTCGCTCGAGATCCTCGTCTCACCGTCGCAATT
CTTGCCATTAAGCTATCCTTTGAGGTCAAAGCTAATGAATGTATCCAGTCTCTTTCCAACAACACATCCATACGCTTCATTGACCTTCCTGAATTACCACCTCCCCCAAA
AGGTGGGAACCATTTTTTCCTGAAAGTTGTTCTTGAAAGCTACAAACCCCATGTCAAACAAGCTCTTATCTCCTCTCTAACTACCACCACCAACCACCTTGCTGGATTTG
TCTTGGACATGTTCTGCACAACCATGGTTGATGTTGCTAATGAATTTCAGGTTCCTTCTTATGTGTATTATACTTCTAGTGCCGCCTATCTTGCTTTTAGTTTTCATCTT
GAACAACTTTACACACAAAATAATAGTTGCAATGAGGTAATACAACAATTGAAGGATTCAGATGTCAATTTGAGTGTGTCAAGCTTAGTGAATCAAGTTCCAAGTAAAGT
CATTCCTGGTATCTTCTTTATAATTAGATCTGAAGTAAAAGGTATTCTCATCAATACATTTGAGGAGCTGGAAGCACATGTGATATGTTCTTTGTCAACTGACTCCTCTT
TGCAACTCCCACCTTTTGGCTCATCATCGGTGGTGTTCTTGTGCTTTGGAAGCAGGGGAAGCTTTGAGAAGGATCAAGTGGAGGAGATTGGACGAGCGCTTGAGCGAAGC
GGGGTTCGTTTCATTTGGTCCCTCCGACGACCATCACCAGGGGGTGCGTTTGAAGTGGCAATCGACTATTCAAACTTCGAAGACATCTTACCTGAGGGATTTCTCAATCG
GACTGTAGATGTTGGGAGGGTCATCGGTTGGGCACCTCAAATGTTTGAGTTGATTTTCATACCTGCCCCCGGAATGGGTCACCTTGCATCCACCGTCGAAATGGCCAATG
TTCTTGCCACTCGGGATCATCGTCTCTCCATCACAGTGCTTGCCATGAAGCTTCCCTACGATGTCAAAGTTGCCGAATGTATCGAGTCACTTTCTTCGTCTTTCGCAGGC
AAACATATACAATTCAACGTCCTTCCGGAACCGCCACTTCCAGAAGAAAGTAAAAAGGACTTCATTGTGTTAGTTGAAAGCTACAAGCCTTATGTTAGAGAGGCTGTCAC
CAACTTTACTACCTCAGCAGAGACAAGTCTCGACTCGCCTCGGCTTGTCGGGTTAGTCATTGACATGTTTTGTACAACCATGATAGATGTGGCTAACGAATTCGGGGTTC
CTTGTTATGTGTTTTATACTTGTAGTGCTAGTTTTCTTGCTTTTAGTCTTTATCTTCAAGAGCTTTATGAGGAGAATGGTAGCAATGAAGTGGTGGAACAGTTGCTGAAC
TCAGATAATGATGAGTTAACTTTGCCAAATTTTGTTAATCCTATTCCTAGTAAACTCATTCCTAGTCTCTTCTCTAACAAGGTCAAGGCTGTTTGGTTTCATGACCATAT
TAAAAGATTTAGATTGGAAATCAAAGGGATTCTCATCAATACATTCTTGGAGATGGAATCACATGTGGTGAAGTCCTATTCTCAAGTCCTTCCACCCTTATATTTTGTTG
GACCTGTTTTGCACTTGAAAAATACAGGAATGGCTGGATCAAGTGAGGCTCAAAACTATGGAGATATATTAAAGTGGCTTGATGATCAACCTCCATCATCAGTGGTTCTA
GTGTGCTTTGGGACCATGGTGAGTTTTGATGAGGATCAGGTGGTGGCAATTGCACAAGCATTGGAGGAAAGCGGGGTTCGTTTCATATGGTCTCTCCGGCAACCCCCACC
GAAGGGTAAGTTCGAAGCACCGAGAAACTACACGGACATCAAAGACAAATTAATGGGGGAAGAAAGTGATGAGATAAGGAAGAAAGTGAAAGCAAAAAGTGAAGAAAGTA
GGAAAAGTGTAATGGAAGGTGGATCTTCCTTCATCTCGTTGGGAAAATTTATAGATGATATGAAGAAGTTTGAGCTGGTTTTCATACCAGGACCTGGAATAGGCCATCTT
GCATCTACTGTTGAGATGGCCAATATGCTTGTCACTCGAGATCACCTTCTCGTCGTCACGGTGCTTGTCATGCAATTGTCTTACGACAACAAAGCAATGGATCATATCCA
ACTACTTTCTGCATCTTTTGCTGGAAAATCTATACACTTGATCCTTCTTCCTGAATTCCCCCTTCCAGAGGAATGTCAAAATGGTATGCCACAACCCCTTATTGAAATCT
ATAAGCCCCATGTTAGAGAAGCTATGGCCAAACATGCCAACTCTCAAACTAGTCCTGACTTGCCCCAACTAGTTGGATTTGTTCTTGACATGTTTTGTATGACAATGATA
GATGTGGCTAAAGAATTTGAGGTACCTTGTTATGTGTTTTACACTTCCAGTGCTGCCTTTCTTGCTCTCAATTTTCACCTTCAAGAGCTTTATGATCAGAGTAATAGCAA
CAAGGTAGTTGAACAGTTAAAGAACTCAGATTCTGAAGCACTAACTATACCAAGTTTTGTCAAACCAATTCCTGGGAAAGTCATTCCTAGTATATTTGTTTATAATGACA
TTGCTGTTTGGTTGTATGAAAATACTAGGAAGTTTAGAACAGAAATCAAAGGTGTTTTGATCAATACTTGTGCAGAGATAGAATCTCATTGGCTCATCCTCACAAGTCCC
TTGTATTTTGTTGGACCTATTTTGAACTTGAAAAACACTGTCAACCAAGTAAATATACTTAAATGGCTTGATGATCAGCCTCAAGCATCAGTGGTTTTCTTGTGTTTTGG
GAGCATGGGAAGCTTTGATGAGGATCAAGTGAAGGAGATTGCACGAGGGCTAGAGCACTGTGGGGTTCGCTTCTTGTGGTCGCTTCGACAACCCCCACCAAAGGGGAAGT
GGGTAGCACCAAGCGATTATGCTGACATCAAAGACGTCAAAAATAGTAACTGCAAAAGAGATAAAGAGTGGAATAAGAAAATTGATGGACGACGGGGACCAAATTCGGAA
GAAAGTCAAAGCAAAAAGCGAAGAAATTCGAAAGGCTGTAATGGAAGTATTGGCTACTCGGAAGCCAGGTTAGTTGTGCCATGTTTTGTCAATCCAATTCCTAGCACAGA
GTCTTCCCTCGCTCTTTGTTTGATTATGAAACTGCTCTTTGGTTTTATGATAATGTCTAAAAGATTTAGGTAA
Protein sequenceShow/hide protein sequence
MPGTGHLASTVEMANILVARDPRLTVAILAIKLPFDAKANEYIQSLSAQSLTNNNTSIRFIVLPELPPPSKDGNHFLLDVVLESYKPHVKQALISSLTTTTNHLAGFVLD
MFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHLEQLYTQNNSSNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFINKAAVWLHEQVKRIRFEVKGVLINTFEEMESH
VISSLSTDSSLQLPPLYSVGPVLHLKKNIETMDHVDVLKWLDDQPPSSVVFLCFGSRGSIEKDQVEEIGRALEQSGVRFIWSLRRPSPGGAFEVAIDYTNFEDILPEGFL
NRTVDVGKVIGWAPQEQDKPKIVSAEEIERGIRKLMDDNNNEIRKKVKTKSEECRKSVVEGGSSFISLQKFIDDKFELDFIPMPGTGHLASTVEMANILVARDPRLTVAI
LAIKLSFEVKANECIQSLSNNTSIRFIDLPELPPPPKGGNHFFLKVVLESYKPHVKQALISSLTTTTNHLAGFVLDMFCTTMVDVANEFQVPSYVYYTSSAAYLAFSFHL
EQLYTQNNSCNEVIQQLKDSDVNLSVSSLVNQVPSKVIPGIFFIIRSEVKGILINTFEELEAHVICSLSTDSSLQLPPFGSSSVVFLCFGSRGSFEKDQVEEIGRALERS
GVRFIWSLRRPSPGGAFEVAIDYSNFEDILPEGFLNRTVDVGRVIGWAPQMFELIFIPAPGMGHLASTVEMANVLATRDHRLSITVLAMKLPYDVKVAECIESLSSSFAG
KHIQFNVLPEPPLPEESKKDFIVLVESYKPYVREAVTNFTTSAETSLDSPRLVGLVIDMFCTTMIDVANEFGVPCYVFYTCSASFLAFSLYLQELYEENGSNEVVEQLLN
SDNDELTLPNFVNPIPSKLIPSLFSNKVKAVWFHDHIKRFRLEIKGILINTFLEMESHVVKSYSQVLPPLYFVGPVLHLKNTGMAGSSEAQNYGDILKWLDDQPPSSVVL
VCFGTMVSFDEDQVVAIAQALEESGVRFIWSLRQPPPKGKFEAPRNYTDIKDKLMGEESDEIRKKVKAKSEESRKSVMEGGSSFISLGKFIDDMKKFELVFIPGPGIGHL
ASTVEMANMLVTRDHLLVVTVLVMQLSYDNKAMDHIQLLSASFAGKSIHLILLPEFPLPEECQNGMPQPLIEIYKPHVREAMAKHANSQTSPDLPQLVGFVLDMFCMTMI
DVAKEFEVPCYVFYTSSAAFLALNFHLQELYDQSNSNKVVEQLKNSDSEALTIPSFVKPIPGKVIPSIFVYNDIAVWLYENTRKFRTEIKGVLINTCAEIESHWLILTSP
LYFVGPILNLKNTVNQVNILKWLDDQPQASVVFLCFGSMGSFDEDQVKEIARGLEHCGVRFLWSLRQPPPKGKWVAPSDYADIKDVKNSNCKRDKEWNKKIDGRRGPNSE
ESQSKKRRNSKGCNGSIGYSEARLVVPCFVNPIPSTESSLALCLIMKLLFGFMIMSKRFR