; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10016480 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10016480
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUDP-glycosyltransferase 72E1
Genome locationChr03:5364569..5366002
RNA-Seq ExpressionHG10016480
SyntenyHG10016480
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0008194 - UDP-glycosyltransferase activity (molecular function)
InterPro domainsIPR002213 - UDP-glucuronosyl/UDP-glucosyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036251.1 UDP-glycosyltransferase 72E1 [Cucumis melo var. makuwa]8.8e-22281.88Show/hide
Query:  MPVEESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIA
        MP +ESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSS+AE  LLQKPS VNI+ LPH+SS+LD NA   +II +MMTASFP LRSSIA
Subjt:  MPVEESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIA

Query:  AAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELG
        AA PRPAALIVDLFGT A+SIAHELGMLGFVF+T++AWFLSLSFF+PS D+  +DAHV NH+ L IPGCTPVRFE+T EVF+LNQ++VY GFGSFA ELG
Subjt:  AAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELG

Query:  TADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV
        TADGILSNTWQDLEPTTLKAL+EAGTL KGKVN+VPI PIGPLT + +P LESEVLKWLD+QPDESVIYVSFGSGGTL  EQITELAWGLE+SQQRFVWV
Subjt:  TADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV

Query:  IRPPAGTDSVGAFFTAGTGSA-DESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTE
        IRPPAGT+S+G FFTAG GS+ D+   E+LPEGFIKRTKEVGLV+PMWGPQAEILSHRSVRGF+THCGWNSSLE IVNGVAMVTWPLYAEQKMNA +LTE
Subjt:  IRPPAGTDSVGAFFTAGTGSA-DESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTE

Query:  EVGVAVRVRTE--RVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDGGC
        EVGVAVRVR E   +V R+EIE KVR IME +EG  IRERVKGLKISGEKA++KGGSSYNSLA +ASECDIFRRRRDGGC
Subjt:  EVGVAVRVRTE--RVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDGGC

TYK12645.1 UDP-glycosyltransferase 72E1 [Cucumis melo var. makuwa]9.8e-22181.46Show/hide
Query:  MPVEESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIA
        MP +ESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSS+AE  LLQKPS VNI+ LPH+SS+LD NA   +II +MMTASFP LRSSIA
Subjt:  MPVEESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIA

Query:  AAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELG
        AA PRPAALIVDLFGT A+ IAHELGMLGFVF+T++AWFLSLSFF+PS D+  +D HV NH+ L IPGCTPVRFE+T EVF+LNQ++VY GFGSFA ELG
Subjt:  AAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELG

Query:  TADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV
        TADGILSNTWQDLEPTTLKAL+EAGTL KGKVN+VPI PIGPLT + +P LESEVLKWLD+QPDESVIYVSFGSGGTL  EQITELAWGLE+SQQRFVWV
Subjt:  TADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV

Query:  IRPPAGTDSVGAFFTAGTGSA-DESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTE
        IRPPAGT+S+G FFTAG GS+ D+   E+LPEGFIKRTKEVGLV+PMWGPQAEILSHRSVRGF+THCGWNSSLE IVNGVAMVTWPLYAEQKMNA +LTE
Subjt:  IRPPAGTDSVGAFFTAGTGSA-DESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTE

Query:  EVGVAVRVRTE--RVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDGGC
        EVGVAVRVR E   +V R+EIE KVR IME +EG  IRERVKGLKISGEKA++KGGSSYNSLA +ASECDIFRRRRDGGC
Subjt:  EVGVAVRVRTE--RVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDGGC

XP_008441118.2 PREDICTED: UDP-glycosyltransferase 72E1 [Cucumis melo]2.8e-22081.46Show/hide
Query:  MPVEESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIA
        MP +ESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSS+AE  LLQKPS VNI+ LPH+SS+LD NA   +II +MMTASFP LRSSIA
Subjt:  MPVEESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIA

Query:  AAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELG
        AA PRPAALIVDLFGT A+SIAHELGMLGFVF+T++AWFLSL  F+PS D+  +DAHV NH+ L IPGCTPVRFE+T EVF+LNQ++VY GFGSFA ELG
Subjt:  AAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELG

Query:  TADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV
        TADGILSNTWQDLEPTTLKAL+EAGTL KGKVN+VPI PIGPLT + +P LESEVLKWLD+QPDESVIYVSFGSGGTL  EQITELAWGLE+SQQRFVWV
Subjt:  TADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV

Query:  IRPPAGTDSVGAFFTAGTGSA-DESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTE
        IRPPAGT+S+G FFTAG GS+ D+   E+LPEGFIKRTKEVGLV+PMWGPQAEILSHRSVRGF+THCGWNSSLE IVNGVAMVTWPLYAEQKMNA +LTE
Subjt:  IRPPAGTDSVGAFFTAGTGSA-DESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTE

Query:  EVGVAVRVRTE--RVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDGGC
        EVGVAVRVR E   +V R+EIE KVR IME +EG  IRERVKGLKISGEKA++KGGSSYNSLA +ASECDIFRRRRDGGC
Subjt:  EVGVAVRVRTE--RVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDGGC

XP_031742908.1 UDP-glycosyltransferase 72E2 [Cucumis sativus]1.2e-21382.25Show/hide
Query:  MGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIAAAKPRPAALIVDLFGTE
        MGHLIPFLELANRLVLHHNLQATLFVVGTGSS+AESTLLQKPS VNIVSLPHS S+LDPNA   DII +MMTASFP LRSSIAA  PRPAALIVDLFGT 
Subjt:  MGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIAAAKPRPAALIVDLFGTE

Query:  AISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELGTADGILSNTWQDLEPTT
        A+SIAHELGMLG VF+T+NAW+LS+S+ +PS ++  +DAHVYNH+ L IPGCTPVRFEDT EVF+LNQE+VY GFG +ARELGTADGILSNTWQDLEPTT
Subjt:  AISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELGTADGILSNTWQDLEPTT

Query:  LKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWVIRPPAGTDSVGAFFTAG
        LKAL+EAGTL  GKVN+VPI PIGPLTR+ EP LESEVLKWLD+QPDESVIYVSFGSGGTLC EQITELAWGLELSQQRFVWVIRPP GT+S GAFFTAG
Subjt:  LKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWVIRPPAGTDSVGAFFTAG

Query:  TGSA-DESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTEEVGVAVRVRTE--RVVG
         GS+ D    +YLPEGFIKRTKEVGLV+PMWGPQAEILSHRSVRGF+THCGWNSSLE IVNGVAMVTWPLYAEQKMNA LLTEE+GVAVR+R E   VV 
Subjt:  TGSA-DESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTEEVGVAVRVRTE--RVVG

Query:  REEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDGG
        R+EIE+KVR IME +EG  IRERVK LKISG KA++KGGSSYNSLA +ASECDIFRRRRDGG
Subjt:  REEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDGG

XP_038883108.1 UDP-glycosyltransferase 72E1-like [Benincasa hispida]2.3e-23886.37Show/hide
Query:  MPVEESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIA
        MPV++SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGT SS AES LLQKPSA NIV LPHSSSTLDPNASY+DII AMMTASFP LRSSI 
Subjt:  MPVEESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIA

Query:  AAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELG
           PRPAALIVDLFGT+A+SIAHELGMLGFVF+TS AWFLSLSFF+PSM +  IDAHVYNHE L+IPGCTPVRFEDT E+FQLN+E++YEGFG F RELG
Subjt:  AAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELG

Query:  TADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV
        TADGIL NTWQDLEP TLKALTE G L+K KVNQVPI PIGPLTR++EPNL+SEVLKWLD+QPDESVIYVSFGSGGTLCA+QITELAWGLELSQQRFVWV
Subjt:  TADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV

Query:  IRPPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTEE
        IRPPAGTD +G FFT GTGS D+S PEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGF+THCGWNSSLEGIVNGVAMVTWPLYAEQKMNA +LTEE
Subjt:  IRPPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTEE

Query:  VGVAVRVRTERVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDGGC
        +GVAVRVR E VVGREEIERKVR IMEDEEGREIRERVK LKI+GEKA+SKGGSSY+SLA +AS+CDIFRRRRDGGC
Subjt:  VGVAVRVRTERVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDGGC

TrEMBL top hitse value%identityAlignment
A0A0A0KGE4 Uncharacterized protein6.6e-22382.46Show/hide
Query:  MPVEESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIA
        MP +ESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSS+AESTLLQKPS VNIVSLPHS S+LDPNA   DII +MMTASFP LRSSIA
Subjt:  MPVEESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIA

Query:  AAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELG
        A  PRPAALIVDLFGT A+SIAHELGMLG VF+T+NAW+LS+S+ +PS ++  +DAHVYNH+ L IPGCTPVRFEDT EVF+LNQE+VY GFG +ARELG
Subjt:  AAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELG

Query:  TADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV
        TADGILSNTWQDLEPTTLKAL+EAGTL  GKVN+VPI PIGPLTR+ EP LESEVLKWLD+QPDESVIYVSFGSGGTLC EQITELAWGLELSQQRFVWV
Subjt:  TADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV

Query:  IRPPAGTDSVGAFFTAGTGSA-DESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTE
        IRPP GT+S GAFFTAG GS+ D    +YLPEGFIKRTKEVGLV+PMWGPQAEILSHRSVRGF+THCGWNSSLE IVNGVAMVTWPLYAEQKMNA LLTE
Subjt:  IRPPAGTDSVGAFFTAGTGSA-DESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTE

Query:  EVGVAVRVRTE--RVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDGG
        E+GVAVR+R E   VV R+EIE+KVR IME +EG  IRERVK LKISG KA++KGGSSYNSLA +ASECDIFRRRRDGG
Subjt:  EVGVAVRVRTE--RVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDGG

A0A1S3B288 UDP-glycosyltransferase 72E11.4e-22081.46Show/hide
Query:  MPVEESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIA
        MP +ESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSS+AE  LLQKPS VNI+ LPH+SS+LD NA   +II +MMTASFP LRSSIA
Subjt:  MPVEESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIA

Query:  AAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELG
        AA PRPAALIVDLFGT A+SIAHELGMLGFVF+T++AWFLSL  F+PS D+  +DAHV NH+ L IPGCTPVRFE+T EVF+LNQ++VY GFGSFA ELG
Subjt:  AAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELG

Query:  TADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV
        TADGILSNTWQDLEPTTLKAL+EAGTL KGKVN+VPI PIGPLT + +P LESEVLKWLD+QPDESVIYVSFGSGGTL  EQITELAWGLE+SQQRFVWV
Subjt:  TADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV

Query:  IRPPAGTDSVGAFFTAGTGSA-DESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTE
        IRPPAGT+S+G FFTAG GS+ D+   E+LPEGFIKRTKEVGLV+PMWGPQAEILSHRSVRGF+THCGWNSSLE IVNGVAMVTWPLYAEQKMNA +LTE
Subjt:  IRPPAGTDSVGAFFTAGTGSA-DESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTE

Query:  EVGVAVRVRTE--RVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDGGC
        EVGVAVRVR E   +V R+EIE KVR IME +EG  IRERVKGLKISGEKA++KGGSSYNSLA +ASECDIFRRRRDGGC
Subjt:  EVGVAVRVRTE--RVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDGGC

A0A5A7T3W2 UDP-glycosyltransferase 72E14.3e-22281.88Show/hide
Query:  MPVEESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIA
        MP +ESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSS+AE  LLQKPS VNI+ LPH+SS+LD NA   +II +MMTASFP LRSSIA
Subjt:  MPVEESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIA

Query:  AAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELG
        AA PRPAALIVDLFGT A+SIAHELGMLGFVF+T++AWFLSLSFF+PS D+  +DAHV NH+ L IPGCTPVRFE+T EVF+LNQ++VY GFGSFA ELG
Subjt:  AAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELG

Query:  TADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV
        TADGILSNTWQDLEPTTLKAL+EAGTL KGKVN+VPI PIGPLT + +P LESEVLKWLD+QPDESVIYVSFGSGGTL  EQITELAWGLE+SQQRFVWV
Subjt:  TADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV

Query:  IRPPAGTDSVGAFFTAGTGSA-DESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTE
        IRPPAGT+S+G FFTAG GS+ D+   E+LPEGFIKRTKEVGLV+PMWGPQAEILSHRSVRGF+THCGWNSSLE IVNGVAMVTWPLYAEQKMNA +LTE
Subjt:  IRPPAGTDSVGAFFTAGTGSA-DESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTE

Query:  EVGVAVRVRTE--RVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDGGC
        EVGVAVRVR E   +V R+EIE KVR IME +EG  IRERVKGLKISGEKA++KGGSSYNSLA +ASECDIFRRRRDGGC
Subjt:  EVGVAVRVRTE--RVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDGGC

A0A5D3CM79 UDP-glycosyltransferase 72E14.7e-22181.46Show/hide
Query:  MPVEESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIA
        MP +ESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSS+AE  LLQKPS VNI+ LPH+SS+LD NA   +II +MMTASFP LRSSIA
Subjt:  MPVEESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIA

Query:  AAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELG
        AA PRPAALIVDLFGT A+ IAHELGMLGFVF+T++AWFLSLSFF+PS D+  +D HV NH+ L IPGCTPVRFE+T EVF+LNQ++VY GFGSFA ELG
Subjt:  AAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELG

Query:  TADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV
        TADGILSNTWQDLEPTTLKAL+EAGTL KGKVN+VPI PIGPLT + +P LESEVLKWLD+QPDESVIYVSFGSGGTL  EQITELAWGLE+SQQRFVWV
Subjt:  TADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV

Query:  IRPPAGTDSVGAFFTAGTGSA-DESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTE
        IRPPAGT+S+G FFTAG GS+ D+   E+LPEGFIKRTKEVGLV+PMWGPQAEILSHRSVRGF+THCGWNSSLE IVNGVAMVTWPLYAEQKMNA +LTE
Subjt:  IRPPAGTDSVGAFFTAGTGSA-DESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTE

Query:  EVGVAVRVRTE--RVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDGGC
        EVGVAVRVR E   +V R+EIE KVR IME +EG  IRERVKGLKISGEKA++KGGSSYNSLA +ASECDIFRRRRDGGC
Subjt:  EVGVAVRVRTE--RVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDGGC

A0A6J1GDG0 UDP-glycosyltransferase 72E2-like4.8e-20576.47Show/hide
Query:  MPVEESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGT-GSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSI
        M V E KTHVALLVSPGMGHLIPFLELA+RLVLHHNLQ TLFVV T  SSAA++ LLQK SAVNIV + HSS +LDP    +D IKAMMTAS+PH+RS I
Subjt:  MPVEESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGT-GSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSI

Query:  AAAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFAREL
        AA +PRP ALIVDLFGTEAI++AHELGMLGFVF+ ++AWFLSL FFFP MD+  +DAH YNHE L+IPGC+ VRFEDT +  +++ E+V+E     AREL
Subjt:  AAAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFAREL

Query:  GTADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVW
        G ADGIL+NTWQDLEPTTLKAL EAGTLS GKVN+VPI PIGPL R  EPNLESEVL WLD+QPDESVI++SFGSGGTLCAEQITELAWGLELSQQRFVW
Subjt:  GTADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVW

Query:  VIRPPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTE
        VIRPPAGTD +GAFFT  T S +++P EYLPEGF+KRT+EVGLVVPMWGPQAEIL HRSVRGF+THCGWNSS+E +VNGVAMVTWPLYAEQKMNA +LTE
Subjt:  VIRPPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTE

Query:  EVGVAVRVRTERVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDG
        EVGVAVR R E VVGR EIER VR+IM DEEGR IRERVK +K SGEKA+ KGGSSYNSLAH+ASEC   RRR +G
Subjt:  EVGVAVRVRTERVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASECDIFRRRRDG

SwissProt top hitse value%identityAlignment
O81498 UDP-glycosyltransferase 72E31.3e-11946.38Show/hide
Query:  SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHS--SSTLDPNASYIDIIKAMMTASFPHLRSSIAAAK
        +K H A+  SPGMGH++P +ELA RL  +H    T+FV+ T +++ +S LL   + V+IV+LP    S  +DPNA  +  I  +M  + P LRS I A  
Subjt:  SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHS--SSTLDPNASYIDIIKAMMTASFPHLRSSIAAAK

Query:  PRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELGTAD
          P ALI+DLFGT+A+ +A EL ML +VFI SNA +L +S ++P++D    + H    + L IPGC PVRFED  + + +  E VY            AD
Subjt:  PRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELGTAD

Query:  GILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPN-LESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWVIR
        GIL NTW+++EP +LK+L +   L  G+V +VP+ P+GPL R  + +  +  V  WL++QP+ESV+Y+SFGSGG+L A+Q+TELAWGLE SQQRF+WV+R
Subjt:  GILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPN-LESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWVIR

Query:  PPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTEEVG
        PP    S   +F+A  G   ++ PEYLPEGF+ RT + G ++P W PQAEIL+H++V GF+THCGW+S+LE ++ GV M+ WPL+AEQ MNA LL++E+G
Subjt:  PPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTEEVG

Query:  VAVRV-RTERVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAIS--KGGSSYNSLAHMASECDIF
        ++VRV   +  + R +IE  VRK+M ++EG E+R +VK L+ + E ++S   GGS++ SL  +  EC  F
Subjt:  VAVRV-RTERVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAIS--KGGSSYNSLAHMASECDIF

Q40287 Anthocyanidin 3-O-glucosyltransferase 52.1e-12551.6Show/hide
Query:  SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQK---PSAVNIVSL--PHSSSTLDPNASYIDIIKAMMTASFPHLRSSIA
        SK H+ LL SPG+GHLIP LEL  R+V   N   T+F+VG+ +SAAE  +L+    P    I+ L  P+ S  +DP A+    +  +M    P  R++++
Subjt:  SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQK---PSAVNIVSL--PHSSSTLDPNASYIDIIKAMMTASFPHLRSSIA

Query:  AAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELG
        A K RPAA+IVDLFGTE++ +A ELG+  +V+I SNAWFL+L+ + P +D+      V   E +KIPGC PVR E+  +         Y  +     E+ 
Subjt:  AAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELG

Query:  TADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEP-NLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVW
        TADGIL NTW+ LEPTT  AL +   L  G+V +VP+ PIGPL R A P     E+L WLDQQP ESV+YVSFGSGGTL  EQ+ ELAWGLE SQQRF+W
Subjt:  TADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEP-NLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVW

Query:  VIRPPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTE
        V+R P       AFFT G G+ D S   Y PEGF+ R + VGLVVP W PQ  I+SH SV  F++HCGWNS LE I  GV ++ WP+YAEQ+MNATLLTE
Subjt:  VIRPPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTE

Query:  EVGVAVRVR---TERVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASE
        E+GVAVR +    + VV REEIER +R+IM DEEG EIR+RV+ LK SGEKA+++GGSS+N ++ + +E
Subjt:  EVGVAVRVR---TERVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASE

Q94A84 UDP-glycosyltransferase 72E11.2e-12348.41Show/hide
Query:  SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKP----SAVNIVSL--PHSSSTLDPNASYIDIIKAMMTASFPHLRSSI
        +K HVA+  SPGMGH+IP +EL  RL   H    T+FV+ T +++A+S  L  P    + V+IV L  P  S  +DP+A +   +  MM  + P +RS I
Subjt:  SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKP----SAVNIVSL--PHSSSTLDPNASYIDIIKAMMTASFPHLRSSI

Query:  AAAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFAREL
           + +P ALIVDLFG +AI +  E  ML ++FI SNA FL+++ FFP++D+   + H+   + + +PGC PVRFEDT E F      +Y  F  F    
Subjt:  AAAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFAREL

Query:  GTADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESE-VLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFV
         T DGI+ NTW D+EP TLK+L +   L  G++  VP+ PIGPL+R  +P+  +  VL WL++QPDESV+Y+SFGSGG+L A+Q+TELAWGLE+SQQRFV
Subjt:  GTADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESE-VLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFV

Query:  WVIRPPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLT
        WV+RPP    +  A+ +A +G   +  P+YLPEGF+ RT E G +V  W PQAEIL+H++V GF+THCGWNS LE +V GV M+ WPL+AEQ MNATLL 
Subjt:  WVIRPPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLT

Query:  EEVGVAVRVR---TERVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAIS-KGGSSYNSLAHMASECD
        EE+GVAVR +   +E V+ R EIE  VRKIM +EEG E+R+++K LK +  +++S  GG ++ SL+ +A E +
Subjt:  EEVGVAVRVR---TERVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAIS-KGGSSYNSLAHMASECD

Q9LVR1 UDP-glycosyltransferase 72E23.5e-12047.36Show/hide
Query:  SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSS--STLDPNASYIDIIKAMMTASFPHLRSSIAAAK
        +K H A+  SPGMGH+IP +EL  RL  ++    T+FV+ T +++A+S  L   + V+IV LP       +DP+   +  I  +M A+ P LRS IAA  
Subjt:  SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSS--STLDPNASYIDIIKAMMTASFPHLRSSIAAAK

Query:  PRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELGTAD
         +P ALIVDLFGT+A+ +A E  ML +VFI +NA FL +S ++P++D+   + H      L IPGC PVRFEDT + + +  E VY  F         AD
Subjt:  PRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELGTAD

Query:  GILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPN-LESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWVIR
        GIL NTW+++EP +LK+L     L  G+V +VP+ PIGPL R  + +  +  VL WL++QP+ESV+Y+SFGSGG L A+Q+TELAWGLE SQQRFVWV+R
Subjt:  GILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPN-LESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWVIR

Query:  PPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTEEVG
        PP        + +A  G  +++ PEYLPEGF+ RT + G VVP W PQAEILSHR+V GF+THCGW+S+LE +V GV M+ WPL+AEQ MNA LL++E+G
Subjt:  PPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTEEVG

Query:  VAVRVRTERV-VGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAIS--KGGSSYNSLAHMASECDIFRRR
        +AVR+   +  + R +IE  VRK+M ++EG  +R +VK L+ S E ++S   GG ++ SL  +  EC  F  R
Subjt:  VAVRVRTERV-VGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAIS--KGGSSYNSLAHMASECDIFRRR

Q9ZU72 UDP-glycosyltransferase 72D14.0e-10043.95Show/hide
Query:  HVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSA-AESTLLQKPSAVNIVSLPHSSST-----LDPNASYIDIIKAMMTASFPHLRSSIAAA
        H  L+ SPG+GHLIP LEL NRL    N+  T+  V +GSS+  E+  +   +A  I  +    S      ++P+A+    +   M A  P +R ++   
Subjt:  HVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSA-AESTLLQKPSAVNIVSLPHSSST-----LDPNASYIDIIKAMMTASFPHLRSSIAAA

Query:  KPRPAALIVDLFGTEAISIAHELGMLG-FVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELGT
        K +P  +IVD  GTE +S+A ++GM   +V++ ++AWFL++  + P +D      +V   E LKIPGC PV  ++  E         Y+       E+  
Subjt:  KPRPAALIVDLFGTEAISIAHELGMLG-FVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELGT

Query:  ADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTR-HAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV
        +DG+L NTW++L+  TL AL E   LS  +V +VP+ PIGP+ R +   +  + + +WLD+Q + SV++V  GSGGTL  EQ  ELA GLELS QRFVWV
Subjt:  ADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTR-HAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV

Query:  IRPPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTEE
        +R PA        +     S DE     LPEGF+ RT+ VG+VV  W PQ EILSHRS+ GF++HCGW+S+LE +  GV ++ WPLYAEQ MNATLLTEE
Subjt:  IRPPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTEE

Query:  VGVAVR---VRTERVVGREEIERKVRKIM--EDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASEC
        +GVAVR   + +ERV+GREE+   VRKIM  EDEEG++IR + + +++S E+A SK GSSYNSL   A  C
Subjt:  VGVAVR---VRTERVVGREEIERKVRKIM--EDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASEC

Arabidopsis top hitse value%identityAlignment
AT2G18570.1 UDP-Glycosyltransferase superfamily protein2.8e-10143.95Show/hide
Query:  HVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSA-AESTLLQKPSAVNIVSLPHSSST-----LDPNASYIDIIKAMMTASFPHLRSSIAAA
        H  L+ SPG+GHLIP LEL NRL    N+  T+  V +GSS+  E+  +   +A  I  +    S      ++P+A+    +   M A  P +R ++   
Subjt:  HVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSA-AESTLLQKPSAVNIVSLPHSSST-----LDPNASYIDIIKAMMTASFPHLRSSIAAA

Query:  KPRPAALIVDLFGTEAISIAHELGMLG-FVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELGT
        K +P  +IVD  GTE +S+A ++GM   +V++ ++AWFL++  + P +D      +V   E LKIPGC PV  ++  E         Y+       E+  
Subjt:  KPRPAALIVDLFGTEAISIAHELGMLG-FVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELGT

Query:  ADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTR-HAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV
        +DG+L NTW++L+  TL AL E   LS  +V +VP+ PIGP+ R +   +  + + +WLD+Q + SV++V  GSGGTL  EQ  ELA GLELS QRFVWV
Subjt:  ADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTR-HAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWV

Query:  IRPPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTEE
        +R PA        +     S DE     LPEGF+ RT+ VG+VV  W PQ EILSHRS+ GF++HCGW+S+LE +  GV ++ WPLYAEQ MNATLLTEE
Subjt:  IRPPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTEE

Query:  VGVAVR---VRTERVVGREEIERKVRKIM--EDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASEC
        +GVAVR   + +ERV+GREE+   VRKIM  EDEEG++IR + + +++S E+A SK GSSYNSL   A  C
Subjt:  VGVAVR---VRTERVVGREEIERKVRKIM--EDEEGREIRERVKGLKISGEKAISKGGSSYNSLAHMASEC

AT3G50740.1 UDP-glucosyl transferase 72E18.2e-12548.41Show/hide
Query:  SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKP----SAVNIVSL--PHSSSTLDPNASYIDIIKAMMTASFPHLRSSI
        +K HVA+  SPGMGH+IP +EL  RL   H    T+FV+ T +++A+S  L  P    + V+IV L  P  S  +DP+A +   +  MM  + P +RS I
Subjt:  SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKP----SAVNIVSL--PHSSSTLDPNASYIDIIKAMMTASFPHLRSSI

Query:  AAAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFAREL
           + +P ALIVDLFG +AI +  E  ML ++FI SNA FL+++ FFP++D+   + H+   + + +PGC PVRFEDT E F      +Y  F  F    
Subjt:  AAAKPRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFAREL

Query:  GTADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESE-VLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFV
         T DGI+ NTW D+EP TLK+L +   L  G++  VP+ PIGPL+R  +P+  +  VL WL++QPDESV+Y+SFGSGG+L A+Q+TELAWGLE+SQQRFV
Subjt:  GTADGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESE-VLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFV

Query:  WVIRPPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLT
        WV+RPP    +  A+ +A +G   +  P+YLPEGF+ RT E G +V  W PQAEIL+H++V GF+THCGWNS LE +V GV M+ WPL+AEQ MNATLL 
Subjt:  WVIRPPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLT

Query:  EEVGVAVRVR---TERVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAIS-KGGSSYNSLAHMASECD
        EE+GVAVR +   +E V+ R EIE  VRKIM +EEG E+R+++K LK +  +++S  GG ++ SL+ +A E +
Subjt:  EEVGVAVRVR---TERVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAIS-KGGSSYNSLAHMASECD

AT4G36770.1 UDP-Glycosyltransferase superfamily protein1.6e-9943.43Show/hide
Query:  HVALLVSPGMGHLIPFLELANRLVLHHNL-QATLFVVGTGSSAAES----TLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIAAAK
        H AL+ SPGMGH +P LEL   L+ HH   + T+F+V    S ++S    TL+++     I  +P   S  D + S +  +  MM  + P ++SS+   +
Subjt:  HVALLVSPGMGHLIPFLELANRLVLHHNL-QATLFVVGTGSSAAES----TLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIAAAK

Query:  PRPAALIVDLFGTEAISIAHELG-MLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELGTA
        PRP   +VDL GTEA+ +A ELG M   V +T++AWFL+ + +  S+D+  +   + +  AL IPGC+PV+FE   +  +  +E           E+ TA
Subjt:  PRPAALIVDLFGTEAISIAHELG-MLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELGTA

Query:  DGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWVIR
        DG+  NTW  LE  T+ +  +   L +  +  VP+ P+GPL R AEP L+  VL WLD QP ESV+YVSFGSGG L  EQ  ELA+GLEL+  RFVWV+R
Subjt:  DGILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWVIR

Query:  PPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTEEVG
        PPA  D   + F       +  P ++LP GF+ RTK++GLVV  W PQ EIL+H+S  GF+THCGWNS LE IVNGV MV WPLY+EQKMNA +++ E+ 
Subjt:  PPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTEEVG

Query:  VAVRVR-TERVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAIS
        +A+++   + +V +E I   V+++M++EEG+E+R+ VK LK + E+A++
Subjt:  VAVRVR-TERVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAIS

AT5G26310.1 UDP-Glycosyltransferase superfamily protein9.4e-12146.38Show/hide
Query:  SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHS--SSTLDPNASYIDIIKAMMTASFPHLRSSIAAAK
        +K H A+  SPGMGH++P +ELA RL  +H    T+FV+ T +++ +S LL   + V+IV+LP    S  +DPNA  +  I  +M  + P LRS I A  
Subjt:  SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHS--SSTLDPNASYIDIIKAMMTASFPHLRSSIAAAK

Query:  PRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELGTAD
          P ALI+DLFGT+A+ +A EL ML +VFI SNA +L +S ++P++D    + H    + L IPGC PVRFED  + + +  E VY            AD
Subjt:  PRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELGTAD

Query:  GILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPN-LESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWVIR
        GIL NTW+++EP +LK+L +   L  G+V +VP+ P+GPL R  + +  +  V  WL++QP+ESV+Y+SFGSGG+L A+Q+TELAWGLE SQQRF+WV+R
Subjt:  GILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPN-LESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWVIR

Query:  PPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTEEVG
        PP    S   +F+A  G   ++ PEYLPEGF+ RT + G ++P W PQAEIL+H++V GF+THCGW+S+LE ++ GV M+ WPL+AEQ MNA LL++E+G
Subjt:  PPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTEEVG

Query:  VAVRV-RTERVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAIS--KGGSSYNSLAHMASECDIF
        ++VRV   +  + R +IE  VRK+M ++EG E+R +VK L+ + E ++S   GGS++ SL  +  EC  F
Subjt:  VAVRV-RTERVVGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAIS--KGGSSYNSLAHMASECDIF

AT5G66690.1 UDP-Glycosyltransferase superfamily protein2.5e-12147.36Show/hide
Query:  SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSS--STLDPNASYIDIIKAMMTASFPHLRSSIAAAK
        +K H A+  SPGMGH+IP +EL  RL  ++    T+FV+ T +++A+S  L   + V+IV LP       +DP+   +  I  +M A+ P LRS IAA  
Subjt:  SKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSS--STLDPNASYIDIIKAMMTASFPHLRSSIAAAK

Query:  PRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELGTAD
         +P ALIVDLFGT+A+ +A E  ML +VFI +NA FL +S ++P++D+   + H      L IPGC PVRFEDT + + +  E VY  F         AD
Subjt:  PRPAALIVDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELGTAD

Query:  GILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPN-LESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWVIR
        GIL NTW+++EP +LK+L     L  G+V +VP+ PIGPL R  + +  +  VL WL++QP+ESV+Y+SFGSGG L A+Q+TELAWGLE SQQRFVWV+R
Subjt:  GILSNTWQDLEPTTLKALTEAGTLSKGKVNQVPINPIGPLTRHAEPN-LESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWVIR

Query:  PPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTEEVG
        PP        + +A  G  +++ PEYLPEGF+ RT + G VVP W PQAEILSHR+V GF+THCGW+S+LE +V GV M+ WPL+AEQ MNA LL++E+G
Subjt:  PPAGTDSVGAFFTAGTGSADESPPEYLPEGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTEEVG

Query:  VAVRVRTERV-VGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAIS--KGGSSYNSLAHMASECDIFRRR
        +AVR+   +  + R +IE  VRK+M ++EG  +R +VK L+ S E ++S   GG ++ SL  +  EC  F  R
Subjt:  VAVRVRTERV-VGREEIERKVRKIMEDEEGREIRERVKGLKISGEKAIS--KGGSSYNSLAHMASECDIFRRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGTGGAAGAATCCAAAACCCACGTGGCTTTGCTGGTCAGCCCCGGAATGGGACATCTCATTCCCTTCCTCGAGCTGGCCAACCGCCTCGTCCTCCATCACAACCT
CCAAGCTACCCTCTTTGTCGTCGGCACCGGCTCCTCCGCCGCCGAATCCACCCTTCTCCAAAAACCTTCCGCCGTAAACATCGTCTCACTTCCTCACTCCTCATCCACCC
TCGACCCAAATGCCTCCTACATCGACATAATCAAGGCCATGATGACCGCTTCTTTCCCCCACCTCCGCTCCTCCATCGCCGCTGCCAAGCCCCGTCCGGCGGCGTTGATC
GTTGACCTTTTCGGAACCGAAGCTATATCCATAGCTCACGAACTCGGCATGTTGGGATTCGTTTTCATAACCTCCAACGCCTGGTTCCTCTCTCTCTCATTCTTTTTCCC
TTCCATGGACAGACCAACCATTGACGCCCACGTGTACAACCACGAAGCCCTCAAAATCCCGGGTTGCACTCCGGTCCGATTCGAAGATACCACCGAGGTGTTCCAATTGA
ACCAGGAGGATGTTTACGAGGGATTCGGAAGTTTCGCACGGGAACTTGGAACGGCCGATGGGATCTTATCGAATACGTGGCAGGATCTGGAGCCCACAACACTAAAAGCA
CTGACTGAAGCTGGGACCCTCAGTAAAGGAAAAGTCAACCAAGTCCCGATCAATCCAATTGGGCCGTTGACTAGACATGCTGAGCCCAATTTGGAGAGTGAGGTGCTGAA
ATGGCTCGACCAGCAACCGGATGAGTCGGTGATCTACGTGTCGTTTGGGAGTGGGGGGACGTTGTGTGCAGAGCAAATCACGGAATTGGCGTGGGGGTTGGAGCTGAGTC
AGCAAAGGTTTGTTTGGGTGATACGGCCGCCGGCAGGGACCGATTCTGTGGGAGCATTTTTCACGGCGGGAACCGGATCTGCAGACGAGTCGCCGCCTGAATACCTGCCG
GAAGGGTTCATAAAAAGGACGAAAGAGGTGGGTTTAGTAGTTCCGATGTGGGGACCACAGGCGGAGATTTTGAGCCATAGATCGGTGAGGGGATTTATTACGCACTGTGG
ATGGAACTCGTCGTTAGAGGGCATAGTGAACGGAGTGGCGATGGTGACGTGGCCATTGTACGCTGAGCAGAAGATGAACGCGACGTTGCTAACGGAGGAGGTGGGGGTGG
CGGTGAGGGTCAGGACGGAGAGGGTGGTGGGGAGGGAAGAGATAGAGAGAAAGGTGAGGAAGATAATGGAGGATGAAGAAGGTCGGGAAATCAGAGAGAGAGTTAAAGGG
CTTAAAATTAGTGGTGAAAAGGCCATTTCTAAGGGTGGGTCCTCCTACAACTCTTTGGCTCATATGGCTTCAGAATGCGATATTTTCCGGCGCCGGAGAGACGGAGGGTG
TTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTGTGGAAGAATCCAAAACCCACGTGGCTTTGCTGGTCAGCCCCGGAATGGGACATCTCATTCCCTTCCTCGAGCTGGCCAACCGCCTCGTCCTCCATCACAACCT
CCAAGCTACCCTCTTTGTCGTCGGCACCGGCTCCTCCGCCGCCGAATCCACCCTTCTCCAAAAACCTTCCGCCGTAAACATCGTCTCACTTCCTCACTCCTCATCCACCC
TCGACCCAAATGCCTCCTACATCGACATAATCAAGGCCATGATGACCGCTTCTTTCCCCCACCTCCGCTCCTCCATCGCCGCTGCCAAGCCCCGTCCGGCGGCGTTGATC
GTTGACCTTTTCGGAACCGAAGCTATATCCATAGCTCACGAACTCGGCATGTTGGGATTCGTTTTCATAACCTCCAACGCCTGGTTCCTCTCTCTCTCATTCTTTTTCCC
TTCCATGGACAGACCAACCATTGACGCCCACGTGTACAACCACGAAGCCCTCAAAATCCCGGGTTGCACTCCGGTCCGATTCGAAGATACCACCGAGGTGTTCCAATTGA
ACCAGGAGGATGTTTACGAGGGATTCGGAAGTTTCGCACGGGAACTTGGAACGGCCGATGGGATCTTATCGAATACGTGGCAGGATCTGGAGCCCACAACACTAAAAGCA
CTGACTGAAGCTGGGACCCTCAGTAAAGGAAAAGTCAACCAAGTCCCGATCAATCCAATTGGGCCGTTGACTAGACATGCTGAGCCCAATTTGGAGAGTGAGGTGCTGAA
ATGGCTCGACCAGCAACCGGATGAGTCGGTGATCTACGTGTCGTTTGGGAGTGGGGGGACGTTGTGTGCAGAGCAAATCACGGAATTGGCGTGGGGGTTGGAGCTGAGTC
AGCAAAGGTTTGTTTGGGTGATACGGCCGCCGGCAGGGACCGATTCTGTGGGAGCATTTTTCACGGCGGGAACCGGATCTGCAGACGAGTCGCCGCCTGAATACCTGCCG
GAAGGGTTCATAAAAAGGACGAAAGAGGTGGGTTTAGTAGTTCCGATGTGGGGACCACAGGCGGAGATTTTGAGCCATAGATCGGTGAGGGGATTTATTACGCACTGTGG
ATGGAACTCGTCGTTAGAGGGCATAGTGAACGGAGTGGCGATGGTGACGTGGCCATTGTACGCTGAGCAGAAGATGAACGCGACGTTGCTAACGGAGGAGGTGGGGGTGG
CGGTGAGGGTCAGGACGGAGAGGGTGGTGGGGAGGGAAGAGATAGAGAGAAAGGTGAGGAAGATAATGGAGGATGAAGAAGGTCGGGAAATCAGAGAGAGAGTTAAAGGG
CTTAAAATTAGTGGTGAAAAGGCCATTTCTAAGGGTGGGTCCTCCTACAACTCTTTGGCTCATATGGCTTCAGAATGCGATATTTTCCGGCGCCGGAGAGACGGAGGGTG
TTAG
Protein sequenceShow/hide protein sequence
MPVEESKTHVALLVSPGMGHLIPFLELANRLVLHHNLQATLFVVGTGSSAAESTLLQKPSAVNIVSLPHSSSTLDPNASYIDIIKAMMTASFPHLRSSIAAAKPRPAALI
VDLFGTEAISIAHELGMLGFVFITSNAWFLSLSFFFPSMDRPTIDAHVYNHEALKIPGCTPVRFEDTTEVFQLNQEDVYEGFGSFARELGTADGILSNTWQDLEPTTLKA
LTEAGTLSKGKVNQVPINPIGPLTRHAEPNLESEVLKWLDQQPDESVIYVSFGSGGTLCAEQITELAWGLELSQQRFVWVIRPPAGTDSVGAFFTAGTGSADESPPEYLP
EGFIKRTKEVGLVVPMWGPQAEILSHRSVRGFITHCGWNSSLEGIVNGVAMVTWPLYAEQKMNATLLTEEVGVAVRVRTERVVGREEIERKVRKIMEDEEGREIRERVKG
LKISGEKAISKGGSSYNSLAHMASECDIFRRRRDGGC