; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G016180 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G016180
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionGlycosyltransferase
Genome locationchr09:24383717..24386566
RNA-Seq ExpressionLsi09G016180
SyntenyLsi09G016180
Gene Ontology termsGO:0080043 - quercetin 3-O-glucosyltransferase activity (molecular function)
GO:0080044 - quercetin 7-O-glucosyltransferase activity (molecular function)
InterPro domainsIPR002213 - UDP-glucuronosyl/UDP-glucosyltransferase
IPR035595 - UDP-glycosyltransferase family, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140483.1 UDP-glycosyltransferase 75C1 [Cucumis sativus]4.0e-22783.73Show/hide
Query:  MNNTA--PYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNI
        MNNT   P PR VLLVT+CAQGHINPTLQ AKRLTRHGD+HVTFL SLSAYRRMG TP++PH++FASFSDGYDDGFKP +DI  ++SELE RGS+ALKNI
Subjt:  MNNTA--PYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNI

Query:  IEESRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFAL
        I+ESRN+GQPF+CIVYSIL+PWVATVARSL+V SV LWIQPAVVFALYYYYNNGYYDEIQR+ S DDP+S SIKLPGLPLLSARDLPSFFG+SD Y+FAL
Subjt:  IEESRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFAL

Query:  PMFRRQFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQK
        PMFR+QFELLEEESNPK+LINTFEELEKDAV+AIKKF LMPIGPLIPS  +DG+ PSEAS GCDLF+ TSSY+EWLNSKPKA+V+YVS GSIST+SKQQK
Subjt:  PMFRRQFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQK

Query:  EEIARGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETG
        EEIARGL  TKRPFLWVIR+IEEEED LSFKEKLETQGKIV WC+QLEVLSSPATGCFLTHCGWNSCLESLACGVP VAFPQW+DQATN+KI++D SETG
Subjt:  EEIARGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETG

Query:  VRLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQVSS
        VRLE  E+GVVKGEEIERCL+LVMGDS+KGEEIRRNA+KWKKLAKEAASEGGSS AN KAFVD V S
Subjt:  VRLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQVSS

XP_008458144.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Cucumis melo]7.8e-23186.27Show/hide
Query:  NNTAPYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIEE
        N T P PRRVLL+TY AQGHINPTLQ AKRL RHGD+HVTFLTSLSAYRRMGQTP++PHLSFASFSDGYDDGFKPG+DIDH++SELE  GS+ALKNII+E
Subjt:  NNTAPYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIEE

Query:  SRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDP-NSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPM
        SRNQGQPF+CIVYSILLPWVATVARSL+V SVLLWIQPAVVFALYYYY NGYYDEIQR+IS DDP +SMSIKLPGLPLLSARDLPSFFG SDVY FAL +
Subjt:  SRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDP-NSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPM

Query:  FRRQFELL-EEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKE
        FR+QFELL EEESNP +LINTFEELEKDAV+AIKKF LMPIGPLIPS F DG  PSEAS GCDL++ TSSYI+WLNSKPKA+V+YVSSGSI+ LS QQKE
Subjt:  FRRQFELL-EEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKE

Query:  EIARGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGV
        E+ARGLLSTKRPFLWVIRD E EED LSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQW+DQATN+KI+QD SETGV
Subjt:  EIARGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGV

Query:  RLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQVSS
        RLEA EDGVVKGEEIERCL LVMGDS+KGE+IRRNA+KWKKLAKEAASEGGSS ANFKAFVDQV S
Subjt:  RLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQVSS

XP_022964048.1 crocetin glucosyltransferase, chloroplastic-like [Cucurbita moschata]4.2e-19274.08Show/hide
Query:  MNNTAPYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIE
        M+NTAP+  RVLL+TY AQGHINP L+FAKRLTR   + VTF+TSLSAYRRMG+TP++PH+SFASFSDGYDDGFK G+DI+HFMSELE RGS+A+K++I 
Subjt:  MNNTAPYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIE

Query:  ESRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPM
            QGQPF+CIVYSILLPWVATVARSL++P++LLWIQPA+VFALYYYYN GY+D IQ   +S DP + +I+LPGLPLL+ARDLPSFFGSSD Y FALP+
Subjt:  ESRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPM

Query:  FRRQFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEE
        FRRQFELLE+E+NP V+INTF+ELE DA+RAI KF L+P+GPLI         PSEAS  CDLFQ T+SYI+WLNSKPK +VIY+SSGS+STLSK QKEE
Subjt:  FRRQFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEE

Query:  IARGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGVR
        IARGLLS  RPFLWVIRDI EE + LS +E+LE  GKIVPWCSQ+EVLS PATGCFLTHCGWNS LESL CGVP V FPQW+DQ TNAKI+QD SETGVR
Subjt:  IARGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGVR

Query:  LEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQV
        LE   DGVVK EEI+RCL+LVMGDS+KGEEIRRN VKWK+LAK A + GGSS++NFKAFVDQV
Subjt:  LEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQV

XP_023513863.1 crocetin glucosyltransferase, chloroplastic-like [Cucurbita pepo subsp. pepo]1.2e-19174.3Show/hide
Query:  MNNTAPYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIE
        M+NTAP+  RVLL+TY AQGHINP L+FAKRLTR   + VTF+TSLSAYRRMG+TP++PH+SFASFSDGYDDGFK G+DI+HFMSELE RGS+A+K++I 
Subjt:  MNNTAPYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIE

Query:  ESRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPM
            QGQPF+CIVYSILLPWVA VARSL++P++LLWIQPA+VFALYYYYN GY+D IQ V   DDP + +I+LPGLPLL+ARDLPSFFGSSD Y FALP+
Subjt:  ESRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPM

Query:  FRRQFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEE
        FRRQFELLE+E+NP V+INTF+ELE DA+RAI KF L+PIGPLI         PSEAS  CDLFQ T+SYI+WLNSKPK +VIYVSSGSISTLSK QKEE
Subjt:  FRRQFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEE

Query:  IARGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGVR
        IARGLLS  RPFLWVIRDI EE + LS +E+LE  GKIV WCSQ+EVLS PATGCFLTHCGWNS LESL CGVP V FPQW+DQ TNAKI+QD SETGVR
Subjt:  IARGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGVR

Query:  LEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQV
        LE   DGVVK EEI+RCL+LVMGDS+KGEEIR+N VKWK+LAK A + GGSS++NFKAFVDQV
Subjt:  LEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQV

XP_038876006.1 phloretin 4'-O-glucosyltransferase-like [Benincasa hispida]1.3e-24189.48Show/hide
Query:  MNNTAPYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIE
        M  T P+P RVLLVTYCAQGHINPTLQFA+RLTRHGDVHVTFLTSLSAYRR+GQTP++PHLSF SFSDGYDDGFKPG+D++ FMSELE  GSEALKNIIE
Subjt:  MNNTAPYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIE

Query:  ESRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPM
        ESRN+GQPF+CIVYSILLPWVATVARSL++PSVLLWIQPAVVFALYYYYNNGYYDEIQR+ISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVY+FALP+
Subjt:  ESRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPM

Query:  FRRQFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEE
        FRRQFELLEEESNPKVLINTFEELEKDAVRAIKKF LMPIGPLIPSAFLDGH PSE S GCDLF+ TSSYI+WLNSKPKA+VIYVSSGSISTLSKQQKEE
Subjt:  FRRQFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEE

Query:  IARGLLSTKRPFLWVIRDIEEE-EDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGV
        IARGLLSTKRPFLWVIRDIEEE ED LSFKEKLETQGKIV WCSQLEVLSSPATGCFLTHCGWNSCLESLACG+PTV  PQWTDQATNAKIVQD SETGV
Subjt:  IARGLLSTKRPFLWVIRDIEEE-EDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGV

Query:  RLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQVSS
        RL+  EDGVVKGEEIERCL+LVMG+SEKGEEIRRNA+KWKKLA+EA SEGGSS AN KAFVDQV S
Subjt:  RLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQVSS

TrEMBL top hitse value%identityAlignment
A0A0A0KA46 UDP-glucose:flavonoid 7-O-glucosyltransferase1.9e-22783.73Show/hide
Query:  MNNTA--PYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNI
        MNNT   P PR VLLVT+CAQGHINPTLQ AKRLTRHGD+HVTFL SLSAYRRMG TP++PH++FASFSDGYDDGFKP +DI  ++SELE RGS+ALKNI
Subjt:  MNNTA--PYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNI

Query:  IEESRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFAL
        I+ESRN+GQPF+CIVYSIL+PWVATVARSL+V SV LWIQPAVVFALYYYYNNGYYDEIQR+ S DDP+S SIKLPGLPLLSARDLPSFFG+SD Y+FAL
Subjt:  IEESRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFAL

Query:  PMFRRQFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQK
        PMFR+QFELLEEESNPK+LINTFEELEKDAV+AIKKF LMPIGPLIPS  +DG+ PSEAS GCDLF+ TSSY+EWLNSKPKA+V+YVS GSIST+SKQQK
Subjt:  PMFRRQFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQK

Query:  EEIARGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETG
        EEIARGL  TKRPFLWVIR+IEEEED LSFKEKLETQGKIV WC+QLEVLSSPATGCFLTHCGWNSCLESLACGVP VAFPQW+DQATN+KI++D SETG
Subjt:  EEIARGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETG

Query:  VRLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQVSS
        VRLE  E+GVVKGEEIERCL+LVMGDS+KGEEIRRNA+KWKKLAKEAASEGGSS AN KAFVD V S
Subjt:  VRLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQVSS

A0A1S3C8E8 Glycosyltransferase3.8e-23186.27Show/hide
Query:  NNTAPYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIEE
        N T P PRRVLL+TY AQGHINPTLQ AKRL RHGD+HVTFLTSLSAYRRMGQTP++PHLSFASFSDGYDDGFKPG+DIDH++SELE  GS+ALKNII+E
Subjt:  NNTAPYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIEE

Query:  SRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDP-NSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPM
        SRNQGQPF+CIVYSILLPWVATVARSL+V SVLLWIQPAVVFALYYYY NGYYDEIQR+IS DDP +SMSIKLPGLPLLSARDLPSFFG SDVY FAL +
Subjt:  SRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDP-NSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPM

Query:  FRRQFELL-EEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKE
        FR+QFELL EEESNP +LINTFEELEKDAV+AIKKF LMPIGPLIPS F DG  PSEAS GCDL++ TSSYI+WLNSKPKA+V+YVSSGSI+ LS QQKE
Subjt:  FRRQFELL-EEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKE

Query:  EIARGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGV
        E+ARGLLSTKRPFLWVIRD E EED LSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQW+DQATN+KI+QD SETGV
Subjt:  EIARGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGV

Query:  RLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQVSS
        RLEA EDGVVKGEEIERCL LVMGDS+KGE+IRRNA+KWKKLAKEAASEGGSS ANFKAFVDQV S
Subjt:  RLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQVSS

A0A5D3CQ80 Glycosyltransferase3.8e-23186.27Show/hide
Query:  NNTAPYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIEE
        N T P PRRVLL+TY AQGHINPTLQ AKRL RHGD+HVTFLTSLSAYRRMGQTP++PHLSFASFSDGYDDGFKPG+DIDH++SELE  GS+ALKNII+E
Subjt:  NNTAPYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIEE

Query:  SRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDP-NSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPM
        SRNQGQPF+CIVYSILLPWVATVARSL+V SVLLWIQPAVVFALYYYY NGYYDEIQR+IS DDP +SMSIKLPGLPLLSARDLPSFFG SDVY FAL +
Subjt:  SRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDP-NSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPM

Query:  FRRQFELL-EEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKE
        FR+QFELL EEESNP +LINTFEELEKDAV+AIKKF LMPIGPLIPS F DG  PSEAS GCDL++ TSSYI+WLNSKPKA+V+YVSSGSI+ LS QQKE
Subjt:  FRRQFELL-EEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKE

Query:  EIARGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGV
        E+ARGLLSTKRPFLWVIRD E EED LSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQW+DQATN+KI+QD SETGV
Subjt:  EIARGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGV

Query:  RLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQVSS
        RLEA EDGVVKGEEIERCL LVMGDS+KGE+IRRNA+KWKKLAKEAASEGGSS ANFKAFVDQV S
Subjt:  RLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQVSS

A0A6J1HJP1 Glycosyltransferase2.0e-19274.08Show/hide
Query:  MNNTAPYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIE
        M+NTAP+  RVLL+TY AQGHINP L+FAKRLTR   + VTF+TSLSAYRRMG+TP++PH+SFASFSDGYDDGFK G+DI+HFMSELE RGS+A+K++I 
Subjt:  MNNTAPYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIE

Query:  ESRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPM
            QGQPF+CIVYSILLPWVATVARSL++P++LLWIQPA+VFALYYYYN GY+D IQ   +S DP + +I+LPGLPLL+ARDLPSFFGSSD Y FALP+
Subjt:  ESRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPM

Query:  FRRQFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEE
        FRRQFELLE+E+NP V+INTF+ELE DA+RAI KF L+P+GPLI         PSEAS  CDLFQ T+SYI+WLNSKPK +VIY+SSGS+STLSK QKEE
Subjt:  FRRQFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEE

Query:  IARGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGVR
        IARGLLS  RPFLWVIRDI EE + LS +E+LE  GKIVPWCSQ+EVLS PATGCFLTHCGWNS LESL CGVP V FPQW+DQ TNAKI+QD SETGVR
Subjt:  IARGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGVR

Query:  LEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQV
        LE   DGVVK EEI+RCL+LVMGDS+KGEEIRRN VKWK+LAK A + GGSS++NFKAFVDQV
Subjt:  LEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQV

A0A6J1KN24 Glycosyltransferase6.5e-19173.87Show/hide
Query:  MNNTAPYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIE
        M+NTAP+  R+LL+TY AQGHINP L+FAKRLTR   + VTF+TSLSAYRR+G+TP +PH+SFASFSDGYDDGFK G+DI+HFMSELE RGS+A+K++I 
Subjt:  MNNTAPYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIE

Query:  ESRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPM
            QGQPF+CIVYSILLPWVATVARSL++P+VLLWIQPA+VFALYYYYN GY+D IQ V   DDP + +I+LPGLPLL+ARDLPSFFGSSD Y FALP+
Subjt:  ESRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPM

Query:  FRRQFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEE
        FRRQFELLE+E+NP ++INTF+ELE +A+RAI KF L+PIGPLI         PSEAS  CDLFQ T+SYI+WLNSKPK +VIYVSSGSISTLSK Q EE
Subjt:  FRRQFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEE

Query:  IARGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGVR
        IARGLLS  RPFLWVIRDI EE + LS +E+LE  GKIV WCSQ+EVLS PATGCFLTHCGWNS LESL CGVP V FPQW+DQ TNAKI+QD SETGVR
Subjt:  IARGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGVR

Query:  LEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQV
        LE   DGVVK EEI+RCL+LVMGDS+KGEEIRRN VKWK+LAK A + GGSS++NFKAFVDQV
Subjt:  LEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQV

SwissProt top hitse value%identityAlignment
A7MAS5 Phloretin 4'-O-glucosyltransferase1.3e-15659.28Show/hide
Query:  RVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIEESRNQGQPF
        R LLVT+ AQGHINP+LQFAKRL      HVT++TSLSA+RR+G       L++A FSDGYDDGFKPG+++D +MSEL  RG +A+ +++  S N+G P+
Subjt:  RVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIEESRNQGQPF

Query:  SCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSM--SIKLPGLPL-LSARDLPSFFGSSDVYNFALPMFRRQFE
        +C+VYS+LLPW A +A  L++PSVLLWIQPA VF +YYYY NGY D I+   SS   N +  SI+LPGLPL  ++RDLPSF   ++ YNFALP+F+ Q E
Subjt:  SCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSM--SIKLPGLPL-LSARDLPSFFGSSDVYNFALPMFRRQFE

Query:  LLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCT--SSYIEWLNSKPKATVIYVSSGSISTLSKQQKEEIARG
        LLE E+NP +L+NTF+ LE +A++AI K+ L+ +GPLIPSAFLDG  PS+ S G DLFQ +  SSY+EWLNSKP+ +VIYVS GSIS L K Q EEIA+G
Subjt:  LLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCT--SSYIEWLNSKPKATVIYVSSGSISTLSKQQKEEIARG

Query:  LLSTKRPFLWVIRD----------IEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDF
        LL    PFLWVIRD           ++EE+ L  +E+LE  G IVPWCSQ+EVLSSP+ GCF+THCGWNS LESL  GVP VAFPQWTDQ TNAK+++D+
Subjt:  LLSTKRPFLWVIRD----------IEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDF

Query:  SETGVRLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQV
         +TGVR+   E+G+V GEE++RCL LV+G  E GE++RRNA KWK LA+EA SEG SS  N +AF+DQ+
Subjt:  SETGVRLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQV

F8WKW0 Crocetin glucosyltransferase, chloroplastic1.1e-14757.85Show/hide
Query:  RRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRM----GQTPSVPHLSFASFSDGYDDGFKP-GNDIDHFMSELEHRGSEALKNIIEESR
        R VLL+TY AQGHINP LQFA+RL R G + VT  TS+ A  RM    G TP    L+FA+FSDGYDDGF+P G D   +MS L  +GS  L+N+I  S 
Subjt:  RRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRM----GQTPSVPHLSFASFSDGYDDGFKP-GNDIDHFMSELEHRGSEALKNIIEESR

Query:  NQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFF--GSSDVYNFALPMF
        +QG P +C+VY++LLPW ATVAR  ++PS LLWIQP  V  +YYYY  GY D+++   +S+DP + SI+ PGLP + A+DLPSF    S ++Y+FALP F
Subjt:  NQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFF--GSSDVYNFALPMF

Query:  RRQFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEEI
        ++Q E L+EE  PKVL+NTF+ LE  A++AI+ + L+ IGPL PSAFLDG  PSE S   DLFQ +  Y EWLNS+P  +V+YVS GS+ TL KQQ EEI
Subjt:  RRQFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEEI

Query:  ARGLLSTKRPFLWVIR-----DIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSE
        ARGLL + RPFLWVIR     + E+EED L   E+LE QG IVPWCSQ+EVL+ P+ GCF+THCGWNS LE+L CGVP VAFP WTDQ TNAK+++D  E
Subjt:  ARGLLSTKRPFLWVIR-----DIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSE

Query:  TGVRLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVD
        TGVR+   EDG V+ +EI+RC++ VM D EKG E++RNA KWK+LA+EA  E GSS  N KAFV+
Subjt:  TGVRLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVD

K4CWS6 UDP-glycosyltransferase 75C11.2e-14657.63Show/hide
Query:  VLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQ---TPSVPHLSFASFSDGYDDGFKPG-NDIDHFMSELEHRGSEALKNIIEESRNQG
        VLLVT+ AQGHINP+LQFAKRL   G + VTF TS+ A+RRM +   + +   L+ A+FSDG+DDGFK   +D   +MSE+  RGS+ L+++I +S ++G
Subjt:  VLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQ---TPSVPHLSFASFSDGYDDGFKPG-NDIDHFMSELEHRGSEALKNIIEESRNQG

Query:  QPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSS----DVYNFALPMFR
        +P + +VY++LLPW A VAR L++PS LLWIQPA V  +YYYY NGY DE+ +  SS+DPN  SI+LP LPLL ++DLPSF  SS    D Y+FALP F+
Subjt:  QPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSS----DVYNFALPMFR

Query:  RQFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQ-CTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEEI
         Q + L+ E NPKVL+NTF+ LE + ++AI+K+ L+ IGPLIPS+FL G    E+S G DLFQ     Y+EWLN+KPK++++Y+S GS+  LS+ QKEEI
Subjt:  RQFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQ-CTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEEI

Query:  ARGLLSTKRPFLWVIRDIEE--EEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGV
        A+GL+  +RPFLWVIRD EE  EE+ LS   +LE QGKIVPWCSQLEVL+ P+ GCF++HCGWNS LESL+ GVP VAFP WTDQ TNAK+++D  +TGV
Subjt:  ARGLLSTKRPFLWVIRDIEE--EEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGV

Query:  RLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQVS
        R+   EDGVV+ +EI+RC+++VM   EKGEE+R+NA KWK+LA+ A  EGGSS  N KAFV QVS
Subjt:  RLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQVS

Q9ZR25 Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase4.7e-12251.52Show/hide
Query:  VLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPH--LSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIEESR--NQG
        VLL T+ AQGHINP LQFAKRL  + D+ VTF TS+ A+RRM +T +  +  ++F SFSDGYDDG +PG+D  ++MSE++ RG +AL + +  +    + 
Subjt:  VLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPH--LSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIEESR--NQG

Query:  QPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLP-GLPLLSARDLPSFFGSSDVYNFALPMFRRQF
           + +VYS L  W A VAR  ++ S LLWI+PA V  ++Y+Y NGY DEI       D  S +I LP GLP+L+ RDLPSF   S    F   + + + 
Subjt:  QPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLP-GLPLLSARDLPSFFGSSDVYNFALPMFRRQF

Query:  ELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSS---YIEWLNSKPKATVIYVSSGSISTLSKQQKEEIA
        E LE E  PKVL+N+F+ LE DA++AI K+ ++ IGPLIPSAFLDG  PS+ S G DLF+  S+    +EWL++ P+++V+YVS GS    +K Q EEIA
Subjt:  ELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSS---YIEWLNSKPKATVIYVSSGSISTLSKQQKEEIA

Query:  RGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGVRLE
        RGLL   RPFLWV+R  E EE  +S  E+L+  GKIV WCSQLEVL+ P+ GCF+THCGWNS LES++ GVP VAFPQW DQ TNAK+++D   TGVR+ 
Subjt:  RGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGVRLE

Query:  ATEDG-VVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQV
        A E+G VV G+EI RC++ VM   EK  ++R +A KWK LA++A  E GSS  N K F+D+V
Subjt:  ATEDG-VVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQV

Q9ZR27 Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 18.3e-12752.99Show/hide
Query:  RRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSV-----PHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIEESR
        RRVLL T+ AQGHINP LQFAKRL + G   VTF TS+ A+RRM  T S      P L F +FSDGYDDG KP  D   +MSE++ RGSEAL+N++  + 
Subjt:  RRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSV-----PHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIEESR

Query:  NQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPMFRR
        +     + +VYS L  W A VAR   VPS LLW++PA V  +YY+Y NGY DEI       D  S  I+LP LP L  R LP+F        F L M + 
Subjt:  NQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPMFRR

Query:  QFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCT--SSYIEWLNSKPKATVIYVSSGSISTLSKQQKEEI
        + E L+ E   KVL+NTF+ LE DA+ AI ++ L+ IGPLIPSAFLDG  PSE S G DLF+ +  ++ +EWL++KPK++V+YVS GS+    K Q EEI
Subjt:  QFELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCT--SSYIEWLNSKPKATVIYVSSGSISTLSKQQKEEI

Query:  ARGLLSTKRPFLWVIR-----DIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSE
         +GLL+  RPFLW+IR     D EEEE+ LS   +L+  GKIV WCSQLEVL+ PA GCF+THCGWNS +ESL+CGVP VA PQW DQ TNAK+++D   
Subjt:  ARGLLSTKRPFLWVIR-----DIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSE

Query:  TGVRLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQVS
        TGVR+   E G V G EIERC+++VM   EK + +R NA+KWK LA+EA  E GSS  N  AF+ QV+
Subjt:  TGVRLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQVS

Arabidopsis top hitse value%identityAlignment
AT1G05530.1 UDP-glucosyl transferase 75B21.2e-10445.26Show/hide
Query:  LLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRR--MGQTPSVPHLSFASFSDGYDDG-FKPGNDIDHFMSELEHRGSEALKNIIEESRNQGQP
        LLVT+ AQGH+NP+L+FA+RL +     VTF T LS   R  +    +V +LSF +FSDG+DDG     +D+ + +   E  G +AL + IE ++N   P
Subjt:  LLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRR--MGQTPSVPHLSFASFSDGYDDG-FKPGNDIDHFMSELEHRGSEALKNIIEESRNQGQP

Query:  FSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPMFRRQFELL
         SC++Y+IL  WV  VAR  ++PSV LWIQPA  F +YY Y+ G              N+   + P LP L  RDLPSF   S+    A  +++   + L
Subjt:  FSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPMFRRQFELL

Query:  EEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEEIARGLLST
        +EESNPK+L+NTF+ LE + + AI    ++ +GPL+P+    G   SE+         +SSY  WL+SK +++VIYVS G++  LSK+Q EE+AR L+  
Subjt:  EEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEEIARGLLST

Query:  KRPFLWVIRD-------IEEEEDG-----LSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSE
         RPFLWVI D       IE EE+        F+ +LE  G IV WCSQ+EVL   A GCFLTHCGW+S LESL  GVP VAFP W+DQ  NAK++++  +
Subjt:  KRPFLWVIRD-------IEEEEDG-----LSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSE

Query:  TGVRLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFV
        TGVR+    +G+V+  EI RCL+ VM    K  E+R NA KWK+LA EA  EGGSS  N +AFV
Subjt:  TGVRLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFV

AT1G05560.1 UDP-glucosyltransferase 75B11.0e-10344.8Show/hide
Query:  PRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRR--MGQTPSVPHLSFASFSDGYDD-GFKPGNDIDHFMSELEHRGSEALKNIIEESRN
        P   LLVT+ AQGH+NP+L+FA+RL +     VTF+T +S +    +     V +LSF +FSDG+DD G     D       L+  G +AL + IE ++N
Subjt:  PRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRR--MGQTPSVPHLSFASFSDGYDD-GFKPGNDIDHFMSELEHRGSEALKNIIEESRN

Query:  QGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPMFRRQ
           P +C++Y+ILL W   VAR   +PS LLWIQPA+VF +YY +  G              N    +LP L  L  RDLPSF   S+    A   F+  
Subjt:  QGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPMFRRQ

Query:  FELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEEIARG
         E L +E+ PK+LINTF+ LE +A+ A     ++ +GPL+P+    G      S    +   +SSY  WL+SK +++VIYVS G++  LSK+Q EE+AR 
Subjt:  FELLEEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEEIARG

Query:  LLSTKRPFLWVIRDI---------EEE---EDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQ
        L+  KRPFLWVI D          EEE   E    F+ +LE  G IV WCSQ+EVLS  A GCF+THCGW+S LESL  GVP VAFP W+DQ TNAK+++
Subjt:  LLSTKRPFLWVIRDI---------EEE---EDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQ

Query:  DFSETGVRLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQV
        +  +TGVR+   +DG+V+  EI RCL+ VM   EK  E+R NA KWK+LA EA  EGGSS  N +AFV+ +
Subjt:  DFSETGVRLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQV

AT1G05680.1 Uridine diphosphate glycosyltransferase 74E21.1e-7837.77Show/hide
Query:  VLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPH------LSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIEESRN
        ++++ +  QGHI P  QF KRL   G      L S          PS P+      ++    S+G+ +G +P  D+D +M  +E      L  ++E+ + 
Subjt:  VLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPH------LSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIEESRN

Query:  QGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPMFRRQ
         G P   IVY   +PW+  VA S  +   + + QP +V A+YY+   G +     V S+   +S     P  P+L+A DLPSF   S  Y   L +   Q
Subjt:  QGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPMFRRQ

Query:  FELLEEESNPKVLINTFEELEKDAVRAIKK-FRLMPIGPLIPSAFLDGHHPSEASPGCDLFQC-TSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEEIA
           ++      VL NTF++LE+  ++ ++  + ++ IGP +PS +LD     + + G  LF    +  +EWLNSK   +V+Y+S GS+  L + Q  E+A
Subjt:  FELLEEESNPKVLINTFEELEKDAVRAIKK-FRLMPIGPLIPSAFLDGHHPSEASPGCDLFQC-TSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEEIA

Query:  RGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGVRLE
         GL  + R FLWV+R+ E  +   ++ E++  +G IV W  QL+VL+  + GCFLTHCGWNS LE L+ GVP +  P WTDQ TNAK +QD  + GVR++
Subjt:  RGLLSTKRPFLWVIRDIEEEEDGLSFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGVRLE

Query:  ATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFV
        A  DG V+ EEI R ++ VM + EKG+EIR+NA KWK LA+EA SEGGSS  +   FV
Subjt:  ATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFV

AT4G14090.1 UDP-Glycosyltransferase superfamily protein8.6e-11147.49Show/hide
Query:  LLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIE---ESRNQGQP
        LLVT+ AQGHINP LQ A RL  HG   VT+ T++SA+RRMG+ PS   LSFA F+DG+DDG K   D   +MSEL+  GS AL++II+   ++  + +P
Subjt:  LLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIE---ESRNQGQP

Query:  FSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPMFRRQFELL
         + ++YS+L+PWV+TVAR  ++P+ LLWI+PA V  +YYYY N  Y  +  V    +P    IKLP LPL++  DLPSF   S     AL   R   E L
Subjt:  FSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPMFRRQFELL

Query:  EEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCT-SSYIEWLNSKPKATVIYVSSGS-ISTLSKQQKEEIARGLL
        E ESNPK+L+NTF  LE DA+ +++K +++PIGPL+          S +    DLF+ +   Y +WL+SK + +VIY+S G+    L ++  E +  G+L
Subjt:  EEESNPKVLINTFEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCT-SSYIEWLNSKPKATVIYVSSGS-ISTLSKQQKEEIARGLL

Query:  STKRPFLWVIRDIEEEEDGLS-FKEKL--ETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGVRLEA
        +T RPFLW++R+   EE   + F E +    +G +V WCSQ  VL+  A GCF+THCGWNS LESL  GVP VAFPQ+ DQ T AK+V+D    GV+++ 
Subjt:  STKRPFLWVIRDIEEEEDGLS-FKEKL--ETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGVRLEA

Query:  TEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQ
         E+G V GEEI RCL+ VM   E+ EE+R NA KWK +A +AA+EGG S  N K FVD+
Subjt:  TEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQ

AT4G15550.1 indole-3-acetate beta-D-glucosyltransferase2.9e-11948.46Show/hide
Query:  NNTAPYPRRVLLVTYCAQGHINPTLQFAKRLT-RHGDVHVTFLTSLSAY-RRMGQTPSVPH-LSFASFSDGYDDGFKPGNDID--------HFMSELEHR
        N+ +P     L VT+ AQGHINP+L+ AKRL        VTF  S+SAY RRM  T +VP  L FA++SDG+DDGFK     D        +FMSE+  R
Subjt:  NNTAPYPRRVLLVTYCAQGHINPTLQFAKRLT-RHGDVHVTFLTSLSAY-RRMGQTPSVPH-LSFASFSDGYDDGFKPGNDID--------HFMSELEHR

Query:  GSEALKNIIEESRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGS
        G E L  +IE++R Q +PF+C+VY+ILL WVA +AR  ++PS LLW+QP  VF+++Y+Y NGY D I  + ++    S SIKLP LPLL+ RD+PSF  S
Subjt:  GSEALKNIIEESRNQGQPFSCIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGS

Query:  SDVYNFALPMFRRQFELLEEESNPKVLINTFEELEKDAVRAI-KKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGS
        S+VY F LP FR Q + L+EE NPK+LINTF+ELE +A+ ++   F+++P+GPL+                   F     YIEWL++K  ++V+YVS G+
Subjt:  SDVYNFALPMFRRQFELLEEESNPKVLINTFEELEKDAVRAI-KKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGS

Query:  ISTLSKQQKEEIARGLLSTKRPFLWVIRD---------IEEEEDGL-SFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFP
        ++ LSK+Q  E+ + L+ ++RPFLWVI D          E+EED + SF+E+L+  G +V WC Q  VL+  + GCF+THCGWNS LESL  GVP VAFP
Subjt:  ISTLSKQQKEEIARGLLSTKRPFLWVIRD---------IEEEEDGL-SFKEKLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFP

Query:  QWTDQATNAKIVQDFSETGVRL--EATEDG--VVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQ
        QW DQ  NAK+++D  +TGVR+  +  E+G  VV  EEI RC++ VM D  K EE R NA +WK LA EA  EGGSS  + KAFVD+
Subjt:  QWTDQATNAKIVQDFSETGVRL--EATEDG--VVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKKLAKEAASEGGSSHANFKAFVDQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAACACAGCACCCTATCCCCGTCGTGTCCTATTAGTAACATATTGCGCTCAAGGGCACATTAACCCTACCCTCCAATTCGCCAAACGTCTCACCCGCCACGGTGA
CGTTCACGTCACCTTCCTCACTTCTCTCTCTGCTTACCGCCGCATGGGTCAAACCCCCTCGGTCCCGCATCTCTCGTTCGCCTCGTTCTCCGACGGCTACGACGACGGTT
TCAAACCTGGCAACGATATTGATCATTTCATGTCAGAGCTCGAGCATCGTGGATCTGAAGCTCTGAAGAATATAATCGAAGAGAGTAGAAACCAAGGTCAACCCTTCAGT
TGTATTGTATATTCCATACTCCTCCCTTGGGTCGCTACGGTGGCGCGTTCACTCAATGTTCCGTCGGTTCTTCTATGGATTCAACCGGCGGTTGTTTTCGCTTTGTATTA
CTATTACAACAATGGATATTACGATGAAATTCAAAGGGTTATTTCTAGTGATGATCCTAATTCCATGAGTATTAAATTACCTGGGCTTCCATTGTTGAGTGCTCGTGACC
TTCCATCATTTTTTGGTTCTTCAGATGTTTATAATTTTGCACTCCCAATGTTTAGGAGGCAATTTGAATTACTAGAGGAAGAGAGTAATCCAAAGGTTTTAATCAACACG
TTTGAGGAATTGGAGAAGGATGCAGTGAGAGCGATTAAGAAATTTCGTTTGATGCCTATTGGACCATTGATTCCATCTGCTTTTCTTGACGGACATCACCCATCAGAAGC
CTCTCCTGGATGTGATCTATTTCAATGTACTAGTAGTTATATCGAGTGGTTGAACTCGAAACCTAAAGCAACCGTCATTTACGTATCATCGGGAAGCATTTCGACATTAT
CAAAGCAACAAAAAGAGGAGATAGCAAGAGGATTATTAAGCACAAAACGACCATTTTTGTGGGTTATCCGAGACATTGAAGAAGAAGAAGATGGATTAAGCTTCAAAGAG
AAACTAGAAACTCAAGGAAAGATAGTTCCATGGTGTTCCCAACTTGAGGTTCTATCAAGCCCAGCCACAGGCTGTTTTCTCACACACTGTGGTTGGAATTCTTGTTTGGA
GAGTTTGGCTTGCGGCGTTCCGACCGTGGCATTTCCGCAATGGACCGATCAAGCGACCAACGCCAAGATCGTTCAGGACTTTTCGGAGACCGGAGTGAGGTTAGAGGCGA
CAGAGGACGGCGTGGTTAAGGGAGAAGAGATAGAAAGGTGCTTGAAGTTGGTTATGGGAGATTCAGAGAAAGGGGAAGAAATAAGGAGGAATGCTGTGAAATGGAAGAAG
TTGGCTAAGGAAGCTGCTAGTGAGGGTGGTTCATCACATGCCAACTTCAAGGCTTTTGTGGATCAAGTTTCTTCTTAA
mRNA sequenceShow/hide mRNA sequence
AGCAAATGTCATTCATAAACCTTCCAAATCTTCATCTTTCCTATAAAACCAACCTCACCCACCTTACATTTCTTATCCTATCAACATGAACAACACAGCACCCTATCCCC
GTCGTGTCCTATTAGTAACATATTGCGCTCAAGGGCACATTAACCCTACCCTCCAATTCGCCAAACGTCTCACCCGCCACGGTGACGTTCACGTCACCTTCCTCACTTCT
CTCTCTGCTTACCGCCGCATGGGTCAAACCCCCTCGGTCCCGCATCTCTCGTTCGCCTCGTTCTCCGACGGCTACGACGACGGTTTCAAACCTGGCAACGATATTGATCA
TTTCATGTCAGAGCTCGAGCATCGTGGATCTGAAGCTCTGAAGAATATAATCGAAGAGAGTAGAAACCAAGGTCAACCCTTCAGTTGTATTGTATATTCCATACTCCTCC
CTTGGGTCGCTACGGTGGCGCGTTCACTCAATGTTCCGTCGGTTCTTCTATGGATTCAACCGGCGGTTGTTTTCGCTTTGTATTACTATTACAACAATGGATATTACGAT
GAAATTCAAAGGGTTATTTCTAGTGATGATCCTAATTCCATGAGTATTAAATTACCTGGGCTTCCATTGTTGAGTGCTCGTGACCTTCCATCATTTTTTGGTTCTTCAGA
TGTTTATAATTTTGCACTCCCAATGTTTAGGAGGCAATTTGAATTACTAGAGGAAGAGAGTAATCCAAAGGTTTTAATCAACACGTTTGAGGAATTGGAGAAGGATGCAG
TGAGAGCGATTAAGAAATTTCGTTTGATGCCTATTGGACCATTGATTCCATCTGCTTTTCTTGACGGACATCACCCATCAGAAGCCTCTCCTGGATGTGATCTATTTCAA
TGTACTAGTAGTTATATCGAGTGGTTGAACTCGAAACCTAAAGCAACCGTCATTTACGTATCATCGGGAAGCATTTCGACATTATCAAAGCAACAAAAAGAGGAGATAGC
AAGAGGATTATTAAGCACAAAACGACCATTTTTGTGGGTTATCCGAGACATTGAAGAAGAAGAAGATGGATTAAGCTTCAAAGAGAAACTAGAAACTCAAGGAAAGATAG
TTCCATGGTGTTCCCAACTTGAGGTTCTATCAAGCCCAGCCACAGGCTGTTTTCTCACACACTGTGGTTGGAATTCTTGTTTGGAGAGTTTGGCTTGCGGCGTTCCGACC
GTGGCATTTCCGCAATGGACCGATCAAGCGACCAACGCCAAGATCGTTCAGGACTTTTCGGAGACCGGAGTGAGGTTAGAGGCGACAGAGGACGGCGTGGTTAAGGGAGA
AGAGATAGAAAGGTGCTTGAAGTTGGTTATGGGAGATTCAGAGAAAGGGGAAGAAATAAGGAGGAATGCTGTGAAATGGAAGAAGTTGGCTAAGGAAGCTGCTAGTGAGG
GTGGTTCATCACATGCCAACTTCAAGGCTTTTGTGGATCAAGTTTCTTCTTAAATTAAGGTGCAACCCATCATAGAATTCGAGAAAATTTCTAATACTACGGTTATCTTC
GTTTCAAAGTAGCGTGGGATTCTATTACATTAATGTCGAATCTGCTAATATTTATGGAGACGAGTTTCCCGTGTGAACAAAGAGAATATAGTATTCCATGTACCCATTTT
AACCATCCAGCGGAGAAACTAAAAAGCATATTAAACAAAAGAAAAAAAAGTCTGTCACGTGAGGCGTGAAAAATTATGATTGGACAAATAGATGTGTATTACTAAGAGAT
TAGTACAATTCAAGATGTACTTAAAGTATTTTTCAATTGTTGTTAAAAATTGAAAACAAAAGTTTTACTAAAAAAAACAATATTGAATTGAAACATTACCAAA
Protein sequenceShow/hide protein sequence
MNNTAPYPRRVLLVTYCAQGHINPTLQFAKRLTRHGDVHVTFLTSLSAYRRMGQTPSVPHLSFASFSDGYDDGFKPGNDIDHFMSELEHRGSEALKNIIEESRNQGQPFS
CIVYSILLPWVATVARSLNVPSVLLWIQPAVVFALYYYYNNGYYDEIQRVISSDDPNSMSIKLPGLPLLSARDLPSFFGSSDVYNFALPMFRRQFELLEEESNPKVLINT
FEELEKDAVRAIKKFRLMPIGPLIPSAFLDGHHPSEASPGCDLFQCTSSYIEWLNSKPKATVIYVSSGSISTLSKQQKEEIARGLLSTKRPFLWVIRDIEEEEDGLSFKE
KLETQGKIVPWCSQLEVLSSPATGCFLTHCGWNSCLESLACGVPTVAFPQWTDQATNAKIVQDFSETGVRLEATEDGVVKGEEIERCLKLVMGDSEKGEEIRRNAVKWKK
LAKEAASEGGSSHANFKAFVDQVSS