; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G08930 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G08930
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGlycosyltransferase
Genome locationChr6:7480903..7483168
RNA-Seq ExpressionCSPI06G08930
SyntenyCSPI06G08930
Gene Ontology termsGO:0080043 - quercetin 3-O-glucosyltransferase activity (molecular function)
GO:0080044 - quercetin 7-O-glucosyltransferase activity (molecular function)
InterPro domainsIPR002213 - UDP-glucuronosyl/UDP-glucosyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140483.1 UDP-glycosyltransferase 75C1 [Cucumis sativus]4.6e-26999.79Show/hide
Query:  MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNI
        MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNI
Subjt:  MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNI

Query:  IQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFAL
        IQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFAL
Subjt:  IQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFAL

Query:  PMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQK
        PMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQK
Subjt:  PMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQK

Query:  EEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETG
        EEIARGLS+TKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETG
Subjt:  EEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETG

Query:  VRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS
        VRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS
Subjt:  VRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS

XP_008458144.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Cucumis melo]2.9e-23187.63Show/hide
Query:  MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNI
        MNNTTP PNPR VLL+T+ AQGHINPTLQLAKRL RHGDLHVTFL SLSAYRRMG TPTLPH++FASFSDGYDDGFKP DDI  Y+SELER GSDALKNI
Subjt:  MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNI

Query:  IQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDP-SSTSIKLPGLPLLSARDLPSFFGASDGYSFA
        IQESRN+GQPFTCIVYSIL+PWVATVARSLDVASV LWIQPAVVFALYYYY NGYYDEIQRI SGDDP SS SIKLPGLPLLSARDLPSFFG SD Y+FA
Subjt:  IQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDP-SSTSIKLPGLPLLSARDLPSFFGASDGYSFA

Query:  LPMFRKQFELL-EEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQ
        L +FRKQFELL EEESNP ILINTFEELEKDAVKAIKKFHLMPIGPLIPSV  DG DPSEASSGCDL+RSTSSY++WLNSKPKASVVYVS GSI+ +S Q
Subjt:  LPMFRKQFELL-EEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQ

Query:  QKEEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSE
        QKEE+ARGL  TKRPFLWVIR+ E EED LSFKEKLETQGKIV WC+QLEVLSSPATGCFLTHCGWNSCLESLACGVP VAFPQWSDQATNSKII+DLSE
Subjt:  QKEEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSE

Query:  TGVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS
        TGVRLE  E+GVVKGEEIERCL LVMGDSKKGE+IRRNALKWKKLAKEAASEGGSSFAN KAFVD VCS
Subjt:  TGVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS

XP_023000593.1 crocetin glucosyltransferase, chloroplastic-like [Cucurbita maxima]1.5e-17970.24Show/hide
Query:  MNNTTPNPNPRH-VLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKN
        M+NT P+   RH +LL+T+ AQGHINP L+ AKRLTR   + VTF+ SLSAYRR+G TP LPH++FASFSDGYDDGFK  DDI  ++SELERRGS A+K+
Subjt:  MNNTTPNPNPRH-VLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKN

Query:  IIQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFA
        +I     +GQPFTCIVYSIL+PWVATVARSL + +V LWIQPA+VFALYYYYN GY+D IQ +   DDP +T I+LPGLPLL+ARDLPSFFG+SD Y FA
Subjt:  IIQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFA

Query:  LPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQ
        LP+FR+QFELLE+E+NP I+INTF+ELE +A++AI KFHL+PIGPLI         PSEASS CDLF+ST+SY++WLNSKPK SV+YVS GSIST+SK Q
Subjt:  LPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQ

Query:  KEEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSET
         EEIARGL    RPFLWVIR+I EE + LS +E+LE  GKIVSWC+Q+EVLS PATGCFLTHCGWNS LESL CGVP V FPQWSDQ TN+KII+D+SET
Subjt:  KEEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSET

Query:  GVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVC
        GVRLEV  +GVVK EEI+RCLELVMGDSKKGEEIRRN +KWK+LAK A + GGSS++N KAFVD VC
Subjt:  GVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVC

XP_023513863.1 crocetin glucosyltransferase, chloroplastic-like [Cucurbita pepo subsp. pepo]1.0e-18070.45Show/hide
Query:  MNNTTPNPNPRH-VLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKN
        M+NT P+   RH VLL+T+ AQGHINP L+ AKRLTR   + VTF+ SLSAYRRMG TPTLPH++FASFSDGYDDGFK  DDI  ++SELERRGS A+K+
Subjt:  MNNTTPNPNPRH-VLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKN

Query:  IIQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFA
        +I     +GQPFTCIVYSIL+PWVA VARSL + ++ LWIQPA+VFALYYYYN GY+D IQ +   DDP +T I+LPGLPLL+ARDLPSFFG+SD Y FA
Subjt:  IIQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFA

Query:  LPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQ
        LP+FR+QFELLE+E+NP ++INTF+ELE DA++AI KFHL+PIGPLI         PSEASS CDLF+ST+SY++WLNSKPK SV+YVS GSIST+SK Q
Subjt:  LPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQ

Query:  KEEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSET
        KEEIARGL    RPFLWVIR+I EE + LS +E+LE  GKIVSWC+Q+EVLS PATGCFLTHCGWNS LESL CGVP V FPQWSDQ TN+KII+D+SET
Subjt:  KEEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSET

Query:  GVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVC
        GVRLEV  +GVVK EEI+RCLELVMGDSKKGEEIR+N +KWK+LAK A + GGSS++N KAFVD VC
Subjt:  GVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVC

XP_038876006.1 phloretin 4'-O-glucosyltransferase-like [Benincasa hispida]1.3e-22884.27Show/hide
Query:  TPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQES
        T  P+P  VLLVT+CAQGHINPTLQ A+RLTRHGD+HVTFL SLSAYRR+G TPTLPH++F SFSDGYDDGFKP DD+  ++SELER GS+ALKNII+ES
Subjt:  TPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQES

Query:  RNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFALPMFR
        RNKGQPFTCIVYSIL+PWVATVARSLD+ SV LWIQPAVVFALYYYYNNGYYDEIQRI S DDP+S SIKLPGLPLLSARDLPSFFG+SD Y FALP+FR
Subjt:  RNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFALPMFR

Query:  KQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEIA
        +QFELLEEESNPK+LINTFEELEKDAV+AIKKFHLMPIGPLIPS  +DG+DPSE SSGCDLFRSTSSY++WLNSKPKASV+YVS GSIST+SKQQKEEIA
Subjt:  KQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEIA

Query:  RGLSITKRPFLWVIRNIEEE-EDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRL
        RGL  TKRPFLWVIR+IEEE ED LSFKEKLETQGKIVSWC+QLEVLSSPATGCFLTHCGWNSCLESLACG+P V  PQW+DQATN+KI++DLSETGVRL
Subjt:  RGLSITKRPFLWVIRNIEEE-EDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRL

Query:  EVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS
        +V E+GVVKGEEIERCLELVMG+S+KGEEIRRNA+KWKKLA+EA SEGGSS ANLKAFVD VCS
Subjt:  EVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS

TrEMBL top hitse value%identityAlignment
A0A0A0KA46 UDP-glucose:flavonoid 7-O-glucosyltransferase2.2e-26999.79Show/hide
Query:  MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNI
        MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNI
Subjt:  MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNI

Query:  IQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFAL
        IQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFAL
Subjt:  IQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFAL

Query:  PMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQK
        PMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQK
Subjt:  PMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQK

Query:  EEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETG
        EEIARGLS+TKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETG
Subjt:  EEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETG

Query:  VRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS
        VRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS
Subjt:  VRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS

A0A1S3C8E8 Glycosyltransferase1.4e-23187.63Show/hide
Query:  MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNI
        MNNTTP PNPR VLL+T+ AQGHINPTLQLAKRL RHGDLHVTFL SLSAYRRMG TPTLPH++FASFSDGYDDGFKP DDI  Y+SELER GSDALKNI
Subjt:  MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNI

Query:  IQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDP-SSTSIKLPGLPLLSARDLPSFFGASDGYSFA
        IQESRN+GQPFTCIVYSIL+PWVATVARSLDVASV LWIQPAVVFALYYYY NGYYDEIQRI SGDDP SS SIKLPGLPLLSARDLPSFFG SD Y+FA
Subjt:  IQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDP-SSTSIKLPGLPLLSARDLPSFFGASDGYSFA

Query:  LPMFRKQFELL-EEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQ
        L +FRKQFELL EEESNP ILINTFEELEKDAVKAIKKFHLMPIGPLIPSV  DG DPSEASSGCDL+RSTSSY++WLNSKPKASVVYVS GSI+ +S Q
Subjt:  LPMFRKQFELL-EEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQ

Query:  QKEEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSE
        QKEE+ARGL  TKRPFLWVIR+ E EED LSFKEKLETQGKIV WC+QLEVLSSPATGCFLTHCGWNSCLESLACGVP VAFPQWSDQATNSKII+DLSE
Subjt:  QKEEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSE

Query:  TGVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS
        TGVRLE  E+GVVKGEEIERCL LVMGDSKKGE+IRRNALKWKKLAKEAASEGGSSFAN KAFVD VCS
Subjt:  TGVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS

A0A5D3CQ80 Glycosyltransferase1.4e-23187.63Show/hide
Query:  MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNI
        MNNTTP PNPR VLL+T+ AQGHINPTLQLAKRL RHGDLHVTFL SLSAYRRMG TPTLPH++FASFSDGYDDGFKP DDI  Y+SELER GSDALKNI
Subjt:  MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNI

Query:  IQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDP-SSTSIKLPGLPLLSARDLPSFFGASDGYSFA
        IQESRN+GQPFTCIVYSIL+PWVATVARSLDVASV LWIQPAVVFALYYYY NGYYDEIQRI SGDDP SS SIKLPGLPLLSARDLPSFFG SD Y+FA
Subjt:  IQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDP-SSTSIKLPGLPLLSARDLPSFFGASDGYSFA

Query:  LPMFRKQFELL-EEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQ
        L +FRKQFELL EEESNP ILINTFEELEKDAVKAIKKFHLMPIGPLIPSV  DG DPSEASSGCDL+RSTSSY++WLNSKPKASVVYVS GSI+ +S Q
Subjt:  LPMFRKQFELL-EEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQ

Query:  QKEEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSE
        QKEE+ARGL  TKRPFLWVIR+ E EED LSFKEKLETQGKIV WC+QLEVLSSPATGCFLTHCGWNSCLESLACGVP VAFPQWSDQATNSKII+DLSE
Subjt:  QKEEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSE

Query:  TGVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS
        TGVRLE  E+GVVKGEEIERCL LVMGDSKKGE+IRRNALKWKKLAKEAASEGGSSFAN KAFVD VCS
Subjt:  TGVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS

A0A6J1HJP1 Glycosyltransferase6.2e-17969.31Show/hide
Query:  MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNI
        M+NT P+ +   VLL+T+ AQGHINP L+ AKRLTR   + VTF+ SLSAYRRMG TPTLPH++FASFSDGYDDGFK  DDI  ++SELERRGS A+K++
Subjt:  MNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNI

Query:  IQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFAL
        I     +GQPFTCIVYSIL+PWVATVARSL + ++ LWIQPA+VFALYYYYN GY+D IQ  ++  DP +T I+LPGLPLL+ARDLPSFFG+SD Y FAL
Subjt:  IQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFAL

Query:  PMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQK
        P+FR+QFELLE+E+NP ++INTF+ELE DA++AI KF+L+P+GPLI         PSEASS CDLF+ST+SY++WLNSKPK SV+Y+S GS+ST+SK QK
Subjt:  PMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQK

Query:  EEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETG
        EEIARGL    RPFLWVIR+I EE + LS +E+LE  GKIV WC+Q+EVLS PATGCFLTHCGWNS LESL CGVP V FPQWSDQ TN+KII+D+SETG
Subjt:  EEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETG

Query:  VRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVC
        VRLEV  +GVVK EEI+RCLELVMGDSKKGEEIRRN +KWK+LAK A + GGSS++N KAFVD VC
Subjt:  VRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVC

A0A6J1KN24 Glycosyltransferase7.3e-18070.24Show/hide
Query:  MNNTTPNPNPRH-VLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKN
        M+NT P+   RH +LL+T+ AQGHINP L+ AKRLTR   + VTF+ SLSAYRR+G TP LPH++FASFSDGYDDGFK  DDI  ++SELERRGS A+K+
Subjt:  MNNTTPNPNPRH-VLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKN

Query:  IIQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFA
        +I     +GQPFTCIVYSIL+PWVATVARSL + +V LWIQPA+VFALYYYYN GY+D IQ +   DDP +T I+LPGLPLL+ARDLPSFFG+SD Y FA
Subjt:  IIQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFA

Query:  LPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQ
        LP+FR+QFELLE+E+NP I+INTF+ELE +A++AI KFHL+PIGPLI         PSEASS CDLF+ST+SY++WLNSKPK SV+YVS GSIST+SK Q
Subjt:  LPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQ

Query:  KEEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSET
         EEIARGL    RPFLWVIR+I EE + LS +E+LE  GKIVSWC+Q+EVLS PATGCFLTHCGWNS LESL CGVP V FPQWSDQ TN+KII+D+SET
Subjt:  KEEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSET

Query:  GVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVC
        GVRLEV  +GVVK EEI+RCLELVMGDSKKGEEIRRN +KWK+LAK A + GGSS++N KAFVD VC
Subjt:  GVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVC

SwissProt top hitse value%identityAlignment
A7MAS5 Phloretin 4'-O-glucosyltransferase8.7e-14656.75Show/hide
Query:  LLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTC
        LLVT  AQGHINP+LQ AKRL      HVT++ SLSA+RR+G+      +T+A FSDGYDDGFKP D++  Y+SEL RRG  A+ +++  S N+G P+TC
Subjt:  LLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTC

Query:  IVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQ-RIASG-DDPSSTSIKLPGLPL-LSARDLPSFFGASDGYSFALPMFRKQFELL
        +VYS+L+PW A +A  L + SV LWIQPA VF +YYYY NGY D I+   +SG ++    SI+LPGLPL  ++RDLPSF   ++ Y+FALP+F++Q ELL
Subjt:  IVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQ-RIASG-DDPSSTSIKLPGLPL-LSARDLPSFFGASDGYSFALPMFRKQFELL

Query:  EEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRST--SSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEIARGLS
        E E+NP IL+NTF+ LE +A+KAI K++L+ +GPLIPS  +DG DPS+ S G DLF+ +  SSY+EWLNSKP+ SV+YVS GSIS + K Q EEIA+GL 
Subjt:  EEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRST--SSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEIARGLS

Query:  ITKRPFLWVIRN----------IEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSE
            PFLWVIR+           ++EE+ L  +E+LE  G IV WC+Q+EVLSSP+ GCF+THCGWNS LESL  GVP VAFPQW+DQ TN+K+IED  +
Subjt:  ITKRPFLWVIRN----------IEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSE

Query:  TGVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHV
        TGVR+   EEG+V GEE++RCL+LV+G  + GE++RRNA KWK LA+EA SEG SS  NL+AF+D +
Subjt:  TGVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHV

F8WKW0 Crocetin glucosyltransferase, chloroplastic2.7e-13955.48Show/hide
Query:  RHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRM----GHTPTLPHITFASFSDGYDDGFKPSD-DIKLYISELERRGSDALKNIIQESR
        RHVLL+T+ AQGHINP LQ A+RL R G + VT   S+ A  RM    G TP    +TFA+FSDGYDDGF+P   D   Y+S L ++GS+ L+N+I  S 
Subjt:  RHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRM----GHTPTLPHITFASFSDGYDDGFKPSD-DIKLYISELERRGSDALKNIIQESR

Query:  NKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFF--GASDGYSFALPMF
        ++G P TC+VY++L+PW ATVAR   + S  LWIQP  V  +YYYY  GY D+++   + +DP + SI+ PGLP + A+DLPSF    + + YSFALP F
Subjt:  NKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFF--GASDGYSFALPMF

Query:  RKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEI
        +KQ E L+EE  PK+L+NTF+ LE  A+KAI+ ++L+ IGPL PS  +DG DPSE S   DLF+ +  Y EWLNS+P  SVVYVS GS+ T+ KQQ EEI
Subjt:  RKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEI

Query:  ARGLSITKRPFLWVIR-----NIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSE
        ARGL  + RPFLWVIR       E+EED L   E+LE QG IV WC+Q+EVL+ P+ GCF+THCGWNS LE+L CGVP VAFP W+DQ TN+K+IED+ E
Subjt:  ARGLSITKRPFLWVIR-----NIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSE

Query:  TGVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVD
        TGVR+   E+G V+ +EI+RC+E VM D +KG E++RNA KWK+LA+EA  E GSS  NLKAFV+
Subjt:  TGVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVD

K4CWS6 UDP-glycosyltransferase 75C13.2e-14056.71Show/hide
Query:  HVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGH--TPTLPH-ITFASFSDGYDDGFKPS-DDIKLYISELERRGSDALKNIIQESRNK
        HVLLVT  AQGHINP+LQ AKRL   G + VTF  S+ A+RRM      T P  +  A+FSDG+DDGFK + DD K Y+SE+  RGS  L+++I +S ++
Subjt:  HVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGH--TPTLPH-ITFASFSDGYDDGFKPS-DDIKLYISELERRGSDALKNIIQESRNK

Query:  GQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGAS----DGYSFALPMF
        G+P T +VY++L+PW A VAR L + S  LWIQPA V  +YYYY NGY DE+ + +S +DP + SI+LP LPLL ++DLPSF  +S    D YSFALP F
Subjt:  GQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGAS----DGYSFALPMF

Query:  RKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLF-RSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEE
        ++Q + L+ E NPK+L+NTF+ LE + +KAI+K++L+ IGPLIPS  + G D  E+S G DLF +S   YMEWLN+KPK+S+VY+S GS+  +S+ QKEE
Subjt:  RKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLF-RSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEE

Query:  IARGLSITKRPFLWVIRNIEE--EEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETG
        IA+GL   +RPFLWVIR+ EE  EE+ LS   +LE QGKIV WC+QLEVL+ P+ GCF++HCGWNS LESL+ GVP VAFP W+DQ TN+K+IED+ +TG
Subjt:  IARGLSITKRPFLWVIRNIEE--EEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETG

Query:  VRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFV
        VR+ V E+GVV+ +EI+RC+E+VM   +KGEE+R+NA KWK+LA+ A  EGGSS  NLKAFV
Subjt:  VRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFV

Q9ZR25 Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase2.5e-12151.84Show/hide
Query:  HVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPH--ITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESR--NK
        HVLL T  AQGHINP LQ AKRL  + D+ VTF  S+ A+RRM  T    +  I F SFSDGYDDG +P DD K Y+SE++ RG  AL + +  +    K
Subjt:  HVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPH--ITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESR--NK

Query:  GQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLP-GLPLLSARDLPSFFGASDGYSFALPMFRKQ
            T +VYS L  W A VAR   + S  LWI+PA V  ++Y+Y NGY DEI       D  S +I LP GLP+L+ RDLPSF   S    F   + +++
Subjt:  GQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLP-GLPLLSARDLPSFFGASDGYSFALPMFRKQ

Query:  FELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSS---YMEWLNSKPKASVVYVSMGSISTVSKQQKEEI
         E LE E  PK+L+N+F+ LE DA+KAI K+ ++ IGPLIPS  +DG DPS+ S G DLF   S+    +EWL++ P++SVVYVS GS    +K Q EEI
Subjt:  FELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSS---YMEWLNSKPKASVVYVSMGSISTVSKQQKEEI

Query:  ARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRL
        ARGL    RPFLWV+R  E EE  +S  E+L+  GKIVSWC+QLEVL+ P+ GCF+THCGWNS LES++ GVP VAFPQW DQ TN+K++ED+  TGVR+
Subjt:  ARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRL

Query:  EVEEEG-VVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHV
           EEG VV G+EI RC+E VM   +K  ++R +A KWK LA++A  E GSS  NLK F+D V
Subjt:  EVEEEG-VVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHV

Q9ZR27 Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 13.3e-12151.39Show/hide
Query:  RHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTL-----PHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESR
        R VLL T  AQGHINP LQ AKRL + G   VTF  S+ A+RRM +T +      P + F +FSDGYDDG KP  D K Y+SE++ RGS+AL+N++  + 
Subjt:  RHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTL-----PHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESR

Query:  NKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFALPMFRK
        +     T +VYS L  W A VAR   V S  LW++PA V  +YY+Y NGY DEI       D  S  I+LP LP L  R LP+F        F L M ++
Subjt:  NKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFALPMFRK

Query:  QFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRST--SSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEI
        + E L+ E   K+L+NTF+ LE DA+ AI ++ L+ IGPLIPS  +DG DPSE S G DLF  +  ++ +EWL++KPK+SVVYVS GS+    K Q EEI
Subjt:  QFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRST--SSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEI

Query:  ARGLSITKRPFLWVIR-----NIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSE
         +GL    RPFLW+IR     + EEEE+ LS   +L+  GKIVSWC+QLEVL+ PA GCF+THCGWNS +ESL+CGVP VA PQW DQ TN+K+IED   
Subjt:  ARGLSITKRPFLWVIR-----NIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSE

Query:  TGVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHV
        TGVR+ + E G V G EIERC+E+VM   +K + +R NA+KWK LA+EA  E GSS  NL AF+  V
Subjt:  TGVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHV

Arabidopsis top hitse value%identityAlignment
AT1G05530.1 UDP-glucosyl transferase 75B23.2e-10344.87Show/hide
Query:  HVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRR--MGHTPTLPHITFASFSDGYDDG-FKPSDDIKLYISELERRGSDALKNIIQESRNKG
        H LLVT  AQGH+NP+L+ A+RL +     VTF   LS   R  + +   + +++F +FSDG+DDG    +DD++  +   ER G  AL + I+ ++N  
Subjt:  HVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRR--MGHTPTLPHITFASFSDGYDDG-FKPSDDIKLYISELERRGSDALKNIIQESRNKG

Query:  QPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFALPMFRKQFE
         P +C++Y+IL  WV  VAR   + SVHLWIQPA  F +YY Y+ G              +++  + P LP L  RDLPSF   S+    A  ++++  +
Subjt:  QPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFALPMFRKQFE

Query:  LLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFR--STSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEIARG
         L+EESNPKIL+NTF+ LE + + AI    ++ +GPL+P+ +  G++     SG DL R   +SSY  WL+SK ++SV+YVS G++  +SK+Q EE+AR 
Subjt:  LLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFR--STSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEIARG

Query:  LSITKRPFLWVIRN-------IEEEED-----FLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIE
        L    RPFLWVI +       IE EE+        F+ +LE  G IVSWC+Q+EVL   A GCFLTHCGW+S LESL  GVP VAFP WSDQ  N+K++E
Subjt:  LSITKRPFLWVIRN-------IEEEED-----FLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIE

Query:  DLSETGVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFV
        ++ +TGVR+    EG+V+  EI RCLE VM    K  E+R NA KWK+LA EA  EGGSS  N++AFV
Subjt:  DLSETGVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFV

AT1G05560.1 UDP-glucosyltransferase 75B11.6e-9943.01Show/hide
Query:  PRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRR--MGHTPTLPHITFASFSDGYDD-GFKPSDDIKLYISELERRGSDALKNIIQESRN
        P H LLVT  AQGH+NP+L+ A+RL +     VTF+  +S +    + +   + +++F +FSDG+DD G    +D +     L+  G  AL + I+ ++N
Subjt:  PRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRR--MGHTPTLPHITFASFSDGYDD-GFKPSDDIKLYISELERRGSDALKNIIQESRN

Query:  KGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFALPMFRKQ
           P TC++Y+IL+ W   VAR   + S  LWIQPA+VF +YY +  G              + +  +LP L  L  RDLPSF   S+    A   F++ 
Subjt:  KGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFALPMFRKQ

Query:  FELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEIARG
         E L +E+ PKILINTF+ LE +A+ A     ++ +GPL+P+ +  G      S+   +   +SSY  WL+SK ++SV+YVS G++  +SK+Q EE+AR 
Subjt:  FELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEIARG

Query:  LSITKRPFLWVIRNI---------EEE---EDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIE
        L   KRPFLWVI +          EEE   E    F+ +LE  G IVSWC+Q+EVLS  A GCF+THCGW+S LESL  GVP VAFP WSDQ TN+K++E
Subjt:  LSITKRPFLWVIRNI---------EEE---EDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIE

Query:  DLSETGVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVC
        +  +TGVR+   ++G+V+  EI RCLE VM   +K  E+R NA KWK+LA EA  EGGSS  N++AFV+ +C
Subjt:  DLSETGVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVC

AT1G05675.1 UDP-Glycosyltransferase superfamily protein9.4e-7937.58Show/hide
Query:  HVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPH------ITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESR
        HV+++   AQGHI P  Q  KRL     L +T ++       +   P+ P+      IT    S+G+ +G + S+D+  Y+  +E    + L  +I++ +
Subjt:  HVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPH------ITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKNIIQESR

Query:  NKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFALPMFRK
          G P   +VY   +PW+  VA S  ++    + QP +V A+YY+   G +     + S     ST    P LP+L+A DLPSF   S  Y + L     
Subjt:  NKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGYSFALPMFRK

Query:  QFELLEEESNPKILINTFEELEKDAVKAIKK-FHLMPIGPLIPSVLVDGNDPSEASSGCDLF-RSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEI
        Q   ++      +L NTF++LE+  +K IK  + ++ IGP +PS+ +D     + + G  LF    +  MEWLNSK  +SVVYVS GS+  + K Q  E+
Subjt:  QFELLEEESNPKILINTFEELEKDAVKAIKK-FHLMPIGPLIPSVLVDGNDPSEASSGCDLF-RSTSSYMEWLNSKPKASVVYVSMGSISTVSKQQKEEI

Query:  ARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRL
        A GL  +   FLWV+R  E  +   ++ E++  +G  VSW  QLEVL+  + GCF+THCGWNS LE L+ GVP +  P W+DQ TN+K +ED+ + GVR+
Subjt:  ARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRL

Query:  EVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVC
        + + +G V+ EE  R +E VM ++++G+EIR+NA KWK LA+EA SEGGSS  N+  FV   C
Subjt:  EVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVC

AT4G14090.1 UDP-Glycosyltransferase superfamily protein4.6e-11046.4Show/hide
Query:  SMNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKN
        S+N +   P   H LLVT  AQGHINP LQLA RL  HG   VT+  ++SA+RRMG  P+   ++FA F+DG+DDG K  +D K+Y+SEL+R GS+AL++
Subjt:  SMNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDDGFKPSDDIKLYISELERRGSDALKN

Query:  IIQ---ESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGY
        II+   ++  + +P T ++YS+L+PWV+TVAR   + +  LWI+PA V  +YYYY N  Y  +  +          IKLP LPL++  DLPSF   S   
Subjt:  IIQ---ESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFFGASDGY

Query:  SFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRST-SSYMEWLNSKPKASVVYVSMGS-IST
          AL   R+  E LE ESNPKIL+NTF  LE DA+ +++K  ++PIGPL+          S +    DLF+S+   Y +WL+SK + SV+Y+S+G+    
Subjt:  SFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRST-SSYMEWLNSKPKASVVYVSMGS-IST

Query:  VSKQQKEEIARGLSITKRPFLWVIRNIEEEEDFLS-FKEKL--ETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSK
        + ++  E +  G+  T RPFLW++R    EE   + F E +    +G +V WC+Q  VL+  A GCF+THCGWNS LESL  GVP VAFPQ++DQ T +K
Subjt:  VSKQQKEEIARGLSITKRPFLWVIRNIEEEEDFLS-FKEKL--ETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSK

Query:  IIEDLSETGVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVD
        ++ED    GV+++V EEG V GEEI RCLE VM   ++ EE+R NA KWK +A +AA+EGG S  NLK FVD
Subjt:  IIEDLSETGVRLEVEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVD

AT4G15550.1 indole-3-acetate beta-D-glucosyltransferase1.6e-11848.77Show/hide
Query:  NNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLT-RHGDLHVTFLISLSAY-RRMGHTPTLPH-ITFASFSDGYDDGFKPS--------DDIKLYISELE
        NN + +P   H L VT  AQGHINP+L+LAKRL        VTF  S+SAY RRM  T  +P  + FA++SDG+DDGFK S        D    ++SE+ 
Subjt:  NNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLT-RHGDLHVTFLISLSAY-RRMGHTPTLPH-ITFASFSDGYDDGFKPS--------DDIKLYISELE

Query:  RRGSDALKNIIQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFF
        RRG + L  +I+++R + +PFTC+VY+IL+ WVA +AR   + S  LW+QP  VF+++Y+Y NGY D I  +A   +  S+SIKLP LPLL+ RD+PSF 
Subjt:  RRGSDALKNIIQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSARDLPSFF

Query:  GASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAI-KKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSM
         +S+ Y+F LP FR+Q + L+EE NPKILINTF+ELE +A+ ++   F ++P+GPL+ ++  D             F S   Y+EWL++K  +SV+YVS 
Subjt:  GASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAI-KKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSM

Query:  GSISTVSKQQKEEIARGLSITKRPFLWVI-----RNIEEEED-----FLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVA
        G+++ +SK+Q  E+ + L  ++RPFLWVI     RN E+E++       SF+E+L+  G +VSWC Q  VL+  + GCF+THCGWNS LESL  GVP VA
Subjt:  GSISTVSKQQKEEIARGLSITKRPFLWVI-----RNIEEEED-----FLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVA

Query:  FPQWSDQATNSKIIEDLSETGVRL--EVEEEG--VVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVD
        FPQW+DQ  N+K++ED  +TGVR+  + EEEG  VV  EEI RC+E VM D  K EE R NA +WK LA EA  EGGSSF +LKAFVD
Subjt:  FPQWSDQATNSKIIEDLSETGVRL--EVEEEG--VVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTCGTCATAGCAAATGTCATTCCTAATTCCCAAACTATCCAATCAACTTTCCTATTAAATACGAACCTCACACACCTTACATTTTCTATCCTAATCCACAGTAT
GAATAACACAACACCCAATCCCAATCCCCGTCATGTCCTTTTAGTTACACATTGTGCTCAAGGCCACATCAATCCCACCCTCCAACTCGCCAAGCGCCTCACCCGCCATG
GGGATCTCCATGTCACCTTCCTCATCTCTCTATCCGCCTACCGCCGTATGGGTCATACCCCAACTCTCCCACATATCACTTTTGCCTCCTTCTCCGATGGCTACGATGAT
GGTTTCAAACCCAGTGACGACATTAAGCTTTATATATCCGAGCTTGAGCGTCGTGGATCTGATGCTTTGAAGAATATAATCCAGGAGAGTAGAAACAAAGGTCAACCCTT
CACTTGTATCGTGTATTCCATACTCATCCCTTGGGTGGCTACGGTGGCACGTTCCCTCGATGTTGCGTCGGTTCATCTTTGGATTCAACCGGCTGTCGTTTTCGCATTGT
ATTACTATTACAACAACGGATATTACGACGAAATTCAAAGGATTGCCTCTGGGGATGATCCTAGTTCAACGAGTATTAAATTACCTGGGCTTCCATTGTTGAGTGCTCGT
GATCTTCCATCATTTTTTGGCGCTTCAGATGGTTATTCTTTTGCACTCCCAATGTTTAGGAAGCAATTTGAATTACTAGAGGAAGAGAGTAATCCAAAGATTTTAATCAA
CACATTTGAAGAGCTGGAGAAAGATGCCGTGAAAGCAATTAAGAAATTTCATTTGATGCCTATCGGACCATTGATTCCATCTGTTCTTGTTGATGGAAATGACCCATCAG
AAGCTTCTTCTGGATGTGACCTATTTCGATCTACGAGCAGTTATATGGAGTGGTTGAACTCGAAACCTAAAGCATCAGTTGTCTACGTATCGATGGGAAGCATTTCAACA
GTATCAAAGCAACAAAAAGAGGAGATAGCGAGAGGATTATCAATAACAAAACGACCATTTTTATGGGTTATCCGAAACATTGAAGAAGAAGAAGATTTTTTAAGCTTTAA
AGAAAAACTAGAAACTCAAGGGAAAATAGTTTCATGGTGTGCTCAACTTGAGGTTCTCTCAAGTCCAGCCACAGGCTGCTTTCTCACACATTGTGGTTGGAATTCTTGTT
TGGAGAGTCTAGCTTGCGGTGTCCCAAACGTGGCATTTCCACAATGGTCCGATCAAGCGACCAACAGTAAGATCATTGAGGACTTGTCAGAGACCGGGGTGAGGTTAGAG
GTGGAGGAAGAGGGCGTGGTTAAGGGAGAGGAGATAGAAAGGTGCTTGGAGTTGGTAATGGGAGATTCAAAGAAAGGAGAAGAAATAAGGAGGAATGCTTTGAAATGGAA
GAAATTGGCTAAGGAAGCTGCTAGTGAGGGTGGTTCATCGTTTGCCAATTTGAAGGCTTTTGTGGATCACGTATGTTCTTAG
mRNA sequenceShow/hide mRNA sequence
TATTTAAATACATAAATACTATAGGTTTGCTGTGGCCTATGAGGCCACGTCCTTGAAAATCATGACCACAATAGGACCTTTCAAATAGAAAAATTGAATCAAAACTACAA
CTCCCAAATGAAATGCGTCACATAAACATGTATGAATTTCGTCATAGCAAATGTCATTCCTAATTCCCAAACTATCCAATCAACTTTCCTATTAAATACGAACCTCACAC
ACCTTACATTTTCTATCCTAATCCACAGTATGAATAACACAACACCCAATCCCAATCCCCGTCATGTCCTTTTAGTTACACATTGTGCTCAAGGCCACATCAATCCCACC
CTCCAACTCGCCAAGCGCCTCACCCGCCATGGGGATCTCCATGTCACCTTCCTCATCTCTCTATCCGCCTACCGCCGTATGGGTCATACCCCAACTCTCCCACATATCAC
TTTTGCCTCCTTCTCCGATGGCTACGATGATGGTTTCAAACCCAGTGACGACATTAAGCTTTATATATCCGAGCTTGAGCGTCGTGGATCTGATGCTTTGAAGAATATAA
TCCAGGAGAGTAGAAACAAAGGTCAACCCTTCACTTGTATCGTGTATTCCATACTCATCCCTTGGGTGGCTACGGTGGCACGTTCCCTCGATGTTGCGTCGGTTCATCTT
TGGATTCAACCGGCTGTCGTTTTCGCATTGTATTACTATTACAACAACGGATATTACGACGAAATTCAAAGGATTGCCTCTGGGGATGATCCTAGTTCAACGAGTATTAA
ATTACCTGGGCTTCCATTGTTGAGTGCTCGTGATCTTCCATCATTTTTTGGCGCTTCAGATGGTTATTCTTTTGCACTCCCAATGTTTAGGAAGCAATTTGAATTACTAG
AGGAAGAGAGTAATCCAAAGATTTTAATCAACACATTTGAAGAGCTGGAGAAAGATGCCGTGAAAGCAATTAAGAAATTTCATTTGATGCCTATCGGACCATTGATTCCA
TCTGTTCTTGTTGATGGAAATGACCCATCAGAAGCTTCTTCTGGATGTGACCTATTTCGATCTACGAGCAGTTATATGGAGTGGTTGAACTCGAAACCTAAAGCATCAGT
TGTCTACGTATCGATGGGAAGCATTTCAACAGTATCAAAGCAACAAAAAGAGGAGATAGCGAGAGGATTATCAATAACAAAACGACCATTTTTATGGGTTATCCGAAACA
TTGAAGAAGAAGAAGATTTTTTAAGCTTTAAAGAAAAACTAGAAACTCAAGGGAAAATAGTTTCATGGTGTGCTCAACTTGAGGTTCTCTCAAGTCCAGCCACAGGCTGC
TTTCTCACACATTGTGGTTGGAATTCTTGTTTGGAGAGTCTAGCTTGCGGTGTCCCAAACGTGGCATTTCCACAATGGTCCGATCAAGCGACCAACAGTAAGATCATTGA
GGACTTGTCAGAGACCGGGGTGAGGTTAGAGGTGGAGGAAGAGGGCGTGGTTAAGGGAGAGGAGATAGAAAGGTGCTTGGAGTTGGTAATGGGAGATTCAAAGAAAGGAG
AAGAAATAAGGAGGAATGCTTTGAAATGGAAGAAATTGGCTAAGGAAGCTGCTAGTGAGGGTGGTTCATCGTTTGCCAATTTGAAGGCTTTTGTGGATCACGTATGTTCT
TAGTGGTTGAGTTCAAGATGCAACCCATCGTCAAAATTAATTACATACAATTTCTAGTACCGACGTTACTATTTCGTTTTTAAAATAGTGTCATGTGGGGTTCAACCACA
TTATTATCATGCATGTCTATTTAATATTTATGAGAACATATTTCTAAAAAAATATTTGACACGTGCATATATTACTTTACAAATTTAGTGTGGTAAATTCTTTTTCTTGG
TAGAAATAGATTTTGTCCGCAAAACAAATGTCCTAAAAAACCTATTTTTATCGAAGAGAGAATGATCGCATGTAAAAGATATGTGTGC
Protein sequenceShow/hide protein sequence
MNFVIANVIPNSQTIQSTFLLNTNLTHLTFSILIHSMNNTTPNPNPRHVLLVTHCAQGHINPTLQLAKRLTRHGDLHVTFLISLSAYRRMGHTPTLPHITFASFSDGYDD
GFKPSDDIKLYISELERRGSDALKNIIQESRNKGQPFTCIVYSILIPWVATVARSLDVASVHLWIQPAVVFALYYYYNNGYYDEIQRIASGDDPSSTSIKLPGLPLLSAR
DLPSFFGASDGYSFALPMFRKQFELLEEESNPKILINTFEELEKDAVKAIKKFHLMPIGPLIPSVLVDGNDPSEASSGCDLFRSTSSYMEWLNSKPKASVVYVSMGSIST
VSKQQKEEIARGLSITKRPFLWVIRNIEEEEDFLSFKEKLETQGKIVSWCAQLEVLSSPATGCFLTHCGWNSCLESLACGVPNVAFPQWSDQATNSKIIEDLSETGVRLE
VEEEGVVKGEEIERCLELVMGDSKKGEEIRRNALKWKKLAKEAASEGGSSFANLKAFVDHVCS