; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy6G021700 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy6G021700
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionGlycosyltransferase
Genome locationGy14Chr6:22342533..22344305
RNA-Seq ExpressionCsGy6G021700
SyntenyCsGy6G021700
Gene Ontology termsGO:0080043 - quercetin 3-O-glucosyltransferase activity (molecular function)
GO:0080044 - quercetin 7-O-glucosyltransferase activity (molecular function)
InterPro domainsIPR002213 - UDP-glucuronosyl/UDP-glucosyltransferase
IPR035595 - UDP-glycosyltransferase family, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647335.1 hypothetical protein Csa_002840 [Cucumis sativus]7.19e-26877.1Show/hide
Query:  MGSSFEE------IRKENGNEVHVVMIPYPSQGHINPLLQFAKYL--HHEGLKVTMLTILTNSSSLHDL--------PNLTIQNVSLFPYQGTDPETHHA
        MG S EE       ++E+  EV V+++P P+QGHINPLLQFAK+L  HH  LK+T+  ILT +++ H          P+LTI ++ L PYQG D      
Subjt:  MGSSFEE------IRKENGNEVHVVMIPYPSQGHINPLLQFAKYL--HHEGLKVTMLTILTNSSSLHDL--------PNLTIQNVSLFPYQGTDPETHHA

Query:  SSERRQASIRLHLTQLLTRHRDHGNP-IACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFV
          ERRQA+IR +LT LLT      NP IAC+VYD+++PWVLDI KQFGV  AAFFTQS AVN IYYN +KGWL    L +  I L+GLP L  SD PSFV
Subjt:  SSERRQASIRLHLTQLLTRHRDHGNP-IACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFV

Query:  SEQHKYPALLSFLADQFVAVNGAHWIFANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVS
        S+  KYP +L+ L+DQF  ++ A WIF NTFDSLEP+EVKWMEG+FAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVS
Subjt:  SEQHKYPALLSFLADQFVAVNGAHWIFANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVS

Query:  FGSGAELEKEQMEELACALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQW
        FGSGAELEKEQMEELACALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQW
Subjt:  FGSGAELEKEQMEELACALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQW

Query:  SDQPTNAKYVEDVWRVGKRVRLREEDNGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN
        SDQPTNAKYVEDVWRVGKRVRLREEDNGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN
Subjt:  SDQPTNAKYVEDVWRVGKRVRLREEDNGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN

XP_004140240.3 UDP-glycosyltransferase 74E2 [Cucumis sativus]0.0100Show/hide
Query:  MGSSFEEIRKENGNEVHVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLHDLPNLTIQNVSLFPYQGTDPETHHASSERRQASIRLHLTQL
        MGSSFEEIRKENGNEVHVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLHDLPNLTIQNVSLFPYQGTDPETHHASSERRQASIRLHLTQL
Subjt:  MGSSFEEIRKENGNEVHVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLHDLPNLTIQNVSLFPYQGTDPETHHASSERRQASIRLHLTQL

Query:  LTRHRDHGNPIACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQF
        LTRHRDHGNPIACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQF
Subjt:  LTRHRDHGNPIACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQF

Query:  VAVNGAHWIFANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELAC
        VAVNGAHWIFANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELAC
Subjt:  VAVNGAHWIFANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELAC

Query:  ALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVG
        ALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVG
Subjt:  ALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVG

Query:  KRVRLREEDNGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN
        KRVRLREEDNGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN
Subjt:  KRVRLREEDNGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN

XP_008449393.1 PREDICTED: UDP-glycosyltransferase 74E1-like [Cucumis melo]6.51e-25875.05Show/hide
Query:  MGSSFEEIRK-----ENGNEVHVVMIPYPSQGHINPLLQFAKYL--HHEGLKVTMLTILTNSSSLHD----------LPNLTIQNVSLFPYQGTD-PETH
        MG S EE RK     E   EV V++IP P+QGHINPLLQFAK+L  HH  LK+T+  ILTN+SS ++           P+LTI ++ L PYQG D PE  
Subjt:  MGSSFEEIRK-----ENGNEVHVVMIPYPSQGHINPLLQFAKYL--HHEGLKVTMLTILTNSSSLHD----------LPNLTIQNVSLFPYQGTD-PETH

Query:  HASSERRQASIRLHLTQLLTRHRDHGNP-IACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPS
        HA  ERR A+IR HLT LLT      NP IAC+VYD++MP VLDI KQFG+  AAFFTQS AVN IY N HKGWLS   LK+  I L+GLP LC SD PS
Subjt:  HASSERRQASIRLHLTQLLTRHRDHGNP-IACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPS

Query:  FVSEQHKYPALLSFLADQFVAVNGAHWIFANTFDSLEPKEVKWMEGEFA-MKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVI
        FVS + KYP +LSFL+DQF  ++   WIF NTFDSLEP+EVKWM+GEFA MKNIGPMVPSMYLDGRLENDKDYGVS+FEPNKNKDLTMKWLDSKH+KSVI
Subjt:  FVSEQHKYPALLSFLADQFVAVNGAHWIFANTFDSLEPKEVKWMEGEFA-MKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVI

Query:  YVSFGSGAELEKEQMEELACALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTM
        YVSFGS  ELEKEQMEELACALK TNKYFLWVVRESE+HKLP NF+EDHED AGDQKGLVVNWC QLQVLAHKS+GCFVTHCGWNSTLEALSLGVPLVTM
Subjt:  YVSFGSGAELEKEQMEELACALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTM

Query:  AQWSDQPTNAKYVEDVWRVGKRVRLREEDNGMCRREEIEKCVNEVM-EEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN
        AQWSDQPTNAKYVEDVW++GKRVRL EEDNG+CRREEIE CVNEVM EEGEVGEEIRK LRKWRELAKEAMDDGGTSHANI+HF+QQLLNKTN
Subjt:  AQWSDQPTNAKYVEDVWRVGKRVRLREEDNGMCRREEIEKCVNEVM-EEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN

XP_008449394.1 PREDICTED: UDP-glycosyltransferase 74E2-like [Cucumis melo]0.089.87Show/hide
Query:  MGSSFEEIRKENGNEVHVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLHDLPNLTIQNVSLFPYQGTDPETHHASSERRQASIRLHLTQL
        MGS+FEE RKENGNEVHVV+IPYP+QGHINPLLQFAK+LHHEGLKVT+ TILTNSSSLHDLPNLTIQNVSLFPYQGTDPET H+ SERRQASIRLHLTQL
Subjt:  MGSSFEEIRKENGNEVHVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLHDLPNLTIQNVSLFPYQGTDPETHHASSERRQASIRLHLTQL

Query:  LTRHRDHGNPIACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQF
        LTRHRD GNPIACLVYDSIMPWVLDIAKQFGVL AAFFTQSSAVN IYYNFHKGWLS+DALKE LICL+GLP LCSSDLPSFVSEQHKYPA+LSFLA+QF
Subjt:  LTRHRDHGNPIACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQF

Query:  VAVNGAHWIFANTFDSLEPKEVKWMEGEFA-MKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELA
        VA+N AHWIFANTFDSLEPKEVKWMEGE A MKNIGPMVPSMYLDGRLENDKDYGVS+FE N NKD TMKWLDSKH+KSVIYVSFGS  ELEKEQMEELA
Subjt:  VAVNGAHWIFANTFDSLEPKEVKWMEGEFA-MKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELA

Query:  CALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRV
        CALK TNKYFLWVVRESE+HKLP NF+EDHED AGDQKGLVVNWC QLQVLAHKS+GCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVW++
Subjt:  CALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRV

Query:  GKRVRLREEDNGMCRREEIEKCVNEVM-EEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN
        GKRVRL EEDNG+CRREEIE CVNEVM EEGEV EEIRK LRKWRELAKEAMDDGGTSHANI+HF+QQLL KTN
Subjt:  GKRVRLREEDNGMCRREEIEKCVNEVM-EEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN

XP_031742564.1 UDP-glycosyltransferase 74E2-like [Cucumis sativus]1.47e-26977.25Show/hide
Query:  MGSSFEEIRK-----ENGNEVHVVMIPYPSQGHINPLLQFAKYL--HHEGLKVTMLTILTNSSSLHDL--------PNLTIQNVSLFPYQGTDPETHHAS
        MG S EE RK     +    V V+++P+P+QGHINPLLQFAK+L  HH  LK+T+  ILT +++ H          P+LTI ++ L PYQG D       
Subjt:  MGSSFEEIRK-----ENGNEVHVVMIPYPSQGHINPLLQFAKYL--HHEGLKVTMLTILTNSSSLHDL--------PNLTIQNVSLFPYQGTDPETHHAS

Query:  SERRQASIRLHLTQLLTRHRDHGNP-IACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVS
         ERRQA+IR HLT LLT      NP IAC+VYD+  PWV+DI KQFGV  AAFFTQS AVN IYYN +KGWL    L++  I L+GLP LC SD PSFV 
Subjt:  SERRQASIRLHLTQLLTRHRDHGNP-IACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVS

Query:  EQHKYPALLSFLADQFVAVNGAHWIFANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSF
        +  KYP +L+ L+DQF  ++ A WIF NTFDSLEP+EVKWMEG+FAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSF
Subjt:  EQHKYPALLSFLADQFVAVNGAHWIFANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSF

Query:  GSGAELEKEQMEELACALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWS
        GSGAELEKEQMEELACALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWS
Subjt:  GSGAELEKEQMEELACALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWS

Query:  DQPTNAKYVEDVWRVGKRVRLREEDNGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN
        DQPTNAKYVEDVWRVGKRVRLREEDNGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN
Subjt:  DQPTNAKYVEDVWRVGKRVRLREEDNGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN

TrEMBL top hitse value%identityAlignment
A0A0A0KJJ6 Glycosyltransferase5.21e-25673.82Show/hide
Query:  MGSSFEE------IRKENGNEVHVVMIPYPSQGHINPLLQFAKYL--HHEGLKVTMLTILTNSSSLHDL--------PNLTIQNVSLFPYQGTDPETHHA
        MG S EE       ++E+  EV V+++P P+QGHINPLLQFAK+L  HH  LK+T+  ILT +++ H          P+LTI ++ L PYQG D      
Subjt:  MGSSFEE------IRKENGNEVHVVMIPYPSQGHINPLLQFAKYL--HHEGLKVTMLTILTNSSSLHDL--------PNLTIQNVSLFPYQGTDPETHHA

Query:  SSERRQASIRLHLTQLLTRHRDHGNP-IACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFV
          ERRQA+IR +LT LLT      NP IAC+VYD+++PWVLDI KQFGV  AAFFTQS AVN IYYN +KGWL    L +  I L+GLP L  SD PSFV
Subjt:  SSERRQASIRLHLTQLLTRHRDHGNP-IACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFV

Query:  SEQHKYPALLSFLADQFVAVNGAHWIFANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVS
        S+  KYP +L+ L+DQF  ++ A WIF NTFDSLEP+EVKWMEGEFAMKNIGP VPSMYLDGRLEND DYGVSMFE  KNKDLTMKWLDSKHHKSVIYVS
Subjt:  SEQHKYPALLSFLADQFVAVNGAHWIFANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVS

Query:  FGSGAELEKEQMEELACALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQW
        FGS AELEKEQMEELACALK TN+YFLWVVRESE+HKLPQNFIEDHED AGDQKGLVVNWC QLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQW
Subjt:  FGSGAELEKEQMEELACALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQW

Query:  SDQPTNAKYVEDVWRVGKRVRLREEDNGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN
        SDQPTNAKYVEDVW++GKRVRLREEDNG+CRREEIEKCVNEVMEEG+V EEIRK LRKWRELAKEAMDDGGTSHANIIHF+QQLLNKTN
Subjt:  SDQPTNAKYVEDVWRVGKRVRLREEDNGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN

A0A1S3BMK1 Glycosyltransferase3.15e-25875.05Show/hide
Query:  MGSSFEEIRK-----ENGNEVHVVMIPYPSQGHINPLLQFAKYL--HHEGLKVTMLTILTNSSSLHD----------LPNLTIQNVSLFPYQGTD-PETH
        MG S EE RK     E   EV V++IP P+QGHINPLLQFAK+L  HH  LK+T+  ILTN+SS ++           P+LTI ++ L PYQG D PE  
Subjt:  MGSSFEEIRK-----ENGNEVHVVMIPYPSQGHINPLLQFAKYL--HHEGLKVTMLTILTNSSSLHD----------LPNLTIQNVSLFPYQGTD-PETH

Query:  HASSERRQASIRLHLTQLLTRHRDHGNP-IACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPS
        HA  ERR A+IR HLT LLT      NP IAC+VYD++MP VLDI KQFG+  AAFFTQS AVN IY N HKGWLS   LK+  I L+GLP LC SD PS
Subjt:  HASSERRQASIRLHLTQLLTRHRDHGNP-IACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPS

Query:  FVSEQHKYPALLSFLADQFVAVNGAHWIFANTFDSLEPKEVKWMEGEFA-MKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVI
        FVS + KYP +LSFL+DQF  ++   WIF NTFDSLEP+EVKWM+GEFA MKNIGPMVPSMYLDGRLENDKDYGVS+FEPNKNKDLTMKWLDSKH+KSVI
Subjt:  FVSEQHKYPALLSFLADQFVAVNGAHWIFANTFDSLEPKEVKWMEGEFA-MKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVI

Query:  YVSFGSGAELEKEQMEELACALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTM
        YVSFGS  ELEKEQMEELACALK TNKYFLWVVRESE+HKLP NF+EDHED AGDQKGLVVNWC QLQVLAHKS+GCFVTHCGWNSTLEALSLGVPLVTM
Subjt:  YVSFGSGAELEKEQMEELACALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTM

Query:  AQWSDQPTNAKYVEDVWRVGKRVRLREEDNGMCRREEIEKCVNEVM-EEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN
        AQWSDQPTNAKYVEDVW++GKRVRL EEDNG+CRREEIE CVNEVM EEGEVGEEIRK LRKWRELAKEAMDDGGTSHANI+HF+QQLLNKTN
Subjt:  AQWSDQPTNAKYVEDVWRVGKRVRLREEDNGMCRREEIEKCVNEVM-EEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN

A0A1S3BMV2 Glycosyltransferase0.089.87Show/hide
Query:  MGSSFEEIRKENGNEVHVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLHDLPNLTIQNVSLFPYQGTDPETHHASSERRQASIRLHLTQL
        MGS+FEE RKENGNEVHVV+IPYP+QGHINPLLQFAK+LHHEGLKVT+ TILTNSSSLHDLPNLTIQNVSLFPYQGTDPET H+ SERRQASIRLHLTQL
Subjt:  MGSSFEEIRKENGNEVHVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLHDLPNLTIQNVSLFPYQGTDPETHHASSERRQASIRLHLTQL

Query:  LTRHRDHGNPIACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQF
        LTRHRD GNPIACLVYDSIMPWVLDIAKQFGVL AAFFTQSSAVN IYYNFHKGWLS+DALKE LICL+GLP LCSSDLPSFVSEQHKYPA+LSFLA+QF
Subjt:  LTRHRDHGNPIACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQF

Query:  VAVNGAHWIFANTFDSLEPKEVKWMEGEFA-MKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELA
        VA+N AHWIFANTFDSLEPKEVKWMEGE A MKNIGPMVPSMYLDGRLENDKDYGVS+FE N NKD TMKWLDSKH+KSVIYVSFGS  ELEKEQMEELA
Subjt:  VAVNGAHWIFANTFDSLEPKEVKWMEGEFA-MKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELA

Query:  CALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRV
        CALK TNKYFLWVVRESE+HKLP NF+EDHED AGDQKGLVVNWC QLQVLAHKS+GCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVW++
Subjt:  CALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRV

Query:  GKRVRLREEDNGMCRREEIEKCVNEVM-EEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN
        GKRVRL EEDNG+CRREEIE CVNEVM EEGEV EEIRK LRKWRELAKEAMDDGGTSHANI+HF+QQLL KTN
Subjt:  GKRVRLREEDNGMCRREEIEKCVNEVM-EEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN

A0A5A7UQI0 Glycosyltransferase3.15e-25875.05Show/hide
Query:  MGSSFEEIRK-----ENGNEVHVVMIPYPSQGHINPLLQFAKYL--HHEGLKVTMLTILTNSSSLHD----------LPNLTIQNVSLFPYQGTD-PETH
        MG S EE RK     E   EV V++IP P+QGHINPLLQFAK+L  HH  LK+T+  ILTN+SS ++           P+LTI ++ L PYQG D PE  
Subjt:  MGSSFEEIRK-----ENGNEVHVVMIPYPSQGHINPLLQFAKYL--HHEGLKVTMLTILTNSSSLHD----------LPNLTIQNVSLFPYQGTD-PETH

Query:  HASSERRQASIRLHLTQLLTRHRDHGNP-IACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPS
        HA  ERR A+IR HLT LLT      NP IAC+VYD++MP VLDI KQFG+  AAFFTQS AVN IY N HKGWLS   LK+  I L+GLP LC SD PS
Subjt:  HASSERRQASIRLHLTQLLTRHRDHGNP-IACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPS

Query:  FVSEQHKYPALLSFLADQFVAVNGAHWIFANTFDSLEPKEVKWMEGEFA-MKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVI
        FVS + KYP +LSFL+DQF  ++   WIF NTFDSLEP+EVKWM+GEFA MKNIGPMVPSMYLDGRLENDKDYGVS+FEPNKNKDLTMKWLDSKH+KSVI
Subjt:  FVSEQHKYPALLSFLADQFVAVNGAHWIFANTFDSLEPKEVKWMEGEFA-MKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVI

Query:  YVSFGSGAELEKEQMEELACALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTM
        YVSFGS  ELEKEQMEELACALK TNKYFLWVVRESE+HKLP NF+EDHED AGDQKGLVVNWC QLQVLAHKS+GCFVTHCGWNSTLEALSLGVPLVTM
Subjt:  YVSFGSGAELEKEQMEELACALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTM

Query:  AQWSDQPTNAKYVEDVWRVGKRVRLREEDNGMCRREEIEKCVNEVM-EEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN
        AQWSDQPTNAKYVEDVW++GKRVRL EEDNG+CRREEIE CVNEVM EEGEVGEEIRK LRKWRELAKEAMDDGGTSHANI+HF+QQLLNKTN
Subjt:  AQWSDQPTNAKYVEDVWRVGKRVRLREEDNGMCRREEIEKCVNEVM-EEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN

A0A5A7UV08 Glycosyltransferase0.089.87Show/hide
Query:  MGSSFEEIRKENGNEVHVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLHDLPNLTIQNVSLFPYQGTDPETHHASSERRQASIRLHLTQL
        MGS+FEE RKENGNEVHVV+IPYP+QGHINPLLQFAK+LHHEGLKVT+ TILTNSSSLHDLPNLTIQNVSLFPYQGTDPET H+ SERRQASIRLHLTQL
Subjt:  MGSSFEEIRKENGNEVHVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLHDLPNLTIQNVSLFPYQGTDPETHHASSERRQASIRLHLTQL

Query:  LTRHRDHGNPIACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQF
        LTRHRD GNPIACLVYDSIMPWVLDIAKQFGVL AAFFTQSSAVN IYYNFHKGWLS+DALKE LICL+GLP LCSSDLPSFVSEQHKYPA+LSFLA+QF
Subjt:  LTRHRDHGNPIACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQF

Query:  VAVNGAHWIFANTFDSLEPKEVKWMEGEFA-MKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELA
        VA+N AHWIFANTFDSLEPKEVKWMEGE A MKNIGPMVPSMYLDGRLENDKDYGVS+FE N NKD TMKWLDSKH+KSVIYVSFGS  ELEKEQMEELA
Subjt:  VAVNGAHWIFANTFDSLEPKEVKWMEGEFA-MKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELA

Query:  CALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRV
        CALK TNKYFLWVVRESE+HKLP NF+EDHED AGDQKGLVVNWC QLQVLAHKS+GCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVW++
Subjt:  CALKRTNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRV

Query:  GKRVRLREEDNGMCRREEIEKCVNEVM-EEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN
        GKRVRL EEDNG+CRREEIE CVNEVM EEGEV EEIRK LRKWRELAKEAMDDGGTSHANI+HF+QQLL KTN
Subjt:  GKRVRLREEDNGMCRREEIEKCVNEVM-EEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNKTN

SwissProt top hitse value%identityAlignment
K7NBW3 Mogroside IE synthase5.7e-10744.98Show/hide
Query:  EVHVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLH----DLPNLTIQNVSLFPYQGTDPETHHASSERRQASIRLHLTQLLTRHRDHGNP
        + H+++ P+PSQGHINPLLQ +K L  +G+KV+++T L  S+ L        ++ I+ +S       + +T   + +R +  +  +L   L +     NP
Subjt:  EVHVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLH----DLPNLTIQNVSLFPYQGTDPETHHASSERRQASIRLHLTQLLTRHRDHGNP

Query:  IACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVNGAHWIF
           ++YDS MPWVL++AK+FG+  A F+TQS A+N I Y+   G L     +   I L  +P L  SDLP++  +      ++  L  Q+  +  A+ +F
Subjt:  IACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVNGAHWIF

Query:  ANTFDSLEPKEVKWMEG-EFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKRTNKYF
         NTFD LE + ++WME     +K +GP VPS YLD R+ENDK YG+S+F+P  N+D+ +KWLDSK   SV+YVS+GS  E+ +EQ++ELA  +K T K+F
Subjt:  ANTFDSLEPKEVKWMEG-EFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKRTNKYF

Query:  LWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVRLREED
        LWVVR++E  KLP NF+E     +  +KGLVV+WC QL+VLAH SVGCF THCGWNSTLEAL LGVP+V   QW+DQ TNAK++EDVW+VGKRV+  E+ 
Subjt:  LWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVRLREED

Query:  NGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQL
          +  +EE+  C+ EVM EGE   E +    +W++ AKEA+D+GG+S  NI  F+  L
Subjt:  NGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQL

P0C7P7 UDP-glycosyltransferase 74E13.0e-10846.48Show/hide
Query:  HVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLHDLPNLTIQNVSL---FPYQGTDPETHHASSERRQASIRLHLTQLLTRHRDHGNPIAC
        HV+++P+P+QGHI P+ QF K L  + LK+T++ +    S  +   + TI  V +   F       E      ER ++SI+  L +L+   +  GNP   
Subjt:  HVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLHDLPNLTIQNVSL---FPYQGTDPETHHASSERRQASIRLHLTQLLTRHRDHGNPIAC

Query:  LVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALK---ESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVNGAHWIF
        LVYDS MPW+LD+A  +G+  A FFTQ   V+ IYY+  KG  S  + K    +L     LP L ++DLPSF+ E   YP +L  + DQ   ++    + 
Subjt:  LVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALK---ESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVNGAHWIF

Query:  ANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKRTNKYFL
         NTFD LE K +KW++  + + NIGP VPSMYLD RL  DK+YG S+F     +   M+WL+SK   SV+YVSFGS   L+K+Q+ ELA  LK++  +FL
Subjt:  ANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKRTNKYFL

Query:  WVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVRLREEDN
        WVVRE+E  KLP+N+IE+       +KGL V+W  QL+VL HKS+GCFVTHCGWNSTLE LSLGVP++ M  W+DQPTNAK++EDVW+VG  VR++ + +
Subjt:  WVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVRLREEDN

Query:  GMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFL
        G  RREE  + V EVM E E G+EIRK   KW+ LA+EA+ +GG+S  NI  F+
Subjt:  GMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFL

Q9SKC1 UDP-glycosyltransferase 74C13.2e-10243.7Show/hide
Query:  HVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLH--DLPNLTIQNV--SLFPYQGTDPETHHASSERRQASIRLHLTQLLTRHRDHGNPIA
        HV+  PYP QGHINP++Q AK L  +G+  T++    +    +  D  ++T+  +    FP++   P       +R   S    LT  ++  +   NP  
Subjt:  HVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLH--DLPNLTIQNV--SLFPYQGTDPETHHASSERRQASIRLHLTQLLTRHRDHGNPIA

Query:  CLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKE---SLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVNGAHWI
         L+YD  MP+ LDIAK   +   A+FTQ    +++YY+ ++G       +    +L    G P L   DLPSF  E+  YP L  F+  QF  +  A  I
Subjt:  CLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKE---SLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVNGAHWI

Query:  FANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGV--SMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKRTNK
          NTFD LEPK VKWM  ++ +KNIGP+VPS +LD RL  DKDY +  S  EP+++    +KWL ++  KSV+YV+FG+   L ++QM+E+A A+ +T  
Subjt:  FANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGV--SMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKRTNK

Query:  YFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVRLRE
        +FLW VRESE  KLP  FIE+ E+      GLV  W  QL+VLAH+S+GCFV+HCGWNSTLEAL LGVP+V + QW+DQPTNAK++EDVW++G  VR+R 
Subjt:  YFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVRLRE

Query:  EDNGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQL
        +  G+  +EEI +C+ EVM EGE G+EIRK + K + LA+EA+ +GG+S   I  F+  L
Subjt:  EDNGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQL

Q9SKC5 UDP-glycosyltransferase 74D11.3e-10345.16Show/hide
Query:  EVHVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSL---HDLPNLTIQNVSLFPYQGTDPETHHASS------ERRQASIRLHLTQLLTRHR
        + +V++  +P QGHINPLLQF+K L  + + VT LT  +  +S+         T   +S  P      E H ++        + Q ++   L++L++   
Subjt:  EVHVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSL---HDLPNLTIQNVSLFPYQGTDPETHHASS------ERRQASIRLHLTQLLTRHR

Query:  DHGNPIACLVYDSIMPWVLDIAKQF-GVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVN
           N +   VYDS +P+VLD+ ++  GV  A+FFTQSS VN  Y +F +G        ++ + L  +P L  +DLP F+ + +    L   ++ QFV V+
Subjt:  DHGNPIACLVYDSIMPWVLDIAKQF-GVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVN

Query:  GAHWIFANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKR
           +   N+FD LE + ++WM+ ++ +KNIGPM+PSMYLD RL  DKDYG+++F    N+   + WLDSK   SVIYVSFGS A L+ +QM E+A  LK+
Subjt:  GAHWIFANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKR

Query:  TNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVR
        T   FLWVVRE+E  KLP N+IED  D     KGL+VNW  QLQVLAHKS+GCF+THCGWNSTLEALSLGV L+ M  +SDQPTNAK++EDVW+VG  VR
Subjt:  TNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVR

Query:  LREEDNGMCRREEIEKCVNEVMEE-GEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLL
        ++ + NG   +EEI +CV EVME+  E G+EIRK  R+  E A+EA+ DGG S  NI  F+ +++
Subjt:  LREEDNGMCRREEIEKCVNEVMEE-GEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLL

Q9SYK9 UDP-glycosyltransferase 74E21.5e-11046.27Show/hide
Query:  HVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSS----SLHDLPNLTIQNVSLFPYQGTDP-ETHHASSERRQASIRLHLTQLLTRHRDHGNPI
        H++++P+P QGHI P+ QF K L  +GLK+T++ +    S    + HD  ++T+  +S    +G +P +      ER + SI+  L +L+   +  GNP 
Subjt:  HVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSS----SLHDLPNLTIQNVSLFPYQGTDP-ETHHASSERRQASIRLHLTQLLTRHRDHGNPI

Query:  ACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALK---ESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVNGAHW
          +VYDS MPW+LD+A  +G+  A FFTQ   V  IYY+  KG  S  + K    +L      P L ++DLPSF+ E   YP +L  + DQ   ++    
Subjt:  ACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALK---ESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVNGAHW

Query:  IFANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKRTNKY
        +  NTFD LE K +KW++  + + NIGP VPSMYLD RL  DK+YG S+F  N      M+WL+SK   SV+Y+SFGS   L+++QM ELA  LK++ ++
Subjt:  IFANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKRTNKY

Query:  FLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVRLREE
        FLWVVRE+E HKLP+N++E+       +KGL+V+W  QL VLAHKS+GCF+THCGWNSTLE LSLGVP++ M  W+DQPTNAK+++DVW+VG  VR++ E
Subjt:  FLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVRLREE

Query:  DNGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFL
         +G  RREEI + V EVM EGE G+EIRK   KW+ LA+EA+ +GG+S  +I  F+
Subjt:  DNGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFL

Arabidopsis top hitse value%identityAlignment
AT1G05675.1 UDP-Glycosyltransferase superfamily protein2.1e-10946.48Show/hide
Query:  HVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLHDLPNLTIQNVSL---FPYQGTDPETHHASSERRQASIRLHLTQLLTRHRDHGNPIAC
        HV+++P+P+QGHI P+ QF K L  + LK+T++ +    S  +   + TI  V +   F       E      ER ++SI+  L +L+   +  GNP   
Subjt:  HVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLHDLPNLTIQNVSL---FPYQGTDPETHHASSERRQASIRLHLTQLLTRHRDHGNPIAC

Query:  LVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALK---ESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVNGAHWIF
        LVYDS MPW+LD+A  +G+  A FFTQ   V+ IYY+  KG  S  + K    +L     LP L ++DLPSF+ E   YP +L  + DQ   ++    + 
Subjt:  LVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALK---ESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVNGAHWIF

Query:  ANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKRTNKYFL
         NTFD LE K +KW++  + + NIGP VPSMYLD RL  DK+YG S+F     +   M+WL+SK   SV+YVSFGS   L+K+Q+ ELA  LK++  +FL
Subjt:  ANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKRTNKYFL

Query:  WVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVRLREEDN
        WVVRE+E  KLP+N+IE+       +KGL V+W  QL+VL HKS+GCFVTHCGWNSTLE LSLGVP++ M  W+DQPTNAK++EDVW+VG  VR++ + +
Subjt:  WVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVRLREEDN

Query:  GMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFL
        G  RREE  + V EVM E E G+EIRK   KW+ LA+EA+ +GG+S  NI  F+
Subjt:  GMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFL

AT1G05680.1 Uridine diphosphate glycosyltransferase 74E21.0e-11146.27Show/hide
Query:  HVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSS----SLHDLPNLTIQNVSLFPYQGTDP-ETHHASSERRQASIRLHLTQLLTRHRDHGNPI
        H++++P+P QGHI P+ QF K L  +GLK+T++ +    S    + HD  ++T+  +S    +G +P +      ER + SI+  L +L+   +  GNP 
Subjt:  HVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSS----SLHDLPNLTIQNVSLFPYQGTDP-ETHHASSERRQASIRLHLTQLLTRHRDHGNPI

Query:  ACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALK---ESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVNGAHW
          +VYDS MPW+LD+A  +G+  A FFTQ   V  IYY+  KG  S  + K    +L      P L ++DLPSF+ E   YP +L  + DQ   ++    
Subjt:  ACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALK---ESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVNGAHW

Query:  IFANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKRTNKY
        +  NTFD LE K +KW++  + + NIGP VPSMYLD RL  DK+YG S+F  N      M+WL+SK   SV+Y+SFGS   L+++QM ELA  LK++ ++
Subjt:  IFANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKRTNKY

Query:  FLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVRLREE
        FLWVVRE+E HKLP+N++E+       +KGL+V+W  QL VLAHKS+GCF+THCGWNSTLE LSLGVP++ M  W+DQPTNAK+++DVW+VG  VR++ E
Subjt:  FLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVRLREE

Query:  DNGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFL
         +G  RREEI + V EVM EGE G+EIRK   KW+ LA+EA+ +GG+S  +I  F+
Subjt:  DNGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFL

AT2G31750.1 UDP-glucosyl transferase 74D19.3e-10545.16Show/hide
Query:  EVHVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSL---HDLPNLTIQNVSLFPYQGTDPETHHASS------ERRQASIRLHLTQLLTRHR
        + +V++  +P QGHINPLLQF+K L  + + VT LT  +  +S+         T   +S  P      E H ++        + Q ++   L++L++   
Subjt:  EVHVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSL---HDLPNLTIQNVSLFPYQGTDPETHHASS------ERRQASIRLHLTQLLTRHR

Query:  DHGNPIACLVYDSIMPWVLDIAKQF-GVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVN
           N +   VYDS +P+VLD+ ++  GV  A+FFTQSS VN  Y +F +G        ++ + L  +P L  +DLP F+ + +    L   ++ QFV V+
Subjt:  DHGNPIACLVYDSIMPWVLDIAKQF-GVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVN

Query:  GAHWIFANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKR
           +   N+FD LE + ++WM+ ++ +KNIGPM+PSMYLD RL  DKDYG+++F    N+   + WLDSK   SVIYVSFGS A L+ +QM E+A  LK+
Subjt:  GAHWIFANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKR

Query:  TNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVR
        T   FLWVVRE+E  KLP N+IED  D     KGL+VNW  QLQVLAHKS+GCF+THCGWNSTLEALSLGV L+ M  +SDQPTNAK++EDVW+VG  VR
Subjt:  TNKYFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVR

Query:  LREEDNGMCRREEIEKCVNEVMEE-GEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLL
        ++ + NG   +EEI +CV EVME+  E G+EIRK  R+  E A+EA+ DGG S  NI  F+ +++
Subjt:  LREEDNGMCRREEIEKCVNEVMEE-GEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLL

AT2G31790.1 UDP-Glycosyltransferase superfamily protein2.3e-10343.7Show/hide
Query:  HVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLH--DLPNLTIQNV--SLFPYQGTDPETHHASSERRQASIRLHLTQLLTRHRDHGNPIA
        HV+  PYP QGHINP++Q AK L  +G+  T++    +    +  D  ++T+  +    FP++   P       +R   S    LT  ++  +   NP  
Subjt:  HVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLH--DLPNLTIQNV--SLFPYQGTDPETHHASSERRQASIRLHLTQLLTRHRDHGNPIA

Query:  CLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKE---SLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVNGAHWI
         L+YD  MP+ LDIAK   +   A+FTQ    +++YY+ ++G       +    +L    G P L   DLPSF  E+  YP L  F+  QF  +  A  I
Subjt:  CLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKE---SLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVNGAHWI

Query:  FANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGV--SMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKRTNK
          NTFD LEPK VKWM  ++ +KNIGP+VPS +LD RL  DKDY +  S  EP+++    +KWL ++  KSV+YV+FG+   L ++QM+E+A A+ +T  
Subjt:  FANTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGV--SMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKRTNK

Query:  YFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVRLRE
        +FLW VRESE  KLP  FIE+ E+      GLV  W  QL+VLAH+S+GCFV+HCGWNSTLEAL LGVP+V + QW+DQPTNAK++EDVW++G  VR+R 
Subjt:  YFLWVVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVRLRE

Query:  EDNGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQL
        +  G+  +EEI +C+ EVM EGE G+EIRK + K + LA+EA+ +GG+S   I  F+  L
Subjt:  EDNGMCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQL

AT2G43820.1 UDP-glucosyltransferase 74F29.1e-10042.05Show/hide
Query:  HVVMIPYPSQGHINPLLQFAKYLHHEGLKVTM-LTILTNSSSLHDLPN-LTIQNVSLFPYQGTDPETHHASSERR---QASIRLHLTQLLTRHRDHGNPI
        HV+ +PYP+QGHI P  QF K LH +GLK T+ LT    +S   DL   ++I  +S   Y     ET  +  +     + S    +  ++ +H+   NPI
Subjt:  HVVMIPYPSQGHINPLLQFAKYLHHEGLKVTM-LTILTNSSSLHDLPN-LTIQNVSLFPYQGTDPETHHASSERR---QASIRLHLTQLLTRHRDHGNPI

Query:  ACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVNGAHWIFA
         C+VYD+ +PW LD+A++FG++   FFTQ  AVN +YY     +++N +L+   + +  LP L   DLPSF S    YPA    +  QF+    A ++  
Subjt:  ACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVNGAHWIFA

Query:  NTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKRTNKYFLW
        N+F  LE  E +       +  IGP +PS+YLD R+++D  Y +++FE +K+    + WLD++   SV+YV+FGS A+L   QMEELA A+  +N  FLW
Subjt:  NTFDSLEPKEVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKRTNKYFLW

Query:  VVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVRLREEDNG
        VVR SE  KLP  F+E        +K LV+ W  QLQVL++K++GCF+THCGWNST+EAL+ GVP+V M QW+DQP NAKY++DVW+ G RV+  E+++G
Subjt:  VVRESEVHKLPQNFIEDHEDAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVRLREEDNG

Query:  MCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNK
        + +REEIE  + EVM EGE  +E++K ++KWR+LA +++++GG++  NI  F+ ++ +K
Subjt:  MCRREEIEKCVNEVMEEGEVGEEIRKRLRKWRELAKEAMDDGGTSHANIIHFLQQLLNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGCAGCTTTGAAGAGATCAGAAAAGAGAATGGAAATGAAGTTCATGTGGTGATGATTCCATATCCAAGCCAAGGTCACATTAACCCACTCCTCCAATTTGCCAA
ATACCTCCACCATGAAGGACTCAAAGTTACAATGCTTACCATTTTAACCAACTCTTCCTCCCTTCATGACCTCCCTAATCTCACTATTCAAAATGTATCTCTCTTTCCAT
ACCAAGGCACGGATCCCGAGACCCACCACGCTTCTTCGGAGCGTCGCCAGGCTTCCATTCGCTTGCATTTGACTCAACTTCTTACTCGTCATCGAGACCATGGCAATCCA
ATTGCTTGCCTTGTTTATGACTCTATCATGCCATGGGTTCTTGACATCGCCAAACAGTTTGGTGTTTTGTGTGCTGCGTTTTTCACTCAATCTTCCGCTGTTAATGTTAT
TTATTACAACTTTCATAAAGGGTGGCTTAGTAATGATGCATTAAAGGAGAGTTTGATTTGTTTGAATGGACTTCCGGGTCTTTGCTCTTCAGATCTGCCTTCTTTTGTTT
CTGAGCAACACAAGTACCCAGCTCTTCTTAGCTTCTTAGCTGACCAATTTGTAGCAGTGAATGGTGCCCATTGGATCTTTGCCAACACATTTGATAGCTTAGAACCAAAG
GAGGTGAAGTGGATGGAAGGGGAGTTTGCAATGAAGAACATTGGTCCAATGGTTCCTTCAATGTACTTAGATGGAAGGCTAGAAAACGACAAAGATTATGGGGTTAGCAT
GTTTGAACCAAACAAAAACAAAGATTTGACAATGAAATGGCTTGATTCTAAGCACCACAAATCCGTCATCTACGTCTCATTTGGAAGTGGCGCTGAATTAGAGAAGGAGC
AAATGGAAGAATTAGCATGTGCCCTGAAGAGAACCAACAAATACTTCTTATGGGTTGTTAGAGAATCGGAGGTCCATAAGCTTCCTCAGAATTTCATTGAGGATCATGAG
GACGCAGCAGGTGATCAGAAAGGGTTGGTGGTAAATTGGTGCTGTCAGCTCCAAGTGTTGGCTCATAAATCAGTTGGATGTTTCGTTACGCACTGCGGTTGGAACTCGAC
CCTGGAAGCGTTGAGTTTGGGAGTGCCGTTGGTGACGATGGCGCAGTGGTCGGACCAGCCAACGAATGCCAAGTACGTGGAGGATGTGTGGAGGGTTGGGAAGAGGGTGA
GATTAAGAGAGGAAGACAATGGAATGTGTAGAAGAGAAGAGATAGAGAAATGTGTGAATGAAGTTATGGAAGAAGGAGAAGTTGGAGAAGAAATAAGAAAGAGGTTGAGG
AAGTGGAGGGAATTGGCAAAAGAAGCTATGGATGATGGAGGAACCTCTCATGCAAACATTATTCACTTTCTACAACAACTTCTTAACAAAACTAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGCAGCTTTGAAGAGATCAGAAAAGAGAATGGAAATGAAGTTCATGTGGTGATGATTCCATATCCAAGCCAAGGTCACATTAACCCACTCCTCCAATTTGCCAA
ATACCTCCACCATGAAGGACTCAAAGTTACAATGCTTACCATTTTAACCAACTCTTCCTCCCTTCATGACCTCCCTAATCTCACTATTCAAAATGTATCTCTCTTTCCAT
ACCAAGGCACGGATCCCGAGACCCACCACGCTTCTTCGGAGCGTCGCCAGGCTTCCATTCGCTTGCATTTGACTCAACTTCTTACTCGTCATCGAGACCATGGCAATCCA
ATTGCTTGCCTTGTTTATGACTCTATCATGCCATGGGTTCTTGACATCGCCAAACAGTTTGGTGTTTTGTGTGCTGCGTTTTTCACTCAATCTTCCGCTGTTAATGTTAT
TTATTACAACTTTCATAAAGGGTGGCTTAGTAATGATGCATTAAAGGAGAGTTTGATTTGTTTGAATGGACTTCCGGGTCTTTGCTCTTCAGATCTGCCTTCTTTTGTTT
CTGAGCAACACAAGTACCCAGCTCTTCTTAGCTTCTTAGCTGACCAATTTGTAGCAGTGAATGGTGCCCATTGGATCTTTGCCAACACATTTGATAGCTTAGAACCAAAG
GAGGTGAAGTGGATGGAAGGGGAGTTTGCAATGAAGAACATTGGTCCAATGGTTCCTTCAATGTACTTAGATGGAAGGCTAGAAAACGACAAAGATTATGGGGTTAGCAT
GTTTGAACCAAACAAAAACAAAGATTTGACAATGAAATGGCTTGATTCTAAGCACCACAAATCCGTCATCTACGTCTCATTTGGAAGTGGCGCTGAATTAGAGAAGGAGC
AAATGGAAGAATTAGCATGTGCCCTGAAGAGAACCAACAAATACTTCTTATGGGTTGTTAGAGAATCGGAGGTCCATAAGCTTCCTCAGAATTTCATTGAGGATCATGAG
GACGCAGCAGGTGATCAGAAAGGGTTGGTGGTAAATTGGTGCTGTCAGCTCCAAGTGTTGGCTCATAAATCAGTTGGATGTTTCGTTACGCACTGCGGTTGGAACTCGAC
CCTGGAAGCGTTGAGTTTGGGAGTGCCGTTGGTGACGATGGCGCAGTGGTCGGACCAGCCAACGAATGCCAAGTACGTGGAGGATGTGTGGAGGGTTGGGAAGAGGGTGA
GATTAAGAGAGGAAGACAATGGAATGTGTAGAAGAGAAGAGATAGAGAAATGTGTGAATGAAGTTATGGAAGAAGGAGAAGTTGGAGAAGAAATAAGAAAGAGGTTGAGG
AAGTGGAGGGAATTGGCAAAAGAAGCTATGGATGATGGAGGAACCTCTCATGCAAACATTATTCACTTTCTACAACAACTTCTTAACAAAACTAATTAATTTCCAT
Protein sequenceShow/hide protein sequence
MGSSFEEIRKENGNEVHVVMIPYPSQGHINPLLQFAKYLHHEGLKVTMLTILTNSSSLHDLPNLTIQNVSLFPYQGTDPETHHASSERRQASIRLHLTQLLTRHRDHGNP
IACLVYDSIMPWVLDIAKQFGVLCAAFFTQSSAVNVIYYNFHKGWLSNDALKESLICLNGLPGLCSSDLPSFVSEQHKYPALLSFLADQFVAVNGAHWIFANTFDSLEPK
EVKWMEGEFAMKNIGPMVPSMYLDGRLENDKDYGVSMFEPNKNKDLTMKWLDSKHHKSVIYVSFGSGAELEKEQMEELACALKRTNKYFLWVVRESEVHKLPQNFIEDHE
DAAGDQKGLVVNWCCQLQVLAHKSVGCFVTHCGWNSTLEALSLGVPLVTMAQWSDQPTNAKYVEDVWRVGKRVRLREEDNGMCRREEIEKCVNEVMEEGEVGEEIRKRLR
KWRELAKEAMDDGGTSHANIIHFLQQLLNKTN