; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh19G002000 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh19G002000
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGlycosyltransferase
Genome locationCmo_Chr19:1308125..1310965
RNA-Seq ExpressionCmoCh19G002000
SyntenyCmoCh19G002000
Gene Ontology termsGO:0080043 - quercetin 3-O-glucosyltransferase activity (molecular function)
GO:0080044 - quercetin 7-O-glucosyltransferase activity (molecular function)
InterPro domainsIPR002213 - UDP-glucuronosyl/UDP-glucosyltransferase
IPR035595 - UDP-glycosyltransferase family, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571601.1 UDP-glycosyltransferase 74G1, partial [Cucurbita argyrosperma subsp. sororia]4.0e-25994.42Show/hide
Query:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT
        MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPF VETISDGHDAGGF SAVSISDYHDR KHDGSQT
Subjt:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT

Query:  LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD
        LRDLIRRKEDAGRR DAVVYDG MPWV DVGKEFGLRTVAYFTQSC VNNIYYHVYR EIK PVV  A EEIRIAGMPPLTTADMPS +QV NPYP FSD
Subjt:  LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD

Query:  VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME
        VVINQFGNVEEADWLVCNSFY+LEHQVLEGM+NK NMKTIGPNIPSFYTDKRID+DRDYGFNLFKMNNEVCQKWLDARPKASVVF AFGS+AALSVEQME
Subjt:  VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME

Query:  ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV
        EL  GLAQSNSFFLWVVRETEAAK+PAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV
Subjt:  ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV

Query:  RAPLDDAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVSV
        RAPLDDAGIVRRTTVADCILKVMDD+GGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVSV
Subjt:  RAPLDDAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVSV

KAG6571602.1 UDP-glycosyltransferase 74F2, partial [Cucurbita argyrosperma subsp. sororia]9.4e-26997.21Show/hide
Query:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT
        MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT
Subjt:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT

Query:  LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD
        LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYR EIKAPVVAAA EEIRIAGMPPLT ADMPSLVQVGNPYP FSD
Subjt:  LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD

Query:  VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME
        +VINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAAL+VEQME
Subjt:  VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME

Query:  ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV
        EL WGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV
Subjt:  ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV

Query:  RAPLDDAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVSV
        RAPL DAGIVRRTTVADCIL+VMDD+ GTEIRKNAAKWGELAK+AVD DGSSDRTVDEILAQLVSV
Subjt:  RAPLDDAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVSV

XP_022963519.1 UDP-glycosyltransferase 74G1-like [Cucurbita moschata]1.9e-277100Show/hide
Query:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT
        MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT
Subjt:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT

Query:  LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD
        LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD
Subjt:  LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD

Query:  VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME
        VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME
Subjt:  VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME

Query:  ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV
        ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV
Subjt:  ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV

Query:  RAPLDDAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVSV
        RAPLDDAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVSV
Subjt:  RAPLDDAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVSV

XP_022967252.1 UDP-glycosyltransferase 74G1-like [Cucurbita maxima]3.8e-26295.28Show/hide
Query:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT
        MADGE K GNN KKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT
Subjt:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT

Query:  LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD
        LRDLIRRKE+AGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYR EIKAPVV AA EEIRIAGMPPLT ADMPSLVQVGNPYP FSD
Subjt:  LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD

Query:  VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME
        +VINQFGNVEEADWL+CNSFYELEHQVLEGMENKWNMK IGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKA+VVFVAFGSYAALSVEQME
Subjt:  VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME

Query:  ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV
        EL WGL Q+N FFLWVVRETEAAKIPAKFAEA ADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV
Subjt:  ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV

Query:  RAPLDDAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVSV
        RAPLDDAGIVRRTTVADCILKVMDD+GGTEIRKNAAKW ELAK+AVD DGSSDRTVDEILAQLVSV
Subjt:  RAPLDDAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVSV

XP_023554034.1 UDP-glycosyltransferase 74G1-like isoform X1 [Cucurbita pepo subsp. pepo]1.2e-26395.49Show/hide
Query:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT
        MADGEGK GNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT
Subjt:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT

Query:  LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD
        LRDLIRRKEDAGRRFDAVVYDGFMPW+LDVGKEFGLRTVAYFTQSCAVNNIYYHVYR EIKAPVVAAA EEIRIAGMP LT ADMPSLVQVGNPYP FSD
Subjt:  LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD

Query:  VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME
        +VINQFGNVEEADWLVCNSFYELEHQVLEGMENKW MK IGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLD  PKASVVFVAFGSYAALSVEQME
Subjt:  VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME

Query:  ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV
        EL WGLAQSNSFFLWVVRETEAAKIPAKFAEATA RGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV
Subjt:  ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV

Query:  RAPLDDAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVSV
        RAPLDDAGIVRRT+VADCILKVMDD+GGT+IRKNAAKW ELAK+AVD DGSSDRTVDEILAQLV V
Subjt:  RAPLDDAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVSV

TrEMBL top hitse value%identityAlignment
A0A0A0LCG4 Glycosyltransferase7.2e-19870.6Show/hide
Query:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT
        MADG+ K+ +NGK+V++LVVTYPAQGHINPLLQFSKRLHHKGAAVTFV+TK+L+NN P +D+PPPFPVET SD HD GGFLSAVS+ DYH RL+  GS+T
Subjt:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT

Query:  LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD
        +RDLIRR E+ GRR DAV+YDGFMPWVL+V KE+GL+T  YFTQ C VNNIY+H+Y+ EIK P+     EEIR+ GMP L   +MPS V+     P F  
Subjt:  LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD

Query:  VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME
         V+NQF N+EEADWL+CNSFYE E QVLE ME +W MKT+GPNIPS Y D++I +DR+YGFN FK  +E C+KWLD R KASVVFVAFGS++ LS+EQME
Subjt:  VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME

Query:  ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV
        EL WGLAQ+N FFLWVVR+ E AK+P KF EAT ++GL+VPWC QL+VLSHESIGCFVTH GWNSTLEALTIGVPMVAMPQWTDQT NAKFV D+WKTG+
Subjt:  ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV

Query:  RAPLDDAGIVRRTTVADCILKVMDDN-GGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVS
        RA  D  GIVRR T+A+CILK+MDDN GG EIRKNAAKWG LA+QAV+  GSSDR VDE L QL S
Subjt:  RAPLDDAGIVRRTTVADCILKVMDDN-GGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVS

A0A5D3DQW9 Glycosyltransferase4.4e-19569.53Show/hide
Query:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT
        MADGEGK  NNGKKV++LVVTYPAQGHINPLLQFSKRLHHKGAAVTFV+TK+L+NN P +D+PPPFPVET SD HD GGFLS+VS+ DYH+RL+  GS++
Subjt:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT

Query:  LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD
        LRDLIRR E  GRR DAV+YDGFMPWVL+V KE+GL+TV YFTQ C VNNIY+H+Y+ EIK  ++    EEIR+ GMP +   +MPS V+     P F  
Subjt:  LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD

Query:  VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME
         V+NQF N+EEAD+L+CNSFYE E QVLE ME KW MKT+GPNIPS Y D++I +DR+YGFN FK N+E C+KWL+ R K SVVFVAFGS++ LS EQME
Subjt:  VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME

Query:  ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV
        EL WGLAQ+N FFLWVVR+ + A +P KF EAT ++GL+VPWC QL+VLSHESIGCFVTH GWNSTLEALTIGVPMVAMPQWTDQT NAKFV D+WKTG+
Subjt:  ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV

Query:  RAPLDDAGIVRRTTVADCILKVMDDN-GGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVS
        RA  D  GIVRR T+ADCILK+MDDN  G EIR+NAAKWG LA++AVD  GSSD+ ++E+L QL S
Subjt:  RAPLDDAGIVRRTTVADCILKVMDDN-GGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVS

A0A6J1DX00 Glycosyltransferase1.9e-19872.96Show/hide
Query:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQ-PNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQ
        MADG     +NG+++++LVVTYPAQGHINPLLQFSKRLHHKGAAVTFV++KFLFNN    + HPPPFPVETISDGHD GGFLSA SI  YH+  +  GS+
Subjt:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQ-PNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQ

Query:  TLRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFS
        TLRDLIRR   AGRR  A+ YDGF+PWVLDV KEFGL+T  YFTQSCAVNNIYYHVYR EIK P    A EEIRIAGMPPLT ADMPS VQ  NPYP F 
Subjt:  TLRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFS

Query:  DVVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQM
        DVVINQF N+EEADW+VCNSFYELE QVLE M  KW MK IGPNIPS YTD+RID D +YGF+LF    EVC+KWLD R KASVVFV+FGS+AALSVE+M
Subjt:  DVVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQM

Query:  EELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTG
        EEL WGL Q+N +FLWVVR  E  K+P KFAE TA++GLLV WC QL++LSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAK + DIWKTG
Subjt:  EELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTG

Query:  VRAPLDDAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVS
        +RAP D++GIVRR  VA+CI ++M+ +GG EIR+NAAKWG LA+QAV   GSSD  VD+ILA L S
Subjt:  VRAPLDDAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVS

A0A6J1HKE5 Glycosyltransferase9.2e-278100Show/hide
Query:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT
        MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT
Subjt:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT

Query:  LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD
        LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD
Subjt:  LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD

Query:  VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME
        VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME
Subjt:  VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME

Query:  ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV
        ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV
Subjt:  ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV

Query:  RAPLDDAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVSV
        RAPLDDAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVSV
Subjt:  RAPLDDAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVSV

A0A6J1HQB3 Glycosyltransferase1.9e-26295.28Show/hide
Query:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT
        MADGE K GNN KKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT
Subjt:  MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQT

Query:  LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD
        LRDLIRRKE+AGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYR EIKAPVV AA EEIRIAGMPPLT ADMPSLVQVGNPYP FSD
Subjt:  LRDLIRRKEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSD

Query:  VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME
        +VINQFGNVEEADWL+CNSFYELEHQVLEGMENKWNMK IGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKA+VVFVAFGSYAALSVEQME
Subjt:  VVINQFGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQME

Query:  ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV
        EL WGL Q+N FFLWVVRETEAAKIPAKFAEA ADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV
Subjt:  ELTWGLAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGV

Query:  RAPLDDAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVSV
        RAPLDDAGIVRRTTVADCILKVMDD+GGTEIRKNAAKW ELAK+AVD DGSSDRTVDEILAQLVSV
Subjt:  RAPLDDAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVSV

SwissProt top hitse value%identityAlignment
K7NBW3 Mogroside IE synthase4.2e-11042.67Show/hide
Query:  NLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNN-QPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRRKEDAGRR
        ++LV  +P+QGHINPLLQ SKRL  KG  V+ V T  + N+ Q    +     +E ISDG +    L   ++    DR +   ++ L D +++   +   
Subjt:  NLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNN-QPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRRKEDAGRR

Query:  FDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSDVVINQFGNVEEADW
           ++YD  MPWVL+V KEFGL    ++TQSCA+N+I YHV   ++K P        I +  MP L  +D+P+            D++ +Q+ N+++A+ 
Subjt:  FDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSDVVINQFGNVEEADW

Query:  LVCNSFYELEHQVLEGMENKWN-MKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWGLAQSNSFF
        L CN+F +LE ++++ ME     +KT+GP +PS Y DKR++ND+ YG +LFK N +VC KWLD++P  SV++V++GS   +  EQ++EL  G+ ++  FF
Subjt:  LVCNSFYELEHQVLEGMENKWN-MKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWGLAQSNSFF

Query:  LWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLDDAGIVRRT
        LWVVR+TEA K+P  F E+ A++GL+V WC QL+VL+H S+GCF THCGWNSTLEAL +GVP+VA PQW DQ TNAKF+ D+WK G R   ++  +  + 
Subjt:  LWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLDDAGIVRRT

Query:  TVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQL
         V  CI +VM+    +E + N+ +W + AK+AVD  GSSD+ ++E +A L
Subjt:  TVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQL

O22822 UDP-glycosyltransferase 74F25.4e-11045.18Show/hide
Query:  KKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRRKEDAG
        K+ ++L V YP QGHI P  QF KRLH KG   T  +T F+FN+  N D   P  + TISDG+D GGF +A SI DY    K  GS+T+ D+I++ + + 
Subjt:  KKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRRKEDAG

Query:  RRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSDVVINQFGNVEEA
             +VYD F+PW LDV +EFGL    +FTQ CAVN +YY  Y        +   + ++ I  +P L   D+PS   V   YP + ++V+ QF N E+A
Subjt:  RRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSDVVINQFGNVEEA

Query:  DWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNE-VCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWGLAQSNS
        D+++ NSF ELE    E       + TIGP IPS Y D+RI +D  Y  NLF+  ++  C  WLD RP+ SVV+VAFGS A L+  QMEEL    A SN 
Subjt:  DWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNE-VCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWGLAQSNS

Query:  FFLWVVRETEAAKIPAKFAE-ATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLD-DAGI
         FLWVVR +E  K+P+ F E    ++ L++ W  QL VLS+++IGCF+THCGWNST+EALT GVPMVAMPQWTDQ  NAK++ D+WK GVR   + ++GI
Subjt:  FFLWVVRETEAAKIPAKFAE-ATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLD-DAGI

Query:  VRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVS
         +R  +   I +VM+     E++KN  KW +LA ++++  GS+D  +D  ++++ S
Subjt:  VRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVS

Q6VAA6 UDP-glycosyltransferase 74G12.4e-11845.97Show/hide
Query:  KKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVE--TISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRRKED
        K  ++L++ +P QGHINP +QF KRL  KG   T V T    N+  N  +     +E   ISDG D GGF+SA     Y +  K  GS++L DLI++ + 
Subjt:  KKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVE--TISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRRKED

Query:  AGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLV----QVGNPYPEFSDVVINQF
         G   DA++YD    WVLDV  EFG+   ++FTQ+C VN++YYHV++  I  P+     E + + G P L   + P ++    Q+ +P+   S ++  QF
Subjt:  AGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLV----QVGNPYPEFSDVVINQF

Query:  GNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWGL
         N+++A W+  NSFY+LE +V+E     WN+K IGP +PS Y DKR+D+D+D GFNL+K N+  C  WLD +PK SVV+VAFGS      EQ+EE+T  L
Subjt:  GNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWGL

Query:  AQSNSFFLWVVRETEAAKIPAKFAEA-TADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLD
          S+  FLWV++  E  K+P   +E     +GL+V WC+QLDVL+HES+GCFVTHCG+NSTLEA+++GVP+VAMPQ++DQTTNAK + +I   GVR   D
Subjt:  AQSNSFFLWVVRETEAAKIPAKFAEA-TADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLD

Query:  DAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLV
        + GIVRR  +A CI  +M++  G  IRKNA KW +LAK AV   GSSD  + E +++L+
Subjt:  DAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLV

Q6X1C0 Crocetin glucosyltransferase 21.4e-10543.76Show/hide
Query:  NGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRRKED
        NG K ++L++  PAQGHINP+LQF KRL       T V T+FL N+      P P  ++ ISDG D GG  +A S   Y DR +    Q    LI     
Subjt:  NGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRRKED

Query:  AGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPY-----PEFSDVVINQ
         GR            W ++V +  GLR+VA+FTQ CAV+ IY HV+   IK PV    AE +R+ G+PPL  +D+P    V N +     P+   + +NQ
Subjt:  AGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPY-----PEFSDVVINQ

Query:  FGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWG
          N+++AD +  NS YELE  +L+G      +K+IGP +PS Y D RI +D  YGFNL+  +      WLD++   SV++V+FGS ++LS +Q  E+  G
Subjt:  FGNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWG

Query:  LAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLD
        L  +N  F+WVVR +E AK+PA F +  A RGL+V WC QLD+L+H + GCFVTHCGWNST+E + +GVPMV +PQW+DQ  NAK+V D+WK GVRA   
Subjt:  LAQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLD

Query:  DAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQ
            VR      C+ +VMD     +IR+NAA+W +LAK +V   GSSD+ + E + Q
Subjt:  DAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQ

Q9SYK9 UDP-glycosyltransferase 74E21.1e-10542.2Show/hide
Query:  NLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHP-PPFPVE-------TISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRR
        +L+V+ +P QGHI P+ QF KRL  KG  +T V+          SD P PP+  E        IS+G   G       + DY +R++     TL  L+  
Subjt:  NLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHP-PPFPVE-------TISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRR

Query:  KEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEI-RIAGMPPLTTADMPSLVQVGNPYPEFSDVVINQF
         + +G    A+VYD  MPW+LDV   +GL    +FTQ   V  IYYHV++     P        +      P LT  D+PS +   + YP    +V++Q 
Subjt:  KEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEI-RIAGMPPLTTADMPSLVQVGNPYPEFSDVVINQF

Query:  GNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWGL
         N++  D ++CN+F +LE ++L+ +++ W +  IGP +PS Y DKR+  D++YGF+LF      C +WL+++   SVV+++FGS   L  +QM EL  GL
Subjt:  GNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWGL

Query:  AQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLDD
         QS  FFLWVVRETE  K+P  + E   ++GL+V W  QLDVL+H+SIGCF+THCGWNSTLE L++GVPM+ MP WTDQ TNAKF+ D+WK GVR   + 
Subjt:  AQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLDD

Query:  AGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILA
         G VRR  +   + +VM+   G EIRKNA KW  LA++AV   GSSD++++E ++
Subjt:  AGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILA

Arabidopsis top hitse value%identityAlignment
AT1G05680.1 Uridine diphosphate glycosyltransferase 74E27.5e-10742.2Show/hide
Query:  NLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHP-PPFPVE-------TISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRR
        +L+V+ +P QGHI P+ QF KRL  KG  +T V+          SD P PP+  E        IS+G   G       + DY +R++     TL  L+  
Subjt:  NLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHP-PPFPVE-------TISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRR

Query:  KEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEI-RIAGMPPLTTADMPSLVQVGNPYPEFSDVVINQF
         + +G    A+VYD  MPW+LDV   +GL    +FTQ   V  IYYHV++     P        +      P LT  D+PS +   + YP    +V++Q 
Subjt:  KEDAGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEI-RIAGMPPLTTADMPSLVQVGNPYPEFSDVVINQF

Query:  GNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWGL
         N++  D ++CN+F +LE ++L+ +++ W +  IGP +PS Y DKR+  D++YGF+LF      C +WL+++   SVV+++FGS   L  +QM EL  GL
Subjt:  GNVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWGL

Query:  AQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLDD
         QS  FFLWVVRETE  K+P  + E   ++GL+V W  QLDVL+H+SIGCF+THCGWNSTLE L++GVPM+ MP WTDQ TNAKF+ D+WK GVR   + 
Subjt:  AQSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLDD

Query:  AGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILA
         G VRR  +   + +VM+   G EIRKNA KW  LA++AV   GSSD++++E ++
Subjt:  AGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILA

AT2G31750.1 UDP-glucosyl transferase 74D16.0e-10443.36Show/hide
Query:  KVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNN-----QPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRRK
        K N+LV ++P QGHINPLLQFSKRL  K   VTF+ T    N+             P     I DG +     S  +  DY  + + + S++L +LI   
Subjt:  KVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNN-----QPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRRK

Query:  EDAGRRFDAVVYDGFMPWVLDV-GKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSDVVINQFG
        +    + +AVVYD  +P+VLDV  K  G+   ++FTQS  VN  Y H  R E K         ++ +  MPPL   D+P  +   N      +++ +QF 
Subjt:  EDAGRRFDAVVYDGFMPWVLDV-GKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSDVVINQFG

Query:  NVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWGLA
        NV++ D+ + NSF ELE +VL+ M+N+W +K IGP IPS Y DKR+  D+DYG NLF      C  WLD++P  SV++V+FGS A L  +QM E+  GL 
Subjt:  NVEEADWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWGLA

Query:  QSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLDDA
        Q+   FLWVVRETE  K+P+ + E   D+GL+V W  QL VL+H+SIGCF+THCGWNSTLEAL++GV ++ MP ++DQ TNAKF+ D+WK GVR   D  
Subjt:  QSNSFFLWVVRETEAAKIPAKFAEATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLDDA

Query:  GIVRRTTVADCILKVMDD--NGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLV
        G V +  +  C+ +VM+D    G EIRKNA +  E A++A+   G+SD+ +DE +A++V
Subjt:  GIVRRTTVADCILKVMDD--NGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLV

AT2G43820.1 UDP-glucosyltransferase 74F23.9e-11145.18Show/hide
Query:  KKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRRKEDAG
        K+ ++L V YP QGHI P  QF KRLH KG   T  +T F+FN+  N D   P  + TISDG+D GGF +A SI DY    K  GS+T+ D+I++ + + 
Subjt:  KKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRRKEDAG

Query:  RRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSDVVINQFGNVEEA
             +VYD F+PW LDV +EFGL    +FTQ CAVN +YY  Y        +   + ++ I  +P L   D+PS   V   YP + ++V+ QF N E+A
Subjt:  RRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSDVVINQFGNVEEA

Query:  DWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNE-VCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWGLAQSNS
        D+++ NSF ELE    E       + TIGP IPS Y D+RI +D  Y  NLF+  ++  C  WLD RP+ SVV+VAFGS A L+  QMEEL    A SN 
Subjt:  DWLVCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNE-VCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWGLAQSNS

Query:  FFLWVVRETEAAKIPAKFAE-ATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLD-DAGI
         FLWVVR +E  K+P+ F E    ++ L++ W  QL VLS+++IGCF+THCGWNST+EALT GVPMVAMPQWTDQ  NAK++ D+WK GVR   + ++GI
Subjt:  FFLWVVRETEAAKIPAKFAE-ATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLD-DAGI

Query:  VRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVS
         +R  +   I +VM+     E++KN  KW +LA ++++  GS+D  +D  ++++ S
Subjt:  VRRTTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQLVS

AT2G43840.1 UDP-glycosyltransferase 74 F12.2e-10642.79Show/hide
Query:  NLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRRKEDAGRRF
        ++L V +P+QGHI P+ QF KRLH KG   T  +T F+FN   + D   P  + TISDG+D GGF SA S+ +Y    K  GS+T+ D+IR+ +      
Subjt:  NLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRRKEDAGRRF

Query:  DAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSDVVINQFGNVEEADWL
          +VYD FMPW LD+  +FGL    +FTQSCAVN I Y  Y        +   +  + I  +P L   D+P+ V     +  + ++V+ QF N ++AD++
Subjt:  DAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSDVVINQFGNVEEADWL

Query:  VCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNN-EVCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWGLAQSNSFFL
        + NSF++L+    E +     + TIGP +PS Y D++I +D DY  NLF +    +C  WLD RP+ SVV++AFGS A LS EQMEE+    A SN  +L
Subjt:  VCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNN-EVCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWGLAQSNSFFL

Query:  WVVRETEAAKIPAKFAEAT-ADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLD-DAGIVRR
        WVVR +E +K+P  F E    D+ L++ W  QL VLS+++IGCF+THCGWNST+E L++GVPMVAMPQWTDQ  NAK++ D+WK GVR   + ++GI +R
Subjt:  WVVRETEAAKIPAKFAEAT-ADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLD-DAGIVRR

Query:  TTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQL
          +   I +VM+     E+++NA KW +LA +++   GS+D  ++E ++++
Subjt:  TTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQL

AT2G43840.2 UDP-glycosyltransferase 74 F13.4e-10743.02Show/hide
Query:  NLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRRKEDAGRRF
        ++L V +P+QGHI P+ QF KRLH KG   T  +T F+FN   + D   P  + TISDG+D GGF SA S+ +Y    K  GS+T+ D+IR+ +      
Subjt:  NLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRRKEDAGRRF

Query:  DAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSDVVINQFGNVEEADWL
          +VYD FMPW LD+  +FGL    +FTQSCAVN I Y  Y        +   +  + I  +P L   D+P+ V     +  + ++V+ QF N ++AD++
Subjt:  DAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSDVVINQFGNVEEADWL

Query:  VCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNN-EVCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWGLAQSNSFFL
        + NSF++L+  V E +     + TIGP +PS Y D++I +D DY  NLF +    +C  WLD RP+ SVV++AFGS A LS EQMEE+    A SN  +L
Subjt:  VCNSFYELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNN-EVCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWGLAQSNSFFL

Query:  WVVRETEAAKIPAKFAEAT-ADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLD-DAGIVRR
        WVVR +E +K+P  F E    D+ L++ W  QL VLS+++IGCF+THCGWNST+E L++GVPMVAMPQWTDQ  NAK++ D+WK GVR   + ++GI +R
Subjt:  WVVRETEAAKIPAKFAEAT-ADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLD-DAGIVRR

Query:  TTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQL
          +   I +VM+     E+++NA KW +LA +++   GS+D  ++E ++++
Subjt:  TTVADCILKVMDDNGGTEIRKNAAKWGELAKQAVDCDGSSDRTVDEILAQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGACGGTGAGGGCAAAATCGGCAACAATGGGAAAAAAGTTAATCTTCTGGTGGTTACATACCCAGCTCAAGGTCACATAAATCCCCTTTTACAATTCTCCAAACG
TCTCCATCACAAGGGCGCCGCCGTGACCTTCGTCGTCACCAAATTCCTTTTCAATAACCAACCCAACTCCGACCACCCTCCGCCGTTTCCCGTCGAGACCATCTCCGACG
GCCACGACGCCGGCGGATTTCTCTCCGCCGTTTCCATCTCCGACTATCACGACCGCCTCAAGCACGACGGGTCACAGACTCTCCGGGATCTGATTCGGCGGAAGGAGGAC
GCCGGCCGCCGGTTCGACGCCGTGGTGTACGACGGATTTATGCCGTGGGTTTTGGATGTGGGGAAAGAATTTGGGCTGAGGACGGTGGCGTATTTTACACAGTCCTGCGC
TGTGAATAACATTTATTACCATGTTTACAGGGAAGAAATTAAGGCGCCGGTGGTGGCGGCGGCGGCGGAGGAGATTCGGATCGCCGGAATGCCGCCGCTGACGACGGCGG
ATATGCCGTCGCTTGTCCAGGTTGGGAACCCATATCCGGAATTCTCTGATGTGGTGATTAATCAGTTTGGAAACGTGGAGGAAGCCGATTGGTTAGTCTGCAACAGTTTC
TACGAACTAGAACACCAGGTGCTAGAAGGGATGGAGAATAAATGGAACATGAAGACGATCGGACCGAACATACCATCGTTCTACACCGACAAACGAATAGACAACGATAG
GGACTATGGGTTCAACCTTTTCAAAATGAACAATGAGGTTTGCCAAAAATGGCTAGACGCCCGTCCAAAAGCGTCAGTAGTTTTTGTAGCGTTCGGGAGCTATGCAGCTT
TGAGCGTCGAGCAAATGGAGGAATTGACTTGGGGTTTGGCACAAAGCAATTCCTTTTTCTTGTGGGTGGTAAGGGAGACAGAGGCGGCGAAGATTCCAGCAAAATTTGCA
GAAGCGACGGCGGACAGAGGGCTATTGGTCCCTTGGTGCCGTCAATTGGATGTTTTATCACACGAATCGATCGGTTGCTTTGTGACGCATTGCGGCTGGAACTCGACGCT
GGAGGCTTTGACCATCGGCGTTCCCATGGTGGCGATGCCGCAGTGGACAGATCAAACCACCAATGCCAAGTTTGTAACGGACATTTGGAAGACCGGAGTTAGGGCTCCGC
TCGACGATGCCGGAATAGTGCGGCGGACAACGGTTGCAGATTGCATCTTGAAGGTCATGGATGACAACGGAGGAACGGAGATTCGGAAAAACGCCGCTAAATGGGGAGAG
TTAGCGAAACAGGCGGTGGATTGCGACGGAAGTTCCGATCGTACTGTTGATGAGATTCTTGCCCAATTGGTTTCTGTTTGA
mRNA sequenceShow/hide mRNA sequence
CAATTCTCATTGTATCCCATTCATAAAGACCTCAAGATTGTGAAAGAGGGAGAGAAAAGCTTCGGGCCCATTTCCATGGCGGACGGTGAGGGCAAAATCGGCAACAATGG
GAAAAAAGTTAATCTTCTGGTGGTTACATACCCAGCTCAAGGTCACATAAATCCCCTTTTACAATTCTCCAAACGTCTCCATCACAAGGGCGCCGCCGTGACCTTCGTCG
TCACCAAATTCCTTTTCAATAACCAACCCAACTCCGACCACCCTCCGCCGTTTCCCGTCGAGACCATCTCCGACGGCCACGACGCCGGCGGATTTCTCTCCGCCGTTTCC
ATCTCCGACTATCACGACCGCCTCAAGCACGACGGGTCACAGACTCTCCGGGATCTGATTCGGCGGAAGGAGGACGCCGGCCGCCGGTTCGACGCCGTGGTGTACGACGG
ATTTATGCCGTGGGTTTTGGATGTGGGGAAAGAATTTGGGCTGAGGACGGTGGCGTATTTTACACAGTCCTGCGCTGTGAATAACATTTATTACCATGTTTACAGGGAAG
AAATTAAGGCGCCGGTGGTGGCGGCGGCGGCGGAGGAGATTCGGATCGCCGGAATGCCGCCGCTGACGACGGCGGATATGCCGTCGCTTGTCCAGGTTGGGAACCCATAT
CCGGAATTCTCTGATGTGGTGATTAATCAGTTTGGAAACGTGGAGGAAGCCGATTGGTTAGTCTGCAACAGTTTCTACGAACTAGAACACCAGGTGCTAGAAGGGATGGA
GAATAAATGGAACATGAAGACGATCGGACCGAACATACCATCGTTCTACACCGACAAACGAATAGACAACGATAGGGACTATGGGTTCAACCTTTTCAAAATGAACAATG
AGGTTTGCCAAAAATGGCTAGACGCCCGTCCAAAAGCGTCAGTAGTTTTTGTAGCGTTCGGGAGCTATGCAGCTTTGAGCGTCGAGCAAATGGAGGAATTGACTTGGGGT
TTGGCACAAAGCAATTCCTTTTTCTTGTGGGTGGTAAGGGAGACAGAGGCGGCGAAGATTCCAGCAAAATTTGCAGAAGCGACGGCGGACAGAGGGCTATTGGTCCCTTG
GTGCCGTCAATTGGATGTTTTATCACACGAATCGATCGGTTGCTTTGTGACGCATTGCGGCTGGAACTCGACGCTGGAGGCTTTGACCATCGGCGTTCCCATGGTGGCGA
TGCCGCAGTGGACAGATCAAACCACCAATGCCAAGTTTGTAACGGACATTTGGAAGACCGGAGTTAGGGCTCCGCTCGACGATGCCGGAATAGTGCGGCGGACAACGGTT
GCAGATTGCATCTTGAAGGTCATGGATGACAACGGAGGAACGGAGATTCGGAAAAACGCCGCTAAATGGGGAGAGTTAGCGAAACAGGCGGTGGATTGCGACGGAAGTTC
CGATCGTACTGTTGATGAGATTCTTGCCCAATTGGTTTCTGTTTGAATATTTAAATTTAATTATTTTTATTTTTATTTTTATTTTCTTTTGGTCTATTTTCAAAATGTTT
GTTACCATTGAACCTCTGCCCGTCATGTGTTCTTGTGTATGATGACACCCGTGGCGTGGCCCTTCATGTGTCTTTGTGTATGATGACATGTGTCCACCCTTCATGTGTCC
TTGTGTATGATGACATGTGTCCCGTGGCCCTTCATGTGTTCTTGTGTATGATGA
Protein sequenceShow/hide protein sequence
MADGEGKIGNNGKKVNLLVVTYPAQGHINPLLQFSKRLHHKGAAVTFVVTKFLFNNQPNSDHPPPFPVETISDGHDAGGFLSAVSISDYHDRLKHDGSQTLRDLIRRKED
AGRRFDAVVYDGFMPWVLDVGKEFGLRTVAYFTQSCAVNNIYYHVYREEIKAPVVAAAAEEIRIAGMPPLTTADMPSLVQVGNPYPEFSDVVINQFGNVEEADWLVCNSF
YELEHQVLEGMENKWNMKTIGPNIPSFYTDKRIDNDRDYGFNLFKMNNEVCQKWLDARPKASVVFVAFGSYAALSVEQMEELTWGLAQSNSFFLWVVRETEAAKIPAKFA
EATADRGLLVPWCRQLDVLSHESIGCFVTHCGWNSTLEALTIGVPMVAMPQWTDQTTNAKFVTDIWKTGVRAPLDDAGIVRRTTVADCILKVMDDNGGTEIRKNAAKWGE
LAKQAVDCDGSSDRTVDEILAQLVSV