; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC05G080470 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC05G080470
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionUDP-glycosyltransferase 91A1-like
Genome locationCiama_Chr05:1452304..1457047
RNA-Seq ExpressionCaUC05G080470
SyntenyCaUC05G080470
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0004650 - polygalacturonase activity (molecular function)
GO:0008194 - UDP-glycosyltransferase activity (molecular function)
InterPro domainsIPR002213 - UDP-glucuronosyl/UDP-glucosyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG5009163.1 hypothetical protein JHK87_017678 [Glycine soja]4.5e-25452.56Show/hide
Query:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA
        MAEN  + V + PWSAFGHLIP F+LSIALAKAGVHVSFISTPKN+QRLP IP +LS  +  V +PLP L  D LPEGAEATVDIPF K  +LK ALD  
Subjt:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA

Query:  EPSFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKND
        + + ++F+A+    PDW I DFN  W+ DI++EF++ ++ F +LS     F     GT       E+L +PP      S+VA+R +EA     GF + N 
Subjt:  EPSFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKND

Query:  SGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIHEIA
        SG+SD +R  KI  AS+A+  RSC E + +YL  Y     K +IP+GLLP E+        D     IF+WLD+Q  +SVVFVGFGSE KLSKDQ+ EIA
Subjt:  SGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIHEIA

Query:  RGVELSELPFLWALRKPEW-AEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLVDKGV
         G+E S+LPFLWALRKP W + D   LPVGF +RT+ RG V  GW PQ+EIL H +IGGSLFH GWGS IE LQFG+ LVLLPF ++QPLNAR LV+K +
Subjt:  RGVELSELPFLWALRKPEW-AEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLVDKGV

Query:  AVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLIPYFQLAI
        A+EV+R  EDGSF    IA +LR+AMV EEG+KIR   REAAAI G+ KLHQ +                        + V + PWSAFGHLIP+F+L+I
Subjt:  AVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLIPYFQLAI

Query:  ALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKFIADHPHPPDWFIVDFNATWIF
        ALAKAGVHVSFISTPKN+QRLP IP +LS  +  V +PLP L  D LPEGAEAT+DIPF K   LK A D  +   ++F+A+    PDW I DFN  W+ 
Subjt:  ALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKFIADHPHPPDWFIVDFNATWIF

Query:  DISREFRIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFD
        DI++EF++  + F +IS+     +        PLS   SL +PP      S+VA+R+HEA    AG ++++ SG+SD +R+ K+  AS+A+  RSC E +
Subjt:  DISREFRIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFD

Query:  VDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEW-AEDSDPLP
         +Y   +     K VIP+GLLP E+        D    + FEWLD+Q  +SVVFVGFGSE K +KDQ+ EIA G+E S+LPFLWALRKP W + D   LP
Subjt:  VDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEW-AEDSDPLP

Query:  VGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIGKALRKTMIS
        VGF +RT+ RG V  GW PQ+EIL H +IGGSLFH G GS IE LQFGH LV+LPF +DQPL AR LV+KG+AIEV+R  EDGSF+   I  +LR+ M+ 
Subjt:  VGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIGKALRKTMIS

Query:  EEGEKIRKKAKEIGAIFGDTKLHQ-QYIEEFVEFLKKG
        EEG+KIR   +E  AI G+ KLHQ  Y+  FV+FLK G
Subjt:  EEGEKIRKKAKEIGAIFGDTKLHQ-QYIEEFVEFLKKG

KAG5036958.1 hypothetical protein JHK86_017798 [Glycine max]4.5e-25452.56Show/hide
Query:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA
        MAEN  + V + PWSAFGHLIP F+LSIALAKAGVHVSFISTPKN+QRLP IP +LS  +  V +PLP L  D LPEGAEATVDIPF K  +LK ALD  
Subjt:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA

Query:  EPSFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKND
        + + ++F+A+    PDW I DFN  W+ DI++EF++ ++ F +LS     F     GT       E+L +PP      S+VA+R +EA     GF + N 
Subjt:  EPSFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKND

Query:  SGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIHEIA
        SG+SD +R  KI  AS+A+  RSC E + +YL  Y     K +IP+GLLP E+        D     IF+WLD+Q  +SVVFVGFGSE KLSKDQ+ EIA
Subjt:  SGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIHEIA

Query:  RGVELSELPFLWALRKPEW-AEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLVDKGV
         G+E S+LPFLWALRKP W + D   LPVGF +RT+ RG V  GW PQ+EIL H +IGGSLFH GWGS IE LQFG+ LVLLPF ++QPLNAR LV+K +
Subjt:  RGVELSELPFLWALRKPEW-AEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLVDKGV

Query:  AVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLIPYFQLAI
        A+EV+R  EDGSF    IA +LR+AMV EEG+KIR   REAAAI G+ KLHQ +                        + V + PWSAFGHLIP+F+L+I
Subjt:  AVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLIPYFQLAI

Query:  ALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKFIADHPHPPDWFIVDFNATWIF
        ALAKAGVHVSFISTPKN+QRLP IP +LS  +  V +PLP L  D LPEGAEAT+DIPF K   LK A D  +   ++F+A+    PDW I DFN  W+ 
Subjt:  ALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKFIADHPHPPDWFIVDFNATWIF

Query:  DISREFRIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFD
        DI++EF++  + F +IS+     +        PLS   SL +PP      S+VA+R+HEA    AG ++++ SG+SD +R+ K+  AS+A+  RSC E +
Subjt:  DISREFRIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFD

Query:  VDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEW-AEDSDPLP
         +Y   +     K VIP+GLLP E+        D    + FEWLD+Q  +SVVFVGFGSE K +KDQ+ EIA G+E S+LPFLWALRKP W + D   LP
Subjt:  VDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEW-AEDSDPLP

Query:  VGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIGKALRKTMIS
        VGF +RT+ RG V  GW PQ+EIL H +IGGSLFH G GS IE LQFGH LV+LPF +DQPL AR LV+KG+AIEV+R  EDGSF+   I  +LR+ M+ 
Subjt:  VGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIGKALRKTMIS

Query:  EEGEKIRKKAKEIGAIFGDTKLHQ-QYIEEFVEFLKKG
        EEG+KIR   +E  AI G+ KLHQ  Y+  FV+FLK G
Subjt:  EEGEKIRKKAKEIGAIFGDTKLHQ-QYIEEFVEFLKKG

RDX84482.1 putative UDP-rhamnose:rhamnosyltransferase 1, partial [Mucuna pruriens]7.9e-26754.08Show/hide
Query:  LHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEPSFRK
        +H+V+ PWSAFGHLIP F+LSIALAKAGV VSF+STPKN+QRLP +P +L+  +  V  PLP L  + LPEGAEAT+DIPF KI +LKLA D  + + +K
Subjt:  LHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEPSFRK

Query:  FIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKNDSGMSDR
         +      P+W I DF+  W+ DI+ EF++ ++F+ V S   + FF        PLS  E+L  PP      S+VAY+R+EA     G    N SG+SD 
Subjt:  FIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKNDSGMSDR

Query:  DRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIHEIARGVELS
        +R TK++ AS+A+  RSC E + +YL  +    GK VIP+GLLP ++ ++     D     IF+WLD+Q  +SVVFVGFGSECKL+KDQ+ EIA G+E S
Subjt:  DRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIHEIARGVELS

Query:  ELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLVDKGVAVEVER
        ELPFLWALRKP WA  D D +PVGF +RT  RGIV MGW PQ EIL HP+IGGSLFH GWGSAIEALQFGH LVLLPFI+DQPLNAR LV+KG+A+EV+R
Subjt:  ELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLVDKGVAVEVER

Query:  KEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLIPYFQLAIALAKAG
          EDGSF    IA +LR+AMV EEG+ IR   REAA I G+ KLHQ +        +++  ++  KM   K +  ++ PWSAFGHLIP+F+L+IALAKAG
Subjt:  KEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLIPYFQLAIALAKAG

Query:  VHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKFIADHPHPPDWFIVDFNATWIFDISREF
        VHVSF+STPKN+QRLP +P +L+  +  V  PL  L  + LPEGAEATVDIPF KI  LKLA D  +   +K + +    P+W I DF+  W+ DI++EF
Subjt:  VHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKFIADHPHPPDWFIVDFNATWIFDISREF

Query:  RIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKF
        ++  +F+ V+S+  +  L  V     P+S   SL  PP      S+VAY+RHEA    AG   +N SG+SD +RVTKI+ AS+A+ VRSC E + +Y   
Subjt:  RIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKF

Query:  YSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEWA-EDSDPLPVGFRDR
        +    GK VIP+GLLP ++  +     D    + FEWLD+Q  +SVVFVGFGSE K TKDQ+ EIA G+E SELPFLWALRKP WA  D D +PVGF +R
Subjt:  YSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEWA-EDSDPLPVGFRDR

Query:  TAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIGKALRKTMISEEGEKI
        T  RGIV MGW PQ EIL HP+IGGSLFH GWGS IE LQF H LV+LPFI+DQPLNAR LV+KG+AIEV +K EDGSF+   I  +LR+ M+ EEG+ I
Subjt:  TAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIGKALRKTMISEEGEKI

Query:  RKKAKEIGAIFGDTKLHQ-QYIEEFVEFLKKG
        R  A E   I G+ KLHQ  YI  FV+FL+ G
Subjt:  RKKAKEIGAIFGDTKLHQ-QYIEEFVEFLKKG

RXI07822.1 hypothetical protein DVH24_009853 [Malus domestica]1.2e-24950.83Show/hide
Query:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA
        M ++  L VV+ PWSAFGH++P FQLSIALAKA VH SFISTPKN+QRLP I P L PF+  V IP P L  D LP+GAEATVD+PF     LK A D  
Subjt:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA

Query:  EPSFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHV-----LGTGLPLSEIENLMSPPPI--DGSTVAYRRYEAAGIRGGF
        +   ++FI D    PDW I DF A W+ +I +E+ +P+V+F V S      F+         T   L  +E+L SPP +    STVA+R  EA  +  GF
Subjt:  EPSFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHV-----LGTGLPLSEIENLMSPPPI--DGSTVAYRRYEAAGIRGGF

Query:  FEKNDSGMSDRDRATKIISASRAI-----AVR---SCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGS
        +  NDSG+SD DR     + +R        +R   +C+EF+ +YL+ Y +  GK +IP GLLPPE P K   G  S    IF+WLD+Q            
Subjt:  FEKNDSGMSDRDRATKIISASRAI-----AVR---SCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGS

Query:  ECKLSKDQIHEIARGVELSELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVD
                                   RKP WA  ++D LP+GF DR +E+G+V  GW PQMEIL HP++GGSLFH GWGS IE LQFGH LV+LPFI+D
Subjt:  ECKLSKDQIHEIARGVELSELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVD

Query:  QPLNARLLVDKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQ-RYIEEFHFHLFLIIILKLTKMADNKV-------L
        QPLNARLL +KG+AVEV+R+  DGSF  + IAK LR AMV EEGE++R  AR+AA +FGD KLHQ  YI +F       +  K TK +  K        L
Subjt:  QPLNARLLVDKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQ-RYIEEFHFHLFLIIILKLTKMADNKV-------L

Query:  HVLLFPWSAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKF
         V++ PWS+FGH+IPYFQL++ALAKA VHVSFISTPKN+Q+LP     L  FI  V IP P L    LPEGAEATVD+PF K   LK+A DL + P ++F
Subjt:  HVLLFPWSAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKF

Query:  IADHPHPPDWFIVDFNATWIFDISREFRIPTVF---FCVISSGFLALLAHVLGS----GLPLSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMND
        I +    PDW I D+ A W+ DI++E+ IP  +   FC+  S F     ++LG+     LP  E  SL SPP      STVA+R  EA  +  GFF  N 
Subjt:  IADHPHPPDWFIVDFNATWIFDISREFRIPTVF---FCVISSGFLALLAHVLGS----GLPLSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMND

Query:  SGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIA
        SG+SD  R+ KIVS S+ +A+RSC+E + +Y + Y    GK V+  GLLPPEKP +      S     FEWLD+   +SVVFVGFGSECK +K+Q+ EIA
Subjt:  SGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIA

Query:  RGVELSELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGV
         G+ELSELPFLWALRKP WA  D+D LPVGF DRT+E+G+V +GW PQMEIL HP+IGGSLFH GWGS IE LQFG  LV LP + DQPLNARLL D+G+
Subjt:  RGVELSELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGV

Query:  AIEVERKEEDGSFSGEAIGKALRKTMISEEGEKIRKKAKEIGAIFGDTKLHQ-QYIEEFVEFLK
        A+EV+R   DGSFS + I K LR  M+ EEGEK+R  A++  A+FGD KLHQ  YI +FV FLK
Subjt:  AIEVERKEEDGSFSGEAIGKALRKTMISEEGEKIRKKAKEIGAIFGDTKLHQ-QYIEEFVEFLK

XP_022924286.1 uncharacterized protein LOC111431815 [Cucurbita moschata]0.0e+0071.52Show/hide
Query:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA
        MAENK LHVV+ PWSAFGHL+PHFQL+I+LAKAGVHVSFISTP+NL+RLP IPPSLSPFIT V IPLPKLPGDPLPEGAEATVDIPF KIPFLKLALDLA
Subjt:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA

Query:  EPSFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPPIDGSTVAYRRYEAAGIRGGFFEKNDSG
        EP FR+F+A+HPH PDW IVDFNATWIC+ISR+F+IPIVF  V SP  LAFFA ++G G    +I +LMSPP IDGS VAYRRYEA  I G  + KNDSG
Subjt:  EPSFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPPIDGSTVAYRRYEAAGIRGGFFEKNDSG

Query:  MSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIHEIARG
        +SD +R+ KIISA +A  +RSC EFDVDYLK Y+D+ G+KVIP+GLLPPEKPQK+EF ADSPWK  F+WLDQQNP+SVVFVGFGSECKL+KD+IH+IARG
Subjt:  MSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIHEIARG

Query:  VELSELPFLWALRKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLVDKGVAVE
        +ELSELPFLW+LRKP+WA DSD LP GF+DRTAERGIVSMGWAPQMEILGHPAIGG  FHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLV+KGVAVE
Subjt:  VELSELPFLWALRKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLVDKGVAVE

Query:  VERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF-----------------HFHLFLIIILKLTKMADNKVLHVLLFPW
        VERKEEDGSF GE IAKALREAM SEEGEKIR+RA E AAIFGD KLHQRYIEEF                  FH     ILK   MA  K + VL FP+
Subjt:  VERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF-----------------HFHLFLIIILKLTKMADNKVLHVLLFPW

Query:  SAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKL-PGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKFIADHPH
         AFGH++P+FQLA+ALA +GVHV F+STPKNLQRLPP PPSLS  ITP+ +PLPKL  G  LPEGAEAT+D+P  K+  L++ALDLA+P FRK + D P+
Subjt:  SAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKL-PGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKFIADHPH

Query:  PPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPPPIDG--STVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIV
        PPDWFIVDF+ATWI +++R+ +IPT+FF VIS+GFLA + +V   G P  +I  L +P  +DG  S V++RR EAA + + F   N +GMS  DR+ KI+
Subjt:  PPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPPPIDG--STVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIV

Query:  SASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWA
        +AS+AI +R+C E D  Y  FYS  CGKKV+P+G LPPEKP+KTEF  DSPWKS FEWLD+QNP+SVVFVGFGSEC+ TKDQ+H+IARG+ELS+LPFLW+
Subjt:  SASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWA

Query:  LRKPEWA--EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGS
        LRKP WA  +DSD +PVGF+DRTAERGIV MGWAPQMEILGHPAIGG  FHGGWGSAIEALQFGH LVLLPFI+DQPL ARLLV+KGV +EVER+E DG 
Subjt:  LRKPEWA--EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGS

Query:  FSGEAIGKALRKTMISEEGEKIRKKAKEIGAIFGDTKLHQQYIEEFVEFLKKGDS
        FSGEAI KALRK ++SEEGEKIR+ AKE  AIFG+ KLHQQYI  FVE  K   S
Subjt:  FSGEAIGKALRKTMISEEGEKIRKKAKEIGAIFGDTKLHQQYIEEFVEFLKKGDS

TrEMBL top hitse value%identityAlignment
A0A0R0JB13 Uncharacterized protein2.2e-25452.56Show/hide
Query:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA
        MAEN  + V + PWSAFGHLIP F+LSIALAKAGVHVSFISTPKN+QRLP IP +LS  +  V +PLP L  D LPEGAEATVDIPF K  +LK ALD  
Subjt:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA

Query:  EPSFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKND
        + + ++F+A+    PDW I DFN  W+ DI++EF++ ++ F +LS     F     GT       E+L +PP      S+VA+R +EA     GF + N 
Subjt:  EPSFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKND

Query:  SGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIHEIA
        SG+SD +R  KI  AS+A+  RSC E + +YL  Y     K +IP+GLLP E+        D     IF+WLD+Q  +SVVFVGFGSE KLSKDQ+ EIA
Subjt:  SGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIHEIA

Query:  RGVELSELPFLWALRKPEW-AEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLVDKGV
         G+E S+LPFLWALRKP W + D   LPVGF +RT+ RG V  GW PQ+EIL H +IGGSLFH GWGS IE LQFG+ LVLLPF ++QPLNAR LV+K +
Subjt:  RGVELSELPFLWALRKPEW-AEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLVDKGV

Query:  AVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLIPYFQLAI
        A+EV+R  EDGSF    IA +LR+AMV EEG+KIR   REAAAI G+ KLHQ +                        + V + PWSAFGHLIP+F+L+I
Subjt:  AVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLIPYFQLAI

Query:  ALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKFIADHPHPPDWFIVDFNATWIF
        ALAKAGVHVSFISTPKN+QRLP IP +LS  +  V +PLP L  D LPEGAEAT+DIPF K   LK A D  +   ++F+A+    PDW I DFN  W+ 
Subjt:  ALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKFIADHPHPPDWFIVDFNATWIF

Query:  DISREFRIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFD
        DI++EF++  + F +IS+     +        PLS   SL +PP      S+VA+R+HEA    AG ++++ SG+SD +R+ K+  AS+A+  RSC E +
Subjt:  DISREFRIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFD

Query:  VDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEW-AEDSDPLP
         +Y   +     K VIP+GLLP E+        D    + FEWLD+Q  +SVVFVGFGSE K +KDQ+ EIA G+E S+LPFLWALRKP W + D   LP
Subjt:  VDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEW-AEDSDPLP

Query:  VGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIGKALRKTMIS
        VGF +RT+ RG V  GW PQ+EIL H +IGGSLFH G GS IE LQFGH LV+LPF +DQPL AR LV+KG+AIEV+R  EDGSF+   I  +LR+ M+ 
Subjt:  VGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIGKALRKTMIS

Query:  EEGEKIRKKAKEIGAIFGDTKLHQ-QYIEEFVEFLKKG
        EEG+KIR   +E  AI G+ KLHQ  Y+  FV+FLK G
Subjt:  EEGEKIRKKAKEIGAIFGDTKLHQ-QYIEEFVEFLKKG

A0A371G1Q6 Putative UDP-rhamnose:rhamnosyltransferase 1 (Fragment)3.8e-26754.08Show/hide
Query:  LHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEPSFRK
        +H+V+ PWSAFGHLIP F+LSIALAKAGV VSF+STPKN+QRLP +P +L+  +  V  PLP L  + LPEGAEAT+DIPF KI +LKLA D  + + +K
Subjt:  LHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEPSFRK

Query:  FIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKNDSGMSDR
         +      P+W I DF+  W+ DI+ EF++ ++F+ V S   + FF        PLS  E+L  PP      S+VAY+R+EA     G    N SG+SD 
Subjt:  FIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKNDSGMSDR

Query:  DRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIHEIARGVELS
        +R TK++ AS+A+  RSC E + +YL  +    GK VIP+GLLP ++ ++     D     IF+WLD+Q  +SVVFVGFGSECKL+KDQ+ EIA G+E S
Subjt:  DRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIHEIARGVELS

Query:  ELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLVDKGVAVEVER
        ELPFLWALRKP WA  D D +PVGF +RT  RGIV MGW PQ EIL HP+IGGSLFH GWGSAIEALQFGH LVLLPFI+DQPLNAR LV+KG+A+EV+R
Subjt:  ELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLVDKGVAVEVER

Query:  KEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLIPYFQLAIALAKAG
          EDGSF    IA +LR+AMV EEG+ IR   REAA I G+ KLHQ +        +++  ++  KM   K +  ++ PWSAFGHLIP+F+L+IALAKAG
Subjt:  KEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLIPYFQLAIALAKAG

Query:  VHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKFIADHPHPPDWFIVDFNATWIFDISREF
        VHVSF+STPKN+QRLP +P +L+  +  V  PL  L  + LPEGAEATVDIPF KI  LKLA D  +   +K + +    P+W I DF+  W+ DI++EF
Subjt:  VHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKFIADHPHPPDWFIVDFNATWIFDISREF

Query:  RIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKF
        ++  +F+ V+S+  +  L  V     P+S   SL  PP      S+VAY+RHEA    AG   +N SG+SD +RVTKI+ AS+A+ VRSC E + +Y   
Subjt:  RIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKF

Query:  YSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEWA-EDSDPLPVGFRDR
        +    GK VIP+GLLP ++  +     D    + FEWLD+Q  +SVVFVGFGSE K TKDQ+ EIA G+E SELPFLWALRKP WA  D D +PVGF +R
Subjt:  YSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEWA-EDSDPLPVGFRDR

Query:  TAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIGKALRKTMISEEGEKI
        T  RGIV MGW PQ EIL HP+IGGSLFH GWGS IE LQF H LV+LPFI+DQPLNAR LV+KG+AIEV +K EDGSF+   I  +LR+ M+ EEG+ I
Subjt:  TAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIGKALRKTMISEEGEKI

Query:  RKKAKEIGAIFGDTKLHQ-QYIEEFVEFLKKG
        R  A E   I G+ KLHQ  YI  FV+FL+ G
Subjt:  RKKAKEIGAIFGDTKLHQ-QYIEEFVEFLKKG

A0A498KQ11 Uncharacterized protein5.6e-25050.83Show/hide
Query:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA
        M ++  L VV+ PWSAFGH++P FQLSIALAKA VH SFISTPKN+QRLP I P L PF+  V IP P L  D LP+GAEATVD+PF     LK A D  
Subjt:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA

Query:  EPSFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHV-----LGTGLPLSEIENLMSPPPI--DGSTVAYRRYEAAGIRGGF
        +   ++FI D    PDW I DF A W+ +I +E+ +P+V+F V S      F+         T   L  +E+L SPP +    STVA+R  EA  +  GF
Subjt:  EPSFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHV-----LGTGLPLSEIENLMSPPPI--DGSTVAYRRYEAAGIRGGF

Query:  FEKNDSGMSDRDRATKIISASRAI-----AVR---SCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGS
        +  NDSG+SD DR     + +R        +R   +C+EF+ +YL+ Y +  GK +IP GLLPPE P K   G  S    IF+WLD+Q            
Subjt:  FEKNDSGMSDRDRATKIISASRAI-----AVR---SCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGS

Query:  ECKLSKDQIHEIARGVELSELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVD
                                   RKP WA  ++D LP+GF DR +E+G+V  GW PQMEIL HP++GGSLFH GWGS IE LQFGH LV+LPFI+D
Subjt:  ECKLSKDQIHEIARGVELSELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVD

Query:  QPLNARLLVDKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQ-RYIEEFHFHLFLIIILKLTKMADNKV-------L
        QPLNARLL +KG+AVEV+R+  DGSF  + IAK LR AMV EEGE++R  AR+AA +FGD KLHQ  YI +F       +  K TK +  K        L
Subjt:  QPLNARLLVDKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQ-RYIEEFHFHLFLIIILKLTKMADNKV-------L

Query:  HVLLFPWSAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKF
         V++ PWS+FGH+IPYFQL++ALAKA VHVSFISTPKN+Q+LP     L  FI  V IP P L    LPEGAEATVD+PF K   LK+A DL + P ++F
Subjt:  HVLLFPWSAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKF

Query:  IADHPHPPDWFIVDFNATWIFDISREFRIPTVF---FCVISSGFLALLAHVLGS----GLPLSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMND
        I +    PDW I D+ A W+ DI++E+ IP  +   FC+  S F     ++LG+     LP  E  SL SPP      STVA+R  EA  +  GFF  N 
Subjt:  IADHPHPPDWFIVDFNATWIFDISREFRIPTVF---FCVISSGFLALLAHVLGS----GLPLSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMND

Query:  SGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIA
        SG+SD  R+ KIVS S+ +A+RSC+E + +Y + Y    GK V+  GLLPPEKP +      S     FEWLD+   +SVVFVGFGSECK +K+Q+ EIA
Subjt:  SGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIA

Query:  RGVELSELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGV
         G+ELSELPFLWALRKP WA  D+D LPVGF DRT+E+G+V +GW PQMEIL HP+IGGSLFH GWGS IE LQFG  LV LP + DQPLNARLL D+G+
Subjt:  RGVELSELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGV

Query:  AIEVERKEEDGSFSGEAIGKALRKTMISEEGEKIRKKAKEIGAIFGDTKLHQ-QYIEEFVEFLK
        A+EV+R   DGSFS + I K LR  M+ EEGEK+R  A++  A+FGD KLHQ  YI +FV FLK
Subjt:  AIEVERKEEDGSFSGEAIGKALRKTMISEEGEKIRKKAKEIGAIFGDTKLHQ-QYIEEFVEFLK

A0A6J1E951 uncharacterized protein LOC1114318150.0e+0071.52Show/hide
Query:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA
        MAENK LHVV+ PWSAFGHL+PHFQL+I+LAKAGVHVSFISTP+NL+RLP IPPSLSPFIT V IPLPKLPGDPLPEGAEATVDIPF KIPFLKLALDLA
Subjt:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA

Query:  EPSFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPPIDGSTVAYRRYEAAGIRGGFFEKNDSG
        EP FR+F+A+HPH PDW IVDFNATWIC+ISR+F+IPIVF  V SP  LAFFA ++G G    +I +LMSPP IDGS VAYRRYEA  I G  + KNDSG
Subjt:  EPSFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPPIDGSTVAYRRYEAAGIRGGFFEKNDSG

Query:  MSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIHEIARG
        +SD +R+ KIISA +A  +RSC EFDVDYLK Y+D+ G+KVIP+GLLPPEKPQK+EF ADSPWK  F+WLDQQNP+SVVFVGFGSECKL+KD+IH+IARG
Subjt:  MSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIHEIARG

Query:  VELSELPFLWALRKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLVDKGVAVE
        +ELSELPFLW+LRKP+WA DSD LP GF+DRTAERGIVSMGWAPQMEILGHPAIGG  FHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLV+KGVAVE
Subjt:  VELSELPFLWALRKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLVDKGVAVE

Query:  VERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF-----------------HFHLFLIIILKLTKMADNKVLHVLLFPW
        VERKEEDGSF GE IAKALREAM SEEGEKIR+RA E AAIFGD KLHQRYIEEF                  FH     ILK   MA  K + VL FP+
Subjt:  VERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF-----------------HFHLFLIIILKLTKMADNKVLHVLLFPW

Query:  SAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKL-PGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKFIADHPH
         AFGH++P+FQLA+ALA +GVHV F+STPKNLQRLPP PPSLS  ITP+ +PLPKL  G  LPEGAEAT+D+P  K+  L++ALDLA+P FRK + D P+
Subjt:  SAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKL-PGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKFIADHPH

Query:  PPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPPPIDG--STVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIV
        PPDWFIVDF+ATWI +++R+ +IPT+FF VIS+GFLA + +V   G P  +I  L +P  +DG  S V++RR EAA + + F   N +GMS  DR+ KI+
Subjt:  PPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPPPIDG--STVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIV

Query:  SASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWA
        +AS+AI +R+C E D  Y  FYS  CGKKV+P+G LPPEKP+KTEF  DSPWKS FEWLD+QNP+SVVFVGFGSEC+ TKDQ+H+IARG+ELS+LPFLW+
Subjt:  SASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWA

Query:  LRKPEWA--EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGS
        LRKP WA  +DSD +PVGF+DRTAERGIV MGWAPQMEILGHPAIGG  FHGGWGSAIEALQFGH LVLLPFI+DQPL ARLLV+KGV +EVER+E DG 
Subjt:  LRKPEWA--EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGS

Query:  FSGEAIGKALRKTMISEEGEKIRKKAKEIGAIFGDTKLHQQYIEEFVEFLKKGDS
        FSGEAI KALRK ++SEEGEKIR+ AKE  AIFG+ KLHQQYI  FVE  K   S
Subjt:  FSGEAIGKALRKTMISEEGEKIRKKAKEIGAIFGDTKLHQQYIEEFVEFLKKGDS

A0A6N2M6C4 Uncharacterized protein7.0e-26146.81Show/hide
Query:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA
        MAE   LHVV+ PW AFGH+IP +QLSIALAKAG+ VSF+STP+N++RLP IPP L+  +  V  PLP L  D LPE  EATVDIP  KI +LK+A DL 
Subjt:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA

Query:  EPSFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAH---VLGTGLP-LSEIENLMSPPP-ID-GSTVAYRRYEAAGIRGGFF
        +   ++FIAD    PDW I+D    W+ DI+RE ++ ++ F VL      F  H   + G     L+E  ++ S P  +D  S+VAYR  EAAG   G +
Subjt:  EPSFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAH---VLGTGLP-LSEIENLMSPPP-ID-GSTVAYRRYEAAGIRGGFF

Query:  EKNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQI
         +N SG++D +R T+I +  +A+ VRSC EF+ DYL L+    GK VIPVGL+P EK ++ +F  D  W  IF+WLD Q P+S+VFVGFGSE KL+KDQ+
Subjt:  EKNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQI

Query:  HEIARGVELSELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLV
        +EIA GVELS LPFLWALRKP WA +D D LP+GF +RT++RGIV  GW PQ+EILGHP+IGGSLFH GWGS IE+LQFGH L+LLPFI DQPLNAR +V
Subjt:  HEIARGVELSELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLV

Query:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLH---------------------------------------------
        +KG+ VE+E K EDGSF  + + KAL+ AMVS EG+ +R +A EAAA+FG+ KLH                                             
Subjt:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLH---------------------------------------------

Query:  -------------------------------------------------------------------------------QRYIEEFHFHLFLI-------
                                                                                       QR I E  F LFL+       
Subjt:  -------------------------------------------------------------------------------QRYIEEFHFHLFLI-------

Query:  ----------IILKLTK----------------------MADNKVLHVLLFPWSAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFIT
                  I  +LT                       MA+   LHV++ PW AFGH+IP+FQL+I LAKAG+ VSF+STP+N++RLP IPPSL+  + 
Subjt:  ----------IILKLTK----------------------MADNKVLHVLLFPWSAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFIT

Query:  PVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKFIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAH---VLGS
         V  PLP L  D LPE  EATVDIP  KI  LK+A DL K P ++FIAD    PDW I+D    W+ DI+RE ++P + F V S      L H   ++G 
Subjt:  PVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRKFIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAH---VLGS

Query:  G---LPLSEIGSLMSPPPID-GSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPRK
        G   L  S       P  +D  S+VAYR  EA  +  G +  N SG++D +R+++I++  +A  VRSC EF+ DY   +    GK VIPVGLLP EKP +
Subjt:  G---LPLSEIGSLMSPPPID-GSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPRK

Query:  TEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPA
         +F  D  W   F+WLD Q P+S+VFVGFGSE K TKDQ++EIA G+ELS LPFLWALRKP WA +D D LP GF +RT++RGIV  GWAPQ+EILGHP+
Subjt:  TEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPA

Query:  IGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIGKALRKTMISEEGEKIRKKAKEIGAIFGDTKLHQQ-YI
        IGGS  H GWGS  E+LQFGH L+LLPFI+DQPLNAR LV+KG+ +E+ER  ED SF+ + + KAL+  M+S EG+ +R+KA E G +FG+ KLHQ  YI
Subjt:  IGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIGKALRKTMISEEGEKIRKKAKEIGAIFGDTKLHQQ-YI

Query:  EEFVEFLKKGDSN
         +FV+FLKK   N
Subjt:  EEFVEFLKKGDSN

SwissProt top hitse value%identityAlignment
B3VI56 UDP-glycosyltransferase 91D23.8e-8639.83Show/hide
Query:  ENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEP
        + K LHV  FPW AFGH++P+ QLS  +A+ G  VSF+ST +N+QRL      +SP I  V + LP++    LPE AEAT D+    IP+LK A D  +P
Subjt:  ENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEP

Query:  SFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSE----IENLMSPP---PIDGSTVAYRRYEAAGIRGGFFE
           +F+    H PDW I D+   W+  I+    I    F V +P  +A+        +  S+    +E+L +PP   P   + V +R+++ A +      
Subjt:  SFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSE----IENLMSPP---PIDGSTVAYRRYEAAGIRGGFFE

Query:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIH
            G+SD  R   ++  S  +  +  +EF   +L L        V+PVGLLPPE P   +   D  W  I +WLD +   SVV+V  GSE  +S+ ++ 
Subjt:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIH

Query:  EIARGVELSELPFLWALRKPEWAEDSD--PLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLV
        E+A G+ELS LPF+WA RKP+    SD   LP GF +RT +RG+V   WAPQ+ IL H ++ G L H G GS +E L FGH L++LP   DQPLNARLL 
Subjt:  EIARGVELSELPFLWALRKPEWAEDSD--PLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLV

Query:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF
        DK V +E+ R EEDG    E++A++LR  +V +EGE  +  ARE + I+ DTK+ + Y+ +F
Subjt:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF

D4Q9Z5 Soyasaponin III rhamnosyltransferase5.8e-8739.45Show/hide
Query:  LTKMADNKVLHVLLFPWSAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLAL
        L   +++K LHV + PW A GH+ PYF++A  LA+ G  V+FI++PKN+ R+P  P  L PFI  V +PLPK+  + LPEGAE+T+DIP  K   LK A 
Subjt:  LTKMADNKVLHVLLFPWSAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLAL

Query:  DLAKPPFRKFIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPP---PIDGSTVAYRRHEAAEIRAGFF
        +  +    K +      PDW + DF A W+  I++ + IP   +  I+  F  +        +    + S+  PP   P   +T+  R +E      G  
Subjt:  DLAKPPFRKFIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPP---PIDGSTVAYRRHEAAEIRAGFF

Query:  EMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPE-KPRKTEFEADSP-WKSTFEWLDQQNPQSVVFVGFGSECKFTKD
        +      +  D + K  S+     +R+  E + D+  + +      V+PVGLLPP  + R  E E ++P W    +WLD Q   SVV++GFGSE K +++
Subjt:  EMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPE-KPRKTEFEADSP-WKSTFEWLDQQNPQSVVFVGFGSECKFTKD

Query:  QIHEIARGVELSELPFLWALRKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLL
         + E+A G+ELS LPF WAL+  +  E    LP GF +RT ERGIV   WAPQ++IL H AIGG + H G GS IE + FGH LV LP+++DQ L +R+L
Subjt:  QIHEIARGVELSELPFLWALRKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLL

Query:  VDKGVAIEVERKEEDGSFSGEAIGKALRKTMISEEGEKIRKKAKEIGAIFGDTKLHQQYIEEFVEFLKK
         +K VA+EV R E+DGSF+   + K LR  ++ EEG  +R+ AKE+G +F   +LH +YI++F++ L+K
Subjt:  VDKGVAIEVERKEEDGSFSGEAIGKALRKTMISEEGEKIRKKAKEIGAIFGDTKLHQQYIEEFVEFLKK

Q66PF2 Putative UDP-rhamnose:rhamnosyltransferase 11.6e-9241.81Show/hide
Query:  ENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEP
        + K LH+ +FPW AFGH+IP  +++  +A+ G  VSFISTP+N+QRLP IP +L+P I  V IPLP +  + LPE AEAT+D+P   IP+LK+A D  E 
Subjt:  ENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEP

Query:  SFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHV----LGTGLPLSEIENLMSPP---PIDGSTVAYRRYEAAGIRGGFFE
           +F+      PDW I DF   W+  I+ +  I    F + +   + FF       +    P  ++E   SPP   P   S + +R +EA  +  G   
Subjt:  SFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHV----LGTGLPLSEIENLMSPP---PIDGSTVAYRRYEAAGIRGGFFE

Query:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKK-VIPVGLLPPEKPQKTEFGA-DSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQ
         N SG++DR R    I   +   +RSC E + ++L L  D   K  V+P GLLPP  P+  E G  DS W  I  WLD+Q    VV+  FGSE  LS++ 
Subjt:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKK-VIPVGLLPPEKPQKTEFGA-DSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQ

Query:  IHEIARGVELSELPFLWALRKPEWAE---DSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNAR
         +E+A G+ELS LPF W LRKP       DS  LP GF DR   RG+V   WAPQ++IL H ++GG L H GW S IE+LQ+G  L++LPF+ DQ L AR
Subjt:  IHEIARGVELSELPFLWALRKPEWAE---DSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNAR

Query:  LLVDKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEE
           D  +  EV R EE G F    +A +L+  +V EEG++ R  A E + +F D +LH RY++E
Subjt:  LLVDKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEE

Q6VAA8 UDP-glycosyltransferase 91D16.4e-8639.61Show/hide
Query:  ENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEP
        + K LHV  FPW AFGH++P  QLS  +A+ G  VSF+ST +N+QRL      +SP I  V + LP++    LPE AEAT D+    I +LK A+D  +P
Subjt:  ENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEP

Query:  SFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFA----HVLGTGLPLSEIENLMSPP---PIDGSTVAYRRYEAAGIRGGFFE
           +F+    H PDW I DF   W+  I+    I   +F V++P  +A+ A     ++      + +E+L +PP   P   + V +R+++ A +      
Subjt:  SFRKFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFA----HVLGTGLPLSEIENLMSPP---PIDGSTVAYRRYEAAGIRGGFFE

Query:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIH
            G+SD  R   +   S  +  +  +EF   +L L        V+PVGLLPPE P   +   D  W  I +WLD +   SVV+V  GSE  +S+ ++ 
Subjt:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIH

Query:  EIARGVELSELPFLWALRKPEWAEDSD--PLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLV
        E+A G+ELS LPF+WA RKP+    SD   LP GF +RT +RG+V   WAPQ+ IL H ++ G L H G GS +E L FGH L++LP   DQPLNARLL 
Subjt:  EIARGVELSELPFLWALRKPEWAEDSD--PLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLV

Query:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF
        DK V +E+ R EEDG    E++A++LR  +V  EGE  +  AR  + I+ DTK+ + Y+ +F
Subjt:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF

Q940V3 UDP-glycosyltransferase 91A12.1e-8439.83Show/hide
Query:  LHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQR-LPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEPSFR
        LHVV+FPW AFGH++P+ +LS  +A+ G  VSFISTP+N+ R LP +P +LS  I  V + LP +  + LPE  EAT D+PF  IP+LK+A D  +    
Subjt:  LHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQR-LPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEPSFR

Query:  KFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIE------NLMSPP---PIDGSTVAYRRYEAAGIRGGFF-E
        +F+      PDW + DF   W+  ISR   I   FF        AF    LG   P    E      + M PP   P + ++VA++ +E   I  GF  E
Subjt:  KFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIE------NLMSPP---PIDGSTVAYRRYEAAGIRGGFF-E

Query:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIH
          +  + D  R   +I     I VRSC E++ ++L L  +   K VIPVG+LPP+  +K  F     W  + +WLD +  +S+V+V FGSE K S+ +++
Subjt:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIH

Query:  EIARGVELSELPFLWAL--RKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLV
        EIA G+ELS LPF W L  R+  W  +   LP GF +RTA+RG+V  GW  Q+  L H +IG  L H GWG+ IEA++F   + +L F+ DQ LNAR++ 
Subjt:  EIARGVELSELPFLWAL--RKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLV

Query:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF
        +K +   + R E +G F  E++A +LR  MV EEG+  R   +E   +FGD     RY++ F
Subjt:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF

Arabidopsis top hitse value%identityAlignment
AT2G22590.1 UDP-Glycosyltransferase superfamily protein1.5e-8539.83Show/hide
Query:  LHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQR-LPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEPSFR
        LHVV+FPW AFGH++P+ +LS  +A+ G  VSFISTP+N+ R LP +P +LS  I  V + LP +  + LPE  EAT D+PF  IP+LK+A D  +    
Subjt:  LHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQR-LPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEPSFR

Query:  KFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIE------NLMSPP---PIDGSTVAYRRYEAAGIRGGFF-E
        +F+      PDW + DF   W+  ISR   I   FF        AF    LG   P    E      + M PP   P + ++VA++ +E   I  GF  E
Subjt:  KFIADHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIE------NLMSPP---PIDGSTVAYRRYEAAGIRGGFF-E

Query:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIH
          +  + D  R   +I     I VRSC E++ ++L L  +   K VIPVG+LPP+  +K  F     W  + +WLD +  +S+V+V FGSE K S+ +++
Subjt:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIH

Query:  EIARGVELSELPFLWAL--RKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLV
        EIA G+ELS LPF W L  R+  W  +   LP GF +RTA+RG+V  GW  Q+  L H +IG  L H GWG+ IEA++F   + +L F+ DQ LNAR++ 
Subjt:  EIARGVELSELPFLWAL--RKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLV

Query:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF
        +K +   + R E +G F  E++A +LR  MV EEG+  R   +E   +FGD     RY++ F
Subjt:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF

AT3G29630.1 UDP-Glycosyltransferase superfamily protein5.1e-5433.19Show/hide
Query:  HVLLFPWSAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSP-FITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRK
        H  L+PW  FGH+IPY  LA  LA+ G  V+F++  K  ++L P+  +L P  I   ++ LP + G  LP GAE T D+P     +L  A+DL +     
Subjt:  HVLLFPWSAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSP-FITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRK

Query:  FIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPPPIDGSTVAYRRHEA------AEIRAGFFEMNDSG
         +      PD    DF   WI  +++E  I +V + +IS+ F+A+         P +E+GS   PP    S VA R H+A      A  R   F+   +G
Subjt:  FIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPPPIDGSTVAYRRHEA------AEIRAGFFEMNDSG

Query:  MSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARG
        + +CD           IA+R+C E + +   F    C +KV+  G +  +   K+    +  W +   WL+   P SVV+  FG+   F  DQ  E+  G
Subjt:  MSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARG

Query:  VELSELPFLWALRKPEWAED-SDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDK-GVA
        +EL+ LPFL A+  P  +    + LP GF +R   RGIV  GW  Q  IL HP+IG  + H G+GS  E+L     +V +P +VDQ L  RLL ++  V+
Subjt:  VELSELPFLWALRKPEWAED-SDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDK-GVA

Query:  IEVERKEEDGSFSGEAIGKALRKTM--ISEEGEKIRKKAKEIGAIFGDTKLHQQYIEEFVEFLK
        ++V+R E  G FS E++   ++  M   SE G  +R+  K++        L   Y ++FV+ L+
Subjt:  IEVERKEEDGSFSGEAIGKALRKTM--ISEEGEKIRKKAKEIGAIFGDTKLHQQYIEEFVEFLK

AT4G27570.1 UDP-Glycosyltransferase superfamily protein1.3e-5234.42Show/hide
Query:  HVLLFPWSAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPF-ITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRK
        HVL++PW A GH+ P+  LA  LA+ G  V+F+   K+L++L     +L P  I   S+ +P + G  LP G E   +IP     LL  A+DL +     
Subjt:  HVLLFPWSAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPF-ITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRK

Query:  FIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPPPIDGSTVAYRRHEAAEI-RAGFFEMNDSGMSDCD
         +      PD    DF A WI +++R+F + TV + V+S+  +A +       +P  E+G  + PP    S V  R+ +A  + +       D G +  +
Subjt:  FIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPPPIDGSTVAYRRHEAAEI-RAGFFEMNDSGMSDCD

Query:  RVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSE
        RVT  +  S  IA+R+  E + ++  +   +C KKV+  G + PE P KT  E +  W    +WL    P SVVF   GS+    KDQ  E+  G+EL+ 
Subjt:  RVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSE

Query:  LPFLWALRKPEWAED-SDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDK-GVAIEVER
         PFL A++ P  +    + LP GF +R   RG+V  GW  Q  IL HP++G  + H G+GS  E+L     +VL+P + DQ LN RLL D+  V++EV R
Subjt:  LPFLWALRKPEWAED-SDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDK-GVAIEVER

Query:  KEEDGSFSGEAIGKALRKTM--ISEEGEKIRKKAKEIGAIFGDTKLHQQYIEEFVEFLK
         EE G FS E++  A+   M   SE G  +RK   +         L   Y++ FVE L+
Subjt:  KEEDGSFSGEAIGKALRKTM--ISEEGEKIRKKAKEIGAIFGDTKLHQQYIEEFVEFLK

AT5G49690.1 UDP-Glycosyltransferase superfamily protein5.6e-8537.39Show/hide
Query:  KVLHVLLFPWSAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPF
        +V+HV +FPW A GHL+P+ +L+  LA+ G  +SFISTP+N++RLP +  +L+  IT VS PLP + G  LP  +E+++D+P++K   LK A DL +PP 
Subjt:  KVLHVLLFPWSAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPF

Query:  RKFIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPLSEIGS-----LMSPPPID-GSTVAYRRHEAAEIRAGFFEMND
        ++F+      PDW I D+ + W+  I+ E  I   FF + ++  L  +     S   + EI S      + PP +   S + +R HE         E + 
Subjt:  RKFIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPLSEIGS-----LMSPPPID-GSTVAYRRHEAAEIRAGFFEMND

Query:  SGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIA
        +G+SD  R    +  S A+ VRSC EF+ ++F    +   K V P+G LPP    + +   D+ W    +WLD+Q   SVV+V  G+E     +++ E+A
Subjt:  SGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIA

Query:  RGVELSELPFLWALRKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVA
         G+E SE PF W LR      +   +P GF+ R   RG+V +GW PQ++IL H ++GG L H GW S +E L FG   +  P + +Q LN RLL  KG+ 
Subjt:  RGVELSELPFLWALRKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVA

Query:  IEVERKEEDGSFSGEAIGKALRKTMISEEGEKIRKKAKEIGAIFGDTKLHQQYIEEFVEFLK-KGDSN
        +EV R E DGSF  +++  ++R  MI + GE+IR KAK +  +FG+   + +Y++E V F++ KG S+
Subjt:  IEVERKEEDGSFSGEAIGKALRKTMISEEGEKIRKKAKEIGAIFGDTKLHQQYIEEFVEFLK-KGDSN

AT5G65550.1 UDP-Glycosyltransferase superfamily protein5.6e-7737.69Show/hide
Query:  LHVLLFPWSAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRK
        LHV +FPW A GH+IPY QL+  +A+ G  VSFIST +N+ RLP I   LS  +  VS+PL +   D LPE AEAT D+P   I+ LK A D     F +
Subjt:  LHVLLFPWSAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLAKPPFRK

Query:  FIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALL---AHVLGSGL-PLSEIGSLMSPPP--IDGSTVAYRRHEAAEI----RAGF--F
        F+      P+W + D    W+  I+ +  +    FC  ++  + ++   A V+  G  P      L+ PPP     + + YR  EA  I     AG    
Subjt:  FIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALL---AHVLGSGL-PLSEIGSLMSPPP--IDGSTVAYRRHEAAEI----RAGF--F

Query:  EMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQI
        E+ND    +C R+      S  I +RSC E + ++ +  S   GK VIP+GLLP       + + +  W    EWLD+   +SVV+V  G+E   + ++I
Subjt:  EMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQI

Query:  HEIARGVELSELPFLWALRKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVD
          +A G+EL  LPF W LRK   A  S  LP GF++R  ERG++   W PQ +IL H ++GG + H GWGSA+E L FG  L++ P  +DQPL ARLL  
Subjt:  HEIARGVELSELPFLWALRKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVD

Query:  KGVAIEVERKEEDGSFSGEAIGKALRKTMISEEGEKIRKK-AKEIGAIFGDTKLHQQYIEEFVEFLK
          + +E+ R E DG F+  ++ + +R  ++ EEG+  R   A +   IFG+ +L  QY + F+EFL+
Subjt:  KGVAIEVERKEEDGSFSGEAIGKALRKTMISEEGEKIRKK-AKEIGAIFGDTKLHQQYIEEFVEFLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAAACAAAGATCTTCACGTCGTTGTTTTTCCATGGTCGGCCTTCGGCCATTTAATACCTCATTTTCAACTCTCCATAGCCTTAGCCAAAGCCGGCGTCCATGT
CTCCTTCATCTCCACCCCCAAAAATCTACAGAGACTTCCCCCAATTCCTCCGTCTCTCTCTCCCTTCATAACTCCGGTTTCGATTCCACTCCCTAAACTCCCCGGCGACC
CCTTGCCGGAAGGTGCAGAGGCCACCGTCGATATTCCCTTTCATAAAATTCCCTTTCTCAAACTAGCCCTGGATCTCGCCGAGCCGTCGTTTCGGAAGTTCATCGCCGAT
CATCCCCATCCACCGGATTGGTTCATCGTCGATTTCAACGCTACTTGGATCTGCGACATTTCTAGAGAGTTTCGAATTCCGATCGTTTTCTTTCGCGTTCTCTCCCCTGG
ATTTCTCGCTTTCTTCGCCCATGTTCTTGGGACTGGTTTGCCTCTGTCGGAGATTGAAAACCTGATGTCGCCGCCTCCGATCGACGGCTCCACGGTGGCGTACCGGCGAT
ATGAAGCGGCCGGAATTCGTGGTGGATTTTTTGAGAAGAATGATTCCGGAATGAGCGATCGCGATAGGGCAACGAAGATTATTTCCGCGAGCCGAGCAATTGCAGTTCGT
AGTTGTAACGAATTTGATGTTGATTATTTGAAATTGTACTCGGATTATTGCGGAAAGAAAGTGATTCCTGTGGGGCTTCTTCCTCCAGAAAAGCCCCAAAAAACAGAGTT
CGGGGCGGATTCGCCATGGAAATGGATCTTCCAGTGGCTCGATCAACAAAACCCCCAATCAGTGGTTTTCGTCGGATTCGGAAGCGAATGCAAGCTCTCGAAGGATCAAA
TTCACGAGATCGCGCGTGGAGTGGAGCTTTCGGAGCTGCCATTTTTGTGGGCTCTGAGAAAACCGGAGTGGGCGGAGGATTCCGACCCGCTTCCGGTCGGTTTCCGGGAT
CGGACGGCGGAGAGAGGGATAGTGAGTATGGGGTGGGCGCCGCAGATGGAGATTTTGGGGCATCCGGCAATTGGAGGGAGTCTGTTTCACGGAGGGTGGGGATCCGCCAT
TGAAGCCCTGCAATTTGGGCATTGTCTTGTTCTGTTGCCGTTCATCGTGGATCAGCCGCTGAATGCGAGGCTTTTGGTGGACAAGGGCGTTGCAGTTGAAGTTGAAAGAA
AGGAAGAGGATGGGTCTTTCATTGGCGAGGCCATAGCCAAAGCTTTGAGAGAAGCCATGGTTTCAGAAGAAGGGGAGAAGATTAGGAGGCGAGCTAGAGAAGCTGCCGCC
ATTTTTGGGGACACCAAGCTTCATCAGCGATATATCGAGGAATTTCACTTTCATCTCTTCCTCATCATCATCCTGAAGCTCACAAAAATGGCGGACAACAAAGTTCTTCA
CGTCCTTCTATTTCCATGGTCGGCCTTTGGCCATTTAATACCTTATTTTCAACTCGCCATAGCCTTAGCCAAAGCCGGCGTCCATGTCTCCTTCATCTCCACTCCCAAAA
ATCTACAGAGACTTCCCCCAATTCCTCCGTCTCTCTCCCCCTTCATAACTCCGGTCTCGATTCCACTCCCCAAACTCCCCGGCGACCCCTTGCCGGAAGGAGCAGAGGCC
ACCGTCGATATTCCCTTTCACAAAATTTCCCTTCTCAAACTAGCCCTAGATCTCGCCAAGCCGCCGTTTCGGAAGTTCATCGCCGATCATCCCCATCCCCCCGATTGGTT
CATTGTCGATTTCAACGCGACTTGGATTTTCGATATTTCTCGAGAGTTTCGAATTCCGACCGTTTTCTTTTGCGTAATCTCGTCTGGATTTCTCGCTTTATTGGCTCATG
TTCTTGGGAGTGGTTTGCCTTTGTCGGAGATCGGAAGCCTGATGTCACCGCCGCCCATTGACGGCTCCACGGTGGCGTATCGACGGCATGAAGCGGCCGAAATTCGTGCT
GGATTTTTTGAAATGAACGATTCTGGAATGAGCGATTGCGACAGGGTAACGAAGATTGTTTCCGCTAGTCGAGCAATTGCAGTTCGTAGTTGTAACGAATTTGATGTTGA
TTATTTTAAATTTTACTCAAATTATTGCGGAAAGAAAGTGATTCCTGTGGGGCTTCTTCCTCCAGAAAAGCCCCGAAAAACAGAGTTCGAGGCCGATTCGCCATGGAAAT
CGACTTTCGAGTGGCTCGATCAACAAAACCCCCAATCGGTGGTGTTCGTCGGATTCGGAAGCGAATGCAAGTTCACGAAGGATCAAATTCACGAGATTGCGCGTGGAGTG
GAGCTTTCGGAGCTGCCATTTTTGTGGGCTCTGAGAAAACCGGAGTGGGCGGAGGATTCCGATCCGCTTCCGGTCGGTTTCCGGGATCGGACGGCGGAGAGAGGGATAGT
GAGTATGGGGTGGGCGCCGCAGATGGAGATTTTGGGGCATCCGGCAATCGGAGGGAGTCTGTTTCACGGAGGGTGGGGATCCGCCATTGAAGCTCTGCAATTTGGGCATG
GGCTTGTTCTGTTGCCGTTCATCGTGGATCAGCCGCTGAATGCAAGGCTTTTGGTGGATAAGGGCGTTGCAATTGAAGTTGAAAGGAAGGAAGAGGATGGGTCATTCAGT
GGGGAAGCCATAGGCAAAGCTTTGAGAAAAACTATGATTTCAGAAGAAGGGGAGAAGATCAGGAAGAAAGCCAAAGAAATTGGCGCCATTTTTGGGGACACCAAGCTTCA
TCAACAATATATTGAGGAATTTGTAGAATTCCTCAAAAAGGGGGATTCAAATGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAAAACAAAGATCTTCACGTCGTTGTTTTTCCATGGTCGGCCTTCGGCCATTTAATACCTCATTTTCAACTCTCCATAGCCTTAGCCAAAGCCGGCGTCCATGT
CTCCTTCATCTCCACCCCCAAAAATCTACAGAGACTTCCCCCAATTCCTCCGTCTCTCTCTCCCTTCATAACTCCGGTTTCGATTCCACTCCCTAAACTCCCCGGCGACC
CCTTGCCGGAAGGTGCAGAGGCCACCGTCGATATTCCCTTTCATAAAATTCCCTTTCTCAAACTAGCCCTGGATCTCGCCGAGCCGTCGTTTCGGAAGTTCATCGCCGAT
CATCCCCATCCACCGGATTGGTTCATCGTCGATTTCAACGCTACTTGGATCTGCGACATTTCTAGAGAGTTTCGAATTCCGATCGTTTTCTTTCGCGTTCTCTCCCCTGG
ATTTCTCGCTTTCTTCGCCCATGTTCTTGGGACTGGTTTGCCTCTGTCGGAGATTGAAAACCTGATGTCGCCGCCTCCGATCGACGGCTCCACGGTGGCGTACCGGCGAT
ATGAAGCGGCCGGAATTCGTGGTGGATTTTTTGAGAAGAATGATTCCGGAATGAGCGATCGCGATAGGGCAACGAAGATTATTTCCGCGAGCCGAGCAATTGCAGTTCGT
AGTTGTAACGAATTTGATGTTGATTATTTGAAATTGTACTCGGATTATTGCGGAAAGAAAGTGATTCCTGTGGGGCTTCTTCCTCCAGAAAAGCCCCAAAAAACAGAGTT
CGGGGCGGATTCGCCATGGAAATGGATCTTCCAGTGGCTCGATCAACAAAACCCCCAATCAGTGGTTTTCGTCGGATTCGGAAGCGAATGCAAGCTCTCGAAGGATCAAA
TTCACGAGATCGCGCGTGGAGTGGAGCTTTCGGAGCTGCCATTTTTGTGGGCTCTGAGAAAACCGGAGTGGGCGGAGGATTCCGACCCGCTTCCGGTCGGTTTCCGGGAT
CGGACGGCGGAGAGAGGGATAGTGAGTATGGGGTGGGCGCCGCAGATGGAGATTTTGGGGCATCCGGCAATTGGAGGGAGTCTGTTTCACGGAGGGTGGGGATCCGCCAT
TGAAGCCCTGCAATTTGGGCATTGTCTTGTTCTGTTGCCGTTCATCGTGGATCAGCCGCTGAATGCGAGGCTTTTGGTGGACAAGGGCGTTGCAGTTGAAGTTGAAAGAA
AGGAAGAGGATGGGTCTTTCATTGGCGAGGCCATAGCCAAAGCTTTGAGAGAAGCCATGGTTTCAGAAGAAGGGGAGAAGATTAGGAGGCGAGCTAGAGAAGCTGCCGCC
ATTTTTGGGGACACCAAGCTTCATCAGCGATATATCGAGGAATTTCACTTTCATCTCTTCCTCATCATCATCCTGAAGCTCACAAAAATGGCGGACAACAAAGTTCTTCA
CGTCCTTCTATTTCCATGGTCGGCCTTTGGCCATTTAATACCTTATTTTCAACTCGCCATAGCCTTAGCCAAAGCCGGCGTCCATGTCTCCTTCATCTCCACTCCCAAAA
ATCTACAGAGACTTCCCCCAATTCCTCCGTCTCTCTCCCCCTTCATAACTCCGGTCTCGATTCCACTCCCCAAACTCCCCGGCGACCCCTTGCCGGAAGGAGCAGAGGCC
ACCGTCGATATTCCCTTTCACAAAATTTCCCTTCTCAAACTAGCCCTAGATCTCGCCAAGCCGCCGTTTCGGAAGTTCATCGCCGATCATCCCCATCCCCCCGATTGGTT
CATTGTCGATTTCAACGCGACTTGGATTTTCGATATTTCTCGAGAGTTTCGAATTCCGACCGTTTTCTTTTGCGTAATCTCGTCTGGATTTCTCGCTTTATTGGCTCATG
TTCTTGGGAGTGGTTTGCCTTTGTCGGAGATCGGAAGCCTGATGTCACCGCCGCCCATTGACGGCTCCACGGTGGCGTATCGACGGCATGAAGCGGCCGAAATTCGTGCT
GGATTTTTTGAAATGAACGATTCTGGAATGAGCGATTGCGACAGGGTAACGAAGATTGTTTCCGCTAGTCGAGCAATTGCAGTTCGTAGTTGTAACGAATTTGATGTTGA
TTATTTTAAATTTTACTCAAATTATTGCGGAAAGAAAGTGATTCCTGTGGGGCTTCTTCCTCCAGAAAAGCCCCGAAAAACAGAGTTCGAGGCCGATTCGCCATGGAAAT
CGACTTTCGAGTGGCTCGATCAACAAAACCCCCAATCGGTGGTGTTCGTCGGATTCGGAAGCGAATGCAAGTTCACGAAGGATCAAATTCACGAGATTGCGCGTGGAGTG
GAGCTTTCGGAGCTGCCATTTTTGTGGGCTCTGAGAAAACCGGAGTGGGCGGAGGATTCCGATCCGCTTCCGGTCGGTTTCCGGGATCGGACGGCGGAGAGAGGGATAGT
GAGTATGGGGTGGGCGCCGCAGATGGAGATTTTGGGGCATCCGGCAATCGGAGGGAGTCTGTTTCACGGAGGGTGGGGATCCGCCATTGAAGCTCTGCAATTTGGGCATG
GGCTTGTTCTGTTGCCGTTCATCGTGGATCAGCCGCTGAATGCAAGGCTTTTGGTGGATAAGGGCGTTGCAATTGAAGTTGAAAGGAAGGAAGAGGATGGGTCATTCAGT
GGGGAAGCCATAGGCAAAGCTTTGAGAAAAACTATGATTTCAGAAGAAGGGGAGAAGATCAGGAAGAAAGCCAAAGAAATTGGCGCCATTTTTGGGGACACCAAGCTTCA
TCAACAATATATTGAGGAATTTGTAGAATTCCTCAAAAAGGGGGATTCAAATGAGTAG
Protein sequenceShow/hide protein sequence
MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEPSFRKFIAD
HPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPPIDGSTVAYRRYEAAGIRGGFFEKNDSGMSDRDRATKIISASRAIAVR
SCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKWIFQWLDQQNPQSVVFVGFGSECKLSKDQIHEIARGVELSELPFLWALRKPEWAEDSDPLPVGFRD
RTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHCLVLLPFIVDQPLNARLLVDKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAA
IFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLIPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEA
TVDIPFHKISLLKLALDLAKPPFRKFIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPLSEIGSLMSPPPIDGSTVAYRRHEAAEIRA
GFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPRKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGV
ELSELPFLWALRKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFS
GEAIGKALRKTMISEEGEKIRKKAKEIGAIFGDTKLHQQYIEEFVEFLKKGDSNE