; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G01930 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G01930
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionUDP-glycosyltransferase 91A1-like
Genome locationClcChr05:1333012..1338482
RNA-Seq ExpressionClc05G01930
SyntenyClc05G01930
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0004650 - polygalacturonase activity (molecular function)
GO:0008194 - UDP-glycosyltransferase activity (molecular function)
InterPro domainsIPR002213 - UDP-glucuronosyl/UDP-glucosyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG5009163.1 hypothetical protein JHK87_017678 [Glycine soja]4.1e-25552.39Show/hide
Query:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA
        MAEN  + V + PWSAFGHLIP F+LSIALAKAGVHVSFISTPKN+QRLP IP +LS  +  V +PLP L  D LPEGAEATVDIPF K  +LK ALD  
Subjt:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA

Query:  EPSFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKND
        + + ++F+AN    PDW I DFN  W+ DI++EF++ ++ F +LS     F     GT       E+L +PP      S+VA+R +EA     GF + N 
Subjt:  EPSFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKND

Query:  SGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIHEIA
        SG+SD +R  KI  AS+A+  RSC E + +YL  Y     K +IP+GLLP E+        D    +IF WLD+Q  +SVVFVGFGSE KL+KDQ+ EIA
Subjt:  SGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIHEIA

Query:  RGVELSELPFLWALRKPDW-AEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVDKGV
         G+E S+LPFLWALRKP W + D   LPVGF +RT+ RG V  GW PQ+EIL H +IGGSLFH GWGS IE LQFG+ LVLLPF ++QPLNAR LV+K +
Subjt:  RGVELSELPFLWALRKPDW-AEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVDKGV

Query:  AVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLMPYFQLAI
        A+EV+R  EDGSF    IA +LR+AMV EEG+KIR   REAAAI G+ KLHQ +                        + V + PWSAFGHL+P+F+L+I
Subjt:  AVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLMPYFQLAI

Query:  ALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKFIADHPHPPDWFIVDFNATWIF
        ALAKAGVHVSFISTPKN+QRLP IP +LS  +  V +PLP L  D LPEGAEAT+DIPF K   LK A D ++   ++F+A+    PDW I DFN  W+ 
Subjt:  ALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKFIADHPHPPDWFIVDFNATWIF

Query:  DISREFRIPTVFFCVISSGFLALLAHVLGSGLPSSEIG-----SLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRS
        DI++EF++  + F +IS+     +      G P +  G     SL +PP      S+VA+R+HEA    AG ++++ SG+SD +R+ K+  AS+A+  RS
Subjt:  DISREFRIPTVFFCVISSGFLALLAHVLGSGLPSSEIG-----SLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRS

Query:  CNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEW-AED
        C E + +Y   +     K VIP+GLLP E+        D    + FEWLD+Q  +SVVFVGFGSE K +KDQ+ EIA G+E S+LPFLWALRKP W + D
Subjt:  CNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEW-AED

Query:  SDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIAKALR
           LPVGF +RT+ RG V  GW PQ+EIL H +IGGSLFH G GS IE LQFGH LV+LPF +DQPL AR LV+KG+AIEV+R  EDGSF+   IA +LR
Subjt:  SDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIAKALR

Query:  KTMISEEGEKIRKRAKEIGAIFGDTKLHQ-QYIEEFVQFLKKG
        + M+ EEG+KIR   +E  AI G+ KLHQ  Y+  FVQFLK G
Subjt:  KTMISEEGEKIRKRAKEIGAIFGDTKLHQ-QYIEEFVQFLKKG

KAG5036958.1 hypothetical protein JHK86_017798 [Glycine max]4.1e-25552.39Show/hide
Query:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA
        MAEN  + V + PWSAFGHLIP F+LSIALAKAGVHVSFISTPKN+QRLP IP +LS  +  V +PLP L  D LPEGAEATVDIPF K  +LK ALD  
Subjt:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA

Query:  EPSFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKND
        + + ++F+AN    PDW I DFN  W+ DI++EF++ ++ F +LS     F     GT       E+L +PP      S+VA+R +EA     GF + N 
Subjt:  EPSFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKND

Query:  SGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIHEIA
        SG+SD +R  KI  AS+A+  RSC E + +YL  Y     K +IP+GLLP E+        D    +IF WLD+Q  +SVVFVGFGSE KL+KDQ+ EIA
Subjt:  SGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIHEIA

Query:  RGVELSELPFLWALRKPDW-AEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVDKGV
         G+E S+LPFLWALRKP W + D   LPVGF +RT+ RG V  GW PQ+EIL H +IGGSLFH GWGS IE LQFG+ LVLLPF ++QPLNAR LV+K +
Subjt:  RGVELSELPFLWALRKPDW-AEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVDKGV

Query:  AVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLMPYFQLAI
        A+EV+R  EDGSF    IA +LR+AMV EEG+KIR   REAAAI G+ KLHQ +                        + V + PWSAFGHL+P+F+L+I
Subjt:  AVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLMPYFQLAI

Query:  ALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKFIADHPHPPDWFIVDFNATWIF
        ALAKAGVHVSFISTPKN+QRLP IP +LS  +  V +PLP L  D LPEGAEAT+DIPF K   LK A D ++   ++F+A+    PDW I DFN  W+ 
Subjt:  ALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKFIADHPHPPDWFIVDFNATWIF

Query:  DISREFRIPTVFFCVISSGFLALLAHVLGSGLPSSEIG-----SLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRS
        DI++EF++  + F +IS+     +      G P +  G     SL +PP      S+VA+R+HEA    AG ++++ SG+SD +R+ K+  AS+A+  RS
Subjt:  DISREFRIPTVFFCVISSGFLALLAHVLGSGLPSSEIG-----SLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRS

Query:  CNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEW-AED
        C E + +Y   +     K VIP+GLLP E+        D    + FEWLD+Q  +SVVFVGFGSE K +KDQ+ EIA G+E S+LPFLWALRKP W + D
Subjt:  CNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEW-AED

Query:  SDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIAKALR
           LPVGF +RT+ RG V  GW PQ+EIL H +IGGSLFH G GS IE LQFGH LV+LPF +DQPL AR LV+KG+AIEV+R  EDGSF+   IA +LR
Subjt:  SDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIAKALR

Query:  KTMISEEGEKIRKRAKEIGAIFGDTKLHQ-QYIEEFVQFLKKG
        + M+ EEG+KIR   +E  AI G+ KLHQ  Y+  FVQFLK G
Subjt:  KTMISEEGEKIRKRAKEIGAIFGDTKLHQ-QYIEEFVQFLKKG

RDX84482.1 putative UDP-rhamnose:rhamnosyltransferase 1, partial [Mucuna pruriens]1.9e-26854.08Show/hide
Query:  LHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEPSFRK
        +H+V+ PWSAFGHLIP F+LSIALAKAGV VSF+STPKN+QRLP +P +L+  +  V  PLP L  + LPEGAEAT+DIPF KI +LKLA D  + + +K
Subjt:  LHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEPSFRK

Query:  FIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKNDSGMSDR
         + +    P+W I DF+  W+ DI+ EF++ ++F+ V S   + FF        PLS  E+L  PP      S+VAY+R+EA     G    N SG+SD 
Subjt:  FIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKNDSGMSDR

Query:  DRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIHEIARGVELS
        +R TK++ AS+A+  RSC E + +YL  +    GK VIP+GLLP ++ ++     D    +IF WLD+Q  +SVVFVGFGSECKLTKDQ+ EIA G+E S
Subjt:  DRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIHEIARGVELS

Query:  ELPFLWALRKPDWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVDKGVAVEVER
        ELPFLWALRKP WA  D D +PVGF +RT  RGIV MGW PQ EIL HP+IGGSLFH GWGSAIEALQFGH LVLLPFI+DQPLNAR LV+KG+A+EV+R
Subjt:  ELPFLWALRKPDWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVDKGVAVEVER

Query:  KEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLMPYFQLAIALAKAG
          EDGSF    IA +LR+AMV EEG+ IR   REAA I G+ KLHQ +        +++  ++  KM   K +  ++ PWSAFGHL+P+F+L+IALAKAG
Subjt:  KEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLMPYFQLAIALAKAG

Query:  VHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKFIADHPHPPDWFIVDFNATWIFDISREF
        VHVSF+STPKN+QRLP +P +L+  +  V  PL  L  + LPEGAEATVDIPF KI  LKLA D ++   +K + +    P+W I DF+  W+ DI++EF
Subjt:  VHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKFIADHPHPPDWFIVDFNATWIFDISREF

Query:  RIPTVFFCVISSGFLALLAHVLGSGLPSSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKF
        ++  +F+ V+S+  +  L   +G+        SL  PP      S+VAY+RHEA    AG   +N SG+SD +RVTKI+ AS+A+ VRSC E + +Y   
Subjt:  RIPTVFFCVISSGFLALLAHVLGSGLPSSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKF

Query:  YSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEWA-EDSDPLPVGFRDR
        +    GK VIP+GLLP ++ ++     D    + FEWLD+Q  +SVVFVGFGSE K TKDQ+ EIA G+E SELPFLWALRKP WA  D D +PVGF +R
Subjt:  YSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEWA-EDSDPLPVGFRDR

Query:  TAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIAKALRKTMISEEGEKI
        T  RGIV MGW PQ EIL HP+IGGSLFH GWGS IE LQF H LV+LPFI+DQPLNAR LV+KG+AIEV +K EDGSF+   IA +LR+ M+ EEG+ I
Subjt:  TAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIAKALRKTMISEEGEKI

Query:  RKRAKEIGAIFGDTKLHQ-QYIEEFVQFLKKG
        R  A E   I G+ KLHQ  YI  FVQFL+ G
Subjt:  RKRAKEIGAIFGDTKLHQ-QYIEEFVQFLKKG

RXI07822.1 hypothetical protein DVH24_009853 [Malus domestica]3.0e-25050.52Show/hide
Query:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA
        M ++  L VV+ PWSAFGH++P FQLSIALAKA VH SFISTPKN+QRLP I P L PF+  VPIP P L  D LP+GAEATVD+PF     LK A D  
Subjt:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA

Query:  EPSFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHV-----LGTGLPLSEIENLMSPPPI--DGSTVAYRRYEAAGIRGGF
        +   ++FI +    PDW I DF A W+ +I +E+ +P+V+F V S      F+         T   L  +E+L SPP +    STVA+R  EA  +  GF
Subjt:  EPSFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHV-----LGTGLPLSEIENLMSPPPI--DGSTVAYRRYEAAGIRGGF

Query:  FEKNDSGMSDRDRATKIISASRAI-----AVR---SCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGS
        +  NDSG+SD DR     + +R        +R   +C+EF+ +YL+ Y +  GK +IP GLLPPE P K   G  S    IF WLD+Q            
Subjt:  FEKNDSGMSDRDRATKIISASRAI-----AVR---SCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGS

Query:  ECKLTKDQIHEIARGVELSELPFLWALRKPDWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVD
                                   RKP+WA  ++D LP+GF DR +E+G+V  GW PQMEIL HP++GGSLFH GWGS IE LQFGH LV+LPFI+D
Subjt:  ECKLTKDQIHEIARGVELSELPFLWALRKPDWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVD

Query:  QPLNARLLVDKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQ-RYIEEFHFHLFLIIILKLTKMADNKV-------L
        QPLNARLL +KG+AVEV+R+  DGSF  + IAK LR AMV EEGE++R  AR+AA +FGD KLHQ  YI +F       +  K TK +  K        L
Subjt:  QPLNARLLVDKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQ-RYIEEFHFHLFLIIILKLTKMADNKV-------L

Query:  HVLLFPWSAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKF
         V++ PWS+FGH++PYFQL++ALAKA VHVSFISTPKN+Q+LP     L  FI  V IP P L    LPEGAEATVD+PF K   LK+A DL++ P ++F
Subjt:  HVLLFPWSAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKF

Query:  IADHPHPPDWFIVDFNATWIFDISREFRIPTVF---FCVISSGFLALLAHVLGS----GLPSSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMND
        I +    PDW I D+ A W+ DI++E+ IP  +   FC+  S F     ++LG+     LP+ E  SL SPP      STVA+R  EA  +  GFF  N 
Subjt:  IADHPHPPDWFIVDFNATWIFDISREFRIPTVF---FCVISSGFLALLAHVLGS----GLPSSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMND

Query:  SGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIA
        SG+SD  R+ KIVS S+ +A+RSC+E + +Y + Y    GK V+  GLLPPEKP +      S     FEWLD+   +SVVFVGFGSECK +K+Q+ EIA
Subjt:  SGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIA

Query:  RGVELSELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGV
         G+ELSELPFLWALRKP WA  D+D LPVGF DRT+E+G+V +GW PQMEIL HP+IGGSLFH GWGS IE LQFG  LV LP + DQPLNARLL D+G+
Subjt:  RGVELSELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGV

Query:  AIEVERKEEDGSFSGEAIAKALRKTMISEEGEKIRKRAKEIGAIFGDTKLHQ-QYIEEFVQFLKKGDSNR
        A+EV+R   DGSFS + I+K LR  M+ EEGEK+R  A++  A+FGD KLHQ  YI +FV FLK   + R
Subjt:  AIEVERKEEDGSFSGEAIAKALRKTMISEEGEKIRKRAKEIGAIFGDTKLHQ-QYIEEFVQFLKKGDSNR

XP_022924286.1 uncharacterized protein LOC111431815 [Cucurbita moschata]0.0e+0072.04Show/hide
Query:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA
        MAENK LHVV+ PWSAFGHL+PHFQL+I+LAKAGVHVSFISTP+NL+RLP IPPSLSPFIT VPIPLPKLPGDPLPEGAEATVDIPF KIPFLKLALDLA
Subjt:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA

Query:  EPSFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPPIDGSTVAYRRYEAAGIRGGFFEKNDSG
        EP FR+F+ANHPH PDW IVDFNATWIC+ISR+F+IPIVF  V SP  LAFFA ++G G    +I +LMSPP IDGS VAYRRYEA  I G  + KNDSG
Subjt:  EPSFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPPIDGSTVAYRRYEAAGIRGGFFEKNDSG

Query:  MSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIHEIARG
        +SD +R+ KIISA +A  +RSC EFDVDYLK Y+D+ G+KVIP+GLLPPEKPQK+EF ADSPWKS F WLDQQNP+SVVFVGFGSECKLTKD+IH+IARG
Subjt:  MSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIHEIARG

Query:  VELSELPFLWALRKPDWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVDKGVAVE
        +ELSELPFLW+LRKPDWA DSD LP GF+DRTAERGIVSMGWAPQMEILGHPAIGG  FHGGWGSAIEALQFGH LVLLPFIVDQPLNARLLV+KGVAVE
Subjt:  VELSELPFLWALRKPDWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVDKGVAVE

Query:  VERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF-----------------HFHLFLIIILKLTKMADNKVLHVLLFPW
        VERKEEDGSF GE IAKALREAM SEEGEKIR+RA E AAIFGD KLHQRYIEEF                  FH     ILK   MA  K + VL FP+
Subjt:  VERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF-----------------HFHLFLIIILKLTKMADNKVLHVLLFPW

Query:  SAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKL-PGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKFIADHPH
         AFGH+MP+FQLA+ALA +GVHV F+STPKNLQRLPP PPSLS  ITP+ +PLPKL  G  LPEGAEAT+D+P  K+  L++ALDL +P FRK + D P+
Subjt:  SAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKL-PGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKFIADHPH

Query:  PPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPSSEIGSLMSPPPIDG--STVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIV
        PPDWFIVDF+ATWI +++R+ +IPT+FF VIS+GFLA + +V   G P  +I  L +P  +DG  S V++RR EAA + + F   N +GMS  DR+ KI+
Subjt:  PPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPSSEIGSLMSPPPIDG--STVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIV

Query:  SASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWA
        +AS+AI +R+C E D  Y  FYS  CGKKV+P+G LPPEKPQKTEF  DSPWKS FEWLD+QNP+SVVFVGFGSEC+ TKDQ+H+IARG+ELS+LPFLW+
Subjt:  SASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWA

Query:  LRKPEWA--EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGS
        LRKP WA  +DSD +PVGF+DRTAERGIV MGWAPQMEILGHPAIGG  FHGGWGSAIEALQFGH LVLLPFI+DQPL ARLLV+KGV +EVER+E DG 
Subjt:  LRKPEWA--EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGS

Query:  FSGEAIAKALRKTMISEEGEKIRKRAKEIGAIFGDTKLHQQYIEEFVQFLKKGDS
        FSGEAIAKALRK ++SEEGEKIR+ AKE  AIFG+ KLHQQYI  FV+  K   S
Subjt:  FSGEAIAKALRKTMISEEGEKIRKRAKEIGAIFGDTKLHQQYIEEFVQFLKKGDS

TrEMBL top hitse value%identityAlignment
A0A0R0JB13 Uncharacterized protein2.0e-25552.39Show/hide
Query:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA
        MAEN  + V + PWSAFGHLIP F+LSIALAKAGVHVSFISTPKN+QRLP IP +LS  +  V +PLP L  D LPEGAEATVDIPF K  +LK ALD  
Subjt:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA

Query:  EPSFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKND
        + + ++F+AN    PDW I DFN  W+ DI++EF++ ++ F +LS     F     GT       E+L +PP      S+VA+R +EA     GF + N 
Subjt:  EPSFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKND

Query:  SGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIHEIA
        SG+SD +R  KI  AS+A+  RSC E + +YL  Y     K +IP+GLLP E+        D    +IF WLD+Q  +SVVFVGFGSE KL+KDQ+ EIA
Subjt:  SGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIHEIA

Query:  RGVELSELPFLWALRKPDW-AEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVDKGV
         G+E S+LPFLWALRKP W + D   LPVGF +RT+ RG V  GW PQ+EIL H +IGGSLFH GWGS IE LQFG+ LVLLPF ++QPLNAR LV+K +
Subjt:  RGVELSELPFLWALRKPDW-AEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVDKGV

Query:  AVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLMPYFQLAI
        A+EV+R  EDGSF    IA +LR+AMV EEG+KIR   REAAAI G+ KLHQ +                        + V + PWSAFGHL+P+F+L+I
Subjt:  AVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLMPYFQLAI

Query:  ALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKFIADHPHPPDWFIVDFNATWIF
        ALAKAGVHVSFISTPKN+QRLP IP +LS  +  V +PLP L  D LPEGAEAT+DIPF K   LK A D ++   ++F+A+    PDW I DFN  W+ 
Subjt:  ALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKFIADHPHPPDWFIVDFNATWIF

Query:  DISREFRIPTVFFCVISSGFLALLAHVLGSGLPSSEIG-----SLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRS
        DI++EF++  + F +IS+     +      G P +  G     SL +PP      S+VA+R+HEA    AG ++++ SG+SD +R+ K+  AS+A+  RS
Subjt:  DISREFRIPTVFFCVISSGFLALLAHVLGSGLPSSEIG-----SLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRS

Query:  CNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEW-AED
        C E + +Y   +     K VIP+GLLP E+        D    + FEWLD+Q  +SVVFVGFGSE K +KDQ+ EIA G+E S+LPFLWALRKP W + D
Subjt:  CNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEW-AED

Query:  SDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIAKALR
           LPVGF +RT+ RG V  GW PQ+EIL H +IGGSLFH G GS IE LQFGH LV+LPF +DQPL AR LV+KG+AIEV+R  EDGSF+   IA +LR
Subjt:  SDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIAKALR

Query:  KTMISEEGEKIRKRAKEIGAIFGDTKLHQ-QYIEEFVQFLKKG
        + M+ EEG+KIR   +E  AI G+ KLHQ  Y+  FVQFLK G
Subjt:  KTMISEEGEKIRKRAKEIGAIFGDTKLHQ-QYIEEFVQFLKKG

A0A371G1Q6 Putative UDP-rhamnose:rhamnosyltransferase 1 (Fragment)9.1e-26954.08Show/hide
Query:  LHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEPSFRK
        +H+V+ PWSAFGHLIP F+LSIALAKAGV VSF+STPKN+QRLP +P +L+  +  V  PLP L  + LPEGAEAT+DIPF KI +LKLA D  + + +K
Subjt:  LHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEPSFRK

Query:  FIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKNDSGMSDR
         + +    P+W I DF+  W+ DI+ EF++ ++F+ V S   + FF        PLS  E+L  PP      S+VAY+R+EA     G    N SG+SD 
Subjt:  FIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPP--IDGSTVAYRRYEAAGIRGGFFEKNDSGMSDR

Query:  DRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIHEIARGVELS
        +R TK++ AS+A+  RSC E + +YL  +    GK VIP+GLLP ++ ++     D    +IF WLD+Q  +SVVFVGFGSECKLTKDQ+ EIA G+E S
Subjt:  DRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIHEIARGVELS

Query:  ELPFLWALRKPDWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVDKGVAVEVER
        ELPFLWALRKP WA  D D +PVGF +RT  RGIV MGW PQ EIL HP+IGGSLFH GWGSAIEALQFGH LVLLPFI+DQPLNAR LV+KG+A+EV+R
Subjt:  ELPFLWALRKPDWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVDKGVAVEVER

Query:  KEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLMPYFQLAIALAKAG
          EDGSF    IA +LR+AMV EEG+ IR   REAA I G+ KLHQ +        +++  ++  KM   K +  ++ PWSAFGHL+P+F+L+IALAKAG
Subjt:  KEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLMPYFQLAIALAKAG

Query:  VHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKFIADHPHPPDWFIVDFNATWIFDISREF
        VHVSF+STPKN+QRLP +P +L+  +  V  PL  L  + LPEGAEATVDIPF KI  LKLA D ++   +K + +    P+W I DF+  W+ DI++EF
Subjt:  VHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKFIADHPHPPDWFIVDFNATWIFDISREF

Query:  RIPTVFFCVISSGFLALLAHVLGSGLPSSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKF
        ++  +F+ V+S+  +  L   +G+        SL  PP      S+VAY+RHEA    AG   +N SG+SD +RVTKI+ AS+A+ VRSC E + +Y   
Subjt:  RIPTVFFCVISSGFLALLAHVLGSGLPSSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKF

Query:  YSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEWA-EDSDPLPVGFRDR
        +    GK VIP+GLLP ++ ++     D    + FEWLD+Q  +SVVFVGFGSE K TKDQ+ EIA G+E SELPFLWALRKP WA  D D +PVGF +R
Subjt:  YSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEWA-EDSDPLPVGFRDR

Query:  TAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIAKALRKTMISEEGEKI
        T  RGIV MGW PQ EIL HP+IGGSLFH GWGS IE LQF H LV+LPFI+DQPLNAR LV+KG+AIEV +K EDGSF+   IA +LR+ M+ EEG+ I
Subjt:  TAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIAKALRKTMISEEGEKI

Query:  RKRAKEIGAIFGDTKLHQ-QYIEEFVQFLKKG
        R  A E   I G+ KLHQ  YI  FVQFL+ G
Subjt:  RKRAKEIGAIFGDTKLHQ-QYIEEFVQFLKKG

A0A498KQ11 Uncharacterized protein1.5e-25050.52Show/hide
Query:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA
        M ++  L VV+ PWSAFGH++P FQLSIALAKA VH SFISTPKN+QRLP I P L PF+  VPIP P L  D LP+GAEATVD+PF     LK A D  
Subjt:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA

Query:  EPSFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHV-----LGTGLPLSEIENLMSPPPI--DGSTVAYRRYEAAGIRGGF
        +   ++FI +    PDW I DF A W+ +I +E+ +P+V+F V S      F+         T   L  +E+L SPP +    STVA+R  EA  +  GF
Subjt:  EPSFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHV-----LGTGLPLSEIENLMSPPPI--DGSTVAYRRYEAAGIRGGF

Query:  FEKNDSGMSDRDRATKIISASRAI-----AVR---SCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGS
        +  NDSG+SD DR     + +R        +R   +C+EF+ +YL+ Y +  GK +IP GLLPPE P K   G  S    IF WLD+Q            
Subjt:  FEKNDSGMSDRDRATKIISASRAI-----AVR---SCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGS

Query:  ECKLTKDQIHEIARGVELSELPFLWALRKPDWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVD
                                   RKP+WA  ++D LP+GF DR +E+G+V  GW PQMEIL HP++GGSLFH GWGS IE LQFGH LV+LPFI+D
Subjt:  ECKLTKDQIHEIARGVELSELPFLWALRKPDWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVD

Query:  QPLNARLLVDKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQ-RYIEEFHFHLFLIIILKLTKMADNKV-------L
        QPLNARLL +KG+AVEV+R+  DGSF  + IAK LR AMV EEGE++R  AR+AA +FGD KLHQ  YI +F       +  K TK +  K        L
Subjt:  QPLNARLLVDKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQ-RYIEEFHFHLFLIIILKLTKMADNKV-------L

Query:  HVLLFPWSAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKF
         V++ PWS+FGH++PYFQL++ALAKA VHVSFISTPKN+Q+LP     L  FI  V IP P L    LPEGAEATVD+PF K   LK+A DL++ P ++F
Subjt:  HVLLFPWSAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKF

Query:  IADHPHPPDWFIVDFNATWIFDISREFRIPTVF---FCVISSGFLALLAHVLGS----GLPSSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMND
        I +    PDW I D+ A W+ DI++E+ IP  +   FC+  S F     ++LG+     LP+ E  SL SPP      STVA+R  EA  +  GFF  N 
Subjt:  IADHPHPPDWFIVDFNATWIFDISREFRIPTVF---FCVISSGFLALLAHVLGS----GLPSSEIGSLMSPPP--IDGSTVAYRRHEAAEIRAGFFEMND

Query:  SGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIA
        SG+SD  R+ KIVS S+ +A+RSC+E + +Y + Y    GK V+  GLLPPEKP +      S     FEWLD+   +SVVFVGFGSECK +K+Q+ EIA
Subjt:  SGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIA

Query:  RGVELSELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGV
         G+ELSELPFLWALRKP WA  D+D LPVGF DRT+E+G+V +GW PQMEIL HP+IGGSLFH GWGS IE LQFG  LV LP + DQPLNARLL D+G+
Subjt:  RGVELSELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGV

Query:  AIEVERKEEDGSFSGEAIAKALRKTMISEEGEKIRKRAKEIGAIFGDTKLHQ-QYIEEFVQFLKKGDSNR
        A+EV+R   DGSFS + I+K LR  M+ EEGEK+R  A++  A+FGD KLHQ  YI +FV FLK   + R
Subjt:  AIEVERKEEDGSFSGEAIAKALRKTMISEEGEKIRKRAKEIGAIFGDTKLHQ-QYIEEFVQFLKKGDSNR

A0A6J1E951 uncharacterized protein LOC1114318150.0e+0072.04Show/hide
Query:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA
        MAENK LHVV+ PWSAFGHL+PHFQL+I+LAKAGVHVSFISTP+NL+RLP IPPSLSPFIT VPIPLPKLPGDPLPEGAEATVDIPF KIPFLKLALDLA
Subjt:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA

Query:  EPSFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPPIDGSTVAYRRYEAAGIRGGFFEKNDSG
        EP FR+F+ANHPH PDW IVDFNATWIC+ISR+F+IPIVF  V SP  LAFFA ++G G    +I +LMSPP IDGS VAYRRYEA  I G  + KNDSG
Subjt:  EPSFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPPIDGSTVAYRRYEAAGIRGGFFEKNDSG

Query:  MSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIHEIARG
        +SD +R+ KIISA +A  +RSC EFDVDYLK Y+D+ G+KVIP+GLLPPEKPQK+EF ADSPWKS F WLDQQNP+SVVFVGFGSECKLTKD+IH+IARG
Subjt:  MSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIHEIARG

Query:  VELSELPFLWALRKPDWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVDKGVAVE
        +ELSELPFLW+LRKPDWA DSD LP GF+DRTAERGIVSMGWAPQMEILGHPAIGG  FHGGWGSAIEALQFGH LVLLPFIVDQPLNARLLV+KGVAVE
Subjt:  VELSELPFLWALRKPDWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVDKGVAVE

Query:  VERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF-----------------HFHLFLIIILKLTKMADNKVLHVLLFPW
        VERKEEDGSF GE IAKALREAM SEEGEKIR+RA E AAIFGD KLHQRYIEEF                  FH     ILK   MA  K + VL FP+
Subjt:  VERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF-----------------HFHLFLIIILKLTKMADNKVLHVLLFPW

Query:  SAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKL-PGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKFIADHPH
         AFGH+MP+FQLA+ALA +GVHV F+STPKNLQRLPP PPSLS  ITP+ +PLPKL  G  LPEGAEAT+D+P  K+  L++ALDL +P FRK + D P+
Subjt:  SAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKL-PGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKFIADHPH

Query:  PPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPSSEIGSLMSPPPIDG--STVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIV
        PPDWFIVDF+ATWI +++R+ +IPT+FF VIS+GFLA + +V   G P  +I  L +P  +DG  S V++RR EAA + + F   N +GMS  DR+ KI+
Subjt:  PPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPSSEIGSLMSPPPIDG--STVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIV

Query:  SASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWA
        +AS+AI +R+C E D  Y  FYS  CGKKV+P+G LPPEKPQKTEF  DSPWKS FEWLD+QNP+SVVFVGFGSEC+ TKDQ+H+IARG+ELS+LPFLW+
Subjt:  SASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWA

Query:  LRKPEWA--EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGS
        LRKP WA  +DSD +PVGF+DRTAERGIV MGWAPQMEILGHPAIGG  FHGGWGSAIEALQFGH LVLLPFI+DQPL ARLLV+KGV +EVER+E DG 
Subjt:  LRKPEWA--EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGS

Query:  FSGEAIAKALRKTMISEEGEKIRKRAKEIGAIFGDTKLHQQYIEEFVQFLKKGDS
        FSGEAIAKALRK ++SEEGEKIR+ AKE  AIFG+ KLHQQYI  FV+  K   S
Subjt:  FSGEAIAKALRKTMISEEGEKIRKRAKEIGAIFGDTKLHQQYIEEFVQFLKKGDS

A0A6N2M6C4 Uncharacterized protein1.2e-26046.63Show/hide
Query:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA
        MAE   LHVV+ PW AFGH+IP +QLSIALAKAG+ VSF+STP+N++RLP IPP L+  +  V  PLP L  D LPE  EATVDIP  KI +LK+A DL 
Subjt:  MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLA

Query:  EPSFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAH---VLGTGLP-LSEIENLMSPPP-ID-GSTVAYRRYEAAGIRGGFF
        +   ++FIA+    PDW I+D    W+ DI+RE ++ ++ F VL      F  H   + G     L+E  ++ S P  +D  S+VAYR  EAAG   G +
Subjt:  EPSFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAH---VLGTGLP-LSEIENLMSPPP-ID-GSTVAYRRYEAAGIRGGFF

Query:  EKNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQI
         +N SG++D +R T+I +  +A+ VRSC EF+ DYL L+    GK VIPVGL+P EK ++ +F  D  W  IF+WLD Q P+S+VFVGFGSE KLTKDQ+
Subjt:  EKNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQI

Query:  HEIARGVELSELPFLWALRKPDWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLV
        +EIA GVELS LPFLWALRKP WA +D D LP+GF +RT++RGIV  GW PQ+EILGHP+IGGSLFH GWGS IE+LQFGH L+LLPFI DQPLNAR +V
Subjt:  HEIARGVELSELPFLWALRKPDWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLV

Query:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLH---------------------------------------------
        +KG+ VE+E K EDGSF  + + KAL+ AMVS EG+ +R +A EAAA+FG+ KLH                                             
Subjt:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLH---------------------------------------------

Query:  -------------------------------------------------------------------------------QRYIEEFHFHLFLI-------
                                                                                       QR I E  F LFL+       
Subjt:  -------------------------------------------------------------------------------QRYIEEFHFHLFLI-------

Query:  ----------IILKLTK----------------------MADNKVLHVLLFPWSAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFIT
                  I  +LT                       MA+   LHV++ PW AFGH++P+FQL+I LAKAG+ VSF+STP+N++RLP IPPSL+  + 
Subjt:  ----------IILKLTK----------------------MADNKVLHVLLFPWSAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFIT

Query:  PVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKFIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAH---VLGS
         V  PLP L  D LPE  EATVDIP  KI  LK+A DL+K P ++FIAD    PDW I+D    W+ DI+RE ++P + F V S      L H   ++G 
Subjt:  PVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRKFIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAH---VLGS

Query:  G---LPSSEIGSLMSPPPID-GSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQK
        G   L  S       P  +D  S+VAYR  EA  +  G +  N SG++D +R+++I++  +A  VRSC EF+ DY   +    GK VIPVGLLP EKP++
Subjt:  G---LPSSEIGSLMSPPPID-GSTVAYRRHEAAEIRAGFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQK

Query:  TEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPA
         +F  D  W   F+WLD Q P+S+VFVGFGSE K TKDQ++EIA G+ELS LPFLWALRKP WA +D D LP GF +RT++RGIV  GWAPQ+EILGHP+
Subjt:  TEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSELPFLWALRKPEWA-EDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPA

Query:  IGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIAKALRKTMISEEGEKIRKRAKEIGAIFGDTKLHQQ-YI
        IGGS  H GWGS  E+LQFGH L+LLPFI+DQPLNAR LV+KG+ +E+ER  ED SF+ + + KAL+  M+S EG+ +R++A E G +FG+ KLHQ  YI
Subjt:  IGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFSGEAIAKALRKTMISEEGEKIRKRAKEIGAIFGDTKLHQQ-YI

Query:  EEFVQFLKKGDSN
         +FV FLKK   N
Subjt:  EEFVQFLKKGDSN

SwissProt top hitse value%identityAlignment
B3VI56 UDP-glycosyltransferase 91D21.7e-8639.83Show/hide
Query:  ENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEP
        + K LHV  FPW AFGH++P+ QLS  +A+ G  VSF+ST +N+QRL      +SP I  V + LP++    LPE AEAT D+    IP+LK A D  +P
Subjt:  ENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEP

Query:  SFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSE----IENLMSPP---PIDGSTVAYRRYEAAGIRGGFFE
           +F+    H PDW I D+   W+  I+    I    F V +P  +A+        +  S+    +E+L +PP   P   + V +R+++ A +      
Subjt:  SFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSE----IENLMSPP---PIDGSTVAYRRYEAAGIRGGFFE

Query:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIH
            G+SD  R   ++  S  +  +  +EF   +L L        V+PVGLLPPE P   +   D  W SI +WLD +   SVV+V  GSE  +++ ++ 
Subjt:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIH

Query:  EIARGVELSELPFLWALRKPDWAEDSD--PLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLV
        E+A G+ELS LPF+WA RKP     SD   LP GF +RT +RG+V   WAPQ+ IL H ++ G L H G GS +E L FGH L++LP   DQPLNARLL 
Subjt:  EIARGVELSELPFLWALRKPDWAEDSD--PLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLV

Query:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF
        DK V +E+ R EEDG    E++A++LR  +V +EGE  +  ARE + I+ DTK+ + Y+ +F
Subjt:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF

D4Q9Z5 Soyasaponin III rhamnosyltransferase2.6e-8739.23Show/hide
Query:  LTKMADNKVLHVLLFPWSAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLAL
        L   +++K LHV + PW A GH+ PYF++A  LA+ G  V+FI++PKN+ R+P  P  L PFI  V +PLPK+  + LPEGAE+T+DIP  K   LK A 
Subjt:  LTKMADNKVLHVLLFPWSAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLAL

Query:  DLVKPPFRKFIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPSSEIGSLMSPP---PIDGSTVAYRRHEAAEIRAGFF
        + ++    K +      PDW + DF A W+  I++ + IP   +  I+  F  +        +    + S+  PP   P   +T+  R +E      G  
Subjt:  DLVKPPFRKFIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPSSEIGSLMSPP---PIDGSTVAYRRHEAAEIRAGFF

Query:  EMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADS--PWKSTFEWLDQQNPQSVVFVGFGSECKFTKD
        +      +  D + K  S+     +R+  E + D+  + +      V+PVGLLPP    +   E D+   W    +WLD Q   SVV++GFGSE K +++
Subjt:  EMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADS--PWKSTFEWLDQQNPQSVVFVGFGSECKFTKD

Query:  QIHEIARGVELSELPFLWALRKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLL
         + E+A G+ELS LPF WAL+  +  E    LP GF +RT ERGIV   WAPQ++IL H AIGG + H G GS IE + FGH LV LP+++DQ L +R+L
Subjt:  QIHEIARGVELSELPFLWALRKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLL

Query:  VDKGVAIEVERKEEDGSFSGEAIAKALRKTMISEEGEKIRKRAKEIGAIFGDTKLHQQYIEEFVQFLKK
         +K VA+EV R E+DGSF+   +AK LR  ++ EEG  +R+ AKE+G +F   +LH +YI++F+  L+K
Subjt:  VDKGVAIEVERKEEDGSFSGEAIAKALRKTMISEEGEKIRKRAKEIGAIFGDTKLHQQYIEEFVQFLKK

Q66PF2 Putative UDP-rhamnose:rhamnosyltransferase 12.7e-9241.59Show/hide
Query:  ENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEP
        + K LH+ +FPW AFGH+IP  +++  +A+ G  VSFISTP+N+QRLP IP +L+P I  V IPLP +  + LPE AEAT+D+P   IP+LK+A D  E 
Subjt:  ENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEP

Query:  SFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHV----LGTGLPLSEIENLMSPP---PIDGSTVAYRRYEAAGIRGGFFE
           +F+      PDW I DF   W+  I+ +  I    F + +   + FF       +    P  ++E   SPP   P   S + +R +EA  +  G   
Subjt:  SFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHV----LGTGLPLSEIENLMSPP---PIDGSTVAYRRYEAAGIRGGFFE

Query:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKK-VIPVGLLPPEKPQKTEFGA-DSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQ
         N SG++DR R    I   +   +RSC E + ++L L  D   K  V+P GLLPP  P+  E G  DS W  I  WLD+Q    VV+  FGSE  L+++ 
Subjt:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKK-VIPVGLLPPEKPQKTEFGA-DSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQ

Query:  IHEIARGVELSELPFLWALRKPDWAE---DSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNAR
         +E+A G+ELS LPF W LRKP       DS  LP GF DR   RG+V   WAPQ++IL H ++GG L H GW S IE+LQ+G  L++LPF+ DQ L AR
Subjt:  IHEIARGVELSELPFLWALRKPDWAE---DSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNAR

Query:  LLVDKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEE
           D  +  EV R EE G F    +A +L+  +V EEG++ R  A E + +F D +LH RY++E
Subjt:  LLVDKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEE

Q6VAA8 UDP-glycosyltransferase 91D13.8e-8639.61Show/hide
Query:  ENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEP
        + K LHV  FPW AFGH++P  QLS  +A+ G  VSF+ST +N+QRL      +SP I  V + LP++    LPE AEAT D+    I +LK A+D  +P
Subjt:  ENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEP

Query:  SFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFA----HVLGTGLPLSEIENLMSPP---PIDGSTVAYRRYEAAGIRGGFFE
           +F+    H PDW I DF   W+  I+    I   +F V++P  +A+ A     ++      + +E+L +PP   P   + V +R+++ A +      
Subjt:  SFRKFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFA----HVLGTGLPLSEIENLMSPP---PIDGSTVAYRRYEAAGIRGGFFE

Query:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIH
            G+SD  R   +   S  +  +  +EF   +L L        V+PVGLLPPE P   +   D  W SI +WLD +   SVV+V  GSE  +++ ++ 
Subjt:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIH

Query:  EIARGVELSELPFLWALRKPDWAEDSD--PLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLV
        E+A G+ELS LPF+WA RKP     SD   LP GF +RT +RG+V   WAPQ+ IL H ++ G L H G GS +E L FGH L++LP   DQPLNARLL 
Subjt:  EIARGVELSELPFLWALRKPDWAEDSD--PLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLV

Query:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF
        DK V +E+ R EEDG    E++A++LR  +V  EGE  +  AR  + I+ DTK+ + Y+ +F
Subjt:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF

Q940V3 UDP-glycosyltransferase 91A14.2e-8539.83Show/hide
Query:  LHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQR-LPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEPSFR
        LHVV+FPW AFGH++P+ +LS  +A+ G  VSFISTP+N+ R LP +P +LS  I  V + LP +  + LPE  EAT D+PF  IP+LK+A D  +    
Subjt:  LHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQR-LPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEPSFR

Query:  KFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIE------NLMSPP---PIDGSTVAYRRYEAAGIRGGFF-E
        +F+ +    PDW + DF   W+  ISR   I   FF        AF    LG   P    E      + M PP   P + ++VA++ +E   I  GF  E
Subjt:  KFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIE------NLMSPP---PIDGSTVAYRRYEAAGIRGGFF-E

Query:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIH
          +  + D  R   +I     I VRSC E++ ++L L  +   K VIPVG+LPP+  +K  F     W S+ +WLD +  +S+V+V FGSE K ++ +++
Subjt:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIH

Query:  EIARGVELSELPFLWAL--RKPDWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLV
        EIA G+ELS LPF W L  R+  W  +   LP GF +RTA+RG+V  GW  Q+  L H +IG  L H GWG+ IEA++F   + +L F+ DQ LNAR++ 
Subjt:  EIARGVELSELPFLWAL--RKPDWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLV

Query:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF
        +K +   + R E +G F  E++A +LR  MV EEG+  R   +E   +FGD     RY++ F
Subjt:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF

Arabidopsis top hitse value%identityAlignment
AT2G22590.1 UDP-Glycosyltransferase superfamily protein3.0e-8639.83Show/hide
Query:  LHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQR-LPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEPSFR
        LHVV+FPW AFGH++P+ +LS  +A+ G  VSFISTP+N+ R LP +P +LS  I  V + LP +  + LPE  EAT D+PF  IP+LK+A D  +    
Subjt:  LHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQR-LPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEPSFR

Query:  KFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIE------NLMSPP---PIDGSTVAYRRYEAAGIRGGFF-E
        +F+ +    PDW + DF   W+  ISR   I   FF        AF    LG   P    E      + M PP   P + ++VA++ +E   I  GF  E
Subjt:  KFIANHPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIE------NLMSPP---PIDGSTVAYRRYEAAGIRGGFF-E

Query:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIH
          +  + D  R   +I     I VRSC E++ ++L L  +   K VIPVG+LPP+  +K  F     W S+ +WLD +  +S+V+V FGSE K ++ +++
Subjt:  KNDSGMSDRDRATKIISASRAIAVRSCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIH

Query:  EIARGVELSELPFLWAL--RKPDWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLV
        EIA G+ELS LPF W L  R+  W  +   LP GF +RTA+RG+V  GW  Q+  L H +IG  L H GWG+ IEA++F   + +L F+ DQ LNAR++ 
Subjt:  EIARGVELSELPFLWAL--RKPDWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLV

Query:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF
        +K +   + R E +G F  E++A +LR  MV EEG+  R   +E   +FGD     RY++ F
Subjt:  DKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAAIFGDTKLHQRYIEEF

AT3G29630.1 UDP-Glycosyltransferase superfamily protein1.1e-5332.97Show/hide
Query:  HVLLFPWSAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSP-FITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRK
        H  L+PW  FGH++PY  LA  LA+ G  V+F++  K  ++L P+  +L P  I   ++ LP + G  LP GAE T D+P     +L  A+DL++     
Subjt:  HVLLFPWSAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSP-FITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRK

Query:  FIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPSSEIGSLMSPPPIDGSTVAYRRHEA------AEIRAGFFEMNDSG
         +      PD    DF   WI  +++E  I +V + +IS+ F+A+         P +E+GS   PP    S VA R H+A      A  R   F+   +G
Subjt:  FIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPSSEIGSLMSPPPIDGSTVAYRRHEA------AEIRAGFFEMNDSG

Query:  MSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARG
        + +CD           IA+R+C E + +   F    C +KV+  G +  +   K+    +  W +   WL+   P SVV+  FG+   F  DQ  E+  G
Subjt:  MSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARG

Query:  VELSELPFLWALRKPEWAED-SDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDK-GVA
        +EL+ LPFL A+  P  +    + LP GF +R   RGIV  GW  Q  IL HP+IG  + H G+GS  E+L     +V +P +VDQ L  RLL ++  V+
Subjt:  VELSELPFLWALRKPEWAED-SDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDK-GVA

Query:  IEVERKEEDGSFSGEAIAKALRKTM--ISEEGEKIRKRAKEIGAIFGDTKLHQQYIEEFVQFLK
        ++V+R E  G FS E++   ++  M   SE G  +R+  K++        L   Y ++FV  L+
Subjt:  IEVERKEEDGSFSGEAIAKALRKTM--ISEEGEKIRKRAKEIGAIFGDTKLHQQYIEEFVQFLK

AT4G27570.1 UDP-Glycosyltransferase superfamily protein3.3e-5334.2Show/hide
Query:  HVLLFPWSAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPF-ITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRK
        HVL++PW A GH+ P+  LA  LA+ G  V+F+   K+L++L     +L P  I   S+ +P + G  LP G E   +IP     LL  A+DL +     
Subjt:  HVLLFPWSAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPF-ITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRK

Query:  FIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPSSEIGSLMSPPPIDGSTVAYRRHEAAEI-RAGFFEMNDSGMSDCD
         +      PD    DF A WI +++R+F + TV + V+S+  +A +       +P  E+G  + PP    S V  R+ +A  + +       D G +  +
Subjt:  FIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPSSEIGSLMSPPPIDGSTVAYRRHEAAEI-RAGFFEMNDSGMSDCD

Query:  RVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSE
        RVT  +  S  IA+R+  E + ++  +   +C KKV+  G + PE P KT  E +  W    +WL    P SVVF   GS+    KDQ  E+  G+EL+ 
Subjt:  RVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGVELSE

Query:  LPFLWALRKPEWAED-SDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDK-GVAIEVER
         PFL A++ P  +    + LP GF +R   RG+V  GW  Q  IL HP++G  + H G+GS  E+L     +VL+P + DQ LN RLL D+  V++EV R
Subjt:  LPFLWALRKPEWAED-SDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDK-GVAIEVER

Query:  KEEDGSFSGEAIAKALRKTM--ISEEGEKIRKRAKEIGAIFGDTKLHQQYIEEFVQFLK
         EE G FS E++  A+   M   SE G  +RK   +         L   Y++ FV+ L+
Subjt:  KEEDGSFSGEAIAKALRKTM--ISEEGEKIRKRAKEIGAIFGDTKLHQQYIEEFVQFLK

AT5G49690.1 UDP-Glycosyltransferase superfamily protein1.9e-8537.2Show/hide
Query:  KVLHVLLFPWSAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPF
        +V+HV +FPW A GHL+P+ +L+  LA+ G  +SFISTP+N++RLP +  +L+  IT VS PLP + G  LP  +E+++D+P++K   LK A DL++PP 
Subjt:  KVLHVLLFPWSAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPF

Query:  RKFIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALL--AHVLGSGLPSSEIGSLMSPPPID-GSTVAYRRHEAAEIRAGFFEMNDSGM
        ++F+      PDW I D+ + W+  I+ E  I   FF + ++  L  +  +  L   + S+     + PP +   S + +R HE         E + +G+
Subjt:  RKFIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALL--AHVLGSGLPSSEIGSLMSPPPID-GSTVAYRRHEAAEIRAGFFEMNDSGM

Query:  SDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGV
        SD  R    +  S A+ VRSC EF+ ++F    +   K V P+G LPP    + +   D+ W    +WLD+Q   SVV+V  G+E     +++ E+A G+
Subjt:  SDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGV

Query:  ELSELPFLWALRKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEV
        E SE PF W LR      +   +P GF+ R   RG+V +GW PQ++IL H ++GG L H GW S +E L FG   +  P + +Q LN RLL  KG+ +EV
Subjt:  ELSELPFLWALRKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEV

Query:  ERKEEDGSFSGEAIAKALRKTMISEEGEKIRKRAKEIGAIFGDTKLHQQYIEEFVQFLK-KGDSN
         R E DGSF  +++A ++R  MI + GE+IR +AK +  +FG+   + +Y++E V+F++ KG S+
Subjt:  ERKEEDGSFSGEAIAKALRKTMISEEGEKIRKRAKEIGAIFGDTKLHQQYIEEFVQFLK-KGDSN

AT5G65550.1 UDP-Glycosyltransferase superfamily protein5.6e-7737.69Show/hide
Query:  LHVLLFPWSAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRK
        LHV +FPW A GH++PY QL+  +A+ G  VSFIST +N+ RLP I   LS  +  VS+PL +   D LPE AEAT D+P   I+ LK A D +   F +
Subjt:  LHVLLFPWSAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEATVDIPFHKISLLKLALDLVKPPFRK

Query:  FIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALL---AHVLGSGL-PSSEIGSLMSPPP--IDGSTVAYRRHEAAEI----RAGF--F
        F+      P+W + D    W+  I+ +  +    FC  ++  + ++   A V+  G  P      L+ PPP     + + YR  EA  I     AG    
Subjt:  FIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALL---AHVLGSGL-PSSEIGSLMSPPP--IDGSTVAYRRHEAAEI----RAGF--F

Query:  EMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQI
        E+ND    +C R+      S  I +RSC E + ++ +  S   GK VIP+GLLP       + E    W    EWLD+   +SVV+V  G+E   + ++I
Subjt:  EMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQI

Query:  HEIARGVELSELPFLWALRKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVD
          +A G+EL  LPF W LRK   A  S  LP GF++R  ERG++   W PQ +IL H ++GG + H GWGSA+E L FG  L++ P  +DQPL ARLL  
Subjt:  HEIARGVELSELPFLWALRKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVD

Query:  KGVAIEVERKEEDGSFSGEAIAKALRKTMISEEGEKIRKR-AKEIGAIFGDTKLHQQYIEEFVQFLK
          + +E+ R E DG F+  ++A+ +R  ++ EEG+  R   A +   IFG+ +L  QY + F++FL+
Subjt:  KGVAIEVERKEEDGSFSGEAIAKALRKTMISEEGEKIRKR-AKEIGAIFGDTKLHQQYIEEFVQFLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAAACAAAGATCTTCACGTCGTTGTTTTTCCATGGTCGGCCTTCGGCCATTTAATACCTCATTTTCAACTCTCCATAGCCTTAGCCAAAGCCGGCGTCCATGT
CTCCTTCATCTCCACCCCCAAAAATCTACAGAGACTTCCCCCAATTCCTCCGTCTCTCTCTCCCTTCATAACTCCGGTTCCGATTCCACTCCCTAAACTCCCCGGCGATC
CCTTGCCGGAAGGTGCAGAGGCCACCGTCGATATTCCCTTTCATAAAATTCCCTTTCTCAAACTAGCCCTAGATCTCGCCGAGCCGTCGTTTCGGAAGTTCATCGCCAAT
CATCCCCATCCACCGGATTGGTTCATCGTCGATTTCAACGCTACTTGGATCTGCGACATTTCTAGAGAGTTTCGAATTCCGATCGTTTTCTTTCGCGTTCTCTCCCCTGG
ATTTCTCGCTTTCTTCGCCCATGTTCTTGGGACTGGTTTGCCTCTGTCGGAGATTGAAAACCTGATGTCGCCGCCTCCGATCGACGGCTCCACGGTGGCGTACCGGCGAT
ATGAAGCGGCCGGAATTCGTGGTGGATTTTTTGAGAAGAATGATTCCGGAATGAGCGATCGCGATAGGGCAACGAAGATTATTTCCGCGAGCCGAGCAATTGCAGTTCGT
AGTTGTAACGAATTTGATGTTGATTATTTGAAATTGTACTCGGATTATTGCGGAAAGAAAGTGATTCCTGTGGGGCTACTTCCTCCAGAAAAGCCCCAAAAAACAGAGTT
CGGGGCCGATTCGCCATGGAAATCGATCTTCCGGTGGCTCGATCAACAAAACCCCCAATCAGTGGTTTTCGTCGGATTCGGAAGCGAATGCAAGCTCACGAAGGATCAAA
TTCACGAGATCGCGCGTGGAGTGGAGCTTTCGGAGCTGCCATTTTTGTGGGCTCTGAGAAAACCGGACTGGGCGGAGGATTCCGACCCGCTTCCGGTCGGTTTCCGGGAT
CGGACGGCGGAGAGAGGGATAGTGAGTATGGGGTGGGCGCCGCAGATGGAGATTTTGGGGCATCCGGCAATTGGAGGGAGTCTGTTTCACGGAGGGTGGGGATCCGCCAT
TGAAGCCCTGCAATTTGGGCATCGTCTTGTTCTGTTGCCGTTCATCGTGGATCAGCCGCTGAATGCGAGGCTTTTGGTGGACAAGGGCGTTGCAGTTGAAGTTGAAAGAA
AGGAAGAGGATGGGTCTTTCATTGGCGAGGCCATAGCCAAAGCTTTGAGAGAAGCCATGGTTTCAGAAGAAGGGGAGAAGATTAGGAGGCGAGCTAGAGAAGCTGCCGCC
ATTTTTGGGGACACCAAGCTTCATCAGCGATATATCGAGGAATTTCACTTTCATCTCTTCCTCATCATCATCCTGAAGCTCACAAAAATGGCGGACAACAAAGTTCTTCA
CGTCCTTCTATTTCCATGGTCGGCCTTTGGCCATTTAATGCCTTATTTTCAACTCGCCATAGCCTTAGCCAAAGCCGGCGTCCATGTCTCCTTCATCTCCACTCCCAAAA
ATCTACAGAGACTTCCCCCAATTCCTCCGTCTCTCTCCCCCTTCATAACTCCGGTCTCGATTCCACTCCCCAAACTCCCCGGCGACCCCTTGCCGGAAGGAGCAGAGGCC
ACCGTCGATATTCCCTTTCACAAAATTTCCCTTCTCAAACTAGCCCTAGATCTCGTCAAGCCGCCGTTTCGGAAGTTCATCGCCGATCATCCCCATCCCCCCGATTGGTT
CATTGTCGATTTCAACGCTACTTGGATTTTCGATATTTCTCGAGAGTTTCGAATTCCGACCGTTTTCTTTTGCGTAATCTCGTCTGGATTTCTCGCTTTATTGGCTCATG
TTCTTGGGAGTGGTTTGCCTTCGTCGGAGATCGGAAGCCTGATGTCACCGCCGCCCATTGACGGCTCCACGGTGGCGTATCGACGGCATGAAGCGGCCGAAATTCGTGCT
GGATTTTTTGAAATGAACGATTCTGGAATGAGCGATTGCGACAGGGTAACGAAGATTGTTTCCGCTAGTCGAGCAATTGCAGTTCGTAGTTGTAACGAATTTGATGTTGA
TTATTTTAAATTTTACTCAAATTATTGCGGAAAGAAAGTGATTCCTGTGGGGCTTCTTCCTCCAGAAAAGCCCCAAAAAACAGAGTTCGAGGCCGATTCGCCATGGAAAT
CGACTTTCGAGTGGCTCGATCAACAAAACCCCCAATCGGTGGTGTTCGTCGGATTCGGAAGCGAATGCAAGTTCACGAAGGATCAAATTCACGAGATTGCGCGTGGAGTG
GAGCTTTCGGAGCTGCCATTTTTGTGGGCTCTGAGAAAACCGGAGTGGGCGGAGGATTCCGATCCGCTTCCGGTCGGTTTCCGGGATCGGACGGCGGAGAGAGGGATAGT
GAGTATGGGGTGGGCGCCGCAGATGGAGATTTTGGGGCATCCGGCAATCGGAGGGAGTCTGTTTCACGGAGGGTGGGGATCCGCCATTGAAGCTCTGCAATTTGGGCATG
GGCTTGTTCTGTTGCCGTTCATCGTGGATCAGCCGCTGAATGCAAGGCTTTTGGTGGATAAGGGCGTTGCAATTGAAGTTGAAAGGAAGGAAGAGGATGGGTCATTCAGT
GGGGAAGCCATAGCCAAAGCTTTGAGAAAAACTATGATTTCAGAAGAAGGGGAGAAGATCAGGAAGCGAGCCAAAGAAATTGGCGCCATTTTTGGGGACACCAAGCTTCA
TCAACAATATATTGAGGAATTTGTACAATTCCTCAAAAAGGGGGATTCAAATCGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAAAACAAAGATCTTCACGTCGTTGTTTTTCCATGGTCGGCCTTCGGCCATTTAATACCTCATTTTCAACTCTCCATAGCCTTAGCCAAAGCCGGCGTCCATGT
CTCCTTCATCTCCACCCCCAAAAATCTACAGAGACTTCCCCCAATTCCTCCGTCTCTCTCTCCCTTCATAACTCCGGTTCCGATTCCACTCCCTAAACTCCCCGGCGATC
CCTTGCCGGAAGGTGCAGAGGCCACCGTCGATATTCCCTTTCATAAAATTCCCTTTCTCAAACTAGCCCTAGATCTCGCCGAGCCGTCGTTTCGGAAGTTCATCGCCAAT
CATCCCCATCCACCGGATTGGTTCATCGTCGATTTCAACGCTACTTGGATCTGCGACATTTCTAGAGAGTTTCGAATTCCGATCGTTTTCTTTCGCGTTCTCTCCCCTGG
ATTTCTCGCTTTCTTCGCCCATGTTCTTGGGACTGGTTTGCCTCTGTCGGAGATTGAAAACCTGATGTCGCCGCCTCCGATCGACGGCTCCACGGTGGCGTACCGGCGAT
ATGAAGCGGCCGGAATTCGTGGTGGATTTTTTGAGAAGAATGATTCCGGAATGAGCGATCGCGATAGGGCAACGAAGATTATTTCCGCGAGCCGAGCAATTGCAGTTCGT
AGTTGTAACGAATTTGATGTTGATTATTTGAAATTGTACTCGGATTATTGCGGAAAGAAAGTGATTCCTGTGGGGCTACTTCCTCCAGAAAAGCCCCAAAAAACAGAGTT
CGGGGCCGATTCGCCATGGAAATCGATCTTCCGGTGGCTCGATCAACAAAACCCCCAATCAGTGGTTTTCGTCGGATTCGGAAGCGAATGCAAGCTCACGAAGGATCAAA
TTCACGAGATCGCGCGTGGAGTGGAGCTTTCGGAGCTGCCATTTTTGTGGGCTCTGAGAAAACCGGACTGGGCGGAGGATTCCGACCCGCTTCCGGTCGGTTTCCGGGAT
CGGACGGCGGAGAGAGGGATAGTGAGTATGGGGTGGGCGCCGCAGATGGAGATTTTGGGGCATCCGGCAATTGGAGGGAGTCTGTTTCACGGAGGGTGGGGATCCGCCAT
TGAAGCCCTGCAATTTGGGCATCGTCTTGTTCTGTTGCCGTTCATCGTGGATCAGCCGCTGAATGCGAGGCTTTTGGTGGACAAGGGCGTTGCAGTTGAAGTTGAAAGAA
AGGAAGAGGATGGGTCTTTCATTGGCGAGGCCATAGCCAAAGCTTTGAGAGAAGCCATGGTTTCAGAAGAAGGGGAGAAGATTAGGAGGCGAGCTAGAGAAGCTGCCGCC
ATTTTTGGGGACACCAAGCTTCATCAGCGATATATCGAGGAATTTCACTTTCATCTCTTCCTCATCATCATCCTGAAGCTCACAAAAATGGCGGACAACAAAGTTCTTCA
CGTCCTTCTATTTCCATGGTCGGCCTTTGGCCATTTAATGCCTTATTTTCAACTCGCCATAGCCTTAGCCAAAGCCGGCGTCCATGTCTCCTTCATCTCCACTCCCAAAA
ATCTACAGAGACTTCCCCCAATTCCTCCGTCTCTCTCCCCCTTCATAACTCCGGTCTCGATTCCACTCCCCAAACTCCCCGGCGACCCCTTGCCGGAAGGAGCAGAGGCC
ACCGTCGATATTCCCTTTCACAAAATTTCCCTTCTCAAACTAGCCCTAGATCTCGTCAAGCCGCCGTTTCGGAAGTTCATCGCCGATCATCCCCATCCCCCCGATTGGTT
CATTGTCGATTTCAACGCTACTTGGATTTTCGATATTTCTCGAGAGTTTCGAATTCCGACCGTTTTCTTTTGCGTAATCTCGTCTGGATTTCTCGCTTTATTGGCTCATG
TTCTTGGGAGTGGTTTGCCTTCGTCGGAGATCGGAAGCCTGATGTCACCGCCGCCCATTGACGGCTCCACGGTGGCGTATCGACGGCATGAAGCGGCCGAAATTCGTGCT
GGATTTTTTGAAATGAACGATTCTGGAATGAGCGATTGCGACAGGGTAACGAAGATTGTTTCCGCTAGTCGAGCAATTGCAGTTCGTAGTTGTAACGAATTTGATGTTGA
TTATTTTAAATTTTACTCAAATTATTGCGGAAAGAAAGTGATTCCTGTGGGGCTTCTTCCTCCAGAAAAGCCCCAAAAAACAGAGTTCGAGGCCGATTCGCCATGGAAAT
CGACTTTCGAGTGGCTCGATCAACAAAACCCCCAATCGGTGGTGTTCGTCGGATTCGGAAGCGAATGCAAGTTCACGAAGGATCAAATTCACGAGATTGCGCGTGGAGTG
GAGCTTTCGGAGCTGCCATTTTTGTGGGCTCTGAGAAAACCGGAGTGGGCGGAGGATTCCGATCCGCTTCCGGTCGGTTTCCGGGATCGGACGGCGGAGAGAGGGATAGT
GAGTATGGGGTGGGCGCCGCAGATGGAGATTTTGGGGCATCCGGCAATCGGAGGGAGTCTGTTTCACGGAGGGTGGGGATCCGCCATTGAAGCTCTGCAATTTGGGCATG
GGCTTGTTCTGTTGCCGTTCATCGTGGATCAGCCGCTGAATGCAAGGCTTTTGGTGGATAAGGGCGTTGCAATTGAAGTTGAAAGGAAGGAAGAGGATGGGTCATTCAGT
GGGGAAGCCATAGCCAAAGCTTTGAGAAAAACTATGATTTCAGAAGAAGGGGAGAAGATCAGGAAGCGAGCCAAAGAAATTGGCGCCATTTTTGGGGACACCAAGCTTCA
TCAACAATATATTGAGGAATTTGTACAATTCCTCAAAAAGGGGGATTCAAATCGGTAG
Protein sequenceShow/hide protein sequence
MAENKDLHVVVFPWSAFGHLIPHFQLSIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVPIPLPKLPGDPLPEGAEATVDIPFHKIPFLKLALDLAEPSFRKFIAN
HPHPPDWFIVDFNATWICDISREFRIPIVFFRVLSPGFLAFFAHVLGTGLPLSEIENLMSPPPIDGSTVAYRRYEAAGIRGGFFEKNDSGMSDRDRATKIISASRAIAVR
SCNEFDVDYLKLYSDYCGKKVIPVGLLPPEKPQKTEFGADSPWKSIFRWLDQQNPQSVVFVGFGSECKLTKDQIHEIARGVELSELPFLWALRKPDWAEDSDPLPVGFRD
RTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHRLVLLPFIVDQPLNARLLVDKGVAVEVERKEEDGSFIGEAIAKALREAMVSEEGEKIRRRAREAAA
IFGDTKLHQRYIEEFHFHLFLIIILKLTKMADNKVLHVLLFPWSAFGHLMPYFQLAIALAKAGVHVSFISTPKNLQRLPPIPPSLSPFITPVSIPLPKLPGDPLPEGAEA
TVDIPFHKISLLKLALDLVKPPFRKFIADHPHPPDWFIVDFNATWIFDISREFRIPTVFFCVISSGFLALLAHVLGSGLPSSEIGSLMSPPPIDGSTVAYRRHEAAEIRA
GFFEMNDSGMSDCDRVTKIVSASRAIAVRSCNEFDVDYFKFYSNYCGKKVIPVGLLPPEKPQKTEFEADSPWKSTFEWLDQQNPQSVVFVGFGSECKFTKDQIHEIARGV
ELSELPFLWALRKPEWAEDSDPLPVGFRDRTAERGIVSMGWAPQMEILGHPAIGGSLFHGGWGSAIEALQFGHGLVLLPFIVDQPLNARLLVDKGVAIEVERKEEDGSFS
GEAIAKALRKTMISEEGEKIRKRAKEIGAIFGDTKLHQQYIEEFVQFLKKGDSNR