; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g20860 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g20860
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGlycosyltransferase
Genome locationchr7:15167055..15168110
RNA-Seq ExpressionMoc07g20860
SyntenyMoc07g20860
Gene Ontology termsGO:0080043 - quercetin 3-O-glucosyltransferase activity (molecular function)
GO:0080044 - quercetin 7-O-glucosyltransferase activity (molecular function)
InterPro domainsIPR002213 - UDP-glucuronosyl/UDP-glucosyltransferase
IPR035595 - UDP-glycosyltransferase family, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593628.1 UDP-glycosyltransferase 75C1, partial [Cucurbita argyrosperma subsp. sororia]6.7e-11064.13Show/hide
Query:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE
        MA++  SF+LPSALFWNQSA+VF+IY+HFFN ++D I   F+ P  +I L GLPLL+S +LPSLC+PANSNS V K +E HF V+ ++PHLKILIN+F++
Subjt:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE

Query:  LERDSLRAIPNVDLVTVGPVIQREI---PSDS-STKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRSTVSGAEDEALSCRA
        LE D LRAI +V+L+ +GPV+       P++S   KS++ WLNSKPKSSVVYVSFGSI A+S  QLEEIA  LL+S  PFLWVMR    G E + +SCR 
Subjt:  LERDSLRAIPNVDLVTVGPVIQREI---PSDS-STKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRSTVSGAEDEALSCRA

Query:  ELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGDGTKGAV
        ELE +GKIV WCSQ+E+L +P+ GCF+THCGWNS LESIACGV VVAFPQWTDQ TNA+IIEE  +SGVRLR N DGIVERGEIK+CL+LVMGDG +G  
Subjt:  ELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGDGTKGAV

Query:  LRKNVRKWKELARKA
        LR+N  KWK+LA  A
Subjt:  LRKNVRKWKELARKA

XP_008458151.1 PREDICTED: crocetin glucosyltransferase, chloroplastic-like [Cucumis melo]3.0e-11060.36Show/hide
Query:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE
        MA++  SF+LP+ALFWNQSA+VF+IY+HFFN +R+ I+  FS P   I L GLP L+S +LPSLC+PANSNS V K FE HF+VL ++PHLKILINSFEE
Subjt:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE

Query:  LERDSLRAIPNVDLVTVGPVIQREI----------------------PSDSSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLW
        LE D  RA    +L+ +GPV+  +                       P+      +  WLNSKPKSSVVYVSFGSI A+S+ QLEEI  ALL+ G  FLW
Subjt:  LERDSLRAIPNVDLVTVGPVIQREI----------------------PSDSSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLW

Query:  VMRSTVSGAEDEALSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERG
        VMR    G E + LSC  ELEA+GK+V WCSQ+E+LSNPA GCF+THCGWNS +ES+ CGVPVVAFPQWTDQGTNAKIIE+L +SGV+LR N +GIVERG
Subjt:  VMRSTVSGAEDEALSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERG

Query:  EIKRCLELVMGDGTKGAVLRKNVRKWKELARKA
        EIK+CLE+VMG+G KG   R+N +KWKELA KA
Subjt:  EIKRCLELVMGDGTKGAVLRKNVRKWKELARKA

XP_022147213.1 crocetin glucosyltransferase, chloroplastic-like [Momordica charantia]9.6e-15787.9Show/hide
Query:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE
        MADL LSFNLPSALFWNQSASVFSIYHHFF+AHRDPIRKLFSIP+GRIQL GLPLLSSDELPSLC+PAN NSLVF+ F+ HFRVLNRDPHLKILINSFEE
Subjt:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE

Query:  LERDSLRAIPNVDLVTVGPVIQREIPSDSSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRSTVSGAEDEALSCRAELEA
        LERD+LRAIPNVDLVTVGPVIQREIPSD+STK+H+AWLNSKPKSSVVYVSFGSI ALS+PQLEEIAGALL+SG PFLWVMR TV+G EDEA+SCRAELEA
Subjt:  LERDSLRAIPNVDLVTVGPVIQREIPSDSSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRSTVSGAEDEALSCRAELEA

Query:  RGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGDGTKGAVLRKN
        RGKIV WCSQVEILSNPATGCFVTHCGWNS LESIACGVPVVAFPQWTDQGTNAKIIEEL ESGVRLRANGD IVERGEIKRCLELVM    +G +LR+N
Subjt:  RGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGDGTKGAVLRKN

Query:  VRKWKELARKAQPE
        V KWKELA+KA  E
Subjt:  VRKWKELARKAQPE

XP_023000592.1 crocetin glucosyltransferase, chloroplastic-like [Cucurbita maxima]4.7e-11164.13Show/hide
Query:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE
        MAD+  SF+LPSALFWNQSA+VF+IY+HFFN ++D I   FS P  +I L GLPLL+S +LPSLC+PANSN+ V K +E HF V+ ++PHLKILIN+F++
Subjt:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE

Query:  LERDSLRAIPNVDLVTVGPVIQREI---PSDS-STKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRSTVSGAEDEALSCRA
        LE D+LRAI +VDL+ +GPV+       P++S   K ++ WLNSKPKSSVVYVSFGSI A+S  QLEEIA  LL+S  PFLWVMR T  G E + +SCR 
Subjt:  LERDSLRAIPNVDLVTVGPVIQREI---PSDS-STKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRSTVSGAEDEALSCRA

Query:  ELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGDGTKGAV
        EL A+GKIV WCSQ+E+L +P+ GCF+THCGWNS LES+ACGV VVAFPQWTDQ TNA+IIEE  ++GVRLR N DGIVERGEIK+CL+LVMGDG +G  
Subjt:  ELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGDGTKGAV

Query:  LRKNVRKWKELARKA
        LR+N  KWK+LA  A
Subjt:  LRKNVRKWKELARKA

XP_023514625.1 crocetin glucosyltransferase, chloroplastic-like [Cucurbita pepo subsp. pepo]1.1e-10964.13Show/hide
Query:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE
        MA++  SF+LPSALFWNQSA+VF+IY+HFFN ++D I   FS P  +I L GLPLL S +LPSLC+PANSNS V K +E HF V+ ++PHLKILIN+F++
Subjt:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE

Query:  LERDSLRAIPNVDLVTVGPVIQREI---PSDS-STKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRSTVSGAEDEALSCRA
        LE D LRAI +V+L+ +GPV+       P++S   K ++ WLNSKPKSSVVYVSFGSI A+S  QLEEIA  LL+S  PFLWVMR    G E + +SCR 
Subjt:  LERDSLRAIPNVDLVTVGPVIQREI---PSDS-STKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRSTVSGAEDEALSCRA

Query:  ELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGDGTKGAV
        EL A+GKIV WCSQ+E+L +P+ GCF+THCGWNS LESIACGV VVAFPQWTDQ TNA+IIEE  +SGVRLR N DGIVERGEIK+CL+LVMGDG +G  
Subjt:  ELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGDGTKGAV

Query:  LRKNVRKWKELARKA
        LR+N  KWK+LA  A
Subjt:  LRKNVRKWKELARKA

TrEMBL top hitse value%identityAlignment
A0A1S3C7B0 Glycosyltransferase1.5e-11060.36Show/hide
Query:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE
        MA++  SF+LP+ALFWNQSA+VF+IY+HFFN +R+ I+  FS P   I L GLP L+S +LPSLC+PANSNS V K FE HF+VL ++PHLKILINSFEE
Subjt:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE

Query:  LERDSLRAIPNVDLVTVGPVIQREI----------------------PSDSSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLW
        LE D  RA    +L+ +GPV+  +                       P+      +  WLNSKPKSSVVYVSFGSI A+S+ QLEEI  ALL+ G  FLW
Subjt:  LERDSLRAIPNVDLVTVGPVIQREI----------------------PSDSSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLW

Query:  VMRSTVSGAEDEALSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERG
        VMR    G E + LSC  ELEA+GK+V WCSQ+E+LSNPA GCF+THCGWNS +ES+ CGVPVVAFPQWTDQGTNAKIIE+L +SGV+LR N +GIVERG
Subjt:  VMRSTVSGAEDEALSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERG

Query:  EIKRCLELVMGDGTKGAVLRKNVRKWKELARKA
        EIK+CLE+VMG+G KG   R+N +KWKELA KA
Subjt:  EIKRCLELVMGDGTKGAVLRKNVRKWKELARKA

A0A5D3CUV8 Glycosyltransferase1.5e-11060.36Show/hide
Query:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE
        MA++  SF+LP+ALFWNQSA+VF+IY+HFFN +R+ I+  FS P   I L GLP L+S +LPSLC+PANSNS V K FE HF+VL ++PHLKILINSFEE
Subjt:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE

Query:  LERDSLRAIPNVDLVTVGPVIQREI----------------------PSDSSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLW
        LE D  RA    +L+ +GPV+  +                       P+      +  WLNSKPKSSVVYVSFGSI A+S+ QLEEI  ALL+ G  FLW
Subjt:  LERDSLRAIPNVDLVTVGPVIQREI----------------------PSDSSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLW

Query:  VMRSTVSGAEDEALSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERG
        VMR    G E + LSC  ELEA+GK+V WCSQ+E+LSNPA GCF+THCGWNS +ES+ CGVPVVAFPQWTDQGTNAKIIE+L +SGV+LR N +GIVERG
Subjt:  VMRSTVSGAEDEALSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERG

Query:  EIKRCLELVMGDGTKGAVLRKNVRKWKELARKA
        EIK+CLE+VMG+G KG   R+N +KWKELA KA
Subjt:  EIKRCLELVMGDGTKGAVLRKNVRKWKELARKA

A0A6J1D0N5 crocetin glucosyltransferase, chloroplastic-like4.6e-15787.9Show/hide
Query:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE
        MADL LSFNLPSALFWNQSASVFSIYHHFF+AHRDPIRKLFSIP+GRIQL GLPLLSSDELPSLC+PAN NSLVF+ F+ HFRVLNRDPHLKILINSFEE
Subjt:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE

Query:  LERDSLRAIPNVDLVTVGPVIQREIPSDSSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRSTVSGAEDEALSCRAELEA
        LERD+LRAIPNVDLVTVGPVIQREIPSD+STK+H+AWLNSKPKSSVVYVSFGSI ALS+PQLEEIAGALL+SG PFLWVMR TV+G EDEA+SCRAELEA
Subjt:  LERDSLRAIPNVDLVTVGPVIQREIPSDSSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRSTVSGAEDEALSCRAELEA

Query:  RGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGDGTKGAVLRKN
        RGKIV WCSQVEILSNPATGCFVTHCGWNS LESIACGVPVVAFPQWTDQGTNAKIIEEL ESGVRLRANGD IVERGEIKRCLELVM    +G +LR+N
Subjt:  RGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGDGTKGAVLRKN

Query:  VRKWKELARKAQPE
        V KWKELA+KA  E
Subjt:  VRKWKELARKAQPE

A0A6J1HHQ4 crocetin glucosyltransferase, chloroplastic-like7.2e-11064.13Show/hide
Query:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE
        MA++  SF+LPSALFWNQSA+VF+IY+HFFN ++D I   F+ P  +I L GLPLL S +LPSLC+PANSNS V K +E HF V+ ++PHLKILIN+F++
Subjt:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE

Query:  LERDSLRAIPNVDLVTVGPVIQREI---PSDS-STKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRSTVSGAEDEALSCRA
        LE D LRAI +V+L+ +GPV+       P++S   K ++ WLNSKPKSSVVYVSFGSI A+S  QLEEIA  LL+S  PFLWVMR    G E + +SCR 
Subjt:  LERDSLRAIPNVDLVTVGPVIQREI---PSDS-STKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRSTVSGAEDEALSCRA

Query:  ELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGDGTKGAV
        ELE +GKIV WCSQ+E+L +P+ GCF+THCGWNS LESIACGV VVAFPQWTDQ TNA+IIEE  +SGVRLR N DGIVERGEIK+CL+LVMGDG +G  
Subjt:  ELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGDGTKGAV

Query:  LRKNVRKWKELARKA
        LR+N  KWKELA  A
Subjt:  LRKNVRKWKELARKA

A0A6J1KE23 crocetin glucosyltransferase, chloroplastic-like2.3e-11164.13Show/hide
Query:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE
        MAD+  SF+LPSALFWNQSA+VF+IY+HFFN ++D I   FS P  +I L GLPLL+S +LPSLC+PANSN+ V K +E HF V+ ++PHLKILIN+F++
Subjt:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE

Query:  LERDSLRAIPNVDLVTVGPVIQREI---PSDS-STKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRSTVSGAEDEALSCRA
        LE D+LRAI +VDL+ +GPV+       P++S   K ++ WLNSKPKSSVVYVSFGSI A+S  QLEEIA  LL+S  PFLWVMR T  G E + +SCR 
Subjt:  LERDSLRAIPNVDLVTVGPVIQREI---PSDS-STKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRSTVSGAEDEALSCRA

Query:  ELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGDGTKGAV
        EL A+GKIV WCSQ+E+L +P+ GCF+THCGWNS LES+ACGV VVAFPQWTDQ TNA+IIEE  ++GVRLR N DGIVERGEIK+CL+LVMGDG +G  
Subjt:  ELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGDGTKGAV

Query:  LRKNVRKWKELARKA
        LR+N  KWK+LA  A
Subjt:  LRKNVRKWKELARKA

SwissProt top hitse value%identityAlignment
A7MAS5 Phloretin 4'-O-glucosyltransferase1.2e-8247.95Show/hide
Query:  ADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFS-----IPDGRIQLSGLPL-LSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILI
        A +    +LPS L W Q A+VF IY+++FN ++D IR   S     +    I+L GLPL  +S +LPS     N  +     F+    +L R+ +  IL+
Subjt:  ADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFS-----IPDGRIQLSGLPL-LSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILI

Query:  NSFEELERDSLRAIPNVDLVTVGPVIQREI-----PSD----------SSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVM
        N+F+ LE ++L+AI   +L+ VGP+I         PSD          S   S++ WLNSKP+ SV+YVSFGSI+ L + Q+EEIA  LL+ G PFLWV+
Subjt:  NSFEELERDSLRAIPNVDLVTVGPVIQREI-----PSD----------SSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVM

Query:  RSTV--------SGAEDEALSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGD
        R  V        +  E+E L CR ELE  G IVPWCSQVE+LS+P+ GCFVTHCGWNS LES+  GVPVVAFPQWTDQGTNAK+IE+  ++GVR+  N +
Subjt:  RSTV--------SGAEDEALSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGD

Query:  GIVERGEIKRCLELVMGDGTKGAVLRKNVRKWKELARKAQPE
        GIV   E+KRCL+LV+G G  G  +R+N +KWK+LAR+A  E
Subjt:  GIVERGEIKRCLELVMGDGTKGAVLRKNVRKWKELARKAQPE

F8WKW0 Crocetin glucosyltransferase, chloroplastic2.7e-8547.53Show/hide
Query:  NLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSN--SLVFKQFEVHFRVLNRDPHLKILINSFEELERDSL
        ++PSAL W Q  +V  IY+++F  + D ++   + P   IQ  GLP + + +LPS   P++ N  S     F+     L+ +   K+L+N+F+ LE  +L
Subjt:  NLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSN--SLVFKQFEVHFRVLNRDPHLKILINSFEELERDSL

Query:  RAIPNVDLVTVGPV-----IQREIPSDSS--------TKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRSTVSG---AEDE
        +AI + +L+ +GP+     +  + PS++S        +K +  WLNS+P  SVVYVSFGS+  L + Q+EEIA  LL+SG PFLWV+R+  +G    E++
Subjt:  RAIPNVDLVTVGPV-----IQREIPSDSS--------TKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRSTVSG---AEDE

Query:  ALSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGD
         L C  ELE +G IVPWCSQ+E+L++P+ GCFVTHCGWNS LE++ CGVPVVAFP WTDQGTNAK+IE++ E+GVR+  N DG VE  EIKRC+E VM D
Subjt:  ALSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGD

Query:  GTKGAVLRKNVRKWKELARKAQPE
        G KG  L++N +KWKELAR+A  E
Subjt:  GTKGAVLRKNVRKWKELARKAQPE

K4CWS6 UDP-glycosyltransferase 75C14.6e-8547.89Show/hide
Query:  ADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIR-KLFSIPDGRIQLSGLPLLSSDELPSLCDPANSN----SLVFKQFEVHFRVLNRDPHLKILIN
        A++    ++PSAL W Q A+V  IY+++FN + D ++    + P+  IQL  LPLL S +LPS    ++S     S     F+     L+ + + K+L+N
Subjt:  ADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIR-KLFSIPDGRIQLSGLPLLSSDELPSLCDPANSN----SLVFKQFEVHFRVLNRDPHLKILIN

Query:  SFEELERDSLRAIPNVDLVTVGPVIQREIPS--------------DSSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRS
        +F+ LE + L+AI   +L+ +GP+I                      S   +M WLN+KPKSS+VY+SFGS+  LSR Q EEIA  L+E   PFLWV+R 
Subjt:  SFEELERDSLRAIPNVDLVTVGPVIQREIPS--------------DSSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRS

Query:  TVSGAEDEALSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKR
             E+E LSC  ELE +GKIVPWCSQ+E+L++P+ GCFV+HCGWNS LES++ GVPVVAFP WTDQGTNAK+IE++ ++GVR+R N DG+VE  EIKR
Subjt:  TVSGAEDEALSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKR

Query:  CLELVMGDGTKGAVLRKNVRKWKELARKAQPE
        C+E+VM  G KG  +RKN +KWKELAR A  E
Subjt:  CLELVMGDGTKGAVLRKNVRKWKELARKAQPE

Q9LR44 UDP-glycosyltransferase 75B11.6e-7447.53Show/hide
Query:  FNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEELERDSLR
        F LPSAL W Q A VF+IY+  F  +    + +F +P+       L  L   +LPS   P+N+N   +  F+     L ++   KILIN+F+ LE ++L 
Subjt:  FNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEELERDSLR

Query:  AIPNVDLVTVGPVIQREIPSDSSTK-------SHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVM-----RSTVSGAEDEA----
        A PN+D+V VGP++  EI S S+ K       S+  WL+SK +SSV+YVSFG++  LS+ Q+EE+A AL+E   PFLWV+     R T +  E+E     
Subjt:  AIPNVDLVTVGPVIQREIPSDSSTK-------SHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVM-----RSTVSGAEDEA----

Query:  -LSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGD
            R ELE  G IV WCSQ+E+LS+ A GCFVTHCGW+S LES+  GVPVVAFP W+DQ TNAK++EE  ++GVR+R N DG+VERGEI+RCLE VM +
Subjt:  -LSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGD

Query:  GTKGAVLRKNVRKWKELARKAQPE
          K   LR+N +KWK LA +A  E
Subjt:  GTKGAVLRKNVRKWKELARKAQPE

Q9ZR25 Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase1.3e-7144.91Show/hide
Query:  ADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQL-SGLPLLSSDELPSLCDPANS---NSLVFKQFEVHFRVLNRDPHLKILINS
        A +   F+L SAL W + A+V  I++ +FN + D I          I L  GLP+L+  +LPS   P+      SL+ ++ E     L  +   K+L+NS
Subjt:  ADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQL-SGLPLLSSDELPSLCDPANS---NSLVFKQFEVHFRVLNRDPHLKILINS

Query:  FEELERDSLRAIPNVDLVTVGPVIQREI-----PSD-----------SSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMR
        F+ LE D+L+AI   +++ +GP+I         PSD           S+    + WL++ P+SSVVYVSFGS    ++ Q+EEIA  LL+ G PFLWV+R
Subjt:  FEELERDSLRAIPNVDLVTVGPVIQREI-----PSD-----------SSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMR

Query:  STVSGAEDEALSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDG-IVERGEI
          V+  E+  +SC  EL+  GKIV WCSQ+E+L++P+ GCFVTHCGWNS LESI+ GVP+VAFPQW DQGTNAK++E++  +GVR+RAN +G +V+  EI
Subjt:  STVSGAEDEALSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDG-IVERGEI

Query:  KRCLELVMGDGTKGAVLRKNVRKWKELARKAQPE
        +RC+E VM  G K   LR++  KWK+LARKA  E
Subjt:  KRCLELVMGDGTKGAVLRKNVRKWKELARKAQPE

Arabidopsis top hitse value%identityAlignment
AT1G05530.1 UDP-glucosyl transferase 75B21.1e-7043.43Show/hide
Query:  FNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEELERDSLR
        F+LPS   W Q A  F IY+++   +      +F  P+       LP L   +LPS   P+N+N      ++     L  + + KIL+N+F+ LE + L 
Subjt:  FNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEELERDSLR

Query:  AIPNVDLVTVGPVIQREI----------PSDSSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVM------RSTVSGAEDEA
        AIPN+++V VGP++  EI            D  + S+  WL+SK +SSV+YVSFG++  LS+ Q+EE+A AL+E G PFLWV+       + + G E+  
Subjt:  AIPNVDLVTVGPVIQREI----------PSDSSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVM------RSTVSGAEDEA

Query:  L----SCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELV
        +      R ELE  G IV WCSQ+E+L + A GCF+THCGW+S LES+  GVPVVAFP W+DQ  NAK++EE+ ++GVR+R N +G+VERGEI RCLE V
Subjt:  L----SCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELV

Query:  MGDGTKGAVLRKNVRKWKELARKAQPE
        M    K   LR+N  KWK LA +A  E
Subjt:  MGDGTKGAVLRKNVRKWKELARKAQPE

AT1G05560.1 UDP-glucosyltransferase 75B11.2e-7547.53Show/hide
Query:  FNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEELERDSLR
        F LPSAL W Q A VF+IY+  F  +    + +F +P+       L  L   +LPS   P+N+N   +  F+     L ++   KILIN+F+ LE ++L 
Subjt:  FNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEELERDSLR

Query:  AIPNVDLVTVGPVIQREIPSDSSTK-------SHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVM-----RSTVSGAEDEA----
        A PN+D+V VGP++  EI S S+ K       S+  WL+SK +SSV+YVSFG++  LS+ Q+EE+A AL+E   PFLWV+     R T +  E+E     
Subjt:  AIPNVDLVTVGPVIQREIPSDSSTK-------SHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVM-----RSTVSGAEDEA----

Query:  -LSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGD
            R ELE  G IV WCSQ+E+LS+ A GCFVTHCGW+S LES+  GVPVVAFP W+DQ TNAK++EE  ++GVR+R N DG+VERGEI+RCLE VM +
Subjt:  -LSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGD

Query:  GTKGAVLRKNVRKWKELARKAQPE
          K   LR+N +KWK LA +A  E
Subjt:  GTKGAVLRKNVRKWKELARKAQPE

AT4G14090.1 UDP-Glycosyltransferase superfamily protein7.0e-5737.46Show/hide
Query:  FNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEELERDSLR
        F+LP+ L W + A+V  IY+++FN      + LF +    I+L  LPL+++ +LPS   P+ +          H   L  + + KIL+N+F  LE D+L 
Subjt:  FNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEELERDSLR

Query:  AIPNVDLVTVGPVI-QREIPSD---SSTKSHMAWLNSKPKSSVVYVSFGS-ITALSRPQLEEIAGALLESGSPFLWVMRSTVSGAEDEALSCRAEL---E
        ++  + ++ +GP++   E  +D   SS + +  WL+SK + SV+Y+S G+    L    +E +   +L +  PFLW++R      E++  +   EL    
Subjt:  AIPNVDLVTVGPVI-QREIPSD---SSTKSHMAWLNSKPKSSVVYVSFGS-ITALSRPQLEEIAGALLESGSPFLWVMRSTVSGAEDEALSCRAEL---E

Query:  ARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGDGTKGAVLRK
         RG +V WCSQ  +L++ A GCFVTHCGWNS LES+  GVPVVAFPQ+ DQ T AK++E+    GV+++   +G V+  EI+RCLE VM  G +   +R+
Subjt:  ARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGDGTKGAVLRK

Query:  NVRKWKELARKAQPE
        N  KWK +A  A  E
Subjt:  NVRKWKELARKAQPE

AT4G15480.1 UDP-Glycosyltransferase superfamily protein1.2e-5138.32Show/hide
Query:  FNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEELERDSLR
        FN+P A+ W QS + FS Y+H+ +       +  + P+  ++L  +P+L +DE+PS   P +S    F+Q  +  +  N      +LI+SF+ LE++ + 
Subjt:  FNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEELERDSLR

Query:  AIPNV-DLVTVGPV--IQREIPSD------SSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRSTVSGAEDEALSCRAEL
         + ++  + TVGP+  + R + SD       ST   + WL+S+PKSSVVY+SFG++  L + Q+EEIA  +L+SG  FLWV+R      + E      EL
Subjt:  AIPNV-DLVTVGPV--IQREIPSD------SSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRSTVSGAEDEALSCRAEL

Query:  -----EARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRL--RANGDGIVERGEI-KRCLELVMGD
             + +G IV WC Q ++LS+P+  CFVTHCGWNS +ES++ GVPVV  PQW DQ T+A  + ++ ++GVRL   A  + +V R E+ ++ LE  +G+
Subjt:  -----EARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRL--RANGDGIVERGEI-KRCLELVMGD

Query:  GTKGAVLRKNVRKWKELARKA
          K   LRKN  KWK  A  A
Subjt:  GTKGAVLRKNVRKWKELARKA

AT4G15550.1 indole-3-acetate beta-D-glucosyltransferase4.4e-6741.9Show/hide
Query:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE
        +A+L   F+LPSAL W Q  +VFSI++H+FN + D I ++ + P   I+L  LPLL+  ++PS    +N  + +   F      L  + + KILIN+F+E
Subjt:  MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEE

Query:  LERDSLRAIP-NVDLVTVGPVIQREIPSDSSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMR-STVSGAEDE-------A
        LE +++ ++P N  +V VGP++       SS   ++ WL++K  SSV+YVSFG++  LS+ QL E+  AL++S  PFLWV+   +    EDE        
Subjt:  LERDSLRAIP-NVDLVTVGPVIQREIPSDSSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMR-STVSGAEDE-------A

Query:  LSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRL----RANGDGIVERGEIKRCLELV
         S R EL+  G +V WC Q  +L++ + GCFVTHCGWNS LES+  GVPVVAFPQW DQ  NAK++E+  ++GVR+       G  +V+  EI+RC+E V
Subjt:  LSCRAELEARGKIVPWCSQVEILSNPATGCFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRL----RANGDGIVERGEIKRCLELV

Query:  MGDGTKGAVLRKNVRKWKELARKAQPE
        M D  K    R N  +WK+LA +A  E
Subjt:  MGDGTKGAVLRKNVRKWKELARKAQPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGACCTCACCCTCTCCTTCAACCTCCCCTCTGCTCTCTTCTGGAACCAGTCCGCCTCTGTCTTCTCCATTTACCACCACTTCTTCAACGCCCACCGAGACCCAAT
TCGGAAACTCTTCTCAATTCCCGATGGGCGAATTCAGCTGTCGGGCCTCCCTTTGCTGTCGTCTGACGAGCTCCCGTCCCTGTGCGATCCGGCCAATTCCAACTCTCTGG
TCTTCAAACAGTTCGAGGTCCATTTCCGCGTCCTCAATCGAGACCCCCACTTGAAGATTTTGATCAACTCGTTCGAGGAATTGGAGCGCGACTCCTTGAGAGCGATCCCC
AATGTGGATTTGGTCACAGTTGGGCCGGTGATTCAGCGCGAGATTCCGTCTGATTCCTCGACGAAATCCCACATGGCGTGGCTGAACTCGAAGCCGAAGTCGTCGGTGGT
GTACGTGTCGTTCGGGAGCATCACGGCGCTGTCGAGGCCCCAATTGGAGGAGATCGCCGGAGCGCTGCTGGAGTCGGGGAGCCCGTTTCTGTGGGTGATGAGGAGCACCG
TGAGCGGGGCAGAGGACGAGGCACTGAGCTGTAGGGCGGAGCTTGAGGCGCGTGGGAAGATAGTGCCGTGGTGCTCGCAGGTGGAGATTCTGTCCAATCCGGCGACCGGG
TGTTTCGTGACGCATTGCGGGTGGAATTCGTTGCTCGAGAGCATTGCATGTGGGGTGCCGGTGGTGGCGTTTCCGCAATGGACGGATCAGGGAACCAATGCGAAGATCAT
CGAGGAGTTGTTGGAGAGTGGCGTGAGGTTAAGGGCGAATGGTGATGGCATTGTGGAGCGAGGGGAGATCAAGAGATGCCTGGAGCTTGTGATGGGCGACGGAACAAAGG
GGGCGGTTTTGAGAAAGAACGTCAGAAAATGGAAGGAGCTGGCCAGGAAAGCACAGCCGGAGAGGGAGGTTCTTCCCACATCAATATTGGAGCTTTCGTTGATGAGGTTT
GCAATGCGACCCCTCTCTAGTAACCACTTTATGGAAGTTCTAAGCATAGTTGAATTTGTTATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGACCTCACCCTCTCCTTCAACCTCCCCTCTGCTCTCTTCTGGAACCAGTCCGCCTCTGTCTTCTCCATTTACCACCACTTCTTCAACGCCCACCGAGACCCAAT
TCGGAAACTCTTCTCAATTCCCGATGGGCGAATTCAGCTGTCGGGCCTCCCTTTGCTGTCGTCTGACGAGCTCCCGTCCCTGTGCGATCCGGCCAATTCCAACTCTCTGG
TCTTCAAACAGTTCGAGGTCCATTTCCGCGTCCTCAATCGAGACCCCCACTTGAAGATTTTGATCAACTCGTTCGAGGAATTGGAGCGCGACTCCTTGAGAGCGATCCCC
AATGTGGATTTGGTCACAGTTGGGCCGGTGATTCAGCGCGAGATTCCGTCTGATTCCTCGACGAAATCCCACATGGCGTGGCTGAACTCGAAGCCGAAGTCGTCGGTGGT
GTACGTGTCGTTCGGGAGCATCACGGCGCTGTCGAGGCCCCAATTGGAGGAGATCGCCGGAGCGCTGCTGGAGTCGGGGAGCCCGTTTCTGTGGGTGATGAGGAGCACCG
TGAGCGGGGCAGAGGACGAGGCACTGAGCTGTAGGGCGGAGCTTGAGGCGCGTGGGAAGATAGTGCCGTGGTGCTCGCAGGTGGAGATTCTGTCCAATCCGGCGACCGGG
TGTTTCGTGACGCATTGCGGGTGGAATTCGTTGCTCGAGAGCATTGCATGTGGGGTGCCGGTGGTGGCGTTTCCGCAATGGACGGATCAGGGAACCAATGCGAAGATCAT
CGAGGAGTTGTTGGAGAGTGGCGTGAGGTTAAGGGCGAATGGTGATGGCATTGTGGAGCGAGGGGAGATCAAGAGATGCCTGGAGCTTGTGATGGGCGACGGAACAAAGG
GGGCGGTTTTGAGAAAGAACGTCAGAAAATGGAAGGAGCTGGCCAGGAAAGCACAGCCGGAGAGGGAGGTTCTTCCCACATCAATATTGGAGCTTTCGTTGATGAGGTTT
GCAATGCGACCCCTCTCTAGTAACCACTTTATGGAAGTTCTAAGCATAGTTGAATTTGTTATTTGA
Protein sequenceShow/hide protein sequence
MADLTLSFNLPSALFWNQSASVFSIYHHFFNAHRDPIRKLFSIPDGRIQLSGLPLLSSDELPSLCDPANSNSLVFKQFEVHFRVLNRDPHLKILINSFEELERDSLRAIP
NVDLVTVGPVIQREIPSDSSTKSHMAWLNSKPKSSVVYVSFGSITALSRPQLEEIAGALLESGSPFLWVMRSTVSGAEDEALSCRAELEARGKIVPWCSQVEILSNPATG
CFVTHCGWNSLLESIACGVPVVAFPQWTDQGTNAKIIEELLESGVRLRANGDGIVERGEIKRCLELVMGDGTKGAVLRKNVRKWKELARKAQPEREVLPTSILELSLMRF
AMRPLSSNHFMEVLSIVEFVI