; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g20800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g20800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGlycosyltransferase
Genome locationchr7:15127745..15128752
RNA-Seq ExpressionMoc07g20800
SyntenyMoc07g20800
Gene Ontology termsGO:0080043 - quercetin 3-O-glucosyltransferase activity (molecular function)
GO:0080044 - quercetin 7-O-glucosyltransferase activity (molecular function)
InterPro domainsIPR002213 - UDP-glucuronosyl/UDP-glucosyltransferase
IPR035595 - UDP-glycosyltransferase family, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593628.1 UDP-glycosyltransferase 75C1, partial [Cucurbita argyrosperma subsp. sororia]6.4e-11863.88Show/hide
Query:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE
        MA++A SF+LPSALFWNQSA+VF+IY+HFF+ ++D I   F+ P  +I LPGLPLL+S +LPSLC PAN NS V +L++ HF V+ ++PHLKILIN+F++
Subjt:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE

Query:  LERDALRAIPNVDLVTVGPVIQR----EIPSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCRA
        LE D LRAI +V+L+ +GPV+         +    K+++ WLNSKPKSSVVYVSFGSIAA+S  QLEEIA  LLDS  PFLWVMRK  +G E + VSCR 
Subjt:  LERDALRAIPNVDLVTVGPVIQR----EIPSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCRA

Query:  ELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRAEGEI
        ELE +GKIV+WCSQ+E+L +P+ GCF+THCGWNSSLESIACGV VVAFPQWTDQ TNA+IIEE S+SGVRLR N D IVERGEIK+CL+LVM    EGE 
Subjt:  ELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRAEGEI

Query:  LRRNVGKWKELAKKATGEGGSSHVNIGAFVDEVSN
        LRRN  KWK+LA  AT +GGSS+VN+ +F+D V N
Subjt:  LRRNVGKWKELAKKATGEGGSSHVNIGAFVDEVSN

XP_022147213.1 crocetin glucosyltransferase, chloroplastic-like [Momordica charantia]2.2e-190100Show/hide
Query:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE
        MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE
Subjt:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE

Query:  LERDALRAIPNVDLVTVGPVIQREIPSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCRAELEA
        LERDALRAIPNVDLVTVGPVIQREIPSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCRAELEA
Subjt:  LERDALRAIPNVDLVTVGPVIQREIPSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCRAELEA

Query:  RGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRAEGEILRRN
        RGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRAEGEILRRN
Subjt:  RGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRAEGEILRRN

Query:  VGKWKELAKKATGEGGSSHVNIGAFVDEVSNAAAL
        VGKWKELAKKATGEGGSSHVNIGAFVDEVSNAAAL
Subjt:  VGKWKELAKKATGEGGSSHVNIGAFVDEVSNAAAL

XP_022964031.1 crocetin glucosyltransferase, chloroplastic-like [Cucurbita moschata]6.4e-11864.18Show/hide
Query:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE
        MA++A SF+LPSALFWNQSA+VF+IY+HFF+ ++D I   F+ P  +I LPGLPLL S +LPSLC PAN NS V +L++ HF V+ ++PHLKILIN+F++
Subjt:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE

Query:  LERDALRAIPNVDLVTVGPVIQR----EIPSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCRA
        LE D LRAI +V+L+ +GPV+         +    K ++ WLNSKPKSSVVYVSFGSIAA+S  QLEEIA  LLDS  PFLWVMRK  +G E + VSCR 
Subjt:  LERDALRAIPNVDLVTVGPVIQR----EIPSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCRA

Query:  ELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRAEGEI
        ELE +GKIV+WCSQ+E+L +P+ GCF+THCGWNSSLESIACGV VVAFPQWTDQ TNA+IIEE S+SGVRLR N D IVERGEIK+CL+LVM    EGE 
Subjt:  ELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRAEGEI

Query:  LRRNVGKWKELAKKATGEGGSSHVNIGAFVDEVSN
        LRRN  KWKELA  AT +GGSS+VN+ +F+D V N
Subjt:  LRRNVGKWKELAKKATGEGGSSHVNIGAFVDEVSN

XP_023000592.1 crocetin glucosyltransferase, chloroplastic-like [Cucurbita maxima]3.1e-12064.78Show/hide
Query:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE
        MAD+A SF+LPSALFWNQSA+VF+IY+HFF+ ++D I   FS P+ +I LPGLPLL+S +LPSLC PAN N+ V +L++ HF V+ ++PHLKILIN+F++
Subjt:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE

Query:  LERDALRAIPNVDLVTVGPVIQR----EIPSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCRA
        LE DALRAI +VDL+ +GPV+         +    K ++ WLNSKPKSSVVYVSFGSIAA+S  QLEEIA  LLDS RPFLWVMRKT +G E + VSCR 
Subjt:  LERDALRAIPNVDLVTVGPVIQR----EIPSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCRA

Query:  ELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRAEGEI
        EL A+GKIV+WCSQ+E+L +P+ GCF+THCGWNSSLES+ACGV VVAFPQWTDQ TNA+IIEE S++GVRLR N D IVERGEIK+CL+LVM    EGE 
Subjt:  ELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRAEGEI

Query:  LRRNVGKWKELAKKATGEGGSSHVNIGAFVDEVSN
        LRRN  KWK+LA  AT +GGSS+VN+  F+D V N
Subjt:  LRRNVGKWKELAKKATGEGGSSHVNIGAFVDEVSN

XP_023514625.1 crocetin glucosyltransferase, chloroplastic-like [Cucurbita pepo subsp. pepo]1.1e-11763.88Show/hide
Query:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE
        MA++A SF+LPSALFWNQSA+VF+IY+HFF+ ++D I   FS P+ +I LPGLPLL S +LPSLC PAN NS V +L++ HF V+ ++PHLKILIN+F++
Subjt:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE

Query:  LERDALRAIPNVDLVTVGPVIQR----EIPSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCRA
        LE D LRAI +V+L+ +GPV+         +    K ++ WLNSKPKSSVVYVSFGSIAA+S  QLEEIA  LLDS  PFLWVMRK  +G E + VSCR 
Subjt:  LERDALRAIPNVDLVTVGPVIQR----EIPSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCRA

Query:  ELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRAEGEI
        EL A+GKIV+WCSQ+E+L +P+ GCF+THCGWNSSLESIACGV VVAFPQWTDQ TNA+IIEE S+SGVRLR N D IVERGEIK+CL+LVM    EGE 
Subjt:  ELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRAEGEI

Query:  LRRNVGKWKELAKKATGEGGSSHVNIGAFVDEVSN
        LRRN  KWK+LA  AT +GGSS+ N+ +F+D V N
Subjt:  LRRNVGKWKELAKKATGEGGSSHVNIGAFVDEVSN

TrEMBL top hitse value%identityAlignment
A0A1S3C7B0 Glycosyltransferase4.5e-11760.68Show/hide
Query:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE
        MA++A SF+LP+ALFWNQSA+VF+IY+HFF+ +R+ I+  FS P   I LPGLP L+S +LPSLC PAN NS V +LF++HF+VL ++PHLKILINSFEE
Subjt:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE

Query:  LERDALRAIPNVDLVTVGPVIQREI----------------------PSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLW
        LE D  RA    +L+ +GPV+  +                       P+      +  WLNSKPKSSVVYVSFGSIAA+SK QLEEI  ALLD G  FLW
Subjt:  LERDALRAIPNVDLVTVGPVIQREI----------------------PSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLW

Query:  VMRKTVNGVEDEAVSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERG
        VMRK  +G E + +SC  ELEA+GK+V+WCSQ+E+LSNPA GCF+THCGWNSS+ES+ CGVPVVAFPQWTDQGTNAKIIE+LS+SGV+LR N + IVERG
Subjt:  VMRKTVNGVEDEAVSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERG

Query:  EIKRCLELVMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDEV
        EIK+CLE+VM    +GE  RRN  KWKELA KAT +GGSS+VNI  F+D +
Subjt:  EIKRCLELVMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDEV

A0A5D3CUV8 Glycosyltransferase4.5e-11760.68Show/hide
Query:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE
        MA++A SF+LP+ALFWNQSA+VF+IY+HFF+ +R+ I+  FS P   I LPGLP L+S +LPSLC PAN NS V +LF++HF+VL ++PHLKILINSFEE
Subjt:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE

Query:  LERDALRAIPNVDLVTVGPVIQREI----------------------PSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLW
        LE D  RA    +L+ +GPV+  +                       P+      +  WLNSKPKSSVVYVSFGSIAA+SK QLEEI  ALLD G  FLW
Subjt:  LERDALRAIPNVDLVTVGPVIQREI----------------------PSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLW

Query:  VMRKTVNGVEDEAVSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERG
        VMRK  +G E + +SC  ELEA+GK+V+WCSQ+E+LSNPA GCF+THCGWNSS+ES+ CGVPVVAFPQWTDQGTNAKIIE+LS+SGV+LR N + IVERG
Subjt:  VMRKTVNGVEDEAVSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERG

Query:  EIKRCLELVMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDEV
        EIK+CLE+VM    +GE  RRN  KWKELA KAT +GGSS+VNI  F+D +
Subjt:  EIKRCLELVMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDEV

A0A6J1D0N5 crocetin glucosyltransferase, chloroplastic-like1.0e-190100Show/hide
Query:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE
        MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE
Subjt:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE

Query:  LERDALRAIPNVDLVTVGPVIQREIPSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCRAELEA
        LERDALRAIPNVDLVTVGPVIQREIPSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCRAELEA
Subjt:  LERDALRAIPNVDLVTVGPVIQREIPSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCRAELEA

Query:  RGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRAEGEILRRN
        RGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRAEGEILRRN
Subjt:  RGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRAEGEILRRN

Query:  VGKWKELAKKATGEGGSSHVNIGAFVDEVSNAAAL
        VGKWKELAKKATGEGGSSHVNIGAFVDEVSNAAAL
Subjt:  VGKWKELAKKATGEGGSSHVNIGAFVDEVSNAAAL

A0A6J1HHQ4 crocetin glucosyltransferase, chloroplastic-like3.1e-11864.18Show/hide
Query:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE
        MA++A SF+LPSALFWNQSA+VF+IY+HFF+ ++D I   F+ P  +I LPGLPLL S +LPSLC PAN NS V +L++ HF V+ ++PHLKILIN+F++
Subjt:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE

Query:  LERDALRAIPNVDLVTVGPVIQR----EIPSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCRA
        LE D LRAI +V+L+ +GPV+         +    K ++ WLNSKPKSSVVYVSFGSIAA+S  QLEEIA  LLDS  PFLWVMRK  +G E + VSCR 
Subjt:  LERDALRAIPNVDLVTVGPVIQR----EIPSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCRA

Query:  ELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRAEGEI
        ELE +GKIV+WCSQ+E+L +P+ GCF+THCGWNSSLESIACGV VVAFPQWTDQ TNA+IIEE S+SGVRLR N D IVERGEIK+CL+LVM    EGE 
Subjt:  ELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRAEGEI

Query:  LRRNVGKWKELAKKATGEGGSSHVNIGAFVDEVSN
        LRRN  KWKELA  AT +GGSS+VN+ +F+D V N
Subjt:  LRRNVGKWKELAKKATGEGGSSHVNIGAFVDEVSN

A0A6J1KE23 crocetin glucosyltransferase, chloroplastic-like1.5e-12064.78Show/hide
Query:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE
        MAD+A SF+LPSALFWNQSA+VF+IY+HFF+ ++D I   FS P+ +I LPGLPLL+S +LPSLC PAN N+ V +L++ HF V+ ++PHLKILIN+F++
Subjt:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE

Query:  LERDALRAIPNVDLVTVGPVIQR----EIPSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCRA
        LE DALRAI +VDL+ +GPV+         +    K ++ WLNSKPKSSVVYVSFGSIAA+S  QLEEIA  LLDS RPFLWVMRKT +G E + VSCR 
Subjt:  LERDALRAIPNVDLVTVGPVIQR----EIPSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCRA

Query:  ELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRAEGEI
        EL A+GKIV+WCSQ+E+L +P+ GCF+THCGWNSSLES+ACGV VVAFPQWTDQ TNA+IIEE S++GVRLR N D IVERGEIK+CL+LVM    EGE 
Subjt:  ELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRAEGEI

Query:  LRRNVGKWKELAKKATGEGGSSHVNIGAFVDEVSN
        LRRN  KWK+LA  AT +GGSS+VN+  F+D V N
Subjt:  LRRNVGKWKELAKKATGEGGSSHVNIGAFVDEVSN

SwissProt top hitse value%identityAlignment
A7MAS5 Phloretin 4'-O-glucosyltransferase4.6e-8748.46Show/hide
Query:  ADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGR-----IQLPGLPL-LSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILI
        A +A   +LPS L W Q A+VF IY+++F+ ++D IR   S          I+LPGLPL  +S +LPS     NP +    LF     +L R+ +  IL+
Subjt:  ADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGR-----IQLPGLPL-LSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILI

Query:  NSFEELERDALRAIPNVDLVTVGPVIQREI-----PSDAS----------TKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVM
        N+F+ LE +AL+AI   +L+ VGP+I         PSD S            +++ WLNSKP+ SV+YVSFGSI+ L K Q+EEIA  LLD G PFLWV+
Subjt:  NSFEELERDALRAIPNVDLVTVGPVIQREI-----PSDAS----------TKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVM

Query:  RKTVN--------GVEDEAVSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGD
        R  V           E+E + CR ELE  G IV WCSQVE+LS+P+ GCFVTHCGWNSSLES+  GVPVVAFPQWTDQGTNAK+IE+  ++GVR+  N +
Subjt:  RKTVN--------GVEDEAVSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGD

Query:  DIVERGEIKRCLELVMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDEV
         IV   E+KRCL+LV+     GE +RRN  KWK+LA++A  EG SS  N+ AF+D++
Subjt:  DIVERGEIKRCLELVMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDEV

F8WKW0 Crocetin glucosyltransferase, chloroplastic2.7e-8747.54Show/hide
Query:  ADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQL--FDAHFRVLNRDPHLKILINSFE
        A +A   ++PSAL W Q  +V  IY+++F  + D ++   + P   IQ PGLP + + +LPS   P++ N   F L  F      L+ +   K+L+N+F+
Subjt:  ADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQL--FDAHFRVLNRDPHLKILINSFE

Query:  ELERDALRAIPNVDLVTVGPV-----IQREIPSDAS--------TKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNG
         LE  AL+AI + +L+ +GP+     +  + PS+ S        +K +  WLNS+P  SVVYVSFGS+  L K Q+EEIA  LL SGRPFLWV+R   NG
Subjt:  ELERDALRAIPNVDLVTVGPV-----IQREIPSDAS--------TKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNG

Query:  ---VEDEAVSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRC
            E++ + C  ELE +G IV WCSQ+E+L++P+ GCFVTHCGWNS+LE++ CGVPVVAFP WTDQGTNAK+IE++ E+GVR+  N D  VE  EIKRC
Subjt:  ---VEDEAVSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRC

Query:  LELVMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDE
        +E VMD   +G  L+RN  KWKELA++A  E GSS  N+ AFV++
Subjt:  LELVMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDE

K4CWS6 UDP-glycosyltransferase 75C19.4e-8847.14Show/hide
Query:  ADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIR-KLFSIPNGRIQLPGLPLLSSDELPSLCEPANPN----SLVFQLFDAHFRVLNRDPHLKILIN
        A++A   ++PSAL W Q A+V  IY+++F+ + D ++    + PN  IQLP LPLL S +LPS    ++      S     F      L+ + + K+L+N
Subjt:  ADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIR-KLFSIPNGRIQLPGLPLLSSDELPSLCEPANPN----SLVFQLFDAHFRVLNRDPHLKILIN

Query:  SFEELERDALRAIPNVDLVTVGPVI-----------QREIPSDASTKA---HVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRK
        +F+ LE + L+AI   +L+ +GP+I           +     D   K+   ++ WLN+KPKSS+VY+SFGS+  LS+ Q EEIA  L++  RPFLWV+R 
Subjt:  SFEELERDALRAIPNVDLVTVGPVI-----------QREIPSDASTKA---HVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRK

Query:  TVNGVEDEAVSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKR
             E+E +SC  ELE +GKIV WCSQ+E+L++P+ GCFV+HCGWNS+LES++ GVPVVAFP WTDQGTNAK+IE++ ++GVR+R N D +VE  EIKR
Subjt:  TVNGVEDEAVSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKR

Query:  CLELVMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDEVSNA
        C+E+VMD   +GE +R+N  KWKELA+ A  EGGSS VN+ AFV +VS +
Subjt:  CLELVMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDEVSNA

Q9LR44 UDP-glycosyltransferase 75B13.6e-7947.23Show/hide
Query:  LALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEELER
        +A  F LPSAL W Q A VF+IY+  F  ++              +LP L  L   +LPS   P+N N   +  F      L ++   KILIN+F+ LE 
Subjt:  LALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEELER

Query:  DALRAIPNVDLVTVGPVIQREIPSDASTK-------AHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVM-----RKTVNGVEDEA
        +AL A PN+D+V VGP++  EI S ++ K       ++  WL+SK +SSV+YVSFG++  LSK Q+EE+A AL++  RPFLWV+     R+T    E+E 
Subjt:  DALRAIPNVDLVTVGPVIQREIPSDASTK-------AHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVM-----RKTVNGVEDEA

Query:  -----VSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLEL
                R ELE  G IVSWCSQ+E+LS+ A GCFVTHCGW+S+LES+  GVPVVAFP W+DQ TNAK++EE  ++GVR+R N D +VERGEI+RCLE 
Subjt:  -----VSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLEL

Query:  VMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDEV
        VM+ ++    LR N  KWK LA +A  EGGSS  N+ AFV+++
Subjt:  VMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDEV

Q9ZR25 Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase3.9e-7846.53Show/hide
Query:  ADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLP-GLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE
        A +A  F+L SAL W + A+V  I++ +F+ + D I       +  I LP GLP+L+  +LPS   P+  +     L       L  +   K+L+NSF+ 
Subjt:  ADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLP-GLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE

Query:  LERDALRAIPNVDLVTVGPVIQREI-----PSDASTKAH-----------VAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTV
        LE DAL+AI   +++ +GP+I         PSD S               + WL++ P+SSVVYVSFGS    +K Q+EEIA  LLD GRPFLWV+R  V
Subjt:  LERDALRAIPNVDLVTVGPVIQREI-----PSDASTKAH-----------VAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTV

Query:  NGVEDEAVSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGD-DIVERGEIKRC
        N  E+  +SC  EL+  GKIVSWCSQ+E+L++P+ GCFVTHCGWNS+LESI+ GVP+VAFPQW DQGTNAK++E++  +GVR+RAN +  +V+  EI+RC
Subjt:  NGVEDEAVSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGD-DIVERGEIKRC

Query:  LELVMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDEV
        +E VMD   +   LR + GKWK+LA+KA  E GSS  N+  F+DEV
Subjt:  LELVMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDEV

Arabidopsis top hitse value%identityAlignment
AT1G05530.1 UDP-glucosyl transferase 75B21.7e-7644.61Show/hide
Query:  LALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEELER
        +A  F+LPS   W Q A  F IY+++   +           N   + P LP L   +LPS   P+N N     ++      L  + + KIL+N+F+ LE 
Subjt:  LALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEELER

Query:  DALRAIPNVDLVTVGPVIQREI----------PSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVN------GV
        + L AIPN+++V VGP++  EI            D  + ++  WL+SK +SSV+YVSFG++  LSK Q+EE+A AL++ GRPFLWV+   +N      G 
Subjt:  DALRAIPNVDLVTVGPVIQREI----------PSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVN------GV

Query:  EDEAV----SCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRC
        E+  +      R ELE  G IVSWCSQ+E+L + A GCF+THCGW+SSLES+  GVPVVAFP W+DQ  NAK++EE+ ++GVR+R N + +VERGEI RC
Subjt:  EDEAV----SCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRC

Query:  LELVMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFV
        LE VM+  A+   LR N  KWK LA +A  EGGSS  N+ AFV
Subjt:  LELVMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFV

AT1G05560.1 UDP-glucosyltransferase 75B12.5e-8047.23Show/hide
Query:  LALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEELER
        +A  F LPSAL W Q A VF+IY+  F  ++              +LP L  L   +LPS   P+N N   +  F      L ++   KILIN+F+ LE 
Subjt:  LALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEELER

Query:  DALRAIPNVDLVTVGPVIQREIPSDASTK-------AHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVM-----RKTVNGVEDEA
        +AL A PN+D+V VGP++  EI S ++ K       ++  WL+SK +SSV+YVSFG++  LSK Q+EE+A AL++  RPFLWV+     R+T    E+E 
Subjt:  DALRAIPNVDLVTVGPVIQREIPSDASTK-------AHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVM-----RKTVNGVEDEA

Query:  -----VSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLEL
                R ELE  G IVSWCSQ+E+LS+ A GCFVTHCGW+S+LES+  GVPVVAFP W+DQ TNAK++EE  ++GVR+R N D +VERGEI+RCLE 
Subjt:  -----VSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLEL

Query:  VMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDEV
        VM+ ++    LR N  KWK LA +A  EGGSS  N+ AFV+++
Subjt:  VMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDEV

AT1G05675.1 UDP-Glycosyltransferase superfamily protein1.4e-5738.51Show/hide
Query:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGR------IQLPGLPLLSSDELPS-LCEPANPNSLVFQLFDAHFRVLNRDPHLKI
        + D+A S+ L  A+F+ Q   V +IY+H F       +  FS+P+ +         P LP+L++++LPS LCE ++   ++  + D   ++ N D    +
Subjt:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGR------IQLPGLPLLSSDELPS-LCEPANPNSLVFQLFDAHFRVLNRDPHLKI

Query:  LINSFEELERDALRAIPNV-DLVTVGPVI-----QREIPSD---------ASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLW
        L N+F++LE   L+ I +V  ++ +GP +      + +  D         A     + WLNSK  SSVVYVSFGS+  L K QL E+A  L  SG  FLW
Subjt:  LINSFEELERDALRAIPNV-DLVTVGPVI-----QREIPSD---------ASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLW

Query:  VMRKTVNGVEDEAVSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERG
        V+R+T      E  +   E+  +G  VSW  Q+E+L++ + GCFVTHCGWNS+LE ++ GVP++  P W DQ TNAK +E++ + GVR++A+ D  V R 
Subjt:  VMRKTVNGVEDEAVSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERG

Query:  EIKRCLELVMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFV
        E  R +E VM+   +G+ +R+N  KWK LA++A  EGGSS  NI  FV
Subjt:  EIKRCLELVMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFV

AT4G14090.1 UDP-Glycosyltransferase superfamily protein1.1e-6238.39Show/hide
Query:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE
        ++ +A  F+LP+ L W + A+V  IY+++F+      + LF +    I+LP LPL+++ +LPS  +P+            H   L  + + KIL+N+F  
Subjt:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE

Query:  LERDALRAIPNVDLVTVGPVI-QREIPSD---ASTKAHVAWLNSKPKSSVVYVSFGSIA-ALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCR
        LE DAL ++  + ++ +GP++   E  +D   +S + +  WL+SK + SV+Y+S G+ A  L +  +E +   +L + RPFLW++R+     E++  +  
Subjt:  LERDALRAIPNVDLVTVGPVI-QREIPSD---ASTKAHVAWLNSKPKSSVVYVSFGSIA-ALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCR

Query:  AEL---EARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRA
         EL     RG +V WCSQ  +L++ A GCFVTHCGWNS+LES+  GVPVVAFPQ+ DQ T AK++E+    GV+++   +  V+  EI+RCLE VM    
Subjt:  AEL---EARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRA

Query:  EGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDE
        E E +R N  KWK +A  A  EGG S +N+  FVDE
Subjt:  EGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDE

AT4G15550.1 indole-3-acetate beta-D-glucosyltransferase1.3e-7644.44Show/hide
Query:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE
        +A+LA  F+LPSAL W Q  +VFSI++H+F+ + D I ++ + P+  I+LP LPLL+  ++PS    +N  + +   F      L  + + KILIN+F+E
Subjt:  MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEE

Query:  LERDALRAIP-NVDLVTVGPVIQREIPSDASTKA-HVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMR-KTVNGVEDE-------
        LE +A+ ++P N  +V VGP++   + +D S++  ++ WL++K  SSV+YVSFG++A LSK QL E+  AL+ S RPFLWV+  K+    EDE       
Subjt:  LERDALRAIP-NVDLVTVGPVIQREIPSDASTKA-HVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMR-KTVNGVEDE-------

Query:  AVSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDD----IVERGEIKRCLEL
          S R EL+  G +VSWC Q  +L++ + GCFVTHCGWNS+LES+  GVPVVAFPQW DQ  NAK++E+  ++GVR+    ++    +V+  EI+RC+E 
Subjt:  AVSCRAELEARGKIVSWCSQVEILSNPATGCFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDD----IVERGEIKRCLEL

Query:  VMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDE
        VM+ +AE    R N  +WK+LA +A  EGGSS  ++ AFVDE
Subjt:  VMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGACCTCGCCCTCTCCTTCAACCTCCCCTCCGCTCTCTTCTGGAACCAGTCCGCCTCTGTCTTCTCCATTTACCACCACTTCTTCCACGCCCACCGAGACCCAAT
TCGGAAACTCTTCTCAATTCCCAATGGCCGAATTCAGCTGCCGGGCCTCCCTCTGCTGTCCTCCGACGAGCTCCCCTCCCTGTGCGAGCCGGCCAATCCCAACTCTCTGG
TCTTCCAACTCTTCGACGCCCATTTCCGCGTCCTCAATCGAGACCCCCATTTGAAGATTTTGATCAACTCGTTCGAGGAATTGGAGCGCGACGCCTTGAGAGCGATCCCC
AATGTGGATTTGGTCACAGTTGGGCCGGTGATTCAGCGCGAGATTCCGTCGGATGCCTCGACAAAAGCCCACGTGGCCTGGCTGAACTCGAAGCCGAAGTCGTCGGTGGT
GTACGTGTCATTTGGGAGCATCGCGGCGCTGTCGAAGCCCCAATTGGAGGAGATCGCCGGAGCGCTGCTGGACTCGGGGCGGCCATTTCTCTGGGTGATGAGGAAAACCG
TAAATGGGGTGGAGGACGAGGCAGTCAGCTGCAGAGCTGAGCTGGAGGCGCGCGGGAAGATAGTATCGTGGTGCTCGCAGGTGGAGATTTTGTCGAATCCGGCGACTGGG
TGTTTCGTGACGCACTGCGGGTGGAATTCTTCGCTCGAGAGCATTGCGTGTGGGGTGCCGGTGGTGGCGTTTCCGCAGTGGACAGATCAGGGGACCAATGCGAAGATCAT
CGAGGAGTTGTCGGAGAGTGGCGTGAGGTTGAGGGCAAATGGTGATGACATTGTGGAGCGAGGGGAGATCAAGAGATGCCTGGAGCTTGTGATGGATCCCAGAGCGGAGG
GGGAGATTTTGAGGAGGAATGTCGGAAAATGGAAGGAGCTGGCCAAGAAAGCGACTGGAGAAGGAGGTTCTTCCCACGTCAATATTGGGGCTTTTGTTGATGAGGTCTCC
AATGCGGCTGCCCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGACCTCGCCCTCTCCTTCAACCTCCCCTCCGCTCTCTTCTGGAACCAGTCCGCCTCTGTCTTCTCCATTTACCACCACTTCTTCCACGCCCACCGAGACCCAAT
TCGGAAACTCTTCTCAATTCCCAATGGCCGAATTCAGCTGCCGGGCCTCCCTCTGCTGTCCTCCGACGAGCTCCCCTCCCTGTGCGAGCCGGCCAATCCCAACTCTCTGG
TCTTCCAACTCTTCGACGCCCATTTCCGCGTCCTCAATCGAGACCCCCATTTGAAGATTTTGATCAACTCGTTCGAGGAATTGGAGCGCGACGCCTTGAGAGCGATCCCC
AATGTGGATTTGGTCACAGTTGGGCCGGTGATTCAGCGCGAGATTCCGTCGGATGCCTCGACAAAAGCCCACGTGGCCTGGCTGAACTCGAAGCCGAAGTCGTCGGTGGT
GTACGTGTCATTTGGGAGCATCGCGGCGCTGTCGAAGCCCCAATTGGAGGAGATCGCCGGAGCGCTGCTGGACTCGGGGCGGCCATTTCTCTGGGTGATGAGGAAAACCG
TAAATGGGGTGGAGGACGAGGCAGTCAGCTGCAGAGCTGAGCTGGAGGCGCGCGGGAAGATAGTATCGTGGTGCTCGCAGGTGGAGATTTTGTCGAATCCGGCGACTGGG
TGTTTCGTGACGCACTGCGGGTGGAATTCTTCGCTCGAGAGCATTGCGTGTGGGGTGCCGGTGGTGGCGTTTCCGCAGTGGACAGATCAGGGGACCAATGCGAAGATCAT
CGAGGAGTTGTCGGAGAGTGGCGTGAGGTTGAGGGCAAATGGTGATGACATTGTGGAGCGAGGGGAGATCAAGAGATGCCTGGAGCTTGTGATGGATCCCAGAGCGGAGG
GGGAGATTTTGAGGAGGAATGTCGGAAAATGGAAGGAGCTGGCCAAGAAAGCGACTGGAGAAGGAGGTTCTTCCCACGTCAATATTGGGGCTTTTGTTGATGAGGTCTCC
AATGCGGCTGCCCTCTAG
Protein sequenceShow/hide protein sequence
MADLALSFNLPSALFWNQSASVFSIYHHFFHAHRDPIRKLFSIPNGRIQLPGLPLLSSDELPSLCEPANPNSLVFQLFDAHFRVLNRDPHLKILINSFEELERDALRAIP
NVDLVTVGPVIQREIPSDASTKAHVAWLNSKPKSSVVYVSFGSIAALSKPQLEEIAGALLDSGRPFLWVMRKTVNGVEDEAVSCRAELEARGKIVSWCSQVEILSNPATG
CFVTHCGWNSSLESIACGVPVVAFPQWTDQGTNAKIIEELSESGVRLRANGDDIVERGEIKRCLELVMDPRAEGEILRRNVGKWKELAKKATGEGGSSHVNIGAFVDEVS
NAAAL