; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020707 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020707
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionGlycosyltransferase
Genome locationscaffold10:26711550..26724030
RNA-Seq ExpressionSpg020707
SyntenySpg020707
Gene Ontology termsGO:0080043 - quercetin 3-O-glucosyltransferase activity (molecular function)
GO:0080044 - quercetin 7-O-glucosyltransferase activity (molecular function)
InterPro domainsIPR002213 - UDP-glucuronosyl/UDP-glucosyltransferase
IPR035595 - UDP-glycosyltransferase family, conserved site
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2728265.1 hypothetical protein I3760_01G197000 [Carya illinoinensis]2.5e-1142.59Show/hide
Query:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVASLPILDAGDLPI------SFPGVGLGFVKSFKASQFGNLEEAKWMLCNTF
        MPW + VA   GL GA FFTQSCAV  IY     G +     EG  VS+ SLP+L   DLP       S+P        +   +QF N++EA W+LCNTF
Subjt:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVASLPILDAGDLPI------SFPGVGLGFVKSFKASQFGNLEEAKWMLCNTF

Query:  YELESAVL
          LE  ++
Subjt:  YELESAVL

KAG2728265.1 hypothetical protein I3760_01G197000 [Carya illinoinensis]4.3e-8362.82Show/hide
Query:  KWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSE
        +W +K++GP +PS YLD RLE D+ YG+ LF  DA DA  +WL++K   SV+Y SFGS+V LGE Q+ E+A  L++ N +F+WV+ ES ++KLP+NF+ E
Subjt:  KWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSE

Query:  TSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIR
        T+EKGLV+SWC QL+VLAH A GCF++HCGWNSTLEALS GVP+V +P+W+DQTTN+KFI DVW VGVR K+D   I T EEI  CI+EVMEGERGKE++
Subjt:  TSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIR

Query:  KNSLKWKELAREAVDEGGSSDKNIDDFVKQLFNS
        KNS++WKELA+EAVDEGGSSDKNI +FV +L +S
Subjt:  KNSLKWKELAREAVDEGGSSDKNIDDFVKQLFNS

XP_022156538.1 UDP-glycosyltransferase 74E2-like [Momordica charantia]8.0e-10679.32Show/hide
Query:  VGHKWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNF
        VG +WRM+S+GP VPSAYLD RLE D+SYG+ LFSSD SD T EWLNSK ATSVVYVSFGS+V+L ENQVREIASSLRD N+ F+WV+GES  QKLPTNF
Subjt:  VGHKWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNF

Query:  VSETSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGK
        +SETSEKGLVLSWC QLQVLAH+A+GCFV+HCGWNSTLEALS GVPMV VPKWSDQTTN+KFI+DVWGVGVRAKV+ + IFT  EI KCI+E+MEGERGK
Subjt:  VSETSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGK

Query:  EIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQLFNS
         IR+NSLKWKELAREA+DEGG+SD NIDDFVK+L NS
Subjt:  EIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQLFNS

XP_022156538.1 UDP-glycosyltransferase 74E2-like [Momordica charantia]2.1e-2966.67Show/hide
Query:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVASLPI-LDAGDLPISFPGVGLGFVKSFKASQFGNLEEAKWMLCNTFYELES
        MPW+LPVA R GL GAPFFTQS AVN I+DLI RG LE+PV +G R+S+ SL + LDA DLPISFP  G+  VK FKASQF NL+E KWM CNTF++LE 
Subjt:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVASLPI-LDAGDLPISFPGVGLGFVKSFKASQFGNLEEAKWMLCNTFYELES

Query:  AV
         V
Subjt:  AV

XP_022156538.1 UDP-glycosyltransferase 74E2-like [Momordica charantia]9.1e-10277.73Show/hide
Query:  VGHKWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDAS-DATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTN
        VG +W MKS+GP VPSAYLD RLE DESYG+ LFSSDAS DAT EWLNSKPATSVVYVSFGS+VNL E QVREI+SSLRD N+ FIWV+ ES KQKLP +
Subjt:  VGHKWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDAS-DATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTN

Query:  FVSETSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERG
        F SETS KGLVL+WC+Q+QVLAH+A+GCFV+HCGWNS LEALS GVPMV VPKWSDQTTN+KF+ADVWGVGVRAKV+ + IFT++EI KCIREVME ERG
Subjt:  FVSETSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERG

Query:  KEIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQLFNS
        K+IR+NSLKWKELAREAVDE G+SD NI+DFVK+L +S
Subjt:  KEIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQLFNS

XP_022156543.1 UDP-glycosyltransferase 74E2-like [Momordica charantia]9.0e-2559.22Show/hide
Query:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAE-GGRVSVASLP-ILDAGDLPISFPGVGLGFVKSFKASQFGNLEEAKWMLCNTFYELE
        MPW+L +ANR GL GAPFFTQS AVNHIYDLI RG LE+PV +    +S+ SLP +L A DLPI+FP  G+  VK  K  QF NL++A+W+ CNTF++LE
Subjt:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAE-GGRVSVASLP-ILDAGDLPISFPGVGLGFVKSFKASQFGNLEEAKWMLCNTFYELE

Query:  SAV
        + V
Subjt:  SAV

XP_022156543.1 UDP-glycosyltransferase 74E2-like [Momordica charantia]4.3e-8362.82Show/hide
Query:  KWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSE
        +W +K++GP +PS YLD RLE D+ YG+ LF  DA DA  +WL++K   SV+Y SFGS+V LGE Q+ E+A  L++ N +F+WV+ ES ++KLP+NF+ E
Subjt:  KWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSE

Query:  TSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIR
        T+EKGLV+SWC QL+VLAH A GCF++HCGWNSTLEALS GVP+V +P+W+DQTTN+KFI DVW VGVR K+D   I T EEI  CI+EVMEGERGKE++
Subjt:  TSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIR

Query:  KNSLKWKELAREAVDEGGSSDKNIDDFVKQLFNS
        KNS++WKELA+EAVDEGGSSDKNI +FV +L +S
Subjt:  KNSLKWKELAREAVDEGGSSDKNIDDFVKQLFNS

XP_042944679.1 mogroside IE synthase-like isoform X1 [Carya illinoinensis]2.1e-1345.63Show/hide
Query:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVASLPILDAGDLPISFPGV-GLGFVKSFKASQFGNLEEAKWMLCNTFYELES
        +PW L VA +LG+DGAPFFTQSCAVN IY    RG      AEG  +S+ SLP L   D+P    G      +      QF NL+E  W+L NTF +LE 
Subjt:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVASLPILDAGDLPISFPGV-GLGFVKSFKASQFGNLEEAKWMLCNTFYELES

Query:  AVL
         ++
Subjt:  AVL

XP_042944679.1 mogroside IE synthase-like isoform X1 [Carya illinoinensis]4.3e-8362.82Show/hide
Query:  KWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSE
        +W +K++GP +PS YLD RLE D+ YG+ LF  DA DA  +WL++K   SV+Y SFGS+V LGE Q+ E+A  L++ N +F+WV+ ES ++KLP+NF+ E
Subjt:  KWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSE

Query:  TSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIR
        T+EKGLV+SWC QL+VLAH A GCF++HCGWNSTLEALS GVP+V +P+W+DQTTN+KFI DVW VGVR K+D   I T EEI  CI+EVMEGERGKE++
Subjt:  TSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIR

Query:  KNSLKWKELAREAVDEGGSSDKNIDDFVKQLFNS
        KNS++WKELA+EAVDEGGSSDKNI +FV +L +S
Subjt:  KNSLKWKELAREAVDEGGSSDKNIDDFVKQLFNS

XP_042944685.1 mogroside IE synthase-like isoform X2 [Carya illinoinensis]2.5e-1142.59Show/hide
Query:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVASLPILDAGDLPI------SFPGVGLGFVKSFKASQFGNLEEAKWMLCNTF
        MPW + VA   GL GA FFTQSCAV  IY     G +     EG  VS+ SLP+L   DLP       S+P        +   +QF N++EA W+LCNTF
Subjt:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVASLPILDAGDLPI------SFPGVGLGFVKSFKASQFGNLEEAKWMLCNTF

Query:  YELESAVL
          LE  ++
Subjt:  YELESAVL

TrEMBL top hitse value%identityAlignment
A0A2I4H5B8 Glycosyltransferase2.0e-1445.87Show/hide
Query:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDL-EIPVAEGGRVSVASLPILDAGDLPI------SFPGVGLGFVKSFKASQFGNLEEAKWMLCNT
        +PW L +A +LG+DGAPFFTQSCAVN IY    RG   +IPV EG  +S+ S+P L   D+P       S+P      +     +QF N +EA W+ CNT
Subjt:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDL-EIPVAEGGRVSVASLPILDAGDLPI------SFPGVGLGFVKSFKASQFGNLEEAKWMLCNT

Query:  FYELESAVL
        F +LE  VL
Subjt:  FYELESAVL

A0A6J1DQW5 UDP-glycosyltransferase 74E2-like3.9e-10679.32Show/hide
Query:  VGHKWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNF
        VG +WRM+S+GP VPSAYLD RLE D+SYG+ LFSSD SD T EWLNSK ATSVVYVSFGS+V+L ENQVREIASSLRD N+ F+WV+GES  QKLPTNF
Subjt:  VGHKWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNF

Query:  VSETSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGK
        +SETSEKGLVLSWC QLQVLAH+A+GCFV+HCGWNSTLEALS GVPMV VPKWSDQTTN+KFI+DVWGVGVRAKV+ + IFT  EI KCI+E+MEGERGK
Subjt:  VSETSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGK

Query:  EIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQLFNS
         IR+NSLKWKELAREA+DEGG+SD NIDDFVK+L NS
Subjt:  EIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQLFNS

A0A6J1DQW5 UDP-glycosyltransferase 74E2-like1.0e-2966.67Show/hide
Query:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVASLPI-LDAGDLPISFPGVGLGFVKSFKASQFGNLEEAKWMLCNTFYELES
        MPW+LPVA R GL GAPFFTQS AVN I+DLI RG LE+PV +G R+S+ SL + LDA DLPISFP  G+  VK FKASQF NL+E KWM CNTF++LE 
Subjt:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVASLPI-LDAGDLPISFPGVGLGFVKSFKASQFGNLEEAKWMLCNTFYELES

Query:  AV
         V
Subjt:  AV

A0A6J1DQW5 UDP-glycosyltransferase 74E2-like4.4e-10277.73Show/hide
Query:  VGHKWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDAS-DATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTN
        VG +W MKS+GP VPSAYLD RLE DESYG+ LFSSDAS DAT EWLNSKPATSVVYVSFGS+VNL E QVREI+SSLRD N+ FIWV+ ES KQKLP +
Subjt:  VGHKWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDAS-DATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTN

Query:  FVSETSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERG
        F SETS KGLVL+WC+Q+QVLAH+A+GCFV+HCGWNS LEALS GVPMV VPKWSDQTTN+KF+ADVWGVGVRAKV+ + IFT++EI KCIREVME ERG
Subjt:  FVSETSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERG

Query:  KEIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQLFNS
        K+IR+NSLKWKELAREAVDE G+SD NI+DFVK+L +S
Subjt:  KEIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQLFNS

A0A6J1DQX0 Glycosyltransferase4.4e-2559.22Show/hide
Query:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAE-GGRVSVASLP-ILDAGDLPISFPGVGLGFVKSFKASQFGNLEEAKWMLCNTFYELE
        MPW+L +ANR GL GAPFFTQS AVNHIYDLI RG LE+PV +    +S+ SLP +L A DLPI+FP  G+  VK  K  QF NL++A+W+ CNTF++LE
Subjt:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAE-GGRVSVASLP-ILDAGDLPISFPGVGLGFVKSFKASQFGNLEEAKWMLCNTFYELE

Query:  SAV
        + V
Subjt:  SAV

A0A6J1DQX0 Glycosyltransferase2.3e-8261.97Show/hide
Query:  KWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSE
        +W +K++GP +PS YLD RLE D+ YG+ LF  D  DA   WL++K   SVVY SFGS+ +LGE Q+ E+   L+D N  F+WV+ E+ ++KLP NF+ E
Subjt:  KWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSE

Query:  TSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIR
        T+EKG+V+SWC QL+VLAH A+GCF++HCGWNSTLEALS GVPMV +P+W+DQTTN+KFI DVW VGVR K+D   I T EEIG CIREVMEGERGKE++
Subjt:  TSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIR

Query:  KNSLKWKELAREAVDEGGSSDKNIDDFVKQLFNS
         NS++WKELA+EAVDE GSSDKNI++FV +L +S
Subjt:  KNSLKWKELAREAVDEGGSSDKNIDDFVKQLFNS

A0A7N2RCP5 Glycosyltransferase8.7e-8262.34Show/hide
Query:  KWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSE
        KW +K++GP +PS YLD RLE D+ YG+ LF  D  DA  +WL++K   SV+Y SFGS+ +LGE Q++E+   L++ N +F+WV+ E+ ++KLPTNF+ E
Subjt:  KWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSE

Query:  TSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIR
        T EKGLV+SWC+QL+VLAH A+GCF++HCGWNSTLEALS GVPMV +P+W+DQ TN+KFIADVW VGVR K+D   I T EEI  CIREV+E ERGKE+R
Subjt:  TSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIR

Query:  KNSLKWKELAREAVDEGGSSDKNIDDFVKQL
         NS+KWKELA+EA+DEGGSSDKNI++FV +L
Subjt:  KNSLKWKELAREAVDEGGSSDKNIDDFVKQL

A0A7N2RCP5 Glycosyltransferase1.0e-1041.67Show/hide
Query:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVASLPILDAGDL------PISFPGVGLGFVKSFKASQFGNLEEAKWMLCNTF
        +PW L VA + G+DGAPF+TQSCAVN +Y    +G + +P+ E   VS+ S+P L   DL      P S+P +   F+      QF N  EAKW+  ++F
Subjt:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVASLPILDAGDL------PISFPGVGLGFVKSFKASQFGNLEEAKWMLCNTF

Query:  YELESAVL
         ELE  V+
Subjt:  YELESAVL

A0A7N2RCP5 Glycosyltransferase2.5e-8160.17Show/hide
Query:  VVEEVGHKWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKL
        V+  +  +W +K++GP +PS YLD RLE D+ YG+ LF  D  DA  +WL++K   SV+Y SFGS+  LGE Q+ E+A  L++ N +F+WV+ ES ++KL
Subjt:  VVEEVGHKWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKL

Query:  PTNFVSETSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEG
        P+NF+ ET+EKGLV+SW  QL VLAH A GCF++HCGWNSTLEALS GVP+V +P+W+DQTTN+KFI DVW VGVR K+D   I T EEI  CI+EVMEG
Subjt:  PTNFVSETSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEG

Query:  ERGKEIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQLFNS
        ERGKE++KNS++WKELA+EAVDEGGSSDKNI +FV +L +S
Subjt:  ERGKEIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQLFNS

SwissProt top hitse value%identityAlignment
K7NBW3 Mogroside IE synthase9.9e-7556.14Show/hide
Query:  MKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSETSE
        +K+VGP VPSAYLD R+E D+ YG+ LF  +  D   +WL+SKP+ SV+YVS+GS+V +GE Q++E+A  +++   FF+WV+ ++  +KLP NFV   +E
Subjt:  MKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSETSE

Query:  KGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIRKNS
        KGLV+SWC+QL+VLAH +VGCF +HCGWNSTLEAL  GVP+V  P+W+DQ TN+KF+ DVW VG R K +   + + EE+  CI EVMEGER  E + NS
Subjt:  KGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIRKNS

Query:  LKWKELAREAVDEGGSSDKNIDDFVKQL
        ++WK+ A+EAVDEGGSSDKNI++FV  L
Subjt:  LKWKELAREAVDEGGSSDKNIDDFVKQL

K7NBW3 Mogroside IE synthase1.5e-1439.81Show/hide
Query:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVASLPILDAGDLP-ISFPGVGLGFVKSFKASQFGNLEEAKWMLCNTFYELES
        MPW+L VA   GLD APF+TQSCA+N I   +  G L++P  E   +S+ S+P+L   DLP   F       +     SQ+ N+++A  + CNTF +LE 
Subjt:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVASLPILDAGDLP-ISFPGVGLGFVKSFKASQFGNLEEAKWMLCNTFYELES

Query:  AVL
         ++
Subjt:  AVL

P0C7P7 UDP-glycosyltransferase 74E11.6e-7256.39Show/hide
Query:  WRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSET
        W + ++GP VPS YLD RL +D++YG  LF +  ++   EWLNSK  +SVVYVSFGS+V L ++Q+ E+A+ L+    FF+WV+ E+ ++KLP N++ E 
Subjt:  WRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSET

Query:  SEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIRK
         EKGL +SW  QL+VL H ++GCFV+HCGWNSTLE LS GVPM+ +P W+DQ TN+KF+ DVW VGVR K D D     EE  + + EVME E+GKEIRK
Subjt:  SEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIRK

Query:  NSLKWKELAREAVDEGGSSDKNIDDFV
        N+ KWK LA+EAV EGGSSDKNI++FV
Subjt:  NSLKWKELAREAVDEGGSSDKNIDDFV

P0C7P7 UDP-glycosyltransferase 74E15.9e-1139.25Show/hide
Query:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVA---SLPILDAGDLPISF--PGVGLGFVKSFKASQFGNLEEAKWMLCNTFY
        MPW+L VA+  GL GA FFTQ   V+ IY  + +G   +P  + G  ++A   SLPIL+A DLP SF        ++      Q  N++    +LCNTF 
Subjt:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVA---SLPILDAGDLPISF--PGVGLGFVKSFKASQFGNLEEAKWMLCNTFY

Query:  ELESAVL
        +LE  +L
Subjt:  ELESAVL

Q9SKC1 UDP-glycosyltransferase 74C11.1e-7349.32Show/hide
Query:  LLDFIGFPIFN-------LYHSQSNP-LSPLLVRQCYGIL----IVLDIFSAV--SVVEEVGHKWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASD
        L  F GFP+ +            S P L   +VRQ   +L    I+ + F  +   VV+ +  +W +K++GP VPS +LDNRL +D+ Y +    ++  +
Subjt:  LLDFIGFPIFN-------LYHSQSNP-LSPLLVRQCYGIL----IVLDIFSAV--SVVEEVGHKWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASD

Query:  ATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSETSEK--GLVLSWCNQLQVLAHAAVGCFVSHCGWNSTL
        +  +WL ++PA SVVYV+FG++V L E Q++EIA ++      F+W + ES + KLP+ F+ E  EK  GLV  W  QL+VLAH ++GCFVSHCGWNSTL
Subjt:  ATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSETSEK--GLVLSWCNQLQVLAHAAVGCFVSHCGWNSTL

Query:  EALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQL
        EAL  GVPMV VP+W+DQ TN+KFI DVW +GVR + DG+ + + EEI +CI EVMEGERGKEIRKN  K K LAREA+ EGGSSDK ID+FV  L
Subjt:  EALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQL

Q9SKC5 UDP-glycosyltransferase 74D13.6e-6945.55Show/hide
Query:  PIFNLYHSQSNPLSPLLVRQCYGI----LIVLDIFS--AVSVVEEVGHKWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVV
        P+F   ++   PL  L+  Q   +      +++ F    V V++ + ++W +K++GP +PS YLD RL  D+ YG+ LF++  ++   +WL+SKP  SV+
Subjt:  PIFNLYHSQSNPLSPLLVRQCYGI----LIVLDIFS--AVSVVEEVGHKWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVV

Query:  YVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSETSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSD
        YVSFGS+  L ++Q+ E+A+ L+     F+WV+ E+  +KLP+N++ +  +KGL+++W  QLQVLAH ++GCF++HCGWNSTLEALS GV ++ +P +SD
Subjt:  YVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSETSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSD

Query:  QTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVME--GERGKEIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQL
        Q TN+KFI DVW VGVR K D +     EEI +C+ EVME   E+GKEIRKN+ +  E AREA+ +GG+SDKNID+FV ++
Subjt:  QTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVME--GERGKEIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQL

Q9SYK9 UDP-glycosyltransferase 74E24.9e-7457.27Show/hide
Query:  WRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSET
        W + ++GP VPS YLD RL +D++YG  LF++  ++   EWLNSK   SVVY+SFGS+V L E+Q+ E+A+ L+    FF+WV+ E+   KLP N+V E 
Subjt:  WRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSET

Query:  SEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIRK
         EKGL++SW  QL VLAH ++GCF++HCGWNSTLE LS GVPM+ +P W+DQ TN+KF+ DVW VGVR K +GD     EEI + + EVMEGE+GKEIRK
Subjt:  SEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIRK

Query:  NSLKWKELAREAVDEGGSSDKNIDDFV
        N+ KWK LA+EAV EGGSSDK+I++FV
Subjt:  NSLKWKELAREAVDEGGSSDKNIDDFV

Q9SYK9 UDP-glycosyltransferase 74E26.5e-1036.04Show/hide
Query:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVA---SLPILDAGDLP------ISFPGVGLGFVKSFKASQFGNLEEAKWMLC
        MPW+L VA+  GL GA FFTQ   V  IY  + +G   +P  + G  ++A   S P+L A DLP       S+P      +      Q  N++    +LC
Subjt:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVA---SLPILDAGDLP------ISFPGVGLGFVKSFKASQFGNLEEAKWMLC

Query:  NTFYELESAVL
        NTF +LE  +L
Subjt:  NTFYELESAVL

Arabidopsis top hitse value%identityAlignment
AT1G05675.1 UDP-Glycosyltransferase superfamily protein1.1e-7356.39Show/hide
Query:  WRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSET
        W + ++GP VPS YLD RL +D++YG  LF +  ++   EWLNSK  +SVVYVSFGS+V L ++Q+ E+A+ L+    FF+WV+ E+ ++KLP N++ E 
Subjt:  WRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSET

Query:  SEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIRK
         EKGL +SW  QL+VL H ++GCFV+HCGWNSTLE LS GVPM+ +P W+DQ TN+KF+ DVW VGVR K D D     EE  + + EVME E+GKEIRK
Subjt:  SEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIRK

Query:  NSLKWKELAREAVDEGGSSDKNIDDFV
        N+ KWK LA+EAV EGGSSDKNI++FV
Subjt:  NSLKWKELAREAVDEGGSSDKNIDDFV

AT1G05675.1 UDP-Glycosyltransferase superfamily protein4.2e-1239.25Show/hide
Query:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVA---SLPILDAGDLPISF--PGVGLGFVKSFKASQFGNLEEAKWMLCNTFY
        MPW+L VA+  GL GA FFTQ   V+ IY  + +G   +P  + G  ++A   SLPIL+A DLP SF        ++      Q  N++    +LCNTF 
Subjt:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVA---SLPILDAGDLPISF--PGVGLGFVKSFKASQFGNLEEAKWMLCNTFY

Query:  ELESAVL
        +LE  +L
Subjt:  ELESAVL

AT1G05680.1 Uridine diphosphate glycosyltransferase 74E23.5e-7557.27Show/hide
Query:  WRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSET
        W + ++GP VPS YLD RL +D++YG  LF++  ++   EWLNSK   SVVY+SFGS+V L E+Q+ E+A+ L+    FF+WV+ E+   KLP N+V E 
Subjt:  WRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSET

Query:  SEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIRK
         EKGL++SW  QL VLAH ++GCF++HCGWNSTLE LS GVPM+ +P W+DQ TN+KF+ DVW VGVR K +GD     EEI + + EVMEGE+GKEIRK
Subjt:  SEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIRK

Query:  NSLKWKELAREAVDEGGSSDKNIDDFV
        N+ KWK LA+EAV EGGSSDK+I++FV
Subjt:  NSLKWKELAREAVDEGGSSDKNIDDFV

AT1G05680.1 Uridine diphosphate glycosyltransferase 74E24.6e-1136.04Show/hide
Query:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVA---SLPILDAGDLP------ISFPGVGLGFVKSFKASQFGNLEEAKWMLC
        MPW+L VA+  GL GA FFTQ   V  IY  + +G   +P  + G  ++A   S P+L A DLP       S+P      +      Q  N++    +LC
Subjt:  MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVA---SLPILDAGDLP------ISFPGVGLGFVKSFKASQFGNLEEAKWMLC

Query:  NTFYELESAVL
        NTF +LE  +L
Subjt:  NTFYELESAVL

AT1G24100.1 UDP-glucosyl transferase 74B11.9e-6550.42Show/hide
Query:  EVGHKWRMKS--VGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLP
        E G    MK+  +GP +PSAYLD+R+E D+ YG  L     S    EWL +K A SV +VSFGS   L E Q+ E+A +L++ +  F+WV+ E+   KLP
Subjt:  EVGHKWRMKS--VGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLP

Query:  TNFVSETSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVD-GDAIFTAEEIGKCIREVMEG
          FV  T ++ L++SWCNQL+VLAH ++GCF++HCGWNSTLE LS GVPMV VP+WSDQ  ++KF+ +VW VG RAK + G+ I  +EE+ +C++ VMEG
Subjt:  TNFVSETSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVD-GDAIFTAEEIGKCIREVMEG

Query:  ERGKEIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQL
        E   +IR++S KWK+LA +A+ EGGSSD++I++F++ L
Subjt:  ERGKEIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQL

AT2G31750.1 UDP-glucosyl transferase 74D12.6e-7045.55Show/hide
Query:  PIFNLYHSQSNPLSPLLVRQCYGI----LIVLDIFS--AVSVVEEVGHKWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVV
        P+F   ++   PL  L+  Q   +      +++ F    V V++ + ++W +K++GP +PS YLD RL  D+ YG+ LF++  ++   +WL+SKP  SV+
Subjt:  PIFNLYHSQSNPLSPLLVRQCYGI----LIVLDIFS--AVSVVEEVGHKWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVV

Query:  YVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSETSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSD
        YVSFGS+  L ++Q+ E+A+ L+     F+WV+ E+  +KLP+N++ +  +KGL+++W  QLQVLAH ++GCF++HCGWNSTLEALS GV ++ +P +SD
Subjt:  YVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSETSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSD

Query:  QTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVME--GERGKEIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQL
        Q TN+KFI DVW VGVR K D +     EEI +C+ EVME   E+GKEIRKN+ +  E AREA+ +GG+SDKNID+FV ++
Subjt:  QTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVME--GERGKEIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQL

AT2G31790.1 UDP-Glycosyltransferase superfamily protein7.8e-7549.32Show/hide
Query:  LLDFIGFPIFN-------LYHSQSNP-LSPLLVRQCYGIL----IVLDIFSAV--SVVEEVGHKWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASD
        L  F GFP+ +            S P L   +VRQ   +L    I+ + F  +   VV+ +  +W +K++GP VPS +LDNRL +D+ Y +    ++  +
Subjt:  LLDFIGFPIFN-------LYHSQSNP-LSPLLVRQCYGIL----IVLDIFSAV--SVVEEVGHKWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASD

Query:  ATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSETSEK--GLVLSWCNQLQVLAHAAVGCFVSHCGWNSTL
        +  +WL ++PA SVVYV+FG++V L E Q++EIA ++      F+W + ES + KLP+ F+ E  EK  GLV  W  QL+VLAH ++GCFVSHCGWNSTL
Subjt:  ATAEWLNSKPATSVVYVSFGSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSETSEK--GLVLSWCNQLQVLAHAAVGCFVSHCGWNSTL

Query:  EALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQL
        EAL  GVPMV VP+W+DQ TN+KFI DVW +GVR + DG+ + + EEI +CI EVMEGERGKEIRKN  K K LAREA+ EGGSSDK ID+FV  L
Subjt:  EALSCGVPMVVVPKWSDQTTNSKFIADVWGVGVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCTGGATTCTGCCCGTCGCGAACCGCCTTGGGCTCGACGGAGCGCCGTTCTTCACTCAGTCTTGCGCTGTCAACCATATTTATGATCTTATTGGTCGGGGCGATCT
GGAGATTCCTGTTGCAGAGGGTGGTCGCGTTTCGGTTGCTTCTCTGCCGATTCTCGACGCCGGCGACTTGCCGATTTCGTTCCCCGGCGTCGGCCTTGGCTTCGTGAAGT
CTTTTAAAGCAAGTCAGTTCGGGAATTTAGAGGAGGCCAAATGGATGTTGTGTAATACCTTCTACGAATTGGAGAGTGCGGTATTGTATTGTTTGTTCGAAGGAACAGTG
GTGCTTGTGAAACAGAGTACTGAAGAACCCATGGTTTCGAGTTTCAAACTGGAGCAAGAGGAATACATGAGTTGTGGCAGAGATAATTCAGCATCCGGCTGCTATTTCAG
TGGAGCAAATGTTGGAGTAGAGTATCGCTTCTCTCAGTGCCTCATGGATCCGAGTTTGCTGGTCGAGGACTGGTCACGAATGAGTCTTACCTCAGCTAAGGATGAAATCT
CGATGCAAGTTGATCAATCGGCTGTGGAACGCACTGGTCTATCTTTGGGTTGTTGTTTACTTGGGAAACTATTGTCCCATCGGATCTTAGCAGCGGAGTGCCAGTGGCTT
TTCGATAAATTCTTGTTGGTTCTAGAATTTCCCGTTCGCGCCTACAGGCCTTTAGATTATTCCTTCACAATCGTGGCTTTTTTGGTCCATTTTTATGATATTCCTCTTGA
TTGGTACAATCATGAAATGGCAGAGAGATTGGGCAACGCAATCAGTGTGTATGTGGAGGTTGATAGCCTTTATCAATACGGGGAGTGGCTTCGTTTTGATGGGAAGAGTA
AAATCTTAGCTCGTTCCTCGAATACGGATAATCGTATCGTTGCTATCCCGTTTGCAACTCCTTCCCAACCGACTTTGGTTGGATCCTCGATGGATAAGGAGGTTGTCACG
ACCATGGGTCTTTCGGAGATTCGCCCATCAATGGGTATTCGGATTATTGAGCAGCATCAGGACAACTCAGGGGGCGATGGAGTGAGTTACGCGACAGCTTGGTATTCTCT
TTTCGGCGGCTCTGATTTTCAGAGTAAAGGCAAAGGCAAAGTTGGTGATCGATACTCCTTGGCCAGCGGAGGAAGTCTTTTTCCCGAGGACAGGAAATGGCGACCGAAAT
CATCCAATTCGCAGCAGTTCAGAGTGAAGGGAGGGGCATTTTCGGTGAAGCCATGGGACCCGACGGTTACGAGAATTGATGTTGATATTGCGGAATATTTAAGAGAGATG
GATGAATTTAACCGAGATCAGATCATCTATTTGGAGGGAGAAAATTCAAATTCAACCTCGGCAGTCCAAACAGGGAAGAAGTACGTGAAAAGGTCCAATTGGAAGAATCG
AGCTCATGCGGGATTTGTTCCCAAAGGGATGACCATGAGAGTCCTCAAGGAGTCCCAAAAGCGTAAGGATGGTTCAATCTTGTTCTCTCTAGATAACATCAAACGTCCTA
AAGTTGACGTTAAGCATTATGATTCTGCATGGGTTATTGGAGGCGATTTCAATGAAATCCTTTGGGATTATGAGAAGTCGGGAGGACCTACTCGAGAGCAACGTCTAACT
CACAATTTCGAAGCTACTTTGGATGATTATAACTTGTTGGATTTGGGATTTTTCGACGGGACCTATACTTGGTGTAATAGAAGAGAAATGGGTGACCAAGTAAGCTTAAG
ACTGGATCGTGTTTTGGCCAATCCCAAGCTTCACTTCCTTATTCCATCGCTATCAGCAGCAAATAACTCCTTTCTTTGCTGCAAATTAGCCTTGAACACCACCACGGAGG
TTGCACCGGCGAAATGTAAATCCATTGTAGCCAGTTTTGGCACTTGGGATGGAAAGGGTGAGGTTAGCAATTTGTTACATCAGGATCTGAAGAAATGTGCGAGGAAATGT
ATTGGAAACAGAGATTGCGAGAGAATTGGCTCTAGTGGGGGGGATAGGAATACGAAATGGTTCCACCAGAAAGCTTCTAGTAGGAGGAAGAGGAATTTAATCAAAGGAGT
AGTGGATGCTACTGGTTCCTGGCAGACTAAGTTATCATCTATTCAAGAGACTTTTGAGAGGATCATATGGATGTTAAGGATAGATGGTAATTCAGGGAGGGGAAGAGATG
ATCGTGCAGGGTCAACTGCGGAGCATAATGGTCAACTCGACAGATACATTTTCCGATCCCTTATGTGTCGAGGCGTTGGCAGTCCAGGAAGGTCTCCGTTTTGCATATAT
ACCGAATTTGAGATGCTTCTTGATTTTATCGGATTCCCTATCTTTAATCTCTATCATTCACAGAGCAATCCCTTGTCCCCGCTACTTGTGCGACAGTGTTATGGGATATT
GATCGTGTTAGACATATTTTCTGCAGTGTCAGTGGTTGAAGAGGTGGGACACAAATGGCGAATGAAGAGTGTGGGGCCATGTGTCCCATCGGCATACTTAGACAATCGGT
TGGAGAAGGACGAAAGCTACGGCGTCGGTCTCTTCAGCTCTGACGCTTCGGACGCCACAGCCGAGTGGCTCAACTCAAAGCCCGCCACCTCCGTTGTGTATGTCTCTTTC
GGAAGCATTGTAAACTTGGGAGAAAACCAGGTAAGGGAAATAGCAAGCAGTCTGAGAGATGGCAATAGCTTCTTCATATGGGTTATGGGAGAATCAGTAAAGCAAAAGCT
TCCAACCAACTTTGTTTCAGAAACCTCAGAGAAAGGTCTTGTTCTCAGCTGGTGCAATCAGCTCCAAGTGTTGGCTCACGCCGCCGTCGGATGCTTCGTCAGTCACTGTG
GCTGGAACTCCACCCTGGAGGCTCTCAGCTGCGGCGTGCCCATGGTGGTCGTTCCCAAATGGTCCGATCAGACCACCAACTCCAAGTTCATAGCAGACGTTTGGGGAGTT
GGGGTCAGAGCCAAAGTCGACGGAGACGCCATTTTCACTGCGGAGGAAATTGGAAAGTGTATTAGAGAAGTGATGGAGGGAGAGAGAGGGAAGGAGATAAGAAAGAATTC
GTTGAAATGGAAGGAACTGGCTAGAGAAGCTGTGGATGAAGGTGGTAGCTCTGATAAGAATATTGATGACTTTGTAAAACAACTTTTCAATTCTGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCTGGATTCTGCCCGTCGCGAACCGCCTTGGGCTCGACGGAGCGCCGTTCTTCACTCAGTCTTGCGCTGTCAACCATATTTATGATCTTATTGGTCGGGGCGATCT
GGAGATTCCTGTTGCAGAGGGTGGTCGCGTTTCGGTTGCTTCTCTGCCGATTCTCGACGCCGGCGACTTGCCGATTTCGTTCCCCGGCGTCGGCCTTGGCTTCGTGAAGT
CTTTTAAAGCAAGTCAGTTCGGGAATTTAGAGGAGGCCAAATGGATGTTGTGTAATACCTTCTACGAATTGGAGAGTGCGGTATTGTATTGTTTGTTCGAAGGAACAGTG
GTGCTTGTGAAACAGAGTACTGAAGAACCCATGGTTTCGAGTTTCAAACTGGAGCAAGAGGAATACATGAGTTGTGGCAGAGATAATTCAGCATCCGGCTGCTATTTCAG
TGGAGCAAATGTTGGAGTAGAGTATCGCTTCTCTCAGTGCCTCATGGATCCGAGTTTGCTGGTCGAGGACTGGTCACGAATGAGTCTTACCTCAGCTAAGGATGAAATCT
CGATGCAAGTTGATCAATCGGCTGTGGAACGCACTGGTCTATCTTTGGGTTGTTGTTTACTTGGGAAACTATTGTCCCATCGGATCTTAGCAGCGGAGTGCCAGTGGCTT
TTCGATAAATTCTTGTTGGTTCTAGAATTTCCCGTTCGCGCCTACAGGCCTTTAGATTATTCCTTCACAATCGTGGCTTTTTTGGTCCATTTTTATGATATTCCTCTTGA
TTGGTACAATCATGAAATGGCAGAGAGATTGGGCAACGCAATCAGTGTGTATGTGGAGGTTGATAGCCTTTATCAATACGGGGAGTGGCTTCGTTTTGATGGGAAGAGTA
AAATCTTAGCTCGTTCCTCGAATACGGATAATCGTATCGTTGCTATCCCGTTTGCAACTCCTTCCCAACCGACTTTGGTTGGATCCTCGATGGATAAGGAGGTTGTCACG
ACCATGGGTCTTTCGGAGATTCGCCCATCAATGGGTATTCGGATTATTGAGCAGCATCAGGACAACTCAGGGGGCGATGGAGTGAGTTACGCGACAGCTTGGTATTCTCT
TTTCGGCGGCTCTGATTTTCAGAGTAAAGGCAAAGGCAAAGTTGGTGATCGATACTCCTTGGCCAGCGGAGGAAGTCTTTTTCCCGAGGACAGGAAATGGCGACCGAAAT
CATCCAATTCGCAGCAGTTCAGAGTGAAGGGAGGGGCATTTTCGGTGAAGCCATGGGACCCGACGGTTACGAGAATTGATGTTGATATTGCGGAATATTTAAGAGAGATG
GATGAATTTAACCGAGATCAGATCATCTATTTGGAGGGAGAAAATTCAAATTCAACCTCGGCAGTCCAAACAGGGAAGAAGTACGTGAAAAGGTCCAATTGGAAGAATCG
AGCTCATGCGGGATTTGTTCCCAAAGGGATGACCATGAGAGTCCTCAAGGAGTCCCAAAAGCGTAAGGATGGTTCAATCTTGTTCTCTCTAGATAACATCAAACGTCCTA
AAGTTGACGTTAAGCATTATGATTCTGCATGGGTTATTGGAGGCGATTTCAATGAAATCCTTTGGGATTATGAGAAGTCGGGAGGACCTACTCGAGAGCAACGTCTAACT
CACAATTTCGAAGCTACTTTGGATGATTATAACTTGTTGGATTTGGGATTTTTCGACGGGACCTATACTTGGTGTAATAGAAGAGAAATGGGTGACCAAGTAAGCTTAAG
ACTGGATCGTGTTTTGGCCAATCCCAAGCTTCACTTCCTTATTCCATCGCTATCAGCAGCAAATAACTCCTTTCTTTGCTGCAAATTAGCCTTGAACACCACCACGGAGG
TTGCACCGGCGAAATGTAAATCCATTGTAGCCAGTTTTGGCACTTGGGATGGAAAGGGTGAGGTTAGCAATTTGTTACATCAGGATCTGAAGAAATGTGCGAGGAAATGT
ATTGGAAACAGAGATTGCGAGAGAATTGGCTCTAGTGGGGGGGATAGGAATACGAAATGGTTCCACCAGAAAGCTTCTAGTAGGAGGAAGAGGAATTTAATCAAAGGAGT
AGTGGATGCTACTGGTTCCTGGCAGACTAAGTTATCATCTATTCAAGAGACTTTTGAGAGGATCATATGGATGTTAAGGATAGATGGTAATTCAGGGAGGGGAAGAGATG
ATCGTGCAGGGTCAACTGCGGAGCATAATGGTCAACTCGACAGATACATTTTCCGATCCCTTATGTGTCGAGGCGTTGGCAGTCCAGGAAGGTCTCCGTTTTGCATATAT
ACCGAATTTGAGATGCTTCTTGATTTTATCGGATTCCCTATCTTTAATCTCTATCATTCACAGAGCAATCCCTTGTCCCCGCTACTTGTGCGACAGTGTTATGGGATATT
GATCGTGTTAGACATATTTTCTGCAGTGTCAGTGGTTGAAGAGGTGGGACACAAATGGCGAATGAAGAGTGTGGGGCCATGTGTCCCATCGGCATACTTAGACAATCGGT
TGGAGAAGGACGAAAGCTACGGCGTCGGTCTCTTCAGCTCTGACGCTTCGGACGCCACAGCCGAGTGGCTCAACTCAAAGCCCGCCACCTCCGTTGTGTATGTCTCTTTC
GGAAGCATTGTAAACTTGGGAGAAAACCAGGTAAGGGAAATAGCAAGCAGTCTGAGAGATGGCAATAGCTTCTTCATATGGGTTATGGGAGAATCAGTAAAGCAAAAGCT
TCCAACCAACTTTGTTTCAGAAACCTCAGAGAAAGGTCTTGTTCTCAGCTGGTGCAATCAGCTCCAAGTGTTGGCTCACGCCGCCGTCGGATGCTTCGTCAGTCACTGTG
GCTGGAACTCCACCCTGGAGGCTCTCAGCTGCGGCGTGCCCATGGTGGTCGTTCCCAAATGGTCCGATCAGACCACCAACTCCAAGTTCATAGCAGACGTTTGGGGAGTT
GGGGTCAGAGCCAAAGTCGACGGAGACGCCATTTTCACTGCGGAGGAAATTGGAAAGTGTATTAGAGAAGTGATGGAGGGAGAGAGAGGGAAGGAGATAAGAAAGAATTC
GTTGAAATGGAAGGAACTGGCTAGAGAAGCTGTGGATGAAGGTGGTAGCTCTGATAAGAATATTGATGACTTTGTAAAACAACTTTTCAATTCTGTTTGA
Protein sequenceShow/hide protein sequence
MPWILPVANRLGLDGAPFFTQSCAVNHIYDLIGRGDLEIPVAEGGRVSVASLPILDAGDLPISFPGVGLGFVKSFKASQFGNLEEAKWMLCNTFYELESAVLYCLFEGTV
VLVKQSTEEPMVSSFKLEQEEYMSCGRDNSASGCYFSGANVGVEYRFSQCLMDPSLLVEDWSRMSLTSAKDEISMQVDQSAVERTGLSLGCCLLGKLLSHRILAAECQWL
FDKFLLVLEFPVRAYRPLDYSFTIVAFLVHFYDIPLDWYNHEMAERLGNAISVYVEVDSLYQYGEWLRFDGKSKILARSSNTDNRIVAIPFATPSQPTLVGSSMDKEVVT
TMGLSEIRPSMGIRIIEQHQDNSGGDGVSYATAWYSLFGGSDFQSKGKGKVGDRYSLASGGSLFPEDRKWRPKSSNSQQFRVKGGAFSVKPWDPTVTRIDVDIAEYLREM
DEFNRDQIIYLEGENSNSTSAVQTGKKYVKRSNWKNRAHAGFVPKGMTMRVLKESQKRKDGSILFSLDNIKRPKVDVKHYDSAWVIGGDFNEILWDYEKSGGPTREQRLT
HNFEATLDDYNLLDLGFFDGTYTWCNRREMGDQVSLRLDRVLANPKLHFLIPSLSAANNSFLCCKLALNTTTEVAPAKCKSIVASFGTWDGKGEVSNLLHQDLKKCARKC
IGNRDCERIGSSGGDRNTKWFHQKASSRRKRNLIKGVVDATGSWQTKLSSIQETFERIIWMLRIDGNSGRGRDDRAGSTAEHNGQLDRYIFRSLMCRGVGSPGRSPFCIY
TEFEMLLDFIGFPIFNLYHSQSNPLSPLLVRQCYGILIVLDIFSAVSVVEEVGHKWRMKSVGPCVPSAYLDNRLEKDESYGVGLFSSDASDATAEWLNSKPATSVVYVSF
GSIVNLGENQVREIASSLRDGNSFFIWVMGESVKQKLPTNFVSETSEKGLVLSWCNQLQVLAHAAVGCFVSHCGWNSTLEALSCGVPMVVVPKWSDQTTNSKFIADVWGV
GVRAKVDGDAIFTAEEIGKCIREVMEGERGKEIRKNSLKWKELAREAVDEGGSSDKNIDDFVKQLFNSV