; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC06G113230 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC06G113230
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionO-glucosyltransferase rumi homolog
Genome locationCicolChr06:4355329..4361184
RNA-Seq ExpressionCcUC06G113230
SyntenyCcUC06G113230
Gene Ontology termsNA
InterPro domainsIPR006598 - Glycosyl transferase CAP10 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138510.2 protein O-glucosyltransferase 1 [Cucumis sativus]2.1e-14658.15Show/hide
Query:  MKRNSDKNLHQIFCYH------SSSLQPPRWRWRSSHNKPSFKTFAFFLF-FFLFFLFTLFIRLGWINAEIIYWRWRGE-GVEKSRSNREGSWKATCPAH
        MKR S KNLHQ+FCYH      SSS      RWRSS N PSFKT A FLF  FLF L   F+R GWI          GE GV+  RS   GS   TCPAH
Subjt:  MKRNSDKNLHQIFCYH------SSSLQPPRWRWRSSHNKPSFKTFAFFLF-FFLFFLFTLFIRLGWINAEIIYWRWRGE-GVEKSRSNREGSWKATCPAH

Query:  FRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKSSPPPL
        FRWI EDL PWRE GITR MVE  R+TAHFRVVIV+GRVYVEKY+GSIQTRDVFTMWGILQL RWYP KLPDLELMFDCDDRPVVRSN F++  S PPPL
Subjt:  FRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKSSPPPL

Query:  FRYCSDEFSLDIVFPDWSFWGWGEINIKPWRM------------------------------------------------------DWDKESKEGYKQSN
        FRYCSDE SLDIVFPDWSFWGWGEINIKPW+M                                                      DWDKE+KEGYKQSN
Subjt:  FRYCSDEFSLDIVFPDWSFWGWGEINIKPWRM------------------------------------------------------DWDKESKEGYKQSN

Query:  LEDQCTHRW---------------------------------------------------------------------AEAIGEEGSKYLHENLKMELVY
        LEDQCTHR+                                                                     AEAIGEEGSKYL ENLKMELVY
Subjt:  LEDQCTHRW---------------------------------------------------------------------AEAIGEEGSKYLHENLKMELVY

Query:  DYMFHLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRKGN
        DYM+HLLNEYSKLLKFRP+VPPGAVEL PETMTGAA+GLHKKF+EDSLEKSPS TEPCDLPPHDP VL EF EKKLNAL +V+ WEKEYWE Q K N
Subjt:  DYMFHLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRKGN

XP_008458253.1 PREDICTED: protein O-glucosyltransferase 1-like [Cucumis melo]2.9e-14858.17Show/hide
Query:  MKRNSDKNLHQIFCYHSSS------------LQPPRWRWRSSHNKPSFKTFAFFLF-FFLFFLFTLFIRLGWINAEIIYWRWRGEGVEKSRSNREGSWKA
        MKR S KNLHQIFCYHSSS            +Q P  RWRSS N PSFKT A FLF  FLF L   F+R GWI    IYW   GEGV   RSN  GS   
Subjt:  MKRNSDKNLHQIFCYHSSS------------LQPPRWRWRSSHNKPSFKTFAFFLF-FFLFFLFTLFIRLGWINAEIIYWRWRGEGVEKSRSNREGSWKA

Query:  TCPAHFRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKS
        TCPAHFRWI EDL PWR  GITR MVE AR+TAHFR+VI++GRVYVEKYRGSIQTRDVFTMWGILQL RWYP KLPD+ELMFDCDDRPVVRSNDF +  S
Subjt:  TCPAHFRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKS

Query:  SPPPLFRYCSDEFSLDIVFPDWSFWGWGEINIKPWRM------------------------------------------------------DWDKESKEG
         PPPL RYCSDE SLDIVFPDWSFWGWGEINIKPWRM                                                      DWDKE+KEG
Subjt:  SPPPLFRYCSDEFSLDIVFPDWSFWGWGEINIKPWRM------------------------------------------------------DWDKESKEG

Query:  YKQSNLEDQCTHRW---------------------------------------------------------------------AEAIGEEGSKYLHENLK
        YKQSNLEDQCTHR+                                                                     AEAIGEEGSKYL ENLK
Subjt:  YKQSNLEDQCTHRW---------------------------------------------------------------------AEAIGEEGSKYLHENLK

Query:  MELVYDYMFHLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRK
        MELVYDYM+HLLNEYSKLLKFRP+VPPGAVEL PETMTGA  GLHKKF+EDSLEKSPS+ EPCDLPP+D  VL E  EKKLNAL QV+ WEKEYWENQ K
Subjt:  MELVYDYMFHLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRK

Query:  GN
         N
Subjt:  GN

XP_023000115.1 protein O-glucosyltransferase 1-like isoform X1 [Cucurbita maxima]2.0e-14457.55Show/hide
Query:  MKRNSDKNLHQIFCYHSSS-LQPPRWRWRSSHNKPSFKTFAFFLFFFLFFLFTLFIRLGWINAEIIYWRWRGEGVEKSRSNREGSWKATCPAHFRWIHED
        MKR  DK+L  I CY+SSS LQ  RWRWRSS  KP+FK  A FLFFF FF+  + +RL W+               KS  + +    ATCP HFRWIHED
Subjt:  MKRNSDKNLHQIFCYHSSS-LQPPRWRWRSSHNKPSFKTFAFFLFFFLFFLFTLFIRLGWINAEIIYWRWRGEGVEKSRSNREGSWKATCPAHFRWIHED

Query:  LGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKSSPPPLFRYCSDE
        L PWRETGITREMVEGAR+TAHFRVVI+DGRVYVEKYRGSIQTRD+FTMWG+LQLVRWYP KLPDLELMFDCDDRPVV+S DFLDPK+ PPPLFRYCSD+
Subjt:  LGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKSSPPPLFRYCSDE

Query:  FSLDIVFPDWSFWGWGEINIKPWR------------------------------------------------------MDWDKESKEGYKQSNLEDQCTH
         SLDIVFPDWSFWGWGEINIKPWR                                                       DWDKESKEGYKQSNLEDQCTH
Subjt:  FSLDIVFPDWSFWGWGEINIKPWR------------------------------------------------------MDWDKESKEGYKQSNLEDQCTH

Query:  RW---------------------------------------------------------------------AEAIGEEGSKYLHENLKMELVYDYMFHLL
        R+                                                                     A+AIGE+GSKYLHENLKMELVYDYMFHLL
Subjt:  RW---------------------------------------------------------------------AEAIGEEGSKYLHENLKMELVYDYMFHLL

Query:  NEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRKGN
        NEYSKLLKFRPSVP GAVEL PE M GAA GLHKKFME+SLE SP+  E C LPPHDP VL EF  KKLNALKQVE WEK+YWEN+RKGN
Subjt:  NEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRKGN

XP_038906931.1 protein O-glucosyltransferase 1-like isoform X1 [Benincasa hispida]1.3e-16764.3Show/hide
Query:  MKRNSDKNLHQIFCY--HSSSLQPP--RWRWRSSHNKPSFKTFAFFLFFFLFFLFTLFIRLGWINAEIIYWRWRGEGVEKSRSNREGSWKATCPAHFRWI
        MKR +DKNLHQIFCY   SSS+QPP  RWRWRSS  KPSFKT A FLFFFLFFL  LFIRL WI A I    WR +GVE +RS  +GS  ATCP HFRWI
Subjt:  MKRNSDKNLHQIFCY--HSSSLQPP--RWRWRSSHNKPSFKTFAFFLFFFLFFLFTLFIRLGWINAEIIYWRWRGEGVEKSRSNREGSWKATCPAHFRWI

Query:  HEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKSSPPPLFRYC
        HEDL PWRETGITREMVE ARK AHFRVVIVDGRVYVEKY+ SIQTRD+FTMWG+LQLVRWYPG+LPDLELMFDCDDRPVVRSNDFLDPKS PPPLFRYC
Subjt:  HEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKSSPPPLFRYC

Query:  SDEFSLDIVFPDWSFWGWGEINIKPWRM------------------------------------------------------DWDKESKEGYKQSNLEDQ
        SDE SLDIVFPDWSFWGWGEINIKPWRM                                                      DWDKESKEGYKQSNLEDQ
Subjt:  SDEFSLDIVFPDWSFWGWGEINIKPWRM------------------------------------------------------DWDKESKEGYKQSNLEDQ

Query:  CTHRW---------------------------------------------------------------------AEAIGEEGSKYLHENLKMELVYDYMF
        CTHR+                                                                     A+AIGEEGSKYL ENLKMELVYDYMF
Subjt:  CTHRW---------------------------------------------------------------------AEAIGEEGSKYLHENLKMELVYDYMF

Query:  HLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRKGN
        HLLNEYSKLLKFRPSVP GA+EL PETMTGA VGLHKKF+EDSLEKSPSDTEPCDLPPHDP VL+EF EKKLNALKQVEIWE EYWENQRKGN
Subjt:  HLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRKGN

XP_038906932.1 O-glucosyltransferase rumi-like isoform X2 [Benincasa hispida]1.4e-16675.12Show/hide
Query:  MKRNSDKNLHQIFCY--HSSSLQPP--RWRWRSSHNKPSFKTFAFFLFFFLFFLFTLFIRLGWINAEIIYWRWRGEGVEKSRSNREGSWKATCPAHFRWI
        MKR +DKNLHQIFCY   SSS+QPP  RWRWRSS  KPSFKT A FLFFFLFFL  LFIRL WI A I    WR +GVE +RS  +GS  ATCP HFRWI
Subjt:  MKRNSDKNLHQIFCY--HSSSLQPP--RWRWRSSHNKPSFKTFAFFLFFFLFFLFTLFIRLGWINAEIIYWRWRGEGVEKSRSNREGSWKATCPAHFRWI

Query:  HEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKSSPPPLFRYC
        HEDL PWRETGITREMVE ARK AHFRVVIVDGRVYVEKY+ SIQTRD+FTMWG+LQLVRWYPG+LPDLELMFDCDDRPVVRSNDFLDPKS PPPLFRYC
Subjt:  HEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKSSPPPLFRYC

Query:  SDEFSLDIVFPDWSFWGWGEINIKPWRMDWDKESKEGYKQSNLEDQC-------------------------THRW-------AEAIGEEGSKYLHENLK
        SDE SLDIVFPDWSFWGWGEINIKPWRM  + + KEG K++  +D+                           H W       A+AIGEEGSKYL ENLK
Subjt:  SDEFSLDIVFPDWSFWGWGEINIKPWRMDWDKESKEGYKQSNLEDQC-------------------------THRW-------AEAIGEEGSKYLHENLK

Query:  MELVYDYMFHLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRK
        MELVYDYMFHLLNEYSKLLKFRPSVP GA+EL PETMTGA VGLHKKF+EDSLEKSPSDTEPCDLPPHDP VL+EF EKKLNALKQVEIWE EYWENQRK
Subjt:  MELVYDYMFHLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRK

Query:  GN
        GN
Subjt:  GN

TrEMBL top hitse value%identityAlignment
A0A0A0KB84 CAP10 domain-containing protein1.0e-14658.15Show/hide
Query:  MKRNSDKNLHQIFCYH------SSSLQPPRWRWRSSHNKPSFKTFAFFLF-FFLFFLFTLFIRLGWINAEIIYWRWRGE-GVEKSRSNREGSWKATCPAH
        MKR S KNLHQ+FCYH      SSS      RWRSS N PSFKT A FLF  FLF L   F+R GWI          GE GV+  RS   GS   TCPAH
Subjt:  MKRNSDKNLHQIFCYH------SSSLQPPRWRWRSSHNKPSFKTFAFFLF-FFLFFLFTLFIRLGWINAEIIYWRWRGE-GVEKSRSNREGSWKATCPAH

Query:  FRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKSSPPPL
        FRWI EDL PWRE GITR MVE  R+TAHFRVVIV+GRVYVEKY+GSIQTRDVFTMWGILQL RWYP KLPDLELMFDCDDRPVVRSN F++  S PPPL
Subjt:  FRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKSSPPPL

Query:  FRYCSDEFSLDIVFPDWSFWGWGEINIKPWRM------------------------------------------------------DWDKESKEGYKQSN
        FRYCSDE SLDIVFPDWSFWGWGEINIKPW+M                                                      DWDKE+KEGYKQSN
Subjt:  FRYCSDEFSLDIVFPDWSFWGWGEINIKPWRM------------------------------------------------------DWDKESKEGYKQSN

Query:  LEDQCTHRW---------------------------------------------------------------------AEAIGEEGSKYLHENLKMELVY
        LEDQCTHR+                                                                     AEAIGEEGSKYL ENLKMELVY
Subjt:  LEDQCTHRW---------------------------------------------------------------------AEAIGEEGSKYLHENLKMELVY

Query:  DYMFHLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRKGN
        DYM+HLLNEYSKLLKFRP+VPPGAVEL PETMTGAA+GLHKKF+EDSLEKSPS TEPCDLPPHDP VL EF EKKLNAL +V+ WEKEYWE Q K N
Subjt:  DYMFHLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRKGN

A0A1S3C7F0 protein O-glucosyltransferase 1-like1.4e-14858.17Show/hide
Query:  MKRNSDKNLHQIFCYHSSS------------LQPPRWRWRSSHNKPSFKTFAFFLF-FFLFFLFTLFIRLGWINAEIIYWRWRGEGVEKSRSNREGSWKA
        MKR S KNLHQIFCYHSSS            +Q P  RWRSS N PSFKT A FLF  FLF L   F+R GWI    IYW   GEGV   RSN  GS   
Subjt:  MKRNSDKNLHQIFCYHSSS------------LQPPRWRWRSSHNKPSFKTFAFFLF-FFLFFLFTLFIRLGWINAEIIYWRWRGEGVEKSRSNREGSWKA

Query:  TCPAHFRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKS
        TCPAHFRWI EDL PWR  GITR MVE AR+TAHFR+VI++GRVYVEKYRGSIQTRDVFTMWGILQL RWYP KLPD+ELMFDCDDRPVVRSNDF +  S
Subjt:  TCPAHFRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKS

Query:  SPPPLFRYCSDEFSLDIVFPDWSFWGWGEINIKPWRM------------------------------------------------------DWDKESKEG
         PPPL RYCSDE SLDIVFPDWSFWGWGEINIKPWRM                                                      DWDKE+KEG
Subjt:  SPPPLFRYCSDEFSLDIVFPDWSFWGWGEINIKPWRM------------------------------------------------------DWDKESKEG

Query:  YKQSNLEDQCTHRW---------------------------------------------------------------------AEAIGEEGSKYLHENLK
        YKQSNLEDQCTHR+                                                                     AEAIGEEGSKYL ENLK
Subjt:  YKQSNLEDQCTHRW---------------------------------------------------------------------AEAIGEEGSKYLHENLK

Query:  MELVYDYMFHLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRK
        MELVYDYM+HLLNEYSKLLKFRP+VPPGAVEL PETMTGA  GLHKKF+EDSLEKSPS+ EPCDLPP+D  VL E  EKKLNAL QV+ WEKEYWENQ K
Subjt:  MELVYDYMFHLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRK

Query:  GN
         N
Subjt:  GN

A0A6J1DPT6 O-glucosyltransferase rumi homolog1.2e-13956.36Show/hide
Query:  MKRNSDK-NLHQIFCYHSSSLQPPRWRWRSSHNKPSFKTFAFFLFFFLFFLFTLFIRLGWINAEIIYWRWR-------GEGVEKSRSNREGSWKATCPAH
        MKR SD+ +LHQIFC  SSS      RWR    K +FK    FLFFFLFF+  LF+R    N   ++WR +           E  R N + +W ATCP H
Subjt:  MKRNSDK-NLHQIFCYHSSSLQPPRWRWRSSHNKPSFKTFAFFLFFFLFFLFTLFIRLGWINAEIIYWRWR-------GEGVEKSRSNREGSWKATCPAH

Query:  FRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKSSPPPL
        FRWIHEDL PWRETGITREMVE ARKTAHFRVVI+DGRVYVEKYRGSIQTRD+FTMWG LQL+RWYP KLPDLELMFDCDDRPVVRS DF   +  PPPL
Subjt:  FRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKSSPPPL

Query:  FRYCSDEFSLDIVFPDWSFWGWGEINIKPWR------------------------------------------------------MDWDKESKEGYKQSN
        FRYCSDE SLDIVFPDWSFWGW EINIKPWR                                                       DWDKESK GYKQSN
Subjt:  FRYCSDEFSLDIVFPDWSFWGWGEINIKPWR------------------------------------------------------MDWDKESKEGYKQSN

Query:  LEDQCTHRW---------------------------------------------------------------------AEAIGEEGSKYLHENLKMELVY
        LEDQCTHR+                                                                     AE IGEEGS YL +NLKMELVY
Subjt:  LEDQCTHRW---------------------------------------------------------------------AEAIGEEGSKYLHENLKMELVY

Query:  DYMFHLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRK
        DYMFHLLNEYSKLLKFRP VP GAVEL PETMTGA  GLHKKFMEDSLE SPSD+EPCDLPPHDP VL E  EKKLNAL+QVEIWEKEYWE+QRK
Subjt:  DYMFHLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRK

A0A6J1HHQ2 protein O-glucosyltransferase 1-like1.2e-14457.35Show/hide
Query:  MKRNSDKNLHQIFCYHSSS-LQPPRWRWRSSHNKPSFKTFAFFLFFFLFFLFTLFIRLGWINAEIIYWRWRGEGVEKSRSNREGSWKATCPAHFRWIHED
        MKR  DK L  I CY SSS LQ  RWRWRSS  K +FK  A     F FF   + +RL W     I+        ++ R+       ATCP HFRWIHED
Subjt:  MKRNSDKNLHQIFCYHSSS-LQPPRWRWRSSHNKPSFKTFAFFLFFFLFFLFTLFIRLGWINAEIIYWRWRGEGVEKSRSNREGSWKATCPAHFRWIHED

Query:  LGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKSSPPPLFRYCSDE
        L PWRETGITREMVEGAR+TAHFR+VI+DGRVYVEKYRGSIQTRD+FTMWG+LQLVRWYPGKLPDLELMFDCDDRPVV+S DFLDPK+ PPPLFRYCSDE
Subjt:  LGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKSSPPPLFRYCSDE

Query:  FSLDIVFPDWSFWGWGEINIKPWR------------------------------------------------------MDWDKESKEGYKQSNLEDQCTH
         SLDIVFPDWSFWGWGEINIKPWR                                                       DWDKESKEGYKQSNLEDQCTH
Subjt:  FSLDIVFPDWSFWGWGEINIKPWR------------------------------------------------------MDWDKESKEGYKQSNLEDQCTH

Query:  RW---------------------------------------------------------------------AEAIGEEGSKYLHENLKMELVYDYMFHLL
        ++                                                                     AEAIGEEGSKYL EN+KMELVYDYMFHLL
Subjt:  RW---------------------------------------------------------------------AEAIGEEGSKYLHENLKMELVYDYMFHLL

Query:  NEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRKGN
        NEYSKLLKFRPSVP GAVEL PET+ GAA GLHKKFMEDSLE SP+ +EPCDLPPHDP VL EF  KKLNALKQVE WEK+YWENQRKGN
Subjt:  NEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRKGN

A0A6J1KHG0 protein O-glucosyltransferase 1-like isoform X19.4e-14557.55Show/hide
Query:  MKRNSDKNLHQIFCYHSSS-LQPPRWRWRSSHNKPSFKTFAFFLFFFLFFLFTLFIRLGWINAEIIYWRWRGEGVEKSRSNREGSWKATCPAHFRWIHED
        MKR  DK+L  I CY+SSS LQ  RWRWRSS  KP+FK  A FLFFF FF+  + +RL W+               KS  + +    ATCP HFRWIHED
Subjt:  MKRNSDKNLHQIFCYHSSS-LQPPRWRWRSSHNKPSFKTFAFFLFFFLFFLFTLFIRLGWINAEIIYWRWRGEGVEKSRSNREGSWKATCPAHFRWIHED

Query:  LGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKSSPPPLFRYCSDE
        L PWRETGITREMVEGAR+TAHFRVVI+DGRVYVEKYRGSIQTRD+FTMWG+LQLVRWYP KLPDLELMFDCDDRPVV+S DFLDPK+ PPPLFRYCSD+
Subjt:  LGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKSSPPPLFRYCSDE

Query:  FSLDIVFPDWSFWGWGEINIKPWR------------------------------------------------------MDWDKESKEGYKQSNLEDQCTH
         SLDIVFPDWSFWGWGEINIKPWR                                                       DWDKESKEGYKQSNLEDQCTH
Subjt:  FSLDIVFPDWSFWGWGEINIKPWR------------------------------------------------------MDWDKESKEGYKQSNLEDQCTH

Query:  RW---------------------------------------------------------------------AEAIGEEGSKYLHENLKMELVYDYMFHLL
        R+                                                                     A+AIGE+GSKYLHENLKMELVYDYMFHLL
Subjt:  RW---------------------------------------------------------------------AEAIGEEGSKYLHENLKMELVYDYMFHLL

Query:  NEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRKGN
        NEYSKLLKFRPSVP GAVEL PE M GAA GLHKKFME+SLE SP+  E C LPPHDP VL EF  KKLNALKQVE WEK+YWEN+RKGN
Subjt:  NEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRKGN

SwissProt top hitse value%identityAlignment
B0X1Q4 O-glucosyltransferase rumi homolog1.2e-0627.89Show/hide
Query:  ATCPAHFRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVF---TMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFL
        + C  H   +  DL P+R +GIT++++E AR     +  I+  R++        + RD        G+   +R    KLPD+EL+ +C D P +  +   
Subjt:  ATCPAHFRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVF---TMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFL

Query:  DPKSSPPPLFRYCSDEFSLDIVFPDWSFW-GWGEINIKPWRMD-WDK
        +    P P+  +      LDI++P W FW G   I++ P  +  WD+
Subjt:  DPKSSPPPLFRYCSDEFSLDIVFPDWSFW-GWGEINIKPWRMD-WDK

Q29AU6 O-glucosyltransferase rumi5.7e-0627.27Show/hide
Query:  ATCPAHFRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEK---YRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFL
        A C  H   I  DL P++ TG++R+M+E + +    R  I + R+Y E+   +    Q  + F    +L LV      LPD++L+ +  D P +   +  
Subjt:  ATCPAHFRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEK---YRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFL

Query:  DPKSSPPPLFRYCSDEFSLDIVFPDWSFWGWG
            +  P+  +   +   DI++P W+FW  G
Subjt:  DPKSSPPPLFRYCSDEFSLDIVFPDWSFWGWG

Q8T045 O-glucosyltransferase rumi5.7e-0626.12Show/hide
Query:  CPAHFRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYR--GSIQTRDVFTMW-----GILQLVRWYPGKLPDLELMFDCDDRPVVRSND
        C  H   +  DL P++ TG+TR+M+E + +             Y  KY+  G    RD   M+     GI   +      LPD++L+ +  D P + +  
Subjt:  CPAHFRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYR--GSIQTRDVFTMW-----GILQLVRWYPGKLPDLELMFDCDDRPVVRSND

Query:  FLDPKSSPPPLFRYCSDEFSLDIVFPDWSFWGWG
             ++  P+F +   +   DI++P W+FW  G
Subjt:  FLDPKSSPPPLFRYCSDEFSLDIVFPDWSFWGWG

Arabidopsis top hitse value%identityAlignment
AT1G63420.1 Arabidopsis thaliana protein of unknown function (DUF821)2.1e-8039.62Show/hide
Query:  GEGVEKSRSNREGSWKATCPAHFRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFD
        G    ++ SNR      +CP +F+WIHEDL PWRETGIT+EMVE  + TAHFR+VI++G+V+VE Y+ SIQTRD FT+WGILQL+R YPGKLPD++LMFD
Subjt:  GEGVEKSRSNREGSWKATCPAHFRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFD

Query:  CDDRPVVRSNDF----LDPKSSPPPLFRYCSDEFSLDIVFPDWSFWGWGEINIKPWR-------------------------------------------
        CDDRPV+RS+ +       +++PPPLFRYC D +++DIVFPDWSFWGW EINI+ W                                            
Subjt:  CDDRPVVRSNDF----LDPKSSPPPLFRYCSDEFSLDIVFPDWSFWGWGEINIKPWR-------------------------------------------

Query:  ------------MDWDKESKEGYKQSNLEDQCTHRW----------------------------------------------------------------
                     DW  E + G++ SN+ +QCT+R+                                                                
Subjt:  ------------MDWDKESKEGYKQSNLEDQCTHRW----------------------------------------------------------------

Query:  -----AEAIGEEGSKYLHENLKMELVYDYMFHLLNEYSKLLKFRPSVPPGAVELNPETMT----GAAV-GLHKKFMEDSLEKSPSDTEPCDL-PPHDPKV
             A+ IG E S+++  +L ME VYDYMFHLLNEYSKLLK++P VP  +VEL  E +     G  V G+ KKFM  SL   P  + PC L PP D   
Subjt:  -----AEAIGEEGSKYLHENLKMELVYDYMFHLLNEYSKLLKFRPSVPPGAVELNPETMT----GAAV-GLHKKFMEDSLEKSPSDTEPCDL-PPHDPKV

Query:  LSEFPEKKLNALKQVEIWEKEYWE
        L +F  KKLN ++QVE WE  YW+
Subjt:  LSEFPEKKLNALKQVEIWEKEYWE

AT2G45830.1 downstream target of AGL15 26.1e-9644.9Show/hide
Query:  EKSRSNREGSWKATCPAHFRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDR
        +K RS+   S  +TCP++FRWIHEDL PW+ETG+TR M+E AR+TAHFRVVI+DGRVYV+KYR SIQTRDVFT+WGI+QL+RWYPG+LPDLELMFD DDR
Subjt:  EKSRSNREGSWKATCPAHFRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDR

Query:  PVVRSNDFLDPK-SSPPPLFRYCSDEFSLDIVFPDWSFWGWGEINIKPWR--------------------------------------------------
        P VRS DF   +  +PPPLFRYCSD+ SLDIVFPDWSFWGW E+NIKPW                                                   
Subjt:  PVVRSNDFLDPK-SSPPPLFRYCSDEFSLDIVFPDWSFWGWGEINIKPWR--------------------------------------------------

Query:  ----MDWDKESKEGYKQSNLEDQCTHRW---------------------------------------------------------------------AEA
             DWD+ES+EG+K SNLE+QCTHR+                                                                     A  
Subjt:  ----MDWDKESKEGYKQSNLEDQCTHRW---------------------------------------------------------------------AEA

Query:  IGEEGSKYLHENLKMELVYDYMFHLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLP-PHDPKVLSEFPEKKLNALK
        IGEEGS+++ E +KME VYDYMFHL+NEY+KLLKF+P +P GA E+ P+ M  +A G  + FME+S+   PS+  PC++P P +P  L E  E+K N  +
Subjt:  IGEEGSKYLHENLKMELVYDYMFHLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLP-PHDPKVLSEFPEKKLNALK

Query:  QVEIWEKEYWEN
        QVE WE +Y+ +
Subjt:  QVEIWEKEYWEN

AT2G45830.2 downstream target of AGL15 21.5e-8143.47Show/hide
Query:  MVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPK-SSPPPLFRYCSDEFSLDIVFPDWS
        M+E AR+TAHFRVVI+DGRVYV+KYR SIQTRDVFT+WGI+QL+RWYPG+LPDLELMFD DDRP VRS DF   +  +PPPLFRYCSD+ SLDIVFPDWS
Subjt:  MVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPK-SSPPPLFRYCSDEFSLDIVFPDWS

Query:  FWGWGEINIKPWR------------------------------------------------------MDWDKESKEGYKQSNLEDQCTHRW---------
        FWGW E+NIKPW                                                        DWD+ES+EG+K SNLE+QCTHR+         
Subjt:  FWGWGEINIKPWR------------------------------------------------------MDWDKESKEGYKQSNLEDQCTHRW---------

Query:  ------------------------------------------------------------AEAIGEEGSKYLHENLKMELVYDYMFHLLNEYSKLLKFRP
                                                                    A  IGEEGS+++ E +KME VYDYMFHL+NEY+KLLKF+P
Subjt:  ------------------------------------------------------------AEAIGEEGSKYLHENLKMELVYDYMFHLLNEYSKLLKFRP

Query:  SVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLP-PHDPKVLSEFPEKKLNALKQVEIWEKEYWEN
         +P GA E+ P+ M  +A G  + FME+S+   PS+  PC++P P +P  L E  E+K N  +QVE WE +Y+ +
Subjt:  SVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDLP-PHDPKVLSEFPEKKLNALKQVEIWEKEYWEN

AT3G61270.1 Arabidopsis thaliana protein of unknown function (DUF821)1.4e-9243.07Show/hide
Query:  KSRSNREGSWK-ATCPAHFRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDR
        KSR N   S K +TCP++FRWIHEDL PW++TGITR M+E A +TAHFR+VI +G+ YV++Y+ SIQTRD FT+WGILQL+RWYPGKLPDLELMFD DDR
Subjt:  KSRSNREGSWK-ATCPAHFRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDR

Query:  PVVRSNDFLDPKSSPPPLFRYCSDEFSLDIVFPDWSFWGWGEINIKPWR---------------------------------------------------
        PVVRS DF+  +  PPP+FRYCSD+ SLDIVFPDWSFWGW E+N+KPW                                                    
Subjt:  PVVRSNDFLDPKSSPPPLFRYCSDEFSLDIVFPDWSFWGWGEINIKPWR---------------------------------------------------

Query:  ---MDWDKESKEGYKQSNLEDQCTHRW---------------------------------------------------------------------AEAI
            DWDKE+KEG+K SNLE+QCTHR+                                                                     A  I
Subjt:  ---MDWDKESKEGYKQSNLEDQCTHRW---------------------------------------------------------------------AEAI

Query:  GEEGSKYLHENLKMELVYDYMFHLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCD-LPPHDPKVLSEFPEKKLNALKQ
        GE GS+++ E + M+ VYDYMFHLL EY+ LLKF+P +P  A E+ P++M   A    + F  +S+  SPS+  PC+ LPP+DP  L E  E+K N  +Q
Subjt:  GEEGSKYLHENLKMELVYDYMFHLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCD-LPPHDPKVLSEFPEKKLNALKQ

Query:  VEIWEKEYWEN
        VE+WE +Y++N
Subjt:  VEIWEKEYWEN

AT5G23850.1 Arabidopsis thaliana protein of unknown function (DUF821)6.6e-8240.69Show/hide
Query:  ATCPAHFRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPK
        ATCP +FRWIHEDL PW  TGITRE +E A+KTA FR+ IV G++YVEK++ + QTRDVFT+WG LQL+R YPGK+PDLELMFDC D PVVR+ +F    
Subjt:  ATCPAHFRWIHEDLGPWRETGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPK

Query:  S-SPPPLFRYCSDEFSLDIVFPDWSFWGWGEINIKPWR------------------------------------------------------MDWDKESK
        + SPPPLFRYC +E +LDIVFPDWSFWGW E+NIKPW                                                        DW KESK
Subjt:  S-SPPPLFRYCSDEFSLDIVFPDWSFWGWGEINIKPWR------------------------------------------------------MDWDKESK

Query:  EGYKQSNLEDQCTHRW---------------------------------------------------------------------AEAIGEEGSKYLHEN
        EGYKQS+L  QC HR+                                                                     A+ IG+  S ++ ++
Subjt:  EGYKQSNLEDQCTHRW---------------------------------------------------------------------AEAIGEEGSKYLHEN

Query:  LKMELVYDYMFHLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDL-PPHDPKVLSEFPEKKLNALKQVEIWEKEYWEN
        LKM+ VYDYM+HLL EYSKLL+F+P +P  AVE+  ETM     G  +KFM +SL K P+D+ PC + PP+DP    E  ++K +   ++  WE +YW  
Subjt:  LKMELVYDYMFHLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMEDSLEKSPSDTEPCDL-PPHDPKVLSEFPEKKLNALKQVEIWEKEYWEN

Query:  QRK
        Q +
Subjt:  QRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACGTAACAGTGATAAAAATCTCCACCAAATCTTCTGCTATCATTCTTCTTCCCTTCAACCTCCAAGATGGCGATGGCGTTCGTCCCACAACAAACCTTCC
TTCAAAACCTTTGCTTTCTTTCTCTTCTTCTTCCTCTTCTTTCTCTTCACCCTCTTCATTCGTCTCGGCTGGATTAACGCGGAAATAATATATTGGCGTTGGCGG
GGGGAAGGGGTTGAGAAGAGCAGGTCAAATCGGGAGGGTTCATGGAAGGCCACGTGTCCGGCGCACTTCCGTTGGATCCACGAAGATCTAGGGCCATGGAGAGAA
ACGGGGATAACGAGAGAAATGGTGGAAGGGGCCAGGAAAACGGCCCATTTCCGGGTAGTAATCGTAGATGGGAGGGTGTACGTAGAGAAATATAGAGGATCAATC
CAGACCAGAGATGTATTCACCATGTGGGGGATTCTGCAACTTGTGAGATGGTACCCGGGAAAATTGCCAGATCTTGAACTCATGTTTGACTGTGATGATCGCCCA
GTCGTCCGATCTAATGACTTTTTGGATCCAAAGTCGTCCCCACCTCCATTGTTCCGTTACTGCTCTGATGAGTTTAGTTTGGATATTGTTTTCCCTGATTGGTCC
TTTTGGGGATGGGGTGAAATAAACATAAAGCCTTGGAGAATGGATTGGGATAAAGAATCCAAAGAGGGGTACAAGCAATCAAATCTAGAAGATCAATGCACACAT
AGGTGGGCAGAAGCAATTGGAGAAGAAGGGAGCAAGTACTTACATGAGAATCTAAAGATGGAATTAGTCTATGATTACATGTTTCATTTACTAAACGAATACAGC
AAGCTTCTTAAATTCCGGCCATCGGTGCCGCCCGGCGCGGTGGAGCTAAATCCGGAGACGATGACCGGCGCCGCCGTGGGGTTGCATAAAAAATTTATGGAGGAT
TCGTTGGAGAAGTCTCCCAGCGACACAGAGCCGTGCGATTTGCCGCCTCATGATCCGAAGGTCCTCAGTGAGTTTCCGGAGAAAAAATTAAATGCCTTAAAGCAA
GTTGAAATTTGGGAAAAGGAATATTGGGAAAATCAGAGGAAAGGGAATTAA
mRNA sequenceShow/hide mRNA sequence
TTTGCCTTTAAATCACAATTTCAACCCTACCCCCATTTTTTCATTTCCTCAATTCTATTAATTTCATACCAAATTTATTTCTCTCTACAATTTTTAGTTCATAAT
TCAAATGAAACGTAACAGTGATAAAAATCTCCACCAAATCTTCTGCTATCATTCTTCTTCCCTTCAACCTCCAAGATGGCGATGGCGTTCGTCCCACAACAAACC
TTCCTTCAAAACCTTTGCTTTCTTTCTCTTCTTCTTCCTCTTCTTTCTCTTCACCCTCTTCATTCGTCTCGGCTGGATTAACGCGGAAATAATATATTGGCGTTG
GCGGGGGGAAGGGGTTGAGAAGAGCAGGTCAAATCGGGAGGGTTCATGGAAGGCCACGTGTCCGGCGCACTTCCGTTGGATCCACGAAGATCTAGGGCCATGGAG
AGAAACGGGGATAACGAGAGAAATGGTGGAAGGGGCCAGGAAAACGGCCCATTTCCGGGTAGTAATCGTAGATGGGAGGGTGTACGTAGAGAAATATAGAGGATC
AATCCAGACCAGAGATGTATTCACCATGTGGGGGATTCTGCAACTTGTGAGATGGTACCCGGGAAAATTGCCAGATCTTGAACTCATGTTTGACTGTGATGATCG
CCCAGTCGTCCGATCTAATGACTTTTTGGATCCAAAGTCGTCCCCACCTCCATTGTTCCGTTACTGCTCTGATGAGTTTAGTTTGGATATTGTTTTCCCTGATTG
GTCCTTTTGGGGATGGGGTGAAATAAACATAAAGCCTTGGAGAATGGATTGGGATAAAGAATCCAAAGAGGGGTACAAGCAATCAAATCTAGAAGATCAATGCAC
ACATAGGTGGGCAGAAGCAATTGGAGAAGAAGGGAGCAAGTACTTACATGAGAATCTAAAGATGGAATTAGTCTATGATTACATGTTTCATTTACTAAACGAATA
CAGCAAGCTTCTTAAATTCCGGCCATCGGTGCCGCCCGGCGCGGTGGAGCTAAATCCGGAGACGATGACCGGCGCCGCCGTGGGGTTGCATAAAAAATTTATGGA
GGATTCGTTGGAGAAGTCTCCCAGCGACACAGAGCCGTGCGATTTGCCGCCTCATGATCCGAAGGTCCTCAGTGAGTTTCCGGAGAAAAAATTAAATGCCTTAAA
GCAAGTTGAAATTTGGGAAAAGGAATATTGGGAAAATCAGAGGAAAGGGAATTAATTAGATTTGGGATTCCATTTGCTTCTGCCATATGAACTCTATTTTCTTTT
TCTTTTTTTAATGGAAAACTTTACTGTTGCAGTCACATTTTGAGATTATGAAATGTCTTTGTTCGATTTGGTTTAATAATAAATATAAGATTCATA
Protein sequenceShow/hide protein sequence
MKRNSDKNLHQIFCYHSSSLQPPRWRWRSSHNKPSFKTFAFFLFFFLFFLFTLFIRLGWINAEIIYWRWRGEGVEKSRSNREGSWKATCPAHFRWIHEDLGPWRE
TGITREMVEGARKTAHFRVVIVDGRVYVEKYRGSIQTRDVFTMWGILQLVRWYPGKLPDLELMFDCDDRPVVRSNDFLDPKSSPPPLFRYCSDEFSLDIVFPDWS
FWGWGEINIKPWRMDWDKESKEGYKQSNLEDQCTHRWAEAIGEEGSKYLHENLKMELVYDYMFHLLNEYSKLLKFRPSVPPGAVELNPETMTGAAVGLHKKFMED
SLEKSPSDTEPCDLPPHDPKVLSEFPEKKLNALKQVEIWEKEYWENQRKGN