; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G006540 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G006540
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionGlycosyltransferase
Genome locationchr08:14891710..14897246
RNA-Seq ExpressionLsi08G006540
SyntenyLsi08G006540
Gene Ontology termsGO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8646630.1 hypothetical protein Csa_005349 [Cucumis sativus]1.3e-11260.58Show/hide
Query:  MEGGHCRDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNL--SSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKR
        ++G H    KM+ILMLPWLAHGHVSPFLELSKLLAT+NF IFFCSTS+ILHSI+SK+PQ L  SSNIQLVELTLPTSADLP  RHTTAGLPSHLMFSLKR
Subjt:  MEGGHCRDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNL--SSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKR

Query:  AFDSAATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRT
        AFDSAA+AFD +++++ PDL+IYDFLQPWAPAVALSA+IP VMFQCTGALMA MV   LKFPNSDF S FPEI LSE EIKQLKNLF  SVNDAKDKQR 
Subjt:  AFDSAATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRT

Query:  EECYERSCGNSRGSLSSRIRKRRRISRKLREMAKQKREKILHTRIFWKRVLLIQIRNGRDRLWLRAEPPELHMGGEVSGEPRRREKEEHRRRTPKRVFRE
        EECY+RSCG                                        +LL+                                              +
Subjt:  EECYERSCGNSRGSLSSRIRKRRRISRKLREMAKQKREKILHTRIFWKRVLLIQIRNGRDRLWLRAEPPELHMGGEVSGEPRRREKEEHRRRTPKRVFRE

Query:  SWRERNGGGRVGPTSPDLEAPEHWRVPQPLLAEHLGVGVVVERSGGGRLCRREVARAVREVVVEESGKRVREKVKEFAKIMKEKGDE-EMEVVVEEIMKL
        S RE             +EA          L EHLGVGVVVERS GGRLCRREVARAVREVV EESGKRVREKVKE AKIMKEKGDE EMEVVVEEI KL
Subjt:  SWRERNGGGRVGPTSPDLEAPEHWRVPQPLLAEHLGVGVVVERSGGGRLCRREVARAVREVVVEESGKRVREKVKEFAKIMKEKGDE-EMEVVVEEIMKL

Query:  CRRKKKGLQSN
        CRRK+KGLQSN
Subjt:  CRRKKKGLQSN

XP_004140986.1 UDP-glucosyltransferase 29 [Cucumis sativus]2.8e-11557.08Show/hide
Query:  MEGGHCRDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNL--SSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKR
        ++G H    KM+ILMLPWLAHGHVSPFLELSKLLAT+NF IFFCSTS+ILHSI+SK+PQ L  SSNIQLVELTLPTSADLP  RHTTAGLPSHLMFSLKR
Subjt:  MEGGHCRDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNL--SSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKR

Query:  AFDSAATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRT
        AFDSAA+AFD +++++ PDL+IYDFLQPWAPAVALSA+IP VMFQCTGALMA MV   LKFPNSDF S FPEI LSE EIKQLKNLF  SVNDAKDKQR 
Subjt:  AFDSAATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRT

Query:  EECYERSCG----NSRGSLSSRIRKRRRISRKLRE------MAKQKREKILHTRIFWK------RVLLIQIRNGRDRLWLRAEPPELHMGGEV-------
        EECY+RSCG     S   + ++       S +++       + +Q+ + ++    F K      +   I +  G +    + +  E+  G E+       
Subjt:  EECYERSCG----NSRGSLSSRIRKRRRISRKLRE------MAKQKREKILHTRIFWK------RVLLIQIRNGRDRLWLRAEPPELHMGGEV-------

Query:  ------SGEPRRREKEEH--RRRTPKRVFR---------ESW-------RERNGGG------------RVGPTSPDLEAPEHWRVPQPL---LAEHLGVG
              SGE   R+K+++      PK             E W       + R+ GG             +    P + AP   ++ QPL   L EHLGVG
Subjt:  ------SGEPRRREKEEH--RRRTPKRVFR---------ESW-------RERNGGG------------RVGPTSPDLEAPEHWRVPQPL---LAEHLGVG

Query:  VVVERSGGGRLCRREVARAVREVVVEESGKRVREKVKEFAKIMKEKGDE-EMEVVVEEIMKLCRRKKKGLQSN
        VVVERS GGRLCRREVARAVREVV EESGKRVREKVKE AKIMKEKGDE EMEVVVEEI KLCRRK+KGLQSN
Subjt:  VVVERSGGGRLCRREVARAVREVVVEESGKRVREKVKEFAKIMKEKGDE-EMEVVVEEIMKLCRRKKKGLQSN

XP_008456584.1 PREDICTED: beta-D-glucosyl crocetin beta-1,6-glucosyltransferase-like [Cucumis melo]6.9e-11457.29Show/hide
Query:  GHCRDEK--MRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNL--SSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRA
        GH R+E   M+ILMLPWLAHGHVSPFLELSKLLAT+NF IFFCSTS+ILHSIQSK+PQNL  SSNI+LVELTLPTSADLP  RHTTAGLP HLMFSLKRA
Subjt:  GHCRDEK--MRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNL--SSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRA

Query:  FDSAATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTE
        FDSAA+AFD++VR++ PDL+IYDFLQPWAPAVALSADIP VMFQCTGALMA +V   LKFPNSDF S+FPEIRLS  EIKQLKNLF  SVNDAKDKQR +
Subjt:  FDSAATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTE

Query:  ECYERSCG----NSRGSLSSRIRKRRRISRKLR---------------EMAKQKREKILHTRIFWKRVLLIQIRNGRDRLWLRAEPPELHMGGEV-----
        ECYERSCG     S   + ++       S +++               E+  +  EK L+ +   ++   I +  G +    + +  E+  G E+     
Subjt:  ECYERSCG----NSRGSLSSRIRKRRRISRKLR---------------EMAKQKREKILHTRIFWKRVLLIQIRNGRDRLWLRAEPPELHMGGEV-----

Query:  --------SGEPRRREKEEHRRRTPKRVFR---------ESW-------RERNGGG---RVGPTS---------PDLEAPEHWRVPQPL---LAEHLGVG
                SGE  R+++       PK             E W       + R+ GG     G +S         P + AP   ++ QPL   L EHLGVG
Subjt:  --------SGEPRRREKEEHRRRTPKRVFR---------ESW-------RERNGGG---RVGPTS---------PDLEAPEHWRVPQPL---LAEHLGVG

Query:  VVVERSGGGRLCRREVARAVREVVVEESGKRVREKVKEFAKIMKEKGD-EEMEVVVEEIMKLCRRKKKGLQSN
        VVVERS GGRLC  EVARAVREVV EESGK VREK+KEFAKIMKEKGD +EMEVV EEI KLCRRKKKGLQSN
Subjt:  VVVERSGGGRLCRREVARAVREVVVEESGKRVREKVKEFAKIMKEKGD-EEMEVVVEEIMKLCRRKKKGLQSN

XP_023543159.1 cyanidin-3-O-glucoside 2-O-glucuronosyltransferase-like, partial [Cucurbita pepo subsp. pepo]9.0e-9048.31Show/hide
Query:  MEGGHCRDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAF
        MEG   R  K  +LMLPWLAHGHVSPF EL+K L  RNF I+FCST+ IL+SIQ  + ++LSS+I+LVEL LPTS+DLPPHRHTTAGLP HLMFSLKRAF
Subjt:  MEGGHCRDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAF

Query:  DSAATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTEE
        DSAAT F  ++R++ PDL+IYDFLQPWAP VA S+ IP VMFQ TGALMA MVKYEL++P SD SSIFPEIRL+E EIKQ+KNLF  SVNDA+D++R + 
Subjt:  DSAATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTEE

Query:  CYERSCGNSRGSLSSRIRKRRRISRKLRE-MAKQKREKILHTRIFWKRVLLIQIRNGRDRLWL--------------------RAEPPELHMGGEVS---
        C ERSCG         ++  R I  K  + ++   R+K++      +      +   R   WL                    + +  E+  G E+S   
Subjt:  CYERSCGNSRGSLSSRIRKRRRISRKLRE-MAKQKREKILHTRIFWKRVLLIQIRNGRDRLWL--------------------RAEPPELHMGGEVS---

Query:  ------------GEPRRREKEEHRRRTPKRVFR-----ESW-------RERNGGG---RVGPTS---------PDLEAPEHWRVPQPL---LAEHLGVGV
                    GE ++  +EE  +   +RV       E W       + R  GG     G +S         P + AP   ++ QPL   L E L VGV
Subjt:  ------------GEPRRREKEEHRRRTPKRVFR-----ESW-------RERNGGG---RVGPTS---------PDLEAPEHWRVPQPL---LAEHLGVGV

Query:  VVERSGGGRLCRREVARAVREVVVEESGKRVREKVKEFAKIMKEKGDEEMEVVVEEIMKLCRRKKKG-LQSN
        VVER   GRL R+EVAR V+EV+VE+ G+RVR+KVKEFA+++K+KG+EEM++VVEE++KLC+R K+  LQS+
Subjt:  VVERSGGGRLCRREVARAVREVVVEESGKRVREKVKEFAKIMKEKGDEEMEVVVEEIMKLCRRKKKG-LQSN

XP_038885902.1 UDP-glucosyltransferase 29-like [Benincasa hispida]1.2e-12361.78Show/hide
Query:  MEGGHCRDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAF
        MEGG+  +EK+RILMLPWLAHGHVSPFLELSKLLATRNF I FCSTSVILHSIQSK+PQNLSSNI+LVELTLPTSADLPPHRHTT GLPSHLMFSLKRAF
Subjt:  MEGGHCRDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAF

Query:  DSAATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTEE
        DSAA+AFDA+VR+V PDLLIYDFLQPWAPAVALSADIP VMFQCTGALMA MV Y LKF NSD  S FPEIR+SELEIKQL NLF CSVNDAKDKQR EE
Subjt:  DSAATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTEE

Query:  CYERSCG------------NSRGSLSSRIRKRRRISRKLREMAKQ------KREKILHTRIFWKRVLLIQIRNGRDRLWLRAEPPELHMGGEVS------
        C ERSCG                SLS+ ++K+      L E  +         EK L+ +   +R   I +  G +    + +  E+  G E+S      
Subjt:  CYERSCG------------NSRGSLSSRIRKRRRISRKLREMAKQ------KREKILHTRIFWKRVLLIQIRNGRDRLWLRAEPPELHMGGEVS------

Query:  ---------GEPRRREKEEHRRRTPKRVFR-----ESW-------RERNGGG---RVGPTS---------PDLEAPEHWRVPQPL---LAEHLGVGVVVE
                 GE ++  +EE  +   +RV       E W       + R+ GG     G +S         P + AP   ++ QPL   L EHLGVGVVVE
Subjt:  ---------GEPRRREKEEHRRRTPKRVFR-----ESW-------RERNGGG---RVGPTS---------PDLEAPEHWRVPQPL---LAEHLGVGVVVE

Query:  RSGGGRLCRREVA---RAVREVVVEESGKRVREKVKEFAKIMKEKGDEEMEVVVEEIMKLCRRKKKGLQSN
        RS GGRLCRREVA   RAVREVV EESGKRVREK KEFAKIMKEKGDEEMEVVVEEIMKLCRRKKKGLQSN
Subjt:  RSGGGRLCRREVA---RAVREVVVEESGKRVREKVKEFAKIMKEKGDEEMEVVVEEIMKLCRRKKKGLQSN

TrEMBL top hitse value%identityAlignment
A0A0A0KE59 Glycosyltransferase1.4e-11557.08Show/hide
Query:  MEGGHCRDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNL--SSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKR
        ++G H    KM+ILMLPWLAHGHVSPFLELSKLLAT+NF IFFCSTS+ILHSI+SK+PQ L  SSNIQLVELTLPTSADLP  RHTTAGLPSHLMFSLKR
Subjt:  MEGGHCRDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNL--SSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKR

Query:  AFDSAATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRT
        AFDSAA+AFD +++++ PDL+IYDFLQPWAPAVALSA+IP VMFQCTGALMA MV   LKFPNSDF S FPEI LSE EIKQLKNLF  SVNDAKDKQR 
Subjt:  AFDSAATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRT

Query:  EECYERSCG----NSRGSLSSRIRKRRRISRKLRE------MAKQKREKILHTRIFWK------RVLLIQIRNGRDRLWLRAEPPELHMGGEV-------
        EECY+RSCG     S   + ++       S +++       + +Q+ + ++    F K      +   I +  G +    + +  E+  G E+       
Subjt:  EECYERSCG----NSRGSLSSRIRKRRRISRKLRE------MAKQKREKILHTRIFWK------RVLLIQIRNGRDRLWLRAEPPELHMGGEV-------

Query:  ------SGEPRRREKEEH--RRRTPKRVFR---------ESW-------RERNGGG------------RVGPTSPDLEAPEHWRVPQPL---LAEHLGVG
              SGE   R+K+++      PK             E W       + R+ GG             +    P + AP   ++ QPL   L EHLGVG
Subjt:  ------SGEPRRREKEEH--RRRTPKRVFR---------ESW-------RERNGGG------------RVGPTSPDLEAPEHWRVPQPL---LAEHLGVG

Query:  VVVERSGGGRLCRREVARAVREVVVEESGKRVREKVKEFAKIMKEKGDE-EMEVVVEEIMKLCRRKKKGLQSN
        VVVERS GGRLCRREVARAVREVV EESGKRVREKVKE AKIMKEKGDE EMEVVVEEI KLCRRK+KGLQSN
Subjt:  VVVERSGGGRLCRREVARAVREVVVEESGKRVREKVKEFAKIMKEKGDE-EMEVVVEEIMKLCRRKKKGLQSN

A0A1S3C496 Glycosyltransferase3.3e-11457.29Show/hide
Query:  GHCRDEK--MRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNL--SSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRA
        GH R+E   M+ILMLPWLAHGHVSPFLELSKLLAT+NF IFFCSTS+ILHSIQSK+PQNL  SSNI+LVELTLPTSADLP  RHTTAGLP HLMFSLKRA
Subjt:  GHCRDEK--MRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNL--SSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRA

Query:  FDSAATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTE
        FDSAA+AFD++VR++ PDL+IYDFLQPWAPAVALSADIP VMFQCTGALMA +V   LKFPNSDF S+FPEIRLS  EIKQLKNLF  SVNDAKDKQR +
Subjt:  FDSAATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTE

Query:  ECYERSCG----NSRGSLSSRIRKRRRISRKLR---------------EMAKQKREKILHTRIFWKRVLLIQIRNGRDRLWLRAEPPELHMGGEV-----
        ECYERSCG     S   + ++       S +++               E+  +  EK L+ +   ++   I +  G +    + +  E+  G E+     
Subjt:  ECYERSCG----NSRGSLSSRIRKRRRISRKLR---------------EMAKQKREKILHTRIFWKRVLLIQIRNGRDRLWLRAEPPELHMGGEV-----

Query:  --------SGEPRRREKEEHRRRTPKRVFR---------ESW-------RERNGGG---RVGPTS---------PDLEAPEHWRVPQPL---LAEHLGVG
                SGE  R+++       PK             E W       + R+ GG     G +S         P + AP   ++ QPL   L EHLGVG
Subjt:  --------SGEPRRREKEEHRRRTPKRVFR---------ESW-------RERNGGG---RVGPTS---------PDLEAPEHWRVPQPL---LAEHLGVG

Query:  VVVERSGGGRLCRREVARAVREVVVEESGKRVREKVKEFAKIMKEKGD-EEMEVVVEEIMKLCRRKKKGLQSN
        VVVERS GGRLC  EVARAVREVV EESGK VREK+KEFAKIMKEKGD +EMEVV EEI KLCRRKKKGLQSN
Subjt:  VVVERSGGGRLCRREVARAVREVVVEESGKRVREKVKEFAKIMKEKGD-EEMEVVVEEIMKLCRRKKKGLQSN

A0A6J1BWM7 Glycosyltransferase1.1e-7745.02Show/hide
Query:  GGHCRDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAFDS
        G H    +M ILMLPWLAHGHVSPF EL+KLLA +NF +FFCST+V L S+Q K    L+ N++ VEL LP S +LPP RHTTAGLP HLMFSLK AFD+
Subjt:  GGHCRDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAFDS

Query:  AATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTEECY
        AA AF AV+R + PDLLIYDFLQPWAPA A +A IP VMF  T ALM   V + L+F +++  S+FPEIR SE EI+QLKN F  SVNDAKDK+R   C+
Subjt:  AATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTEECY

Query:  ERSCGNSRGSLSSRIRKRRRISRKLREMAKQKREKILHTRIFWKRVLL------------------------IQIRNGRDRLWLRAEPPELHMGGEVS--
        ERSCG         ++  R I  K  +        +LH ++     L+                        I +  G +    + +  E+  G E+S  
Subjt:  ERSCGNSRGSLSSRIRKRRRISRKLREMAKQKREKILHTRIFWKRVLL------------------------IQIRNGRDRLWLRAEPPELHMGGEVS--

Query:  -----------GEPRRREKEEH------RRRTPKRVFRESW-------RERNGGG---RVGPTS---------PDLEAPEHWRVPQPL---LAEHLGVGV
                   G  R++  EE        R   + +  E W       R R+ GG     G +S         P + AP H  + QPL   L E L V +
Subjt:  -----------GEPRRREKEEH------RRRTPKRVFRESW-------RERNGGG---RVGPTS---------PDLEAPEHWRVPQPL---LAEHLGVGV

Query:  VVERSG--GGRLCRREVARAVREVVVEESGKRVREKVKEFAKIMKEKGDEEMEVVVEEIMKL
        +VER G  GG L R EVARA++EVVV++SG+R+R+K KE AK+MK+KG+EEMEVVVEE++KL
Subjt:  VVERSG--GGRLCRREVARAVREVVVEESGKRVREKVKEFAKIMKEKGDEEMEVVVEEIMKL

A0A6J1H6Y0 Glycosyltransferase7.5e-9048.6Show/hide
Query:  MEGGHCRDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAF
        MEG   R  K  +LMLPWLAHGHVSPF EL+K L  RNF I+FCSTSVI++SIQS + ++LSS+I+LVEL LPTS+DLPP+RHTTAGLP HLMFSLKRAF
Subjt:  MEGGHCRDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAF

Query:  DSAATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTEE
        DSAA  F  ++ ++ PDL+IYDFLQPWAP VA S+ IP VMFQ TGALMA MVKYEL++P+SD SSIFP+IRL+E EIKQ+KNLF  SVNDA+D++R +E
Subjt:  DSAATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTEE

Query:  CYERSCG--------NSRGS----LSSRIRKR-----RRISRKLREMAKQKR-EKILHTRIFWKRVLLIQIRNGRDRLWLRAEPPELHMGGEVS------
        C ERSCG           G     LS  +RK+       +     ++  ++R EK L+ +   +    + +  G +    + +  E+  G E+S      
Subjt:  CYERSCG--------NSRGS----LSSRIRKR-----RRISRKLREMAKQKR-EKILHTRIFWKRVLLIQIRNGRDRLWLRAEPPELHMGGEVS------

Query:  ---------GEPRRREKEEHRRRTPKRVFR-----ESW-------RERNGGG---RVGPTS---------PDLEAPEHWRVPQPL---LAEHLGVGVVVE
                 GE ++  +EE  +   +RV       E W       + R  GG     G +S         P + AP   ++ QPL   L E L VGVV+E
Subjt:  ---------GEPRRREKEEHRRRTPKRVFR-----ESW-------RERNGGG---RVGPTS---------PDLEAPEHWRVPQPL---LAEHLGVGVVVE

Query:  RSGGGRLCRREVARAVREVVVEESGKRVREKVKEFAKIMKEKGDEEMEVVVEEIMKLCRRKKK
        R   GRL R+EVAR V+EV+VE+ G+RVR+KVKEFA+++K+KGDEEM++VVEE++KLC+  K+
Subjt:  RSGGGRLCRREVARAVREVVVEESGKRVREKVKEFAKIMKEKGDEEMEVVVEEIMKLCRRKKK

A0A6J1JJU7 Glycosyltransferase1.7e-8647Show/hide
Query:  RDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAFDSAATA
        R  K  +LMLPWLAHGHVSPF EL+K L  RNF I+FCSTS+IL+SIQ  + ++L S+I+LVEL LPTS+DLP + HTTAGLP HLMFSLK+AFDSAA+A
Subjt:  RDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAFDSAATA

Query:  FDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTEECYERSC
        F  ++ ++ PDL+IYDFLQPWAPAVA S+ IP VMFQ TGALMA MVKYEL++P S+ SSIFPEIRL+E EIKQ+KNLF  SVNDA+D++R +EC ERSC
Subjt:  FDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTEECYERSC

Query:  GNSRGSLSSRIRKRRRISRKLRE-MAKQKREKILHTRIFWKRVLLIQIRNGRDRLWL--------------------RAEPPELHMGGEVS---------
        G         ++  R I  K  + ++   R+K++      +      +   R   WL                    + +  E+  G E+S         
Subjt:  GNSRGSLSSRIRKRRRISRKLRE-MAKQKREKILHTRIFWKRVLLIQIRNGRDRLWL--------------------RAEPPELHMGGEVS---------

Query:  ------GEPRRREKEEHRRRTPKRVFR-----ESW-------RERNGGG---RVGPTS---------PDLEAPEHWRVPQPL---LAEHLGVGVVVERSG
              GE ++  +EE  +   +RV       E W       + R  GG     G +S         P + AP   ++ QPL   L E L  GVV+ER  
Subjt:  ------GEPRRREKEEHRRRTPKRVFR-----ESW-------RERNGGG---RVGPTS---------PDLEAPEHWRVPQPL---LAEHLGVGVVVERSG

Query:  GGRLCRREVARAVREVVVEESGKRVREKVKEFAKIMKEKGDEEMEVVVEEIMKLCRRKKK-GLQSN
         GRL  +EVAR V+EV+VE+ G+RVR+KVKEFA+++K+KGDEEM++VVEE++KLC+R K+  LQS+
Subjt:  GGRLCRREVARAVREVVVEESGKRVREKVKEFAKIMKEKGDEEMEVVVEEIMKLCRRKKK-GLQSN

SwissProt top hitse value%identityAlignment
A0A0A6ZFY4 UDP-glucosyltransferase 294.7e-4132.45Show/hide
Query:  KMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAFDSAATAFDA
        ++ I +LP+LAHGH+SPF EL+K LA RN  +F CST + L SI+ K   + S++I+LVEL LP+S DLPPH HTT GLPSHLM  L+ AF++A   F  
Subjt:  KMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAFDSAATAFDA

Query:  VVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNS--------DFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTEEC
        +++ + PDLLIYDF   WAP +A S +IP V F  T A  + +  +  K P          D S+I PE   ++  +K L +  +              C
Subjt:  VVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNS--------DFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTEEC

Query:  YERSC------------GNSRGSLSSRIRKR-----RRISRKLREMAKQKREKILHTRIFWKRVLLIQIRNGRDRLWLRAEPPELHMGGEVS--------
        +ERSC            G     LS+   K        +   +      K E+I++         ++ +  G +      E  E+ +G E+S        
Subjt:  YERSC------------GNSRGSLSSRIRKR-----RRISRKLREMAKQKREKILHTRIFWKRVLLIQIRNGRDRLWLRAEPPELHMGGEVS--------

Query:  ----GEPRRREKEEHRRRTPKR-VFRESWRER-------NGGGRVGPTS------------PDLEAPEHWRVPQPL---LAEHLGVGVVVERSGGGRLCR
            GE +    E   +R   R +  E W  +       + GG V                P +    H  + QPL   LA  +GVG+ V R   G+  R
Subjt:  ----GEPRRREKEEHRRRTPKR-VFRESWRER-------NGGGRVGPTS------------PDLEAPEHWRVPQPL---LAEHLGVGVVVERSGGGRLCR

Query:  REVARAVREVVVEESGKRVREKVKEFAKIMKEKGDEEMEVVVEEIMKLCRRKK
          +A  +R+VVVE+SG+ +R K +E ++ MKEKG++E++  +EE++++C++KK
Subjt:  REVARAVREVVVEESGKRVREKVKEFAKIMKEKGDEEMEVVVEEIMKLCRRKK

F8WKW8 Beta-D-glucosyl crocetin beta-1,6-glucosyltransferase2.4e-2929.09Show/hide
Query:  MLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAFDSAATAFDAVVRDV
        M PWLA+GH+SP+LEL+K L  R F I+ CST + L  I+ +I    S  I+LVEL LP + +LPPH HTT GLP HLM +LKRA + A      +++ +
Subjt:  MLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAFDSAATAFDAVVRDV

Query:  GPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTEECYERSCGNSRGSLS
         PD +IYD  Q W  A+ ++ +IP V F  +   M     +    P  +F   FP I LS+ E  + +     +  DA++     E   R C +     S
Subjt:  GPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTEECYERSCGNSRGSLS

Query:  SRI--------------RKRRRISRKLREMAKQKREKILHTRIFW----KRVLLIQIRNGRDRLWLRAEPPELHMGGEVS-----GEPRRREKEEHRRRT
        SR                K   +   + E  K  +    +  I W     +   + +  G +    + E  E+  G E+S        R    ++ R   
Subjt:  SRI--------------RKRRRISRKLREMAKQKREKILHTRIFW----KRVLLIQIRNGRDRLWLRAEPPELHMGGEVS-----GEPRRREKEEHRRRT

Query:  PKRVFRESWRERNGG-GRV----GPTSPDLEAPE-----------------HWRVP---------QPL---LAEHLGVGVVVERSGGGRLCRREVARAVR
        P     E + ER G  GR+     P S  L  P                   + VP         QPL   L   +G G+ V R   G+  R+E+ARA++
Subjt:  PKRVFRESWRERNGG-GRV----GPTSPDLEAPE-----------------HWRVP---------QPL---LAEHLGVGVVVERSGGGRLCRREVARAVR

Query:  EVVVEESGKRVREKVKEFAKIMKEKGDEEMEVVVEEIMKL
        + +VE++G+  R K+ +    ++ K  +E++ V E + +L
Subjt:  EVVVEESGKRVREKVKEFAKIMKEKGDEEMEVVVEEIMKL

Q5NTH0 Cyanidin-3-O-glucoside 2-O-glucuronosyltransferase7.8e-2828.64Show/hide
Query:  RILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAFDSAATAFDAVV
        R++MLPWLA+ H+S FL  +K L   NF I+ CS+   +  +++ +    S +IQL+EL LP+S++LP   HTT GLP HL  +L   +  +   F+ ++
Subjt:  RILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAFDSAATAFDAVV

Query:  RDVGPDLLIYDFLQPWAPAVALSADIPPV--MFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTEECYERSC---
          + P L+IYDF Q WAP VA +  IP +  +  C          Y      +     FPEI     +I +           +K  +R  +C  RSC   
Subjt:  RDVGPDLLIYDFLQPWAPAVALSADIPPV--MFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTEECYERSC---

Query:  ---------GNSRGSLSSRIRKR-RRISRKLREMAKQKREKILHTRIFWKR--VLLIQIRNGRDRLWLRAEPPELHMGGEVSGEP-----RRREKEEH--
                 G     LS  + K+   +   ++E +  + + I   +   K+    ++ +  G + +    E  ++  G E+S        R +    +  
Subjt:  ---------GNSRGSLSSRIRKR-RRISRKLREMAKQKREKILHTRIFWKR--VLLIQIRNGRDRLWLRAEPPELHMGGEVSGEP-----RRREKEEH--

Query:  -RRRTPKRVFRESW--------RERNGG--GRVGPTS---------PDLEAPEHWRVP-QPLLAEHLGVGVVVERSGGGRLCRREVARAVREVVVEESGK
          R   K +  + W            GG     G +S         P +  P  +  P    L E +G G+ V R G GRL R E+A  VR+VVVE+SG+
Subjt:  -RRRTPKRVFRESW--------RERNGG--GRVGPTS---------PDLEAPEHWRVP-QPLLAEHLGVGVVVERSGGGRLCRREVARAVREVVVEESGK

Query:  RVREKVKEFAKIMKEKGDEEME-VVVEEIMKLC
         +REK KE  +IMK+  + E++ +V+E ++KLC
Subjt:  RVREKVKEFAKIMKEKGDEEME-VVVEEIMKLC

Q8GVE3 Flavanone 7-O-glucoside 2''-O-beta-L-rhamnosyltransferase3.4e-3128.6Show/hide
Query:  EKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLP-TSADLPPHRHTTAGLPSHLMFSLKRAFDSAATAF
        +K  ILMLPWLAHGH++P LEL+K L+ +NF I+FCST   L S    + +N SS+IQL+EL LP T  +LP    TT  LP HL+++L  AF+ A  AF
Subjt:  EKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLP-TSADLPPHRHTTAGLPSHLMFSLKRAFDSAATAF

Query:  DAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTEECYERSC-
          ++  + P L++YD  QPWA   A   DI  ++F    A+    + + +  P+  +   F E    + E K +      + N   +K R  + +E SC 
Subjt:  DAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTEECYERSC-

Query:  ----------------------GNSRGSLSSRIRKR--RRISRKLREMAKQKREKIL-----HTRIFWKRVLLIQIRNGR-----DRLWLRAEPPELHMG
                              GN    +   I++   +    K+ +   QK  + +      +  F  +  + +I +G      + +W     P+  M 
Subjt:  ----------------------GNSRGSLSSRIRKR--RRISRKLREMAKQKREKIL-----HTRIFWKRVLLIQIRNGR-----DRLWLRAEPPELHMG

Query:  GEVSGEPRRREKEEHRRRTPKRVFRESW-------RERNGGGRVGPTS------------PDLEAPEHWRVP-QPLLAEHLGVGVVVERSG-GGRLCRRE
         E   E   +   E   R  K +  + W       R  + GG +                P +  P  +  P    +    G+G+VV R     RL   E
Subjt:  GEVSGEPRRREKEEHRRRTPKRVFRESW-------RERNGGGRVGPTS------------PDLEAPEHWRVP-QPLLAEHLGVGVVVERSG-GGRLCRRE

Query:  VARAVREVVVEESGKRVREKVKEFAKIMKEKGDEEMEVVVEEIMKLCRRKK
        VAR ++ VV++E  K++R K  E ++ MK+ GD EM VVVE++++L ++ +
Subjt:  VARAVREVVVEESGKRVREKVKEFAKIMKEKGDEEMEVVVEEIMKLCRRKK

Q9LTA3 UDP-glycosyltransferase 91C11.5e-1837.59Show/hide
Query:  RDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAFDSAATA
        R+E M + M PWLA GH+ PFL LSKLLA +  +I F ST   +  +  K+  NL+S+I  V   LP  + LPP   ++  +P +   SLK AFD     
Subjt:  RDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAFDSAATA

Query:  FDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGA
            +R   PD +IYD+   W P++A    I    F    A
Subjt:  FDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGA

Arabidopsis top hitse value%identityAlignment
AT2G22590.1 UDP-Glycosyltransferase superfamily protein4.9e-1732.88Show/hide
Query:  KMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSAD-LPPHRHTTAGLPSHLMFSLKRAFDSAATAFD
        K+ ++M PWLA GH+ P+LELSKL+A +  ++ F ST   +  +  ++P+NLSS I  V+L+LP   + LP     T  +P  L+  LK A+D       
Subjt:  KMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSAD-LPPHRHTTAGLPSHLMFSLKRAFDSAATAFD

Query:  AVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVK
          +    PD ++ DF   W P ++    I    F        G++K
Subjt:  AVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVK

AT4G09500.1 UDP-Glycosyltransferase superfamily protein3.4e-1030.66Show/hide
Query:  DEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQN--LSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAFDSAAT
        + K    M PW A GH+ PFL L+  LA +  ++ F    ++    Q ++  +     +I    LT+P    LP    TT+ +P  L   L +A D    
Subjt:  DEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQN--LSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAFDSAAT

Query:  AFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMF
          +A VR + PDL+ +DF Q W P +A    I  V +
Subjt:  AFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMF

AT5G49690.1 UDP-Glycosyltransferase superfamily protein1.1e-1937.59Show/hide
Query:  RDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAFDSAATA
        R+E M + M PWLA GH+ PFL LSKLLA +  +I F ST   +  +  K+  NL+S+I  V   LP  + LPP   ++  +P +   SLK AFD     
Subjt:  RDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAFDSAATA

Query:  FDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGA
            +R   PD +IYD+   W P++A    I    F    A
Subjt:  FDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGA

AT5G54060.1 UDP-glucose:flavonoid 3-o-glucosyltransferase3.3e-1332.41Show/hide
Query:  GHCRDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSN-IQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAFDS
        G      M I+M PWLA GH++PFL LS  LA +  +I F      L+ ++   P NL  N I    +++P    LPP   T + +P  L   L  A D 
Subjt:  GHCRDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSN-IQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAFDS

Query:  AATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGA
             + + R + PDL+ YD    W P +A       V F    A
Subjt:  AATAFDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGA

AT5G65550.1 UDP-Glycosyltransferase superfamily protein2.3e-1433.09Show/hide
Query:  KMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSA---DLPPHRHTTAGLPSHLMFSLKRAFDSAATA
        K+ + + PWLA GH+ P+L+LSKL+A +   + F ST+  +    S++P N+SS++ +  ++LP S     LP +   T  +P   +  LK+AFD  + A
Subjt:  KMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSA---DLPPHRHTTAGLPSHLMFSLKRAFDSAATA

Query:  FDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCT
        F   +    P+ ++YD L  W P +A    +   +F CT
Subjt:  FDAVVRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGTGGCCATTGTAGAGATGAGAAAATGAGAATTTTGATGCTCCCATGGCTTGCTCATGGCCATGTTTCCCCTTTCTTAGAGCTCTCAAAGTTGCTTGCTACTAG
AAACTTTCAAATATTCTTCTGTTCCACTTCTGTAATTCTTCACTCCATTCAATCAAAAATCCCTCAAAATCTCTCCTCCAATATACAGCTTGTCGAGTTGACCTTGCCAA
CGTCGGCCGACCTCCCGCCGCACCGCCACACCACCGCCGGCCTCCCGTCCCATCTTATGTTCTCGCTCAAGCGAGCATTCGACTCGGCTGCCACCGCCTTCGATGCTGTC
GTTCGTGACGTGGGACCGGACTTGCTTATCTATGACTTCTTGCAGCCGTGGGCTCCGGCTGTGGCTCTCTCAGCTGATATTCCACCAGTCATGTTTCAATGCACAGGTGC
TCTTATGGCGGGCATGGTAAAATACGAGCTAAAGTTTCCAAATTCAGATTTTTCTTCGATATTTCCTGAAATTCGTCTCTCTGAGTTGGAGATTAAACAGCTGAAGAACT
TGTTTAGTTGTTCAGTGAATGATGCAAAAGACAAGCAAAGAACTGAGGAATGTTATGAGAGATCTTGCGGTAATTCCCGTGGGTCCCTTAGTTCAAGAATCAGAAAACGA
CGTCGTATTAGCAGGAAGCTTCGAGAAATGGCTAAACAAAAAAGAGAGAAAATCTTGCATACTCGTATCTTTTGGAAGCGAGTTTTACTTATCCAAATTAGAAATGGAAG
AGATCGCTTATGGCTTAGAGCTGAGCCGCCTGAACTTCATATGGGTGGTGAGGTTTCCGGCGAGCCGCGGAGGAGAGAGAAAGAAGAACATAGAAGAAGAACTCCCAAAA
GGGTTTTTAGAGAGAGTTGGAGAGAGAGGAATGGTGGTGGAAGGGTGGGTCCCACAAGCCCAGATCTTGAAGCACCGGAGCACTGGCGGGTTCCTCAGCCACTGTTAGCA
GAGCACCTTGGTGTCGGTGTCGTGGTGGAGAGAAGTGGTGGTGGTAGGCTATGCCGGAGAGAGGTGGCGAGAGCTGTGAGAGAGGTGGTGGTGGAGGAAAGTGGGAAGAG
AGTGAGGGAGAAGGTGAAGGAGTTTGCAAAGATTATGAAGGAGAAAGGTGATGAAGAAATGGAGGTTGTTGTGGAAGAGATAATGAAGCTATGTAGGAGGAAGAAGAAGG
GTTTACAAAGCAATATATTGGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGTGGCCATTGTAGAGATGAGAAAATGAGAATTTTGATGCTCCCATGGCTTGCTCATGGCCATGTTTCCCCTTTCTTAGAGCTCTCAAAGTTGCTTGCTACTAG
AAACTTTCAAATATTCTTCTGTTCCACTTCTGTAATTCTTCACTCCATTCAATCAAAAATCCCTCAAAATCTCTCCTCCAATATACAGCTTGTCGAGTTGACCTTGCCAA
CGTCGGCCGACCTCCCGCCGCACCGCCACACCACCGCCGGCCTCCCGTCCCATCTTATGTTCTCGCTCAAGCGAGCATTCGACTCGGCTGCCACCGCCTTCGATGCTGTC
GTTCGTGACGTGGGACCGGACTTGCTTATCTATGACTTCTTGCAGCCGTGGGCTCCGGCTGTGGCTCTCTCAGCTGATATTCCACCAGTCATGTTTCAATGCACAGGTGC
TCTTATGGCGGGCATGGTAAAATACGAGCTAAAGTTTCCAAATTCAGATTTTTCTTCGATATTTCCTGAAATTCGTCTCTCTGAGTTGGAGATTAAACAGCTGAAGAACT
TGTTTAGTTGTTCAGTGAATGATGCAAAAGACAAGCAAAGAACTGAGGAATGTTATGAGAGATCTTGCGGTAATTCCCGTGGGTCCCTTAGTTCAAGAATCAGAAAACGA
CGTCGTATTAGCAGGAAGCTTCGAGAAATGGCTAAACAAAAAAGAGAGAAAATCTTGCATACTCGTATCTTTTGGAAGCGAGTTTTACTTATCCAAATTAGAAATGGAAG
AGATCGCTTATGGCTTAGAGCTGAGCCGCCTGAACTTCATATGGGTGGTGAGGTTTCCGGCGAGCCGCGGAGGAGAGAGAAAGAAGAACATAGAAGAAGAACTCCCAAAA
GGGTTTTTAGAGAGAGTTGGAGAGAGAGGAATGGTGGTGGAAGGGTGGGTCCCACAAGCCCAGATCTTGAAGCACCGGAGCACTGGCGGGTTCCTCAGCCACTGTTAGCA
GAGCACCTTGGTGTCGGTGTCGTGGTGGAGAGAAGTGGTGGTGGTAGGCTATGCCGGAGAGAGGTGGCGAGAGCTGTGAGAGAGGTGGTGGTGGAGGAAAGTGGGAAGAG
AGTGAGGGAGAAGGTGAAGGAGTTTGCAAAGATTATGAAGGAGAAAGGTGATGAAGAAATGGAGGTTGTTGTGGAAGAGATAATGAAGCTATGTAGGAGGAAGAAGAAGG
GTTTACAAAGCAATATATTGGTGTAG
Protein sequenceShow/hide protein sequence
MEGGHCRDEKMRILMLPWLAHGHVSPFLELSKLLATRNFQIFFCSTSVILHSIQSKIPQNLSSNIQLVELTLPTSADLPPHRHTTAGLPSHLMFSLKRAFDSAATAFDAV
VRDVGPDLLIYDFLQPWAPAVALSADIPPVMFQCTGALMAGMVKYELKFPNSDFSSIFPEIRLSELEIKQLKNLFSCSVNDAKDKQRTEECYERSCGNSRGSLSSRIRKR
RRISRKLREMAKQKREKILHTRIFWKRVLLIQIRNGRDRLWLRAEPPELHMGGEVSGEPRRREKEEHRRRTPKRVFRESWRERNGGGRVGPTSPDLEAPEHWRVPQPLLA
EHLGVGVVVERSGGGRLCRREVARAVREVVVEESGKRVREKVKEFAKIMKEKGDEEMEVVVEEIMKLCRRKKKGLQSNILV