; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G002760 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G002760
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionCAP10 domain-containing protein
Genome locationchr09:2891718..2896780
RNA-Seq ExpressionLsi09G002760
SyntenyLsi09G002760
Gene Ontology termsNA
InterPro domainsIPR006598 - Glycosyl transferase CAP10 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB1222531.1 O-glucosyltransferase rumi [Morella rubra]6.4e-9843.33Show/hide
Query:  RPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPK--IPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPES
        R  W+P L K    VA P A +F +  L   + ISS RL ++  S +   N T  +S K     K I  +PLNCS     NQTQ   C  NYPT F P+ 
Subjt:  RPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPK--IPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPES

Query:  IDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSA
        +DPS  PVC  YFRWIH+DL PW A GITREMVEK K TAHFRL VV G+ Y+E YKKSIQTRD+FTIWGILQLLRRYPGRIPDLELMFDC+D PV++S 
Subjt:  IDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSA

Query:  DYRTAAVETAGPPPVFRYCGDEETLDIVFPDWI--GFESLN-----------KVISNRT----------WQAN---------------------------
        DYR     T GPPP+FRYCGD  T DIVFPDW   G+  +N           K  +NR           W+ N                           
Subjt:  DYRTAAVETAGPPPVFRYCGDEETLDIVFPDWI--GFESLN-----------KVISNRT----------WQAN---------------------------

Query:  ----------------------------------------------------------------------------------------------APIGKA
                                                                                                        IGKA
Subjt:  ----------------------------------------------------------------------------------------------APIGKA

Query:  ASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEK
        +SDFIQ+ELKM+ VYDYMFH+LN YAKLL+F P+IP GAVE+CSET+AC   G+EKKFM ES+VK PS+T PC+MPPPF+   L  LYRRN N+I+QV K
Subjt:  ASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEK

Query:  WENEFW------NKNKP
        WEN +W       K+KP
Subjt:  WENEFW------NKNKP

XP_004143920.1 O-glucosyltransferase rumi homolog isoform X1 [Cucumis sativus]6.7e-15659.39Show/hide
Query:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT
        MQRFPISTSI  P RRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKKSIKYYPLNCSSSSTTNQTQ+FT
Subjt:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT

Query:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL
        CRK+YPT +EPESI PS R VC EYFRWIHEDL+PWAAGGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWGILQLLRRYPG+IPDLEL
Subjt:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWIGF--------------ESLNKVISNRTW--------------------------
        MFDCDDRPVVKSADYR A V+T   PPVFRYCGDEETLDIVFPDW  +              + L K    R W                          
Subjt:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWIGF--------------ESLNKVISNRTW--------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----QANAPIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRL
            Q    IGK AS+FIQQEL+MENVYDYMFHLLN YAKLLRF+PEIP GA+E+CSET+ACPRDG EKKFM+ESMVKTPSLTIPCSMPPPFDTPSLQRL
Subjt:  ----QANAPIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRL

Query:  YRRNANLIRQVEKWENEFWNKN
        YRRNANLI QVEKWEN FW +N
Subjt:  YRRNANLIRQVEKWENEFWNKN

XP_031740971.1 O-glucosyltransferase rumi homolog isoform X2 [Cucumis sativus]6.5e-15962.5Show/hide
Query:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT
        MQRFPISTSI  P RRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKKSIKYYPLNCSSSSTTNQTQ+FT
Subjt:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT

Query:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL
        CRK+YPT +EPESI PS R VC EYFRWIHEDL+PWAAGGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWGILQLLRRYPG+IPDLEL
Subjt:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWIGF--------------ESLNKVISNRTW--------------------------
        MFDCDDRPVVKSADYR A V+T   PPVFRYCGDEETLDIVFPDW  +              + L K    R W                          
Subjt:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWIGF--------------ESLNKVISNRTW--------------------------

Query:  ------------------------------------------------------------------------------QANAPIGKAASDFIQQELKMEN
                                                                                      Q    IGK AS+FIQQEL+MEN
Subjt:  ------------------------------------------------------------------------------QANAPIGKAASDFIQQELKMEN

Query:  VYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWNKN
        VYDYMFHLLN YAKLLRF+PEIP GA+E+CSET+ACPRDG EKKFM+ESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLI QVEKWEN FW +N
Subjt:  VYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWNKN

XP_031740972.1 O-glucosyltransferase rumi homolog isoform X3 [Cucumis sativus]4.5e-16063.79Show/hide
Query:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT
        MQRFPISTSI  P RRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKKSIKYYPLNCSSSSTTNQTQ+FT
Subjt:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT

Query:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL
        CRK+YPT +EPESI PS R VC EYFRWIHEDL+PWAAGGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWGILQLLRRYPG+IPDLEL
Subjt:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWIGF--------------ESLNKVISNRTW--------------------------
        MFDCDDRPVVKSADYR A V+T   PPVFRYCGDEETLDIVFPDW  +              + L K    R W                          
Subjt:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWIGF--------------ESLNKVISNRTW--------------------------

Query:  --------------------------------------------------------------------QANAPIGKAASDFIQQELKMENVYDYMFHLLN
                                                                            Q    IGK AS+FIQQEL+MENVYDYMFHLLN
Subjt:  --------------------------------------------------------------------QANAPIGKAASDFIQQELKMENVYDYMFHLLN

Query:  QYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWNKN
         YAKLLRF+PEIP GA+E+CSET+ACPRDG EKKFM+ESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLI QVEKWEN FW +N
Subjt:  QYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWNKN

XP_038875037.1 protein O-glucosyltransferase 1-like [Benincasa hispida]5.8e-17663.55Show/hide
Query:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT
        MQRFPIST I RP RRPIWQPSLPKRSV VAVPAAA+FLLA LS AV+ISS RLQSTLFS NFLGNQTE ISPKIPKKSIKYYPL+CSSSSTTNQTQNF 
Subjt:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT

Query:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL
        CRKNYPT+FEPES D S RP+C EYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLR+YPGR+PDLEL
Subjt:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWI--GFESLN-----KVI-----SNRTW----------------------------
        MFDCDDRPVVKSADY+TA +ETAGPPP+FRYCGDE+TLDIVFPDW   G+E +N     K++      N  W                            
Subjt:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWI--GFESLN-----KVI-----SNRTW----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----QANAPIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRL
            Q    IGKAASDFIQQ+LKMENVYDYMFHLLNQYAKLLRFRP IP GAVE+CSET+ACPRDGMEKKFM+ESMVKTPSLTIPCSMPPPFDTPSLQRL
Subjt:  ----QANAPIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRL

Query:  YRRNANLIRQVEKWENEFWNKNKP
        YRRNANLIRQVEKWE+EFWN+NKP
Subjt:  YRRNANLIRQVEKWENEFWNKNKP

TrEMBL top hitse value%identityAlignment
A0A0A0KQX4 CAP10 domain-containing protein3.2e-15659.39Show/hide
Query:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT
        MQRFPISTSI  P RRPIW+ SL KRS  VAVPAAAIF L    AAVLISSTRLQ TLF  NFLGNQTE    KIPKKSIKYYPLNCSSSSTTNQTQ+FT
Subjt:  MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFT

Query:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL
        CRK+YPT +EPESI PS R VC EYFRWIHEDL+PWAAGGITREMVEKGKATAHFRLAVV G VYVEHYKKSIQTRDLFTIWGILQLLRRYPG+IPDLEL
Subjt:  CRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLEL

Query:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWIGF--------------ESLNKVISNRTW--------------------------
        MFDCDDRPVVKSADYR A V+T   PPVFRYCGDEETLDIVFPDW  +              + L K    R W                          
Subjt:  MFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWIGF--------------ESLNKVISNRTW--------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----QANAPIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRL
            Q    IGK AS+FIQQEL+MENVYDYMFHLLN YAKLLRF+PEIP GA+E+CSET+ACPRDG EKKFM+ESMVKTPSLTIPCSMPPPFDTPSLQRL
Subjt:  ----QANAPIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRL

Query:  YRRNANLIRQVEKWENEFWNKN
        YRRNANLI QVEKWEN FW +N
Subjt:  YRRNANLIRQVEKWENEFWNKN

A0A2N9J2X5 CAP10 domain-containing protein8.7e-10143.65Show/hide
Query:  WQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPK--IPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPES-ID
        WQP L K        A  I    TL     IS+ RL ++ F      N T TI  K  IP K I  +PLNCS     NQTQ  TC  NYP  F P + +D
Subjt:  WQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPK--IPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPES-ID

Query:  PSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADY
        PS  PVC +YFRWIHEDLRPW   GITR+MVEK K++AHFRL +V G+ Y+E YKKSIQTRD+FTIWGILQLLRRYPGR+PD+ELMFDCDDRPV+KSADY
Subjt:  PSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADY

Query:  RTAAVETAGPPPVFRYCGDEETLDIVFPDWI--GFESLN---------------------------------------------KVISNRTWQAN-----
        R A +   GPPP+FRYCGD  T+DIVFPDW   G+  +N                                              V   + W A      
Subjt:  RTAAVETAGPPPVFRYCGDEETLDIVFPDWI--GFESLN---------------------------------------------KVISNRTWQAN-----

Query:  --------------------------------------------------------------------------------------------APIGKAAS
                                                                                                      IGKA+S
Subjt:  --------------------------------------------------------------------------------------------APIGKAAS

Query:  DFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWE
        DFIQ+ELKM+ VYDYMFHLLN+YAKLL+F P+IP GA+E+CSET AC  +G EKKFM ES+VK PSLT PC+MPPP++   L  L RRN N+IRQVEKWE
Subjt:  DFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWE

Query:  NEFW
        N++W
Subjt:  NEFW

A0A5J4ZS00 CAP10 domain-containing protein4.5e-9748.29Show/hide
Query:  QPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKY-YPLNCSSSSTTNQTQNFTCRKNYPTRFEPESIDPSD
        Q  L KR  T     A +  L  LSAAV IS++ +  + F +  +  +   IS +  + +++   PLNCS+    N TQ  TC  NYPT FE E  DPS 
Subjt:  QPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKY-YPLNCSSSSTTNQTQNFTCRKNYPTRFEPESIDPSD

Query:  RPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTA
           C +YFRWIHEDLRP+ + GITR+MVE+   TAHFR+ +V GRVYVE YKKSIQTRD+FT  GILQLLRRYPGR+PDLE+MFDCDDRPV++S DYR  
Subjt:  RPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTA

Query:  AVETAGPPPVFRYCGDEETLDIVFPDW-------IGFESLNKVI-------SNRTWQANAP--------------------------------------I
              PPP+FRYCGD   LDIVFPDW       I  +  N ++         + W    P                                      I
Subjt:  AVETAGPPPVFRYCGDEETLDIVFPDW-------IGFESLNKVI-------SNRTWQANAP--------------------------------------I

Query:  GKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQ
        GKAASDFIQ+ELKME VYDYMFHL N+YAKLL+   ++P GAVE CSET+ACP +G+EK+ M ES+VK  S+T PC+MPPP+D  +L    +R AN I+Q
Subjt:  GKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQ

Query:  VEKWENEFWN
        VE WE  +W+
Subjt:  VEKWENEFWN

A0A5J4ZTU4 CAP10 domain-containing protein2.2e-9649.63Show/hide
Query:  IWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPESIDPS
        IWQ  L KR  T     A++  L+ L AAVL  S+ +  + FS+    N TE    +   K I+  PL+CS+ + T      TC  NYP  FE E  DPS
Subjt:  IWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPESIDPS

Query:  DRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRT
            C +YFRWIHEDLRP+ + GITR+MVE+ K TAHFR+ +V GR+YVE YKKSIQTRD+FT+WGILQLLRRYPGR+PDLE+MFDC+DRPV++S DYR 
Subjt:  DRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRT

Query:  AAVETAGPPPVFRYCGDEETLDIVFPDWI----------GFESLNKVI----SNRTWQANAP--------------------------------------
               PPP+FRYCGD   LDIVFPDW            +ES+ K +    S + W A  P                                      
Subjt:  AAVETAGPPPVFRYCGDEETLDIVFPDWI----------GFESLNKVI----SNRTWQANAP--------------------------------------

Query:  IGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIR
        IGKAASDFIQQELKM+ VYDYMFHLLN+YAKLL+F+PE+P  AVE CSET+ACP DG+EK+FM ES+VK PS+T PC++PPP+D  +L  L R     + 
Subjt:  IGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIR

Query:  QVEKWEN
        Q  KW +
Subjt:  QVEKWEN

A0A6A1WB87 O-glucosyltransferase rumi3.1e-9843.33Show/hide
Query:  RPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPK--IPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPES
        R  W+P L K    VA P A +F +  L   + ISS RL ++  S +   N T  +S K     K I  +PLNCS     NQTQ   C  NYPT F P+ 
Subjt:  RPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPK--IPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPES

Query:  IDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSA
        +DPS  PVC  YFRWIH+DL PW A GITREMVEK K TAHFRL VV G+ Y+E YKKSIQTRD+FTIWGILQLLRRYPGRIPDLELMFDC+D PV++S 
Subjt:  IDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSA

Query:  DYRTAAVETAGPPPVFRYCGDEETLDIVFPDWI--GFESLN-----------KVISNRT----------WQAN---------------------------
        DYR     T GPPP+FRYCGD  T DIVFPDW   G+  +N           K  +NR           W+ N                           
Subjt:  DYRTAAVETAGPPPVFRYCGDEETLDIVFPDWI--GFESLN-----------KVISNRT----------WQAN---------------------------

Query:  ----------------------------------------------------------------------------------------------APIGKA
                                                                                                        IGKA
Subjt:  ----------------------------------------------------------------------------------------------APIGKA

Query:  ASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEK
        +SDFIQ+ELKM+ VYDYMFH+LN YAKLL+F P+IP GAVE+CSET+AC   G+EKKFM ES+VK PS+T PC+MPPPF+   L  LYRRN N+I+QV K
Subjt:  ASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEK

Query:  WENEFW------NKNKP
        WEN +W       K+KP
Subjt:  WENEFW------NKNKP

SwissProt top hitse value%identityAlignment
A0NDG6 O-glucosyltransferase rumi homolog3.3e-0426.36Show/hide
Query:  CSEYFRWIHEDLRPWAAGGITREMVEKGKA-TAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTAAV
        C+ +   +  DL+P+ A GIT+EM+ + K    H++  V+G ++Y     +  +        G+   +R     +PD++L+ +C D P +    +R  + 
Subjt:  CSEYFRWIHEDLRPWAAGGITREMVEKGKA-TAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTAAV

Query:  ETAGPPPVFRYCGDEETLDIVFPDWIGFE
        E     PV  +    E LDI++P W  +E
Subjt:  ETAGPPPVFRYCGDEETLDIVFPDWIGFE

Arabidopsis top hitse value%identityAlignment
AT1G63420.1 Arabidopsis thaliana protein of unknown function (DUF821)1.2e-7836.34Show/hide
Query:  KKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTR
        KKS +    +   SS  NQ ++ +C +   + +     + S+R  C +YF+WIHEDL+PW   GIT+EMVE+GK TAHFRL ++ G+V+VE+YKKSIQTR
Subjt:  KKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTR

Query:  DLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYR--TAAVETAGPPPVFRYCGDEETLDIVFPDWI----------------------------
        D FT+WGILQLLR+YPG++PD++LMFDCDDRPV++S  Y      VE A PPP+FRYCGD  T+DIVFPDW                             
Subjt:  DLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYR--TAAVETAGPPPVFRYCGDEETLDIVFPDWI----------------------------

Query:  -------------------------------------------------GFESLNKV-------------------------------------------
                                                         GFE+ N                                             
Subjt:  -------------------------------------------------GFESLNKV-------------------------------------------

Query:  --------------------------ISNRTWQANAPIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRD-----GM
                                  ++N T +A   IG+ AS+F+Q++L MENVYDYMFHLLN+Y+KLL+++P++P  +VE+C+E + CP +     G+
Subjt:  --------------------------ISNRTWQANAPIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRD-----GM

Query:  EKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWNK
        +KKFM  S+V  P  + PCS+PPPFD+  L++ +R+  NLIRQVEKWE+ +W K
Subjt:  EKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWNK

AT2G45830.1 downstream target of AGL15 21.3e-6733.74Show/hide
Query:  AVPAAAIFLLATL--SAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPESIDPSDRPVCSEYFRW
        ++  A +FL+ +L  SA +L     L        F G +  T S +    + + +P  C      NQTQ F    +     +P S   S    C  YFRW
Subjt:  AVPAAAIFLLATL--SAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPESIDPSDRPVCSEYFRW

Query:  IHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTAAVETAGPPPV
        IHEDLRPW   G+TR M+EK + TAHFR+ ++ GRVYV+ Y+KSIQTRD+FT+WGI+QLLR YPGR+PDLELMFD DDRP V+S D++    +   PPP+
Subjt:  IHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTAAVETAGPPPV

Query:  FRYCGDEETLDIVFPDW-----------------IGFESLNKV------ISNRTWQAN------------------------------------------
        FRYC D+ +LDIVFPDW                 +  E  NK+      ++   W+ N                                          
Subjt:  FRYCGDEETLDIVFPDW-----------------IGFESLNKV------ISNRTWQAN------------------------------------------

Query:  -------------------------------------------------------------------------------APIGKAASDFIQQELKMENVY
                                                                                       + IG+  S FI++E+KME VY
Subjt:  -------------------------------------------------------------------------------APIGKAASDFIQQELKMENVY

Query:  DYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWN
        DYMFHL+N+YAKLL+F+PEIP GA EI  + + C   G  + FM ESMV  PS   PC MP PF+   L+ +  R  NL RQVE WE+++++
Subjt:  DYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWN

AT3G48980.1 Arabidopsis thaliana protein of unknown function (DUF821)3.8e-8038.82Show/hide
Query:  SPKIPKKSIKYYPLNCSSSSTTNQTQNFTCRK-NYPTRF-----EPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVY
        S K+  +  K + LNC++ S  +     TC K NYPT F     E ES D S    C +YFRWIHEDLRPW   GITRE +E+  ATA FRLA++ GR+Y
Subjt:  SPKIPKKSIKYYPLNCSSSSTTNQTQNFTCRK-NYPTRF-----EPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVY

Query:  VEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWI----------GFESLNKVI
        VE ++++ QTRD+FTIWG +QLLRRYPG+IPDLELMFDC D PVVK+A++  A V+   PPP+FRYC ++ETLDIVFPDW            +ESL K +
Subjt:  VEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWI----------GFESLNKVI

Query:  ---SNRT----------WQAN-------------------------------------------------------------------------------
           + RT          W+ N                                                                               
Subjt:  ---SNRT----------WQAN-------------------------------------------------------------------------------

Query:  ------------------------------------------APIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRD
                                                    IGK AS+F+QQELKM+ VYDYMFHLL QY+KLLRF+PEIP  + E+CSE +ACPRD
Subjt:  ------------------------------------------APIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRD

Query:  GMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWNK
        G E+KFM ES+VK P+ T PC+MPPP+D  S   + +R  +   ++E+WE+++W K
Subjt:  GMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWNK

AT3G61270.1 Arabidopsis thaliana protein of unknown function (DUF821)1.5e-6533.53Show/hide
Query:  TVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPESIDPSDRPVCSEYFRW
        TV  P  +I + AT+   VL  S  +   L  ++F          K+  K+ +  P  C      NQ+      +N  +R  P +   S    C  YFRW
Subjt:  TVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFEPESIDPSDRPVCSEYFRW

Query:  IHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTAAVETAGPPPV
        IHEDLRPW   GITR M+E+   TAHFRL +  G+ YV+ YKKSIQTRD FT+WGILQLLR YPG++PDLELMFD DDRPVV+S D+     E   PPPV
Subjt:  IHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTAAVETAGPPPV

Query:  FRYCGDEETLDIVFPDWI----------------------------------------------------------------------------GFESLN
        FRYC D+ +LDIVFPDW                                                                             GF++ N
Subjt:  FRYCGDEETLDIVFPDWI----------------------------------------------------------------------------GFESLN

Query:  K----------VISNRTWQAN----------------------------------------------------------APIGKAASDFIQQELKMENVY
                    I    W  +                                                            IG+  S FI++E+ M+ VY
Subjt:  K----------VISNRTWQAN----------------------------------------------------------APIGKAASDFIQQELKMENVY

Query:  DYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWNK--NKP
        DYMFHLL +YA LL+F+PEIP+ A EI  +++ CP     + F  ESM+ +PS   PC M PP+D  +L+ +  R ANL RQVE WEN+++    NKP
Subjt:  DYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWNK--NKP

AT5G23850.1 Arabidopsis thaliana protein of unknown function (DUF821)3.2e-7935.44Show/hide
Query:  IWQPSL-------PKRSVTVAVPAAAIFLLATLSAAVLISST-RLQSTLFSINFLGNQTETISPKIPKKSI-------KYYPLNCSSSSTTNQTQNFTCR
        IW P +       P RS  +      + + A +S  +L+ +T  L+    +      QT+TI+PK P+ +          + L+CS++ TT      +C 
Subjt:  IWQPSL-------PKRSVTVAVPAAAIFLLATLSAAVLISST-RLQSTLFSINFLGNQTETISPKIPKKSI-------KYYPLNCSSSSTTNQTQNFTCR

Query:  KN-YP--TRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLE
         N YP  T FE +  +      C +YFRWIHEDLRPW+  GITRE +E+ K TA FRLA+VGG++YVE ++ + QTRD+FTIWG LQLLR+YPG+IPDLE
Subjt:  KN-YP--TRFEPESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLE

Query:  LMFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWI----------GFESLNKVI---SNRT----------WQAN-------------
        LMFDC D PVV++ ++  A      PPP+FRYCG+EETLDIVFPDW            +ESL K +   + RT          W+ N             
Subjt:  LMFDCDDRPVVKSADYRTAAVETAGPPPVFRYCGDEETLDIVFPDWI----------GFESLNKVI---SNRT----------WQAN-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------APIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQR
                  IGKAASDFIQQ+LKM+ VYDYM+HLL +Y+KLL+F+PEIP  AVEICSET+AC R G E+KFM ES+VK P+ + PC+MPPP+D  +   
Subjt:  --------APIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRESMVKTPSLTIPCSMPPPFDTPSLQR

Query:  LYRRNANLIRQVEKWENEFWNK
        + +R  +   ++ +WE ++W+K
Subjt:  LYRRNANLIRQVEKWENEFWNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAGATTTCCAATTTCAACTTCAATTTTGCGACCATGCCGGAGACCCATTTGGCAACCGTCTCTTCCAAAACGCTCCGTCACCGTCGCAGTTCCGGCCGCCGCCAT
CTTCCTCCTCGCCACCCTCTCCGCCGCCGTCCTCATTTCCTCCACTCGCCTCCAATCCACTCTGTTTTCCATTAATTTTCTCGGAAACCAAACAGAAACAATCTCACCAA
AAATCCCCAAAAAATCAATCAAATATTACCCACTTAACTGTTCTTCATCTTCCACCACAAACCAAACCCAGAATTTCACCTGCCGGAAAAACTACCCGACCCGATTCGAA
CCCGAATCAATCGACCCATCGGATCGGCCCGTTTGCTCAGAGTATTTCCGATGGATCCACGAGGATCTGCGGCCGTGGGCGGCGGGCGGAATCACGAGGGAGATGGTGGA
GAAAGGGAAGGCGACGGCGCATTTCCGGCTGGCGGTGGTCGGCGGGAGGGTCTACGTGGAGCACTACAAGAAATCAATTCAAACGAGGGATTTGTTTACGATTTGGGGGA
TTTTGCAGCTTCTGAGAAGGTACCCAGGGCGAATCCCTGATTTGGAGCTGATGTTCGACTGTGATGACCGACCGGTTGTTAAATCGGCCGATTACCGGACTGCGGCCGTG
GAGACGGCGGGGCCGCCACCGGTGTTCCGGTACTGCGGCGATGAGGAGACATTGGATATTGTGTTCCCGGATTGGATTGGATTCGAGAGTCTCAACAAGGTTATAAGCAA
TCGAACTTGGCAAGCCAATGCACCCATAGGGAAAGCAGCAAGTGACTTCATCCAACAAGAGTTAAAGATGGAAAATGTGTATGACTACATGTTTCATCTCCTCAACCAAT
ACGCTAAGCTCCTTCGTTTTCGACCTGAAATCCCGATCGGCGCAGTGGAAATCTGCTCCGAGACGGTGGCTTGCCCGAGAGATGGGATGGAGAAGAAGTTCATGAGAGAA
TCCATGGTGAAAACTCCCTCTCTCACCATCCCTTGCTCCATGCCACCACCCTTTGACACCCCTTCTCTCCAAAGGCTTTATAGAAGAAATGCCAACCTAATAAGGCAAGT
GGAGAAGTGGGAAAATGAATTTTGGAATAAAAATAAACCTTAA
mRNA sequenceShow/hide mRNA sequence
AAAAAGTCAACGGAAATTACCTACACAAAGCACAAAAAGTCTTCCTCGAAAATTACTTTCAAAATTTTCTTTCAGATTCGAAAGATCCCACCAGATCCAATGCAGAGATT
TCCAATTTCAACTTCAATTTTGCGACCATGCCGGAGACCCATTTGGCAACCGTCTCTTCCAAAACGCTCCGTCACCGTCGCAGTTCCGGCCGCCGCCATCTTCCTCCTCG
CCACCCTCTCCGCCGCCGTCCTCATTTCCTCCACTCGCCTCCAATCCACTCTGTTTTCCATTAATTTTCTCGGAAACCAAACAGAAACAATCTCACCAAAAATCCCCAAA
AAATCAATCAAATATTACCCACTTAACTGTTCTTCATCTTCCACCACAAACCAAACCCAGAATTTCACCTGCCGGAAAAACTACCCGACCCGATTCGAACCCGAATCAAT
CGACCCATCGGATCGGCCCGTTTGCTCAGAGTATTTCCGATGGATCCACGAGGATCTGCGGCCGTGGGCGGCGGGCGGAATCACGAGGGAGATGGTGGAGAAAGGGAAGG
CGACGGCGCATTTCCGGCTGGCGGTGGTCGGCGGGAGGGTCTACGTGGAGCACTACAAGAAATCAATTCAAACGAGGGATTTGTTTACGATTTGGGGGATTTTGCAGCTT
CTGAGAAGGTACCCAGGGCGAATCCCTGATTTGGAGCTGATGTTCGACTGTGATGACCGACCGGTTGTTAAATCGGCCGATTACCGGACTGCGGCCGTGGAGACGGCGGG
GCCGCCACCGGTGTTCCGGTACTGCGGCGATGAGGAGACATTGGATATTGTGTTCCCGGATTGGATTGGATTCGAGAGTCTCAACAAGGTTATAAGCAATCGAACTTGGC
AAGCCAATGCACCCATAGGGAAAGCAGCAAGTGACTTCATCCAACAAGAGTTAAAGATGGAAAATGTGTATGACTACATGTTTCATCTCCTCAACCAATACGCTAAGCTC
CTTCGTTTTCGACCTGAAATCCCGATCGGCGCAGTGGAAATCTGCTCCGAGACGGTGGCTTGCCCGAGAGATGGGATGGAGAAGAAGTTCATGAGAGAATCCATGGTGAA
AACTCCCTCTCTCACCATCCCTTGCTCCATGCCACCACCCTTTGACACCCCTTCTCTCCAAAGGCTTTATAGAAGAAATGCCAACCTAATAAGGCAAGTGGAGAAGTGGG
AAAATGAATTTTGGAATAAAAATAAACCTTAA
Protein sequenceShow/hide protein sequence
MQRFPISTSILRPCRRPIWQPSLPKRSVTVAVPAAAIFLLATLSAAVLISSTRLQSTLFSINFLGNQTETISPKIPKKSIKYYPLNCSSSSTTNQTQNFTCRKNYPTRFE
PESIDPSDRPVCSEYFRWIHEDLRPWAAGGITREMVEKGKATAHFRLAVVGGRVYVEHYKKSIQTRDLFTIWGILQLLRRYPGRIPDLELMFDCDDRPVVKSADYRTAAV
ETAGPPPVFRYCGDEETLDIVFPDWIGFESLNKVISNRTWQANAPIGKAASDFIQQELKMENVYDYMFHLLNQYAKLLRFRPEIPIGAVEICSETVACPRDGMEKKFMRE
SMVKTPSLTIPCSMPPPFDTPSLQRLYRRNANLIRQVEKWENEFWNKNKP