; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy2G012750 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy2G012750
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
Descriptionbeta-glucosidase BoGH3B-like
Genome locationGy14Chr2:12584250..12585555
RNA-Seq ExpressionCsGy2G012750
SyntenyCsGy2G012750
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004553 - hydrolase activity, hydrolyzing O-glycosyl compounds (molecular function)
InterPro domainsIPR002772 - Glycoside hydrolase family 3 C-terminal domain
IPR036881 - Glycoside hydrolase family 3 C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8651882.1 hypothetical protein Csa_006396 [Cucumis sativus]5.02e-146100Show/hide
Query:  MRYQLEHSYQQAWIVTHSHARILSLYNQGCLYNNGFLLKLELSSSVKESTIDLSSRNSRQQSAAAGQSPGKDSADKNLTTGTTILEAVKKTVDPNTEVIY
        MRYQLEHSYQQAWIVTHSHARILSLYNQGCLYNNGFLLKLELSSSVKESTIDLSSRNSRQQSAAAGQSPGKDSADKNLTTGTTILEAVKKTVDPNTEVIY
Subjt:  MRYQLEHSYQQAWIVTHSHARILSLYNQGCLYNNGFLLKLELSSSVKESTIDLSSRNSRQQSAAAGQSPGKDSADKNLTTGTTILEAVKKTVDPNTEVIY

Query:  NINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVCIVVIVSGRPLTRQQYTSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLE
        NINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVCIVVIVSGRPLTRQQYTSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLE
Subjt:  NINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVCIVVIVSGRPLTRQQYTSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLE

Query:  RTRFKTQEKDG
        RTRFKTQEKDG
Subjt:  RTRFKTQEKDG

XP_016902614.1 PREDICTED: LOW QUALITY PROTEIN: lysosomal beta glucosidase-like [Cucumis melo]7.49e-5878.26Show/hide
Query:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVC-----IVVIVSGRPLTRQQYTSQLDA
        NLTTGTTILEAVKKTVDPNTEVIYN+NPTTDY KANNFS+ I  VGETP AE KGDNLNLTI EGGSDTIQ VC     IVVIVSGR    QQY SQLDA
Subjt:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVC-----IVVIVSGRPLTRQQYTSQLDA

Query:  LEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
        L  AWLPGTEGEGVTDVL GEYGFT KL RT  KT ++
Subjt:  LEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

XP_022146225.1 uncharacterized protein LOC111015489 [Momordica charantia]6.97e-5875.36Show/hide
Query:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVC-----IVVIVSGRPLTRQQYTSQLDA
        N TTGTTIL+AVKKTVDPNTEV+YN++PTTDY KANNFS+ IV VGE P+AE  GDNLNLTI EGGSDTIQ VC     +VVIVSGRPLT   Y SQLDA
Subjt:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVC-----IVVIVSGRPLTRQQYTSQLDA

Query:  LEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
        L AAWLPGTEGEGVTDVL G+YGFT KL RT FKT ++
Subjt:  LEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

XP_022943425.1 uncharacterized protein LOC111448193 [Cucurbita moschata]7.80e-5671.74Show/hide
Query:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVC-----IVVIVSGRPLTRQQYTSQLDA
        NLTTGTTILEAVKK+VDPNTEV+++++PT DY KANNF++ IV VGE P+AE  GDNLNLTIPEGG DTIQ VC     +VV+VSGRPLT   Y SQLDA
Subjt:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVC-----IVVIVSGRPLTRQQYTSQLDA

Query:  LEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
        L AAWLPGTEGEGV DVL G+YGFT KL RT FKT+++
Subjt:  LEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

XP_038900909.1 beta-glucosidase BoGH3B-like [Benincasa hispida]1.71e-5875.36Show/hide
Query:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVC-----IVVIVSGRPLTRQQYTSQLDA
        NLTTGTTILEAVKKTVDPNTE+IYN+N TTDY KANNFS+ IV VGETP+AE  GDNLNLTI EGGSDTIQ VC     +VVIVSGRPLT + + SQLDA
Subjt:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVC-----IVVIVSGRPLTRQQYTSQLDA

Query:  LEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
        L  +WLPGTEGEGVTDVL G+YGFT KL RT FKT ++
Subjt:  LEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

TrEMBL top hitse value%identityAlignment
A0A0A0LJ56 Glyco_hydro_3_C domain-containing protein4.38e-11397.69Show/hide
Query:  KLELSSSVKESTIDLSSRNSRQQSAAAGQSPGKDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTI
        ++  SSSVKESTIDLSSRNSRQQSAAAGQSPGKDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTI
Subjt:  KLELSSSVKESTIDLSSRNSRQQSAAAGQSPGKDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTI

Query:  PEGGSDTIQKVCIVVIVSGRPLTRQQYTSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEKDG
        PEGGSDTIQKVCIVVIVSGRPLTRQQYTSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEKDG
Subjt:  PEGGSDTIQKVCIVVIVSGRPLTRQQYTSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEKDG

A0A1S4E304 LOW QUALITY PROTEIN: lysosomal beta glucosidase-like3.63e-5878.26Show/hide
Query:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVC-----IVVIVSGRPLTRQQYTSQLDA
        NLTTGTTILEAVKKTVDPNTEVIYN+NPTTDY KANNFS+ I  VGETP AE KGDNLNLTI EGGSDTIQ VC     IVVIVSGR    QQY SQLDA
Subjt:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVC-----IVVIVSGRPLTRQQYTSQLDA

Query:  LEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
        L  AWLPGTEGEGVTDVL GEYGFT KL RT  KT ++
Subjt:  LEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

A0A6A3BU67 Detected protein of confused Function1.23e-4862.32Show/hide
Query:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVC-----IVVIVSGRPLTRQQYTSQLDA
        NLT+GTTIL A+K TVDPNT+V+YN NP+ DY K+N FS+ IV VGE  +AE  GDNLNLTIPE G  TI  VC     ++V++SGRP+  Q Y S++DA
Subjt:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVC-----IVVIVSGRPLTRQQYTSQLDA

Query:  LEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
        L AAWLPGTEG+GV DVL G+YGFT KL RT FKT ++
Subjt:  LEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

A0A6J1CWQ0 uncharacterized protein LOC1110154893.37e-5875.36Show/hide
Query:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVC-----IVVIVSGRPLTRQQYTSQLDA
        N TTGTTIL+AVKKTVDPNTEV+YN++PTTDY KANNFS+ IV VGE P+AE  GDNLNLTI EGGSDTIQ VC     +VVIVSGRPLT   Y SQLDA
Subjt:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVC-----IVVIVSGRPLTRQQYTSQLDA

Query:  LEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
        L AAWLPGTEGEGVTDVL G+YGFT KL RT FKT ++
Subjt:  LEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

A0A6J1FXT0 uncharacterized protein LOC1114481933.77e-5671.74Show/hide
Query:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVC-----IVVIVSGRPLTRQQYTSQLDA
        NLTTGTTILEAVKK+VDPNTEV+++++PT DY KANNF++ IV VGE P+AE  GDNLNLTIPEGG DTIQ VC     +VV+VSGRPLT   Y SQLDA
Subjt:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVC-----IVVIVSGRPLTRQQYTSQLDA

Query:  LEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
        L AAWLPGTEGEGV DVL G+YGFT KL RT FKT+++
Subjt:  LEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G62710.1 Glycosyl hydrolase family protein2.4e-2850Show/hide
Query:  GTTILEAVKKTVDPNTEVIYNINPTTDYFKAN-NFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV------CIVVIVSGRPLTRQQYTSQLDALE
        GTTILEA++K VDP TEV+Y   P  D  K + + ++ IV VGETP+AE  GD+  L I + G DT+         C+V++V+GRPL  + Y   LDAL 
Subjt:  GTTILEAVKKTVDPNTEVIYNINPTTDYFKAN-NFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV------CIVVIVSGRPLTRQQYTSQLDALE

Query:  AAWLPGTEGEGVTDVLLGEYGFTEKLERTRFK
         AWLPGTEG+GV DVL G++ FT  L RT  K
Subjt:  AAWLPGTEGEGVTDVLLGEYGFTEKLERTRFK

AT5G04885.1 Glycosyl hydrolase family protein1.1e-3350.7Show/hide
Query:  SADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGG----SDTIQKV-CIVVIVSGRPLTRQQYTS
        S +KN T GTT+L AVK  VD +TEV++  NP  ++ K+NNF++ I+AVGE P+AE  GD+  LT+ + G    S T Q V C+VV++SGRPL  + Y +
Subjt:  SADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGG----SDTIQKV-CIVVIVSGRPLTRQQYTS

Query:  QLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
         +DAL AAWLPGTEG+G+TD L G++GF+ KL  T F+  E+
Subjt:  QLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

AT5G20940.1 Glycosyl hydrolase family protein8.0e-4062.32Show/hide
Query:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYTSQLDA
        NLT GTTIL AVKKTVDP T+VIYN NP T++ KA +F + IVAVGE P+AE  GD+ NLTI E G  TI  V     C+VV+VSGRP+  Q   S +DA
Subjt:  NLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYTSQLDA

Query:  LEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK
        L AAWLPGTEG+GV DVL G+YGFT KL RT FKT ++
Subjt:  LEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEK

AT5G20950.1 Glycosyl hydrolase family protein6.0e-3555.64Show/hide
Query:  TTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYTSQLDALE
        T GTTIL AVK TV P T+V+Y+ NP  ++ K+  F + IV VGE P+AE  GD  NLTI + G   I  V     C+VV+VSGRP+  Q Y S +DAL 
Subjt:  TTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYTSQLDALE

Query:  AAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKT
        AAWLPGTEG+GV D L G+YGFT KL RT FK+
Subjt:  AAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKT

AT5G20950.2 Glycosyl hydrolase family protein6.0e-3555.64Show/hide
Query:  TTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYTSQLDALE
        T GTTIL AVK TV P T+V+Y+ NP  ++ K+  F + IV VGE P+AE  GD  NLTI + G   I  V     C+VV+VSGRP+  Q Y S +DAL 
Subjt:  TTGTTILEAVKKTVDPNTEVIYNINPTTDYFKANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKV-----CIVVIVSGRPLTRQQYTSQLDALE

Query:  AAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKT
        AAWLPGTEG+GV D L G+YGFT KL RT FK+
Subjt:  AAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGTATCAATTAGAACACAGTTATCAGCAAGCATGGATTGTTACGCATTCACACGCCAGGATACTATCACTCTACAATCAAGGGTGTCTCTACAATAATGGTTTCCT
ACTGAAGTTGGAACTTTCTTCCTCTGTCAAAGAAAGCACCATAGATCTTAGTAGCCGGAACTCACGCCAACAATCTGCGGCGGCCGGACAATCACCTGGCAAGGACTCAG
CGGACAAAAATCTAACAACCGGAACCACCATTCTCGAGGCAGTGAAGAAAACCGTTGATCCAAACACGGAGGTCATCTACAACATAAATCCAACAACCGATTACTTCAAG
GCGAACAACTTCTCGCACGTCATTGTGGCTGTAGGAGAAACGCCGCACGCCGAGCCCAAAGGCGACAACCTAAACCTAACTATCCCCGAAGGAGGCTCGGACACGATCCA
GAAGGTGTGCATCGTTGTCATCGTCTCCGGCCGGCCTCTGACGAGGCAGCAATACACGTCACAATTGGACGCGCTGGAGGCGGCGTGGCTGCCGGGAACGGAAGGGGAAG
GCGTGACGGACGTGCTGTTGGGAGAATATGGGTTCACCGAAAAGCTGGAGAGGACGAGGTTCAAGACTCAAGAAAAAGATGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGGTATCAATTAGAACACAGTTATCAGCAAGCATGGATTGTTACGCATTCACACGCCAGGATACTATCACTCTACAATCAAGGGTGTCTCTACAATAATGGTTTCCT
ACTGAAGTTGGAACTTTCTTCCTCTGTCAAAGAAAGCACCATAGATCTTAGTAGCCGGAACTCACGCCAACAATCTGCGGCGGCCGGACAATCACCTGGCAAGGACTCAG
CGGACAAAAATCTAACAACCGGAACCACCATTCTCGAGGCAGTGAAGAAAACCGTTGATCCAAACACGGAGGTCATCTACAACATAAATCCAACAACCGATTACTTCAAG
GCGAACAACTTCTCGCACGTCATTGTGGCTGTAGGAGAAACGCCGCACGCCGAGCCCAAAGGCGACAACCTAAACCTAACTATCCCCGAAGGAGGCTCGGACACGATCCA
GAAGGTGTGCATCGTTGTCATCGTCTCCGGCCGGCCTCTGACGAGGCAGCAATACACGTCACAATTGGACGCGCTGGAGGCGGCGTGGCTGCCGGGAACGGAAGGGGAAG
GCGTGACGGACGTGCTGTTGGGAGAATATGGGTTCACCGAAAAGCTGGAGAGGACGAGGTTCAAGACTCAAGAAAAAGATGGTTAG
Protein sequenceShow/hide protein sequence
MRYQLEHSYQQAWIVTHSHARILSLYNQGCLYNNGFLLKLELSSSVKESTIDLSSRNSRQQSAAAGQSPGKDSADKNLTTGTTILEAVKKTVDPNTEVIYNINPTTDYFK
ANNFSHVIVAVGETPHAEPKGDNLNLTIPEGGSDTIQKVCIVVIVSGRPLTRQQYTSQLDALEAAWLPGTEGEGVTDVLLGEYGFTEKLERTRFKTQEKDG