; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G21780 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G21780
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionEndoglucanase
Genome locationChr1:17348415..17348879
RNA-Seq ExpressionCSPI01G21780
SyntenyCSPI01G21780
Gene Ontology termsGO:0030245 - cellulose catabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008810 - cellulase activity (molecular function)
InterPro domainsIPR001701 - Glycoside hydrolase family 9
IPR008928 - Six-hairpin glycosidase superfamily
IPR012341 - Six-hairpin glycosidase-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057069.1 endoglucanase 25 [Cucumis melo var. makuwa]1.3e-6395.08Show/hide
Query:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN
        MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPD FDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDT DFN
Subjt:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN

Query:  GKDLGIDKMSIFDRIPKASTAP
        GK+LGID+MSIF+RIPKAS AP
Subjt:  GKDLGIDKMSIFDRIPKASTAP

KAE8653204.1 hypothetical protein Csa_019838 [Cucumis sativus]3.2e-67100Show/hide
Query:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN
        MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN
Subjt:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN

Query:  GKDLGIDKMSIFDRIPKASTAP
        GKDLGIDKMSIFDRIPKASTAP
Subjt:  GKDLGIDKMSIFDRIPKASTAP

KAG6573332.1 Endoglucanase 7, partial [Cucurbita argyrosperma subsp. sororia]2.4e-4682.86Show/hide
Query:  RAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFNGKDLGIDKMSIFDRIPK
        R ASIPWDGQFYSC EGDRWLLSK  NPN+L GAMVAGPD FDHFSDDREKPWFTEP+IASNAGLVAAL+AL+DYPGD S +NGK++GID+MSIFDRIP 
Subjt:  RAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFNGKDLGIDKMSIFDRIPK

Query:  ASTAP
        AS AP
Subjt:  ASTAP

XP_022140170.1 endoglucanase 25-like [Momordica charantia]2.5e-5990.16Show/hide
Query:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN
        MSYVVG+G NFPTHVHHR ASIP DGQFYSCAEGDRWLLSKASNPNILSGA+V GPD FDHFSDDR KPWFTEPSIASNAGLVAALVAL+DYPGDTSDFN
Subjt:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN

Query:  GKDLGIDKMSIFDRIPKASTAP
        GKDLGID+MSIFDRIP AS AP
Subjt:  GKDLGIDKMSIFDRIPKASTAP

XP_031745535.1 endoglucanase 9-like [Cucumis sativus]3.2e-67100Show/hide
Query:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN
        MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN
Subjt:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN

Query:  GKDLGIDKMSIFDRIPKASTAP
        GKDLGIDKMSIFDRIPKASTAP
Subjt:  GKDLGIDKMSIFDRIPKASTAP

TrEMBL top hitse value%identityAlignment
A0A0A0LV18 Cellulase1.6e-67100Show/hide
Query:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN
        MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN
Subjt:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN

Query:  GKDLGIDKMSIFDRIPKASTAP
        GKDLGIDKMSIFDRIPKASTAP
Subjt:  GKDLGIDKMSIFDRIPKASTAP

A0A5A7UU46 Endoglucanase6.1e-6495.08Show/hide
Query:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN
        MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPD FDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDT DFN
Subjt:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN

Query:  GKDLGIDKMSIFDRIPKASTAP
        GK+LGID+MSIF+RIPKAS AP
Subjt:  GKDLGIDKMSIFDRIPKASTAP

A0A5N5MQK3 Endoglucanase1.1e-4170.69Show/hide
Query:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPG-DTSDF
        MSY+VG+GN +PTHVHHRAASIPWD Q YSC EGDRWL SK  NPNIL GAMVAGPD FD+F DDR+KPWFTEP+IASNAGLVAA +AL+D P   +SD 
Subjt:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPG-DTSDF

Query:  NGKDLGIDKMSIFDRI
        N  +LGID   IF+ +
Subjt:  NGKDLGIDKMSIFDRI

A0A6J1CED2 Endoglucanase1.2e-5990.16Show/hide
Query:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN
        MSYVVG+G NFPTHVHHR ASIP DGQFYSCAEGDRWLLSKASNPNILSGA+V GPD FDHFSDDR KPWFTEPSIASNAGLVAALVAL+DYPGDTSDFN
Subjt:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN

Query:  GKDLGIDKMSIFDRIPKASTAP
        GKDLGID+MSIFDRIP AS AP
Subjt:  GKDLGIDKMSIFDRIPKASTAP

A0A6N2LST2 Endoglucanase5.0e-4271.55Show/hide
Query:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPG-DTSDF
        MSY+VG+GN +PTHVHHRAASIPWD Q YSC EGDRWL SK  NPNIL GAMVAGPD FD+F DDR+KPWFTEP+IASNAGLVAAL+AL+D P   +SD 
Subjt:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPG-DTSDF

Query:  NGKDLGIDKMSIFDRI
        N  +LGID   IF+ +
Subjt:  NGKDLGIDKMSIFDRI

SwissProt top hitse value%identityAlignment
P0C1U4 Endoglucanase 92.6e-2750Show/hide
Query:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN
        MSYVVGYGN +P  VHHR ASIP +G  Y C  G +W  +K  NPNI+ GAMVAGPD  D F D R+   +TE ++A NAGLVAALVAL          +
Subjt:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN

Query:  GKDLGIDKMSIFDRIPKASTAP
        G+  G+DK ++F  +P    +P
Subjt:  GKDLGIDKMSIFDRIPKASTAP

Q38890 Endoglucanase 253.2e-2550Show/hide
Query:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN
        MSYVVG+G  +P HVHHR ASIP +   Y+C  G +W  SK  NPN + GAMVAGPD  D + D R    +TEP++A NAGLVAALVAL+       +  
Subjt:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN

Query:  GKDLGIDKMSIFDRIPKASTAP
        GK   IDK +IF  +P     P
Subjt:  GKDLGIDKMSIFDRIPKASTAP

Q7XUK4 Endoglucanase 128.6e-2346.22Show/hide
Query:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN
        MSYVVGYG  +P  +HHR AS P +G  YSC  G +W  +K ++PN+L GAMV GPD  D F D R      EP++  NAGLVAALVAL +        +
Subjt:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN

Query:  GKDLG---IDKMSIFDRIP
        G+  G   +DK ++F  +P
Subjt:  GKDLG---IDKMSIFDRIP

Q84R49 Endoglucanase 103.7e-2648.36Show/hide
Query:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN
        MSYVVG+GN +P   HHR ASIP +G  Y C  G +W  +K  NPNIL GA+VAGPD  D F D R    +TEP++A+NAGLVAAL++L +    +    
Subjt:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN

Query:  GKDLGIDKMSIFDRIPKASTAP
            GIDK +IF  +P     P
Subjt:  GKDLGIDKMSIFDRIPKASTAP

Q9STW8 Endoglucanase 212.3e-2350.86Show/hide
Query:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN
        MSYVVGYG  +P  VHHR ASIP      +C  G +W  SK +NPN ++GAMVAGPD  D F D R    +TEP++A NAGLVAALVAL+   G+ +   
Subjt:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN

Query:  GKDLGIDKMSIFDRIP
            GIDK ++F  +P
Subjt:  GKDLGIDKMSIFDRIP

Arabidopsis top hitse value%identityAlignment
AT1G64390.1 glycosyl hydrolase 9C22.9e-1845.1Show/hide
Query:  SYVVGYGNNFPTHVHHRAASI---PWDGQFYSCAEG-DRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTS
        SY+VGYGNNFP  VHHR +SI     D  F +C  G   W   K S+PN+L+GA+V GPD +D+F+D R+    TEP+  +NA L+  L  L+      S
Subjt:  SYVVGYGNNFPTHVHHRAASI---PWDGQFYSCAEG-DRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTS

Query:  DF
         F
Subjt:  DF

AT1G65610.1 Six-hairpin glycosidases superfamily protein5.2e-2346.55Show/hide
Query:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN
        MSYVVG+G  FP  VHHR A+IP D +  SC EG ++  +K  NPN ++GAMV GP+ FD F D R     +EP+++ NAGLVAALV+L    G      
Subjt:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN

Query:  GKDLGIDKMSIFDRIP
             IDK ++F+ +P
Subjt:  GKDLGIDKMSIFDRIP

AT4G11050.1 glycosyl hydrolase 9C31.5e-1745.65Show/hide
Query:  SYVVGYGNNFPTHVHHRAASI---PWDGQFYSCAEG-DRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVAL
        SY+VGYG N+P  VHHR +SI     D +F +C  G   W   K S+PN+L+GA+V GPD +D+F+D R+    TEP+  +NA L+  L  L
Subjt:  SYVVGYGNNFPTHVHHRAASI---PWDGQFYSCAEG-DRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVAL

AT4G24260.1 glycosyl hydrolase 9A31.6e-2450.86Show/hide
Query:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN
        MSYVVGYG  +P  VHHR ASIP      +C  G +W  SK +NPN ++GAMVAGPD  D F D R    +TEP++A NAGLVAALVAL+   G+ +   
Subjt:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN

Query:  GKDLGIDKMSIFDRIP
            GIDK ++F  +P
Subjt:  GKDLGIDKMSIFDRIP

AT5G49720.1 glycosyl hydrolase 9A12.2e-2650Show/hide
Query:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN
        MSYVVG+G  +P HVHHR ASIP +   Y+C  G +W  SK  NPN + GAMVAGPD  D + D R    +TEP++A NAGLVAALVAL+       +  
Subjt:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFN

Query:  GKDLGIDKMSIFDRIPKASTAP
        GK   IDK +IF  +P     P
Subjt:  GKDLGIDKMSIFDRIPKASTAP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTACGTAGTTGGCTATGGAAACAATTTCCCCACCCATGTCCACCACAGAGCTGCTTCAATTCCTTGGGATGGTCAGTTCTATTCATGTGCTGAAGGAGATAGATG
GCTGTTATCAAAGGCTTCAAATCCAAATATTCTTTCCGGAGCCATGGTGGCTGGTCCAGACATGTTTGACCATTTCTCAGATGATAGGGAAAAACCTTGGTTTACTGAAC
CAAGCATAGCAAGCAATGCAGGTTTGGTCGCAGCACTTGTTGCTCTAAATGATTATCCAGGCGACACCTCAGATTTTAATGGAAAGGATTTAGGCATAGATAAGATGTCA
ATCTTTGATAGAATCCCCAAGGCTTCTACAGCTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTACGTAGTTGGCTATGGAAACAATTTCCCCACCCATGTCCACCACAGAGCTGCTTCAATTCCTTGGGATGGTCAGTTCTATTCATGTGCTGAAGGAGATAGATG
GCTGTTATCAAAGGCTTCAAATCCAAATATTCTTTCCGGAGCCATGGTGGCTGGTCCAGACATGTTTGACCATTTCTCAGATGATAGGGAAAAACCTTGGTTTACTGAAC
CAAGCATAGCAAGCAATGCAGGTTTGGTCGCAGCACTTGTTGCTCTAAATGATTATCCAGGCGACACCTCAGATTTTAATGGAAAGGATTTAGGCATAGATAAGATGTCA
ATCTTTGATAGAATCCCCAAGGCTTCTACAGCTCCTTGATAGTCCTAAGATATATATTTTTTCTCACAAGAACAGAAATGTGAATTAAGAAATCTTCACTTTGTGATATC
CCCACCCCGAAATCTTTCTTACAAG
Protein sequenceShow/hide protein sequence
MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDMFDHFSDDREKPWFTEPSIASNAGLVAALVALNDYPGDTSDFNGKDLGIDKMS
IFDRIPKASTAP