; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025036 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025036
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionEndoglucanase
Genome locationchr10:7948661..7949035
RNA-Seq ExpressionLag0025036
SyntenyLag0025036
Gene Ontology termsGO:0030245 - cellulose catabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008810 - cellulase activity (molecular function)
InterPro domainsIPR001701 - Glycoside hydrolase family 9
IPR008928 - Six-hairpin glycosidase superfamily
IPR012341 - Six-hairpin glycosidase-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057069.1 endoglucanase 25 [Cucumis melo var. makuwa]5.9e-6189.52Show/hide
Query:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSG
        +KMSYVVGYGN++PTHVHHR ASIPWDGQFYSCAEGDRWLLSK SNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAAL+AL+DYPGDT  
Subjt:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSG

Query:  LNGKNLGIDQMSIFDRIPMASVAP
         NGKNLGIDQMSIF+RIP AS AP
Subjt:  LNGKNLGIDQMSIFDRIPMASVAP

KAE8653204.1 hypothetical protein Csa_019838 [Cucumis sativus]1.0e-6088.71Show/hide
Query:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSG
        +KMSYVVGYGN++PTHVHHR ASIPWDGQFYSCAEGDRWLLSK SNPNILSGAMVAGPD FDHFSDDREKPWFTEPSIASNAGLVAAL+AL+DYPGDTS 
Subjt:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSG

Query:  LNGKNLGIDQMSIFDRIPMASVAP
         NGK+LGID+MSIFDRIP AS AP
Subjt:  LNGKNLGIDQMSIFDRIPMASVAP

KAG6573332.1 Endoglucanase 7, partial [Cucurbita argyrosperma subsp. sororia]1.2e-5090.48Show/hide
Query:  RGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSGLNGKNLGIDQMSIFDRIPM
        RGASIPWDGQFYSC EGDRWLLSKG NPN+L GAMVAGPDKFDHFSDDREKPWFTEP+IASNAGLVAALIALHDYPGD S  NGKN+GIDQMSIFDRIPM
Subjt:  RGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSGLNGKNLGIDQMSIFDRIPM

Query:  ASVAP
        AS+AP
Subjt:  ASVAP

XP_022140170.1 endoglucanase 25-like [Momordica charantia]1.7e-6089.52Show/hide
Query:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSG
        MKMSYVVG+G ++PTHVHHRGASIP DGQFYSCAEGDRWLLSK SNPNILSGA+V GPDKFDHFSDDR KPWFTEPSIASNAGLVAAL+ALHDYPGDTS 
Subjt:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSG

Query:  LNGKNLGIDQMSIFDRIPMASVAP
         NGK+LGIDQMSIFDRIPMASVAP
Subjt:  LNGKNLGIDQMSIFDRIPMASVAP

XP_031745535.1 endoglucanase 9-like [Cucumis sativus]1.0e-6088.71Show/hide
Query:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSG
        +KMSYVVGYGN++PTHVHHR ASIPWDGQFYSCAEGDRWLLSK SNPNILSGAMVAGPD FDHFSDDREKPWFTEPSIASNAGLVAAL+AL+DYPGDTS 
Subjt:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSG

Query:  LNGKNLGIDQMSIFDRIPMASVAP
         NGK+LGID+MSIFDRIP AS AP
Subjt:  LNGKNLGIDQMSIFDRIPMASVAP

TrEMBL top hitse value%identityAlignment
A0A0A0LV18 Cellulase3.2e-6089.34Show/hide
Query:  MSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSGLN
        MSYVVGYGN++PTHVHHR ASIPWDGQFYSCAEGDRWLLSK SNPNILSGAMVAGPD FDHFSDDREKPWFTEPSIASNAGLVAAL+AL+DYPGDTS  N
Subjt:  MSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSGLN

Query:  GKNLGIDQMSIFDRIPMASVAP
        GK+LGID+MSIFDRIP AS AP
Subjt:  GKNLGIDQMSIFDRIPMASVAP

A0A2P5WI96 Endoglucanase1.4e-4471.67Show/hide
Query:  KMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSGL
        KMSY+VG+G+ YPTHVHHR ASIPWDGQ++SCAEGDRWL S+  NPN+L GAMVAGPD+FD FSD+REK WFTEPSIA NAGLVAALIA HD P  +SG 
Subjt:  KMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSGL

Query:  NGKNLGIDQMSIFDRIPMAS
        NG NLG+D M IF+++ + S
Subjt:  NGKNLGIDQMSIFDRIPMAS

A0A5A7UU46 Endoglucanase2.9e-6189.52Show/hide
Query:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSG
        +KMSYVVGYGN++PTHVHHR ASIPWDGQFYSCAEGDRWLLSK SNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAAL+AL+DYPGDT  
Subjt:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSG

Query:  LNGKNLGIDQMSIFDRIPMASVAP
         NGKNLGIDQMSIF+RIP AS AP
Subjt:  LNGKNLGIDQMSIFDRIPMASVAP

A0A6J1CED2 Endoglucanase8.3e-6189.52Show/hide
Query:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSG
        MKMSYVVG+G ++PTHVHHRGASIP DGQFYSCAEGDRWLLSK SNPNILSGA+V GPDKFDHFSDDR KPWFTEPSIASNAGLVAAL+ALHDYPGDTS 
Subjt:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSG

Query:  LNGKNLGIDQMSIFDRIPMASVAP
         NGK+LGIDQMSIFDRIPMASVAP
Subjt:  LNGKNLGIDQMSIFDRIPMASVAP

A0A6N2LST2 Endoglucanase6.4e-4571.77Show/hide
Query:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPG-DTS
        MKMSY+VG+GN YPTHVHHR ASIPWD Q YSC EGDRWL SK  NPNIL GAMVAGPDKFD+F DDR+KPWFTEP+IASNAGLVAALIALHD P   +S
Subjt:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPG-DTS

Query:  GLNGKNLGIDQMSIFDRIPMASVA
          N  NLGID   IF+ + +   A
Subjt:  GLNGKNLGIDQMSIFDRIPMASVA

SwissProt top hitse value%identityAlignment
P0C1U4 Endoglucanase 91.8e-2850Show/hide
Query:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSG
        +KMSYVVGYGN YP  VHHRGASIP +G  Y C  G +W  +K  NPNI+ GAMVAGPD+ D F D R+   +TE ++A NAGLVAAL+A          
Subjt:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSG

Query:  LNGKNLGIDQMSIFDRIPMASVAP
        L+G+  G+D+ ++F  +P    +P
Subjt:  LNGKNLGIDQMSIFDRIPMASVAP

Q38890 Endoglucanase 253.8e-2652.99Show/hide
Query:  KMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSGL
        KMSYVVG+G  YP HVHHRGASIP +   Y+C  G +W  SK  NPN + GAMVAGPDK D + D R    +TEP++A NAGLVAAL+AL       SG 
Subjt:  KMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSGL

Query:  NGKNLGIDQMSIFDRIP
              ID+ +IF  +P
Subjt:  NGKNLGIDQMSIFDRIP

Q7XUK4 Endoglucanase 121.1e-2549.57Show/hide
Query:  KMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSGL
        KMSYVVGYG  YP  +HHRGAS P +G  YSC  G +W  +KG++PN+L GAMV GPDK D F D R      EP++  NAGLVAAL+AL       SG 
Subjt:  KMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSGL

Query:  NGKNLGIDQMSIFDRIP
              +D+ ++F  +P
Subjt:  NGKNLGIDQMSIFDRIP

Q84R49 Endoglucanase 101.5e-2752.54Show/hide
Query:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSG
        +KMSYVVG+GN YP   HHRGASIP +G  Y C  G +W  +K  NPNIL GA+VAGPD+ D F D R    +TEP++A+NAGLVAALI+L       + 
Subjt:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSG

Query:  LNGKNLGIDQMSIFDRIP
        ++ K+ GID+ +IF  +P
Subjt:  LNGKNLGIDQMSIFDRIP

Q9STW8 Endoglucanase 214.2e-2552.99Show/hide
Query:  KMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSGL
        KMSYVVGYG  YP  VHHRGASIP      +C  G +W  SK +NPN ++GAMVAGPDK D F D R    +TEP++A NAGLVAAL+AL       SG 
Subjt:  KMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSGL

Query:  NGKNLGIDQMSIFDRIP
             GID+ ++F  +P
Subjt:  NGKNLGIDQMSIFDRIP

Arabidopsis top hitse value%identityAlignment
AT1G64390.1 glycosyl hydrolase 9C25.1e-1847.83Show/hide
Query:  SYVVGYGNSYPTHVHHRGASI---PWDGQFYSCAEG-DRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIAL
        SY+VGYGN++P  VHHRG+SI     D  F +C  G   W   KGS+PN+L+GA+V GPD +D+F+D R+    TEP+  +NA L+  L  L
Subjt:  SYVVGYGNSYPTHVHHRGASI---PWDGQFYSCAEG-DRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIAL

AT1G65610.1 Six-hairpin glycosidases superfamily protein3.6e-2445.76Show/hide
Query:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSG
        +KMSYVVG+G  +P  VHHRGA+IP D +  SC EG ++  +K  NPN ++GAMV GP+KFD F D R     +EP+++ NAGLVAAL++L    G    
Subjt:  MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSG

Query:  LNGKNLGIDQMSIFDRIP
               ID+ ++F+ +P
Subjt:  LNGKNLGIDQMSIFDRIP

AT4G11050.1 glycosyl hydrolase 9C32.3e-1847.83Show/hide
Query:  SYVVGYGNSYPTHVHHRGASI---PWDGQFYSCAEG-DRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIAL
        SY+VGYG +YP  VHHRG+SI     D +F +C  G   W   KGS+PN+L+GA+V GPD +D+F+D R+    TEP+  +NA L+  L  L
Subjt:  SYVVGYGNSYPTHVHHRGASI---PWDGQFYSCAEG-DRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIAL

AT4G24260.1 glycosyl hydrolase 9A33.0e-2652.99Show/hide
Query:  KMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSGL
        KMSYVVGYG  YP  VHHRGASIP      +C  G +W  SK +NPN ++GAMVAGPDK D F D R    +TEP++A NAGLVAAL+AL       SG 
Subjt:  KMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSGL

Query:  NGKNLGIDQMSIFDRIP
             GID+ ++F  +P
Subjt:  NGKNLGIDQMSIFDRIP

AT5G49720.1 glycosyl hydrolase 9A12.7e-2752.99Show/hide
Query:  KMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSGL
        KMSYVVG+G  YP HVHHRGASIP +   Y+C  G +W  SK  NPN + GAMVAGPDK D + D R    +TEP++A NAGLVAAL+AL       SG 
Subjt:  KMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSGL

Query:  NGKNLGIDQMSIFDRIP
              ID+ +IF  +P
Subjt:  NGKNLGIDQMSIFDRIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATGAGCTATGTAGTTGGTTACGGAAACAGTTATCCCACCCATGTCCACCACAGGGGTGCCTCAATTCCTTGGGATGGTCAATTCTATTCATGTGCCGAAGGAGA
TAGATGGTTGCTATCTAAGGGTTCAAATCCAAATATTCTTTCCGGAGCCATGGTGGCAGGACCAGACAAGTTTGATCACTTCTCAGACGATAGGGAAAAACCTTGGTTTA
CTGAACCAAGCATAGCAAGCAATGCAGGTTTAGTTGCAGCGCTCATTGCTCTACATGATTATCCAGGAGATACTTCAGGTTTGAATGGAAAAAATTTAGGCATAGATCAG
ATGTCAATCTTTGATAGAATCCCTATGGCTTCTGTAGCTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATGAGCTATGTAGTTGGTTACGGAAACAGTTATCCCACCCATGTCCACCACAGGGGTGCCTCAATTCCTTGGGATGGTCAATTCTATTCATGTGCCGAAGGAGA
TAGATGGTTGCTATCTAAGGGTTCAAATCCAAATATTCTTTCCGGAGCCATGGTGGCAGGACCAGACAAGTTTGATCACTTCTCAGACGATAGGGAAAAACCTTGGTTTA
CTGAACCAAGCATAGCAAGCAATGCAGGTTTAGTTGCAGCGCTCATTGCTCTACATGATTATCCAGGAGATACTTCAGGTTTGAATGGAAAAAATTTAGGCATAGATCAG
ATGTCAATCTTTGATAGAATCCCTATGGCTTCTGTAGCTCCTTGA
Protein sequenceShow/hide protein sequence
MKMSYVVGYGNSYPTHVHHRGASIPWDGQFYSCAEGDRWLLSKGSNPNILSGAMVAGPDKFDHFSDDREKPWFTEPSIASNAGLVAALIALHDYPGDTSGLNGKNLGIDQ
MSIFDRIPMASVAP