; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10021306 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10021306
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionEndoglucanase
Genome locationChr05:7536385..7536732
RNA-Seq ExpressionHG10021306
SyntenyHG10021306
Gene Ontology termsGO:0030245 - cellulose catabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008810 - cellulase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057069.1 endoglucanase 25 [Cucumis melo var. makuwa]2.2e-2578.82Show/hide
Query:  TRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAVNQALKF
        TR NFNV SKSS +Y+SIPSPYSKS DFKIVISN++RFK  SYISA LLLLIIALTLLL FLPHKHNLHEASNNYTVA+  A ++
Subjt:  TRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAVNQALKF

KAE8653204.1 hypothetical protein Csa_019838 [Cucumis sativus]5.5e-3779.05Show/hide
Query:  MMQPVRSIHTEHEADRLLSSTRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAVN
        MMQP RS+HTEHEADR LS+TRLNFN  S+SSI+Y+SIPSPYSKS DFKIVISN++RFKW SYISA LLLLI+ALTLLL FLPHKHNLHEASNNYTVA+ 
Subjt:  MMQPVRSIHTEHEADRLLSSTRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAVN

Query:  QALKF
         A ++
Subjt:  QALKF

KAG6573332.1 Endoglucanase 7, partial [Cucurbita argyrosperma subsp. sororia]3.0e-3572.41Show/hide
Query:  MMQPVRSIHTEH-EADRLLSSTRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAV
        MMQP R +H EH EAD L+S+ R++FNVSS S++ YDSI SPY KSIDFKIVISNR RFKW SYISAF+LL+IIAL LLL+FLPHKH+ HEASNN+TVAV
Subjt:  MMQPVRSIHTEH-EADRLLSSTRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAV

Query:  NQALKFFDAQKCNSPP
        +QAL+FFDAQK    P
Subjt:  NQALKFFDAQKCNSPP

KAG7012497.1 Endoglucanase 7, partial [Cucurbita argyrosperma subsp. argyrosperma]3.0e-3573.5Show/hide
Query:  MMQPVRSIHTEH-EADRLLSSTRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAV
        MMQP R +H EH EAD L+S+ R++FNVSS S++ YDSI SPY KSIDFKIVISNR RFKW SYISAF+LL+IIAL LLL+FLPHKH+ HEASNN+TVAV
Subjt:  MMQPVRSIHTEH-EADRLLSSTRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAV

Query:  NQALKFFDAQKC--NSP
        +QAL+FFDAQK   NSP
Subjt:  NQALKFFDAQKC--NSP

XP_022140170.1 endoglucanase 25-like [Momordica charantia]2.4e-4077.39Show/hide
Query:  MMQPVRSIHTEHEADRLLSSTRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAVN
        MMQPVR +H EHEA+RLLSSTRL+ NVS ++SI YDSIPSPYSKS DFK+VISNR RF+W SYISA LLLLIIA++ LLHFLPHKHN HEASNN+TVA+N
Subjt:  MMQPVRSIHTEHEADRLLSSTRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAVN

Query:  QALKFFDAQKCNSPP
        QALKFFDAQK    P
Subjt:  QALKFFDAQKCNSPP

TrEMBL top hitse value%identityAlignment
A0A0A0M069 Uncharacterized protein1.1e-4684.35Show/hide
Query:  MMQPVRSIHTEHEADRLLSSTRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAVN
        MMQP RS+HTEHEADR LS+TRLNFN  S+SSI+Y+SIPSPYSKS DFKIVISN++RFKW SYISA LLLLI+ALTLLL FLPHKHNLHEASNNYTVAVN
Subjt:  MMQPVRSIHTEHEADRLLSSTRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAVN

Query:  QALKFFDAQKCNSPP
        QALKFFDAQKC +PP
Subjt:  QALKFFDAQKCNSPP

A0A1R3H9N2 Cellulase2.4e-2256.3Show/hide
Query:  QPVRSIHTEHEADRLL------SSTRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYT
        Q VR +HT  EA RLL      +S +LN +V  +SSI YDS+PS  SKS D+K+VI+++  +K F YIS  L LLI+AL LLLHFLPHK++ H AS N T
Subjt:  QPVRSIHTEHEADRLL------SSTRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYT

Query:  VAVNQALKFFDAQKCNSPP
        VAVNQA+ FFDAQK    P
Subjt:  VAVNQALKFFDAQKCNSPP

A0A5A7UU46 Endoglucanase1.1e-2578.82Show/hide
Query:  TRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAVNQALKF
        TR NFNV SKSS +Y+SIPSPYSKS DFKIVISN++RFK  SYISA LLLLIIALTLLL FLPHKHNLHEASNNYTVA+  A ++
Subjt:  TRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAVNQALKF

A0A6J1CED2 Endoglucanase1.2e-4077.39Show/hide
Query:  MMQPVRSIHTEHEADRLLSSTRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAVN
        MMQPVR +H EHEA+RLLSSTRL+ NVS ++SI YDSIPSPYSKS DFK+VISNR RF+W SYISA LLLLIIA++ LLHFLPHKHN HEASNN+TVA+N
Subjt:  MMQPVRSIHTEHEADRLLSSTRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAVN

Query:  QALKFFDAQKCNSPP
        QALKFFDAQK    P
Subjt:  QALKFFDAQKCNSPP

W9RWH0 Cellulase1.3e-2355.17Show/hide
Query:  QPVRSIHTEHEADRLL------SSTRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYT
        +PV  +HT  EA RLL      +S  L+F +  +SS  YDS+PS +SKS D+ +VI+N+   K F+YISA  +L+I+AL LLLHFLPHKH  H +S N T
Subjt:  QPVRSIHTEHEADRLL------SSTRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYT

Query:  VAVNQALKFFDAQKCN
        +AVNQAL FFDAQKCN
Subjt:  VAVNQALKFFDAQKCN

SwissProt top hitse value%identityAlignment
Q9STW8 Endoglucanase 211.7e-0436.11Show/hide
Query:  KSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAVNQALKFFDAQKCNSPP
        K +D   ++ +RK F W         LL   +TL++  LPH H+     +NYT+A+  ALKFF+AQ+    P
Subjt:  KSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAVNQALKFFDAQKCNSPP

Arabidopsis top hitse value%identityAlignment
AT4G24260.1 glycosyl hydrolase 9A31.2e-0536.11Show/hide
Query:  KSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAVNQALKFFDAQKCNSPP
        K +D   ++ +RK F W         LL   +TL++  LPH H+     +NYT+A+  ALKFF+AQ+    P
Subjt:  KSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAVNQALKFFDAQKCNSPP

AT5G49720.1 glycosyl hydrolase 9A17.8e-0536.11Show/hide
Query:  KSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAVNQALKFFDAQKCNSPP
        K +D   +I +RK F W         LL   +TL++  +P  H      +NYT+A+++ALKFF+AQK    P
Subjt:  KSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAVNQALKFFDAQKCNSPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGCAACCGGTGAGGTCCATTCACACAGAACATGAAGCAGATCGTCTTCTTAGCTCGACACGACTTAACTTCAATGTCTCTTCTAAATCTTCCATTGATTATGATTC
GATTCCTTCTCCATACTCCAAGTCCATCGACTTCAAGATTGTTATATCGAATAGAAAACGTTTCAAATGGTTTTCTTACATTTCAGCTTTCCTGCTACTTCTAATCATAG
CACTTACACTTCTCCTTCACTTTTTGCCTCACAAACATAATCTCCATGAAGCCTCAAACAATTACACAGTTGCAGTAAATCAAGCACTAAAGTTTTTCGATGCTCAAAAA
TGTAATAGTCCCCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGCAACCGGTGAGGTCCATTCACACAGAACATGAAGCAGATCGTCTTCTTAGCTCGACACGACTTAACTTCAATGTCTCTTCTAAATCTTCCATTGATTATGATTC
GATTCCTTCTCCATACTCCAAGTCCATCGACTTCAAGATTGTTATATCGAATAGAAAACGTTTCAAATGGTTTTCTTACATTTCAGCTTTCCTGCTACTTCTAATCATAG
CACTTACACTTCTCCTTCACTTTTTGCCTCACAAACATAATCTCCATGAAGCCTCAAACAATTACACAGTTGCAGTAAATCAAGCACTAAAGTTTTTCGATGCTCAAAAA
TGTAATAGTCCCCCTTAA
Protein sequenceShow/hide protein sequence
MMQPVRSIHTEHEADRLLSSTRLNFNVSSKSSIDYDSIPSPYSKSIDFKIVISNRKRFKWFSYISAFLLLLIIALTLLLHFLPHKHNLHEASNNYTVAVNQALKFFDAQK
CNSPP