; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy1G021445 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy1G021445
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionEndoglucanase
Genome locationGy14Chr1:20171143..20171490
RNA-Seq ExpressionCsGy1G021445
SyntenyCsGy1G021445
Gene Ontology termsGO:0030245 - cellulose catabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008810 - cellulase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057069.1 endoglucanase 25 [Cucumis melo var. makuwa]5.68e-3785.88Show/hide
Query:  TRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVNQALKF
        TR NFN FS+SS EYNSIPSPYSKSFDFKIVISNQRRFK CSYISALLLLLI+ALTLLLQFLPHKHNLHEASNNYTVA+  A ++
Subjt:  TRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVNQALKF

KAE8653204.1 hypothetical protein Csa_019838 [Cucumis sativus]2.48e-5794.29Show/hide
Query:  MMQPERSVHTEHEADRFLSTTRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVN
        MMQPERSVHTEHEADRFLSTTRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVA+ 
Subjt:  MMQPERSVHTEHEADRFLSTTRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVN

Query:  QALKF
         A ++
Subjt:  QALKF

KAG6573332.1 Endoglucanase 7, partial [Cucurbita argyrosperma subsp. sororia]7.77e-4268.97Show/hide
Query:  MMQPERSVHTEH-EADRFLSTTRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAV
        MMQPER VH EH EAD  +ST R++FN  S S++ Y+SI SPY KS DFKIVISN+ RFKWCSYISA +LL+I+AL LLL FLPHKH+ HEASNN+TVAV
Subjt:  MMQPERSVHTEH-EADRFLSTTRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAV

Query:  NQALKFFDAQKCRNPP
        +QAL+FFDAQK    P
Subjt:  NQALKFFDAQKCRNPP

KAG7012497.1 Endoglucanase 7, partial [Cucurbita argyrosperma subsp. argyrosperma]2.27e-4271.17Show/hide
Query:  MMQPERSVHTEH-EADRFLSTTRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAV
        MMQPER VH EH EAD  +ST R++FN  S S++ Y+SI SPY KS DFKIVISN+ RFKWCSYISA +LL+I+AL LLL FLPHKH+ HEASNN+TVAV
Subjt:  MMQPERSVHTEH-EADRFLSTTRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAV

Query:  NQALKFFDAQK
        +QAL+FFDAQK
Subjt:  NQALKFFDAQK

XP_022140170.1 endoglucanase 25-like [Momordica charantia]2.37e-4673.04Show/hide
Query:  MMQPERSVHTEHEADRFLSTTRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVN
        MMQP R VH EHEA+R LS+TRL+ N    +SI Y+SIPSPYSKSFDFK+VISN+ RF+WCSYISALLLLLI+A++ LL FLPHKHN HEASNN+TVA+N
Subjt:  MMQPERSVHTEHEADRFLSTTRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVN

Query:  QALKFFDAQKCRNPP
        QALKFFDAQK    P
Subjt:  QALKFFDAQKCRNPP

TrEMBL top hitse value%identityAlignment
A0A0A0M069 Uncharacterized protein4.75e-77100Show/hide
Query:  MMQPERSVHTEHEADRFLSTTRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVN
        MMQPERSVHTEHEADRFLSTTRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVN
Subjt:  MMQPERSVHTEHEADRFLSTTRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVN

Query:  QALKFFDAQKCRNPP
        QALKFFDAQKCRNPP
Subjt:  QALKFFDAQKCRNPP

A0A3N7F0M3 Cellulase5.03e-2747.9Show/hide
Query:  QPERSVHTEHEADRFL------STTRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYT
        QP R VHT  EA R L      ++  L+F+   +S+  Y+S+PS YSKSFD+++VI++++ FK   Y+S L++ +++A+ LL+QFLPHKH  H  S N T
Subjt:  QPERSVHTEHEADRFL------STTRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYT

Query:  VAVNQALKFFDAQKCRNPP
        +A+NQAL FFDAQK  N P
Subjt:  VAVNQALKFFDAQKCRNPP

A0A5A7UU46 Endoglucanase2.75e-3785.88Show/hide
Query:  TRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVNQALKF
        TR NFN FS+SS EYNSIPSPYSKSFDFKIVISNQRRFK CSYISALLLLLI+ALTLLLQFLPHKHNLHEASNNYTVA+  A ++
Subjt:  TRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVNQALKF

A0A6J1CED2 Endoglucanase1.15e-4673.04Show/hide
Query:  MMQPERSVHTEHEADRFLSTTRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVN
        MMQP R VH EHEA+R LS+TRL+ N    +SI Y+SIPSPYSKSFDFK+VISN+ RF+WCSYISALLLLLI+A++ LL FLPHKHN HEASNN+TVA+N
Subjt:  MMQPERSVHTEHEADRFLSTTRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVN

Query:  QALKFFDAQKCRNPP
        QALKFFDAQK    P
Subjt:  QALKFFDAQKCRNPP

W9RWH0 Cellulase2.66e-2651.3Show/hide
Query:  QPERSVHTEHEADRFL------STTRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYT
        +P   VHT  EA R L      ++  L+F    +SS  Y+S+PS +SKSFD+ +VI+N+   K  +YISA  +L+I+AL LLL FLPHKH  H +S N T
Subjt:  QPERSVHTEHEADRFL------STTRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYT

Query:  VAVNQALKFFDAQKC
        +AVNQAL FFDAQKC
Subjt:  VAVNQALKFFDAQKC

SwissProt top hitse value%identityAlignment
Q9STW8 Endoglucanase 213.8e-0433.33Show/hide
Query:  KSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVNQALKFFDAQKCRNPP
        K  D   ++ +++ F W      +  LL   +TL+++ LPH H+     +NYT+A+  ALKFF+AQ+    P
Subjt:  KSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVNQALKFFDAQKCRNPP

Arabidopsis top hitse value%identityAlignment
AT4G24260.1 glycosyl hydrolase 9A32.7e-0533.33Show/hide
Query:  KSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVNQALKFFDAQKCRNPP
        K  D   ++ +++ F W      +  LL   +TL+++ LPH H+     +NYT+A+  ALKFF+AQ+    P
Subjt:  KSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVNQALKFFDAQKCRNPP

AT5G49720.1 glycosyl hydrolase 9A12.3e-0433.33Show/hide
Query:  KSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVNQALKFFDAQKCRNPP
        K  D   +I +++ F W         LL   +TL+++ +P  H      +NYT+A+++ALKFF+AQK    P
Subjt:  KSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVNQALKFFDAQKCRNPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGCAACCAGAGAGATCTGTCCACACAGAACATGAAGCAGATCGTTTTCTTAGCACGACACGACTTAACTTCAATGCCTTTTCTGAATCTTCTATTGAATATAATTC
AATTCCTTCACCATACTCCAAGTCCTTTGACTTCAAGATCGTTATATCAAATCAAAGACGTTTCAAATGGTGTTCTTACATTTCAGCTTTGCTGCTACTGCTAATCATGG
CACTTACACTTCTCCTTCAGTTTTTGCCTCACAAACATAATCTTCACGAAGCATCAAACAATTACACAGTTGCAGTAAATCAAGCACTAAAGTTTTTTGATGCTCAAAAA
TGTAGAAATCCTCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGCAACCAGAGAGATCTGTCCACACAGAACATGAAGCAGATCGTTTTCTTAGCACGACACGACTTAACTTCAATGCCTTTTCTGAATCTTCTATTGAATATAATTC
AATTCCTTCACCATACTCCAAGTCCTTTGACTTCAAGATCGTTATATCAAATCAAAGACGTTTCAAATGGTGTTCTTACATTTCAGCTTTGCTGCTACTGCTAATCATGG
CACTTACACTTCTCCTTCAGTTTTTGCCTCACAAACATAATCTTCACGAAGCATCAAACAATTACACAGTTGCAGTAAATCAAGCACTAAAGTTTTTTGATGCTCAAAAA
TGTAGAAATCCTCCTTAA
Protein sequenceShow/hide protein sequence
MMQPERSVHTEHEADRFLSTTRLNFNAFSESSIEYNSIPSPYSKSFDFKIVISNQRRFKWCSYISALLLLLIMALTLLLQFLPHKHNLHEASNNYTVAVNQALKFFDAQK
CRNPP