; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g26370 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g26370
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionEndoglucanase
Genome locationchr3:19006049..19006468
RNA-Seq ExpressionMoc03g26370
SyntenyMoc03g26370
Gene Ontology termsGO:0030245 - cellulose catabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008810 - cellulase activity (molecular function)
InterPro domainsIPR001701 - Glycoside hydrolase family 9
IPR008928 - Six-hairpin glycosidase superfamily
IPR012341 - Six-hairpin glycosidase-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057069.1 endoglucanase 25 [Cucumis melo var. makuwa]1.8e-5085.09Show/hide
Query:  LFSAGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSD
        LFSAGRYPK+SPVKFRGDSGL+DGV  NK DGL+GGFYDSGNN+KFTFPTAYTITLLSWSVIEYHPKYADMNELDHV+DIIRWGT+YLLKVFVAPN TSD
Subjt:  LFSAGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSD

Query:  QAIIYSQVSHMASQ
        Q IIYSQV   +++
Subjt:  QAIIYSQVSHMASQ

KAE8653204.1 hypothetical protein Csa_019838 [Cucumis sativus]4.8e-5190.74Show/hide
Query:  LFSAGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSD
        LFSAGRYPK+SPVKFRGDSGLEDGV  NK DGL+GGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHV+DIIRWGT+YLLK+FVAPN TSD
Subjt:  LFSAGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSD

Query:  QAIIYSQV
        Q IIYSQV
Subjt:  QAIIYSQV

KAG6573332.1 Endoglucanase 7, partial [Cucurbita argyrosperma subsp. sororia]4.2e-4782.73Show/hide
Query:  AGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAI
        +GRYP+NSPV FRGDSGL+DGV  +K DGLVGGFYDSGNNIKFTFPTAYTITLL WSVIEYHPKYADMNELDHV+DII+WGTDYLLKVFVAPN TSD+ I
Subjt:  AGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAI

Query:  IYSQVSHMAS
        IYSQV  +++
Subjt:  IYSQVSHMAS

XP_022140170.1 endoglucanase 25-like [Momordica charantia]3.5e-5499.05Show/hide
Query:  AGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAI
        +GRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAI
Subjt:  AGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAI

Query:  IYSQV
        IYSQV
Subjt:  IYSQV

XP_031745535.1 endoglucanase 9-like [Cucumis sativus]8.7e-5390.18Show/hide
Query:  TDKMLFSAGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPN
        TDK LFSAGRYPK+SPVKFRGDSGLEDGV  NK DGL+GGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHV+DIIRWGT+YLLK+FVAPN
Subjt:  TDKMLFSAGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPN

Query:  GTSDQAIIYSQV
         TSDQ IIYSQV
Subjt:  GTSDQAIIYSQV

TrEMBL top hitse value%identityAlignment
A0A0A0LXP8 Cellulase1.3e-5489.57Show/hide
Query:  TDKMLFSAGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPN
        TDK LFSAGRYPK+SPVKFRGDSGLEDGV  NK DGL+GGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHV+DIIRWGT+YLLK+FVAPN
Subjt:  TDKMLFSAGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPN

Query:  GTSDQAIIYSQVSHM
         TSDQ IIYSQVSH+
Subjt:  GTSDQAIIYSQVSHM

A0A5A7UU46 Endoglucanase8.8e-5185.09Show/hide
Query:  LFSAGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSD
        LFSAGRYPK+SPVKFRGDSGL+DGV  NK DGL+GGFYDSGNN+KFTFPTAYTITLLSWSVIEYHPKYADMNELDHV+DIIRWGT+YLLKVFVAPN TSD
Subjt:  LFSAGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSD

Query:  QAIIYSQVSHMASQ
        Q IIYSQV   +++
Subjt:  QAIIYSQVSHMASQ

A0A6J1ACP6 Endoglucanase9.1e-4075.24Show/hide
Query:  AGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAI
        +G YP  SP+KFRG SGL DG   N    LVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYH KYAD+ EL+H++DIIRWG+DYLLKVFVAPN TS+  I
Subjt:  AGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAI

Query:  IYSQV
        +YSQV
Subjt:  IYSQV

A0A6J1CED2 Endoglucanase1.7e-5499.05Show/hide
Query:  AGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAI
        +GRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAI
Subjt:  AGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAI

Query:  IYSQV
        IYSQV
Subjt:  IYSQV

A0A6P6ANB4 Endoglucanase7.0e-4073.33Show/hide
Query:  AGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAI
        +G YP NSP++FRG SGL+DG + N    LVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYH KYAD+ EL H++D+I+WG+DYLLKVF+APN TSD  I
Subjt:  AGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAI

Query:  IYSQV
        +YSQV
Subjt:  IYSQV

SwissProt top hitse value%identityAlignment
O04478 Endoglucanase 78.0e-2550.48Show/hide
Query:  AGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAI
        +G+ PK + V +RGDSG +DG+ D  + GLVGG+YD G+N+KF FP A+++T+LSWS+IEY  KY  ++E DH+ D+++WGTDYLL  F   N  +    
Subjt:  AGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAI

Query:  IYSQV
        IY+QV
Subjt:  IYSQV

P0C1U4 Endoglucanase 91.5e-2343.33Show/hide
Query:  EDLFMPLTDKMLF----SAGRYPKNSPVKFRGDSGLEDGVVDNKLD-GLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWG
        +D  + L   ++F     +G+ PKN+ V +RG+S ++DG+ D  +   LVGG+YD+G+ +KF FP A+++TLLSWSVIEY  KY  + EL H+ D I+WG
Subjt:  EDLFMPLTDKMLF----SAGRYPKNSPVKFRGDSGLEDGVVDNKLD-GLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWG

Query:  TDYLLKVFVAPNGTSDQAII
         DY LK F +   T D+ ++
Subjt:  TDYLLKVFVAPNGTSDQAII

Q38890 Endoglucanase 254.4e-2352.53Show/hide
Query:  AGRYPKNSPVKFRGDSGLED--GVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSD
        +G+ PK++ V +RG+SGL+D  G   +    LVGG+YD+G+ IKF FP AY +T+LSWSVIEY  KY    EL HV+++I+WGTDY LK F   N T+D
Subjt:  AGRYPKNSPVKFRGDSGLED--GVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSD

Q7XUK4 Endoglucanase 126.5e-2756.07Show/hide
Query:  AGRYPKNSPVKFRGDSGLEDG--VVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQ
        +GR PKN+ +K+RG+SGL DG  + D K  GLVGG+YD+G+NIKF FP A+++T+LSWSVIEY  KY  + E DHV ++I+WGTDYLL  F +   T D+
Subjt:  AGRYPKNSPVKFRGDSGLEDG--VVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQ

Query:  AIIYSQV
          +YSQV
Subjt:  AIIYSQV

Q84R49 Endoglucanase 103.0e-2450Show/hide
Query:  MLFSA---GRYPKNSPVKFRGDSGLEDGVVDNKL-DGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAP
        M F+A   G  PK++ V +RG+S ++DG+ D+ +   LVGGFYD+G+ IKF +P A+++T+LSWSVIEY  KY  + ELDHV+++I+WGTDYLLK F + 
Subjt:  MLFSA---GRYPKNSPVKFRGDSGLEDGVVDNKL-DGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAP

Query:  NGTSDQAI
          T D+ +
Subjt:  NGTSDQAI

Arabidopsis top hitse value%identityAlignment
AT1G65610.1 Six-hairpin glycosidases superfamily protein5.7e-2650.48Show/hide
Query:  AGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAI
        +G+ PK + V +RGDSG +DG+ D  + GLVGG+YD G+N+KF FP A+++T+LSWS+IEY  KY  ++E DH+ D+++WGTDYLL  F   N  +    
Subjt:  AGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAI

Query:  IYSQV
        IY+QV
Subjt:  IYSQV

AT2G32990.1 glycosyl hydrolase 9B82.1e-2048.39Show/hide
Query:  AGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPN
        +GR P N  V +R  SGL DG ++  +D LVGG++D+G+++KF  P A+T+T+LSWSVIEY    A   EL H  + I+WGTDY +K   +PN
Subjt:  AGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPN

AT4G23560.1 glycosyl hydrolase 9B151.9e-2141.9Show/hide
Query:  AGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAI
        +G+ P N  VK+R DS L DG + N    L+GG+YD+G+N+KF +P ++T TLLSW+ IEY  + + +N+L ++   I+WGTD++L+   +PN      +
Subjt:  AGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSDQAI

Query:  IYSQV
        +Y+QV
Subjt:  IYSQV

AT4G24260.1 glycosyl hydrolase 9A32.2e-2248.67Show/hide
Query:  AGRYPKN-SPVKFRGDSGLEDGVVD--NKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSD
        +G+ PKN   V +R DS L+DG  D       LVGG+YD+G++IKF FP +Y +T+LSWSVIEY  KY    EL+HV+++I+WGTDY LK F   N ++D
Subjt:  AGRYPKN-SPVKFRGDSGLEDGVVD--NKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSD

Query:  QAIIYSQVSHMAS
           IY  V  + S
Subjt:  QAIIYSQVSHMAS

AT5G49720.1 glycosyl hydrolase 9A13.1e-2452.53Show/hide
Query:  AGRYPKNSPVKFRGDSGLED--GVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSD
        +G+ PK++ V +RG+SGL+D  G   +    LVGG+YD+G+ IKF FP AY +T+LSWSVIEY  KY    EL HV+++I+WGTDY LK F   N T+D
Subjt:  AGRYPKNSPVKFRGDSGLED--GVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDIIRWGTDYLLKVFVAPNGTSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCAAAAATGTAAAAATCTCTCTTAACTCGAGAATTTCAGAAGATCTTTTCATGCCCCTGACTGATAAAATGCTTTTCTCAGCTGGTAGGTACCCAAAAAAT
AGTCCAGTGAAGTTTCGAGGAGATTCAGGCTTGGAAGATGGGGTTGTGGACAATAAACTGGATGGTCTCGTTGGTGGTTTCTATGATTCAGGAAACAATATTAAG
TTCACTTTCCCCACAGCTTATACCATTACTCTTTTAAGCTGGAGTGTGATTGAGTATCATCCAAAGTATGCAGACATGAATGAGCTTGATCATGTCGAGGACATC
ATCAGGTGGGGAACTGATTATTTGCTCAAAGTTTTTGTGGCCCCAAATGGCACTTCTGATCAAGCCATAATATATTCTCAGGTAAGTCATATGGCCTCCCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCAAAAATGTAAAAATCTCTCTTAACTCGAGAATTTCAGAAGATCTTTTCATGCCCCTGACTGATAAAATGCTTTTCTCAGCTGGTAGGTACCCAAAAAAT
AGTCCAGTGAAGTTTCGAGGAGATTCAGGCTTGGAAGATGGGGTTGTGGACAATAAACTGGATGGTCTCGTTGGTGGTTTCTATGATTCAGGAAACAATATTAAG
TTCACTTTCCCCACAGCTTATACCATTACTCTTTTAAGCTGGAGTGTGATTGAGTATCATCCAAAGTATGCAGACATGAATGAGCTTGATCATGTCGAGGACATC
ATCAGGTGGGGAACTGATTATTTGCTCAAAGTTTTTGTGGCCCCAAATGGCACTTCTGATCAAGCCATAATATATTCTCAGGTAAGTCATATGGCCTCCCAATGA
Protein sequenceShow/hide protein sequence
MLKNVKISLNSRISEDLFMPLTDKMLFSAGRYPKNSPVKFRGDSGLEDGVVDNKLDGLVGGFYDSGNNIKFTFPTAYTITLLSWSVIEYHPKYADMNELDHVEDI
IRWGTDYLLKVFVAPNGTSDQAIIYSQVSHMASQ