; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g0438 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g0438
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionEndoglucanase
Genome locationMC04:3513157..3514194
RNA-Seq ExpressionMC04g0438
SyntenyMC04g0438
Gene Ontology termsGO:0008152 - metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148538.1 uncharacterized protein LOC101213547 [Cucumis sativus]5.45e-10065.99Show/hide
Query:  MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDID--MLPSNNEKESIFPW
        MVE SISS+LDS PESKE VD LTPEDIAWVDSCLI E PDISDGNWN++K ALLEI+DLYP+ F SS  LSDN PG +  DID  MLPSNN KE  F  
Subjt:  MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDID--MLPSNNEKESIFPW

Query:  GDDDD------------PMSEPGIASEDPR-DHDDIDTSLSLSFSKNPFLPTYKEDV-GGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELL
         D DD            PM++ GIASEDP+  HDDIDTSL  +  KNPFLPTYKE+V G  EN Q G   +LSEIG + P+N+IF VWDLN PPVEDEL+
Subjt:  GDDDD------------PMSEPGIASEDPR-DHDDIDTSLSLSFSKNPFLPTYKEDV-GGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELL

Query:  EQLNKALSENSFKSLPSMWDNRSVLKDFKEESLDDLINSISDLSLEQ
        EQLNKAL+ENS + +PSM  N  V KD KE+ LDDLINSISDLSLEQ
Subjt:  EQLNKALSENSFKSLPSMWDNRSVLKDFKEESLDDLINSISDLSLEQ

XP_008444927.1 PREDICTED: uncharacterized protein LOC103488123 [Cucumis melo]1.68e-10367.89Show/hide
Query:  MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDID--MLPSNNEKESIFPW
        MVE SISS+LD  PESKE VD LT EDIAWVDSCLI E PDISDGNWN++K ALLEILDLYP+ F SS  LSDN PG +  DID  MLPSNN KE  F  
Subjt:  MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDID--MLPSNNEKESIFPW

Query:  GDDDD------------PMSEPGIASEDPRDHDDIDTSLSLSFSKNPFLPTYKEDV-GGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELLE
         D DD            PM++ GIASEDP+ HDDIDTSL L+  KNPFLPTYKE+V G  EN Q G   +LSEIG E P+NDIF VWDLN PPVEDEL+E
Subjt:  GDDDD------------PMSEPGIASEDPRDHDDIDTSLSLSFSKNPFLPTYKEDV-GGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELLE

Query:  QLNKALSENSFKSLPSMWDNRSVLKDFKEESLDDLINSISDLSLEQ
        QLNKAL+ENS +S+PSM  N  VLKD KE+ LDDLINSISDLSLEQ
Subjt:  QLNKALSENSFKSLPSMWDNRSVLKDFKEESLDDLINSISDLSLEQ

XP_022135637.1 uncharacterized protein LOC111007546 [Momordica charantia]1.18e-166100Show/hide
Query:  MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDIDMLPSNNEKESIFPWGD
        MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDIDMLPSNNEKESIFPWGD
Subjt:  MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDIDMLPSNNEKESIFPWGD

Query:  DDDPMSEPGIASEDPRDHDDIDTSLSLSFSKNPFLPTYKEDVGGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELLEQLNKALSENSFKSLP
        DDDPMSEPGIASEDPRDHDDIDTSLSLSFSKNPFLPTYKEDVGGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELLEQLNKALSENSFKSLP
Subjt:  DDDPMSEPGIASEDPRDHDDIDTSLSLSFSKNPFLPTYKEDVGGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELLEQLNKALSENSFKSLP

Query:  SMWDNRSVLKDFKEESLDDLINSISDLSLEQNN
        SMWDNRSVLKDFKEESLDDLINSISDLSLEQNN
Subjt:  SMWDNRSVLKDFKEESLDDLINSISDLSLEQNN

XP_022971265.1 uncharacterized protein LOC111470037 [Cucurbita maxima]9.41e-8953.31Show/hide
Query:  MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDID----------------
        M+E S SS L+S  ESKE +D LTPEDIAW DSCLI E PDI DGNWN++K ALLE+ DLYP+ F S   +SDN PGGT DD+D                
Subjt:  MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDID----------------

Query:  ------------------------------------------MLPSNNEKESIFPWGDDDDPMSEPGIASEDPRDHDDIDTSLSLSFSKNPFLPTYKEDV
                                                  ML SNN K+  F   D DD M+E G+ASED + HDDIDTSLSLSF+KNPFLPTYKE+V
Subjt:  ------------------------------------------MLPSNNEKESIFPWGDDDDPMSEPGIASEDPRDHDDIDTSLSLSFSKNPFLPTYKEDV

Query:  GGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELLEQLNKALSENSFKSLPSMWDNRSVLKDFKEESLDDLINSISDLSL
         GKE+ QT SS DL EIG+E P+NDIF+VWDLNLPP+E++L+EQLNKALSENS +S+     N SVLKDF ++ LD LI+SISDLSL
Subjt:  GGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELLEQLNKALSENSFKSLPSMWDNRSVLKDFKEESLDDLINSISDLSL

XP_038887167.1 uncharacterized protein LOC120077354 [Benincasa hispida]5.94e-10071.82Show/hide
Query:  ESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDID--MLPSNNEKESIFPWGDDDDPMSEPGIAS
        ESKE VDALTPEDIAWVDSCLI ETPDISDGNWN++K ALLEILDLYP+SF SS  +S NAPGG+  DID  ML  NN KE  F   D DDPM+E GIAS
Subjt:  ESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDID--MLPSNNEKESIFPWGDDDDPMSEPGIAS

Query:  EDPRDHDDIDTSLSLSFSKNPFLPTYKEDV-GGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELLEQLNKALSENSFKSLPSMWDNRSVLKD
        EDP+ HDDIDTSL  +  KNPFLPTYKE+V G  EN Q G   +LSEIG E P+NDIF VWDLN PPVEDEL+EQLNKAL+EN  +S PSM  N  VLKD
Subjt:  EDPRDHDDIDTSLSLSFSKNPFLPTYKEDV-GGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELLEQLNKALSENSFKSLPSMWDNRSVLKD

Query:  FKEESLDDLINSISDLSLEQ
         KE+ LDDLINSISDLSLEQ
Subjt:  FKEESLDDLINSISDLSLEQ

TrEMBL top hitse value%identityAlignment
A0A0A0K2C8 Uncharacterized protein2.64e-10065.99Show/hide
Query:  MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDID--MLPSNNEKESIFPW
        MVE SISS+LDS PESKE VD LTPEDIAWVDSCLI E PDISDGNWN++K ALLEI+DLYP+ F SS  LSDN PG +  DID  MLPSNN KE  F  
Subjt:  MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDID--MLPSNNEKESIFPW

Query:  GDDDD------------PMSEPGIASEDPR-DHDDIDTSLSLSFSKNPFLPTYKEDV-GGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELL
         D DD            PM++ GIASEDP+  HDDIDTSL  +  KNPFLPTYKE+V G  EN Q G   +LSEIG + P+N+IF VWDLN PPVEDEL+
Subjt:  GDDDD------------PMSEPGIASEDPR-DHDDIDTSLSLSFSKNPFLPTYKEDV-GGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELL

Query:  EQLNKALSENSFKSLPSMWDNRSVLKDFKEESLDDLINSISDLSLEQ
        EQLNKAL+ENS + +PSM  N  V KD KE+ LDDLINSISDLSLEQ
Subjt:  EQLNKALSENSFKSLPSMWDNRSVLKDFKEESLDDLINSISDLSLEQ

A0A1S3BBI4 uncharacterized protein LOC1034881238.13e-10467.89Show/hide
Query:  MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDID--MLPSNNEKESIFPW
        MVE SISS+LD  PESKE VD LT EDIAWVDSCLI E PDISDGNWN++K ALLEILDLYP+ F SS  LSDN PG +  DID  MLPSNN KE  F  
Subjt:  MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDID--MLPSNNEKESIFPW

Query:  GDDDD------------PMSEPGIASEDPRDHDDIDTSLSLSFSKNPFLPTYKEDV-GGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELLE
         D DD            PM++ GIASEDP+ HDDIDTSL L+  KNPFLPTYKE+V G  EN Q G   +LSEIG E P+NDIF VWDLN PPVEDEL+E
Subjt:  GDDDD------------PMSEPGIASEDPRDHDDIDTSLSLSFSKNPFLPTYKEDV-GGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELLE

Query:  QLNKALSENSFKSLPSMWDNRSVLKDFKEESLDDLINSISDLSLEQ
        QLNKAL+ENS +S+PSM  N  VLKD KE+ LDDLINSISDLSLEQ
Subjt:  QLNKALSENSFKSLPSMWDNRSVLKDFKEESLDDLINSISDLSLEQ

A0A5D3BCZ2 Uncharacterized protein8.13e-10467.89Show/hide
Query:  MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDID--MLPSNNEKESIFPW
        MVE SISS+LD  PESKE VD LT EDIAWVDSCLI E PDISDGNWN++K ALLEILDLYP+ F SS  LSDN PG +  DID  MLPSNN KE  F  
Subjt:  MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDID--MLPSNNEKESIFPW

Query:  GDDDD------------PMSEPGIASEDPRDHDDIDTSLSLSFSKNPFLPTYKEDV-GGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELLE
         D DD            PM++ GIASEDP+ HDDIDTSL L+  KNPFLPTYKE+V G  EN Q G   +LSEIG E P+NDIF VWDLN PPVEDEL+E
Subjt:  GDDDD------------PMSEPGIASEDPRDHDDIDTSLSLSFSKNPFLPTYKEDV-GGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELLE

Query:  QLNKALSENSFKSLPSMWDNRSVLKDFKEESLDDLINSISDLSLEQ
        QLNKAL+ENS +S+PSM  N  VLKD KE+ LDDLINSISDLSLEQ
Subjt:  QLNKALSENSFKSLPSMWDNRSVLKDFKEESLDDLINSISDLSLEQ

A0A6J1C3A0 uncharacterized protein LOC1110075465.71e-167100Show/hide
Query:  MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDIDMLPSNNEKESIFPWGD
        MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDIDMLPSNNEKESIFPWGD
Subjt:  MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDIDMLPSNNEKESIFPWGD

Query:  DDDPMSEPGIASEDPRDHDDIDTSLSLSFSKNPFLPTYKEDVGGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELLEQLNKALSENSFKSLP
        DDDPMSEPGIASEDPRDHDDIDTSLSLSFSKNPFLPTYKEDVGGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELLEQLNKALSENSFKSLP
Subjt:  DDDPMSEPGIASEDPRDHDDIDTSLSLSFSKNPFLPTYKEDVGGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELLEQLNKALSENSFKSLP

Query:  SMWDNRSVLKDFKEESLDDLINSISDLSLEQNN
        SMWDNRSVLKDFKEESLDDLINSISDLSLEQNN
Subjt:  SMWDNRSVLKDFKEESLDDLINSISDLSLEQNN

A0A6J1I1I4 uncharacterized protein LOC1114700374.55e-8953.31Show/hide
Query:  MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDID----------------
        M+E S SS L+S  ESKE +D LTPEDIAW DSCLI E PDI DGNWN++K ALLE+ DLYP+ F S   +SDN PGGT DD+D                
Subjt:  MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDID----------------

Query:  ------------------------------------------MLPSNNEKESIFPWGDDDDPMSEPGIASEDPRDHDDIDTSLSLSFSKNPFLPTYKEDV
                                                  ML SNN K+  F   D DD M+E G+ASED + HDDIDTSLSLSF+KNPFLPTYKE+V
Subjt:  ------------------------------------------MLPSNNEKESIFPWGDDDDPMSEPGIASEDPRDHDDIDTSLSLSFSKNPFLPTYKEDV

Query:  GGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELLEQLNKALSENSFKSLPSMWDNRSVLKDFKEESLDDLINSISDLSL
         GKE+ QT SS DL EIG+E P+NDIF+VWDLNLPP+E++L+EQLNKALSENS +S+     N SVLKDF ++ LD LI+SISDLSL
Subjt:  GGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELLEQLNKALSENSFKSLPSMWDNRSVLKDFKEESLDDLINSISDLSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G38980.1 unknown protein4.1e-1327.57Show/hide
Query:  LTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDIDMLPSNNEKESIFPWGDDDDPMSEPGIASEDPR---DHD
        L+PE +AW DSC+I+   D  + NW   + AL EI+D++PE F  S+        GT+  +       E E+I      D   SEP   S + R    ++
Subjt:  LTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDIDMLPSNNEKESIFPWGDDDDPMSEPGIASEDPR---DHD

Query:  DIDTSLS-LSFSKNP--------FLPT-------------YKEDVGGKENTQTGSSQDLSEIGYEFPVN-------------------------------
        ++   +S L+F  +P        + P               K D+GG E+ +   S    E   E P +                               
Subjt:  DIDTSLS-LSFSKNP--------FLPT-------------YKEDVGGKENTQTGSSQDLSEIGYEFPVN-------------------------------

Query:  ---DIFRVWDLNL---PPVEDELLEQLNKALSENS-FKSLPSMWDNRSVLKDFKEESLDDLINSISDLSLEQ
           +IF+VWDL +      ED L+ QL KAL E+S  + LP   ++  V+ +  + ++DDLI+ ISDLSL +
Subjt:  ---DIFRVWDLNL---PPVEDELLEQLNKALSENS-FKSLPSMWDNRSVLKDFKEESLDDLINSISDLSLEQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGAAGAATCCATCTCTTCTTATCTCGACAGTAGACCGGAATCTAAAGAGCCAGTTGATGCTCTTACTCCTGAAGACATTGCTTGGGTTGATTCTTGTCTGATTAA
CGAGACACCCGATATTTCTGATGGGAATTGGAACAATATGAAGTCTGCCTTATTGGAAATCCTCGATCTGTATCCTGAAAGTTTTGCATCTTCTGCTGTATTAAGTGATA
ATGCTCCGGGAGGTACTAAAGATGACATTGACATGCTTCCCTCTAATAATGAAAAGGAATCCATATTTCCCTGGGGAGATGATGATGATCCTATGAGCGAACCAGGAATA
GCTTCGGAAGATCCACGAGACCATGACGATATCGATACATCTCTGTCGTTATCGTTTAGCAAGAATCCATTTTTACCCACTTACAAAGAAGATGTAGGAGGGAAGGAGAA
TACTCAAACTGGATCCAGCCAGGATTTATCAGAAATTGGATATGAGTTCCCAGTAAACGATATTTTCAGGGTCTGGGACTTGAACCTCCCTCCGGTCGAAGATGAGCTTC
TCGAGCAGCTTAACAAAGCCCTTTCCGAAAATTCCTTTAAATCACTTCCTTCAATGTGGGATAATCGTAGTGTCTTGAAAGACTTCAAGGAAGAATCACTTGATGACTTG
ATCAATAGCATTTCAGACCTGTCTTTAGAACAGAATAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTGAAGAATCCATCTCTTCTTATCTCGACAGTAGACCGGAATCTAAAGAGCCAGTTGATGCTCTTACTCCTGAAGACATTGCTTGGGTTGATTCTTGTCTGATTAA
CGAGACACCCGATATTTCTGATGGGAATTGGAACAATATGAAGTCTGCCTTATTGGAAATCCTCGATCTGTATCCTGAAAGTTTTGCATCTTCTGCTGTATTAAGTGATA
ATGCTCCGGGAGGTACTAAAGATGACATTGACATGCTTCCCTCTAATAATGAAAAGGAATCCATATTTCCCTGGGGAGATGATGATGATCCTATGAGCGAACCAGGAATA
GCTTCGGAAGATCCACGAGACCATGACGATATCGATACATCTCTGTCGTTATCGTTTAGCAAGAATCCATTTTTACCCACTTACAAAGAAGATGTAGGAGGGAAGGAGAA
TACTCAAACTGGATCCAGCCAGGATTTATCAGAAATTGGATATGAGTTCCCAGTAAACGATATTTTCAGGGTCTGGGACTTGAACCTCCCTCCGGTCGAAGATGAGCTTC
TCGAGCAGCTTAACAAAGCCCTTTCCGAAAATTCCTTTAAATCACTTCCTTCAATGTGGGATAATCGTAGTGTCTTGAAAGACTTCAAGGAAGAATCACTTGATGACTTG
ATCAATAGCATTTCAGACCTGTCTTTAGAACAGAATAATTAGTAGGGACAGAGTTGTTCAACAGCCCTTGATTTCTGAAAGTTCATGCTGCTGACACGTGGCATTCCGTT
ATTTGTCAACTTTGGATGTCTCACATAGGAATAAGGGTGATTTTAGAGAAGCTACTAGGAGACTTAGATGATAGTTCTACAATCTGTTATTATTTTTGGCTCAGTGCAGT
CTAGTGGGATATGTAAGTACCTTTTTTCTCAGAATCTTATTGACCATTTTGGTTAATGACGACGTTATAGTAATTAATAGGTCTAATTGGCTGTAAAATTTGATTTCGTT
TTTGTGAAAGTAGGTGGTACTATTTATGTTGAATCAGATTTATACCAG
Protein sequenceShow/hide protein sequence
MVEESISSYLDSRPESKEPVDALTPEDIAWVDSCLINETPDISDGNWNNMKSALLEILDLYPESFASSAVLSDNAPGGTKDDIDMLPSNNEKESIFPWGDDDDPMSEPGI
ASEDPRDHDDIDTSLSLSFSKNPFLPTYKEDVGGKENTQTGSSQDLSEIGYEFPVNDIFRVWDLNLPPVEDELLEQLNKALSENSFKSLPSMWDNRSVLKDFKEESLDDL
INSISDLSLEQNN