; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG02G004590 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG02G004590
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionEndoglucanase
Genome locationCG_Chr02:4774523..4780872
RNA-Seq ExpressionClCG02G004590
SyntenyClCG02G004590
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148538.1 uncharacterized protein LOC101213547 [Cucumis sativus]5.0e-10584.87Show/hide
Query:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNETGIASED
        KEQVD LTPEDIAWVDSCLIKE+PDISDGNWN +KDALLEI+DLYPQ FESSLA+SDNVPGASNGDIDVDML  NNVKE TF SRDSDD MNET +  ED
Subjt:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNETGIASED

Query:  SQNHDPMNETGIASEDPQ-NHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFGHELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSV
             PMN+TGIASEDPQ +HDDIDTSL  T +KNPFLPTYKEEVEGNDENNQAG GHELSEIGS+ PIN+IFHVWDLNFPPVEDELVEQLNKALTENSV
Subjt:  SQNHDPMNETGIASEDPQ-NHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFGHELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSV

Query:  ESVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTRY
        E VPSMDSNL V KDLKEDLLDDLINSISDLSLEQT+Y
Subjt:  ESVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTRY

XP_008444927.1 PREDICTED: uncharacterized protein LOC103488123 [Cucumis melo]5.2e-11089.03Show/hide
Query:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNETGIASED
        KEQVD LT EDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQ FESSLA+SDNVPGASNGDIDVDML  NNVKEPTF SRDSDD MNET  A ED
Subjt:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNETGIASED

Query:  SQNHDPMNETGIASEDPQNHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFGHELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSVE
             PMN+TGIASEDPQ HDDIDTSL LT IKNPFLPTYKEEVEGNDEN+QAG GHELSEIGSE PINDIFHVWDLNFPPVEDEL+EQLNKALTENSVE
Subjt:  SQNHDPMNETGIASEDPQNHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFGHELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSVE

Query:  SVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTRY
        SVPSMDSNL VLKDLKEDLLDDLINSISDLSLEQT+Y
Subjt:  SVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTRY

XP_022925089.1 uncharacterized protein LOC111432435 isoform X1 [Cucurbita moschata]4.4e-7759.43Show/hide
Query:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNET------
        KEQ+D LTPEDIAW DSCLIK+IPDI DGNWNH+KDALLE+LDLYPQ FES LAVSD VPG  N DIDVD+L   NVK+PT   RDSDDPMN+       
Subjt:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNET------

Query:  ------------------GIASED--------------------SQNHDPMNETGIASEDPQNHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFG
                          G  S+D                      + D MNETG+ASED Q+HDDIDTSLSL+  KNPFLPTYKEEV+G  E+ Q    
Subjt:  ------------------GIASED--------------------SQNHDPMNETGIASEDPQNHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFG

Query:  HELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSVESVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTRY
        H+L EIG EPPINDIF VWDLN PP+E++LVEQLNKAL+ENS ESV   DSN  V+KD  +DLLD LI+SISDLSLE  +Y
Subjt:  HELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSVESVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTRY

XP_022925097.1 uncharacterized protein LOC111432435 isoform X2 [Cucurbita moschata]4.4e-7759.43Show/hide
Query:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNET------
        KEQ+D LTPEDIAW DSCLIK+IPDI DGNWNH+KDALLE+LDLYPQ FES LAVSD VPG  N DIDVD+L   NVK+PT   RDSDDPMN+       
Subjt:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNET------

Query:  ------------------GIASED--------------------SQNHDPMNETGIASEDPQNHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFG
                          G  S+D                      + D MNETG+ASED Q+HDDIDTSLSL+  KNPFLPTYKEEV+G  E+ Q    
Subjt:  ------------------GIASED--------------------SQNHDPMNETGIASEDPQNHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFG

Query:  HELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSVESVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTRY
        H+L EIG EPPINDIF VWDLN PP+E++LVEQLNKAL+ENS ESV   DSN  V+KD  +DLLD LI+SISDLSLE  +Y
Subjt:  HELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSVESVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTRY

XP_038887167.1 uncharacterized protein LOC120077354 [Benincasa hispida]1.2e-10385.17Show/hide
Query:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNETGIASED
        KEQVD+LTPEDIAWVDSCLIKE PDISDGNWNH+KDALLEILDLYPQSFESS AVS N PG SNGDIDVDMLSPNNVKEPTF SRDSD            
Subjt:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNETGIASED

Query:  SQNHDPMNETGIASEDPQNHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFGHELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSVE
            DPMNETGIASEDPQ HDDIDTSL  T IKNPFLPTYKEEVEGNDENNQAGF HELSEIGSE PINDIFHVWDLNFPPVEDELVEQLNKALTEN VE
Subjt:  SQNHDPMNETGIASEDPQNHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFGHELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSVE

Query:  SVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTR
        S PSMD NL VLKDLKEDLLDDLINSISDLSLEQT+
Subjt:  SVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTR

TrEMBL top hitse value%identityAlignment
A0A0A0K2C8 Uncharacterized protein2.4e-10584.87Show/hide
Query:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNETGIASED
        KEQVD LTPEDIAWVDSCLIKE+PDISDGNWN +KDALLEI+DLYPQ FESSLA+SDNVPGASNGDIDVDML  NNVKE TF SRDSDD MNET +  ED
Subjt:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNETGIASED

Query:  SQNHDPMNETGIASEDPQ-NHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFGHELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSV
             PMN+TGIASEDPQ +HDDIDTSL  T +KNPFLPTYKEEVEGNDENNQAG GHELSEIGS+ PIN+IFHVWDLNFPPVEDELVEQLNKALTENSV
Subjt:  SQNHDPMNETGIASEDPQ-NHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFGHELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSV

Query:  ESVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTRY
        E VPSMDSNL V KDLKEDLLDDLINSISDLSLEQT+Y
Subjt:  ESVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTRY

A0A1S3BBI4 uncharacterized protein LOC1034881232.5e-11089.03Show/hide
Query:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNETGIASED
        KEQVD LT EDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQ FESSLA+SDNVPGASNGDIDVDML  NNVKEPTF SRDSDD MNET  A ED
Subjt:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNETGIASED

Query:  SQNHDPMNETGIASEDPQNHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFGHELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSVE
             PMN+TGIASEDPQ HDDIDTSL LT IKNPFLPTYKEEVEGNDEN+QAG GHELSEIGSE PINDIFHVWDLNFPPVEDEL+EQLNKALTENSVE
Subjt:  SQNHDPMNETGIASEDPQNHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFGHELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSVE

Query:  SVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTRY
        SVPSMDSNL VLKDLKEDLLDDLINSISDLSLEQT+Y
Subjt:  SVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTRY

A0A5D3BCZ2 Uncharacterized protein2.5e-11089.03Show/hide
Query:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNETGIASED
        KEQVD LT EDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQ FESSLA+SDNVPGASNGDIDVDML  NNVKEPTF SRDSDD MNET  A ED
Subjt:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNETGIASED

Query:  SQNHDPMNETGIASEDPQNHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFGHELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSVE
             PMN+TGIASEDPQ HDDIDTSL LT IKNPFLPTYKEEVEGNDEN+QAG GHELSEIGSE PINDIFHVWDLNFPPVEDEL+EQLNKALTENSVE
Subjt:  SQNHDPMNETGIASEDPQNHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFGHELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSVE

Query:  SVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTRY
        SVPSMDSNL VLKDLKEDLLDDLINSISDLSLEQT+Y
Subjt:  SVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTRY

A0A6J1EAW3 uncharacterized protein LOC111432435 isoform X22.1e-7759.43Show/hide
Query:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNET------
        KEQ+D LTPEDIAW DSCLIK+IPDI DGNWNH+KDALLE+LDLYPQ FES LAVSD VPG  N DIDVD+L   NVK+PT   RDSDDPMN+       
Subjt:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNET------

Query:  ------------------GIASED--------------------SQNHDPMNETGIASEDPQNHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFG
                          G  S+D                      + D MNETG+ASED Q+HDDIDTSLSL+  KNPFLPTYKEEV+G  E+ Q    
Subjt:  ------------------GIASED--------------------SQNHDPMNETGIASEDPQNHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFG

Query:  HELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSVESVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTRY
        H+L EIG EPPINDIF VWDLN PP+E++LVEQLNKAL+ENS ESV   DSN  V+KD  +DLLD LI+SISDLSLE  +Y
Subjt:  HELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSVESVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTRY

A0A6J1EB39 uncharacterized protein LOC111432435 isoform X12.1e-7759.43Show/hide
Query:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNET------
        KEQ+D LTPEDIAW DSCLIK+IPDI DGNWNH+KDALLE+LDLYPQ FES LAVSD VPG  N DIDVD+L   NVK+PT   RDSDDPMN+       
Subjt:  KEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNVKEPTFPSRDSDDPMNET------

Query:  ------------------GIASED--------------------SQNHDPMNETGIASEDPQNHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFG
                          G  S+D                      + D MNETG+ASED Q+HDDIDTSLSL+  KNPFLPTYKEEV+G  E+ Q    
Subjt:  ------------------GIASED--------------------SQNHDPMNETGIASEDPQNHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFG

Query:  HELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSVESVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTRY
        H+L EIG EPPINDIF VWDLN PP+E++LVEQLNKAL+ENS ESV   DSN  V+KD  +DLLD LI+SISDLSLE  +Y
Subjt:  HELSEIGSEPPINDIFHVWDLNFPPVEDELVEQLNKALTENSVESVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTRY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G38980.1 unknown protein1.8e-1228.21Show/hide
Query:  DSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSF-----------------------------ESSLAVSDNVPGASNGDID--VDML-
        + L+PE +AW DSC+I  + D  + NW   +DAL EI+D++P+ F                             E +   +++   +SN ++   + ML 
Subjt:  DSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSF-----------------------------ESSLAVSDNVPGASNGDID--VDML-

Query:  -----SPNNVKEPTFPSRDSDDPMNETGIASE-DSQNHDPMNETGIASEDPQNHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFGHELSEIGSEP
             S N +++  FP   +++   E    S+ D    + + E G  S + +  ++   S+S    K+ F+ TY   VE N E+        ++E   + 
Subjt:  -----SPNNVKEPTFPSRDSDDPMNETGIASE-DSQNHDPMNETGIASEDPQNHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFGHELSEIGSEP

Query:  PINDIFHVWDLNF---PPVEDELVEQLNKALTENS-VESVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQT
           +IF VWDL        ED LV QL KAL E+S V+ +P   ++  V+ +  +  +DDLI+ ISDLSL +T
Subjt:  PINDIFHVWDLNF---PPVEDELVEQLNKALTENS-VESVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAAGATGCAAAATTACGAATGAAGCTGCAACGGGATGAGATTCGCGGGAGAACAGCCGCTGAAGAACGAAGATTTGTGAAGTGCAGACGAAGGAAAGAGCAAGT
CGATTCTCTTACTCCTGAAGACATTGCTTGGGTTGACTCTTGTCTGATTAAAGAGATACCAGATATTTCAGATGGCAATTGGAACCACGTAAAGGATGCCTTGTTAGAAA
TCCTTGACCTGTATCCTCAAAGTTTTGAATCTTCTCTTGCTGTGAGCGATAATGTTCCGGGAGCTAGTAATGGCGACATCGACGTTGACATGCTTTCCCCTAACAATGTG
AAGGAGCCTACATTTCCCTCGAGAGATAGTGACGATCCTATGAATGAAACAGGAATAGCTTCGGAAGATTCACAAAACCATGATCCTATGAATGAAACAGGAATAGCTTC
AGAAGATCCACAAAACCACGACGACATCGATACTTCTCTGTCATTAACTCCTATCAAGAATCCATTTTTACCCACTTACAAAGAGGAGGTAGAAGGGAATGATGAGAATA
ATCAAGCTGGATTTGGGCATGAATTATCAGAAATTGGATCTGAGCCCCCAATCAATGATATTTTCCATGTCTGGGATTTGAACTTCCCTCCAGTTGAAGATGAACTCGTC
GAGCAGCTGAACAAAGCCCTCACCGAAAATTCTGTTGAATCAGTCCCTTCAATGGACAGTAATCTTAATGTGTTGAAAGACTTAAAGGAAGATTTACTTGATGACTTGAT
CAATAGCATTTCTGACCTATCTTTGGAACAGACTAGATATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTGAAGATGCAAAATTACGAATGAAGCTGCAACGGGATGAGATTCGCGGGAGAACAGCCGCTGAAGAACGAAGATTTGTGAAGTGCAGACGAAGGAAAGAGCAAGT
CGATTCTCTTACTCCTGAAGACATTGCTTGGGTTGACTCTTGTCTGATTAAAGAGATACCAGATATTTCAGATGGCAATTGGAACCACGTAAAGGATGCCTTGTTAGAAA
TCCTTGACCTGTATCCTCAAAGTTTTGAATCTTCTCTTGCTGTGAGCGATAATGTTCCGGGAGCTAGTAATGGCGACATCGACGTTGACATGCTTTCCCCTAACAATGTG
AAGGAGCCTACATTTCCCTCGAGAGATAGTGACGATCCTATGAATGAAACAGGAATAGCTTCGGAAGATTCACAAAACCATGATCCTATGAATGAAACAGGAATAGCTTC
AGAAGATCCACAAAACCACGACGACATCGATACTTCTCTGTCATTAACTCCTATCAAGAATCCATTTTTACCCACTTACAAAGAGGAGGTAGAAGGGAATGATGAGAATA
ATCAAGCTGGATTTGGGCATGAATTATCAGAAATTGGATCTGAGCCCCCAATCAATGATATTTTCCATGTCTGGGATTTGAACTTCCCTCCAGTTGAAGATGAACTCGTC
GAGCAGCTGAACAAAGCCCTCACCGAAAATTCTGTTGAATCAGTCCCTTCAATGGACAGTAATCTTAATGTGTTGAAAGACTTAAAGGAAGATTTACTTGATGACTTGAT
CAATAGCATTTCTGACCTATCTTTGGAACAGACTAGATATTAG
Protein sequenceShow/hide protein sequence
MAEDAKLRMKLQRDEIRGRTAAEERRFVKCRRRKEQVDSLTPEDIAWVDSCLIKEIPDISDGNWNHVKDALLEILDLYPQSFESSLAVSDNVPGASNGDIDVDMLSPNNV
KEPTFPSRDSDDPMNETGIASEDSQNHDPMNETGIASEDPQNHDDIDTSLSLTPIKNPFLPTYKEEVEGNDENNQAGFGHELSEIGSEPPINDIFHVWDLNFPPVEDELV
EQLNKALTENSVESVPSMDSNLNVLKDLKEDLLDDLINSISDLSLEQTRY