; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G014100 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G014100
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionENDO3c domain-containing protein
Genome locationCG_Chr01:28366347..28369989
RNA-Seq ExpressionClCG01G014100
SyntenyClCG01G014100
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0019104 - DNA N-glycosylase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055207.1 putative DNA glycosylase [Cucumis melo var. makuwa]1.0e-4185.58Show/hide
Query:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSAFA
        +PYP+HSSPTSDECLSVRDDLLNLHGFPREFLKYRK+RERLSECC  +DG   EH DNVESELV EKESVLDGLV+TVLSQNTTEANSERAF SLKSAF+
Subjt:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSAFA

Query:  NWED
         WED
Subjt:  NWED

XP_008438070.1 PREDICTED: putative DNA glycosylase At3g47830 [Cucumis melo]9.4e-4387.5Show/hide
Query:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSAFA
        +PYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRK+RERLSECCS +DG   EH DNVESELV EKESVLDGLV+TVLSQNTTEANSERAF SLKSAF+
Subjt:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSAFA

Query:  NWED
         WED
Subjt:  NWED

XP_011651429.1 putative DNA glycosylase At3g47830 [Cucumis sativus]5.9e-4589.62Show/hide
Query:  SIDPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSA
        +IDPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRK+RERLSECCS +DG   EHRDNVESE V EKESVLDGLV+TVLSQNTTEANSERAFASLKSA
Subjt:  SIDPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSA

Query:  FANWED
        FA WED
Subjt:  FANWED

XP_022136993.1 putative DNA glycosylase At3g47830 [Momordica charantia]1.1e-3882.86Show/hide
Query:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERL-SECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSAF
        DPYPAH+SPTSD+CLS+RDDLLNLHGFPREF+KYRK+R+R  SECCS + GG GE  D+V+SELV EKESVLDGLVRTVLSQNTTEANSERAFASLKSAF
Subjt:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERL-SECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSAF

Query:  ANWED
        A WED
Subjt:  ANWED

XP_038894941.1 putative DNA glycosylase At3g47830 [Benincasa hispida]5.3e-4692.31Show/hide
Query:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSAFA
        DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRK+RERLSECCS +DGG GEHRDNVESE V EKESVLDGLVRTVLSQNTTEANSERAF+SLKSAFA
Subjt:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSAFA

Query:  NWED
         WED
Subjt:  NWED

TrEMBL top hitse value%identityAlignment
A0A0A0LTF1 ENDO3c domain-containing protein2.9e-4589.62Show/hide
Query:  SIDPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSA
        +IDPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRK+RERLSECCS +DG   EHRDNVESE V EKESVLDGLV+TVLSQNTTEANSERAFASLKSA
Subjt:  SIDPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSA

Query:  FANWED
        FA WED
Subjt:  FANWED

A0A1S3AW45 putative DNA glycosylase At3g478304.6e-4387.5Show/hide
Query:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSAFA
        +PYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRK+RERLSECCS +DG   EH DNVESELV EKESVLDGLV+TVLSQNTTEANSERAF SLKSAF+
Subjt:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSAFA

Query:  NWED
         WED
Subjt:  NWED

A0A5A7UKV1 Putative DNA glycosylase5.0e-4285.58Show/hide
Query:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSAFA
        +PYP+HSSPTSDECLSVRDDLLNLHGFPREFLKYRK+RERLSECC  +DG   EH DNVESELV EKESVLDGLV+TVLSQNTTEANSERAF SLKSAF+
Subjt:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSAFA

Query:  NWED
         WED
Subjt:  NWED

A0A5D3BJ43 Putative DNA glycosylase4.6e-4387.5Show/hide
Query:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSAFA
        +PYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRK+RERLSECCS +DG   EH DNVESELV EKESVLDGLV+TVLSQNTTEANSERAF SLKSAF+
Subjt:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSAFA

Query:  NWED
         WED
Subjt:  NWED

A0A6J1C919 putative DNA glycosylase At3g478305.2e-3982.86Show/hide
Query:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERL-SECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSAF
        DPYPAH+SPTSD+CLS+RDDLLNLHGFPREF+KYRK+R+R  SECCS + GG GE  D+V+SELV EKESVLDGLVRTVLSQNTTEANSERAFASLKSAF
Subjt:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERL-SECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSAF

Query:  ANWED
        A WED
Subjt:  ANWED

SwissProt top hitse value%identityAlignment
F4JCQ3 Putative DNA glycosylase At3g478307.3e-2253.77Show/hide
Query:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVH--EKESVLDGLVRTVLSQNTTEANSERAFASLKSA
        +PYP    PT++EC  VRD LL+LHGFP EF  YR+QR R      D D      + N++SE ++  E+ESVLDGLV+ +LSQNTTE+NS+RAFASLK+ 
Subjt:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVH--EKESVLDGLVRTVLSQNTTEANSERAFASLKSA

Query:  FANWED
        F  W+D
Subjt:  FANWED

Arabidopsis top hitse value%identityAlignment
AT3G47830.1 DNA glycosylase superfamily protein5.2e-2353.77Show/hide
Query:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVH--EKESVLDGLVRTVLSQNTTEANSERAFASLKSA
        +PYP    PT++EC  VRD LL+LHGFP EF  YR+QR R      D D      + N++SE ++  E+ESVLDGLV+ +LSQNTTE+NS+RAFASLK+ 
Subjt:  DPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVH--EKESVLDGLVRTVLSQNTTEANSERAFASLKSA

Query:  FANWED
        F  W+D
Subjt:  FANWED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTATTGACCCGTATCCGGCTCATTCTTCGCCCACTTCGGACGAATGTCTGTCCGTGAGGGACGATCTGTTGAATCTTCATGGTTTCCCTCGAGAGTTTCTTAAGTA
TCGGAAGCAGCGAGAGAGACTGAGCGAGTGCTGCTCCGACATGGACGGCGGCAGCGGTGAGCACCGGGATAATGTGGAATCGGAGCTCGTCCATGAGAAGGAGAGCGTTT
TGGATGGATTGGTGAGGACTGTGCTCTCGCAGAACACTACTGAGGCTAATTCCGAGAGGGCTTTTGCTTCTCTCAAGTCTGCTTTTGCTAACTGGGAGGATAAAGATGAT
TTTCCAGTAGACACCCATGTGAGTAACTGGATTCTTGATTCTGATGAGTTTGTGATCTCTAAAGACCATTGGAAGGTTTTGATTTTCCAACATGTTTGCAGGTCTTTGAG
ATTGCGAAATTTGCCGGTTGGGTCCCGGATGAGGCAGACAGGAACAAAACATATCTTCATCTTAACTAAAGGATCCCAAATCATCTCAAATTTGATCTCAACTGTCTTCT
TTACACTCATGGCAAGGTCTATTCGAAATGTACGAAGAGAACAGGCGGGCGACAACGAAAGGGATCAGAAGATCAGTCTTGTCCCTTGTTCAAGTACTCCAAGAACCCGT
AAATTTGTGAATATAAATGTGGCTATTAGCTATGCGTCACGGGGCTTATGCGTTGAAGGTGGTTACCGTAGGTTGTTAAGGACGAATAAGCAAGAAGGTATAAAGTTAAG
TCATGGAGTGTTAGAAAGAGATCTAAGTGTCATGGATGGAAATAACGCCATTTCAAACCGCGACACAAACGCCCTTGACTCGCCTCGCATTGTCAAGGGAAGGTCGAGAG
TGAAGGGCCAAGGCGTTGACAAAACCGCTGCACGAGGATGCCATCGCATGCAGACGGGACGCAAGTGCCATCGGGCCACACCCATGCCTCAGGTGTATGCCTTCGACATA
GCCGGATCCCCACCCGAGTGCCTCAACAGACACCTATGTGCGTTGACACATGGCAGAGTGGCTGCGCAGCCAACCATAAGTATGCGGCATGCACGCTCAACTGCAAGGGT
GAGTAGCATGACGTGCCCGCATATGCGAAGTTACCAGGTGTGCAAGTTCGACGCATATCCACGATGCATTCAGAGTGGGAACAACGCATGCGCGCCTCGTAAGTGCCAGC
AGTGTACGCATATCGTCGACGCCTCATGTGTACACCGCCTAGAGACCATGTGTCCAACTGCATCCTTCGACGCATGGAACATCCCAGACGCCTTGAGTAAGTCCTGGAAG
ATTCTAGACGCATCGGGACAAGGCCAGAGACCTCGACAACCCTCTGGACGCGTCTGGAGAGGCGGGCAGGCCGCTGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTATTGACCCGTATCCGGCTCATTCTTCGCCCACTTCGGACGAATGTCTGTCCGTGAGGGACGATCTGTTGAATCTTCATGGTTTCCCTCGAGAGTTTCTTAAGTA
TCGGAAGCAGCGAGAGAGACTGAGCGAGTGCTGCTCCGACATGGACGGCGGCAGCGGTGAGCACCGGGATAATGTGGAATCGGAGCTCGTCCATGAGAAGGAGAGCGTTT
TGGATGGATTGGTGAGGACTGTGCTCTCGCAGAACACTACTGAGGCTAATTCCGAGAGGGCTTTTGCTTCTCTCAAGTCTGCTTTTGCTAACTGGGAGGATAAAGATGAT
TTTCCAGTAGACACCCATGTGAGTAACTGGATTCTTGATTCTGATGAGTTTGTGATCTCTAAAGACCATTGGAAGGTTTTGATTTTCCAACATGTTTGCAGGTCTTTGAG
ATTGCGAAATTTGCCGGTTGGGTCCCGGATGAGGCAGACAGGAACAAAACATATCTTCATCTTAACTAAAGGATCCCAAATCATCTCAAATTTGATCTCAACTGTCTTCT
TTACACTCATGGCAAGGTCTATTCGAAATGTACGAAGAGAACAGGCGGGCGACAACGAAAGGGATCAGAAGATCAGTCTTGTCCCTTGTTCAAGTACTCCAAGAACCCGT
AAATTTGTGAATATAAATGTGGCTATTAGCTATGCGTCACGGGGCTTATGCGTTGAAGGTGGTTACCGTAGGTTGTTAAGGACGAATAAGCAAGAAGGTATAAAGTTAAG
TCATGGAGTGTTAGAAAGAGATCTAAGTGTCATGGATGGAAATAACGCCATTTCAAACCGCGACACAAACGCCCTTGACTCGCCTCGCATTGTCAAGGGAAGGTCGAGAG
TGAAGGGCCAAGGCGTTGACAAAACCGCTGCACGAGGATGCCATCGCATGCAGACGGGACGCAAGTGCCATCGGGCCACACCCATGCCTCAGGTGTATGCCTTCGACATA
GCCGGATCCCCACCCGAGTGCCTCAACAGACACCTATGTGCGTTGACACATGGCAGAGTGGCTGCGCAGCCAACCATAAGTATGCGGCATGCACGCTCAACTGCAAGGGT
GAGTAGCATGACGTGCCCGCATATGCGAAGTTACCAGGTGTGCAAGTTCGACGCATATCCACGATGCATTCAGAGTGGGAACAACGCATGCGCGCCTCGTAAGTGCCAGC
AGTGTACGCATATCGTCGACGCCTCATGTGTACACCGCCTAGAGACCATGTGTCCAACTGCATCCTTCGACGCATGGAACATCCCAGACGCCTTGAGTAAGTCCTGGAAG
ATTCTAGACGCATCGGGACAAGGCCAGAGACCTCGACAACCCTCTGGACGCGTCTGGAGAGGCGGGCAGGCCGCTGGGTAA
Protein sequenceShow/hide protein sequence
MSIDPYPAHSSPTSDECLSVRDDLLNLHGFPREFLKYRKQRERLSECCSDMDGGSGEHRDNVESELVHEKESVLDGLVRTVLSQNTTEANSERAFASLKSAFANWEDKDD
FPVDTHVSNWILDSDEFVISKDHWKVLIFQHVCRSLRLRNLPVGSRMRQTGTKHIFILTKGSQIISNLISTVFFTLMARSIRNVRREQAGDNERDQKISLVPCSSTPRTR
KFVNINVAISYASRGLCVEGGYRRLLRTNKQEGIKLSHGVLERDLSVMDGNNAISNRDTNALDSPRIVKGRSRVKGQGVDKTAARGCHRMQTGRKCHRATPMPQVYAFDI
AGSPPECLNRHLCALTHGRVAAQPTISMRHARSTARVSSMTCPHMRSYQVCKFDAYPRCIQSGNNACAPRKCQQCTHIVDASCVHRLETMCPTASFDAWNIPDALSKSWK
ILDASGQGQRPRQPSGRVWRGGQAAG