; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G018840 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G018840
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionDNA polymerase epsilon catalytic subunit A, putative
Genome locationCG_Chr05:31096271..31097181
RNA-Seq ExpressionClCG05G018840
SyntenyClCG05G018840
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147861.1 uncharacterized protein LOC101222738 [Cucumis sativus]1.6e-6286.11Show/hide
Query:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEEK-KMEYGGNLQRSASMPPSNTKKGLLSDMESETKLEKPKKNA
        MGSLMAGWDSP TDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAIL P+E+ EEK K   G NLQRSASMPP NT+KGL  +M+SET LEKP+KN 
Subjt:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEEK-KMEYGGNLQRSASMPPSNTKKGLLSDMESETKLEKPKKNA

Query:  WWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA
        WWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGG+SA
Subjt:  WWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA

XP_008466545.1 PREDICTED: uncharacterized protein LOC103503930 [Cucumis melo]4.3e-6386.11Show/hide
Query:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEEKKME-YGGNLQRSASMPPSNTKKGLLSDMESETKLEKPKKNA
        MGSLMAGWDSP TDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAIL P+E+ EEK+ E  G NLQRSASMPP NT+KGLL +++SET LEKP+KN 
Subjt:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEEKKME-YGGNLQRSASMPPSNTKKGLLSDMESETKLEKPKKNA

Query:  WWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA
        WWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGG+SA
Subjt:  WWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA

XP_022940046.1 uncharacterized protein LOC111445795 [Cucurbita moschata]2.1e-5781.51Show/hide
Query:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEE-KKMEYGG-NLQRSASMPPSNTKKGLLSDMESETKLE-KPKK
        MGSLMAGWDSPA+DP+EVSHRRNKSLTKEEIEAFWKTKKQ+HEEHLRAIL P++S EE KK+E+GG NLQRS+S+PP NT+KG L DMESE  L+ KPKK
Subjt:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEE-KKMEYGG-NLQRSASMPPSNTKKGLLSDMESETKLE-KPKK

Query:  NAWWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA
        N WWRRSNWAFLNEPP  EGSGNSYVSQFHVAN+AASRLGRGG+ A
Subjt:  NAWWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA

XP_022982140.1 uncharacterized protein LOC111481065 [Cucurbita maxima]7.1e-5882.19Show/hide
Query:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEE-KKMEYGG-NLQRSASMPPSNTKKGLLSDMESETKLE-KPKK
        MGSLMAGWDSPA+DP+EVSHRRNKSLTKEEIEAFWKTKKQ+HEEHLRAIL P++S EE KK+E+GG NLQRS+S+PP NT+KGLL DMESE  L+ KPKK
Subjt:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEE-KKMEYGG-NLQRSASMPPSNTKKGLLSDMESETKLE-KPKK

Query:  NAWWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA
        N WWRRSNWAFLNEPP  EGSGNSYVSQFHVAN+AASRLGRGG+ A
Subjt:  NAWWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA

XP_038898962.1 uncharacterized protein LOC120086405 [Benincasa hispida]3.0e-6489.51Show/hide
Query:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEEKKMEYGGNLQRSASMPPSNTKKGLLSDMESETKLEKPKKNAW
        MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAIL P+E+ EEKKME GGNLQRSAS+PP  T KGLL +MESE  LEKPKKNAW
Subjt:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEEKKMEYGGNLQRSASMPPSNTKKGLLSDMESETKLEKPKKNAW

Query:  WRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA
        WRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLG GG+SA
Subjt:  WRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA

TrEMBL top hitse value%identityAlignment
A0A0A0LDC2 Uncharacterized protein7.9e-6386.11Show/hide
Query:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEEK-KMEYGGNLQRSASMPPSNTKKGLLSDMESETKLEKPKKNA
        MGSLMAGWDSP TDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAIL P+E+ EEK K   G NLQRSASMPP NT+KGL  +M+SET LEKP+KN 
Subjt:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEEK-KMEYGGNLQRSASMPPSNTKKGLLSDMESETKLEKPKKNA

Query:  WWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA
        WWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGG+SA
Subjt:  WWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA

A0A1S3CST6 uncharacterized protein LOC1035039302.1e-6386.11Show/hide
Query:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEEKKME-YGGNLQRSASMPPSNTKKGLLSDMESETKLEKPKKNA
        MGSLMAGWDSP TDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAIL P+E+ EEK+ E  G NLQRSASMPP NT+KGLL +++SET LEKP+KN 
Subjt:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEEKKME-YGGNLQRSASMPPSNTKKGLLSDMESETKLEKPKKNA

Query:  WWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA
        WWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGG+SA
Subjt:  WWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA

A0A5D3D820 Putative DNA polymerase epsilon catalytic subunit A2.1e-6386.11Show/hide
Query:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEEKKME-YGGNLQRSASMPPSNTKKGLLSDMESETKLEKPKKNA
        MGSLMAGWDSP TDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAIL P+E+ EEK+ E  G NLQRSASMPP NT+KGLL +++SET LEKP+KN 
Subjt:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEEKKME-YGGNLQRSASMPPSNTKKGLLSDMESETKLEKPKKNA

Query:  WWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA
        WWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGG+SA
Subjt:  WWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA

A0A6J1FHE8 uncharacterized protein LOC1114457951.0e-5781.51Show/hide
Query:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEE-KKMEYGG-NLQRSASMPPSNTKKGLLSDMESETKLE-KPKK
        MGSLMAGWDSPA+DP+EVSHRRNKSLTKEEIEAFWKTKKQ+HEEHLRAIL P++S EE KK+E+GG NLQRS+S+PP NT+KG L DMESE  L+ KPKK
Subjt:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEE-KKMEYGG-NLQRSASMPPSNTKKGLLSDMESETKLE-KPKK

Query:  NAWWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA
        N WWRRSNWAFLNEPP  EGSGNSYVSQFHVAN+AASRLGRGG+ A
Subjt:  NAWWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA

A0A6J1J1T2 uncharacterized protein LOC1114810653.4e-5882.19Show/hide
Query:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEE-KKMEYGG-NLQRSASMPPSNTKKGLLSDMESETKLE-KPKK
        MGSLMAGWDSPA+DP+EVSHRRNKSLTKEEIEAFWKTKKQ+HEEHLRAIL P++S EE KK+E+GG NLQRS+S+PP NT+KGLL DMESE  L+ KPKK
Subjt:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEE-KKMEYGG-NLQRSASMPPSNTKKGLLSDMESETKLE-KPKK

Query:  NAWWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA
        N WWRRSNWAFLNEPP  EGSGNSYVSQFHVAN+AASRLGRGG+ A
Subjt:  NAWWRRSNWAFLNEPPETEGSGNSYVSQFHVANMAASRLGRGGISA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G19530.1 unknown protein3.7e-2042.86Show/hide
Query:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQ-VHEEHLRAILRPYESYEEKKMEYGGNLQRSASMPPSNTKKGLLSDMESETKLEKPKKNA
        MGSLM+GWDS   DP+ V  RR KSLT+EEI+ FWKTKK+   EEH++A              +   + +  +   +  KK +    E+++     K + 
Subjt:  MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQ-VHEEHLRAILRPYESYEEKKMEYGGNLQRSASMPPSNTKKGLLSDMESETKLEKPKKNA

Query:  WWRRSNWAFLNEPPETEGSGNSYVSQFHVANMA
        WWR++ WAFLNEP E EG  N+YVSQF VA++A
Subjt:  WWRRSNWAFLNEPPETEGSGNSYVSQFHVANMA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCTCTAATGGCTGGTTGGGACTCCCCGGCGACTGACCCCCAAGAAGTGAGTCACCGGAGGAATAAATCGTTGACAAAAGAAGAGATTGAAGCGTTCTGGAAAAC
GAAGAAACAAGTGCATGAAGAACATCTAAGAGCCATTTTAAGGCCATATGAGAGCTATGAGGAAAAGAAAATGGAATATGGGGGGAATCTTCAGAGATCAGCCTCTATGC
CACCATCCAATACAAAGAAGGGTTTATTATCGGACATGGAATCTGAAACTAAGCTAGAAAAGCCGAAGAAAAACGCCTGGTGGAGAAGAAGCAACTGGGCGTTTCTAAAC
GAACCGCCGGAGACGGAAGGATCTGGTAACAGCTACGTGTCGCAGTTCCACGTGGCAAACATGGCGGCTTCCAGACTTGGTCGTGGTGGCATCAGTGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTCTCTAATGGCTGGTTGGGACTCCCCGGCGACTGACCCCCAAGAAGTGAGTCACCGGAGGAATAAATCGTTGACAAAAGAAGAGATTGAAGCGTTCTGGAAAAC
GAAGAAACAAGTGCATGAAGAACATCTAAGAGCCATTTTAAGGCCATATGAGAGCTATGAGGAAAAGAAAATGGAATATGGGGGGAATCTTCAGAGATCAGCCTCTATGC
CACCATCCAATACAAAGAAGGGTTTATTATCGGACATGGAATCTGAAACTAAGCTAGAAAAGCCGAAGAAAAACGCCTGGTGGAGAAGAAGCAACTGGGCGTTTCTAAAC
GAACCGCCGGAGACGGAAGGATCTGGTAACAGCTACGTGTCGCAGTTCCACGTGGCAAACATGGCGGCTTCCAGACTTGGTCGTGGTGGCATCAGTGCTTGA
Protein sequenceShow/hide protein sequence
MGSLMAGWDSPATDPQEVSHRRNKSLTKEEIEAFWKTKKQVHEEHLRAILRPYESYEEKKMEYGGNLQRSASMPPSNTKKGLLSDMESETKLEKPKKNAWWRRSNWAFLN
EPPETEGSGNSYVSQFHVANMAASRLGRGGISA