; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG02G008690 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG02G008690
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionDUF4228 domain protein
Genome locationCG_Chr02:11644817..11645257
RNA-Seq ExpressionClCG02G008690
SyntenyClCG02G008690
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570883.1 hypothetical protein SDJN03_29798, partial [Cucurbita argyrosperma subsp. sororia]9.2e-4573.2Show/hide
Query:  MGNNCFKSNKVMAQDEPDDLL---PPTEVKKVEEKSPAGSAMAKPKTAEARAGGAS-KKVVRFKLQEEEEKNSGGSGSD---AGVLRIKVVMSQKELKQM
        MGN+CFKSNKVMAQDE    L   PP E KKVEEK  AGSAMAKPKTAE R+G A+ KKVVRFKLQEE+E NSGGSG D   AGVLRIKVVMSQ+ELKQ+
Subjt:  MGNNCFKSNKVMAQDEPDDLL---PPTEVKKVEEKSPAGSAMAKPKTAEARAGGAS-KKVVRFKLQEEEEKNSGGSGSD---AGVLRIKVVMSQKELKQM

Query:  LTDRENNSCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPEGLH
        L + EN+S +LEELIAE KV+GRTT+SDA  D+VEDENGS +P LE IPEGLH
Subjt:  LTDRENNSCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPEGLH

KGN63254.1 hypothetical protein Csa_022493 [Cucumis sativus]1.1e-3262.91Show/hide
Query:  MGNNCFKSNKVMAQDEPDDLLPP---TEVKKVEEKSPAGSAMAKPKTAEARAGGASKKVVRFKLQEEEE----KNSGGSGSDAGVLRIKVVMSQKELKQM
        MGN CFKSNKVMAQD+  D  PP    E KKV+++   GSAMAKPK      G A KKVVRF LQEEE+    +NSG SG   GVLRIKVV+SQKELKQ+
Subjt:  MGNNCFKSNKVMAQDEPDDLLPP---TEVKKVEEKSPAGSAMAKPKTAEARAGGASKKVVRFKLQEEEE----KNSGGSGSDAGVLRIKVVMSQKELKQM

Query:  LTDRENNSCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPEG
        L  RENNSC+LEELI ELKV+GR T   A      DE GSWKP LE IPEG
Subjt:  LTDRENNSCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPEG

TYK24218.1 hypothetical protein E5676_scaffold27G00200 [Cucumis melo var. makuwa]8.9e-3261.59Show/hide
Query:  MGNNCFKSNKVMAQDEPDDLLPP---TEVKKVEEKS-PAGSAMAKPKTAEARAGGASKKVVRFKLQEEE---EKNSGGSGSDAGVLRIKVVMSQKELKQM
        MGN CF++NKVMAQD+  D LPP    E +KVEE+    GSAMAKPK      G A KKVVRF LQEEE   E  + G  S AGVLRIKVV+SQKELK++
Subjt:  MGNNCFKSNKVMAQDEPDDLLPP---TEVKKVEEKS-PAGSAMAKPKTAEARAGGASKKVVRFKLQEEE---EKNSGGSGSDAGVLRIKVVMSQKELKQM

Query:  LTDRENNSCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPEG
        L +RENNSC+LEELI ELKV+GR T        V DE GSWKP LE IPEG
Subjt:  LTDRENNSCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPEG

XP_022140639.1 uncharacterized protein LOC111011249 [Momordica charantia]2.5e-3463.51Show/hide
Query:  NCFKSNKVMAQDE-----PDDLLPPTEVKKVEEKSPAGSAMAKPKTAEARAGGASKKVVRFKLQEEEEKNSGGSGSDAGVLRIKVVMSQKELKQMLTDRE
        NC ++N+VMAQDE     P+  L  T   KVE+K  AGSA+A+PKT EAR     KKVVRF  Q+ E++ SGG G   GVLRIKVV+SQKELKQ+L DRE
Subjt:  NCFKSNKVMAQDE-----PDDLLPPTEVKKVEEKSPAGSAMAKPKTAEARAGGASKKVVRFKLQEEEEKNSGGSGSDAGVLRIKVVMSQKELKQMLTDRE

Query:  NNSCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPEGLH
        +NS TLEEL+AELK++GR TISDAR D  EDENGSW+P LESIPE LH
Subjt:  NNSCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPEGLH

XP_038902397.1 uncharacterized protein LOC120089037 [Benincasa hispida]2.5e-5886.81Show/hide
Query:  MGNNCFKSNKVMAQDEPDDLLPPTEVKKVEEKSPAGSAMAKPKTAEARAGGASKKVVRFKLQEEEEKNSGGSGSDAGVLRIKVVMSQKELKQMLTDRENN
        MGNNCFKSNKVMAQDEP+DLLPP E KKVEEK   GSAMAKPKTAEAR GGASKKVVRFKLQEEEEKNSG    D GVLRIKVVMSQKELKQML DRENN
Subjt:  MGNNCFKSNKVMAQDEPDDLLPPTEVKKVEEKSPAGSAMAKPKTAEARAGGASKKVVRFKLQEEEEKNSGGSGSDAGVLRIKVVMSQKELKQMLTDRENN

Query:  SCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPEG
        SCTLEELI ELKV+GRTTISD RID VEDENG WKPDLE IPEG
Subjt:  SCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPEG

TrEMBL top hitse value%identityAlignment
A0A0A0LQE9 Uncharacterized protein5.1e-3362.91Show/hide
Query:  MGNNCFKSNKVMAQDEPDDLLPP---TEVKKVEEKSPAGSAMAKPKTAEARAGGASKKVVRFKLQEEEE----KNSGGSGSDAGVLRIKVVMSQKELKQM
        MGN CFKSNKVMAQD+  D  PP    E KKV+++   GSAMAKPK      G A KKVVRF LQEEE+    +NSG SG   GVLRIKVV+SQKELKQ+
Subjt:  MGNNCFKSNKVMAQDEPDDLLPP---TEVKKVEEKSPAGSAMAKPKTAEARAGGASKKVVRFKLQEEEE----KNSGGSGSDAGVLRIKVVMSQKELKQM

Query:  LTDRENNSCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPEG
        L  RENNSC+LEELI ELKV+GR T   A      DE GSWKP LE IPEG
Subjt:  LTDRENNSCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPEG

A0A4V3WQ11 Uncharacterized protein3.2e-1137.58Show/hide
Query:  NCFKSNKVMAQDEPDDLLPPTEVKKVEEKSPAGSAMAKPKTAEARAGGASKKVVRFKL---QEEEEKNSGGSG------SDAGVLRIKVVMSQKELKQML
        NC  SNK++ QDE D+   P E + +E              +     G  KK VRFKL   +EEEE+   G+G      S  G +RI+VV++Q+EL ++L
Subjt:  NCFKSNKVMAQDEPDDLLPPTEVKKVEEKSPAGSAMAKPKTAEARAGGASKKVVRFKL---QEEEEKNSGGSG------SDAGVLRIKVVMSQKELKQML

Query:  TDRENNSCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPE
          +   S ++E+++ E+K++ R  IS  R    E  NGSW+P LESIPE
Subjt:  TDRENNSCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPE

A0A5D3DKZ8 Uncharacterized protein4.3e-3261.59Show/hide
Query:  MGNNCFKSNKVMAQDEPDDLLPP---TEVKKVEEKS-PAGSAMAKPKTAEARAGGASKKVVRFKLQEEE---EKNSGGSGSDAGVLRIKVVMSQKELKQM
        MGN CF++NKVMAQD+  D LPP    E +KVEE+    GSAMAKPK      G A KKVVRF LQEEE   E  + G  S AGVLRIKVV+SQKELK++
Subjt:  MGNNCFKSNKVMAQDEPDDLLPP---TEVKKVEEKS-PAGSAMAKPKTAEARAGGASKKVVRFKLQEEE---EKNSGGSGSDAGVLRIKVVMSQKELKQM

Query:  LTDRENNSCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPEG
        L +RENNSC+LEELI ELKV+GR T        V DE GSWKP LE IPEG
Subjt:  LTDRENNSCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPEG

A0A6J1B1M6 uncharacterized protein LOC1104232931.9e-1139.86Show/hide
Query:  NCFKSNKVMAQ-DEPDDLLPPTEVKKVEEKSPAGSAMAKPKTAEARAGGASKKVVRFKLQEEEEKNSGGSG-SDAGVLRIKVVMSQKELKQMLTDREN-N
        NC  SNK++AQ D+P+      EV +   K  A         A+       KK+VRFKL EE + + G  G S  GV+RI++V++QKELKQ+L+ RE+  
Subjt:  NCFKSNKVMAQ-DEPDDLLPPTEVKKVEEKSPAGSAMAKPKTAEARAGGASKKVVRFKLQEEEEKNSGGSG-SDAGVLRIKVVMSQKELKQMLTDREN-N

Query:  SCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPE
          +LE LI  +K+RG       R +  +  +G W+P LESIPE
Subjt:  SCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPE

A0A6J1CG85 uncharacterized protein LOC1110112491.2e-3463.51Show/hide
Query:  NCFKSNKVMAQDE-----PDDLLPPTEVKKVEEKSPAGSAMAKPKTAEARAGGASKKVVRFKLQEEEEKNSGGSGSDAGVLRIKVVMSQKELKQMLTDRE
        NC ++N+VMAQDE     P+  L  T   KVE+K  AGSA+A+PKT EAR     KKVVRF  Q+ E++ SGG G   GVLRIKVV+SQKELKQ+L DRE
Subjt:  NCFKSNKVMAQDE-----PDDLLPPTEVKKVEEKSPAGSAMAKPKTAEARAGGASKKVVRFKLQEEEEKNSGGSGSDAGVLRIKVVMSQKELKQMLTDRE

Query:  NNSCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPEGLH
        +NS TLEEL+AELK++GR TISDAR D  EDENGSW+P LESIPE LH
Subjt:  NNSCTLEELIAELKVRGRTTISDARIDQVEDENGSWKPDLESIPEGLH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G21680.1 unknown protein1.5e-0834.51Show/hide
Query:  NCFKSNKVMAQDEPDDLLPPTEVKKVEEKSPAGSAMAKPKTAEARAGGASKKVVRFKLQEEEEKNSGGSGSDAGVLRIKVVMSQKELKQMLTDRENNSCT
        NC + +  +A+ E DDL P   VK +EE           KT+             F+ +EE E++   +  ++ V+RIKVV+++KEL+Q+L   +N   +
Subjt:  NCFKSNKVMAQDEPDDLLPPTEVKKVEEKSPAGSAMAKPKTAEARAGGASKKVVRFKLQEEEEKNSGGSGSDAGVLRIKVVMSQKELKQMLTDRENNSCT

Query:  LEELIAELKVRGRTTISDARIDQVEDENG--SWKPDLESIPE
        +++L+  LK  GR  IS A  ++ E E G  +W+P LESIPE
Subjt:  LEELIAELKVRGRTTISDARIDQVEDENG--SWKPDLESIPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAATAATTGCTTCAAAAGCAACAAAGTGATGGCGCAAGACGAGCCTGATGATCTTTTGCCTCCTACCGAAGTTAAGAAAGTTGAGGAAAAATCGCCGGCTGGATC
GGCAATGGCGAAGCCGAAGACGGCAGAGGCGAGAGCCGGTGGTGCGAGTAAGAAGGTAGTGAGGTTTAAGCTACAAGAAGAAGAGGAGAAAAATTCCGGCGGAAGTGGCA
GCGATGCTGGAGTACTGAGGATTAAAGTGGTGATGTCTCAGAAAGAGTTGAAACAGATGTTGACGGATAGAGAGAACAATTCGTGTACATTGGAGGAATTGATTGCTGAA
TTGAAGGTGAGAGGCAGAACGACGATTTCAGATGCGAGAATCGATCAAGTTGAAGATGAAAATGGAAGCTGGAAGCCGGATCTGGAATCTATTCCTGAAGGTCTCCATTA
A
mRNA sequenceShow/hide mRNA sequence
ATGGGGAATAATTGCTTCAAAAGCAACAAAGTGATGGCGCAAGACGAGCCTGATGATCTTTTGCCTCCTACCGAAGTTAAGAAAGTTGAGGAAAAATCGCCGGCTGGATC
GGCAATGGCGAAGCCGAAGACGGCAGAGGCGAGAGCCGGTGGTGCGAGTAAGAAGGTAGTGAGGTTTAAGCTACAAGAAGAAGAGGAGAAAAATTCCGGCGGAAGTGGCA
GCGATGCTGGAGTACTGAGGATTAAAGTGGTGATGTCTCAGAAAGAGTTGAAACAGATGTTGACGGATAGAGAGAACAATTCGTGTACATTGGAGGAATTGATTGCTGAA
TTGAAGGTGAGAGGCAGAACGACGATTTCAGATGCGAGAATCGATCAAGTTGAAGATGAAAATGGAAGCTGGAAGCCGGATCTGGAATCTATTCCTGAAGGTCTCCATTA
A
Protein sequenceShow/hide protein sequence
MGNNCFKSNKVMAQDEPDDLLPPTEVKKVEEKSPAGSAMAKPKTAEARAGGASKKVVRFKLQEEEEKNSGGSGSDAGVLRIKVVMSQKELKQMLTDRENNSCTLEELIAE
LKVRGRTTISDARIDQVEDENGSWKPDLESIPEGLH