; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G008590 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G008590
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionDUF3143 domain-containing protein
Genome locationCG_Chr05:9276196..9278610
RNA-Seq ExpressionClCG05G008590
SyntenyClCG05G008590
Gene Ontology termsNA
InterPro domainsIPR021489 - Protein of unknown function DUF3143


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576682.1 hypothetical protein SDJN03_24256, partial [Cucurbita argyrosperma subsp. sororia]1.5e-7690.57Show/hide
Query:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY
        MSI FA QSLC NVFP P SNHPEISS WRP STKLPQL NPISQS RKS +IVSSKSSEAEELS PEDEWLNKLP+KKKPLYSHSLPC+EAWLKNLGFY
Subjt:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY

Query:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
        QS EDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
Subjt:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA

XP_008439322.1 PREDICTED: uncharacterized protein LOC103484146 [Cucumis melo]3.4e-7689.94Show/hide
Query:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY
        MSI FA  SLCNNVFP PTSNH EISS WRP STKLPQL+NPISQ  RK+F IVSSKSSEAEELS+PEDEWLNKLP+KKKPLYSHSLPC+EAWLKNLGFY
Subjt:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY

Query:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
        QSKEDRA+WLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
Subjt:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA

XP_022922878.1 uncharacterized protein LOC111430721 [Cucurbita moschata]3.1e-7791.19Show/hide
Query:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY
        MSI FA QSLC NVFP P SNHPEISS WRP STKLPQL NPISQSFRKS +IVSSKSSEAEELS PEDEWLNKLP+KKKPLYSHSLPC+EAWLKNLGFY
Subjt:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY

Query:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
        QS EDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
Subjt:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA

XP_022984153.1 uncharacterized protein LOC111482568 [Cucurbita maxima]2.6e-7690.57Show/hide
Query:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY
        MSI FA QSLC NVFP P SNHPEISS WRP STKLPQL NPISQSFRKS +IVSSKSSEAEELS  EDEWLNKLP+KKKPLYSHSLPC+EAWLKNLGFY
Subjt:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY

Query:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
        QS EDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
Subjt:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA

XP_038880287.1 uncharacterized protein LOC120071904 [Benincasa hispida]4.3e-7993.08Show/hide
Query:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY
        MSI FA QSLCNNVFP PTSNH EISS WRP STKLPQL+NPISQSFRK+F IVSSKSSEAEELSTPEDEWLNKLP+KKKPLYSHSLPC+EAWLKNLGFY
Subjt:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY

Query:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
        QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
Subjt:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA

TrEMBL top hitse value%identityAlignment
A0A1S3AY50 uncharacterized protein LOC1034841461.6e-7689.94Show/hide
Query:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY
        MSI FA  SLCNNVFP PTSNH EISS WRP STKLPQL+NPISQ  RK+F IVSSKSSEAEELS+PEDEWLNKLP+KKKPLYSHSLPC+EAWLKNLGFY
Subjt:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY

Query:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
        QSKEDRA+WLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
Subjt:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA

A0A5D3DHV1 Uncharacterized protein1.6e-7689.94Show/hide
Query:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY
        MSI FA  SLCNNVFP PTSNH EISS WRP STKLPQL+NPISQ  RK+F IVSSKSSEAEELS+PEDEWLNKLP+KKKPLYSHSLPC+EAWLKNLGFY
Subjt:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY

Query:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
        QSKEDRA+WLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
Subjt:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA

A0A6J1CHI2 uncharacterized protein LOC1110116931.1e-6983.65Show/hide
Query:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY
        MSI FA  SLCNNVFP   SN+PEIS  WR  S KL Q  NPIS S RKSF +VSS+SSEAEELS PEDEWL+KLP+KKKPLYSHSLPC+EAWLK+LGFY
Subjt:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY

Query:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
        QSKEDRA+WLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
Subjt:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA

A0A6J1E806 uncharacterized protein LOC1114307211.5e-7791.19Show/hide
Query:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY
        MSI FA QSLC NVFP P SNHPEISS WRP STKLPQL NPISQSFRKS +IVSSKSSEAEELS PEDEWLNKLP+KKKPLYSHSLPC+EAWLKNLGFY
Subjt:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY

Query:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
        QS EDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
Subjt:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA

A0A6J1J9Q7 uncharacterized protein LOC1114825681.3e-7690.57Show/hide
Query:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY
        MSI FA QSLC NVFP P SNHPEISS WRP STKLPQL NPISQSFRKS +IVSSKSSEAEELS  EDEWLNKLP+KKKPLYSHSLPC+EAWLKNLGFY
Subjt:  MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFY

Query:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
        QS EDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA
Subjt:  QSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G52960.1 unknown protein2.6e-5080.36Show/hide
Query:  RKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFYQSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRF
        RK   +VSSKSS+AEE+S  EDEWL KLP+K KPLYSHSLPC+EAWL+ LGFYQSK+DRAVWLI+KP+WHAQLSLDVTDL IRY+KSGPGNLE+D+ERRF
Subjt:  RKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFYQSKEDRAVWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRF

Query:  SYALSREDIENA
        SYALSRED ENA
Subjt:  SYALSREDIENA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCATCAATTTTGCTCTTCAGTCTCTCTGTAACAATGTTTTCCCATTTCCAACTTCAAATCATCCCGAAATTTCATCGAATTGGCGTCCCCATTCCACTAAACTCCC
ACAATTAACCAACCCCATCTCTCAATCCTTCAGAAAAAGCTTCGCCATTGTTTCTTCCAAGTCGTCCGAAGCAGAAGAGCTCTCTACTCCAGAGGACGAGTGGCTGAACA
AGCTTCCAGACAAGAAGAAGCCCTTGTACTCACACAGTTTGCCTTGCCTCGAGGCTTGGTTAAAGAACTTGGGATTTTACCAGAGTAAAGAGGACCGAGCTGTGTGGCTA
ATCGAGAAGCCTGAATGGCACGCTCAGCTCTCACTCGATGTCACCGACCTCTATATAAGATATCTAAAGAGTGGACCAGGAAATCTTGAGAAAGATGTGGAGAGGAGATT
TAGCTATGCACTAAGCAGAGAAGATATTGAGAATGCTACAGAGACATTGCTGTACAGATTTGTTCAACAGGGGAATAAAGGAGATGGAATATATACTTCTCACCAAAAAT
TTTACTGGTACAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCATCAATTTTGCTCTTCAGTCTCTCTGTAACAATGTTTTCCCATTTCCAACTTCAAATCATCCCGAAATTTCATCGAATTGGCGTCCCCATTCCACTAAACTCCC
ACAATTAACCAACCCCATCTCTCAATCCTTCAGAAAAAGCTTCGCCATTGTTTCTTCCAAGTCGTCCGAAGCAGAAGAGCTCTCTACTCCAGAGGACGAGTGGCTGAACA
AGCTTCCAGACAAGAAGAAGCCCTTGTACTCACACAGTTTGCCTTGCCTCGAGGCTTGGTTAAAGAACTTGGGATTTTACCAGAGTAAAGAGGACCGAGCTGTGTGGCTA
ATCGAGAAGCCTGAATGGCACGCTCAGCTCTCACTCGATGTCACCGACCTCTATATAAGATATCTAAAGAGTGGACCAGGAAATCTTGAGAAAGATGTGGAGAGGAGATT
TAGCTATGCACTAAGCAGAGAAGATATTGAGAATGCTACAGAGACATTGCTGTACAGATTTGTTCAACAGGGGAATAAAGGAGATGGAATATATACTTCTCACCAAAAAT
TTTACTGGTACAAATGA
Protein sequenceShow/hide protein sequence
MSINFALQSLCNNVFPFPTSNHPEISSNWRPHSTKLPQLTNPISQSFRKSFAIVSSKSSEAEELSTPEDEWLNKLPDKKKPLYSHSLPCLEAWLKNLGFYQSKEDRAVWL
IEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENATETLLYRFVQQGNKGDGIYTSHQKFYWYK