; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG06G014617 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG06G014617
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionProtein of unknown function (DUF789)
Genome locationCG_Chr06:28077547..28079053
RNA-Seq ExpressionClCG06G014617
SyntenyClCG06G014617
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008453426.1 PREDICTED: uncharacterized protein LOC103494138 isoform X1 [Cucumis melo]7.1e-3485.19Show/hide
Query:  NGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR
        N H LPPVM+YPKDIDDI K+SLPVFG+ASYKLKGSIWGQNG+N+HQ ANSLMQAADKWLRSLQV QPDFQFF+SHGTY R
Subjt:  NGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR

XP_022134722.1 uncharacterized protein LOC111006925 [Momordica charantia]9.6e-3176.67Show/hide
Query:  NSFAEPL-SNGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR
        +S + P+  NGHG  PVMIYP D+D + KVSLPVFGLASYKLKGSIW QNGV EHQ ANSLMQAAD WLR LQV QPDFQFFASHGTY R
Subjt:  NSFAEPL-SNGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR

XP_022921943.1 uncharacterized protein LOC111430050 [Cucurbita moschata]4.5e-2874.44Show/hide
Query:  NSFAEPL-SNGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR
        +S + P+  NGHG  P MIYP D D I KVSLPVFGLASYKLKGSIW QN V EHQ ANSLMQAA+KWLR LQV QPDFQFFASH TY R
Subjt:  NSFAEPL-SNGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR

XP_031736215.1 uncharacterized protein LOC101215266 [Cucumis sativus]7.8e-3381.48Show/hide
Query:  NGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR
        N H LPP+M+YPKDIDDI K+SLPVFG+ASYK+KGSIWGQNG+++HQ ANSLMQAADKWLRSLQV QPDFQFF+SHGTY R
Subjt:  NGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR

XP_038897708.1 uncharacterized protein LOC120085653 [Benincasa hispida]2.1e-3896.3Show/hide
Query:  NGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR
        NGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNG+NEHQTANSLMQAADKWLRSLQV+QPDFQFFASHGTY R
Subjt:  NGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR

TrEMBL top hitse value%identityAlignment
A0A0A0LS49 Uncharacterized protein3.8e-3381.48Show/hide
Query:  NGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR
        N H LPP+M+YPKDIDDI K+SLPVFG+ASYK+KGSIWGQNG+++HQ ANSLMQAADKWLRSLQV QPDFQFF+SHGTY R
Subjt:  NGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR

A0A1S3BX10 uncharacterized protein LOC103494138 isoform X13.4e-3485.19Show/hide
Query:  NGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR
        N H LPPVM+YPKDIDDI K+SLPVFG+ASYKLKGSIWGQNG+N+HQ ANSLMQAADKWLRSLQV QPDFQFF+SHGTY R
Subjt:  NGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR

A0A5A7USF1 Uncharacterized protein3.4e-3485.19Show/hide
Query:  NGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR
        N H LPPVM+YPKDIDDI K+SLPVFG+ASYKLKGSIWGQNG+N+HQ ANSLMQAADKWLRSLQV QPDFQFF+SHGTY R
Subjt:  NGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR

A0A6J1C0E1 uncharacterized protein LOC1110069254.6e-3176.67Show/hide
Query:  NSFAEPL-SNGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR
        +S + P+  NGHG  PVMIYP D+D + KVSLPVFGLASYKLKGSIW QNGV EHQ ANSLMQAAD WLR LQV QPDFQFFASHGTY R
Subjt:  NSFAEPL-SNGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR

A0A6J1E577 uncharacterized protein LOC1114300502.2e-2874.44Show/hide
Query:  NSFAEPL-SNGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR
        +S + P+  NGHG  P MIYP D D I KVSLPVFGLASYKLKGSIW QN V EHQ ANSLMQAA+KWLR LQV QPDFQFFASH TY R
Subjt:  NSFAEPL-SNGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)6.0e-1562.07Show/hide
Query:  DDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFF
        + + K+ LPVFGLASYKL+GS+W   G + HQ ANSL QAAD WLR  QV  PDF FF
Subjt:  DDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFF

AT1G73210.1 Protein of unknown function (DUF789)7.9e-0738.89Show/hide
Query:  KVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFF
        ++ LP FG+ +YK++G +WG+ G ++ +    L  AAD WL+ L V   D+ FF
Subjt:  KVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFF

AT2G01260.1 Protein of unknown function (DUF789)4.6e-1566.67Show/hide
Query:  KVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFF
        K+SLPVFGLASYK +GS+W   G +EHQ  NSL QAADKWL S  V  PDF FF
Subjt:  KVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFF

AT4G16100.1 Protein of unknown function (DUF789)4.0e-1150.85Show/hide
Query:  AKVSLPVFGLASYKLKGSIWG-QNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASH
        AK+ LP FGLASYK K S W  ++ V+E+Q   +L++ A++WLR L+V+ PDF+ F SH
Subjt:  AKVSLPVFGLASYKLKGSIWG-QNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASH

AT5G49220.1 Protein of unknown function (DUF789)1.2e-1554.41Show/hide
Query:  DIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR
        D     K+ LP FGLASYKLK S+W QN + E Q   SL+QAADKWL+ LQV  PD++FF S+    R
Subjt:  DIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCACCTCCTTGGGCCAAGACATACAATAGAAAGGCGTTCTTTGCCAATGTTGTCTAACTCCTTTGCAGAACCCCTTAGTAATGGACATGGCCTGCCACCA
GTAATGATATATCCAAAGGACATTGATGATATCGCAAAGGTCTCCTTGCCTGTTTTTGGATTGGCTTCTTATAAACTGAAGGGATCGATATGGGGGCAAAATGGC
GTCAATGAGCATCAAACAGCAAATTCTCTCATGCAGGCAGCAGATAAATGGCTGAGGAGCCTTCAGGTCGTTCAACCTGATTTTCAGTTCTTTGCATCACATGGG
ACATACCGGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCACCTCCTTGGGCCAAGACATACAATAGAAAGGCGTTCTTTGCCAATGTTGTCTAACTCCTTTGCAGAACCCCTTAGTAATGGACATGGCCTGCCACCA
GTAATGATATATCCAAAGGACATTGATGATATCGCAAAGGTCTCCTTGCCTGTTTTTGGATTGGCTTCTTATAAACTGAAGGGATCGATATGGGGGCAAAATGGC
GTCAATGAGCATCAAACAGCAAATTCTCTCATGCAGGCAGCAGATAAATGGCTGAGGAGCCTTCAGGTCGTTCAACCTGATTTTCAGTTCTTTGCATCACATGGG
ACATACCGGAGATGA
Protein sequenceShow/hide protein sequence
MPHLLGPRHTIERRSLPMLSNSFAEPLSNGHGLPPVMIYPKDIDDIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHG
TYRR