; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G004300 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G004300
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPlant protein 1589 of unknown function
Genome locationCG_Chr04:16531750..16535906
RNA-Seq ExpressionClCG04G004300
SyntenyClCG04G004300
Gene Ontology termsNA
InterPro domainsIPR006476 - Conserved hypothetical protein CHP01589, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595704.1 hypothetical protein SDJN03_12257, partial [Cucurbita argyrosperma subsp. sororia]3.5e-1887.27Show/hide
Query:  MSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR
        MSTGT RR+SRQDIQLVRSLIERCLQLDMNRKEVVE LLNHEKIDP FTEH W++
Subjt:  MSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR

KAG7027666.1 hypothetical protein SDJN02_11681 [Cucurbita argyrosperma subsp. argyrosperma]3.5e-1887.27Show/hide
Query:  MSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR
        MSTGT RR+SRQDIQLVRSLIERCLQLDMNRKEVVE LLNHEKIDP FTEH W++
Subjt:  MSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR

XP_022967824.1 uncharacterized protein LOC111467226 isoform X1 [Cucurbita maxima]1.3e-2076Show/hide
Query:  NKLCQLIVIQFTCFGL--GNMSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR
        NK CQL   QF  + L  GNMST TV R+SRQDIQ VRSLIERCLQLDMNRKEVVE LLNHEKIDPGFTEH W++
Subjt:  NKLCQLIVIQFTCFGL--GNMSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR

XP_022967830.1 uncharacterized protein LOC111467226 isoform X2 [Cucurbita maxima]1.3e-2076Show/hide
Query:  NKLCQLIVIQFTCFGL--GNMSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR
        NK CQL   QF  + L  GNMST TV R+SRQDIQ VRSLIERCLQLDMNRKEVVE LLNHEKIDPGFTEH W++
Subjt:  NKLCQLIVIQFTCFGL--GNMSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR

XP_038882081.1 uncharacterized protein LOC120073356 [Benincasa hispida]1.2e-1890.91Show/hide
Query:  MSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR
        MSTGTVRRISRQDIQLVRSLIERCLQLDM+RKEVVEALLNHEKIDP FTEH W++
Subjt:  MSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR

TrEMBL top hitse value%identityAlignment
A0A1S3C1F9 uncharacterized protein LOC103495809 isoform X11.1e-1787.27Show/hide
Query:  MSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR
        MSTGTVRRI RQDIQLVRSLIERCLQLDM+RKEVVE LLN EKIDPGFTEH W++
Subjt:  MSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR

A0A6J1EH54 uncharacterized protein LOC111432488 isoform X15.0e-1885.45Show/hide
Query:  MSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR
        MS+GT RR+SRQDIQLVRSLIERCLQLDMNRKEVVE LLNHEKIDP FTEH W++
Subjt:  MSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR

A0A6J1HRZ6 uncharacterized protein LOC111466056 isoform X11.7e-1887.27Show/hide
Query:  MSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR
        MSTGT RR+SRQDIQLVRSLIERCLQLDMNRKEVVE LLNHEKIDP FTEH W++
Subjt:  MSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR

A0A6J1HW89 uncharacterized protein LOC111467226 isoform X26.3e-2176Show/hide
Query:  NKLCQLIVIQFTCFGL--GNMSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR
        NK CQL   QF  + L  GNMST TV R+SRQDIQ VRSLIERCLQLDMNRKEVVE LLNHEKIDPGFTEH W++
Subjt:  NKLCQLIVIQFTCFGL--GNMSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR

A0A6J1HXV0 uncharacterized protein LOC111467226 isoform X16.3e-2176Show/hide
Query:  NKLCQLIVIQFTCFGL--GNMSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR
        NK CQL   QF  + L  GNMST TV R+SRQDIQ VRSLIERCLQLDMNRKEVVE LLNHEKIDPGFTEH W++
Subjt:  NKLCQLIVIQFTCFGL--GNMSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G10250.1 Plant protein 1589 of unknown function3.5e-1670.91Show/hide
Query:  MSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR
        MS+GTVRR+SRQDIQLV++LIERCLQL MN+KEVV+ LL   KI+PGFTE  W++
Subjt:  MSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR

AT3G10250.2 Plant protein 1589 of unknown function3.5e-1670.91Show/hide
Query:  MSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR
        MS+GTVRR+SRQDIQLV++LIERCLQL MN+KEVV+ LL   KI+PGFTE  W++
Subjt:  MSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR

AT3G61700.1 Plant protein 1589 of unknown function4.1e-1255.93Show/hide
Query:  GLGNMSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR
        G  + S+   R++SRQDI+LV++LIERCLQL MNR EVV+ LL   +IDPGFT   W++
Subjt:  GLGNMSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR

AT3G61700.2 Plant protein 1589 of unknown function4.1e-1255.93Show/hide
Query:  GLGNMSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR
        G  + S+   R++SRQDI+LV++LIERCLQL MNR EVV+ LL   +IDPGFT   W++
Subjt:  GLGNMSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR

AT5G04090.2 Plant protein 1589 of unknown function1.1e-1467.27Show/hide
Query:  MSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR
        MS+ TVRR+SR+DIQLV++LIERCLQL MN+KEVV+ LL   KI+PGFTE  W++
Subjt:  MSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATGCTGGGGTGGAATAATCGAGGCGAATAAGCTCTGCCAGCTGATTGTCATTCAATTCACATGCTTTGGACTTGGAAATATGTCAACTGGAACTGTGAGAAGGAT
ATCACGTCAAGACATACAACTGGTGCGAAGTCTTATAGAGCGATGCCTTCAGCTTGATATGAACCGAAAAGAAGTTGTGGAAGCTCTTTTGAATCATGAAAAAATTGACC
CTGGTTTCACAGAACATGCATGGAAGAGGAAAAACATGTCCTATCGGAAGAGGAAAAACATGATTGTGGAGCCCCTATCTAGGCAAGGTCATAGCTTGGTTGATGACAAC
AATGTTGAGGCCGAATTTCTATACTTCTTTAAAGAGCTCTACACAAAACCAGGTAACCAGACTCTTCCTCTTATTGATAATTGGGACCCGATTAATGTTGATTCTGCCAT
TGCATTGGAAGCACCTTTCACTGAGGAGGAAATCTAG
mRNA sequenceShow/hide mRNA sequence
TGTTTTTTAAAATTGCATATCTATTTGGGATAGGGATTTTTGTTTGTTTAATAAAAAGGCTGCTTTCATTTTCAACTTAGGATACTGTTTAAGATTTGAGTAGGATGCAA
ACTTTGGTACAGTTTATTTCACTTCACTTCTTTGACCTAAGTTTTTTAACCCACGGATCAGTCTTTTTTTTGTACTTGAGGATGTTTTTTTACAACTAAATCAATAGCTA
TTATTGTGATTTAAAGGGGTTTTCTGAATTCACATGGTATCTCCATTTAAAGTATAGATGGTATGCTGGGGTGGAATAATCGAGGCGAATAAGCTCTGCCAGCTGATTGT
CATTCAATTCACATGCTTTGGACTTGGAAATATGTCAACTGGAACTGTGAGAAGGATATCACGTCAAGACATACAACTGGTGCGAAGTCTTATAGAGCGATGCCTTCAGC
TTGATATGAACCGAAAAGAAGTTGTGGAAGCTCTTTTGAATCATGAAAAAATTGACCCTGGTTTCACAGAACATGCATGGAAGAGGAAAAACATGTCCTATCGGAAGAGG
AAAAACATGATTGTGGAGCCCCTATCTAGGCAAGGTCATAGCTTGGTTGATGACAACAATGTTGAGGCCGAATTTCTATACTTCTTTAAAGAGCTCTACACAAAACCAGG
TAACCAGACTCTTCCTCTTATTGATAATTGGGACCCGATTAATGTTGATTCTGCCATTGCATTGGAAGCACCTTTCACTGAGGAGGAAATCTAG
Protein sequenceShow/hide protein sequence
MVCWGGIIEANKLCQLIVIQFTCFGLGNMSTGTVRRISRQDIQLVRSLIERCLQLDMNRKEVVEALLNHEKIDPGFTEHAWKRKNMSYRKRKNMIVEPLSRQGHSLVDDN
NVEAEFLYFFKELYTKPGNQTLPLIDNWDPINVDSAIALEAPFTEEEI