; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G011880 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G011880
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionconserved peptide upstream open reading frame 9
Genome locationCG_Chr05:14785565..14791355
RNA-Seq ExpressionClCG05G011880
SyntenyClCG05G011880
Gene Ontology termsNA
InterPro domainsIPR012511 - S-adenosyl-l-methionine decarboxylase leader peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650573.1 hypothetical protein Csa_010068 [Cucumis sativus]1.9e-22100Show/hide
Query:  ANDLMESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS
        ANDLMESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS
Subjt:  ANDLMESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS

KAG6581173.1 hypothetical protein SDJN03_21175, partial [Cucurbita argyrosperma subsp. sororia]1.0e-2360.16Show/hide
Query:  GFLGFRSPPLTHLIGFNWRKGTKAKGFSVLTGSPIFTFRIYWVSSLFKIISFISVEASLISFWGATSICAANDLMESKGGKKKSSSSSSSKSLFYEAPLG
        GF  FRS PL HLI F+ R+  + KGF    G  +F   +++V        FI                 ANDLMESKGGKKK SSSSSSKSLFYEAPLG
Subjt:  GFLGFRSPPLTHLIGFNWRKGTKAKGFSVLTGSPIFTFRIYWVSSLFKIISFISVEASLISFWGATSICAANDLMESKGGKKKSSSSSSSKSLFYEAPLG

Query:  YSIEDVRPHGGIKKFRSAAYSNCVRKPS
        YSIEDVRPHGGIKKFRSAAYSNCVRKPS
Subjt:  YSIEDVRPHGGIKKFRSAAYSNCVRKPS

KAG6607040.1 Uridine kinase-like protein 2, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.4e-2098.28Show/hide
Query:  NDLMESKGGKKK-SSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS
        NDLMESKGGKKK SSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS
Subjt:  NDLMESKGGKKK-SSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS

KAG7017910.1 hypothetical protein SDJN02_19776, partial [Cucurbita argyrosperma subsp. argyrosperma]5.3e-2052.21Show/hide
Query:  GFNWRKGTKAKG-FSVLTGSPIFTFRIYWVSSLFKIISFISVEASLISFWGATS---------------------ICAANDLMESKGGKKKSSSSSSSKS
        GF+ +  TK++  F +L    +F+   + V  L  +   + +   L+SF+ A S                     +  ANDLMESKGGKKK SSSSSSKS
Subjt:  GFNWRKGTKAKG-FSVLTGSPIFTFRIYWVSSLFKIISFISVEASLISFWGATS---------------------ICAANDLMESKGGKKKSSSSSSSKS

Query:  LFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS
        LFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS
Subjt:  LFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS

QHO12318.1 uncharacterized protein DS421_15g505930 [Arachis hypogaea]4.0e-2094.74Show/hide
Query:  NDLMESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS
        NDLMESKGGKK SSSSSSSKSLFYEAPLGYSIEDVRP+GGIKKFRSAAYSNC RKPS
Subjt:  NDLMESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS

TrEMBL top hitse value%identityAlignment
A0A0A0LAP2 Uncharacterized protein9.4e-23100Show/hide
Query:  ANDLMESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS
        ANDLMESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS
Subjt:  ANDLMESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS

A0A444FD06 Uncharacterized protein1.3e-1960.19Show/hide
Query:  PIF-TFRIYWVSSLFKIISFISVE----------ASLISFWGATSICA--ANDLMESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAY
        P+F  F +Y V  ++ + S I +            SLI   G  S+ A   N+LMESKGGKKKSSSSSSS SL YEAPLGYSIEDVRPHGGIKKF++AAY
Subjt:  PIF-TFRIYWVSSLFKIISFISVE----------ASLISFWGATSICA--ANDLMESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAY

Query:  SNCVRKPS
        SNCVRKPS
Subjt:  SNCVRKPS

A0A5J5AL23 Uncharacterized protein5.7e-2063.27Show/hide
Query:  FTFRIY-----WVSSLFKIISFISVEASLISFWGATSICAANDLMESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS
        F FR +     W+   FK +  IS    +I  + +  +   NDLMESKGGKKK   SSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNC RKPS
Subjt:  FTFRIY-----WVSSLFKIISFISVEASLISFWGATSICAANDLMESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS

A0A6A5NYN2 Uncharacterized protein1.7e-1991.23Show/hide
Query:  NDLMESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS
        NDLMESKGGKKKSSSSSS  SLFYEAPLGYSIEDVRP+GGI+KFRSAAYSNC RKPS
Subjt:  NDLMESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS

A0A6A6MF36 Uncharacterized protein1.3e-1994.74Show/hide
Query:  NDLMESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS
        NDLMESKGGKKK   SSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS
Subjt:  NDLMESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G02468.1 conserved peptide upstream open reading frame 94.0e-1880Show/hide
Query:  LMESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS
        +MESKGGKKKSSSSS   SLFYEAPLGYSIEDVRP+GGIKKF+S+ YSNC ++PS
Subjt:  LMESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS

AT3G25572.1 conserved peptide upstream open reading frame 111.5e-1781.48Show/hide
Query:  MESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS
        ME+KGGKKKSS+SSS  SLF+EAPL YSIEDVRP+GGIKKFRSAAYSN   KPS
Subjt:  MESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS

AT5G15948.1 conserved peptide upstream open reading frame 102.4e-1574.07Show/hide
Query:  MESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS
        MESK G KKSSS+S   SL YEAPLGYSIEDVRP GGIKKF+S+ YSNC ++PS
Subjt:  MESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPHGGIKKFRSAAYSNCVRKPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTTTCTCGGGTTTCGTTCTCCGCCGCTAACGCATTTAATCGGATTTAATTGGAGAAAGGGTACTAAGGCAAAGGGTTTCTCGGTGTTAACTGGTTCACCCATTTT
TACATTTCGTATTTATTGGGTTTCCAGTCTTTTTAAAATTATTTCCTTCATTTCGGTTGAGGCTTCTCTGATTTCATTTTGGGGTGCTACCTCTATCTGTGCAGCGAATG
ATTTAATGGAGTCAAAAGGTGGTAAGAAGAAGTCTAGTAGTAGTAGTAGTAGTAAATCCCTTTTCTACGAAGCTCCCCTCGGATACAGCATTGAAGACGTGCGACCACAC
GGTGGAATCAAGAAGTTCAGATCTGCTGCTTACTCCAACTGCGTTCGAAAGCCATCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGTTTCTCGGGTTTCGTTCTCCGCCGCTAACGCATTTAATCGGATTTAATTGGAGAAAGGGTACTAAGGCAAAGGGTTTCTCGGTGTTAACTGGTTCACCCATTTT
TACATTTCGTATTTATTGGGTTTCCAGTCTTTTTAAAATTATTTCCTTCATTTCGGTTGAGGCTTCTCTGATTTCATTTTGGGGTGCTACCTCTATCTGTGCAGCGAATG
ATTTAATGGAGTCAAAAGGTGGTAAGAAGAAGTCTAGTAGTAGTAGTAGTAGTAAATCCCTTTTCTACGAAGCTCCCCTCGGATACAGCATTGAAGACGTGCGACCACAC
GGTGGAATCAAGAAGTTCAGATCTGCTGCTTACTCCAACTGCGTTCGAAAGCCATCCTGAGTTCTGCTGAATTCCGTTTACCCTGCGCCCCGAGATCTTTAGTTTCTTTA
TAATTTTTAGTCTGTCTTTTTTCTTCGGTACTCTCTACTTTCCTCGTTCTCTCGTTCAGTCTCTCGACATTATTAGCTTCCTTTTAAGAAAAGATGACGTTTCCAACCTC
TGCTATCGGATTTGAAGGCTATGAAAAGAGGCTCGAAGTATCATTCTTCGAGCCAAGCGTTTTTGCTGACCCTAGGGGCATGGGTCTCCGTGCGCTGTCAAAAGCACAAC
TGGATGAAATTCTGACATTAGCTGAGTGCACCATTGTCGACTCTTTGTCAAATGACTATCTTGATTCGTATGTCCTTTCAGAGTCGAGCCTCTTTGTGTACCCATACAAG
TTCATCATCAAAACCTGCGGCACTACTAAGCTGCTTCTCTCTATCCCAGCTCTGTTGAAGTTGGCTGATTCTCTGTCTCTTACTGTGAAGTGTGTGAGGTACACTCGTGG
CAGCTTTATCTTTCCTGGTGCCCAGTCTTTTCCCCATCGCAGCTTCTCTGAGGAAGTTGCCGTCCTTGATGGCTATTTGGCCAAGCTTGGCCTCAAAGGCTCTGCTATTG
TGATGGGAAGTCCTGATGAAACAAGGAAATGGCACGTTTATTCTGCCTGTGCCAACATGGGGAGTCAAAGTAACAACCCTGTTTATACCCTGGAAATGTGCATGACTGGC
TTAGACAAGGAGAAGGCATCTGTCTTCTTCAAAACAGATGCAAGTTCTGCTGCTGCGATGACTGAAAATTCTGGTATTAGGAAAATCCTTCCAAAATCTGAAATATGTGA
TTTCGAGTTCGACCCTTGTGGTTATTCCATGAATGCCATTGAAGGAGATGCAGAGTCTACAATCCATGTTACTCCAGAAGATGGGTTTAGTTATGCAAGCTTTGAAGCAG
CTGGTTATGACTTCGACGACATAAATCTGTCTAAGCTGATTGTGAGGGTGCTGGCATGCTTCCAGCCTTCTGATTTTTCTGTTGCCCTCCATTCAGATCTCGTTGGTGAG
AATCTGGAAGATTTACTCTGCCTGGAATTGAAGGGTTACGAGGGTGGTGAGAAGAGCTGTGAAATGCTGGGGGAAAATGGAACCGTTATCTACCAGAGCTTTGTGAAGAC
CGGAGGAGATTATGCCTCATCTCCAAGGTCAACCCTGTTGAAATGTTGTTGGAGCGAGGACGAGAAGGACGAGGAAGTTGGGAAGTATTAGAACTTTTATCAACTTCTGC
TTGCTTTTTTTATCTTTACTAGTAATGAATAATAAACAAAGAAGTATCTTGGGGGTCATATTCTGAAAAAAAAAAAAAATTAAGCTGCCTTTGGTGTGTTTATGCAGCAT
ATTAGTTCTTTCTTAAGTGTCTTGTGTTTTCTTTTCGTCATTGTTGGAGTCAATTTTCATTCTGCGAATAATTCCCCGAGGGGGGAATCCTGTTGGATTCTGGAAGTCTG
TCTTCTGTTATGGTTTTGCATCATATTATTTTATGATGAAACGTATCGATCGTATTTTGTTATTTTAATCAAGACTTTGGTTTTTTGGAATGATGTACTTTTGATCTTGC
TTTATTTATGTAATCATGGCATCTGGATGATGTGCAGATTTCAGCTTTCTTTTTTCCTTCTGCTTTCTATATATAATAATTATGAATAGTACTGTGGAAGGGTGGAGAGT
ACAGAATGACAGGCCAACTATTGTGAAAAGAGGTTAATAATGAGGGGAAAAGGGAGGGAAGGAGATGTTGGAGATCTTTGGAGAAGAAGTGCTATTTGGCCGTCGAGTCT
GCCAAAAAGTTATAATTTCCAACATTTTGAAGAATGCTTGGTTTTCTAAATTAGTGAGAACTTGAGAAGTACATCAACTCTTGATTGTGGTACAAGTTTTAGCCCTGCGA
TATGGAAGCAGCGGTGCAAATGGCACTTGGAATTATTGAAGCTGCAGGTGGGGAGTTATATTGCAAATATATACTGTAAAACAATGGTGGAATGGAAAGGTGAAATCATC
ATTGCAGAAAGGGGACAAAATGTTGTGGAGTTGACTCATTCAGATGAATGGAAATACTTTTTCCTAGCCAAAAAGTGAATTCCTATCCAAAATGTGAATTTTTAGAGAGT
GCTTTTGACTTTCAGTACTCCTAATATACCATTGCTTAAATTACACATTTTGGAAGAATTAGCAAATAGTTAGGACCTTAGAAACAAAAGTGGAGTCCTTCTGCTGGCAA
GGTAGCGCATTTCAAAGGAGGACTTGAGCCGACTTTGCTTCTTTTCTAGCTCGGTTTCTTGGATTACCTTGAGGGAGGATGACACCTCGACCTCAATAGTAAAACATCTT
CCAAACTATATACAGTGAAAATGGTGTTGTCCTAATTGATGTGCAAACTGATGTTTGATGACCATGTAATACAAAAGGTAACATCTCATGCCAATTGTTATGATTGACAG
TTATATTCTTGATAATTTTCTTGATATTTTAGTTTGTCGTCCCACTACTCCATTCATTTTAGGGCGATCAGAATAGAGTTGTAGTGTCTAATTTTGGATTGAGTACATAA
TTATTATTCAAGTTCAGTGATGATACGTTTCGGCACATATAATCCTTATGAACTTGACAATACCTTGTTTGGGTGACACTCTTGTATGAGGCAGTTTCTTACCCATTTTG
TAAGATAGTTAATGGCTACTAAATTGAATTGGTGACCATTTGATGCCTTTTGCTCAATGGCTACTAAGCTGGAGCATGTACTTTATTTGTATATATTTTGCATTTGTGAC
ATTTTCTTAATTCGATCCGACACGTAAAGCTTGTCTTGGCATCATGTGTCCGCTTGTATGTGTCCCATAGACTCCTTCGTGTACTTTTTCTAGAATTCTCTTGGCTTCTG
AAGCTTCAACACATCTTAGAAGAGTCATATGTATAACGTTTCTTCGTTAAAAAAGTAACTCATGACCAACTTTCTGATAGTGCACTTACTATTTTCAAATGCTCCCTATA
GATATTCTCGATGTGTATGTAGTGTTTAATGTCATGATATGAGAGCTTTTCATTGGGTTCAAGAATGTAGTATGTTGGTGCTTCACGCTATGCAATTTTTATATACTAGA
TTGATTCATTGTAAGCCGCATTAAATATGGCCGACAAAGTGGTCCATGTATCTGCAATCTGGTTATTTTCATGTAGGACATATTTGTGTGTGATTCAAAGGTTTGGGCCA
ATTATCGGATGTACTTGATGTTAGTACTACCATAATATCTTCATCTAGAAATTCAAATTTCATTGGCTTACATTTTTCAACTAGTAATTAGTTAGTAAATCGACGATTGC
AGCCTCGTTTATGGTTTTTCTATTGACATAGACAATATTGTACTTAGATAGTAATACTTGCCATCTAGTTTCTCCTAGACGAAGATGACTTTTCGAAAATGTACTTTATA
AGATCCATTTTTTAGATAAACCATGTAGTACAGAATATACTGTATTAATCTTTAAGCTGCCCACGCTAAGGCACAACTTGCTTTTTTTCCAATAACGAGTATTTTTACTA
ATAATTTGTGAACTTTTTACTCAAATAAAAATCAACTTGCCCCTCCCTTCCTATAGAGTCGTTTTACCCCTATACATATTCCATTTAGATCTCCTTTATCATTAAGTATA
GGATTAACGGTTGTCTTGGAGTTGGTGGGACAAGTATTAGAGGGCTTCGTAAATGATCTTTGATTTTATCGAAAGCTCTTTGGCAATCTTTGTTCCAACAACATAGCGCA
CCCTTGCAAAGGAGTCTTAATATTGGCTTGCATGTTTGAGTGTGGTGTGAAATAAATCATGTGATGTAATTTAGTCTCCCCATGAAACTTCTAATTTCCATTTGGATTTT
TGATGGCTTTAAATTCACTATTGCCTTGATTTTGTTTGGGTTGACTTTGATTCTCCCTTGACTAACGATGAAACTCAACAATTCTAAACAATACTCTGAATATGCACTTT
GCTACTTTTGCAATCTCTTGAAAAGTATGCAAAGGGTAACTATATGCTTTTCTCCGGGTTTAGACTTTCCTATCATATCATCAAAATAACTTCAATTTTTTTTTTGCATC
AAGTCATGAAATAACGTAAGAATAGTTTTTTGGTAA
Protein sequenceShow/hide protein sequence
MGFLGFRSPPLTHLIGFNWRKGTKAKGFSVLTGSPIFTFRIYWVSSLFKIISFISVEASLISFWGATSICAANDLMESKGGKKKSSSSSSSKSLFYEAPLGYSIEDVRPH
GGIKKFRSAAYSNCVRKPS