; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G15550 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G15550
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionUPF0235 protein C15orf40 homolog
Genome locationChr2:15142649..15144342
RNA-Seq ExpressionCSPI02G15550
SyntenyCSPI02G15550
Gene Ontology termsNA
InterPro domainsIPR003746 - Protein of unknown function DUF167
IPR036591 - YggU-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151587.1 uncharacterized protein LOC101218498 [Cucumis sativus]7.5e-77100Show/hide
Query:  MTVNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKD
        MTVNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKD
Subjt:  MTVNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKD

Query:  GEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALTCE
        GEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALTCE
Subjt:  GEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALTCE

XP_008460904.1 PREDICTED: UPF0235 protein LHK_03181 [Cucumis melo]1.1e-7295.51Show/hide
Query:  MTVNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKD
        MTV+ LNGLKSKQNY S RKIPFDLGKKMPPAKKGKTKAPKATESIQSSVK+NNYPSCLRSVSPSSVAITIHAKPGSKIASITDF DDALGVQIDAPAKD
Subjt:  MTVNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKD

Query:  GEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALTCE
        GEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVE+VSLQSVFDALNKALTCE
Subjt:  GEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALTCE

XP_022143592.1 uncharacterized protein LOC111013451 [Momordica charantia]7.5e-5390.62Show/hide
Query:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS
        MPPAKKGKTKAPKATESIQ    SN +PSCLRSV+PSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDY+S+VLGVKRRQVSIGSGS
Subjt:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEDVSLQSVFDALNKALTCE
        KSR KVVIVE+VSLQ+VFDALNKALTCE
Subjt:  KSRGKVVIVEDVSLQSVFDALNKALTCE

XP_022971730.1 UPF0235 protein C15orf40 homolog [Cucurbita maxima]4.0e-5493.65Show/hide
Query:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS
        M PAKKGKTKAPKATESIQSS+K NNYPSCLRSVS SS+AITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQ+SIGSGS
Subjt:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEDVSLQSVFDALNKALT
        KSR KVVIVE+VSLQSVFDALNKALT
Subjt:  KSRGKVVIVEDVSLQSVFDALNKALT

XP_038902101.1 UPF0235 protein C15orf40 homolog isoform X2 [Benincasa hispida]2.3e-5796.09Show/hide
Query:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS
        MPPAKKGKTKA KATESIQSSVK+NNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQ+SIGSGS
Subjt:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEDVSLQSVFDALNKALTCE
        KSR KVVIVE+VSLQSVFDALNKALTCE
Subjt:  KSRGKVVIVEDVSLQSVFDALNKALTCE

TrEMBL top hitse value%identityAlignment
A0A0A0LMY2 Uncharacterized protein3.6e-77100Show/hide
Query:  MTVNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKD
        MTVNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKD
Subjt:  MTVNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKD

Query:  GEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALTCE
        GEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALTCE
Subjt:  GEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALTCE

A0A1S3CDI8 UPF0235 protein LHK_031815.4e-7395.51Show/hide
Query:  MTVNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKD
        MTV+ LNGLKSKQNY S RKIPFDLGKKMPPAKKGKTKAPKATESIQSSVK+NNYPSCLRSVSPSSVAITIHAKPGSKIASITDF DDALGVQIDAPAKD
Subjt:  MTVNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKD

Query:  GEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALTCE
        GEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVE+VSLQSVFDALNKALTCE
Subjt:  GEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALTCE

A0A6J1CR98 uncharacterized protein LOC1110134513.6e-5390.62Show/hide
Query:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS
        MPPAKKGKTKAPKATESIQ    SN +PSCLRSV+PSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDY+S+VLGVKRRQVSIGSGS
Subjt:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEDVSLQSVFDALNKALTCE
        KSR KVVIVE+VSLQ+VFDALNKALTCE
Subjt:  KSRGKVVIVEDVSLQSVFDALNKALTCE

A0A6J1EMK4 uncharacterized protein LOC1114348896.9e-5290.48Show/hide
Query:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS
        M PAKKGKTKA K TES +S +K+NNYPSCLRSVS SSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQ+SIGSGS
Subjt:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEDVSLQSVFDALNKALT
        KSR KVVIVE+VSLQSVFDALNKALT
Subjt:  KSRGKVVIVEDVSLQSVFDALNKALT

A0A6J1I6J5 UPF0235 protein C15orf40 homolog1.9e-5493.65Show/hide
Query:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS
        M PAKKGKTKAPKATESIQSS+K NNYPSCLRSVS SS+AITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQ+SIGSGS
Subjt:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEDVSLQSVFDALNKALT
        KSR KVVIVE+VSLQSVFDALNKALT
Subjt:  KSRGKVVIVEDVSLQSVFDALNKALT

SwissProt top hitse value%identityAlignment
Q3ZBP8 UPF0235 protein C15orf40 homolog1.6e-1043.02Show/hide
Query:  VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVV-IVEDVSLQSVFDALNK
        V+I IHAKPGSK  ++TD   +A+ V I AP  +GEANA L  Y+S VL +++  V +  G KSR KVV ++     + + + L K
Subjt:  VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVV-IVEDVSLQSVFDALNK

Q505I4 UPF0235 protein C15orf40 homolog3.7e-1041.44Show/hide
Query:  KKMPPAKKGK--TKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSI
        KK     KGK  TK P+        V ++         S   V I IHAKPGSK  ++TD   +A+GV I AP  +GEANA L  Y+S VL +++  V +
Subjt:  KKMPPAKKGK--TKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSI

Query:  GSGSKSRGKVV
          G KSR KVV
Subjt:  GSGSKSRGKVV

Q54UW1 UPF0235 protein5.3e-0928.8Show/hide
Query:  KKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGSKSRG
        KKG + + K  +  Q  + +NN       V    + I ++  P SK +SI  F D  L ++I  P  DG+AN  +++++S  L +++  + +G GSKSR 
Subjt:  KKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGSKSRG

Query:  KVVIV----EDVSLQSVFDALNKAL
        K V +    E+++   +F+ +   L
Subjt:  KVVIV----EDVSLQSVFDALNKAL

Q8WUR7 UPF0235 protein C15orf408.2e-1034.31Show/hide
Query:  STRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSS-VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVL
        S R +  ++ KK     KGK+++ +    +         P    +V P   V I IHAKPGSK  ++TD   +A+ V I AP  +GEANA L  Y+S VL
Subjt:  STRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSS-VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVL

Query:  GVKRRQVSIGSGSKSRGKVV-IVEDVSLQSVFDALNK
         +++  V +  G KSR KVV ++   + + + + L K
Subjt:  GVKRRQVSIGSGSKSRGKVV-IVEDVSLQSVFDALNK

Q9CRC3 UPF0235 protein C15orf40 homolog1.4e-0939.64Show/hide
Query:  KKMPPAKKGK--TKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSI
        KK     KGK  TK P+        V ++             V I IHAKPGS+  ++TD   +A+GV I AP  +GEANA L  Y+S VL +++  V +
Subjt:  KKMPPAKKGK--TKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSI

Query:  GSGSKSRGKVV
          G KSR KVV
Subjt:  GSGSKSRGKVV

Arabidopsis top hitse value%identityAlignment
AT1G49170.1 Protein of unknown function (DUF167)7.8e-4071.77Show/hide
Query:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS
        M P KKGK K   A ES  +  +S+++P+CLR ++PSSVAITIHAKPGSK ASITD  D+A+GVQIDAPA+DGEANAALL+YMSSVLGVKRRQVS+GSGS
Subjt:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEDVSLQSVFDALNKA
        KSR KVVIVED++ QSVF AL++A
Subjt:  KSRGKVVIVEDVSLQSVFDALNKA

AT5G63440.2 Protein of unknown function (DUF167)6.4e-1032.65Show/hide
Query:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKAL
        P C+  +    V + I  +  ++ ++IT    D + V + APA  GEAN  LL++M  VLG++  Q+++  G  S+ K+++VED+S + V++ L +A+
Subjt:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKAL

AT5G63440.3 Protein of unknown function (DUF167)6.4e-1032.65Show/hide
Query:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKAL
        P C+  +    V + I  +  ++ ++IT    D + V + APA  GEAN  LL++M  VLG++  Q+++  G  S+ K+++VED+S + V++ L +A+
Subjt:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGTTAATCCATTGAATGGGCTTAAATCTAAACAAAACTATGTCAGCACTCGCAAAATTCCGTTTGATTTGGGAAAGAAAATGCCTCCGGCGAAGAAGGGGAAAAC
TAAGGCGCCGAAAGCTACTGAATCCATTCAATCCTCCGTCAAATCCAACAATTACCCATCTTGTCTTCGCTCTGTTTCTCCTTCTTCCGTCGCCATTACCATCCACGCAA
AGCCTGGCTCCAAGATCGCCTCTATCACAGACTTTGGCGATGATGCGTTGGGAGTACAAATCGACGCACCGGCCAAAGATGGAGAAGCTAATGCTGCACTTCTTGATTAC
ATGAGCTCTGTTTTAGGTGTCAAAAGAAGACAAGTGTCTATAGGTTCTGGCTCCAAATCAAGAGGCAAGGTTGTGATCGTGGAGGATGTAAGCTTGCAAAGTGTTTTTGA
TGCTTTGAATAAAGCTTTAACATGTGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATAACAGACTTATTATGACGGTTAATCCATTGAATGGGCTTAAATCTAAACAAAACTATGTCAGCACTCGCAAAATTCCGTTTGATTTGGGAAAGAAAATGCCTCCGGCG
AAGAAGGGGAAAACTAAGGCGCCGAAAGCTACTGAATCCATTCAATCCTCCGTCAAATCCAACAATTACCCATCTTGTCTTCGCTCTGTTTCTCCTTCTTCCGTCGCCAT
TACCATCCACGCAAAGCCTGGCTCCAAGATCGCCTCTATCACAGACTTTGGCGATGATGCGTTGGGAGTACAAATCGACGCACCGGCCAAAGATGGAGAAGCTAATGCTG
CACTTCTTGATTACATGAGCTCTGTTTTAGGTGTCAAAAGAAGACAAGTGTCTATAGGTTCTGGCTCCAAATCAAGAGGCAAGGTTGTGATCGTGGAGGATGTAAGCTTG
CAAAGTGTTTTTGATGCTTTGAATAAAGCTTTAACATGTGAGTGATTCTATGGGCTCTTTGGAAGTTCACACCCTTTAACAGAATATCAGCAATACTGTAATAAGTATAT
AGGTGGATTACCTACTACCTTGCTGCCAGAACATTCAAAGTTATACAGCTAGATGTAACTCTTATATGTTTTTATTTATCTCAAACCTCAACAATAGTTTACGTGCTGTA
TATTTGTAGAGTTTCAACTTTATGGTAGTGAGCCGAAATCTCCTAACCTGACTGCAAGTGATTAATCTTGTTCAATTGGTTGATTCTTTACCA
Protein sequenceShow/hide protein sequence
MTVNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDY
MSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALTCE