; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G16055 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G16055
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionUPF0235 protein C15orf40 homolog
Genome locationctg2178:1162589..1164821
RNA-Seq ExpressionCucsat.G16055
SyntenyCucsat.G16055
Gene Ontology termsNA
InterPro domainsIPR003746 - Protein of unknown function DUF167
IPR036591 - YggU-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151587.1 uncharacterized protein LOC101218498 [Cucumis sativus]2.61e-97100Show/hide
Query:  VNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGE
        VNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGE
Subjt:  VNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGE

Query:  ANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALT
        ANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALT
Subjt:  ANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALT

XP_008460904.1 PREDICTED: UPF0235 protein LHK_03181 [Cucumis melo]8.01e-9295.39Show/hide
Query:  VNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGE
        V+ LNGLKSKQNY S RKIPFDLGKKMPPAKKGKTKAPKATESIQSSVK+NNYPSCLRSVSPSSVAITIHAKPGSKIASITDF DDALGVQIDAPAKDGE
Subjt:  VNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGE

Query:  ANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALT
        ANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVE+VSLQSVFDALNKALT
Subjt:  ANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALT

XP_022971730.1 UPF0235 protein C15orf40 homolog [Cucurbita maxima]2.20e-7193.65Show/hide
Query:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS
        M PAKKGKTKAPKATESIQSS+K NNYPSCLRSVS SS+AITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQ+SIGSGS
Subjt:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEDVSLQSVFDALNKALT
        KSR KVVIVE+VSLQSVFDALNKALT
Subjt:  KSRGKVVIVEDVSLQSVFDALNKALT

XP_023512400.1 UPF0235 protein C15orf40 homolog [Cucurbita pepo subsp. pepo]2.98e-6991.27Show/hide
Query:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS
        M PAKKGKTKAPK TESI+S +K+NNYPSCLRSVS SSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQ+SIG GS
Subjt:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEDVSLQSVFDALNKALT
        KSR KVVIVE+VSLQSVFDALNKALT
Subjt:  KSRGKVVIVEDVSLQSVFDALNKALT

XP_038902101.1 UPF0235 protein C15orf40 homolog isoform X2 [Benincasa hispida]1.73e-7396.03Show/hide
Query:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS
        MPPAKKGKTKA KATESIQSSVK+NNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQ+SIGSGS
Subjt:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEDVSLQSVFDALNKALT
        KSR KVVIVE+VSLQSVFDALNKALT
Subjt:  KSRGKVVIVEDVSLQSVFDALNKALT

TrEMBL top hitse value%identityAlignment
A0A0A0LMY2 Uncharacterized protein1.26e-97100Show/hide
Query:  VNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGE
        VNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGE
Subjt:  VNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGE

Query:  ANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALT
        ANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALT
Subjt:  ANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALT

A0A1S3CDI8 UPF0235 protein LHK_031813.88e-9295.39Show/hide
Query:  VNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGE
        V+ LNGLKSKQNY S RKIPFDLGKKMPPAKKGKTKAPKATESIQSSVK+NNYPSCLRSVSPSSVAITIHAKPGSKIASITDF DDALGVQIDAPAKDGE
Subjt:  VNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGE

Query:  ANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALT
        ANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVE+VSLQSVFDALNKALT
Subjt:  ANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALT

A0A6J1CR98 uncharacterized protein LOC1110134514.51e-6890.48Show/hide
Query:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS
        MPPAKKGKTKAPKATESIQS    N +PSCLRSV+PSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDY+S+VLGVKRRQVSIGSGS
Subjt:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEDVSLQSVFDALNKALT
        KSR KVVIVE+VSLQ+VFDALNKALT
Subjt:  KSRGKVVIVEDVSLQSVFDALNKALT

A0A6J1EMK4 uncharacterized protein LOC1114348892.39e-6890.48Show/hide
Query:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS
        M PAKKGKTKA K TES +S +K+NNYPSCLRSVS SSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQ+SIGSGS
Subjt:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEDVSLQSVFDALNKALT
        KSR KVVIVE+VSLQSVFDALNKALT
Subjt:  KSRGKVVIVEDVSLQSVFDALNKALT

A0A6J1I6J5 UPF0235 protein C15orf40 homolog1.06e-7193.65Show/hide
Query:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS
        M PAKKGKTKAPKATESIQSS+K NNYPSCLRSVS SS+AITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQ+SIGSGS
Subjt:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEDVSLQSVFDALNKALT
        KSR KVVIVE+VSLQSVFDALNKALT
Subjt:  KSRGKVVIVEDVSLQSVFDALNKALT

SwissProt top hitse value%identityAlignment
Q3ZBP8 UPF0235 protein C15orf40 homolog2.3e-1043.02Show/hide
Query:  VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVV-IVEDVSLQSVFDALNK
        V+I IHAKPGSK  ++TD   +A+ V I AP  +GEANA L  Y+S VL +++  V +  G KSR KVV ++     + + + L K
Subjt:  VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVV-IVEDVSLQSVFDALNK

Q505I4 UPF0235 protein C15orf40 homolog4.0e-1041.44Show/hide
Query:  KKMPPAKKGK--TKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSI
        KK     KGK  TK P+        V ++         S   V I IHAKPGSK  ++TD   +A+GV I AP  +GEANA L  Y+S VL +++  V +
Subjt:  KKMPPAKKGK--TKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSI

Query:  GSGSKSRGKVV
          G KSR KVV
Subjt:  GSGSKSRGKVV

Q54UW1 UPF0235 protein7.5e-0928.8Show/hide
Query:  KKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGSKSRG
        KKG + + K  +  Q  + +NN       V    + I ++  P SK +SI  F D  L ++I  P  DG+AN  +++++S  L +++  + +G GSKSR 
Subjt:  KKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGSKSRG

Query:  KVVIV----EDVSLQSVFDALNKAL
        K V +    E+++   +F+ +   L
Subjt:  KVVIV----EDVSLQSVFDALNKAL

Q8WUR7 UPF0235 protein C15orf401.2e-0934.31Show/hide
Query:  STRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSS-VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVL
        S R +  ++ KK     KGK+++ +    +         P    +V P   V I IHAKPGSK  ++TD   +A+ V I AP  +GEANA L  Y+S VL
Subjt:  STRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSS-VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVL

Query:  GVKRRQVSIGSGSKSRGKVV-IVEDVSLQSVFDALNK
         +++  V +  G KSR KVV ++   + + + + L K
Subjt:  GVKRRQVSIGSGSKSRGKVV-IVEDVSLQSVFDALNK

Q9CRC3 UPF0235 protein C15orf40 homolog1.5e-0939.64Show/hide
Query:  KKMPPAKKGK--TKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSI
        KK     KGK  TK P+        V ++             V I IHAKPGS+  ++TD   +A+GV I AP  +GEANA L  Y+S VL +++  V +
Subjt:  KKMPPAKKGK--TKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSI

Query:  GSGSKSRGKVV
          G KSR KVV
Subjt:  GSGSKSRGKVV

Arabidopsis top hitse value%identityAlignment
AT1G49170.1 Protein of unknown function (DUF167)1.1e-3971.77Show/hide
Query:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS
        M P KKGK K   A ES  +  +S+++P+CLR ++PSSVAITIHAKPGSK ASITD  D+A+GVQIDAPA+DGEANAALL+YMSSVLGVKRRQVS+GSGS
Subjt:  MPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEDVSLQSVFDALNKA
        KSR KVVIVED++ QSVF AL++A
Subjt:  KSRGKVVIVEDVSLQSVFDALNKA

AT5G63440.2 Protein of unknown function (DUF167)9.0e-1032.65Show/hide
Query:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKAL
        P C+  +    V + I  +  ++ ++IT    D + V + APA  GEAN  LL++M  VLG++  Q+++  G  S+ K+++VED+S + V++ L +A+
Subjt:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKAL

AT5G63440.3 Protein of unknown function (DUF167)9.0e-1032.65Show/hide
Query:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKAL
        P C+  +    V + I  +  ++ ++IT    D + V + APA  GEAN  LL++M  VLG++  Q+++  G  S+ K+++VED+S + V++ L +A+
Subjt:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GTTAATCCATTGAATGGGCTTAAATCTAAACAAAACTATGTCAGCACTCGCAAAATTCCGTTTGATTTGGGAAAGAAAATGCCTCCGGCGAAGAAGGGGAAAACTAAGGC
GCCGAAAGCTACTGAATCCATTCAATCCTCCGTCAAATCCAACAATTACCCATCTTGTCTTCGCTCTGTTTCTCCTTCTTCCGTCGCCATTACCATCCACGCAAAGCCTG
GCTCCAAGATCGCCTCTATCACAGACTTTGGCGATGATGCACTGGGAGTACAAATCGACGCACCGGCCAAAGATGGAGAAGCTAATGCTGCACTTCTTGATTACATGAGC
TCTGTTTTAGGTGTCAAAAGAAGACAAGTGTCTATAGGTTCTGGCTCCAAATCAAGAGGCAAGGTTGTGATCGTGGAGGATGTAAGCTTGCAAAGTGTTTTTGATGCTTT
GAATAAAGCTTTAACATACTTGTGGAATATGGAGAATGCTTGCTGGAACTTAAGCTTAGAAGAAATCTGA
mRNA sequenceShow/hide mRNA sequence
GTTAATCCATTGAATGGGCTTAAATCTAAACAAAACTATGTCAGCACTCGCAAAATTCCGTTTGATTTGGGAAAGAAAATGCCTCCGGCGAAGAAGGGGAAAACTAAGGC
GCCGAAAGCTACTGAATCCATTCAATCCTCCGTCAAATCCAACAATTACCCATCTTGTCTTCGCTCTGTTTCTCCTTCTTCCGTCGCCATTACCATCCACGCAAAGCCTG
GCTCCAAGATCGCCTCTATCACAGACTTTGGCGATGATGCACTGGGAGTACAAATCGACGCACCGGCCAAAGATGGAGAAGCTAATGCTGCACTTCTTGATTACATGAGC
TCTGTTTTAGGTGTCAAAAGAAGACAAGTGTCTATAGGTTCTGGCTCCAAATCAAGAGGCAAGGTTGTGATCGTGGAGGATGTAAGCTTGCAAAGTGTTTTTGATGCTTT
GAATAAAGCTTTAACATACTTGTGGAATATGGAGAATGCTTGCTGGAACTTAAGCTTAGAAGAAATCTGA
Protein sequenceShow/hide protein sequence
VNPLNGLKSKQNYVSTRKIPFDLGKKMPPAKKGKTKAPKATESIQSSVKSNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYMS
SVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVFDALNKALTYLWNMENACWNLSLEEI