; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC02G026360 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC02G026360
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionUPF0235 protein C15orf40 homolog
Genome locationCmU531Chr02:641696..645341
RNA-Seq ExpressionCmUC02G026360
SyntenyCmUC02G026360
Gene Ontology termsNA
InterPro domainsIPR003746 - Protein of unknown function DUF167
IPR036591 - YggU-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151587.1 uncharacterized protein LOC101218498 [Cucumis sativus]1.8e-4794.64Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKAPKATESIQSSVK+NNYPSCLRSVSPSSVAITIHAKPGSKIASITDFG DALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQVSIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSRGKVVIVE++
Subjt:  KSRGKVVIVEEI

XP_008460904.1 PREDICTED: UPF0235 protein LHK_03181 [Cucumis melo]1.8e-4795.54Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDF  DALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQVSIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSRGKVVIVEE+
Subjt:  KSRGKVVIVEEI

XP_038902100.1 UPF0235 protein C15orf40 homolog isoform X1 [Benincasa hispida]7.8e-4687.4Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKA KATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFG DALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQ+SIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEE---ILRGN-NSKSSSM
        KSR KVVIVEE   + R + N+ SSSM
Subjt:  KSRGKVVIVEE---ILRGN-NSKSSSM

XP_038902101.1 UPF0235 protein C15orf40 homolog isoform X2 [Benincasa hispida]3.5e-4693.75Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKA KATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFG DALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQ+SIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSR KVVIVEE+
Subjt:  KSRGKVVIVEEI

XP_038902103.1 UPF0235 protein C15orf40 homolog isoform X3 [Benincasa hispida]7.8e-4694.59Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKA KATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFG DALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQ+SIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEE
        KSR KVVIVEE
Subjt:  KSRGKVVIVEE

TrEMBL top hitse value%identityAlignment
A0A0A0LMY2 Uncharacterized protein9.0e-4894.64Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKAPKATESIQSSVK+NNYPSCLRSVSPSSVAITIHAKPGSKIASITDFG DALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQVSIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSRGKVVIVE++
Subjt:  KSRGKVVIVEEI

A0A1S3CDI8 UPF0235 protein LHK_031819.0e-4895.54Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDF  DALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQVSIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSRGKVVIVEE+
Subjt:  KSRGKVVIVEEI

A0A6J1CR98 uncharacterized protein LOC1110134511.1e-4289.29Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKAPKATESIQS    N +PSCLRSV+PSSVAITIHAKPGSKIASITDFG DALGVQIDAPAKDGEANAALLDYIS+VLGVKRRQVSIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSR KVVIVEE+
Subjt:  KSRGKVVIVEEI

A0A6J1EMK4 uncharacterized protein LOC1114348892.5e-4287.5Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        M P KKGKTKA K TES +S +KTNNYPSCLRSVS SSVAITIHAKPGSKIASITDFG DALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQ+SIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSR KVVIVEE+
Subjt:  KSRGKVVIVEEI

A0A6J1I6J5 UPF0235 protein C15orf40 homolog2.1e-4490.18Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        M P KKGKTKAPKATESIQSS+K NNYPSCLRSVS SS+AITIHAKPGSKIASITDFG DALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQ+SIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSR KVVIVEE+
Subjt:  KSRGKVVIVEEI

SwissProt top hitse value%identityAlignment
C1D6C4 UPF0235 protein LHK_031811.5e-0737.5Show/hide
Query:  SVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVVIVEEI
        + +   V +T+H +PG++   +     DAL +++ AP  DG+ANA LL +++  LGV R  V++ SG  SR KVV +  I
Subjt:  SVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVVIVEEI

Q3ZBP8 UPF0235 protein C15orf40 homolog5.5e-1050.72Show/hide
Query:  VAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVV
        V+I IHAKPGSK  ++TD   +A+ V I AP  +GEANA L  Y+S VL +++  V +  G KSR KVV
Subjt:  VAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVV

Q505I4 UPF0235 protein C15orf40 homolog1.1e-1042.16Show/hide
Query:  KGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGK
        K +TK P+        V T+         S   V I IHAKPGSK  ++TD   +A+GV I AP  +GEANA L  Y+S VL +++  V +  G KSR K
Subjt:  KGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGK

Query:  VV
        VV
Subjt:  VV

Q8WUR7 UPF0235 protein C15orf401.4e-1052.17Show/hide
Query:  VAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVV
        V I IHAKPGSK  ++TD  A+A+ V I AP  +GEANA L  Y+S VL +++  V +  G KSR KVV
Subjt:  VAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVV

Q9CRC3 UPF0235 protein C15orf40 homolog4.2e-1040.2Show/hide
Query:  KGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGK
        K +TK P+        V T+             V I IHAKPGS+  ++TD   +A+GV I AP  +GEANA L  Y+S VL +++  V +  G KSR K
Subjt:  KGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGK

Query:  VV
        VV
Subjt:  VV

Arabidopsis top hitse value%identityAlignment
AT1G49170.1 Protein of unknown function (DUF167)3.6e-3369.64Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        M P KKGK K   A ES  +  +++++P+CLR ++PSSVAITIHAKPGSK ASITD   +A+GVQIDAPA+DGEANAALL+Y+SSVLGVKRRQVS+GSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSR KVVIVE++
Subjt:  KSRGKVVIVEEI

AT5G63440.2 Protein of unknown function (DUF167)2.6e-0731.76Show/hide
Query:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVVIVEEI
        P C+  +    V + I  +  ++ ++IT   AD + V + APA  GEAN  LL+++  VLG++  Q+++  G  S+ K+++VE++
Subjt:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVVIVEEI

AT5G63440.3 Protein of unknown function (DUF167)2.6e-0731.76Show/hide
Query:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVVIVEEI
        P C+  +    V + I  +  ++ ++IT   AD + V + APA  GEAN  LL+++  VLG++  Q+++  G  S+ K+++VE++
Subjt:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVVIVEEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCCGCCGAAGAAGGGAAAAACGAAAGCGCCGAAAGCTACTGAATCAATTCAATCCTCCGTCAAAACCAACAATTACCCATCTTGTCTTCGCTCTGTTTCT
CCTTCTTCCGTCGCCATTACCATCCACGCAAAGCCTGGCTCCAAGATCGCTTCTATAACAGACTTTGGCGCTGATGCGTTAGGAGTGCAAATCGATGCACCGGCC
AAAGATGGAGAAGCTAATGCTGCACTTCTTGATTACATTAGCTCTGTTTTAGGTGTCAAAAGAAGACAAGTGTCTATTGGTTCTGGCTCTAAATCAAGAGGCAAG
GTTGTGATCGTGGAGGAGATCCTCAGAGGTAATAACTCCAAATCTTCTAGCATGAAAAGGAGAGTATGCTTCAGGATTCTTGAGGAATTACGTATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTCCGCCGAAGAAGGGAAAAACGAAAGCGCCGAAAGCTACTGAATCAATTCAATCCTCCGTCAAAACCAACAATTACCCATCTTGTCTTCGCTCTGTTTCT
CCTTCTTCCGTCGCCATTACCATCCACGCAAAGCCTGGCTCCAAGATCGCTTCTATAACAGACTTTGGCGCTGATGCGTTAGGAGTGCAAATCGATGCACCGGCC
AAAGATGGAGAAGCTAATGCTGCACTTCTTGATTACATTAGCTCTGTTTTAGGTGTCAAAAGAAGACAAGTGTCTATTGGTTCTGGCTCTAAATCAAGAGGCAAG
GTTGTGATCGTGGAGGAGATCCTCAGAGGTAATAACTCCAAATCTTCTAGCATGAAAAGGAGAGTATGCTTCAGGATTCTTGAGGAATTACGTATGTAG
Protein sequenceShow/hide protein sequence
MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGADALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGK
VVIVEEILRGNNSKSSSMKRRVCFRILEELRM