; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G017830 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G017830
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionUPF0235 protein C15orf40 homolog
Genome locationCicolChr02:650801..654654
RNA-Seq ExpressionCcUC02G017830
SyntenyCcUC02G017830
Gene Ontology termsNA
InterPro domainsIPR003746 - Protein of unknown function DUF167
IPR036591 - YggU-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151587.1 uncharacterized protein LOC101218498 [Cucumis sativus]1.8e-4793.75Show/hide
Query:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKA KATESIQSSVK+NNYPSCLRSV PSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQVSIGSGS
Subjt:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSRGKVVIVE++
Subjt:  KSRGKVVIVEEI

XP_008460904.1 PREDICTED: UPF0235 protein LHK_03181 [Cucumis melo]1.8e-4794.64Show/hide
Query:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKA KATESIQSSVKTNNYPSCLRSV PSSVAITIHAKPGSKIASITDF DDALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQVSIGSGS
Subjt:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSRGKVVIVEE+
Subjt:  KSRGKVVIVEEI

XP_038902100.1 UPF0235 protein C15orf40 homolog isoform X1 [Benincasa hispida]5.4e-4787.4Show/hide
Query:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKA KATESIQSSVKTNNYPSCLRSV PSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQ+SIGSGS
Subjt:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEE---ILRGN-NSKSSSM
        KSR KVVIVEE   + R + N+ SSSM
Subjt:  KSRGKVVIVEE---ILRGN-NSKSSSM

XP_038902101.1 UPF0235 protein C15orf40 homolog isoform X2 [Benincasa hispida]2.4e-4793.75Show/hide
Query:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKA KATESIQSSVKTNNYPSCLRSV PSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQ+SIGSGS
Subjt:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSR KVVIVEE+
Subjt:  KSRGKVVIVEEI

XP_038902103.1 UPF0235 protein C15orf40 homolog isoform X3 [Benincasa hispida]5.4e-4794.59Show/hide
Query:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKA KATESIQSSVKTNNYPSCLRSV PSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQ+SIGSGS
Subjt:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEE
        KSR KVVIVEE
Subjt:  KSRGKVVIVEE

TrEMBL top hitse value%identityAlignment
A0A0A0LMY2 Uncharacterized protein9.0e-4893.75Show/hide
Query:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKA KATESIQSSVK+NNYPSCLRSV PSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQVSIGSGS
Subjt:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSRGKVVIVE++
Subjt:  KSRGKVVIVEEI

A0A1S3CDI8 UPF0235 protein LHK_031819.0e-4894.64Show/hide
Query:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKA KATESIQSSVKTNNYPSCLRSV PSSVAITIHAKPGSKIASITDF DDALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQVSIGSGS
Subjt:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSRGKVVIVEE+
Subjt:  KSRGKVVIVEEI

A0A6J1CR98 uncharacterized protein LOC1110134516.6e-4389.29Show/hide
Query:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKA KATESIQS    N +PSCLRSV PSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYIS+VLGVKRRQVSIGSGS
Subjt:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSR KVVIVEE+
Subjt:  KSRGKVVIVEEI

A0A6J1EMK4 uncharacterized protein LOC1114348892.3e-4387.5Show/hide
Query:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        M P KKGKTKA K TES +S +KTNNYPSCLRSV  SSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQ+SIGSGS
Subjt:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSR KVVIVEE+
Subjt:  KSRGKVVIVEEI

A0A6J1I6J5 UPF0235 protein C15orf40 homolog2.1e-4489.29Show/hide
Query:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        M P KKGKTKA KATESIQSS+K NNYPSCLRSV  SS+AITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQ+SIGSGS
Subjt:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSR KVVIVEE+
Subjt:  KSRGKVVIVEEI

SwissProt top hitse value%identityAlignment
Q3ZBP8 UPF0235 protein C15orf40 homolog3.2e-1050.72Show/hide
Query:  VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVV
        V+I IHAKPGSK  ++TD   +A+ V I AP  +GEANA L  Y+S VL +++  V +  G KSR KVV
Subjt:  VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVV

Q505I4 UPF0235 protein C15orf40 homolog2.2e-1152.17Show/hide
Query:  VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVV
        V I IHAKPGSK  ++TD   +A+GV I AP  +GEANA L  Y+S VL +++  V +  G KSR KVV
Subjt:  VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVV

Q54UW1 UPF0235 protein2.3e-0831.13Show/hide
Query:  KKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRG
        KKG + + K  +  Q  +  NN       V    + I ++  P SK +SI  F D  L ++I  P  DG+AN  +++++S  L +++  + +G GSKSR 
Subjt:  KKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRG

Query:  KVVIVE
        K V ++
Subjt:  KVVIVE

Q8WUR7 UPF0235 protein C15orf403.2e-1050.72Show/hide
Query:  VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVV
        V I IHAKPGSK  ++TD   +A+ V I AP  +GEANA L  Y+S VL +++  V +  G KSR KVV
Subjt:  VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVV

Q9CRC3 UPF0235 protein C15orf40 homolog4.9e-1150.72Show/hide
Query:  VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVV
        V I IHAKPGS+  ++TD   +A+GV I AP  +GEANA L  Y+S VL +++  V +  G KSR KVV
Subjt:  VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVV

Arabidopsis top hitse value%identityAlignment
AT1G49170.1 Protein of unknown function (DUF167)1.5e-3470.54Show/hide
Query:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        M P KKGK K   A ES  +  +++++P+CLR + PSSVAITIHAKPGSK ASITD  D+A+GVQIDAPA+DGEANAALL+Y+SSVLGVKRRQVS+GSGS
Subjt:  MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSR KVVIVE++
Subjt:  KSRGKVVIVEEI

AT5G63440.2 Protein of unknown function (DUF167)9.9e-0730.59Show/hide
Query:  PSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVVIVEEI
        P C+  +    V + I  +  ++ ++IT    D + V + APA  GEAN  LL+++  VLG++  Q+++  G  S+ K+++VE++
Subjt:  PSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVVIVEEI

AT5G63440.3 Protein of unknown function (DUF167)9.9e-0730.59Show/hide
Query:  PSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVVIVEEI
        P C+  +    V + I  +  ++ ++IT    D + V + APA  GEAN  LL+++  VLG++  Q+++  G  S+ K+++VE++
Subjt:  PSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVVIVEEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCCGCCGAAGAAGGGAAAAACGAAAGCGCTGAAAGCTACTGAATCAATTCAATCCTCCGTCAAAACCAACAATTACCCATCTTGTCTTCGCTCTGTTTTTCCTTC
TTCCGTCGCCATTACCATCCACGCAAAGCCTGGCTCCAAGATCGCTTCTATAACAGACTTTGGCGATGATGCGTTAGGAGTGCAAATCGATGCACCGGCCAAAGATGGAG
AAGCTAATGCTGCACTTCTTGATTACATTAGCTCTGTTTTAGGTGTCAAAAGAAGACAAGTGTCTATTGGTTCTGGCTCTAAATCAAGAGGCAAGGTTGTGATCGTGGAG
GAGATCCTCAGAGGTAATAACTCCAAATCTTCTAGCATGAAAAGGAGAGTATGCTTCAGGATTCTTGAGGAATTACGTATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTCCGCCGAAGAAGGGAAAAACGAAAGCGCTGAAAGCTACTGAATCAATTCAATCCTCCGTCAAAACCAACAATTACCCATCTTGTCTTCGCTCTGTTTTTCCTTC
TTCCGTCGCCATTACCATCCACGCAAAGCCTGGCTCCAAGATCGCTTCTATAACAGACTTTGGCGATGATGCGTTAGGAGTGCAAATCGATGCACCGGCCAAAGATGGAG
AAGCTAATGCTGCACTTCTTGATTACATTAGCTCTGTTTTAGGTGTCAAAAGAAGACAAGTGTCTATTGGTTCTGGCTCTAAATCAAGAGGCAAGGTTGTGATCGTGGAG
GAGATCCTCAGAGGTAATAACTCCAAATCTTCTAGCATGAAAAGGAGAGTATGCTTCAGGATTCTTGAGGAATTACGTATGTAG
Protein sequenceShow/hide protein sequence
MPPPKKGKTKALKATESIQSSVKTNNYPSCLRSVFPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVVIVE
EILRGNNSKSSSMKRRVCFRILEELRM