; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G00720 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G00720
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionUPF0235 protein C15orf40 homolog
Genome locationClcChr02:653415..657053
RNA-Seq ExpressionClc02G00720
SyntenyClc02G00720
Gene Ontology termsNA
InterPro domainsIPR003746 - Protein of unknown function DUF167
IPR036591 - YggU-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151587.1 uncharacterized protein LOC101218498 [Cucumis sativus]1.3e-4895.54Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKAPKATESIQSSVK+NNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQVSIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSRGKVVIVE++
Subjt:  KSRGKVVIVEEI

XP_008460904.1 PREDICTED: UPF0235 protein LHK_03181 [Cucumis melo]1.3e-4896.43Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDF DDALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQVSIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSRGKVVIVEE+
Subjt:  KSRGKVVIVEEI

XP_038902100.1 UPF0235 protein C15orf40 homolog isoform X1 [Benincasa hispida]5.4e-4788.19Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKA KATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQ+SIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEE---ILRGN-NSKSSSM
        KSR KVVIVEE   + R + N+ SSSM
Subjt:  KSRGKVVIVEE---ILRGN-NSKSSSM

XP_038902101.1 UPF0235 protein C15orf40 homolog isoform X2 [Benincasa hispida]2.4e-4794.64Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKA KATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQ+SIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSR KVVIVEE+
Subjt:  KSRGKVVIVEEI

XP_038902103.1 UPF0235 protein C15orf40 homolog isoform X3 [Benincasa hispida]5.4e-4795.5Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKA KATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQ+SIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEE
        KSR KVVIVEE
Subjt:  KSRGKVVIVEE

TrEMBL top hitse value%identityAlignment
A0A0A0LMY2 Uncharacterized protein6.2e-4995.54Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKAPKATESIQSSVK+NNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQVSIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSRGKVVIVE++
Subjt:  KSRGKVVIVEEI

A0A1S3CDI8 UPF0235 protein LHK_031816.2e-4996.43Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDF DDALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQVSIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSRGKVVIVEE+
Subjt:  KSRGKVVIVEEI

A0A6J1CR98 uncharacterized protein LOC1110134511.0e-4390.18Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        MPP KKGKTKAPKATESIQS    N +PSCLRSV+PSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYIS+VLGVKRRQVSIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSR KVVIVEE+
Subjt:  KSRGKVVIVEEI

A0A6J1EMK4 uncharacterized protein LOC1114348891.7e-4388.39Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        M P KKGKTKA K TES +S +KTNNYPSCLRSVS SSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQ+SIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSR KVVIVEE+
Subjt:  KSRGKVVIVEEI

A0A6J1I6J5 UPF0235 protein C15orf40 homolog1.4e-4591.07Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        M P KKGKTKAPKATESIQSS+K NNYPSCLRSVS SS+AITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDY+SSVLGVKRRQ+SIGSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSR KVVIVEE+
Subjt:  KSRGKVVIVEEI

SwissProt top hitse value%identityAlignment
Q3ZBP8 UPF0235 protein C15orf40 homolog4.2e-1050.72Show/hide
Query:  VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVV
        V+I IHAKPGSK  ++TD   +A+ V I AP  +GEANA L  Y+S VL +++  V +  G KSR KVV
Subjt:  VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVV

Q505I4 UPF0235 protein C15orf40 homolog1.1e-1042.16Show/hide
Query:  KGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGK
        K +TK P+        V T+         S   V I IHAKPGSK  ++TD   +A+GV I AP  +GEANA L  Y+S VL +++  V +  G KSR K
Subjt:  KGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGK

Query:  VV
        VV
Subjt:  VV

Q54UW1 UPF0235 protein3.0e-0831.13Show/hide
Query:  KKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRG
        KKG + + K  +  Q  +  NN       V    + I ++  P SK +SI  F D  L ++I  P  DG+AN  +++++S  L +++  + +G GSKSR 
Subjt:  KKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRG

Query:  KVVIVE
        K V ++
Subjt:  KVVIVE

Q8WUR7 UPF0235 protein C15orf404.2e-1050.72Show/hide
Query:  VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVV
        V I IHAKPGSK  ++TD   +A+ V I AP  +GEANA L  Y+S VL +++  V +  G KSR KVV
Subjt:  VAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVV

Q9CRC3 UPF0235 protein C15orf40 homolog4.2e-1040.2Show/hide
Query:  KGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGK
        K +TK P+        V T+             V I IHAKPGS+  ++TD   +A+GV I AP  +GEANA L  Y+S VL +++  V +  G KSR K
Subjt:  KGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGK

Query:  VV
        VV
Subjt:  VV

Arabidopsis top hitse value%identityAlignment
AT1G49170.1 Protein of unknown function (DUF167)2.5e-3470.54Show/hide
Query:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS
        M P KKGK K   A ES  +  +++++P+CLR ++PSSVAITIHAKPGSK ASITD  D+A+GVQIDAPA+DGEANAALL+Y+SSVLGVKRRQVS+GSGS
Subjt:  MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGS

Query:  KSRGKVVIVEEI
        KSR KVVIVE++
Subjt:  KSRGKVVIVEEI

AT5G63440.2 Protein of unknown function (DUF167)9.9e-0730.59Show/hide
Query:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVVIVEEI
        P C+  +    V + I  +  ++ ++IT    D + V + APA  GEAN  LL+++  VLG++  Q+++  G  S+ K+++VE++
Subjt:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVVIVEEI

AT5G63440.3 Protein of unknown function (DUF167)9.9e-0730.59Show/hide
Query:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVVIVEEI
        P C+  +    V + I  +  ++ ++IT    D + V + APA  GEAN  LL+++  VLG++  Q+++  G  S+ K+++VE++
Subjt:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVVIVEEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCCGCCGAAGAAGGGAAAAACGAAAGCGCCGAAAGCTACTGAATCAATTCAATCCTCCGTCAAAACCAACAATTACCCATCTTGTCTTCGCTCTGTTTCTCCTTC
TTCCGTCGCCATTACCATCCACGCAAAGCCTGGCTCCAAGATCGCTTCTATAACAGACTTTGGCGATGATGCGTTAGGAGTGCAAATCGATGCACCGGCCAAAGATGGAG
AAGCTAATGCTGCACTTCTTGATTACATTAGCTCTGTTTTAGGTGTCAAAAGAAGACAAGTGTCTATTGGTTCTGGCTCTAAATCAAGAGGCAAGGTTGTGATCGTGGAG
GAGATCCTCAGAGGTAATAACTCCAAATCTTCTAGCATGAAAAGGAGAGTATGCTTCAGGATTCTTGAGGAATTACGTATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTCCGCCGAAGAAGGGAAAAACGAAAGCGCCGAAAGCTACTGAATCAATTCAATCCTCCGTCAAAACCAACAATTACCCATCTTGTCTTCGCTCTGTTTCTCCTTC
TTCCGTCGCCATTACCATCCACGCAAAGCCTGGCTCCAAGATCGCTTCTATAACAGACTTTGGCGATGATGCGTTAGGAGTGCAAATCGATGCACCGGCCAAAGATGGAG
AAGCTAATGCTGCACTTCTTGATTACATTAGCTCTGTTTTAGGTGTCAAAAGAAGACAAGTGTCTATTGGTTCTGGCTCTAAATCAAGAGGCAAGGTTGTGATCGTGGAG
GAGATCCTCAGAGGTAATAACTCCAAATCTTCTAGCATGAAAAGGAGAGTATGCTTCAGGATTCTTGAGGAATTACGTATGTAG
Protein sequenceShow/hide protein sequence
MPPPKKGKTKAPKATESIQSSVKTNNYPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPAKDGEANAALLDYISSVLGVKRRQVSIGSGSKSRGKVVIVE
EILRGNNSKSSSMKRRVCFRILEELRM