; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G05940 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G05940
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function, DUF538
Genome locationClcChr09:4698788..4699252
RNA-Seq ExpressionClc09G05940
SyntenyClc09G05940
Gene Ontology termsNA
InterPro domainsIPR007493 - Protein of unknown function DUF538
IPR036758 - At5g01610-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8646343.1 hypothetical protein Csa_016699 [Cucumis sativus]1.3e-7088.89Show/hide
Query:  MSPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV
        MSPT   SPS +NG+  SVYDILREFNFPIGLLPEG+VGCKLDRTTGKLEAYLK SCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV
Subjt:  MSPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV

Query:  EVVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRK
        EVVRNGDDL+FS+GMATASFPVDNFSECPPGGCGVDC+DGK ++
Subjt:  EVVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRK

XP_004136167.1 uncharacterized protein LOC101222381 [Cucumis sativus]9.6e-7791.56Show/hide
Query:  MSPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV
        MSPT   SPS +NG+  SVYDILREFNFPIGLLPEG+VGCKLDRTTGKLEAYLK SCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV
Subjt:  MSPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV

Query:  EVVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSSA
        EVVRNGDDL+FS+GMATASFPVDNFSECPPGGCGVDC+DGKVRKIKAKSLVSSA
Subjt:  EVVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSSA

XP_008451557.1 PREDICTED: uncharacterized protein LOC103492802 [Cucumis melo]1.5e-7793.51Show/hide
Query:  MSPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV
        MSPT    PSTSNGE  SVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLK SCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV
Subjt:  MSPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV

Query:  EVVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSSA
        EVVRNGDDL+FS+GMATASFPVDNFSECPPG CGVDCNDGKVRKIKAKSLVSSA
Subjt:  EVVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSSA

XP_022151830.1 uncharacterized protein LOC111019713 [Momordica charantia]3.8e-6581.7Show/hide
Query:  MSPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV
        MSP+AA+ PS  NGE  SVYDILREFNFPIGLLPEG VGC+LDR TGKLEAYL G+C F PDE Y+LKYKSTISG ISRNRLT+LKGVSVKFMFFWVNIV
Subjt:  MSPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV

Query:  EVVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSS
        EVVR GDDL+FSVGMATASFPV+NFSECP  GCG+DC +GKVRKI AKSL+SS
Subjt:  EVVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSS

XP_038896189.1 uncharacterized protein LOC120084473 [Benincasa hispida]1.5e-7794.81Show/hide
Query:  MSPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV
        MSPT   SPSTS+ E  SVYDILREFNFPIGLLPEGVVG KLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV
Subjt:  MSPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV

Query:  EVVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSSA
        EVVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSSA
Subjt:  EVVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSSA

TrEMBL top hitse value%identityAlignment
A0A0A0K643 Uncharacterized protein4.6e-7791.56Show/hide
Query:  MSPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV
        MSPT   SPS +NG+  SVYDILREFNFPIGLLPEG+VGCKLDRTTGKLEAYLK SCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV
Subjt:  MSPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV

Query:  EVVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSSA
        EVVRNGDDL+FS+GMATASFPVDNFSECPPGGCGVDC+DGKVRKIKAKSLVSSA
Subjt:  EVVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSSA

A0A1S3BR55 uncharacterized protein LOC1034928027.1e-7893.51Show/hide
Query:  MSPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV
        MSPT    PSTSNGE  SVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLK SCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV
Subjt:  MSPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV

Query:  EVVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSSA
        EVVRNGDDL+FS+GMATASFPVDNFSECPPG CGVDCNDGKVRKIKAKSLVSSA
Subjt:  EVVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSSA

A0A5A7VQJ7 Uncharacterized protein7.1e-7893.51Show/hide
Query:  MSPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV
        MSPT    PSTSNGE  SVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLK SCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV
Subjt:  MSPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV

Query:  EVVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSSA
        EVVRNGDDL+FS+GMATASFPVDNFSECPPG CGVDCNDGKVRKIKAKSLVSSA
Subjt:  EVVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSSA

A0A6J1DCA2 uncharacterized protein LOC1110197131.8e-6581.7Show/hide
Query:  MSPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV
        MSP+AA+ PS  NGE  SVYDILREFNFPIGLLPEG VGC+LDR TGKLEAYL G+C F PDE Y+LKYKSTISG ISRNRLT+LKGVSVKFMFFWVNIV
Subjt:  MSPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIV

Query:  EVVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSS
        EVVR GDDL+FSVGMATASFPV+NFSECP  GCG+DC +GKVRKI AKSL+SS
Subjt:  EVVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSS

A0A6J1KX02 uncharacterized protein LOC1114979013.1e-6581.58Show/hide
Query:  TAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIVEVV
        TAA SP+T+NGE  SVYDILREFNFPIGLLPEGVVGC LDRTTGKLEAYL  +C FSP++PYEL+YK+TISG IS+NRLT+LKGV+VKFMFFWVNIVEVV
Subjt:  TAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIVEVV

Query:  RNGDDLQFSVGMATASFPVDNFSEC-PPGGCGVDCNDGKVRKIKAKSLVSSA
        RNGDDL FSVG+ATASFPVDNFSEC PPGGCG DC +GK  KIK KSLVS A
Subjt:  RNGDDLQFSVGMATASFPVDNFSEC-PPGGCGVDCNDGKVRKIKAKSLVSSA

SwissProt top hitse value%identityAlignment
Q9M015 Uncharacterized protein At5g016103.0e-0926.32Show/hide
Query:  DILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIVEVVRNGDDLQFSVGM
        ++L+E++ PIG+ P      + D  T KL   +   C     +   LK+ +T++G++ + +LT+++G+  K M  WV +  +  +   + F+ GM
Subjt:  DILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIVEVVRNGDDLQFSVGM

Arabidopsis top hitse value%identityAlignment
AT1G02813.1 Protein of unknown function, DUF5387.4e-2745.45Show/hide
Query:  AVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIVEVVRN
        + S S S  +  SVY +L  +  P G+LPEGV    L+R TG  +     +C FS D  Y++KYK  ISG I+R R+  L GVSVK +FFW+NI EV R+
Subjt:  AVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIVEVVRN

Query:  GDDLQFSVGMATASFPVDNFSECPPGGCGVDC
        GDD++F VG A+  F    F + P  GCG +C
Subjt:  GDDLQFSVGMATASFPVDNFSECPPGGCGVDC

AT1G02816.1 Protein of unknown function, DUF5386.0e-3747.02Show/hide
Query:  PTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIVEV
        P+  ++ +  +  P + Y +L+ +NFP+G+LP+GVV   LD++TG+  AY   SC F+    Y+L YKSTISG IS N++T L GV VK +F W+NIVEV
Subjt:  PTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIVEV

Query:  VRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSS
        +RNGD+L+FSVG+ +A+F +D F E P  GCG DC   K + +     VSS
Subjt:  VRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSS

AT4G02360.1 Protein of unknown function, DUF5382.2e-3453.12Show/hide
Query:  NGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIVEVVRNGDDLQFS
        +G+  + YD ++ +N P G+LP+GVV  +L+  TG  + Y   +C F+  + Y+LKYKSTISG IS   + NLKGVSVK +FFWVNI EV  +G DL FS
Subjt:  NGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIVEVVRNGDDLQFS

Query:  VGMATASFPVDNFSECPPGGCGVDCNDG
        VG+A+ASFP  NF E P  GCG DCN+G
Subjt:  VGMATASFPVDNFSECPPGGCGVDCNDG

AT4G02370.1 Protein of unknown function, DUF5383.2e-3849.02Show/hide
Query:  SPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIVE
        S TAAV  +  +  P + Y +L+ +NFP+G+LP+GVV   LD TTGK  AY   SC F+    Y+L YKSTISG IS N+L  L GV VK +F W+NIVE
Subjt:  SPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIVE

Query:  VVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSSA
        V+RNGD+++FSVG+ +A+F +  F E P  GCG +C D K+  I+    +SS+
Subjt:  VVRNGDDLQFSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSSA

AT5G19590.1 Protein of unknown function, DUF5383.2e-1431.75Show/hide
Query:  PTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFS-PDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIVE
        P +   P+    +P   +  L    FPIGLLP  V    L++T+G    +L G+C  + P + Y   Y + ++G IS+ ++  L+G+ V+  F   +I  
Subjt:  PTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFS-PDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIVE

Query:  VVRNGDDLQFSVGMATASFPVDNFSE
        +  +GD+L F V   TA +P  NF E
Subjt:  VVRNGDDLQFSVGMATASFPVDNFSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCCCACCGCCGCGGTGTCGCCCTCCACCAGCAATGGCGAACCCGCGTCGGTGTACGACATTCTCCGAGAATTCAACTTCCCAATAGGTCTCCTTCCAGAAGGTGT
CGTGGGTTGCAAGTTAGATCGAACTACTGGAAAATTGGAGGCTTATTTGAAGGGATCATGTCATTTCTCACCGGATGAGCCATATGAATTGAAATACAAATCCACTATTA
GTGGAAACATTTCAAGGAATAGATTGACAAATCTAAAGGGGGTGAGTGTGAAGTTCATGTTCTTTTGGGTCAACATTGTGGAGGTGGTCAGAAATGGCGACGATCTACAA
TTCTCGGTGGGGATGGCTACGGCGTCGTTTCCTGTGGATAACTTCTCCGAGTGCCCCCCGGGTGGGTGTGGAGTCGATTGCAATGATGGGAAAGTCAGAAAAATTAAAGC
CAAATCTCTTGTTTCATCTGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCCCACCGCCGCGGTGTCGCCCTCCACCAGCAATGGCGAACCCGCGTCGGTGTACGACATTCTCCGAGAATTCAACTTCCCAATAGGTCTCCTTCCAGAAGGTGT
CGTGGGTTGCAAGTTAGATCGAACTACTGGAAAATTGGAGGCTTATTTGAAGGGATCATGTCATTTCTCACCGGATGAGCCATATGAATTGAAATACAAATCCACTATTA
GTGGAAACATTTCAAGGAATAGATTGACAAATCTAAAGGGGGTGAGTGTGAAGTTCATGTTCTTTTGGGTCAACATTGTGGAGGTGGTCAGAAATGGCGACGATCTACAA
TTCTCGGTGGGGATGGCTACGGCGTCGTTTCCTGTGGATAACTTCTCCGAGTGCCCCCCGGGTGGGTGTGGAGTCGATTGCAATGATGGGAAAGTCAGAAAAATTAAAGC
CAAATCTCTTGTTTCATCTGCTTGA
Protein sequenceShow/hide protein sequence
MSPTAAVSPSTSNGEPASVYDILREFNFPIGLLPEGVVGCKLDRTTGKLEAYLKGSCHFSPDEPYELKYKSTISGNISRNRLTNLKGVSVKFMFFWVNIVEVVRNGDDLQ
FSVGMATASFPVDNFSECPPGGCGVDCNDGKVRKIKAKSLVSSA