; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G25140 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G25140
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationChr4:22680712..22681515
RNA-Seq ExpressionCSPI04G25140
SyntenyCSPI04G25140
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR022357 - Major intrinsic protein, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN55312.1 hypothetical protein Csa_012508 [Cucumis sativus]4.7e-3876.36Show/hide
Query:  MEIDKLDWVYIGHGFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHT
        MEIDKLDWVYIGHGFVHTNPSVTLGAVLDLMSLTDGQQFSLSV ISQNPRSKNWWLRLKVQGRPVG                         SNLRKVPHT
Subjt:  MEIDKLDWVYIGHGFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHT

Query:  GMAMGCGDYA
        GMAMGCGDYA
Subjt:  GMAMGCGDYA

KGN55313.1 hypothetical protein Csa_012040 [Cucumis sativus]3.5e-3349.06Show/hide
Query:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYA---
        GFV TNPSV +GAV++ +S T+GQQ+++S+ I Q+P S NWW  LK QG PVGYW  TLFGYLDHSATLVEWGGEVFSSN++ VPHTG  MG GDYA   
Subjt:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYA---

Query:  -----------------------------------------ENDTIEPVFYFGGPRRGR
                                                 ++ T EPVFYFGGP   R
Subjt:  -----------------------------------------ENDTIEPVFYFGGPRRGR

TYK20107.1 uncharacterized protein E5676_scaffold134G002320 [Cucumis melo var. makuwa]1.5e-3168.04Show/hide
Query:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYA
        GFV TNPSV +GAV+D +S T+GQQ+++ + I Q+P+S NWW  LK Q +PVGYW  TLFGYLDHSATLVEWGGEVFSSN++ VPHTG  MG GDYA
Subjt:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYA

XP_004141419.2 uncharacterized protein LOC101213587 [Cucumis sativus]3.5e-3349.06Show/hide
Query:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYA---
        GFV TNPSV +GAV++ +S T+GQQ+++S+ I Q+P S NWW  LK QG PVGYW  TLFGYLDHSATLVEWGGEVFSSN++ VPHTG  MG GDYA   
Subjt:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYA---

Query:  -----------------------------------------ENDTIEPVFYFGGPRRGR
                                                 ++ T EPVFYFGGP   R
Subjt:  -----------------------------------------ENDTIEPVFYFGGPRRGR

XP_038896670.1 uncharacterized protein LOC120084931 [Benincasa hispida]4.4e-3654.09Show/hide
Query:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYA---
        GFV TNPSV LG+VL+ +S T GQQF +SV I Q+PRSK+WW  LKVQG+PVGYWAE LFGY+DHSATLVEWGGEVFS+N++KVPHT   MG GDYA   
Subjt:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYA---

Query:  -------------END----------------------------TIEPVFYFGGPRRGR
                     +N                             TIEPVFYFGGP RGR
Subjt:  -------------END----------------------------TIEPVFYFGGPRRGR

TrEMBL top hitse value%identityAlignment
A0A0A0L0M3 Uncharacterized protein1.7e-3349.06Show/hide
Query:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYA---
        GFV TNPSV +GAV++ +S T+GQQ+++S+ I Q+P S NWW  LK QG PVGYW  TLFGYLDHSATLVEWGGEVFSSN++ VPHTG  MG GDYA   
Subjt:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYA---

Query:  -----------------------------------------ENDTIEPVFYFGGPRRGR
                                                 ++ T EPVFYFGGP   R
Subjt:  -----------------------------------------ENDTIEPVFYFGGPRRGR

A0A0A0L3X7 Neprosin domain-containing protein2.3e-3876.36Show/hide
Query:  MEIDKLDWVYIGHGFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHT
        MEIDKLDWVYIGHGFVHTNPSVTLGAVLDLMSLTDGQQFSLSV ISQNPRSKNWWLRLKVQGRPVG                         SNLRKVPHT
Subjt:  MEIDKLDWVYIGHGFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHT

Query:  GMAMGCGDYA
        GMAMGCGDYA
Subjt:  GMAMGCGDYA

A0A1S4DZ87 uncharacterized protein LOC1034938977.0e-3268.04Show/hide
Query:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYA
        GFV TNPSV +GAV+D +S T+GQQ+++ + I Q+P+S NWW  LK Q +PVGYW  TLFGYLDHSATLVEWGGEVFSSN++ VPHTG  MG GDYA
Subjt:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYA

A0A5A7V8M6 Uncharacterized protein7.0e-3268.04Show/hide
Query:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYA
        GFV TNPSV +GAV+D +S T+GQQ+++ + I Q+P+S NWW  LK Q +PVGYW  TLFGYLDHSATLVEWGGEVFSSN++ VPHTG  MG GDYA
Subjt:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYA

A0A5D3D964 Uncharacterized protein7.0e-3268.04Show/hide
Query:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYA
        GFV TNPSV +GAV+D +S T+GQQ+++ + I Q+P+S NWW  LK Q +PVGYW  TLFGYLDHSATLVEWGGEVFSSN++ VPHTG  MG GDYA
Subjt:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G44210.1 Protein of Unknown Function (DUF239)4.1e-1635.71Show/hide
Query:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYAE
        GFV  N  + +G  +  +S     Q+ +++ I ++P+  +WWL+   +   +GYW  +LF YL  SA+++EWGGEV +S   +  HT   MG G +AE
Subjt:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYAE

AT2G44210.2 Protein of Unknown Function (DUF239)4.1e-1635.71Show/hide
Query:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYAE
        GFV  N  + +G  +  +S     Q+ +++ I ++P+  +WWL+   +   +GYW  +LF YL  SA+++EWGGEV +S   +  HT   MG G +AE
Subjt:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYAE

AT5G19170.1 Protein of Unknown Function (DUF239)1.6e-1540.82Show/hide
Query:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYAE
        GFV T+ ++ +GA +   S   G QF +++ I ++  S +WWL L     P+GYW   +F  L   AT VEWGGEV   NL  V +T   MG G+YA+
Subjt:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYAE

AT5G25950.1 Protein of Unknown Function (DUF239)1.6e-2044.44Show/hide
Query:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSN-LRKVPHTGMAMGCGDYAEN
        GFV T+    LGA ++ +S +   Q+ ++V I  +P S NWW  L  +   +GYW  TLF YL HSAT V+WGGEV S N + K PHT  AMG G +A  
Subjt:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSN-LRKVPHTGMAMGCGDYAEN

Query:  DTIEPVFY
           E  F+
Subjt:  DTIEPVFY

AT5G25960.1 Protein of Unknown Function (DUF239)5.4e-1643.3Show/hide
Query:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYA
        GFV T     LGA ++ +S T  +Q  ++     +  S NWW  L      +GYW  TLF YL HSAT V+ GGEV S N+ K PHT  +MG G +A
Subjt:  GFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATCGACAAACTCGACTGGGTGTATATTGGACATGGGTTTGTCCACACGAATCCTAGTGTGACACTTGGTGCAGTCTTGGATCTTATGTCACTTACTGATGGACA
ACAATTCAGTCTCTCTGTTTGTATCTCTCAGAATCCTAGGTCAAAGAATTGGTGGTTGAGGTTGAAGGTGCAAGGGAGACCAGTGGGGTATTGGGCGGAGACACTATTTG
GATACTTGGACCATAGTGCCACATTAGTGGAATGGGGTGGCGAAGTTTTTAGCTCCAATCTGAGGAAAGTGCCGCACACAGGTATGGCTATGGGGTGTGGAGATTATGCA
GAAAATGATACTATTGAACCTGTCTTCTACTTTGGCGGTCCTCGTCGTGGTCGTGGTCCTGGTCCTGGTCCTGATCGTTTCTATCGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATCGACAAACTCGACTGGGTGTATATTGGACATGGGTTTGTCCACACGAATCCTAGTGTGACACTTGGTGCAGTCTTGGATCTTATGTCACTTACTGATGGACA
ACAATTCAGTCTCTCTGTTTGTATCTCTCAGAATCCTAGGTCAAAGAATTGGTGGTTGAGGTTGAAGGTGCAAGGGAGACCAGTGGGGTATTGGGCGGAGACACTATTTG
GATACTTGGACCATAGTGCCACATTAGTGGAATGGGGTGGCGAAGTTTTTAGCTCCAATCTGAGGAAAGTGCCGCACACAGGTATGGCTATGGGGTGTGGAGATTATGCA
GAAAATGATACTATTGAACCTGTCTTCTACTTTGGCGGTCCTCGTCGTGGTCGTGGTCCTGGTCCTGGTCCTGATCGTTTCTATCGTTGA
Protein sequenceShow/hide protein sequence
MEIDKLDWVYIGHGFVHTNPSVTLGAVLDLMSLTDGQQFSLSVCISQNPRSKNWWLRLKVQGRPVGYWAETLFGYLDHSATLVEWGGEVFSSNLRKVPHTGMAMGCGDYA
ENDTIEPVFYFGGPRRGRGPGPGPDRFYR