; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy5G028540 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy5G028540
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionProtein of unknown function (DUF1997)
Genome locationGy14Chr5:32145105..32146297
RNA-Seq ExpressionCsGy5G028540
SyntenyCsGy5G028540
Gene Ontology termsNA
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141578.1 uncharacterized protein LOC101212716 isoform X2 [Cucumis sativus]1.24e-10367.73Show/hide
Query:  MTMLCSNTKPMCLHYFQRESSLKKQEVKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEY-----------------------WKIN
        MTMLCSNTKPMCLHYFQRESSLKKQ+VKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEY                       W+I 
Subjt:  MTMLCSNTKPMCLHYFQRESSLKKQEVKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEY-----------------------WKIN

Query:  Q------------------IGQSN---------------------ISWKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFV
                           I ++N                      +W+      EYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFV
Subjt:  Q------------------IGQSN---------------------ISWKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFV

Query:  PNDVLRGIIETVMKAMVEDLKHKTVHKLVEDYGKFRMEKE--NIGKVNTSK
        PNDVLRGIIETVMKAMVEDLKHKTVHKLVEDY KFRMEKE  NIGKVNTSK
Subjt:  PNDVLRGIIETVMKAMVEDLKHKTVHKLVEDYGKFRMEKE--NIGKVNTSK

XP_016902410.1 PREDICTED: uncharacterized protein LOC103498744 [Cucumis melo]6.74e-5753.47Show/hide
Query:  MCLHYFQRESSLKKQEVKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEY-----------------------WKINQ---------
        MCLH FQRESSLKKQ++K W+CFAI PRSQK IHH+NLLSVSF+SFSDL L+ESPGKASFDEY                       W+I           
Subjt:  MCLHYFQRESSLKKQEVKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEY-----------------------WKINQ---------

Query:  ---------IGQSN---------------------ISWKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFVPNDVLRGIIE
                 I ++N                      +W+      +YRPSSANVCSHGVIYR+KIGTRS LKF+LVIDLSFLVPDALHFVPNDVLRG+I 
Subjt:  ---------IGQSN---------------------ISWKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFVPNDVLRGIIE

Query:  TV
        TV
Subjt:  TV

XP_023517005.1 uncharacterized protein LOC111780797 isoform X2 [Cucurbita pepo subsp. pepo]1.98e-4242.13Show/hide
Query:  RESSLKKQEVKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEYW-------------KINQIGQ-----------------------
        ++S LK Q++  WKCFA+    QK     NLLSVS  SFSD+PLYE  GKASFD+Y              K  Q+ Q                       
Subjt:  RESSLKKQEVKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEYW-------------KINQIGQ-----------------------

Query:  -------------------SNISWKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFVPNDVLRGIIETVMKAMVEDLKHKT
                           +  +W+      +YRPSSANVCS G IY +K G RSRLKFQL I+LSF +PDAL FVP DV + I+E  +KAMVED+K K 
Subjt:  -------------------SNISWKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFVPNDVLRGIIETVMKAMVEDLKHKT

Query:  VHKLVEDYGKFRMEKE
        + +LVEDY  FR EK+
Subjt:  VHKLVEDYGKFRMEKE

XP_031741979.1 uncharacterized protein LOC101212716 isoform X1 [Cucumis sativus]1.82e-10367.46Show/hide
Query:  MTMLCSNTKPMCLHYFQRESSLKKQEVKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEY-----------------------WKIN
        MTMLCSNTKPMCLHYFQRESSLKKQ+VKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEY                       W+I 
Subjt:  MTMLCSNTKPMCLHYFQRESSLKKQEVKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEY-----------------------WKIN

Query:  Q------------------IGQSN----------------------ISWKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHF
                           I ++N                       +W+      EYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHF
Subjt:  Q------------------IGQSN----------------------ISWKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHF

Query:  VPNDVLRGIIETVMKAMVEDLKHKTVHKLVEDYGKFRMEKE--NIGKVNTSK
        VPNDVLRGIIETVMKAMVEDLKHKTVHKLVEDY KFRMEKE  NIGKVNTSK
Subjt:  VPNDVLRGIIETVMKAMVEDLKHKTVHKLVEDYGKFRMEKE--NIGKVNTSK

XP_038891182.1 uncharacterized protein LOC120080556 [Benincasa hispida]2.71e-5243.95Show/hide
Query:  MLCSNTKPMCLHY------FQRESS-----LKKQEVKNWKCFAIDPRSQKIIHHN--NLLSVSFVSFSDLPLYESPGKASFDEYW-------------KI
        MLC  T  +C          QRESS     LKKQ++K WKCFA+  ++QK+ HH+  NLLSVS   FSDLPLY+SPGKASFDEY              KI
Subjt:  MLCSNTKPMCLHY------FQRESS-----LKKQEVKNWKCFAIDPRSQKIIHHN--NLLSVSFVSFSDLPLYESPGKASFDEYW-------------KI

Query:  NQIGQSN------------------------------------------------ISWKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLS
         Q+ Q                                                   +W+      +YRPS ANVCS G IY +KIGTRS LKF+L+I+LS
Subjt:  NQIGQSN------------------------------------------------ISWKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLS

Query:  FLVPDALHFVPNDVLRGIIETVMKAMVEDLKHKTVHKLVEDYGKFRME
        FLVP  L+FV NDVL+ I++T +KAM+EDLKHK++HKLVEDY +FR E
Subjt:  FLVPDALHFVPNDVLRGIIETVMKAMVEDLKHKTVHKLVEDYGKFRME

TrEMBL top hitse value%identityAlignment
A0A0A0KSD5 Uncharacterized protein6.01e-10467.73Show/hide
Query:  MTMLCSNTKPMCLHYFQRESSLKKQEVKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEY-----------------------WKIN
        MTMLCSNTKPMCLHYFQRESSLKKQ+VKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEY                       W+I 
Subjt:  MTMLCSNTKPMCLHYFQRESSLKKQEVKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEY-----------------------WKIN

Query:  Q------------------IGQSN---------------------ISWKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFV
                           I ++N                      +W+      EYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFV
Subjt:  Q------------------IGQSN---------------------ISWKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFV

Query:  PNDVLRGIIETVMKAMVEDLKHKTVHKLVEDYGKFRMEKE--NIGKVNTSK
        PNDVLRGIIETVMKAMVEDLKHKTVHKLVEDY KFRMEKE  NIGKVNTSK
Subjt:  PNDVLRGIIETVMKAMVEDLKHKTVHKLVEDYGKFRMEKE--NIGKVNTSK

A0A1S4E357 uncharacterized protein LOC1034987443.26e-5753.47Show/hide
Query:  MCLHYFQRESSLKKQEVKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEY-----------------------WKINQ---------
        MCLH FQRESSLKKQ++K W+CFAI PRSQK IHH+NLLSVSF+SFSDL L+ESPGKASFDEY                       W+I           
Subjt:  MCLHYFQRESSLKKQEVKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEY-----------------------WKINQ---------

Query:  ---------IGQSN---------------------ISWKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFVPNDVLRGIIE
                 I ++N                      +W+      +YRPSSANVCSHGVIYR+KIGTRS LKF+LVIDLSFLVPDALHFVPNDVLRG+I 
Subjt:  ---------IGQSN---------------------ISWKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFVPNDVLRGIIE

Query:  TV
        TV
Subjt:  TV

A0A6J1C174 uncharacterized protein LOC111006493 isoform X19.59e-3336.16Show/hide
Query:  FQRESSLKKQEVKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEY-----------------------WKINQIGQSNIS-------
        F      K++ +++ K  A+    Q+   H NLLS S   FSD+PL ESPGKASFD+Y                       W+I       +S       
Subjt:  FQRESSLKKQEVKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEY-----------------------WKINQIGQSNIS-------

Query:  -------------------------------WKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFVPNDVLRGIIETVMKAM
                                       W+       YRPSSANV S G IY +K GT SRLKFQ  ++ +F+VP AL F+P D+ R I ETV+K M
Subjt:  -------------------------------WKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFVPNDVLRGIIETVMKAM

Query:  VEDLKHKTVHKLVEDYGKFRMEKE
        +EDL +K + KLVEDY KFR EK+
Subjt:  VEDLKHKTVHKLVEDYGKFRMEKE

A0A6J1HEU2 uncharacterized protein LOC1114623973.61e-4141.44Show/hide
Query:  RESSLKKQEVKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEYW-------------KINQIGQS----------------------
        ++S LK Q++  WKCFA+    QK     NLLSVS  SFSD+PLYE  GKASFD+Y              K  Q+ Q                       
Subjt:  RESSLKKQEVKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEYW-------------KINQIGQS----------------------

Query:  -----------------NIS---------WKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFVPNDVLRGIIETVMKAMVE
                         NI+         W+      +YRPSSANVCS G IY +K G RSRLKFQL I+LSF +PDAL FVP DV + I+E  +K MVE
Subjt:  -----------------NIS---------WKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFVPNDVLRGIIETVMKAMVE

Query:  DLKHKTVHKLVEDYGKFRMEKE
        D+K K + +LVEDY  FR EK+
Subjt:  DLKHKTVHKLVEDYGKFRMEKE

A0A6J1JUJ3 uncharacterized protein LOC1114876271.65e-3740.09Show/hide
Query:  RESSLKKQEVKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEYW-------------KINQIGQS----------------------
        ++S LK Q++  WKCFA+             LSVS  SFSD+PLYE  GKASFD+Y              K  Q+ Q                       
Subjt:  RESSLKKQEVKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEYW-------------KINQIGQS----------------------

Query:  -----------------NIS---------WKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFVPNDVLRGIIETVMKAMVE
                         NI+         W+    Q +Y PSSANVCS G IY +K G RSRLKFQL I+LSF +PDAL F+P DV + I+ET +KAMVE
Subjt:  -----------------NIS---------WKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFVPNDVLRGIIETVMKAMVE

Query:  DLKHKTVHKLVEDYGKFRMEKE
        D+K K + +LVEDY  FR EK+
Subjt:  DLKHKTVHKLVEDYGKFRMEKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G39520.1 Protein of unknown function (DUF1997)2.6e-1030.11Show/hide
Query:  WKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFVPNDVLRGIIETVMKAMVEDLKHKTVHKLVEDYGKFRMEKE
        W+         P+   +   G +Y  + G  +RLK +L   +SF++P  L  VP DV R +   ++  +V+++KH+ +  LV DY KF+ E++
Subjt:  WKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFVPNDVLRGIIETVMKAMVEDLKHKTVHKLVEDYGKFRMEKE

AT5G39530.1 Protein of unknown function (DUF1997)9.1e-1132.26Show/hide
Query:  WKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFVPNDVLRGIIETVMKAMVEDLKHKTVHKLVEDYGKFRMEKE
        WK         P+  ++   G +Y  + G  +RL+ QL +++SF++P  L  VP DV R +   V+  +VE++KHK    L+ DY +F+ E++
Subjt:  WKKPTAQPEYRPSSANVCSHGVIYRQKIGTRSRLKFQLVIDLSFLVPDALHFVPNDVLRGIIETVMKAMVEDLKHKTVHKLVEDYGKFRMEKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGATGCTGTGCTCAAATACAAAGCCAATGTGTTTGCATTATTTTCAAAGAGAAAGCAGTTTGAAGAAGCAGGAAGTTAAGAACTGGAAGTGCTTTGCTATTGATCC
CAGAAGTCAAAAAATCATTCATCATAATAACCTCTTATCTGTTTCTTTTGTATCTTTTAGTGACTTACCACTTTATGAATCTCCTGGGAAAGCTTCATTTGATGAATATT
GGAAGATAAACCAGATTGGTCAAAGCAACATTTCCTGGAAAAAACCAACAGCTCAACCAGAGTATAGGCCATCTTCAGCCAATGTTTGTTCTCATGGAGTTATCTATAGA
CAAAAAATTGGAACAAGAAGCCGCCTTAAGTTTCAACTTGTAATCGATCTCAGCTTTCTTGTACCGGACGCGCTCCATTTCGTTCCAAATGACGTTTTACGGGGCATTAT
CGAGACGGTTATGAAGGCAATGGTTGAGGACTTGAAGCATAAAACTGTACATAAATTGGTTGAGGATTATGGTAAGTTTAGGATGGAGAAAGAAAATATTGGAAAAGTAA
ACACATCAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGACGATGCTGTGCTCAAATACAAAGCCAATGTGTTTGCATTATTTTCAAAGAGAAAGCAGTTTGAAGAAGCAGGAAGTTAAGAACTGGAAGTGCTTTGCTATTGATCC
CAGAAGTCAAAAAATCATTCATCATAATAACCTCTTATCTGTTTCTTTTGTATCTTTTAGTGACTTACCACTTTATGAATCTCCTGGGAAAGCTTCATTTGATGAATATT
GGAAGATAAACCAGATTGGTCAAAGCAACATTTCCTGGAAAAAACCAACAGCTCAACCAGAGTATAGGCCATCTTCAGCCAATGTTTGTTCTCATGGAGTTATCTATAGA
CAAAAAATTGGAACAAGAAGCCGCCTTAAGTTTCAACTTGTAATCGATCTCAGCTTTCTTGTACCGGACGCGCTCCATTTCGTTCCAAATGACGTTTTACGGGGCATTAT
CGAGACGGTTATGAAGGCAATGGTTGAGGACTTGAAGCATAAAACTGTACATAAATTGGTTGAGGATTATGGTAAGTTTAGGATGGAGAAAGAAAATATTGGAAAAGTAA
ACACATCAAAATGA
Protein sequenceShow/hide protein sequence
MTMLCSNTKPMCLHYFQRESSLKKQEVKNWKCFAIDPRSQKIIHHNNLLSVSFVSFSDLPLYESPGKASFDEYWKINQIGQSNISWKKPTAQPEYRPSSANVCSHGVIYR
QKIGTRSRLKFQLVIDLSFLVPDALHFVPNDVLRGIIETVMKAMVEDLKHKTVHKLVEDYGKFRMEKENIGKVNTSK