; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G8527 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G8527
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Descriptionglutamic acid-rich protein isoform X1
Genome locationctg1557:3672226..3675297
RNA-Seq ExpressionCucsat.G8527
SyntenyCucsat.G8527
Gene Ontology termsNA
InterPro domainsIPR019351 - Protein of unknown function DUF2039


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033212.1 hypothetical protein SDJN02_07266 [Cucurbita argyrosperma subsp. argyrosperma]1.02e-11284.88Show/hide
Query:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MS+K G PKHQNKYAWKPNAGRKINETEVGGRFRPLS+ITGVCLRCKDQIDWKRRYGKYKPLSEP KCQLCSKR VRQAYHNLCPGCAK+QGVCAKCRCR
Subjt:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNEDD--NESTDDTDGDNYENEDEHE
        VDQT+GRD+SEVEAEQKMLQEAIKNARERD+RTLLRAMEKGK+K+SNKNKSA +EE+K GDSI S TE QA +GR EDD  NESTDDTD D  E+EDE  
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNEDD--NESTDDTDGDNYENEDEHE

Query:  CENEK
        CENE+
Subjt:  CENEK

XP_004141470.1 uncharacterized protein LOC101206376 [Cucumis sativus]1.39e-144100Show/hide
Query:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
Subjt:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNEDDNESTDDTDGDNYENEDEHECE
        VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNEDDNESTDDTDGDNYENEDEHECE
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNEDDNESTDDTDGDNYENEDEHECE

Query:  NEKDEDGKE
        NEKDEDGKE
Subjt:  NEKDEDGKE

XP_008459394.1 PREDICTED: uncharacterized protein LOC103498541 [Cucumis melo]4.27e-13496.06Show/hide
Query:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MSNKQGPPKHQN+YAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEP KCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
Subjt:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNEDDNESTDDTDGDNYENEDEHECE
        VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAV EETK GDSIHSPTE QAEIGRNEDDNE TDDTD DNYENEDEHECE
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNEDDNESTDDTDGDNYENEDEHECE

Query:  NEK
        NE+
Subjt:  NEK

XP_022954477.1 glutamic acid-rich protein isoform X1 [Cucurbita moschata]1.51e-11284.47Show/hide
Query:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MS+K G PKHQNKYAWKPNAGRKINETEVGGRFRPLS+ITGVCLRCKDQIDWKRRYGKYKPLSEP KCQLCSKR VRQAYHNLCPGCAK+QGVCAKCRCR
Subjt:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNEDD---NESTDDTDGDNYENEDEH
        VDQT+GRD+SEVEAEQKMLQEAIKNARERD+RTLLRAMEKGK+K+SNKNKSA +EE+K GDSI S TE QA +GR EDD   NESTDDTD D  E+EDE 
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNEDD---NESTDDTDGDNYENEDEH

Query:  ECENEK
         CENE+
Subjt:  ECENEK

XP_023538167.1 glutamic acid-rich protein [Cucurbita pepo subsp. pepo]1.51e-11284.47Show/hide
Query:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MS+K G PKHQNKYAWKPNAGRKINETEVGGRFRPLS+ITGVCLRCKDQIDWKRRYGKYKPLSEP KCQLCSKR VRQAYHNLCPGCAK+QGVCAKCRCR
Subjt:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNE---DDNESTDDTDGDNYENEDEH
        VDQT+GRD+SEVEAEQKMLQEAIKNARERD+RTLLRAMEKGK+K+SNKNKSA +EE+K GDSI S TE QA +GR E   DDNESTDDTD D  E+EDE 
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNE---DDNESTDDTDGDNYENEDEH

Query:  ECENEK
         CENE+
Subjt:  ECENEK

TrEMBL top hitse value%identityAlignment
A0A0A0KSJ8 Uncharacterized protein6.71e-145100Show/hide
Query:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
Subjt:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNEDDNESTDDTDGDNYENEDEHECE
        VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNEDDNESTDDTDGDNYENEDEHECE
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNEDDNESTDDTDGDNYENEDEHECE

Query:  NEKDEDGKE
        NEKDEDGKE
Subjt:  NEKDEDGKE

A0A1S3CAL0 uncharacterized protein LOC1034985412.07e-13496.06Show/hide
Query:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MSNKQGPPKHQN+YAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEP KCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
Subjt:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNEDDNESTDDTDGDNYENEDEHECE
        VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAV EETK GDSIHSPTE QAEIGRNEDDNE TDDTD DNYENEDEHECE
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNEDDNESTDDTDGDNYENEDEHECE

Query:  NEK
        NE+
Subjt:  NEK

A0A6J1BX54 uncharacterized protein LOC1110063058.52e-10380.71Show/hide
Query:  KQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCRVDQ
        K GPPKHQN+YAWKPNAG KINETEVGGRFRPLS ITGVCLRCKDQIDWKRRYGKYKPL+EP KCQLCSKR VRQAYHNLCPGCAKEQGVCAKCRCRVD 
Subjt:  KQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCRVDQ

Query:  TVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNEDDNESTDDTDGDNYENEDEHECE
        TVGRD SEVEAEQKMLQEAI+NARERD+RTLLRAM KGK+K+S+K+KSAV+EETK GD   S  E  A++GR EDDN+ TD ++ D+ ENEDE E E
Subjt:  TVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNEDDNESTDDTDGDNYENEDEHECE

A0A6J1GR32 glutamic acid-rich protein isoform X17.29e-11384.47Show/hide
Query:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MS+K G PKHQNKYAWKPNAGRKINETEVGGRFRPLS+ITGVCLRCKDQIDWKRRYGKYKPLSEP KCQLCSKR VRQAYHNLCPGCAK+QGVCAKCRCR
Subjt:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNEDD---NESTDDTDGDNYENEDEH
        VDQT+GRD+SEVEAEQKMLQEAIKNARERD+RTLLRAMEKGK+K+SNKNKSA +EE+K GDSI S TE QA +GR EDD   NESTDDTD D  E+EDE 
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNEDD---NESTDDTDGDNYENEDEH

Query:  ECENEK
         CENE+
Subjt:  ECENEK

A0A6J1JP90 ribosome biogenesis protein BOP1 homolog isoform X11.00e-11284.39Show/hide
Query:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        M++K G PKHQNKYAWKPNAGRKINETEVGGRFRPLS+ITGVCLRCKDQIDWKRRYGKYKPLSEP KCQLCSKR VRQAYHNLCPGCAK+QGVCAKCRCR
Subjt:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNE--DDNESTDDTDGDNYENEDEHE
        VDQT+GRD+SEVEAEQKMLQEAIKNARERD+RTLLRAMEKGK+K+SNKNKSA +EE+K GDSI S TE QA +GR E  DDNESTD TD D YE+EDE  
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNE--DDNESTDDTDGDNYENEDEHE

Query:  CENEK
        CENE+
Subjt:  CENEK

SwissProt top hitse value%identityAlignment
Q68FU5 Uncharacterized protein C9orf85 homolog3.3e-1436.54Show/hide
Query:  MSNKQG------PPKHQNKYAWKPNA-GRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGV
        MS+++G      P KHQN + +K +   + +   ++  +        GVC RCK+ ++W+ +Y KYKPLS+P KC  C ++ V+ +YH +C  CA +  V
Subjt:  MSNKQG------PPKHQNKYAWKPNA-GRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGV

Query:  CAKC
        CAKC
Subjt:  CAKC

Q96MD7 Uncharacterized protein C9orf858.6e-1536.7Show/hide
Query:  MSNKQG------PPKHQNKYAWKPNA-GRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGV
        MS+++G      P KHQN +++K +   + +   ++  +        GVC RCK+ ++W+ +Y KYKPLS+P KC  C ++ V+ +YH +C  CA E  V
Subjt:  MSNKQG------PPKHQNKYAWKPNA-GRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGV

Query:  CAKCRCRVD
        CAKC  + D
Subjt:  CAKCRCRVD

Q9CQ90 Uncharacterized protein C9orf85 homolog1.1e-1437.5Show/hide
Query:  MSNKQG------PPKHQNKYAWKPNA-GRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGV
        MS+++G      P KHQN + +K +   + +   ++  +        GVC RCK+ ++W+ +Y KYKPLS+P KC  C ++ V+ +YH +C  CA E  V
Subjt:  MSNKQG------PPKHQNKYAWKPNA-GRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGV

Query:  CAKC
        CAKC
Subjt:  CAKC

Arabidopsis top hitse value%identityAlignment
AT3G02220.1 unknown protein2.2e-5856.82Show/hide
Query:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        M ++QGPPKHQNK+AW P AG KINETEVGGRFRPLS+ITGVC RC++QI WKR+YGKYK L+E TKCQ C+KRNVRQAYH LCPGCAKEQ VCAKC   
Subjt:  MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPT---------EVQAEIGRNEDDNESTDDTDG---
        VDQ +GRD+ EVEAEQK+L E IKNARERDRRTLLRAM K    + +  +++  + +K GD   S +          V   IG     + + DD  G   
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPT---------EVQAEIGRNEDDNESTDDTDG---

Query:  --DNYENEDEHECENEKDED
          D+   +DEH+   + DE+
Subjt:  --DNYENEDEHECENEKDED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAACAAGCAGGGCCCTCCCAAGCACCAGAACAAATACGCTTGGAAACCCAACGCCGGCCGCAAAATCAACGAAACGGAGGTTGGAGGTAGGTTCCGCCCCTTATC
TGACATCACCGGAGTTTGTCTCCGTTGTAAGGACCAAATTGATTGGAAACGCCGTTACGGAAAGTACAAACCTCTTTCTGAACCTACTAAATGTCAATTGTGTTCAAAGC
GGAATGTTCGTCAAGCGTATCACAATCTCTGCCCCGGTTGTGCCAAGGAGCAAGGTGTATGTGCAAAATGTCGTTGTCGTGTAGATCAAACTGTTGGAAGGGATTTGTCT
GAAGTGGAAGCTGAGCAAAAGATGCTTCAAGAGGCAATAAAGAATGCTCGAGAAAGGGATCGTAGAACTCTGTTACGTGCTATGGAGAAAGGAAAAGCTAAGAGTTCAAA
TAAAAACAAATCAGCAGTTGAAGAAGAAACGAAGGATGGAGATTCAATTCATTCACCAACTGAAGTGCAAGCTGAAATAGGTCGAAATGAGGATGATAATGAAAGTACAG
ACGACACGGATGGAGATAACTACGAAAATGAAGATGAACATGAATGTGAAAATGAGAAAGATGAAGATGGGAAAGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCAACAAGCAGGGCCCTCCCAAGCACCAGAACAAATACGCTTGGAAACCCAACGCCGGCCGCAAAATCAACGAAACGGAGGTTGGAGGTAGGTTCCGCCCCTTATC
TGACATCACCGGAGTTTGTCTCCGTTGTAAGGACCAAATTGATTGGAAACGCCGTTACGGAAAGTACAAACCTCTTTCTGAACCTACTAAATGTCAATTGTGTTCAAAGC
GGAATGTTCGTCAAGCGTATCACAATCTCTGCCCCGGTTGTGCCAAGGAGCAAGGTGTATGTGCAAAATGTCGTTGTCGTGTAGATCAAACTGTTGGAAGGGATTTGTCT
GAAGTGGAAGCTGAGCAAAAGATGCTTCAAGAGGCAATAAAGAATGCTCGAGAAAGGGATCGTAGAACTCTGTTACGTGCTATGGAGAAAGGAAAAGCTAAGAGTTCAAA
TAAAAACAAATCAGCAGTTGAAGAAGAAACGAAGGATGGAGATTCAATTCATTCACCAACTGAAGTGCAAGCTGAAATAGGTCGAAATGAGGATGATAATGAAAGTACAG
ACGACACGGATGGAGATAACTACGAAAATGAAGATGAACATGAATGTGAAAATGAGAAAGATGAAGATGGGAAAGAGTAG
Protein sequenceShow/hide protein sequence
MSNKQGPPKHQNKYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPTKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCRVDQTVGRDLS
EVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVEEETKDGDSIHSPTEVQAEIGRNEDDNESTDDTDGDNYENEDEHECENEKDEDGKE