; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0024033 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0024033
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
Descriptionglutamic acid-rich protein isoform X1
Genome locationchr09:20235863..20238570
RNA-Seq ExpressionPI0024033
SyntenyPI0024033
Gene Ontology termsNA
InterPro domainsIPR019351 - Protein of unknown function DUF2039


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602535.1 Eukaryotic translation initiation factor 3 subunit M, partial [Cucurbita argyrosperma subsp. sororia]1.7e-8784.04Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MS+K G PKHQN+YAWKPNAGRKINETEVGGRFRPLS+ITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKR VRQAYHNLCPGCAK+QGVCAKCRCR
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNE--DDNEITDDTDDDNYENEDEHE
        VDQT+GRD+SEVEAEQKMLQEAIKNARERD+RTLLRAMEKGK+K+SNKNKSA  EE+K GDSI S TEEQA +GR E  DDNE TDDTD+D  E+ED  E
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNE--DDNEITDDTDDDNYENEDEHE

Query:  CENEENDEDEDEK
        CENEE D+DE+E+
Subjt:  CENEENDEDEDEK

KAG7033212.1 hypothetical protein SDJN02_07266 [Cucurbita argyrosperma subsp. argyrosperma]1.7e-8784.04Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MS+K G PKHQN+YAWKPNAGRKINETEVGGRFRPLS+ITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKR VRQAYHNLCPGCAK+QGVCAKCRCR
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNE--DDNEITDDTDDDNYENEDEHE
        VDQT+GRD+SEVEAEQKMLQEAIKNARERD+RTLLRAMEKGK+K+SNKNKSA  EE+K GDSI S TEEQA +GR E  DDNE TDDTD+D  E+ED  E
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNE--DDNEITDDTDDDNYENEDEHE

Query:  CENEENDEDEDEK
        CENEE D+DE+E+
Subjt:  CENEENDEDEDEK

XP_004141470.1 uncharacterized protein LOC101206376 [Cucumis sativus]5.3e-10294.34Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MSNKQGPPKHQN+YAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEP KCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNEDDNEITDDTDDDNYENEDEHECE
        VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAV+EETK GDSIHSPTE QAEIGRNEDDNE TDDTD DNYENEDEHECE
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNEDDNEITDDTDDDNYENEDEHECE

Query:  NEENDEDEDEKE
        NE   +DED KE
Subjt:  NEENDEDEDEKE

XP_008459394.1 PREDICTED: uncharacterized protein LOC103498541 [Cucumis melo]7.2e-10798.07Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNEDDNEITDDTDDDNYENEDEHECE
        VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAV EETKVGDSIHSPTE+QAEIGRNEDDNEITDDTDDDNYENEDEHECE
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNEDDNEITDDTDDDNYENEDEHECE

Query:  NEENDED
        NEEND++
Subjt:  NEENDED

XP_022954477.1 glutamic acid-rich protein isoform X1 [Cucurbita moschata]2.2e-8783.64Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MS+K G PKHQN+YAWKPNAGRKINETEVGGRFRPLS+ITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKR VRQAYHNLCPGCAK+QGVCAKCRCR
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNE---DDNEITDDTDDDNYENEDEH
        VDQT+GRD+SEVEAEQKMLQEAIKNARERD+RTLLRAMEKGK+K+SNKNKSA  EE+K GDSI S TEEQA +GR E   DDNE TDDTD+D  E+ED  
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNE---DDNEITDDTDDDNYENEDEH

Query:  ECENEENDEDEDEK
        ECENEE D+DE+E+
Subjt:  ECENEENDEDEDEK

TrEMBL top hitse value%identityAlignment
A0A0A0KSJ8 Uncharacterized protein2.6e-10294.34Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MSNKQGPPKHQN+YAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEP KCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNEDDNEITDDTDDDNYENEDEHECE
        VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAV+EETK GDSIHSPTE QAEIGRNEDDNE TDDTD DNYENEDEHECE
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNEDDNEITDDTDDDNYENEDEHECE

Query:  NEENDEDEDEKE
        NE   +DED KE
Subjt:  NEENDEDEDEKE

A0A1S3CAL0 uncharacterized protein LOC1034985413.5e-10798.07Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNEDDNEITDDTDDDNYENEDEHECE
        VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAV EETKVGDSIHSPTE+QAEIGRNEDDNEITDDTDDDNYENEDEHECE
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNEDDNEITDDTDDDNYENEDEHECE

Query:  NEENDED
        NEEND++
Subjt:  NEENDED

A0A6J1BX54 uncharacterized protein LOC1110063057.8e-8383.33Show/hide
Query:  KQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCRVDQ
        K GPPKHQNRYAWKPNAG KINETEVGGRFRPLS ITGVCLRCKDQIDWKRRYGKYKPL+EPAKCQLCSKR VRQAYHNLCPGCAKEQGVCAKCRCRVD 
Subjt:  KQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCRVDQ

Query:  TVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNEDDNEITDDTDDDNYENEDEHECEN
        TVGRD SEVEAEQKMLQEAI+NARERD+RTLLRAM KGK+K+S+K+KSAVKEETKVGD   S  EE A++GR EDDN+ITD +++D+ ENEDE E E+
Subjt:  TVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNEDDNEITDDTDDDNYENEDEHECEN

A0A6J1GR32 glutamic acid-rich protein isoform X11.1e-8783.64Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MS+K G PKHQN+YAWKPNAGRKINETEVGGRFRPLS+ITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKR VRQAYHNLCPGCAK+QGVCAKCRCR
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNE---DDNEITDDTDDDNYENEDEH
        VDQT+GRD+SEVEAEQKMLQEAIKNARERD+RTLLRAMEKGK+K+SNKNKSA  EE+K GDSI S TEEQA +GR E   DDNE TDDTD+D  E+ED  
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNE---DDNEITDDTDDDNYENEDEH

Query:  ECENEENDEDEDEK
        ECENEE D+DE+E+
Subjt:  ECENEENDEDEDEK

A0A6J1JP90 ribosome biogenesis protein BOP1 homolog isoform X11.4e-8783.57Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        M++K G PKHQN+YAWKPNAGRKINETEVGGRFRPLS+ITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKR VRQAYHNLCPGCAK+QGVCAKCRCR
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNE--DDNEITDDTDDDNYENEDEHE
        VDQT+GRD+SEVEAEQKMLQEAIKNARERD+RTLLRAMEKGK+K+SNKNKSA  EE+K GDSI S TEEQA +GR E  DDNE TD TD+D YE+ED  E
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNE--DDNEITDDTDDDNYENEDEHE

Query:  CENEENDEDEDEK
        CENEE D+DE+E+
Subjt:  CENEENDEDEDEK

SwissProt top hitse value%identityAlignment
Q68FU5 Uncharacterized protein C9orf85 homolog3.3e-1436.54Show/hide
Query:  MSNKQG------PPKHQNRYAWKPNA-GRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGV
        MS+++G      P KHQN + +K +   + +   ++  +        GVC RCK+ ++W+ +Y KYKPLS+P KC  C ++ V+ +YH +C  CA +  V
Subjt:  MSNKQG------PPKHQNRYAWKPNA-GRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGV

Query:  CAKC
        CAKC
Subjt:  CAKC

Q96MD7 Uncharacterized protein C9orf858.8e-1536.7Show/hide
Query:  MSNKQG------PPKHQNRYAWKPNA-GRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGV
        MS+++G      P KHQN +++K +   + +   ++  +        GVC RCK+ ++W+ +Y KYKPLS+P KC  C ++ V+ +YH +C  CA E  V
Subjt:  MSNKQG------PPKHQNRYAWKPNA-GRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGV

Query:  CAKCRCRVD
        CAKC  + D
Subjt:  CAKCRCRVD

Q9CQ90 Uncharacterized protein C9orf85 homolog1.1e-1437.5Show/hide
Query:  MSNKQG------PPKHQNRYAWKPNA-GRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGV
        MS+++G      P KHQN + +K +   + +   ++  +        GVC RCK+ ++W+ +Y KYKPLS+P KC  C ++ V+ +YH +C  CA E  V
Subjt:  MSNKQG------PPKHQNRYAWKPNA-GRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGV

Query:  CAKC
        CAKC
Subjt:  CAKC

Arabidopsis top hitse value%identityAlignment
AT3G02220.1 unknown protein1.2e-5656.89Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        M ++QGPPKHQN++AW P AG KINETEVGGRFRPLS+ITGVC RC++QI WKR+YGKYK L+E  KCQ C+KRNVRQAYH LCPGCAKEQ VCAKC   
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPT-EEQAE--------IGRNEDDNEITDD-----T
        VDQ +GRD+ EVEAEQK+L E IKNARERDRRTLLRAM K    + +  +++  + +KVGD   S + EE A         IG     +   DD     +
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPT-EEQAE--------IGRNEDDNEITDD-----T

Query:  DDDNYENEDEHECENEENDEDEDEK
        D+D+   +DEH+   E++DE+E  +
Subjt:  DDDNYENEDEHECENEENDEDEDEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAACAAGCAGGGCCCTCCCAAGCACCAAAACAGATACGCTTGGAAACCCAACGCCGGCCGGAAAATCAACGAAACGGAGGTTGGAGGTAGGTTCCGCCCCTTATC
TGACATCACCGGAGTTTGTCTCCGTTGCAAGGACCAAATTGATTGGAAACGCCGTTACGGCAAGTACAAACCTCTTTCTGAACCTGCTAAATGTCAATTGTGTTCAAAGC
GGAATGTTCGTCAAGCGTATCATAATCTCTGCCCCGGTTGTGCCAAGGAGCAAGGTGTATGTGCAAAATGTCGCTGTCGTGTAGATCAAACTGTTGGGAGGGATTTGTCT
GAAGTGGAGGCTGAGCAAAAGATGCTTCAAGAGGCCATAAAGAATGCTCGAGAAAGGGATCGTAGAACTCTATTACGTGCTATGGAGAAAGGAAAAGCTAAGAGTTCAAA
TAAAAACAAATCAGCAGTTAAAGAAGAAACGAAGGTTGGAGATTCAATTCATTCACCAACTGAAGAGCAAGCTGAAATAGGTCGAAATGAGGATGATAATGAAATTACAG
ACGACACGGATGACGATAACTACGAAAATGAAGATGAACATGAATGTGAAAATGAAGAAAATGATGAAGATGAAGACGAGAAAGAGATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAACAAGCAGGGCCCTCCCAAGCACCAAAACAGATACGCTTGGAAACCCAACGCCGGCCGGAAAATCAACGAAACGGAGGTTGGAGGTAGGTTCCGCCCCTTATC
TGACATCACCGGAGTTTGTCTCCGTTGCAAGGACCAAATTGATTGGAAACGCCGTTACGGCAAGTACAAACCTCTTTCTGAACCTGCTAAATGTCAATTGTGTTCAAAGC
GGAATGTTCGTCAAGCGTATCATAATCTCTGCCCCGGTTGTGCCAAGGAGCAAGGTGTATGTGCAAAATGTCGCTGTCGTGTAGATCAAACTGTTGGGAGGGATTTGTCT
GAAGTGGAGGCTGAGCAAAAGATGCTTCAAGAGGCCATAAAGAATGCTCGAGAAAGGGATCGTAGAACTCTATTACGTGCTATGGAGAAAGGAAAAGCTAAGAGTTCAAA
TAAAAACAAATCAGCAGTTAAAGAAGAAACGAAGGTTGGAGATTCAATTCATTCACCAACTGAAGAGCAAGCTGAAATAGGTCGAAATGAGGATGATAATGAAATTACAG
ACGACACGGATGACGATAACTACGAAAATGAAGATGAACATGAATGTGAAAATGAAGAAAATGATGAAGATGAAGACGAGAAAGAGATGTAA
Protein sequenceShow/hide protein sequence
MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCRVDQTVGRDLS
EVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVKEETKVGDSIHSPTEEQAEIGRNEDDNEITDDTDDDNYENEDEHECENEENDEDEDEKEM