; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0008901 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0008901
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
Descriptionglutamic acid-rich protein isoform X1
Genome locationchr09:1659632..1662489
RNA-Seq ExpressionIVF0008901
SyntenyIVF0008901
Gene Ontology termsNA
InterPro domainsIPR019351 - Protein of unknown function DUF2039


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033212.1 hypothetical protein SDJN02_07266 [Cucurbita argyrosperma subsp. argyrosperma]1.34e-11284.13Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MS+K G PKHQN+YAWKPNAGRKINETEVGGRFRPLS+ITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKR VRQAYHNLCPGCAK+QGVCAKCRCR
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNEDD--NEITDDTDDDNYENEDEHE
        VDQT+GRD+SEVEAEQKMLQEAIKNARERD+RTLLRAMEKGK+K+SNKNKSA  EE+K GDSI S TE+QA +GR EDD  NE TDDTD+D  E+EDE  
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNEDD--NEITDDTDDDNYENEDEHE

Query:  CENEENGK
        CENEE  K
Subjt:  CENEENGK

XP_004141470.1 uncharacterized protein LOC101206376 [Cucumis sativus]1.72e-13395.22Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MSNKQGPPKHQN+YAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEP KCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNEDDNEITDDTDDDNYENEDEHECE
        VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAV EETK GDSIHSPTE QAEIGRNEDDNE TDDTD DNYENEDEHECE
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNEDDNEITDDTDDDNYENEDEHECE

Query:  NE--ENGKE
        NE  E+GKE
Subjt:  NE--ENGKE

XP_008459394.1 PREDICTED: uncharacterized protein LOC103498541 [Cucumis melo]1.33e-14199.52Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNEDDNEITDDTDDDNYENEDEHECE
        VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNEDDNEITDDTDDDNYENEDEHECE
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNEDDNEITDDTDDDNYENEDEHECE

Query:  NEENGKE
        NEEN KE
Subjt:  NEENGKE

XP_022954477.1 glutamic acid-rich protein isoform X1 [Cucurbita moschata]1.98e-11283.73Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MS+K G PKHQN+YAWKPNAGRKINETEVGGRFRPLS+ITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKR VRQAYHNLCPGCAK+QGVCAKCRCR
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNEDD---NEITDDTDDDNYENEDEH
        VDQT+GRD+SEVEAEQKMLQEAIKNARERD+RTLLRAMEKGK+K+SNKNKSA  EE+K GDSI S TE+QA +GR EDD   NE TDDTD+D  E+EDE 
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNEDD---NEITDDTDDDNYENEDEH

Query:  ECENEENGK
         CENEE  K
Subjt:  ECENEENGK

XP_023538167.1 glutamic acid-rich protein [Cucurbita pepo subsp. pepo]1.98e-11283.73Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MS+K G PKHQN+YAWKPNAGRKINETEVGGRFRPLS+ITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKR VRQAYHNLCPGCAK+QGVCAKCRCR
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNE---DDNEITDDTDDDNYENEDEH
        VDQT+GRD+SEVEAEQKMLQEAIKNARERD+RTLLRAMEKGK+K+SNKNKSA  EE+K GDSI S TE+QA +GR E   DDNE TDDTD+D  E+EDE 
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNE---DDNEITDDTDDDNYENEDEH

Query:  ECENEENGK
         CENEE  K
Subjt:  ECENEENGK

TrEMBL top hitse value%identityAlignment
A0A0A0KSJ8 Uncharacterized protein1.7e-10395.22Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MSNKQGPPKHQN+YAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEP KCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNEDDNEITDDTDDDNYENEDEHECE
        VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAV EETK GDSIHSPTE QAEIGRNEDDNE TDDTD DNYENEDEHECE
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNEDDNEITDDTDDDNYENEDEHECE

Query:  NE--ENGKE
        NE  E+GKE
Subjt:  NE--ENGKE

A0A1S3CAL0 uncharacterized protein LOC1034985411.2e-10999.52Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNEDDNEITDDTDDDNYENEDEHECE
        VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNEDDNEITDDTDDDNYENEDEHECE
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNEDDNEITDDTDDDNYENEDEHECE

Query:  NEENGKE
        NEEN KE
Subjt:  NEENGKE

A0A6J1BX54 uncharacterized protein LOC1110063052.9e-8282.32Show/hide
Query:  KQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCRVDQ
        K GPPKHQNRYAWKPNAG KINETEVGGRFRPLS ITGVCLRCKDQIDWKRRYGKYKPL+EPAKCQLCSKR VRQAYHNLCPGCAKEQGVCAKCRCRVD 
Subjt:  KQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCRVDQ

Query:  TVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNEDDNEITDDTDDDNYENEDEHECEN
        TVGRD SEVEAEQKMLQEAI+NARERD+RTLLRAM KGK+K+S+K+KSAV EETKVGD   S  E+ A++GR EDDN+ITD +++D+ ENEDE E E+
Subjt:  TVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNEDDNEITDDTDDDNYENEDEHECEN

A0A6J1GR32 glutamic acid-rich protein isoform X11.7e-8783.33Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        MS+K G PKHQN+YAWKPNAGRKINETEVGGRFRPLS+ITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKR VRQAYHNLCPGCAK+QGVCAKCRCR
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNE---DDNEITDDTDDDNYENEDEH
        VDQT+GRD+SEVEAEQKMLQEAIKNARERD+RTLLRAMEKGK+K+SNKNKSA  EE+K GDSI S TE+QA +GR E   DDNE TDDTD+D  E+ED  
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNE---DDNEITDDTDDDNYENEDEH

Query:  ECENEENGKE
        ECENEE  K+
Subjt:  ECENEENGKE

A0A6J1JP90 ribosome biogenesis protein BOP1 homolog isoform X12.3e-8783.25Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        M++K G PKHQN+YAWKPNAGRKINETEVGGRFRPLS+ITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKR VRQAYHNLCPGCAK+QGVCAKCRCR
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNE--DDNEITDDTDDDNYENEDEHE
        VDQT+GRD+SEVEAEQKMLQEAIKNARERD+RTLLRAMEKGK+K+SNKNKSA  EE+K GDSI S TE+QA +GR E  DDNE TD TD+D YE+ED  E
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNE--DDNEITDDTDDDNYENEDEHE

Query:  CENEENGKE
        CENEE  K+
Subjt:  CENEENGKE

SwissProt top hitse value%identityAlignment
Q68FU5 Uncharacterized protein C9orf85 homolog3.2e-1436.54Show/hide
Query:  MSNKQG------PPKHQNRYAWKPNA-GRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGV
        MS+++G      P KHQN + +K +   + +   ++  +        GVC RCK+ ++W+ +Y KYKPLS+P KC  C ++ V+ +YH +C  CA +  V
Subjt:  MSNKQG------PPKHQNRYAWKPNA-GRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGV

Query:  CAKC
        CAKC
Subjt:  CAKC

Q96MD7 Uncharacterized protein C9orf858.5e-1536.7Show/hide
Query:  MSNKQG------PPKHQNRYAWKPNA-GRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGV
        MS+++G      P KHQN +++K +   + +   ++  +        GVC RCK+ ++W+ +Y KYKPLS+P KC  C ++ V+ +YH +C  CA E  V
Subjt:  MSNKQG------PPKHQNRYAWKPNA-GRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGV

Query:  CAKCRCRVD
        CAKC  + D
Subjt:  CAKCRCRVD

Q9CQ90 Uncharacterized protein C9orf85 homolog1.1e-1437.5Show/hide
Query:  MSNKQG------PPKHQNRYAWKPNA-GRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGV
        MS+++G      P KHQN + +K +   + +   ++  +        GVC RCK+ ++W+ +Y KYKPLS+P KC  C ++ V+ +YH +C  CA E  V
Subjt:  MSNKQG------PPKHQNRYAWKPNA-GRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGV

Query:  CAKC
        CAKC
Subjt:  CAKC

Arabidopsis top hitse value%identityAlignment
AT3G02220.1 unknown protein1.9e-5756.11Show/hide
Query:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR
        M ++QGPPKHQN++AW P AG KINETEVGGRFRPLS+ITGVC RC++QI WKR+YGKYK L+E  KCQ C+KRNVRQAYH LCPGCAKEQ VCAKC   
Subjt:  MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCR

Query:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPT-EDQAE--------IGRNEDDNEITDD-----T
        VDQ +GRD+ EVEAEQK+L E IKNARERDRRTLLRAM K    + +  +++ ++ +KVGD   S + E+ A         IG     +   DD     +
Subjt:  VDQTVGRDLSEVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPT-EDQAE--------IGRNEDDNEITDD-----T

Query:  DDDNYENEDEHECENEENGKE
        D+D+   +DEH+   + +  E
Subjt:  DDDNYENEDEHECENEENGKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAACAAGCAGGGCCCTCCCAAGCACCAAAACAGATACGCTTGGAAACCCAACGCCGGCCGCAAGATCAACGAAACGGAAGTTGGAGGTAGGTTCCGCCCCTTATC
TGACATCACCGGAGTTTGTCTCCGTTGCAAGGACCAAATTGATTGGAAACGCCGTTACGGAAAGTACAAACCTCTTTCTGAACCTGCTAAATGTCAATTGTGTTCAAAGC
GGAATGTTCGTCAAGCGTATCACAATCTCTGTCCCGGTTGTGCCAAGGAGCAAGGTGTATGTGCAAAATGTCGTTGTCGTGTAGATCAAACTGTTGGAAGGGACTTGTCT
GAAGTGGAGGCTGAGCAAAAGATGCTTCAAGAGGCCATAAAGAATGCTCGAGAAAGGGATCGTAGAACTCTGTTACGAGCTATGGAGAAAGGAAAAGCTAAGAGTTCAAA
TAAAAACAAATCAGCAGTTACAGAAGAAACTAAGGTTGGAGATTCAATTCATTCACCAACTGAAGATCAAGCTGAAATAGGTCGAAATGAGGATGATAATGAAATTACAG
ACGACACGGATGACGATAACTACGAAAACGAAGATGAACATGAATGTGAAAATGAAGAAAATGGTAAAGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCAACAAGCAGGGCCCTCCCAAGCACCAAAACAGATACGCTTGGAAACCCAACGCCGGCCGCAAGATCAACGAAACGGAAGTTGGAGGTAGGTTCCGCCCCTTATC
TGACATCACCGGAGTTTGTCTCCGTTGCAAGGACCAAATTGATTGGAAACGCCGTTACGGAAAGTACAAACCTCTTTCTGAACCTGCTAAATGTCAATTGTGTTCAAAGC
GGAATGTTCGTCAAGCGTATCACAATCTCTGTCCCGGTTGTGCCAAGGAGCAAGGTGTATGTGCAAAATGTCGTTGTCGTGTAGATCAAACTGTTGGAAGGGACTTGTCT
GAAGTGGAGGCTGAGCAAAAGATGCTTCAAGAGGCCATAAAGAATGCTCGAGAAAGGGATCGTAGAACTCTGTTACGAGCTATGGAGAAAGGAAAAGCTAAGAGTTCAAA
TAAAAACAAATCAGCAGTTACAGAAGAAACTAAGGTTGGAGATTCAATTCATTCACCAACTGAAGATCAAGCTGAAATAGGTCGAAATGAGGATGATAATGAAATTACAG
ACGACACGGATGACGATAACTACGAAAACGAAGATGAACATGAATGTGAAAATGAAGAAAATGGTAAAGAGTAG
Protein sequenceShow/hide protein sequence
MSNKQGPPKHQNRYAWKPNAGRKINETEVGGRFRPLSDITGVCLRCKDQIDWKRRYGKYKPLSEPAKCQLCSKRNVRQAYHNLCPGCAKEQGVCAKCRCRVDQTVGRDLS
EVEAEQKMLQEAIKNARERDRRTLLRAMEKGKAKSSNKNKSAVTEETKVGDSIHSPTEDQAEIGRNEDDNEITDDTDDDNYENEDEHECENEENGKE