; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0003282 (gene) of Chayote v1 genome

Gene IDSed0003282
OrganismSechium edule (Chayote v1)
Descriptionhydroxyproline-rich glycoprotein family protein
Genome locationLG01:63971315..63973669
RNA-Seq ExpressionSed0003282
SyntenySed0003282
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042546.1 uncharacterized protein E6C27_scaffold44G00400 [Cucumis melo var. makuwa]2.0e-3560.11Show/hide
Query:  MWPEPPLTADQ-TSAPTLALSWLPP-EPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPA-ELPPPPRLL-IEAKQS-ALLSPTTVLDGPE
        M  EPP T    TS PTL LSWLP  + PWMPTPPRL L SVPFLWEEAPGKPRPS  S++ W  P  A +LPPPPRLL  E KQS  L SPTTVLDG E
Subjt:  MWPEPPLTADQ-TSAPTLALSWLPP-EPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPA-ELPPPPRLL-IEAKQS-ALLSPTTVLDGPE

Query:  RGG-DGSKRWSSFRMCKELVGGGNDSFAAGFDGGGGGRRFGRGR-AQSLSVSSYG-RSHFLVKMYKSFKHILSRRGRS
        RGG  GSKRW SFRMC+              + GGG  RF RGR A SLS+SSY   SHFLV +Y+SFK +L  R R+
Subjt:  RGG-DGSKRWSSFRMCKELVGGGNDSFAAGFDGGGGGRRFGRGR-AQSLSVSSYG-RSHFLVKMYKSFKHILSRRGRS

KAG6605789.1 hypothetical protein SDJN03_03106, partial [Cucurbita argyrosperma subsp. sororia]1.4e-5571.76Show/hide
Query:  MWPEPPLTADQTSAPTLALSWLPPEPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAELPPPPRLLIEAKQSALLSPTTVLDGPERGGDG
        MW EPP T +++S P+L  S LPPEPPWMPTPPRL LASVPFLWEEAPGKPRP AGSE+ W SP   +LPPPPRLL EA QS LLSPTTVLDGPER   G
Subjt:  MWPEPPLTADQTSAPTLALSWLPPEPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAELPPPPRLLIEAKQSALLSPTTVLDGPERGGDG

Query:  SKRWSSFRMCKELVGGGNDSFAAGFDG-GGGGRRFGRGRAQSLSVSSYGRSHFLVKMYKSFKHILSRRGR
        SKRW SFRMCKELVGGGNDS A   DG GGG  RFG+    SLSVSSYGRSHFLVK+Y+ FK +   R R
Subjt:  SKRWSSFRMCKELVGGGNDSFAAGFDG-GGGGRRFGRGRAQSLSVSSYGRSHFLVKMYKSFKHILSRRGR

XP_022958693.1 uncharacterized protein LOC111459844 [Cucurbita moschata]2.1e-5672.35Show/hide
Query:  MWPEPPLTADQTSAPTLALSWLPPEPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAELPPPPRLLIEAKQSALLSPTTVLDGPERGGDG
        MW EPP T +++S P+L  S LPPEPPWMPTPPRL LASVPFLWEEAPGKPRP AGSE+ W SP   +LPPPPRLL EA QS LLSPTTVLDGPER   G
Subjt:  MWPEPPLTADQTSAPTLALSWLPPEPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAELPPPPRLLIEAKQSALLSPTTVLDGPERGGDG

Query:  SKRWSSFRMCKELVGGGNDSFAAGFDGGGGGR-RFGRGRAQSLSVSSYGRSHFLVKMYKSFKHILSRRGR
        SKRW SFRMCKELVGGGNDS A   DGGGGG  RFG+    SLSVSSYGRSHFLVK+Y+ FK +   R R
Subjt:  SKRWSSFRMCKELVGGGNDSFAAGFDGGGGGR-RFGRGRAQSLSVSSYGRSHFLVKMYKSFKHILSRRGR

XP_022995989.1 uncharacterized protein At4g00950-like [Cucurbita maxima]1.9e-5772.94Show/hide
Query:  MWPEPPLTADQTSAPTLALSWLPPEPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAELPPPPRLLIEAKQSALLSPTTVLDGPERGGDG
        MW EPP T +++S P+L LS LPPEPPWMPTPPRL LASVPFLWEEAPGKPRP AGSE+ W SP   +LPPPPRLL EA QS LLSPTTVLDGPER   G
Subjt:  MWPEPPLTADQTSAPTLALSWLPPEPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAELPPPPRLLIEAKQSALLSPTTVLDGPERGGDG

Query:  SKRWSSFRMCKELVGGGNDSFAAGFDGGGGGR-RFGRGRAQSLSVSSYGRSHFLVKMYKSFKHILSRRGR
        SKRW SFRMCKELV GGNDS A G DGGGGG  RFG+    SLSVSSYGRSHFLVK+Y+ FK +   R R
Subjt:  SKRWSSFRMCKELVGGGNDSFAAGFDGGGGGR-RFGRGRAQSLSVSSYGRSHFLVKMYKSFKHILSRRGR

XP_038875428.1 uncharacterized protein LOC120067888 [Benincasa hispida]1.9e-4971.79Show/hide
Query:  MWPEPPLTADQTSAPTLALSWLPPEPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAELPPPPRLLIEAKQSALLSPTTVLDGPERGGDG
        M  EPP T  +TS PTL LSWLPPEPPWMPTPPRL LASVPFLWEEAPGKPR SA S++ WP PA  ELPPPP+L  EAKQS L SPTT+LDGPER GD 
Subjt:  MWPEPPLTADQTSAPTLALSWLPPEPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAELPPPPRLLIEAKQSALLSPTTVLDGPERGGDG

Query:  SKRWSSFRMCKELVGG---GNDSFAAGFDGGGGGRRFGRGRAQSLSVSSYGRSHFL
        SKRW SFRM K+ VGG   GNDS AAG D GGG   F R R  S SVSSY RSHFL
Subjt:  SKRWSSFRMCKELVGG---GNDSFAAGFDGGGGGRRFGRGRAQSLSVSSYGRSHFL

TrEMBL top hitse value%identityAlignment
A0A1S3AU39 uncharacterized protein LOC1034829691.1e-3459.55Show/hide
Query:  MWPEPPLTADQ-TSAPTLALSWLPP-EPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPA-ELPPPPRLL-IEAKQS-ALLSPTTVLDGPE
        M  EPP T    TS PTL LSWLP  + PWMPTPPRL L SVPFLWEEAPGKPRPS  S++ W  P  A +LPPPPRLL  E KQS  L SPTTVLDG E
Subjt:  MWPEPPLTADQ-TSAPTLALSWLPP-EPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPA-ELPPPPRLL-IEAKQS-ALLSPTTVLDGPE

Query:  RGG-DGSKRWSSFRMCKELVGGGNDSFAAGFDGGGGGRRFGRGR-AQSLSVSSYGRSH-FLVKMYKSFKHILSRRGRS
        RGG  GSKRW SFRMC+              + GGG  RF RGR A SLS+SSY  S  FLV +Y+SFK +L  R R+
Subjt:  RGG-DGSKRWSSFRMCKELVGGGNDSFAAGFDGGGGGRRFGRGR-AQSLSVSSYGRSH-FLVKMYKSFKHILSRRGRS

A0A5A7TKS5 Uncharacterized protein9.9e-3660.11Show/hide
Query:  MWPEPPLTADQ-TSAPTLALSWLPP-EPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPA-ELPPPPRLL-IEAKQS-ALLSPTTVLDGPE
        M  EPP T    TS PTL LSWLP  + PWMPTPPRL L SVPFLWEEAPGKPRPS  S++ W  P  A +LPPPPRLL  E KQS  L SPTTVLDG E
Subjt:  MWPEPPLTADQ-TSAPTLALSWLPP-EPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPA-ELPPPPRLL-IEAKQS-ALLSPTTVLDGPE

Query:  RGG-DGSKRWSSFRMCKELVGGGNDSFAAGFDGGGGGRRFGRGR-AQSLSVSSYG-RSHFLVKMYKSFKHILSRRGRS
        RGG  GSKRW SFRMC+              + GGG  RF RGR A SLS+SSY   SHFLV +Y+SFK +L  R R+
Subjt:  RGG-DGSKRWSSFRMCKELVGGGNDSFAAGFDGGGGGRRFGRGR-AQSLSVSSYG-RSHFLVKMYKSFKHILSRRGRS

A0A6J1DV06 uncharacterized protein At4g00950-like2.9e-3555.56Show/hide
Query:  MWPEPPLTADQTSAPTLALSWLPPEPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAP---AELPPPPRLLIEAKQSALLSPTTVLDGPE--
        MW EPP   + +S PTL+LS L PEPPWMPTPPRL LAS+PFLWEEAPGKPR  AGS+     P P     L  PPRL     Q+AL SPTTVLDGPE  
Subjt:  MWPEPPLTADQTSAPTLALSWLPPEPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAP---AELPPPPRLLIEAKQSALLSPTTVLDGPE--

Query:  -RGGDGSKRWSSFRMCKELVGGGNDSFAAGFDGGGGGRRFGRGRAQSLSVSSYGRSHFLVKMYKSFKHILS
          GG  SKRW SFR CKE+              GGGG R  R R   LSVSSY +SHFLV +Y+ FK +++
Subjt:  -RGGDGSKRWSSFRMCKELVGGGNDSFAAGFDGGGGGRRFGRGRAQSLSVSSYGRSHFLVKMYKSFKHILS

A0A6J1H3S6 uncharacterized protein LOC1114598441.0e-5672.35Show/hide
Query:  MWPEPPLTADQTSAPTLALSWLPPEPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAELPPPPRLLIEAKQSALLSPTTVLDGPERGGDG
        MW EPP T +++S P+L  S LPPEPPWMPTPPRL LASVPFLWEEAPGKPRP AGSE+ W SP   +LPPPPRLL EA QS LLSPTTVLDGPER   G
Subjt:  MWPEPPLTADQTSAPTLALSWLPPEPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAELPPPPRLLIEAKQSALLSPTTVLDGPERGGDG

Query:  SKRWSSFRMCKELVGGGNDSFAAGFDGGGGGR-RFGRGRAQSLSVSSYGRSHFLVKMYKSFKHILSRRGR
        SKRW SFRMCKELVGGGNDS A   DGGGGG  RFG+    SLSVSSYGRSHFLVK+Y+ FK +   R R
Subjt:  SKRWSSFRMCKELVGGGNDSFAAGFDGGGGGR-RFGRGRAQSLSVSSYGRSHFLVKMYKSFKHILSRRGR

A0A6J1K7H0 uncharacterized protein At4g00950-like9.2e-5872.94Show/hide
Query:  MWPEPPLTADQTSAPTLALSWLPPEPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAELPPPPRLLIEAKQSALLSPTTVLDGPERGGDG
        MW EPP T +++S P+L LS LPPEPPWMPTPPRL LASVPFLWEEAPGKPRP AGSE+ W SP   +LPPPPRLL EA QS LLSPTTVLDGPER   G
Subjt:  MWPEPPLTADQTSAPTLALSWLPPEPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAELPPPPRLLIEAKQSALLSPTTVLDGPERGGDG

Query:  SKRWSSFRMCKELVGGGNDSFAAGFDGGGGGR-RFGRGRAQSLSVSSYGRSHFLVKMYKSFKHILSRRGR
        SKRW SFRMCKELV GGNDS A G DGGGGG  RFG+    SLSVSSYGRSHFLVK+Y+ FK +   R R
Subjt:  SKRWSSFRMCKELVGGGNDSFAAGFDGGGGGR-RFGRGRAQSLSVSSYGRSHFLVKMYKSFKHILSRRGR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G27810.1 unknown protein5.0e-0836.65Show/hide
Query:  EPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAE---------LPPPPRLLIEAKQSALLSPTTVLDGPERGGDGSKRWSSFRMCKELVG
        + P + TPP     SVPFLWEEAPGKPR S  ++         E         L  PPRL   A      SPTTVLDGP    D  +R  S     E   
Subjt:  EPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAE---------LPPPPRLLIEAKQSALLSPTTVLDGPERGGDGSKRWSSFRMCKELVG

Query:  GGNDSFAAG-----FDGGGGGR-RFGRGRAQ-SLSVSSYGRSHFLVKMYKSFKHILSRRGR
         G   F+        DGGGG   +  R R + SL   S+ +S FL ++Y+ FK ++  R R
Subjt:  GGNDSFAAG-----FDGGGGGR-RFGRGRAQ-SLSVSSYGRSHFLVKMYKSFKHILSRRGR

AT5G51680.1 hydroxyproline-rich glycoprotein family protein4.4e-0440Show/hide
Query:  MWPEPPLTADQTSAPTLA--LSWLPPEPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAELPPPP
        +W E P    +   P+LA  +   PP PP +P P +L + SVPF WEE PGKP P++ ++       P +LP PP
Subjt:  MWPEPPLTADQTSAPTLA--LSWLPPEPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAELPPPP

AT5G53030.1 unknown protein7.3e-0730.41Show/hide
Query:  PWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAELPPPPRLLIEAKQSAL--LSPTTVLDGP------------------------------
        P + TPP     SVPFLWEEAPGKPR                L  PPRL++  + + +   SPTTVLDGP                              
Subjt:  PWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAELPPPPRLLIEAKQSAL--LSPTTVLDGP------------------------------

Query:  -ERGGDGSKRWSSFRMCKELVGG----------GNDSFAAGFDGGGGGRRFGRGRAQSLSVSSYG---------RSHFLVKM----YKSFKHIL
         ER   GS RW SF  CKE+  G          G D       GGG G   G  + +   +   G         +S F +KM    Y+ FK ++
Subjt:  -ERGGDGSKRWSSFRMCKELVGG----------GNDSFAAGFDGGGGGRRFGRGRAQSLSVSSYG---------RSHFLVKM----YKSFKHIL

AT5G53030.2 unknown protein1.2e-0630.65Show/hide
Query:  PWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAELPPPPRLLIEAKQSAL--LSPTTVLDGP------------------------------
        P + TPP     SVPFLWEEAPGKPR                L  PPRL++  + + +   SPTTVLDGP                              
Subjt:  PWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAELPPPPRLLIEAKQSAL--LSPTTVLDGP------------------------------

Query:  -ERGGDGSKRWSSFRMCKELVGG----------GNDSFAAGFDGGGGGRRFGRGRAQSLSVSSYG---------RSHFLVKMYKSF
         ER   GS RW SF  CKE+  G          G D       GGG G   G  + +   +   G         +S F V  Y+ F
Subjt:  -ERGGDGSKRWSSFRMCKELVGG----------GNDSFAAGFDGGGGGRRFGRGRAQSLSVSSYG---------RSHFLVKMYKSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGCCGGAGCCGCCGCTCACGGCGGATCAGACCTCTGCCCCAACGCTGGCTCTGTCCTGGCTTCCGCCGGAGCCGCCGTGGATGCCCACGCCCCCTCGTTTGGCTCT
GGCCTCCGTTCCCTTTCTGTGGGAGGAGGCGCCCGGGAAGCCCCGCCCCTCCGCCGGATCGGAACTGTCGTGGCCGTCGCCGGCGCCGGCGGAGCTGCCTCCGCCGCCCC
GGTTGTTGATTGAGGCGAAACAGAGTGCTCTGCTTTCGCCGACGACGGTGCTCGACGGCCCGGAAAGGGGCGGCGATGGATCGAAGAGATGGAGTTCTTTCAGGATGTGT
AAGGAGCTTGTCGGCGGCGGCAATGACTCTTTCGCCGCCGGTTTCGACGGCGGCGGTGGAGGCAGGAGGTTTGGCCGGGGGAGGGCTCAATCATTGTCGGTTTCTTCTTA
TGGAAGGTCCCACTTTTTGGTGAAGATGTATAAGAGCTTTAAACATATTCTCTCAAGGAGGGGAAGATCAATATCATAA
mRNA sequenceShow/hide mRNA sequence
GACCAGTATTGCTCTCGTTTATTTTGATATTTATTCTACAAAACACATTTGTTTTTGGAAAAAAAAAACGATCTCCACCATATCCATGTAGTTTGACTTTTATTCAAAGC
CGCCGTCACTCACTCGCATCATTGTCCCTCCATTATTACCATTCATGAATTTCCCTCCCTCAAATTCAATCTCCCCATTAAAATCTCAACCACTCCACTCCTAAAATACC
ACTCTCCCTACACTAACATCCATTTTACCAACCAAATTTCCTTTCTTTTCCATCACCCATGTGGCCGGAGCCGCCGCTCACGGCGGATCAGACCTCTGCCCCAACGCTGG
CTCTGTCCTGGCTTCCGCCGGAGCCGCCGTGGATGCCCACGCCCCCTCGTTTGGCTCTGGCCTCCGTTCCCTTTCTGTGGGAGGAGGCGCCCGGGAAGCCCCGCCCCTCC
GCCGGATCGGAACTGTCGTGGCCGTCGCCGGCGCCGGCGGAGCTGCCTCCGCCGCCCCGGTTGTTGATTGAGGCGAAACAGAGTGCTCTGCTTTCGCCGACGACGGTGCT
CGACGGCCCGGAAAGGGGCGGCGATGGATCGAAGAGATGGAGTTCTTTCAGGATGTGTAAGGAGCTTGTCGGCGGCGGCAATGACTCTTTCGCCGCCGGTTTCGACGGCG
GCGGTGGAGGCAGGAGGTTTGGCCGGGGGAGGGCTCAATCATTGTCGGTTTCTTCTTATGGAAGGTCCCACTTTTTGGTGAAGATGTATAAGAGCTTTAAACATATTCTC
TCAAGGAGGGGAAGATCAATATCATAATTAACTTCAAAACTATGGCATGCAAGTAATGTTTATAAGCACGTGATAAATTTTAATTATCTTAGCTTACAAATTAGAACCAA
GTTGTAAGCTTAAACAAATGTATATGCGTATGTAATGTTTATGTATGTATTTGAGATTTACTTTCAAATAATTTAGTTTGTACTTGAGGAAAATTACATTTTTAGTTTTT
AGTTTTTAGGTTTAAA
Protein sequenceShow/hide protein sequence
MWPEPPLTADQTSAPTLALSWLPPEPPWMPTPPRLALASVPFLWEEAPGKPRPSAGSELSWPSPAPAELPPPPRLLIEAKQSALLSPTTVLDGPERGGDGSKRWSSFRMC
KELVGGGNDSFAAGFDGGGGGRRFGRGRAQSLSVSSYGRSHFLVKMYKSFKHILSRRGRSIS