; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0004265 (gene) of Chayote v1 genome

Gene IDSed0004265
OrganismSechium edule (Chayote v1)
Descriptionhydroxyproline-rich glycoprotein family protein
Genome locationLG04:44739632..44740419
RNA-Seq ExpressionSed0004265
SyntenySed0004265
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB2087956.1 hypothetical protein ES319_A04G141000v1 [Gossypium barbadense]1.4e-3747.89Show/hide
Query:  PTPKP-PSPPPENTAESDAGDNPTLKPGPRRRTKRPYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRN
        P P P P P P             L+ G       PYPW+    A + +L +L S     ITGDV+C +C+++Y++G DL TKF  I+T+IAQNK    +
Subjt:  PTPKP-PSPPPENTAESDAGDNPTLKPGPRRRTKRPYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRN

Query:  RAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHKNINWLFLFLGQLLGRLTLGQIKYFCVHNKISLTSRPKNCLLYHTYLALCKQLQPSNP
        RAPS W  PVL  C FC Q N  KPVI  K  K+INWLFL LGQLLG  TL Q+KYFC H K   T   K+ +LY TYL LCKQL P+ P
Subjt:  RAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHKNINWLFLFLGQLLGRLTLGQIKYFCVHNKISLTSRPKNCLLYHTYLALCKQLQPSNP

PPD69681.1 hypothetical protein GOBAR_DD33434 [Gossypium barbadense]1.4e-3747.03Show/hide
Query:  ENTMESSAGEDPTPKPPSPPPENTAESDAGDNPT--LKPGPRRRTKRPYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRIS
        E T  ++ G+D T   PSP P  +       N T  L+ G       PYPW+    A + +L +L S     ITGDV+C +C+++Y++G DL TKF  I+
Subjt:  ENTMESSAGEDPTPKPPSPPPENTAESDAGDNPT--LKPGPRRRTKRPYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRIS

Query:  TFIAQNKPDWRNRAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHKNINWLFLFLGQLLGRLTLGQIKYFCVHNKISLTSRPKNCLLYHTYLALCKQLQPS
        T+IAQNK    +RAPS W  PVL  C FC Q N  KPVI  K  K+INWLFL LGQ+LG  TL Q+KYFC H K   T   K+ +LY TYL LCKQL P+
Subjt:  TFIAQNKPDWRNRAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHKNINWLFLFLGQLLGRLTLGQIKYFCVHNKISLTSRPKNCLLYHTYLALCKQLQPS

Query:  NP
         P
Subjt:  NP

XP_004246107.1 uncharacterized protein LOC101258478 [Solanum lycopersicum]1.1e-3740.53Show/hide
Query:  HTTVETAIGETPTPKPPPPPPENTMESSAGEDPTPKPPSPPPENTAESDAGDNPTLKPGPRRRTKR------PYPWSRKRGARIQSLTHLQSNHFDVITG
        H+ +  ++  +P+P   PPPP               PP PPP           P+ +P  R+RT +      PYPW+    A+I SL  L+ N    ITG
Subjt:  HTTVETAIGETPTPKPPPPPPENTMESSAGEDPTPKPPSPPPENTAESDAGDNPTLKPGPRRRTKR------PYPWSRKRGARIQSLTHLQSNHFDVITG

Query:  DVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRNRAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHKNINWLFLFLGQLLGRLTLGQIKYFCVHNKI
        +V+C +C+++Y++G DL  KF ++ +FI+ NK     RAP  W  P+ L+CNFC+Q N VKP+I  K  K+INW+FL LGQ +G  TL Q+KYFC HN+I
Subjt:  DVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRNRAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHKNINWLFLFLGQLLGRLTLGQIKYFCVHNKI

Query:  SLTSRPKNCLLYHTYLALCKQLQPSNP
          T   K+ +LY TYL LC+QL  + P
Subjt:  SLTSRPKNCLLYHTYLALCKQLQPSNP

XP_015084832.1 uncharacterized protein LOC107028319 [Solanum pennellii]1.1e-3740.53Show/hide
Query:  HTTVETAIGETPTPKPPPPPPENTMESSAGEDPTPKPPSPPPENTAESDAGDNPTLKPGPRRRTKR------PYPWSRKRGARIQSLTHLQSNHFDVITG
        H+ +  ++  +P+P   PPPP               PP PPP           P+ +P  R+RT +      PYPW+    A+I SL  L+ N    ITG
Subjt:  HTTVETAIGETPTPKPPPPPPENTMESSAGEDPTPKPPSPPPENTAESDAGDNPTLKPGPRRRTKR------PYPWSRKRGARIQSLTHLQSNHFDVITG

Query:  DVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRNRAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHKNINWLFLFLGQLLGRLTLGQIKYFCVHNKI
        +V+C +C+++Y++G DL  KF ++ +FI+ NK     RAP  W  P+ L+CNFC+Q N VKP+I  K  K+INW FL LGQ +G  TL Q+KYFC HN+I
Subjt:  DVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRNRAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHKNINWLFLFLGQLLGRLTLGQIKYFCVHNKI

Query:  SLTSRPKNCLLYHTYLALCKQLQPSNP
          T   K+ +LY TYL LC+QL  + P
Subjt:  SLTSRPKNCLLYHTYLALCKQLQPSNP

XP_022135937.1 uncharacterized protein LOC111007768 [Momordica charantia]3.4e-3950.31Show/hide
Query:  KRPYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRNRAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHK
        K PYPWS +  A +  L +L+ N    ITGDV+C +C+++Y +  DL TKF+ I++FI +NK    +RAP SW  P  L C  C + NCV+P I   D+K
Subjt:  KRPYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRNRAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHK

Query:  NINWLFLFLGQLLGRLTLGQIKYFCVHNKISLTSRPKNCLLYHTYLALCKQLQPSNPLF
        NINWLFL LGQ++GRL L  +KYFC +     T   KN L+Y TYL LCKQLQPS  LF
Subjt:  NINWLFLFLGQLLGRLTLGQIKYFCVHNKISLTSRPKNCLLYHTYLALCKQLQPSNPLF

TrEMBL top hitse value%identityAlignment
A0A1U8NI96 uncharacterized protein LOC1079474199.1e-3847.89Show/hide
Query:  PTPKP-PSPPPENTAESDAGDNPTLKPGPRRRTKRPYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRN
        P P P P P P             L+ G       PYPW+    A + +L +L S     ITGDV+C +C+++Y++G DL TKF  I+T+IAQNK    +
Subjt:  PTPKP-PSPPPENTAESDAGDNPTLKPGPRRRTKRPYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRN

Query:  RAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHKNINWLFLFLGQLLGRLTLGQIKYFCVHNKISLTSRPKNCLLYHTYLALCKQLQPSNP
        RAPS W  PVL  C FC Q N  KPVI  K  K+INWLFL LGQLLG  TL Q+KYFC H K   T   K+ +LY TYL LCKQL P+ P
Subjt:  RAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHKNINWLFLFLGQLLGRLTLGQIKYFCVHNKISLTSRPKNCLLYHTYLALCKQLQPSNP

A0A3Q7HWE7 Uncharacterized protein5.3e-3840.53Show/hide
Query:  HTTVETAIGETPTPKPPPPPPENTMESSAGEDPTPKPPSPPPENTAESDAGDNPTLKPGPRRRTKR------PYPWSRKRGARIQSLTHLQSNHFDVITG
        H+ +  ++  +P+P   PPPP               PP PPP           P+ +P  R+RT +      PYPW+    A+I SL  L+ N    ITG
Subjt:  HTTVETAIGETPTPKPPPPPPENTMESSAGEDPTPKPPSPPPENTAESDAGDNPTLKPGPRRRTKR------PYPWSRKRGARIQSLTHLQSNHFDVITG

Query:  DVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRNRAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHKNINWLFLFLGQLLGRLTLGQIKYFCVHNKI
        +V+C +C+++Y++G DL  KF ++ +FI+ NK     RAP  W  P+ L+CNFC+Q N VKP+I  K  K+INW+FL LGQ +G  TL Q+KYFC HN+I
Subjt:  DVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRNRAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHKNINWLFLFLGQLLGRLTLGQIKYFCVHNKI

Query:  SLTSRPKNCLLYHTYLALCKQLQPSNP
          T   K+ +LY TYL LC+QL  + P
Subjt:  SLTSRPKNCLLYHTYLALCKQLQPSNP

A0A5J5W9K1 Uncharacterized protein6.9e-3847.89Show/hide
Query:  PTPKP-PSPPPENTAESDAGDNPTLKPGPRRRTKRPYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRN
        P P P P P P             L+ G       PYPW+    A + +L +L S     ITGDV+C +C+++Y++G DL TKF  I+T+IAQNK    +
Subjt:  PTPKP-PSPPPENTAESDAGDNPTLKPGPRRRTKRPYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRN

Query:  RAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHKNINWLFLFLGQLLGRLTLGQIKYFCVHNKISLTSRPKNCLLYHTYLALCKQLQPSNP
        RAPS W  PVL  C FC Q N  KPVI  K  K+INWLFL LGQLLG  TL Q+KYFC H K   T   K+ +LY TYL LCKQL P+ P
Subjt:  RAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHKNINWLFLFLGQLLGRLTLGQIKYFCVHNKISLTSRPKNCLLYHTYLALCKQLQPSNP

A0A6J1C462 uncharacterized protein LOC1110077681.7e-3950.31Show/hide
Query:  KRPYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRNRAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHK
        K PYPWS +  A +  L +L+ N    ITGDV+C +C+++Y +  DL TKF+ I++FI +NK    +RAP SW  P  L C  C + NCV+P I   D+K
Subjt:  KRPYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRNRAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHK

Query:  NINWLFLFLGQLLGRLTLGQIKYFCVHNKISLTSRPKNCLLYHTYLALCKQLQPSNPLF
        NINWLFL LGQ++GRL L  +KYFC +     T   KN L+Y TYL LCKQLQPS  LF
Subjt:  NINWLFLFLGQLLGRLTLGQIKYFCVHNKISLTSRPKNCLLYHTYLALCKQLQPSNPLF

A0A6P4M137 uncharacterized protein LOC1084585459.1e-3847.89Show/hide
Query:  PTPKP-PSPPPENTAESDAGDNPTLKPGPRRRTKRPYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRN
        P P P P P P             L+ G       PYPW+    A + +L +L S     ITGDV+C +C+++Y++G DL TKF  I+T+IAQNK    +
Subjt:  PTPKP-PSPPPENTAESDAGDNPTLKPGPRRRTKRPYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRN

Query:  RAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHKNINWLFLFLGQLLGRLTLGQIKYFCVHNKISLTSRPKNCLLYHTYLALCKQLQPSNP
        RAPS W  PVL  C FC Q N  KPVI  K  K+INWLFL LGQLLG  TL Q+KYFC H K   T   K+ +LY TYL LCKQL P+ P
Subjt:  RAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHKNINWLFLFLGQLLGRLTLGQIKYFCVHNKISLTSRPKNCLLYHTYLALCKQLQPSNP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein2.2e-2833.83Show/hide
Query:  PPPHITVEDAAGENPSTEYPIHPPHTTVETAIGETPTPKPPPPPPENTMESSAGEDPTPKPPSP-----PPENTAESDAGDNPTLKPGPRRRTKR-----
        PPP           P T +P     T  +     +  P PPPP  +  +  S     TP PP       PP   A   +   P     P   + R     
Subjt:  PPPHITVEDAAGENPSTEYPIHPPHTTVETAIGETPTPKPPPPPPENTMESSAGEDPTPKPPSP-----PPENTAESDAGDNPTLKPGPRRRTKR-----

Query:  ------------PYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRNRAPSSWAFPVLLHCNFCKQRNCV
                    P+PW+  R   IQSL +L+SN    ITG+V+C  C++ Y++  +L  +F  +  F    K   R+RA   WA+P    C  C +   V
Subjt:  ------------PYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRNRAPSSWAFPVLLHCNFCKQRNCV

Query:  KPVIHWKDHKNINWLFLFLGQLLGRLTLGQIKYFCVHNKISLTSRPKNCLLYHTYLALCKQLQPSNPLF
        KPVI  +    INWLFL LGQ LG  TL Q+K FC H+K   T   K+ +LY TY+ LCK LQP + LF
Subjt:  KPVIHWKDHKNINWLFLFLGQLLGRLTLGQIKYFCVHNKISLTSRPKNCLLYHTYLALCKQLQPSNPLF

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)8.1e-3136.29Show/hide
Query:  PPPHITVEDAAGEN------PSTEYPIHPPHTTVETAIGETPTPKPPPPPPENTMESSAGEDP-TPKPP-------SPPPENTAESDAGDNPTLKPGPRR
        P P+     A G N       + E  + PP+ +V T +   P+ +  PPP  N + + A   P   +PP       S  P    E + GD         R
Subjt:  PPPHITVEDAAGEN------PSTEYPIHPPHTTVETAIGETPTPKPPPPPPENTMESSAGEDP-TPKPP-------SPPPENTAESDAGDNPTLKPGPRR

Query:  RTKRPYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRNRAPSSWAFPVLLHCNFCKQRNCVKPVIHWKD
            PYPW+ K+  +IQS   L SN+ +VI+G V C  CD+   +  +L  KF  +  +I  NK + R+RAP SW+ P L+ C  CK    +KPV+  + 
Subjt:  RTKRPYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRNRAPSSWAFPVLLHCNFCKQRNCVKPVIHWKD

Query:  HKNINWLFLFLGQLLGRLTLGQIKYFCVHNKISLTSRPKNCLLYHTYLALCKQLQPSNP
         + INWLFL LGQ+LG  TL Q++YFC  N    T   K+ ++Y TYL+LCKQL P  P
Subjt:  HKNINWLFLFLGQLLGRLTLGQIKYFCVHNKISLTSRPKNCLLYHTYLALCKQLQPSNP

AT2G16190.2 FUNCTIONS IN: molecular_function unknown1.2e-2134.53Show/hide
Query:  PPPHITVEDAAGEN------PSTEYPIHPPHTTVETAIGETPTPKPPPPPPENTMESSAGEDP-TPKPP-------SPPPENTAESDAGDNPTLKPGPRR
        P P+     A G N       + E  + PP+ +V T +   P+ +  PPP  N + + A   P   +PP       S  P    E + GD         R
Subjt:  PPPHITVEDAAGEN------PSTEYPIHPPHTTVETAIGETPTPKPPPPPPENTMESSAGEDP-TPKPP-------SPPPENTAESDAGDNPTLKPGPRR

Query:  RTKRPYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRNRAPSSWAFPVLLHCNFCKQRNCVKPVIHWKD
            PYPW+ K+  +IQS   L SN+ +VI+G V C  CD+   +  +L  KF  +  +I  NK + R+RAP SW+ P L+ C  CK    +KPV+  + 
Subjt:  RTKRPYPWSRKRGARIQSLTHLQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRNRAPSSWAFPVLLHCNFCKQRNCVKPVIHWKD

Query:  HKNINWLFLFLGQLLGRLTLGQI
         + INWLFL LGQ+LG  TL Q+
Subjt:  HKNINWLFLFLGQLLGRLTLGQI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAACCCTCCGCCGCATATCACCGTCGAGGACGCCGCCGGCGAAAATCCATCCACCGAATATCCCATCCATCCGCCGCACACCACCGTCGAGACCGCCATCGGCGA
AACTCCAACCCCCAAACCCCCACCCCCTCCACCGGAGAACACCATGGAGAGCAGCGCCGGCGAAGATCCAACTCCGAAACCCCCAAGCCCTCCTCCGGAAAACACCGCGG
AAAGCGACGCCGGCGACAACCCAACCCTGAAACCTGGGCCGCGACGACGGACCAAGCGGCCGTACCCGTGGTCCAGAAAACGAGGAGCCAGAATCCAGAGCCTAACCCAC
CTCCAATCGAACCACTTCGACGTCATAACCGGCGACGTGAAATGCACCAAATGCGACCAAGAGTACAAACTCGGGCTCGATCTGGCCACCAAATTCGACAGGATTTCAAC
CTTCATAGCACAGAACAAGCCCGATTGGCGAAACAGAGCTCCAAGTTCATGGGCGTTCCCTGTTTTGCTCCATTGCAATTTCTGCAAGCAACGAAACTGCGTCAAGCCGG
TGATCCACTGGAAGGATCACAAGAACATCAATTGGCTGTTCTTGTTTTTGGGGCAATTGCTTGGACGGTTGACTCTTGGACAGATCAAATATTTCTGTGTTCATAACAAG
ATTTCTCTAACTAGCCGCCCCAAGAATTGCCTTCTTTATCACACTTATCTTGCTTTGTGTAAGCAGCTTCAGCCCTCCAATCCACTCTTTGATCCTTGA
mRNA sequenceShow/hide mRNA sequence
TTCCAAGATCATCAAAGCACGCCGGCCAAATGGAAAACCCTCCGCCGCATATCACCGTCGAGGACGCCGCCGGCGAAAATCCATCCACCGAATATCCCATCCATCCGCCG
CACACCACCGTCGAGACCGCCATCGGCGAAACTCCAACCCCCAAACCCCCACCCCCTCCACCGGAGAACACCATGGAGAGCAGCGCCGGCGAAGATCCAACTCCGAAACC
CCCAAGCCCTCCTCCGGAAAACACCGCGGAAAGCGACGCCGGCGACAACCCAACCCTGAAACCTGGGCCGCGACGACGGACCAAGCGGCCGTACCCGTGGTCCAGAAAAC
GAGGAGCCAGAATCCAGAGCCTAACCCACCTCCAATCGAACCACTTCGACGTCATAACCGGCGACGTGAAATGCACCAAATGCGACCAAGAGTACAAACTCGGGCTCGAT
CTGGCCACCAAATTCGACAGGATTTCAACCTTCATAGCACAGAACAAGCCCGATTGGCGAAACAGAGCTCCAAGTTCATGGGCGTTCCCTGTTTTGCTCCATTGCAATTT
CTGCAAGCAACGAAACTGCGTCAAGCCGGTGATCCACTGGAAGGATCACAAGAACATCAATTGGCTGTTCTTGTTTTTGGGGCAATTGCTTGGACGGTTGACTCTTGGAC
AGATCAAATATTTCTGTGTTCATAACAAGATTTCTCTAACTAGCCGCCCCAAGAATTGCCTTCTTTATCACACTTATCTTGCTTTGTGTAAGCAGCTTCAGCCCTCCAAT
CCACTCTTTGATCCTTGA
Protein sequenceShow/hide protein sequence
MENPPPHITVEDAAGENPSTEYPIHPPHTTVETAIGETPTPKPPPPPPENTMESSAGEDPTPKPPSPPPENTAESDAGDNPTLKPGPRRRTKRPYPWSRKRGARIQSLTH
LQSNHFDVITGDVKCTKCDQEYKLGLDLATKFDRISTFIAQNKPDWRNRAPSSWAFPVLLHCNFCKQRNCVKPVIHWKDHKNINWLFLFLGQLLGRLTLGQIKYFCVHNK
ISLTSRPKNCLLYHTYLALCKQLQPSNPLFDP