; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0010000 (gene) of Chayote v1 genome

Gene IDSed0010000
OrganismSechium edule (Chayote v1)
Descriptionlysine-rich arabinogalactan protein 19
Genome locationLG05:26613831..26623451
RNA-Seq ExpressionSed0010000
SyntenySed0010000
Gene Ontology termsGO:0017053 - transcriptional repressor complex (cellular component)
InterPro domainsIPR028226 - Protein LIN37


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606861.1 hypothetical protein SDJN03_00203, partial [Cucurbita argyrosperma subsp. sororia]2.9e-14280.17Show/hide
Query:  MPVD---IPTNTTAAAVTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ--------
        MP D   I TNTT AAV   T TVAP S TLTTA ATTT      AA I RPLANQAPSRPIS IPQTHHLHYPPQA+Y AQ I +RTP+HQ        
Subjt:  MPVD---IPTNTTAAAVTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ--------

Query:  SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDPKVFPP
        SQA+LYPVASSGRGFVPRP+RPLPADQAVTVAN  G+PHRPVVTFPHRP+GSPHLD M+HP+HL RPP+LP QLI LSGSAISGSI GAPNSSDPKVFPP
Subjt:  SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDPKVFPP

Query:  STIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLESIEQLS
        STI ESNGCKEMRV+D+AL V+RDRKVRITDGASLYA+CRSWLRNGS EE QP YGNFLR LPRPLPIPVA A+ SQKKEVV +EVDEEDKD  SIEQLS
Subjt:  STIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLESIEQLS

Query:  TQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS
        +Q+LLKRHVKRAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  TQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS

XP_008461764.1 PREDICTED: uncharacterized protein LOC103500291 isoform X1 [Cucumis melo]2.2e-14279.05Show/hide
Query:  MPVD---IPTNTTAAA-----VTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ---
        MPVD   IPTNTT+AA      TTTT TV PISFTLT A AT     AAA AAI RPLANQAPSRPIS IPQTHHLHYP QA+Y  Q I VRTP+ Q   
Subjt:  MPVD---IPTNTTAAA-----VTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ---

Query:  -----SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDP
             SQA+LYPVASSGRGFVPRP+RPLP DQAVT+AN  G+PHRPVVTFPHRP+GSPHLD M+HP+H+TRPP+L  QLIP SGS+ISGSI GAPNSSDP
Subjt:  -----SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDP

Query:  KVFPPSTIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLES
        K FPPSTI ESNGCKEMRV+D+ LCVVRDRKVRITDGASLYA+CRSWLRNGSQEESQP YG+FLRSLPRPLPI VA A PSQKKEVV+EEVDEEDKD  S
Subjt:  KVFPPSTIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLES

Query:  IEQLSTQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS
        IE LSTQELLKRHV+RAKKVRSRLREERL RIERYKTRLALLLPPP+EQLRTDN+TGS
Subjt:  IEQLSTQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS

XP_022934169.1 uncharacterized protein LOC111441417 [Cucurbita moschata]4.5e-14380.06Show/hide
Query:  MPVD---IPTNTTAAAV---TTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ-----
        MPVD   IPTNTT AAV    TTT TVAPISFTLTTA ATT      +AAAI RPLANQAPSRPIS IPQTHHLHYPPQA+Y AQ I VRTP+ Q     
Subjt:  MPVD---IPTNTTAAAV---TTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ-----

Query:  ---SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDPKV
           SQA+LYPVASSGRGFVPRP+RPLP DQ VTVAN  G+PHRPVV+FPHRP+GSPHLD M+HP+H+ RPP+L  QLIP SGS+ISGSI GAPNSSDPKV
Subjt:  ---SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDPKV

Query:  FPPSTIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLESIE
        FPPSTI E+NGCKEMRV+D+ALCVVRDRKV ITDGASLYA+CRSWLRNGSQEESQP YG+FLRSLPRPLPI V  A+PSQKKEVVEE VDE+DKD ESIE
Subjt:  FPPSTIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLESIE

Query:  QLSTQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS
         LSTQELLKRHV+RAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  QLSTQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS

XP_022998467.1 uncharacterized protein LOC111493092 [Cucurbita maxima]3.4e-14380.17Show/hide
Query:  MPVD---IPTNTTAAAVTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ--------
        MP D   I TNTT AAV  TT TVAP S TLTTA ATTT       AAI RPLANQAPSRPIS IPQTHHLHYPPQA+Y AQ I +RTP+HQ        
Subjt:  MPVD---IPTNTTAAAVTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ--------

Query:  SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDPKVFPP
        SQA+LYPVASSGRGFVPRP+RPLP DQAVTVAN  G+PHRP VTFPHRP+GSPHLD M+HP+HL RPP+LP QLI +SGSAISGSI GAPNSSDPKVFPP
Subjt:  SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDPKVFPP

Query:  STIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLESIEQLS
        STI ESNGCKEMRV+D+AL V+RDRKVRITDGASLYA+CRSWLRNGSQEE QP YGNFLR LPRPLPIPVA A+PSQKKEVV +EVDEEDKD  SIEQLS
Subjt:  STIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLESIEQLS

Query:  TQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS
        +Q+LLKRHVKRAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  TQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS

XP_023526543.1 uncharacterized protein LOC111790016 [Cucurbita pepo subsp. pepo]2.6e-14380.45Show/hide
Query:  MPVD---IPTNTTAAAVTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ--------
        MPVD   IPTNTT AA TTTT TVAPISFTLTTA ATT      AAAAI RPLANQAPSRPIS IPQTHHLHYPPQA+Y AQ I VRTP+ Q        
Subjt:  MPVD---IPTNTTAAAVTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ--------

Query:  SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDPKVFPP
        SQA+LYPVASSGRGFVPRP+RPLP DQ VTVAN  G+PHRPVV+FPHRP+GSPHLD M+HP+H+ RPP+L  QLIP SGS+ISGSI GAPNSSDPKVFPP
Subjt:  SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDPKVFPP

Query:  STIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLESIEQLS
        STI E+NGCKEMRV+D+ALCVVRDRKV ITDGASLYA+CRSWLRNGSQEESQP YG+FLRSLPRPLPI V  A+PSQKKEVVEE VDE+DKD ESIE L 
Subjt:  STIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLESIEQLS

Query:  TQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS
        TQELLKRHV+RAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDN+TGS
Subjt:  TQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS

TrEMBL top hitse value%identityAlignment
A0A1S3CFG6 uncharacterized protein LOC103500291 isoform X11.1e-14279.05Show/hide
Query:  MPVD---IPTNTTAAA-----VTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ---
        MPVD   IPTNTT+AA      TTTT TV PISFTLT A AT     AAA AAI RPLANQAPSRPIS IPQTHHLHYP QA+Y  Q I VRTP+ Q   
Subjt:  MPVD---IPTNTTAAA-----VTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ---

Query:  -----SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDP
             SQA+LYPVASSGRGFVPRP+RPLP DQAVT+AN  G+PHRPVVTFPHRP+GSPHLD M+HP+H+TRPP+L  QLIP SGS+ISGSI GAPNSSDP
Subjt:  -----SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDP

Query:  KVFPPSTIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLES
        K FPPSTI ESNGCKEMRV+D+ LCVVRDRKVRITDGASLYA+CRSWLRNGSQEESQP YG+FLRSLPRPLPI VA A PSQKKEVV+EEVDEEDKD  S
Subjt:  KVFPPSTIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLES

Query:  IEQLSTQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS
        IE LSTQELLKRHV+RAKKVRSRLREERL RIERYKTRLALLLPPP+EQLRTDN+TGS
Subjt:  IEQLSTQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS

A0A5A7TZA2 Mucin-2 isoform X21.1e-14279.05Show/hide
Query:  MPVD---IPTNTTAAA-----VTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ---
        MPVD   IPTNTT+AA      TTTT TV PISFTLT A AT     AAA AAI RPLANQAPSRPIS IPQTHHLHYP QA+Y  Q I VRTP+ Q   
Subjt:  MPVD---IPTNTTAAA-----VTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ---

Query:  -----SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDP
             SQA+LYPVASSGRGFVPRP+RPLP DQAVT+AN  G+PHRPVVTFPHRP+GSPHLD M+HP+H+TRPP+L  QLIP SGS+ISGSI GAPNSSDP
Subjt:  -----SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDP

Query:  KVFPPSTIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLES
        K FPPSTI ESNGCKEMRV+D+ LCVVRDRKVRITDGASLYA+CRSWLRNGSQEESQP YG+FLRSLPRPLPI VA A PSQKKEVV+EEVDEEDKD  S
Subjt:  KVFPPSTIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLES

Query:  IEQLSTQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS
        IE LSTQELLKRHV+RAKKVRSRLREERL RIERYKTRLALLLPPP+EQLRTDN+TGS
Subjt:  IEQLSTQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS

A0A6J1F1T6 uncharacterized protein LOC1114414172.2e-14380.06Show/hide
Query:  MPVD---IPTNTTAAAV---TTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ-----
        MPVD   IPTNTT AAV    TTT TVAPISFTLTTA ATT      +AAAI RPLANQAPSRPIS IPQTHHLHYPPQA+Y AQ I VRTP+ Q     
Subjt:  MPVD---IPTNTTAAAV---TTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ-----

Query:  ---SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDPKV
           SQA+LYPVASSGRGFVPRP+RPLP DQ VTVAN  G+PHRPVV+FPHRP+GSPHLD M+HP+H+ RPP+L  QLIP SGS+ISGSI GAPNSSDPKV
Subjt:  ---SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDPKV

Query:  FPPSTIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLESIE
        FPPSTI E+NGCKEMRV+D+ALCVVRDRKV ITDGASLYA+CRSWLRNGSQEESQP YG+FLRSLPRPLPI V  A+PSQKKEVVEE VDE+DKD ESIE
Subjt:  FPPSTIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLESIE

Query:  QLSTQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS
         LSTQELLKRHV+RAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  QLSTQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS

A0A6J1IZP9 uncharacterized protein LOC1114814112.4e-14279.49Show/hide
Query:  MPVD---IPTNTT---AAAVTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ-----
        MPVD   IPTNTT   AAA  TTT TVAPISFTL+ A ATT      AAAAI RPLANQAPSRPIS IPQTHHLHYPPQA+Y AQ I VRTP+ Q     
Subjt:  MPVD---IPTNTT---AAAVTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ-----

Query:  ---SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDPKV
           SQA+LYPVASSGRGFVPRP+RPLP DQ VTVAN  G+PHRPVV+FPHRP+GSPHLD M+HP+H+ RPP+L  QLIP SGS+ISGSI  APNSSDPKV
Subjt:  ---SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDPKV

Query:  FPPSTIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLESIE
        FPPSTI E+NGCKEMRV+D+ALCVVRDRKV ITDGASLYA+CRSWLRNGSQEESQP YG+FLRSLPRPLPI V  A+PSQKKEVVEEEVDE+DKD ESIE
Subjt:  FPPSTIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLESIE

Query:  QLSTQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS
         LSTQELLKRHV+RAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDN+TGS
Subjt:  QLSTQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS

A0A6J1K833 uncharacterized protein LOC1114930921.7e-14380.17Show/hide
Query:  MPVD---IPTNTTAAAVTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ--------
        MP D   I TNTT AAV  TT TVAP S TLTTA ATTT       AAI RPLANQAPSRPIS IPQTHHLHYPPQA+Y AQ I +RTP+HQ        
Subjt:  MPVD---IPTNTTAAAVTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQ--------

Query:  SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDPKVFPP
        SQA+LYPVASSGRGFVPRP+RPLP DQAVTVAN  G+PHRP VTFPHRP+GSPHLD M+HP+HL RPP+LP QLI +SGSAISGSI GAPNSSDPKVFPP
Subjt:  SQAVLYPVASSGRGFVPRPVRPLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDPKVFPP

Query:  STIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLESIEQLS
        STI ESNGCKEMRV+D+AL V+RDRKVRITDGASLYA+CRSWLRNGSQEE QP YGNFLR LPRPLPIPVA A+PSQKKEVV +EVDEEDKD  SIEQLS
Subjt:  STIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLESIEQLS

Query:  TQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS
        +Q+LLKRHVKRAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  TQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G04930.1 hydroxyproline-rich glycoprotein family protein4.0e-4139.81Show/hide
Query:  SFTLTTAPATTTAA-TAAAAAAIPRPLANQAPSRPISVIPQTHH----LHYP----PQAIYP----AQPISVRTPSHQSQAVLYPVASSGRGFVPRPVRP
        S +LT +P+ +TA+ T        RP  +Q P  P  + P T+     L +P     Q+ Y     A  I VR       AVLYP A  GRGF  RPVR 
Subjt:  SFTLTTAPATTTAA-TAAAAAAIPRPLANQAPSRPISVIPQTHH----LHYP----PQAIYP----AQPISVRTPSHQSQAVLYPVASSGRGFVPRPVRP

Query:  LPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAIS-GSINGAPNSSDPKVFPPST-IYESNGCKEMRVKDEALC
          AD +VT  NL+G+P RP  T+   P     ++ +       R P + P      GS +  G I  +P    P+V PP T I +++  ++ R KD AL 
Subjt:  LPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAIS-GSINGAPNSSDPKVFPPST-IYESNGCKEMRVKDEALC

Query:  VVRDRKVRITDG-ASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLESIEQLSTQELLKRHVKRAKKVRSRL
        VVR RKVRIT+G +SLY++ RSWL+NG+    QP     ++ LP+PLP+ +     S   +  EE  DE+ +D E+++QLS ++LLKRH++RAKKVR++L
Subjt:  VVRDRKVRITDG-ASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLESIEQLSTQELLKRHVKRAKKVRSRL

Query:  REERLLRIERYKTRLALLL
        REER  RI RYK R+ L+L
Subjt:  REERLLRIERYKTRLALLL

AT1G04930.2 hydroxyproline-rich glycoprotein family protein1.9e-3836.96Show/hide
Query:  SFTLTTAPATTTAA-TAAAAAAIPRPLANQAPSRPISVIPQTHH----LHYP----PQAIYP----AQPISVRTPSHQSQAVLYPVASSGRGFVPRPVRP
        S +LT +P+ +TA+ T        RP  +Q P  P  + P T+     L +P     Q+ Y     A  I VR       AVLYP A  GRGF  RPVR 
Subjt:  SFTLTTAPATTTAA-TAAAAAAIPRPLANQAPSRPISVIPQTHH----LHYP----PQAIYP----AQPISVRTPSHQSQAVLYPVASSGRGFVPRPVRP

Query:  LPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHL----------PPQLIPLSGS-------AISGSINGAPNSSDPKVF-------
          AD +VT  NL+G+P RP  T+   P     ++ +       R P +          P  L P+  S         SG I+G     DPK         
Subjt:  LPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHL----------PPQLIPLSGS-------AISGSINGAPNSSDPKVF-------

Query:  --------PPSTIYESNGCKEMRVKDEALCVVRDRKVRITDG-ASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEE
                PP++I +++  ++ R KD AL VVR RKVRIT+G +SLY++ RSWL+NG+    QP     ++ LP+PLP+ +     S   +  EE  DE+
Subjt:  --------PPSTIYESNGCKEMRVKDEALCVVRDRKVRITDG-ASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEE

Query:  DKDLESIEQLSTQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLL
         +D E+++QLS ++LLKRH++RAKKVR++LREER  RI RYK R+ L+L
Subjt:  DKDLESIEQLSTQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLL

AT2G32840.1 proline-rich family protein8.1e-4240.52Show/hide
Query:  PTNTTAAAVTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVR-----------TPSHQSQAVL
        P    + +V  +TP V     T + +P  T   T    ++ P+P    +  R I+ +    H H+P Q IY   P+ +R            P     +++
Subjt:  PTNTTAAAVTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVR-----------TPSHQSQAVL

Query:  YPVASSGRGFVPRPVRPLPADQAVTVANLA--GF-PHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDPKVFP-PS
        YP  SSGRGF  RPVR      A  V + +  G+ P  PV  + H    S +LD MN      R  H   Q  P  G   SG + G P+   P+  P P+
Subjt:  YPVASSGRGFVPRPVRPLPADQAVTVANLA--GF-PHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDPKVFP-PS

Query:  TIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDK-DLESIEQLS
        +I +++G K+ R +D+AL +VR RKVRIT+GASLY++CRSWLRNG+ E  +P   + +  LP+PLP+       S  K++VEE + EEDK D ES++ LS
Subjt:  TIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDK-DLESIEQLS

Query:  TQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTD
          +LLKRH+ RAKKVR+RLREERL RI RYK RLALLLPP  EQ R +
Subjt:  TQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTD

AT2G32840.2 proline-rich family protein2.0e-2434.93Show/hide
Query:  PTNTTAAAVTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVR-----------TPSHQSQAVL
        P    + +V  +TP V     T + +P  T   T    ++ P+P    +  R I+ +    H H+P Q IY   P+ +R            P     +++
Subjt:  PTNTTAAAVTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVR-----------TPSHQSQAVL

Query:  YPVASSGRGFVPRPVRPLPADQAVTVANLA--GF-PHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDPKVFP-PS
        YP  SSGRGF  RPVR      A  V + +  G+ P  PV  + H    S +LD MN      R  H   Q  P  G   SG + G P+   P+  P P+
Subjt:  YPVASSGRGFVPRPVRPLPADQAVTVANLA--GF-PHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDPKVFP-PS

Query:  TIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDL
        +I +++G K+ R +D+AL +VR RKVRIT+GASLY++CRSWLRNG+ E  +P   + +  LP+PLP+       S  K++VEE + EEDK++
Subjt:  TIYESNGCKEMRVKDEALCVVRDRKVRITDGASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCGTCGACATCCCCACCAACACGACGGCCGCCGCCGTCACCACCACCACCCCCACCGTCGCTCCGATTTCATTCACTCTAACCACAGCCCCAGCCACCACCACAGC
CGCCACCGCCGCCGCCGCCGCTGCCATTCCTCGTCCGCTTGCGAATCAAGCGCCGTCGAGACCAATTTCTGTAATTCCTCAGACCCATCATCTCCACTACCCTCCTCAAG
CCATCTATCCGGCGCAGCCGATCTCTGTTCGAACTCCGAGCCACCAATCTCAGGCAGTTCTTTACCCTGTTGCTTCCTCCGGCCGCGGTTTCGTTCCTCGCCCCGTTCGC
CCCCTTCCCGCCGATCAGGCCGTCACGGTGGCCAACCTTGCCGGCTTCCCACATCGCCCCGTCGTGACCTTCCCGCATCGGCCGGTTGGGTCGCCTCATTTGGACTTCAT
GAACCATCCAGTTCACTTGACCCGACCTCCCCACTTGCCGCCGCAGTTGATTCCCCTTTCTGGGTCCGCCATTTCGGGCTCGATTAATGGCGCCCCCAATTCCTCCGATC
CAAAGGTTTTTCCTCCATCAACAATCTATGAGTCAAATGGATGTAAAGAAATGAGAGTGAAGGATGAAGCTCTTTGTGTTGTTAGAGATCGAAAAGTCCGAATAACTGAT
GGAGCTTCTCTTTATGCAATTTGTCGATCATGGTTGAGGAATGGTTCTCAAGAAGAAAGCCAGCCACACTATGGAAATTTTTTGAGGTCTCTTCCGAGACCTTTGCCTAT
CCCTGTGGCTGCTGCTTTACCATCACAGAAGAAGGAAGTCGTTGAAGAAGAAGTTGACGAGGAAGATAAGGATCTGGAATCCATTGAGCAGTTGTCAACACAAGAGCTAT
TGAAAAGACACGTTAAACGAGCCAAAAAAGTTCGATCACGATTGAGGGAAGAACGGTTGCTACGAATTGAAAGATACAAAACCAGGCTCGCTCTTCTCCTTCCTCCACCA
GTCGAGCAGTTGAGAACGGATAATATTACTGGAAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATATTTTAAAAATTAATTTGCTCCTCCCTTTCAACGGCCACCACGAGCCCAAACCCCCAATCGTGTCAATTCGCAGAAATTCAAGTTTTATTTGCCCAAATTTTGATTCA
AACAATCAAAAACCACATACACCCAAATTCCCCGCCACCATGCCCGTCGACATCCCCACCAACACGACGGCCGCCGCCGTCACCACCACCACCCCCACCGTCGCTCCGAT
TTCATTCACTCTAACCACAGCCCCAGCCACCACCACAGCCGCCACCGCCGCCGCCGCCGCTGCCATTCCTCGTCCGCTTGCGAATCAAGCGCCGTCGAGACCAATTTCTG
TAATTCCTCAGACCCATCATCTCCACTACCCTCCTCAAGCCATCTATCCGGCGCAGCCGATCTCTGTTCGAACTCCGAGCCACCAATCTCAGGCAGTTCTTTACCCTGTT
GCTTCCTCCGGCCGCGGTTTCGTTCCTCGCCCCGTTCGCCCCCTTCCCGCCGATCAGGCCGTCACGGTGGCCAACCTTGCCGGCTTCCCACATCGCCCCGTCGTGACCTT
CCCGCATCGGCCGGTTGGGTCGCCTCATTTGGACTTCATGAACCATCCAGTTCACTTGACCCGACCTCCCCACTTGCCGCCGCAGTTGATTCCCCTTTCTGGGTCCGCCA
TTTCGGGCTCGATTAATGGCGCCCCCAATTCCTCCGATCCAAAGGTTTTTCCTCCATCAACAATCTATGAGTCAAATGGATGTAAAGAAATGAGAGTGAAGGATGAAGCT
CTTTGTGTTGTTAGAGATCGAAAAGTCCGAATAACTGATGGAGCTTCTCTTTATGCAATTTGTCGATCATGGTTGAGGAATGGTTCTCAAGAAGAAAGCCAGCCACACTA
TGGAAATTTTTTGAGGTCTCTTCCGAGACCTTTGCCTATCCCTGTGGCTGCTGCTTTACCATCACAGAAGAAGGAAGTCGTTGAAGAAGAAGTTGACGAGGAAGATAAGG
ATCTGGAATCCATTGAGCAGTTGTCAACACAAGAGCTATTGAAAAGACACGTTAAACGAGCCAAAAAAGTTCGATCACGATTGAGGGAAGAACGGTTGCTACGAATTGAA
AGATACAAAACCAGGCTCGCTCTTCTCCTTCCTCCACCAGTCGAGCAGTTGAGAACGGATAATATTACTGGAAGTTAAGTATGCATCTCGAAATCCTCATCAAATGTACC
GTTGACCATTGTCCAAATCATTCTTCTCAGCTACAGATGAACTCAAGAAACATTCACCAGAAGCGCGGGGGATTGATTGTATTTAAACAATACTAGAAAGGTATTTGATA
GATTTAGGTAAATTTTGTCTTCTATCAGTTCTTTTTTACAGTAGTTTGTATCATGGTGCTATCAATCAAGTTAAAGGGAAATGGTGTTATTTATTGAGCTTAACTTGTGT
TATTAAAATTTCATTTAACATCATCTCTGAGTTTTATTTTGAGATGTCTGAATCTCAAGGTGGTGTAAGCAAAGCCTTTTTTTAATCTTAT
Protein sequenceShow/hide protein sequence
MPVDIPTNTTAAAVTTTTPTVAPISFTLTTAPATTTAATAAAAAAIPRPLANQAPSRPISVIPQTHHLHYPPQAIYPAQPISVRTPSHQSQAVLYPVASSGRGFVPRPVR
PLPADQAVTVANLAGFPHRPVVTFPHRPVGSPHLDFMNHPVHLTRPPHLPPQLIPLSGSAISGSINGAPNSSDPKVFPPSTIYESNGCKEMRVKDEALCVVRDRKVRITD
GASLYAICRSWLRNGSQEESQPHYGNFLRSLPRPLPIPVAAALPSQKKEVVEEEVDEEDKDLESIEQLSTQELLKRHVKRAKKVRSRLREERLLRIERYKTRLALLLPPP
VEQLRTDNITGS