; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0001065 (gene) of Chayote v1 genome

Gene IDSed0001065
OrganismSechium edule (Chayote v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationLG13:6398216..6401113
RNA-Seq ExpressionSed0001065
SyntenySed0001065
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146564.1 uncharacterized protein LOC101220378 isoform X1 [Cucumis sativus]7.9e-15753.75Show/hide
Query:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAAP
        M RR D D   P+   NNTFQTITAAADAIAT DHRFPRAT VQKRRWGSC SIYWCFGS+KQRKRIG AVLVPEPS SS E HEN LQSPD VLPFAAP
Subjt:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAAP

Query:  PTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPL------TTTPSSPEVPFAQVLQPTLQ
        P+SP  S LQSE  SA QSPTAL+ FTSLTANMYS PDGPSSIFAIGPFA+E QLVSPPLNFS LTTEPSTP         TTPSSPEVPFAQ +QPTL 
Subjt:  PTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPL------TTTPSSPEVPFAQVLQPTLQ

Query:  NTESDNHYSFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMSGS
          ESDN Y+FPNDDFQSY FYPGSP+SHLISPRSVISRSGASSPLPD DFAS  GSQF NF LEVPP LLNLD HSI NWR+ Q +DS TQ+S+   S +
Subjt:  NTESDNHYSFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMSGS

Query:  GFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDGLL
         F LNPQT ESMSDHHATNESQNIQILID   K+E                                            EEPGA NHRFSFELSD D LL
Subjt:  GFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDGLL

Query:  RSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESKPL
        +SV SKPLE N                                                                                         
Subjt:  RSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESKPL

Query:  EPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPVGSHISNGTEEKEIAEGEETKQHQANHNHS
                                                         E A  SSPIHEPFE  KE+SP G H SN  EEK  A+G+E   HQ   +HS
Subjt:  EPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPVGSHISNGTEEKEIAEGEETKQHQANHNHS

Query:  ITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ
        +TL SVKEFNFDNGNGSD   PNINS+WW NAKD   E TATG+WSFFP+ QQ
Subjt:  ITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ

XP_008452033.1 PREDICTED: uncharacterized protein LOC103493162 isoform X2 [Cucumis melo]1.9e-15553.67Show/hide
Query:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAAP
        M RR D   TD  R VNNTFQTITAAADAIAT DHRFPRAT VQKRRWGSC SIYWCFGSLKQRKRIG AVLVPEPS SS E HEN LQSPD VLPFAAP
Subjt:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAAP

Query:  PTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT-------TTPSSPEVPFAQVLQPTL
        P+SP  S LQSE  SA QSPTAL+ FTSLTANMYS PDGPSSIFAIGPFA+E QLVSPPLNFS LTTEPSTPP T       TTPSSPEVPFAQ + P+L
Subjt:  PTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT-------TTPSSPEVPFAQVLQPTL

Query:  QNTESDNHYSFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMSG
        Q  ESDN Y+FPNDDFQSY FYPGSP+SHLISPRSVISRSGASSPLPD DFAS  GSQF NF LEVPP L NLD HSI NWR+ Q +DS TQ+S+   S 
Subjt:  QNTESDNHYSFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMSG

Query:  SGFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDGL
        + F LNP T ESM DHHATNESQNIQILID   K E                                            EEPGA NHRFSFELSD D L
Subjt:  SGFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDGL

Query:  LRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESKP
         +SV SKPLE NE                                                                                       
Subjt:  LRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESKP

Query:  LEPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPVGSHISNGTEEKEIAEGEETKQHQANHNH
                                                     P+E       SSPIHEPFE  KE+SP G H SN  EEK  A+G+E  QHQ   +H
Subjt:  LEPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPVGSHISNGTEEKEIAEGEETKQHQANHNH

Query:  SITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ
        S+ L SVKEFNFDN NGSD   P INSDWW NAKD   EGT TG+WSFFP  QQ
Subjt:  SITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ

XP_022136623.1 uncharacterized protein At1g76660-like [Momordica charantia]2.9e-15953.28Show/hide
Query:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAAP
        M RR D DA   +  VNNTFQTITAAADAIAT DHRFPRAT VQKRRWGSC SIYWCFGSLKQRKRIG AVLVPEPS S+ E  EN LQSPD VLPFAAP
Subjt:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAAP

Query:  PTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT-------TTPSSPEVPFAQVLQPTL
        P+SP  SFLQSE  SATQSPTA+L FTSLTANMYS PDGPSSIFA+GPFA+ETQLVSPPLNFS +TT+PST P T       TTPSSPEVPFAQ LQP+ 
Subjt:  PTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT-------TTPSSPEVPFAQVLQPTL

Query:  QNTESDNHY-SFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMS
        Q  ESD+ Y  FPNDDFQSY FYPGSP+SHLISPRSVISRSGASSPLPD DF + SGS FSNF +EVPP LLNLD HSI +WR  Q SDS TQNSVG  S
Subjt:  QNTESDNHY-SFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMS

Query:  GSGFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDG
         + F LNPQT ES+SD+HA+NE  NIQIL DGSQ++E  AANHRFSFELSDE+ LL+SVE+KPLESN                                 
Subjt:  GSGFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDG

Query:  LLRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESK
                                                                                                            
Subjt:  LLRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESK

Query:  PLEPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPVGSHISNGTEEKEIAEGEETKQHQANHN
                                                           E A ASSPIHEP E AKE S VG H SN TEE+E A+GEE   HQ   +
Subjt:  PLEPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPVGSHISNGTEEKEIAEGEETKQHQANHN

Query:  HSITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ
        HS+TL +VKEFNFDNGNG D  KPNINS WWAN KD E EGT TG+WSFFP+ QQ
Subjt:  HSITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ

XP_023522163.1 uncharacterized protein At1g76660-like [Cucurbita pepo subsp. pepo]7.4e-15554.35Show/hide
Query:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAAP
        M RRAD DA D +R +NNTFQTITAAADAIAT DHRFPRAT VQKRRWGSC SIYWCFGSLKQRKRIG AVLVPEPS  S EAH+N LQSPD VLPFAAP
Subjt:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAAP

Query:  PTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT-------TTPSSPEVPFAQVLQPTL
        P+SP  SFLQSE  SATQSP+ +L FTSLTANMYS PDGPSSIFAIGPFA+ETQLVSPPLNFS LTTEPSTP  T       TTPSSPEVPFAQ LQPTL
Subjt:  PTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT-------TTPSSPEVPFAQVLQPTL

Query:  QNTESDNHYSFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMSG
        Q  ESD+ YS PNDDFQSY FYPGSP+S+LISPRS IS SGASSPLPDLDFAS S SQFSNF+L+VPPALLNLD       R+GQ SDS TQNSVG  S 
Subjt:  QNTESDNHYSFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMSG

Query:  -SGFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDG
           FDLNP+T +SM      NESQNIQILIDGSQ EEP   NHRFSFELSDE+ LLR+VESKPLESN                                 
Subjt:  -SGFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDG

Query:  LLRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESK
                                                                                                            
Subjt:  LLRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESK

Query:  PLEPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPVGSHISNGTEEKEIAEGEETKQHQANHN
                                                             A ASSP+HE FE AKE S  G H SNG EEK  A+GEE  QHQ  H+
Subjt:  PLEPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPVGSHISNGTEEKEIAEGEETKQHQANHN

Query:  HSITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ
        HS TL SV EFNFDNGNGS+A KPNINSDWWANAKDVE +GT TG+WSFFP+AQQ
Subjt:  HSITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ

XP_038884079.1 uncharacterized protein LOC120075005 isoform X2 [Benincasa hispida]6.9e-16155.27Show/hide
Query:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAAP
        M RR D   TD  R VNNTFQTITAAADAIAT DHRFPRAT VQKRRWGSC SIYWCFGSLKQRKRIG AVLVPE S SS E+HEN LQSPD VLPFAAP
Subjt:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAAP

Query:  PTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT-------TTPSSPEVPFAQVLQPTL
        P+SP  SFLQSE  SATQSPTAL+ FTSLTANMYS PDGPSSIFAIGPFA+ETQLVSPPLNFS LTTEPST P T       TTPSSPEVPFAQ LQPTL
Subjt:  PTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT-------TTPSSPEVPFAQVLQPTL

Query:  QNTESDNHYSFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMSG
        Q +ESD+ Y FPNDDFQSY FYPGSP+SHLISPRSVISRSGASSPLPD DFAS  GSQF NF LEVPP LLNLD  SI NWR+ Q +DS TQ+S+   S 
Subjt:  QNTESDNHYSFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMSG

Query:  SGFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDGL
        + F LNPQT ESMSDHHATNESQNIQILIDG+QKEE                                            E PGA NHRFSFELSD D L
Subjt:  SGFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDGL

Query:  LRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESKP
        L+SV SKPL+ N                                                                                        
Subjt:  LRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESKP

Query:  LEPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPV-GSHISNGTEEKEIAEGEETKQHQANHN
                                                          E A ASSPIHEPFE AKE+SPV   H SN TE K  AE EE  QHQ   +
Subjt:  LEPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPV-GSHISNGTEEKEIAEGEETKQHQANHN

Query:  HSITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ
        HSITL SVKEFNFDNGNGSD  K N+NS+WW NAKDV+ EGT  G+WSFFP+ QQ
Subjt:  HSITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ

TrEMBL top hitse value%identityAlignment
A0A1S3BSB0 uncharacterized protein LOC103493162 isoform X12.3e-15453.59Show/hide
Query:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPV-QKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAA
        M RR D   TD  R VNNTFQTITAAADAIAT DHRFPRAT V QKRRWGSC SIYWCFGSLKQRKRIG AVLVPEPS SS E HEN LQSPD VLPFAA
Subjt:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPV-QKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAA

Query:  PPTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT-------TTPSSPEVPFAQVLQPT
        PP+SP  S LQSE  SA QSPTAL+ FTSLTANMYS PDGPSSIFAIGPFA+E QLVSPPLNFS LTTEPSTPP T       TTPSSPEVPFAQ + P+
Subjt:  PPTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT-------TTPSSPEVPFAQVLQPT

Query:  LQNTESDNHYSFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMS
        LQ  ESDN Y+FPNDDFQSY FYPGSP+SHLISPRSVISRSGASSPLPD DFAS  GSQF NF LEVPP L NLD HSI NWR+ Q +DS TQ+S+   S
Subjt:  LQNTESDNHYSFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMS

Query:  GSGFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDG
         + F LNP T ESM DHHATNESQNIQILID   K E                                            EEPGA NHRFSFELSD D 
Subjt:  GSGFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDG

Query:  LLRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESK
        L +SV SKPLE NE                                                                                      
Subjt:  LLRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESK

Query:  PLEPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPVGSHISNGTEEKEIAEGEETKQHQANHN
                                                      P+E       SSPIHEPFE  KE+SP G H SN  EEK  A+G+E  QHQ   +
Subjt:  PLEPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPVGSHISNGTEEKEIAEGEETKQHQANHN

Query:  HSITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ
        HS+ L SVKEFNFDN NGSD   P INSDWW NAKD   EGT TG+WSFFP  QQ
Subjt:  HSITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ

A0A1S3BSY8 uncharacterized protein LOC103493162 isoform X29.4e-15653.67Show/hide
Query:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAAP
        M RR D   TD  R VNNTFQTITAAADAIAT DHRFPRAT VQKRRWGSC SIYWCFGSLKQRKRIG AVLVPEPS SS E HEN LQSPD VLPFAAP
Subjt:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAAP

Query:  PTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT-------TTPSSPEVPFAQVLQPTL
        P+SP  S LQSE  SA QSPTAL+ FTSLTANMYS PDGPSSIFAIGPFA+E QLVSPPLNFS LTTEPSTPP T       TTPSSPEVPFAQ + P+L
Subjt:  PTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT-------TTPSSPEVPFAQVLQPTL

Query:  QNTESDNHYSFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMSG
        Q  ESDN Y+FPNDDFQSY FYPGSP+SHLISPRSVISRSGASSPLPD DFAS  GSQF NF LEVPP L NLD HSI NWR+ Q +DS TQ+S+   S 
Subjt:  QNTESDNHYSFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMSG

Query:  SGFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDGL
        + F LNP T ESM DHHATNESQNIQILID   K E                                            EEPGA NHRFSFELSD D L
Subjt:  SGFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDGL

Query:  LRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESKP
         +SV SKPLE NE                                                                                       
Subjt:  LRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESKP

Query:  LEPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPVGSHISNGTEEKEIAEGEETKQHQANHNH
                                                     P+E       SSPIHEPFE  KE+SP G H SN  EEK  A+G+E  QHQ   +H
Subjt:  LEPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPVGSHISNGTEEKEIAEGEETKQHQANHNH

Query:  SITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ
        S+ L SVKEFNFDN NGSD   P INSDWW NAKD   EGT TG+WSFFP  QQ
Subjt:  SITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ

A0A5A7TUB1 Mucin-23.0e-15453.21Show/hide
Query:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAAP
        M RR D   TD  R VNNTFQTITAAADAIAT DHRFPRAT VQKRRWGSC SIYWCFGSLKQRKRIG AVLVPEPS SS E HEN LQSPD VLPFAAP
Subjt:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAAP

Query:  PTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT-------TTPSSPEVPFAQVLQPTL
        P+SP  S LQSE  SA QSPTAL+ FTSLTANMYS PDGPSSIFAIGPFA+E QLVSPPLNFS LTTEPSTPP T       TTPSSPEVPFAQ + P+ 
Subjt:  PTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT-------TTPSSPEVPFAQVLQPTL

Query:  QNTESDNHYSFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMSG
        Q  ESDN Y+FPNDDFQSY FYPGSP+SHLISPRSVISRSGASSPLPD DFAS  GSQF NF L+VPP L N+D HSI NWR+ Q +DS TQ+S+   S 
Subjt:  QNTESDNHYSFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMSG

Query:  SGFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDGL
        + F LNP T ESM DHHATNESQNIQILID   K E                                            EEPGA NHRFSFELSD D L
Subjt:  SGFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDGL

Query:  LRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESKP
         +SV SKPLE NE                                                                                       
Subjt:  LRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESKP

Query:  LEPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPVGSHISNGTEEKEIAEGEETKQHQANHNH
                                                     P+E       SSPIHEPFE  KE+SP G H SN  EEK  A+G+E  QHQ   +H
Subjt:  LEPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPVGSHISNGTEEKEIAEGEETKQHQANHNH

Query:  SITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ
        S+ L SVKEFNFDN NGSD   P INSDWW NAKD   EGT TG+WSFFP  QQ
Subjt:  SITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ

A0A5D3CYQ2 Mucin-29.4e-15653.67Show/hide
Query:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAAP
        M RR D   TD  R VNNTFQTITAAADAIAT DHRFPRAT VQKRRWGSC SIYWCFGSLKQRKRIG AVLVPEPS SS E HEN LQSPD VLPFAAP
Subjt:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAAP

Query:  PTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT-------TTPSSPEVPFAQVLQPTL
        P+SP  S LQSE  SA QSPTAL+ FTSLTANMYS PDGPSSIFAIGPFA+E QLVSPPLNFS LTTEPSTPP T       TTPSSPEVPFAQ + P+L
Subjt:  PTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT-------TTPSSPEVPFAQVLQPTL

Query:  QNTESDNHYSFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMSG
        Q  ESDN Y+FPNDDFQSY FYPGSP+SHLISPRSVISRSGASSPLPD DFAS  GSQF NF LEVPP L NLD HSI NWR+ Q +DS TQ+S+   S 
Subjt:  QNTESDNHYSFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMSG

Query:  SGFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDGL
        + F LNP T ESM DHHATNESQNIQILID   K E                                            EEPGA NHRFSFELSD D L
Subjt:  SGFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDGL

Query:  LRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESKP
         +SV SKPLE NE                                                                                       
Subjt:  LRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESKP

Query:  LEPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPVGSHISNGTEEKEIAEGEETKQHQANHNH
                                                     P+E       SSPIHEPFE  KE+SP G H SN  EEK  A+G+E  QHQ   +H
Subjt:  LEPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPVGSHISNGTEEKEIAEGEETKQHQANHNH

Query:  SITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ
        S+ L SVKEFNFDN NGSD   P INSDWW NAKD   EGT TG+WSFFP  QQ
Subjt:  SITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ

A0A6J1C828 uncharacterized protein At1g76660-like1.4e-15953.28Show/hide
Query:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAAP
        M RR D DA   +  VNNTFQTITAAADAIAT DHRFPRAT VQKRRWGSC SIYWCFGSLKQRKRIG AVLVPEPS S+ E  EN LQSPD VLPFAAP
Subjt:  MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAAP

Query:  PTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT-------TTPSSPEVPFAQVLQPTL
        P+SP  SFLQSE  SATQSPTA+L FTSLTANMYS PDGPSSIFA+GPFA+ETQLVSPPLNFS +TT+PST P T       TTPSSPEVPFAQ LQP+ 
Subjt:  PTSPASSFLQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT-------TTPSSPEVPFAQVLQPTL

Query:  QNTESDNHY-SFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMS
        Q  ESD+ Y  FPNDDFQSY FYPGSP+SHLISPRSVISRSGASSPLPD DF + SGS FSNF +EVPP LLNLD HSI +WR  Q SDS TQNSVG  S
Subjt:  QNTESDNHY-SFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMS

Query:  GSGFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDG
         + F LNPQT ES+SD+HA+NE  NIQIL DGSQ++E  AANHRFSFELSDE+ LL+SVE+KPLESN                                 
Subjt:  GSGFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDG

Query:  LLRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESK
                                                                                                            
Subjt:  LLRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLLRSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESK

Query:  PLEPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPVGSHISNGTEEKEIAEGEETKQHQANHN
                                                           E A ASSPIHEP E AKE S VG H SN TEE+E A+GEE   HQ   +
Subjt:  PLEPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASSPIHEPFEMAKEDSPVGSHISNGTEEKEIAEGEETKQHQANHN

Query:  HSITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ
        HS+TL +VKEFNFDNGNG D  KPNINS WWAN KD E EGT TG+WSFFP+ QQ
Subjt:  HSITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766601.2e-2744.49Show/hide
Query:  QKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAE----AHE----NLLQSPDNVLPFAAPPTSPASSFLQSELSSATQSPTALLPFTSLTANMYS
        Q++RWG C  ++ CF S K  KRI PA  +PE  + SA     AH+    N   +    L   APP+SPA SF  S L S TQSP   L   SL AN   
Subjt:  QKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAE----AHE----NLLQSPDNVLPFAAPPTSPASSFLQSELSSATQSPTALLPFTSLTANMYS

Query:  DPDGP-SSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT--------TTPSSPEVPFAQVLQPTLQ-NTESDNHYSFPNDDFQSYHFYPGSPISHLIS
         P GP SS++A GP+A+ETQLVSPP+ FS  TTEPST P T        T PSSP+VP+A+ L  ++        HY   ND   +Y  YPGSP S L S
Subjt:  DPDGP-SSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT--------TTPSSPEVPFAQVLQPTLQ-NTESDNHYSFPNDDFQSYHFYPGSPISHLIS

Query:  PRSVISRSGASSPLPDLDFASASGSQF
        P S  S  G  SP       S SG+ F
Subjt:  PRSVISRSGASSPLPDLDFASASGSQF

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)3.9e-4547.45Show/hide
Query:  NNTFQTITAAADAIATADHRFPRATPV-QKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEP---SDSSAEAHENLLQSPDNVLPFAAPPTSPASSFLQSE
        NN F TI AAA AIA++D R  +++P+ +KR+W +  S+  CFGS +QRKRIG +VLVPEP   S S++    +  +S    LPF APP+SPA SF QSE
Subjt:  NNTFQTITAAADAIATADHRFPRATPV-QKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEP---SDSSAEAHENLLQSPDNVLPFAAPPTSPASSFLQSE

Query:  LSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPS----TPPL--------TTTPSSPEVPFAQVLQPTLQNTESDN
          SATQSP  +L F+ L  N         SIFAIGP+A+ETQLVSPP+ FS  TTEPS    TPPL        TTTPSSPEVPFAQ+     Q      
Subjt:  LSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPS----TPPL--------TTTPSSPEVPFAQVLQPTLQNTESDN

Query:  HYSFP---NDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSI
         Y FP   + +FQ Y   PGSP+  LISP      SG +SP PD        S F +F +  PP LL+  T  +
Subjt:  HYSFP---NDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSI

AT1G76660.1 FUNCTIONS IN: molecular_function unknown8.8e-2944.49Show/hide
Query:  QKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAE----AHE----NLLQSPDNVLPFAAPPTSPASSFLQSELSSATQSPTALLPFTSLTANMYS
        Q++RWG C  ++ CF S K  KRI PA  +PE  + SA     AH+    N   +    L   APP+SPA SF  S L S TQSP   L   SL AN   
Subjt:  QKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAE----AHE----NLLQSPDNVLPFAAPPTSPASSFLQSELSSATQSPTALLPFTSLTANMYS

Query:  DPDGP-SSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT--------TTPSSPEVPFAQVLQPTLQ-NTESDNHYSFPNDDFQSYHFYPGSPISHLIS
         P GP SS++A GP+A+ETQLVSPP+ FS  TTEPST P T        T PSSP+VP+A+ L  ++        HY   ND   +Y  YPGSP S L S
Subjt:  DPDGP-SSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT--------TTPSSPEVPFAQVLQPTLQ-NTESDNHYSFPNDDFQSYHFYPGSPISHLIS

Query:  PRSVISRSGASSPLPDLDFASASGSQF
        P S  S  G  SP       S SG+ F
Subjt:  PRSVISRSGASSPLPDLDFASASGSQF

AT4G25620.1 hydroxyproline-rich glycoprotein family protein5.5e-3938.64Show/hide
Query:  MRSVNN-TFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSS---AEAHENLLQSPDNVLPFAAPPTSPASSF
        MRSVNN +  T+ AAA AI +A+ R  + + VQK+R GS  S+YWCFGS K  KRIG AVLVPEP+ S    A    +   S    +PF APP+SPA SF
Subjt:  MRSVNN-TFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSS---AEAHENLLQSPDNVLPFAAPPTSPASSF

Query:  LQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT---TTPSSPEVPFAQVLQPTLQNTESDN-----
        L S   SA+ +P   L   SLT N       P S F IGP+A+ETQ V+PP+ FS  TTEPST P T    +PSSPEVPFAQ+L  +L+    ++     
Subjt:  LQSELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT---TTPSSPEVPFAQVLQPTLQNTESDN-----

Query:  -HYSFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMSGSGFDLN
          +S  + +F+S   YPGSP  +LISP      SG SSP P              F +  PP  L  +  +   W    GS S T    GS  GSG  L 
Subjt:  -HYSFPNDDFQSYHFYPGSPISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMSGSGFDLN

Query:  PQTLESMSDHHATNESQNI------------QILIDGSQKEEPGAAN----------------HRFSFELSDEEGLLRSVESK
        P   +  S     N ++ +              L+D    E    AN                HR SFEL+ E+ + R + SK
Subjt:  PQTLESMSDHHATNESQNI------------QILIDGSQKEEPGAAN----------------HRFSFELSDEEGLLRSVESK

AT5G52430.1 hydroxyproline-rich glycoprotein family protein3.4e-4939.31Show/hide
Query:  VNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAE--AHENLLQSPDNVLPFAAPPTSPASSFLQSEL
        VNN+ +T+ AAA AI TA+ R  + +  QK RWG C S+Y CFG+ K  KRIG AVLVPEP  S       +N   S   VLPF APP+SPA SFLQS+ 
Subjt:  VNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAE--AHENLLQSPDNVLPFAAPPTSPASSFLQSEL

Query:  SSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT---------TTPSSPEVPFAQVLQPTLQNTESDN----
        SS + SP   L   SLT+N +S P  P S+F +GP+A ETQ V+PP+ FS   TEPST P T         TTPSSPEVPFAQ+L  +L+ T  D+    
Subjt:  SSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLT---------TTPSSPEVPFAQVLQPTLQNTESDN----

Query:  --HYSFPNDDFQSYHFYPGSP-ISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMSGSGFD
           +S  + +F+S    PGSP   +LISP SVIS SG SSP P         S    F +  PP  L  +  +   W    GS S T    GS   SG  
Subjt:  --HYSFPNDDFQSYHFYPGSP-ISHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMSGSGFD

Query:  LNPQTLESMSDHHATNES--------QNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSD
        L P   E +S +   N +          +  L +     E   A+HR SFEL+ E+ + R + SK   S++  N                N R   E S 
Subjt:  LNPQTLESMSDHHATNES--------QNIQILIDGSQKEEPGAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSD

Query:  EDGLLRSVESKPLE-PNESQNIQILIH---GSQKE
           + R++E +  +  NE   IQ L     GS KE
Subjt:  EDGLLRSVESKPLE-PNESQNIQILIH---GSQKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACGGCGGGCAGATGTTGATGCTACTGATCCGATGAGGTCTGTAAACAACACTTTTCAAACCATTACGGCGGCCGCCGATGCGATCGCCACCGCCGATCATCGATT
TCCTCGGGCTACTCCCGTCCAGAAAAGAAGATGGGGCAGCTGTTGTAGTATTTATTGGTGCTTTGGATCTCTTAAACAGAGGAAAAGAATTGGGCCTGCTGTCCTGGTCC
CAGAACCAAGTGATTCTTCAGCTGAGGCTCATGAAAATCTTTTGCAATCACCAGACAATGTGCTGCCTTTTGCTGCACCTCCCACTTCCCCTGCATCATCCTTCCTTCAA
TCAGAGCTATCTTCTGCTACACAATCACCTACAGCTCTACTCCCTTTCACTTCTCTCACGGCTAACATGTATTCTGATCCTGATGGGCCTTCCTCAATTTTTGCCATTGG
CCCATTTGCTTATGAAACTCAACTTGTGTCTCCACCTCTGAATTTCTCCAATCTCACCACTGAACCATCAACTCCTCCCTTGACTACTACTCCTTCTTCCCCTGAAGTTC
CTTTTGCTCAGGTTCTTCAGCCTACCCTTCAGAATACTGAGTCTGATAACCATTATTCATTTCCTAACGATGACTTTCAGTCTTATCATTTCTATCCCGGCAGCCCAATT
AGTCACCTCATATCGCCACGCTCTGTCATTTCTCGTTCTGGGGCATCGTCTCCTTTGCCAGACTTGGATTTCGCTTCCGCCTCTGGTTCTCAATTCTCTAATTTCACATT
GGAAGTTCCACCTGCGCTGTTGAACCTTGACACCCATTCCATTCTTAACTGGCGAAAAGGGCAAGGTTCTGATTCTTTCACTCAAAATTCTGTTGGGTCCATGTCGGGTA
GTGGTTTTGATTTGAATCCTCAAACTTTGGAATCTATGTCGGATCACCACGCAACAAATGAATCCCAAAATATTCAGATTCTCATTGATGGAAGCCAAAAGGAGGAGCCT
GGTGCTGCTAATCATAGATTCTCATTTGAGTTATCTGATGAAGAGGGGTTATTAAGAAGTGTTGAAAGTAAGCCACTAGAATCAAATGAATCCAAAAATATTCAAATTCT
CATTGATGGAAGCCAAAAGGAGGAGCCTGGTGCTGCTAATCATAGATTCTCATTTGAGTTATCTGATGAAGATGGATTATTAAGAAGCGTAGAAAGTAAGCCACTGGAAC
CAAATGAATCCCAAAATATCCAAATTCTGATTCATGGAAGCCAAAAGGAGGATCCTAGTGCTGCTAATCATAGATTCTCATTTGAGTTATGTGATGAAGAGGGGTTATTA
AGAAGTGTTGAAAGTAAGCCACTGGAATCAAATGAATCCAAAAATATTCAAATTCTCATTGATGGAAGCCAAGAGGAGGAGCTTGCTGCTGCTAATCATAGATTCTCATT
TGAGTTATCTGATGAAGATGGATTATTAAGAAGTGTAGAAAGTAAGCCACTGGAACCAAATGAATCCCAAAATATCCAAATTCTGATTCATGGAAGCCAAAAGGAGGATC
CTAGTGCTGCTAATCATAGATTCTCATTTGAGTTATCTGATGAAGATGGTTTATTAAGAAGCGTAGAATGTAAGCCACTGGAACCAACTGAGTTCGCAGCGGCATCATCT
CCAATACACGAACCGTTTGAAATGGCTAAAGAAGATTCCCCTGTTGGTAGTCATATTTCAAATGGTACAGAAGAAAAGGAAATAGCAGAGGGAGAAGAAACAAAACAGCA
TCAAGCAAATCATAATCATTCCATTACTCTTCGGTCTGTAAAGGAATTCAATTTTGATAATGGCAATGGAAGTGATGCATTTAAGCCTAATATCAACTCAGACTGGTGGG
CCAATGCTAAAGATGTCGAGAAAGAAGGTACAGCGACCGGGTCCTGGTCGTTCTTTCCATTGGCGCAACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTACGGCGGGCAGATGTTGATGCTACTGATCCGATGAGGTCTGTAAACAACACTTTTCAAACCATTACGGCGGCCGCCGATGCGATCGCCACCGCCGATCATCGATT
TCCTCGGGCTACTCCCGTCCAGAAAAGAAGATGGGGCAGCTGTTGTAGTATTTATTGGTGCTTTGGATCTCTTAAACAGAGGAAAAGAATTGGGCCTGCTGTCCTGGTCC
CAGAACCAAGTGATTCTTCAGCTGAGGCTCATGAAAATCTTTTGCAATCACCAGACAATGTGCTGCCTTTTGCTGCACCTCCCACTTCCCCTGCATCATCCTTCCTTCAA
TCAGAGCTATCTTCTGCTACACAATCACCTACAGCTCTACTCCCTTTCACTTCTCTCACGGCTAACATGTATTCTGATCCTGATGGGCCTTCCTCAATTTTTGCCATTGG
CCCATTTGCTTATGAAACTCAACTTGTGTCTCCACCTCTGAATTTCTCCAATCTCACCACTGAACCATCAACTCCTCCCTTGACTACTACTCCTTCTTCCCCTGAAGTTC
CTTTTGCTCAGGTTCTTCAGCCTACCCTTCAGAATACTGAGTCTGATAACCATTATTCATTTCCTAACGATGACTTTCAGTCTTATCATTTCTATCCCGGCAGCCCAATT
AGTCACCTCATATCGCCACGCTCTGTCATTTCTCGTTCTGGGGCATCGTCTCCTTTGCCAGACTTGGATTTCGCTTCCGCCTCTGGTTCTCAATTCTCTAATTTCACATT
GGAAGTTCCACCTGCGCTGTTGAACCTTGACACCCATTCCATTCTTAACTGGCGAAAAGGGCAAGGTTCTGATTCTTTCACTCAAAATTCTGTTGGGTCCATGTCGGGTA
GTGGTTTTGATTTGAATCCTCAAACTTTGGAATCTATGTCGGATCACCACGCAACAAATGAATCCCAAAATATTCAGATTCTCATTGATGGAAGCCAAAAGGAGGAGCCT
GGTGCTGCTAATCATAGATTCTCATTTGAGTTATCTGATGAAGAGGGGTTATTAAGAAGTGTTGAAAGTAAGCCACTAGAATCAAATGAATCCAAAAATATTCAAATTCT
CATTGATGGAAGCCAAAAGGAGGAGCCTGGTGCTGCTAATCATAGATTCTCATTTGAGTTATCTGATGAAGATGGATTATTAAGAAGCGTAGAAAGTAAGCCACTGGAAC
CAAATGAATCCCAAAATATCCAAATTCTGATTCATGGAAGCCAAAAGGAGGATCCTAGTGCTGCTAATCATAGATTCTCATTTGAGTTATGTGATGAAGAGGGGTTATTA
AGAAGTGTTGAAAGTAAGCCACTGGAATCAAATGAATCCAAAAATATTCAAATTCTCATTGATGGAAGCCAAGAGGAGGAGCTTGCTGCTGCTAATCATAGATTCTCATT
TGAGTTATCTGATGAAGATGGATTATTAAGAAGTGTAGAAAGTAAGCCACTGGAACCAAATGAATCCCAAAATATCCAAATTCTGATTCATGGAAGCCAAAAGGAGGATC
CTAGTGCTGCTAATCATAGATTCTCATTTGAGTTATCTGATGAAGATGGTTTATTAAGAAGCGTAGAATGTAAGCCACTGGAACCAACTGAGTTCGCAGCGGCATCATCT
CCAATACACGAACCGTTTGAAATGGCTAAAGAAGATTCCCCTGTTGGTAGTCATATTTCAAATGGTACAGAAGAAAAGGAAATAGCAGAGGGAGAAGAAACAAAACAGCA
TCAAGCAAATCATAATCATTCCATTACTCTTCGGTCTGTAAAGGAATTCAATTTTGATAATGGCAATGGAAGTGATGCATTTAAGCCTAATATCAACTCAGACTGGTGGG
CCAATGCTAAAGATGTCGAGAAAGAAGGTACAGCGACCGGGTCCTGGTCGTTCTTTCCATTGGCGCAACAATGA
Protein sequenceShow/hide protein sequence
MLRRADVDATDPMRSVNNTFQTITAAADAIATADHRFPRATPVQKRRWGSCCSIYWCFGSLKQRKRIGPAVLVPEPSDSSAEAHENLLQSPDNVLPFAAPPTSPASSFLQ
SELSSATQSPTALLPFTSLTANMYSDPDGPSSIFAIGPFAYETQLVSPPLNFSNLTTEPSTPPLTTTPSSPEVPFAQVLQPTLQNTESDNHYSFPNDDFQSYHFYPGSPI
SHLISPRSVISRSGASSPLPDLDFASASGSQFSNFTLEVPPALLNLDTHSILNWRKGQGSDSFTQNSVGSMSGSGFDLNPQTLESMSDHHATNESQNIQILIDGSQKEEP
GAANHRFSFELSDEEGLLRSVESKPLESNESKNIQILIDGSQKEEPGAANHRFSFELSDEDGLLRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELCDEEGLL
RSVESKPLESNESKNIQILIDGSQEEELAAANHRFSFELSDEDGLLRSVESKPLEPNESQNIQILIHGSQKEDPSAANHRFSFELSDEDGLLRSVECKPLEPTEFAAASS
PIHEPFEMAKEDSPVGSHISNGTEEKEIAEGEETKQHQANHNHSITLRSVKEFNFDNGNGSDAFKPNINSDWWANAKDVEKEGTATGSWSFFPLAQQ