; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018642 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018642
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProline-rich protein 36-like
Genome locationChr04:6214188..6216477
RNA-Seq ExpressionHG10018642
SyntenyHG10018642
Gene Ontology termsNA
InterPro domainsIPR009646 - Root cap


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031326.1 proline-rich protein 36-like [Cucumis melo var. makuwa]6.8e-22571.56Show/hide
Query:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGS--DGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPS
        M E TPPGIANNPSHATCKIKKYKHCYNL HVCPKFCP+QC+VECASCKPICGS  D  NPPPED       + P+PPS+TYYSPPPPVA       TPS
Subjt:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGS--DGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPS

Query:  PPASDPTPSYSPPLPS--PTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPS-PTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPP
        PPAS+P+PS+SPPLPS  PTPVTPSPSPPPP + TPS P   PT S  PP+ S P P   +P   PP    PSP    P P+ + P  +PTP  P+ SPP
Subjt:  PPASDPTPSYSPPLPS--PTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPS-PTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPP

Query:  PPVAVTPSPPASNPTPSYSPPLPSPPVTPSPSPPTSSSPPPPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPS
        PP  VT +PP+ NP P  SPP      TPSP PPT + PP     +  TPP++SPPPP+TST P      NPNPPTSPP +H           PPSTPP+
Subjt:  PPVAVTPSPPASNPTPSYSPPLPSPPVTPSPSPPTSSSPPPPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPS

Query:  NPTPPENSTPPSTTPTNPNPPSTPPANPNPPT-------------------PSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTS-PPSSSAGAAKRV
        NPTPPENS PPST PTNPN PSTPP+ PN P+                   PSTPN PS TPPSETPNSPP NTP S PQTPSPP S PPSSSAGA K V
Subjt:  NPTPPENSTPPSTTPTNPNPPSTPPANPNPPT-------------------PSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTS-PPSSSAGAAKRV

Query:  KCKNVNYPQCYNMIHTCPSACPNGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSL
        +CKNVNYPQCYNMIH CPSACPNGC+VDCVTCKPVCHCDRPGAVCQDPR VGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSL
Subjt:  KCKNVNYPQCYNMIHTCPSACPNGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSL

Query:  AILFHTHRLFIAAQKTDVWDDSIDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSF
        AILF+ HRL IAAQKTDVWDDSIDRL+I LD+HP+ALP SEGS++ HP ENPT+ I+RLAATNHVMVEAKGLFRITAKVVPIT+EDSRIH YGIEEGDSF
Subjt:  AILFHTHRLFIAAQKTDVWDDSIDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSF

Query:  AHLDVGFKFYGLSEEVSGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGGGEETI
        AHLDVGFKF+GLS++V+G LGQTYG GYVS +NVKAAMAVMGR EEFETSSLFAADCAVSRFG  GGVGGG+ETI
Subjt:  AHLDVGFKFYGLSEEVSGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGGGEETI

TYK06779.1 proline-rich protein 36-like [Cucumis melo var. makuwa]1.7e-22070.14Show/hide
Query:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGS--DGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPS
        M E TPPGIANNPSHATCKIKKYKHCYNL HVCPKFCP+QC+VECASCKPICGS  D  NPPPED       + P+PPS+TYYSPPPPVA       TPS
Subjt:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGS--DGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPS

Query:  PPASDPTPSYSPPLPSPTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPV
        PPAS+P+PS+SPPLPSPT                                 PTPVTPSPSPPPP +  PS P                   P+ SPPPPV
Subjt:  PPASDPTPSYSPPLPSPTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPV

Query:  AVTPSPPASNPTPSYSPPLPSPPVTPSPSPPTSSSPPPPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPSNPT
          TP P  +NP     P  P    TPSP PPT + PP     +  TPP++SPPPP+TST P      NPNPPTSPP +H           PPSTPP+NPT
Subjt:  AVTPSPPASNPTPSYSPPLPSPPVTPSPSPPTSSSPPPPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPSNPT

Query:  PPENSTPPSTTPTNPNPPSTPPANPNPPTPSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTS-PPSSSAGAAKRVKCKNVNYPQCYNMIHTCPSACP
        PPENS PPST PTNPN PSTP        PSTPN PS TPPSETPNSPP NTP S PQTPSPP S PPSSSAGA K V+CKNVNYPQCYNMIH CPSACP
Subjt:  PPENSTPPSTTPTNPNPPSTPPANPNPPTPSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTS-PPSSSAGAAKRVKCKNVNYPQCYNMIHTCPSACP

Query:  NGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLAILFHTHRLFIAAQKTDVWDDS
        NGC+VDCVTCKPVCHCDRPGAVCQDPR VGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLAILF+ HRL IAAQKTDVWDDS
Subjt:  NGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLAILFHTHRLFIAAQKTDVWDDS

Query:  IDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSFAHLDVGFKFYGLSEEVSGFLGQ
        IDRL+I LD+HP+ALP SEGS++ HP ENPT+ I+RLAATNHVMVEAKGLFRITAKVVPIT+EDSRIH YGIEEGDSFAHLDVGFKF+GLS++V+G LGQ
Subjt:  IDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSFAHLDVGFKFYGLSEEVSGFLGQ

Query:  TYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGGGEETI
        TYG GYVS +NVKAAMAVMGR EEFETSSLFAADCAVSRFG  GGVGGG+ETI
Subjt:  TYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGGGEETI

XP_016901711.1 PREDICTED: LOW QUALITY PROTEIN: proline-rich protein 36-like [Cucumis melo]1.4e-21469.73Show/hide
Query:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGS--DGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPS
        M E TPPGIANNPSHATCKIKKYKHCYNL HVCPKFCP+QC+VECASCKPICGS  D  NPPPED       + P+PPS+TYYSPPPPVA       TPS
Subjt:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGS--DGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPS

Query:  PPASDPTPSYSPPLPS--PTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPPP
        PPAS+P+PS+SPPLPS  PTPVTPSPSPPPP + TPS P   PT      L     +       P +  +      +       P    TP  P+ SPPP
Subjt:  PPASDPTPSYSPPLPS--PTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPPP

Query:  PVAVTPSPPASNPTPSYSPPLPSPPVTPSPSPPTSSSPPPPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPSN
        P  VT +PP+ NP P  SPP      TPSP PPT + PP     +  TPP++SPPPP+TST P      NPNPPTSPP +H           PPSTPP+N
Subjt:  PVAVTPSPPASNPTPSYSPPLPSPPVTPSPSPPTSSSPPPPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPSN

Query:  PTPPENSTPPSTTPTNPNPPSTPPANPNPPT-------------------PSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTS-PPSSSAGAAKRVK
        PTPPENS PPST PTNPN PSTPP+ PN P+                   PSTPN PS TPPSETPNSPP NTP S PQTPSPP S PPSSSAGA K V+
Subjt:  PTPPENSTPPSTTPTNPNPPSTPPANPNPPT-------------------PSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTS-PPSSSAGAAKRVK

Query:  CKNVNYPQCYNMIHTCPSACPNGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLA
        CKNVNYPQCYNMIH CPSACPNGC+VDCVTCKPVCHCDRPGAVCQDPR VGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLA
Subjt:  CKNVNYPQCYNMIHTCPSACPNGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLA

Query:  ILFHTHRLFIAAQKTDVWDDSIDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSFA
        ILF+ HRL IAAQKTDVWDDSIDRL+I LD+HP+ALP SEGS++ HP ENPT+ I+RLAATNHVMVEAKGLFRITAKVVPIT+EDSRIH YGIEEGDSFA
Subjt:  ILFHTHRLFIAAQKTDVWDDSIDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSFA

Query:  HLDVGFKFYGLSEEVSGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGGGEETI
        HLDVGFKF+GLS++V+G LGQTYG GYVS +NVKAAMAVMGR EEFETSSLFAADCAVSRFG  GGVGGG+ETI
Subjt:  HLDVGFKFYGLSEEVSGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGGGEETI

XP_031744357.1 formin-like protein 20 [Cucumis sativus]1.9e-22772.15Show/hide
Query:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGS--DGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPS
        M EATPPGIANNPSHATCKIKKYKHCYNL HVCPKFCP+QCYVECASCKPICGS  D  NPPPEDTPT        PPS+TYYSPPPPVA       TPS
Subjt:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGS--DGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPS

Query:  PPASDPTPSYSPPLPS--PTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPPP
        PPA +P+P +SPPLPS  PTPVTPSPSPPPPV  +PSP               P PVTPSPSPPPPV   PSP               P PVTPSPSPPP
Subjt:  PPASDPTPSYSPPLPS--PTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPPP

Query:  PVAVTPSPPASNPTPSYSPPLPSPPVTPSPSPP---TSSSPPPPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPS--
        PV  +PS              PSPPVTPSPSPP   T S  PPP + TPSTPP+ S PPP+TST P    PANPNPP SPP + P SPPPP +TNPPS  
Subjt:  PVAVTPSPPASNPTPSYSPPLPSPPVTPSPSPP---TSSSPPPPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPS--

Query:  TPPS-NPTPPENSTPPSTTPTNP------NPPSTPPANPNPP--------TPSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTS-PPSSSAGAAKRV
        TPP+ +P PP  STPPS  P  P      +PPSTPP NP PP         P+ PN+P STPPSETPNSPP NTP  APQTPSPP S PPSSSAGA KRV
Subjt:  TPPS-NPTPPENSTPPSTTPTNP------NPPSTPPANPNPP--------TPSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTS-PPSSSAGAAKRV

Query:  KCKNVNYPQCYNMIHTCPSACPNGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSL
        +CKN  YPQCYNMIH CPSACPNGC+VDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTW++SL
Subjt:  KCKNVNYPQCYNMIHTCPSACPNGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSL

Query:  AILFHTHRLFIAAQKTDVWDDSIDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSF
        AILF+ HRL IAAQKTDVWDDSIDRL+I LD+HP+ALP SEGS++ HP ENPTV+I+RLAATNHVMVEAKGLFRITAKVVPIT+EDSRIH YGIEEGDSF
Subjt:  AILFHTHRLFIAAQKTDVWDDSIDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSF

Query:  AHLDVGFKFYGLSEEVSGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGGGEETI
        AHLDVGFKF+ LSE+V+G LGQTYG GYVS +NVKAAMAVMGR +EFETSSLFAADCAVSRFG  G VGG +ETI
Subjt:  AHLDVGFKFYGLSEEVSGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGGGEETI

XP_038887137.1 adhesive plaque matrix protein [Benincasa hispida]4.0e-23372.38Show/hide
Query:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGSDGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPSPP
        +AEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCP+QCYVECASCKPICGS   NPPPEDTPTPATPS  SPPS TYY                   
Subjt:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGSDGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPSPP

Query:  ASDPTPSYSPPLPSPTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAV
                              SPPPPVAVTPSPPASNPTPSYSPPLPSP+P TPSPSPP P                              SPPPPV  
Subjt:  ASDPTPSYSPPLPSPTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAV

Query:  TPSPPASNPTPSYSPPLPSPPVTPSPSPPTSSSPP--PPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPSNPT
         PSPP                      PPT+++PP  PP+    STPP+ S PPP+T + PPTTTP NPNPPTSP  ++P SPPPP NTNPPSTPPS+PT
Subjt:  TPSPPASNPTPSYSPPLPSPPVTPSPSPPTSSSPP--PPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPSNPT

Query:  PPENSTPPSTTPTNPNPPSTPPANPNPPTPSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSP-------PTSPPSSSAGAAKRVKCKNVNYPQCYNMIHT
        PPEN  PPS    NPNPPSTPP NPNPPTP TPNSPS  PPS+TPNSP TNTPP APQTPSP       PT P SSSAGAAKRV+CKNVNYPQCYNMIHT
Subjt:  PPENSTPPSTTPTNPNPPSTPPANPNPPTPSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSP-------PTSPPSSSAGAAKRVKCKNVNYPQCYNMIHT

Query:  CPSACPNGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLAILFHTHRLFIAAQKT
        CPS CPNGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDG+TFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLAILFHTHRL IAAQKT
Subjt:  CPSACPNGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLAILFHTHRLFIAAQKT

Query:  DVWDDSIDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSFAHLDVGFKFYGLSEEV
        DVWDDSIDRL+I+LD+HP+ LPTSEGS+  HPAENPTVVIIRLAATNHV+VEAKGLFRITAKVVPIT EDSRIH YGIEEGDSFAHLDVGFKFYGLSEEV
Subjt:  DVWDDSIDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSFAHLDVGFKFYGLSEEV

Query:  SGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGGGEETI
        +G LGQTYG  YVS VNVKAAMAVMGRAEEFETSSLFAADCAVS+F D G +G G+E I
Subjt:  SGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGGGEETI

TrEMBL top hitse value%identityAlignment
A0A0A0K2J8 Uncharacterized protein1.5e-21762.67Show/hide
Query:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGS--DGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPS
        M EATPPGIANNPSHATCKIKKYKHCYNL HVCPKFCP+QCYVECASCKPICGS  D  NPPPEDTPT        PPS+TYYSPPPPVA       TPS
Subjt:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGS--DGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPS

Query:  PPASDPTPSYSPPLPSPT--------------------------------------------------------------PVTPSPSPPPPVAVTPSPP-
        PPA +P+P +SPPLPSPT                                                              PVTPSPSPPPPV  +PSPP 
Subjt:  PPASDPTPSYSPPLPSPT--------------------------------------------------------------PVTPSPSPPPPVAVTPSPP-

Query:  -----ASNP---TPSYSPPLP------SPTPVTPSPSPPPPVAVIPSPP------ASNP---TPSYSPPLP------SPTPVTPSPSPPPPVAVTPSPPA
              S P   TPS SPP P       P PVTPSPSPPPPV   PSPP       S P   TPS SPP P       P PVTPSPSPPPPV  +PSPP 
Subjt:  -----ASNP---TPSYSPPLP------SPTPVTPSPSPPPPVAVIPSPP------ASNP---TPSYSPPLP------SPTPVTPSPSPPPPVAVTPSPPA

Query:  S-NPTPSYSPPL---PSPPVTPSPSPP----------------------------------TSSSPPPPSTGTPSTPPSI----SPPPPI----------
           P+PS  PP+   PSPPVTPSPSPP                                   S SPPPP T +PS PP +    SPPPP+          
Subjt:  S-NPTPSYSPPL---PSPPVTPSPSPP----------------------------------TSSSPPPPSTGTPSTPPSI----SPPPPI----------

Query:  --------TSTSPPTTT---PANPNPPTSPPEDHPPSPPPPPNTNPPS--TPPS-NPTPPENSTPPSTTPTNP------NPPSTPPANPNPP--------
                TST PP T+   PANPNPP SPP + P SPPPP +TNPPS  TPP+ +P PP  STPPS  P  P      +PPSTPP NP PP        
Subjt:  --------TSTSPPTTT---PANPNPPTSPPEDHPPSPPPPPNTNPPS--TPPS-NPTPPENSTPPSTTPTNP------NPPSTPPANPNPP--------

Query:  TPSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTS-PPSSSAGAAKRVKCKNVNYPQCYNMIHTCPSACPNGCEVDCVTCKPVCHCDRPGAVCQDPRF
         P+ PN+P STPPSETPNSPP NTP  APQTPSPP S PPSSSAGA KRV+CKN  YPQCYNMIH CPSACPNGC+VDCVTCKPVCHCDRPGAVCQDPRF
Subjt:  TPSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTS-PPSSSAGAAKRVKCKNVNYPQCYNMIHTCPSACPNGCEVDCVTCKPVCHCDRPGAVCQDPRF

Query:  VGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLAILFHTHRLFIAAQKTDVWDDSIDRLSIALDNHPVALPTSEGSRLHHPAE
        VGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTW++SLAILF+ HRL IAAQKTDVWDDSIDRL+I LD+HP+ALP SEGS++ HP E
Subjt:  VGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLAILFHTHRLFIAAQKTDVWDDSIDRLSIALDNHPVALPTSEGSRLHHPAE

Query:  NPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSFAHLDVGFKFYGLSEEVSGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETS
        NPTV+I+RLAATNHVMVEAKGLFRITAKVVPIT+EDSRIH YGIEEGDSFAHLDVGFKF+ LSE+V+G LGQTYG GYVS +NVKAAMAVMGR +EFETS
Subjt:  NPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSFAHLDVGFKFYGLSEEVSGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETS

Query:  SLFAADCAVSRFGDRGGVGGGEETI
        SLFAADCAVSRFG  G VGG +ETI
Subjt:  SLFAADCAVSRFGDRGGVGGGEETI

A0A1S4E0F8 LOW QUALITY PROTEIN: proline-rich protein 36-like6.9e-21569.73Show/hide
Query:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGS--DGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPS
        M E TPPGIANNPSHATCKIKKYKHCYNL HVCPKFCP+QC+VECASCKPICGS  D  NPPPED       + P+PPS+TYYSPPPPVA       TPS
Subjt:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGS--DGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPS

Query:  PPASDPTPSYSPPLPS--PTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPPP
        PPAS+P+PS+SPPLPS  PTPVTPSPSPPPP + TPS P   PT      L     +       P +  +      +       P    TP  P+ SPPP
Subjt:  PPASDPTPSYSPPLPS--PTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPPP

Query:  PVAVTPSPPASNPTPSYSPPLPSPPVTPSPSPPTSSSPPPPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPSN
        P  VT +PP+ NP P  SPP      TPSP PPT + PP     +  TPP++SPPPP+TST P      NPNPPTSPP +H           PPSTPP+N
Subjt:  PVAVTPSPPASNPTPSYSPPLPSPPVTPSPSPPTSSSPPPPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPSN

Query:  PTPPENSTPPSTTPTNPNPPSTPPANPNPPT-------------------PSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTS-PPSSSAGAAKRVK
        PTPPENS PPST PTNPN PSTPP+ PN P+                   PSTPN PS TPPSETPNSPP NTP S PQTPSPP S PPSSSAGA K V+
Subjt:  PTPPENSTPPSTTPTNPNPPSTPPANPNPPT-------------------PSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTS-PPSSSAGAAKRVK

Query:  CKNVNYPQCYNMIHTCPSACPNGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLA
        CKNVNYPQCYNMIH CPSACPNGC+VDCVTCKPVCHCDRPGAVCQDPR VGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLA
Subjt:  CKNVNYPQCYNMIHTCPSACPNGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLA

Query:  ILFHTHRLFIAAQKTDVWDDSIDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSFA
        ILF+ HRL IAAQKTDVWDDSIDRL+I LD+HP+ALP SEGS++ HP ENPT+ I+RLAATNHVMVEAKGLFRITAKVVPIT+EDSRIH YGIEEGDSFA
Subjt:  ILFHTHRLFIAAQKTDVWDDSIDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSFA

Query:  HLDVGFKFYGLSEEVSGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGGGEETI
        HLDVGFKF+GLS++V+G LGQTYG GYVS +NVKAAMAVMGR EEFETSSLFAADCAVSRFG  GGVGGG+ETI
Subjt:  HLDVGFKFYGLSEEVSGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGGGEETI

A0A5A7SMM6 Proline-rich protein 36-like3.3e-22571.56Show/hide
Query:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGS--DGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPS
        M E TPPGIANNPSHATCKIKKYKHCYNL HVCPKFCP+QC+VECASCKPICGS  D  NPPPED       + P+PPS+TYYSPPPPVA       TPS
Subjt:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGS--DGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPS

Query:  PPASDPTPSYSPPLPS--PTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPS-PTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPP
        PPAS+P+PS+SPPLPS  PTPVTPSPSPPPP + TPS P   PT S  PP+ S P P   +P   PP    PSP    P P+ + P  +PTP  P+ SPP
Subjt:  PPASDPTPSYSPPLPS--PTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPS-PTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPP

Query:  PPVAVTPSPPASNPTPSYSPPLPSPPVTPSPSPPTSSSPPPPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPS
        PP  VT +PP+ NP P  SPP      TPSP PPT + PP     +  TPP++SPPPP+TST P      NPNPPTSPP +H           PPSTPP+
Subjt:  PPVAVTPSPPASNPTPSYSPPLPSPPVTPSPSPPTSSSPPPPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPS

Query:  NPTPPENSTPPSTTPTNPNPPSTPPANPNPPT-------------------PSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTS-PPSSSAGAAKRV
        NPTPPENS PPST PTNPN PSTPP+ PN P+                   PSTPN PS TPPSETPNSPP NTP S PQTPSPP S PPSSSAGA K V
Subjt:  NPTPPENSTPPSTTPTNPNPPSTPPANPNPPT-------------------PSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTS-PPSSSAGAAKRV

Query:  KCKNVNYPQCYNMIHTCPSACPNGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSL
        +CKNVNYPQCYNMIH CPSACPNGC+VDCVTCKPVCHCDRPGAVCQDPR VGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSL
Subjt:  KCKNVNYPQCYNMIHTCPSACPNGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSL

Query:  AILFHTHRLFIAAQKTDVWDDSIDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSF
        AILF+ HRL IAAQKTDVWDDSIDRL+I LD+HP+ALP SEGS++ HP ENPT+ I+RLAATNHVMVEAKGLFRITAKVVPIT+EDSRIH YGIEEGDSF
Subjt:  AILFHTHRLFIAAQKTDVWDDSIDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSF

Query:  AHLDVGFKFYGLSEEVSGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGGGEETI
        AHLDVGFKF+GLS++V+G LGQTYG GYVS +NVKAAMAVMGR EEFETSSLFAADCAVSRFG  GGVGGG+ETI
Subjt:  AHLDVGFKFYGLSEEVSGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGGGEETI

A0A5D3C8U8 Proline-rich protein 36-like8.4e-22170.14Show/hide
Query:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGS--DGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPS
        M E TPPGIANNPSHATCKIKKYKHCYNL HVCPKFCP+QC+VECASCKPICGS  D  NPPPED       + P+PPS+TYYSPPPPVA       TPS
Subjt:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGS--DGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPS

Query:  PPASDPTPSYSPPLPSPTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPV
        PPAS+P+PS+SPPLPSPT                                 PTPVTPSPSPPPP +  PS P                   P+ SPPPPV
Subjt:  PPASDPTPSYSPPLPSPTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPV

Query:  AVTPSPPASNPTPSYSPPLPSPPVTPSPSPPTSSSPPPPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPSNPT
          TP P  +NP     P  P    TPSP PPT + PP     +  TPP++SPPPP+TST P      NPNPPTSPP +H           PPSTPP+NPT
Subjt:  AVTPSPPASNPTPSYSPPLPSPPVTPSPSPPTSSSPPPPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPSNPT

Query:  PPENSTPPSTTPTNPNPPSTPPANPNPPTPSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTS-PPSSSAGAAKRVKCKNVNYPQCYNMIHTCPSACP
        PPENS PPST PTNPN PSTP        PSTPN PS TPPSETPNSPP NTP S PQTPSPP S PPSSSAGA K V+CKNVNYPQCYNMIH CPSACP
Subjt:  PPENSTPPSTTPTNPNPPSTPPANPNPPTPSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTS-PPSSSAGAAKRVKCKNVNYPQCYNMIHTCPSACP

Query:  NGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLAILFHTHRLFIAAQKTDVWDDS
        NGC+VDCVTCKPVCHCDRPGAVCQDPR VGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLAILF+ HRL IAAQKTDVWDDS
Subjt:  NGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLAILFHTHRLFIAAQKTDVWDDS

Query:  IDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSFAHLDVGFKFYGLSEEVSGFLGQ
        IDRL+I LD+HP+ALP SEGS++ HP ENPT+ I+RLAATNHVMVEAKGLFRITAKVVPIT+EDSRIH YGIEEGDSFAHLDVGFKF+GLS++V+G LGQ
Subjt:  IDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSFAHLDVGFKFYGLSEEVSGFLGQ

Query:  TYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGGGEETI
        TYG GYVS +NVKAAMAVMGR EEFETSSLFAADCAVSRFG  GGVGGG+ETI
Subjt:  TYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGGGEETI

A0A6J1I4T7 mucin-21.1e-19967.48Show/hide
Query:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGSDGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPSPP
        MA+ATPPGIA NPSHA+CKIKKYKHCYNL+HVCPKFCPDQC VECASCKPICG D  NPPPED PTPAT   PSPPS+ YYSPPPPV        TPSPP
Subjt:  MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGSDGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPSPP

Query:  ASDPTPSYSPPLPSPTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAV
         S+PTPSYSPPLPSPTPVTPSPSPP      P+PP++ PT SY P   +P                PS P ++P P        PTP TPS  P      
Subjt:  ASDPTPSYSPPLPSPTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAV

Query:  TPSPPASNPTPSYSPPLPSPPVTP--SPSPPTSSSPPPPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPSNPT
         P+PP+S PT SY P   +PP +P  SP+PPT S+P  P+   PSTPP+ S PP   + +PP+T P +PNPPT         P  P N NPPSTPP++  
Subjt:  TPSPPASNPTPSYSPPLPSPPVTP--SPSPPTSSSPPPPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPSNPT

Query:  PPENSTPPSTTPTNPNPP--STPPANPNPPT--PSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPP-TSPPSSSAGAAKRVKCKNVNYPQCYNMIHTCP
        PP N  PPS    NPNPP  S PP NPNPP+  PSTP +  S P   TP++PPT++ P  P  P+PP T PPSSS GAAKRV+CKN NYPQCYNMIHTCP
Subjt:  PPENSTPPSTTPTNPNPP--STPPANPNPPT--PSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPP-TSPPSSSAGAAKRVKCKNVNYPQCYNMIHTCP

Query:  SACPNGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLAILFHTHRLFIAAQKTDV
        SACPNGC+VDCVTCKPVCHCDRPGAVCQDPRF+GGDGITFYFHG+KDKDFCLVSDPNLHINAHFIGKRNPSL RDFTWVQSL ILF+THRL I+AQKT V
Subjt:  SACPNGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLAILFHTHRLFIAAQKTDV

Query:  WDDSIDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSFAHLDVGFKFYGLSEEVSG
        WDDSIDRL+IAL++ PVALP SEGS+  HP ENPTVVI+RL A NHVMVEAKGLFRITAKVVPIT+EDSR+H YGI+EGDSFAHLDVGFKF+ LS  V+G
Subjt:  WDDSIDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSFAHLDVGFKFYGLSEEVSG

Query:  FLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGG-GEETI
         LGQTYG GYVS VN+KAAM VMGR +EFETSSLFAADCAV++FG  GG GG G E +
Subjt:  FLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGG-GEETI

SwissProt top hitse value%identityAlignment
P13983 Extensin7.3e-0446Show/hide
Query:  PPEDTPTPATPSPPSP---PSETYYSPPPPVACALRVPPTPSPPASDPTPSYSPPLP----SPTPVTPSPSPPPPVAVT--PSPPASNPTPSYSPPLPS-
        PP   P+P    PPSP   P    YSPPPP   A    P PSP  S P P+YSPP P    SP P   SPSPPP    T  P PPA +P P+YSPP P+ 
Subjt:  PPEDTPTPATPSPPSP---PSETYYSPPPPVACALRVPPTPSPPASDPTPSYSPPLP----SPTPVTPSPSPPPPVAVT--PSPPASNPTPSYSPPLPS-

Query:  -PTPVTPSPSPPPPVAVIPSPPA-SNPTPSYSPPLPSPTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSPPVTPSPSPPTSSSPPPPSTGTPSTPPS
         P P +P  SPPPPV   P PP+ S P P+Y PP P  +P  PS SPPPP     SPP   P P+YSPPLP+PP T SP PPT S PPP     P  PP+
Subjt:  -PTPVTPSPSPPPPVAVIPSPPA-SNPTPSYSPPLPSPTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSPPVTPSPSPPTSSSPPPPSTGTPSTPPS

Query:  ISPPPPITSTSPPTTTPANPNPPT-SPP-----EDHPP----SPPPPPNTNPPSTPPSNPTPPE-NSTPPSTTPTNP---NPPSTPPANPNPPTPSTPNS
         SPPPP  S  PP T   +P PPT SPP     +  PP    SPPPP  + PP +P  +P PP+    PP+ +P  P   + P  P   P PPTP+    
Subjt:  ISPPPPITSTSPPTTTPANPNPPT-SPP-----EDHPP----SPPPPPNTNPPSTPPSNPTPPE-NSTPPSTTPTNP---NPPSTPPANPNPPTPSTPNS

Query:  PS----STPPSETPNSPPTNTPPSAPQTPSP----PTSPPSSSAGAAKRV
        PS    S PP    +SPP   P   P+TP+P    P SPP+ SA   +++
Subjt:  PS----STPPSETPNSPPTNTPPSAPQTPSP----PTSPPSSSAGAAKRV

Q9FPQ6 Vegetative cell wall protein gp11.6e-1146.24Show/hide
Query:  YVECASCKPICGSDGTNPPPEDTPTPATPSP----PSPPSETYYSPPPPVACALRVPPTPSPPASDPTPSYSPPLPS-PTPVTPSPSPPPPVAVTPSPPA
        +V  A+ + + G     PP    P+PA PSP    P+PPS    SP PP + A   PP+P+PP+  P PS +PP P+ P+P  PSP+PP P   +P+PP+
Subjt:  YVECASCKPICGSDGTNPPPEDTPTPATPSP----PSPPSETYYSPPPPVACALRVPPTPSPPASDPTPSYSPPLPS-PTPVTPSPSPPPPVAVTPSPPA

Query:  SNPTPSYSPPLPS-PTPVTPS-PSPPPPVAVIPSPPASNP--TPSYSPPLPSPTPVTPSPSPP---PPVAVTPSPPASNP--TPSYSPPLPSPPVTPSPS
            PS +PP PS P P +PS PSP PP+   P+PP+ +P   PS SPP+P P+P  PSP+PP   PPV  +P+PP+  P   PS +PP P+PPV PSP+
Subjt:  SNPTPSYSPPLPS-PTPVTPS-PSPPPPVAVIPSPPASNP--TPSYSPPLPSPTPVTPSPSPP---PPVAVTPSPPASNP--TPSYSPPLPSPPVTPSPS

Query:  PPTSSSPPPPSTGTPSTPPSISPPPPIT----STSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPSNPTPPENSTPPST--TPTNPNPPSTPPA
        PP+  SP PPS  +P+ PPS SPP P +    S +PP+  P +P PP  PP   PPSPPPPP   PP  P + P PP   +PP +   PT P PPS  P 
Subjt:  PPTSSSPPPPSTGTPSTPPSISPPPPIT----STSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPSNPTPPENSTPPST--TPTNPNPPSTPPA

Query:  NPNPPTPSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTSPPSS
        +P PP+P+ P  PS  PPS  P+ PP+  PP+   +PSP  SP  S
Subjt:  NPNPPTPSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTSPPSS

Arabidopsis top hitse value%identityAlignment
AT3G19430.1 late embryogenesis abundant protein-related / LEA protein-related1.4e-12750Show/hide
Query:  TPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGSDGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPSPPASDP
        TPPGIA NPSHATCKIKKYKHCYNLEHVCPKFCPD C+VECASCKPICG      PP  +P        S   +  Y+PP PV      PPTPS P+  P
Subjt:  TPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGSDGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPSPPASDP

Query:  TPSYSPPLPSPTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVTPSP
        TP  SPP P+PTP  PSP+PP                  SPP P+PTP  PSP+PP                  SPP P+PTP  PSP+PP       SP
Subjt:  TPSYSPPLPSPTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVTPSP

Query:  PASNPTPSYSPPLPSPPVTPSPSPPTSSSPPPPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPSNPTPPENST
        P   PTPS   P P  P  P PSPP   SPPPP T TPS P            SPP  T   P PPT       PS P PP+  P  TPP+   P    +
Subjt:  PASNPTPSYSPPLPSPPVTPSPSPPTSSSPPPPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPSNPTPPENST

Query:  PPSTTPTNPNPPSTPPANPNPPTPSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTSPPSSSAGAAKRVKCKNVNYPQCYNMIHTCPSACPNGCEVDC
        PP  TPT P PPS        PTPS               SPP   PPS               A  AKRV+CK    P CY + +TCP+ CP  C+VDC
Subjt:  PPSTTPTNPNPPSTPPANPNPPTPSTPNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTSPPSSSAGAAKRVKCKNVNYPQCYNMIHTCPSACPNGCEVDC

Query:  VTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLAILFHTHRLFIAAQKTDVWDDSIDRLSIA
        VTCKPVC+CD+PG+VCQDPRF+GGDG+TFYFHGKKD +FCL+SDPNLHINAHFIGKR   + RDFTWVQS+AILF THRL++ A KT  WDDS+DR++++
Subjt:  VTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLAILFHTHRLFIAAQKTDVWDDSIDRLSIA

Query:  LDNHPVALPTSEGSR-LHHPAENPTVVIIRL-AATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSFAHLDVGFKFYGLSEEVSGFLGQTYGGG
         D + ++LP  +G+R    P   P V + R+   TN++ VE +GL +ITA+VVPIT EDSRIH Y ++E D  AHLD+GFKF  LS+ V G LGQTY   
Subjt:  LDNHPVALPTSEGSR-LHHPAENPTVVIIRL-AATNHVMVEAKGLFRITAKVVPITQEDSRIHKYGIEEGDSFAHLDVGFKFYGLSEEVSGFLGQTYGGG

Query:  YVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGGGEETI
        YVS V +   M VMG   EF+T+ LFA DC+ +RF   G    G   +
Subjt:  YVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGGGEETI

AT5G54370.1 Late embryogenesis abundant (LEA) protein-related3.1e-5841.55Show/hide
Query:  AKRVKCKNVNYPQCYNMIHTCPSACPNG---------CEVDC--VTCKPVC-----HCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINA
        A  V C N  Y +CY     CP  CP+          C  DC   TCK  C     +C+RPG+ C DPRF+GGDGI FYFHGK +++F LVSD +L IN 
Subjt:  AKRVKCKNVNYPQCYNMIHTCPSACPNG---------CEVDC--VTCKPVC-----HCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINA

Query:  HFIGKRNPSLKRDFTWVQSLAILFHTHRLFIAAQKTDVWDDSIDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVV
         FIG R     RDFTW+Q+L  LF++++  + A KT  WD+ ID L  + D   +++P    S  + P  N  + I R++  N V+V  K    I   VV
Subjt:  HFIGKRNPSLKRDFTWVQSLAILFHTHRLFIAAQKTDVWDDSIDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVV

Query:  PITQEDSRIHKYGIEEGDSFAHLDVGFKFYGLSEEVSGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGD
        P+T+ED RIH Y +   D FAHL+V F+F+ LS +V G LG+TY   + +P     AM V+G  + F+TSSL + DC    F +
Subjt:  PITQEDSRIHKYGIEEGDSFAHLDVGFKFYGLSEEVSGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGD

AT5G60520.1 Late embryogenesis abundant (LEA) protein-related2.3e-6142.71Show/hide
Query:  KRVKCKNVNYPQCYNMIHTCPSACP----------NGCEVDC-----VTCK-PVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAH
        +RV+C  +    C   I TCP  CP            C +DC     VTCK    +C+  G++C DPRFVGGDG+ FYFHG KD +F +VSD NL INAH
Subjt:  KRVKCKNVNYPQCYNMIHTCPSACP----------NGCEVDC-----VTCK-PVCHCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAH

Query:  FIGKRNPSLKRDFTWVQSLAILFHTHRLFIAAQKTDVWDDSIDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVP
        FIG R     RDFTWVQ+ +++F +H L IAA+K   WDDS+D L +  +   V +PT   +      +   V++ R    N+V V   G+ +I  +V P
Subjt:  FIGKRNPSLKRDFTWVQSLAILFHTHRLFIAAQKTDVWDDSIDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRITAKVVP

Query:  ITQEDSRIHKYGIEEGDSFAHLDVGFKFYGLSEEVSGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVG
        I +E+ R+HKY + + D+FAHL+  FKF+ LS+ V G LG+TY  GYVSPV     M +MG  ++++T SLF+  C V RF  + G G
Subjt:  ITQEDSRIHKYGIEEGDSFAHLDVGFKFYGLSEEVSGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVG

AT5G60530.1 late embryogenesis abundant protein-related / LEA protein-related1.2e-5440.37Show/hide
Query:  CYNMIHTCPSACP----------NGCEVDCVT-CKPVC-----HCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRD
        CY     CP  CP           GC +DC   C+  C     +C+  G++C DPRFVGGDG+ FYFHG K  +F +VSD NL INAHFIG R     RD
Subjt:  CYNMIHTCPSACP----------NGCEVDCVT-CKPVC-----HCDRPGAVCQDPRFVGGDGITFYFHGKKDKDFCLVSDPNLHINAHFIGKRNPSLKRD

Query:  FTWVQSLAILFHTHRLFIAAQKTDVWDDSIDRLSIALDNHPVALPTSEGSRLHH-PAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKY
        FTWVQ+L ++F  H+L I A + + WD++ D  +I  D   + LP  E S       +   ++I R    N V V    L ++  +V PI +E++R+H Y
Subjt:  FTWVQSLAILFHTHRLFIAAQKTDVWDDSIDRLSIALDNHPVALPTSEGSRLHH-PAENPTVVIIRLAATNHVMVEAKGLFRITAKVVPITQEDSRIHKY

Query:  GIEEGDSFAHLDVGFKFYGLSEEVSGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRF
         + + D+FAHL+  FKF  LSE V G LG+TY   YVS       M V+G  ++++T SLF+  C + RF
Subjt:  GIEEGDSFAHLDVGFKFYGLSEEVSGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAGGCGACACCGCCGGGGATTGCTAATAATCCGAGCCATGCAACGTGCAAGATTAAAAAGTATAAACATTGTTATAATTTGGAACATGTTTGTCCCAAGTTTTG
TCCTGATCAATGTTATGTTGAATGTGCCTCTTGTAAGCCTATTTGTGGTAGTGATGGTACCAATCCTCCACCGGAAGATACTCCCACTCCGGCCACCCCTTCTCCGCCGT
CACCTCCATCGGAAACTTATTACTCGCCTCCACCTCCGGTGGCTTGCGCTCTCAGAGTCCCTCCAACTCCGAGCCCTCCTGCTTCAGACCCTACTCCCTCATACTCTCCT
CCATTGCCTTCACCCACACCGGTAACTCCTTCTCCTTCGCCTCCACCACCGGTGGCTGTAACTCCGAGCCCTCCTGCTTCGAACCCTACTCCCTCATACTCTCCTCCATT
ACCTTCTCCAACACCAGTAACTCCTTCTCCTTCCCCTCCACCTCCAGTGGCTGTAATTCCGAGCCCTCCTGCTTCAAACCCTACTCCCTCATACTCTCCTCCATTGCCTT
CACCGACACCGGTAACTCCATCTCCTTCGCCTCCACCCCCGGTGGCTGTAACTCCGAGCCCTCCTGCTTCGAACCCTACTCCCTCATACTCTCCTCCATTGCCTTCACCA
CCGGTAACTCCTTCTCCTTCCCCTCCAACTTCGTCATCTCCACCACCTCCATCAACTGGAACTCCAAGCACCCCTCCATCTATTTCGCCACCTCCACCGATTACTTCAAC
ATCACCTCCAACAACCACTCCTGCAAATCCCAATCCTCCAACATCGCCTCCGGAGGACCACCCTCCATCACCACCGCCTCCACCAAATACCAACCCCCCCTCAACACCAC
CATCAAACCCCACCCCTCCAGAAAATTCCACTCCTCCATCAACAACTCCAACAAACCCTAACCCTCCATCAACACCTCCAGCAAATCCCAATCCTCCAACACCATCCACT
CCAAATTCTCCATCGTCGACACCACCGTCCGAAACTCCCAACTCCCCACCTACTAACACTCCTCCGTCAGCTCCTCAAACACCATCTCCTCCAACTTCTCCACCATCTTC
CTCGGCGGGTGCAGCAAAGAGAGTGAAATGCAAAAATGTGAATTATCCTCAATGTTACAACATGATTCACACTTGTCCTAGCGCTTGCCCTAATGGATGTGAAGTTGATT
GTGTTACCTGCAAACCCGTCTGTCATTGTGACAGACCAGGAGCAGTATGCCAAGACCCACGTTTCGTCGGCGGCGACGGCATAACCTTCTACTTCCACGGCAAAAAGGAC
AAGGATTTCTGTCTAGTTTCCGATCCCAACCTCCACATCAACGCCCATTTCATCGGAAAACGAAACCCTTCCTTAAAACGAGACTTCACTTGGGTCCAATCCCTAGCTAT
CCTCTTCCACACTCACCGCCTCTTCATTGCCGCGCAAAAGACCGACGTTTGGGATGATTCCATCGACCGCCTCTCTATTGCCCTCGACAACCACCCGGTGGCCCTTCCAA
CATCCGAAGGCAGTCGTTTGCACCACCCTGCCGAAAATCCCACCGTTGTCATCATCCGACTAGCCGCGACAAACCACGTGATGGTGGAAGCCAAAGGGCTGTTCAGAATC
ACAGCCAAGGTGGTGCCAATAACACAAGAGGACTCAAGGATTCACAAATATGGGATAGAGGAAGGGGATTCGTTCGCTCATTTGGATGTAGGGTTTAAGTTTTATGGTTT
GAGTGAGGAAGTTAGTGGGTTTTTGGGGCAGACTTATGGGGGTGGATATGTGAGTCCTGTCAATGTGAAGGCGGCTATGGCAGTCATGGGGAGGGCGGAGGAGTTTGAAA
CTTCTAGCTTGTTTGCGGCGGATTGTGCTGTCTCTAGATTTGGCGACCGCGGCGGCGTTGGCGGCGGAGAGGAGACTATATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAGGCGACACCGCCGGGGATTGCTAATAATCCGAGCCATGCAACGTGCAAGATTAAAAAGTATAAACATTGTTATAATTTGGAACATGTTTGTCCCAAGTTTTG
TCCTGATCAATGTTATGTTGAATGTGCCTCTTGTAAGCCTATTTGTGGTAGTGATGGTACCAATCCTCCACCGGAAGATACTCCCACTCCGGCCACCCCTTCTCCGCCGT
CACCTCCATCGGAAACTTATTACTCGCCTCCACCTCCGGTGGCTTGCGCTCTCAGAGTCCCTCCAACTCCGAGCCCTCCTGCTTCAGACCCTACTCCCTCATACTCTCCT
CCATTGCCTTCACCCACACCGGTAACTCCTTCTCCTTCGCCTCCACCACCGGTGGCTGTAACTCCGAGCCCTCCTGCTTCGAACCCTACTCCCTCATACTCTCCTCCATT
ACCTTCTCCAACACCAGTAACTCCTTCTCCTTCCCCTCCACCTCCAGTGGCTGTAATTCCGAGCCCTCCTGCTTCAAACCCTACTCCCTCATACTCTCCTCCATTGCCTT
CACCGACACCGGTAACTCCATCTCCTTCGCCTCCACCCCCGGTGGCTGTAACTCCGAGCCCTCCTGCTTCGAACCCTACTCCCTCATACTCTCCTCCATTGCCTTCACCA
CCGGTAACTCCTTCTCCTTCCCCTCCAACTTCGTCATCTCCACCACCTCCATCAACTGGAACTCCAAGCACCCCTCCATCTATTTCGCCACCTCCACCGATTACTTCAAC
ATCACCTCCAACAACCACTCCTGCAAATCCCAATCCTCCAACATCGCCTCCGGAGGACCACCCTCCATCACCACCGCCTCCACCAAATACCAACCCCCCCTCAACACCAC
CATCAAACCCCACCCCTCCAGAAAATTCCACTCCTCCATCAACAACTCCAACAAACCCTAACCCTCCATCAACACCTCCAGCAAATCCCAATCCTCCAACACCATCCACT
CCAAATTCTCCATCGTCGACACCACCGTCCGAAACTCCCAACTCCCCACCTACTAACACTCCTCCGTCAGCTCCTCAAACACCATCTCCTCCAACTTCTCCACCATCTTC
CTCGGCGGGTGCAGCAAAGAGAGTGAAATGCAAAAATGTGAATTATCCTCAATGTTACAACATGATTCACACTTGTCCTAGCGCTTGCCCTAATGGATGTGAAGTTGATT
GTGTTACCTGCAAACCCGTCTGTCATTGTGACAGACCAGGAGCAGTATGCCAAGACCCACGTTTCGTCGGCGGCGACGGCATAACCTTCTACTTCCACGGCAAAAAGGAC
AAGGATTTCTGTCTAGTTTCCGATCCCAACCTCCACATCAACGCCCATTTCATCGGAAAACGAAACCCTTCCTTAAAACGAGACTTCACTTGGGTCCAATCCCTAGCTAT
CCTCTTCCACACTCACCGCCTCTTCATTGCCGCGCAAAAGACCGACGTTTGGGATGATTCCATCGACCGCCTCTCTATTGCCCTCGACAACCACCCGGTGGCCCTTCCAA
CATCCGAAGGCAGTCGTTTGCACCACCCTGCCGAAAATCCCACCGTTGTCATCATCCGACTAGCCGCGACAAACCACGTGATGGTGGAAGCCAAAGGGCTGTTCAGAATC
ACAGCCAAGGTGGTGCCAATAACACAAGAGGACTCAAGGATTCACAAATATGGGATAGAGGAAGGGGATTCGTTCGCTCATTTGGATGTAGGGTTTAAGTTTTATGGTTT
GAGTGAGGAAGTTAGTGGGTTTTTGGGGCAGACTTATGGGGGTGGATATGTGAGTCCTGTCAATGTGAAGGCGGCTATGGCAGTCATGGGGAGGGCGGAGGAGTTTGAAA
CTTCTAGCTTGTTTGCGGCGGATTGTGCTGTCTCTAGATTTGGCGACCGCGGCGGCGTTGGCGGCGGAGAGGAGACTATATGA
Protein sequenceShow/hide protein sequence
MAEATPPGIANNPSHATCKIKKYKHCYNLEHVCPKFCPDQCYVECASCKPICGSDGTNPPPEDTPTPATPSPPSPPSETYYSPPPPVACALRVPPTPSPPASDPTPSYSP
PLPSPTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVIPSPPASNPTPSYSPPLPSPTPVTPSPSPPPPVAVTPSPPASNPTPSYSPPLPSP
PVTPSPSPPTSSSPPPPSTGTPSTPPSISPPPPITSTSPPTTTPANPNPPTSPPEDHPPSPPPPPNTNPPSTPPSNPTPPENSTPPSTTPTNPNPPSTPPANPNPPTPST
PNSPSSTPPSETPNSPPTNTPPSAPQTPSPPTSPPSSSAGAAKRVKCKNVNYPQCYNMIHTCPSACPNGCEVDCVTCKPVCHCDRPGAVCQDPRFVGGDGITFYFHGKKD
KDFCLVSDPNLHINAHFIGKRNPSLKRDFTWVQSLAILFHTHRLFIAAQKTDVWDDSIDRLSIALDNHPVALPTSEGSRLHHPAENPTVVIIRLAATNHVMVEAKGLFRI
TAKVVPITQEDSRIHKYGIEEGDSFAHLDVGFKFYGLSEEVSGFLGQTYGGGYVSPVNVKAAMAVMGRAEEFETSSLFAADCAVSRFGDRGGVGGGEETI