; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0022685 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0022685
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationchr04:30526571..30527558
RNA-Seq ExpressionPay0022685
SyntenyPay0022685
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043818.1 protein YLS9 [Cucumis melo var. makuwa]5.9e-16098.63Show/hide
Query:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGTVSAG
        MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHP YPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGT+SAG
Subjt:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGTVSAG

Query:  FLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEK
        FLRGIVAALILLVAIMTLSSIITWIILRPE+PVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEK
Subjt:  FLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEK

Query:  SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSKTCSVLV
        SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRV CEDMKLVFTGPAAGHAVYLADEHSKTCSVLV
Subjt:  SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSKTCSVLV

XP_008442912.1 PREDICTED: uncharacterized protein LOC103486674 [Cucumis melo]5.9e-16098.63Show/hide
Query:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGTVSAG
        MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHP YPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGT+SAG
Subjt:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGTVSAG

Query:  FLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEK
        FLRGIVAALILLVAIMTLSSIITWIILRPE+PVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEK
Subjt:  FLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEK

Query:  SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSKTCSVLV
        SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRV CEDMKLVFTGPAAGHAVYLADEHSKTCSVLV
Subjt:  SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSKTCSVLV

XP_011652032.1 uncharacterized protein LOC105434983 [Cucumis sativus]5.0e-14388.01Show/hide
Query:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGTVSAG
        MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPP GH GYSPAMGYPP P PGYPPA GNYPPYN YYAQAPPAAYYNNPQNYRA TVSAG
Subjt:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGTVSAG

Query:  FLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEK
        FLRGIV ALILLVA+MTLSSIITWI+LRP+IPVFKVDSFSVSNFNISKLNYSGNW+ S+TV+NPNHKL VN+ERIQSFV+YK+NTLAMSYADPFF+DVEK
Subjt:  FLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEK

Query:  SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSKTCSVL
        S QM+VKLTSSSPDDPGNWLETEEK+G+E+A+GTVSFNLRFFAWT FR+GSWWTRR+VM+VFCED+KL FTGPAA H VYLAD HSKTCSVL
Subjt:  SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSKTCSVL

XP_022983003.1 uncharacterized protein LOC111481675 [Cucurbita maxima]3.2e-11374.84Show/hide
Query:  MASSSEDQ---QSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAY-YAQAPPAAYY-------NNP
        MASSS DQ   QSQSK TDPPPP P SAGNNPPP+YPPPTLGY PP  H GY PAMGYPPAPHPGYPPA GNYPPYNAY Y QAPPAAYY       NNP
Subjt:  MASSSEDQ---QSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAY-YAQAPPAAYY-------NNP

Query:  QNYRAGTVSAGFLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMS
        Q YR  T  AGFLRGI AAL+LLV IMT+SSIITWIILRPEIP FKVDSFSV+NFNISK NYSG WD  VTVQNPNHKLN++ ERI+SFVDY  NT+A S
Subjt:  QNYRAGTVSAGFLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMS

Query:  YADPFFLDVEKSGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFR--TGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSK
        ++DPFFLD+EKS QM VK+TSSSPDDPGNW++TEEKL RERATGTVSF LR  AWTTFR  +GS WTRRV++RVFCED+KLVFTG      VY    H K
Subjt:  YADPFFLDVEKSGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFR--TGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSK

Query:  TCSVLV
        TC VLV
Subjt:  TCSVLV

XP_038905898.1 uncharacterized protein LOC120091828 [Benincasa hispida]7.5e-13182.59Show/hide
Query:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGTVSAG
        MASSS+D QSQSKATDPPP  P SAGNNPPPVYPPPTLGYPPPQGH  Y PAMGYPPAPHPGYPPA GNYPPYN YYAQAPPAAYYNN QNYRA TV+ G
Subjt:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGTVSAG

Query:  FLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEK
        FLRGIV ALIL VAIMTLSSI+TWIILRPEIPVF++DSFSV NFNISK NYSGNWD ++TVQNPNH+LNVN+ER+QSFVDYK NTLAMSY DPFFLDVEK
Subjt:  FLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEK

Query:  SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSKTCSVLV
        S QM+VKLTSSSPDDPG+W ETE+KLG+E+ATGTVSFNLRF AWTTFR GSWWTRRVV+RVFCED+KLVF GPAAG  VY  + + K CSVL+
Subjt:  SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSKTCSVLV

TrEMBL top hitse value%identityAlignment
A0A0A0LGS8 Uncharacterized protein2.4e-14388.01Show/hide
Query:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGTVSAG
        MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPP GH GYSPAMGYPP P PGYPPA GNYPPYN YYAQAPPAAYYNNPQNYRA TVSAG
Subjt:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGTVSAG

Query:  FLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEK
        FLRGIV ALILLVA+MTLSSIITWI+LRP+IPVFKVDSFSVSNFNISKLNYSGNW+ S+TV+NPNHKL VN+ERIQSFV+YK+NTLAMSYADPFF+DVEK
Subjt:  FLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEK

Query:  SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSKTCSVL
        S QM+VKLTSSSPDDPGNWLETEEK+G+E+A+GTVSFNLRFFAWT FR+GSWWTRR+VM+VFCED+KL FTGPAA H VYLAD HSKTCSVL
Subjt:  SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSKTCSVL

A0A1S3B6W4 uncharacterized protein LOC1034866742.9e-16098.63Show/hide
Query:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGTVSAG
        MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHP YPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGT+SAG
Subjt:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGTVSAG

Query:  FLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEK
        FLRGIVAALILLVAIMTLSSIITWIILRPE+PVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEK
Subjt:  FLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEK

Query:  SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSKTCSVLV
        SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRV CEDMKLVFTGPAAGHAVYLADEHSKTCSVLV
Subjt:  SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSKTCSVLV

A0A5A7TLT1 Protein YLS92.9e-16098.63Show/hide
Query:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGTVSAG
        MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHP YPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGT+SAG
Subjt:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGTVSAG

Query:  FLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEK
        FLRGIVAALILLVAIMTLSSIITWIILRPE+PVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEK
Subjt:  FLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEK

Query:  SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSKTCSVLV
        SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRV CEDMKLVFTGPAAGHAVYLADEHSKTCSVLV
Subjt:  SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSKTCSVLV

A0A6J1F415 uncharacterized protein LOC1114421887.6e-11374.75Show/hide
Query:  MASSSEDQ---QSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAY-YAQAPPAAYY------NNPQ
        MASSS DQ   QSQSK TDPPPP P SAGNNPPP+YPPPTLGY PP  H GY PAMGYPPAPHPGYPPA GNYPPYNAY Y QAPPAAYY      NNPQ
Subjt:  MASSSEDQ---QSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAY-YAQAPPAAYY------NNPQ

Query:  NYRAGTVSAGFLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSY
         YR  T  AGFLRGI AAL+LLV IMT+SSIITWIILRPEIP FKVDSFSV+NFNISK NYSG WD  VTVQNPNHKLN++ ERI+SFVDY  NT+A S+
Subjt:  NYRAGTVSAGFLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSY

Query:  ADPFFLDVEKSGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFR--TGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSKT
        +DPFFLD+EKS QM+VK+TSSSPDDPGNW +TEEKL RER TGTVSF LR  AWTTFR  +GS WTRRV++RVFCED+KLVFTG      VY     SKT
Subjt:  ADPFFLDVEKSGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFR--TGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSKT

Query:  CSVLV
        C VLV
Subjt:  CSVLV

A0A6J1J6I9 uncharacterized protein LOC1114816751.5e-11374.84Show/hide
Query:  MASSSEDQ---QSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAY-YAQAPPAAYY-------NNP
        MASSS DQ   QSQSK TDPPPP P SAGNNPPP+YPPPTLGY PP  H GY PAMGYPPAPHPGYPPA GNYPPYNAY Y QAPPAAYY       NNP
Subjt:  MASSSEDQ---QSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAY-YAQAPPAAYY-------NNP

Query:  QNYRAGTVSAGFLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMS
        Q YR  T  AGFLRGI AAL+LLV IMT+SSIITWIILRPEIP FKVDSFSV+NFNISK NYSG WD  VTVQNPNHKLN++ ERI+SFVDY  NT+A S
Subjt:  QNYRAGTVSAGFLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMS

Query:  YADPFFLDVEKSGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFR--TGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSK
        ++DPFFLD+EKS QM VK+TSSSPDDPGNW++TEEKL RERATGTVSF LR  AWTTFR  +GS WTRRV++RVFCED+KLVFTG      VY    H K
Subjt:  YADPFFLDVEKSGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFR--TGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSK

Query:  TCSVLV
        TC VLV
Subjt:  TCSVLV

SwissProt top hitse value%identityAlignment
Q9SJ52 NDR1/HIN1-like protein 102.5e-0425.59Show/hide
Query:  PPYNAYYAQA--PPA--AYYNNPQNYRAGTVSAGFLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISK----LNYSGNWDASVTVQ
        P   A+Y  +  PPA   YY        G      L   V  +I L+ I+ ++++I W+I+RP    F V   S++ F+ +     L Y  N   +V V+
Subjt:  PPYNAYYAQA--PPA--AYYNNPQNYRAGTVSAGFLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISK----LNYSGNWDASVTVQ

Query:  NPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEKSGQMKVKLTSSSPDDPGNWL-----ETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRV
        NPN ++ +  +RI++   Y+    +     PF+       Q     T  +P   G  L          L  ER +G  +  ++F     F+ G    RR+
Subjt:  NPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEKSGQMKVKLTSSSPDDPGNWL-----ETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRV

Query:  VMRVFCEDMKL
          +V C+D++L
Subjt:  VMRVFCEDMKL

Arabidopsis top hitse value%identityAlignment
AT2G27260.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.5e-1733.33Show/hide
Query:  PAMGYPPAPHPGYPPATGNYPPYNAYYAQAPPAAY-YNNPQNYRAGTVS--AGFLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNIS
        PA GY P P+P YP      PP N Y   A   AY Y N   Y A   +  A  +R +       + ++ L   I ++I+RP++P   ++S SVSNFN+S
Subjt:  PAMGYPPAPHPGYPPATGNYPPYNAYYAQAPPAAY-YNNPQNYRAGTVS--AGFLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNIS

Query:  KLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEKSGQMKVKLTSSSPDDPGNWLETE--EKLGRERAT-GTVSFNLRFFAW
            SG WD  +  +NPN K++++ E     + Y + +L+ +   PF  D  K  Q  V  T S     G +++    + +G+ER+  G V F+LR  ++
Subjt:  KLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEKSGQMKVKLTSSSPDDPGNWLETE--EKLGRERAT-GTVSFNLRFFAW

Query:  TTFRTGSWWTRRVVMRVFCEDM
         TFR G++  RR V  V+C+D+
Subjt:  TTFRTGSWWTRRVVMRVFCEDM

AT2G35980.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.8e-0525.59Show/hide
Query:  PPYNAYYAQA--PPA--AYYNNPQNYRAGTVSAGFLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISK----LNYSGNWDASVTVQ
        P   A+Y  +  PPA   YY        G      L   V  +I L+ I+ ++++I W+I+RP    F V   S++ F+ +     L Y  N   +V V+
Subjt:  PPYNAYYAQA--PPA--AYYNNPQNYRAGTVSAGFLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISK----LNYSGNWDASVTVQ

Query:  NPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEKSGQMKVKLTSSSPDDPGNWL-----ETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRV
        NPN ++ +  +RI++   Y+    +     PF+       Q     T  +P   G  L          L  ER +G  +  ++F     F+ G    RR+
Subjt:  NPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEKSGQMKVKLTSSSPDDPGNWL-----ETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRV

Query:  VMRVFCEDMKL
          +V C+D++L
Subjt:  VMRVFCEDMKL

AT3G52460.1 hydroxyproline-rich glycoprotein family protein2.9e-4039.93Show/hide
Query:  SSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQ-GHGGYSPAMGY-----PPAPHPGYPPATGNYPPYNAY-YAQAPPAAYYNNPQNYRAGT
        S  ++++Q K    P  +     N PPP  PPP    PPPQ     Y P MGY     PP P+P YP A     PY  Y YAQAPPA+YY +    +   
Subjt:  SSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQ-GHGGYSPAMGY-----PPAPHPGYPPATGNYPPYNAY-YAQAPPAAYYNNPQNYRAGT

Query:  V-----SAGFLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNT------
        V     S+GF+RGI   LI+LV ++ +S+ ITW++LRP+IP+F V++FSVSNFN++   +S  W A++T++N N KL    +RIQ  V Y QN       
Subjt:  V-----SAGFLRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNT------

Query:  LAMSYADPFFLDVEKSGQMKVKLTSSSPDDP--GNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYL
        LA ++  P F++ +KS  +   LT+   + P   +W+  E K  +ER TGTV+F+LR   W TF+T  W  R   ++VFC  +K+ F G +   AV L
Subjt:  LAMSYADPFFLDVEKSGQMKVKLTSSSPDDP--GNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYL

AT5G22200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.3e-0825.97Show/hide
Query:  LRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNY-SGNWDASVTVQNPNHKLNVNMERIQSFVDYKQN--TLAMSYADPFFLDV
        +R I  A + L+  +     + W IL P  P F +   ++++FN+S+ N+ S N   +V+ +NPN K+ +  +R+  +V Y+    TLA      +   +
Subjt:  LRGIVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNY-SGNWDASVTVQNPNHKLNVNMERIQSFVDYKQN--TLAMSYADPFFLDV

Query:  EKSGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDM-----KLVFTGPA
        E +      + S+ P  P         L  +   G V  N++   W  ++ GSW +    + V C        KL  TGPA
Subjt:  EKSGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDM-----KLVFTGPA

AT5G22870.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family7.8e-0922.54Show/hide
Query:  IVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNY-SGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFF---LDVEK
        I   ++ L+ +  +  +ITW+  +P+   + V++ SV NFN++  N+ S  +  ++   NPNH+++V    ++ FV +K  TLA    +PF    ++V++
Subjt:  IVAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNY-SGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFF---LDVEK

Query:  SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDMKLVFTGP
          +  +    +     G  L ++  LG+      + F +   A   F+ G W +     ++ C  + +  + P
Subjt:  SGQMKVKLTSSSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDMKLVFTGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTCATCGGAGGATCAACAATCTCAATCCAAAGCCACTGACCCACCTCCTCCGCACCCGTCCTCTGCCGGAAACAACCCTCCTCCTGTCTATCCACCG
CCCACATTGGGGTACCCTCCTCCTCAAGGCCATGGGGGGTACTCTCCGGCAATGGGGTACCCTCCAGCTCCACATCCAGGGTACCCACCGGCTACAGGGAATTAC
CCTCCTTACAATGCGTACTACGCTCAGGCTCCCCCGGCGGCGTATTACAATAACCCCCAAAATTACAGAGCGGGGACGGTAAGCGCGGGATTCCTCCGAGGGATT
GTGGCGGCGTTGATTTTATTGGTGGCTATAATGACTCTGTCCAGCATAATCACATGGATCATCCTCCGCCCTGAAATCCCAGTGTTTAAAGTCGATTCATTCTCC
GTTTCGAATTTCAATATCTCGAAATTGAATTACTCCGGAAATTGGGATGCGAGTGTGACGGTTCAAAATCCGAACCATAAACTGAATGTGAATATGGAGCGGATC
CAGAGCTTCGTGGACTACAAACAAAATACGTTGGCAATGTCCTACGCGGATCCATTTTTTCTAGATGTGGAGAAGAGCGGTCAAATGAAGGTGAAATTGACGTCG
AGTAGTCCCGATGATCCAGGAAATTGGTTGGAAACAGAAGAGAAGTTGGGGCGGGAGAGGGCGACCGGAACGGTGAGTTTCAATTTGAGATTCTTTGCTTGGACA
ACTTTCCGAACTGGTTCTTGGTGGACAAGGCGGGTTGTTATGAGAGTGTTTTGTGAAGATATGAAGCTGGTCTTCACCGGACCCGCCGCCGGTCATGCCGTTTAC
TTGGCCGACGAACACTCCAAGACTTGTTCTGTTCTCGTCTAG
mRNA sequenceShow/hide mRNA sequence
TTCTCCTTTACCATTCTTCTCTCTCTATCTCTCTCTCTCTCCAAATCCTCTCACAGGGAGAGAGAAAAGGCCAGAAACAGAGCATTCTTTCCAATGGCTTCCTCA
TCGGAGGATCAACAATCTCAATCCAAAGCCACTGACCCACCTCCTCCGCACCCGTCCTCTGCCGGAAACAACCCTCCTCCTGTCTATCCACCGCCCACATTGGGG
TACCCTCCTCCTCAAGGCCATGGGGGGTACTCTCCGGCAATGGGGTACCCTCCAGCTCCACATCCAGGGTACCCACCGGCTACAGGGAATTACCCTCCTTACAAT
GCGTACTACGCTCAGGCTCCCCCGGCGGCGTATTACAATAACCCCCAAAATTACAGAGCGGGGACGGTAAGCGCGGGATTCCTCCGAGGGATTGTGGCGGCGTTG
ATTTTATTGGTGGCTATAATGACTCTGTCCAGCATAATCACATGGATCATCCTCCGCCCTGAAATCCCAGTGTTTAAAGTCGATTCATTCTCCGTTTCGAATTTC
AATATCTCGAAATTGAATTACTCCGGAAATTGGGATGCGAGTGTGACGGTTCAAAATCCGAACCATAAACTGAATGTGAATATGGAGCGGATCCAGAGCTTCGTG
GACTACAAACAAAATACGTTGGCAATGTCCTACGCGGATCCATTTTTTCTAGATGTGGAGAAGAGCGGTCAAATGAAGGTGAAATTGACGTCGAGTAGTCCCGAT
GATCCAGGAAATTGGTTGGAAACAGAAGAGAAGTTGGGGCGGGAGAGGGCGACCGGAACGGTGAGTTTCAATTTGAGATTCTTTGCTTGGACAACTTTCCGAACT
GGTTCTTGGTGGACAAGGCGGGTTGTTATGAGAGTGTTTTGTGAAGATATGAAGCTGGTCTTCACCGGACCCGCCGCCGGTCATGCCGTTTACTTGGCCGACGAA
CACTCCAAGACTTGTTCTGTTCTCGTCTAGAAGAATTCTTCGG
Protein sequenceShow/hide protein sequence
MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPQGHGGYSPAMGYPPAPHPGYPPATGNYPPYNAYYAQAPPAAYYNNPQNYRAGTVSAGFLRGI
VAALILLVAIMTLSSIITWIILRPEIPVFKVDSFSVSNFNISKLNYSGNWDASVTVQNPNHKLNVNMERIQSFVDYKQNTLAMSYADPFFLDVEKSGQMKVKLTS
SSPDDPGNWLETEEKLGRERATGTVSFNLRFFAWTTFRTGSWWTRRVVMRVFCEDMKLVFTGPAAGHAVYLADEHSKTCSVLV