; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G35560 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G35560
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationChr3:30999758..31001581
RNA-Seq ExpressionCSPI03G35560
SyntenyCSPI03G35560
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043818.1 protein YLS9 [Cucumis melo var. makuwa]1.5e-13986.64Show/hide
Query:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGH-GYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAG
        MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPP GH GYSPAMGYPP P P YPPA GNYPPYN YYAQAPPAAYYNNPQNYRA T+SAG
Subjt:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGH-GYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAG

Query:  FLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEK
        FLRGIV ALILLVA+MTLSSIITWI+LRP++PVFKVDSFSVSNFNISKLNYSGNW+ S+TV+NPNHKL VN+ERIQSFV+YK+NTLAMSYADPFF+DVEK
Subjt:  FLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEK

Query:  SSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL
        S QM+VKLTSSSPDDPGNWLETEEK+G+E+A+GTVSFNLRFFAWT FR+GSWWTRR+VM+V CED+KL FTGPAA H VYLAD HSKTCSVL
Subjt:  SSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL

XP_008442912.1 PREDICTED: uncharacterized protein LOC103486674 [Cucumis melo]1.5e-13986.64Show/hide
Query:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGH-GYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAG
        MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPP GH GYSPAMGYPP P P YPPA GNYPPYN YYAQAPPAAYYNNPQNYRA T+SAG
Subjt:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGH-GYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAG

Query:  FLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEK
        FLRGIV ALILLVA+MTLSSIITWI+LRP++PVFKVDSFSVSNFNISKLNYSGNW+ S+TV+NPNHKL VN+ERIQSFV+YK+NTLAMSYADPFF+DVEK
Subjt:  FLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEK

Query:  SSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL
        S QM+VKLTSSSPDDPGNWLETEEK+G+E+A+GTVSFNLRFFAWT FR+GSWWTRR+VM+V CED+KL FTGPAA H VYLAD HSKTCSVL
Subjt:  SSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL

XP_011652032.1 uncharacterized protein LOC105434983 [Cucumis sativus]1.6e-160100Show/hide
Query:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGF
        MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGF
Subjt:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGF

Query:  LRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKS
        LRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKS
Subjt:  LRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKS

Query:  SQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVLF
        SQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVLF
Subjt:  SQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVLF

XP_022983003.1 uncharacterized protein LOC111481675 [Cucurbita maxima]1.8e-10870.39Show/hide
Query:  MASSSEDQ---QSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTY-YAQAPPAAYY-------NNPQ
        MASSS DQ   QSQSK TDPPPP P SAGNNPPP+YPPPTLGY PPH HGY PAMGYPP P PGYPPAPGNYPPYN Y Y QAPPAAYY       NNPQ
Subjt:  MASSSEDQ---QSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTY-YAQAPPAAYY-------NNPQ

Query:  NYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSY
         YR +T  AGFLRGI  AL+LLV +MT+SSIITWI+LRP+IP FKVDSFSV+NFNISK NYSG W+  +TV+NPNHKL ++ ERI+SFV+Y +NT+A S+
Subjt:  NYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSY

Query:  ADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFR--SGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKT
        +DPFF+D+EKS QM VK+TSSSPDDPGNW++TEEK+ +E+A+GTVSF LR  AWT FR  SGS WTRR++++VFCEDLKL FTG   T GVY   AH KT
Subjt:  ADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFR--SGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKT

Query:  CSVL
        C VL
Subjt:  CSVL

XP_038905898.1 uncharacterized protein LOC120091828 [Benincasa hispida]5.0e-12779.38Show/hide
Query:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGF
        MASSS+D QSQSKATDPPP  P SAGNNPPPVYPPPTLGYPPP GH Y PAMGYPP P PGYPPAPGNYPPYN YYAQAPPAAYYNN QNYRA+TV+ GF
Subjt:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGF

Query:  LRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKS
        LRGIVTALIL VA+MTLSSI+TWI+LRP+IPVF++DSFSV NFNISK NYSGNW+G++TV+NPNH+L VN+ER+QSFV+YK+NTLAMSY DPFF+DVEKS
Subjt:  LRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKS

Query:  SQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL
         QMRVKLTSSSPDDPG+W ETE+K+GQEKA+GTVSFNLRF AWT FR GSWWTRR+V++VFCEDLKL F GPAA   VY  + + K CSVL
Subjt:  SQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL

TrEMBL top hitse value%identityAlignment
A0A0A0LGS8 Uncharacterized protein7.5e-161100Show/hide
Query:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGF
        MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGF
Subjt:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGF

Query:  LRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKS
        LRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKS
Subjt:  LRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKS

Query:  SQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVLF
        SQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVLF
Subjt:  SQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVLF

A0A1S3B6W4 uncharacterized protein LOC1034866747.3e-14086.64Show/hide
Query:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGH-GYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAG
        MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPP GH GYSPAMGYPP P P YPPA GNYPPYN YYAQAPPAAYYNNPQNYRA T+SAG
Subjt:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGH-GYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAG

Query:  FLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEK
        FLRGIV ALILLVA+MTLSSIITWI+LRP++PVFKVDSFSVSNFNISKLNYSGNW+ S+TV+NPNHKL VN+ERIQSFV+YK+NTLAMSYADPFF+DVEK
Subjt:  FLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEK

Query:  SSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL
        S QM+VKLTSSSPDDPGNWLETEEK+G+E+A+GTVSFNLRFFAWT FR+GSWWTRR+VM+V CED+KL FTGPAA H VYLAD HSKTCSVL
Subjt:  SSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL

A0A5A7TLT1 Protein YLS97.3e-14086.64Show/hide
Query:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGH-GYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAG
        MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPP GH GYSPAMGYPP P P YPPA GNYPPYN YYAQAPPAAYYNNPQNYRA T+SAG
Subjt:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGH-GYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAG

Query:  FLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEK
        FLRGIV ALILLVA+MTLSSIITWI+LRP++PVFKVDSFSVSNFNISKLNYSGNW+ S+TV+NPNHKL VN+ERIQSFV+YK+NTLAMSYADPFF+DVEK
Subjt:  FLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEK

Query:  SSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL
        S QM+VKLTSSSPDDPGNWLETEEK+G+E+A+GTVSFNLRFFAWT FR+GSWWTRR+VM+V CED+KL FTGPAA H VYLAD HSKTCSVL
Subjt:  SSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL

A0A6J1F415 uncharacterized protein LOC1114421885.7e-10870.3Show/hide
Query:  MASSSEDQ---QSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTY-YAQAPPAAYY------NNPQN
        MASSS DQ   QSQSK TDPPPP P SAGNNPPP+YPPPTLGY PPH HGY PAMGYPP P PGYPPAPGNYPPYN Y Y QAPPAAYY      NNPQ 
Subjt:  MASSSEDQ---QSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTY-YAQAPPAAYY------NNPQN

Query:  YRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYA
        YR +T  AGFLRGI  AL+LLV +MT+SSIITWI+LRP+IP FKVDSFSV+NFNISK NYSG W+  +TV+NPNHKL ++ ERI+SFV+Y +NT+A S++
Subjt:  YRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYA

Query:  DPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFR--SGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTC
        DPFF+D+EKS QM+VK+TSSSPDDPGNW +TEEK+ +E+ +GTVSF LR  AWT FR  SGS WTRR++++VFCEDLKL FTG   T GVY   A SKTC
Subjt:  DPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFR--SGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTC

Query:  SVL
         VL
Subjt:  SVL

A0A6J1J6I9 uncharacterized protein LOC1114816758.7e-10970.39Show/hide
Query:  MASSSEDQ---QSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTY-YAQAPPAAYY-------NNPQ
        MASSS DQ   QSQSK TDPPPP P SAGNNPPP+YPPPTLGY PPH HGY PAMGYPP P PGYPPAPGNYPPYN Y Y QAPPAAYY       NNPQ
Subjt:  MASSSEDQ---QSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTY-YAQAPPAAYY-------NNPQ

Query:  NYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSY
         YR +T  AGFLRGI  AL+LLV +MT+SSIITWI+LRP+IP FKVDSFSV+NFNISK NYSG W+  +TV+NPNHKL ++ ERI+SFV+Y +NT+A S+
Subjt:  NYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSY

Query:  ADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFR--SGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKT
        +DPFF+D+EKS QM VK+TSSSPDDPGNW++TEEK+ +E+A+GTVSF LR  AWT FR  SGS WTRR++++VFCEDLKL FTG   T GVY   AH KT
Subjt:  ADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFR--SGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKT

Query:  CSVL
        C VL
Subjt:  CSVL

SwissProt top hitse value%identityAlignment
Q9SJ52 NDR1/HIN1-like protein 106.6e-0524Show/hide
Query:  YYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLT--VENPNHKLTVNIE
        Y    PP A     +    +      L   V  +I L+ ++ ++++I W+++RP+   F V   S++ F+ +  +    +N +LT  V NPN ++ +  +
Subjt:  YYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLT--VENPNHKLTVNIE

Query:  RIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWL-----ETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKL
        RI++   Y+    +     PF+       Q     T  +P   G  L          +  E+ SG  +  ++F     F+ G    RRI  KV C+DL+L
Subjt:  RIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWL-----ETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKL

Arabidopsis top hitse value%identityAlignment
AT2G27260.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.1e-1432.49Show/hide
Query:  PTLGYPPPHGHGYSPAMGYPPPPPPGYP-PAPGNYPPY---NTYYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIP
        P  GYP P+ +   P      PP  GYP PA G   PY   N YYA  P         N RA  +   F+  + T  +LL+ ++     I ++++RPQ+P
Subjt:  PTLGYPPPHGHGYSPAMGYPPPPPPGYP-PAPGNYPPY---NTYYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIP

Query:  VFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETE--EKVGQEK
           ++S SVSNFN+S    SG W+  L   NPN K++++ E     + Y   +L+ +   PF  D  K  Q  V  T S     G +++    + +G+E+
Subjt:  VFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETE--EKVGQEK

Query:  A-SGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDL
        +  G V F+LR  ++  FR G++  RR V  V+C+D+
Subjt:  A-SGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDL

AT2G35980.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.7e-0624Show/hide
Query:  YYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLT--VENPNHKLTVNIE
        Y    PP A     +    +      L   V  +I L+ ++ ++++I W+++RP+   F V   S++ F+ +  +    +N +LT  V NPN ++ +  +
Subjt:  YYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLT--VENPNHKLTVNIE

Query:  RIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWL-----ETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKL
        RI++   Y+    +     PF+       Q     T  +P   G  L          +  E+ SG  +  ++F     F+ G    RRI  KV C+DL+L
Subjt:  RIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWL-----ETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKL

AT3G52460.1 hydroxyproline-rich glycoprotein family protein4.2e-3940.27Show/hide
Query:  SSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPP--HGHGYSPAMGYP--PPPPPGYPPAPGNYP--PYNTY-YAQAPPAAYYNNPQNYRAQ--
        S  ++++Q K    P  +     N PPP  PPP    PPP      Y P MGYP    PPP YP    NYP  PY  Y YAQAPPA+YY +  +Y AQ  
Subjt:  SSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPP--HGHGYSPAMGYP--PPPPPGYPPAPGNYP--PYNTY-YAQAPPAAYYNNPQNYRAQ--

Query:  -----TVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNY-----KENT
               S+GF+RGI T LI+LV ++ +S+ ITW+VLRPQIP+F V++FSVSNFN++   +S  W  +LT+EN N KL    +RIQ  V +     ++  
Subjt:  -----TVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNY-----KENT

Query:  LAMSYADPFFIDVEKSSQMRVKLTSSSPDDP--GNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYL
        LA ++  P F++ +KS  +   LT+   + P   +W+  E K  +E+ +GTV+F+LR   W  F++  W  R   +KVFC  LK+ F G +    V L
Subjt:  LAMSYADPFFIDVEKSSQMRVKLTSSSPDDP--GNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYL

AT5G22870.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.6e-0922.54Show/hide
Query:  IVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNY-SGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFF---IDVEK
        I   ++ L+ +  +  +ITW+  +P+   + V++ SV NFN++  N+ S  +  ++   NPNH+++V    ++ FV +K+ TLA    +PF    ++V++
Subjt:  IVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNY-SGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFF---IDVEK

Query:  SSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGP
          +  +    +     G  L ++  +G+      + F +   A   F+ G W +     K+ C  + ++ + P
Subjt:  SSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTCATCGGAGGATCAACAATCTCAATCCAAAGCCACTGACCCACCTCCTCCGCACCCCTCCTCTGCTGGAAACAACCCTCCTCCTGTCTATCCACCGCCCAC
ATTGGGGTACCCTCCTCCTCACGGCCATGGGTACTCTCCGGCGATGGGGTACCCTCCACCTCCACCTCCAGGGTACCCACCGGCTCCGGGGAATTACCCTCCTTACAATA
CGTACTACGCTCAGGCTCCCCCGGCGGCGTATTACAATAACCCTCAAAACTACAGAGCCCAGACCGTAAGCGCGGGATTCCTCCGAGGGATTGTGACGGCGTTGATTTTA
TTGGTGGCTGTAATGACTCTGTCCAGCATAATCACATGGATCGTCCTCCGCCCTCAAATCCCAGTGTTTAAAGTCGATTCATTCTCCGTTTCGAATTTCAATATCTCGAA
ATTGAATTACTCCGGAAATTGGAATGGGAGTCTGACGGTTGAAAATCCGAACCATAAACTGACTGTGAATATAGAGCGCATCCAGAGCTTCGTGAACTACAAAGAAAATA
CGTTGGCAATGTCTTACGCGGACCCATTTTTTATAGATGTGGAGAAGAGCAGTCAAATGAGGGTGAAATTGACGTCGAGTAGTCCCGATGATCCGGGAAATTGGTTAGAA
ACAGAGGAGAAGGTGGGGCAGGAGAAGGCGAGTGGAACGGTGAGTTTCAATTTGAGATTCTTTGCTTGGACGGCTTTCCGATCCGGTTCTTGGTGGACAAGGCGGATTGT
CATGAAAGTGTTTTGTGAAGATTTGAAGCTGGCCTTCACCGGACCCGCCGCCACTCATGGCGTTTACTTGGCCGACGCACACTCCAAGACTTGTTCTGTTCTCTTCTAG
mRNA sequenceShow/hide mRNA sequence
TAACCTTCCGTTTTTTTTCTAAGAAAAGCCTTAACGGATCTCCATATTCTAAATCCTCCTCCATTTCTGATTTCTCCTTTACCATTCTTCTCTCTCTCTCTCTCTCTCTC
TCTCTCCAAATCCTCTCACAGGGAGAGAGAAAAAGCCACAAACAGAGCATCCTTTCCAATGGCTTCCTCATCGGAGGATCAACAATCTCAATCCAAAGCCACTGACCCAC
CTCCTCCGCACCCCTCCTCTGCTGGAAACAACCCTCCTCCTGTCTATCCACCGCCCACATTGGGGTACCCTCCTCCTCACGGCCATGGGTACTCTCCGGCGATGGGGTAC
CCTCCACCTCCACCTCCAGGGTACCCACCGGCTCCGGGGAATTACCCTCCTTACAATACGTACTACGCTCAGGCTCCCCCGGCGGCGTATTACAATAACCCTCAAAACTA
CAGAGCCCAGACCGTAAGCGCGGGATTCCTCCGAGGGATTGTGACGGCGTTGATTTTATTGGTGGCTGTAATGACTCTGTCCAGCATAATCACATGGATCGTCCTCCGCC
CTCAAATCCCAGTGTTTAAAGTCGATTCATTCTCCGTTTCGAATTTCAATATCTCGAAATTGAATTACTCCGGAAATTGGAATGGGAGTCTGACGGTTGAAAATCCGAAC
CATAAACTGACTGTGAATATAGAGCGCATCCAGAGCTTCGTGAACTACAAAGAAAATACGTTGGCAATGTCTTACGCGGACCCATTTTTTATAGATGTGGAGAAGAGCAG
TCAAATGAGGGTGAAATTGACGTCGAGTAGTCCCGATGATCCGGGAAATTGGTTAGAAACAGAGGAGAAGGTGGGGCAGGAGAAGGCGAGTGGAACGGTGAGTTTCAATT
TGAGATTCTTTGCTTGGACGGCTTTCCGATCCGGTTCTTGGTGGACAAGGCGGATTGTCATGAAAGTGTTTTGTGAAGATTTGAAGCTGGCCTTCACCGGACCCGCCGCC
ACTCATGGCGTTTACTTGGCCGACGCACACTCCAAGACTTGTTCTGTTCTCTTCTAGAAGAATTCTTCGGAAAGTAGGCAAATTGTGTGTGGGGGCTTGAAGGAAAAGGG
GTATAGCAGAGAGATTTGCTTTCATGTTTGGAGATTATGAAACATTATATTCAACTTTAGGGATCTTTTTTTTTTTTTTT
Protein sequenceShow/hide protein sequence
MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALIL
LVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLE
TEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVLF