; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G12590 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G12590
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationctg1838:557837..559607
RNA-Seq ExpressionCucsat.G12590
SyntenyCucsat.G12590
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043818.1 protein YLS9 [Cucumis melo var. makuwa]1.52e-17986.38Show/hide
Query:  KATNRASFPMASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHG-YSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQN
        KA NRA FPMASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPP GHG YSPAMGYPP P P YPPA GNYPPYN YYAQAPPAAYYNNPQN
Subjt:  KATNRASFPMASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHG-YSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQN

Query:  YRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYA
        YRA T+SAGFLRGIV ALILLVA+MTLSSIITWI+LRP++PVFKVDSFSVSNFNISKLNYSGNW+ S+TV+NPNHKL VN+ERIQSFV+YK+NTLAMSYA
Subjt:  YRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYA

Query:  DPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSV
        DPFF+DVEKS QM+VKLTSSSPDDPGNWLETEEK+G+E+A+GTVSFNLRFFAWT FR+GSWWTRR+VM+V CED+KL FTGPAA H VYLAD HSKTCSV
Subjt:  DPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSV

Query:  L
        L
Subjt:  L

XP_008442912.1 PREDICTED: uncharacterized protein LOC103486674 [Cucumis melo]1.29e-17986.64Show/hide
Query:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHG-YSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAG
        MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPP GHG YSPAMGYPP P P YPPA GNYPPYN YYAQAPPAAYYNNPQNYRA T+SAG
Subjt:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHG-YSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAG

Query:  FLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEK
        FLRGIV ALILLVA+MTLSSIITWI+LRP++PVFKVDSFSVSNFNISKLNYSGNW+ S+TV+NPNHKL VN+ERIQSFV+YK+NTLAMSYADPFF+DVEK
Subjt:  FLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEK

Query:  SSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL
        S QM+VKLTSSSPDDPGNWLETEEK+G+E+A+GTVSFNLRFFAWT FR+GSWWTRR+VM+V CED+KL FTGPAA H VYLAD HSKTCSVL
Subjt:  SSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL

XP_011652032.1 uncharacterized protein LOC105434983 [Cucumis sativus]2.75e-222100Show/hide
Query:  LSLSLSLSKSSHREREKATNRASFPMASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTY
        LSLSLSLSKSSHREREKATNRASFPMASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTY
Subjt:  LSLSLSLSKSSHREREKATNRASFPMASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTY

Query:  YAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQ
        YAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQ
Subjt:  YAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQ

Query:  SFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAAT
        SFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAAT
Subjt:  SFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAAT

Query:  HGVYLADAHSKTCSVLF
        HGVYLADAHSKTCSVLF
Subjt:  HGVYLADAHSKTCSVLF

XP_022983003.1 uncharacterized protein LOC111481675 [Cucurbita maxima]1.68e-14168.79Show/hide
Query:  LSLSLSLSKSSHR-EREKATNRASFPMASSSEDQQ---SQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPP
        LSLSLSLS      EREK   +  F MASSS DQQ   SQSK TDPPPP P SAGNNPPP+YPPPTLGYPP H HGY PAMGYPP P PGYPPAPGNYPP
Subjt:  LSLSLSLSKSSHR-EREKATNRASFPMASSSEDQQ---SQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPP

Query:  YNTY-YAQAPPAAYYN-------NPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENP
        YN Y Y QAPPAAYYN       NPQ YR +T  AGFLRGI  AL+LLV +MT+SSIITWI+LRP+IP FKVDSFSV+NFNISK NYSG W+  +TV+NP
Subjt:  YNTY-YAQAPPAAYYN-------NPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENP

Query:  NHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSW--WTRRIVMKVF
        NHKL ++ ERI+SFV+Y +NT+A S++DPFF+D+EKS QM VK+TSSSPDDPGNW++TEEK+ +E+A+GTVSF LR  AWT FRSGS   WTRR++++VF
Subjt:  NHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSW--WTRRIVMKVF

Query:  CEDLKLAFTGPAATHGVYLADAHSKTCSVL
        CEDLKL FTG   T GVY   AH KTC VL
Subjt:  CEDLKLAFTGPAATHGVYLADAHSKTCSVL

XP_038905898.1 uncharacterized protein LOC120091828 [Benincasa hispida]1.54e-16977.85Show/hide
Query:  LSLSLSLSKSSHREREKATNRASFPMASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTY
        LSLSLSLSKS HRERE         MASSS+D QSQSKATDPPP  P SAGNNPPPVYPPPTLGYPPP GH Y PAMGYPP P PGYPPAPGNYPPYN Y
Subjt:  LSLSLSLSKSSHREREKATNRASFPMASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTY

Query:  YAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQ
        YAQAPPAAYYNN QNYRA+TV+ GFLRGIVTALIL VA+MTLSSI+TWI+LRP+IPVF++DSFSV NFNISK NYSGNW+G++TV+NPNH+L VN+ER+Q
Subjt:  YAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQ

Query:  SFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAAT
        SFV+YK+NTLAMSY DPFF+DVEKS QMRVKLTSSSPDDPG+W ETE+K+GQEKA+GTVSFNLRF AWT FR GSWWTRR+V++VFCEDLKL F GPAA 
Subjt:  SFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAAT

Query:  HGVYLADAHSKTCSVL
          VY  + + K CSVL
Subjt:  HGVYLADAHSKTCSVL

TrEMBL top hitse value%identityAlignment
A0A0A0LGS8 Uncharacterized protein1.59e-207100Show/hide
Query:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGF
        MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGF
Subjt:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAGF

Query:  LRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKS
        LRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKS
Subjt:  LRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKS

Query:  SQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVLF
        SQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVLF
Subjt:  SQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVLF

A0A1S3B6W4 uncharacterized protein LOC1034866746.25e-18086.64Show/hide
Query:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHG-YSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAG
        MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPP GHG YSPAMGYPP P P YPPA GNYPPYN YYAQAPPAAYYNNPQNYRA T+SAG
Subjt:  MASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHG-YSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQNYRAQTVSAG

Query:  FLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEK
        FLRGIV ALILLVA+MTLSSIITWI+LRP++PVFKVDSFSVSNFNISKLNYSGNW+ S+TV+NPNHKL VN+ERIQSFV+YK+NTLAMSYADPFF+DVEK
Subjt:  FLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEK

Query:  SSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL
        S QM+VKLTSSSPDDPGNWLETEEK+G+E+A+GTVSFNLRFFAWT FR+GSWWTRR+VM+V CED+KL FTGPAA H VYLAD HSKTCSVL
Subjt:  SSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVL

A0A5A7TLT1 Protein YLS97.36e-18086.38Show/hide
Query:  KATNRASFPMASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHG-YSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQN
        KA NRA FPMASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPP GHG YSPAMGYPP P P YPPA GNYPPYN YYAQAPPAAYYNNPQN
Subjt:  KATNRASFPMASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHG-YSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAYYNNPQN

Query:  YRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYA
        YRA T+SAGFLRGIV ALILLVA+MTLSSIITWI+LRP++PVFKVDSFSVSNFNISKLNYSGNW+ S+TV+NPNHKL VN+ERIQSFV+YK+NTLAMSYA
Subjt:  YRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYA

Query:  DPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSV
        DPFF+DVEKS QM+VKLTSSSPDDPGNWLETEEK+G+E+A+GTVSFNLRFFAWT FR+GSWWTRR+VM+V CED+KL FTGPAA H VYLAD HSKTCSV
Subjt:  DPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSV

Query:  L
        L
Subjt:  L

A0A6J1F415 uncharacterized protein LOC1114421884.49e-14168.79Show/hide
Query:  SLSLSLSLSKSSHR-EREKATNRASFPMASSSEDQQ---SQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYP
        SLSLSLSLS      EREK   +  F MASSS DQQ   SQSK TDPPPP P SAGNNPPP+YPPPTLGYPP H HGY PAMGYPP P PGYPPAPGNYP
Subjt:  SLSLSLSLSKSSHR-EREKATNRASFPMASSSEDQQ---SQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYP

Query:  PYNTY-YAQAPPAAYYN------NPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENP
        PYN Y Y QAPPAAYYN      NPQ YR +T  AGFLRGI  AL+LLV +MT+SSIITWI+LRP+IP FKVDSFSV+NFNISK NYSG W+  +TV+NP
Subjt:  PYNTY-YAQAPPAAYYN------NPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENP

Query:  NHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSW--WTRRIVMKVF
        NHKL ++ ERI+SFV+Y +NT+A S++DPFF+D+EKS QM+VK+TSSSPDDPGNW +TEEK+ +E+ +GTVSF LR  AWT FRSGS   WTRR++++VF
Subjt:  NHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSW--WTRRIVMKVF

Query:  CEDLKLAFTGPAATHGVYLADAHSKTCSVL
        CEDLKL FTG   T GVY   A SKTC VL
Subjt:  CEDLKLAFTGPAATHGVYLADAHSKTCSVL

A0A6J1J6I9 uncharacterized protein LOC1114816758.13e-14268.79Show/hide
Query:  LSLSLSLSKSSHR-EREKATNRASFPMASSSEDQQ---SQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPP
        LSLSLSLS      EREK   +  F MASSS DQQ   SQSK TDPPPP P SAGNNPPP+YPPPTLGYPP H HGY PAMGYPP P PGYPPAPGNYPP
Subjt:  LSLSLSLSKSSHR-EREKATNRASFPMASSSEDQQ---SQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPP

Query:  YNTY-YAQAPPAAYYN-------NPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENP
        YN Y Y QAPPAAYYN       NPQ YR +T  AGFLRGI  AL+LLV +MT+SSIITWI+LRP+IP FKVDSFSV+NFNISK NYSG W+  +TV+NP
Subjt:  YNTY-YAQAPPAAYYN-------NPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENP

Query:  NHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSW--WTRRIVMKVF
        NHKL ++ ERI+SFV+Y +NT+A S++DPFF+D+EKS QM VK+TSSSPDDPGNW++TEEK+ +E+A+GTVSF LR  AWT FRSGS   WTRR++++VF
Subjt:  NHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSW--WTRRIVMKVF

Query:  CEDLKLAFTGPAATHGVYLADAHSKTCSVL
        CEDLKL FTG   T GVY   AH KTC VL
Subjt:  CEDLKLAFTGPAATHGVYLADAHSKTCSVL

SwissProt top hitse value%identityAlignment
Q9SJ52 NDR1/HIN1-like protein 107.2e-0524Show/hide
Query:  YYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLT--VENPNHKLTVNIE
        Y    PP A     +    +      L   V  +I L+ ++ ++++I W+++RP+   F V   S++ F+ +  +    +N +LT  V NPN ++ +  +
Subjt:  YYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLT--VENPNHKLTVNIE

Query:  RIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWL-----ETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKL
        RI++   Y+    +     PF+       Q     T  +P   G  L          +  E+ SG  +  ++F     F+ G    RRI  KV C+DL+L
Subjt:  RIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWL-----ETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKL

Arabidopsis top hitse value%identityAlignment
AT2G27260.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.7e-1432.49Show/hide
Query:  PTLGYPPPHGHGYSPAMGYPPPPPPGYP-PAPGNYPPY---NTYYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIP
        P  GYP P+ +   P      PP  GYP PA G   PY   N YYA  P         N RA  +   F+  + T  +LL+ ++     I ++++RPQ+P
Subjt:  PTLGYPPPHGHGYSPAMGYPPPPPPGYP-PAPGNYPPY---NTYYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIP

Query:  VFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETE--EKVGQEK
           ++S SVSNFN+S    SG W+  L   NPN K++++ E     + Y   +L+ +   PF  D  K  Q  V  T S     G +++    + +G+E+
Subjt:  VFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWLETE--EKVGQEK

Query:  A-SGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDL
        +  G V F+LR  ++  FR G++  RR V  V+C+D+
Subjt:  A-SGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDL

AT2G35980.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family5.1e-0624Show/hide
Query:  YYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLT--VENPNHKLTVNIE
        Y    PP A     +    +      L   V  +I L+ ++ ++++I W+++RP+   F V   S++ F+ +  +    +N +LT  V NPN ++ +  +
Subjt:  YYAQAPPAAYYNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLT--VENPNHKLTVNIE

Query:  RIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWL-----ETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKL
        RI++   Y+    +     PF+       Q     T  +P   G  L          +  E+ SG  +  ++F     F+ G    RRI  KV C+DL+L
Subjt:  RIQSFVNYKENTLAMSYADPFFIDVEKSSQMRVKLTSSSPDDPGNWL-----ETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKL

AT3G52460.1 hydroxyproline-rich glycoprotein family protein4.6e-3940.27Show/hide
Query:  SSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPP--HGHGYSPAMGYP--PPPPPGYPPAPGNYP--PYNTY-YAQAPPAAYYNNPQNYRAQ--
        S  ++++Q K    P  +     N PPP  PPP    PPP      Y P MGYP    PPP YP    NYP  PY  Y YAQAPPA+YY +  +Y AQ  
Subjt:  SSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPP--HGHGYSPAMGYP--PPPPPGYPPAPGNYP--PYNTY-YAQAPPAAYYNNPQNYRAQ--

Query:  -----TVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNY-----KENT
               S+GF+RGI T LI+LV ++ +S+ ITW+VLRPQIP+F V++FSVSNFN++   +S  W  +LT+EN N KL    +RIQ  V +     ++  
Subjt:  -----TVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNY-----KENT

Query:  LAMSYADPFFIDVEKSSQMRVKLTSSSPDDP--GNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYL
        LA ++  P F++ +KS  +   LT+   + P   +W+  E K  +E+ +GTV+F+LR   W  F++  W  R   +KVFC  LK+ F G +    V L
Subjt:  LAMSYADPFFIDVEKSSQMRVKLTSSSPDDP--GNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYL

AT5G22870.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.9e-0922.54Show/hide
Query:  IVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNY-SGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFF---IDVEK
        I   ++ L+ +  +  +ITW+  +P+   + V++ SV NFN++  N+ S  +  ++   NPNH+++V    ++ FV +K+ TLA    +PF    ++V++
Subjt:  IVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNY-SGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFF---IDVEK

Query:  SSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGP
          +  +    +     G  L ++  +G+      + F +   A   F+ G W +     K+ C  + ++ + P
Subjt:  SSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCTCTCTCTCTCTCTCTCTCTCTCTCCAAATCCTCTCACAGGGAGAGAGAAAAAGCCACAAACAGAGCATCCTTTCCAATGGCTTCCTCATCGGAGGATCAACAATCTCA
ATCCAAAGCCACTGACCCACCTCCTCCGCACCCCTCCTCTGCTGGAAACAACCCTCCTCCTGTCTATCCACCGCCCACATTGGGGTACCCTCCTCCTCACGGCCATGGGT
ACTCTCCGGCGATGGGGTACCCTCCACCTCCACCTCCAGGGTACCCACCGGCTCCGGGGAATTACCCTCCTTACAATACGTACTACGCTCAGGCTCCCCCGGCGGCGTAT
TACAATAACCCTCAAAACTACAGAGCCCAGACCGTAAGCGCGGGATTCCTCCGAGGGATTGTGACGGCGTTGATTTTATTGGTGGCTGTAATGACTCTGTCCAGCATAAT
CACATGGATCGTCCTCCGCCCTCAAATCCCAGTGTTTAAAGTCGATTCATTCTCCGTTTCGAATTTCAATATCTCGAAATTGAATTACTCCGGAAATTGGAATGGGAGTC
TGACGGTTGAAAATCCGAACCATAAACTGACTGTGAATATAGAGCGCATCCAGAGCTTCGTGAACTACAAAGAAAATACGTTGGCAATGTCTTACGCGGACCCATTTTTT
ATAGATGTGGAGAAGAGCAGTCAAATGAGGGTGAAATTGACGTCGAGTAGTCCCGATGATCCGGGAAATTGGTTAGAAACAGAGGAGAAGGTGGGGCAGGAGAAGGCGAG
TGGAACGGTGAGTTTCAATTTGAGATTCTTTGCTTGGACGGCTTTCCGATCCGGTTCTTGGTGGACAAGGCGGATTGTCATGAAAGTGTTTTGTGAAGATTTGAAGCTGG
CCTTCACCGGACCCGCCGCCACTCATGGCGTTTACTTGGCCGACGCACACTCCAAGACTTGTTCTGTTCTCTTCTAG
mRNA sequenceShow/hide mRNA sequence
TCTCTCTCTCTCTCTCTCTCTCTCTCCAAATCCTCTCACAGGGAGAGAGAAAAAGCCACAAACAGAGCATCCTTTCCAATGGCTTCCTCATCGGAGGATCAACAATCTCA
ATCCAAAGCCACTGACCCACCTCCTCCGCACCCCTCCTCTGCTGGAAACAACCCTCCTCCTGTCTATCCACCGCCCACATTGGGGTACCCTCCTCCTCACGGCCATGGGT
ACTCTCCGGCGATGGGGTACCCTCCACCTCCACCTCCAGGGTACCCACCGGCTCCGGGGAATTACCCTCCTTACAATACGTACTACGCTCAGGCTCCCCCGGCGGCGTAT
TACAATAACCCTCAAAACTACAGAGCCCAGACCGTAAGCGCGGGATTCCTCCGAGGGATTGTGACGGCGTTGATTTTATTGGTGGCTGTAATGACTCTGTCCAGCATAAT
CACATGGATCGTCCTCCGCCCTCAAATCCCAGTGTTTAAAGTCGATTCATTCTCCGTTTCGAATTTCAATATCTCGAAATTGAATTACTCCGGAAATTGGAATGGGAGTC
TGACGGTTGAAAATCCGAACCATAAACTGACTGTGAATATAGAGCGCATCCAGAGCTTCGTGAACTACAAAGAAAATACGTTGGCAATGTCTTACGCGGACCCATTTTTT
ATAGATGTGGAGAAGAGCAGTCAAATGAGGGTGAAATTGACGTCGAGTAGTCCCGATGATCCGGGAAATTGGTTAGAAACAGAGGAGAAGGTGGGGCAGGAGAAGGCGAG
TGGAACGGTGAGTTTCAATTTGAGATTCTTTGCTTGGACGGCTTTCCGATCCGGTTCTTGGTGGACAAGGCGGATTGTCATGAAAGTGTTTTGTGAAGATTTGAAGCTGG
CCTTCACCGGACCCGCCGCCACTCATGGCGTTTACTTGGCCGACGCACACTCCAAGACTTGTTCTGTTCTCTTCTAG
Protein sequenceShow/hide protein sequence
SLSLSLSLSKSSHREREKATNRASFPMASSSEDQQSQSKATDPPPPHPSSAGNNPPPVYPPPTLGYPPPHGHGYSPAMGYPPPPPPGYPPAPGNYPPYNTYYAQAPPAAY
YNNPQNYRAQTVSAGFLRGIVTALILLVAVMTLSSIITWIVLRPQIPVFKVDSFSVSNFNISKLNYSGNWNGSLTVENPNHKLTVNIERIQSFVNYKENTLAMSYADPFF
IDVEKSSQMRVKLTSSSPDDPGNWLETEEKVGQEKASGTVSFNLRFFAWTAFRSGSWWTRRIVMKVFCEDLKLAFTGPAATHGVYLADAHSKTCSVLF