; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033218 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033218
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationscaffold5:5182074..5182943
RNA-Seq ExpressionSpg033218
SyntenySpg033218
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043818.1 protein YLS9 [Cucumis melo var. makuwa]1.0e-11173.47Show/hide
Query:  SSSSDDRQSISK--DPPP--PPVAGTNPPPVYPPQAMGY--PHGH-GYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYYTNNPQPFRPEPVRA
        +SSS+D+QS SK  DPPP  P  AG NPPPVYPP  +GY  P GH GY PAMGYPPAPHP YPPA G YPPYNAY YAQAPPAAYY NNPQ +R   + A
Subjt:  SSSSDDRQSISK--DPPP--PPVAGTNPPPVYPPQAMGY--PHGH-GYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYYTNNPQPFRPEPVRA

Query:  GFIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFVEPFFLDVE
        GF+RGIV+ALILLV +MTLSSIITWI+LRPE+P F+VDS +V+NFNISK NYSGNW+++VTVQNPN KLN+N ERIQSFVD+K NTLAMS+ +PFFLDVE
Subjt:  GFIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFVEPFFLDVE

Query:  KSGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNCAVLM
        KSGQM+VKLTSSSPDDPGNW +TEEK+GRERATG VSFNLRFF WTTFR+G+WWTRRV+MRV CED+KL F GPAA +AV   D   K C+VL+
Subjt:  KSGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNCAVLM

XP_008442912.1 PREDICTED: uncharacterized protein LOC103486674 [Cucumis melo]1.0e-11173.47Show/hide
Query:  SSSSDDRQSISK--DPPP--PPVAGTNPPPVYPPQAMGY--PHGH-GYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYYTNNPQPFRPEPVRA
        +SSS+D+QS SK  DPPP  P  AG NPPPVYPP  +GY  P GH GY PAMGYPPAPHP YPPA G YPPYNAY YAQAPPAAYY NNPQ +R   + A
Subjt:  SSSSDDRQSISK--DPPP--PPVAGTNPPPVYPPQAMGY--PHGH-GYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYYTNNPQPFRPEPVRA

Query:  GFIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFVEPFFLDVE
        GF+RGIV+ALILLV +MTLSSIITWI+LRPE+P F+VDS +V+NFNISK NYSGNW+++VTVQNPN KLN+N ERIQSFVD+K NTLAMS+ +PFFLDVE
Subjt:  GFIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFVEPFFLDVE

Query:  KSGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNCAVLM
        KSGQM+VKLTSSSPDDPGNW +TEEK+GRERATG VSFNLRFF WTTFR+G+WWTRRV+MRV CED+KL F GPAA +AV   D   K C+VL+
Subjt:  KSGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNCAVLM

XP_011652032.1 uncharacterized protein LOC105434983 [Cucumis sativus]1.9e-11071.92Show/hide
Query:  SSSSDDRQSISK--DPPP--PPVAGTNPPPVYPPQAMGY--PHGHGYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYYTNNPQPFRPEPVRAG
        +SSS+D+QS SK  DPPP  P  AG NPPPVYPP  +GY  PHGHGY PAMGYPP P PGYPPAPG YPPYN Y YAQAPPAAYY NNPQ +R + V AG
Subjt:  SSSSDDRQSISK--DPPP--PPVAGTNPPPVYPPQAMGY--PHGHGYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYYTNNPQPFRPEPVRAG

Query:  FIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFVEPFFLDVEK
        F+RGIV+ALILLV +MTLSSIITWIVLRP+IP F+VDS +V+NFNISK NYSGNW  ++TV+NPN KL +N ERIQSFV++K+NTLAMS+ +PFF+DVEK
Subjt:  FIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFVEPFFLDVEK

Query:  SGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNCAVL
        S QMRVKLTSSSPDDPGNW +TEEK+G+E+A+G VSFNLRFF WT FRSG+WWTRR++M+VFCEDLKLAF GPAA + V   D   K C+VL
Subjt:  SGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNCAVL

XP_022983003.1 uncharacterized protein LOC111481675 [Cucurbita maxima]5.4e-10568.75Show/hide
Query:  MASSSSDDR----QSISKDPPP--PPVAGTNPPPVYPPQAMGY-PHGHGYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYY------TNNPQP
        MASSS D +    QS   DPPP  PP AG NPPP+YPP  +GY PH HGYPPAMGYPPAPHPGYPPAPG YPPYNAYAY QAPPAAYY       NNPQ 
Subjt:  MASSSSDDR----QSISKDPPP--PPVAGTNPPPVYPPQAMGY-PHGHGYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYY------TNNPQP

Query:  FRPEPVRAGFIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFV
        +R E   AGF+RGI +AL+LLVV+MT+SSIITWI+LRPEIP F+VDS +VANFNISK+NYSG W+  VTVQNPN KLNL+FERI+SFVD+ DNT+A SF 
Subjt:  FRPEPVRAGFIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFV

Query:  EPFFLDVEKSGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFR--SGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNC
        +PFFLD+EKS QM VK+TSSSPDDPGNW QTEEK+ RERATG VSF LR   WTTFR  SG+ WTRRV++RVFCEDLKL F G    + V +    PK C
Subjt:  EPFFLDVEKSGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFR--SGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNC

Query:  AVLM
         VL+
Subjt:  AVLM

XP_038905898.1 uncharacterized protein LOC120091828 [Benincasa hispida]5.9e-11273.04Show/hide
Query:  SSSSDDRQSISK--DPP--PPPVAGTNPPPVYPPQAMGY--PHGHGYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYYTNNPQPFRPEPVRAG
        +SSSDD QS SK  DPP  PPP AG NPPPVYPP  +GY  P GH YPPAMGYPPAPHPGYPPAPG YPPYN Y YAQAPPAAYY NN Q +R E V  G
Subjt:  SSSSDDRQSISK--DPP--PPPVAGTNPPPVYPPQAMGY--PHGHGYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYYTNNPQPFRPEPVRAG

Query:  FIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFVEPFFLDVEK
        F+RGIV+ALIL V +MTLSSI+TWI+LRPEIP FR+DS +V NFNISK+NYSGNW+ N+TVQNPN +LN+N ER+QSFVD+KDNTLAMS+ +PFFLDVEK
Subjt:  FIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFVEPFFLDVEK

Query:  SGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNCAVLM
        S QMRVKLTSSSPDDPG+WA+TE+K+G+E+ATG VSFNLRF  WTTFR G+WWTRRV++RVFCEDLKL FAGPAA   V + +  PK C+VL+
Subjt:  SGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNCAVLM

TrEMBL top hitse value%identityAlignment
A0A0A0LGS8 Uncharacterized protein9.2e-11171.92Show/hide
Query:  SSSSDDRQSISK--DPPP--PPVAGTNPPPVYPPQAMGY--PHGHGYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYYTNNPQPFRPEPVRAG
        +SSS+D+QS SK  DPPP  P  AG NPPPVYPP  +GY  PHGHGY PAMGYPP P PGYPPAPG YPPYN Y YAQAPPAAYY NNPQ +R + V AG
Subjt:  SSSSDDRQSISK--DPPP--PPVAGTNPPPVYPPQAMGY--PHGHGYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYYTNNPQPFRPEPVRAG

Query:  FIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFVEPFFLDVEK
        F+RGIV+ALILLV +MTLSSIITWIVLRP+IP F+VDS +V+NFNISK NYSGNW  ++TV+NPN KL +N ERIQSFV++K+NTLAMS+ +PFF+DVEK
Subjt:  FIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFVEPFFLDVEK

Query:  SGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNCAVL
        S QMRVKLTSSSPDDPGNW +TEEK+G+E+A+G VSFNLRFF WT FRSG+WWTRR++M+VFCEDLKLAF GPAA + V   D   K C+VL
Subjt:  SGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNCAVL

A0A1S3B6W4 uncharacterized protein LOC1034866744.9e-11273.47Show/hide
Query:  SSSSDDRQSISK--DPPP--PPVAGTNPPPVYPPQAMGY--PHGH-GYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYYTNNPQPFRPEPVRA
        +SSS+D+QS SK  DPPP  P  AG NPPPVYPP  +GY  P GH GY PAMGYPPAPHP YPPA G YPPYNAY YAQAPPAAYY NNPQ +R   + A
Subjt:  SSSSDDRQSISK--DPPP--PPVAGTNPPPVYPPQAMGY--PHGH-GYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYYTNNPQPFRPEPVRA

Query:  GFIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFVEPFFLDVE
        GF+RGIV+ALILLV +MTLSSIITWI+LRPE+P F+VDS +V+NFNISK NYSGNW+++VTVQNPN KLN+N ERIQSFVD+K NTLAMS+ +PFFLDVE
Subjt:  GFIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFVEPFFLDVE

Query:  KSGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNCAVLM
        KSGQM+VKLTSSSPDDPGNW +TEEK+GRERATG VSFNLRFF WTTFR+G+WWTRRV+MRV CED+KL F GPAA +AV   D   K C+VL+
Subjt:  KSGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNCAVLM

A0A5A7TLT1 Protein YLS94.9e-11273.47Show/hide
Query:  SSSSDDRQSISK--DPPP--PPVAGTNPPPVYPPQAMGY--PHGH-GYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYYTNNPQPFRPEPVRA
        +SSS+D+QS SK  DPPP  P  AG NPPPVYPP  +GY  P GH GY PAMGYPPAPHP YPPA G YPPYNAY YAQAPPAAYY NNPQ +R   + A
Subjt:  SSSSDDRQSISK--DPPP--PPVAGTNPPPVYPPQAMGY--PHGH-GYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYYTNNPQPFRPEPVRA

Query:  GFIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFVEPFFLDVE
        GF+RGIV+ALILLV +MTLSSIITWI+LRPE+P F+VDS +V+NFNISK NYSGNW+++VTVQNPN KLN+N ERIQSFVD+K NTLAMS+ +PFFLDVE
Subjt:  GFIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFVEPFFLDVE

Query:  KSGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNCAVLM
        KSGQM+VKLTSSSPDDPGNW +TEEK+GRERATG VSFNLRFF WTTFR+G+WWTRRV+MRV CED+KL F GPAA +AV   D   K C+VL+
Subjt:  KSGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNCAVLM

A0A6J1F415 uncharacterized protein LOC1114421883.7e-10468.32Show/hide
Query:  MASSSSDDR----QSISKDPPP--PPVAGTNPPPVYPPQAMGY-PHGHGYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYY-----TNNPQPF
        MASSS D +    QS   DPPP  PP AG NPPP+YPP  +GY PH HGYPPAMGYPPAPHPGYPPAPG YPPYNAYAY QAPPAAYY      NNPQ +
Subjt:  MASSSSDDR----QSISKDPPP--PPVAGTNPPPVYPPQAMGY-PHGHGYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYY-----TNNPQPF

Query:  RPEPVRAGFIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFVE
        R E   AGF+RGI +AL+LLVV+MT+SSIITWI+LRPEIP F+VDS +V NFNISK+NYSG W+  VTVQNPN KLNL+FERI+SFVD+ DNT+A SF +
Subjt:  RPEPVRAGFIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFVE

Query:  PFFLDVEKSGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFR--SGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNCA
        PFFLD+EKS QM+VK+TSSSPDDPGNWAQTEEK+ RER TG VSF LR   WTTFR  SG+ WTRRV++RVFCEDLKL F G    + V +     K C 
Subjt:  PFFLDVEKSGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFR--SGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNCA

Query:  VLM
        VL+
Subjt:  VLM

A0A6J1J6I9 uncharacterized protein LOC1114816752.6e-10568.75Show/hide
Query:  MASSSSDDR----QSISKDPPP--PPVAGTNPPPVYPPQAMGY-PHGHGYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYY------TNNPQP
        MASSS D +    QS   DPPP  PP AG NPPP+YPP  +GY PH HGYPPAMGYPPAPHPGYPPAPG YPPYNAYAY QAPPAAYY       NNPQ 
Subjt:  MASSSSDDR----QSISKDPPP--PPVAGTNPPPVYPPQAMGY-PHGHGYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYY------TNNPQP

Query:  FRPEPVRAGFIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFV
        +R E   AGF+RGI +AL+LLVV+MT+SSIITWI+LRPEIP F+VDS +VANFNISK+NYSG W+  VTVQNPN KLNL+FERI+SFVD+ DNT+A SF 
Subjt:  FRPEPVRAGFIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFV

Query:  EPFFLDVEKSGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFR--SGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNC
        +PFFLD+EKS QM VK+TSSSPDDPGNW QTEEK+ RERATG VSF LR   WTTFR  SG+ WTRRV++RVFCEDLKL F G    + V +    PK C
Subjt:  EPFFLDVEKSGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFR--SGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNC

Query:  AVLM
         VL+
Subjt:  AVLM

SwissProt top hitse value%identityAlignment
Q9SJ52 NDR1/HIN1-like protein 101.1e-0723.77Show/hide
Query:  AAYYTNNPQPFRPEPVRAGFIRG--------IVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKAN--YSGNWESNVTVQNPNQKLNLN
        A Y  + P P      R G  RG         V  +I L+V++ ++++I W+++RP    F V   ++  F+ +  +     N    V V+NPN+++ L 
Subjt:  AAYYTNNPQPFRPEPVRAGFIRG--------IVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKAN--YSGNWESNVTVQNPNQKLNLN

Query:  FERIQSFVDFKDNTLAMSFVEPFFLDVEKSGQMRVKLTSSSPDDPGN-----WAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDL
        ++RI++   ++    +   + PF+       Q     T  +P   G       A     +  ER +G  +  ++F +   F+ G    RR+  +V C+DL
Subjt:  FERIQSFVDFKDNTLAMSFVEPFFLDVEKSGQMRVKLTSSSPDDPGN-----WAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDL

Query:  KLAFAGPAANNAVNTRDGFPKNC
        +L  +   +N    T   FP  C
Subjt:  KLAFAGPAANNAVNTRDGFPKNC

Arabidopsis top hitse value%identityAlignment
AT2G27260.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.2e-1730.86Show/hide
Query:  PAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYYTNNPQPFRPEP-VRAGFIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNIS
        PA GY P P+P YP      PP N Y    A  A  Y N+   + P+P  RA  IR +       ++L+ L   I ++++RP++P   ++SL+V+NFN+S
Subjt:  PAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYYTNNPQPFRPEP-VRAGFIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNIS

Query:  KANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFVEPFFLDVEKSGQMRVKLTSSSPDDPGNWAQTEEKMGRERAT-GFVSFNLRFFVWTT
            SG W+  +  +NPN K++L++E     + +   +L+ + ++PF  D  K  Q  V  T S      +  +  + +G+ER+  G V F+LR   + T
Subjt:  KANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFVEPFFLDVEKSGQMRVKLTSSSPDDPGNWAQTEEKMGRERAT-GFVSFNLRFFVWTT

Query:  FRSGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNC
        FR G  + RR  + V+C+D+ +   G   ++      G  K C
Subjt:  FRSGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNC

AT2G35980.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family7.7e-0923.77Show/hide
Query:  AAYYTNNPQPFRPEPVRAGFIRG--------IVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKAN--YSGNWESNVTVQNPNQKLNLN
        A Y  + P P      R G  RG         V  +I L+V++ ++++I W+++RP    F V   ++  F+ +  +     N    V V+NPN+++ L 
Subjt:  AAYYTNNPQPFRPEPVRAGFIRG--------IVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKAN--YSGNWESNVTVQNPNQKLNLN

Query:  FERIQSFVDFKDNTLAMSFVEPFFLDVEKSGQMRVKLTSSSPDDPGN-----WAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDL
        ++RI++   ++    +   + PF+       Q     T  +P   G       A     +  ER +G  +  ++F +   F+ G    RR+  +V C+DL
Subjt:  FERIQSFVDFKDNTLAMSFVEPFFLDVEKSGQMRVKLTSSSPDDPGN-----WAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDL

Query:  KLAFAGPAANNAVNTRDGFPKNC
        +L  +   +N    T   FP  C
Subjt:  KLAFAGPAANNAVNTRDGFPKNC

AT3G52460.1 hydroxyproline-rich glycoprotein family protein2.3e-4540.88Show/hide
Query:  PPPPPVAGTNPPPVYPPQAMGYPHGHGYPPAMGY-----PPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYYTNN----PQPFRPEPVRAGFIRGIVSALI
        PPPPP     PPP    Q         YPP MGY     PP P+P YP A     PY  Y YAQAPPA+YY ++      P    P  +GF+RGI + LI
Subjt:  PPPPPVAGTNPPPVYPPQAMGYPHGHGYPPAMGY-----PPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYYTNN----PQPFRPEPVRAGFIRGIVSALI

Query:  LLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDF-----KDNTLAMSFVEPFFLDVEKSGQMR
        +LVVL+ +S+ ITW+VLRP+IP F V++ +V+NFN++   +S  W +N+T++N N KL   F+RIQ  V       +D  LA +F +P F++ +KS  + 
Subjt:  LLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDF-----KDNTLAMSFVEPFFLDVEKSGQMR

Query:  VKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDLKLAFAGPAANNAV
          LT+   + P   +   ++M +ER TG V+F+LR  VW TF++  W  R   ++VFC  LK+ F G + N AV
Subjt:  VKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDLKLAFAGPAANNAV

AT4G19200.1 proline-rich family protein3.0e-0561.54Show/hide
Query:  PPVAGTNPPPVYPPQAMGYPHGHGYPPAMGYPPAPHP----GYPPAPGTYPP
        PP  G  PP  YPPQ  GYP   GYPPA GYPP  +P    GYPPAPG YPP
Subjt:  PPVAGTNPPPVYPPQAMGYPHGHGYPPAMGYPPAPHP----GYPPAPGTYPP

AT5G22870.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.3e-1123.96Show/hide
Query:  TNNPQPFRPEPVRAGFIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANY-SGNWESNVTVQNPNQKLNLNFERIQSFVDFKDN
        T+  QP R    R   I  I   ++ L+ +  +  +ITW+  +P+   + V++ +V NFN++  N+ S  ++  +   NPN ++++ +  ++ FV FKD 
Subjt:  TNNPQPFRPEPVRAGFIRGIVSALILLVVLMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANY-SGNWESNVTVQNPNQKLNLNFERIQSFVDFKDN

Query:  TLAMSFVEPFF---LDVEKSGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDLKLAFAGP
        TLA   VEPF    ++V++  +  +    +     G   +++  +G+     FV   +RF V      G W +     ++ C  + ++ + P
Subjt:  TLAMSFVEPFF---LDVEKSGQMRVKLTSSSPDDPGNWAQTEEKMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDLKLAFAGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTCCTCATCCGACGATCGACAATCCATATCCAAAGATCCACCTCCGCCCCCCGTCGCCGGCACCAACCCCCCTCCGGTCTACCCTCCCCAAGCAATGGGCTA
CCCCCACGGCCACGGCTACCCTCCGGCCATGGGGTACCCTCCGGCCCCACATCCAGGGTACCCTCCGGCGCCGGGGACCTACCCGCCCTACAACGCCTACGCCTACGCCC
AAGCCCCTCCCGCCGCTTATTACACCAACAACCCCCAACCTTTCCGGCCGGAGCCCGTGAGGGCCGGCTTCATCCGCGGCATCGTCTCGGCGTTAATCCTTCTGGTGGTG
TTGATGACGCTGTCGAGCATCATCACGTGGATCGTGCTCCGACCGGAAATCCCGACGTTCAGAGTGGATTCGTTGGCGGTGGCGAATTTCAACATCTCGAAGGCGAACTA
CTCCGGGAACTGGGAGTCGAACGTGACGGTGCAGAATCCGAACCAGAAGCTGAACCTGAATTTCGAGCGGATCCAGAGCTTCGTGGACTTCAAGGACAACACTCTGGCGA
TGTCGTTCGTGGAGCCGTTCTTTCTGGACGTGGAGAAGAGCGGGCAGATGCGGGTGAAGCTGACGTCGAGCAGCCCGGACGACCCCGGGAACTGGGCCCAGACGGAGGAG
AAGATGGGCCGGGAGCGGGCCACGGGATTCGTCAGTTTCAATTTGAGATTCTTTGTTTGGACCACTTTCCGATCGGGGACATGGTGGACCAGGCGCGTTCTCATGAGAGT
CTTCTGTGAGGATTTGAAGCTCGCCTTCGCCGGACCCGCCGCGAACAACGCCGTCAACACGCGCGACGGCTTCCCCAAGAACTGTGCGGTTCTCATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCTCCTCATCCGACGATCGACAATCCATATCCAAAGATCCACCTCCGCCCCCCGTCGCCGGCACCAACCCCCCTCCGGTCTACCCTCCCCAAGCAATGGGCTA
CCCCCACGGCCACGGCTACCCTCCGGCCATGGGGTACCCTCCGGCCCCACATCCAGGGTACCCTCCGGCGCCGGGGACCTACCCGCCCTACAACGCCTACGCCTACGCCC
AAGCCCCTCCCGCCGCTTATTACACCAACAACCCCCAACCTTTCCGGCCGGAGCCCGTGAGGGCCGGCTTCATCCGCGGCATCGTCTCGGCGTTAATCCTTCTGGTGGTG
TTGATGACGCTGTCGAGCATCATCACGTGGATCGTGCTCCGACCGGAAATCCCGACGTTCAGAGTGGATTCGTTGGCGGTGGCGAATTTCAACATCTCGAAGGCGAACTA
CTCCGGGAACTGGGAGTCGAACGTGACGGTGCAGAATCCGAACCAGAAGCTGAACCTGAATTTCGAGCGGATCCAGAGCTTCGTGGACTTCAAGGACAACACTCTGGCGA
TGTCGTTCGTGGAGCCGTTCTTTCTGGACGTGGAGAAGAGCGGGCAGATGCGGGTGAAGCTGACGTCGAGCAGCCCGGACGACCCCGGGAACTGGGCCCAGACGGAGGAG
AAGATGGGCCGGGAGCGGGCCACGGGATTCGTCAGTTTCAATTTGAGATTCTTTGTTTGGACCACTTTCCGATCGGGGACATGGTGGACCAGGCGCGTTCTCATGAGAGT
CTTCTGTGAGGATTTGAAGCTCGCCTTCGCCGGACCCGCCGCGAACAACGCCGTCAACACGCGCGACGGCTTCCCCAAGAACTGTGCGGTTCTCATGTAG
Protein sequenceShow/hide protein sequence
MASSSSDDRQSISKDPPPPPVAGTNPPPVYPPQAMGYPHGHGYPPAMGYPPAPHPGYPPAPGTYPPYNAYAYAQAPPAAYYTNNPQPFRPEPVRAGFIRGIVSALILLVV
LMTLSSIITWIVLRPEIPTFRVDSLAVANFNISKANYSGNWESNVTVQNPNQKLNLNFERIQSFVDFKDNTLAMSFVEPFFLDVEKSGQMRVKLTSSSPDDPGNWAQTEE
KMGRERATGFVSFNLRFFVWTTFRSGTWWTRRVLMRVFCEDLKLAFAGPAANNAVNTRDGFPKNCAVLM