; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0028292 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0028292
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationchr04:5578313..5580512
RNA-Seq ExpressionPI0028292
SyntenyPI0028292
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043818.1 protein YLS9 [Cucumis melo var. makuwa]8.5e-14388.74Show/hide
Query:  MASSSEDQQSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAG
        MASSSEDQQSQSKATDPPPP P SAGNNPPPVYPPPTLGYPPP GH GY PAMGYPPAPHP YPPA GNYPPYNAYYAQAPPAAYYNNPQNYRA T++AG
Subjt:  MASSSEDQQSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAG

Query:  FLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEK
        FLRGIV ALI LVAIMTLSSIITWIILRPE+PVF+VDS SV+NFNISKLNYSGNWD SVTVQNPNHKLNV+++RIQSFVDYK+NTLAMSYA+PFFLDVEK
Subjt:  FLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEK

Query:  SGQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKTCSVVV
        SGQM+VKLTS+SPDDPGNW+ETEEKLGRERATGTVSFNLRFFAWTTFR+GSWWTRRVVMRV CED+K+VFTGPAA +AVYLAD HSKTCSV+V
Subjt:  SGQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKTCSVVV

XP_008442912.1 PREDICTED: uncharacterized protein LOC103486674 [Cucumis melo]8.5e-14388.74Show/hide
Query:  MASSSEDQQSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAG
        MASSSEDQQSQSKATDPPPP P SAGNNPPPVYPPPTLGYPPP GH GY PAMGYPPAPHP YPPA GNYPPYNAYYAQAPPAAYYNNPQNYRA T++AG
Subjt:  MASSSEDQQSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAG

Query:  FLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEK
        FLRGIV ALI LVAIMTLSSIITWIILRPE+PVF+VDS SV+NFNISKLNYSGNWD SVTVQNPNHKLNV+++RIQSFVDYK+NTLAMSYA+PFFLDVEK
Subjt:  FLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEK

Query:  SGQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKTCSVVV
        SGQM+VKLTS+SPDDPGNW+ETEEKLGRERATGTVSFNLRFFAWTTFR+GSWWTRRVVMRV CED+K+VFTGPAA +AVYLAD HSKTCSV+V
Subjt:  SGQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKTCSVVV

XP_011652032.1 uncharacterized protein LOC105434983 [Cucumis sativus]8.8e-14085.22Show/hide
Query:  MASSSEDQQSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAGF
        MASSSEDQQSQSKATDPPPP P SAGNNPPPVYPPPTLGYPPPHGHGY PAMGYPP P PGYPPAPGNYPPYN YYAQAPPAAYYNNPQNYRA+TV+AGF
Subjt:  MASSSEDQQSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAGF

Query:  LRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEKS
        LRGIVTALI LVA+MTLSSIITWI+LRP+IPVF+VDS SV+NFNISKLNYSGNW+ S+TV+NPNHKL V+++RIQSFV+YKENTLAMSYA+PFF+DVEKS
Subjt:  LRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEKS

Query:  GQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKTCSVV
         QMRVKLTS+SPDDPGNW+ETEEK+G+E+A+GTVSFNLRFFAWT FRSGSWWTRR+VM+VFCEDLK+ FTGPAA + VYLAD HSKTCSV+
Subjt:  GQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKTCSVV

XP_022983003.1 uncharacterized protein LOC111481675 [Cucurbita maxima]1.8e-11675.41Show/hide
Query:  MASSSEDQ---QSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYY-------NNPQ
        MASSS DQ   QSQSK TDPPPP P SAGNNPPP+YPPPTLGY PPH HGYPPAMGYPPAPHPGYPPAPGNYPPYNAY Y QAPPAAYY       NNPQ
Subjt:  MASSSEDQ---QSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYY-------NNPQ

Query:  NYRAETVNAGFLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSY
         YR ET  AGFLRGI  AL+ LV IMT+SSIITWIILRPEIP F+VDS SVANFNISK NYSG WDV VTVQNPNHKLN+H +RI+SFVDY +NT+A S+
Subjt:  NYRAETVNAGFLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSY

Query:  AEPFFLDVEKSGQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFR--SGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKT
        ++PFFLD+EKS QM VK+TS+SPDDPGNWV+TEEKL RERATGTVSF LR  AWTTFR  SGS WTRRV++RVFCEDLK+VFTG    + VY    H KT
Subjt:  AEPFFLDVEKSGQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFR--SGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKT

Query:  CSVVV
        C V+V
Subjt:  CSVVV

XP_038905898.1 uncharacterized protein LOC120091828 [Benincasa hispida]1.2e-13182.19Show/hide
Query:  MASSSEDQQSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAGF
        MASSS+D QSQSKATDPPP  P SAGNNPPPVYPPPTLGYPPP GH YPPAMGYPPAPHPGYPPAPGNYPPYN YYAQAPPAAYYNN QNYRAETVN GF
Subjt:  MASSSEDQQSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAGF

Query:  LRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEKS
        LRGIVTALI  VAIMTLSSI+TWIILRPEIPVFR+DS SV NFNISK NYSGNWD ++TVQNPNH+LNV+++R+QSFVDYK+NTLAMSY +PFFLDVEKS
Subjt:  LRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEKS

Query:  GQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKTCSVVV
         QMRVKLTS+SPDDPG+W ETE+KLG+E+ATGTVSFNLRF AWTTFR GSWWTRRVV+RVFCEDLK+VF GPAA   VY  + + K CSV++
Subjt:  GQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKTCSVVV

TrEMBL top hitse value%identityAlignment
A0A0A0LGS8 Uncharacterized protein4.3e-14085.22Show/hide
Query:  MASSSEDQQSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAGF
        MASSSEDQQSQSKATDPPPP P SAGNNPPPVYPPPTLGYPPPHGHGY PAMGYPP P PGYPPAPGNYPPYN YYAQAPPAAYYNNPQNYRA+TV+AGF
Subjt:  MASSSEDQQSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAGF

Query:  LRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEKS
        LRGIVTALI LVA+MTLSSIITWI+LRP+IPVF+VDS SV+NFNISKLNYSGNW+ S+TV+NPNHKL V+++RIQSFV+YKENTLAMSYA+PFF+DVEKS
Subjt:  LRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEKS

Query:  GQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKTCSVV
         QMRVKLTS+SPDDPGNW+ETEEK+G+E+A+GTVSFNLRFFAWT FRSGSWWTRR+VM+VFCEDLK+ FTGPAA + VYLAD HSKTCSV+
Subjt:  GQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKTCSVV

A0A1S3B6W4 uncharacterized protein LOC1034866744.1e-14388.74Show/hide
Query:  MASSSEDQQSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAG
        MASSSEDQQSQSKATDPPPP P SAGNNPPPVYPPPTLGYPPP GH GY PAMGYPPAPHP YPPA GNYPPYNAYYAQAPPAAYYNNPQNYRA T++AG
Subjt:  MASSSEDQQSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAG

Query:  FLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEK
        FLRGIV ALI LVAIMTLSSIITWIILRPE+PVF+VDS SV+NFNISKLNYSGNWD SVTVQNPNHKLNV+++RIQSFVDYK+NTLAMSYA+PFFLDVEK
Subjt:  FLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEK

Query:  SGQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKTCSVVV
        SGQM+VKLTS+SPDDPGNW+ETEEKLGRERATGTVSFNLRFFAWTTFR+GSWWTRRVVMRV CED+K+VFTGPAA +AVYLAD HSKTCSV+V
Subjt:  SGQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKTCSVVV

A0A5A7TLT1 Protein YLS94.1e-14388.74Show/hide
Query:  MASSSEDQQSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAG
        MASSSEDQQSQSKATDPPPP P SAGNNPPPVYPPPTLGYPPP GH GY PAMGYPPAPHP YPPA GNYPPYNAYYAQAPPAAYYNNPQNYRA T++AG
Subjt:  MASSSEDQQSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAG

Query:  FLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEK
        FLRGIV ALI LVAIMTLSSIITWIILRPE+PVF+VDS SV+NFNISKLNYSGNWD SVTVQNPNHKLNV+++RIQSFVDYK+NTLAMSYA+PFFLDVEK
Subjt:  FLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEK

Query:  SGQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKTCSVVV
        SGQM+VKLTS+SPDDPGNW+ETEEKLGRERATGTVSFNLRFFAWTTFR+GSWWTRRVVMRV CED+K+VFTGPAA +AVYLAD HSKTCSV+V
Subjt:  SGQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKTCSVVV

A0A6J1F415 uncharacterized protein LOC1114421882.1e-11574.34Show/hide
Query:  MASSSEDQ---QSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYY------NNPQN
        MASSS DQ   QSQSK TDPPPP P SAGNNPPP+YPPPTLGY PPH HGYPPAMGYPPAPHPGYPPAPGNYPPYNAY Y QAPPAAYY      NNPQ 
Subjt:  MASSSEDQ---QSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYY------NNPQN

Query:  YRAETVNAGFLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYA
        YR ET  AGFLRGI  AL+ LV IMT+SSIITWIILRPEIP F+VDS SV NFNISK NYSG WD+ VTVQNPNHKLN+H +RI+SFVDY +NT+A S++
Subjt:  YRAETVNAGFLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYA

Query:  EPFFLDVEKSGQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFR--SGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKTC
        +PFFLD+EKS QM+VK+TS+SPDDPGNW +TEEKL RER TGTVSF LR  AWTTFR  SGS WTRRV++RVFCEDLK+VFTG    + VY     SKTC
Subjt:  EPFFLDVEKSGQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFR--SGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKTC

Query:  SVVV
         V+V
Subjt:  SVVV

A0A6J1J6I9 uncharacterized protein LOC1114816758.7e-11775.41Show/hide
Query:  MASSSEDQ---QSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYY-------NNPQ
        MASSS DQ   QSQSK TDPPPP P SAGNNPPP+YPPPTLGY PPH HGYPPAMGYPPAPHPGYPPAPGNYPPYNAY Y QAPPAAYY       NNPQ
Subjt:  MASSSEDQ---QSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYY-------NNPQ

Query:  NYRAETVNAGFLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSY
         YR ET  AGFLRGI  AL+ LV IMT+SSIITWIILRPEIP F+VDS SVANFNISK NYSG WDV VTVQNPNHKLN+H +RI+SFVDY +NT+A S+
Subjt:  NYRAETVNAGFLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSY

Query:  AEPFFLDVEKSGQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFR--SGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKT
        ++PFFLD+EKS QM VK+TS+SPDDPGNWV+TEEKL RERATGTVSF LR  AWTTFR  SGS WTRRV++RVFCEDLK+VFTG    + VY    H KT
Subjt:  AEPFFLDVEKSGQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFR--SGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKT

Query:  CSVVV
        C V+V
Subjt:  CSVVV

SwissProt top hitse value%identityAlignment
Q9SJ52 NDR1/HIN1-like protein 101.3e-0525.48Show/hide
Query:  PPYNAYYA-QAPPAAYYNNPQNYRAETVNAGFLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISK----LNYSGNWDVSVTVQNPN
        P   A+Y    PP A     +           L   V  +I L+ I+ ++++I W+I+RP    F V   S+  F+ +     L Y  N  ++V V+NPN
Subjt:  PPYNAYYA-QAPPAAYYNNPQNYRAETVNAGFLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISK----LNYSGNWDVSVTVQNPN

Query:  HKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEKSGQMRVKLTSTSPDDPG-NWV----ETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMR
         ++ ++ DRI++   Y+    +     PF+       Q     T  +P   G N V         L  ER +G  +  ++F     F+ G    RR+  +
Subjt:  HKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEKSGQMRVKLTSTSPDDPG-NWV----ETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMR

Query:  VFCEDLKV
        V C+DL++
Subjt:  VFCEDLKV

Q9SRN0 NDR1/HIN1-like protein 13.3e-0426.32Show/hide
Query:  LRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNIS---KLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEP
        +R I  ++IF++ I+ L+ ++ W IL+P  P F +   +V  FN+S       + N+ ++++ +NPN+K+ ++ DR+  +  Y+   +    + P
Subjt:  LRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNIS---KLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEP

Arabidopsis top hitse value%identityAlignment
AT2G27260.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.7e-1933.93Show/hide
Query:  PAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAY-YNNPQNYRAETVN--AGFLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNIS
        PA GY P P+P YP      PP N Y   A   AY Y N   Y A   N  A  +R +       + ++ L   I ++I+RP++P   ++SLSV+NFN+S
Subjt:  PAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAY-YNNPQNYRAETVN--AGFLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNIS

Query:  KLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEKSGQMRVKLTSTSPDDPGNWVETE--EKLGRERAT-GTVSFNLRFFAW
            SG WD+ +  +NPN K+++H +     + Y   +L+ +  +PF  D  K  Q  V  T +     G +V+    + +G+ER+  G V F+LR  ++
Subjt:  KLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEKSGQMRVKLTSTSPDDPGNWVETE--EKLGRERAT-GTVSFNLRFFAW

Query:  TTFRSGSWWTRRVVMRVFCEDLKV
         TFR G++  RR V  V+C+D+ V
Subjt:  TTFRSGSWWTRRVVMRVFCEDLKV

AT2G35980.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family9.5e-0725.48Show/hide
Query:  PPYNAYYA-QAPPAAYYNNPQNYRAETVNAGFLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISK----LNYSGNWDVSVTVQNPN
        P   A+Y    PP A     +           L   V  +I L+ I+ ++++I W+I+RP    F V   S+  F+ +     L Y  N  ++V V+NPN
Subjt:  PPYNAYYA-QAPPAAYYNNPQNYRAETVNAGFLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISK----LNYSGNWDVSVTVQNPN

Query:  HKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEKSGQMRVKLTSTSPDDPG-NWV----ETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMR
         ++ ++ DRI++   Y+    +     PF+       Q     T  +P   G N V         L  ER +G  +  ++F     F+ G    RR+  +
Subjt:  HKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEKSGQMRVKLTSTSPDDPG-NWV----ETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMR

Query:  VFCEDLKV
        V C+DL++
Subjt:  VFCEDLKV

AT3G52460.1 hydroxyproline-rich glycoprotein family protein2.2e-4039.53Show/hide
Query:  SSEDQQSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPP--HGHGYPPAMGY-----PPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYYNNPQNYRAET
        S  ++++Q K    P        N PPP  PPP    PPP      YPP MGY     PP P+P YP A     PY  Y YAQAPPA+YY +    +   
Subjt:  SSEDQQSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPP--HGHGYPPAMGY-----PPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYYNNPQNYRAET

Query:  V-----NAGFLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENT------
        V     ++GF+RGI T LI LV ++ +S+ ITW++LRP+IP+F V++ SV+NFN++   +S  W  ++T++N N KL  + DRIQ  V Y +N       
Subjt:  V-----NAGFLRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENT------

Query:  LAMSYAEPFFLDVEKSGQMRVKLTSTSPDDP--GNWVETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLAD
        LA ++ +P F++ +KS  +   LT+   + P   +WV  E K  +ER TGTV+F+LR   W TF++  W  R   ++VFC  LKV F G +   AV L  
Subjt:  LAMSYAEPFFLDVEKSGQMRVKLTSTSPDDP--GNWVETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLAD

Query:  P
        P
Subjt:  P

AT5G22200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.1e-1026.23Show/hide
Query:  LRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNY-SGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEK
        +R I  A + L+  +     + W IL P  P F +  +++ +FN+S+ N+ S N  V+V+ +NPN K+ +  DR+  +V Y+   + ++   P       
Subjt:  LRGIVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNY-SGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEK

Query:  SGQMRVKLTSTSPDDPGNWVET----EEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDL-----KVVFTGPA
        + Q  +++T  SP   G+ V         L  +   G V  N++   W  ++ GSW +    + V C        K+  TGPA
Subjt:  SGQMRVKLTSTSPDDPGNWVET----EEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDL-----KVVFTGPA

AT5G22870.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family8.3e-1124.28Show/hide
Query:  IVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNY-SGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFF---LDVEK
        ++  LIF+ A+     +ITW+  +P+   + V++ SV NFN++  N+ S  +  ++   NPNH+++V+   ++ FV +K+ TLA    EPF    ++V++
Subjt:  IVTALIFLVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNY-SGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFF---LDVEK

Query:  SGQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDLKVVFTGP
          +  +          G  + ++  LG+      + F +   A   F+ G W +     ++ C  + V  + P
Subjt:  SGQMRVKLTSTSPDDPGNWVETEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDLKVVFTGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTCATCGGAGGATCAACAATCTCAATCCAAAGCCACGGACCCACCTCCTCCGCAGCCACACTCTGCTGGAAACAACCCTCCTCCTGTCTACCCACCGCCCAC
ATTGGGGTACCCTCCTCCTCACGGCCATGGGTACCCTCCGGCAATGGGGTACCCTCCAGCTCCACATCCAGGGTACCCACCGGCTCCGGGGAATTACCCTCCTTACAATG
CGTACTACGCTCAGGCTCCCCCGGCGGCGTATTACAATAACCCACAAAATTACAGAGCGGAGACTGTAAACGCGGGATTCCTCCGAGGAATTGTGACAGCGTTGATTTTT
TTGGTGGCTATAATGACTCTGTCCAGCATAATCACATGGATCATCCTCCGCCCTGAAATCCCAGTGTTTAGAGTCGATTCATTATCCGTTGCGAATTTCAATATCTCGAA
ATTGAATTACTCCGGGAATTGGGATGTGAGTGTGACGGTTCAAAATCCGAATCATAAACTGAATGTGCATTTGGACCGGATCCAGAGCTTCGTGGACTACAAAGAAAATA
CGTTGGCAATGTCTTATGCGGAGCCATTTTTTCTAGATGTGGAGAAGAGCGGTCAAATGAGGGTGAAATTGACGTCGACTAGTCCCGATGATCCGGGAAATTGGGTGGAA
ACAGAGGAGAAGCTGGGGCGGGAGAGGGCGACCGGAACGGTGAGTTTCAATTTGAGATTCTTTGCTTGGACGACTTTCCGATCTGGGTCTTGGTGGACAAGGCGGGTTGT
TATGAGAGTGTTTTGTGAAGATTTGAAGGTGGTCTTCACCGGACCCGCCGCCGCTAATGCCGTTTACTTGGCCGACCCACACTCCAAGACTTGTTCTGTTGTCGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATTCTTCTCTCTCTCTCTCTCTCTCTCTCCAAATCCTTTCACAGGGAGAGAGAAAAAGCCAGAAACAGAGCATCCCTTTCTAATGGCTTCCTCATCGGAGGATCAACAAT
CTCAATCCAAAGCCACGGACCCACCTCCTCCGCAGCCACACTCTGCTGGAAACAACCCTCCTCCTGTCTACCCACCGCCCACATTGGGGTACCCTCCTCCTCACGGCCAT
GGGTACCCTCCGGCAATGGGGTACCCTCCAGCTCCACATCCAGGGTACCCACCGGCTCCGGGGAATTACCCTCCTTACAATGCGTACTACGCTCAGGCTCCCCCGGCGGC
GTATTACAATAACCCACAAAATTACAGAGCGGAGACTGTAAACGCGGGATTCCTCCGAGGAATTGTGACAGCGTTGATTTTTTTGGTGGCTATAATGACTCTGTCCAGCA
TAATCACATGGATCATCCTCCGCCCTGAAATCCCAGTGTTTAGAGTCGATTCATTATCCGTTGCGAATTTCAATATCTCGAAATTGAATTACTCCGGGAATTGGGATGTG
AGTGTGACGGTTCAAAATCCGAATCATAAACTGAATGTGCATTTGGACCGGATCCAGAGCTTCGTGGACTACAAAGAAAATACGTTGGCAATGTCTTATGCGGAGCCATT
TTTTCTAGATGTGGAGAAGAGCGGTCAAATGAGGGTGAAATTGACGTCGACTAGTCCCGATGATCCGGGAAATTGGGTGGAAACAGAGGAGAAGCTGGGGCGGGAGAGGG
CGACCGGAACGGTGAGTTTCAATTTGAGATTCTTTGCTTGGACGACTTTCCGATCTGGGTCTTGGTGGACAAGGCGGGTTGTTATGAGAGTGTTTTGTGAAGATTTGAAG
GTGGTCTTCACCGGACCCGCCGCCGCTAATGCCGTTTACTTGGCCGACCCACACTCCAAGACTTGTTCTGTTGTCGTCTAGAAGAATTCTTCGGATTATCTCGTGACGAA
GAACAATAAATAAATCGACGGTACATTGAATGAGGATGAAGATGAAGAAAAATCAAAACGTTAAGAGACTCCTAAATTAATATCTATCTCATTGACAAATAATCTATTAG
AGTCCATTCTCTATCCACACCTTTAACCCACACTTTAACGTATAGGTATAACTTAGCCACCACATATTTACCCTCCATTAGTATTTTTAAGTTATACAAACAAATTTCAA
ATTCCTTGAAAGAAATGAAAATAAGAATTTTTTTTCTTTTTATCAGGAAAGTAGGCAATATGTGTGTGGGCTTGGAGGACAAGGGGTATAGCTGAGAGATTTGCATTCAT
GTTTGGACATGATGAAACATTATATTCAATTTTAGGGAATTTTTTTTTTTTTTTTTTTTTATTTTTAGTATCATATCTCTTTCAAATAAGAATTTTATTTTATTTATTTT
TTATTTTTATTTATTTATTTATTTTGTAGATAGAGAGATATGAAAATTTGGAAATATAAAAGTTGTTTCTGAATGTGTTGGGGTTGTGTTTAGAGGAAAAAAGAGACATG
TAACTTTGTTTTGTTTTCATTTTCATTTTCATATGAACTTTTTTAAAAGTTTTTTTTATCCTCTGTGTTTGTATTTTGTGTATTAAAAAGTTATATACTTATTTTCGTGG
TCTTTTAA
Protein sequenceShow/hide protein sequence
MASSSEDQQSQSKATDPPPPQPHSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAGFLRGIVTALIF
LVAIMTLSSIITWIILRPEIPVFRVDSLSVANFNISKLNYSGNWDVSVTVQNPNHKLNVHLDRIQSFVDYKENTLAMSYAEPFFLDVEKSGQMRVKLTSTSPDDPGNWVE
TEEKLGRERATGTVSFNLRFFAWTTFRSGSWWTRRVVMRVFCEDLKVVFTGPAAANAVYLADPHSKTCSVVV