; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G201380 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G201380
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationCla97Chr10:31452286..31453164
RNA-Seq ExpressionCla97C10G201380
SyntenyCla97C10G201380
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043818.1 protein YLS9 [Cucumis melo var. makuwa]5.0e-13586.01Show/hide
Query:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAG
        MASSS+DQQSQSKATDPPPP P SAGNNPPPVYPPPTLGYPPP GH GY PAMGYPPAPHP YPPA GNYPPYNAYYAQAPPAAYYNNPQNYRA T++AG
Subjt:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAG

Query:  FIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEK
        F+RGIV ALILLVAIMTLSSIITWIILRPE+PVF+VDSFSV+NFNISK NYSGNWDA+VTVQNPNHKLNVN+ERIQSFVD+K NTLAMSYADPFFLDVEK
Subjt:  FIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEK

Query:  SSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICAVLV
        S QM+VKLTSSSPDDPGNW  TEEKLG+ER TGTVSF+LRFFAWTTFR+GSWWTRRVVMRV CED+KLVF GPAA +AVY  D H   C+VLV
Subjt:  SSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICAVLV

XP_008442912.1 PREDICTED: uncharacterized protein LOC103486674 [Cucumis melo]5.0e-13586.01Show/hide
Query:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAG
        MASSS+DQQSQSKATDPPPP P SAGNNPPPVYPPPTLGYPPP GH GY PAMGYPPAPHP YPPA GNYPPYNAYYAQAPPAAYYNNPQNYRA T++AG
Subjt:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAG

Query:  FIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEK
        F+RGIV ALILLVAIMTLSSIITWIILRPE+PVF+VDSFSV+NFNISK NYSGNWDA+VTVQNPNHKLNVN+ERIQSFVD+K NTLAMSYADPFFLDVEK
Subjt:  FIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEK

Query:  SSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICAVLV
        S QM+VKLTSSSPDDPGNW  TEEKLG+ER TGTVSF+LRFFAWTTFR+GSWWTRRVVMRV CED+KLVF GPAA +AVY  D H   C+VLV
Subjt:  SSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICAVLV

XP_011652032.1 uncharacterized protein LOC105434983 [Cucumis sativus]1.9e-13483.51Show/hide
Query:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAGF
        MASSS+DQQSQSKATDPPPP P SAGNNPPPVYPPPTLGYPPPHGHGY PAMGYPP P PGYPPAPGNYPPYN YYAQAPPAAYYNNPQNYRA+TV+AGF
Subjt:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAGF

Query:  IRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKS
        +RGIVTALILLVA+MTLSSIITWI+LRP+IPVF+VDSFSV+NFNISK NYSGNW+ ++TV+NPNHKL VN+ERIQSFV++K+NTLAMSYADPFF+DVEKS
Subjt:  IRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKS

Query:  SQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICAVL
        SQMRVKLTSSSPDDPGNW  TEEK+GQE+ +GTVSF+LRFFAWT FRSGSWWTRR+VM+VFCEDLKL F GPAA + VY  DAH   C+VL
Subjt:  SQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICAVL

XP_022983003.1 uncharacterized protein LOC111481675 [Cucurbita maxima]8.1e-11776.39Show/hide
Query:  MASSSDDQ---QSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYY-------NNPQ
        MASSS DQ   QSQSK TDPPPP PPSAGNNPPP+YPPPTLGY PPH HGYPPAMGYPPAPHPGYPPAPGNYPPYNAY Y QAPPAAYY       NNPQ
Subjt:  MASSSDDQ---QSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYY-------NNPQ

Query:  NYRAETVNAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSY
         YR ET  AGF+RGI  AL+LLV IMT+SSIITWIILRPEIP F+VDSFSVANFNISK+NYSG WD  VTVQNPNHKLN++ ERI+SFVD+ DNT+A S+
Subjt:  NYRAETVNAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSY

Query:  ADPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFR--SGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMI
        +DPFFLD+EKS QM VK+TSSSPDDPGNW  TEEKL +ER TGTVSF+LR  AWTTFR  SGS WTRRV++RVFCEDLKLVF G    + VYSP AHP  
Subjt:  ADPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFR--SGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMI

Query:  CAVLV
        C VLV
Subjt:  CAVLV

XP_038905898.1 uncharacterized protein LOC120091828 [Benincasa hispida]7.5e-13986.64Show/hide
Query:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAGF
        MASSSDD QSQSKATDPPP PPPSAGNNPPPVYPPPTLGYPPP GH YPPAMGYPPAPHPGYPPAPGNYPPYN YYAQAPPAAYYNN QNYRAETVN GF
Subjt:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAGF

Query:  IRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKS
        +RGIVTALIL VAIMTLSSI+TWIILRPEIPVFR+DSFSV NFNISK+NYSGNWD N+TVQNPNH+LNVN+ER+QSFVD+KDNTLAMSY DPFFLDVEKS
Subjt:  IRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKS

Query:  SQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICAVLV
         QMRVKLTSSSPDDPG+WA TE+KLGQE+ TGTVSF+LRF AWTTFR GSWWTRRVV+RVFCEDLKLVFAGPAA   VYSP+ +P IC+VL+
Subjt:  SQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICAVLV

TrEMBL top hitse value%identityAlignment
A0A0A0LGS8 Uncharacterized protein9.2e-13583.51Show/hide
Query:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAGF
        MASSS+DQQSQSKATDPPPP P SAGNNPPPVYPPPTLGYPPPHGHGY PAMGYPP P PGYPPAPGNYPPYN YYAQAPPAAYYNNPQNYRA+TV+AGF
Subjt:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAGF

Query:  IRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKS
        +RGIVTALILLVA+MTLSSIITWI+LRP+IPVF+VDSFSV+NFNISK NYSGNW+ ++TV+NPNHKL VN+ERIQSFV++K+NTLAMSYADPFF+DVEKS
Subjt:  IRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKS

Query:  SQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICAVL
        SQMRVKLTSSSPDDPGNW  TEEK+GQE+ +GTVSF+LRFFAWT FRSGSWWTRR+VM+VFCEDLKL F GPAA + VY  DAH   C+VL
Subjt:  SQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICAVL

A0A1S3B6W4 uncharacterized protein LOC1034866742.4e-13586.01Show/hide
Query:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAG
        MASSS+DQQSQSKATDPPPP P SAGNNPPPVYPPPTLGYPPP GH GY PAMGYPPAPHP YPPA GNYPPYNAYYAQAPPAAYYNNPQNYRA T++AG
Subjt:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAG

Query:  FIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEK
        F+RGIV ALILLVAIMTLSSIITWIILRPE+PVF+VDSFSV+NFNISK NYSGNWDA+VTVQNPNHKLNVN+ERIQSFVD+K NTLAMSYADPFFLDVEK
Subjt:  FIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEK

Query:  SSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICAVLV
        S QM+VKLTSSSPDDPGNW  TEEKLG+ER TGTVSF+LRFFAWTTFR+GSWWTRRVVMRV CED+KLVF GPAA +AVY  D H   C+VLV
Subjt:  SSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICAVLV

A0A5A7TLT1 Protein YLS92.4e-13586.01Show/hide
Query:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAG
        MASSS+DQQSQSKATDPPPP P SAGNNPPPVYPPPTLGYPPP GH GY PAMGYPPAPHP YPPA GNYPPYNAYYAQAPPAAYYNNPQNYRA T++AG
Subjt:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAG

Query:  FIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEK
        F+RGIV ALILLVAIMTLSSIITWIILRPE+PVF+VDSFSV+NFNISK NYSGNWDA+VTVQNPNHKLNVN+ERIQSFVD+K NTLAMSYADPFFLDVEK
Subjt:  FIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEK

Query:  SSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICAVLV
        S QM+VKLTSSSPDDPGNW  TEEKLG+ER TGTVSF+LRFFAWTTFR+GSWWTRRVVMRV CED+KLVF GPAA +AVY  D H   C+VLV
Subjt:  SSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICAVLV

A0A6J1F415 uncharacterized protein LOC1114421881.6e-11575.99Show/hide
Query:  MASSSDDQ---QSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYY------NNPQN
        MASSS DQ   QSQSK TDPPPP PPSAGNNPPP+YPPPTLGY PPH HGYPPAMGYPPAPHPGYPPAPGNYPPYNAY Y QAPPAAYY      NNPQ 
Subjt:  MASSSDDQ---QSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYY------NNPQN

Query:  YRAETVNAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYA
        YR ET  AGF+RGI  AL+LLV IMT+SSIITWIILRPEIP F+VDSFSV NFNISK+NYSG WD  VTVQNPNHKLN++ ERI+SFVD+ DNT+A S++
Subjt:  YRAETVNAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYA

Query:  DPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFR--SGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMIC
        DPFFLD+EKS QM+VK+TSSSPDDPGNWA TEEKL +ER TGTVSF+LR  AWTTFR  SGS WTRRV++RVFCEDLKLVF G    + VYSP A    C
Subjt:  DPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFR--SGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMIC

Query:  AVLV
         VLV
Subjt:  AVLV

A0A6J1J6I9 uncharacterized protein LOC1114816753.9e-11776.39Show/hide
Query:  MASSSDDQ---QSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYY-------NNPQ
        MASSS DQ   QSQSK TDPPPP PPSAGNNPPP+YPPPTLGY PPH HGYPPAMGYPPAPHPGYPPAPGNYPPYNAY Y QAPPAAYY       NNPQ
Subjt:  MASSSDDQ---QSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYY-------NNPQ

Query:  NYRAETVNAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSY
         YR ET  AGF+RGI  AL+LLV IMT+SSIITWIILRPEIP F+VDSFSVANFNISK+NYSG WD  VTVQNPNHKLN++ ERI+SFVD+ DNT+A S+
Subjt:  NYRAETVNAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSY

Query:  ADPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFR--SGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMI
        +DPFFLD+EKS QM VK+TSSSPDDPGNW  TEEKL +ER TGTVSF+LR  AWTTFR  SGS WTRRV++RVFCEDLKLVF G    + VYSP AHP  
Subjt:  ADPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFR--SGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMI

Query:  CAVLV
        C VLV
Subjt:  CAVLV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G27260.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.1e-1531.82Show/hide
Query:  PAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAY-YNNPQNYRAETVN--AGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNIS
        PA GY P P+P YP      PP N Y   A   AY Y N   Y A   N  A  IR +       + ++ L   I ++I+RP++P   ++S SV+NFN+S
Subjt:  PAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAY-YNNPQNYRAETVN--AGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNIS

Query:  KANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQER-MTGTVSFSLRFFAWTT
            SG WD  +  +NPN K++++ E     + +   +L+ +   PF    +  + +   L+ S     G      + +G+ER + G V F LR  ++ T
Subjt:  KANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQER-MTGTVSFSLRFFAWTT

Query:  FRSGSWWTRRVVMRVFCEDL
        FR G++  RR V  V+C+D+
Subjt:  FRSGSWWTRRVVMRVFCEDL

AT3G52460.1 hydroxyproline-rich glycoprotein family protein2.0e-4139.93Show/hide
Query:  QQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGY-----PPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYYNNPQNYRAETV-----
        Q S+     PPPPPP S    PPP         P      YPP MGY     PP P+P YP A     PY  Y YAQAPPA+YY +    +   V     
Subjt:  QQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGY-----PPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYYNNPQNYRAETV-----

Query:  NAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDF-----KDNTLAMSYAD
        ++GF+RGI T LI+LV ++ +S+ ITW++LRP+IP+F V++FSV+NFN++   +S  W AN+T++N N KL    +RIQ  V       +D  LA ++  
Subjt:  NAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDF-----KDNTLAMSYAD

Query:  PFFLDVEKSSQMRVKLTSSSPDDP--GNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICA
        P F++ +KS  +   LT+   + P   +W   E K  +ER TGTV+FSLR   W TF++  W  R   ++VFC  LK+ F G +   AV  P   P+ C 
Subjt:  PFFLDVEKSSQMRVKLTSSSPDDP--GNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICA

Query:  VLV
        V V
Subjt:  VLV

AT5G22870.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family5.9e-0923.84Show/hide
Query:  IVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANY-SGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKSSQ
        I   ++ L+ +  +  +ITW+  +P+   + V++ SV NFN++  N+ S  +   +   NPNH+++V    ++ FV FKD TLA    +PF        +
Subjt:  IVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANY-SGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKSSQ

Query:  MRVKLTSSS--PDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGP
        M VK    +   ++     +  + L  +   G + F +   A   F+ G W +     ++ C  + +  + P
Subjt:  MRVKLTSSS--PDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTCATCCGACGATCAACAATCCCAATCCAAGGCCACTGACCCACCTCCCCCGCCGCCGCCCTCTGCCGGAAACAACCCCCCTCCTGTCTACCCACCGCCCAC
ATTGGGGTACCCTCCTCCTCACGGTCATGGGTACCCTCCGGCGATGGGGTATCCTCCAGCCCCACATCCAGGGTACCCACCGGCGCCGGGGAATTACCCTCCTTATAACG
CGTACTATGCTCAGGCTCCCCCGGCGGCGTATTACAACAATCCCCAAAATTACAGGGCGGAGACGGTGAACGCGGGATTCATCCGGGGGATTGTGACGGCGTTGATTCTG
TTGGTGGCTATAATGACTCTGTCCAGCATAATAACGTGGATCATCCTCCGCCCTGAAATCCCAGTGTTCAGAGTGGATTCATTCTCCGTGGCGAATTTCAACATCTCGAA
AGCTAATTACTCCGGCAACTGGGACGCGAATGTGACGGTCCAAAATCCGAACCATAAACTGAACGTAAATTTGGAGCGGATCCAGAGCTTCGTGGACTTCAAAGACAACA
CATTGGCAATGTCTTATGCAGACCCATTTTTTCTGGACGTGGAGAAGAGCAGCCAAATGAGGGTGAAATTGACGTCGAGTAGCCCGGATGATCCGGGAAATTGGGCGGCA
ACAGAGGAGAAGCTAGGGCAGGAGAGGATGACCGGAACGGTGAGTTTCAGTTTGAGATTCTTTGCTTGGACAACTTTCCGATCTGGGTCTTGGTGGACGAGGCGAGTTGT
TATGAGAGTGTTCTGTGAGGATTTGAAGCTCGTCTTTGCCGGACCCGCCGCCGCTAATGCCGTTTATTCCCCCGACGCCCACCCCATGATTTGCGCGGTTCTCGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCTCATCCGACGATCAACAATCCCAATCCAAGGCCACTGACCCACCTCCCCCGCCGCCGCCCTCTGCCGGAAACAACCCCCCTCCTGTCTACCCACCGCCCAC
ATTGGGGTACCCTCCTCCTCACGGTCATGGGTACCCTCCGGCGATGGGGTATCCTCCAGCCCCACATCCAGGGTACCCACCGGCGCCGGGGAATTACCCTCCTTATAACG
CGTACTATGCTCAGGCTCCCCCGGCGGCGTATTACAACAATCCCCAAAATTACAGGGCGGAGACGGTGAACGCGGGATTCATCCGGGGGATTGTGACGGCGTTGATTCTG
TTGGTGGCTATAATGACTCTGTCCAGCATAATAACGTGGATCATCCTCCGCCCTGAAATCCCAGTGTTCAGAGTGGATTCATTCTCCGTGGCGAATTTCAACATCTCGAA
AGCTAATTACTCCGGCAACTGGGACGCGAATGTGACGGTCCAAAATCCGAACCATAAACTGAACGTAAATTTGGAGCGGATCCAGAGCTTCGTGGACTTCAAAGACAACA
CATTGGCAATGTCTTATGCAGACCCATTTTTTCTGGACGTGGAGAAGAGCAGCCAAATGAGGGTGAAATTGACGTCGAGTAGCCCGGATGATCCGGGAAATTGGGCGGCA
ACAGAGGAGAAGCTAGGGCAGGAGAGGATGACCGGAACGGTGAGTTTCAGTTTGAGATTCTTTGCTTGGACAACTTTCCGATCTGGGTCTTGGTGGACGAGGCGAGTTGT
TATGAGAGTGTTCTGTGAGGATTTGAAGCTCGTCTTTGCCGGACCCGCCGCCGCTAATGCCGTTTATTCCCCCGACGCCCACCCCATGATTTGCGCGGTTCTCGTCTAG
Protein sequenceShow/hide protein sequence
MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVNAGFIRGIVTALIL
LVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDANVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKSSQMRVKLTSSSPDDPGNWAA
TEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICAVLV