; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC10G203680 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC10G203680
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationCicolChr10:31057425..31058973
RNA-Seq ExpressionCcUC10G203680
SyntenyCcUC10G203680
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043818.1 protein YLS9 [Cucumis melo var. makuwa]7.0e-13483.11Show/hide
Query:  REKNQKQSIPFQMASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNN
        RE   +    F MASSS+DQQSQSKATDPPPP P SAGNNPPPVYPPPTLGYPPP GH GY PAMGYPPAPHP YPPA GNYPPYNAYYAQAPPAAYYNN
Subjt:  REKNQKQSIPFQMASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNN

Query:  PQNYRAETVKAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAM
        PQNYRA T+ AGF+RGIV ALILLVAIMTLSSIITWIILRPE+PVF+VDSFSV+NFNISK NYSGNWD +VTVQNPNHKLNVN+ERIQSFVD+K NTLAM
Subjt:  PQNYRAETVKAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAM

Query:  SYADPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMI
        SYADPFFLDVEKS QM+VKLTSSSPDDPGNW  TEEKLG+ER TGTVSF+LRFFAWTTFR+GSWWTRRVVMRV CED+KLVF GPAA +AVY  D H   
Subjt:  SYADPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMI

Query:  CA
        C+
Subjt:  CA

XP_008442912.1 PREDICTED: uncharacterized protein LOC103486674 [Cucumis melo]3.5e-13385.52Show/hide
Query:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVKAG
        MASSS+DQQSQSKATDPPPP P SAGNNPPPVYPPPTLGYPPP GH GY PAMGYPPAPHP YPPA GNYPPYNAYYAQAPPAAYYNNPQNYRA T+ AG
Subjt:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVKAG

Query:  FIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEK
        F+RGIV ALILLVAIMTLSSIITWIILRPE+PVF+VDSFSV+NFNISK NYSGNWD +VTVQNPNHKLNVN+ERIQSFVD+K NTLAMSYADPFFLDVEK
Subjt:  FIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEK

Query:  SSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICA
        S QM+VKLTSSSPDDPGNW  TEEKLG+ER TGTVSF+LRFFAWTTFR+GSWWTRRVVMRV CED+KLVF GPAA +AVY  D H   C+
Subjt:  SSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICA

XP_011652032.1 uncharacterized protein LOC105434983 [Cucumis sativus]3.1e-15080Show/hide
Query:  KTEVEDDNVRTFRHFSKKSFNGSPYSKSFSI--YPQTPLLSHSSLSLSPNPFTEREKNQKQSIPFQMASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYP
        KTEVEDDNVRTFR FSKKS NGSPYSKS SI  +  T LLS  SLSLS +   EREK   ++  F MASSS+DQQSQSKATDPPPP P SAGNNPPPVYP
Subjt:  KTEVEDDNVRTFRHFSKKSFNGSPYSKSFSI--YPQTPLLSHSSLSLSPNPFTEREKNQKQSIPFQMASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYP

Query:  PPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVKAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFR
        PPTLGYPPPHGHGY PAMGYPP P PGYPPAPGNYPPYN YYAQAPPAAYYNNPQNYRA+TV AGF+RGIVTALILLVA+MTLSSIITWI+LRP+IPVF+
Subjt:  PPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVKAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFR

Query:  VDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTV
        VDSFSV+NFNISK NYSGNW+G++TV+NPNHKL VN+ERIQSFV++K+NTLAMSYADPFF+DVEKSSQMRVKLTSSSPDDPGNW  TEEK+GQE+ +GTV
Subjt:  VDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTV

Query:  SFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICA
        SF+LRFFAWT FRSGSWWTRR+VM+VFCEDLKL F GPAA + VY  DAH   C+
Subjt:  SFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICA

XP_022935251.1 uncharacterized protein LOC111442188 [Cucurbita moschata]6.6e-12470.43Show/hide
Query:  KTEVEDDNVRTFRHFSKKSFNGSPYSKSFSI---YPQTPLLSHSSLSLSPN-----PFTEREKNQKQSIPFQMASSSDDQ---QSQSKATDPPPPPPPSA
        KTEVEDD VR  R  SKKSF G PYS S +        P+    SLSLS +        EREK  KQ   FQMASSS DQ   QSQSK TDPPPP PPSA
Subjt:  KTEVEDDNVRTFRHFSKKSFNGSPYSKSFSI---YPQTPLLSHSSLSLSPN-----PFTEREKNQKQSIPFQMASSSDDQ---QSQSKATDPPPPPPPSA

Query:  GNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYY------NNPQNYRAETVKAGFIRGIVTALILLVAIMTLS
        GNNPPP+YPPPTLGY PPH HGYPPAMGYPPAPHPGYPPAPGNYPPYNAY Y QAPPAAYY      NNPQ YR ET  AGF+RGI  AL+LLV IMT+S
Subjt:  GNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYY------NNPQNYRAETVKAGFIRGIVTALILLVAIMTLS

Query:  SIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKSSQMRVKLTSSSPDDPGNW
        SIITWIILRPEIP F+VDSFSV NFNISK+NYSG WD  VTVQNPNHKLN++ ERI+SFVD+ DNT+A S++DPFFLD+EKS QM+VK+TSSSPDDPGNW
Subjt:  SIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKSSQMRVKLTSSSPDDPGNW

Query:  AATEEKLGQERMTGTVSFSLRFFAWTTFR--SGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMIC
        A TEEKL +ER TGTVSF+LR  AWTTFR  SGS WTRRV++RVFCEDLKLVF G    + VYSP A    C
Subjt:  AATEEKLGQERMTGTVSFSLRFFAWTTFR--SGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMIC

XP_038905898.1 uncharacterized protein LOC120091828 [Benincasa hispida]4.4e-14485.58Show/hide
Query:  SLSLSPNPFTEREKNQKQSIPFQMASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYA
        SLSLS +   ERE NQKQSIP QMASSSDD QSQSKATDPPP PPPSAGNNPPPVYPPPTLGYPPP GH YPPAMGYPPAPHPGYPPAPGNYPPYN YYA
Subjt:  SLSLSPNPFTEREKNQKQSIPFQMASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYA

Query:  QAPPAAYYNNPQNYRAETVKAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSF
        QAPPAAYYNN QNYRAETV  GF+RGIVTALIL VAIMTLSSI+TWIILRPEIPVFR+DSFSV NFNISK+NYSGNWDGN+TVQNPNH+LNVN+ER+QSF
Subjt:  QAPPAAYYNNPQNYRAETVKAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSF

Query:  VDFKDNTLAMSYADPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANA
        VD+KDNTLAMSY DPFFLDVEKS QMRVKLTSSSPDDPG+WA TE+KLGQE+ TGTVSF+LRF AWTTFR GSWWTRRVV+RVFCEDLKLVFAGPAA   
Subjt:  VDFKDNTLAMSYADPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANA

Query:  VYSPDAHPMICA
        VYSP+ +P IC+
Subjt:  VYSPDAHPMICA

TrEMBL top hitse value%identityAlignment
A0A0A0LGS8 Uncharacterized protein1.5e-13483.74Show/hide
Query:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVKAGF
        MASSS+DQQSQSKATDPPPP P SAGNNPPPVYPPPTLGYPPPHGHGY PAMGYPP P PGYPPAPGNYPPYN YYAQAPPAAYYNNPQNYRA+TV AGF
Subjt:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVKAGF

Query:  IRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKS
        +RGIVTALILLVA+MTLSSIITWI+LRP+IPVF+VDSFSV+NFNISK NYSGNW+G++TV+NPNHKL VN+ERIQSFV++K+NTLAMSYADPFF+DVEKS
Subjt:  IRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKS

Query:  SQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICA
        SQMRVKLTSSSPDDPGNW  TEEK+GQE+ +GTVSF+LRFFAWT FRSGSWWTRR+VM+VFCEDLKL F GPAA + VY  DAH   C+
Subjt:  SQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICA

A0A1S3B6W4 uncharacterized protein LOC1034866741.7e-13385.52Show/hide
Query:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVKAG
        MASSS+DQQSQSKATDPPPP P SAGNNPPPVYPPPTLGYPPP GH GY PAMGYPPAPHP YPPA GNYPPYNAYYAQAPPAAYYNNPQNYRA T+ AG
Subjt:  MASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVKAG

Query:  FIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEK
        F+RGIV ALILLVAIMTLSSIITWIILRPE+PVF+VDSFSV+NFNISK NYSGNWD +VTVQNPNHKLNVN+ERIQSFVD+K NTLAMSYADPFFLDVEK
Subjt:  FIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEK

Query:  SSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICA
        S QM+VKLTSSSPDDPGNW  TEEKLG+ER TGTVSF+LRFFAWTTFR+GSWWTRRVVMRV CED+KLVF GPAA +AVY  D H   C+
Subjt:  SSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICA

A0A5A7TLT1 Protein YLS93.4e-13483.11Show/hide
Query:  REKNQKQSIPFQMASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNN
        RE   +    F MASSS+DQQSQSKATDPPPP P SAGNNPPPVYPPPTLGYPPP GH GY PAMGYPPAPHP YPPA GNYPPYNAYYAQAPPAAYYNN
Subjt:  REKNQKQSIPFQMASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGH-GYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNN

Query:  PQNYRAETVKAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAM
        PQNYRA T+ AGF+RGIV ALILLVAIMTLSSIITWIILRPE+PVF+VDSFSV+NFNISK NYSGNWD +VTVQNPNHKLNVN+ERIQSFVD+K NTLAM
Subjt:  PQNYRAETVKAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAM

Query:  SYADPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMI
        SYADPFFLDVEKS QM+VKLTSSSPDDPGNW  TEEKLG+ER TGTVSF+LRFFAWTTFR+GSWWTRRVVMRV CED+KLVF GPAA +AVY  D H   
Subjt:  SYADPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMI

Query:  CA
        C+
Subjt:  CA

A0A6J1F415 uncharacterized protein LOC1114421883.2e-12470.43Show/hide
Query:  KTEVEDDNVRTFRHFSKKSFNGSPYSKSFSI---YPQTPLLSHSSLSLSPN-----PFTEREKNQKQSIPFQMASSSDDQ---QSQSKATDPPPPPPPSA
        KTEVEDD VR  R  SKKSF G PYS S +        P+    SLSLS +        EREK  KQ   FQMASSS DQ   QSQSK TDPPPP PPSA
Subjt:  KTEVEDDNVRTFRHFSKKSFNGSPYSKSFSI---YPQTPLLSHSSLSLSPN-----PFTEREKNQKQSIPFQMASSSDDQ---QSQSKATDPPPPPPPSA

Query:  GNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYY------NNPQNYRAETVKAGFIRGIVTALILLVAIMTLS
        GNNPPP+YPPPTLGY PPH HGYPPAMGYPPAPHPGYPPAPGNYPPYNAY Y QAPPAAYY      NNPQ YR ET  AGF+RGI  AL+LLV IMT+S
Subjt:  GNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYY------NNPQNYRAETVKAGFIRGIVTALILLVAIMTLS

Query:  SIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKSSQMRVKLTSSSPDDPGNW
        SIITWIILRPEIP F+VDSFSV NFNISK+NYSG WD  VTVQNPNHKLN++ ERI+SFVD+ DNT+A S++DPFFLD+EKS QM+VK+TSSSPDDPGNW
Subjt:  SIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKSSQMRVKLTSSSPDDPGNW

Query:  AATEEKLGQERMTGTVSFSLRFFAWTTFR--SGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMIC
        A TEEKL +ER TGTVSF+LR  AWTTFR  SGS WTRRV++RVFCEDLKLVF G    + VYSP A    C
Subjt:  AATEEKLGQERMTGTVSFSLRFFAWTTFR--SGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMIC

A0A6J1J6I9 uncharacterized protein LOC1114816754.4e-11873.72Show/hide
Query:  PLLSHSSLSLSPN-PFTEREKNQKQSIPFQMASSSDDQ---QSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPG
        P+    SLSLS      EREK  KQ   FQMASSS DQ   QSQSK TDPPPP PPSAGNNPPP+YPPPTLGY PPH HGYPPAMGYPPAPHPGYPPAPG
Subjt:  PLLSHSSLSLSPN-PFTEREKNQKQSIPFQMASSSDDQ---QSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGYPPAPHPGYPPAPG

Query:  NYPPYNAY-YAQAPPAAYY-------NNPQNYRAETVKAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVT
        NYPPYNAY Y QAPPAAYY       NNPQ YR ET  AGF+RGI  AL+LLV IMT+SSIITWIILRPEIP F+VDSFSVANFNISK+NYSG WD  VT
Subjt:  NYPPYNAY-YAQAPPAAYY-------NNPQNYRAETVKAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVT

Query:  VQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFR--SGSWWTRRVV
        VQNPNHKLN++ ERI+SFVD+ DNT+A S++DPFFLD+EKS QM VK+TSSSPDDPGNW  TEEKL +ER TGTVSF+LR  AWTTFR  SGS WTRRV+
Subjt:  VQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFR--SGSWWTRRVV

Query:  MRVFCEDLKLVFAGPAAANAVYSPDAHPMIC
        +RVFCEDLKLVF G    + VYSP AHP  C
Subjt:  MRVFCEDLKLVFAGPAAANAVYSPDAHPMIC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G27260.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.5e-1430.8Show/hide
Query:  PTLGYPPPHGH-----GYPPAMGYP-PAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVKAGFIRGIVTALILLVAIMTLSSIITWIILRPE
        P  GYP P+ +       PP  GYP PA    YP     Y  +N YYA  P         N RA  ++  FI  + T  +LL+ ++     I ++I+RP+
Subjt:  PTLGYPPPHGH-----GYPPAMGYP-PAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVKAGFIRGIVTALILLVAIMTLSSIITWIILRPE

Query:  IPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQER
        +P   ++S SV+NFN+S    SG WD  +  +NPN K++++ E     + +   +L+ +   PF    +  + +   L+ S     G      + +G+ER
Subjt:  IPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQER

Query:  -MTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDL
         + G V F LR  ++ TFR G++  RR V  V+C+D+
Subjt:  -MTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDL

AT3G52460.1 hydroxyproline-rich glycoprotein family protein1.3e-4039.26Show/hide
Query:  QQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGY-----PPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYYNNPQNYRAETV-----
        Q S+     PPPPPP S    PPP         P      YPP MGY     PP P+P YP A     PY  Y YAQAPPA+YY +    +   V     
Subjt:  QQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYPPPHGHGYPPAMGY-----PPAPHPGYPPAPGNYPPYNAY-YAQAPPAAYYNNPQNYRAETV-----

Query:  KAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDF-----KDNTLAMSYAD
         +GF+RGI T LI+LV ++ +S+ ITW++LRP+IP+F V++FSV+NFN++   +S  W  N+T++N N KL    +RIQ  V       +D  LA ++  
Subjt:  KAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDF-----KDNTLAMSYAD

Query:  PFFLDVEKSSQMRVKLTSSSPDDP--GNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMI
        P F++ +KS  +   LT+   + P   +W   E K  +ER TGTV+FSLR   W TF++  W  R   ++VFC  LK+ F G +   AV  P   P +
Subjt:  PFFLDVEKSSQMRVKLTSSSPDDP--GNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMI

AT5G22870.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family9.8e-0923.84Show/hide
Query:  IVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANY-SGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKSSQ
        I   ++ L+ +  +  +ITW+  +P+   + V++ SV NFN++  N+ S  +   +   NPNH+++V    ++ FV FKD TLA    +PF        +
Subjt:  IVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNISKANY-SGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKSSQ

Query:  MRVKLTSSS--PDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGP
        M VK    +   ++     +  + L  +   G + F +   A   F+ G W +     ++ C  + +  + P
Subjt:  MRVKLTSSS--PDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSWWTRRVVMRVFCEDLKLVFAGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AAAACAGAGGTTGAAGATGACAATGTCCGAACCTTCCGTCATTTTTCTAAGAAAAGCTTTAACGGATCTCCATATTCTAAATCCTTCTCAATTTACCCTCAAACG
CCTCTCCTTTCCCATTCCTCTCTCTCTCTCTCTCCAAATCCATTCACAGAGAGAGAGAAAAACCAAAAACAGAGCATACCCTTCCAAATGGCTTCCTCATCCGAC
GATCAACAATCCCAATCCAAGGCCACTGACCCACCTCCCCCGCCGCCGCCCTCTGCCGGAAACAACCCCCCTCCTGTCTACCCACCGCCCACATTGGGGTACCCT
CCTCCTCACGGTCATGGGTACCCTCCGGCGATGGGGTATCCTCCAGCCCCACATCCAGGGTACCCACCGGCGCCGGGGAATTACCCTCCTTATAACGCGTACTAT
GCTCAGGCTCCCCCGGCGGCGTATTACAACAATCCCCAAAATTACAGGGCGGAGACGGTGAAGGCAGGATTCATCCGGGGGATTGTGACGGCGTTGATTCTGTTG
GTGGCTATAATGACTCTGTCCAGCATAATAACGTGGATCATCCTCCGCCCTGAAATCCCAGTGTTCAGAGTGGATTCATTCTCCGTGGCGAATTTCAACATCTCG
AAAGCGAATTACTCCGGCAACTGGGACGGGAATGTGACGGTCCAAAATCCGAACCATAAACTGAACGTGAATTTGGAGCGGATCCAGAGCTTCGTGGACTTCAAA
GACAACACATTGGCAATGTCTTATGCAGATCCATTTTTTCTGGACGTGGAGAAGAGCAGCCAAATGAGGGTGAAATTGACGTCGAGTAGCCCGGATGATCCGGGA
AATTGGGCGGCAACAGAGGAGAAGCTAGGGCAGGAGAGGATGACCGGAACGGTGAGTTTCAGTTTGAGATTCTTTGCTTGGACAACTTTCCGATCTGGATCTTGG
TGGACGAGGCGAGTTGTTATGAGAGTGTTCTGTGAGGATTTGAAGCTCGTCTTTGCCGGACCCGCCGCCGCTAATGCCGTTTATTCCCCGGACGCCCACCCCATG
ATTTGCGCGTGTCGTATCTATTTGTCCTCTTCAGATAAGAATTTGATTTATTTG
mRNA sequenceShow/hide mRNA sequence
AAAACAGAGGTTGAAGATGACAATGTCCGAACCTTCCGTCATTTTTCTAAGAAAAGCTTTAACGGATCTCCATATTCTAAATCCTTCTCAATTTACCCTCAAACG
CCTCTCCTTTCCCATTCCTCTCTCTCTCTCTCTCCAAATCCATTCACAGAGAGAGAGAAAAACCAAAAACAGAGCATACCCTTCCAAATGGCTTCCTCATCCGAC
GATCAACAATCCCAATCCAAGGCCACTGACCCACCTCCCCCGCCGCCGCCCTCTGCCGGAAACAACCCCCCTCCTGTCTACCCACCGCCCACATTGGGGTACCCT
CCTCCTCACGGTCATGGGTACCCTCCGGCGATGGGGTATCCTCCAGCCCCACATCCAGGGTACCCACCGGCGCCGGGGAATTACCCTCCTTATAACGCGTACTAT
GCTCAGGCTCCCCCGGCGGCGTATTACAACAATCCCCAAAATTACAGGGCGGAGACGGTGAAGGCAGGATTCATCCGGGGGATTGTGACGGCGTTGATTCTGTTG
GTGGCTATAATGACTCTGTCCAGCATAATAACGTGGATCATCCTCCGCCCTGAAATCCCAGTGTTCAGAGTGGATTCATTCTCCGTGGCGAATTTCAACATCTCG
AAAGCGAATTACTCCGGCAACTGGGACGGGAATGTGACGGTCCAAAATCCGAACCATAAACTGAACGTGAATTTGGAGCGGATCCAGAGCTTCGTGGACTTCAAA
GACAACACATTGGCAATGTCTTATGCAGATCCATTTTTTCTGGACGTGGAGAAGAGCAGCCAAATGAGGGTGAAATTGACGTCGAGTAGCCCGGATGATCCGGGA
AATTGGGCGGCAACAGAGGAGAAGCTAGGGCAGGAGAGGATGACCGGAACGGTGAGTTTCAGTTTGAGATTCTTTGCTTGGACAACTTTCCGATCTGGATCTTGG
TGGACGAGGCGAGTTGTTATGAGAGTGTTCTGTGAGGATTTGAAGCTCGTCTTTGCCGGACCCGCCGCCGCTAATGCCGTTTATTCCCCGGACGCCCACCCCATG
ATTTGCGCGTGTCGTATCTATTTGTCCTCTTCAGATAAGAATTTGATTTATTTG
Protein sequenceShow/hide protein sequence
KTEVEDDNVRTFRHFSKKSFNGSPYSKSFSIYPQTPLLSHSSLSLSPNPFTEREKNQKQSIPFQMASSSDDQQSQSKATDPPPPPPPSAGNNPPPVYPPPTLGYP
PPHGHGYPPAMGYPPAPHPGYPPAPGNYPPYNAYYAQAPPAAYYNNPQNYRAETVKAGFIRGIVTALILLVAIMTLSSIITWIILRPEIPVFRVDSFSVANFNIS
KANYSGNWDGNVTVQNPNHKLNVNLERIQSFVDFKDNTLAMSYADPFFLDVEKSSQMRVKLTSSSPDDPGNWAATEEKLGQERMTGTVSFSLRFFAWTTFRSGSW
WTRRVVMRVFCEDLKLVFAGPAAANAVYSPDAHPMICACRIYLSSSDKNLIYL