; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g29570 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g29570
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationchr5:21987511..21988557
RNA-Seq ExpressionMoc05g29570
SyntenyMoc05g29570
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043818.1 protein YLS9 [Cucumis melo var. makuwa]9.0e-9164.77Show/hide
Query:  PVSHRPLRGPPPPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVNAGFIRGIVSALILV
        P  H    G  PPP  P P TL  P       H GY PAMGYP    P         PPYN Y YAQAPPAAYY+  QNY A  ++AGF+RGIV+ALIL+
Subjt:  PVSHRPLRGPPPPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVNAGFIRGIVSALILV

Query:  VLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSS
        V ++TLSSIITWI+LRPE+P+FKV+SFSV NFNISK NYSG+W+A+V V+NPN KLN+N ERIQSFVD+K++TLAMS+ DPFFLDVEKS +M+V+L SSS
Subjt:  VLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSS

Query:  PDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV
        PDDPGNW + E+K+GRERA GTV FNLRF AWTTFR+GSWWTRRV+MR+ CED+KL F GPAA  A ++AD H KTCSV V
Subjt:  PDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV

XP_008442912.1 PREDICTED: uncharacterized protein LOC103486674 [Cucumis melo]9.0e-9164.77Show/hide
Query:  PVSHRPLRGPPPPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVNAGFIRGIVSALILV
        P  H    G  PPP  P P TL  P       H GY PAMGYP    P         PPYN Y YAQAPPAAYY+  QNY A  ++AGF+RGIV+ALIL+
Subjt:  PVSHRPLRGPPPPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVNAGFIRGIVSALILV

Query:  VLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSS
        V ++TLSSIITWI+LRPE+P+FKV+SFSV NFNISK NYSG+W+A+V V+NPN KLN+N ERIQSFVD+K++TLAMS+ DPFFLDVEKS +M+V+L SSS
Subjt:  VLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSS

Query:  PDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV
        PDDPGNW + E+K+GRERA GTV FNLRF AWTTFR+GSWWTRRV+MR+ CED+KL F GPAA  A ++AD H KTCSV V
Subjt:  PDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV

XP_011652032.1 uncharacterized protein LOC105434983 [Cucumis sativus]2.2e-8955Show/hide
Query:  NVRTEHLLSFSKKSHSHFPF--------------LNLN-PLQFLTQRERKKKTKK-----------KTEIDRLIDPPWLLRRGINNLNPRPVSHRPLRGP
        NVRT     FSKKS +  P+              L+L+  L   + RER+K T +           +    +  DPP        N NP PV   P  G 
Subjt:  NVRTEHLLSFSKKSHSHFPF--------------LNLN-PLQFLTQRERKKKTKK-----------KTEIDRLIDPPWLLRRGINNLNPRPVSHRPLRGP

Query:  PPPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHG-QNYPAEPVNAGFIRGIVSALILVVLLLTLSSIITWIMLRPEI
        PPP    +   +  P      P PGYPPA G  +Y   PPYN Y YAQAPPAAYY+  QNY A+ V+AGF+RGIV+ALIL+V ++TLSSIITWI+LRP+I
Subjt:  PPPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHG-QNYPAEPVNAGFIRGIVSALILVVLLLTLSSIITWIMLRPEI

Query:  PIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSSPDDPGNWADIEDKMGRERA
        P+FKV+SFSV NFNISK NYSG+W  ++ VENPN KL +N ERIQSFV++KE+TLAMS+ DPFF+DVEKS++MRV+L SSSPDDPGNW + E+K+G+E+A
Subjt:  PIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSSPDDPGNWADIEDKMGRERA

Query:  AGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV
        +GTV FNLRF AWT FRSGSWWTRR++M++FCEDLKLAF GPAA    ++AD+H KTCSV
Subjt:  AGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV

XP_023539989.1 uncharacterized protein LOC111800503 [Cucurbita pepo subsp. pepo]4.5e-8259.93Show/hide
Query:  LNPRPVSHRPLRGPP--PPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHG--QNYPAEPVNAGFIRGIVSALILVVL
        L P   +H P   PP  PPPT  +P            PHPGYPPA      GA PPYNGYAYAQAPPAAYYH   QNY  EP +A FIRGIV+ALI++V+
Subjt:  LNPRPVSHRPLRGPP--PPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHG--QNYPAEPVNAGFIRGIVSALILVVL

Query:  LLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSSPD
        L+ LSSIITWI+LRPEIP F+V++  V NFNISKSNYSG+W A + V+NPN+KLNL F+RIQ FV +K++TLAMSF DPFFL VE++  MRVR  SSSPD
Subjt:  LLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSSPD

Query:  DPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV
        DPGNW + E+K+G+E+A   VGFNLRF  WTTF+SGSWWTR VI+R+FC+DLK+ F  P + +  F A  H   C+V
Subjt:  DPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV

XP_038905898.1 uncharacterized protein LOC120091828 [Benincasa hispida]2.4e-9160.32Show/hide
Query:  PLQFLTQRERKKKTKKKTEIDRLIDPPWLLRRGINNLNPRPVSHRPLRGPPPPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAPPPYNGYAYAQAP
        PLQ  +  +  +   K T      DPP +      N NP PV   P  G PPP    +P  +  P A    PHPGYPPA G  +Y   PPYN Y YAQAP
Subjt:  PLQFLTQRERKKKTKKKTEIDRLIDPPWLLRRGINNLNPRPVSHRPLRGPPPPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAPPPYNGYAYAQAP

Query:  PAAYYHG-QNYPAEPVNAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDF
        PAAYY+  QNY AE VN GF+RGIV+ALIL V ++TLSSI+TWI+LRPEIP+F+++SFSV NFNISKSNYSG+W+  + V+NPN +LN+N ER+QSFVD+
Subjt:  PAAYYHG-QNYPAEPVNAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDF

Query:  KEHTLAMSFGDPFFLDVEKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFV
        K++TLAMS+GDPFFLDVEKS +MRV+L SSSPDDPG+WA+ EDK+G+E+A GTV FNLRF+AWTTFR GSWWTRRV++R+FCEDLKL FAGPAA    + 
Subjt:  KEHTLAMSFGDPFFLDVEKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFV

Query:  ADSHPKTCSV
         + +PK CSV
Subjt:  ADSHPKTCSV

TrEMBL top hitse value%identityAlignment
A0A0A0LGS8 Uncharacterized protein1.8e-8964.47Show/hide
Query:  NPRPVSHRPLRGPPPPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHG-QNYPAEPVNAGFIRGIVSALILVVLLLTL
        NP PV   P  G PPP    +   +  P      P PGYPPA G  +Y   PPYN Y YAQAPPAAYY+  QNY A+ V+AGF+RGIV+ALIL+V ++TL
Subjt:  NPRPVSHRPLRGPPPPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHG-QNYPAEPVNAGFIRGIVSALILVVLLLTL

Query:  SSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSSPDDPGN
        SSIITWI+LRP+IP+FKV+SFSV NFNISK NYSG+W  ++ VENPN KL +N ERIQSFV++KE+TLAMS+ DPFF+DVEKS++MRV+L SSSPDDPGN
Subjt:  SSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSSPDDPGN

Query:  WADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV
        W + E+K+G+E+A+GTV FNLRF AWT FRSGSWWTRR++M++FCEDLKLAF GPAA    ++AD+H KTCSV
Subjt:  WADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV

A0A1S3B6W4 uncharacterized protein LOC1034866744.4e-9164.77Show/hide
Query:  PVSHRPLRGPPPPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVNAGFIRGIVSALILV
        P  H    G  PPP  P P TL  P       H GY PAMGYP    P         PPYN Y YAQAPPAAYY+  QNY A  ++AGF+RGIV+ALIL+
Subjt:  PVSHRPLRGPPPPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVNAGFIRGIVSALILV

Query:  VLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSS
        V ++TLSSIITWI+LRPE+P+FKV+SFSV NFNISK NYSG+W+A+V V+NPN KLN+N ERIQSFVD+K++TLAMS+ DPFFLDVEKS +M+V+L SSS
Subjt:  VLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSS

Query:  PDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV
        PDDPGNW + E+K+GRERA GTV FNLRF AWTTFR+GSWWTRRV+MR+ CED+KL F GPAA  A ++AD H KTCSV V
Subjt:  PDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV

A0A5A7TLT1 Protein YLS94.4e-9164.77Show/hide
Query:  PVSHRPLRGPPPPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVNAGFIRGIVSALILV
        P  H    G  PPP  P P TL  P       H GY PAMGYP    P         PPYN Y YAQAPPAAYY+  QNY A  ++AGF+RGIV+ALIL+
Subjt:  PVSHRPLRGPPPPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVNAGFIRGIVSALILV

Query:  VLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSS
        V ++TLSSIITWI+LRPE+P+FKV+SFSV NFNISK NYSG+W+A+V V+NPN KLN+N ERIQSFVD+K++TLAMS+ DPFFLDVEKS +M+V+L SSS
Subjt:  VLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSS

Query:  PDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV
        PDDPGNW + E+K+GRERA GTV FNLRF AWTTFR+GSWWTRRV+MR+ CED+KL F GPAA  A ++AD H KTCSV V
Subjt:  PDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV

A0A6J1FNP1 uncharacterized protein LOC1114471068.3e-8260.22Show/hide
Query:  PPPPPTTPH-------PSTLRRPSATLRGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHG--QNYPAEPVNAGFIRGIVSALILVVLLLTLSSII
        P PPP+  H       P T+  P A    PHPGYPPA      GA PPYNGYAYAQAPPAAYYH   QNY  EP +A FIRGIV+ALI++V+L+ L+SII
Subjt:  PPPPPTTPH-------PSTLRRPSATLRGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHG--QNYPAEPVNAGFIRGIVSALILVVLLLTLSSII

Query:  TWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSSPDDPGNWADI
        TWI+LRPEIP F+V++  V NFNISKSNYSG+W A + V+NPN+KLNL F+RIQ FV +K++TLAMSF DPFFL VE++  MRVR  SSSPDDPGNW + 
Subjt:  TWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSSPDDPGNWADI

Query:  EDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV
        E+K+G+E+A   V FNLRF  WTTF+SGSWWTR VI+R+FC+DLK+ F  P + +  F A  H   C+V
Subjt:  EDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV

A0A6J1J6I9 uncharacterized protein LOC1114816757.0e-8158.59Show/hide
Query:  DPPWLLRRGINNLNPRPVSHRPLRGPPPPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHGQN--------YPAEPVN
        DPP  L     N NP P+   P  G  PP    +P  +  P A    PHPGYPPA G  +Y   PPYN YAY QAPPAAYY+  N        Y  E   
Subjt:  DPPWLLRRGINNLNPRPVSHRPLRGPPPPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHGQN--------YPAEPVN

Query:  AGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDV
        AGF+RGI +AL+L+V+++T+SSIITWI+LRPEIP FKV+SFSV NFNISKSNYSG W+  V V+NPN KLNL+FERI+SFVD+ ++T+A SF DPFFLD+
Subjt:  AGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDV

Query:  EKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFR--SGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV
        EKS +M V++ SSSPDDPGNW   E+K+ RERA GTV F LR LAWTTFR  SGS WTRRVI+R+FCEDLKL F G    D  +   +HPKTC V V
Subjt:  EKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFR--SGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G27080.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family8.1e-0524.77Show/hide
Query:  YPHYGAPPPY-------NGYAYAQAPPAAYYHGQNYPAEPVNAGFIR----GIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNI-SKSNY
        +P   APPP            Y   PP   +  +    +  N    R      ++A+ ++++L  +S  + +++ RPE P + +E FSV   N+ S S  
Subjt:  YPHYGAPPPY-------NGYAYAQAPPAAYYHGQNYPAEPVNAGFIR----GIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNI-SKSNY

Query:  SGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFG-DPFFLDVEKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSG
        S S+   V   N N K+ + +E+ +S VD   + + +S G  P F    K+  +   ++S S       + +  +M  E +  TV F L+  A    + G
Subjt:  SGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFG-DPFFLDVEKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSG

Query:  SWWTRRVIMRIFCE
        S  T  +I+ + C+
Subjt:  SWWTRRVIMRIFCE

AT2G27080.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family8.1e-0524.77Show/hide
Query:  YPHYGAPPPY-------NGYAYAQAPPAAYYHGQNYPAEPVNAGFIR----GIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNI-SKSNY
        +P   APPP            Y   PP   +  +    +  N    R      ++A+ ++++L  +S  + +++ RPE P + +E FSV   N+ S S  
Subjt:  YPHYGAPPPY-------NGYAYAQAPPAAYYHGQNYPAEPVNAGFIR----GIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNI-SKSNY

Query:  SGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFG-DPFFLDVEKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSG
        S S+   V   N N K+ + +E+ +S VD   + + +S G  P F    K+  +   ++S S       + +  +M  E +  TV F L+  A    + G
Subjt:  SGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFG-DPFFLDVEKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSG

Query:  SWWTRRVIMRIFCE
        S  T  +I+ + C+
Subjt:  SWWTRRVIMRIFCE

AT2G27260.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.7e-1830.49Show/hide
Query:  PHPGYPPAMGYPH-YGAPPPYNGYAYAQAPPAAYYHGQN--YPAEP-VNAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKS
        P  GYP    YP+     PP NGY    A  A  Y   N  Y  +P   A  IR +       +LLL L   I ++++RP++P   + S SV NFN+S +
Subjt:  PHPGYPPAMGYPH-YGAPPPYNGYAYAQAPPAAYYHGQN--YPAEP-VNAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKS

Query:  NYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSSPDDPGNWAD--IEDKMGRERAA-GTVGFNLRFLAWTT
          SG W+  +   NPN K++L++E     + +   +L+ +   PF    +  T +   L  S     G + D  + D +G+ER+  G V F+LR +++ T
Subjt:  NYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSSPDDPGNWAD--IEDKMGRERAA-GTVGFNLRFLAWTT

Query:  FRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVY
        FR G++  RR +  ++C+D+ +     ++ + K V  S  K C  Y
Subjt:  FRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVY

AT3G52460.1 hydroxyproline-rich glycoprotein family protein9.4e-4638.78Show/hide
Query:  PRPVSHRPLRGPPPPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAPP---------PYNGYAYAQAPPAAYYHGQNYPAE-------PVNAGFIRG
        P   S R +  PPPPP    P   +    T       YPP MGYP Y  PP         PY  Y YAQAPPA+YY G +YPA+       P ++GF+RG
Subjt:  PRPVSHRPLRGPPPPPTTPHPSTLRRPSATLRGPHPGYPPAMGYPHYGAPP---------PYNGYAYAQAPPAAYYHGQNYPAE-------PVNAGFIRG

Query:  IVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDF-----KEHTLAMSFGDPFFLDVE
        I + LI++V+LL +S+ ITW++LRP+IP+F V +FSV NFN++   +S  W A + +EN N KL   F+RIQ  V       ++  LA +F  P F++ +
Subjt:  IVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDF-----KEHTLAMSFGDPFFLDVE

Query:  KSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV
        KS  +   L +   + P   + + D+M +ER  GTV F+LR   W TF++  W  R   +++FC  LK+ F G +   A  V    P  C VYV
Subjt:  KSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV

AT5G22870.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family5.8e-1123.91Show/hide
Query:  PAEPV-NAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNY-SGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSF
        PA+P+     I  I   ++ ++ +  +  +ITW+  +P+   + VE+ SV NFN++  N+ S +++  +   NPN ++++ +  ++ FV FK+ TLA   
Subjt:  PAEPV-NAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNY-SGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSF

Query:  GDPFFLDVEKSTKMRVRLISSS-PDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGP
         +PF        ++   LI+ +      N  D+      + + G +GF +   A   F+ G W +     +I C  + ++ + P
Subjt:  GDPFFLDVEKSTKMRVRLISSS-PDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCCGAATGGAGGTTGAAGAAAAGAGAAGTCATTAATGTCCGAACAGAACACCTTCTGTCATTTTCTAAGAAAAGCCATTCCCATTTCCCATTTCTAAACCTAAA
CCCCCTCCAATTCCTCACACAGAGAGAGAGGAAAAAAAAAACAAAAAAAAAAACAGAGATCGATCGATTGATCGATCCGCCATGGCTTCTTCGTCGGGGGATCAACAATC
TCAATCCAAGGCCGGTGAGCCACCGTCCCCTCCGCGGTCCTCCTCCGCCGCCAACAACCCCCCACCCATCTACCCTCCGCCGTCCGTCGGCTACCCTCCGGGGGCCCCAC
CCGGGCTACCCGCCGGCAATGGGGTACCCCCATTACGGGGCCCCGCCGCCGTACAACGGCTACGCGTACGCCCAGGCCCCTCCGGCGGCGTACTACCACGGGCAGAATTA
CCCGGCGGAGCCGGTGAACGCGGGATTCATCCGCGGGATTGTGTCGGCCCTGATTCTGGTGGTGCTGTTGCTGACCCTGAGCAGCATAATCACGTGGATCATGCTCCGAC
CCGAGATCCCAATCTTCAAAGTGGAATCCTTCTCGGTGGGGAATTTCAACATCTCGAAATCGAATTACTCCGGCAGCTGGGAGGCGGCGGTGGGGGTGGAGAATCCGAAC
CGGAAACTGAATCTGAATTTCGAGCGGATCCAGAGCTTCGTGGATTTCAAAGAACACACGCTGGCGATGTCGTTTGGGGACCCGTTTTTCCTGGACGTGGAGAAGAGCAC
CAAAATGCGGGTGAGATTGATCTCGAGCAGCCCCGATGATCCCGGGAATTGGGCCGACATAGAGGACAAGATGGGCCGGGAGCGGGCCGCCGGAACTGTGGGCTTCAATT
TGAGATTCTTGGCCTGGACCACTTTCCGGTCTGGGTCATGGTGGACCAGGCGGGTCATTATGAGGATTTTCTGTGAGGATTTGAAGCTTGCCTTCGCCGGACCCGCCGCA
CGCGACGCCAAGTTCGTCGCCGATTCCCACCCCAAGACTTGTTCCGTTTATGTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCCGAATGGAGGTTGAAGAAAAGAGAAGTCATTAATGTCCGAACAGAACACCTTCTGTCATTTTCTAAGAAAAGCCATTCCCATTTCCCATTTCTAAACCTAAA
CCCCCTCCAATTCCTCACACAGAGAGAGAGGAAAAAAAAAACAAAAAAAAAAACAGAGATCGATCGATTGATCGATCCGCCATGGCTTCTTCGTCGGGGGATCAACAATC
TCAATCCAAGGCCGGTGAGCCACCGTCCCCTCCGCGGTCCTCCTCCGCCGCCAACAACCCCCCACCCATCTACCCTCCGCCGTCCGTCGGCTACCCTCCGGGGGCCCCAC
CCGGGCTACCCGCCGGCAATGGGGTACCCCCATTACGGGGCCCCGCCGCCGTACAACGGCTACGCGTACGCCCAGGCCCCTCCGGCGGCGTACTACCACGGGCAGAATTA
CCCGGCGGAGCCGGTGAACGCGGGATTCATCCGCGGGATTGTGTCGGCCCTGATTCTGGTGGTGCTGTTGCTGACCCTGAGCAGCATAATCACGTGGATCATGCTCCGAC
CCGAGATCCCAATCTTCAAAGTGGAATCCTTCTCGGTGGGGAATTTCAACATCTCGAAATCGAATTACTCCGGCAGCTGGGAGGCGGCGGTGGGGGTGGAGAATCCGAAC
CGGAAACTGAATCTGAATTTCGAGCGGATCCAGAGCTTCGTGGATTTCAAAGAACACACGCTGGCGATGTCGTTTGGGGACCCGTTTTTCCTGGACGTGGAGAAGAGCAC
CAAAATGCGGGTGAGATTGATCTCGAGCAGCCCCGATGATCCCGGGAATTGGGCCGACATAGAGGACAAGATGGGCCGGGAGCGGGCCGCCGGAACTGTGGGCTTCAATT
TGAGATTCTTGGCCTGGACCACTTTCCGGTCTGGGTCATGGTGGACCAGGCGGGTCATTATGAGGATTTTCTGTGAGGATTTGAAGCTTGCCTTCGCCGGACCCGCCGCA
CGCGACGCCAAGTTCGTCGCCGATTCCCACCCCAAGACTTGTTCCGTTTATGTCTAA
Protein sequenceShow/hide protein sequence
MAPEWRLKKREVINVRTEHLLSFSKKSHSHFPFLNLNPLQFLTQRERKKKTKKKTEIDRLIDPPWLLRRGINNLNPRPVSHRPLRGPPPPPTTPHPSTLRRPSATLRGPH
PGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHGQNYPAEPVNAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPN
RKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAA
RDAKFVADSHPKTCSVYV