; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019449 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019449
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationscaffold28:594154..595005
RNA-Seq ExpressionMS019449
SyntenyMS019449
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043818.1 protein YLS9 [Cucumis melo var. makuwa]7.6e-10469.15Show/hide
Query:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP-HPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVN
        MASSS DQQSQSKA +PP PP  SSA NNPPP+YPPP++GYPP   H GY PAMGYP    P         PPYN Y YAQAPPAAYY+  QNY A  ++
Subjt:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP-HPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVN

Query:  AGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDV
        AGF+RGIV+ALIL+V ++TLSSIITWI+LRPE+P+FKV+SFSV NFNISK NYSG+W+A+V V+NPN KLN+N ERIQSFVD+K++TLAMS+ DPFFLDV
Subjt:  AGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDV

Query:  EKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV
        EKS +M+V+L SSSPDDPGNW + E+K+GRERA GTV FNLRF AWTTFR+GSWWTRRV+MR+ CED+KL F GPAA  A ++AD H KTCSV V
Subjt:  EKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV

XP_008442912.1 PREDICTED: uncharacterized protein LOC103486674 [Cucumis melo]7.6e-10469.15Show/hide
Query:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP-HPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVN
        MASSS DQQSQSKA +PP PP  SSA NNPPP+YPPP++GYPP   H GY PAMGYP    P         PPYN Y YAQAPPAAYY+  QNY A  ++
Subjt:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP-HPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVN

Query:  AGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDV
        AGF+RGIV+ALIL+V ++TLSSIITWI+LRPE+P+FKV+SFSV NFNISK NYSG+W+A+V V+NPN KLN+N ERIQSFVD+K++TLAMS+ DPFFLDV
Subjt:  AGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDV

Query:  EKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV
        EKS +M+V+L SSSPDDPGNW + E+K+GRERA GTV FNLRF AWTTFR+GSWWTRRV+MR+ CED+KL F GPAA  A ++AD H KTCSV V
Subjt:  EKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV

XP_011652032.1 uncharacterized protein LOC105434983 [Cucumis sativus]2.4e-10268.15Show/hide
Query:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVNA
        MASSS DQQSQSKA +PP PP  SSA NNPPP+YPPP++GYPP    GY PAMGYP    P         PPYN Y YAQAPPAAYY+  QNY A+ V+A
Subjt:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVNA

Query:  GFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVE
        GF+RGIV+ALIL+V ++TLSSIITWI+LRP+IP+FKV+SFSV NFNISK NYSG+W  ++ VENPN KL +N ERIQSFV++KE+TLAMS+ DPFF+DVE
Subjt:  GFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVE

Query:  KSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV
        KS++MRV+L SSSPDDPGNW + E+K+G+E+A+GTV FNLRF AWT FRSGSWWTRR++M++FCEDLKLAF GPAA    ++AD+H KTCSV
Subjt:  KSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV

XP_023539989.1 uncharacterized protein LOC111800503 [Cucurbita pepo subsp. pepo]6.0e-9364.56Show/hide
Query:  MASSSGDQQSQSKAGE-PPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHG--QNYPAEPVNAGFIRGIV
        MASSS +QQS+SK+ +  P PP   SAA+NPPPIYPPP++GYPP PHPGYPPA      GA PPYNGYAYAQAPPAAYYH   QNY  EP +A FIRGIV
Subjt:  MASSSGDQQSQSKAGE-PPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHG--QNYPAEPVNAGFIRGIV

Query:  SALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRV
        +ALI++V+L+ LSSIITWI+LRPEIP F+V++  V NFNISKSNYSG+W A + V+NPN+KLNL F+RIQ FV +K++TLAMSF DPFFL VE++  MRV
Subjt:  SALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRV

Query:  RLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV
        R  SSSPDDPGNW + E+K+G+E+A   VGFNLRF  WTTF+SGSWWTR VI+R+FC+DLK+ F  P + +  F A  H   C+V
Subjt:  RLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV

XP_038905898.1 uncharacterized protein LOC120091828 [Benincasa hispida]1.9e-9967.47Show/hide
Query:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGY---PHYGAP------PPYNGYAYAQAPPAAYYHG-QNYPAEPVNA
        MASSS D QSQSKA +PP  P   SA NNPPP+YPPP++GYPP     YPPAMGY   PH G P      PPYN Y YAQAPPAAYY+  QNY AE VN 
Subjt:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGY---PHYGAP------PPYNGYAYAQAPPAAYYHG-QNYPAEPVNA

Query:  GFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVE
        GF+RGIV+ALIL V ++TLSSI+TWI+LRPEIP+F+++SFSV NFNISKSNYSG+W+  + V+NPN +LN+N ER+QSFVD+K++TLAMS+GDPFFLDVE
Subjt:  GFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVE

Query:  KSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV
        KS +MRV+L SSSPDDPG+WA+ EDK+G+E+A GTV FNLRF+AWTTFR GSWWTRRV++R+FCEDLKL FAGPAA    +  + +PK CSV
Subjt:  KSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV

TrEMBL top hitse value%identityAlignment
A0A0A0LGS8 Uncharacterized protein1.2e-10268.15Show/hide
Query:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVNA
        MASSS DQQSQSKA +PP PP  SSA NNPPP+YPPP++GYPP    GY PAMGYP    P         PPYN Y YAQAPPAAYY+  QNY A+ V+A
Subjt:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVNA

Query:  GFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVE
        GF+RGIV+ALIL+V ++TLSSIITWI+LRP+IP+FKV+SFSV NFNISK NYSG+W  ++ VENPN KL +N ERIQSFV++KE+TLAMS+ DPFF+DVE
Subjt:  GFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVE

Query:  KSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV
        KS++MRV+L SSSPDDPGNW + E+K+G+E+A+GTV FNLRF AWT FRSGSWWTRR++M++FCEDLKLAF GPAA    ++AD+H KTCSV
Subjt:  KSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV

A0A1S3B6W4 uncharacterized protein LOC1034866743.7e-10469.15Show/hide
Query:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP-HPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVN
        MASSS DQQSQSKA +PP PP  SSA NNPPP+YPPP++GYPP   H GY PAMGYP    P         PPYN Y YAQAPPAAYY+  QNY A  ++
Subjt:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP-HPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVN

Query:  AGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDV
        AGF+RGIV+ALIL+V ++TLSSIITWI+LRPE+P+FKV+SFSV NFNISK NYSG+W+A+V V+NPN KLN+N ERIQSFVD+K++TLAMS+ DPFFLDV
Subjt:  AGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDV

Query:  EKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV
        EKS +M+V+L SSSPDDPGNW + E+K+GRERA GTV FNLRF AWTTFR+GSWWTRRV+MR+ CED+KL F GPAA  A ++AD H KTCSV V
Subjt:  EKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV

A0A5A7TLT1 Protein YLS93.7e-10469.15Show/hide
Query:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP-HPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVN
        MASSS DQQSQSKA +PP PP  SSA NNPPP+YPPP++GYPP   H GY PAMGYP    P         PPYN Y YAQAPPAAYY+  QNY A  ++
Subjt:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP-HPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVN

Query:  AGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDV
        AGF+RGIV+ALIL+V ++TLSSIITWI+LRPE+P+FKV+SFSV NFNISK NYSG+W+A+V V+NPN KLN+N ERIQSFVD+K++TLAMS+ DPFFLDV
Subjt:  AGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDV

Query:  EKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV
        EKS +M+V+L SSSPDDPGNW + E+K+GRERA GTV FNLRF AWTTFR+GSWWTRRV+MR+ CED+KL F GPAA  A ++AD H KTCSV V
Subjt:  EKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV

A0A6J1FNP1 uncharacterized protein LOC1114471066.1e-9163.16Show/hide
Query:  MASSSGDQQSQSKAGEPPSPP-RSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHG--QNYPAEPVNAGFIRGIV
        MASSS +QQS+SK+    S P    SAA+NP PIYPPP++GYPP PHPGYPPA      GA PPYNGYAYAQAPPAAYYH   QNY  EP +A FIRGIV
Subjt:  MASSSGDQQSQSKAGEPPSPP-RSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHG--QNYPAEPVNAGFIRGIV

Query:  SALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRV
        +ALI++V+L+ L+SIITWI+LRPEIP F+V++  V NFNISKSNYSG+W A + V+NPN+KLNL F+RIQ FV +K++TLAMSF DPFFL VE++  MRV
Subjt:  SALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRV

Query:  RLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV
        R  SSSPDDPGNW + E+K+G+E+A   V FNLRF  WTTF+SGSWWTR VI+R+FC+DLK+ F  P + +  F A  H   C+V
Subjt:  RLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV

A0A6J1J6I9 uncharacterized protein LOC1114816751.6e-9163.4Show/hide
Query:  MASSSGDQ---QSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGY---PHYGAP------PPYNGYAYAQAPPAAYYHGQN------
        MASSS DQ   QSQSK  +PP PP   SA NNPPPIYPPP++GYPP  H GYPPAMGY   PH G P      PPYN YAY QAPPAAYY+  N      
Subjt:  MASSSGDQ---QSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGY---PHYGAP------PPYNGYAYAQAPPAAYYHGQN------

Query:  --YPAEPVNAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMS
          Y  E   AGF+RGI +AL+L+V+++T+SSIITWI+LRPEIP FKV+SFSV NFNISKSNYSG W+  V V+NPN KLNL+FERI+SFVD+ ++T+A S
Subjt:  --YPAEPVNAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMS

Query:  FGDPFFLDVEKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFR--SGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPK
        F DPFFLD+EKS +M V++ SSSPDDPGNW   E+K+ RERA GTV F LR LAWTTFR  SGS WTRRVI+R+FCEDLKL F G    D  +   +HPK
Subjt:  FGDPFFLDVEKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFR--SGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPK

Query:  TCSVYV
        TC V V
Subjt:  TCSVYV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G27260.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.1e-1830.2Show/hide
Query:  PSVGYP-----PGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHGQNYPAEPVNAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFS
        P+ GYP     P P    PP  GYP+     P  G AY       YY  Q  P     A  IR +       +LLL L   I ++++RP++P   + S S
Subjt:  PSVGYP-----PGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHGQNYPAEPVNAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFS

Query:  VGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSSPDDPGNWAD--IEDKMGRERAA-GTVGF
        V NFN+S +  SG W+  +   NPN K++L++E     + +   +L+ +   PF    +  T +   L  S     G + D  + D +G+ER+  G V F
Subjt:  VGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSSPDDPGNWAD--IEDKMGRERAA-GTVGF

Query:  NLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVY
        +LR +++ TFR G++  RR +  ++C+D+ +     ++ + K V  S  K C  Y
Subjt:  NLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVY

AT3G52460.1 hydroxyproline-rich glycoprotein family protein2.3e-4238.69Show/hide
Query:  SSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP---HPGYPPAMGYPHYGAPP---------PYNGYAYAQAPPAAYYHGQNYPAE-----
        S  ++++Q K      P ++S    N PP  PPP    PP P      YPP MGYP Y  PP         PY  Y YAQAPPA+YY G +YPA+     
Subjt:  SSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP---HPGYPPAMGYPHYGAPP---------PYNGYAYAQAPPAAYYHGQNYPAE-----

Query:  --PVNAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDF-----KEHTLAM
          P ++GF+RGI + LI++V+LL +S+ ITW++LRP+IP+F V +FSV NFN++   +S  W A + +EN N KL   F+RIQ  V       ++  LA 
Subjt:  --PVNAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDF-----KEHTLAM

Query:  SFGDPFFLDVEKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKT
        +F  P F++ +KS  +   L +   + P   + + D+M +ER  GTV F+LR   W TF++  W  R   +++FC  LK+ F G +   A  V    P  
Subjt:  SFGDPFFLDVEKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKT

Query:  CSVYV
        C VYV
Subjt:  CSVYV

AT5G22870.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.7e-1123.91Show/hide
Query:  PAEPV-NAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNY-SGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSF
        PA+P+     I  I   ++ ++ +  +  +ITW+  +P+   + VE+ SV NFN++  N+ S +++  +   NPN ++++ +  ++ FV FK+ TLA   
Subjt:  PAEPV-NAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNY-SGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSF

Query:  GDPFFLDVEKSTKMRVRLISSS-PDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGP
         +PF        ++   LI+ +      N  D+      + + G +GF +   A   F+ G W +     +I C  + ++ + P
Subjt:  GDPFFLDVEKSTKMRVRLISSS-PDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCGTCGGGGGATCAACAATCTCAATCCAAGGCCGGTGAGCCACCGTCCCCTCCGCGGTCCTCCTCCGCCGCCAACAACCCCCCACCCATCTACCCTCCGCC
GTCCGTCGGCTACCCTCCGGGGCCCCACCCGGGCTACCCGCCGGCAATGGGGTACCCCCATTACGGGGCCCCGCCGCCGTACAACGGCTACGCGTACGCCCAGGCCCCTC
CGGCGGCGTACTACCACGGGCAGAATTACCCGGCGGAGCCGGTGAACGCGGGATTCATCCGCGGGATTGTGTCGGCCCTGATTCTGGTGGTGCTGTTGCTGACCCTGAGC
AGCATAATCACGTGGATCATGCTCCGACCCGAAATCCCAATCTTCAAAGTGGAATCCTTCTCGGTGGGGAATTTCAACATCTCGAAATCGAATTACTCCGGCAGCTGGGA
GGCGGCGGTGGGGGTGGAGAATCCGAACCGGAAACTGAATCTGAATTTCGAGCGGATCCAGAGCTTCGTGGATTTCAAAGAACACACGCTGGCGATGTCGTTTGGGGACC
CGTTTTTCCTGGACGTGGAGAAGAGCACCAAAATGCGGGTGAGATTGATCTCGAGCAGCCCCGATGATCCCGGGAATTGGGCCGACATCGAGGACAAGATGGGCCGGGAG
CGGGCCGCCGGAACTGTGGGCTTCAATTTGAGATTCTTGGCCTGGACCACATTCCGGTCTGGGTCATGGTGGACCAGGCGGGTCATTATGAGGATTTTCTGTGAGGATTT
GAAGCTTGCCTTCGCCGGACCCGCCGCACGCGACGCCAAGTTCGTCGCCGATTCCCACCCCAAGACTTGTTCCGTTTATGTC
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCGTCGGGGGATCAACAATCTCAATCCAAGGCCGGTGAGCCACCGTCCCCTCCGCGGTCCTCCTCCGCCGCCAACAACCCCCCACCCATCTACCCTCCGCC
GTCCGTCGGCTACCCTCCGGGGCCCCACCCGGGCTACCCGCCGGCAATGGGGTACCCCCATTACGGGGCCCCGCCGCCGTACAACGGCTACGCGTACGCCCAGGCCCCTC
CGGCGGCGTACTACCACGGGCAGAATTACCCGGCGGAGCCGGTGAACGCGGGATTCATCCGCGGGATTGTGTCGGCCCTGATTCTGGTGGTGCTGTTGCTGACCCTGAGC
AGCATAATCACGTGGATCATGCTCCGACCCGAAATCCCAATCTTCAAAGTGGAATCCTTCTCGGTGGGGAATTTCAACATCTCGAAATCGAATTACTCCGGCAGCTGGGA
GGCGGCGGTGGGGGTGGAGAATCCGAACCGGAAACTGAATCTGAATTTCGAGCGGATCCAGAGCTTCGTGGATTTCAAAGAACACACGCTGGCGATGTCGTTTGGGGACC
CGTTTTTCCTGGACGTGGAGAAGAGCACCAAAATGCGGGTGAGATTGATCTCGAGCAGCCCCGATGATCCCGGGAATTGGGCCGACATCGAGGACAAGATGGGCCGGGAG
CGGGCCGCCGGAACTGTGGGCTTCAATTTGAGATTCTTGGCCTGGACCACATTCCGGTCTGGGTCATGGTGGACCAGGCGGGTCATTATGAGGATTTTCTGTGAGGATTT
GAAGCTTGCCTTCGCCGGACCCGCCGCACGCGACGCCAAGTTCGTCGCCGATTCCCACCCCAAGACTTGTTCCGTTTATGTC
Protein sequenceShow/hide protein sequence
MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHGQNYPAEPVNAGFIRGIVSALILVVLLLTLS
SIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSSPDDPGNWADIEDKMGRE
RAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV