; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g1226 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g1226
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationMC05:16515187..16516038
RNA-Seq ExpressionMC05g1226
SyntenyMC05g1226
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043818.1 protein YLS9 [Cucumis melo var. makuwa]4.34e-12869.15Show/hide
Query:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP-HPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVN
        MASSS DQQSQSKA +PP PP  SSA NNPPP+YPPP++GYPP   H GY PAMGYP    P         PPYN Y YAQAPPAAYY+  QNY A  ++
Subjt:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP-HPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVN

Query:  AGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDV
        AGF+RGIV+ALIL+V ++TLSSIITWI+LRPE+P+FKV+SFSV NFNISK NYSG+W+A+V V+NPN KLN+N ERIQSFVD+K++TLAMS+ DPFFLDV
Subjt:  AGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDV

Query:  EKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV
        EKS +M+V+L SSSPDDPGNW + E+K+GRERA GTV FNLRF AWTTFR+GSWWTRRV+MR+ CED+KL F GPAA  A ++AD H KTCSV V
Subjt:  EKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV

XP_008442912.1 PREDICTED: uncharacterized protein LOC103486674 [Cucumis melo]2.13e-13269.15Show/hide
Query:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP-HPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVN
        MASSS DQQSQSKA +PP PP  SSA NNPPP+YPPP++GYPP   H GY PAMGYP    P         PPYN Y YAQAPPAAYY+  QNY A  ++
Subjt:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP-HPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVN

Query:  AGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDV
        AGF+RGIV+ALIL+V ++TLSSIITWI+LRPE+P+FKV+SFSV NFNISK NYSG+W+A+V V+NPN KLN+N ERIQSFVD+K++TLAMS+ DPFFLDV
Subjt:  AGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDV

Query:  EKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV
        EKS +M+V+L SSSPDDPGNW + E+K+GRERA GTV FNLRF AWTTFR+GSWWTRRV+MR+ CED+KL F GPAA  A ++AD H KTCSV V
Subjt:  EKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV

XP_011652032.1 uncharacterized protein LOC105434983 [Cucumis sativus]1.98e-12968.15Show/hide
Query:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVNA
        MASSS DQQSQSKA +PP PP  SSA NNPPP+YPPP++GYPP    GY PAMGYP    P         PPYN Y YAQAPPAAYY+  QNY A+ V+A
Subjt:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVNA

Query:  GFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVE
        GF+RGIV+ALIL+V ++TLSSIITWI+LRP+IP+FKV+SFSV NFNISK NYSG+W  ++ VENPN KL +N ERIQSFV++KE+TLAMS+ DPFF+DVE
Subjt:  GFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVE

Query:  KSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV
        KS++MRV+L SSSPDDPGNW + E+K+G+E+A+GTV FNLRF AWT FRSGSWWTRR++M++FCEDLKLAF GPAA    ++AD+H KTCSV
Subjt:  KSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV

XP_023539989.1 uncharacterized protein LOC111800503 [Cucurbita pepo subsp. pepo]1.87e-11864.34Show/hide
Query:  MASSSGDQQSQSKA--GEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHG--QNYPAEPVNAGFIRGI
        MASSS +QQS+SK+   +P  PP   SAA+NPPPIYPPP++GYPP PHPGYPPA G     A PPYNGYAYAQAPPAAYYH   QNY  EP +A FIRGI
Subjt:  MASSSGDQQSQSKA--GEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHG--QNYPAEPVNAGFIRGI

Query:  VSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMR
        V+ALI++V+L+ LSSIITWI+LRPEIP F+V++  V NFNISKSNYSG+W A + V+NPN+KLNL F+RIQ FV +K++TLAMSF DPFFL VE++  MR
Subjt:  VSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMR

Query:  VRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV
        VR  SSSPDDPGNW + E+K+G+E+A   VGFNLRF  WTTF+SGSWWTR VI+R+FC+DLK+ F  P + +  F A  H   C+V
Subjt:  VRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV

XP_038905898.1 uncharacterized protein LOC120091828 [Benincasa hispida]1.01e-12567.47Show/hide
Query:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYP---HYGAPP------PYNGYAYAQAPPAAYYHG-QNYPAEPVNA
        MASSS D QSQSKA +PP  P  S A NNPPP+YPPP++GYPP     YPPAMGYP   H G PP      PYN Y YAQAPPAAYY+  QNY AE VN 
Subjt:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYP---HYGAPP------PYNGYAYAQAPPAAYYHG-QNYPAEPVNA

Query:  GFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVE
        GF+RGIV+ALIL V ++TLSSI+TWI+LRPEIP+F+++SFSV NFNISKSNYSG+W+  + V+NPN +LN+N ER+QSFVD+K++TLAMS+GDPFFLDVE
Subjt:  GFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVE

Query:  KSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV
        KS +MRV+L SSSPDDPG+WA+ EDK+G+E+A GTV FNLRF+AWTTFR GSWWTRRV++R+FCEDLKL FAGPAA    +  + +PK CSV
Subjt:  KSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV

TrEMBL top hitse value%identityAlignment
A0A0A0LGS8 Uncharacterized protein9.47e-13168.15Show/hide
Query:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVNA
        MASSS DQQSQSKA +PP PP  SSA NNPPP+YPPP++GYPP    GY PAMGYP    P         PPYN Y YAQAPPAAYY+  QNY A+ V+A
Subjt:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVNA

Query:  GFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVE
        GF+RGIV+ALIL+V ++TLSSIITWI+LRP+IP+FKV+SFSV NFNISK NYSG+W  ++ VENPN KL +N ERIQSFV++KE+TLAMS+ DPFF+DVE
Subjt:  GFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVE

Query:  KSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV
        KS++MRV+L SSSPDDPGNW + E+K+G+E+A+GTV FNLRF AWT FRSGSWWTRR++M++FCEDLKLAF GPAA    ++AD+H KTCSV
Subjt:  KSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV

A0A1S3B6W4 uncharacterized protein LOC1034866741.03e-13269.15Show/hide
Query:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP-HPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVN
        MASSS DQQSQSKA +PP PP  SSA NNPPP+YPPP++GYPP   H GY PAMGYP    P         PPYN Y YAQAPPAAYY+  QNY A  ++
Subjt:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP-HPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVN

Query:  AGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDV
        AGF+RGIV+ALIL+V ++TLSSIITWI+LRPE+P+FKV+SFSV NFNISK NYSG+W+A+V V+NPN KLN+N ERIQSFVD+K++TLAMS+ DPFFLDV
Subjt:  AGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDV

Query:  EKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV
        EKS +M+V+L SSSPDDPGNW + E+K+GRERA GTV FNLRF AWTTFR+GSWWTRRV+MR+ CED+KL F GPAA  A ++AD H KTCSV V
Subjt:  EKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV

A0A5A7TLT1 Protein YLS92.10e-12869.15Show/hide
Query:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP-HPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVN
        MASSS DQQSQSKA +PP PP  SSA NNPPP+YPPP++GYPP   H GY PAMGYP    P         PPYN Y YAQAPPAAYY+  QNY A  ++
Subjt:  MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP-HPGYPPAMGYPHYGAP---------PPYNGYAYAQAPPAAYYHG-QNYPAEPVN

Query:  AGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDV
        AGF+RGIV+ALIL+V ++TLSSIITWI+LRPE+P+FKV+SFSV NFNISK NYSG+W+A+V V+NPN KLN+N ERIQSFVD+K++TLAMS+ DPFFLDV
Subjt:  AGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDV

Query:  EKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV
        EKS +M+V+L SSSPDDPGNW + E+K+GRERA GTV FNLRF AWTTFR+GSWWTRRV+MR+ CED+KL F GPAA  A ++AD H KTCSV V
Subjt:  EKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV

A0A6J1FNP1 uncharacterized protein LOC1114471067.49e-11662.85Show/hide
Query:  MASSSGDQQSQSKA----GEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHG--QNYPAEPVNAGFIR
        MASSS +QQS+SK+     +P  PP   SAA+NP PIYPPP++GYPP PHPGYPPA G     A PPYNGYAYAQAPPAAYYH   QNY  EP +A FIR
Subjt:  MASSSGDQQSQSKA----GEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHG--QNYPAEPVNAGFIR

Query:  GIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTK
        GIV+ALI++V+L+ L+SIITWI+LRPEIP F+V++  V NFNISKSNYSG+W A + V+NPN+KLNL F+RIQ FV +K++TLAMSF DPFFL VE++  
Subjt:  GIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTK

Query:  MRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV
        MRVR  SSSPDDPGNW + E+K+G+E+A   V FNLRF  WTTF+SGSWWTR VI+R+FC+DLK+ F  P + +  F A  H   C+V
Subjt:  MRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV

A0A6J1L2Y7 uncharacterized protein LOC1114986312.83e-11562.94Show/hide
Query:  MASSSGDQQSQSKA--GEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHG--QNYPAEPVNAGFIRGI
        MASSS +QQS+SK+   +P  PP   SAA+NPPPIYPPP++GYPP PHPGYPPA G     A PPYNGYAYAQAPP AYYH   QNY  EP +A  IRGI
Subjt:  MASSSGDQQSQSKA--GEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHG--QNYPAEPVNAGFIRGI

Query:  VSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMR
        V+ALI++V+L+ LSSIITWI+LRPEIP F+V++  V NFNISKSNYSG+W A + V+NPN+KLNL F+RIQ FV +K++TLAMSF DPFFL VE++  MR
Subjt:  VSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMR

Query:  VRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV
        VR  SSSPDDPG+W + E+K+G+E+A   V FNLRF  WTTF+SGSWWTR VI+R+FC+DLK+ F  P + +  F A  H   C+V
Subjt:  VRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G27260.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.1e-1830.2Show/hide
Query:  PSVGYP-----PGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHGQNYPAEPVNAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFS
        P+ GYP     P P    PP  GYP+     P  G AY       YY  Q  P     A  IR +       +LLL L   I ++++RP++P   + S S
Subjt:  PSVGYP-----PGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHGQNYPAEPVNAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFS

Query:  VGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSSPDDPGNWAD--IEDKMGRERAA-GTVGF
        V NFN+S +  SG W+  +   NPN K++L++E     + +   +L+ +   PF    +  T +   L  S     G + D  + D +G+ER+  G V F
Subjt:  VGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSSPDDPGNWAD--IEDKMGRERAA-GTVGF

Query:  NLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVY
        +LR +++ TFR G++  RR +  ++C+D+ +     ++ + K V  S  K C  Y
Subjt:  NLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVY

AT3G52460.1 hydroxyproline-rich glycoprotein family protein2.3e-4238.69Show/hide
Query:  SSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP---HPGYPPAMGYPHYGAPP---------PYNGYAYAQAPPAAYYHGQNYPAE-----
        S  ++++Q K      P ++S    N PP  PPP    PP P      YPP MGYP Y  PP         PY  Y YAQAPPA+YY G +YPA+     
Subjt:  SSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGP---HPGYPPAMGYPHYGAPP---------PYNGYAYAQAPPAAYYHGQNYPAE-----

Query:  --PVNAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDF-----KEHTLAM
          P ++GF+RGI + LI++V+LL +S+ ITW++LRP+IP+F V +FSV NFN++   +S  W A + +EN N KL   F+RIQ  V       ++  LA 
Subjt:  --PVNAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDF-----KEHTLAM

Query:  SFGDPFFLDVEKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKT
        +F  P F++ +KS  +   L +   + P   + + D+M +ER  GTV F+LR   W TF++  W  R   +++FC  LK+ F G +   A  V    P  
Subjt:  SFGDPFFLDVEKSTKMRVRLISSSPDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKT

Query:  CSVYV
        C VYV
Subjt:  CSVYV

AT5G22870.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.7e-1123.91Show/hide
Query:  PAEPV-NAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNY-SGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSF
        PA+P+     I  I   ++ ++ +  +  +ITW+  +P+   + VE+ SV NFN++  N+ S +++  +   NPN ++++ +  ++ FV FK+ TLA   
Subjt:  PAEPV-NAGFIRGIVSALILVVLLLTLSSIITWIMLRPEIPIFKVESFSVGNFNISKSNY-SGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSF

Query:  GDPFFLDVEKSTKMRVRLISSS-PDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGP
         +PF        ++   LI+ +      N  D+      + + G +GF +   A   F+ G W +     +I C  + ++ + P
Subjt:  GDPFFLDVEKSTKMRVRLISSS-PDDPGNWADIEDKMGRERAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCGTCGGGGGATCAACAATCTCAATCCAAGGCCGGTGAGCCACCGTCCCCTCCGCGGTCCTCCTCCGCCGCCAACAACCCCCCACCCATCTACCCTCCGCC
GTCCGTCGGCTACCCTCCGGGGCCCCACCCGGGCTACCCGCCGGCAATGGGGTACCCCCATTACGGGGCCCCGCCGCCGTACAACGGCTACGCGTACGCCCAGGCCCCTC
CGGCGGCGTACTACCACGGGCAGAATTACCCGGCGGAGCCGGTGAACGCGGGATTCATCCGCGGGATTGTGTCGGCCCTGATTCTGGTGGTGCTGTTGCTGACCCTGAGC
AGCATAATCACGTGGATCATGCTCCGACCCGAGATCCCAATCTTCAAAGTGGAATCCTTCTCGGTGGGGAATTTCAACATCTCGAAATCGAATTACTCCGGCAGCTGGGA
GGCGGCGGTGGGGGTGGAGAATCCGAACCGGAAACTGAATCTGAATTTCGAGCGGATCCAGAGCTTCGTGGATTTCAAAGAACACACGCTGGCGATGTCGTTTGGGGACC
CGTTTTTCCTGGACGTGGAGAAGAGCACCAAAATGCGGGTGAGATTGATCTCGAGCAGCCCCGATGATCCCGGGAATTGGGCCGACATAGAGGACAAGATGGGCCGGGAG
CGGGCCGCCGGAACTGTGGGCTTCAATTTGAGATTCTTGGCCTGGACCACTTTCCGGTCTGGGTCATGGTGGACCAGGCGGGTCATTATGAGGATTTTCTGTGAGGATTT
GAAGCTTGCCTTCGCCGGACCCGCCGCACGCGACGCCAAGTTCGTCGCCGATTCCCACCCCAAGACTTGTTCCGTTTATGTC
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCGTCGGGGGATCAACAATCTCAATCCAAGGCCGGTGAGCCACCGTCCCCTCCGCGGTCCTCCTCCGCCGCCAACAACCCCCCACCCATCTACCCTCCGCC
GTCCGTCGGCTACCCTCCGGGGCCCCACCCGGGCTACCCGCCGGCAATGGGGTACCCCCATTACGGGGCCCCGCCGCCGTACAACGGCTACGCGTACGCCCAGGCCCCTC
CGGCGGCGTACTACCACGGGCAGAATTACCCGGCGGAGCCGGTGAACGCGGGATTCATCCGCGGGATTGTGTCGGCCCTGATTCTGGTGGTGCTGTTGCTGACCCTGAGC
AGCATAATCACGTGGATCATGCTCCGACCCGAGATCCCAATCTTCAAAGTGGAATCCTTCTCGGTGGGGAATTTCAACATCTCGAAATCGAATTACTCCGGCAGCTGGGA
GGCGGCGGTGGGGGTGGAGAATCCGAACCGGAAACTGAATCTGAATTTCGAGCGGATCCAGAGCTTCGTGGATTTCAAAGAACACACGCTGGCGATGTCGTTTGGGGACC
CGTTTTTCCTGGACGTGGAGAAGAGCACCAAAATGCGGGTGAGATTGATCTCGAGCAGCCCCGATGATCCCGGGAATTGGGCCGACATAGAGGACAAGATGGGCCGGGAG
CGGGCCGCCGGAACTGTGGGCTTCAATTTGAGATTCTTGGCCTGGACCACTTTCCGGTCTGGGTCATGGTGGACCAGGCGGGTCATTATGAGGATTTTCTGTGAGGATTT
GAAGCTTGCCTTCGCCGGACCCGCCGCACGCGACGCCAAGTTCGTCGCCGATTCCCACCCCAAGACTTGTTCCGTTTATGTC
Protein sequenceShow/hide protein sequence
MASSSGDQQSQSKAGEPPSPPRSSSAANNPPPIYPPPSVGYPPGPHPGYPPAMGYPHYGAPPPYNGYAYAQAPPAAYYHGQNYPAEPVNAGFIRGIVSALILVVLLLTLS
SIITWIMLRPEIPIFKVESFSVGNFNISKSNYSGSWEAAVGVENPNRKLNLNFERIQSFVDFKEHTLAMSFGDPFFLDVEKSTKMRVRLISSSPDDPGNWADIEDKMGRE
RAAGTVGFNLRFLAWTTFRSGSWWTRRVIMRIFCEDLKLAFAGPAARDAKFVADSHPKTCSVYV