; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g0850 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g0850
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionlysine-rich arabinogalactan protein 19
Genome locationMC08:6879135..6883091
RNA-Seq ExpressionMC08g0850
SyntenyMC08g0850
Gene Ontology termsGO:0017053 - transcriptional repressor complex (cellular component)
InterPro domainsIPR028226 - Protein LIN37


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152574.1 lysine-rich arabinogalactan protein 19 [Momordica charantia]2.22e-240100Show/hide
Query:  MPADSPLIPTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGILYPV
        MPADSPLIPTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGILYPV
Subjt:  MPADSPLIPTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGILYPV

Query:  ASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPSTICESN
        ASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPSTICESN
Subjt:  ASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPSTICESN

Query:  GCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQELL
        GCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQELL
Subjt:  GCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQELL

Query:  KRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        KRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
Subjt:  KRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

XP_022934169.1 uncharacterized protein LOC111441417 [Cucurbita moschata]6.42e-19282.54Show/hide
Query:  MPADSPLIPTTTTPPAAA-------TAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDAS
        MP DSPLIPT TTP A A       T AP+S+TL   AATT+  AAAIARPLANQAPSRPISSIP THHLHYPPQALYTAQ IPVRTPN QLPK+QQDAS
Subjt:  MPADSPLIPTTTTPPAAA-------TAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDAS

Query:  QGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPP
        Q ILYPVASSGRGFVPRPIRPLP DQ VTVAN  G+PHRPVV+FPHRP+GSPHLD+MSHPMHM RPPNLQQQ L+P  GS+ +GSIKGAPNSSDPKVFPP
Subjt:  QGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPP

Query:  STICESNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQ
        STI E+NGCKEMR  VRDD LCV+RDRKV ITDGASLYALCRSWLRNGSQEE+QPQYG+FLRSLPRPLPI V GA+PSQKKEV EE V E DKDE+SI  
Subjt:  STICESNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQ

Query:  LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN  GS
Subjt:  LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

XP_022982576.1 uncharacterized protein LOC111481411 [Cucurbita maxima]4.75e-19483.38Show/hide
Query:  MPADSPLIPTTTTPPAAA-------TAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDAS
        MP DSPLIPT TTP AAA       T AP+S+TL+P AATT  AAAAIARPLANQAPSRPISSIP THHLHYPPQALYTAQ IPVRTPN QLPK+QQDAS
Subjt:  MPADSPLIPTTTTPPAAA-------TAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDAS

Query:  QGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPP
        Q ILYPVASSGRGFVPRPIRPLP DQ VTVAN  G+PHRPVV+FPHRP+GSPHLD+MSHPMHM RPPNLQQQ L+P  GS+ +GSIK APNSSDPKVFPP
Subjt:  QGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPP

Query:  STICESNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQ
        STI E+NGCKEMR  VRDD LCV+RDRKV ITDGASLYALCRSWLRNGSQEE+QPQYG+FLRSLPRPLPI V GA+PSQKKEV EEEV E DKDE+SI  
Subjt:  STICESNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQ

Query:  LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN  GS
Subjt:  LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

XP_023526543.1 uncharacterized protein LOC111790016 [Cucurbita pepo subsp. pepo]1.12e-19183.19Show/hide
Query:  MPADSPLIPTTTTPPAAATA---APVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGIL
        MP DSPLIPT TTP A  T    AP+S+TL   AATT  AAAAIARPLANQAPSRPISSIP THHLHYPPQALYTAQ IPVRTPN QLPK+QQDASQ IL
Subjt:  MPADSPLIPTTTTPPAAATA---APVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGIL

Query:  YPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPSTIC
        YPVASSGRGFVPRPIRPLP DQ VTVAN  G+PHRPVV+FPHRP+GSPHLD+MSHPMHM RPPNLQQQ L+P  GS+ +GSIKGAPNSSDPKVFPPSTI 
Subjt:  YPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPSTIC

Query:  ESNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQ
        E+NGCKEMR  VRDD LCV+RDRKV ITDGASLYALCRSWLRNGSQEE+QPQYG+FLRSLPRPLPI V GA+PSQKKEV EE V E DKDE+SI  L TQ
Subjt:  ESNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQ

Query:  ELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        ELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN  GS
Subjt:  ELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

XP_038903387.1 uncharacterized protein LOC120089997 [Benincasa hispida]3.30e-19283Show/hide
Query:  MPADSPLIPTTTTP--PAAATAAPVSYTLAPTAATTTAAAA--AIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGI
        MP DSPLIPT TT   P+  T  P+S+TL   AATTTAAAA  AIARPLANQAPSRPISSIP THHLHYPPQALY AQ IPVRTPN QLPK+ QDASQ I
Subjt:  MPADSPLIPTTTTP--PAAATAAPVSYTLAPTAATTTAAAA--AIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGI

Query:  LYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQ-LLPLPGSATAGSIKGAPNSSDPKVFPPST
        LYPVASSGRGFVPRPIRPLPADQ VT+AN  G+ +RPVVTFPHRP+GS HLD+MSHPMHM RPPNLQQQQ L+P  GS+ +GSIKG PNSSDPKVF PST
Subjt:  LYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQ-LLPLPGSATAGSIKGAPNSSDPKVFPPST

Query:  ICESNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLS
        ICESNGCKEMR  VRDD LCV+RDRKVRITDGASLYALCRSWLRNGSQEE+QPQYGNFLRSLPRPLPI  AGA+PSQKKEV EEEV E+DKDE SI  LS
Subjt:  ICESNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLS

Query:  TQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        TQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN  GS
Subjt:  TQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

TrEMBL top hitse value%identityAlignment
A0A1S3CFG6 uncharacterized protein LOC103500291 isoform X11.55e-18981.49Show/hide
Query:  MPADSPLIPTTTTPPAAATAA-----------PVSYTL--APTAATTTAAA-AAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLP
        MP DSPLIPT TT  +AAT+A           P+S+TL  A TA TT AAA AAIARPLANQAPSRPISSIP THHLHYP QALY  Q IPVRTPN QLP
Subjt:  MPADSPLIPTTTTPPAAATAA-----------PVSYTL--APTAATTTAAA-AAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLP

Query:  KVQQDASQGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSS
        K+ QDASQ ILYPVASSGRGFVPRPIRPLP DQAVT+AN  G+PHRPVVTFPHRP+GSPHLD+MSHPMHM RPPNLQQQ L+P  GS+ +GSIKGAPNSS
Subjt:  KVQQDASQGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSS

Query:  DPKVFPPSTICESNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDK
        DPK FPPSTICESNGCKEMR  VRDDTLCV+RDRKVRITDGASLYALCRSWLRNGSQEE+QPQYG+FLRSLPRPLPI VAGA PSQKKEV +EEV E+DK
Subjt:  DPKVFPPSTICESNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDK

Query:  DEQSIAQLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        DE SI  LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDN  GS
Subjt:  DEQSIAQLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

A0A5A7TZA2 Mucin-2 isoform X21.55e-18981.49Show/hide
Query:  MPADSPLIPTTTTPPAAATAA-----------PVSYTL--APTAATTTAAA-AAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLP
        MP DSPLIPT TT  +AAT+A           P+S+TL  A TA TT AAA AAIARPLANQAPSRPISSIP THHLHYP QALY  Q IPVRTPN QLP
Subjt:  MPADSPLIPTTTTPPAAATAA-----------PVSYTL--APTAATTTAAA-AAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLP

Query:  KVQQDASQGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSS
        K+ QDASQ ILYPVASSGRGFVPRPIRPLP DQAVT+AN  G+PHRPVVTFPHRP+GSPHLD+MSHPMHM RPPNLQQQ L+P  GS+ +GSIKGAPNSS
Subjt:  KVQQDASQGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSS

Query:  DPKVFPPSTICESNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDK
        DPK FPPSTICESNGCKEMR  VRDDTLCV+RDRKVRITDGASLYALCRSWLRNGSQEE+QPQYG+FLRSLPRPLPI VAGA PSQKKEV +EEV E+DK
Subjt:  DPKVFPPSTICESNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDK

Query:  DEQSIAQLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        DE SI  LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDN  GS
Subjt:  DEQSIAQLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

A0A6J1DGE9 lysine-rich arabinogalactan protein 191.08e-240100Show/hide
Query:  MPADSPLIPTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGILYPV
        MPADSPLIPTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGILYPV
Subjt:  MPADSPLIPTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGILYPV

Query:  ASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPSTICESN
        ASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPSTICESN
Subjt:  ASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPSTICESN

Query:  GCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQELL
        GCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQELL
Subjt:  GCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQELL

Query:  KRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        KRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
Subjt:  KRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

A0A6J1F1T6 uncharacterized protein LOC1114414173.11e-19282.54Show/hide
Query:  MPADSPLIPTTTTPPAAA-------TAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDAS
        MP DSPLIPT TTP A A       T AP+S+TL   AATT+  AAAIARPLANQAPSRPISSIP THHLHYPPQALYTAQ IPVRTPN QLPK+QQDAS
Subjt:  MPADSPLIPTTTTPPAAA-------TAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDAS

Query:  QGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPP
        Q ILYPVASSGRGFVPRPIRPLP DQ VTVAN  G+PHRPVV+FPHRP+GSPHLD+MSHPMHM RPPNLQQQ L+P  GS+ +GSIKGAPNSSDPKVFPP
Subjt:  QGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPP

Query:  STICESNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQ
        STI E+NGCKEMR  VRDD LCV+RDRKV ITDGASLYALCRSWLRNGSQEE+QPQYG+FLRSLPRPLPI V GA+PSQKKEV EE V E DKDE+SI  
Subjt:  STICESNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQ

Query:  LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN  GS
Subjt:  LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

A0A6J1IZP9 uncharacterized protein LOC1114814112.30e-19483.38Show/hide
Query:  MPADSPLIPTTTTPPAAA-------TAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDAS
        MP DSPLIPT TTP AAA       T AP+S+TL+P AATT  AAAAIARPLANQAPSRPISSIP THHLHYPPQALYTAQ IPVRTPN QLPK+QQDAS
Subjt:  MPADSPLIPTTTTPPAAA-------TAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDAS

Query:  QGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPP
        Q ILYPVASSGRGFVPRPIRPLP DQ VTVAN  G+PHRPVV+FPHRP+GSPHLD+MSHPMHM RPPNLQQQ L+P  GS+ +GSIK APNSSDPKVFPP
Subjt:  QGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPP

Query:  STICESNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQ
        STI E+NGCKEMR  VRDD LCV+RDRKV ITDGASLYALCRSWLRNGSQEE+QPQYG+FLRSLPRPLPI V GA+PSQKKEV EEEV E DKDE+SI  
Subjt:  STICESNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQ

Query:  LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN  GS
Subjt:  LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G04930.1 hydroxyproline-rich glycoprotein family protein3.8e-4037.65Show/hide
Query:  PAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHH----LHYP----PQALYT----AQPIPVRTPNQQLPKVQQDASQGILYPVAS
        P ++ +  VS +L+ TA+ T        RP  +Q P  P    PPT+     L +P     Q+ Y+    A  IPVR       +  QD S  +LYP A 
Subjt:  PAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHH----LHYP----PQALYT----AQPIPVRTPNQQLPKVQQDASQGILYPVAS

Query:  SGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPSTICESNGC
         GRGF  RP+R   AD +VT  N +G+P RP  T+   P     ++++       R P ++    L L      G I+ +P    P+V PP T       
Subjt:  SGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPSTICESNGC

Query:  KEMRDRVRDDTLCVIRDRKVRITDG-ASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQELLK
        +  + R +D  L V+R RKVRIT+G +SLY+L RSWL+NG+    QPQ    ++ LP+PLP+ +     S   +  EE   ED +DE+++ QLS ++LLK
Subjt:  KEMRDRVRDDTLCVIRDRKVRITDG-ASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQELLK

Query:  RHVRRAKKVRSRLREERLQRIERYKTRLALLL
        RH+ RAKKVR++LREER +RI RYK R+ L+L
Subjt:  RHVRRAKKVRSRLREERLQRIERYKTRLALLL

AT1G04930.2 hydroxyproline-rich glycoprotein family protein3.4e-3634.71Show/hide
Query:  PAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHH----LHYP----PQALYT----AQPIPVRTPNQQLPKVQQDASQGILYPVAS
        P ++ +  VS +L+ TA+ T        RP  +Q P  P    PPT+     L +P     Q+ Y+    A  IPVR       +  QD S  +LYP A 
Subjt:  PAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHH----LHYP----PQALYT----AQPIPVRTPNQQLPKVQQDASQGILYPVAS

Query:  SGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPN----------------SS
         GRGF  RP+R   AD +VT  N +G+P RP  T+   P     ++++       R P ++    L L      G I+ +P                   
Subjt:  SGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPN----------------SS

Query:  DPKVF---------------PPSTICESNGCKEMRDRVRDDTLCVIRDRKVRITDG-ASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIP
        DPK                 PP++I +++  +  + R +D  L V+R RKVRIT+G +SLY+L RSWL+NG+    QPQ    ++ LP+PLP+ +     
Subjt:  DPKVF---------------PPSTICESNGCKEMRDRVRDDTLCVIRDRKVRITDG-ASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIP

Query:  SQKKEVTEEEVGEDDKDEQSIAQLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLL
        S   +  EE   ED +DE+++ QLS ++LLKRH+ RAKKVR++LREER +RI RYK R+ L+L
Subjt:  SQKKEVTEEEVGEDDKDEQSIAQLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLL

AT2G32840.1 proline-rich family protein8.0e-4640.17Show/hide
Query:  PTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISS---IPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDA---SQGILYPVAS
        P     P  + +  VS  +   + +       +  P +   P  P SS   I P H  H+P Q +YT  P+P+R  N       Q        ++YP  S
Subjt:  PTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISS---IPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDA---SQGILYPVAS

Query:  SGRGFVPRPIRP---LPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFP-PSTICE
        SGRGF  RP+R      AD   + +     P  PV  + H    S +LD M+  M    P N Q  QL        +G +KG P+   P+  P P++I +
Subjt:  SGRGFVPRPIRP---LPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFP-PSTICE

Query:  SNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDK-DEQSIAQLSTQ
        ++G K+ R   RDD L ++R RKVRIT+GASLY+LCRSWLRNG+ E  +PQ  + +  LP+PLP+       S  K++ EE + E+DK DE+S+  LS  
Subjt:  SNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDK-DEQSIAQLSTQ

Query:  ELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTD
        +LLKRH+ RAKKVR+RLREERL+RI RYK RLALLLPP  EQ R +
Subjt:  ELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTD

AT2G32840.2 proline-rich family protein9.8e-2834.6Show/hide
Query:  PTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISS---IPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDA---SQGILYPVAS
        P     P  + +  VS  +   + +       +  P +   P  P SS   I P H  H+P Q +YT  P+P+R  N       Q        ++YP  S
Subjt:  PTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISS---IPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDA---SQGILYPVAS

Query:  SGRGFVPRPIRP---LPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFP-PSTICE
        SGRGF  RP+R      AD   + +     P  PV  + H    S +LD M+  M    P N Q  QL        +G +KG P+   P+  P P++I +
Subjt:  SGRGFVPRPIRP---LPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFP-PSTICE

Query:  SNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKD
        ++G K+ R   RDD L ++R RKVRIT+GASLY+LCRSWLRNG+ E  +PQ  + +  LP+PLP+       S  K++ EE + E+DK+
Subjt:  SNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CGAGTCGAAACCCCAATCCTGTCAAATTCGCAAAAAATTGAGTTTTTATTTGGCAATTTTGATATTCAAACAATCAAACCGCAGACTCCCAAATTCCCCGCCACCATGCC
TGCAGACTCCCCCCTCATCCCCACCACCACCACGCCGCCCGCCGCCGCCACCGCCGCTCCCGTTTCGTACACTCTCGCCCCAACTGCAGCCACCACCACCGCCGCCGCCG
CCGCCATTGCTCGTCCGCTTGCGAATCAAGCGCCATCGAGACCCATTTCTTCAATTCCTCCGACCCACCATCTCCACTACCCTCCTCAAGCCCTCTACACGGCGCAGCCC
ATCCCCGTTCGAACTCCTAACCAGCAATTGCCCAAGGTTCAGCAAGACGCGTCTCAGGGAATTCTTTACCCTGTCGCTTCCTCCGGCCGCGGCTTCGTTCCTCGGCCCAT
TCGGCCGCTTCCCGCCGATCAGGCCGTCACGGTAGCCAACGCTGCTGGGTTTCCACATCGCCCGGTCGTCACGTTCCCGCATCGGCCGGTCGGGTCGCCTCATTTGGACG
CCATGAGCCATCCAATGCACATGGGGCGACCTCCCAATTTGCAACAGCAGCAGCTTCTTCCCCTTCCTGGCTCCGCCACTGCCGGCTCGATTAAGGGTGCCCCAAATTCC
TCTGATCCAAAGGTTTTTCCCCCATCAACAATCTGCGAGTCAAATGGATGTAAAGAAATGAGGGATAGAGTCAGAGACGACACTCTGTGCGTGATAAGAGATCGAAAAGT
CAGAATAACTGATGGGGCCTCTCTTTATGCACTTTGTCGATCATGGTTGAGGAATGGTTCTCAAGAAGAAAACCAGCCACAATATGGAAATTTTTTGAGGTCTCTTCCGC
GACCTTTGCCCATCCCCGTGGCTGGTGCTATACCGTCACAGAAAAAGGAAGTAACCGAAGAAGAAGTTGGCGAGGATGATAAGGATGAGCAATCCATTGCGCAGTTGTCA
ACACAAGAGCTATTAAAAAGACATGTTAGACGTGCAAAGAAGGTTCGCTCACGATTGAGGGAGGAACGGTTGCAACGAATTGAAAGATACAAAACCAGGCTCGCTCTTCT
CCTTCCTCCACCAGTTGAGCAGTTGAGGACCGATAATGCTGCTGGAAGCTGA
mRNA sequenceShow/hide mRNA sequence
CACGAGTCGAAACCCCAATCCTGTCAAATTCGCAAAAAATTGAGTTTTTATTTGGCAATTTTGATATTCAAACAATCAAACCGCAGACTCCCAAATTCCCCGCCACCATG
CCTGCAGACTCCCCCCTCATCCCCACCACCACCACGCCGCCCGCCGCCGCCACCGCCGCTCCCGTTTCGTACACTCTCGCCCCAACTGCAGCCACCACCACCGCCGCCGC
CGCCGCCATTGCTCGTCCGCTTGCGAATCAAGCGCCATCGAGACCCATTTCTTCAATTCCTCCGACCCACCATCTCCACTACCCTCCTCAAGCCCTCTACACGGCGCAGC
CCATCCCCGTTCGAACTCCTAACCAGCAATTGCCCAAGGTTCAGCAAGACGCGTCTCAGGGAATTCTTTACCCTGTCGCTTCCTCCGGCCGCGGCTTCGTTCCTCGGCCC
ATTCGGCCGCTTCCCGCCGATCAGGCCGTCACGGTAGCCAACGCTGCTGGGTTTCCACATCGCCCGGTCGTCACGTTCCCGCATCGGCCGGTCGGGTCGCCTCATTTGGA
CGCCATGAGCCATCCAATGCACATGGGGCGACCTCCCAATTTGCAACAGCAGCAGCTTCTTCCCCTTCCTGGCTCCGCCACTGCCGGCTCGATTAAGGGTGCCCCAAATT
CCTCTGATCCAAAGGTTTTTCCCCCATCAACAATCTGCGAGTCAAATGGATGTAAAGAAATGAGGGATAGAGTCAGAGACGACACTCTGTGCGTGATAAGAGATCGAAAA
GTCAGAATAACTGATGGGGCCTCTCTTTATGCACTTTGTCGATCATGGTTGAGGAATGGTTCTCAAGAAGAAAACCAGCCACAATATGGAAATTTTTTGAGGTCTCTTCC
GCGACCTTTGCCCATCCCCGTGGCTGGTGCTATACCGTCACAGAAAAAGGAAGTAACCGAAGAAGAAGTTGGCGAGGATGATAAGGATGAGCAATCCATTGCGCAGTTGT
CAACACAAGAGCTATTAAAAAGACATGTTAGACGTGCAAAGAAGGTTCGCTCACGATTGAGGGAGGAACGGTTGCAACGAATTGAAAGATACAAAACCAGGCTCGCTCTT
CTCCTTCCTCCACCAGTTGAGCAGTTGAGGACCGATAATGCTGCTGGAAGCTGAGTCCATTAAATCCCGAAATCCTCATCCCTGTAATCAATGTGGACTGCTGTCCAAAT
CATTCTTCGCGATTGCAGATGGAGTCAGAGAAACATTGAACAAAAGCGCAGAGGATTATATGCAGGCAAGAAAAGATAGATATTCGATAGAAATTGGTAATTCTTGTCTT
CCTATCAGTTCCTTTTTGGCAGTTTGTACCATGTGTTTTTACCATTGTTGTGCTTTCAAGTAGGTTAAAAAACGATGTCATTTACTGAGGTTAGAAATGGAGGAAATTGA
GAAGTAAATCACATATGTTCCCCCCTTTTTTACCTAACTTCTGTTGCTAAAATTCCCATTTAACATCATCACTTAGTTATATTTTGAGATGTCTGAATCTGAATCTTCAC
AGGGGCTTAAAATAAGGT
Protein sequenceShow/hide protein sequence
RVETPILSNSQKIEFLFGNFDIQTIKPQTPKFPATMPADSPLIPTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQP
IPVRTPNQQLPKVQQDASQGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNS
SDPKVFPPSTICESNGCKEMRDRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLS
TQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS