; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS016135 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS016135
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionlysine-rich arabinogalactan protein 19
Genome locationscaffold9_2:590837..594280
RNA-Seq ExpressionMS016135
SyntenyMS016135
Gene Ontology termsGO:0017053 - transcriptional repressor complex (cellular component)
InterPro domainsIPR028226 - Protein LIN37


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152574.1 lysine-rich arabinogalactan protein 19 [Momordica charantia]9.7e-18699.43Show/hide
Query:  MPADSPLIPTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGILYPV
        MPADSPLIPTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGILYPV
Subjt:  MPADSPLIPTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGILYPV

Query:  ASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPSTICESN
        ASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPSTICESN
Subjt:  ASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPSTICESN

Query:  GCKEM--RVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQELL
        GCKEM  RVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQELL
Subjt:  GCKEM--RVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQELL

Query:  KRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        KRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
Subjt:  KRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

XP_022934169.1 uncharacterized protein LOC111441417 [Cucurbita moschata]1.1e-15282.91Show/hide
Query:  MPADSPLIPTTTTPPAAATAAPVSYTLAP-----TAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQG
        MP DSPLIPT TTP A A AA  + T+AP     T A  T +AAAIARPLANQAPSRPISSIP THHLHYPPQALYTAQ IPVRTPN QLPK+QQDASQ 
Subjt:  MPADSPLIPTTTTPPAAATAAPVSYTLAP-----TAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQG

Query:  ILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPST
        ILYPVASSGRGFVPRPIRPLP DQ VTVAN  G+PHRPVV+FPHRP+GSPHLD+MSHPMHM RPPNL QQQL+P  GS+ +GSIKGAPNSSDPKVFPPST
Subjt:  ILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPST

Query:  ICESNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQ
        I E+NGCKEMRVRDD LCV+RDRKV ITDGASLYALCRSWLRNGSQEE+QPQYG+FLRSLPRPLPI V GA+PSQKKEV EE V E DKDE+SI  LSTQ
Subjt:  ICESNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQ

Query:  ELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        ELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN  GS
Subjt:  ELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

XP_022982576.1 uncharacterized protein LOC111481411 [Cucurbita maxima]1.2e-15483.85Show/hide
Query:  MPADSPLIPTTTTPPAAA-------TAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDAS
        MP DSPLIPT TTP AAA       T AP+S+TL+P AATT  AAAAIARPLANQAPSRPISSIP THHLHYPPQALYTAQ IPVRTPN QLPK+QQDAS
Subjt:  MPADSPLIPTTTTPPAAA-------TAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDAS

Query:  QGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPP
        Q ILYPVASSGRGFVPRPIRPLP DQ VTVAN  G+PHRPVV+FPHRP+GSPHLD+MSHPMHM RPPNL QQQL+P  GS+ +GSIK APNSSDPKVFPP
Subjt:  QGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPP

Query:  STICESNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLS
        STI E+NGCKEMRVRDD LCV+RDRKV ITDGASLYALCRSWLRNGSQEE+QPQYG+FLRSLPRPLPI V GA+PSQKKEV EEEV E DKDE+SI  LS
Subjt:  STICESNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLS

Query:  TQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        TQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN  GS
Subjt:  TQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

XP_023526543.1 uncharacterized protein LOC111790016 [Cucurbita pepo subsp. pepo]8.3e-15383.67Show/hide
Query:  MPADSPLIPTTTTPPA---AATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGIL
        MP DSPLIPT TTP A     T AP+S+TL   AATT  AAAAIARPLANQAPSRPISSIP THHLHYPPQALYTAQ IPVRTPN QLPK+QQDASQ IL
Subjt:  MPADSPLIPTTTTPPA---AATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGIL

Query:  YPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPSTIC
        YPVASSGRGFVPRPIRPLP DQ VTVAN  G+PHRPVV+FPHRP+GSPHLD+MSHPMHM RPPNL QQQL+P  GS+ +GSIKGAPNSSDPKVFPPSTI 
Subjt:  YPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPSTIC

Query:  ESNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQEL
        E+NGCKEMRVRDD LCV+RDRKV ITDGASLYALCRSWLRNGSQEE+QPQYG+FLRSLPRPLPI V GA+PSQKKEV EE V E DKDE+SI  L TQEL
Subjt:  ESNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQEL

Query:  LKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        LKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN  GS
Subjt:  LKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

XP_038903387.1 uncharacterized protein LOC120089997 [Benincasa hispida]2.2e-15383.48Show/hide
Query:  MPADSPLIPTTTT--PPAAATAAPVSYTLAPTAATTTAAA--AAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGI
        MP DSPLIPT TT   P+  T  P+S+TL   AATTTAAA  AAIARPLANQAPSRPISSIP THHLHYPPQALY AQ IPVRTPN QLPK+ QDASQ I
Subjt:  MPADSPLIPTTTT--PPAAATAAPVSYTLAPTAATTTAAA--AAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGI

Query:  LYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNL-QQQQLLPLPGSATAGSIKGAPNSSDPKVFPPST
        LYPVASSGRGFVPRPIRPLPADQ VT+AN  G+ +RPVVTFPHRP+GS HLD+MSHPMHM RPPNL QQQQL+P  GS+ +GSIKG PNSSDPKVF PST
Subjt:  LYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNL-QQQQLLPLPGSATAGSIKGAPNSSDPKVFPPST

Query:  ICESNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQ
        ICESNGCKEMRVRDD LCV+RDRKVRITDGASLYALCRSWLRNGSQEE+QPQYGNFLRSLPRPLPI  AGA+PSQKKEV EEEV E+DKDE SI  LSTQ
Subjt:  ICESNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQ

Query:  ELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        ELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN  GS
Subjt:  ELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

TrEMBL top hitse value%identityAlignment
A0A1S3CFG6 uncharacterized protein LOC103500291 isoform X11.7e-15181.84Show/hide
Query:  MPADSPLIPTTTTPPAAA---------TAAPVSYTL--APTAATTTAAA-AAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKV
        MP DSPLIPT TT  A +         T  P+S+TL  A TA TT AAA AAIARPLANQAPSRPISSIP THHLHYP QALY  Q IPVRTPN QLPK+
Subjt:  MPADSPLIPTTTTPPAAA---------TAAPVSYTL--APTAATTTAAA-AAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKV

Query:  QQDASQGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDP
         QDASQ ILYPVASSGRGFVPRPIRPLP DQAVT+AN  G+PHRPVVTFPHRP+GSPHLD+MSHPMHM RPPNL QQQL+P  GS+ +GSIKGAPNSSDP
Subjt:  QQDASQGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDP

Query:  KVFPPSTICESNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQS
        K FPPSTICESNGCKEMRVRDDTLCV+RDRKVRITDGASLYALCRSWLRNGSQEE+QPQYG+FLRSLPRPLPI VAGA PSQKKEV +EEV E+DKDE S
Subjt:  KVFPPSTICESNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQS

Query:  IAQLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        I  LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDN  GS
Subjt:  IAQLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

A0A5A7TZA2 Mucin-2 isoform X21.7e-15181.84Show/hide
Query:  MPADSPLIPTTTTPPAAA---------TAAPVSYTL--APTAATTTAAA-AAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKV
        MP DSPLIPT TT  A +         T  P+S+TL  A TA TT AAA AAIARPLANQAPSRPISSIP THHLHYP QALY  Q IPVRTPN QLPK+
Subjt:  MPADSPLIPTTTTPPAAA---------TAAPVSYTL--APTAATTTAAA-AAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKV

Query:  QQDASQGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDP
         QDASQ ILYPVASSGRGFVPRPIRPLP DQAVT+AN  G+PHRPVVTFPHRP+GSPHLD+MSHPMHM RPPNL QQQL+P  GS+ +GSIKGAPNSSDP
Subjt:  QQDASQGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDP

Query:  KVFPPSTICESNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQS
        K FPPSTICESNGCKEMRVRDDTLCV+RDRKVRITDGASLYALCRSWLRNGSQEE+QPQYG+FLRSLPRPLPI VAGA PSQKKEV +EEV E+DKDE S
Subjt:  KVFPPSTICESNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQS

Query:  IAQLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        I  LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDN  GS
Subjt:  IAQLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

A0A6J1DGE9 lysine-rich arabinogalactan protein 194.7e-18699.43Show/hide
Query:  MPADSPLIPTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGILYPV
        MPADSPLIPTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGILYPV
Subjt:  MPADSPLIPTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGILYPV

Query:  ASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPSTICESN
        ASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPSTICESN
Subjt:  ASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPSTICESN

Query:  GCKEM--RVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQELL
        GCKEM  RVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQELL
Subjt:  GCKEM--RVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQELL

Query:  KRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        KRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
Subjt:  KRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

A0A6J1F1T6 uncharacterized protein LOC1114414175.2e-15382.91Show/hide
Query:  MPADSPLIPTTTTPPAAATAAPVSYTLAP-----TAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQG
        MP DSPLIPT TTP A A AA  + T+AP     T A  T +AAAIARPLANQAPSRPISSIP THHLHYPPQALYTAQ IPVRTPN QLPK+QQDASQ 
Subjt:  MPADSPLIPTTTTPPAAATAAPVSYTLAP-----TAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQG

Query:  ILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPST
        ILYPVASSGRGFVPRPIRPLP DQ VTVAN  G+PHRPVV+FPHRP+GSPHLD+MSHPMHM RPPNL QQQL+P  GS+ +GSIKGAPNSSDPKVFPPST
Subjt:  ILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPST

Query:  ICESNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQ
        I E+NGCKEMRVRDD LCV+RDRKV ITDGASLYALCRSWLRNGSQEE+QPQYG+FLRSLPRPLPI V GA+PSQKKEV EE V E DKDE+SI  LSTQ
Subjt:  ICESNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQ

Query:  ELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        ELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN  GS
Subjt:  ELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

A0A6J1IZP9 uncharacterized protein LOC1114814115.6e-15583.85Show/hide
Query:  MPADSPLIPTTTTPPAAA-------TAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDAS
        MP DSPLIPT TTP AAA       T AP+S+TL+P AATT  AAAAIARPLANQAPSRPISSIP THHLHYPPQALYTAQ IPVRTPN QLPK+QQDAS
Subjt:  MPADSPLIPTTTTPPAAA-------TAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDAS

Query:  QGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPP
        Q ILYPVASSGRGFVPRPIRPLP DQ VTVAN  G+PHRPVV+FPHRP+GSPHLD+MSHPMHM RPPNL QQQL+P  GS+ +GSIK APNSSDPKVFPP
Subjt:  QGILYPVASSGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPP

Query:  STICESNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLS
        STI E+NGCKEMRVRDD LCV+RDRKV ITDGASLYALCRSWLRNGSQEE+QPQYG+FLRSLPRPLPI V GA+PSQKKEV EEEV E DKDE+SI  LS
Subjt:  STICESNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLS

Query:  TQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS
        TQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN  GS
Subjt:  TQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNAAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G04930.1 hydroxyproline-rich glycoprotein family protein1.4e-4138.07Show/hide
Query:  PAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHH----LHYP----PQALYT----AQPIPVRTPNQQLPKVQQDASQGILYPVAS
        P ++ +  VS +L+ TA+ T        RP  +Q P  P    PPT+     L +P     Q+ Y+    A  IPVR       +  QD S  +LYP A 
Subjt:  PAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHH----LHYP----PQALYT----AQPIPVRTPNQQLPKVQQDASQGILYPVAS

Query:  SGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPST-ICESNG
         GRGF  RP+R   AD +VT  N +G+P RP  T+   P     ++++       R P ++    L L      G I+ +P    P+V PP T I +++ 
Subjt:  SGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPST-ICESNG

Query:  CKEMRVRDDTLCVIRDRKVRITDG-ASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQELLKR
         ++ R +D  L V+R RKVRIT+G +SLY+L RSWL+NG+    QPQ    ++ LP+PLP+ +     S   +  EE   ED +DE+++ QLS ++LLKR
Subjt:  CKEMRVRDDTLCVIRDRKVRITDG-ASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQELLKR

Query:  HVRRAKKVRSRLREERLQRIERYKTRLALLL
        H+ RAKKVR++LREER +RI RYK R+ L+L
Subjt:  HVRRAKKVRSRLREERLQRIERYKTRLALLL

AT1G04930.2 hydroxyproline-rich glycoprotein family protein1.2e-3734.9Show/hide
Query:  PAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHH----LHYP----PQALYT----AQPIPVRTPNQQLPKVQQDASQGILYPVAS
        P ++ +  VS +L+ TA+ T        RP  +Q P  P    PPT+     L +P     Q+ Y+    A  IPVR       +  QD S  +LYP A 
Subjt:  PAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHH----LHYP----PQALYT----AQPIPVRTPNQQLPKVQQDASQGILYPVAS

Query:  SGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPN----------------SS
         GRGF  RP+R   AD +VT  N +G+P RP  T+   P     ++++       R P ++    L L      G I+ +P                   
Subjt:  SGRGFVPRPIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPN----------------SS

Query:  DPKVF---------------PPSTICESNGCKEMRVRDDTLCVIRDRKVRITDG-ASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQ
        DPK                 PP++I +++  ++ R +D  L V+R RKVRIT+G +SLY+L RSWL+NG+    QPQ    ++ LP+PLP+ +     S 
Subjt:  DPKVF---------------PPSTICESNGCKEMRVRDDTLCVIRDRKVRITDG-ASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQ

Query:  KKEVTEEEVGEDDKDEQSIAQLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLL
          +  EE   ED +DE+++ QLS ++LLKRH+ RAKKVR++LREER +RI RYK R+ L+L
Subjt:  KKEVTEEEVGEDDKDEQSIAQLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLL

AT2G32840.1 proline-rich family protein2.2e-4740.41Show/hide
Query:  PTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISS---IPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDA---SQGILYPVAS
        P     P  + +  VS  +   + +       +  P +   P  P SS   I P H  H+P Q +YT  P+P+R  N       Q        ++YP  S
Subjt:  PTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISS---IPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDA---SQGILYPVAS

Query:  SGRGFVPRPIRP---LPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFP-PSTICE
        SGRGF  RP+R      AD   + +     P  PV  + H    S +LD M+  M    P N Q  QL        +G +KG P+   P+  P P++I +
Subjt:  SGRGFVPRPIRP---LPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFP-PSTICE

Query:  SNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDK-DEQSIAQLSTQEL
        ++G K+ R RDD L ++R RKVRIT+GASLY+LCRSWLRNG+ E  +PQ  + +  LP+PLP+       S  K++ EE + E+DK DE+S+  LS  +L
Subjt:  SNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDK-DEQSIAQLSTQEL

Query:  LKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTD
        LKRH+ RAKKVR+RLREERL+RI RYK RLALLLPP  EQ R +
Subjt:  LKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTD

AT2G32840.2 proline-rich family protein2.7e-2934.84Show/hide
Query:  PTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISS---IPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDA---SQGILYPVAS
        P     P  + +  VS  +   + +       +  P +   P  P SS   I P H  H+P Q +YT  P+P+R  N       Q        ++YP  S
Subjt:  PTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISS---IPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDA---SQGILYPVAS

Query:  SGRGFVPRPIRP---LPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFP-PSTICE
        SGRGF  RP+R      AD   + +     P  PV  + H    S +LD M+  M    P N Q  QL        +G +KG P+   P+  P P++I +
Subjt:  SGRGFVPRPIRP---LPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFP-PSTICE

Query:  SNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKD
        ++G K+ R RDD L ++R RKVRIT+GASLY+LCRSWLRNG+ E  +PQ  + +  LP+PLP+       S  K++ EE + E+DK+
Subjt:  SNGCKEMRVRDDTLCVIRDRKVRITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGCAGACTCCCCCCTCATCCCCACCACCACCACGCCGCCCGCCGCCGCCACCGCCGCTCCCGTTTCGTACACTCTCGCCCCAACTGCAGCCACCACCACCGCCGC
CGCCGCCGCCATTGCTCGTCCGCTTGCGAATCAAGCGCCATCGAGACCCATTTCTTCAATTCCTCCGACCCACCATCTCCACTACCCTCCTCAAGCCCTCTACACGGCGC
AGCCCATCCCCGTTCGAACTCCTAACCAGCAATTGCCCAAGGTTCAGCAAGATGCGTCTCAGGGAATTCTTTACCCTGTCGCTTCCTCCGGCCGCGGCTTCGTTCCTCGG
CCCATTCGGCCGCTTCCCGCCGATCAGGCCGTCACGGTGGCCAACGCTGCTGGGTTTCCACATCGCCCGGTCGTCACGTTCCCGCATCGGCCGGTCGGGTCGCCTCATTT
GGACGCCATGAGCCATCCAATGCACATGGGGCGACCTCCCAATTTGCAACAGCAGCAGCTTCTTCCCCTTCCTGGCTCCGCCACTGCCGGCTCGATTAAGGGTGCCCCAA
ATTCCTCTGATCCAAAGGTTTTTCCCCCATCAACAATCTGCGAGTCAAATGGATGTAAAGAAATGAGAGTCAGAGACGACACTCTGTGCGTGATAAGAGATCGAAAAGTC
AGAATAACTGATGGGGCCTCTCTTTATGCACTTTGTCGATCATGGTTGAGGAATGGTTCTCAAGAAGAAAACCAGCCACAATATGGAAATTTTTTGAGGTCTCTTCCGCG
ACCTTTGCCCATCCCCGTGGCTGGTGCTATACCGTCACAGAAAAAGGAAGTAACCGAAGAAGAAGTTGGTGAGGATGATAAGGATGAGCAATCCATTGCGCAGTTGTCAA
CACAAGAGCTATTAAAAAGACATGTTAGACGTGCAAAGAAGGTTCGCTCACGATTGAGGGAGGAACGGTTGCAACGAATTGAAAGATACAAAACCAGGCTCGCTCTTCTC
CTTCCTCCACCAGTTGAGCAGTTGAGGACCGATAATGCTGCTGGAAGC
mRNA sequenceShow/hide mRNA sequence
ATGCCTGCAGACTCCCCCCTCATCCCCACCACCACCACGCCGCCCGCCGCCGCCACCGCCGCTCCCGTTTCGTACACTCTCGCCCCAACTGCAGCCACCACCACCGCCGC
CGCCGCCGCCATTGCTCGTCCGCTTGCGAATCAAGCGCCATCGAGACCCATTTCTTCAATTCCTCCGACCCACCATCTCCACTACCCTCCTCAAGCCCTCTACACGGCGC
AGCCCATCCCCGTTCGAACTCCTAACCAGCAATTGCCCAAGGTTCAGCAAGATGCGTCTCAGGGAATTCTTTACCCTGTCGCTTCCTCCGGCCGCGGCTTCGTTCCTCGG
CCCATTCGGCCGCTTCCCGCCGATCAGGCCGTCACGGTGGCCAACGCTGCTGGGTTTCCACATCGCCCGGTCGTCACGTTCCCGCATCGGCCGGTCGGGTCGCCTCATTT
GGACGCCATGAGCCATCCAATGCACATGGGGCGACCTCCCAATTTGCAACAGCAGCAGCTTCTTCCCCTTCCTGGCTCCGCCACTGCCGGCTCGATTAAGGGTGCCCCAA
ATTCCTCTGATCCAAAGGTTTTTCCCCCATCAACAATCTGCGAGTCAAATGGATGTAAAGAAATGAGAGTCAGAGACGACACTCTGTGCGTGATAAGAGATCGAAAAGTC
AGAATAACTGATGGGGCCTCTCTTTATGCACTTTGTCGATCATGGTTGAGGAATGGTTCTCAAGAAGAAAACCAGCCACAATATGGAAATTTTTTGAGGTCTCTTCCGCG
ACCTTTGCCCATCCCCGTGGCTGGTGCTATACCGTCACAGAAAAAGGAAGTAACCGAAGAAGAAGTTGGTGAGGATGATAAGGATGAGCAATCCATTGCGCAGTTGTCAA
CACAAGAGCTATTAAAAAGACATGTTAGACGTGCAAAGAAGGTTCGCTCACGATTGAGGGAGGAACGGTTGCAACGAATTGAAAGATACAAAACCAGGCTCGCTCTTCTC
CTTCCTCCACCAGTTGAGCAGTTGAGGACCGATAATGCTGCTGGAAGC
Protein sequenceShow/hide protein sequence
MPADSPLIPTTTTPPAAATAAPVSYTLAPTAATTTAAAAAIARPLANQAPSRPISSIPPTHHLHYPPQALYTAQPIPVRTPNQQLPKVQQDASQGILYPVASSGRGFVPR
PIRPLPADQAVTVANAAGFPHRPVVTFPHRPVGSPHLDAMSHPMHMGRPPNLQQQQLLPLPGSATAGSIKGAPNSSDPKVFPPSTICESNGCKEMRVRDDTLCVIRDRKV
RITDGASLYALCRSWLRNGSQEENQPQYGNFLRSLPRPLPIPVAGAIPSQKKEVTEEEVGEDDKDEQSIAQLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALL
LPPPVEQLRTDNAAGS