; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg07306 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg07306
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionlysine-rich arabinogalactan protein 19
Genome locationCarg_Chr01:1015719..1019128
RNA-Seq ExpressionCarg07306
SyntenyCarg07306
Gene Ontology termsGO:0017053 - transcriptional repressor complex (cellular component)
InterPro domainsIPR028226 - Protein LIN37


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606861.1 hypothetical protein SDJN03_00203, partial [Cucurbita argyrosperma subsp. sororia]1.4e-189100Show/hide
Query:  MPDDSLPISTNTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQAILY
        MPDDSLPISTNTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQAILY
Subjt:  MPDDSLPISTNTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQAILY

Query:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTICES
        PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTICES
Subjt:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTICES

Query:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQDLLK
        NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQDLLK
Subjt:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQDLLK

Query:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

XP_022934169.1 uncharacterized protein LOC111441417 [Cucurbita moschata]5.9e-15986.29Show/hide
Query:  MPDDSLPISTNTTPAAVVA---PTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQA
        MP DS  I TNTTPAAV A    TTTVAP S TLTTAAATT+AA IARPLANQAPSRPISSIPQTHHLHYPPQALY AQ IP+RTPN QLPK+QQDASQA
Subjt:  MPDDSLPISTNTTPAAVVA---PTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQA

Query:  ILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTI
        ILYPVASSGRGFVPRPIRPLP DQ VTVANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMH+ARPPNL QQLI  SGS+ISGSIKGAPNSSDPKVFPPSTI
Subjt:  ILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTI

Query:  CESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQD
         E+NGCKEMRVRDDAL V+RDRKV ITDGASLYALCRSWLRNGS EE QPQYG+FLR LPRPLPI V GAV SQKKEVV + VDE+DKDE SIE LS+Q+
Subjt:  CESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQD

Query:  LLKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        LLKRHV+RAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  LLKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

XP_022949214.1 uncharacterized protein LOC111452632 [Cucurbita moschata]4.2e-18999.71Show/hide
Query:  MPDDSLPISTNTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQAILY
        MPDDSLPISTNTTPAA VAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQAILY
Subjt:  MPDDSLPISTNTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQAILY

Query:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTICES
        PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTICES
Subjt:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTICES

Query:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQDLLK
        NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQDLLK
Subjt:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQDLLK

Query:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

XP_022998467.1 uncharacterized protein LOC111493092 [Cucurbita maxima]3.4e-18396.83Show/hide
Query:  MPDDSLPISTNTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQAILY
        MPDDSLPISTNTTPAAV A TTTVAPGSLTLTTAAATTT A IARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPK+QQDASQAILY
Subjt:  MPDDSLPISTNTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQAILY

Query:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTICES
        PVASSGRGFVPRPIRPLP DQAVTVANPGGYPHRP VTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLIS+SGSAISGSIKGAPNSSDPKVFPPSTICES
Subjt:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTICES

Query:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQDLLK
        NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGS EECQPQYGNFLRPLPRPLPIPVAGAV SQKKEVV DEVDEEDKDEGSIEQLSSQDLLK
Subjt:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQDLLK

Query:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

XP_023525420.1 uncharacterized protein LOC111789035 [Cucurbita pepo subsp. pepo]1.1e-18497.69Show/hide
Query:  MPDDSLPISTNTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQAILY
        MPDDSLPISTNTTPAAV A TTTVAPGS+TLTTAAATTTAA IARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQAILY
Subjt:  MPDDSLPISTNTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQAILY

Query:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTICES
        PVASSGRGFVPRPIRPLP DQAVTVANPGGYP RPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTICES
Subjt:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTICES

Query:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQDLLK
        NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAV SQKKEVVGDEVDEEDKDEGSIE+LSSQDLLK
Subjt:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQDLLK

Query:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

TrEMBL top hitse value%identityAlignment
A0A5A7TZA2 Mucin-2 isoform X22.1e-15483.19Show/hide
Query:  MPDDSLPISTNTTPAAVVA-----PTTTVAPGSLTLTTAA-ATTTA----AVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKI
        MP DS  I TNTT AA  A      TTTV P S TLT AA A TTA    A IARPLANQAPSRPISSIPQTHHLHYP QALY  Q IP+RTPN QLPK+
Subjt:  MPDDSLPISTNTTPAAVVA-----PTTTVAPGSLTLTTAA-ATTTA----AVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKI

Query:  QQDASQAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPK
         QDASQAILYPVASSGRGFVPRPIRPLP DQAVT+ANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMH+ RPPNL QQLI  SGS+ISGSIKGAPNSSDPK
Subjt:  QQDASQAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPK

Query:  VFPPSTICESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSI
         FPPSTICESNGCKEMRVRDD L V+RDRKVRITDGASLYALCRSWLRNGS EE QPQYG+FLR LPRPLPI VAGA  SQKKEVV +EVDEEDKDEGSI
Subjt:  VFPPSTICESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSI

Query:  EQLSSQDLLKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        E LS+Q+LLKRHV+RAKKVRSRLREERL RIERYKTRLALLLPPP+EQLRTDN+TGS
Subjt:  EQLSSQDLLKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

A0A6J1F1T6 uncharacterized protein LOC1114414172.9e-15986.29Show/hide
Query:  MPDDSLPISTNTTPAAVVA---PTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQA
        MP DS  I TNTTPAAV A    TTTVAP S TLTTAAATT+AA IARPLANQAPSRPISSIPQTHHLHYPPQALY AQ IP+RTPN QLPK+QQDASQA
Subjt:  MPDDSLPISTNTTPAAVVA---PTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQA

Query:  ILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTI
        ILYPVASSGRGFVPRPIRPLP DQ VTVANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMH+ARPPNL QQLI  SGS+ISGSIKGAPNSSDPKVFPPSTI
Subjt:  ILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTI

Query:  CESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQD
         E+NGCKEMRVRDDAL V+RDRKV ITDGASLYALCRSWLRNGS EE QPQYG+FLR LPRPLPI V GAV SQKKEVV + VDE+DKDE SIE LS+Q+
Subjt:  CESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQD

Query:  LLKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        LLKRHV+RAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  LLKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

A0A6J1GC68 uncharacterized protein LOC1114526322.0e-18999.71Show/hide
Query:  MPDDSLPISTNTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQAILY
        MPDDSLPISTNTTPAA VAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQAILY
Subjt:  MPDDSLPISTNTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQAILY

Query:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTICES
        PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTICES
Subjt:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTICES

Query:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQDLLK
        NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQDLLK
Subjt:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQDLLK

Query:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

A0A6J1IZP9 uncharacterized protein LOC1114814113.5e-15785.14Show/hide
Query:  MPDDSLPISTNTTPAAVVA---PTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQA
        MP DS  I TNTTPAA  A    TTTVAP S TL+ AAATT AA IARPLANQAPSRPISSIPQTHHLHYPPQALY AQ IP+RTPN QLPK+QQDASQA
Subjt:  MPDDSLPISTNTTPAAVVA---PTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQA

Query:  ILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTI
        ILYPVASSGRGFVPRPIRPLP DQ VTVANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMH+ARPPNL QQLI  SGS+ISGSIK APNSSDPKVFPPSTI
Subjt:  ILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTI

Query:  CESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQD
         E+NGCKEMRVRDDAL V+RDRKV ITDGASLYALCRSWLRNGS EE QPQYG+FLR LPRPLPI V GAV SQKKEVV +EVDE+DKDE SIE LS+Q+
Subjt:  CESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQD

Query:  LLKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        LLKRHV+RAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDN+TGS
Subjt:  LLKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

A0A6J1K833 uncharacterized protein LOC1114930921.7e-18396.83Show/hide
Query:  MPDDSLPISTNTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQAILY
        MPDDSLPISTNTTPAAV A TTTVAPGSLTLTTAAATTT A IARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPK+QQDASQAILY
Subjt:  MPDDSLPISTNTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQAILY

Query:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTICES
        PVASSGRGFVPRPIRPLP DQAVTVANPGGYPHRP VTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLIS+SGSAISGSIKGAPNSSDPKVFPPSTICES
Subjt:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTICES

Query:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQDLLK
        NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGS EECQPQYGNFLRPLPRPLPIPVAGAV SQKKEVV DEVDEEDKDEGSIEQLSSQDLLK
Subjt:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQDLLK

Query:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G04930.1 hydroxyproline-rich glycoprotein family protein6.7e-4438.35Show/hide
Query:  MPDDSLPISTNTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYA----AQRIPIRTPNHQLPKIQQDASQ
        +PD +   S   +P+   A  T V P    + T     +    A P       RPI+  P  H   +  Q+ Y+    A  IP+R       +  QD S 
Subjt:  MPDDSLPISTNTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYA----AQRIPIRTPNHQLPKIQQDASQ

Query:  AILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNL-PQQLISLSGSAISGSIKGAPNSSDPKVFPPS
        A+LYP A  GRGF  RP+R   AD +VT  N  GYP RP  T+   P     ++S+       R P + P   + L      G I+ +P    P+V PP 
Subjt:  AILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNL-PQQLISLSGSAISGSIKGAPNSSDPKVFPPS

Query:  T-ICESNGCKEMRVRDDALAVIRDRKVRITDG-ASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQL
        T I +++  ++ R +D ALAV+R RKVRIT+G +SLY+L RSWL+NG+H   QPQ    ++PLP+PLP+ +    +S   +   +  DE+ +DE +++QL
Subjt:  T-ICESNGCKEMRVRDDALAVIRDRKVRITDG-ASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQL

Query:  SSQDLLKRHVKRAKKVRSRLREERLHRIERYKTRLALLL
        S +DLLKRH++RAKKVR++LREER  RI RYK R+ L+L
Subjt:  SSQDLLKRHVKRAKKVRSRLREERLHRIERYKTRLALLL

AT1G04930.2 hydroxyproline-rich glycoprotein family protein3.5e-4035.95Show/hide
Query:  MPDDSLPISTNTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYA----AQRIPIRTPNHQLPKIQQDASQ
        +PD +   S   +P+   A  T V P    + T     +    A P       RPI+  P  H   +  Q+ Y+    A  IP+R       +  QD S 
Subjt:  MPDDSLPISTNTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYA----AQRIPIRTPNHQLPKIQQDASQ

Query:  AILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNL-PQQLISLSGSAI-----------------SG
        A+LYP A  GRGF  RP+R   AD +VT  N  GYP RP  T+   P     ++S+       R P + P   + L GS +                 SG
Subjt:  AILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNL-PQQLISLSGSAI-----------------SG

Query:  SIKGAPNSSDPKVF---------------PPSTICESNGCKEMRVRDDALAVIRDRKVRITDG-ASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPI
         I G     DPK                 PP++I +++  ++ R +D ALAV+R RKVRIT+G +SLY+L RSWL+NG+H   QPQ    ++PLP+PLP+
Subjt:  SIKGAPNSSDPKVF---------------PPSTICESNGCKEMRVRDDALAVIRDRKVRITDG-ASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPI

Query:  PVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQDLLKRHVKRAKKVRSRLREERLHRIERYKTRLALLL
         +    +S   +   +  DE+ +DE +++QLS +DLLKRH++RAKKVR++LREER  RI RYK R+ L+L
Subjt:  PVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQDLLKRHVKRAKKVRSRLREERLHRIERYKTRLALLL

AT2G32840.1 proline-rich family protein6.7e-5244.64Show/hide
Query:  NTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTH--HLHYPPQALYAAQRIPIRTPN------HQLPKIQQDASQAILYPV
        N  P+  VA +T +   S +  T   T     +  P +   P  P SS       H H+P Q +Y    +PIR  N      HQ P    D S +++YP 
Subjt:  NTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTH--HLHYPPQALYAAQRIPIRTPN------HQLPKIQQDASQAILYPV

Query:  ASSGRGFVPRPIRPLPADQAVTVA--NPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFP-PSTIC
         SSGRGF  RP+R      A  V   +PGGY P  PV  + H    S +LD M+  M  A P N  QQ   L     SG +KG P+   P+  P P++I 
Subjt:  ASSGRGFVPRPIRPLPADQAVTVA--NPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFP-PSTIC

Query:  ESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDK-DEGSIEQLSSQD
        +++G K+ R RDDAL ++R RKVRIT+GASLY+LCRSWLRNG+HE  +PQ  + +  LP+PL  PV    +S  K++V + + EEDK DE S++ LS  D
Subjt:  ESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDK-DEGSIEQLSSQD

Query:  LLKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTD
        LLKRH+ RAKKVR+RLREERL RI RYK RLALLLPP  EQ R +
Subjt:  LLKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTD

AT2G32840.2 proline-rich family protein4.1e-3339.58Show/hide
Query:  NTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTH--HLHYPPQALYAAQRIPIRTPN------HQLPKIQQDASQAILYPV
        N  P+  VA +T +   S +  T   T     +  P +   P  P SS       H H+P Q +Y    +PIR  N      HQ P    D S +++YP 
Subjt:  NTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTH--HLHYPPQALYAAQRIPIRTPN------HQLPKIQQDASQAILYPV

Query:  ASSGRGFVPRPIRPLPADQAVTVA--NPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFP-PSTIC
         SSGRGF  RP+R      A  V   +PGGY P  PV  + H    S +LD M+  M  A P N  QQ   L     SG +KG P+   P+  P P++I 
Subjt:  ASSGRGFVPRPIRPLPADQAVTVA--NPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFP-PSTIC

Query:  ESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKD
        +++G K+ R RDDAL ++R RKVRIT+GASLY+LCRSWLRNG+HE  +PQ  + +  LP+PL  PV    +S  K++V + + EEDK+
Subjt:  ESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCGACGACTCCCTCCCCATCTCCACCAACACGACGCCCGCCGCCGTCGTTGCCCCCACCACCACTGTCGCTCCGGGTTCATTAACTCTAACTACTGCTGCAGCCAC
TACCACTGCCGCTGTTATTGCTCGTCCGCTTGCGAATCAAGCTCCGTCGAGACCCATTTCTTCAATTCCTCAAACCCACCATCTCCACTACCCTCCTCAAGCCCTCTACG
CGGCGCAGCGCATCCCTATTCGAACTCCTAACCACCAATTGCCGAAGATTCAGCAAGATGCATCTCAGGCAATTCTTTACCCTGTTGCTTCCTCCGGCCGCGGCTTCGTT
CCTCGCCCCATTCGACCTCTTCCCGCCGATCAGGCGGTCACGGTAGCCAACCCTGGCGGCTACCCACATCGCCCCGTCGTCACTTTCCCGCATCGGCCGATTGGGTCGCC
TCATTTGGACTCCATGAGCCATCCAATGCACTTGGCGCGACCTCCCAATTTGCCGCAGCAGTTGATTTCCCTTTCTGGGTCCGCCATTTCGGGCTCGATTAAGGGTGCCC
CCAATTCCTCTGATCCAAAGGTTTTTCCTCCATCAACAATCTGCGAGTCAAATGGATGTAAAGAAATGAGAGTCAGAGATGATGCTCTTGCTGTGATTAGAGATCGAAAA
GTCCGAATAACTGATGGGGCTTCTCTTTATGCACTTTGTCGATCATGGTTGAGGAATGGCTCTCATGAAGAATGCCAGCCACAATATGGGAACTTTTTGAGGCCTCTTCC
AAGACCTTTGCCCATACCCGTGGCTGGTGCTGTTTCATCGCAGAAGAAGGAAGTCGTCGGAGACGAAGTTGATGAGGAAGATAAGGATGAGGGATCCATTGAGCAGTTGT
CATCGCAAGATCTATTAAAACGGCATGTTAAACGAGCAAAGAAAGTCCGATCACGATTGAGGGAAGAACGGTTGCATCGAATTGAAAGATACAAAACCCGGCTCGCTCTT
CTCCTTCCTCCGCCAGTCGAACAGTTGAGAACTGATAATATTACTGGAAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCCGACGACTCCCTCCCCATCTCCACCAACACGACGCCCGCCGCCGTCGTTGCCCCCACCACCACTGTCGCTCCGGGTTCATTAACTCTAACTACTGCTGCAGCCAC
TACCACTGCCGCTGTTATTGCTCGTCCGCTTGCGAATCAAGCTCCGTCGAGACCCATTTCTTCAATTCCTCAAACCCACCATCTCCACTACCCTCCTCAAGCCCTCTACG
CGGCGCAGCGCATCCCTATTCGAACTCCTAACCACCAATTGCCGAAGATTCAGCAAGATGCATCTCAGGCAATTCTTTACCCTGTTGCTTCCTCCGGCCGCGGCTTCGTT
CCTCGCCCCATTCGACCTCTTCCCGCCGATCAGGCGGTCACGGTAGCCAACCCTGGCGGCTACCCACATCGCCCCGTCGTCACTTTCCCGCATCGGCCGATTGGGTCGCC
TCATTTGGACTCCATGAGCCATCCAATGCACTTGGCGCGACCTCCCAATTTGCCGCAGCAGTTGATTTCCCTTTCTGGGTCCGCCATTTCGGGCTCGATTAAGGGTGCCC
CCAATTCCTCTGATCCAAAGGTTTTTCCTCCATCAACAATCTGCGAGTCAAATGGATGTAAAGAAATGAGAGTCAGAGATGATGCTCTTGCTGTGATTAGAGATCGAAAA
GTCCGAATAACTGATGGGGCTTCTCTTTATGCACTTTGTCGATCATGGTTGAGGAATGGCTCTCATGAAGAATGCCAGCCACAATATGGGAACTTTTTGAGGCCTCTTCC
AAGACCTTTGCCCATACCCGTGGCTGGTGCTGTTTCATCGCAGAAGAAGGAAGTCGTCGGAGACGAAGTTGATGAGGAAGATAAGGATGAGGGATCCATTGAGCAGTTGT
CATCGCAAGATCTATTAAAACGGCATGTTAAACGAGCAAAGAAAGTCCGATCACGATTGAGGGAAGAACGGTTGCATCGAATTGAAAGATACAAAACCCGGCTCGCTCTT
CTCCTTCCTCCGCCAGTCGAACAGTTGAGAACTGATAATATTACTGGAAGCTAAGTGTTGATCCCAAGAAACTTGCCGAATTCTCTGTGGATCTTTGTCCAAATCGTCTT
CCCAACGACAGATGAACTCGAGAAACATTCAACGGAAGCGCGGGGAGGATTACATGTAGGTAAGCATAGGAAGGTATTTGATATAGTTAGGTGATTCTTGTTCTGTCTTG
GCAGCAGTTTGTATCATGGTGCTTTCTATCAAGTTTTAAAAAAAGAAGACACAATGAGTAGGGCCATGTGAATAGCGAATAATTGTTCCTTACACTCTTGTGTGTGTGCG
CGCGCGCTTTGACAAATACCATTAGTATCCATTGATGATGATGATGAT
Protein sequenceShow/hide protein sequence
MPDDSLPISTNTTPAAVVAPTTTVAPGSLTLTTAAATTTAAVIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKIQQDASQAILYPVASSGRGFV
PRPIRPLPADQAVTVANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTICESNGCKEMRVRDDALAVIRDRK
VRITDGASLYALCRSWLRNGSHEECQPQYGNFLRPLPRPLPIPVAGAVSSQKKEVVGDEVDEEDKDEGSIEQLSSQDLLKRHVKRAKKVRSRLREERLHRIERYKTRLAL
LLPPPVEQLRTDNITGS