; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh01G002080 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh01G002080
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionMucin-2 isoform X2
Genome locationCma_Chr01:943836..947313
RNA-Seq ExpressionCmaCh01G002080
SyntenyCmaCh01G002080
Gene Ontology termsGO:0017053 - transcriptional repressor complex (cellular component)
InterPro domainsIPR028226 - Protein LIN37


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606861.1 hypothetical protein SDJN03_00203, partial [Cucurbita argyrosperma subsp. sororia]1.7e-18296.83Show/hide
Query:  MPDDSLPISTNTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQAILY
        MPDDSLPISTNTTPAAV A TTTVAPGSLTLTTAAATTT A IARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPK+QQDASQAILY
Subjt:  MPDDSLPISTNTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQAILY

Query:  PVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTICES
        PVASSGRGFVPRPIRPLP DQAVTVANPGGYPHRP VTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLIS+SGSAISGSIKGAPNSSDPKVFPPSTICES
Subjt:  PVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTICES

Query:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQDLLK
        NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGS EECQPQYGNFLRPLPRPLPIPVAGAV SQKKEVV DEVDEEDKDEGSIEQLSSQDLLK
Subjt:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQDLLK

Query:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

XP_022934169.1 uncharacterized protein LOC111441417 [Cucurbita moschata]1.4e-16087.43Show/hide
Query:  MPDDSLPISTNTTPAAVAA---TTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQA
        MP DS  I TNTTPAAVAA   TTTTVAP S TLTTAAATT+ AAIARPLANQAPSRPISSIPQTHHLHYPPQALY AQ IP+RTPN QLPKLQQDASQA
Subjt:  MPDDSLPISTNTTPAAVAA---TTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQA

Query:  ILYPVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTI
        ILYPVASSGRGFVPRPIRPLP DQ VTVANPGGYPHRP V+FPHRPIGSPHLDSMSHPMH+ARPPNL QQLI  SGS+ISGSIKGAPNSSDPKVFPPSTI
Subjt:  ILYPVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTI

Query:  CESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQD
         E+NGCKEMRVRDDAL V+RDRKV ITDGASLYALCRSWLRNGSQEE QPQYG+FLR LPRPLPI V GAVPSQKKEVV + VDE+DKDE SIE LS+Q+
Subjt:  CESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQD

Query:  LLKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        LLKRHV+RAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  LLKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

XP_022949214.1 uncharacterized protein LOC111452632 [Cucurbita moschata]5.0e-18296.54Show/hide
Query:  MPDDSLPISTNTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQAILY
        MPDDSLPISTNTTPAA  A TTTVAPGSLTLTTAAATTT A IARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPK+QQDASQAILY
Subjt:  MPDDSLPISTNTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQAILY

Query:  PVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTICES
        PVASSGRGFVPRPIRPLP DQAVTVANPGGYPHRP VTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLIS+SGSAISGSIKGAPNSSDPKVFPPSTICES
Subjt:  PVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTICES

Query:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQDLLK
        NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGS EECQPQYGNFLRPLPRPLPIPVAGAV SQKKEVV DEVDEEDKDEGSIEQLSSQDLLK
Subjt:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQDLLK

Query:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

XP_022998467.1 uncharacterized protein LOC111493092 [Cucurbita maxima]1.6e-188100Show/hide
Query:  MPDDSLPISTNTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQAILY
        MPDDSLPISTNTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQAILY
Subjt:  MPDDSLPISTNTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQAILY

Query:  PVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTICES
        PVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTICES
Subjt:  PVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTICES

Query:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQDLLK
        NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQDLLK
Subjt:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQDLLK

Query:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

XP_023525420.1 uncharacterized protein LOC111789035 [Cucurbita pepo subsp. pepo]1.2e-18397.41Show/hide
Query:  MPDDSLPISTNTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQAILY
        MPDDSLPISTNTTPAAVAATTTTVAPGS+TLTTAAATTT AAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPK+QQDASQAILY
Subjt:  MPDDSLPISTNTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQAILY

Query:  PVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTICES
        PVASSGRGFVPRPIRPLPTDQAVTVANPGGYP RP VTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLIS+SGSAISGSIKGAPNSSDPKVFPPSTICES
Subjt:  PVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTICES

Query:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQDLLK
        NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGS EECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVV DEVDEEDKDEGSIE+LSSQDLLK
Subjt:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQDLLK

Query:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

TrEMBL top hitse value%identityAlignment
A0A5A7TZA2 Mucin-2 isoform X23.9e-15683.47Show/hide
Query:  MPDDSLPISTNTTPAAVAA-----TTTTVAPGSLTLTTAA-----ATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKL
        MP DS  I TNTT AA +A     TTTTV P S TLT AA     A    AAIARPLANQAPSRPISSIPQTHHLHYP QALY  Q IP+RTPN QLPKL
Subjt:  MPDDSLPISTNTTPAAVAA-----TTTTVAPGSLTLTTAA-----ATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKL

Query:  QQDASQAILYPVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPK
         QDASQAILYPVASSGRGFVPRPIRPLP DQAVT+ANPGGYPHRP VTFPHRPIGSPHLDSMSHPMH+ RPPNL QQLI  SGS+ISGSIKGAPNSSDPK
Subjt:  QQDASQAILYPVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPK

Query:  VFPPSTICESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSI
         FPPSTICESNGCKEMRVRDD L V+RDRKVRITDGASLYALCRSWLRNGSQEE QPQYG+FLR LPRPLPI VAGA PSQKKEVV +EVDEEDKDEGSI
Subjt:  VFPPSTICESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSI

Query:  EQLSSQDLLKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        E LS+Q+LLKRHV+RAKKVRSRLREERL RIERYKTRLALLLPPP+EQLRTDN+TGS
Subjt:  EQLSSQDLLKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

A0A6J1F1T6 uncharacterized protein LOC1114414176.8e-16187.43Show/hide
Query:  MPDDSLPISTNTTPAAVAA---TTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQA
        MP DS  I TNTTPAAVAA   TTTTVAP S TLTTAAATT+ AAIARPLANQAPSRPISSIPQTHHLHYPPQALY AQ IP+RTPN QLPKLQQDASQA
Subjt:  MPDDSLPISTNTTPAAVAA---TTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQA

Query:  ILYPVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTI
        ILYPVASSGRGFVPRPIRPLP DQ VTVANPGGYPHRP V+FPHRPIGSPHLDSMSHPMH+ARPPNL QQLI  SGS+ISGSIKGAPNSSDPKVFPPSTI
Subjt:  ILYPVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTI

Query:  CESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQD
         E+NGCKEMRVRDDAL V+RDRKV ITDGASLYALCRSWLRNGSQEE QPQYG+FLR LPRPLPI V GAVPSQKKEVV + VDE+DKDE SIE LS+Q+
Subjt:  CESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQD

Query:  LLKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        LLKRHV+RAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  LLKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

A0A6J1GC68 uncharacterized protein LOC1114526322.4e-18296.54Show/hide
Query:  MPDDSLPISTNTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQAILY
        MPDDSLPISTNTTPAA  A TTTVAPGSLTLTTAAATTT A IARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPK+QQDASQAILY
Subjt:  MPDDSLPISTNTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQAILY

Query:  PVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTICES
        PVASSGRGFVPRPIRPLP DQAVTVANPGGYPHRP VTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLIS+SGSAISGSIKGAPNSSDPKVFPPSTICES
Subjt:  PVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTICES

Query:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQDLLK
        NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGS EECQPQYGNFLRPLPRPLPIPVAGAV SQKKEVV DEVDEEDKDEGSIEQLSSQDLLK
Subjt:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQDLLK

Query:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

A0A6J1IZP9 uncharacterized protein LOC1114814118.3e-15986.29Show/hide
Query:  MPDDSLPISTNTTPAAVAA---TTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQA
        MP DS  I TNTTPAA AA   TTTTVAP S TL+ AAATT  AAIARPLANQAPSRPISSIPQTHHLHYPPQALY AQ IP+RTPN QLPKLQQDASQA
Subjt:  MPDDSLPISTNTTPAAVAA---TTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQA

Query:  ILYPVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTI
        ILYPVASSGRGFVPRPIRPLP DQ VTVANPGGYPHRP V+FPHRPIGSPHLDSMSHPMH+ARPPNL QQLI  SGS+ISGSIK APNSSDPKVFPPSTI
Subjt:  ILYPVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTI

Query:  CESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQD
         E+NGCKEMRVRDDAL V+RDRKV ITDGASLYALCRSWLRNGSQEE QPQYG+FLR LPRPLPI V GAVPSQKKEVV +EVDE+DKDE SIE LS+Q+
Subjt:  CESNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQD

Query:  LLKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        LLKRHV+RAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDN+TGS
Subjt:  LLKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

A0A6J1K833 uncharacterized protein LOC1114930927.7e-189100Show/hide
Query:  MPDDSLPISTNTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQAILY
        MPDDSLPISTNTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQAILY
Subjt:  MPDDSLPISTNTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQAILY

Query:  PVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTICES
        PVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTICES
Subjt:  PVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTICES

Query:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQDLLK
        NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQDLLK
Subjt:  NGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQDLLK

Query:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
        RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  RHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTDNITGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G04930.1 hydroxyproline-rich glycoprotein family protein1.3e-4237.83Show/hide
Query:  MPDDSLPISTNTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYA----AQRIPIRTPNHQLPKLQQDASQ
        +PD +   S   +P+   A+ T V P    + T     +    A P       RPI+  P  H   +  Q+ Y+    A  IP+R       +  QD S 
Subjt:  MPDDSLPISTNTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYA----AQRIPIRTPNHQLPKLQQDASQ

Query:  AILYPVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNL-PQQLISVSGSAISGSIKGAPNSSDPKVFPPS
        A+LYP A  GRGF  RP+R    D +VT  N  GYP RP+ T+   P     ++S+       R P + P   + +      G I+ +P    P+V PP 
Subjt:  AILYPVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNL-PQQLISVSGSAISGSIKGAPNSSDPKVFPPS

Query:  T-ICESNGCKEMRVRDDALAVIRDRKVRITDG-ASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVA--GAVPSQKKEVVVDEVDEEDKDEGSIE
        T I +++  ++ R +D ALAV+R RKVRIT+G +SLY+L RSWL+NG+    QPQ    ++PLP+PLP+ +    +VP    E   +  DE+ +DE +++
Subjt:  T-ICESNGCKEMRVRDDALAVIRDRKVRITDG-ASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVA--GAVPSQKKEVVVDEVDEEDKDEGSIE

Query:  QLSSQDLLKRHVKRAKKVRSRLREERLHRIERYKTRLALLL
        QLS +DLLKRH++RAKKVR++LREER  RI RYK R+ L+L
Subjt:  QLSSQDLLKRHVKRAKKVRSRLREERLHRIERYKTRLALLL

AT1G04930.2 hydroxyproline-rich glycoprotein family protein5.0e-3935.58Show/hide
Query:  MPDDSLPISTNTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYA----AQRIPIRTPNHQLPKLQQDASQ
        +PD +   S   +P+   A+ T V P    + T     +    A P       RPI+  P  H   +  Q+ Y+    A  IP+R       +  QD S 
Subjt:  MPDDSLPISTNTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYA----AQRIPIRTPNHQLPKLQQDASQ

Query:  AILYPVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNL----------PQQLISVSGS-------AISGS
        A+LYP A  GRGF  RP+R    D +VT  N  GYP RP+ T+   P     ++S+       R P +          P  L  +  S         SG 
Subjt:  AILYPVASSGRGFVPRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNL----------PQQLISVSGS-------AISGS

Query:  IKGAPNSSDPKVF---------------PPSTICESNGCKEMRVRDDALAVIRDRKVRITDG-ASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIP
        I G     DPK                 PP++I +++  ++ R +D ALAV+R RKVRIT+G +SLY+L RSWL+NG+    QPQ    ++PLP+PLP+ 
Subjt:  IKGAPNSSDPKVF---------------PPSTICESNGCKEMRVRDDALAVIRDRKVRITDG-ASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIP

Query:  VA--GAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQDLLKRHVKRAKKVRSRLREERLHRIERYKTRLALLL
        +    +VP    E   +  DE+ +DE +++QLS +DLLKRH++RAKKVR++LREER  RI RYK R+ L+L
Subjt:  VA--GAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQDLLKRHVKRAKKVRSRLREERLHRIERYKTRLALLL

AT2G32840.1 proline-rich family protein3.1e-4943.31Show/hide
Query:  NTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTH--HLHYPPQALYAAQRIPIRTPN------HQLPKLQQDASQAILYPV
        N  P+   A +T +   S +  T   T     +  P +   P  P SS       H H+P Q +Y    +PIR  N      HQ P    D S +++YP 
Subjt:  NTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTH--HLHYPPQALYAAQRIPIRTPN------HQLPKLQQDASQAILYPV

Query:  ASSGRGFVPRPIRPLPTDQAVTVA--NPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFP-PSTICE
         SSGRGF  RP+R      A  V   +PGGY  R  V   H      +LD M+  M  A P N  QQ    S    SG +KG P+   P+  P P++I +
Subjt:  ASSGRGFVPRPIRPLPTDQAVTVA--NPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFP-PSTICE

Query:  SNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDK-DEGSIEQLSSQDL
        ++G K+ R RDDAL ++R RKVRIT+GASLY+LCRSWLRNG+ E  +PQ  + +  LP+PLP+       S  K++V + + EEDK DE S++ LS  DL
Subjt:  SNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDK-DEGSIEQLSSQDL

Query:  LKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTD
        LKRH+ RAKKVR+RLREERL RI RYK RLALLLPP  EQ R +
Subjt:  LKRHVKRAKKVRSRLREERLHRIERYKTRLALLLPPPVEQLRTD

AT2G32840.2 proline-rich family protein1.9e-3037.98Show/hide
Query:  NTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTH--HLHYPPQALYAAQRIPIRTPN------HQLPKLQQDASQAILYPV
        N  P+   A +T +   S +  T   T     +  P +   P  P SS       H H+P Q +Y    +PIR  N      HQ P    D S +++YP 
Subjt:  NTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTH--HLHYPPQALYAAQRIPIRTPN------HQLPKLQQDASQAILYPV

Query:  ASSGRGFVPRPIRPLPTDQAVTVA--NPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFP-PSTICE
         SSGRGF  RP+R      A  V   +PGGY  R  V   H      +LD M+  M  A P N  QQ    S    SG +KG P+   P+  P P++I +
Subjt:  ASSGRGFVPRPIRPLPTDQAVTVA--NPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFP-PSTICE

Query:  SNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKD
        ++G K+ R RDDAL ++R RKVRIT+GASLY+LCRSWLRNG+ E  +PQ  + +  LP+PLP+       S  K++V + + EEDK+
Subjt:  SNGCKEMRVRDDALAVIRDRKVRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCGACGACTCCCTCCCCATCTCCACCAACACGACGCCGGCCGCCGTCGCTGCCACCACCACCACTGTCGCTCCGGGTTCATTAACTCTAACTACTGCTGCAGCCAC
TACCACTGTCGCTGCTATTGCTCGTCCGCTTGCGAATCAAGCTCCGTCGAGACCCATTTCTTCAATTCCTCAAACCCACCATCTCCACTACCCTCCTCAAGCCCTCTACG
CGGCGCAGCGCATCCCTATTCGAACTCCTAACCACCAATTGCCGAAGCTTCAGCAAGATGCATCTCAGGCAATTCTTTACCCTGTTGCTTCCTCCGGCCGCGGCTTCGTT
CCTCGCCCCATTCGGCCTCTTCCCACCGATCAGGCGGTCACGGTGGCCAACCCTGGCGGCTACCCACATCGCCCCGCCGTCACTTTCCCGCATCGGCCGATTGGGTCGCC
TCATTTGGACTCCATGAGCCATCCAATGCACTTGGCGCGACCTCCCAATTTGCCGCAGCAGTTGATTTCCGTTTCTGGGTCCGCCATTTCGGGCTCGATTAAGGGTGCCC
CCAATTCCTCTGATCCAAAGGTTTTTCCCCCATCAACAATCTGCGAGTCAAATGGATGTAAAGAAATGAGAGTCAGAGATGATGCTCTTGCTGTGATTAGAGATCGAAAA
GTCCGAATAACTGATGGGGCTTCTCTTTATGCACTTTGTCGATCATGGTTGAGGAATGGTTCTCAAGAAGAATGCCAGCCACAATATGGGAACTTTTTGAGGCCTCTTCC
AAGACCTTTGCCCATACCCGTGGCTGGTGCTGTTCCATCGCAGAAGAAGGAAGTCGTCGTAGACGAAGTTGACGAGGAAGATAAGGATGAGGGATCCATTGAGCAGTTGT
CATCGCAAGATCTATTAAAACGGCATGTTAAACGAGCAAAGAAAGTCCGATCACGATTGAGGGAAGAACGGTTGCATCGAATTGAAAGATACAAAACCCGGCTCGCTCTT
CTCCTTCCTCCGCCAGTTGAGCAGTTGAGAACTGATAATATTACTGGAAGCTGA
mRNA sequenceShow/hide mRNA sequence
CTTTCTTTTGGCTAATTTGATATTCAAACAATCAAACCACATACACCCAAATTCCCCGCCACCATGCCCGACGACTCCCTCCCCATCTCCACCAACACGACGCCGGCCGC
CGTCGCTGCCACCACCACCACTGTCGCTCCGGGTTCATTAACTCTAACTACTGCTGCAGCCACTACCACTGTCGCTGCTATTGCTCGTCCGCTTGCGAATCAAGCTCCGT
CGAGACCCATTTCTTCAATTCCTCAAACCCACCATCTCCACTACCCTCCTCAAGCCCTCTACGCGGCGCAGCGCATCCCTATTCGAACTCCTAACCACCAATTGCCGAAG
CTTCAGCAAGATGCATCTCAGGCAATTCTTTACCCTGTTGCTTCCTCCGGCCGCGGCTTCGTTCCTCGCCCCATTCGGCCTCTTCCCACCGATCAGGCGGTCACGGTGGC
CAACCCTGGCGGCTACCCACATCGCCCCGCCGTCACTTTCCCGCATCGGCCGATTGGGTCGCCTCATTTGGACTCCATGAGCCATCCAATGCACTTGGCGCGACCTCCCA
ATTTGCCGCAGCAGTTGATTTCCGTTTCTGGGTCCGCCATTTCGGGCTCGATTAAGGGTGCCCCCAATTCCTCTGATCCAAAGGTTTTTCCCCCATCAACAATCTGCGAG
TCAAATGGATGTAAAGAAATGAGAGTCAGAGATGATGCTCTTGCTGTGATTAGAGATCGAAAAGTCCGAATAACTGATGGGGCTTCTCTTTATGCACTTTGTCGATCATG
GTTGAGGAATGGTTCTCAAGAAGAATGCCAGCCACAATATGGGAACTTTTTGAGGCCTCTTCCAAGACCTTTGCCCATACCCGTGGCTGGTGCTGTTCCATCGCAGAAGA
AGGAAGTCGTCGTAGACGAAGTTGACGAGGAAGATAAGGATGAGGGATCCATTGAGCAGTTGTCATCGCAAGATCTATTAAAACGGCATGTTAAACGAGCAAAGAAAGTC
CGATCACGATTGAGGGAAGAACGGTTGCATCGAATTGAAAGATACAAAACCCGGCTCGCTCTTCTCCTTCCTCCGCCAGTTGAGCAGTTGAGAACTGATAATATTACTGG
AAGCTGAGTGTTGATCCCAAAAAACTTGTCGAATTCGCTGTGGACCTTTGTCCAAATCGTTCTTCCCAACGACAGATGAACTCGAGAAACATTCGACGGAAGCGCGGGGA
GGATTATATGTAGGTAAGAGTAGGAGGGTATTTGATAGAGTTAGGTAATTCTTGTTCTTTCTTGGCAGCAGTTTGTATCATGGTGCTTTGTATCAAGTTTTAAAAAAAAG
AAGACACAATGAGTAGGACCATGTGAATAGTGAATAATTGTTCTTTACACCCTTGTGTGTGCGCGCGCGCGCTTTGTCAAATACCATTAGTATCCATTGATGATATGATG
ATACCTACACATTGCCAAATTGCATGCTTCCAAATCCCTCGTAATCATGAAACTTTGAAATCTTTCTTGGAAA
Protein sequenceShow/hide protein sequence
MPDDSLPISTNTTPAAVAATTTTVAPGSLTLTTAAATTTVAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQRIPIRTPNHQLPKLQQDASQAILYPVASSGRGFV
PRPIRPLPTDQAVTVANPGGYPHRPAVTFPHRPIGSPHLDSMSHPMHLARPPNLPQQLISVSGSAISGSIKGAPNSSDPKVFPPSTICESNGCKEMRVRDDALAVIRDRK
VRITDGASLYALCRSWLRNGSQEECQPQYGNFLRPLPRPLPIPVAGAVPSQKKEVVVDEVDEEDKDEGSIEQLSSQDLLKRHVKRAKKVRSRLREERLHRIERYKTRLAL
LLPPPVEQLRTDNITGS