; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy4G080280 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy4G080280
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionMucin-2 isoform X2
Genome locationchrH04:19573593..19578605
RNA-Seq ExpressionChy4G080280
SyntenyChy4G080280
Gene Ontology termsGO:0017053 - transcriptional repressor complex (cellular component)
InterPro domainsIPR028226 - Protein LIN37


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149622.3 uncharacterized protein LOC101211370 [Cucumis sativus]3.46e-21997.01Show/hide
Query:  TTTTVTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLPKLHQDASQGILYPVASSGRGFVPR
        TTTTVTPISFTLTTAAAATTTTTAAA TAAIARPLANQAPS+PISSIPQTHHLHYPSQ LYQPQSIPVRTPN QLPKLHQDASQ ILYPVASSGRGFVPR
Subjt:  TTTTVTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLPKLHQDASQGILYPVASSGRGFVPR

Query:  TIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSDPKAFPPSTICESNGCKEMRVRDDT
        TIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSS SGSIKCAPNSSDPKAFPP TICESNGCKEMRVRDDT
Subjt:  TIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSDPKAFPPSTICESNGCKEMRVRDDT

Query:  LCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEGSIEHLSTQELLKRHVRRAKKVRSR
        LCVVRDRKVRITDGASLYALCRSWLRNGS EESQPQYGSFFRSLPRPLPI VAGAAP QKKEVVKEEVDEKDKDEGSIEHLSTQELLKRHVRRAKKVRSR
Subjt:  LCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEGSIEHLSTQELLKRHVRRAKKVRSR

Query:  LREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS
        LREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS
Subjt:  LREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS

XP_008461764.1 PREDICTED: uncharacterized protein LOC103500291 isoform X1 [Cucumis melo]6.38e-22694.99Show/hide
Query:  MPVDSPLIPTNTTSAATSALTTTT----VTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLP
        MPVDSPLIPTNTTSAATSALTTTT    VTPISFTLT  AAAT  TTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQ LYQPQSIPVRTPNTQLP
Subjt:  MPVDSPLIPTNTTSAATSALTTTT----VTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLP

Query:  KLHQDASQGILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSD
        KLHQDASQ ILYPVASSGRGFVPR IRPLP DQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSS SGSIK APNSSD
Subjt:  KLHQDASQGILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSD

Query:  PKAFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEG
        PKAFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGS EESQPQYGSF RSLPRPLPI VAGAAPSQKKEVVKEEVDE+DKDEG
Subjt:  PKAFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEG

Query:  SIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS
        SIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS
Subjt:  SIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS

XP_022982576.1 uncharacterized protein LOC111481411 [Cucurbita maxima]4.93e-20386.83Show/hide
Query:  MPVDSPLIPTNTT--SAATSALTTTTVTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLPKL
        MPVDSPLIPTNTT  +AA +A TTTTV PISFTL+ AAA T       A AAIARPLANQAPSRPISSIPQTHHLHYP Q LY  Q IPVRTPNTQLPKL
Subjt:  MPVDSPLIPTNTT--SAATSALTTTTVTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLPKL

Query:  HQDASQGILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSDPK
         QDASQ ILYPVASSGRGFVPR IRPLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGSS SGSIK APNSSDPK
Subjt:  HQDASQGILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSDPK

Query:  AFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEGSI
         FPPSTI E+NGCKEMRVRDD LCVVRDRKV ITDGASLYALCRSWLRNGS EESQPQYGSF RSLPRPLPI V GA PSQKKEVV+EEVDEKDKDE SI
Subjt:  AFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEGSI

Query:  EHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS
        E LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVTGS
Subjt:  EHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS

XP_023526543.1 uncharacterized protein LOC111790016 [Cucurbita pepo subsp. pepo]1.22e-20287.32Show/hide
Query:  MPVDSPLIPTNTTSAATSALTTTTVTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLPKLHQ
        MPVDSPLIPTNTT AAT+  TTTTV PISFTLTTAAA T       A AAIARPLANQAPSRPISSIPQTHHLHYP Q LY  Q IPVRTPNTQLPKL Q
Subjt:  MPVDSPLIPTNTTSAATSALTTTTVTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLPKLHQ

Query:  DASQGILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSDPKAF
        DASQ ILYPVASSGRGFVPR IRPLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGSS SGSIK APNSSDPK F
Subjt:  DASQGILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSDPKAF

Query:  PPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEGSIEH
        PPSTI E+NGCKEMRVRDD LCVVRDRKV ITDGASLYALCRSWLRNGS EESQPQYGSF RSLPRPLPI V GA PSQKKEVV+E VDEKDKDE SIE 
Subjt:  PPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEGSIEH

Query:  LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS
        L TQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVTGS
Subjt:  LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS

XP_038903387.1 uncharacterized protein LOC120089997 [Benincasa hispida]9.72e-20889.36Show/hide
Query:  MPVDSPLIPTNTTSAATSALTTTTVTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLPKLHQ
        MPVDSPLIPTNTT+AA S   TTTVTPISFTLTTAAA   TTTAAAA AAIARPLANQAPSRPISSIPQTHHLHYP Q LY  QSIPVRTPN QLPKLHQ
Subjt:  MPVDSPLIPTNTTSAATSALTTTTVTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLPKLHQ

Query:  DASQGILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQ--LIPFSGSSTSGSIKCAPNSSDPK
        DASQ ILYPVASSGRGFVPR IRPLPADQ VTLANPGGY +RPVVTFPHRPIGS HLDSMSHPMHM RPPNLQQQ  LIPFSGSS SGSIK  PNSSDPK
Subjt:  DASQGILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQ--LIPFSGSSTSGSIKCAPNSSDPK

Query:  AFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEGSI
         F PSTICESNGCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGS EESQPQYG+F RSLPRPLPI  AGA PSQKKEVV+EEVDE+DKDEGSI
Subjt:  AFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEGSI

Query:  EHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS
        EHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVTGS
Subjt:  EHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS

TrEMBL top hitse value%identityAlignment
A0A0A0LCG5 Uncharacterized protein2.1e-18196.36Show/hide
Query:  MPVDSPLIPTNTTSAATSA--LTTTTVTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLPKL
        MPVDSPLIPTNTTSAATSA   TTTTVTPISFTLTTAAAATTTTT AAATAAIARPLANQAPS+PISSIPQTHHLHYPSQ LYQPQSIPVRTPN QLPKL
Subjt:  MPVDSPLIPTNTTSAATSA--LTTTTVTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLPKL

Query:  HQDASQGILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSDPK
        HQDASQ ILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSS SGSIKCAPNSSDPK
Subjt:  HQDASQGILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSDPK

Query:  AFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEGSI
        AFPP TICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGS EESQPQYGSFFRSLPRPLPI VAGAAP QKKEVVKEEVDEKDKDEGSI
Subjt:  AFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEGSI

Query:  EHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS
        EHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS
Subjt:  EHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS

A0A1S3CFG6 uncharacterized protein LOC103500291 isoform X12.2e-17894.99Show/hide
Query:  MPVDSPLIPTNTTSAATSAL----TTTTVTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLP
        MPVDSPLIPTNTTSAATSAL    TTTTVTPISFTLT  AAAT  TTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQ LYQPQSIPVRTPNTQLP
Subjt:  MPVDSPLIPTNTTSAATSAL----TTTTVTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLP

Query:  KLHQDASQGILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSD
        KLHQDASQ ILYPVASSGRGFVPR IRPLP DQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSS SGSIK APNSSD
Subjt:  KLHQDASQGILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSD

Query:  PKAFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEG
        PKAFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGS EESQPQYGSF RSLPRPLPI VAGAAPSQKKEVVKEEVDE+DKDEG
Subjt:  PKAFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEG

Query:  SIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS
        SIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS
Subjt:  SIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS

A0A5A7TZA2 Mucin-2 isoform X22.2e-17894.99Show/hide
Query:  MPVDSPLIPTNTTSAATSAL----TTTTVTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLP
        MPVDSPLIPTNTTSAATSAL    TTTTVTPISFTLT  AAAT  TTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQ LYQPQSIPVRTPNTQLP
Subjt:  MPVDSPLIPTNTTSAATSAL----TTTTVTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLP

Query:  KLHQDASQGILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSD
        KLHQDASQ ILYPVASSGRGFVPR IRPLP DQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSS SGSIK APNSSD
Subjt:  KLHQDASQGILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSD

Query:  PKAFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEG
        PKAFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGS EESQPQYGSF RSLPRPLPI VAGAAPSQKKEVVKEEVDE+DKDEG
Subjt:  PKAFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEG

Query:  SIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS
        SIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS
Subjt:  SIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS

A0A6J1F1T6 uncharacterized protein LOC1114414172.7e-16086.55Show/hide
Query:  MPVDSPLIPTNTTSA--ATSALTTTTVTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLPKL
        MPVDSPLIPTNTT A  A +A TTTTV PISFTLTTAAA T       + AAIARPLANQAPSRPISSIPQTHHLHYP Q LY  Q IPVRTPNTQLPKL
Subjt:  MPVDSPLIPTNTTSA--ATSALTTTTVTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLPKL

Query:  HQDASQGILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSDPK
         QDASQ ILYPVASSGRGFVPR IRPLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGSS SGSIK APNSSDPK
Subjt:  HQDASQGILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSDPK

Query:  AFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEGSI
         FPPSTI E+NGCKEMRVRDD LCVVRDRKV ITDGASLYALCRSWLRNGS EESQPQYGSF RSLPRPLPI V GA PSQKKEVV+E VDEKDKDE SI
Subjt:  AFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEGSI

Query:  EHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS
        E LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDN+TGS
Subjt:  EHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS

A0A6J1IZP9 uncharacterized protein LOC1114814117.0e-16186.83Show/hide
Query:  MPVDSPLIPTNTT--SAATSALTTTTVTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLPKL
        MPVDSPLIPTNTT  +AA +A TTTTV PISFTL+ AAA T       A AAIARPLANQAPSRPISSIPQTHHLHYP Q LY  Q IPVRTPNTQLPKL
Subjt:  MPVDSPLIPTNTT--SAATSALTTTTVTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLPKL

Query:  HQDASQGILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSDPK
         QDASQ ILYPVASSGRGFVPR IRPLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGSS SGSIK APNSSDPK
Subjt:  HQDASQGILYPVASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSDPK

Query:  AFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEGSI
         FPPSTI E+NGCKEMRVRDD LCVVRDRKV ITDGASLYALCRSWLRNGS EESQPQYGSF RSLPRPLPI V GA PSQKKEVV+EEVDEKDKDE SI
Subjt:  AFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEGSI

Query:  EHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS
        E LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVTGS
Subjt:  EHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPIEQLRTDNVTGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G04930.1 hydroxyproline-rich glycoprotein family protein2.8e-3736.97Show/hide
Query:  SFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQP---------------QSIPVRTPNTQLPKLHQDASQGILYPVASS
        S +LT + + +T +          RP  +Q P  P    P T+    P  PL  P                SIPVR       +  QD S  +LYP A  
Subjt:  SFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQP---------------QSIPVRTPNTQLPKLHQDASQGILYPVASS

Query:  GRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTS-GSIKCAPNSSDPK-AFPPSTICESNGC
        GRGF  R +R   AD +VT  N  GYP RP  T+   P     ++S+       R P ++       GS    G I+ +P    P+ A PP++I +++  
Subjt:  GRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTS-GSIKCAPNSSDPK-AFPPSTICESNGC

Query:  KEMRVRDDTLCVVRDRKVRITDG-ASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEGSIEHLSTQELLKRH
        ++ R +D  L VVR RKVRIT+G +SLY+L RSWL+NG+H   QPQ     + LP+PLP+ +     S   +  +E  DE  +DE +++ LS ++LLKRH
Subjt:  KEMRVRDDTLCVVRDRKVRITDG-ASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEGSIEHLSTQELLKRH

Query:  VRRAKKVRSRLREERLQRIERYKTRLALLL
        + RAKKVR++LREER +RI RYK R+ L+L
Subjt:  VRRAKKVRSRLREERLQRIERYKTRLALLL

AT1G04930.2 hydroxyproline-rich glycoprotein family protein3.8e-3434.72Show/hide
Query:  SFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQP---------------QSIPVRTPNTQLPKLHQDASQGILYPVASS
        S +LT + + +T +          RP  +Q P  P    P T+    P  PL  P                SIPVR       +  QD S  +LYP A  
Subjt:  SFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQP---------------QSIPVRTPNTQLPKLHQDASQGILYPVASS

Query:  GRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQ----------QLIPFSGSS-------TSGSIKCAPNSSD
        GRGF  R +R   AD +VT  N  GYP RP  T+   P     ++S+       R P ++            L P   S        +SG I       D
Subjt:  GRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQ----------QLIPFSGSS-------TSGSIKCAPNSSD

Query:  PK---------------AFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDG-ASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQK
        PK               A PP++I +++  ++ R +D  L VVR RKVRIT+G +SLY+L RSWL+NG+H   QPQ     + LP+PLP+ +     S  
Subjt:  PK---------------AFPPSTICESNGCKEMRVRDDTLCVVRDRKVRITDG-ASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQK

Query:  KEVVKEEVDEKDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLL
         +  +E  DE  +DE +++ LS ++LLKRH+ RAKKVR++LREER +RI RYK R+ L+L
Subjt:  KEVVKEEVDEKDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLL

AT2G32840.1 proline-rich family protein3.5e-4843.2Show/hide
Query:  TAAAATTTTTAAAATA----AIARPLANQAPSRPISSIPQTH--HLHYPSQPLYQPQSIPVRTPNTQLPKLHQDA---SQGILYPVASSGRGFVPRTIRP
        + A +T   TA+ +       +  P +   P  P SS       H H+P Q +Y    +P+R  N+     HQ        ++YP  SSGRGF  R +R 
Subjt:  TAAAATTTTTAAAATA----AIARPLANQAPSRPISSIPQTH--HLHYPSQPLYQPQSIPVRTPNTQLPKLHQDA---SQGILYPVASSGRGFVPRTIRP

Query:  LPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSDPKAFP-PSTICESNGCKEMRVRDDT
             A  V   +PGGY P  PV  + H    S +LD M+  M    P N Q      S    SG +K  P+   P+A P P++I +++G K+ R RDD 
Subjt:  LPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSDPKAFP-PSTICESNGCKEMRVRDDT

Query:  LCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDK-DEGSIEHLSTQELLKRHVRRAKKVRS
        L +VR RKVRIT+GASLY+LCRSWLRNG+HE  +PQ       LP+PL  PV     S  K++V+E + E+DK DE S++HLS  +LLKRH+ RAKKVR+
Subjt:  LCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDK-DEGSIEHLSTQELLKRHVRRAKKVRS

Query:  RLREERLQRIERYKTRLALLLPPPIEQLRTD
        RLREERL+RI RYK RLALLLPP  EQ R +
Subjt:  RLREERLQRIERYKTRLALLLPPPIEQLRTD

AT2G32840.2 proline-rich family protein3.7e-2937.59Show/hide
Query:  TAAAATTTTTAAAATA----AIARPLANQAPSRPISSIPQTH--HLHYPSQPLYQPQSIPVRTPNTQLPKLHQDA---SQGILYPVASSGRGFVPRTIRP
        + A +T   TA+ +       +  P +   P  P SS       H H+P Q +Y    +P+R  N+     HQ        ++YP  SSGRGF  R +R 
Subjt:  TAAAATTTTTAAAATA----AIARPLANQAPSRPISSIPQTH--HLHYPSQPLYQPQSIPVRTPNTQLPKLHQDA---SQGILYPVASSGRGFVPRTIRP

Query:  LPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSDPKAFP-PSTICESNGCKEMRVRDDT
             A  V   +PGGY P  PV  + H    S +LD M+  M    P N Q      S    SG +K  P+   P+A P P++I +++G K+ R RDD 
Subjt:  LPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSDPKAFP-PSTICESNGCKEMRVRDDT

Query:  LCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKD
        L +VR RKVRIT+GASLY+LCRSWLRNG+HE  +PQ       LP+PL  PV     S  K++V+E + E+DK+
Subjt:  LCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCGTCGACTCCCCTCTCATCCCCACCAACACCACCTCCGCCGCCACTTCCGCCCTTACCACCACTACCGTCACTCCGATTTCCTTCACTCTAACTACAGCTGCTGC
AGCCACCACCACTACTACCGCCGCCGCTGCCACTGCTGCCATTGCTCGTCCGCTTGCGAATCAAGCACCATCCAGACCCATTTCTTCAATTCCTCAAACCCACCATCTCC
ACTACCCTTCTCAACCCCTCTACCAGCCTCAGTCCATCCCCGTTCGAACTCCCAACACCCAATTGCCCAAGCTTCATCAGGATGCGTCTCAGGGAATTCTTTACCCTGTC
GCCTCCTCTGGCCGCGGCTTCGTTCCTCGCACCATTCGTCCCCTTCCCGCCGATCAGGCCGTCACCCTGGCTAACCCTGGAGGCTACCCACATCGCCCCGTTGTCACTTT
CCCCCATCGCCCGATTGGGTCGCCTCATTTGGACTCCATGAGCCATCCAATGCATATGACTCGACCTCCCAATTTGCAGCAGCAGCTTATTCCCTTTTCTGGGTCTTCCA
CTTCGGGTTCGATTAAATGTGCCCCCAATTCCTCTGATCCCAAGGCTTTTCCTCCATCAACAATCTGCGAGTCAAATGGGTGTAAAGAAATGAGAGTTAGAGACGACACT
CTTTGTGTGGTTAGAGATCGAAAAGTTCGGATAACTGATGGGGCTTCTCTTTATGCGCTTTGTCGATCATGGCTGAGGAATGGTTCTCATGAAGAAAGCCAGCCACAATA
TGGAAGCTTTTTTAGGTCACTTCCGAGACCACTGCCCATCCCTGTGGCTGGTGCTGCTCCATCACAGAAAAAGGAAGTTGTCAAAGAAGAAGTTGATGAGAAAGATAAGG
ATGAGGGATCCATTGAGCACTTGTCAACGCAAGAGTTATTGAAAAGACATGTTAGACGTGCAAAGAAAGTCCGATCACGATTGAGAGAAGAACGGTTACAACGCATTGAA
AGATACAAAACCAGACTCGCTCTTCTCCTTCCTCCTCCAATTGAGCAGTTGAGGACGGATAACGTTACTGGAAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCGTCGACTCCCCTCTCATCCCCACCAACACCACCTCCGCCGCCACTTCCGCCCTTACCACCACTACCGTCACTCCGATTTCCTTCACTCTAACTACAGCTGCTGC
AGCCACCACCACTACTACCGCCGCCGCTGCCACTGCTGCCATTGCTCGTCCGCTTGCGAATCAAGCACCATCCAGACCCATTTCTTCAATTCCTCAAACCCACCATCTCC
ACTACCCTTCTCAACCCCTCTACCAGCCTCAGTCCATCCCCGTTCGAACTCCCAACACCCAATTGCCCAAGCTTCATCAGGATGCGTCTCAGGGAATTCTTTACCCTGTC
GCCTCCTCTGGCCGCGGCTTCGTTCCTCGCACCATTCGTCCCCTTCCCGCCGATCAGGCCGTCACCCTGGCTAACCCTGGAGGCTACCCACATCGCCCCGTTGTCACTTT
CCCCCATCGCCCGATTGGGTCGCCTCATTTGGACTCCATGAGCCATCCAATGCATATGACTCGACCTCCCAATTTGCAGCAGCAGCTTATTCCCTTTTCTGGGTCTTCCA
CTTCGGGTTCGATTAAATGTGCCCCCAATTCCTCTGATCCCAAGGCTTTTCCTCCATCAACAATCTGCGAGTCAAATGGGTGTAAAGAAATGAGAGTTAGAGACGACACT
CTTTGTGTGGTTAGAGATCGAAAAGTTCGGATAACTGATGGGGCTTCTCTTTATGCGCTTTGTCGATCATGGCTGAGGAATGGTTCTCATGAAGAAAGCCAGCCACAATA
TGGAAGCTTTTTTAGGTCACTTCCGAGACCACTGCCCATCCCTGTGGCTGGTGCTGCTCCATCACAGAAAAAGGAAGTTGTCAAAGAAGAAGTTGATGAGAAAGATAAGG
ATGAGGGATCCATTGAGCACTTGTCAACGCAAGAGTTATTGAAAAGACATGTTAGACGTGCAAAGAAAGTCCGATCACGATTGAGAGAAGAACGGTTACAACGCATTGAA
AGATACAAAACCAGACTCGCTCTTCTCCTTCCTCCTCCAATTGAGCAGTTGAGGACGGATAACGTTACTGGAAGCTGA
Protein sequenceShow/hide protein sequence
MPVDSPLIPTNTTSAATSALTTTTVTPISFTLTTAAAATTTTTAAAATAAIARPLANQAPSRPISSIPQTHHLHYPSQPLYQPQSIPVRTPNTQLPKLHQDASQGILYPV
ASSGRGFVPRTIRPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMTRPPNLQQQLIPFSGSSTSGSIKCAPNSSDPKAFPPSTICESNGCKEMRVRDDT
LCVVRDRKVRITDGASLYALCRSWLRNGSHEESQPQYGSFFRSLPRPLPIPVAGAAPSQKKEVVKEEVDEKDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIE
RYKTRLALLLPPPIEQLRTDNVTGS