; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019452 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019452
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionMucin-2 isoform X2
Genome locationChr04:21969904..21975064
RNA-Seq ExpressionHG10019452
SyntenyHG10019452
Gene Ontology termsGO:0017053 - transcriptional repressor complex (cellular component)
InterPro domainsIPR028226 - Protein LIN37


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461764.1 PREDICTED: uncharacterized protein LOC103500291 isoform X1 [Cucumis melo]7.2e-16589.08Show/hide
Query:  MPVDSPLIPPNTTAAA-------PTTTTVTPISFTLT-TATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKL
        MPVDSPLIP NTT+AA        TTTTVTPISFTLT  ATA TTAAAA AAIARPLANQAPSRPISSIPQT HLHYP QALY  QSIPVRTPNTQL KL
Subjt:  MPVDSPLIPPNTTAAA-------PTTTTVTPISFTLT-TATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKL

Query:  H----QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPK
        H    Q ILYPVASSGRGFVPRPI+PLP DQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGS ISGSIKGAPNSSDPK
Subjt:  H----QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPK

Query:  VFPPSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSI
         FPPS++CES GCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAVAGA PSQKKEVV+EEVDEEDKDEGSI
Subjt:  VFPPSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSI

Query:  EHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        EHLS QELLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVTGS
Subjt:  EHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

XP_022934169.1 uncharacterized protein LOC111441417 [Cucurbita moschata]5.9e-15987.29Show/hide
Query:  MPVDSPLIPPNTT-----AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH--
        MPVDSPLIP NTT     AAA TTTTV PISFTLTTA ATT    +AAAIARPLANQAPSRPISSIPQT HLHYPPQALY AQ IPVRTPNTQL KL   
Subjt:  MPVDSPLIPPNTT-----AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH--

Query:  --QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPKVFP
          Q ILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGS ISGSIKGAPNSSDPKVFP
Subjt:  --QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPKVFP

Query:  PSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHL
        PS++ E+ GCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAV GAVPSQKKEVVEE VDE+DKDE SIE L
Subjt:  PSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHL

Query:  SAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        S QELLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN+TGS
Subjt:  SAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

XP_022982576.1 uncharacterized protein LOC111481411 [Cucurbita maxima]1.5e-15787.01Show/hide
Query:  MPVDSPLIPPNTT-----AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH--
        MPVDSPLIP NTT     A A TTTTV PISFTL+ A ATT    AAAAIARPLANQAPSRPISSIPQT HLHYPPQALY AQ IPVRTPNTQL KL   
Subjt:  MPVDSPLIPPNTT-----AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH--

Query:  --QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPKVFP
          Q ILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGS ISGSIK APNSSDPKVFP
Subjt:  --QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPKVFP

Query:  PSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHL
        PS++ E+ GCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAV GAVPSQKKEVVEEEVDE+DKDE SIE L
Subjt:  PSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHL

Query:  SAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        S QELLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
Subjt:  SAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

XP_023526543.1 uncharacterized protein LOC111790016 [Cucurbita pepo subsp. pepo]7.7e-15988.29Show/hide
Query:  MPVDSPLIPPNTT-AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH----QT
        MPVDSPLIP NTT AA  TTTTV PISFTLTTA ATT    AAAAIARPLANQAPSRPISSIPQT HLHYPPQALY AQ IPVRTPNTQL KL     Q 
Subjt:  MPVDSPLIPPNTT-AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH----QT

Query:  ILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPKVFPPSSM
        ILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGS ISGSIKGAPNSSDPKVFPPS++
Subjt:  ILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPKVFPPSSM

Query:  CESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSAQE
         E+ GCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAV GAVPSQKKEVVEE VDE+DKDE SIE L  QE
Subjt:  CESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSAQE

Query:  LLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        LLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
Subjt:  LLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

XP_038903387.1 uncharacterized protein LOC120089997 [Benincasa hispida]3.1e-16892.02Show/hide
Query:  MPVDSPLIPPNTTAAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH----QTI
        MPVDSPLIP NTTAAAP+TTTVTPISFTLTTA ATTTAAAA AAIARPLANQAPSRPISSIPQT HLHYPPQALYAAQSIPVRTPN QL KLH    Q I
Subjt:  MPVDSPLIPPNTTAAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH----QTI

Query:  LYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNL--QQQLIPFSGSPISGSIKGAPNSSDPKVFPPSS
        LYPVASSGRGFVPRPI+PLPADQ VTLANPGGY +RPVVTFPHRPIGS HLDSMSHPMHMARPPNL  QQQLIPFSGS ISGSIKG PNSSDPKVF PS+
Subjt:  LYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNL--QQQLIPFSGSPISGSIKGAPNSSDPKVFPPSS

Query:  MCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSAQ
        +CES GCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIA AGAVPSQKKEVVEEEVDEEDKDEGSIEHLS Q
Subjt:  MCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSAQ

Query:  ELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        ELLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
Subjt:  ELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

TrEMBL top hitse value%identityAlignment
A0A0A0LCG5 Uncharacterized protein5.2e-16187.36Show/hide
Query:  MPVDSPLIPPNTTAAA-----PTTTTVTPISFTLTTATA--TTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH
        MPVDSPLIP NTT+AA      TTTTVTPISFTLTTA A  TTT AAA AAIARPLANQAPS+PISSIPQT HLHYP QALY  QSIPVRTPN QL KLH
Subjt:  MPVDSPLIPPNTTAAA-----PTTTTVTPISFTLTTATA--TTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH

Query:  ----QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPKV
            Q ILYPVASSGRGFVPR I+PLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGS ISGSIK APNSSDPK 
Subjt:  ----QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPKV

Query:  FPPSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIE
        FPP ++CES GCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYG+F RSLPRPLPIAVAGA P QKKEVV+EEVDE+DKDEGSIE
Subjt:  FPPSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIE

Query:  HLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        HLS QELLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVTGS
Subjt:  HLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

A0A1S3CFG6 uncharacterized protein LOC103500291 isoform X13.5e-16589.08Show/hide
Query:  MPVDSPLIPPNTTAAA-------PTTTTVTPISFTLT-TATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKL
        MPVDSPLIP NTT+AA        TTTTVTPISFTLT  ATA TTAAAA AAIARPLANQAPSRPISSIPQT HLHYP QALY  QSIPVRTPNTQL KL
Subjt:  MPVDSPLIPPNTTAAA-------PTTTTVTPISFTLT-TATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKL

Query:  H----QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPK
        H    Q ILYPVASSGRGFVPRPI+PLP DQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGS ISGSIKGAPNSSDPK
Subjt:  H----QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPK

Query:  VFPPSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSI
         FPPS++CES GCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAVAGA PSQKKEVV+EEVDEEDKDEGSI
Subjt:  VFPPSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSI

Query:  EHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        EHLS QELLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVTGS
Subjt:  EHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

A0A5A7TZA2 Mucin-2 isoform X23.5e-16589.08Show/hide
Query:  MPVDSPLIPPNTTAAA-------PTTTTVTPISFTLT-TATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKL
        MPVDSPLIP NTT+AA        TTTTVTPISFTLT  ATA TTAAAA AAIARPLANQAPSRPISSIPQT HLHYP QALY  QSIPVRTPNTQL KL
Subjt:  MPVDSPLIPPNTTAAA-------PTTTTVTPISFTLT-TATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKL

Query:  H----QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPK
        H    Q ILYPVASSGRGFVPRPI+PLP DQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGS ISGSIKGAPNSSDPK
Subjt:  H----QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPK

Query:  VFPPSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSI
         FPPS++CES GCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAVAGA PSQKKEVV+EEVDEEDKDEGSI
Subjt:  VFPPSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSI

Query:  EHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        EHLS QELLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVTGS
Subjt:  EHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

A0A6J1F1T6 uncharacterized protein LOC1114414172.9e-15987.29Show/hide
Query:  MPVDSPLIPPNTT-----AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH--
        MPVDSPLIP NTT     AAA TTTTV PISFTLTTA ATT    +AAAIARPLANQAPSRPISSIPQT HLHYPPQALY AQ IPVRTPNTQL KL   
Subjt:  MPVDSPLIPPNTT-----AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH--

Query:  --QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPKVFP
          Q ILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGS ISGSIKGAPNSSDPKVFP
Subjt:  --QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPKVFP

Query:  PSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHL
        PS++ E+ GCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAV GAVPSQKKEVVEE VDE+DKDE SIE L
Subjt:  PSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHL

Query:  SAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        S QELLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN+TGS
Subjt:  SAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

A0A6J1IZP9 uncharacterized protein LOC1114814117.0e-15887.01Show/hide
Query:  MPVDSPLIPPNTT-----AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH--
        MPVDSPLIP NTT     A A TTTTV PISFTL+ A ATT    AAAAIARPLANQAPSRPISSIPQT HLHYPPQALY AQ IPVRTPNTQL KL   
Subjt:  MPVDSPLIPPNTT-----AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH--

Query:  --QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPKVFP
          Q ILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGS ISGSIK APNSSDPKVFP
Subjt:  --QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPKVFP

Query:  PSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHL
        PS++ E+ GCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAV GAVPSQKKEVVEEEVDE+DKDE SIE L
Subjt:  PSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHL

Query:  SAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        S QELLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
Subjt:  SAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G04930.1 hydroxyproline-rich glycoprotein family protein1.9e-3837.8Show/hide
Query:  PNTTAAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYA----AQSIPVRTPNTQLQKLHQTILYPVASSGR
        PN++A+   + +++  S T  T   T     +    A P       RPI+  P      +  Q+ Y+    A SIPVR    Q+Q     +LYP A  GR
Subjt:  PNTTAAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYA----AQSIPVRTPNTQLQKLHQTILYPVASSGR

Query:  GFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPIS-GSIKGAPNSSDPKVF-PPSSMCESKGCKE
        GF  RP++   AD +VT  N  GYP RP  T+   P     ++S+       R P ++       GSP+  G I+ +P    P+V  PP+S+ ++   ++
Subjt:  GFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPIS-GSIKGAPNSSDPKVF-PPSSMCESKGCKE

Query:  MRVRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSAQELLKRHVK
         R +D AL VVR RKVRIT+G +SLY+L RSWL+NG+    QPQ    ++ LP+PLP+ +     S   +  EE  DE+ +DE +++ LS ++LLKRH++
Subjt:  MRVRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSAQELLKRHVK

Query:  RAKKVRSRLREERLQRIERYKTRLALLL
        RAKKVR++LREER +RI RYK R+ L+L
Subjt:  RAKKVRSRLREERLQRIERYKTRLALLL

AT1G04930.2 hydroxyproline-rich glycoprotein family protein1.5e-3535.2Show/hide
Query:  PNTTAAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYA----AQSIPVRTPNTQLQKLHQTILYPVASSGR
        PN++A+   + +++  S T  T   T     +    A P       RPI+  P      +  Q+ Y+    A SIPVR    Q+Q     +LYP A  GR
Subjt:  PNTTAAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYA----AQSIPVRTPNTQLQKLHQTILYPVASSGR

Query:  GFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPI-----------------SGSIKGAPNSSDPK
        GF  RP++   AD +VT  N  GYP RP  T+   P     ++S+       R P ++       GSP+                 SG I G     DPK
Subjt:  GFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPI-----------------SGSIKGAPNSSDPK

Query:  VF---------------PPSSMCESKGCKEMRVRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKE
                         PP+S+ ++   ++ R +D AL VVR RKVRIT+G +SLY+L RSWL+NG+    QPQ    ++ LP+PLP+ +     S   +
Subjt:  VF---------------PPSSMCESKGCKEMRVRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKE

Query:  VVEEEVDEEDKDEGSIEHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLL
          EE  DE+ +DE +++ LS ++LLKRH++RAKKVR++LREER +RI RYK R+ L+L
Subjt:  VVEEEVDEEDKDEGSIEHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLL

AT2G32840.1 proline-rich family protein8.2e-5042.49Show/hide
Query:  PVDSPLIPPNTTAAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQ--HLHYPPQALYAAQSIPVRTPNTQLQKLHQ-----
        P  +P   PN + +   +T +          TA+ +       +  P +   P  P SS       H H+P Q +Y    +P+R  N+     HQ     
Subjt:  PVDSPLIPPNTTAAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQ--HLHYPPQALYAAQSIPVRTPNTQLQKLHQ-----

Query:  --TILYPVASSGRGFVPRPIQPLPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPKV
          +++YP  SSGRGF  RP++      A  V   +PGGY P  PV  + H    S +LD M+  M  A P N Q    P  G   SG +KG P+   P+ 
Subjt:  --TILYPVASSGRGFVPRPIQPLPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPKV

Query:  FP-PSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDK-DEGS
         P P+S+ ++ G K+ R RDDAL +VR RKVRIT+GASLY+LCRSWLRNG+ E  +PQ  + +  LP+PLP  V     S  K++VEE + EEDK DE S
Subjt:  FP-PSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDK-DEGS

Query:  IEHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTD
        ++HLS  +LLKRH+ RAKKVR+RLREERL+RI RYK RLALLLPP  EQ R +
Subjt:  IEHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTD

AT2G32840.2 proline-rich family protein8.5e-3137.16Show/hide
Query:  PVDSPLIPPNTTAAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQ--HLHYPPQALYAAQSIPVRTPNTQLQKLHQ-----
        P  +P   PN + +   +T +          TA+ +       +  P +   P  P SS       H H+P Q +Y    +P+R  N+     HQ     
Subjt:  PVDSPLIPPNTTAAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQ--HLHYPPQALYAAQSIPVRTPNTQLQKLHQ-----

Query:  --TILYPVASSGRGFVPRPIQPLPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPKV
          +++YP  SSGRGF  RP++      A  V   +PGGY P  PV  + H    S +LD M+  M  A P N Q    P  G   SG +KG P+   P+ 
Subjt:  --TILYPVASSGRGFVPRPIQPLPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPKV

Query:  FP-PSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKD
         P P+S+ ++ G K+ R RDDAL +VR RKVRIT+GASLY+LCRSWLRNG+ E  +PQ  + +  LP+PLP  V     S  K++VEE + EEDK+
Subjt:  FP-PSSMCESKGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCGTCGACTCCCCTCTCATCCCCCCCAATACCACGGCCGCCGCCCCCACCACCACCACCGTCACTCCGATTTCCTTCACTTTAACTACAGCTACAGCCACCACTAC
CGCCGCCGCTGCCGCTGCCGCCATTGCTCGTCCGCTTGCGAATCAAGCGCCATCCAGACCCATTTCTTCAATTCCTCAAACCCAACATCTCCACTACCCTCCTCAAGCCC
TCTACGCGGCTCAGTCCATCCCCGTTCGAACTCCCAACACCCAATTGCAGAAGCTTCATCAGACAATTCTTTACCCTGTCGCCTCCTCTGGTCGCGGCTTCGTTCCTCGC
CCCATTCAGCCCCTTCCCGCCGATCAGGCCGTCACGCTGGCTAACCCTGGCGGCTACCCACATCGCCCCGTCGTCACTTTTCCCCATCGGCCGATTGGGTCGCCTCATTT
GGACTCCATGAGTCATCCAATGCACATGGCTCGACCTCCCAACTTGCAGCAGCAGCTTATTCCCTTTTCTGGGTCCCCCATTTCGGGCTCGATTAAAGGTGCCCCCAATT
CCTCTGATCCAAAGGTTTTTCCTCCATCATCAATGTGCGAGTCAAAGGGGTGTAAAGAAATGAGAGTTAGAGACGACGCTCTTTGTGTGGTTAGAGATCGAAAAGTTCGA
ATAACTGATGGGGCTTCTCTATATGCGCTTTGTCGATCCTGGCTAAGGAATGGTTCTCAAGAAGAAAGTCAGCCACAATATGGAAATTTTTTGAGGTCACTTCCGAGACC
ACTGCCCATTGCTGTGGCTGGTGCTGTTCCATCACAGAAGAAGGAAGTTGTCGAAGAAGAAGTTGACGAGGAAGATAAGGATGAGGGATCCATTGAGCACTTGTCCGCTC
AAGAGTTATTGAAAAGACATGTTAAACGTGCAAAGAAAGTCCGATCACGATTGAGAGAAGAACGGTTGCAACGAATTGAAAGATACAAAACCAGGCTCGCTCTTCTCCTT
CCTCCTCCAGTCGAGCAGTTGAGGACGGATAACGTTACTGGAAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCGTCGACTCCCCTCTCATCCCCCCCAATACCACGGCCGCCGCCCCCACCACCACCACCGTCACTCCGATTTCCTTCACTTTAACTACAGCTACAGCCACCACTAC
CGCCGCCGCTGCCGCTGCCGCCATTGCTCGTCCGCTTGCGAATCAAGCGCCATCCAGACCCATTTCTTCAATTCCTCAAACCCAACATCTCCACTACCCTCCTCAAGCCC
TCTACGCGGCTCAGTCCATCCCCGTTCGAACTCCCAACACCCAATTGCAGAAGCTTCATCAGACAATTCTTTACCCTGTCGCCTCCTCTGGTCGCGGCTTCGTTCCTCGC
CCCATTCAGCCCCTTCCCGCCGATCAGGCCGTCACGCTGGCTAACCCTGGCGGCTACCCACATCGCCCCGTCGTCACTTTTCCCCATCGGCCGATTGGGTCGCCTCATTT
GGACTCCATGAGTCATCCAATGCACATGGCTCGACCTCCCAACTTGCAGCAGCAGCTTATTCCCTTTTCTGGGTCCCCCATTTCGGGCTCGATTAAAGGTGCCCCCAATT
CCTCTGATCCAAAGGTTTTTCCTCCATCATCAATGTGCGAGTCAAAGGGGTGTAAAGAAATGAGAGTTAGAGACGACGCTCTTTGTGTGGTTAGAGATCGAAAAGTTCGA
ATAACTGATGGGGCTTCTCTATATGCGCTTTGTCGATCCTGGCTAAGGAATGGTTCTCAAGAAGAAAGTCAGCCACAATATGGAAATTTTTTGAGGTCACTTCCGAGACC
ACTGCCCATTGCTGTGGCTGGTGCTGTTCCATCACAGAAGAAGGAAGTTGTCGAAGAAGAAGTTGACGAGGAAGATAAGGATGAGGGATCCATTGAGCACTTGTCCGCTC
AAGAGTTATTGAAAAGACATGTTAAACGTGCAAAGAAAGTCCGATCACGATTGAGAGAAGAACGGTTGCAACGAATTGAAAGATACAAAACCAGGCTCGCTCTTCTCCTT
CCTCCTCCAGTCGAGCAGTTGAGGACGGATAACGTTACTGGAAGCTGA
Protein sequenceShow/hide protein sequence
MPVDSPLIPPNTTAAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLHQTILYPVASSGRGFVPR
PIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSPISGSIKGAPNSSDPKVFPPSSMCESKGCKEMRVRDDALCVVRDRKVR
ITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLL
PPPVEQLRTDNVTGS