; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G012340 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G012340
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionMucin-2 isoform X2
Genome locationchr03:22904841..22911008
RNA-Seq ExpressionLsi03G012340
SyntenyLsi03G012340
Gene Ontology termsGO:0017053 - transcriptional repressor complex (cellular component)
InterPro domainsIPR028226 - Protein LIN37


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461764.1 PREDICTED: uncharacterized protein LOC103500291 isoform X1 [Cucumis melo]1.5e-16790.2Show/hide
Query:  MPVDSPLIPTNTTAAA-------PTTTTVTPISFTLT-TATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKL
        MPVDSPLIPTNTT+AA        TTTTVTPISFTLT  ATA TTAAAA AAIARPLANQAPSRPISSIPQT HLHYP QALY  QSIPVRTPNTQL KL
Subjt:  MPVDSPLIPTNTTAAA-------PTTTTVTPISFTLT-TATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKL

Query:  H----QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPK
        H    Q ILYPVASSGRGFVPRPI+PLP DQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGSSISGSIKGAPNSSDPK
Subjt:  H----QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPK

Query:  VFPPSTMCESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSI
         FPPST+CESNGCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAVAGA PSQKKEVV+EEVDEEDKDEGSI
Subjt:  VFPPSTMCESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSI

Query:  EHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        EHLS QELLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVTGS
Subjt:  EHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

XP_022934169.1 uncharacterized protein LOC111441417 [Cucurbita moschata]1.7e-16188.42Show/hide
Query:  MPVDSPLIPTNTT-----AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH--
        MPVDSPLIPTNTT     AAA TTTTV PISFTLTTA ATT    +AAAIARPLANQAPSRPISSIPQT HLHYPPQALY AQ IPVRTPNTQL KL   
Subjt:  MPVDSPLIPTNTT-----AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH--

Query:  --QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFP
          Q ILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFP
Subjt:  --QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFP

Query:  PSTMCESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHL
        PST+ E+NGCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAV GAVPSQKKEVVEE VDE+DKDE SIE L
Subjt:  PSTMCESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHL

Query:  SAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        S QELLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN+TGS
Subjt:  SAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

XP_022982576.1 uncharacterized protein LOC111481411 [Cucurbita maxima]4.1e-16088.14Show/hide
Query:  MPVDSPLIPTNTT-----AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH--
        MPVDSPLIPTNTT     A A TTTTV PISFTL+ A ATT    AAAAIARPLANQAPSRPISSIPQT HLHYPPQALY AQ IPVRTPNTQL KL   
Subjt:  MPVDSPLIPTNTT-----AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH--

Query:  --QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFP
          Q ILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIK APNSSDPKVFP
Subjt:  --QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFP

Query:  PSTMCESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHL
        PST+ E+NGCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAV GAVPSQKKEVVEEEVDE+DKDE SIE L
Subjt:  PSTMCESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHL

Query:  SAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        S QELLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
Subjt:  SAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

XP_023526543.1 uncharacterized protein LOC111790016 [Cucurbita pepo subsp. pepo]2.2e-16189.43Show/hide
Query:  MPVDSPLIPTNTT-AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH----QT
        MPVDSPLIPTNTT AA  TTTTV PISFTLTTA ATT    AAAAIARPLANQAPSRPISSIPQT HLHYPPQALY AQ IPVRTPNTQL KL     Q 
Subjt:  MPVDSPLIPTNTT-AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH----QT

Query:  ILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFPPSTM
        ILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFPPST+
Subjt:  ILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFPPSTM

Query:  CESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSAQE
         E+NGCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAV GAVPSQKKEVVEE VDE+DKDE SIE L  QE
Subjt:  CESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSAQE

Query:  LLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        LLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
Subjt:  LLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

XP_038903387.1 uncharacterized protein LOC120089997 [Benincasa hispida]6.7e-17193.16Show/hide
Query:  MPVDSPLIPTNTTAAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH----QTI
        MPVDSPLIPTNTTAAAP+TTTVTPISFTLTTA ATTTAAAA AAIARPLANQAPSRPISSIPQT HLHYPPQALYAAQSIPVRTPN QL KLH    Q I
Subjt:  MPVDSPLIPTNTTAAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH----QTI

Query:  LYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNL--QQQLIPFSGSSISGSIKGAPNSSDPKVFPPST
        LYPVASSGRGFVPRPI+PLPADQ VTLANPGGY +RPVVTFPHRPIGS HLDSMSHPMHMARPPNL  QQQLIPFSGSSISGSIKG PNSSDPKVF PST
Subjt:  LYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNL--QQQLIPFSGSSISGSIKGAPNSSDPKVFPPST

Query:  MCESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSAQ
        +CESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIA AGAVPSQKKEVVEEEVDEEDKDEGSIEHLS Q
Subjt:  MCESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSAQ

Query:  ELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        ELLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
Subjt:  ELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

TrEMBL top hitse value%identityAlignment
A0A0A0LCG5 Uncharacterized protein1.1e-16388.48Show/hide
Query:  MPVDSPLIPTNTTAAA-----PTTTTVTPISFTLTTATA--TTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH
        MPVDSPLIPTNTT+AA      TTTTVTPISFTLTTA A  TTT AAA AAIARPLANQAPS+PISSIPQT HLHYP QALY  QSIPVRTPN QL KLH
Subjt:  MPVDSPLIPTNTTAAA-----PTTTTVTPISFTLTTATA--TTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH

Query:  ----QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKV
            Q ILYPVASSGRGFVPR I+PLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGSSISGSIK APNSSDPK 
Subjt:  ----QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKV

Query:  FPPSTMCESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIE
        FPP T+CESNGCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYG+F RSLPRPLPIAVAGA P QKKEVV+EEVDE+DKDEGSIE
Subjt:  FPPSTMCESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIE

Query:  HLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        HLS QELLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVTGS
Subjt:  HLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

A0A1S3CFG6 uncharacterized protein LOC103500291 isoform X17.5e-16890.2Show/hide
Query:  MPVDSPLIPTNTTAAA-------PTTTTVTPISFTLT-TATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKL
        MPVDSPLIPTNTT+AA        TTTTVTPISFTLT  ATA TTAAAA AAIARPLANQAPSRPISSIPQT HLHYP QALY  QSIPVRTPNTQL KL
Subjt:  MPVDSPLIPTNTTAAA-------PTTTTVTPISFTLT-TATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKL

Query:  H----QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPK
        H    Q ILYPVASSGRGFVPRPI+PLP DQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGSSISGSIKGAPNSSDPK
Subjt:  H----QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPK

Query:  VFPPSTMCESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSI
         FPPST+CESNGCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAVAGA PSQKKEVV+EEVDEEDKDEGSI
Subjt:  VFPPSTMCESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSI

Query:  EHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        EHLS QELLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVTGS
Subjt:  EHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

A0A5A7TZA2 Mucin-2 isoform X27.5e-16890.2Show/hide
Query:  MPVDSPLIPTNTTAAA-------PTTTTVTPISFTLT-TATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKL
        MPVDSPLIPTNTT+AA        TTTTVTPISFTLT  ATA TTAAAA AAIARPLANQAPSRPISSIPQT HLHYP QALY  QSIPVRTPNTQL KL
Subjt:  MPVDSPLIPTNTTAAA-------PTTTTVTPISFTLT-TATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKL

Query:  H----QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPK
        H    Q ILYPVASSGRGFVPRPI+PLP DQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGSSISGSIKGAPNSSDPK
Subjt:  H----QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPK

Query:  VFPPSTMCESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSI
         FPPST+CESNGCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAVAGA PSQKKEVV+EEVDEEDKDEGSI
Subjt:  VFPPSTMCESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSI

Query:  EHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        EHLS QELLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVTGS
Subjt:  EHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

A0A6J1F1T6 uncharacterized protein LOC1114414178.0e-16288.42Show/hide
Query:  MPVDSPLIPTNTT-----AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH--
        MPVDSPLIPTNTT     AAA TTTTV PISFTLTTA ATT    +AAAIARPLANQAPSRPISSIPQT HLHYPPQALY AQ IPVRTPNTQL KL   
Subjt:  MPVDSPLIPTNTT-----AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH--

Query:  --QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFP
          Q ILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFP
Subjt:  --QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFP

Query:  PSTMCESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHL
        PST+ E+NGCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAV GAVPSQKKEVVEE VDE+DKDE SIE L
Subjt:  PSTMCESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHL

Query:  SAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        S QELLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN+TGS
Subjt:  SAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

A0A6J1IZP9 uncharacterized protein LOC1114814112.0e-16088.14Show/hide
Query:  MPVDSPLIPTNTT-----AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH--
        MPVDSPLIPTNTT     A A TTTTV PISFTL+ A ATT    AAAAIARPLANQAPSRPISSIPQT HLHYPPQALY AQ IPVRTPNTQL KL   
Subjt:  MPVDSPLIPTNTT-----AAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLH--

Query:  --QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFP
          Q ILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIK APNSSDPKVFP
Subjt:  --QTILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFP

Query:  PSTMCESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHL
        PST+ E+NGCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAV GAVPSQKKEVVEEEVDE+DKDE SIE L
Subjt:  PSTMCESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHL

Query:  SAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        S QELLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
Subjt:  SAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G04930.1 hydroxyproline-rich glycoprotein family protein2.5e-3837.99Show/hide
Query:  PTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQH----LHYP----PQALYA----AQSIPVRTPNTQLQKLHQTILYPVASSG
        P ++    +S +L+TA+ T           RP  +Q P  P    P T      L +P     Q+ Y+    A SIPVR    Q+Q     +LYP A  G
Subjt:  PTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQH----LHYP----PQALYA----AQSIPVRTPNTQLQKLHQTILYPVASSG

Query:  RGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSIS-GSIKGAPNSSDPKVFPPST-MCESNGCK
        RGF  RP++   AD +VT  N  GYP RP  T+   P     ++S+       R P ++       GS +  G I+ +P    P+V PP T + +++  +
Subjt:  RGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSIS-GSIKGAPNSSDPKVFPPST-MCESNGCK

Query:  EMRVRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSAQELLKRHV
        + R +D AL VVR RKVRIT+G +SLY+L RSWL+NG+    QPQ    ++ LP+PLP+ +     S   +  EE  DE+ +DE +++ LS ++LLKRH+
Subjt:  EMRVRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSAQELLKRHV

Query:  KRAKKVRSRLREERLQRIERYKTRLALLL
        +RAKKVR++LREER +RI RYK R+ L+L
Subjt:  KRAKKVRSRLREERLQRIERYKTRLALLL

AT1G04930.2 hydroxyproline-rich glycoprotein family protein3.3e-3535.1Show/hide
Query:  PTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQH----LHYP----PQALYA----AQSIPVRTPNTQLQKLHQTILYPVASSG
        P ++    +S +L+TA+ T           RP  +Q P  P    P T      L +P     Q+ Y+    A SIPVR    Q+Q     +LYP A  G
Subjt:  PTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQH----LHYP----PQALYA----AQSIPVRTPNTQLQKLHQTILYPVASSG

Query:  RGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSI-----------------SGSIKGAPNSSDP
        RGF  RP++   AD +VT  N  GYP RP  T+   P     ++S+       R P ++       GS +                 SG I G     DP
Subjt:  RGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSI-----------------SGSIKGAPNSSDP

Query:  KVF---------------PPSTMCESNGCKEMRVRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKK
        K                 PP+++ +++  ++ R +D AL VVR RKVRIT+G +SLY+L RSWL+NG+    QPQ    ++ LP+PLP+ +     S   
Subjt:  KVF---------------PPSTMCESNGCKEMRVRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKK

Query:  EVVEEEVDEEDKDEGSIEHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLL
        +  EE  DE+ +DE +++ LS ++LLKRH++RAKKVR++LREER +RI RYK R+ L+L
Subjt:  EVVEEEVDEEDKDEGSIEHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLL

AT2G32840.1 proline-rich family protein8.2e-5042.9Show/hide
Query:  PTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQ--HLHYPPQALYAAQSIPVRTPNTQLQKLHQ-------TILYPVASSGRGF
        P       ++ +    TA+ +       +  P +   P  P SS       H H+P Q +Y    +P+R  N+     HQ       +++YP  SSGRGF
Subjt:  PTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQ--HLHYPPQALYAAQSIPVRTPNTQLQKLHQ-------TILYPVASSGRGF

Query:  VPRPIQPLPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFP-PSTMCESNGCKE
          RP++      A  V   +PGGY P  PV  + H    S +LD M+  M  A P N Q    P  G   SG +KG P+   P+  P P+++ +++G K+
Subjt:  VPRPIQPLPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFP-PSTMCESNGCKE

Query:  MRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDK-DEGSIEHLSAQELLKRHVK
         R RDDAL +VR RKVRIT+GASLY+LCRSWLRNG+ E  +PQ  + +  LP+PLP  V     S  K++VEE + EEDK DE S++HLS  +LLKRH+ 
Subjt:  MRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDK-DEGSIEHLSAQELLKRHVK

Query:  RAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTD
        RAKKVR+RLREERL+RI RYK RLALLLPP  EQ R +
Subjt:  RAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTD

AT2G32840.2 proline-rich family protein1.1e-3037.37Show/hide
Query:  PTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQ--HLHYPPQALYAAQSIPVRTPNTQLQKLHQ-------TILYPVASSGRGF
        P       ++ +    TA+ +       +  P +   P  P SS       H H+P Q +Y    +P+R  N+     HQ       +++YP  SSGRGF
Subjt:  PTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQ--HLHYPPQALYAAQSIPVRTPNTQLQKLHQ-------TILYPVASSGRGF

Query:  VPRPIQPLPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFP-PSTMCESNGCKE
          RP++      A  V   +PGGY P  PV  + H    S +LD M+  M  A P N Q    P  G   SG +KG P+   P+  P P+++ +++G K+
Subjt:  VPRPIQPLPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFP-PSTMCESNGCKE

Query:  MRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKD
         R RDDAL +VR RKVRIT+GASLY+LCRSWLRNG+ E  +PQ  + +  LP+PLP  V     S  K++VEE + EEDK+
Subjt:  MRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCGTCGACTCCCCTCTCATCCCCACCAATACCACGGCCGCCGCCCCCACCACCACCACCGTCACTCCGATTTCCTTCACTTTAACTACAGCTACAGCCACCACTAC
CGCCGCCGCTGCCGCTGCCGCCATTGCTCGTCCGCTTGCGAATCAAGCGCCATCCAGACCCATTTCTTCAATTCCTCAAACCCAACATCTCCACTACCCTCCTCAAGCCC
TCTACGCGGCTCAGTCCATCCCCGTTCGAACTCCCAACACCCAATTGCAGAAGCTTCATCAGACAATTCTTTACCCTGTCGCCTCCTCTGGTCGCGGCTTCGTTCCTCGC
CCCATTCAGCCCCTTCCCGCCGATCAGGCCGTCACGCTGGCTAACCCTGGCGGCTACCCACATCGCCCCGTCGTCACTTTTCCCCATCGGCCGATTGGGTCGCCTCATTT
GGACTCCATGAGTCATCCAATGCACATGGCTCGACCTCCCAACTTGCAGCAGCAGCTTATTCCCTTTTCTGGGTCCTCCATTTCGGGCTCGATTAAAGGTGCCCCCAATT
CCTCTGATCCAAAGGTTTTTCCTCCATCAACAATGTGCGAGTCAAATGGGTGTAAAGAAATGAGAGTTAGAGACGACGCTCTTTGTGTGGTTAGAGATCGAAAAGTTCGA
ATAACTGATGGGGCTTCTCTATATGCGCTTTGTCGATCCTGGCTAAGGAATGGTTCTCAAGAAGAAAGTCAGCCACAATATGGAAATTTTTTGAGGTCACTTCCGAGACC
ACTGCCCATTGCTGTGGCTGGTGCTGTTCCATCACAGAAGAAGGAAGTTGTCGAAGAAGAAGTTGACGAGGAAGATAAGGATGAGGGATCCATTGAGCACTTGTCCGCTC
AAGAGTTATTGAAAAGACATGTTAAACGTGCAAAGAAAGTCCGATCACGATTGAGAGAAGAACGGTTGCAACGAATTGAAAGATACAAAACCAGGCTCGCTCTTCTCCTT
CCTCCTCCAGTCGAGCAGTTGAGGACGGATAACGTTACTGGAAGCTGA
mRNA sequenceShow/hide mRNA sequence
TGTCAATTCACAGAAATTCAAGTTTTATTTGCCAATTTGATATTCAAACAATCAAACCACAAACTCCCAAATTCCCCGCCACCATGCCCGTCGACTCCCCTCTCATCCCC
ACCAATACCACGGCCGCCGCCCCCACCACCACCACCGTCACTCCGATTTCCTTCACTTTAACTACAGCTACAGCCACCACTACCGCCGCCGCTGCCGCTGCCGCCATTGC
TCGTCCGCTTGCGAATCAAGCGCCATCCAGACCCATTTCTTCAATTCCTCAAACCCAACATCTCCACTACCCTCCTCAAGCCCTCTACGCGGCTCAGTCCATCCCCGTTC
GAACTCCCAACACCCAATTGCAGAAGCTTCATCAGACAATTCTTTACCCTGTCGCCTCCTCTGGTCGCGGCTTCGTTCCTCGCCCCATTCAGCCCCTTCCCGCCGATCAG
GCCGTCACGCTGGCTAACCCTGGCGGCTACCCACATCGCCCCGTCGTCACTTTTCCCCATCGGCCGATTGGGTCGCCTCATTTGGACTCCATGAGTCATCCAATGCACAT
GGCTCGACCTCCCAACTTGCAGCAGCAGCTTATTCCCTTTTCTGGGTCCTCCATTTCGGGCTCGATTAAAGGTGCCCCCAATTCCTCTGATCCAAAGGTTTTTCCTCCAT
CAACAATGTGCGAGTCAAATGGGTGTAAAGAAATGAGAGTTAGAGACGACGCTCTTTGTGTGGTTAGAGATCGAAAAGTTCGAATAACTGATGGGGCTTCTCTATATGCG
CTTTGTCGATCCTGGCTAAGGAATGGTTCTCAAGAAGAAAGTCAGCCACAATATGGAAATTTTTTGAGGTCACTTCCGAGACCACTGCCCATTGCTGTGGCTGGTGCTGT
TCCATCACAGAAGAAGGAAGTTGTCGAAGAAGAAGTTGACGAGGAAGATAAGGATGAGGGATCCATTGAGCACTTGTCCGCTCAAGAGTTATTGAAAAGACATGTTAAAC
GTGCAAAGAAAGTCCGATCACGATTGAGAGAAGAACGGTTGCAACGAATTGAAAGATACAAAACCAGGCTCGCTCTTCTCCTTCCTCCTCCAGTCGAGCAGTTGAGGACG
GATAACGTTACTGGAAGCTGAATATGCATCCCGGAATCCTCGCCCTCAAAATTCACTGTGGACTTTCGTCCAAATCATTCTTCCCCAACTACAGATGAACTCAAGAAACA
AACATTCAACGGAAGCGCCGAGGATTATATGTAGGTGCACCAAATAGATATTTGATTAGATATTGGTAATTATTGTCTTCCTATCAGTTCCTTTTTGACAGTTTGTAATT
GTATAATATGCTCTGAATCAGGTTAAAAAAATGGAGTACATTGAGAAGTAAATCACATATTTCCCCCTTTTTTTTGTACCTATCTCCATTCCTTTTTCTTGGGCTTCTTA
AAGTTCAACTTCTGTTGTTAAAAGTTCCATTTAACATC
Protein sequenceShow/hide protein sequence
MPVDSPLIPTNTTAAAPTTTTVTPISFTLTTATATTTAAAAAAAIARPLANQAPSRPISSIPQTQHLHYPPQALYAAQSIPVRTPNTQLQKLHQTILYPVASSGRGFVPR
PIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFPPSTMCESNGCKEMRVRDDALCVVRDRKVR
ITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSAQELLKRHVKRAKKVRSRLREERLQRIERYKTRLALLL
PPPVEQLRTDNVTGS