; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G195520 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G195520
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionMucin-2 isoform X2
Genome locationCla97Chr10:25019405..25025563
RNA-Seq ExpressionCla97C10G195520
SyntenyCla97C10G195520
Gene Ontology termsGO:0017053 - transcriptional repressor complex (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461764.1 PREDICTED: uncharacterized protein LOC103500291 isoform X1 [Cucumis melo]2.2e-16381.59Show/hide
Query:  MPVDSPLIPTNTTAAAP---PTTTTTTTVTPISFTLTTAA-ATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKL
        MPVDSPLIPTNTT+AA     TTTTTTTVTPISFTLT AA A TTAAAA+AAIARPLANQAPSRPISSIPQTHHLHYP Q LY PQSIPVRTPN QL KL
Subjt:  MPVDSPLIPTNTTAAAP---PTTTTTTTVTPISFTLTTAA-ATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKL

Query:  H----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPK
        H    QAILYPVASSGRGFVPRPI+PLP DQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGSSISGSIKGA NSSDPK
Subjt:  H----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPK

Query:  VFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYN
         FPPSTICESNGCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQ                                            
Subjt:  VFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYN

Query:  IPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVT
         PQYG+FLRSLPRPLPIAVAGA PSQKKEVV+EEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVT
Subjt:  IPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVT

Query:  GS
        GS
Subjt:  GS

XP_022934169.1 uncharacterized protein LOC111441417 [Cucurbita moschata]1.1e-15478.95Show/hide
Query:  MPVDSPLIPTNTT-AAAPPTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH--
        MPVDSPLIPTNTT AA     TTTTTV PISFTLTTAAATT+A    AAIARPLANQAPSRPISSIPQTHHLHYPPQ LY  Q IPVRTPN QL KL   
Subjt:  MPVDSPLIPTNTT-AAAPPTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH--

Query:  --QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFP
          QAILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGA NSSDPKVFP
Subjt:  --QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFP

Query:  PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQ
        PSTI E+NGCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQ                                             PQ
Subjt:  PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQ

Query:  YGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        YG+FLRSLPRPLPIAV GAVPSQKKEVVEE VDE+DKDE SIE LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN+TGS
Subjt:  YGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

XP_022982576.1 uncharacterized protein LOC111481411 [Cucurbita maxima]1.4e-15479.2Show/hide
Query:  MPVDSPLIPTNTT-AAAPPTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH--
        MPVDSPLIPTNTT AAA  T TTTTTV PISFTL+ AAATT    A+AAIARPLANQAPSRPISSIPQTHHLHYPPQ LY  Q IPVRTPN QL KL   
Subjt:  MPVDSPLIPTNTT-AAAPPTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH--

Query:  --QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFP
          QAILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIK A NSSDPKVFP
Subjt:  --QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFP

Query:  PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQ
        PSTI E+NGCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQ                                             PQ
Subjt:  PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQ

Query:  YGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        YG+FLRSLPRPLPIAV GAVPSQKKEVVEEEVDE+DKDE SIE LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
Subjt:  YGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

XP_023526543.1 uncharacterized protein LOC111790016 [Cucurbita pepo subsp. pepo]2.9e-15579.4Show/hide
Query:  MPVDSPLIPTNTTAAAPPTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH---
        MPVDSPLIPTNTT AA   TTTTTTV PISFTLTTAAATT    A+AAIARPLANQAPSRPISSIPQTHHLHYPPQ LY  Q IPVRTPN QL KL    
Subjt:  MPVDSPLIPTNTTAAAPPTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH---

Query:  -QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFPP
         QAILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGA NSSDPKVFPP
Subjt:  -QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFPP

Query:  STICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQY
        STI E+NGCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQ                                             PQY
Subjt:  STICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQY

Query:  GNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        G+FLRSLPRPLPIAV GAVPSQKKEVVEE VDE+DKDE SIE L TQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
Subjt:  GNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

XP_038903387.1 uncharacterized protein LOC120089997 [Benincasa hispida]6.5e-16382.5Show/hide
Query:  MPVDSPLIPTNTTAAAPPTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH---
        MPVDSPLIPTNTTAAAP    +TTTVTPISFTLTTAAATTTAAAA AAIARPLANQAPSRPISSIPQTHHLHYPPQ LYA QSIPVRTPN QL KLH   
Subjt:  MPVDSPLIPTNTTAAAPPTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH---

Query:  -QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNL--QQQLIPFSGSSISGSIKGALNSSDPKVF
         QAILYPVASSGRGFVPRPI+PLPADQ VTLANPGGY +RPVVTFPHRPIGS HLDSMSHPMHMARPPNL  QQQLIPFSGSSISGSIKG  NSSDPKVF
Subjt:  -QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNL--QQQLIPFSGSSISGSIKGALNSSDPKVF

Query:  PPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIP
         PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQ                                             P
Subjt:  PPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIP

Query:  QYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        QYGNFLRSLPRPLPIA AGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
Subjt:  QYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

TrEMBL top hitse value%identityAlignment
A0A0A0LCG5 Uncharacterized protein3.8e-16180.8Show/hide
Query:  MPVDSPLIPTNTTAAA-PPTTTTTTTVTPISFTLTTAAA--TTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH
        MPVDSPLIPTNTT+AA   TTTTTTTVTPISFTLTTAAA  TTT AAA+AAIARPLANQAPS+PISSIPQTHHLHYP Q LY PQSIPVRTPNAQL KLH
Subjt:  MPVDSPLIPTNTTAAA-PPTTTTTTTVTPISFTLTTAAA--TTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH

Query:  ----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKV
            QAILYPVASSGRGFVPR I+PLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGSSISGSIK A NSSDPK 
Subjt:  ----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKV

Query:  FPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNI
        FPP TICESNGCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQ                                             
Subjt:  FPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNI

Query:  PQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTG
        PQYG+F RSLPRPLPIAVAGA P QKKEVV+EEVDE+DKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVTG
Subjt:  PQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTG

Query:  S
        S
Subjt:  S

A0A1S3CFG6 uncharacterized protein LOC103500291 isoform X11.1e-16381.59Show/hide
Query:  MPVDSPLIPTNTTAAAP---PTTTTTTTVTPISFTLTTAA-ATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKL
        MPVDSPLIPTNTT+AA     TTTTTTTVTPISFTLT AA A TTAAAA+AAIARPLANQAPSRPISSIPQTHHLHYP Q LY PQSIPVRTPN QL KL
Subjt:  MPVDSPLIPTNTTAAAP---PTTTTTTTVTPISFTLTTAA-ATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKL

Query:  H----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPK
        H    QAILYPVASSGRGFVPRPI+PLP DQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGSSISGSIKGA NSSDPK
Subjt:  H----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPK

Query:  VFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYN
         FPPSTICESNGCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQ                                            
Subjt:  VFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYN

Query:  IPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVT
         PQYG+FLRSLPRPLPIAVAGA PSQKKEVV+EEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVT
Subjt:  IPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVT

Query:  GS
        GS
Subjt:  GS

A0A5A7TZA2 Mucin-2 isoform X21.1e-16381.59Show/hide
Query:  MPVDSPLIPTNTTAAAP---PTTTTTTTVTPISFTLTTAA-ATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKL
        MPVDSPLIPTNTT+AA     TTTTTTTVTPISFTLT AA A TTAAAA+AAIARPLANQAPSRPISSIPQTHHLHYP Q LY PQSIPVRTPN QL KL
Subjt:  MPVDSPLIPTNTTAAAP---PTTTTTTTVTPISFTLTTAA-ATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKL

Query:  H----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPK
        H    QAILYPVASSGRGFVPRPI+PLP DQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGSSISGSIKGA NSSDPK
Subjt:  H----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPK

Query:  VFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYN
         FPPSTICESNGCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQ                                            
Subjt:  VFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYN

Query:  IPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVT
         PQYG+FLRSLPRPLPIAVAGA PSQKKEVV+EEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVT
Subjt:  IPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVT

Query:  GS
        GS
Subjt:  GS

A0A6J1F1T6 uncharacterized protein LOC1114414175.4e-15578.95Show/hide
Query:  MPVDSPLIPTNTT-AAAPPTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH--
        MPVDSPLIPTNTT AA     TTTTTV PISFTLTTAAATT+A    AAIARPLANQAPSRPISSIPQTHHLHYPPQ LY  Q IPVRTPN QL KL   
Subjt:  MPVDSPLIPTNTT-AAAPPTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH--

Query:  --QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFP
          QAILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGA NSSDPKVFP
Subjt:  --QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFP

Query:  PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQ
        PSTI E+NGCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQ                                             PQ
Subjt:  PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQ

Query:  YGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        YG+FLRSLPRPLPIAV GAVPSQKKEVVEE VDE+DKDE SIE LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN+TGS
Subjt:  YGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

A0A6J1IZP9 uncharacterized protein LOC1114814117.0e-15579.2Show/hide
Query:  MPVDSPLIPTNTT-AAAPPTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH--
        MPVDSPLIPTNTT AAA  T TTTTTV PISFTL+ AAATT    A+AAIARPLANQAPSRPISSIPQTHHLHYPPQ LY  Q IPVRTPN QL KL   
Subjt:  MPVDSPLIPTNTT-AAAPPTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH--

Query:  --QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFP
          QAILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIK A NSSDPKVFP
Subjt:  --QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFP

Query:  PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQ
        PSTI E+NGCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQ                                             PQ
Subjt:  PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQ

Query:  YGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        YG+FLRSLPRPLPIAV GAVPSQKKEVVEEEVDE+DKDE SIE LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
Subjt:  YGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G04930.1 hydroxyproline-rich glycoprotein family protein2.8e-3134.13Show/hide
Query:  PTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYP---PQP----------LYAPQSIPVRTPNAQLQKLHQAILYPV
        P ++ + TV+P   +L+TA+ T      +    RP  +Q P  P    P T+    P   P P          LYA  SIPVR    Q+Q    A+LYP 
Subjt:  PTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYP---PQP----------LYAPQSIPVRTPNAQLQKLHQAILYPV

Query:  ASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSIS-GSIKGALNSSDPKVFPPST-ICES
        A  GRGF  RP++   AD +VT  N  GYP RP  T+   P     ++S+       R P ++       GS +  G I+ +     P+V PP T I ++
Subjt:  ASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSIS-GSIKGALNSSDPKVFPPST-ICES

Query:  NGCKEMRVRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQYGNFLR
        +  ++ R +D AL VVR RKVRIT+G +SLY+L RSWL+NG+    Q                                             PQ    ++
Subjt:  NGCKEMRVRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQYGNFLR

Query:  SLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLL
         LP+PLP+ +     S   +  EE  DE+ +DE +++ LS ++LLKRH+ RAKKVR++LREER +RI RYK R+ L+L
Subjt:  SLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLL

AT1G04930.2 hydroxyproline-rich glycoprotein family protein3.5e-2932.11Show/hide
Query:  PTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYP---PQP----------LYAPQSIPVRTPNAQLQKLHQAILYPV
        P ++ + TV+P   +L+TA+ T      +    RP  +Q P  P    P T+    P   P P          LYA  SIPVR    Q+Q    A+LYP 
Subjt:  PTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYP---PQP----------LYAPQSIPVRTPNAQLQKLHQAILYPV

Query:  ASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSI-----------------SGSIKGALN
        A  GRGF  RP++   AD +VT  N  GYP RP  T+   P     ++S+       R P ++       GS +                 SG I G   
Subjt:  ASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSI-----------------SGSIKGALN

Query:  SSDPKVF---------------PPSTICESNGCKEMRVRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKC
          DPK                 PP++I +++  ++ R +D AL VVR RKVRIT+G +SLY+L RSWL+NG+    Q                       
Subjt:  SSDPKVF---------------PPSTICESNGCKEMRVRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKC

Query:  QHYSGLVFMLCCSFLIHILYNIPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERY
                              PQ    ++ LP+PLP+ +     S   +  EE  DE+ +DE +++ LS ++LLKRH+ RAKKVR++LREER +RI RY
Subjt:  QHYSGLVFMLCCSFLIHILYNIPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERY

Query:  KTRLALLL
        K R+ L+L
Subjt:  KTRLALLL

AT2G32840.1 proline-rich family protein1.8e-4138.57Show/hide
Query:  PKFPATMPVDSPLIPTNTTAAAPPTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQ
        P    ++ V +P++    TA+  P T   T +TP     ++     T A++  AIA PL                H H+P Q +Y    +P+R  N+   
Subjt:  PKFPATMPVDSPLIPTNTTAAAPPTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQ

Query:  KLHQ-------AILYPVASSGRGFVPRPIQPLPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKG
          HQ       +++YP  SSGRGF  RP++      A  V   +PGGY P  PV  + H    S +LD M+  M  A P N Q    P  G   SG +KG
Subjt:  KLHQ-------AILYPVASSGRGFVPRPIQPLPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKG

Query:  ALNSSDPKVFP-PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCC
          +   P+  P P++I +++G K+ R RDDAL +VR RKVRIT+GASLY+LCRSWLRNG+ E                P RI              M+ C
Subjt:  ALNSSDPKVFP-PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCC

Query:  SFLIHILYNIPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDK-DEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP
                           LP+PLP  V     S  K++VEE + EEDK DE S++HLS  +LLKRH+ RAKKVR+RLREERL+RI RYK RLALLLPP 
Subjt:  SFLIHILYNIPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDK-DEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP

Query:  VEQLRTD
         EQ R +
Subjt:  VEQLRTD

AT2G32840.2 proline-rich family protein1.8e-2233.43Show/hide
Query:  PKFPATMPVDSPLIPTNTTAAAPPTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQ
        P    ++ V +P++    TA+  P T   T +TP     ++     T A++  AIA PL                H H+P Q +Y    +P+R  N+   
Subjt:  PKFPATMPVDSPLIPTNTTAAAPPTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQ

Query:  KLHQ-------AILYPVASSGRGFVPRPIQPLPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKG
          HQ       +++YP  SSGRGF  RP++      A  V   +PGGY P  PV  + H    S +LD M+  M  A P N Q    P  G   SG +KG
Subjt:  KLHQ-------AILYPVASSGRGFVPRPIQPLPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKG

Query:  ALNSSDPKVFP-PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCC
          +   P+  P P++I +++G K+ R RDDAL +VR RKVRIT+GASLY+LCRSWLRNG+ E                P RI              M+ C
Subjt:  ALNSSDPKVFP-PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCC

Query:  SFLIHILYNIPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKD
                           LP+PLP  V     S  K++VEE + EEDK+
Subjt:  SFLIHILYNIPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCCCCCACGAGCGTCCAAACCCCCCAATCCTGTCAATTCATTCACAGAAATTCAAGTTTTATTTGGCAATTTGATATTCAAACAATCAAACCACAAACTCCCAA
ATTCCCCGCCACCATGCCCGTCGACTCCCCTCTCATCCCCACCAATACGACGGCCGCCGCCCCCCCCACCACCACCACCACCACCACCGTCACTCCGATTTCCTTCACTT
TAACTACAGCTGCAGCCACCACTACCGCCGCCGCTGCCTCTGCCGCCATTGCTCGTCCGCTTGCGAATCAAGCGCCATCCAGACCCATTTCTTCAATTCCTCAAACCCAT
CATCTCCACTACCCTCCTCAACCCCTCTACGCGCCTCAGTCCATCCCCGTTCGAACTCCCAACGCCCAATTGCAGAAGCTTCATCAGGCAATTCTTTACCCTGTCGCCTC
CTCTGGCCGCGGCTTCGTTCCTCGCCCCATTCAGCCCCTTCCCGCCGATCAGGCCGTCACGCTGGCTAACCCTGGCGGTTACCCACATCGCCCCGTTGTCACTTTTCCCC
ATCGGCCGATTGGGTCGCCTCATTTGGACTCCATGAGCCATCCAATGCACATGGCTCGACCTCCCAACTTGCAGCAGCAACTTATTCCCTTTTCTGGGTCCTCCATTTCG
GGCTCGATTAAAGGTGCCCTCAATTCCTCTGATCCAAAGGTTTTTCCTCCATCAACAATCTGCGAGTCAAATGGTTGTAAAGAAATGAGAGTTAGAGACGATGCTCTTTG
TGTGGTTAGAGATCGAAAAGTTCGAATAACTGATGGGGCTTCTCTTTATGCGCTTTGTCGATCATGGCTGAGGAATGGTTCTCAAGAAGAAAGCCAGGTCTGTGTCAGTT
CTGATTACAGCATCTTTCCTTTTGATCCTTATAGGATTAGTTTTATGATTAAGTGTCAACACTACAGCGGATTGGTCTTTATGTTATGTTGTAGCTTTCTCATTCATATT
TTGTACAATATTCCACAATATGGAAATTTTTTGAGGTCACTTCCGAGACCACTGCCCATTGCCGTGGCTGGTGCTGTTCCATCACAGAAGAAGGAAGTTGTCGAAGAAGA
AGTTGATGAGGAAGATAAGGATGAGGGATCCATTGAGCACTTGTCAACGCAAGAGTTATTGAAAAGACATGTTAGACGTGCAAAGAAAGTCCGATCACGATTGAGAGAAG
AACGGTTGCAACGAATTGAAAGATACAAAACCAGGCTTGCTCTTCTCCTTCCTCCTCCAGTCGAGCAGTTGAGAACGGATAATGTTACTGGAAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCCCCCACGAGCGTCCAAACCCCCCAATCCTGTCAATTCATTCACAGAAATTCAAGTTTTATTTGGCAATTTGATATTCAAACAATCAAACCACAAACTCCCAA
ATTCCCCGCCACCATGCCCGTCGACTCCCCTCTCATCCCCACCAATACGACGGCCGCCGCCCCCCCCACCACCACCACCACCACCACCGTCACTCCGATTTCCTTCACTT
TAACTACAGCTGCAGCCACCACTACCGCCGCCGCTGCCTCTGCCGCCATTGCTCGTCCGCTTGCGAATCAAGCGCCATCCAGACCCATTTCTTCAATTCCTCAAACCCAT
CATCTCCACTACCCTCCTCAACCCCTCTACGCGCCTCAGTCCATCCCCGTTCGAACTCCCAACGCCCAATTGCAGAAGCTTCATCAGGCAATTCTTTACCCTGTCGCCTC
CTCTGGCCGCGGCTTCGTTCCTCGCCCCATTCAGCCCCTTCCCGCCGATCAGGCCGTCACGCTGGCTAACCCTGGCGGTTACCCACATCGCCCCGTTGTCACTTTTCCCC
ATCGGCCGATTGGGTCGCCTCATTTGGACTCCATGAGCCATCCAATGCACATGGCTCGACCTCCCAACTTGCAGCAGCAACTTATTCCCTTTTCTGGGTCCTCCATTTCG
GGCTCGATTAAAGGTGCCCTCAATTCCTCTGATCCAAAGGTTTTTCCTCCATCAACAATCTGCGAGTCAAATGGTTGTAAAGAAATGAGAGTTAGAGACGATGCTCTTTG
TGTGGTTAGAGATCGAAAAGTTCGAATAACTGATGGGGCTTCTCTTTATGCGCTTTGTCGATCATGGCTGAGGAATGGTTCTCAAGAAGAAAGCCAGGTCTGTGTCAGTT
CTGATTACAGCATCTTTCCTTTTGATCCTTATAGGATTAGTTTTATGATTAAGTGTCAACACTACAGCGGATTGGTCTTTATGTTATGTTGTAGCTTTCTCATTCATATT
TTGTACAATATTCCACAATATGGAAATTTTTTGAGGTCACTTCCGAGACCACTGCCCATTGCCGTGGCTGGTGCTGTTCCATCACAGAAGAAGGAAGTTGTCGAAGAAGA
AGTTGATGAGGAAGATAAGGATGAGGGATCCATTGAGCACTTGTCAACGCAAGAGTTATTGAAAAGACATGTTAGACGTGCAAAGAAAGTCCGATCACGATTGAGAGAAG
AACGGTTGCAACGAATTGAAAGATACAAAACCAGGCTTGCTCTTCTCCTTCCTCCTCCAGTCGAGCAGTTGAGAACGGATAATGTTACTGGAAGCTGAATACGCATCCTG
GAATCCTCGCCCTCGAAATTCACTGTGGATTTTCGTCCAAAACATGCTTCCCCAAATACAGATGAACTCGAGAAACGTTCAACGGAAGCGCCAAGGATTATATGTAGGTG
CACCAGATAGATTTTTTATTAGATATTGGTAATTCTTGTCTTCCTATCAGTTCCTTTTTGACAGTTTGTAATTGTATAATATGCTCTGAATCGGGTTAAAAAAATGAAGT
CATTTACTGAGGTT
Protein sequenceShow/hide protein sequence
MAAPTSVQTPQSCQFIHRNSSFIWQFDIQTIKPQTPKFPATMPVDSPLIPTNTTAAAPPTTTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTH
HLHYPPQPLYAPQSIPVRTPNAQLQKLHQAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSIS
GSIKGALNSSDPKVFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHI
LYNIPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS