; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC10G192060 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC10G192060
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionMucin-2 isoform X2
Genome locationCiama_Chr10:26559449..26565616
RNA-Seq ExpressionCaUC10G192060
SyntenyCaUC10G192060
Gene Ontology termsGO:0017053 - transcriptional repressor complex (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149622.3 uncharacterized protein LOC101211370 [Cucumis sativus]3.2e-15480.79Show/hide
Query:  TTTTTVTPISFTLTTAAA--TTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH----QAILYPVASSGRGFVPR
        TTTTTVTPISFTLTTAAA  TTT AAA+AAIARPLANQAPS+PISSIPQTHHLHYP Q LY PQSIPVRTPNAQL KLH    QAILYPVASSGRGFVPR
Subjt:  TTTTTVTPISFTLTTAAA--TTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH----QAILYPVASSGRGFVPR

Query:  PIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFPPSTICESNGCKEMRVRDDA
         I+PLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGSSISGSIK A NSSDPK FPP TICESNGCKEMRVRDD 
Subjt:  PIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFPPSTICESNGCKEMRVRDDA

Query:  LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQYGNFLRSLPRPLPIAVAGA
        LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQ                                             PQYG+F RSLPRPLPIAVAGA
Subjt:  LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQYGNFLRSLPRPLPIAVAGA

Query:  VPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
         P QKKEVV+EEVDE+DKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVTGS
Subjt:  VPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

XP_008461764.1 PREDICTED: uncharacterized protein LOC103500291 isoform X1 [Cucumis melo]8.4e-16381.09Show/hide
Query:  MPVDSPLIPTNTTAAA------PTTTTTVTPISFTLTTAA-ATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKL
        MPVDSPLIPTNTT+AA       TTTTTVTPISFTLT AA A TTAAAA+AAIARPLANQAPSRPISSIPQTHHLHYP Q LY PQSIPVRTPN QL KL
Subjt:  MPVDSPLIPTNTTAAA------PTTTTTVTPISFTLTTAA-ATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKL

Query:  H----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPK
        H    QAILYPVASSGRGFVPRPI+PLP DQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGSSISGSIKGA NSSDPK
Subjt:  H----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPK

Query:  VFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYN
         FPPSTICESNGCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQ                                            
Subjt:  VFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYN

Query:  IPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVT
         PQYG+FLRSLPRPLPIAVAGA PSQKKEVV+EEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVT
Subjt:  IPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVT

Query:  GS
        GS
Subjt:  GS

XP_022934169.1 uncharacterized protein LOC111441417 [Cucurbita moschata]1.3e-15579.2Show/hide
Query:  MPVDSPLIPTNTT----AAAPTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH--
        MPVDSPLIPTNTT    AAA TTTTTV PISFTLTTAAATT+A    AAIARPLANQAPSRPISSIPQTHHLHYPPQ LY  Q IPVRTPN QL KL   
Subjt:  MPVDSPLIPTNTT----AAAPTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH--

Query:  --QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFP
          QAILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGA NSSDPKVFP
Subjt:  --QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFP

Query:  PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQ
        PSTI E+NGCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQ                                             PQ
Subjt:  PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQ

Query:  YGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        YG+FLRSLPRPLPIAV GAVPSQKKEVVEE VDE+DKDE SIE LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN+TGS
Subjt:  YGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

XP_023526543.1 uncharacterized protein LOC111790016 [Cucurbita pepo subsp. pepo]2.0e-15679.75Show/hide
Query:  MPVDSPLIPTNTTAAAPTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH----QA
        MPVDSPLIPTNTT AA TTTTTV PISFTLTTAAATT    A+AAIARPLANQAPSRPISSIPQTHHLHYPPQ LY  Q IPVRTPN QL KL     QA
Subjt:  MPVDSPLIPTNTTAAAPTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH----QA

Query:  ILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFPPSTI
        ILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGA NSSDPKVFPPSTI
Subjt:  ILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFPPSTI

Query:  CESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQYGNF
         E+NGCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQ                                             PQYG+F
Subjt:  CESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQYGNF

Query:  LRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        LRSLPRPLPIAV GAVPSQKKEVVEE VDE+DKDE SIE L TQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
Subjt:  LRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

XP_038903387.1 uncharacterized protein LOC120089997 [Benincasa hispida]2.2e-16383.12Show/hide
Query:  MPVDSPLIPTNTTAAAPTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH----QA
        MPVDSPLIPTNTTAAAP +TTTVTPISFTLTTAAATTTAAAA AAIARPLANQAPSRPISSIPQTHHLHYPPQ LYA QSIPVRTPN QL KLH    QA
Subjt:  MPVDSPLIPTNTTAAAPTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH----QA

Query:  ILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNL--QQQLIPFSGSSISGSIKGALNSSDPKVFPPS
        ILYPVASSGRGFVPRPI+PLPADQ VTLANPGGY +RPVVTFPHRPIGS HLDSMSHPMHMARPPNL  QQQLIPFSGSSISGSIKG  NSSDPKVF PS
Subjt:  ILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNL--QQQLIPFSGSSISGSIKGALNSSDPKVFPPS

Query:  TICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQYG
        TICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQ                                             PQYG
Subjt:  TICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQYG

Query:  NFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        NFLRSLPRPLPIA AGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
Subjt:  NFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

TrEMBL top hitse value%identityAlignment
A0A0A0LCG5 Uncharacterized protein1.4e-16080.3Show/hide
Query:  MPVDSPLIPTNTTAAA----PTTTTTVTPISFTLTTAAA--TTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH
        MPVDSPLIPTNTT+AA     TTTTTVTPISFTLTTAAA  TTT AAA+AAIARPLANQAPS+PISSIPQTHHLHYP Q LY PQSIPVRTPNAQL KLH
Subjt:  MPVDSPLIPTNTTAAA----PTTTTTVTPISFTLTTAAA--TTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH

Query:  ----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKV
            QAILYPVASSGRGFVPR I+PLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGSSISGSIK A NSSDPK 
Subjt:  ----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKV

Query:  FPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNI
        FPP TICESNGCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQ                                             
Subjt:  FPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNI

Query:  PQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTG
        PQYG+F RSLPRPLPIAVAGA P QKKEVV+EEVDE+DKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVTG
Subjt:  PQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTG

Query:  S
        S
Subjt:  S

A0A1S3CFG6 uncharacterized protein LOC103500291 isoform X14.1e-16381.09Show/hide
Query:  MPVDSPLIPTNTTAAA------PTTTTTVTPISFTLTTAA-ATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKL
        MPVDSPLIPTNTT+AA       TTTTTVTPISFTLT AA A TTAAAA+AAIARPLANQAPSRPISSIPQTHHLHYP Q LY PQSIPVRTPN QL KL
Subjt:  MPVDSPLIPTNTTAAA------PTTTTTVTPISFTLTTAA-ATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKL

Query:  H----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPK
        H    QAILYPVASSGRGFVPRPI+PLP DQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGSSISGSIKGA NSSDPK
Subjt:  H----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPK

Query:  VFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYN
         FPPSTICESNGCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQ                                            
Subjt:  VFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYN

Query:  IPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVT
         PQYG+FLRSLPRPLPIAVAGA PSQKKEVV+EEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVT
Subjt:  IPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVT

Query:  GS
        GS
Subjt:  GS

A0A5A7TZA2 Mucin-2 isoform X24.1e-16381.09Show/hide
Query:  MPVDSPLIPTNTTAAA------PTTTTTVTPISFTLTTAA-ATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKL
        MPVDSPLIPTNTT+AA       TTTTTVTPISFTLT AA A TTAAAA+AAIARPLANQAPSRPISSIPQTHHLHYP Q LY PQSIPVRTPN QL KL
Subjt:  MPVDSPLIPTNTTAAA------PTTTTTVTPISFTLTTAA-ATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKL

Query:  H----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPK
        H    QAILYPVASSGRGFVPRPI+PLP DQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHM RPPNLQQQLIPFSGSSISGSIKGA NSSDPK
Subjt:  H----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPK

Query:  VFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYN
         FPPSTICESNGCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQ                                            
Subjt:  VFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYN

Query:  IPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVT
         PQYG+FLRSLPRPLPIAVAGA PSQKKEVV+EEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDNVT
Subjt:  IPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVT

Query:  GS
        GS
Subjt:  GS

A0A6J1F1T6 uncharacterized protein LOC1114414176.3e-15679.2Show/hide
Query:  MPVDSPLIPTNTT----AAAPTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH--
        MPVDSPLIPTNTT    AAA TTTTTV PISFTLTTAAATT+A    AAIARPLANQAPSRPISSIPQTHHLHYPPQ LY  Q IPVRTPN QL KL   
Subjt:  MPVDSPLIPTNTT----AAAPTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH--

Query:  --QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFP
          QAILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGA NSSDPKVFP
Subjt:  --QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFP

Query:  PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQ
        PSTI E+NGCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQ                                             PQ
Subjt:  PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQ

Query:  YGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        YG+FLRSLPRPLPIAV GAVPSQKKEVVEE VDE+DKDE SIE LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN+TGS
Subjt:  YGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

A0A6J1IZP9 uncharacterized protein LOC1114814112.6e-15478.7Show/hide
Query:  MPVDSPLIPTNTT----AAAPTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH--
        MPVDSPLIPTNTT    AA  TTTTTV PISFTL+ AAATT    A+AAIARPLANQAPSRPISSIPQTHHLHYPPQ LY  Q IPVRTPN QL KL   
Subjt:  MPVDSPLIPTNTT----AAAPTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYAPQSIPVRTPNAQLQKLH--

Query:  --QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFP
          QAILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIK A NSSDPKVFP
Subjt:  --QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFP

Query:  PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQ
        PSTI E+NGCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQ                                             PQ
Subjt:  PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQ

Query:  YGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
        YG+FLRSLPRPLPIAV GAVPSQKKEVVEEEVDE+DKDE SIE LSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS
Subjt:  YGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G04930.1 hydroxyproline-rich glycoprotein family protein3.1e-3033.24Show/hide
Query:  IPTNTTAAAPTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYA----PQSIPVRTPNAQLQKLHQAILYPVAS
        IP   ++A+ T + +++  S T  T   T     +    A P       RPI+  P  H   +  Q  Y+      SIPVR    Q+Q    A+LYP A 
Subjt:  IPTNTTAAAPTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYA----PQSIPVRTPNAQLQKLHQAILYPVAS

Query:  SGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSIS-GSIKGALNSSDPKVFPPST-ICESNG
         GRGF  RP++   AD +VT  N  GYP RP  T+   P     ++S+       R P ++       GS +  G I+ +     P+V PP T I +++ 
Subjt:  SGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSIS-GSIKGALNSSDPKVFPPST-ICESNG

Query:  CKEMRVRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQYGNFLRSL
         ++ R +D AL VVR RKVRIT+G +SLY+L RSWL+NG+    Q                                             PQ    ++ L
Subjt:  CKEMRVRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQYGNFLRSL

Query:  PRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLL
        P+PLP+ +     S   +  EE  DE+ +DE +++ LS ++LLKRH+ RAKKVR++LREER +RI RYK R+ L+L
Subjt:  PRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLL

AT1G04930.2 hydroxyproline-rich glycoprotein family protein3.8e-2831.28Show/hide
Query:  IPTNTTAAAPTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYA----PQSIPVRTPNAQLQKLHQAILYPVAS
        IP   ++A+ T + +++  S T  T   T     +    A P       RPI+  P  H   +  Q  Y+      SIPVR    Q+Q    A+LYP A 
Subjt:  IPTNTTAAAPTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQPLYA----PQSIPVRTPNAQLQKLHQAILYPVAS

Query:  SGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSI-----------------SGSIKGALNSS
         GRGF  RP++   AD +VT  N  GYP RP  T+   P     ++S+       R P ++       GS +                 SG I G     
Subjt:  SGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSI-----------------SGSIKGALNSS

Query:  DPKVF---------------PPSTICESNGCKEMRVRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQH
        DPK                 PP++I +++  ++ R +D AL VVR RKVRIT+G +SLY+L RSWL+NG+    Q                         
Subjt:  DPKVF---------------PPSTICESNGCKEMRVRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQH

Query:  YSGLVFMLCCSFLIHILYNIPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKT
                            PQ    ++ LP+PLP+ +     S   +  EE  DE+ +DE +++ LS ++LLKRH+ RAKKVR++LREER +RI RYK 
Subjt:  YSGLVFMLCCSFLIHILYNIPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKT

Query:  RLALLL
        R+ L+L
Subjt:  RLALLL

AT2G32840.1 proline-rich family protein7.1e-4339.26Show/hide
Query:  ISFTLTTAAATTTAAAASA--AIARPLANQAPSRPISSIPQTH--HLHYPPQPLYAPQSIPVRTPNAQLQKLHQ-------AILYPVASSGRGFVPRPIQ
        +S  ++T   T + +  +    +  P +   P  P SS       H H+P Q +Y    +P+R  N+     HQ       +++YP  SSGRGF  RP++
Subjt:  ISFTLTTAAATTTAAAASA--AIARPLANQAPSRPISSIPQTH--HLHYPPQPLYAPQSIPVRTPNAQLQKLHQ-------AILYPVASSGRGFVPRPIQ

Query:  PLPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFP-PSTICESNGCKEMRVRDD
              A  V   +PGGY P  PV  + H    S +LD M+  M  A P N Q    P  G   SG +KG  +   P+  P P++I +++G K+ R RDD
Subjt:  PLPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFP-PSTICESNGCKEMRVRDD

Query:  ALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQYGNFLRSLPRPLPIAVAG
        AL +VR RKVRIT+GASLY+LCRSWLRNG+ E                P RI              M+ C                   LP+PLP  V  
Subjt:  ALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQYGNFLRSLPRPLPIAVAG

Query:  AVPSQKKEVVEEEVDEEDK-DEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTD
           S  K++VEE + EEDK DE S++HLS  +LLKRH+ RAKKVR+RLREERL+RI RYK RLALLLPP  EQ R +
Subjt:  AVPSQKKEVVEEEVDEEDK-DEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTD

AT2G32840.2 proline-rich family protein5.7e-2433.75Show/hide
Query:  ISFTLTTAAATTTAAAASA--AIARPLANQAPSRPISSIPQTH--HLHYPPQPLYAPQSIPVRTPNAQLQKLHQ-------AILYPVASSGRGFVPRPIQ
        +S  ++T   T + +  +    +  P +   P  P SS       H H+P Q +Y    +P+R  N+     HQ       +++YP  SSGRGF  RP++
Subjt:  ISFTLTTAAATTTAAAASA--AIARPLANQAPSRPISSIPQTH--HLHYPPQPLYAPQSIPVRTPNAQLQKLHQ-------AILYPVASSGRGFVPRPIQ

Query:  PLPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFP-PSTICESNGCKEMRVRDD
              A  V   +PGGY P  PV  + H    S +LD M+  M  A P N Q    P  G   SG +KG  +   P+  P P++I +++G K+ R RDD
Subjt:  PLPADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSIKGALNSSDPKVFP-PSTICESNGCKEMRVRDD

Query:  ALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQYGNFLRSLPRPLPIAVAG
        AL +VR RKVRIT+GASLY+LCRSWLRNG+ E                P RI              M+ C                   LP+PLP  V  
Subjt:  ALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYNIPQYGNFLRSLPRPLPIAVAG

Query:  AVPSQKKEVVEEEVDEEDKD
           S  K++VEE + EEDK+
Subjt:  AVPSQKKEVVEEEVDEEDKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCCCCCACGAGCGTCCAAACCCCCCAATCCTGTCAATTCATTCACAGAAATTCAAGTTTTATTTGGCAATTTGATATTCAAACAATCAAACCACAAACTCCCAA
ATTCCCCGCCACCATGCCCGTCGACTCCCCTCTCATCCCCACCAATACGACGGCCGCCGCCCCCACCACCACCACCACCGTCACTCCGATTTCCTTCACTTTAACTACAG
CTGCAGCCACCACTACCGCCGCCGCTGCCTCTGCCGCCATTGCTCGTCCGCTTGCGAATCAAGCGCCATCCAGACCCATTTCTTCAATTCCTCAAACCCATCATCTCCAC
TACCCTCCTCAACCCCTCTACGCGCCTCAGTCCATCCCCGTTCGAACTCCCAACGCCCAATTGCAGAAGCTTCATCAGGCAATTCTTTACCCTGTCGCCTCCTCTGGCCG
CGGCTTCGTTCCTCGCCCCATTCAGCCCCTTCCCGCCGATCAGGCCGTCACGCTGGCTAACCCTGGCGGTTACCCACATCGCCCCGTTGTCACTTTTCCCCATCGGCCGA
TTGGGTCGCCTCATTTGGACTCCATGAGCCATCCAATGCACATGGCTCGACCTCCCAACTTGCAGCAGCAACTTATTCCCTTTTCTGGGTCCTCCATTTCGGGCTCGATT
AAAGGTGCCCTCAATTCCTCTGATCCAAAGGTTTTTCCTCCATCAACAATCTGCGAGTCAAATGGGTGTAAAGAAATGAGAGTTAGAGACGATGCTCTTTGTGTGGTTAG
AGATCGAAAAGTTCGAATAACTGATGGGGCTTCTCTTTATGCGCTTTGTCGATCATGGCTGAGGAATGGTTCTCAAGAAGAAAGCCAGGTCTGTGTCAGTTCTGATTACA
GCATCTTTCCTTTTGATCCTTATAGGATTAGTTTTATGATTAAGTGTCAACACTACAGCGGATTGGTCTTTATGTTATGTTGTAGCTTTCTCATTCATATTTTGTACAAT
ATTCCACAATATGGAAATTTTTTGAGGTCGCTTCCGAGACCACTGCCCATTGCCGTGGCTGGTGCTGTTCCATCACAGAAGAAGGAAGTTGTCGAAGAAGAAGTTGATGA
GGAAGATAAGGATGAGGGATCCATTGAGCACTTGTCAACGCAAGAGTTATTGAAAAGACATGTTAGACGTGCAAAGAAAGTCCGATCACGATTGAGAGAAGAACGGTTGC
AACGAATTGAAAGATACAAAACCAGGCTTGCTCTTCTCCTTCCTCCTCCAGTCGAGCAGTTGAGAACGGATAATGTTACTGGAAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCCCCCACGAGCGTCCAAACCCCCCAATCCTGTCAATTCATTCACAGAAATTCAAGTTTTATTTGGCAATTTGATATTCAAACAATCAAACCACAAACTCCCAA
ATTCCCCGCCACCATGCCCGTCGACTCCCCTCTCATCCCCACCAATACGACGGCCGCCGCCCCCACCACCACCACCACCGTCACTCCGATTTCCTTCACTTTAACTACAG
CTGCAGCCACCACTACCGCCGCCGCTGCCTCTGCCGCCATTGCTCGTCCGCTTGCGAATCAAGCGCCATCCAGACCCATTTCTTCAATTCCTCAAACCCATCATCTCCAC
TACCCTCCTCAACCCCTCTACGCGCCTCAGTCCATCCCCGTTCGAACTCCCAACGCCCAATTGCAGAAGCTTCATCAGGCAATTCTTTACCCTGTCGCCTCCTCTGGCCG
CGGCTTCGTTCCTCGCCCCATTCAGCCCCTTCCCGCCGATCAGGCCGTCACGCTGGCTAACCCTGGCGGTTACCCACATCGCCCCGTTGTCACTTTTCCCCATCGGCCGA
TTGGGTCGCCTCATTTGGACTCCATGAGCCATCCAATGCACATGGCTCGACCTCCCAACTTGCAGCAGCAACTTATTCCCTTTTCTGGGTCCTCCATTTCGGGCTCGATT
AAAGGTGCCCTCAATTCCTCTGATCCAAAGGTTTTTCCTCCATCAACAATCTGCGAGTCAAATGGGTGTAAAGAAATGAGAGTTAGAGACGATGCTCTTTGTGTGGTTAG
AGATCGAAAAGTTCGAATAACTGATGGGGCTTCTCTTTATGCGCTTTGTCGATCATGGCTGAGGAATGGTTCTCAAGAAGAAAGCCAGGTCTGTGTCAGTTCTGATTACA
GCATCTTTCCTTTTGATCCTTATAGGATTAGTTTTATGATTAAGTGTCAACACTACAGCGGATTGGTCTTTATGTTATGTTGTAGCTTTCTCATTCATATTTTGTACAAT
ATTCCACAATATGGAAATTTTTTGAGGTCGCTTCCGAGACCACTGCCCATTGCCGTGGCTGGTGCTGTTCCATCACAGAAGAAGGAAGTTGTCGAAGAAGAAGTTGATGA
GGAAGATAAGGATGAGGGATCCATTGAGCACTTGTCAACGCAAGAGTTATTGAAAAGACATGTTAGACGTGCAAAGAAAGTCCGATCACGATTGAGAGAAGAACGGTTGC
AACGAATTGAAAGATACAAAACCAGGCTTGCTCTTCTCCTTCCTCCTCCAGTCGAGCAGTTGAGAACGGATAATGTTACTGGAAGCTGAATACGCATCCTGGAATCCTCG
CCCTCGAAATTCACTGTGGATGTTCGTCCAAAACATGCTTCCCCAACTACAGATGAACTCGAGAAACATTCAACGGAAGCGCCAAGGATTATATGTAGGTGCACCAGATA
GATTTTTTATTAGATATTGGTAATTCTTGTCTTCCTATCAGTTCCATTTTGACAGTTTGTAATTGTATAATATGATCTGAATCGGGTTAAAAAAATGAAGTCATTTACTG
AGGTC
Protein sequenceShow/hide protein sequence
MAAPTSVQTPQSCQFIHRNSSFIWQFDIQTIKPQTPKFPATMPVDSPLIPTNTTAAAPTTTTTVTPISFTLTTAAATTTAAAASAAIARPLANQAPSRPISSIPQTHHLH
YPPQPLYAPQSIPVRTPNAQLQKLHQAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSMSHPMHMARPPNLQQQLIPFSGSSISGSI
KGALNSSDPKVFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQVCVSSDYSIFPFDPYRISFMIKCQHYSGLVFMLCCSFLIHILYN
IPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHLSTQELLKRHVRRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNVTGS