; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC10G197770 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC10G197770
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionMucin-2 isoform X2
Genome locationCicolChr10:24415365..24420635
RNA-Seq ExpressionCcUC10G197770
SyntenyCcUC10G197770
Gene Ontology termsGO:0017053 - transcriptional repressor complex (cellular component)
InterPro domainsIPR028226 - Protein LIN37


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149622.3 uncharacterized protein LOC101211370 [Cucumis sativus]1.1e-15790.5Show/hide
Query:  TTTTTVTPISFTLTTAAA--TTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQLQKLH----QAILYPVASSGRGFVPR
        TTTTTVTPISFTLTTAAA  TTT AAA+AAIARPLANQAPS+PISSIPQTHHLHYP QALY PQSIPVRTPNAQL KLH    QAILYPVASSGRGFVPR
Subjt:  TTTTTVTPISFTLTTAAA--TTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQLQKLH----QAILYPVASSGRGFVPR

Query:  PIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFPPSTICESNGCKEMRVRDDA
         I+PLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDS SHPMHM RPPNLQQQLIPFSGSSISGSIK APNSSDPK FPP TICESNGCKEMRVRDD 
Subjt:  PIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFPPSTICESNGCKEMRVRDDA

Query:  LCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHSSTQELLKRHVRRAKKVRSR
        LCVVRDRKVRI DGASLYALCRSWLRNGSQEESQPQYG+F RSLPRPLPIAVAGA P QKKEVV+EEVDE+DKDEGSIEH STQELLKRHVRRAKKVRSR
Subjt:  LCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHSSTQELLKRHVRRAKKVRSR

Query:  CVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS
          LREERLQRIERYK RLALLLPPP+EQLRTDNVTGS
Subjt:  CVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS

XP_008461764.1 PREDICTED: uncharacterized protein LOC103500291 isoform X1 [Cucumis melo]3.9e-16690.58Show/hide
Query:  MPVDSPLIPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAA-ATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQLQ
        MPVDSPLIPTNTT+AA  T+  TTTTTTTTVTPISFTLT AA A TT AAA+AAIARPLANQAPSRPISSIPQTHHLHYP QALY PQSIPVRTPN QL 
Subjt:  MPVDSPLIPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAA-ATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQLQ

Query:  KLH----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSD
        KLH    QAILYPVASSGRGFVPRPI+PLP DQAVTLANPGGYPHRPVVTFPHRPIGSPHLDS SHPMHM RPPNLQQQLIPFSGSSISGSIKGAPNSSD
Subjt:  KLH----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSD

Query:  PKVFPPSTICESNGCKEMRVRDDALCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEG
        PK FPPSTICESNGCKEMRVRDD LCVVRDRKVRI DGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAVAGA PSQKKEVV+EEVDEEDKDEG
Subjt:  PKVFPPSTICESNGCKEMRVRDDALCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEG

Query:  SIEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS
        SIEH STQELLKRHVRRAKKVRSR  LREERLQRIERYK RLALLLPPP+EQLRTDNVTGS
Subjt:  SIEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS

XP_022982576.1 uncharacterized protein LOC111481411 [Cucurbita maxima]1.7e-15686.94Show/hide
Query:  MPVDSPLIPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAAATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQLQK
        MPVDSPLIPTNTT AA+      T TTTTTV PISFTL+ AAATT    A+AAIARPLANQAPSRPISSIPQTHHLHYPPQALY  Q IPVRTPN QL K
Subjt:  MPVDSPLIPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAAATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQLQK

Query:  LH----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDP
        L     QAILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDS SHPMHMARPPNLQQQLIPFSGSSISGSIK APNSSDP
Subjt:  LH----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDP

Query:  KVFPPSTICESNGCKEMRVRDDALCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGS
        KVFPPSTI E+NGCKEMRVRDDALCVVRDRKV I DGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAV GAVPSQKKEVVEEEVDE+DKDE S
Subjt:  KVFPPSTICESNGCKEMRVRDDALCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGS

Query:  IEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS
        IE  STQELLKRHVRRAKKVRSR  LREERLQRIERYK RLALLLPPPVEQLRTDNVTGS
Subjt:  IEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS

XP_023526543.1 uncharacterized protein LOC111790016 [Cucurbita pepo subsp. pepo]8.8e-15887.22Show/hide
Query:  MPVDSPLIPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAAATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQLQK
        MPVDSPLIPTNTT AA        TTTTTTV PISFTLTTAAATT    A+AAIARPLANQAPSRPISSIPQTHHLHYPPQALY  Q IPVRTPN QL K
Subjt:  MPVDSPLIPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAAATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQLQK

Query:  LH----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDP
        L     QAILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDS SHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDP
Subjt:  LH----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDP

Query:  KVFPPSTICESNGCKEMRVRDDALCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGS
        KVFPPSTI E+NGCKEMRVRDDALCVVRDRKV I DGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAV GAVPSQKKEVVEE VDE+DKDE S
Subjt:  KVFPPSTICESNGCKEMRVRDDALCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGS

Query:  IEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS
        IE   TQELLKRHVRRAKKVRSR  LREERLQRIERYK RLALLLPPPVEQLRTDNVTGS
Subjt:  IEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS

XP_038903387.1 uncharacterized protein LOC120089997 [Benincasa hispida]6.3e-16490.06Show/hide
Query:  MPVDSPLIPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAAATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQLQK
        MPVDSPLIPTNTTAAA          +TTTVTPISFTLTTAAATTT AAA AAIARPLANQAPSRPISSIPQTHHLHYPPQALYA QSIPVRTPN QL K
Subjt:  MPVDSPLIPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAAATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQLQK

Query:  LH----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNL--QQQLIPFSGSSISGSIKGAPNSS
        LH    QAILYPVASSGRGFVPRPI+PLPADQ VTLANPGGY +RPVVTFPHRPIGS HLDS SHPMHMARPPNL  QQQLIPFSGSSISGSIKG PNSS
Subjt:  LH----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNL--QQQLIPFSGSSISGSIKGAPNSS

Query:  DPKVFPPSTICESNGCKEMRVRDDALCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDE
        DPKVF PSTICESNGCKEMRVRDDALCVVRDRKVRI DGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIA AGAVPSQKKEVVEEEVDEEDKDE
Subjt:  DPKVFPPSTICESNGCKEMRVRDDALCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDE

Query:  GSIEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS
        GSIEH STQELLKRHVRRAKKVRSR  LREERLQRIERYK RLALLLPPPVEQLRTDNVTGS
Subjt:  GSIEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS

TrEMBL top hitse value%identityAlignment
A0A0A0LCG5 Uncharacterized protein4.0e-16489.23Show/hide
Query:  MPVDSPLIPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAAA--TTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQL
        MPVDSPLIPTNTT+AA    T+ TTTTTTTVTPISFTLTTAAA  TTT AAA+AAIARPLANQAPS+PISSIPQTHHLHYP QALY PQSIPVRTPNAQL
Subjt:  MPVDSPLIPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAAA--TTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQL

Query:  QKLH----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSS
         KLH    QAILYPVASSGRGFVPR I+PLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDS SHPMHM RPPNLQQQLIPFSGSSISGSIK APNSS
Subjt:  QKLH----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSS

Query:  DPKVFPPSTICESNGCKEMRVRDDALCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDE
        DPK FPP TICESNGCKEMRVRDD LCVVRDRKVRI DGASLYALCRSWLRNGSQEESQPQYG+F RSLPRPLPIAVAGA P QKKEVV+EEVDE+DKDE
Subjt:  DPKVFPPSTICESNGCKEMRVRDDALCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDE

Query:  GSIEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS
        GSIEH STQELLKRHVRRAKKVRSR  LREERLQRIERYK RLALLLPPP+EQLRTDNVTGS
Subjt:  GSIEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS

A0A1S3CFG6 uncharacterized protein LOC103500291 isoform X11.9e-16690.58Show/hide
Query:  MPVDSPLIPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAA-ATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQLQ
        MPVDSPLIPTNTT+AA  T+  TTTTTTTTVTPISFTLT AA A TT AAA+AAIARPLANQAPSRPISSIPQTHHLHYP QALY PQSIPVRTPN QL 
Subjt:  MPVDSPLIPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAA-ATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQLQ

Query:  KLH----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSD
        KLH    QAILYPVASSGRGFVPRPI+PLP DQAVTLANPGGYPHRPVVTFPHRPIGSPHLDS SHPMHM RPPNLQQQLIPFSGSSISGSIKGAPNSSD
Subjt:  KLH----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSD

Query:  PKVFPPSTICESNGCKEMRVRDDALCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEG
        PK FPPSTICESNGCKEMRVRDD LCVVRDRKVRI DGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAVAGA PSQKKEVV+EEVDEEDKDEG
Subjt:  PKVFPPSTICESNGCKEMRVRDDALCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEG

Query:  SIEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS
        SIEH STQELLKRHVRRAKKVRSR  LREERLQRIERYK RLALLLPPP+EQLRTDNVTGS
Subjt:  SIEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS

A0A5A7TZA2 Mucin-2 isoform X21.9e-16690.58Show/hide
Query:  MPVDSPLIPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAA-ATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQLQ
        MPVDSPLIPTNTT+AA  T+  TTTTTTTTVTPISFTLT AA A TT AAA+AAIARPLANQAPSRPISSIPQTHHLHYP QALY PQSIPVRTPN QL 
Subjt:  MPVDSPLIPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAA-ATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQLQ

Query:  KLH----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSD
        KLH    QAILYPVASSGRGFVPRPI+PLP DQAVTLANPGGYPHRPVVTFPHRPIGSPHLDS SHPMHM RPPNLQQQLIPFSGSSISGSIKGAPNSSD
Subjt:  KLH----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSD

Query:  PKVFPPSTICESNGCKEMRVRDDALCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEG
        PK FPPSTICESNGCKEMRVRDD LCVVRDRKVRI DGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAVAGA PSQKKEVV+EEVDEEDKDEG
Subjt:  PKVFPPSTICESNGCKEMRVRDDALCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEG

Query:  SIEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS
        SIEH STQELLKRHVRRAKKVRSR  LREERLQRIERYK RLALLLPPP+EQLRTDNVTGS
Subjt:  SIEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS

A0A6J1F1T6 uncharacterized protein LOC1114414178.0e-15786.67Show/hide
Query:  MPVDSPLIPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAAATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQLQK
        MPVDSPLIPTNTT AA         TTTTTV PISFTLTTAAATT    ++AAIARPLANQAPSRPISSIPQTHHLHYPPQALY  Q IPVRTPN QL K
Subjt:  MPVDSPLIPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAAATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQLQK

Query:  LH----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDP
        L     QAILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDS SHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDP
Subjt:  LH----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDP

Query:  KVFPPSTICESNGCKEMRVRDDALCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGS
        KVFPPSTI E+NGCKEMRVRDDALCVVRDRKV I DGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAV GAVPSQKKEVVEE VDE+DKDE S
Subjt:  KVFPPSTICESNGCKEMRVRDDALCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGS

Query:  IEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS
        IE  STQELLKRHVRRAKKVRSR  LREERLQRIERYK RLALLLPPPVEQLRTDN+TGS
Subjt:  IEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS

A0A6J1IZP9 uncharacterized protein LOC1114814118.0e-15786.94Show/hide
Query:  MPVDSPLIPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAAATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQLQK
        MPVDSPLIPTNTT AA+      T TTTTTV PISFTL+ AAATT    A+AAIARPLANQAPSRPISSIPQTHHLHYPPQALY  Q IPVRTPN QL K
Subjt:  MPVDSPLIPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAAATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQLQK

Query:  LH----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDP
        L     QAILYPVASSGRGFVPRPI+PLP DQ VT+ANPGGYPHRPVV+FPHRPIGSPHLDS SHPMHMARPPNLQQQLIPFSGSSISGSIK APNSSDP
Subjt:  LH----QAILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDP

Query:  KVFPPSTICESNGCKEMRVRDDALCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGS
        KVFPPSTI E+NGCKEMRVRDDALCVVRDRKV I DGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAV GAVPSQKKEVVEEEVDE+DKDE S
Subjt:  KVFPPSTICESNGCKEMRVRDDALCVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGS

Query:  IEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS
        IE  STQELLKRHVRRAKKVRSR  LREERLQRIERYK RLALLLPPPVEQLRTDNVTGS
Subjt:  IEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLLPPPVEQLRTDNVTGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G04930.1 hydroxyproline-rich glycoprotein family protein2.9e-3437.24Show/hide
Query:  IPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAAATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYA----PQSIPVRTPNAQLQKLHQ
        IP   ++A+ T + + +T + T VTP++        T  P  +    A P       RPI+  P  H   +  Q+ Y+      SIPVR    Q+Q    
Subjt:  IPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAAATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYA----PQSIPVRTPNAQLQKLHQ

Query:  AILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSIS-GSIKGAPNSSDPKVFPPS
        A+LYP A  GRGF  RP++   AD +VT  N  GYP RP  T+   P     ++S        R P ++       GS +  G I+ +P    P+V PP 
Subjt:  AILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSIS-GSIKGAPNSSDPKVFPPS

Query:  T-ICESNGCKEMRVRDDALCVVRDRKVRIADG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHS
        T I +++  ++ R +D AL VVR RKVRI +G +SLY+L RSWL+NG+    QPQ    ++ LP+PLP+ +     S   +  EE  DE+ +DE +++  
Subjt:  T-ICESNGCKEMRVRDDALCVVRDRKVRIADG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHS

Query:  STQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLL
        S ++LLKRH+ RAKKVR++  LREER +RI RYK R+ L+L
Subjt:  STQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLL

AT1G04930.2 hydroxyproline-rich glycoprotein family protein3.9e-3134.5Show/hide
Query:  IPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAAATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYA----PQSIPVRTPNAQLQKLHQ
        IP   ++A+ T + + +T + T VTP++        T  P  +    A P       RPI+  P  H   +  Q+ Y+      SIPVR    Q+Q    
Subjt:  IPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAAATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYA----PQSIPVRTPNAQLQKLHQ

Query:  AILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSI-----------------SGS
        A+LYP A  GRGF  RP++   AD +VT  N  GYP RP  T+   P     ++S        R P ++       GS +                 SG 
Subjt:  AILYPVASSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSI-----------------SGS

Query:  IKGAPNSSDPKVF---------------PPSTICESNGCKEMRVRDDALCVVRDRKVRIADG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIA
        I G     DPK                 PP++I +++  ++ R +D AL VVR RKVRI +G +SLY+L RSWL+NG+    QPQ    ++ LP+PLP+ 
Subjt:  IKGAPNSSDPKVF---------------PPSTICESNGCKEMRVRDDALCVVRDRKVRIADG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIA

Query:  VAGAVPSQKKEVVEEEVDEEDKDEGSIEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLL
        +     S   +  EE  DE+ +DE +++  S ++LLKRH+ RAKKVR++  LREER +RI RYK R+ L+L
Subjt:  VAGAVPSQKKEVVEEEVDEEDKDEGSIEHSSTQELLKRHVRRAKKVRSRCVLREERLQRIERYKIRLALLL

AT2G32840.1 proline-rich family protein4.6e-4842.77Show/hide
Query:  ISFTLTTAAATTTPAAASAAIARPLANQAPSRPISSIPQTH--HLHYPPQALYAPQSIPVRTPNAQLQKLHQ-------AILYPVASSGRGFVPRPIQPL
        ++ +     A+ +P      +  P +   P  P SS       H H+P Q +Y    +P+R  N+     HQ       +++YP  SSGRGF  RP++  
Subjt:  ISFTLTTAAATTTPAAASAAIARPLANQAPSRPISSIPQTH--HLHYPPQALYAPQSIPVRTPNAQLQKLHQ-------AILYPVASSGRGFVPRPIQPL

Query:  PADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFP-PSTICESNGCKEMRVRDDAL
            A  V   +PGGY P  PV  + H    S +LD  +  M  A P N Q    P  G   SG +KG P+   P+  P P++I +++G K+ R RDDAL
Subjt:  PADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFP-PSTICESNGCKEMRVRDDAL

Query:  CVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDK-DEGSIEHSSTQELLKRHVRRAKKVRSR
         +VR RKVRI +GASLY+LCRSWLRNG+ E  +PQ  + +  LP+PLP  V     S  K++VEE + EEDK DE S++H S  +LLKRH+ RAKKVR+R
Subjt:  CVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDK-DEGSIEHSSTQELLKRHVRRAKKVRSR

Query:  CVLREERLQRIERYKIRLALLLPPPVEQLRTD
          LREERL+RI RYK RLALLLPP  EQ R +
Subjt:  CVLREERLQRIERYKIRLALLLPPPVEQLRTD

AT2G32840.2 proline-rich family protein3.9e-3137.73Show/hide
Query:  ISFTLTTAAATTTPAAASAAIARPLANQAPSRPISSIPQTH--HLHYPPQALYAPQSIPVRTPNAQLQKLHQ-------AILYPVASSGRGFVPRPIQPL
        ++ +     A+ +P      +  P +   P  P SS       H H+P Q +Y    +P+R  N+     HQ       +++YP  SSGRGF  RP++  
Subjt:  ISFTLTTAAATTTPAAASAAIARPLANQAPSRPISSIPQTH--HLHYPPQALYAPQSIPVRTPNAQLQKLHQ-------AILYPVASSGRGFVPRPIQPL

Query:  PADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFP-PSTICESNGCKEMRVRDDAL
            A  V   +PGGY P  PV  + H    S +LD  +  M  A P N Q    P  G   SG +KG P+   P+  P P++I +++G K+ R RDDAL
Subjt:  PADQA--VTLANPGGY-PHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFP-PSTICESNGCKEMRVRDDAL

Query:  CVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKD
         +VR RKVRI +GASLY+LCRSWLRNG+ E  +PQ  + +  LP+PLP  V     S  K++VEE + EEDK+
Subjt:  CVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCGTCGACTCCCCTCTCATCCCCACCAATACGACGGCCGCCGCCTCCACCACCACCACCACCACCACCACCACCACCACCACCACCGTCACTCCGATTTCATTCAC
TTTAACTACAGCTGCAGCCACCACTACCCCCGCCGCTGCCTCTGCCGCCATTGCTCGTCCGCTTGCGAATCAAGCGCCATCCAGACCCATTTCTTCAATTCCTCAAACCC
ATCATCTCCACTACCCTCCTCAAGCCCTCTACGCGCCTCAGTCCATCCCCGTTCGAACTCCCAACGCCCAATTGCAGAAGCTTCATCAGGCAATTCTTTACCCTGTCGCC
TCCTCTGGCCGCGGCTTCGTTCCTCGCCCCATTCAGCCCCTTCCCGCCGATCAGGCCGTCACGCTGGCTAACCCTGGCGGTTACCCACATCGCCCCGTTGTCACTTTTCC
CCATCGGCCGATTGGGTCGCCTCATTTGGACTCCACGAGCCATCCAATGCACATGGCTCGACCTCCCAACTTGCAGCAGCAACTTATTCCCTTTTCTGGGTCCTCCATTT
CGGGCTCGATTAAAGGTGCCCCCAATTCCTCTGATCCAAAGGTTTTTCCTCCATCAACAATCTGCGAGTCAAATGGGTGTAAAGAAATGAGAGTTAGAGACGATGCTCTT
TGTGTGGTTAGAGATCGAAAAGTTCGAATAGCTGATGGGGCTTCTCTTTATGCGCTTTGTCGATCATGGCTGAGGAATGGTTCTCAAGAAGAAAGCCAGCCACAATATGG
AAATTTTTTGAGGTCACTTCCGAGACCACTGCCCATTGCTGTGGCTGGTGCTGTTCCATCACAGAAGAAGGAAGTTGTCGAAGAAGAAGTTGATGAGGAAGATAAGGATG
AGGGATCCATTGAGCACTCGTCAACACAAGAGTTATTGAAAAGACATGTTAGACGTGCAAAGAAAGTCCGATCACGGTGTGTATTGAGAGAAGAACGGTTGCAACGAATT
GAAAGATACAAAATCAGGCTTGCTCTTCTCCTTCCTCCTCCAGTCGAGCAGTTGAGAACGGATAATGTTACTGGAAGCTGA
mRNA sequenceShow/hide mRNA sequence
TTGATATTCAAACAATCAAACCACAAACTCCCAAATTCCCCGCCACGATGCCCGTCGACTCCCCTCTCATCCCCACCAATACGACGGCCGCCGCCTCCACCACCACCACC
ACCACCACCACCACCACCACCACCACCGTCACTCCGATTTCATTCACTTTAACTACAGCTGCAGCCACCACTACCCCCGCCGCTGCCTCTGCCGCCATTGCTCGTCCGCT
TGCGAATCAAGCGCCATCCAGACCCATTTCTTCAATTCCTCAAACCCATCATCTCCACTACCCTCCTCAAGCCCTCTACGCGCCTCAGTCCATCCCCGTTCGAACTCCCA
ACGCCCAATTGCAGAAGCTTCATCAGGCAATTCTTTACCCTGTCGCCTCCTCTGGCCGCGGCTTCGTTCCTCGCCCCATTCAGCCCCTTCCCGCCGATCAGGCCGTCACG
CTGGCTAACCCTGGCGGTTACCCACATCGCCCCGTTGTCACTTTTCCCCATCGGCCGATTGGGTCGCCTCATTTGGACTCCACGAGCCATCCAATGCACATGGCTCGACC
TCCCAACTTGCAGCAGCAACTTATTCCCTTTTCTGGGTCCTCCATTTCGGGCTCGATTAAAGGTGCCCCCAATTCCTCTGATCCAAAGGTTTTTCCTCCATCAACAATCT
GCGAGTCAAATGGGTGTAAAGAAATGAGAGTTAGAGACGATGCTCTTTGTGTGGTTAGAGATCGAAAAGTTCGAATAGCTGATGGGGCTTCTCTTTATGCGCTTTGTCGA
TCATGGCTGAGGAATGGTTCTCAAGAAGAAAGCCAGCCACAATATGGAAATTTTTTGAGGTCACTTCCGAGACCACTGCCCATTGCTGTGGCTGGTGCTGTTCCATCACA
GAAGAAGGAAGTTGTCGAAGAAGAAGTTGATGAGGAAGATAAGGATGAGGGATCCATTGAGCACTCGTCAACACAAGAGTTATTGAAAAGACATGTTAGACGTGCAAAGA
AAGTCCGATCACGGTGTGTATTGAGAGAAGAACGGTTGCAACGAATTGAAAGATACAAAATCAGGCTTGCTCTTCTCCTTCCTCCTCCAGTCGAGCAGTTGAGAACGGAT
AATGTTACTGGAAGCTGAATACACATCCTGGAATCCTCGCCCTCGAAATTCACTGTGGATTTTCGTCCAAAACATTCTTCCCCAACTACAGATGAACTCGAGAAACATTC
AACGGAGGCGCCAAGGATTATATGTAGGTGCACCATATAGATTTTTTATTAGATATTGGTAATTCTTGTCTTCCTATCAGTTCCTTTTTGACAGTATGTAATTGTATAAT
ATGGTCTGAATCAGGTTAAAAAAAAATGAAGTCATTTACTGAGGTTAGAAGTAAATCACATATTCCC
Protein sequenceShow/hide protein sequence
MPVDSPLIPTNTTAAASTTTTTTTTTTTTTVTPISFTLTTAAATTTPAAASAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAPQSIPVRTPNAQLQKLHQAILYPVA
SSGRGFVPRPIQPLPADQAVTLANPGGYPHRPVVTFPHRPIGSPHLDSTSHPMHMARPPNLQQQLIPFSGSSISGSIKGAPNSSDPKVFPPSTICESNGCKEMRVRDDAL
CVVRDRKVRIADGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAVPSQKKEVVEEEVDEEDKDEGSIEHSSTQELLKRHVRRAKKVRSRCVLREERLQRI
ERYKIRLALLLPPPVEQLRTDNVTGS