; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031192 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031192
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionMucin-2 isoform X2
Genome locationchr11:5664217..5670358
RNA-Seq ExpressionLag0031192
SyntenyLag0031192
Gene Ontology termsGO:0017053 - transcriptional repressor complex (cellular component)
InterPro domainsIPR028226 - Protein LIN37


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461764.1 PREDICTED: uncharacterized protein LOC103500291 isoform X1 [Cucumis melo]5.4e-16888.24Show/hide
Query:  MPVDSPLIPTNTTAAA-----ATATTTTVAPISFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKL
        MPVDSPLIPTNTT+AA      T TTTTV PISFTLT AA   T AAAA AAIARPLANQAPSRPISSIPQTHHLHYP QALY  Q IPVRTPN QLPKL
Subjt:  MPVDSPLIPTNTTAAA-----ATATTTTVAPISFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKL

Query:  QQDASQAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPK
         QDASQAILYPVASSGRGFVPRPIRPLP DQAVT+ANPGG+PHRPVVTF HRPIGSPHLDSMSHPMH+ RPPNLQQQLIP SGS+I GSIKGAPNSSDPK
Subjt:  QQDASQAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPK

Query:  VFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSI
         FPPSTICESNGCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAVAGA PSQKKEV++EEVDEEDKDEGSI
Subjt:  VFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSI

Query:  DHLSTQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT
        +HLSTQ+LLKRHVRRAKKVRSRLREERL RIERYKTRLALLLPPP+EQLRTDN+TG+
Subjt:  DHLSTQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT

XP_022934169.1 uncharacterized protein LOC111441417 [Cucurbita moschata]1.1e-16589.01Show/hide
Query:  MPVDSPLIPTNTT---AAAATATTTTVAPISFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQ
        MPVDSPLIPTNTT    AAA  TTTTVAPISFTLTTAAATT     +AAAIARPLANQAPSRPISSIPQTHHLHYPPQALY AQ IPVRTPN QLPKLQQ
Subjt:  MPVDSPLIPTNTT---AAAATATTTTVAPISFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQ

Query:  DASQAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPKVF
        DASQAILYPVASSGRGFVPRPIRPLP DQ VTVANPGG+PHRPVV+F HRPIGSPHLDSMSHPMH+ARPPNLQQQLIP SGS+I GSIKGAPNSSDPKVF
Subjt:  DASQAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPKVF

Query:  PPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSIDH
        PPSTI E+NGCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAV GA+PSQKKEV+EE VDE+DKDE SI+ 
Subjt:  PPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSIDH

Query:  LSTQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT
        LSTQ+LLKRHVRRAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDNITG+
Subjt:  LSTQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT

XP_022982576.1 uncharacterized protein LOC111481411 [Cucurbita maxima]7.3e-16588.73Show/hide
Query:  MPVDSPLIPTNTTAAAATA---TTTTVAPISFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQ
        MPVDSPLIPTNTT AAA A   TTTTVAPISFTL+ AAATT     AAAAIARPLANQAPSRPISSIPQTHHLHYPPQALY AQ IPVRTPN QLPKLQQ
Subjt:  MPVDSPLIPTNTTAAAATA---TTTTVAPISFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQ

Query:  DASQAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPKVF
        DASQAILYPVASSGRGFVPRPIRPLP DQ VTVANPGG+PHRPVV+F HRPIGSPHLDSMSHPMH+ARPPNLQQQLIP SGS+I GSIK APNSSDPKVF
Subjt:  DASQAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPKVF

Query:  PPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSIDH
        PPSTI E+NGCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAV GA+PSQKKEV+EEEVDE+DKDE SI+ 
Subjt:  PPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSIDH

Query:  LSTQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT
        LSTQ+LLKRHVRRAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDN+TG+
Subjt:  LSTQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT

XP_023526543.1 uncharacterized protein LOC111790016 [Cucurbita pepo subsp. pepo]1.1e-16589.49Show/hide
Query:  MPVDSPLIPTNTTAAAATATTTTVAPISFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDAS
        MPVDSPLIPTNTT  AAT TTTTVAPISFTLTTAAATT     AAAAIARPLANQAPSRPISSIPQTHHLHYPPQALY AQ IPVRTPN QLPKLQQDAS
Subjt:  MPVDSPLIPTNTTAAAATATTTTVAPISFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDAS

Query:  QAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPKVFPPS
        QAILYPVASSGRGFVPRPIRPLP DQ VTVANPGG+PHRPVV+F HRPIGSPHLDSMSHPMH+ARPPNLQQQLIP SGS+I GSIKGAPNSSDPKVFPPS
Subjt:  QAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPKVFPPS

Query:  TICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSIDHLST
        TI E+NGCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAV GA+PSQKKEV+EE VDE+DKDE SI+ L T
Subjt:  TICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSIDHLST

Query:  QDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT
        Q+LLKRHVRRAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDN+TG+
Subjt:  QDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT

XP_038903387.1 uncharacterized protein LOC120089997 [Benincasa hispida]3.2e-16890.68Show/hide
Query:  MPVDSPLIPTNTTAAAATATTTTVAPISFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDAS
        MPVDSPLIPTNTTAAA   +TTTV PISFTLTTAAATTT AAAA AAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQ IPVRTPN+QLPKL QDAS
Subjt:  MPVDSPLIPTNTTAAAATATTTTVAPISFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDAS

Query:  QAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNL--QQQLIPLSGSAIQGSIKGAPNSSDPKVFP
        QAILYPVASSGRGFVPRPIRPLPADQ VT+ANPGG+ +RPVVTF HRPIGS HLDSMSHPMH+ARPPNL  QQQLIP SGS+I GSIKG PNSSDPKVF 
Subjt:  QAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNL--QQQLIPLSGSAIQGSIKGAPNSSDPKVFP

Query:  PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSIDHL
        PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIA AGA+PSQKKEV+EEEVDEEDKDEGSI+HL
Subjt:  PSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSIDHL

Query:  STQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT
        STQ+LLKRHVRRAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDN+TG+
Subjt:  STQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT

TrEMBL top hitse value%identityAlignment
A0A0A0LCG5 Uncharacterized protein4.6e-16587.64Show/hide
Query:  MPVDSPLIPTNTTAA---AATATTTTVAPISFTLTT-AAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQ
        MPVDSPLIPTNTT+A   A T TTTTV PISFTLTT AAATTT  AAA AAIARPLANQAPS+PISSIPQTHHLHYP QALY  Q IPVRTPN QLPKL 
Subjt:  MPVDSPLIPTNTTAA---AATATTTTVAPISFTLTT-AAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQ

Query:  QDASQAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPKV
        QDASQAILYPVASSGRGFVPR IRPLPADQAVT+ANPGG+PHRPVVTF HRPIGSPHLDSMSHPMH+ RPPNLQQQLIP SGS+I GSIK APNSSDPK 
Subjt:  QDASQAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPKV

Query:  FPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSID
        FPP TICESNGCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYG+F RSLPRPLPIAVAGA P QKKEV++EEVDE+DKDEGSI+
Subjt:  FPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSID

Query:  HLSTQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT
        HLSTQ+LLKRHVRRAKKVRSRLREERL RIERYKTRLALLLPPP+EQLRTDN+TG+
Subjt:  HLSTQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT

A0A1S3CFG6 uncharacterized protein LOC103500291 isoform X12.6e-16888.24Show/hide
Query:  MPVDSPLIPTNTTAAA-----ATATTTTVAPISFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKL
        MPVDSPLIPTNTT+AA      T TTTTV PISFTLT AA   T AAAA AAIARPLANQAPSRPISSIPQTHHLHYP QALY  Q IPVRTPN QLPKL
Subjt:  MPVDSPLIPTNTTAAA-----ATATTTTVAPISFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKL

Query:  QQDASQAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPK
         QDASQAILYPVASSGRGFVPRPIRPLP DQAVT+ANPGG+PHRPVVTF HRPIGSPHLDSMSHPMH+ RPPNLQQQLIP SGS+I GSIKGAPNSSDPK
Subjt:  QQDASQAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPK

Query:  VFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSI
         FPPSTICESNGCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAVAGA PSQKKEV++EEVDEEDKDEGSI
Subjt:  VFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSI

Query:  DHLSTQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT
        +HLSTQ+LLKRHVRRAKKVRSRLREERL RIERYKTRLALLLPPP+EQLRTDN+TG+
Subjt:  DHLSTQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT

A0A5A7TZA2 Mucin-2 isoform X22.6e-16888.24Show/hide
Query:  MPVDSPLIPTNTTAAA-----ATATTTTVAPISFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKL
        MPVDSPLIPTNTT+AA      T TTTTV PISFTLT AA   T AAAA AAIARPLANQAPSRPISSIPQTHHLHYP QALY  Q IPVRTPN QLPKL
Subjt:  MPVDSPLIPTNTTAAA-----ATATTTTVAPISFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKL

Query:  QQDASQAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPK
         QDASQAILYPVASSGRGFVPRPIRPLP DQAVT+ANPGG+PHRPVVTF HRPIGSPHLDSMSHPMH+ RPPNLQQQLIP SGS+I GSIKGAPNSSDPK
Subjt:  QQDASQAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPK

Query:  VFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSI
         FPPSTICESNGCKEMRVRDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAVAGA PSQKKEV++EEVDEEDKDEGSI
Subjt:  VFPPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSI

Query:  DHLSTQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT
        +HLSTQ+LLKRHVRRAKKVRSRLREERL RIERYKTRLALLLPPP+EQLRTDN+TG+
Subjt:  DHLSTQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT

A0A6J1F1T6 uncharacterized protein LOC1114414175.5e-16689.01Show/hide
Query:  MPVDSPLIPTNTT---AAAATATTTTVAPISFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQ
        MPVDSPLIPTNTT    AAA  TTTTVAPISFTLTTAAATT     +AAAIARPLANQAPSRPISSIPQTHHLHYPPQALY AQ IPVRTPN QLPKLQQ
Subjt:  MPVDSPLIPTNTT---AAAATATTTTVAPISFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQ

Query:  DASQAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPKVF
        DASQAILYPVASSGRGFVPRPIRPLP DQ VTVANPGG+PHRPVV+F HRPIGSPHLDSMSHPMH+ARPPNLQQQLIP SGS+I GSIKGAPNSSDPKVF
Subjt:  DASQAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPKVF

Query:  PPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSIDH
        PPSTI E+NGCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAV GA+PSQKKEV+EE VDE+DKDE SI+ 
Subjt:  PPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSIDH

Query:  LSTQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT
        LSTQ+LLKRHVRRAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDNITG+
Subjt:  LSTQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT

A0A6J1IZP9 uncharacterized protein LOC1114814113.5e-16588.73Show/hide
Query:  MPVDSPLIPTNTTAAAATA---TTTTVAPISFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQ
        MPVDSPLIPTNTT AAA A   TTTTVAPISFTL+ AAATT     AAAAIARPLANQAPSRPISSIPQTHHLHYPPQALY AQ IPVRTPN QLPKLQQ
Subjt:  MPVDSPLIPTNTTAAAATA---TTTTVAPISFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQ

Query:  DASQAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPKVF
        DASQAILYPVASSGRGFVPRPIRPLP DQ VTVANPGG+PHRPVV+F HRPIGSPHLDSMSHPMH+ARPPNLQQQLIP SGS+I GSIK APNSSDPKVF
Subjt:  DASQAILYPVASSGRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPKVF

Query:  PPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSIDH
        PPSTI E+NGCKEMRVRDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPIAV GA+PSQKKEV+EEEVDE+DKDE SI+ 
Subjt:  PPSTICESNGCKEMRVRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSIDH

Query:  LSTQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT
        LSTQ+LLKRHVRRAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDN+TG+
Subjt:  LSTQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLLPPPVEQLRTDNITGT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G04930.1 hydroxyproline-rich glycoprotein family protein6.0e-4038.15Show/hide
Query:  SFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHH----LHYP----PQALYA----AQPIPVRTPNHQLPKLQQDASQAILYPVASSGRGFV
        S T++ + +T +          RP  +Q P  P    P T+     L +P     Q+ Y+    A  IPVR       +  QD S A+LYP A  GRGF 
Subjt:  SFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHH----LHYP----PQALYA----AQPIPVRTPNHQLPKLQQDASQAILYPVASSGRGFV

Query:  PRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQ-QLIPLSGSAIQGSIKGAPNSSDPKVFPPST-ICESNGCKEMRV
         RP+R   AD +VT  N  G+P RP  T+   P     ++S+       R P ++    + L      G I+ +P    P+V PP T I +++  ++ R 
Subjt:  PRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQ-QLIPLSGSAIQGSIKGAPNSSDPKVFPPST-ICESNGCKEMRV

Query:  RDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSIDHLSTQDLLKRHVRRAK
        +D AL VVR RKVRIT+G +SLY+L RSWL+NG+    QPQ    ++ LP+PLP+ +     S   +  EE  DE+ +DE ++  LS +DLLKRH+ RAK
Subjt:  RDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSIDHLSTQDLLKRHVRRAK

Query:  KVRSRLREERLLRIERYKTRLALLL
        KVR++LREER  RI RYK R+ L+L
Subjt:  KVRSRLREERLLRIERYKTRLALLL

AT1G04930.2 hydroxyproline-rich glycoprotein family protein1.4e-3635.49Show/hide
Query:  SFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHH----LHYP----PQALYA----AQPIPVRTPNHQLPKLQQDASQAILYPVASSGRGFV
        S T++ + +T +          RP  +Q P  P    P T+     L +P     Q+ Y+    A  IPVR       +  QD S A+LYP A  GRGF 
Subjt:  SFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHH----LHYP----PQALYA----AQPIPVRTPNHQLPKLQQDASQAILYPVASSGRGFV

Query:  PRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQ----------QLIPLSGS-------AIQGSIKGAPNSSDPKVF-
         RP+R   AD +VT  N  G+P RP  T+   P     ++S+       R P ++            L P+  S          G I G     DPK   
Subjt:  PRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQ----------QLIPLSGS-------AIQGSIKGAPNSSDPKVF-

Query:  --------------PPSTICESNGCKEMRVRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIE
                      PP++I +++  ++ R +D AL VVR RKVRIT+G +SLY+L RSWL+NG+    QPQ    ++ LP+PLP+ +     S   +  E
Subjt:  --------------PPSTICESNGCKEMRVRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIE

Query:  EEVDEEDKDEGSIDHLSTQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLL
        E  DE+ +DE ++  LS +DLLKRH+ RAKKVR++LREER  RI RYK R+ L+L
Subjt:  EEVDEEDKDEGSIDHLSTQDLLKRHVRRAKKVRSRLREERLLRIERYKTRLALLL

AT2G32840.1 proline-rich family protein5.8e-5144.48Show/hide
Query:  ISFTLTTAAATTTAAAAAAA-AIARPLANQAPSRPISSIPQTH--HLHYPPQALYAAQPIPVRTPN------HQLPKLQQDASQAILYPVASSGRGFVPR
        +S  ++T   T + +       +  P +   P  P SS       H H+P Q +Y   P+P+R  N      HQ P    D S +++YP  SSGRGF  R
Subjt:  ISFTLTTAAATTTAAAAAAA-AIARPLANQAPSRPISSIPQTH--HLHYPPQALYAAQPIPVRTPN------HQLPKLQQDASQAILYPVASSGRGFVPR

Query:  PIRPLPADQAVTVA--NPGGF-PHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPKVFP-PSTICESNGCKEMRV
        P+R      A  V   +PGG+ P  PV  + H    S +LD M+  M  A P N Q    P  GS   G +KG P+   P+  P P++I +++G K+ R 
Subjt:  PIRPLPADQAVTVA--NPGGF-PHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPKVFP-PSTICESNGCKEMRV

Query:  RDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDK-DEGSIDHLSTQDLLKRHVRRAK
        RDDAL +VR RKVRIT+GASLY+LCRSWLRNG+ E  +PQ  + +  LP+PLP+       S  K+++EE + EEDK DE S+ HLS  DLLKRH+ RAK
Subjt:  RDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDK-DEGSIDHLSTQDLLKRHVRRAK

Query:  KVRSRLREERLLRIERYKTRLALLLPPPVEQLRTD
        KVR+RLREERL RI RYK RLALLLPP  EQ R +
Subjt:  KVRSRLREERLLRIERYKTRLALLLPPPVEQLRTD

AT2G32840.2 proline-rich family protein7.9e-3238.85Show/hide
Query:  ISFTLTTAAATTTAAAAAAA-AIARPLANQAPSRPISSIPQTH--HLHYPPQALYAAQPIPVRTPN------HQLPKLQQDASQAILYPVASSGRGFVPR
        +S  ++T   T + +       +  P +   P  P SS       H H+P Q +Y   P+P+R  N      HQ P    D S +++YP  SSGRGF  R
Subjt:  ISFTLTTAAATTTAAAAAAA-AIARPLANQAPSRPISSIPQTH--HLHYPPQALYAAQPIPVRTPN------HQLPKLQQDASQAILYPVASSGRGFVPR

Query:  PIRPLPADQAVTVA--NPGGF-PHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPKVFP-PSTICESNGCKEMRV
        P+R      A  V   +PGG+ P  PV  + H    S +LD M+  M  A P N Q    P  GS   G +KG P+   P+  P P++I +++G K+ R 
Subjt:  PIRPLPADQAVTVA--NPGGF-PHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPKVFP-PSTICESNGCKEMRV

Query:  RDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKD
        RDDAL +VR RKVRIT+GASLY+LCRSWLRNG+ E  +PQ  + +  LP+PLP+       S  K+++EE + EEDK+
Subjt:  RDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCGTCGACTCCCCCCTCATCCCCACCAACACGACGGCCGCCGCCGCCACCGCCACCACCACCACCGTCGCTCCGATTTCCTTCACTCTAACCACAGCTGCAGCCAC
CACCACCGCCGCCGCCGCTGCTGCCGCTGCCATTGCTCGTCCGCTTGCGAATCAAGCGCCATCGAGACCCATTTCTTCAATTCCTCAGACCCACCATCTCCACTACCCTC
CTCAAGCCCTCTACGCGGCGCAGCCCATCCCCGTTCGAACTCCGAACCACCAATTGCCGAAGCTTCAGCAAGATGCATCTCAGGCAATTCTTTACCCTGTCGCTTCCTCC
GGCCGCGGCTTCGTTCCTCGCCCTATTCGGCCCCTTCCCGCCGATCAGGCCGTCACGGTGGCCAACCCTGGCGGTTTCCCACATCGCCCCGTCGTCACTTTCACGCATCG
GCCGATTGGGTCGCCTCATTTGGACTCCATGAGCCATCCAATGCACTTGGCCCGACCTCCCAACTTGCAACAGCAGTTGATTCCCCTTTCTGGGTCCGCCATTCAGGGCT
CGATTAAGGGTGCCCCCAATTCCTCTGATCCAAAGGTTTTTCCTCCATCAACAATCTGCGAGTCAAATGGGTGTAAAGAAATGAGAGTCAGAGACGACGCTCTTTGTGTG
GTTAGAGATAGAAAAGTCCGAATAACTGATGGGGCTTCTCTTTATGCACTTTGTCGATCATGGTTGAGGAATGGTTCTCAAGAAGAAAGCCAGCCACAATATGGAAACTT
TTTGAGGTCTCTTCCAAGACCTTTGCCCATAGCCGTGGCTGGTGCTATACCGTCGCAGAAGAAGGAAGTCATCGAGGAAGAAGTTGATGAGGAAGATAAGGATGAGGGAT
CCATTGACCACTTGTCAACGCAAGATTTGTTGAAAAGACATGTTAGACGTGCAAAGAAAGTTCGATCACGATTGAGGGAAGAACGGTTGCTACGAATCGAAAGATACAAA
ACCAGGCTCGCTCTTCTCCTTCCTCCGCCAGTCGAACAATTGAGGACTGATAATATTACTGGAACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCGTCGACTCCCCCCTCATCCCCACCAACACGACGGCCGCCGCCGCCACCGCCACCACCACCACCGTCGCTCCGATTTCCTTCACTCTAACCACAGCTGCAGCCAC
CACCACCGCCGCCGCCGCTGCTGCCGCTGCCATTGCTCGTCCGCTTGCGAATCAAGCGCCATCGAGACCCATTTCTTCAATTCCTCAGACCCACCATCTCCACTACCCTC
CTCAAGCCCTCTACGCGGCGCAGCCCATCCCCGTTCGAACTCCGAACCACCAATTGCCGAAGCTTCAGCAAGATGCATCTCAGGCAATTCTTTACCCTGTCGCTTCCTCC
GGCCGCGGCTTCGTTCCTCGCCCTATTCGGCCCCTTCCCGCCGATCAGGCCGTCACGGTGGCCAACCCTGGCGGTTTCCCACATCGCCCCGTCGTCACTTTCACGCATCG
GCCGATTGGGTCGCCTCATTTGGACTCCATGAGCCATCCAATGCACTTGGCCCGACCTCCCAACTTGCAACAGCAGTTGATTCCCCTTTCTGGGTCCGCCATTCAGGGCT
CGATTAAGGGTGCCCCCAATTCCTCTGATCCAAAGGTTTTTCCTCCATCAACAATCTGCGAGTCAAATGGGTGTAAAGAAATGAGAGTCAGAGACGACGCTCTTTGTGTG
GTTAGAGATAGAAAAGTCCGAATAACTGATGGGGCTTCTCTTTATGCACTTTGTCGATCATGGTTGAGGAATGGTTCTCAAGAAGAAAGCCAGCCACAATATGGAAACTT
TTTGAGGTCTCTTCCAAGACCTTTGCCCATAGCCGTGGCTGGTGCTATACCGTCGCAGAAGAAGGAAGTCATCGAGGAAGAAGTTGATGAGGAAGATAAGGATGAGGGAT
CCATTGACCACTTGTCAACGCAAGATTTGTTGAAAAGACATGTTAGACGTGCAAAGAAAGTTCGATCACGATTGAGGGAAGAACGGTTGCTACGAATCGAAAGATACAAA
ACCAGGCTCGCTCTTCTCCTTCCTCCGCCAGTCGAACAATTGAGGACTGATAATATTACTGGAACCTGA
Protein sequenceShow/hide protein sequence
MPVDSPLIPTNTTAAAATATTTTVAPISFTLTTAAATTTAAAAAAAAIARPLANQAPSRPISSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDASQAILYPVASS
GRGFVPRPIRPLPADQAVTVANPGGFPHRPVVTFTHRPIGSPHLDSMSHPMHLARPPNLQQQLIPLSGSAIQGSIKGAPNSSDPKVFPPSTICESNGCKEMRVRDDALCV
VRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIAVAGAIPSQKKEVIEEEVDEEDKDEGSIDHLSTQDLLKRHVRRAKKVRSRLREERLLRIERYK
TRLALLLPPPVEQLRTDNITGT