; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0006000 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0006000
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionMucin-2
Genome locationchr07:20267915..20270594
RNA-Seq ExpressionIVF0006000
SyntenyIVF0006000
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044829.1 mucin-2 [Cucumis melo var. makuwa]0.099.35Show/hide
Query:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
        MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
Subjt:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP

Query:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVESD
        VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPS QKVESD
Subjt:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVESD

Query:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
        NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPL+VPPTLSN+DKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
Subjt:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
        HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
Subjt:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD

Query:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
        EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
Subjt:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR

XP_004146564.1 uncharacterized protein LOC101220378 isoform X1 [Cucumis sativus]0.095.05Show/hide
Query:  MRRRTDTDDFRPVNN-TFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
        MRRRTDTDDFRPVNN TFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGS+KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
Subjt:  MRRRTDTDDFRPVNN-TFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS

Query:  PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVES
        PVSLLQSEPPSA+QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP FTPPESIHLTTPSSPEVPFAQFV P+L KVES
Subjt:  PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVES

Query:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
        DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTL NLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
Subjt:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN

Query:  PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
        P TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPLESNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
Subjt:  PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG

Query:  DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
        DEAHQ QEHHSV LGSVKEFNFDN NGSDTHNP INS+WW NAKDGSTE T TG WSFFP TQQR
Subjt:  DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR

XP_008452032.1 PREDICTED: uncharacterized protein LOC103493162 isoform X1 [Cucumis melo]0.099.78Show/hide
Query:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQ-KRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
        MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQ KRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
Subjt:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQ-KRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS

Query:  PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVES
        PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVES
Subjt:  PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVES

Query:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
        DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
Subjt:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN

Query:  PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
        PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
Subjt:  PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG

Query:  DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
        DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
Subjt:  DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR

XP_008452033.1 PREDICTED: uncharacterized protein LOC103493162 isoform X2 [Cucumis melo]0.0100Show/hide
Query:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
        MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
Subjt:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP

Query:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVESD
        VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVESD
Subjt:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVESD

Query:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
        NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
Subjt:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
        HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
Subjt:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD

Query:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
        EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
Subjt:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR

XP_031740284.1 uncharacterized protein LOC101220378 isoform X2 [Cucumis sativus]6.69e-29691.18Show/hide
Query:  MRRRTDTDDFRPVNN-TFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
        MRRRTDTDDFRPVNN TFQTITAAADAIATVDHRFPRATAVQ                   RKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
Subjt:  MRRRTDTDDFRPVNN-TFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS

Query:  PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVES
        PVSLLQSEPPSA+QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP FTPPESIHLTTPSSPEVPFAQFV P+L KVES
Subjt:  PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVES

Query:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
        DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTL NLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
Subjt:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN

Query:  PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
        P TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPLESNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
Subjt:  PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG

Query:  DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
        DEAHQ QEHHSV LGSVKEFNFDN NGSDTHNP INS+WW NAKDGSTE T TG WSFFP TQQR
Subjt:  DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR

TrEMBL top hitse value%identityAlignment
A0A1S3BSB0 uncharacterized protein LOC103493162 isoform X11.0e-26599.78Show/hide
Query:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
        MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAV QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
Subjt:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS

Query:  PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVES
        PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVES
Subjt:  PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVES

Query:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
        DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
Subjt:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN

Query:  PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
        PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
Subjt:  PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG

Query:  DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
        DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
Subjt:  DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR

A0A1S3BSY8 uncharacterized protein LOC103493162 isoform X24.2e-267100Show/hide
Query:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
        MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
Subjt:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP

Query:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVESD
        VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVESD
Subjt:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVESD

Query:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
        NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
Subjt:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
        HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
Subjt:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD

Query:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
        EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
Subjt:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR

A0A5A7TUB1 Mucin-21.4e-26599.35Show/hide
Query:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
        MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
Subjt:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP

Query:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVESD
        VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPS QKVESD
Subjt:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVESD

Query:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
        NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPL+VPPTLSN+DKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
Subjt:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
        HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
Subjt:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD

Query:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
        EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
Subjt:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR

A0A5D3CYQ2 Mucin-24.2e-267100Show/hide
Query:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
        MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
Subjt:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP

Query:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVESD
        VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVESD
Subjt:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVESD

Query:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
        NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
Subjt:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
        HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
Subjt:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD

Query:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
        EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
Subjt:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR

A0A6J1C828 uncharacterized protein At1g76660-like2.8e-20279.74Show/hide
Query:  MRRRTDTD---DFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPP
        MRRR D D   D  PVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQRKRIGHAVLVPEPSPS+EP ENTLQSPDIVLPFAAPP
Subjt:  MRRRTDTD---DFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPP

Query:  SSPVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKV
        SSPVS LQSEPPSA QSPTA++SFTSLTANMYSPDGPSSIFA+GPFAHE QLVSPPLNFST+TT+PST PFTPPESIHLTTPSSPEVPFAQ++ PS QKV
Subjt:  SSPVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKV

Query:  ESDNQY-TFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDF
        ESD+QY  FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPD DF   GS F NFP+EVPPTL NLD+HSI +WR +QS+DSCTQ+S+ +KSSNDF
Subjt:  ESDNQY-TFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDF

Query:  VLNPHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTK
        VLNP TSES+ D+HA+NE  NIQIL  DGS+R +E  A NHRFSFELSD D L +SV +KPLESNEL V SSPIHEP ET KE S  G HTSN  EE+ K
Subjt:  VLNPHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTK

Query:  ADGDEAHQHQ--EHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQ
        ADG+E H HQ  EHHSV LG+VKEFNFDN NG DT  P INS WW N KD  TEGTTTGAWSFFP TQQ
Subjt:  ADGDEAHQHQ--EHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQ

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766607.7e-3248.1Show/hide
Query:  QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPS--PSSEP---HE----NTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALISFTSLTANMYSP
        Q++RWG CL ++ CF S K  KRI  A  +PE     +S+P   H+    N   +  I L   APPSSP S   S  PS  QSP     + SL AN  SP
Subjt:  QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPS--PSSEP---HE----NTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALISFTSLTANMYSP

Query:  DGP-SSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFT-PPESIHLTTPSSPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRS
         GP SS++A GP+AHE QLVSPP+ FST TTEPST PFT PPE   LT PSSP+VP+A+F+  S+    S   +   ND   +Y  YPGSP S L SP S
Subjt:  DGP-SSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFT-PPESIHLTTPSSPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRS

Query:  VISRSGASSP
          S  G  SP
Subjt:  VISRSGASSP

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)7.9e-4849.81Show/hide
Query:  NNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENT----LQSPDIVLPFAAPPSSPVSLLQSEP
        NN F TI AAA AIA+ D R  +++ + +KR+W +  S+  CFGS +QRKRIG++VLVPEP   S  +  T     +S    LPF APPSSP S  QSEP
Subjt:  NNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENT----LQSPDIVLPFAAPPSSPVSLLQSEP

Query:  PSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPP---ESIHL--TTPSSPEVPFAQFVPPSLQKVESDNQY
        PSA QSP  ++SF+ L  N        SIFAIGP+AHE QLVSPP+ FST TTEPS+ P TPP    SI+L  TTPSSPEVPFAQ    + Q       Y
Subjt:  PSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPP---ESIHL--TTPSSPEVPFAQFVPPSLQKVESDNQY

Query:  TFP---NDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTL
         FP   + +FQ YQ  PGSP+  LISP      SG +SP PD +     S F +F +  PP L
Subjt:  TFP---NDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTL

AT1G76660.1 FUNCTIONS IN: molecular_function unknown5.5e-3348.1Show/hide
Query:  QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPS--PSSEP---HE----NTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALISFTSLTANMYSP
        Q++RWG CL ++ CF S K  KRI  A  +PE     +S+P   H+    N   +  I L   APPSSP S   S  PS  QSP     + SL AN  SP
Subjt:  QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPS--PSSEP---HE----NTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALISFTSLTANMYSP

Query:  DGP-SSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFT-PPESIHLTTPSSPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRS
         GP SS++A GP+AHE QLVSPP+ FST TTEPST PFT PPE   LT PSSP+VP+A+F+  S+    S   +   ND   +Y  YPGSP S L SP S
Subjt:  DGP-SSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFT-PPESIHLTTPSSPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRS

Query:  VISRSGASSP
          S  G  SP
Subjt:  VISRSGASSP

AT4G25620.1 hydroxyproline-rich glycoprotein family protein7.1e-4936.53Show/hide
Query:  RPVNN-TFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSS---EPHEN-TLQSPDIVLPFAAPPSSPVSLLQ
        R VNN +  T+ AAA AI + + R  + ++VQK+R GS  S+YWCFGS K  KRIGHAVLVPEP+ S     P +N +  S  I +PF APPSSP S L 
Subjt:  RPVNN-TFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSS---EPHEN-TLQSPDIVLPFAAPPSSPVSLLQ

Query:  SEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVESDN----
        S PPSA  +P   +   SLT N      P S F IGP+AHE Q V+PP+ FS  TTEPST PFTPP      +PSSPEVPFAQ +  SL++   ++    
Subjt:  SEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVESDN----

Query:  --QYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDS--------------CTQ
          +++  + +F+S Q YPGSP  +LISP      SG SSP P           + F +  PP     +  +   W  R  + S               T 
Subjt:  --QYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDS--------------CTQ

Query:  DSIEFKSSNDFVLNPHTSESMCDHHATN---------ESQNIQIL----IDDGSKR-EEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIH
        D  +  S    V+ P+ +E++      N         +SQ  ++      D GS R  +E     HR SFEL+  DV ++ + SK        +  S  H
Subjt:  DSIEFKSSNDFVLNPHTSESMCDHHATN---------ESQNIQIL----IDDGSKR-EEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIH

Query:  EPFETTKENSPHGDHTSNVIEEKTKADGD-EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAK-DGSTEGTTTGAWSFFP
        E           G+H   +     K  G+ E+ Q Q+  S + GS KEF FD+ N  +    KI S+WW N K  G  + +   +W+FFP
Subjt:  EPFETTKENSPHGDHTSNVIEEKTKADGD-EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAK-DGSTEGTTTGAWSFFP

AT5G52430.1 hydroxyproline-rich glycoprotein family protein8.4e-5837.63Show/hide
Query:  VNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEP---HENTLQSPDIVLPFAAPPSSPVSLLQSEPP
        VNN+ +T+ AAA AI T + R  + ++ QK RWG C S+Y CFG+ K  KRIG+AVLVPEP  S  P    +N+  S  +VLPF APPSSP S LQS+P 
Subjt:  VNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEP---HENTLQSPDIVLPFAAPPSSPVSLLQSEPP

Query:  SAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPP--ESIHLTTPSSPEVPFAQFVPPSLQKVESDN------
        S   SP   +   SLT+N +SP  P S+F +GP+A+E Q V+PP+ FS   TEPST P+TPP   S+H+TTPSSPEVPFAQ +  SL+    D+      
Subjt:  SAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPP--ESIHLTTPSSPEVPFAQFVPPSLQKVESDN------

Query:  QYTFPNDDFQSYQFYPGSP-VSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
        +++  + +F+S Q  PGSP   +LISP SVIS SG SSP P        S  + F +  PP     +  +   W  R  + S T             L P
Subjt:  QYTFPNDDFQSYQFYPGSP-VSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  HTSESMCDHHATN------ESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEK
        +  E +  +   N      ++Q  ++     S    E    +HR SFEL+  DV ++ + SK   S++    +  I        E S   D   N IE++
Subjt:  HTSESMCDHHATN------ESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEK

Query:  TKADGDEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFP
        +    +E H+ Q+  S ++GS KEF FD                  N KD + E     +WSFFP
Subjt:  TKADGDEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACGACGTACGGATACTGATGATTTCAGGCCTGTTAACAATACTTTTCAAACAATTACTGCCGCCGCTGATGCGATCGCAACCGTCGATCATCGTTTCCCTAGGGC
TACTGCCGTCCAGAAAAGAAGATGGGGCAGTTGTTTGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAAAGAATTGGGCATGCTGTATTGGTACCAGAACCAA
GTCCTTCATCTGAGCCTCATGAAAATACATTACAATCACCAGATATTGTGCTTCCTTTTGCTGCACCTCCTTCTTCCCCTGTTTCCTTACTTCAATCTGAACCACCTTCT
GCTATACAGTCGCCTACTGCTTTAATCTCTTTCACTTCTCTTACCGCTAACATGTATTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGAACC
ACAATTAGTGTCTCCACCTCTGAATTTCTCTACTCTTACTACTGAACCATCAACTCCTCCCTTCACTCCTCCCGAATCTATCCACTTGACTACACCTTCTTCCCCTGAAG
TTCCTTTTGCTCAGTTTGTTCCGCCTAGTCTTCAGAAAGTTGAGTCTGATAATCAATATACATTTCCTAATGATGATTTTCAATCTTACCAATTCTATCCAGGTAGTCCG
GTTAGTCACCTCATATCACCACGGTCGGTTATTTCTCGTTCTGGGGCTTCGTCGCCTTTGCCTGACTATGATTTTGCTTCCTTTGGTTCTCAATTTTTGAATTTCCCATT
AGAAGTTCCACCTACTTTGTCGAACCTTGACAAACATTCCATTCATAACTGGCGACAACGTCAAAGTACTGATTCTTGTACTCAAGATTCTATAGAATTCAAATCAAGTA
ATGACTTTGTTTTGAATCCCCATACTTCAGAATCTATGTGTGATCACCACGCAACAAATGAATCTCAAAATATCCAAATTCTCATCGATGATGGAAGCAAAAGGGAGGAA
GAGCCAGGAGCTACTAATCATAGATTCTCATTTGAGTTATCTGATGGAGATGTTTTATCACAAAGCGTAGGAAGTAAGCCATTGGAATCAAACGAACTTCCAGTTGAATC
ATCGCCAATACATGAACCATTTGAAACGACTAAAGAAAATTCTCCTCATGGTGACCATACTTCAAATGTTATAGAAGAAAAGACAAAAGCAGACGGTGATGAAGCACATC
AGCATCAAGAACATCATTCCGTTGCTCTTGGGTCTGTGAAGGAATTCAATTTTGATAATCGTAATGGAAGTGATACACATAACCCAAAAATAAATTCAGATTGGTGGACT
AATGCAAAGGATGGTAGCACAGAAGGCACAACCACCGGGGCCTGGTCATTCTTTCCAACGACGCAACAAAGATGA
mRNA sequenceShow/hide mRNA sequence
TTTTCCTTTCCGTCTTATTGCTTCTATCTCCCTAACGAACCTTCTCTTCACTTTCTTACTGCAAAATCCCCTGTTTCGATTTTGCCTAAGAATTTCATGATCTCTTCTTT
GTATGTGGAATTACGTTTCTTCTTTCTATGAAACCCATCGGCGATTCTCTGTTTCCAAGTAACGATAGCGATGAGACGACGTACGGATACTGATGATTTCAGGCCTGTTA
ACAATACTTTTCAAACAATTACTGCCGCCGCTGATGCGATCGCAACCGTCGATCATCGTTTCCCTAGGGCTACTGCCGTCCAGAAAAGAAGATGGGGCAGTTGTTTGAGT
ATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAAAGAATTGGGCATGCTGTATTGGTACCAGAACCAAGTCCTTCATCTGAGCCTCATGAAAATACATTACAATCACC
AGATATTGTGCTTCCTTTTGCTGCACCTCCTTCTTCCCCTGTTTCCTTACTTCAATCTGAACCACCTTCTGCTATACAGTCGCCTACTGCTTTAATCTCTTTCACTTCTC
TTACCGCTAACATGTATTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGAACCACAATTAGTGTCTCCACCTCTGAATTTCTCTACTCTTACT
ACTGAACCATCAACTCCTCCCTTCACTCCTCCCGAATCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCTTTTGCTCAGTTTGTTCCGCCTAGTCTTCAGAAAGT
TGAGTCTGATAATCAATATACATTTCCTAATGATGATTTTCAATCTTACCAATTCTATCCAGGTAGTCCGGTTAGTCACCTCATATCACCACGGTCGGTTATTTCTCGTT
CTGGGGCTTCGTCGCCTTTGCCTGACTATGATTTTGCTTCCTTTGGTTCTCAATTTTTGAATTTCCCATTAGAAGTTCCACCTACTTTGTCGAACCTTGACAAACATTCC
ATTCATAACTGGCGACAACGTCAAAGTACTGATTCTTGTACTCAAGATTCTATAGAATTCAAATCAAGTAATGACTTTGTTTTGAATCCCCATACTTCAGAATCTATGTG
TGATCACCACGCAACAAATGAATCTCAAAATATCCAAATTCTCATCGATGATGGAAGCAAAAGGGAGGAAGAGCCAGGAGCTACTAATCATAGATTCTCATTTGAGTTAT
CTGATGGAGATGTTTTATCACAAAGCGTAGGAAGTAAGCCATTGGAATCAAACGAACTTCCAGTTGAATCATCGCCAATACATGAACCATTTGAAACGACTAAAGAAAAT
TCTCCTCATGGTGACCATACTTCAAATGTTATAGAAGAAAAGACAAAAGCAGACGGTGATGAAGCACATCAGCATCAAGAACATCATTCCGTTGCTCTTGGGTCTGTGAA
GGAATTCAATTTTGATAATCGTAATGGAAGTGATACACATAACCCAAAAATAAATTCAGATTGGTGGACTAATGCAAAGGATGGTAGCACAGAAGGCACAACCACCGGGG
CCTGGTCATTCTTTCCAACGACGCAACAAAGATGAGAAAACTGGGACAGTTGCAAATTGATAGGTAAGACAAACAGCAAGAGGAATGGTTAGTTTTGAAGGTTTTAAAGA
TGTCAAATTATGAAAGAGCCTGACCAGAAGCCTTTTTTTTTTCCAACAATATGGCCTAAAACAGACGAAGTCAGATATTATTAGATAGAACGATAGAGAGATTGTAGATT
CAATTGGACCTTATTAACAAACACTTGTGCCTTGTGACTCGTCACTTGAATTGTAATAGATATCAATAGTCTGATAGAGATTGAAAGCATGTAAATATGGTAATAAGAAG
TTTTTTTTAATCTTCATGATATTGTTGATTTTGAATTAGTTAGTAGAATACCAAAGTTGAGAACG
Protein sequenceShow/hide protein sequence
MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPS
AIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSLQKVESDNQYTFPNDDFQSYQFYPGSP
VSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLSNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNPHTSESMCDHHATNESQNIQILIDDGSKREE
EPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWT
NAKDGSTEGTTTGAWSFFPTTQQR