; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0010571 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0010571
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionMucin-2
Genome locationchr07:24967807..24970719
RNA-Seq ExpressionPay0010571
SyntenyPay0010571
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044829.1 mucin-2 [Cucumis melo var. makuwa]5.1e-267100Show/hide
Query:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
        MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
Subjt:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP

Query:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVESD
        VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVESD
Subjt:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVESD

Query:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
        NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
Subjt:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
        HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
Subjt:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD

Query:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
        EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
Subjt:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR

XP_004146564.1 uncharacterized protein LOC101220378 isoform X1 [Cucumis sativus]5.3e-24894.41Show/hide
Query:  MRRRTDTDDFRPV-NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
        MRRRTDTDDFRPV NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGS+KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
Subjt:  MRRRTDTDDFRPV-NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS

Query:  PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVES
        PVSLLQSEPPSA+QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST PFTPPESIHLTTPSSPEVPFAQFV P+  KVES
Subjt:  PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVES

Query:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
        DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPL+VPPTL N+DKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
Subjt:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN

Query:  PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
        P TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPLESNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
Subjt:  PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG

Query:  DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
        DEAHQ QEHHSV LGSVKEFNFDN NGSDTHNP INS+WW NAKDGSTE T TG WSFFP TQQR
Subjt:  DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR

XP_008452032.1 PREDICTED: uncharacterized protein LOC103493162 isoform X1 [Cucumis melo]1.2e-26399.14Show/hide
Query:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
        MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAV QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
Subjt:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS

Query:  PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVES
        PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPS QKVES
Subjt:  PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVES

Query:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
        DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPL+VPPTLSN+DKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
Subjt:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN

Query:  PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
        PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
Subjt:  PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG

Query:  DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
        DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
Subjt:  DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR

XP_008452033.1 PREDICTED: uncharacterized protein LOC103493162 isoform X2 [Cucumis melo]4.8e-26599.35Show/hide
Query:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
        MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
Subjt:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP

Query:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVESD
        VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPS QKVESD
Subjt:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVESD

Query:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
        NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPL+VPPTLSN+DKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
Subjt:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
        HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
Subjt:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD

Query:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
        EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
Subjt:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR

XP_031740284.1 uncharacterized protein LOC101220378 isoform X2 [Cucumis sativus]2.0e-23190.54Show/hide
Query:  MRRRTDTDDFRPV-NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
        MRRRTDTDDFRPV NNTFQTITAAADAIATVDHRFPRATAV                   QRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
Subjt:  MRRRTDTDDFRPV-NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS

Query:  PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVES
        PVSLLQSEPPSA+QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST PFTPPESIHLTTPSSPEVPFAQFV P+  KVES
Subjt:  PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVES

Query:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
        DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPL+VPPTL N+DKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
Subjt:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN

Query:  PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
        P TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPLESNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
Subjt:  PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG

Query:  DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
        DEAHQ QEHHSV LGSVKEFNFDN NGSDTHNP INS+WW NAKDGSTE T TG WSFFP TQQR
Subjt:  DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR

TrEMBL top hitse value%identityAlignment
A0A1S3BSB0 uncharacterized protein LOC103493162 isoform X15.7e-26499.14Show/hide
Query:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
        MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAV QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
Subjt:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS

Query:  PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVES
        PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPS QKVES
Subjt:  PVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVES

Query:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
        DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPL+VPPTLSN+DKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
Subjt:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN

Query:  PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
        PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
Subjt:  PHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG

Query:  DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
        DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
Subjt:  DEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR

A0A1S3BSY8 uncharacterized protein LOC103493162 isoform X22.3e-26599.35Show/hide
Query:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
        MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
Subjt:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP

Query:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVESD
        VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPS QKVESD
Subjt:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVESD

Query:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
        NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPL+VPPTLSN+DKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
Subjt:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
        HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
Subjt:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD

Query:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
        EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
Subjt:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR

A0A5A7TUB1 Mucin-22.5e-267100Show/hide
Query:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
        MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
Subjt:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP

Query:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVESD
        VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVESD
Subjt:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVESD

Query:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
        NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
Subjt:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
        HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
Subjt:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD

Query:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
        EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
Subjt:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR

A0A5D3CYQ2 Mucin-22.3e-26599.35Show/hide
Query:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
        MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP
Subjt:  MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSP

Query:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVESD
        VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPS QKVESD
Subjt:  VSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVESD

Query:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
        NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPL+VPPTLSN+DKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
Subjt:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
        HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
Subjt:  HTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD

Query:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
        EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR
Subjt:  EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQR

A0A6J1C828 uncharacterized protein At1g76660-like9.6e-20379.53Show/hide
Query:  MRRRTDTD---DFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPP
        MRRR D D   D  PVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQRKRIGHAVLVPEPSPS+EP ENTLQSPDIVLPFAAPP
Subjt:  MRRRTDTD---DFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPP

Query:  SSPVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKV
        SSPVS LQSEPPSA QSPTA++SFTSLTANMYSPDGPSSIFA+GPFAHE QLVSPPLNFST+TT+PST PFTPPESIHLTTPSSPEVPFAQ++ PSHQKV
Subjt:  SSPVSLLQSEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKV

Query:  ESDNQY-TFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDF
        ESD+QY  FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPD DF   GS F NFP++VPPTL N+D+HSI +WR +QS+DSCTQ+S+ +KSSNDF
Subjt:  ESDNQY-TFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDF

Query:  VLNPHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTK
        VLNP TSES+ D+HA+NE  NIQIL  DGS+R +E  A NHRFSFELSD D L +SV +KPLESNEL V SSPIHEP ET KE S  G HTSN  EE+ K
Subjt:  VLNPHTSESMCDHHATNESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTK

Query:  ADGDEAHQHQ--EHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQ
        ADG+E H HQ  EHHSV LG+VKEFNFDN NG DT  P INS WW N KD  TEGTTTGAWSFFP TQQ
Subjt:  ADGDEAHQHQ--EHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFPTTQQ

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766603.8e-3148.1Show/hide
Query:  QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPS--PSSEP---HE----NTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALISFTSLTANMYSP
        Q++RWG CL ++ CF S K  KRI  A  +PE     +S+P   H+    N   +  I L   APPSSP S   S  PS  QSP     + SL AN  SP
Subjt:  QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPS--PSSEP---HE----NTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALISFTSLTANMYSP

Query:  DGP-SSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFT-PPESIHLTTPSSPEVPFAQFVPPSHQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRS
         GP SS++A GP+AHE QLVSPP+ FST TTEPST PFT PPE   LT PSSP+VP+A+F+  S     S   +   ND   +Y  YPGSP S L SP S
Subjt:  DGP-SSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFT-PPESIHLTTPSSPEVPFAQFVPPSHQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRS

Query:  VISRSGASSP
          S  G  SP
Subjt:  VISRSGASSP

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)7.1e-4950.19Show/hide
Query:  NNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENT----LQSPDIVLPFAAPPSSPVSLLQSEP
        NN F TI AAA AIA+ D R  +++ + +KR+W +  S+  CFGS +QRKRIG++VLVPEP   S  +  T     +S    LPF APPSSP S  QSEP
Subjt:  NNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENT----LQSPDIVLPFAAPPSSPVSLLQSEP

Query:  PSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPP---ESIHL--TTPSSPEVPFAQFVPPSHQKVESDNQY
        PSA QSP  ++SF+ L  N        SIFAIGP+AHE QLVSPP+ FST TTEPS+ P TPP    SI+L  TTPSSPEVPFAQ    +HQ       Y
Subjt:  PSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPP---ESIHL--TTPSSPEVPFAQFVPPSHQKVESDNQY

Query:  TFP---NDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTL
         FP   + +FQ YQ  PGSP+  LISP      SG +SP PD +     S F +F +  PP L
Subjt:  TFP---NDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTL

AT1G76660.1 FUNCTIONS IN: molecular_function unknown2.7e-3248.1Show/hide
Query:  QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPS--PSSEP---HE----NTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALISFTSLTANMYSP
        Q++RWG CL ++ CF S K  KRI  A  +PE     +S+P   H+    N   +  I L   APPSSP S   S  PS  QSP     + SL AN  SP
Subjt:  QKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPS--PSSEP---HE----NTLQSPDIVLPFAAPPSSPVSLLQSEPPSAIQSPTALISFTSLTANMYSP

Query:  DGP-SSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFT-PPESIHLTTPSSPEVPFAQFVPPSHQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRS
         GP SS++A GP+AHE QLVSPP+ FST TTEPST PFT PPE   LT PSSP+VP+A+F+  S     S   +   ND   +Y  YPGSP S L SP S
Subjt:  DGP-SSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFT-PPESIHLTTPSSPEVPFAQFVPPSHQKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRS

Query:  VISRSGASSP
          S  G  SP
Subjt:  VISRSGASSP

AT4G25620.1 hydroxyproline-rich glycoprotein family protein6.0e-4836.33Show/hide
Query:  RPVNN-TFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSS---EPHEN-TLQSPDIVLPFAAPPSSPVSLLQ
        R VNN +  T+ AAA AI + + R  + ++VQK+R GS  S+YWCFGS K  KRIGHAVLVPEP+ S     P +N +  S  I +PF APPSSP S L 
Subjt:  RPVNN-TFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSS---EPHEN-TLQSPDIVLPFAAPPSSPVSLLQ

Query:  SEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVESDN----
        S PPSA  +P   +   SLT N      P S F IGP+AHE Q V+PP+ FS  TTEPST PFTPP      +PSSPEVPFAQ +  S ++   ++    
Subjt:  SEPPSAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVESDN----

Query:  --QYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDS--------------CTQ
          +++  + +F+S Q YPGSP  +LISP      SG SSP P           + F +  PP     +  +   W  R  + S               T 
Subjt:  --QYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDS--------------CTQ

Query:  DSIEFKSSNDFVLNPHTSESMCDHHATN---------ESQNIQIL----IDDGSKR-EEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIH
        D  +  S    V+ P+ +E++      N         +SQ  ++      D GS R  +E     HR SFEL+  DV ++ + SK        +  S  H
Subjt:  DSIEFKSSNDFVLNPHTSESMCDHHATN---------ESQNIQIL----IDDGSKR-EEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIH

Query:  EPFETTKENSPHGDHTSNVIEEKTKADGD-EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAK-DGSTEGTTTGAWSFFP
        E           G+H   +     K  G+ E+ Q Q+  S + GS KEF FD+ N  +    KI S+WW N K  G  + +   +W+FFP
Subjt:  EPFETTKENSPHGDHTSNVIEEKTKADGD-EAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAK-DGSTEGTTTGAWSFFP

AT5G52430.1 hydroxyproline-rich glycoprotein family protein7.1e-5737.42Show/hide
Query:  VNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEP---HENTLQSPDIVLPFAAPPSSPVSLLQSEPP
        VNN+ +T+ AAA AI T + R  + ++ QK RWG C S+Y CFG+ K  KRIG+AVLVPEP  S  P    +N+  S  +VLPF APPSSP S LQS+P 
Subjt:  VNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEP---HENTLQSPDIVLPFAAPPSSPVSLLQSEPP

Query:  SAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPP--ESIHLTTPSSPEVPFAQFVPPSHQKVESDN------
        S   SP   +   SLT+N +SP  P S+F +GP+A+E Q V+PP+ FS   TEPST P+TPP   S+H+TTPSSPEVPFAQ +  S +    D+      
Subjt:  SAIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPP--ESIHLTTPSSPEVPFAQFVPPSHQKVESDN------

Query:  QYTFPNDDFQSYQFYPGSP-VSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
        +++  + +F+S Q  PGSP   +LISP SVIS SG SSP P        S  + F +  PP     +  +   W  R  + S T             L P
Subjt:  QYTFPNDDFQSYQFYPGSP-VSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  HTSESMCDHHATN------ESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEK
        +  E +  +   N      ++Q  ++     S    E    +HR SFEL+  DV ++ + SK   S++    +  I        E S   D   N IE++
Subjt:  HTSESMCDHHATN------ESQNIQILIDDGSKREEEPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEK

Query:  TKADGDEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFP
        +    +E H+ Q+  S ++GS KEF FD                  N KD + E     +WSFFP
Subjt:  TKADGDEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWTNAKDGSTEGTTTGAWSFFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACGACGTACGGATACTGATGATTTCAGGCCTGTTAACAATACTTTTCAAACAATTACTGCCGCCGCTGATGCGATCGCAACCGTCGATCATCGTTTCCCTAGGGC
TACTGCCGTCCAGAAAAGAAGATGGGGCAGTTGTTTGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAAAGAATTGGGCATGCTGTATTGGTACCAGAACCAA
GTCCTTCATCTGAGCCTCATGAAAATACATTACAATCACCAGATATTGTGCTTCCTTTTGCTGCACCTCCTTCTTCCCCTGTTTCCTTACTTCAATCTGAACCACCTTCT
GCTATACAGTCGCCTACTGCTTTAATCTCTTTCACTTCTCTTACCGCTAACATGTATTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGAACC
ACAATTAGTGTCTCCACCTCTGAATTTCTCTACTCTTACTACTGAACCATCAACTCCTCCCTTCACTCCTCCCGAATCTATCCACTTGACTACACCTTCTTCCCCTGAAG
TTCCTTTTGCTCAGTTTGTTCCGCCTAGTCATCAGAAAGTTGAGTCTGATAATCAATATACATTTCCTAATGATGATTTTCAATCTTACCAATTCTATCCAGGTAGTCCG
GTTAGTCACCTCATATCACCACGGTCGGTTATTTCTCGTTCTGGGGCTTCGTCGCCTTTGCCTGACTATGATTTTGCTTCCTTTGGTTCTCAATTTTTGAATTTCCCATT
AAAAGTTCCACCTACTTTGTCGAACATTGACAAACATTCCATTCATAACTGGCGACAACGTCAAAGTACTGATTCTTGTACTCAAGATTCTATAGAATTCAAATCAAGTA
ATGACTTTGTTTTGAATCCCCATACTTCAGAATCTATGTGTGATCACCACGCAACAAATGAATCTCAAAATATCCAAATTCTCATCGATGATGGAAGCAAAAGGGAGGAA
GAGCCAGGTGCTACTAATCATAGATTCTCATTTGAGTTATCTGATGGAGATGTTTTATCACAAAGCGTAGGAAGTAAGCCATTGGAATCAAACGAACTTCCAGTTGAATC
ATCGCCAATACATGAACCATTTGAAACGACTAAAGAAAATTCTCCTCATGGTGACCATACTTCAAATGTTATAGAAGAAAAGACAAAAGCAGACGGTGATGAAGCACATC
AGCATCAAGAACATCATTCCGTTGCTCTTGGGTCTGTGAAGGAATTCAATTTTGATAATCGTAATGGAAGTGATACACATAACCCAAAAATAAATTCAGATTGGTGGACT
AATGCAAAGGATGGTAGCACAGAAGGCACAACCACCGGGGCCTGGTCATTCTTTCCAACGACGCAACAAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATTTTAATTTCATTACATTTTCCTTTCCGTCTTATTGCTTCTATCTCCCTAACGAACCTTCTCTTCACTTTCTTACTGCAAAATCCCCTGTTTCGATTTTGCCTAAGAAT
TTCATGATCTCTTCTTTGTATGTGGAATTACGTTTCTTCTTTCTATGAAACCCATCGGCGATTCTCTGTTTCCAAGTAACGATAGCGATGAGACGACGTACGGATACTGA
TGATTTCAGGCCTGTTAACAATACTTTTCAAACAATTACTGCCGCCGCTGATGCGATCGCAACCGTCGATCATCGTTTCCCTAGGGCTACTGCCGTCCAGAAAAGAAGAT
GGGGCAGTTGTTTGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAAAGAATTGGGCATGCTGTATTGGTACCAGAACCAAGTCCTTCATCTGAGCCTCATGAA
AATACATTACAATCACCAGATATTGTGCTTCCTTTTGCTGCACCTCCTTCTTCCCCTGTTTCCTTACTTCAATCTGAACCACCTTCTGCTATACAGTCGCCTACTGCTTT
AATCTCTTTCACTTCTCTTACCGCTAACATGTATTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGAACCACAATTAGTGTCTCCACCTCTGA
ATTTCTCTACTCTTACTACTGAACCATCAACTCCTCCCTTCACTCCTCCCGAATCTATCCACTTGACTACACCTTCTTCCCCTGAAGTTCCTTTTGCTCAGTTTGTTCCG
CCTAGTCATCAGAAAGTTGAGTCTGATAATCAATATACATTTCCTAATGATGATTTTCAATCTTACCAATTCTATCCAGGTAGTCCGGTTAGTCACCTCATATCACCACG
GTCGGTTATTTCTCGTTCTGGGGCTTCGTCGCCTTTGCCTGACTATGATTTTGCTTCCTTTGGTTCTCAATTTTTGAATTTCCCATTAAAAGTTCCACCTACTTTGTCGA
ACATTGACAAACATTCCATTCATAACTGGCGACAACGTCAAAGTACTGATTCTTGTACTCAAGATTCTATAGAATTCAAATCAAGTAATGACTTTGTTTTGAATCCCCAT
ACTTCAGAATCTATGTGTGATCACCACGCAACAAATGAATCTCAAAATATCCAAATTCTCATCGATGATGGAAGCAAAAGGGAGGAAGAGCCAGGTGCTACTAATCATAG
ATTCTCATTTGAGTTATCTGATGGAGATGTTTTATCACAAAGCGTAGGAAGTAAGCCATTGGAATCAAACGAACTTCCAGTTGAATCATCGCCAATACATGAACCATTTG
AAACGACTAAAGAAAATTCTCCTCATGGTGACCATACTTCAAATGTTATAGAAGAAAAGACAAAAGCAGACGGTGATGAAGCACATCAGCATCAAGAACATCATTCCGTT
GCTCTTGGGTCTGTGAAGGAATTCAATTTTGATAATCGTAATGGAAGTGATACACATAACCCAAAAATAAATTCAGATTGGTGGACTAATGCAAAGGATGGTAGCACAGA
AGGCACAACCACCGGGGCCTGGTCATTCTTTCCAACGACGCAACAAAGATGAGAAAACTGGGACAGTTGCAAATTGATAGGTAAGACAAACAGCAAGAGGAATGGTTAGT
TTTGAAGGTTTTAAAGATGTCAAATTATGAAAGAGCCTGACCAGAAGCCTTTTTTTTCTTTCAACAATATGGCCTAAAACAGACGAAGTCAGATATTATTAGATAGAACG
ATAGAGAGATTGTAGATTCAATTGGACCTTATTAACAAACACTTGTGCCTTGTGACTCGTCACTTGAATTGTAATAGATATCAATAGTCTGATAGAGATTGAAAGCACGT
AAATATGGTAATAAGAAGTTTTTTTTAATCTTCATGATTATTGATTTTGAATTAGTTAGTAGAATACACAAAGTTGAGAACGTATTGAGTTGGGCTACTCCTAGAGATGT
TTCATTCATAAGTTGCACATCACTAATGCCTACACTACATGTCCATGATGTTTTCTTTTATCATGTTCCTCATTTGAACAATGTTATGACCCATTCTTTGACATCCAAGC
TGTGGAAAGAGCAAACTTCTTTAAACTCTCTATTCTTCATACCTTGAGAGGGTTGAGTTACTTGTTTCTCTTGATGTTGG
Protein sequenceShow/hide protein sequence
MRRRTDTDDFRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSLKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPPS
AIQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPPFTPPESIHLTTPSSPEVPFAQFVPPSHQKVESDNQYTFPNDDFQSYQFYPGSP
VSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLKVPPTLSNIDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNPHTSESMCDHHATNESQNIQILIDDGSKREE
EPGATNHRFSFELSDGDVLSQSVGSKPLESNELPVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQHQEHHSVALGSVKEFNFDNRNGSDTHNPKINSDWWT
NAKDGSTEGTTTGAWSFFPTTQQR