; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G05770 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G05770
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationChr4:3916227..3918923
RNA-Seq ExpressionCSPI04G05770
SyntenyCSPI04G05770
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044829.1 mucin-2 [Cucumis melo var. makuwa]1.1e-24894.19Show/hide
Query:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
        MRRRTDTDDFRPV NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGS+KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
Subjt:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS

Query:  PVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPKVES
        PVSLLQSEPPSA+QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST PFTPPESIHLTTPSSPEVPFAQFV P+  KVES
Subjt:  PVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPKVES

Query:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
        DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPL+VPPTL N+DKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
Subjt:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN

Query:  PQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
        P TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPLESNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
Subjt:  PQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG

Query:  DEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQR
        DEAHQ QEHHSV LGSVKEFNFDN NGSDTHNP I S+WW NAKDGSTE T TG WSFFP TQQR
Subjt:  DEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQR

XP_004146564.1 uncharacterized protein LOC101220378 isoform X1 [Cucumis sativus]7.4e-26699.78Show/hide
Query:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
        MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
Subjt:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS

Query:  PVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPSSPEVPFAQFVQPTLPKVESD
        PVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPSSPEVPFAQFVQPTLPKVESD
Subjt:  PVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPSSPEVPFAQFVQPTLPKVESD

Query:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
        NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
Subjt:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  QTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
        QTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
Subjt:  QTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD

Query:  EAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQR
        EAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNI SEWWINAKDGSTESTATGTWSFFPMTQQR
Subjt:  EAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQR

XP_008452032.1 PREDICTED: uncharacterized protein LOC103493162 isoform X1 [Cucumis melo]8.3e-24994.64Show/hide
Query:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
        MRRRTDTDDFRPV NNTFQTITAAADAIATVDHRFPRATAV QKRRWGSCLSIYWCFGS+KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
Subjt:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS

Query:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPKVE
        SPVSLLQSEPPSA+QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST PFTPPESIHLTTPSSPEVPFAQFV P+L KVE
Subjt:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPKVE

Query:  SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL
        SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTL NLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL
Subjt:  SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL

Query:  NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD
        NP TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPLESNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD
Subjt:  NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD

Query:  GDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQR
        GDEAHQ QEHHSV LGSVKEFNFDN NGSDTHNP I S+WW NAKDGSTE T TG WSFFP TQQR
Subjt:  GDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQR

XP_008452033.1 PREDICTED: uncharacterized protein LOC103493162 isoform X2 [Cucumis melo]3.4e-25094.84Show/hide
Query:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
        MRRRTDTDDFRPV NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGS+KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
Subjt:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS

Query:  PVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPKVES
        PVSLLQSEPPSA+QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST PFTPPESIHLTTPSSPEVPFAQFV P+L KVES
Subjt:  PVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPKVES

Query:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
        DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTL NLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
Subjt:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN

Query:  PQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
        P TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPLESNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
Subjt:  PQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG

Query:  DEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQR
        DEAHQ QEHHSV LGSVKEFNFDN NGSDTHNP I S+WW NAKDGSTE T TG WSFFP TQQR
Subjt:  DEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQR

XP_031740284.1 uncharacterized protein LOC101220378 isoform X2 [Cucumis sativus]4.8e-24995.69Show/hide
Query:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
        MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAV                   QRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
Subjt:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS

Query:  PVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPSSPEVPFAQFVQPTLPKVESD
        PVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPSSPEVPFAQFVQPTLPKVESD
Subjt:  PVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPSSPEVPFAQFVQPTLPKVESD

Query:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
        NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP
Subjt:  NQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  QTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
        QTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD
Subjt:  QTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGD

Query:  EAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQR
        EAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNI SEWWINAKDGSTESTATGTWSFFPMTQQR
Subjt:  EAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQR

TrEMBL top hitse value%identityAlignment
A0A1S3BSB0 uncharacterized protein LOC103493162 isoform X14.0e-24994.64Show/hide
Query:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
        MRRRTDTDDFRPV NNTFQTITAAADAIATVDHRFPRATAV QKRRWGSCLSIYWCFGS+KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
Subjt:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS

Query:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPKVE
        SPVSLLQSEPPSA+QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST PFTPPESIHLTTPSSPEVPFAQFV P+L KVE
Subjt:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPKVE

Query:  SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL
        SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTL NLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL
Subjt:  SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL

Query:  NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD
        NP TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPLESNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD
Subjt:  NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD

Query:  GDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQR
        GDEAHQ QEHHSV LGSVKEFNFDN NGSDTHNP I S+WW NAKDGSTE T TG WSFFP TQQR
Subjt:  GDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQR

A0A1S3BSY8 uncharacterized protein LOC103493162 isoform X21.6e-25094.84Show/hide
Query:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
        MRRRTDTDDFRPV NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGS+KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
Subjt:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS

Query:  PVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPKVES
        PVSLLQSEPPSA+QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST PFTPPESIHLTTPSSPEVPFAQFV P+L KVES
Subjt:  PVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPKVES

Query:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
        DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTL NLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
Subjt:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN

Query:  PQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
        P TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPLESNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
Subjt:  PQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG

Query:  DEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQR
        DEAHQ QEHHSV LGSVKEFNFDN NGSDTHNP I S+WW NAKDGSTE T TG WSFFP TQQR
Subjt:  DEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQR

A0A5A7TUB1 Mucin-25.2e-24994.19Show/hide
Query:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
        MRRRTDTDDFRPV NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGS+KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
Subjt:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS

Query:  PVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPKVES
        PVSLLQSEPPSA+QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST PFTPPESIHLTTPSSPEVPFAQFV P+  KVES
Subjt:  PVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPKVES

Query:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
        DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPL+VPPTL N+DKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
Subjt:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN

Query:  PQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
        P TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPLESNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
Subjt:  PQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG

Query:  DEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQR
        DEAHQ QEHHSV LGSVKEFNFDN NGSDTHNP I S+WW NAKDGSTE T TG WSFFP TQQR
Subjt:  DEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQR

A0A5D3CYQ2 Mucin-21.6e-25094.84Show/hide
Query:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
        MRRRTDTDDFRPV NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGS+KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS
Subjt:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSS

Query:  PVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPKVES
        PVSLLQSEPPSA+QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST PFTPPESIHLTTPSSPEVPFAQFV P+L KVES
Subjt:  PVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPKVES

Query:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
        DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTL NLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
Subjt:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN

Query:  PQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
        P TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPLESNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
Subjt:  PQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG

Query:  DEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQR
        DEAHQ QEHHSV LGSVKEFNFDN NGSDTHNP I S+WW NAKDGSTE T TG WSFFP TQQR
Subjt:  DEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQR

A0A6J1C828 uncharacterized protein At1g76660-like2.2e-20279.57Show/hide
Query:  MRRRTDTD---DFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAP
        MRRR D D   D  PV NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGS+KQRKRIGHAVLVPEPSPS+EP ENTLQSPDIVLPFAAP
Subjt:  MRRRTDTD---DFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAP

Query:  PSSPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPK
        PSSPVS LQSEPPSA QSPTA++SFTSLTANMYSPDGPSSIFA+GPFAHE QLVSPPLNFST+TT+PST PFTPPESIHLTTPSSPEVPFAQ++QP+  K
Subjt:  PSSPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPK

Query:  VESDNQY-TFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSND
        VESD+QY  FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPD DF   GS F NFP+EVPPTLLNLD+HSI +WR +QS+DSCTQ+S+ +KSSND
Subjt:  VESDNQY-TFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSND

Query:  FVLNPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKT
        FVLNPQTSES+SD+HA+NE  NIQIL  DGS++ +E  A NHRFSFELSD D LL+SV +KPLESNELAV SSPIHEP ET KE S  G HTSN  EE+ 
Subjt:  FVLNPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKT

Query:  KADGDE--AHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQ
        KADG+E   HQ  EHHSVTLG+VKEFNFDNGNG DT  PNI S WW N KD  TE T TG WSFFP+TQQ
Subjt:  KADGDE--AHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFPMTQQ

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766601.9e-3047.62Show/hide
Query:  QKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPS--PSSEP---HE----NTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALISFTSLTANMYSP
        Q++RWG CL ++ CF S K  KRI  A  +PE     +S+P   H+    N   +  I L   APPSSP S   S  PS  QSP     + SL AN  SP
Subjt:  QKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPS--PSSEP---HE----NTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALISFTSLTANMYSP

Query:  DGP-SSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFT-PPESIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRS
         GP SS++A GP+AHE QLVSPP+ FST TTEPST PFT PPE   LT PSSP+VP+A+F+  ++    S   +   ND   +Y  YPGSP S L SP S
Subjt:  DGP-SSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFT-PPESIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRS

Query:  VISRSGASSP
          S  G  SP
Subjt:  VISRSGASSP

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)3.9e-4749.43Show/hide
Query:  NNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENT----LQSPDIVLPFAAPPSSPVSLLQSEP
        NN F TI AAA AIA+ D R  +++ + +KR+W +  S+  CFGS +QRKRIG++VLVPEP   S  +  T     +S    LPF APPSSP S  QSEP
Subjt:  NNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENT----LQSPDIVLPFAAPPSSPVSLLQSEP

Query:  PSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPP---ESIHL--TTPSSPEVPFAQFVQPTLPKVESDNQY
        PSA QSP  ++SF+ L  N        SIFAIGP+AHE QLVSPP+ FST TTEPS+ P TPP    SI+L  TTPSSPEVPFAQ              Y
Subjt:  PSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPP---ESIHL--TTPSSPEVPFAQFVQPTLPKVESDNQY

Query:  TFP---NDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLN
         FP   + +FQ YQ  PGSP+  LISP      SG +SP PD +     S F +F +  PP LL+
Subjt:  TFP---NDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLN

AT1G76660.1 FUNCTIONS IN: molecular_function unknown1.4e-3147.62Show/hide
Query:  QKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPS--PSSEP---HE----NTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALISFTSLTANMYSP
        Q++RWG CL ++ CF S K  KRI  A  +PE     +S+P   H+    N   +  I L   APPSSP S   S  PS  QSP     + SL AN  SP
Subjt:  QKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPS--PSSEP---HE----NTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALISFTSLTANMYSP

Query:  DGP-SSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFT-PPESIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRS
         GP SS++A GP+AHE QLVSPP+ FST TTEPST PFT PPE   LT PSSP+VP+A+F+  ++    S   +   ND   +Y  YPGSP S L SP S
Subjt:  DGP-SSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFT-PPESIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRS

Query:  VISRSGASSP
          S  G  SP
Subjt:  VISRSGASSP

AT4G25620.1 hydroxyproline-rich glycoprotein family protein1.4e-4935.85Show/hide
Query:  RPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSS---EPHEN-TLQSPDIVLPFAAPPSSPVSLLQ
        R VNN++  T+ AAA AI + + R  + ++VQK+R GS  S+YWCFGS K  KRIGHAVLVPEP+ S     P +N +  S  I +PF APPSSP S L 
Subjt:  RPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSS---EPHEN-TLQSPDIVLPFAAPPSSPVSLLQ

Query:  SEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPKVESDN----
        S PPSA  +P   +   SLT N      P S F IGP+AHE Q V+PP+ FS  TTEPST PFTPP      +PSSPEVPFAQ +  +L +   ++    
Subjt:  SEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPKVESDN----

Query:  --QYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDS--------------CTQ
          +++  + +F+S Q YPGSP  +LISP      SG SSP P           + F +  PP  L  +  +   W  R  + S               T 
Subjt:  --QYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDS--------------CTQ

Query:  DSIEFKSSNDFVLNPQTSESMSDHHATNESQNIQILIDD--------------GSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIH
        D  +  S    V+ P  +E++      N +     L+D                S+  +E     HR SFEL+  DV  + + SK        +  S  H
Subjt:  DSIEFKSSNDFVLNPQTSESMSDHHATNESQNIQILIDD--------------GSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIH

Query:  EPFETTKENSPHGDHTSNVIEEKTKADGD-EAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAK-DGSTESTATGTWSFFPM
        E           G+H   +     K  G+ E+ Q Q+  S + GS KEF FD+ N  +     I+SEWW N K  G  + +   +W+FFP+
Subjt:  EPFETTKENSPHGDHTSNVIEEKTKADGD-EAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAK-DGSTESTATGTWSFFPM

AT5G52430.1 hydroxyproline-rich glycoprotein family protein1.7e-5838.03Show/hide
Query:  RPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEP---HENTLQSPDIVLPFAAPPSSPVSLLQS
        R V NN+ +T+ AAA AI T + R  + ++ QK RWG C S+Y CFG+ K  KRIG+AVLVPEP  S  P    +N+  S  +VLPF APPSSP S LQS
Subjt:  RPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEP---HENTLQSPDIVLPFAAPPSSPVSLLQS

Query:  EPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPP--ESIHLTTPSSPEVPFAQFVQPTLPKVESDN---
        +P S   SP   +   SLT+N +SP  P S+F +GP+A+E Q V+PP+ FS   TEPST P+TPP   S+H+TTPSSPEVPFAQ +  +L     D+   
Subjt:  EPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPP--ESIHLTTPSSPEVPFAQFVQPTLPKVESDN---

Query:  ---QYTFPNDDFQSYQFYPGSP-VSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFV
           +++  + +F+S Q  PGSP   +LISP SVIS SG SSP P        S  + F +  PP  L  +  +   W  R  + S T             
Subjt:  ---QYTFPNDDFQSYQFYPGSP-VSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFV

Query:  LNPQTSESMSDHHATN------ESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVI
        L P   E +S +   N      ++Q  ++     S    E    +HR SFEL+  DV  + + SK   S++    +  I        E S   D   N I
Subjt:  LNPQTSESMSDHHATN------ESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVI

Query:  EEKTKADGDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFP
        E+++    +E H+ Q+  S ++GS KEF FD                  N KD + E  A  +WSFFP
Subjt:  EEKTKADGDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWINAKDGSTESTATGTWSFFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACGACGTACGGATACTGATGATTTCAGGCCTGTTAACAATAATACTTTTCAAACAATTACTGCCGCCGCTGATGCGATCGCAACCGTCGATCATCGTTTCCCTCG
GGCTACTGCCGTCCAGAAAAGAAGATGGGGCAGTTGTTTGAGTATTTATTGGTGCTTTGGATCTATCAAACAGAGGAAAAGAATCGGGCATGCTGTATTGGTACCAGAAC
CAAGTCCTTCATCTGAGCCTCATGAAAATACATTACAATCACCAGATATTGTGCTTCCTTTTGCTGCACCTCCTTCTTCCCCTGTTTCCTTACTTCAATCTGAACCACCT
TCTGCTATGCAGTCGCCTACTGCTTTAATCTCTTTCACTTCTCTTACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGA
ACCACAATTAGTGTCTCCACCTCTGAATTTCTCTACTCTTACTACTGAACCATCAACTCCCTTCACTCCTCCCGAATCTATCCACTTGACTACACCTTCTTCCCCTGAAG
TTCCTTTTGCTCAGTTTGTTCAGCCTACTCTTCCGAAAGTTGAGTCTGATAATCAATATACATTTCCTAATGATGATTTCCAATCTTACCAATTCTATCCAGGTAGTCCG
GTTAGTCACCTCATATCACCACGGTCGGTTATTTCTCGTTCTGGGGCTTCATCGCCTTTGCCTGACTATGATTTTGCTTCCTTTGGTTCTCAATTTTTGAATTTCCCACT
AGAAGTTCCACCTACTTTGTTGAACCTTGACAAACATTCCATTCATAACTGGCGACAACGTCAAAGTACTGATTCTTGTACTCAAGATTCTATAGAATTCAAATCAAGTA
ATGACTTTGTTTTGAATCCCCAAACTTCAGAATCTATGTCTGATCACCACGCAACAAATGAATCTCAAAATATTCAAATTCTCATCGATGATGGAAGTAAAAAGGAGGAG
GAGCCAGGTGCTACTAATCATAGATTCTCATTTGAGCTATCTGATGGGGATGTTTTATTACAAAGCGTAGGAAGTAAGCCATTGGAATCAAATGAACTTGCGGTTGAATC
ATCGCCAATACATGAACCATTTGAAACGACTAAAGAAAATTCTCCTCATGGTGACCATACTTCAAATGTTATAGAAGAAAAGACAAAAGCAGACGGTGATGAAGCACATC
AGCGTCAAGAACATCATTCCGTTACACTTGGGTCTGTGAAGGAATTCAATTTTGATAATGGTAATGGAAGTGACACACATAACCCAAATATAAAATCAGAATGGTGGATT
AATGCAAAGGATGGTAGCACAGAAAGCACAGCCACCGGGACCTGGTCATTCTTTCCAATGACGCAACAAAGATGA
mRNA sequenceShow/hide mRNA sequence
GAAAATTCGAATTAATTGAATTACATAGTACTAATGATATATGTTTTTAAAATGATTTATTTAATTAGAATTGGTTTCCGTCTTGTTGCTTCTATCTCGCTAACGAACCT
TCTCTTCACTTTCATACTGCAAAATCCCCTGTTTCGAATTTGCCTAAGAATTTCATGATCTCTTCTTTGTATGTTGAATTACGATTCTTCTTTCTATGAAACCCATCGGC
GATTCTCTCTTTCCGAGTAACGATAGCGATGAGACGACGTACGGATACTGATGATTTCAGGCCTGTTAACAATAATACTTTTCAAACAATTACTGCCGCCGCTGATGCGA
TCGCAACCGTCGATCATCGTTTCCCTCGGGCTACTGCCGTCCAGAAAAGAAGATGGGGCAGTTGTTTGAGTATTTATTGGTGCTTTGGATCTATCAAACAGAGGAAAAGA
ATCGGGCATGCTGTATTGGTACCAGAACCAAGTCCTTCATCTGAGCCTCATGAAAATACATTACAATCACCAGATATTGTGCTTCCTTTTGCTGCACCTCCTTCTTCCCC
TGTTTCCTTACTTCAATCTGAACCACCTTCTGCTATGCAGTCGCCTACTGCTTTAATCTCTTTCACTTCTCTTACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCCA
TTTTTGCCATTGGCCCATTTGCTCATGAACCACAATTAGTGTCTCCACCTCTGAATTTCTCTACTCTTACTACTGAACCATCAACTCCCTTCACTCCTCCCGAATCTATC
CACTTGACTACACCTTCTTCCCCTGAAGTTCCTTTTGCTCAGTTTGTTCAGCCTACTCTTCCGAAAGTTGAGTCTGATAATCAATATACATTTCCTAATGATGATTTCCA
ATCTTACCAATTCTATCCAGGTAGTCCGGTTAGTCACCTCATATCACCACGGTCGGTTATTTCTCGTTCTGGGGCTTCATCGCCTTTGCCTGACTATGATTTTGCTTCCT
TTGGTTCTCAATTTTTGAATTTCCCACTAGAAGTTCCACCTACTTTGTTGAACCTTGACAAACATTCCATTCATAACTGGCGACAACGTCAAAGTACTGATTCTTGTACT
CAAGATTCTATAGAATTCAAATCAAGTAATGACTTTGTTTTGAATCCCCAAACTTCAGAATCTATGTCTGATCACCACGCAACAAATGAATCTCAAAATATTCAAATTCT
CATCGATGATGGAAGTAAAAAGGAGGAGGAGCCAGGTGCTACTAATCATAGATTCTCATTTGAGCTATCTGATGGGGATGTTTTATTACAAAGCGTAGGAAGTAAGCCAT
TGGAATCAAATGAACTTGCGGTTGAATCATCGCCAATACATGAACCATTTGAAACGACTAAAGAAAATTCTCCTCATGGTGACCATACTTCAAATGTTATAGAAGAAAAG
ACAAAAGCAGACGGTGATGAAGCACATCAGCGTCAAGAACATCATTCCGTTACACTTGGGTCTGTGAAGGAATTCAATTTTGATAATGGTAATGGAAGTGACACACATAA
CCCAAATATAAAATCAGAATGGTGGATTAATGCAAAGGATGGTAGCACAGAAAGCACAGCCACCGGGACCTGGTCATTCTTTCCAATGACGCAACAAAGATGAGCAAACT
GGGGCAGTTGCAAATCGATAGGTAAGACGAACAGCAAGAGGAATTGTTAGTTTTGAAGGTTTTAAAAAACATGTCAAATTATGAAAGAGCCTGACCAGAAGCCTTTTTTT
CCAACAATATGACCTAAAACAAACAACGACAGATATTATTAGATAGAACGATAGAGAAATTGTAGATTCAATAGGACCTTATTAACAAACACTTGTGGCTTGTGACTCGT
CACTTGGATTGTAATAGATATCAAAGTCTTGATAGAAATTGAAAGCATGTAAATATGGTAATAAGAAGC
Protein sequenceShow/hide protein sequence
MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSLLQSEPP
SAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSP
VSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNPQTSESMSDHHATNESQNIQILIDDGSKKEE
EPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNIKSEWWI
NAKDGSTESTATGTWSFFPMTQQR