; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G11989 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G11989
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationctg1820:4642945..4645442
RNA-Seq ExpressionCucsat.G11989
SyntenyCucsat.G11989
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044829.1 mucin-2 [Cucumis melo var. makuwa]8.28e-31594.21Show/hide
Query:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
        MRRRTDTDDFRPVNN TFQTITAAADAIATVDHRFPRATAVQ KRRWGSCLSIYWCFGS+KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
Subjt:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS

Query:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPESIHLTTPSSPEVPFAQFVQPTLPKVE
        SPVSLLQSEPPSA+QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP FTPPESIHLTTPSSPEVPFAQFV P+  KVE
Subjt:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPESIHLTTPSSPEVPFAQFVQPTLPKVE

Query:  SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL
        SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPL+VPPTL N+DKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL
Subjt:  SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL

Query:  NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD
        NP TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPLESNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD
Subjt:  NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD

Query:  GDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR
        GDEAHQ QEHHSV LGSVKEFNFDN NGSDTHNP INS+WW NAKDGSTE T TG WSFFP TQQR
Subjt:  GDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR

XP_004146564.1 uncharacterized protein LOC101220378 isoform X1 [Cucumis sativus]0.099.78Show/hide
Query:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
        MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQ KRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
Subjt:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS

Query:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPSSPEVPFAQFVQPTLPKVES
        SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPSSPEVPFAQFVQPTLPKVES
Subjt:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPSSPEVPFAQFVQPTLPKVES

Query:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
        DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
Subjt:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN

Query:  PQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
        PQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
Subjt:  PQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG

Query:  DEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR
        DEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR
Subjt:  DEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR

XP_008452032.1 PREDICTED: uncharacterized protein LOC103493162 isoform X1 [Cucumis melo]0.095.06Show/hide
Query:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
        MRRRTDTDDFRPVNN TFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGS+KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
Subjt:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS

Query:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPESIHLTTPSSPEVPFAQFVQPTLPKVE
        SPVSLLQSEPPSA+QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP FTPPESIHLTTPSSPEVPFAQFV P+L KVE
Subjt:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPESIHLTTPSSPEVPFAQFVQPTLPKVE

Query:  SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL
        SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTL NLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL
Subjt:  SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL

Query:  NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD
        NP TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPLESNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD
Subjt:  NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD

Query:  GDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR
        GDEAHQ QEHHSV LGSVKEFNFDN NGSDTHNP INS+WW NAKDGSTE T TG WSFFP TQQR
Subjt:  GDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR

XP_008452033.1 PREDICTED: uncharacterized protein LOC103493162 isoform X2 [Cucumis melo]0.094.85Show/hide
Query:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
        MRRRTDTDDFRPVNN TFQTITAAADAIATVDHRFPRATAVQ KRRWGSCLSIYWCFGS+KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
Subjt:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS

Query:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPESIHLTTPSSPEVPFAQFVQPTLPKVE
        SPVSLLQSEPPSA+QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP FTPPESIHLTTPSSPEVPFAQFV P+L KVE
Subjt:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPESIHLTTPSSPEVPFAQFVQPTLPKVE

Query:  SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL
        SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTL NLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL
Subjt:  SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL

Query:  NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD
        NP TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPLESNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD
Subjt:  NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD

Query:  GDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR
        GDEAHQ QEHHSV LGSVKEFNFDN NGSDTHNP INS+WW NAKDGSTE T TG WSFFP TQQR
Subjt:  GDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR

XP_031740284.1 uncharacterized protein LOC101220378 isoform X2 [Cucumis sativus]0.095.7Show/hide
Query:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
        MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQ                    RKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
Subjt:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS

Query:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPSSPEVPFAQFVQPTLPKVES
        SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPSSPEVPFAQFVQPTLPKVES
Subjt:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPSSPEVPFAQFVQPTLPKVES

Query:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
        DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN
Subjt:  DNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLN

Query:  PQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
        PQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG
Subjt:  PQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADG

Query:  DEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR
        DEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR
Subjt:  DEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR

TrEMBL top hitse value%identityAlignment
A0A1S3BSB0 uncharacterized protein LOC103493162 isoform X10.095.06Show/hide
Query:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
        MRRRTDTDDFRPVNN TFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGS+KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
Subjt:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS

Query:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPESIHLTTPSSPEVPFAQFVQPTLPKVE
        SPVSLLQSEPPSA+QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP FTPPESIHLTTPSSPEVPFAQFV P+L KVE
Subjt:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPESIHLTTPSSPEVPFAQFVQPTLPKVE

Query:  SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL
        SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTL NLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL
Subjt:  SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL

Query:  NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD
        NP TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPLESNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD
Subjt:  NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD

Query:  GDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR
        GDEAHQ QEHHSV LGSVKEFNFDN NGSDTHNP INS+WW NAKDGSTE T TG WSFFP TQQR
Subjt:  GDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR

A0A1S3BSY8 uncharacterized protein LOC103493162 isoform X20.094.85Show/hide
Query:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
        MRRRTDTDDFRPVNN TFQTITAAADAIATVDHRFPRATAVQ KRRWGSCLSIYWCFGS+KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
Subjt:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS

Query:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPESIHLTTPSSPEVPFAQFVQPTLPKVE
        SPVSLLQSEPPSA+QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP FTPPESIHLTTPSSPEVPFAQFV P+L KVE
Subjt:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPESIHLTTPSSPEVPFAQFVQPTLPKVE

Query:  SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL
        SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTL NLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL
Subjt:  SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL

Query:  NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD
        NP TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPLESNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD
Subjt:  NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD

Query:  GDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR
        GDEAHQ QEHHSV LGSVKEFNFDN NGSDTHNP INS+WW NAKDGSTE T TG WSFFP TQQR
Subjt:  GDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR

A0A5A7TUB1 Mucin-24.01e-31594.21Show/hide
Query:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
        MRRRTDTDDFRPVNN TFQTITAAADAIATVDHRFPRATAVQ KRRWGSCLSIYWCFGS+KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
Subjt:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS

Query:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPESIHLTTPSSPEVPFAQFVQPTLPKVE
        SPVSLLQSEPPSA+QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP FTPPESIHLTTPSSPEVPFAQFV P+  KVE
Subjt:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPESIHLTTPSSPEVPFAQFVQPTLPKVE

Query:  SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL
        SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPL+VPPTL N+DKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL
Subjt:  SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL

Query:  NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD
        NP TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPLESNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD
Subjt:  NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD

Query:  GDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR
        GDEAHQ QEHHSV LGSVKEFNFDN NGSDTHNP INS+WW NAKDGSTE T TG WSFFP TQQR
Subjt:  GDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR

A0A5D3CYQ2 Mucin-20.094.85Show/hide
Query:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
        MRRRTDTDDFRPVNN TFQTITAAADAIATVDHRFPRATAVQ KRRWGSCLSIYWCFGS+KQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS
Subjt:  MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPS

Query:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPESIHLTTPSSPEVPFAQFVQPTLPKVE
        SPVSLLQSEPPSA+QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP FTPPESIHLTTPSSPEVPFAQFV P+L KVE
Subjt:  SPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTP-FTPPESIHLTTPSSPEVPFAQFVQPTLPKVE

Query:  SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL
        SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTL NLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL
Subjt:  SDNQYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVL

Query:  NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD
        NP TSESM DHHATNESQNIQILIDDGSK+EEEPGATNHRFSFELSDGDVL QSVGSKPLESNEL VESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD
Subjt:  NPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKAD

Query:  GDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR
        GDEAHQ QEHHSV LGSVKEFNFDN NGSDTHNP INS+WW NAKDGSTE T TG WSFFP TQQR
Subjt:  GDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR

A0A6J1C828 uncharacterized protein At1g76660-like4.36e-25479.62Show/hide
Query:  MRRRTDTD---DFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAA
        MRRR D D   D  PVNN TFQTITAAADAIATVDHRFPRATAVQ KRRWGSC SIYWCFGS+KQRKRIGHAVLVPEPSPS+EP ENTLQSPDIVLPFAA
Subjt:  MRRRTDTD---DFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAA

Query:  PPSSPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLP
        PPSSPVS LQSEPPSA QSPTA++SFTSLTANMYSPDGPSSIFA+GPFAHE QLVSPPLNFST+TT+PST PFTPPESIHLTTPSSPEVPFAQ++QP+  
Subjt:  PPSSPVSLLQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLP

Query:  KVESDNQYT-FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSN
        KVESD+QY  FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPD DF   GS F NFP+EVPPTLLNLD+HSI +WR +QS+DSCTQ+S+ +KSSN
Subjt:  KVESDNQYT-FPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSN

Query:  DFVLNPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEK
        DFVLNPQTSES+SD+HA+NE  NIQIL D GS+++E   A NHRFSFELSD D LL+SV +KPLESNELAV SSPIHEP ET KE S  G HTSN  EE+
Subjt:  DFVLNPQTSESMSDHHATNESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEK

Query:  TKADGDEAHQRQE--HHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQ
         KADG+E H  QE  HHSVTLG+VKEFNFDNGNG DT  PNINS WW N KD  TE T TG WSFFP+TQQ
Subjt:  TKADGDEAHQRQE--HHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQ

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766602.5e-3047.62Show/hide
Query:  QKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPS--PSSEP---HE----NTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALISFTSLTANMYSP
        Q++RWG CL ++ CF S K  KRI  A  +PE     +S+P   H+    N   +  I L   APPSSP S   S  PS  QSP     + SL AN  SP
Subjt:  QKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPS--PSSEP---HE----NTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALISFTSLTANMYSP

Query:  DGP-SSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFT-PPESIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRS
         GP SS++A GP+AHE QLVSPP+ FST TTEPST PFT PPE   LT PSSP+VP+A+F+  ++    S   +   ND   +Y  YPGSP S L SP S
Subjt:  DGP-SSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFT-PPESIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRS

Query:  VISRSGASSP
          S  G  SP
Subjt:  VISRSGASSP

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)1.6e-4849.43Show/hide
Query:  NNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENT----LQSPDIVLPFAAPPSSPVSLLQSEP
        NN F TI AAA AIA+ D R  +++ + +KR+W +  S+  CFGS +QRKRIG++VLVPEP   S  +  T     +S    LPF APPSSP S  QSEP
Subjt:  NNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENT----LQSPDIVLPFAAPPSSPVSLLQSEP

Query:  PSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPP---ESIHL--TTPSSPEVPFAQFVQPTLPKVESDNQY
        PSA QSP  ++SF+ L  N        SIFAIGP+AHE QLVSPP+ FST TTEPS+ P TPP    SI+L  TTPSSPEVPFAQ              Y
Subjt:  PSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPP---ESIHL--TTPSSPEVPFAQFVQPTLPKVESDNQY

Query:  TFP---NDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLN
         FP   + +FQ YQ  PGSP+  LISP      SG +SP PD +     S F +F +  PP LL+
Subjt:  TFP---NDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLN

AT1G76660.1 FUNCTIONS IN: molecular_function unknown1.8e-3147.62Show/hide
Query:  QKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPS--PSSEP---HE----NTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALISFTSLTANMYSP
        Q++RWG CL ++ CF S K  KRI  A  +PE     +S+P   H+    N   +  I L   APPSSP S   S  PS  QSP     + SL AN  SP
Subjt:  QKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPS--PSSEP---HE----NTLQSPDIVLPFAAPPSSPVSLLQSEPPSAMQSPTALISFTSLTANMYSP

Query:  DGP-SSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFT-PPESIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRS
         GP SS++A GP+AHE QLVSPP+ FST TTEPST PFT PPE   LT PSSP+VP+A+F+  ++    S   +   ND   +Y  YPGSP S L SP S
Subjt:  DGP-SSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFT-PPESIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDDFQSYQFYPGSPVSHLISPRS

Query:  VISRSGASSP
          S  G  SP
Subjt:  VISRSGASSP

AT4G25620.1 hydroxyproline-rich glycoprotein family protein1.2e-4835.77Show/hide
Query:  RPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSS---EPHEN-TLQSPDIVLPFAAPPSSPVSLL
        R VNN++  T+ AAA AI + + R  + ++VQ+KR  GS  S+YWCFGS K  KRIGHAVLVPEP+ S     P +N +  S  I +PF APPSSP S L
Subjt:  RPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSS---EPHEN-TLQSPDIVLPFAAPPSSPVSLL

Query:  QSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPKVESDN---
         S PPSA  +P   +   SLT N      P S F IGP+AHE Q V+PP+ FS  TTEPST PFTPP      +PSSPEVPFAQ +  +L +   ++   
Subjt:  QSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFVQPTLPKVESDN---

Query:  ---QYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDS--------------CT
           +++  + +F+S Q YPGSP  +LISP      SG SSP P           + F +  PP  L  +  +   W  R  + S               T
Subjt:  ---QYTFPNDDFQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDS--------------CT

Query:  QDSIEFKSSNDFVLNPQTSESMSDHHATNESQNIQILIDD--------------GSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPI
         D  +  S    V+ P  +E++      N +     L+D                S+  +E     HR SFEL+  DV  + + SK        +  S  
Subjt:  QDSIEFKSSNDFVLNPQTSESMSDHHATNESQNIQILIDD--------------GSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPI

Query:  HEPFETTKENSPHGDHTSNVIEEKTKADGD-EAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAK-DGSTESTATGTWSFFPM
        HE           G+H   +     K  G+ E+ Q Q+  S + GS KEF FD+ N  +     I SEWW N K  G  + +   +W+FFP+
Subjt:  HEPFETTKENSPHGDHTSNVIEEKTKADGD-EAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAK-DGSTESTATGTWSFFPM

AT5G52430.1 hydroxyproline-rich glycoprotein family protein3.8e-5837.95Show/hide
Query:  RPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEP---HENTLQSPDIVLPFAAPPSSPVSLLQ
        R V NN+ +T+ AAA AI T + R   +++  QK RWG C S+Y CFG+ K  KRIG+AVLVPEP  S  P    +N+  S  +VLPF APPSSP S LQ
Subjt:  RPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEP---HENTLQSPDIVLPFAAPPSSPVSLLQ

Query:  SEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPP--ESIHLTTPSSPEVPFAQFVQPTLPKVESDN--
        S+P S   SP   +   SLT+N +SP  P S+F +GP+A+E Q V+PP+ FS   TEPST P+TPP   S+H+TTPSSPEVPFAQ +  +L     D+  
Subjt:  SEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPST-PFTPP--ESIHLTTPSSPEVPFAQFVQPTLPKVESDN--

Query:  ----QYTFPNDDFQSYQFYPGSP-VSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDF
            +++  + +F+S Q  PGSP   +LISP SVIS SG SSP P        S  + F +  PP  L  +  +   W  R  + S T            
Subjt:  ----QYTFPNDDFQSYQFYPGSP-VSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDF

Query:  VLNPQTSESMSDHHATN------ESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNV
         L P   E +S +   N      ++Q  ++     S    E    +HR SFEL+  DV  + + SK   S++    +  I        E S   D   N 
Subjt:  VLNPQTSESMSDHHATN------ESQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNV

Query:  IEEKTKADGDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFP
        IE+++    +E H+ Q+  S ++GS KEF FD                  N KD + E  A  +WSFFP
Subjt:  IEEKTKADGDEAHQRQEHHSVTLGSVKEFNFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACGACGTACGGATACTGATGATTTCAGGCCTGTTAACAATAATACTTTTCAAACAATTACTGCCGCCGCTGATGCGATCGCAACCGTCGATCATCGTTTC
CCTCGGGCTACTGCCGTCCAGCAGAAAAGAAGATGGGGCAGTTGTTTGAGTATTTATTGGTGCTTTGGATCTATCAAACAGAGGAAAAGAATCGGGCATGCTGTA
TTGGTACCAGAACCAAGTCCTTCATCTGAGCCTCATGAAAATACATTACAATCACCAGATATTGTGCTTCCTTTTGCTGCACCTCCTTCTTCCCCTGTTTCCTTA
CTTCAATCTGAACCACCTTCTGCTATGCAGTCGCCTACTGCTTTAATCTCTTTCACTTCTCTTACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCCATTTTT
GCCATTGGCCCATTTGCTCATGAACCACAATTAGTGTCTCCACCTCTGAATTTCTCTACTCTTACTACTGAACCATCAACTCCCTTCACTCCTCCCGAATCTATC
CACTTGACTACACCTTCTTCCCCTGAAGTTCCTTTTGCTCAGTTTGTTCAACCTACTCTTCCGAAAGTTGAGTCTGATAATCAATATACATTTCCTAATGATGAT
TTCCAATCTTACCAATTCTATCCAGGTAGTCCGGTTAGTCACCTCATATCACCACGGTCGGTTATTTCTCGTTCTGGGGCTTCATCGCCTTTGCCTGACTATGAT
TTTGCTTCCTTTGGTTCTCAATTTTTGAATTTCCCACTAGAAGTTCCACCTACTTTGTTGAACCTTGACAAACATTCCATTCATAACTGGCGACAACGTCAAAGT
ACTGATTCTTGTACTCAAGATTCTATAGAATTCAAATCAAGTAATGACTTTGTTTTGAATCCCCAAACTTCAGAATCTATGTCTGATCACCACGCAACAAATGAA
TCTCAAAATATTCAAATTCTCATCGATGATGGAAGTAAAAAGGAGGAGGAGCCAGGTGCTACTAATCATAGATTCTCATTTGAGCTATCTGATGGGGATGTTTTA
TTACAAAGCGTAGGAAGTAAGCCATTGGAATCAAATGAACTTGCGGTTGAATCATCGCCAATACATGAACCATTTGAAACGACTAAAGAAAATTCTCCTCATGGT
GACCATACTTCAAATGTTATAGAAGAAAAGACAAAAGCTGACGGTGATGAAGCACATCAACGTCAAGAACATCATTCCGTTACACTTGGGTCTGTGAAGGAATTC
AATTTTGATAATGGTAATGGAAGTGACACACATAACCCAAATATAAATTCAGAATGGTGGATTAATGCAAAGGATGGTAGCACAGAAAGCACAGCCACCGGGACC
TGGTCATTCTTTCCAATGACGCAACAAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGACGACGTACGGATACTGATGATTTCAGGCCTGTTAACAATAATACTTTTCAAACAATTACTGCCGCCGCTGATGCGATCGCAACCGTCGATCATCGTTTC
CCTCGGGCTACTGCCGTCCAGCAGAAAAGAAGATGGGGCAGTTGTTTGAGTATTTATTGGTGCTTTGGATCTATCAAACAGAGGAAAAGAATCGGGCATGCTGTA
TTGGTACCAGAACCAAGTCCTTCATCTGAGCCTCATGAAAATACATTACAATCACCAGATATTGTGCTTCCTTTTGCTGCACCTCCTTCTTCCCCTGTTTCCTTA
CTTCAATCTGAACCACCTTCTGCTATGCAGTCGCCTACTGCTTTAATCTCTTTCACTTCTCTTACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCCATTTTT
GCCATTGGCCCATTTGCTCATGAACCACAATTAGTGTCTCCACCTCTGAATTTCTCTACTCTTACTACTGAACCATCAACTCCCTTCACTCCTCCCGAATCTATC
CACTTGACTACACCTTCTTCCCCTGAAGTTCCTTTTGCTCAGTTTGTTCAACCTACTCTTCCGAAAGTTGAGTCTGATAATCAATATACATTTCCTAATGATGAT
TTCCAATCTTACCAATTCTATCCAGGTAGTCCGGTTAGTCACCTCATATCACCACGGTCGGTTATTTCTCGTTCTGGGGCTTCATCGCCTTTGCCTGACTATGAT
TTTGCTTCCTTTGGTTCTCAATTTTTGAATTTCCCACTAGAAGTTCCACCTACTTTGTTGAACCTTGACAAACATTCCATTCATAACTGGCGACAACGTCAAAGT
ACTGATTCTTGTACTCAAGATTCTATAGAATTCAAATCAAGTAATGACTTTGTTTTGAATCCCCAAACTTCAGAATCTATGTCTGATCACCACGCAACAAATGAA
TCTCAAAATATTCAAATTCTCATCGATGATGGAAGTAAAAAGGAGGAGGAGCCAGGTGCTACTAATCATAGATTCTCATTTGAGCTATCTGATGGGGATGTTTTA
TTACAAAGCGTAGGAAGTAAGCCATTGGAATCAAATGAACTTGCGGTTGAATCATCGCCAATACATGAACCATTTGAAACGACTAAAGAAAATTCTCCTCATGGT
GACCATACTTCAAATGTTATAGAAGAAAAGACAAAAGCTGACGGTGATGAAGCACATCAACGTCAAGAACATCATTCCGTTACACTTGGGTCTGTGAAGGAATTC
AATTTTGATAATGGTAATGGAAGTGACACACATAACCCAAATATAAATTCAGAATGGTGGATTAATGCAAAGGATGGTAGCACAGAAAGCACAGCCACCGGGACC
TGGTCATTCTTTCCAATGACGCAACAAAGATGA
Protein sequenceShow/hide protein sequence
MRRRTDTDDFRPVNNNTFQTITAAADAIATVDHRFPRATAVQQKRRWGSCLSIYWCFGSIKQRKRIGHAVLVPEPSPSSEPHENTLQSPDIVLPFAAPPSSPVSL
LQSEPPSAMQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHEPQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPSSPEVPFAQFVQPTLPKVESDNQYTFPNDD
FQSYQFYPGSPVSHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKHSIHNWRQRQSTDSCTQDSIEFKSSNDFVLNPQTSESMSDHHATNE
SQNIQILIDDGSKKEEEPGATNHRFSFELSDGDVLLQSVGSKPLESNELAVESSPIHEPFETTKENSPHGDHTSNVIEEKTKADGDEAHQRQEHHSVTLGSVKEF
NFDNGNGSDTHNPNINSEWWINAKDGSTESTATGTWSFFPMTQQR