; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10002324 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10002324
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptiondentin sialophosphoprotein isoform X1
Genome locationChr11:5566195..5580084
RNA-Seq ExpressionHG10002324
SyntenyHG10002324
Gene Ontology termsNA
InterPro domainsIPR010844 - Occludin homology domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146856.1 dentin sialophosphoprotein isoform X1 [Cucumis sativus]0.0e+0089.25Show/hide
Query:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR
        MYGG SKLGRPG GAGRG  GKRPHSSFPLPPSHRPSGRLSLGGG AGSV+NPRNRTTTATTSEA QS EENFSLVTGNNPLAF MIIRLAPDLIDEIKR
Subjt:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR

Query:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI
        VEAQGGTPRIKFDANA NSSGNVIDVGGKEFRFTWSRE GD C+IYEERKSGEDGSGLLIESGN WRKLNV R+LDESTTNHVKKLSEEAER+SKSRRAI
Subjt:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI

Query:  VLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI
        VLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKNELSQ                   DRLSSSPIP PPEQ GAPVSQFGSANT+KTHVIAEDI
Subjt:  VLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI

Query:  RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLL
        RPR+PAKIN AAS+EKEIPT A KGVLETPGQEGNSG KPTDLQGMLYNLL ENPKGMSLKALEKAVGDKIPNAVKKIEPIIKK++        ++ +  
Subjt:  RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLL

Query:  GVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNES
        GV LEGSKKPTSEGESSPLISHHQT VHEDLPDQ  APELQLEAR G++LEEKVETSQANKESNFLE NG+QQ  PD FAEKK SENSEGQAASSSDNES
Subjt:  GVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNES

Query:  DSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKEDG
        DSDS+SDSSDSGSDSGNHSRSRS+SPVGSGSGSSSDSESD PSNS+EGSD DVDIMTSDDDKESK KLQA VQGFSTSPAAWKSPDGG  QIIDDEKEDG
Subjt:  DSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKEDG

Query:  QGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKS
        Q  DAIDIEKDSSDDEPDAKID  SLL  EEG RPVEEPRSFSPYPDEFQERQNFIGSLFEDREN ++DSARHEQSDSTGRISKGKSKRSSDLECLEEKS
Subjt:  QGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKS

Query:  DHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAAR
        DHTKRLKSESLAQQPVSGNWGVQLQSP NLSPSKLNRDSVRN TSQVTNKGEIKGNSDFRPKKGNKETV EKNSSDVSQAGWRPHDQS  GVRAVDTA R
Subjt:  DHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAAR

Query:  ADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSA
        ADKHGDIGRGTKHTEK GHANENFHVFKDTFYGN +NEGTKEKKVSKNSRSGGPGDKQIQP DSHHSKPGEIVGKFKDGQ FSSSQMGYSPRDNNNRVSA
Subjt:  ADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSA

Query:  NRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSE
        NRSPVNGKGR LQRE SDLELGELREPF EEARGKKKFERNNSLKQLENKENTTDIW SDLNKGKSNLKAS+EYGKRSSPHVSTKFPSNPEGSNKKKNSE
Subjt:  NRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSE

Query:  HIVEDSTRLNHRSLQSHPQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPVSAERE
        HIVEDS R+N+RSL SH QYNSR+DH EVDKS D NVKPNQG GPEG  ESNRKASVGISQLND KREQ PSKKGSKR APNPITEVTD LKNPVSAERE
Subjt:  HIVEDSTRLNHRSLQSHPQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPVSAERE

Query:  NSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCST
        NSDPKRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYR EFCKLGKELDSARGQDSE+YFNVLGQLKESYRLCST
Subjt:  NSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCST

XP_008447590.1 PREDICTED: dentin sialophosphoprotein isoform X1 [Cucumis melo]0.0e+0089.17Show/hide
Query:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR
        MYGG SKLGRPG GAGRG GGKRPHSSFPLPPSHRPSGRLSLGGG AGS +NPRNRTTTATTSEA QS EENFSLVTGNNPLAF MIIRLAPDLIDEIKR
Subjt:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR

Query:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI
        VEAQGGTPRIKFDA A NSSGNVIDVGGKEFRFTWSRE GD C+IYEERKSGEDGSGLLIESGN WRKLNV R+LDESTTNHVKKLSEEAER+SKSRRAI
Subjt:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI

Query:  VLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI
        VLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKNELSQ                   DRLSSSPIP PPEQ G PVSQFGSANT KTHVIAEDI
Subjt:  VLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI

Query:  RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLL
        RPR+PAKIN AAS+EKEI T A KGVLETPGQEGNSGAKPTDLQGMLYNLL ENPKGMSLKALEKAVGDKIPNAVKKIEPIIKK++         + +  
Subjt:  RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLL

Query:  GVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNES
        GVELEGSKKPTSEGESSPL+SHHQT VHEDLPDQI APELQLEA  GI+LEEKVETSQANKESNFLEKNG+QQ  PD FAEKKGSENSEGQAASSSDN S
Subjt:  GVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNES

Query:  DSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKEDG
        DSDS+SDSSDSGSDSGNHSRSRS+SPVGSGSGSSSDSESD PSNS+EGSDEDVDIMTSDDDKESKHKLQA VQGFSTSPAAWKSPDGG  QIIDDEKEDG
Subjt:  DSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKEDG

Query:  QGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKS
        Q  DAIDIEKDSSDDEPDAK+D  SLL  EE GRPVEEPRSFSPYPDEFQERQNFIGSLFEDREN + DSARHEQSDSTGRISKGKSKRSSDLECLEEK+
Subjt:  QGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKS

Query:  DHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAAR
        DHTKRLKSESLAQQPVSGNWGVQLQSP NLSPSKLNRDSVRN TSQVTNKGEIKGNSDFRPKKGNKETV EKNSSDV QAGWRPHDQS GGVRAVDTA R
Subjt:  DHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAAR

Query:  ADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSA
        ADKHGDIGRGTKH EK GHANENFHVFKDTFYGNA+NEGTKEKKVSKNSRSGGPGDK IQPFDSH SKPGEIVGKFKDGQ FSSSQMGYSPRDNNNRVSA
Subjt:  ADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSA

Query:  NRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSE
        NRSPVNGKGR LQRE SDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIW SDLNKGKSNLKAS+EYGKRSSPHVSTKFPSNPEGSNKKKNSE
Subjt:  NRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSE

Query:  HIVEDSTRLNHRSLQSHPQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPVSAERE
        H+VEDS RLN+RSL SH QYNSR+DH EVDKSVD NV+PNQG GPEG  ESNRKASVGISQLND KREQLPSKKGSKR APNPITEVTD LKNP+SAE E
Subjt:  HIVEDSTRLNHRSLQSHPQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPVSAERE

Query:  NSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCST
        NSDPKRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYR EFCKLGKELDSARGQDSE+YFNVLGQLKESYRLCST
Subjt:  NSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCST

XP_023524752.1 dentin sialophosphoprotein-like isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0083.98Show/hide
Query:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR
        MYGG SKL R G GAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+ A  SEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKR
Subjt:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR

Query:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI
        VE+QGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTTNHVKKLSEEAERRSKSRRAI
Subjt:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI

Query:  VLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED
        VLEPGNPSMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQKNELSQ                   +RLSSSP+PSPPEQSGAP+SQFGSAN TKTH  AED
Subjt:  VLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED

Query:  IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNL
        I+PR PAKIN+AASSEK+IPTKAAKGVLE PGQE N+GAKPTDLQGMLYNLL +NPKGMSLKALEKAVGDKIPN+VKKIEPIIKK++         + + 
Subjt:  IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNL

Query:  LGVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNE
          VE+EGSKKP+SEGESSPL+SH QTPVHED  DQ   PE QLEAR  IELEEKVETSQANKESNFLEKNG+QQ+SPD FAEKKGSENSEGQAASSSDNE
Subjt:  LGVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNE

Query:  SDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED
        SDSDSESDSSDSGSDSGN SRSRS+SPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKE K+KLQA VQGFS SPAAWKSPDGGA   IDDEKED
Subjt:  SDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED

Query:  GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEK
        G  SDAIDIEKDSSDDEP+AKIDD SL    EGGRPVEE RS SPYPDEFQERQNFIGSLFEDRENT++DSARHEQSDST R+SKGKSKRSS+LEC EE 
Subjt:  GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEK

Query:  SDHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAA
        + HTKRLK ES +QQPVSGNWG QLQS  NLSPSKLNRDS RNPTSQVTNKGE+KGNSDFRPK GNKE V EKN SDVSQA WRPHDQS  GVRAVDTA 
Subjt:  SDHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAA

Query:  RADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV
        R DKHG+ IGRG KH+EKGGHANE+FH +KD FYGNAENEG  EKKVS+NSRSGGPGDKQIQP DSH SKPG+IVGKFKDG+ FSSSQMGYSPRDNNNR+
Subjt:  RADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV

Query:  SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKN
        SA+RSPVNGKGR LQRE SDLELGELREPFPEEA GKKKFERNNS KQLENKE+T+DIWSS+LNKGKSNLKAS++ GKRSSPH+STKFPSNPEGSNKKK 
Subjt:  SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKN

Query:  SEHIVEDSTRLNHRSLQSH---PQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPV
        SEH VED TR+NHR  QSH   PQY+SRVDHVEV+K VDANVKPNQGIGPE CGESNRKASVGISQL+DMKREQLPSKKGSKR APN ITEVTDALKNP+
Subjt:  SEHIVEDSTRLNHRSLQSH---PQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPV

Query:  SAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY
        SAE ENSD KRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQYKEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDS+RGQDS++YFN+LGQLKESY
Subjt:  SAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY

Query:  RLCSTPKASSPTNLGKH
        RLCST      +NL +H
Subjt:  RLCSTPKASSPTNLGKH

XP_023524753.1 dentin sialophosphoprotein-like isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0084.56Show/hide
Query:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR
        MYGG SKL R G GAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+ A  SEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKR
Subjt:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR

Query:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI
        VE+QGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTTNHVKKLSEEAERRSKSRRAI
Subjt:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI

Query:  VLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED
        VLEPGNPSMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQKNELSQ                   +RLSSSP+PSPPEQSGAP+SQFGSAN TKTH  AED
Subjt:  VLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED

Query:  IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNL
        I+PR PAKIN+AASSEK+IPTKAAKGVLE PGQE N+GAKPTDLQGMLYNLL +NPKGMSLKALEKAVGDKIPN+VKKIEPIIKK++         + + 
Subjt:  IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNL

Query:  LGVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNE
          VE+EGSKKP+SEGESSPL+SH QTPVHED  DQ   PE QLEAR  IELEEKVETSQANKESNFLEKNG+QQ+SPD FAEKKGSENSEGQAASSSDNE
Subjt:  LGVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNE

Query:  SDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED
        SDSDSESDSSDSGSDSGN SRSRS+SPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKE K+KLQA VQGFS SPAAWKSPDGGA   IDDEKED
Subjt:  SDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED

Query:  GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEK
        G  SDAIDIEKDSSDDEP+AKIDD SL    EGGRPVEE RS SPYPDEFQERQNFIGSLFEDRENT++DSARHEQSDST R+SKGKSKRSS+LEC EE 
Subjt:  GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEK

Query:  SDHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAA
        + HTKRLK ES +QQPVSGNWG QLQS  NLSPSKLNRDS RNPTSQVTNKGE+KGNSDFRPK GNKE V EKN SDVSQA WRPHDQS  GVRAVDTA 
Subjt:  SDHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAA

Query:  RADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV
        R DKHG+ IGRG KH+EKGGHANE+FH +KD FYGNAENEG  EKKVS+NSRSGGPGDKQIQP DSH SKPG+IVGKFKDG+ FSSSQMGYSPRDNNNR+
Subjt:  RADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV

Query:  SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKN
        SA+RSPVNGKGR LQRE SDLELGELREPFPEEA GKKKFERNNS KQLENKE+T+DIWSS+LNKGKSNLKAS++ GKRSSPH+STKFPSNPEGSNKKK 
Subjt:  SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKN

Query:  SEHIVEDSTRLNHRSLQSH---PQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPV
        SEH VED TR+NHR  QSH   PQY+SRVDHVEV+K VDANVKPNQGIGPE CGESNRKASVGISQL+DMKREQLPSKKGSKR APN ITEVTDALKNP+
Subjt:  SEHIVEDSTRLNHRSLQSH---PQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPV

Query:  SAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY
        SAE ENSD KRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQYKEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDS+RGQDS++YFN+LGQLKESY
Subjt:  SAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY

Query:  RLCST
        RLCST
Subjt:  RLCST

XP_038883601.1 dentin sialophosphoprotein isoform X1 [Benincasa hispida]0.0e+0090.25Show/hide
Query:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR
        MYGG+SKLGRPG GAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGG  SV+NPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR
Subjt:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR

Query:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI
        VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI
Subjt:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI

Query:  VLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI
        VLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKNELSQ                   DRLSSSPIPSPPEQSGAPVS FGSANTTKTHVI EDI
Subjt:  VLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI

Query:  RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLL
        RPRLPAK+N+AASSEKEI TKAAKGVLETPGQEGNSGAK TDLQGMLYNLL ENPKGMSLKALEKAVGDKIPNAVKKIEPIIKK++         + +  
Subjt:  RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLL

Query:  GVELEGSKKPTSEGESSPLISHHQT-PVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNE
        GVELEGSKKP+SEGESSPLISHHQ  PVHEDLPDQITAPELQLEAR GIELEEKVETSQANK+SNFLEKNG+QQHSPDLFAEKKGSENSE QAASSSDNE
Subjt:  GVELEGSKKPTSEGESSPLISHHQT-PVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNE

Query:  SDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED
        SDSDSESDSSDSGSDSGNHSRSRS+SPVGSGSGSSSDSES+ PSNSKEGSDEDVDIMTSDDDKESKHKLQA  QGFSTSPAAWKSPDGGA QIIDDEKED
Subjt:  SDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED

Query:  GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEK
        GQ SDAIDIE DSSDDEPDAKIDD S L I EGGR VEEPRSFSPYPDEFQERQNFIGSLFEDR+NT++DS RHEQSDSTG+ISKGKSKRSSDLECLEEK
Subjt:  GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEK

Query:  SDHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAA
        SDHTKRLKSESLAQQPVS               SK  RDSVRNPTSQVTNKGE+KGNSDFRPKKG+KETV EKNSSDVSQAGWRPHDQS GGVRAVDTAA
Subjt:  SDHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAA

Query:  RADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVS
        R DKHGDIGRGTKHTEK GHANENFH+FKDTF+GNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQ FSSSQMGYSPRDNNNR+S
Subjt:  RADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVS

Query:  ANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNS
        ANRSPVNGKGR LQRELSDLELGELR+PFPEE+RGKKKFERNNSLKQLENKE+TTDIW SDL++GKSNLK S+EYGKRS PHVSTKFPSNPEGSNKKK S
Subjt:  ANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNS

Query:  EHIVEDSTRLNHRSLQSHPQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPVSAER
        EHIVEDSTRLN RSLQSHPQYNSRVDHVEVDKS+ ANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVT+ALKNPVSAER
Subjt:  EHIVEDSTRLNHRSLQSHPQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPVSAER

Query:  ENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCS
        ENSDPKRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQY+EYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSE+YFNVLGQLKESYRLCS
Subjt:  ENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCS

TrEMBL top hitse value%identityAlignment
A0A0A0LCU6 Occludin_ELL domain-containing protein0.0e+0089.25Show/hide
Query:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR
        MYGG SKLGRPG GAGRG  GKRPHSSFPLPPSHRPSGRLSLGGG AGSV+NPRNRTTTATTSEA QS EENFSLVTGNNPLAF MIIRLAPDLIDEIKR
Subjt:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR

Query:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI
        VEAQGGTPRIKFDANA NSSGNVIDVGGKEFRFTWSRE GD C+IYEERKSGEDGSGLLIESGN WRKLNV R+LDESTTNHVKKLSEEAER+SKSRRAI
Subjt:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI

Query:  VLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI
        VLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKNELSQ                   DRLSSSPIP PPEQ GAPVSQFGSANT+KTHVIAEDI
Subjt:  VLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI

Query:  RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLL
        RPR+PAKIN AAS+EKEIPT A KGVLETPGQEGNSG KPTDLQGMLYNLL ENPKGMSLKALEKAVGDKIPNAVKKIEPIIKK++        ++ +  
Subjt:  RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLL

Query:  GVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNES
        GV LEGSKKPTSEGESSPLISHHQT VHEDLPDQ  APELQLEAR G++LEEKVETSQANKESNFLE NG+QQ  PD FAEKK SENSEGQAASSSDNES
Subjt:  GVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNES

Query:  DSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKEDG
        DSDS+SDSSDSGSDSGNHSRSRS+SPVGSGSGSSSDSESD PSNS+EGSD DVDIMTSDDDKESK KLQA VQGFSTSPAAWKSPDGG  QIIDDEKEDG
Subjt:  DSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKEDG

Query:  QGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKS
        Q  DAIDIEKDSSDDEPDAKID  SLL  EEG RPVEEPRSFSPYPDEFQERQNFIGSLFEDREN ++DSARHEQSDSTGRISKGKSKRSSDLECLEEKS
Subjt:  QGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKS

Query:  DHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAAR
        DHTKRLKSESLAQQPVSGNWGVQLQSP NLSPSKLNRDSVRN TSQVTNKGEIKGNSDFRPKKGNKETV EKNSSDVSQAGWRPHDQS  GVRAVDTA R
Subjt:  DHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAAR

Query:  ADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSA
        ADKHGDIGRGTKHTEK GHANENFHVFKDTFYGN +NEGTKEKKVSKNSRSGGPGDKQIQP DSHHSKPGEIVGKFKDGQ FSSSQMGYSPRDNNNRVSA
Subjt:  ADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSA

Query:  NRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSE
        NRSPVNGKGR LQRE SDLELGELREPF EEARGKKKFERNNSLKQLENKENTTDIW SDLNKGKSNLKAS+EYGKRSSPHVSTKFPSNPEGSNKKKNSE
Subjt:  NRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSE

Query:  HIVEDSTRLNHRSLQSHPQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPVSAERE
        HIVEDS R+N+RSL SH QYNSR+DH EVDKS D NVKPNQG GPEG  ESNRKASVGISQLND KREQ PSKKGSKR APNPITEVTD LKNPVSAERE
Subjt:  HIVEDSTRLNHRSLQSHPQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPVSAERE

Query:  NSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCST
        NSDPKRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYR EFCKLGKELDSARGQDSE+YFNVLGQLKESYRLCST
Subjt:  NSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCST

A0A1S3BIQ1 dentin sialophosphoprotein isoform X10.0e+0089.17Show/hide
Query:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR
        MYGG SKLGRPG GAGRG GGKRPHSSFPLPPSHRPSGRLSLGGG AGS +NPRNRTTTATTSEA QS EENFSLVTGNNPLAF MIIRLAPDLIDEIKR
Subjt:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR

Query:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI
        VEAQGGTPRIKFDA A NSSGNVIDVGGKEFRFTWSRE GD C+IYEERKSGEDGSGLLIESGN WRKLNV R+LDESTTNHVKKLSEEAER+SKSRRAI
Subjt:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI

Query:  VLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI
        VLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKNELSQ                   DRLSSSPIP PPEQ G PVSQFGSANT KTHVIAEDI
Subjt:  VLEPGNPSMKNQIKQLAAAEANPWRHFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDI

Query:  RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLL
        RPR+PAKIN AAS+EKEI T A KGVLETPGQEGNSGAKPTDLQGMLYNLL ENPKGMSLKALEKAVGDKIPNAVKKIEPIIKK++         + +  
Subjt:  RPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLL

Query:  GVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNES
        GVELEGSKKPTSEGESSPL+SHHQT VHEDLPDQI APELQLEA  GI+LEEKVETSQANKESNFLEKNG+QQ  PD FAEKKGSENSEGQAASSSDN S
Subjt:  GVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNES

Query:  DSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKEDG
        DSDS+SDSSDSGSDSGNHSRSRS+SPVGSGSGSSSDSESD PSNS+EGSDEDVDIMTSDDDKESKHKLQA VQGFSTSPAAWKSPDGG  QIIDDEKEDG
Subjt:  DSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKEDG

Query:  QGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKS
        Q  DAIDIEKDSSDDEPDAK+D  SLL  EE GRPVEEPRSFSPYPDEFQERQNFIGSLFEDREN + DSARHEQSDSTGRISKGKSKRSSDLECLEEK+
Subjt:  QGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKS

Query:  DHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAAR
        DHTKRLKSESLAQQPVSGNWGVQLQSP NLSPSKLNRDSVRN TSQVTNKGEIKGNSDFRPKKGNKETV EKNSSDV QAGWRPHDQS GGVRAVDTA R
Subjt:  DHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAAR

Query:  ADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSA
        ADKHGDIGRGTKH EK GHANENFHVFKDTFYGNA+NEGTKEKKVSKNSRSGGPGDK IQPFDSH SKPGEIVGKFKDGQ FSSSQMGYSPRDNNNRVSA
Subjt:  ADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSA

Query:  NRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSE
        NRSPVNGKGR LQRE SDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIW SDLNKGKSNLKAS+EYGKRSSPHVSTKFPSNPEGSNKKKNSE
Subjt:  NRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSE

Query:  HIVEDSTRLNHRSLQSHPQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPVSAERE
        H+VEDS RLN+RSL SH QYNSR+DH EVDKSVD NV+PNQG GPEG  ESNRKASVGISQLND KREQLPSKKGSKR APNPITEVTD LKNP+SAE E
Subjt:  HIVEDSTRLNHRSLQSHPQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPVSAERE

Query:  NSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCST
        NSDPKRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYR EFCKLGKELDSARGQDSE+YFNVLGQLKESYRLCST
Subjt:  NSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCST

A0A6J1K6B7 dentin sialophosphoprotein isoform X10.0e+0083.65Show/hide
Query:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR
        MYGG SKL R G GAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+TA  SEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKR
Subjt:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR

Query:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI
        VE+QGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTTNHVKKLSEEAERRSKSRRAI
Subjt:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI

Query:  VLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED
        VLEPGN SMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQKNELSQ                   +RLSSSP+PSPPEQ GAP+SQFGSAN TKTH IAED
Subjt:  VLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED

Query:  IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNL
        I+PR PAKIN+AASSEKEIPTKAAKGVLE PGQE N+GAKPTDLQGMLYNLL +NPKGMSLKALEKAVGDKIPN+VKKIEPIIKK++         + + 
Subjt:  IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNL

Query:  LGVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNE
          VELEGSKKP+SEGESSPL+SH QTPVHED  DQ   PE QLEAR  IELEEKVETSQANKESNFLEKNG+QQ+SPD FAEKKGSENSEG+AA+SSDNE
Subjt:  LGVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNE

Query:  SDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED
        SDSDSESDSSDSGSDSGN SRSRS+SPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKE KHKLQA VQGFS SPAAWKSPDGGA   IDDEKED
Subjt:  SDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED

Query:  GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEK
        G  SDAIDIEKDSSDDEP+AKIDD SL    EGGR VEE RS SPYPDEFQERQNFIGSLFEDRENT++DS RHEQSDST RISKGKSKRSS+LEC EE 
Subjt:  GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEK

Query:  SDHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAA
        + HTKRLK ES +QQPVSGNWGVQLQS  NLSPSKLNRDS RNPT+QVTNKGE+KGNSDFRPK GNKE V EKN SDVSQA WRPHDQS  GVRAVDTA 
Subjt:  SDHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAA

Query:  RADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV
        R DKHG+ IGRG KH+EKGGHANE+FH +KD FYGNAENE   EKKVS+NSRSGGPGDKQIQP DSH SKPG+IVGKFKDG+ F SSQMGYSPRDNNNR+
Subjt:  RADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV

Query:  SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKN
        SA+RSPVNGKGR LQRE SDLELGELREPFPEEA GKKKFERNNS KQLENKE+T+DIWSS+LNKGKSNLKAS++ GKRSSPH+STKFPSNPEGSNKKK 
Subjt:  SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKN

Query:  SEHIVEDSTRLNHRSLQSHP---QYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPV
        SEH VED TR+NHR  QSHP   QY+SRVDHVEV+K VDANVK NQGIGPE CGESNRKASVG+SQL+DMKREQLPSKKGSKR APN I EVTDALKNP+
Subjt:  SEHIVEDSTRLNHRSLQSHP---QYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPV

Query:  SAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY
        SAE ENSD KRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQYKEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDSARGQDS++YFN+LGQLKESY
Subjt:  SAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY

Query:  RLCSTPKASSPTNLGKH
        RLCST      +NL +H
Subjt:  RLCSTPKASSPTNLGKH

A0A6J1KCU5 dentin sialophosphoprotein isoform X30.0e+0084.23Show/hide
Query:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR
        MYGG SKL R G GAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+TA  SEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKR
Subjt:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR

Query:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI
        VE+QGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTTNHVKKLSEEAERRSKSRRAI
Subjt:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI

Query:  VLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED
        VLEPGN SMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQKNELSQ                   +RLSSSP+PSPPEQ GAP+SQFGSAN TKTH IAED
Subjt:  VLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED

Query:  IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNL
        I+PR PAKIN+AASSEKEIPTKAAKGVLE PGQE N+GAKPTDLQGMLYNLL +NPKGMSLKALEKAVGDKIPN+VKKIEPIIKK++         + + 
Subjt:  IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNL

Query:  LGVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNE
          VELEGSKKP+SEGESSPL+SH QTPVHED  DQ   PE QLEAR  IELEEKVETSQANKESNFLEKNG+QQ+SPD FAEKKGSENSEG+AA+SSDNE
Subjt:  LGVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNE

Query:  SDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED
        SDSDSESDSSDSGSDSGN SRSRS+SPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKE KHKLQA VQGFS SPAAWKSPDGGA   IDDEKED
Subjt:  SDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED

Query:  GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEK
        G  SDAIDIEKDSSDDEP+AKIDD SL    EGGR VEE RS SPYPDEFQERQNFIGSLFEDRENT++DS RHEQSDST RISKGKSKRSS+LEC EE 
Subjt:  GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEK

Query:  SDHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAA
        + HTKRLK ES +QQPVSGNWGVQLQS  NLSPSKLNRDS RNPT+QVTNKGE+KGNSDFRPK GNKE V EKN SDVSQA WRPHDQS  GVRAVDTA 
Subjt:  SDHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAA

Query:  RADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV
        R DKHG+ IGRG KH+EKGGHANE+FH +KD FYGNAENE   EKKVS+NSRSGGPGDKQIQP DSH SKPG+IVGKFKDG+ F SSQMGYSPRDNNNR+
Subjt:  RADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV

Query:  SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKN
        SA+RSPVNGKGR LQRE SDLELGELREPFPEEA GKKKFERNNS KQLENKE+T+DIWSS+LNKGKSNLKAS++ GKRSSPH+STKFPSNPEGSNKKK 
Subjt:  SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKN

Query:  SEHIVEDSTRLNHRSLQSHP---QYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPV
        SEH VED TR+NHR  QSHP   QY+SRVDHVEV+K VDANVK NQGIGPE CGESNRKASVG+SQL+DMKREQLPSKKGSKR APN I EVTDALKNP+
Subjt:  SEHIVEDSTRLNHRSLQSHP---QYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPV

Query:  SAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY
        SAE ENSD KRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQYKEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDSARGQDS++YFN+LGQLKESY
Subjt:  SAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY

Query:  RLCST
        RLCST
Subjt:  RLCST

A0A6J1KF98 dentin sialophosphoprotein isoform X20.0e+0083.65Show/hide
Query:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR
        MYGG SKL R G GAGRGA GKRP SSFPLPP+HRPSGRLSLGGGGAGS ANPRNRT+TA  SEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLI+EIKR
Subjt:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKR

Query:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI
        VE+QGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLL+ESGNAWRK+NVQRILDESTTNHVKKLSEEAERRSKSRRAI
Subjt:  VEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAI

Query:  VLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED
        VLEPGN SMKNQIKQLAAAE+NPWR H+KNKKEPPFKKQKNELSQ                   +RLSSSP+PSPPEQ GAP+SQFGSAN TKTH IAED
Subjt:  VLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQKNELSQ-------------------DRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAED

Query:  IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNL
        I+PR PAKIN+AASSEKEIPTKAAKGVLE PGQE N+GAKPTDLQGMLYNLL +NPKGMSLKALEKAVGDKIPN+VKKIEPIIKK++         + + 
Subjt:  IRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNL

Query:  LGVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNE
          VELEGSKKP+SEGESSPL+SH QTPVHED  DQ   PE QLEAR  IELEEKVETSQANKESNFLEKNG+QQ+SPD FAEKKGSENSEG+AA+SSDNE
Subjt:  LGVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNE

Query:  SDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED
        SDSDSESDSSDSGSDSGN SRSRS+SPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKE KHKLQA VQGFS SPAAWKSPDGGA   IDDEKED
Subjt:  SDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQGFSTSPAAWKSPDGGAAQIIDDEKED

Query:  GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEK
        G  SDAIDIEKDSSDDEP+AKIDD SL    EGGR VEE RS SPYPDEFQERQNFIGSLFEDRENT++DS RHEQSDST RISKGKSKRSS+LEC EE 
Subjt:  GQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEK

Query:  SDHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAA
        + HTKRLK ES +QQPVSGNWGVQLQS  NLSPSKLNRDS RNPT+QVTNKGE+KGNSDFRPK GNKE V EKN SDVSQA WRPHDQS  GVRAVDTA 
Subjt:  SDHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAA

Query:  RADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV
        R DKHG+ IGRG KH+EKGGHANE+FH +KD FYGNAENE   EKKVS+NSRSGGPGDKQIQP DSH SKPG+IVGKFKDG+ F SSQMGYSPRDNNNR+
Subjt:  RADKHGD-IGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRV

Query:  SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKN
        SA+RSPVNGKGR LQRE SDLELGELREPFPEEA GKKKFERNNS KQLENKE+T+DIWSS+LNKGKSNLKAS++ GKRSSPH+STKFPSNPEGSNKKK 
Subjt:  SANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKN

Query:  SEHIVEDSTRLNHRSLQSHP---QYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPV
        SEH VED TR+NHR  QSHP   QY+SRVDHVEV+K VDANVK NQGIGPE CGESNRKASVG+SQL+DMKREQLPSKKGSKR APN I EVTDALKNP+
Subjt:  SEHIVEDSTRLNHRSLQSHP---QYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPV

Query:  SAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY
        SAE ENSD KRRDSSSDENSCSYSKYEKDEPE KGAIKDFSQYKEYVQEY DKYE YLSLNKILESYRAEFCKLGKELDSARGQDS++YFN+LGQLKESY
Subjt:  SAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY

Query:  RLCSTPKASSPTNLGKH
        RLCST      +NL +H
Subjt:  RLCSTPKASSPTNLGKH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G21290.1 dentin sialophosphoprotein-related3.6e-15338.74Show/hide
Query:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPS--GRLSLGGGGAGSVANPRNRTTT------ATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAP
        M+ GSSK G  G   G G+G  R  +SFP P +  PS  GR+S GGGG GS A PR R+ +      A+T+ + ++VEE F+LV   +  AF MIIRL+P
Subjt:  MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPS--GRLSLGGGGAGSVANPRNRTTT------ATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAP

Query:  DLIDEIKRVEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAER
        DL+DEIKRVEAQGG  +IKFDA   NS+ N+I+VGGKEF+FTWS E G+LCDIYEE +SGEDG+GLLIE+G AWRKLNV R LDESTT+H+K  S EAE+
Subjt:  DLIDEIKRVEAQGGTPRIKFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAER

Query:  RSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQK--------------------NELSQDRLSSSPIPSPPEQSGAPVSQFGSANT
        R+KSR+AIVL+PGNPS+    KQLA AE +PWR   K KKEPP KK+K                        ++RLS+SP PSP  Q   P   +G  N 
Subjt:  RSKSRRAIVLEPGNPSMKNQIKQLAAAEANPWR-HFKNKKEPPFKKQK--------------------NELSQDRLSSSPIPSPPEQSGAPVSQFGSANT

Query:  TKTHVIAEDIRP-RLPAKINSAASSEKEIPTKAAKGVL-ETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDP
         KTH   E++ P +   ++N     EKE P+     VL +T G+E  +  K  DLQ +L ++L E P  MSLKALEKAVGDK+PN  KKIEPI+K++++ 
Subjt:  TKTHVIAEDIRP-RLPAKINSAASSEKEIPTKAAKGVL-ETPGQEGNSGAKPTDLQGMLYNLLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDP

Query:  NVKLFEIMHNLLGVELEGSKKPTSEGESSPLISHHQ-TPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNF-------------LEKNGVQQH
            + +       ELE  KK + +  SSP   H Q  PV E   DQ+  P        G    EK    + N E +               E   ++ H
Subjt:  NVKLFEIMHNLLGVELEGSKKPTSEGESSPLISHHQ-TPVHEDLPDQITAPELQLEARRGIELEEKVETSQANKESNF-------------LEKNGVQQH

Query:  SPDLFAEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQ-
        SP +F E+K SEN E QA SS    SDSDS+SD+SDSGSD        S+S  GS SGSSSDSE  A SNSK+GSDEDVDIM SD D+E     Q+  Q 
Subjt:  SPDLFAEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQAPVQ-

Query:  -----GFSTSPAAWKSPDGGAAQIIDDEKE----DGQGSDAIDIEKDSSDD----EPDAKIDDSSLLRIE---------EGGRPVEEPRSFSPYPDEFQE
             G  +S    +  +  A  I   + +    DG GSD +D+E +SSD+    + D K +  +  ++E          G   +     F+   D  +E
Subjt:  -----GFSTSPAAWKSPDGGAAQIIDDEKE----DGQGSDAIDIEKDSSDD----EPDAKIDDSSLLRIE---------EGGRPVEEPRSFSPYPDEFQE

Query:  RQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDS-VRNPTSQVTNK
        RQNFIG LF+D ENT  ++ ++++ D + R+ K +++++ D E   +KS H K  KS+S  Q        V   S H    S+L  D+ +RN ++  T  
Subjt:  RQNFIGSLFEDRENTLLDSARHEQSDSTGRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDS-VRNPTSQVTNK

Query:  GEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAARADKHGDIGRGTKHTEKGGH------ANENFHVFKDTFYGNAENEGTKEKK
                  P +G  ++  EK++                         +++KH D     + ++KG H      ++ +   F+D    N  ++   + K
Subjt:  GEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSGGGVRAVDTAARADKHGDIGRGTKHTEKGGH------ANENFHVFKDTFYGNAENEGTKEKK

Query:  VSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSL
          +N + G    +   P ++   KP E+ G  KD +  S   +G SP D+     A      G G  LQ+++S+LELGEL EP  E+    K  E   S 
Subjt:  VSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVSANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSL

Query:  KQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHPQYNSRVDHVEVD-----------KSV
        +Q   K +T++    D +K +S    S    K+++P      P    GSN     EH+VEDS R    +LQSH Q  +  D  E+            KS 
Subjt:  KQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRLNHRSLQSHPQYNSRVDHVEVD-----------KSV

Query:  DANVKPNQGIGPEGCGESNRKASV--GISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNP-VSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKG
          + +   G   EG GE+N+K  V    S+     R    SK+ S     N I    DA   P  S  RE    K+  S  +E+S SY KYEK  PE KG
Subjt:  DANVKPNQGIGPEGCGESNRKASV--GISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNP-VSAERENSDPKRRDSSSDENSCSYSKYEKDEPEFKG

Query:  AIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY
         I D  QYK Y+QEY+DKY+SY S+NKILES+R +F KLG++L  A+G+D ERY  ++ Q+KESY
Subjt:  AIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGGCGGCTCATCCAAGCTCGGTCGACCCGGCGCCGGCGCCGGCCGGGGAGCCGGAGGAAAGCGTCCGCACTCTTCTTTTCCTCTACCACCTTCTCACCGCCCCTC
CGGTCGTCTCTCTCTTGGCGGCGGTGGTGCTGGTTCTGTCGCCAATCCTCGGAATCGAACCACCACTGCAACCACATCCGAAGCCCCTCAATCCGTTGAGGAGAATTTCA
GTCTCGTTACTGGTAACAACCCGTTGGCTTTTGCCATGATAATTCGGTTGGCTCCGGATTTGATCGATGAGATCAAGCGGGTCGAGGCGCAGGGCGGGACTCCGAGGATT
AAGTTTGATGCGAACGCCAAGAATTCTAGTGGTAATGTCATTGATGTTGGTGGGAAAGAGTTTAGGTTCACATGGTCACGTGAAGTTGGTGATCTTTGTGACATATATGA
AGAACGCAAAAGTGGAGAAGATGGAAGTGGTTTGCTCATTGAATCAGGCAATGCTTGGAGAAAACTGAATGTGCAGCGTATCTTAGACGAATCAACTACAAACCATGTTA
AGAAGTTGTCTGAAGAAGCTGAACGTAGATCTAAGTCTCGTAGAGCTATTGTCTTAGAACCTGGGAATCCATCTATGAAGAATCAAATAAAGCAATTGGCTGCTGCCGAA
GCTAATCCATGGAGGCACTTTAAGAATAAGAAAGAGCCTCCCTTTAAAAAGCAGAAAAACGAATTGTCTCAAGATAGGCTATCATCTTCACCTATTCCATCGCCTCCTGA
GCAATCCGGTGCTCCAGTATCTCAATTTGGATCTGCAAACACCACTAAGACTCATGTTATTGCAGAAGATATTAGACCTCGACTGCCTGCTAAGATTAATTCTGCTGCTA
GCAGCGAGAAGGAAATCCCGACCAAAGCTGCAAAAGGAGTACTTGAAACACCAGGACAAGAAGGGAATAGTGGAGCTAAACCAACAGACTTGCAAGGAATGTTGTATAAT
TTACTTTCGGAAAACCCCAAGGGGATGAGTTTGAAGGCCTTGGAGAAAGCTGTTGGCGATAAAATCCCAAATGCTGTAAAAAAGATCGAGCCAATCATTAAAAAAGTAAG
TGATCCAAATGTTAAATTATTTGAAATAATGCATAATCTACTGGGAGTTGAGTTGGAAGGCTCCAAAAAACCTACATCCGAAGGTGAAAGCTCTCCTTTGATCAGCCATC
ACCAAACCCCTGTACATGAAGACCTCCCTGATCAAATAACTGCTCCAGAATTGCAGTTAGAAGCAAGACGTGGCATTGAATTGGAGGAAAAGGTAGAAACCTCTCAAGCT
AACAAAGAATCAAATTTCTTGGAGAAAAATGGCGTCCAACAGCATTCACCAGATCTTTTTGCTGAGAAAAAAGGTTCTGAAAATAGTGAAGGCCAGGCAGCTAGTTCTTC
TGACAACGAAAGTGACAGTGATTCTGAAAGTGATAGCAGTGATAGTGGAAGTGATAGTGGGAATCATAGTAGGAGTAGAAGCCAAAGCCCCGTGGGTAGTGGGAGTGGGA
GTAGCAGTGATAGTGAAAGTGATGCACCTTCCAATAGCAAGGAGGGTTCTGATGAGGACGTGGATATTATGACCAGTGATGATGACAAAGAATCCAAGCATAAATTGCAA
GCTCCCGTGCAGGGTTTCTCTACATCTCCTGCTGCTTGGAAAAGTCCAGATGGTGGGGCTGCGCAGATTATTGATGATGAGAAGGAAGATGGACAAGGATCTGATGCAAT
TGATATTGAGAAAGATTCTTCTGATGATGAGCCAGATGCTAAAATTGATGATAGCAGTTTACTTCGTATTGAAGAAGGTGGAAGACCTGTGGAAGAACCAAGATCCTTTT
CACCATACCCTGATGAATTCCAAGAGCGCCAAAATTTTATAGGGAGTTTGTTTGAGGATAGGGAAAATACTCTTCTGGACAGTGCCAGGCATGAACAATCTGACAGCACA
GGTCGGATATCTAAAGGCAAGTCTAAAAGGAGTTCTGATCTGGAGTGCTTAGAAGAGAAATCTGATCATACCAAGAGATTAAAATCAGAAAGCTTAGCCCAACAACCAGT
TTCTGGTAATTGGGGAGTCCAATTACAGAGTCCTCACAATTTATCTCCTAGTAAACTCAACAGAGATTCTGTCAGAAATCCTACCAGTCAAGTTACGAATAAAGGTGAAA
TTAAAGGCAATTCTGATTTTAGACCAAAAAAGGGAAACAAAGAAACAGTTCCAGAAAAAAATAGTTCAGATGTTTCACAAGCAGGTTGGAGGCCTCATGACCAGAGTGGT
GGAGGAGTGAGGGCTGTGGATACAGCTGCCAGAGCCGACAAGCATGGTGATATTGGACGTGGCACTAAACACACCGAAAAGGGTGGTCATGCTAATGAAAATTTTCATGT
GTTTAAAGATACCTTTTATGGAAATGCTGAAAATGAAGGGACAAAGGAGAAAAAGGTGTCAAAAAATTCTAGATCAGGTGGTCCAGGGGACAAACAGATACAACCTTTTG
ACTCCCATCACAGTAAACCTGGTGAAATAGTTGGAAAATTCAAAGATGGCCAAATATTTTCAAGCTCGCAGATGGGGTATTCACCAAGGGATAATAATAATAGAGTTAGT
GCCAACAGGTCCCCTGTTAATGGAAAAGGCCGAAGTCTCCAAAGAGAGCTTTCAGACCTGGAGTTAGGTGAACTTCGTGAGCCTTTCCCTGAGGAAGCACGGGGTAAAAA
GAAATTTGAAAGAAACAATTCGTTGAAACAGTTGGAGAATAAAGAAAACACTACAGATATCTGGAGTTCAGATTTAAATAAAGGAAAATCTAATTTGAAGGCTAGTATAG
AATATGGAAAGCGATCCTCACCCCATGTAAGTACCAAGTTTCCCAGCAATCCGGAAGGCTCAAATAAAAAGAAGAATTCAGAACATATAGTTGAAGATTCGACCAGGCTT
AACCACCGGTCTCTGCAGTCTCATCCACAATATAATTCAAGGGTAGACCATGTTGAAGTCGATAAGTCAGTTGATGCAAATGTAAAACCTAATCAGGGGATTGGTCCAGA
AGGCTGTGGGGAAAGCAACAGAAAAGCATCCGTTGGCATTTCCCAGCTGAATGATATGAAAAGAGAACAGCTTCCCTCAAAAAAAGGAAGTAAAAGACTAGCACCTAATC
CAATAACCGAAGTTACTGATGCACTGAAGAACCCAGTATCAGCTGAGCGTGAAAATAGTGATCCAAAGAGAAGAGATTCTTCTTCAGACGAAAACAGTTGTTCATATTCC
AAGTATGAAAAGGATGAGCCAGAGTTCAAGGGGGCAATCAAGGATTTTTCTCAGTACAAGGAATATGTACAAGAGTATCATGATAAATATGAATCATACCTATCCTTGAA
CAAAATCCTAGAAAGCTACAGGGCTGAGTTCTGCAAACTCGGAAAGGAACTTGATTCTGCTAGAGGACAAGATTCGGAGAGATACTTTAATGTCTTAGGACAGCTGAAAG
AATCCTATCGGCTGTGTTCAACGCCGAAGGCTTCATCTCCTACCAATCTTGGAAAGCATGATGGAGGATGGAGAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATGGCGGCTCATCCAAGCTCGGTCGACCCGGCGCCGGCGCCGGCCGGGGAGCCGGAGGAAAGCGTCCGCACTCTTCTTTTCCTCTACCACCTTCTCACCGCCCCTC
CGGTCGTCTCTCTCTTGGCGGCGGTGGTGCTGGTTCTGTCGCCAATCCTCGGAATCGAACCACCACTGCAACCACATCCGAAGCCCCTCAATCCGTTGAGGAGAATTTCA
GTCTCGTTACTGGTAACAACCCGTTGGCTTTTGCCATGATAATTCGGTTGGCTCCGGATTTGATCGATGAGATCAAGCGGGTCGAGGCGCAGGGCGGGACTCCGAGGATT
AAGTTTGATGCGAACGCCAAGAATTCTAGTGGTAATGTCATTGATGTTGGTGGGAAAGAGTTTAGGTTCACATGGTCACGTGAAGTTGGTGATCTTTGTGACATATATGA
AGAACGCAAAAGTGGAGAAGATGGAAGTGGTTTGCTCATTGAATCAGGCAATGCTTGGAGAAAACTGAATGTGCAGCGTATCTTAGACGAATCAACTACAAACCATGTTA
AGAAGTTGTCTGAAGAAGCTGAACGTAGATCTAAGTCTCGTAGAGCTATTGTCTTAGAACCTGGGAATCCATCTATGAAGAATCAAATAAAGCAATTGGCTGCTGCCGAA
GCTAATCCATGGAGGCACTTTAAGAATAAGAAAGAGCCTCCCTTTAAAAAGCAGAAAAACGAATTGTCTCAAGATAGGCTATCATCTTCACCTATTCCATCGCCTCCTGA
GCAATCCGGTGCTCCAGTATCTCAATTTGGATCTGCAAACACCACTAAGACTCATGTTATTGCAGAAGATATTAGACCTCGACTGCCTGCTAAGATTAATTCTGCTGCTA
GCAGCGAGAAGGAAATCCCGACCAAAGCTGCAAAAGGAGTACTTGAAACACCAGGACAAGAAGGGAATAGTGGAGCTAAACCAACAGACTTGCAAGGAATGTTGTATAAT
TTACTTTCGGAAAACCCCAAGGGGATGAGTTTGAAGGCCTTGGAGAAAGCTGTTGGCGATAAAATCCCAAATGCTGTAAAAAAGATCGAGCCAATCATTAAAAAAGTAAG
TGATCCAAATGTTAAATTATTTGAAATAATGCATAATCTACTGGGAGTTGAGTTGGAAGGCTCCAAAAAACCTACATCCGAAGGTGAAAGCTCTCCTTTGATCAGCCATC
ACCAAACCCCTGTACATGAAGACCTCCCTGATCAAATAACTGCTCCAGAATTGCAGTTAGAAGCAAGACGTGGCATTGAATTGGAGGAAAAGGTAGAAACCTCTCAAGCT
AACAAAGAATCAAATTTCTTGGAGAAAAATGGCGTCCAACAGCATTCACCAGATCTTTTTGCTGAGAAAAAAGGTTCTGAAAATAGTGAAGGCCAGGCAGCTAGTTCTTC
TGACAACGAAAGTGACAGTGATTCTGAAAGTGATAGCAGTGATAGTGGAAGTGATAGTGGGAATCATAGTAGGAGTAGAAGCCAAAGCCCCGTGGGTAGTGGGAGTGGGA
GTAGCAGTGATAGTGAAAGTGATGCACCTTCCAATAGCAAGGAGGGTTCTGATGAGGACGTGGATATTATGACCAGTGATGATGACAAAGAATCCAAGCATAAATTGCAA
GCTCCCGTGCAGGGTTTCTCTACATCTCCTGCTGCTTGGAAAAGTCCAGATGGTGGGGCTGCGCAGATTATTGATGATGAGAAGGAAGATGGACAAGGATCTGATGCAAT
TGATATTGAGAAAGATTCTTCTGATGATGAGCCAGATGCTAAAATTGATGATAGCAGTTTACTTCGTATTGAAGAAGGTGGAAGACCTGTGGAAGAACCAAGATCCTTTT
CACCATACCCTGATGAATTCCAAGAGCGCCAAAATTTTATAGGGAGTTTGTTTGAGGATAGGGAAAATACTCTTCTGGACAGTGCCAGGCATGAACAATCTGACAGCACA
GGTCGGATATCTAAAGGCAAGTCTAAAAGGAGTTCTGATCTGGAGTGCTTAGAAGAGAAATCTGATCATACCAAGAGATTAAAATCAGAAAGCTTAGCCCAACAACCAGT
TTCTGGTAATTGGGGAGTCCAATTACAGAGTCCTCACAATTTATCTCCTAGTAAACTCAACAGAGATTCTGTCAGAAATCCTACCAGTCAAGTTACGAATAAAGGTGAAA
TTAAAGGCAATTCTGATTTTAGACCAAAAAAGGGAAACAAAGAAACAGTTCCAGAAAAAAATAGTTCAGATGTTTCACAAGCAGGTTGGAGGCCTCATGACCAGAGTGGT
GGAGGAGTGAGGGCTGTGGATACAGCTGCCAGAGCCGACAAGCATGGTGATATTGGACGTGGCACTAAACACACCGAAAAGGGTGGTCATGCTAATGAAAATTTTCATGT
GTTTAAAGATACCTTTTATGGAAATGCTGAAAATGAAGGGACAAAGGAGAAAAAGGTGTCAAAAAATTCTAGATCAGGTGGTCCAGGGGACAAACAGATACAACCTTTTG
ACTCCCATCACAGTAAACCTGGTGAAATAGTTGGAAAATTCAAAGATGGCCAAATATTTTCAAGCTCGCAGATGGGGTATTCACCAAGGGATAATAATAATAGAGTTAGT
GCCAACAGGTCCCCTGTTAATGGAAAAGGCCGAAGTCTCCAAAGAGAGCTTTCAGACCTGGAGTTAGGTGAACTTCGTGAGCCTTTCCCTGAGGAAGCACGGGGTAAAAA
GAAATTTGAAAGAAACAATTCGTTGAAACAGTTGGAGAATAAAGAAAACACTACAGATATCTGGAGTTCAGATTTAAATAAAGGAAAATCTAATTTGAAGGCTAGTATAG
AATATGGAAAGCGATCCTCACCCCATGTAAGTACCAAGTTTCCCAGCAATCCGGAAGGCTCAAATAAAAAGAAGAATTCAGAACATATAGTTGAAGATTCGACCAGGCTT
AACCACCGGTCTCTGCAGTCTCATCCACAATATAATTCAAGGGTAGACCATGTTGAAGTCGATAAGTCAGTTGATGCAAATGTAAAACCTAATCAGGGGATTGGTCCAGA
AGGCTGTGGGGAAAGCAACAGAAAAGCATCCGTTGGCATTTCCCAGCTGAATGATATGAAAAGAGAACAGCTTCCCTCAAAAAAAGGAAGTAAAAGACTAGCACCTAATC
CAATAACCGAAGTTACTGATGCACTGAAGAACCCAGTATCAGCTGAGCGTGAAAATAGTGATCCAAAGAGAAGAGATTCTTCTTCAGACGAAAACAGTTGTTCATATTCC
AAGTATGAAAAGGATGAGCCAGAGTTCAAGGGGGCAATCAAGGATTTTTCTCAGTACAAGGAATATGTACAAGAGTATCATGATAAATATGAATCATACCTATCCTTGAA
CAAAATCCTAGAAAGCTACAGGGCTGAGTTCTGCAAACTCGGAAAGGAACTTGATTCTGCTAGAGGACAAGATTCGGAGAGATACTTTAATGTCTTAGGACAGCTGAAAG
AATCCTATCGGCTGTGTTCAACGCCGAAGGCTTCATCTCCTACCAATCTTGGAAAGCATGATGGAGGATGGAGAGTTTGA
Protein sequenceShow/hide protein sequence
MYGGSSKLGRPGAGAGRGAGGKRPHSSFPLPPSHRPSGRLSLGGGGAGSVANPRNRTTTATTSEAPQSVEENFSLVTGNNPLAFAMIIRLAPDLIDEIKRVEAQGGTPRI
KFDANAKNSSGNVIDVGGKEFRFTWSREVGDLCDIYEERKSGEDGSGLLIESGNAWRKLNVQRILDESTTNHVKKLSEEAERRSKSRRAIVLEPGNPSMKNQIKQLAAAE
ANPWRHFKNKKEPPFKKQKNELSQDRLSSSPIPSPPEQSGAPVSQFGSANTTKTHVIAEDIRPRLPAKINSAASSEKEIPTKAAKGVLETPGQEGNSGAKPTDLQGMLYN
LLSENPKGMSLKALEKAVGDKIPNAVKKIEPIIKKVSDPNVKLFEIMHNLLGVELEGSKKPTSEGESSPLISHHQTPVHEDLPDQITAPELQLEARRGIELEEKVETSQA
NKESNFLEKNGVQQHSPDLFAEKKGSENSEGQAASSSDNESDSDSESDSSDSGSDSGNHSRSRSQSPVGSGSGSSSDSESDAPSNSKEGSDEDVDIMTSDDDKESKHKLQ
APVQGFSTSPAAWKSPDGGAAQIIDDEKEDGQGSDAIDIEKDSSDDEPDAKIDDSSLLRIEEGGRPVEEPRSFSPYPDEFQERQNFIGSLFEDRENTLLDSARHEQSDST
GRISKGKSKRSSDLECLEEKSDHTKRLKSESLAQQPVSGNWGVQLQSPHNLSPSKLNRDSVRNPTSQVTNKGEIKGNSDFRPKKGNKETVPEKNSSDVSQAGWRPHDQSG
GGVRAVDTAARADKHGDIGRGTKHTEKGGHANENFHVFKDTFYGNAENEGTKEKKVSKNSRSGGPGDKQIQPFDSHHSKPGEIVGKFKDGQIFSSSQMGYSPRDNNNRVS
ANRSPVNGKGRSLQRELSDLELGELREPFPEEARGKKKFERNNSLKQLENKENTTDIWSSDLNKGKSNLKASIEYGKRSSPHVSTKFPSNPEGSNKKKNSEHIVEDSTRL
NHRSLQSHPQYNSRVDHVEVDKSVDANVKPNQGIGPEGCGESNRKASVGISQLNDMKREQLPSKKGSKRLAPNPITEVTDALKNPVSAERENSDPKRRDSSSDENSCSYS
KYEKDEPEFKGAIKDFSQYKEYVQEYHDKYESYLSLNKILESYRAEFCKLGKELDSARGQDSERYFNVLGQLKESYRLCSTPKASSPTNLGKHDGGWRV