; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G09120 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G09120
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDentin sialophosphoprotein
Genome locationClcChr02:11917900..11922048
RNA-Seq ExpressionClc02G09120
SyntenyClc02G09120
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034793.1 dentin sialophosphoprotein [Cucumis melo var. makuwa]1.1e-22783.73Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVGPIDLKESNRGKSPEQFPLT+LLDLEIRWPES+K GI DETPAPSKS+LNLAGVDL +YF+EEK DT SKASD  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS DNSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+T+ MPDQ+EQTGILIDGRA ETANYSSSA+VDWFQDDQ QGGSQKKPDDKSV KDDDSADAWD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FSTTT++DSDF +SSQP SFA+AFP     SVEKA WPDASDL+RM EENG+S ENS+A + QAASG  SS+DD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

XP_008455912.1 PREDICTED: uncharacterized protein LOC103495983 [Cucumis melo]4.2e-22783.33Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVGPIDLKESNRGKSPEQFPLT+LLDLEIRWPES+K GI+DETPAPSKS+LNLAGVDL +YF+EEK DT SKASD  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS DNSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+T+ MPDQ+EQTGILIDGRA ET NYSSSA+VDWFQDDQ QGGSQKKPDDKSV KDDDSAD WD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FSTTT++DSDF +SSQP SFA+AFP     SVEKA WPDASDL+RM EENG+S ENS+A   QAASG  SS+DD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

XP_011649988.1 uncharacterized protein LOC101209977 [Cucumis sativus]1.9e-22783.33Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN A ISSYDPH PSLPNLPS +ETIA+LDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPPDPINF NTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVG IDLKESNRGKSPEQFPLT+LLDLEIRWPESEKKGISDETPAPSKS+LNLAGVDL  YF+EEK DT SKASD  PP +K+TVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PS  TA RTTKHE+DDSFSGWEASFQ ASSAT  DNSKS+DPF VSGVNISSSLETTFG+ +KS SGE+EDTKNPSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+TI MPDQ+EQTGILIDGR  ETANYSSSA+VDWFQDDQ QG SQKKPDDKSV KDD SADAWDDFTSSTGVQGP D+S+KDIVND VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FST T++DSDF +SSQP SFA+AFP     SVEKA WPDASDLSRM+EENG++ ENS+A++RQAASGPSSS+DD +MMM KMHDLSFMLES LS+PPK
Subjt:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

XP_022970990.1 uncharacterized protein LOC111469795 [Cucurbita maxima]2.3e-20475.74Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MA++IP DLI QLQISLRN AK+SSYDPHD SLPNLPSLHETIA+LDPSPPYLRCKHCKGRLLRDLKSF+CVFCG+EQNT+VPPDPINFKNTIACRWLLE
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQ-------TV
        SLDLDGSEMVG +DLKESNRGKS E+FPLT+LLDL+IRWPESEK+G+SD T APSKS+LNLA VDLD YFSEE KDT  K SDE  PLN+Q       T 
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQ-------TV

Query:  EDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQD
        +DNVDLSLF  V SS TA R  +HE+ DSFSGWEA+FQT +SATSH+NSKS+DPFA+SGV+IS SLE T G  +K RSGE E+TKNPSSSM +DWF QQD
Subjt:  EDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQD

Query:  DLWSSSNHETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVD
        DLWSSSNHETI  P+Q++QTG   DG+   TA+YSSSASVDWFQDDQ QGGS KKPDD S  KDDDSADAWDDFTSSTG+QG  D+  KDIVN+IVPKVD
Subjt:  DLWSSSNHETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVD

Query:  EISEVDFFSTTTSRDSDFGNSSQPNSFADAFPK-----SVEKATWPDASDLSRMNEENGESGENSEAMKR-QAASGPSSSSDDIQMMMAKMHDLSFMLES
        EISE+DFFSTTTS+D +FGN SQPN F +AFP      S EKAT PDASDLSRM+EENG+SGENS+A K  QA+S PSS+ DD+QMMMAKMHDLSFMLES
Subjt:  EISEVDFFSTTTSRDSDFGNSSQPNSFADAFPK-----SVEKATWPDASDLSRMNEENGESGENSEAMKR-QAASGPSSSSDDIQMMMAKMHDLSFMLES

Query:  NLSVPPK
        +LS+PPK
Subjt:  NLSVPPK

XP_038902680.1 uncharacterized protein LOC120089318 [Benincasa hispida]1.4e-23886.72Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIP DLI QLQISLRN AKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHC GRLLRDLKSF+CVFCGREQNTDVPPDPINFKNTIACRWLLE
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMV PI+LKESNRGKSPEQFPLT+LLDLEIRWPESEKKGISDETPAPSKS+LNLA VDLD+YFSEEKKDT SKAS+EPPPLNKQTVEDNVDLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNH
        LFD VPSS TA RTTKHE+ DSFSGWEASFQ ASSAT HDNSKS+DPFAVS VNISSSLETTFGD +KSRSGE++DTKNPSSS+ NDWFQQ DLWSSSNH
Subjt:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNH

Query:  ETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFF
        ETIRMPDQ+EQTGI+IDGRAAETANYSSSASVDWFQ DQRQGGSQKKPDDKS  K D SADAWDDFTSSTGV GPSD+SRKDIVND+V KVDEISEVDFF
Subjt:  ETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFF

Query:  STTTSRDSDFGNSSQPNSFADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        STT   +SDF NSSQPNSFA+AFP     S+ KATW DASDLSRM+EE+GE+GENS+A++ Q+ASGPSSS+DD+QMMM KMHDLSFMLESNLS+PPK
Subjt:  STTTSRDSDFGNSSQPNSFADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

TrEMBL top hitse value%identityAlignment
A0A0A0LMS7 Uncharacterized protein9.2e-22883.33Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN A ISSYDPH PSLPNLPS +ETIA+LDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPPDPINF NTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVG IDLKESNRGKSPEQFPLT+LLDLEIRWPESEKKGISDETPAPSKS+LNLAGVDL  YF+EEK DT SKASD  PP +K+TVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PS  TA RTTKHE+DDSFSGWEASFQ ASSAT  DNSKS+DPF VSGVNISSSLETTFG+ +KS SGE+EDTKNPSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+TI MPDQ+EQTGILIDGR  ETANYSSSA+VDWFQDDQ QG SQKKPDDKSV KDD SADAWDDFTSSTGVQGP D+S+KDIVND VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FST T++DSDF +SSQP SFA+AFP     SVEKA WPDASDLSRM+EENG++ ENS+A++RQAASGPSSS+DD +MMM KMHDLSFMLES LS+PPK
Subjt:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

A0A1S3C2P9 uncharacterized protein LOC1034959832.0e-22783.33Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVGPIDLKESNRGKSPEQFPLT+LLDLEIRWPES+K GI+DETPAPSKS+LNLAGVDL +YF+EEK DT SKASD  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS DNSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+T+ MPDQ+EQTGILIDGRA ET NYSSSA+VDWFQDDQ QGGSQKKPDDKSV KDDDSAD WD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FSTTT++DSDF +SSQP SFA+AFP     SVEKA WPDASDL+RM EENG+S ENS+A   QAASG  SS+DD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

A0A5A7SW96 Dentin sialophosphoprotein5.4e-22883.73Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVGPIDLKESNRGKSPEQFPLT+LLDLEIRWPES+K GI DETPAPSKS+LNLAGVDL +YF+EEK DT SKASD  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS DNSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+T+ MPDQ+EQTGILIDGRA ETANYSSSA+VDWFQDDQ QGGSQKKPDDKSV KDDDSADAWD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FSTTT++DSDF +SSQP SFA+AFP     SVEKA WPDASDL+RM EENG+S ENS+A + QAASG  SS+DD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

A0A5D3CEG4 Dentin sialophosphoprotein2.0e-22783.33Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVGPIDLKESNRGKSPEQFPLT+LLDLEIRWPES+K GI+DETPAPSKS+LNLAGVDL +YF+EEK DT SKASD  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS DNSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+T+ MPDQ+EQTGILIDGRA ET NYSSSA+VDWFQDDQ QGGSQKKPDDKSV KDDDSAD WD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FSTTT++DSDF +SSQP SFA+AFP     SVEKA WPDASDL+RM EENG+S ENS+A   QAASG  SS+DD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATWPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

A0A6J1I4G5 uncharacterized protein LOC1114697951.1e-20475.74Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MA++IP DLI QLQISLRN AK+SSYDPHD SLPNLPSLHETIA+LDPSPPYLRCKHCKGRLLRDLKSF+CVFCG+EQNT+VPPDPINFKNTIACRWLLE
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQ-------TV
        SLDLDGSEMVG +DLKESNRGKS E+FPLT+LLDL+IRWPESEK+G+SD T APSKS+LNLA VDLD YFSEE KDT  K SDE  PLN+Q       T 
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQ-------TV

Query:  EDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQD
        +DNVDLSLF  V SS TA R  +HE+ DSFSGWEA+FQT +SATSH+NSKS+DPFA+SGV+IS SLE T G  +K RSGE E+TKNPSSSM +DWF QQD
Subjt:  EDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQD

Query:  DLWSSSNHETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVD
        DLWSSSNHETI  P+Q++QTG   DG+   TA+YSSSASVDWFQDDQ QGGS KKPDD S  KDDDSADAWDDFTSSTG+QG  D+  KDIVN+IVPKVD
Subjt:  DLWSSSNHETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVD

Query:  EISEVDFFSTTTSRDSDFGNSSQPNSFADAFPK-----SVEKATWPDASDLSRMNEENGESGENSEAMKR-QAASGPSSSSDDIQMMMAKMHDLSFMLES
        EISE+DFFSTTTS+D +FGN SQPN F +AFP      S EKAT PDASDLSRM+EENG+SGENS+A K  QA+S PSS+ DD+QMMMAKMHDLSFMLES
Subjt:  EISEVDFFSTTTSRDSDFGNSSQPNSFADAFPK-----SVEKATWPDASDLSRMNEENGESGENSEAMKR-QAASGPSSSSDDIQMMMAKMHDLSFMLES

Query:  NLSVPPK
        +LS+PPK
Subjt:  NLSVPPK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05090.1 dentin sialophosphoprotein-related2.9e-4827.32Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNT-DVPPDPINFKNTIACRWL
        MA EI  DLINQL++SLR  AK++S D   D S P+LP+  E IAELD S PYLRC++CKG+LLR ++S ICVFCG +Q T D PPDPI F +T A +W 
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNT-DVPPDPINFKNTIACRWL

Query:  LESLDLDGSEMVGPI-DLKESNRG--KSP--EQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTV
        L SL+LDGSEMV P+ +   S+RG  K+P  +   L+  LDLEI+W   E+K   D      K+ LNL G++LD YF E + D +     E  P+     
Subjt:  LESLDLDGSEMVGPI-DLKESNRG--KSP--EQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTV

Query:  EDNVDLSLFDKVPSSAT-------------------------------------------------------------------------AARTTKHEND
        +D   LSLFD V S                                                                            A RT+  ++D
Subjt:  EDNVDLSLFDKVPSSAT-------------------------------------------------------------------------AARTTKHEND

Query:  DSF------------------------------------------------------------------SGWEASFQTASSATSHDNSKSIDPFAVSGVN
        +SF                                                                  S W++ FQ+A    S       DPF  S V+
Subjt:  DSF------------------------------------------------------------------SGWEASFQTASSATSHDNSKSIDPFAVSGVN

Query:  ISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNHETIRMPDQL--EQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKS
        +++ +++ FG        +  D+     S A DW  QDDL+ +   E       +  +  G ++ G      N +SS  +DW  DD  Q   +K  +   
Subjt:  ISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNHETIRMPDQL--EQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKS

Query:  VIKDDDSADAWDDFTSS-----------------------------TGVQGPSDDSRKDIVNDIVPKVDEISEVDFFST---------------------
           +DD  D W+DF SS                              GV+  S D +++    ++  + +  E D F T                     
Subjt:  VIKDDDSADAWDDFTSS-----------------------------TGVQGPSDDSRKDIVNDIVPKVDEISEVDFFST---------------------

Query:  ----------------TTSRDSDFGNSSQPNSFADAF---PKSVEKATWPD-ASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSF
                          +RD DF + S+ + F+++      S E    P   S L R ++ +G   +  + +     + P S SD  + +M++MHDLSF
Subjt:  ----------------TTSRDSDFGNSSQPNSFADAF---PKSVEKATWPD-ASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSF

Query:  MLESNLSVPP
        MLE+ LSVPP
Subjt:  MLESNLSVPP

AT4G20720.1 dentin sialophosphoprotein-related3.5e-4642.05Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNT-DVPPDPINFKNTIACRWL
        MA EI  DLINQL++SLR  AK++S D   D S P+LP+  E IAELD S PYLRC++CKG+LLR ++S ICVFCG +Q T D PPDPI F +T A +W 
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNT-DVPPDPINFKNTIACRWL

Query:  LESLDLDGSEMVGPI-DLKESNRG--KSP--EQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTV
        L SL+LDGSEMV P+ +   S+RG  K+P  +   L+  LDLEI+W   E+K   D      K+ LNL G++LD YF E + D +     E  P+     
Subjt:  LESLDLDGSEMVGPI-DLKESNRG--KSP--EQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTV

Query:  EDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESED
        +D   LSLFD V S      + +H+N   F   +A     SS    + S      A   V+ ++     F +   +R+   ED
Subjt:  EDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESED

AT4G20720.1 dentin sialophosphoprotein-related7.9e-0626.65Show/hide
Query:  AGVDLDFYFSEEKKDTASKASDEPP----PLNKQTVEDNV---DLSLFDKVPSSATAARTTKHEN---DDSFSGWEASFQTASSATSHDNSKSIDPFAVS
        A  D D  F    ++ + K  D  P    P++     D+V      L    P+ ++ A  +K  +   DD F       QT  SA  HD ++      + 
Subjt:  AGVDLDFYFSEEKKDTASKASDEPP----PLNKQTVEDNV---DLSLFDKVPSSATAARTTKHEN---DDSFSGWEASFQTASSATSHDNSKSIDPFAVS

Query:  GVNISSSLETTF-GDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNHETIRMP--DQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKP
        G N +SS++  + GD     + +    K P+    +D    +D  SS+N +T   P    +E +   I    A+  N     SV     D++Q       
Subjt:  GVNISSSLETTF-GDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNHETIRMP--DQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKP

Query:  DDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFF-STTTSRDSDFGNSSQPNSFADAF---PKSVEKATWPD-ASDLSRMNE
         D    ++DD    WD FTSST +Q     S +       P  ++  E++ F     +RD DF + S+ + F+++      S E    P   S L R ++
Subjt:  DDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFF-STTTSRDSDFGNSSQPNSFADAF---PKSVEKATWPD-ASDLSRMNE

Query:  ENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPP
         +G   +  + +     + P S SD  + +M++MHDLSFMLE+ LSVPP
Subjt:  ENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTATGAAATCCCTCGCGATCTGATCAATCAACTTCAGATCTCTCTTCGAAATAGGGCCAAAATCTCCTCCTACGACCCTCACGATCCTTCACTTCCAAATCTACC
ATCGCTCCATGAAACAATTGCAGAGCTTGATCCCTCCCCGCCTTATCTTCGCTGCAAACACTGCAAAGGAAGATTGCTTAGAGACTTGAAGTCATTTATTTGCGTTTTCT
GCGGCAGGGAACAGAACACCGACGTCCCTCCGGACCCCATTAATTTCAAGAATACCATTGCTTGTCGTTGGCTTCTCGAATCCTTGGACTTGGATGGATCGGAGATGGTG
GGACCAATCGATTTGAAGGAATCAAACCGGGGAAAATCACCAGAGCAATTTCCCCTGACGAATCTTTTAGATTTAGAGATTAGATGGCCTGAATCTGAAAAGAAAGGGAT
CTCAGACGAAACCCCGGCTCCAAGTAAAAGTTCCTTGAATTTGGCTGGAGTTGATCTTGACTTCTACTTCTCTGAGGAAAAAAAAGACACTGCTTCAAAAGCATCTGATG
AGCCACCACCACTGAATAAACAAACTGTTGAGGATAATGTTGATCTTAGTTTATTTGATAAGGTTCCATCTTCCGCGACGGCAGCAAGGACCACTAAACATGAGAATGAT
GATTCCTTTTCTGGTTGGGAGGCAAGCTTTCAGACTGCTAGTTCTGCAACTTCTCATGATAATTCTAAATCAATTGATCCTTTTGCTGTTTCTGGGGTCAATATATCTTC
CTCTTTGGAAACAACGTTTGGAGACCATAGCAAGTCCAGAAGTGGAGAATCAGAAGATACTAAAAATCCCTCTTCATCAATGGCCAATGACTGGTTTCAACAAGATGATT
TATGGAGTAGTTCTAATCACGAAACGATTCGCATGCCAGATCAGCTTGAACAAACTGGAATTTTAATTGATGGTAGAGCTGCAGAAACTGCTAATTATTCTTCATCAGCA
AGCGTTGATTGGTTTCAAGATGATCAGCGGCAAGGAGGGAGCCAAAAGAAACCTGATGATAAAAGTGTTATTAAAGATGATGATTCAGCTGATGCTTGGGATGATTTTAC
TAGCTCAACTGGTGTGCAAGGCCCCTCTGATGATTCTAGGAAAGACATTGTGAATGACATTGTGCCAAAGGTGGATGAGATATCAGAAGTAGATTTCTTCAGCACAACCA
CCTCAAGGGATAGTGATTTTGGAAACTCTTCTCAGCCAAATTCATTTGCAGATGCATTCCCCAAATCCGTAGAAAAAGCAACGTGGCCAGATGCTTCTGATTTAAGCAGG
ATGAATGAAGAGAATGGAGAAAGTGGAGAAAATTCTGAAGCTATGAAGCGTCAAGCTGCATCAGGTCCTAGTTCAAGTTCTGATGATATACAGATGATGATGGCGAAGAT
GCACGATCTATCTTTTATGCTCGAAAGCAATCTTTCAGTCCCCCCAAAGTGA
mRNA sequenceShow/hide mRNA sequence
GGAAAAGAGTGAAAGCATAAACTCCCATAAAGTTAAAAGTACTTGAAATGGCGAGTTCTTGGTTGAATGGAACGCAGCGTTTTGCGCATTTTCCCGTGTACCGGAGAAGC
TATCTTCCCCGTCTGGGAGAACGTGGATTTTTCATTTTGGAAATCCACACAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGGAGAACTCCGTCGCTC
ACACAACCTGAAGATACTGCGAACCTCTTCAAACTCAATCAATGGCGTATGAAATCCCTCGCGATCTGATCAATCAACTTCAGATCTCTCTTCGAAATAGGGCCAAAATC
TCCTCCTACGACCCTCACGATCCTTCACTTCCAAATCTACCATCGCTCCATGAAACAATTGCAGAGCTTGATCCCTCCCCGCCTTATCTTCGCTGCAAACACTGCAAAGG
AAGATTGCTTAGAGACTTGAAGTCATTTATTTGCGTTTTCTGCGGCAGGGAACAGAACACCGACGTCCCTCCGGACCCCATTAATTTCAAGAATACCATTGCTTGTCGTT
GGCTTCTCGAATCCTTGGACTTGGATGGATCGGAGATGGTGGGACCAATCGATTTGAAGGAATCAAACCGGGGAAAATCACCAGAGCAATTTCCCCTGACGAATCTTTTA
GATTTAGAGATTAGATGGCCTGAATCTGAAAAGAAAGGGATCTCAGACGAAACCCCGGCTCCAAGTAAAAGTTCCTTGAATTTGGCTGGAGTTGATCTTGACTTCTACTT
CTCTGAGGAAAAAAAAGACACTGCTTCAAAAGCATCTGATGAGCCACCACCACTGAATAAACAAACTGTTGAGGATAATGTTGATCTTAGTTTATTTGATAAGGTTCCAT
CTTCCGCGACGGCAGCAAGGACCACTAAACATGAGAATGATGATTCCTTTTCTGGTTGGGAGGCAAGCTTTCAGACTGCTAGTTCTGCAACTTCTCATGATAATTCTAAA
TCAATTGATCCTTTTGCTGTTTCTGGGGTCAATATATCTTCCTCTTTGGAAACAACGTTTGGAGACCATAGCAAGTCCAGAAGTGGAGAATCAGAAGATACTAAAAATCC
CTCTTCATCAATGGCCAATGACTGGTTTCAACAAGATGATTTATGGAGTAGTTCTAATCACGAAACGATTCGCATGCCAGATCAGCTTGAACAAACTGGAATTTTAATTG
ATGGTAGAGCTGCAGAAACTGCTAATTATTCTTCATCAGCAAGCGTTGATTGGTTTCAAGATGATCAGCGGCAAGGAGGGAGCCAAAAGAAACCTGATGATAAAAGTGTT
ATTAAAGATGATGATTCAGCTGATGCTTGGGATGATTTTACTAGCTCAACTGGTGTGCAAGGCCCCTCTGATGATTCTAGGAAAGACATTGTGAATGACATTGTGCCAAA
GGTGGATGAGATATCAGAAGTAGATTTCTTCAGCACAACCACCTCAAGGGATAGTGATTTTGGAAACTCTTCTCAGCCAAATTCATTTGCAGATGCATTCCCCAAATCCG
TAGAAAAAGCAACGTGGCCAGATGCTTCTGATTTAAGCAGGATGAATGAAGAGAATGGAGAAAGTGGAGAAAATTCTGAAGCTATGAAGCGTCAAGCTGCATCAGGTCCT
AGTTCAAGTTCTGATGATATACAGATGATGATGGCGAAGATGCACGATCTATCTTTTATGCTCGAAAGCAATCTTTCAGTCCCCCCAAAGTGATGCATCTTTAATTCTTC
TTCTGAAGCACTCTGCCACTGAGCTTTTTTTGTATTTTTCTTTCCCACTTTCTTTTTAAATCTGTAGCAGTATAGTGTTAGTTTAGTTATTACGGAATGCATTCTTTGAT
TTTATAAAATGGCCATATGCCGTTGAAATCCATTGCAGGCATAACTAACATACTACACATTCACCTGACTGACATACTCTTTTCAAACTTACACCCATTGTTCAAAAAAA
ATTATGTTCTAGAGGACAATTATTCACTGATGACTGACCAAAACTGAAACTTTGACACAACTACAATCCAGGGTCACGCAGGTGACATTTCAAATAGTATTAAGGGGAGT
TCATTATTTGCCATTGTGAATCAACCCCTCCACAAAGATGAACGAAAGCATAAAACTATCCTAGCTTGAAGACAAAATCTAAGCAACTGACCAAATGTTAGTTATTTCAC
TTTGTTGTCTCTTGACACAAGTACACAAACAATAATGCCAAAGCACTGGATTTGAATAAGGTTGGCCATGGAAGCCCATAGGTTCCCTTTGCAAACATCCCCTTCAAAAA
TGGCCAGAAGCACAAAATCAACCACACACTGCATAGCACCTCTGCCACCCCAAACTCTTGAATGCTTGGCTGCGTTTGCAAGAAGCCGATCAAGAGTGCTGCCAGTTGAA
TCATCAAAATTGTTGTGACTGGCACAAACAATGGGGACTCATCAAATGTGAACTGACCCAAATCTCCATCCCTACTCTCTGTGTCATCACTAGAAGAAGATGACTCCTTT
TTTGTTATTTCAAACACAGTCTCTGAAAGTCCAAAAGTCTTGAGAATGACAGCCACCATCCCAAGCAAGGAAGTGGACATTGTCTGAATCTTCTCCATTCTTAGGTTATT
CCACCAAGCTCTTACAGATTGGCCCGTTTCAAAGTAGTCTAACAACCCTCGTAGCTTGAGAATGACGAACAGGAGAAGAGGTACGCATATTACTGGTTCTTGTACCTGCA
TTGAGAAATTCATAGCTGTTTAACAAGTTGAATATCTCTGGTTATTTAGATGATGTAGGTAGGGGTTGTATTCTATACTTGCCTTGGGCAAAAAATGAGAGTTGGAAATT
AGACAAAATGCTGGTAGAGTAGCGTAACATATCTCTGGAATAGCACGAAGGCCCACTAGATACGCCCACAGATAGATGAGACGTTGCCTGAATTGAAGCTTATCAGAGAA
AGCAGTGACAATGGGGGACTTTTTGCTGATTAGAATTTCAAGTAGTCCTGTCATTGCTCTCTTATGATGGTTTAATGGAATTGGGCCTCCTGAGGGTGCACATCCTAGGA
ATGCTGGTGGAGTTGGTGTAATGTATGCAGATTTCCATCCCTTTTTATGGATCTCCATCCCAGTCAACACGTCCTCCACTATGGATCCATATTGCCAACCCACCTAAACA
ACCCATCACCAATTTTGATTAAGGATTCCAACATTTGACCTTGCAAAATATTATA
Protein sequenceShow/hide protein sequence
MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLESLDLDGSEMV
GPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSSLNLAGVDLDFYFSEEKKDTASKASDEPPPLNKQTVEDNVDLSLFDKVPSSATAARTTKHEND
DSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNHETIRMPDQLEQTGILIDGRAAETANYSSSA
SVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFFSTTTSRDSDFGNSSQPNSFADAFPKSVEKATWPDASDLSR
MNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK