; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G026400 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G026400
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionDentin sialophosphoprotein
Genome locationCicolChr02:12574665..12577377
RNA-Seq ExpressionCcUC02G026400
SyntenyCcUC02G026400
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034793.1 dentin sialophosphoprotein [Cucumis melo var. makuwa]1.1e-22784.14Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVGPIDLKESNRGKSPEQFPLT+LLDLEIRWPES+K GI DETPAPSKSTLNLAGVDL +YF+EEK DTTSKASD  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS DNSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+T+ MPDQ+EQTGILIDGRA ETANYSSSA+VDWFQDDQ QGGSQKKPDDKSVFKDD SADAWD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFRNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FSTTT++DSDFR+SSQP SFA+AFP     SVEKA  PDASDL+RM EENG+S ENS+A + QAASG  SS+DD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSRDSDFRNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

XP_008455912.1 PREDICTED: uncharacterized protein LOC103495983 [Cucumis melo]4.2e-22783.73Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVGPIDLKESNRGKSPEQFPLT+LLDLEIRWPES+K GI+DETPAPSKSTLNLAGVDL +YF+EEK DTTSKASD  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS DNSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+T+ MPDQ+EQTGILIDGRA ET NYSSSA+VDWFQDDQ QGGSQKKPDDKSVFKDD SAD WD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFRNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FSTTT++DSDFR+SSQP SFA+AFP     SVEKA  PDASDL+RM EENG+S ENS+A   QAASG  SS+DD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSRDSDFRNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

XP_011649988.1 uncharacterized protein LOC101209977 [Cucumis sativus]3.8e-22883.94Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN A ISSYDPH PSLPNLPS +ETIA+LDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPPDPINF NTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVG IDLKESNRGKSPEQFPLT+LLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDL  YF+EEK DTTSKASD  PP +K+TVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PS  TA RTTKHE+DDSFSGWEASFQ ASSAT  DNSKS+DPF VSGVNISSSLETTFG+ +KS SGE+EDTKNPSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+TI MPDQ+EQTGILIDGR  ETANYSSSA+VDWFQDDQ QG SQKKPDDKSVFKDD SADAWDDFTSSTGVQGP D+S+KDIVND VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFRNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FST T++DSDFR+SSQP SFA+AFP     SVEKA  PDASDLSRM+EENG++ ENS+A++RQAASGPSSS+DD +MMM KMHDLSFMLES LS+PPK
Subjt:  FSTTTSRDSDFRNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

XP_022970990.1 uncharacterized protein LOC111469795 [Cucurbita maxima]2.7e-20576.13Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MA++IP DLI QLQISLRN AK+SSYDPHD SLPNLPSLHETIA+LDPSPPYLRCKHCKGRLLRDLKSF+CVFCG+EQNT+VPPDPINFKNTIACRWLLE
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQ-------TV
        SLDLDGSEMVG +DLKESNRGKS E+FPLT+LLDL+IRWPESEK+G+SD T APSKSTLNLA VDLD YFSEE KDTT K SDE  PLN+Q       T 
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQ-------TV

Query:  EDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQD
        +DNVDLSLF  V SS TA R  +HE+ DSFSGWEA+FQT +SATSH+NSKS+DPFA+SGV+IS SLE T G  +K RSGE E+TKNPSSSM +DWF QQD
Subjt:  EDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQD

Query:  DLWSSSNHETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVD
        DLWSSSNHETI  P+Q++QTG   DG+   TA+YSSSASVDWFQDDQ QGGS KKPDD S FKDD SADAWDDFTSSTG+QG  D+  KDIVN+IVPKVD
Subjt:  DLWSSSNHETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVD

Query:  EISEVDFFSTTTSRDSDFRNSSQPNSFADAFPK-----SVEKATRPDASDLSRMNEENGESGENSEAMKR-QAASGPSSSSDDIQMMMAKMHDLSFMLES
        EISE+DFFSTTTS+D +F N SQPN F +AFP      S EKATRPDASDLSRM+EENG+SGENS+A K  QA+S PSS+ DD+QMMMAKMHDLSFMLES
Subjt:  EISEVDFFSTTTSRDSDFRNSSQPNSFADAFPK-----SVEKATRPDASDLSRMNEENGESGENSEAMKR-QAASGPSSSSDDIQMMMAKMHDLSFMLES

Query:  NLSVPPK
        +LS+PPK
Subjt:  NLSVPPK

XP_038902680.1 uncharacterized protein LOC120089318 [Benincasa hispida]7.4e-24087.32Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIP DLI QLQISLRN AKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHC GRLLRDLKSF+CVFCGREQNTDVPPDPINFKNTIACRWLLE
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMV PI+LKESNRGKSPEQFPLT+LLDLEIRWPESEKKGISDETPAPSKS LNLA VDLD+YFSEEKKDTTSKAS+EPPPLNKQTVEDNVDLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNH
        LFD VPSS TA RTTKHE+ DSFSGWEASFQ ASSAT HDNSKS+DPFAVS VNISSSLETTFGD +KSRSGE++DTKNPSSS+ NDWFQQ DLWSSSNH
Subjt:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNH

Query:  ETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFF
        ETIRMPDQ+EQTGI+IDGRAAETANYSSSASVDWFQ DQRQGGSQKKPDDKS FK DHSADAWDDFTSSTGV GPSD+SRKDIVND+V KVDEISEVDFF
Subjt:  ETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFF

Query:  STTTSRDSDFRNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        STT   +SDFRNSSQPNSFA+AFP     S+ KAT  DASDLSRM+EE+GE+GENS+A++ Q+ASGPSSS+DD+QMMM KMHDLSFMLESNLS+PPK
Subjt:  STTTSRDSDFRNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

TrEMBL top hitse value%identityAlignment
A0A0A0LMS7 Uncharacterized protein1.9e-22883.94Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN A ISSYDPH PSLPNLPS +ETIA+LDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPPDPINF NTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVG IDLKESNRGKSPEQFPLT+LLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDL  YF+EEK DTTSKASD  PP +K+TVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PS  TA RTTKHE+DDSFSGWEASFQ ASSAT  DNSKS+DPF VSGVNISSSLETTFG+ +KS SGE+EDTKNPSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+TI MPDQ+EQTGILIDGR  ETANYSSSA+VDWFQDDQ QG SQKKPDDKSVFKDD SADAWDDFTSSTGVQGP D+S+KDIVND VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFRNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FST T++DSDFR+SSQP SFA+AFP     SVEKA  PDASDLSRM+EENG++ ENS+A++RQAASGPSSS+DD +MMM KMHDLSFMLES LS+PPK
Subjt:  FSTTTSRDSDFRNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

A0A1S3C2P9 uncharacterized protein LOC1034959832.0e-22783.73Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVGPIDLKESNRGKSPEQFPLT+LLDLEIRWPES+K GI+DETPAPSKSTLNLAGVDL +YF+EEK DTTSKASD  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS DNSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+T+ MPDQ+EQTGILIDGRA ET NYSSSA+VDWFQDDQ QGGSQKKPDDKSVFKDD SAD WD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFRNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FSTTT++DSDFR+SSQP SFA+AFP     SVEKA  PDASDL+RM EENG+S ENS+A   QAASG  SS+DD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSRDSDFRNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

A0A5A7SW96 Dentin sialophosphoprotein5.4e-22884.14Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVGPIDLKESNRGKSPEQFPLT+LLDLEIRWPES+K GI DETPAPSKSTLNLAGVDL +YF+EEK DTTSKASD  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS DNSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+T+ MPDQ+EQTGILIDGRA ETANYSSSA+VDWFQDDQ QGGSQKKPDDKSVFKDD SADAWD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFRNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FSTTT++DSDFR+SSQP SFA+AFP     SVEKA  PDASDL+RM EENG+S ENS+A + QAASG  SS+DD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSRDSDFRNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

A0A5D3CEG4 Dentin sialophosphoprotein2.0e-22783.73Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVGPIDLKESNRGKSPEQFPLT+LLDLEIRWPES+K GI+DETPAPSKSTLNLAGVDL +YF+EEK DTTSKASD  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS DNSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+T+ MPDQ+EQTGILIDGRA ET NYSSSA+VDWFQDDQ QGGSQKKPDDKSVFKDD SAD WD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFRNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FSTTT++DSDFR+SSQP SFA+AFP     SVEKA  PDASDL+RM EENG+S ENS+A   QAASG  SS+DD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSRDSDFRNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

A0A6J1I4G5 uncharacterized protein LOC1114697951.3e-20576.13Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MA++IP DLI QLQISLRN AK+SSYDPHD SLPNLPSLHETIA+LDPSPPYLRCKHCKGRLLRDLKSF+CVFCG+EQNT+VPPDPINFKNTIACRWLLE
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQ-------TV
        SLDLDGSEMVG +DLKESNRGKS E+FPLT+LLDL+IRWPESEK+G+SD T APSKSTLNLA VDLD YFSEE KDTT K SDE  PLN+Q       T 
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQ-------TV

Query:  EDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQD
        +DNVDLSLF  V SS TA R  +HE+ DSFSGWEA+FQT +SATSH+NSKS+DPFA+SGV+IS SLE T G  +K RSGE E+TKNPSSSM +DWF QQD
Subjt:  EDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQD

Query:  DLWSSSNHETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVD
        DLWSSSNHETI  P+Q++QTG   DG+   TA+YSSSASVDWFQDDQ QGGS KKPDD S FKDD SADAWDDFTSSTG+QG  D+  KDIVN+IVPKVD
Subjt:  DLWSSSNHETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVD

Query:  EISEVDFFSTTTSRDSDFRNSSQPNSFADAFPK-----SVEKATRPDASDLSRMNEENGESGENSEAMKR-QAASGPSSSSDDIQMMMAKMHDLSFMLES
        EISE+DFFSTTTS+D +F N SQPN F +AFP      S EKATRPDASDLSRM+EENG+SGENS+A K  QA+S PSS+ DD+QMMMAKMHDLSFMLES
Subjt:  EISEVDFFSTTTSRDSDFRNSSQPNSFADAFPK-----SVEKATRPDASDLSRMNEENGESGENSEAMKR-QAASGPSSSSDDIQMMMAKMHDLSFMLES

Query:  NLSVPPK
        +LS+PPK
Subjt:  NLSVPPK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05090.1 dentin sialophosphoprotein-related2.4e-4726.9Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNT-DVPPDPINFKNTIACRWL
        MA EI  DLINQL++SLR  AK++S D   D S P+LP+  E IAELD S PYLRC++CKG+LLR ++S ICVFCG +Q T D PPDPI F +T A +W 
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNT-DVPPDPINFKNTIACRWL

Query:  LESLDLDGSEMVGPI-DLKESNRG--KSP--EQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTV
        L SL+LDGSEMV P+ +   S+RG  K+P  +   L+  LDLEI+W   E+K   D      K+ LNL G++LD YF E + D +     E  P+     
Subjt:  LESLDLDGSEMVGPI-DLKESNRG--KSP--EQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTV

Query:  EDNVDLSLFDKVPSSAT-------------------------------------------------------------------------AARTTKHEND
        +D   LSLFD V S                                                                            A RT+  ++D
Subjt:  EDNVDLSLFDKVPSSAT-------------------------------------------------------------------------AARTTKHEND

Query:  DSF------------------------------------------------------------------SGWEASFQTASSATSHDNSKSIDPFAVSGVN
        +SF                                                                  S W++ FQ+A    S       DPF  S V+
Subjt:  DSF------------------------------------------------------------------SGWEASFQTASSATSHDNSKSIDPFAVSGVN

Query:  ISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNHETIRMPDQL--EQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKS
        +++ +++ FG        +  D+     S A DW  QDDL+ +   E       +  +  G ++ G      N +SS  +DW  DD  Q   +K  +   
Subjt:  ISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNHETIRMPDQL--EQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKS

Query:  VFKDDHSADAWDDFTSS-----------------------------TGVQGPSDDSRKDIVNDIVPKVDEISEVDFFST---------------------
           +D   D W+DF SS                              GV+  S D +++    ++  + +  E D F T                     
Subjt:  VFKDDHSADAWDDFTSS-----------------------------TGVQGPSDDSRKDIVNDIVPKVDEISEVDFFST---------------------

Query:  ----------------TTSRDSDFRNSSQPNSFADAFPKSVE----KATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSF
                          +RD DF + S+ + F+++          K      S L R ++ +G   +  + +     + P S SD  + +M++MHDLSF
Subjt:  ----------------TTSRDSDFRNSSQPNSFADAFPKSVE----KATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSF

Query:  MLESNLSVPP
        MLE+ LSVPP
Subjt:  MLESNLSVPP

AT4G20720.1 dentin sialophosphoprotein-related3.5e-4642.05Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNT-DVPPDPINFKNTIACRWL
        MA EI  DLINQL++SLR  AK++S D   D S P+LP+  E IAELD S PYLRC++CKG+LLR ++S ICVFCG +Q T D PPDPI F +T A +W 
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNT-DVPPDPINFKNTIACRWL

Query:  LESLDLDGSEMVGPI-DLKESNRG--KSP--EQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTV
        L SL+LDGSEMV P+ +   S+RG  K+P  +   L+  LDLEI+W   E+K   D      K+ LNL G++LD YF E + D +     E  P+     
Subjt:  LESLDLDGSEMVGPI-DLKESNRG--KSP--EQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTV

Query:  EDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESED
        +D   LSLFD V S      + +H+N   F   +A     SS    + S      A   V+ ++     F +   +R+   ED
Subjt:  EDNVDLSLFDKVPSSATAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESED

AT4G20720.1 dentin sialophosphoprotein-related1.1e-0425.79Show/hide
Query:  AGVDLDFYFSEEKKDTTSKASDEPP----PLNKQTVEDNV---DLSLFDKVPSSATAARTTKHEN---DDSFSGWEASFQTASSATSHDNSKSIDPFAVS
        A  D D  F    ++ + K  D  P    P++     D+V      L    P+ ++ A  +K  +   DD F       QT  SA  HD ++      + 
Subjt:  AGVDLDFYFSEEKKDTTSKASDEPP----PLNKQTVEDNV---DLSLFDKVPSSATAARTTKHEN---DDSFSGWEASFQTASSATSHDNSKSIDPFAVS

Query:  GVNISSSLETTF-GDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNHETIRMP--DQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKP
        G N +SS++  + GD     + +    K P+    +D    +D  SS+N +T   P    +E +   I    A+  N     SV     D++Q       
Subjt:  GVNISSSLETTF-GDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNHETIRMP--DQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKP

Query:  DDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFF-STTTSRDSDFRNSSQPNSFADAFPKSVE----KATRPDASDLSRMNE
         D    ++D     WD FTSST +Q     S +       P  ++  E++ F     +RD DF + S+ + F+++          K      S L R ++
Subjt:  DDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFF-STTTSRDSDFRNSSQPNSFADAFPKSVE----KATRPDASDLSRMNE

Query:  ENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPP
         +G   +  + +     + P S SD  + +M++MHDLSFMLE+ LSVPP
Subjt:  ENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTATGAAATCCCTCGCGATCTGATCAATCAACTTCAGATCTCTCTTCGAAATAGGGCCAAAATCTCCTCCTACGACCCTCACGATCCTTCACTTCCAAAT
CTACCATCGCTCCATGAAACAATTGCAGAGCTTGATCCCTCCCCGCCTTATCTTCGCTGCAAACACTGCAAAGGAAGATTGCTTAGAGACTTGAAGTCATTTATT
TGCGTTTTCTGCGGCAGGGAACAGAACACGGACGTCCCTCCGGACCCCATTAATTTCAAGAATACCATTGCTTGTCGTTGGCTTCTCGAATCCTTGGACTTGGAT
GGATCGGAGATGGTGGGACCAATCGATTTGAAGGAATCAAACCGGGGAAAATCACCAGAGCAATTTCCCCTGACGAATCTTTTAGATTTAGAGATTAGATGGCCT
GAATCTGAAAAGAAAGGGATCTCAGACGAAACCCCGGCTCCAAGTAAAAGTACCTTGAATTTGGCTGGAGTTGATCTTGACTTCTACTTCTCTGAGGAAAAAAAA
GACACTACTTCAAAAGCATCTGATGAGCCACCACCACTGAATAAACAAACTGTTGAGGATAATGTTGATCTTAGTTTATTTGATAAGGTTCCATCTTCCGCGACG
GCAGCAAGGACCACTAAACATGAGAATGATGATTCCTTTTCTGGTTGGGAGGCAAGCTTTCAGACTGCTAGTTCTGCAACTTCTCATGATAATTCTAAATCAATT
GATCCTTTTGCTGTTTCTGGGGTCAATATATCTTCCTCCTTGGAAACAACGTTTGGAGACCATAGCAAGTCCAGAAGTGGAGAATCAGAAGATACTAAAAATCCC
TCTTCATCAATGGCCAATGACTGGTTTCAACAAGATGATTTATGGAGTAGTTCTAATCACGAAACAATTCGCATGCCAGATCAGCTTGAACAAACTGGAATTTTA
ATTGATGGTAGAGCTGCAGAAACTGCTAATTATTCTTCATCAGCAAGCGTTGATTGGTTTCAAGATGATCAGCGGCAAGGAGGGAGCCAAAAGAAACCTGATGAT
AAAAGTGTTTTTAAAGATGATCACTCAGCTGATGCTTGGGATGATTTTACTAGCTCAACTGGTGTGCAAGGCCCCTCCGATGATTCTAGGAAAGACATCGTGAAT
GACATTGTGCCAAAGGTGGATGAGATATCAGAAGTAGATTTCTTCAGCACAACCACCTCAAGGGATAGTGATTTTAGAAACTCTTCTCAGCCAAATTCATTTGCA
GATGCATTCCCCAAATCCGTAGAAAAAGCAACGCGGCCAGATGCTTCTGATTTAAGCAGGATGAATGAAGAGAATGGAGAAAGTGGAGAAAATTCTGAAGCTATG
AAGCGTCAAGCTGCATCAGGTCCTAGTTCAAGTTCTGATGATATACAGATGATGATGGCGAAGATGCACGATCTATCTTTTATGCTCGAAAGCAATCTTTCAGTC
CCCCCAAAGTGA
mRNA sequenceShow/hide mRNA sequence
GTTCTTAGTTGAATGGAACGCAGCGTTTTGCGCATTTTCCCGTGTACCGGAGAAGCCATCTTCCCCGTCTGGGAGAAAGTGGATTTTTCATTTTGGAAATCCACA
CAGAAGAAGAAGAAGAAGAAGAAGAAGGAGAACTCCGTTGCTCACACAACCTGAAGATACTGCGAACCTCTTCAAACTCAATCAATGGCGTATGAAATCCCTCGC
GATCTGATCAATCAACTTCAGATCTCTCTTCGAAATAGGGCCAAAATCTCCTCCTACGACCCTCACGATCCTTCACTTCCAAATCTACCATCGCTCCATGAAACA
ATTGCAGAGCTTGATCCCTCCCCGCCTTATCTTCGCTGCAAACACTGCAAAGGAAGATTGCTTAGAGACTTGAAGTCATTTATTTGCGTTTTCTGCGGCAGGGAA
CAGAACACGGACGTCCCTCCGGACCCCATTAATTTCAAGAATACCATTGCTTGTCGTTGGCTTCTCGAATCCTTGGACTTGGATGGATCGGAGATGGTGGGACCA
ATCGATTTGAAGGAATCAAACCGGGGAAAATCACCAGAGCAATTTCCCCTGACGAATCTTTTAGATTTAGAGATTAGATGGCCTGAATCTGAAAAGAAAGGGATC
TCAGACGAAACCCCGGCTCCAAGTAAAAGTACCTTGAATTTGGCTGGAGTTGATCTTGACTTCTACTTCTCTGAGGAAAAAAAAGACACTACTTCAAAAGCATCT
GATGAGCCACCACCACTGAATAAACAAACTGTTGAGGATAATGTTGATCTTAGTTTATTTGATAAGGTTCCATCTTCCGCGACGGCAGCAAGGACCACTAAACAT
GAGAATGATGATTCCTTTTCTGGTTGGGAGGCAAGCTTTCAGACTGCTAGTTCTGCAACTTCTCATGATAATTCTAAATCAATTGATCCTTTTGCTGTTTCTGGG
GTCAATATATCTTCCTCCTTGGAAACAACGTTTGGAGACCATAGCAAGTCCAGAAGTGGAGAATCAGAAGATACTAAAAATCCCTCTTCATCAATGGCCAATGAC
TGGTTTCAACAAGATGATTTATGGAGTAGTTCTAATCACGAAACAATTCGCATGCCAGATCAGCTTGAACAAACTGGAATTTTAATTGATGGTAGAGCTGCAGAA
ACTGCTAATTATTCTTCATCAGCAAGCGTTGATTGGTTTCAAGATGATCAGCGGCAAGGAGGGAGCCAAAAGAAACCTGATGATAAAAGTGTTTTTAAAGATGAT
CACTCAGCTGATGCTTGGGATGATTTTACTAGCTCAACTGGTGTGCAAGGCCCCTCCGATGATTCTAGGAAAGACATCGTGAATGACATTGTGCCAAAGGTGGAT
GAGATATCAGAAGTAGATTTCTTCAGCACAACCACCTCAAGGGATAGTGATTTTAGAAACTCTTCTCAGCCAAATTCATTTGCAGATGCATTCCCCAAATCCGTA
GAAAAAGCAACGCGGCCAGATGCTTCTGATTTAAGCAGGATGAATGAAGAGAATGGAGAAAGTGGAGAAAATTCTGAAGCTATGAAGCGTCAAGCTGCATCAGGT
CCTAGTTCAAGTTCTGATGATATACAGATGATGATGGCGAAGATGCACGATCTATCTTTTATGCTCGAAAGCAATCTTTCAGTCCCCCCAAAGTGATGCATCTTT
AATTCTTCTGAAGCATTCTGCCACTGAGCTTTTCTTGTATTTTTCTTTCCCTCTTTCTTTTTAAATCTGTAGCAGTATAGTGTTAGTTTAGTTATTACGGAATGC
ATTCTTTGATTTTATAAAATGGCCATATGCC
Protein sequenceShow/hide protein sequence
MAYEIPRDLINQLQISLRNRAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLESLDLD
GSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLSLFDKVPSSAT
AARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNHETIRMPDQLEQTGIL
IDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDHSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFFSTTTSRDSDFRNSSQPNSFA
DAFPKSVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK