; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G035290 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G035290
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionDentin sialophosphoprotein
Genome locationCiama_Chr02:14450323..14453096
RNA-Seq ExpressionCaUC02G035290
SyntenyCaUC02G035290
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034793.1 dentin sialophosphoprotein [Cucumis melo var. makuwa]7.2e-22783.94Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN AKISSYDPH+PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVGPIDLKESNRGKSPEQFPLT+LLDLEIRWPES+K GI DETPAPSKSTLNLAGVDL +YF+EEK DTTSKASD  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS DNSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+T+ MPDQ+EQTGILIDGRA ETANYSSSA+VDWFQDDQ QGGSQKKPDDKSV KDDDSADAWD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FSTTT++DSDF +SSQP SFA+AFP     SVEKA  PDASDL+RM EENG+S ENS+A + QAASG  SS+DD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

XP_008455912.1 PREDICTED: uncharacterized protein LOC103495983 [Cucumis melo]2.7e-22683.53Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN AKISSYDPH+PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVGPIDLKESNRGKSPEQFPLT+LLDLEIRWPES+K GI+DETPAPSKSTLNLAGVDL +YF+EEK DTTSKASD  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS DNSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+T+ MPDQ+EQTGILIDGRA ET NYSSSA+VDWFQDDQ QGGSQKKPDDKSV KDDDSAD WD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FSTTT++DSDF +SSQP SFA+AFP     SVEKA  PDASDL+RM EENG+S ENS+A   QAASG  SS+DD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

XP_011649988.1 uncharacterized protein LOC101209977 [Cucumis sativus]1.2e-22683.53Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN A ISSYDPH+PSLPNLPS +ETIA+LDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPPDPINF NTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVG IDLKESNRGKSPEQFPLT+LLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDL  YF+EEK DTTSKASD  PP +K+TVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PS  TA RTTKHE+DDSFSGWEASFQ ASSAT  DNSKS+DPF VSGVNISSSLETTFG+ +KS SGE+EDTKNPSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+TI MPDQ+EQTGILIDGR  ETANYSSSA+VDWFQDDQ QG SQKKPDDKSV KDD SADAWDDFTSSTGVQGP D+S+KDIVND VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FST T++DSDF +SSQP SFA+AFP     SVEKA  PDASDLSRM+EENG++ ENS+A++RQAASGPSSS+DD +MMM KMHDLSFMLES LS+PPK
Subjt:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

XP_022970990.1 uncharacterized protein LOC111469795 [Cucurbita maxima]2.0e-20576.13Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MA++IP DLI QLQISLRN AK+SSYDPH+ SLPNLPSLHETIA+LDPSPPYLRCKHCKGRLLRDLKSF+CVFCG+EQNT+VPPDPINFKNTIACRWLLE
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQ-------TV
        SLDLDGSEMVG +DLKESNRGKS E+FPLT+LLDL+IRWPESEK+G+SD T APSKSTLNLA VDLD YFSEE KDTT K SDE  PLN+Q       T 
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQ-------TV

Query:  EDNVDLSLFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQD
        +DNVDLSLF  V SS TA R  +HE+ DSFSGWEA+FQT +SATSH+NSKS+DPFA+SGV+IS SLE T G  +K RSGE E+TKNPSSSM +DWF QQD
Subjt:  EDNVDLSLFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQD

Query:  DLWSSSNHETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVD
        DLWSSSNHETI  P+Q++QTG   DG+   TA+YSSSASVDWFQDDQ QGGS KKPDD S  KDDDSADAWDDFTSSTG+QG  D+  KDIVN+IVPKVD
Subjt:  DLWSSSNHETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVD

Query:  EISEVDFFSTTTSRDSDFGNSSQPNSFADAFPK-----SVEKATRPDASDLSRMNEENGESGENSEAMKR-QAASGPSSSSDDIQMMMAKMHDLSFMLES
        EISE+DFFSTTTS+D +FGN SQPN F +AFP      S EKATRPDASDLSRM+EENG+SGENS+A K  QA+S PSS+ DD+QMMMAKMHDLSFMLES
Subjt:  EISEVDFFSTTTSRDSDFGNSSQPNSFADAFPK-----SVEKATRPDASDLSRMNEENGESGENSEAMKR-QAASGPSSSSDDIQMMMAKMHDLSFMLES

Query:  NLSVPPK
        +LS+PPK
Subjt:  NLSVPPK

XP_038902680.1 uncharacterized protein LOC120089318 [Benincasa hispida]2.2e-23686.52Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIP DLI QLQISLRN AKISSYDPH+PSLPNLPSLHETIAELDPSPPYLRCKHC GRLLRDLKSF+CVFCGREQNTDVPPDPINFKNTIACRWLLE
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMV PI+LKESNRGKSPEQFPLT+LLDLEIRWPESEKKGISDETPAPSKS LNLA VDLD+YFSEEKKDTTSKAS+EPPPLNKQTVEDNVDLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNH
        LFD VPSS TA RTTKHE+ DSFSGWEASFQ ASSAT HDNSKS+DPFAVS VNISSSLETTFGD +KSRSGE++DTKNPSSS+ NDWFQQ DLWSSSNH
Subjt:  LFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNH

Query:  ETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFF
        ETIRMPDQ+EQTGI+IDGRAAETANYSSSASVDWFQ DQRQGGSQKKPDDKS  K D SADAWDDFTSSTGV GPSD+SRKDIVND+V KVDEISEVDFF
Subjt:  ETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFF

Query:  STTTSRDSDFGNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        STT   +SDF NSSQPNSFA+AFP     S+ KAT  DASDLSRM+EE+GE+GENS+A++ Q+ASGPSSS+DD+QMMM KMHDLSFMLESNLS+PPK
Subjt:  STTTSRDSDFGNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

TrEMBL top hitse value%identityAlignment
A0A0A0LMS7 Uncharacterized protein6.0e-22783.53Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN A ISSYDPH+PSLPNLPS +ETIA+LDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPPDPINF NTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVG IDLKESNRGKSPEQFPLT+LLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDL  YF+EEK DTTSKASD  PP +K+TVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PS  TA RTTKHE+DDSFSGWEASFQ ASSAT  DNSKS+DPF VSGVNISSSLETTFG+ +KS SGE+EDTKNPSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+TI MPDQ+EQTGILIDGR  ETANYSSSA+VDWFQDDQ QG SQKKPDDKSV KDD SADAWDDFTSSTGVQGP D+S+KDIVND VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FST T++DSDF +SSQP SFA+AFP     SVEKA  PDASDLSRM+EENG++ ENS+A++RQAASGPSSS+DD +MMM KMHDLSFMLES LS+PPK
Subjt:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

A0A1S3C2P9 uncharacterized protein LOC1034959831.3e-22683.53Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN AKISSYDPH+PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVGPIDLKESNRGKSPEQFPLT+LLDLEIRWPES+K GI+DETPAPSKSTLNLAGVDL +YF+EEK DTTSKASD  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS DNSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+T+ MPDQ+EQTGILIDGRA ET NYSSSA+VDWFQDDQ QGGSQKKPDDKSV KDDDSAD WD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FSTTT++DSDF +SSQP SFA+AFP     SVEKA  PDASDL+RM EENG+S ENS+A   QAASG  SS+DD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

A0A5A7SW96 Dentin sialophosphoprotein3.5e-22783.94Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN AKISSYDPH+PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVGPIDLKESNRGKSPEQFPLT+LLDLEIRWPES+K GI DETPAPSKSTLNLAGVDL +YF+EEK DTTSKASD  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS DNSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+T+ MPDQ+EQTGILIDGRA ETANYSSSA+VDWFQDDQ QGGSQKKPDDKSV KDDDSADAWD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FSTTT++DSDF +SSQP SFA+AFP     SVEKA  PDASDL+RM EENG+S ENS+A + QAASG  SS+DD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

A0A5D3CEG4 Dentin sialophosphoprotein1.3e-22683.53Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPRDLI QLQISLRN AKISSYDPH+PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS
        SLDLDGSEMVGPIDLKESNRGKSPEQFPLT+LLDLEIRWPES+K GI+DETPAPSKSTLNLAGVDL +YF+EEK DTTSKASD  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLS

Query:  LFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN
        LFDK PSS +A RTTKHE+DDSFSGWEASFQTASSATS DNSKSIDPF VSGVN+SSS E TFGD +KSRSGE+EDTK+PSSS  NDWF QQDDLWSSSN
Subjt:  LFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQDDLWSSSN

Query:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF
        H+T+ MPDQ+EQTGILIDGRA ET NYSSSA+VDWFQDDQ QGGSQKKPDDKSV KDDDSAD WD+FTSSTGVQGPSD+SRKDIV D VPKVDEISEVDF
Subjt:  HETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDF

Query:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK
        FSTTT++DSDF +SSQP SFA+AFP     SVEKA  PDASDL+RM EENG+S ENS+A   QAASG  SS+DD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSRDSDFGNSSQPNSFADAFPK----SVEKATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK

A0A6J1I4G5 uncharacterized protein LOC1114697959.9e-20676.13Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MA++IP DLI QLQISLRN AK+SSYDPH+ SLPNLPSLHETIA+LDPSPPYLRCKHCKGRLLRDLKSF+CVFCG+EQNT+VPPDPINFKNTIACRWLLE
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQ-------TV
        SLDLDGSEMVG +DLKESNRGKS E+FPLT+LLDL+IRWPESEK+G+SD T APSKSTLNLA VDLD YFSEE KDTT K SDE  PLN+Q       T 
Subjt:  SLDLDGSEMVGPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQ-------TV

Query:  EDNVDLSLFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQD
        +DNVDLSLF  V SS TA R  +HE+ DSFSGWEA+FQT +SATSH+NSKS+DPFA+SGV+IS SLE T G  +K RSGE E+TKNPSSSM +DWF QQD
Subjt:  EDNVDLSLFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWF-QQD

Query:  DLWSSSNHETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVD
        DLWSSSNHETI  P+Q++QTG   DG+   TA+YSSSASVDWFQDDQ QGGS KKPDD S  KDDDSADAWDDFTSSTG+QG  D+  KDIVN+IVPKVD
Subjt:  DLWSSSNHETIRMPDQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVD

Query:  EISEVDFFSTTTSRDSDFGNSSQPNSFADAFPK-----SVEKATRPDASDLSRMNEENGESGENSEAMKR-QAASGPSSSSDDIQMMMAKMHDLSFMLES
        EISE+DFFSTTTS+D +FGN SQPN F +AFP      S EKATRPDASDLSRM+EENG+SGENS+A K  QA+S PSS+ DD+QMMMAKMHDLSFMLES
Subjt:  EISEVDFFSTTTSRDSDFGNSSQPNSFADAFPK-----SVEKATRPDASDLSRMNEENGESGENSEAMKR-QAASGPSSSSDDIQMMMAKMHDLSFMLES

Query:  NLSVPPK
        +LS+PPK
Subjt:  NLSVPPK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05090.1 dentin sialophosphoprotein-related1.4e-4727.18Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDP-HNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNT-DVPPDPINFKNTIACRWL
        MA EI  DLINQL++SLR  AK++S D   + S P+LP+  E IAELD S PYLRC++CKG+LLR ++S ICVFCG +Q T D PPDPI F +T A +W 
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDP-HNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNT-DVPPDPINFKNTIACRWL

Query:  LESLDLDGSEMVGPI-DLKESNRG--KSP--EQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTV
        L SL+LDGSEMV P+ +   S+RG  K+P  +   L+  LDLEI+W   E+K   D      K+ LNL G++LD YF E + D +     E  P+     
Subjt:  LESLDLDGSEMVGPI-DLKESNRG--KSP--EQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTV

Query:  EDNVDLSLFDKV----------------------PSSVT---------------------------------------------------AARTTKHEND
        +D   LSLFD V                      P SV                                                    A RT+  ++D
Subjt:  EDNVDLSLFDKV----------------------PSSVT---------------------------------------------------AARTTKHEND

Query:  DSF------------------------------------------------------------------SGWEASFQTASSATSHDNSKSIDPFAVSGVN
        +SF                                                                  S W++ FQ+A    S       DPF  S V+
Subjt:  DSF------------------------------------------------------------------SGWEASFQTASSATSHDNSKSIDPFAVSGVN

Query:  ISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNHETIRMPDQL--EQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKS
        +++ +++ FG        +  D+     S A DW  QDDL+ +   E       +  +  G ++ G      N +SS  +DW  DD  Q   +K  +   
Subjt:  ISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNHETIRMPDQL--EQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKS

Query:  VIKDDDSADAWDDFTSS-----------------------------TGVQGPSDDSRKDIVNDIVPKVDEISEVDFFST---------------------
           +DD  D W+DF SS                              GV+  S D +++    ++  + +  E D F T                     
Subjt:  VIKDDDSADAWDDFTSS-----------------------------TGVQGPSDDSRKDIVNDIVPKVDEISEVDFFST---------------------

Query:  ----------------TTSRDSDFGNSSQPNSFADAFPKSVE----KATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSF
                          +RD DF + S+ + F+++          K      S L R ++ +G   +  + +     + P S SD  + +M++MHDLSF
Subjt:  ----------------TTSRDSDFGNSSQPNSFADAFPKSVE----KATRPDASDLSRMNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSF

Query:  MLESNLSVPP
        MLE+ LSVPP
Subjt:  MLESNLSVPP

AT4G20720.1 dentin sialophosphoprotein-related3.0e-4541.7Show/hide
Query:  MAYEIPRDLINQLQISLRNRAKISSYDP-HNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNT-DVPPDPINFKNTIACRWL
        MA EI  DLINQL++SLR  AK++S D   + S P+LP+  E IAELD S PYLRC++CKG+LLR ++S ICVFCG +Q T D PPDPI F +T A +W 
Subjt:  MAYEIPRDLINQLQISLRNRAKISSYDP-HNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNT-DVPPDPINFKNTIACRWL

Query:  LESLDLDGSEMVGPI-DLKESNRG--KSP--EQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTV
        L SL+LDGSEMV P+ +   S+RG  K+P  +   L+  LDLEI+W   E+K   D      K+ LNL G++LD YF E + D +     E  P+     
Subjt:  LESLDLDGSEMVGPI-DLKESNRG--KSP--EQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTV

Query:  EDNVDLSLFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESED
        +D   LSLFD V S      + +H+N   F   +A     SS    + S      A   V+ ++     F +   +R+   ED
Subjt:  EDNVDLSLFDKVPSSVTAARTTKHENDDSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESED

AT4G20720.1 dentin sialophosphoprotein-related3.0e-0526.07Show/hide
Query:  AGVDLDFYFSEEKKDTTSKASDEPP----PLNKQTVEDNV---DLSLFDKVPSSVTAARTTKHEN---DDSFSGWEASFQTASSATSHDNSKSIDPFAVS
        A  D D  F    ++ + K  D  P    P++     D+V      L    P+  + A  +K  +   DD F       QT  SA  HD ++      + 
Subjt:  AGVDLDFYFSEEKKDTTSKASDEPP----PLNKQTVEDNV---DLSLFDKVPSSVTAARTTKHEN---DDSFSGWEASFQTASSATSHDNSKSIDPFAVS

Query:  GVNISSSLETTF-GDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNHETIRMP--DQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKP
        G N +SS++  + GD     + +    K P+    +D    +D  SS+N +T   P    +E +   I    A+  N     SV     D++Q       
Subjt:  GVNISSSLETTF-GDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNHETIRMP--DQLEQTGILIDGRAAETANYSSSASVDWFQDDQRQGGSQKKP

Query:  DDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFF-STTTSRDSDFGNSSQPNSFADAFPKSVE----KATRPDASDLSRMNE
         D    ++DD    WD FTSST +Q     S +       P  ++  E++ F     +RD DF + S+ + F+++          K      S L R ++
Subjt:  DDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFF-STTTSRDSDFGNSSQPNSFADAFPKSVE----KATRPDASDLSRMNE

Query:  ENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPP
         +G   +  + +     + P S SD  + +M++MHDLSFMLE+ LSVPP
Subjt:  ENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTATGAAATCCCTCGCGATCTGATCAATCAACTTCAGATCTCTCTTCGAAATAGGGCCAAAATCTCCTCCTACGACCCTCACAATCCTTCACTTCCAAATCTACC
ATCGCTCCATGAAACAATTGCAGAGCTTGATCCCTCCCCGCCTTATCTTCGCTGCAAACACTGCAAAGGAAGATTGCTTAGAGACTTGAAGTCATTTATTTGCGTTTTCT
GCGGCAGGGAACAGAACACCGACGTCCCTCCGGACCCCATTAATTTCAAGAATACCATTGCTTGTCGTTGGCTTCTCGAATCCTTGGACTTGGATGGATCGGAGATGGTG
GGACCAATCGATTTGAAGGAATCAAACCGGGGAAAATCACCAGAGCAATTTCCCCTGACGAATCTTTTAGATTTAGAGATTAGATGGCCTGAATCTGAAAAGAAAGGGAT
CTCAGACGAAACCCCGGCTCCAAGTAAAAGTACCTTGAATTTGGCTGGAGTTGATCTTGACTTCTACTTCTCTGAGGAAAAAAAAGACACTACTTCAAAAGCATCTGATG
AGCCACCACCACTGAATAAACAAACTGTTGAGGATAATGTTGATCTTAGTTTATTTGATAAGGTTCCATCTTCCGTGACGGCAGCAAGGACCACTAAACATGAGAATGAT
GATTCCTTTTCTGGTTGGGAGGCAAGCTTTCAGACTGCTAGTTCTGCAACTTCTCATGATAATTCTAAATCAATTGATCCTTTTGCTGTTTCTGGGGTCAATATATCTTC
CTCTTTGGAAACAACGTTTGGAGACCATAGCAAGTCCAGAAGTGGAGAATCAGAAGATACTAAAAATCCCTCTTCATCAATGGCCAATGACTGGTTTCAACAAGATGATT
TATGGAGTAGTTCTAATCACGAAACGATTCGCATGCCAGATCAGCTTGAACAAACTGGAATTTTAATTGATGGTAGAGCTGCAGAAACTGCTAATTATTCTTCATCAGCA
AGCGTTGATTGGTTTCAAGATGATCAGCGGCAAGGAGGGAGCCAAAAGAAACCTGATGATAAAAGTGTTATTAAAGATGATGATTCAGCTGATGCTTGGGATGATTTTAC
TAGCTCAACTGGTGTGCAAGGCCCCTCTGATGATTCTAGGAAAGACATTGTGAATGACATTGTGCCAAAGGTGGATGAGATATCAGAAGTAGATTTCTTCAGCACAACCA
CCTCAAGGGATAGTGATTTTGGAAACTCTTCTCAGCCAAATTCATTTGCAGATGCATTCCCCAAATCCGTAGAAAAAGCAACGCGGCCAGATGCTTCTGATTTAAGCAGG
ATGAATGAAGAGAATGGAGAAAGTGGAGAAAATTCTGAAGCTATGAAGCGTCAAGCTGCATCAGGTCCTAGTTCAAGTTCTGATGATATACAGATGATGATGGCGAAGAT
GCACGATCTATCTTTTATGCTCGAAAGCAATCTTTCAGTCCCCCCAAAGTGA
mRNA sequenceShow/hide mRNA sequence
TAAACTCCCATAAAGTTAAAAGTACTTGAAATGGCGAGTTCTTGGTTGAATGGAACGCAGCGTTTTGCGCATTTTCCCGTGTACCGGAGAAGCTATCTTCCCCGTCTGGG
AGAAAGTGGATTTTTCATTTTGGAAATCCACACAGAAGAAGAAGAAGAAGAAGAAGAAGAAGGAGAACTCCGTCGCTCACACAACCTGAAGATACTGCGAACCTCTTCAA
ACTCAATCAATGGCGTATGAAATCCCTCGCGATCTGATCAATCAACTTCAGATCTCTCTTCGAAATAGGGCCAAAATCTCCTCCTACGACCCTCACAATCCTTCACTTCC
AAATCTACCATCGCTCCATGAAACAATTGCAGAGCTTGATCCCTCCCCGCCTTATCTTCGCTGCAAACACTGCAAAGGAAGATTGCTTAGAGACTTGAAGTCATTTATTT
GCGTTTTCTGCGGCAGGGAACAGAACACCGACGTCCCTCCGGACCCCATTAATTTCAAGAATACCATTGCTTGTCGTTGGCTTCTCGAATCCTTGGACTTGGATGGATCG
GAGATGGTGGGACCAATCGATTTGAAGGAATCAAACCGGGGAAAATCACCAGAGCAATTTCCCCTGACGAATCTTTTAGATTTAGAGATTAGATGGCCTGAATCTGAAAA
GAAAGGGATCTCAGACGAAACCCCGGCTCCAAGTAAAAGTACCTTGAATTTGGCTGGAGTTGATCTTGACTTCTACTTCTCTGAGGAAAAAAAAGACACTACTTCAAAAG
CATCTGATGAGCCACCACCACTGAATAAACAAACTGTTGAGGATAATGTTGATCTTAGTTTATTTGATAAGGTTCCATCTTCCGTGACGGCAGCAAGGACCACTAAACAT
GAGAATGATGATTCCTTTTCTGGTTGGGAGGCAAGCTTTCAGACTGCTAGTTCTGCAACTTCTCATGATAATTCTAAATCAATTGATCCTTTTGCTGTTTCTGGGGTCAA
TATATCTTCCTCTTTGGAAACAACGTTTGGAGACCATAGCAAGTCCAGAAGTGGAGAATCAGAAGATACTAAAAATCCCTCTTCATCAATGGCCAATGACTGGTTTCAAC
AAGATGATTTATGGAGTAGTTCTAATCACGAAACGATTCGCATGCCAGATCAGCTTGAACAAACTGGAATTTTAATTGATGGTAGAGCTGCAGAAACTGCTAATTATTCT
TCATCAGCAAGCGTTGATTGGTTTCAAGATGATCAGCGGCAAGGAGGGAGCCAAAAGAAACCTGATGATAAAAGTGTTATTAAAGATGATGATTCAGCTGATGCTTGGGA
TGATTTTACTAGCTCAACTGGTGTGCAAGGCCCCTCTGATGATTCTAGGAAAGACATTGTGAATGACATTGTGCCAAAGGTGGATGAGATATCAGAAGTAGATTTCTTCA
GCACAACCACCTCAAGGGATAGTGATTTTGGAAACTCTTCTCAGCCAAATTCATTTGCAGATGCATTCCCCAAATCCGTAGAAAAAGCAACGCGGCCAGATGCTTCTGAT
TTAAGCAGGATGAATGAAGAGAATGGAGAAAGTGGAGAAAATTCTGAAGCTATGAAGCGTCAAGCTGCATCAGGTCCTAGTTCAAGTTCTGATGATATACAGATGATGAT
GGCGAAGATGCACGATCTATCTTTTATGCTCGAAAGCAATCTTTCAGTCCCCCCAAAGTGATGCATCTTTAATTCTTCTTCTGAAGCACTCTGCCACTGAGCTTTTCTTG
TATTTTTCTTTCCCTCTTTCTTTTTAAATCTGTAGCAGTATAGTGTTAGTTTAGTTATTACGGAATGCATTCTTTGATTTTATAAAATGGCCATATGCCGTTGAAATCCA
Protein sequenceShow/hide protein sequence
MAYEIPRDLINQLQISLRNRAKISSYDPHNPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLESLDLDGSEMV
GPIDLKESNRGKSPEQFPLTNLLDLEIRWPESEKKGISDETPAPSKSTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLSLFDKVPSSVTAARTTKHEND
DSFSGWEASFQTASSATSHDNSKSIDPFAVSGVNISSSLETTFGDHSKSRSGESEDTKNPSSSMANDWFQQDDLWSSSNHETIRMPDQLEQTGILIDGRAAETANYSSSA
SVDWFQDDQRQGGSQKKPDDKSVIKDDDSADAWDDFTSSTGVQGPSDDSRKDIVNDIVPKVDEISEVDFFSTTTSRDSDFGNSSQPNSFADAFPKSVEKATRPDASDLSR
MNEENGESGENSEAMKRQAASGPSSSSDDIQMMMAKMHDLSFMLESNLSVPPK