; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi10G001060 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi10G001060
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionDentin sialophosphoprotein
Genome locationchr10:1630552..1633347
RNA-Seq ExpressionLsi10G001060
SyntenyLsi10G001060
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034793.1 dentin sialophosphoprotein [Cucumis melo var. makuwa]2.6e-23284.74Show/hide
Query:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MA+EIPRDLIKQLQISLRNEAKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKS ICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLN
        SLDLDGSEMVGP+DLKESNRGKSPEQFPL +LLDLEIRWPES+K GI DETPAPSK+TLNLAGVDL +YF+EEK DTTSKASD  PP +KQTVEDN DL+
Subjt:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLN

Query:  LFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLWSSSN
        LFDK PS+E+AT TTKHESDDSFSGWEASFQTASSATS DNSKS+DPF VSGVN+SSS E TFGDQNKSRSG TEDT +PSSS TNDWFQQQDDLWSSSN
Subjt:  LFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLWSSSN

Query:  HGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVDEISEVDF
        H T+HMPDQVEQTG LIDGRA ETANYSSSA+VDWFQDDQ QGGSQKKPDDKSVFKDDDSADAWD+FTSSTGVQGPSDNSRKDIV D VPKVDEISEVDF
Subjt:  HGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVDEISEVDF

Query:  FSTTTSKDSDFRNSSQPNSFAEAFPK----SIEKVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKMHDLSFMLESNLSVPPK
        FSTTT+KDSDFR+SSQP SFAEAFP     S+EK  WPDASDL+RM EENG++ ENS+A + QAASG  SSTDD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSKDSDFRNSSQPNSFAEAFPK----SIEKVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKMHDLSFMLESNLSVPPK

XP_008455912.1 PREDICTED: uncharacterized protein LOC103495983 [Cucumis melo]9.8e-23284.34Show/hide
Query:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MA+EIPRDLIKQLQISLRNEAKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKS ICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLN
        SLDLDGSEMVGP+DLKESNRGKSPEQFPL +LLDLEIRWPES+K GI+DETPAPSK+TLNLAGVDL +YF+EEK DTTSKASD  PP +KQTVEDN DL+
Subjt:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLN

Query:  LFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLWSSSN
        LFDK PS+E+AT TTKHESDDSFSGWEASFQTASSATS DNSKS+DPF VSGVN+SSS E TFGDQNKSRSG TEDT +PSSS TNDWFQQQDDLWSSSN
Subjt:  LFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLWSSSN

Query:  HGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVDEISEVDF
        H T+HMPDQVEQTG LIDGRA ET NYSSSA+VDWFQDDQ QGGSQKKPDDKSVFKDDDSAD WD+FTSSTGVQGPSDNSRKDIV D VPKVDEISEVDF
Subjt:  HGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVDEISEVDF

Query:  FSTTTSKDSDFRNSSQPNSFAEAFPK----SIEKVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKMHDLSFMLESNLSVPPK
        FSTTT+KDSDFR+SSQP SFAEAFP     S+EK  WPDASDL+RM EENG++ ENS+A   QAASG  SSTDD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSKDSDFRNSSQPNSFAEAFPK----SIEKVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKMHDLSFMLESNLSVPPK

XP_011649988.1 uncharacterized protein LOC101209977 [Cucumis sativus]1.5e-23284.94Show/hide
Query:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MA+EIPRDLIKQLQISLRNEA ISSYDPH PSLPNLPS +ETIA+LDPSPPYLRCKHCKGRLLRDLKS ICVFCGREQ +DVPPDPINF NTIACRWLL+
Subjt:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLN
        SLDLDGSEMVG +DLKESNRGKSPEQFPL +LLDLEIRWPESEKKGISDETPAPSK+TLNLAGVDL  YF+EEK DTTSKASD  PP +K+TVEDN DL+
Subjt:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLN

Query:  LFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLWSSSN
        LFDK PS ETAT TTKHESDDSFSGWEASFQ ASSAT  DNSKSVDPF VSGVN+SSSLETTFG+QNKS SG TEDT NPSSS TNDWFQQQDDLWSSSN
Subjt:  LFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLWSSSN

Query:  HGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVDEISEVDF
        H TIHMPDQVEQTG LIDGR  ETANYSSSA+VDWFQDDQ QG SQKKPDDKSVFKDD SADAWDDFTSSTGVQGP DNS+KDIVND VPKVDEISEVDF
Subjt:  HGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVDEISEVDF

Query:  FSTTTSKDSDFRNSSQPNSFAEAFPK----SIEKVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKMHDLSFMLESNLSVPPK
        FST T+KDSDFR+SSQP SFAEAFP     S+EK  WPDASDLSRMSEENG+T ENS+A++ QAASGPSSSTDD +M+M KMHDLSFMLES LS+PPK
Subjt:  FSTTTSKDSDFRNSSQPNSFAEAFPK----SIEKVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKMHDLSFMLESNLSVPPK

XP_022970990.1 uncharacterized protein LOC111469795 [Cucurbita maxima]9.2e-20676.33Show/hide
Query:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAF+IP DLIKQLQISLRNEAK+SSYDPHD SLPNLPSLHETIA+LDPSPPYLRCKHCKGRLLRDLKS +CVFCG+EQNT+VPPDPINFKNTIACRWLLE
Subjt:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQ-------TV
        SLDLDGSEMVG +DLKESNRGKS E+FPL +LLDL+IRWPESEK+G+SD T APSK+TLNLA VDLD YFSEE KDTT K SDE  PLN+Q       T 
Subjt:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQ-------TV

Query:  EDNVDLNLFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQD
        +DNVDL+LF  V S+ETAT   +HES DSFSGWEA+FQT +SATSH+NSKSVDPFA+SGV++S SLE T G QNK RSG  E+T NPSSS+T+DWFQQQD
Subjt:  EDNVDLNLFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQD

Query:  DLWSSSNHGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVD
        DLWSSSNH TI  P+QV+QTG   DG+   TA+YSSSASVDWFQDDQ QGGS KKPDD S FKDDDSADAWDDFTSSTG+QG  DN  KDIVN++VPKVD
Subjt:  DLWSSSNHGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVD

Query:  EISEVDFFSTTTSKDSDFRNSSQPNSFAEAFPK-----SIEKVAWPDASDLSRMSEENGETGENSEAMKS-QAASGPSSSTDDVQMVMAKMHDLSFMLES
        EISE+DFFSTTTSKD +F N SQPN F EAFP      S EK   PDASDLSRMSEENG++GENS+A K  QA+S PSS+ DDVQM+MAKMHDLSFMLES
Subjt:  EISEVDFFSTTTSKDSDFRNSSQPNSFAEAFPK-----SIEKVAWPDASDLSRMSEENGETGENSEAMKS-QAASGPSSSTDDVQMVMAKMHDLSFMLES

Query:  NLSVPPK
        +LS+PPK
Subjt:  NLSVPPK

XP_038902680.1 uncharacterized protein LOC120089318 [Benincasa hispida]9.1e-23887.55Show/hide
Query:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MA+EIP DLIKQLQISLRN AKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHC GRLLRDLKS +CVFCGREQNTDVPPDPINFKNTIACRWLLE
Subjt:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLN
        SLDLDGSEMV P++LKESNRGKSPEQFPL +LLDLEIRWPESEKKGISDETPAPSK+ LNLA VDLD+YFSEEKKDTTSKAS+EPPPLNKQTVEDNVDL+
Subjt:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLN

Query:  LFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLWSSSN
        LFD VPS+ETAT TTKHES DSFSGWEASFQ ASSAT HDNSKSVDPFAVS VN+SSSLETTFGDQNKSRSG T+DT NPSSS+TNDWFQQQ DLWSSSN
Subjt:  LFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLWSSSN

Query:  HGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVDEISEVDF
        H TI MPDQVEQTG +IDGRAAETANYSSSASVDWFQ DQRQGGSQKKPDDKS FK D SADAWDDFTSSTGV GPSDNSRKDIVNDVV KVDEISEVDF
Subjt:  HGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVDEISEVDF

Query:  FSTTTSKDSDFRNSSQPNSFAEAFPK----SIEKVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKMHDLSFMLESNLSVPPK
        FSTT   +SDFRNSSQPNSFAEAFP     SI K  W DASDLSRMSEE+GETGENS+A++ Q+ASGPSSSTDDVQM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSKDSDFRNSSQPNSFAEAFPK----SIEKVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKMHDLSFMLESNLSVPPK

TrEMBL top hitse value%identityAlignment
A0A0A0LMS7 Uncharacterized protein7.3e-23384.94Show/hide
Query:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MA+EIPRDLIKQLQISLRNEA ISSYDPH PSLPNLPS +ETIA+LDPSPPYLRCKHCKGRLLRDLKS ICVFCGREQ +DVPPDPINF NTIACRWLL+
Subjt:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLN
        SLDLDGSEMVG +DLKESNRGKSPEQFPL +LLDLEIRWPESEKKGISDETPAPSK+TLNLAGVDL  YF+EEK DTTSKASD  PP +K+TVEDN DL+
Subjt:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLN

Query:  LFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLWSSSN
        LFDK PS ETAT TTKHESDDSFSGWEASFQ ASSAT  DNSKSVDPF VSGVN+SSSLETTFG+QNKS SG TEDT NPSSS TNDWFQQQDDLWSSSN
Subjt:  LFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLWSSSN

Query:  HGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVDEISEVDF
        H TIHMPDQVEQTG LIDGR  ETANYSSSA+VDWFQDDQ QG SQKKPDDKSVFKDD SADAWDDFTSSTGVQGP DNS+KDIVND VPKVDEISEVDF
Subjt:  HGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVDEISEVDF

Query:  FSTTTSKDSDFRNSSQPNSFAEAFPK----SIEKVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKMHDLSFMLESNLSVPPK
        FST T+KDSDFR+SSQP SFAEAFP     S+EK  WPDASDLSRMSEENG+T ENS+A++ QAASGPSSSTDD +M+M KMHDLSFMLES LS+PPK
Subjt:  FSTTTSKDSDFRNSSQPNSFAEAFPK----SIEKVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKMHDLSFMLESNLSVPPK

A0A1S3C2P9 uncharacterized protein LOC1034959834.7e-23284.34Show/hide
Query:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MA+EIPRDLIKQLQISLRNEAKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKS ICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLN
        SLDLDGSEMVGP+DLKESNRGKSPEQFPL +LLDLEIRWPES+K GI+DETPAPSK+TLNLAGVDL +YF+EEK DTTSKASD  PP +KQTVEDN DL+
Subjt:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLN

Query:  LFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLWSSSN
        LFDK PS+E+AT TTKHESDDSFSGWEASFQTASSATS DNSKS+DPF VSGVN+SSS E TFGDQNKSRSG TEDT +PSSS TNDWFQQQDDLWSSSN
Subjt:  LFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLWSSSN

Query:  HGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVDEISEVDF
        H T+HMPDQVEQTG LIDGRA ET NYSSSA+VDWFQDDQ QGGSQKKPDDKSVFKDDDSAD WD+FTSSTGVQGPSDNSRKDIV D VPKVDEISEVDF
Subjt:  HGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVDEISEVDF

Query:  FSTTTSKDSDFRNSSQPNSFAEAFPK----SIEKVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKMHDLSFMLESNLSVPPK
        FSTTT+KDSDFR+SSQP SFAEAFP     S+EK  WPDASDL+RM EENG++ ENS+A   QAASG  SSTDD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSKDSDFRNSSQPNSFAEAFPK----SIEKVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKMHDLSFMLESNLSVPPK

A0A5A7SW96 Dentin sialophosphoprotein1.2e-23284.74Show/hide
Query:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MA+EIPRDLIKQLQISLRNEAKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKS ICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLN
        SLDLDGSEMVGP+DLKESNRGKSPEQFPL +LLDLEIRWPES+K GI DETPAPSK+TLNLAGVDL +YF+EEK DTTSKASD  PP +KQTVEDN DL+
Subjt:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLN

Query:  LFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLWSSSN
        LFDK PS+E+AT TTKHESDDSFSGWEASFQTASSATS DNSKS+DPF VSGVN+SSS E TFGDQNKSRSG TEDT +PSSS TNDWFQQQDDLWSSSN
Subjt:  LFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLWSSSN

Query:  HGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVDEISEVDF
        H T+HMPDQVEQTG LIDGRA ETANYSSSA+VDWFQDDQ QGGSQKKPDDKSVFKDDDSADAWD+FTSSTGVQGPSDNSRKDIV D VPKVDEISEVDF
Subjt:  HGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVDEISEVDF

Query:  FSTTTSKDSDFRNSSQPNSFAEAFPK----SIEKVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKMHDLSFMLESNLSVPPK
        FSTTT+KDSDFR+SSQP SFAEAFP     S+EK  WPDASDL+RM EENG++ ENS+A + QAASG  SSTDD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSKDSDFRNSSQPNSFAEAFPK----SIEKVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKMHDLSFMLESNLSVPPK

A0A5D3CEG4 Dentin sialophosphoprotein4.7e-23284.34Show/hide
Query:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MA+EIPRDLIKQLQISLRNEAKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHCKGRLLRDLKS ICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLN
        SLDLDGSEMVGP+DLKESNRGKSPEQFPL +LLDLEIRWPES+K GI+DETPAPSK+TLNLAGVDL +YF+EEK DTTSKASD  PP +KQTVEDN DL+
Subjt:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLN

Query:  LFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLWSSSN
        LFDK PS+E+AT TTKHESDDSFSGWEASFQTASSATS DNSKS+DPF VSGVN+SSS E TFGDQNKSRSG TEDT +PSSS TNDWFQQQDDLWSSSN
Subjt:  LFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLWSSSN

Query:  HGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVDEISEVDF
        H T+HMPDQVEQTG LIDGRA ET NYSSSA+VDWFQDDQ QGGSQKKPDDKSVFKDDDSAD WD+FTSSTGVQGPSDNSRKDIV D VPKVDEISEVDF
Subjt:  HGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVDEISEVDF

Query:  FSTTTSKDSDFRNSSQPNSFAEAFPK----SIEKVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKMHDLSFMLESNLSVPPK
        FSTTT+KDSDFR+SSQP SFAEAFP     S+EK  WPDASDL+RM EENG++ ENS+A   QAASG  SSTDD QM+M KMHDLSFMLESNLS+PPK
Subjt:  FSTTTSKDSDFRNSSQPNSFAEAFPK----SIEKVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKMHDLSFMLESNLSVPPK

A0A6J1I4G5 uncharacterized protein LOC1114697954.5e-20676.33Show/hide
Query:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAF+IP DLIKQLQISLRNEAK+SSYDPHD SLPNLPSLHETIA+LDPSPPYLRCKHCKGRLLRDLKS +CVFCG+EQNT+VPPDPINFKNTIACRWLLE
Subjt:  MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQ-------TV
        SLDLDGSEMVG +DLKESNRGKS E+FPL +LLDL+IRWPESEK+G+SD T APSK+TLNLA VDLD YFSEE KDTT K SDE  PLN+Q       T 
Subjt:  SLDLDGSEMVGPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQ-------TV

Query:  EDNVDLNLFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQD
        +DNVDL+LF  V S+ETAT   +HES DSFSGWEA+FQT +SATSH+NSKSVDPFA+SGV++S SLE T G QNK RSG  E+T NPSSS+T+DWFQQQD
Subjt:  EDNVDLNLFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQD

Query:  DLWSSSNHGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVD
        DLWSSSNH TI  P+QV+QTG   DG+   TA+YSSSASVDWFQDDQ QGGS KKPDD S FKDDDSADAWDDFTSSTG+QG  DN  KDIVN++VPKVD
Subjt:  DLWSSSNHGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVD

Query:  EISEVDFFSTTTSKDSDFRNSSQPNSFAEAFPK-----SIEKVAWPDASDLSRMSEENGETGENSEAMKS-QAASGPSSSTDDVQMVMAKMHDLSFMLES
        EISE+DFFSTTTSKD +F N SQPN F EAFP      S EK   PDASDLSRMSEENG++GENS+A K  QA+S PSS+ DDVQM+MAKMHDLSFMLES
Subjt:  EISEVDFFSTTTSKDSDFRNSSQPNSFAEAFPK-----SIEKVAWPDASDLSRMSEENGETGENSEAMKS-QAASGPSSSTDDVQMVMAKMHDLSFMLES

Query:  NLSVPPK
        +LS+PPK
Subjt:  NLSVPPK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05090.1 dentin sialophosphoprotein-related2.7e-4626.99Show/hide
Query:  MAFEIPRDLIKQLQISLRNEAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNT-DVPPDPINFKNTIACRWL
        MA EI  DLI QL++SLR EAK++S D   D S P+LP+  E IAELD S PYLRC++CKG+LLR ++S+ICVFCG +Q T D PPDPI F +T A +W 
Subjt:  MAFEIPRDLIKQLQISLRNEAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNT-DVPPDPINFKNTIACRWL

Query:  LESLDLDGSEMVGPV-DLKESNRG--KSP--EQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTV
        L SL+LDGSEMV P+ +   S+RG  K+P  +   L   LDLEI+W   E+K   D      KN LNL G++LD YF E + D +     E  P+     
Subjt:  LESLDLDGSEMVGPV-DLKESNRG--KSP--EQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTV

Query:  EDNVDLNLFDKVPS-----------------------------------------------------TETATTTTKHESDDSF-----------------
        +D   L+LFD V S                                                      E A  T+  + D+SF                 
Subjt:  EDNVDLNLFDKVPS-----------------------------------------------------TETATTTTKHESDDSF-----------------

Query:  ---------------------------------------------------------------------SGWEASFQTASSATSHDNSKSVDPFAVSGVN
                                                                             S W++ FQ+A    S       DPF  S V+
Subjt:  ---------------------------------------------------------------------SGWEASFQTASSATSHDNSKSVDPFAVSGVN

Query:  MSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLW------SSSNHGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKK
        +++ +++ FG           D++    S   DW   QDDL+      + +N   +H  ++    G ++ G      N +SS  +DW  DD  Q   +K 
Subjt:  MSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLW------SSSNHGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKK

Query:  PDDKSVFKDDDSADAWDDFTSS-----------------------------TGVQGPSDNSRKDIVNDVVPKVDEISEVDFFST----------------
         +      +DD  D W+DF SS                              GV+  S + +++    V+  + +  E D F T                
Subjt:  PDDKSVFKDDDSADAWDDFTSS-----------------------------TGVQGPSDNSRKDIVNDVVPKVDEISEVDFFST----------------

Query:  ---------------------TTSKDSDFRNSSQPNSFAEAFPKSIE----KVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKM
                               ++D DF + S+ + F+E+          KV     S L R S+ +G   +  + +     + P S +D  + +M++M
Subjt:  ---------------------TTSKDSDFRNSSQPNSFAEAFPKSIE----KVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKM

Query:  HDLSFMLESNLSVPP
        HDLSFMLE+ LSVPP
Subjt:  HDLSFMLESNLSVPP

AT4G20720.1 dentin sialophosphoprotein-related1.1e-4233.26Show/hide
Query:  MAFEIPRDLIKQLQISLRNEAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNT-DVPPDPINFKNTIACRWL
        MA EI  DLI QL++SLR EAK++S D   D S P+LP+  E IAELD S PYLRC++CKG+LLR ++S+ICVFCG +Q T D PPDPI F +T A +W 
Subjt:  MAFEIPRDLIKQLQISLRNEAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNT-DVPPDPINFKNTIACRWL

Query:  LESLDLDGSEMVGPV-DLKESNRG--KSP--EQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTV
        L SL+LDGSEMV P+ +   S+RG  K+P  +   L   LDLEI+W   E+K   D      KN LNL G++LD YF E + D +     E  P+     
Subjt:  LESLDLDGSEMVGPV-DLKESNRG--KSP--EQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTV

Query:  EDNVDLNLFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQD
        +D   L+LFD V S +    + +H++   F   +A     SS    + S      A   V+ ++     F ++  +R+   ED N          F+ ++
Subjt:  EDNVDLNLFDKVPSTETATTTTKHESDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQD

Query:  DLWSSSNHGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDK--SVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPK
        D   +S+        +V+++    +G+ A+  + S        +DD+  G  + K D +  S  K+D+S   ++    +       +N            
Subjt:  DLWSSSNHGTIHMPDQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDK--SVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPK

Query:  VDEISEVD--FFSTTTSKDSDFRNSSQ
          ++   D    + ++  DSDF+++ Q
Subjt:  VDEISEVD--FFSTTTSKDSDFRNSSQ

AT4G20720.1 dentin sialophosphoprotein-related2.0e-0424.44Show/hide
Query:  GPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPP----PLNKQTVEDNV-----DLNL
        G  D + ++  K  E F       L      ++ K   D+  A S         D D  F    ++ + K  D  P    P++     D+V     DL L
Subjt:  GPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPP----PLNKQTVEDNV-----DLNL

Query:  FDKVPSTETATTTTKHE--SDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLWSSS
        + +   + TA  +   +   DD F       QT  SA  HD ++      + G N +SS++  +   +  ++   +      + + +D     +D  SS+
Subjt:  FDKVPSTETATTTTKHE--SDDSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLWSSS

Query:  NHGTIHMP--DQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVDEISE
        N  T + P    +E +   I    A+  N     SV     D++Q        D    ++DD    WD FTSST +Q     S +       P  ++  E
Subjt:  NHGTIHMP--DQVEQTGSLIDGRAAETANYSSSASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVDEISE

Query:  VDFF-STTTSKDSDFRNSSQPNSFAEAFPKSIE----KVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKMHDLSFMLESNLSVP
        ++ F     ++D DF + S+ + F+E+          KV     S L R S+ +G   +  + +     + P S +D  + +M++MHDLSFMLE+ LSVP
Subjt:  VDFF-STTTSKDSDFRNSSQPNSFAEAFPKSIE----KVAWPDASDLSRMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKMHDLSFMLESNLSVP

Query:  P
        P
Subjt:  P


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTTGAAATCCCTCGCGATCTGATCAAACAACTTCAGATCTCTCTTCGAAATGAGGCCAAAATCTCCTCCTACGACCCTCACGATCCTTCACTTCCAAATCTACC
ATCGCTCCATGAAACAATTGCAGAGCTTGATCCCTCGCCGCCTTATCTTCGCTGCAAACACTGCAAAGGAAGATTGCTTAGAGACTTGAAGTCAGTTATTTGCGTTTTCT
GCGGCAGGGAACAGAACACGGACGTTCCTCCGGACCCCATTAATTTCAAGAATACCATTGCTTGTCGTTGGCTTCTCGAATCCTTGGACTTGGATGGATCGGAGATGGTG
GGACCAGTCGATTTGAAGGAATCAAACCGGGGAAAATCACCAGAGCAATTTCCCCTGATTAATCTTTTAGATTTAGAGATTAGATGGCCTGAATCTGAAAAGAAAGGGAT
CTCAGACGAGACCCCGGCTCCAAGTAAAAATACCTTGAATTTGGCTGGAGTTGATCTTGACTTCTACTTCTCTGAGGAAAAAAAAGACACTACTTCAAAAGCATCTGATG
AGCCTCCACCACTGAATAAACAAACTGTTGAGGATAATGTTGATCTTAATTTATTTGATAAGGTTCCATCTACTGAGACGGCAACAACGACCACTAAACATGAGAGTGAT
GATTCCTTTTCTGGTTGGGAGGCAAGCTTTCAGACTGCTAGTTCTGCAACTTCTCATGATAATTCTAAATCAGTTGATCCTTTTGCTGTTTCTGGGGTCAATATGTCTTC
CTCTTTGGAAACAACGTTTGGAGACCAAAACAAATCCAGAAGTGGAGGAACAGAAGATACTAATAATCCCTCTTCATCAATAACCAATGACTGGTTTCAACAACAAGATG
ATTTATGGAGTAGTTCTAATCACGGAACGATTCACATGCCAGACCAGGTTGAGCAAACTGGAAGTTTAATTGATGGTAGAGCTGCAGAAACTGCTAATTATTCTTCATCA
GCAAGCGTTGATTGGTTTCAAGATGATCAGCGGCAAGGAGGGAGCCAAAAGAAACCTGATGATAAAAGTGTTTTTAAAGATGACGATTCAGCTGATGCATGGGATGATTT
TACTAGTTCAACTGGTGTGCAAGGCCCCTCTGATAATTCTAGGAAAGATATTGTGAATGACGTTGTGCCCAAGGTGGATGAGATATCAGAAGTAGATTTCTTCAGCACAA
CCACCTCAAAGGATAGTGATTTTAGAAACTCTTCTCAGCCAAATTCATTTGCAGAAGCATTCCCCAAATCCATAGAAAAAGTAGCATGGCCAGATGCTTCTGATTTAAGC
AGGATGAGTGAAGAGAATGGAGAAACTGGAGAAAATTCCGAAGCTATGAAGAGTCAAGCTGCATCAGGTCCTAGTTCAAGCACGGATGATGTACAGATGGTGATGGCCAA
GATGCACGATCTATCTTTTATGCTCGAAAGCAATCTTTCAGTCCCCCCAAAGTGA
mRNA sequenceShow/hide mRNA sequence
CGGAACGCAGCGTTTTGGGCATCTTTCAATCAATTTTCCGTGTACAGGAAAAGCTATCTTCCCCGTCTGGGGGAAAGTCGATTTTCATTTTCGAAATCCACACAGAAGGA
GGTGAAGGAGAACTCTGTCGCTCACACAATCTGGAGATACTGTGGAAAACCTTTAAACTCAATCAATGGCGTTTGAAATCCCTCGCGATCTGATCAAACAACTTCAGATC
TCTCTTCGAAATGAGGCCAAAATCTCCTCCTACGACCCTCACGATCCTTCACTTCCAAATCTACCATCGCTCCATGAAACAATTGCAGAGCTTGATCCCTCGCCGCCTTA
TCTTCGCTGCAAACACTGCAAAGGAAGATTGCTTAGAGACTTGAAGTCAGTTATTTGCGTTTTCTGCGGCAGGGAACAGAACACGGACGTTCCTCCGGACCCCATTAATT
TCAAGAATACCATTGCTTGTCGTTGGCTTCTCGAATCCTTGGACTTGGATGGATCGGAGATGGTGGGACCAGTCGATTTGAAGGAATCAAACCGGGGAAAATCACCAGAG
CAATTTCCCCTGATTAATCTTTTAGATTTAGAGATTAGATGGCCTGAATCTGAAAAGAAAGGGATCTCAGACGAGACCCCGGCTCCAAGTAAAAATACCTTGAATTTGGC
TGGAGTTGATCTTGACTTCTACTTCTCTGAGGAAAAAAAAGACACTACTTCAAAAGCATCTGATGAGCCTCCACCACTGAATAAACAAACTGTTGAGGATAATGTTGATC
TTAATTTATTTGATAAGGTTCCATCTACTGAGACGGCAACAACGACCACTAAACATGAGAGTGATGATTCCTTTTCTGGTTGGGAGGCAAGCTTTCAGACTGCTAGTTCT
GCAACTTCTCATGATAATTCTAAATCAGTTGATCCTTTTGCTGTTTCTGGGGTCAATATGTCTTCCTCTTTGGAAACAACGTTTGGAGACCAAAACAAATCCAGAAGTGG
AGGAACAGAAGATACTAATAATCCCTCTTCATCAATAACCAATGACTGGTTTCAACAACAAGATGATTTATGGAGTAGTTCTAATCACGGAACGATTCACATGCCAGACC
AGGTTGAGCAAACTGGAAGTTTAATTGATGGTAGAGCTGCAGAAACTGCTAATTATTCTTCATCAGCAAGCGTTGATTGGTTTCAAGATGATCAGCGGCAAGGAGGGAGC
CAAAAGAAACCTGATGATAAAAGTGTTTTTAAAGATGACGATTCAGCTGATGCATGGGATGATTTTACTAGTTCAACTGGTGTGCAAGGCCCCTCTGATAATTCTAGGAA
AGATATTGTGAATGACGTTGTGCCCAAGGTGGATGAGATATCAGAAGTAGATTTCTTCAGCACAACCACCTCAAAGGATAGTGATTTTAGAAACTCTTCTCAGCCAAATT
CATTTGCAGAAGCATTCCCCAAATCCATAGAAAAAGTAGCATGGCCAGATGCTTCTGATTTAAGCAGGATGAGTGAAGAGAATGGAGAAACTGGAGAAAATTCCGAAGCT
ATGAAGAGTCAAGCTGCATCAGGTCCTAGTTCAAGCACGGATGATGTACAGATGGTGATGGCCAAGATGCACGATCTATCTTTTATGCTCGAAAGCAATCTTTCAGTCCC
CCCAAAGTGATGCATCTTTAGTTCTTCTTCTGAAGCACTCTGCCACTGAGCTTTTCTTGTATTTTTCTTTCCCTCTTTCATTTTAAATCTGTAGCAGCTTAGTGTTAGTT
TAGTTATTACGGAATGCATTCTTTGATTTTATAAAATGGCCATATGCCGTTGAAATCCATTGCAAGCATAACTAACATGCTACCCATTCACATGACTGACATACTCTTTA
CAAACTTACACACCATTGAACAAAAAGTTTCTGTACTAGATGGCAAAATATTATTC
Protein sequenceShow/hide protein sequence
MAFEIPRDLIKQLQISLRNEAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCKGRLLRDLKSVICVFCGREQNTDVPPDPINFKNTIACRWLLESLDLDGSEMV
GPVDLKESNRGKSPEQFPLINLLDLEIRWPESEKKGISDETPAPSKNTLNLAGVDLDFYFSEEKKDTTSKASDEPPPLNKQTVEDNVDLNLFDKVPSTETATTTTKHESD
DSFSGWEASFQTASSATSHDNSKSVDPFAVSGVNMSSSLETTFGDQNKSRSGGTEDTNNPSSSITNDWFQQQDDLWSSSNHGTIHMPDQVEQTGSLIDGRAAETANYSSS
ASVDWFQDDQRQGGSQKKPDDKSVFKDDDSADAWDDFTSSTGVQGPSDNSRKDIVNDVVPKVDEISEVDFFSTTTSKDSDFRNSSQPNSFAEAFPKSIEKVAWPDASDLS
RMSEENGETGENSEAMKSQAASGPSSSTDDVQMVMAKMHDLSFMLESNLSVPPK