; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013144 (gene) of Snake gourd v1 genome

Gene IDTan0013144
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDentin sialophosphoprotein
Genome locationLG05:7300093..7302342
RNA-Seq ExpressionTan0013144
SyntenyTan0013144
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604870.1 hypothetical protein SDJN03_02187, partial [Cucurbita argyrosperma subsp. sororia]5.0e-23183Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK
        MAY+IP+DLIKQLQISLRN AK+SSY+P D SLPNLPSLHETIA+LDPSP YLRCKHCKGRLLRDLKSFICV CG+EQNTEV PDPINFKNTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK

Query:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF
        SLDLDGSEMVGH+DLKES+RGKS EEFPLT+LLDL+IRWPESE+RGLSD T A SKSTLNLA VDLD YFSEE KDT   VSDE  PLN+QIDGSESKTF
Subjt:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF

Query:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD
        +DNVDLSLFGNVQS ETATRI +HES DSFS WEANFQT NSAT+H+NSKSVDPFA+S VDIS  LE TSGHQNK RSGEIEETKNPSSSMT+DWFQQQD
Subjt:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD

Query:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD
        DLWSSSNHETI TPEQV QTG   DG+TVGTADYS SASVDWFQDDQWQGGSKKKPDD S F+DDDSADAWDDFTSST +QG LDN  KDIVN++VPKV 
Subjt:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD

Query:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDSH
        EISEIDFF TTTSKD NFGNFS PN  VE+FPN NGTSEEKATRPDASDLS MSEENGK+GE+SK  KE +++S PSSN DDVQ +MAKMHDLSFML+SH
Subjt:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDSH

Query:  LSIPPK
        LSIPPK
Subjt:  LSIPPK

XP_022947158.1 uncharacterized protein LOC111451112 [Cucurbita moschata]1.7e-23183.2Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK
        MAY+IP+DLIKQLQISLRN AK+SSY+P D SLPNLPSLHETIA+LDPSP YLRCKHCKGRLLRDLKSFICV CG+EQNTEVPPDPINFKNTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK

Query:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF
        SL LDGSEMVG++DLKES+RGKS EEFPLT+LLDL+IRWPESEKRGLSD T A SKSTLNLA VDLD YFSEE KDT  KVSDE  PLN+QIDGSESKTF
Subjt:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF

Query:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD
        +DNVDLSLFGNVQSS+TATRI +HES DSFS WEANFQT NSAT+H+NSKSVDPFA+S VDIS +LE TSGHQNK RSGEIEETKNPSSS+T+DWFQQQD
Subjt:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD

Query:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD
        DLWSSSNHETI TPEQV QTG   DG+ VGTADYSSSASVDWFQDDQWQGGSKKKPDD S FEDDDSADAWDDFTSST +QG  DN  KDIVN+IVPKV 
Subjt:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD

Query:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDSH
        EISEIDFF TTTSKD NFGNFS PN  VE+FPN NGTSEEKATRPDASDLS MSEENGK+GE+SK  KE +++S PSSN DDVQ +MAKMHDLSFML+SH
Subjt:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDSH

Query:  LSIPPK
        LSIPPK
Subjt:  LSIPPK

XP_022970990.1 uncharacterized protein LOC111469795 [Cucurbita maxima]6.3e-23484.22Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK
        MA++IP+DLIKQLQISLRN AK+SSY+P D SLPNLPSLHETIA+LDPSP YLRCKHCKGRLLRDLKSF+CVFCG+EQNTEVPPDPINFKNTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK

Query:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF
        SLDLDGSEMVGH+DLKES+RGKS EEFPLT+LLDL+IRWPESEKRGLSD T A SKSTLNLA VDLD YFSEE KDT  KVSDE  PLN+QIDGSE KTF
Subjt:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF

Query:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD
        +DNVDLSLFGNVQSSETATRI +HES DSFS WEANFQT NSAT+H+NSKSVDPFA+S VDIS +LE TSGHQNK RSGEIEETKNPSSSMT+DWFQQQD
Subjt:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD

Query:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD
        DLWSSSNHETI TPEQV+QTG   DG+TVGTADYSSSASVDWFQDDQWQGGS KKPDD S F+DDDSADAWDDFTSST +QG LDN  KDIVN+IVPKVD
Subjt:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD

Query:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRN-GTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDS
        EISEIDFFSTTTSKD NFGNFS PN  VE+FPN N GTSEEKATRPDASDLSRMSEENGK+GE+SK  KE +A+S PSSN DDVQ +MAKMHDLSFML+S
Subjt:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRN-GTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDS

Query:  HLSIPPK
        HLSIPPK
Subjt:  HLSIPPK

XP_023533243.1 uncharacterized protein LOC111795191 [Cucurbita pepo subsp. pepo]6.7e-23684.39Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK
        MAY+IP+DLIKQLQISLRN AK+SSY+P D SLPNLPSLHETIA+LDPSP YLRCKHCKGRLLRDLKSFICV CG+EQNTEVPPDPINFKNTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK

Query:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF
        SLDLDGSEMVGH+DLKES+RGKS EEFPLT+LLDL+IRWPESEKRGLSD T A SKSTLNLA VDLD YFSEE KD  +KVSDE  PLN+QIDGSESKTF
Subjt:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF

Query:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD
        +DNVDLSLFGNVQSSETATRI +HES DSFS WEANFQT NSAT+H+NSKSVDPFA+S VDIS +LE TSGHQNK RSGEIEETKNPSSSMT+DWFQQQD
Subjt:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD

Query:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD
        DLWSSSNHETI TPEQV QTG   DG+TVGTADYSSSASVDWFQDDQWQGGSKKKPDD S F+DDDSADAWDDFTSST +QG LDN  KDIVN+IVPKV 
Subjt:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD

Query:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDSH
        EISEIDFF TTTSKD NFGNFS PN  VE+FPN NGTSEEKATRPDASDLSRMSEENGK+GE+SK  KE +A+S PSSN DDVQ +MAKMHDLSFML+SH
Subjt:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDSH

Query:  LSIPPK
        LSIPPK
Subjt:  LSIPPK

XP_038902680.1 uncharacterized protein LOC120089318 [Benincasa hispida]1.2e-21679.84Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK
        MAYEIPHDLIKQLQISLRNGAKISSY+P DPSLPNLPSLHETIAELDPSP YLRCKHC GRLLRDLKSF+CVFCGREQNT+VPPDPINFKNTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK

Query:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF
        SLDLDGSEMV  I+LKES+RGKSPE+FPLT+LLDLEIRWPESEK+G+SDETPA SKS LNLA VDLDYYFSEEKKDT SK S+EP PLNKQ       T 
Subjt:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF

Query:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD
        EDNVDLSLF NV SSETATR TKHESGDSFS WEA+FQ A+SAT HDNSKSVDPFAVS V+ISS+LETT G QNKSRSGE ++TKNPSSS+TNDWFQQQ 
Subjt:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD

Query:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD
        DLWSSSNHETIR P+QVEQTGI+IDGR   TA+YSSSASVDWFQ DQ QGGS+KKPDD+S F+ D SADAWDDFTSST V GP DNSRKDIVND+V KVD
Subjt:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD

Query:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDSH
        EISE+DFFSTT   +S+F N S PNS  E+FPN NGTS  KAT  DASDLSRMSEE+G+TGE+SK + E ++ SGPSS++DDVQ +M KMHDLSFML+S+
Subjt:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDSH

Query:  LSIPPK
        LSIPPK
Subjt:  LSIPPK

TrEMBL top hitse value%identityAlignment
A0A0A0LMS7 Uncharacterized protein1.6e-20676.28Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK
        MAYEIP DLIKQLQISLRN A ISSY+P  PSLPNLPS +ETIA+LDPSP YLRCKHCKGRLLRDLKSFICVFCGREQ ++VPPDPINF NTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK

Query:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF
        SLDLDGSEMVG IDLKES+RGKSPE+FPLT+LLDLEIRWPESEK+G+SDETPA SKSTLNLAGVDL  YF+EEK DT SK SD   P +K       +T 
Subjt:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF

Query:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD
        EDN DLSLF    S ETATR TKHES DSFS WEA+FQ A+SAT  DNSKSVDPF VS V+ISS+LETT G+QNKS SGE E+TKNPSSS TNDWFQQQD
Subjt:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD

Query:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD
        DLWSSSNH+TI  P+QVEQTGILIDGRT  TA+YSSSA+VDWFQDDQ QG S+KKPDD+S F+DD SADAWDDFTSST VQGP DNS+KDIVND VPKVD
Subjt:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD

Query:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDSH
        EISE+DFFST T+KDS+F + S P S  E+FPN NGTS EKA  PDASDLSRMSEENGKT E+S  + + +A SGPSS++DD + +M KMHDLSFML+S 
Subjt:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDSH

Query:  LSIPPK
        LSIPPK
Subjt:  LSIPPK

A0A5A7SW96 Dentin sialophosphoprotein5.0e-20575.1Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK
        MAYEIP DLIKQLQISLRN AKISSY+P  PSLPNLPS ++TIAELDPSP YLRCKHCKGRLLRDLKSFICVFCGREQ ++VPP+PINFKNTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK

Query:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF
        SLDLDGSEMVG IDLKES+RGKSPE+FPLT+LLDLEIRWPES+K G+ DETPA SKSTLNLAGVDL YYF+EEK DT SK SD   P +KQ       T 
Subjt:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF

Query:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD
        EDN DLSLF    SSE+ATR TKHES DSFS WEA+FQTA+SAT+ DNSKS+DPF VS V++SS+ E T G QNKSRSGE E+TK+PSSS TNDWFQQQD
Subjt:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD

Query:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD
        DLWSSSNH+T+  P+QVEQTGILIDGR   TA+YSSSA+VDWFQDDQWQGGS+KKPDD+S F+DDDSADAWD+FTSST VQGP DNSRKDIV D VPKVD
Subjt:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD

Query:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDSH
        EISE+DFFSTTT+KDS+F + S P S  E+FPN NGTS EKA  PDASDL+RM EENGK+ E+S   +   A+ G  S++DD Q +M KMHDLSFML+S+
Subjt:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDSH

Query:  LSIPPK
        LSIPPK
Subjt:  LSIPPK

A0A5D3CEG4 Dentin sialophosphoprotein1.9e-20474.7Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK
        MAYEIP DLIKQLQISLRN AKISSY+P  PSLPNLPS ++TIAELDPSP YLRCKHCKGRLLRDLKSFICVFCGREQ ++VPP+PINFKNTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK

Query:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF
        SLDLDGSEMVG IDLKES+RGKSPE+FPLT+LLDLEIRWPES+K G++DETPA SKSTLNLAGVDL YYF+EEK DT SK SD   P +KQ       T 
Subjt:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF

Query:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD
        EDN DLSLF    SSE+ATR TKHES DSFS WEA+FQTA+SAT+ DNSKS+DPF VS V++SS+ E T G QNKSRSGE E+TK+PSSS TNDWFQQQD
Subjt:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD

Query:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD
        DLWSSSNH+T+  P+QVEQTGILIDGR   T +YSSSA+VDWFQDDQWQGGS+KKPDD+S F+DDDSAD WD+FTSST VQGP DNSRKDIV D VPKVD
Subjt:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD

Query:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDSH
        EISE+DFFSTTT+KDS+F + S P S  E+FPN NGTS EKA  PDASDL+RM EENGK+ E+S       A+ G  S++DD Q +M KMHDLSFML+S+
Subjt:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDSH

Query:  LSIPPK
        LSIPPK
Subjt:  LSIPPK

A0A6J1G5U6 uncharacterized protein LOC1114511128.3e-23283.2Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK
        MAY+IP+DLIKQLQISLRN AK+SSY+P D SLPNLPSLHETIA+LDPSP YLRCKHCKGRLLRDLKSFICV CG+EQNTEVPPDPINFKNTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK

Query:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF
        SL LDGSEMVG++DLKES+RGKS EEFPLT+LLDL+IRWPESEKRGLSD T A SKSTLNLA VDLD YFSEE KDT  KVSDE  PLN+QIDGSESKTF
Subjt:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF

Query:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD
        +DNVDLSLFGNVQSS+TATRI +HES DSFS WEANFQT NSAT+H+NSKSVDPFA+S VDIS +LE TSGHQNK RSGEIEETKNPSSS+T+DWFQQQD
Subjt:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD

Query:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD
        DLWSSSNHETI TPEQV QTG   DG+ VGTADYSSSASVDWFQDDQWQGGSKKKPDD S FEDDDSADAWDDFTSST +QG  DN  KDIVN+IVPKV 
Subjt:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD

Query:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDSH
        EISEIDFF TTTSKD NFGNFS PN  VE+FPN NGTSEEKATRPDASDLS MSEENGK+GE+SK  KE +++S PSSN DDVQ +MAKMHDLSFML+SH
Subjt:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDSH

Query:  LSIPPK
        LSIPPK
Subjt:  LSIPPK

A0A6J1I4G5 uncharacterized protein LOC1114697953.0e-23484.22Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK
        MA++IP+DLIKQLQISLRN AK+SSY+P D SLPNLPSLHETIA+LDPSP YLRCKHCKGRLLRDLKSF+CVFCG+EQNTEVPPDPINFKNTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLK

Query:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF
        SLDLDGSEMVGH+DLKES+RGKS EEFPLT+LLDL+IRWPESEKRGLSD T A SKSTLNLA VDLD YFSEE KDT  KVSDE  PLN+QIDGSE KTF
Subjt:  SLDLDGSEMVGHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTF

Query:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD
        +DNVDLSLFGNVQSSETATRI +HES DSFS WEANFQT NSAT+H+NSKSVDPFA+S VDIS +LE TSGHQNK RSGEIEETKNPSSSMT+DWFQQQD
Subjt:  EDNVDLSLFGNVQSSETATRITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQD

Query:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD
        DLWSSSNHETI TPEQV+QTG   DG+TVGTADYSSSASVDWFQDDQWQGGS KKPDD S F+DDDSADAWDDFTSST +QG LDN  KDIVN+IVPKVD
Subjt:  DLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVD

Query:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRN-GTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDS
        EISEIDFFSTTTSKD NFGNFS PN  VE+FPN N GTSEEKATRPDASDLSRMSEENGK+GE+SK  KE +A+S PSSN DDVQ +MAKMHDLSFML+S
Subjt:  EISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRN-GTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDS

Query:  HLSIPPK
        HLSIPPK
Subjt:  HLSIPPK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05090.1 dentin sialophosphoprotein-related2.9e-4326.34Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYNP-QDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNT-EVPPDPINFKNTIACRWL
        MA EI  DLI QL++SLR  AK++S +   D S P+LP+  E IAELD S  YLRC++CKG+LLR ++S ICVFCG +Q T + PPDPI F +T A +W 
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYNP-QDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNT-EVPPDPINFKNTIACRWL

Query:  LKSLDLDGSEMVGHI-DLKESSRG--KSP--EEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSE---------------------
        L SL+LDGSEMV  + +   SSRG  K+P  +   L+  LDLEI+W   E++   D      K+ LNL G++LD YF E                     
Subjt:  LKSLDLDGSEMVGHI-DLKESSRG--KSP--EEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSE---------------------

Query:  -----------------------------EKKDTASKV-------------------SDEPLPL-----------NKQID-------GSESKTFEDNVDL
                                     +KKD    V                    DE L L           + ++D       G +++    + D 
Subjt:  -----------------------------EKKDTASKV-------------------SDEPLPL-----------NKQID-------GSESKTFEDNVDL

Query:  SLFGNVQSSETATRITKHESGDSF---------------------------------------------SDWEANFQTANSATTHDNSKSVDPFAVSEVD
          FG  +  + A R +  +  +SF                                             SDW+++FQ+A+   +       DPF  S VD
Subjt:  SLFGNVQSSETATRITKHESGDSF---------------------------------------------SDWEANFQTANSATTHDNSKSVDPFAVSEVD

Query:  ISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQDDLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSG
        +++ +++  G        +  ++     S   DW   QDDL+ +   E       V       +G+ VG  + +SS  +DW  DD WQ   KK  +    
Subjt:  ISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQDDLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSASVDWFQDDQWQGGSKKKPDDQSG

Query:  FEDDDSADAWDDFTSSTSVQGP-------LDNSRKDIV----------------------NDIVPKVDEISEIDFFST----------------------
          +DD  D W+DF SS + + P       +++S+ +I                         ++  + +  E D F T                      
Subjt:  FEDDDSADAWDDFTSSTSVQGP-------LDNSRKDIV----------------------NDIVPKVDEISEIDFFST----------------------

Query:  ---------------TTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSF
                         ++D +F + S  +   ES   +  + E K      S L R S+ +G + + +  +     T+ P S SD  + +M++MHDLSF
Subjt:  ---------------TTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSF

Query:  MLDSHLSIPP
        ML++ LS+PP
Subjt:  MLDSHLSIPP

AT4G20720.1 dentin sialophosphoprotein-related1.6e-4125.92Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYNP-QDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNT-EVPPDPINFKNTIACRWL
        MA EI  DLI QL++SLR  AK++S +   D S P+LP+  E IAELD S  YLRC++CKG+LLR ++S ICVFCG +Q T + PPDPI F +T A +W 
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYNP-QDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNT-EVPPDPINFKNTIACRWL

Query:  LKSLDLDGSEMVGHI-DLKESSRG--KSP--EEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSE---------------------
        L SL+LDGSEMV  + +   SSRG  K+P  +   L+  LDLEI+W   E++   D      K+ LNL G++LD YF E                     
Subjt:  LKSLDLDGSEMVGHI-DLKESSRG--KSP--EEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSE---------------------

Query:  -----------------------------EKKDTASKV----SDEPLPLNKQIDGSES------------------KTFEDNVDLSL-------------
                                     +KKD    V      E L L    D  ES                   +F+++ +LSL             
Subjt:  -----------------------------EKKDTASKV----SDEPLPLNKQIDGSES------------------KTFEDNVDLSL-------------

Query:  ----------------------------------------------FGNVQSSETATRITKHESGDSF------------------------SDWEANFQ
                                                      FG  +  E A R +  +  ++F                        SDW+++FQ
Subjt:  ----------------------------------------------FGNVQSSETATRITKHESGDSF------------------------SDWEANFQ

Query:  TANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQDDLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSA
        +A+   +       DPF  S VD+++ +++  G        +  ++     S   DW   QDDL+ +   E       V       +G+ VG  + +SS 
Subjt:  TANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQDDLWSSSNHETIRTPEQVEQTGILIDGRTVGTADYSSSA

Query:  SVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGP-------LDNSRKDIV----------------------NDIVPKVDEISEIDFFS
         +DW  DD WQ   KK  +      +DD  D W+DF SS + + P       +++S+ +I                         ++  + +  E D F 
Subjt:  SVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGP-------LDNSRKDIV----------------------NDIVPKVDEISEIDFFS

Query:  T-------------------------------------TTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRA
        T                                       ++D +F + S  +   ES   +  + E K      S L R S+ +G + + +  +     
Subjt:  T-------------------------------------TTSKDSNFGNFSLPNSSVESFPNRNGTSEEKATRPDASDLSRMSEENGKTGEHSKVMKESRA

Query:  TSGPSSNSDDVQTVMAKMHDLSFMLDSHLSIPP
        T+ P S SD  + +M++MHDLSFML++ LS+PP
Subjt:  TSGPSSNSDDVQTVMAKMHDLSFMLDSHLSIPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTATGAAATTCCTCACGATCTCATCAAACAACTTCAGATCTCACTTCGAAATGGGGCCAAAATCTCCTCCTACAACCCTCAGGATCCTTCACTTCCAAATCTACC
GTCGCTCCATGAAACAATTGCAGAGCTCGATCCCTCGCCGGCTTATCTTCGCTGCAAACACTGCAAAGGAAGATTGCTTAGAGACTTGAAGTCATTTATTTGCGTTTTCT
GCGGCAGGGAACAGAACACGGAGGTCCCTCCGGACCCTATTAATTTCAAGAATACCATCGCTTGTCGTTGGCTGCTCAAGTCCTTGGACTTGGATGGATCGGAGATGGTG
GGACATATTGATTTGAAGGAATCGAGCCGGGGAAAATCACCTGAGGAGTTTCCCCTCACGAATCTTTTAGATTTAGAGATCAGGTGGCCTGAATCTGAAAAAAGGGGGCT
CTCAGACGAGACTCCGGCTTCAAGTAAGAGTACCTTGAATTTGGCTGGAGTTGATCTTGACTACTACTTCTCTGAGGAAAAAAAAGACACTGCCTCAAAAGTATCTGATG
AGCCACTACCACTGAATAAACAAATTGATGGTTCTGAAAGCAAAACTTTTGAGGACAATGTAGATCTTAGTTTGTTTGGAAATGTTCAATCTTCTGAGACAGCTACAAGG
ATCACTAAACATGAGAGTGGTGATTCTTTTTCTGATTGGGAGGCAAACTTTCAGACGGCTAATTCTGCAACTACTCACGATAATTCCAAATCTGTTGATCCTTTTGCTGT
TTCGGAGGTCGATATTTCTTCCACTTTAGAAACAACATCAGGGCACCAAAACAAGTCCAGAAGTGGAGAAATAGAAGAAACCAAAAACCCATCTTCATCAATGACCAATG
ACTGGTTTCAACAACAAGATGATTTATGGAGTAGTTCCAACCATGAAACCATTCGCACGCCTGAACAGGTCGAGCAAACTGGAATTTTAATTGATGGCAGAACTGTAGGA
ACTGCAGATTATTCTTCATCAGCAAGTGTTGATTGGTTTCAAGATGATCAGTGGCAAGGAGGAAGCAAAAAGAAACCTGATGATCAAAGTGGTTTCGAAGATGACGACTC
AGCTGATGCTTGGGATGATTTTACTAGCTCGACCAGTGTGCAAGGCCCTTTGGATAATTCTAGGAAAGATATTGTTAATGACATTGTGCCGAAGGTGGATGAAATATCAG
AAATAGATTTCTTCAGCACAACGACCTCCAAGGATAGTAATTTTGGAAACTTTTCTCTGCCAAATTCATCTGTGGAATCATTCCCCAATCGGAATGGTACATCAGAAGAA
AAAGCAACGCGGCCAGATGCTTCTGACTTAAGCAGGATGAGTGAAGAGAATGGAAAAACTGGAGAACATTCCAAAGTTATGAAGGAAAGTCGGGCTACATCAGGTCCAAG
TTCAAATTCTGATGATGTACAGACGGTGATGGCGAAGATGCACGATCTTTCTTTTATGCTCGATAGCCATCTTTCAATCCCCCCAAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTATGAAATTCCTCACGATCTCATCAAACAACTTCAGATCTCACTTCGAAATGGGGCCAAAATCTCCTCCTACAACCCTCAGGATCCTTCACTTCCAAATCTACC
GTCGCTCCATGAAACAATTGCAGAGCTCGATCCCTCGCCGGCTTATCTTCGCTGCAAACACTGCAAAGGAAGATTGCTTAGAGACTTGAAGTCATTTATTTGCGTTTTCT
GCGGCAGGGAACAGAACACGGAGGTCCCTCCGGACCCTATTAATTTCAAGAATACCATCGCTTGTCGTTGGCTGCTCAAGTCCTTGGACTTGGATGGATCGGAGATGGTG
GGACATATTGATTTGAAGGAATCGAGCCGGGGAAAATCACCTGAGGAGTTTCCCCTCACGAATCTTTTAGATTTAGAGATCAGGTGGCCTGAATCTGAAAAAAGGGGGCT
CTCAGACGAGACTCCGGCTTCAAGTAAGAGTACCTTGAATTTGGCTGGAGTTGATCTTGACTACTACTTCTCTGAGGAAAAAAAAGACACTGCCTCAAAAGTATCTGATG
AGCCACTACCACTGAATAAACAAATTGATGGTTCTGAAAGCAAAACTTTTGAGGACAATGTAGATCTTAGTTTGTTTGGAAATGTTCAATCTTCTGAGACAGCTACAAGG
ATCACTAAACATGAGAGTGGTGATTCTTTTTCTGATTGGGAGGCAAACTTTCAGACGGCTAATTCTGCAACTACTCACGATAATTCCAAATCTGTTGATCCTTTTGCTGT
TTCGGAGGTCGATATTTCTTCCACTTTAGAAACAACATCAGGGCACCAAAACAAGTCCAGAAGTGGAGAAATAGAAGAAACCAAAAACCCATCTTCATCAATGACCAATG
ACTGGTTTCAACAACAAGATGATTTATGGAGTAGTTCCAACCATGAAACCATTCGCACGCCTGAACAGGTCGAGCAAACTGGAATTTTAATTGATGGCAGAACTGTAGGA
ACTGCAGATTATTCTTCATCAGCAAGTGTTGATTGGTTTCAAGATGATCAGTGGCAAGGAGGAAGCAAAAAGAAACCTGATGATCAAAGTGGTTTCGAAGATGACGACTC
AGCTGATGCTTGGGATGATTTTACTAGCTCGACCAGTGTGCAAGGCCCTTTGGATAATTCTAGGAAAGATATTGTTAATGACATTGTGCCGAAGGTGGATGAAATATCAG
AAATAGATTTCTTCAGCACAACGACCTCCAAGGATAGTAATTTTGGAAACTTTTCTCTGCCAAATTCATCTGTGGAATCATTCCCCAATCGGAATGGTACATCAGAAGAA
AAAGCAACGCGGCCAGATGCTTCTGACTTAAGCAGGATGAGTGAAGAGAATGGAAAAACTGGAGAACATTCCAAAGTTATGAAGGAAAGTCGGGCTACATCAGGTCCAAG
TTCAAATTCTGATGATGTACAGACGGTGATGGCGAAGATGCACGATCTTTCTTTTATGCTCGATAGCCATCTTTCAATCCCCCCAAAGTGA
Protein sequenceShow/hide protein sequence
MAYEIPHDLIKQLQISLRNGAKISSYNPQDPSLPNLPSLHETIAELDPSPAYLRCKHCKGRLLRDLKSFICVFCGREQNTEVPPDPINFKNTIACRWLLKSLDLDGSEMV
GHIDLKESSRGKSPEEFPLTNLLDLEIRWPESEKRGLSDETPASSKSTLNLAGVDLDYYFSEEKKDTASKVSDEPLPLNKQIDGSESKTFEDNVDLSLFGNVQSSETATR
ITKHESGDSFSDWEANFQTANSATTHDNSKSVDPFAVSEVDISSTLETTSGHQNKSRSGEIEETKNPSSSMTNDWFQQQDDLWSSSNHETIRTPEQVEQTGILIDGRTVG
TADYSSSASVDWFQDDQWQGGSKKKPDDQSGFEDDDSADAWDDFTSSTSVQGPLDNSRKDIVNDIVPKVDEISEIDFFSTTTSKDSNFGNFSLPNSSVESFPNRNGTSEE
KATRPDASDLSRMSEENGKTGEHSKVMKESRATSGPSSNSDDVQTVMAKMHDLSFMLDSHLSIPPK