; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Bhi10G001090 (gene) of Wax gourd (B227) v1 genome

Gene IDBhi10G001090
OrganismBenincasa hispida cv. B227 (Wax gourd (B227) v1)
DescriptionDentin sialophosphoprotein
Genome locationchr10:33183956..33188708
RNA-Seq ExpressionBhi10G001090
SyntenyBhi10G001090
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034793.1 dentin sialophosphoprotein [Cucumis melo var. makuwa]1.1e-22282.93Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIP DLIKQLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHC GRLLRDLKSF+CVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQTVEDNVDLS
        SLDLDGSEMV PI+LKESNRGKSPEQFPLTDLLDLEIRWPES+K GI DETPAPSKS LNLA VDL YYF+EEK DTTSKAS+  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQTVEDNVDLS

Query:  LFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ-DLWSSSN
        LFD  PSSE+ATRTTKHES DSFSGWEASFQ ASSAT  DNSKS+DPF VS VN+SSS E TFGDQNKSRSGET+DTK+PSSS TNDWFQQQ DLWSSSN
Subjt:  LFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ-DLWSSSN

Query:  HETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFK-DHSADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVDEISEVDF
        H+T+ MPDQVEQTGI+IDGRA ETANYSSSA+VDWFQ DQ QGGSQKKPDDKS FK D SADAWD+FTSSTGV GPSDNSRKDIV D V KVDEISEVDF
Subjt:  HETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFK-DHSADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVDEISEVDF

Query:  FSTT---NSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPPK
        FSTT   +SDFR+SSQP SFAEAFPNPNGTS+ KA W DASDL+RM EE+G++ ENS A +HQ+ASG  SSTDD QM+MEKMHDLSFMLESNLSIPPK
Subjt:  FSTT---NSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPPK

XP_008455912.1 PREDICTED: uncharacterized protein LOC103495983 [Cucumis melo]5.5e-22282.53Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIP DLIKQLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHC GRLLRDLKSF+CVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQTVEDNVDLS
        SLDLDGSEMV PI+LKESNRGKSPEQFPLTDLLDLEIRWPES+K GI+DETPAPSKS LNLA VDL YYF+EEK DTTSKAS+  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQTVEDNVDLS

Query:  LFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ-DLWSSSN
        LFD  PSSE+ATRTTKHES DSFSGWEASFQ ASSAT  DNSKS+DPF VS VN+SSS E TFGDQNKSRSGET+DTK+PSSS TNDWFQQQ DLWSSSN
Subjt:  LFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ-DLWSSSN

Query:  HETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFK-DHSADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVDEISEVDF
        H+T+ MPDQVEQTGI+IDGRA ET NYSSSA+VDWFQ DQ QGGSQKKPDDKS FK D SAD WD+FTSSTGV GPSDNSRKDIV D V KVDEISEVDF
Subjt:  HETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFK-DHSADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVDEISEVDF

Query:  FSTT---NSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPPK
        FSTT   +SDFR+SSQP SFAEAFPNPNGTS+ KA W DASDL+RM EE+G++ ENS A  HQ+ASG  SSTDD QM+MEKMHDLSFMLESNLSIPPK
Subjt:  FSTT---NSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPPK

XP_011649988.1 uncharacterized protein LOC101209977 [Cucumis sativus]1.0e-22383.53Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIP DLIKQLQISLRN A ISSYDPH PSLPNLPS +ETIA+LDPSPPYLRCKHC GRLLRDLKSF+CVFCGREQ +DVPPDPINF NTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQTVEDNVDLS
        SLDLDGSEMV  I+LKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKS LNLA VDL  YF+EEK DTTSKAS+  PP +K+TVEDN DLS
Subjt:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQTVEDNVDLS

Query:  LFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ-DLWSSSN
        LFD  PS ETATRTTKHES DSFSGWEASFQ ASSAT  DNSKSVDPF VS VNISSSLETTFG+QNKS SGET+DTKNPSSS TNDWFQQQ DLWSSSN
Subjt:  LFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ-DLWSSSN

Query:  HETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFKDH-SADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVDEISEVDF
        H+TI MPDQVEQTGI+IDGR  ETANYSSSA+VDWFQ DQ QG SQKKPDDKS FKD  SADAWDDFTSSTGV GP DNS+KDIVND V KVDEISEVDF
Subjt:  HETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFKDH-SADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVDEISEVDF

Query:  FS---TTNSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPPK
        FS   T +SDFR+SSQP SFAEAFPNPNGTS+ KA W DASDLSRMSEE+G+T ENS A++ Q+ASGPSSSTDD +MMMEKMHDLSFMLES LSIPPK
Subjt:  FS---TTNSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPPK

XP_023533243.1 uncharacterized protein LOC111795191 [Cucurbita pepo subsp. pepo]4.5e-20075.49Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAY+IP+DLIKQLQISLRN AK+SSYDPHD SLPNLPSLHETIA+LDPSPPYLRCKHC GRLLRDLKSF+CV CG+EQNT+VPPDPINFKNTIACRWLLE
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQ-------TV
        SLDLDGSEMV  ++LKESNRGKS E+FPLTDLLDL+IRWPESEK+G+SD T APSKS LNLAEVDLD YFSEE KD T+K S+E  PLN+Q       T 
Subjt:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQ-------TV

Query:  EDNVDLSLFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ-
        +DNVDLSLF NV SSETATR  +HES DSFSGWEA+FQ  +SAT H+NSKSVDPFA+S V+IS SLE T G QNK RSGE ++TKNPSSS+T+DWFQQQ 
Subjt:  EDNVDLSLFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ-

Query:  DLWSSSNHETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFK-DHSADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVD
        DLWSSSNHETI  P+QV QTG   DG+   TA+YSSSASVDWFQ DQ QGGS+KKPDD SDFK D SADAWDDFTSSTG+ G  DN  KDIVN++V KV 
Subjt:  DLWSSSNHETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFK-DHSADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVD

Query:  EISEVDFFSTTNS---DFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKAL-EHQSASGPSSSTDDVQMMMEKMHDLSFMLESN
        EISE+DFF TT S   +F N SQPN F EAFPN NGTS  KAT  DASDLSRMSEE+G++GENSKA  E Q++S PSS+ DDVQMMM KMHDLSFMLES+
Subjt:  EISEVDFFSTTNS---DFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKAL-EHQSASGPSSSTDDVQMMMEKMHDLSFMLESN

Query:  LSIPPK
        LSIPPK
Subjt:  LSIPPK

XP_038902680.1 uncharacterized protein LOC120089318 [Benincasa hispida]5.8e-280100Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQTVEDNVDLS
        SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQTVEDNVDLS
Subjt:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQTVEDNVDLS

Query:  LFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQDLWSSSNH
        LFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQDLWSSSNH
Subjt:  LFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQDLWSSSNH

Query:  ETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFKDHSADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVDEISEVDFFS
        ETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFKDHSADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVDEISEVDFFS
Subjt:  ETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFKDHSADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVDEISEVDFFS

Query:  TTNSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPPK
        TTNSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPPK
Subjt:  TTNSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPPK

TrEMBL top hitse value%identityAlignment
A0A0A0LMS7 Uncharacterized protein4.8e-22483.53Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIP DLIKQLQISLRN A ISSYDPH PSLPNLPS +ETIA+LDPSPPYLRCKHC GRLLRDLKSF+CVFCGREQ +DVPPDPINF NTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQTVEDNVDLS
        SLDLDGSEMV  I+LKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKS LNLA VDL  YF+EEK DTTSKAS+  PP +K+TVEDN DLS
Subjt:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQTVEDNVDLS

Query:  LFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ-DLWSSSN
        LFD  PS ETATRTTKHES DSFSGWEASFQ ASSAT  DNSKSVDPF VS VNISSSLETTFG+QNKS SGET+DTKNPSSS TNDWFQQQ DLWSSSN
Subjt:  LFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ-DLWSSSN

Query:  HETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFKDH-SADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVDEISEVDF
        H+TI MPDQVEQTGI+IDGR  ETANYSSSA+VDWFQ DQ QG SQKKPDDKS FKD  SADAWDDFTSSTGV GP DNS+KDIVND V KVDEISEVDF
Subjt:  HETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFKDH-SADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVDEISEVDF

Query:  FS---TTNSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPPK
        FS   T +SDFR+SSQP SFAEAFPNPNGTS+ KA W DASDLSRMSEE+G+T ENS A++ Q+ASGPSSSTDD +MMMEKMHDLSFMLES LSIPPK
Subjt:  FS---TTNSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPPK

A0A1S3C2P9 uncharacterized protein LOC1034959832.7e-22282.53Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIP DLIKQLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHC GRLLRDLKSF+CVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQTVEDNVDLS
        SLDLDGSEMV PI+LKESNRGKSPEQFPLTDLLDLEIRWPES+K GI+DETPAPSKS LNLA VDL YYF+EEK DTTSKAS+  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQTVEDNVDLS

Query:  LFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ-DLWSSSN
        LFD  PSSE+ATRTTKHES DSFSGWEASFQ ASSAT  DNSKS+DPF VS VN+SSS E TFGDQNKSRSGET+DTK+PSSS TNDWFQQQ DLWSSSN
Subjt:  LFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ-DLWSSSN

Query:  HETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFK-DHSADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVDEISEVDF
        H+T+ MPDQVEQTGI+IDGRA ET NYSSSA+VDWFQ DQ QGGSQKKPDDKS FK D SAD WD+FTSSTGV GPSDNSRKDIV D V KVDEISEVDF
Subjt:  HETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFK-DHSADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVDEISEVDF

Query:  FSTT---NSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPPK
        FSTT   +SDFR+SSQP SFAEAFPNPNGTS+ KA W DASDL+RM EE+G++ ENS A  HQ+ASG  SSTDD QM+MEKMHDLSFMLESNLSIPPK
Subjt:  FSTT---NSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPPK

A0A5A7SW96 Dentin sialophosphoprotein5.4e-22382.93Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIP DLIKQLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHC GRLLRDLKSF+CVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQTVEDNVDLS
        SLDLDGSEMV PI+LKESNRGKSPEQFPLTDLLDLEIRWPES+K GI DETPAPSKS LNLA VDL YYF+EEK DTTSKAS+  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQTVEDNVDLS

Query:  LFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ-DLWSSSN
        LFD  PSSE+ATRTTKHES DSFSGWEASFQ ASSAT  DNSKS+DPF VS VN+SSS E TFGDQNKSRSGET+DTK+PSSS TNDWFQQQ DLWSSSN
Subjt:  LFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ-DLWSSSN

Query:  HETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFK-DHSADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVDEISEVDF
        H+T+ MPDQVEQTGI+IDGRA ETANYSSSA+VDWFQ DQ QGGSQKKPDDKS FK D SADAWD+FTSSTGV GPSDNSRKDIV D V KVDEISEVDF
Subjt:  HETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFK-DHSADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVDEISEVDF

Query:  FSTT---NSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPPK
        FSTT   +SDFR+SSQP SFAEAFPNPNGTS+ KA W DASDL+RM EE+G++ ENS A +HQ+ASG  SSTDD QM+MEKMHDLSFMLESNLSIPPK
Subjt:  FSTT---NSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPPK

A0A5D3CEG4 Dentin sialophosphoprotein2.7e-22282.53Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIP DLIKQLQISLRN AKISSYDPH PSLPNLPS ++TIAELDPSPPYLRCKHC GRLLRDLKSF+CVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQTVEDNVDLS
        SLDLDGSEMV PI+LKESNRGKSPEQFPLTDLLDLEIRWPES+K GI+DETPAPSKS LNLA VDL YYF+EEK DTTSKAS+  PP +KQTVEDN DLS
Subjt:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQTVEDNVDLS

Query:  LFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ-DLWSSSN
        LFD  PSSE+ATRTTKHES DSFSGWEASFQ ASSAT  DNSKS+DPF VS VN+SSS E TFGDQNKSRSGET+DTK+PSSS TNDWFQQQ DLWSSSN
Subjt:  LFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ-DLWSSSN

Query:  HETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFK-DHSADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVDEISEVDF
        H+T+ MPDQVEQTGI+IDGRA ET NYSSSA+VDWFQ DQ QGGSQKKPDDKS FK D SAD WD+FTSSTGV GPSDNSRKDIV D V KVDEISEVDF
Subjt:  HETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFK-DHSADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVDEISEVDF

Query:  FSTT---NSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPPK
        FSTT   +SDFR+SSQP SFAEAFPNPNGTS+ KA W DASDL+RM EE+G++ ENS A  HQ+ASG  SSTDD QM+MEKMHDLSFMLESNLSIPPK
Subjt:  FSTT---NSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPPK

A0A6J1I4G5 uncharacterized protein LOC1114697951.1e-19975.94Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE
        MA++IP+DLIKQLQISLRN AK+SSYDPHD SLPNLPSLHETIA+LDPSPPYLRCKHC GRLLRDLKSF+CVFCG+EQNT+VPPDPINFKNTIACRWLLE
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQ-------TV
        SLDLDGSEMV  ++LKESNRGKS E+FPLTDLLDL+IRWPESEK+G+SD T APSKS LNLAEVDLD YFSEE KDTT K S+E  PLN+Q       T 
Subjt:  SLDLDGSEMVEPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQ-------TV

Query:  EDNVDLSLFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ-
        +DNVDLSLF NV SSETATR  +HES DSFSGWEA+FQ  +SAT H+NSKSVDPFA+S V+IS SLE T G QNK RSGE ++TKNPSSS+T+DWFQQQ 
Subjt:  EDNVDLSLFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ-

Query:  DLWSSSNHETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFK-DHSADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVD
        DLWSSSNHETI  P+QV+QTG   DG+   TA+YSSSASVDWFQ DQ QGGS KKPDD SDFK D SADAWDDFTSSTG+ G  DN  KDIVN++V KVD
Subjt:  DLWSSSNHETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFK-DHSADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVD

Query:  EISEVDFFSTTNS---DFRNSSQPNSFAEAFPNPN-GTSIGKATWSDASDLSRMSEESGETGENSKAL-EHQSASGPSSSTDDVQMMMEKMHDLSFMLES
        EISE+DFFSTT S   +F N SQPN F EAFPN N GTS  KAT  DASDLSRMSEE+G++GENSKA  E Q++S PSS+ DDVQMMM KMHDLSFMLES
Subjt:  EISEVDFFSTTNS---DFRNSSQPNSFAEAFPNPN-GTSIGKATWSDASDLSRMSEESGETGENSKAL-EHQSASGPSSSTDDVQMMMEKMHDLSFMLES

Query:  NLSIPPK
        +LSIPPK
Subjt:  NLSIPPK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05090.1 dentin sialophosphoprotein-related5.8e-4440.4Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNT-DVPPDPINFKNTIACRWL
        MA EI  DLI QL++SLR  AK++S D   D S P+LP+  E IAELD S PYLRC++C G+LLR ++S +CVFCG +Q T D PPDPI F +T A +W 
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNT-DVPPDPINFKNTIACRWL

Query:  LESLDLDGSEMVEPINLKE-SNRG--KSP--EQFPLTDLLDLEIRWPESEKKGISDETPAPSKSN-LNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQT
        L SL+LDGSEMVEP+   + S+RG  K+P  +   L+  LDLEI+W   E+K  SD+  +  K N LNL  ++LD YF E + D +     E  P+    
Subjt:  LESLDLDGSEMVEPINLKE-SNRG--KSP--EQFPLTDLLDLEIRWPESEKKGISDETPAPSKSN-LNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQT

Query:  VEDNVDLSLFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWF
         +D   LSLFD+V  S+    + +H++   F   +A   + SS   H+N           +++ +  +    D+N S     +D +  SSS  ++ F
Subjt:  VEDNVDLSLFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWF

AT1G05090.1 dentin sialophosphoprotein-related9.8e-4426.72Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNT-DVPPDPINFKNTIACRWL
        MA EI  DLI QL++SLR  AK++S D   D S P+LP+  E IAELD S PYLRC++C G+LLR ++S +CVFCG +Q T D PPDPI F +T A +W 
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNT-DVPPDPINFKNTIACRWL

Query:  LESLDLDGSEMVEPINLKE-SNRG--KSP--EQFPLTDLLDLEIRWPESEKKGISDETPAPSKSN-LNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQT
        L SL+LDGSEMVEP+   + S+RG  K+P  +   L+  LDLEI+W   E+K  SD+  +  K N LNL  ++LD YF E + D +     E  P+    
Subjt:  LESLDLDGSEMVEPINLKE-SNRG--KSP--EQFPLTDLLDLEIRWPESEKKGISDETPAPSKSN-LNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQT

Query:  VEDNVDLSLFDNVPS-----------------------------------------------------SETATRTTKHESGDSF----------------
         +D   LSLFD+V S                                                      E A RT+  +  +SF                
Subjt:  VEDNVDLSLFDNVPS-----------------------------------------------------SETATRTTKHESGDSF----------------

Query:  ----------------------------------------------------------------------SGWEASFQIASSATLHDNSKSVDPFAVSVV
                                                                              S W++ FQ A    L       DPF  S V
Subjt:  ----------------------------------------------------------------------SGWEASFQIASSATLHDNSKSVDPFAVSVV

Query:  NISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQDLWSSSNHETIRMPDQV--EQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPD-D
        ++++ +++ FG        +  D+     S   DW  Q DL+ +   E       V  +  G ++ G      N +SS  +DW   D  Q   +K  +  
Subjt:  NISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQDLWSSSNHETIRMPDQV--EQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPD-D

Query:  KSDFKDHSADAWDDFTSS-----------------------------TGVLGPSDNSRKDIVNDVVSKVDEISEVDFFSTTNS-----------------
         +D  D   D W+DF SS                              GV   S + +++    V+S + +  E D F T +S                 
Subjt:  KSDFKDHSADAWDDFTSS-----------------------------TGVLGPSDNSRKDIVNDVVSKVDEISEVDFFSTTNS-----------------

Query:  -----------------------DFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLS
                               DF + S+ + F+E+      +   K   S  S L R S+  G   +    +   + + P S +D  + +M +MHDLS
Subjt:  -----------------------DFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLS

Query:  FMLESNLSIPP
        FMLE+ LS+PP
Subjt:  FMLESNLSIPP

AT4G20720.1 dentin sialophosphoprotein-related3.2e-4236.84Show/hide
Query:  MAYEIPHDLIKQLQISLRNGAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNT-DVPPDPINFKNTIACRWL
        MA EI  DLI QL++SLR  AK++S D   D S P+LP+  E IAELD S PYLRC++C G+LLR ++S +CVFCG +Q T D PPDPI F +T A +W 
Subjt:  MAYEIPHDLIKQLQISLRNGAKISSYDP-HDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNT-DVPPDPINFKNTIACRWL

Query:  LESLDLDGSEMVEPINLKE-SNRG--KSP--EQFPLTDLLDLEIRWPESEKKGISDETPAPSKSN-LNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQT
        L SL+LDGSEMVEP+   + S+RG  K+P  +   L+  LDLEI+W   E+K  SD+  +  K N LNL  ++LD YF E + D +     E  P+    
Subjt:  LESLDLDGSEMVEPINLKE-SNRG--KSP--EQFPLTDLLDLEIRWPESEKKGISDETPAPSKSN-LNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQT

Query:  VEDNVDLSLFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ
         +D   LSLFD+V  S+    + +H++   F   +A   + SS   H+N   +  FA       +    +F  Q      E KD +N   S   D  +  
Subjt:  VEDNVDLSLFDNVPSSETATRTTKHESGDSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQ

Query:  DLWSSSNHETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSD
         L+            +V+++    +G+ A+    SSS   + F + + +  +Q+    K D
Subjt:  DLWSSSNHETIRMPDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSD

AT4G20720.1 dentin sialophosphoprotein-related4.9e-0322.87Show/hide
Query:  QTVEDNVDLSLFDNVPSSETATRTTKHESGDSFSGWEASF-QIASSATLHDNSKSVDPFAVSVV-NISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDW
        Q+ + N+     D  P   +      H      SG +  + Q A S+T +  SK+ D     +  N++   +T     +    G+     N +SS+  DW
Subjt:  QTVEDNVDLSLFDNVPSSETATRTTKHESGDSFSGWEASF-QIASSATLHDNSKSVDPFAVSVV-NISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDW

Query:  FQQQDLWSSSNHETIRM------PDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFKDHSAD----------------------
            DLW ++  ++I         D  +          ++T N   S +++  Q +   G +Q    DK+  K+ S D                      
Subjt:  FQQQDLWSSSNHETIRM------PDQVEQTGIVIDGRAAETANYSSSASVDWFQVDQRQGGSQKKPDDKSDFKDHSAD----------------------

Query:  -AWDDFTSST--------GVLGPSDNSRKDIVNDVVSKVDEISEVDFFSTTNSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGEN
          WD FTSST          +  + +  K+   ++  + +   ++DF S + SDF        F+E+      +   K   S  S L R S+  G   + 
Subjt:  -AWDDFTSST--------GVLGPSDNSRKDIVNDVVSKVDEISEVDFFSTTNSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSRMSEESGETGEN

Query:  SKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPP
           +   + + P S +D  + +M +MHDLSFMLE+ LS+PP
Subjt:  SKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTATGAAATCCCTCACGATCTGATCAAACAACTTCAGATCTCTCTTCGAAATGGTGCCAAAATCTCCTCCTACGACCCTCACGATCCTTCACTTCCAAATCTACC
ATCGCTCCATGAAACAATTGCAGAGCTTGATCCCTCGCCGCCTTATCTTCGCTGCAAACACTGCAATGGAAGATTGCTTAGAGACTTGAAGTCGTTTATGTGCGTTTTCT
GCGGCAGGGAACAGAACACGGACGTCCCTCCGGACCCCATCAATTTCAAGAATACCATTGCTTGTCGTTGGCTTCTCGAATCCTTGGACTTGGATGGATCGGAGATGGTG
GAACCAATCAATTTGAAGGAATCTAACCGGGGAAAATCACCTGAGCAATTTCCCCTGACGGATCTTTTAGATTTAGAGATTAGATGGCCTGAATCTGAAAAGAAGGGGAT
CTCAGACGAGACTCCGGCTCCAAGCAAAAGTAACTTGAATTTGGCTGAAGTTGATCTTGACTACTACTTCTCCGAGGAAAAGAAAGACACTACCTCAAAAGCATCTAATG
AGCCACCACCACTGAATAAACAAACTGTTGAGGATAATGTTGATCTTAGTTTGTTTGATAATGTTCCATCGTCCGAGACGGCTACAAGGACCACTAAACATGAGAGTGGT
GATTCCTTTTCTGGTTGGGAGGCAAGCTTTCAGATTGCTAGTTCTGCAACTCTTCATGATAATTCCAAATCAGTTGATCCTTTTGCTGTTTCTGTGGTCAATATATCTTC
CTCTTTGGAAACAACGTTTGGGGACCAAAACAAGTCCAGAAGTGGAGAAACAAAAGATACTAAAAATCCTTCTTCATCAGTGACCAATGACTGGTTTCAACAACAAGATT
TATGGAGTAGTTCTAATCATGAAACAATTCGCATGCCAGATCAGGTTGAGCAAACTGGAATTGTAATTGATGGTAGAGCTGCAGAAACTGCTAATTATTCTTCATCAGCA
AGCGTTGATTGGTTTCAAGTTGATCAGCGGCAAGGAGGGAGCCAAAAGAAACCTGATGATAAAAGTGATTTTAAAGATCATTCAGCTGATGCTTGGGATGATTTTACCAG
CTCAACTGGTGTGCTAGGCCCCTCTGATAATTCTAGGAAAGATATTGTGAATGACGTTGTGTCAAAGGTTGATGAGATATCAGAAGTAGATTTCTTCAGCACAACCAATA
GTGATTTTAGAAACTCTTCTCAGCCAAATTCATTTGCAGAAGCATTTCCCAATCCAAATGGTACATCCATAGGAAAAGCAACCTGGTCGGATGCTTCTGATTTAAGCAGG
ATGAGTGAAGAGAGCGGAGAAACTGGAGAAAATTCCAAAGCTCTGGAGCATCAGTCTGCATCAGGTCCTAGTTCAAGTACAGATGATGTACAGATGATGATGGAGAAGAT
GCACGATCTATCTTTTATGCTCGAAAGCAATCTTTCAATCCCCCCAAAGTTATTCCACCAAGCTCTTACAGATTGGCCCGTTTTGAAGTAG
mRNA sequenceShow/hide mRNA sequence
CTTCCATGAAACGCAGCGTTTGGGAAAATTTTCCCTGTACCGGAAAAGCCATCTTCCCCGTGTGGGAGAAAATCGATTTTCATATTCGAAATCCACACAGAAGGAAGAGA
AGGAGAACTCTGTCGCTCACACAACCTGGAGAAACTACGAAACACCTTTAAACTCAATCAATGGCGTATGAAATCCCTCACGATCTGATCAAACAACTTCAGATCTCTCT
TCGAAATGGTGCCAAAATCTCCTCCTACGACCCTCACGATCCTTCACTTCCAAATCTACCATCGCTCCATGAAACAATTGCAGAGCTTGATCCCTCGCCGCCTTATCTTC
GCTGCAAACACTGCAATGGAAGATTGCTTAGAGACTTGAAGTCGTTTATGTGCGTTTTCTGCGGCAGGGAACAGAACACGGACGTCCCTCCGGACCCCATCAATTTCAAG
AATACCATTGCTTGTCGTTGGCTTCTCGAATCCTTGGACTTGGATGGATCGGAGATGGTGGAACCAATCAATTTGAAGGAATCTAACCGGGGAAAATCACCTGAGCAATT
TCCCCTGACGGATCTTTTAGATTTAGAGATTAGATGGCCTGAATCTGAAAAGAAGGGGATCTCAGACGAGACTCCGGCTCCAAGCAAAAGTAACTTGAATTTGGCTGAAG
TTGATCTTGACTACTACTTCTCCGAGGAAAAGAAAGACACTACCTCAAAAGCATCTAATGAGCCACCACCACTGAATAAACAAACTGTTGAGGATAATGTTGATCTTAGT
TTGTTTGATAATGTTCCATCGTCCGAGACGGCTACAAGGACCACTAAACATGAGAGTGGTGATTCCTTTTCTGGTTGGGAGGCAAGCTTTCAGATTGCTAGTTCTGCAAC
TCTTCATGATAATTCCAAATCAGTTGATCCTTTTGCTGTTTCTGTGGTCAATATATCTTCCTCTTTGGAAACAACGTTTGGGGACCAAAACAAGTCCAGAAGTGGAGAAA
CAAAAGATACTAAAAATCCTTCTTCATCAGTGACCAATGACTGGTTTCAACAACAAGATTTATGGAGTAGTTCTAATCATGAAACAATTCGCATGCCAGATCAGGTTGAG
CAAACTGGAATTGTAATTGATGGTAGAGCTGCAGAAACTGCTAATTATTCTTCATCAGCAAGCGTTGATTGGTTTCAAGTTGATCAGCGGCAAGGAGGGAGCCAAAAGAA
ACCTGATGATAAAAGTGATTTTAAAGATCATTCAGCTGATGCTTGGGATGATTTTACCAGCTCAACTGGTGTGCTAGGCCCCTCTGATAATTCTAGGAAAGATATTGTGA
ATGACGTTGTGTCAAAGGTTGATGAGATATCAGAAGTAGATTTCTTCAGCACAACCAATAGTGATTTTAGAAACTCTTCTCAGCCAAATTCATTTGCAGAAGCATTTCCC
AATCCAAATGGTACATCCATAGGAAAAGCAACCTGGTCGGATGCTTCTGATTTAAGCAGGATGAGTGAAGAGAGCGGAGAAACTGGAGAAAATTCCAAAGCTCTGGAGCA
TCAGTCTGCATCAGGTCCTAGTTCAAGTACAGATGATGTACAGATGATGATGGAGAAGATGCACGATCTATCTTTTATGCTCGAAAGCAATCTTTCAATCCCCCCAAAGT
TATTCCACCAAGCTCTTACAGATTGGCCCGTTTTGAAGTAGTCTAACAACCCTCGTAGCTTGTTAATAACAAAAAGGAGAAGAGATGATGTGTAGGTAGGGATTGTATTC
TATACTTGCCTTGGGCAAAAAATGAGAGTTGGAAATTAGACAGAATGCTGGGAGTGTAGCGTAACATATCTCTGGAATAGCACGAATGCTAGTCAGATATGCCCACAGAT
ACATGAGACGTTGCCTAAATTGGAGCTTATCAGAGAGAGCAGTGACAATAGGGGATCTTTTGCTGATTAGAATTTCAAGTAGTCCTGTTATTGCTCTCTTGTGATGGTTT
AATGGAATTGGGCCTCCTGAAGGAGCACATCCTAGGAATGCTGGTGGAGTTGGTGTAATGTATGCAGATTTCCATCCCCTTTTGTGGATCTCCATCCCAGTCAACACATC
CTCCACTAAAGATCCATATTGCCAACCCACCTAAAGGACCCATCGGTAATTTTGAGTTAACTAACTAGGTATAAGGATTGTAGTGGAG
Protein sequenceShow/hide protein sequence
MAYEIPHDLIKQLQISLRNGAKISSYDPHDPSLPNLPSLHETIAELDPSPPYLRCKHCNGRLLRDLKSFMCVFCGREQNTDVPPDPINFKNTIACRWLLESLDLDGSEMV
EPINLKESNRGKSPEQFPLTDLLDLEIRWPESEKKGISDETPAPSKSNLNLAEVDLDYYFSEEKKDTTSKASNEPPPLNKQTVEDNVDLSLFDNVPSSETATRTTKHESG
DSFSGWEASFQIASSATLHDNSKSVDPFAVSVVNISSSLETTFGDQNKSRSGETKDTKNPSSSVTNDWFQQQDLWSSSNHETIRMPDQVEQTGIVIDGRAAETANYSSSA
SVDWFQVDQRQGGSQKKPDDKSDFKDHSADAWDDFTSSTGVLGPSDNSRKDIVNDVVSKVDEISEVDFFSTTNSDFRNSSQPNSFAEAFPNPNGTSIGKATWSDASDLSR
MSEESGETGENSKALEHQSASGPSSSTDDVQMMMEKMHDLSFMLESNLSIPPKLFHQALTDWPVLK