; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0444 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0444
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDentin sialophosphoprotein
Genome locationMC02:3723270..3726622
RNA-Seq ExpressionMC02g0444
SyntenyMC02g0444
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034793.1 dentin sialophosphoprotein [Cucumis melo var. makuwa]5.96e-22069.56Show/hide
Query:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIP DLIKQLQISLRN A ISSYDPH PSLPNLPS ++TIA+LDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNKQI--DGSE-
        SLDLDGSEMVGPID KE NRGKSPEQFPLTDLLDLEI+WPES+K G+ DE PAPSK STLNL GVDL YYF+EEK D TSK SD   P +KQ   D ++ 
Subjt:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNKQI--DGSE-

Query:  SFFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATR-DNSKSIDPFAAS-----SLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWSSSN
        S F    +  +AT   K ES DS SGWEA FQ+AS+AT  DNSKSIDPF  S     S E TFG Q  SRSG  EDTK+ SSS TNDWFQQQDDLWSSSN
Subjt:  SFFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATR-DNSKSIDPFAAS-----SLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWSSSN

Query:  HESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSST---------------TKPKVDEISEIDFF
        H+++ MP QVEQTGI  DGR   TA++SSSA+ DWFQDDQ +GGS+KKPDDKSV KDDDSADAWD+FTSST                 PKVDEISE+DFF
Subjt:  HESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSST---------------TKPKVDEISEIDFF

Query:  SSTTSRDSNLRNSSQPNSFAEAFP-PDGTVEEKATRPDASDLSRIGEENEKTGENSE---------GSNSDDVQKMMGKMHDLSFMLDSHLAVPPK
        S+TT++DS+ R+SSQP SFAEAFP P+GT  EKA  PDASDL+R+GEEN K+ ENS+         GS++DD Q +M KMHDLSFML+S+L++PPK
Subjt:  SSTTSRDSNLRNSSQPNSFAEAFP-PDGTVEEKATRPDASDLSRIGEENEKTGENSE---------GSNSDDVQKMMGKMHDLSFMLDSHLAVPPK

XP_011649988.1 uncharacterized protein LOC101209977 [Cucumis sativus]7.84e-22170.08Show/hide
Query:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIP DLIKQLQISLRN ANISSYDPH PSLPNLPS +ETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPPDPINF NTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDE-PIPLNKQIDGSE--
        SLDLDGSEMVG ID KE NRGKSPEQFPLTDLLDLEI+WPESEKKG++DE PAPSK STLNL GVDL  YF+EEK D TSK SD  P P  + ++ +   
Subjt:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDE-PIPLNKQIDGSE--

Query:  SFFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATR-DNSKSIDPFA------ASSLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWSSS
        S F    +F TAT   K ES DS SGWEA FQ AS+AT  DNSKS+DPF       +SSLETTFG Q  S SG  EDTKN SSS TNDWFQQQDDLWSSS
Subjt:  SFFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATR-DNSKSIDPFA------ASSLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWSSS

Query:  NHESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSST---------------TKPKVDEISEIDF
        NH++I MP QVEQTGI  DGRT  TA++SSSA+ DWFQDDQ +G S+KKPDDKSV KDD SADAWDDFTSST                 PKVDEISE+DF
Subjt:  NHESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSST---------------TKPKVDEISEIDF

Query:  FSSTTSRDSNLRNSSQPNSFAEAFP-PDGTVEEKATRPDASDLSRIGEENEKTGENSEG----------SNSDDVQKMMGKMHDLSFMLDSHLAVPPK
        FS+ T++DS+ R+SSQP SFAEAFP P+GT  EKA  PDASDLSR+ EEN KT ENS+           S++DD + MM KMHDLSFML+S L++PPK
Subjt:  FSSTTSRDSNLRNSSQPNSFAEAFP-PDGTVEEKATRPDASDLSRIGEENEKTGENSEG----------SNSDDVQKMMGKMHDLSFMLDSHLAVPPK

XP_022140549.1 uncharacterized protein LOC111011184 [Momordica charantia]0.0100Show/hide
Query:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
Subjt:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNKQIDGSESFF
        SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNKQIDGSESFF
Subjt:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNKQIDGSESFF

Query:  GDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATRDNSKSIDPFAASSLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWSSSNHESIRMPGQ
        GDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATRDNSKSIDPFAASSLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWSSSNHESIRMPGQ
Subjt:  GDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATRDNSKSIDPFAASSLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWSSSNHESIRMPGQ

Query:  VEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSSTTKPKVDEISEIDFFSSTTSRDSNLRNSSQPNSFAEAFP
        VEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSSTTKPKVDEISEIDFFSSTTSRDSNLRNSSQPNSFAEAFP
Subjt:  VEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSSTTKPKVDEISEIDFFSSTTSRDSNLRNSSQPNSFAEAFP

Query:  PDGTVEEKATRPDASDLSRIGEENEKTGENSEGSNSDDVQKMMGKMHDLSFMLDSHLAVPPK
        PDGTVEEKATRPDASDLSRIGEENEKTGENSEGSNSDDVQKMMGKMHDLSFMLDSHLAVPPK
Subjt:  PDGTVEEKATRPDASDLSRIGEENEKTGENSEGSNSDDVQKMMGKMHDLSFMLDSHLAVPPK

XP_023533243.1 uncharacterized protein LOC111795191 [Cucurbita pepo subsp. pepo]1.33e-22069.82Show/hide
Query:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAY+IP+DLIKQLQISLRN A +SSYDPHD SLPNLPSL ETIA LDPSPPYLRCKHCKGRLLRDLKSFICV CG+EQNT+VPPDPINFKNTIACRWLLE
Subjt:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNKQIDGSES--
        SLDLDGSEMVG +D KE NRGKS E+FPLTDLLDL+I+WPESEK+GL+D   APSK STLNL  VDLD YFSEE KD T+KVSDEP  LN+QIDGSES  
Subjt:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNKQIDGSES--

Query:  --------FFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAAT-RDNSKSIDPFAAS------SLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQ
                 FG+VQ+  TAT I + ES DS SGWEA FQ+ ++AT  +NSKS+DPFA S      SLE T G Q   RSG IE+TKN SSS+T+DWFQQQ
Subjt:  --------FFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAAT-RDNSKSIDPFAAS------SLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQ

Query:  DDLWSSSNHESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSSTTK----------------PKV
        DDLWSSSNHE+I  P QV QTG   DG+TVGTAD+SSSAS DWFQDDQ +GGSKKKPDD S  KDDDSADAWDDFTSST                  PKV
Subjt:  DDLWSSSNHESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSSTTK----------------PKV

Query:  DEISEIDFFSSTTSRDSNLRNSSQPNSFAEAFPP-DGTVEEKATRPDASDLSRIGEENEKTGENSEG-----------SNSDDVQKMMGKMHDLSFMLDS
         EISEIDFF +TTS+D N  N SQPN F EAFP  +GT EEKATRPDASDLSR+ EEN K+GENS+            SN DDVQ MM KMHDLSFML+S
Subjt:  DEISEIDFFSSTTSRDSNLRNSSQPNSFAEAFPP-DGTVEEKATRPDASDLSRIGEENEKTGENSEG-----------SNSDDVQKMMGKMHDLSFMLDS

Query:  HLAVPPK
        HL++PPK
Subjt:  HLAVPPK

XP_038902680.1 uncharacterized protein LOC120089318 [Benincasa hispida]2.17e-22472.34Show/hide
Query:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPHDLIKQLQISLRN A ISSYDPHDPSLPNLPSL ETIA+LDPSPPYLRCKHC GRLLRDLKSF+CVFCGREQNTDVPPDPINFKNTIACRWLLE
Subjt:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNKQI--DGSE-
        SLDLDGSEMV PI+ KE NRGKSPEQFPLTDLLDLEI+WPESEKKG++DE PAPSK S LNL  VDLDYYFSEEKKD TSK S+EP PLNKQ   D  + 
Subjt:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNKQI--DGSE-

Query:  SFFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAAT-RDNSKSIDPFA------ASSLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWSSS
        S F +V +  TAT   K ESGDS SGWEA FQ AS+AT  DNSKS+DPFA      +SSLETTFG Q  SRSG  +DTKN SSSVTNDWFQQQD LWSSS
Subjt:  SFFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAAT-RDNSKSIDPFA------ASSLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWSSS

Query:  NHESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSST----------------TKPKVDEISEID
        NHE+IRMP QVEQTGI  DGR   TA++SSSAS DWFQ DQR+GGS+KKPDDKS  KD  SADAWDDFTSST                   KVDEISE+D
Subjt:  NHESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSST----------------TKPKVDEISEID

Query:  FFSSTTSRDSNLRNSSQPNSFAEAFP-PDGTVEEKATRPDASDLSRIGEENEKTGENSEG----------SNSDDVQKMMGKMHDLSFMLDSHLAVPPK
        FFS+T S   + RNSSQPNSFAEAFP P+GT   KAT  DASDLSR+ EE+ +TGENS+           S++DDVQ MM KMHDLSFML+S+L++PPK
Subjt:  FFSSTTSRDSNLRNSSQPNSFAEAFP-PDGTVEEKATRPDASDLSRIGEENEKTGENSEG----------SNSDDVQKMMGKMHDLSFMLDSHLAVPPK

TrEMBL top hitse value%identityAlignment
A0A0A0LMS7 Uncharacterized protein3.80e-22170.08Show/hide
Query:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIP DLIKQLQISLRN ANISSYDPH PSLPNLPS +ETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPPDPINF NTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDE-PIPLNKQIDGSE--
        SLDLDGSEMVG ID KE NRGKSPEQFPLTDLLDLEI+WPESEKKG++DE PAPSK STLNL GVDL  YF+EEK D TSK SD  P P  + ++ +   
Subjt:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDE-PIPLNKQIDGSE--

Query:  SFFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATR-DNSKSIDPFA------ASSLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWSSS
        S F    +F TAT   K ES DS SGWEA FQ AS+AT  DNSKS+DPF       +SSLETTFG Q  S SG  EDTKN SSS TNDWFQQQDDLWSSS
Subjt:  SFFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATR-DNSKSIDPFA------ASSLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWSSS

Query:  NHESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSST---------------TKPKVDEISEIDF
        NH++I MP QVEQTGI  DGRT  TA++SSSA+ DWFQDDQ +G S+KKPDDKSV KDD SADAWDDFTSST                 PKVDEISE+DF
Subjt:  NHESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSST---------------TKPKVDEISEIDF

Query:  FSSTTSRDSNLRNSSQPNSFAEAFP-PDGTVEEKATRPDASDLSRIGEENEKTGENSEG----------SNSDDVQKMMGKMHDLSFMLDSHLAVPPK
        FS+ T++DS+ R+SSQP SFAEAFP P+GT  EKA  PDASDLSR+ EEN KT ENS+           S++DD + MM KMHDLSFML+S L++PPK
Subjt:  FSSTTSRDSNLRNSSQPNSFAEAFP-PDGTVEEKATRPDASDLSRIGEENEKTGENSEG----------SNSDDVQKMMGKMHDLSFMLDSHLAVPPK

A0A5A7SW96 Dentin sialophosphoprotein2.88e-22069.56Show/hide
Query:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIP DLIKQLQISLRN A ISSYDPH PSLPNLPS ++TIA+LDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNKQI--DGSE-
        SLDLDGSEMVGPID KE NRGKSPEQFPLTDLLDLEI+WPES+K G+ DE PAPSK STLNL GVDL YYF+EEK D TSK SD   P +KQ   D ++ 
Subjt:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNKQI--DGSE-

Query:  SFFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATR-DNSKSIDPFAAS-----SLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWSSSN
        S F    +  +AT   K ES DS SGWEA FQ+AS+AT  DNSKSIDPF  S     S E TFG Q  SRSG  EDTK+ SSS TNDWFQQQDDLWSSSN
Subjt:  SFFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATR-DNSKSIDPFAAS-----SLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWSSSN

Query:  HESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSST---------------TKPKVDEISEIDFF
        H+++ MP QVEQTGI  DGR   TA++SSSA+ DWFQDDQ +GGS+KKPDDKSV KDDDSADAWD+FTSST                 PKVDEISE+DFF
Subjt:  HESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSST---------------TKPKVDEISEIDFF

Query:  SSTTSRDSNLRNSSQPNSFAEAFP-PDGTVEEKATRPDASDLSRIGEENEKTGENSE---------GSNSDDVQKMMGKMHDLSFMLDSHLAVPPK
        S+TT++DS+ R+SSQP SFAEAFP P+GT  EKA  PDASDL+R+GEEN K+ ENS+         GS++DD Q +M KMHDLSFML+S+L++PPK
Subjt:  SSTTSRDSNLRNSSQPNSFAEAFP-PDGTVEEKATRPDASDLSRIGEENEKTGENSE---------GSNSDDVQKMMGKMHDLSFMLDSHLAVPPK

A0A5D3CEG4 Dentin sialophosphoprotein1.66e-21969.15Show/hide
Query:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIP DLIKQLQISLRN A ISSYDPH PSLPNLPS ++TIA+LDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQ +DVPP+PINFKNTIACRWLL+
Subjt:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNKQI--DGSE-
        SLDLDGSEMVGPID KE NRGKSPEQFPLTDLLDLEI+WPES+K G+ DE PAPSK STLNL GVDL YYF+EEK D TSK SD   P +KQ   D ++ 
Subjt:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNKQI--DGSE-

Query:  SFFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATR-DNSKSIDPFAAS-----SLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWSSSN
        S F    +  +AT   K ES DS SGWEA FQ+AS+AT  DNSKSIDPF  S     S E TFG Q  SRSG  EDTK+ SSS TNDWFQQQDDLWSSSN
Subjt:  SFFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATR-DNSKSIDPFAAS-----SLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWSSSN

Query:  HESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSST---------------TKPKVDEISEIDFF
        H+++ MP QVEQTGI  DGR   T ++SSSA+ DWFQDDQ +GGS+KKPDDKSV KDDDSAD WD+FTSST                 PKVDEISE+DFF
Subjt:  HESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSST---------------TKPKVDEISEIDFF

Query:  SSTTSRDSNLRNSSQPNSFAEAFP-PDGTVEEKATRPDASDLSRIGEENEKTGENSE---------GSNSDDVQKMMGKMHDLSFMLDSHLAVPPK
        S+TT++DS+ R+SSQP SFAEAFP P+GT  EKA  PDASDL+R+GEEN K+ ENS+         GS++DD Q +M KMHDLSFML+S+L++PPK
Subjt:  SSTTSRDSNLRNSSQPNSFAEAFP-PDGTVEEKATRPDASDLSRIGEENEKTGENSE---------GSNSDDVQKMMGKMHDLSFMLDSHLAVPPK

A0A6J1CG03 uncharacterized protein LOC1110111840.0100Show/hide
Query:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
Subjt:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNKQIDGSESFF
        SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNKQIDGSESFF
Subjt:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNKQIDGSESFF

Query:  GDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATRDNSKSIDPFAASSLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWSSSNHESIRMPGQ
        GDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATRDNSKSIDPFAASSLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWSSSNHESIRMPGQ
Subjt:  GDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATRDNSKSIDPFAASSLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWSSSNHESIRMPGQ

Query:  VEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSSTTKPKVDEISEIDFFSSTTSRDSNLRNSSQPNSFAEAFP
        VEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSSTTKPKVDEISEIDFFSSTTSRDSNLRNSSQPNSFAEAFP
Subjt:  VEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSSTTKPKVDEISEIDFFSSTTSRDSNLRNSSQPNSFAEAFP

Query:  PDGTVEEKATRPDASDLSRIGEENEKTGENSEGSNSDDVQKMMGKMHDLSFMLDSHLAVPPK
        PDGTVEEKATRPDASDLSRIGEENEKTGENSEGSNSDDVQKMMGKMHDLSFMLDSHLAVPPK
Subjt:  PDGTVEEKATRPDASDLSRIGEENEKTGENSEGSNSDDVQKMMGKMHDLSFMLDSHLAVPPK

A0A6J1I4G5 uncharacterized protein LOC1114697955.26e-22069.69Show/hide
Query:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE
        MA++IP+DLIKQLQISLRN A +SSYDPHD SLPNLPSL ETIA LDPSPPYLRCKHCKGRLLRDLKSF+CVFCG+EQNT+VPPDPINFKNTIACRWLLE
Subjt:  MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLE

Query:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNKQIDGSE---
        SLDLDGSEMVG +D KE NRGKS E+FPLTDLLDL+I+WPESEK+GL+D   APSK STLNL  VDLD YFSEE KD T KVSDEP  LN+QIDGSE   
Subjt:  SLDLDGSEMVGPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNKQIDGSE---

Query:  -------SFFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAAT-RDNSKSIDPFAAS------SLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQ
               S FG+VQ+  TAT I + ES DS SGWEA FQ+ ++AT  +NSKS+DPFA S      SLE T G Q   RSG IE+TKN SSS+T+DWFQQQ
Subjt:  -------SFFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAAT-RDNSKSIDPFAAS------SLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQ

Query:  DDLWSSSNHESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSSTTK----------------PKV
        DDLWSSSNHE+I  P QV+QTG   DG+TVGTAD+SSSAS DWFQDDQ +GGSKK PDD S  KDDDSADAWDDFTSST                  PKV
Subjt:  DDLWSSSNHESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSKKKPDDKSVSKDDDSADAWDDFTSSTTK----------------PKV

Query:  DEISEIDFFSSTTSRDSNLRNSSQPNSFAEAFPP--DGTVEEKATRPDASDLSRIGEENEKTGENSEG-----------SNSDDVQKMMGKMHDLSFMLD
        DEISEIDFFS+TTS+D N  N SQPN F EAFP    GT EEKATRPDASDLSR+ EEN K+GENS+            SN DDVQ MM KMHDLSFML+
Subjt:  DEISEIDFFSSTTSRDSNLRNSSQPNSFAEAFPP--DGTVEEKATRPDASDLSRIGEENEKTGENSEG-----------SNSDDVQKMMGKMHDLSFMLD

Query:  SHLAVPPK
        SHL++PPK
Subjt:  SHLAVPPK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05090.1 dentin sialophosphoprotein-related3.1e-3633.9Show/hide
Query:  MAYEIPHDLIKQLQISLRNAANISSYDP-HDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNT-DVPPDPINFKNTIACRWL
        MA EI  DLI QL++SLR  A ++S D   D S P+LP+ +E IA+LD S PYLRC++CKG+LLR ++S ICVFCG +Q T D PPDPI F +T A +W 
Subjt:  MAYEIPHDLIKQLQISLRNAANISSYDP-HDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNT-DVPPDPINFKNTIACRWL

Query:  LESLDLDGSEMVGPI---DSKELNRGKSP--EQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNK--
        L SL+LDGSEMV P+   D       K+P  +   L+  LDLEI+W   E+K  +D+  +  KK+ LNL G++LD YF E + D +     E  P+    
Subjt:  LESLDLDGSEMVGPI---DSKELNRGKSP--EQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNK--

Query:  -QIDGSESFFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATRDNSKSIDPFAASSLETTFGRQKNSRSGAIEDTKNSSSSVTND---WFQQQDDLW
         +   S S F  V++ G    +   +  D+   ++ K    S  +    +++  FA    +     +  S     ED + +SSS  ++   +F+ +D   
Subjt:  -QIDGSESFFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATRDNSKSIDPFAASSLETTFGRQKNSRSGAIEDTKNSSSSVTND---WFQQQDDLW

Query:  SSSNHESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSK------------------KKPDDKSVSKDDDSADAWD-DFTSSTTKPKVD
        +SS+ +     G  E  G     R   + +  S    +  +D QR   SK                  K  DDK V+   D    WD DF S+       
Subjt:  SSSNHESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRGGSK------------------KKPDDKSVSKDDDSADAWD-DFTSSTTKPKVD

Query:  EISEIDFFSS
        +I    F SS
Subjt:  EISEIDFFSS

AT4G20720.1 dentin sialophosphoprotein-related4.8e-3732.34Show/hide
Query:  MAYEIPHDLIKQLQISLRNAANISSYDP-HDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNT-DVPPDPINFKNTIACRWL
        MA EI  DLI QL++SLR  A ++S D   D S P+LP+ +E IA+LD S PYLRC++CKG+LLR ++S ICVFCG +Q T D PPDPI F +T A +W 
Subjt:  MAYEIPHDLIKQLQISLRNAANISSYDP-HDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNT-DVPPDPINFKNTIACRWL

Query:  LESLDLDGSEMVGPI---DSKELNRGKSP--EQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNK--
        L SL+LDGSEMV P+   D       K+P  +   L+  LDLEI+W   E+K  +D+  +  KK+ LNL G++LD YF E + D +     E  P+    
Subjt:  LESLDLDGSEMVGPI---DSKELNRGKSP--EQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNK--

Query:  -QIDGSESFFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATRDNSKSIDPFAASSLE--TTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWS
         +   S S F  V++ G    +   +  D+   ++ K    S  +    +++  FA    +   +F  Q N      +D +NS        F++ ++L  
Subjt:  -QIDGSESFFGDVQAFGTATTIAKQESGDSSSGWEAKFQSASAATRDNSKSIDPFAASSLE--TTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWS

Query:  SSNHESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRG--GSKKKPDDKSVSKDDDSADAWDDFTSSTTKPKVDEISEIDFFSSTTSRDSNL
            E        ++T  S    + G  +   +      +DD+  G    KK     S SK+D+S   ++    +       E     FF      +++L
Subjt:  SSNHESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDDQRRG--GSKKKPDDKSVSKDDDSADAWDDFTSSTTKPKVDEISEIDFFSSTTSRDSNL

Query:  RN
        ++
Subjt:  RN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTATGAAATTCCACACGATCTGATCAAACAACTTCAGATCTCACTTCGAAATGCTGCCAATATCTCCTCCTACGACCCACACGATCCTTCGCTTCCGAATCTACC
ATCGCTCGATGAAACAATTGCAGACCTCGATCCCTCGCCGCCTTATCTTCGCTGCAAACATTGCAAAGGAAGATTGCTTAGAGACTTGAAATCATTTATTTGCGTCTTCT
GCGGCAGGGAACAGAACACGGACGTCCCTCCGGACCCTATTAATTTTAAGAATACCATCGCTTGTCGATGGTTGCTCGAGTCCTTGGACTTGGATGGATCGGAGATGGTG
GGGCCGATCGATTCAAAGGAATTGAACCGGGGAAAATCACCAGAGCAGTTTCCACTGACGGATCTTTTAGATTTAGAGATCAAGTGGCCTGAATCTGAAAAAAAGGGGCT
TGCAGACGAGATCCCAGCTCCAAGTAAAAAAAGTACCTTGAATTTGGTTGGAGTTGATCTTGACTACTACTTTTCTGAGGAAAAAAAAGACAATACTTCAAAAGTATCTG
ATGAGCCAATACCACTGAATAAACAAATAGATGGTTCTGAAAGTTTTTTTGGAGATGTTCAAGCTTTTGGGACAGCGACAACGATAGCTAAACAGGAGAGTGGCGATTCC
TCTTCTGGTTGGGAGGCAAAGTTTCAGTCTGCTAGTGCCGCAACTCGTGATAATTCTAAATCAATTGATCCTTTTGCTGCTTCCTCTTTGGAAACAACATTTGGACGCCA
AAAAAATTCCAGAAGTGGAGCAATAGAAGATACTAAAAACTCCTCTTCATCAGTAACCAATGACTGGTTTCAACAACAGGATGATTTATGGAGTAGTTCCAATCATGAAT
CCATTCGTATGCCTGGACAGGTCGAGCAAACTGGAATTTCAACTGATGGTAGAACTGTAGGAACTGCAGATTTTTCTTCATCAGCAAGTGGTGATTGGTTTCAAGATGAT
CAGCGGCGAGGAGGGAGCAAAAAGAAACCTGATGATAAAAGTGTTTCTAAAGATGATGACTCAGCTGATGCCTGGGATGATTTTACTAGCTCGACCACGAAGCCAAAGGT
GGATGAGATATCAGAAATAGATTTCTTCAGCTCAACGACCTCAAGGGATAGTAATCTTAGAAACTCTTCTCAGCCAAATTCATTTGCAGAAGCATTCCCTCCAGATGGTA
CAGTAGAAGAAAAAGCAACACGGCCAGATGCTTCTGATTTAAGCAGGATTGGTGAAGAGAACGAAAAAACTGGAGAAAATTCTGAAGGTTCAAATTCTGATGATGTACAG
AAGATGATGGGAAAGATGCACGATCTTTCTTTTATGCTCGATAGCCATCTTGCAGTCCCCCCAAAGTGA
mRNA sequenceShow/hide mRNA sequence
AGAGAAAATCTGAGTCGAAAAGCGCGACTTTGAATAGCGAGTTCTTCCTTCCGTAATACGCTGCGTTTTAGGCATTCTTTCTGTATTCCGAAATGCCATCTTCTCCGAGG
TCGTCATTTTCGAAATTCACAGAGAAGGCACACGCTCTTGCTAACAACAGCCGAATACACTGCGAGCACCTTTAAATCCTTCTAATGGCGTATGAAATTCCACACGATCT
GATCAAACAACTTCAGATCTCACTTCGAAATGCTGCCAATATCTCCTCCTACGACCCACACGATCCTTCGCTTCCGAATCTACCATCGCTCGATGAAACAATTGCAGACC
TCGATCCCTCGCCGCCTTATCTTCGCTGCAAACATTGCAAAGGAAGATTGCTTAGAGACTTGAAATCATTTATTTGCGTCTTCTGCGGCAGGGAACAGAACACGGACGTC
CCTCCGGACCCTATTAATTTTAAGAATACCATCGCTTGTCGATGGTTGCTCGAGTCCTTGGACTTGGATGGATCGGAGATGGTGGGGCCGATCGATTCAAAGGAATTGAA
CCGGGGAAAATCACCAGAGCAGTTTCCACTGACGGATCTTTTAGATTTAGAGATCAAGTGGCCTGAATCTGAAAAAAAGGGGCTTGCAGACGAGATCCCAGCTCCAAGTA
AAAAAAGTACCTTGAATTTGGTTGGAGTTGATCTTGACTACTACTTTTCTGAGGAAAAAAAAGACAATACTTCAAAAGTATCTGATGAGCCAATACCACTGAATAAACAA
ATAGATGGTTCTGAAAGTTTTTTTGGAGATGTTCAAGCTTTTGGGACAGCGACAACGATAGCTAAACAGGAGAGTGGCGATTCCTCTTCTGGTTGGGAGGCAAAGTTTCA
GTCTGCTAGTGCCGCAACTCGTGATAATTCTAAATCAATTGATCCTTTTGCTGCTTCCTCTTTGGAAACAACATTTGGACGCCAAAAAAATTCCAGAAGTGGAGCAATAG
AAGATACTAAAAACTCCTCTTCATCAGTAACCAATGACTGGTTTCAACAACAGGATGATTTATGGAGTAGTTCCAATCATGAATCCATTCGTATGCCTGGACAGGTCGAG
CAAACTGGAATTTCAACTGATGGTAGAACTGTAGGAACTGCAGATTTTTCTTCATCAGCAAGTGGTGATTGGTTTCAAGATGATCAGCGGCGAGGAGGGAGCAAAAAGAA
ACCTGATGATAAAAGTGTTTCTAAAGATGATGACTCAGCTGATGCCTGGGATGATTTTACTAGCTCGACCACGAAGCCAAAGGTGGATGAGATATCAGAAATAGATTTCT
TCAGCTCAACGACCTCAAGGGATAGTAATCTTAGAAACTCTTCTCAGCCAAATTCATTTGCAGAAGCATTCCCTCCAGATGGTACAGTAGAAGAAAAAGCAACACGGCCA
GATGCTTCTGATTTAAGCAGGATTGGTGAAGAGAACGAAAAAACTGGAGAAAATTCTGAAGGTTCAAATTCTGATGATGTACAGAAGATGATGGGAAAGATGCACGATCT
TTCTTTTATGCTCGATAGCCATCTTGCAGTCCCCCCAAAGTGATGCATCTTTAGTTTTTTTTCTGAAGAAGAGCACACGCTGCCACTGAGCTTTTCCTGTAATTTTCTTC
CCTTTTTCTTTTTTAAATCTGTAGCAGCGTAGTGTAGTTATTACTGAAAATGCATTCTTTGATTTTATAAAATGGCCATATGCTGTTGATGTCCATTGCAGACATAACTA
ACATACCGTGGATTCACGCGACTGACAAACTCCTTTCAAACTAGCATTTTGCTGAGTATATTAGATTTATACACACCATCCAACCGAAATCTCGGTATTGTCTGCCAGAC
GACACTATATTTTACTGGCTAAAGCACTGGGTCATCACAGCTGACTTAGCAAATAGTGGATATGGGGAGTTTTATTATTATGAATCAACCCTTCATAAAGATGATAAAAA
ACATAAAACCATCCTAGCTTGAATACATCATTTTAGTAACTGACCAAAACCTATTGATATCACTTTATTAGTCATTTGACACAGTACACAACAAACAAGAATGCCAAAGC
ACTGGATTTGCATAAGGTGGGCCATGGAAGCCCATACCTTCCCTTGGAAAACAACCCCTTCAAAAATGGCCAGAAGCTCAAAATCAACCACACACTGCATATCACCTCCC
CCACCCCAAACTCTTGCACAATTGACTGCCTTCCTAACAAACCAATTGAAAGTGCCGCCAGTTGAATCATCAAAATGGTTGTGCCTGGCACAAAGAATGGGGACTCATCA
AATGTGAGCCGACCCAAATCTCCATCAGTACCCTGGGCGCCATTGGTAGAAGATGACTCCTTTTTTGTTACTTCGAACACAGTCTCCGATAGTCCTAAAATCTTCATAAT
GACAGCCACCATTCCAAGCAATGAAGAACACATTGTCTTAATCCTCTCCATTCGCTGGTTATTCCACCAGGCTCTTATGGATTGGCCCGTTTCCAAGTACTCCAACAGTT
GTTGTAAGTTGACAAGGACGAAAAGCAGAAGAGGTATCCATATTACT
Protein sequenceShow/hide protein sequence
MAYEIPHDLIKQLQISLRNAANISSYDPHDPSLPNLPSLDETIADLDPSPPYLRCKHCKGRLLRDLKSFICVFCGREQNTDVPPDPINFKNTIACRWLLESLDLDGSEMV
GPIDSKELNRGKSPEQFPLTDLLDLEIKWPESEKKGLADEIPAPSKKSTLNLVGVDLDYYFSEEKKDNTSKVSDEPIPLNKQIDGSESFFGDVQAFGTATTIAKQESGDS
SSGWEAKFQSASAATRDNSKSIDPFAASSLETTFGRQKNSRSGAIEDTKNSSSSVTNDWFQQQDDLWSSSNHESIRMPGQVEQTGISTDGRTVGTADFSSSASGDWFQDD
QRRGGSKKKPDDKSVSKDDDSADAWDDFTSSTTKPKVDEISEIDFFSSTTSRDSNLRNSSQPNSFAEAFPPDGTVEEKATRPDASDLSRIGEENEKTGENSEGSNSDDVQ
KMMGKMHDLSFMLDSHLAVPPK