; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0080201 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0080201
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
Descriptioncwf21 domain-containing protein
Genome locationCMiso1.1chr03:26152917..26157330
RNA-Seq ExpressionCmc03g0080201
SyntenyCmc03g0080201
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR013170 - mRNA splicing factor Cwf21 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064984.1 dentin sialophosphoprotein-like [Cucumis melo var. makuwa]0.0e+0099.23Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD
        ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSL DGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD

Query:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADSGEHKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKRK
        DKDVKKGASKELKGHQKDKKRRPKDD SD DSGEHKGTKKNLRDSRRIDSES+LDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKRK
Subjt:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADSGEHKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKRK

Query:  DDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHTSKKQKKSTRHDSDDSDSFTDGDKIGMDSHQKG
        DDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHTSKKQKKSTRHDSDDSDSFTDGDKIGMDSHQKG
Subjt:  DDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHTSKKQKKSTRHDSDDSDSFTDGDKIGMDSHQKG

Query:  SGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYGESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKSRKYPKKDDRRR
        SGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYGESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKSRKYPKKD RRR
Subjt:  SGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYGESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKSRKYPKKDDRRR

Query:  RHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLEKGRKLS
        RHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLEKGRKLS
Subjt:  RHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLEKGRKLS

Query:  SGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKNCADGLD
        SGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGK+ATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKN ADGLD
Subjt:  SGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKNCADGLD

Query:  KFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKIDSESNTRSRYSAHNEDDDRK
        KFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKIDSESNTRSRYSAHNEDDDRK
Subjt:  KFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKIDSESNTRSRYSAHNEDDDRK

Query:  STRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYESTRDREDSRKRAKYESRS
        STRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYESTRDREDSRKRAKYESRS
Subjt:  STRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYESTRDREDSRKRAKYESRS

Query:  SRHDNH
        SRHDNH
Subjt:  SRHDNH

XP_004138875.1 dentin sialophosphoprotein [Cucumis sativus]0.0e+0091.43Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLN+QGYTEKEISEKLREARENLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD
        ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSL D EQVK+EISDPSR+RREGQNAD+KRHEKSEHSFLDR+LNWK+RGTEDQ+D
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD

Query:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADSGEHKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKRK
        DKDVKKGASKE+K  QKDKKRR KDD SD DSGE KGTKKNLRDSRR DSESDLDIDVNNKYVASR SKKNRRHDSDDSS TDSGGEHKVTKKHSRNKRK
Subjt:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADSGEHKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKRK

Query:  DDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHTSKKQKKSTRHDSDDSDSFTDGDKIGMDSH-QK
        D+ E+DSDSDLDQKYLTSRKHKKNRRHDSDDSSD+DS GEHKKTK+SVR+NQRGHGSD DSDVDKKHTSKKQKKSTRHDSD SDSFTDGDKIGMDSH +K
Subjt:  DDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHTSKKQKKSTRHDSDDSDSFTDGDKIGMDSH-QK

Query:  GSGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYGESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKSRKYPKKDDRR
        GSGRHES KVKKQRS+KQDSTDETNSDS +EDKHRQLKHK+QHGKRYGESDSSDHDSSDSDVGR KSTHR+HSK TGKSRV+SESD EKSRKYP KDDRR
Subjt:  GSGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYGESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKSRKYPKKDDRR

Query:  RRHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLEKGRKL
        RRHDIDDEKSGDN SSSDELVKRRRGRRH+ DDSSEEEGEYFGRSGKI TKGKIDAKRQ D SNNSD SLAV RKGDD+HK+AKKY SGDGFNLEKG KL
Subjt:  RRHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLEKGRKL

Query:  SSGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKNCADGL
        SSGARERGKGNL+H EGRRHNTDDKSEEEGEYLGRSGK+ATKRK+D KRQHDDSENSDDSLAVKHKRAKKY SSDDSDLEKGVKSTDGARERGKN ADGL
Subjt:  SSGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKNCADGL

Query:  DKFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKIDSESNTRSRYSAHNEDDDR
         KFKKDSI+E NHASQRTDKMN KRKLDEG E EQEPESKSRNRNSDPKKD KHDSESSRRSRSGRYD+TRDGRYRED KIDSESNTRSRYSA  EDDDR
Subjt:  DKFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKIDSESNTRSRYSAHNEDDDR

Query:  KSTRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY---ESTRDREDSRKRAKY
        KS RTGSRY+EETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY   ESTRDREDSRKR KY
Subjt:  KSTRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY---ESTRDREDSRKRAKY

Query:  ESRSSRHDNH
        ESRSSR DNH
Subjt:  ESRSSRHDNH

XP_008445109.1 PREDICTED: dentin sialophosphoprotein-like [Cucumis melo]0.0e+00100Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD
        ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD

Query:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADSGEHKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKRK
        DKDVKKGASKELKGHQKDKKRRPKDDFSDADSGEHKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKRK
Subjt:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADSGEHKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKRK

Query:  DDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHTSKKQKKSTRHDSDDSDSFTDGDKIGMDSHQKG
        DDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHTSKKQKKSTRHDSDDSDSFTDGDKIGMDSHQKG
Subjt:  DDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHTSKKQKKSTRHDSDDSDSFTDGDKIGMDSHQKG

Query:  SGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYGESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKSRKYPKKDDRRR
        SGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYGESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKSRKYPKKDDRRR
Subjt:  SGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYGESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKSRKYPKKDDRRR

Query:  RHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLEKGRKLS
        RHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLEKGRKLS
Subjt:  RHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLEKGRKLS

Query:  SGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKNCADGLD
        SGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKNCADGLD
Subjt:  SGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKNCADGLD

Query:  KFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKIDSESNTRSRYSAHNEDDDRK
        KFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKIDSESNTRSRYSAHNEDDDRK
Subjt:  KFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKIDSESNTRSRYSAHNEDDDRK

Query:  STRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYESTRDREDSRKRAKYESRS
        STRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYESTRDREDSRKRAKYESRS
Subjt:  STRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYESTRDREDSRKRAKYESRS

Query:  SRHDNH
        SRHDNH
Subjt:  SRHDNH

XP_022929608.1 dentin sialophosphoprotein-like [Cucurbita moschata]3.7e-28562.35Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYTE EIS+KL+EARE LEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD
        ASGSEEKDG SAIVLADK+VSDTQ+HQIAARKEEQMKTLRAALGL S  D EQV E ISDP+R+RREGQNADIKRHEKSEHSFLDRELNWK+ G+ED  D
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD

Query:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADS-GE-HKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNK
        DK  KK  SKELKGH KD +RRPKDD SD DS GE HKGTKKNLRD+RR DSESD + D ++KY  SRKSKKNRRHDSD SS TDSGGE K TKKH R+ 
Subjt:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADS-GE-HKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNK

Query:  RKDDPESDSDSDLDQKYLTSRKHKKNRRHDSDD-------------------------------------------------------------------
        R+D P+ D DS+ DQKY TSRKHKKNRRHDSDD                                                                   
Subjt:  RKDDPESDSDSDLDQKYLTSRKHKKNRRHDSDD-------------------------------------------------------------------

Query:  ---------------------------------------SSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKH-TSKKQKKSTRHDSDDSDSFTDGDK
                                               SSD+DSGGEHK+TK+S+++N+R   SD DSD+DKK+ TSKKQ+K+    SDDSDS  D  +
Subjt:  ---------------------------------------SSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKH-TSKKQKKSTRHDSDDSDSFTDGDK

Query:  IGMDSHQKGSGRHESQKV-KKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYG-ESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKS
         GM SH+KGSGR +SQKV KKQR +KQ+STDE+NSDS ++DK RQLKHKNQHGKRYG +SDSSD DSSDSDVGR KS HR+ SKR GKSRVDSESD EK 
Subjt:  IGMDSHQKGSGRHESQKV-KKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYG-ESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKS

Query:  RKYPKKDDRRRRHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGD
        RK+PKKD  RRRHD D+++SGDNSSSSDE+VK RR RRH++DD SEEEGEYFG+SGKI TKG I AKR+ D S+ SD S AVDRKG+D+ KRAKK+SSGD
Subjt:  RKYPKKDDRRRRHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGD

Query:  GFNLEKGRKLSSGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGAR
        G + +KG K S GARERGKG+ NH +G                                                             L++ V +     
Subjt:  GFNLEKGRKLSSGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGAR

Query:  ERGKNCADGLDKFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNR--------NSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKID
         + +N          D + EFN A+Q+T  M SKRKLDEG E+EQ+PE+KSR+R        + DPKKDFK+DSESSRR+RSGRY+ETRDGRYRED KID
Subjt:  ERGKNCADGLDKFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNR--------NSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKID

Query:  SESNTRSRYSAHNEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRH--SRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEK
        SESN RSRYSAHNED+DRKSTRTGSRYTEETEHGSRH+ KANESHH  RTDQD EE KRH  SRYEE RGRKHERDEG+KSSRE ERGEYQPSSR RSEK
Subjt:  SESNTRSRYSAHNEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRH--SRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEK

Query:  DY---ESTRDREDSRKRAKYESRSSRHD
        DY   ESTRDR+D RKRAKY+SRSSR D
Subjt:  DY---ESTRDREDSRKRAKYESRSSRHD

XP_038884695.1 dentin sialophosphoprotein-like [Benincasa hispida]0.0e+0070.66Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYT  EISEKLREARE LEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD
        ASGSEEKDG SAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGS GD EQVKEEISDPSR RREGQNADIKRHEKSEHSFLDRELNWK+ G EDQ+D
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD

Query:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADSGEHKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKRK
        DKD KK  SKELKGHQK +KRRPKDD SD DS       +NLRDSRR DSESDLD DV +KYVASR   KNRRHDSDDSS TDSGGE K TKKH R+KR+
Subjt:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADSGEHKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKRK

Query:  DDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHT-SKKQKKSTRHDSDDSDSFTDGDKIGM-DSHQ
        DDPESD DSD DQKY+TSRKHKKNRRHD D+SSD+DSGGEHKKTK+++R+N+RGHGSDP SD+DKK+T SKK +K+ RHDSDDSDS TDGD+ GM  SH+
Subjt:  DDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHT-SKKQKKSTRHDSDDSDSFTDGDKIGM-DSHQ

Query:  KGSGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYG-ESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKSRKYPKKDD
        KGS RH+SQKVK QRS+KQ+STDE+NSDS +++K RQLKH+NQHGKRYG ESDSSDHDSSDSDVG KKS HR+ SKR GKSRVDSES+ EKSRK+ KKD 
Subjt:  KGSGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYG-ESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKSRKYPKKDD

Query:  RRRRHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLEKGR
         R RHDID+EKSGDNSSS  E+VKRRRGR ++ DD+SEEEGEY GRSGKI TKGKIDAKRQ D + NSD SLAV RKG+D+HKRAKK SSGD  +LEKG 
Subjt:  RRRRHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLEKGR

Query:  KLSSGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKNCAD
        K S GARERGKG+LNH                                                                                  AD
Subjt:  KLSSGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKNCAD

Query:  GLDKFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNRNS-------------------------------------------------DPK
        GL+KFKKDSI+EFNHASQ+TD MNSKRK DEG +NEQ+ ESKSRNRNS                                                 +PK
Subjt:  GLDKFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNRNS-------------------------------------------------DPK

Query:  KDFKHDSESSRRSRSGRYDETRDGRYREDSKIDSESNTRSRYSAHNEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPR
        K F++DSESSRR+RSGRYDETRDGRYRED KIDSESN RSRYS  +EDDDRK+T+TGSR+TEETEHGSRHHRKANESHH  RT +DTEEEKRHSRYEEPR
Subjt:  KDFKHDSESSRRSRSGRYDETRDGRYREDSKIDSESNTRSRYSAHNEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPR

Query:  GRKHERDEGLKSSREVERGEYQPSSRQRSEKDY---ESTRDREDSRKRAKYESRSSRHDNH
        GRKHER+EGLKS REVERGEYQPSSR RSEKDY   ESTRDR+DSRKRAKYESRSSR DNH
Subjt:  GRKHERDEGLKSSREVERGEYQPSSRQRSEKDY---ESTRDREDSRKRAKYESRSSRHDNH

TrEMBL top hitse value%identityAlignment
A0A0A0LQ00 cwf21 domain-containing protein0.0e+0091.43Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLN+QGYTEKEISEKLREARENLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD
        ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSL D EQVK+EISDPSR+RREGQNAD+KRHEKSEHSFLDR+LNWK+RGTEDQ+D
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD

Query:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADSGEHKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKRK
        DKDVKKGASKE+K  QKDKKRR KDD SD DSGE KGTKKNLRDSRR DSESDLDIDVNNKYVASR SKKNRRHDSDDSS TDSGGEHKVTKKHSRNKRK
Subjt:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADSGEHKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKRK

Query:  DDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHTSKKQKKSTRHDSDDSDSFTDGDKIGMDSH-QK
        D+ E+DSDSDLDQKYLTSRKHKKNRRHDSDDSSD+DS GEHKKTK+SVR+NQRGHGSD DSDVDKKHTSKKQKKSTRHDSD SDSFTDGDKIGMDSH +K
Subjt:  DDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHTSKKQKKSTRHDSDDSDSFTDGDKIGMDSH-QK

Query:  GSGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYGESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKSRKYPKKDDRR
        GSGRHES KVKKQRS+KQDSTDETNSDS +EDKHRQLKHK+QHGKRYGESDSSDHDSSDSDVGR KSTHR+HSK TGKSRV+SESD EKSRKYP KDDRR
Subjt:  GSGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYGESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKSRKYPKKDDRR

Query:  RRHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLEKGRKL
        RRHDIDDEKSGDN SSSDELVKRRRGRRH+ DDSSEEEGEYFGRSGKI TKGKIDAKRQ D SNNSD SLAV RKGDD+HK+AKKY SGDGFNLEKG KL
Subjt:  RRHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLEKGRKL

Query:  SSGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKNCADGL
        SSGARERGKGNL+H EGRRHNTDDKSEEEGEYLGRSGK+ATKRK+D KRQHDDSENSDDSLAVKHKRAKKY SSDDSDLEKGVKSTDGARERGKN ADGL
Subjt:  SSGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKNCADGL

Query:  DKFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKIDSESNTRSRYSAHNEDDDR
         KFKKDSI+E NHASQRTDKMN KRKLDEG E EQEPESKSRNRNSDPKKD KHDSESSRRSRSGRYD+TRDGRYRED KIDSESNTRSRYSA  EDDDR
Subjt:  DKFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKIDSESNTRSRYSAHNEDDDR

Query:  KSTRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY---ESTRDREDSRKRAKY
        KS RTGSRY+EETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY   ESTRDREDSRKR KY
Subjt:  KSTRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY---ESTRDREDSRKRAKY

Query:  ESRSSRHDNH
        ESRSSR DNH
Subjt:  ESRSSRHDNH

A0A1S3BBX0 dentin sialophosphoprotein-like0.0e+00100Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD
        ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD

Query:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADSGEHKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKRK
        DKDVKKGASKELKGHQKDKKRRPKDDFSDADSGEHKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKRK
Subjt:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADSGEHKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKRK

Query:  DDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHTSKKQKKSTRHDSDDSDSFTDGDKIGMDSHQKG
        DDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHTSKKQKKSTRHDSDDSDSFTDGDKIGMDSHQKG
Subjt:  DDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHTSKKQKKSTRHDSDDSDSFTDGDKIGMDSHQKG

Query:  SGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYGESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKSRKYPKKDDRRR
        SGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYGESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKSRKYPKKDDRRR
Subjt:  SGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYGESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKSRKYPKKDDRRR

Query:  RHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLEKGRKLS
        RHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLEKGRKLS
Subjt:  RHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLEKGRKLS

Query:  SGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKNCADGLD
        SGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKNCADGLD
Subjt:  SGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKNCADGLD

Query:  KFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKIDSESNTRSRYSAHNEDDDRK
        KFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKIDSESNTRSRYSAHNEDDDRK
Subjt:  KFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKIDSESNTRSRYSAHNEDDDRK

Query:  STRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYESTRDREDSRKRAKYESRS
        STRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYESTRDREDSRKRAKYESRS
Subjt:  STRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYESTRDREDSRKRAKYESRS

Query:  SRHDNH
        SRHDNH
Subjt:  SRHDNH

A0A5A7VCH8 Dentin sialophosphoprotein-like0.0e+0099.23Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD
        ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSL DGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD

Query:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADSGEHKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKRK
        DKDVKKGASKELKGHQKDKKRRPKDD SD DSGEHKGTKKNLRDSRRIDSES+LDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKRK
Subjt:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADSGEHKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKRK

Query:  DDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHTSKKQKKSTRHDSDDSDSFTDGDKIGMDSHQKG
        DDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHTSKKQKKSTRHDSDDSDSFTDGDKIGMDSHQKG
Subjt:  DDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHTSKKQKKSTRHDSDDSDSFTDGDKIGMDSHQKG

Query:  SGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYGESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKSRKYPKKDDRRR
        SGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYGESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKSRKYPKKD RRR
Subjt:  SGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYGESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKSRKYPKKDDRRR

Query:  RHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLEKGRKLS
        RHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLEKGRKLS
Subjt:  RHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLEKGRKLS

Query:  SGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKNCADGLD
        SGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGK+ATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKN ADGLD
Subjt:  SGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKNCADGLD

Query:  KFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKIDSESNTRSRYSAHNEDDDRK
        KFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKIDSESNTRSRYSAHNEDDDRK
Subjt:  KFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKIDSESNTRSRYSAHNEDDDRK

Query:  STRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYESTRDREDSRKRAKYESRS
        STRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYESTRDREDSRKRAKYESRS
Subjt:  STRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYESTRDREDSRKRAKYESRS

Query:  SRHDNH
        SRHDNH
Subjt:  SRHDNH

A0A6J1ESM6 dentin sialophosphoprotein-like1.8e-28562.35Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYTE EIS+KL+EARE LEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD
        ASGSEEKDG SAIVLADK+VSDTQ+HQIAARKEEQMKTLRAALGL S  D EQV E ISDP+R+RREGQNADIKRHEKSEHSFLDRELNWK+ G+ED  D
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD

Query:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADS-GE-HKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNK
        DK  KK  SKELKGH KD +RRPKDD SD DS GE HKGTKKNLRD+RR DSESD + D ++KY  SRKSKKNRRHDSD SS TDSGGE K TKKH R+ 
Subjt:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADS-GE-HKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNK

Query:  RKDDPESDSDSDLDQKYLTSRKHKKNRRHDSDD-------------------------------------------------------------------
        R+D P+ D DS+ DQKY TSRKHKKNRRHDSDD                                                                   
Subjt:  RKDDPESDSDSDLDQKYLTSRKHKKNRRHDSDD-------------------------------------------------------------------

Query:  ---------------------------------------SSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKH-TSKKQKKSTRHDSDDSDSFTDGDK
                                               SSD+DSGGEHK+TK+S+++N+R   SD DSD+DKK+ TSKKQ+K+    SDDSDS  D  +
Subjt:  ---------------------------------------SSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKH-TSKKQKKSTRHDSDDSDSFTDGDK

Query:  IGMDSHQKGSGRHESQKV-KKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYG-ESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKS
         GM SH+KGSGR +SQKV KKQR +KQ+STDE+NSDS ++DK RQLKHKNQHGKRYG +SDSSD DSSDSDVGR KS HR+ SKR GKSRVDSESD EK 
Subjt:  IGMDSHQKGSGRHESQKV-KKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYG-ESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKS

Query:  RKYPKKDDRRRRHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGD
        RK+PKKD  RRRHD D+++SGDNSSSSDE+VK RR RRH++DD SEEEGEYFG+SGKI TKG I AKR+ D S+ SD S AVDRKG+D+ KRAKK+SSGD
Subjt:  RKYPKKDDRRRRHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGD

Query:  GFNLEKGRKLSSGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGAR
        G + +KG K S GARERGKG+ NH +G                                                             L++ V +     
Subjt:  GFNLEKGRKLSSGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGAR

Query:  ERGKNCADGLDKFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNR--------NSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKID
         + +N          D + EFN A+Q+T  M SKRKLDEG E+EQ+PE+KSR+R        + DPKKDFK+DSESSRR+RSGRY+ETRDGRYRED KID
Subjt:  ERGKNCADGLDKFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNR--------NSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKID

Query:  SESNTRSRYSAHNEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRH--SRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEK
        SESN RSRYSAHNED+DRKSTRTGSRYTEETEHGSRH+ KANESHH  RTDQD EE KRH  SRYEE RGRKHERDEG+KSSRE ERGEYQPSSR RSEK
Subjt:  SESNTRSRYSAHNEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRH--SRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEK

Query:  DY---ESTRDREDSRKRAKYESRSSRHD
        DY   ESTRDR+D RKRAKY+SRSSR D
Subjt:  DY---ESTRDREDSRKRAKYESRSSRHD

A0A6J1K7B6 dentin sialophosphoprotein-like2.7e-28162.06Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYTE EIS+KL+EARE LEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD
        ASGSEEKDG SAIVLADK+VSDTQ+HQIAARKEEQMKTLRAALGL S  D EQV E ISDP+R+RREGQNADIKR EKSEHSFLDRELNWKR G+ED  D
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD

Query:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADS-GE-HKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNK
        DK  KK  SKELKGH KD +RRPKDD SD DS GE HKGTKKNLRD+RR DSESD + D ++KY  SRKSKKNRRHDSD SS TDSGGE K TKKH R+ 
Subjt:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADS-GE-HKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNK

Query:  RKDDPESDSDSDLDQKYLTSR-------------------------------------------------------------------------------
        R+D P+ D DS+ DQKY TSR                                                                               
Subjt:  RKDDPESDSDSDLDQKYLTSR-------------------------------------------------------------------------------

Query:  ---------------------------KHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKH-TSKKQKKSTRHDSDDSDSFTDGDK
                                   KHKKNRRHDSD SSD+DSGGEHK+TK+S+++N+R   SD DSD+DKK+ TSKKQ+K+   DSDDSDS  D  +
Subjt:  ---------------------------KHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKH-TSKKQKKSTRHDSDDSDSFTDGDK

Query:  IGMDSHQKGSGRHESQKV-KKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYG-ESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKS
         GM SH+KGSGR +SQKV KKQRS+KQ+STDE+NSDS ++DK RQLK+KNQHGKRYG +SDSSD DSSDSDVGR KS HR+HSKRTGKSRVDSESD EK 
Subjt:  IGMDSHQKGSGRHESQKV-KKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYG-ESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKS

Query:  RKYPKKDDRRRRHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGD
        RK+PKKD  RRRHD D+++SGDNSSSSDE+VKRRR RRH++DD S EEGEYFG+SGKI TKG I AKR+ + S+ SD S AVDR+G+D+ KRAKK+S GD
Subjt:  RKYPKKDDRRRRHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGD

Query:  GFNLEKGRKLSSGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGAR
        G + +KG K S GARERGKG+ NH +G                                                             L++ V +     
Subjt:  GFNLEKGRKLSSGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGAR

Query:  ERGKNCADGLDKFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNR--------NSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKID
         + +N          DS+ EFN A+Q+T  M SKRKLDEG E+EQ+PE+KS++R        + DPKKDFK+DSESSRR+RSGR+ ETRDGRYRED KID
Subjt:  ERGKNCADGLDKFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNR--------NSDPKKDFKHDSESSRRSRSGRYDETRDGRYREDSKID

Query:  SESNTRSRYSAHNEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRH--SRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEK
        SESN RSRYSAHNEDDDRKS RTGSRYTEETEHGSRH+ KANESHH  RTDQD EE KR   SRYEE RGRKHERDEG+KSSRE ERGEYQPSSR RSEK
Subjt:  SESNTRSRYSAHNEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRH--SRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEK

Query:  DY---ESTRDREDSRKRAKYESRSSRHD
        DY   ESTRDR+D RKRAKY+SRSSR D
Subjt:  DY---ESTRDREDSRKRAKYESRSSRHD

SwissProt top hitse value%identityAlignment
Q7RYH7 Pre-mRNA-splicing factor cwc-216.4e-0625.61Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA
        M + +GL TPRGSGT+GY+Q N    RP+    +   + F+         ++P+K +LEHDRKR++E+K+  L DKL ++G  E EI  +  E R  L  
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD
        A     ++   A     K +   Q H++A  K ++ + LR AL +                SR  +EG +   K+ E+     L+RE N    G      
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFD

Query:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADSGEHKGTKKNLRDSRRIDSES------DLDIDVNNKYVASRKSKKNRRHDSDD---SSGTDSGGEHKVT
              G S               D   D D G  +G  +  RD  R++S        D D     +    R  +  R  + D    ++G D        
Subjt:  DKDVKKGASKELKGHQKDKKRRPKDDFSDADSGEHKGTKKNLRDSRRIDSES------DLDIDVNNKYVASRKSKKNRRHDSDD---SSGTDSGGEHKVT

Query:  KKHSRNKRKDDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSD
        ++ SR +        S S + ++ L SR   ++R +    S       + +    S RS  R +   PD D
Subjt:  KKHSRNKRKDDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSD

Arabidopsis top hitse value%identityAlignment
AT3G49601.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; CONTAINS InterPro DOMAIN/s: mRNA splicing factor, Cwf21 (InterPro:IPR013170); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).1.6e-5234.42Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKT-GKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLE
        MYNGIGLQT RGSGTNGY+QTNKFFVRP+  GK  +  +GFE+D+GTAG+SKKPNK ILEHDRKRQI LKL ILEDKL DQGY++ EI++KL EAR +LE
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKT-GKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLE

Query:  AASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQF
        AA+ + E++       +D +VS+TQTHQ+AARKE+QM+  RAALG   L D +QV EE        REG    +K  E+ EHSFLDR+    R+  ++  
Subjt:  AASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQF

Query:  DDKDVKKGASKELKG------------HQKDKKRRPKDDFSDAD--SGEHKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSG
        D+KD K   SK+ +G             +K+ K+R  DD S++D    + +   K     R+ +SESD           S  S      DSDD       
Subjt:  DDKDVKKGASKELKG------------HQKDKKRRPKDDFSDAD--SGEHKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSG

Query:  GEHKVTKKHSRNKRKDDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHTSKKQ---KKSTRHDSDD
           K TKK SR KR    ES+     D K L  + HKK+    S+ S   +   +H +  R+ R       S+P+S+ +K+   KK+   +   +   DD
Subjt:  GEHKVTKKHSRNKRKDDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHTSKKQ---KKSTRHDSDD

Query:  SDSFTDGDKIGMDSHQKGSGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYGESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDS
         D   D  K       K + R       + +++KQ  +      + +  K ++ +   +HGK    SDS                     K   +   DS
Subjt:  SDSFTDGDKIGMDSHQKGSGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKNQHGKRYGESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDS

Query:  ESDFE-----KSRKYPKKDDRRRRHDIDDEKSGDNSSSSDELVKR---------RRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGS
        E+++E     K+  Y +    +R  D D++  G +    D+ VKR         R   R   ++  ++ G Y  R   +    +     +D Y    DG 
Subjt:  ESDFE-----KSRKYPKKDDRRRRHDIDDEKSGDNSSSSDELVKR---------RRGRRHSTDDSSEEEGEYFGRSGKITTKGKIDAKRQDDYSNNSDGS

Query:  LAVDRKGDDEHK--RAKKYSSGDGFNLEKGRKLSSGARERGK
         A  ++ DD+ +  R ++YSS       +GR     +R  GK
Subjt:  LAVDRKGDDEHK--RAKKYSSGDGFNLEKGRKLSSGARERGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATAACGGTATTGGATTACAGACTCCTAGAGGGTCTGGCACTAATGGTTATATCCAGACGAACAAGTTTTTTGTGAGGCCAAAGACTGGAAAGGTTGCTGAAAGCAC
CAGAGGATTTGAAGAAGATCAGGGCACTGCTGGAGTTTCTAAGAAACCTAATAAAGACATTCTCGAGCATGACCGCAAGCGTCAGATTGAACTCAAACTAGTCATACTTG
AGGACAAGCTCAATGACCAAGGCTATACAGAGAAGGAAATTTCTGAAAAGTTGAGGGAAGCTCGCGAGAATTTGGAAGCTGCTTCTGGTTCTGAGGAAAAAGATGGATCC
TCTGCCATTGTTCTTGCTGATAAGAGGGTATCTGATACACAGACTCACCAGATTGCTGCAAGAAAGGAGGAGCAGATGAAAACATTGAGAGCTGCTCTTGGGTTGGGTTC
ATTGGGTGATGGTGAACAGGTTAAAGAAGAGATTTCTGATCCATCAAGGAGTAGAAGAGAGGGTCAGAATGCTGATATTAAACGTCATGAGAAGTCTGAACACTCTTTTT
TGGACAGAGAATTGAACTGGAAAAGGCGTGGCACTGAAGATCAGTTTGATGATAAGGATGTCAAAAAGGGGGCTTCGAAAGAGTTGAAAGGTCATCAAAAGGATAAAAAA
AGAAGACCCAAGGATGATTTTTCTGACGCAGATTCTGGTGAGCATAAGGGAACCAAGAAGAACTTGAGAGATAGTAGAAGGATTGATTCTGAAAGTGACCTTGACATTGA
TGTCAACAATAAATACGTCGCCTCAAGGAAGTCTAAAAAGAATAGAAGGCATGATAGCGACGATTCTTCTGGAACTGATTCTGGAGGCGAGCACAAGGTAACCAAGAAGC
ACTCGAGAAATAAACGGAAAGATGATCCTGAAAGTGACTCAGACAGCGATCTTGACCAGAAATATTTGACCTCGAGAAAGCATAAGAAAAACAGAAGGCATGATAGTGAT
GATTCTTCTGATAGTGATTCTGGTGGAGAGCACAAGAAAACCAAGAGGAGTGTGAGAAGTAATCAAAGAGGTCATGGAAGTGATCCCGACAGTGACGTTGACAAGAAACA
CACCTCAAAGAAGCAGAAGAAAAGCACAAGGCATGATAGCGATGATTCTGATTCCTTTACAGATGGTGATAAGATTGGGATGGACAGTCACCAGAAAGGATCTGGTCGAC
ATGAAAGTCAAAAGGTGAAGAAGCAAAGAAGCCAGAAACAGGATTCTACTGATGAAACCAATTCTGACAGTGTGGTTGAAGATAAACACAGGCAACTGAAGCACAAAAAC
CAGCATGGTAAAAGATATGGAGAAAGTGACAGCTCTGACCATGACAGTTCTGATTCTGATGTAGGTCGCAAGAAGAGTACGCATAGGTTTCACAGCAAACGTACAGGAAA
GAGCAGGGTAGATAGTGAATCCGATTTTGAAAAGTCGAGAAAGTATCCTAAGAAAGATGATCGTAGACGCAGACATGACATTGATGATGAAAAAAGTGGTGATAACAGCT
CTAGCAGTGATGAATTAGTGAAGAGGCGCAGAGGTAGGAGGCACAGTACTGATGATAGTTCTGAAGAAGAAGGTGAATATTTTGGTAGAAGTGGTAAGATAACCACAAAA
GGAAAAATAGATGCTAAAAGGCAAGATGATTATAGCAATAATTCTGATGGTAGCTTAGCAGTTGATAGAAAGGGCGATGATGAACACAAGAGAGCTAAGAAATATTCGTC
TGGTGACGGTTTTAATCTTGAGAAGGGAAGAAAATTGAGCAGTGGAGCTCGTGAAAGAGGAAAAGGGAACTTAAACCATCCAGAAGGTAGGAGACACAATACTGATGATA
AATCTGAAGAAGAAGGTGAATATCTTGGTAGAAGTGGTAAGATGGCTACAAAAAGAAAAATGGATGCTAAAAGGCAACATGATGACAGTGAGAATTCTGATGATAGCCTA
GCAGTTAAACACAAGAGAGCTAAGAAATATTCGTCAAGTGACGATTCTGATCTAGAGAAGGGAGTAAAATCAACTGATGGAGCTCGTGAAAGAGGGAAAAACTGTGCAGA
TGGTTTGGACAAGTTTAAGAAAGATTCTATCCATGAGTTCAATCATGCAAGTCAACGTACAGACAAAATGAACAGCAAGAGAAAGCTTGATGAAGGTCGTGAAAATGAGC
AAGAGCCAGAGTCAAAATCTAGAAATCGAAATTCTGACCCCAAGAAAGATTTCAAACATGATTCTGAATCAAGCAGAAGATCACGAAGTGGTAGGTACGATGAGACAAGG
GATGGACGGTACAGGGAAGACTCCAAAATTGACTCTGAATCAAACACTAGATCACGCTACAGTGCACACAATGAGGATGATGACAGAAAGTCGACTCGAACAGGAAGCAG
ATACACTGAAGAAACTGAGCATGGAAGTAGACATCATCGCAAGGCTAACGAGTCTCATCACCACCGCAGGACTGATCAAGATACTGAAGAGGAAAAAAGGCACAGCAGAT
ATGAGGAGCCTAGAGGGAGAAAACATGAAAGAGATGAGGGTCTAAAATCGAGCAGGGAAGTTGAAAGAGGGGAGTATCAACCAAGTAGCAGGCAGAGATCTGAGAAAGAT
TATGAATCTACAAGAGATAGGGAGGATTCCAGAAAGAGGGCCAAATATGAATCTCGATCGAGCAGACATGATAATCATTAG
mRNA sequenceShow/hide mRNA sequence
TTCACATAAATACCTTAAGGATCTTTTATATAATTCAATAAAATATCTAATAAGTTAGGGTTCTTCATCGCTTGACCGAAAATCTCAATTTCTCTCTCATCTCTCAACGC
TTCGATTTATCTCAAGCTTCTCCGTCTGATCTCAATTTCTCTCTCGTCTCTCAATGTTGTCGCCGTATTTGATCGTTTAAAATCTCTCCCTTCTGGTGCTCTCTTTGATT
TCGTTCTTAAGCCTCACTCTCGATTGTTCTTCAAGTTTGAGGTTGGCGAAGCAGGGCTATGTATAACGGTATTGGATTACAGACTCCTAGAGGGTCTGGCACTAATGGTT
ATATCCAGACGAACAAGTTTTTTGTGAGGCCAAAGACTGGAAAGGTTGCTGAAAGCACCAGAGGATTTGAAGAAGATCAGGGCACTGCTGGAGTTTCTAAGAAACCTAAT
AAAGACATTCTCGAGCATGACCGCAAGCGTCAGATTGAACTCAAACTAGTCATACTTGAGGACAAGCTCAATGACCAAGGCTATACAGAGAAGGAAATTTCTGAAAAGTT
GAGGGAAGCTCGCGAGAATTTGGAAGCTGCTTCTGGTTCTGAGGAAAAAGATGGATCCTCTGCCATTGTTCTTGCTGATAAGAGGGTATCTGATACACAGACTCACCAGA
TTGCTGCAAGAAAGGAGGAGCAGATGAAAACATTGAGAGCTGCTCTTGGGTTGGGTTCATTGGGTGATGGTGAACAGGTTAAAGAAGAGATTTCTGATCCATCAAGGAGT
AGAAGAGAGGGTCAGAATGCTGATATTAAACGTCATGAGAAGTCTGAACACTCTTTTTTGGACAGAGAATTGAACTGGAAAAGGCGTGGCACTGAAGATCAGTTTGATGA
TAAGGATGTCAAAAAGGGGGCTTCGAAAGAGTTGAAAGGTCATCAAAAGGATAAAAAAAGAAGACCCAAGGATGATTTTTCTGACGCAGATTCTGGTGAGCATAAGGGAA
CCAAGAAGAACTTGAGAGATAGTAGAAGGATTGATTCTGAAAGTGACCTTGACATTGATGTCAACAATAAATACGTCGCCTCAAGGAAGTCTAAAAAGAATAGAAGGCAT
GATAGCGACGATTCTTCTGGAACTGATTCTGGAGGCGAGCACAAGGTAACCAAGAAGCACTCGAGAAATAAACGGAAAGATGATCCTGAAAGTGACTCAGACAGCGATCT
TGACCAGAAATATTTGACCTCGAGAAAGCATAAGAAAAACAGAAGGCATGATAGTGATGATTCTTCTGATAGTGATTCTGGTGGAGAGCACAAGAAAACCAAGAGGAGTG
TGAGAAGTAATCAAAGAGGTCATGGAAGTGATCCCGACAGTGACGTTGACAAGAAACACACCTCAAAGAAGCAGAAGAAAAGCACAAGGCATGATAGCGATGATTCTGAT
TCCTTTACAGATGGTGATAAGATTGGGATGGACAGTCACCAGAAAGGATCTGGTCGACATGAAAGTCAAAAGGTGAAGAAGCAAAGAAGCCAGAAACAGGATTCTACTGA
TGAAACCAATTCTGACAGTGTGGTTGAAGATAAACACAGGCAACTGAAGCACAAAAACCAGCATGGTAAAAGATATGGAGAAAGTGACAGCTCTGACCATGACAGTTCTG
ATTCTGATGTAGGTCGCAAGAAGAGTACGCATAGGTTTCACAGCAAACGTACAGGAAAGAGCAGGGTAGATAGTGAATCCGATTTTGAAAAGTCGAGAAAGTATCCTAAG
AAAGATGATCGTAGACGCAGACATGACATTGATGATGAAAAAAGTGGTGATAACAGCTCTAGCAGTGATGAATTAGTGAAGAGGCGCAGAGGTAGGAGGCACAGTACTGA
TGATAGTTCTGAAGAAGAAGGTGAATATTTTGGTAGAAGTGGTAAGATAACCACAAAAGGAAAAATAGATGCTAAAAGGCAAGATGATTATAGCAATAATTCTGATGGTA
GCTTAGCAGTTGATAGAAAGGGCGATGATGAACACAAGAGAGCTAAGAAATATTCGTCTGGTGACGGTTTTAATCTTGAGAAGGGAAGAAAATTGAGCAGTGGAGCTCGT
GAAAGAGGAAAAGGGAACTTAAACCATCCAGAAGGTAGGAGACACAATACTGATGATAAATCTGAAGAAGAAGGTGAATATCTTGGTAGAAGTGGTAAGATGGCTACAAA
AAGAAAAATGGATGCTAAAAGGCAACATGATGACAGTGAGAATTCTGATGATAGCCTAGCAGTTAAACACAAGAGAGCTAAGAAATATTCGTCAAGTGACGATTCTGATC
TAGAGAAGGGAGTAAAATCAACTGATGGAGCTCGTGAAAGAGGGAAAAACTGTGCAGATGGTTTGGACAAGTTTAAGAAAGATTCTATCCATGAGTTCAATCATGCAAGT
CAACGTACAGACAAAATGAACAGCAAGAGAAAGCTTGATGAAGGTCGTGAAAATGAGCAAGAGCCAGAGTCAAAATCTAGAAATCGAAATTCTGACCCCAAGAAAGATTT
CAAACATGATTCTGAATCAAGCAGAAGATCACGAAGTGGTAGGTACGATGAGACAAGGGATGGACGGTACAGGGAAGACTCCAAAATTGACTCTGAATCAAACACTAGAT
CACGCTACAGTGCACACAATGAGGATGATGACAGAAAGTCGACTCGAACAGGAAGCAGATACACTGAAGAAACTGAGCATGGAAGTAGACATCATCGCAAGGCTAACGAG
TCTCATCACCACCGCAGGACTGATCAAGATACTGAAGAGGAAAAAAGGCACAGCAGATATGAGGAGCCTAGAGGGAGAAAACATGAAAGAGATGAGGGTCTAAAATCGAG
CAGGGAAGTTGAAAGAGGGGAGTATCAACCAAGTAGCAGGCAGAGATCTGAGAAAGATTATGAATCTACAAGAGATAGGGAGGATTCCAGAAAGAGGGCCAAATATGAAT
CTCGATCGAGCAGACATGATAATCATTAGGCTTGGGTTCAGTATTCGTATGTTTGCTTATTGTTTATTTCCTCTTTCCTTTAACGACAATTTATGTGAAATATTGTTTGC
TTTCAGCTTGCTTCATAGTTGCTACAGAAAAGCTTGTTACGGGCAGGGACAGAAATGGAATTGCCTTTTGTGTATTGTTCATTGTTGAAATGTATTCATCAGGTTTTGGA
AAGGCGATCCAATATTTGAATCCCATTTGTGCTAGCAAGTTTGTTTGTATGGGCAGAAGATTCTGAGTTCCGGAGTAAATTTTGAAGCTCTATTGAAGTTCTTATAATGT
CTCTATTTTGTTTTTTCTTTTCTTTTCACTATATAAGTCTAGTTTTAGAGATTCATTGACATATACTTTTATTTTTATTTTTTTTAATTGGAAGTTCAAATCTTAAGTGA
AAAGTATTTCTAATCTTCATGTTGTC
Protein sequenceShow/hide protein sequence
MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEAASGSEEKDGS
SAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSEHSFLDRELNWKRRGTEDQFDDKDVKKGASKELKGHQKDKK
RRPKDDFSDADSGEHKGTKKNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKRKDDPESDSDSDLDQKYLTSRKHKKNRRHDSD
DSSDSDSGGEHKKTKRSVRSNQRGHGSDPDSDVDKKHTSKKQKKSTRHDSDDSDSFTDGDKIGMDSHQKGSGRHESQKVKKQRSQKQDSTDETNSDSVVEDKHRQLKHKN
QHGKRYGESDSSDHDSSDSDVGRKKSTHRFHSKRTGKSRVDSESDFEKSRKYPKKDDRRRRHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSEEEGEYFGRSGKITTK
GKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLEKGRKLSSGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSENSDDSL
AVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKNCADGLDKFKKDSIHEFNHASQRTDKMNSKRKLDEGRENEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDETR
DGRYREDSKIDSESNTRSRYSAHNEDDDRKSTRTGSRYTEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKD
YESTRDREDSRKRAKYESRSSRHDNH