; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G22930 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G22930
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptioncwf21 domain-containing protein
Genome locationChr2:19885493..19889609
RNA-Seq ExpressionCSPI02G22930
SyntenyCSPI02G22930
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR013170 - mRNA splicing factor Cwf21 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064984.1 dentin sialophosphoprotein-like [Cucumis melo var. makuwa]0.0e+0091.53Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLN+QGYTEKEISEKLREARENLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD
        ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDD EQVK+EISDPSR+RREGQNAD+KRHEKSEHSFLDR+LNWK+RGTEDQ+D
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD

Query:  DKDVKKGASKEMK--QKDKKRRSKDDSSDTDSGERKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKRK
        DKDVKKGASKE+K  QKDKKRR KDDSSDTDSGE KGTKKNLRDSRR DSES+LDIDVNNKYVASR SKKNRRHDSDDSS TDSGGEHKVTKKH RNKRK
Subjt:  DKDVKKGASKEMK--QKDKKRRSKDDSSDTDSGERKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKRK

Query:  DNLETDSDSDLDQKYLTSRKHKKNRRHDSDDSSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKHTSKKQKKSTRHDSDGSDSFTDGDKIGTDSHKKG
        D+ E+DSDSDLDQKYLTSRKHKKNRRHDSDDSSD+DS GEHKKTK+SVR+NQRGHGSD DSDVDKKHTSKKQKKSTRHDSD SDSFTDGDKIG DSH+KG
Subjt:  DNLETDSDSDLDQKYLTSRKHKKNRRHDSDDSSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKHTSKKQKKSTRHDSDGSDSFTDGDKIGTDSHKKG

Query:  SGRHESHKVKKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYGESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSRKYPKKDDRRR
        SGRHES KVKKQRS+KQDSTDETNSDS +EDKHRQLKHK+QHGKRYGESDSSDHDSSDSDVGR KS HR+HSK TGKSRV+SESD EKSRKYPKKD RRR
Subjt:  SGRHESHKVKKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYGESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSRKYPKKDDRRR

Query:  RHDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQHD-SNNSDDSLAVHRKGDDDHKKAKKYLSGDGFNLEKGGKLS
        RHDIDDEKSGDN SSSDELVKRRRGRRH+ DDSSEEEGEYFGRSGKI TKGKIDAKRQ D SNNSD SLAV RKGDD+HK+AKKY SGDGFNLEKG KLS
Subjt:  RHDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQHD-SNNSDDSLAVHRKGDDDHKKAKKYLSGDGFNLEKGGKLS

Query:  SGARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKY-SSDDSVLEKGVKSTDGARERGKNHADGLY
        SGARERGKGNL+H EGRRHNTDDKSEEEGEYLGRSGKIATKRK+D KRQHDDSENSDDSLAVKHKRAKKY SSDDS LEKGVKSTDGARERGKN ADGL 
Subjt:  SGARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKY-SSDDSVLEKGVKSTDGARERGKNHADGLY

Query:  KFKKDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSESNTRSRYSAQIEDDDRK
        KFKKDSI+E NHASQRTDKMN KRKLDEG E EQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYD+TRDGRYRED KIDSESNTRSRYSA  EDDDRK
Subjt:  KFKKDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSESNTRSRYSAQIEDDDRK

Query:  SIRTGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDREDSRKRPKYE
        S RTGSRY+EETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY   ESTRDREDSRKR KYE
Subjt:  SIRTGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDREDSRKRPKYE

Query:  SRSSRRDNH
        SRSSR DNH
Subjt:  SRSSRRDNH

XP_004138875.1 dentin sialophosphoprotein [Cucumis sativus]0.0e+0099.23Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD
        ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD

Query:  DKDVKKGASKEMKQKDKKRRSKDDSSDTDSGERKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKRKDN
        DKDVKKGASKEMKQKDKKRRSKDDSSDTDSGERKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKH RNKRKDN
Subjt:  DKDVKKGASKEMKQKDKKRRSKDDSSDTDSGERKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKRKDN

Query:  LETDSDSDLDQKYLTSRKHKKNRRHDSDDSSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKHTSKKQKKSTRHDSDGSDSFTDGDKIGTDSH-KKGS
        LETDSDSDLDQKYLTSRKHKKNRRHDSDDSSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKHTSKKQKKSTRHDSDGSDSFTDGDKIG DSH KKGS
Subjt:  LETDSDSDLDQKYLTSRKHKKNRRHDSDDSSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKHTSKKQKKSTRHDSDGSDSFTDGDKIGTDSH-KKGS

Query:  GRHESHKVKKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYGESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSRKYPKKDDRRRR
        GRHESHKVKKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYGESDSSDHDSSDSDVGRMKS HRYHSKHTGKSRVNSESDSEKSRKYP KDDRRRR
Subjt:  GRHESHKVKKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYGESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSRKYPKKDDRRRR

Query:  HDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQHDSNNSDDSLAVHRKGDDDHKKAKKYLSGDGFNLEKGGKLSSG
        HDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQHDSNNSDDSLAVHRKGDDDHKKAKKYLSGDGFNLEKGGKLSSG
Subjt:  HDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQHDSNNSDDSLAVHRKGDDDHKKAKKYLSGDGFNLEKGGKLSSG

Query:  ARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKYSSDDSVLEKGVKSTDGARERGKNHADGLYKFK
        ARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKYSSDDS LEKGVKSTDGARERGKNHADGLYKFK
Subjt:  ARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKYSSDDSVLEKGVKSTDGARERGKNHADGLYKFK

Query:  KDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSESNTRSRYSAQIEDDDRKSIR
        KDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNRNSDPKKD KHDSESSRRSRSGRYDDTRDGRYREDFKIDSESNTRSRYSAQIEDDDRKSIR
Subjt:  KDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSESNTRSRYSAQIEDDDRKSIR

Query:  TGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDREDSRKRPKYESRS
        TGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDREDSRKRPKYESRS
Subjt:  TGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDREDSRKRPKYESRS

Query:  SRRDNH
        SRRDNH
Subjt:  SRRDNH

XP_008445109.1 PREDICTED: dentin sialophosphoprotein-like [Cucumis melo]0.0e+0091.31Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLN+QGYTEKEISEKLREARENLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD
        ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSL D EQVK+EISDPSR+RREGQNAD+KRHEKSEHSFLDR+LNWK+RGTEDQ+D
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD

Query:  DKDVKKGASKEMK--QKDKKRRSKDDSSDTDSGERKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKRK
        DKDVKKGASKE+K  QKDKKRR KDD SD DSGE KGTKKNLRDSRR DSESDLDIDVNNKYVASR SKKNRRHDSDDSS TDSGGEHKVTKKH RNKRK
Subjt:  DKDVKKGASKEMK--QKDKKRRSKDDSSDTDSGERKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKRK

Query:  DNLETDSDSDLDQKYLTSRKHKKNRRHDSDDSSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKHTSKKQKKSTRHDSDGSDSFTDGDKIGTDSHKKG
        D+ E+DSDSDLDQKYLTSRKHKKNRRHDSDDSSD+DS GEHKKTK+SVR+NQRGHGSD DSDVDKKHTSKKQKKSTRHDSD SDSFTDGDKIG DSH+KG
Subjt:  DNLETDSDSDLDQKYLTSRKHKKNRRHDSDDSSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKHTSKKQKKSTRHDSDGSDSFTDGDKIGTDSHKKG

Query:  SGRHESHKVKKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYGESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSRKYPKKDDRRR
        SGRHES KVKKQRS+KQDSTDETNSDS +EDKHRQLKHK+QHGKRYGESDSSDHDSSDSDVGR KS HR+HSK TGKSRV+SESD EKSRKYPKKDDRRR
Subjt:  SGRHESHKVKKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYGESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSRKYPKKDDRRR

Query:  RHDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQHD-SNNSDDSLAVHRKGDDDHKKAKKYLSGDGFNLEKGGKLS
        RHDIDDEKSGDN SSSDELVKRRRGRRH+ DDSSEEEGEYFGRSGKI TKGKIDAKRQ D SNNSD SLAV RKGDD+HK+AKKY SGDGFNLEKG KLS
Subjt:  RHDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQHD-SNNSDDSLAVHRKGDDDHKKAKKYLSGDGFNLEKGGKLS

Query:  SGARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKY-SSDDSVLEKGVKSTDGARERGKNHADGLY
        SGARERGKGNL+H EGRRHNTDDKSEEEGEYLGRSGK+ATKRK+D KRQHDDSENSDDSLAVKHKRAKKY SSDDS LEKGVKSTDGARERGKN ADGL 
Subjt:  SGARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKY-SSDDSVLEKGVKSTDGARERGKNHADGLY

Query:  KFKKDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSESNTRSRYSAQIEDDDRK
        KFKKDSI+E NHASQRTDKMN KRKLDEG E EQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYD+TRDGRYRED KIDSESNTRSRYSA  EDDDRK
Subjt:  KFKKDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSESNTRSRYSAQIEDDDRK

Query:  SIRTGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDREDSRKRPKYE
        S RTGSRY+EETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY   ESTRDREDSRKR KYE
Subjt:  SIRTGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDREDSRKRPKYE

Query:  SRSSRRDNH
        SRSSR DNH
Subjt:  SRSSRRDNH

XP_022929608.1 dentin sialophosphoprotein-like [Cucurbita moschata]3.9e-27961.89Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL +QGYTE EIS+KL+EARE LEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD
        ASGSEEKDG SAIVLADK+VSDTQ+HQIAARKEEQMKTLRAALGL S +DSEQV + ISDP+RNRREGQNAD+KRHEKSEHSFLDR+LNWKK G+ED  D
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD

Query:  DKDVKKGASKEMKQKDK-KRRSKDDSSDTDS-GE-RKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKR
        DK  KK  SKE+K   K +RR KDDSSD DS GE  KGTKKNLRD+RRNDSESD + D ++KY  SR SKKNRRHDSD SS+TDSGGE K TKKHLR+ R
Subjt:  DKDVKKGASKEMKQKDK-KRRSKDDSSDTDS-GE-RKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKR

Query:  KDNLETDSDSDLDQKYLTSRKHKKNRRHDSDD--------------------------------------------------------------------
        +D  + D DS+ DQKY TSRKHKKNRRHDSDD                                                                    
Subjt:  KDNLETDSDSDLDQKYLTSRKHKKNRRHDSDD--------------------------------------------------------------------

Query:  --------------------------------------SSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKH-TSKKQKKSTRHDSDGSDSFTDGDKI
                                              SSDTDS GEHK+TKKS++NN+R   SD DSD+DKK+ TSKKQ+K+    SD SDS  D  + 
Subjt:  --------------------------------------SSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKH-TSKKQKKSTRHDSDGSDSFTDGDKI

Query:  GTDSHKKGSGRHESHKV-KKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYG-ESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSR
        G  SH+KGSGR +S KV KKQR RKQ+STDE+NSDSGI+DK RQLKHK+QHGKRYG +SDSSD DSSDSDVGR KS HRY SK  GKSRV+SESDSEK R
Subjt:  GTDSHKKGSGRHESHKV-KKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYG-ESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSR

Query:  KYPKKDDRRRRHDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQH-DSNNSDDSLAVHRKGDDDHKKAKKYLSGDG
        K+PKKD  RRRHD D+++SGDN SSSDE+VK RR RRHN DD SEEEGEYFG+SGKIATKG I AKR+H DS+ SDDS AV RKG+D  K+AKK+ SGDG
Subjt:  KYPKKDDRRRRHDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQH-DSNNSDDSLAVHRKGDDDHKKAKKYLSGDG

Query:  FNLEKGGKLSSGARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKYSSDDSVLEKGVKSTDGARER
         + +KG K S GARERGKG+ +HA+G                                                        D+SV     K+T      
Subjt:  FNLEKGGKLSSGARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKYSSDDSVLEKGVKSTDGARER

Query:  GKNHADGLYKFKKDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNR--------NSDPKKDFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSE
                YK + D ++E N A+Q+T  M  KRKLDEG E EQ+PE+KSR+R        + DPKKDFK+DSESSRR+RSGRY++TRDGRYRED KIDSE
Subjt:  GKNHADGLYKFKKDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNR--------NSDPKKDFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSE

Query:  SNTRSRYSAQIEDDDRKSIRTGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRH--SRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY
        SN RSRYSA  ED+DRKS RTGSRY+EETEHGSRH+ KANESHH  RTDQD EE KRH  SRYEE RGRKHERDEG+KSSRE ERGEYQPSSR RSEKDY
Subjt:  SNTRSRYSAQIEDDDRKSIRTGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRH--SRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY

Query:  ETRESTRDREDSRKRPKYESRSSRRD
        ET+ESTRDR+D RKR KY+SRSSRRD
Subjt:  ETRESTRDREDSRKRPKYESRSSRRD

XP_038884695.1 dentin sialophosphoprotein-like [Benincasa hispida]0.0e+0070.31Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL +QGYT  EISEKLREARE LEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD
        ASGSEEKDG SAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGS  D+EQVK+EISDPSR RREGQNAD+KRHEKSEHSFLDR+LNWKK G EDQYD
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD

Query:  DKDVKKGASKEMK--QKDKKRRSKDDSSDTDSGERKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKRK
        DKD KK  SKE+K  QK +KRR KDDSSDTDS +R     NLRDSRRNDSESDLD DV +KYVASR   KNRRHDSDDSS+TDSGGE K TKKHLR+KR+
Subjt:  DKDVKKGASKEMK--QKDKKRRSKDDSSDTDSGERKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKRK

Query:  DNLETDSDSDLDQKYLTSRKHKKNRRHDSDDSSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKHT-SKKQKKSTRHDSDGSDSFTDGDKIGT-DSHK
        D+ E+D DSD DQKY+TSRKHKKNRRHD D+SSDTDS GEHKKTKK++RNN+RGHGSD  SD+DKK+T SKK +K+ RHDSD SDS TDGD+ G   SHK
Subjt:  DNLETDSDSDLDQKYLTSRKHKKNRRHDSDDSSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKHT-SKKQKKSTRHDSDGSDSFTDGDKIGT-DSHK

Query:  KGSGRHESHKVKKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYG-ESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSRKYPKKDD
        KGS RH+S KVK QRSRKQ+STDE+NSDSGI++K RQLKH++QHGKRYG ESDSSDHDSSDSDVG  KS HRY SK  GKSRV+SES+SEKSRK+ KKD 
Subjt:  KGSGRHESHKVKKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYG-ESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSRKYPKKDD

Query:  RRRRHDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQHDSN-NSDDSLAVHRKGDDDHKKAKKYLSGDGFNLEKGG
         R RHDID+EKSGDN SS  E+VKRRRGR +N DD+SEEEGEY GRSGKIATKGKIDAKRQHD N NSDDSLAV RKG+D HK+AKK  SGD  +LEKG 
Subjt:  RRRRHDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQHDSN-NSDDSLAVHRKGDDDHKKAKKYLSGDGFNLEKGG

Query:  KLSSGARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKYSSDDSVLEKGVKSTDGARERGKNHADG
        K S GARERGKG+L                                                                                 NHADG
Subjt:  KLSSGARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKYSSDDSVLEKGVKSTDGARERGKNHADG

Query:  LYKFKKDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNRNS-------------------------------------------------DPKK
        L KFKKDSINE NHASQ+TD MN KRK DEG + EQ+ ESKSRNRNS                                                 +PKK
Subjt:  LYKFKKDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNRNS-------------------------------------------------DPKK

Query:  DFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSESNTRSRYSAQIEDDDRKSIRTGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRG
         F++DSESSRR+RSGRYD+TRDGRYRED KIDSESN RSRYS Q EDDDRK+ +TGSR++EETEHGSRHHRKANESHH  RT +DTEEEKRHSRYEEPRG
Subjt:  DFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSESNTRSRYSAQIEDDDRKSIRTGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRG

Query:  RKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDREDSRKRPKYESRSSRRDNH
        RKHER+EGLKS REVERGEYQPSSR RSEKDYETRESTRDR+DSRKR KYESRSSRRDNH
Subjt:  RKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDREDSRKRPKYESRSSRRDNH

TrEMBL top hitse value%identityAlignment
A0A0A0LQ00 cwf21 domain-containing protein0.0e+0099.23Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD
        ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD

Query:  DKDVKKGASKEMKQKDKKRRSKDDSSDTDSGERKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKRKDN
        DKDVKKGASKEMKQKDKKRRSKDDSSDTDSGERKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKH RNKRKDN
Subjt:  DKDVKKGASKEMKQKDKKRRSKDDSSDTDSGERKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKRKDN

Query:  LETDSDSDLDQKYLTSRKHKKNRRHDSDDSSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKHTSKKQKKSTRHDSDGSDSFTDGDKIGTDSH-KKGS
        LETDSDSDLDQKYLTSRKHKKNRRHDSDDSSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKHTSKKQKKSTRHDSDGSDSFTDGDKIG DSH KKGS
Subjt:  LETDSDSDLDQKYLTSRKHKKNRRHDSDDSSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKHTSKKQKKSTRHDSDGSDSFTDGDKIGTDSH-KKGS

Query:  GRHESHKVKKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYGESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSRKYPKKDDRRRR
        GRHESHKVKKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYGESDSSDHDSSDSDVGRMKS HRYHSKHTGKSRVNSESDSEKSRKYP KDDRRRR
Subjt:  GRHESHKVKKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYGESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSRKYPKKDDRRRR

Query:  HDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQHDSNNSDDSLAVHRKGDDDHKKAKKYLSGDGFNLEKGGKLSSG
        HDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQHDSNNSDDSLAVHRKGDDDHKKAKKYLSGDGFNLEKGGKLSSG
Subjt:  HDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQHDSNNSDDSLAVHRKGDDDHKKAKKYLSGDGFNLEKGGKLSSG

Query:  ARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKYSSDDSVLEKGVKSTDGARERGKNHADGLYKFK
        ARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKYSSDDS LEKGVKSTDGARERGKNHADGLYKFK
Subjt:  ARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKYSSDDSVLEKGVKSTDGARERGKNHADGLYKFK

Query:  KDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSESNTRSRYSAQIEDDDRKSIR
        KDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNRNSDPKKD KHDSESSRRSRSGRYDDTRDGRYREDFKIDSESNTRSRYSAQIEDDDRKSIR
Subjt:  KDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSESNTRSRYSAQIEDDDRKSIR

Query:  TGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDREDSRKRPKYESRS
        TGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDREDSRKRPKYESRS
Subjt:  TGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDREDSRKRPKYESRS

Query:  SRRDNH
        SRRDNH
Subjt:  SRRDNH

A0A1S3BBX0 dentin sialophosphoprotein-like0.0e+0091.31Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLN+QGYTEKEISEKLREARENLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD
        ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSL D EQVK+EISDPSR+RREGQNAD+KRHEKSEHSFLDR+LNWK+RGTEDQ+D
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD

Query:  DKDVKKGASKEMK--QKDKKRRSKDDSSDTDSGERKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKRK
        DKDVKKGASKE+K  QKDKKRR KDD SD DSGE KGTKKNLRDSRR DSESDLDIDVNNKYVASR SKKNRRHDSDDSS TDSGGEHKVTKKH RNKRK
Subjt:  DKDVKKGASKEMK--QKDKKRRSKDDSSDTDSGERKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKRK

Query:  DNLETDSDSDLDQKYLTSRKHKKNRRHDSDDSSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKHTSKKQKKSTRHDSDGSDSFTDGDKIGTDSHKKG
        D+ E+DSDSDLDQKYLTSRKHKKNRRHDSDDSSD+DS GEHKKTK+SVR+NQRGHGSD DSDVDKKHTSKKQKKSTRHDSD SDSFTDGDKIG DSH+KG
Subjt:  DNLETDSDSDLDQKYLTSRKHKKNRRHDSDDSSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKHTSKKQKKSTRHDSDGSDSFTDGDKIGTDSHKKG

Query:  SGRHESHKVKKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYGESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSRKYPKKDDRRR
        SGRHES KVKKQRS+KQDSTDETNSDS +EDKHRQLKHK+QHGKRYGESDSSDHDSSDSDVGR KS HR+HSK TGKSRV+SESD EKSRKYPKKDDRRR
Subjt:  SGRHESHKVKKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYGESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSRKYPKKDDRRR

Query:  RHDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQHD-SNNSDDSLAVHRKGDDDHKKAKKYLSGDGFNLEKGGKLS
        RHDIDDEKSGDN SSSDELVKRRRGRRH+ DDSSEEEGEYFGRSGKI TKGKIDAKRQ D SNNSD SLAV RKGDD+HK+AKKY SGDGFNLEKG KLS
Subjt:  RHDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQHD-SNNSDDSLAVHRKGDDDHKKAKKYLSGDGFNLEKGGKLS

Query:  SGARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKY-SSDDSVLEKGVKSTDGARERGKNHADGLY
        SGARERGKGNL+H EGRRHNTDDKSEEEGEYLGRSGK+ATKRK+D KRQHDDSENSDDSLAVKHKRAKKY SSDDS LEKGVKSTDGARERGKN ADGL 
Subjt:  SGARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKY-SSDDSVLEKGVKSTDGARERGKNHADGLY

Query:  KFKKDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSESNTRSRYSAQIEDDDRK
        KFKKDSI+E NHASQRTDKMN KRKLDEG E EQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYD+TRDGRYRED KIDSESNTRSRYSA  EDDDRK
Subjt:  KFKKDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSESNTRSRYSAQIEDDDRK

Query:  SIRTGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDREDSRKRPKYE
        S RTGSRY+EETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY   ESTRDREDSRKR KYE
Subjt:  SIRTGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDREDSRKRPKYE

Query:  SRSSRRDNH
        SRSSR DNH
Subjt:  SRSSRRDNH

A0A5A7VCH8 Dentin sialophosphoprotein-like0.0e+0091.53Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLN+QGYTEKEISEKLREARENLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD
        ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDD EQVK+EISDPSR+RREGQNAD+KRHEKSEHSFLDR+LNWK+RGTEDQ+D
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD

Query:  DKDVKKGASKEMK--QKDKKRRSKDDSSDTDSGERKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKRK
        DKDVKKGASKE+K  QKDKKRR KDDSSDTDSGE KGTKKNLRDSRR DSES+LDIDVNNKYVASR SKKNRRHDSDDSS TDSGGEHKVTKKH RNKRK
Subjt:  DKDVKKGASKEMK--QKDKKRRSKDDSSDTDSGERKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKRK

Query:  DNLETDSDSDLDQKYLTSRKHKKNRRHDSDDSSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKHTSKKQKKSTRHDSDGSDSFTDGDKIGTDSHKKG
        D+ E+DSDSDLDQKYLTSRKHKKNRRHDSDDSSD+DS GEHKKTK+SVR+NQRGHGSD DSDVDKKHTSKKQKKSTRHDSD SDSFTDGDKIG DSH+KG
Subjt:  DNLETDSDSDLDQKYLTSRKHKKNRRHDSDDSSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKHTSKKQKKSTRHDSDGSDSFTDGDKIGTDSHKKG

Query:  SGRHESHKVKKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYGESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSRKYPKKDDRRR
        SGRHES KVKKQRS+KQDSTDETNSDS +EDKHRQLKHK+QHGKRYGESDSSDHDSSDSDVGR KS HR+HSK TGKSRV+SESD EKSRKYPKKD RRR
Subjt:  SGRHESHKVKKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYGESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSRKYPKKDDRRR

Query:  RHDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQHD-SNNSDDSLAVHRKGDDDHKKAKKYLSGDGFNLEKGGKLS
        RHDIDDEKSGDN SSSDELVKRRRGRRH+ DDSSEEEGEYFGRSGKI TKGKIDAKRQ D SNNSD SLAV RKGDD+HK+AKKY SGDGFNLEKG KLS
Subjt:  RHDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQHD-SNNSDDSLAVHRKGDDDHKKAKKYLSGDGFNLEKGGKLS

Query:  SGARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKY-SSDDSVLEKGVKSTDGARERGKNHADGLY
        SGARERGKGNL+H EGRRHNTDDKSEEEGEYLGRSGKIATKRK+D KRQHDDSENSDDSLAVKHKRAKKY SSDDS LEKGVKSTDGARERGKN ADGL 
Subjt:  SGARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKY-SSDDSVLEKGVKSTDGARERGKNHADGLY

Query:  KFKKDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSESNTRSRYSAQIEDDDRK
        KFKKDSI+E NHASQRTDKMN KRKLDEG E EQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYD+TRDGRYRED KIDSESNTRSRYSA  EDDDRK
Subjt:  KFKKDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSESNTRSRYSAQIEDDDRK

Query:  SIRTGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDREDSRKRPKYE
        S RTGSRY+EETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY   ESTRDREDSRKR KYE
Subjt:  SIRTGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDREDSRKRPKYE

Query:  SRSSRRDNH
        SRSSR DNH
Subjt:  SRSSRRDNH

A0A6J1ESM6 dentin sialophosphoprotein-like1.9e-27961.89Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL +QGYTE EIS+KL+EARE LEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD
        ASGSEEKDG SAIVLADK+VSDTQ+HQIAARKEEQMKTLRAALGL S +DSEQV + ISDP+RNRREGQNAD+KRHEKSEHSFLDR+LNWKK G+ED  D
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD

Query:  DKDVKKGASKEMKQKDK-KRRSKDDSSDTDS-GE-RKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKR
        DK  KK  SKE+K   K +RR KDDSSD DS GE  KGTKKNLRD+RRNDSESD + D ++KY  SR SKKNRRHDSD SS+TDSGGE K TKKHLR+ R
Subjt:  DKDVKKGASKEMKQKDK-KRRSKDDSSDTDS-GE-RKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKR

Query:  KDNLETDSDSDLDQKYLTSRKHKKNRRHDSDD--------------------------------------------------------------------
        +D  + D DS+ DQKY TSRKHKKNRRHDSDD                                                                    
Subjt:  KDNLETDSDSDLDQKYLTSRKHKKNRRHDSDD--------------------------------------------------------------------

Query:  --------------------------------------SSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKH-TSKKQKKSTRHDSDGSDSFTDGDKI
                                              SSDTDS GEHK+TKKS++NN+R   SD DSD+DKK+ TSKKQ+K+    SD SDS  D  + 
Subjt:  --------------------------------------SSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKH-TSKKQKKSTRHDSDGSDSFTDGDKI

Query:  GTDSHKKGSGRHESHKV-KKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYG-ESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSR
        G  SH+KGSGR +S KV KKQR RKQ+STDE+NSDSGI+DK RQLKHK+QHGKRYG +SDSSD DSSDSDVGR KS HRY SK  GKSRV+SESDSEK R
Subjt:  GTDSHKKGSGRHESHKV-KKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYG-ESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSR

Query:  KYPKKDDRRRRHDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQH-DSNNSDDSLAVHRKGDDDHKKAKKYLSGDG
        K+PKKD  RRRHD D+++SGDN SSSDE+VK RR RRHN DD SEEEGEYFG+SGKIATKG I AKR+H DS+ SDDS AV RKG+D  K+AKK+ SGDG
Subjt:  KYPKKDDRRRRHDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQH-DSNNSDDSLAVHRKGDDDHKKAKKYLSGDG

Query:  FNLEKGGKLSSGARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKYSSDDSVLEKGVKSTDGARER
         + +KG K S GARERGKG+ +HA+G                                                        D+SV     K+T      
Subjt:  FNLEKGGKLSSGARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKYSSDDSVLEKGVKSTDGARER

Query:  GKNHADGLYKFKKDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNR--------NSDPKKDFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSE
                YK + D ++E N A+Q+T  M  KRKLDEG E EQ+PE+KSR+R        + DPKKDFK+DSESSRR+RSGRY++TRDGRYRED KIDSE
Subjt:  GKNHADGLYKFKKDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNR--------NSDPKKDFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSE

Query:  SNTRSRYSAQIEDDDRKSIRTGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRH--SRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY
        SN RSRYSA  ED+DRKS RTGSRY+EETEHGSRH+ KANESHH  RTDQD EE KRH  SRYEE RGRKHERDEG+KSSRE ERGEYQPSSR RSEKDY
Subjt:  SNTRSRYSAQIEDDDRKSIRTGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRH--SRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY

Query:  ETRESTRDREDSRKRPKYESRSSRRD
        ET+ESTRDR+D RKR KY+SRSSRRD
Subjt:  ETRESTRDREDSRKRPKYESRSSRRD

A0A6J1K7B6 dentin sialophosphoprotein-like8.3e-27561.5Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL +QGYTE EIS+KL+EARE LEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEA

Query:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD
        ASGSEEKDG SAIVLADK+VSDTQ+HQIAARKEEQMKTLRAALGL S +DSEQV + ISDP+RNRREGQNAD+KR EKSEHSFLDR+LNWK+ G+ED  D
Subjt:  ASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYD

Query:  DKDVKKGASKEMKQKDK-KRRSKDDSSDTDS-GE-RKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKR
        DK  KK  SKE+K   K +RR KDDSSD DS GE  KGTKKNLRD+RR DSESD + D ++KY  SR SKKNRRHDSD SS+TDSGGE K TKKHLR+ R
Subjt:  DKDVKKGASKEMKQKDK-KRRSKDDSSDTDS-GE-RKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKR

Query:  KDNLETDSDSDLDQKYLTSR--------------------------------------------------------------------------------
        +D  + D DS+ DQKY TSR                                                                                
Subjt:  KDNLETDSDSDLDQKYLTSR--------------------------------------------------------------------------------

Query:  --------------------------KHKKNRRHDSDDSSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKH-TSKKQKKSTRHDSDGSDSFTDGDKI
                                  KHKKNRRHDSD SSDTDS GEHK+TKKS++NN+R   SD DSD+DKK+ TSKKQ+K+   DSD SDS  D  + 
Subjt:  --------------------------KHKKNRRHDSDDSSDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKH-TSKKQKKSTRHDSDGSDSFTDGDKI

Query:  GTDSHKKGSGRHESHKV-KKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYG-ESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSR
        G  SH+KGSGR +S KV KKQRSRKQ+STDE+NSDSGI+DK RQLK+K+QHGKRYG +SDSSD DSSDSDVGR KS HRYHSK TGKSRV+SESDSEK R
Subjt:  GTDSHKKGSGRHESHKV-KKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYG-ESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSR

Query:  KYPKKDDRRRRHDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQH-DSNNSDDSLAVHRKGDDDHKKAKKYLSGDG
        K+PKKD  RRRHD D+++SGDN SSSDE+VKRRR RRHN DD S EEGEYFG+SGKIATKG I AKR+H DS+ SDDS AV R+G+D  K+AKK+  GDG
Subjt:  KYPKKDDRRRRHDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQH-DSNNSDDSLAVHRKGDDDHKKAKKYLSGDG

Query:  FNLEKGGKLSSGARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKYSSDDSVLEKGVKSTDGARER
         + +KG K S GARERGKG+ +HA+G                                                        D+SV     K+T      
Subjt:  FNLEKGGKLSSGARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVKHKRAKKYSSDDSVLEKGVKSTDGARER

Query:  GKNHADGLYKFKKDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNR--------NSDPKKDFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSE
                YK + DS++E N A+Q+T  M  KRKLDEG E EQ+PE+KS++R        + DPKKDFK+DSESSRR+RSGR+ +TRDGRYRED KIDSE
Subjt:  GKNHADGLYKFKKDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNR--------NSDPKKDFKHDSESSRRSRSGRYDDTRDGRYREDFKIDSE

Query:  SNTRSRYSAQIEDDDRKSIRTGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRH--SRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY
        SN RSRYSA  EDDDRKSIRTGSRY+EETEHGSRH+ KANESHH  RTDQD EE KR   SRYEE RGRKHERDEG+KSSRE ERGEYQPSSR RSEKDY
Subjt:  SNTRSRYSAQIEDDDRKSIRTGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRH--SRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY

Query:  ETRESTRDREDSRKRPKYESRSSRRD
        ET+ESTRDR+D RKR KY+SRSSR D
Subjt:  ETRESTRDREDSRKRPKYESRSSRRD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G49601.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; CONTAINS InterPro DOMAIN/s: mRNA splicing factor, Cwf21 (InterPro:IPR013170); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).2.1e-5233.48Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKT-GKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLE
        MYNGIGLQT RGSGTNGY+QTNKFFVRP+  GK  +  +GFE+D+GTAG+SKKPNK ILEHDRKRQI LKL ILEDKL +QGY++ EI++KL EAR +LE
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKT-GKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLE

Query:  AASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQY
        AA+ + E++       +D +VS+TQTHQ+AARKE+QM+  RAALG   L D +QV +E        REG    +K  E+ EHSFLDRD   KK   ++  
Subjt:  AASGSEEKDGSSAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQY

Query:  DDKDVKKGASKEMKQKDKKRRSKDDSSDTDSGERKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSD-DSSETDSGGEHKVTKKHLRNKRK
        D+KD K      +K+  K+R   DD  D     +K   K     RR+D  S+ D    ++   S+   K R+ +S+ DSS +DS                
Subjt:  DDKDVKKGASKEMKQKDKKRRSKDDSSDTDSGERKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSD-DSSETDSGGEHKVTKKHLRNKRK

Query:  DNLETDSDSDLDQK------YLTSRKHKKNRRHDSDDSSDTDSDGE---HKKTKKSVRNNQRGHGSDLDSDVDKKHTSKKQKKSTRHDSDGSDSFTDGDK
           E+DSDSD  +K        T++K  + +R  S +S + +SD      K  KKS+ +N+ G       ++  KH  + +    RHDSD S+  ++ +K
Subjt:  DNLETDSDSDLDQK------YLTSRKHKKNRRHDSDDSSDTDSDGE---HKKTKKSVRNNQRGHGSDLDSDVDKKHTSKKQKKSTRHDSDGSDSFTDGDK

Query:  IGTDSHKKGSGRHESHKVKKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYGESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSRK
                         ++K+    +    +   D  +E  H  LK +     +    DS D +    +  +++S    +S    + R   E  ++  + 
Subjt:  IGTDSHKKGSGRHESHKVKKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQHGKRYGESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSRK

Query:  YPKKDDRRRR--HDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQHDSNNSDD---SLAVHRKGDDD---HKKAKK
          + D R +    D DD ++        +    +RGR+H       EE E     G+   +G    KR       DD     A+  +GDDD   ++  ++
Subjt:  YPKKDDRRRR--HDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGKIDAKRQHDSNNSDD---SLAVHRKGDDD---HKKAKK

Query:  YLSGDGFNLEKGGKLSSGARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENS
         +  D    + G       R+R +G+   A G+  + DD+   E EY  R           G+ ++DDS +S
Subjt:  YLSGDGFNLEKGGKLSSGARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATAACGGTATTGGATTACAGACTCCTAGAGGGTCTGGCACTAACGGTTATATCCAGACGAACAAGTTTTTTGTGAGGCCAAAGACTGGAAAGGTTGCTGAAAGCAC
CAGAGGATTTGAAGAAGATCAGGGCACTGCTGGAGTTTCTAAGAAACCTAATAAAGACATTCTCGAGCATGACCGCAAGCGTCAGATTGAACTCAAACTTGTCATACTTG
AGGACAAGCTCAACGAACAAGGCTATACAGAGAAGGAAATTTCTGAAAAGTTGAGGGAAGCTCGTGAGAATTTGGAAGCTGCTTCTGGTTCTGAGGAAAAAGATGGATCC
TCTGCCATTGTTCTTGCTGATAAGAGGGTATCAGATACACAGACTCACCAAATTGCTGCAAGAAAGGAGGAGCAGATGAAAACATTGAGAGCTGCTCTTGGGTTGGGTTC
ATTGGATGATAGTGAACAGGTTAAAGATGAGATTTCTGATCCATCAAGGAATAGGAGAGAGGGTCAGAATGCTGATATGAAACGTCATGAGAAGTCTGAACACTCTTTTC
TGGACAGAGATTTGAACTGGAAAAAGCGTGGCACTGAAGATCAGTATGATGATAAGGATGTCAAAAAGGGGGCTTCAAAAGAGATGAAACAAAAGGATAAAAAAAGAAGA
TCCAAGGATGATTCTTCTGACACAGATTCTGGTGAGCGGAAGGGAACCAAGAAGAACTTGAGAGATAGTAGAAGGAATGATTCTGAAAGTGACCTTGACATTGATGTCAA
CAATAAATATGTCGCCTCAAGGAACTCTAAAAAGAATAGAAGACATGATAGTGATGATTCTTCTGAAACTGATTCTGGTGGCGAGCACAAGGTAACCAAAAAGCACTTGA
GAAATAAACGGAAAGATAATCTTGAAACTGACTCAGACAGCGATCTTGACCAGAAATATTTGACCTCGAGGAAGCATAAGAAAAACAGAAGGCATGATAGTGATGATTCT
TCTGATACTGATTCTGATGGAGAGCACAAGAAAACCAAGAAGAGTGTGAGAAATAATCAAAGAGGTCATGGAAGTGATCTCGACAGTGATGTTGACAAGAAACACACCTC
AAAGAAGCAGAAGAAAAGCACAAGGCATGATAGCGATGGTTCTGATTCCTTTACGGATGGTGATAAGATTGGGACGGACAGCCACAAGAAAGGATCTGGTCGACATGAAA
GTCACAAGGTGAAGAAGCAAAGAAGCCGGAAACAGGATTCTACTGATGAAACCAATTCTGACAGTGGGATTGAAGATAAACACAGGCAACTGAAACACAAAAGCCAGCAT
GGTAAAAGATATGGAGAAAGTGACAGCTCTGACCATGACAGTTCTGATTCTGATGTAGGTCGCATGAAGAGTATGCATAGATATCACAGCAAACATACAGGAAAGAGCAG
GGTAAATAGTGAATCCGATTCTGAAAAGTCAAGAAAGTATCCAAAGAAAGATGATCGTAGACGCAGACATGACATTGATGATGAAAAAAGTGGCGATAACAGATCTAGCA
GTGATGAATTAGTGAAGAGGCGCAGAGGTAGGAGGCACAATATTGATGATAGTTCTGAAGAAGAAGGCGAATATTTTGGTAGAAGTGGTAAGATAGCCACAAAAGGAAAA
ATAGATGCTAAAAGGCAACATGATAGCAATAATTCTGATGATAGCTTAGCAGTTCATAGAAAGGGCGATGATGACCACAAGAAAGCTAAGAAATATTTGTCTGGTGACGG
TTTTAATCTAGAGAAGGGAGGAAAATTGAGCAGTGGAGCTCGTGAAAGAGGAAAAGGGAACTTAGACCATGCAGAAGGTAGGAGACACAACACTGATGATAAATCCGAAG
AAGAAGGTGAATATCTTGGTAGAAGTGGTAAGATAGCTACAAAAAGAAAAATAGATGGTAAAAGGCAACATGATGACAGTGAAAATTCTGATGATAGCCTTGCAGTTAAA
CACAAGAGAGCTAAGAAATATTCATCTGACGATTCTGTTCTAGAGAAGGGAGTAAAATCAACTGATGGAGCACGTGAAAGAGGGAAAAACCATGCAGATGGTTTGTACAA
GTTTAAGAAAGATTCTATCAATGAGTTGAACCATGCAAGTCAACGGACAGATAAAATGAACGGCAAGAGAAAGCTTGATGAAGGTCCTGAAATTGAGCAAGAGCCAGAGT
CAAAATCTAGAAATCGAAATTCTGACCCCAAGAAAGATTTCAAACATGACTCTGAATCAAGCAGAAGATCACGGAGTGGTAGGTACGATGACACAAGGGATGGACGGTAC
AGGGAAGACTTCAAAATTGACTCTGAATCAAACACTAGATCACGCTACAGTGCACAAATTGAGGATGACGACAGAAAGTCGATTCGAACAGGAAGCAGATATTCTGAAGA
AACTGAGCATGGAAGTAGACATCATCGCAAGGCTAACGAGTCTCATCACCATCGGAGGACTGATCAAGATACTGAAGAGGAAAAAAGGCACAGCAGATATGAGGAGCCTA
GAGGGAGAAAACATGAAAGAGATGAGGGTCTAAAATCAAGCAGGGAAGTTGAAAGAGGGGAGTATCAACCAAGTAGCAGGCAGAGATCTGAGAAAGATTATGAAACTAGA
GAATCTACAAGAGATAGGGAGGATTCCAGAAAGAGGCCCAAATATGAATCTCGATCAAGCAGACGTGATAATCATTAG
mRNA sequenceShow/hide mRNA sequence
GGGTTCTTCATCGCGCCTATTTTTCTTTCACCTTCTGACCGAAAATCCCAATCTCTCTTTATAATCATTTCAACACTATTGATTTATCTCAAGCCTCTCCTTCTCTGATC
TCCATTTATCTCTCGTCTCTCAATGTTGTCGCCTTTGATCGTTTAAGATCTCTCCGTCTCTGGTGCTTTCTTTGATTTCGTTATTATCTCTCACTCTCGGTTGTTCTTCA
AGTTTGAGGTTGGGGAAGCTGCAGGACTATGTATAACGGTATTGGATTACAGACTCCTAGAGGGTCTGGCACTAACGGTTATATCCAGACGAACAAGTTTTTTGTGAGGC
CAAAGACTGGAAAGGTTGCTGAAAGCACCAGAGGATTTGAAGAAGATCAGGGCACTGCTGGAGTTTCTAAGAAACCTAATAAAGACATTCTCGAGCATGACCGCAAGCGT
CAGATTGAACTCAAACTTGTCATACTTGAGGACAAGCTCAACGAACAAGGCTATACAGAGAAGGAAATTTCTGAAAAGTTGAGGGAAGCTCGTGAGAATTTGGAAGCTGC
TTCTGGTTCTGAGGAAAAAGATGGATCCTCTGCCATTGTTCTTGCTGATAAGAGGGTATCAGATACACAGACTCACCAAATTGCTGCAAGAAAGGAGGAGCAGATGAAAA
CATTGAGAGCTGCTCTTGGGTTGGGTTCATTGGATGATAGTGAACAGGTTAAAGATGAGATTTCTGATCCATCAAGGAATAGGAGAGAGGGTCAGAATGCTGATATGAAA
CGTCATGAGAAGTCTGAACACTCTTTTCTGGACAGAGATTTGAACTGGAAAAAGCGTGGCACTGAAGATCAGTATGATGATAAGGATGTCAAAAAGGGGGCTTCAAAAGA
GATGAAACAAAAGGATAAAAAAAGAAGATCCAAGGATGATTCTTCTGACACAGATTCTGGTGAGCGGAAGGGAACCAAGAAGAACTTGAGAGATAGTAGAAGGAATGATT
CTGAAAGTGACCTTGACATTGATGTCAACAATAAATATGTCGCCTCAAGGAACTCTAAAAAGAATAGAAGACATGATAGTGATGATTCTTCTGAAACTGATTCTGGTGGC
GAGCACAAGGTAACCAAAAAGCACTTGAGAAATAAACGGAAAGATAATCTTGAAACTGACTCAGACAGCGATCTTGACCAGAAATATTTGACCTCGAGGAAGCATAAGAA
AAACAGAAGGCATGATAGTGATGATTCTTCTGATACTGATTCTGATGGAGAGCACAAGAAAACCAAGAAGAGTGTGAGAAATAATCAAAGAGGTCATGGAAGTGATCTCG
ACAGTGATGTTGACAAGAAACACACCTCAAAGAAGCAGAAGAAAAGCACAAGGCATGATAGCGATGGTTCTGATTCCTTTACGGATGGTGATAAGATTGGGACGGACAGC
CACAAGAAAGGATCTGGTCGACATGAAAGTCACAAGGTGAAGAAGCAAAGAAGCCGGAAACAGGATTCTACTGATGAAACCAATTCTGACAGTGGGATTGAAGATAAACA
CAGGCAACTGAAACACAAAAGCCAGCATGGTAAAAGATATGGAGAAAGTGACAGCTCTGACCATGACAGTTCTGATTCTGATGTAGGTCGCATGAAGAGTATGCATAGAT
ATCACAGCAAACATACAGGAAAGAGCAGGGTAAATAGTGAATCCGATTCTGAAAAGTCAAGAAAGTATCCAAAGAAAGATGATCGTAGACGCAGACATGACATTGATGAT
GAAAAAAGTGGCGATAACAGATCTAGCAGTGATGAATTAGTGAAGAGGCGCAGAGGTAGGAGGCACAATATTGATGATAGTTCTGAAGAAGAAGGCGAATATTTTGGTAG
AAGTGGTAAGATAGCCACAAAAGGAAAAATAGATGCTAAAAGGCAACATGATAGCAATAATTCTGATGATAGCTTAGCAGTTCATAGAAAGGGCGATGATGACCACAAGA
AAGCTAAGAAATATTTGTCTGGTGACGGTTTTAATCTAGAGAAGGGAGGAAAATTGAGCAGTGGAGCTCGTGAAAGAGGAAAAGGGAACTTAGACCATGCAGAAGGTAGG
AGACACAACACTGATGATAAATCCGAAGAAGAAGGTGAATATCTTGGTAGAAGTGGTAAGATAGCTACAAAAAGAAAAATAGATGGTAAAAGGCAACATGATGACAGTGA
AAATTCTGATGATAGCCTTGCAGTTAAACACAAGAGAGCTAAGAAATATTCATCTGACGATTCTGTTCTAGAGAAGGGAGTAAAATCAACTGATGGAGCACGTGAAAGAG
GGAAAAACCATGCAGATGGTTTGTACAAGTTTAAGAAAGATTCTATCAATGAGTTGAACCATGCAAGTCAACGGACAGATAAAATGAACGGCAAGAGAAAGCTTGATGAA
GGTCCTGAAATTGAGCAAGAGCCAGAGTCAAAATCTAGAAATCGAAATTCTGACCCCAAGAAAGATTTCAAACATGACTCTGAATCAAGCAGAAGATCACGGAGTGGTAG
GTACGATGACACAAGGGATGGACGGTACAGGGAAGACTTCAAAATTGACTCTGAATCAAACACTAGATCACGCTACAGTGCACAAATTGAGGATGACGACAGAAAGTCGA
TTCGAACAGGAAGCAGATATTCTGAAGAAACTGAGCATGGAAGTAGACATCATCGCAAGGCTAACGAGTCTCATCACCATCGGAGGACTGATCAAGATACTGAAGAGGAA
AAAAGGCACAGCAGATATGAGGAGCCTAGAGGGAGAAAACATGAAAGAGATGAGGGTCTAAAATCAAGCAGGGAAGTTGAAAGAGGGGAGTATCAACCAAGTAGCAGGCA
GAGATCTGAGAAAGATTATGAAACTAGAGAATCTACAAGAGATAGGGAGGATTCCAGAAAGAGGCCCAAATATGAATCTCGATCAAGCAGACGTGATAATCATTAGGCTT
GGGTTCAATATCCGTATGTTTGCTTATTGTTTCTATCAATTCTAGAGCTTGAATCATGATATGCCTTGTTTGAAAGGTTGTATCTTTCCTCTTTCCTCTTTCCTCTTTCC
TCTTTCCTGTAAAGACAATTTATTTGAAATGTTGTTTGCTTTCAACTTCCTTCATAGTTGCTACAGAAAAGATTGTTCCGGGCAGGGACAGAAATGGAATTGTCTTTTGT
GTATTGTTCATTGTTGTATTGTTTTCATCAGGTTCTGGAAAGGCGATCCAATATTTGAATCCCATTTGTGCTAGCAAGTTTGTATATGCAGAAGATTCGAGTTCCGGAGT
AAATTTTAAAGCTC
Protein sequenceShow/hide protein sequence
MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLNEQGYTEKEISEKLREARENLEAASGSEEKDGS
SAIVLADKRVSDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQVKDEISDPSRNRREGQNADMKRHEKSEHSFLDRDLNWKKRGTEDQYDDKDVKKGASKEMKQKDKKRR
SKDDSSDTDSGERKGTKKNLRDSRRNDSESDLDIDVNNKYVASRNSKKNRRHDSDDSSETDSGGEHKVTKKHLRNKRKDNLETDSDSDLDQKYLTSRKHKKNRRHDSDDS
SDTDSDGEHKKTKKSVRNNQRGHGSDLDSDVDKKHTSKKQKKSTRHDSDGSDSFTDGDKIGTDSHKKGSGRHESHKVKKQRSRKQDSTDETNSDSGIEDKHRQLKHKSQH
GKRYGESDSSDHDSSDSDVGRMKSMHRYHSKHTGKSRVNSESDSEKSRKYPKKDDRRRRHDIDDEKSGDNRSSSDELVKRRRGRRHNIDDSSEEEGEYFGRSGKIATKGK
IDAKRQHDSNNSDDSLAVHRKGDDDHKKAKKYLSGDGFNLEKGGKLSSGARERGKGNLDHAEGRRHNTDDKSEEEGEYLGRSGKIATKRKIDGKRQHDDSENSDDSLAVK
HKRAKKYSSDDSVLEKGVKSTDGARERGKNHADGLYKFKKDSINELNHASQRTDKMNGKRKLDEGPEIEQEPESKSRNRNSDPKKDFKHDSESSRRSRSGRYDDTRDGRY
REDFKIDSESNTRSRYSAQIEDDDRKSIRTGSRYSEETEHGSRHHRKANESHHHRRTDQDTEEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETR
ESTRDREDSRKRPKYESRSSRRDNH