; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017402 (gene) of Snake gourd v1 genome

Gene IDTan0017402
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptioncwf21 domain-containing protein
Genome locationLG02:94928669..94933108
RNA-Seq ExpressionTan0017402
SyntenyTan0017402
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR013170 - mRNA splicing factor Cwf21 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131365.1 protein starmaker [Momordica charantia]0.0e+0073.36Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKL+ILEDKL DQGYT +E+SEKLKEARKTLE 
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED

Query:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD
        AS  EEK GPSAI+L DKR+SDTQTHQIAARKEEQMKTLR A GLGS DD+EQLKEG+SD   + REG+N+DIK  EK EH+FLDRELNWKKHA E  +D
Subjt:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD

Query:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGEHKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNNR
        DK+ K RV+KE KGH+KDR+RRPKDDSSDTDSGGEHKGTKKNLRDNRR+DS SD+DSDVDKKY+TSR+ KKNRRHD DDSSD+DSGGE K  KK++R+NR
Subjt:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGEHKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNNR

Query:  RDDPESDLDTDVDQKYITSRKHKKNRRHDSDDSSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGVEFGMGSH
        RDDPESD D+DVD+KYITSRKHKKNRRHDSDDSS +DSG +HK TKKN+++ +RDDHESD DSDVDKKY +SKK  K++RHDSD+SDS+TD  +FG G H
Subjt:  RDDPESDLDTDVDQKYITSRKHKKNRRHDSDDSSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGVEFGMGSH

Query:  KKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESEKSRKHPK
        KKGSGRPKSQKVKKK   +KQESTDESNSD G  DDKGR  +HKN  GKR    SDSSDH  SDSDVGRNKSKHRYHS+S GK +VDSE ++EKSRKHPK
Subjt:  KKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESEKSRKHPK

Query:  EDDVGRHRHDTDD-ESDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKHSSGNGSD
        E DVGRHRHDTDD ES D S SSDE V+RR  +R+DTDD+S   GE  DR+SGKIATK K+AAK+QYD SD+SDDS+AVDRKG +KH+RAKKH+ G+GS 
Subjt:  EDDVGRHRHDTDD-ESDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKHSSGNGSD

Query:  LERGFKSSSGLRERGKGNLNHADGLDESVTADENNLHMSRKDSINEFSH-----MKTKRKF-EGGENDQPEAKSRNRNYARESDFHRDPKKDIKTDPDSN
        LE+GFKSS G RE GKGNLNHADGLDE VTAD+NN + SRKD+I+EF+H     MK+KRKF EGGEN+Q EAKSRNRN  RE  F+ D KKD K D  SN
Subjt:  LERGFKSSSGLRERGKGNLNHADGLDESVTADENNLHMSRKDSINEFSH-----MKTKRKF-EGGENDQPEAKSRNRNYARESDFHRDPKKDIKTDPDSN

Query:  RRAHSSRYDETRDGRYRENSRIDSESNTRSRY-SVRDEDDNRKSTRTGSRYSEETEHESRHHRKANESHGR-RTDQDIEEGKRYSRYEEHRGRKHERDDE
         RA ++RYDE RDG +RE+ ++DSESNTR+RY S+ DEDD  K  RTGS+Y+EETEH SRH+RKANESH       DIEEGKR+ RYEEHRGRKHER DE
Subjt:  RRAHSSRYDETRDGRYRENSRIDSESNTRSRY-SVRDEDDNRKSTRTGSRYSEETEHESRHHRKANESHGR-RTDQDIEEGKRYSRYEEHRGRKHERDDE

Query:  GLKSSREVERGEYQPSSRL---------------------------RSEKDYESRESRRDRD-DPRKRAKYDDSRSSRRDGY
         LKSSREVERGEYQPSS+L                           RSEKDYE+RESRRDR+ D RKRAKYDDSRSSRRD Y
Subjt:  GLKSSREVERGEYQPSSRL---------------------------RSEKDYESRESRRDRD-DPRKRAKYDDSRSSRRDGY

XP_022929608.1 dentin sialophosphoprotein-like [Cucurbita moschata]0.0e+0069.93Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEHDRKRQIELKL+ILEDKLTDQGYT DEIS+KLKEAR+TLE 
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED

Query:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD
        ASGSEEKDGPSAI+LADK+VSDTQ+HQIAARKEEQMKTLR A GL SS+D+EQ+ EG+SD +R+RREGQNADIK HEK EHSFLDRELNWKKH  ED +D
Subjt:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD

Query:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGE-HKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNN
        DK DKKRV+KELKGH KD RRRPKDDSSD DS GE HKGTKKNLRDNRRNDS SD +SD D KY TSRKSKKNRRHD D SSD+DSGGERKGTKKH+R+N
Subjt:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGE-HKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNN

Query:  RRDDPESDLDTDVDQKYITSRKHKKNRRHDSDD-------------------------------------------------------------------
        RRD P+ D D++ DQKY TSRKHKKNRRHDSDD                                                                   
Subjt:  RRDDPESDLDTDVDQKYITSRKHKKNRRHDSDD-------------------------------------------------------------------

Query:  ---------------------------------------SSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGV
                                               SSD+DSGGEHKETKK++K+NRR D ESD DSD+DKKYT+SKK EKN+   SD+SDS  D  
Subjt:  ---------------------------------------SSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGV

Query:  EFGMGSHKKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESE
        EFGMGSH+KGSGR KSQKV KKQ+ +KQESTDESNSDSGI DDKGRQLKHKNQHGKRYGV SDSSD  SSDSDVGRNKSKHRY SK  GK RVDSES+SE
Subjt:  EFGMGSHKKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESE

Query:  KSRKHPKEDDVGRHRHDTD-DESDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKH
        K RKHPK+ DVGR RHDTD DES DNSSSSDEIVK R  RRH++DDKSEEEGE+   +SGKIATK  +AAKR++D SD SDDS+AVDRKG+DK KRAKKH
Subjt:  KSRKHPKEDDVGRHRHDTD-DESDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKH

Query:  SSGNGSDLERGFKSSSGLRERGKGNLNHADGLDESVTADENNLHMSRKDSINEFSH-----MKTKRKF-EGGEND-QPEAKSRNRNYARESDFHRDPKKD
        SSG+GSD ++G KSS G RERGKG+ NHADGLDESVTA +N  + SR D ++EF+      MK+KRK  EGGE++ QPEAKSR+R   RESDFH DPKKD
Subjt:  SSGNGSDLERGFKSSSGLRERGKGNLNHADGLDESVTADENNLHMSRKDSINEFSH-----MKTKRKF-EGGEND-QPEAKSRNRNYARESDFHRDPKKD

Query:  IKTDPDSNRRAHSSRYDETRDGRYRENSRIDSESNTRSRYSVRDEDDNRKSTRTGSRYSEETEHESRHHRKANESHGR-RTDQDIEEGKRY--SRYEEHR
         K D +S+RRA S RY+ETRDGRYRE+ +IDSESN RSRYS  +ED++RKSTRTGSRY+EETEH SRH+ KANESH R RTDQDIEEGKR+  SRYEEHR
Subjt:  IKTDPDSNRRAHSSRYDETRDGRYRENSRIDSESNTRSRYSVRDEDDNRKSTRTGSRYSEETEHESRHHRKANESHGR-RTDQDIEEGKRY--SRYEEHR

Query:  GRKHERDDEGLKSSREVERGEYQPSSRLRSEKDYESRESRRDRDDPRKRAKYDDSRSSRRD
        GRKHER DEG+KSSRE ERGEYQPSSRLRSEKDYE++ES RDRDDPRKRAKY DSRSSRRD
Subjt:  GRKHERDDEGLKSSREVERGEYQPSSRLRSEKDYESRESRRDRDDPRKRAKYDDSRSSRRD

XP_022997381.1 dentin sialophosphoprotein-like [Cucurbita maxima]6.5e-31069.51Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEHDRKRQIELKL+ILEDKLTDQGYT DEIS+KLKEAR+TLE 
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED

Query:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD
        ASGSEEKDGPSAI+LADK+VSDTQ+HQIAARKEEQMKTLR A GL SS+D+EQ+ EG+SD +R+RREGQNADIK  EK EHSFLDRELNWK+H  ED +D
Subjt:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD

Query:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGE-HKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNN
        DK DKKRV+KELKGH KD RRRPKDDSSD DS GE HKGTKKNLRDNRR DS SD +SD D KY TSRKSKKNRRHD D SSD+DSGGERKGTKKH+R+N
Subjt:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGE-HKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNN

Query:  RRDDPESDLDTDVDQKY-----------------------------------------------------ITSRKHKKNRRHDSDD--------------
        RRD P+ D D++ DQKY                                                     ITSRKHKKNRRHDSDD              
Subjt:  RRDDPESDLDTDVDQKY-----------------------------------------------------ITSRKHKKNRRHDSDD--------------

Query:  ---------------------------------------SSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGV
                                               SSD+DSGGEHKETKK++K+NRR D ESD DSD+DKKYT+SKK EKN+  DSD+SDS  D  
Subjt:  ---------------------------------------SSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGV

Query:  EFGMGSHKKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESE
        EFGMGSH+KGSGRPKSQKV KKQ+ +KQESTDESNSDSGI DDKGRQLK+KNQHGKRYGV SDSSD  SSDSDVGRNKSKHRYHSK TGK RVDSES+SE
Subjt:  EFGMGSHKKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESE

Query:  KSRKHPKEDDVGRHRHDTD-DESDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKH
        K RKHPK+ DVGR RHDTD DES DNSSSSDEIVKRR  RRH++DDKS EEGE+   +SGKIATK  +AAKR+++ SD SDDS+AVDR+G+DK KRAKKH
Subjt:  KSRKHPKEDDVGRHRHDTD-DESDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKH

Query:  SSGNGSDLERGFKSSSGLRERGKGNLNHADGLDESVTADENNLHMSRKDSINEFSH-----MKTKRKF-EGGEND-QPEAKSRNRNYARESDFHRDPKKD
        S G+GSD ++G KSS G RERGKG+ NHADGLDESVTA +N  + SR DS++EF+      MK+KRK  EGGE++ QPEAKS++R   RESDFH DPKKD
Subjt:  SSGNGSDLERGFKSSSGLRERGKGNLNHADGLDESVTADENNLHMSRKDSINEFSH-----MKTKRKF-EGGEND-QPEAKSRNRNYARESDFHRDPKKD

Query:  IKTDPDSNRRAHSSRYDETRDGRYRENSRIDSESNTRSRYSVRDEDDNRKSTRTGSRYSEETEHESRHHRKANESHGR-RTDQDIEEGKRY--SRYEEHR
         K D +S+RRA S R+ ETRDGRYRE+ +IDSESN RSRYS  +EDD+RKS RTGSRY+EETEH SRH+ KANESH R RTDQDIEEGKR   SRYEEHR
Subjt:  IKTDPDSNRRAHSSRYDETRDGRYRENSRIDSESNTRSRYSVRDEDDNRKSTRTGSRYSEETEHESRHHRKANESHGR-RTDQDIEEGKRY--SRYEEHR

Query:  GRKHERDDEGLKSSREVERGEYQPSSRLRSEKDYESRESRRDRDDPRKRAKYDDSRSSRRD
        GRKHER DEG+KSSRE ERGEYQPSSRLRSEKDYE++ES RDRDDPRKRAKY DSRSSR D
Subjt:  GRKHERDDEGLKSSREVERGEYQPSSRLRSEKDYESRESRRDRDDPRKRAKYDDSRSSRRD

XP_023545728.1 protein starmaker-like [Cucurbita pepo subsp. pepo]0.0e+0069.93Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEHDRKRQIELKL+ILEDKLTDQGYT DEIS+KLKEAR+TLE 
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED

Query:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD
        ASGSEEKDGPSAI+LADK+VSDTQ+HQIAARKEEQMKTLR A GL SS+D+EQ+ EG+SD +R+RREGQNADIK  EK EHSFLDRELNWKKH  ED +D
Subjt:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD

Query:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGE-HKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNN
        DK DKKRV+KELKGH KD RRRPKDDSSD DS GE HKGTKKNLRDNRRNDS SD +SD D+KY TSRKSKKNRRHD D SSD+DSGGERKGTKKH+R+N
Subjt:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGE-HKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNN

Query:  RRDDPESDLDTDVDQKYITSRKHKKNRRHDSDD-------------------------------------------------------------------
        RRD P+ D D++ DQKY TSRKHKKNRRHDSDD                                                                   
Subjt:  RRDDPESDLDTDVDQKYITSRKHKKNRRHDSDD-------------------------------------------------------------------

Query:  ---------------------------------------SSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGV
                                               SSD+DSGGEHKETKK++K+NRR D ESD DSDVDKKYT+SKK EKN+  DSD+SDS  D  
Subjt:  ---------------------------------------SSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGV

Query:  EFGMGSHKKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESE
        EFGMGSH+KGSGRPKSQKV KKQ+ +KQESTDESNSDSGI DDKGRQLKHKNQHGKRYGV SDSSD  SSDSDVGRNKSKHRYHSK  GK RVDSES+SE
Subjt:  EFGMGSHKKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESE

Query:  KSRKHPKEDDVGRHRHDTD-DESDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKH
        K RKHPK+ DVGR RHDTD DES DNSSSSDEIVKRR  RR+++DDKS EEGE+   +SGK ATK  +AAKR++D SD SDDS+A+DR+G+DK KRAKKH
Subjt:  KSRKHPKEDDVGRHRHDTD-DESDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKH

Query:  SSGNGSDLERGFKSSSGLRERGKGNLNHADGLDESVTADENNLHMSRKDSINEFSH-----MKTKRKF-EGGEND-QPEAKSRNRNYARESDFHRDPKKD
        SSG+GSD ++G KSS G RERGKG+ NHADGLDESVTA +N  + SR DS++EF+      MK+KRK  EGGE++ QPEAKSR+R   RESDFH DPKKD
Subjt:  SSGNGSDLERGFKSSSGLRERGKGNLNHADGLDESVTADENNLHMSRKDSINEFSH-----MKTKRKF-EGGEND-QPEAKSRNRNYARESDFHRDPKKD

Query:  IKTDPDSNRRAHSSRYDETRDGRYRENSRIDSESNTRSRYSVRDEDDNRKSTRTGSRYSEETEHESRHHRKANESHGR-RTDQDIEEGKRY--SRYEEHR
         K D +S+RRA S RY+E RDGRYRE  +IDSESNTRSRYS  +EDD+RKSTRTGSRY+EETEH SRH+ KANESH R RTDQDIEEGKR+  SRYEE R
Subjt:  IKTDPDSNRRAHSSRYDETRDGRYRENSRIDSESNTRSRYSVRDEDDNRKSTRTGSRYSEETEHESRHHRKANESHGR-RTDQDIEEGKRY--SRYEEHR

Query:  GRKHERDDEGLKSSREVERGEYQPSSRLRSEKDYESRESRRDRDDPRKRAKYDDSRSSRRD
        GRKHER DEG+KSSRE ERGEYQPSSRLRSEKDYE++ES RDRDDPRKRAKY DSRSSRRD
Subjt:  GRKHERDDEGLKSSREVERGEYQPSSRLRSEKDYESRESRRDRDDPRKRAKYDDSRSSRRD

XP_038884695.1 dentin sialophosphoprotein-like [Benincasa hispida]1.7e-30873.16Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKL+ILEDKL DQGYT+DEISEKL+EAR+TLE 
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED

Query:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD
        ASGSEEKDGPSAI+LADKRVSDTQTHQIAARKEEQMKTLR A GLGSS DTEQ+KE +SD SR RREGQNADIK HEK EHSFLDRELNWKKH  EDQ D
Subjt:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD

Query:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGEHKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNNR
        DK+DKKR++KELKGHQK R+RRPKDDSSDTDS        +NLRD+RRNDS SDLDSDV  KY+ SR   KNRRHD DDSSD+DSGGERKGTKKH+R+ R
Subjt:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGEHKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNNR

Query:  RDDPESDLDTDVDQKYITSRKHKKNRRHDSDDSSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGVEFGM-GS
        RDDPESD D+D DQKYITSRKHKKNRRHD D+SSD+DSGGEHK+TKKNM++NRR  H SD  SD+DKKYT SKK EKNRRHDSD+SDS+TDG EFGM GS
Subjt:  RDDPESDLDTDVDQKYITSRKHKKNRRHDSDDSSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGVEFGM-GS

Query:  HKKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESEKSRKHP
        HKKGS R KSQKV K Q+ +KQESTDESNSDSGI D+K RQLKH+NQHGKRYGV SDSSDH SSDSDVG  KSKHRY SK  GK RVDSES SEKSRKH 
Subjt:  HKKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESEKSRKHP

Query:  KEDDVGRHRHDTDDE-SDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKHSSGNGS
        K+D  GRHRHD D+E S DNSSS  EIVKRR GR ++ DD SEEEGE+L  RSGKIATK K+ AKRQ+D ++NSDDS AV RKG+DKHKRAKK SSG+ S
Subjt:  KEDDVGRHRHDTDDE-SDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKHSSGNGS

Query:  DLERGFKSSSGLRERGKGNLNHADGLDESVTADENNLHMSRKDSINEFSH-------MKTKRKF-EGGENDQP-EAKSRNRNYARESDFHRD--------
        DLE+G K+S G RERGKG+LNHADGL++            +KDSINEF+H       M +KRKF EGG+N+Q  E+KSRNRN  R SDFH D        
Subjt:  DLERGFKSSSGLRERGKGNLNHADGLDESVTADENNLHMSRKDSINEFSH-------MKTKRKF-EGGENDQP-EAKSRNRNYARESDFHRD--------

Query:  ---------------------------------PKKDIKTDPDSNRRAHSSRYDETRDGRYRENSRIDSESNTRSRYSVRDEDDNRKSTRTGSRYSEETE
                                         PKK  + D +S+RRA S RYDETRDGRYRE+ +IDSESN RSRYSV+DEDD+RK+T+TGSR++EETE
Subjt:  ---------------------------------PKKDIKTDPDSNRRAHSSRYDETRDGRYRENSRIDSESNTRSRYSVRDEDDNRKSTRTGSRYSEETE

Query:  HESRHHRKANESHGR-RTDQDIEEGKRYSRYEEHRGRKHERDDEGLKSSREVERGEYQPSSRLRSEKDYESRESRRDRDDPRKRAKYDDSRSSRRDGY
        H SRHHRKANESH R RT +D EE KR+SRYEE RGRKHER +EGLKS REVERGEYQPSSRLRSEKDYE+RES RDRDD RKRAKY +SRSSRRD +
Subjt:  HESRHHRKANESHGR-RTDQDIEEGKRYSRYEEHRGRKHERDDEGLKSSREVERGEYQPSSRLRSEKDYESRESRRDRDDPRKRAKYDDSRSSRRDGY

TrEMBL top hitse value%identityAlignment
A0A1S3BBX0 dentin sialophosphoprotein-like4.4e-29469.47Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKL+ILEDKL DQGYT  EISEKL+EAR+ LE 
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED

Query:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD
        ASGSEEKDG SAI+LADKRVSDTQTHQIAARKEEQMKTLR A GLGS  D EQ+KE +SD SRSRREGQNADIK HEK EHSFLDRELNWK+   EDQ D
Subjt:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD

Query:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGEHKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNNR
        DK+ KK  +KELKGHQKD++RRPKDD SD DS GEHKGTKKNLRD+RR DS SDLD DV+ KY+ SRKSKKNRRHD DDSS +DSGGE K TKKH RN R
Subjt:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGEHKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNNR

Query:  RDDPESDLDTDVDQKYITSRKHKKNRRHDSDDSSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGVEFGMGSH
        +DDPESD D+D+DQKY+TSRKHKKNRRHDSDDSSDSDSGGEHK+TK++++ N+R  H SD DSDVDKK+T SKK +K+ RHDSD+SDS TDG + GM SH
Subjt:  RDDPESDLDTDVDQKYITSRKHKKNRRHDSDDSSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGVEFGMGSH

Query:  KKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESEKSRKHPK
        +KGSGR +SQKV KKQ+ QKQ+STDE+NSDS + +DK RQLKHKNQHGKRYG  SDSSDH SSDSDVGR KS HR+HSK TGK RVDSES+ EKSRK+PK
Subjt:  KKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESEKSRKHPK

Query:  EDDVGRHRHDTDDE-SDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKHSSGNGSD
        +DD  R RHD DDE S DNSSSSDE+VKRR GRRH TDD SEEEGE+   RSGKI TK K+ AKRQ D S+NSD S AVDRKG D+HKRAKK+SSG+G +
Subjt:  EDDVGRHRHDTDDE-SDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKHSSGNGSD

Query:  LERGFKSSSGLRERGKGNLNHADGL---------------------------------DESVTADE----------------------------------
        LE+G K SSG RERGKGNLNH +G                                  D+S  +D+                                  
Subjt:  LERGFKSSSGLRERGKGNLNHADGL---------------------------------DESVTADE----------------------------------

Query:  ----NNLHMSRKDSINEFSH-------MKTKRKF-EGGENDQ-PEAKSRNRNYARESDFHRDPKKDIKTDPDSNRRAHSSRYDETRDGRYRENSRIDSES
            + L   +KDSI+EF+H       M +KRK  EG EN+Q PE+KSRNRN         DPKKD K D +S+RR+ S RYDETRDGRYRE+S+IDSES
Subjt:  ----NNLHMSRKDSINEFSH-------MKTKRKF-EGGENDQ-PEAKSRNRNYARESDFHRDPKKDIKTDPDSNRRAHSSRYDETRDGRYRENSRIDSES

Query:  NTRSRYSVRDEDDNRKSTRTGSRYSEETEHESRHHRKANES-HGRRTDQDIEEGKRYSRYEEHRGRKHERDDEGLKSSREVERGEYQPSSRLRSEKDYES
        NTRSRYS  +EDD+RKSTRTGSRY+EETEH SRHHRKANES H RRTDQD EE KR+SRYEE RGRKHER DEGLKSSREVERGEYQPSSR RSEKDY  
Subjt:  NTRSRYSVRDEDDNRKSTRTGSRYSEETEHESRHHRKANES-HGRRTDQDIEEGKRYSRYEEHRGRKHERDDEGLKSSREVERGEYQPSSRLRSEKDYES

Query:  RESRRDRDDPRKRAKYDDSRSSRRDGY
         ES RDR+D RKRAKY +SRSSR D +
Subjt:  RESRRDRDDPRKRAKYDDSRSSRRDGY

A0A5A7VCH8 Dentin sialophosphoprotein-like7.9e-29669.69Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKL+ILEDKL DQGYT  EISEKL+EAR+ LE 
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED

Query:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD
        ASGSEEKDG SAI+LADKRVSDTQTHQIAARKEEQMKTLR A GLGS DD EQ+KE +SD SRSRREGQNADIK HEK EHSFLDRELNWK+   EDQ D
Subjt:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD

Query:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGEHKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNNR
        DK+ KK  +KELKGHQKD++RRPKDDSSDTDS GEHKGTKKNLRD+RR DS S+LD DV+ KY+ SRKSKKNRRHD DDSS +DSGGE K TKKH RN R
Subjt:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGEHKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNNR

Query:  RDDPESDLDTDVDQKYITSRKHKKNRRHDSDDSSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGVEFGMGSH
        +DDPESD D+D+DQKY+TSRKHKKNRRHDSDDSSDSDSGGEHK+TK++++ N+R  H SD DSDVDKK+T SKK +K+ RHDSD+SDS TDG + GM SH
Subjt:  RDDPESDLDTDVDQKYITSRKHKKNRRHDSDDSSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGVEFGMGSH

Query:  KKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESEKSRKHPK
        +KGSGR +SQKV KKQ+ QKQ+STDE+NSDS + +DK RQLKHKNQHGKRYG  SDSSDH SSDSDVGR KS HR+HSK TGK RVDSES+ EKSRK+PK
Subjt:  KKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESEKSRKHPK

Query:  EDDVGRHRHDTDDE-SDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKHSSGNGSD
        + DV R RHD DDE S DNSSSSDE+VKRR GRRH TDD SEEEGE+   RSGKI TK K+ AKRQ D S+NSD S AVDRKG D+HKRAKK+SSG+G +
Subjt:  EDDVGRHRHDTDDE-SDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKHSSGNGSD

Query:  LERGFKSSSGLRERGKGNLNHADGL---------------------------------DESVTADE----------------------------------
        LE+G K SSG RERGKGNLNH +G                                  D+S  +D+                                  
Subjt:  LERGFKSSSGLRERGKGNLNHADGL---------------------------------DESVTADE----------------------------------

Query:  ----NNLHMSRKDSINEFSH-------MKTKRKF-EGGENDQ-PEAKSRNRNYARESDFHRDPKKDIKTDPDSNRRAHSSRYDETRDGRYRENSRIDSES
            + L   +KDSI+EF+H       M +KRK  EG EN+Q PE+KSRNRN         DPKKD K D +S+RR+ S RYDETRDGRYRE+S+IDSES
Subjt:  ----NNLHMSRKDSINEFSH-------MKTKRKF-EGGENDQ-PEAKSRNRNYARESDFHRDPKKDIKTDPDSNRRAHSSRYDETRDGRYRENSRIDSES

Query:  NTRSRYSVRDEDDNRKSTRTGSRYSEETEHESRHHRKANES-HGRRTDQDIEEGKRYSRYEEHRGRKHERDDEGLKSSREVERGEYQPSSRLRSEKDYES
        NTRSRYS  +EDD+RKSTRTGSRY+EETEH SRHHRKANES H RRTDQD EE KR+SRYEE RGRKHER DEGLKSSREVERGEYQPSSR RSEKDY  
Subjt:  NTRSRYSVRDEDDNRKSTRTGSRYSEETEHESRHHRKANES-HGRRTDQDIEEGKRYSRYEEHRGRKHERDDEGLKSSREVERGEYQPSSRLRSEKDYES

Query:  RESRRDRDDPRKRAKYDDSRSSRRDGY
         ES RDR+D RKRAKY +SRSSR D +
Subjt:  RESRRDRDDPRKRAKYDDSRSSRRDGY

A0A6J1BPI2 protein starmaker0.0e+0073.36Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKL+ILEDKL DQGYT +E+SEKLKEARKTLE 
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED

Query:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD
        AS  EEK GPSAI+L DKR+SDTQTHQIAARKEEQMKTLR A GLGS DD+EQLKEG+SD   + REG+N+DIK  EK EH+FLDRELNWKKHA E  +D
Subjt:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD

Query:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGEHKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNNR
        DK+ K RV+KE KGH+KDR+RRPKDDSSDTDSGGEHKGTKKNLRDNRR+DS SD+DSDVDKKY+TSR+ KKNRRHD DDSSD+DSGGE K  KK++R+NR
Subjt:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGEHKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNNR

Query:  RDDPESDLDTDVDQKYITSRKHKKNRRHDSDDSSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGVEFGMGSH
        RDDPESD D+DVD+KYITSRKHKKNRRHDSDDSS +DSG +HK TKKN+++ +RDDHESD DSDVDKKY +SKK  K++RHDSD+SDS+TD  +FG G H
Subjt:  RDDPESDLDTDVDQKYITSRKHKKNRRHDSDDSSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGVEFGMGSH

Query:  KKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESEKSRKHPK
        KKGSGRPKSQKVKKK   +KQESTDESNSD G  DDKGR  +HKN  GKR    SDSSDH  SDSDVGRNKSKHRYHS+S GK +VDSE ++EKSRKHPK
Subjt:  KKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESEKSRKHPK

Query:  EDDVGRHRHDTDD-ESDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKHSSGNGSD
        E DVGRHRHDTDD ES D S SSDE V+RR  +R+DTDD+S   GE  DR+SGKIATK K+AAK+QYD SD+SDDS+AVDRKG +KH+RAKKH+ G+GS 
Subjt:  EDDVGRHRHDTDD-ESDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKHSSGNGSD

Query:  LERGFKSSSGLRERGKGNLNHADGLDESVTADENNLHMSRKDSINEFSH-----MKTKRKF-EGGENDQPEAKSRNRNYARESDFHRDPKKDIKTDPDSN
        LE+GFKSS G RE GKGNLNHADGLDE VTAD+NN + SRKD+I+EF+H     MK+KRKF EGGEN+Q EAKSRNRN  RE  F+ D KKD K D  SN
Subjt:  LERGFKSSSGLRERGKGNLNHADGLDESVTADENNLHMSRKDSINEFSH-----MKTKRKF-EGGENDQPEAKSRNRNYARESDFHRDPKKDIKTDPDSN

Query:  RRAHSSRYDETRDGRYRENSRIDSESNTRSRY-SVRDEDDNRKSTRTGSRYSEETEHESRHHRKANESHGR-RTDQDIEEGKRYSRYEEHRGRKHERDDE
         RA ++RYDE RDG +RE+ ++DSESNTR+RY S+ DEDD  K  RTGS+Y+EETEH SRH+RKANESH       DIEEGKR+ RYEEHRGRKHER DE
Subjt:  RRAHSSRYDETRDGRYRENSRIDSESNTRSRY-SVRDEDDNRKSTRTGSRYSEETEHESRHHRKANESHGR-RTDQDIEEGKRYSRYEEHRGRKHERDDE

Query:  GLKSSREVERGEYQPSSRL---------------------------RSEKDYESRESRRDRD-DPRKRAKYDDSRSSRRDGY
         LKSSREVERGEYQPSS+L                           RSEKDYE+RESRRDR+ D RKRAKYDDSRSSRRD Y
Subjt:  GLKSSREVERGEYQPSSRL---------------------------RSEKDYESRESRRDRD-DPRKRAKYDDSRSSRRDGY

A0A6J1ESM6 dentin sialophosphoprotein-like0.0e+0069.93Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEHDRKRQIELKL+ILEDKLTDQGYT DEIS+KLKEAR+TLE 
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED

Query:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD
        ASGSEEKDGPSAI+LADK+VSDTQ+HQIAARKEEQMKTLR A GL SS+D+EQ+ EG+SD +R+RREGQNADIK HEK EHSFLDRELNWKKH  ED +D
Subjt:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD

Query:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGE-HKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNN
        DK DKKRV+KELKGH KD RRRPKDDSSD DS GE HKGTKKNLRDNRRNDS SD +SD D KY TSRKSKKNRRHD D SSD+DSGGERKGTKKH+R+N
Subjt:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGE-HKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNN

Query:  RRDDPESDLDTDVDQKYITSRKHKKNRRHDSDD-------------------------------------------------------------------
        RRD P+ D D++ DQKY TSRKHKKNRRHDSDD                                                                   
Subjt:  RRDDPESDLDTDVDQKYITSRKHKKNRRHDSDD-------------------------------------------------------------------

Query:  ---------------------------------------SSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGV
                                               SSD+DSGGEHKETKK++K+NRR D ESD DSD+DKKYT+SKK EKN+   SD+SDS  D  
Subjt:  ---------------------------------------SSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGV

Query:  EFGMGSHKKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESE
        EFGMGSH+KGSGR KSQKV KKQ+ +KQESTDESNSDSGI DDKGRQLKHKNQHGKRYGV SDSSD  SSDSDVGRNKSKHRY SK  GK RVDSES+SE
Subjt:  EFGMGSHKKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESE

Query:  KSRKHPKEDDVGRHRHDTD-DESDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKH
        K RKHPK+ DVGR RHDTD DES DNSSSSDEIVK R  RRH++DDKSEEEGE+   +SGKIATK  +AAKR++D SD SDDS+AVDRKG+DK KRAKKH
Subjt:  KSRKHPKEDDVGRHRHDTD-DESDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKH

Query:  SSGNGSDLERGFKSSSGLRERGKGNLNHADGLDESVTADENNLHMSRKDSINEFSH-----MKTKRKF-EGGEND-QPEAKSRNRNYARESDFHRDPKKD
        SSG+GSD ++G KSS G RERGKG+ NHADGLDESVTA +N  + SR D ++EF+      MK+KRK  EGGE++ QPEAKSR+R   RESDFH DPKKD
Subjt:  SSGNGSDLERGFKSSSGLRERGKGNLNHADGLDESVTADENNLHMSRKDSINEFSH-----MKTKRKF-EGGEND-QPEAKSRNRNYARESDFHRDPKKD

Query:  IKTDPDSNRRAHSSRYDETRDGRYRENSRIDSESNTRSRYSVRDEDDNRKSTRTGSRYSEETEHESRHHRKANESHGR-RTDQDIEEGKRY--SRYEEHR
         K D +S+RRA S RY+ETRDGRYRE+ +IDSESN RSRYS  +ED++RKSTRTGSRY+EETEH SRH+ KANESH R RTDQDIEEGKR+  SRYEEHR
Subjt:  IKTDPDSNRRAHSSRYDETRDGRYRENSRIDSESNTRSRYSVRDEDDNRKSTRTGSRYSEETEHESRHHRKANESHGR-RTDQDIEEGKRY--SRYEEHR

Query:  GRKHERDDEGLKSSREVERGEYQPSSRLRSEKDYESRESRRDRDDPRKRAKYDDSRSSRRD
        GRKHER DEG+KSSRE ERGEYQPSSRLRSEKDYE++ES RDRDDPRKRAKY DSRSSRRD
Subjt:  GRKHERDDEGLKSSREVERGEYQPSSRLRSEKDYESRESRRDRDDPRKRAKYDDSRSSRRD

A0A6J1K7B6 dentin sialophosphoprotein-like3.1e-31069.51Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEHDRKRQIELKL+ILEDKLTDQGYT DEIS+KLKEAR+TLE 
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLED

Query:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD
        ASGSEEKDGPSAI+LADK+VSDTQ+HQIAARKEEQMKTLR A GL SS+D+EQ+ EG+SD +R+RREGQNADIK  EK EHSFLDRELNWK+H  ED +D
Subjt:  ASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDD

Query:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGE-HKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNN
        DK DKKRV+KELKGH KD RRRPKDDSSD DS GE HKGTKKNLRDNRR DS SD +SD D KY TSRKSKKNRRHD D SSD+DSGGERKGTKKH+R+N
Subjt:  DKNDKKRVTKELKGHQKDRRRRPKDDSSDTDSGGE-HKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNN

Query:  RRDDPESDLDTDVDQKY-----------------------------------------------------ITSRKHKKNRRHDSDD--------------
        RRD P+ D D++ DQKY                                                     ITSRKHKKNRRHDSDD              
Subjt:  RRDDPESDLDTDVDQKY-----------------------------------------------------ITSRKHKKNRRHDSDD--------------

Query:  ---------------------------------------SSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGV
                                               SSD+DSGGEHKETKK++K+NRR D ESD DSD+DKKYT+SKK EKN+  DSD+SDS  D  
Subjt:  ---------------------------------------SSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGV

Query:  EFGMGSHKKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESE
        EFGMGSH+KGSGRPKSQKV KKQ+ +KQESTDESNSDSGI DDKGRQLK+KNQHGKRYGV SDSSD  SSDSDVGRNKSKHRYHSK TGK RVDSES+SE
Subjt:  EFGMGSHKKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQLKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESE

Query:  KSRKHPKEDDVGRHRHDTD-DESDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKH
        K RKHPK+ DVGR RHDTD DES DNSSSSDEIVKRR  RRH++DDKS EEGE+   +SGKIATK  +AAKR+++ SD SDDS+AVDR+G+DK KRAKKH
Subjt:  KSRKHPKEDDVGRHRHDTD-DESDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRRSGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKH

Query:  SSGNGSDLERGFKSSSGLRERGKGNLNHADGLDESVTADENNLHMSRKDSINEFSH-----MKTKRKF-EGGEND-QPEAKSRNRNYARESDFHRDPKKD
        S G+GSD ++G KSS G RERGKG+ NHADGLDESVTA +N  + SR DS++EF+      MK+KRK  EGGE++ QPEAKS++R   RESDFH DPKKD
Subjt:  SSGNGSDLERGFKSSSGLRERGKGNLNHADGLDESVTADENNLHMSRKDSINEFSH-----MKTKRKF-EGGEND-QPEAKSRNRNYARESDFHRDPKKD

Query:  IKTDPDSNRRAHSSRYDETRDGRYRENSRIDSESNTRSRYSVRDEDDNRKSTRTGSRYSEETEHESRHHRKANESHGR-RTDQDIEEGKRY--SRYEEHR
         K D +S+RRA S R+ ETRDGRYRE+ +IDSESN RSRYS  +EDD+RKS RTGSRY+EETEH SRH+ KANESH R RTDQDIEEGKR   SRYEEHR
Subjt:  IKTDPDSNRRAHSSRYDETRDGRYRENSRIDSESNTRSRYSVRDEDDNRKSTRTGSRYSEETEHESRHHRKANESHGR-RTDQDIEEGKRY--SRYEEHR

Query:  GRKHERDDEGLKSSREVERGEYQPSSRLRSEKDYESRESRRDRDDPRKRAKYDDSRSSRRD
        GRKHER DEG+KSSRE ERGEYQPSSRLRSEKDYE++ES RDRDDPRKRAKY DSRSSR D
Subjt:  GRKHERDDEGLKSSREVERGEYQPSSRLRSEKDYESRESRRDRDDPRKRAKYDDSRSSRRD

SwissProt top hitse value%identityAlignment
P0CM94 Pre-mRNA-splicing factor CWC215.2e-1027.8Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSK-----KPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEAR
        MY  +GL T RGSGTNGY+  N   +R + G       G   D     VSK      P++ ILEH+RKR++E+K++ L D+L ++G   D+I E+  + R
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSK-----KPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEAR

Query:  KTLEDASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAI
        + L   +   E+ G   +           TH +AA KE +M  L+ A G+  + +  +  +  ++  ++ R  +  + +  E+ E +      N K+   
Subjt:  KTLEDASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAI

Query:  EDQDDDKNDKKRVTKELKGHQKD
          Q+ ++ ++ R  +E K  ++D
Subjt:  EDQDDDKNDKKRVTKELKGHQKD

P0CM95 Pre-mRNA-splicing factor CWC215.2e-1027.8Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSK-----KPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEAR
        MY  +GL T RGSGTNGY+  N   +R + G       G   D     VSK      P++ ILEH+RKR++E+K++ L D+L ++G   D+I E+  + R
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSK-----KPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEAR

Query:  KTLEDASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAI
        + L   +   E+ G   +           TH +AA KE +M  L+ A G+  + +  +  +  ++  ++ R  +  + +  E+ E +      N K+   
Subjt:  KTLEDASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAI

Query:  EDQDDDKNDKKRVTKELKGHQKD
          Q+ ++ ++ R  +E K  ++D
Subjt:  EDQDDDKNDKKRVTKELKGHQKD

Arabidopsis top hitse value%identityAlignment
AT3G49601.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; CONTAINS InterPro DOMAIN/s: mRNA splicing factor, Cwf21 (InterPro:IPR013170); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).2.2e-4834.17Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKT-GKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLE
        MYNGIGLQT RGSGTNGY+QTNKFFVRP+  GK  +  +GFE+D+GTAG+SKKPNK ILEHDRKRQI LKL ILEDKL DQGY+  EI++KL+EAR +LE
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKT-GKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLE

Query:  DASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQD
         A+ + E++       +D +VS+TQTHQ+AARKE+QM+  R A GL   D  +  +EG+ D     REG    +K  E+ EHSFLDR+   KK  +++  
Subjt:  DASGSEEKDGPSAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQD

Query:  DDKNDKKRVTKELKG------------HQKDRRRRPKDDSSDTDSGG---------EHKGTKKNLR-DNRRNDSVSDLDSDVDKK------YMTSRKSKK
        D+K+ K + +K+ +G             +K+ ++R  DDSS++D  G         + KG K+    D+  +DS SD DSD  KK        T++K  +
Subjt:  DDKNDKKRVTKELKG------------HQKDRRRRPKDDSSDTDSGG---------EHKGTKKNLR-DNRRNDSVSDLDSDVDKK------YMTSRKSKK

Query:  NRRHDIDDSSD---SDSGGERKGTKKHMRNNRRDDPESDLDTDVDQKYITSRKHKKNRRHDSDDSSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKK
         +R    +S +    DS   RK  KK + +NR    E  L    D++    RK     RHDS D S+ +S    +  +K  +  R    +   D DV+  
Subjt:  NRRHDIDDSSD---SDSGGERKGTKKHMRNNRRDDPESDLDTDVDQKYITSRKHKKNRRHDSDDSSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKK

Query:  YTSSKKLEKNRRHDSDNSDSLTDGVEFGMGSHKKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQL---------KHKNQHGKRYGVYSDSSD
        +   +    +++   D+ DS     E    + K+   + +       QKR+++E   +   D    D +G+++         +++N+   +   Y     
Subjt:  YTSSKKLEKNRRHDSDNSDSLTDGVEFGMGSHKKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQL---------KHKNQHGKRYGVYSDSSD

Query:  HGSSDSDVGRNKSKHRYHSKSTGKR----RVDSESESEKSRKHPKEDDVGRHRHDTDDESDDNSSSSDEIVKRRG------GRRHDTDDKSEEEGEFLDR
        H   + +   N  + RY      KR    + D +    ++ +   +DD GR+R   +   DD         + RG      G+  D DD+   E E+  R
Subjt:  HGSSDSDVGRNKSKHRYHSKSTGKR----RVDSESESEKSRKHPKEDDVGRHRHDTDDESDDNSSSSDEIVKRRG------GRRHDTDDKSEEEGEFLDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATAACGGTATTGGATTACAGACGCCGAGAGGGTCTGGAACTAATGGGTACATACAGACGAACAAGTTCTTCGTGAGGCCTAAGACTGGAAAGGTTGCTGAAAGCAC
CAGAGGATTCGAAGAAGATCAGGGCACTGCTGGTGTTTCAAAAAAACCTAATAAAGACATTCTCGAACATGATCGCAAGCGTCAGATTGAGCTCAAGCTCCTCATACTCG
AGGATAAGCTCACTGACCAAGGTTATACGGCCGATGAAATTTCTGAAAAGTTGAAGGAGGCTCGCAAGACTTTGGAAGATGCTTCTGGTTCTGAGGAAAAAGATGGACCT
TCTGCCATCTTACTTGCTGATAAGAGGGTCTCGGATACACAGACTCACCAAATTGCTGCGAGAAAAGAGGAGCAGATGAAAACATTGAGAGTTGCTTTTGGGTTGGGCTC
ATCGGATGATACCGAACAGCTTAAAGAAGGAGTTTCTGATTCATCTAGAAGTAGAAGAGAGGGTCAAAATGCTGATATTAAGCATCATGAGAAGCCTGAACACTCTTTTT
TGGACAGAGAATTGAACTGGAAAAAGCATGCCATTGAAGATCAGGATGATGATAAGAACGACAAAAAAAGGGTTACTAAAGAGTTGAAAGGTCATCAGAAGGATAGGAGA
AGAAGGCCCAAGGATGATTCTTCTGATACTGATTCTGGTGGAGAGCATAAGGGAACCAAGAAGAACTTGAGAGATAATAGAAGGAATGATTCTGTAAGTGACCTTGACAG
TGATGTTGACAAGAAATACATGACCTCAAGAAAGTCTAAGAAAAATAGAAGACACGATATTGACGATTCTTCTGATTCTGATTCTGGTGGAGAGCGCAAGGGAACCAAGA
AGCACATGAGAAATAATCGAAGAGATGATCCTGAGAGTGACCTGGACACCGATGTTGACCAGAAATATATCACGTCAAGGAAGCATAAGAAAAACAGAAGGCATGATAGT
GATGATTCTTCTGATTCTGATTCTGGTGGAGAGCACAAGGAAACCAAGAAGAACATGAAAGATAATAGAAGAGATGATCATGAAAGTGATCGCGACAGTGACGTTGACAA
GAAATACACCAGCTCAAAGAAGCTGGAGAAAAATCGAAGGCATGATAGTGATAATTCTGATTCACTTACAGATGGTGTTGAGTTTGGGATGGGCAGCCACAAGAAAGGAT
CTGGTAGGCCTAAAAGTCAAAAGGTCAAGAAGAAGCAAAAAAGGCAGAAACAGGAGTCCACTGATGAATCCAATTCTGACAGTGGGATTGATGATGATAAAGGTAGGCAA
CTGAAGCACAAGAACCAGCATGGTAAAAGATATGGGGTTTATAGTGACAGCTCTGACCATGGCAGTTCTGATTCTGATGTAGGTCGCAACAAGAGCAAGCATAGATATCA
TAGCAAAAGTACAGGAAAACGCAGGGTAGATAGTGAATCCGAGTCTGAAAAGTCAAGAAAGCATCCTAAGGAAGATGATGTTGGGAGACACAGACATGATACTGATGATG
AAAGTGATGATAACAGCTCTAGCAGTGATGAAATAGTGAAGAGGCGTGGAGGAAGGAGGCACGATACTGATGATAAATCTGAAGAAGAAGGTGAATTTTTGGATAGAAGA
AGTGGTAAGATAGCCACAAAGGAAAAAATGGCTGCTAAAAGGCAATATGATGGCAGTGATAATTCTGATGATAGCAAAGCAGTTGATAGAAAGGGCCATGATAAACACAA
GAGAGCTAAGAAACATTCGTCTGGTAATGGTTCTGATCTAGAGAGGGGATTCAAATCAAGTAGTGGACTTCGTGAAAGAGGAAAAGGGAACTTAAATCATGCAGATGGTT
TGGATGAGTCAGTGACAGCAGATGAGAATAATTTGCACATGTCTAGGAAAGATTCTATCAATGAGTTCAGCCATATGAAAACCAAGAGAAAGTTCGAAGGTGGTGAAAAT
GACCAGCCAGAAGCAAAGTCTAGAAATCGAAATTATGCCAGAGAGTCGGATTTCCACAGGGACCCCAAGAAAGATATCAAAACTGATCCTGACTCAAACAGAAGAGCACA
CAGTAGTCGGTACGATGAGACAAGGGATGGAAGGTACAGGGAGAACTCCAGGATTGATTCTGAATCAAACACTAGATCACGCTATAGTGTGCGTGATGAGGATGACAACA
GAAAGTCGACTCGAACAGGAAGTAGATATAGTGAAGAAACAGAGCATGAAAGTAGACATCATCGTAAAGCTAACGAGTCTCACGGCCGCAGGACTGATCAAGATATTGAG
GAGGGAAAAAGGTACAGCAGATATGAGGAGCATAGAGGGAGAAAACATGAAAGAGATGATGAGGGTCTAAAATCCAGCAGGGAAGTTGAAAGGGGGGAGTATCAACCAAG
CAGCAGGCTGAGATCTGAGAAAGATTATGAAAGTAGAGAATCTAGGAGAGATAGGGATGATCCTAGAAAGAGGGCCAAATATGATGATTCTCGATCAAGCAGACGTGATG
GTTATTAA
mRNA sequenceShow/hide mRNA sequence
GTGAAATAAGAATAAACCATCAATCCTCTTTTCGACCCCAACCGATCAATAAGTTATCACCGGTCGGCCGACTTCGATCAATCCTCTTTTCTTCCTTCAGATCCAGAATT
TTCTTCCATCAGATCCAGAAGGGGTGAGGACGACAGCTGGTAACCCTAATTTTCATCGGTCGACTTCGATCCATCCTCTTTTCTTCCATCAGATCCAGAACTTTTCTTAC
GCGACGGGCTACAAGGGTTGAGGTTGAGGACGACACTCAAACAGCTGGTTGGTGAAGCAGGGAGATGTATAACGGTATTGGATTACAGACGCCGAGAGGGTCTGGAACTA
ATGGGTACATACAGACGAACAAGTTCTTCGTGAGGCCTAAGACTGGAAAGGTTGCTGAAAGCACCAGAGGATTCGAAGAAGATCAGGGCACTGCTGGTGTTTCAAAAAAA
CCTAATAAAGACATTCTCGAACATGATCGCAAGCGTCAGATTGAGCTCAAGCTCCTCATACTCGAGGATAAGCTCACTGACCAAGGTTATACGGCCGATGAAATTTCTGA
AAAGTTGAAGGAGGCTCGCAAGACTTTGGAAGATGCTTCTGGTTCTGAGGAAAAAGATGGACCTTCTGCCATCTTACTTGCTGATAAGAGGGTCTCGGATACACAGACTC
ACCAAATTGCTGCGAGAAAAGAGGAGCAGATGAAAACATTGAGAGTTGCTTTTGGGTTGGGCTCATCGGATGATACCGAACAGCTTAAAGAAGGAGTTTCTGATTCATCT
AGAAGTAGAAGAGAGGGTCAAAATGCTGATATTAAGCATCATGAGAAGCCTGAACACTCTTTTTTGGACAGAGAATTGAACTGGAAAAAGCATGCCATTGAAGATCAGGA
TGATGATAAGAACGACAAAAAAAGGGTTACTAAAGAGTTGAAAGGTCATCAGAAGGATAGGAGAAGAAGGCCCAAGGATGATTCTTCTGATACTGATTCTGGTGGAGAGC
ATAAGGGAACCAAGAAGAACTTGAGAGATAATAGAAGGAATGATTCTGTAAGTGACCTTGACAGTGATGTTGACAAGAAATACATGACCTCAAGAAAGTCTAAGAAAAAT
AGAAGACACGATATTGACGATTCTTCTGATTCTGATTCTGGTGGAGAGCGCAAGGGAACCAAGAAGCACATGAGAAATAATCGAAGAGATGATCCTGAGAGTGACCTGGA
CACCGATGTTGACCAGAAATATATCACGTCAAGGAAGCATAAGAAAAACAGAAGGCATGATAGTGATGATTCTTCTGATTCTGATTCTGGTGGAGAGCACAAGGAAACCA
AGAAGAACATGAAAGATAATAGAAGAGATGATCATGAAAGTGATCGCGACAGTGACGTTGACAAGAAATACACCAGCTCAAAGAAGCTGGAGAAAAATCGAAGGCATGAT
AGTGATAATTCTGATTCACTTACAGATGGTGTTGAGTTTGGGATGGGCAGCCACAAGAAAGGATCTGGTAGGCCTAAAAGTCAAAAGGTCAAGAAGAAGCAAAAAAGGCA
GAAACAGGAGTCCACTGATGAATCCAATTCTGACAGTGGGATTGATGATGATAAAGGTAGGCAACTGAAGCACAAGAACCAGCATGGTAAAAGATATGGGGTTTATAGTG
ACAGCTCTGACCATGGCAGTTCTGATTCTGATGTAGGTCGCAACAAGAGCAAGCATAGATATCATAGCAAAAGTACAGGAAAACGCAGGGTAGATAGTGAATCCGAGTCT
GAAAAGTCAAGAAAGCATCCTAAGGAAGATGATGTTGGGAGACACAGACATGATACTGATGATGAAAGTGATGATAACAGCTCTAGCAGTGATGAAATAGTGAAGAGGCG
TGGAGGAAGGAGGCACGATACTGATGATAAATCTGAAGAAGAAGGTGAATTTTTGGATAGAAGAAGTGGTAAGATAGCCACAAAGGAAAAAATGGCTGCTAAAAGGCAAT
ATGATGGCAGTGATAATTCTGATGATAGCAAAGCAGTTGATAGAAAGGGCCATGATAAACACAAGAGAGCTAAGAAACATTCGTCTGGTAATGGTTCTGATCTAGAGAGG
GGATTCAAATCAAGTAGTGGACTTCGTGAAAGAGGAAAAGGGAACTTAAATCATGCAGATGGTTTGGATGAGTCAGTGACAGCAGATGAGAATAATTTGCACATGTCTAG
GAAAGATTCTATCAATGAGTTCAGCCATATGAAAACCAAGAGAAAGTTCGAAGGTGGTGAAAATGACCAGCCAGAAGCAAAGTCTAGAAATCGAAATTATGCCAGAGAGT
CGGATTTCCACAGGGACCCCAAGAAAGATATCAAAACTGATCCTGACTCAAACAGAAGAGCACACAGTAGTCGGTACGATGAGACAAGGGATGGAAGGTACAGGGAGAAC
TCCAGGATTGATTCTGAATCAAACACTAGATCACGCTATAGTGTGCGTGATGAGGATGACAACAGAAAGTCGACTCGAACAGGAAGTAGATATAGTGAAGAAACAGAGCA
TGAAAGTAGACATCATCGTAAAGCTAACGAGTCTCACGGCCGCAGGACTGATCAAGATATTGAGGAGGGAAAAAGGTACAGCAGATATGAGGAGCATAGAGGGAGAAAAC
ATGAAAGAGATGATGAGGGTCTAAAATCCAGCAGGGAAGTTGAAAGGGGGGAGTATCAACCAAGCAGCAGGCTGAGATCTGAGAAAGATTATGAAAGTAGAGAATCTAGG
AGAGATAGGGATGATCCTAGAAAGAGGGCCAAATATGATGATTCTCGATCAAGCAGACGTGATGGTTATTAAGGGTAAGGGTCTCATATTCGTATTTTAGCTTATTATCT
GCAACAACTCTAGATCTTGAATCCCGTTTAATCCTTTTTTCTGTAATGAAATAGCTCGTGAAATGGGCAGACATGTGAAATTAAGTTTGATTTCAGTCATCTTTTCTCTC
GCTGCATACTTGCTACAGAAAACCCTTACTATGGACAGATATGAAATTATCACTTCTTTAATTGTTAATTACTGAAATGGATTCCTCAGGTTTTAGAACGGTGATCCAAT
ATTAGACTTCAAAAGACATGGTGAGAGATTTGAATCTTTGACCTCTTGTTCGAGAGTATGTCTTAATTAGTTGAACTATGCTCACGTTGGTAATGGTAATCTAATGTTTG
TTTCACGTTGACAACGGTAATCTAATTTTTGATTCCATTTGCATAAGCAAATTTGTATAGACAGGAGGAGGCTTGACTTCTGGAGTAAATTTGAAACCTATACTGACATT
CTTATTATATCTCCATTTTTAAAAAGGGAATCTGTAAGACCCCAATTATGTGAGAAGGGGTAAAAGGGTAATTGATCCTAAACGGACGGGTGAGGTAAGGGAGTGAGTCT
TTTTTTGTAGGAGGGTTAATTTTTGGCTTTTGGGTAGTGGTACCTGGTCGGAGAGCTCCAGCCTCTCGAAAAGCTGTGAG
Protein sequenceShow/hide protein sequence
MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLLILEDKLTDQGYTADEISEKLKEARKTLEDASGSEEKDGP
SAILLADKRVSDTQTHQIAARKEEQMKTLRVAFGLGSSDDTEQLKEGVSDSSRSRREGQNADIKHHEKPEHSFLDRELNWKKHAIEDQDDDKNDKKRVTKELKGHQKDRR
RRPKDDSSDTDSGGEHKGTKKNLRDNRRNDSVSDLDSDVDKKYMTSRKSKKNRRHDIDDSSDSDSGGERKGTKKHMRNNRRDDPESDLDTDVDQKYITSRKHKKNRRHDS
DDSSDSDSGGEHKETKKNMKDNRRDDHESDRDSDVDKKYTSSKKLEKNRRHDSDNSDSLTDGVEFGMGSHKKGSGRPKSQKVKKKQKRQKQESTDESNSDSGIDDDKGRQ
LKHKNQHGKRYGVYSDSSDHGSSDSDVGRNKSKHRYHSKSTGKRRVDSESESEKSRKHPKEDDVGRHRHDTDDESDDNSSSSDEIVKRRGGRRHDTDDKSEEEGEFLDRR
SGKIATKEKMAAKRQYDGSDNSDDSKAVDRKGHDKHKRAKKHSSGNGSDLERGFKSSSGLRERGKGNLNHADGLDESVTADENNLHMSRKDSINEFSHMKTKRKFEGGEN
DQPEAKSRNRNYARESDFHRDPKKDIKTDPDSNRRAHSSRYDETRDGRYRENSRIDSESNTRSRYSVRDEDDNRKSTRTGSRYSEETEHESRHHRKANESHGRRTDQDIE
EGKRYSRYEEHRGRKHERDDEGLKSSREVERGEYQPSSRLRSEKDYESRESRRDRDDPRKRAKYDDSRSSRRDGY