; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G13210 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G13210
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptioncwf21 domain-containing protein
Genome locationClcChr08:24402860..24411355
RNA-Seq ExpressionClc08G13210
SyntenyClc08G13210
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR013170 - mRNA splicing factor Cwf21 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064984.1 dentin sialophosphoprotein-like [Cucumis melo var. makuwa]0.0e+0081.37Show/hide
Query:  TGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLADKRVSDTQTHQIA
        TGKVAES+RGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYT  EISEKL+EARE LEAA  SEEKDG+SAIVLADKRVSDTQTHQIA
Subjt:  TGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLADKRVSDTQTHQIA

Query:  ARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQKDRKRRSKDDSSD
        ARKEEQMKT RAALGLGSLDD EQ KEEISDPSR+RREGQNADIKRHEKSEHSFLDRELNWK+ GTEDQ+DDKD KK  SKELKGHQKD+KRR KDDSSD
Subjt:  ARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQKDRKRRSKDDSSD

Query:  TDSGERKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDSDSDFDQKYITSRKHKKNRRHDS
        TDSGE KGTKKNLRDS + DSES+LD DV NKYVA+RKS KNRRHDSDDSS TDSGGE   TKKH R+KR+DDPE DSDSD DQKY+TSRKHKKNRRHDS
Subjt:  TDSGERKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDSDSDFDQKYITSRKHKKNRRHDS

Query:  DDSSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKYTSKKQAKNTRHDSDDSDSFTDGDKFGMGSHKKGSGRHKSQKVKKQRSWKQESTDESNSDSGI
        DDSS +DSGGEHKKTK+++R+ QR HGS PDSDVDKK+TSKKQ K+TRHDSDDSDSFTDGDK GM SH+KGSGRH+SQKVKKQRS KQ+STDE+NSDS +
Subjt:  DDSSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKYTSKKQAKNTRHDSDDSDSFTDGDKFGMGSHKKGSGRHKSQKVKKQRSWKQESTDESNSDSGI

Query:  DDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDEIVKRRRGRRH
        +DK R+LKHKNQHGKRYG ESDSSDHDSSDSDVGRKKSTHR  SKRTGK RVDSESD EKSRK+PKKDV RR HDIDDEKSGDNSSSSDE+VKRRRGRRH
Subjt:  DDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDEIVKRRRGRRH

Query:  NTDDES-EEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARARGKGNLNHSEGRNHDTDDKSEE
        +TDD S EEGEYFGRS ++ TKG KI AKRQ D S+NSD SLAV RKG+D+HKRAKKYS+GDGF LEKG K S+GAR RGKGNLNH EGR H+TDDKS  
Subjt:  NTDDES-EEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARARGKGNLNHSEGRNHDTDDKSEE

Query:  EEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNFNHADGLDKFKKDSINEFSHA--
         EEEGEY  RSGK+ATK K+DAKRQHDDS+NSDDSLAV      KHKRAKKYSS DDSDLEKGVKS+ GARE+GK   N ADGLDKFKKDSI+EF+HA  
Subjt:  EEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNFNHADGLDKFKKDSINEFSHA--

Query:  HTDTMKSKRKFDEGGENEQQLEAKPINRNSTRESDFHGNPKKDFKNDSESIRRAHSGWYDETRDRRYREDPKIDSESNSRSHYSARDEGDDRKSTRTGSR
         TD M SKRK DEG ENEQ+ E+K  NRNS        +PKKDFK+DSES RR+ SG YDETRD RYRED KIDSESN+RS YSA +E DDRKSTRTGSR
Subjt:  HTDTMKSKRKFDEGGENEQQLEAKPINRNSTRESDFHGNPKKDFKNDSESIRRAHSGWYDETRDRRYREDPKIDSESNSRSHYSARDEGDDRKSTRTGSR

Query:  YNEETEHGSRHHRKANESHHRSRTDQDTDEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDRDDSRKRARYDSRSSRRD
        Y EETEHGSRHHRKANESHH  RTDQDT+EEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY   ESTRDR+DSRKRA+Y+SRSSR D
Subjt:  YNEETEHGSRHHRKANESHHRSRTDQDTDEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDRDDSRKRARYDSRSSRRD

Query:  NY
        N+
Subjt:  NY

KAG6598821.1 Endonuclease V, partial [Cucurbita argyrosperma subsp. sororia]1.8e-29663.6Show/hide
Query:  FKCEVGEVKRCITTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLA
        F+  VGE  RCITTGKVAE++RGF+EDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYT DEIS+KLKEARETLEAA  SEEKDG SAIVLA
Subjt:  FKCEVGEVKRCITTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLA

Query:  DKRVSDTQTHQIAARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQ
        DK+VSDTQ+HQIAARKEEQMKT RAALGL S +D+EQ  E ISDP+RNRREGQNADIKRHEKSEHSFLDRELNWKKHG+ED  DDK DKKRVSKELKGH 
Subjt:  DKRVSDTQTHQIAARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQ

Query:  KDRKRRSKDDSSDTDS-GE-RKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDS-------
        KDR RR KDDSSD DS GE  KGTKKNLRD+ +NDSESD +SD  +KY  +RKS KNRRHDSD SSDTDSGGER GTKKH+RD RRD P+ D        
Subjt:  KDRKRRSKDDSSDTDS-GE-RKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDS-------

Query:  ---------------------------------------------------------------------------------------------------D
                                                                                                           D
Subjt:  ---------------------------------------------------------------------------------------------------D

Query:  SDFDQKYITSRKHKKNRRHDSDDSSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKY-TSKKQAKNTRHDSDDSDSFTDGDKFGMGSHKKGSGRHKSQ
        S+FDQK+ITSRKHKKNRRHDSD SS TDSGGEHK+TKK+++N +RD  S  DSD+DKKY TSKKQ KN   DSDDSDS  D  +FGMGSH+KGSGR KSQ
Subjt:  SDFDQKYITSRKHKKNRRHDSDDSSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKY-TSKKQAKNTRHDSDDSDSFTDGDKFGMGSHKKGSGRHKSQ

Query:  KV-KKQRSWKQESTDESNSDSGIDDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDID
        KV KKQRS KQESTDESNSDSGIDDK R+LKHKNQHGKRYGV+SDSSD DSSDSDVGR KS HR+ SKR GK RVDSESDSEK RKHPK DVGRR HD D
Subjt:  KV-KKQRSWKQESTDESNSDSGIDDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDID

Query:  DEKSGDNSSSSDEIVKRRRGRRHNTDDESEEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARA
        +++SGDNSSSSDEIVKRRR RRHN+DD+SEEGEYFG                                                                
Subjt:  DEKSGDNSSSSDEIVKRRRGRRHNTDDESEEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARA

Query:  RGKGNLNHSEGRNHDTDDKSEEEEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNF
                                       +SGK+ATKG I AKR+HDDSD SDDS AVDR+GNDK KRAKK+SSGD SD +KGVKSSGGARE+GKG+ 
Subjt:  RGKGNLNHSEGRNHDTDDKSEEEEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNF

Query:  NHADGLD-----------KFKKDSINEFSHAHTDTMKSKRKFDEGGENEQQLEAKPINRNSTRESDFHGNPKKDFKNDSESIRRAHSGWYDETRDRRYRE
        NHADGLD           K + DS++EF+ A+  TMKSKRK DEGGE+EQQ EAK  +R STRESDFHG+PKKDFKNDSES RRA SG Y+E RD RYRE
Subjt:  NHADGLD-----------KFKKDSINEFSHAHTDTMKSKRKFDEGGENEQQLEAKPINRNSTRESDFHGNPKKDFKNDSESIRRAHSGWYDETRDRRYRE

Query:  DPKIDSESNSRSHYSARDEGDDRKSTRTGSRYNEETEHGSRHHRKANESHHRSRTDQDTDEEKRH--SRYEEPRGRKHERDEGLKSSREVERGEYQPSSR
        DPKIDSESN+RS YSA +E DDRKSTRTGSRY EETEHGSRH+ KANESHHRSRTDQD +E KRH  SRYEE RGRKHERDEG+KSSRE ERGEYQPSSR
Subjt:  DPKIDSESNSRSHYSARDEGDDRKSTRTGSRYNEETEHGSRHHRKANESHHRSRTDQDTDEEKRH--SRYEEPRGRKHERDEGLKSSREVERGEYQPSSR

Query:  QRSEKDYETRESTRDRDDSRKRARYDSRSSRRD
         RSEKDYET+ESTRDRDD RKRA+YDSRSSRRD
Subjt:  QRSEKDYETRESTRDRDDSRKRARYDSRSSRRD

XP_004138875.1 dentin sialophosphoprotein [Cucumis sativus]0.0e+0080.18Show/hide
Query:  TGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLADKRVSDTQTHQIA
        TGKVAES+RGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL +QGYT  EISEKL+EARE LEAA  SEEKDG+SAIVLADKRVSDTQTHQIA
Subjt:  TGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLADKRVSDTQTHQIA

Query:  ARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQKDRKRRSKDDSSD
        ARKEEQMKT RAALGLGSLDD+EQ K+EISDPSRNRREGQNAD+KRHEKSEHSFLDR+LNWKK GTEDQYDDKD KK  SKE+K  QKD+KRRSKDDSSD
Subjt:  ARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQKDRKRRSKDDSSD

Query:  TDSGERKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDSDSDFDQKYITSRKHKKNRRHDS
        TDSGERKGTKKNLRDS +NDSESDLD DV NKYVA+R S KNRRHDSDDSS+TDSGGE   TKKH R+KR+D+ E DSDSD DQKY+TSRKHKKNRRHDS
Subjt:  TDSGERKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDSDSDFDQKYITSRKHKKNRRHDS

Query:  DDSSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKYTSKKQAKNTRHDSDDSDSFTDGDKFGMGSH-KKGSGRHKSQKVKKQRSWKQESTDESNSDSG
        DDSS TDS GEHKKTKK++RN QR HGS  DSDVDKK+TSKKQ K+TRHDSD SDSFTDGDK GM SH KKGSGRH+S KVKKQRS KQ+STDE+NSDSG
Subjt:  DDSSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKYTSKKQAKNTRHDSDDSDSFTDGDKFGMGSH-KKGSGRHKSQKVKKQRSWKQESTDESNSDSG

Query:  IDDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDEIVKRRRGRR
        I+DK R+LKHK+QHGKRYG ESDSSDHDSSDSDVGR KSTHR+ SK TGK RV+SESDSEKSRK+P KD  RR HDIDDEKSGDN SSSDE+VKRRRGRR
Subjt:  IDDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDEIVKRRRGRR

Query:  HNTDDES-EEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARARGKGNLNHSEGRNHDTDDKSE
        HN DD S EEGEYFGRS ++ TKG KI AKRQH DS+NSDDSLAV RKG+D HK+AKKY +GDGF LEKG K S+GAR RGKGNL+H+EGR H+TDDKS 
Subjt:  HNTDDES-EEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARARGKGNLNHSEGRNHDTDDKSE

Query:  EEEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNFNHADGLDKFKKDSINEFSHA-
          EEEGEY  RSGK+ATK KID KRQHDDS+NSDDSLAV      KHKRAKKYSS DDSDLEKGVKS+ GARE+GK   NHADGL KFKKDSINE +HA 
Subjt:  EEEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNFNHADGLDKFKKDSINEFSHA-

Query:  -HTDTMKSKRKFDEGGENEQQLEAKPINRNSTRESDFHGNPKKDFKNDSESIRRAHSGWYDETRDRRYREDPKIDSESNSRSHYSARDEGDDRKSTRTGS
          TD M  KRK DEG E EQ+ E+K  NRNS        +PKKD K+DSES RR+ SG YD+TRD RYRED KIDSESN+RS YSA+ E DDRKS RTGS
Subjt:  -HTDTMKSKRKFDEGGENEQQLEAKPINRNSTRESDFHGNPKKDFKNDSESIRRAHSGWYDETRDRRYREDPKIDSESNSRSHYSARDEGDDRKSTRTGS

Query:  RYNEETEHGSRHHRKANESHHRSRTDQDTDEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDRDDSRKRARYDSRSSRR
        RY+EETEHGSRHHRKANESHH  RTDQDT+EEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDR+DSRKR +Y+SRSSRR
Subjt:  RYNEETEHGSRHHRKANESHHRSRTDQDTDEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDRDDSRKRARYDSRSSRR

Query:  DNY
        DN+
Subjt:  DNY

XP_008445109.1 PREDICTED: dentin sialophosphoprotein-like [Cucumis melo]0.0e+0081.04Show/hide
Query:  TGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLADKRVSDTQTHQIA
        TGKVAES+RGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYT  EISEKL+EARE LEAA  SEEKDG+SAIVLADKRVSDTQTHQIA
Subjt:  TGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLADKRVSDTQTHQIA

Query:  ARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQKDRKRRSKDDSSD
        ARKEEQMKT RAALGLGSL D EQ KEEISDPSR+RREGQNADIKRHEKSEHSFLDRELNWK+ GTEDQ+DDKD KK  SKELKGHQKD+KRR KDD SD
Subjt:  ARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQKDRKRRSKDDSSD

Query:  TDSGERKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDSDSDFDQKYITSRKHKKNRRHDS
         DSGE KGTKKNLRDS + DSESDLD DV NKYVA+RKS KNRRHDSDDSS TDSGGE   TKKH R+KR+DDPE DSDSD DQKY+TSRKHKKNRRHDS
Subjt:  TDSGERKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDSDSDFDQKYITSRKHKKNRRHDS

Query:  DDSSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKYTSKKQAKNTRHDSDDSDSFTDGDKFGMGSHKKGSGRHKSQKVKKQRSWKQESTDESNSDSGI
        DDSS +DSGGEHKKTK+++R+ QR HGS PDSDVDKK+TSKKQ K+TRHDSDDSDSFTDGDK GM SH+KGSGRH+SQKVKKQRS KQ+STDE+NSDS +
Subjt:  DDSSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKYTSKKQAKNTRHDSDDSDSFTDGDKFGMGSHKKGSGRHKSQKVKKQRSWKQESTDESNSDSGI

Query:  DDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDEIVKRRRGRRH
        +DK R+LKHKNQHGKRYG ESDSSDHDSSDSDVGRKKSTHR  SKRTGK RVDSESD EKSRK+PKKD  RR HDIDDEKSGDNSSSSDE+VKRRRGRRH
Subjt:  DDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDEIVKRRRGRRH

Query:  NTDDES-EEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARARGKGNLNHSEGRNHDTDDKSEE
        +TDD S EEGEYFGRS ++ TKG KI AKRQ D S+NSD SLAV RKG+D+HKRAKKYS+GDGF LEKG K S+GAR RGKGNLNH EGR H+TDDKS  
Subjt:  NTDDES-EEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARARGKGNLNHSEGRNHDTDDKSEE

Query:  EEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNFNHADGLDKFKKDSINEFSHA--
         EEEGEY  RSGK+ATK K+DAKRQHDDS+NSDDSLAV      KHKRAKKYSS DDSDLEKGVKS+ GARE+GK   N ADGLDKFKKDSI+EF+HA  
Subjt:  EEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNFNHADGLDKFKKDSINEFSHA--

Query:  HTDTMKSKRKFDEGGENEQQLEAKPINRNSTRESDFHGNPKKDFKNDSESIRRAHSGWYDETRDRRYREDPKIDSESNSRSHYSARDEGDDRKSTRTGSR
         TD M SKRK DEG ENEQ+ E+K  NRNS        +PKKDFK+DSES RR+ SG YDETRD RYRED KIDSESN+RS YSA +E DDRKSTRTGSR
Subjt:  HTDTMKSKRKFDEGGENEQQLEAKPINRNSTRESDFHGNPKKDFKNDSESIRRAHSGWYDETRDRRYREDPKIDSESNSRSHYSARDEGDDRKSTRTGSR

Query:  YNEETEHGSRHHRKANESHHRSRTDQDTDEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDRDDSRKRARYDSRSSRRD
        Y EETEHGSRHHRKANESHH  RTDQDT+EEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY   ESTRDR+DSRKRA+Y+SRSSR D
Subjt:  YNEETEHGSRHHRKANESHHRSRTDQDTDEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDRDDSRKRARYDSRSSRRD

Query:  NY
        N+
Subjt:  NY

XP_038884695.1 dentin sialophosphoprotein-like [Benincasa hispida]0.0e+0073.09Show/hide
Query:  TGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLADKRVSDTQTHQIA
        TGKVAES+RGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYT DEISEKL+EARETLEAA  SEEKDG SAIVLADKRVSDTQTHQIA
Subjt:  TGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLADKRVSDTQTHQIA

Query:  ARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQKDRKRRSKDDSSD
        ARKEEQMKT RAALGLGS  DTEQ KEEISDPSR RREGQNADIKRHEKSEHSFLDRELNWKKHG EDQYDDKDDKKR+SKELKGHQK RKRR KDDSSD
Subjt:  ARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQKDRKRRSKDDSSD

Query:  TDSGERKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDSDSDFDQKYITSRKHKKNRRHDS
        TDS +R     NLRDS +NDSESDLDSDVG+KYVA+R   KNRRHDSDDSSDTDSGGER GTKKH+RDKRRDDPE D DSDFDQKYITSRKHKKNRRHD 
Subjt:  TDSGERKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDSDSDFDQKYITSRKHKKNRRHDS

Query:  DDSSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKYT-SKKQAKNTRHDSDDSDSFTDGDKFGM-GSHKKGSGRHKSQKVKKQRSWKQESTDESNSDS
        D+SS TDSGGEHKKTKKNMRN +R HGS P SD+DKKYT SKK  KN RHDSDDSDS TDGD+FGM GSHKKGS RHKSQKVK QRS KQESTDESNSDS
Subjt:  DDSSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKYT-SKKQAKNTRHDSDDSDSFTDGDKFGM-GSHKKGSGRHKSQKVKKQRSWKQESTDESNSDS

Query:  GIDDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDEIVKRRRGR
        GID+KRR+LKH+NQHGKRYGVESDSSDHDSSDSDVG KKS HR+DSKR GK RVDSES+SEKSRKH KKD GR  HDID+EKSGDNSSS  EIVKRRRGR
Subjt:  GIDDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDEIVKRRRGR

Query:  RHNTDDESEEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARARGKGNLNHSEGRNHDTDDKSE
         +N DD S                                                                                            
Subjt:  RHNTDDESEEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARARGKGNLNHSEGRNHDTDDKSE

Query:  EEEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNFNHADGLDKFKKDSINEFSHA-
          EEEGEY  RSGK+ATKGKIDAKRQHDD++NSDDSLAV RKGNDKHKRAKK SSGDDSDLEKGVK+SGGARE+GKG+ NHADGL+KFKKDSINEF+HA 
Subjt:  EEEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNFNHADGLDKFKKDSINEFSHA-

Query:  -HTDTMKSKRKFDEGGENEQQLEAKPINRNSTR-----------------------------------------ESDFHGNPKKDFKNDSESIRRAHSGW
          TDTM SKRKFDEGG+NEQQLE+K  NRNSTR                                         +SDFHGNPKK F+NDSES RRA SG 
Subjt:  -HTDTMKSKRKFDEGGENEQQLEAKPINRNSTR-----------------------------------------ESDFHGNPKKDFKNDSESIRRAHSGW

Query:  YDETRDRRYREDPKIDSESNSRSHYSARDEGDDRKSTRTGSRYNEETEHGSRHHRKANESHHRSRTDQDTDEEKRHSRYEEPRGRKHERDEGLKSSREVE
        YDETRD RYREDPKIDSESN RS YS +DE DDRK+T+TGSR+ EETEHGSRHHRKANESHHRSRT +DT+EEKRHSRYEEPRGRKHER+EGLKS REVE
Subjt:  YDETRDRRYREDPKIDSESNSRSHYSARDEGDDRKSTRTGSRYNEETEHGSRHHRKANESHHRSRTDQDTDEEKRHSRYEEPRGRKHERDEGLKSSREVE

Query:  RGEYQPSSRQRSEKDYETRESTRDRDDSRKRARYDSRSSRRDNY
        RGEYQPSSR RSEKDYETRESTRDRDDSRKRA+Y+SRSSRRDN+
Subjt:  RGEYQPSSRQRSEKDYETRESTRDRDDSRKRARYDSRSSRRDNY

TrEMBL top hitse value%identityAlignment
A0A0A0LQ00 cwf21 domain-containing protein0.0e+0080.18Show/hide
Query:  TGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLADKRVSDTQTHQIA
        TGKVAES+RGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL +QGYT  EISEKL+EARE LEAA  SEEKDG+SAIVLADKRVSDTQTHQIA
Subjt:  TGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLADKRVSDTQTHQIA

Query:  ARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQKDRKRRSKDDSSD
        ARKEEQMKT RAALGLGSLDD+EQ K+EISDPSRNRREGQNAD+KRHEKSEHSFLDR+LNWKK GTEDQYDDKD KK  SKE+K  QKD+KRRSKDDSSD
Subjt:  ARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQKDRKRRSKDDSSD

Query:  TDSGERKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDSDSDFDQKYITSRKHKKNRRHDS
        TDSGERKGTKKNLRDS +NDSESDLD DV NKYVA+R S KNRRHDSDDSS+TDSGGE   TKKH R+KR+D+ E DSDSD DQKY+TSRKHKKNRRHDS
Subjt:  TDSGERKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDSDSDFDQKYITSRKHKKNRRHDS

Query:  DDSSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKYTSKKQAKNTRHDSDDSDSFTDGDKFGMGSH-KKGSGRHKSQKVKKQRSWKQESTDESNSDSG
        DDSS TDS GEHKKTKK++RN QR HGS  DSDVDKK+TSKKQ K+TRHDSD SDSFTDGDK GM SH KKGSGRH+S KVKKQRS KQ+STDE+NSDSG
Subjt:  DDSSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKYTSKKQAKNTRHDSDDSDSFTDGDKFGMGSH-KKGSGRHKSQKVKKQRSWKQESTDESNSDSG

Query:  IDDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDEIVKRRRGRR
        I+DK R+LKHK+QHGKRYG ESDSSDHDSSDSDVGR KSTHR+ SK TGK RV+SESDSEKSRK+P KD  RR HDIDDEKSGDN SSSDE+VKRRRGRR
Subjt:  IDDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDEIVKRRRGRR

Query:  HNTDDES-EEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARARGKGNLNHSEGRNHDTDDKSE
        HN DD S EEGEYFGRS ++ TKG KI AKRQH DS+NSDDSLAV RKG+D HK+AKKY +GDGF LEKG K S+GAR RGKGNL+H+EGR H+TDDKS 
Subjt:  HNTDDES-EEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARARGKGNLNHSEGRNHDTDDKSE

Query:  EEEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNFNHADGLDKFKKDSINEFSHA-
          EEEGEY  RSGK+ATK KID KRQHDDS+NSDDSLAV      KHKRAKKYSS DDSDLEKGVKS+ GARE+GK   NHADGL KFKKDSINE +HA 
Subjt:  EEEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNFNHADGLDKFKKDSINEFSHA-

Query:  -HTDTMKSKRKFDEGGENEQQLEAKPINRNSTRESDFHGNPKKDFKNDSESIRRAHSGWYDETRDRRYREDPKIDSESNSRSHYSARDEGDDRKSTRTGS
          TD M  KRK DEG E EQ+ E+K  NRNS        +PKKD K+DSES RR+ SG YD+TRD RYRED KIDSESN+RS YSA+ E DDRKS RTGS
Subjt:  -HTDTMKSKRKFDEGGENEQQLEAKPINRNSTRESDFHGNPKKDFKNDSESIRRAHSGWYDETRDRRYREDPKIDSESNSRSHYSARDEGDDRKSTRTGS

Query:  RYNEETEHGSRHHRKANESHHRSRTDQDTDEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDRDDSRKRARYDSRSSRR
        RY+EETEHGSRHHRKANESHH  RTDQDT+EEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDR+DSRKR +Y+SRSSRR
Subjt:  RYNEETEHGSRHHRKANESHHRSRTDQDTDEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDRDDSRKRARYDSRSSRR

Query:  DNY
        DN+
Subjt:  DNY

A0A1S3BBX0 dentin sialophosphoprotein-like0.0e+0081.04Show/hide
Query:  TGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLADKRVSDTQTHQIA
        TGKVAES+RGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYT  EISEKL+EARE LEAA  SEEKDG+SAIVLADKRVSDTQTHQIA
Subjt:  TGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLADKRVSDTQTHQIA

Query:  ARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQKDRKRRSKDDSSD
        ARKEEQMKT RAALGLGSL D EQ KEEISDPSR+RREGQNADIKRHEKSEHSFLDRELNWK+ GTEDQ+DDKD KK  SKELKGHQKD+KRR KDD SD
Subjt:  ARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQKDRKRRSKDDSSD

Query:  TDSGERKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDSDSDFDQKYITSRKHKKNRRHDS
         DSGE KGTKKNLRDS + DSESDLD DV NKYVA+RKS KNRRHDSDDSS TDSGGE   TKKH R+KR+DDPE DSDSD DQKY+TSRKHKKNRRHDS
Subjt:  TDSGERKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDSDSDFDQKYITSRKHKKNRRHDS

Query:  DDSSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKYTSKKQAKNTRHDSDDSDSFTDGDKFGMGSHKKGSGRHKSQKVKKQRSWKQESTDESNSDSGI
        DDSS +DSGGEHKKTK+++R+ QR HGS PDSDVDKK+TSKKQ K+TRHDSDDSDSFTDGDK GM SH+KGSGRH+SQKVKKQRS KQ+STDE+NSDS +
Subjt:  DDSSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKYTSKKQAKNTRHDSDDSDSFTDGDKFGMGSHKKGSGRHKSQKVKKQRSWKQESTDESNSDSGI

Query:  DDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDEIVKRRRGRRH
        +DK R+LKHKNQHGKRYG ESDSSDHDSSDSDVGRKKSTHR  SKRTGK RVDSESD EKSRK+PKKD  RR HDIDDEKSGDNSSSSDE+VKRRRGRRH
Subjt:  DDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDEIVKRRRGRRH

Query:  NTDDES-EEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARARGKGNLNHSEGRNHDTDDKSEE
        +TDD S EEGEYFGRS ++ TKG KI AKRQ D S+NSD SLAV RKG+D+HKRAKKYS+GDGF LEKG K S+GAR RGKGNLNH EGR H+TDDKS  
Subjt:  NTDDES-EEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARARGKGNLNHSEGRNHDTDDKSEE

Query:  EEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNFNHADGLDKFKKDSINEFSHA--
         EEEGEY  RSGK+ATK K+DAKRQHDDS+NSDDSLAV      KHKRAKKYSS DDSDLEKGVKS+ GARE+GK   N ADGLDKFKKDSI+EF+HA  
Subjt:  EEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNFNHADGLDKFKKDSINEFSHA--

Query:  HTDTMKSKRKFDEGGENEQQLEAKPINRNSTRESDFHGNPKKDFKNDSESIRRAHSGWYDETRDRRYREDPKIDSESNSRSHYSARDEGDDRKSTRTGSR
         TD M SKRK DEG ENEQ+ E+K  NRNS        +PKKDFK+DSES RR+ SG YDETRD RYRED KIDSESN+RS YSA +E DDRKSTRTGSR
Subjt:  HTDTMKSKRKFDEGGENEQQLEAKPINRNSTRESDFHGNPKKDFKNDSESIRRAHSGWYDETRDRRYREDPKIDSESNSRSHYSARDEGDDRKSTRTGSR

Query:  YNEETEHGSRHHRKANESHHRSRTDQDTDEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDRDDSRKRARYDSRSSRRD
        Y EETEHGSRHHRKANESHH  RTDQDT+EEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY   ESTRDR+DSRKRA+Y+SRSSR D
Subjt:  YNEETEHGSRHHRKANESHHRSRTDQDTDEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDRDDSRKRARYDSRSSRRD

Query:  NY
        N+
Subjt:  NY

A0A5A7VCH8 Dentin sialophosphoprotein-like0.0e+0081.37Show/hide
Query:  TGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLADKRVSDTQTHQIA
        TGKVAES+RGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYT  EISEKL+EARE LEAA  SEEKDG+SAIVLADKRVSDTQTHQIA
Subjt:  TGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLADKRVSDTQTHQIA

Query:  ARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQKDRKRRSKDDSSD
        ARKEEQMKT RAALGLGSLDD EQ KEEISDPSR+RREGQNADIKRHEKSEHSFLDRELNWK+ GTEDQ+DDKD KK  SKELKGHQKD+KRR KDDSSD
Subjt:  ARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQKDRKRRSKDDSSD

Query:  TDSGERKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDSDSDFDQKYITSRKHKKNRRHDS
        TDSGE KGTKKNLRDS + DSES+LD DV NKYVA+RKS KNRRHDSDDSS TDSGGE   TKKH R+KR+DDPE DSDSD DQKY+TSRKHKKNRRHDS
Subjt:  TDSGERKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDSDSDFDQKYITSRKHKKNRRHDS

Query:  DDSSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKYTSKKQAKNTRHDSDDSDSFTDGDKFGMGSHKKGSGRHKSQKVKKQRSWKQESTDESNSDSGI
        DDSS +DSGGEHKKTK+++R+ QR HGS PDSDVDKK+TSKKQ K+TRHDSDDSDSFTDGDK GM SH+KGSGRH+SQKVKKQRS KQ+STDE+NSDS +
Subjt:  DDSSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKYTSKKQAKNTRHDSDDSDSFTDGDKFGMGSHKKGSGRHKSQKVKKQRSWKQESTDESNSDSGI

Query:  DDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDEIVKRRRGRRH
        +DK R+LKHKNQHGKRYG ESDSSDHDSSDSDVGRKKSTHR  SKRTGK RVDSESD EKSRK+PKKDV RR HDIDDEKSGDNSSSSDE+VKRRRGRRH
Subjt:  DDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDEIVKRRRGRRH

Query:  NTDDES-EEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARARGKGNLNHSEGRNHDTDDKSEE
        +TDD S EEGEYFGRS ++ TKG KI AKRQ D S+NSD SLAV RKG+D+HKRAKKYS+GDGF LEKG K S+GAR RGKGNLNH EGR H+TDDKS  
Subjt:  NTDDES-EEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARARGKGNLNHSEGRNHDTDDKSEE

Query:  EEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNFNHADGLDKFKKDSINEFSHA--
         EEEGEY  RSGK+ATK K+DAKRQHDDS+NSDDSLAV      KHKRAKKYSS DDSDLEKGVKS+ GARE+GK   N ADGLDKFKKDSI+EF+HA  
Subjt:  EEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNFNHADGLDKFKKDSINEFSHA--

Query:  HTDTMKSKRKFDEGGENEQQLEAKPINRNSTRESDFHGNPKKDFKNDSESIRRAHSGWYDETRDRRYREDPKIDSESNSRSHYSARDEGDDRKSTRTGSR
         TD M SKRK DEG ENEQ+ E+K  NRNS        +PKKDFK+DSES RR+ SG YDETRD RYRED KIDSESN+RS YSA +E DDRKSTRTGSR
Subjt:  HTDTMKSKRKFDEGGENEQQLEAKPINRNSTRESDFHGNPKKDFKNDSESIRRAHSGWYDETRDRRYREDPKIDSESNSRSHYSARDEGDDRKSTRTGSR

Query:  YNEETEHGSRHHRKANESHHRSRTDQDTDEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDRDDSRKRARYDSRSSRRD
        Y EETEHGSRHHRKANESHH  RTDQDT+EEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDY   ESTRDR+DSRKRA+Y+SRSSR D
Subjt:  YNEETEHGSRHHRKANESHHRSRTDQDTDEEKRHSRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETRESTRDRDDSRKRARYDSRSSRRD

Query:  NY
        N+
Subjt:  NY

A0A6J1ESM6 dentin sialophosphoprotein-like4.2e-29163.53Show/hide
Query:  TGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLADKRVSDTQTHQIA
        TGKVAE++RGF+EDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYT DEIS+KLKEARETLEAA  SEEKDG SAIVLADK+VSDTQ+HQIA
Subjt:  TGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLADKRVSDTQTHQIA

Query:  ARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQKDRKRRSKDDSSD
        ARKEEQMKT RAALGL S +D+EQ  E ISDP+RNRREGQNADIKRHEKSEHSFLDRELNWKKHG+ED  DDK DKKRVSKELKGH KDR RR KDDSSD
Subjt:  ARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQKDRKRRSKDDSSD

Query:  TDS-GE-RKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDSDSDFDQKYITSRKHKKNRRH
         DS GE  KGTKKNLRD+ +NDSESD +SD  +KY  +RKS KNRRHDSD SSDTDSGGER GTKKH+RD RRD P+ D DS+FDQKY TSRKHKKNRRH
Subjt:  TDS-GE-RKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDSDSDFDQKYITSRKHKKNRRH

Query:  DSDD------------------------------------------------------------------------------------------------
        DSDD                                                                                                
Subjt:  DSDD------------------------------------------------------------------------------------------------

Query:  ----------SSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKY-TSKKQAKNTRHDSDDSDSFTDGDKFGMGSHKKGSGRHKSQKV-KKQRSWKQES
                  SS TDSGGEHK+TKK+++N +RD  S  DSD+DKKY TSKKQ KN    SDDSDS  D  +FGMGSH+KGSGR KSQKV KKQR  KQES
Subjt:  ----------SSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKY-TSKKQAKNTRHDSDDSDSFTDGDKFGMGSHKKGSGRHKSQKV-KKQRSWKQES

Query:  TDESNSDSGIDDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDE
        TDESNSDSGIDDK R+LKHKNQHGKRYGV+SDSSD DSSDSDVGR KS HR+ SKR GK RVDSESDSEK RKHPKKDVGRR HD D+++SGDNSSSSDE
Subjt:  TDESNSDSGIDDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDE

Query:  IVKRRRGRRHNTDDESEEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARARGKGNLNHSEGRN
        IVK RR RRHN+DD+S                                                                                    
Subjt:  IVKRRRGRRHNTDDESEEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARARGKGNLNHSEGRN

Query:  HDTDDKSEEEEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNFNHADGLD------
                  EEEGEYF +SGK+ATKG I AKR+HDDSD SDDS AVDRKGNDK KRAKK+SSGD SD +KGVKSSGGARE+GKG+ NHADGLD      
Subjt:  HDTDDKSEEEEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNFNHADGLD------

Query:  -----KFKKDSINEFSHAHTDTMKSKRKFDEGGENEQQLEAKPINRNSTRESDFHGNPKKDFKNDSESIRRAHSGWYDETRDRRYREDPKIDSESNSRSH
             K + D ++EF+ A+  TMKSKRK DEGGE+EQQ EAK  +R STRESDFHG+PKKDFKNDSES RRA SG Y+ETRD RYREDPKIDSESN+RS 
Subjt:  -----KFKKDSINEFSHAHTDTMKSKRKFDEGGENEQQLEAKPINRNSTRESDFHGNPKKDFKNDSESIRRAHSGWYDETRDRRYREDPKIDSESNSRSH

Query:  YSARDEGDDRKSTRTGSRYNEETEHGSRHHRKANESHHRSRTDQDTDEEKRH--SRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETREST
        YSA +E +DRKSTRTGSRY EETEHGSRH+ KANESHHRSRTDQD +E KRH  SRYEE RGRKHERDEG+KSSRE ERGEYQPSSR RSEKDYET+EST
Subjt:  YSARDEGDDRKSTRTGSRYNEETEHGSRHHRKANESHHRSRTDQDTDEEKRH--SRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETREST

Query:  RDRDDSRKRARYDSRSSRRD
        RDRDD RKRA+YDSRSSRRD
Subjt:  RDRDDSRKRARYDSRSSRRD

A0A6J1K7B6 dentin sialophosphoprotein-like4.8e-28763.14Show/hide
Query:  TGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLADKRVSDTQTHQIA
        TGKVAE++RGF+EDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYT DEIS+KLKEARETLEAA  SEEKDG SAIVLADK+VSDTQ+HQIA
Subjt:  TGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAA--SEEKDGASAIVLADKRVSDTQTHQIA

Query:  ARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQKDRKRRSKDDSSD
        ARKEEQMKT RAALGL S +D+EQ  E ISDP+RNRREGQNADIKR EKSEHSFLDRELNWK+HG+ED  DDK DKKRVSKELKGH KDR RR KDDSSD
Subjt:  ARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQKDRKRRSKDDSSD

Query:  TDS-GE-RKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDSDSDFDQKY------------
         DS GE  KGTKKNLRD+ + DSESD +SD  +KY  +RKS KNRRHDSD SSDTDSGGER GTKKH+RD RRD P+ D DS+FDQKY            
Subjt:  TDS-GE-RKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDSDSDFDQKY------------

Query:  -----------------------------------------ITSRKHKKNRRHDSDD-------------------------------------------
                                                 ITSRKHKKNRRHDSDD                                           
Subjt:  -----------------------------------------ITSRKHKKNRRHDSDD-------------------------------------------

Query:  ----------SSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKY-TSKKQAKNTRHDSDDSDSFTDGDKFGMGSHKKGSGRHKSQKV-KKQRSWKQES
                  SS TDSGGEHK+TKK+++N +RD  S  DSD+DKKY TSKKQ KN   DSDDSDS  D  +FGMGSH+KGSGR KSQKV KKQRS KQES
Subjt:  ----------SSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKY-TSKKQAKNTRHDSDDSDSFTDGDKFGMGSHKKGSGRHKSQKV-KKQRSWKQES

Query:  TDESNSDSGIDDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDE
        TDESNSDSGIDDK R+LK+KNQHGKRYGV+SDSSD DSSDSDVGR KS HR+ SKRTGK RVDSESDSEK RKHPKKDVGRR HD D+++SGDNSSSSDE
Subjt:  TDESNSDSGIDDKRRKLKHKNQHGKRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDE

Query:  IVKRRRGRRHNTDDESEEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARARGKGNLNHSEGRN
        IVKRRR RRHN+DD+SEEGEYFG                                                                             
Subjt:  IVKRRRGRRHNTDDESEEGEYFGRSDRVDTKGKKIAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARARGKGNLNHSEGRN

Query:  HDTDDKSEEEEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNFNHADGLD------
                          +SGK+ATKG I AKR+H+DSD SDDS AVDR+GNDK KRAKK+S GD SD +KGVKSSGGARE+GKG+ NHADGLD      
Subjt:  HDTDDKSEEEEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDSLAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNFNHADGLD------

Query:  -----KFKKDSINEFSHAHTDTMKSKRKFDEGGENEQQLEAKPINRNSTRESDFHGNPKKDFKNDSESIRRAHSGWYDETRDRRYREDPKIDSESNSRSH
             K + DS++EF+ A+  TMKSKRK DEGGE+EQQ EAK  +R STRESDFHG+PKKDFKNDSES RRA SG + ETRD RYREDPKIDSESN+RS 
Subjt:  -----KFKKDSINEFSHAHTDTMKSKRKFDEGGENEQQLEAKPINRNSTRESDFHGNPKKDFKNDSESIRRAHSGWYDETRDRRYREDPKIDSESNSRSH

Query:  YSARDEGDDRKSTRTGSRYNEETEHGSRHHRKANESHHRSRTDQDTDEEKRH--SRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETREST
        YSA +E DDRKS RTGSRY EETEHGSRH+ KANESHHRSRTDQD +E KR   SRYEE RGRKHERDEG+KSSRE ERGEYQPSSR RSEKDYET+EST
Subjt:  YSARDEGDDRKSTRTGSRYNEETEHGSRHHRKANESHHRSRTDQDTDEEKRH--SRYEEPRGRKHERDEGLKSSREVERGEYQPSSRQRSEKDYETREST

Query:  RDRDDSRKRARYDSRSSRRD
        RDRDD RKRA+YDSRSSR D
Subjt:  RDRDDSRKRARYDSRSSRRD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G49601.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; CONTAINS InterPro DOMAIN/s: mRNA splicing factor, Cwf21 (InterPro:IPR013170); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).1.4e-3633.39Show/hide
Query:  GKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAASEEKDGASAIVLADKRVSDTQTHQIAARK
        GK  +  +GFE+D+GTAG+SKKPNK ILEHDRKRQI LKL ILEDKL DQGY+  EI++KL+EAR +LEAA+   +  S     D +VS+TQTHQ+AARK
Subjt:  GKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAASEEKDGASAIVLADKRVSDTQTHQIAARK

Query:  EEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKG------------HQKDRK
        E+QM+ FRAALG   L D +Q  EE        REG    +K  E+ EHSFLDR+   KK   ++  D+KD K + SK+ +G             +K+ K
Subjt:  EEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKG------------HQKDRK

Query:  RRSKDDSSDTDS---GERKGTKKNLR--------DSIKNDSESDLDSDVGNKYVAAR--KSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDS
        +R  DDSS++D      R+ +KK  +        DS  +DSESD DSD G K    +  K+ K R       S      E + +KK  +  ++  P   S
Subjt:  RRSKDDSSDTDS---GERKGTKKNLR--------DSIKNDSESDLDSDVGNKYVAAR--KSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDS

Query:  DSDFDQKYITSRKHKKNRRHDSD--DSSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVD---KKYTSKKQAKNTRHDSDDSD----------SFTDGDKF
         S   +     +     +RHDSD  +    D+    +K ++  R  Q+      D + D    +YT  +  K    DSDDS+          S  +    
Subjt:  DSDFDQKYITSRKHKKNRRHDSD--DSSHTDSGGEHKKTKKNMRNYQRDHGSHPDSDVD---KKYTSKKQAKNTRHDSDDSD----------SFTDGDKF

Query:  GMGSHKKGS---GRHKSQKVKKQRSWKQESTDESNSDSGIDDKRRKLKHKNQHGKRYGVESD-SSDHDSSD----SDVGRKKSTHRHDSKRTGKRRVDSE
        GM   +K      +H   K +     K+ + D  +S++  +++++      Q G+++  E D  +D+   D     D  ++  T + D  R   R ++ E
Subjt:  GMGSHKKGS---GRHKSQKVKKQRSWKQESTDESNSDSGIDDKRRKLKHKNQHGKRYGVESD-SSDHDSSD----SDVGRKKSTHRHDSKRTGKRRVDSE

Query:  SDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDEIVKRRRGRRHNTDDE-SEEGEYFGR
         D ++ R  P+++  +   D ++ K G +    D   +R  G+  + DD  S E EY  R
Subjt:  SDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDEIVKRRRGRRHNTDDE-SEEGEYFGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTACTACCGCTTTCCCTTGAGAGTTCCTTTCTTATCCAATTCCTAACATTTATTTACATGGTTGCCCTTTATGAACTTATCCTCTTCCAACCAAGGAAAGATAGGAG
ATCAATCTTGTGTGAGTCACCAAATGACCTTTGGAGCATTCACTTAAACTGCATTGATCTCAACCCGAGTAACCTTATGAGGACAACTCCCAAATCCCCGCCTCTACCCA
AGGCCTCATTAACAAAAAAAGGGTACTTCGTAATAGCCGACAACATCATTCAGCAAATTGATGACCTTCAAATCTCTCCGTCTCTTGCACTCTCTCCATCTCTGCTCTCT
CAAGCTCTCTTTGATTTCGTTGTCGTCTCTCACTCTCGGTTGTTCTTCAAGTGTGAGGTTGGTGAAGTGAAAAGATGTATAACGACCGGAAAGGTTGCTGAAAGCTCTAG
AGGATTCGAAGAAGATCAGGGCACTGCTGGAGTTTCAAAGAAACCGAATAAAGACATTCTCGAACACGATCGCAAGCGTCAGATTGAACTCAAACTTGTCATACTTGAGG
ACAAGCTCACTGACCAAGGTTATACCATGGATGAAATTTCCGAGAAGTTAAAGGAGGCTCGTGAGACTTTGGAAGCTGCTTCAGAGGAAAAAGATGGAGCTTCTGCCATC
GTACTTGCAGATAAGAGGGTATCAGATACACAGACTCACCAAATTGCTGCAAGAAAGGAGGAGCAGATGAAAACATTTAGAGCTGCTCTTGGGTTGGGCTCATTGGACGA
TACTGAACAGGCTAAAGAAGAGATTTCTGATCCATCAAGAAATAGAAGAGAGGGTCAGAATGCTGATATTAAGCGTCATGAGAAGTCTGAACATTCTTTCTTGGATAGAG
AATTGAACTGGAAAAAGCATGGCACTGAAGATCAGTATGATGATAAGGATGACAAAAAAAGGGTTTCAAAAGAGTTGAAAGGTCATCAGAAGGATAGAAAAAGAAGGTCC
AAGGATGATTCTTCTGACACCGATTCTGGTGAGCGTAAGGGAACCAAGAAGAACTTGAGAGATAGTATAAAGAATGATTCTGAAAGTGACCTTGACAGTGATGTTGGCAA
CAAATATGTCGCGGCAAGGAAGTCTATAAAAAATAGAAGGCACGATAGTGATGATTCTTCTGATACTGATTCTGGTGGTGAGCGCAACGGAACCAAGAAGCACATGAGAG
ATAAACGGAGAGATGATCCTGAAATTGACTCAGACAGCGATTTTGACCAGAAATATATCACCTCAAGGAAGCATAAGAAAAACAGAAGGCATGATAGTGATGATTCTTCT
CATACTGATTCTGGTGGAGAGCACAAGAAAACCAAGAAGAATATGAGAAATTATCAAAGAGATCATGGAAGTCATCCTGACAGTGACGTTGATAAGAAATACACCTCAAA
GAAGCAGGCAAAAAACACAAGGCATGATAGTGATGATTCTGATTCGTTTACAGATGGTGATAAGTTTGGGATGGGCAGCCACAAGAAAGGATCTGGTAGACATAAAAGTC
AAAAGGTGAAAAAGCAAAGAAGCTGGAAACAGGAGTCTACTGATGAATCCAATTCTGACAGTGGGATTGATGATAAACGCAGAAAACTGAAGCACAAAAACCAGCATGGT
AAAAGATATGGGGTAGAAAGTGACAGCTCTGACCATGACAGTTCTGATTCTGATGTAGGTCGCAAGAAGAGTACGCATAGGCATGACAGCAAGCGTACAGGAAAGAGGAG
GGTAGATAGTGAATCCGATTCTGAAAAGTCGAGAAAGCATCCTAAGAAAGATGTTGGCAGACGCGGACATGATATTGATGATGAAAAAAGTGGTGATAACAGCTCTAGCA
GTGATGAAATAGTGAAGAGGCGCAGAGGTAGGAGGCACAATACTGATGATGAATCTGAAGAAGGTGAATATTTTGGTAGAAGTGATAGGGTAGACACAAAGGGAAAGAAA
ATAGCTGCTAAAAGGCAACATGATGACAGTGATAATTCTGATGATAGCCTAGCAGTTGGTAGAAAGGGCAATGATAAACACAAGAGAGCTAAGAAATATTCGACGGGTGA
TGGCTTTGGTCTAGAGAAGGGAGTAAAATCAAGCAATGGAGCACGTGCAAGAGGAAAAGGGAACTTAAATCATTCAGAAGGTAGGAACCACGATACGGATGATAAATCTG
AAGAAGAAGAAGAAGAAGGTGAATATTTTGATAGAAGTGGAAAGCTAGCGACAAAAGGTAAAATAGATGCTAAAAGGCAACACGATGACAGTGATAATTCTGATGATAGC
CTTGCAGTTGATAGAAAGGGCAATGATAAACACAAGAGAGCTAAGAAATATTCGTCGGGTGATGATTCTGATCTAGAGAAGGGAGTAAAATCAAGTGGTGGAGCTCGTGA
AAAAGGAAAAGGGAACTTTAATCATGCAGATGGTTTGGACAAGTTTAAGAAAGATTCCATCAATGAGTTCAGCCATGCACATACAGATACAATGAAAAGCAAGAGAAAGT
TTGATGAAGGTGGTGAAAATGAGCAGCAGCTAGAGGCAAAGCCTATAAATCGCAATTCTACAAGAGAGTCAGATTTCCATGGCAACCCCAAGAAAGATTTCAAAAATGAT
TCTGAATCAATCCGAAGAGCACACAGTGGCTGGTATGATGAAACAAGGGATCGGCGGTACAGGGAAGATCCCAAAATTGACTCTGAATCAAACTCTAGATCACACTATAG
TGCACGTGATGAGGGTGACGACAGAAAGTCAACTCGAACAGGAAGCAGATATAATGAAGAAACAGAGCATGGAAGTAGACATCACCGCAAGGCTAACGAGTCTCATCATC
GCAGCAGGACTGATCAAGATACTGACGAGGAAAAAAGGCATAGCAGATATGAGGAGCCTAGAGGGAGAAAACACGAAAGAGACGAAGGTCTAAAATCGAGCAGGGAAGTT
GAAAGAGGGGAGTATCAACCAAGTAGCAGGCAGAGATCTGAGAAAGATTATGAAACTAGAGAATCTACAAGAGATCGGGATGATTCCAGAAAGAGGGCCAGATATGATTC
TCGATCAAGCAGACGTGATAATTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTACTACCGCTTTCCCTTGAGAGTTCCTTTCTTATCCAATTCCTAACATTTATTTACATGGTTGCCCTTTATGAACTTATCCTCTTCCAACCAAGGAAAGATAGGAG
ATCAATCTTGTGTGAGTCACCAAATGACCTTTGGAGCATTCACTTAAACTGCATTGATCTCAACCCGAGTAACCTTATGAGGACAACTCCCAAATCCCCGCCTCTACCCA
AGGCCTCATTAACAAAAAAAGGGTACTTCGTAATAGCCGACAACATCATTCAGCAAATTGATGACCTTCAAATCTCTCCGTCTCTTGCACTCTCTCCATCTCTGCTCTCT
CAAGCTCTCTTTGATTTCGTTGTCGTCTCTCACTCTCGGTTGTTCTTCAAGTGTGAGGTTGGTGAAGTGAAAAGATGTATAACGACCGGAAAGGTTGCTGAAAGCTCTAG
AGGATTCGAAGAAGATCAGGGCACTGCTGGAGTTTCAAAGAAACCGAATAAAGACATTCTCGAACACGATCGCAAGCGTCAGATTGAACTCAAACTTGTCATACTTGAGG
ACAAGCTCACTGACCAAGGTTATACCATGGATGAAATTTCCGAGAAGTTAAAGGAGGCTCGTGAGACTTTGGAAGCTGCTTCAGAGGAAAAAGATGGAGCTTCTGCCATC
GTACTTGCAGATAAGAGGGTATCAGATACACAGACTCACCAAATTGCTGCAAGAAAGGAGGAGCAGATGAAAACATTTAGAGCTGCTCTTGGGTTGGGCTCATTGGACGA
TACTGAACAGGCTAAAGAAGAGATTTCTGATCCATCAAGAAATAGAAGAGAGGGTCAGAATGCTGATATTAAGCGTCATGAGAAGTCTGAACATTCTTTCTTGGATAGAG
AATTGAACTGGAAAAAGCATGGCACTGAAGATCAGTATGATGATAAGGATGACAAAAAAAGGGTTTCAAAAGAGTTGAAAGGTCATCAGAAGGATAGAAAAAGAAGGTCC
AAGGATGATTCTTCTGACACCGATTCTGGTGAGCGTAAGGGAACCAAGAAGAACTTGAGAGATAGTATAAAGAATGATTCTGAAAGTGACCTTGACAGTGATGTTGGCAA
CAAATATGTCGCGGCAAGGAAGTCTATAAAAAATAGAAGGCACGATAGTGATGATTCTTCTGATACTGATTCTGGTGGTGAGCGCAACGGAACCAAGAAGCACATGAGAG
ATAAACGGAGAGATGATCCTGAAATTGACTCAGACAGCGATTTTGACCAGAAATATATCACCTCAAGGAAGCATAAGAAAAACAGAAGGCATGATAGTGATGATTCTTCT
CATACTGATTCTGGTGGAGAGCACAAGAAAACCAAGAAGAATATGAGAAATTATCAAAGAGATCATGGAAGTCATCCTGACAGTGACGTTGATAAGAAATACACCTCAAA
GAAGCAGGCAAAAAACACAAGGCATGATAGTGATGATTCTGATTCGTTTACAGATGGTGATAAGTTTGGGATGGGCAGCCACAAGAAAGGATCTGGTAGACATAAAAGTC
AAAAGGTGAAAAAGCAAAGAAGCTGGAAACAGGAGTCTACTGATGAATCCAATTCTGACAGTGGGATTGATGATAAACGCAGAAAACTGAAGCACAAAAACCAGCATGGT
AAAAGATATGGGGTAGAAAGTGACAGCTCTGACCATGACAGTTCTGATTCTGATGTAGGTCGCAAGAAGAGTACGCATAGGCATGACAGCAAGCGTACAGGAAAGAGGAG
GGTAGATAGTGAATCCGATTCTGAAAAGTCGAGAAAGCATCCTAAGAAAGATGTTGGCAGACGCGGACATGATATTGATGATGAAAAAAGTGGTGATAACAGCTCTAGCA
GTGATGAAATAGTGAAGAGGCGCAGAGGTAGGAGGCACAATACTGATGATGAATCTGAAGAAGGTGAATATTTTGGTAGAAGTGATAGGGTAGACACAAAGGGAAAGAAA
ATAGCTGCTAAAAGGCAACATGATGACAGTGATAATTCTGATGATAGCCTAGCAGTTGGTAGAAAGGGCAATGATAAACACAAGAGAGCTAAGAAATATTCGACGGGTGA
TGGCTTTGGTCTAGAGAAGGGAGTAAAATCAAGCAATGGAGCACGTGCAAGAGGAAAAGGGAACTTAAATCATTCAGAAGGTAGGAACCACGATACGGATGATAAATCTG
AAGAAGAAGAAGAAGAAGGTGAATATTTTGATAGAAGTGGAAAGCTAGCGACAAAAGGTAAAATAGATGCTAAAAGGCAACACGATGACAGTGATAATTCTGATGATAGC
CTTGCAGTTGATAGAAAGGGCAATGATAAACACAAGAGAGCTAAGAAATATTCGTCGGGTGATGATTCTGATCTAGAGAAGGGAGTAAAATCAAGTGGTGGAGCTCGTGA
AAAAGGAAAAGGGAACTTTAATCATGCAGATGGTTTGGACAAGTTTAAGAAAGATTCCATCAATGAGTTCAGCCATGCACATACAGATACAATGAAAAGCAAGAGAAAGT
TTGATGAAGGTGGTGAAAATGAGCAGCAGCTAGAGGCAAAGCCTATAAATCGCAATTCTACAAGAGAGTCAGATTTCCATGGCAACCCCAAGAAAGATTTCAAAAATGAT
TCTGAATCAATCCGAAGAGCACACAGTGGCTGGTATGATGAAACAAGGGATCGGCGGTACAGGGAAGATCCCAAAATTGACTCTGAATCAAACTCTAGATCACACTATAG
TGCACGTGATGAGGGTGACGACAGAAAGTCAACTCGAACAGGAAGCAGATATAATGAAGAAACAGAGCATGGAAGTAGACATCACCGCAAGGCTAACGAGTCTCATCATC
GCAGCAGGACTGATCAAGATACTGACGAGGAAAAAAGGCATAGCAGATATGAGGAGCCTAGAGGGAGAAAACACGAAAGAGACGAAGGTCTAAAATCGAGCAGGGAAGTT
GAAAGAGGGGAGTATCAACCAAGTAGCAGGCAGAGATCTGAGAAAGATTATGAAACTAGAGAATCTACAAGAGATCGGGATGATTCCAGAAAGAGGGCCAGATATGATTC
TCGATCAAGCAGACGTGATAATTATTAG
Protein sequenceShow/hide protein sequence
MVLPLSLESSFLIQFLTFIYMVALYELILFQPRKDRRSILCESPNDLWSIHLNCIDLNPSNLMRTTPKSPPLPKASLTKKGYFVIADNIIQQIDDLQISPSLALSPSLLS
QALFDFVVVSHSRLFFKCEVGEVKRCITTGKVAESSRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLTDQGYTMDEISEKLKEARETLEAASEEKDGASAI
VLADKRVSDTQTHQIAARKEEQMKTFRAALGLGSLDDTEQAKEEISDPSRNRREGQNADIKRHEKSEHSFLDRELNWKKHGTEDQYDDKDDKKRVSKELKGHQKDRKRRS
KDDSSDTDSGERKGTKKNLRDSIKNDSESDLDSDVGNKYVAARKSIKNRRHDSDDSSDTDSGGERNGTKKHMRDKRRDDPEIDSDSDFDQKYITSRKHKKNRRHDSDDSS
HTDSGGEHKKTKKNMRNYQRDHGSHPDSDVDKKYTSKKQAKNTRHDSDDSDSFTDGDKFGMGSHKKGSGRHKSQKVKKQRSWKQESTDESNSDSGIDDKRRKLKHKNQHG
KRYGVESDSSDHDSSDSDVGRKKSTHRHDSKRTGKRRVDSESDSEKSRKHPKKDVGRRGHDIDDEKSGDNSSSSDEIVKRRRGRRHNTDDESEEGEYFGRSDRVDTKGKK
IAAKRQHDDSDNSDDSLAVGRKGNDKHKRAKKYSTGDGFGLEKGVKSSNGARARGKGNLNHSEGRNHDTDDKSEEEEEEGEYFDRSGKLATKGKIDAKRQHDDSDNSDDS
LAVDRKGNDKHKRAKKYSSGDDSDLEKGVKSSGGAREKGKGNFNHADGLDKFKKDSINEFSHAHTDTMKSKRKFDEGGENEQQLEAKPINRNSTRESDFHGNPKKDFKND
SESIRRAHSGWYDETRDRRYREDPKIDSESNSRSHYSARDEGDDRKSTRTGSRYNEETEHGSRHHRKANESHHRSRTDQDTDEEKRHSRYEEPRGRKHERDEGLKSSREV
ERGEYQPSSRQRSEKDYETRESTRDRDDSRKRARYDSRSSRRDNY