; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS001269 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS001269
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptioncwf21 domain-containing protein
Genome locationscaffold36:2799819..2802368
RNA-Seq ExpressionMS001269
SyntenyMS001269
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR013170 - mRNA splicing factor Cwf21 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131365.1 protein starmaker [Momordica charantia]0.0e+0098.6Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA

Query:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND
        ASDQEEKGGPSAIVL DKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGEN+DIKRREKSEHAFLDRELNWKKHAKE HND
Subjt:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND

Query:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDNR
        DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGE+K+AKKNLRDNR
Subjt:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDNR

Query:  RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSSTDSGRDHKGTKKNLRAYQRDDHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDDFGKGRHK
        RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSSTDSGRDHKGTKKNLR YQRDDHESDPDSDVDKKYFTSKKQGKSK HDSDDSDSVTDDDDFGKGRHK
Subjt:  RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSSTDSGRDHKGTKKNLRAYQRDDHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDDFGKGRHK

Query:  KGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKSRKHPKED
        KGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKSRKHPKED
Subjt:  KGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKSRKHPKED

Query:  VGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLEK
        VGRHRHDTDDKESGDFS+SSDEKVERRKSKRYDTDDES+GGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLEK
Subjt:  VGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLEK

Query:  GFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQREAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRA
        GFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQREAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRA
Subjt:  GFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQREAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRA

Query:  GNNRYDEMRDGWHREDPKVDSKSNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANEAHRHGSASPDIEEGKRHIRYEE
        GNNRYDEMRDGWHREDPKVDS+SNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANE+HRHGSASPDIEEGKRHIRYEE
Subjt:  GNNRYDEMRDGWHREDPKVDSKSNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANEAHRHGSASPDIEEGKRHIRYEE

XP_022929608.1 dentin sialophosphoprotein-like [Cucurbita moschata]1.8e-27264.84Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYT++E+S+KLKEAR+TLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA

Query:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND
        AS  EEK GPSAIVL+DK++SDTQ+HQIAARKEEQMKTLRAALGL S +DSEQ+ EGISDP  N REG+NADIKR EKSEH+FLDRELNWKKH  E+HND
Subjt:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND

Query:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGE-HKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDN
        DK  K RVSKE KGH KDR RRPKDDSSD DS GE HKGTKKNLRDNRR+DSESD +SD D KY TSR+ KKNRRHDSD SSDTDSGGE K  KK+LRDN
Subjt:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGE-HKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDN

Query:  RRDDPESDPDSDVDKKYITSRKHKKNRRHDSDD------------------------------------------------------SSTDSGRDHKGTK
        RRD P+ DPDS+ D+KY TSRKHKKNRRHDSDD                                                      S TDSG + KGT 
Subjt:  RRDDPESDPDSDVDKKYITSRKHKKNRRHDSDD------------------------------------------------------SSTDSGRDHKGTK

Query:  KNLRAYQRD----------------------------------------------------DHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDD
        K+LR  +RD                                                    D ESD DSD+DKKY TSKKQ K+K   SDDSDS  D  +
Subjt:  KNLRAYQRD----------------------------------------------------DHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDD

Query:  FGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKS
        FG G H+KGSGR KSQKV KK   RKQESTDESNSD G DDKGR  +HKN  GKR   DSDSSD D SDSDVGRNKSKHRY S+  GK +VDSE D+EK 
Subjt:  FGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKS

Query:  RKHPKEDVGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIG
        RKHPK+DVGR RHDTD+ ESGD S SSDE V+ R+ +R+++DD+S+  GE F  KSGKIATKG IAAK+++DDSD SDDS+AVDRKG +K +RAKKH+ G
Subjt:  RKHPKEDVGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIG

Query:  DGSGLEKGFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQR-EAKSRNRNSTRELGFYGDLKKDSKI
        DGS  +KG KSSGGARE GKG+ NHADGLDE VTA  N SYKSR D +DEFN ANQ TMKSKRK DEGGE+EQ+ EAKSR+R STRE  F+GD KKD K 
Subjt:  DGSGLEKGFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQR-EAKSRNRNSTRELGFYGDLKKDSKI

Query:  DSGSNSRAGNNRYDEMRDGWHREDPKVDSKSNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANEAHRHGSASPDIEEGKRH--IRYEE
        DS S+ RA + RY+E RDG +REDPK+DS+SN R+RYS+ H+ED+  K  RTGS+Y EETEHGSRH  KANE+H       DIEEGKRH   RYEE
Subjt:  DSGSNSRAGNNRYDEMRDGWHREDPKVDSKSNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANEAHRHGSASPDIEEGKRH--IRYEE

XP_022997381.1 dentin sialophosphoprotein-like [Cucurbita maxima]1.2e-27164.73Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYT++E+S+KLKEAR+TLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA

Query:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND
        AS  EEK GPSAIVL+DK++SDTQ+HQIAARKEEQMKTLRAALGL S +DSEQ+ EGISDP  N REG+NADIKR+EKSEH+FLDRELNWK+H  E+HND
Subjt:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND

Query:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGE-HKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDN
        DK  K RVSKE KGH KDR RRPKDDSSD DS GE HKGTKKNLRDNRR DSESD +SD D KY TSR+ KKNRRHDSD SSDTDSGGE K  KK+LRDN
Subjt:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGE-HKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDN

Query:  RRDDPESDPDSDVDKKY-----------------------------------------------------ITSRKHKKNRRHDSDDSS-TDSGRDHKGTK
        RRD P+ DPDS+ D+KY                                                     ITSRKHKKNRRHDSDDSS TDSG + KGT 
Subjt:  RRDDPESDPDSDVDKKY-----------------------------------------------------ITSRKHKKNRRHDSDDSS-TDSGRDHKGTK

Query:  KNLRAYQRD----------------------------------------------------DHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDD
        K+LR  +RD                                                    D ESD DSD+DKKY TSKKQ K+K  DSDDSDS  D  +
Subjt:  KNLRAYQRD----------------------------------------------------DHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDD

Query:  FGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKS
        FG G H+KGSGRPKSQKV KK  SRKQESTDESNSD G DDKGR  ++KN  GKR   DSDSSD D SDSDVGRNKSKHRYHS+  GK +VDSE D+EK 
Subjt:  FGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKS

Query:  RKHPKEDVGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIG
        RKHPK+DVGR RHDTD+ ESGD S SSDE V+RR+ +R+++DD+S+ G   +  KSGKIATKG IAAK++++DSD SDDS+AVDR+G +K +RAKKH+ G
Subjt:  RKHPKEDVGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIG

Query:  DGSGLEKGFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQR-EAKSRNRNSTRELGFYGDLKKDSKI
        DGS  +KG KSSGGARE GKG+ NHADGLDE VTA  N SYKSR D++DEFN ANQ TMKSKRK DEGGE+EQ+ EAKS++R STRE  F+GD KKD K 
Subjt:  DGSGLEKGFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQR-EAKSRNRNSTRELGFYGDLKKDSKI

Query:  DSGSNSRAGNNRYDEMRDGWHREDPKVDSKSNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANEAHRHGSASPDIEEGKRH--IRYEE
        DS S+ RA + R+ E RDG +REDPK+DS+SN R+RYS+ H+EDD  K  RTGS+Y EETEHGSRH  KANE+H       DIEEGKR    RYEE
Subjt:  DSGSNSRAGNNRYDEMRDGWHREDPKVDSKSNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANEAHRHGSASPDIEEGKRH--IRYEE

XP_023545728.1 protein starmaker-like [Cucurbita pepo subsp. pepo]2.7e-27665.18Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYT++E+S+KLKEAR+TLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA

Query:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND
        AS  EEK GPSAIVL+DK++SDTQ+HQIAARKEEQMKTLRAALGL S +DSEQ+ EGISDP  N REG+NADIKR+EKSEH+FLDRELNWKKH  E+HND
Subjt:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND

Query:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGE-HKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDN
        DK  K RVSKE KGH KDR RRPKDDSSD DS GE HKGTKKNLRDNRR+DSESD +SD D+KY TSR+ KKNRRHDSD SSDTDSGGE K  KK+LRDN
Subjt:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGE-HKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDN

Query:  RRDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSS------------------------------------------------------TDSGRDHKGTK
        RRD P+ DPDS+ D+KY TSRKHKKNRRHDSDDSS                                                       DSG + KGT 
Subjt:  RRDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSS------------------------------------------------------TDSGRDHKGTK

Query:  KNLRAYQRD----------------------------------------------------DHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDD
        K+LR  +RD                                                    D ESD DSDVDKKY TSKKQ K+K  DSDDSDS  D  +
Subjt:  KNLRAYQRD----------------------------------------------------DHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDD

Query:  FGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKS
        FG G H+KGSGRPKSQKV KK  SRKQESTDESNSD G DDKGR  +HKN  GKR   DSDSSD D SDSDVGRNKSKHRYHS+  GK +VDSE D+EK 
Subjt:  FGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKS

Query:  RKHPKEDVGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIG
        RKHPK+DVGR RHDTD+ ESGD S SSDE V+RR+ +RY++DD+S+ G   +  KSGK ATKG IAAK+++DDSD SDDS+A+DR+G +K +RAKKH+ G
Subjt:  RKHPKEDVGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIG

Query:  DGSGLEKGFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQR-EAKSRNRNSTRELGFYGDLKKDSKI
        DGS  +KG KSSGGARE GKG+ NHADGLDE VTA  N SYKSR D++DEFN ANQ TMKSKRK DEGGE+EQ+ EAKSR+R STRE  F+GD KKD K 
Subjt:  DGSGLEKGFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQR-EAKSRNRNSTRELGFYGDLKKDSKI

Query:  DSGSNSRAGNNRYDEMRDGWHREDPKVDSKSNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANEAHRHGSASPDIEEGKRH--IRYEE
        DS S+ RA + RY+E RDG +RE+PK+DS+SNTR+RYS+ H+EDD  K  RTGS+Y EETEHGSRH  KANE+H       DIEEGKRH   RYEE
Subjt:  DSGSNSRAGNNRYDEMRDGWHREDPKVDSKSNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANEAHRHGSASPDIEEGKRH--IRYEE

XP_038884695.1 dentin sialophosphoprotein-like [Benincasa hispida]1.1e-26167.07Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYT +E+SEKL+EAR+TLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA

Query:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND
        AS  EEK GPSAIVL+DKR+SDTQTHQIAARKEEQMKTLRAALGLGS  D+EQ+KE ISDP    REG+NADIKR EKSEH+FLDRELNWKKH  E+  D
Subjt:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND

Query:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDNR
        DKD K R+SKE KGH+K RKRRPKDDSSDTDS        +NLRD+RR+DSESD+DSDV  KY+ SR   KNRRHDSDDSSDTDSGGE K  KK+LRD R
Subjt:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDNR

Query:  RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSS-TDSGRDHKGTKKNLRAYQRDDHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDDFGK-GR
        RDDPESDPDSD D+KYITSRKHKKNRRHD D+SS TDSG +HK TKKN+R  +R  H SDP SD+DKKY  SKK  K++ HDSDDSDS+TD D+FG  G 
Subjt:  RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSS-TDSGRDHKGTKKNLRAYQRDDHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDDFGK-GR

Query:  HKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKSRKHPK
        HKKGS R KSQKVK +  SRKQESTDESNSD G D+K R  +H+N  GKR   +SDSSDHD SDSDVG  KSKHRY S+  GK +VDSE ++EKSRKH K
Subjt:  HKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKSRKHPK

Query:  EDVGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGL
        +D GRHRHD D+++SGD S S  E V+RR+ + Y+ DD S+  GE   R SGKIATKGKI AK+Q+DD+++SDDS AV RKG +KH+RAKK + GD S L
Subjt:  EDVGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGL

Query:  EKGFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQH--TMKSKRKFDEGGENEQR-EAKSRNRNSTRELGFYGDLKK-------
        EKG K+SGGARE GKG+LNHADGL+           K +KD+I+EFNHA+Q   TM SKRKFDEGG+NEQ+ E+KSRNRNSTR   F+GD KK       
Subjt:  EKGFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQH--TMKSKRKFDEGGENEQR-EAKSRNRNSTRELGFYGDLKK-------

Query:  --------------------DSKIDSG--------------SNSRAGNNRYDEMRDGWHREDPKVDSKSNTRARYSSMHDEDDRSKLDRTGSKYNEETEH
                            D KIDS               S+ RA + RYDE RDG +REDPK+DS+SN R+RY S+ DEDD  K  +TGS++ EETEH
Subjt:  --------------------DSKIDSG--------------SNSRAGNNRYDEMRDGWHREDPKVDSKSNTRARYSSMHDEDDRSKLDRTGSKYNEETEH

Query:  GSRHYRKANEAHRHGSASPDIEEGKRHIRYEE
        GSRH+RKANE+H       D EE KRH RYEE
Subjt:  GSRHYRKANEAHRHGSASPDIEEGKRHIRYEE

TrEMBL top hitse value%identityAlignment
A0A1S3BBX0 dentin sialophosphoprotein-like2.9e-25264Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYT++E+SEKL+EAR+ LEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA

Query:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND
        AS  EEK G SAIVL+DKR+SDTQTHQIAARKEEQMKTLRAALGLGSL D EQ+KE ISDP  + REG+NADIKR EKSEH+FLDRELNWK+   E+  D
Subjt:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND

Query:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDNR
        DKD K   SKE KGH+KD+KRRPKDD SD DS GEHKGTKKNLRD+RR DSESD+D DV+ KY+ SR+ KKNRRHDSDDSS TDSGGEHK  KK+ R+ R
Subjt:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDNR

Query:  RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSS-TDSGRDHKGTKKNLRAYQRDDHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDDFGKGRH
        +DDPESD DSD+D+KY+TSRKHKKNRRHDSDDSS +DSG +HK TK+++R+ QR  H SDPDSDVDKK+ TSKKQ KS  HDSDDSDS TD D  G   H
Subjt:  RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSS-TDSGRDHKGTKKNLRAYQRDDHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDDFGKGRH

Query:  KKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKSRKHPKE
        +KGSGR +SQKVKK+  S+KQ+STDE+NSD   +DK R  +HKN  GK R  +SDSSDHD SDSDVGR KS HR+HS+  GK +VDSE D EKSRK+PK+
Subjt:  KKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKSRKHPKE

Query:  DVGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLE
        D  R RHD DD++SGD S SSDE V+RR+ +R+ TDD S+  GE F R SGKI TKGKI AK+Q D S++SD S AVDRKG ++H+RAKK++ GDG  LE
Subjt:  DVGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLE

Query:  KGFKSSGGAREGGKGNLNHADGL---------------------------------DEPVTADDNNSYKS------------------------------
        KG K S GARE GKGNLNH +G                                  D+   +DD+ + K                               
Subjt:  KGFKSSGGAREGGKGNLNHADGL---------------------------------DEPVTADDNNSYKS------------------------------

Query:  --------RKDAIDEFNHANQHT--MKSKRKFDEGGENEQR-EAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRAGNNRYDEMRDGWHREDPKVDSKSNT
                +KD+I EFNHA+Q T  M SKRK DEG ENEQ  E+KSRNRNS        D KKD K DS S+ R+ + RYDE RDG +RED K+DS+SNT
Subjt:  --------RKDAIDEFNHANQHT--MKSKRKFDEGGENEQR-EAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRAGNNRYDEMRDGWHREDPKVDSKSNT

Query:  RARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANEAHRHGSASPDIEEGKRHIRYEE
        R+RYS+ H+EDD  K  RTGS+Y EETEHGSRH+RKANE+H H     D EE KRH RYEE
Subjt:  RARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANEAHRHGSASPDIEEGKRHIRYEE

A0A5A7VCH8 Dentin sialophosphoprotein-like1.1e-25464.34Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYT++E+SEKL+EAR+ LEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA

Query:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND
        AS  EEK G SAIVL+DKR+SDTQTHQIAARKEEQMKTLRAALGLGSLDD EQ+KE ISDP  + REG+NADIKR EKSEH+FLDRELNWK+   E+  D
Subjt:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND

Query:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDNR
        DKD K   SKE KGH+KD+KRRPKDDSSDTDS GEHKGTKKNLRD+RR DSES++D DV+ KY+ SR+ KKNRRHDSDDSS TDSGGEHK  KK+ R+ R
Subjt:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDNR

Query:  RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSS-TDSGRDHKGTKKNLRAYQRDDHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDDFGKGRH
        +DDPESD DSD+D+KY+TSRKHKKNRRHDSDDSS +DSG +HK TK+++R+ QR  H SDPDSDVDKK+ TSKKQ KS  HDSDDSDS TD D  G   H
Subjt:  RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSS-TDSGRDHKGTKKNLRAYQRDDHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDDFGKGRH

Query:  KKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKSRKHPKE
        +KGSGR +SQKVKK+  S+KQ+STDE+NSD   +DK R  +HKN  GK R  +SDSSDHD SDSDVGR KS HR+HS+  GK +VDSE D EKSRK+PK+
Subjt:  KKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKSRKHPKE

Query:  DVGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLE
        DV R RHD DD++SGD S SSDE V+RR+ +R+ TDD S+  GE F R SGKI TKGKI AK+Q D S++SD S AVDRKG ++H+RAKK++ GDG  LE
Subjt:  DVGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLE

Query:  KGFKSSGGAREGGKGNLNHADGL---------------------------------DEPVTADDNNSYKS------------------------------
        KG K S GARE GKGNLNH +G                                  D+   +DD+ + K                               
Subjt:  KGFKSSGGAREGGKGNLNHADGL---------------------------------DEPVTADDNNSYKS------------------------------

Query:  --------RKDAIDEFNHANQHT--MKSKRKFDEGGENEQR-EAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRAGNNRYDEMRDGWHREDPKVDSKSNT
                +KD+I EFNHA+Q T  M SKRK DEG ENEQ  E+KSRNRNS        D KKD K DS S+ R+ + RYDE RDG +RED K+DS+SNT
Subjt:  --------RKDAIDEFNHANQHT--MKSKRKFDEGGENEQR-EAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRAGNNRYDEMRDGWHREDPKVDSKSNT

Query:  RARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANEAHRHGSASPDIEEGKRHIRYEE
        R+RYS+ H+EDD  K  RTGS+Y EETEHGSRH+RKANE+H H     D EE KRH RYEE
Subjt:  RARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANEAHRHGSASPDIEEGKRHIRYEE

A0A6J1BPI2 protein starmaker0.0e+0098.6Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA

Query:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND
        ASDQEEKGGPSAIVL DKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGEN+DIKRREKSEHAFLDRELNWKKHAKE HND
Subjt:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND

Query:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDNR
        DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGE+K+AKKNLRDNR
Subjt:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDNR

Query:  RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSSTDSGRDHKGTKKNLRAYQRDDHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDDFGKGRHK
        RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSSTDSGRDHKGTKKNLR YQRDDHESDPDSDVDKKYFTSKKQGKSK HDSDDSDSVTDDDDFGKGRHK
Subjt:  RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSSTDSGRDHKGTKKNLRAYQRDDHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDDFGKGRHK

Query:  KGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKSRKHPKED
        KGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKSRKHPKED
Subjt:  KGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKSRKHPKED

Query:  VGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLEK
        VGRHRHDTDDKESGDFS+SSDEKVERRKSKRYDTDDES+GGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLEK
Subjt:  VGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLEK

Query:  GFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQREAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRA
        GFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQREAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRA
Subjt:  GFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQREAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRA

Query:  GNNRYDEMRDGWHREDPKVDSKSNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANEAHRHGSASPDIEEGKRHIRYEE
        GNNRYDEMRDGWHREDPKVDS+SNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANE+HRHGSASPDIEEGKRHIRYEE
Subjt:  GNNRYDEMRDGWHREDPKVDSKSNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANEAHRHGSASPDIEEGKRHIRYEE

A0A6J1ESM6 dentin sialophosphoprotein-like8.8e-27364.84Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYT++E+S+KLKEAR+TLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA

Query:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND
        AS  EEK GPSAIVL+DK++SDTQ+HQIAARKEEQMKTLRAALGL S +DSEQ+ EGISDP  N REG+NADIKR EKSEH+FLDRELNWKKH  E+HND
Subjt:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND

Query:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGE-HKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDN
        DK  K RVSKE KGH KDR RRPKDDSSD DS GE HKGTKKNLRDNRR+DSESD +SD D KY TSR+ KKNRRHDSD SSDTDSGGE K  KK+LRDN
Subjt:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGE-HKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDN

Query:  RRDDPESDPDSDVDKKYITSRKHKKNRRHDSDD------------------------------------------------------SSTDSGRDHKGTK
        RRD P+ DPDS+ D+KY TSRKHKKNRRHDSDD                                                      S TDSG + KGT 
Subjt:  RRDDPESDPDSDVDKKYITSRKHKKNRRHDSDD------------------------------------------------------SSTDSGRDHKGTK

Query:  KNLRAYQRD----------------------------------------------------DHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDD
        K+LR  +RD                                                    D ESD DSD+DKKY TSKKQ K+K   SDDSDS  D  +
Subjt:  KNLRAYQRD----------------------------------------------------DHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDD

Query:  FGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKS
        FG G H+KGSGR KSQKV KK   RKQESTDESNSD G DDKGR  +HKN  GKR   DSDSSD D SDSDVGRNKSKHRY S+  GK +VDSE D+EK 
Subjt:  FGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKS

Query:  RKHPKEDVGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIG
        RKHPK+DVGR RHDTD+ ESGD S SSDE V+ R+ +R+++DD+S+  GE F  KSGKIATKG IAAK+++DDSD SDDS+AVDRKG +K +RAKKH+ G
Subjt:  RKHPKEDVGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIG

Query:  DGSGLEKGFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQR-EAKSRNRNSTRELGFYGDLKKDSKI
        DGS  +KG KSSGGARE GKG+ NHADGLDE VTA  N SYKSR D +DEFN ANQ TMKSKRK DEGGE+EQ+ EAKSR+R STRE  F+GD KKD K 
Subjt:  DGSGLEKGFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQR-EAKSRNRNSTRELGFYGDLKKDSKI

Query:  DSGSNSRAGNNRYDEMRDGWHREDPKVDSKSNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANEAHRHGSASPDIEEGKRH--IRYEE
        DS S+ RA + RY+E RDG +REDPK+DS+SN R+RYS+ H+ED+  K  RTGS+Y EETEHGSRH  KANE+H       DIEEGKRH   RYEE
Subjt:  DSGSNSRAGNNRYDEMRDGWHREDPKVDSKSNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANEAHRHGSASPDIEEGKRH--IRYEE

A0A6J1K7B6 dentin sialophosphoprotein-like5.7e-27264.73Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA
        MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKL DQGYT++E+S+KLKEAR+TLEA
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEA

Query:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND
        AS  EEK GPSAIVL+DK++SDTQ+HQIAARKEEQMKTLRAALGL S +DSEQ+ EGISDP  N REG+NADIKR+EKSEH+FLDRELNWK+H  E+HND
Subjt:  ASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHND

Query:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGE-HKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDN
        DK  K RVSKE KGH KDR RRPKDDSSD DS GE HKGTKKNLRDNRR DSESD +SD D KY TSR+ KKNRRHDSD SSDTDSGGE K  KK+LRDN
Subjt:  DKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGE-HKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDN

Query:  RRDDPESDPDSDVDKKY-----------------------------------------------------ITSRKHKKNRRHDSDDSS-TDSGRDHKGTK
        RRD P+ DPDS+ D+KY                                                     ITSRKHKKNRRHDSDDSS TDSG + KGT 
Subjt:  RRDDPESDPDSDVDKKY-----------------------------------------------------ITSRKHKKNRRHDSDDSS-TDSGRDHKGTK

Query:  KNLRAYQRD----------------------------------------------------DHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDD
        K+LR  +RD                                                    D ESD DSD+DKKY TSKKQ K+K  DSDDSDS  D  +
Subjt:  KNLRAYQRD----------------------------------------------------DHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDD

Query:  FGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKS
        FG G H+KGSGRPKSQKV KK  SRKQESTDESNSD G DDKGR  ++KN  GKR   DSDSSD D SDSDVGRNKSKHRYHS+  GK +VDSE D+EK 
Subjt:  FGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKS

Query:  RKHPKEDVGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIG
        RKHPK+DVGR RHDTD+ ESGD S SSDE V+RR+ +R+++DD+S+ G   +  KSGKIATKG IAAK++++DSD SDDS+AVDR+G +K +RAKKH+ G
Subjt:  RKHPKEDVGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIG

Query:  DGSGLEKGFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQR-EAKSRNRNSTRELGFYGDLKKDSKI
        DGS  +KG KSSGGARE GKG+ NHADGLDE VTA  N SYKSR D++DEFN ANQ TMKSKRK DEGGE+EQ+ EAKS++R STRE  F+GD KKD K 
Subjt:  DGSGLEKGFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQR-EAKSRNRNSTRELGFYGDLKKDSKI

Query:  DSGSNSRAGNNRYDEMRDGWHREDPKVDSKSNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANEAHRHGSASPDIEEGKRH--IRYEE
        DS S+ RA + R+ E RDG +REDPK+DS+SN R+RYS+ H+EDD  K  RTGS+Y EETEHGSRH  KANE+H       DIEEGKR    RYEE
Subjt:  DSGSNSRAGNNRYDEMRDGWHREDPKVDSKSNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANEAHRHGSASPDIEEGKRH--IRYEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G49601.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; CONTAINS InterPro DOMAIN/s: mRNA splicing factor, Cwf21 (InterPro:IPR013170); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).6.2e-5336.01Show/hide
Query:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKT-GKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLE
        MYNGIGLQT RGSGTNGY+QTNKFFVRP+  GK  +  +GFE+D+GTAG+SKKPNK ILEHDRKRQI LKL ILEDKL DQGY+D E+++KL+EAR +LE
Subjt:  MYNGIGLQTPRGSGTNGYIQTNKFFVRPKT-GKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLE

Query:  AASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHN
        AA+   E+        SD ++S+TQTHQ+AARKE+QM+  RAALGL   D  +  +EGI D  E  REG    +K  E+ EH+FLDR+   KK   +E  
Subjt:  AASDQEEKGGPSAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHN

Query:  DDKDKKIRVSKESKG------------HKKDRKRRPKDDSSDTDSGG---------EHKGTKKNLR-DNRRDDSESDIDSDVDKK-------YITSRRYK
        D+KD K++ SK+ +G             KK+ K+R  DDSS++D  G         + KG K+    D+   DSESD DSD  KK         T +R +
Subjt:  DDKDKKIRVSKESKG------------HKKDRKRRPKDDSSDTDSGG---------EHKGTKKNLR-DNRRDDSESDIDSDVDKK-------YITSRRYK

Query:  KNR--RHDSDDSSDTDSGGEHKRAKKNLRDNRRDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSSTDSGRDHKGTKKNLRAYQRDDHESDPDSDVDKKY
        + R    +S++    DS    K  KK+L  NR    E       DK    SR  +K  RHDSD S  +S  + +  +K   AY+    +   D DV+  +
Subjt:  KNR--RHDSDDSSDTDSGGEHKRAKKNLRDNRRDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSSTDSGRDHKGTKKNLRAYQRDDHESDPDSDVDKKY

Query:  FTSKKQGKSKGHDSDDSDSVTDDDDFGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRN
           +       +  DD  +  D DD       K   R K +     +  +++E  D +         G+     + +GK    DSD S+ +  +    +N
Subjt:  FTSKKQGKSKGHDSDDSDSVTDDDDFGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRN

Query:  KS--KHRYHSRSVGKDKVDSEFDTEKSRKHPKEDVGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDD
        +S  + R H R   +D  +   D  +     K   G  + D D           D+   R + +R    D+ +      DR  G     G+ A  K+ DD
Subjt:  KS--KHRYHSRSVGKDKVDSEFDTEKSRKHPKEDVGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSGKIATKGKIAAKKQYDD

Query:  SDSSDDSRAVDRKGRNKHERAK
         D     R    +GR++++ ++
Subjt:  SDSSDDSRAVDRKGRNKHERAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATAACGGTATTGGGTTACAGACGCCAAGAGGGTCTGGTACAAATGGTTACATTCAGACCAACAAGTTCTTTGTGAGGCCGAAGACCGGAAAGGTTGCTGAAAGCAC
CAGAGGATTCGAAGAAGACCAAGGTACTGCAGGTGTTTCCAAAAAACCTAACAAGGATATCCTTGAACATGATCGCAAGCGTCAGATTGAACTCAAGCTCGTCATACTCG
AGGACAAGCTCATCGACCAAGGTTATACAGATGAGGAGGTTTCTGAAAAGTTGAAAGAGGCTCGAAAAACATTGGAAGCTGCATCAGATCAGGAAGAAAAAGGTGGACCT
TCGGCCATAGTACTTTCAGATAAGAGGATCTCAGATACGCAGACCCACCAAATTGCTGCGAGAAAGGAGGAGCAGATGAAAACATTGAGAGCTGCTCTTGGGTTGGGCTC
ATTGGATGATAGTGAGCAACTTAAAGAAGGGATTTCTGATCCATTAGAAAATAGCAGAGAAGGTGAAAATGCTGATATTAAGCGTCGTGAGAAGTCTGAACATGCTTTCT
TGGATAGAGAATTGAACTGGAAAAAGCATGCAAAGGAAGAACACAATGATGATAAGGATAAAAAAATAAGGGTTTCAAAGGAGTCCAAAGGTCATAAGAAAGATAGGAAA
AGAAGGCCCAAGGATGATTCTTCTGATACTGATTCTGGTGGAGAACATAAGGGAACCAAGAAGAACTTGAGAGATAATCGAAGAGATGATTCTGAAAGTGACATTGACAG
CGATGTTGACAAGAAATACATCACCTCACGGAGGTATAAGAAAAACAGGAGGCATGATAGTGATGATTCTTCTGATACTGATTCTGGCGGAGAGCATAAGAGAGCCAAGA
AGAACTTGAGAGATAATAGAAGAGATGATCCTGAAAGTGACCCTGACAGCGATGTTGACAAGAAATACATCACCTCAAGGAAGCATAAGAAAAACAGAAGGCATGATAGT
GATGATTCTTCTACTGATTCTGGTAGAGACCATAAAGGAACCAAGAAAAACTTGAGAGCATATCAAAGAGATGATCATGAAAGTGATCCTGACAGTGATGTTGACAAGAA
ATACTTCACCTCAAAGAAGCAGGGGAAAAGCAAAGGACATGATAGTGATGATTCTGATTCAGTTACAGATGATGATGATTTCGGGAAGGGCAGACATAAGAAAGGATCTG
GTAGACCTAAAAGTCAAAAGGTGAAGAAGAAGCTAGGAAGCCGGAAACAGGAGTCCACTGATGAATCCAATTCTGACGTTGGGACTGATGATAAAGGCAGGCTATCGAGG
CACAAGAACCATCAGGGTAAAAGACGCAGGGCAGATAGTGATAGCTCTGACCATGACGGTTCTGATTCAGATGTAGGTCGCAACAAGAGTAAGCATAGGTACCATAGCAG
AAGTGTGGGGAAGGACAAGGTAGATAGTGAATTTGATACCGAAAAGTCAAGAAAGCATCCTAAAGAAGATGTTGGGAGACACAGGCATGATACTGATGACAAAGAAAGTG
GTGATTTTAGCTATAGCAGTGATGAAAAAGTGGAGAGGCGCAAAAGTAAGAGGTATGATACTGATGATGAATCTGATGGAGGAGGTGAACGTTTTGATAGGAAAAGTGGT
AAGATAGCCACAAAGGGGAAAATAGCCGCTAAAAAGCAATATGATGACAGTGATAGTTCTGATGATAGTCGTGCAGTTGATAGAAAAGGCCGTAATAAGCACGAGAGAGC
TAAGAAACATACGATAGGTGATGGTTCTGGTCTAGAGAAGGGATTCAAATCAAGTGGTGGAGCTCGCGAAGGAGGAAAAGGGAATTTAAATCATGCCGATGGTTTGGATG
AGCCGGTGACTGCAGATGATAACAATTCGTACAAGTCTAGGAAAGATGCTATCGATGAGTTCAACCATGCAAATCAACATACAATGAAAAGCAAGAGAAAATTTGATGAG
GGTGGTGAAAATGAGCAGCGAGAAGCAAAATCTAGAAACCGAAATTCTACAAGAGAGTTGGGTTTCTATGGAGACCTCAAGAAGGATTCCAAAATTGATTCTGGATCAAA
CAGTAGAGCAGGCAATAATAGGTATGATGAGATGAGGGATGGATGGCACAGGGAGGACCCAAAAGTTGATTCCAAATCAAATACTAGAGCACGCTATAGTAGTATGCATG
ATGAGGATGACCGCAGCAAGTTGGATCGAACAGGAAGCAAATATAATGAAGAAACAGAGCATGGAAGTAGACATTATCGTAAGGCTAATGAGGCTCACCGTCACGGTAGT
GCCAGTCCAGATATTGAAGAGGGAAAAAGGCATATCAGATATGAGGAG
mRNA sequenceShow/hide mRNA sequence
ATGTATAACGGTATTGGGTTACAGACGCCAAGAGGGTCTGGTACAAATGGTTACATTCAGACCAACAAGTTCTTTGTGAGGCCGAAGACCGGAAAGGTTGCTGAAAGCAC
CAGAGGATTCGAAGAAGACCAAGGTACTGCAGGTGTTTCCAAAAAACCTAACAAGGATATCCTTGAACATGATCGCAAGCGTCAGATTGAACTCAAGCTCGTCATACTCG
AGGACAAGCTCATCGACCAAGGTTATACAGATGAGGAGGTTTCTGAAAAGTTGAAAGAGGCTCGAAAAACATTGGAAGCTGCATCAGATCAGGAAGAAAAAGGTGGACCT
TCGGCCATAGTACTTTCAGATAAGAGGATCTCAGATACGCAGACCCACCAAATTGCTGCGAGAAAGGAGGAGCAGATGAAAACATTGAGAGCTGCTCTTGGGTTGGGCTC
ATTGGATGATAGTGAGCAACTTAAAGAAGGGATTTCTGATCCATTAGAAAATAGCAGAGAAGGTGAAAATGCTGATATTAAGCGTCGTGAGAAGTCTGAACATGCTTTCT
TGGATAGAGAATTGAACTGGAAAAAGCATGCAAAGGAAGAACACAATGATGATAAGGATAAAAAAATAAGGGTTTCAAAGGAGTCCAAAGGTCATAAGAAAGATAGGAAA
AGAAGGCCCAAGGATGATTCTTCTGATACTGATTCTGGTGGAGAACATAAGGGAACCAAGAAGAACTTGAGAGATAATCGAAGAGATGATTCTGAAAGTGACATTGACAG
CGATGTTGACAAGAAATACATCACCTCACGGAGGTATAAGAAAAACAGGAGGCATGATAGTGATGATTCTTCTGATACTGATTCTGGCGGAGAGCATAAGAGAGCCAAGA
AGAACTTGAGAGATAATAGAAGAGATGATCCTGAAAGTGACCCTGACAGCGATGTTGACAAGAAATACATCACCTCAAGGAAGCATAAGAAAAACAGAAGGCATGATAGT
GATGATTCTTCTACTGATTCTGGTAGAGACCATAAAGGAACCAAGAAAAACTTGAGAGCATATCAAAGAGATGATCATGAAAGTGATCCTGACAGTGATGTTGACAAGAA
ATACTTCACCTCAAAGAAGCAGGGGAAAAGCAAAGGACATGATAGTGATGATTCTGATTCAGTTACAGATGATGATGATTTCGGGAAGGGCAGACATAAGAAAGGATCTG
GTAGACCTAAAAGTCAAAAGGTGAAGAAGAAGCTAGGAAGCCGGAAACAGGAGTCCACTGATGAATCCAATTCTGACGTTGGGACTGATGATAAAGGCAGGCTATCGAGG
CACAAGAACCATCAGGGTAAAAGACGCAGGGCAGATAGTGATAGCTCTGACCATGACGGTTCTGATTCAGATGTAGGTCGCAACAAGAGTAAGCATAGGTACCATAGCAG
AAGTGTGGGGAAGGACAAGGTAGATAGTGAATTTGATACCGAAAAGTCAAGAAAGCATCCTAAAGAAGATGTTGGGAGACACAGGCATGATACTGATGACAAAGAAAGTG
GTGATTTTAGCTATAGCAGTGATGAAAAAGTGGAGAGGCGCAAAAGTAAGAGGTATGATACTGATGATGAATCTGATGGAGGAGGTGAACGTTTTGATAGGAAAAGTGGT
AAGATAGCCACAAAGGGGAAAATAGCCGCTAAAAAGCAATATGATGACAGTGATAGTTCTGATGATAGTCGTGCAGTTGATAGAAAAGGCCGTAATAAGCACGAGAGAGC
TAAGAAACATACGATAGGTGATGGTTCTGGTCTAGAGAAGGGATTCAAATCAAGTGGTGGAGCTCGCGAAGGAGGAAAAGGGAATTTAAATCATGCCGATGGTTTGGATG
AGCCGGTGACTGCAGATGATAACAATTCGTACAAGTCTAGGAAAGATGCTATCGATGAGTTCAACCATGCAAATCAACATACAATGAAAAGCAAGAGAAAATTTGATGAG
GGTGGTGAAAATGAGCAGCGAGAAGCAAAATCTAGAAACCGAAATTCTACAAGAGAGTTGGGTTTCTATGGAGACCTCAAGAAGGATTCCAAAATTGATTCTGGATCAAA
CAGTAGAGCAGGCAATAATAGGTATGATGAGATGAGGGATGGATGGCACAGGGAGGACCCAAAAGTTGATTCCAAATCAAATACTAGAGCACGCTATAGTAGTATGCATG
ATGAGGATGACCGCAGCAAGTTGGATCGAACAGGAAGCAAATATAATGAAGAAACAGAGCATGGAAGTAGACATTATCGTAAGGCTAATGAGGCTCACCGTCACGGTAGT
GCCAGTCCAGATATTGAAGAGGGAAAAAGGCATATCAGATATGAGGAG
Protein sequenceShow/hide protein sequence
MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEAASDQEEKGGP
SAIVLSDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENADIKRREKSEHAFLDRELNWKKHAKEEHNDDKDKKIRVSKESKGHKKDRK
RRPKDDSSDTDSGGEHKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEHKRAKKNLRDNRRDDPESDPDSDVDKKYITSRKHKKNRRHDS
DDSSTDSGRDHKGTKKNLRAYQRDDHESDPDSDVDKKYFTSKKQGKSKGHDSDDSDSVTDDDDFGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSR
HKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKSRKHPKEDVGRHRHDTDDKESGDFSYSSDEKVERRKSKRYDTDDESDGGGERFDRKSG
KIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLEKGFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDE
GGENEQREAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRAGNNRYDEMRDGWHREDPKVDSKSNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANEAHRHGS
ASPDIEEGKRHIRYEE