; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh12G002330 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh12G002330
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionmRNA splicing factor Cwf21 domain containing protein
Genome locationCmo_Chr12:1532338..1535162
RNA-Seq ExpressionCmoCh12G002330
SyntenyCmoCh12G002330
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR013170 - mRNA splicing factor Cwf21 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585413.1 Serine/arginine repetitive matrix protein 2, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0095.82Show/hide
Query:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE
        MYNGIGLQTPRGSGTNGHIQTNKFFVRPK GKVSE+TRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE
Subjt:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE

Query:  ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVD----
         SGSEEK+GPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSE LKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVD    
Subjt:  ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVD----

Query:  DDEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKGTKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNR
        DDEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGK KGTKKN RDNRRNDSESDL RDVDKKYTASRK KNRRHDSDDSFDADSGGERKGTRKHLRKNR
Subjt:  DDEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKGTKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNR

Query:  RYDLEGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSH
        RYDLEGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSGG+ KETRMN RY RRDD ESDFDSDVEKKSTTSKKQ KNRRHDSDDSNLSTDGDEFGMGSH
Subjt:  RYDLEGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSH

Query:  KKGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKE
        KKGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGK KVDREP+SKSSRKHPKE
Subjt:  KKGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKE

Query:  DIGRRRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDDRKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKFDEGGEDNQREAKSRSRKSTRES
        DIGRRRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDS DRKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKF+EGGEDNQREAKSRSRKSTR+S
Subjt:  DIGRRRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDDRKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKFDEGGEDNQREAKSRSRKSTRES

Query:  DFHGDPKKEPESNRRTGSRRCDEARDGRFRDDSKMDRKLTRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHERR
        DFHGDPKKEPESNRRTGS R D+ARDGRFRDDSKMDRKLTRTGRRF EEEEHGSTRHRKANESRRGSRTDEDIEE KRQSRYEEHRGRKHERR
Subjt:  DFHGDPKKEPESNRRTGSRRCDEARDGRFRDDSKMDRKLTRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHERR

KAG7020332.1 Serine/arginine repetitive matrix protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0095.53Show/hide
Query:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE
        MYNGIGLQTPRGSGTNGHIQTNKFFVRPK GKVSE+TRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE
Subjt:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE

Query:  ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVD----
         SGSE K+GPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSE LKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVD    
Subjt:  ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVD----

Query:  DDEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKGTKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNR
        DDEDDKKMVSKELKGH KDRKRRAKDDSSDTDSGGK KGTKKN RDNRRNDSESDL RDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNR
Subjt:  DDEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKGTKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNR

Query:  RYDLEGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSH
        RYDLEGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSGG+ KETRMN RY RRDD ESDFDSDVEKKSTTSKKQ KNRRHDSDDSNLSTDGDEFGMGSH
Subjt:  RYDLEGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSH

Query:  KKGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKE
        KKGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGK KVDREP+SKSSRKHPKE
Subjt:  KKGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKE

Query:  DIGRRRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDDRKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKFDEGGEDNQREAKSRSRKSTRES
        DIGRRRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDS DRKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKF+EGGEDNQ+EAKSRSRKSTR+S
Subjt:  DIGRRRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDDRKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKFDEGGEDNQREAKSRSRKSTRES

Query:  DFHGDPKKEPESNRRTGSRRCDEARDGRFRDDSKMDRKLTRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHERR
        DFHGDPKKEPESNRRTGS R D+ARDGRFRDDSKMDRKLTRTGRRF EEEEHGSTRHRKANESRRGSRTDEDIEE KRQSRYEEHRGRKHERR
Subjt:  DFHGDPKKEPESNRRTGSRRCDEARDGRFRDDSKMDRKLTRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHERR

XP_022951424.1 serine/arginine repetitive matrix protein 2-like isoform X1 [Cucurbita moschata]0.0e+00100Show/hide
Query:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE
        MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE
Subjt:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE

Query:  ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVDDDED
        ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVDDDED
Subjt:  ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVDDDED

Query:  DKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKGTKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNRRYDL
        DKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKGTKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNRRYDL
Subjt:  DKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKGTKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNRRYDL

Query:  EGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSHKKGS
        EGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSHKKGS
Subjt:  EGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSHKKGS

Query:  GRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKEDIGR
        GRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKEDIGR
Subjt:  GRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKEDIGR

Query:  RRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDDRKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKFDEGGEDNQREAKSRSRKSTRESDFHG
        RRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDDRKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKFDEGGEDNQREAKSRSRKSTRESDFHG
Subjt:  RRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDDRKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKFDEGGEDNQREAKSRSRKSTRESDFHG

Query:  DPKKEPESNRRTGSRRCDEARDGRFRDDSKMDRKLTRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHERR
        DPKKEPESNRRTGSRRCDEARDGRFRDDSKMDRKLTRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHERR
Subjt:  DPKKEPESNRRTGSRRCDEARDGRFRDDSKMDRKLTRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHERR

XP_022951426.1 dentin sialophosphoprotein-like isoform X2 [Cucurbita moschata]5.3e-292100Show/hide
Query:  MKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVDDDEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKG
        MKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVDDDEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKG
Subjt:  MKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVDDDEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKG

Query:  TKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNRRYDLEGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSG
        TKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNRRYDLEGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSG
Subjt:  TKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNRRYDLEGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSG

Query:  GKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSHKKGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQ
        GKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSHKKGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQ
Subjt:  GKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSHKKGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQ

Query:  LKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKEDIGRRRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDD
        LKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKEDIGRRRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDD
Subjt:  LKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKEDIGRRRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDD

Query:  RKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKFDEGGEDNQREAKSRSRKSTRESDFHGDPKKEPESNRRTGSRRCDEARDGRFRDDSKMDRKL
        RKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKFDEGGEDNQREAKSRSRKSTRESDFHGDPKKEPESNRRTGSRRCDEARDGRFRDDSKMDRKL
Subjt:  RKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKFDEGGEDNQREAKSRSRKSTRESDFHGDPKKEPESNRRTGSRRCDEARDGRFRDDSKMDRKL

Query:  TRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHERR
        TRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHERR
Subjt:  TRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHERR

XP_023538168.1 serine/arginine repetitive matrix protein 2-like isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0095.95Show/hide
Query:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE
        MYNGIGLQTPRGSGTNGHIQTNKFFVRPK GKVSE+TRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE
Subjt:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE

Query:  ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVD---D
        ASGSEEK GPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSE LKEGISDSSRSGREGQDADTKRREK EHSFLDRELNWKKHAVD   D
Subjt:  ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVD---D

Query:  DEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKGTKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNRR
        DEDDKK VSKELKGHQKDRKRRAKDDSSDTDSGGK KGTKKN RDNRRNDSESDLDRDVDKKYT SRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNRR
Subjt:  DEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKGTKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNRR

Query:  YDLEGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSHK
        YDLEGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSGG+ KET+MN RY RRDD ESDFDSDVEKKSTTSKKQ KNRRHDSDDSNLST GDEFGMGSHK
Subjt:  YDLEGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSHK

Query:  KGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKED
        K SGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGK KVDREP+SKSSRKHPKED
Subjt:  KGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKED

Query:  IGRRRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDDRKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKFDEGGEDNQREAKSRSRKSTRESD
        IGRRRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDDRKYHGKHKRAKKHSSSDDSDLENNLYK SQHTMKSKRKFDEGGEDNQREAKSRSRKSTRESD
Subjt:  IGRRRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDDRKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKFDEGGEDNQREAKSRSRKSTRESD

Query:  FHGDPKKEPESNRRTGSRRCDEARDGRFRDDSKMDRKLTRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHERR
        FHGDPKKEPESNRRTGSRR DEARDGRFRDDSKMDRKLTRTGRRFTEEEEHGSTRHRKANE RRGSRTDE IEE KRQSRYEEHRGRKHERR
Subjt:  FHGDPKKEPESNRRTGSRRCDEARDGRFRDDSKMDRKLTRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHERR

TrEMBL top hitse value%identityAlignment
A0A6J1BPI2 protein starmaker1.5e-22362.56Show/hide
Query:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE
        MYNGIGLQTPRGSGTNG+IQTNKFFVRPK GKV+E TRGFEEDQGTAGVSKKPNKDILEHDRKRQI+LKL ILEDKLIDQGYT +E+SEKLKE RK LE 
Subjt:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE

Query:  ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVD---D
        AS  EEK GPSAIV+ DKR+S TQTHQIAARKEEQMKTLR+ALGLGS DDSE LKEGISD   + REG+++D KRREK+EH+FLDRELNWKKHA +   D
Subjt:  ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVD---D

Query:  DEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKGTKKNMRDNRRNDSESDLDRDVDKKYTASRK-TKNRRHDSDDSFDADSGGERKGTRKHLRKNR
        D+D K  VSKE KGH+KDRKRR KDDSSDTDSGG+HKGTKKN+RDNRR+DSESD+D DVDKKY  SR+  KNRRHDSDDS D DSGGE K  +K+LR NR
Subjt:  DEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKGTKKNMRDNRRNDSESDLDRDVDKKYTASRK-TKNRRHDSDDSFDADSGGERKGTRKHLRKNR

Query:  RYDLEGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSH
        R D E D DSD D+K+IT RKHKKNR+H SDDSS TDSG  HK T+ N R  +RDD ESD DSDV+KK  TSKKQGK++RHDSDDS+  TD D+FG G H
Subjt:  RYDLEGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSH

Query:  KKGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKE
        KKGSGRPKS+KVKKK  SRKQESTDESNSD G D K R  +HKN  GK    DSDSSDHD S SD GR+++KHRY S S GK KVD E  ++ SRKHPKE
Subjt:  KKGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKE

Query:  DIGRRRHDTDDDESG-----------------------------------GKTVAKEKMAAAKRKYDDSDDSD-----DRKYHGKHKRAKKHSSSDDSDL
        D+GR RHDTDD ESG                                   GK   K K+ AAK++YDDSD SD     DRK   KH+RAKKH+  D S L
Subjt:  DIGRRRHDTDDDESG-----------------------------------GKTVAKEKMAAAKRKYDDSDDSD-----DRKYHGKHKRAKKHSSSDDSDL

Query:  E-------------------------------NNLYKS-----------SQHTMKSKRKFDEGGEDNQREAKSRSRKSTRESDFHGDPKKEPE----SNR
        E                               NN YKS           +QHTMKSKRKFDEGGE+ QREAKSR+R STRE  F+GD KK+ +    SN 
Subjt:  E-------------------------------NNLYKS-----------SQHTMKSKRKFDEGGEDNQREAKSRSRKSTRESDFHGDPKKEPE----SNR

Query:  RTGSRRCDEARDGRFRDDSKMD------------------RKLTRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHER
        R G+ R DE RDG  R+D K+D                   KL RTG ++ EE EHGS  +RKANES R      DIEE KR  RYEEHRGRKHER
Subjt:  RTGSRRCDEARDGRFRDDSKMD------------------RKLTRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHER

A0A6J1ESM6 dentin sialophosphoprotein-like1.4e-21356.59Show/hide
Query:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE
        MYNGIGLQTPRGSGTNG+IQTNKFFVRPK GKV+E+TRGF+EDQGTAGVSKKPNKDILEHDRKRQI+LKL ILEDKL DQGYT DEIS+KLKE R+ LE 
Subjt:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE

Query:  ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVD---D
        ASGSEEKDGPSAIV+ADK+VS TQ+HQIAARKEEQMKTLR+ALGL SS+DSE + EGISD +R+ REGQ+AD KR EK+EHSFLDRELNWKKH  +   D
Subjt:  ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVD---D

Query:  DEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGK-HKGTKKNMRDNRRNDSESDLDRDVDKKYTASRKT-KNRRHDSDDSFDADSGGERKGTRKHLR--
        D+ DKK VSKELKGH KDR RR KDDSSD DS G+ HKGTKKN+RDNRRNDSESD + D D KY  SRK+ KNRRHDSD S D DSGGERKGT+KHLR  
Subjt:  DEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGK-HKGTKKNMRDNRRNDSESDLDRDVDKKYTASRKT-KNRRHDSDDSFDADSGGERKGTRKHLR--

Query:  -----------------------------------------------------------------------------KNRRYDLEGDQDSDA--------
                                                                                     KNRR+D +   D+D+        
Subjt:  -----------------------------------------------------------------------------KNRRYDLEGDQDSDA--------

Query:  -------------------DQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGD
                           DQKHIT RKHKKNR+H SD SS TDSGG+HKET+ + +  RR D ESD DSD++KK TTSKKQ KN+   SDDS+  +D  
Subjt:  -------------------DQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGD

Query:  EFGMGSHKKGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKS
        EFGMGSH+KGSGR KS+KV KKQR RKQESTDESNSDSGID K RQLKHKNQHGK YGVDSDSSD D+S SD GR+++KHRY+S   GK +VD E  S+ 
Subjt:  EFGMGSHKKGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKS

Query:  SRKHPKEDIGRRRHDTDDDESG-------------------------------GKT--VAKEKMAAAKRKYDDSDDSD-----DRKYHGKHKRAKKHSSS
         RKHPK+D+GRRRHDTD+DESG                               GK+  +A +   AAKRK+DDSD SD     DRK + K KRAKKHSS 
Subjt:  SRKHPKEDIGRRRHDTDDDESG-------------------------------GKT--VAKEKMAAAKRKYDDSDDSD-----DRKYHGKHKRAKKHSSS

Query:  DDSDLE-------------------------------NNLYKS-----------SQHTMKSKRKFDEGGEDNQR-EAKSRSRKSTRESDFHGDPKK----
        D SD +                               N  YKS           +Q TMKSKRK DEGGED Q+ EAKSRSR STRESDFHGDPKK    
Subjt:  DDSDLE-------------------------------NNLYKS-----------SQHTMKSKRKFDEGGEDNQR-EAKSRSRKSTRESDFHGDPKK----

Query:  EPESNRRTGSRRCDEARDGRFRDDSKM-----------------DRKLTRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKR--QSRYEEHRGRK
        + ES+RR  S R +E RDGR+R+D K+                 DRK TRTG R+TEE EHGS  + KANES   SRTD+DIEE KR   SRYEEHRGRK
Subjt:  EPESNRRTGSRRCDEARDGRFRDDSKM-----------------DRKLTRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKR--QSRYEEHRGRK

Query:  HER
        HER
Subjt:  HER

A0A6J1GHK0 dentin sialophosphoprotein-like isoform X22.5e-292100Show/hide
Query:  MKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVDDDEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKG
        MKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVDDDEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKG
Subjt:  MKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVDDDEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKG

Query:  TKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNRRYDLEGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSG
        TKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNRRYDLEGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSG
Subjt:  TKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNRRYDLEGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSG

Query:  GKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSHKKGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQ
        GKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSHKKGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQ
Subjt:  GKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSHKKGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQ

Query:  LKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKEDIGRRRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDD
        LKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKEDIGRRRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDD
Subjt:  LKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKEDIGRRRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDD

Query:  RKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKFDEGGEDNQREAKSRSRKSTRESDFHGDPKKEPESNRRTGSRRCDEARDGRFRDDSKMDRKL
        RKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKFDEGGEDNQREAKSRSRKSTRESDFHGDPKKEPESNRRTGSRRCDEARDGRFRDDSKMDRKL
Subjt:  RKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKFDEGGEDNQREAKSRSRKSTRESDFHGDPKKEPESNRRTGSRRCDEARDGRFRDDSKMDRKL

Query:  TRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHERR
        TRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHERR
Subjt:  TRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHERR

A0A6J1GIP9 serine/arginine repetitive matrix protein 2-like isoform X10.0e+00100Show/hide
Query:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE
        MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE
Subjt:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE

Query:  ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVDDDED
        ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVDDDED
Subjt:  ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVDDDED

Query:  DKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKGTKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNRRYDL
        DKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKGTKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNRRYDL
Subjt:  DKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKGTKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNRRYDL

Query:  EGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSHKKGS
        EGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSHKKGS
Subjt:  EGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSHKKGS

Query:  GRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKEDIGR
        GRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKEDIGR
Subjt:  GRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKEDIGR

Query:  RRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDDRKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKFDEGGEDNQREAKSRSRKSTRESDFHG
        RRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDDRKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKFDEGGEDNQREAKSRSRKSTRESDFHG
Subjt:  RRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDDRKYHGKHKRAKKHSSSDDSDLENNLYKSSQHTMKSKRKFDEGGEDNQREAKSRSRKSTRESDFHG

Query:  DPKKEPESNRRTGSRRCDEARDGRFRDDSKMDRKLTRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHERR
        DPKKEPESNRRTGSRRCDEARDGRFRDDSKMDRKLTRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHERR
Subjt:  DPKKEPESNRRTGSRRCDEARDGRFRDDSKMDRKLTRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQSRYEEHRGRKHERR

A0A6J1K7B6 dentin sialophosphoprotein-like2.2e-21156.21Show/hide
Query:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE
        MYNGIGLQTPRGSGTNG+IQTNKFFVRPK GKV+E+TRGF+EDQGTAGVSKKPNKDILEHDRKRQI+LKL ILEDKL DQGYT DEIS+KLKE R+ LE 
Subjt:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE

Query:  ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVD---D
        ASGSEEKDGPSAIV+ADK+VS TQ+HQIAARKEEQMKTLR+ALGL SS+DSE + EGISD +R+ REGQ+AD KR+EK+EHSFLDRELNWK+H  +   D
Subjt:  ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVD---D

Query:  DEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGK-HKGTKKNMRDNRRNDSESDLDRDVDKKYTASRKT-KNRRHDSDDSFDADSGGERKGTRKHLR--
        D+ DKK VSKELKGH KDR RR KDDSSD DS G+ HKGTKKN+RDNRR DSESD + D D KY  SRK+ KNRRHDSD S D DSGGERKGT+KHLR  
Subjt:  DEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGK-HKGTKKNMRDNRRNDSESDLDRDVDKKYTASRKT-KNRRHDSDDSFDADSGGERKGTRKHLR--

Query:  -----------------------------------------------------------------------------KNRRYDLEGDQDSDA--------
                                                                                     KNRR+D +   D+D+        
Subjt:  -----------------------------------------------------------------------------KNRRYDLEGDQDSDA--------

Query:  -------------------DQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGD
                           DQKHIT  KHKKNR+H SD SS TDSGG+HKET+ + +  RR D ESD DSD++KK TTSKKQ KN+  DSDDS+  +D  
Subjt:  -------------------DQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGD

Query:  EFGMGSHKKGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKS
        EFGMGSH+KGSGRPKS+KV KKQRSRKQESTDESNSDSGID K RQLK+KNQHGK YGVDSDSSD D+S SD GR+++KHRY S  TGK +VD E  S+ 
Subjt:  EFGMGSHKKGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKS

Query:  SRKHPKEDIGRRRHDTDDDESG------------------------------GKT--VAKEKMAAAKRKYDDSDDSD-----DRKYHGKHKRAKKHSSSD
         RKHPK+D+GRRRHDTD+DESG                              GK+  +A +   AAKRK++DSD SD     DR+ + K KRAKKHS  D
Subjt:  SRKHPKEDIGRRRHDTDDDESG------------------------------GKT--VAKEKMAAAKRKYDDSDDSD-----DRKYHGKHKRAKKHSSSD

Query:  DSDLE-------------------------------NNLYKS-----------SQHTMKSKRKFDEGGEDNQR-EAKSRSRKSTRESDFHGDPKK----E
         SD +                               N  YKS           +Q TMKSKRK DEGGED Q+ EAKS+SR STRESDFHGDPKK    +
Subjt:  DSDLE-------------------------------NNLYKS-----------SQHTMKSKRKFDEGGEDNQR-EAKSRSRKSTRESDFHGDPKK----E

Query:  PESNRRTGSRRCDEARDGRFRDDSKM-----------------DRKLTRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQ--SRYEEHRGRKH
         ES+RR  S R  E RDGR+R+D K+                 DRK  RTG R+TEE EHGS  + KANES   SRTD+DIEE KRQ  SRYEEHRGRKH
Subjt:  PESNRRTGSRRCDEARDGRFRDDSKM-----------------DRKLTRTGRRFTEEEEHGSTRHRKANESRRGSRTDEDIEEEKRQ--SRYEEHRGRKH

Query:  ER
        ER
Subjt:  ER

SwissProt top hitse value%identityAlignment
P0CM94 Pre-mRNA-splicing factor CWC214.2e-1029.07Show/hide
Query:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSK-----KPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVR
        MY  +GL T RGSGTNG++  N   +R + G       G   D     VSK      P++ ILEH+RKR++++K+  L D+L ++G   D+I E+  ++R
Subjt:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSK-----KPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVR

Query:  KNLEEASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAV
        + L   +   E+ G   +           TH +AA KE +M  L+ ALG+     S   +EG +   R   E + A   +RE+ E   ++  +  ++   
Subjt:  KNLEEASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAV

Query:  DDDEDDKKMVSKELKGHQKDRKRRAKD
        ++++  ++   KE    +++ KRR +D
Subjt:  DDDEDDKKMVSKELKGHQKDRKRRAKD

P0CM95 Pre-mRNA-splicing factor CWC214.2e-1029.07Show/hide
Query:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSK-----KPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVR
        MY  +GL T RGSGTNG++  N   +R + G       G   D     VSK      P++ ILEH+RKR++++K+  L D+L ++G   D+I E+  ++R
Subjt:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSK-----KPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVR

Query:  KNLEEASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAV
        + L   +   E+ G   +           TH +AA KE +M  L+ ALG+     S   +EG +   R   E + A   +RE+ E   ++  +  ++   
Subjt:  KNLEEASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAV

Query:  DDDEDDKKMVSKELKGHQKDRKRRAKD
        ++++  ++   KE    +++ KRR +D
Subjt:  DDDEDDKKMVSKELKGHQKDRKRRAKD

Q4IB70 Pre-mRNA-splicing factor CWC215.0e-1131.49Show/hide
Query:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPK--AGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVR-KN
        M + +GL TPRGSGT+G++Q N   ++P+       +D       Q      ++P+K ILEHDRKR++++K+  L DKL ++    DEI ++  E+R K 
Subjt:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPK--AGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVR-KN

Query:  LEEASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREK
        L E +      GP       K     Q H++A  K ++ + LR AL + +  +     +   +  RS  E +D D + R K
Subjt:  LEEASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREK

Q6C0M9 Pre-mRNA-splicing factor CWC211.4e-0827.04Show/hide
Query:  YNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNK--------------DILEHDRKRQIDLKLAILEDKLIDQGYTA-DE
        YNGIGL TPRGS T+GHIQTN   +  +A + +     F   + T    K+ NK              ++LEH+RKR++++    L+DKL ++G    +E
Subjt:  YNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNK--------------DILEHDRKRQIDLKLAILEDKLIDQGYTA-DE

Query:  ISEKLKEVRKNLEEASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKR-REKTEHSFLD
        I E++  +R+ L     + E D  + IV AD+       H+ A  K+++M  +R A     + D        +   ++  +    DT+R R   E S   
Subjt:  ISEKLKEVRKNLEEASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKR-REKTEHSFLD

Query:  RELNWKKHAVDDDEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKGTKKNMRDNRRNDSESDLDRD
        R    ++++V +D    +++S+        R+  ++  S   + G + K  ++  R   R  SE    RD
Subjt:  RELNWKKHAVDDDEDDKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKGTKKNMRDNRRNDSESDLDRD

Q7RYH7 Pre-mRNA-splicing factor cwc-215.2e-0825.14Show/hide
Query:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE
        M + +GL TPRGSGT+G++Q N    RP+    S   + F+         ++P+K +LEHDRKR++++K+  L DKL ++G   DEI  +  E+R+ L  
Subjt:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEE

Query:  ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVDDDED
        A     ++   A     K + + Q H++A  K ++ + LR AL +                SR  +EG  +  K++E+     L+RE N    ++     
Subjt:  ASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVDDDED

Query:  DKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKGTKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNRRYDL
                 +G  +DR R       D D G         +    R     D DR    +    R  +  R    DS+   +G +R  +R  +R+  R   
Subjt:  DKKMVSKELKGHQKDRKRRAKDDSSDTDSGGKHKGTKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNRRYDL

Query:  ----EGDQDSDADQKHITLRKHKKNRKHGSDDS----SGTDSGGKHKETRMNRRYTRRDDRE
             G   S   ++ ++ R   ++R +    S       DS  +      +R Y+R  DR+
Subjt:  ----EGDQDSDADQKHITLRKHKKNRKHGSDDS----SGTDSGGKHKETRMNRRYTRRDDRE

Arabidopsis top hitse value%identityAlignment
AT3G49601.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; CONTAINS InterPro DOMAIN/s: mRNA splicing factor, Cwf21 (InterPro:IPR013170); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).1.3e-5135.02Show/hide
Query:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPK-AGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLE
        MYNGIGLQT RGSGTNG++QTNKFFVRP+  GK  +  +GFE+D+GTAG+SKKPNK ILEHDRKRQI LKLAILEDKL DQGY+  EI++KL+E R +LE
Subjt:  MYNGIGLQTPRGSGTNGHIQTNKFFVRPK-AGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLE

Query:  EASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSS--RSGREGQDADTKRREKTEHSFLDRELNWKKHAVDD
         A+ + E++       +D +VS TQTHQ+AARKE+QM+  R+ALGL   D  +  +EGI D    R G EG     + +E+ EHSFLDR+   KK   D 
Subjt:  EASGSEEKDGPSAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSS--RSGREGQDADTKRREKTEHSFLDRELNWKKHAVDD

Query:  DEDDKKM-VSKELKG------------HQKDRKRRAKDDSSDTDSGG---------KHKGTKKNMR-DNRRNDSESDLDRDVDKK-------YTASRKTK
        DE D K+  SK+ +G             +K+ K+R  DDSS++D  G         K KG K+    D+  +DSESD D D  KK        T  ++++
Subjt:  DEDDKKM-VSKELKG------------HQKDRKRRAKDDSSDTDSGG---------KHKGTKKNMR-DNRRNDSESDLDRDVDKK-------YTASRKTK

Query:  NRRHDSDDSFDADSGGERKGTRKHLRKNRRYDLEGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTT
         +R  S +S + +S   +K     LRK+ +  L  ++    + +     + +  RK    D S  +S    +  R      R   ++   D DVE     
Subjt:  NRRHDSDDSFDADSGGERKGTRKHLRKNRRYDLEGDQDSDADQKHITLRKHKKNRKHGSDDSSGTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTT

Query:  SK--KQGKNRRHDSDDS--------NLSTDGDEFGMG-SHKKGSGRPKSRKVKKKQRS---RKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSS
         +  +  K    DSDDS         L +  + +  G S K+      ++  K K RS    K+ + D  +S++  +++ +      Q G+ +  + D  
Subjt:  SK--KQGKNRRHDSDDS--------NLSTDGDEFGMG-SHKKGSGRPKSRKVKKKQRS---RKQESTDESNSDSGIDHKDRQLKHKNQHGKGYGVDSDSS

Query:  DHDNSGSDFGR-----------DENKHRYRSNSTGKPKVDREPKSKSSRKHPKED-------------IGRRRHDTDDDESGGKTVAKEKMAAAKRKYDD
        D+DN G D  R            E+  RYR  +  +   D   + +  R+  K+D              GRR    +DD+    +  +E  +  + +YDD
Subjt:  DHDNSGSDFGR-----------DENKHRYRSNSTGKPKVDREPKSKSSRKHPKED-------------IGRRRHDTDDDESGGKTVAKEKMAAAKRKYDD

Query:  SDDSDDRKYHG
        S  S  R  HG
Subjt:  SDDSDDRKYHG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATAACGGTATTGGATTACAGACGCCGAGAGGCTCTGGCACTAATGGCCACATTCAGACGAACAAGTTCTTCGTGAGGCCGAAGGCTGGAAAGGTTTCTGAAGACAC
CAGAGGATTCGAAGAAGATCAGGGCACTGCCGGTGTTTCCAAGAAACCTAATAAAGACATTCTCGAACATGATCGCAAGCGTCAGATTGATCTCAAGCTTGCCATACTTG
AGGACAAGCTCATTGATCAAGGTTATACGGCCGATGAAATTTCTGAAAAGTTGAAGGAGGTTCGCAAGAATCTGGAGGAGGCTTCAGGTTCTGAGGAAAAAGATGGGCCT
TCTGCCATCGTAATTGCAGATAAGAGGGTATCAGTTACACAGACTCACCAAATTGCTGCGAGAAAGGAGGAGCAGATGAAAACATTGAGATCTGCTCTTGGGTTGGGTTC
TTCGGACGATTCGGAAACGCTCAAGGAAGGGATTTCTGATTCATCTAGAAGTGGAAGAGAGGGTCAAGATGCTGATACTAAGCGTCGTGAGAAGACGGAACATTCTTTTT
TGGACAGAGAATTGAACTGGAAAAAGCATGCCGTTGACGACGATGAAGATGACAAAAAAATGGTTTCGAAAGAGTTGAAAGGTCATCAGAAGGATAGAAAACGAAGGGCT
AAGGATGATTCTTCTGATACTGATTCTGGTGGAAAGCATAAGGGAACCAAGAAGAACATGAGAGATAATAGAAGGAATGATTCTGAAAGCGACCTTGACAGAGATGTTGA
CAAGAAGTACACCGCCTCAAGAAAGACGAAAAATAGAAGGCATGATAGTGATGATTCTTTCGATGCCGATTCTGGTGGAGAACGCAAGGGAACCAGGAAGCACCTGAGAA
AAAACCGAAGATATGATCTCGAAGGTGACCAGGACAGTGATGCTGACCAGAAACATATCACTTTAAGGAAGCATAAGAAAAACAGAAAGCACGGTAGTGATGATTCTTCC
GGTACTGATTCCGGTGGAAAGCACAAGGAAACCCGGATGAACAGGAGATATACTCGAAGAGATGATCGTGAAAGTGATTTCGACAGTGATGTTGAGAAGAAATCCACCAC
CTCAAAGAAGCAGGGGAAAAACAGAAGGCATGATAGTGATGATTCTAATTTATCTACAGATGGTGATGAGTTTGGTATGGGTAGCCACAAGAAAGGCTCTGGTAGACCTA
AAAGTCGAAAGGTCAAGAAGAAGCAAAGAAGCCGAAAACAGGAGTCGACTGATGAATCCAATTCCGACAGTGGGATTGATCACAAAGACAGGCAACTAAAGCACAAGAAC
CAGCATGGTAAAGGATATGGAGTAGATAGTGACAGCTCTGACCACGACAATTCTGGTTCCGATTTTGGTCGTGACGAGAATAAGCATAGGTATCGTAGCAATAGTACAGG
AAAACCCAAGGTAGACAGGGAACCCAAATCCAAGAGTTCAAGAAAGCATCCTAAGGAAGACATTGGGAGACGCAGACACGATACCGATGACGATGAAAGTGGTGGTAAGA
CAGTCGCAAAGGAAAAAATGGCTGCGGCTAAAAGGAAATATGATGACAGTGATGATTCAGATGATAGAAAGTACCATGGTAAACACAAGAGAGCTAAGAAACATTCTTCC
AGTGATGATTCTGATCTAGAGAATAATTTGTACAAATCTAGTCAGCATACGATGAAAAGCAAGAGAAAGTTCGATGAAGGTGGTGAAGATAACCAGCGAGAAGCGAAGTC
TAGAAGTCGAAAATCTACACGAGAGTCGGATTTCCATGGGGACCCCAAGAAAGAACCTGAATCAAACAGAAGAACTGGCAGTCGTCGGTGCGACGAGGCAAGGGATGGAC
GGTTCAGGGACGACTCCAAAATGGATAGAAAGTTGACTCGAACAGGAAGGAGATTTACAGAAGAAGAAGAGCATGGAAGTACTCGTCATCGGAAAGCTAACGAGTCTCGC
CGGGGCAGTAGGACTGATGAAGATATTGAAGAGGAAAAAAGGCAGAGCAGATATGAGGAGCATAGAGGGAGAAAACATGAAAGAAGGTAA
mRNA sequenceShow/hide mRNA sequence
GCCTAAAACTGTCATCCTCATTGGAGTTGTCTATAAATACTCTCCCACCCGTTGGCTCTTGTAATCGAAACTGTAGCAAAGGAAACCAGACAGGACGGACCATCCGAACA
CGACCTGAGTCGGTTCCCAGTACCAAATCGACTTACGATTTTCAGGGTTTAGAGGCTTCGTCGTTCCGTTCGATCCTGATTTCTTCCATCATATGTGGAATGGGAGAGGA
CAGACGATAAGCACTCGAAACACCTGATTGGTGAAGCAGGGAAATGTATAACGGTATTGGATTACAGACGCCGAGAGGCTCTGGCACTAATGGCCACATTCAGACGAACA
AGTTCTTCGTGAGGCCGAAGGCTGGAAAGGTTTCTGAAGACACCAGAGGATTCGAAGAAGATCAGGGCACTGCCGGTGTTTCCAAGAAACCTAATAAAGACATTCTCGAA
CATGATCGCAAGCGTCAGATTGATCTCAAGCTTGCCATACTTGAGGACAAGCTCATTGATCAAGGTTATACGGCCGATGAAATTTCTGAAAAGTTGAAGGAGGTTCGCAA
GAATCTGGAGGAGGCTTCAGGTTCTGAGGAAAAAGATGGGCCTTCTGCCATCGTAATTGCAGATAAGAGGGTATCAGTTACACAGACTCACCAAATTGCTGCGAGAAAGG
AGGAGCAGATGAAAACATTGAGATCTGCTCTTGGGTTGGGTTCTTCGGACGATTCGGAAACGCTCAAGGAAGGGATTTCTGATTCATCTAGAAGTGGAAGAGAGGGTCAA
GATGCTGATACTAAGCGTCGTGAGAAGACGGAACATTCTTTTTTGGACAGAGAATTGAACTGGAAAAAGCATGCCGTTGACGACGATGAAGATGACAAAAAAATGGTTTC
GAAAGAGTTGAAAGGTCATCAGAAGGATAGAAAACGAAGGGCTAAGGATGATTCTTCTGATACTGATTCTGGTGGAAAGCATAAGGGAACCAAGAAGAACATGAGAGATA
ATAGAAGGAATGATTCTGAAAGCGACCTTGACAGAGATGTTGACAAGAAGTACACCGCCTCAAGAAAGACGAAAAATAGAAGGCATGATAGTGATGATTCTTTCGATGCC
GATTCTGGTGGAGAACGCAAGGGAACCAGGAAGCACCTGAGAAAAAACCGAAGATATGATCTCGAAGGTGACCAGGACAGTGATGCTGACCAGAAACATATCACTTTAAG
GAAGCATAAGAAAAACAGAAAGCACGGTAGTGATGATTCTTCCGGTACTGATTCCGGTGGAAAGCACAAGGAAACCCGGATGAACAGGAGATATACTCGAAGAGATGATC
GTGAAAGTGATTTCGACAGTGATGTTGAGAAGAAATCCACCACCTCAAAGAAGCAGGGGAAAAACAGAAGGCATGATAGTGATGATTCTAATTTATCTACAGATGGTGAT
GAGTTTGGTATGGGTAGCCACAAGAAAGGCTCTGGTAGACCTAAAAGTCGAAAGGTCAAGAAGAAGCAAAGAAGCCGAAAACAGGAGTCGACTGATGAATCCAATTCCGA
CAGTGGGATTGATCACAAAGACAGGCAACTAAAGCACAAGAACCAGCATGGTAAAGGATATGGAGTAGATAGTGACAGCTCTGACCACGACAATTCTGGTTCCGATTTTG
GTCGTGACGAGAATAAGCATAGGTATCGTAGCAATAGTACAGGAAAACCCAAGGTAGACAGGGAACCCAAATCCAAGAGTTCAAGAAAGCATCCTAAGGAAGACATTGGG
AGACGCAGACACGATACCGATGACGATGAAAGTGGTGGTAAGACAGTCGCAAAGGAAAAAATGGCTGCGGCTAAAAGGAAATATGATGACAGTGATGATTCAGATGATAG
AAAGTACCATGGTAAACACAAGAGAGCTAAGAAACATTCTTCCAGTGATGATTCTGATCTAGAGAATAATTTGTACAAATCTAGTCAGCATACGATGAAAAGCAAGAGAA
AGTTCGATGAAGGTGGTGAAGATAACCAGCGAGAAGCGAAGTCTAGAAGTCGAAAATCTACACGAGAGTCGGATTTCCATGGGGACCCCAAGAAAGAACCTGAATCAAAC
AGAAGAACTGGCAGTCGTCGGTGCGACGAGGCAAGGGATGGACGGTTCAGGGACGACTCCAAAATGGATAGAAAGTTGACTCGAACAGGAAGGAGATTTACAGAAGAAGA
AGAGCATGGAAGTACTCGTCATCGGAAAGCTAACGAGTCTCGCCGGGGCAGTAGGACTGATGAAGATATTGAAGAGGAAAAAAGGCAGAGCAGATATGAGGAGCATAGAG
GGAGAAAACATGAAAGAAGGTAACGGTTCCATGTTCTTCGTAGTTTTGGCTTATTGTCTGTATAAACTCTCTCTACAACTTGAATCATGATGTTTCTACTTTTATGTAAT
GAAACAACTTCATCTCTTAGGTTGAAATTATATGCTTTAACTAGT
Protein sequenceShow/hide protein sequence
MYNGIGLQTPRGSGTNGHIQTNKFFVRPKAGKVSEDTRGFEEDQGTAGVSKKPNKDILEHDRKRQIDLKLAILEDKLIDQGYTADEISEKLKEVRKNLEEASGSEEKDGP
SAIVIADKRVSVTQTHQIAARKEEQMKTLRSALGLGSSDDSETLKEGISDSSRSGREGQDADTKRREKTEHSFLDRELNWKKHAVDDDEDDKKMVSKELKGHQKDRKRRA
KDDSSDTDSGGKHKGTKKNMRDNRRNDSESDLDRDVDKKYTASRKTKNRRHDSDDSFDADSGGERKGTRKHLRKNRRYDLEGDQDSDADQKHITLRKHKKNRKHGSDDSS
GTDSGGKHKETRMNRRYTRRDDRESDFDSDVEKKSTTSKKQGKNRRHDSDDSNLSTDGDEFGMGSHKKGSGRPKSRKVKKKQRSRKQESTDESNSDSGIDHKDRQLKHKN
QHGKGYGVDSDSSDHDNSGSDFGRDENKHRYRSNSTGKPKVDREPKSKSSRKHPKEDIGRRRHDTDDDESGGKTVAKEKMAAAKRKYDDSDDSDDRKYHGKHKRAKKHSS
SDDSDLENNLYKSSQHTMKSKRKFDEGGEDNQREAKSRSRKSTRESDFHGDPKKEPESNRRTGSRRCDEARDGRFRDDSKMDRKLTRTGRRFTEEEEHGSTRHRKANESR
RGSRTDEDIEEEKRQSRYEEHRGRKHERR