; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr030458 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr030458
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCLK4-associating serine/arginine rich protein isoform X1
Genome locationtig00153654:1483544..1491973
RNA-Seq ExpressionSgr030458
SyntenySgr030458
Gene Ontology termsNA
InterPro domainsIPR019147 - Suppressor of white apricot, N-terminal domain
IPR040397 - Suppressor of white apricot


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605238.1 CLK4-associating serine/arginine rich protein, partial [Cucurbita argyrosperma subsp. sororia]9.7e-25296.39Show/hide
Query:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ
        MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQ++MIDRFDGRALLDFIREPGSRHIR Q
Subjt:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ

Query:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPP-AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
        EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEME+KVSAPFVSDRSQ PQPP AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
Subjt:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPP-AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED

Query:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY
        EDEE FNSDDSNDEGMEV+AKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAAR+TGTRVLHHDPYRESRRSPTY
Subjt:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY

Query:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS
        DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKP+TPKIEFITEFGG GESKEPRLEGLSPP SPPSQPDMLNRPSSGRILEALHIDPASG+TLD EKS
Subjt:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS

Query:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRR
        SRAVKPS STSSALAKLTKASSSGGPLK GEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRE ERQRLEKLAETSRLNS RRRSRSRSYSRSPRR
Subjt:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRR

KAG7035202.1 CLK4-associating serine/arginine rich protein [Cucurbita argyrosperma subsp. argyrosperma]1.4e-26193.54Show/hide
Query:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ
        MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQ++MIDRFDGRALLDFIREPGSRHIR Q
Subjt:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ

Query:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPP-AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
        EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEME+KVSAPFVSDRSQ PQPP AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
Subjt:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPP-AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED

Query:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY
        EDEE FNSDDSNDEGMEV+AKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAAR+TGTRVLHHDPYRESRRSPTY
Subjt:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY

Query:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS
        DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKP+TPKIEFITEFGG GESKEPRLEGLSPP SPPSQPDMLNRPSSGRILEALHIDPASG+TLD EKS
Subjt:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS

Query:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRRVG
        SRAVKPS STSSALAKLTKASSSGGPLK GEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRE ERQRLEKLAETSRLNS RRRSRSRSYS+    VG
Subjt:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRRVG

Query:  VEAEVHGDIILILGLALDLVLGLNLEAILAHIPVRLLTLAHQ
        VE EV GDI  ILGL L LVL L LEAILAHIPV LL   HQ
Subjt:  VEAEVHGDIILILGLALDLVLGLNLEAILAHIPVRLLTLAHQ

XP_022947250.1 CLK4-associating serine/arginine rich protein isoform X1 [Cucurbita moschata]9.7e-25296.39Show/hide
Query:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ
        MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQ++MIDRFDGRALLDFIREPGSRHIR Q
Subjt:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ

Query:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPP-AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
        EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEME+KVSAPFVSDRSQ PQPP AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
Subjt:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPP-AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED

Query:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY
        EDEE FNSDDSNDEGMEV+AKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAAR+TGTRVLHHDPYRESRRSPTY
Subjt:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY

Query:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS
        DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKP+TPKIEFITEFGG GESKEPRLEGLSPP SPPSQPDMLNRPSSGRILEALHIDPASG+TLD EKS
Subjt:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS

Query:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRR
        SRAVKPS STSSALAKLTKASSSGGPLK GEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRE ERQRLEKLAETSRLNS RRRSRSRSYSRSPRR
Subjt:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRR

XP_022947251.1 CLK4-associating serine/arginine rich protein isoform X2 [Cucurbita moschata]9.7e-25296.39Show/hide
Query:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ
        MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQ++MIDRFDGRALLDFIREPGSRHIR Q
Subjt:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ

Query:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPP-AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
        EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEME+KVSAPFVSDRSQ PQPP AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
Subjt:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPP-AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED

Query:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY
        EDEE FNSDDSNDEGMEV+AKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAAR+TGTRVLHHDPYRESRRSPTY
Subjt:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY

Query:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS
        DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKP+TPKIEFITEFGG GESKEPRLEGLSPP SPPSQPDMLNRPSSGRILEALHIDPASG+TLD EKS
Subjt:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS

Query:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRR
        SRAVKPS STSSALAKLTKASSSGGPLK GEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRE ERQRLEKLAETSRLNS RRRSRSRSYSRSPRR
Subjt:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRR

XP_023521265.1 CLK4-associating serine/arginine rich protein-like [Cucurbita pepo subsp. pepo]2.0e-25296.59Show/hide
Query:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ
        MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQ++MIDRFDGRALLDFIREPGSRHIR Q
Subjt:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ

Query:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPPAAS-KGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
        EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEME+KVSAPFVSDRSQQPQPPAAS KGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
Subjt:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPPAAS-KGSYSQVGFSYEADGKEESNSDADDNASDDEDDED

Query:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY
        EDEE FNSDDSNDEGMEV+AKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAAR+TGTRVLHHDPYRESRRSPTY
Subjt:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY

Query:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS
        DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKP+TPKIEFITEFGG GESKEPRLEGLSPP SPPSQPDMLNRPSSGRILEALHIDPASG+TLD EKS
Subjt:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS

Query:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRR
        SRAVKPS STSSALAKLTKASSSGGPLK GEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRE ERQRLEKLAETSRLNS RRRSRSRSYSRSPRR
Subjt:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRR

TrEMBL top hitse value%identityAlignment
A0A6J1G5W5 CLK4-associating serine/arginine rich protein isoform X24.7e-25296.39Show/hide
Query:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ
        MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQ++MIDRFDGRALLDFIREPGSRHIR Q
Subjt:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ

Query:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPP-AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
        EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEME+KVSAPFVSDRSQ PQPP AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
Subjt:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPP-AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED

Query:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY
        EDEE FNSDDSNDEGMEV+AKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAAR+TGTRVLHHDPYRESRRSPTY
Subjt:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY

Query:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS
        DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKP+TPKIEFITEFGG GESKEPRLEGLSPP SPPSQPDMLNRPSSGRILEALHIDPASG+TLD EKS
Subjt:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS

Query:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRR
        SRAVKPS STSSALAKLTKASSSGGPLK GEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRE ERQRLEKLAETSRLNS RRRSRSRSYSRSPRR
Subjt:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRR

A0A6J1G5Y2 CLK4-associating serine/arginine rich protein isoform X34.7e-25296.39Show/hide
Query:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ
        MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQ++MIDRFDGRALLDFIREPGSRHIR Q
Subjt:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ

Query:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPP-AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
        EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEME+KVSAPFVSDRSQ PQPP AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
Subjt:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPP-AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED

Query:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY
        EDEE FNSDDSNDEGMEV+AKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAAR+TGTRVLHHDPYRESRRSPTY
Subjt:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY

Query:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS
        DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKP+TPKIEFITEFGG GESKEPRLEGLSPP SPPSQPDMLNRPSSGRILEALHIDPASG+TLD EKS
Subjt:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS

Query:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRR
        SRAVKPS STSSALAKLTKASSSGGPLK GEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRE ERQRLEKLAETSRLNS RRRSRSRSYSRSPRR
Subjt:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRR

A0A6J1G6C5 CLK4-associating serine/arginine rich protein isoform X14.7e-25296.39Show/hide
Query:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ
        MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQ++MIDRFDGRALLDFIREPGSRHIR Q
Subjt:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ

Query:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPP-AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
        EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEME+KVSAPFVSDRSQ PQPP AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
Subjt:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPP-AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED

Query:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY
        EDEE FNSDDSNDEGMEV+AKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAAR+TGTRVLHHDPYRESRRSPTY
Subjt:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY

Query:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS
        DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKP+TPKIEFITEFGG GESKEPRLEGLSPP SPPSQPDMLNRPSSGRILEALHIDPASG+TLD EKS
Subjt:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS

Query:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRR
        SRAVKPS STSSALAKLTKASSSGGPLK GEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRE ERQRLEKLAETSRLNS RRRSRSRSYSRSPRR
Subjt:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRR

A0A6J1KZ57 CLK4-associating serine/arginine rich protein isoform X22.0e-25095.58Show/hide
Query:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ
        MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQ++MIDRFDGRALLDFIREPGSRHIR Q
Subjt:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ

Query:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPP-AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
        EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEME+KVSAPFVSDRSQQPQPP AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
Subjt:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPP-AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED

Query:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY
        EDEE FNSDDSNDEGMEV+AK+FGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMERERERE+AR+TG RVLHHDPYRESRRSPTY
Subjt:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY

Query:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS
        DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKP+TPKIEFITEFGG GESKEPRLEGLSPP SPPSQPDMLNRPSSGRILEALHIDPASG+TLD EKS
Subjt:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS

Query:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRR
        SRAVKP  STSSALAKLTK SSSGGPLK GEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRE ERQRLEKLAETSRLNS RRRSRSRSYSRSPRR
Subjt:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRR

A0A6J1L1K6 CLK4-associating serine/arginine rich protein isoform X12.0e-25095.58Show/hide
Query:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ
        MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQ++MIDRFDGRALLDFIREPGSRHIR Q
Subjt:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ

Query:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPP-AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
        EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEME+KVSAPFVSDRSQQPQPP AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED
Subjt:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPP-AASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDED

Query:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY
        EDEE FNSDDSNDEGMEV+AK+FGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMERERERE+AR+TG RVLHHDPYRESRRSPTY
Subjt:  EDEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTY

Query:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS
        DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKP+TPKIEFITEFGG GESKEPRLEGLSPP SPPSQPDMLNRPSSGRILEALHIDPASG+TLD EKS
Subjt:  DAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKS

Query:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRR
        SRAVKP  STSSALAKLTK SSSGGPLK GEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRE ERQRLEKLAETSRLNS RRRSRSRSYSRSPRR
Subjt:  SRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRR

SwissProt top hitse value%identityAlignment
A0JNI5 CLK4-associating serine/arginine rich protein6.1e-1527.67Show/hide
Query:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ
        MWHEAR+ ERK+  MM   +KRA+RR  +  K + DP Q +QV G  C+V+ D  +  A +    ++PW G    MIDRFD RA LD I  P        
Subjt:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ

Query:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVN-QEMESKVSAPFVSDRSQQPQPPAASKGSYSQVGFSYE----------ADGKEESNSDADD
          S E+E  E   N++RYR L+++   G ++E  L  +   E+   +  P   ++ +  +  A+       +G++YE           +  EE  S A++
Subjt:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVN-QEMESKVSAPFVSDRSQQPQPPAASKGSYSQVGFSYE----------ADGKEESNSDADD

Query:  NASDDEDDEDED-EEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHD
         ++ DED+   D + + + D+ N E +  + K+     YG       +   ++K + E IK   A+ +       + S+ +R   RE  R+ G ++    
Subjt:  NASDDEDDEDED-EEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHD

Query:  PYRESRRSPTYDAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLS-------PPPSPPSQPDMLNRPSSGRIL
        P    R SPTYD Y RS  S S S S S SR  + G  +          KI FIT FGG  E         +        PP+PP QP     P+ GR  
Subjt:  PYRESRRSPTYDAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLS-------PPPSPPSQPDMLNRPSSGRIL

Query:  EALHIDPASGVTLDKEKSSRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNS
         A      S        SS +   ++S+ S  +  +++   GG  + G    +      R  S+       +      +     +  R  +         
Subjt:  EALHIDPASGVTLDKEKSSRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNS

Query:  LRRRSRSRSYSRSPRRVGVEAEVH
         RRRSRSRS S    R G     H
Subjt:  LRRRSRSRSYSRSPRRVGVEAEVH

Q5HZB6 CLK4-associating serine/arginine rich protein3.3e-1627.27Show/hide
Query:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ
        MWHEAR+ ERK+  MM   +KRA+RR  +  K + DP Q +QV G  C+V+ D  +  A +    ++PW G    MIDRFD RA LD I  P        
Subjt:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ

Query:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVN-QEMESKVSAPFVSDRSQQPQPPAASKGSYSQVGFSYE----------ADGKEESNSDADD
          S E+E  E   N++RYR L+++   G ++E  L  +   E+   +  P   ++ +  +  A+       +G++YE          A+  EE  S A++
Subjt:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVN-QEMESKVSAPFVSDRSQQPQPPAASKGSYSQVGFSYE----------ADGKEESNSDADD

Query:  NASDDEDDEDED-EEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHD
         ++ DED+   D + + + D+ N E +  + K+     YG       +   ++K + E IK   A+ +       + S+ +R   RE  R+ G ++    
Subjt:  NASDDEDDEDED-EEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHD

Query:  PYRESRRSPTYDAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDP
        P    R SPTYD Y RS  S S S S S SR  S G  +          KI FIT FGG  E         +   + P +P    +P            P
Subjt:  PYRESRRSPTYDAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDP

Query:  ASGVTLDKEKSSRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRS
        A G      + S +   S ++SS  +  + + S  G  + G    +      R  S+       +      +     +  R  +        + RRRSRS
Subjt:  ASGVTLDKEKSSRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRS

Query:  RSYSRSPRRVGVEAEVH
        RS S    + G     H
Subjt:  RSYSRSPRRVGVEAEVH

Q8CFC7 CLK4-associating serine/arginine rich protein2.8e-1527.47Show/hide
Query:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ
        MWHEAR+ ERK+  MM   +KRA+RR  +  K + DP Q +QV G  C+V+ D  +  A +    ++PW G    MIDRFD RA LD I  P        
Subjt:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ

Query:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVN-QEMESKVSAPFVSDRSQQPQPPAASKGSYSQVGFSYE----------ADGKEESNSDADD
          S E+E  E   N++RYR L+++   G ++E  L  +   E+   +  P   ++ +  +  A+       +G++YE          A+  EE  S A++
Subjt:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVN-QEMESKVSAPFVSDRSQQPQPPAASKGSYSQVGFSYE----------ADGKEESNSDADD

Query:  NASDDEDDEDED-EEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHD
         ++ DED+   D + + + D+ N E +  + K+     YG       +   ++K + E IK   A+ +       + S+ +R   RE  R+ G ++    
Subjt:  NASDDEDDEDED-EEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHD

Query:  PYRESRRSPTYDAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDP
        P    R SPTYD Y RS  S S S S S SR  S G  +          KI FIT FGG  E         +   + P +P     P +G         P
Subjt:  PYRESRRSPTYDAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDP

Query:  ASGVTLDKEKSSRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRS
        A G      + S +   S ++SS  +  + + S  G  + G    +      R  S+       +      +     +  R  +          RRRSRS
Subjt:  ASGVTLDKEKSSRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRS

Query:  RSYSRSPRRVGVEAEVH
        RS S    + G     H
Subjt:  RSYSRSPRRVGVEAEVH

Q8N2M8 CLK4-associating serine/arginine rich protein7.3e-1628.57Show/hide
Query:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ
        MWHEAR+ ERK+  MM   +KRA+RR  +  K + DP Q +QV G  C+V+ D  +  A +    ++PW G    MIDRFD RA LD I  P        
Subjt:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ

Query:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVN-QEMESKVSAPFVSDRSQQPQPPAASKGSYSQVGFSYE----------ADGKEESNSDADD
          S E+E  E   N++RYR L+++   G ++E  L  +   E+   +  P   ++ +  +  A+       +G++YE          A+  EE  S A++
Subjt:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVN-QEMESKVSAPFVSDRSQQPQPPAASKGSYSQVGFSYE----------ADGKEESNSDADD

Query:  NASDDEDDEDED-EEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHD
         ++ DED+   D + + + D+ N E +  + K+     YG       +   ++K + E IK   A+ +       + S+ +R   RE  R+ G ++    
Subjt:  NASDDEDDEDED-EEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHD

Query:  PYRESRRSPTYDAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLS-------PPPSPPSQPDMLNRPSSGRIL
        P    R SPTYD Y RS  S S S S S SR  + G  +          KI FIT FGG  E         +        PP+PP QP     P+ GR  
Subjt:  PYRESRRSPTYDAYPRSRRSRSRSHSPSHSRRHSRGHSDDVHRNKPKTPKIEFITEFGGCGESKEPRLEGLS-------PPPSPPSQPDMLNRPSSGRIL

Query:  EALHIDPASGVTLDKEKSSRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNS
         A     +S        SS A + S+S SS+ +  +++   GG  + G    +      R  S+       +      +     +  R  +         
Subjt:  EALHIDPASGVTLDKEKSSRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNS

Query:  LRRRSRSRSYSRSPRRVG
         RRRSRSRS+S    R G
Subjt:  LRRRSRSRSYSRSPRRVG

Arabidopsis top hitse value%identityAlignment
AT4G36980.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Splicing factor, suppressor of white apricot (InterPro:IPR019147); Has 7672 Blast hits to 5479 proteins in 321 species: Archae - 0; Bacteria - 89; Metazoa - 5155; Fungi - 712; Plants - 341; Viruses - 39; Other Eukaryotes - 1336 (source: NCBI BLink).1.1e-18471.8Show/hide
Query:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ
        MWHEARRSE+KVHDMMDAARKRAQRRA++LAKRRGDP QSIQ VG+R RV+RDDGLYQAT+DQQGLIPWNGKQ++MIDRFDGRALLDF+RE GSR +R  
Subjt:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ

Query:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPPAASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDEDE
        +K+EEEEELEEFVNF+RYRDLIKHRRRGF+DE+GLQHV+QE+E+K++APF+  R+Q  QPP A+KG+YSQVGFSY  +GK+ S    +D+  DDEDDEDE
Subjt:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPPAASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDEDE

Query:  DEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTYD
         EE+F+S+DS+DEGME IAK+FG+KRYGWLVYMDKKAKEEEKRQKE+IKGDP+I+KLSRKERRK S++ER+RERE +R  G +++HHDPYRESRRSPTY+
Subjt:  DEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTYD

Query:  AYPRSR--RSRSRSHSPSHSRRHSRG-HSDDVHRNKPKTPKIEFITEF-GGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDK
        AYPRSR  RSRSRS+SPS+SRR+ RG HSD++ +     PKIE+ITEF GG G+ + P+ EG SPP SPPSQ D+L+RPS GRILEALH+DPAS ++L+K
Subjt:  AYPRSR--RSRSRSHSPSHSRRHSRG-HSDDVHRNKPKTPKIEFITEF-GGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDK

Query:  EKSSRAVKPSASTSSALAKLTK-ASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSP
        +K ++  K + STS+ALAKL+K A +S     Q EKKETPQERLKRIM++QL KQIKKD+A E AKKRE ERQRLEKLAETSRL+  R+RSRSRS SRSP
Subjt:  EKSSRAVKPSASTSSALAKLTK-ASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSP

AT4G36980.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Splicing factor, suppressor of white apricot (InterPro:IPR019147); Has 5391 Blast hits to 4388 proteins in 280 species: Archae - 1; Bacteria - 114; Metazoa - 3014; Fungi - 666; Plants - 308; Viruses - 14; Other Eukaryotes - 1274 (source: NCBI BLink).1.1e-18471.8Show/hide
Query:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ
        MWHEARRSE+KVHDMMDAARKRAQRRA++LAKRRGDP QSIQ VG+R RV+RDDGLYQAT+DQQGLIPWNGKQ++MIDRFDGRALLDF+RE GSR +R  
Subjt:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ

Query:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPPAASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDEDE
        +K+EEEEELEEFVNF+RYRDLIKHRRRGF+DE+GLQHV+QE+E+K++APF+  R+Q  QPP A+KG+YSQVGFSY  +GK+ S    +D+  DDEDDEDE
Subjt:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPPAASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDEDE

Query:  DEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTYD
         EE+F+S+DS+DEGME IAK+FG+KRYGWLVYMDKKAKEEEKRQKE+IKGDP+I+KLSRKERRK S++ER+RERE +R  G +++HHDPYRESRRSPTY+
Subjt:  DEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTYD

Query:  AYPRSR--RSRSRSHSPSHSRRHSRG-HSDDVHRNKPKTPKIEFITEF-GGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDK
        AYPRSR  RSRSRS+SPS+SRR+ RG HSD++ +     PKIE+ITEF GG G+ + P+ EG SPP SPPSQ D+L+RPS GRILEALH+DPAS ++L+K
Subjt:  AYPRSR--RSRSRSHSPSHSRRHSRG-HSDDVHRNKPKTPKIEFITEF-GGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDK

Query:  EKSSRAVKPSASTSSALAKLTK-ASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSP
        +K ++  K + STS+ALAKL+K A +S     Q EKKETPQERLKRIM++QL KQIKKD+A E AKKRE ERQRLEKLAETSRL+  R+RSRSRS SRSP
Subjt:  EKSSRAVKPSASTSSALAKLTK-ASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSP

AT4G36980.3 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Splicing factor, suppressor of white apricot (InterPro:IPR019147).3.5e-18371.8Show/hide
Query:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ
        MWHEARRSE+KVHDMMDAARKRAQRRA++LAKRRGDP QSIQ VG+R RV+RDDGLYQAT+DQQGLIPWNGKQ++MIDRFDGRALLDF+RE GSR +R  
Subjt:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ

Query:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPPAASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDEDE
        +K+EEEEELEEFVNF+RYRDLIKHRRRGF+DE+GLQHV+QE+E+K++APF+  R+Q  QPP A+KG+YSQVGFSY  +GK+ S    +D+  DDEDDEDE
Subjt:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPPAASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDEDE

Query:  DEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTYD
         EE+F+S+DS+DEGME IAK+FG+KRYGWLVYMDKKAKEEEKRQKE+IKGDP+I KLSRKERRK S++ER+RERE +R  G +++HHDPYRESRRSPTY+
Subjt:  DEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTYD

Query:  AYPRSR--RSRSRSHSPSHSRRHSRG-HSDDVHRNKPKTPKIEFITEF-GGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDK
        AYPRSR  RSRSRS+SPS+SRR+ RG HSD++ +     PKIE+ITEF GG G+ + P+ EG SPP SPPSQ D+L+RPS GRILEALH+DPAS ++L+K
Subjt:  AYPRSR--RSRSRSHSPSHSRRHSRG-HSDDVHRNKPKTPKIEFITEF-GGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDK

Query:  EKSSRAVKPSASTSSALAKLTK-ASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSP
        +K ++  K + STS+ALAKL+K A +S     Q EKKETPQERLKRIM++QL KQIKKD+A E AKKRE ERQRLEKLAETSRL+  R+RSRSRS SRSP
Subjt:  EKSSRAVKPSASTSSALAKLTK-ASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSP

AT4G36980.4 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Splicing factor, suppressor of white apricot (InterPro:IPR019147).1.1e-18471.8Show/hide
Query:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ
        MWHEARRSE+KVHDMMDAARKRAQRRA++LAKRRGDP QSIQ VG+R RV+RDDGLYQAT+DQQGLIPWNGKQ++MIDRFDGRALLDF+RE GSR +R  
Subjt:  MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQ

Query:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPPAASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDEDE
        +K+EEEEELEEFVNF+RYRDLIKHRRRGF+DE+GLQHV+QE+E+K++APF+  R+Q  QPP A+KG+YSQVGFSY  +GK+ S    +D+  DDEDDEDE
Subjt:  EKSEEEEELEEFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPPAASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDEDE

Query:  DEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTYD
         EE+F+S+DS+DEGME IAK+FG+KRYGWLVYMDKKAKEEEKRQKE+IKGDP+I+KLSRKERRK S++ER+RERE +R  G +++HHDPYRESRRSPTY+
Subjt:  DEEDFNSDDSNDEGMEVIAKEFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTYD

Query:  AYPRSR--RSRSRSHSPSHSRRHSRG-HSDDVHRNKPKTPKIEFITEF-GGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDK
        AYPRSR  RSRSRS+SPS+SRR+ RG HSD++ +     PKIE+ITEF GG G+ + P+ EG SPP SPPSQ D+L+RPS GRILEALH+DPAS ++L+K
Subjt:  AYPRSR--RSRSRSHSPSHSRRHSRG-HSDDVHRNKPKTPKIEFITEF-GGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDK

Query:  EKSSRAVKPSASTSSALAKLTK-ASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSP
        +K ++  K + STS+ALAKL+K A +S     Q EKKETPQERLKRIM++QL KQIKKD+A E AKKRE ERQRLEKLAETSRL+  R+RSRSRS SRSP
Subjt:  EKSSRAVKPSASTSSALAKLTK-ASSSGGPLKQGEKKETPQERLKRIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGCACGAGGCAAGGAGATCCGAGAGGAAGGTCCACGACATGATGGACGCTGCTCGAAAGCGAGCTCAAAGACGCGCTGTCTTTCTCGCCAAGCGGCGTGGCGATCC
TCAGCAATCTATACAGGTCGTTGGAACTCGATGCCGCGTTTATCGTGACGATGGTCTCTATCAAGCGACTCAGGACCAACAAGGCTTGATTCCTTGGAATGGAAAGCAGG
AAATTATGATTGATAGATTTGATGGACGTGCCCTGCTTGATTTCATTCGAGAACCTGGATCCCGACATATTCGAGCACAAGAAAAATCTGAAGAAGAAGAGGAATTAGAA
GAGTTTGTTAATTTTCAGCGTTATCGAGATTTAATTAAGCATCGACGTCGAGGATTTAATGATGAGGATGGTTTACAACATGTGAATCAAGAAATGGAATCCAAGGTGTC
TGCTCCATTTGTATCGGACAGATCTCAGCAACCGCAGCCTCCTGCTGCAAGCAAGGGATCATATTCACAAGTGGGATTTTCTTATGAGGCGGATGGGAAAGAGGAATCAA
ATTCAGATGCAGATGATAATGCTAGTGATGATGAAGATGATGAGGATGAGGACGAGGAGGATTTTAACAGTGATGACAGTAATGATGAAGGAATGGAGGTAATAGCAAAA
GAGTTCGGAGTGAAAAGATATGGTTGGCTTGTTTACATGGATAAAAAAGCTAAGGAGGAAGAGAAGAGGCAAAAGGAAATCATCAAAGGTGATCCTGCAATAAGGAAGCT
AAGTCGCAAGGAAAGAAGGAAAGCATCTCAAATGGAACGGGAAAGGGAAAGGGAAGCTGCACGAATTACTGGTACCAGAGTCCTCCATCATGATCCTTATCGAGAATCTA
GACGGAGTCCAACTTATGATGCTTATCCACGTTCTAGAAGATCGAGGTCCAGATCGCATTCCCCATCACACTCAAGGCGTCATTCTCGTGGCCATTCTGATGATGTTCAT
CGAAACAAACCAAAGACTCCCAAAATTGAATTTATCACTGAATTTGGGGGCTGTGGCGAAAGCAAGGAACCAAGGCTAGAAGGACTATCTCCACCACCATCTCCTCCATC
TCAGCCTGATATGTTAAACCGGCCATCATCTGGTCGTATACTTGAGGCATTGCATATTGATCCAGCATCTGGCGTGACACTTGATAAAGAAAAGAGCAGTAGAGCAGTAA
AACCATCAGCAAGTACGTCGTCAGCACTTGCAAAGTTAACAAAGGCAAGTTCTTCTGGAGGGCCTTTAAAACAGGGAGAGAAGAAAGAAACTCCTCAAGAACGACTTAAG
AGGATCATGAGTCAACAGCTTAATAAACAAATTAAGAAAGATACAGCTGCAGAAATGGCTAAAAAGAGGGAGCTGGAGCGCCAGAGACTCGAAAAGCTGGCAGAAACTAG
CCGTTTAAATAGTCTTAGGCGTCGGAGCCGCAGCAGAAGCTACAGCCGTTCACCGCGAAGGGTAGGAGTCGAAGCAGAGGTTCACGGAGATATTATTCTCATTCTCGGTC
TCGCTCTCGATCTCGTTCTCGGTCTCAATCTCGAAGCCATTCTCGCTCACATTCCCGTTCGCCTTCTTACTCTCGCTCACCAAGATGAGATGTCTGACCATTTGGTGGGT
TGGAATAGTGCTGAAAAACAGGGAAATGTGACAGCTGAATTGAATGGAACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGCACGAGGCAAGGAGATCCGAGAGGAAGGTCCACGACATGATGGACGCTGCTCGAAAGCGAGCTCAAAGACGCGCTGTCTTTCTCGCCAAGCGGCGTGGCGATCC
TCAGCAATCTATACAGGTCGTTGGAACTCGATGCCGCGTTTATCGTGACGATGGTCTCTATCAAGCGACTCAGGACCAACAAGGCTTGATTCCTTGGAATGGAAAGCAGG
AAATTATGATTGATAGATTTGATGGACGTGCCCTGCTTGATTTCATTCGAGAACCTGGATCCCGACATATTCGAGCACAAGAAAAATCTGAAGAAGAAGAGGAATTAGAA
GAGTTTGTTAATTTTCAGCGTTATCGAGATTTAATTAAGCATCGACGTCGAGGATTTAATGATGAGGATGGTTTACAACATGTGAATCAAGAAATGGAATCCAAGGTGTC
TGCTCCATTTGTATCGGACAGATCTCAGCAACCGCAGCCTCCTGCTGCAAGCAAGGGATCATATTCACAAGTGGGATTTTCTTATGAGGCGGATGGGAAAGAGGAATCAA
ATTCAGATGCAGATGATAATGCTAGTGATGATGAAGATGATGAGGATGAGGACGAGGAGGATTTTAACAGTGATGACAGTAATGATGAAGGAATGGAGGTAATAGCAAAA
GAGTTCGGAGTGAAAAGATATGGTTGGCTTGTTTACATGGATAAAAAAGCTAAGGAGGAAGAGAAGAGGCAAAAGGAAATCATCAAAGGTGATCCTGCAATAAGGAAGCT
AAGTCGCAAGGAAAGAAGGAAAGCATCTCAAATGGAACGGGAAAGGGAAAGGGAAGCTGCACGAATTACTGGTACCAGAGTCCTCCATCATGATCCTTATCGAGAATCTA
GACGGAGTCCAACTTATGATGCTTATCCACGTTCTAGAAGATCGAGGTCCAGATCGCATTCCCCATCACACTCAAGGCGTCATTCTCGTGGCCATTCTGATGATGTTCAT
CGAAACAAACCAAAGACTCCCAAAATTGAATTTATCACTGAATTTGGGGGCTGTGGCGAAAGCAAGGAACCAAGGCTAGAAGGACTATCTCCACCACCATCTCCTCCATC
TCAGCCTGATATGTTAAACCGGCCATCATCTGGTCGTATACTTGAGGCATTGCATATTGATCCAGCATCTGGCGTGACACTTGATAAAGAAAAGAGCAGTAGAGCAGTAA
AACCATCAGCAAGTACGTCGTCAGCACTTGCAAAGTTAACAAAGGCAAGTTCTTCTGGAGGGCCTTTAAAACAGGGAGAGAAGAAAGAAACTCCTCAAGAACGACTTAAG
AGGATCATGAGTCAACAGCTTAATAAACAAATTAAGAAAGATACAGCTGCAGAAATGGCTAAAAAGAGGGAGCTGGAGCGCCAGAGACTCGAAAAGCTGGCAGAAACTAG
CCGTTTAAATAGTCTTAGGCGTCGGAGCCGCAGCAGAAGCTACAGCCGTTCACCGCGAAGGGTAGGAGTCGAAGCAGAGGTTCACGGAGATATTATTCTCATTCTCGGTC
TCGCTCTCGATCTCGTTCTCGGTCTCAATCTCGAAGCCATTCTCGCTCACATTCCCGTTCGCCTTCTTACTCTCGCTCACCAAGATGAGATGTCTGACCATTTGGTGGGT
TGGAATAGTGCTGAAAAACAGGGAAATGTGACAGCTGAATTGAATGGAACTTAG
Protein sequenceShow/hide protein sequence
MWHEARRSERKVHDMMDAARKRAQRRAVFLAKRRGDPQQSIQVVGTRCRVYRDDGLYQATQDQQGLIPWNGKQEIMIDRFDGRALLDFIREPGSRHIRAQEKSEEEEELE
EFVNFQRYRDLIKHRRRGFNDEDGLQHVNQEMESKVSAPFVSDRSQQPQPPAASKGSYSQVGFSYEADGKEESNSDADDNASDDEDDEDEDEEDFNSDDSNDEGMEVIAK
EFGVKRYGWLVYMDKKAKEEEKRQKEIIKGDPAIRKLSRKERRKASQMEREREREAARITGTRVLHHDPYRESRRSPTYDAYPRSRRSRSRSHSPSHSRRHSRGHSDDVH
RNKPKTPKIEFITEFGGCGESKEPRLEGLSPPPSPPSQPDMLNRPSSGRILEALHIDPASGVTLDKEKSSRAVKPSASTSSALAKLTKASSSGGPLKQGEKKETPQERLK
RIMSQQLNKQIKKDTAAEMAKKRELERQRLEKLAETSRLNSLRRRSRSRSYSRSPRRVGVEAEVHGDIILILGLALDLVLGLNLEAILAHIPVRLLTLAHQDEMSDHLVG
WNSAEKQGNVTAELNGT