; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0017399 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0017399
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
Description40S ribosomal protein S3
Genome locationchr01:28563359..28570706
RNA-Seq ExpressionPI0017399
SyntenyPI0017399
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0015935 - small ribosomal subunit (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR001351 - Ribosomal protein S3, C-terminal
IPR004044 - K Homology domain, type 2
IPR005703 - Ribosomal protein S3, eukaryotic/archaeal
IPR009019 - K homology domain superfamily, prokaryotic type
IPR009057 - Homeobox-like domain superfamily
IPR015946 - K homology domain-like, alpha/beta
IPR017930 - Myb domain
IPR018280 - Ribosomal protein S3, conserved site
IPR036419 - Ribosomal protein S3, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059406.1 uncharacterized protein E6C27_scaffold242G001320 [Cucumis melo var. makuwa]0.0e+0092.9Show/hide
Query:  LCILVACGGFSMETVVGFVENERKIVESGAAQDGSSLSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFG
        LCILVACGGFSMET VG VENERKIVESGAA+DGS+LSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPT+ TLFG
Subjt:  LCILVACGGFSMETVVGFVENERKIVESGAAQDGSSLSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFG

Query:  KPRVEVLNDTPGLPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQEITHLISPSLKENH
        KP VEVLND PGLP+SD+FEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNY SAYV+GD KGSDEHG LPVIDEKLQSN+SLQ           ENH
Subjt:  KPRVEVLNDTPGLPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQEITHLISPSLKENH

Query:  VNENGSLGDCLKHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEE
        VNENGSLGDCLKHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKF+EE
Subjt:  VNENGSLGDCLKHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEE

Query:  STSNVERMSTSPTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVVNLNKNTISDGVSANSI
        S+ NVE MST+PTAETLNIECRVSPTTYSLENKDLHH EDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVV+LNKNTISD +SANSI
Subjt:  STSNVERMSTSPTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVVNLNKNTISDGVSANSI

Query:  ARPIKKVYSDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKDENPSVTVTDEAEKNLEQKQTASDNASDDNTAVVPTTKGGMRRKHH
        ARPIKKVYSDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKD+NPSVTVTDE EK LEQKQTASDN SDDNTAVVPTTKGGMRRKHH
Subjt:  ARPIKKVYSDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKDENPSVTVTDEAEKNLEQKQTASDNASDDNTAVVPTTKGGMRRKHH

Query:  RAWTLVEVIKLVEGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPSNHGQGK
        RAWTLVEVIKLVEGVSKCGAG+WSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPS+HGQGK
Subjt:  RAWTLVEVIKLVEGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPSNHGQGK

Query:  LGGGVSGGSMHEMSSSTVCS
        LGGG  G SMHEMSSSTVCS
Subjt:  LGGGVSGGSMHEMSSSTVCS

KAE8646506.1 hypothetical protein Csa_015876 [Cucumis sativus]0.0e+0090.04Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEEEDFIRLIFKC----MENLF--WPL----TSWAVAHIEEPKIEFTIY-------WQTKSD---ETISTLCILVACGGFSMETVVG
        PTPLPDLVTIH+PKEEEDFIR +       +E L   W +    T WAVAH+EEP  EFT         +   S      +  LCILVACGGFSMET VG
Subjt:  PTPLPDLVTIHTPKEEEDFIRLIFKC----MENLF--WPL----TSWAVAHIEEPKIEFTIY-------WQTKSD---ETISTLCILVACGGFSMETVVG

Query:  FVENERKIVESGAAQDGSSLSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFGKPRVEVLNDTPGLPESD
         VENERKIVESGA QDGS+LSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGC P EGTLFGKP VEVLNDTPGL +SD
Subjt:  FVENERKIVESGAAQDGSSLSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFGKPRVEVLNDTPGLPESD

Query:  SFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQEITHLISPSLKENHVNENGSLGDCLKHPDKS
        +FEAAADYNARLEYIEEVLQKVKQEERLRLTCGS NYASAYV+GDRKGSDEHG LPVIDEKLQSNISLQEITH ISPSLKENHVNENGSLGDCLKHPDKS
Subjt:  SFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQEITHLISPSLKENHVNENGSLGDCLKHPDKS

Query:  VESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEESTSNVERMSTSPTAETL
        VESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWL+RRI MGLTNSCDIP SSFIIKEGKF+EE + NVE +ST+PTAETL
Subjt:  VESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEESTSNVERMSTSPTAETL

Query:  NIECRVSPTTYSLENKDLHHFEDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVVNLNKNTISDGVSANSIARPIKKVYSDGGRTVIT
        NIECRVSP+TYSLENKDLHH EDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVV++NKNTISD VSANSIARPIKKVYSDGGRTVIT
Subjt:  NIECRVSPTTYSLENKDLHHFEDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVVNLNKNTISDGVSANSIARPIKKVYSDGGRTVIT

Query:  RLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKDENPSVTVTDEAEKNLEQKQTASDNASDDNTAVVPTTKGGMRRKHHRAWTLVEVIKLVEGVSK
        RLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKD++PSVTVTDEAEKNLEQKQT SDN SDDNTAVV TTKGGMRRKHHRAWTLVEVIKLVEGVSK
Subjt:  RLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKDENPSVTVTDEAEKNLEQKQTASDNASDDNTAVVPTTKGGMRRKHHRAWTLVEVIKLVEGVSK

Query:  CGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPSNHGQGKLGGGVSGGSMH
        CGAG+WSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQ+LLRVRELAEMHAQIPPS+HGQGKLGGG   G+ H
Subjt:  CGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPSNHGQGKLGGGVSGGSMH

XP_008462318.2 PREDICTED: uncharacterized protein LOC103500701 isoform X1 [Cucumis melo]0.0e+0092.74Show/hide
Query:  METVVGFVENERKIVESGAAQDGSSLSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFGKPRVEVLNDTP
        MET VG VENERKIVESGAA+DGS+LSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTE TLFGKP VEVLND P
Subjt:  METVVGFVENERKIVESGAAQDGSSLSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFGKPRVEVLNDTP

Query:  GLPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQEITHLISPSLKENHVNENGSLGDCL
        GLP+SD+FEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNY SAYV+GD KGSDEHG LPVIDEKLQSN+SLQ           ENHVNENGSLGDCL
Subjt:  GLPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQEITHLISPSLKENHVNENGSLGDCL

Query:  KHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEESTSNVERMSTS
        KHPDKSVESESSDALCTTSNPDFSLLKGD+CLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKF+EES+ NVE MST+
Subjt:  KHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEESTSNVERMSTS

Query:  PTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVVNLNKNTISDGVSANSIARPIKKVYSDG
        PTAETLNIECRVSPTTYSLENKDLHH EDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVV+LNKNTISD +SANSIARPIKKVYSDG
Subjt:  PTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVVNLNKNTISDGVSANSIARPIKKVYSDG

Query:  GRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKDENPSVTVTDEAEKNLEQKQTASDNASDDNTAVVPTTKGGMRRKHHRAWTLVEVIKL
        GRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKD+NPSVTVTDE EK LEQKQTASDN SDDNTAVVPTTKGGMRRKHHRAWTLVEVIKL
Subjt:  GRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKDENPSVTVTDEAEKNLEQKQTASDNASDDNTAVVPTTKGGMRRKHHRAWTLVEVIKL

Query:  VEGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPSNHGQGKLGGGVSGGSMH
        VEGVSKCGAG+WSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPS+HGQGKLGGG  G SMH
Subjt:  VEGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPSNHGQGKLGGGVSGGSMH

Query:  EMSSST
        EMSSST
Subjt:  EMSSST

XP_031745224.1 uncharacterized protein LOC101203003 isoform X1 [Cucumis sativus]0.0e+0093.27Show/hide
Query:  METVVGFVENERKIVESGAAQDGSSLSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFGKPRVEVLNDTP
        MET VG VENERKIVESGA QDGS+LSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGC P EGTLFGKP VEVLNDTP
Subjt:  METVVGFVENERKIVESGAAQDGSSLSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFGKPRVEVLNDTP

Query:  GLPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQEITHLISPSLKENHVNENGSLGDCL
        GL +SD+FEAAADYNARLEYIEEVLQKVKQEERLRLTCGS NYASAYV+GDRKGSDEHG LPVIDEKLQSNISLQEITH ISPSLKENHVNENGSLGDCL
Subjt:  GLPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQEITHLISPSLKENHVNENGSLGDCL

Query:  KHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEESTSNVERMSTS
        KHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWL+RRI MGLTNSCDIP SSFIIKEGKF+EE + NVE +ST+
Subjt:  KHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEESTSNVERMSTS

Query:  PTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVVNLNKNTISDGVSANSIARPIKKVYSDG
        PTAETLNIECRVSP+TYSLENKDLHH EDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVV++NKNTISD VSANSIARPIKKVYSDG
Subjt:  PTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVVNLNKNTISDGVSANSIARPIKKVYSDG

Query:  GRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKDENPSVTVTDEAEKNLEQKQTASDNASDDNTAVVPTTKGGMRRKHHRAWTLVEVIKL
        GRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKD++PSVTVTDEAEKNLEQKQT SDN SDDNTAVV TTKGGMRRKHHRAWTLVEVIKL
Subjt:  GRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKDENPSVTVTDEAEKNLEQKQTASDNASDDNTAVVPTTKGGMRRKHHRAWTLVEVIKL

Query:  VEGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPSNHGQGKLGGGVSGGSMH
        VEGVSKCGAG+WSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQ+LLRVRELAEMHAQIPPS+HGQGKLGGG   GSMH
Subjt:  VEGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPSNHGQGKLGGGVSGGSMH

Query:  EMSSSTVCS
        EMSSST+CS
Subjt:  EMSSSTVCS

XP_038897567.1 uncharacterized protein LOC120085586 isoform X1 [Benincasa hispida]1.9e-30789.16Show/hide
Query:  METVVGFVENERKIVESGAAQDGSSLSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFGKPRVEVLNDTP
        METVVGFVENE KIVESGAAQDGS+LSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNE+VEDAGQIVGC PTEGTLFGKPRVE+ ND P
Subjt:  METVVGFVENERKIVESGAAQDGSSLSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFGKPRVEVLNDTP

Query:  GLPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQEITHLISPSLKENHVNENGSLGDCL
        GLP+S++ EAAA+YNARLEYIEEVLQKVKQEERLRLTCGSP Y SA V+GDRK SDEHG LPV+DE LQSNI LQEITH ISP+LK++HVNENGSLG+C 
Subjt:  GLPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQEITHLISPSLKENHVNENGSLGDCL

Query:  KHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEESTSNVERMSTS
        KHPDKSVESESSDALCTT NPDFSLLKGDVCLDNLSIREL ECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSF+IKEGKF+EE + NV+ MST 
Subjt:  KHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEESTSNVERMSTS

Query:  PTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVVNLNKNTISDGVSANSIARPIKKVYSDG
        P AE L IECR SPTTYSLENKD + FEDMELDHGSEGQHDERAAVKR+RKPTRRYIEELSEVESREYVQKV++LNKN ISDGVSANSIARPIKKV SDG
Subjt:  PTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVVNLNKNTISDGVSANSIARPIKKVYSDG

Query:  GRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKDENPSVTVTDEAEKNLEQKQTASDNASDDNTAVVPTTKGGMRRKHHRAWTLVEVIKL
        GRTVITRLDSLGGSGFQVPCVSRVRRSRPRKD+V LVF+LP+KD+NPSVTVTDEAEKNLEQKQTAS NASDDNT+VV T+KGGMRRKHHRAWTLVEVIKL
Subjt:  GRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKDENPSVTVTDEAEKNLEQKQTASDNASDDNTAVVPTTKGGMRRKHHRAWTLVEVIKL

Query:  VEGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPSNHGQGKLGGGVSGGSMH
        VEGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKAS  QTPVDEGISSRKHASISIPAQILL+VRELAEMHAQIPPS+HGQGKLGGGVS GSMH
Subjt:  VEGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPSNHGQGKLGGGVSGGSMH

Query:  EMSSSTVCS
        EMS+S +CS
Subjt:  EMSSSTVCS

TrEMBL top hitse value%identityAlignment
A0A0A0KCL9 Uncharacterized protein0.0e+0089.57Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEEEDFIRLIFKC----MENLF--WPL----TSWAVAHIEEPKIEFTIYWQTKSDET---------------ISTLCILVACGGFSM
        PTPLPDLVTIH+PKEEEDFIR +       +E L   W +    T WAVAH+EEP  EFT      S                  +  LCILVACGGFSM
Subjt:  PTPLPDLVTIHTPKEEEDFIRLIFKC----MENLF--WPL----TSWAVAHIEEPKIEFTIYWQTKSDET---------------ISTLCILVACGGFSM

Query:  ETVVGFVENERKIVESGAAQDGSSLSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFGKPRVEVLNDTPG
        ET VG VENERKIVESGA QDGS+LSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGC P EGTLFGKP VEVLNDTPG
Subjt:  ETVVGFVENERKIVESGAAQDGSSLSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFGKPRVEVLNDTPG

Query:  LPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQEITHLISPSLKENHVNENGSLGDCLK
        L +SD+FEAAADYNARLEYIEEVLQKVKQEERLRLTCGS NYASAYV+GDRKGSDEHG LPVIDEKLQSNISLQEITH ISPSLKENHVNENGSLGDCLK
Subjt:  LPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQEITHLISPSLKENHVNENGSLGDCLK

Query:  HPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEESTSNVERMSTSP
        HPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWL+RRI MGLTNSCDIP SSFIIKEGKF+EE + NVE +ST+P
Subjt:  HPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEESTSNVERMSTSP

Query:  TAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVVNLNKNTISDGVSANSIARPIKKVYSDGG
        TAETLNIECRVSP+TYSLENKDLHH EDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVV++NKNTISD VSANSIARPIKKVYSDGG
Subjt:  TAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVVNLNKNTISDGVSANSIARPIKKVYSDGG

Query:  RTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKDENPSVTVTDEAEKNLEQKQTASDNASDDNTAVVPTTKGGMRRKHHRAWTLVEVIKLV
        RTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKD++PSVTVTDEAEKNLEQKQT SDN SDDNTAVV TTKGGMRRKHHRAWTLVEVIKLV
Subjt:  RTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKDENPSVTVTDEAEKNLEQKQTASDNASDDNTAVVPTTKGGMRRKHHRAWTLVEVIKLV

Query:  EGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISS
        EGVSKCGAG+WSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEG+ S
Subjt:  EGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISS

A0A1S3CGM9 uncharacterized protein LOC103500701 isoform X33.2e-28792.73Show/hide
Query:  MEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFGKPRVEVLNDTPGLPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSD
        MEVEDLLEDDKNEKVEDAGQIVGCKPTE TLFGKP VEVLND PGLP+SD+FEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNY SAYV+GD KGSD
Subjt:  MEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFGKPRVEVLNDTPGLPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSD

Query:  EHGSLPVIDEKLQSNISLQEITHLISPSLKENHVNENGSLGDCLKHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKD
        EHG LPVIDEKLQSN+SLQ           ENHVNENGSLGDCLKHPDKSVESESSDALCTTSNPDFSLLKGD+CLDNLSIRELRECFKATFGRDTTVKD
Subjt:  EHGSLPVIDEKLQSNISLQEITHLISPSLKENHVNENGSLGDCLKHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKD

Query:  KSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEESTSNVERMSTSPTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQHDERAAVKRVRKPTRRY
        KSWLKRRIAMGLTNSCDIPASSFIIKEGKF+EES+ NVE MST+PTAETLNIECRVSPTTYSLENKDLHH EDMELDHGSEGQHDERAAVKRVRKPTRRY
Subjt:  KSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEESTSNVERMSTSPTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQHDERAAVKRVRKPTRRY

Query:  IEELSEVESREYVQKVVNLNKNTISDGVSANSIARPIKKVYSDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKDENPSVTVTDEAE
        IEELSEVESREYVQKVV+LNKNTISD +SANSIARPIKKVYSDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKD+NPSVTVTDE E
Subjt:  IEELSEVESREYVQKVVNLNKNTISDGVSANSIARPIKKVYSDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKDENPSVTVTDEAE

Query:  KNLEQKQTASDNASDDNTAVVPTTKGGMRRKHHRAWTLVEVIKLVEGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRK
        K LEQKQTASDN SDDNTAVVPTTKGGMRRKHHRAWTLVEVIKLVEGVSKCGAG+WSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRK
Subjt:  KNLEQKQTASDNASDDNTAVVPTTKGGMRRKHHRAWTLVEVIKLVEGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRK

Query:  HASISIPAQILLRVRELAEMHAQIPPSNHGQGKLGGGVSGGSMHEMSSST
        HASISIPAQILLRVRELAEMHAQIPPS+HGQGKLGGG  G SMHEMSSST
Subjt:  HASISIPAQILLRVRELAEMHAQIPPSNHGQGKLGGGVSGGSMHEMSSST

A0A1S3CGR3 uncharacterized protein LOC103500701 isoform X29.8e-29792.59Show/hide
Query:  VRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFGKPRVEVLNDTPGLPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCG
        ++VDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTE TLFGKP VEVLND PGLP+SD+FEAAADYNARLEYIEEVLQKVKQEERLRLTCG
Subjt:  VRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFGKPRVEVLNDTPGLPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCG

Query:  SPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQEITHLISPSLKENHVNENGSLGDCLKHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRE
        SPNY SAYV+GD KGSDEHG LPVIDEKLQSN+SLQ           ENHVNENGSLGDCLKHPDKSVESESSDALCTTSNPDFSLLKGD+CLDNLSIRE
Subjt:  SPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQEITHLISPSLKENHVNENGSLGDCLKHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRE

Query:  LRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEESTSNVERMSTSPTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQ
        LRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKF+EES+ NVE MST+PTAETLNIECRVSPTTYSLENKDLHH EDMELDHGSEGQ
Subjt:  LRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEESTSNVERMSTSPTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQ

Query:  HDERAAVKRVRKPTRRYIEELSEVESREYVQKVVNLNKNTISDGVSANSIARPIKKVYSDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFA
        HDERAAVKRVRKPTRRYIEELSEVESREYVQKVV+LNKNTISD +SANSIARPIKKVYSDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFA
Subjt:  HDERAAVKRVRKPTRRYIEELSEVESREYVQKVVNLNKNTISDGVSANSIARPIKKVYSDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFA

Query:  LPEKDENPSVTVTDEAEKNLEQKQTASDNASDDNTAVVPTTKGGMRRKHHRAWTLVEVIKLVEGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLL
        LPEKD+NPSVTVTDE EK LEQKQTASDN SDDNTAVVPTTKGGMRRKHHRAWTLVEVIKLVEGVSKCGAG+WSEIKKLSFSSYSYRTSVDLKDKWRNLL
Subjt:  LPEKDENPSVTVTDEAEKNLEQKQTASDNASDDNTAVVPTTKGGMRRKHHRAWTLVEVIKLVEGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLL

Query:  KASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPSNHGQGKLGGGVSGGSMHEMSSST
        KASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPS+HGQGKLGGG  G SMHEMSSST
Subjt:  KASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPSNHGQGKLGGGVSGGSMHEMSSST

A0A1S3CI77 uncharacterized protein LOC103500701 isoform X10.0e+0092.74Show/hide
Query:  METVVGFVENERKIVESGAAQDGSSLSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFGKPRVEVLNDTP
        MET VG VENERKIVESGAA+DGS+LSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTE TLFGKP VEVLND P
Subjt:  METVVGFVENERKIVESGAAQDGSSLSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFGKPRVEVLNDTP

Query:  GLPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQEITHLISPSLKENHVNENGSLGDCL
        GLP+SD+FEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNY SAYV+GD KGSDEHG LPVIDEKLQSN+SLQ           ENHVNENGSLGDCL
Subjt:  GLPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQEITHLISPSLKENHVNENGSLGDCL

Query:  KHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEESTSNVERMSTS
        KHPDKSVESESSDALCTTSNPDFSLLKGD+CLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKF+EES+ NVE MST+
Subjt:  KHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEESTSNVERMSTS

Query:  PTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVVNLNKNTISDGVSANSIARPIKKVYSDG
        PTAETLNIECRVSPTTYSLENKDLHH EDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVV+LNKNTISD +SANSIARPIKKVYSDG
Subjt:  PTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVVNLNKNTISDGVSANSIARPIKKVYSDG

Query:  GRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKDENPSVTVTDEAEKNLEQKQTASDNASDDNTAVVPTTKGGMRRKHHRAWTLVEVIKL
        GRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKD+NPSVTVTDE EK LEQKQTASDN SDDNTAVVPTTKGGMRRKHHRAWTLVEVIKL
Subjt:  GRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKDENPSVTVTDEAEKNLEQKQTASDNASDDNTAVVPTTKGGMRRKHHRAWTLVEVIKL

Query:  VEGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPSNHGQGKLGGGVSGGSMH
        VEGVSKCGAG+WSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPS+HGQGKLGGG  G SMH
Subjt:  VEGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPSNHGQGKLGGGVSGGSMH

Query:  EMSSST
        EMSSST
Subjt:  EMSSST

A0A5D3BW39 HTH myb-type domain-containing protein0.0e+0092.9Show/hide
Query:  LCILVACGGFSMETVVGFVENERKIVESGAAQDGSSLSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFG
        LCILVACGGFSMET VG VENERKIVESGAA+DGS+LSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPT+ TLFG
Subjt:  LCILVACGGFSMETVVGFVENERKIVESGAAQDGSSLSPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFG

Query:  KPRVEVLNDTPGLPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQEITHLISPSLKENH
        KP VEVLND PGLP+SD+FEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNY SAYV+GD KGSDEHG LPVIDEKLQSN+SLQ           ENH
Subjt:  KPRVEVLNDTPGLPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQEITHLISPSLKENH

Query:  VNENGSLGDCLKHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEE
        VNENGSLGDCLKHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKF+EE
Subjt:  VNENGSLGDCLKHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFLEE

Query:  STSNVERMSTSPTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVVNLNKNTISDGVSANSI
        S+ NVE MST+PTAETLNIECRVSPTTYSLENKDLHH EDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVV+LNKNTISD +SANSI
Subjt:  STSNVERMSTSPTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVVNLNKNTISDGVSANSI

Query:  ARPIKKVYSDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKDENPSVTVTDEAEKNLEQKQTASDNASDDNTAVVPTTKGGMRRKHH
        ARPIKKVYSDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKD+NPSVTVTDE EK LEQKQTASDN SDDNTAVVPTTKGGMRRKHH
Subjt:  ARPIKKVYSDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKDENPSVTVTDEAEKNLEQKQTASDNASDDNTAVVPTTKGGMRRKHH

Query:  RAWTLVEVIKLVEGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPSNHGQGK
        RAWTLVEVIKLVEGVSKCGAG+WSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPS+HGQGK
Subjt:  RAWTLVEVIKLVEGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPSNHGQGK

Query:  LGGGVSGGSMHEMSSSTVCS
        LGGG  G SMHEMSSSTVCS
Subjt:  LGGGVSGGSMHEMSSSTVCS

SwissProt top hitse value%identityAlignment
P02350 40S ribosomal protein S3-A2.2e-9684.72Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MA Q+SKKRKFVADG+F AELNE LTRELAEDGYSGVEVRVTP RTEIII ATRTQNVLGEKGRRIRELT+VVQKRF FPE SVELYAEKV  RGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRF+MESGAKGCEV+VSGKLR QRAKSMKF DG MI SG PV  Y+D+AVRHVLLRQGVLGIKVKIML WDP GK GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEE
          PLPD V+I  PK+E
Subjt:  PTPLPDLVTIHTPKEE

P47835 40S ribosomal protein S3-B2.2e-9684.72Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MA QMSKKRKFVADG+F AELNE LTRELAEDGYSGVEVRVTP +TEIII ATRTQNVLGEKGRRIRELT+VVQKRF FPE SVELYAEKV  RGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRF+MESGAKGCEV+VSGKLR QRAKSMKF DG MI SG PV  Y+D+AVRHVLLRQGVLGIKVKIML WDP GK GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEE
          PLPD V+I  PK+E
Subjt:  PTPLPDLVTIHTPKEE

Q9FJA6 40S ribosomal protein S3-31.6e-11091.82Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP++SVELYAEKV NRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP KEYID+AVRHVLLRQGVLG+KVKIMLDWDPKGKQGP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEEEDFI
         TPLPD+V IHTPKE++ +I
Subjt:  PTPLPDLVTIHTPKEEEDFI

Q9M339 40S ribosomal protein S3-22.3e-10992.63Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        M TQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP KEYIDSAVRHVLLRQGVLGIKVK+MLDWDPKG  GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEEE
         TPLPD+V IH+PKEEE
Subjt:  PTPLPDLVTIHTPKEEE

Q9SIP7 40S ribosomal protein S3-11.9e-10892.13Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP KEYID+AVRHVLLRQGVLGIKVKIMLDWDP GK GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEE
         TPLPD+V IH PK++
Subjt:  PTPLPDLVTIHTPKEE

Arabidopsis top hitse value%identityAlignment
AT1G72650.1 TRF-like 61.4e-9839.76Show/hide
Query:  METVVGFVENERKIVESGAAQDGSSL-SPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVED---------------LLEDDKNEKVE--------DAGQ
        M TVVG VE+ R + E  A    +   S NQI +PV YKLVRV GDG  VPATD+E++EV D               L  D++N +V+        DA Q
Subjt:  METVVGFVENERKIVESGAAQDGSSL-SPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVED---------------LLEDDKNEKVE--------DAGQ

Query:  IVGCKPTEGTLFGKPRVEVLND-TPGLPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQ
         +G  P EG      ++E       GL  SD+ +   D     +Y EE+LQKV+QEERL    GS    S     + + S+E+      ++++     LQ
Subjt:  IVGCKPTEGTLFGKPRVEVLND-TPGLPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISLQ

Query:  EITHLISPSLKENHVNENGSLGDCLKHPDKSVESESSDAL-CTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDI
        +          E  +NE+    D ++    +V S    AL      PDFS ++G++CLDNL I+ L+E F+ATFGRDTTVKDK+WLKRRIAMGL NSCD+
Subjt:  EITHLISPSLKENHVNENGSLGDCLKHPDKSVESESSDAL-CTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDI

Query:  PASSFIIKEGKFLEESTSNVERMSTSPTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQH--------DERAAVKRVRKPTRRYIEELSEVESR
        P ++  +K+ K +     N E+ +    A    I   +     + + KD     D    H + G H         E+ A KRVRKPTRRYIEELSE + +
Subjt:  PASSFIIKEGKFLEESTSNVERMSTSPTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQH--------DERAAVKRVRKPTRRYIEELSEVESR

Query:  EYVQKVVNLNKNTISDGVSANSIARPIKKVYSDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFA----LPEK-------------------
        +   K V  +K+     +S  S  R I    S G R  +TR+ SL GS  +VP VS VRRSRPR++++ L+      L +K                   
Subjt:  EYVQKVVNLNKNTISDGVSANSIARPIKKVYSDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFA----LPEK-------------------

Query:  ---------------------DENPSVTVTDEAEKNLEQKQ-TASDNASDDNTAVVPTTKGG-MRRKHHRAWTLVEVIKLVEGVSKCGAGRWSEIKKLSF
                             DEN    +  E ++ +E +   +S N+SD+N   VP  +GG +RRKHHRAWTL E+ KLVEGVSK GAG+WSEIKK  F
Subjt:  ---------------------DENPSVTVTDEAEKNLEQKQ-TASDNASDDNTAVVPTTKGG-MRRKHHRAWTLVEVIKLVEGVSKCGAGRWSEIKKLSF

Query:  SSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQ
        SS+SYRTSVDLKDKWRNLLK S  Q+P +   S +KH S+ IP QILLRVRELAE  +Q
Subjt:  SSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQ

AT1G72650.2 TRF-like 69.9e-10040Show/hide
Query:  METVVGFVENERKIVESGAAQDGSSL-SPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVED---------------LLEDDKNEKVE--------DAGQ
        M TVVG VE+ R + E  A    +   S NQI +PV YKLVRV GDG  VPATD+E++EV D               L  D++N +V+        DA Q
Subjt:  METVVGFVENERKIVESGAAQDGSSL-SPNQIADPVVYKLVRVDGDGRFVPATDDEVMEVED---------------LLEDDKNEKVE--------DAGQ

Query:  IVGCKPTEGTLFGKPRVEVLND-TPGLPESDSFEAAAD-YNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISL
         +G  P EG      ++E       GL  SD+ +   D   +R EY EE+LQKV+QEERL    GS    S     + + S+E+      ++++     L
Subjt:  IVGCKPTEGTLFGKPRVEVLND-TPGLPESDSFEAAAD-YNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDEKLQSNISL

Query:  QEITHLISPSLKENHVNENGSLGDCLKHPDKSVESESSDAL-CTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCD
        Q+          E  +NE+    D ++    +V S    AL      PDFS ++G++CLDNL I+ L+E F+ATFGRDTTVKDK+WLKRRIAMGL NSCD
Subjt:  QEITHLISPSLKENHVNENGSLGDCLKHPDKSVESESSDAL-CTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCD

Query:  IPASSFIIKEGKFLEESTSNVERMSTSPTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQH--------DERAAVKRVRKPTRRYIEELSEVES
        +P ++  +K+ K +     N E+ +    A    I   +     + + KD     D    H + G H         E+ A KRVRKPTRRYIEELSE + 
Subjt:  IPASSFIIKEGKFLEESTSNVERMSTSPTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQH--------DERAAVKRVRKPTRRYIEELSEVES

Query:  REYVQKVVNLNKNTISDGVSANSIARPIKKVYSDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFA----LPEK------------------
        ++   K V  +K+     +S  S  R I    S G R  +TR+ SL GS  +VP VS VRRSRPR++++ L+      L +K                  
Subjt:  REYVQKVVNLNKNTISDGVSANSIARPIKKVYSDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFA----LPEK------------------

Query:  ----------------------DENPSVTVTDEAEKNLEQKQ-TASDNASDDNTAVVPTTKGG-MRRKHHRAWTLVEVIKLVEGVSKCGAGRWSEIKKLS
                              DEN    +  E ++ +E +   +S N+SD+N   VP  +GG +RRKHHRAWTL E+ KLVEGVSK GAG+WSEIKK  
Subjt:  ----------------------DENPSVTVTDEAEKNLEQKQ-TASDNASDDNTAVVPTTKGG-MRRKHHRAWTLVEVIKLVEGVSKCGAGRWSEIKKLS

Query:  FSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQ
        FSS+SYRTSVDLKDKWRNLLK S  Q+P +   S +KH S+ IP QILLRVRELAE  +Q
Subjt:  FSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQ

AT2G31610.1 Ribosomal protein S3 family protein1.4e-10992.13Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP KEYID+AVRHVLLRQGVLGIKVKIMLDWDP GK GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEE
         TPLPD+V IH PK++
Subjt:  PTPLPDLVTIHTPKEE

AT3G53870.1 Ribosomal protein S3 family protein1.6e-11092.63Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        M TQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP KEYIDSAVRHVLLRQGVLGIKVK+MLDWDPKG  GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEEE
         TPLPD+V IH+PKEEE
Subjt:  PTPLPDLVTIHTPKEEE

AT5G35530.1 Ribosomal protein S3 family protein1.1e-11191.82Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP++SVELYAEKV NRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP KEYID+AVRHVLLRQGVLG+KVKIMLDWDPKGKQGP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEEEDFI
         TPLPD+V IHTPKE++ +I
Subjt:  PTPLPDLVTIHTPKEEEDFI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACTCAGATGAGTAAAAAGCGAAAGTTTGTGGCTGATGGAGTGTTCTTTGCTGAGCTTAATGAAGTTCTTACCAGAGAGCTTGCAGAGGATGGATACTCCGGAGT
AGAGGTTAGGGTTACACCTATGCGTACTGAGATTATCATCAGGGCAACTCGCACTCAGAATGTTCTTGGTGAGAAAGGAAGGAGAATCAGAGAATTGACATCTGTTGTTC
AGAAGCGTTTCAAGTTCCCTGAAAACAGTGTTGAGCTATATGCCGAGAAGGTCAACAACAGAGGACTCTGTGCCATTGCTCAAGCTGAGTCTCTGCGCTACAAGCTTCTA
GGAGGCCTTGCTGTGAGAAGGGCTTGCTATGGTGTTCTTAGATTTGTGATGGAGAGTGGAGCTAAAGGATGTGAGGTTATTGTTAGTGGTAAGCTGAGGGCTCAGCGTGC
AAAATCCATGAAATTCAAGGATGGTTACATGATTTCGTCTGGACAGCCGGTGAAAGAGTACATAGACTCTGCCGTGAGACACGTTCTCCTTAGACAGGGTGTTTTGGGTA
TCAAGGTGAAGATCATGCTTGATTGGGATCCTAAGGGTAAGCAAGGTCCACCAACGCCACTTCCGGATTTGGTTACCATCCATACTCCCAAAGAGGAAGAGGATTTCATT
AGGCTGATTTTCAAATGCATGGAAAATCTATTCTGGCCTCTCACATCTTGGGCCGTAGCCCATATTGAGGAGCCCAAAATTGAGTTTACAATATATTGGCAGACGAAGAG
TGACGAGACGATAAGCACTTTATGTATTTTAGTTGCATGTGGTGGATTTTCCATGGAAACAGTGGTTGGATTTGTGGAGAATGAAAGGAAAATTGTTGAAAGCGGGGCTG
CTCAGGATGGCTCCTCTCTGTCCCCAAATCAGATTGCTGACCCAGTTGTGTACAAACTTGTCAGGGTTGATGGTGATGGCAGATTTGTTCCTGCCACAGATGATGAGGTA
ATGGAGGTTGAGGATTTACTTGAAGACGACAAGAATGAAAAAGTGGAAGACGCAGGACAAATTGTAGGATGCAAACCCACAGAAGGCACTTTATTTGGGAAGCCTCGTGT
AGAAGTCTTGAATGATACGCCAGGTCTACCGGAATCTGATTCCTTTGAAGCTGCTGCAGATTATAATGCCCGGTTGGAGTACATTGAGGAGGTCTTACAAAAGGTGAAGC
AGGAAGAGAGGCTTCGCTTAACATGTGGATCACCGAACTATGCTTCTGCTTACGTGAGTGGAGACAGGAAGGGTTCTGATGAGCATGGTAGCTTGCCTGTTATTGATGAA
AAGCTCCAATCCAATATTTCTCTGCAGGAAATAACTCACTTAATTTCTCCAAGTTTAAAGGAGAATCACGTGAATGAAAATGGGAGTCTAGGCGATTGTTTAAAGCATCC
AGATAAATCAGTCGAGTCTGAATCCTCAGACGCCCTCTGCACTACGTCTAACCCTGATTTTTCCTTGTTGAAGGGCGACGTTTGCCTAGATAATCTTTCAATTAGAGAAC
TCCGTGAATGTTTCAAAGCAACTTTTGGGAGAGACACTACAGTTAAAGACAAGTCGTGGCTTAAGAGGAGAATTGCCATGGGATTGACCAACTCATGCGACATTCCAGCC
TCATCTTTTATAATTAAGGAAGGCAAGTTTCTTGAAGAAAGTACTTCAAACGTGGAGCGCATGTCCACTAGTCCAACTGCCGAAACTTTGAATATTGAATGCCGAGTTTC
GCCAACCACTTATAGTTTGGAAAACAAGGACCTTCATCACTTTGAGGATATGGAACTTGATCATGGAAGTGAAGGTCAACACGATGAGAGAGCTGCTGTTAAAAGAGTTC
GGAAGCCTACCAGGCGGTATATTGAAGAACTTTCTGAAGTGGAGTCAAGAGAGTATGTGCAAAAGGTGGTTAATTTGAATAAAAATACTATATCAGATGGTGTATCTGCG
AATTCCATTGCAAGACCTATTAAAAAGGTCTATTCAGATGGAGGAAGAACTGTAATCACAAGATTGGATTCTCTTGGTGGATCTGGCTTTCAAGTCCCATGTGTTTCAAG
AGTTAGAAGGAGCCGTCCGAGGAAAGACGTTGTAGGCCTTGTGTTTGCCCTTCCAGAGAAAGATGAGAATCCTTCAGTTACGGTCACAGATGAAGCAGAGAAGAATTTGG
AGCAGAAGCAAACAGCTTCTGATAATGCATCAGATGATAACACGGCAGTTGTTCCGACGACAAAAGGTGGAATGAGGAGGAAGCATCATCGTGCTTGGACTCTTGTTGAA
GTCATCAAATTAGTAGAAGGCGTGTCGAAATGTGGAGCTGGGAGGTGGTCTGAGATTAAAAAACTATCTTTCTCATCATACTCATACCGTACGTCAGTTGATCTCAAGGA
TAAATGGAGAAACCTGCTGAAAGCTAGCTTAGTGCAGACACCTGTTGATGAAGGGATAAGTTCTCGGAAACATGCGTCGATATCAATTCCTGCGCAGATCTTGTTACGGG
TAAGGGAGCTTGCTGAGATGCACGCCCAAATTCCTCCCTCAAATCATGGGCAAGGCAAGCTGGGTGGTGGAGTTAGTGGTGGGAGTATGCATGAGATGAGTTCTTCGACG
GTATGCTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTACTCAGATGAGTAAAAAGCGAAAGTTTGTGGCTGATGGAGTGTTCTTTGCTGAGCTTAATGAAGTTCTTACCAGAGAGCTTGCAGAGGATGGATACTCCGGAGT
AGAGGTTAGGGTTACACCTATGCGTACTGAGATTATCATCAGGGCAACTCGCACTCAGAATGTTCTTGGTGAGAAAGGAAGGAGAATCAGAGAATTGACATCTGTTGTTC
AGAAGCGTTTCAAGTTCCCTGAAAACAGTGTTGAGCTATATGCCGAGAAGGTCAACAACAGAGGACTCTGTGCCATTGCTCAAGCTGAGTCTCTGCGCTACAAGCTTCTA
GGAGGCCTTGCTGTGAGAAGGGCTTGCTATGGTGTTCTTAGATTTGTGATGGAGAGTGGAGCTAAAGGATGTGAGGTTATTGTTAGTGGTAAGCTGAGGGCTCAGCGTGC
AAAATCCATGAAATTCAAGGATGGTTACATGATTTCGTCTGGACAGCCGGTGAAAGAGTACATAGACTCTGCCGTGAGACACGTTCTCCTTAGACAGGGTGTTTTGGGTA
TCAAGGTGAAGATCATGCTTGATTGGGATCCTAAGGGTAAGCAAGGTCCACCAACGCCACTTCCGGATTTGGTTACCATCCATACTCCCAAAGAGGAAGAGGATTTCATT
AGGCTGATTTTCAAATGCATGGAAAATCTATTCTGGCCTCTCACATCTTGGGCCGTAGCCCATATTGAGGAGCCCAAAATTGAGTTTACAATATATTGGCAGACGAAGAG
TGACGAGACGATAAGCACTTTATGTATTTTAGTTGCATGTGGTGGATTTTCCATGGAAACAGTGGTTGGATTTGTGGAGAATGAAAGGAAAATTGTTGAAAGCGGGGCTG
CTCAGGATGGCTCCTCTCTGTCCCCAAATCAGATTGCTGACCCAGTTGTGTACAAACTTGTCAGGGTTGATGGTGATGGCAGATTTGTTCCTGCCACAGATGATGAGGTA
ATGGAGGTTGAGGATTTACTTGAAGACGACAAGAATGAAAAAGTGGAAGACGCAGGACAAATTGTAGGATGCAAACCCACAGAAGGCACTTTATTTGGGAAGCCTCGTGT
AGAAGTCTTGAATGATACGCCAGGTCTACCGGAATCTGATTCCTTTGAAGCTGCTGCAGATTATAATGCCCGGTTGGAGTACATTGAGGAGGTCTTACAAAAGGTGAAGC
AGGAAGAGAGGCTTCGCTTAACATGTGGATCACCGAACTATGCTTCTGCTTACGTGAGTGGAGACAGGAAGGGTTCTGATGAGCATGGTAGCTTGCCTGTTATTGATGAA
AAGCTCCAATCCAATATTTCTCTGCAGGAAATAACTCACTTAATTTCTCCAAGTTTAAAGGAGAATCACGTGAATGAAAATGGGAGTCTAGGCGATTGTTTAAAGCATCC
AGATAAATCAGTCGAGTCTGAATCCTCAGACGCCCTCTGCACTACGTCTAACCCTGATTTTTCCTTGTTGAAGGGCGACGTTTGCCTAGATAATCTTTCAATTAGAGAAC
TCCGTGAATGTTTCAAAGCAACTTTTGGGAGAGACACTACAGTTAAAGACAAGTCGTGGCTTAAGAGGAGAATTGCCATGGGATTGACCAACTCATGCGACATTCCAGCC
TCATCTTTTATAATTAAGGAAGGCAAGTTTCTTGAAGAAAGTACTTCAAACGTGGAGCGCATGTCCACTAGTCCAACTGCCGAAACTTTGAATATTGAATGCCGAGTTTC
GCCAACCACTTATAGTTTGGAAAACAAGGACCTTCATCACTTTGAGGATATGGAACTTGATCATGGAAGTGAAGGTCAACACGATGAGAGAGCTGCTGTTAAAAGAGTTC
GGAAGCCTACCAGGCGGTATATTGAAGAACTTTCTGAAGTGGAGTCAAGAGAGTATGTGCAAAAGGTGGTTAATTTGAATAAAAATACTATATCAGATGGTGTATCTGCG
AATTCCATTGCAAGACCTATTAAAAAGGTCTATTCAGATGGAGGAAGAACTGTAATCACAAGATTGGATTCTCTTGGTGGATCTGGCTTTCAAGTCCCATGTGTTTCAAG
AGTTAGAAGGAGCCGTCCGAGGAAAGACGTTGTAGGCCTTGTGTTTGCCCTTCCAGAGAAAGATGAGAATCCTTCAGTTACGGTCACAGATGAAGCAGAGAAGAATTTGG
AGCAGAAGCAAACAGCTTCTGATAATGCATCAGATGATAACACGGCAGTTGTTCCGACGACAAAAGGTGGAATGAGGAGGAAGCATCATCGTGCTTGGACTCTTGTTGAA
GTCATCAAATTAGTAGAAGGCGTGTCGAAATGTGGAGCTGGGAGGTGGTCTGAGATTAAAAAACTATCTTTCTCATCATACTCATACCGTACGTCAGTTGATCTCAAGGA
TAAATGGAGAAACCTGCTGAAAGCTAGCTTAGTGCAGACACCTGTTGATGAAGGGATAAGTTCTCGGAAACATGCGTCGATATCAATTCCTGCGCAGATCTTGTTACGGG
TAAGGGAGCTTGCTGAGATGCACGCCCAAATTCCTCCCTCAAATCATGGGCAAGGCAAGCTGGGTGGTGGAGTTAGTGGTGGGAGTATGCATGAGATGAGTTCTTCGACG
GTATGCTCGTGA
Protein sequenceShow/hide protein sequence
MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLL
GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPPTPLPDLVTIHTPKEEEDFI
RLIFKCMENLFWPLTSWAVAHIEEPKIEFTIYWQTKSDETISTLCILVACGGFSMETVVGFVENERKIVESGAAQDGSSLSPNQIADPVVYKLVRVDGDGRFVPATDDEV
MEVEDLLEDDKNEKVEDAGQIVGCKPTEGTLFGKPRVEVLNDTPGLPESDSFEAAADYNARLEYIEEVLQKVKQEERLRLTCGSPNYASAYVSGDRKGSDEHGSLPVIDE
KLQSNISLQEITHLISPSLKENHVNENGSLGDCLKHPDKSVESESSDALCTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPA
SSFIIKEGKFLEESTSNVERMSTSPTAETLNIECRVSPTTYSLENKDLHHFEDMELDHGSEGQHDERAAVKRVRKPTRRYIEELSEVESREYVQKVVNLNKNTISDGVSA
NSIARPIKKVYSDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDVVGLVFALPEKDENPSVTVTDEAEKNLEQKQTASDNASDDNTAVVPTTKGGMRRKHHRAWTLVE
VIKLVEGVSKCGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASLVQTPVDEGISSRKHASISIPAQILLRVRELAEMHAQIPPSNHGQGKLGGGVSGGSMHEMSSST
VCS