; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0000500 (gene) of Chayote v1 genome

Gene IDSed0000500
OrganismSechium edule (Chayote v1)
Description30S ribosomal protein S1
Genome locationLG10:33773193..33779908
RNA-Seq ExpressionSed0000500
SyntenySed0000500
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0005840 - ribosome (cellular component)
GO:0016020 - membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0022627 - cytosolic small ribosomal subunit (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003729 - mRNA binding (molecular function)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008438974.1 PREDICTED: 30S ribosomal protein S1, chloroplastic [Cucumis melo]6.8e-14368.41Show/hide
Query:  MASMALQSTVLKRTAQFPIPYRR-SKPLGLRRRQSMASPFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQ
        MASMA Q T L+     P+   R SKP   +   + +   P+ AAVI  P P+ Q ++RF LK  FE+A +RCRN P+EG+SFTL DF +ALEKYDFDS+
Subjt:  MASMALQSTVLKRTAQFPIPYRR-SKPLGLRRRQSMASPFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQ

Query:  LGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENE-DDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDA
        LG KVKGTV   + NGALV+I  KS AYLPLQEA IHR+K VEEAGI+PG REEFVIIGENE DDSLILSLR IQ DLAWERCRQL AEDVVVKGKVVDA
Subjt:  LGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENE-DDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDA

Query:  NKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSH
        NKGGV+ VVEGL+GFVPFS+IS  +TAEE+LNKELP+KF+ V+EE +R+VLSNRKAMAD+QAQL IGSVVTGTV+ L+ +GAF+D+GGI+GLLH+S+ SH
Subjt:  NKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSH

Query:  DRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQAG--ARADLLSFQSQGGLTF-SDGILGPCTPQ
        DRI++I  VL+PGD LKVMIL HDRE+G + LSTKKLEP PGDMIRNPKLV+EKAEEMAQ F++RI QA   ARAD+L FQ + GLT  +DGILGP TP+
Subjt:  DRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQAG--ARADLLSFQSQGGLTF-SDGILGPCTPQ

Query:  LP
        LP
Subjt:  LP

XP_016902972.1 PREDICTED: 30S ribosomal protein S1, chloroplastic-like [Cucumis melo]6.2e-14474.52Show/hide
Query:  MALQSTVLKRTAQFPIPYRRSKPLGLRRRQSMAS-PFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQLGA
        MA   T L+      I +  SKPLG  R+Q+ A+  F ++AAVI SP P+    +RF LK TF DAADRC N PMEGVSFTL  FL++LEKYDFD QLG+
Subjt:  MALQSTVLKRTAQFPIPYRRSKPLGLRRRQSMAS-PFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQLGA

Query:  KVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENEDDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDANKGG
        KVKGTVV +EANGALVEIA KSPAYLPLQEASIHR+KRVEEAGIYPGFREEFVIIG+NEDD L LSLRPIQ +LAWERCRQL A DVVVKGKVV ANKGG
Subjt:  KVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENEDDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDANKGG

Query:  VLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSHDRIA
        VLVVVEGLKGFVPFSEI M++T EE++NKELP+K L+V EE TR+VLSNRK MAD++AQLEIGSVVTGTV RL KFGAFVD+GG+HGLLHISE SHDRI 
Subjt:  VLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSHDRIA

Query:  NIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQA
        +I  VLKPGDILKVM+L  DREKGHIRLSTKKLEPN GDMIRNP LV+ KAEEMA+RF++R+ QA
Subjt:  NIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQA

XP_022138241.1 30S ribosomal protein S1, chloroplastic [Momordica charantia]2.8e-14468.91Show/hide
Query:  MASMALQSTVLKRTAQFPIPYRR-SKPLGLRRRQSMASPFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQ
        MASMA Q T L+     P+   R S P   R  Q+ A   P+ AAVI SP P+ Q ++RF LK  FEDA +RCRN P+EG+SFTL DF +ALEKYDFDS+
Subjt:  MASMALQSTVLKRTAQFPIPYRR-SKPLGLRRRQSMASPFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQ

Query:  LGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENE-DDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDA
        +G KVKGTV   +ANGALV+I  KS AYLP+QEA IHR+K VEEAGI+PG REEFVIIGENE DDSL+LSLR IQ DLAWERCRQL AEDVVVKGKVVDA
Subjt:  LGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENE-DDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDA

Query:  NKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSH
        NKGGV+ VVEGL+GFVPFS+IS  +TAEE+LNKELP+KF+ V+EE +R+VLSNRKAMAD+QAQL IGSVVTGTV+ L+ +GAF+D+GGI+GLLH+S+ SH
Subjt:  NKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSH

Query:  DRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQAG--ARADLLSFQSQGGLTF-SDGILGPCTPQ
        DRI++I  VL+PGD LKVMIL HDRE+G + LSTKKLEP PGDMIRNPKLV+EKAEEMAQ F++RI QA   ARAD+L FQ + GLT  +DGILGP TP+
Subjt:  DRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQAG--ARADLLSFQSQGGLTF-SDGILGPCTPQ

Query:  LP
        LP
Subjt:  LP

XP_038878013.1 30S ribosomal protein S1, chloroplastic-like [Benincasa hispida]6.2e-14468.91Show/hide
Query:  MASMALQSTVLKRTAQFPIPYRR-SKPLGLRRRQSMASPFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQ
        MASMA Q T L+     P+   R SKP   R  QS A   P+ AAVI  P P+ Q ++RF LK  FE+A +RCRN P+EG++FTL DF +ALEKYDFDS+
Subjt:  MASMALQSTVLKRTAQFPIPYRR-SKPLGLRRRQSMASPFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQ

Query:  LGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENE-DDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDA
        LG KVKGTV   + NGALV+I  KS AYLP+QEA IHR+K VEEAGI+PG REEFVIIGENE DDSLILSLR IQ DLAWERCRQL AEDVVVKGKVVDA
Subjt:  LGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENE-DDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDA

Query:  NKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSH
        NKGGV+ VVEGL+GFVPFS+IS  +TAEE+LNKELP+KF+ V+EE +R+VLSNRKAMAD+QAQL IGSVVTGTV+ L+ +GAF+D+GGI+GLLH+S+ SH
Subjt:  NKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSH

Query:  DRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQAG--ARADLLSFQSQGGLTF-SDGILGPCTPQ
        DRI++I  VL+PGD LKVMIL HDRE+G + LSTKKLEP PGDMIRNPKLV+EKAEEMAQ F++RI QA   ARAD+L FQ + GLT  +DGILGP TP+
Subjt:  DRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQAG--ARADLLSFQSQGGLTF-SDGILGPCTPQ

Query:  LP
        LP
Subjt:  LP

XP_038885297.1 30S ribosomal protein S1, chloroplastic-like [Benincasa hispida]1.2e-15274.56Show/hide
Query:  MALQSTVLKRTAQFPIPYRRSKPLGLRRRQSM-ASPFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQLGA
        MA Q T L+   +F I    SKPL     Q+M    FP+VAAVI  P PT Q  +RF LK TF DAADRCRN PMEGVSFTL DFL++LEKY FD QLGA
Subjt:  MALQSTVLKRTAQFPIPYRRSKPLGLRRRQSM-ASPFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQLGA

Query:  KVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENEDDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDANKGG
        KVKGTVV  EANGALVEIA KSPAYLPL EA IHR+KRVEEAGIYPGFREEFVIIGENEDDSL LSLR IQ +LAWERCRQL AEDV+VKGKVV AN GG
Subjt:  KVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENEDDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDANKGG

Query:  VLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSHDRIA
        VLVVVEGLKGFVP+SEI M++TAEE++NKELP+KFL+VNEE TRIVLSNRK MAD++AQL IG+VVTGTV RL KFGAFVD+GG+HGLLHISE SHDRI 
Subjt:  VLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSHDRIA

Query:  NIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQAG--ARADLLSFQSQGGLTF-SDGILGPCTPQL
        +I AVLKPGDILKVMIL  + EKGHIRLSTKKLEPN GDMI NP LV+EKAEEMA RF++R+ QA   ARADLLSFQ +G L   SDGIL P TP+L
Subjt:  NIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQAG--ARADLLSFQSQGGLTF-SDGILGPCTPQL

TrEMBL top hitse value%identityAlignment
A0A1S3AXL6 30S ribosomal protein S1, chloroplastic3.3e-14368.41Show/hide
Query:  MASMALQSTVLKRTAQFPIPYRR-SKPLGLRRRQSMASPFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQ
        MASMA Q T L+     P+   R SKP   +   + +   P+ AAVI  P P+ Q ++RF LK  FE+A +RCRN P+EG+SFTL DF +ALEKYDFDS+
Subjt:  MASMALQSTVLKRTAQFPIPYRR-SKPLGLRRRQSMASPFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQ

Query:  LGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENE-DDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDA
        LG KVKGTV   + NGALV+I  KS AYLPLQEA IHR+K VEEAGI+PG REEFVIIGENE DDSLILSLR IQ DLAWERCRQL AEDVVVKGKVVDA
Subjt:  LGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENE-DDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDA

Query:  NKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSH
        NKGGV+ VVEGL+GFVPFS+IS  +TAEE+LNKELP+KF+ V+EE +R+VLSNRKAMAD+QAQL IGSVVTGTV+ L+ +GAF+D+GGI+GLLH+S+ SH
Subjt:  NKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSH

Query:  DRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQAG--ARADLLSFQSQGGLTF-SDGILGPCTPQ
        DRI++I  VL+PGD LKVMIL HDRE+G + LSTKKLEP PGDMIRNPKLV+EKAEEMAQ F++RI QA   ARAD+L FQ + GLT  +DGILGP TP+
Subjt:  DRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQAG--ARADLLSFQSQGGLTF-SDGILGPCTPQ

Query:  LP
        LP
Subjt:  LP

A0A1S4E424 30S ribosomal protein S1, chloroplastic-like3.0e-14474.52Show/hide
Query:  MALQSTVLKRTAQFPIPYRRSKPLGLRRRQSMAS-PFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQLGA
        MA   T L+      I +  SKPLG  R+Q+ A+  F ++AAVI SP P+    +RF LK TF DAADRC N PMEGVSFTL  FL++LEKYDFD QLG+
Subjt:  MALQSTVLKRTAQFPIPYRRSKPLGLRRRQSMAS-PFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQLGA

Query:  KVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENEDDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDANKGG
        KVKGTVV +EANGALVEIA KSPAYLPLQEASIHR+KRVEEAGIYPGFREEFVIIG+NEDD L LSLRPIQ +LAWERCRQL A DVVVKGKVV ANKGG
Subjt:  KVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENEDDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDANKGG

Query:  VLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSHDRIA
        VLVVVEGLKGFVPFSEI M++T EE++NKELP+K L+V EE TR+VLSNRK MAD++AQLEIGSVVTGTV RL KFGAFVD+GG+HGLLHISE SHDRI 
Subjt:  VLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSHDRIA

Query:  NIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQA
        +I  VLKPGDILKVM+L  DREKGHIRLSTKKLEPN GDMIRNP LV+ KAEEMA+RF++R+ QA
Subjt:  NIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQA

A0A5A7SUN2 30S ribosomal protein S13.0e-14474.52Show/hide
Query:  MALQSTVLKRTAQFPIPYRRSKPLGLRRRQSMAS-PFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQLGA
        MA   T L+      I +  SKPLG  R+Q+ A+  F ++AAVI SP P+    +RF LK TF DAADRC N PMEGVSFTL  FL++LEKYDFD QLG+
Subjt:  MALQSTVLKRTAQFPIPYRRSKPLGLRRRQSMAS-PFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQLGA

Query:  KVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENEDDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDANKGG
        KVKGTVV +EANGALVEIA KSPAYLPLQEASIHR+KRVEEAGIYPGFREEFVIIG+NEDD L LSLRPIQ +LAWERCRQL A DVVVKGKVV ANKGG
Subjt:  KVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENEDDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDANKGG

Query:  VLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSHDRIA
        VLVVVEGLKGFVPFSEI M++T EE++NKELP+K L+V EE TR+VLSNRK MAD++AQLEIGSVVTGTV RL KFGAFVD+GG+HGLLHISE SHDRI 
Subjt:  VLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSHDRIA

Query:  NIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQA
        +I  VLKPGDILKVM+L  DREKGHIRLSTKKLEPN GDMIRNP LV+ KAEEMA+RF++R+ QA
Subjt:  NIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQA

A0A6J1C966 30S ribosomal protein S1, chloroplastic1.3e-14468.91Show/hide
Query:  MASMALQSTVLKRTAQFPIPYRR-SKPLGLRRRQSMASPFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQ
        MASMA Q T L+     P+   R S P   R  Q+ A   P+ AAVI SP P+ Q ++RF LK  FEDA +RCRN P+EG+SFTL DF +ALEKYDFDS+
Subjt:  MASMALQSTVLKRTAQFPIPYRR-SKPLGLRRRQSMASPFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQ

Query:  LGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENE-DDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDA
        +G KVKGTV   +ANGALV+I  KS AYLP+QEA IHR+K VEEAGI+PG REEFVIIGENE DDSL+LSLR IQ DLAWERCRQL AEDVVVKGKVVDA
Subjt:  LGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENE-DDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDA

Query:  NKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSH
        NKGGV+ VVEGL+GFVPFS+IS  +TAEE+LNKELP+KF+ V+EE +R+VLSNRKAMAD+QAQL IGSVVTGTV+ L+ +GAF+D+GGI+GLLH+S+ SH
Subjt:  NKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSH

Query:  DRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQAG--ARADLLSFQSQGGLTF-SDGILGPCTPQ
        DRI++I  VL+PGD LKVMIL HDRE+G + LSTKKLEP PGDMIRNPKLV+EKAEEMAQ F++RI QA   ARAD+L FQ + GLT  +DGILGP TP+
Subjt:  DRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQAG--ARADLLSFQSQGGLTF-SDGILGPCTPQ

Query:  LP
        LP
Subjt:  LP

A0A6J1F6H4 30S ribosomal protein S1, chloroplastic-like3.1e-14166.92Show/hide
Query:  MASMALQSTVLKRTAQFPIPYRR-SKPLGLRRRQSMASPFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQ
        MASMA Q   L+     P+   R SKP   R  Q+ A   P+ AAVI SP P+ Q ++RF LK  FE+A +RCRN P+EG+SFT+ DF SA+EKYDF+S+
Subjt:  MASMALQSTVLKRTAQFPIPYRR-SKPLGLRRRQSMASPFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQ

Query:  LGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENE-DDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDA
        +G KVKGTV   +ANGALV+I  KS AYLPLQEA IHR+K VEEAGIYPG R+EFVIIGENE DDSL+LSLR IQ DLAWERCRQL AEDVVVKGKVVDA
Subjt:  LGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENE-DDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDA

Query:  NKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSH
        NKGGV+ VVEGL+GFVPFS+IS  +TAEE+L KE+P+KF+ V+EE +R+VLSNRKA+AD+QAQL IGSVV GTV+ L+ +GAF+D+GG++GLLH+S+ SH
Subjt:  NKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSH

Query:  DRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRI--TQAGARADLLSFQSQGGLTF-SDGILGPCTPQ
        DRI++I  VL+PGD LKVMIL HDRE+G + LSTKKLEP PGDMIRNPKLV+EKAEEMAQ F++RI   +A ARAD+L FQ + GLT  +DGILGP TP+
Subjt:  DRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRI--TQAGARADLLSFQSQGGLTF-SDGILGPCTPQ

Query:  LP
        LP
Subjt:  LP

SwissProt top hitse value%identityAlignment
O33698 30S ribosomal protein S15.4e-4236.3Show/hide
Query:  DFLSALEKYDFDSQLGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVII-GENEDDSLILSLRPIQSDLAWERCRQL
        DF  ALE    DSQ G  V+G V +   +GA ++I  K+PA+LP +EA++H +  + EA +      EF++I  +NED  + +SLR +  + AW R  +L
Subjt:  DFLSALEKYDFDSQLGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVII-GENEDDSLILSLRPIQSDLAWERCRQL

Query:  LAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQA-QLEIGSVVTGTVRRLEKFGAFVD
              V+ KV  +NKGGV   +EGL+ F+P S ++     + +  K L V FL VN    ++VLS R+A       ++E+G ++ G V  L+ FG FVD
Subjt:  LAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQA-QLEIGSVVTGTVRRLEKFGAFVD

Query:  LGGIHGLLHISETSHDRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRI
        LGG   LL I++ S   +A++ A+ K GD ++ +++  D  KG I LSTK LE +PG+++ N   +   A + A+R ++++
Subjt:  LGGIHGLLHISETSHDRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRI

P29344 30S ribosomal protein S1, chloroplastic3.5e-13468.78Show/hide
Query:  PIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQLGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLK
        PIV+AV  S    +Q R+R  LK  FEDA +RCRN PMEGVSFT+ DF +AL+KYDF+S++G++VKGTV   +ANGALV+I  KS AYLPL EA I+R+K
Subjt:  PIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQLGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLK

Query:  RVEEAGIYPGFREEFVIIGENE-DDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFL
         VEEAGI PG REEFVIIGENE DDSLILSLR IQ +LAWERCRQL AEDVVVKGK+V ANKGGV+ +VEGL+GFVPFS+IS  ++AEE+L KE+P+KF+
Subjt:  RVEEAGIYPGFREEFVIIGENE-DDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFL

Query:  IVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSHDRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPN
         V+EE +R+V+SNRKAMAD+QAQL IGSVVTGTV+ L+ +GAF+D+GGI+GLLH+S+ SHDR+++I  VL+PGD LKVMIL HDRE+G + LSTKKLEP 
Subjt:  IVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSHDRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPN

Query:  PGDMIRNPKLVYEKAEEMAQRFKRRITQAG--ARADLLSFQSQGGLTF-SDGILGPCTPQLP
        PGDMIRNPKLV+EKAEEMAQ F++RI QA   ARAD+L FQ + GLT  SDGILGP T  LP
Subjt:  PGDMIRNPKLVYEKAEEMAQRFKRRITQAG--ARADLLSFQSQGGLTF-SDGILGPCTPQLP

P46228 30S ribosomal protein S15.0e-7247.95Show/hide
Query:  PMEGVSFTLHDFLSALEKYDFDSQLGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENEDDSLILSLRPIQSD
        P   + FT  DF + L++YD+    G  V GTV  +E  GAL++I  K+ A+LP+QE SI+R++  EE       RE F++  ENED  L LS+R I+  
Subjt:  PMEGVSFTLHDFLSALEKYDFDSQLGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENEDDSLILSLRPIQSD

Query:  LAWERCRQLLAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQA-QLEIGSVVTGTVRR
         AWER RQL  ED  V+ +V   N+GG LV +EGL+GF+P S IS     E+++ +ELP+KFL V+E+  R+VLS+R+A+ + +  +LE+G VV G VR 
Subjt:  LAWERCRQLLAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQA-QLEIGSVVTGTVRR

Query:  LEKFGAFVDLGGIHGLLHISETSHDRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQ
        ++ +GAF+D+GG+ GLLHISE SHD I    +V    D +KVMI+  D E+G I LSTK+LEP PGDM+RNP++VYEKAEEMA +++ ++ Q
Subjt:  LEKFGAFVDLGGIHGLLHISETSHDRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQ

P73530 30S ribosomal protein S1 homolog A4.8e-6747.37Show/hide
Query:  VSFTLHDFLSALEKYDFDSQLGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENEDDSLILSLRPIQSDLAWE
        + FTL DF + L+KYD+    G  V GTV  +E+ GAL++I  K+ AY+P+QE SI+R+   EE       RE F++  ENED  L LS+R I+   AWE
Subjt:  VSFTLHDFLSALEKYDFDSQLGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENEDDSLILSLRPIQSDLAWE

Query:  RCRQLLAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQ-LEIGSVVTGTVRRLEKF
        R RQL AED  V+  V   N+GG LV +EGL+GF+P S IS     E+++ ++LP+KFL V+EE  R+VLS+R+A+ + +   LE+  VV G+VR ++ +
Subjt:  RCRQLLAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQ-LEIGSVVTGTVRRLEKF

Query:  GAFVDLGGIHGLLHISETSHDRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRR
        GAF+D+GG+ GLLHISE SHD I    +V    D +KVMI+  D E+G I LSTK+LEP PG M+++  LV E A+EMA+ F+++
Subjt:  GAFVDLGGIHGLLHISETSHDRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRR

Q93VC7 30S ribosomal protein S1, chloroplastic1.9e-12760.64Show/hide
Query:  MASMALQSTVLK---RTAQFPIPYRRSKPLGLRRRQSMASPFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFD
        MAS+A Q + L+    ++   +  R SK    + + +  SP  IVAAV  S   + Q ++R  LK  FEDA +RCR  PMEGV+FT+ DF +A+E+YDF+
Subjt:  MASMALQSTVLK---RTAQFPIPYRRSKPLGLRRRQSMASPFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFD

Query:  SQLGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENE-DDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVV
        S++G +VKGTV + +ANGALV+I+ KS AYL +++A IHR+K VEEAGI PG  EEFVIIGENE DDSL+LSLR IQ +LAWERCRQL AEDV+VK KV+
Subjt:  SQLGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENE-DDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVV

Query:  DANKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISET
         ANKGG++ +VEGL+GFVPFS+IS    AEE+L KE+P+KF+ V+EE T++VLSNRKA+AD+QAQL IGSVV G V+ L+ +GAF+D+GGI+GLLH+S+ 
Subjt:  DANKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISET

Query:  SHDRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQAG--ARADLLSFQSQGGLTF-SDGILGPCT
        SHDR+++I  VL+PGD LKVMIL HDR++G + LSTKKLEP PGDMIRNPKLV+EKAEEMAQ F++RI QA   ARAD+L FQ + GLT  SDGILGP  
Subjt:  SHDRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQAG--ARADLLSFQSQGGLTF-SDGILGPCT

Query:  PQLP
         +LP
Subjt:  PQLP

Arabidopsis top hitse value%identityAlignment
AT1G71720.1 Nucleic acid-binding proteins superfamily1.3e-1930.23Show/hide
Query:  IIGENEDDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSE----ISMVTTAEEILNKELPVKFLIVNEELTRIVLSN
        ++G       +LS R     +AW R RQ+   +  ++ K+ + N GG+L  +EGL+ F+P  E    ++  T  +E + +   V+   +NE+   ++LS 
Subjt:  IIGENEDDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSE----ISMVTTAEEILNKELPVKFLIVNEELTRIVLSN

Query:  RKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLG--GIHGLLHISETSHDRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLV
        +  +A  +  L  G+++ GTV ++  +GA V LG     GLLHIS  +  RI ++  VL+  + +KV+++K       I LS   LE  PG  I + + V
Subjt:  RKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLG--GIHGLLHISETSHDRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLV

Query:  YEKAEEMAQRFKRRI
        + +AEEMA++++ ++
Subjt:  YEKAEEMAQRFKRRI

AT3G11964.1 RNA binding;RNA binding2.3e-1128.24Show/hide
Query:  LAEDVVVKGKVVDA-NKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNR--------------KAMADNQAQLEIGSVVT
        L+ D+ V+G V +  +KG  +++   ++  V  S +      E    KE PV  L+    L    LS R              K+ + +  +L +G +++
Subjt:  LAEDVVVKGKVVDA-NKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNR--------------KAMADNQAQLEIGSVVT

Query:  GTVRRLEKFGAFVDLG--GIHGLLHISETSHDRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQA
        G +RR+E FG F+D+   G+ GL HIS+ S DR+ N+ A  K G+ ++  ILK D EK  I L  K      GD  +   L  +             ++ 
Subjt:  GTVRRLEKFGAFVDLG--GIHGLLHISETSHDRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQA

Query:  GARADLLSFQSQGGLT
         A  D   FQ   G T
Subjt:  GARADLLSFQSQGGLT

AT3G23700.1 Nucleic acid-binding proteins superfamily7.3e-1831.11Show/hide
Query:  WERCRQLLAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEISMVTTAEE-----------ILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQ-LEIG
        W+  +         +G+V   N GG+L+    L GF+P+ ++S   + +E           ++  +LPVK +  +EE  +++LS + A+    +Q + +G
Subjt:  WERCRQLLAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEISMVTTAEE-----------ILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQ-LEIG

Query:  SVVTGTVRRLEKFGAFVDL----GGIH--GLLHISETSHDRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNP
         V  G V  +E +GAF+ L    G  H  GL+H+SE S D + ++  VL+ GD ++V++   D+EK  I LS K+LE +P
Subjt:  SVVTGTVRRLEKFGAFVDL----GGIH--GLLHISETSHDRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNP

AT5G14580.1 polyribonucleotide nucleotidyltransferase, putative3.3e-1036.75Show/hide
Query:  ELPVKFLIVNEELTRIVLSNRKAMADNQAQLE--------IGSVVTGTVRRLEKFGAFVDL-GGIHGLLHISETSHDRIANIFAVLKPGDILKVMILKHD
        E   +  I N  LT IV  N+  M   Q Q++        +G V  GTV  ++++GAFV+  GG  GLLH+SE SH+ ++ +  VL  G  +  M ++ D
Subjt:  ELPVKFLIVNEELTRIVLSNRKAMADNQAQLE--------IGSVVTGTVRRLEKFGAFVDL-GGIHGLLHISETSHDRIANIFAVLKPGDILKVMILKHD

Query:  REKGHIRLSTKKLEPNP
          +G+I+LS K L P P
Subjt:  REKGHIRLSTKKLEPNP

AT5G30510.1 ribosomal protein S11.3e-12860.64Show/hide
Query:  MASMALQSTVLK---RTAQFPIPYRRSKPLGLRRRQSMASPFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFD
        MAS+A Q + L+    ++   +  R SK    + + +  SP  IVAAV  S   + Q ++R  LK  FEDA +RCR  PMEGV+FT+ DF +A+E+YDF+
Subjt:  MASMALQSTVLK---RTAQFPIPYRRSKPLGLRRRQSMASPFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFD

Query:  SQLGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENE-DDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVV
        S++G +VKGTV + +ANGALV+I+ KS AYL +++A IHR+K VEEAGI PG  EEFVIIGENE DDSL+LSLR IQ +LAWERCRQL AEDV+VK KV+
Subjt:  SQLGAKVKGTVVQVEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENE-DDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVV

Query:  DANKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISET
         ANKGG++ +VEGL+GFVPFS+IS    AEE+L KE+P+KF+ V+EE T++VLSNRKA+AD+QAQL IGSVV G V+ L+ +GAF+D+GGI+GLLH+S+ 
Subjt:  DANKGGVLVVVEGLKGFVPFSEISMVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISET

Query:  SHDRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQAG--ARADLLSFQSQGGLTF-SDGILGPCT
        SHDR+++I  VL+PGD LKVMIL HDR++G + LSTKKLEP PGDMIRNPKLV+EKAEEMAQ F++RI QA   ARAD+L FQ + GLT  SDGILGP  
Subjt:  SHDRIANIFAVLKPGDILKVMILKHDREKGHIRLSTKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQAG--ARADLLSFQSQGGLTF-SDGILGPCT

Query:  PQLP
         +LP
Subjt:  PQLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAATGGCTCTGCAAAGCACAGTGTTGAAACGCACGGCTCAGTTTCCGATTCCGTATCGTCGCTCGAAGCCATTAGGTCTCCGGCGTAGGCAGAGCATGGCGAG
TCCGTTCCCCATTGTCGCTGCAGTTATTTGTAGCCCTAATCCCACCTCTCAGGCCAGAGATCGGTTCAACCTCAAAGCCACCTTCGAGGACGCCGCCGATCGCTGCCGTA
ATGATCCCATGGAAGGCGTCTCCTTCACTCTCCATGATTTCCTTTCCGCTCTTGAGAAATACGACTTCGATTCTCAATTGGGAGCTAAGGTGAAAGGAACTGTGGTCCAA
GTAGAAGCTAATGGAGCGCTGGTTGAGATTGCTACCAAGTCGCCTGCATACTTGCCATTGCAGGAGGCTTCCATTCATAGACTTAAACGTGTAGAAGAAGCAGGAATATA
TCCTGGTTTTAGAGAGGAGTTTGTTATTATAGGTGAGAATGAAGATGATAGTTTGATTTTGAGCTTGAGGCCCATTCAATCTGACCTGGCTTGGGAGAGGTGTAGACAAC
TTCTAGCAGAGGATGTCGTTGTCAAGGGTAAGGTTGTTGATGCAAACAAAGGCGGAGTTTTGGTAGTCGTGGAAGGTCTAAAGGGGTTTGTGCCTTTCTCAGAGATATCA
ATGGTAACAACTGCTGAAGAGATTCTCAACAAGGAACTTCCTGTGAAATTTCTAATTGTAAATGAGGAACTTACGAGGATTGTTCTCAGTAACCGCAAGGCCATGGCTGA
CAACCAGGCACAGCTTGAAATTGGATCAGTGGTCACTGGAACAGTTCGAAGACTTGAAAAGTTTGGTGCCTTTGTTGACCTTGGTGGAATCCATGGTCTTCTCCACATCA
GTGAGACAAGTCATGATCGTATAGCAAACATTTTTGCAGTTCTAAAGCCTGGAGACATTCTCAAGGTCATGATTTTGAAACATGATCGTGAGAAAGGCCATATTCGTCTT
TCTACCAAGAAGTTAGAGCCCAATCCTGGGGACATGATTCGCAATCCAAAGCTTGTTTATGAGAAGGCTGAGGAAATGGCTCAAAGATTTAAGCGAAGAATAACTCAAGC
AGGGGCACGTGCAGACTTGCTTAGTTTTCAGTCTCAGGGTGGATTAACATTTAGTGATGGAATATTGGGACCATGTACCCCACAGTTGCCTTGA
mRNA sequenceShow/hide mRNA sequence
AATAAATGCAACAATAGATCGCAGGTCTTAGTCTGGGAAACGCCGCTGAAGGAAGGCGAGGATGGCTTCAATGGCTCTGCAAAGCACAGTGTTGAAACGCACGGCTCAGT
TTCCGATTCCGTATCGTCGCTCGAAGCCATTAGGTCTCCGGCGTAGGCAGAGCATGGCGAGTCCGTTCCCCATTGTCGCTGCAGTTATTTGTAGCCCTAATCCCACCTCT
CAGGCCAGAGATCGGTTCAACCTCAAAGCCACCTTCGAGGACGCCGCCGATCGCTGCCGTAATGATCCCATGGAAGGCGTCTCCTTCACTCTCCATGATTTCCTTTCCGC
TCTTGAGAAATACGACTTCGATTCTCAATTGGGAGCTAAGGTGAAAGGAACTGTGGTCCAAGTAGAAGCTAATGGAGCGCTGGTTGAGATTGCTACCAAGTCGCCTGCAT
ACTTGCCATTGCAGGAGGCTTCCATTCATAGACTTAAACGTGTAGAAGAAGCAGGAATATATCCTGGTTTTAGAGAGGAGTTTGTTATTATAGGTGAGAATGAAGATGAT
AGTTTGATTTTGAGCTTGAGGCCCATTCAATCTGACCTGGCTTGGGAGAGGTGTAGACAACTTCTAGCAGAGGATGTCGTTGTCAAGGGTAAGGTTGTTGATGCAAACAA
AGGCGGAGTTTTGGTAGTCGTGGAAGGTCTAAAGGGGTTTGTGCCTTTCTCAGAGATATCAATGGTAACAACTGCTGAAGAGATTCTCAACAAGGAACTTCCTGTGAAAT
TTCTAATTGTAAATGAGGAACTTACGAGGATTGTTCTCAGTAACCGCAAGGCCATGGCTGACAACCAGGCACAGCTTGAAATTGGATCAGTGGTCACTGGAACAGTTCGA
AGACTTGAAAAGTTTGGTGCCTTTGTTGACCTTGGTGGAATCCATGGTCTTCTCCACATCAGTGAGACAAGTCATGATCGTATAGCAAACATTTTTGCAGTTCTAAAGCC
TGGAGACATTCTCAAGGTCATGATTTTGAAACATGATCGTGAGAAAGGCCATATTCGTCTTTCTACCAAGAAGTTAGAGCCCAATCCTGGGGACATGATTCGCAATCCAA
AGCTTGTTTATGAGAAGGCTGAGGAAATGGCTCAAAGATTTAAGCGAAGAATAACTCAAGCAGGGGCACGTGCAGACTTGCTTAGTTTTCAGTCTCAGGGTGGATTAACA
TTTAGTGATGGAATATTGGGACCATGTACCCCACAGTTGCCTTGAGGAGAGGAGATAGTTCAGCTAAAGAATGAAGACTCAGCTGGCTTTTTGCACGTTAATATACACAT
ATTTGATGCATTATCCCAATACTTAACTAGAAAATAACAACAAAGATTAGGGTTTTTTTTCTTTTTTTTTGTTAAGTTCTGAAGTAGGGAGGCATTTTGAGAATTGAGGA
CTAAAATTATTTTCTTTAATTGATTTTTTTTTAATAAAGCATCCGGAAGTAAAATACAATGTTATCCAAGGTTAGAAACTAAAATAGAATAATGTTAAAAGTCTTAAGAT
TGTATCATCTTTTTTCCCTTTTTTTGAC
Protein sequenceShow/hide protein sequence
MASMALQSTVLKRTAQFPIPYRRSKPLGLRRRQSMASPFPIVAAVICSPNPTSQARDRFNLKATFEDAADRCRNDPMEGVSFTLHDFLSALEKYDFDSQLGAKVKGTVVQ
VEANGALVEIATKSPAYLPLQEASIHRLKRVEEAGIYPGFREEFVIIGENEDDSLILSLRPIQSDLAWERCRQLLAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEIS
MVTTAEEILNKELPVKFLIVNEELTRIVLSNRKAMADNQAQLEIGSVVTGTVRRLEKFGAFVDLGGIHGLLHISETSHDRIANIFAVLKPGDILKVMILKHDREKGHIRL
STKKLEPNPGDMIRNPKLVYEKAEEMAQRFKRRITQAGARADLLSFQSQGGLTFSDGILGPCTPQLP