; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg025079 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg025079
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Description30S ribosomal protein S1, chloroplastic
Genome locationscaffold12:12693788..12698840
RNA-Seq ExpressionSpg025079
SyntenySpg025079
Gene Ontology termsGO:0005840 - ribosome (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138847.1 30S ribosomal protein S1, chloroplastic-like isoform X1 [Momordica charantia]1.6e-17384.28Show/hide
Query:  MSSLAHQLCWLSSSPLSSTRLSV--SSWKRFSAKESRTSK--ALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANY
        M+SLAH++      PLSS RLSV  SSW+RFS KES   K  ALPVVS+ AS TPISNAQTKERL+LKQLFKEAYERCC+TPMDGVSFTLEDFHAAL+NY
Subjt:  MSSLAHQLCWLSSSPLSSTRLSV--SSWKRFSAKESRTSK--ALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANY

Query:  DFVSQLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKG
        DFVS+LGTKVKGTV+ +DA+GALVDTTAKGTA+LP +EACIL+I HVEEAGIYPGLEEEFVIIAE E D +LILSLR +QY LAWERCRQLQAEDVVIKG
Subjt:  DFVSQLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKG

Query:  KVVGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHV
        KVVGANKGGV VLVEGLRGFVPFSQISAKSTAEELL+KELRLKFVEVDE+LSRLILSNCKAIANS+ E RIGSVVTGTVQILK YGAFIDIGG+NGLLH+
Subjt:  KVVGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHV

Query:  SQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPE
        SQIS NHISD+ TVL+PGD LKVMILSYDHERGRVSLSTK LEP PGDMIHNPKLVFEKADEMAQ FRQRIAQAE MARVDLLR QPE
Subjt:  SQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPE

XP_022958818.1 30S ribosomal protein S1, chloroplastic-like isoform X1 [Cucurbita moschata]1.9e-16982.52Show/hide
Query:  MSSLAHQLCWLSSSPLSSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFVS
        MSSLAHQLC L SSPLSST   +S+ KRFS          PVVSA ASPTPISNAQTKERL+LKQLFKEAYERCC TPMDGVSFTLEDFHAALANYDFVS
Subjt:  MSSLAHQLCWLSSSPLSSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFVS

Query:  QLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKV
        +LGTKVKGTV+ +DANGALVDT+AKGTA+LPTQEACI  I HVEEAGIYPGLEEEFVII   EQEDD +LILSLRSVQY LAWERCRQLQAED VIKGKV
Subjt:  QLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKV

Query:  VGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ
        V  NKGGV+VLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDEKL RLILSN KAI +S+ E RIGSVVTG VQILKPYGAF+DIGG+NGLLHVSQ
Subjt:  VGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ

Query:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPEVLM
        ISQNHI DI  VLQPGD+LKVMILSYD +RGR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAE MAR +LL FQPE+L+
Subjt:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPEVLM

XP_023006590.1 30S ribosomal protein S1, chloroplastic-like isoform X1 [Cucurbita maxima]3.8e-17082.52Show/hide
Query:  MSSLAHQLCWLSSSPLSSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFVS
        MSSLAHQLC L SSPLSST   +S+ KRFS          PVVSA ASPTPISNAQTKERL+LKQLFKEAYERCC TPMDGVSFTLEDFHA+LANYDFVS
Subjt:  MSSLAHQLCWLSSSPLSSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFVS

Query:  QLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKV
        ++GTKVKGTV+ +DANGALVDT+AKGTA+LPTQEACI  I HVEEAGIYPGLEEEFVII   EQED+ +LILSLRSVQY LAWERCRQLQAED VIKGKV
Subjt:  QLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKV

Query:  VGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ
        V  NKGGV+VLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDEKLSRL+LSN KAIA+S+ E RIGSVVTG VQILKPYGAFIDIGG+NGLLHVSQ
Subjt:  VGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ

Query:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPEVLM
        ISQNHISDI  VLQPGD+LKVMILSYD +RGR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAE MAR +LL FQPE+L+
Subjt:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPEVLM

XP_023006591.1 30S ribosomal protein S1, chloroplastic-like isoform X2 [Cucurbita maxima]1.9e-16982.9Show/hide
Query:  MSSLAHQLCWLSSSPLSSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFVS
        MSSLAHQLC L SSPLSST   +S+ KRFS          PVVSA ASPTPISNAQTKERL+LKQLFKEAYERCC TPMDGVSFTLEDFHA+LANYDFVS
Subjt:  MSSLAHQLCWLSSSPLSSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFVS

Query:  QLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKV
        ++GTKVKGTV+ +DANGALVDT+AKGTA+LPTQEACI  I HVEEAGIYPGLEEEFVII   EQED+ +LILSLRSVQY LAWERCRQLQAED VIKGKV
Subjt:  QLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKV

Query:  VGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ
        V  NKGGV+VLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDEKLSRL+LSN KAIA+S+ E RIGSVVTG VQILKPYGAFIDIGG+NGLLHVSQ
Subjt:  VGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ

Query:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPE
        ISQNHISDI  VLQPGD+LKVMILSYD +RGR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAE MAR +LL FQPE
Subjt:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPE

XP_023547911.1 30S ribosomal protein S1, chloroplastic-like [Cucurbita pepo subsp. pepo]7.1e-16983.16Show/hide
Query:  MSSLAHQLCWLSSSPLSSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFVS
        MSSLAHQLC L SSPLSST   +S+ KRFS          PVVSA ASPTPISNAQTKERL+LKQLFKEAYERCC TPMDGVSFTLEDFH ALANYDFVS
Subjt:  MSSLAHQLCWLSSSPLSSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFVS

Query:  QLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKV
        +LGTKVKGTV+ +DANGALVDT+AKGTA+LPTQEACI  I HVEEAGIYPGLEEEFVII   EQEDD +LILSLRSVQY LAWERCRQLQAED VIKG V
Subjt:  QLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKV

Query:  VGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ
        V  NKGGV+VLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDEKLSRLILSN KAIA+S+ E +IGSVVTG VQILKPYGAFIDIGG+NGLLHVSQ
Subjt:  VGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ

Query:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPE
        ISQNHISDI  VLQPGD+LKVMILSYD +RGR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAE MAR +LL FQPE
Subjt:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPE

TrEMBL top hitse value%identityAlignment
A0A6J1CCC9 30S ribosomal protein S1, chloroplastic-like isoform X17.9e-17484.28Show/hide
Query:  MSSLAHQLCWLSSSPLSSTRLSV--SSWKRFSAKESRTSK--ALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANY
        M+SLAH++      PLSS RLSV  SSW+RFS KES   K  ALPVVS+ AS TPISNAQTKERL+LKQLFKEAYERCC+TPMDGVSFTLEDFHAAL+NY
Subjt:  MSSLAHQLCWLSSSPLSSTRLSV--SSWKRFSAKESRTSK--ALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANY

Query:  DFVSQLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKG
        DFVS+LGTKVKGTV+ +DA+GALVDTTAKGTA+LP +EACIL+I HVEEAGIYPGLEEEFVIIAE E D +LILSLR +QY LAWERCRQLQAEDVVIKG
Subjt:  DFVSQLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKG

Query:  KVVGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHV
        KVVGANKGGV VLVEGLRGFVPFSQISAKSTAEELL+KELRLKFVEVDE+LSRLILSNCKAIANS+ E RIGSVVTGTVQILK YGAFIDIGG+NGLLH+
Subjt:  KVVGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHV

Query:  SQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPE
        SQIS NHISD+ TVL+PGD LKVMILSYDHERGRVSLSTK LEP PGDMIHNPKLVFEKADEMAQ FRQRIAQAE MARVDLLR QPE
Subjt:  SQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPE

A0A6J1H453 30S ribosomal protein S1, chloroplastic-like isoform X19.0e-17082.52Show/hide
Query:  MSSLAHQLCWLSSSPLSSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFVS
        MSSLAHQLC L SSPLSST   +S+ KRFS          PVVSA ASPTPISNAQTKERL+LKQLFKEAYERCC TPMDGVSFTLEDFHAALANYDFVS
Subjt:  MSSLAHQLCWLSSSPLSSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFVS

Query:  QLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKV
        +LGTKVKGTV+ +DANGALVDT+AKGTA+LPTQEACI  I HVEEAGIYPGLEEEFVII   EQEDD +LILSLRSVQY LAWERCRQLQAED VIKGKV
Subjt:  QLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKV

Query:  VGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ
        V  NKGGV+VLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDEKL RLILSN KAI +S+ E RIGSVVTG VQILKPYGAF+DIGG+NGLLHVSQ
Subjt:  VGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ

Query:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPEVLM
        ISQNHI DI  VLQPGD+LKVMILSYD +RGR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAE MAR +LL FQPE+L+
Subjt:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPEVLM

A0A6J1H681 30S ribosomal protein S1, chloroplastic-like isoform X24.5e-16982.9Show/hide
Query:  MSSLAHQLCWLSSSPLSSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFVS
        MSSLAHQLC L SSPLSST   +S+ KRFS          PVVSA ASPTPISNAQTKERL+LKQLFKEAYERCC TPMDGVSFTLEDFHAALANYDFVS
Subjt:  MSSLAHQLCWLSSSPLSSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFVS

Query:  QLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKV
        +LGTKVKGTV+ +DANGALVDT+AKGTA+LPTQEACI  I HVEEAGIYPGLEEEFVII   EQEDD +LILSLRSVQY LAWERCRQLQAED VIKGKV
Subjt:  QLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKV

Query:  VGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ
        V  NKGGV+VLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDEKL RLILSN KAI +S+ E RIGSVVTG VQILKPYGAF+DIGG+NGLLHVSQ
Subjt:  VGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ

Query:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPE
        ISQNHI DI  VLQPGD+LKVMILSYD +RGR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAE MAR +LL FQPE
Subjt:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPE

A0A6J1KY62 30S ribosomal protein S1, chloroplastic-like isoform X29.0e-17082.9Show/hide
Query:  MSSLAHQLCWLSSSPLSSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFVS
        MSSLAHQLC L SSPLSST   +S+ KRFS          PVVSA ASPTPISNAQTKERL+LKQLFKEAYERCC TPMDGVSFTLEDFHA+LANYDFVS
Subjt:  MSSLAHQLCWLSSSPLSSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFVS

Query:  QLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKV
        ++GTKVKGTV+ +DANGALVDT+AKGTA+LPTQEACI  I HVEEAGIYPGLEEEFVII   EQED+ +LILSLRSVQY LAWERCRQLQAED VIKGKV
Subjt:  QLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKV

Query:  VGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ
        V  NKGGV+VLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDEKLSRL+LSN KAIA+S+ E RIGSVVTG VQILKPYGAFIDIGG+NGLLHVSQ
Subjt:  VGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ

Query:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPE
        ISQNHISDI  VLQPGD+LKVMILSYD +RGR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAE MAR +LL FQPE
Subjt:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPE

A0A6J1L0J3 30S ribosomal protein S1, chloroplastic-like isoform X11.8e-17082.52Show/hide
Query:  MSSLAHQLCWLSSSPLSSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFVS
        MSSLAHQLC L SSPLSST   +S+ KRFS          PVVSA ASPTPISNAQTKERL+LKQLFKEAYERCC TPMDGVSFTLEDFHA+LANYDFVS
Subjt:  MSSLAHQLCWLSSSPLSSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFVS

Query:  QLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKV
        ++GTKVKGTV+ +DANGALVDT+AKGTA+LPTQEACI  I HVEEAGIYPGLEEEFVII   EQED+ +LILSLRSVQY LAWERCRQLQAED VIKGKV
Subjt:  QLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKV

Query:  VGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ
        V  NKGGV+VLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDEKLSRL+LSN KAIA+S+ E RIGSVVTG VQILKPYGAFIDIGG+NGLLHVSQ
Subjt:  VGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ

Query:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPEVLM
        ISQNHISDI  VLQPGD+LKVMILSYD +RGR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAE MAR +LL FQPE+L+
Subjt:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPEVLM

SwissProt top hitse value%identityAlignment
O33698 30S ribosomal protein S18.7e-4535.82Show/hide
Query:  EDFHAALANYDFVSQLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQ
        +DF  AL      SQ G  V+G V     +GA +D   K  AFLP +EA +  +  + EA +    E EF++I +Q +D  + +SLR++    AW R  +
Subjt:  EDFHAALANYDFVSQLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQ

Query:  LQAEDVVIKGKVVGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKV-EQRIGSVVTGTVQILKPYGAFI
        LQ     ++ KV G+NKGGV   +EGLR F+P S ++ K   + L  K L + F+EV+    +L+LS  +A   + V E  +G ++ G V  LKP+G F+
Subjt:  LQAEDVVIKGKVVGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKV-EQRIGSVVTGTVQILKPYGAFI

Query:  DIGGINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRI
        D+GG   LL ++QISQ  ++D+G + + GD ++ ++++ D+ +GR+SLSTK LE +PG+++ N   +   A + A+  R+++
Subjt:  DIGGINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRI

P29344 30S ribosomal protein S1, chloroplastic1.5e-14571.17Show/hide
Query:  MSSLAHQLC-WLSSSPLSSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFV
        M+SLA QL   L   PLS++ LS    K FS K +   +  P+VSA A    +SNAQT+ER +LKQLF++AYERC + PM+GVSFT++DFH AL  YDF 
Subjt:  MSSLAHQLC-WLSSSPLSSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFV

Query:  SQLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKVV
        S++G++VKGTV+ +DANGALVD TAK +A+LP  EACI +I +VEEAGI PG+ EEFVII E E DD+LILSLR +QY LAWERCRQLQAEDVV+KGK+V
Subjt:  SQLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKVV

Query:  GANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQI
        GANKGGV+ LVEGLRGFVPFSQIS+KS+AEELL KE+ LKFVEVDE+ SRL++SN KA+A+S+ +  IGSVVTGTVQ LKPYGAFIDIGGINGLLHVSQI
Subjt:  GANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQI

Query:  SQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPE
        S + +SDI TVLQPGD LKVMILS+D ERGRVSLSTKKLEP PGDMI NPKLVFEKA+EMAQ FRQRIAQAE MAR D+LRFQPE
Subjt:  SQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPE

P46228 30S ribosomal protein S15.2e-7447.97Show/hide
Query:  PMDGVSFTLEDFHAALANYDFVSQLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQY
        P   + FT EDF A L  YD+    G  V GTV+  +  GAL+D  AK  AFLP QE  I ++   EE  + P    EF I++++ +D  L LS+R ++Y
Subjt:  PMDGVSFTLEDFHAALANYDFVSQLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQY

Query:  SLAWERCRQLQAEDVVIKGKVVGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQ-RIGSVVTGTVQ
          AWER RQLQ ED  ++ +V   N+GG +V +EGLRGF+P S IS +   E+L+ +EL LKF+EVDE  +RL+LS+ +A+   K+ +  +G VV G V+
Subjt:  SLAWERCRQLQAEDVVIKGKVVGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQ-RIGSVVTGTVQ

Query:  ILKPYGAFIDIGGINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEE
         +KPYGAFIDIGG++GLLH+S+IS +HI    +V    D +KVMI+  D ERGR+SLSTK+LEP PGDM+ NP++V+EKA+EMA  +R+++ Q  E
Subjt:  ILKPYGAFIDIGGINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEE

P73530 30S ribosomal protein S1 homolog A1.4e-7148.63Show/hide
Query:  VSFTLEDFHAALANYDFVSQLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAW
        + FTLEDF A L  YD+    G  V GTV+  ++ GAL+D  AK  A++P QE  I ++   EE  + P    EF I+ ++ +D  L LS+R ++Y  AW
Subjt:  VSFTLEDFHAALANYDFVSQLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAW

Query:  ERCRQLQAEDVVIKGKVVGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVE-QRIGSVVTGTVQILKP
        ER RQLQAED  ++  V   N+GG +V +EGLRGF+P S ISA+   E+L+ ++L LKF+EVDE+ +RL+LS+ +A+   K+    +  VV G+V+ +KP
Subjt:  ERCRQLQAEDVVIKGKVVGANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVE-QRIGSVVTGTVQILKP

Query:  YGAFIDIGGINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQ-RIAQAE
        YGAFIDIGG++GLLH+S+IS +HI    +V    D +KVMI+  D ERGR+SLSTK+LEP PG M+ +  LV E ADEMA+IFRQ R+A+A+
Subjt:  YGAFIDIGGINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQ-RIAQAE

Q93VC7 30S ribosomal protein S1, chloroplastic1.6e-13967.53Show/hide
Query:  MSSLAHQLCWLSSSPL-SSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFV
        M+SLA Q   L  SPL SS+RLS  + K F   +S  S +  +V+A A    +S+ QTKERL LK++F++AYERC ++PM+GV+FT++DF AA+  YDF 
Subjt:  MSSLAHQLCWLSSSPL-SSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFV

Query:  SQLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKVV
        S++GT+VKGTV+++DANGALVD +AK +A+L  ++ACI +I HVEEAGI PG+ EEFVII E E DD+L+LSLR++QY LAWERCRQLQAEDV++K KV+
Subjt:  SQLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKVV

Query:  GANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQI
        GANKGG++ LVEGLRGFVPFSQIS+K+ AEELL KE+ LKFVEVDE+ ++L+LSN KA+A+S+ +  IGSVV G VQ LKPYGAFIDIGGINGLLHVSQI
Subjt:  GANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQI

Query:  SQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPE
        S + +SDI TVLQPGD LKVMILS+D +RGRVSLSTKKLEP PGDMI NPKLVFEKA+EMAQ FRQRIAQAE MAR D+LRFQPE
Subjt:  SQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPE

Arabidopsis top hitse value%identityAlignment
AT1G71720.1 Nucleic acid-binding proteins superfamily2.4e-2131.71Show/hide
Query:  ILSLRSVQYSLAWERCRQLQAEDVVIKGKVVGANKGGVIVLVEGLRGFVPFSQISAK----STAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVE
        +LS R     +AW R RQ++  +  I+ K+   N GG++  +EGLR F+P  ++  K    +  +E + +   ++   ++E  + LILS  + +A  K+ 
Subjt:  ILSLRSVQYSLAWERCRQLQAEDVVIKGKVVGANKGGVIVLVEGLRGFVPFSQISAK----STAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVE

Query:  QRIGSVVTGTVQILKPYGAFIDIG--GINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQI
         R G+++ GTV  + PYGA + +G    +GLLH+S I++  I  +  VLQ  + +KV+++       ++SLS   LE  PG  I + + VF +A+EMA+ 
Subjt:  QRIGSVVTGTVQILKPYGAFIDIG--GINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQI

Query:  FRQRI
        +R+++
Subjt:  FRQRI

AT3G23700.1 Nucleic acid-binding proteins superfamily8.4e-1930.56Show/hide
Query:  WERCRQLQAEDVVIKGKVVGANKGGVIVLVEGLRGFVPFSQISAKSTAEE-----------LLNKELRLKFVEVDEKLSRLILSNCKAI-ANSKVEQRIG
        W+  +         +G+V G N GG+++    L GF+P+ Q+S   + +E           L+  +L +K V+ DE+  +LILS   A+         +G
Subjt:  WERCRQLQAEDVVIKGKVVGANKGGVIVLVEGLRGFVPFSQISAKSTAEE-----------LLNKELRLKFVEVDEKLSRLILSNCKAI-ANSKVEQRIG

Query:  SVVTGTVQILKPYGAFIDIG------GINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNP
         V  G V  ++ YGAFI +        + GL+HVS++S +++ D+  VL+ GD ++V++ + D E+ R++LS K+LE +P
Subjt:  SVVTGTVQILKPYGAFIDIG------GINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNP

AT4G29060.1 elongation factor Ts family protein4.2e-1038.16Show/hide
Query:  GSVVTGTVQILKPYGAFIDIGGI-NGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNP
        G+  TG V+ ++P+GAF+D G   +GL+HVSQ+S N + D+ +V+  G  +KV ++  D E  R+SL+ ++ +  P
Subjt:  GSVVTGTVQILKPYGAFIDIGGI-NGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNP

AT4G29060.2 elongation factor Ts family protein4.2e-1038.16Show/hide
Query:  GSVVTGTVQILKPYGAFIDIGGI-NGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNP
        G+  TG V+ ++P+GAF+D G   +GL+HVSQ+S N + D+ +V+  G  +KV ++  D E  R+SL+ ++ +  P
Subjt:  GSVVTGTVQILKPYGAFIDIGGI-NGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNP

AT5G30510.1 ribosomal protein S11.1e-14067.53Show/hide
Query:  MSSLAHQLCWLSSSPL-SSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFV
        M+SLA Q   L  SPL SS+RLS  + K F   +S  S +  +V+A A    +S+ QTKERL LK++F++AYERC ++PM+GV+FT++DF AA+  YDF 
Subjt:  MSSLAHQLCWLSSSPL-SSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFV

Query:  SQLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKVV
        S++GT+VKGTV+++DANGALVD +AK +A+L  ++ACI +I HVEEAGI PG+ EEFVII E E DD+L+LSLR++QY LAWERCRQLQAEDV++K KV+
Subjt:  SQLGTKVKGTVYRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKVV

Query:  GANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQI
        GANKGG++ LVEGLRGFVPFSQIS+K+ AEELL KE+ LKFVEVDE+ ++L+LSN KA+A+S+ +  IGSVV G VQ LKPYGAFIDIGGINGLLHVSQI
Subjt:  GANKGGVIVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQI

Query:  SQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPE
        S + +SDI TVLQPGD LKVMILS+D +RGRVSLSTKKLEP PGDMI NPKLVFEKA+EMAQ FRQRIAQAE MAR D+LRFQPE
Subjt:  SQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCATTGGCGCATCAACTTTGTTGGTTGAGTAGTTCCCCTCTTTCTTCTACGCGGCTTTCGGTTTCGAGTTGGAAGCGCTTTTCTGCGAAGGAAAGCCGCACGTC
TAAAGCGCTTCCGGTAGTTTCAGCTACAGCTTCCCCTACTCCCATTTCCAATGCGCAGACCAAAGAGCGCCTTCGACTCAAGCAACTCTTCAAGGAAGCTTATGAACGCT
GCTGTTCTACTCCCATGGATGGCGTCTCCTTCACCCTTGAGGACTTCCATGCCGCTCTTGCAAACTATGACTTCGTTTCTCAACTCGGAACCAAGGTTAAAGGTACTGTA
TACCGTTCAGATGCTAATGGGGCATTAGTTGACACTACTGCAAAGGGAACTGCATTCTTGCCCACCCAAGAGGCATGCATTCTTCAAATAAGCCATGTAGAAGAAGCAGG
CATATATCCTGGTTTAGAAGAGGAGTTTGTAATTATTGCTGAACAGGAAGATGATGATACCTTAATTCTGAGCTTGAGAAGTGTCCAGTATAGCCTTGCTTGGGAGCGAT
GCAGACAACTCCAAGCTGAGGATGTTGTTATCAAGGGTAAGGTTGTTGGTGCAAACAAAGGGGGAGTAATTGTTCTTGTGGAAGGCCTTAGAGGCTTTGTTCCTTTCTCT
CAGATATCAGCAAAATCAACTGCAGAGGAGCTGCTTAATAAAGAGCTTCGTCTGAAGTTTGTGGAGGTCGATGAGAAACTATCTCGGCTAATTCTAAGTAACTGCAAGGC
AATTGCCAATAGCAAGGTAGAGCAAAGAATTGGATCAGTAGTTACTGGAACTGTGCAGATTCTGAAACCATATGGAGCCTTTATTGACATTGGTGGAATTAATGGCCTTC
TTCATGTTAGTCAAATCAGTCAAAATCACATATCAGATATTGGAACTGTTCTTCAACCAGGAGATATACTAAAGGTCATGATTTTGAGCTATGACCACGAGAGAGGCCGT
GTTAGTCTTTCTACCAAGAAATTGGAACCTAATCCTGGAGACATGATTCACAATCCAAAGCTTGTTTTTGAGAAGGCAGATGAAATGGCTCAGATATTCAGGCAAAGAAT
AGCTCAAGCAGAAGAAATGGCTCGTGTAGACCTTCTCAGATTTCAGCCTGAGGTACTTATGCATTTGCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCTCATTGGCGCATCAACTTTGTTGGTTGAGTAGTTCCCCTCTTTCTTCTACGCGGCTTTCGGTTTCGAGTTGGAAGCGCTTTTCTGCGAAGGAAAGCCGCACGTC
TAAAGCGCTTCCGGTAGTTTCAGCTACAGCTTCCCCTACTCCCATTTCCAATGCGCAGACCAAAGAGCGCCTTCGACTCAAGCAACTCTTCAAGGAAGCTTATGAACGCT
GCTGTTCTACTCCCATGGATGGCGTCTCCTTCACCCTTGAGGACTTCCATGCCGCTCTTGCAAACTATGACTTCGTTTCTCAACTCGGAACCAAGGTTAAAGGTACTGTA
TACCGTTCAGATGCTAATGGGGCATTAGTTGACACTACTGCAAAGGGAACTGCATTCTTGCCCACCCAAGAGGCATGCATTCTTCAAATAAGCCATGTAGAAGAAGCAGG
CATATATCCTGGTTTAGAAGAGGAGTTTGTAATTATTGCTGAACAGGAAGATGATGATACCTTAATTCTGAGCTTGAGAAGTGTCCAGTATAGCCTTGCTTGGGAGCGAT
GCAGACAACTCCAAGCTGAGGATGTTGTTATCAAGGGTAAGGTTGTTGGTGCAAACAAAGGGGGAGTAATTGTTCTTGTGGAAGGCCTTAGAGGCTTTGTTCCTTTCTCT
CAGATATCAGCAAAATCAACTGCAGAGGAGCTGCTTAATAAAGAGCTTCGTCTGAAGTTTGTGGAGGTCGATGAGAAACTATCTCGGCTAATTCTAAGTAACTGCAAGGC
AATTGCCAATAGCAAGGTAGAGCAAAGAATTGGATCAGTAGTTACTGGAACTGTGCAGATTCTGAAACCATATGGAGCCTTTATTGACATTGGTGGAATTAATGGCCTTC
TTCATGTTAGTCAAATCAGTCAAAATCACATATCAGATATTGGAACTGTTCTTCAACCAGGAGATATACTAAAGGTCATGATTTTGAGCTATGACCACGAGAGAGGCCGT
GTTAGTCTTTCTACCAAGAAATTGGAACCTAATCCTGGAGACATGATTCACAATCCAAAGCTTGTTTTTGAGAAGGCAGATGAAATGGCTCAGATATTCAGGCAAAGAAT
AGCTCAAGCAGAAGAAATGGCTCGTGTAGACCTTCTCAGATTTCAGCCTGAGGTACTTATGCATTTGCCTTAG
Protein sequenceShow/hide protein sequence
MSSLAHQLCWLSSSPLSSTRLSVSSWKRFSAKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCSTPMDGVSFTLEDFHAALANYDFVSQLGTKVKGTV
YRSDANGALVDTTAKGTAFLPTQEACILQISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQLQAEDVVIKGKVVGANKGGVIVLVEGLRGFVPFS
QISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSKVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGR
VSLSTKKLEPNPGDMIHNPKLVFEKADEMAQIFRQRIAQAEEMARVDLLRFQPEVLMHLP