; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031646 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031646
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Description30S ribosomal protein S1, chloroplastic
Genome locationchr11:11327797..11330785
RNA-Seq ExpressionLag0031646
SyntenyLag0031646
Gene Ontology termsGO:0005840 - ribosome (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138847.1 30S ribosomal protein S1, chloroplastic-like isoform X1 [Momordica charantia]4.0e-17283.76Show/hide
Query:  MNSLAHQLCGLSSSPLSSTRLSV--SNWKRFSPKESRTSK--ALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANC
        MNSLAH++      PLSS RLSV  S+W+RFS KES   K  ALPVVS+ AS TPISNAQTKERL+LKQLFKEAYERCC TPMDGVSFTLEDFHAAL+N 
Subjt:  MNSLAHQLCGLSSSPLSSTRLSV--SNWKRFSPKESRTSK--ALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANC

Query:  DFVSQLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKG
        DFVS+LGTKVKGTV+ TDA+GALVDTTAKGTA+LP +EACILKI HVEEAGIYPGLEEEFVIIAE E D +LILSLR +QY LAWERCRQ QAEDVVIKG
Subjt:  DFVSQLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKG

Query:  KVVGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHV
        KVVGANKGGV VLVEGLRGFVPFSQISAKSTAEELL+KELRLKFVEVDE+LSRLILSNCKAIANS  E RIGSVVTGTVQILK YGAFIDIGG+NGLLH+
Subjt:  KVVGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHV

Query:  SQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPE
        SQIS NHISD+ TVL+PGD LKVMILSYDHERGRVSLSTK LEP PGDMIHNPKLVFEKADE+AQ FRQRIAQAE MA VDLLR QPE
Subjt:  SQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPE

XP_022958818.1 30S ribosomal protein S1, chloroplastic-like isoform X1 [Cucurbita moschata]1.2e-16882.26Show/hide
Query:  MNSLAHQLCGLSSSPLSSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFVS
        M+SLAHQLCGL SSPLSST   +S  KRFS          PVVSA ASPTPISNAQTKERL+LKQLFKEAYERCC TPMDGVSFTLEDFHAALAN DFVS
Subjt:  MNSLAHQLCGLSSSPLSSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFVS

Query:  QLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKV
        +LGTKVKGTV+ TDANGALVDT+AKGTA+LPTQEACI  I HVEEAGIYPGLEEEFVII   EQEDD +LILSLRSVQY LAWERCRQ QAED VIKGKV
Subjt:  QLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKV

Query:  VGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ
        V  NKGGVVVLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDEKL RLILSN KAI +S  E RIGSVVTG VQILKPYGAF+DIGG+NGLLHVSQ
Subjt:  VGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ

Query:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPEVLL
        ISQNHI DI  VLQPGD+LKVMILSYD +RGR+SLSTKKLEP+PGDM+HNPKLVFEKADE+AQ FRQRIAQAE MA  +LL FQPE+LL
Subjt:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPEVLL

XP_023006590.1 30S ribosomal protein S1, chloroplastic-like isoform X1 [Cucurbita maxima]2.4e-16982.26Show/hide
Query:  MNSLAHQLCGLSSSPLSSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFVS
        M+SLAHQLCGL SSPLSST   +S  KRFS          PVVSA ASPTPISNAQTKERL+LKQLFKEAYERCC TPMDGVSFTLEDFHA+LAN DFVS
Subjt:  MNSLAHQLCGLSSSPLSSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFVS

Query:  QLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKV
        ++GTKVKGTV+ TDANGALVDT+AKGTA+LPTQEACI  I HVEEAGIYPGLEEEFVII   EQED+ +LILSLRSVQY LAWERCRQ QAED VIKGKV
Subjt:  QLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKV

Query:  VGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ
        V  NKGGVVVLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDEKLSRL+LSN KAIA+S  E RIGSVVTG VQILKPYGAFIDIGG+NGLLHVSQ
Subjt:  VGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ

Query:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPEVLL
        ISQNHISDI  VLQPGD+LKVMILSYD +RGR+SLSTKKLEP+PGDM+HNPKLVFEKADE+AQ FRQRIAQAE MA  +LL FQPE+LL
Subjt:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPEVLL

XP_023006591.1 30S ribosomal protein S1, chloroplastic-like isoform X2 [Cucurbita maxima]1.6e-16882.38Show/hide
Query:  MNSLAHQLCGLSSSPLSSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFVS
        M+SLAHQLCGL SSPLSST   +S  KRFS          PVVSA ASPTPISNAQTKERL+LKQLFKEAYERCC TPMDGVSFTLEDFHA+LAN DFVS
Subjt:  MNSLAHQLCGLSSSPLSSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFVS

Query:  QLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKV
        ++GTKVKGTV+ TDANGALVDT+AKGTA+LPTQEACI  I HVEEAGIYPGLEEEFVII   EQED+ +LILSLRSVQY LAWERCRQ QAED VIKGKV
Subjt:  QLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKV

Query:  VGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ
        V  NKGGVVVLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDEKLSRL+LSN KAIA+S  E RIGSVVTG VQILKPYGAFIDIGG+NGLLHVSQ
Subjt:  VGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ

Query:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPE
        ISQNHISDI  VLQPGD+LKVMILSYD +RGR+SLSTKKLEP+PGDM+HNPKLVFEKADE+AQ FRQRIAQAE MA  +LL FQPE
Subjt:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPE

XP_023547911.1 30S ribosomal protein S1, chloroplastic-like [Cucurbita pepo subsp. pepo]6.0e-16882.64Show/hide
Query:  MNSLAHQLCGLSSSPLSSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFVS
        M+SLAHQLCGL SSPLSST   +S  KRFS          PVVSA ASPTPISNAQTKERL+LKQLFKEAYERCC TPMDGVSFTLEDFH ALAN DFVS
Subjt:  MNSLAHQLCGLSSSPLSSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFVS

Query:  QLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKV
        +LGTKVKGTV+ TDANGALVDT+AKGTA+LPTQEACI  I HVEEAGIYPGLEEEFVII   EQEDD +LILSLRSVQY LAWERCRQ QAED VIKG V
Subjt:  QLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKV

Query:  VGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ
        V  NKGGVVVLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDEKLSRLILSN KAIA+S  E +IGSVVTG VQILKPYGAFIDIGG+NGLLHVSQ
Subjt:  VGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ

Query:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPE
        ISQNHISDI  VLQPGD+LKVMILSYD +RGR+SLSTKKLEP+PGDM+HNPKLVFEKADE+AQ FRQRIAQAE MA  +LL FQPE
Subjt:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPE

TrEMBL top hitse value%identityAlignment
A0A6J1CCC9 30S ribosomal protein S1, chloroplastic-like isoform X11.9e-17283.76Show/hide
Query:  MNSLAHQLCGLSSSPLSSTRLSV--SNWKRFSPKESRTSK--ALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANC
        MNSLAH++      PLSS RLSV  S+W+RFS KES   K  ALPVVS+ AS TPISNAQTKERL+LKQLFKEAYERCC TPMDGVSFTLEDFHAAL+N 
Subjt:  MNSLAHQLCGLSSSPLSSTRLSV--SNWKRFSPKESRTSK--ALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANC

Query:  DFVSQLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKG
        DFVS+LGTKVKGTV+ TDA+GALVDTTAKGTA+LP +EACILKI HVEEAGIYPGLEEEFVIIAE E D +LILSLR +QY LAWERCRQ QAEDVVIKG
Subjt:  DFVSQLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKG

Query:  KVVGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHV
        KVVGANKGGV VLVEGLRGFVPFSQISAKSTAEELL+KELRLKFVEVDE+LSRLILSNCKAIANS  E RIGSVVTGTVQILK YGAFIDIGG+NGLLH+
Subjt:  KVVGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHV

Query:  SQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPE
        SQIS NHISD+ TVL+PGD LKVMILSYDHERGRVSLSTK LEP PGDMIHNPKLVFEKADE+AQ FRQRIAQAE MA VDLLR QPE
Subjt:  SQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPE

A0A6J1H453 30S ribosomal protein S1, chloroplastic-like isoform X15.9e-16982.26Show/hide
Query:  MNSLAHQLCGLSSSPLSSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFVS
        M+SLAHQLCGL SSPLSST   +S  KRFS          PVVSA ASPTPISNAQTKERL+LKQLFKEAYERCC TPMDGVSFTLEDFHAALAN DFVS
Subjt:  MNSLAHQLCGLSSSPLSSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFVS

Query:  QLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKV
        +LGTKVKGTV+ TDANGALVDT+AKGTA+LPTQEACI  I HVEEAGIYPGLEEEFVII   EQEDD +LILSLRSVQY LAWERCRQ QAED VIKGKV
Subjt:  QLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKV

Query:  VGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ
        V  NKGGVVVLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDEKL RLILSN KAI +S  E RIGSVVTG VQILKPYGAF+DIGG+NGLLHVSQ
Subjt:  VGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ

Query:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPEVLL
        ISQNHI DI  VLQPGD+LKVMILSYD +RGR+SLSTKKLEP+PGDM+HNPKLVFEKADE+AQ FRQRIAQAE MA  +LL FQPE+LL
Subjt:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPEVLL

A0A6J1H681 30S ribosomal protein S1, chloroplastic-like isoform X23.8e-16882.38Show/hide
Query:  MNSLAHQLCGLSSSPLSSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFVS
        M+SLAHQLCGL SSPLSST   +S  KRFS          PVVSA ASPTPISNAQTKERL+LKQLFKEAYERCC TPMDGVSFTLEDFHAALAN DFVS
Subjt:  MNSLAHQLCGLSSSPLSSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFVS

Query:  QLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKV
        +LGTKVKGTV+ TDANGALVDT+AKGTA+LPTQEACI  I HVEEAGIYPGLEEEFVII   EQEDD +LILSLRSVQY LAWERCRQ QAED VIKGKV
Subjt:  QLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKV

Query:  VGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ
        V  NKGGVVVLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDEKL RLILSN KAI +S  E RIGSVVTG VQILKPYGAF+DIGG+NGLLHVSQ
Subjt:  VGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ

Query:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPE
        ISQNHI DI  VLQPGD+LKVMILSYD +RGR+SLSTKKLEP+PGDM+HNPKLVFEKADE+AQ FRQRIAQAE MA  +LL FQPE
Subjt:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPE

A0A6J1KY62 30S ribosomal protein S1, chloroplastic-like isoform X27.7e-16982.38Show/hide
Query:  MNSLAHQLCGLSSSPLSSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFVS
        M+SLAHQLCGL SSPLSST   +S  KRFS          PVVSA ASPTPISNAQTKERL+LKQLFKEAYERCC TPMDGVSFTLEDFHA+LAN DFVS
Subjt:  MNSLAHQLCGLSSSPLSSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFVS

Query:  QLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKV
        ++GTKVKGTV+ TDANGALVDT+AKGTA+LPTQEACI  I HVEEAGIYPGLEEEFVII   EQED+ +LILSLRSVQY LAWERCRQ QAED VIKGKV
Subjt:  QLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKV

Query:  VGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ
        V  NKGGVVVLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDEKLSRL+LSN KAIA+S  E RIGSVVTG VQILKPYGAFIDIGG+NGLLHVSQ
Subjt:  VGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ

Query:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPE
        ISQNHISDI  VLQPGD+LKVMILSYD +RGR+SLSTKKLEP+PGDM+HNPKLVFEKADE+AQ FRQRIAQAE MA  +LL FQPE
Subjt:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPE

A0A6J1L0J3 30S ribosomal protein S1, chloroplastic-like isoform X11.2e-16982.26Show/hide
Query:  MNSLAHQLCGLSSSPLSSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFVS
        M+SLAHQLCGL SSPLSST   +S  KRFS          PVVSA ASPTPISNAQTKERL+LKQLFKEAYERCC TPMDGVSFTLEDFHA+LAN DFVS
Subjt:  MNSLAHQLCGLSSSPLSSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFVS

Query:  QLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKV
        ++GTKVKGTV+ TDANGALVDT+AKGTA+LPTQEACI  I HVEEAGIYPGLEEEFVII   EQED+ +LILSLRSVQY LAWERCRQ QAED VIKGKV
Subjt:  QLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIA--EQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKV

Query:  VGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ
        V  NKGGVVVLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDEKLSRL+LSN KAIA+S  E RIGSVVTG VQILKPYGAFIDIGG+NGLLHVSQ
Subjt:  VGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ

Query:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPEVLL
        ISQNHISDI  VLQPGD+LKVMILSYD +RGR+SLSTKKLEP+PGDM+HNPKLVFEKADE+AQ FRQRIAQAE MA  +LL FQPE+LL
Subjt:  ISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPEVLL

SwissProt top hitse value%identityAlignment
O33698 30S ribosomal protein S11.1e-4435.46Show/hide
Query:  EDFHAALANCDFVSQLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQ
        +DF  AL      SQ G  V+G V     +GA +D   K  AFLP +EA +  +  + EA +    E EF++I +Q +D  + +SLR++    AW R  +
Subjt:  EDFHAALANCDFVSQLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQ

Query:  FQAEDVVIKGKVVGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMV-EQRIGSVVTGTVQILKPYGAFI
         Q     ++ KV G+NKGGV   +EGLR F+P S ++ K   + L  K L + F+EV+    +L+LS  +A   ++V E  +G ++ G V  LKP+G F+
Subjt:  FQAEDVVIKGKVVGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMV-EQRIGSVVTGTVQILKPYGAFI

Query:  DIGGINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRI
        D+GG   LL ++QISQ  ++D+G + + GD ++ ++++ D+ +GR+SLSTK LE +PG+++ N   +   A + A+  R+++
Subjt:  DIGGINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRI

P29344 30S ribosomal protein S1, chloroplastic2.6e-14571.17Show/hide
Query:  MNSLAHQLC-GLSSSPLSSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFV
        M SLA QL  GL   PLS++ LS    K FSPK +   +  P+VSA A    +SNAQT+ER +LKQLF++AYERC   PM+GVSFT++DFH AL   DF 
Subjt:  MNSLAHQLC-GLSSSPLSSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFV

Query:  SQLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKVV
        S++G++VKGTV+ TDANGALVD TAK +A+LP  EACI +I +VEEAGI PG+ EEFVII E E DD+LILSLR +QY LAWERCRQ QAEDVV+KGK+V
Subjt:  SQLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKVV

Query:  GANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQI
        GANKGGVV LVEGLRGFVPFSQIS+KS+AEELL KE+ LKFVEVDE+ SRL++SN KA+A+S  +  IGSVVTGTVQ LKPYGAFIDIGGINGLLHVSQI
Subjt:  GANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQI

Query:  SQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPE
        S + +SDI TVLQPGD LKVMILS+D ERGRVSLSTKKLEP PGDMI NPKLVFEKA+E+AQ FRQRIAQAE MA  D+LRFQPE
Subjt:  SQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPE

P46228 30S ribosomal protein S18.4e-7246.96Show/hide
Query:  PMDGVSFTLEDFHAALANCDFVSQLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQY
        P   + FT EDF A L   D+    G  V GTV+  +  GAL+D  AK  AFLP QE  I ++   EE  + P    EF I++++ +D  L LS+R ++Y
Subjt:  PMDGVSFTLEDFHAALANCDFVSQLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQY

Query:  SLAWERCRQFQAEDVVIKGKVVGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKA-IANSMVEQRIGSVVTGTVQ
          AWER RQ Q ED  ++ +V   N+GG +V +EGLRGF+P S IS +   E+L+ +EL LKF+EVDE  +RL+LS+ +A +   M    +G VV G V+
Subjt:  SLAWERCRQFQAEDVVIKGKVVGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKA-IANSMVEQRIGSVVTGTVQ

Query:  ILKPYGAFIDIGGINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEE
         +KPYGAFIDIGG++GLLH+S+IS +HI    +V    D +KVMI+  D ERGR+SLSTK+LEP PGDM+ NP++V+EKA+E+A  +R+++ Q  E
Subjt:  ILKPYGAFIDIGGINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEE

P73530 30S ribosomal protein S1 homolog A3.0e-6947.6Show/hide
Query:  VSFTLEDFHAALANCDFVSQLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAW
        + FTLEDF A L   D+    G  V GTV+  ++ GAL+D  AK  A++P QE  I ++   EE  + P    EF I+ ++ +D  L LS+R ++Y  AW
Subjt:  VSFTLEDFHAALANCDFVSQLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAW

Query:  ERCRQFQAEDVVIKGKVVGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKA-IANSMVEQRIGSVVTGTVQILKP
        ER RQ QAED  ++  V   N+GG +V +EGLRGF+P S ISA+   E+L+ ++L LKF+EVDE+ +RL+LS+ +A +   M    +  VV G+V+ +KP
Subjt:  ERCRQFQAEDVVIKGKVVGANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKA-IANSMVEQRIGSVVTGTVQILKP

Query:  YGAFIDIGGINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQ-RIAQAE
        YGAFIDIGG++GLLH+S+IS +HI    +V    D +KVMI+  D ERGR+SLSTK+LEP PG M+ +  LV E ADE+A+IFRQ R+A+A+
Subjt:  YGAFIDIGGINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQ-RIAQAE

Q93VC7 30S ribosomal protein S1, chloroplastic6.1e-13967.27Show/hide
Query:  MNSLAHQLCGLSSSPL-SSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFV
        M SLA Q  GL  SPL SS+RLS    K F P+    S +  +V+A A    +S+ QTKERL LK++F++AYERC  +PM+GV+FT++DF AA+   DF 
Subjt:  MNSLAHQLCGLSSSPL-SSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFV

Query:  SQLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKVV
        S++GT+VKGTV++TDANGALVD +AK +A+L  ++ACI +I HVEEAGI PG+ EEFVII E E DD+L+LSLR++QY LAWERCRQ QAEDV++K KV+
Subjt:  SQLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKVV

Query:  GANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQI
        GANKGG+V LVEGLRGFVPFSQIS+K+ AEELL KE+ LKFVEVDE+ ++L+LSN KA+A+S  +  IGSVV G VQ LKPYGAFIDIGGINGLLHVSQI
Subjt:  GANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQI

Query:  SQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPE
        S + +SDI TVLQPGD LKVMILS+D +RGRVSLSTKKLEP PGDMI NPKLVFEKA+E+AQ FRQRIAQAE MA  D+LRFQPE
Subjt:  SQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPE

Arabidopsis top hitse value%identityAlignment
AT1G71720.1 Nucleic acid-binding proteins superfamily4.5e-2030.73Show/hide
Query:  ILSLRSVQYSLAWERCRQFQAEDVVIKGKVVGANKGGVVVLVEGLRGFVPFSQISAK----STAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVE
        +LS R     +AW R RQ +  +  I+ K+   N GG++  +EGLR F+P  ++  K    +  +E + +   ++   ++E  + LILS  + +A   + 
Subjt:  ILSLRSVQYSLAWERCRQFQAEDVVIKGKVVGANKGGVVVLVEGLRGFVPFSQISAK----STAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVE

Query:  QRIGSVVTGTVQILKPYGAFIDIG--GINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQI
         R G+++ GTV  + PYGA + +G    +GLLH+S I++  I  +  VLQ  + +KV+++       ++SLS   LE  PG  I + + VF +A+E+A+ 
Subjt:  QRIGSVVTGTVQILKPYGAFIDIG--GINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQI

Query:  FRQRI
        +R+++
Subjt:  FRQRI

AT3G23700.1 Nucleic acid-binding proteins superfamily2.2e-1930.56Show/hide
Query:  WERCRQFQAEDVVIKGKVVGANKGGVVVLVEGLRGFVPFSQISAKSTAEE-----------LLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQ-RIG
        W+  + +       +G+V G N GG+++    L GF+P+ Q+S   + +E           L+  +L +K V+ DE+  +LILS   A+     +   +G
Subjt:  WERCRQFQAEDVVIKGKVVGANKGGVVVLVEGLRGFVPFSQISAKSTAEE-----------LLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQ-RIG

Query:  SVVTGTVQILKPYGAFIDIG------GINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNP
         V  G V  ++ YGAFI +        + GL+HVS++S +++ D+  VL+ GD ++V++ + D E+ R++LS K+LE +P
Subjt:  SVVTGTVQILKPYGAFIDIG------GINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNP

AT4G29060.1 elongation factor Ts family protein4.2e-1038.16Show/hide
Query:  GSVVTGTVQILKPYGAFIDIGGI-NGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNP
        G+  TG V+ ++P+GAF+D G   +GL+HVSQ+S N + D+ +V+  G  +KV ++  D E  R+SL+ ++ +  P
Subjt:  GSVVTGTVQILKPYGAFIDIGGI-NGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNP

AT4G29060.2 elongation factor Ts family protein4.2e-1038.16Show/hide
Query:  GSVVTGTVQILKPYGAFIDIGGI-NGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNP
        G+  TG V+ ++P+GAF+D G   +GL+HVSQ+S N + D+ +V+  G  +KV ++  D E  R+SL+ ++ +  P
Subjt:  GSVVTGTVQILKPYGAFIDIGGI-NGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNP

AT5G30510.1 ribosomal protein S14.3e-14067.27Show/hide
Query:  MNSLAHQLCGLSSSPL-SSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFV
        M SLA Q  GL  SPL SS+RLS    K F P+    S +  +V+A A    +S+ QTKERL LK++F++AYERC  +PM+GV+FT++DF AA+   DF 
Subjt:  MNSLAHQLCGLSSSPL-SSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFV

Query:  SQLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKVV
        S++GT+VKGTV++TDANGALVD +AK +A+L  ++ACI +I HVEEAGI PG+ EEFVII E E DD+L+LSLR++QY LAWERCRQ QAEDV++K KV+
Subjt:  SQLGTKVKGTVYRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKVV

Query:  GANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQI
        GANKGG+V LVEGLRGFVPFSQIS+K+ AEELL KE+ LKFVEVDE+ ++L+LSN KA+A+S  +  IGSVV G VQ LKPYGAFIDIGGINGLLHVSQI
Subjt:  GANKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQI

Query:  SQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPE
        S + +SDI TVLQPGD LKVMILS+D +RGRVSLSTKKLEP PGDMI NPKLVFEKA+E+AQ FRQRIAQAE MA  D+LRFQPE
Subjt:  SQNHISDIGTVLQPGDILKVMILSYDHERGRVSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTCATTGGCTCATCAACTTTGTGGGTTGAGTAGTTCCCCTCTTTCTTCTACGCGGCTTTCGGTTTCGAATTGGAAGCGCTTTTCTCCGAAGGAAAGCCGCACGTC
TAAAGCGCTTCCGGTAGTTTCAGCTACAGCTTCCCCTACTCCCATTTCCAATGCGCAGACCAAAGAGCGCCTTCGACTCAAGCAACTCTTCAAGGAAGCTTATGAACGCT
GCTGTGCTACCCCCATGGATGGCGTCTCCTTCACCCTTGAGGACTTCCATGCCGCTCTTGCAAACTGTGACTTCGTTTCTCAACTCGGAACCAAGGTTAAAGGTACTGTA
TACCGTACAGATGCTAATGGGGCATTAGTTGACACTACTGCAAAGGGAACTGCATTCTTGCCCACCCAGGAGGCATGCATTCTTAAAATAAGCCATGTAGAAGAAGCAGG
CATATATCCTGGTTTAGAAGAGGAGTTTGTAATTATTGCTGAACAGGAAGATGATGATACCTTAATTCTGAGCTTGAGAAGTGTCCAGTATAGCCTTGCTTGGGAGCGAT
GCAGACAATTCCAAGCTGAGGATGTTGTTATCAAGGGTAAGGTTGTTGGTGCAAACAAAGGGGGAGTAGTTGTTCTTGTGGAAGGCCTTAGAGGCTTTGTTCCTTTCTCT
CAGATATCAGCAAAATCAACTGCAGAGGAGCTGCTTAATAAAGAGCTACGTCTGAAGTTTGTGGAGGTCGATGAGAAACTATCTCGGCTAATCCTAAGTAACTGCAAGGC
AATTGCCAATAGCATGGTAGAGCAAAGAATTGGGTCAGTAGTTACTGGAACTGTGCAGATTCTGAAACCATATGGAGCCTTTATTGACATTGGTGGAATTAATGGCCTTC
TTCATGTTAGTCAAATCAGTCAAAATCACATATCAGATATTGGAACTGTTCTTCAACCAGGAGATATACTAAAGGTCATGATTTTGAGCTATGACCACGAGAGAGGCCGT
GTTAGTCTTTCTACCAAGAAATTGGAACCTAATCCTGGAGACATGATTCACAATCCAAAGCTTGTTTTTGAAAAGGCAGATGAAATAGCTCAGATATTCAGGCAAAGAAT
AGCTCAAGCAGAAGAAATGGCTTGTGTAGACCTTCTCAGATTTCAGCCTGAGGTACTTTTGCATTTGCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACTCATTGGCTCATCAACTTTGTGGGTTGAGTAGTTCCCCTCTTTCTTCTACGCGGCTTTCGGTTTCGAATTGGAAGCGCTTTTCTCCGAAGGAAAGCCGCACGTC
TAAAGCGCTTCCGGTAGTTTCAGCTACAGCTTCCCCTACTCCCATTTCCAATGCGCAGACCAAAGAGCGCCTTCGACTCAAGCAACTCTTCAAGGAAGCTTATGAACGCT
GCTGTGCTACCCCCATGGATGGCGTCTCCTTCACCCTTGAGGACTTCCATGCCGCTCTTGCAAACTGTGACTTCGTTTCTCAACTCGGAACCAAGGTTAAAGGTACTGTA
TACCGTACAGATGCTAATGGGGCATTAGTTGACACTACTGCAAAGGGAACTGCATTCTTGCCCACCCAGGAGGCATGCATTCTTAAAATAAGCCATGTAGAAGAAGCAGG
CATATATCCTGGTTTAGAAGAGGAGTTTGTAATTATTGCTGAACAGGAAGATGATGATACCTTAATTCTGAGCTTGAGAAGTGTCCAGTATAGCCTTGCTTGGGAGCGAT
GCAGACAATTCCAAGCTGAGGATGTTGTTATCAAGGGTAAGGTTGTTGGTGCAAACAAAGGGGGAGTAGTTGTTCTTGTGGAAGGCCTTAGAGGCTTTGTTCCTTTCTCT
CAGATATCAGCAAAATCAACTGCAGAGGAGCTGCTTAATAAAGAGCTACGTCTGAAGTTTGTGGAGGTCGATGAGAAACTATCTCGGCTAATCCTAAGTAACTGCAAGGC
AATTGCCAATAGCATGGTAGAGCAAAGAATTGGGTCAGTAGTTACTGGAACTGTGCAGATTCTGAAACCATATGGAGCCTTTATTGACATTGGTGGAATTAATGGCCTTC
TTCATGTTAGTCAAATCAGTCAAAATCACATATCAGATATTGGAACTGTTCTTCAACCAGGAGATATACTAAAGGTCATGATTTTGAGCTATGACCACGAGAGAGGCCGT
GTTAGTCTTTCTACCAAGAAATTGGAACCTAATCCTGGAGACATGATTCACAATCCAAAGCTTGTTTTTGAAAAGGCAGATGAAATAGCTCAGATATTCAGGCAAAGAAT
AGCTCAAGCAGAAGAAATGGCTTGTGTAGACCTTCTCAGATTTCAGCCTGAGGTACTTTTGCATTTGCCTTAG
Protein sequenceShow/hide protein sequence
MNSLAHQLCGLSSSPLSSTRLSVSNWKRFSPKESRTSKALPVVSATASPTPISNAQTKERLRLKQLFKEAYERCCATPMDGVSFTLEDFHAALANCDFVSQLGTKVKGTV
YRTDANGALVDTTAKGTAFLPTQEACILKISHVEEAGIYPGLEEEFVIIAEQEDDDTLILSLRSVQYSLAWERCRQFQAEDVVIKGKVVGANKGGVVVLVEGLRGFVPFS
QISAKSTAEELLNKELRLKFVEVDEKLSRLILSNCKAIANSMVEQRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQISQNHISDIGTVLQPGDILKVMILSYDHERGR
VSLSTKKLEPNPGDMIHNPKLVFEKADEIAQIFRQRIAQAEEMACVDLLRFQPEVLLHLP