; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc00G02535 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc00G02535
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Description30S ribosomal protein S4, chloroplastic
Genome locationClcCtg023:35242..37966
RNA-Seq ExpressionClc00G02535
SyntenyClc00G02535
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0015935 - small ribosomal subunit (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0019843 - rRNA binding (molecular function)
InterPro domainsIPR001912 - Ribosomal protein S4/S9, N-terminal
IPR002942 - RNA-binding S4 domain
IPR018079 - Ribosomal protein S4, conserved site
IPR022801 - Ribosomal protein S4/S9
IPR036986 - RNA-binding S4 domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN69936.1 Tetratricopeptide repeat-like superfamily protein, partial [Prunus dulcis]1.3e-8368.8Show/hide
Query:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSR-----------------------SERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP
        GPRFKKIRRLGALPGLTSK+P+ G+DL+NQSR                       +E+QLLKYVRIAGKAKGSTGQVLLQLLEMRLDN LFRLGMASTIP
Subjt:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSR-----------------------SERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP

Query:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDNLNLTKKQGFGQFRSPLRRSLSKET
        QARQLVNHRHILVNG IVDIPSYRCKPRDIIT +DE+KSRALIQNYLDS P  ELPKHLTL P QY+GLVNQIID+  +  K    +       SLS   
Subjt:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDNLNLTKKQGFGQFRSPLRRSLSKET

Query:  E-----GKMGL-------VPMPRSGGINGNFIDKTFSIVANILLRIIPTTSGEKEAFTYYRDERHS
        E     GK G+        PM RS GINGNFIDKTFSIVANILLRIIPTTSGEKEAFTYYRD  ++
Subjt:  E-----GKMGL-------VPMPRSGGINGNFIDKTFSIVANILLRIIPTTSGEKEAFTYYRDERHS

QJR52958.1 ribosomal protein S4 [Herpetospermum pedunculosum]2.2e-7285.23Show/hide
Query:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP
        GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS                       ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDN LFRLGMASTIP
Subjt:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP

Query:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN
        QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAK+EKKSRALIQNYLDSSPP+ELPKHLTLQPLQYKGLVNQIIDN
Subjt:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN

QZL38683.1 ribosomal protein S4 [Citrullus ecirrhosus]1.0e-7285.8Show/hide
Query:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP
        GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS                       ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDN LFRLGMASTIP
Subjt:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP

Query:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN
        QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIID+
Subjt:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN

RZC49689.1 hypothetical protein C5167_018115 [Papaver somniferum]1.7e-7265.06Show/hide
Query:  PRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIPQ
        PRFKKIRRLGALPGLTSKRPK GNDL+NQSRS                       ERQLLKYVR AGKAKGSTGQVLLQLLEMRLDN LFRLGMASTIP 
Subjt:  PRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIPQ

Query:  ARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDNLNLTKKQGFGQFRSPLRRSLSKETE
        ARQLVNHRHILVNG IVDIPSYRCKPRDIIT +DE+KS+ALIQNYLDSS   ELPKHLTL   QYKG                                 
Subjt:  ARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDNLNLTKKQGFGQFRSPLRRSLSKETE

Query:  GKMGLVPMPRSGGINGNFIDKTFSIVANILLRIIPTTSGEKEAFTYYRD
                     INGNFIDKT SIVANILLR+IPTTSGEKEAFTYYRD
Subjt:  GKMGLVPMPRSGGINGNFIDKTFSIVANILLRIIPTTSGEKEAFTYYRD

YP_009325991.1 ribosomal protein S4 [Citrullus lanatus]2.0e-7386.36Show/hide
Query:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP
        GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS                       ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP
Subjt:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP

Query:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN
        QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIID+
Subjt:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN

TrEMBL top hitse value%identityAlignment
A0A1P8LDY9 Ribosomal protein S49.8e-7486.36Show/hide
Query:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP
        GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS                       ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP
Subjt:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP

Query:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN
        QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIID+
Subjt:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN

A0A1P8LEX7 Ribosomal protein S49.8e-7486.36Show/hide
Query:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP
        GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS                       ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP
Subjt:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP

Query:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN
        QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIID+
Subjt:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN

A0A249RX12 30S ribosomal protein S4, chloroplastic9.8e-7486.36Show/hide
Query:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP
        GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS                       ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP
Subjt:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP

Query:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN
        QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIID+
Subjt:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN

A0A343A8B2 30S ribosomal protein S4, chloroplastic9.8e-7486.36Show/hide
Query:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP
        GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS                       ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP
Subjt:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP

Query:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN
        QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIID+
Subjt:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN

A0A5H2Y7H7 Tetratricopeptide repeat-like superfamily protein (Fragment)6.1e-8468.8Show/hide
Query:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSR-----------------------SERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP
        GPRFKKIRRLGALPGLTSK+P+ G+DL+NQSR                       +E+QLLKYVRIAGKAKGSTGQVLLQLLEMRLDN LFRLGMASTIP
Subjt:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSR-----------------------SERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP

Query:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDNLNLTKKQGFGQFRSPLRRSLSKET
        QARQLVNHRHILVNG IVDIPSYRCKPRDIIT +DE+KSRALIQNYLDS P  ELPKHLTL P QY+GLVNQIID+  +  K    +       SLS   
Subjt:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDNLNLTKKQGFGQFRSPLRRSLSKET

Query:  E-----GKMGL-------VPMPRSGGINGNFIDKTFSIVANILLRIIPTTSGEKEAFTYYRDERHS
        E     GK G+        PM RS GINGNFIDKTFSIVANILLRIIPTTSGEKEAFTYYRD  ++
Subjt:  E-----GKMGL-------VPMPRSGGINGNFIDKTFSIVANILLRIIPTTSGEKEAFTYYRDERHS

SwissProt top hitse value%identityAlignment
A6MM38 30S ribosomal protein S4, chloroplastic2.7e-6878.98Show/hide
Query:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP
        GPRFKKIRRLGALPGLTSKRP+ G+DL+NQSRS                       ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDN LFRLGMASTIP
Subjt:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP

Query:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN
         ARQLVNHRHILVNG IVDIPSYRCKPRDIITAKDE+KSRALIQNYLDSSP  ELPKHLTL   QYKGLVNQIID+
Subjt:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN

Q09FV9 30S ribosomal protein S4, chloroplastic8.5e-6777.84Show/hide
Query:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP
        GPRFKKIRRLGALPGLTSKRP  G+DL+NQSRS                       ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDN LFRLGMASTIP
Subjt:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP

Query:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN
         ARQLVNHRHILVNG IVDIPSYRCKPRDIIT +DE+KSRALIQNYLDSSP  ELPKHLTL   QYKGLVNQIID+
Subjt:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN

Q1KXV7 30S ribosomal protein S4, chloroplastic8.5e-6776.7Show/hide
Query:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP
        GPRFKKIRRLGALPGLT+KRP+ G+DL+NQSRS                       ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDN LFRLGMA TIP
Subjt:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP

Query:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN
         ARQLVNHRHILVNG IVDIPSYRCKPRD I A+DE+KSRALIQN LDSSPP ELP HLTLQP QYKGLVNQIID+
Subjt:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN

Q49KZ6 30S ribosomal protein S4, chloroplastic2.7e-6877.27Show/hide
Query:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP
        GPRFKKIRRLGALPGLTSKRP+ G+DL+NQSRS                       ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDN LFRLGMASTIP
Subjt:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP

Query:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN
        QARQLVNHRHILVNG IVDIPSYRCKPRDIITA+D++KSRA+IQNY DSSP  E+PKHLTL P QYKGLVNQIID+
Subjt:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN

Q4VZH3 30S ribosomal protein S4, chloroplastic5.5e-7484.57Show/hide
Query:  PRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIPQ
        PRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS                       ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDN LFRLGMASTIPQ
Subjt:  PRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIPQ

Query:  ARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN
        ARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSR LIQNYLDSSPP+ELPKHLTLQPLQYKGLVNQIID+
Subjt:  ARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDN

Arabidopsis top hitse value%identityAlignment
AT5G39850.1 Ribosomal protein S48.1e-0447.62Show/hide
Query:  LEMRLDNTLFRLGMASTIPQARQLVNHRHILVNGSIVDIPSY
        LE RL   +F+ GMA +I  AR L+  RHI V   +V+IPS+
Subjt:  LEMRLDNTLFRLGMASTIPQARQLVNHRHILVNGSIVDIPSY

ATCG00380.1 chloroplast ribosomal protein S46.3e-6574.86Show/hide
Query:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP
        GPRFKKIRRLGALPGLTSKRPK G+DL+NQSRS                       E QLLKYVRIAGKAKGSTGQVLLQLLEMRLDN LFRLGMA TIP
Subjt:  GPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRS-----------------------ERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIP

Query:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIID
        QARQLVNH HILVNG IVDIPSYRCKPRDIIT KDE+ SR L+QN LDSS P ELP HLTL   QY+GLVNQIID
Subjt:  QARQLVNHRHILVNGSIVDIPSYRCKPRDIITAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIID


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGGCCTCGTTTCAAAAAAATACGCCGTCTGGGGGCTTTACCGGGACTAACTAGTAAAAGGCCCAAAACCGGAAACGATCTTAAAAACCAATCGCGTTCCGAACG
ACAATTACTTAAATACGTTCGTATCGCCGGAAAAGCCAAAGGATCAACGGGTCAAGTTTTACTACAATTACTTGAAATGCGTTTGGATAACACCCTTTTTCGATTGGGTA
TGGCTTCGACTATTCCTCAAGCCCGACAATTAGTTAATCATAGACATATTTTAGTTAATGGTTCTATAGTGGATATACCAAGTTATCGGTGCAAACCCCGAGATATTATT
ACAGCAAAGGATGAAAAAAAATCGAGAGCACTGATTCAAAATTATCTTGATTCATCCCCCCCTCGGGAATTGCCAAAACATTTGACTCTTCAGCCATTGCAATATAAAGG
ATTAGTCAATCAAATAATAGATAACTTAAACCTAACTAAAAAACAAGGGTTCGGACAATTTCGTTCTCCCCTTCGCCGAAGTCTAAGTAAAGAGACCGAAGGAAAAATGG
GGTTGGTACCGATGCCTAGATCTGGTGGGATAAATGGAAATTTTATCGATAAGACCTTTTCAATTGTAGCCAATATCTTATTACGAATAATTCCGACAACTTCCGGGGAA
AAAGAGGCATTCACCTATTACAGAGATGAAAGACACAGTATTCGGGATAGAAAGAAAACAATAATTGAAATAAATCTACGCTTCTTGGAGAGGGGAAGAAGCACTACACC
TAGGAATCAACAACACGAAAACTTTGTTATAAATTATCCCTTTTCCTTATCGGGATCGGAACTTAGAAAAATGGTTGGGACAACAAACATCCATCTCGGGGAGCCGTATG
AGATGAAAATCTCACGTACGGTTCTGGAACGGAGATTCTTTGAATGGAATAACGAACGACCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGGGCCTCGTTTCAAAAAAATACGCCGTCTGGGGGCTTTACCGGGACTAACTAGTAAAAGGCCCAAAACCGGAAACGATCTTAAAAACCAATCGCGTTCCGAACG
ACAATTACTTAAATACGTTCGTATCGCCGGAAAAGCCAAAGGATCAACGGGTCAAGTTTTACTACAATTACTTGAAATGCGTTTGGATAACACCCTTTTTCGATTGGGTA
TGGCTTCGACTATTCCTCAAGCCCGACAATTAGTTAATCATAGACATATTTTAGTTAATGGTTCTATAGTGGATATACCAAGTTATCGGTGCAAACCCCGAGATATTATT
ACAGCAAAGGATGAAAAAAAATCGAGAGCACTGATTCAAAATTATCTTGATTCATCCCCCCCTCGGGAATTGCCAAAACATTTGACTCTTCAGCCATTGCAATATAAAGG
ATTAGTCAATCAAATAATAGATAACTTAAACCTAACTAAAAAACAAGGGTTCGGACAATTTCGTTCTCCCCTTCGCCGAAGTCTAAGTAAAGAGACCGAAGGAAAAATGG
GGTTGGTACCGATGCCTAGATCTGGTGGGATAAATGGAAATTTTATCGATAAGACCTTTTCAATTGTAGCCAATATCTTATTACGAATAATTCCGACAACTTCCGGGGAA
AAAGAGGCATTCACCTATTACAGAGATGAAAGACACAGTATTCGGGATAGAAAGAAAACAATAATTGAAATAAATCTACGCTTCTTGGAGAGGGGAAGAAGCACTACACC
TAGGAATCAACAACACGAAAACTTTGTTATAAATTATCCCTTTTCCTTATCGGGATCGGAACTTAGAAAAATGGTTGGGACAACAAACATCCATCTCGGGGAGCCGTATG
AGATGAAAATCTCACGTACGGTTCTGGAACGGAGATTCTTTGAATGGAATAACGAACGACCGTAA
Protein sequenceShow/hide protein sequence
MSGPRFKKIRRLGALPGLTSKRPKTGNDLKNQSRSERQLLKYVRIAGKAKGSTGQVLLQLLEMRLDNTLFRLGMASTIPQARQLVNHRHILVNGSIVDIPSYRCKPRDII
TAKDEKKSRALIQNYLDSSPPRELPKHLTLQPLQYKGLVNQIIDNLNLTKKQGFGQFRSPLRRSLSKETEGKMGLVPMPRSGGINGNFIDKTFSIVANILLRIIPTTSGE
KEAFTYYRDERHSIRDRKKTIIEINLRFLERGRSTTPRNQQHENFVINYPFSLSGSELRKMVGTTNIHLGEPYEMKISRTVLERRFFEWNNERP