; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G014050 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G014050
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Description30S ribosomal protein S1
Genome locationchr05:21944978..21949412
RNA-Seq ExpressionLsi05G014050
SyntenyLsi05G014050
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0022627 - cytosolic small ribosomal subunit (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052175.1 30S ribosomal protein S1 [Cucumis melo var. makuwa]2.1e-19584.08Show/hide
Query:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT
        MASMAQQFTGLR VPLSSSRLSKPFSS+HL NK+RSLPVQAAVIS PIPSPQTKERFKLKEVFEEAYERCRNAPVEGI+FTLEDFHAALEKYDFDSE+GT
Subjt:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT

Query:  K-------------------------VKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDL
        K                         VKGTVFCTDNNGALVDITAKSSAYLP+QEACIHRIKHVEEAGIFPGLREEFVIIGENE+DDSLILSLRSIQYDL
Subjt:  K-------------------------VKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDL

Query:  AWERCRQLQAEDVVVKGK---------------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLK
        AWERCRQLQAEDVVVKGK                           KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLK
Subjt:  AWERCRQLQAEDVVVKGK---------------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLK

Query:  PYGAFIDIGGINGLLHVSQISHDRISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADML
        PYGAFIDIGGINGLLHVSQISHDRISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEA+ARADML
Subjt:  PYGAFIDIGGINGLLHVSQISHDRISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADML

Query:  RFQPEDIVLDIPASGLTLTTDGILGPITPELPVEGLYLNDVPPAEE
        RFQPE        SGLTLTTDGILGPITPELPVEGL LNDVPPAEE
Subjt:  RFQPEDIVLDIPASGLTLTTDGILGPITPELPVEGLYLNDVPPAEE

XP_004147619.1 30S ribosomal protein S1, chloroplastic [Cucumis sativus]1.5e-19687.41Show/hide
Query:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT
        MASMAQQFTGLR  PLSSSRLSKPFSS+H  NK+RSLPVQAAVIS PIPSPQT+ERFKLKEVFEEAYERCRNAPVEGI+FTLEDFHAALEKYDFDSE+GT
Subjt:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT

Query:  KVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-------
        KVKGTVFCTDNNGALVDITAKSSAYLP+QEACIHRIKHVEEAG+FPGLREEFVIIGENE+DDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK       
Subjt:  KVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-------

Query:  --------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI
                            KS AEELL+KELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI
Subjt:  --------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI

Query:  SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGILG
        SDIATVLQPGD+LKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEA+ARADMLRFQPE        SGLTLTTDGILG
Subjt:  SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGILG

Query:  PITPELPVEGLYLNDVPPAEE
        PITPELPVEGL LNDVPPAEE
Subjt:  PITPELPVEGLYLNDVPPAEE

XP_008438974.1 PREDICTED: 30S ribosomal protein S1, chloroplastic [Cucumis melo]1.4e-19989.07Show/hide
Query:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT
        MASMAQQFTGLR VPLSSSRLSKPFSS+HL NK+RSLPVQAAVIS PIPSPQTKERFKLKEVFEEAYERCRNAPVEGI+FTLEDFHAALEKYDFDSE+GT
Subjt:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT

Query:  KVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-------
        KVKGTVFCTDNNGALVDITAKSSAYLP+QEACIHRIKHVEEAGIFPGLREEFVIIGENE+DDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK       
Subjt:  KVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-------

Query:  --------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI
                            KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI
Subjt:  --------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI

Query:  SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGILG
        SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEA+ARADMLRFQPE        SGLTLTTDGILG
Subjt:  SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGILG

Query:  PITPELPVEGLYLNDVPPAEE
        PITPELPVEGL LNDVPPAEE
Subjt:  PITPELPVEGLYLNDVPPAEE

XP_022138241.1 30S ribosomal protein S1, chloroplastic [Momordica charantia]2.7e-19888.84Show/hide
Query:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT
        MASMAQQFTGLR  PLSSSRLS PFS RHLQNKARSLPV AAVISSPIPSPQTKERFKLKEVFE+AYERCRNAPVEGI+FTLEDFHAALEKYDFDSEMGT
Subjt:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT

Query:  KVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-------
        KVKGTVFCTD NGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPG+REEFVIIGENEADDSL+LSLRSIQYDLAWERCRQLQAEDVVVKGK       
Subjt:  KVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-------

Query:  --------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI
                            KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI
Subjt:  --------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI

Query:  SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGILG
        SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPE        SGLTLTTDGILG
Subjt:  SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGILG

Query:  PITPELPVEGLYLNDVPPAEE
        PITPELPVEGL L+DVPPAEE
Subjt:  PITPELPVEGLYLNDVPPAEE

XP_038878013.1 30S ribosomal protein S1, chloroplastic-like [Benincasa hispida]2.0e-20190.26Show/hide
Query:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT
        MASMAQQFTGLR VPLSSSRLSKPFSSRHLQ+KARSLPVQAAVIS PIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSE+GT
Subjt:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT

Query:  KVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-------
        KVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK       
Subjt:  KVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-------

Query:  --------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI
                            KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI
Subjt:  --------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI

Query:  SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGILG
        SDI TVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPE        SGLTLTTDGILG
Subjt:  SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGILG

Query:  PITPELPVEGLYLNDVPPAEE
        PITPELPVEGL LNDVPPAEE
Subjt:  PITPELPVEGLYLNDVPPAEE

TrEMBL top hitse value%identityAlignment
A0A1S3AXL6 30S ribosomal protein S1, chloroplastic6.9e-20089.07Show/hide
Query:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT
        MASMAQQFTGLR VPLSSSRLSKPFSS+HL NK+RSLPVQAAVIS PIPSPQTKERFKLKEVFEEAYERCRNAPVEGI+FTLEDFHAALEKYDFDSE+GT
Subjt:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT

Query:  KVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-------
        KVKGTVFCTDNNGALVDITAKSSAYLP+QEACIHRIKHVEEAGIFPGLREEFVIIGENE+DDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK       
Subjt:  KVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-------

Query:  --------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI
                            KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI
Subjt:  --------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI

Query:  SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGILG
        SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEA+ARADMLRFQPE        SGLTLTTDGILG
Subjt:  SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGILG

Query:  PITPELPVEGLYLNDVPPAEE
        PITPELPVEGL LNDVPPAEE
Subjt:  PITPELPVEGLYLNDVPPAEE

A0A5A7UEP7 30S ribosomal protein S11.0e-19584.08Show/hide
Query:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT
        MASMAQQFTGLR VPLSSSRLSKPFSS+HL NK+RSLPVQAAVIS PIPSPQTKERFKLKEVFEEAYERCRNAPVEGI+FTLEDFHAALEKYDFDSE+GT
Subjt:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT

Query:  K-------------------------VKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDL
        K                         VKGTVFCTDNNGALVDITAKSSAYLP+QEACIHRIKHVEEAGIFPGLREEFVIIGENE+DDSLILSLRSIQYDL
Subjt:  K-------------------------VKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDL

Query:  AWERCRQLQAEDVVVKGK---------------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLK
        AWERCRQLQAEDVVVKGK                           KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLK
Subjt:  AWERCRQLQAEDVVVKGK---------------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLK

Query:  PYGAFIDIGGINGLLHVSQISHDRISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADML
        PYGAFIDIGGINGLLHVSQISHDRISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEA+ARADML
Subjt:  PYGAFIDIGGINGLLHVSQISHDRISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADML

Query:  RFQPEDIVLDIPASGLTLTTDGILGPITPELPVEGLYLNDVPPAEE
        RFQPE        SGLTLTTDGILGPITPELPVEGL LNDVPPAEE
Subjt:  RFQPEDIVLDIPASGLTLTTDGILGPITPELPVEGLYLNDVPPAEE

A0A6J1C966 30S ribosomal protein S1, chloroplastic1.3e-19888.84Show/hide
Query:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT
        MASMAQQFTGLR  PLSSSRLS PFS RHLQNKARSLPV AAVISSPIPSPQTKERFKLKEVFE+AYERCRNAPVEGI+FTLEDFHAALEKYDFDSEMGT
Subjt:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT

Query:  KVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-------
        KVKGTVFCTD NGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPG+REEFVIIGENEADDSL+LSLRSIQYDLAWERCRQLQAEDVVVKGK       
Subjt:  KVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-------

Query:  --------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI
                            KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI
Subjt:  --------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI

Query:  SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGILG
        SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPE        SGLTLTTDGILG
Subjt:  SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGILG

Query:  PITPELPVEGLYLNDVPPAEE
        PITPELPVEGL L+DVPPAEE
Subjt:  PITPELPVEGLYLNDVPPAEE

A0A6J1F6H4 30S ribosomal protein S1, chloroplastic-like6.3e-19385.27Show/hide
Query:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT
        MASMAQQF GLR  PLSSSRLSKPFSSRHLQN+ARSLPVQAAVI+SPIPSPQ KERFKLKE+FEEAYERCRNAPVEGI+FT+EDFH+A+EKYDF+SE+GT
Subjt:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT

Query:  KVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-------
        KVKGTVFCTD NGALVDITAKSSAYLP+QEACIHRIKHVEEAGI+PGLR+EFVIIGENEADDSL+LSLRSIQYDLAWERCRQLQAEDVVVKGK       
Subjt:  KVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-------

Query:  --------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI
                            KSTAEELL KE+PLKFVEVDEEQSRLVLSNRKA+ADSQAQLGIGSVV GTVQSLKPYGAFIDIGG+NGLLHVSQISHDRI
Subjt:  --------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI

Query:  SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGILG
        SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIA AEAMARADML+FQPE        SGLTLTTDGILG
Subjt:  SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGILG

Query:  PITPELPVEGLYLNDVPPAEE
        P+TPELPVEGL LNDVPPAEE
Subjt:  PITPELPVEGLYLNDVPPAEE

A0A6J1IAT2 30S ribosomal protein S1, chloroplastic1.8e-19285.04Show/hide
Query:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT
        MASMAQQF GLR  PLSSSRLSKPFSSRHLQN+ARSLPVQAAVI+SPIPSP  KERFKLKE+FEEAYERCRNAPVEGI+FT+EDFH+A+EKYDF+SE+GT
Subjt:  MASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMGT

Query:  KVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-------
        KVKGTVFCTD+NGALVDITAKSSAYLP+QEACIHRIKHVEEAGI+PGLR+EFVIIGENEADDSL+LSLRSIQYDLAWERCRQLQAEDVVVKGK       
Subjt:  KVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-------

Query:  --------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI
                            KSTAEELL KE+PLKFVEVDEEQSRLVLSNRKA+ADSQAQLGIGSVV GTVQSLKPYGAFIDIGG+NGLLHVSQISHDRI
Subjt:  --------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRI

Query:  SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGILG
        SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIA AEAMARADML+FQPE        SGLTLTTDGILG
Subjt:  SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGILG

Query:  PITPELPVEGLYLNDVPPAEE
        P+TPELPVEGL LNDVPPAEE
Subjt:  PITPELPVEGLYLNDVPPAEE

SwissProt top hitse value%identityAlignment
P29344 30S ribosomal protein S1, chloroplastic4.7e-16976.07Show/hide
Query:  MASMAQQFT-GLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMG
        MAS+AQQ   GLR  PLS+S LSKPFS +H   K R  P+ +AV    + + QT+ER KLK++FE+AYERCRNAP+EG++FT++DFH AL+KYDF+SEMG
Subjt:  MASMAQQFT-GLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEMG

Query:  TKVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK------
        ++VKGTVFCTD NGALVDITAKSSAYLP+ EACI+RIK+VEEAGI PG+REEFVIIGENEADDSLILSLR IQY+LAWERCRQLQAEDVVVKGK      
Subjt:  TKVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK------

Query:  ---------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDR
                             KS+AEELL KE+PLKFVEVDEEQSRLV+SNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDR
Subjt:  ---------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDR

Query:  ISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGIL
        +SDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPE        SGLTL++DGIL
Subjt:  ISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGIL

Query:  GPITPELPVEGLYLNDVPPAEE
        GP+T +LP EGL L+ VPPA E
Subjt:  GPITPELPVEGLYLNDVPPAEE

P46228 30S ribosomal protein S11.8e-6746.62Show/hide
Query:  RNAPVEGIAFTLEDFHAALEKYDFDSEMGTKVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRS
        ++ P   I FT EDF A L++YD+    G  V GTVF  +  GAL+DI AK++A+LPVQE  I+R++  EE  + P    EF I+ +   D  L LS+R 
Subjt:  RNAPVEGIAFTLEDFHAALEKYDFDSEMGTKVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRS

Query:  IQYDLAWERCRQLQAEDVVVKGK---------------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQA-QLGIGSVVTG
        I+Y  AWER RQLQ ED  V+ +                           +   E+L+ +ELPLKF+EVDE+++RLVLS+R+A+ + +  +L +G VV G
Subjt:  IQYDLAWERCRQLQAEDVVVKGK---------------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQA-QLGIGSVVTG

Query:  TVQSLKPYGAFIDIGGINGLLHVSQISHDRISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQ
         V+ +KPYGAFIDIGG++GLLH+S+ISHD I    +V    D +KVMI+  D ERGR+SLSTK+LEP PGDM+RNP++V+EKAEEMA  +R+++ Q
Subjt:  TVQSLKPYGAFIDIGGINGLLHVSQISHDRISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQ

P73530 30S ribosomal protein S1 homolog A3.1e-6446.6Show/hide
Query:  IAFTLEDFHAALEKYDFDSEMGTKVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAW
        I FTLEDF A L+KYD+    G  V GTVF  ++ GAL+DI AK++AY+P+QE  I+R+   EE  + P    EF I+ +   D  L LS+R I+Y  AW
Subjt:  IAFTLEDFHAALEKYDFDSEMGTKVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAW

Query:  ERCRQLQAEDVVVKGK---------------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQ-LGIGSVVTGTVQSLKP
        ER RQLQAED  V+                             +   E+L+ ++LPLKF+EVDEE++RLVLS+R+A+ + +   L +  VV G+V+ +KP
Subjt:  ERCRQLQAEDVVVKGK---------------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQ-LGIGSVVTGTVQSLKP

Query:  YGAFIDIGGINGLLHVSQISHDRISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQ-RIAQAEAM
        YGAFIDIGG++GLLH+S+ISHD I    +V    D +KVMI+  D ERGR+SLSTK+LEP PG M+++  LV E A+EMA+ FRQ R+A+A+ +
Subjt:  YGAFIDIGGINGLLHVSQISHDRISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQ-RIAQAEAM

Q1XDE2 30S ribosomal protein S1, chloroplastic6.8e-3534.23Show/hide
Query:  FTLEDFHAALEKYDFDSEMGTKVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLR----EEFVIIGENEADDSLILSLRSIQYDL
        FT  +F A L+KY +D  +G  V GT+F  + NG LVDI    SAYLP+QE     +   ++   F  L      EF ++  N     LILS+R ++Y  
Subjt:  FTLEDFHAALEKYDFDSEMGTKVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLR----EEFVIIGENEADDSLILSLRSIQYDL

Query:  AWERCRQLQAED----VVVK-----------------------GKKSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAM-ADSQAQLGIGSVVTGTVQSL
        AW+R RQL AED    V++K                       G    +E+  NK + LK + V+E+ + L+LS+R+A+ + + + L +G+++ G +  +
Subjt:  AWERCRQLQAED----VVVK-----------------------GKKSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAM-ADSQAQLGIGSVVTGTVQSL

Query:  KPYGAFIDIGGINGLLHVSQISHDRISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKL
         PYG FI +G + GL+H+S+I+   +  I++  + GDT+K +I+  D+++GR+SLS K L
Subjt:  KPYGAFIDIGGINGLLHVSQISHDRISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKL

Q93VC7 30S ribosomal protein S1, chloroplastic8.3e-15871.87Show/hide
Query:  MASMAQQFTGLRSVPL-SSSRLSKPFSSRHLQNKARSL-PVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEM
        MAS+AQQF+GLR  PL SSSRLS+  S    QNK+ S+ P   A ++  + S QTKER +LK++FE+AYERCR +P+EG+AFT++DF AA+E+YDF+SE+
Subjt:  MASMAQQFTGLRSVPL-SSSRLSKPFSSRHLQNKARSL-PVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEM

Query:  GTKVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-----
        GT+VKGTVF TD NGALVDI+AKSSAYL V++ACIHRIKHVEEAGI PG+ EEFVIIGENE+DDSL+LSLR+IQY+LAWERCRQLQAEDV+VK K     
Subjt:  GTKVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-----

Query:  ----------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHD
                              K+ AEELL KE+PLKFVEVDEEQ++LVLSNRKA+ADSQAQLGIGSVV G VQSLKPYGAFIDIGGINGLLHVSQISHD
Subjt:  ----------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHD

Query:  RISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGI
        R+SDIATVLQPGDTLKVMILSHDR+RGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPE        SGLTL++DGI
Subjt:  RISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGI

Query:  LGPITPELPVEG--LYLNDVPPA
        LGP+  ELP +G  L ++D+P A
Subjt:  LGPITPELPVEG--LYLNDVPPA

Arabidopsis top hitse value%identityAlignment
AT1G71720.1 Nucleic acid-binding proteins superfamily1.3e-1231.91Show/hide
Query:  EELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIG--GINGLLHVSQISHDRISDIATVLQPGDTLKVMILSHDR
        +E + +   ++   ++E+++ L+LS +  +A  +  L  G+++ GTV  + PYGA + +G    +GLLH+S I+  RI  ++ VLQ  +++KV+++    
Subjt:  EELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIG--GINGLLHVSQISHDRISDIATVLQPGDTLKVMILSHDR

Query:  ERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRI
           ++SLS   LE  PG  I + + VF +AEEMA+ +R+++
Subjt:  ERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRI

AT3G23700.1 Nucleic acid-binding proteins superfamily3.8e-1737.7Show/hide
Query:  AEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQ-LGIGSVVTGTVQSLKPYGAFIDIG------GINGLLHVSQISHDRISDIATVLQPGDTLKVM
        A+ L+  +LP+K V+ DEE  +L+LS + A+    +Q + +G V  G V S++ YGAFI +        + GL+HVS++S D + D+  VL+ GD ++V+
Subjt:  AEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQ-LGIGSVVTGTVQSLKPYGAFIDIG------GINGLLHVSQISHDRISDIATVLQPGDTLKVM

Query:  ILSHDRERGRVSLSTKKLEPTP
        + + D+E+ R++LS K+LE  P
Subjt:  ILSHDRERGRVSLSTKKLEPTP

AT4G29060.1 elongation factor Ts family protein1.3e-0936.84Show/hide
Query:  GSVVTGTVQSLKPYGAFIDIGGI-NGLLHVSQISHDRISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTP
        G+  TG V++++P+GAF+D G   +GL+HVSQ+S + + D+++V+  G  +KV ++  D E  R+SL+ ++ +  P
Subjt:  GSVVTGTVQSLKPYGAFIDIGGI-NGLLHVSQISHDRISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTP

AT5G14580.1 polyribonucleotide nucleotidyltransferase, putative1.1e-1136.94Show/hide
Query:  VEVDEEQSRLVLSNRKAMADSQAQ--------LGIGSVVTGTVQSLKPYGAFIDI-GGINGLLHVSQISHDRISDIATVLQPGDTLKVMILSHDRERGRV
        + +D     +V  N+  M  +Q Q        L +G V  GTV S+K YGAF++  GG  GLLH+S++SH+ +S ++ VL  G  +  M +  D  RG +
Subjt:  VEVDEEQSRLVLSNRKAMADSQAQ--------LGIGSVVTGTVQSLKPYGAFIDI-GGINGLLHVSQISHDRISDIATVLQPGDTLKVMILSHDRERGRV

Query:  SLSTKKLEPTP
         LS K L P P
Subjt:  SLSTKKLEPTP

AT5G30510.1 ribosomal protein S15.9e-15971.87Show/hide
Query:  MASMAQQFTGLRSVPL-SSSRLSKPFSSRHLQNKARSL-PVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEM
        MAS+AQQF+GLR  PL SSSRLS+  S    QNK+ S+ P   A ++  + S QTKER +LK++FE+AYERCR +P+EG+AFT++DF AA+E+YDF+SE+
Subjt:  MASMAQQFTGLRSVPL-SSSRLSKPFSSRHLQNKARSL-PVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTLEDFHAALEKYDFDSEM

Query:  GTKVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-----
        GT+VKGTVF TD NGALVDI+AKSSAYL V++ACIHRIKHVEEAGI PG+ EEFVIIGENE+DDSL+LSLR+IQY+LAWERCRQLQAEDV+VK K     
Subjt:  GTKVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKGK-----

Query:  ----------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHD
                              K+ AEELL KE+PLKFVEVDEEQ++LVLSNRKA+ADSQAQLGIGSVV G VQSLKPYGAFIDIGGINGLLHVSQISHD
Subjt:  ----------------------KSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHD

Query:  RISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGI
        R+SDIATVLQPGDTLKVMILSHDR+RGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPE        SGLTL++DGI
Subjt:  RISDIATVLQPGDTLKVMILSHDRERGRVSLSTKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGI

Query:  LGPITPELPVEG--LYLNDVPPA
        LGP+  ELP +G  L ++D+P A
Subjt:  LGPITPELPVEG--LYLNDVPPA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATCTTCAATACCTTATCCAACCCAAAACCACATCAACTGTGTACAGAGTAGTCTGAGAAACGCTGAGCGAAGAAGGAAAATGGCTTCCATGGCTCAGCAATTTAC
TGGGTTGCGAAGTGTGCCTCTTTCTTCATCGCGTCTCTCGAAGCCATTTTCTTCGAGGCATTTGCAGAACAAGGCCCGTTCGCTCCCCGTTCAAGCTGCAGTCATTTCGA
GCCCCATTCCCAGTCCTCAGACCAAGGAGCGTTTCAAGCTCAAGGAGGTCTTCGAGGAGGCCTACGAACGCTGCCGTAATGCCCCTGTGGAAGGCATAGCCTTCACTCTC
GAGGACTTCCATGCCGCTCTTGAAAAATACGACTTCGATTCTGAAATGGGAACTAAGGTGAAAGGTACTGTGTTTTGTACGGATAATAATGGAGCACTTGTTGACATTAC
TGCCAAGTCATCTGCATATTTGCCAGTGCAAGAGGCTTGCATTCACAGAATAAAACATGTAGAAGAAGCAGGAATATTTCCTGGTTTGAGAGAGGAGTTTGTTATTATTG
GTGAGAATGAAGCTGATGATAGCTTGATTTTGAGCTTGAGATCCATTCAATATGATCTGGCTTGGGAGAGGTGCAGACAGCTTCAAGCAGAGGACGTTGTTGTCAAGGGT
AAGAAATCAACTGCTGAAGAGCTTCTCAACAAGGAGCTTCCTCTAAAGTTTGTGGAGGTTGATGAAGAACAATCGAGGCTTGTCCTCAGTAACCGTAAGGCCATGGCTGA
CAGCCAGGCCCAACTTGGAATTGGATCAGTGGTCACTGGGACAGTTCAAAGCCTTAAACCGTATGGTGCATTCATTGACATTGGTGGAATCAATGGTCTTCTTCATGTCA
GTCAAATCAGTCATGATCGGATATCGGATATTGCAACAGTTCTTCAGCCTGGAGACACTCTCAAGGTCATGATATTGAGTCATGATCGTGAGAGAGGCCGAGTTAGTCTT
TCTACCAAGAAGTTAGAGCCCACTCCTGGGGACATGATTCGCAATCCAAAGCTTGTCTTTGAGAAGGCTGAGGAGATGGCACAAACGTTCAGGCAAAGAATAGCCCAAGC
AGAGGCAATGGCACGCGCAGACATGCTTAGGTTTCAGCCTGAGGATATTGTGTTGGATATTCCTGCGAGTGGTTTGACTTTGACTACTGATGGAATATTGGGACCAATTA
CCCCAGAGTTGCCTGTAGAGGGCTTATATTTGAATGATGTACCTCCAGCTGAAGAGTGA
mRNA sequenceShow/hide mRNA sequence
CGTGTATATATATTTGTAAAATGCAATCTTCAATACCTTATCCAACCCAAAACCACATCAACTGTGTACAGAGTAGTCTGAGAAACGCTGAGCGAAGAAGGAAAATGGCT
TCCATGGCTCAGCAATTTACTGGGTTGCGAAGTGTGCCTCTTTCTTCATCGCGTCTCTCGAAGCCATTTTCTTCGAGGCATTTGCAGAACAAGGCCCGTTCGCTCCCCGT
TCAAGCTGCAGTCATTTCGAGCCCCATTCCCAGTCCTCAGACCAAGGAGCGTTTCAAGCTCAAGGAGGTCTTCGAGGAGGCCTACGAACGCTGCCGTAATGCCCCTGTGG
AAGGCATAGCCTTCACTCTCGAGGACTTCCATGCCGCTCTTGAAAAATACGACTTCGATTCTGAAATGGGAACTAAGGTGAAAGGTACTGTGTTTTGTACGGATAATAAT
GGAGCACTTGTTGACATTACTGCCAAGTCATCTGCATATTTGCCAGTGCAAGAGGCTTGCATTCACAGAATAAAACATGTAGAAGAAGCAGGAATATTTCCTGGTTTGAG
AGAGGAGTTTGTTATTATTGGTGAGAATGAAGCTGATGATAGCTTGATTTTGAGCTTGAGATCCATTCAATATGATCTGGCTTGGGAGAGGTGCAGACAGCTTCAAGCAG
AGGACGTTGTTGTCAAGGGTAAGAAATCAACTGCTGAAGAGCTTCTCAACAAGGAGCTTCCTCTAAAGTTTGTGGAGGTTGATGAAGAACAATCGAGGCTTGTCCTCAGT
AACCGTAAGGCCATGGCTGACAGCCAGGCCCAACTTGGAATTGGATCAGTGGTCACTGGGACAGTTCAAAGCCTTAAACCGTATGGTGCATTCATTGACATTGGTGGAAT
CAATGGTCTTCTTCATGTCAGTCAAATCAGTCATGATCGGATATCGGATATTGCAACAGTTCTTCAGCCTGGAGACACTCTCAAGGTCATGATATTGAGTCATGATCGTG
AGAGAGGCCGAGTTAGTCTTTCTACCAAGAAGTTAGAGCCCACTCCTGGGGACATGATTCGCAATCCAAAGCTTGTCTTTGAGAAGGCTGAGGAGATGGCACAAACGTTC
AGGCAAAGAATAGCCCAAGCAGAGGCAATGGCACGCGCAGACATGCTTAGGTTTCAGCCTGAGGATATTGTGTTGGATATTCCTGCGAGTGGTTTGACTTTGACTACTGA
TGGAATATTGGGACCAATTACCCCAGAGTTGCCTGTAGAGGGCTTATATTTGAATGATGTACCTCCAGCTGAAGAGTGAAGATTCAAAGCTTTCCCACTGTTGTATTCTG
TACAATGTTTCACTTTGTATACAAACTTCTGCCATTGTAATTGGGTACGCAACTTCGCCCTTCATGTCATAGCTTAGTTTTGAATGCTTAATCAAAGCCCGCTTGCTAGT
TCATTTTCGTTCTCTATTATTTGTTTCTTTGGTTTCTGCATCGAAAATGAAAGTATGATCAACACTGCCACATTCCCAAGTTAAAAATATTCCATGCTAGTTGAAATGTA
TCTGAAAAATATGTTTGATCAACACTGCCATATTTTTTTTCTCT
Protein sequenceShow/hide protein sequence
MQSSIPYPTQNHINCVQSSLRNAERRRKMASMAQQFTGLRSVPLSSSRLSKPFSSRHLQNKARSLPVQAAVISSPIPSPQTKERFKLKEVFEEAYERCRNAPVEGIAFTL
EDFHAALEKYDFDSEMGTKVKGTVFCTDNNGALVDITAKSSAYLPVQEACIHRIKHVEEAGIFPGLREEFVIIGENEADDSLILSLRSIQYDLAWERCRQLQAEDVVVKG
KKSTAEELLNKELPLKFVEVDEEQSRLVLSNRKAMADSQAQLGIGSVVTGTVQSLKPYGAFIDIGGINGLLHVSQISHDRISDIATVLQPGDTLKVMILSHDRERGRVSL
STKKLEPTPGDMIRNPKLVFEKAEEMAQTFRQRIAQAEAMARADMLRFQPEDIVLDIPASGLTLTTDGILGPITPELPVEGLYLNDVPPAEE