; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003661 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003661
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Description30S ribosomal protein S1
Genome locationChr08:5078169..5087913
RNA-Seq ExpressionHG10003661
SyntenyHG10003661
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0022627 - cytosolic small ribosomal subunit (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008438974.1 PREDICTED: 30S ribosomal protein S1, chloroplastic [Cucumis melo]3.7e-16073.12Show/hide
Query:  MPLMAQQCTGLRCEPLFSIS-SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLG
        M  MAQQ TGLRC PL S   SKP    H+ NK +RS PV AA+IS PIP+PQT ERFKLKE F+ A +RCRN P+EG+SFTL+DF A+LEKYDFD +LG
Subjt:  MPLMAQQCTGLRCEPLFSIS-SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLG

Query:  SKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANK
        +KVKGTV   + NGALV+I AKS AYLPLQEACIHRIK +EEAGI+PGLREEFVIIGENE DDSL LSLR IQY+LAWERCRQLQAEDVVVKGKVVDANK
Subjt:  SKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANK

Query:  GGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHDR
        GGV+ VVEGL+GFVPFS+I   STAEEL+NKELPLKF+ V+EEQ+R+VLSNRK MADS+A+L IGSVVT TV  L+ YGAF+DIGGI+GLLH+S+ISHDR
Subjt:  GGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHDR

Query:  IRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLSSDGIFVPVTPELA
        I D+A VL+PGD LKVMIL+ DRE+G + LSTKKLEP  GDMI NP LVFEKAEEMA  FRQR+AQAEALARAD+L FQPE  LTL++DGI  P+TPEL 
Subjt:  IRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLSSDGIFVPVTPELA

Query:  VKRGLDIEQYVPP
        V+ GLD+   VPP
Subjt:  VKRGLDIEQYVPP

XP_016902972.1 PREDICTED: 30S ribosomal protein S1, chloroplastic-like [Cucumis melo]1.4e-17086.89Show/hide
Query:  LMAQQCTGLRCEPLFSIS---SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLG
        LMAQ  TGLR EPL SIS   SKPLGRS  QN  ARSF VLAA+ISSPIP+P TTERFKLK+TF +AADRC N PMEGVSFTLQ FLASLEKYDFDPQLG
Subjt:  LMAQQCTGLRCEPLFSIS---SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLG

Query:  SKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANKG
        SKVKGTVVY EANGALVEIAAKSPAYLPLQEA IHRIKR+EEAGIYPG REEFVIIG+NEDD LTLSLRPIQYELAWERCRQLQA DVVVKGKVV ANKG
Subjt:  SKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANKG

Query:  GVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHDRI
        GVLVVVEGLKGFVPFSEILMIST EELINKELPLK LVV EEQTR+VLSNRKVMADSKA+L IGSVVT TVL+L K+GAFVDIGG+HGLLHISEISHDRI
Subjt:  GVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHDRI

Query:  RDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQA
         D+A VLKPGDILKVM+LNIDREKGHIRLSTKKLEPN GDMI NPGLVF KAEEMA RFRQRLAQA
Subjt:  RDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQA

XP_022138241.1 30S ribosomal protein S1, chloroplastic [Momordica charantia]1.3e-16072.88Show/hide
Query:  MPLMAQQCTGLRCEPLFSIS-SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLG
        M  MAQQ TGLRC PL S   S P    H+QNK ARS PV AA+ISSPIP+PQT ERFKLKE F++A +RCRN P+EG+SFTL+DF A+LEKYDFD ++G
Subjt:  MPLMAQQCTGLRCEPLFSIS-SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLG

Query:  SKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANK
        +KVKGTV   +ANGALV+I AKS AYLP+QEACIHRIK +EEAGI+PG+REEFVIIGENE DDSL LSLR IQY+LAWERCRQLQAEDVVVKGKVVDANK
Subjt:  SKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANK

Query:  GGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHDR
        GGV+ VVEGL+GFVPFS+I   STAEEL+NKELPLKF+ V+EEQ+R+VLSNRK MADS+A+L IGSVVT TV  L+ YGAF+DIGGI+GLLH+S+ISHDR
Subjt:  GGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHDR

Query:  IRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLSSDGIFVPVTPELA
        I D+A VL+PGD LKVMIL+ DRE+G + LSTKKLEP  GDMI NP LVFEKAEEMA  FRQR+AQAEA+ARAD+L FQPE  LTL++DGI  P+TPEL 
Subjt:  IRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLSSDGIFVPVTPELA

Query:  VKRGLDIEQYVPP
        V+ GLD+   VPP
Subjt:  VKRGLDIEQYVPP

XP_038878013.1 30S ribosomal protein S1, chloroplastic-like [Benincasa hispida]3.2e-15972.4Show/hide
Query:  MPLMAQQCTGLRCEPLFSIS-SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLG
        M  MAQQ TGLRC PL S   SKP    H+Q+K ARS PV AA+IS PIP+PQT ERFKLKE F+ A +RCRN P+EG++FTL+DF A+LEKYDFD +LG
Subjt:  MPLMAQQCTGLRCEPLFSIS-SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLG

Query:  SKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANK
        +KVKGTV   + NGALV+I AKS AYLP+QEACIHRIK +EEAGI+PGLREEFVIIGENE DDSL LSLR IQY+LAWERCRQLQAEDVVVKGKVVDANK
Subjt:  SKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANK

Query:  GGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHDR
        GGV+ VVEGL+GFVPFS+I   STAEEL+NKELPLKF+ V+EEQ+R+VLSNRK MADS+A+L IGSVVT TV  L+ YGAF+DIGGI+GLLH+S+ISHDR
Subjt:  GGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHDR

Query:  IRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLSSDGIFVPVTPELA
        I D+  VL+PGD LKVMIL+ DRE+G + LSTKKLEP  GDMI NP LVFEKAEEMA  FRQR+AQAEA+ARAD+L FQPE  LTL++DGI  P+TPEL 
Subjt:  IRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLSSDGIFVPVTPELA

Query:  VKRGLDIEQYVPP
        V+ GLD+   VPP
Subjt:  VKRGLDIEQYVPP

XP_038885297.1 30S ribosomal protein S1, chloroplastic-like [Benincasa hispida]2.5e-19688.94Show/hide
Query:  MAQQCTGLRCEPLFSIS---SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLGS
        MAQQCTGLRCEP FSIS   SKPL  SHMQN V RSFPV+AA+IS PIPTPQTTERFKLK+TF +AADRCRN PMEGVSFTLQDFLASLEKY FDPQLG+
Subjt:  MAQQCTGLRCEPLFSIS---SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLGS

Query:  KVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANKGG
        KVKGTVVY EANGALVEIAAKSPAYLPL EACIHRIKR+EEAGIYPG REEFVIIGENEDDSLTLSLR IQYELAWERCRQLQAEDV+VKGKVV AN GG
Subjt:  KVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANKGG

Query:  VLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHDRIR
        VLVVVEGLKGFVP+SEILMISTAEELINKELPLKFLVVNEE+TR+VLSNRK+MADSKA+LAIG+VVT TVL+L K+GAFVDIGG+HGLLHISEISHDRI 
Subjt:  VLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHDRIR

Query:  DVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLSSDGIFVPVTPELA
        D+AAVLKPGDILKVMILNI+ EKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRL LSSDGI  P+TPELA
Subjt:  DVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLSSDGIFVPVTPELA

TrEMBL top hitse value%identityAlignment
A0A1S3AXL6 30S ribosomal protein S1, chloroplastic1.8e-16073.12Show/hide
Query:  MPLMAQQCTGLRCEPLFSIS-SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLG
        M  MAQQ TGLRC PL S   SKP    H+ NK +RS PV AA+IS PIP+PQT ERFKLKE F+ A +RCRN P+EG+SFTL+DF A+LEKYDFD +LG
Subjt:  MPLMAQQCTGLRCEPLFSIS-SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLG

Query:  SKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANK
        +KVKGTV   + NGALV+I AKS AYLPLQEACIHRIK +EEAGI+PGLREEFVIIGENE DDSL LSLR IQY+LAWERCRQLQAEDVVVKGKVVDANK
Subjt:  SKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANK

Query:  GGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHDR
        GGV+ VVEGL+GFVPFS+I   STAEEL+NKELPLKF+ V+EEQ+R+VLSNRK MADS+A+L IGSVVT TV  L+ YGAF+DIGGI+GLLH+S+ISHDR
Subjt:  GGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHDR

Query:  IRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLSSDGIFVPVTPELA
        I D+A VL+PGD LKVMIL+ DRE+G + LSTKKLEP  GDMI NP LVFEKAEEMA  FRQR+AQAEALARAD+L FQPE  LTL++DGI  P+TPEL 
Subjt:  IRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLSSDGIFVPVTPELA

Query:  VKRGLDIEQYVPP
        V+ GLD+   VPP
Subjt:  VKRGLDIEQYVPP

A0A1S4E424 30S ribosomal protein S1, chloroplastic-like6.6e-17186.89Show/hide
Query:  LMAQQCTGLRCEPLFSIS---SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLG
        LMAQ  TGLR EPL SIS   SKPLGRS  QN  ARSF VLAA+ISSPIP+P TTERFKLK+TF +AADRC N PMEGVSFTLQ FLASLEKYDFDPQLG
Subjt:  LMAQQCTGLRCEPLFSIS---SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLG

Query:  SKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANKG
        SKVKGTVVY EANGALVEIAAKSPAYLPLQEA IHRIKR+EEAGIYPG REEFVIIG+NEDD LTLSLRPIQYELAWERCRQLQA DVVVKGKVV ANKG
Subjt:  SKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANKG

Query:  GVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHDRI
        GVLVVVEGLKGFVPFSEILMIST EELINKELPLK LVV EEQTR+VLSNRKVMADSKA+L IGSVVT TVL+L K+GAFVDIGG+HGLLHISEISHDRI
Subjt:  GVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHDRI

Query:  RDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQA
         D+A VLKPGDILKVM+LNIDREKGHIRLSTKKLEPN GDMI NPGLVF KAEEMA RFRQRLAQA
Subjt:  RDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQA

A0A5A7SUN2 30S ribosomal protein S16.6e-17186.89Show/hide
Query:  LMAQQCTGLRCEPLFSIS---SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLG
        LMAQ  TGLR EPL SIS   SKPLGRS  QN  ARSF VLAA+ISSPIP+P TTERFKLK+TF +AADRC N PMEGVSFTLQ FLASLEKYDFDPQLG
Subjt:  LMAQQCTGLRCEPLFSIS---SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLG

Query:  SKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANKG
        SKVKGTVVY EANGALVEIAAKSPAYLPLQEA IHRIKR+EEAGIYPG REEFVIIG+NEDD LTLSLRPIQYELAWERCRQLQA DVVVKGKVV ANKG
Subjt:  SKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANKG

Query:  GVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHDRI
        GVLVVVEGLKGFVPFSEILMIST EELINKELPLK LVV EEQTR+VLSNRKVMADSKA+L IGSVVT TVL+L K+GAFVDIGG+HGLLHISEISHDRI
Subjt:  GVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHDRI

Query:  RDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQA
         D+A VLKPGDILKVM+LNIDREKGHIRLSTKKLEPN GDMI NPGLVF KAEEMA RFRQRLAQA
Subjt:  RDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQA

A0A5A7UEP7 30S ribosomal protein S12.7e-15668.95Show/hide
Query:  MPLMAQQCTGLRCEPLFSIS-SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLG
        M  MAQQ TGLRC PL S   SKP    H+ NK +RS PV AA+IS PIP+PQT ERFKLKE F+ A +RCRN P+EG+SFTL+DF A+LEKYDFD +LG
Subjt:  MPLMAQQCTGLRCEPLFSIS-SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLG

Query:  SK-------------------------VKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYE
        +K                         VKGTV   + NGALV+I AKS AYLPLQEACIHRIK +EEAGI+PGLREEFVIIGENE DDSL LSLR IQY+
Subjt:  SK-------------------------VKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYE

Query:  LAWERCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKL
        LAWERCRQLQAEDVVVKGKVVDANKGGV+ VVEGL+GFVPFS+I   STAEEL+NKELPLKF+ V+EEQ+R+VLSNRK MADS+A+L IGSVVT TV  L
Subjt:  LAWERCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKL

Query:  QKYGAFVDIGGIHGLLHISEISHDRIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADL
        + YGAF+DIGGI+GLLH+S+ISHDRI D+A VL+PGD LKVMIL+ DRE+G + LSTKKLEP  GDMI NP LVFEKAEEMA  FRQR+AQAEALARAD+
Subjt:  QKYGAFVDIGGIHGLLHISEISHDRIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADL

Query:  LSFQPEGRLTLSSDGIFVPVTPELAVKRGLDIEQYVPP
        L FQPE  LTL++DGI  P+TPEL V+ GLD+   VPP
Subjt:  LSFQPEGRLTLSSDGIFVPVTPELAVKRGLDIEQYVPP

A0A6J1C966 30S ribosomal protein S1, chloroplastic6.2e-16172.88Show/hide
Query:  MPLMAQQCTGLRCEPLFSIS-SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLG
        M  MAQQ TGLRC PL S   S P    H+QNK ARS PV AA+ISSPIP+PQT ERFKLKE F++A +RCRN P+EG+SFTL+DF A+LEKYDFD ++G
Subjt:  MPLMAQQCTGLRCEPLFSIS-SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLG

Query:  SKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANK
        +KVKGTV   +ANGALV+I AKS AYLP+QEACIHRIK +EEAGI+PG+REEFVIIGENE DDSL LSLR IQY+LAWERCRQLQAEDVVVKGKVVDANK
Subjt:  SKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANK

Query:  GGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHDR
        GGV+ VVEGL+GFVPFS+I   STAEEL+NKELPLKF+ V+EEQ+R+VLSNRK MADS+A+L IGSVVT TV  L+ YGAF+DIGGI+GLLH+S+ISHDR
Subjt:  GGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHDR

Query:  IRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLSSDGIFVPVTPELA
        I D+A VL+PGD LKVMIL+ DRE+G + LSTKKLEP  GDMI NP LVFEKAEEMA  FRQR+AQAEA+ARAD+L FQPE  LTL++DGI  P+TPEL 
Subjt:  IRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLSSDGIFVPVTPELA

Query:  VKRGLDIEQYVPP
        V+ GLD+   VPP
Subjt:  VKRGLDIEQYVPP

SwissProt top hitse value%identityAlignment
O33698 30S ribosomal protein S11.3e-4136.65Show/hide
Query:  DFLASLEKYDFDPQLGSKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVII-GENEDDSLTLSLRPIQYELAWERCRQL
        DF  +LE    D Q G  V+G V     +GA ++I  K+PA+LP +EA +H +  + EA +      EF++I  +NED  +T+SLR +  E AW R  +L
Subjt:  DFLASLEKYDFDPQLGSKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVII-GENEDDSLTLSLRPIQYELAWERCRQL

Query:  QAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKA-ELAIGSVVTETVLKLQKYGAFVD
        Q     V+ KV  +NKGGV   +EGL+ F+P S +      + L  K L + FL VN    ++VLS R+    +   E+ +G ++   V  L+ +G FVD
Subjt:  QAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKA-ELAIGSVVTETVLKLQKYGAFVD

Query:  IGGIHGLLHISEISHDRIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRL
        +GG   LL I++IS   + DV A+ K GD ++ +++ ID  KG I LSTK LE + G+++ N   +   A + A R R++L
Subjt:  IGGIHGLLHISEISHDRIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRL

P29344 30S ribosomal protein S1, chloroplastic1.9e-14365.7Show/hide
Query:  MPLMAQQCT-GLRCEPLFSIS-SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQL
        M  +AQQ   GLRC PL + + SKP    H      R  P+++A+    +   QT ER KLK+ F++A +RCRN PMEGVSFT+ DF  +L+KYDF+ ++
Subjt:  MPLMAQQCT-GLRCEPLFSIS-SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQL

Query:  GSKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDAN
        GS+VKGTV   +ANGALV+I AKS AYLPL EACI+RIK +EEAGI PG+REEFVIIGENE DDSL LSLR IQYELAWERCRQLQAEDVVVKGK+V AN
Subjt:  GSKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDAN

Query:  KGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHD
        KGGV+ +VEGL+GFVPFS+I   S+AEEL+ KE+PLKF+ V+EEQ+R+V+SNRK MADS+A+L IGSVVT TV  L+ YGAF+DIGGI+GLLH+S+ISHD
Subjt:  KGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHD

Query:  RIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLSSDGIFVPVTPEL
        R+ D+A VL+PGD LKVMIL+ DRE+G + LSTKKLEP  GDMI NP LVFEKAEEMA  FRQR+AQAEA+ARAD+L FQPE  LTLSSDGI  P+T +L
Subjt:  RIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLSSDGIFVPVTPEL

Query:  AVKRGLDIEQYVPP
          + GLD+   VPP
Subjt:  AVKRGLDIEQYVPP

P46228 30S ribosomal protein S17.3e-7449Show/hide
Query:  RNVPMEGVSFTLQDFLASLEKYDFDPQLGSKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENEDDSLTLSLRPI
        +++P   + FT +DF A L++YD+    G  V GTV   E  GAL++I AK+ A+LP+QE  I+R++  EE      +RE F++  ENED  LTLS+R I
Subjt:  RNVPMEGVSFTLQDFLASLEKYDFDPQLGSKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENEDDSLTLSLRPI

Query:  QYELAWERCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKA-ELAIGSVVTET
        +Y  AWER RQLQ ED  V+ +V   N+GG LV +EGL+GF+P S I      E+L+ +ELPLKFL V+E++ R+VLS+R+ + + K   L +G VV   
Subjt:  QYELAWERCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKA-ELAIGSVVTET

Query:  VLKLQKYGAFVDIGGIHGLLHISEISHDRIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRL-AQAEAL
        V  ++ YGAF+DIGG+ GLLHISEISHD I    +V    D +KVMI+++D E+G I LSTK+LEP  GDM+ NP +V+EKAEEMA ++R++L  QAE L
Subjt:  VLKLQKYGAFVDIGGIHGLLHISEISHDRIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRL-AQAEAL

P73530 30S ribosomal protein S1 homolog A4.9e-7049.49Show/hide
Query:  VSFTLQDFLASLEKYDFDPQLGSKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWE
        + FTL+DF A L+KYD+    G  V GTV   E+ GAL++I AK+ AY+P+QE  I+R+   EE       RE F++  ENED  LTLS+R I+Y  AWE
Subjt:  VSFTLQDFLASLEKYDFDPQLGSKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWE

Query:  RCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAE-LAIGSVVTETVLKLQKY
        R RQLQAED  V+  V   N+GG LV +EGL+GF+P S I      E+L+ ++LPLKFL V+EE+ R+VLS+R+ + + K   L +  VV  +V  ++ Y
Subjt:  RCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAE-LAIGSVVTETVLKLQKY

Query:  GAFVDIGGIHGLLHISEISHDRIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQ-RLAQAEAL
        GAF+DIGG+ GLLHISEISHD I    +V    D +KVMI+++D E+G I LSTK+LEP  G M+ +  LV E A+EMA  FRQ RLA+A+ +
Subjt:  GAFVDIGGIHGLLHISEISHDRIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQ-RLAQAEAL

Q93VC7 30S ribosomal protein S1, chloroplastic1.4e-13863.5Show/hide
Query:  MPLMAQQCTGLRCEPLFSIS--SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQL
        M  +AQQ +GLRC PL S S  S+   ++  QNK A   P + A ++  + + QT ER +LK+ F++A +RCR  PMEGV+FT+ DF A++E+YDF+ ++
Subjt:  MPLMAQQCTGLRCEPLFSIS--SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQL

Query:  GSKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDAN
        G++VKGTV   +ANGALV+I+AKS AYL +++ACIHRIK +EEAGI PG+ EEFVIIGENE DDSL LSLR IQYELAWERCRQLQAEDV+VK KV+ AN
Subjt:  GSKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDAN

Query:  KGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHD
        KGG++ +VEGL+GFVPFS+I   + AEEL+ KE+PLKF+ V+EEQT++VLSNRK +ADS+A+L IGSVV   V  L+ YGAF+DIGGI+GLLH+S+ISHD
Subjt:  KGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHD

Query:  RIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLSSDGIFVPVTPEL
        R+ D+A VL+PGD LKVMIL+ DR++G + LSTKKLEP  GDMI NP LVFEKAEEMA  FRQR+AQAEA+ARAD+L FQPE  LTLSSDGI  P+  EL
Subjt:  RIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLSSDGIFVPVTPEL

Arabidopsis top hitse value%identityAlignment
AT1G71720.1 Nucleic acid-binding proteins superfamily2.1e-2031.93Show/hide
Query:  IIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEIL-MISTAEELINKELPLKFLV----VNEEQTRVVLS
        ++G        LS R     +AW R RQ++  +  ++ K+ + N GG+L  +EGL+ F+P  E++  ++T  EL  + +  +FLV    +NE++  ++LS
Subjt:  IIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEIL-MISTAEELINKELPLKFLV----VNEEQTRVVLS

Query:  NRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIG--GIHGLLHISEISHDRIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGL
         +  +A  K  L  G+++  TV+K+  YGA V +G     GLLHIS I+  RI  V+ VL+  + +KV+++        I LS   LE   G  I +   
Subjt:  NRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIG--GIHGLLHISEISHDRIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGL

Query:  VFEKAEEMAHRFRQRL---AQAEALARADLLSFQPEGR
        VF +AEEMA ++R+++   A +    R  + S  P+G+
Subjt:  VFEKAEEMAHRFRQRL---AQAEALARADLLSFQPEGR

AT3G11964.1 RNA binding;RNA binding2.1e-0727.62Show/hide
Query:  WERCRQLQAEDVVVKGKVVDA-NKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLV------VNEEQTRVVLSNRKVMADSK--------AEL
        +ER   L + D+ V+G V +  +KG  +++   ++  V  S +      E    KE P+  LV      V     R+ ++ + V A  +         +L
Subjt:  WERCRQLQAEDVVVKGKVVDA-NKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLV------VNEEQTRVVLSNRKVMADSK--------AEL

Query:  AIGSVVTETVLKLQKYGAFVDIG--GIHGLLHISEISHDRIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGD
         +G +++  + +++ +G F+DI   G+ GL HIS++S DR+ +V A  K G+ ++  IL +D EK  I L  K      GD
Subjt:  AIGSVVTETVLKLQKYGAFVDIG--GIHGLLHISEISHDRIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGD

AT3G23700.1 Nucleic acid-binding proteins superfamily4.9e-1731.07Show/hide
Query:  WERCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEE-----------LINKELPLKFLVVNEEQTRVVLSNRKVMADSKAE-LAIG
        W+  +         +G+V   N GG+L+    L GF+P+ ++    + +E           L+  +LP+K +  +EE  +++LS +  +    ++ + +G
Subjt:  WERCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEE-----------LINKELPLKFLVVNEEQTRVVLSNRKVMADSKAE-LAIG

Query:  SVVTETVLKLQKYGAFV----DIGGIH--GLLHISEISHDRIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLE
         V    V  ++ YGAF+    D G  H  GL+H+SE+S D ++DV  VL+ GD ++V++ NID+EK  I LS K+LE
Subjt:  SVVTETVLKLQKYGAFV----DIGGIH--GLLHISEISHDRIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLE

AT5G14580.1 polyribonucleotide nucleotidyltransferase, putative2.9e-0935.78Show/hide
Query:  LVVNEEQTRVVLSNRKVMADSK--------AELAIGSVVTETVLKLQKYGAFVDI-GGIHGLLHISEISHDRIRDVAAVLKPGDILKVMILNIDREKGHI
        L ++     +V  N+ VM  ++         EL +G V   TV  +++YGAFV+  GG  GLLH+SE+SH+ +  V+ VL  G  +  M +  D  +G+I
Subjt:  LVVNEEQTRVVLSNRKVMADSK--------AELAIGSVVTETVLKLQKYGAFVDI-GGIHGLLHISEISHDRIRDVAAVLKPGDILKVMILNIDREKGHI

Query:  RLSTKKLEP
        +LS K L P
Subjt:  RLSTKKLEP

AT5G30510.1 ribosomal protein S11.0e-13963.5Show/hide
Query:  MPLMAQQCTGLRCEPLFSIS--SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQL
        M  +AQQ +GLRC PL S S  S+   ++  QNK A   P + A ++  + + QT ER +LK+ F++A +RCR  PMEGV+FT+ DF A++E+YDF+ ++
Subjt:  MPLMAQQCTGLRCEPLFSIS--SKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQL

Query:  GSKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDAN
        G++VKGTV   +ANGALV+I+AKS AYL +++ACIHRIK +EEAGI PG+ EEFVIIGENE DDSL LSLR IQYELAWERCRQLQAEDV+VK KV+ AN
Subjt:  GSKVKGTVVYAEANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDAN

Query:  KGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHD
        KGG++ +VEGL+GFVPFS+I   + AEEL+ KE+PLKF+ V+EEQT++VLSNRK +ADS+A+L IGSVV   V  L+ YGAF+DIGGI+GLLH+S+ISHD
Subjt:  KGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHD

Query:  RIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLSSDGIFVPVTPEL
        R+ D+A VL+PGD LKVMIL+ DR++G + LSTKKLEP  GDMI NP LVFEKAEEMA  FRQR+AQAEA+ARAD+L FQPE  LTLSSDGI  P+  EL
Subjt:  RIRDVAAVLKPGDILKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLSSDGIFVPVTPEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTTGATGGCTCAGCAATGCACAGGGTTGAGATGTGAGCCTCTGTTTTCAATTTCCTCCAAGCCACTTGGTCGGAGCCATATGCAGAACAAGGTAGCCCGTTCATT
CCCAGTTTTGGCTGCACTAATATCGAGCCCTATTCCCACTCCTCAGACCACAGAGCGTTTCAAGCTCAAGGAGACCTTCAAGAATGCGGCCGATCGCTGCCGTAATGTTC
CCATGGAAGGTGTCTCCTTCACTCTCCAAGACTTCCTTGCCTCTCTTGAGAAATACGACTTTGATCCTCAATTGGGATCCAAGGTGAAAGGTACTGTGGTCTATGCAGAA
GCTAATGGAGCACTTGTGGAGATTGCTGCCAAGTCCCCTGCATACTTGCCCCTGCAGGAGGCTTGCATTCATAGAATAAAACGTATAGAAGAAGCAGGAATATATCCTGG
TTTAAGAGAGGAGTTTGTTATTATAGGTGAGAATGAAGATGATAGCTTGACTTTGAGCTTGAGGCCCATCCAATATGAACTTGCTTGGGAAAGGTGCAGACAGCTTCAAG
CAGAGGATGTTGTTGTCAAGGGTAAGGTGGTTGATGCGAACAAAGGGGGAGTTTTGGTAGTTGTGGAAGGCCTAAAAGGATTTGTTCCTTTCTCAGAGATATTAATGATA
TCAACTGCTGAAGAGCTTATCAACAAGGAGCTTCCTCTGAAATTTCTGGTGGTTAATGAGGAACAAACGAGGGTTGTCCTCAGTAACCGTAAGGTCATGGCTGACAGCAA
GGCAGAACTTGCAATTGGATCAGTGGTCACTGAAACAGTTCTAAAACTTCAAAAGTATGGTGCCTTTGTTGACATCGGTGGAATCCATGGTCTTCTTCACATCAGTGAGA
TAAGTCATGATCGCATAAGAGATGTTGCAGCAGTTCTTAAGCCTGGAGACATTCTCAAGGTCATGATATTGAACATTGATCGTGAAAAAGGCCATATTCGTCTTTCTACA
AAGAAGCTAGAGCCTAATACTGGGGACATGATTTGCAATCCAGGGCTTGTTTTTGAAAAGGCTGAGGAAATGGCACATAGATTTAGGCAAAGATTAGCTCAAGCAGAGGC
ATTGGCACGTGCAGACTTGCTTAGTTTTCAGCCTGAGGGCAGATTAACTTTGAGCAGTGATGGAATATTTGTTCCAGTTACCCCAGAATTGGCTGTAAAGAGAGGGTTAG
ATATTGAACAATATGTTCCTCCCTTCAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTTTGATGGCTCAGCAATGCACAGGGTTGAGATGTGAGCCTCTGTTTTCAATTTCCTCCAAGCCACTTGGTCGGAGCCATATGCAGAACAAGGTAGCCCGTTCATT
CCCAGTTTTGGCTGCACTAATATCGAGCCCTATTCCCACTCCTCAGACCACAGAGCGTTTCAAGCTCAAGGAGACCTTCAAGAATGCGGCCGATCGCTGCCGTAATGTTC
CCATGGAAGGTGTCTCCTTCACTCTCCAAGACTTCCTTGCCTCTCTTGAGAAATACGACTTTGATCCTCAATTGGGATCCAAGGTGAAAGGTACTGTGGTCTATGCAGAA
GCTAATGGAGCACTTGTGGAGATTGCTGCCAAGTCCCCTGCATACTTGCCCCTGCAGGAGGCTTGCATTCATAGAATAAAACGTATAGAAGAAGCAGGAATATATCCTGG
TTTAAGAGAGGAGTTTGTTATTATAGGTGAGAATGAAGATGATAGCTTGACTTTGAGCTTGAGGCCCATCCAATATGAACTTGCTTGGGAAAGGTGCAGACAGCTTCAAG
CAGAGGATGTTGTTGTCAAGGGTAAGGTGGTTGATGCGAACAAAGGGGGAGTTTTGGTAGTTGTGGAAGGCCTAAAAGGATTTGTTCCTTTCTCAGAGATATTAATGATA
TCAACTGCTGAAGAGCTTATCAACAAGGAGCTTCCTCTGAAATTTCTGGTGGTTAATGAGGAACAAACGAGGGTTGTCCTCAGTAACCGTAAGGTCATGGCTGACAGCAA
GGCAGAACTTGCAATTGGATCAGTGGTCACTGAAACAGTTCTAAAACTTCAAAAGTATGGTGCCTTTGTTGACATCGGTGGAATCCATGGTCTTCTTCACATCAGTGAGA
TAAGTCATGATCGCATAAGAGATGTTGCAGCAGTTCTTAAGCCTGGAGACATTCTCAAGGTCATGATATTGAACATTGATCGTGAAAAAGGCCATATTCGTCTTTCTACA
AAGAAGCTAGAGCCTAATACTGGGGACATGATTTGCAATCCAGGGCTTGTTTTTGAAAAGGCTGAGGAAATGGCACATAGATTTAGGCAAAGATTAGCTCAAGCAGAGGC
ATTGGCACGTGCAGACTTGCTTAGTTTTCAGCCTGAGGGCAGATTAACTTTGAGCAGTGATGGAATATTTGTTCCAGTTACCCCAGAATTGGCTGTAAAGAGAGGGTTAG
ATATTGAACAATATGTTCCTCCCTTCAGCTGA
Protein sequenceShow/hide protein sequence
MPLMAQQCTGLRCEPLFSISSKPLGRSHMQNKVARSFPVLAALISSPIPTPQTTERFKLKETFKNAADRCRNVPMEGVSFTLQDFLASLEKYDFDPQLGSKVKGTVVYAE
ANGALVEIAAKSPAYLPLQEACIHRIKRIEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMI
STAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSKAELAIGSVVTETVLKLQKYGAFVDIGGIHGLLHISEISHDRIRDVAAVLKPGDILKVMILNIDREKGHIRLST
KKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLSSDGIFVPVTPELAVKRGLDIEQYVPPFS