; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G04560 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G04560
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Description30S ribosomal protein S1, chloroplastic
Genome locationClcChr06:4722360..4725705
RNA-Seq ExpressionClc06G04560
SyntenyClc06G04560
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0022627 - cytosolic small ribosomal subunit (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1
IPR035104 - Ribosomal protein S1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022958818.1 30S ribosomal protein S1, chloroplastic-like isoform X1 [Cucurbita moschata]1.9e-16480.2Show/hide
Query:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN
        MSS AHQ CGL S     PLSS+  S       RFSP         V +AA+SP+PISNAQTK RLKLKQLFKEAYERCC  PMDG+SFTLEDFHAALAN
Subjt:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN

Query:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIA--EQEDGDGLILSLKSVQYGLAWERCRQLQAEDIV
        YDFV+ELGTKVKGTVF TDANGALVDT+AKGTAYLPTQEACI  IRHVEE GIYPGLEEEFVII   EQED   LILSL+SVQYGLAWERCRQLQAED V
Subjt:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIA--EQEDGDGLILSLKSVQYGLAWERCRQLQAEDIV

Query:  IKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGL
        IKGKVV   KGG+VVLVEGL+GFVPFSQISAKST EELLNKEL LKFVEVDEKL RL+LSNSKAIVSSQ+ELRIGSVVTG VQ LKPYGAF+DIGG+NGL
Subjt:  IKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGL

Query:  LHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLEVLI
        LHVSQISQNHI DIA VLQPGDMLKVMILSYD ++GR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAE MARA LL FQ E+L+
Subjt:  LHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLEVLI

XP_022958819.1 30S ribosomal protein S1, chloroplastic-like isoform X2 [Cucurbita moschata]9.4e-16480.56Show/hide
Query:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN
        MSS AHQ CGL S     PLSS+  S       RFSP         V +AA+SP+PISNAQTK RLKLKQLFKEAYERCC  PMDG+SFTLEDFHAALAN
Subjt:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN

Query:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIA--EQEDGDGLILSLKSVQYGLAWERCRQLQAEDIV
        YDFV+ELGTKVKGTVF TDANGALVDT+AKGTAYLPTQEACI  IRHVEE GIYPGLEEEFVII   EQED   LILSL+SVQYGLAWERCRQLQAED V
Subjt:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIA--EQEDGDGLILSLKSVQYGLAWERCRQLQAEDIV

Query:  IKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGL
        IKGKVV   KGG+VVLVEGL+GFVPFSQISAKST EELLNKEL LKFVEVDEKL RL+LSNSKAIVSSQ+ELRIGSVVTG VQ LKPYGAF+DIGG+NGL
Subjt:  IKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGL

Query:  LHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLE
        LHVSQISQNHI DIA VLQPGDMLKVMILSYD ++GR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAE MARA LL FQ E
Subjt:  LHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLE

XP_023006590.1 30S ribosomal protein S1, chloroplastic-like isoform X1 [Cucurbita maxima]8.5e-16580.46Show/hide
Query:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN
        MSS AHQ CGL S     PLSS+  S       RFSP         V +AA+SP+PISNAQTK RLKLKQLFKEAYERCC  PMDG+SFTLEDFHA+LAN
Subjt:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN

Query:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIA--EQEDGDGLILSLKSVQYGLAWERCRQLQAEDIV
        YDFV+E+GTKVKGTVF TDANGALVDT+AKGTAYLPTQEACI  IRHVEE GIYPGLEEEFVII   EQED   LILSL+SVQYGLAWERCRQLQAED V
Subjt:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIA--EQEDGDGLILSLKSVQYGLAWERCRQLQAEDIV

Query:  IKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGL
        IKGKVV   KGG+VVLVEGL+GFVPFSQISAKST EELLNKEL LKFVEVDEKLSRLVLSNSKAI SSQ+ELRIGSVVTG VQ LKPYGAFIDIGG+NGL
Subjt:  IKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGL

Query:  LHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLEVLI
        LHVSQISQNHISDIA VLQPGDMLKVMILSYD ++GR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAE MARA LL FQ E+L+
Subjt:  LHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLEVLI

XP_023006591.1 30S ribosomal protein S1, chloroplastic-like isoform X2 [Cucurbita maxima]4.2e-16480.82Show/hide
Query:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN
        MSS AHQ CGL S     PLSS+  S       RFSP         V +AA+SP+PISNAQTK RLKLKQLFKEAYERCC  PMDG+SFTLEDFHA+LAN
Subjt:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN

Query:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIA--EQEDGDGLILSLKSVQYGLAWERCRQLQAEDIV
        YDFV+E+GTKVKGTVF TDANGALVDT+AKGTAYLPTQEACI  IRHVEE GIYPGLEEEFVII   EQED   LILSL+SVQYGLAWERCRQLQAED V
Subjt:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIA--EQEDGDGLILSLKSVQYGLAWERCRQLQAEDIV

Query:  IKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGL
        IKGKVV   KGG+VVLVEGL+GFVPFSQISAKST EELLNKEL LKFVEVDEKLSRLVLSNSKAI SSQ+ELRIGSVVTG VQ LKPYGAFIDIGG+NGL
Subjt:  IKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGL

Query:  LHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLE
        LHVSQISQNHISDIA VLQPGDMLKVMILSYD ++GR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAE MARA LL FQ E
Subjt:  LHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLE

XP_038906593.1 30S ribosomal protein S1, chloroplastic-like [Benincasa hispida]4.1e-19190.77Show/hide
Query:  MSSSAHQP-CGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALA
        MSSSAHQP CGL SRYSY PLSS R SASSWNWNRF P +  K+LPLVSAAASSPSPISNAQTK RLKLKQLFKEAYERCCT+PMDG+SFTLEDFHAALA
Subjt:  MSSSAHQP-CGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALA

Query:  NYDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIAEQEDGDGLILSLKSVQYGLAWERCRQLQAEDIVI
        +YDFV+ELGTKVKGTVFCT+ANGALVD T KGTAYLPTQEACILKI+HVEE GIYPGLEEEF+IIAEQEDGDGLILSL+SVQYGLAWERCRQLQAEDIVI
Subjt:  NYDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIAEQEDGDGLILSLKSVQYGLAWERCRQLQAEDIVI

Query:  KGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGLL
        KGKVVGATKGG+VVLVEGLRGFVPFSQISAKST EELLNKELRLKFVEVDE+LSRL+LSNSKAIV SQAELRIGSVVTGTVQ LKPYGAFIDIGGINGLL
Subjt:  KGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGLL

Query:  HVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLE
        HVSQISQNHI DIATVLQPGD+LKVMILSYD NKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQ FRQRIAQAE MARAGLLG Q E
Subjt:  HVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLE

TrEMBL top hitse value%identityAlignment
A0A6J1CCC9 30S ribosomal protein S1, chloroplastic-like isoform X16.0e-16480.42Show/hide
Query:  LNSRYSYPPLSSSRFSASSWNWNRF----SPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALANYDFVTE
        +NS     PLSS R S SS +W RF    S N + + LP+VS+AAS+ +PISNAQTK RLKLKQLFKEAYERCCT PMDG+SFTLEDFHAAL+NYDFV+E
Subjt:  LNSRYSYPPLSSSRFSASSWNWNRF----SPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALANYDFVTE

Query:  LGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIAEQEDGDGLILSLKSVQYGLAWERCRQLQAEDIVIKGKVVGA
        LGTKVKGTVF TDA+GALVDTTAKGTAYLP +EACILKIRHVEE GIYPGLEEEFVIIAE E    LILSL+ +QYGLAWERCRQLQAED+VIKGKVVGA
Subjt:  LGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIAEQEDGDGLILSLKSVQYGLAWERCRQLQAEDIVIKGKVVGA

Query:  TKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGLLHVSQISQ
         KGG+ VLVEGLRGFVPFSQISAKST EELL+KELRLKFVEVDE+LSRL+LSN KAI +SQAELRIGSVVTGTVQ LK YGAFIDIGG+NGLLH+SQIS 
Subjt:  TKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGLLHVSQISQ

Query:  NHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLE
        NHISD+ATVL+PGD LKVMILSYD  +GRVSLSTK LEPTPGDMIHNPKLVFEKADEMAQ FRQRIAQAE MAR  LL  Q E
Subjt:  NHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLE

A0A6J1H453 30S ribosomal protein S1, chloroplastic-like isoform X19.2e-16580.2Show/hide
Query:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN
        MSS AHQ CGL S     PLSS+  S       RFSP         V +AA+SP+PISNAQTK RLKLKQLFKEAYERCC  PMDG+SFTLEDFHAALAN
Subjt:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN

Query:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIA--EQEDGDGLILSLKSVQYGLAWERCRQLQAEDIV
        YDFV+ELGTKVKGTVF TDANGALVDT+AKGTAYLPTQEACI  IRHVEE GIYPGLEEEFVII   EQED   LILSL+SVQYGLAWERCRQLQAED V
Subjt:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIA--EQEDGDGLILSLKSVQYGLAWERCRQLQAEDIV

Query:  IKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGL
        IKGKVV   KGG+VVLVEGL+GFVPFSQISAKST EELLNKEL LKFVEVDEKL RL+LSNSKAIVSSQ+ELRIGSVVTG VQ LKPYGAF+DIGG+NGL
Subjt:  IKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGL

Query:  LHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLEVLI
        LHVSQISQNHI DIA VLQPGDMLKVMILSYD ++GR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAE MARA LL FQ E+L+
Subjt:  LHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLEVLI

A0A6J1H681 30S ribosomal protein S1, chloroplastic-like isoform X24.6e-16480.56Show/hide
Query:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN
        MSS AHQ CGL S     PLSS+  S       RFSP         V +AA+SP+PISNAQTK RLKLKQLFKEAYERCC  PMDG+SFTLEDFHAALAN
Subjt:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN

Query:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIA--EQEDGDGLILSLKSVQYGLAWERCRQLQAEDIV
        YDFV+ELGTKVKGTVF TDANGALVDT+AKGTAYLPTQEACI  IRHVEE GIYPGLEEEFVII   EQED   LILSL+SVQYGLAWERCRQLQAED V
Subjt:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIA--EQEDGDGLILSLKSVQYGLAWERCRQLQAEDIV

Query:  IKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGL
        IKGKVV   KGG+VVLVEGL+GFVPFSQISAKST EELLNKEL LKFVEVDEKL RL+LSNSKAIVSSQ+ELRIGSVVTG VQ LKPYGAF+DIGG+NGL
Subjt:  IKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGL

Query:  LHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLE
        LHVSQISQNHI DIA VLQPGDMLKVMILSYD ++GR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAE MARA LL FQ E
Subjt:  LHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLE

A0A6J1KY62 30S ribosomal protein S1, chloroplastic-like isoform X22.0e-16480.82Show/hide
Query:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN
        MSS AHQ CGL S     PLSS+  S       RFSP         V +AA+SP+PISNAQTK RLKLKQLFKEAYERCC  PMDG+SFTLEDFHA+LAN
Subjt:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN

Query:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIA--EQEDGDGLILSLKSVQYGLAWERCRQLQAEDIV
        YDFV+E+GTKVKGTVF TDANGALVDT+AKGTAYLPTQEACI  IRHVEE GIYPGLEEEFVII   EQED   LILSL+SVQYGLAWERCRQLQAED V
Subjt:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIA--EQEDGDGLILSLKSVQYGLAWERCRQLQAEDIV

Query:  IKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGL
        IKGKVV   KGG+VVLVEGL+GFVPFSQISAKST EELLNKEL LKFVEVDEKLSRLVLSNSKAI SSQ+ELRIGSVVTG VQ LKPYGAFIDIGG+NGL
Subjt:  IKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGL

Query:  LHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLE
        LHVSQISQNHISDIA VLQPGDMLKVMILSYD ++GR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAE MARA LL FQ E
Subjt:  LHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLE

A0A6J1L0J3 30S ribosomal protein S1, chloroplastic-like isoform X14.1e-16580.46Show/hide
Query:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN
        MSS AHQ CGL S     PLSS+  S       RFSP         V +AA+SP+PISNAQTK RLKLKQLFKEAYERCC  PMDG+SFTLEDFHA+LAN
Subjt:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN

Query:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIA--EQEDGDGLILSLKSVQYGLAWERCRQLQAEDIV
        YDFV+E+GTKVKGTVF TDANGALVDT+AKGTAYLPTQEACI  IRHVEE GIYPGLEEEFVII   EQED   LILSL+SVQYGLAWERCRQLQAED V
Subjt:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIA--EQEDGDGLILSLKSVQYGLAWERCRQLQAEDIV

Query:  IKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGL
        IKGKVV   KGG+VVLVEGL+GFVPFSQISAKST EELLNKEL LKFVEVDEKLSRLVLSNSKAI SSQ+ELRIGSVVTG VQ LKPYGAFIDIGG+NGL
Subjt:  IKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGL

Query:  LHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLEVLI
        LHVSQISQNHISDIA VLQPGDMLKVMILSYD ++GR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAE MARA LL FQ E+L+
Subjt:  LHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLEVLI

SwissProt top hitse value%identityAlignment
P29344 30S ribosomal protein S1, chloroplastic5.0e-14471.39Show/hide
Query:  PPLSSSRFSASSWNWNRFSPNQ--RPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALANYDFVTELGTKVKGTV
        PPLS+S  S        FSP    +P+  P+VSA A     +SNAQT+ R KLKQLF++AYERC  APM+G+SFT++DFH AL  YDF +E+G++VKGTV
Subjt:  PPLSSSRFSASSWNWNRFSPNQ--RPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALANYDFVTELGTKVKGTV

Query:  FCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIAEQEDGDGLILSLKSVQYGLAWERCRQLQAEDIVIKGKVVGATKGGLVVLV
        FCTDANGALVD TAK +AYLP  EACI +I++VEE GI PG+ EEFVII E E  D LILSL+ +QY LAWERCRQLQAED+V+KGK+VGA KGG+V LV
Subjt:  FCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIAEQEDGDGLILSLKSVQYGLAWERCRQLQAEDIVIKGKVVGATKGGLVVLV

Query:  EGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGLLHVSQISQNHISDIATV
        EGLRGFVPFSQIS+KS+ EELL KE+ LKFVEVDE+ SRLV+SN KA+  SQA+L IGSVVTGTVQ LKPYGAFIDIGGINGLLHVSQIS + +SDIATV
Subjt:  EGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGLLHVSQISQNHISDIATV

Query:  LQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLE
        LQPGD LKVMILS+D  +GRVSLSTKKLEPTPGDMI NPKLVFEKA+EMAQ FRQRIAQAE MARA +L FQ E
Subjt:  LQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLE

P46228 30S ribosomal protein S11.1e-7147.78Show/hide
Query:  PMDGISFTLEDFHAALANYDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIAEQEDGDGLILSLKSVQY
        P   I FT EDF A L  YD+    G  V GTVF  +  GAL+D  AK  A+LP QE  I ++   EEV + P    EF I++++ +   L LS++ ++Y
Subjt:  PMDGISFTLEDFHAALANYDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIAEQEDGDGLILSLKSVQY

Query:  GLAWERCRQLQAEDIVIKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQA-ELRIGSVVTGTVQ
          AWER RQLQ ED  ++ +V    +GG +V +EGLRGF+P S IS +   E+L+ +EL LKF+EVDE  +RLVLS+ +A+V  +   L +G VV G V+
Subjt:  GLAWERCRQLQAEDIVIKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQA-ELRIGSVVTGTVQ

Query:  FLKPYGAFIDIGGINGLLHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQ
         +KPYGAFIDIGG++GLLH+S+IS +HI    +V    D +KVMI+  D  +GR+SLSTK+LEP PGDM+ NP++V+EKA+EMA  +R+++ Q
Subjt:  FLKPYGAFIDIGGINGLLHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQ

P51345 30S ribosomal protein S1, chloroplastic5.2e-4035.5Show/hide
Query:  SFTLEDFHAALANYDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLE----EEFVIIAEQEDGDGLILSLKSVQYG
        SFT  +F A L  Y +   LG  V GT+F  + NG LVD     +AYLP QE     +   +E+  +  L      EF ++    +   LILS++ ++Y 
Subjt:  SFTLEDFHAALANYDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLE----EEFVIIAEQEDGDGLILSLKSVQYG

Query:  LAWERCRQLQAEDIVIKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVS-SQAELRIGSVVTGTVQF
         AW+R RQL AED ++  ++ G  KGG++V +EG+ GFVP S ++  S      NK ++LK + V+EK + L+LS+ +A+++ + + L +G+++ G +  
Subjt:  LAWERCRQLQAEDIVIKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVS-SQAELRIGSVVTGTVQF

Query:  LKPYGAFIDIGGINGLLHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLE
        + PYG FI  G + GL+H+S+I+   +  I +  + GD +K +I+  D  +GR+SLS K L+
Subjt:  LKPYGAFIDIGGINGLLHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLE

P73530 30S ribosomal protein S1 homolog A1.3e-7048.97Show/hide
Query:  ISFTLEDFHAALANYDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIAEQEDGDGLILSLKSVQYGLAW
        I FTLEDF A L  YD+    G  V GTVF  ++ GAL+D  AK  AY+P QE  I ++   EEV + P    EF I+ ++ +   L LS++ ++Y  AW
Subjt:  ISFTLEDFHAALANYDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIAEQEDGDGLILSLKSVQYGLAW

Query:  ERCRQLQAEDIVIKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAE-LRIGSVVTGTVQFLKP
        ER RQLQAED  ++  V    +GG +V +EGLRGF+P S ISA+   E+L+ ++L LKF+EVDE+ +RLVLS+ +A+V  +   L +  VV G+V+ +KP
Subjt:  ERCRQLQAEDIVIKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAE-LRIGSVVTGTVQFLKP

Query:  YGAFIDIGGINGLLHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQ-RIAQAE
        YGAFIDIGG++GLLH+S+IS +HI    +V    D +KVMI+  D  +GR+SLSTK+LEP PG M+ +  LV E ADEMA+IFRQ R+A+A+
Subjt:  YGAFIDIGGINGLLHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQ-RIAQAE

Q93VC7 30S ribosomal protein S1, chloroplastic3.9e-13666.07Show/hide
Query:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN
        M+S A Q  GL      P  SSSR S  +     F  N+   V P + AA +    +S+ QTK RL+LK++F++AYERC T+PM+G++FT++DF AA+  
Subjt:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN

Query:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIAEQEDGDGLILSLKSVQYGLAWERCRQLQAEDIVIK
        YDF +E+GT+VKGTVF TDANGALVD +AK +AYL  ++ACI +I+HVEE GI PG+ EEFVII E E  D L+LSL+++QY LAWERCRQLQAED+++K
Subjt:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIAEQEDGDGLILSLKSVQYGLAWERCRQLQAEDIVIK

Query:  GKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGLLH
         KV+GA KGGLV LVEGLRGFVPFSQIS+K+  EELL KE+ LKFVEVDE+ ++LVLSN KA+  SQA+L IGSVV G VQ LKPYGAFIDIGGINGLLH
Subjt:  GKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGLLH

Query:  VSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLE
        VSQIS + +SDIATVLQPGD LKVMILS+D ++GRVSLSTKKLEPTPGDMI NPKLVFEKA+EMAQ FRQRIAQAE MARA +L FQ E
Subjt:  VSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLE

Arabidopsis top hitse value%identityAlignment
AT1G71720.1 Nucleic acid-binding proteins superfamily2.1e-2030.95Show/hide
Query:  ILSLKSVQYGLAWERCRQLQAEDIVIKGKVVGATKGGLVVLVEGLRGFVPFSQISAK-STGEEL---LNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAE
        +LS +     +AW R RQ++  +  I+ K+     GGL+  +EGLR F+P  ++  K +T  EL   + +   ++   ++E  + L+L  S+ +   +  
Subjt:  ILSLKSVQYGLAWERCRQLQAEDIVIKGKVVGATKGGLVVLVEGLRGFVPFSQISAK-STGEEL---LNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAE

Query:  LRIGSVVTGTVQFLKPYGAFIDIG--GINGLLHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQI
        LR G+++ GTV  + PYGA + +G    +GLLH+S I++  I  ++ VLQ  + +KV+++       ++SLS   LE  PG  I + + VF +A+EMA+ 
Subjt:  LRIGSVVTGTVQFLKPYGAFIDIG--GINGLLHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQI

Query:  FRQRIAQAET
        +R+++    T
Subjt:  FRQRIAQAET

AT3G23700.1 Nucleic acid-binding proteins superfamily2.0e-1830Show/hide
Query:  WERCRQLQAEDIVIKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEE-----------LLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAE-LRIG
        W+  +         +G+V G   GGL++    L GF+P+ Q+S   + +E           L+  +L +K V+ DE+  +L+LS   A+    ++ + +G
Subjt:  WERCRQLQAEDIVIKGKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEE-----------LLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAE-LRIG

Query:  SVVTGTVQFLKPYGAFIDIG------GINGLLHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTP
         V  G V  ++ YGAFI +        + GL+HVS++S +++ D+  VL+ GD ++V++ + D  K R++LS K+LE  P
Subjt:  SVVTGTVQFLKPYGAFIDIG------GINGLLHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTP

AT4G29060.1 elongation factor Ts family protein3.4e-1037.5Show/hide
Query:  ELRIGSVVTGTVQFLKPYGAFIDIGGI-NGLLHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTP
        EL  G+  TG V+ ++P+GAF+D G   +GL+HVSQ+S N + D+++V+  G  +KV ++  D    R+SL+ ++ +  P
Subjt:  ELRIGSVVTGTVQFLKPYGAFIDIGGI-NGLLHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTP

AT4G29060.2 elongation factor Ts family protein3.4e-1037.5Show/hide
Query:  ELRIGSVVTGTVQFLKPYGAFIDIGGI-NGLLHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTP
        EL  G+  TG V+ ++P+GAF+D G   +GL+HVSQ+S N + D+++V+  G  +KV ++  D    R+SL+ ++ +  P
Subjt:  ELRIGSVVTGTVQFLKPYGAFIDIGGI-NGLLHVSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTP

AT5G30510.1 ribosomal protein S12.8e-13766.07Show/hide
Query:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN
        M+S A Q  GL      P  SSSR S  +     F  N+   V P + AA +    +S+ QTK RL+LK++F++AYERC T+PM+G++FT++DF AA+  
Subjt:  MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALAN

Query:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIAEQEDGDGLILSLKSVQYGLAWERCRQLQAEDIVIK
        YDF +E+GT+VKGTVF TDANGALVD +AK +AYL  ++ACI +I+HVEE GI PG+ EEFVII E E  D L+LSL+++QY LAWERCRQLQAED+++K
Subjt:  YDFVTELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIAEQEDGDGLILSLKSVQYGLAWERCRQLQAEDIVIK

Query:  GKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGLLH
         KV+GA KGGLV LVEGLRGFVPFSQIS+K+  EELL KE+ LKFVEVDE+ ++LVLSN KA+  SQA+L IGSVV G VQ LKPYGAFIDIGGINGLLH
Subjt:  GKVVGATKGGLVVLVEGLRGFVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGLLH

Query:  VSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLE
        VSQIS + +SDIATVLQPGD LKVMILS+D ++GRVSLSTKKLEPTPGDMI NPKLVFEKA+EMAQ FRQRIAQAE MARA +L FQ E
Subjt:  VSQISQNHISDIATVLQPGDMLKVMILSYDCNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCATCGGCGCATCAACCTTGTGGGTTGAACTCGAGGTATTCGTATCCTCCTCTTTCTTCATCGCGGTTTTCAGCTTCGAGTTGGAACTGGAACCGCTTTTCTCC
CAACCAACGGCCTAAAGTGCTTCCATTAGTTTCAGCTGCAGCTTCTTCCCCTTCTCCCATTTCCAATGCGCAGACCAAAGGGCGCCTTAAACTCAAGCAACTCTTCAAGG
AAGCTTATGAACGCTGCTGTACTGCCCCCATGGATGGCATCTCCTTCACTCTTGAAGACTTCCATGCCGCTCTTGCAAATTACGACTTTGTTACTGAACTCGGAACCAAG
GTTAAGGGTACTGTATTCTGTACCGATGCTAATGGGGCACTAGTTGATACTACTGCAAAGGGAACTGCATACTTGCCCACTCAAGAGGCATGCATTCTTAAAATAAGACA
TGTAGAAGAAGTAGGCATATATCCTGGTTTAGAAGAGGAGTTCGTAATTATTGCTGAACAGGAAGATGGCGATGGCTTAATTCTGAGCTTGAAAAGTGTCCAGTATGGCC
TTGCTTGGGAGCGATGCAGACAACTCCAAGCTGAGGATATTGTTATCAAGGGTAAGGTTGTTGGTGCAACCAAAGGGGGATTAGTTGTTCTTGTGGAAGGTCTTAGAGGC
TTTGTTCCTTTCTCTCAGATATCAGCAAAATCAACTGGAGAGGAGCTTCTTAATAAAGAGCTACGTCTGAAGTTTGTGGAGGTTGATGAGAAACTATCTCGGCTAGTCTT
AAGTAATTCCAAGGCCATTGTCAGTAGTCAGGCAGAGCTAAGAATTGGTTCAGTAGTTACTGGAACCGTGCAGTTTCTGAAACCATATGGAGCCTTTATTGACATCGGAG
GAATTAATGGGCTTCTTCATGTTAGTCAAATCAGTCAAAATCACATATCAGATATTGCAACCGTTCTTCAACCAGGAGATATGCTTAAGGTCATGATTTTGAGCTATGAC
TGCAACAAAGGCCGTGTTAGTCTTTCTACCAAGAAATTGGAACCTACTCCTGGAGACATGATTCACAATCCAAAGCTTGTCTTTGAGAAGGCGGACGAGATGGCTCAGAT
ATTCAGGCAAAGAATAGCTCAAGCAGAAACAATGGCTCGTGCAGGCCTTCTCGGATTTCAGCTTGAGGTACTTATTTATTTGGCTTGGATTTTTCAACTTAATTCATTGA
AAATGTTGGAGCGTGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTCATCGGCGCATCAACCTTGTGGGTTGAACTCGAGGTATTCGTATCCTCCTCTTTCTTCATCGCGGTTTTCAGCTTCGAGTTGGAACTGGAACCGCTTTTCTCC
CAACCAACGGCCTAAAGTGCTTCCATTAGTTTCAGCTGCAGCTTCTTCCCCTTCTCCCATTTCCAATGCGCAGACCAAAGGGCGCCTTAAACTCAAGCAACTCTTCAAGG
AAGCTTATGAACGCTGCTGTACTGCCCCCATGGATGGCATCTCCTTCACTCTTGAAGACTTCCATGCCGCTCTTGCAAATTACGACTTTGTTACTGAACTCGGAACCAAG
GTTAAGGGTACTGTATTCTGTACCGATGCTAATGGGGCACTAGTTGATACTACTGCAAAGGGAACTGCATACTTGCCCACTCAAGAGGCATGCATTCTTAAAATAAGACA
TGTAGAAGAAGTAGGCATATATCCTGGTTTAGAAGAGGAGTTCGTAATTATTGCTGAACAGGAAGATGGCGATGGCTTAATTCTGAGCTTGAAAAGTGTCCAGTATGGCC
TTGCTTGGGAGCGATGCAGACAACTCCAAGCTGAGGATATTGTTATCAAGGGTAAGGTTGTTGGTGCAACCAAAGGGGGATTAGTTGTTCTTGTGGAAGGTCTTAGAGGC
TTTGTTCCTTTCTCTCAGATATCAGCAAAATCAACTGGAGAGGAGCTTCTTAATAAAGAGCTACGTCTGAAGTTTGTGGAGGTTGATGAGAAACTATCTCGGCTAGTCTT
AAGTAATTCCAAGGCCATTGTCAGTAGTCAGGCAGAGCTAAGAATTGGTTCAGTAGTTACTGGAACCGTGCAGTTTCTGAAACCATATGGAGCCTTTATTGACATCGGAG
GAATTAATGGGCTTCTTCATGTTAGTCAAATCAGTCAAAATCACATATCAGATATTGCAACCGTTCTTCAACCAGGAGATATGCTTAAGGTCATGATTTTGAGCTATGAC
TGCAACAAAGGCCGTGTTAGTCTTTCTACCAAGAAATTGGAACCTACTCCTGGAGACATGATTCACAATCCAAAGCTTGTCTTTGAGAAGGCGGACGAGATGGCTCAGAT
ATTCAGGCAAAGAATAGCTCAAGCAGAAACAATGGCTCGTGCAGGCCTTCTCGGATTTCAGCTTGAGGTACTTATTTATTTGGCTTGGATTTTTCAACTTAATTCATTGA
AAATGTTGGAGCGTGATTAA
Protein sequenceShow/hide protein sequence
MSSSAHQPCGLNSRYSYPPLSSSRFSASSWNWNRFSPNQRPKVLPLVSAAASSPSPISNAQTKGRLKLKQLFKEAYERCCTAPMDGISFTLEDFHAALANYDFVTELGTK
VKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEVGIYPGLEEEFVIIAEQEDGDGLILSLKSVQYGLAWERCRQLQAEDIVIKGKVVGATKGGLVVLVEGLRG
FVPFSQISAKSTGEELLNKELRLKFVEVDEKLSRLVLSNSKAIVSSQAELRIGSVVTGTVQFLKPYGAFIDIGGINGLLHVSQISQNHISDIATVLQPGDMLKVMILSYD
CNKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQIFRQRIAQAETMARAGLLGFQLEVLIYLAWIFQLNSLKMLERD