; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10010439 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10010439
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Description30S ribosomal protein S1, chloroplastic
Genome locationChr06:22223028..22231630
RNA-Seq ExpressionHG10010439
SyntenyHG10010439
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0022627 - cytosolic small ribosomal subunit (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022958818.1 30S ribosomal protein S1, chloroplastic-like isoform X1 [Cucurbita moschata]2.3e-17083.8Show/hide
Query:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF
        MSS AH+ CG  LR SPLSS  +S+      RFS        PVVSAAASP+PISNAQTKERLKLKQLFKEAYERCC  PMDGVSFTLEDFHA LANYDF
Subjt:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF

Query:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIA--EQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKG
        VSELGTKVKGTVF TDANGALVDT+AKGTAYLPTQEACI  IRHVEEAGIYPGLEEEFVII   EQED   LILSLRSVQYGLAWERCRQLQAED VIKG
Subjt:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIA--EQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKG

Query:  KVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHV
        KVV  +KGGVVVLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDE+L RL+LSNSKAIVSSQ+E+RIGSVVTG VQILKPYGAF+DIGG+NGLLHV
Subjt:  KVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHV

Query:  SQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPEI
        SQISQNHI DIA VLQPGDMLKVMILSYDR+RGR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAEA+ARA LL FQPE+
Subjt:  SQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPEI

XP_022958819.1 30S ribosomal protein S1, chloroplastic-like isoform X2 [Cucurbita moschata]3.9e-17084.02Show/hide
Query:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF
        MSS AH+ CG  LR SPLSS  +S+      RFS        PVVSAAASP+PISNAQTKERLKLKQLFKEAYERCC  PMDGVSFTLEDFHA LANYDF
Subjt:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF

Query:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIA--EQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKG
        VSELGTKVKGTVF TDANGALVDT+AKGTAYLPTQEACI  IRHVEEAGIYPGLEEEFVII   EQED   LILSLRSVQYGLAWERCRQLQAED VIKG
Subjt:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIA--EQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKG

Query:  KVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHV
        KVV  +KGGVVVLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDE+L RL+LSNSKAIVSSQ+E+RIGSVVTG VQILKPYGAF+DIGG+NGLLHV
Subjt:  KVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHV

Query:  SQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPE
        SQISQNHI DIA VLQPGDMLKVMILSYDR+RGR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAEA+ARA LL FQPE
Subjt:  SQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPE

XP_023006590.1 30S ribosomal protein S1, chloroplastic-like isoform X1 [Cucurbita maxima]7.9e-17184.06Show/hide
Query:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF
        MSS AH+ CG  LR SPLSS  +S+      RFS        PVVSAAASP+PISNAQTKERLKLKQLFKEAYERCC  PMDGVSFTLEDFHA+LANYDF
Subjt:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF

Query:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIA--EQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKG
        VSE+GTKVKGTVF TDANGALVDT+AKGTAYLPTQEACI  IRHVEEAGIYPGLEEEFVII   EQED   LILSLRSVQYGLAWERCRQLQAED VIKG
Subjt:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIA--EQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKG

Query:  KVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHV
        KVV  +KGGVVVLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDE+LSRLVLSNSKAI SSQ+E+RIGSVVTG VQILKPYGAFIDIGG+NGLLHV
Subjt:  KVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHV

Query:  SQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPEI
        SQISQNHI+DIA VLQPGDMLKVMILSYDR+RGR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAEA+ARA LL FQPE+
Subjt:  SQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPEI

XP_023006591.1 30S ribosomal protein S1, chloroplastic-like isoform X2 [Cucurbita maxima]1.4e-17084.28Show/hide
Query:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF
        MSS AH+ CG  LR SPLSS  +S+      RFS        PVVSAAASP+PISNAQTKERLKLKQLFKEAYERCC  PMDGVSFTLEDFHA+LANYDF
Subjt:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF

Query:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIA--EQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKG
        VSE+GTKVKGTVF TDANGALVDT+AKGTAYLPTQEACI  IRHVEEAGIYPGLEEEFVII   EQED   LILSLRSVQYGLAWERCRQLQAED VIKG
Subjt:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIA--EQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKG

Query:  KVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHV
        KVV  +KGGVVVLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDE+LSRLVLSNSKAI SSQ+E+RIGSVVTG VQILKPYGAFIDIGG+NGLLHV
Subjt:  KVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHV

Query:  SQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPE
        SQISQNHI+DIA VLQPGDMLKVMILSYDR+RGR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAEA+ARA LL FQPE
Subjt:  SQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPE

XP_038906593.1 30S ribosomal protein S1, chloroplastic-like [Benincasa hispida]1.6e-19291.54Show/hide
Query:  MSSSAHKP-CGLNLR--YSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAA-SPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLA
        MSSSAH+P CGL  R  YSPLSS RLS+SSWNWNRF PKEWRK+LP+VSAAA SPSPISNAQTKERLKLKQLFKEAYERCCT+PMDGVSFTLEDFHA LA
Subjt:  MSSSAHKP-CGLNLR--YSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAA-SPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLA

Query:  NYDFVSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIAEQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVI
        +YDFVSELGTKVKGTVFCT+ANGALVD T KGTAYLPTQEACILKI+HVEEAGIYPGLEEEF+IIAEQEDGDGLILSLRSVQYGLAWERCRQLQAED+VI
Subjt:  NYDFVSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIAEQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVI

Query:  KGKVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLL
        KGKVVGA+KGGVVVLVEGLRGFVPFSQISAKSTAEELLNKEL LKFVEVDEELSRL+LSNSKAIV SQAE+RIGSVVTGTVQILKPYGAFIDIGGINGLL
Subjt:  KGKVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLL

Query:  HVSQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPE
        HVSQISQNHI DIATVLQPGD+LKVMILSYD+N+GRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQ FRQRIAQAEA+ARAGLLG QPE
Subjt:  HVSQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPE

TrEMBL top hitse value%identityAlignment
A0A6J1CCC9 30S ribosomal protein S1, chloroplastic-like isoform X19.5e-17082.31Show/hide
Query:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWR----KVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLA
        M+S AH+         PLSS RLS SS +W RFS KE      + LPVVS+AAS +PISNAQTKERLKLKQLFKEAYERCCT PMDGVSFTLEDFHA L+
Subjt:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWR----KVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLA

Query:  NYDFVSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIAEQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVI
        NYDFVSELGTKVKGTVF TDA+GALVDTTAKGTAYLP +EACILKIRHVEEAGIYPGLEEEFVIIAE E    LILSLR +QYGLAWERCRQLQAEDVVI
Subjt:  NYDFVSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIAEQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVI

Query:  KGKVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLL
        KGKVVGA+KGGV VLVEGLRGFVPFSQISAKSTAEELL+KEL LKFVEVDEELSRL+LSN KAI +SQAE+RIGSVVTGTVQILK YGAFIDIGG+NGLL
Subjt:  KGKVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLL

Query:  HVSQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPE
        H+SQIS NHI+D+ATVL+PGD LKVMILSYD  RGRVSLSTK LEPTPGDMIHNPKLVFEKADEMAQ FRQRIAQAEA+AR  LL  QPE
Subjt:  HVSQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPE

A0A6J1H453 30S ribosomal protein S1, chloroplastic-like isoform X11.1e-17083.8Show/hide
Query:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF
        MSS AH+ CG  LR SPLSS  +S+      RFS        PVVSAAASP+PISNAQTKERLKLKQLFKEAYERCC  PMDGVSFTLEDFHA LANYDF
Subjt:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF

Query:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIA--EQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKG
        VSELGTKVKGTVF TDANGALVDT+AKGTAYLPTQEACI  IRHVEEAGIYPGLEEEFVII   EQED   LILSLRSVQYGLAWERCRQLQAED VIKG
Subjt:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIA--EQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKG

Query:  KVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHV
        KVV  +KGGVVVLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDE+L RL+LSNSKAIVSSQ+E+RIGSVVTG VQILKPYGAF+DIGG+NGLLHV
Subjt:  KVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHV

Query:  SQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPEI
        SQISQNHI DIA VLQPGDMLKVMILSYDR+RGR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAEA+ARA LL FQPE+
Subjt:  SQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPEI

A0A6J1H681 30S ribosomal protein S1, chloroplastic-like isoform X21.9e-17084.02Show/hide
Query:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF
        MSS AH+ CG  LR SPLSS  +S+      RFS        PVVSAAASP+PISNAQTKERLKLKQLFKEAYERCC  PMDGVSFTLEDFHA LANYDF
Subjt:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF

Query:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIA--EQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKG
        VSELGTKVKGTVF TDANGALVDT+AKGTAYLPTQEACI  IRHVEEAGIYPGLEEEFVII   EQED   LILSLRSVQYGLAWERCRQLQAED VIKG
Subjt:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIA--EQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKG

Query:  KVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHV
        KVV  +KGGVVVLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDE+L RL+LSNSKAIVSSQ+E+RIGSVVTG VQILKPYGAF+DIGG+NGLLHV
Subjt:  KVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHV

Query:  SQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPE
        SQISQNHI DIA VLQPGDMLKVMILSYDR+RGR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAEA+ARA LL FQPE
Subjt:  SQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPE

A0A6J1KY62 30S ribosomal protein S1, chloroplastic-like isoform X26.6e-17184.28Show/hide
Query:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF
        MSS AH+ CG  LR SPLSS  +S+      RFS        PVVSAAASP+PISNAQTKERLKLKQLFKEAYERCC  PMDGVSFTLEDFHA+LANYDF
Subjt:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF

Query:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIA--EQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKG
        VSE+GTKVKGTVF TDANGALVDT+AKGTAYLPTQEACI  IRHVEEAGIYPGLEEEFVII   EQED   LILSLRSVQYGLAWERCRQLQAED VIKG
Subjt:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIA--EQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKG

Query:  KVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHV
        KVV  +KGGVVVLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDE+LSRLVLSNSKAI SSQ+E+RIGSVVTG VQILKPYGAFIDIGG+NGLLHV
Subjt:  KVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHV

Query:  SQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPE
        SQISQNHI+DIA VLQPGDMLKVMILSYDR+RGR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAEA+ARA LL FQPE
Subjt:  SQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPE

A0A6J1L0J3 30S ribosomal protein S1, chloroplastic-like isoform X13.8e-17184.06Show/hide
Query:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF
        MSS AH+ CG  LR SPLSS  +S+      RFS        PVVSAAASP+PISNAQTKERLKLKQLFKEAYERCC  PMDGVSFTLEDFHA+LANYDF
Subjt:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF

Query:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIA--EQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKG
        VSE+GTKVKGTVF TDANGALVDT+AKGTAYLPTQEACI  IRHVEEAGIYPGLEEEFVII   EQED   LILSLRSVQYGLAWERCRQLQAED VIKG
Subjt:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIA--EQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKG

Query:  KVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHV
        KVV  +KGGVVVLVEGL+GFVPFSQISAKSTAEELLNKEL LKFVEVDE+LSRLVLSNSKAI SSQ+E+RIGSVVTG VQILKPYGAFIDIGG+NGLLHV
Subjt:  KVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHV

Query:  SQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPEI
        SQISQNHI+DIA VLQPGDMLKVMILSYDR+RGR+SLSTKKLEP+PGDM+HNPKLVFEKADEMAQ FRQRIAQAEA+ARA LL FQPE+
Subjt:  SQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPEI

SwissProt top hitse value%identityAlignment
O33698 30S ribosomal protein S12.1e-4134.4Show/hide
Query:  EDFHATLANYDFVSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIAEQEDGDGLILSLRSVQYGLAWERCRQ
        +DF   L      S+ G  V+G V     +GA +D   K  A+LP +EA +  +  + EA +    E EF++I +Q +   + +SLR++    AW R  +
Subjt:  EDFHATLANYDFVSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIAEQEDGDGLILSLRSVQYGLAWERCRQ

Query:  LQAEDVVIKGKVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQA-EIRIGSVVTGTVQILKPYGAFI
        LQ     ++ KV G++KGGV   +EGLR F+P S ++ K   + L  K L + F+EV+    +LVLS  +A  ++   EI +G ++ G V  LKP+G F+
Subjt:  LQAEDVVIKGKVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQA-EIRIGSVVTGTVQILKPYGAFI

Query:  DIGGINGLLHVSQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRI
        D+GG   LL ++QISQ  +AD+  + + GD ++ ++++ D  +GR+SLSTK LE  PG+++ N   +   A + A+  R+++
Subjt:  DIGGINGLLHVSQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRI

P29344 30S ribosomal protein S1, chloroplastic1.5e-14872.68Show/hide
Query:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRK--VLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANY
        M+S A +  G  LR  PLS+  LS        FSPK   K    P+VSA A    +SNAQT+ER KLKQLF++AYERC  APM+GVSFT++DFH  L  Y
Subjt:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRK--VLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANY

Query:  DFVSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIAEQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKG
        DF SE+G++VKGTVFCTDANGALVD TAK +AYLP  EACI +I++VEEAGI PG+ EEFVII E E  D LILSLR +QY LAWERCRQLQAEDVV+KG
Subjt:  DFVSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIAEQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKG

Query:  KVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHV
        K+VGA+KGGVV LVEGLRGFVPFSQIS+KS+AEELL KE+ LKFVEVDEE SRLV+SN KA+  SQA++ IGSVVTGTVQ LKPYGAFIDIGGINGLLHV
Subjt:  KVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHV

Query:  SQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPE
        SQIS + ++DIATVLQPGD LKVMILS+DR RGRVSLSTKKLEPTPGDMI NPKLVFEKA+EMAQ FRQRIAQAEA+ARA +L FQPE
Subjt:  SQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPE

P46228 30S ribosomal protein S18.8e-7247.44Show/hide
Query:  PMDGVSFTLEDFHATLANYDFVSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIAEQEDGDGLILSLRSVQY
        P   + FT EDF A L  YD+    G  V GTVF  +  GAL+D  AK  A+LP QE  I ++   EE  + P    EF I++++ +   L LS+R ++Y
Subjt:  PMDGVSFTLEDFHATLANYDFVSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIAEQEDGDGLILSLRSVQY

Query:  GLAWERCRQLQAEDVVIKGKVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQA-EIRIGSVVTGTVQ
          AWER RQLQ ED  ++ +V   ++GG +V +EGLRGF+P S IS +   E+L+ +EL LKF+EVDE+ +RLVLS+ +A+V  +   + +G VV G V+
Subjt:  GLAWERCRQLQAEDVVIKGKVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQA-EIRIGSVVTGTVQ

Query:  ILKPYGAFIDIGGINGLLHVSQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQ
         +KPYGAFIDIGG++GLLH+S+IS +HI    +V    D +KVMI+  D  RGR+SLSTK+LEP PGDM+ NP++V+EKA+EMA  +R+++ Q
Subjt:  ILKPYGAFIDIGGINGLLHVSQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQ

P73530 30S ribosomal protein S1 homolog A2.2e-7048.3Show/hide
Query:  VSFTLEDFHATLANYDFVSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIAEQEDGDGLILSLRSVQYGLAW
        + FTLEDF A L  YD+    G  V GTVF  ++ GAL+D  AK  AY+P QE  I ++   EE  + P    EF I+ ++ +   L LS+R ++Y  AW
Subjt:  VSFTLEDFHATLANYDFVSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIAEQEDGDGLILSLRSVQYGLAW

Query:  ERCRQLQAEDVVIKGKVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAE-IRIGSVVTGTVQILKP
        ER RQLQAED  ++  V   ++GG +V +EGLRGF+P S ISA+   E+L+ ++L LKF+EVDEE +RLVLS+ +A+V  +   + +  VV G+V+ +KP
Subjt:  ERCRQLQAEDVVIKGKVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAE-IRIGSVVTGTVQILKP

Query:  YGAFIDIGGINGLLHVSQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQ-RIAQAEAV
        YGAFIDIGG++GLLH+S+IS +HI    +V    D +KVMI+  D  RGR+SLSTK+LEP PG M+ +  LV E ADEMA++FRQ R+A+A+ +
Subjt:  YGAFIDIGGINGLLHVSQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQ-RIAQAEAV

Q93VC7 30S ribosomal protein S1, chloroplastic4.4e-14068.39Show/hide
Query:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF
        M+S A +  G  LR SPLSS    S   + N F   +   V P + AA +   +S+ QTKERL+LK++F++AYERC T+PM+GV+FT++DF A +  YDF
Subjt:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF

Query:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIAEQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKGKV
         SE+GT+VKGTVF TDANGALVD +AK +AYL  ++ACI +I+HVEEAGI PG+ EEFVII E E  D L+LSLR++QY LAWERCRQLQAEDV++K KV
Subjt:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIAEQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKGKV

Query:  VGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ
        +GA+KGG+V LVEGLRGFVPFSQIS+K+ AEELL KE+ LKFVEVDEE ++LVLSN KA+  SQA++ IGSVV G VQ LKPYGAFIDIGGINGLLHVSQ
Subjt:  VGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ

Query:  ISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPE
        IS + ++DIATVLQPGD LKVMILS+DR+RGRVSLSTKKLEPTPGDMI NPKLVFEKA+EMAQ FRQRIAQAEA+ARA +L FQPE
Subjt:  ISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPE

Arabidopsis top hitse value%identityAlignment
AT1G71720.1 Nucleic acid-binding proteins superfamily3.6e-2029.76Show/hide
Query:  ILSLRSVQYGLAWERCRQLQAEDVVIKGKVVGASKGGVVVLVEGLRGFVPFSQISAK----STAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAE
        +LS R     +AW R RQ++  +  I+ K+   + GG++  +EGLR F+P  ++  K    +  +E + +   ++   ++E+ + L+L  S+ +   +  
Subjt:  ILSLRSVQYGLAWERCRQLQAEDVVIKGKVVGASKGGVVVLVEGLRGFVPFSQISAK----STAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAE

Query:  IRIGSVVTGTVQILKPYGAFIDIG--GINGLLHVSQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQM
        +R G+++ GTV  + PYGA + +G    +GLLH+S I++  I  ++ VLQ  + +KV+++       ++SLS   LE  PG  I + + VF +A+EMA+ 
Subjt:  IRIGSVVTGTVQILKPYGAFIDIG--GINGLLHVSQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQM

Query:  FRQRI
        +R+++
Subjt:  FRQRI

AT3G23700.1 Nucleic acid-binding proteins superfamily6.8e-1929.44Show/hide
Query:  WERCRQLQAEDVVIKGKVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEE-----------LLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAE-IRIG
        W+  +         +G+V G + GG+++    L GF+P+ Q+S   + +E           L+  +L +K V+ DEE  +L+LS   A+    ++ + +G
Subjt:  WERCRQLQAEDVVIKGKVVGASKGGVVVLVEGLRGFVPFSQISAKSTAEE-----------LLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAE-IRIG

Query:  SVVTGTVQILKPYGAFIDIG------GINGLLHVSQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTP
         V  G V  ++ YGAFI +        + GL+HVS++S +++ D+  VL+ GD ++V++ + D+ + R++LS K+LE  P
Subjt:  SVVTGTVQILKPYGAFIDIG------GINGLLHVSQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTP

AT4G29060.1 elongation factor Ts family protein1.7e-0936.25Show/hide
Query:  EIRIGSVVTGTVQILKPYGAFIDIGGI-NGLLHVSQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTP
        E+  G+  TG V+ ++P+GAF+D G   +GL+HVSQ+S N + D+++V+  G  +KV ++  D    R+SL+ ++ +  P
Subjt:  EIRIGSVVTGTVQILKPYGAFIDIGGI-NGLLHVSQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTP

AT4G29060.2 elongation factor Ts family protein1.7e-0936.25Show/hide
Query:  EIRIGSVVTGTVQILKPYGAFIDIGGI-NGLLHVSQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTP
        E+  G+  TG V+ ++P+GAF+D G   +GL+HVSQ+S N + D+++V+  G  +KV ++  D    R+SL+ ++ +  P
Subjt:  EIRIGSVVTGTVQILKPYGAFIDIGGI-NGLLHVSQISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTP

AT5G30510.1 ribosomal protein S13.1e-14168.39Show/hide
Query:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF
        M+S A +  G  LR SPLSS    S   + N F   +   V P + AA +   +S+ QTKERL+LK++F++AYERC T+PM+GV+FT++DF A +  YDF
Subjt:  MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDF

Query:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIAEQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKGKV
         SE+GT+VKGTVF TDANGALVD +AK +AYL  ++ACI +I+HVEEAGI PG+ EEFVII E E  D L+LSLR++QY LAWERCRQLQAEDV++K KV
Subjt:  VSELGTKVKGTVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIAEQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKGKV

Query:  VGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ
        +GA+KGG+V LVEGLRGFVPFSQIS+K+ AEELL KE+ LKFVEVDEE ++LVLSN KA+  SQA++ IGSVV G VQ LKPYGAFIDIGGINGLLHVSQ
Subjt:  VGASKGGVVVLVEGLRGFVPFSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQ

Query:  ISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPE
        IS + ++DIATVLQPGD LKVMILS+DR+RGRVSLSTKKLEPTPGDMI NPKLVFEKA+EMAQ FRQRIAQAEA+ARA +L FQPE
Subjt:  ISQNHIADIATVLQPGDMLKVMILSYDRNRGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCATCGGCGCATAAACCTTGTGGGTTGAACTTGAGGTATTCCCCTCTTTCTTCAAAGCGACTTTCATCTTCGAGTTGGAACTGGAATCGCTTTTCTCCAAAGGA
ATGGCGTAAAGTGCTTCCAGTAGTTTCAGCTGCAGCTTCCCCTTCTCCCATTTCCAATGCGCAGACCAAAGAGCGCCTTAAACTCAAACAACTCTTCAAGGAAGCTTATG
AACGCTGCTGTACTGCCCCCATGGATGGCGTCTCCTTCACTCTTGAAGACTTCCATGCCACTCTTGCAAATTATGACTTTGTTTCTGAACTCGGAACCAAGGTTAAAGGT
ACTGTATTTTGTACCGATGCTAATGGGGCACTAGTTGATACTACTGCAAAGGGAACTGCATACTTGCCCACCCAAGAGGCATGCATTCTTAAAATAAGACATGTAGAAGA
AGCCGGCATATATCCTGGTTTAGAAGAGGAGTTCGTAATTATTGCTGAACAGGAAGATGGCGATGGCTTAATTCTGAGCTTGAGAAGTGTCCAGTATGGCCTTGCTTGGG
AGCGATGCAGACAACTCCAAGCTGAGGATGTTGTTATCAAGGGTAAGGTTGTTGGTGCAAGCAAAGGGGGAGTAGTTGTTCTTGTGGAAGGTCTTAGAGGCTTTGTTCCT
TTCTCTCAGATATCAGCAAAATCAACTGCAGAGGAGCTTCTTAATAAAGAGCTACATCTGAAGTTTGTGGAGGTCGATGAGGAACTATCTCGGCTAGTCTTAAGTAACTC
CAAGGCCATTGTCAGTAGCCAGGCAGAGATAAGAATTGGTTCAGTAGTTACTGGAACCGTGCAGATTCTGAAACCATATGGAGCCTTTATTGACATCGGTGGAATTAATG
GGCTTCTTCATGTTAGTCAGATCAGTCAAAATCACATAGCAGATATTGCAACCGTTCTTCAACCAGGAGATATGCTTAAGGTCATGATTTTGAGCTATGACCGCAACAGA
GGCCGTGTTAGTCTTTCTACCAAGAAACTGGAACCTACTCCTGGAGACATGATTCACAATCCAAAGCTTGTCTTTGAGAAGGCGGACGAGATGGCTCAAATGTTCAGGCA
AAGAATAGCTCAAGCAGAAGCAGTGGCTCGTGCAGGCCTTCTCGGATTTCAGCCTGAGATCAACAACAGGAAGAAGAAGAAGAAAAAGAAGAAGATGATGATGATGAAAT
TCATGGAGAAAAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCTCATCGGCGCATAAACCTTGTGGGTTGAACTTGAGGTATTCCCCTCTTTCTTCAAAGCGACTTTCATCTTCGAGTTGGAACTGGAATCGCTTTTCTCCAAAGGA
ATGGCGTAAAGTGCTTCCAGTAGTTTCAGCTGCAGCTTCCCCTTCTCCCATTTCCAATGCGCAGACCAAAGAGCGCCTTAAACTCAAACAACTCTTCAAGGAAGCTTATG
AACGCTGCTGTACTGCCCCCATGGATGGCGTCTCCTTCACTCTTGAAGACTTCCATGCCACTCTTGCAAATTATGACTTTGTTTCTGAACTCGGAACCAAGGTTAAAGGT
ACTGTATTTTGTACCGATGCTAATGGGGCACTAGTTGATACTACTGCAAAGGGAACTGCATACTTGCCCACCCAAGAGGCATGCATTCTTAAAATAAGACATGTAGAAGA
AGCCGGCATATATCCTGGTTTAGAAGAGGAGTTCGTAATTATTGCTGAACAGGAAGATGGCGATGGCTTAATTCTGAGCTTGAGAAGTGTCCAGTATGGCCTTGCTTGGG
AGCGATGCAGACAACTCCAAGCTGAGGATGTTGTTATCAAGGGTAAGGTTGTTGGTGCAAGCAAAGGGGGAGTAGTTGTTCTTGTGGAAGGTCTTAGAGGCTTTGTTCCT
TTCTCTCAGATATCAGCAAAATCAACTGCAGAGGAGCTTCTTAATAAAGAGCTACATCTGAAGTTTGTGGAGGTCGATGAGGAACTATCTCGGCTAGTCTTAAGTAACTC
CAAGGCCATTGTCAGTAGCCAGGCAGAGATAAGAATTGGTTCAGTAGTTACTGGAACCGTGCAGATTCTGAAACCATATGGAGCCTTTATTGACATCGGTGGAATTAATG
GGCTTCTTCATGTTAGTCAGATCAGTCAAAATCACATAGCAGATATTGCAACCGTTCTTCAACCAGGAGATATGCTTAAGGTCATGATTTTGAGCTATGACCGCAACAGA
GGCCGTGTTAGTCTTTCTACCAAGAAACTGGAACCTACTCCTGGAGACATGATTCACAATCCAAAGCTTGTCTTTGAGAAGGCGGACGAGATGGCTCAAATGTTCAGGCA
AAGAATAGCTCAAGCAGAAGCAGTGGCTCGTGCAGGCCTTCTCGGATTTCAGCCTGAGATCAACAACAGGAAGAAGAAGAAGAAAAAGAAGAAGATGATGATGATGAAAT
TCATGGAGAAAAATTAG
Protein sequenceShow/hide protein sequence
MSSSAHKPCGLNLRYSPLSSKRLSSSSWNWNRFSPKEWRKVLPVVSAAASPSPISNAQTKERLKLKQLFKEAYERCCTAPMDGVSFTLEDFHATLANYDFVSELGTKVKG
TVFCTDANGALVDTTAKGTAYLPTQEACILKIRHVEEAGIYPGLEEEFVIIAEQEDGDGLILSLRSVQYGLAWERCRQLQAEDVVIKGKVVGASKGGVVVLVEGLRGFVP
FSQISAKSTAEELLNKELHLKFVEVDEELSRLVLSNSKAIVSSQAEIRIGSVVTGTVQILKPYGAFIDIGGINGLLHVSQISQNHIADIATVLQPGDMLKVMILSYDRNR
GRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAQMFRQRIAQAEAVARAGLLGFQPEINNRKKKKKKKKMMMMKFMEKN