; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC07g0857 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC07g0857
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Description30S ribosomal protein S1, chloroplastic
Genome locationMC07:16097419..16101311
RNA-Seq ExpressionMC07g0857
SyntenyMC07g0857
Gene Ontology termsGO:0005840 - ribosome (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138847.1 30S ribosomal protein S1, chloroplastic-like isoform X1 [Momordica charantia]2.42e-285100Show/hide
Query:  MNSLAHRVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL
        MNSLAHRVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL
Subjt:  MNSLAHRVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL

Query:  GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN
        GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN
Subjt:  GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN

Query:  KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN
        KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN
Subjt:  KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN

Query:  HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPESGFTQSFDGILGGLAPEL
        HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPESGFTQSFDGILGGLAPEL
Subjt:  HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPESGFTQSFDGILGGLAPEL

Query:  PA
        PA
Subjt:  PA

XP_022138848.1 30S ribosomal protein S1, chloroplastic-like isoform X2 [Momordica charantia]3.91e-249100Show/hide
Query:  MNSLAHRVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL
        MNSLAHRVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL
Subjt:  MNSLAHRVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL

Query:  GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN
        GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN
Subjt:  GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN

Query:  KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN
        KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN
Subjt:  KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN

Query:  HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEK
        HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEK
Subjt:  HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEK

XP_023006590.1 30S ribosomal protein S1, chloroplastic-like isoform X1 [Cucurbita maxima]1.07e-20679.23Show/hide
Query:  MNSLAHRV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY
        M+SLAH++      PLSS  +S      +RFS            PVVS+AAS TPISNAQTKERLKLKQLFKEAYERCC TPMDGVSFTLEDFHA+L+NY
Subjt:  MNSLAHRV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY

Query:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIA--EHETDGSLILSLRRLQYGLAWERCRQLQAEDVVI
        DFVSE+GTKVKGTVFSTDA+GALVDT+AKGTAYLP +EACI  IRHVEEAGIYPGLEEEFVII   E E +GSLILSLR +QYGLAWERCRQLQAED VI
Subjt:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIA--EHETDGSLILSLRRLQYGLAWERCRQLQAEDVVI

Query:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL
        KGKVV  NKGGV VLVEGL+GFVPFSQISAKSTAEELL+KEL LKFVEVDE+LSRL+LSN KAIA+SQ+ELRIGSVVTG VQILK YGAFIDIGGVNGLL
Subjt:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL

Query:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE
        H+SQIS NHISD+A VL+PGD LKVMILSYD +RGR+SLSTK LEP+PGDM+HNPKLVFEKADEMAQTFRQRIAQAEAMAR +LL  QPE
Subjt:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE

XP_023006591.1 30S ribosomal protein S1, chloroplastic-like isoform X2 [Cucurbita maxima]3.64e-20779.03Show/hide
Query:  MNSLAHRV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY
        M+SLAH++      PLSS  +S      +RFS            PVVS+AAS TPISNAQTKERLKLKQLFKEAYERCC TPMDGVSFTLEDFHA+L+NY
Subjt:  MNSLAHRV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY

Query:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIA--EHETDGSLILSLRRLQYGLAWERCRQLQAEDVVI
        DFVSE+GTKVKGTVFSTDA+GALVDT+AKGTAYLP +EACI  IRHVEEAGIYPGLEEEFVII   E E +GSLILSLR +QYGLAWERCRQLQAED VI
Subjt:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIA--EHETDGSLILSLRRLQYGLAWERCRQLQAEDVVI

Query:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL
        KGKVV  NKGGV VLVEGL+GFVPFSQISAKSTAEELL+KEL LKFVEVDE+LSRL+LSN KAIA+SQ+ELRIGSVVTG VQILK YGAFIDIGGVNGLL
Subjt:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL

Query:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPES
        H+SQIS NHISD+A VL+PGD LKVMILSYD +RGR+SLSTK LEP+PGDM+HNPKLVFEKADEMAQTFRQRIAQAEAMAR +LL  QPE+
Subjt:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPES

XP_038906593.1 30S ribosomal protein S1, chloroplastic-like [Benincasa hispida]4.72e-21183.55Show/hide
Query:  PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAAST-TPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSELGTKVKGT
        PLSS RLS SS +W RF  KE      + LP+VS+AAS+ +PISNAQTKERLKLKQLFKEAYERCCT+PMDGVSFTLEDFHAAL++YDFVSELGTKVKGT
Subjt:  PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAAST-TPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSELGTKVKGT

Query:  VFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGANKGGVFVL
        VF T+A+GALVDT  KGTAYLP +EACILKI+HVEEAGIYPGLEEEF+IIAE E    LILSLR +QYGLAWERCRQLQAED+VIKGKVVGA KGGV VL
Subjt:  VFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGANKGGVFVL

Query:  VEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHNHISDMAT
        VEGLRGFVPFSQISAKSTAEELL+KELRLKFVEVDEELSRLILSN KAI  SQAELRIGSVVTGTVQILK YGAFIDIGG+NGLLH+SQIS NHI D+AT
Subjt:  VEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHNHISDMAT

Query:  VLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPESG
        VL+PGD LKVMILSYD  +GRVSLSTK LEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMAR  LL  QPESG
Subjt:  VLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPESG

TrEMBL top hitse value%identityAlignment
A0A6J1CCC9 30S ribosomal protein S1, chloroplastic-like isoform X11.17e-285100Show/hide
Query:  MNSLAHRVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL
        MNSLAHRVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL
Subjt:  MNSLAHRVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL

Query:  GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN
        GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN
Subjt:  GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN

Query:  KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN
        KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN
Subjt:  KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN

Query:  HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPESGFTQSFDGILGGLAPEL
        HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPESGFTQSFDGILGGLAPEL
Subjt:  HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPESGFTQSFDGILGGLAPEL

Query:  PA
        PA
Subjt:  PA

A0A6J1CEA2 30S ribosomal protein S1, chloroplastic-like isoform X21.89e-249100Show/hide
Query:  MNSLAHRVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL
        MNSLAHRVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL
Subjt:  MNSLAHRVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL

Query:  GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN
        GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN
Subjt:  GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN

Query:  KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN
        KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN
Subjt:  KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN

Query:  HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEK
        HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEK
Subjt:  HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEK

A0A6J1H681 30S ribosomal protein S1, chloroplastic-like isoform X21.44e-20679.03Show/hide
Query:  MNSLAHRV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY
        M+SLAH++      PLSS  +S      +RFS            PVVS+AAS TPISNAQTKERLKLKQLFKEAYERCC TPMDGVSFTLEDFHAAL+NY
Subjt:  MNSLAHRV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY

Query:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHET--DGSLILSLRRLQYGLAWERCRQLQAEDVVI
        DFVSELGTKVKGTVFSTDA+GALVDT+AKGTAYLP +EACI  IRHVEEAGIYPGLEEEFVII E E   DGSLILSLR +QYGLAWERCRQLQAED VI
Subjt:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHET--DGSLILSLRRLQYGLAWERCRQLQAEDVVI

Query:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL
        KGKVV  NKGGV VLVEGL+GFVPFSQISAKSTAEELL+KEL LKFVEVDE+L RLILSN KAI +SQ+ELRIGSVVTG VQILK YGAF+DIGGVNGLL
Subjt:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL

Query:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPES
        H+SQIS NHI D+A VL+PGD LKVMILSYD +RGR+SLSTK LEP+PGDM+HNPKLVFEKADEMAQTFRQRIAQAEAMAR +LL  QPE+
Subjt:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPES

A0A6J1KY62 30S ribosomal protein S1, chloroplastic-like isoform X21.76e-20779.03Show/hide
Query:  MNSLAHRV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY
        M+SLAH++      PLSS  +S      +RFS            PVVS+AAS TPISNAQTKERLKLKQLFKEAYERCC TPMDGVSFTLEDFHA+L+NY
Subjt:  MNSLAHRV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY

Query:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIA--EHETDGSLILSLRRLQYGLAWERCRQLQAEDVVI
        DFVSE+GTKVKGTVFSTDA+GALVDT+AKGTAYLP +EACI  IRHVEEAGIYPGLEEEFVII   E E +GSLILSLR +QYGLAWERCRQLQAED VI
Subjt:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIA--EHETDGSLILSLRRLQYGLAWERCRQLQAEDVVI

Query:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL
        KGKVV  NKGGV VLVEGL+GFVPFSQISAKSTAEELL+KEL LKFVEVDE+LSRL+LSN KAIA+SQ+ELRIGSVVTG VQILK YGAFIDIGGVNGLL
Subjt:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL

Query:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPES
        H+SQIS NHISD+A VL+PGD LKVMILSYD +RGR+SLSTK LEP+PGDM+HNPKLVFEKADEMAQTFRQRIAQAEAMAR +LL  QPE+
Subjt:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPES

A0A6J1L0J3 30S ribosomal protein S1, chloroplastic-like isoform X15.19e-20779.23Show/hide
Query:  MNSLAHRV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY
        M+SLAH++      PLSS  +S      +RFS            PVVS+AAS TPISNAQTKERLKLKQLFKEAYERCC TPMDGVSFTLEDFHA+L+NY
Subjt:  MNSLAHRV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY

Query:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIA--EHETDGSLILSLRRLQYGLAWERCRQLQAEDVVI
        DFVSE+GTKVKGTVFSTDA+GALVDT+AKGTAYLP +EACI  IRHVEEAGIYPGLEEEFVII   E E +GSLILSLR +QYGLAWERCRQLQAED VI
Subjt:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIA--EHETDGSLILSLRRLQYGLAWERCRQLQAEDVVI

Query:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL
        KGKVV  NKGGV VLVEGL+GFVPFSQISAKSTAEELL+KEL LKFVEVDE+LSRL+LSN KAIA+SQ+ELRIGSVVTG VQILK YGAFIDIGGVNGLL
Subjt:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL

Query:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE
        H+SQIS NHISD+A VL+PGD LKVMILSYD +RGR+SLSTK LEP+PGDM+HNPKLVFEKADEMAQTFRQRIAQAEAMAR +LL  QPE
Subjt:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE

SwissProt top hitse value%identityAlignment
O33698 30S ribosomal protein S12.1e-4134.4Show/hide
Query:  EDFHAALSNYDFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQ
        +DF  AL      S+ G  V+G V      GA +D   K  A+LP  EA +  +  + EA +    E EF++I +   DG + +SLR L    AW R  +
Subjt:  EDFHAALSNYDFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQ

Query:  LQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQA-ELRIGSVVTGTVQILKSYGAFI
        LQ     ++ KV G+NKGGV   +EGLR F+P S ++ K   + L  K L + F+EV+    +L+LS  +A   +   E+ +G ++ G V  LK +G F+
Subjt:  LQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQA-ELRIGSVVTGTVQILKSYGAFI

Query:  DIGGVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRI
        D+GG   LL I+QIS   ++D+  + + GD ++ ++++ D+ +GR+SLSTK LE  PG+++ N   +   A + A+  R+++
Subjt:  DIGGVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRI

P29344 30S ribosomal protein S1, chloroplastic2.1e-15573.1Show/hide
Query:  PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSELGTKVKGTV
        PLS+  LS      + FS K  H  K R  P+VS+ A    +SNAQT+ER KLKQLF++AYERC   PM+GVSFT++DFH AL  YDF SE+G++VKGTV
Subjt:  PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSELGTKVKGTV

Query:  FSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGANKGGVFVLV
        F TDA+GALVD TAK +AYLP+ EACI +I++VEEAGI PG+ EEFVII E+E D SLILSLR++QY LAWERCRQLQAEDVV+KGK+VGANKGGV  LV
Subjt:  FSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGANKGGVFVLV

Query:  EGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHNHISDMATV
        EGLRGFVPFSQIS+KS+AEELL+KE+ LKFVEVDEE SRL++SN KA+A+SQA+L IGSVVTGTVQ LK YGAFIDIGG+NGLLH+SQISH+ +SD+ATV
Subjt:  EGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHNHISDMATV

Query:  LRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPESGFTQSFDGILGGLAPELPA
        L+PGDTLKVMILS+D ERGRVSLSTK LEPTPGDMI NPKLVFEKA+EMAQTFRQRIAQAEAMAR D+LR QPESG T S DGILG L  +LPA
Subjt:  LRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPESGFTQSFDGILGGLAPELPA

P46228 30S ribosomal protein S14.9e-7549.15Show/hide
Query:  PMDGVSFTLEDFHAALSNYDFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQY
        P   + FT EDF A L  YD+    G  V GTVF+ +  GAL+D  AK  A+LP++E  I ++   EE  + P    EF I+++   DG L LS+RR++Y
Subjt:  PMDGVSFTLEDFHAALSNYDFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQY

Query:  GLAWERCRQLQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQA-ELRIGSVVTGTVQ
          AWER RQLQ ED  ++ +V   N+GG  V +EGLRGF+P S IS +   E+L+ +EL LKF+EVDE+ +RL+LS+ +A+   +   L +G VV G V+
Subjt:  GLAWERCRQLQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQA-ELRIGSVVTGTVQ

Query:  ILKSYGAFIDIGGVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQ
         +K YGAFIDIGGV+GLLHIS+ISH+HI    +V    D +KVMI+  D ERGR+SLSTK LEP PGDM+ NP++V+EKA+EMA  +R+++ Q
Subjt:  ILKSYGAFIDIGGVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQ

P73530 30S ribosomal protein S1 homolog A9.2e-7450.68Show/hide
Query:  VSFTLEDFHAALSNYDFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAW
        + FTLEDF A L  YD+    G  V GTVFS ++ GAL+D  AK  AY+PI+E  I ++   EE  + P    EF I+ +   DG L LS+RR++Y  AW
Subjt:  VSFTLEDFHAALSNYDFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAW

Query:  ERCRQLQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAE-LRIGSVVTGTVQILKS
        ER RQLQAED  ++  V   N+GG  V +EGLRGF+P S ISA+   E+L+ ++L LKF+EVDEE +RL+LS+ +A+   +   L +  VV G+V+ +K 
Subjt:  ERCRQLQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAE-LRIGSVVTGTVQILKS

Query:  YGAFIDIGGVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQ-RIAQAEAM
        YGAFIDIGGV+GLLHIS+ISH+HI    +V    D +KVMI+  D ERGR+SLSTK LEP PG M+ +  LV E ADEMA+ FRQ R+A+A+ +
Subjt:  YGAFIDIGGVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQ-RIAQAEAM

Q93VC7 30S ribosomal protein S1, chloroplastic1.0e-14968.91Show/hide
Query:  MNSLAHRVPLSSLRLSVSSSSWRRFSRKESHNH-KFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSE
        M SLA +   S LR S  SSS  R SR+ S N  + ++  V  +  +   +S+ QTKERL+LK++F++AYERC T+PM+GV+FT++DF AA+  YDF SE
Subjt:  MNSLAHRVPLSSLRLSVSSSSWRRFSRKESHNH-KFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSE

Query:  LGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGA
        +GT+VKGTVF TDA+GALVD +AK +AYL +E+ACI +I+HVEEAGI PG+ EEFVII E+E+D SL+LSLR +QY LAWERCRQLQAEDV++K KV+GA
Subjt:  LGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGA

Query:  NKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISH
        NKGG+  LVEGLRGFVPFSQIS+K+ AEELL+KE+ LKFVEVDEE ++L+LSN KA+A+SQA+L IGSVV G VQ LK YGAFIDIGG+NGLLH+SQISH
Subjt:  NKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISH

Query:  NHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPESGFTQSFDGILGGLAPE
        + +SD+ATVL+PGDTLKVMILS+D +RGRVSLSTK LEPTPGDMI NPKLVFEKA+EMAQTFRQRIAQAEAMAR D+LR QPESG T S DGILG L  E
Subjt:  NHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPESGFTQSFDGILGGLAPE

Query:  LP
        LP
Subjt:  LP

Arabidopsis top hitse value%identityAlignment
AT1G71720.1 Nucleic acid-binding proteins superfamily8.4e-2231.28Show/hide
Query:  PGLEEEFVIIAE---HETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAK----STAEELLDKELRLKFVE
        P +E   V+ AE       G  +LS RR    +AW R RQ++  +  I+ K+   N GG+   +EGLR F+P  ++  K    +  +E + +   ++   
Subjt:  PGLEEEFVIIAE---HETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAK----STAEELLDKELRLKFVE

Query:  VDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIG--GVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEP
        ++E+ + LILS  + +A  +  LR G+++ GTV  +  YGA + +G    +GLLHIS I+   I  ++ VL+  +++KV+++       ++SLS  +LE 
Subjt:  VDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIG--GVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEP

Query:  TPGDMIHNPKLVFEKADEMAQTFRQRI
         PG  I + + VF +A+EMA+ +R+++
Subjt:  TPGDMIHNPKLVFEKADEMAQTFRQRI

AT3G23700.1 Nucleic acid-binding proteins superfamily1.3e-1931.11Show/hide
Query:  WERCRQLQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEE-----------LLDKELRLKFVEVDEELSRLILSNCKAIANSQAE-LRIG
        W+  +         +G+V G N GG+ +    L GF+P+ Q+S   + +E           L+  +L +K V+ DEE  +LILS   A+    ++ + +G
Subjt:  WERCRQLQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEE-----------LLDKELRLKFVEVDEELSRLILSNCKAIANSQAE-LRIG

Query:  SVVTGTVQILKSYGAFIDIG------GVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTP
         V  G V  ++ YGAFI +        + GL+H+S++S +++ D+  VLR GD ++V++ + D E+ R++LS K LE  P
Subjt:  SVVTGTVQILKSYGAFIDIG------GVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTP

AT4G29060.1 elongation factor Ts family protein3.7e-0935.16Show/hide
Query:  SNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGG-VNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTP
        S   A+ N   EL  G+  TG V+ ++ +GAF+D G   +GL+H+SQ+S N + D+++V+  G  +KV ++  D E  R+SL+ +  +  P
Subjt:  SNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGG-VNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTP

AT5G14580.1 polyribonucleotide nucleotidyltransferase, putative3.3e-1043.75Show/hide
Query:  ELRIGSVVTGTVQILKSYGAFIDI-GGVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTP
        EL +G V  GTV  +K YGAF++  GG  GLLH+S++SH  +S ++ VL  G  +  M +  D  RG + LS K L P P
Subjt:  ELRIGSVVTGTVQILKSYGAFIDI-GGVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTP

AT5G30510.1 ribosomal protein S17.3e-15168.91Show/hide
Query:  MNSLAHRVPLSSLRLSVSSSSWRRFSRKESHNH-KFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSE
        M SLA +   S LR S  SSS  R SR+ S N  + ++  V  +  +   +S+ QTKERL+LK++F++AYERC T+PM+GV+FT++DF AA+  YDF SE
Subjt:  MNSLAHRVPLSSLRLSVSSSSWRRFSRKESHNH-KFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSE

Query:  LGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGA
        +GT+VKGTVF TDA+GALVD +AK +AYL +E+ACI +I+HVEEAGI PG+ EEFVII E+E+D SL+LSLR +QY LAWERCRQLQAEDV++K KV+GA
Subjt:  LGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGA

Query:  NKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISH
        NKGG+  LVEGLRGFVPFSQIS+K+ AEELL+KE+ LKFVEVDEE ++L+LSN KA+A+SQA+L IGSVV G VQ LK YGAFIDIGG+NGLLH+SQISH
Subjt:  NKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISH

Query:  NHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPESGFTQSFDGILGGLAPE
        + +SD+ATVL+PGDTLKVMILS+D +RGRVSLSTK LEPTPGDMI NPKLVFEKA+EMAQTFRQRIAQAEAMAR D+LR QPESG T S DGILG L  E
Subjt:  NHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPESGFTQSFDGILGGLAPE

Query:  LP
        LP
Subjt:  LP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCGTTGGCGCATCGAGTTCCTCTTTCTTCGTTGCGACTTTCGGTTTCGAGTTCGAGTTGGAGGCGCTTTTCCCGTAAGGAGAGCCATAACCATAAGTTTAGGGC
GCTTCCGGTAGTTTCATCTGCAGCTTCCACTACTCCCATTTCCAATGCGCAGACCAAAGAGCGCCTTAAACTCAAGCAGCTCTTCAAGGAAGCTTACGAACGCTGCTGTA
CTACTCCCATGGATGGCGTCTCCTTCACGCTTGAGGACTTCCATGCTGCTCTCTCCAACTATGACTTCGTTTCCGAACTCGGAACCAAGGTTAAAGGAACTGTATTCAGT
ACCGATGCTAGTGGGGCACTAGTTGACACTACTGCAAAGGGAACTGCATACTTGCCCATCGAGGAGGCATGCATTCTTAAAATAAGACATGTAGAAGAAGCAGGCATATA
TCCTGGTCTAGAAGAGGAGTTTGTAATTATAGCTGAACATGAAACTGATGGTAGCTTGATTCTTAGCTTGAGAAGACTTCAGTATGGCCTTGCCTGGGAGCGATGCAGAC
AACTCCAAGCTGAGGATGTTGTTATCAAGGGTAAGGTTGTTGGTGCAAACAAAGGGGGAGTATTTGTTCTCGTGGAAGGCCTTAGAGGCTTCGTTCCTTTCTCTCAGATA
TCAGCAAAATCAACTGCAGAGGAGCTGCTTGATAAAGAGCTACGCCTGAAGTTTGTGGAGGTCGATGAGGAACTATCTCGGCTAATACTAAGTAACTGCAAGGCAATTGC
CAATAGTCAGGCAGAGCTAAGAATTGGGTCAGTAGTTACTGGAACCGTGCAGATTTTGAAATCATATGGAGCCTTTATTGACATTGGTGGAGTTAATGGCCTTCTTCATA
TTAGTCAAATCAGTCATAATCACATATCAGATATGGCAACTGTTCTTAGACCAGGAGACACGCTAAAGGTCATGATTTTGAGCTATGATCATGAGAGAGGCCGTGTTAGT
CTTTCTACCAAGAATTTGGAACCTACTCCTGGAGACATGATTCACAATCCAAAGCTTGTTTTTGAGAAGGCAGATGAGATGGCTCAGACGTTCAGGCAAAGAATAGCTCA
AGCAGAAGCAATGGCTCGTGTAGACCTTCTCAGACTTCAGCCTGAGAGTGGATTTACTCAGAGCTTTGATGGGATATTGGGTGGGCTTGCACCTGAATTGCCTGCGTAG
mRNA sequenceShow/hide mRNA sequence
GTTTTTTTTATGAAAGGTTGTGTCCTACCCATTCATGTTTTTATTTCCTAACCATAAAATTCTTTGGAATTCGTTACTCCCACACAAAGAAAAGAAATAATAATAATAAA
TTCGCCCGAGCAGAAGCTGAAGCTGGAGTTTTTGGTACTCCCGGCTTGTGAAAGGCGCGAATGAATTCGTTGGCGCATCGAGTTCCTCTTTCTTCGTTGCGACTTTCGGT
TTCGAGTTCGAGTTGGAGGCGCTTTTCCCGTAAGGAGAGCCATAACCATAAGTTTAGGGCGCTTCCGGTAGTTTCATCTGCAGCTTCCACTACTCCCATTTCCAATGCGC
AGACCAAAGAGCGCCTTAAACTCAAGCAGCTCTTCAAGGAAGCTTACGAACGCTGCTGTACTACTCCCATGGATGGCGTCTCCTTCACGCTTGAGGACTTCCATGCTGCT
CTCTCCAACTATGACTTCGTTTCCGAACTCGGAACCAAGGTTAAAGGAACTGTATTCAGTACCGATGCTAGTGGGGCACTAGTTGACACTACTGCAAAGGGAACTGCATA
CTTGCCCATCGAGGAGGCATGCATTCTTAAAATAAGACATGTAGAAGAAGCAGGCATATATCCTGGTCTAGAAGAGGAGTTTGTAATTATAGCTGAACATGAAACTGATG
GTAGCTTGATTCTTAGCTTGAGAAGACTTCAGTATGGCCTTGCCTGGGAGCGATGCAGACAACTCCAAGCTGAGGATGTTGTTATCAAGGGTAAGGTTGTTGGTGCAAAC
AAAGGGGGAGTATTTGTTCTCGTGGAAGGCCTTAGAGGCTTCGTTCCTTTCTCTCAGATATCAGCAAAATCAACTGCAGAGGAGCTGCTTGATAAAGAGCTACGCCTGAA
GTTTGTGGAGGTCGATGAGGAACTATCTCGGCTAATACTAAGTAACTGCAAGGCAATTGCCAATAGTCAGGCAGAGCTAAGAATTGGGTCAGTAGTTACTGGAACCGTGC
AGATTTTGAAATCATATGGAGCCTTTATTGACATTGGTGGAGTTAATGGCCTTCTTCATATTAGTCAAATCAGTCATAATCACATATCAGATATGGCAACTGTTCTTAGA
CCAGGAGACACGCTAAAGGTCATGATTTTGAGCTATGATCATGAGAGAGGCCGTGTTAGTCTTTCTACCAAGAATTTGGAACCTACTCCTGGAGACATGATTCACAATCC
AAAGCTTGTTTTTGAGAAGGCAGATGAGATGGCTCAGACGTTCAGGCAAAGAATAGCTCAAGCAGAAGCAATGGCTCGTGTAGACCTTCTCAGACTTCAGCCTGAGAGTG
GATTTACTCAGAGCTTTGATGGGATATTGGGTGGGCTTGCACCTGAATTGCCTGCGTAGGTTGTAGATCTCACTGATGTTTCCCTCACAGAATAATAACGAAGAGTTATT
ACATTTATAGCTTGCTTCTCTCCTGTTAATAATTCCACATTTAATTTTTAGTATTGGACTTAATTTATAAAATGGCGCCCTTCCAAAGGTATAGGGTATATATAGGTCAG
ACTCTTTGGAAGTAATGCTATTTGTGGGTAACAACTTGTTGGGCTTTTATCTTGCCTGATTCACAATATTGTTTTAACGGCAAGATAGTTTTATGATCGCTATGCTTTTG
AAGTTGAATCAGTAATTGCTATTCAGTTGATGAATTTTTTAACTTTGGTTAGCAGCATCTATAGATGAAGTGCTATTCTGCTAACTTATGGGAATCTTTCATCGCGATTA
CACCAAATTCCTTGGTATCGGCATTCAGCTTGGAACAATTGTTTGTATTTATTCTCTCGTAGAATTTGTCGTCACTTGTATATTTTGAAACAGTAATCATGTAGCTGATT
TTTTACATTTTAAGAGTTCTAGTATTGTCTTCCTCGGT
Protein sequenceShow/hide protein sequence
MNSLAHRVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSELGTKVKGTVFS
TDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQI
SAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVS
LSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPESGFTQSFDGILGGLAPELPA