; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019369 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019369
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Description30S ribosomal protein S1, chloroplastic
Genome locationscaffold611:684179..687236
RNA-Seq ExpressionMS019369
SyntenyMS019369
Gene Ontology termsGO:0005840 - ribosome (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138847.1 30S ribosomal protein S1, chloroplastic-like isoform X1 [Momordica charantia]1.5e-20899.74Show/hide
Query:  MNSLAHQVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL
        MNSLAH+VPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL
Subjt:  MNSLAHQVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL

Query:  GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN
        GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN
Subjt:  GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN

Query:  KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN
        KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN
Subjt:  KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN

Query:  HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE
        HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE
Subjt:  HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE

XP_022138848.1 30S ribosomal protein S1, chloroplastic-like isoform X2 [Momordica charantia]5.3e-19399.72Show/hide
Query:  MNSLAHQVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL
        MNSLAH+VPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL
Subjt:  MNSLAHQVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL

Query:  GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN
        GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN
Subjt:  GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN

Query:  KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN
        KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN
Subjt:  KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN

Query:  HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEK
        HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEK
Subjt:  HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEK

XP_023006590.1 30S ribosomal protein S1, chloroplastic-like isoform X1 [Cucurbita maxima]6.3e-16279.49Show/hide
Query:  MNSLAHQV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY
        M+SLAHQ+      PLSS  +S      +RFS            PVVS+AAS TPISNAQTKERLKLKQLFKEAYERCC TPMDGVSFTLEDFHA+L+NY
Subjt:  MNSLAHQV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY

Query:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIA--EHETDGSLILSLRRLQYGLAWERCRQLQAEDVVI
        DFVSE+GTKVKGTVFSTDA+GALVDT+AKGTAYLP +EACI  IRHVEEAGIYPGLEEEFVII   E E +GSLILSLR +QYGLAWERCRQLQAED VI
Subjt:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIA--EHETDGSLILSLRRLQYGLAWERCRQLQAEDVVI

Query:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL
        KGKVV  NKGGV VLVEGL+GFVPFSQISAKSTAEELL+KEL LKFVEVDE+LSRL+LSN KAIA+SQ+ELRIGSVVTG VQILK YGAFIDIGGVNGLL
Subjt:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL

Query:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE
        H+SQIS NHISD+A VL+PGD LKVMILSYD +RGR+SLSTK LEP+PGDM+HNPKLVFEKADEMAQTFRQRIAQAEAMAR +LL  QPE
Subjt:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE

XP_023006591.1 30S ribosomal protein S1, chloroplastic-like isoform X2 [Cucurbita maxima]6.3e-16279.49Show/hide
Query:  MNSLAHQV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY
        M+SLAHQ+      PLSS  +S      +RFS            PVVS+AAS TPISNAQTKERLKLKQLFKEAYERCC TPMDGVSFTLEDFHA+L+NY
Subjt:  MNSLAHQV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY

Query:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIA--EHETDGSLILSLRRLQYGLAWERCRQLQAEDVVI
        DFVSE+GTKVKGTVFSTDA+GALVDT+AKGTAYLP +EACI  IRHVEEAGIYPGLEEEFVII   E E +GSLILSLR +QYGLAWERCRQLQAED VI
Subjt:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIA--EHETDGSLILSLRRLQYGLAWERCRQLQAEDVVI

Query:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL
        KGKVV  NKGGV VLVEGL+GFVPFSQISAKSTAEELL+KEL LKFVEVDE+LSRL+LSN KAIA+SQ+ELRIGSVVTG VQILK YGAFIDIGGVNGLL
Subjt:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL

Query:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE
        H+SQIS NHISD+A VL+PGD LKVMILSYD +RGR+SLSTK LEP+PGDM+HNPKLVFEKADEMAQTFRQRIAQAEAMAR +LL  QPE
Subjt:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE

XP_038906593.1 30S ribosomal protein S1, chloroplastic-like [Benincasa hispida]2.0e-16380.71Show/hide
Query:  MNSLAHQ-----------VPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAAST-TPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFH
        M+S AHQ            PLSS RLS SS +W RF  KE      + LP+VS+AAS+ +PISNAQTKERLKLKQLFKEAYERCCT+PMDGVSFTLEDFH
Subjt:  MNSLAHQ-----------VPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAAST-TPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFH

Query:  AALSNYDFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAE
        AAL++YDFVSELGTKVKGTVF T+A+GALVD T KGTAYLP +EACILKI+HVEEAGIYPGLEEEF+IIAE E    LILSLR +QYGLAWERCRQLQAE
Subjt:  AALSNYDFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAE

Query:  DVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGV
        D+VIKGKVVGA KGGV VLVEGLRGFVPFSQISAKSTAEELL+KELRLKFVEVDEELSRLILSN KAI  SQAELRIGSVVTGTVQILK YGAFIDIGG+
Subjt:  DVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGV

Query:  NGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE
        NGLLH+SQIS NHI D+ATVL+PGD LKVMILSYD  +GRVSLSTK LEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMAR  LL  QPE
Subjt:  NGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE

TrEMBL top hitse value%identityAlignment
A0A6J1CCC9 30S ribosomal protein S1, chloroplastic-like isoform X17.4e-20999.74Show/hide
Query:  MNSLAHQVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL
        MNSLAH+VPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL
Subjt:  MNSLAHQVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL

Query:  GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN
        GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN
Subjt:  GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN

Query:  KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN
        KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN
Subjt:  KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN

Query:  HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE
        HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE
Subjt:  HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE

A0A6J1CEA2 30S ribosomal protein S1, chloroplastic-like isoform X22.6e-19399.72Show/hide
Query:  MNSLAHQVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL
        MNSLAH+VPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL
Subjt:  MNSLAHQVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSEL

Query:  GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN
        GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN
Subjt:  GTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGAN

Query:  KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN
        KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN
Subjt:  KGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHN

Query:  HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEK
        HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEK
Subjt:  HISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEK

A0A6J1H681 30S ribosomal protein S1, chloroplastic-like isoform X21.5e-16179.49Show/hide
Query:  MNSLAHQV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY
        M+SLAHQ+      PLSS  +S      +RFS            PVVS+AAS TPISNAQTKERLKLKQLFKEAYERCC TPMDGVSFTLEDFHAAL+NY
Subjt:  MNSLAHQV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY

Query:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIA--EHETDGSLILSLRRLQYGLAWERCRQLQAEDVVI
        DFVSELGTKVKGTVFSTDA+GALVDT+AKGTAYLP +EACI  IRHVEEAGIYPGLEEEFVII   E E DGSLILSLR +QYGLAWERCRQLQAED VI
Subjt:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIA--EHETDGSLILSLRRLQYGLAWERCRQLQAEDVVI

Query:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL
        KGKVV  NKGGV VLVEGL+GFVPFSQISAKSTAEELL+KEL LKFVEVDE+L RLILSN KAI +SQ+ELRIGSVVTG VQILK YGAF+DIGGVNGLL
Subjt:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL

Query:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE
        H+SQIS NHI D+A VL+PGD LKVMILSYD +RGR+SLSTK LEP+PGDM+HNPKLVFEKADEMAQTFRQRIAQAEAMAR +LL  QPE
Subjt:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE

A0A6J1KY62 30S ribosomal protein S1, chloroplastic-like isoform X23.1e-16279.49Show/hide
Query:  MNSLAHQV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY
        M+SLAHQ+      PLSS  +S      +RFS            PVVS+AAS TPISNAQTKERLKLKQLFKEAYERCC TPMDGVSFTLEDFHA+L+NY
Subjt:  MNSLAHQV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY

Query:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIA--EHETDGSLILSLRRLQYGLAWERCRQLQAEDVVI
        DFVSE+GTKVKGTVFSTDA+GALVDT+AKGTAYLP +EACI  IRHVEEAGIYPGLEEEFVII   E E +GSLILSLR +QYGLAWERCRQLQAED VI
Subjt:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIA--EHETDGSLILSLRRLQYGLAWERCRQLQAEDVVI

Query:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL
        KGKVV  NKGGV VLVEGL+GFVPFSQISAKSTAEELL+KEL LKFVEVDE+LSRL+LSN KAIA+SQ+ELRIGSVVTG VQILK YGAFIDIGGVNGLL
Subjt:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL

Query:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE
        H+SQIS NHISD+A VL+PGD LKVMILSYD +RGR+SLSTK LEP+PGDM+HNPKLVFEKADEMAQTFRQRIAQAEAMAR +LL  QPE
Subjt:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE

A0A6J1L0J3 30S ribosomal protein S1, chloroplastic-like isoform X13.1e-16279.49Show/hide
Query:  MNSLAHQV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY
        M+SLAHQ+      PLSS  +S      +RFS            PVVS+AAS TPISNAQTKERLKLKQLFKEAYERCC TPMDGVSFTLEDFHA+L+NY
Subjt:  MNSLAHQV------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNY

Query:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIA--EHETDGSLILSLRRLQYGLAWERCRQLQAEDVVI
        DFVSE+GTKVKGTVFSTDA+GALVDT+AKGTAYLP +EACI  IRHVEEAGIYPGLEEEFVII   E E +GSLILSLR +QYGLAWERCRQLQAED VI
Subjt:  DFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIA--EHETDGSLILSLRRLQYGLAWERCRQLQAEDVVI

Query:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL
        KGKVV  NKGGV VLVEGL+GFVPFSQISAKSTAEELL+KEL LKFVEVDE+LSRL+LSN KAIA+SQ+ELRIGSVVTG VQILK YGAFIDIGGVNGLL
Subjt:  KGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLL

Query:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE
        H+SQIS NHISD+A VL+PGD LKVMILSYD +RGR+SLSTK LEP+PGDM+HNPKLVFEKADEMAQTFRQRIAQAEAMAR +LL  QPE
Subjt:  HISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE

SwissProt top hitse value%identityAlignment
O33698 30S ribosomal protein S12.0e-4134.4Show/hide
Query:  EDFHAALSNYDFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQ
        +DF  AL      S+ G  V+G V      GA +D   K  A+LP  EA +  +  + EA +    E EF++I +   DG + +SLR L    AW R  +
Subjt:  EDFHAALSNYDFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQ

Query:  LQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQA-ELRIGSVVTGTVQILKSYGAFI
        LQ     ++ KV G+NKGGV   +EGLR F+P S ++ K   + L  K L + F+EV+    +L+LS  +A   +   E+ +G ++ G V  LK +G F+
Subjt:  LQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQA-ELRIGSVVTGTVQILKSYGAFI

Query:  DIGGVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRI
        D+GG   LL I+QIS   ++D+  + + GD ++ ++++ D+ +GR+SLSTK LE  PG+++ N   +   A + A+  R+++
Subjt:  DIGGVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRI

P29344 30S ribosomal protein S1, chloroplastic2.4e-14871.98Show/hide
Query:  MNSLAHQV-------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSN
        M SLA Q+       PLS+  LS      + FS K  H  K R  P+VS+ A    +SNAQT+ER KLKQLF++AYERC   PM+GVSFT++DFH AL  
Subjt:  MNSLAHQV-------PLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSN

Query:  YDFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIK
        YDF SE+G++VKGTVF TDA+GALVD TAK +AYLP+ EACI +I++VEEAGI PG+ EEFVII E+E D SLILSLR++QY LAWERCRQLQAEDVV+K
Subjt:  YDFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIK

Query:  GKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLH
        GK+VGANKGGV  LVEGLRGFVPFSQIS+KS+AEELL+KE+ LKFVEVDEE SRL++SN KA+A+SQA+L IGSVVTGTVQ LK YGAFIDIGG+NGLLH
Subjt:  GKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLH

Query:  ISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE
        +SQISH+ +SD+ATVL+PGDTLKVMILS+D ERGRVSLSTK LEPTPGDMI NPKLVFEKA+EMAQTFRQRIAQAEAMAR D+LR QPE
Subjt:  ISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE

P46228 30S ribosomal protein S14.7e-7549.15Show/hide
Query:  PMDGVSFTLEDFHAALSNYDFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQY
        P   + FT EDF A L  YD+    G  V GTVF+ +  GAL+D  AK  A+LP++E  I ++   EE  + P    EF I+++   DG L LS+RR++Y
Subjt:  PMDGVSFTLEDFHAALSNYDFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQY

Query:  GLAWERCRQLQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQA-ELRIGSVVTGTVQ
          AWER RQLQ ED  ++ +V   N+GG  V +EGLRGF+P S IS +   E+L+ +EL LKF+EVDE+ +RL+LS+ +A+   +   L +G VV G V+
Subjt:  GLAWERCRQLQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQA-ELRIGSVVTGTVQ

Query:  ILKSYGAFIDIGGVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQ
         +K YGAFIDIGGV+GLLHIS+ISH+HI    +V    D +KVMI+  D ERGR+SLSTK LEP PGDM+ NP++V+EKA+EMA  +R+++ Q
Subjt:  ILKSYGAFIDIGGVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQ

P73530 30S ribosomal protein S1 homolog A1.1e-7350.68Show/hide
Query:  VSFTLEDFHAALSNYDFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAW
        + FTLEDF A L  YD+    G  V GTVFS ++ GAL+D  AK  AY+PI+E  I ++   EE  + P    EF I+ +   DG L LS+RR++Y  AW
Subjt:  VSFTLEDFHAALSNYDFVSELGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAW

Query:  ERCRQLQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAE-LRIGSVVTGTVQILKS
        ER RQLQAED  ++  V   N+GG  V +EGLRGF+P S ISA+   E+L+ ++L LKF+EVDEE +RL+LS+ +A+   +   L +  VV G+V+ +K 
Subjt:  ERCRQLQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAE-LRIGSVVTGTVQILKS

Query:  YGAFIDIGGVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQ-RIAQAEAM
        YGAFIDIGGV+GLLHIS+ISH+HI    +V    D +KVMI+  D ERGR+SLSTK LEP PG M+ +  LV E ADEMA+ FRQ R+A+A+ +
Subjt:  YGAFIDIGGVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQ-RIAQAEAM

Q93VC7 30S ribosomal protein S1, chloroplastic5.2e-14369.19Show/hide
Query:  MNSLAHQVPLSSLRLSVSSSSWRRFSRKESHNH-KFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSE
        M SLA Q   S LR S  SSS  R SR+ S N  + ++  V  +  +   +S+ QTKERL+LK++F++AYERC T+PM+GV+FT++DF AA+  YDF SE
Subjt:  MNSLAHQVPLSSLRLSVSSSSWRRFSRKESHNH-KFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSE

Query:  LGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGA
        +GT+VKGTVF TDA+GALVD +AK +AYL +E+ACI +I+HVEEAGI PG+ EEFVII E+E+D SL+LSLR +QY LAWERCRQLQAEDV++K KV+GA
Subjt:  LGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGA

Query:  NKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISH
        NKGG+  LVEGLRGFVPFSQIS+K+ AEELL+KE+ LKFVEVDEE ++L+LSN KA+A+SQA+L IGSVV G VQ LK YGAFIDIGG+NGLLH+SQISH
Subjt:  NKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISH

Query:  NHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE
        + +SD+ATVL+PGDTLKVMILS+D +RGRVSLSTK LEPTPGDMI NPKLVFEKA+EMAQTFRQRIAQAEAMAR D+LR QPE
Subjt:  NHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE

Arabidopsis top hitse value%identityAlignment
AT1G71720.1 Nucleic acid-binding proteins superfamily8.0e-2231.28Show/hide
Query:  PGLEEEFVIIAE---HETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAK----STAEELLDKELRLKFVE
        P +E   V+ AE       G  +LS RR    +AW R RQ++  +  I+ K+   N GG+   +EGLR F+P  ++  K    +  +E + +   ++   
Subjt:  PGLEEEFVIIAE---HETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAK----STAEELLDKELRLKFVE

Query:  VDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIG--GVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEP
        ++E+ + LILS  + +A  +  LR G+++ GTV  +  YGA + +G    +GLLHIS I+   I  ++ VL+  +++KV+++       ++SLS  +LE 
Subjt:  VDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIG--GVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEP

Query:  TPGDMIHNPKLVFEKADEMAQTFRQRI
         PG  I + + VF +A+EMA+ +R+++
Subjt:  TPGDMIHNPKLVFEKADEMAQTFRQRI

AT3G23700.1 Nucleic acid-binding proteins superfamily1.3e-1931.11Show/hide
Query:  WERCRQLQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEE-----------LLDKELRLKFVEVDEELSRLILSNCKAIANSQAE-LRIG
        W+  +         +G+V G N GG+ +    L GF+P+ Q+S   + +E           L+  +L +K V+ DEE  +LILS   A+    ++ + +G
Subjt:  WERCRQLQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQISAKSTAEE-----------LLDKELRLKFVEVDEELSRLILSNCKAIANSQAE-LRIG

Query:  SVVTGTVQILKSYGAFIDIG------GVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTP
         V  G V  ++ YGAFI +        + GL+H+S++S +++ D+  VLR GD ++V++ + D E+ R++LS K LE  P
Subjt:  SVVTGTVQILKSYGAFIDIG------GVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTP

AT4G29060.1 elongation factor Ts family protein3.5e-0935.16Show/hide
Query:  SNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGG-VNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTP
        S   A+ N   EL  G+  TG V+ ++ +GAF+D G   +GL+H+SQ+S N + D+++V+  G  +KV ++  D E  R+SL+ +  +  P
Subjt:  SNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGG-VNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTP

AT5G14580.1 polyribonucleotide nucleotidyltransferase, putative3.2e-1043.75Show/hide
Query:  ELRIGSVVTGTVQILKSYGAFIDI-GGVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTP
        EL +G V  GTV  +K YGAF++  GG  GLLH+S++SH  +S ++ VL  G  +  M +  D  RG + LS K L P P
Subjt:  ELRIGSVVTGTVQILKSYGAFIDI-GGVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTP

AT5G30510.1 ribosomal protein S13.7e-14469.19Show/hide
Query:  MNSLAHQVPLSSLRLSVSSSSWRRFSRKESHNH-KFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSE
        M SLA Q   S LR S  SSS  R SR+ S N  + ++  V  +  +   +S+ QTKERL+LK++F++AYERC T+PM+GV+FT++DF AA+  YDF SE
Subjt:  MNSLAHQVPLSSLRLSVSSSSWRRFSRKESHNH-KFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSE

Query:  LGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGA
        +GT+VKGTVF TDA+GALVD +AK +AYL +E+ACI +I+HVEEAGI PG+ EEFVII E+E+D SL+LSLR +QY LAWERCRQLQAEDV++K KV+GA
Subjt:  LGTKVKGTVFSTDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGA

Query:  NKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISH
        NKGG+  LVEGLRGFVPFSQIS+K+ AEELL+KE+ LKFVEVDEE ++L+LSN KA+A+SQA+L IGSVV G VQ LK YGAFIDIGG+NGLLH+SQISH
Subjt:  NKGGVFVLVEGLRGFVPFSQISAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISH

Query:  NHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE
        + +SD+ATVL+PGDTLKVMILS+D +RGRVSLSTK LEPTPGDMI NPKLVFEKA+EMAQTFRQRIAQAEAMAR D+LR QPE
Subjt:  NHISDMATVLRPGDTLKVMILSYDHERGRVSLSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTCGTTGGCGCATCAAGTTCCTCTTTCTTCGTTGCGACTTTCGGTTTCGAGTTCGAGTTGGAGGCGCTTTTCCCGTAAGGAGAGCCATAACCATAAGTTTAGGGC
GCTTCCGGTAGTTTCATCTGCAGCTTCCACTACTCCCATTTCCAATGCGCAGACCAAAGAGCGCCTTAAACTCAAGCAGCTCTTCAAGGAAGCTTACGAACGCTGCTGTA
CTACTCCCATGGATGGCGTCTCCTTCACGCTTGAGGACTTCCATGCTGCTCTCTCCAACTATGACTTCGTTTCCGAACTCGGAACCAAGGTTAAAGGAACTGTATTCAGT
ACCGATGCTAGTGGGGCACTAGTTGACACTACTGCAAAGGGAACTGCATACTTGCCCATCGAGGAGGCATGCATTCTTAAAATAAGACATGTAGAAGAAGCAGGCATATA
TCCTGGTCTAGAAGAGGAGTTTGTAATTATAGCTGAACATGAAACTGATGGTAGCTTGATTCTTAGCTTGAGAAGACTTCAGTATGGCCTTGCCTGGGAGCGATGCAGAC
AACTCCAAGCTGAGGATGTTGTTATCAAGGGTAAGGTTGTTGGTGCAAACAAAGGGGGAGTATTTGTTCTCGTGGAAGGCCTTAGAGGCTTCGTTCCTTTCTCTCAGATA
TCAGCAAAATCAACTGCAGAGGAGCTGCTTGATAAAGAGCTACGTCTGAAGTTTGTGGAGGTCGATGAGGAACTATCTCGGCTAATACTAAGTAACTGCAAGGCAATTGC
CAATAGTCAGGCAGAGCTAAGAATTGGGTCAGTAGTTACTGGAACCGTGCAGATTTTGAAATCATATGGAGCCTTTATTGACATTGGTGGAGTTAATGGCCTTCTTCATA
TTAGTCAAATCAGTCATAATCACATATCAGATATGGCAACTGTTCTTAGACCAGGAGACACGCTAAAGGTCATGATTTTGAGCTATGATCACGAGAGAGGCCGTGTTAGT
CTTTCTACCAAGAATTTGGAACCTACTCCTGGAGACATGATTCACAATCCAAAGCTTGTTTTTGAGAAGGCAGATGAGATGGCTCAGACGTTCAGGCAAAGAATAGCTCA
AGCAGAAGCAATGGCTCGTGTAGACCTTCTCAGACTTCAGCCTGAG
mRNA sequenceShow/hide mRNA sequence
ATGAACTCGTTGGCGCATCAAGTTCCTCTTTCTTCGTTGCGACTTTCGGTTTCGAGTTCGAGTTGGAGGCGCTTTTCCCGTAAGGAGAGCCATAACCATAAGTTTAGGGC
GCTTCCGGTAGTTTCATCTGCAGCTTCCACTACTCCCATTTCCAATGCGCAGACCAAAGAGCGCCTTAAACTCAAGCAGCTCTTCAAGGAAGCTTACGAACGCTGCTGTA
CTACTCCCATGGATGGCGTCTCCTTCACGCTTGAGGACTTCCATGCTGCTCTCTCCAACTATGACTTCGTTTCCGAACTCGGAACCAAGGTTAAAGGAACTGTATTCAGT
ACCGATGCTAGTGGGGCACTAGTTGACACTACTGCAAAGGGAACTGCATACTTGCCCATCGAGGAGGCATGCATTCTTAAAATAAGACATGTAGAAGAAGCAGGCATATA
TCCTGGTCTAGAAGAGGAGTTTGTAATTATAGCTGAACATGAAACTGATGGTAGCTTGATTCTTAGCTTGAGAAGACTTCAGTATGGCCTTGCCTGGGAGCGATGCAGAC
AACTCCAAGCTGAGGATGTTGTTATCAAGGGTAAGGTTGTTGGTGCAAACAAAGGGGGAGTATTTGTTCTCGTGGAAGGCCTTAGAGGCTTCGTTCCTTTCTCTCAGATA
TCAGCAAAATCAACTGCAGAGGAGCTGCTTGATAAAGAGCTACGTCTGAAGTTTGTGGAGGTCGATGAGGAACTATCTCGGCTAATACTAAGTAACTGCAAGGCAATTGC
CAATAGTCAGGCAGAGCTAAGAATTGGGTCAGTAGTTACTGGAACCGTGCAGATTTTGAAATCATATGGAGCCTTTATTGACATTGGTGGAGTTAATGGCCTTCTTCATA
TTAGTCAAATCAGTCATAATCACATATCAGATATGGCAACTGTTCTTAGACCAGGAGACACGCTAAAGGTCATGATTTTGAGCTATGATCACGAGAGAGGCCGTGTTAGT
CTTTCTACCAAGAATTTGGAACCTACTCCTGGAGACATGATTCACAATCCAAAGCTTGTTTTTGAGAAGGCAGATGAGATGGCTCAGACGTTCAGGCAAAGAATAGCTCA
AGCAGAAGCAATGGCTCGTGTAGACCTTCTCAGACTTCAGCCTGAG
Protein sequenceShow/hide protein sequence
MNSLAHQVPLSSLRLSVSSSSWRRFSRKESHNHKFRALPVVSSAASTTPISNAQTKERLKLKQLFKEAYERCCTTPMDGVSFTLEDFHAALSNYDFVSELGTKVKGTVFS
TDASGALVDTTAKGTAYLPIEEACILKIRHVEEAGIYPGLEEEFVIIAEHETDGSLILSLRRLQYGLAWERCRQLQAEDVVIKGKVVGANKGGVFVLVEGLRGFVPFSQI
SAKSTAEELLDKELRLKFVEVDEELSRLILSNCKAIANSQAELRIGSVVTGTVQILKSYGAFIDIGGVNGLLHISQISHNHISDMATVLRPGDTLKVMILSYDHERGRVS
LSTKNLEPTPGDMIHNPKLVFEKADEMAQTFRQRIAQAEAMARVDLLRLQPE