; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0004602 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0004602
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
Description30S ribosomal protein S1, chloroplastic
Genome locationchr11:3512578..3517794
RNA-Seq ExpressionPI0004602
SyntenyPI0004602
Gene Ontology termsGO:0005840 - ribosome (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138847.1 30S ribosomal protein S1, chloroplastic-like isoform X1 [Momordica charantia]2.2e-16280.67Show/hide
Query:  LNSRYSYSPLSSSRLSASSWNWKRFPQKEWR----KLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALANYDFV
        +NS     PLSS RLS SS +W+RF +KE      + LP+VS+AAS+   +PISNAQTKERLKLKQLFKEAYERCCT PMDGVSFTLEDFHAAL+NYDFV
Subjt:  LNSRYSYSPLSSSRLSASSWNWKRFPQKEWR----KLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALANYDFV

Query:  SQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIAEQEDGDGLILSLRSVQNGLAWERCRQLQAEDFVIKGKVV
        S+LGTKVKGTVF TDA+GALVDTTAKGTAYLP +E+ ILKIRHVEEAGIYPGLEE+FVIIAE E    LILSLR +Q GLAWERCRQLQAED VIKGKVV
Subjt:  SQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIAEQEDGDGLILSLRSVQNGLAWERCRQLQAEDFVIKGKVV

Query:  GATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGINGLLHVSQI
        GA KGGV VLVE LRGFVPFSQISA STAEELL KEL LKFVEVDE+LSRLILSN KAI +SQAELRIGSVVTGTVQ LK YGAFIDIGG+NGLLH+SQI
Subjt:  GATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGINGLLHVSQI

Query:  SQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPESG
        S NHISD+A VL+PGD LKVMILSYDH +GRVSLSTK LEPTPGDMIHNPKLVFEKADEMA+ FRQRIAQAEAMAR D LL  QPESG
Subjt:  SQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPESG

XP_022958819.1 30S ribosomal protein S1, chloroplastic-like isoform X2 [Cucurbita moschata]1.5e-16180.71Show/hide
Query:  SSAAHQPCGLNSRYSYSPLSSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALA
        SS AHQ CGL S    SPLSS+ +S      KRF         P+VSAAA   SP+PISNAQTKERLKLKQLFKEAYERCC  PMDGVSFTLEDFHAALA
Subjt:  SSAAHQPCGLNSRYSYSPLSSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALA

Query:  NYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIA--EQEDGDGLILSLRSVQNGLAWERCRQLQAEDF
        NYDFVS+LGTKVKGTVF TDANGALVDT+AKGTAYLPTQE+ I  IRHVEEAGIYPGLEE+FVII   EQED   LILSLRSVQ GLAWERCRQLQAEDF
Subjt:  NYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIA--EQEDGDGLILSLRSVQNGLAWERCRQLQAEDF

Query:  VIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGING
        VIKGKVV   KGGVVVLVE L+GFVPFSQISA STAEELL+KELLLKFVEVDEKL RLILSNSKAIVSSQ+ELRIGSVVTG VQ LKPYGAF+DIGG+NG
Subjt:  VIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGING

Query:  LLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPES
        LLHVSQISQNHI DIA VLQPGD+LKVMILSYD  +GR+SLSTKKLEP+PGDM+HNPKLVFEKADEMA+ FRQRIAQAEAMAR + LL FQPE+
Subjt:  LLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPES

XP_023006590.1 30S ribosomal protein S1, chloroplastic-like isoform X1 [Cucurbita maxima]1.5e-16180.66Show/hide
Query:  SSAAHQPCGLNSRYSYSPLSSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALA
        SS AHQ CGL S    SPLSS+ +S      KRF         P+VSAAA   SP+PISNAQTKERLKLKQLFKEAYERCC  PMDGVSFTLEDFHA+LA
Subjt:  SSAAHQPCGLNSRYSYSPLSSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALA

Query:  NYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIA--EQEDGDGLILSLRSVQNGLAWERCRQLQAEDF
        NYDFVS++GTKVKGTVF TDANGALVDT+AKGTAYLPTQE+ I  IRHVEEAGIYPGLEE+FVII   EQED   LILSLRSVQ GLAWERCRQLQAEDF
Subjt:  NYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIA--EQEDGDGLILSLRSVQNGLAWERCRQLQAEDF

Query:  VIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGING
        VIKGKVV   KGGVVVLVE L+GFVPFSQISA STAEELL+KELLLKFVEVDEKLSRL+LSNSKAI SSQ+ELRIGSVVTG VQ LKPYGAFIDIGG+NG
Subjt:  VIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGING

Query:  LLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPE
        LLHVSQISQNHISDIA VLQPGD+LKVMILSYD  +GR+SLSTKKLEP+PGDM+HNPKLVFEKADEMA+ FRQRIAQAEAMAR + LL FQPE
Subjt:  LLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPE

XP_023006591.1 30S ribosomal protein S1, chloroplastic-like isoform X2 [Cucurbita maxima]1.1e-16180.46Show/hide
Query:  SSAAHQPCGLNSRYSYSPLSSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALA
        SS AHQ CGL S    SPLSS+ +S      KRF         P+VSAAA   SP+PISNAQTKERLKLKQLFKEAYERCC  PMDGVSFTLEDFHA+LA
Subjt:  SSAAHQPCGLNSRYSYSPLSSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALA

Query:  NYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIA--EQEDGDGLILSLRSVQNGLAWERCRQLQAEDF
        NYDFVS++GTKVKGTVF TDANGALVDT+AKGTAYLPTQE+ I  IRHVEEAGIYPGLEE+FVII   EQED   LILSLRSVQ GLAWERCRQLQAEDF
Subjt:  NYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIA--EQEDGDGLILSLRSVQNGLAWERCRQLQAEDF

Query:  VIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGING
        VIKGKVV   KGGVVVLVE L+GFVPFSQISA STAEELL+KELLLKFVEVDEKLSRL+LSNSKAI SSQ+ELRIGSVVTG VQ LKPYGAFIDIGG+NG
Subjt:  VIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGING

Query:  LLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPES
        LLHVSQISQNHISDIA VLQPGD+LKVMILSYD  +GR+SLSTKKLEP+PGDM+HNPKLVFEKADEMA+ FRQRIAQAEAMAR + LL FQPE+
Subjt:  LLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPES

XP_038906593.1 30S ribosomal protein S1, chloroplastic-like [Benincasa hispida]5.3e-18890.36Show/hide
Query:  SSAAHQP-CGLNSRYSYSPLSSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAAL
        SS+AHQP CGL SRYSYSPLSS RLSASSWNW RFP KEWRKLLPLVSAAA  SSPSPISNAQTKERLKLKQLFKEAYERCCT PMDGVSFTLEDFHAAL
Subjt:  SSAAHQP-CGLNSRYSYSPLSSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAAL

Query:  ANYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIAEQEDGDGLILSLRSVQNGLAWERCRQLQAEDFV
        A+YDFVS+LGTKVKGTVFCT+ANGALVD T KGTAYLPTQE+ ILKI+HVEEAGIYPGLEE+F+IIAEQEDGDGLILSLRSVQ GLAWERCRQLQAED V
Subjt:  ANYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIAEQEDGDGLILSLRSVQNGLAWERCRQLQAEDFV

Query:  IKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGINGL
        IKGKVVGATKGGVVVLVE LRGFVPFSQISA STAEELL+KEL LKFVEVDE+LSRLILSNSKAIV SQAELRIGSVVTGTVQ LKPYGAFIDIGGINGL
Subjt:  IKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGINGL

Query:  LHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPESG
        LHVSQISQNHI DIA VLQPGD+LKVMILSYD +KGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMA+ FRQRIAQAEAMAR  GLLG QPESG
Subjt:  LHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPESG

TrEMBL top hitse value%identityAlignment
A0A6J1CCC9 30S ribosomal protein S1, chloroplastic-like isoform X11.1e-16280.67Show/hide
Query:  LNSRYSYSPLSSSRLSASSWNWKRFPQKEWR----KLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALANYDFV
        +NS     PLSS RLS SS +W+RF +KE      + LP+VS+AAS+   +PISNAQTKERLKLKQLFKEAYERCCT PMDGVSFTLEDFHAAL+NYDFV
Subjt:  LNSRYSYSPLSSSRLSASSWNWKRFPQKEWR----KLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALANYDFV

Query:  SQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIAEQEDGDGLILSLRSVQNGLAWERCRQLQAEDFVIKGKVV
        S+LGTKVKGTVF TDA+GALVDTTAKGTAYLP +E+ ILKIRHVEEAGIYPGLEE+FVIIAE E    LILSLR +Q GLAWERCRQLQAED VIKGKVV
Subjt:  SQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIAEQEDGDGLILSLRSVQNGLAWERCRQLQAEDFVIKGKVV

Query:  GATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGINGLLHVSQI
        GA KGGV VLVE LRGFVPFSQISA STAEELL KEL LKFVEVDE+LSRLILSN KAI +SQAELRIGSVVTGTVQ LK YGAFIDIGG+NGLLH+SQI
Subjt:  GATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGINGLLHVSQI

Query:  SQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPESG
        S NHISD+A VL+PGD LKVMILSYDH +GRVSLSTK LEPTPGDMIHNPKLVFEKADEMA+ FRQRIAQAEAMAR D LL  QPESG
Subjt:  SQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPESG

A0A6J1H453 30S ribosomal protein S1, chloroplastic-like isoform X19.2e-16280.92Show/hide
Query:  SSAAHQPCGLNSRYSYSPLSSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALA
        SS AHQ CGL S    SPLSS+ +S      KRF         P+VSAAA   SP+PISNAQTKERLKLKQLFKEAYERCC  PMDGVSFTLEDFHAALA
Subjt:  SSAAHQPCGLNSRYSYSPLSSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALA

Query:  NYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIA--EQEDGDGLILSLRSVQNGLAWERCRQLQAEDF
        NYDFVS+LGTKVKGTVF TDANGALVDT+AKGTAYLPTQE+ I  IRHVEEAGIYPGLEE+FVII   EQED   LILSLRSVQ GLAWERCRQLQAEDF
Subjt:  NYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIA--EQEDGDGLILSLRSVQNGLAWERCRQLQAEDF

Query:  VIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGING
        VIKGKVV   KGGVVVLVE L+GFVPFSQISA STAEELL+KELLLKFVEVDEKL RLILSNSKAIVSSQ+ELRIGSVVTG VQ LKPYGAF+DIGG+NG
Subjt:  VIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGING

Query:  LLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPE
        LLHVSQISQNHI DIA VLQPGD+LKVMILSYD  +GR+SLSTKKLEP+PGDM+HNPKLVFEKADEMA+ FRQRIAQAEAMAR + LL FQPE
Subjt:  LLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPE

A0A6J1H681 30S ribosomal protein S1, chloroplastic-like isoform X27.0e-16280.71Show/hide
Query:  SSAAHQPCGLNSRYSYSPLSSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALA
        SS AHQ CGL S    SPLSS+ +S      KRF         P+VSAAA   SP+PISNAQTKERLKLKQLFKEAYERCC  PMDGVSFTLEDFHAALA
Subjt:  SSAAHQPCGLNSRYSYSPLSSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALA

Query:  NYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIA--EQEDGDGLILSLRSVQNGLAWERCRQLQAEDF
        NYDFVS+LGTKVKGTVF TDANGALVDT+AKGTAYLPTQE+ I  IRHVEEAGIYPGLEE+FVII   EQED   LILSLRSVQ GLAWERCRQLQAEDF
Subjt:  NYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIA--EQEDGDGLILSLRSVQNGLAWERCRQLQAEDF

Query:  VIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGING
        VIKGKVV   KGGVVVLVE L+GFVPFSQISA STAEELL+KELLLKFVEVDEKL RLILSNSKAIVSSQ+ELRIGSVVTG VQ LKPYGAF+DIGG+NG
Subjt:  VIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGING

Query:  LLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPES
        LLHVSQISQNHI DIA VLQPGD+LKVMILSYD  +GR+SLSTKKLEP+PGDM+HNPKLVFEKADEMA+ FRQRIAQAEAMAR + LL FQPE+
Subjt:  LLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPES

A0A6J1KY62 30S ribosomal protein S1, chloroplastic-like isoform X25.4e-16280.46Show/hide
Query:  SSAAHQPCGLNSRYSYSPLSSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALA
        SS AHQ CGL S    SPLSS+ +S      KRF         P+VSAAA   SP+PISNAQTKERLKLKQLFKEAYERCC  PMDGVSFTLEDFHA+LA
Subjt:  SSAAHQPCGLNSRYSYSPLSSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALA

Query:  NYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIA--EQEDGDGLILSLRSVQNGLAWERCRQLQAEDF
        NYDFVS++GTKVKGTVF TDANGALVDT+AKGTAYLPTQE+ I  IRHVEEAGIYPGLEE+FVII   EQED   LILSLRSVQ GLAWERCRQLQAEDF
Subjt:  NYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIA--EQEDGDGLILSLRSVQNGLAWERCRQLQAEDF

Query:  VIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGING
        VIKGKVV   KGGVVVLVE L+GFVPFSQISA STAEELL+KELLLKFVEVDEKLSRL+LSNSKAI SSQ+ELRIGSVVTG VQ LKPYGAFIDIGG+NG
Subjt:  VIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGING

Query:  LLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPES
        LLHVSQISQNHISDIA VLQPGD+LKVMILSYD  +GR+SLSTKKLEP+PGDM+HNPKLVFEKADEMA+ FRQRIAQAEAMAR + LL FQPE+
Subjt:  LLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPES

A0A6J1L0J3 30S ribosomal protein S1, chloroplastic-like isoform X17.0e-16280.66Show/hide
Query:  SSAAHQPCGLNSRYSYSPLSSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALA
        SS AHQ CGL S    SPLSS+ +S      KRF         P+VSAAA   SP+PISNAQTKERLKLKQLFKEAYERCC  PMDGVSFTLEDFHA+LA
Subjt:  SSAAHQPCGLNSRYSYSPLSSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALA

Query:  NYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIA--EQEDGDGLILSLRSVQNGLAWERCRQLQAEDF
        NYDFVS++GTKVKGTVF TDANGALVDT+AKGTAYLPTQE+ I  IRHVEEAGIYPGLEE+FVII   EQED   LILSLRSVQ GLAWERCRQLQAEDF
Subjt:  NYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIA--EQEDGDGLILSLRSVQNGLAWERCRQLQAEDF

Query:  VIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGING
        VIKGKVV   KGGVVVLVE L+GFVPFSQISA STAEELL+KELLLKFVEVDEKLSRL+LSNSKAI SSQ+ELRIGSVVTG VQ LKPYGAFIDIGG+NG
Subjt:  VIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGING

Query:  LLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPE
        LLHVSQISQNHISDIA VLQPGD+LKVMILSYD  +GR+SLSTKKLEP+PGDM+HNPKLVFEKADEMA+ FRQRIAQAEAMAR + LL FQPE
Subjt:  LLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPE

SwissProt top hitse value%identityAlignment
O33698 30S ribosomal protein S11.1e-3933.33Show/hide
Query:  EDFHAALANYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIAEQEDGDGLILSLRSVQNGLAWERCRQ
        +DF  AL      SQ G  V+G V     +GA +D   K  A+LP +E+++  +  + EA +    E +F++I +Q +   + +SLR++    AW R  +
Subjt:  EDFHAALANYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIAEQEDGDGLILSLRSVQNGLAWERCRQ

Query:  LQAEDFVIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQA-ELRIGSVVTGTVQTLKPYGAFI
        LQ     ++ KV G+ KGGV   +E LR F+P S ++     + L  K L + F+EV+    +L+LS  +A  ++   E+ +G ++ G V  LKP+G F+
Subjt:  LQAEDFVIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQA-ELRIGSVVTGTVQTLKPYGAFI

Query:  DIGGINGLLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRI
        D+GG   LL ++QISQ  ++D+  + + GD ++ ++++ D+ KGR+SLSTK LE  PG+++ N   +   A + AE  R+++
Subjt:  DIGGINGLLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRI

P29344 30S ribosomal protein S1, chloroplastic7.3e-14070.63Show/hide
Query:  PLSSSRLSASSWNWKRFPQKEWRK--LLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALANYDFVSQLGTKVKGT
        PLS+S LS      K F  K   K    P+VSA A       +SNAQT+ER KLKQLF++AYERC   PM+GVSFT++DFH AL  YDF S++G++VKGT
Subjt:  PLSSSRLSASSWNWKRFPQKEWRK--LLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALANYDFVSQLGTKVKGT

Query:  VFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIAEQEDGDGLILSLRSVQNGLAWERCRQLQAEDFVIKGKVVGATKGGVVVL
        VFCTDANGALVD TAK +AYLP  E+ I +I++VEEAGI PG+ E+FVII E E  D LILSLR +Q  LAWERCRQLQAED V+KGK+VGA KGGVV L
Subjt:  VFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIAEQEDGDGLILSLRSVQNGLAWERCRQLQAEDFVIKGKVVGATKGGVVVL

Query:  VECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGINGLLHVSQISQNHISDIAN
        VE LRGFVPFSQIS+ S+AEELL KE+ LKFVEVDE+ SRL++SN KA+  SQA+L IGSVVTGTVQ+LKPYGAFIDIGGINGLLHVSQIS + +SDIA 
Subjt:  VECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGINGLLHVSQISQNHISDIAN

Query:  VLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPESG
        VLQPGD LKVMILS+D  +GRVSLSTKKLEPTPGDMI NPKLVFEKA+EMA+ FRQRIAQAEAMAR D +L FQPESG
Subjt:  VLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPESG

P46228 30S ribosomal protein S13.9e-6947.44Show/hide
Query:  PMDGVSFTLEDFHAALANYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIAEQEDGDGLILSLRSVQN
        P   + FT EDF A L  YD+    G  V GTVF  +  GAL+D  AK  A+LP QE SI ++   EE      + E F++  E EDG  L LS+R ++ 
Subjt:  PMDGVSFTLEDFHAALANYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIAEQEDGDGLILSLRSVQN

Query:  GLAWERCRQLQAEDFVIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQA-ELRIGSVVTGTVQ
          AWER RQLQ ED  ++ +V    +GG +V +E LRGF+P S IS     E+L+ +EL LKF+EVDE  +RL+LS+ +A+V  +   L +G VV G V+
Subjt:  GLAWERCRQLQAEDFVIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQA-ELRIGSVVTGTVQ

Query:  TLKPYGAFIDIGGINGLLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQ
         +KPYGAFIDIGG++GLLH+S+IS +HI    +V    D +KVMI+  D  +GR+SLSTK+LEP PGDM+ NP++V+EKA+EMA  +R+++ Q
Subjt:  TLKPYGAFIDIGGINGLLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQ

P73530 30S ribosomal protein S1 homolog A9.7e-6847.28Show/hide
Query:  VSFTLEDFHAALANYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIAEQEDGDGLILSLRSVQNGLAW
        + FTLEDF A L  YD+    G  V GTVF  ++ GAL+D  AK  AY+P QE SI ++   EE  + P    +F I+ ++ +   L LS+R ++   AW
Subjt:  VSFTLEDFHAALANYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIAEQEDGDGLILSLRSVQNGLAW

Query:  ERCRQLQAEDFVIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAE-LRIGSVVTGTVQTLKP
        ER RQLQAED  ++  V    +GG +V +E LRGF+P S ISA    E+L+ ++L LKF+EVDE+ +RL+LS+ +A+V  +   L +  VV G+V+ +KP
Subjt:  ERCRQLQAEDFVIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAE-LRIGSVVTGTVQTLKP

Query:  YGAFIDIGGINGLLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQ-RIAQAEAM
        YGAFIDIGG++GLLH+S+IS +HI    +V    D +KVMI+  D  +GR+SLSTK+LEP PG M+ +  LV E ADEMAE+FRQ R+A+A+ +
Subjt:  YGAFIDIGGINGLLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQ-RIAQAEAM

Q93VC7 30S ribosomal protein S1, chloroplastic4.6e-13465.99Show/hide
Query:  SSAAHQPCGLNSRYSYSPL-SSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAAL
        +S A Q  GL      SPL SSSRLS  +   K FPQ +   + P + AA + SS       QTKERL+LK++F++AYERC T PM+GV+FT++DF AA+
Subjt:  SSAAHQPCGLNSRYSYSPL-SSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAAL

Query:  ANYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIAEQEDGDGLILSLRSVQNGLAWERCRQLQAEDFV
          YDF S++GT+VKGTVF TDANGALVD +AK +AYL  +++ I +I+HVEEAGI PG+ E+FVII E E  D L+LSLR++Q  LAWERCRQLQAED +
Subjt:  ANYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIAEQEDGDGLILSLRSVQNGLAWERCRQLQAEDFV

Query:  IKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGINGL
        +K KV+GA KGG+V LVE LRGFVPFSQIS+ + AEELL KE+ LKFVEVDE+ ++L+LSN KA+  SQA+L IGSVV G VQ+LKPYGAFIDIGGINGL
Subjt:  IKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGINGL

Query:  LHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPESG
        LHVSQIS + +SDIA VLQPGD LKVMILS+D  +GRVSLSTKKLEPTPGDMI NPKLVFEKA+EMA+ FRQRIAQAEAMAR D +L FQPESG
Subjt:  LHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPESG

Arabidopsis top hitse value%identityAlignment
AT1G71720.1 Nucleic acid-binding proteins superfamily1.6e-2030.24Show/hide
Query:  ILSLRSVQNGLAWERCRQLQAEDFVIKGKVVGATKGGVVVLVECLRGFVP----FSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAE
        +LS R     +AW R RQ++  +  I+ K+     GG++  +E LR F+P      +++  +  +E + +  L++   ++E  + LIL  S+ +   +  
Subjt:  ILSLRSVQNGLAWERCRQLQAEDFVIKGKVVGATKGGVVVLVECLRGFVP----FSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAE

Query:  LRIGSVVTGTVQTLKPYGAFIDIG--GINGLLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEM
        LR G+++ GTV  + PYGA + +G    +GLLH+S I++  I  +++VLQ  + +KV+++       ++SLS   LE  PG  I + + VF +A+EMA+ 
Subjt:  LRIGSVVTGTVQTLKPYGAFIDIG--GINGLLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEM

Query:  FRQRI
        +R+++
Subjt:  FRQRI

AT3G23700.1 Nucleic acid-binding proteins superfamily6.5e-1930Show/hide
Query:  WERCRQLQAEDFVIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEE-----------LLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAE-LRIG
        W+  +         +G+V G   GG+++    L GF+P+ Q+S + + +E           L+  +L +K V+ DE+  +LILS   A+    ++ + +G
Subjt:  WERCRQLQAEDFVIKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEE-----------LLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAE-LRIG

Query:  SVVTGTVQTLKPYGAFIDIG------GINGLLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTP
         V  G V +++ YGAFI +        + GL+HVS++S +++ D+ +VL+ GD ++V++ + D  K R++LS K+LE  P
Subjt:  SVVTGTVQTLKPYGAFIDIG------GINGLLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTP

AT4G29060.1 elongation factor Ts family protein4.3e-1037.5Show/hide
Query:  ELRIGSVVTGTVQTLKPYGAFIDIGGI-NGLLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTP
        EL  G+  TG V+ ++P+GAF+D G   +GL+HVSQ+S N + D+++V+  G  +KV ++  D    R+SL+ ++ +  P
Subjt:  ELRIGSVVTGTVQTLKPYGAFIDIGGI-NGLLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTP

AT4G29060.2 elongation factor Ts family protein4.3e-1037.5Show/hide
Query:  ELRIGSVVTGTVQTLKPYGAFIDIGGI-NGLLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTP
        EL  G+  TG V+ ++P+GAF+D G   +GL+HVSQ+S N + D+++V+  G  +KV ++  D    R+SL+ ++ +  P
Subjt:  ELRIGSVVTGTVQTLKPYGAFIDIGGI-NGLLHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTP

AT5G30510.1 ribosomal protein S13.2e-13565.99Show/hide
Query:  SSAAHQPCGLNSRYSYSPL-SSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAAL
        +S A Q  GL      SPL SSSRLS  +   K FPQ +   + P + AA + SS       QTKERL+LK++F++AYERC T PM+GV+FT++DF AA+
Subjt:  SSAAHQPCGLNSRYSYSPL-SSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAAL

Query:  ANYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIAEQEDGDGLILSLRSVQNGLAWERCRQLQAEDFV
          YDF S++GT+VKGTVF TDANGALVD +AK +AYL  +++ I +I+HVEEAGI PG+ E+FVII E E  D L+LSLR++Q  LAWERCRQLQAED +
Subjt:  ANYDFVSQLGTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIAEQEDGDGLILSLRSVQNGLAWERCRQLQAEDFV

Query:  IKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGINGL
        +K KV+GA KGG+V LVE LRGFVPFSQIS+ + AEELL KE+ LKFVEVDE+ ++L+LSN KA+  SQA+L IGSVV G VQ+LKPYGAFIDIGGINGL
Subjt:  IKGKVVGATKGGVVVLVECLRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGINGL

Query:  LHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPESG
        LHVSQIS + +SDIA VLQPGD LKVMILS+D  +GRVSLSTKKLEPTPGDMI NPKLVFEKA+EMA+ FRQRIAQAEAMAR D +L FQPESG
Subjt:  LHVSQISQNHISDIANVLQPGDLLKVMILSYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPESG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCATCGGCGGCGCATCAACCTTGTGGGTTGAATTCGAGGTATTCCTATTCCCCTCTTTCTTCATCGCGACTTTCAGCTTCGAGTTGGAACTGGAAGCGATTTCC
TCAAAAGGAATGGCGTAAACTGCTTCCATTAGTTTCAGCTGCAGCTTCTTCTTCTTCTCCTTCTCCCATTTCCAATGCACAGACTAAAGAGCGGCTTAAACTCAAGCAAC
TCTTCAAGGAAGCTTATGAACGCTGCTGTACTGACCCCATGGATGGCGTCTCCTTCACTCTTGAAGACTTCCATGCCGCTCTTGCAAATTACGACTTTGTTTCTCAACTC
GGAACCAAGGTTAAAGGTACTGTCTTCTGTACCGATGCTAATGGGGCGCTAGTTGATACTACTGCAAAGGGAACTGCATACTTGCCCACCCAAGAGTCTTCTATTCTTAA
AATAAGACATGTAGAAGAAGCAGGCATATATCCTGGTTTAGAAGAGCAGTTTGTAATTATTGCTGAACAGGAAGATGGTGATGGCTTAATTCTGAGCTTGAGAAGTGTCC
AGAATGGGCTTGCTTGGGAGCGATGCAGACAACTCCAAGCTGAGGATTTTGTTATCAAGGGTAAGGTTGTTGGTGCAACCAAAGGGGGAGTAGTTGTTCTTGTGGAATGT
CTTAGAGGCTTTGTTCCTTTCTCTCAGATATCAGCAAACTCAACTGCAGAGGAGCTTCTTAGTAAAGAGCTACTTCTGAAGTTTGTGGAGGTTGATGAGAAACTATCTCG
GCTAATCCTAAGTAACTCCAAGGCCATTGTCAGTAGCCAAGCAGAGCTAAGGATTGGTTCAGTAGTTACTGGAACCGTGCAGACTCTTAAACCATATGGAGCCTTTATTG
ACATTGGTGGAATTAATGGGCTTCTTCATGTTAGTCAAATCAGTCAAAATCACATATCAGATATTGCAAACGTTCTTCAGCCAGGAGATTTGCTTAAGGTCATGATTTTG
AGCTATGACCACCACAAAGGTCGTGTCAGTCTTTCTACCAAGAAATTGGAACCTACTCCTGGAGACATGATTCACAACCCAAAGCTTGTTTTTGAGAAGGCGGACGAGAT
GGCTGAGATGTTCAGGCAAAGAATAGCTCAAGCAGAAGCAATGGCTCGTGGAGATGGCCTTCTCGGATTTCAGCCTGAGAGTGGATAA
mRNA sequenceShow/hide mRNA sequence
ATTTGAGCCGCTCGGGTCATTACCGGGAGAATTCTCTAAGCCTATTAAATAGCCGGAATACAAAGTCGCCGGAGTAGAGGGTGGAGAGAGGCGGAGATGAGCTCATCGGC
GGCGCATCAACCTTGTGGGTTGAATTCGAGGTATTCCTATTCCCCTCTTTCTTCATCGCGACTTTCAGCTTCGAGTTGGAACTGGAAGCGATTTCCTCAAAAGGAATGGC
GTAAACTGCTTCCATTAGTTTCAGCTGCAGCTTCTTCTTCTTCTCCTTCTCCCATTTCCAATGCACAGACTAAAGAGCGGCTTAAACTCAAGCAACTCTTCAAGGAAGCT
TATGAACGCTGCTGTACTGACCCCATGGATGGCGTCTCCTTCACTCTTGAAGACTTCCATGCCGCTCTTGCAAATTACGACTTTGTTTCTCAACTCGGAACCAAGGTTAA
AGGTACTGTCTTCTGTACCGATGCTAATGGGGCGCTAGTTGATACTACTGCAAAGGGAACTGCATACTTGCCCACCCAAGAGTCTTCTATTCTTAAAATAAGACATGTAG
AAGAAGCAGGCATATATCCTGGTTTAGAAGAGCAGTTTGTAATTATTGCTGAACAGGAAGATGGTGATGGCTTAATTCTGAGCTTGAGAAGTGTCCAGAATGGGCTTGCT
TGGGAGCGATGCAGACAACTCCAAGCTGAGGATTTTGTTATCAAGGGTAAGGTTGTTGGTGCAACCAAAGGGGGAGTAGTTGTTCTTGTGGAATGTCTTAGAGGCTTTGT
TCCTTTCTCTCAGATATCAGCAAACTCAACTGCAGAGGAGCTTCTTAGTAAAGAGCTACTTCTGAAGTTTGTGGAGGTTGATGAGAAACTATCTCGGCTAATCCTAAGTA
ACTCCAAGGCCATTGTCAGTAGCCAAGCAGAGCTAAGGATTGGTTCAGTAGTTACTGGAACCGTGCAGACTCTTAAACCATATGGAGCCTTTATTGACATTGGTGGAATT
AATGGGCTTCTTCATGTTAGTCAAATCAGTCAAAATCACATATCAGATATTGCAAACGTTCTTCAGCCAGGAGATTTGCTTAAGGTCATGATTTTGAGCTATGACCACCA
CAAAGGTCGTGTCAGTCTTTCTACCAAGAAATTGGAACCTACTCCTGGAGACATGATTCACAACCCAAAGCTTGTTTTTGAGAAGGCGGACGAGATGGCTGAGATGTTCA
GGCAAAGAATAGCTCAAGCAGAAGCAATGGCTCGTGGAGATGGCCTTCTCGGATTTCAGCCTGAGAGTGGATAATGCAATTGACTCAGCTTTGATGGGGTATCCGATTGG
ACCTACACCTGAGTTGTCTGCAGAAATTGTAAATCTCACAACCGAGAGGATTGAAGACGCTTGACACAATGTCAACTCAATTGAGATAAGTAAAGCTCTATTTGATTAAT
TAATTTTTGAAAAACTAAACCAATAAATATTATTTACATGTAAATAAGCCCAACAATTTTGAGAAACCCCAAAAATTAAATAATTATTAGTGTATTTATTATTGATTCTA
TTGAATGCACCATTTAGTAAAATATTTATCAGAACTATTACTTTTAACTTCTTTCCTAGAAAGTAATCATGGAAAGGTTCAATGACAAAAATAAAAAGAAAAAAAAGAAA
GTTGAAAGCACACAACAAAATGAAGATTAAGGTAATCACACAAATAGTAGGGGTGCAAGAAAAGAGTGTGAAGCCGCTTAAATCGATTGAAACTGCTTGTTGGTTCGAGT
TCAAACTGGTTGGTTTGGCAGTTTCGAGTTGAAGGTATACAGAACCGAAAATATTTGGTCAGGTTTGAGATTGGGTAAACACGAAATTTTGTTTTTGTTTCAAAAAATAA
ATTTGCAGTTTGCGGATTCCCTCTCCCTGGTCCTGTTCACAGGTGCACACAATTAACGGTCAACTGCAAACTGAGCTGACCAATTCGATAGTTCGG
Protein sequenceShow/hide protein sequence
MSSSAAHQPCGLNSRYSYSPLSSSRLSASSWNWKRFPQKEWRKLLPLVSAAASSSSPSPISNAQTKERLKLKQLFKEAYERCCTDPMDGVSFTLEDFHAALANYDFVSQL
GTKVKGTVFCTDANGALVDTTAKGTAYLPTQESSILKIRHVEEAGIYPGLEEQFVIIAEQEDGDGLILSLRSVQNGLAWERCRQLQAEDFVIKGKVVGATKGGVVVLVEC
LRGFVPFSQISANSTAEELLSKELLLKFVEVDEKLSRLILSNSKAIVSSQAELRIGSVVTGTVQTLKPYGAFIDIGGINGLLHVSQISQNHISDIANVLQPGDLLKVMIL
SYDHHKGRVSLSTKKLEPTPGDMIHNPKLVFEKADEMAEMFRQRIAQAEAMARGDGLLGFQPESG