; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0012874 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0012874
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
Description30S ribosomal protein S1
Genome locationchr03:18595435..18599809
RNA-Seq ExpressionPay0012874
SyntenyPay0012874
Gene Ontology termsGO:0005840 - ribosome (cellular component)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008438974.1 PREDICTED: 30S ribosomal protein S1, chloroplastic [Cucumis melo]9.3e-13471.31Show/hide
Query:  MAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQLGS
        MAQ  TGLR  PLSS    LSKP       N  +RS  V AAVIS PIPSP T ERFKLK+ F +A +RC NAP+EG+SFTL+ F A+LEKYDFD +LG+
Subjt:  MAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQLGS

Query:  KVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNE-DDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSANKG
        KVKGTV  T+ NGALV+I AKS AYLPLQEA IHRIK VEEAGI+PG REEFVIIG+NE DD L LSLR IQY+LAWERCRQLQA DVVVKGKVV ANKG
Subjt:  KVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNE-DDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSANKG

Query:  GVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHDRI
        GV+ VVEGL+GFVPFS+I   ST EEL+NKELPLK + V+EEQ+RLVLSNRK MADS+AQL IGSVVTGTV  +  +GAF+DIGG++GLLH+S+ISHDRI
Subjt:  GVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHDRI

Query:  LDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA
         DIA VL+PGD LKVM+L+ DRE+G + LSTKKLEP  GDMIRNP LVF KAEEMA+ FRQR+AQA
Subjt:  LDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA

XP_016902972.1 PREDICTED: 30S ribosomal protein S1, chloroplastic-like [Cucumis melo]2.7e-20299.47Show/hide
Query:  MPLSLPFSLMAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEK
        MPLSLPFSLMAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEK
Subjt:  MPLSLPFSLMAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEK

Query:  YDFDPQLGSKVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNEDDCLTLSLRPIQYELAWERCRQLQAADVVVKG
        YDFDPQLGSKVKGTVVY EANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNEDDCLTLSLRPIQYELAWERCRQLQAADVVVKG
Subjt:  YDFDPQLGSKVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNEDDCLTLSLRPIQYELAWERCRQLQAADVVVKG

Query:  KVVSANKGGVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHI
        KVVSANKGGVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLR+VKFGAFVDIGGVHGLLHI
Subjt:  KVVSANKGGVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHI

Query:  SEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA
        SEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA
Subjt:  SEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA

XP_022138241.1 30S ribosomal protein S1, chloroplastic [Momordica charantia]4.9e-13571.86Show/hide
Query:  MAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQLGS
        MAQ  TGLR  PLSS    LS P      QN  ARS  V AAVISSPIPSP T ERFKLK+ F DA +RC NAP+EG+SFTL+ F A+LEKYDFD ++G+
Subjt:  MAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQLGS

Query:  KVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNE-DDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSANKG
        KVKGTV  T+ANGALV+I AKS AYLP+QEA IHRIK VEEAGI+PG REEFVIIG+NE DD L LSLR IQY+LAWERCRQLQA DVVVKGKVV ANKG
Subjt:  KVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNE-DDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSANKG

Query:  GVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHDRI
        GV+ VVEGL+GFVPFS+I   ST EEL+NKELPLK + V+EEQ+RLVLSNRK MADS+AQL IGSVVTGTV  +  +GAF+DIGG++GLLH+S+ISHDRI
Subjt:  GVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHDRI

Query:  LDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA
         DIA VL+PGD LKVM+L+ DRE+G + LSTKKLEP  GDMIRNP LVF KAEEMA+ FRQR+AQA
Subjt:  LDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA

XP_038878013.1 30S ribosomal protein S1, chloroplastic-like [Benincasa hispida]1.2e-13371.04Show/hide
Query:  MAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQLGS
        MAQ  TGLR  PLSS    LSKP   SR   + ARS  V AAVIS PIPSP T ERFKLK+ F +A +RC NAP+EG++FTL+ F A+LEKYDFD +LG+
Subjt:  MAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQLGS

Query:  KVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNE-DDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSANKG
        KVKGTV  T+ NGALV+I AKS AYLP+QEA IHRIK VEEAGI+PG REEFVIIG+NE DD L LSLR IQY+LAWERCRQLQA DVVVKGKVV ANKG
Subjt:  KVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNE-DDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSANKG

Query:  GVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHDRI
        GV+ VVEGL+GFVPFS+I   ST EEL+NKELPLK + V+EEQ+RLVLSNRK MADS+AQL IGSVVTGTV  +  +GAF+DIGG++GLLH+S+ISHDRI
Subjt:  GVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHDRI

Query:  LDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA
         DI  VL+PGD LKVM+L+ DRE+G + LSTKKLEP  GDMIRNP LVF KAEEMA+ FRQR+AQA
Subjt:  LDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA

XP_038885297.1 30S ribosomal protein S1, chloroplastic-like [Benincasa hispida]4.0e-16986.58Show/hide
Query:  MAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQLGS
        MAQ  TGLR EP  SIS CLSKPL  S  QN   RSF V+AAVIS PIP+P TTERFKLKQTFNDAADRC NAPMEGVSFTLQ FLASLEKY FDPQLG+
Subjt:  MAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQLGS

Query:  KVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNEDDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSANKGG
        KVKGTVVYTEANGALVEIAAKSPAYLPL EA IHRIKRVEEAGIYPGFREEFVIIG+NEDD LTLSLR IQYELAWERCRQLQA DV+VKGKVV AN GG
Subjt:  KVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNEDDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSANKGG

Query:  VLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHDRIL
        VLVVVEGLKGFVP+SEILMIST EELINKELPLK LVV EE+TR+VLSNRK+MADSKAQL IG+VVTGTVLR+VKFGAFVDIGGVHGLLHISEISHDRIL
Subjt:  VLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHDRIL

Query:  DIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA
        DIA VLKPGDILKVM+LNI+ EKGHIRLSTKKLEPN GDMI NPGLVF KAEEMA RFRQRLAQA
Subjt:  DIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA

TrEMBL top hitse value%identityAlignment
A0A1S3AXL6 30S ribosomal protein S1, chloroplastic4.5e-13471.31Show/hide
Query:  MAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQLGS
        MAQ  TGLR  PLSS    LSKP       N  +RS  V AAVIS PIPSP T ERFKLK+ F +A +RC NAP+EG+SFTL+ F A+LEKYDFD +LG+
Subjt:  MAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQLGS

Query:  KVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNE-DDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSANKG
        KVKGTV  T+ NGALV+I AKS AYLPLQEA IHRIK VEEAGI+PG REEFVIIG+NE DD L LSLR IQY+LAWERCRQLQA DVVVKGKVV ANKG
Subjt:  KVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNE-DDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSANKG

Query:  GVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHDRI
        GV+ VVEGL+GFVPFS+I   ST EEL+NKELPLK + V+EEQ+RLVLSNRK MADS+AQL IGSVVTGTV  +  +GAF+DIGG++GLLH+S+ISHDRI
Subjt:  GVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHDRI

Query:  LDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA
         DIA VL+PGD LKVM+L+ DRE+G + LSTKKLEP  GDMIRNP LVF KAEEMA+ FRQR+AQA
Subjt:  LDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA

A0A1S4E424 30S ribosomal protein S1, chloroplastic-like1.3e-20299.47Show/hide
Query:  MPLSLPFSLMAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEK
        MPLSLPFSLMAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEK
Subjt:  MPLSLPFSLMAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEK

Query:  YDFDPQLGSKVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNEDDCLTLSLRPIQYELAWERCRQLQAADVVVKG
        YDFDPQLGSKVKGTVVY EANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNEDDCLTLSLRPIQYELAWERCRQLQAADVVVKG
Subjt:  YDFDPQLGSKVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNEDDCLTLSLRPIQYELAWERCRQLQAADVVVKG

Query:  KVVSANKGGVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHI
        KVVSANKGGVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLR+VKFGAFVDIGGVHGLLHI
Subjt:  KVVSANKGGVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHI

Query:  SEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA
        SEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA
Subjt:  SEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA

A0A5A7SUN2 30S ribosomal protein S11.3e-20299.47Show/hide
Query:  MPLSLPFSLMAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEK
        MPLSLPFSLMAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEK
Subjt:  MPLSLPFSLMAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEK

Query:  YDFDPQLGSKVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNEDDCLTLSLRPIQYELAWERCRQLQAADVVVKG
        YDFDPQLGSKVKGTVVY EANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNEDDCLTLSLRPIQYELAWERCRQLQAADVVVKG
Subjt:  YDFDPQLGSKVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNEDDCLTLSLRPIQYELAWERCRQLQAADVVVKG

Query:  KVVSANKGGVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHI
        KVVSANKGGVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLR+VKFGAFVDIGGVHGLLHI
Subjt:  KVVSANKGGVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHI

Query:  SEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA
        SEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA
Subjt:  SEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA

A0A6J1C966 30S ribosomal protein S1, chloroplastic2.4e-13571.86Show/hide
Query:  MAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQLGS
        MAQ  TGLR  PLSS    LS P      QN  ARS  V AAVISSPIPSP T ERFKLK+ F DA +RC NAP+EG+SFTL+ F A+LEKYDFD ++G+
Subjt:  MAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQLGS

Query:  KVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNE-DDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSANKG
        KVKGTV  T+ANGALV+I AKS AYLP+QEA IHRIK VEEAGI+PG REEFVIIG+NE DD L LSLR IQY+LAWERCRQLQA DVVVKGKVV ANKG
Subjt:  KVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNE-DDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSANKG

Query:  GVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHDRI
        GV+ VVEGL+GFVPFS+I   ST EEL+NKELPLK + V+EEQ+RLVLSNRK MADS+AQL IGSVVTGTV  +  +GAF+DIGG++GLLH+S+ISHDRI
Subjt:  GVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHDRI

Query:  LDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA
         DIA VL+PGD LKVM+L+ DRE+G + LSTKKLEP  GDMIRNP LVF KAEEMA+ FRQR+AQA
Subjt:  LDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA

A0A6J1IAT2 30S ribosomal protein S1, chloroplastic1.4e-13069.13Show/hide
Query:  MAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQLGS
        MAQ   GLR  PLSS    LSKP      QN  ARS  V AAVI+SPIPSPL  ERFKLK+ F +A +RC NAP+EG+SFT++ F +++EKYDF+ ++G+
Subjt:  MAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQLGS

Query:  KVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNE-DDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSANKG
        KVKGTV  T++NGALV+I AKS AYLPLQEA IHRIK VEEAGIYPG R+EFVIIG+NE DD L LSLR IQY+LAWERCRQLQA DVVVKGKVV ANKG
Subjt:  KVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNE-DDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSANKG

Query:  GVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHDRI
        GV+ VVEGL+GFVPFS+I   ST EEL+ KE+PLK + V+EEQ+RLVLSNRK +ADS+AQL IGSVV GTV  +  +GAF+DIGGV+GLLH+S+ISHDRI
Subjt:  GVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHDRI

Query:  LDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA
         DIA VL+PGD LKVM+L+ DRE+G + LSTKKLEP  GDMIRNP LVF KAEEMA+ FRQR+A A
Subjt:  LDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA

SwissProt top hitse value%identityAlignment
P29344 30S ribosomal protein S1, chloroplastic1.5e-12365.49Show/hide
Query:  SLMAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQL
        SL  Q   GLR  PLS+ +  LSKP   S K     R   +++AV    + +  T ER KLKQ F DA +RC NAPMEGVSFT+  F  +L+KYDF+ ++
Subjt:  SLMAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQL

Query:  GSKVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNE-DDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSAN
        GS+VKGTV  T+ANGALV+I AKS AYLPL EA I+RIK VEEAGI PG REEFVIIG+NE DD L LSLR IQYELAWERCRQLQA DVVVKGK+V AN
Subjt:  GSKVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNE-DDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSAN

Query:  KGGVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHD
        KGGV+ +VEGL+GFVPFS+I   S+ EEL+ KE+PLK + V+EEQ+RLV+SNRK MADS+AQL IGSVVTGTV  +  +GAF+DIGG++GLLH+S+ISHD
Subjt:  KGGVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHD

Query:  RILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA
        R+ DIA VL+PGD LKVM+L+ DRE+G + LSTKKLEP  GDMIRNP LVF KAEEMA+ FRQR+AQA
Subjt:  RILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA

P46228 30S ribosomal protein S12.3e-7149.32Show/hide
Query:  PMEGVSFTLQRFLASLEKYDFDPQLGSKVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNEDDCLTLSLRPIQYE
        P   + FT + F A L++YD+    G  V GTV   E  GAL++I AK+ A+LP+QE SI+R++  EE       RE F++  +NED  LTLS+R I+Y 
Subjt:  PMEGVSFTLQRFLASLEKYDFDPQLGSKVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNEDDCLTLSLRPIQYE

Query:  LAWERCRQLQAADVVVKGKVVSANKGGVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKA-QLEIGSVVTGTVLR
         AWER RQLQ  D  V+ +V + N+GG LV +EGL+GF+P S I      E+L+ +ELPLK L V+E++ RLVLS+R+ + + K  +LE+G VV G V  
Subjt:  LAWERCRQLQAADVVVKGKVVSANKGGVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKA-QLEIGSVVTGTVLR

Query:  IVKFGAFVDIGGVHGLLHISEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQ
        I  +GAF+DIGGV GLLHISEISHD I     V    D +KVM++++D E+G I LSTK+LEP  GDM+RNP +V+ KAEEMA ++R++L Q
Subjt:  IVKFGAFVDIGGVHGLLHISEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQ

P51345 30S ribosomal protein S1, chloroplastic2.8e-4037.35Show/hide
Query:  SFTLQRFLASLEKYDFDPQLGSKVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNEDDCLTLSLRPIQYELAWER
        SFT + F A L+KY +D  LG  V GT+   E NG LV+I     AYLP+QE S ++      +      RE F++  + E   L LS+R ++Y  AW+R
Subjt:  SFTLQRFLASLEKYDFDPQLGSKVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNEDDCLTLSLRPIQYELAWER

Query:  CRQLQAADVVVKGKVVSANKGGVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRK-VMADSKAQLEIGSVVTGTVLRIVKFG
         RQL A D ++  ++   NKGG++V +EG+ GFVP S +   S      NK + LKLL VEE+   L+LS+R+ ++A + + L +G+++ G + +I  +G
Subjt:  CRQLQAADVVVKGKVVSANKGGVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRK-VMADSKAQLEIGSVVTGTVLRIVKFG

Query:  AFVDIGGVHGLLHISEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLE
         F+  G + GL+HISEI+  ++  I    K GD +K +++++D+++G + LS K L+
Subjt:  AFVDIGGVHGLLHISEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLE

P73530 30S ribosomal protein S1 homolog A2.7e-6749.66Show/hide
Query:  VSFTLQRFLASLEKYDFDPQLGSKVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNEDDCLTLSLRPIQYELAWE
        + FTL+ F A L+KYD+    G  V GTV   E+ GAL++I AK+ AY+P+QE SI+R+   EE       RE F++  +NED  LTLS+R I+Y  AWE
Subjt:  VSFTLQRFLASLEKYDFDPQLGSKVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNEDDCLTLSLRPIQYELAWE

Query:  RCRQLQAADVVVKGKVVSANKGGVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQ-LEIGSVVTGTVLRIVKF
        R RQLQA D  V+  V + N+GG LV +EGL+GF+P S I      E+L+ ++LPLK L V+EE+ RLVLS+R+ + + K   LE+  VV G+V  I  +
Subjt:  RCRQLQAADVVVKGKVVSANKGGVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQ-LEIGSVVTGTVLRIVKF

Query:  GAFVDIGGVHGLLHISEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQ-RLAQA
        GAF+DIGGV GLLHISEISHD I     V    D +KVM++++D E+G I LSTK+LEP  G M+++  LV   A+EMA  FRQ RLA+A
Subjt:  GAFVDIGGVHGLLHISEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQ-RLAQA

Q93VC7 30S ribosomal protein S1, chloroplastic2.5e-11861.2Show/hide
Query:  MAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQLGS
        +AQ  +GLR  PLSS S  LS+   ++  QN +A     + A ++  + S  T ER +LK+ F DA +RC  +PMEGV+FT+  F A++E+YDF+ ++G+
Subjt:  MAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQLGS

Query:  KVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNE-DDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSANKG
        +VKGTV  T+ANGALV+I+AKS AYL +++A IHRIK VEEAGI PG  EEFVIIG+NE DD L LSLR IQYELAWERCRQLQA DV+VK KV+ ANKG
Subjt:  KVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNE-DDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSANKG

Query:  GVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHDRI
        G++ +VEGL+GFVPFS+I   +  EEL+ KE+PLK + V+EEQT+LVLSNRK +ADS+AQL IGSVV G V  +  +GAF+DIGG++GLLH+S+ISHDR+
Subjt:  GVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHDRI

Query:  LDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA
         DIA VL+PGD LKVM+L+ DR++G + LSTKKLEP  GDMIRNP LVF KAEEMA+ FRQR+AQA
Subjt:  LDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA

Arabidopsis top hitse value%identityAlignment
AT1G71720.1 Nucleic acid-binding proteins superfamily2.8e-1932.84Show/hide
Query:  LSLRPIQYELAWERCRQLQAADVVVKGKVVSANKGGVLVVVEGLKGFVPFSEIL-MISTGEEL---INKELPLKLLVVEEEQTRLVLSNRKVMADSKAQL
        LS R     +AW R RQ++  +  ++ K+   N GG+L  +EGL+ F+P  E++  ++T  EL   + +   +++  + E++  L+LS +  +A  K  L
Subjt:  LSLRPIQYELAWERCRQLQAADVVVKGKVVSANKGGVLVVVEGLKGFVPFSEIL-MISTGEEL---INKELPLKLLVVEEEQTRLVLSNRKVMADSKAQL

Query:  EIGSVVTGTVLRIVKFGAFVDIG--GVHGLLHISEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRF
          G+++ GTV++I+ +GA V +G     GLLHIS I+  RI  ++ VL+  + +KV+V+        I LS   LE   G  I +   VF +AEEMA+++
Subjt:  EIGSVVTGTVLRIVKFGAFVDIG--GVHGLLHISEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRF

Query:  RQRL
        R+++
Subjt:  RQRL

AT3G11964.1 RNA binding;RNA binding2.9e-0828.18Show/hide
Query:  WERCRQLQAADVVVKGKVVSA-NKGGVLVVVEGLKGFVPFSEILMISTGEELINKELPL------KLLVVEEEQTRLVLSNRKVMADSK--------AQL
        +ER   L + D+ V+G V +  +KG  +++   ++  V  S +      E    KE P+      ++L VE    R+ ++ + V A  +         +L
Subjt:  WERCRQLQAADVVVKGKVVSA-NKGGVLVVVEGLKGFVPFSEILMISTGEELINKELPL------KLLVVEEEQTRLVLSNRKVMADSK--------AQL

Query:  EIGSVVTGTVLRIVKFGAFVDIG--GVHGLLHISEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGD
         +G +++G + R+  FG F+DI   G+ GL HIS++S DR+ ++    K G+ ++  +L +D EK  I L  K     NGD
Subjt:  EIGSVVTGTVLRIVKFGAFVDIG--GVHGLLHISEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGD

AT3G23700.1 Nucleic acid-binding proteins superfamily2.6e-1732.2Show/hide
Query:  WERCRQLQAADVVVKGKVVSANKGGVLVVVEGLKGFVPFSEILMISTGEE-----------LINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQ-LEIG
        W+  +    +    +G+V   N GG+L+    L GF+P+ ++    + +E           L+  +LP+K++  +EE  +L+LS +  +    +Q + +G
Subjt:  WERCRQLQAADVVVKGKVVSANKGGVLVVVEGLKGFVPFSEILMISTGEE-----------LINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQ-LEIG

Query:  SVVTGTVLRIVKFGAFV----DIGGVH--GLLHISEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLE
         V  G V  +  +GAF+    D G  H  GL+H+SE+S D + D+  VL+ GD ++V+V NID+EK  I LS K+LE
Subjt:  SVVTGTVLRIVKFGAFV----DIGGVH--GLLHISEISHDRILDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLE

AT5G14580.1 polyribonucleotide nucleotidyltransferase, putative7.6e-0934.86Show/hide
Query:  LVVEEEQTRLVLSNRKVMADSKAQLE--------IGSVVTGTVLRIVKFGAFVDI-GGVHGLLHISEISHDRILDIAGVLKPGDILKVMVLNIDREKGHI
        L ++     +V  N+ VM  ++ Q++        +G V  GTV  I ++GAFV+  GG  GLLH+SE+SH+ +  ++ VL  G  +  M +  D  +G+I
Subjt:  LVVEEEQTRLVLSNRKVMADSKAQLE--------IGSVVTGTVLRIVKFGAFVDI-GGVHGLLHISEISHDRILDIAGVLKPGDILKVMVLNIDREKGHI

Query:  RLSTKKLEP
        +LS K L P
Subjt:  RLSTKKLEP

AT5G30510.1 ribosomal protein S11.8e-11961.2Show/hide
Query:  MAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQLGS
        +AQ  +GLR  PLSS S  LS+   ++  QN +A     + A ++  + S  T ER +LK+ F DA +RC  +PMEGV+FT+  F A++E+YDF+ ++G+
Subjt:  MAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQLGS

Query:  KVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNE-DDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSANKG
        +VKGTV  T+ANGALV+I+AKS AYL +++A IHRIK VEEAGI PG  EEFVIIG+NE DD L LSLR IQYELAWERCRQLQA DV+VK KV+ ANKG
Subjt:  KVKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNE-DDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSANKG

Query:  GVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHDRI
        G++ +VEGL+GFVPFS+I   +  EEL+ KE+PLK + V+EEQT+LVLSNRK +ADS+AQL IGSVV G V  +  +GAF+DIGG++GLLH+S+ISHDR+
Subjt:  GVLVVVEGLKGFVPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHDRI

Query:  LDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA
         DIA VL+PGD LKVM+L+ DR++G + LSTKKLEP  GDMIRNP LVF KAEEMA+ FRQR+AQA
Subjt:  LDIAGVLKPGDILKVMVLNIDREKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATTGTCATTGCCATTTTCATTGATGGCTCAGCCACTTACAGGGTTGAGATCTGAGCCTCTATCCTCAATTTCCTTCTGTCTCTCTAAGCCACTTGGTCGAAGCCG
TAAGCAGAACACTGCAGCCCGTTCATTCCGCGTTTTGGCTGCAGTAATATCCAGCCCCATTCCTTCTCCTCTCACCACAGAGCGTTTCAAGCTCAAGCAAACCTTCAATG
ATGCGGCTGATCGCTGCCATAATGCTCCCATGGAAGGTGTTTCCTTCACTCTCCAGCGCTTCCTTGCGTCTCTTGAGAAATACGACTTCGATCCTCAGTTGGGATCCAAG
GTGAAAGGTACTGTCGTCTATACAGAAGCTAATGGAGCACTTGTGGAGATTGCTGCCAAGTCACCTGCATACTTGCCGTTGCAGGAGGCTTCCATTCATAGAATTAAACG
TGTAGAAGAAGCAGGAATATATCCTGGTTTTAGAGAGGAGTTTGTTATTATAGGTGATAATGAAGATGATTGCTTGACTTTGAGCTTGAGGCCCATCCAATATGAACTTG
CTTGGGAAAGGTGCAGACAGCTTCAGGCAGCGGATGTTGTTGTCAAGGGTAAGGTGGTTAGTGCGAACAAAGGGGGAGTTCTGGTAGTTGTGGAAGGCCTTAAAGGATTT
GTTCCCTTCTCAGAGATATTAATGATATCAACTGGTGAAGAGCTTATCAACAAGGAGCTTCCTCTGAAGTTGCTGGTGGTTGAAGAGGAACAAACAAGGCTTGTCCTCAG
TAACCGTAAGGTCATGGCTGACAGCAAAGCACAACTTGAAATTGGATCAGTGGTCACTGGAACAGTTCTAAGAATTGTAAAATTTGGTGCCTTTGTTGACATTGGTGGAG
TCCATGGTCTTCTTCACATCAGTGAGATAAGTCATGATCGCATACTAGATATTGCAGGAGTTCTTAAGCCTGGAGACATTCTCAAGGTCATGGTATTGAACATTGATCGT
GAAAAAGGCCATATTCGTCTTTCTACCAAGAAGCTAGAGCCTAATAATGGGGACATGATTCGCAATCCCGGGCTTGTTTTCAATAAGGCTGAGGAAATGGCACGTAGATT
TAGGCAAAGATTAGCTCAAGCATAG
mRNA sequenceShow/hide mRNA sequence
GGTCAACACTGTGAATATATAAAGGAAGTGGATCTGTTAGGAACACTCTCTCTGGTAGAGCTTGGAAAACGCCGCGGCAGGAAGACGAAGATGCCATTGTCATTGCCATT
TTCATTGATGGCTCAGCCACTTACAGGGTTGAGATCTGAGCCTCTATCCTCAATTTCCTTCTGTCTCTCTAAGCCACTTGGTCGAAGCCGTAAGCAGAACACTGCAGCCC
GTTCATTCCGCGTTTTGGCTGCAGTAATATCCAGCCCCATTCCTTCTCCTCTCACCACAGAGCGTTTCAAGCTCAAGCAAACCTTCAATGATGCGGCTGATCGCTGCCAT
AATGCTCCCATGGAAGGTGTTTCCTTCACTCTCCAGCGCTTCCTTGCGTCTCTTGAGAAATACGACTTCGATCCTCAGTTGGGATCCAAGGTGAAAGGTACTGTCGTCTA
TACAGAAGCTAATGGAGCACTTGTGGAGATTGCTGCCAAGTCACCTGCATACTTGCCGTTGCAGGAGGCTTCCATTCATAGAATTAAACGTGTAGAAGAAGCAGGAATAT
ATCCTGGTTTTAGAGAGGAGTTTGTTATTATAGGTGATAATGAAGATGATTGCTTGACTTTGAGCTTGAGGCCCATCCAATATGAACTTGCTTGGGAAAGGTGCAGACAG
CTTCAGGCAGCGGATGTTGTTGTCAAGGGTAAGGTGGTTAGTGCGAACAAAGGGGGAGTTCTGGTAGTTGTGGAAGGCCTTAAAGGATTTGTTCCCTTCTCAGAGATATT
AATGATATCAACTGGTGAAGAGCTTATCAACAAGGAGCTTCCTCTGAAGTTGCTGGTGGTTGAAGAGGAACAAACAAGGCTTGTCCTCAGTAACCGTAAGGTCATGGCTG
ACAGCAAAGCACAACTTGAAATTGGATCAGTGGTCACTGGAACAGTTCTAAGAATTGTAAAATTTGGTGCCTTTGTTGACATTGGTGGAGTCCATGGTCTTCTTCACATC
AGTGAGATAAGTCATGATCGCATACTAGATATTGCAGGAGTTCTTAAGCCTGGAGACATTCTCAAGGTCATGGTATTGAACATTGATCGTGAAAAAGGCCATATTCGTCT
TTCTACCAAGAAGCTAGAGCCTAATAATGGGGACATGATTCGCAATCCCGGGCTTGTTTTCAATAAGGCTGAGGAAATGGCACGTAGATTTAGGCAAAGATTAGCTCAAG
CATAGGCATTGGGACGTGCATGCTTGCTTAGTTTTCAGCCTGAGGGTGGATTAACTTTGAGTATCGATGGAATATTGGGTCCAGTTACCCAAAGTTGCCTGTAAAGAGAG
GATTCAACAATATGTTCCTCCCTCAGCTAAAGAATGAAAACACAACTAAGGCTCTAAATGCATTGTCACATGCGCCTACTCGTTAATACAGATATTCGATTTTGCGTTAT
CCAAATAGTTAACAAAAATAAAGTTAATAACATCAAAGACTAAAATTACCTTTTTTCATTTTTTTTTCCTTTTTTAAAATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT
TCAAAAGATGATTTTTTTTTTATAAAAGTTTGGGACTATCTTGATGTGATAGAAGTTGAGGGATGAAATTGATATTGAAATATATTTAAATTGTTTCTTCAATTTAGACG
ACTTCAGGGTGGAATTCGAATGATGGATTGAGATGGTAATTTGTGTGTGAATTTTTAAATCTGGGCCAAATCATGGTGTAAATCATCTTTCTTTTTTTTCCTTTTTTTTT
TTTTTTGGAGTTTGGTGCAAATAGCATAATTCTTTTGATTTTTGTCTAAAACGAAATTGCAAGATTTTTCTTTCATATCAAATTTGTTTTGATTTAACTACTTTTAAAAA
TATTTAAGGGTCATTTGGGGAAGGGTAAAGTTATGGAATTCATTTTATGATAATTCTAGGGTTATGATAAAGCTATGTGATGATAATATATGTTTGAGGGAAGATTTATT
ATTGGTAGTGTTATAATAGTATGTGTTTGGGAGAGATTATGATAATAGTTTTATG
Protein sequenceShow/hide protein sequence
MPLSLPFSLMAQPLTGLRSEPLSSISFCLSKPLGRSRKQNTAARSFRVLAAVISSPIPSPLTTERFKLKQTFNDAADRCHNAPMEGVSFTLQRFLASLEKYDFDPQLGSK
VKGTVVYTEANGALVEIAAKSPAYLPLQEASIHRIKRVEEAGIYPGFREEFVIIGDNEDDCLTLSLRPIQYELAWERCRQLQAADVVVKGKVVSANKGGVLVVVEGLKGF
VPFSEILMISTGEELINKELPLKLLVVEEEQTRLVLSNRKVMADSKAQLEIGSVVTGTVLRIVKFGAFVDIGGVHGLLHISEISHDRILDIAGVLKPGDILKVMVLNIDR
EKGHIRLSTKKLEPNNGDMIRNPGLVFNKAEEMARRFRQRLAQA