; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G02620 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G02620
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Description30S ribosomal protein S1
Genome locationClcChr08:5208453..5214493
RNA-Seq ExpressionClc08G02620
SyntenyClc08G02620
Gene Ontology termsGO:0005840 - ribosome (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008438974.1 PREDICTED: 30S ribosomal protein S1, chloroplastic [Cucumis melo]3.1e-15974.06Show/hide
Query:  LALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDPQ
        +A MA Q TGLRC PL   SSRLSKP    H+ NK +RS PV AA+IS PIP+PQT ERFKLK+ F +A +RCRN+P+EG+SFTL+ F A+LEKYDFD +
Subjt:  LALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDPQ

Query:  LGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDA
        LG KVKGTV  ++ NGA V+I AKS AYLPLQE CIHRIK VEEAGI+PGLREEFVIIGENE DDSL LSLR IQY+LAWERCRQLQAEDVVVKGKVVDA
Subjt:  LGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDA

Query:  NKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEISH
        NKGGV+ VVEGL+GFVPFS+I   STAEEL+NKELPLKF+ V+EEQ+R+VLSNRK MADS A+L IGSVVTGTV  LK YGAF+DIGGI+GLLH+S+ISH
Subjt:  NKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEISH

Query:  DRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLNSDGILDPATPE
        DRI D+A VL+PGD LKVMIL+ DRE+G + LSTKKLEP  GDMI NP LVFEKAEEMA  FRQR+AQAEALARAD+L FQPE  LTL +DGIL P TPE
Subjt:  DRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLNSDGILDPATPE

Query:  L
        L
Subjt:  L

XP_016902972.1 PREDICTED: 30S ribosomal protein S1, chloroplastic-like [Cucumis melo]9.3e-17285.68Show/hide
Query:  MPLALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFD
        +P +LMA   TGLR EPL SIS  LSKPL RS  QN AARSF VLAA+ISSPIP+P TTERFKLKQTF DAADRC N+PMEGVSFTLQ FLASLEKYDFD
Subjt:  MPLALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFD

Query:  PQLGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVD
        PQLG+KVKGTVVY EANGA VEIAAKSPAYLPLQE  IHRIKRVEEAGIYPG REEFVIIG+NEDD LTLSLRPIQYELAWERCRQLQA DVVVKGKVV 
Subjt:  PQLGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVD

Query:  ANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEIS
        ANKGGVLVVVEGLKGFVPFSEILMIST EELINKELPLK LVV EEQTR+VLSNRKVMADS A+L IGSVVTGTVL+L ++GAFVDIGG+HGLLHISEIS
Subjt:  ANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEIS

Query:  HDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQA
        HDRI D+A VLKPGD+LKVM+LNIDREKGHIRLSTKKLEPN GDMI NPGLVF KAEEMA RFRQRLAQA
Subjt:  HDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQA

XP_022138241.1 30S ribosomal protein S1, chloroplastic [Momordica charantia]4.8e-16074.06Show/hide
Query:  LALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDPQ
        +A MA Q TGLRC PL   SSRLS P    H+QNK ARS PV AA+ISSPIP+PQT ERFKLK+ F DA +RCRN+P+EG+SFTL+ F A+LEKYDFD +
Subjt:  LALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDPQ

Query:  LGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDA
        +G KVKGTV  ++ANGA V+I AKS AYLP+QE CIHRIK VEEAGI+PG+REEFVIIGENE DDSL LSLR IQY+LAWERCRQLQAEDVVVKGKVVDA
Subjt:  LGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDA

Query:  NKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEISH
        NKGGV+ VVEGL+GFVPFS+I   STAEEL+NKELPLKF+ V+EEQ+R+VLSNRK MADS A+L IGSVVTGTV  LK YGAF+DIGGI+GLLH+S+ISH
Subjt:  NKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEISH

Query:  DRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLNSDGILDPATPE
        DRI D+A VL+PGD LKVMIL+ DRE+G + LSTKKLEP  GDMI NP LVFEKAEEMA  FRQR+AQAEA+ARAD+L FQPE  LTL +DGIL P TPE
Subjt:  DRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLNSDGILDPATPE

Query:  L
        L
Subjt:  L

XP_038878013.1 30S ribosomal protein S1, chloroplastic-like [Benincasa hispida]2.6e-15873.32Show/hide
Query:  LALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDPQ
        +A MA Q TGLRC PL   SSRLSKP    H+Q+K ARS PV AA+IS PIP+PQT ERFKLK+ F +A +RCRN+P+EG++FTL+ F A+LEKYDFD +
Subjt:  LALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDPQ

Query:  LGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDA
        LG KVKGTV  ++ NGA V+I AKS AYLP+QE CIHRIK VEEAGI+PGLREEFVIIGENE DDSL LSLR IQY+LAWERCRQLQAEDVVVKGKVVDA
Subjt:  LGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDA

Query:  NKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEISH
        NKGGV+ VVEGL+GFVPFS+I   STAEEL+NKELPLKF+ V+EEQ+R+VLSNRK MADS A+L IGSVVTGTV  LK YGAF+DIGGI+GLLH+S+ISH
Subjt:  NKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEISH

Query:  DRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLNSDGILDPATPE
        DRI D+  VL+PGD LKVMIL+ DRE+G + LSTKKLEP  GDMI NP LVFEKAEEMA  FRQR+AQAEA+ARAD+L FQPE  LTL +DGIL P TPE
Subjt:  DRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLNSDGILDPATPE

Query:  L
        L
Subjt:  L

XP_038885297.1 30S ribosomal protein S1, chloroplastic-like [Benincasa hispida]2.4e-19688.66Show/hide
Query:  MAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDPQLGA
        MA QCTGLRCEP FSISS LSKPL  SHMQN   RSFPV+AA+IS PIPTPQTTERFKLKQTF DAADRCRN+PMEGVSFTLQ FLASLEKY FDPQLGA
Subjt:  MAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDPQLGA

Query:  KVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANKGG
        KVKGTVVY+EANGA VEIAAKSPAYLPL E CIHRIKRVEEAGIYPG REEFVIIGENEDDSLTLSLR IQYELAWERCRQLQAEDV+VKGKVV AN GG
Subjt:  KVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANKGG

Query:  VLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEISHDRIR
        VLVVVEGLKGFVP+SEILMISTAEELINKELPLKFLVVNEE+TR+VLSNRK+MADS A+LAIG+VVTGTVL+L ++GAFVDIGG+HGLLHISEISHDRI 
Subjt:  VLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEISHDRIR

Query:  DVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLNSDGILDPATPEL
        D+AAVLKPGD+LKVMILNI+ EKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRL L+SDGIL P TPEL
Subjt:  DVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLNSDGILDPATPEL

TrEMBL top hitse value%identityAlignment
A0A1S3AXL6 30S ribosomal protein S1, chloroplastic1.5e-15974.06Show/hide
Query:  LALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDPQ
        +A MA Q TGLRC PL   SSRLSKP    H+ NK +RS PV AA+IS PIP+PQT ERFKLK+ F +A +RCRN+P+EG+SFTL+ F A+LEKYDFD +
Subjt:  LALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDPQ

Query:  LGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDA
        LG KVKGTV  ++ NGA V+I AKS AYLPLQE CIHRIK VEEAGI+PGLREEFVIIGENE DDSL LSLR IQY+LAWERCRQLQAEDVVVKGKVVDA
Subjt:  LGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDA

Query:  NKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEISH
        NKGGV+ VVEGL+GFVPFS+I   STAEEL+NKELPLKF+ V+EEQ+R+VLSNRK MADS A+L IGSVVTGTV  LK YGAF+DIGGI+GLLH+S+ISH
Subjt:  NKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEISH

Query:  DRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLNSDGILDPATPE
        DRI D+A VL+PGD LKVMIL+ DRE+G + LSTKKLEP  GDMI NP LVFEKAEEMA  FRQR+AQAEALARAD+L FQPE  LTL +DGIL P TPE
Subjt:  DRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLNSDGILDPATPE

Query:  L
        L
Subjt:  L

A0A1S4E424 30S ribosomal protein S1, chloroplastic-like4.5e-17285.68Show/hide
Query:  MPLALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFD
        +P +LMA   TGLR EPL SIS  LSKPL RS  QN AARSF VLAA+ISSPIP+P TTERFKLKQTF DAADRC N+PMEGVSFTLQ FLASLEKYDFD
Subjt:  MPLALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFD

Query:  PQLGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVD
        PQLG+KVKGTVVY EANGA VEIAAKSPAYLPLQE  IHRIKRVEEAGIYPG REEFVIIG+NEDD LTLSLRPIQYELAWERCRQLQA DVVVKGKVV 
Subjt:  PQLGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVD

Query:  ANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEIS
        ANKGGVLVVVEGLKGFVPFSEILMIST EELINKELPLK LVV EEQTR+VLSNRKVMADS A+L IGSVVTGTVL+L ++GAFVDIGG+HGLLHISEIS
Subjt:  ANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEIS

Query:  HDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQA
        HDRI D+A VLKPGD+LKVM+LNIDREKGHIRLSTKKLEPN GDMI NPGLVF KAEEMA RFRQRLAQA
Subjt:  HDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQA

A0A5A7SUN2 30S ribosomal protein S14.5e-17285.68Show/hide
Query:  MPLALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFD
        +P +LMA   TGLR EPL SIS  LSKPL RS  QN AARSF VLAA+ISSPIP+P TTERFKLKQTF DAADRC N+PMEGVSFTLQ FLASLEKYDFD
Subjt:  MPLALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFD

Query:  PQLGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVD
        PQLG+KVKGTVVY EANGA VEIAAKSPAYLPLQE  IHRIKRVEEAGIYPG REEFVIIG+NEDD LTLSLRPIQYELAWERCRQLQA DVVVKGKVV 
Subjt:  PQLGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVD

Query:  ANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEIS
        ANKGGVLVVVEGLKGFVPFSEILMIST EELINKELPLK LVV EEQTR+VLSNRKVMADS A+L IGSVVTGTVL+L ++GAFVDIGG+HGLLHISEIS
Subjt:  ANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEIS

Query:  HDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQA
        HDRI D+A VLKPGD+LKVM+LNIDREKGHIRLSTKKLEPN GDMI NPGLVF KAEEMA RFRQRLAQA
Subjt:  HDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQA

A0A5A7UEP7 30S ribosomal protein S12.3e-15569.72Show/hide
Query:  LALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDPQ
        +A MA Q TGLRC PL   SSRLSKP    H+ NK +RS PV AA+IS PIP+PQT ERFKLK+ F +A +RCRN+P+EG+SFTL+ F A+LEKYDFD +
Subjt:  LALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDPQ

Query:  LGAK-------------------------VKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQ
        LG K                         VKGTV  ++ NGA V+I AKS AYLPLQE CIHRIK VEEAGI+PGLREEFVIIGENE DDSL LSLR IQ
Subjt:  LGAK-------------------------VKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQ

Query:  YELAWERCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVL
        Y+LAWERCRQLQAEDVVVKGKVVDANKGGV+ VVEGL+GFVPFS+I   STAEEL+NKELPLKF+ V+EEQ+R+VLSNRK MADS A+L IGSVVTGTV 
Subjt:  YELAWERCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVL

Query:  KLKEYGAFVDIGGIHGLLHISEISHDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARA
         LK YGAF+DIGGI+GLLH+S+ISHDRI D+A VL+PGD LKVMIL+ DRE+G + LSTKKLEP  GDMI NP LVFEKAEEMA  FRQR+AQAEALARA
Subjt:  KLKEYGAFVDIGGIHGLLHISEISHDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARA

Query:  DLLSFQPEGRLTLNSDGILDPATPEL
        D+L FQPE  LTL +DGIL P TPEL
Subjt:  DLLSFQPEGRLTLNSDGILDPATPEL

A0A6J1C966 30S ribosomal protein S1, chloroplastic2.3e-16074.06Show/hide
Query:  LALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDPQ
        +A MA Q TGLRC PL   SSRLS P    H+QNK ARS PV AA+ISSPIP+PQT ERFKLK+ F DA +RCRN+P+EG+SFTL+ F A+LEKYDFD +
Subjt:  LALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDPQ

Query:  LGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDA
        +G KVKGTV  ++ANGA V+I AKS AYLP+QE CIHRIK VEEAGI+PG+REEFVIIGENE DDSL LSLR IQY+LAWERCRQLQAEDVVVKGKVVDA
Subjt:  LGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDA

Query:  NKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEISH
        NKGGV+ VVEGL+GFVPFS+I   STAEEL+NKELPLKF+ V+EEQ+R+VLSNRK MADS A+L IGSVVTGTV  LK YGAF+DIGGI+GLLH+S+ISH
Subjt:  NKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEISH

Query:  DRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLNSDGILDPATPE
        DRI D+A VL+PGD LKVMIL+ DRE+G + LSTKKLEP  GDMI NP LVFEKAEEMA  FRQR+AQAEA+ARAD+L FQPE  LTL +DGIL P TPE
Subjt:  DRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLNSDGILDPATPE

Query:  L
        L
Subjt:  L

SwissProt top hitse value%identityAlignment
O33698 30S ribosomal protein S18.5e-4335.96Show/hide
Query:  SPMEGVSFTLQHFLASLEKYDFDPQLGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVII-GENEDDSLTLSLRPIQ
        SP    + +   F  +LE    D Q G  V+G V     +GA+++I  K+PA+LP +E  +H +  + EA +      EF++I  +NED  +T+SLR + 
Subjt:  SPMEGVSFTLQHFLASLEKYDFDPQLGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVII-GENEDDSLTLSLRPIQ

Query:  YELAWERCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNA-ELAIGSVVTGTV
         E AW R  +LQ     V+ KV  +NKGGV   +EGL+ F+P S +      + L  K L + FL VN    ++VLS R+    +   E+ +G ++ G V
Subjt:  YELAWERCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNA-ELAIGSVVTGTV

Query:  LKLKEYGAFVDIGGIHGLLHISEISHDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRL
          LK +G FVD+GG   LL I++IS   + DV A+ K GD ++ +++ ID  KG I LSTK LE + G+++ N   +   A + A R R++L
Subjt:  LKLKEYGAFVDIGGIHGLLHISEISHDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRL

P29344 30S ribosomal protein S1, chloroplastic1.9e-14366.42Show/hide
Query:  LALMAHQCT-GLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDP
        +A +A Q   GLRC PL   +S LSKP    H      R  P+++A+    +   QT ER KLKQ F DA +RCRN+PMEGVSFT+  F  +L+KYDF+ 
Subjt:  LALMAHQCT-GLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDP

Query:  QLGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVD
        ++G++VKGTV  ++ANGA V+I AKS AYLPL E CI+RIK VEEAGI PG+REEFVIIGENE DDSL LSLR IQYELAWERCRQLQAEDVVVKGK+V 
Subjt:  QLGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVD

Query:  ANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEIS
        ANKGGV+ +VEGL+GFVPFS+I   S+AEEL+ KE+PLKF+ V+EEQ+R+V+SNRK MADS A+L IGSVVTGTV  LK YGAF+DIGGI+GLLH+S+IS
Subjt:  ANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEIS

Query:  HDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLNSDGILDPATP
        HDR+ D+A VL+PGD LKVMIL+ DRE+G + LSTKKLEP  GDMI NP LVFEKAEEMA  FRQR+AQAEA+ARAD+L FQPE  LTL+SDGIL P T 
Subjt:  HDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLNSDGILDPATP

Query:  ELGAK
        +L A+
Subjt:  ELGAK

P46228 30S ribosomal protein S16.0e-7349.16Show/hide
Query:  PMEGVSFTLQHFLASLEKYDFDPQLGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYE
        P   + FT + F A L++YD+    G  V GTV   E  GA ++I AK+ A+LP+QE  I+R++  EE      +RE F++  ENED  LTLS+R I+Y 
Subjt:  PMEGVSFTLQHFLASLEKYDFDPQLGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYE

Query:  LAWERCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNA-ELAIGSVVTGTVLK
         AWER RQLQ ED  V+ +V   N+GG LV +EGL+GF+P S I      E+L+ +ELPLKFL V+E++ R+VLS+R+ + +     L +G VV G V  
Subjt:  LAWERCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNA-ELAIGSVVTGTVLK

Query:  LKEYGAFVDIGGIHGLLHISEISHDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRL-AQAEAL
        +K YGAF+DIGG+ GLLHISEISHD I    +V    D +KVMI+++D E+G I LSTK+LEP  GDM+ NP +V+EKAEEMA ++R++L  QAE L
Subjt:  LKEYGAFVDIGGIHGLLHISEISHDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRL-AQAEAL

P73530 30S ribosomal protein S1 homolog A8.2e-7049.15Show/hide
Query:  VSFTLQHFLASLEKYDFDPQLGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWE
        + FTL+ F A L+KYD+    G  V GTV   E+ GA ++I AK+ AY+P+QE  I+R+   EE       RE F++  ENED  LTLS+R I+Y  AWE
Subjt:  VSFTLQHFLASLEKYDFDPQLGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWE

Query:  RCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAE-LAIGSVVTGTVLKLKEY
        R RQLQAED  V+  V   N+GG LV +EGL+GF+P S I      E+L+ ++LPLKFL V+EE+ R+VLS+R+ + +     L +  VV G+V  +K Y
Subjt:  RCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAE-LAIGSVVTGTVLKLKEY

Query:  GAFVDIGGIHGLLHISEISHDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQ-RLAQAEAL
        GAF+DIGG+ GLLHISEISHD I    +V    D +KVMI+++D E+G I LSTK+LEP  G M+ +  LV E A+EMA  FRQ RLA+A+ +
Subjt:  GAFVDIGGIHGLLHISEISHDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQ-RLAQAEAL

Q93VC7 30S ribosomal protein S1, chloroplastic4.0e-14164.34Show/hide
Query:  LALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDPQ
        +A +A Q +GLRC PL S SSRLS+   ++  QNK+A   P + A ++  + + QT ER +LK+ F DA +RCR SPMEGV+FT+  F A++E+YDF+ +
Subjt:  LALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDPQ

Query:  LGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDA
        +G +VKGTV  ++ANGA V+I+AKS AYL +++ CIHRIK VEEAGI PG+ EEFVIIGENE DDSL LSLR IQYELAWERCRQLQAEDV+VK KV+ A
Subjt:  LGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDA

Query:  NKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEISH
        NKGG++ +VEGL+GFVPFS+I   + AEEL+ KE+PLKF+ V+EEQT++VLSNRK +ADS A+L IGSVV G V  LK YGAF+DIGGI+GLLH+S+ISH
Subjt:  NKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEISH

Query:  DRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLNSDGILDPATPE
        DR+ D+A VL+PGD LKVMIL+ DR++G + LSTKKLEP  GDMI NP LVFEKAEEMA  FRQR+AQAEA+ARAD+L FQPE  LTL+SDGIL P   E
Subjt:  DRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLNSDGILDPATPE

Query:  L
        L
Subjt:  L

Arabidopsis top hitse value%identityAlignment
AT1G71720.1 Nucleic acid-binding proteins superfamily1.2e-2031.93Show/hide
Query:  IIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEIL-MISTAEELINKELPLKFLV----VNEEQTRVVLS
        ++G        LS R     +AW R RQ++  +  ++ K+ + N GG+L  +EGL+ F+P  E++  ++T  EL  + +  +FLV    +NE++  ++LS
Subjt:  IIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEIL-MISTAEELINKELPLKFLV----VNEEQTRVVLS

Query:  NRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIG--GIHGLLHISEISHDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGL
         +  +A     L  G+++ GTV+K+  YGA V +G     GLLHIS I+  RI  V+ VL+  + +KV+++        I LS   LE   G  I +   
Subjt:  NRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIG--GIHGLLHISEISHDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGL

Query:  VFEKAEEMAHRFRQRL---AQAEALARADLLSFQPEGR
        VF +AEEMA ++R+++   A +    R  + S  P+G+
Subjt:  VFEKAEEMAHRFRQRL---AQAEALARADLLSFQPEGR

AT3G11964.1 RNA binding;RNA binding8.2e-0936.14Show/hide
Query:  ELAIGSVVTGTVLKLKEYGAFVDIG--GIHGLLHISEISHDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGD
        +L +G +++G + +++ +G F+DI   G+ GL HIS++S DR+ +V A  K G+ ++  IL +D EK  I L  K      GD
Subjt:  ELAIGSVVTGTVLKLKEYGAFVDIG--GIHGLLHISEISHDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGD

AT3G23700.1 Nucleic acid-binding proteins superfamily2.6e-1831.64Show/hide
Query:  WERCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEE-----------LINKELPLKFLVVNEEQTRVVLSNRKVMADSNAE-LAIG
        W+  +         +G+V   N GG+L+    L GF+P+ ++    + +E           L+  +LP+K +  +EE  +++LS +  +    ++ + +G
Subjt:  WERCRQLQAEDVVVKGKVVDANKGGVLVVVEGLKGFVPFSEILMISTAEE-----------LINKELPLKFLVVNEEQTRVVLSNRKVMADSNAE-LAIG

Query:  SVVTGTVLKLKEYGAFV----DIGGIH--GLLHISEISHDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLE
         V  G V  +++YGAF+    D G  H  GL+H+SE+S D ++DV  VL+ GD ++V++ NID+EK  I LS K+LE
Subjt:  SVVTGTVLKLKEYGAFV----DIGGIH--GLLHISEISHDRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLE

AT5G14580.1 polyribonucleotide nucleotidyltransferase, putative5.2e-1138.53Show/hide
Query:  LVVNEEQTRVVLSNRKVMADS--------NAELAIGSVVTGTVLKLKEYGAFVDI-GGIHGLLHISEISHDRIRDVAAVLKPGDVLKVMILNIDREKGHI
        L ++     +V  N+ VM  +          EL +G V  GTV  +KEYGAFV+  GG  GLLH+SE+SH+ +  V+ VL  G  +  M +  D  +G+I
Subjt:  LVVNEEQTRVVLSNRKVMADS--------NAELAIGSVVTGTVLKLKEYGAFVDI-GGIHGLLHISEISHDRIRDVAAVLKPGDVLKVMILNIDREKGHI

Query:  RLSTKKLEP
        +LS K L P
Subjt:  RLSTKKLEP

AT5G30510.1 ribosomal protein S12.8e-14264.34Show/hide
Query:  LALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDPQ
        +A +A Q +GLRC PL S SSRLS+   ++  QNK+A   P + A ++  + + QT ER +LK+ F DA +RCR SPMEGV+FT+  F A++E+YDF+ +
Subjt:  LALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDPQ

Query:  LGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDA
        +G +VKGTV  ++ANGA V+I+AKS AYL +++ CIHRIK VEEAGI PG+ EEFVIIGENE DDSL LSLR IQYELAWERCRQLQAEDV+VK KV+ A
Subjt:  LGAKVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENE-DDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDA

Query:  NKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEISH
        NKGG++ +VEGL+GFVPFS+I   + AEEL+ KE+PLKF+ V+EEQT++VLSNRK +ADS A+L IGSVV G V  LK YGAF+DIGGI+GLLH+S+ISH
Subjt:  NKGGVLVVVEGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEISH

Query:  DRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLNSDGILDPATPE
        DR+ D+A VL+PGD LKVMIL+ DR++G + LSTKKLEP  GDMI NP LVFEKAEEMA  FRQR+AQAEA+ARAD+L FQPE  LTL+SDGIL P   E
Subjt:  DRIRDVAAVLKPGDVLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLNSDGILDPATPE

Query:  L
        L
Subjt:  L


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTTGGCTTTGATGGCTCACCAATGCACAGGGTTGAGATGTGAGCCTCTCTTTTCAATTTCCTCGCGTCTCTCCAAGCCACTTGATCGGAGCCATATGCAG
AATAAGGCGGCCCGTTCATTCCCCGTTTTGGCTGCACTAATATCGAGCCCTATTCCCACTCCTCAGACCACAGAACGTTTCAAGCTCAAGCAGACCTTCATAGAT
GCGGCTGATCGCTGCCGTAACTCTCCCATGGAAGGTGTCTCCTTCACTCTCCAACACTTCCTTGCGTCTCTTGAGAAATACGACTTCGATCCTCAATTGGGAGCC
AAGGTGAAAGGTACTGTGGTCTATTCAGAAGCTAATGGAGCATTTGTGGAGATTGCTGCCAAGTCACCTGCATACTTGCCCTTGCAGGAGACTTGCATTCATAGA
ATAAAACGTGTAGAAGAAGCAGGAATATATCCTGGTTTAAGAGAGGAGTTTGTTATTATAGGTGAGAATGAAGATGATAGCTTGACTTTGAGCTTGAGGCCCATC
CAATATGAACTTGCTTGGGAAAGGTGCAGACAGCTTCAAGCAGAGGATGTTGTTGTCAAGGGTAAGGTGGTTGATGCGAACAAAGGGGGAGTTTTGGTAGTTGTG
GAAGGCCTAAAAGGATTTGTTCCTTTCTCAGAGATATTAATGATATCAACTGCTGAAGAGCTTATCAACAAGGAGCTTCCTCTGAAATTTCTGGTGGTTAATGAG
GAACAAACGAGGGTTGTTCTCAGTAACCGTAAGGTCATGGCTGACAGCAACGCAGAGCTTGCAATTGGATCGGTGGTCACTGGAACAGTTCTAAAACTTAAAGAG
TATGGTGCCTTTGTTGACATCGGTGGAATCCATGGTCTTCTTCACATCAGTGAGATAAGTCATGATCGCATACGAGATGTTGCAGCAGTTCTTAAGCCGGGAGAC
GTTCTCAAGGTCATGATATTGAACATTGATCGTGAAAAAGGCCATATTCGTCTTTCTACCAAGAAGCTAGAGCCTAATACCGGGGACATGATTTGCAATCCAGGG
CTTGTTTTTGAAAAGGCTGAGGAAATGGCACATAGATTTAGGCAAAGATTAGCTCAAGCAGAGGCATTGGCACGTGCAGACTTGCTTAGTTTTCAGCCTGAGGGC
AGATTAACTTTGAACAGTGATGGAATATTGGATCCGGCTACCCCAGAGTTGGGGGCTAAAGAATGA
mRNA sequenceShow/hide mRNA sequence
GAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGTATTTTTATAATATAGTCTAAAATCTATGAAAAAAGTGCATTCGGGGAGTGAAGTTTCAGTCAAAATCAAAA
CTATGAATATTTAAACGAAGTGGGTCTGTTAGCAAGTCTCTGGTACAGCCTGGGAAACGCCGCTGGAGGAAGACGAAGATGCCTTTGGCTTTGATGGCTCACCAA
TGCACAGGGTTGAGATGTGAGCCTCTCTTTTCAATTTCCTCGCGTCTCTCCAAGCCACTTGATCGGAGCCATATGCAGAATAAGGCGGCCCGTTCATTCCCCGTT
TTGGCTGCACTAATATCGAGCCCTATTCCCACTCCTCAGACCACAGAACGTTTCAAGCTCAAGCAGACCTTCATAGATGCGGCTGATCGCTGCCGTAACTCTCCC
ATGGAAGGTGTCTCCTTCACTCTCCAACACTTCCTTGCGTCTCTTGAGAAATACGACTTCGATCCTCAATTGGGAGCCAAGGTGAAAGGTACTGTGGTCTATTCA
GAAGCTAATGGAGCATTTGTGGAGATTGCTGCCAAGTCACCTGCATACTTGCCCTTGCAGGAGACTTGCATTCATAGAATAAAACGTGTAGAAGAAGCAGGAATA
TATCCTGGTTTAAGAGAGGAGTTTGTTATTATAGGTGAGAATGAAGATGATAGCTTGACTTTGAGCTTGAGGCCCATCCAATATGAACTTGCTTGGGAAAGGTGC
AGACAGCTTCAAGCAGAGGATGTTGTTGTCAAGGGTAAGGTGGTTGATGCGAACAAAGGGGGAGTTTTGGTAGTTGTGGAAGGCCTAAAAGGATTTGTTCCTTTC
TCAGAGATATTAATGATATCAACTGCTGAAGAGCTTATCAACAAGGAGCTTCCTCTGAAATTTCTGGTGGTTAATGAGGAACAAACGAGGGTTGTTCTCAGTAAC
CGTAAGGTCATGGCTGACAGCAACGCAGAGCTTGCAATTGGATCGGTGGTCACTGGAACAGTTCTAAAACTTAAAGAGTATGGTGCCTTTGTTGACATCGGTGGA
ATCCATGGTCTTCTTCACATCAGTGAGATAAGTCATGATCGCATACGAGATGTTGCAGCAGTTCTTAAGCCGGGAGACGTTCTCAAGGTCATGATATTGAACATT
GATCGTGAAAAAGGCCATATTCGTCTTTCTACCAAGAAGCTAGAGCCTAATACCGGGGACATGATTTGCAATCCAGGGCTTGTTTTTGAAAAGGCTGAGGAAATG
GCACATAGATTTAGGCAAAGATTAGCTCAAGCAGAGGCATTGGCACGTGCAGACTTGCTTAGTTTTCAGCCTGAGGGCAGATTAACTTTGAACAGTGATGGAATA
TTGGATCCGGCTACCCCAGAGTTGGGGGCTAAAGAATGAAGTCTCAACGAAGCTCTAAATGCATGGTCATGCCTACACGTTAATGCACGTATTCGATGTTGCATT
ATCCAAATAGTTAACAAGAAAATAACGTCAAAGACTGAAATTACATCATTTTTCTAATGTAGGTAGATATTTTGAAAATTTAGGGATAAAAATGATTTTCTTAAA
GAAGTTTGGGATTATGTAGATGTTATAGAAGTTCAGGGATCAAAATATAATAATGTTGAAAGTTGAGGGATGAAATTAAAATTTATATATGCTTATGTTGATTCA
TCTATTTAAGTTGTAACATTGAATGGGTTTTTATATATATATATATAATTAAGTTGTCCAAAATACGCATTTGGTCTTTGAGGTTTGGAGTTGGCATCTATATGG
TCTATGAGGTTTCAAAAAAGGATACTTATGGTTCTTGAGGTTTGGAAAAAAAAGATGTTGCAGACTAAATTCAACTTATGTGACAATTTTTAATGAATTGGCTTT
TTTATTTTATTTTGTTGGGTGAGTTTCTCCCTCCC
Protein sequenceShow/hide protein sequence
MPLALMAHQCTGLRCEPLFSISSRLSKPLDRSHMQNKAARSFPVLAALISSPIPTPQTTERFKLKQTFIDAADRCRNSPMEGVSFTLQHFLASLEKYDFDPQLGA
KVKGTVVYSEANGAFVEIAAKSPAYLPLQETCIHRIKRVEEAGIYPGLREEFVIIGENEDDSLTLSLRPIQYELAWERCRQLQAEDVVVKGKVVDANKGGVLVVV
EGLKGFVPFSEILMISTAEELINKELPLKFLVVNEEQTRVVLSNRKVMADSNAELAIGSVVTGTVLKLKEYGAFVDIGGIHGLLHISEISHDRIRDVAAVLKPGD
VLKVMILNIDREKGHIRLSTKKLEPNTGDMICNPGLVFEKAEEMAHRFRQRLAQAEALARADLLSFQPEGRLTLNSDGILDPATPELGAKE