; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039093 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039093
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Description30S ribosomal protein S1
Genome locationchr2:35825855..35834353
RNA-Seq ExpressionLag0039093
SyntenyLag0039093
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0022627 - cytosolic small ribosomal subunit (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR004332 - Transposase, MuDR, plant
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147619.1 30S ribosomal protein S1, chloroplastic [Cucumis sativus]6.3e-15672.54Show/hide
Query:  SMAQQCTALRSVPLFPSCLSKPLGRRHLQNKLRPFSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGAKV
        SMAQQ T LR  PL  S LSKP   +H  NK R   + AAV+S PIP+ QTRERFKLKE FE+A +RCRNAP+EG+ FTL++F AAL K+DFD +LG KV
Subjt:  SMAQQCTALRSVPLFPSCLSKPLGRRHLQNKLRPFSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGAKV

Query:  KGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKGGV
        KGTV   + NGALV+I  KS AYLPL+E CIHRIKHVEEAG+F G REEFVIIG NE DDSLILSLRSI+YDLAWERCRQL+AED +VKGKVVD+NKGGV
Subjt:  KGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKGGV

Query:  LVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDRIRD
        + +VEGL+GFVPFS+I   S AEELL+KEL LKF+ VDEEQ+R+VLSNRKA+AD + QL IGSVVTGTV+ L   GA +DIGGI G LH+S+ISHDRI D
Subjt:  LVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDRIRD

Query:  IAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP
        IA VL+PGD+LKVMIL HDRE GR+ LSTKKLEP PGDMIRN +LVFEKAEEMAQ FRQRIAQAEALA AD+L FQP+ GLTLTTDGILGP TP+LP
Subjt:  IAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP

XP_008438974.1 PREDICTED: 30S ribosomal protein S1, chloroplastic [Cucumis melo]3.0e-15873.8Show/hide
Query:  SMAQQCTALRSVPLFPSCLSKPLGRRHLQNKLRPFSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGAKV
        SMAQQ T LR VPL  S LSKP   +HL NK R   + AAV+S PIP+ QT+ERFKLKE FE+A +RCRNAP+EG+ FTL++F AAL K+DFD +LG KV
Subjt:  SMAQQCTALRSVPLFPSCLSKPLGRRHLQNKLRPFSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGAKV

Query:  KGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKGGV
        KGTV   + NGALV+I  KS AYLPL+E CIHRIKHVEEAGIF G REEFVIIG NE DDSLILSLRSI+YDLAWERCRQL+AED +VKGKVVD+NKGGV
Subjt:  KGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKGGV

Query:  LVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDRIRD
        + +VEGL+GFVPFS+I   STAEELLNKEL LKF+ VDEEQ+R+VLSNRKA+AD + QL IGSVVTGTV+ L   GA +DIGGI G LH+S+ISHDRI D
Subjt:  LVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDRIRD

Query:  IAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP
        IA VL+PGDTLKVMIL HDRE GR+ LSTKKLEP PGDMIRN +LVFEKAEEMAQ FRQRIAQAEALA AD+L FQP+ GLTLTTDGILGP TP+LP
Subjt:  IAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP

XP_022138241.1 30S ribosomal protein S1, chloroplastic [Momordica charantia]1.0e-15873.55Show/hide
Query:  SMAQQCTALRSVPLFPSCLSKPLGRRHLQNKLRPFSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGAKV
        SMAQQ T LR  PL  S LS P   RHLQNK R   + AAV+SSPIP+ QT+ERFKLKE FEDA +RCRNAP+EG+ FTL++F AAL K+DFD ++G KV
Subjt:  SMAQQCTALRSVPLFPSCLSKPLGRRHLQNKLRPFSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGAKV

Query:  KGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKGGV
        KGTV   +ANGALV+I  KS AYLP++E CIHRIKHVEEAGIF G REEFVIIG NE DDSL+LSLRSI+YDLAWERCRQL+AED +VKGKVVD+NKGGV
Subjt:  KGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKGGV

Query:  LVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDRIRD
        + +VEGL+GFVPFS+I   STAEELLNKEL LKF+ VDEEQ+R+VLSNRKA+AD + QL IGSVVTGTV+ L   GA +DIGGI G LH+S+ISHDRI D
Subjt:  LVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDRIRD

Query:  IAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP
        IA VL+PGDTLKVMIL HDRE GR+ LSTKKLEP PGDMIRN +LVFEKAEEMAQ FRQRIAQAEA+A AD+L FQP+ GLTLTTDGILGP TP+LP
Subjt:  IAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP

XP_038878013.1 30S ribosomal protein S1, chloroplastic-like [Benincasa hispida]6.7e-15873.3Show/hide
Query:  SMAQQCTALRSVPLFPSCLSKPLGRRHLQNKLRPFSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGAKV
        SMAQQ T LR VPL  S LSKP   RHLQ+K R   + AAV+S PIP+ QT+ERFKLKE FE+A +RCRNAP+EG+ FTL++F AAL K+DFD +LG KV
Subjt:  SMAQQCTALRSVPLFPSCLSKPLGRRHLQNKLRPFSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGAKV

Query:  KGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKGGV
        KGTV   + NGALV+I  KS AYLP++E CIHRIKHVEEAGIF G REEFVIIG NE DDSLILSLRSI+YDLAWERCRQL+AED +VKGKVVD+NKGGV
Subjt:  KGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKGGV

Query:  LVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDRIRD
        + +VEGL+GFVPFS+I   STAEELLNKEL LKF+ VDEEQ+R+VLSNRKA+AD + QL IGSVVTGTV+ L   GA +DIGGI G LH+S+ISHDRI D
Subjt:  LVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDRIRD

Query:  IAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP
        I  VL+PGDTLKVMIL HDRE GR+ LSTKKLEP PGDMIRN +LVFEKAEEMAQ FRQRIAQAEA+A AD+L FQP+ GLTLTTDGILGP TP+LP
Subjt:  IAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP

XP_038885297.1 30S ribosomal protein S1, chloroplastic-like [Benincasa hispida]5.3e-16377.33Show/hide
Query:  MAQQCTALRSVPLF--PSCLSKPLGRRHLQNK-LRPFSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGA
        MAQQCT LR  P F   SCLSKPL   H+QN  +R F +VAAV+S PIPT QT ERFKLK+TF DAADRCRNAPMEGV FTLQ+FLA+L K+ FDPQLGA
Subjt:  MAQQCTALRSVPLF--PSCLSKPLGRRHLQNK-LRPFSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGA

Query:  KVKGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNEDDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKGG
        KVKGTVV+ EANGALVEIA KSPAYLPL E CIHRIK VEEAGI+ GFREEFVIIG NEDDSL LSLRSI+Y+LAWERCRQL+AED IVKGKVV +N GG
Subjt:  KVKGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNEDDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKGG

Query:  VLVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDRIR
        VLV+VEGLKGFVP+SEILMISTAEEL+NKEL LKFLVV+EE+TR+VLSNRK +AD K QL IG+VVTGTV RL++ GA VDIGG+ G LHISEISHDRI 
Subjt:  VLVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDRIR

Query:  DIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQL
        DIAAVLKPGD LKVMIL  + E+G IRLSTKKLEPN GDMI N  LVFEKAEEMA RFRQR+AQAEALA ADLLSFQP+G L L++DGIL P TP+L
Subjt:  DIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQL

TrEMBL top hitse value%identityAlignment
A0A1S3AXL6 30S ribosomal protein S1, chloroplastic1.5e-15873.8Show/hide
Query:  SMAQQCTALRSVPLFPSCLSKPLGRRHLQNKLRPFSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGAKV
        SMAQQ T LR VPL  S LSKP   +HL NK R   + AAV+S PIP+ QT+ERFKLKE FE+A +RCRNAP+EG+ FTL++F AAL K+DFD +LG KV
Subjt:  SMAQQCTALRSVPLFPSCLSKPLGRRHLQNKLRPFSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGAKV

Query:  KGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKGGV
        KGTV   + NGALV+I  KS AYLPL+E CIHRIKHVEEAGIF G REEFVIIG NE DDSLILSLRSI+YDLAWERCRQL+AED +VKGKVVD+NKGGV
Subjt:  KGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKGGV

Query:  LVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDRIRD
        + +VEGL+GFVPFS+I   STAEELLNKEL LKF+ VDEEQ+R+VLSNRKA+AD + QL IGSVVTGTV+ L   GA +DIGGI G LH+S+ISHDRI D
Subjt:  LVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDRIRD

Query:  IAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP
        IA VL+PGDTLKVMIL HDRE GR+ LSTKKLEP PGDMIRN +LVFEKAEEMAQ FRQRIAQAEALA AD+L FQP+ GLTLTTDGILGP TP+LP
Subjt:  IAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP

A0A5A7UEP7 30S ribosomal protein S12.2e-15469.43Show/hide
Query:  SMAQQCTALRSVPLFPSCLSKPLGRRHLQNKLRPFSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGAK-
        SMAQQ T LR VPL  S LSKP   +HL NK R   + AAV+S PIP+ QT+ERFKLKE FE+A +RCRNAP+EG+ FTL++F AAL K+DFD +LG K 
Subjt:  SMAQQCTALRSVPLFPSCLSKPLGRRHLQNKLRPFSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGAK-

Query:  ------------------------VKGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAW
                                VKGTV   + NGALV+I  KS AYLPL+E CIHRIKHVEEAGIF G REEFVIIG NE DDSLILSLRSI+YDLAW
Subjt:  ------------------------VKGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAW

Query:  ERCRQLKAEDAIVKGKVVDSNKGGVLVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQT
        ERCRQL+AED +VKGKVVD+NKGGV+ +VEGL+GFVPFS+I   STAEELLNKEL LKF+ VDEEQ+R+VLSNRKA+AD + QL IGSVVTGTV+ L   
Subjt:  ERCRQLKAEDAIVKGKVVDSNKGGVLVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQT

Query:  GALVDIGGICGFLHISEISHDRIRDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSF
        GA +DIGGI G LH+S+ISHDRI DIA VL+PGDTLKVMIL HDRE GR+ LSTKKLEP PGDMIRN +LVFEKAEEMAQ FRQRIAQAEALA AD+L F
Subjt:  GALVDIGGICGFLHISEISHDRIRDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSF

Query:  QPQGGLTLTTDGILGPFTPQLP
        QP+ GLTLTTDGILGP TP+LP
Subjt:  QPQGGLTLTTDGILGPFTPQLP

A0A6J1C328 uncharacterized protein LOC1110069943.2e-15347.12Show/hide
Query:  MTFTEFQHCIMQKLGSLGNLDSPNIFVSIGSRNFIKKDAQVSEDKDVRWLFGIVSNNVEQYCVVIVDSNNNLSVILDNIPPTNS----NELVGEGRFFHT
        M++      IM+ LG +G+ D P++F  +G+  FIKKD ++S+DKDV WL+ I+SN   Q C ++VD  N LS ILD +P   S    N +   G+F+  
Subjt:  MTFTEFQHCIMQKLGSLGNLDSPNIFVSIGSRNFIKKDAQVSEDKDVRWLFGIVSNNVEQYCVVIVDSNNNLSVILDNIPPTNS----NELVGEGRFFHT

Query:  IDVSKMSSNFEICVDDMFSSKIVLQNAIRSIAIRDNFQFKTVKSNRDFLVVQCVVEDCEWFLRASRFGDDGSATWVVKRFDNDHTCSIDVVLTDHKQATF
        IDV+ +S+ F I V+D F  K  LQNA+RS+AIR NF F+TVKSNR+ L+V C+  +C+WFL AS+FGD GS  W+VK+F ++HTCS+++VL DH+QATF
Subjt:  IDVSKMSSNFEICVDDMFSSKIVLQNAIRSIAIRDNFQFKTVKSNRDFLVVQCVVEDCEWFLRASRFGDDGSATWVVKRFDNDHTCSIDVVLTDHKQATF

Query:  TFIKDCIKRKISIAASELPTPKDIISFIRSEYGLHISYQKAWRAREAALNEIRGSPEDSYKMIPSFAHM----------------------------QCI
        + IK+ IK +I+   ++LP+ KD IS +  E  + I+YQKA  ARE A+ EIRGSPE SY +IP F HM                              I
Subjt:  TFIKDCIKRKISIAASELPTPKDIISFIRSEYGLHISYQKAWRAREAALNEIRGSPEDSYKMIPSFAHM----------------------------QCI

Query:  SGWKHCRPIISVDGTQMKNKFAGTLITASTPDANDQIFPLVFSVVDSEN-------------------------DRHKSIGKAIHDVLPDALHCICMVHL
        SGWK+C P+ISVDGT MKNK+AGTLI+A T DAN QIFPL FSV DSEN                         DRHKSIGK+   V   A HCIC  HL
Subjt:  SGWKHCRPIISVDGTQMKNKFAGTLITASTPDANDQIFPLVFSVVDSEN-------------------------DRHKSIGKAIHDVLPDALHCICMVHL

Query:  LRNLKLKYKEKLVDNIFYACAKAFNVVDFEFQMRQMEQDAR--------------------------------ESLNAAVKEARELPIASMLEVLRMMLQ
         +NLKLKYK+K+ DN+F+ CAKA+NV DFE  MR ++   R                                ESLNAA+K+ARELPI SMLEV+RMMLQ
Subjt:  LRNLKLKYKEKLVDNIFYACAKAFNVVDFEFQMRQMEQDAR--------------------------------ESLNAAVKEARELPIASMLEVLRMMLQ

Query:  RWFHDRRNETAFQVTDFTKNTEKHIRDQIAMGRSMQPEDAVVGCGTFLEIPCSHACAVLTWKHLSMKEYISNFYLNSTLSSIYSGIIHPLGNESSWHIPD
        RWF++R+N   FQ+T+FTK+ EK +R+QI+ GR+M             ++  +H       KHL  K Y+S +Y N+ L S YSG IHPLG++SSW+IP+
Subjt:  RWFHDRRNETAFQVTDFTKNTEKHIRDQIAMGRSMQPEDAVVGCGTFLEIPCSHACAVLTWKHLSMKEYISNFYLNSTLSSIYSGIIHPLGNESSWHIPD

Query:  DIKNISVLPPNVKRSVGRPKKTRI
        D+K I +LPPNVKR  GRPKK RI
Subjt:  DIKNISVLPPNVKRSVGRPKKTRI

A0A6J1C966 30S ribosomal protein S1, chloroplastic5.0e-15973.55Show/hide
Query:  SMAQQCTALRSVPLFPSCLSKPLGRRHLQNKLRPFSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGAKV
        SMAQQ T LR  PL  S LS P   RHLQNK R   + AAV+SSPIP+ QT+ERFKLKE FEDA +RCRNAP+EG+ FTL++F AAL K+DFD ++G KV
Subjt:  SMAQQCTALRSVPLFPSCLSKPLGRRHLQNKLRPFSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGAKV

Query:  KGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKGGV
        KGTV   +ANGALV+I  KS AYLP++E CIHRIKHVEEAGIF G REEFVIIG NE DDSL+LSLRSI+YDLAWERCRQL+AED +VKGKVVD+NKGGV
Subjt:  KGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKGGV

Query:  LVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDRIRD
        + +VEGL+GFVPFS+I   STAEELLNKEL LKF+ VDEEQ+R+VLSNRKA+AD + QL IGSVVTGTV+ L   GA +DIGGI G LH+S+ISHDRI D
Subjt:  LVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDRIRD

Query:  IAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP
        IA VL+PGDTLKVMIL HDRE GR+ LSTKKLEP PGDMIRN +LVFEKAEEMAQ FRQRIAQAEA+A AD+L FQP+ GLTLTTDGILGP TP+LP
Subjt:  IAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP

A0A6J1F6H4 30S ribosomal protein S1, chloroplastic-like2.4e-15370.03Show/hide
Query:  SMAQQCTALRSVPLFPSCLSKPLGRRHLQNKLRPFSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGAKV
        SMAQQ   LR  PL  S LSKP   RHLQN+ R   + AAV++SPIP+ Q +ERFKLKE FE+A +RCRNAP+EG+ FT+++F +A+ K+DF+ ++G KV
Subjt:  SMAQQCTALRSVPLFPSCLSKPLGRRHLQNKLRPFSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGAKV

Query:  KGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKGGV
        KGTV   +ANGALV+I  KS AYLPL+E CIHRIKHVEEAGI+ G R+EFVIIG NE DDSL+LSLRSI+YDLAWERCRQL+AED +VKGKVVD+NKGGV
Subjt:  KGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKGGV

Query:  LVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDRIRD
        + +VEGL+GFVPFS+I   STAEELL KE+ LKF+ VDEEQ+R+VLSNRKA+AD + QL IGSVV GTV+ L   GA +DIGG+ G LH+S+ISHDRI D
Subjt:  LVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDRIRD

Query:  IAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP
        IA VL+PGDTLKVMIL HDRE GR+ LSTKKLEP PGDMIRN +LVFEKAEEMAQ FRQRIA AEA+A AD+L FQP+ GLTLTTDGILGP TP+LP
Subjt:  IAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP

SwissProt top hitse value%identityAlignment
P29344 30S ribosomal protein S1, chloroplastic3.5e-14166.92Show/hide
Query:  SMAQQCT-ALRSVPLFPSCLSKPLGRRHLQNKLRP-FSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGA
        S+AQQ    LR  PL  S LSKP   +H    L+P FS + + V+  +  AQTRER KLK+ FEDA +RCRNAPMEGV FT+ +F  AL K+DF+ ++G+
Subjt:  SMAQQCT-ALRSVPLFPSCLSKPLGRRHLQNKLRP-FSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLGA

Query:  KVKGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKG
        +VKGTV   +ANGALV+I  KS AYLPL E CI+RIK+VEEAGI  G REEFVIIG NE DDSLILSLR I+Y+LAWERCRQL+AED +VKGK+V +NKG
Subjt:  KVKGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKG

Query:  GVLVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDRI
        GV+ +VEGL+GFVPFS+I   S+AEELL KE+ LKF+ VDEEQ+R+V+SNRKA+AD + QL IGSVVTGTV+ L   GA +DIGGI G LH+S+ISHDR+
Subjt:  GVLVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDRI

Query:  RDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP
         DIA VL+PGDTLKVMIL HDRE GR+ LSTKKLEP PGDMIRN +LVFEKAEEMAQ FRQRIAQAEA+A AD+L FQP+ GLTL++DGILGP T  LP
Subjt:  RDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP

P46228 30S ribosomal protein S15.5e-7047.37Show/hide
Query:  RNAPMEGVPFTLQEFLAALVKHDFDPQLGAKVKGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNEDDSLILSLRSI
        ++ P   + FT ++F A L ++D+    G  V GTV ++E  GAL++I  K+ A+LP++E  I+R++  EE    +  RE F++   NED  L LS+R I
Subjt:  RNAPMEGVPFTLQEFLAALVKHDFDPQLGAKVKGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNEDDSLILSLRSI

Query:  EYDLAWERCRQLKAEDAIVKGKVVDSNKGGVLVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDK-PQLQIGSVVTGT
        EY  AWER RQL+ EDA V+ +V  +N+GG LV +EGL+GF+P S I      E+L+ +EL LKFL VDE++ R+VLS+R+A+ + K  +L++G VV G 
Subjt:  EYDLAWERCRQLKAEDAIVKGKVVDSNKGGVLVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDK-PQLQIGSVVTGT

Query:  VRRLLQTGALVDIGGICGFLHISEISHDRIRDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRI-AQAEAL
        VR +   GA +DIGG+ G LHISEISHD I    +V    D +KVMI+  D E GRI LSTK+LEP PGDM+RN E+V+EKAEEMA ++R+++  QAE L
Subjt:  VRRLLQTGALVDIGGICGFLHISEISHDRIRDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRI-AQAEAL

Query:  AYAD
           +
Subjt:  AYAD

P73530 30S ribosomal protein S1 homolog A2.0e-6748.14Show/hide
Query:  VPFTLQEFLAALVKHDFDPQLGAKVKGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNEDDSLILSLRSIEYDLAWE
        + FTL++F A L K+D+    G  V GTV  +E+ GAL++I  K+ AY+P++E  I+R+   EE       RE F++   NED  L LS+R IEY  AWE
Subjt:  VPFTLQEFLAALVKHDFDPQLGAKVKGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNEDDSLILSLRSIEYDLAWE

Query:  RCRQLKAEDAIVKGKVVDSNKGGVLVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQ-LQIGSVVTGTVRRLLQT
        R RQL+AEDA V+  V  +N+GG LV +EGL+GF+P S I      E+L+ ++L LKFL VDEE+ R+VLS+R+A+ + K   L++  VV G+VR +   
Subjt:  RCRQLKAEDAIVKGKVVDSNKGGVLVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQ-LQIGSVVTGTVRRLLQT

Query:  GALVDIGGICGFLHISEISHDRIRDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQ-RIAQAEALAY
        GA +DIGG+ G LHISEISHD I    +V    D +KVMI+  D E GRI LSTK+LEP PG M+++ +LV E A+EMA+ FRQ R+A+A+ + Y
Subjt:  GALVDIGGICGFLHISEISHDRIRDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQ-RIAQAEALAY

Q1XDE2 30S ribosomal protein S1, chloroplastic4.5e-4036.15Show/hide
Query:  FTLQEFLAALVKHDFDPQLGAKVKGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFR----EEFVIIGMN-EDDSLILSLRSIEYDL
        FT + F A L K+ +D  LG  V GT+   E NG LV+I T   AYLP++E     +   ++   FT        EF ++  N +   LILS+R +EY  
Subjt:  FTLQEFLAALVKHDFDPQLGAKVKGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFR----EEFVIIGMN-EDDSLILSLRSIEYDL

Query:  AWERCRQLKAEDAIVKGKVVDSNKGGVLVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKA-IADDKPQLQIGSVVTGTVRRL
        AW+R RQL AED+++   +   NKGG+++ +EG+ GFVP S +     +E+  NK ++LK L V+E+   ++LS+R+A I+     L +G+++ G + ++
Subjt:  AWERCRQLKAEDAIVKGKVVDSNKGGVLVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKA-IADDKPQLQIGSVVTGTVRRL

Query:  LQTGALVDIGGICGFLHISEISHDRIRDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKL
           G  + +G + G +HISEI+   +  I++  K GDT+K +I+  D+++GR+ LS K L
Subjt:  LQTGALVDIGGICGFLHISEISHDRIRDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKL

Q93VC7 30S ribosomal protein S1, chloroplastic1.3e-13563.37Show/hide
Query:  SMAQQCTALRSVPLFPSC-LSKPLGRRHLQNKLRPFS--IVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLG
        S+AQQ + LR  PL  S  LS+   +   QNK    S  IVAAV  S   + QT+ER +LK+ FEDA +RCR +PMEGV FT+ +F AA+ ++DF+ ++G
Subjt:  SMAQQCTALRSVPLFPSC-LSKPLGRRHLQNKLRPFS--IVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLG

Query:  AKVKGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNK
         +VKGTV   +ANGALV+I+ KS AYL + + CIHRIKHVEEAGI  G  EEFVIIG NE DDSL+LSLR+I+Y+LAWERCRQL+AED IVK KV+ +NK
Subjt:  AKVKGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNK

Query:  GGVLVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDR
        GG++ +VEGL+GFVPFS+I   + AEELL KE+ LKF+ VDEEQT++VLSNRKA+AD + QL IGSVV G V+ L   GA +DIGGI G LH+S+ISHDR
Subjt:  GGVLVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDR

Query:  IRDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP
        + DIA VL+PGDTLKVMIL HDR+ GR+ LSTKKLEP PGDMIRN +LVFEKAEEMAQ FRQRIAQAEA+A AD+L FQP+ GLTL++DGILGP   +LP
Subjt:  IRDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP

Query:  FSSV
           V
Subjt:  FSSV

Arabidopsis top hitse value%identityAlignment
AT1G64260.1 MuDR family transposase8.6e-1020.35Show/hide
Query:  FSSKIVLQNAIRSIAIRDNFQFKTVKSNRDFLVVQCVVEDCEWFLRASRFGDDGSATWVVKRFDNDHTCSIDVVLTDHKQATFTFIKDCIKRKISIAASE
        F  +  L+ A+    IR        ++ ++    +CV   C+W LRA+R  + G     + ++   HTCS +       +     I+  ++ + +++ +E
Subjt:  FSSKIVLQNAIRSIAIRDNFQFKTVKSNRDFLVVQCVVEDCEWFLRASRFGDDGSATWVVKRFDNDHTCSIDVVLTDHKQATFTFIKDCIKRKISIAASE

Query:  LPTPKDIISFIRSEYGLHISYQKAWRAREAALNEIRGSPEDSYKMIPS--------------------------------FAHMQCISGWKHCRPIISVD
        L        + + + G  +   K    +   +  + G  + S++++P                                 ++  Q I G++HCRP+I VD
Subjt:  LPTPKDIISFIRSEYGLHISYQKAWRAREAALNEIRGSPEDSYKMIPS--------------------------------FAHMQCISGWKHCRPIISVD

Query:  GTQMKNKFAGTLITASTPDANDQIFPLVFSV
           +  K+   L+ AS  DA ++ FPL F+V
Subjt:  GTQMKNKFAGTLITASTPDANDQIFPLVFSV

AT1G71720.1 Nucleic acid-binding proteins superfamily1.5e-2231.78Show/hide
Query:  IIGMNEDDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKGGVLVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLV----VDEEQTRVVLSN
        ++G       +LS R     +AW R RQ+K  +  ++ K+ + N GG+L  +EGL+ F+P  E++        L + +  +FLV    ++E++  ++LS 
Subjt:  IIGMNEDDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKGGVLVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLV----VDEEQTRVVLSN

Query:  RKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIG--GICGFLHISEISHDRIRDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELV
        +  +A +K  L+ G+++ GTV ++L  GA V +G     G LHIS I+  RI  ++ VL+  +++KV+++K      +I LS   LE  PG  I + E V
Subjt:  RKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIG--GICGFLHISEISHDRIRDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELV

Query:  FEKAEEMAQRFRQRIAQAEALAYAD---LLSFQPQG
        F +AEEMA+++R+++        +D   + S  PQG
Subjt:  FEKAEEMAQRFRQRIAQAEALAYAD---LLSFQPQG

AT3G11964.1 RNA binding;RNA binding1.6e-0830.37Show/hide
Query:  KAIADDKPQLQIGSVVTGTVRRLLQTGALVDIG--GICGFLHISEISHDRIRDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVF
        K+ + D  +L +G +++G +RR+   G  +DI   G+ G  HIS++S DR+ ++ A  K G++++  ILK D E+ RI L  K      GD  +   L  
Subjt:  KAIADDKPQLQIGSVVTGTVRRLLQTGALVDIG--GICGFLHISEISHDRIRDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVF

Query:  EKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLT
        +              ++E LA  D   FQ   G T
Subjt:  EKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLT

AT3G23700.1 Nucleic acid-binding proteins superfamily1.5e-1427.78Show/hide
Query:  WERCRQLKAEDAIVKGKVVDSNKGGVLVIVEGLKGFVPFSEILMISTAEE-----------LLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQ-LQIG
        W+  +         +G+V   N GG+L+    L GF+P+ ++    + +E           L+  +L +K +  DEE  +++LS + A+     Q + +G
Subjt:  WERCRQLKAEDAIVKGKVVDSNKGGVLVIVEGLKGFVPFSEILMISTAEE-----------LLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQ-LQIG

Query:  SVVTGTVRRLLQTGALVDIG------GICGFLHISEISHDRIRDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNP
         V  G V  +   GA + +        + G +H+SE+S D ++D+  VL+ GD ++V++   D+E+ RI LS K+LE +P
Subjt:  SVVTGTVRRLLQTGALVDIG------GICGFLHISEISHDRIRDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNP

AT5G30510.1 ribosomal protein S19.1e-13763.37Show/hide
Query:  SMAQQCTALRSVPLFPSC-LSKPLGRRHLQNKLRPFS--IVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLG
        S+AQQ + LR  PL  S  LS+   +   QNK    S  IVAAV  S   + QT+ER +LK+ FEDA +RCR +PMEGV FT+ +F AA+ ++DF+ ++G
Subjt:  SMAQQCTALRSVPLFPSC-LSKPLGRRHLQNKLRPFS--IVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQLG

Query:  AKVKGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNK
         +VKGTV   +ANGALV+I+ KS AYL + + CIHRIKHVEEAGI  G  EEFVIIG NE DDSL+LSLR+I+Y+LAWERCRQL+AED IVK KV+ +NK
Subjt:  AKVKGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNE-DDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNK

Query:  GGVLVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDR
        GG++ +VEGL+GFVPFS+I   + AEELL KE+ LKF+ VDEEQT++VLSNRKA+AD + QL IGSVV G V+ L   GA +DIGGI G LH+S+ISHDR
Subjt:  GGVLVIVEGLKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDR

Query:  IRDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP
        + DIA VL+PGDTLKVMIL HDR+ GR+ LSTKKLEP PGDMIRN +LVFEKAEEMAQ FRQRIAQAEA+A AD+L FQP+ GLTL++DGILGP   +LP
Subjt:  IRDIAAVLKPGDTLKVMILKHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLP

Query:  FSSV
           V
Subjt:  FSSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGAGCCTGGGAAACGCCGCGGGAGGAAGATGAAGATTCCTTCGATGGCTCAGCAATGCACAGCGTTGAGATCAGTGCCTCTGTTCCCATCGTGTCTTTCGAAGCC
ATTGGGTCGGAGGCATTTGCAGAACAAACTCCGTCCATTCTCCATTGTTGCTGCAGTTGTATCAAGCCCTATTCCCACTGCTCAGACCAGAGAGCGTTTCAAGCTCAAGG
AAACCTTCGAGGATGCGGCCGATCGCTGCCGTAATGCTCCCATGGAAGGCGTCCCCTTCACTCTCCAAGAATTCCTCGCCGCTCTCGTGAAACACGACTTCGATCCTCAA
CTGGGAGCCAAGGTGAAAGGTACTGTGGTCCATGTGGAAGCTAATGGAGCACTTGTTGAGATCGCTACCAAGTCACCTGCATATTTGCCATTGCGGGAGACTTGCATTCA
CAGAATAAAACATGTAGAAGAAGCAGGAATATTTACTGGTTTTAGAGAGGAGTTTGTTATTATAGGTATGAATGAAGATGATAGCTTGATTTTGAGCTTGAGGTCCATCG
AATATGACCTGGCTTGGGAGAGATGCAGACAGCTTAAAGCAGAGGATGCTATTGTCAAGGGTAAGGTGGTTGATTCGAACAAAGGCGGAGTTTTGGTAATTGTGGAAGGT
CTAAAAGGGTTTGTGCCTTTCTCAGAGATATTAATGATATCAACTGCTGAAGAGCTTCTCAACAAGGAACTTCGTCTGAAATTTCTGGTGGTTGATGAGGAACAAACGAG
GGTTGTCCTTAGTAACCGTAAGGCCATAGCGGATGACAAGCCACAACTTCAAATTGGATCAGTGGTCACTGGAACAGTTCGAAGACTTCTACAGACTGGTGCCCTTGTTG
ACATTGGTGGAATCTGTGGTTTTCTTCACATCAGTGAGATAAGCCATGATCGCATACGAGATATTGCAGCGGTTCTTAAGCCTGGAGACACTCTCAAGGTCATGATATTG
AAACATGATCGTGAGGAAGGCCGTATCCGTCTTTCTACCAAGAAGTTAGAGCCGAATCCTGGGGACATGATTCGCAATTCAGAGCTTGTTTTTGAGAAGGCTGAGGAAAT
GGCACAAAGATTTAGGCAAAGAATAGCTCAAGCAGAGGCATTGGCATATGCAGACTTGCTTAGTTTTCAGCCTCAGGGTGGATTAACTTTGACTACAGATGGAATACTGG
GTCCATTTACCCCACAGTTGCCATTTTCCAGTGTTTTCAGACTGCATCGAGTGAGCCTCTACGTTCCGTTTTCTAGTCGTTTGAAGATGGGTAGCGTAATTGTTCAAATG
CTGCACGATGGTCATTGGAACGATAGTGAGAATTATGTGGACTACAAAGTATCACAGCTGTTATTAAACGATGGGATGACTTTTACTGAGTTTCAGCATTGTATCATGCA
AAAACTAGGTAGTTTAGGTAATTTGGATTCCCCCAATATATTTGTTAGCATAGGATCCCGAAATTTTATAAAGAAGGATGCACAAGTATCTGAAGATAAGGACGTAAGAT
GGTTGTTTGGAATAGTCTCAAACAATGTTGAACAGTACTGTGTTGTAATTGTTGATTCAAATAACAACTTATCAGTTATTTTGGACAATATCCCCCCTACAAACTCTAAT
GAATTGGTCGGTGAAGGACGTTTTTTCCACACAATCGATGTTTCAAAGATGTCTTCCAATTTTGAGATTTGTGTGGACGACATGTTCTCTTCCAAGATTGTGTTACAGAA
TGCGATTCGATCAATTGCCATCCGAGACAACTTCCAATTCAAAACTGTGAAATCTAACCGTGATTTTTTGGTCGTGCAATGTGTTGTCGAGGATTGTGAGTGGTTCCTTA
GAGCATCTCGGTTCGGAGATGATGGTAGTGCCACTTGGGTTGTCAAGAGATTTGACAATGATCATACATGTTCAATTGATGTTGTGTTGACTGACCACAAGCAAGCGACA
TTTACATTCATTAAGGATTGTATTAAACGGAAGATTAGCATAGCGGCCAGTGAATTACCCACTCCTAAAGACATCATATCATTTATTCGATCAGAATATGGTTTGCATAT
TAGCTATCAGAAAGCTTGGCGTGCTCGTGAAGCTGCATTAAATGAGATTAGAGGATCTCCAGAAGACTCATACAAAATGATCCCATCGTTTGCCCATATGCAATGCATTT
CTGGTTGGAAGCATTGTCGTCCAATTATTTCTGTAGATGGTACACAAATGAAAAACAAGTTTGCCGGCACTCTGATAACAGCTTCAACTCCTGATGCGAACGATCAAATA
TTCCCTTTAGTATTTTCTGTTGTTGATTCAGAGAACGACAGGCATAAGAGCATTGGCAAAGCAATTCACGACGTATTGCCTGATGCGTTACATTGCATATGCATGGTTCA
TTTGTTGAGGAACCTGAAATTGAAGTATAAAGAAAAGCTTGTCGACAATATATTTTATGCTTGTGCAAAAGCTTTTAACGTCGTGGACTTTGAATTTCAAATGCGTCAAA
TGGAGCAAGATGCAAGGGAGAGCCTAAATGCTGCCGTTAAAGAAGCAAGGGAATTACCTATTGCATCCATGCTAGAGGTTTTAAGGATGATGTTGCAAAGGTGGTTTCAT
GATAGAAGGAACGAGACAGCTTTTCAAGTAACTGATTTTACCAAAAATACAGAAAAACATATAAGAGACCAAATTGCAATGGGTCGTTCGATGCAGCCAGAAGATGCAGT
TGTAGGATGTGGGACATTTTTGGAGATTCCATGTTCTCATGCCTGTGCCGTTTTGACTTGGAAGCACTTGTCTATGAAGGAGTATATATCAAATTTCTACTTGAATAGTA
CTCTATCTTCAATATACAGTGGTATAATCCATCCATTAGGCAATGAGTCTAGCTGGCATATCCCCGATGACATAAAAAATATATCAGTTCTCCCCCCAAACGTTAAGCGT
TCTGTTGGTAGGCCAAAGAAGACCAGAATCCCCTCACAGATGGAGTTCAAAAGACGGGTTAAATGTGGTCGTTGTGACAGGATTGGGCATAATAAGAAGACTTGCAGATT
TGCTCTAACTCAGGGTTATTTTTGTATTTCGGTTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGAGCCTGGGAAACGCCGCGGGAGGAAGATGAAGATTCCTTCGATGGCTCAGCAATGCACAGCGTTGAGATCAGTGCCTCTGTTCCCATCGTGTCTTTCGAAGCC
ATTGGGTCGGAGGCATTTGCAGAACAAACTCCGTCCATTCTCCATTGTTGCTGCAGTTGTATCAAGCCCTATTCCCACTGCTCAGACCAGAGAGCGTTTCAAGCTCAAGG
AAACCTTCGAGGATGCGGCCGATCGCTGCCGTAATGCTCCCATGGAAGGCGTCCCCTTCACTCTCCAAGAATTCCTCGCCGCTCTCGTGAAACACGACTTCGATCCTCAA
CTGGGAGCCAAGGTGAAAGGTACTGTGGTCCATGTGGAAGCTAATGGAGCACTTGTTGAGATCGCTACCAAGTCACCTGCATATTTGCCATTGCGGGAGACTTGCATTCA
CAGAATAAAACATGTAGAAGAAGCAGGAATATTTACTGGTTTTAGAGAGGAGTTTGTTATTATAGGTATGAATGAAGATGATAGCTTGATTTTGAGCTTGAGGTCCATCG
AATATGACCTGGCTTGGGAGAGATGCAGACAGCTTAAAGCAGAGGATGCTATTGTCAAGGGTAAGGTGGTTGATTCGAACAAAGGCGGAGTTTTGGTAATTGTGGAAGGT
CTAAAAGGGTTTGTGCCTTTCTCAGAGATATTAATGATATCAACTGCTGAAGAGCTTCTCAACAAGGAACTTCGTCTGAAATTTCTGGTGGTTGATGAGGAACAAACGAG
GGTTGTCCTTAGTAACCGTAAGGCCATAGCGGATGACAAGCCACAACTTCAAATTGGATCAGTGGTCACTGGAACAGTTCGAAGACTTCTACAGACTGGTGCCCTTGTTG
ACATTGGTGGAATCTGTGGTTTTCTTCACATCAGTGAGATAAGCCATGATCGCATACGAGATATTGCAGCGGTTCTTAAGCCTGGAGACACTCTCAAGGTCATGATATTG
AAACATGATCGTGAGGAAGGCCGTATCCGTCTTTCTACCAAGAAGTTAGAGCCGAATCCTGGGGACATGATTCGCAATTCAGAGCTTGTTTTTGAGAAGGCTGAGGAAAT
GGCACAAAGATTTAGGCAAAGAATAGCTCAAGCAGAGGCATTGGCATATGCAGACTTGCTTAGTTTTCAGCCTCAGGGTGGATTAACTTTGACTACAGATGGAATACTGG
GTCCATTTACCCCACAGTTGCCATTTTCCAGTGTTTTCAGACTGCATCGAGTGAGCCTCTACGTTCCGTTTTCTAGTCGTTTGAAGATGGGTAGCGTAATTGTTCAAATG
CTGCACGATGGTCATTGGAACGATAGTGAGAATTATGTGGACTACAAAGTATCACAGCTGTTATTAAACGATGGGATGACTTTTACTGAGTTTCAGCATTGTATCATGCA
AAAACTAGGTAGTTTAGGTAATTTGGATTCCCCCAATATATTTGTTAGCATAGGATCCCGAAATTTTATAAAGAAGGATGCACAAGTATCTGAAGATAAGGACGTAAGAT
GGTTGTTTGGAATAGTCTCAAACAATGTTGAACAGTACTGTGTTGTAATTGTTGATTCAAATAACAACTTATCAGTTATTTTGGACAATATCCCCCCTACAAACTCTAAT
GAATTGGTCGGTGAAGGACGTTTTTTCCACACAATCGATGTTTCAAAGATGTCTTCCAATTTTGAGATTTGTGTGGACGACATGTTCTCTTCCAAGATTGTGTTACAGAA
TGCGATTCGATCAATTGCCATCCGAGACAACTTCCAATTCAAAACTGTGAAATCTAACCGTGATTTTTTGGTCGTGCAATGTGTTGTCGAGGATTGTGAGTGGTTCCTTA
GAGCATCTCGGTTCGGAGATGATGGTAGTGCCACTTGGGTTGTCAAGAGATTTGACAATGATCATACATGTTCAATTGATGTTGTGTTGACTGACCACAAGCAAGCGACA
TTTACATTCATTAAGGATTGTATTAAACGGAAGATTAGCATAGCGGCCAGTGAATTACCCACTCCTAAAGACATCATATCATTTATTCGATCAGAATATGGTTTGCATAT
TAGCTATCAGAAAGCTTGGCGTGCTCGTGAAGCTGCATTAAATGAGATTAGAGGATCTCCAGAAGACTCATACAAAATGATCCCATCGTTTGCCCATATGCAATGCATTT
CTGGTTGGAAGCATTGTCGTCCAATTATTTCTGTAGATGGTACACAAATGAAAAACAAGTTTGCCGGCACTCTGATAACAGCTTCAACTCCTGATGCGAACGATCAAATA
TTCCCTTTAGTATTTTCTGTTGTTGATTCAGAGAACGACAGGCATAAGAGCATTGGCAAAGCAATTCACGACGTATTGCCTGATGCGTTACATTGCATATGCATGGTTCA
TTTGTTGAGGAACCTGAAATTGAAGTATAAAGAAAAGCTTGTCGACAATATATTTTATGCTTGTGCAAAAGCTTTTAACGTCGTGGACTTTGAATTTCAAATGCGTCAAA
TGGAGCAAGATGCAAGGGAGAGCCTAAATGCTGCCGTTAAAGAAGCAAGGGAATTACCTATTGCATCCATGCTAGAGGTTTTAAGGATGATGTTGCAAAGGTGGTTTCAT
GATAGAAGGAACGAGACAGCTTTTCAAGTAACTGATTTTACCAAAAATACAGAAAAACATATAAGAGACCAAATTGCAATGGGTCGTTCGATGCAGCCAGAAGATGCAGT
TGTAGGATGTGGGACATTTTTGGAGATTCCATGTTCTCATGCCTGTGCCGTTTTGACTTGGAAGCACTTGTCTATGAAGGAGTATATATCAAATTTCTACTTGAATAGTA
CTCTATCTTCAATATACAGTGGTATAATCCATCCATTAGGCAATGAGTCTAGCTGGCATATCCCCGATGACATAAAAAATATATCAGTTCTCCCCCCAAACGTTAAGCGT
TCTGTTGGTAGGCCAAAGAAGACCAGAATCCCCTCACAGATGGAGTTCAAAAGACGGGTTAAATGTGGTCGTTGTGACAGGATTGGGCATAATAAGAAGACTTGCAGATT
TGCTCTAACTCAGGGTTATTTTTGTATTTCGGTTTTTTAA
Protein sequenceShow/hide protein sequence
MVEPGKRRGRKMKIPSMAQQCTALRSVPLFPSCLSKPLGRRHLQNKLRPFSIVAAVVSSPIPTAQTRERFKLKETFEDAADRCRNAPMEGVPFTLQEFLAALVKHDFDPQ
LGAKVKGTVVHVEANGALVEIATKSPAYLPLRETCIHRIKHVEEAGIFTGFREEFVIIGMNEDDSLILSLRSIEYDLAWERCRQLKAEDAIVKGKVVDSNKGGVLVIVEG
LKGFVPFSEILMISTAEELLNKELRLKFLVVDEEQTRVVLSNRKAIADDKPQLQIGSVVTGTVRRLLQTGALVDIGGICGFLHISEISHDRIRDIAAVLKPGDTLKVMIL
KHDREEGRIRLSTKKLEPNPGDMIRNSELVFEKAEEMAQRFRQRIAQAEALAYADLLSFQPQGGLTLTTDGILGPFTPQLPFSSVFRLHRVSLYVPFSSRLKMGSVIVQM
LHDGHWNDSENYVDYKVSQLLLNDGMTFTEFQHCIMQKLGSLGNLDSPNIFVSIGSRNFIKKDAQVSEDKDVRWLFGIVSNNVEQYCVVIVDSNNNLSVILDNIPPTNSN
ELVGEGRFFHTIDVSKMSSNFEICVDDMFSSKIVLQNAIRSIAIRDNFQFKTVKSNRDFLVVQCVVEDCEWFLRASRFGDDGSATWVVKRFDNDHTCSIDVVLTDHKQAT
FTFIKDCIKRKISIAASELPTPKDIISFIRSEYGLHISYQKAWRAREAALNEIRGSPEDSYKMIPSFAHMQCISGWKHCRPIISVDGTQMKNKFAGTLITASTPDANDQI
FPLVFSVVDSENDRHKSIGKAIHDVLPDALHCICMVHLLRNLKLKYKEKLVDNIFYACAKAFNVVDFEFQMRQMEQDARESLNAAVKEARELPIASMLEVLRMMLQRWFH
DRRNETAFQVTDFTKNTEKHIRDQIAMGRSMQPEDAVVGCGTFLEIPCSHACAVLTWKHLSMKEYISNFYLNSTLSSIYSGIIHPLGNESSWHIPDDIKNISVLPPNVKR
SVGRPKKTRIPSQMEFKRRVKCGRCDRIGHNKKTCRFALTQGYFCISVF