; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G001830 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G001830
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Description40S ribosomal protein S23
Genome locationCmo_Chr14:833291..842158
RNA-Seq ExpressionCmoCh14G001830
SyntenyCmoCh14G001830
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0015935 - small ribosomal subunit (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR005680 - Ribosomal protein S23, eukaryotic/archaeal
IPR006032 - Ribosomal protein S12/S23
IPR012340 - Nucleic acid-binding, OB-fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580587.1 hypothetical protein SDJN03_20589, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0096.27Show/hide
Query:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS
        MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDS RIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS
Subjt:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS

Query:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP
        LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP
Subjt:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP

Query:  C--------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRI
                        SASRKQTHGLQGPGRT+MQT SQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRI
Subjt:  C--------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRI

Query:  PTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ
        PTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVG KVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ
Subjt:  PTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ

Query:  RSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKTSQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAGN
        RSNSTR SLHSVSSKRISIDSDTSNDGGNHLVGPNT TTGLRSQPVKKTSQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAGN
Subjt:  RSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKTSQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAGN

Query:  VSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKMT
        VSTNGGQSKTKPSRVQ ARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKMT
Subjt:  VSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKMT

Query:  TANTNREPNNSSPNFQT
        TANTNREPNNSSPNFQT
Subjt:  TANTNREPNNSSPNFQT

KAG7017342.1 hypothetical protein SDJN02_19207, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0096.11Show/hide
Query:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS
        MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKE+EEQNNIGQLGTVDS RIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS
Subjt:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS

Query:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP
        LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP
Subjt:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP

Query:  C--------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRI
                        SASRKQTHGLQGPGRT+MQT SQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRI
Subjt:  C--------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRI

Query:  PTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ
        PTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVG KVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ
Subjt:  PTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ

Query:  RSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKTSQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAGN
        RSNSTR SLHSVSSKRISIDSDTSNDGGNHLVGPNT TTGLRSQPVKKTSQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAGN
Subjt:  RSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKTSQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAGN

Query:  VSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKMT
        VSTNGGQSKTKPSRVQ ARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKMT
Subjt:  VSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKMT

Query:  TANTNREPNNSSPNFQT
        TANTNREPNNSSPNFQT
Subjt:  TANTNREPNNSSPNFQT

XP_022934536.1 uncharacterized protein LOC111441682 [Cucurbita moschata]0.0e+0097.41Show/hide
Query:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS
        MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS
Subjt:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS

Query:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP
        LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP
Subjt:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP

Query:  C--------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRI
                        SASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRI
Subjt:  C--------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRI

Query:  PTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ
        PTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ
Subjt:  PTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ

Query:  RSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKTSQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAGN
        RSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKTSQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAGN
Subjt:  RSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKTSQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAGN

Query:  VSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKMT
        VSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKMT
Subjt:  VSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKMT

Query:  TANTNREPNNSSPNFQT
        TANTNREPNNSSPNFQT
Subjt:  TANTNREPNNSSPNFQT

XP_022983416.1 uncharacterized protein LOC111482026 [Cucurbita maxima]3.9e-30194.34Show/hide
Query:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS
        MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQ+NI QLG VDS RIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS
Subjt:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS

Query:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP
        LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP
Subjt:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP

Query:  C--------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRI
                        SASRKQTHGLQGPGRT+MQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTT+DAGRAGRRDSVSLRSTAKLTRI
Subjt:  C--------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRI

Query:  PTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ
        PTAAKIQQKTSSDVS +SSDKVG SFSKD  RKTEGKASPSSGCIPK+SSRVG KVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ
Subjt:  PTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ

Query:  RSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKT-SQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAG
        RSNSTR SLHSVSSKRISIDSDTSNDGGNH VGPNTQTTGLRSQPVKKT SQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAG
Subjt:  RSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKT-SQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAG

Query:  NVSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKM
        NVSTNGGQSKTK SRVQ ARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKM
Subjt:  NVSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKM

Query:  TTANTNREPNNSSPNFQT
        TTANTNREP NSSPNFQT
Subjt:  TTANTNREPNNSSPNFQT

XP_023526184.1 uncharacterized protein LOC111789741 [Cucurbita pepo subsp. pepo]1.1e-30895.3Show/hide
Query:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS
        MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQ++I QLGTVDS RIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS
Subjt:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS

Query:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP
        LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSD+ISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRK TCKPP
Subjt:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP

Query:  C--------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRI
                        SASRKQTHGLQGPGRT+MQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRI
Subjt:  C--------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRI

Query:  PTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ
        PTAAKIQQKTS DVS SSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ
Subjt:  PTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ

Query:  RSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKTSQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAGN
        RSNSTR SLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKTSQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAGN
Subjt:  RSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKTSQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAGN

Query:  VSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKMT
        VSTNGGQSKTKPSRVQ ARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNT SHNSGSCAESNRNSGALREEMSKENESCSYANKMT
Subjt:  VSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKMT

Query:  TANTNREPNNSSPNFQT
        TANTNREPNNS PNFQT
Subjt:  TANTNREPNNSSPNFQT

TrEMBL top hitse value%identityAlignment
A0A5A7TL81 Nuclear pore complex protein NUP62 isoform X12.1e-20771.57Show/hide
Query:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS
        MDA G D DRRF RLSLIDFASEDDFLL SPSCDLHDVNSLDIT ED+E + I Q   +D  RI+E TDAFEQRED+ Q L SSEPE IRRNGKYNLRKS
Subjt:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS

Query:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP
        LAWDSAFFTSAGFLDPEELTSMIAPVGR EKR LP ISEDVQKSSDSIS+LESEIMPLESIEGNLFEDVRASIQKSSRI+G  NSR+K ESGR+   KPP
Subjt:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP

Query:  C--------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTA-KLTR
                        SAS K    LQGPGRT+ Q SSQPR GQQLKAV RLPS S S KRPSLGHN  AT KDGT S    A RRDSVSLR+TA + TR
Subjt:  C--------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTA-KLTR

Query:  IPTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLE
        I T+AK  QKTSSDVS SSSDKVG S SKD R+KTE KA PSSG +    SRV  KV +P   SRLSS      K +SGISPASSISEWSTESSSNSTLE
Subjt:  IPTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLE

Query:  QRSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKK-TSQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGA
         RSNS R SL S+SSKRIS DS+ S+DG NH VGP+TQTTGL SQ VKK +SQSS LPP S KPSGLRLPSPKIGYFDG KTSS KSNLAVPGG TK GA
Subjt:  QRSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKK-TSQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGA

Query:  GNVSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTN--DQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYA
        GNVSTNGG+SK KPS++Q AR+LPK+ATRANI P MN KS+K   TKMSKTN  DQ++KEL REG NTD H S +CAES   SGA REE++KENES S A
Subjt:  GNVSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTN--DQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYA

Query:  NKMTTANTNREPNNSSPNF
        N+    N+  EPNN+SP+F
Subjt:  NKMTTANTNREPNNSSPNF

A0A6J1CV96 uncharacterized protein LOC111015105 isoform X23.7e-21272.26Show/hide
Query:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS
        MD    D+DRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSL ITKEDEEQNN+ QL TVDS RIE  TD+ EQ+EDEPQ L S +PER R+NGKYNLRKS
Subjt:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS

Query:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP
        LAWDSAFFTSAGFLDPEELTSMI  VG+ EK  LP I EDVQKSSDSIS+LESEIMPLESIEGNLFEDVRASIQKSSR IG  NSRSKAESGR+ T KPP
Subjt:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP

Query:  C--------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTA-KLTR
                        S S KQT GLQGPGRT+ + SSQPRS QQL AVS+LP  S   KRPSLG + PA   + T+S AG A RRDSV+L++T  KLTR
Subjt:  C--------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTA-KLTR

Query:  IPTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKV-KTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTL
        I TAAK +Q+TSS+ S SSSDKV  S SKD ++KT+ KASPS+GCI K  SRV  +V +TP  NS ++SYLISQTKHSSGISPASSISEWSTESSSNSTL
Subjt:  IPTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKV-KTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTL

Query:  EQRSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKT-SQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTG
        EQRSNS+R SLHS+SSKRISIDSDTS++G NH VGP+TQTTGL SQ VKKT SQSS LPP SMKPSGLRLPSPKIG+FDGGKTSSMKSNLAVPGG+TK+G
Subjt:  EQRSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKT-SQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTG

Query:  AGNVSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTN-DQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYA
        AGNVST+GGQ+KTKPS++Q  R+LPK  TRANI P+ NSKSNKP  TKMSKTN D++IKE  REG NTD H+S  CAESNRNSGALR    K    C   
Subjt:  AGNVSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTN-DQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYA

Query:  NKMTTANTNREPNNSSPNFQ
        N+ TTA T+ EPN  SPN Q
Subjt:  NKMTTANTNREPNNSSPNFQ

A0A6J1CWQ3 uncharacterized protein LOC111015105 isoform X13.1e-21171.34Show/hide
Query:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS
        MD    D+DRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSL ITKEDEEQNN+ QL TVDS RIE  TD+ EQ+EDEPQ L S +PER R+NGKYNLRKS
Subjt:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS

Query:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP
        LAWDSAFFTSAGFLDPEELTSMI  VG+ EK  LP I EDVQKSSDSIS+LESEIMPLESIEGNLFEDVRASIQKSSR IG  NSRSKAESGR+ T KPP
Subjt:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP

Query:  C----------------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLR
                                S S KQT GLQGPGRT+ + SSQPRS QQL AVS+LP  S   KRPSLG + PA   + T+S AG A RRDSV+L+
Subjt:  C----------------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLR

Query:  STA-KLTRIPTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKV-KTPIENSRLSSYLISQTKHSSGISPASSISEWST
        +T  KLTRI TAAK +Q+TSS+ S SSSDKV  S SKD ++KT+ KASPS+GCI K  SRV  +V +TP  NS ++SYLISQTKHSSGISPASSISEWST
Subjt:  STA-KLTRIPTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKV-KTPIENSRLSSYLISQTKHSSGISPASSISEWST

Query:  ESSSNSTLEQRSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKT-SQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAV
        ESSSNSTLEQRSNS+R SLHS+SSKRISIDSDTS++G NH VGP+TQTTGL SQ VKKT SQSS LPP SMKPSGLRLPSPKIG+FDGGKTSSMKSNLAV
Subjt:  ESSSNSTLEQRSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKT-SQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAV

Query:  PGGMTKTGAGNVSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTN-DQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSK
        PGG+TK+GAGNVST+GGQ+KTKPS++Q  R+LPK  TRANI P+ NSKSNKP  TKMSKTN D++IKE  REG NTD H+S  CAESNRNSGALR    K
Subjt:  PGGMTKTGAGNVSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTN-DQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSK

Query:  ENESCSYANKMTTANTNREPNNSSPNFQ
            C   N+ TTA T+ EPN  SPN Q
Subjt:  ENESCSYANKMTTANTNREPNNSSPNFQ

A0A6J1F230 uncharacterized protein LOC1114416820.0e+0097.41Show/hide
Query:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS
        MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS
Subjt:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS

Query:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP
        LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP
Subjt:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP

Query:  C--------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRI
                        SASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRI
Subjt:  C--------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRI

Query:  PTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ
        PTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ
Subjt:  PTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ

Query:  RSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKTSQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAGN
        RSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKTSQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAGN
Subjt:  RSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKTSQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAGN

Query:  VSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKMT
        VSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKMT
Subjt:  VSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKMT

Query:  TANTNREPNNSSPNFQT
        TANTNREPNNSSPNFQT
Subjt:  TANTNREPNNSSPNFQT

A0A6J1J253 uncharacterized protein LOC1114820261.9e-30194.34Show/hide
Query:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS
        MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQ+NI QLG VDS RIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS
Subjt:  MDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKS

Query:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP
        LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP
Subjt:  LAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPP

Query:  C--------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRI
                        SASRKQTHGLQGPGRT+MQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTT+DAGRAGRRDSVSLRSTAKLTRI
Subjt:  C--------------ESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRI

Query:  PTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ
        PTAAKIQQKTSSDVS +SSDKVG SFSKD  RKTEGKASPSSGCIPK+SSRVG KVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ
Subjt:  PTAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQ

Query:  RSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKT-SQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAG
        RSNSTR SLHSVSSKRISIDSDTSNDGGNH VGPNTQTTGLRSQPVKKT SQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAG
Subjt:  RSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKT-SQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAG

Query:  NVSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKM
        NVSTNGGQSKTK SRVQ ARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKM
Subjt:  NVSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKM

Query:  TTANTNREPNNSSPNFQT
        TTANTNREP NSSPNFQT
Subjt:  TTANTNREPNNSSPNFQT

SwissProt top hitse value%identityAlignment
P46297 40S ribosomal protein S232.5e-7298.55Show/hide
Query:  KTRGMGAGRKLKSHRRRQRWADKSYKKSHLGNEWKKPFAGSSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLIA
        KTRGMGA RKLK+HRRRQRWADKSYKKSHLGNEWKKPFAGSSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLIA
Subjt:  KTRGMGAGRKLKSHRRRQRWADKSYKKSHLGNEWKKPFAGSSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLIA

Query:  GFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP
        GFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP
Subjt:  GFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP

P49201 40S ribosomal protein S23-21.5e-6996.38Show/hide
Query:  KTRGMGAGRKLKSHRRRQRWADKSYKKSHLGNEWKKPFAGSSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLIA
        KTRGMGAGRKLK  R  QRWADK YKKSHLGNEWKKPFAGSSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLIA
Subjt:  KTRGMGAGRKLKSHRRRQRWADKSYKKSHLGNEWKKPFAGSSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLIA

Query:  GFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP
        GFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP
Subjt:  GFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP

Q9GRJ3 40S ribosomal protein S231.4e-5980.58Show/hide
Query:  KTRGMGAGRKLKSHRRRQRWADKSYKKSHLGNEWK-KPFAGSSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLI
        K RG+   RKLK+HRR QRW DK YKKSHLG  WK  PF G+SHAKGIVLEK+G+EAKQPNSAIRKC RVQLIKNGKKI AFVP DGCLN+IEENDEVL+
Subjt:  KTRGMGAGRKLKSHRRRQRWADKSYKKSHLGNEWK-KPFAGSSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLI

Query:  AGFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP
        AGFGRKGHAVGDIPGVRFKVVKV+ VSLLAL+K+KKE+P
Subjt:  AGFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP

Q9M5Z9 40S ribosomal protein S232.1e-7197.83Show/hide
Query:  KTRGMGAGRKLKSHRRRQRWADKSYKKSHLGNEWKKPFAGSSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLIA
        KT GMGA RKLKSHRR QRWADKSYKKSHLGNEWKKPFAGSSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLIA
Subjt:  KTRGMGAGRKLKSHRRRQRWADKSYKKSHLGNEWKKPFAGSSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLIA

Query:  GFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP
        GFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP
Subjt:  GFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP

Q9SF35 40S ribosomal protein S23-16.9e-6794.2Show/hide
Query:  KTRGMGAGRKLKSHRRRQRWADKSYKKSHLGNEWKKPFAGSSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLIA
        KTRGMGAGRKLK  R  QRWADK YKKS+ GNEWKKPFA SSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLIA
Subjt:  KTRGMGAGRKLKSHRRRQRWADKSYKKSHLGNEWKKPFAGSSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLIA

Query:  GFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP
        GFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP
Subjt:  GFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP

Arabidopsis top hitse value%identityAlignment
AT2G37070.1 unknown protein2.8e-1526.17Show/hide
Query:  LSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKSLAWDSAFFTSAGFL
        LSLIDF++EDD LL S      D  + D +  D+E   +   G   +C  EE   +    E EP  +            K NLRKSLAWD AFFT+AG L
Subjt:  LSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKSLAWDSAFFTSAGFL

Query:  DPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIE-GNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPPCESASRKQTHGLQ
        +P+EL+SM+       +++LP + ED+ +S++S+S+L+S+     ++E G  F    A+     + +G   S     S   +T   P      K     +
Subjt:  DPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIE-GNLFEDVRASIQKSSRIIGMGNSRSKAESGRKATCKPPCESASRKQTHGLQ

Query:  GPG---RTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGT---TSDAGRAGRRDSVS-------LRSTAKLTRIP-------------
         PG   + + + +  P + ++          +TSI RPS G N P++    T   + D  +A +  +         L S   ++R P             
Subjt:  GPG---RTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGT---TSDAGRAGRRDSVS-------LRSTAKLTRIP-------------

Query:  ----TAAKIQQKTSS--------DVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWS
            + A   + TSS         VS ++S+K      K  + ++   AS S    PK S+         I+  ++      +T +   +S  SS  +WS
Subjt:  ----TAAKIQQKTSS--------DVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWS

Query:  TESSSNSTLEQRSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKTSQSSTL-----PPTS-MKPSGLRLPSPKIGYFDGGKTSSM
        +ES    T  + +   + S+H                G N   GP T  T    +P+  +   S +     P  S MKP+GLR+PSPK+GYFDG + S  
Subjt:  TESSSNSTLEQRSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGPNTQTTGLRSQPVKKTSQSSTL-----PPTS-MKPSGLRLPSPKIGYFDGGKTSSM

Query:  KSNLAVPGGMTKTGAGNVSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCRE
        ++    P G        +S +G  S  + +  + A+  P T ++A ++P   S+S++ I +   K  ++   ++  E
Subjt:  KSNLAVPGGMTKTGAGNVSTNGGQSKTKPSRVQLARVLPKTATRANIQPNMNSKSNKPITTKMSKTNDQEIKELCRE

AT3G09680.1 Ribosomal protein S12/S23 family protein4.9e-6894.2Show/hide
Query:  KTRGMGAGRKLKSHRRRQRWADKSYKKSHLGNEWKKPFAGSSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLIA
        KTRGMGAGRKLK  R  QRWADK YKKS+ GNEWKKPFA SSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLIA
Subjt:  KTRGMGAGRKLKSHRRRQRWADKSYKKSHLGNEWKKPFAGSSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLIA

Query:  GFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP
        GFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP
Subjt:  GFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP

AT3G53320.1 unknown protein1.8e-3331.56Show/hide
Query:  LSLIDFASEDDFLLSSPSCDLHDVNSLD-ITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKSLAWDSAFFTSAGF
        L LID A EDD LL S   +  + +  D   KED++ N    +     C  E    + E++E+  Q  +S EPE++ + GKYNLRKSLAWD+ FFTSAG 
Subjt:  LSLIDFASEDDFLLSSPSCDLHDVNSLD-ITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQREDEPQSLQSSEPERIRRNGKYNLRKSLAWDSAFFTSAGF

Query:  LDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGN-SRSKAESGRKATCKPPCESASRKQTHG-
        L+PEEL+SM+    +  K+ALP I ED+ +S++SIS+ +S+     S E  LFEDVRASIQ+S++   +    +S           P   +     T G 
Subjt:  LDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGN-SRSKAESGRKATCKPPCESASRKQTHG-

Query:  ------------LQGPGRTVMQ---------TSSQPRSGQQLKAVSRL-PSISTSIKRPSLGHNLPATEKDGTTSDAGR--AGRRDSVSLRSTAKLTRIP
                    +QGPG+   Q         + S+P +G     +S++ P  +TS  R SL  +    EK+ +   AG+   G R S+S R+   L + P
Subjt:  ------------LQGPGRTVMQ---------TSSQPRSGQQLKAVSRL-PSISTSIKRPSLGHNLPATEKDGTTSDAGR--AGRRDSVSLRSTAKLTRIP

Query:  TAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLI-------------SQTKHSSGISPASSISEW
                 SSD S +      +S    A   +     PS   I K +         P+ N   S  ++             S+ K SS +  A SIS++
Subjt:  TAAKIQQKTSSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLI-------------SQTKHSSGISPASSISEW

Query:  STESSSNSTLEQRSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGP--NTQTTGLRSQPVKKTSQ------SSTLPPTSMKPSGLRLPSPKIGYFDGGKT
        S+ESS  S   + +N         + K +S +   +ND     V P  N++ T +     K+ ++         +P  S KPSGLR+PSPKIG+FDG + 
Subjt:  STESSSNSTLEQRSNSTRTSLHSVSSKRISIDSDTSNDGGNHLVGP--NTQTTGLRSQPVKKTSQ------SSTLPPTSMKPSGLRLPSPKIGYFDGGKT

Query:  SSMKSNLAVPGGMTKTGAGNV--STNGGQSKTKPSRVQLARVLPK
         S  S     GG T+     +  STN   SK+K S   ++   PK
Subjt:  SSMKSNLAVPGGMTKTGAGNV--STNGGQSKTKPSRVQLARVLPK

AT5G02960.1 Ribosomal protein S12/S23 family protein1.1e-7096.38Show/hide
Query:  KTRGMGAGRKLKSHRRRQRWADKSYKKSHLGNEWKKPFAGSSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLIA
        KTRGMGAGRKLK  R  QRWADK YKKSHLGNEWKKPFAGSSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLIA
Subjt:  KTRGMGAGRKLKSHRRRQRWADKSYKKSHLGNEWKKPFAGSSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLIA

Query:  GFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP
        GFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP
Subjt:  GFGRKGHAVGDIPGVRFKVVKVSGVSLLALFKEKKEKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGAGTATACTATTAATCCTGATCTCGTACCGAATGGGAACGTACATGATCGTTTCGTTTTTTGTAAGATTCGACATAAGATTTTGTACTATAGAAGGGAAGATCA
GGCACCGATATCAACCGAGGCCATTCAAGTATATGAGCAAGCTATTGCAAACCCTGAAGAAGAAACGTTTAAAATCAACGAACAATTTCATGGAGAGCTTATGAATCCCA
TATCAAACGAAACCATTCAAGTACATGAGCAAGCTATTGCATACCCTGAAGAAGAAACGTTTAAAATCAAAGAACAATTTCATGGAGAGCTTATGAATCCCATATCAACC
GAGACCATTCAAGTACATGAGCAAGCTATTGCAAACCTTGAAGAAGAAACGTTTCAAATCGACGAACAATCTCTTAGAGAACTTATGAATTCCATATCAACCGAGAGCAT
TAAAATACATGCACAAGCCGAAGATTCAAATGAAGAACAAATGTTTGAAATCAATGAACAATTTTTTGAAGAACTTGACGATCCATTGGAAGATATCGACTTCTCTTGCA
TGAACGACGACCTCTCATTACTTGATGATTTACCACTTCCATGGAAGACACGAGGTATGGGAGCTGGACGCAAGCTGAAGTCCCACCGTAGAAGGCAAAGGTGGGCTGAC
AAATCTTATAAGAAGTCTCACCTTGGCAATGAATGGAAGAAGCCATTTGCCGGGTCTTCTCATGCCAAGGGCATTGTGTTGGAAAAGATAGGTATTGAGGCTAAGCAGCC
CAATTCTGCCATTAGAAAATGTGCCAGGGTCCAACTCATCAAGAACGGGAAAAAGATTGCTGCATTCGTACCCAACGATGGGTGCCTGAATTATATAGAAGAAAATGATG
AAGTGTTGATTGCTGGATTTGGTCGTAAAGGACATGCGGTCGGAGATATTCCTGGTGTTCGATTCAAGGTTGTGAAGGTCTCTGGTGTGTCTCTCCTTGCTCTTTTCAAG
GAGAAGAAGGAGAAGCCTAGTTTGATTCCCAACATGATAATTGGTGTTCATGTTGGTAGTTTGATTCCCAAGCTAGTGAAGTACTCTTCTAGATCGATCTCCATGGACGC
CGCAGGCGCTGATCACGATCGGCGTTTTAGCCGTCTCAGCCTCATCGATTTTGCCTCCGAGGACGATTTCCTCCTCTCTTCTCCTTCCTGCGATCTCCACGACGTCAATT
CTTTAGACATTACGAAGGAGGACGAGGAACAGAACAATATCGGACAATTAGGGACCGTAGATTCTTGCAGAATAGAGGAGGGAACAGACGCCTTCGAACAGAGAGAGGAT
GAACCTCAATCACTCCAATCTTCGGAACCGGAAAGGATCAGAAGAAATGGAAAATATAACTTGCGTAAGAGTTTAGCATGGGATAGCGCTTTCTTCACTAGTGCAGGGTT
TTTGGATCCTGAGGAGTTAACGAGCATGATTGCACCAGTAGGCAGGATTGAAAAGCGTGCACTACCGAAAATTTCAGAAGATGTTCAGAAATCTTCAGATTCAATTTCTT
CGTTGGAAAGTGAAATTATGCCATTGGAAAGCATTGAGGGAAATTTATTCGAAGATGTAAGGGCTTCAATTCAGAAATCTAGTAGAATTATTGGGATGGGTAACTCGAGG
AGTAAAGCTGAATCCGGGAGAAAGGCAACATGCAAACCGCCATGTGAGTCTGCTTCCAGAAAGCAAACTCATGGGTTGCAAGGACCGGGTAGAACTGTTATGCAGACTTC
TTCTCAACCACGCAGTGGACAGCAATTGAAGGCAGTCAGCAGGCTTCCTTCGATCTCAACATCGATAAAGAGGCCTTCCCTTGGTCACAATCTTCCTGCCACTGAAAAGG
ATGGCACTACCAGTGATGCAGGACGTGCAGGCAGGCGGGATTCAGTTAGTCTTAGAAGTACTGCTAAGCTTACCCGAATTCCAACTGCAGCTAAGATTCAGCAAAAGACT
TCCTCTGACGTTTCTGGAAGTTCATCTGACAAGGTCGGTAACTCTTTCTCCAAAGATGCGAGGAGAAAAACAGAGGGTAAGGCTTCACCTTCCTCTGGCTGTATCCCAAA
AATGTCATCACGAGTTGGATTGAAGGTCAAAACTCCTATTGAGAATTCTCGTCTCTCGTCTTACTTGATATCCCAAACTAAGCATTCTTCAGGAATATCACCTGCTAGTT
CTATTAGTGAGTGGTCGACAGAGTCATCATCAAATTCTACTCTTGAACAACGATCAAATAGCACAAGAACCAGCCTTCACTCAGTTTCGAGCAAAAGAATCTCTATAGAC
AGTGACACATCTAATGATGGAGGAAACCATCTTGTTGGACCCAATACTCAGACTACTGGTTTGCGATCTCAGCCAGTAAAGAAAACTTCGCAGTCTTCTACACTTCCTCC
TACTTCAATGAAGCCTTCGGGCCTTCGATTGCCATCACCTAAAATTGGATACTTTGATGGGGGGAAAACATCTAGCATGAAATCTAATCTTGCTGTACCTGGTGGCATGA
CTAAGACTGGAGCTGGAAATGTTAGCACAAATGGAGGCCAAAGTAAGACCAAGCCTTCAAGGGTTCAACTCGCTAGAGTGTTACCCAAGACCGCAACTCGTGCTAATATT
CAGCCTAATATGAATTCGAAATCAAATAAACCAATTACAACGAAGATGTCCAAAACGAATGACCAAGAAATCAAGGAGCTTTGTCGTGAAGGACGGAATACCGATTCACA
CAATTCAGGTTCATGTGCAGAAAGTAATAGGAATTCAGGTGCTTTGAGGGAGGAGATGAGCAAAGAGAATGAATCTTGCAGTTATGCAAACAAAATGACAACAGCTAACA
CCAACAGAGAGCCTAACAACAGCTCACCGAACTTCCAGACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCATGAGTATACTATTAATCCTGATCTCGTACCGAATGGGAACGTACATGATCGTTTCGTTTTTTGTAAGATTCGACATAAGATTTTGTACTATAGAAGGGAAGATCA
GGCACCGATATCAACCGAGGCCATTCAAGTATATGAGCAAGCTATTGCAAACCCTGAAGAAGAAACGTTTAAAATCAACGAACAATTTCATGGAGAGCTTATGAATCCCA
TATCAAACGAAACCATTCAAGTACATGAGCAAGCTATTGCATACCCTGAAGAAGAAACGTTTAAAATCAAAGAACAATTTCATGGAGAGCTTATGAATCCCATATCAACC
GAGACCATTCAAGTACATGAGCAAGCTATTGCAAACCTTGAAGAAGAAACGTTTCAAATCGACGAACAATCTCTTAGAGAACTTATGAATTCCATATCAACCGAGAGCAT
TAAAATACATGCACAAGCCGAAGATTCAAATGAAGAACAAATGTTTGAAATCAATGAACAATTTTTTGAAGAACTTGACGATCCATTGGAAGATATCGACTTCTCTTGCA
TGAACGACGACCTCTCATTACTTGATGATTTACCACTTCCATGGAAGACACGAGGTATGGGAGCTGGACGCAAGCTGAAGTCCCACCGTAGAAGGCAAAGGTGGGCTGAC
AAATCTTATAAGAAGTCTCACCTTGGCAATGAATGGAAGAAGCCATTTGCCGGGTCTTCTCATGCCAAGGGCATTGTGTTGGAAAAGATAGGTATTGAGGCTAAGCAGCC
CAATTCTGCCATTAGAAAATGTGCCAGGGTCCAACTCATCAAGAACGGGAAAAAGATTGCTGCATTCGTACCCAACGATGGGTGCCTGAATTATATAGAAGAAAATGATG
AAGTGTTGATTGCTGGATTTGGTCGTAAAGGACATGCGGTCGGAGATATTCCTGGTGTTCGATTCAAGGTTGTGAAGGTCTCTGGTGTGTCTCTCCTTGCTCTTTTCAAG
GAGAAGAAGGAGAAGCCTAGTTTGATTCCCAACATGATAATTGGTGTTCATGTTGGTAGTTTGATTCCCAAGCTAGTGAAGTACTCTTCTAGATCGATCTCCATGGACGC
CGCAGGCGCTGATCACGATCGGCGTTTTAGCCGTCTCAGCCTCATCGATTTTGCCTCCGAGGACGATTTCCTCCTCTCTTCTCCTTCCTGCGATCTCCACGACGTCAATT
CTTTAGACATTACGAAGGAGGACGAGGAACAGAACAATATCGGACAATTAGGGACCGTAGATTCTTGCAGAATAGAGGAGGGAACAGACGCCTTCGAACAGAGAGAGGAT
GAACCTCAATCACTCCAATCTTCGGAACCGGAAAGGATCAGAAGAAATGGAAAATATAACTTGCGTAAGAGTTTAGCATGGGATAGCGCTTTCTTCACTAGTGCAGGGTT
TTTGGATCCTGAGGAGTTAACGAGCATGATTGCACCAGTAGGCAGGATTGAAAAGCGTGCACTACCGAAAATTTCAGAAGATGTTCAGAAATCTTCAGATTCAATTTCTT
CGTTGGAAAGTGAAATTATGCCATTGGAAAGCATTGAGGGAAATTTATTCGAAGATGTAAGGGCTTCAATTCAGAAATCTAGTAGAATTATTGGGATGGGTAACTCGAGG
AGTAAAGCTGAATCCGGGAGAAAGGCAACATGCAAACCGCCATGTGAGTCTGCTTCCAGAAAGCAAACTCATGGGTTGCAAGGACCGGGTAGAACTGTTATGCAGACTTC
TTCTCAACCACGCAGTGGACAGCAATTGAAGGCAGTCAGCAGGCTTCCTTCGATCTCAACATCGATAAAGAGGCCTTCCCTTGGTCACAATCTTCCTGCCACTGAAAAGG
ATGGCACTACCAGTGATGCAGGACGTGCAGGCAGGCGGGATTCAGTTAGTCTTAGAAGTACTGCTAAGCTTACCCGAATTCCAACTGCAGCTAAGATTCAGCAAAAGACT
TCCTCTGACGTTTCTGGAAGTTCATCTGACAAGGTCGGTAACTCTTTCTCCAAAGATGCGAGGAGAAAAACAGAGGGTAAGGCTTCACCTTCCTCTGGCTGTATCCCAAA
AATGTCATCACGAGTTGGATTGAAGGTCAAAACTCCTATTGAGAATTCTCGTCTCTCGTCTTACTTGATATCCCAAACTAAGCATTCTTCAGGAATATCACCTGCTAGTT
CTATTAGTGAGTGGTCGACAGAGTCATCATCAAATTCTACTCTTGAACAACGATCAAATAGCACAAGAACCAGCCTTCACTCAGTTTCGAGCAAAAGAATCTCTATAGAC
AGTGACACATCTAATGATGGAGGAAACCATCTTGTTGGACCCAATACTCAGACTACTGGTTTGCGATCTCAGCCAGTAAAGAAAACTTCGCAGTCTTCTACACTTCCTCC
TACTTCAATGAAGCCTTCGGGCCTTCGATTGCCATCACCTAAAATTGGATACTTTGATGGGGGGAAAACATCTAGCATGAAATCTAATCTTGCTGTACCTGGTGGCATGA
CTAAGACTGGAGCTGGAAATGTTAGCACAAATGGAGGCCAAAGTAAGACCAAGCCTTCAAGGGTTCAACTCGCTAGAGTGTTACCCAAGACCGCAACTCGTGCTAATATT
CAGCCTAATATGAATTCGAAATCAAATAAACCAATTACAACGAAGATGTCCAAAACGAATGACCAAGAAATCAAGGAGCTTTGTCGTGAAGGACGGAATACCGATTCACA
CAATTCAGGTTCATGTGCAGAAAGTAATAGGAATTCAGGTGCTTTGAGGGAGGAGATGAGCAAAGAGAATGAATCTTGCAGTTATGCAAACAAAATGACAACAGCTAACA
CCAACAGAGAGCCTAACAACAGCTCACCGAACTTCCAGACCTAATGTTGATGGTATGCTTTTGCCAACTTTATGCTTGCAGTAAGGTAAGTCAGAATCATGAATTTGGGC
GGGTTTCGATACTCGAAAGAGTTAATTAACTGTTGATCAGGACTCTCGGCTTGTGAATATCTGATAATGTCCCAAAGGAAGCCTCCCACTTTCTTGTCAAAACTCAAAGG
TTCACTGATGCCTCTCACTGCTCATCTAACTTAGGTAACTGAAAATCATTCCATACAGGCCCAAAAGATCTTTTCTCAGCTGCGGCAGCCGCAGATGTGGATCAAGAATT
GCCTCTGTTGCCATCTTAGCAAGAACACAATACTAATGGCTGTGGCTTGTCCTGGTGTCTGTCTGGTCCAGTGTTTCATGATGTTGGTATTCCAAATCTGGGTTTTCTGG
TGAAATGTGAAGGTCGTTGTGCTGCAAGCTAAGTATCTGATACCAATCATATGGGGGCATTGCTGACTTGGTATCGCCGTTGTTGTCGTGTCTGTGTACTCGTGGTGCTT
GTGATCTCGGCAGAACAAGAATTTCGAACCTTTCAGAATCCAAAAAGTGGTTGTCGTGTTTTACCTTTTTAGTTCATGAACAGTGTTGTAGATGTTCCTCATTTAACCCT
TTCCCGTACTCCACATTTCTGGGTTTCAATGAAAACTTTATTGTAGGATTATGGATCCATCTGATTATGTTTTAACCTTTTAGATCGTATTTGAATGTGAAAAGAGAACA
ATGTAACAATACTATATTAATTATAGATTGACAAAGTCGACAGAG
Protein sequenceShow/hide protein sequence
MHEYTINPDLVPNGNVHDRFVFCKIRHKILYYRREDQAPISTEAIQVYEQAIANPEEETFKINEQFHGELMNPISNETIQVHEQAIAYPEEETFKIKEQFHGELMNPIST
ETIQVHEQAIANLEEETFQIDEQSLRELMNSISTESIKIHAQAEDSNEEQMFEINEQFFEELDDPLEDIDFSCMNDDLSLLDDLPLPWKTRGMGAGRKLKSHRRRQRWAD
KSYKKSHLGNEWKKPFAGSSHAKGIVLEKIGIEAKQPNSAIRKCARVQLIKNGKKIAAFVPNDGCLNYIEENDEVLIAGFGRKGHAVGDIPGVRFKVVKVSGVSLLALFK
EKKEKPSLIPNMIIGVHVGSLIPKLVKYSSRSISMDAAGADHDRRFSRLSLIDFASEDDFLLSSPSCDLHDVNSLDITKEDEEQNNIGQLGTVDSCRIEEGTDAFEQRED
EPQSLQSSEPERIRRNGKYNLRKSLAWDSAFFTSAGFLDPEELTSMIAPVGRIEKRALPKISEDVQKSSDSISSLESEIMPLESIEGNLFEDVRASIQKSSRIIGMGNSR
SKAESGRKATCKPPCESASRKQTHGLQGPGRTVMQTSSQPRSGQQLKAVSRLPSISTSIKRPSLGHNLPATEKDGTTSDAGRAGRRDSVSLRSTAKLTRIPTAAKIQQKT
SSDVSGSSSDKVGNSFSKDARRKTEGKASPSSGCIPKMSSRVGLKVKTPIENSRLSSYLISQTKHSSGISPASSISEWSTESSSNSTLEQRSNSTRTSLHSVSSKRISID
SDTSNDGGNHLVGPNTQTTGLRSQPVKKTSQSSTLPPTSMKPSGLRLPSPKIGYFDGGKTSSMKSNLAVPGGMTKTGAGNVSTNGGQSKTKPSRVQLARVLPKTATRANI
QPNMNSKSNKPITTKMSKTNDQEIKELCREGRNTDSHNSGSCAESNRNSGALREEMSKENESCSYANKMTTANTNREPNNSSPNFQT