; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G01700 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G01700
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Description40S ribosomal protein S3-2-like
Genome locationClcChr11:1692288..1700992
RNA-Seq ExpressionClc11G01700
SyntenyClc11G01700
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0022627 - cytosolic small ribosomal subunit (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR001351 - Ribosomal protein S3, C-terminal
IPR004044 - K Homology domain, type 2
IPR005703 - Ribosomal protein S3, eukaryotic/archaeal
IPR009019 - K homology domain superfamily, prokaryotic type
IPR012870 - Protein of unknown function DUF1666
IPR015946 - K homology domain-like, alpha/beta
IPR018280 - Ribosomal protein S3, conserved site
IPR036419 - Ribosomal protein S3, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061472.1 40S ribosomal protein S3-3 [Cucumis melo var. makuwa]0.0e+0074.41Show/hide
Query:  QADYGTVDSVLAIQGDETE---TIDYNSSKYQLEPTTQIHGFMEESETTKCFVQELYC-----SDSGNQTSDDYFECCSGKYLGESDDGVKLEMFSNEEA
        +  YGTVDSVL  +G E+E     DYNSSKYQLEPTTQIHGF++ESETT CFVQE +C     S    QT D  FEC S KYL ESD GVKLE+ ++EE 
Subjt:  QADYGTVDSVLAIQGDETE---TIDYNSSKYQLEPTTQIHGFMEESETTKCFVQELYC-----SDSGNQTSDDYFECCSGKYLGESDDGVKLEMFSNEEA

Query:  LEIKDEHKGLEFNSDGPIHEEAEKFGFEIKD----LEDCSFLSSDSDSESSSFDEEYIEIELE----------SLVLPVNDWSHDENQDRLEEPKEREND
        LE  DEH+GL  N D PI  + EKFG +  +    +ED SFL SDSDSES SFDEEY+EIELE          + VLPVNDWS +ENQD L E  E E D
Subjt:  LEIKDEHKGLEFNSDGPIHEEAEKFGFEIKD----LEDCSFLSSDSDSESSSFDEEYIEIELE----------SLVLPVNDWSHDENQDRLEEPKEREND

Query:  ENAMEF---EQQEEEEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAESMCPTLVETLKPLKNGGNFELKEHFREIQKVYKTYSEKMRKLD
        E  MEF   +QQ+EEE EEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE   EA SMCPT VETLKPLK   NFELK+HFREIQKVYKTY+EKMRKLD
Subjt:  ENAMEF---EQQEEEEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAESMCPTLVETLKPLKNGGNFELKEHFREIQKVYKTYSEKMRKLD

Query:  ISNTQTNYAIGLVKLKDPITPMDGK----KSVFSLKLRAGRADVKDCSRLMRDLKRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNE
        ISN QTNYAIGLVKLKDP + MDGK    KSVF LKLR  R DVK C  L RDLKRDMEMVY+GH+CLSWE+LHWQ+RKA ELQQNDS+ +S+FTRV NE
Subjt:  ISNTQTNYAIGLVKLKDPITPMDGK----KSVFSLKLRAGRADVKDCSRLMRDLKRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNE

Query:  FQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIREDCVNDKKLRGKEGENTVSTAALVSIIEDSMRVFREFLRADKDVRNSTIKRAQVELNA
        FQLFS+LIQRFIEDEQFCGPRIDNYA NRLFIRSLLQVPAIR DCVNDKK RGKE E+T+STAALVSIIEDSM+VFREFLRADK V+NSTIK AQV+LNA
Subjt:  FQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIREDCVNDKKLRGKEGENTVSTAALVSIIEDSMRVFREFLRADKDVRNSTIKRAQVELNA

Query:  QLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIAEEEGRLKTELLIAEVELKLISRVVNMSKLTESQLIWCHKKLHQVNFVNGKAYKVQSAAAAFSST
        Q MMMEIR GL+KKERRLKEIMRSGNCI K+F+RI EEEGR+K ELLIAEVELKL+SRV                                         
Subjt:  QLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIAEEEGRLKTELLIAEVELKLISRVVNMSKLTESQLIWCHKKLHQVNFVNGKAYKVQSAAAAFSST

Query:  RTRAAQGLPFSLFLKMATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVE
                                 FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVE
Subjt:  RTRAAQGLPFSLFLKMATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVE

Query:  LYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIK
        LYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIK
Subjt:  LYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIK

Query:  VKIMLDWDPKGKQGPTTPLPDVVTIHSPKEEEEIVHRPAAVLTTDIEVPVAV
        VKIMLDWDPKGKQGPTTPLPDVVTIHSPKEEEEIVHRP AVLT DIEVPVAV
Subjt:  VKIMLDWDPKGKQGPTTPLPDVVTIHSPKEEEEIVHRPAAVLTTDIEVPVAV

KAG6599520.1 40S ribosomal protein S3-1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0070.9Show/hide
Query:  SLIYSLSNFLVSVFDPIITLFFRIQADYGTVDSVLAIQGDETETI-------DY---------------NSSKYQLEPTTQIHGFMEESETTKCFVQELY
        S  YSLS+F+VSVF  II LFFRIQ DYGTV+SVLA+ GDE ET+       D+               NSSK+Q+E T QI+GF+EESETT CFV+ELY
Subjt:  SLIYSLSNFLVSVFDPIITLFFRIQADYGTVDSVLAIQGDETETI-------DY---------------NSSKYQLEPTTQIHGFMEESETTKCFVQELY

Query:  C----SDSGNQTSDDYFECCSGKYL-------GESDDGVKLEMFSNEEALEIKDEHKGLEFNSDGPIHEEAEKFGFEIKDLEDCSFLSSDSDSESSSFDE
        C    S SGNQT DD F+C SGKYL        E  D V+LE+FS +EAL  +DEH+ L+ N DGPI +E          L DCS   SDSDSES SFDE
Subjt:  C----SDSGNQTSDDYFECCSGKYL-------GESDDGVKLEMFSNEEALEIKDEHKGLEFNSDGPIHEEAEKFGFEIKDLEDCSFLSSDSDSESSSFDE

Query:  EYIEIELE----------SLVLPVNDWSHDENQDRLEEPKERENDENAMEFEQQEEEEEEE--------EFLQEHQDLINQLKIELRNSRTGGLPTVQEE
        E+IEIELE          + V PVNDWS +E++D L EP E E DE  MEFE+ EEEEEEE        EF QEHQDLI QLKIELRNSRTGGLPTVQEE
Subjt:  EYIEIELE----------SLVLPVNDWSHDENQDRLEEPKERENDENAMEFEQQEEEEEEE--------EFLQEHQDLINQLKIELRNSRTGGLPTVQEE

Query:  EEA--ESMCPTLVETLKPLKNGGNFELKEHFREIQKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDPITPMD----GKKSVFSLKLRAGRADVKDCSRLM
        EEA  E M PT VE LKPLKNGGNFE +  F+EIQKVYKTY++KMRKLD+SNTQTNYA+ L+KLKDP + M+    G KSV S KLRAGRA VK    LM
Subjt:  EEA--ESMCPTLVETLKPLKNGGNFELKEHFREIQKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDPITPMD----GKKSVFSLKLRAGRADVKDCSRLM

Query:  RDLKRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIREDCVNDKKL
        RDLKRDMEMVY+GH+CLSWEVLHWQ+RKAIELQQND++  SR+TRVVNEFQ F +L+QRFIEDE FCGPRI+NY +NRL +RSLLQVPAIREDCV+DKKL
Subjt:  RDLKRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIREDCVNDKKL

Query:  RGKEGENTVSTAALVSIIEDSMRVFREFLRADKDVRNSTIKRAQVELNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIA-EEEGRLKTELLIAE
        RGKEGE+T+STAALVS+IE+SMRVFR+FLRADKDV ++TIK A+VE+NAQ MMMEIRT LRKKERRLKEI+R GNCIVK+ +R++ EEEGRLK ELLIAE
Subjt:  RGKEGENTVSTAALVSIIEDSMRVFREFLRADKDVRNSTIKRAQVELNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIA-EEEGRLKTELLIAE

Query:  VELKLISRVVNMSKLTESQLIWCHKKLHQVNFVNGKAYKVQSAAAAFSSTRTRAAQGLPFSL---------------FLKMATQMSKKRKFVADGVFFAE
        VELKL+SRVV                                  +A SST TRAAQ     L                LKMATQMSKKRKFVADGVFFAE
Subjt:  VELKLISRVVNMSKLTESQLIWCHKKLHQVNFVNGKAYKVQSAAAAFSSTRTRAAQGLPFSL---------------FLKMATQMSKKRKFVADGVFFAE

Query:  LNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACY
        LNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACY
Subjt:  LNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACY

Query:  GVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTIHSPKEEEEIV
        GVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVV IHSPKEEEEIV
Subjt:  GVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTIHSPKEEEEIV

Query:  HRP---AAVLTTDIEVPVAVA
        HRP   AAVLTTDIEVPV VA
Subjt:  HRP---AAVLTTDIEVPVAVA

XP_008459697.1 PREDICTED: uncharacterized protein LOC103498734 [Cucumis melo]3.1e-23172.15Show/hide
Query:  ILAFSLNKPFANEFVRHNYNMLSLIYSLSNFLVSVFDPIITLFFRIQADYGTVDSVLAIQGDETE---TIDYNSSKYQLEPTTQIHGFMEESETTKCFVQ
        +LAFS +  ++NEF+ H  NM  LIYS+ NFL+S+F  II  FFRIQ  YGTVDSVL  +G E+E     +YNSSKYQLEPTTQIHGF++ESETT CFVQ
Subjt:  ILAFSLNKPFANEFVRHNYNMLSLIYSLSNFLVSVFDPIITLFFRIQADYGTVDSVLAIQGDETE---TIDYNSSKYQLEPTTQIHGFMEESETTKCFVQ

Query:  ELYC-----SDSGNQTSDDYFECCSGKYLGESDDGVKLEMFSNEEALEIKDEHKGLEFNSDGPIHEEAEKFGFEIKD----LEDCSFLSSDSDSESSSFD
        E +C     S    QT D  FEC S KYL ESD GVKLE+ ++EE LE  DEH+GL  N D PI  + EKFG +  +    +ED SFL SDSDSES SFD
Subjt:  ELYC-----SDSGNQTSDDYFECCSGKYLGESDDGVKLEMFSNEEALEIKDEHKGLEFNSDGPIHEEAEKFGFEIKD----LEDCSFLSSDSDSESSSFD

Query:  EEYIEIELE----------SLVLPVNDWSHDENQDRLEEPKERENDENAMEF---EQQEEEEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---
        EEY+EIELE          + VLPVNDWS +ENQD L E  E E  E  MEF   +QQ+EEE EEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE   
Subjt:  EEYIEIELE----------SLVLPVNDWSHDENQDRLEEPKERENDENAMEF---EQQEEEEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---

Query:  EAESMCPTLVETLKPLKNGGNFELKEHFREIQKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDPITPMDGK----KSVFSLKLRAGRADVKDCSRLMRDL
        EA SMCPT VETLKPLK   NFELK+HFREIQKVYKTY+EKMRKLDISN QTNYAIGLVKLKDP   MDGK    KSVF LKLR  R DVK C  L RDL
Subjt:  EAESMCPTLVETLKPLKNGGNFELKEHFREIQKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDPITPMDGK----KSVFSLKLRAGRADVKDCSRLMRDL

Query:  KRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIREDCVNDKKLRGK
        KRDMEMVY+GH+CLSWE+LHWQ+RKA ELQQNDS+ +S+FTRV NEFQLFS+LIQRFIEDEQFCGPRIDNYA NRLFIRSLLQVPAIR DCVNDKK RGK
Subjt:  KRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIREDCVNDKKLRGK

Query:  EGENTVSTAALVSIIEDSMRVFREFLRADKDVRNSTIKRAQVELNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIAEEEGRLKTELLIAEVELK
        E E+T+STAALVSIIEDSM+VFREFLRADK V+NSTIK AQV+LNAQ MMMEIR GL+KKERRLKEIMRSGNCI K+F+RI EEEGR+K ELLIAEVELK
Subjt:  EGENTVSTAALVSIIEDSMRVFREFLRADKDVRNSTIKRAQVELNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIAEEEGRLKTELLIAEVELK

Query:  LISRVVNMSKLTESQLIWCHKKLHQVNFVNGK
        L+SRVV+MS+LTESQLIWCHKKLHQ+NFVN K
Subjt:  LISRVVNMSKLTESQLIWCHKKLHQVNFVNGK

XP_011648587.1 uncharacterized protein LOC101214479 isoform X1 [Cucumis sativus]1.0e-23472.36Show/hide
Query:  PSILAFSLNKPFANEFVRHNYNMLSLIYSLSNFLVSVFDPIITLFFRIQADYGTVDSVLAIQGDETE---TIDYNSSKYQLEPTTQIHGFMEESETTKCF
        P +LAFS    ++NEF+    NML LIYS+ NFL+S+F  II  FFRIQ  YGTVDSVL I+G E E   T DYNSSKYQLEPTTQIHGF+++SETT CF
Subjt:  PSILAFSLNKPFANEFVRHNYNMLSLIYSLSNFLVSVFDPIITLFFRIQADYGTVDSVLAIQGDETE---TIDYNSSKYQLEPTTQIHGFMEESETTKCF

Query:  VQELYC-----SDSGNQTSDDYFECCSGKYLGESDDGVKLEMFSNEEALEIKDEHKGLE---------FNSDGPIHEEAEKFGFEIKD----LEDCSFLS
        VQE +C     S SGNQT D  FEC S KYL ESDDGVKLE+F+ EE LE  DEH+GLE          N D PI  E EK G +  +    +ED SFL 
Subjt:  VQELYC-----SDSGNQTSDDYFECCSGKYLGESDDGVKLEMFSNEEALEIKDEHKGLE---------FNSDGPIHEEAEKFGFEIKD----LEDCSFLS

Query:  SDSDSESSSFDEEYIEIELE----------SLVLPVNDWSHDENQDRLEEPKERENDENAMEF----EQQEEEEEEEEFLQEHQDLINQLKIELRNSRTG
        SDSD ES  FDEEYIEIELE          + +LPVNDWS +E+QD L E  E E DE  MEF    +QQEEEEEEEEFLQEHQDLINQLKIELRNSRTG
Subjt:  SDSDSESSSFDEEYIEIELE----------SLVLPVNDWSHDENQDRLEEPKERENDENAMEF----EQQEEEEEEEEFLQEHQDLINQLKIELRNSRTG

Query:  GLPTVQEEE---EAESMCPTLVETLKPLKNGGNFELKEHFREIQKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDPITPMDGK----KSVFSLKLRAGRA
        GLPTVQEEE   EA SMCPT VETLKPLK   NFELK+HFREIQKVYKTY+EKMRKLDISN QTNYAIGLVKLKDP   MDGK    KSVF LKLR GR 
Subjt:  GLPTVQEEE---EAESMCPTLVETLKPLKNGGNFELKEHFREIQKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDPITPMDGK----KSVFSLKLRAGRA

Query:  DVKDCSRLMRDLKRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIR
         VKDC RL RDLKRDMEMVY+GH+CLSWE+LHWQ+RKA ELQQNDS+ +SRFTRVVNEFQLFS+LIQRFIEDEQFCGPRIDNYA NRLFIRSLLQVPAIR
Subjt:  DVKDCSRLMRDLKRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIR

Query:  EDCVNDKKLRGKEGENTVSTAALVSIIEDSMRVFREFLRADKDVRNSTIKRAQVELNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIAEEEGRL
         DCVNDKK RGKE E+T+STAALVSIIEDSM+VFREFLRA+K VRNSTIK AQ +LNAQ MMMEIR+GL+KKERRLKEI+RSGNCI K+F+RI E+EGR+
Subjt:  EDCVNDKKLRGKEGENTVSTAALVSIIEDSMRVFREFLRADKDVRNSTIKRAQVELNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIAEEEGRL

Query:  KTELLIAEVELKLISRVVNMSKLTESQLIWCHKKLHQVNFVNGK
        K ELLIAEVELKL+SRVV+MS+LTESQLIWCHKKLHQ+NFVN K
Subjt:  KTELLIAEVELKLISRVVNMSKLTESQLIWCHKKLHQVNFVNGK

XP_031736637.1 uncharacterized protein LOC101214479 isoform X2 [Cucumis sativus]4.1e-22373.38Show/hide
Query:  LFFRIQADYGTVDSVLAIQGDETE---TIDYNSSKYQLEPTTQIHGFMEESETTKCFVQELYC-----SDSGNQTSDDYFECCSGKYLGESDDGVKLEMF
        +FF++   YGTVDSVL I+G E E   T DYNSSKYQLEPTTQIHGF+++SETT CFVQE +C     S SGNQT D  FEC S KYL ESDDGVKLE+F
Subjt:  LFFRIQADYGTVDSVLAIQGDETE---TIDYNSSKYQLEPTTQIHGFMEESETTKCFVQELYC-----SDSGNQTSDDYFECCSGKYLGESDDGVKLEMF

Query:  SNEEALEIKDEHKGLE---------FNSDGPIHEEAEKFGFEIKD----LEDCSFLSSDSDSESSSFDEEYIEIELE----------SLVLPVNDWSHDE
        + EE LE  DEH+GLE          N D PI  E EK G +  +    +ED SFL SDSD ES  FDEEYIEIELE          + +LPVNDWS +E
Subjt:  SNEEALEIKDEHKGLE---------FNSDGPIHEEAEKFGFEIKD----LEDCSFLSSDSDSESSSFDEEYIEIELE----------SLVLPVNDWSHDE

Query:  NQDRLEEPKERENDENAMEF----EQQEEEEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAESMCPTLVETLKPLKNGGNFELKEHFREI
        +QD L E  E E DE  MEF    +QQEEEEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE   EA SMCPT VETLKPLK   NFELK+HFREI
Subjt:  NQDRLEEPKERENDENAMEF----EQQEEEEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAESMCPTLVETLKPLKNGGNFELKEHFREI

Query:  QKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDPITPMDGK----KSVFSLKLRAGRADVKDCSRLMRDLKRDMEMVYIGHICLSWEVLHWQYRKAIELQQ
        QKVYKTY+EKMRKLDISN QTNYAIGLVKLKDP   MDGK    KSVF LKLR GR  VKDC RL RDLKRDMEMVY+GH+CLSWE+LHWQ+RKA ELQQ
Subjt:  QKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDPITPMDGK----KSVFSLKLRAGRADVKDCSRLMRDLKRDMEMVYIGHICLSWEVLHWQYRKAIELQQ

Query:  NDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIREDCVNDKKLRGKEGENTVSTAALVSIIEDSMRVFREFLRADKD
        NDS+ +SRFTRVVNEFQLFS+LIQRFIEDEQFCGPRIDNYA NRLFIRSLLQVPAIR DCVNDKK RGKE E+T+STAALVSIIEDSM+VFREFLRA+K 
Subjt:  NDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIREDCVNDKKLRGKEGENTVSTAALVSIIEDSMRVFREFLRADKD

Query:  VRNSTIKRAQVELNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIAEEEGRLKTELLIAEVELKLISRVVNMSKLTESQLIWCHKKLHQVNFVNG
        VRNSTIK AQ +LNAQ MMMEIR+GL+KKERRLKEI+RSGNCI K+F+RI E+EGR+K ELLIAEVELKL+SRVV+MS+LTESQLIWCHKKLHQ+NFVN 
Subjt:  VRNSTIKRAQVELNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIAEEEGRLKTELLIAEVELKLISRVVNMSKLTESQLIWCHKKLHQVNFVNG

Query:  K
        K
Subjt:  K

TrEMBL top hitse value%identityAlignment
A0A0A0LFQ1 Uncharacterized protein5.0e-23572.36Show/hide
Query:  PSILAFSLNKPFANEFVRHNYNMLSLIYSLSNFLVSVFDPIITLFFRIQADYGTVDSVLAIQGDETE---TIDYNSSKYQLEPTTQIHGFMEESETTKCF
        P +LAFS    ++NEF+    NML LIYS+ NFL+S+F  II  FFRIQ  YGTVDSVL I+G E E   T DYNSSKYQLEPTTQIHGF+++SETT CF
Subjt:  PSILAFSLNKPFANEFVRHNYNMLSLIYSLSNFLVSVFDPIITLFFRIQADYGTVDSVLAIQGDETE---TIDYNSSKYQLEPTTQIHGFMEESETTKCF

Query:  VQELYC-----SDSGNQTSDDYFECCSGKYLGESDDGVKLEMFSNEEALEIKDEHKGLE---------FNSDGPIHEEAEKFGFEIKD----LEDCSFLS
        VQE +C     S SGNQT D  FEC S KYL ESDDGVKLE+F+ EE LE  DEH+GLE          N D PI  E EK G +  +    +ED SFL 
Subjt:  VQELYC-----SDSGNQTSDDYFECCSGKYLGESDDGVKLEMFSNEEALEIKDEHKGLE---------FNSDGPIHEEAEKFGFEIKD----LEDCSFLS

Query:  SDSDSESSSFDEEYIEIELE----------SLVLPVNDWSHDENQDRLEEPKERENDENAMEF----EQQEEEEEEEEFLQEHQDLINQLKIELRNSRTG
        SDSD ES  FDEEYIEIELE          + +LPVNDWS +E+QD L E  E E DE  MEF    +QQEEEEEEEEFLQEHQDLINQLKIELRNSRTG
Subjt:  SDSDSESSSFDEEYIEIELE----------SLVLPVNDWSHDENQDRLEEPKERENDENAMEF----EQQEEEEEEEEFLQEHQDLINQLKIELRNSRTG

Query:  GLPTVQEEE---EAESMCPTLVETLKPLKNGGNFELKEHFREIQKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDPITPMDGK----KSVFSLKLRAGRA
        GLPTVQEEE   EA SMCPT VETLKPLK   NFELK+HFREIQKVYKTY+EKMRKLDISN QTNYAIGLVKLKDP   MDGK    KSVF LKLR GR 
Subjt:  GLPTVQEEE---EAESMCPTLVETLKPLKNGGNFELKEHFREIQKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDPITPMDGK----KSVFSLKLRAGRA

Query:  DVKDCSRLMRDLKRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIR
         VKDC RL RDLKRDMEMVY+GH+CLSWE+LHWQ+RKA ELQQNDS+ +SRFTRVVNEFQLFS+LIQRFIEDEQFCGPRIDNYA NRLFIRSLLQVPAIR
Subjt:  DVKDCSRLMRDLKRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIR

Query:  EDCVNDKKLRGKEGENTVSTAALVSIIEDSMRVFREFLRADKDVRNSTIKRAQVELNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIAEEEGRL
         DCVNDKK RGKE E+T+STAALVSIIEDSM+VFREFLRA+K VRNSTIK AQ +LNAQ MMMEIR+GL+KKERRLKEI+RSGNCI K+F+RI E+EGR+
Subjt:  EDCVNDKKLRGKEGENTVSTAALVSIIEDSMRVFREFLRADKDVRNSTIKRAQVELNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIAEEEGRL

Query:  KTELLIAEVELKLISRVVNMSKLTESQLIWCHKKLHQVNFVNGK
        K ELLIAEVELKL+SRVV+MS+LTESQLIWCHKKLHQ+NFVN K
Subjt:  KTELLIAEVELKLISRVVNMSKLTESQLIWCHKKLHQVNFVNGK

A0A1S3CB94 uncharacterized protein LOC1034987341.5e-23172.15Show/hide
Query:  ILAFSLNKPFANEFVRHNYNMLSLIYSLSNFLVSVFDPIITLFFRIQADYGTVDSVLAIQGDETE---TIDYNSSKYQLEPTTQIHGFMEESETTKCFVQ
        +LAFS +  ++NEF+ H  NM  LIYS+ NFL+S+F  II  FFRIQ  YGTVDSVL  +G E+E     +YNSSKYQLEPTTQIHGF++ESETT CFVQ
Subjt:  ILAFSLNKPFANEFVRHNYNMLSLIYSLSNFLVSVFDPIITLFFRIQADYGTVDSVLAIQGDETE---TIDYNSSKYQLEPTTQIHGFMEESETTKCFVQ

Query:  ELYC-----SDSGNQTSDDYFECCSGKYLGESDDGVKLEMFSNEEALEIKDEHKGLEFNSDGPIHEEAEKFGFEIKD----LEDCSFLSSDSDSESSSFD
        E +C     S    QT D  FEC S KYL ESD GVKLE+ ++EE LE  DEH+GL  N D PI  + EKFG +  +    +ED SFL SDSDSES SFD
Subjt:  ELYC-----SDSGNQTSDDYFECCSGKYLGESDDGVKLEMFSNEEALEIKDEHKGLEFNSDGPIHEEAEKFGFEIKD----LEDCSFLSSDSDSESSSFD

Query:  EEYIEIELE----------SLVLPVNDWSHDENQDRLEEPKERENDENAMEF---EQQEEEEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---
        EEY+EIELE          + VLPVNDWS +ENQD L E  E E  E  MEF   +QQ+EEE EEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE   
Subjt:  EEYIEIELE----------SLVLPVNDWSHDENQDRLEEPKERENDENAMEF---EQQEEEEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---

Query:  EAESMCPTLVETLKPLKNGGNFELKEHFREIQKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDPITPMDGK----KSVFSLKLRAGRADVKDCSRLMRDL
        EA SMCPT VETLKPLK   NFELK+HFREIQKVYKTY+EKMRKLDISN QTNYAIGLVKLKDP   MDGK    KSVF LKLR  R DVK C  L RDL
Subjt:  EAESMCPTLVETLKPLKNGGNFELKEHFREIQKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDPITPMDGK----KSVFSLKLRAGRADVKDCSRLMRDL

Query:  KRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIREDCVNDKKLRGK
        KRDMEMVY+GH+CLSWE+LHWQ+RKA ELQQNDS+ +S+FTRV NEFQLFS+LIQRFIEDEQFCGPRIDNYA NRLFIRSLLQVPAIR DCVNDKK RGK
Subjt:  KRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIREDCVNDKKLRGK

Query:  EGENTVSTAALVSIIEDSMRVFREFLRADKDVRNSTIKRAQVELNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIAEEEGRLKTELLIAEVELK
        E E+T+STAALVSIIEDSM+VFREFLRADK V+NSTIK AQV+LNAQ MMMEIR GL+KKERRLKEIMRSGNCI K+F+RI EEEGR+K ELLIAEVELK
Subjt:  EGENTVSTAALVSIIEDSMRVFREFLRADKDVRNSTIKRAQVELNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIAEEEGRLKTELLIAEVELK

Query:  LISRVVNMSKLTESQLIWCHKKLHQVNFVNGK
        L+SRVV+MS+LTESQLIWCHKKLHQ+NFVN K
Subjt:  LISRVVNMSKLTESQLIWCHKKLHQVNFVNGK

A0A5A7UZY3 40S ribosomal protein S3-2-like0.0e+0074.41Show/hide
Query:  QADYGTVDSVLAIQGDETE---TIDYNSSKYQLEPTTQIHGFMEESETTKCFVQELYC-----SDSGNQTSDDYFECCSGKYLGESDDGVKLEMFSNEEA
        +  YGTVDSVL  +G E+E     DYNSSKYQLEPTTQIHGF++ESETT CFVQE +C     S    QT D  FEC S KYL ESD GVKLE+ ++EE 
Subjt:  QADYGTVDSVLAIQGDETE---TIDYNSSKYQLEPTTQIHGFMEESETTKCFVQELYC-----SDSGNQTSDDYFECCSGKYLGESDDGVKLEMFSNEEA

Query:  LEIKDEHKGLEFNSDGPIHEEAEKFGFEIKD----LEDCSFLSSDSDSESSSFDEEYIEIELE----------SLVLPVNDWSHDENQDRLEEPKEREND
        LE  DEH+GL  N D PI  + EKFG +  +    +ED SFL SDSDSES SFDEEY+EIELE          + VLPVNDWS +ENQD L E  E E D
Subjt:  LEIKDEHKGLEFNSDGPIHEEAEKFGFEIKD----LEDCSFLSSDSDSESSSFDEEYIEIELE----------SLVLPVNDWSHDENQDRLEEPKEREND

Query:  ENAMEF---EQQEEEEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAESMCPTLVETLKPLKNGGNFELKEHFREIQKVYKTYSEKMRKLD
        E  MEF   +QQ+EEE EEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE   EA SMCPT VETLKPLK   NFELK+HFREIQKVYKTY+EKMRKLD
Subjt:  ENAMEF---EQQEEEEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAESMCPTLVETLKPLKNGGNFELKEHFREIQKVYKTYSEKMRKLD

Query:  ISNTQTNYAIGLVKLKDPITPMDGK----KSVFSLKLRAGRADVKDCSRLMRDLKRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNE
        ISN QTNYAIGLVKLKDP + MDGK    KSVF LKLR  R DVK C  L RDLKRDMEMVY+GH+CLSWE+LHWQ+RKA ELQQNDS+ +S+FTRV NE
Subjt:  ISNTQTNYAIGLVKLKDPITPMDGK----KSVFSLKLRAGRADVKDCSRLMRDLKRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNE

Query:  FQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIREDCVNDKKLRGKEGENTVSTAALVSIIEDSMRVFREFLRADKDVRNSTIKRAQVELNA
        FQLFS+LIQRFIEDEQFCGPRIDNYA NRLFIRSLLQVPAIR DCVNDKK RGKE E+T+STAALVSIIEDSM+VFREFLRADK V+NSTIK AQV+LNA
Subjt:  FQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIREDCVNDKKLRGKEGENTVSTAALVSIIEDSMRVFREFLRADKDVRNSTIKRAQVELNA

Query:  QLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIAEEEGRLKTELLIAEVELKLISRVVNMSKLTESQLIWCHKKLHQVNFVNGKAYKVQSAAAAFSST
        Q MMMEIR GL+KKERRLKEIMRSGNCI K+F+RI EEEGR+K ELLIAEVELKL+SRV                                         
Subjt:  QLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIAEEEGRLKTELLIAEVELKLISRVVNMSKLTESQLIWCHKKLHQVNFVNGKAYKVQSAAAAFSST

Query:  RTRAAQGLPFSLFLKMATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVE
                                 FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVE
Subjt:  RTRAAQGLPFSLFLKMATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVE

Query:  LYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIK
        LYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIK
Subjt:  LYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIK

Query:  VKIMLDWDPKGKQGPTTPLPDVVTIHSPKEEEEIVHRPAAVLTTDIEVPVAV
        VKIMLDWDPKGKQGPTTPLPDVVTIHSPKEEEEIVHRP AVLT DIEVPVAV
Subjt:  VKIMLDWDPKGKQGPTTPLPDVVTIHSPKEEEEIVHRPAAVLTTDIEVPVAV

A0A6J1G4S1 uncharacterized protein LOC1114507411.8e-21367.99Show/hide
Query:  SLIYSLSNFLVSVFDPIITLFFRIQADYGTVDSVLAIQGDETETI-------DY---------------NSSKYQLEPTTQIHGFMEESETTKCFVQELY
        S  YSLS+F+VSVF  II LFFRIQ DYGTV+SVLA+ GDE ET+       D+               NSSK+Q+E T QI+GF+EESETT CFV+ELY
Subjt:  SLIYSLSNFLVSVFDPIITLFFRIQADYGTVDSVLAIQGDETETI-------DY---------------NSSKYQLEPTTQIHGFMEESETTKCFVQELY

Query:  C----SDSGNQTSDDYFECCSGKYL-------GESDDGVKLEMFSNEEALEIKDEHKGLEFNSDGPIHEEAEKFGFEIKDLEDCSFLSSDSDSESSSFDE
        C    S SGNQT DD F+C SGKYL        E  D VKLE+FS +EAL  +DEH+ L+ N DGPI +E  +       L DCSF  SDSDSES SFDE
Subjt:  C----SDSGNQTSDDYFECCSGKYL-------GESDDGVKLEMFSNEEALEIKDEHKGLEFNSDGPIHEEAEKFGFEIKDLEDCSFLSSDSDSESSSFDE

Query:  EYIEIELE----------SLVLPVNDWSHDENQDRLEEPKERENDENAMEFEQQEEEEEEE---EFLQEHQDLINQLKIELRNSRTGGLPTVQEEEEA--
        E+IEIELE          + V PVNDWS +E++D L EP E E DE  MEFE+ EEEEEEE   EF QEHQDLI QLKIELRNSRTGGLPTVQEEEEA  
Subjt:  EYIEIELE----------SLVLPVNDWSHDENQDRLEEPKERENDENAMEFEQQEEEEEEE---EFLQEHQDLINQLKIELRNSRTGGLPTVQEEEEA--

Query:  ESMCPTLVETLKPLKNGGNFELKEHFREIQKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDPITPMD----GKKSVFSLKLRAGRADVKDCSRLMRDLKR
        E M PT VE LKPLKNGGNFE +  F+EIQKVYKTY++KMRKLD+SNTQTNYAI L+KLKDP + M+    G KSV S KLRAGRA VK    LMRDLKR
Subjt:  ESMCPTLVETLKPLKNGGNFELKEHFREIQKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDPITPMD----GKKSVFSLKLRAGRADVKDCSRLMRDLKR

Query:  DMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIREDCVNDKKLRGKEG
        DMEMVY+GH+CLSWEVLHWQ+RKAIELQQND++  SR+TRVVNEFQ F +L+QRFIEDE FCGPRI+NY +NRL +RSLLQVPAIREDCV+DKKLRGKEG
Subjt:  DMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIREDCVNDKKLRGKEG

Query:  ENTVSTAALVSIIEDSMRVFREFLRADKDVRNSTIKRAQVELNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIA-EEEGRLKTELLIAEVELKL
        E+T+STAALVS+IE+SMRVFR+FLRADKDV ++TIK A+VE+NAQ MMMEIRT LRKKERRLKEI+R GNCIVK+ +R++ EEEGRLK ELLIAEVELKL
Subjt:  ENTVSTAALVSIIEDSMRVFREFLRADKDVRNSTIKRAQVELNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIA-EEEGRLKTELLIAEVELKL

Query:  ISRVVNMSKLTESQLIWCHKKLHQVNFVNGK
        +SRVV+MS+LTESQL+WCHKKLHQ+NFVN K
Subjt:  ISRVVNMSKLTESQLIWCHKKLHQVNFVNGK

A0A6J1KAQ8 uncharacterized protein LOC1114937711.0e-21167.24Show/hide
Query:  SLIYSLSNFLVSVFDPIITLFFRIQADYGTVDSVLAIQGDETET----------------------IDYNSSKYQLEPTTQIHGFMEESETTKCFVQELY
        S  YSLS+F+VSVF  II LFFRIQ DYGTV+SVLA+ GDE ET                      +  NSSK+Q+E T QIHGF+EESETT CFV+ELY
Subjt:  SLIYSLSNFLVSVFDPIITLFFRIQADYGTVDSVLAIQGDETET----------------------IDYNSSKYQLEPTTQIHGFMEESETTKCFVQELY

Query:  C----SDSGNQTSDDYFECCSGKYL-------GESDDGVKLEMFSNEEALEIKDEHKGLEFNSDGPIHEEAEKFGFEIKDLEDCSFLSSDSDSESSSFDE
        C    S SGNQT DD FEC SGKYL        E  D VKLE+FS +EALE +DEH+ L+ N DGPI +E  +       L DCSF  SDSDSES SFDE
Subjt:  C----SDSGNQTSDDYFECCSGKYL-------GESDDGVKLEMFSNEEALEIKDEHKGLEFNSDGPIHEEAEKFGFEIKDLEDCSFLSSDSDSESSSFDE

Query:  EYIEIELE----------SLVLPVNDWSHDENQDRLEEPKERENDENAMEFEQQEEEEEEE-------EFLQEHQDLINQLKIELRNSRTGGLPTVQEEE
        E+IEIELE          + V PVNDWS +E++D L E  E E DE  MEFE++EEEEEEE       EF QEHQDLI QLKIELRNSRTGGLPTVQEEE
Subjt:  EYIEIELE----------SLVLPVNDWSHDENQDRLEEPKERENDENAMEFEQQEEEEEEE-------EFLQEHQDLINQLKIELRNSRTGGLPTVQEEE

Query:  EA--ESMCPTLVETLKPLKNGGNFELKEHFREIQKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDPITPMD----GKKSVFSLKLRAGRADVKDCSRLMR
        EA  E M PT VE LKPLKNGGNFE +  F+EIQKVYKTY++KMRKLD+SNTQTNYAI  +KLKDP + MD    G KSV S KLRA RA VK    LMR
Subjt:  EA--ESMCPTLVETLKPLKNGGNFELKEHFREIQKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDPITPMD----GKKSVFSLKLRAGRADVKDCSRLMR

Query:  DLKRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIREDCVNDKKLR
        DLKRDMEMVY+GH+CLSWEVLHWQ+RKAIELQQND++  SR+TRVVNEFQ F +LIQRF+EDE FCGPRI+NY +NRL +RSLLQVPAIREDCVNDKKLR
Subjt:  DLKRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIREDCVNDKKLR

Query:  GKEGENTVSTAALVSIIEDSMRVFREFLRADKDVRNSTIKRAQVELNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIAE-EEGRLKTELLIAEV
        GKEGE+T+STAALVS+IE+SM VFR+FLRADKDVR++ IK A+VE+NAQ MMMEIRT LRKKERRLKEI+R G+CIVK+ +R++E EEGRLK ELLIAEV
Subjt:  GKEGENTVSTAALVSIIEDSMRVFREFLRADKDVRNSTIKRAQVELNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIAE-EEGRLKTELLIAEV

Query:  ELKLISRVVNMSKLTESQLIWCHKKLHQVNFVNGK
        ELKL+SRVV+M +LTESQL+WCHKKLHQ+NFVN K
Subjt:  ELKLISRVVNMSKLTESQLIWCHKKLHQVNFVNGK

SwissProt top hitse value%identityAlignment
P02350 40S ribosomal protein S3-A3.0e-9684.72Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MA Q+SKKRKFVADG+F AELNE LTRELAEDGYSGVEVRVTP RTEIII ATRTQNVLGEKGRRIRELT+VVQKRF FPE SVELYAEKV  RGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRF+MESGAKGCEV+VSGKLR QRAKSMKF DG MI SG PV  Y+D+AVRHVLLRQGVLGIKVKIML WDP GK GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  TTPLPDVVTIHSPKEE
          PLPD V+I  PK+E
Subjt:  TTPLPDVVTIHSPKEE

P47835 40S ribosomal protein S3-B3.0e-9684.72Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MA QMSKKRKFVADG+F AELNE LTRELAEDGYSGVEVRVTP +TEIII ATRTQNVLGEKGRRIRELT+VVQKRF FPE SVELYAEKV  RGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRF+MESGAKGCEV+VSGKLR QRAKSMKF DG MI SG PV  Y+D+AVRHVLLRQGVLGIKVKIML WDP GK GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  TTPLPDVVTIHSPKEE
          PLPD V+I  PK+E
Subjt:  TTPLPDVVTIHSPKEE

Q9FJA6 40S ribosomal protein S3-34.8e-11087.29Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP++SVELYAEKV NRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP ++YID+AVRHVLLRQGVLG+KVKIMLDWDPKGKQGP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  TTPLPDVVTIHSPKEEEEIVHRPAAVLTTDIEVPVA
         TPLPDVV IH+PKE++  +  PA V+T    VP A
Subjt:  TTPLPDVVTIHSPKEEEEIVHRPAAVLTTDIEVPVA

Q9M339 40S ribosomal protein S3-21.1e-10990.71Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        M TQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP ++YIDSAVRHVLLRQGVLGIKVK+MLDWDPKG  GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  TTPLPDVVTIHSPKEEEEIVHRPAAV
         TPLPDVV IHSPKEEE I + PA V
Subjt:  TTPLPDVVTIHSPKEEEEIVHRPAAV

Q9SIP7 40S ribosomal protein S3-14.5e-10891.67Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP ++YID+AVRHVLLRQGVLGIKVKIMLDWDP GK GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  TTPLPDVVTIHSPKEE
         TPLPDVV IH+PK++
Subjt:  TTPLPDVVTIHSPKEE

Arabidopsis top hitse value%identityAlignment
AT2G31610.1 Ribosomal protein S3 family protein3.2e-10991.67Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP ++YID+AVRHVLLRQGVLGIKVKIMLDWDP GK GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  TTPLPDVVTIHSPKEE
         TPLPDVV IH+PK++
Subjt:  TTPLPDVVTIHSPKEE

AT3G53870.1 Ribosomal protein S3 family protein7.6e-11190.71Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        M TQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP ++YIDSAVRHVLLRQGVLGIKVK+MLDWDPKG  GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  TTPLPDVVTIHSPKEEEEIVHRPAAV
         TPLPDVV IHSPKEEE I + PA V
Subjt:  TTPLPDVVTIHSPKEEEEIVHRPAAV

AT5G35530.1 Ribosomal protein S3 family protein3.4e-11187.29Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP++SVELYAEKV NRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP ++YID+AVRHVLLRQGVLG+KVKIMLDWDPKGKQGP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  TTPLPDVVTIHSPKEEEEIVHRPAAVLTTDIEVPVA
         TPLPDVV IH+PKE++  +  PA V+T    VP A
Subjt:  TTPLPDVVTIHSPKEEEEIVHRPAAVLTTDIEVPVA

AT5G39785.1 Protein of unknown function (DUF1666)3.5e-7137.42Show/hide
Query:  EIKDLEDCSFLSSDSDSESSSFDEEYIEIELESLVLPVNDWSHDENQDRLEEPKERENDENAMEFEQQEEEEEEE---EFLQEHQDLINQLKIELRNSRT
        +I+ L +  FL SDSD   SS   +      E   L  +D++    +  L++ + R++D +    + +EEEEE+    E L EHQDLI QLK+E++  + 
Subjt:  EIKDLEDCSFLSSDSDSESSSFDEEYIEIELESLVLPVNDWSHDENQDRLEEPKERENDENAMEFEQQEEEEEEE---EFLQEHQDLINQLKIELRNSRT

Query:  -GGLPTVQEEEEAESMCPTLVETLKP--LKNGGNFELKEHFREIQKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDP--ITPMDGK-------KSVFSLK
         GGL T+ EEEE +  CP ++E LKP  ++    F+  +   E+ K +++Y E+MRKLDI + Q +YA+GL++ K P   T   G         SVFS+ 
Subjt:  -GGLPTVQEEEEAESMCPTLVETLKP--LKNGGNFELKEHFREIQKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDP--ITPMDGK-------KSVFSLK

Query:  LR---AGRADVKDCSRLMRDLKRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIR
        +R   A +++++   + +++++ ++E VY+G +CLSWE+LHWQY KAIEL ++D     R+  V  EFQ F VL+QRF+E+E F  PR+ +Y + R  +R
Subjt:  LR---AGRADVKDCSRLMRDLKRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIR

Query:  SLLQVPAIREDCVNDKK---LRGKEGEN--TVSTAALVSIIEDSMRVFREFLRADK------DVRNSTIKRAQVELNAQ------LMMMEIRTGLRKKER
        +LLQ+P IRED   DKK    R  E  N   + +  LV I+E+++R+F  F+R DK      D ++ T  ++Q+E + +       M  E+++ L+ KE+
Subjt:  SLLQVPAIREDCVNDKK---LRGKEGEN--TVSTAALVSIIEDSMRVFREFLRADK------DVRNSTIKRAQVELNAQ------LMMMEIRTGLRKKER

Query:  RLKEIMRSGNCIVKRFQRIAEEEGRLKTEL-LIAEVELKLISRVVNMSKLTESQLIWCHKKLHQVNFVNGKAY
        RL+++++S  CI++RFQ+  EE+      L   ++V++KL++RV+NMSKLT   L+WCH KL ++NFVN + +
Subjt:  RLKEIMRSGNCIVKRFQRIAEEEGRLKTEL-LIAEVELKLISRVVNMSKLTESQLIWCHKKLHQVNFVNGKAY

AT5G39785.2 Protein of unknown function (DUF1666)3.2e-6937.13Show/hide
Query:  EIKDLEDCSFLSSDSDSESSSFDEEYIEIELESLVLPVNDWSHDENQDRLEEPKERENDENAMEFEQQEEEEEEE---EFLQEHQDLINQLKIELRNSRT
        +I+ L +  FL SDSD   SS   +      E   L  +D++    +  L++ + R++D +    + +EEEEE+    E L EHQDLI QLK+E++  + 
Subjt:  EIKDLEDCSFLSSDSDSESSSFDEEYIEIELESLVLPVNDWSHDENQDRLEEPKERENDENAMEFEQQEEEEEEE---EFLQEHQDLINQLKIELRNSRT

Query:  -GGLPTVQEEEEAESMCPTLVETLKP--LKNGGNFELKEHFREIQKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDP--ITPMDGK-------KSVFSLK
         GGL T+ EEEE +  CP ++E LKP  ++    F+  +   E+ K +++Y E+MRKLDI + Q +YA+GL++ K P   T   G         SVFS+ 
Subjt:  -GGLPTVQEEEEAESMCPTLVETLKP--LKNGGNFELKEHFREIQKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDP--ITPMDGK-------KSVFSLK

Query:  LR---AGRADVKDCSRLMRDLKRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIR
        +R   A +++++   + +++++ ++E VY+G +CLSWE+LHWQY KAIEL ++D     R+  V  EFQ F VL+QRF+E+E F  PR+ +Y + R  +R
Subjt:  LR---AGRADVKDCSRLMRDLKRDMEMVYIGHICLSWEVLHWQYRKAIELQQNDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIR

Query:  SLLQVPAIREDCVNDKK---LRGKEGEN--TVSTAALVSIIEDSMRVFREFLRADK------DVRNSTIKRAQVELNAQ------LMMMEIRTGLRK-KE
        +LLQ+P IRED   DKK    R  E  N   + +  LV I+E+++R+F  F+R DK      D ++ T  ++Q+E + +       M  E+++ L+   E
Subjt:  SLLQVPAIREDCVNDKK---LRGKEGEN--TVSTAALVSIIEDSMRVFREFLRADK------DVRNSTIKRAQVELNAQ------LMMMEIRTGLRK-KE

Query:  RRLKEIMRSGNCIVKRFQRIAEEEGRLKTEL-LIAEVELKLISRVVNMSKLTESQLIWCHKKLHQVNFVNGKAY
        +RL+++++S  CI++RFQ+  EE+      L   ++V++KL++RV+NMSKLT   L+WCH KL ++NFVN + +
Subjt:  RRLKEIMRSGNCIVKRFQRIAEEEGRLKTEL-LIAEVELKLISRVVNMSKLTESQLIWCHKKLHQVNFVNGKAY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTTAGCCAACTCCAGCTACCACTTCCAGCAACGGCCGCTGGCCATCTTCGACCACCACTTTCGAAAACCGTCATCGACCGACCTTTGGGCACTACCTTCAACGAC
CGCTTCCGGCCACCACCACGGATGTGTTGATTTTGATTCCCCGGAACTTAGGCCTTCAATTCTTGCTTTTTCCCTCAATAAGCCATTCGCAAATGAATTTGTGCGCCATA
ATTATAATATGCTTTCTCTCATATATTCCCTCTCCAATTTTCTTGTCTCCGTGTTTGATCCAATAATCACACTCTTTTTCAGAATTCAAGCCGATTATGGAACTGTTGAT
TCTGTTCTTGCAATTCAAGGCGATGAAACTGAAACAATTGATTACAATTCAAGCAAATATCAATTGGAACCTACCACACAAATCCATGGGTTCATGGAAGAATCGGAAAC
TACCAAATGTTTTGTCCAAGAATTGTACTGTTCTGATTCAGGTAATCAAACTTCTGATGATTATTTTGAATGTTGCAGCGGAAAATATTTAGGGGAAAGTGATGATGGGG
TGAAACTAGAAATGTTTAGCAACGAAGAGGCTTTAGAAATCAAAGATGAACATAAGGGTTTAGAATTCAATAGCGATGGCCCAATTCATGAGGAAGCTGAGAAGTTTGGT
TTTGAAATTAAAGATTTGGAAGATTGTTCGTTTCTTTCTTCAGATTCCGACTCTGAATCTTCAAGTTTTGATGAAGAGTACATAGAAATAGAATTAGAATCCCTAGTTTT
GCCTGTAAATGATTGGAGTCACGATGAGAATCAAGATCGTTTGGAGGAACCAAAAGAAAGAGAAAATGATGAGAATGCGATGGAATTTGAGCAGCAAGAAGAAGAAGAAG
AGGAAGAGGAATTCTTGCAAGAACATCAAGATTTGATAAATCAACTCAAGATAGAGCTAAGAAACTCAAGAACAGGAGGACTTCCAACCGTACAAGAAGAAGAAGAGGCA
GAATCCATGTGTCCCACGTTGGTTGAAACTCTAAAACCTCTAAAGAATGGTGGAAATTTCGAACTCAAAGAACATTTCAGAGAGATCCAAAAGGTTTACAAGACTTATTC
AGAGAAAATGCGAAAGCTTGACATCTCCAATACTCAAACAAATTATGCAATTGGTTTAGTTAAGTTGAAAGATCCAATTACTCCAATGGATGGGAAGAAATCTGTGTTTT
CCCTCAAGTTAAGGGCAGGGAGAGCAGATGTTAAGGACTGTTCAAGATTGATGAGAGACTTAAAGAGGGATATGGAAATGGTGTATATTGGACATATTTGCCTTTCTTGG
GAAGTTTTACATTGGCAGTATAGGAAGGCCATTGAGTTGCAACAAAATGACTCTCAACGCATCTCTCGGTTCACTCGAGTTGTCAATGAATTTCAACTCTTCTCCGTCCT
CATTCAAAGATTCATTGAAGACGAACAGTTTTGTGGCCCTCGAATTGACAATTATGCCGAAAACCGACTTTTCATTCGTAGTCTCCTTCAAGTTCCTGCAATTAGAGAGG
ATTGTGTAAATGACAAAAAGCTTAGAGGCAAAGAAGGTGAAAATACCGTGTCAACTGCAGCTCTAGTATCGATCATTGAAGATTCAATGCGGGTTTTTCGGGAATTTCTA
CGTGCAGATAAAGACGTTAGGAATTCAACGATCAAACGCGCTCAAGTAGAACTTAATGCACAGCTTATGATGATGGAGATACGAACTGGTCTGCGAAAGAAGGAGAGGAG
GCTAAAAGAGATAATGAGAAGTGGGAATTGTATAGTAAAGAGGTTTCAAAGGATTGCTGAAGAAGAAGGCAGATTGAAGACTGAATTGTTGATTGCGGAAGTTGAATTGA
AATTGATATCAAGGGTTGTCAATATGTCAAAACTAACGGAGAGCCAATTGATTTGGTGTCATAAAAAACTACACCAGGTCAATTTTGTGAACGGGAAGGCGTATAAAGTT
CAATCTGCGGCAGCGGCTTTTTCTTCCACTAGAACGCGCGCAGCACAGGGGCTTCCCTTCTCCCTCTTTTTGAAAATGGCGACCCAGATGAGTAAGAAGAGAAAGTTCGT
AGCCGATGGAGTGTTTTTTGCCGAGCTAAACGAGGTTTTGACCCGAGAGTTGGCCGAGGATGGGTACTCCGGTGTCGAGGTTAGAGTCACTCCTATGCGCACTGAGATCA
TCATCCGAGCCACCCGCACCCAAAACGTTCTCGGTGAAAAGGGTAGGAGGATTAGGGAACTCACCTCGGTTGTGCAGAAGCGGTTCAAGTTTCCGGAGAATAGTGTTGAG
CTGTATGCCGAGAAAGTTAACAATAGAGGGCTTTGTGCCATTGCTCAAGCTGAGTCACTTCGTTATAAGCTACTTGGTGGCCTTGCTGTTAGGAGGGCATGCTATGGTGT
ATTGAGATTCGTCATGGAAAGTGGAGCCAAGGGTTGCGAGGTGATTGTTAGTGGAAAACTCAGGGCACAGCGTGCAAAGTCTATGAAGTTCAAAGACGGATATATGATCT
CTTCGGGTCAACCAGTCAGAGATTATATTGACTCAGCTGTTAGACACGTTCTTCTTAGACAGGGAGTTCTTGGTATCAAAGTTAAGATCATGCTTGATTGGGATCCTAAG
GGCAAGCAAGGCCCTACCACCCCTCTACCTGATGTTGTAACCATCCACTCTCCCAAGGAGGAAGAAGAAATCGTTCACAGACCAGCAGCAGTTTTGACCACTGATATTGA
GGTTCCAGTAGCAGTAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTTAGCCAACTCCAGCTACCACTTCCAGCAACGGCCGCTGGCCATCTTCGACCACCACTTTCGAAAACCGTCATCGACCGACCTTTGGGCACTACCTTCAACGAC
CGCTTCCGGCCACCACCACGGATGTGTTGATTTTGATTCCCCGGAACTTAGGCCTTCAATTCTTGCTTTTTCCCTCAATAAGCCATTCGCAAATGAATTTGTGCGCCATA
ATTATAATATGCTTTCTCTCATATATTCCCTCTCCAATTTTCTTGTCTCCGTGTTTGATCCAATAATCACACTCTTTTTCAGAATTCAAGCCGATTATGGAACTGTTGAT
TCTGTTCTTGCAATTCAAGGCGATGAAACTGAAACAATTGATTACAATTCAAGCAAATATCAATTGGAACCTACCACACAAATCCATGGGTTCATGGAAGAATCGGAAAC
TACCAAATGTTTTGTCCAAGAATTGTACTGTTCTGATTCAGGTAATCAAACTTCTGATGATTATTTTGAATGTTGCAGCGGAAAATATTTAGGGGAAAGTGATGATGGGG
TGAAACTAGAAATGTTTAGCAACGAAGAGGCTTTAGAAATCAAAGATGAACATAAGGGTTTAGAATTCAATAGCGATGGCCCAATTCATGAGGAAGCTGAGAAGTTTGGT
TTTGAAATTAAAGATTTGGAAGATTGTTCGTTTCTTTCTTCAGATTCCGACTCTGAATCTTCAAGTTTTGATGAAGAGTACATAGAAATAGAATTAGAATCCCTAGTTTT
GCCTGTAAATGATTGGAGTCACGATGAGAATCAAGATCGTTTGGAGGAACCAAAAGAAAGAGAAAATGATGAGAATGCGATGGAATTTGAGCAGCAAGAAGAAGAAGAAG
AGGAAGAGGAATTCTTGCAAGAACATCAAGATTTGATAAATCAACTCAAGATAGAGCTAAGAAACTCAAGAACAGGAGGACTTCCAACCGTACAAGAAGAAGAAGAGGCA
GAATCCATGTGTCCCACGTTGGTTGAAACTCTAAAACCTCTAAAGAATGGTGGAAATTTCGAACTCAAAGAACATTTCAGAGAGATCCAAAAGGTTTACAAGACTTATTC
AGAGAAAATGCGAAAGCTTGACATCTCCAATACTCAAACAAATTATGCAATTGGTTTAGTTAAGTTGAAAGATCCAATTACTCCAATGGATGGGAAGAAATCTGTGTTTT
CCCTCAAGTTAAGGGCAGGGAGAGCAGATGTTAAGGACTGTTCAAGATTGATGAGAGACTTAAAGAGGGATATGGAAATGGTGTATATTGGACATATTTGCCTTTCTTGG
GAAGTTTTACATTGGCAGTATAGGAAGGCCATTGAGTTGCAACAAAATGACTCTCAACGCATCTCTCGGTTCACTCGAGTTGTCAATGAATTTCAACTCTTCTCCGTCCT
CATTCAAAGATTCATTGAAGACGAACAGTTTTGTGGCCCTCGAATTGACAATTATGCCGAAAACCGACTTTTCATTCGTAGTCTCCTTCAAGTTCCTGCAATTAGAGAGG
ATTGTGTAAATGACAAAAAGCTTAGAGGCAAAGAAGGTGAAAATACCGTGTCAACTGCAGCTCTAGTATCGATCATTGAAGATTCAATGCGGGTTTTTCGGGAATTTCTA
CGTGCAGATAAAGACGTTAGGAATTCAACGATCAAACGCGCTCAAGTAGAACTTAATGCACAGCTTATGATGATGGAGATACGAACTGGTCTGCGAAAGAAGGAGAGGAG
GCTAAAAGAGATAATGAGAAGTGGGAATTGTATAGTAAAGAGGTTTCAAAGGATTGCTGAAGAAGAAGGCAGATTGAAGACTGAATTGTTGATTGCGGAAGTTGAATTGA
AATTGATATCAAGGGTTGTCAATATGTCAAAACTAACGGAGAGCCAATTGATTTGGTGTCATAAAAAACTACACCAGGTCAATTTTGTGAACGGGAAGGCGTATAAAGTT
CAATCTGCGGCAGCGGCTTTTTCTTCCACTAGAACGCGCGCAGCACAGGGGCTTCCCTTCTCCCTCTTTTTGAAAATGGCGACCCAGATGAGTAAGAAGAGAAAGTTCGT
AGCCGATGGAGTGTTTTTTGCCGAGCTAAACGAGGTTTTGACCCGAGAGTTGGCCGAGGATGGGTACTCCGGTGTCGAGGTTAGAGTCACTCCTATGCGCACTGAGATCA
TCATCCGAGCCACCCGCACCCAAAACGTTCTCGGTGAAAAGGGTAGGAGGATTAGGGAACTCACCTCGGTTGTGCAGAAGCGGTTCAAGTTTCCGGAGAATAGTGTTGAG
CTGTATGCCGAGAAAGTTAACAATAGAGGGCTTTGTGCCATTGCTCAAGCTGAGTCACTTCGTTATAAGCTACTTGGTGGCCTTGCTGTTAGGAGGGCATGCTATGGTGT
ATTGAGATTCGTCATGGAAAGTGGAGCCAAGGGTTGCGAGGTGATTGTTAGTGGAAAACTCAGGGCACAGCGTGCAAAGTCTATGAAGTTCAAAGACGGATATATGATCT
CTTCGGGTCAACCAGTCAGAGATTATATTGACTCAGCTGTTAGACACGTTCTTCTTAGACAGGGAGTTCTTGGTATCAAAGTTAAGATCATGCTTGATTGGGATCCTAAG
GGCAAGCAAGGCCCTACCACCCCTCTACCTGATGTTGTAACCATCCACTCTCCCAAGGAGGAAGAAGAAATCGTTCACAGACCAGCAGCAGTTTTGACCACTGATATTGA
GGTTCCAGTAGCAGTAGCTTGA
Protein sequenceShow/hide protein sequence
MSLANSSYHFQQRPLAIFDHHFRKPSSTDLWALPSTTASGHHHGCVDFDSPELRPSILAFSLNKPFANEFVRHNYNMLSLIYSLSNFLVSVFDPIITLFFRIQADYGTVD
SVLAIQGDETETIDYNSSKYQLEPTTQIHGFMEESETTKCFVQELYCSDSGNQTSDDYFECCSGKYLGESDDGVKLEMFSNEEALEIKDEHKGLEFNSDGPIHEEAEKFG
FEIKDLEDCSFLSSDSDSESSSFDEEYIEIELESLVLPVNDWSHDENQDRLEEPKERENDENAMEFEQQEEEEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEEEA
ESMCPTLVETLKPLKNGGNFELKEHFREIQKVYKTYSEKMRKLDISNTQTNYAIGLVKLKDPITPMDGKKSVFSLKLRAGRADVKDCSRLMRDLKRDMEMVYIGHICLSW
EVLHWQYRKAIELQQNDSQRISRFTRVVNEFQLFSVLIQRFIEDEQFCGPRIDNYAENRLFIRSLLQVPAIREDCVNDKKLRGKEGENTVSTAALVSIIEDSMRVFREFL
RADKDVRNSTIKRAQVELNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKRFQRIAEEEGRLKTELLIAEVELKLISRVVNMSKLTESQLIWCHKKLHQVNFVNGKAYKV
QSAAAAFSSTRTRAAQGLPFSLFLKMATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVE
LYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPK
GKQGPTTPLPDVVTIHSPKEEEEIVHRPAAVLTTDIEVPVAVA