; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh05G014060 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh05G014060
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Description40S ribosomal protein S3-2-like
Genome locationCmo_Chr05:10827365..10834797
RNA-Seq ExpressionCmoCh05G014060
SyntenyCmoCh05G014060
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0015935 - small ribosomal subunit (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR001351 - Ribosomal protein S3, C-terminal
IPR004044 - K Homology domain, type 2
IPR005703 - Ribosomal protein S3, eukaryotic/archaeal
IPR009019 - K homology domain superfamily, prokaryotic type
IPR012870 - Protein of unknown function DUF1666
IPR015946 - K homology domain-like, alpha/beta
IPR018280 - Ribosomal protein S3, conserved site
IPR036419 - Ribosomal protein S3, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599520.1 40S ribosomal protein S3-1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0095.43Show/hide
Query:  MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNC
        MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNC
Subjt:  MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNC

Query:  FVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPS
        FVEELYCDAS SSSGNQTPDDDFDCYSGKYLCEFS ETEEGFDDV+LELFSTDEALGKEDEHEDLKSNEDGPIFDELP VSTPLGDCS PFSDSDSESPS
Subjt:  FVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPS

Query:  FDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEFEED-----EEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTV
        FDEEFIEIELELEPRL VSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEFEED     EEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTV
Subjt:  FDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEFEED-----EEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTV

Query:  QEEEEAGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGRAVKGYPSL
        QEEEEAGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYA+DLIKLKDPFSSMNEKKSGLKSVVS+KLRAGRAVKGYPSL
Subjt:  QEEEEAGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGRAVKGYPSL

Query:  MRDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKK
        MRDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKK
Subjt:  MRDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKK

Query:  LRGKEGESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLKNELLIA
        LRGKEGESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLKNELLIA
Subjt:  LRGKEGESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLKNELLIA

Query:  EVELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSERLRCCLSNFSLKMATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTE
        EVELKLVSRVV       S      +   + + +  +IF ERLRCCLSNFSLKMATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTE
Subjt:  EVELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSERLRCCLSNFSLKMATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTE

Query:  IIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRA
        IIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRA
Subjt:  IIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRA

Query:  KSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVVIHSPKEEEEIVHRPAAAAAAAVLTTDIEVPVVVAA
        KSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVVIHSPKEEEEIVHRP  AAAAAVLTTDIEVPVVVAA
Subjt:  KSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVVIHSPKEEEEIVHRPAAAAAAAVLTTDIEVPVVVAA

KAG7030497.1 hypothetical protein SDJN02_08844 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0096.6Show/hide
Query:  MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNC
        MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNC
Subjt:  MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNC

Query:  FVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPS
        FVEELYCDAS SSSGNQTPDDDFDCYSGKYLCEFS ETEEGFDDV+LELFSTDEALGKEDEHEDLKSNEDGPIFDELP VSTPLGDCS PFSDSDSESPS
Subjt:  FVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPS

Query:  FDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEFEED-----------EEEEEEEEDDEFSQEHQDLIAQLKIELRNSRT
        FDEEFIEIE+ELEPRL VSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEFEED           EEEEEEEEDDEFSQEHQDLIAQLKIELRNSRT
Subjt:  FDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEFEED-----------EEEEEEEEDDEFSQEHQDLIAQLKIELRNSRT

Query:  GGLPTVQEEEEAGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGRAV
        GGLPTVQEEEEAGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVS+KLRAGRAV
Subjt:  GGLPTVQEEEEAGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGRAV

Query:  KGYPSLMRDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIRED
        KGYPSLMRDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIRED
Subjt:  KGYPSLMRDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIRED

Query:  CVSDKKLRGKEGESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLK
        CVSDKKLRGKEGESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLK
Subjt:  CVSDKKLRGKEGESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLK

Query:  NELLIAEVELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSE
        NELLIAEVELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRK+  E
Subjt:  NELLIAEVELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSE

XP_022946773.1 uncharacterized protein LOC111450741 [Cucurbita moschata]0.0e+0099.53Show/hide
Query:  MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNC
        MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNC
Subjt:  MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNC

Query:  FVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPS
        FVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPS
Subjt:  FVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPS

Query:  FDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEFEEDEEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTVQEEEE
        FDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEFEEDEEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTVQEEEE
Subjt:  FDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEFEEDEEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTVQEEEE

Query:  AGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGRAVKGYPSLMRDLK
        AGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGRAVKGYPSLMRDLK
Subjt:  AGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGRAVKGYPSLMRDLK

Query:  RDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKKLRGKE
        RDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKKLRGKE
Subjt:  RDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKKLRGKE

Query:  GESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLKNELLIAEVELK
        GESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLKNELLIAEVELK
Subjt:  GESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLKNELLIAEVELK

Query:  LVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSE
        LVSRVVSMSRLTESQLVWCHKKLHQINFVNRK+  E
Subjt:  LVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSE

XP_022999382.1 uncharacterized protein LOC111493771 [Cucurbita maxima]0.0e+0094.53Show/hide
Query:  MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNC
        MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDD ELLGEEAEI+SVVASNSSKFQVEHTAQI+GFVEESETTNC
Subjt:  MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNC

Query:  FVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPS
        FVEELYCDAS SSSGNQTPDDDF+CYSGKYLCEFS ETEEGFDDVKLELFSTDEAL KEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPS
Subjt:  FVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPS

Query:  FDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEF----EEDEEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTVQ
        FDEEFIEIELELEPRL VSNNAQVCPVNDWSEEESKDCL E +ETERDEKGMEF    EE+EEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTVQ
Subjt:  FDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEF----EEDEEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTVQ

Query:  EEEEAGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGRAVKGYPSLM
        EEEEAGPEYMSPTSVE LKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAID IKLKDPFSSM+EKKSGLKSVVS+KLRA RAVKGYP+LM
Subjt:  EEEEAGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGRAVKGYPSLM

Query:  RDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKKL
        RDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCIL+QRF+EDE FCGPRINNYVKNRLLVRSLLQVPAIREDCV+DKKL
Subjt:  RDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKKL

Query:  RGKEGESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLKNELLIAE
        RGKEGESTISTAALVSMIEESM VFRDFLRADKDV ST IKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGG+CIVKKLKRVSE EEGRLKNELLIAE
Subjt:  RGKEGESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLKNELLIAE

Query:  VELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSE
        VELKLVSRVVSM RLTESQLVWCHKKLHQINFVNRK+  E
Subjt:  VELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSE

XP_023546981.1 uncharacterized protein LOC111805919 [Cucurbita pepo subsp. pepo]0.0e+0096.26Show/hide
Query:  MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNC
        MLPLAGSFRYSLSSFVVSVFVA+IGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNC
Subjt:  MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNC

Query:  FVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPS
        FVEELYCDAS SSSGNQTPDDDFDC SGKYLCEFS ETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPS
Subjt:  FVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPS

Query:  FDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEF-----EEDEEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTV
        FDEEFIEIELELEPRL VSNNAQVCPVNDWSEEESKDCLGEP+ETERDEKGMEF     EE+EEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTV
Subjt:  FDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEF-----EEDEEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTV

Query:  QEEEEAGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGRAVKGYPSL
        QEEEEAGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSM+EKKSGLKSVVS+KLRAGRAVKGYPSL
Subjt:  QEEEEAGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGRAVKGYPSL

Query:  MRDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKK
        MRDLKRDMEMVYVGHLCLSWEVLHWQHRKAIEL QNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKK
Subjt:  MRDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKK

Query:  LRGKEGESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLKNELLIA
        LRGKEGESTISTAALVSMIEESMRVFRDFLRADKDVG+TTIKCAEVEVNAQAM+MEIRTELRKKERRLKEIVRGGNCIVKK KRVSEEEEGRLKNELL+A
Subjt:  LRGKEGESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLKNELLIA

Query:  EVELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSE
        EVE+KLVSRVVSMSRLTESQLVWCHKKLHQI FVNRK+  E
Subjt:  EVELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSE

TrEMBL top hitse value%identityAlignment
A0A0A0LFQ1 Uncharacterized protein8.9e-21965.68Show/hide
Query:  AFARFYSLSKALVNAFLFHNMLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKF
        AF+ F+  S    N FL  NMLPL     YS+ +F++S+F AII  FFRIQV YGTV+SVL                   ++ G+E E  +    NSSK+
Subjt:  AFARFYSLSKALVNAFLFHNMLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKF

Query:  QVEHTAQIYGFVEESETTNCFVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLK---------SNEDG
        Q+E T QI+GF+++SETTNCFV+E    AS SSSGNQTPD DF+CYS KYLCE         D VKLE+F+ +E L K DEHE L+         SN D 
Subjt:  QVEHTAQIYGFVEESETTNCFVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLK---------SNEDG

Query:  PIFDEL-------PEVSTPLGDCSFPFSDSDSESPSFDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGME-FEEDEEEEEE
        PI  E+        EV T + D SF FSDSD ESP FDEE+IEIELEL+P L V NNA++ PVNDWSEEES+DCL E  ETE+DEKGME FE+ +++EEE
Subjt:  PIFDEL-------PEVSTPLGDCSFPFSDSDSESPSFDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGME-FEEDEEEEEE

Query:  EEDDEFSQEHQDLIAQLKIELRNSRTGGLPTVQEEEEAGPE-YMSPTSVEALKPLKNGGNFE-HRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKL
        EE++EF QEHQDLI QLKIELRNSRTGGLPTVQEEE+ G    M PTSVE LKPLK   NFE  + F+EIQKVYKTYA+KMRKLD+SN QTNYAI L+KL
Subjt:  EEDDEFSQEHQDLIAQLKIELRNSRTGGLPTVQEEEEAGPE-YMSPTSVEALKPLKNGGNFE-HRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKL

Query:  KDPFSSMNEKKSGLKSVVSYKLRAGR-AVKGYPSLMRDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDE
        KDP  SM+ KKSGLKSV   KLR GR  VK  P L RDLKRDMEMVYVGHLCLSWE+LHWQHRKA ELQQND+R  SR+TRVVNEFQ F IL+QRFIEDE
Subjt:  KDPFSSMNEKKSGLKSVVSYKLRAGR-AVKGYPSLMRDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDE

Query:  PFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKKLRGKEGESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKE
         FCGPRI+NY +NRL +RSLLQVPAIR DCV+DKK RGKE ESTISTAALVS+IE+SM+VFR+FLRA+K V ++TIKCA+ ++NAQ MMMEIR+ L+KKE
Subjt:  PFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKKLRGKEGESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKE

Query:  RRLKEIVRGGNCIVKKLKRVSEEEEGRLKNELLIAEVELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSE
        RRLKEI+R GNCI KK KR+  E+EGR+KNELLIAEVELKLVSRVVSMSRLTESQL+WCHKKLHQINFVNRK+  E
Subjt:  RRLKEIVRGGNCIVKKLKRVSEEEEGRLKNELLIAEVELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSE

A0A1S3CB94 uncharacterized protein LOC1034987343.8e-21766.11Show/hide
Query:  YSLSKALVNAFLFHNMLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHT
        +S S    N FL HNM PL     YS+ +F++S+F AII  FFRIQV YGTV+SVL   G E E  A  E                    NSSK+Q+E T
Subjt:  YSLSKALVNAFLFHNMLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHT

Query:  AQIYGFVEESETTNCFVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDEL-------P
         QI+GF++ESETT CFV+E    AS SS   QTPD DF+CYS KYLC    E++ G   VKLE+ +++E L K DEHE L SN D PI  ++       P
Subjt:  AQIYGFVEESETTNCFVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDEL-------P

Query:  EVSTPLGDCSFPFSDSDSESPSFDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEFEEDEEEEEEEEDDEFSQEHQDLIA
        EV T + D SF FSDSDSESPSFDEE++EIELEL+PRL VSNNA+V PVNDWSEEE++D L E  ETE+ EKGMEF + ++++EEE ++EF QEHQDLI 
Subjt:  EVSTPLGDCSFPFSDSDSESPSFDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEFEEDEEEEEEEEDDEFSQEHQDLIA

Query:  QLKIELRNSRTGGLPTVQEEEEAGPE-YMSPTSVEALKPLKNGGNFE-HRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLK
        QLKIELRNSRTGGLPTVQEEE+ G    M PT+VE LKPLK   NFE  + F+EIQKVYKTY +KMRKLD+SN QTNYAI L+KLKDP  SM+ KKSGLK
Subjt:  QLKIELRNSRTGGLPTVQEEEEAGPE-YMSPTSVEALKPLKNGGNFE-HRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLK

Query:  SVVSYKLRAGRA-VKGYPSLMRDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRL
        SV   KLR  R  VKG P L RDLKRDMEMVYVGHLCLSWE+LHWQHRKA ELQQND+R  S++TRV NEFQ F IL+QRFIEDE FCGPRI+NY +NRL
Subjt:  SVVSYKLRAGRA-VKGYPSLMRDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRL

Query:  LVRSLLQVPAIREDCVSDKKLRGKEGESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVK
         +RSLLQVPAIR DCV+DKK RGKE ESTISTAALVS+IE+SM+VFR+FLRADK V ++TIKCA+V++NAQ MMMEIR  L+KKERRLKEI+R GNCI K
Subjt:  LVRSLLQVPAIREDCVSDKKLRGKEGESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVK

Query:  KLKRVSEEEEGRLKNELLIAEVELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSE
        K KR+  EEEGR+KNELLIAEVELKLVSRVVSMSRLTESQL+WCHKKLHQINFVNRK+  E
Subjt:  KLKRVSEEEEGRLKNELLIAEVELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSE

A0A5A7UZY3 40S ribosomal protein S3-2-like0.0e+0070.46Show/hide
Query:  QVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNCFVEELYCDASSSSSGNQTPDDDFDCYSGKY
        +V YGTV+SVL   G E E     +A+ D+               NSSK+Q+E T QI+GF++ESETT CFV+E    AS SS   QTPD DF+CYS KY
Subjt:  QVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNCFVEELYCDASSSSSGNQTPDDDFDCYSGKY

Query:  LCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDEL-------PEVSTPLGDCSFPFSDSDSESPSFDEEFIEIELELEPRLQVSNNAQ
        LC    E++ G   VKLE+ +++E L K DEHE L SN D PI  ++       PEV T + D SF FSDSDSESPSFDEE++EIELEL+PRL VSNNA+
Subjt:  LCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDEL-------PEVSTPLGDCSFPFSDSDSESPSFDEEFIEIELELEPRLQVSNNAQ

Query:  VCPVNDWSEEESKDCLGEPMETERDEKGMEFEEDEEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTVQEEEEAGPE-YMSPTSVEALKPLKNGGN
        V PVNDWSEEE++D L E  ETE+DEKGMEF + ++++EEE ++EF QEHQDLI QLKIELRNSRTGGLPTVQEEE+ G    M PT+VE LKPLK   N
Subjt:  VCPVNDWSEEESKDCLGEPMETERDEKGMEFEEDEEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTVQEEEEAGPE-YMSPTSVEALKPLKNGGN

Query:  FE-HRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGRA-VKGYPSLMRDLKRDMEMVYVGHLCLSWEVLHW
        FE  + F+EIQKVYKTYA+KMRKLD+SN QTNYAI L+KLKDP SSM+ KKSGLKSV   KLR  R  VKG P L RDLKRDMEMVYVGHLCLSWE+LHW
Subjt:  FE-HRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGRA-VKGYPSLMRDLKRDMEMVYVGHLCLSWEVLHW

Query:  QHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKKLRGKEGESTISTAALVSMIEESMRV
        QHRKA ELQQND+R  S++TRV NEFQ F IL+QRFIEDE FCGPRI+NY +NRL +RSLLQVPAIR DCV+DKK RGKE ESTISTAALVS+IE+SM+V
Subjt:  QHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKKLRGKEGESTISTAALVSMIEESMRV

Query:  FRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLKNELLIAEVELKLVSRVVSMSRLTESQLVWCH
        FR+FLRADK V ++TIKCA+V++NAQ MMMEIR  L+KKERRLKEI+R GNCI KK KR+  EEEGR+KNELLIAEVELKLVSRV               
Subjt:  FRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLKNELLIAEVELKLVSRVVSMSRLTESQLVWCH

Query:  KKLHQINFVNRKIFSERLRCCLSNFSLKMATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSV
                                              FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSV
Subjt:  KKLHQINFVNRKIFSERLRCCLSNFSLKMATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSV

Query:  VQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAV
        VQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAV
Subjt:  VQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAV

Query:  RHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVVIHSPKEEEEIVHRPAAAAAAAVLTTDIEVPVVV
        RHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVV IHSPKEEEEIVHRP      AVLT DIEVPV V
Subjt:  RHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVVIHSPKEEEEIVHRPAAAAAAAVLTTDIEVPVVV

A0A6J1G4S1 uncharacterized protein LOC1114507410.0e+0099.53Show/hide
Query:  MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNC
        MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNC
Subjt:  MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNC

Query:  FVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPS
        FVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPS
Subjt:  FVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPS

Query:  FDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEFEEDEEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTVQEEEE
        FDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEFEEDEEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTVQEEEE
Subjt:  FDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEFEEDEEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTVQEEEE

Query:  AGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGRAVKGYPSLMRDLK
        AGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGRAVKGYPSLMRDLK
Subjt:  AGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGRAVKGYPSLMRDLK

Query:  RDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKKLRGKE
        RDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKKLRGKE
Subjt:  RDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKKLRGKE

Query:  GESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLKNELLIAEVELK
        GESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLKNELLIAEVELK
Subjt:  GESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLKNELLIAEVELK

Query:  LVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSE
        LVSRVVSMSRLTESQLVWCHKKLHQINFVNRK+  E
Subjt:  LVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSE

A0A6J1KAQ8 uncharacterized protein LOC1114937710.0e+0094.53Show/hide
Query:  MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNC
        MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDD ELLGEEAEI+SVVASNSSKFQVEHTAQI+GFVEESETTNC
Subjt:  MLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVESVLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNC

Query:  FVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPS
        FVEELYCDAS SSSGNQTPDDDF+CYSGKYLCEFS ETEEGFDDVKLELFSTDEAL KEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPS
Subjt:  FVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLELFSTDEALGKEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPS

Query:  FDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEF----EEDEEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTVQ
        FDEEFIEIELELEPRL VSNNAQVCPVNDWSEEESKDCL E +ETERDEKGMEF    EE+EEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTVQ
Subjt:  FDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEF----EEDEEEEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTVQ

Query:  EEEEAGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGRAVKGYPSLM
        EEEEAGPEYMSPTSVE LKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAID IKLKDPFSSM+EKKSGLKSVVS+KLRA RAVKGYP+LM
Subjt:  EEEEAGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGRAVKGYPSLM

Query:  RDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKKL
        RDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCIL+QRF+EDE FCGPRINNYVKNRLLVRSLLQVPAIREDCV+DKKL
Subjt:  RDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIREDCVSDKKL

Query:  RGKEGESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLKNELLIAE
        RGKEGESTISTAALVSMIEESM VFRDFLRADKDV ST IKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGG+CIVKKLKRVSE EEGRLKNELLIAE
Subjt:  RGKEGESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLKNELLIAE

Query:  VELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSE
        VELKLVSRVVSM RLTESQLVWCHKKLHQINFVNRK+  E
Subjt:  VELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSE

SwissProt top hitse value%identityAlignment
P02350 40S ribosomal protein S3-A5.4e-9684.72Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MA Q+SKKRKFVADG+F AELNE LTRELAEDGYSGVEVRVTP RTEIII ATRTQNVLGEKGRRIRELT+VVQKRF FPE SVELYAEKV  RGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRF+MESGAKGCEV+VSGKLR QRAKSMKF DG MI SG PV  Y+D+AVRHVLLRQGVLGIKVKIML WDP GK GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  TTPLPDVVVIHSPKEE
          PLPD V I  PK+E
Subjt:  TTPLPDVVVIHSPKEE

P47835 40S ribosomal protein S3-B5.4e-9684.72Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MA QMSKKRKFVADG+F AELNE LTRELAEDGYSGVEVRVTP +TEIII ATRTQNVLGEKGRRIRELT+VVQKRF FPE SVELYAEKV  RGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRF+MESGAKGCEV+VSGKLR QRAKSMKF DG MI SG PV  Y+D+AVRHVLLRQGVLGIKVKIML WDP GK GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  TTPLPDVVVIHSPKEE
          PLPD V I  PK+E
Subjt:  TTPLPDVVVIHSPKEE

Q9FJA6 40S ribosomal protein S3-35.0e-11085.06Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP++SVELYAEKV NRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP ++YID+AVRHVLLRQGVLG+KVKIMLDWDPKGKQGP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  TTPLPDVVVIHSPKEEE------EIVHRPAAAAAAAVLTTD
         TPLPDVV+IH+PKE++      ++V + A    A + TTD
Subjt:  TTPLPDVVVIHSPKEEE------EIVHRPAAAAAAAVLTTD

Q9M339 40S ribosomal protein S3-22.0e-11187.14Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        M TQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP ++YIDSAVRHVLLRQGVLGIKVK+MLDWDPKG  GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  TTPLPDVVVIHSPKEEEEIVHRPAAAAAAAVLTTDIEVPVV
         TPLPDVV+IHSPKEEE I + PA  AA A L  D  +  V
Subjt:  TTPLPDVVVIHSPKEEEEIVHRPAAAAAAAVLTTDIEVPVV

Q9SIP7 40S ribosomal protein S3-15.6e-10985.06Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP ++YID+AVRHVLLRQGVLGIKVKIMLDWDP GK GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  TTPLPDVVVIHSPKEEEEIVHRPAAAAAAAVLTTDIEVPVV
         TPLPDVV+IH+PK ++ +   PA AAA   L  +  +  V
Subjt:  TTPLPDVVVIHSPKEEEEIVHRPAAAAAAAVLTTDIEVPVV

Arabidopsis top hitse value%identityAlignment
AT1G69610.1 Protein of unknown function (DUF1666)1.1e-7042.38Show/hide
Query:  LELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEFEEDEEEEEE------EEDDEFSQEHQDLIAQLKIELRNSRTGGLPTVQEEEEAGP
        +EL P LQ+SN A       + EEE ++ +        +E  M F+E EE  +E       +DDEF  EH D+I +LK ELR +RTGGL T+ EE E   
Subjt:  LELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEFEEDEEEEEE------EEDDEFSQEHQDLIAQLKIELRNSRTGGLPTVQEEEEAGP

Query:  EYMSPTSVEALKPLK-NGGNFEHR-TFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGR--AVKGYPS--LMR
             T ++ LKPLK      +H+    EI KVYK YA KMRKLDV ++QT ++I L+KLKD            KS +   +   +   ++  PS  L++
Subjt:  EYMSPTSVEALKPLK-NGGNFEHR-TFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMNEKKSGLKSVVSYKLRAGR--AVKGYPS--LMR

Query:  DLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPF-CGPRINNYVKNRLLVRSLLQVPAIREDCVSDKKL
        +  RD E VYVG +CLSWE+L WQ+ K +E     T  T +Y  V  EFQ F +L+QRF+E+EPF    R+  Y+KNR   ++ LQ+P +R+D  S KK 
Subjt:  DLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPF-CGPRINNYVKNRLLVRSLLQVPAIREDCVSDKKL

Query:  RGKEGESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCA-EVEVNAQ-----AMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLKN
        R  EGE  + T  L  +I ESM VF +FL ADKD  ++ +K + + +V+ Q      ++ +IRT L+KKE++LKEI R  +CIVKKLK+   +    +K+
Subjt:  RGKEGESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCA-EVEVNAQ-----AMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLKN

Query:  ELLIAEVELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSE
        ELLIA++EL+LVSRV+ MS+LT  +L WC +KL +I+F  RKI  E
Subjt:  ELLIAEVELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSE

AT2G31610.1 Ribosomal protein S3 family protein4.0e-11085.06Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP ++YID+AVRHVLLRQGVLGIKVKIMLDWDP GK GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  TTPLPDVVVIHSPKEEEEIVHRPAAAAAAAVLTTDIEVPVV
         TPLPDVV+IH+PK ++ +   PA AAA   L  +  +  V
Subjt:  TTPLPDVVVIHSPKEEEEIVHRPAAAAAAAVLTTDIEVPVV

AT3G53870.1 Ribosomal protein S3 family protein1.5e-11287.14Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        M TQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP ++YIDSAVRHVLLRQGVLGIKVK+MLDWDPKG  GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  TTPLPDVVVIHSPKEEEEIVHRPAAAAAAAVLTTDIEVPVV
         TPLPDVV+IHSPKEEE I + PA  AA A L  D  +  V
Subjt:  TTPLPDVVVIHSPKEEEEIVHRPAAAAAAAVLTTDIEVPVV

AT5G35530.1 Ribosomal protein S3 family protein3.6e-11185.06Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP++SVELYAEKV NRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP ++YID+AVRHVLLRQGVLG+KVKIMLDWDPKGKQGP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  TTPLPDVVVIHSPKEEE------EIVHRPAAAAAAAVLTTD
         TPLPDVV+IH+PKE++      ++V + A    A + TTD
Subjt:  TTPLPDVVVIHSPKEEE------EIVHRPAAAAAAAVLTTD

AT5G39785.1 Protein of unknown function (DUF1666)9.2e-6735.26Show/hide
Query:  DEEFIEIELELEPRLQV-SNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEFEEDEEEEEEEEDDEFSQEHQDLIAQLKIELRNSRT-GGLPTVQEEE
        +E+F+E + +     Q  ++N +   ++D   + ++  L +    + D  G   + +EEEEE+    E   EHQDLI QLK+E++  +  GGL T+ EEE
Subjt:  DEEFIEIELELEPRLQV-SNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEFEEDEEEEEEEEDDEFSQEHQDLIAQLKIELRNSRT-GGLPTVQEEE

Query:  EAGPEYMSPTSVEALKP--LKNGGNFEH-RTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSS-----MNEKKSGLKSVVSYKLRAGRAVKG
        E   +   P  +E LKP  ++    F+H  T  E+ K +++Y ++MRKLD+ + Q +YA+ L++ K P  +      N  ++   SV S  +R  +A K 
Subjt:  EAGPEYMSPTSVEALKP--LKNGGNFEH-RTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSS-----MNEKKSGLKSVVSYKLRAGRAVKG

Query:  ----YPSLMRDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIR
                +++++ ++E VYVG +CLSWE+LHWQ+ KAIEL ++D  G+ RY  V  EFQ F +L+QRF+E+EPF  PR+ +Y+K R ++R+LLQ+P IR
Subjt:  ----YPSLMRDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRSLLQVPAIR

Query:  EDCVSDKKLRGK-----EGESTISTAALVSMIEESMRVFRDFLRADK------DVGSTTIKCAEVEVNAQA----MMMEIRTELRKKERRLKEIVRGGNC
        ED   DKK   +       +  I +  LV ++EE++R+F  F+R DK      D  S T    E +    +    M  E++++L+ KE+RL+++++   C
Subjt:  EDCVSDKKLRGK-----EGESTISTAALVSMIEESMRVFRDFLRADK------DVGSTTIKCAEVEVNAQA----MMMEIRTELRKKERRLKEIVRGGNC

Query:  IVKKLKRVSEEEEGRLKNELLIAEVELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSERLRC
        I+++ ++  EE+    +     ++V++KLV+RV++MS+LT   LVWCH KL +INFVNR++  +   C
Subjt:  IVKKLKRVSEEEEGRLKNELLIAEVELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSERLRC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGTTGGGGGCTTGGCAATGCAGTCTCAAACAGATCAGCTTCTGTGGATGCAGCAGATAAAGACTAAAGAGTTCTGTTGCTTGACAAAGACAAGGGTGATTTCTGT
TACTGTAAATGTTTACAAATCATATTTCAAAAGAGAATTGTCGGCTTTTGCTCGCTTTTATTCCCTCAGTAAGGCGTTGGTTAATGCGTTTTTGTTCCATAATATGCTTC
CTCTCGCGGGTTCTTTCAGATATTCCCTCTCCAGTTTTGTCGTCTCTGTGTTTGTAGCAATAATCGGACTCTTTTTCAGGATTCAGGTCGATTATGGAACTGTTGAGTCT
GTTCTTGCACTTGATGGCGACGAACGGGAAACAGTCGCGTCAATCGAAGCACAGGATGATCACGAACTCTTAGGGGAAGAAGCTGAGATCATTTCTGTTGTAGCTTCTAA
TTCGAGTAAATTTCAGGTAGAACACACGGCACAAATCTATGGATTCGTGGAAGAATCGGAAACTACCAACTGTTTTGTCGAAGAATTGTACTGTGATGCTTCGTCTTCTT
CTTCAGGTAATCAAACTCCTGATGATGATTTTGATTGTTATAGCGGAAAATATTTATGTGAATTCAGTTTCGAAACAGAAGAAGGTTTTGATGATGTGAAACTAGAATTA
TTTAGCACCGATGAGGCCTTGGGAAAGGAAGATGAACACGAGGATTTAAAATCCAATGAAGATGGCCCAATTTTTGATGAATTGCCCGAAGTTTCAACTCCGTTGGGAGA
TTGTTCCTTTCCTTTTTCAGATTCAGACTCTGAATCTCCCAGTTTTGATGAAGAGTTCATAGAAATAGAATTAGAATTAGAACCCCGTTTACAGGTTTCAAATAATGCCC
AAGTTTGTCCTGTAAATGATTGGAGCGAGGAGGAGAGTAAAGATTGTTTGGGGGAACCAATGGAAACAGAAAGGGACGAGAAGGGGATGGAATTTGAGGAGGATGAAGAA
GAAGAAGAAGAAGAAGAAGACGATGAATTCTCGCAAGAACACCAAGATTTGATAGCCCAACTCAAAATAGAGCTAAGAAACTCGAGAACAGGAGGACTTCCAACCGTACA
AGAAGAAGAAGAGGCAGGCCCTGAATATATGAGTCCTACATCAGTTGAAGCTCTTAAACCTCTAAAAAATGGTGGAAATTTCGAACACAGAACATTCAAAGAGATCCAAA
AGGTCTACAAAACTTATGCTCAAAAAATGCGAAAGCTAGACGTCTCTAATACCCAGACAAATTATGCAATTGATTTAATCAAGTTGAAAGATCCATTTAGTTCAATGAAT
GAAAAGAAATCTGGCCTTAAATCTGTTGTGTCCTACAAGTTAAGGGCAGGGAGAGCTGTTAAGGGCTATCCAAGTTTGATGAGAGACTTAAAGAGGGACATGGAAATGGT
GTATGTTGGACATCTTTGCCTCTCTTGGGAAGTTCTGCATTGGCAGCATAGGAAGGCCATTGAGCTCCAACAAAACGACACTCGAGGAACCTCTCGGTACACTCGTGTCG
TTAATGAATTTCAATTCTTCTGCATCCTCGTTCAAAGATTCATCGAAGACGAGCCCTTTTGCGGCCCTCGAATCAATAACTACGTGAAAAACAGACTTCTCGTTCGTAGT
CTCCTTCAAGTTCCAGCCATTAGAGAGGATTGTGTAAGTGACAAAAAGTTGAGAGGCAAAGAAGGTGAAAGTACCATCTCAACTGCAGCTCTGGTATCAATGATCGAAGA
ATCGATGCGGGTTTTTCGGGATTTTTTACGTGCTGATAAGGACGTCGGGAGTACGACGATCAAGTGTGCTGAAGTAGAAGTTAATGCACAAGCTATGATGATGGAAATAA
GAACTGAGCTGCGAAAGAAGGAGAGAAGGCTAAAGGAGATAGTGAGAGGTGGGAATTGTATAGTAAAGAAGTTGAAAAGGGTTAGTGAAGAAGAAGAAGGGAGATTGAAG
AATGAATTGTTGATAGCGGAGGTTGAGTTGAAATTGGTATCAAGAGTAGTGAGTATGTCCAGACTAACAGAGAGCCAATTGGTTTGGTGCCATAAAAAGCTACACCAAAT
CAATTTTGTTAATAGGAAGATCTTTTCTGAGCGGCTTCGTTGCTGTCTTTCCAATTTCTCTTTGAAAATGGCGACCCAGATGAGTAAGAAGAGAAAGTTTGTGGCCGATG
GAGTGTTTTTCGCCGAGCTAAACGAGGTTTTGACCAGGGAGTTGGCCGAGGATGGGTACTCCGGTGTCGAGGTTAGAGTCACTCCAATGCGCACTGAGATCATCATCCGA
GCCACCCGCACCCAAAACGTTCTCGGAGAAAAAGGTAGGAGGATTAGGGAACTCACCTCGGTTGTGCAGAAGCGGTTCAAGTTTCCGGAGAACAGTGTTGAGCTCTATGC
CGAGAAAGTTAACAATAGAGGGCTTTGTGCTATTGCCCAAGCCGAGTCTCTTCGTTATAAGCTTCTTGGTGGCCTTGCTGTGCGGAGGGCATGCTATGGTGTTCTGAGAT
TCGTCATGGAAAGTGGAGCAAAGGGTTGCGAGGTTATTGTCAGTGGAAAACTGAGAGCACAGCGTGCAAAGTCTATGAAGTTTAAAGATGGATATATGATATCTTCGGGT
CAACCAGTGAGAGATTATATTGACTCAGCTGTTAGGCACGTTCTTCTCAGACAGGGAGTTCTTGGCATCAAAGTTAAGATCATGCTTGATTGGGATCCTAAGGGCAAGCA
AGGGCCTACCACCCCTCTTCCTGATGTTGTCGTAATCCATTCTCCCAAGGAGGAAGAAGAAATCGTCCATAGACCAGCAGCAGCAGCAGCAGCAGCAGTTTTGACCACTG
ATATTGAGGTTCCAGTAGTAGTAGCAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCGTTGGGGGCTTGGCAATGCAGTCTCAAACAGATCAGCTTCTGTGGATGCAGCAGATAAAGACTAAAGAGTTCTGTTGCTTGACAAAGACAAGGGTGATTTCTGT
TACTGTAAATGTTTACAAATCATATTTCAAAAGAGAATTGTCGGCTTTTGCTCGCTTTTATTCCCTCAGTAAGGCGTTGGTTAATGCGTTTTTGTTCCATAATATGCTTC
CTCTCGCGGGTTCTTTCAGATATTCCCTCTCCAGTTTTGTCGTCTCTGTGTTTGTAGCAATAATCGGACTCTTTTTCAGGATTCAGGTCGATTATGGAACTGTTGAGTCT
GTTCTTGCACTTGATGGCGACGAACGGGAAACAGTCGCGTCAATCGAAGCACAGGATGATCACGAACTCTTAGGGGAAGAAGCTGAGATCATTTCTGTTGTAGCTTCTAA
TTCGAGTAAATTTCAGGTAGAACACACGGCACAAATCTATGGATTCGTGGAAGAATCGGAAACTACCAACTGTTTTGTCGAAGAATTGTACTGTGATGCTTCGTCTTCTT
CTTCAGGTAATCAAACTCCTGATGATGATTTTGATTGTTATAGCGGAAAATATTTATGTGAATTCAGTTTCGAAACAGAAGAAGGTTTTGATGATGTGAAACTAGAATTA
TTTAGCACCGATGAGGCCTTGGGAAAGGAAGATGAACACGAGGATTTAAAATCCAATGAAGATGGCCCAATTTTTGATGAATTGCCCGAAGTTTCAACTCCGTTGGGAGA
TTGTTCCTTTCCTTTTTCAGATTCAGACTCTGAATCTCCCAGTTTTGATGAAGAGTTCATAGAAATAGAATTAGAATTAGAACCCCGTTTACAGGTTTCAAATAATGCCC
AAGTTTGTCCTGTAAATGATTGGAGCGAGGAGGAGAGTAAAGATTGTTTGGGGGAACCAATGGAAACAGAAAGGGACGAGAAGGGGATGGAATTTGAGGAGGATGAAGAA
GAAGAAGAAGAAGAAGAAGACGATGAATTCTCGCAAGAACACCAAGATTTGATAGCCCAACTCAAAATAGAGCTAAGAAACTCGAGAACAGGAGGACTTCCAACCGTACA
AGAAGAAGAAGAGGCAGGCCCTGAATATATGAGTCCTACATCAGTTGAAGCTCTTAAACCTCTAAAAAATGGTGGAAATTTCGAACACAGAACATTCAAAGAGATCCAAA
AGGTCTACAAAACTTATGCTCAAAAAATGCGAAAGCTAGACGTCTCTAATACCCAGACAAATTATGCAATTGATTTAATCAAGTTGAAAGATCCATTTAGTTCAATGAAT
GAAAAGAAATCTGGCCTTAAATCTGTTGTGTCCTACAAGTTAAGGGCAGGGAGAGCTGTTAAGGGCTATCCAAGTTTGATGAGAGACTTAAAGAGGGACATGGAAATGGT
GTATGTTGGACATCTTTGCCTCTCTTGGGAAGTTCTGCATTGGCAGCATAGGAAGGCCATTGAGCTCCAACAAAACGACACTCGAGGAACCTCTCGGTACACTCGTGTCG
TTAATGAATTTCAATTCTTCTGCATCCTCGTTCAAAGATTCATCGAAGACGAGCCCTTTTGCGGCCCTCGAATCAATAACTACGTGAAAAACAGACTTCTCGTTCGTAGT
CTCCTTCAAGTTCCAGCCATTAGAGAGGATTGTGTAAGTGACAAAAAGTTGAGAGGCAAAGAAGGTGAAAGTACCATCTCAACTGCAGCTCTGGTATCAATGATCGAAGA
ATCGATGCGGGTTTTTCGGGATTTTTTACGTGCTGATAAGGACGTCGGGAGTACGACGATCAAGTGTGCTGAAGTAGAAGTTAATGCACAAGCTATGATGATGGAAATAA
GAACTGAGCTGCGAAAGAAGGAGAGAAGGCTAAAGGAGATAGTGAGAGGTGGGAATTGTATAGTAAAGAAGTTGAAAAGGGTTAGTGAAGAAGAAGAAGGGAGATTGAAG
AATGAATTGTTGATAGCGGAGGTTGAGTTGAAATTGGTATCAAGAGTAGTGAGTATGTCCAGACTAACAGAGAGCCAATTGGTTTGGTGCCATAAAAAGCTACACCAAAT
CAATTTTGTTAATAGGAAGATCTTTTCTGAGCGGCTTCGTTGCTGTCTTTCCAATTTCTCTTTGAAAATGGCGACCCAGATGAGTAAGAAGAGAAAGTTTGTGGCCGATG
GAGTGTTTTTCGCCGAGCTAAACGAGGTTTTGACCAGGGAGTTGGCCGAGGATGGGTACTCCGGTGTCGAGGTTAGAGTCACTCCAATGCGCACTGAGATCATCATCCGA
GCCACCCGCACCCAAAACGTTCTCGGAGAAAAAGGTAGGAGGATTAGGGAACTCACCTCGGTTGTGCAGAAGCGGTTCAAGTTTCCGGAGAACAGTGTTGAGCTCTATGC
CGAGAAAGTTAACAATAGAGGGCTTTGTGCTATTGCCCAAGCCGAGTCTCTTCGTTATAAGCTTCTTGGTGGCCTTGCTGTGCGGAGGGCATGCTATGGTGTTCTGAGAT
TCGTCATGGAAAGTGGAGCAAAGGGTTGCGAGGTTATTGTCAGTGGAAAACTGAGAGCACAGCGTGCAAAGTCTATGAAGTTTAAAGATGGATATATGATATCTTCGGGT
CAACCAGTGAGAGATTATATTGACTCAGCTGTTAGGCACGTTCTTCTCAGACAGGGAGTTCTTGGCATCAAAGTTAAGATCATGCTTGATTGGGATCCTAAGGGCAAGCA
AGGGCCTACCACCCCTCTTCCTGATGTTGTCGTAATCCATTCTCCCAAGGAGGAAGAAGAAATCGTCCATAGACCAGCAGCAGCAGCAGCAGCAGCAGTTTTGACCACTG
ATATTGAGGTTCCAGTAGTAGTAGCAGCTTGAGAGGCTCTTTTTTTTTCTAGATATTTATTCATGGAATCAACCATCATCAATTTTGTATTTTCTGTCCCAAGATTTCGT
TTCATGTCATTGATCTGTTGGTTTGAGCCTCCTGACTGCCCAACTTTCTTTTGGATAATCAAACAAAATGCATATGGAGAACAGTTTTTTATGATCCCTTAAGCCTTTTT
ATATATTATTAGCTTTTGCTTCATAATCATA
Protein sequenceShow/hide protein sequence
MGVGGLAMQSQTDQLLWMQQIKTKEFCCLTKTRVISVTVNVYKSYFKRELSAFARFYSLSKALVNAFLFHNMLPLAGSFRYSLSSFVVSVFVAIIGLFFRIQVDYGTVES
VLALDGDERETVASIEAQDDHELLGEEAEIISVVASNSSKFQVEHTAQIYGFVEESETTNCFVEELYCDASSSSSGNQTPDDDFDCYSGKYLCEFSFETEEGFDDVKLEL
FSTDEALGKEDEHEDLKSNEDGPIFDELPEVSTPLGDCSFPFSDSDSESPSFDEEFIEIELELEPRLQVSNNAQVCPVNDWSEEESKDCLGEPMETERDEKGMEFEEDEE
EEEEEEDDEFSQEHQDLIAQLKIELRNSRTGGLPTVQEEEEAGPEYMSPTSVEALKPLKNGGNFEHRTFKEIQKVYKTYAQKMRKLDVSNTQTNYAIDLIKLKDPFSSMN
EKKSGLKSVVSYKLRAGRAVKGYPSLMRDLKRDMEMVYVGHLCLSWEVLHWQHRKAIELQQNDTRGTSRYTRVVNEFQFFCILVQRFIEDEPFCGPRINNYVKNRLLVRS
LLQVPAIREDCVSDKKLRGKEGESTISTAALVSMIEESMRVFRDFLRADKDVGSTTIKCAEVEVNAQAMMMEIRTELRKKERRLKEIVRGGNCIVKKLKRVSEEEEGRLK
NELLIAEVELKLVSRVVSMSRLTESQLVWCHKKLHQINFVNRKIFSERLRCCLSNFSLKMATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIR
ATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSG
QPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVVIHSPKEEEEIVHRPAAAAAAAVLTTDIEVPVVVAA