; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10006807 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10006807
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Description40S ribosomal protein S3-2-like
Genome locationChr07:22181246..22188069
RNA-Seq ExpressionHG10006807
SyntenyHG10006807
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0022627 - cytosolic small ribosomal subunit (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR001351 - Ribosomal protein S3, C-terminal
IPR004044 - K Homology domain, type 2
IPR005703 - Ribosomal protein S3, eukaryotic/archaeal
IPR009019 - K homology domain superfamily, prokaryotic type
IPR012870 - Protein of unknown function DUF1666
IPR015946 - K homology domain-like, alpha/beta
IPR018280 - Ribosomal protein S3, conserved site
IPR036419 - Ribosomal protein S3, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061472.1 40S ribosomal protein S3-3 [Cucumis melo var. makuwa]0.0e+0079.6Show/hide
Query:  YGTVDSVLAIEGDETETITSIQAKADYNSSKYQLEVTTQIHGFMEESETTKCFVEELYCTAS----GNQTPDDDFECYSGKYLGESDDGVKLEIFSNEEA
        YGTVDSVL  +G E+E     QAKADYNSSKYQLE TTQIHGF++ESETT CFV+E   TAS      QTPD DFECYS KYL ESD GVKLEI ++EE 
Subjt:  YGTVDSVLAIEGDETETITSIQAKADYNSSKYQLEVTTQIHGFMEESETTKCFVEELYCTAS----GNQTPDDDFECYSGKYLGESDDGVKLEIFSNEEA

Query:  LEMEDECKGFEFNNDGLIHEEAEKFGL---EIKNL-EDCSFLF--SDSESPSFDEEYIEIELE----------TLVLPVNDWSEEENQDCLGELTETERD
        LE  DE +G   N D  I  + EKFGL   E++ L ED SFLF  SDSESPSFDEEY+EIELE            VLPVNDWSEEENQD L ELTETE+D
Subjt:  LEMEDECKGFEFNNDGLIHEEAEKFGL---EIKNL-EDCSFLF--SDSESPSFDEEYIEIELE----------TLVLPVNDWSEEENQDCLGELTETERD

Query:  EKGMEF--------EEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAECMCPTSVETLKPLKNGGNFELKEHFREIQKVYKTYAEKMRKLD
        EKGMEF        EE EEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE   EA  MCPT+VETLKPLK   NFELK+HFREIQKVYKTYAEKMRKLD
Subjt:  EKGMEF--------EEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAECMCPTSVETLKPLKNGGNFELKEHFREIQKVYKTYAEKMRKLD

Query:  ISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVKDCSTLTRDLKRDMEMVYIGHLCLSWEVLHWQHRRAIELQQNDSRGTSRFTRVVNE
        ISN QTNYAIGLVKLKDP + M+ K    KS+F LKLR  R DVK C  LTRDLKRDMEMVY+GHLCLSWE+LHWQHR+A ELQQNDSR  S+FTRV NE
Subjt:  ISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVKDCSTLTRDLKRDMEMVYIGHLCLSWEVLHWQHRRAIELQQNDSRGTSRFTRVVNE

Query:  FQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDCVNDKKPRGQEGESTISTAALVSIIEDSMRVFREFLRADKDVRNSTIKCAQVQLNA
        FQLFS+L+QRFIEDEQF GPRIDNYA+ RLF+RSLLQVPAIR DCVNDKK RG+E ESTISTAALVSIIEDSM+VFREFLRADK V+NSTIKCAQVQLNA
Subjt:  FQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDCVNDKKPRGQEGESTISTAALVSIIEDSMRVFREFLRADKDVRNSTIKCAQVQLNA

Query:  QLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIAEEEGRLKMELLIAEVELKLISRVVNMSKLTEGQLIWCHKKLHQINFFVADGVFFAELNEVLTRE
        Q MMMEIR GL+KKERRLKEIMRSGNCI KKF++I EEEGR+K ELLIAEVELKL+SRV                       FVADGVFFAELNEVLTRE
Subjt:  QLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIAEEEGRLKMELLIAEVELKLISRVVNMSKLTEGQLIWCHKKLHQINFFVADGVFFAELNEVLTRE

Query:  LAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVME
        LAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVME
Subjt:  LAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVME

Query:  SGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTIHSPKEEEEIVHRPAAVLT
        SGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTIHSPKEEEEIVHRP AVLT
Subjt:  SGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTIHSPKEEEEIVHRPAAVLT

Query:  TDIEVPVAV
         DIEVPVAV
Subjt:  TDIEVPVAV

KAG6599520.1 40S ribosomal protein S3-1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0072.32Show/hide
Query:  MLPLI----YSLSNSLVSLFQPIIPHFFRIQPDYGTVDSVLAIEGDETETITSIQAKADY---------------NSSKYQLEVTTQIHGFMEESETTKC
        MLPL     YSLS+ +VS+F  II  FFRIQ DYGTV+SVLA++GDE ET+ SI+A+ D+               NSSK+Q+E T QI+GF+EESETT C
Subjt:  MLPLI----YSLSNSLVSLFQPIIPHFFRIQPDYGTVDSVLAIEGDETETITSIQAKADY---------------NSSKYQLEVTTQIHGFMEESETTKC

Query:  FVEELYCTA----SGNQTPDDDFECYSGKYL-------GESDDGVKLEIFSNEEALEMEDECKGFEFNNDGLIHEEAEKFGLEIKNLEDCSFLF--SDSE
        FVEELYC A    SGNQTPDDDF+CYSGKYL        E  D V+LE+FS +EAL  EDE +  + N DG I +E     L    L DCS  F  SDSE
Subjt:  FVEELYCTA----SGNQTPDDDFECYSGKYL-------GESDDGVKLEIFSNEEALEMEDECKGFEFNNDGLIHEEAEKFGLEIKNLEDCSFLF--SDSE

Query:  SPSFDEEYIEIELE----------TLVLPVNDWSEEENQDCLGELTETERDEKGMEFEEEEE-------------EFLQEHQDLINQLKIELRNSRTGGL
        SPSFDEE+IEIELE            V PVNDWSEEE++DCLGE  ETERDEKGMEFEE+EE             EF QEHQDLI QLKIELRNSRTGGL
Subjt:  SPSFDEEYIEIELE----------TLVLPVNDWSEEENQDCLGELTETERDEKGMEFEEEEE-------------EFLQEHQDLINQLKIELRNSRTGGL

Query:  PTVQEEEEA--ECMCPTSVETLKPLKNGGNFELKEHFREIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVK
        PTVQEEEEA  E M PTSVE LKPLKNGGNFE +  F+EIQKVYKTYA+KMRKLD+SNTQTNYA+ L+KLKDP + MN K    KS+ S KLRAGRA VK
Subjt:  PTVQEEEEA--ECMCPTSVETLKPLKNGGNFELKEHFREIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVK

Query:  DCSTLTRDLKRDMEMVYIGHLCLSWEVLHWQHRRAIELQQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDC
           +L RDLKRDMEMVY+GHLCLSWEVLHWQHR+AIELQQND+RGTSR+TRVVNEFQ F +LVQRFIEDE F GPRI+NY K RL VRSLLQVPAIREDC
Subjt:  DCSTLTRDLKRDMEMVYIGHLCLSWEVLHWQHRRAIELQQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDC

Query:  VNDKKPRGQEGESTISTAALVSIIEDSMRVFREFLRADKDVRNSTIKCAQVQLNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIA-EEEGRLKM
        V+DKK RG+EGESTISTAALVS+IE+SMRVFR+FLRADKDV ++TIKCA+V++NAQ MMMEIRT LRKKERRLKEI+R GNCIVKK ++++ EEEGRLK 
Subjt:  VNDKKPRGQEGESTISTAALVSIIEDSMRVFREFLRADKDVRNSTIKCAQVQLNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIA-EEEGRLKM

Query:  ELLIAEVELKLISRV-------------------VNMSKLTEGQLIWC--------HKKLHQINFFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMR
        ELLIAEVELKL+SRV                   V ++++   +L  C          ++ +   FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMR
Subjt:  ELLIAEVELKLISRV-------------------VNMSKLTEGQLIWC--------HKKLHQINFFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMR

Query:  TEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQ
        TEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQ
Subjt:  TEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQ

Query:  RAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTIHSPKEEEEIVHRP---AAVLTTDIEVPVAVA
        RAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVV IHSPKEEEEIVHRP   AAVLTTDIEVPV VA
Subjt:  RAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTIHSPKEEEEIVHRP---AAVLTTDIEVPVAVA

XP_008459697.1 PREDICTED: uncharacterized protein LOC103498734 [Cucumis melo]3.2e-23274.3Show/hide
Query:  MLPLIYSLSNSLVSLFQPIIPHFFRIQPDYGTVDSVLAIEGDETETITSIQAKADYNSSKYQLEVTTQIHGFMEESETTKCFVEELYCTAS----GNQTP
        M PLIYS+ N L+SLF  II +FFRIQ  YGTVDSVL  +G E+E     QAKA+YNSSKYQLE TTQIHGF++ESETT CFV+E   TAS      QTP
Subjt:  MLPLIYSLSNSLVSLFQPIIPHFFRIQPDYGTVDSVLAIEGDETETITSIQAKADYNSSKYQLEVTTQIHGFMEESETTKCFVEELYCTAS----GNQTP

Query:  DDDFECYSGKYLGESDDGVKLEIFSNEEALEMEDECKGFEFNNDGLIHEEAEKFGL---EIKNL-EDCSFLF--SDSESPSFDEEYIEIELE--------
        D DFECYS KYL ESD GVKLEI ++EE LE  DE +G   N D  I  + EKFGL   E++ L ED SFLF  SDSESPSFDEEY+EIELE        
Subjt:  DDDFECYSGKYLGESDDGVKLEIFSNEEALEMEDECKGFEFNNDGLIHEEAEKFGL---EIKNL-EDCSFLF--SDSESPSFDEEYIEIELE--------

Query:  --TLVLPVNDWSEEENQDCLGELTETERDEKGMEF--------EEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAECMCPTSVETLKPLK
            VLPVNDWSEEENQD L ELTETE+ EKGMEF        EE EEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE   EA  MCPT+VETLKPLK
Subjt:  --TLVLPVNDWSEEENQDCLGELTETERDEKGMEF--------EEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAECMCPTSVETLKPLK

Query:  NGGNFELKEHFREIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVKDCSTLTRDLKRDMEMVYIGHLCLSWE
           NFELK+HFREIQKVYKTY EKMRKLDISN QTNYAIGLVKLKDP   M+ K    KS+F LKLR  R DVK C  LTRDLKRDMEMVY+GHLCLSWE
Subjt:  NGGNFELKEHFREIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVKDCSTLTRDLKRDMEMVYIGHLCLSWE

Query:  VLHWQHRRAIELQQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDCVNDKKPRGQEGESTISTAALVSIIED
        +LHWQHR+A ELQQNDSR  S+FTRV NEFQLFS+L+QRFIEDEQF GPRIDNYA+ RLF+RSLLQVPAIR DCVNDKK RG+E ESTISTAALVSIIED
Subjt:  VLHWQHRRAIELQQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDCVNDKKPRGQEGESTISTAALVSIIED

Query:  SMRVFREFLRADKDVRNSTIKCAQVQLNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIAEEEGRLKMELLIAEVELKLISRVVNMSKLTEGQLI
        SM+VFREFLRADK V+NSTIKCAQVQLNAQ MMMEIR GL+KKERRLKEIMRSGNCI KKF++I EEEGR+K ELLIAEVELKL+SRVV+MS+LTE QLI
Subjt:  SMRVFREFLRADKDVRNSTIKCAQVQLNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIAEEEGRLKMELLIAEVELKLISRVVNMSKLTEGQLI

Query:  WCHKKLHQINF
        WCHKKLHQINF
Subjt:  WCHKKLHQINF

XP_011648587.1 uncharacterized protein LOC101214479 isoform X1 [Cucumis sativus]4.0e-23574.4Show/hide
Query:  MLPLIYSLSNSLVSLFQPIIPHFFRIQPDYGTVDSVLAIEGDETETITSIQAKADYNSSKYQLEVTTQIHGFMEESETTKCFVEELYCTA----SGNQTP
        MLPLIYS+ N L+SLF  II  FFRIQ  YGTVDSVL I+G E E     QA ADYNSSKYQLE TTQIHGF+++SETT CFV+E   TA    SGNQTP
Subjt:  MLPLIYSLSNSLVSLFQPIIPHFFRIQPDYGTVDSVLAIEGDETETITSIQAKADYNSSKYQLEVTTQIHGFMEESETTKCFVEELYCTA----SGNQTP

Query:  DDDFECYSGKYLGESDDGVKLEIFSNEEALEMEDECKGFE---------FNNDGLIHEEAEKFG---LEIKNL-EDCSFLFSDS--ESPSFDEEYIEIEL
        D DFECYS KYL ESDDGVKLEIF+ EE LE  DE +G E          N D  I  E EK G    E++ L ED SFLFSDS  ESP FDEEYIEIEL
Subjt:  DDDFECYSGKYLGESDDGVKLEIFSNEEALEMEDECKGFE---------FNNDGLIHEEAEKFG---LEIKNL-EDCSFLFSDS--ESPSFDEEYIEIEL

Query:  E----------TLVLPVNDWSEEENQDCLGELTETERDEKGMEF---------EEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAECMCP
        E            +LPVNDWSEEE+QDCL ELTETE+DEKGMEF         EEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE   EA  MCP
Subjt:  E----------TLVLPVNDWSEEENQDCLGELTETERDEKGMEF---------EEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAECMCP

Query:  TSVETLKPLKNGGNFELKEHFREIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVKDCSTLTRDLKRDMEMV
        TSVETLKPLK   NFELK+HFREIQKVYKTYAEKMRKLDISN QTNYAIGLVKLKDP   M+ K    KS+F LKLR GR  VKDC  LTRDLKRDMEMV
Subjt:  TSVETLKPLKNGGNFELKEHFREIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVKDCSTLTRDLKRDMEMV

Query:  YIGHLCLSWEVLHWQHRRAIELQQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDCVNDKKPRGQEGESTIS
        Y+GHLCLSWE+LHWQHR+A ELQQNDSR  SRFTRVVNEFQLFS+L+QRFIEDEQF GPRIDNYA+ RLF+RSLLQVPAIR DCVNDKK RG+E ESTIS
Subjt:  YIGHLCLSWEVLHWQHRRAIELQQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDCVNDKKPRGQEGESTIS

Query:  TAALVSIIEDSMRVFREFLRADKDVRNSTIKCAQVQLNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIAEEEGRLKMELLIAEVELKLISRVVN
        TAALVSIIEDSM+VFREFLRA+K VRNSTIKCAQ QLNAQ MMMEIR+GL+KKERRLKEI+RSGNCI KKF++I E+EGR+K ELLIAEVELKL+SRVV+
Subjt:  TAALVSIIEDSMRVFREFLRADKDVRNSTIKCAQVQLNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIAEEEGRLKMELLIAEVELKLISRVVN

Query:  MSKLTEGQLIWCHKKLHQINF
        MS+LTE QLIWCHKKLHQINF
Subjt:  MSKLTEGQLIWCHKKLHQINF

XP_031736637.1 uncharacterized protein LOC101214479 isoform X2 [Cucumis sativus]1.2e-22674.29Show/hide
Query:  FFRIQPDYGTVDSVLAIEGDETETITSIQAKADYNSSKYQLEVTTQIHGFMEESETTKCFVEELYCTA----SGNQTPDDDFECYSGKYLGESDDGVKLE
        FF++   YGTVDSVL I+G E E     QA ADYNSSKYQLE TTQIHGF+++SETT CFV+E   TA    SGNQTPD DFECYS KYL ESDDGVKLE
Subjt:  FFRIQPDYGTVDSVLAIEGDETETITSIQAKADYNSSKYQLEVTTQIHGFMEESETTKCFVEELYCTA----SGNQTPDDDFECYSGKYLGESDDGVKLE

Query:  IFSNEEALEMEDECKGFE---------FNNDGLIHEEAEKFG---LEIKNL-EDCSFLFSDS--ESPSFDEEYIEIELE----------TLVLPVNDWSE
        IF+ EE LE  DE +G E          N D  I  E EK G    E++ L ED SFLFSDS  ESP FDEEYIEIELE            +LPVNDWSE
Subjt:  IFSNEEALEMEDECKGFE---------FNNDGLIHEEAEKFG---LEIKNL-EDCSFLFSDS--ESPSFDEEYIEIELE----------TLVLPVNDWSE

Query:  EENQDCLGELTETERDEKGMEF---------EEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAECMCPTSVETLKPLKNGGNFELKEHFR
        EE+QDCL ELTETE+DEKGMEF         EEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE   EA  MCPTSVETLKPLK   NFELK+HFR
Subjt:  EENQDCLGELTETERDEKGMEF---------EEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAECMCPTSVETLKPLKNGGNFELKEHFR

Query:  EIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVKDCSTLTRDLKRDMEMVYIGHLCLSWEVLHWQHRRAIEL
        EIQKVYKTYAEKMRKLDISN QTNYAIGLVKLKDP   M+ K    KS+F LKLR GR  VKDC  LTRDLKRDMEMVY+GHLCLSWE+LHWQHR+A EL
Subjt:  EIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVKDCSTLTRDLKRDMEMVYIGHLCLSWEVLHWQHRRAIEL

Query:  QQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDCVNDKKPRGQEGESTISTAALVSIIEDSMRVFREFLRAD
        QQNDSR  SRFTRVVNEFQLFS+L+QRFIEDEQF GPRIDNYA+ RLF+RSLLQVPAIR DCVNDKK RG+E ESTISTAALVSIIEDSM+VFREFLRA+
Subjt:  QQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDCVNDKKPRGQEGESTISTAALVSIIEDSMRVFREFLRAD

Query:  KDVRNSTIKCAQVQLNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIAEEEGRLKMELLIAEVELKLISRVVNMSKLTEGQLIWCHKKLHQINF
        K VRNSTIKCAQ QLNAQ MMMEIR+GL+KKERRLKEI+RSGNCI KKF++I E+EGR+K ELLIAEVELKL+SRVV+MS+LTE QLIWCHKKLHQINF
Subjt:  KDVRNSTIKCAQVQLNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIAEEEGRLKMELLIAEVELKLISRVVNMSKLTEGQLIWCHKKLHQINF

TrEMBL top hitse value%identityAlignment
A0A0A0LFQ1 Uncharacterized protein2.0e-23574.4Show/hide
Query:  MLPLIYSLSNSLVSLFQPIIPHFFRIQPDYGTVDSVLAIEGDETETITSIQAKADYNSSKYQLEVTTQIHGFMEESETTKCFVEELYCTA----SGNQTP
        MLPLIYS+ N L+SLF  II  FFRIQ  YGTVDSVL I+G E E     QA ADYNSSKYQLE TTQIHGF+++SETT CFV+E   TA    SGNQTP
Subjt:  MLPLIYSLSNSLVSLFQPIIPHFFRIQPDYGTVDSVLAIEGDETETITSIQAKADYNSSKYQLEVTTQIHGFMEESETTKCFVEELYCTA----SGNQTP

Query:  DDDFECYSGKYLGESDDGVKLEIFSNEEALEMEDECKGFE---------FNNDGLIHEEAEKFG---LEIKNL-EDCSFLFSDS--ESPSFDEEYIEIEL
        D DFECYS KYL ESDDGVKLEIF+ EE LE  DE +G E          N D  I  E EK G    E++ L ED SFLFSDS  ESP FDEEYIEIEL
Subjt:  DDDFECYSGKYLGESDDGVKLEIFSNEEALEMEDECKGFE---------FNNDGLIHEEAEKFG---LEIKNL-EDCSFLFSDS--ESPSFDEEYIEIEL

Query:  E----------TLVLPVNDWSEEENQDCLGELTETERDEKGMEF---------EEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAECMCP
        E            +LPVNDWSEEE+QDCL ELTETE+DEKGMEF         EEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE   EA  MCP
Subjt:  E----------TLVLPVNDWSEEENQDCLGELTETERDEKGMEF---------EEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAECMCP

Query:  TSVETLKPLKNGGNFELKEHFREIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVKDCSTLTRDLKRDMEMV
        TSVETLKPLK   NFELK+HFREIQKVYKTYAEKMRKLDISN QTNYAIGLVKLKDP   M+ K    KS+F LKLR GR  VKDC  LTRDLKRDMEMV
Subjt:  TSVETLKPLKNGGNFELKEHFREIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVKDCSTLTRDLKRDMEMV

Query:  YIGHLCLSWEVLHWQHRRAIELQQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDCVNDKKPRGQEGESTIS
        Y+GHLCLSWE+LHWQHR+A ELQQNDSR  SRFTRVVNEFQLFS+L+QRFIEDEQF GPRIDNYA+ RLF+RSLLQVPAIR DCVNDKK RG+E ESTIS
Subjt:  YIGHLCLSWEVLHWQHRRAIELQQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDCVNDKKPRGQEGESTIS

Query:  TAALVSIIEDSMRVFREFLRADKDVRNSTIKCAQVQLNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIAEEEGRLKMELLIAEVELKLISRVVN
        TAALVSIIEDSM+VFREFLRA+K VRNSTIKCAQ QLNAQ MMMEIR+GL+KKERRLKEI+RSGNCI KKF++I E+EGR+K ELLIAEVELKL+SRVV+
Subjt:  TAALVSIIEDSMRVFREFLRADKDVRNSTIKCAQVQLNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIAEEEGRLKMELLIAEVELKLISRVVN

Query:  MSKLTEGQLIWCHKKLHQINF
        MS+LTE QLIWCHKKLHQINF
Subjt:  MSKLTEGQLIWCHKKLHQINF

A0A1S3CB94 uncharacterized protein LOC1034987341.5e-23274.3Show/hide
Query:  MLPLIYSLSNSLVSLFQPIIPHFFRIQPDYGTVDSVLAIEGDETETITSIQAKADYNSSKYQLEVTTQIHGFMEESETTKCFVEELYCTAS----GNQTP
        M PLIYS+ N L+SLF  II +FFRIQ  YGTVDSVL  +G E+E     QAKA+YNSSKYQLE TTQIHGF++ESETT CFV+E   TAS      QTP
Subjt:  MLPLIYSLSNSLVSLFQPIIPHFFRIQPDYGTVDSVLAIEGDETETITSIQAKADYNSSKYQLEVTTQIHGFMEESETTKCFVEELYCTAS----GNQTP

Query:  DDDFECYSGKYLGESDDGVKLEIFSNEEALEMEDECKGFEFNNDGLIHEEAEKFGL---EIKNL-EDCSFLF--SDSESPSFDEEYIEIELE--------
        D DFECYS KYL ESD GVKLEI ++EE LE  DE +G   N D  I  + EKFGL   E++ L ED SFLF  SDSESPSFDEEY+EIELE        
Subjt:  DDDFECYSGKYLGESDDGVKLEIFSNEEALEMEDECKGFEFNNDGLIHEEAEKFGL---EIKNL-EDCSFLF--SDSESPSFDEEYIEIELE--------

Query:  --TLVLPVNDWSEEENQDCLGELTETERDEKGMEF--------EEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAECMCPTSVETLKPLK
            VLPVNDWSEEENQD L ELTETE+ EKGMEF        EE EEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE   EA  MCPT+VETLKPLK
Subjt:  --TLVLPVNDWSEEENQDCLGELTETERDEKGMEF--------EEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAECMCPTSVETLKPLK

Query:  NGGNFELKEHFREIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVKDCSTLTRDLKRDMEMVYIGHLCLSWE
           NFELK+HFREIQKVYKTY EKMRKLDISN QTNYAIGLVKLKDP   M+ K    KS+F LKLR  R DVK C  LTRDLKRDMEMVY+GHLCLSWE
Subjt:  NGGNFELKEHFREIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVKDCSTLTRDLKRDMEMVYIGHLCLSWE

Query:  VLHWQHRRAIELQQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDCVNDKKPRGQEGESTISTAALVSIIED
        +LHWQHR+A ELQQNDSR  S+FTRV NEFQLFS+L+QRFIEDEQF GPRIDNYA+ RLF+RSLLQVPAIR DCVNDKK RG+E ESTISTAALVSIIED
Subjt:  VLHWQHRRAIELQQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDCVNDKKPRGQEGESTISTAALVSIIED

Query:  SMRVFREFLRADKDVRNSTIKCAQVQLNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIAEEEGRLKMELLIAEVELKLISRVVNMSKLTEGQLI
        SM+VFREFLRADK V+NSTIKCAQVQLNAQ MMMEIR GL+KKERRLKEIMRSGNCI KKF++I EEEGR+K ELLIAEVELKL+SRVV+MS+LTE QLI
Subjt:  SMRVFREFLRADKDVRNSTIKCAQVQLNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIAEEEGRLKMELLIAEVELKLISRVVNMSKLTEGQLI

Query:  WCHKKLHQINF
        WCHKKLHQINF
Subjt:  WCHKKLHQINF

A0A5A7UZY3 40S ribosomal protein S3-2-like0.0e+0079.6Show/hide
Query:  YGTVDSVLAIEGDETETITSIQAKADYNSSKYQLEVTTQIHGFMEESETTKCFVEELYCTAS----GNQTPDDDFECYSGKYLGESDDGVKLEIFSNEEA
        YGTVDSVL  +G E+E     QAKADYNSSKYQLE TTQIHGF++ESETT CFV+E   TAS      QTPD DFECYS KYL ESD GVKLEI ++EE 
Subjt:  YGTVDSVLAIEGDETETITSIQAKADYNSSKYQLEVTTQIHGFMEESETTKCFVEELYCTAS----GNQTPDDDFECYSGKYLGESDDGVKLEIFSNEEA

Query:  LEMEDECKGFEFNNDGLIHEEAEKFGL---EIKNL-EDCSFLF--SDSESPSFDEEYIEIELE----------TLVLPVNDWSEEENQDCLGELTETERD
        LE  DE +G   N D  I  + EKFGL   E++ L ED SFLF  SDSESPSFDEEY+EIELE            VLPVNDWSEEENQD L ELTETE+D
Subjt:  LEMEDECKGFEFNNDGLIHEEAEKFGL---EIKNL-EDCSFLF--SDSESPSFDEEYIEIELE----------TLVLPVNDWSEEENQDCLGELTETERD

Query:  EKGMEF--------EEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAECMCPTSVETLKPLKNGGNFELKEHFREIQKVYKTYAEKMRKLD
        EKGMEF        EE EEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE   EA  MCPT+VETLKPLK   NFELK+HFREIQKVYKTYAEKMRKLD
Subjt:  EKGMEF--------EEEEEEFLQEHQDLINQLKIELRNSRTGGLPTVQEEE---EAECMCPTSVETLKPLKNGGNFELKEHFREIQKVYKTYAEKMRKLD

Query:  ISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVKDCSTLTRDLKRDMEMVYIGHLCLSWEVLHWQHRRAIELQQNDSRGTSRFTRVVNE
        ISN QTNYAIGLVKLKDP + M+ K    KS+F LKLR  R DVK C  LTRDLKRDMEMVY+GHLCLSWE+LHWQHR+A ELQQNDSR  S+FTRV NE
Subjt:  ISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVKDCSTLTRDLKRDMEMVYIGHLCLSWEVLHWQHRRAIELQQNDSRGTSRFTRVVNE

Query:  FQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDCVNDKKPRGQEGESTISTAALVSIIEDSMRVFREFLRADKDVRNSTIKCAQVQLNA
        FQLFS+L+QRFIEDEQF GPRIDNYA+ RLF+RSLLQVPAIR DCVNDKK RG+E ESTISTAALVSIIEDSM+VFREFLRADK V+NSTIKCAQVQLNA
Subjt:  FQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDCVNDKKPRGQEGESTISTAALVSIIEDSMRVFREFLRADKDVRNSTIKCAQVQLNA

Query:  QLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIAEEEGRLKMELLIAEVELKLISRVVNMSKLTEGQLIWCHKKLHQINFFVADGVFFAELNEVLTRE
        Q MMMEIR GL+KKERRLKEIMRSGNCI KKF++I EEEGR+K ELLIAEVELKL+SRV                       FVADGVFFAELNEVLTRE
Subjt:  QLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIAEEEGRLKMELLIAEVELKLISRVVNMSKLTEGQLIWCHKKLHQINFFVADGVFFAELNEVLTRE

Query:  LAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVME
        LAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVME
Subjt:  LAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLLGGLAVRRACYGVLRFVME

Query:  SGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTIHSPKEEEEIVHRPAAVLT
        SGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTIHSPKEEEEIVHRP AVLT
Subjt:  SGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTIHSPKEEEEIVHRPAAVLT

Query:  TDIEVPVAV
         DIEVPVAV
Subjt:  TDIEVPVAV

A0A6J1G4S1 uncharacterized protein LOC1114507414.2e-22269.51Show/hide
Query:  MLPLI----YSLSNSLVSLFQPIIPHFFRIQPDYGTVDSVLAIEGDETETITSIQAKADY---------------NSSKYQLEVTTQIHGFMEESETTKC
        MLPL     YSLS+ +VS+F  II  FFRIQ DYGTV+SVLA++GDE ET+ SI+A+ D+               NSSK+Q+E T QI+GF+EESETT C
Subjt:  MLPLI----YSLSNSLVSLFQPIIPHFFRIQPDYGTVDSVLAIEGDETETITSIQAKADY---------------NSSKYQLEVTTQIHGFMEESETTKC

Query:  FVEELYCTA----SGNQTPDDDFECYSGKYL-------GESDDGVKLEIFSNEEALEMEDECKGFEFNNDGLIHEEAEKFGLEIKNLEDCSFLF--SDSE
        FVEELYC A    SGNQTPDDDF+CYSGKYL        E  D VKLE+FS +EAL  EDE +  + N DG I +E  +       L DCSF F  SDSE
Subjt:  FVEELYCTA----SGNQTPDDDFECYSGKYL-------GESDDGVKLEIFSNEEALEMEDECKGFEFNNDGLIHEEAEKFGLEIKNLEDCSFLF--SDSE

Query:  SPSFDEEYIEIELE----------TLVLPVNDWSEEENQDCLGELTETERDEKGMEFEEEEE--------EFLQEHQDLINQLKIELRNSRTGGLPTVQE
        SPSFDEE+IEIELE            V PVNDWSEEE++DCLGE  ETERDEKGMEFEE+EE        EF QEHQDLI QLKIELRNSRTGGLPTVQE
Subjt:  SPSFDEEYIEIELE----------TLVLPVNDWSEEENQDCLGELTETERDEKGMEFEEEEE--------EFLQEHQDLINQLKIELRNSRTGGLPTVQE

Query:  EEEA--ECMCPTSVETLKPLKNGGNFELKEHFREIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVKDCSTL
        EEEA  E M PTSVE LKPLKNGGNFE +  F+EIQKVYKTYA+KMRKLD+SNTQTNYAI L+KLKDP + MN K    KS+ S KLRAGRA VK   +L
Subjt:  EEEA--ECMCPTSVETLKPLKNGGNFELKEHFREIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVKDCSTL

Query:  TRDLKRDMEMVYIGHLCLSWEVLHWQHRRAIELQQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDCVNDKK
         RDLKRDMEMVY+GHLCLSWEVLHWQHR+AIELQQND+RGTSR+TRVVNEFQ F +LVQRFIEDE F GPRI+NY K RL VRSLLQVPAIREDCV+DKK
Subjt:  TRDLKRDMEMVYIGHLCLSWEVLHWQHRRAIELQQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDCVNDKK

Query:  PRGQEGESTISTAALVSIIEDSMRVFREFLRADKDVRNSTIKCAQVQLNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIA-EEEGRLKMELLIA
         RG+EGESTISTAALVS+IE+SMRVFR+FLRADKDV ++TIKCA+V++NAQ MMMEIRT LRKKERRLKEI+R GNCIVKK ++++ EEEGRLK ELLIA
Subjt:  PRGQEGESTISTAALVSIIEDSMRVFREFLRADKDVRNSTIKCAQVQLNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIA-EEEGRLKMELLIA

Query:  EVELKLISRVVNMSKLTEGQLIWCHKKLHQINF
        EVELKL+SRVV+MS+LTE QL+WCHKKLHQINF
Subjt:  EVELKLISRVVNMSKLTEGQLIWCHKKLHQINF

A0A6J1KAQ8 uncharacterized protein LOC1114937711.3e-21868.45Show/hide
Query:  MLPLI----YSLSNSLVSLFQPIIPHFFRIQPDYGTVDSVLAIEGDETETITSIQAKADY---------------NSSKYQLEVTTQIHGFMEESETTKC
        MLPL     YSLS+ +VS+F  II  FFRIQ DYGTV+SVLA++GDE ET+ SI+A+ D                NSSK+Q+E T QIHGF+EESETT C
Subjt:  MLPLI----YSLSNSLVSLFQPIIPHFFRIQPDYGTVDSVLAIEGDETETITSIQAKADY---------------NSSKYQLEVTTQIHGFMEESETTKC

Query:  FVEELYCTA----SGNQTPDDDFECYSGKYL-------GESDDGVKLEIFSNEEALEMEDECKGFEFNNDGLIHEEAEKFGLEIKNLEDCSFLF--SDSE
        FVEELYC A    SGNQTPDDDFECYSGKYL        E  D VKLE+FS +EALE EDE +  + N DG I +E  +       L DCSF F  SDSE
Subjt:  FVEELYCTA----SGNQTPDDDFECYSGKYL-------GESDDGVKLEIFSNEEALEMEDECKGFEFNNDGLIHEEAEKFGLEIKNLEDCSFLF--SDSE

Query:  SPSFDEEYIEIELE----------TLVLPVNDWSEEENQDCLGELTETERDEKGMEFEEEEE------------EFLQEHQDLINQLKIELRNSRTGGLP
        SPSFDEE+IEIELE            V PVNDWSEEE++DCL E  ETERDEKGMEFEEEEE            EF QEHQDLI QLKIELRNSRTGGLP
Subjt:  SPSFDEEYIEIELE----------TLVLPVNDWSEEENQDCLGELTETERDEKGMEFEEEEE------------EFLQEHQDLINQLKIELRNSRTGGLP

Query:  TVQEEEEA--ECMCPTSVETLKPLKNGGNFELKEHFREIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVKD
        TVQEEEEA  E M PTSVE LKPLKNGGNFE +  F+EIQKVYKTYA+KMRKLD+SNTQTNYAI  +KLKDP + M+ K    KS+ S KLRA RA VK 
Subjt:  TVQEEEEA--ECMCPTSVETLKPLKNGGNFELKEHFREIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDPITPMNRK----KSLFSLKLRAGRADVKD

Query:  CSTLTRDLKRDMEMVYIGHLCLSWEVLHWQHRRAIELQQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDCV
           L RDLKRDMEMVY+GHLCLSWEVLHWQHR+AIELQQND+RGTSR+TRVVNEFQ F +L+QRF+EDE F GPRI+NY K RL VRSLLQVPAIREDCV
Subjt:  CSTLTRDLKRDMEMVYIGHLCLSWEVLHWQHRRAIELQQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDCV

Query:  NDKKPRGQEGESTISTAALVSIIEDSMRVFREFLRADKDVRNSTIKCAQVQLNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIAE-EEGRLKME
        NDKK RG+EGESTISTAALVS+IE+SM VFR+FLRADKDVR++ IKCA+V++NAQ MMMEIRT LRKKERRLKEI+R G+CIVKK ++++E EEGRLK E
Subjt:  NDKKPRGQEGESTISTAALVSIIEDSMRVFREFLRADKDVRNSTIKCAQVQLNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIAE-EEGRLKME

Query:  LLIAEVELKLISRVVNMSKLTEGQLIWCHKKLHQINF
        LLIAEVELKL+SRVV+M +LTE QL+WCHKKLHQINF
Subjt:  LLIAEVELKLISRVVNMSKLTEGQLIWCHKKLHQINF

SwissProt top hitse value%identityAlignment
E2RH47 40S ribosomal protein S36.7e-9284.95Show/hide
Query:  FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLL
        FVADG+F AELNE LTRELAEDGYSGVEVRVTP RTEIII ATRTQNVLGEKGRRIRELT+VVQKRF FPE SVELYAEKV  RGLCAIAQAESLRYKLL
Subjt:  FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLL

Query:  GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTI
        GGLAVRRACYGVLRF+MESGAKGCEV+VSGKLR QRAKSMKF DG MI SG PV  Y+D+AVRHVLLRQGVLGIKVKIML WDP GK GP  PLPD V+I
Subjt:  GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTI

Query:  HSPKEE
          PK+E
Subjt:  HSPKEE

P02350 40S ribosomal protein S3-A6.7e-9284.95Show/hide
Query:  FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLL
        FVADG+F AELNE LTRELAEDGYSGVEVRVTP RTEIII ATRTQNVLGEKGRRIRELT+VVQKRF FPE SVELYAEKV  RGLCAIAQAESLRYKLL
Subjt:  FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLL

Query:  GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTI
        GGLAVRRACYGVLRF+MESGAKGCEV+VSGKLR QRAKSMKF DG MI SG PV  Y+D+AVRHVLLRQGVLGIKVKIML WDP GK GP  PLPD V+I
Subjt:  GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTI

Query:  HSPKEE
          PK+E
Subjt:  HSPKEE

Q9FJA6 40S ribosomal protein S3-33.1e-10587.17Show/hide
Query:  FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLL
        FVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP++SVELYAEKV NRGLCAIAQAESLRYKLL
Subjt:  FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLL

Query:  GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTI
        GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP ++YID+AVRHVLLRQGVLG+KVKIMLDWDPKGKQGP TPLPDVV I
Subjt:  GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTI

Query:  HSPKEEEEIVHRPAAVLTTDIEVPVA
        H+PKE++  +  PA V+T    VP A
Subjt:  HSPKEEEEIVHRPAAVLTTDIEVPVA

Q9M339 40S ribosomal protein S3-22.4e-10591.2Show/hide
Query:  FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLL
        FVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIAQAESLRYKLL
Subjt:  FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLL

Query:  GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTI
        GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP ++YIDSAVRHVLLRQGVLGIKVK+MLDWDPKG  GP TPLPDVV I
Subjt:  GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTI

Query:  HSPKEEEEIVHRPAAV
        HSPKEEE I + PA V
Subjt:  HSPKEEEEIVHRPAAV

Q9SIP7 40S ribosomal protein S3-13.8e-10391.75Show/hide
Query:  FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLL
        FVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIAQAESLRYKLL
Subjt:  FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLL

Query:  GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTI
        GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP ++YID+AVRHVLLRQGVLGIKVKIMLDWDP GK GP TPLPDVV I
Subjt:  GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTI

Query:  HSPKEE
        H+PK++
Subjt:  HSPKEE

Arabidopsis top hitse value%identityAlignment
AT2G31610.1 Ribosomal protein S3 family protein2.7e-10491.75Show/hide
Query:  FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLL
        FVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIAQAESLRYKLL
Subjt:  FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLL

Query:  GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTI
        GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP ++YID+AVRHVLLRQGVLGIKVKIMLDWDP GK GP TPLPDVV I
Subjt:  GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTI

Query:  HSPKEE
        H+PK++
Subjt:  HSPKEE

AT3G53870.1 Ribosomal protein S3 family protein1.7e-10691.2Show/hide
Query:  FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLL
        FVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIAQAESLRYKLL
Subjt:  FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLL

Query:  GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTI
        GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP ++YIDSAVRHVLLRQGVLGIKVK+MLDWDPKG  GP TPLPDVV I
Subjt:  GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTI

Query:  HSPKEEEEIVHRPAAV
        HSPKEEE I + PA V
Subjt:  HSPKEEEEIVHRPAAV

AT5G35530.1 Ribosomal protein S3 family protein2.2e-10687.17Show/hide
Query:  FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLL
        FVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP++SVELYAEKV NRGLCAIAQAESLRYKLL
Subjt:  FVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLL

Query:  GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTI
        GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP ++YID+AVRHVLLRQGVLG+KVKIMLDWDPKGKQGP TPLPDVV I
Subjt:  GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPLPDVVTI

Query:  HSPKEEEEIVHRPAAVLTTDIEVPVA
        H+PKE++  +  PA V+T    VP A
Subjt:  HSPKEEEEIVHRPAAVLTTDIEVPVA

AT5G39785.1 Protein of unknown function (DUF1666)5.2e-7136.29Show/hide
Query:  EIKNLEDCSFLFSDSESPSFDEEYIEIELETLVLPVNDWSEEENQDCLGELTETERDEKGMEFEEEEE------EFLQEHQDLINQLKIELRNSRT-GGL
        +I+ L +  FL SDS+     + +   + E   L  +D++E   +   G+  +++    G + EEEEE      E L EHQDLI QLK+E++  +  GGL
Subjt:  EIKNLEDCSFLFSDSESPSFDEEYIEIELETLVLPVNDWSEEENQDCLGELTETERDEKGMEFEEEEE------EFLQEHQDLINQLKIELRNSRT-GGL

Query:  PTVQEEEEAECMCPTSVETLKP--LKNGGNFELKEHFREIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDP--------ITPMNRK-KSLFSLKLR--
         T+ EEEE +  CP  +E LKP  ++    F+  +   E+ K +++Y E+MRKLDI + Q +YA+GL++ K P          P      S+FS+ +R  
Subjt:  PTVQEEEEAECMCPTSVETLKP--LKNGGNFELKEHFREIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDP--------ITPMNRK-KSLFSLKLR--

Query:  -AGRADVKDCSTLTRDLKRDMEMVYIGHLCLSWEVLHWQHRRAIELQQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQ
         A +++++      ++++ ++E VY+G +CLSWE+LHWQ+ +AIEL ++D  G+ R+  V  EFQ F VL+QRF+E+E F  PR+ +Y K R  +R+LLQ
Subjt:  -AGRADVKDCSTLTRDLKRDMEMVYIGHLCLSWEVLHWQHRRAIELQQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQ

Query:  VPAIREDCVNDKK-----PRGQEGESTISTAALVSIIEDSMRVFREFLRADKDVRNSTIKCAQVQLNAQL------------MMMEIRTGLRKKERRLKE
        +P IRED   DKK        +  +  I +  LV I+E+++R+F  F+R DK    S+I   + +  +Q+            M  E+++ L+ KE+RL++
Subjt:  VPAIREDCVNDKK-----PRGQEGESTISTAALVSIIEDSMRVFREFLRADKDVRNSTIKCAQVQLNAQL------------MMMEIRTGLRKKERRLKE

Query:  IMRSGNCIVKKFQKIAEEEGRLKMEL-LIAEVELKLISRVVNMSKLTEGQLIWCHKKLHQINF
        +++S  CI+++FQK  EE+      L   ++V++KL++RV+NMSKLT   L+WCH KL +INF
Subjt:  IMRSGNCIVKKFQKIAEEEGRLKMEL-LIAEVELKLISRVVNMSKLTEGQLIWCHKKLHQINF

AT5G39785.2 Protein of unknown function (DUF1666)4.8e-6935.99Show/hide
Query:  EIKNLEDCSFLFSDSESPSFDEEYIEIELETLVLPVNDWSEEENQDCLGELTETERDEKGMEFEEEEE------EFLQEHQDLINQLKIELRNSRT-GGL
        +I+ L +  FL SDS+     + +   + E   L  +D++E   +   G+  +++    G + EEEEE      E L EHQDLI QLK+E++  +  GGL
Subjt:  EIKNLEDCSFLFSDSESPSFDEEYIEIELETLVLPVNDWSEEENQDCLGELTETERDEKGMEFEEEEE------EFLQEHQDLINQLKIELRNSRT-GGL

Query:  PTVQEEEEAECMCPTSVETLKP--LKNGGNFELKEHFREIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDP--------ITPMNRK-KSLFSLKLR--
         T+ EEEE +  CP  +E LKP  ++    F+  +   E+ K +++Y E+MRKLDI + Q +YA+GL++ K P          P      S+FS+ +R  
Subjt:  PTVQEEEEAECMCPTSVETLKP--LKNGGNFELKEHFREIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDP--------ITPMNRK-KSLFSLKLR--

Query:  -AGRADVKDCSTLTRDLKRDMEMVYIGHLCLSWEVLHWQHRRAIELQQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQ
         A +++++      ++++ ++E VY+G +CLSWE+LHWQ+ +AIEL ++D  G+ R+  V  EFQ F VL+QRF+E+E F  PR+ +Y K R  +R+LLQ
Subjt:  -AGRADVKDCSTLTRDLKRDMEMVYIGHLCLSWEVLHWQHRRAIELQQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQ

Query:  VPAIREDCVNDKK-----PRGQEGESTISTAALVSIIEDSMRVFREFLRADKDVRNSTIKCAQVQLNAQL------------MMMEIRTGLRK-KERRLK
        +P IRED   DKK        +  +  I +  LV I+E+++R+F  F+R DK    S+I   + +  +Q+            M  E+++ L+   E+RL+
Subjt:  VPAIREDCVNDKK-----PRGQEGESTISTAALVSIIEDSMRVFREFLRADKDVRNSTIKCAQVQLNAQL------------MMMEIRTGLRK-KERRLK

Query:  EIMRSGNCIVKKFQKIAEEEGRLKMEL-LIAEVELKLISRVVNMSKLTEGQLIWCHKKLHQINF
        ++++S  CI+++FQK  EE+      L   ++V++KL++RV+NMSKLT   L+WCH KL +INF
Subjt:  EIMRSGNCIVKKFQKIAEEEGRLKMEL-LIAEVELKLISRVVNMSKLTEGQLIWCHKKLHQINF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCCTCTCATATATTCCCTCTCCAATTCTCTCGTCTCTCTGTTTCAACCAATAATCCCACACTTTTTCAGAATTCAACCCGATTATGGAACTGTTGATTCTGTTCT
TGCAATTGAAGGCGATGAAACTGAAACAATCACGTCAATTCAAGCAAAGGCTGATTACAATTCAAGCAAATATCAGTTGGAAGTTACGACACAAATCCATGGATTCATGG
AAGAATCAGAAACTACTAAATGTTTTGTTGAAGAATTGTACTGTACTGCTTCAGGTAATCAAACTCCTGATGATGATTTTGAATGTTACAGCGGGAAATATTTAGGGGAA
AGTGATGATGGGGTGAAACTAGAAATATTTAGTAACGAAGAGGCTTTAGAAATGGAAGATGAATGTAAGGGTTTTGAATTCAATAACGATGGCCTAATTCATGAGGAAGC
TGAGAAGTTTGGCCTTGAAATTAAAAATTTGGAAGATTGTTCGTTTCTTTTTTCAGATTCAGAATCTCCAAGTTTTGATGAAGAGTACATAGAAATAGAATTAGAAACCC
TAGTTTTGCCTGTAAATGATTGGAGCGAGGAGGAGAATCAAGATTGTTTGGGGGAACTAACAGAAACAGAAAGGGATGAGAAGGGAATGGAATTTGAAGAAGAAGAAGAG
GAATTCTTGCAAGAGCATCAAGATTTGATAAATCAACTCAAGATAGAGCTAAGAAACTCAAGAACAGGAGGACTTCCAACGGTACAAGAAGAAGAAGAGGCAGAATGCAT
GTGTCCCACATCGGTTGAAACTCTAAAACCTCTAAAAAATGGTGGAAATTTCGAACTCAAAGAACATTTCAGAGAGATTCAAAAGGTTTACAAGACTTATGCAGAGAAAA
TGCGTAAGCTCGACATCTCCAATACCCAAACAAATTATGCAATCGGTTTAGTTAAGTTGAAGGATCCAATTACTCCAATGAATAGAAAGAAATCTCTGTTTTCCCTCAAG
TTAAGGGCAGGGAGAGCAGATGTTAAGGACTGTTCAACATTGACGAGAGACTTAAAGAGGGATATGGAAATGGTGTATATTGGACATCTTTGCCTTTCTTGGGAAGTTTT
ACATTGGCAGCATAGGAGGGCCATTGAGTTGCAACAAAATGACTCTCGAGGAACCTCTCGGTTCACTCGAGTTGTCAATGAATTTCAACTCTTCTCCGTCCTCGTTCAAA
GATTCATCGAAGACGAACAGTTTCGTGGCCCTCGAATCGACAACTATGCCAAAACCCGACTTTTTGTTCGTAGTCTCCTTCAAGTTCCTGCCATTAGAGAGGATTGTGTG
AATGACAAAAAGCCGAGAGGCCAAGAAGGTGAAAGTACTATCTCAACTGCAGCTTTAGTATCGATCATTGAAGATTCAATGCGGGTTTTTCGGGAATTTCTACGTGCAGA
TAAAGACGTTAGGAATTCGACGATCAAATGTGCTCAAGTACAACTTAATGCACAACTAATGATGATGGAAATACGAACTGGTCTAAGAAAGAAGGAGAGAAGGCTAAAAG
AGATAATGAGAAGTGGGAATTGTATAGTAAAGAAGTTTCAAAAGATTGCTGAAGAAGAAGGCAGACTGAAGATGGAATTGTTGATAGCGGAAGTTGAATTGAAATTGATA
TCAAGGGTTGTCAATATGTCAAAACTAACGGAGGGCCAATTGATTTGGTGTCATAAAAAACTACACCAAATCAATTTTTTCGTAGCTGATGGAGTGTTTTTTGCTGAGCT
AAACGAGGTTTTGACCCGAGAGTTGGCCGAGGATGGGTACTCCGGTGTCGAGGTTAGAGTAACTCCTATGCGCACTGAGATCATCATCCGAGCCACCCGTACCCAAAATG
TTCTCGGTGAAAAGGGTAGGAGAATTAGGGAACTCACCTCGGTTGTGCAGAAGCGGTTCAAGTTTCCGGAGAACAGTGTTGAGCTTTATGCCGAGAAAGTTAACAACAGA
GGGCTTTGTGCCATTGCCCAAGCCGAGTCACTTCGTTATAAGCTACTTGGTGGCCTTGCTGTTCGGAGGGCATGCTACGGTGTACTGAGATTCGTCATGGAAAGTGGAGC
AAAGGGTTGCGAGGTGATTGTTAGTGGAAAACTCAGGGCACAGCGTGCAAAGTCTATGAAGTTCAAAGATGGATATATGATCTCTTCAGGTCAACCAGTCAGAGACTATA
TTGACTCGGCTGTTAGACACGTTCTTCTCAGACAGGGAGTTCTTGGTATCAAAGTTAAGATCATGCTTGATTGGGATCCTAAGGGCAAGCAAGGCCCTACCACCCCTCTT
CCTGATGTTGTCACCATCCACTCTCCCAAGGAGGAGGAAGAAATCGTCCACAGACCCGCAGCAGTTTTGACCACCGATATTGAGGTTCCGGTAGCAGTAGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTCCCTCTCATATATTCCCTCTCCAATTCTCTCGTCTCTCTGTTTCAACCAATAATCCCACACTTTTTCAGAATTCAACCCGATTATGGAACTGTTGATTCTGTTCT
TGCAATTGAAGGCGATGAAACTGAAACAATCACGTCAATTCAAGCAAAGGCTGATTACAATTCAAGCAAATATCAGTTGGAAGTTACGACACAAATCCATGGATTCATGG
AAGAATCAGAAACTACTAAATGTTTTGTTGAAGAATTGTACTGTACTGCTTCAGGTAATCAAACTCCTGATGATGATTTTGAATGTTACAGCGGGAAATATTTAGGGGAA
AGTGATGATGGGGTGAAACTAGAAATATTTAGTAACGAAGAGGCTTTAGAAATGGAAGATGAATGTAAGGGTTTTGAATTCAATAACGATGGCCTAATTCATGAGGAAGC
TGAGAAGTTTGGCCTTGAAATTAAAAATTTGGAAGATTGTTCGTTTCTTTTTTCAGATTCAGAATCTCCAAGTTTTGATGAAGAGTACATAGAAATAGAATTAGAAACCC
TAGTTTTGCCTGTAAATGATTGGAGCGAGGAGGAGAATCAAGATTGTTTGGGGGAACTAACAGAAACAGAAAGGGATGAGAAGGGAATGGAATTTGAAGAAGAAGAAGAG
GAATTCTTGCAAGAGCATCAAGATTTGATAAATCAACTCAAGATAGAGCTAAGAAACTCAAGAACAGGAGGACTTCCAACGGTACAAGAAGAAGAAGAGGCAGAATGCAT
GTGTCCCACATCGGTTGAAACTCTAAAACCTCTAAAAAATGGTGGAAATTTCGAACTCAAAGAACATTTCAGAGAGATTCAAAAGGTTTACAAGACTTATGCAGAGAAAA
TGCGTAAGCTCGACATCTCCAATACCCAAACAAATTATGCAATCGGTTTAGTTAAGTTGAAGGATCCAATTACTCCAATGAATAGAAAGAAATCTCTGTTTTCCCTCAAG
TTAAGGGCAGGGAGAGCAGATGTTAAGGACTGTTCAACATTGACGAGAGACTTAAAGAGGGATATGGAAATGGTGTATATTGGACATCTTTGCCTTTCTTGGGAAGTTTT
ACATTGGCAGCATAGGAGGGCCATTGAGTTGCAACAAAATGACTCTCGAGGAACCTCTCGGTTCACTCGAGTTGTCAATGAATTTCAACTCTTCTCCGTCCTCGTTCAAA
GATTCATCGAAGACGAACAGTTTCGTGGCCCTCGAATCGACAACTATGCCAAAACCCGACTTTTTGTTCGTAGTCTCCTTCAAGTTCCTGCCATTAGAGAGGATTGTGTG
AATGACAAAAAGCCGAGAGGCCAAGAAGGTGAAAGTACTATCTCAACTGCAGCTTTAGTATCGATCATTGAAGATTCAATGCGGGTTTTTCGGGAATTTCTACGTGCAGA
TAAAGACGTTAGGAATTCGACGATCAAATGTGCTCAAGTACAACTTAATGCACAACTAATGATGATGGAAATACGAACTGGTCTAAGAAAGAAGGAGAGAAGGCTAAAAG
AGATAATGAGAAGTGGGAATTGTATAGTAAAGAAGTTTCAAAAGATTGCTGAAGAAGAAGGCAGACTGAAGATGGAATTGTTGATAGCGGAAGTTGAATTGAAATTGATA
TCAAGGGTTGTCAATATGTCAAAACTAACGGAGGGCCAATTGATTTGGTGTCATAAAAAACTACACCAAATCAATTTTTTCGTAGCTGATGGAGTGTTTTTTGCTGAGCT
AAACGAGGTTTTGACCCGAGAGTTGGCCGAGGATGGGTACTCCGGTGTCGAGGTTAGAGTAACTCCTATGCGCACTGAGATCATCATCCGAGCCACCCGTACCCAAAATG
TTCTCGGTGAAAAGGGTAGGAGAATTAGGGAACTCACCTCGGTTGTGCAGAAGCGGTTCAAGTTTCCGGAGAACAGTGTTGAGCTTTATGCCGAGAAAGTTAACAACAGA
GGGCTTTGTGCCATTGCCCAAGCCGAGTCACTTCGTTATAAGCTACTTGGTGGCCTTGCTGTTCGGAGGGCATGCTACGGTGTACTGAGATTCGTCATGGAAAGTGGAGC
AAAGGGTTGCGAGGTGATTGTTAGTGGAAAACTCAGGGCACAGCGTGCAAAGTCTATGAAGTTCAAAGATGGATATATGATCTCTTCAGGTCAACCAGTCAGAGACTATA
TTGACTCGGCTGTTAGACACGTTCTTCTCAGACAGGGAGTTCTTGGTATCAAAGTTAAGATCATGCTTGATTGGGATCCTAAGGGCAAGCAAGGCCCTACCACCCCTCTT
CCTGATGTTGTCACCATCCACTCTCCCAAGGAGGAGGAAGAAATCGTCCACAGACCCGCAGCAGTTTTGACCACCGATATTGAGGTTCCGGTAGCAGTAGCCTAA
Protein sequenceShow/hide protein sequence
MLPLIYSLSNSLVSLFQPIIPHFFRIQPDYGTVDSVLAIEGDETETITSIQAKADYNSSKYQLEVTTQIHGFMEESETTKCFVEELYCTASGNQTPDDDFECYSGKYLGE
SDDGVKLEIFSNEEALEMEDECKGFEFNNDGLIHEEAEKFGLEIKNLEDCSFLFSDSESPSFDEEYIEIELETLVLPVNDWSEEENQDCLGELTETERDEKGMEFEEEEE
EFLQEHQDLINQLKIELRNSRTGGLPTVQEEEEAECMCPTSVETLKPLKNGGNFELKEHFREIQKVYKTYAEKMRKLDISNTQTNYAIGLVKLKDPITPMNRKKSLFSLK
LRAGRADVKDCSTLTRDLKRDMEMVYIGHLCLSWEVLHWQHRRAIELQQNDSRGTSRFTRVVNEFQLFSVLVQRFIEDEQFRGPRIDNYAKTRLFVRSLLQVPAIREDCV
NDKKPRGQEGESTISTAALVSIIEDSMRVFREFLRADKDVRNSTIKCAQVQLNAQLMMMEIRTGLRKKERRLKEIMRSGNCIVKKFQKIAEEEGRLKMELLIAEVELKLI
SRVVNMSKLTEGQLIWCHKKLHQINFFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNR
GLCAIAQAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVRDYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPTTPL
PDVVTIHSPKEEEEIVHRPAAVLTTDIEVPVAVA