; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008972 (gene) of Snake gourd v1 genome

Gene IDTan0008972
OrganismTrichosanthes anguina (Snake gourd v1)
Description40S ribosomal protein S3
Genome locationLG06:1259393..1266647
RNA-Seq ExpressionTan0008972
SyntenyTan0008972
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0015935 - small ribosomal subunit (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR001351 - Ribosomal protein S3, C-terminal
IPR004044 - K Homology domain, type 2
IPR005703 - Ribosomal protein S3, eukaryotic/archaeal
IPR009019 - K homology domain superfamily, prokaryotic type
IPR009057 - Homeobox-like domain superfamily
IPR015946 - K homology domain-like, alpha/beta
IPR017930 - Myb domain
IPR018280 - Ribosomal protein S3, conserved site
IPR036419 - Ribosomal protein S3, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8646506.1 hypothetical protein Csa_015876 [Cucumis sativus]0.0e+0082.33Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEEEEFVRPVAVIPTTEIEWL---------------------------------------SQVGFHL--------------------
        PTPLPDLVTIH+PKEEE+F+RPVAVIPTTEIE L                                       SQ+G HL                    
Subjt:  PTPLPDLVTIHTPKEEEEFVRPVAVIPTTEIEWL---------------------------------------SQVGFHL--------------------

Query:  ---NEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISN---GLPQSE
           NE KIVESG  QDGSTLS NQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDD++EK+EDAGQI GC+P EGTLFGKPHVE+ N   GL QS+
Subjt:  ---NEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISN---GLPQSE

Query:  TFEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKYPDKP
        TFEA A YNARLEYIEEVLQKVKQEERLRL CGS NYASAYVNGDRK SD+HGRLPVIDEKLQS ISLQEI HSISPSL ENH N++GSLGDCLK+PDK 
Subjt:  TFEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKYPDKP

Query:  VESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVPTAKAL
        VESESSDA+CTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWL+RRI MGLTNSCDIP SSFIIKEGKFVEE S NVEG+ T PTA+ L
Subjt:  VESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVPTAKAL

Query:  NIECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGRTVIT
        NIECR SP+ Y+LENKD HH E MELDHGSE QH+ERAAVKR+RKPTRRYIEELSEVESREYVQKV+S+NKN +SD +SANSIARPIKKV SDGGRTVIT
Subjt:  NIECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGRTVIT

Query:  RLDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEVEKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLVEGVSI
        RLDSLGGSGFQVPCVSRVRRSRPRKD+V LVF+LP+KDQ+PSV  TDE EK+LEQKQT S N  DDNTA+V T KGG RRKHHRAWTLVEVIKLVEGVS 
Subjt:  RLDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEVEKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLVEGVSI

Query:  CGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRKHASVSIPAQILLRVRELAEMHAQIPPPNHGQGKLGGGVGGNSLH
        CGAG+WSEIKKLSFSSYSYRTSVDLKDKWRNLLKAS  QTPVDEGISSRKHAS+SIPAQ+LLRVRELAEMHAQIPP +HGQGKLGGG    + H
Subjt:  CGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRKHASVSIPAQILLRVRELAEMHAQIPPPNHGQGKLGGGVGGNSLH

KAG6593544.1 Telomere repeat-binding protein 4, partial [Cucurbita argyrosperma subsp. sororia]4.7e-28784.88Show/hide
Query:  VGFHLNEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISNGLPQSET
        VGF  NEGKIV+SG+ QD STLS NQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDD+ EK+EDAGQI  C+PTEGTLFGKPHVEISNGLPQS  
Subjt:  VGFHLNEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISNGLPQSET

Query:  FEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKYPDKPV
         E DAGY ARLEYIEEVLQKVKQEERLRLACGS NY SAYV+GDRK SDQHGRL V DEK QS+ISLQEI+H  SPSLNENHE++HGSLG+ LK+PDK V
Subjt:  FEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKYPDKPV

Query:  ESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVPTAKALN
        ESESSDAICTTS P+FS+LKGD+CLDNLSIREL ECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNV GV T+P A+ALN
Subjt:  ESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVPTAKALN

Query:  IECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGRTVITR
        IEC GSPT Y    KDHHHVE +ELDHG +DQHEERAAVKRIRKPTRRYIEELSEVESRE+V KVISLNK+ VSDG+SANSI RP KKVCSD GRTVITR
Subjt:  IECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGRTVITR

Query:  LDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEV-EKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLVEGVSI
        LDSLGGSG QVPCVSRVRRSRPRKDIVALVF+LPDKDQNPSV DT+EV EK+LE+K T SGNA DDN  IVPTPKGG+RRKHHRAWTLVEVIKLVEGVSI
Subjt:  LDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEV-EKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLVEGVSI

Query:  CGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRKHASVSIPAQILLRVRELAEMHAQIPPPNHGQGKLGGGVGGNSLHEMTSAV
        CGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPV EG+SSRKH SVSIP QILL+VRELAEMHAQIPP NHGQGKLGG   G+++HE++ AV
Subjt:  CGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRKHASVSIPAQILLRVRELAEMHAQIPPPNHGQGKLGGGVGGNSLHEMTSAV

Query:  CL
        CL
Subjt:  CL

XP_022999983.1 uncharacterized protein LOC111494307 isoform X1 [Cucurbita maxima]2.3e-28985.38Show/hide
Query:  VGFHLNEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISNGLPQSET
        VGF  NEGKIV+SG+ QD STLS NQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDD+ EK+EDAGQI  CVPTEGTLFGKP V+ISNGLPQS  
Subjt:  VGFHLNEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISNGLPQSET

Query:  FEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKYPDKPV
         E DAGY ARLEYIEEVLQKVK+EERLRLACGS NY SAYV+GDRK SDQHGRLPV DEK QS+ISLQEI+H  SPSLNENHEN+HGSLG+ LK+PDK V
Subjt:  FEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKYPDKPV

Query:  ESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVPTAKALN
        ESESSDAICTTS PDFS+LKGD+CLDNLSIREL ECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKF+EESSQNV GV T+PTA+ALN
Subjt:  ESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVPTAKALN

Query:  IECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGRTVITR
        IEC GSPT Y LE KDHHH+E +ELDHG EDQHEERAAVKRIRKPTRRYIEELSEVESREYV KVISLNK+ VSDG+SANSI RP KKVCSD GRTVITR
Subjt:  IECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGRTVITR

Query:  LDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEV-EKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLVEGVSI
        LDSLGGSG QVPCVSRVRRSRPRKDIVALVF+LPDKDQNPSV DT+EV EK+LE+K T SGNA DDN  IVPTPK G+RRKHHRAWTLVEVIKLVEGVSI
Subjt:  LDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEV-EKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLVEGVSI

Query:  CGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRKHASVSIPAQILLRVRELAEMHAQIPPPNHGQGKLGGGVGGNSLHEMTSAV
        CGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPV EG+SSRKH SVSIP QILL+VRELAEMHAQIPP NHGQGKLGG   G+++HE++ AV
Subjt:  CGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRKHASVSIPAQILLRVRELAEMHAQIPPPNHGQGKLGGGVGGNSLHEMTSAV

Query:  CL
        CL
Subjt:  CL

XP_031745224.1 uncharacterized protein LOC101203003 isoform X1 [Cucumis sativus]8.6e-28984.08Show/hide
Query:  SQVGFHLNEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISN---GL
        ++VG   NE KIVESG  QDGSTLS NQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDD++EK+EDAGQI GC+P EGTLFGKPHVE+ N   GL
Subjt:  SQVGFHLNEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISN---GL

Query:  PQSETFEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKY
         QS+TFEA A YNARLEYIEEVLQKVKQEERLRL CGS NYASAYVNGDRK SD+HGRLPVIDEKLQS ISLQEI HSISPSL ENH N++GSLGDCLK+
Subjt:  PQSETFEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKY

Query:  PDKPVESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVPT
        PDK VESESSDA+CTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWL+RRI MGLTNSCDIP SSFIIKEGKFVEE S NVEG+ T PT
Subjt:  PDKPVESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVPT

Query:  AKALNIECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGR
        A+ LNIECR SP+ Y+LENKD HH E MELDHGSE QH+ERAAVKR+RKPTRRYIEELSEVESREYVQKV+S+NKN +SD +SANSIARPIKKV SDGGR
Subjt:  AKALNIECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGR

Query:  TVITRLDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEVEKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLVE
        TVITRLDSLGGSGFQVPCVSRVRRSRPRKD+V LVF+LP+KDQ+PSV  TDE EK+LEQKQT S N  DDNTA+V T KGG RRKHHRAWTLVEVIKLVE
Subjt:  TVITRLDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEVEKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLVE

Query:  GVSICGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRKHASVSIPAQILLRVRELAEMHAQIPPPNHGQGKLGGGVGGNSLHEM
        GVS CGAG+WSEIKKLSFSSYSYRTSVDLKDKWRNLLKAS  QTPVDEGISSRKHAS+SIPAQ+LLRVRELAEMHAQIPP +HGQGKLGGG    S+HEM
Subjt:  GVSICGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRKHASVSIPAQILLRVRELAEMHAQIPPPNHGQGKLGGGVGGNSLHEM

Query:  TSA
        +S+
Subjt:  TSA

XP_038897567.1 uncharacterized protein LOC120085586 isoform X1 [Benincasa hispida]6.8e-29486.42Show/hide
Query:  VGFHLNEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISN---GLPQ
        VGF  NEGKIVESG AQDGSTLS NQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDD++E++EDAGQI GC+PTEGTLFGKP VEISN   GLPQ
Subjt:  VGFHLNEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISN---GLPQ

Query:  SETFEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKYPD
        SET EA A YNARLEYIEEVLQKVKQEERLRL CGSP Y SA VNGDRKDSD+HGRLPV+DE LQS I LQEI HSISP+L ++H N++GSLG+C K+PD
Subjt:  SETFEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKYPD

Query:  KPVESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVPTAK
        K VESESSDA+CTT NPDFSLLKGDVCLDNLSIREL ECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSF+IKEGKFVEE SQNV+G+ TVP A+
Subjt:  KPVESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVPTAK

Query:  ALNIECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGRTV
        AL IECRGSPT Y+LENKD++  E MELDHGSE QH+ERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKN +SDG+SANSIARPIKKVCSDGGRTV
Subjt:  ALNIECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGRTV

Query:  ITRLDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEVEKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLVEGV
        ITRLDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSV  TDE EK+LEQKQTASGNA DDNT++V T KGG RRKHHRAWTLVEVIKLVEGV
Subjt:  ITRLDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEVEKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLVEGV

Query:  SICGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRKHASVSIPAQILLRVRELAEMHAQIPPPNHGQGKLGGGVGGNSLHEMT-
        S CGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRKHAS+SIPAQILL+VRELAEMHAQIPP +HGQGKLGGGV G S+HEM+ 
Subjt:  SICGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRKHASVSIPAQILLRVRELAEMHAQIPPPNHGQGKLGGGVGGNSLHEMT-

Query:  SAVC
        SA+C
Subjt:  SAVC

TrEMBL top hitse value%identityAlignment
A0A0A0KCL9 Uncharacterized protein0.0e+0081.83Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEEEEFVRPVAVIPTTEIEWL--------------------------------------------SQVGFHL---------------
        PTPLPDLVTIH+PKEEE+F+RPVAVIPTTEIE L                                            SQ+G HL               
Subjt:  PTPLPDLVTIHTPKEEEEFVRPVAVIPTTEIEWL--------------------------------------------SQVGFHL---------------

Query:  --------NEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISN---G
                NE KIVESG  QDGSTLS NQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDD++EK+EDAGQI GC+P EGTLFGKPHVE+ N   G
Subjt:  --------NEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISN---G

Query:  LPQSETFEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLK
        L QS+TFEA A YNARLEYIEEVLQKVKQEERLRL CGS NYASAYVNGDRK SD+HGRLPVIDEKLQS ISLQEI HSISPSL ENH N++GSLGDCLK
Subjt:  LPQSETFEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLK

Query:  YPDKPVESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVP
        +PDK VESESSDA+CTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWL+RRI MGLTNSCDIP SSFIIKEGKFVEE S NVEG+ T P
Subjt:  YPDKPVESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVP

Query:  TAKALNIECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGG
        TA+ LNIECR SP+ Y+LENKD HH E MELDHGSE QH+ERAAVKR+RKPTRRYIEELSEVESREYVQKV+S+NKN +SD +SANSIARPIKKV SDGG
Subjt:  TAKALNIECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGG

Query:  RTVITRLDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEVEKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLV
        RTVITRLDSLGGSGFQVPCVSRVRRSRPRKD+V LVF+LP+KDQ+PSV  TDE EK+LEQKQT S N  DDNTA+V T KGG RRKHHRAWTLVEVIKLV
Subjt:  RTVITRLDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEVEKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLV

Query:  EGVSICGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISS
        EGVS CGAG+WSEIKKLSFSSYSYRTSVDLKDKWRNLLKAS  QTPVDEG+ S
Subjt:  EGVSICGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISS

A0A1S3CI77 uncharacterized protein LOC103500701 isoform X13.0e-28784.41Show/hide
Query:  SQVGFHLNEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISN---GL
        ++VG   NE KIVESG A+DGSTLS NQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDD++EK+EDAGQI GC PTE TLFGKPHVE+ N   GL
Subjt:  SQVGFHLNEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISN---GL

Query:  PQSETFEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKY
        PQS+TFEA A YNARLEYIEEVLQKVKQEERLRL CGSPNY SAYVNGD K SD+HGRLPVIDEKLQS +SLQ           ENH N++GSLGDCLK+
Subjt:  PQSETFEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKY

Query:  PDKPVESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVPT
        PDK VESESSDA+CTTSNPDFSLLKGD+CLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESS NVEG+ T PT
Subjt:  PDKPVESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVPT

Query:  AKALNIECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGR
        A+ LNIECR SPT Y+LENKD HH E MELDHGSE QH+ERAAVKR+RKPTRRYIEELSEVESREYVQKV+SLNKN +SD ISANSIARPIKKV SDGGR
Subjt:  AKALNIECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGR

Query:  TVITRLDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEVEKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLVE
        TVITRLDSLGGSGFQVPCVSRVRRSRPRKD+V LVF+LP+KDQNPSV  TDEVEK LEQKQTAS N  DDNTA+VPT KGG RRKHHRAWTLVEVIKLVE
Subjt:  TVITRLDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEVEKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLVE

Query:  GVSICGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRKHASVSIPAQILLRVRELAEMHAQIPPPNHGQGKLGGGVGGNSLHEM
        GVS CGAG+WSEIKKLSFSSYSYRTSVDLKDKWRNLLKAS  QTPVDEGISSRKHAS+SIPAQILLRVRELAEMHAQIPP +HGQGKLGGG  G S+HEM
Subjt:  GVSICGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRKHASVSIPAQILLRVRELAEMHAQIPPPNHGQGKLGGGVGGNSLHEM

Query:  TSA
        +S+
Subjt:  TSA

A0A5D3BW39 HTH myb-type domain-containing protein3.0e-28784.32Show/hide
Query:  SQVGFHLNEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISN---GL
        ++VG   NE KIVESG A+DGSTLS NQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDD++EK+EDAGQI GC PT+ TLFGKPHVE+ N   GL
Subjt:  SQVGFHLNEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISN---GL

Query:  PQSETFEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKY
        PQS+TFEA A YNARLEYIEEVLQKVKQEERLRL CGSPNY SAYVNGD K SD+HGRLPVIDEKLQS +SLQ           ENH N++GSLGDCLK+
Subjt:  PQSETFEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKY

Query:  PDKPVESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVPT
        PDK VESESSDA+CTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESS NVEG+ T PT
Subjt:  PDKPVESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVPT

Query:  AKALNIECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGR
        A+ LNIECR SPT Y+LENKD HH E MELDHGSE QH+ERAAVKR+RKPTRRYIEELSEVESREYVQKV+SLNKN +SD ISANSIARPIKKV SDGGR
Subjt:  AKALNIECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGR

Query:  TVITRLDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEVEKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLVE
        TVITRLDSLGGSGFQVPCVSRVRRSRPRKD+V LVF+LP+KDQNPSV  TDEVEK LEQKQTAS N  DDNTA+VPT KGG RRKHHRAWTLVEVIKLVE
Subjt:  TVITRLDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEVEKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLVE

Query:  GVSICGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRKHASVSIPAQILLRVRELAEMHAQIPPPNHGQGKLGGGVGGNSLHEM
        GVS CGAG+WSEIKKLSFSSYSYRTSVDLKDKWRNLLKAS  QTPVDEGISSRKHAS+SIPAQILLRVRELAEMHAQIPP +HGQGKLGGG  G S+HEM
Subjt:  GVSICGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRKHASVSIPAQILLRVRELAEMHAQIPPPNHGQGKLGGGVGGNSLHEM

Query:  TSA-VC
        +S+ VC
Subjt:  TSA-VC

A0A6J1HME2 uncharacterized protein LOC111464283 isoform X13.0e-28785.05Show/hide
Query:  VGFHLNEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISNGLPQSET
        VGF  NEGKIV+SG+ QD STLS NQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDD+ EK+EDAGQI  C+PTEGTLFGKPHVEISNGLPQS  
Subjt:  VGFHLNEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISNGLPQSET

Query:  FEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKYPDKPV
         E DAGY ARLEYIEEVLQKVKQEERLRLACGS NY SAYV+GDRK SDQHG L V DEK QS+ISLQEI+H  SPSLNENHEN+HGSLG+ LK+PDK V
Subjt:  FEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKYPDKPV

Query:  ESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVPTAKALN
        ESESSDAICTTS P+FS+LKGD+CLDNLSIREL ECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNV GV T+PTA+ALN
Subjt:  ESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVPTAKALN

Query:  IECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGRTVITR
        IEC GSPT Y LE KDHHHVE +ELDHG +DQHEERAAVKRIRKPTRRYIEELSEVESRE+V KVISLNK+ VSD +SANSI RP KKVCSD GRTVITR
Subjt:  IECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGRTVITR

Query:  LDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEV-EKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLVEGVSI
        LDSLGGSG QVPCVSRVRRSRPRK+IVALVF+LPDKDQNPSV DT+EV EK+LE+K T SGNA DDN  IVPTPKGG+RRKHHRAWTLVEVIKLVEGVSI
Subjt:  LDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEV-EKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLVEGVSI

Query:  CGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRKHASVSIPAQILLRVRELAEMHAQIPPPNHGQGKLGGGVGGNSLHEMTSAV
        CGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPV EG+SSRKH SVSIP QILL+VRELAEMHAQIPP NHGQGKLGG   G+++HE++ AV
Subjt:  CGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRKHASVSIPAQILLRVRELAEMHAQIPPPNHGQGKLGGGVGGNSLHEMTSAV

Query:  CL
        CL
Subjt:  CL

A0A6J1KL99 uncharacterized protein LOC111494307 isoform X11.1e-28985.38Show/hide
Query:  VGFHLNEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISNGLPQSET
        VGF  NEGKIV+SG+ QD STLS NQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDD+ EK+EDAGQI  CVPTEGTLFGKP V+ISNGLPQS  
Subjt:  VGFHLNEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISNGLPQSET

Query:  FEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKYPDKPV
         E DAGY ARLEYIEEVLQKVK+EERLRLACGS NY SAYV+GDRK SDQHGRLPV DEK QS+ISLQEI+H  SPSLNENHEN+HGSLG+ LK+PDK V
Subjt:  FEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKYPDKPV

Query:  ESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVPTAKALN
        ESESSDAICTTS PDFS+LKGD+CLDNLSIREL ECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKF+EESSQNV GV T+PTA+ALN
Subjt:  ESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVPTAKALN

Query:  IECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGRTVITR
        IEC GSPT Y LE KDHHH+E +ELDHG EDQHEERAAVKRIRKPTRRYIEELSEVESREYV KVISLNK+ VSDG+SANSI RP KKVCSD GRTVITR
Subjt:  IECRGSPTPYTLENKDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGRTVITR

Query:  LDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEV-EKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLVEGVSI
        LDSLGGSG QVPCVSRVRRSRPRKDIVALVF+LPDKDQNPSV DT+EV EK+LE+K T SGNA DDN  IVPTPK G+RRKHHRAWTLVEVIKLVEGVSI
Subjt:  LDSLGGSGFQVPCVSRVRRSRPRKDIVALVFSLPDKDQNPSVMDTDEV-EKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLVEGVSI

Query:  CGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRKHASVSIPAQILLRVRELAEMHAQIPPPNHGQGKLGGGVGGNSLHEMTSAV
        CGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPV EG+SSRKH SVSIP QILL+VRELAEMHAQIPP NHGQGKLGG   G+++HE++ AV
Subjt:  CGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRKHASVSIPAQILLRVRELAEMHAQIPPPNHGQGKLGGGVGGNSLHEMTSAV

Query:  CL
        CL
Subjt:  CL

SwissProt top hitse value%identityAlignment
P02350 40S ribosomal protein S3-A9.5e-9780.95Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MA Q+SKKRKFVADG+F AELNE LTRELAEDGYSGVEVRVTP RTEIII ATRTQNVLGEKGRRIRELT+VVQKRF FPE SVELYAEKV  RGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRF+MESGAKGCEV+VSGKLR QRAKSMKF DG MI SG PV  Y+D+AVRHVLLRQGVLGIKVKIML WDP GK GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEEEEFVRPVAVIPTTEI
          PLPD V+I  PK+E        ++PTT I
Subjt:  PTPLPDLVTIHTPKEEEEFVRPVAVIPTTEI

P47835 40S ribosomal protein S3-B9.5e-9780.95Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MA QMSKKRKFVADG+F AELNE LTRELAEDGYSGVEVRVTP +TEIII ATRTQNVLGEKGRRIRELT+VVQKRF FPE SVELYAEKV  RGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRF+MESGAKGCEV+VSGKLR QRAKSMKF DG MI SG PV  Y+D+AVRHVLLRQGVLGIKVKIML WDP GK GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEEEEFVRPVAVIPTTEI
          PLPD V+I  PK+E        ++PTT I
Subjt:  PTPLPDLVTIHTPKEEEEFVRPVAVIPTTEI

Q9FJA6 40S ribosomal protein S3-31.4e-11189.82Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP++SVELYAEKV NRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP KEYID+AVRHVLLRQGVLG+KVKIMLDWDPKGKQGP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEEEEFVRPVAVI
         TPLPD+V IHTPKE++ ++ P  V+
Subjt:  PTPLPDLVTIHTPKEEEEFVRPVAVI

Q9M339 40S ribosomal protein S3-27.5e-11090.22Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        M TQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP KEYIDSAVRHVLLRQGVLGIKVK+MLDWDPKG  GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEEEEFVRPVAV
         TPLPD+V IH+PKEEE    P  V
Subjt:  PTPLPDLVTIHTPKEEEEFVRPVAV

Q9SIP7 40S ribosomal protein S3-16.3e-10987.12Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP KEYID+AVRHVLLRQGVLGIKVKIMLDWDP GK GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEEEEFVRPV-AVIPTTEIE
         TPLPD+V IH PK++  +  P  A  P T ++
Subjt:  PTPLPDLVTIHTPKEEEEFVRPV-AVIPTTEIE

Arabidopsis top hitse value%identityAlignment
AT1G72650.1 TRF-like 62.1e-9940.61Show/hide
Query:  STNQIADPVVYKLVRVDGDGRFVPATDDEVMEVED---------------LLEDDQDEKME--------DAGQIEGCVPTEGTLFGKPHVEISNGLPQSE
        STNQI +PV YKLVRV GDG  VPATD+E++EV D               L  D+++ +++        DA Q  G +P EG       +E S  +  S 
Subjt:  STNQIADPVVYKLVRVDGDGRFVPATDDEVMEVED---------------LLEDDQDEKME--------DAGQIEGCVPTEGTLFGKPHVEISNGLPQSE

Query:  TFEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKYPDKP
           +D       +Y EE+LQKV+QEERL    GS    S   + + + S+++      ++++  E  LQ+          E   N+   +  C      P
Subjt:  TFEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKYPDKP

Query:  VESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFV--EESSQNVEGVFTVPTAK
         E+  S A      PDFS ++G++CLDNL I+ L+E F+ATFGRDTTVKDK+WLKRRIAMGL NSCD+P ++  +K+ K +  +E S +V    T    K
Subjt:  VESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFV--EESSQNVEGVFTVPTAK

Query:  ALNIECRGSPTPYTLENKDH--HHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGR
         +  + R +       + DH   H  G    + SED   E+ A KR+RKPTRRYIEELSE + ++   K +  +K+     +S  S  R I    S G R
Subjt:  ALNIECRGSPTPYTLENKDH--HHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGR

Query:  TVITRLDSLGGSGFQVPCVSRVRRSRPRKDIVALV----FSLPDK--------DQNPSVMDTDEVEKDLEQKQT--------------------------
          +TR+ SL GS  +VP VS VRRSRPR++I+AL+      L DK        + +PS + ++ V +D  +K                            
Subjt:  TVITRLDSLGGSGFQVPCVSRVRRSRPRKDIVALV----FSLPDK--------DQNPSVMDTDEVEKDLEQKQT--------------------------

Query:  -------ASGNALDDNTAIVPTPKGGT-RRKHHRAWTLVEVIKLVEGVSICGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRK
               +SGN+ D+N   VP  +GG  RRKHHRAWTL E+ KLVEGVS  GAG+WSEIKK  FSS+SYRTSVDLKDKWRNLLK SFAQ+P +   S +K
Subjt:  -------ASGNALDDNTAIVPTPKGGT-RRKHHRAWTLVEVIKLVEGVSICGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEGISSRK

Query:  HASVSIPAQILLRVRELAEMHAQ
        H S+ IP QILLRVRELAE  +Q
Subjt:  HASVSIPAQILLRVRELAEMHAQ

AT1G72650.2 TRF-like 61.9e-10040.76Show/hide
Query:  STNQIADPVVYKLVRVDGDGRFVPATDDEVMEVED---------------LLEDDQDEKME--------DAGQIEGCVPTEGTLFGKPHVE----ISNGL
        STNQI +PV YKLVRV GDG  VPATD+E++EV D               L  D+++ +++        DA Q  G +P EG       +E    I++GL
Subjt:  STNQIADPVVYKLVRVDGDGRFVPATDDEVMEVED---------------LLEDDQDEKME--------DAGQIEGCVPTEGTLFGKPHVE----ISNGL

Query:  PQSETFEADAG-YNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLK
          S+  +       +R EY EE+LQKV+QEERL    GS    S   + + + S+++      ++++  E  LQ+          E   N+   +  C  
Subjt:  PQSETFEADAG-YNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLK

Query:  YPDKPVESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFV--EESSQNVEGVFT
            P E+  S A      PDFS ++G++CLDNL I+ L+E F+ATFGRDTTVKDK+WLKRRIAMGL NSCD+P ++  +K+ K +  +E S +V    T
Subjt:  YPDKPVESESSDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFV--EESSQNVEGVFT

Query:  VPTAKALNIECRGSPTPYTLENKDH--HHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVC
            K +  + R +       + DH   H  G    + SED   E+ A KR+RKPTRRYIEELSE + ++   K +  +K+     +S  S  R I    
Subjt:  VPTAKALNIECRGSPTPYTLENKDH--HHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVC

Query:  SDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDIVALV----FSLPDK--------DQNPSVMDTDEVEKDLEQKQT---------------------
        S G R  +TR+ SL GS  +VP VS VRRSRPR++I+AL+      L DK        + +PS + ++ V +D  +K                       
Subjt:  SDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRKDIVALV----FSLPDK--------DQNPSVMDTDEVEKDLEQKQT---------------------

Query:  ------------ASGNALDDNTAIVPTPKGGT-RRKHHRAWTLVEVIKLVEGVSICGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEG
                    +SGN+ D+N   VP  +GG  RRKHHRAWTL E+ KLVEGVS  GAG+WSEIKK  FSS+SYRTSVDLKDKWRNLLK SFAQ+P +  
Subjt:  ------------ASGNALDDNTAIVPTPKGGT-RRKHHRAWTLVEVIKLVEGVSICGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKASFAQTPVDEG

Query:  ISSRKHASVSIPAQILLRVRELAEMHAQ
         S +KH S+ IP QILLRVRELAE  +Q
Subjt:  ISSRKHASVSIPAQILLRVRELAEMHAQ

AT2G31610.1 Ribosomal protein S3 family protein4.5e-11087.12Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP KEYID+AVRHVLLRQGVLGIKVKIMLDWDP GK GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEEEEFVRPV-AVIPTTEIE
         TPLPD+V IH PK++  +  P  A  P T ++
Subjt:  PTPLPDLVTIHTPKEEEEFVRPV-AVIPTTEIE

AT3G53870.1 Ribosomal protein S3 family protein5.3e-11190.22Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        M TQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP +SVELYAEKVNNRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP KEYIDSAVRHVLLRQGVLGIKVK+MLDWDPKG  GP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEEEEFVRPVAV
         TPLPD+V IH+PKEEE    P  V
Subjt:  PTPLPDLVTIHTPKEEEEFVRPVAV

AT5G35530.1 Ribosomal protein S3 family protein9.7e-11389.82Show/hide
Query:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA
        MATQ+SKKRKFVADGVF+AELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTS+VQKRFKFP++SVELYAEKV NRGLCAIA
Subjt:  MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIA

Query:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP
        QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRA RAKSMKFKDGYM+SSGQP KEYID+AVRHVLLRQGVLG+KVKIMLDWDPKGKQGP
Subjt:  QAESLRYKLLGGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGP

Query:  PTPLPDLVTIHTPKEEEEFVRPVAVI
         TPLPD+V IHTPKE++ ++ P  V+
Subjt:  PTPLPDLVTIHTPKEEEEFVRPVAVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACTCAGATGAGCAAAAAGCGTAAGTTTGTGGCCGACGGAGTGTTCTTCGCCGAGCTTAACGAAGTTCTTACCAGAGAGCTTGCAGAGGATGGATACTCCGGAGT
TGAGGTTAGGGTTACTCCTATGCGGACTGAGATTATCATTAGGGCTACTCGCACTCAGAATGTTCTTGGCGAGAAAGGAAGGAGAATCAGAGAGTTGACATCCGTTGTTC
AGAAGCGATTCAAGTTTCCTGAAAACAGCGTTGAGCTGTATGCCGAGAAGGTCAACAACAGAGGACTCTGTGCCATTGCTCAAGCTGAGTCTCTTCGCTACAAGCTTCTT
GGAGGCCTTGCTGTGAGGAGGGCTTGCTATGGTGTCCTTAGATTTGTCATGGAGAGTGGAGCTAAGGGATGTGAGGTTATCGTTAGTGGGAAGCTGAGGGCTCAGCGTGC
AAAATCCATGAAATTCAAGGATGGCTACATGATCTCATCCGGACAGCCCGTGAAAGAGTACATAGACTCTGCCGTGAGACACGTTCTCCTTAGACAGGGTGTTCTAGGTA
TCAAGGTCAAGATTATGCTCGACTGGGATCCGAAGGGCAAGCAAGGTCCACCGACACCCCTTCCCGATTTGGTTACTATCCATACTCCCAAGGAGGAAGAGGAGTTTGTT
AGGCCCGTGGCTGTGATTCCGACAACCGAGATTGAGTGGCTGAGTCAAGTCGGCTTTCACTTGAATGAAGGGAAGATTGTGGAAAGCGGGGTCGCGCAGGATGGCTCCAC
TCTGTCTACAAATCAGATTGCCGACCCAGTTGTGTATAAACTTGTTCGGGTTGATGGTGATGGCAGATTCGTTCCAGCCACAGATGATGAAGTAATGGAGGTTGAAGATT
TACTTGAAGATGACCAGGACGAAAAAATGGAAGATGCAGGACAAATTGAAGGATGCGTACCCACCGAGGGCACTTTATTTGGGAAGCCACATGTAGAAATCTCAAATGGT
TTGCCACAATCTGAAACCTTTGAAGCTGATGCAGGGTATAATGCCCGATTGGAGTACATTGAAGAGGTATTGCAAAAGGTGAAACAGGAAGAGAGGCTTCGCTTGGCATG
TGGATCACCTAACTATGCTTCTGCTTATGTGAATGGAGACAGGAAGGATTCTGATCAGCATGGTAGATTGCCTGTAATAGATGAGAAGCTCCAATCTGAAATTTCACTGC
AGGAAATTGCTCATTCAATTTCTCCAAGTTTAAATGAGAATCATGAGAATGATCATGGGAGTCTGGGCGATTGTTTAAAGTATCCAGATAAACCAGTGGAATCCGAATCC
TCGGACGCCATTTGCACTACGTCTAACCCTGATTTTTCCTTGTTAAAGGGGGACGTATGCCTGGATAATCTGTCAATTAGAGAACTCCGTGAATGTTTCAAAGCAACTTT
TGGGAGAGACACTACAGTTAAAGACAAATCGTGGCTTAAGAGGAGAATTGCCATGGGATTGACCAACTCATGCGACATTCCAGCCTCGTCTTTTATAATTAAGGAAGGCA
AGTTTGTCGAAGAAAGTTCTCAAAATGTGGAGGGCGTGTTCACTGTTCCAACTGCTAAAGCTTTGAATATTGAATGCAGAGGTTCACCAACACCTTACACATTGGAAAAT
AAGGACCATCATCATGTGGAGGGTATGGAACTTGATCATGGAAGTGAGGATCAACACGAAGAGAGAGCTGCTGTTAAAAGAATTCGGAAGCCTACCAGGCGGTATATTGA
AGAACTTTCTGAAGTGGAGTCAAGAGAGTATGTCCAAAAGGTGATAAGTTTGAATAAAAATGGTGTATCAGATGGCATATCTGCAAATTCTATTGCAAGACCTATTAAGA
AAGTCTGTTCAGATGGGGGAAGAACTGTCATCACGAGATTGGATTCACTTGGTGGATCTGGATTTCAAGTTCCATGTGTTTCAAGAGTTCGAAGGAGCCGCCCTAGGAAA
GACATTGTGGCCCTTGTGTTTTCCCTTCCAGACAAAGATCAGAATCCTTCAGTTATGGACACAGATGAAGTGGAGAAGGATTTGGAGCAGAAGCAAACAGCTTCTGGTAA
TGCATTGGATGATAACACCGCAATTGTTCCGACACCAAAAGGTGGAACGAGGAGGAAGCATCATCGCGCTTGGACTCTTGTTGAGGTCATCAAATTAGTAGAGGGTGTGT
CGATATGTGGAGCTGGGAGATGGTCTGAGATCAAGAAACTTTCTTTTTCATCATACTCATACCGCACATCAGTTGATCTCAAGGATAAATGGAGAAACCTGCTCAAAGCT
AGTTTCGCACAGACACCTGTTGATGAAGGGATAAGTTCTCGGAAACATGCGTCGGTGTCGATTCCTGCACAGATCTTGTTACGGGTGAGGGAGCTTGCTGAGATGCATGC
TCAAATTCCTCCTCCAAATCATGGCCAAGGCAAGTTGGGGGGTGGAGTTGGTGGTAATAGTCTGCATGAGATGACTTCGGCAGTGTGCTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAACTCAGATGAGCAAAAAGCGTAAGTTTGTGGCCGACGGAGTGTTCTTCGCCGAGCTTAACGAAGTTCTTACCAGAGAGCTTGCAGAGGATGGATACTCCGGAGT
TGAGGTTAGGGTTACTCCTATGCGGACTGAGATTATCATTAGGGCTACTCGCACTCAGAATGTTCTTGGCGAGAAAGGAAGGAGAATCAGAGAGTTGACATCCGTTGTTC
AGAAGCGATTCAAGTTTCCTGAAAACAGCGTTGAGCTGTATGCCGAGAAGGTCAACAACAGAGGACTCTGTGCCATTGCTCAAGCTGAGTCTCTTCGCTACAAGCTTCTT
GGAGGCCTTGCTGTGAGGAGGGCTTGCTATGGTGTCCTTAGATTTGTCATGGAGAGTGGAGCTAAGGGATGTGAGGTTATCGTTAGTGGGAAGCTGAGGGCTCAGCGTGC
AAAATCCATGAAATTCAAGGATGGCTACATGATCTCATCCGGACAGCCCGTGAAAGAGTACATAGACTCTGCCGTGAGACACGTTCTCCTTAGACAGGGTGTTCTAGGTA
TCAAGGTCAAGATTATGCTCGACTGGGATCCGAAGGGCAAGCAAGGTCCACCGACACCCCTTCCCGATTTGGTTACTATCCATACTCCCAAGGAGGAAGAGGAGTTTGTT
AGGCCCGTGGCTGTGATTCCGACAACCGAGATTGAGTGGCTGAGTCAAGTCGGCTTTCACTTGAATGAAGGGAAGATTGTGGAAAGCGGGGTCGCGCAGGATGGCTCCAC
TCTGTCTACAAATCAGATTGCCGACCCAGTTGTGTATAAACTTGTTCGGGTTGATGGTGATGGCAGATTCGTTCCAGCCACAGATGATGAAGTAATGGAGGTTGAAGATT
TACTTGAAGATGACCAGGACGAAAAAATGGAAGATGCAGGACAAATTGAAGGATGCGTACCCACCGAGGGCACTTTATTTGGGAAGCCACATGTAGAAATCTCAAATGGT
TTGCCACAATCTGAAACCTTTGAAGCTGATGCAGGGTATAATGCCCGATTGGAGTACATTGAAGAGGTATTGCAAAAGGTGAAACAGGAAGAGAGGCTTCGCTTGGCATG
TGGATCACCTAACTATGCTTCTGCTTATGTGAATGGAGACAGGAAGGATTCTGATCAGCATGGTAGATTGCCTGTAATAGATGAGAAGCTCCAATCTGAAATTTCACTGC
AGGAAATTGCTCATTCAATTTCTCCAAGTTTAAATGAGAATCATGAGAATGATCATGGGAGTCTGGGCGATTGTTTAAAGTATCCAGATAAACCAGTGGAATCCGAATCC
TCGGACGCCATTTGCACTACGTCTAACCCTGATTTTTCCTTGTTAAAGGGGGACGTATGCCTGGATAATCTGTCAATTAGAGAACTCCGTGAATGTTTCAAAGCAACTTT
TGGGAGAGACACTACAGTTAAAGACAAATCGTGGCTTAAGAGGAGAATTGCCATGGGATTGACCAACTCATGCGACATTCCAGCCTCGTCTTTTATAATTAAGGAAGGCA
AGTTTGTCGAAGAAAGTTCTCAAAATGTGGAGGGCGTGTTCACTGTTCCAACTGCTAAAGCTTTGAATATTGAATGCAGAGGTTCACCAACACCTTACACATTGGAAAAT
AAGGACCATCATCATGTGGAGGGTATGGAACTTGATCATGGAAGTGAGGATCAACACGAAGAGAGAGCTGCTGTTAAAAGAATTCGGAAGCCTACCAGGCGGTATATTGA
AGAACTTTCTGAAGTGGAGTCAAGAGAGTATGTCCAAAAGGTGATAAGTTTGAATAAAAATGGTGTATCAGATGGCATATCTGCAAATTCTATTGCAAGACCTATTAAGA
AAGTCTGTTCAGATGGGGGAAGAACTGTCATCACGAGATTGGATTCACTTGGTGGATCTGGATTTCAAGTTCCATGTGTTTCAAGAGTTCGAAGGAGCCGCCCTAGGAAA
GACATTGTGGCCCTTGTGTTTTCCCTTCCAGACAAAGATCAGAATCCTTCAGTTATGGACACAGATGAAGTGGAGAAGGATTTGGAGCAGAAGCAAACAGCTTCTGGTAA
TGCATTGGATGATAACACCGCAATTGTTCCGACACCAAAAGGTGGAACGAGGAGGAAGCATCATCGCGCTTGGACTCTTGTTGAGGTCATCAAATTAGTAGAGGGTGTGT
CGATATGTGGAGCTGGGAGATGGTCTGAGATCAAGAAACTTTCTTTTTCATCATACTCATACCGCACATCAGTTGATCTCAAGGATAAATGGAGAAACCTGCTCAAAGCT
AGTTTCGCACAGACACCTGTTGATGAAGGGATAAGTTCTCGGAAACATGCGTCGGTGTCGATTCCTGCACAGATCTTGTTACGGGTGAGGGAGCTTGCTGAGATGCATGC
TCAAATTCCTCCTCCAAATCATGGCCAAGGCAAGTTGGGGGGTGGAGTTGGTGGTAATAGTCTGCATGAGATGACTTCGGCAGTGTGCTTGTAA
Protein sequenceShow/hide protein sequence
MATQMSKKRKFVADGVFFAELNEVLTRELAEDGYSGVEVRVTPMRTEIIIRATRTQNVLGEKGRRIRELTSVVQKRFKFPENSVELYAEKVNNRGLCAIAQAESLRYKLL
GGLAVRRACYGVLRFVMESGAKGCEVIVSGKLRAQRAKSMKFKDGYMISSGQPVKEYIDSAVRHVLLRQGVLGIKVKIMLDWDPKGKQGPPTPLPDLVTIHTPKEEEEFV
RPVAVIPTTEIEWLSQVGFHLNEGKIVESGVAQDGSTLSTNQIADPVVYKLVRVDGDGRFVPATDDEVMEVEDLLEDDQDEKMEDAGQIEGCVPTEGTLFGKPHVEISNG
LPQSETFEADAGYNARLEYIEEVLQKVKQEERLRLACGSPNYASAYVNGDRKDSDQHGRLPVIDEKLQSEISLQEIAHSISPSLNENHENDHGSLGDCLKYPDKPVESES
SDAICTTSNPDFSLLKGDVCLDNLSIRELRECFKATFGRDTTVKDKSWLKRRIAMGLTNSCDIPASSFIIKEGKFVEESSQNVEGVFTVPTAKALNIECRGSPTPYTLEN
KDHHHVEGMELDHGSEDQHEERAAVKRIRKPTRRYIEELSEVESREYVQKVISLNKNGVSDGISANSIARPIKKVCSDGGRTVITRLDSLGGSGFQVPCVSRVRRSRPRK
DIVALVFSLPDKDQNPSVMDTDEVEKDLEQKQTASGNALDDNTAIVPTPKGGTRRKHHRAWTLVEVIKLVEGVSICGAGRWSEIKKLSFSSYSYRTSVDLKDKWRNLLKA
SFAQTPVDEGISSRKHASVSIPAQILLRVRELAEMHAQIPPPNHGQGKLGGGVGGNSLHEMTSAVCL