; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018936 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018936
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Description30S ribosomal protein S1-like
Genome locationtig00153228:966828..980128
RNA-Seq ExpressionSgr018936
SyntenySgr018936
Gene Ontology termsGO:0000481 - maturation of 5S rRNA (biological process)
GO:0008285 - negative regulation of cell population proliferation (biological process)
GO:0009737 - response to abscisic acid (biological process)
GO:0032508 - DNA duplex unwinding (biological process)
GO:0034337 - RNA folding (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016020 - membrane (cellular component)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR012552 - DVL
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582039.1 30S ribosomal protein S1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]8.5e-17676.87Show/hide
Query:  MPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVTHKSPSKRPSNFAARVSLSGKPEPIAGVLDSSPPSPESVRRARVSTLFVRSFLNCGGVSLR
        MPIFAATTLGS+SAHSFLSL  STDA   + + S+S + THKSPSKRPSNFAARVSLSGKPEPIAGVL+SSP SPESVRRAR                  
Subjt:  MPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVTHKSPSKRPSNFAARVSLSGKPEPIAGVLDSSPPSPESVRRARVSTLFVRSFLNCGGVSLR

Query:  LKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSI
                                                 RSADWK AREYLD GFI++GRIEGSNAGGLLVRF SLVGFLPFP LSP+HSCKEPYKSI
Subjt:  LKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSI

Query:  QDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEV
        QDIAKSLIGS IPVKVIQADE+NK LIFSEKEAAWSKFSE+V VGDVYEARVGSVEDYGAFVHL FSDGL+HLTGLVH+SEVSWDLVQDVRDILSEGDEV
Subjt:  QDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEV

Query:  RVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLWLSNAPP
        RVKVI+VDR+KSRITLSIKQLEEDPLLETLDKVIPQD S EPDSFGP++DSEI+PLPGL+TIFEELLQEEGIEDV +NRQGFEKRVVSQDLQLWLSNAPP
Subjt:  RVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLWLSNAPP

Query:  VEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP
        VE KFTLLARAGRQVQEIQLTTSLDQEGIK+ALQRVLERVP
Subjt:  VEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP

KAG7018465.1 30S ribosomal protein S1, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]1.1e-17576.87Show/hide
Query:  MPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVTHKSPSKRPSNFAARVSLSGKPEPIAGVLDSSPPSPESVRRARVSTLFVRSFLNCGGVSLR
        MPIFAATTLGS+SAHSFLSL  S DA   + + STS ++THKSPSKRPSNFAARVSLSGKPEPIAGVL+SSP SPESVRRAR                  
Subjt:  MPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVTHKSPSKRPSNFAARVSLSGKPEPIAGVLDSSPPSPESVRRARVSTLFVRSFLNCGGVSLR

Query:  LKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSI
                                                 RSADWK AREYLD GFI++GRIEGSNAGGLLVRF SLVGFLPFP LSP+HSCKEPYKSI
Subjt:  LKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSI

Query:  QDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEV
        QDIAKSLIGS IPVKVIQADE+NK LIFSEKEAAWSKFSE+V VGDVYEARVGSVEDYGAFVHL FSDGL+HLTGLVH+SEVSWDLVQDVRDILSEGDEV
Subjt:  QDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEV

Query:  RVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLWLSNAPP
        RVKVI+VDR+KSRITLSIKQLEEDPLLETLDKVIPQD S EPDSFGP++DSEI+PLPGL+TIFEELLQEEGIEDV +NRQGFEKRVVSQDLQLWLSNAPP
Subjt:  RVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLWLSNAPP

Query:  VEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP
        VE KFTLLARAGRQVQEIQLTTSLDQEGIK+ALQRVLERVP
Subjt:  VEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP

XP_022955545.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111457527 [Cucurbita moschata]1.3e-17676.29Show/hide
Query:  PLSTLNMPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVTHKSPSKRPSNFAARVSLSGKPEPIAGVLDSSPPSPESVRRARVSTLFVRSFLNC
        PL   NMPIFAATTLGS+SAHSFLSL  S DA   + + STS ++THKSPSKRPSNFAARVSLSGKPEPIAGVL+SSP SPESVRRAR            
Subjt:  PLSTLNMPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVTHKSPSKRPSNFAARVSLSGKPEPIAGVLDSSPPSPESVRRARVSTLFVRSFLNC

Query:  GGVSLRLKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCK
                                                       RSADWK AREYLD GFI++GRIEGSNAGGLLVRF SLVGFLPFP LSP+HSCK
Subjt:  GGVSLRLKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCK

Query:  EPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDIL
        EPYKSIQDIAKSLIGS IPVKVIQADE+NK LIFSEKEAAWSKFSE+V VGDVYEARVGS+EDYGAFVHL FSDGL+HLTGLVH+SEVSWDLVQDVRDIL
Subjt:  EPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDIL

Query:  SEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLW
        SEGDEVRVKVI+VDR+KSRITLSIKQLEEDPLLETLDKVIPQD S EPDSFGP++DSEI+PLPGL+TIFEELLQEEGIEDV +NRQGFEKRVVSQDLQLW
Subjt:  SEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLW

Query:  LSNAPPVEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP
        LSNAPPVE KFTLLARAGRQVQEIQLTTSLDQEGIK+ALQRVLERVP
Subjt:  LSNAPPVEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP

XP_022979820.1 uncharacterized protein LOC111479406 [Cucurbita maxima]6.5e-17676.64Show/hide
Query:  MPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVTHKSPSKRPSNFAARVSLSGKPEPIAGVLDSSPPSPESVRRARVSTLFVRSFLNCGGVSLR
        MPIFAATTLGS+SAHSFLSL  STDA   + + S+S ++THKSPSKRPSNFAARVSLSGKPEPIAGVL+SSP SPESVRRAR                  
Subjt:  MPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVTHKSPSKRPSNFAARVSLSGKPEPIAGVLDSSPPSPESVRRARVSTLFVRSFLNCGGVSLR

Query:  LKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSI
                                                 RSADWK AREYLD GFI++GRIEGSNAGGLLVRF SLVGFLPFP LSP+HSCKEPYKSI
Subjt:  LKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSI

Query:  QDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEV
        QDIAKSLIGS IPVKVIQADE+NK LIFSEKEAAWSKFSEQV VGDVYEARVGSVEDYGAFVHL FSDG +HLTGLVH+SEVSWDLVQDVRDILSEGDEV
Subjt:  QDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEV

Query:  RVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLWLSNAPP
        RVKV++VDR+KSRITLSIKQLEEDPLLETLDKVIPQD S EPDSFGP++DSEI+PLPGL+TIFEELLQEEGIEDV +NRQGFEKRVVSQDLQLWLSNAPP
Subjt:  RVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLWLSNAPP

Query:  VEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP
        VE KFTLLARAGRQVQEIQLTTSLDQEGIK+ALQRVLERVP
Subjt:  VEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP

XP_023526022.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111789621 [Cucurbita pepo subsp. pepo]4.5e-17776.51Show/hide
Query:  PLSTLNMPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVTHKSPSKRPSNFAARVSLSGKPEPIAGVLDSSPPSPESVRRARVSTLFVRSFLNC
        PL   NMPIFAATTLGS+SAHSFL+L  STDA   + + STS  +THKSPSKRPSNFAARVSLSGKP+PIAGVL+SSP SPESVRRAR            
Subjt:  PLSTLNMPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVTHKSPSKRPSNFAARVSLSGKPEPIAGVLDSSPPSPESVRRARVSTLFVRSFLNC

Query:  GGVSLRLKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCK
                                                       RSADWK AREYLD GFI++GRIEGSNAGGLLVRF SLVGFLPFP LSP+HSCK
Subjt:  GGVSLRLKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCK

Query:  EPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDIL
        EPYKSIQDIAKSLIGS IPVKVIQADE+NK LIFSEKEAAWSKFSEQV VGDVYEARVGSVEDYGAFVHL FSDGL+HLTGLVH+SEVSWDLVQDVRDIL
Subjt:  EPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDIL

Query:  SEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLW
        SEGDEVRVKVI+VDR+KSRITLSIKQLEEDPLLETLDKVIPQD S EPDSFGP++DSEI+PLPGL+TIFEELLQEEGIEDV +NRQGFEKRVVSQDLQLW
Subjt:  SEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLW

Query:  LSNAPPVEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP
        LSNAPPVE KFTLLARAGRQVQEIQLTTSLDQEGIK+ALQRVLERVP
Subjt:  LSNAPPVEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP

TrEMBL top hitse value%identityAlignment
A0A0A0KTX2 Uncharacterized protein2.2e-16173.24Show/hide
Query:  MPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVTHKSPSKRPSNFAARVSLSGKPEPIAGVLDSSPPSPESVRRARVSTLFVRSFLNCGGVSLR
        MPIF A T+ SVSAHSFLSL  ST   +   + S+S I+  KSPSKR S F +RVSLSGKP+PIAGVLD+   SPESVRRAR                  
Subjt:  MPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVTHKSPSKRPSNFAARVSLSGKPEPIAGVLDSSPPSPESVRRARVSTLFVRSFLNCGGVSLR

Query:  LKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSI
                                                 RSADWKAAREYLD GFIYEGRIEGSNAGGLLVRF SLVGFLPFPQLSPSHSCKEPYKSI
Subjt:  LKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSI

Query:  QDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEV
        QDIAKSLIGS I VKVIQADE+N+KLIFSEKEAA SKFS QV+VGDVYE +VGSVEDYGAFVHL  SDGL+HLTGLVHVSEVSWDLVQDVRDILSEGDEV
Subjt:  QDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEV

Query:  RVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLWLSNAPP
         VKVINV++ KSRITLSI+QLEEDPLLETLDKVIPQ+ S EPDSFGP+ DSEI+PLPGLETI EELLQEEGI DV VNRQGFEKRVVSQDLQLWLSNAPP
Subjt:  RVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLWLSNAPP

Query:  VEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP
        VE KFTLLARAGRQVQEIQLTTSLDQEGIK+ALQRVLERVP
Subjt:  VEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP

A0A1S3BXL4 30S ribosomal protein S1 isoform X11.8e-16372.12Show/hide
Query:  SLPFPPLSTLNMPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVTHKSPSKRPSNFAARVSLSGKPEPIAGVLDSSPPSPESVRRARVSTLFVR
        ++ FP    + MPIF A T+ SVS HSFLSL  ST   +   + S+SSI+  KSPSKRPS F +RVSLSGKP+PIAGVLD+   SPESVRRAR       
Subjt:  SLPFPPLSTLNMPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVTHKSPSKRPSNFAARVSLSGKPEPIAGVLDSSPPSPESVRRARVSTLFVR

Query:  SFLNCGGVSLRLKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSP
                                                            RSADWKAAREYLD GFIYEGRIEGSNAGGLLVRF SL+GFLPFPQLSP
Subjt:  SFLNCGGVSLRLKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSP

Query:  SHSCKEPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQD
        SHSCKEP KSIQDIAKSL GS I VKVIQADERNKKLIFSEKEA WSKFS QV VGDVYEA+VGS+EDYGAFVHL FSDGL+HLTGLVHVSEVSWDLVQD
Subjt:  SHSCKEPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQD

Query:  VRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQ
        VRDILSEGDEV VKVINVDR+KSRITLSI+QLEEDPLLETLDKVIPQD S EPDSFGP++DSEI+PLPGL TI EEL QEEGI DV VNRQGFEKRVVSQ
Subjt:  VRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQ

Query:  DLQLWLSNAPPVEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP
        DLQLWLSNAPP+E KFTLLARAGRQVQEIQLTTSLDQEGIK+ALQRVLERVP
Subjt:  DLQLWLSNAPPVEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP

A0A6J1CUJ6 uncharacterized protein LOC1110147143.3e-16574.1Show/hide
Query:  MPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVT--HKSPSKRPSNFAARVSLSGKPEPIAGVLD-SSPPSPESVRRARVSTLFVRSFLNCGGV
        MPI  A TLGS+S +SFLS F STD+       S SSI+T  H SPSKR  NF  R+SL   P+PIAGVLD +SP SPES+ R+R               
Subjt:  MPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVT--HKSPSKRPSNFAARVSLSGKPEPIAGVLD-SSPPSPESVRRARVSTLFVRSFLNCGGV

Query:  SLRLKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPY
                                                    RS DWKAAREYLD GFIYEGRIEGSNAGGLLVRF SLVGFLPFPQLSPSHSCKEPY
Subjt:  SLRLKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPY

Query:  KSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEG
        KSIQDIAKSL+GS +PVK+IQADERNKKLIFSEKEAAWSKFSEQVSVG+VY+ARVGSVEDYGAFVHL FSDGL+HLTGLVHVSEVSWDLVQDVRDILSEG
Subjt:  KSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEG

Query:  DEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLWLSN
        DEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGS EPDSFGPR+DSEI+PLPGLETIFEELLQE+GIEDV VNRQGFEKRVVSQDLQLWLSN
Subjt:  DEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLWLSN

Query:  APPVEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP
        APPVE KF LLARAGRQVQEIQLTTSLDQEGIK ALQ VLERVP
Subjt:  APPVEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP

A0A6J1GVD7 LOW QUALITY PROTEIN: uncharacterized protein LOC1114575276.3e-17776.29Show/hide
Query:  PLSTLNMPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVTHKSPSKRPSNFAARVSLSGKPEPIAGVLDSSPPSPESVRRARVSTLFVRSFLNC
        PL   NMPIFAATTLGS+SAHSFLSL  S DA   + + STS ++THKSPSKRPSNFAARVSLSGKPEPIAGVL+SSP SPESVRRAR            
Subjt:  PLSTLNMPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVTHKSPSKRPSNFAARVSLSGKPEPIAGVLDSSPPSPESVRRARVSTLFVRSFLNC

Query:  GGVSLRLKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCK
                                                       RSADWK AREYLD GFI++GRIEGSNAGGLLVRF SLVGFLPFP LSP+HSCK
Subjt:  GGVSLRLKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCK

Query:  EPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDIL
        EPYKSIQDIAKSLIGS IPVKVIQADE+NK LIFSEKEAAWSKFSE+V VGDVYEARVGS+EDYGAFVHL FSDGL+HLTGLVH+SEVSWDLVQDVRDIL
Subjt:  EPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDIL

Query:  SEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLW
        SEGDEVRVKVI+VDR+KSRITLSIKQLEEDPLLETLDKVIPQD S EPDSFGP++DSEI+PLPGL+TIFEELLQEEGIEDV +NRQGFEKRVVSQDLQLW
Subjt:  SEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLW

Query:  LSNAPPVEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP
        LSNAPPVE KFTLLARAGRQVQEIQLTTSLDQEGIK+ALQRVLERVP
Subjt:  LSNAPPVEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP

A0A6J1IUF3 uncharacterized protein LOC1114794063.1e-17676.64Show/hide
Query:  MPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVTHKSPSKRPSNFAARVSLSGKPEPIAGVLDSSPPSPESVRRARVSTLFVRSFLNCGGVSLR
        MPIFAATTLGS+SAHSFLSL  STDA   + + S+S ++THKSPSKRPSNFAARVSLSGKPEPIAGVL+SSP SPESVRRAR                  
Subjt:  MPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVTHKSPSKRPSNFAARVSLSGKPEPIAGVLDSSPPSPESVRRARVSTLFVRSFLNCGGVSLR

Query:  LKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSI
                                                 RSADWK AREYLD GFI++GRIEGSNAGGLLVRF SLVGFLPFP LSP+HSCKEPYKSI
Subjt:  LKNWVLLLFVRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSI

Query:  QDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEV
        QDIAKSLIGS IPVKVIQADE+NK LIFSEKEAAWSKFSEQV VGDVYEARVGSVEDYGAFVHL FSDG +HLTGLVH+SEVSWDLVQDVRDILSEGDEV
Subjt:  QDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEV

Query:  RVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLWLSNAPP
        RVKV++VDR+KSRITLSIKQLEEDPLLETLDKVIPQD S EPDSFGP++DSEI+PLPGL+TIFEELLQEEGIEDV +NRQGFEKRVVSQDLQLWLSNAPP
Subjt:  RVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLWLSNAPP

Query:  VEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP
        VE KFTLLARAGRQVQEIQLTTSLDQEGIK+ALQRVLERVP
Subjt:  VEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP

SwissProt top hitse value%identityAlignment
P29344 30S ribosomal protein S1, chloroplastic3.3e-2132.22Show/hide
Query:  WKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVG
        W+  R+      + +G+I G+N GG++     L GF+PF Q+S   S +E           L+   IP+K ++ DE   +L+ S ++ A +    Q+ +G
Subjt:  WKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVG

Query:  DVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP
         V    V S++ YGAF+       +  + GL+HVS++S D V D+  +L  GD ++V +++ DRE+ R++LS K+LE  P
Subjt:  DVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP

P46228 30S ribosomal protein S18.9e-1930.56Show/hide
Query:  WKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVG
        W+  R+           +  +N GG LVR   L GF+P   +S +   KE           L+G  +P+K ++ DE   +L+ S + A   +   ++ VG
Subjt:  WKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVG

Query:  DVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP
        +V    V  ++ YGAF+       +  ++GL+H+SE+S D ++    + +  DEV+V +I++D E+ RI+LS KQLE +P
Subjt:  DVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP

P73530 30S ribosomal protein S1 homolog A3.7e-1728.33Show/hide
Query:  WKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVG
        W+  R+           +  +N GG LVR   L GF+P   +           S ++  + L+G  +P+K ++ DE   +L+ S + A   +    + V 
Subjt:  WKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVG

Query:  DVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP
         V    V  ++ YGAF+       +  ++GL+H+SE+S D +     + +  DE++V +I++D E+ RI+LS KQLE +P
Subjt:  DVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP

Q93VC7 30S ribosomal protein S1, chloroplastic1.1e-1931.67Show/hide
Query:  WKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVG
        W+  R+      I + ++ G+N GGL+     L GF+PF Q+S   + +E           L+   IP+K ++ DE   KL+ S ++A  +    Q+ +G
Subjt:  WKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVG

Query:  DVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP
         V    V S++ YGAF+       +  + GL+HVS++S D V D+  +L  GD ++V +++ DR++ R++LS K+LE  P
Subjt:  DVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP

Q9JZ44 30S ribosomal protein S17.6e-1831.77Show/hide
Query:  ERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSE
        +R+ADW A  E ++ G I  G I G   GGL V   S+  FLP   +          + ++D      G  I  KVI+ D++   ++ S +    +   E
Subjt:  ERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSE

Query:  Q-------VSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP
        +       +  G V +  V ++ DYGAFV L   D      GL+H+++++W  V+   ++L  G EV  KV+  D+EK R++L +KQL EDP
Subjt:  Q-------VSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP

Arabidopsis top hitse value%identityAlignment
AT1G12800.1 Nucleic acid-binding, OB-fold-like protein3.4e-2132.93Show/hide
Query:  QDIAKSLIGSFIPVKVIQADERNKKLIFS----EKEAAWSK---FSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDI
        Q    S +G  I V V+ A+  ++KLIFS    E E    K      ++ VGDV +  +  +  +G F        L  +  LVH SEVSWD   D    
Subjt:  QDIAKSLIGSFIPVKVIQADERNKKLIFS----EKEAAWSK---FSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDI

Query:  LSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPL--PGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDL
           G  V  KV  +D    RI LS+K++  DPL E L+ V+  D     D  G R  +  +    P +E++ +EL   EGI+ VS +R  F    ++   
Subjt:  LSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPDSFGPRNDSEIVPL--PGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDL

Query:  QLWLSNAPPVEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERV
        Q+++  AP  E ++ LLARAG +VQE+ +  SL +E +K  +     RV
Subjt:  QLWLSNAPPVEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERV

AT3G23700.1 Nucleic acid-binding proteins superfamily1.1e-12072.48Show/hide
Query:  ADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVS
        +DWK A+ Y   G  +EG ++G N GGLL+RF SLVGFLP+PQLSPS SCKEP KSI +IAK+L+GS +PVKV+QADE N+KLI SEK A W K+S+ V+
Subjt:  ADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVS

Query:  VGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPD
        VGDV+  RVGSVEDYGAF+HL F DGL+HLTGLVHVSEVSWD VQDVRD+L +GDEVRV V N+D+EKSRITLSIKQLE+DPLLETLDKVI +D ST   
Subjt:  VGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSTEPD

Query:  SFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLWLSNAPPVEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP
        S    N   I PLPGLETI EELL+E+GIE V +NRQGFEKRVVSQDLQLWLSN PP + KF LLARAGRQVQEI LTTSL+Q GIKKALQ VLERVP
Subjt:  SFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLWLSNAPPVEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLERVP

AT4G29060.1 elongation factor Ts family protein8.6e-0931.3Show/hide
Query:  SEQVSVGDVYEARVGSVEDYGAFVHL-CFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQD
        +E++  G  +  +V +++ +GAFV    F+D      GLVHVS++S + V+DV  +++ G EV+V+++  D E  RI+L++++ ++ P  ++        
Subjt:  SEQVSVGDVYEARVGSVEDYGAFVHL-CFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQD

Query:  GSTEPDSFGPRNDSE
        GS +P S G R+ S+
Subjt:  GSTEPDSFGPRNDSE

AT4G29060.2 elongation factor Ts family protein8.6e-0931.3Show/hide
Query:  SEQVSVGDVYEARVGSVEDYGAFVHL-CFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQD
        +E++  G  +  +V +++ +GAFV    F+D      GLVHVS++S + V+DV  +++ G EV+V+++  D E  RI+L++++ ++ P  ++        
Subjt:  SEQVSVGDVYEARVGSVEDYGAFVHL-CFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQD

Query:  GSTEPDSFGPRNDSE
        GS +P S G R+ S+
Subjt:  GSTEPDSFGPRNDSE

AT5G30510.1 ribosomal protein S17.5e-2131.67Show/hide
Query:  WKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVG
        W+  R+      I + ++ G+N GGL+     L GF+PF Q+S   + +E           L+   IP+K ++ DE   KL+ S ++A  +    Q+ +G
Subjt:  WKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSFIPVKVIQADERNKKLIFSEKEAAWSKFSEQVSVG

Query:  DVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP
         V    V S++ YGAF+       +  + GL+HVS++S D V D+  +L  GD ++V +++ DR++ R++LS K+LE  P
Subjt:  DVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAGCCGCAGGTGCATTGTAAATGGGTTTCCAAAGCACAATCTGAGAAGATGAAGAGCCATAGCCATAGGCTTACCTCCAAGTGTGCTGCTTTGATAAAGGAGCA
GCGAGGCAGAATTTACCTCCTCCGCCGGTGCGCAACCATGTTGCTCTGCTGTTACCAAAGACGTGGGCATGGAGATCTGCAACCGCAAGCCATTGGTGACATATGGAGAA
GATGCAGTGCTTTTGTACCAGATTATCAGTTTGTGACTTTCCATTTCCAGCCCAATCCGAATGCTTTGCCACTGCGCTTCTCCCTGCCATTTCCTCCACTCTCAACTCTC
AACATGCCAATCTTTGCTGCAACAACGCTGGGATCTGTGTCCGCTCATTCCTTTCTCTCACTCTTTGGCTCCACTGATGCCCCCGCCCCCGCCCCCGCCCCTTCAACCTC
CTCCATTGTAACCCACAAGTCCCCCTCTAAACGGCCTTCCAACTTCGCCGCCAGAGTTTCCCTCTCCGGAAAACCGGAGCCCATTGCCGGAGTTCTAGACAGTTCCCCTC
CGTCGCCGGAATCAGTTCGACGTGCTCGGGTGAGTACATTGTTCGTCCGTTCGTTCTTGAATTGTGGAGGTGTTAGCTTACGATTGAAGAACTGGGTTCTTCTCTTGTTT
GTAAGGAGGGTAGATGTAGTAGGAGAAGCTATTGCGACGGGAGGAGGTGATGAGAGAGGTTGTGATGCTGGAGGAGAATTGCTCAACATGAGGGAGAGATCTGCTGATTG
GAAGGCGGCGAGGGAATACCTAGATGGTGGATTTATCTACGAAGGTAGGATTGAAGGTTCAAATGCAGGAGGTTTACTTGTTCGATTTTGTTCTCTTGTTGGGTTTCTTC
CATTCCCTCAATTGAGCCCGTCTCATTCATGTAAAGAACCATACAAAAGTATCCAAGATATTGCAAAGAGCTTAATTGGTTCGTTTATACCAGTGAAGGTTATCCAAGCA
GATGAGAGAAACAAGAAATTGATATTTTCAGAGAAGGAAGCTGCCTGGTCAAAGTTTTCTGAGCAAGTTAGTGTGGGAGATGTTTATGAAGCTAGGGTTGGTTCTGTGGA
GGACTATGGTGCCTTTGTACACTTATGTTTCTCCGATGGTCTTCATCATCTTACTGGTCTAGTACATGTCTCAGAAGTTTCATGGGATCTAGTTCAAGATGTAAGGGACA
TCTTAAGTGAGGGTGACGAAGTGAGGGTGAAAGTCATTAATGTTGACAGGGAAAAGTCTAGGATTACACTGTCAATTAAACAACTGGAGGAAGATCCACTTTTGGAAACA
TTGGACAAAGTAATACCGCAGGATGGTTCTACTGAACCTGATTCTTTCGGACCTAGAAATGACAGTGAAATTGTACCACTTCCTGGACTTGAAACAATATTTGAAGAGCT
ACTGCAGGAAGAAGGTATAGAAGATGTTAGTGTCAACCGACAAGGATTTGAGAAGCGGGTTGTTTCACAAGACCTACAGCTTTGGTTATCAAATGCGCCTCCCGTTGAAA
TGAAGTTCACTCTCCTTGCTCGTGCTGGTAGGCAGGTTCAGGAAATACAACTGACGACATCACTCGATCAGGAAGGTATAAAAAAGGCATTGCAGCGAGTATTGGAGCGT
GTCCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGAGCCGCAGGTGCATTGTAAATGGGTTTCCAAAGCACAATCTGAGAAGATGAAGAGCCATAGCCATAGGCTTACCTCCAAGTGTGCTGCTTTGATAAAGGAGCA
GCGAGGCAGAATTTACCTCCTCCGCCGGTGCGCAACCATGTTGCTCTGCTGTTACCAAAGACGTGGGCATGGAGATCTGCAACCGCAAGCCATTGGTGACATATGGAGAA
GATGCAGTGCTTTTGTACCAGATTATCAGTTTGTGACTTTCCATTTCCAGCCCAATCCGAATGCTTTGCCACTGCGCTTCTCCCTGCCATTTCCTCCACTCTCAACTCTC
AACATGCCAATCTTTGCTGCAACAACGCTGGGATCTGTGTCCGCTCATTCCTTTCTCTCACTCTTTGGCTCCACTGATGCCCCCGCCCCCGCCCCCGCCCCTTCAACCTC
CTCCATTGTAACCCACAAGTCCCCCTCTAAACGGCCTTCCAACTTCGCCGCCAGAGTTTCCCTCTCCGGAAAACCGGAGCCCATTGCCGGAGTTCTAGACAGTTCCCCTC
CGTCGCCGGAATCAGTTCGACGTGCTCGGGTGAGTACATTGTTCGTCCGTTCGTTCTTGAATTGTGGAGGTGTTAGCTTACGATTGAAGAACTGGGTTCTTCTCTTGTTT
GTAAGGAGGGTAGATGTAGTAGGAGAAGCTATTGCGACGGGAGGAGGTGATGAGAGAGGTTGTGATGCTGGAGGAGAATTGCTCAACATGAGGGAGAGATCTGCTGATTG
GAAGGCGGCGAGGGAATACCTAGATGGTGGATTTATCTACGAAGGTAGGATTGAAGGTTCAAATGCAGGAGGTTTACTTGTTCGATTTTGTTCTCTTGTTGGGTTTCTTC
CATTCCCTCAATTGAGCCCGTCTCATTCATGTAAAGAACCATACAAAAGTATCCAAGATATTGCAAAGAGCTTAATTGGTTCGTTTATACCAGTGAAGGTTATCCAAGCA
GATGAGAGAAACAAGAAATTGATATTTTCAGAGAAGGAAGCTGCCTGGTCAAAGTTTTCTGAGCAAGTTAGTGTGGGAGATGTTTATGAAGCTAGGGTTGGTTCTGTGGA
GGACTATGGTGCCTTTGTACACTTATGTTTCTCCGATGGTCTTCATCATCTTACTGGTCTAGTACATGTCTCAGAAGTTTCATGGGATCTAGTTCAAGATGTAAGGGACA
TCTTAAGTGAGGGTGACGAAGTGAGGGTGAAAGTCATTAATGTTGACAGGGAAAAGTCTAGGATTACACTGTCAATTAAACAACTGGAGGAAGATCCACTTTTGGAAACA
TTGGACAAAGTAATACCGCAGGATGGTTCTACTGAACCTGATTCTTTCGGACCTAGAAATGACAGTGAAATTGTACCACTTCCTGGACTTGAAACAATATTTGAAGAGCT
ACTGCAGGAAGAAGGTATAGAAGATGTTAGTGTCAACCGACAAGGATTTGAGAAGCGGGTTGTTTCACAAGACCTACAGCTTTGGTTATCAAATGCGCCTCCCGTTGAAA
TGAAGTTCACTCTCCTTGCTCGTGCTGGTAGGCAGGTTCAGGAAATACAACTGACGACATCACTCGATCAGGAAGGTATAAAAAAGGCATTGCAGCGAGTATTGGAGCGT
GTCCCATGA
Protein sequenceShow/hide protein sequence
MGEPQVHCKWVSKAQSEKMKSHSHRLTSKCAALIKEQRGRIYLLRRCATMLLCCYQRRGHGDLQPQAIGDIWRRCSAFVPDYQFVTFHFQPNPNALPLRFSLPFPPLSTL
NMPIFAATTLGSVSAHSFLSLFGSTDAPAPAPAPSTSSIVTHKSPSKRPSNFAARVSLSGKPEPIAGVLDSSPPSPESVRRARVSTLFVRSFLNCGGVSLRLKNWVLLLF
VRRVDVVGEAIATGGGDERGCDAGGELLNMRERSADWKAAREYLDGGFIYEGRIEGSNAGGLLVRFCSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSFIPVKVIQA
DERNKKLIFSEKEAAWSKFSEQVSVGDVYEARVGSVEDYGAFVHLCFSDGLHHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLET
LDKVIPQDGSTEPDSFGPRNDSEIVPLPGLETIFEELLQEEGIEDVSVNRQGFEKRVVSQDLQLWLSNAPPVEMKFTLLARAGRQVQEIQLTTSLDQEGIKKALQRVLER
VP