; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G16580 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G16580
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Description30S ribosomal protein S1-like
Genome locationctg24:1054357..1060715
RNA-Seq ExpressionCucsat.G16580
SyntenyCucsat.G16580
Gene Ontology termsGO:0000481 - maturation of 5S rRNA (biological process)
GO:0009737 - response to abscisic acid (biological process)
GO:0032508 - DNA duplex unwinding (biological process)
GO:0034337 - RNA folding (biological process)
GO:0071840 - cellular component organization or biogenesis (biological process)
GO:0005840 - ribosome (cellular component)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044578.1 30S ribosomal protein S1 isoform X1 [Cucumis melo var. makuwa]1.78e-23793.46Show/hide
Query:  MPIFVATIASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR
        MPIF+ATIASVS HSFLSLLASTSDASSTSSSSSSS ILPLKSPSKR SIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR
Subjt:  MPIFVATIASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR

Query:  IEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKV-IQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAF
        IEGSNAGGLLVRFYSL+GFLPFPQLSPSHSCKEP KSIQDIAKSL GSLISVKV IQADE+N+KLIFSEKEA  SKFSGQV VGDVYE KVGS+EDYGAF
Subjt:  IEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKV-IQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAF

Query:  VHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNK--NKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPGL
        VHLR SDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINV++  +KSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPK DSEIIPLPGL
Subjt:  VHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNK--NKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPGL

Query:  ETIIEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
         TIIEEL QEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPP+EKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  ETIIEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

XP_004152109.1 uncharacterized protein LOC101213559 isoform X2 [Cucumis sativus]4.10e-25699.47Show/hide
Query:  MPIFVATIASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR
        MPIFVATIASVSAHSFLSLLASTSDASST SSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR
Subjt:  MPIFVATIASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR

Query:  IEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFV
        IEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFV
Subjt:  IEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFV

Query:  HLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPGLETI
        HLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQ+SSAEPDSFGPKGDSEIIPLPGLETI
Subjt:  HLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPGLETI

Query:  IEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        IEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  IEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

XP_008454000.1 PREDICTED: 30S ribosomal protein S1 isoform X1 [Cucumis melo]1.27e-24094.2Show/hide
Query:  MPIFVATIASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR
        MPIF+ATIASVS HSFLSLLASTSDASSTSSSSSSS ILPLKSPSKR SIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR
Subjt:  MPIFVATIASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR

Query:  IEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFV
        IEGSNAGGLLVRFYSL+GFLPFPQLSPSHSCKEP KSIQDIAKSL GSLISVKVIQADE+N+KLIFSEKEA  SKFSGQV VGDVYE KVGS+EDYGAFV
Subjt:  IEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFV

Query:  HLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPGLETI
        HLR SDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINV+++KSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPK DSEIIPLPGL TI
Subjt:  HLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPGLETI

Query:  IEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        IEEL QEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPP+EKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  IEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

XP_011653045.1 uncharacterized protein LOC101213559 isoform X1 [Cucumis sativus]9.21e-25498.43Show/hide
Query:  MPIFVATIASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR
        MPIFVATIASVSAHSFLSLLASTSDASST SSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR
Subjt:  MPIFVATIASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR

Query:  IEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFV
        IEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFV
Subjt:  IEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFV

Query:  HLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPGLETI
        HLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQ+SSAEPDSFGPKGDSEIIPLPGLETI
Subjt:  HLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPGLETI

Query:  IEELLQEEG----IVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        IEELLQEEG    IVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  IEELLQEEG----IVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

XP_038897871.1 30S ribosomal protein S1 homolog B [Benincasa hispida]6.34e-23089.01Show/hide
Query:  MPIFVATIASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSP---ESVRRARRSADWKAAREYLDSGFIY
        MPIF AT+ SV AHSFLSLLAST+D +     SS+SFILP KSPSKR S FP+RVSLSGKPDPIAGVLDTSP   ES+RRARRSADWKAAREYLD+GFIY
Subjt:  MPIFVATIASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSP---ESVRRARRSADWKAAREYLDSGFIY

Query:  EGRIEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYG
        EGRIEGSNAGGLLVRFYSL+GFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADE+N+KLIFSEKEAA SKFS QV VGDVYE +VGSVEDYG
Subjt:  EGRIEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYG

Query:  AFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPGL
        AFVHLR SDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINV+++KSRITLSI+QLEEDPLLETLDKVIPQ  SAEPDSFGPK DSEI+PLPGL
Subjt:  AFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPGL

Query:  ETIIEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        ETIIEELLQE+GIVD+ VNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  ETIIEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

TrEMBL top hitse value%identityAlignment
A0A0A0KTX2 Uncharacterized protein1.99e-25699.47Show/hide
Query:  MPIFVATIASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR
        MPIFVATIASVSAHSFLSLLASTSDASST SSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR
Subjt:  MPIFVATIASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR

Query:  IEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFV
        IEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFV
Subjt:  IEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFV

Query:  HLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPGLETI
        HLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQ+SSAEPDSFGPKGDSEIIPLPGLETI
Subjt:  HLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPGLETI

Query:  IEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        IEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  IEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

A0A1S3BXL4 30S ribosomal protein S1 isoform X16.14e-24194.2Show/hide
Query:  MPIFVATIASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR
        MPIF+ATIASVS HSFLSLLASTSDASSTSSSSSSS ILPLKSPSKR SIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR
Subjt:  MPIFVATIASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR

Query:  IEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFV
        IEGSNAGGLLVRFYSL+GFLPFPQLSPSHSCKEP KSIQDIAKSL GSLISVKVIQADE+N+KLIFSEKEA  SKFSGQV VGDVYE KVGS+EDYGAFV
Subjt:  IEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFV

Query:  HLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPGLETI
        HLR SDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINV+++KSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPK DSEIIPLPGL TI
Subjt:  HLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPGLETI

Query:  IEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        IEEL QEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPP+EKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  IEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

A0A5A7TNY0 30S ribosomal protein S1 isoform X18.62e-23893.46Show/hide
Query:  MPIFVATIASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR
        MPIF+ATIASVS HSFLSLLASTSDASSTSSSSSSS ILPLKSPSKR SIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR
Subjt:  MPIFVATIASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGR

Query:  IEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKV-IQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAF
        IEGSNAGGLLVRFYSL+GFLPFPQLSPSHSCKEP KSIQDIAKSL GSLISVKV IQADE+N+KLIFSEKEA  SKFSGQV VGDVYE KVGS+EDYGAF
Subjt:  IEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKV-IQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAF

Query:  VHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNK--NKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPGL
        VHLR SDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINV++  +KSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPK DSEIIPLPGL
Subjt:  VHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNK--NKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPGL

Query:  ETIIEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
         TIIEEL QEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPP+EKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  ETIIEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

A0A6J1GVD7 LOW QUALITY PROTEIN: uncharacterized protein LOC1114575271.82e-21585.12Show/hide
Query:  MPIFVAT-IASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSP---ESVRRARRSADWKAAREYLDSGFI
        MPIF AT + S+SAHSFLSL AS +DAS   +S S+SF+L  KSPSKR S F +RVSLSGKP+PIAGVL++SP   ESVRRARRSADWK AREYLDSGFI
Subjt:  MPIFVAT-IASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSP---ESVRRARRSADWKAAREYLDSGFI

Query:  YEGRIEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDY
        ++GRIEGSNAGGLLVRFYSLVGFLPFP LSP+HSCKEPYKSIQDIAKSLIGSLI VKVIQADEKN+ LIFSEKEAA SKFS +V VGDVYE +VGS+EDY
Subjt:  YEGRIEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDY

Query:  GAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPG
        GAFVHLR SDGLYHLTGLVH+SEVSWDLVQDVRDILSEGDEV VKVI+V+++KSRITLSI+QLEEDPLLETLDKVIPQD SAEPDSFGPK DSEIIPLPG
Subjt:  GAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPG

Query:  LETIIEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        L+TI EELLQEEGI DV +NRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  LETIIEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

A0A6J1IUF3 uncharacterized protein LOC1114794063.55e-21885.9Show/hide
Query:  MPIFVAT-IASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSP---ESVRRARRSADWKAAREYLDSGFI
        MPIF AT + S+SAHSFLSLLAST DAS    S SSSF+L  KSPSKR S F +RVSLSGKP+PIAGVL++SP   ESVRRARRSADWK AREYLDSGFI
Subjt:  MPIFVAT-IASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSP---ESVRRARRSADWKAAREYLDSGFI

Query:  YEGRIEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDY
        ++GRIEGSNAGGLLVRFYSLVGFLPFP LSP+HSCKEPYKSIQDIAKSLIGSLI VKVIQADEKN+ LIFSEKEAA SKFS QV VGDVYE +VGSVEDY
Subjt:  YEGRIEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDY

Query:  GAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPG
        GAFVHLR SDG YHLTGLVH+SEVSWDLVQDVRDILSEGDEV VKV++V+++KSRITLSI+QLEEDPLLETLDKVIPQD SAEPDSFGPK DSEIIPLPG
Subjt:  GAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPG

Query:  LETIIEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        L+TI EELLQEEGI DV +NRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  LETIIEELLQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

SwissProt top hitse value%identityAlignment
P29344 30S ribosomal protein S1, chloroplastic7.2e-2029.47Show/hide
Query:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAAR
        S+R+ +    W+  R+      + +G+I G+N GG++     L GF+PF Q+S   S +E           L+   I +K ++ DE+  +L+ S ++A  
Subjt:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAAR

Query:  SKFSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDP
             Q+ +G V  G V S++ YGAF+ +        + GL+HVS++S D V D+  +L  GD + V +++ ++ + R++LS ++LE  P
Subjt:  SKFSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDP

P46228 30S ribosomal protein S14.0e-1828.95Show/hide
Query:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAAR
        S+RR      W+  R+           +  +N GG LVR   L GF+P   +S +   KE           L+G  + +K ++ DE   +L+ S + A  
Subjt:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAAR

Query:  SKFSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDP
         +   ++ VG+V  G V  ++ YGAF+ +        ++GL+H+SE+S D ++    + +  DEV V +I+++  + RI+LS +QLE +P
Subjt:  SKFSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDP

P74142 30S ribosomal protein S1 homolog B3.6e-1929.35Show/hide
Query:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAAR
        S R+ +    W+   E  +SG   E  + G+N GG++     L GF+P   L             +D   +L+G ++   +++A++ N KL+ +++   +
Subjt:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAAR

Query:  SKFSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDP--LLETLDKV
        ++  G++A G++YEGKV  ++ YG FV +        +TGL+HVS+VS   V  +  + + G  ++V V  +++ K+RI+LS R LE  P  L+E  D++
Subjt:  SKFSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDP--LLETLDKV

Query:  I
        +
Subjt:  I

Q93VC7 30S ribosomal protein S1, chloroplastic4.7e-1929.47Show/hide
Query:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAAR
        S+R  +    W+  R+      I + ++ G+N GGL+     L GF+PF Q+S   + +E           L+   I +K ++ DE+  KL+ S ++A  
Subjt:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAAR

Query:  SKFSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDP
             Q+ +G V  G V S++ YGAF+ +        + GL+HVS++S D V D+  +L  GD + V +++ ++++ R++LS ++LE  P
Subjt:  SKFSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDP

Q9JZ44 30S ribosomal protein S11.5e-1731.98Show/hide
Query:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEA--
        S  +A+R+ADW A  E +++G I  G I G   GGL V   S+  FLP   +          + ++D      G  I  KVI+ D+K   ++ S +    
Subjt:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEA--

Query:  -----ARSKFSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDP
              R      +  G V +G V ++ DYGAFV L   DGL H+T      +++W  V+   ++L  G EV  KV+  ++ K R++L ++QL EDP
Subjt:  -----ARSKFSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDP

Arabidopsis top hitse value%identityAlignment
AT1G12800.1 Nucleic acid-binding, OB-fold-like protein4.4e-2031.73Show/hide
Query:  QDIAKSLIGSLISVKVIQADEKNRKLIFS------EKEAARSK-FSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDI
        Q    S +G  I V V+ A+  +RKLIFS      E+E  + +    ++ VGDV +  +  +  +G F  L        +  LVH SEVSWD   D    
Subjt:  QDIAKSLIGSLISVKVIQADEKNRKLIFS------EKEAARSK-FSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDI

Query:  LSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPL--PGLETIIEELLQEEGIVDVRVNRQGFEKRVVSQDL
           G  V  KV  ++    RI LS++++  DPL E L+ V+  D+    D  G +  +  +    P +E++I+EL   EGI  V  +R  F    ++   
Subjt:  LSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPL--PGLETIIEELLQEEGIVDVRVNRQGFEKRVVSQDL

Query:  QLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERV
        Q+++  AP  E ++ LLARAG +VQE+ +  SL +E +K  +     RV
Subjt:  QLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERV

AT3G23700.1 Nucleic acid-binding proteins superfamily3.2e-11662.16Show/hide
Query:  SFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAG-----VLDTSPESVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGL
        +FLS       +SS+SSS+ +S +  +KS S  ++    R S       +       + DTS E+   A   +DWK A+ Y  SG  +EG ++G N GGL
Subjt:  SFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAG-----VLDTSPESVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGL

Query:  LVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLY
        L+RF+SLVGFLP+PQLSPS SCKEP KSI +IAK+L+GS + VKV+QADE+NRKLI SEK A   K+S  V VGDV+ G+VGSVEDYGAF+HLR  DGLY
Subjt:  LVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLY

Query:  HLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPGLETIIEELLQEEG
        HLTGLVHVSEVSWD VQDVRD+L +GDEV V V N++K KSRITLSI+QLE+DPLLETLDKVI +DSS    S        I PLPGLETI+EELL+E+G
Subjt:  HLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPGLETIIEELLQEEG

Query:  IVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        I  V++NRQGFEKRVVSQDLQLWLSN PP + KF LLARAGRQVQEI LTTSL+Q GIK+ALQ VLERVP
Subjt:  IVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

AT4G29060.1 elongation factor Ts family protein5.5e-0725.62Show/hide
Query:  SEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLE
        SE  A +++   ++  G  + GKV +++ +GAFV     D      GLVHVS++S + V+DV  +++ G EV V+++  +    RI+L++R+ ++ P  +
Subjt:  SEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLE

Query:  TLDKVIPQDSSAEPDSFG---PKGD---SEIIPLPGLETIIEELLQEEGIVDVRVNRQGF
        +     P+       S G    KG+   S+      L+ +++ L +    + +    +GF
Subjt:  TLDKVIPQDSSAEPDSFG---PKGD---SEIIPLPGLETIIEELLQEEGIVDVRVNRQGF

AT4G29060.2 elongation factor Ts family protein5.5e-0725.62Show/hide
Query:  SEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLE
        SE  A +++   ++  G  + GKV +++ +GAFV     D      GLVHVS++S + V+DV  +++ G EV V+++  +    RI+L++R+ ++ P  +
Subjt:  SEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLE

Query:  TLDKVIPQDSSAEPDSFG---PKGD---SEIIPLPGLETIIEELLQEEGIVDVRVNRQGF
        +     P+       S G    KG+   S+      L+ +++ L +    + +    +GF
Subjt:  TLDKVIPQDSSAEPDSFG---PKGD---SEIIPLPGLETIIEELLQEEGIVDVRVNRQGF

AT5G30510.1 ribosomal protein S13.3e-2029.47Show/hide
Query:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAAR
        S+R  +    W+  R+      I + ++ G+N GGL+     L GF+PF Q+S   + +E           L+   I +K ++ DE+  KL+ S ++A  
Subjt:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAAR

Query:  SKFSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDP
             Q+ +G V  G V S++ YGAF+ +        + GL+HVS++S D V D+  +L  GD + V +++ ++++ R++LS ++LE  P
Subjt:  SKFSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAATCTTTGTTGCAACCATCGCCTCTGTCTCTGCTCATTCCTTTCTCTCACTCCTTGCTTCCACTTCTGATGCTTCTTCAACCTCCTCCTCCTCCTCCTCCTCCTT
CATTTTACCACTCAAATCCCCCTCTAAACGCTCTTCCATTTTCCCCTCCAGAGTCTCCCTCTCCGGAAAACCTGACCCCATTGCCGGAGTTCTAGACACCTCCCCGGAAT
CCGTTCGACGTGCTCGGAGATCTGCTGATTGGAAGGCAGCGAGGGAATACCTTGATAGTGGATTTATCTATGAGGGTAGGATTGAAGGTTCAAATGCTGGAGGTTTACTT
GTACGATTTTATTCTCTTGTTGGGTTTCTTCCATTCCCTCAATTGAGCCCGTCTCATTCTTGTAAAGAACCATACAAGAGTATTCAAGATATTGCAAAAAGCTTAATTGG
TTCGCTTATATCAGTAAAGGTAATCCAAGCAGATGAGAAAAACAGGAAATTGATATTTTCAGAGAAGGAAGCTGCGCGGTCAAAGTTTTCTGGGCAAGTGGCTGTGGGGG
ATGTTTATGAAGGCAAAGTTGGATCTGTGGAGGATTATGGTGCTTTTGTACATCTACGTCTCTCTGATGGTCTTTATCATCTTACTGGGCTAGTACATGTATCAGAAGTT
TCATGGGATCTAGTTCAGGATGTACGAGACATATTAAGTGAGGGTGACGAAGTGACAGTGAAGGTCATTAATGTTAACAAGAATAAGTCTCGGATCACATTGTCGATCAG
ACAACTCGAGGAAGATCCACTTTTAGAAACATTGGACAAAGTAATACCGCAGGACAGTTCTGCTGAACCTGATTCTTTTGGACCTAAAGGCGACAGCGAAATAATACCCC
TCCCTGGACTTGAAACAATAATTGAAGAGCTACTGCAGGAAGAGGGTATAGTAGATGTTCGTGTCAATCGACAAGGATTTGAGAAACGGGTGGTTTCACAAGACCTACAG
CTTTGGCTATCAAATGCACCTCCCGTTGAAAAGAAGTTCACTCTCCTTGCTCGTGCCGGGAGGCAGGTTCAAGAAATTCAGCTGACAACATCACTCGATCAGGAAGGTAT
AAAAAGGGCATTGCAGCGTGTGTTGGAACGTGTCCCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAATCTTTGTTGCAACCATCGCCTCTGTCTCTGCTCATTCCTTTCTCTCACTCCTTGCTTCCACTTCTGATGCTTCTTCAACCTCCTCCTCCTCCTCCTCCTCCTT
CATTTTACCACTCAAATCCCCCTCTAAACGCTCTTCCATTTTCCCCTCCAGAGTCTCCCTCTCCGGAAAACCTGACCCCATTGCCGGAGTTCTAGACACCTCCCCGGAAT
CCGTTCGACGTGCTCGGAGATCTGCTGATTGGAAGGCAGCGAGGGAATACCTTGATAGTGGATTTATCTATGAGGGTAGGATTGAAGGTTCAAATGCTGGAGGTTTACTT
GTACGATTTTATTCTCTTGTTGGGTTTCTTCCATTCCCTCAATTGAGCCCGTCTCATTCTTGTAAAGAACCATACAAGAGTATTCAAGATATTGCAAAAAGCTTAATTGG
TTCGCTTATATCAGTAAAGGTAATCCAAGCAGATGAGAAAAACAGGAAATTGATATTTTCAGAGAAGGAAGCTGCGCGGTCAAAGTTTTCTGGGCAAGTGGCTGTGGGGG
ATGTTTATGAAGGCAAAGTTGGATCTGTGGAGGATTATGGTGCTTTTGTACATCTACGTCTCTCTGATGGTCTTTATCATCTTACTGGGCTAGTACATGTATCAGAAGTT
TCATGGGATCTAGTTCAGGATGTACGAGACATATTAAGTGAGGGTGACGAAGTGACAGTGAAGGTCATTAATGTTAACAAGAATAAGTCTCGGATCACATTGTCGATCAG
ACAACTCGAGGAAGATCCACTTTTAGAAACATTGGACAAAGTAATACCGCAGGACAGTTCTGCTGAACCTGATTCTTTTGGACCTAAAGGCGACAGCGAAATAATACCCC
TCCCTGGACTTGAAACAATAATTGAAGAGCTACTGCAGGAAGAGGGTATAGTAGATGTTCGTGTCAATCGACAAGGATTTGAGAAACGGGTGGTTTCACAAGACCTACAG
CTTTGGCTATCAAATGCACCTCCCGTTGAAAAGAAGTTCACTCTCCTTGCTCGTGCCGGGAGGCAGGTTCAAGAAATTCAGCTGACAACATCACTCGATCAGGAAGGTAT
AAAAAGGGCATTGCAGCGTGTGTTGGAACGTGTCCCGTGA
Protein sequenceShow/hide protein sequence
MPIFVATIASVSAHSFLSLLASTSDASSTSSSSSSSFILPLKSPSKRSSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLL
VRFYSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLISVKVIQADEKNRKLIFSEKEAARSKFSGQVAVGDVYEGKVGSVEDYGAFVHLRLSDGLYHLTGLVHVSEV
SWDLVQDVRDILSEGDEVTVKVINVNKNKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKGDSEIIPLPGLETIIEELLQEEGIVDVRVNRQGFEKRVVSQDLQ
LWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP