; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026597 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026597
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Description30S ribosomal protein S31, chloroplastic
Genome locationtig00153033:1646075..1648225
RNA-Seq ExpressionSgr026597
SyntenySgr026597
Gene Ontology termsGO:0032544 - plastid translation (biological process)
GO:0005840 - ribosome (cellular component)
GO:0009507 - chloroplast (cellular component)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR030826 - 30S ribosomal protein
IPR044695 - 30S ribosomal protein S31, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TQD81049.1 hypothetical protein C1H46_033402 [Malus baccata]3.7e-7055.62Show/hide
Query:  MASLPFGLLQASSQSPTLSPRFFSFSKSETLGTSLRASTASLATSATSSPSSVPAVYCGRGDKKTEKGKRFNHSYGNARPRDKTKGRGTPRVPIPPSPPR
        MASL  G   A SQS   S R  SFS SETL  SL +STASL+ S  SSP  VP+VYCGRGDKKTE+GKRFNHS+GNARPR+K KGRG PRVP+P +PP+
Subjt:  MASLPFGLLQASSQSPTLSPRFFSFSKSETLGTSLRASTASLATSATSSPSSVPAVYCGRGDKKTEKGKRFNHSYGNARPRDKTKGRGTPRVPIPPSPPR

Query:  KDKFDDNEKIKI-----------------EIDEFIDPEDQKERIRQILQYQKSVY---SSSSSFSSSSA------SSLRSSSLLDLMKAGNTSLRRLFDM
        KDKF+D+  +KI                 E + F+DPE QKERIR+I++YQKS+Y   SSSSS SSSSA      S+ RSSSLL LMK GNTSLRRLFDM
Subjt:  KDKFDDNEKIKI-----------------EIDEFIDPEDQKERIRQILQYQKSVY---SSSSSFSSSSA------SSLRSSSLLDLMKAGNTSLRRLFDM

Query:  QHTSLATFFYKYSGSPMMKPITLWGSDSD--AEICDAWASIK--FGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWR
        +HTSLA  F  +SGS ++KPI LWGSD+D   E  D W SIK    ++H       +  ++   +    G ++  +T+G  +LTRKKSF+RLPRFG LWR
Subjt:  QHTSLATFFYKYSGSPMMKPITLWGSDSD--AEICDAWASIK--FGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWR

Query:  CRGSRVRFRLRRLRIMICER
        CRG R R RLRR++I+ C R
Subjt:  CRGSRVRFRLRRLRIMICER

XP_022950112.1 uncharacterized protein LOC111453292 isoform X1 [Cucurbita moschata]2.8e-6277.01Show/hide
Query:  EIDEFIDPEDQKERIRQILQYQKSVYSSSSSFSSSSASSL-------RSSSLLDLMKAGNTSLRRLFDMQHTSLATFFYKYSGSPMMKPITLWGSDSDAE
        E+ +FIDPE QKERIRQIL+YQKSVYSSSSS SSSS+SS+       RS SLLDLMKAGNTSLRRLFDMQHTSLAT+F KYSGSP +KPI LWGSDSD E
Subjt:  EIDEFIDPEDQKERIRQILQYQKSVYSSSSSFSSSSASSL-------RSSSLLDLMKAGNTSLRRLFDMQHTSLATFFYKYSGSPMMKPITLWGSDSDAE

Query:  ICDAWASIKFGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWRCRGSRVRFR
        ICDAWASIK GLSHDS SRGTNCTSNG+FMDR  G +NST+ V KHKLTRKKSF++LP FGLLWR    RVR R
Subjt:  ICDAWASIKFGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWRCRGSRVRFR

XP_022950113.1 uncharacterized protein LOC111453292 isoform X2 [Cucurbita moschata]2.5e-6677.6Show/hide
Query:  EIDEFIDPEDQKERIRQILQYQKSVYSSSSSFSSSSASSL-------RSSSLLDLMKAGNTSLRRLFDMQHTSLATFFYKYSGSPMMKPITLWGSDSDAE
        E+ +FIDPE QKERIRQIL+YQKSVYSSSSS SSSS+SS+       RS SLLDLMKAGNTSLRRLFDMQHTSLAT+F KYSGSP +KPI LWGSDSD E
Subjt:  EIDEFIDPEDQKERIRQILQYQKSVYSSSSSFSSSSASSL-------RSSSLLDLMKAGNTSLRRLFDMQHTSLATFFYKYSGSPMMKPITLWGSDSDAE

Query:  ICDAWASIKFGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWRCRGSRVRFRLRRLRIMIC
        ICDAWASIK GLSHDS SRGTNCTSNG+FMDR  G +NST+ V KHKLTRKKSF++LP FGLLWR    RVR R RRLRIMIC
Subjt:  ICDAWASIKFGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWRCRGSRVRFRLRRLRIMIC

XP_022978287.1 uncharacterized protein LOC111478319 isoform X2 [Cucurbita maxima]2.5e-6377.09Show/hide
Query:  EIDEFIDPEDQKERIRQILQYQKSVYSSSSSFSSS---SASSLRSSSLLDLMKAGNTSLRRLFDMQHTSLATFFYKYSGSPMMKPITLWGSDSDAEICDA
        E+ +FIDPE Q+ERIRQIL+YQKSVYSSSSS SSS   SASS RS SLLDLMKAGN SLRRLFDMQHTSLAT+F KYSGSP +KPI LWGSDSD EICDA
Subjt:  EIDEFIDPEDQKERIRQILQYQKSVYSSSSSFSSS---SASSLRSSSLLDLMKAGNTSLRRLFDMQHTSLATFFYKYSGSPMMKPITLWGSDSDAEICDA

Query:  WASIKFGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWRCRGSRVRFRLRRLRIMIC
        WASIK G SHDS S GTNCTSNG+FMDR  G +NST+ V KHKLTRKKSF++LP FGLLWR    RVR R R LRIMIC
Subjt:  WASIKFGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWRCRGSRVRFRLRRLRIMIC

XP_023543295.1 serine/threonine-protein phosphatase 4 regulatory subunit 3-like isoform X2 [Cucurbita pepo subsp. pepo]2.1e-6576.47Show/hide
Query:  EIDEFIDPEDQKERIRQILQYQKSVYSSSSSFSSS-----------SASSLRSSSLLDLMKAGNTSLRRLFDMQHTSLATFFYKYSGSPMMKPITLWGSD
        E+ +FIDPE QKERIRQIL+YQKSVYSSSSS SSS           SASS RS SLLDLMKAGNTSLRRLFDMQHTSLAT+F KYSGSP +KPI LWGSD
Subjt:  EIDEFIDPEDQKERIRQILQYQKSVYSSSSSFSSS-----------SASSLRSSSLLDLMKAGNTSLRRLFDMQHTSLATFFYKYSGSPMMKPITLWGSD

Query:  SDAEICDAWASIKFGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWRCRGSRVRFRLRRLRIMIC
        SD E CDAWASIK GLSHDS SRGTNCTSNGIFMDR  G +NST+ V KHKLTRKKSF++LP FGLLWR    RVR R RRLRIMIC
Subjt:  SDAEICDAWASIKFGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWRCRGSRVRFRLRRLRIMIC

TrEMBL top hitse value%identityAlignment
A0A540L3X8 Uncharacterized protein1.8e-7055.62Show/hide
Query:  MASLPFGLLQASSQSPTLSPRFFSFSKSETLGTSLRASTASLATSATSSPSSVPAVYCGRGDKKTEKGKRFNHSYGNARPRDKTKGRGTPRVPIPPSPPR
        MASL  G   A SQS   S R  SFS SETL  SL +STASL+ S  SSP  VP+VYCGRGDKKTE+GKRFNHS+GNARPR+K KGRG PRVP+P +PP+
Subjt:  MASLPFGLLQASSQSPTLSPRFFSFSKSETLGTSLRASTASLATSATSSPSSVPAVYCGRGDKKTEKGKRFNHSYGNARPRDKTKGRGTPRVPIPPSPPR

Query:  KDKFDDNEKIKI-----------------EIDEFIDPEDQKERIRQILQYQKSVY---SSSSSFSSSSA------SSLRSSSLLDLMKAGNTSLRRLFDM
        KDKF+D+  +KI                 E + F+DPE QKERIR+I++YQKS+Y   SSSSS SSSSA      S+ RSSSLL LMK GNTSLRRLFDM
Subjt:  KDKFDDNEKIKI-----------------EIDEFIDPEDQKERIRQILQYQKSVY---SSSSSFSSSSA------SSLRSSSLLDLMKAGNTSLRRLFDM

Query:  QHTSLATFFYKYSGSPMMKPITLWGSDSD--AEICDAWASIK--FGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWR
        +HTSLA  F  +SGS ++KPI LWGSD+D   E  D W SIK    ++H       +  ++   +    G ++  +T+G  +LTRKKSF+RLPRFG LWR
Subjt:  QHTSLATFFYKYSGSPMMKPITLWGSDSD--AEICDAWASIK--FGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWR

Query:  CRGSRVRFRLRRLRIMICER
        CRG R R RLRR++I+ C R
Subjt:  CRGSRVRFRLRRLRIMICER

A0A6J1GDW9 uncharacterized protein LOC111453292 isoform X11.4e-6277.01Show/hide
Query:  EIDEFIDPEDQKERIRQILQYQKSVYSSSSSFSSSSASSL-------RSSSLLDLMKAGNTSLRRLFDMQHTSLATFFYKYSGSPMMKPITLWGSDSDAE
        E+ +FIDPE QKERIRQIL+YQKSVYSSSSS SSSS+SS+       RS SLLDLMKAGNTSLRRLFDMQHTSLAT+F KYSGSP +KPI LWGSDSD E
Subjt:  EIDEFIDPEDQKERIRQILQYQKSVYSSSSSFSSSSASSL-------RSSSLLDLMKAGNTSLRRLFDMQHTSLATFFYKYSGSPMMKPITLWGSDSDAE

Query:  ICDAWASIKFGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWRCRGSRVRFR
        ICDAWASIK GLSHDS SRGTNCTSNG+FMDR  G +NST+ V KHKLTRKKSF++LP FGLLWR    RVR R
Subjt:  ICDAWASIKFGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWRCRGSRVRFR

A0A6J1GDZ9 uncharacterized protein LOC111453292 isoform X21.2e-6677.6Show/hide
Query:  EIDEFIDPEDQKERIRQILQYQKSVYSSSSSFSSSSASSL-------RSSSLLDLMKAGNTSLRRLFDMQHTSLATFFYKYSGSPMMKPITLWGSDSDAE
        E+ +FIDPE QKERIRQIL+YQKSVYSSSSS SSSS+SS+       RS SLLDLMKAGNTSLRRLFDMQHTSLAT+F KYSGSP +KPI LWGSDSD E
Subjt:  EIDEFIDPEDQKERIRQILQYQKSVYSSSSSFSSSSASSL-------RSSSLLDLMKAGNTSLRRLFDMQHTSLATFFYKYSGSPMMKPITLWGSDSDAE

Query:  ICDAWASIKFGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWRCRGSRVRFRLRRLRIMIC
        ICDAWASIK GLSHDS SRGTNCTSNG+FMDR  G +NST+ V KHKLTRKKSF++LP FGLLWR    RVR R RRLRIMIC
Subjt:  ICDAWASIKFGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWRCRGSRVRFRLRRLRIMIC

A0A6J1IM99 uncharacterized protein LOC111478319 isoform X12.2e-6077.06Show/hide
Query:  EIDEFIDPEDQKERIRQILQYQKSVYSSSSSFSSS---SASSLRSSSLLDLMKAGNTSLRRLFDMQHTSLATFFYKYSGSPMMKPITLWGSDSDAEICDA
        E+ +FIDPE Q+ERIRQIL+YQKSVYSSSSS SSS   SASS RS SLLDLMKAGN SLRRLFDMQHTSLAT+F KYSGSP +KPI LWGSDSD EICDA
Subjt:  EIDEFIDPEDQKERIRQILQYQKSVYSSSSSFSSS---SASSLRSSSLLDLMKAGNTSLRRLFDMQHTSLATFFYKYSGSPMMKPITLWGSDSDAEICDA

Query:  WASIKFGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWRCRGSRVRFR
        WASIK G SHDS S GTNCTSNG+FMDR  G +NST+ V KHKLTRKKSF++LP FGLLWR    RVR R
Subjt:  WASIKFGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWRCRGSRVRFR

A0A6J1ISL5 uncharacterized protein LOC111478319 isoform X21.2e-6377.09Show/hide
Query:  EIDEFIDPEDQKERIRQILQYQKSVYSSSSSFSSS---SASSLRSSSLLDLMKAGNTSLRRLFDMQHTSLATFFYKYSGSPMMKPITLWGSDSDAEICDA
        E+ +FIDPE Q+ERIRQIL+YQKSVYSSSSS SSS   SASS RS SLLDLMKAGN SLRRLFDMQHTSLAT+F KYSGSP +KPI LWGSDSD EICDA
Subjt:  EIDEFIDPEDQKERIRQILQYQKSVYSSSSSFSSS---SASSLRSSSLLDLMKAGNTSLRRLFDMQHTSLATFFYKYSGSPMMKPITLWGSDSDAEICDA

Query:  WASIKFGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWRCRGSRVRFRLRRLRIMIC
        WASIK G SHDS S GTNCTSNG+FMDR  G +NST+ V KHKLTRKKSF++LP FGLLWR    RVR R R LRIMIC
Subjt:  WASIKFGLSHDSESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWRCRGSRVRFRLRRLRIMIC

SwissProt top hitse value%identityAlignment
O80439 30S ribosomal protein S31, chloroplastic7.5e-2668.82Show/hide
Query:  SFSKSETLGTSLRASTASLATSATSSPSSVPAVYCGRGDKKTEKGKRFNHSYGNARPRDKTKGRGTPRVPIPPSPPRKDKFDDNEKIKIEIDE
        S S SET G SL   T   + S TSS SS+P VYCGRGD+KT KGKRFNHS+GNARPR+K+KGRG  RVP+PP+PPRKDKF+++EKIKI+IDE
Subjt:  SFSKSETLGTSLRASTASLATSATSSPSSVPAVYCGRGDKKTEKGKRFNHSYGNARPRDKTKGRGTPRVPIPPSPPRKDKFDDNEKIKIEIDE

P47909 30S ribosomal protein S31, mitochondrial3.3e-0563.04Show/hide
Query:  AVYCGRGDKKTEKGKRFNHSYGNARP-RDKTKGRGTPRVPIPPSPP
        AV CGRGDKKT++GKRF  SYGNARP R+K   R   RV +P S P
Subjt:  AVYCGRGDKKTEKGKRFNHSYGNARP-RDKTKGRGTPRVPIPPSPP

P47910 30S ribosomal protein S31, chloroplastic8.6e-1449.53Show/hide
Query:  MASLPFGLLQASSQSPTLSPRFFSFSKSETL-GTSLRA-STASLATSATSSPSSVPAVYCGRGDKKTEKGKRFNHSYGNARPRDKTKGRGTPRVPIPPSP
        MASL  G    +  S +     FSFS S+++ G SL +  T SL++S ++SP ++P +YCGRGD+KT KGKRFNHS+GNARP++K KGRG P+ PI P  
Subjt:  MASLPFGLLQASSQSPTLSPRFFSFSKSETL-GTSLRA-STASLATSATSSPSSVPAVYCGRGDKKTEKGKRFNHSYGNARPRDKTKGRGTPRVPIPPSP

Query:  PRKDKFD
            K D
Subjt:  PRKDKFD

Arabidopsis top hitse value%identityAlignment
AT2G38140.1 plastid-specific ribosomal protein 45.3e-2768.82Show/hide
Query:  SFSKSETLGTSLRASTASLATSATSSPSSVPAVYCGRGDKKTEKGKRFNHSYGNARPRDKTKGRGTPRVPIPPSPPRKDKFDDNEKIKIEIDE
        S S SET G SL   T   + S TSS SS+P VYCGRGD+KT KGKRFNHS+GNARPR+K+KGRG  RVP+PP+PPRKDKF+++EKIKI+IDE
Subjt:  SFSKSETLGTSLRASTASLATSATSSPSSVPAVYCGRGDKKTEKGKRFNHSYGNARPRDKTKGRGTPRVPIPPSPPRKDKFDDNEKIKIEIDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCACTTCCTTTCGGACTTCTTCAGGCCTCATCTCAATCTCCTACTCTTTCTCCTCGCTTCTTCTCTTTCTCCAAATCCGAAACCCTAGGAACGTCTCTGCGTGC
CTCTACTGCTTCTCTTGCGACCTCTGCAACTTCGTCTCCTTCCTCAGTTCCTGCTGTGTATTGTGGCAGAGGTGATAAGAAAACGGAGAAAGGAAAGCGGTTCAATCACT
CATACGGAAATGCAAGGCCTCGGGACAAGACGAAAGGGAGAGGGACTCCGAGAGTGCCGATTCCTCCTTCTCCGCCTAGGAAAGATAAATTTGACGACAATGAGAAAATC
AAGATCGAGATTGACGAGTTTATTGACCCTGAAGACCAAAAGGAAAGAATCCGACAAATACTTCAGTATCAGAAATCTGTGTATTCTTCTTCTTCATCTTTTTCCTCTTC
CTCTGCATCATCTTTAAGAAGTAGTAGTTTACTAGACTTGATGAAAGCAGGAAACACCTCTCTCAGGAGATTATTTGACATGCAGCATACTAGTTTGGCGACCTTTTTTT
ATAAGTACAGTGGATCACCTATGATGAAGCCTATAACTTTGTGGGGCAGTGATTCTGATGCTGAAATCTGCGATGCTTGGGCGTCCATCAAGTTTGGACTATCACATGAT
TCTGAGTCTCGCGGAACTAATTGCACATCAAATGGTATTTTTATGGACAGAATGATAGGGTCTCAGAATAGCACAATTACGGTTGGCAAGCACAAGTTAACTAGGAAAAA
GTCATTTCAGAGGCTGCCTAGATTTGGCCTGTTATGGAGATGCAGGGGATCTAGAGTCAGGTTTAGGTTGAGACGACTTCGGATTATGATTTGTGAAAGGAGTATACTGA
ATAATCTTGAGATGCCACTCATAGCACCACCTCAGGCTAGTAGAAATGTTATTGTGTATTCCACACGGATGCCTGTTCAATTCGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCACTTCCTTTCGGACTTCTTCAGGCCTCATCTCAATCTCCTACTCTTTCTCCTCGCTTCTTCTCTTTCTCCAAATCCGAAACCCTAGGAACGTCTCTGCGTGC
CTCTACTGCTTCTCTTGCGACCTCTGCAACTTCGTCTCCTTCCTCAGTTCCTGCTGTGTATTGTGGCAGAGGTGATAAGAAAACGGAGAAAGGAAAGCGGTTCAATCACT
CATACGGAAATGCAAGGCCTCGGGACAAGACGAAAGGGAGAGGGACTCCGAGAGTGCCGATTCCTCCTTCTCCGCCTAGGAAAGATAAATTTGACGACAATGAGAAAATC
AAGATCGAGATTGACGAGTTTATTGACCCTGAAGACCAAAAGGAAAGAATCCGACAAATACTTCAGTATCAGAAATCTGTGTATTCTTCTTCTTCATCTTTTTCCTCTTC
CTCTGCATCATCTTTAAGAAGTAGTAGTTTACTAGACTTGATGAAAGCAGGAAACACCTCTCTCAGGAGATTATTTGACATGCAGCATACTAGTTTGGCGACCTTTTTTT
ATAAGTACAGTGGATCACCTATGATGAAGCCTATAACTTTGTGGGGCAGTGATTCTGATGCTGAAATCTGCGATGCTTGGGCGTCCATCAAGTTTGGACTATCACATGAT
TCTGAGTCTCGCGGAACTAATTGCACATCAAATGGTATTTTTATGGACAGAATGATAGGGTCTCAGAATAGCACAATTACGGTTGGCAAGCACAAGTTAACTAGGAAAAA
GTCATTTCAGAGGCTGCCTAGATTTGGCCTGTTATGGAGATGCAGGGGATCTAGAGTCAGGTTTAGGTTGAGACGACTTCGGATTATGATTTGTGAAAGGAGTATACTGA
ATAATCTTGAGATGCCACTCATAGCACCACCTCAGGCTAGTAGAAATGTTATTGTGTATTCCACACGGATGCCTGTTCAATTCGGCTGA
Protein sequenceShow/hide protein sequence
MASLPFGLLQASSQSPTLSPRFFSFSKSETLGTSLRASTASLATSATSSPSSVPAVYCGRGDKKTEKGKRFNHSYGNARPRDKTKGRGTPRVPIPPSPPRKDKFDDNEKI
KIEIDEFIDPEDQKERIRQILQYQKSVYSSSSSFSSSSASSLRSSSLLDLMKAGNTSLRRLFDMQHTSLATFFYKYSGSPMMKPITLWGSDSDAEICDAWASIKFGLSHD
SESRGTNCTSNGIFMDRMIGSQNSTITVGKHKLTRKKSFQRLPRFGLLWRCRGSRVRFRLRRLRIMICERSILNNLEMPLIAPPQASRNVIVYSTRMPVQFG