; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018404 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018404
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSm domain-containing protein
Genome locationtig00153197:1402200..1403042
RNA-Seq ExpressionSgr018404
SyntenySgr018404
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0030490 - maturation of SSU-rRNA (biological process)
GO:0000932 - P-body (cellular component)
GO:0005681 - spliceosomal complex (cellular component)
GO:0005688 - U6 snRNP (cellular component)
GO:0005730 - nucleolus (cellular component)
GO:0005732 - small nucleolar ribonucleoprotein complex (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0046540 - U4/U6 x U5 tri-snRNP complex (cellular component)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR001163 - LSM domain, eukaryotic/archaea-type
IPR010920 - LSM domain superfamily
IPR016487 - Sm-like protein Lsm6/SmF


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600796.1 Eukaryotic translation initiation factor 2 subunit 3, Y-linked, partial [Cucurbita argyrosperma subsp. sororia]9.1e-3598.7Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
        MSIGGEKGS STKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV

XP_004140401.1 sm-like protein LSM36B [Cucumis sativus]3.1e-35100Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
        MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV

XP_022942161.1 sm-like protein LSM36B isoform X2 [Cucurbita moschata]1.4e-3596.3Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNVRRRG
        MSIGGEKGSASTKTPADFLKSIRG PVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV +RG
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNVRRRG

XP_022942809.1 U6 snRNA-associated Sm-like protein LSm6 [Cucurbita moschata]9.1e-3598.7Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
        MSIGGEKGS STKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV

XP_022974953.1 U6 snRNA-associated Sm-like protein LSm6 [Cucurbita maxima]9.1e-3598.7Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
        MSIGGEKGS STKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV

TrEMBL top hitse value%identityAlignment
A0A0A0KSS9 Sm domain-containing protein1.5e-35100Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
        MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV

A0A1S3CBJ6 sm-like protein LSM36B1.5e-35100Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
        MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV

A0A6J1DNH0 sm-like protein LSM36B1.5e-35100Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
        MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV

A0A6J1FN26 sm-like protein LSM36B isoform X26.8e-3696.3Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNVRRRG
        MSIGGEKGSASTKTPADFLKSIRG PVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV +RG
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNVRRRG

A0A6J1IHU4 U6 snRNA-associated Sm-like protein LSm64.4e-3598.7Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
        MSIGGEKGS STKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV

SwissProt top hitse value%identityAlignment
A8NHT8 U6 snRNA-associated Sm-like protein LSm61.1e-2275.76Show/hide
Query:  TKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
        T +P DFLK + G+ VVV+L SGVDYRGIL+CLDGYMNIAMEQTEE VNG++ N+YGDAFIRGNNV
Subjt:  TKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV

O22823 Sm-like protein LSM36B5.2e-3388.31Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
        MS  GEK S +TKTPADFLKSIRG+PVVVKLNSGVDYRGIL CLDGYMNIAMEQTEEYVNGQLKN YGDAF+RGNNV
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV

P62312 U6 snRNA-associated Sm-like protein LSm65.9e-2986.96Show/hide
Query:  SASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
        S   +TP+DFLK I GRPVVVKLNSGVDYRG+LACLDGYMNIA+EQTEEYVNGQLKNKYGDAFIRGNNV
Subjt:  SASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV

P62313 U6 snRNA-associated Sm-like protein LSm65.9e-2986.96Show/hide
Query:  SASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
        S   +TP+DFLK I GRPVVVKLNSGVDYRG+LACLDGYMNIA+EQTEEYVNGQLKNKYGDAFIRGNNV
Subjt:  SASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV

Q9M1Z3 Sm-like protein LSM6A2.6e-3293.06Show/hide
Query:  EKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
        EK S +TKTPADFLKSIRGRPVVVKLNSGVDYRG L CLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
Subjt:  EKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV

Arabidopsis top hitse value%identityAlignment
AT2G43810.1 Small nuclear ribonucleoprotein family protein3.7e-3488.31Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
        MS  GEK S +TKTPADFLKSIRG+PVVVKLNSGVDYRGIL CLDGYMNIAMEQTEEYVNGQLKN YGDAF+RGNNV
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV

AT2G43810.2 Small nuclear ribonucleoprotein family protein3.7e-3488.31Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
        MS  GEK S +TKTPADFLKSIRG+PVVVKLNSGVDYRGIL CLDGYMNIAMEQTEEYVNGQLKN YGDAF+RGNNV
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV

AT3G59810.1 Small nuclear ribonucleoprotein family protein1.8e-3393.06Show/hide
Query:  EKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
        EK S +TKTPADFLKSIRGRPVVVKLNSGVDYRG L CLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
Subjt:  EKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV

AT4G30220.1 small nuclear ribonucleoprotein F1.4e-1247.62Show/hide
Query:  PADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
        P  FL ++ G+ V+VKL  G++Y+G LA +D YMN+ +  TEEY++GQL    G+  IR NNV
Subjt:  PADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV

AT4G30220.2 small nuclear ribonucleoprotein F1.4e-1247.62Show/hide
Query:  PADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV
        P  FL ++ G+ V+VKL  G++Y+G LA +D YMN+ +  TEEY++GQL    G+  IR NNV
Subjt:  PADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTATTGGGGGTGAAAAAGGGTCTGCATCAACAAAGACACCAGCGGACTTTCTCAAATCGATTCGGGGCCGTCCAGTAGTTGTGAAACTCAATTCTGGTGTTGACTA
CCGAGGTATTTTAGCTTGTCTCGACGGATACATGAATATAGCAATGGAGCAGACTGAGGAATACGTAAATGGGCAGTTGAAGAACAAATATGGCGATGCTTTCATCCGTG
GAAATAATGTACGTCGAAGAGGACATTATCAGAAGGGTCTTCTTAGCTTAGCTAAGAAAATTCTCCCAGGAAATGCACATCTTTTGCAGTTATGTTCCTTCAGAGTCTCT
CTCTCATGCTGTAAAATGAGAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTATTGGGGGTGAAAAAGGGTCTGCATCAACAAAGACACCAGCGGACTTTCTCAAATCGATTCGGGGCCGTCCAGTAGTTGTGAAACTCAATTCTGGTGTTGACTA
CCGAGGTATTTTAGCTTGTCTCGACGGATACATGAATATAGCAATGGAGCAGACTGAGGAATACGTAAATGGGCAGTTGAAGAACAAATATGGCGATGCTTTCATCCGTG
GAAATAATGTACGTCGAAGAGGACATTATCAGAAGGGTCTTCTTAGCTTAGCTAAGAAAATTCTCCCAGGAAATGCACATCTTTTGCAGTTATGTTCCTTCAGAGTCTCT
CTCTCATGCTGTAAAATGAGAGATTGA
Protein sequenceShow/hide protein sequence
MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLDGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNVRRRGHYQKGLLSLAKKILPGNAHLLQLCSFRVS
LSCCKMRD