; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018403 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018403
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSm domain-containing protein
Genome locationtig00153197:1389408..1392297
RNA-Seq ExpressionSgr018403
SyntenySgr018403
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0030490 - maturation of SSU-rRNA (biological process)
GO:0000932 - P-body (cellular component)
GO:0005681 - spliceosomal complex (cellular component)
GO:0005688 - U6 snRNP (cellular component)
GO:0005730 - nucleolus (cellular component)
GO:0005732 - small nucleolar ribonucleoprotein complex (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0046540 - U4/U6 x U5 tri-snRNP complex (cellular component)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR001163 - LSM domain, eukaryotic/archaea-type
IPR010920 - LSM domain superfamily
IPR016487 - Sm-like protein Lsm6/SmF


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599859.1 Sm-like protein LSM36B, partial [Cucurbita argyrosperma subsp. sororia]5.0e-3490.36Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS
        MSIGGEKGSASTKTPADFLKSIRG PVVVKLNSGVDYRGILACL+GYMNIAMEQTEEYVNGQLKNKYGDAFIRGNN + + +S
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS

KAG6600796.1 Eukaryotic translation initiation factor 2 subunit 3, Y-linked, partial [Cucurbita argyrosperma subsp. sororia]3.8e-3490.36Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS
        MSIGGEKGS STKTPADFLKSIRGRPVVVKLNSGVDYRGILACL+GYMNIAMEQTEEYVNGQLKNKYGDAFIRGNN + + +S
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS

XP_004140401.1 sm-like protein LSM36B [Cucumis sativus]1.3e-3491.57Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS
        MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACL+GYMNIAMEQTEEYVNGQLKNKYGDAFIRGNN + + +S
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS

XP_022942809.1 U6 snRNA-associated Sm-like protein LSm6 [Cucurbita moschata]3.8e-3490.36Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS
        MSIGGEKGS STKTPADFLKSIRGRPVVVKLNSGVDYRGILACL+GYMNIAMEQTEEYVNGQLKNKYGDAFIRGNN + + +S
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS

XP_022974953.1 U6 snRNA-associated Sm-like protein LSm6 [Cucurbita maxima]3.8e-3490.36Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS
        MSIGGEKGS STKTPADFLKSIRGRPVVVKLNSGVDYRGILACL+GYMNIAMEQTEEYVNGQLKNKYGDAFIRGNN + + +S
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS

TrEMBL top hitse value%identityAlignment
A0A0A0KSS9 Sm domain-containing protein6.3e-3591.57Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS
        MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACL+GYMNIAMEQTEEYVNGQLKNKYGDAFIRGNN + + +S
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS

A0A1S3CBJ6 sm-like protein LSM36B6.3e-3591.57Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS
        MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACL+GYMNIAMEQTEEYVNGQLKNKYGDAFIRGNN + + +S
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS

A0A6J1DNH0 sm-like protein LSM36B6.3e-3591.57Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS
        MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACL+GYMNIAMEQTEEYVNGQLKNKYGDAFIRGNN + + +S
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS

A0A6J1FPX7 U6 snRNA-associated Sm-like protein LSm61.8e-3490.36Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS
        MSIGGEKGS STKTPADFLKSIRGRPVVVKLNSGVDYRGILACL+GYMNIAMEQTEEYVNGQLKNKYGDAFIRGNN + + +S
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS

A0A6J1IHU4 U6 snRNA-associated Sm-like protein LSm61.8e-3490.36Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS
        MSIGGEKGS STKTPADFLKSIRGRPVVVKLNSGVDYRGILACL+GYMNIAMEQTEEYVNGQLKNKYGDAFIRGNN + + +S
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS

SwissProt top hitse value%identityAlignment
A8NHT8 U6 snRNA-associated Sm-like protein LSm61.3e-2167.61Show/hide
Query:  TKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGS
        T +P DFLK + G+ VVV+L SGVDYRGIL+CL+GYMNIAMEQTEE VNG++ N+YGDAFIRGNN + + +
Subjt:  TKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGS

O22823 Sm-like protein LSM36B4.8e-3279.52Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS
        MS  GEK S +TKTPADFLKSIRG+PVVVKLNSGVDYRGIL CL+GYMNIAMEQTEEYVNGQLKN YGDAF+RGNN + + ++
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS

P62312 U6 snRNA-associated Sm-like protein LSm67.2e-2878.38Show/hide
Query:  SASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGS
        S   +TP+DFLK I GRPVVVKLNSGVDYRG+LACL+GYMNIA+EQTEEYVNGQLKNKYGDAFIRGNN + + +
Subjt:  SASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGS

P62313 U6 snRNA-associated Sm-like protein LSm67.2e-2878.38Show/hide
Query:  SASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGS
        S   +TP+DFLK I GRPVVVKLNSGVDYRG+LACL+GYMNIA+EQTEEYVNGQLKNKYGDAFIRGNN + + +
Subjt:  SASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGS

Q9M1Z3 Sm-like protein LSM6A3.1e-3184.42Show/hide
Query:  EKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGS
        EK S +TKTPADFLKSIRGRPVVVKLNSGVDYRG L CL+GYMNIAMEQTEEYVNGQLKNKYGDAFIRGNN + + +
Subjt:  EKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGS

Arabidopsis top hitse value%identityAlignment
AT2G43810.1 Small nuclear ribonucleoprotein family protein3.4e-3379.52Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS
        MS  GEK S +TKTPADFLKSIRG+PVVVKLNSGVDYRGIL CL+GYMNIAMEQTEEYVNGQLKN YGDAF+RGNN + + ++
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS

AT2G43810.2 Small nuclear ribonucleoprotein family protein3.4e-3379.52Show/hide
Query:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS
        MS  GEK S +TKTPADFLKSIRG+PVVVKLNSGVDYRGIL CL+GYMNIAMEQTEEYVNGQLKN YGDAF+RGNN + + ++
Subjt:  MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSS

AT3G59810.1 Small nuclear ribonucleoprotein family protein2.2e-3284.42Show/hide
Query:  EKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGS
        EK S +TKTPADFLKSIRGRPVVVKLNSGVDYRG L CL+GYMNIAMEQTEEYVNGQLKNKYGDAFIRGNN + + +
Subjt:  EKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGS

AT4G30220.1 small nuclear ribonucleoprotein F1.3e-1143.94Show/hide
Query:  PADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVV
        P  FL ++ G+ V+VKL  G++Y+G LA ++ YMN+ +  TEEY++GQL    G+  IR NN + V
Subjt:  PADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVV

AT4G30220.2 small nuclear ribonucleoprotein F1.3e-1143.94Show/hide
Query:  PADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVV
        P  FL ++ G+ V+VKL  G++Y+G LA ++ YMN+ +  TEEY++GQL    G+  IR NN + V
Subjt:  PADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTATTGGGGGTGAAAAAGGGTCTGCATCAACAAAGACACCAGCGGACTTTCTCAAATCGATTCGGGGCCGTCCAGTAGTTGTGAAGCTCAATTCTGGTGTTGACTA
TCGAGGTATTTTAGCTTGTCTCGAGGGATACATGAATATAGCAATGGAGCAGACGGAGGAATACGTAAATGGGCAGTTGAAGAACAAATATGGCGATGCTTTCATCCGTG
GAAATAATGGGGTAGTGGTCGGCAGTAGTTGGATTGAAAATCAAAAACTCACCTTCATGTCTATTGGGATCGTAGCTTTGGACTTCGGGAAAAGAAGAGATCGGAGGGAG
TTGGAGTTGGGACTTAGCGTGGAACTTGGTAATATTGGTGTTTGGGCCACTTGGGCAAACTGCTTCTCCATGAACGACAAACTGCTTGGAGTCGAACGTAATCGTTTCTG
TTATGGTGAAAAGAAAACTCTAAATTTAGCCCGCAGAATGGAGAACGTGAGCGACAAAGGGGAACAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTATTGGGGGTGAAAAAGGGTCTGCATCAACAAAGACACCAGCGGACTTTCTCAAATCGATTCGGGGCCGTCCAGTAGTTGTGAAGCTCAATTCTGGTGTTGACTA
TCGAGGTATTTTAGCTTGTCTCGAGGGATACATGAATATAGCAATGGAGCAGACGGAGGAATACGTAAATGGGCAGTTGAAGAACAAATATGGCGATGCTTTCATCCGTG
GAAATAATGGGGTAGTGGTCGGCAGTAGTTGGATTGAAAATCAAAAACTCACCTTCATGTCTATTGGGATCGTAGCTTTGGACTTCGGGAAAAGAAGAGATCGGAGGGAG
TTGGAGTTGGGACTTAGCGTGGAACTTGGTAATATTGGTGTTTGGGCCACTTGGGCAAACTGCTTCTCCATGAACGACAAACTGCTTGGAGTCGAACGTAATCGTTTCTG
TTATGGTGAAAAGAAAACTCTAAATTTAGCCCGCAGAATGGAGAACGTGAGCGACAAAGGGGAACAAGATTGA
Protein sequenceShow/hide protein sequence
MSIGGEKGSASTKTPADFLKSIRGRPVVVKLNSGVDYRGILACLEGYMNIAMEQTEEYVNGQLKNKYGDAFIRGNNGVVVGSSWIENQKLTFMSIGIVALDFGKRRDRRE
LELGLSVELGNIGVWATWANCFSMNDKLLGVERNRFCYGEKKTLNLARRMENVSDKGEQD