; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC10G193830 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC10G193830
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionSm-like protein LSM7
Genome locationCmU531Chr10:24988116..24992915
RNA-Seq ExpressionCmUC10G193830
SyntenyCmUC10G193830
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0000956 - nuclear-transcribed mRNA catabolic process (biological process)
GO:0005688 - U6 snRNP (cellular component)
GO:0005689 - U12-type spliceosomal complex (cellular component)
GO:0071004 - U2-type prespliceosome (cellular component)
GO:0071013 - catalytic step 2 spliceosome (cellular component)
GO:0097526 - spliceosomal tri-snRNP complex (cellular component)
GO:0120115 - Lsm2-8 complex (cellular component)
GO:1990726 - Lsm1-7-Pat1 complex (cellular component)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR001163 - LSM domain, eukaryotic/archaea-type
IPR010920 - LSM domain superfamily
IPR017132 - Sm-like protein Lsm7
IPR044641 - Sm-like protein Lsm7/SmG


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ERN11727.1 hypothetical protein AMTR_s00022p00235920 [Amborella trichopoda]2.4e-3696.3Show/hide
Query:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
        KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSP DGTDEI+NPF+QPD
Subjt:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD

XP_004149601.1 sm-like protein LSM7 [Cucumis sativus]3.8e-3798.77Show/hide
Query:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
        KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEA+EFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
Subjt:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD

XP_011625467.1 sm-like protein LSM7 [Amborella trichopoda]2.4e-3696.3Show/hide
Query:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
        KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSP DGTDEI+NPF+QPD
Subjt:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD

XP_022152570.1 sm-like protein LSM7 [Momordica charantia]3.2e-3698.77Show/hide
Query:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
        KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTR LGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
Subjt:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD

XP_038903588.1 sm-like protein LSM7 [Benincasa hispida]2.9e-37100Show/hide
Query:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
        KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
Subjt:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD

TrEMBL top hitse value%identityAlignment
A0A0A0LD28 Sm domain-containing protein1.8e-3798.77Show/hide
Query:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
        KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEA+EFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
Subjt:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD

A0A1S3TGW6 sm-like protein LSM74.5e-3696.3Show/hide
Query:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
        KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEA+EFLRDPDDPLKTTDQTR LGLIVCRGTAVMLVSP DGTDEIANPFIQPD
Subjt:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD

A0A5A7TXR7 Sm-like protein LSM71.4e-37100Show/hide
Query:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
        KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
Subjt:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD

A0A6J1DI46 sm-like protein LSM71.5e-3698.77Show/hide
Query:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
        KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTR LGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
Subjt:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD

W1PP03 Sm domain-containing protein1.2e-3696.3Show/hide
Query:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
        KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSP DGTDEI+NPF+QPD
Subjt:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD

SwissProt top hitse value%identityAlignment
O74499 U6 snRNA-associated Sm-like protein LSm76.3e-1955.7Show/hide
Query:  VQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
        +Q   TGGRQ+TG LKG+DQL+NLVLD+  E LR+P+D  K T   R LGL+V RGT ++L++P+DG++EI NPF+Q +
Subjt:  VQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD

Q54HF6 Probable U6 snRNA-associated Sm-like protein LSm72.6e-2058.44Show/hide
Query:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPF
        K + VK TGGR+V G LKGYDQL+N+ LD+  EF+RD +DPL TTD+ R LGL+VCRG++VM+V P +G + I NP+
Subjt:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPF

Q9CQQ8 U6 snRNA-associated Sm-like protein LSm71.0e-2158.23Show/hide
Query:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQ
        K ++VK  GGR+ +G LKG+D LLNLVLD  +E++RDPDD  K T+ TR LGL+VCRGT+V+L+ P DG + I NPF+Q
Subjt:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQ

Q9SI54 Sm-like protein LSM78.2e-3585.54Show/hide
Query:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDVI
        KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEA+EF+RD DDPLKTTDQTR LGLIVCRGTAVMLVSP DGT+EIANPF+  + +
Subjt:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDVI

Q9UK45 U6 snRNA-associated Sm-like protein LSm73.6e-2260.76Show/hide
Query:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQ
        K ++VK  GGR+ +G LKG+D LLNLVLD  IE++RDPDD  K T+ TR LGL+VCRGT+V+L+ P DG + I NPFIQ
Subjt:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQ

Arabidopsis top hitse value%identityAlignment
AT2G03870.1 Small nuclear ribonucleoprotein family protein5.8e-3685.54Show/hide
Query:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDVI
        KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEA+EF+RD DDPLKTTDQTR LGLIVCRGTAVMLVSP DGT+EIANPF+  + +
Subjt:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDVI

AT2G03870.2 Small nuclear ribonucleoprotein family protein5.8e-3685.54Show/hide
Query:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDVI
        KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEA+EF+RD DDPLKTTDQTR LGLIVCRGTAVMLVSP DGT+EIANPF+  + +
Subjt:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDVI

AT2G23930.1 probable small nuclear ribonucleoprotein G9.7e-0736.99Show/hide
Query:  KKRNYKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVD
        KK   K +Q+KL   R VTGTL+G+DQ +NLV+D  +E        +   D+T  +G++V RG +++ V  ++
Subjt:  KKRNYKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVD

AT2G23930.2 probable small nuclear ribonucleoprotein G1.6e-0636.76Show/hide
Query:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVD
        K +Q+KL   R VTGTL+G+DQ +NLV+D  +E        +   D+T  +G++V RG +++ V  ++
Subjt:  KGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVD

AT3G11500.1 Small nuclear ribonucleoprotein family protein2.2e-0634.25Show/hide
Query:  KKRNYKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVD
        KK   K +Q+KL   R V GTL+G+DQ +NLV+D  +E            D    +G++V RG +++ V  ++
Subjt:  KKRNYKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGGGCCGGAAGAGAAATTGGGCCGGAGTCCAACTGCCTATGGTTTGGAGGGTACTTTTTCCCGCCATCGACCATTTCTCTCAACGAGCAAATCTAGGGCACATTC
TTTCCGCTCCTCACCACTTCCATTGAAAGTTTCTGGTGCGTTTAATTTTACGGAGGAGCTGAGGAAACGATTGCGCGATGTCTGGAAGAAAAGAAACTACAAAGGCGTCC
AAGTGAAGCTTACCGGCGGCAGACAAGTTACTGGAACGCTCAAAGGATATGATCAATTGCTAAACCTTGTGCTGGATGAAGCTATAGAGTTTCTAAGAGATCCTGATGAT
CCATTGAAGACAACAGATCAAACCCGGCCTCTTGGCCTAATCGTCTGCAGGGGGACTGCTGTAATGCTTGTGTCTCCAGTTGATGGTACCGACGAGATTGCTAACCCCTT
CATCCAACCGGATGTGATATGGGACCCTGCTCCTTTGGCCGACAAAGGATTTCTTTTGTTGATTGCCCCAACTTGTTCTCTTTCGGCACATGTGAGACTGAGACTTTATG
CCAAGCCGAGCCAACTATGTGGACTCAGCTCTCCAACTTGTGCTGAGGGAAACTTAGAGGCAATTTGCAATGTTGCATGCAACCTTACCCCTAACTTTTTGCACTGCACT
GTTTTCTATGACCTTTTAGCTCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGATCGGGCCGGAAGAGAAATTGGGCCGGAGTCCAACTGCCTATGGTTTGGAGGGTACTTTTTCCCGCCATCGACCATTTCTCTCAACGAGCAAATCTAGGGCACATTC
TTTCCGCTCCTCACCACTTCCATTGAAAGTTTCTGGTGCGTTTAATTTTACGGAGGAGCTGAGGAAACGATTGCGCGATGTCTGGAAGAAAAGAAACTACAAAGGCGTCC
AAGTGAAGCTTACCGGCGGCAGACAAGTTACTGGAACGCTCAAAGGATATGATCAATTGCTAAACCTTGTGCTGGATGAAGCTATAGAGTTTCTAAGAGATCCTGATGAT
CCATTGAAGACAACAGATCAAACCCGGCCTCTTGGCCTAATCGTCTGCAGGGGGACTGCTGTAATGCTTGTGTCTCCAGTTGATGGTACCGACGAGATTGCTAACCCCTT
CATCCAACCGGATGTGATATGGGACCCTGCTCCTTTGGCCGACAAAGGATTTCTTTTGTTGATTGCCCCAACTTGTTCTCTTTCGGCACATGTGAGACTGAGACTTTATG
CCAAGCCGAGCCAACTATGTGGACTCAGCTCTCCAACTTGTGCTGAGGGAAACTTAGAGGCAATTTGCAATGTTGCATGCAACCTTACCCCTAACTTTTTGCACTGCACT
GTTTTCTATGACCTTTTAGCTCACTGA
Protein sequenceShow/hide protein sequence
MIGPEEKLGRSPTAYGLEGTFSRHRPFLSTSKSRAHSFRSSPLPLKVSGAFNFTEELRKRLRDVWKKRNYKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDD
PLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDVIWDPAPLADKGFLLLIAPTCSLSAHVRLRLYAKPSQLCGLSSPTCAEGNLEAICNVACNLTPNFLHCT
VFYDLLAH