; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012123 (gene) of Snake gourd v1 genome

Gene IDTan0012123
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionsm-like protein LSM7
Genome locationLG04:83166907..83174025
RNA-Seq ExpressionTan0012123
SyntenyTan0012123
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0000956 - nuclear-transcribed mRNA catabolic process (biological process)
GO:0005688 - U6 snRNP (cellular component)
GO:0005689 - U12-type spliceosomal complex (cellular component)
GO:0071004 - U2-type prespliceosome (cellular component)
GO:0071013 - catalytic step 2 spliceosome (cellular component)
GO:0097526 - spliceosomal tri-snRNP complex (cellular component)
GO:0120115 - Lsm2-8 complex (cellular component)
GO:1990726 - Lsm1-7-Pat1 complex (cellular component)
GO:0003723 - RNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_010036102.1 sm-like protein LSM7 [Eucalyptus grandis]1.7e-4796.97Show/hide
Query:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
        MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEA+EFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSP DGTDEIANPF+QPDGA
Subjt:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA

XP_010257259.1 PREDICTED: sm-like protein LSM7 [Nelumbo nucifera]1.3e-4797.98Show/hide
Query:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
        MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSP DGTDEIANPF+QPDGA
Subjt:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA

XP_017969315.1 PREDICTED: sm-like protein LSM7 [Theobroma cacao]1.3e-4797.98Show/hide
Query:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
        MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEA+EFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSP DGTDEIANPFIQPDGA
Subjt:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA

XP_022152570.1 sm-like protein LSM7 [Momordica charantia]3.5e-48100Show/hide
Query:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
        MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
Subjt:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA

XP_038903588.1 sm-like protein LSM7 [Benincasa hispida]2.3e-4798.99Show/hide
Query:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
        MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTR LGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
Subjt:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA

TrEMBL top hitse value%identityAlignment
A0A061DVC4 Small nuclear ribonucleoprotein family protein6.5e-4897.98Show/hide
Query:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
        MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEA+EFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSP DGTDEIANPFIQPDGA
Subjt:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA

A0A1U8A739 sm-like protein LSM76.5e-4897.98Show/hide
Query:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
        MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSP DGTDEIANPF+QPDGA
Subjt:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA

A0A2P2J3S8 Sm domain-containing protein6.5e-4897.98Show/hide
Query:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
        MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEA+EFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSP DGTDEIANPFIQPDGA
Subjt:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA

A0A6J1A2J0 sm-like protein LSM76.5e-4897.98Show/hide
Query:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
        MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEA+EFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSP DGTDEIANPFIQPDGA
Subjt:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA

A0A6J1DI46 sm-like protein LSM71.7e-48100Show/hide
Query:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
        MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
Subjt:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA

SwissProt top hitse value%identityAlignment
O74499 U6 snRNA-associated Sm-like protein LSm77.4e-2554.26Show/hide
Query:  RKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
        RKE++LDL+++ D+ +Q   TGGRQ+TG LKG+DQL+NLVLD+  E LR+P+D  K T   R+LGL+V RGT ++L++P+DG++EI NPF+Q +
Subjt:  RKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD

Q54HF6 Probable U6 snRNA-associated Sm-like protein LSm76.7e-2657.78Show/hide
Query:  RKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPF
        +KE++LDL KF+ K + VK TGGR+V G LKGYDQL+N+ LD+  EF+RD +DPL TTD+ R LGL+VCRG++VM+V P +G + I NP+
Subjt:  RKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPF

Q9CQQ8 U6 snRNA-associated Sm-like protein LSm72.5e-2857.61Show/hide
Query:  RKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQ
        +KE++LDL+K++DK ++VK  GGR+ +G LKG+D LLNLVLD  +E++RDPDD  K T+ TR+LGL+VCRGT+V+L+ P DG + I NPF+Q
Subjt:  RKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQ

Q9SI54 Sm-like protein LSM72.9e-4593.62Show/hide
Query:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFI
        MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEA+EF+RD DDPLKTTDQTRRLGLIVCRGTAVMLVSP DGT+EIANPF+
Subjt:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFI

Q9UK45 U6 snRNA-associated Sm-like protein LSm76.5e-2958.33Show/hide
Query:  RKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
        +KE++LDL+K++DK ++VK  GGR+ +G LKG+D LLNLVLD  IE++RDPDD  K T+ TR+LGL+VCRGT+V+L+ P DG + I NPFIQ   A
Subjt:  RKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA

Arabidopsis top hitse value%identityAlignment
AT2G03870.1 Small nuclear ribonucleoprotein family protein2.1e-4693.62Show/hide
Query:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFI
        MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEA+EF+RD DDPLKTTDQTRRLGLIVCRGTAVMLVSP DGT+EIANPF+
Subjt:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFI

AT2G03870.2 Small nuclear ribonucleoprotein family protein2.1e-4693.62Show/hide
Query:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFI
        MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEA+EF+RD DDPLKTTDQTRRLGLIVCRGTAVMLVSP DGT+EIANPF+
Subjt:  MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFI

AT2G23930.1 probable small nuclear ribonucleoprotein G6.9e-1038.67Show/hide
Query:  DLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVD
        DL K++DK +Q+KL   R VTGTL+G+DQ +NLV+D  +E        +   D+T  +G++V RG +++ V  ++
Subjt:  DLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVD

AT2G23930.2 probable small nuclear ribonucleoprotein G8.4e-0837.14Show/hide
Query:  VDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVD
        +DK +Q+KL   R VTGTL+G+DQ +NLV+D  +E        +   D+T  +G++V RG +++ V  ++
Subjt:  VDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVD

AT3G11500.1 Small nuclear ribonucleoprotein family protein1.5e-0936Show/hide
Query:  DLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVD
        DL K++DK +Q+KL   R V GTL+G+DQ +NLV+D  +E            D    +G++V RG +++ V  ++
Subjt:  DLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGGAAGAAAAGAAACTGTTCTGGATTTGGCGAAGTTTGTGGACAAAGGCGTCCAAGTCAAGCTTACCGGCGGCAGACAAGTTACGGGAACGCTCAAAGGATACGA
TCAGTTGCTAAACCTTGTGCTGGATGAAGCTATAGAGTTTTTAAGAGATCCTGATGATCCATTGAAAACAACAGATCAAACCAGGCGTCTTGGCCTAATTGTTTGCAGGG
GGACTGCTGTAATGCTAGTGTCTCCAGTTGACGGTACAGACGAGATTGCTAACCCCTTCATCCAACCGGATGGTGCATAG
mRNA sequenceShow/hide mRNA sequence
GAGAAATTGGGCCGGAGTCCAACTGTGATTGGTTTGGAGGGTACATTTCCCGCCATCGACCATTTCTCTCAACGAGCAAATCTAGGGCACATTCTTTCCGCTTCTCACCA
TTTCCATTGAAGGTTTCTGGTTCGTTTAATTTTGCAGAGGAGCTGAGAAAACGCTAGTGCGATGTCTGGAAGAAAAGAAACTGTTCTGGATTTGGCGAAGTTTGTGGACA
AAGGCGTCCAAGTCAAGCTTACCGGCGGCAGACAAGTTACGGGAACGCTCAAAGGATACGATCAGTTGCTAAACCTTGTGCTGGATGAAGCTATAGAGTTTTTAAGAGAT
CCTGATGATCCATTGAAAACAACAGATCAAACCAGGCGTCTTGGCCTAATTGTTTGCAGGGGGACTGCTGTAATGCTAGTGTCTCCAGTTGACGGTACAGACGAGATTGC
TAACCCCTTCATCCAACCGGATGGTGCATAGATAAAATGACATGGAGATCCTGCTTGTTTGGCCGACAAAGGATTTCTTTTGTTGATAGCCCAACTTGTGTTCTCTTTCG
GCACATGTGACTTTATGCCAACTATGTGAGAGCTGCAACTTGTTGTGCCGGACAATTTTTCAAGAAAATCATGGGATGTATGAAAAGCGGATAGAGGTGGAGAAGTGTTT
GGATCCTTTGAAAATGACTGGGTATTGGTCCTCATATATACTTTGGAGCGTACTTTATTCCTTTAAGTTACAAATTTAGGTCCGTTTGATTACTATTAGATTTTTTATTT
TTGTTTTTTAAAAATTAAGCATATAAATACTACTTTCATCTATGGATTTTTTTGTTTTTTCAAAGACCAAAACATGTTTTAGAAACGAAAAGAAGTAGTTTTCAAAAACT
TGTTTTTGTTTTTGGATTTTGATTAAGAATTCAAATGTTTCATTAAGATAGATGAAACTCATGATAAAGAAATTTGGAGAAAAAGTTTATTTTTTTAAAAATTAGAAACA
AAAAACTAAGGCTTCGTTTGATAGCCATTTAGTTTTTAGTTTTTTGAAAATGAAGCCTATAAATATTATTTCCAC
Protein sequenceShow/hide protein sequence
MSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAIEFLRDPDDPLKTTDQTRRLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA