; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy4G080090 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy4G080090
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionSm domain-containing protein
Genome locationchrH04:19432535..19436232
RNA-Seq ExpressionChy4G080090
SyntenyChy4G080090
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0000956 - nuclear-transcribed mRNA catabolic process (biological process)
GO:0005688 - U6 snRNP (cellular component)
GO:0005689 - U12-type spliceosomal complex (cellular component)
GO:0071004 - U2-type prespliceosome (cellular component)
GO:0071013 - catalytic step 2 spliceosome (cellular component)
GO:0097526 - spliceosomal tri-snRNP complex (cellular component)
GO:0120115 - Lsm2-8 complex (cellular component)
GO:1990726 - Lsm1-7-Pat1 complex (cellular component)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR001163 - LSM domain, eukaryotic/archaea-type
IPR010920 - LSM domain superfamily
IPR017132 - Sm-like protein Lsm7
IPR044641 - Sm-like protein Lsm7/SmG


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CBI24863.3 unnamed protein product, partial [Vitis vinifera]1.20e-6196.04Show/hide
Query:  IFQSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDG
        +FQSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDP KTTDQTR LGLIVCRGTAVMLVSP DGTDEIANPFIQPDG
Subjt:  IFQSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDG

Query:  A
        A
Subjt:  A

OMO69793.1 hypothetical protein CCACVL1_19268 [Corchorus capsularis]8.80e-6397.03Show/hide
Query:  IFQSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDG
        +FQSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTR LGLIVCRGTAVMLVSP DGTDEIANPFIQPDG
Subjt:  IFQSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDG

Query:  A
        A
Subjt:  A

XP_004149601.1 sm-like protein LSM7 [Cucumis sativus]1.14e-62100Show/hide
Query:  SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
        SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
Subjt:  SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA

XP_038903588.1 sm-like protein LSM7 [Benincasa hispida]1.61e-6298.98Show/hide
Query:  SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
        SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEA+EFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
Subjt:  SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA

XP_038984132.1 sm-like protein LSM7 [Phoenix dactylifera]9.57e-6287.27Show/hide
Query:  LVTKRLTSRIFQSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEI
        L + +    +FQSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEA+EFLRDPDDPLKTTDQTR LGLIVCRGTAVMLVSP DGTDEI
Subjt:  LVTKRLTSRIFQSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEI

Query:  ANPFIQPDGA
        ANPF+QP+GA
Subjt:  ANPFIQPDGA

TrEMBL top hitse value%identityAlignment
A0A0A0LD28 Sm domain-containing protein8.2e-48100Show/hide
Query:  SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
        SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
Subjt:  SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA

A0A1R3HHK1 Sm domain-containing protein6.2e-4897.03Show/hide
Query:  IFQSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDG
        +FQSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTR LGLIVCRGTAVMLVSP DGTDEIANPFIQPDG
Subjt:  IFQSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDG

Query:  A
        A
Subjt:  A

A0A1R3IE97 Sm domain-containing protein6.2e-4897.03Show/hide
Query:  IFQSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDG
        +FQSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTR LGLIVCRGTAVMLVSP DGTDEIANPFIQPDG
Subjt:  IFQSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDG

Query:  A
        A
Subjt:  A

A0A5A7TXR7 Sm-like protein LSM71.1e-4798.98Show/hide
Query:  SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
        SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEA+EFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
Subjt:  SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA

D7T2Y2 Sm domain-containing protein4.1e-4796.04Show/hide
Query:  IFQSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDG
        +FQSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDP KTTDQTR LGLIVCRGTAVMLVSP DGTDEIANPFIQPDG
Subjt:  IFQSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDG

Query:  A
        A
Subjt:  A

SwissProt top hitse value%identityAlignment
O74499 U6 snRNA-associated Sm-like protein LSm72.7e-2454.26Show/hide
Query:  RKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
        RKE++LDL+++ D+ +Q   TGGRQ+TG LKG+DQL+NLVLD+  E LR+P+D  K T   R LGL+V RGT ++L++P+DG++EI NPF+Q +
Subjt:  RKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD

Q54HF6 Probable U6 snRNA-associated Sm-like protein LSm71.9e-2557.78Show/hide
Query:  RKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPF
        +KE++LDL KF+ K + VK TGGR+V G LKGYDQL+N+ LD+  EF+RD +DPL TTD+ R LGL+VCRG++VM+V P +G + I NP+
Subjt:  RKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPF

Q9CQQ8 U6 snRNA-associated Sm-like protein LSm71.2e-2757.61Show/hide
Query:  RKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQ
        +KE++LDL+K++DK ++VK  GGR+ +G LKG+D LLNLVLD  +E++RDPDD  K T+ TR LGL+VCRGT+V+L+ P DG + I NPF+Q
Subjt:  RKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQ

Q9SI54 Sm-like protein LSM71.5e-4393.55Show/hide
Query:  SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFI
        SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEF+RD DDPLKTTDQTR LGLIVCRGTAVMLVSP DGT+EIANPF+
Subjt:  SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFI

Q9UK45 U6 snRNA-associated Sm-like protein LSm73.1e-2857.29Show/hide
Query:  RKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA
        +KE++LDL+K++DK ++VK  GGR+ +G LKG+D LLNLVLD  +E++RDPDD  K T+ TR LGL+VCRGT+V+L+ P DG + I NPFIQ   A
Subjt:  RKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPDGA

Arabidopsis top hitse value%identityAlignment
AT2G03870.1 Small nuclear ribonucleoprotein family protein1.1e-4493.55Show/hide
Query:  SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFI
        SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEF+RD DDPLKTTDQTR LGLIVCRGTAVMLVSP DGT+EIANPF+
Subjt:  SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFI

AT2G03870.2 Small nuclear ribonucleoprotein family protein1.1e-4493.55Show/hide
Query:  SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFI
        SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEF+RD DDPLKTTDQTR LGLIVCRGTAVMLVSP DGT+EIANPF+
Subjt:  SGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFI

AT2G23930.1 probable small nuclear ribonucleoprotein G8.7e-1040Show/hide
Query:  DLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVD
        DL K++DK +Q+KL   R VTGTL+G+DQ +NLV+D  VE        +   D+T  +G++V RG +++ V  ++
Subjt:  DLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVD

AT2G23930.2 probable small nuclear ribonucleoprotein G1.1e-0738.57Show/hide
Query:  VDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVD
        +DK +Q+KL   R VTGTL+G+DQ +NLV+D  VE        +   D+T  +G++V RG +++ V  ++
Subjt:  VDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVD

AT3G11500.1 Small nuclear ribonucleoprotein family protein1.9e-0937.33Show/hide
Query:  DLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVD
        DL K++DK +Q+KL   R V GTL+G+DQ +NLV+D  VE            D    +G++V RG +++ V  ++
Subjt:  DLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGGGCCGGACGAGAAATTGGGCCACAGTCCAACCGTCTATGGTTTGGAGGGTTACCCTTCCCGCCATCCACCATTTCTCTCAACGAGCAAATCTAGGGCA
CATTCTTTCCGCTCCTCATCACTTCCATTGAAAGTAAGCTTCAATTCACTCTCCTTCATTGCACAAATTTCGGTTTTCTGCACTCTTCTTATACTCAATGTACCT
TCCAGTTCTTCAATTTTTTTATTTCCAGACGCATATAATGCCCTAACTTCCCCGAGTTTTATTTTAATCGATCAGCACTTGGAATTTTCTTTTTCACTCGTAACG
AAACGTCTCACTTCTCGGATATTTCAGTCTGGAAGAAAAGAAACTGTTCTGGATTTGGCTAAGTTTGTAGACAAAGGCGTCCAAGTGAAGCTTACTGGCGGCAGA
CAAGTTACTGGAACGCTCAAAGGATATGATCAATTGCTAAACCTTGTGCTGGATGAAGCTGTAGAATTTTTAAGAGATCCTGATGATCCATTGAAGACAACAGAT
CAAACCAGGCCTCTTGGCCTAATTGTGTGCAGGGGGACTGCTGTAATGCTGGTGTCTCCAGTTGATGGAACAGACGAGATTGCTAACCCTTTCATCCAACCCGAT
GGTGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGATTGGGCCGGACGAGAAATTGGGCCACAGTCCAACCGTCTATGGTTTGGAGGGTTACCCTTCCCGCCATCCACCATTTCTCTCAACGAGCAAATCTAGGGCA
CATTCTTTCCGCTCCTCATCACTTCCATTGAAAGTAAGCTTCAATTCACTCTCCTTCATTGCACAAATTTCGGTTTTCTGCACTCTTCTTATACTCAATGTACCT
TCCAGTTCTTCAATTTTTTTATTTCCAGACGCATATAATGCCCTAACTTCCCCGAGTTTTATTTTAATCGATCAGCACTTGGAATTTTCTTTTTCACTCGTAACG
AAACGTCTCACTTCTCGGATATTTCAGTCTGGAAGAAAAGAAACTGTTCTGGATTTGGCTAAGTTTGTAGACAAAGGCGTCCAAGTGAAGCTTACTGGCGGCAGA
CAAGTTACTGGAACGCTCAAAGGATATGATCAATTGCTAAACCTTGTGCTGGATGAAGCTGTAGAATTTTTAAGAGATCCTGATGATCCATTGAAGACAACAGAT
CAAACCAGGCCTCTTGGCCTAATTGTGTGCAGGGGGACTGCTGTAATGCTGGTGTCTCCAGTTGATGGAACAGACGAGATTGCTAACCCTTTCATCCAACCCGAT
GGTGCATAG
Protein sequenceShow/hide protein sequence
MIGPDEKLGHSPTVYGLEGYPSRHPPFLSTSKSRAHSFRSSSLPLKVSFNSLSFIAQISVFCTLLILNVPSSSSIFLFPDAYNALTSPSFILIDQHLEFSFSLVT
KRLTSRIFQSGRKETVLDLAKFVDKGVQVKLTGGRQVTGTLKGYDQLLNLVLDEAVEFLRDPDDPLKTTDQTRPLGLIVCRGTAVMLVSPVDGTDEIANPFIQPD
GA