; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy4G021050 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy4G021050
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionSm protein F
Genome locationGy14Chr4:27746984..27748363
RNA-Seq ExpressionCsGy4G021050
SyntenyCsGy4G021050
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005685 - U1 snRNP (cellular component)
GO:0071013 - catalytic step 2 spliceosome (cellular component)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR001163 - LSM domain, eukaryotic/archaea-type
IPR010920 - LSM domain superfamily
IPR016487 - Sm-like protein Lsm6/SmF
IPR034100 - Small nuclear ribonucleoprotein F


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3452786.1 hypothetical protein FNV43_RR03219 [Rhamnella rubrinervis]1.74e-5597.67Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
        +IPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTG+LGEILIRCNNVLYLRGVPEDEEIEDAERD
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD

XP_004146017.2 probable small nuclear ribonucleoprotein F isoform X1 [Cucumis sativus]3.53e-57100Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD

XP_022144158.1 probable small nuclear ribonucleoprotein F [Momordica charantia]5.87e-5697.7Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIED ER+
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD

XP_022951684.1 probable small nuclear ribonucleoprotein F [Cucurbita moschata]1.19e-5597.7Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQ TGSLGEILIRCNNVLYLRGVPEDEE+EDAERD
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD

XP_022961705.1 probable small nuclear ribonucleoprotein F [Cucurbita moschata]5.87e-5698.85Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQ TGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD

TrEMBL top hitse value%identityAlignment
A0A1S3CKH5 Sm protein F1.71e-57100Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD

A0A5D3DWN8 Sm protein F1.71e-57100Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD

A0A6J1CSK0 Sm protein F2.84e-5697.7Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIED ER+
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD

A0A6J1HD26 Sm protein F2.84e-5698.85Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQ TGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD

A0A6J1KQT2 Sm protein F2.84e-5698.85Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQ TGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD

SwissProt top hitse value%identityAlignment
P62306 Small nuclear ribonucleoprotein F1.3e-3380Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE
        MS+P+NPKPFLN LTGK V+VKLKWGMEYKG+LVSVD YMN+QLANTEEYIDG  +G LGE+LIRCNNVLY+RGV E+EE
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE

P62307 Small nuclear ribonucleoprotein F1.3e-3380Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE
        MS+P+NPKPFLN LTGK V+VKLKWGMEYKG+LVSVD YMN+QLANTEEYIDG  +G LGE+LIRCNNVLY+RGV E+EE
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE

P62321 Small nuclear ribonucleoprotein F1.3e-3380Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE
        MS+P+NPKPFLN LTGK V+VKLKWGMEYKG+LVSVD YMN+QLANTEEYIDG  +G LGE+LIRCNNVLY+RGV E+EE
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE

Q3T0Z8 Small nuclear ribonucleoprotein F1.3e-3380Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE
        MS+P+NPKPFLN LTGK V+VKLKWGMEYKG+LVSVD YMN+QLANTEEYIDG  +G LGE+LIRCNNVLY+RGV E+EE
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE

Q9SUM2 Probable small nuclear ribonucleoprotein F8.6e-4188.37Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
        +IPVNPKPFLNNLTGKTV+VKLKWGMEYKGFL SVDSYMNLQL NTEEYIDGQ TG+LGEILIRCNNVLY+RGVPEDEE+EDA++D
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD

Arabidopsis top hitse value%identityAlignment
AT2G14285.1 Small nuclear ribonucleoprotein family protein9.4e-2786.89Show/hide
Query:  MEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
        MEYKGFL SVDSYMNLQL NTEEYIDGQ TG+LGEILIRCNNVLY+RGVPEDEE+EDA++D
Subjt:  MEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD

AT2G43810.1 Small nuclear ribonucleoprotein family protein3.1e-1446.97Show/hide
Query:  PKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYL
        P  FL ++ GK VVVKL  G++Y+G L  +D YMN+ +  TEEY++GQ   + G+  +R NNVLY+
Subjt:  PKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYL

AT2G43810.2 Small nuclear ribonucleoprotein family protein3.1e-1446.97Show/hide
Query:  PKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYL
        P  FL ++ GK VVVKL  G++Y+G L  +D YMN+ +  TEEY++GQ   + G+  +R NNVLY+
Subjt:  PKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYL

AT4G30220.1 small nuclear ribonucleoprotein F6.1e-4288.37Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
        +IPVNPKPFLNNLTGKTV+VKLKWGMEYKGFL SVDSYMNLQL NTEEYIDGQ TG+LGEILIRCNNVLY+RGVPEDEE+EDA++D
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD

AT4G30220.2 small nuclear ribonucleoprotein F6.1e-4288.37Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD
        +IPVNPKPFLNNLTGKTV+VKLKWGMEYKGFL SVDSYMNLQL NTEEYIDGQ TG+LGEILIRCNNVLY+RGVPEDEE+EDA++D
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCATACCAGTTAACCCGAAGCCATTTTTGAACAATTTGACTGGAAAGACTGTGGTTGTGAAACTCAAGTGGGGAATGGAGTACAAAGGATTTCTTGTCTCGGTAGA
CTCGTACATGAACTTGCAGCTTGCCAATACTGAGGAATATATCGACGGGCAATTTACTGGGAGTCTTGGAGAAATATTGATCAGGTGTAACAATGTTCTTTATCTTCGAG
GAGTGCCAGAGGATGAAGAGATTGAAGATGCGGAGCGTGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCATACCAGTTAACCCGAAGCCATTTTTGAACAATTTGACTGGAAAGACTGTGGTTGTGAAACTCAAGTGGGGAATGGAGTACAAAGGATTTCTTGTCTCGGTAGA
CTCGTACATGAACTTGCAGCTTGCCAATACTGAGGAATATATCGACGGGCAATTTACTGGGAGTCTTGGAGAAATATTGATCAGGTGTAACAATGTTCTTTATCTTCGAG
GAGTGCCAGAGGATGAAGAGATTGAAGATGCGGAGCGTGACTAG
Protein sequenceShow/hide protein sequence
MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERD