; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS016842 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS016842
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionSm protein F
Genome locationscaffold9_1:388223..389913
RNA-Seq ExpressionMS016842
SyntenyMS016842
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005685 - U1 snRNP (cellular component)
GO:0071013 - catalytic step 2 spliceosome (cellular component)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR001163 - LSM domain, eukaryotic/archaea-type
IPR010920 - LSM domain superfamily
IPR016487 - Sm-like protein Lsm6/SmF
IPR034100 - Small nuclear ribonucleoprotein F


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4298371.1 unnamed protein product [Prunus armeniaca]1.1e-4095.4Show/hide
Query:  QSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        Q+IPVNPKPFLNNLTGKTV+VKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTG+LGEILIRCNNVLYLRGVPEDEEIEDAE+E
Subjt:  QSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

KAF4391041.1 hypothetical protein F8388_024873 [Cannabis sativa]2.4e-4097.67Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        SIPVNPKPFLNNLTGKTV VKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTG+LGEILIRCNNVLYLRGVPEDEEIEDAERE
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

XP_004146017.2 probable small nuclear ribonucleoprotein F isoform X1 [Cucumis sativus]6.2e-4198.84Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAER+
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

XP_022144158.1 probable small nuclear ribonucleoprotein F [Momordica charantia]8.2e-4198.84Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIED ERE
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

XP_030480982.1 probable small nuclear ribonucleoprotein F [Cannabis sativa]2.4e-4097.67Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        SIPVNPKPFLNNLTGKTV VKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTG+LGEILIRCNNVLYLRGVPEDEEIEDAERE
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

TrEMBL top hitse value%identityAlignment
A0A1S3CKH5 Sm protein F3.0e-4198.84Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAER+
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

A0A5D3DWN8 Sm protein F3.0e-4198.84Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAER+
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

A0A6J1CSK0 Sm protein F3.9e-4198.84Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIED ERE
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

A0A6J5WDD8 Sm protein F5.2e-4195.4Show/hide
Query:  QSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        Q+IPVNPKPFLNNLTGKTV+VKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTG+LGEILIRCNNVLYLRGVPEDEEIEDAE+E
Subjt:  QSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

A0A803QDU2 Uncharacterized protein5.2e-4196.55Show/hide
Query:  QSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        +SIPVNPKPFLNNLTGKTV VKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTG+LGEILIRCNNVLYLRGVPEDEEIEDAERE
Subjt:  QSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

SwissProt top hitse value%identityAlignment
P62306 Small nuclear ribonucleoprotein F6.5e-3379.75Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE
        S+P+NPKPFLN LTGK V+VKLKWGMEYKG+LVSVD YMN+QLANTEEYIDG  +G LGE+LIRCNNVLY+RGV E+EE
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE

P62307 Small nuclear ribonucleoprotein F6.5e-3379.75Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE
        S+P+NPKPFLN LTGK V+VKLKWGMEYKG+LVSVD YMN+QLANTEEYIDG  +G LGE+LIRCNNVLY+RGV E+EE
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE

P62321 Small nuclear ribonucleoprotein F6.5e-3379.75Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE
        S+P+NPKPFLN LTGK V+VKLKWGMEYKG+LVSVD YMN+QLANTEEYIDG  +G LGE+LIRCNNVLY+RGV E+EE
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE

Q3T0Z8 Small nuclear ribonucleoprotein F6.5e-3379.75Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE
        S+P+NPKPFLN LTGK V+VKLKWGMEYKG+LVSVD YMN+QLANTEEYIDG  +G LGE+LIRCNNVLY+RGV E+EE
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE

Q9SUM2 Probable small nuclear ribonucleoprotein F5.5e-4087.21Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        +IPVNPKPFLNNLTGKTV+VKLKWGMEYKGFL SVDSYMNLQL NTEEYIDGQ TG+LGEILIRCNNVLY+RGVPEDEE+EDA+++
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

Arabidopsis top hitse value%identityAlignment
AT2G14285.1 Small nuclear ribonucleoprotein family protein4.6e-2685.25Show/hide
Query:  MEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        MEYKGFL SVDSYMNLQL NTEEYIDGQ TG+LGEILIRCNNVLY+RGVPEDEE+EDA+++
Subjt:  MEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

AT2G43810.1 Small nuclear ribonucleoprotein family protein4.1e-1446.97Show/hide
Query:  PKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYL
        P  FL ++ GK VVVKL  G++Y+G L  +D YMN+ +  TEEY++GQ   + G+  +R NNVLY+
Subjt:  PKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYL

AT2G43810.2 Small nuclear ribonucleoprotein family protein4.1e-1446.97Show/hide
Query:  PKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYL
        P  FL ++ GK VVVKL  G++Y+G L  +D YMN+ +  TEEY++GQ   + G+  +R NNVLY+
Subjt:  PKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYL

AT4G30220.1 small nuclear ribonucleoprotein F3.9e-4187.21Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        +IPVNPKPFLNNLTGKTV+VKLKWGMEYKGFL SVDSYMNLQL NTEEYIDGQ TG+LGEILIRCNNVLY+RGVPEDEE+EDA+++
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

AT4G30220.2 small nuclear ribonucleoprotein F7.9e-4287.36Show/hide
Query:  QSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        Q+IPVNPKPFLNNLTGKTV+VKLKWGMEYKGFL SVDSYMNLQL NTEEYIDGQ TG+LGEILIRCNNVLY+RGVPEDEE+EDA+++
Subjt:  QSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CAGAGCATCCCAGTTAACCCGAAGCCATTCTTGAACAATTTGACCGGAAAGACTGTGGTGGTGAAACTCAAGTGGGGGATGGAGTACAAAGGCTTTCTCGTCTCAGTGGA
CTCGTACATGAACTTACAGCTTGCCAATACTGAGGAATATATTGACGGACAATTTACTGGGAGTCTAGGAGAAATATTGATCAGGTGTAATAATGTTCTTTACCTTCGTG
GAGTACCGGAGGATGAAGAAATTGAAGATGCGGAGCGTGAG
mRNA sequenceShow/hide mRNA sequence
CAGAGCATCCCAGTTAACCCGAAGCCATTCTTGAACAATTTGACCGGAAAGACTGTGGTGGTGAAACTCAAGTGGGGGATGGAGTACAAAGGCTTTCTCGTCTCAGTGGA
CTCGTACATGAACTTACAGCTTGCCAATACTGAGGAATATATTGACGGACAATTTACTGGGAGTCTAGGAGAAATATTGATCAGGTGTAATAATGTTCTTTACCTTCGTG
GAGTACCGGAGGATGAAGAAATTGAAGATGCGGAGCGTGAG
Protein sequenceShow/hide protein sequence
QSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE