; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004401 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004401
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSm protein F
Genome locationchr6:3525118..3526644
RNA-Seq ExpressionLag0004401
SyntenyLag0004401
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005685 - U1 snRNP (cellular component)
GO:0071013 - catalytic step 2 spliceosome (cellular component)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR001163 - LSM domain, eukaryotic/archaea-type
IPR010920 - LSM domain superfamily
IPR016487 - Sm-like protein Lsm6/SmF
IPR034100 - Small nuclear ribonucleoprotein F


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3452786.1 hypothetical protein FNV43_RR03219 [Rhamnella rubrinervis]3.1e-4096.51Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD
        +IPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTG+LGEILIRCNNVLYLRGVPEDEEIE+AERD
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD

XP_004146017.2 probable small nuclear ribonucleoprotein F isoform X1 [Cucumis sativus]1.6e-4198.85Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIE+AERD
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD

XP_022144158.1 probable small nuclear ribonucleoprotein F [Momordica charantia]1.4e-4096.55Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIE+ ER+
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD

XP_022951684.1 probable small nuclear ribonucleoprotein F [Cucurbita moschata]2.4e-4096.55Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQ TGSLGEILIRCNNVLYLRGVPEDEE+E+AERD
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD

XP_022961705.1 probable small nuclear ribonucleoprotein F [Cucurbita moschata]1.4e-4097.7Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQ TGSLGEILIRCNNVLYLRGVPEDEEIE+AERD
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD

TrEMBL top hitse value%identityAlignment
A0A1S3CKH5 Sm protein F8.0e-4298.85Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIE+AERD
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD

A0A5D3DWN8 Sm protein F8.0e-4298.85Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIE+AERD
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD

A0A6J1CSK0 Sm protein F6.7e-4196.55Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIE+ ER+
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD

A0A6J1HD26 Sm protein F6.7e-4197.7Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQ TGSLGEILIRCNNVLYLRGVPEDEEIE+AERD
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD

A0A6J1KQT2 Sm protein F6.7e-4197.7Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQ TGSLGEILIRCNNVLYLRGVPEDEEIE+AERD
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD

SwissProt top hitse value%identityAlignment
P62306 Small nuclear ribonucleoprotein F1.7e-3380Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE
        MS+P+NPKPFLN LTGK V+VKLKWGMEYKG+LVSVD YMN+QLANTEEYIDG  +G LGE+LIRCNNVLY+RGV E+EE
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE

P62307 Small nuclear ribonucleoprotein F1.7e-3380Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE
        MS+P+NPKPFLN LTGK V+VKLKWGMEYKG+LVSVD YMN+QLANTEEYIDG  +G LGE+LIRCNNVLY+RGV E+EE
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE

P62321 Small nuclear ribonucleoprotein F1.7e-3380Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE
        MS+P+NPKPFLN LTGK V+VKLKWGMEYKG+LVSVD YMN+QLANTEEYIDG  +G LGE+LIRCNNVLY+RGV E+EE
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE

Q3T0Z8 Small nuclear ribonucleoprotein F1.7e-3380Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE
        MS+P+NPKPFLN LTGK V+VKLKWGMEYKG+LVSVD YMN+QLANTEEYIDG  +G LGE+LIRCNNVLY+RGV E+EE
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE

Q9SUM2 Probable small nuclear ribonucleoprotein F5.5e-4087.21Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD
        +IPVNPKPFLNNLTGKTV+VKLKWGMEYKGFL SVDSYMNLQL NTEEYIDGQ TG+LGEILIRCNNVLY+RGVPEDEE+E+A++D
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD

Arabidopsis top hitse value%identityAlignment
AT2G14285.1 Small nuclear ribonucleoprotein family protein4.6e-2685.25Show/hide
Query:  MEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD
        MEYKGFL SVDSYMNLQL NTEEYIDGQ TG+LGEILIRCNNVLY+RGVPEDEE+E+A++D
Subjt:  MEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD

AT2G43810.1 Small nuclear ribonucleoprotein family protein4.1e-1446.97Show/hide
Query:  PKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYL
        P  FL ++ GK VVVKL  G++Y+G L  +D YMN+ +  TEEY++GQ   + G+  +R NNVLY+
Subjt:  PKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYL

AT2G43810.2 Small nuclear ribonucleoprotein family protein4.1e-1446.97Show/hide
Query:  PKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYL
        P  FL ++ GK VVVKL  G++Y+G L  +D YMN+ +  TEEY++GQ   + G+  +R NNVLY+
Subjt:  PKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYL

AT4G30220.1 small nuclear ribonucleoprotein F3.9e-4187.21Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD
        +IPVNPKPFLNNLTGKTV+VKLKWGMEYKGFL SVDSYMNLQL NTEEYIDGQ TG+LGEILIRCNNVLY+RGVPEDEE+E+A++D
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD

AT4G30220.2 small nuclear ribonucleoprotein F3.9e-4187.21Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD
        +IPVNPKPFLNNLTGKTV+VKLKWGMEYKGFL SVDSYMNLQL NTEEYIDGQ TG+LGEILIRCNNVLY+RGVPEDEE+E+A++D
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCATCCCAGTTAACCCGAAACCGTTCTTGAACAATTTGACTGGAAAGACTGTGGTTGTGAAACTCAAGTGGGGGATGGAGTACAAAGGCTTTCTTGTCTCGGTGGA
CTCGTACATGAACTTACAGCTTGCCAATACCGAGGAATATATTGATGGACAATTTACTGGGAGTCTCGGAGAAATATTGATCAGGTGTAATAATGTTCTTTACCTTCGCG
GAGTACCAGAGGATGAAGAAATCGAAGAAGCCGAGCGTGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCATCCCAGTTAACCCGAAACCGTTCTTGAACAATTTGACTGGAAAGACTGTGGTTGTGAAACTCAAGTGGGGGATGGAGTACAAAGGCTTTCTTGTCTCGGTGGA
CTCGTACATGAACTTACAGCTTGCCAATACCGAGGAATATATTGATGGACAATTTACTGGGAGTCTCGGAGAAATATTGATCAGGTGTAATAATGTTCTTTACCTTCGCG
GAGTACCAGAGGATGAAGAAATCGAAGAAGCCGAGCGTGACTAG
Protein sequenceShow/hide protein sequence
MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEEAERD