; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020370 (gene) of Snake gourd v1 genome

Gene IDTan0020370
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSm protein F
Genome locationLG02:88796872..88799299
RNA-Seq ExpressionTan0020370
SyntenyTan0020370
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005685 - U1 snRNP (cellular component)
GO:0071013 - catalytic step 2 spliceosome (cellular component)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR001163 - LSM domain, eukaryotic/archaea-type
IPR010920 - LSM domain superfamily
IPR016487 - Sm-like protein Lsm6/SmF
IPR034100 - Small nuclear ribonucleoprotein F


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4357808.1 hypothetical protein F8388_024419 [Cannabis sativa]6.2e-4197.7Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        MSIPVNPKPFLNNLTGKTV VKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTG+LGEILIRCNNVLYLRGVPEDEEIEDAERE
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

KAF4391041.1 hypothetical protein F8388_024873 [Cannabis sativa]6.2e-4197.7Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        MSIPVNPKPFLNNLTGKTV VKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTG+LGEILIRCNNVLYLRGVPEDEEIEDAERE
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

XP_004146017.2 probable small nuclear ribonucleoprotein F isoform X1 [Cucumis sativus]1.6e-4198.85Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAER+
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

XP_022144158.1 probable small nuclear ribonucleoprotein F [Momordica charantia]2.1e-4198.85Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIED ERE
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

XP_030480982.1 probable small nuclear ribonucleoprotein F [Cannabis sativa]6.2e-4197.7Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        MSIPVNPKPFLNNLTGKTV VKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTG+LGEILIRCNNVLYLRGVPEDEEIEDAERE
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

TrEMBL top hitse value%identityAlignment
A0A1S3CKH5 Sm protein F8.0e-4298.85Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAER+
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

A0A5D3DWN8 Sm protein F8.0e-4298.85Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAER+
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

A0A6J1CSK0 Sm protein F1.0e-4198.85Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIED ERE
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

A0A7J6EH77 Sm protein F3.0e-4197.7Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        MSIPVNPKPFLNNLTGKTV VKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTG+LGEILIRCNNVLYLRGVPEDEEIEDAERE
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

A0A7J6FQ66 Sm protein F3.0e-4197.7Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        MSIPVNPKPFLNNLTGKTV VKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTG+LGEILIRCNNVLYLRGVPEDEEIEDAERE
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

SwissProt top hitse value%identityAlignment
P62306 Small nuclear ribonucleoprotein F1.7e-3380Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE
        MS+P+NPKPFLN LTGK V+VKLKWGMEYKG+LVSVD YMN+QLANTEEYIDG  +G LGE+LIRCNNVLY+RGV E+EE
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE

P62307 Small nuclear ribonucleoprotein F1.7e-3380Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE
        MS+P+NPKPFLN LTGK V+VKLKWGMEYKG+LVSVD YMN+QLANTEEYIDG  +G LGE+LIRCNNVLY+RGV E+EE
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE

P62321 Small nuclear ribonucleoprotein F1.7e-3380Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE
        MS+P+NPKPFLN LTGK V+VKLKWGMEYKG+LVSVD YMN+QLANTEEYIDG  +G LGE+LIRCNNVLY+RGV E+EE
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE

Q3T0Z8 Small nuclear ribonucleoprotein F1.7e-3380Show/hide
Query:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE
        MS+P+NPKPFLN LTGK V+VKLKWGMEYKG+LVSVD YMN+QLANTEEYIDG  +G LGE+LIRCNNVLY+RGV E+EE
Subjt:  MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEE

Q9SUM2 Probable small nuclear ribonucleoprotein F5.5e-4087.21Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        +IPVNPKPFLNNLTGKTV+VKLKWGMEYKGFL SVDSYMNLQL NTEEYIDGQ TG+LGEILIRCNNVLY+RGVPEDEE+EDA+++
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

Arabidopsis top hitse value%identityAlignment
AT2G14285.1 Small nuclear ribonucleoprotein family protein4.6e-2685.25Show/hide
Query:  MEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        MEYKGFL SVDSYMNLQL NTEEYIDGQ TG+LGEILIRCNNVLY+RGVPEDEE+EDA+++
Subjt:  MEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

AT2G43810.1 Small nuclear ribonucleoprotein family protein4.1e-1446.97Show/hide
Query:  PKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYL
        P  FL ++ GK VVVKL  G++Y+G L  +D YMN+ +  TEEY++GQ   + G+  +R NNVLY+
Subjt:  PKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYL

AT2G43810.2 Small nuclear ribonucleoprotein family protein4.1e-1446.97Show/hide
Query:  PKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYL
        P  FL ++ GK VVVKL  G++Y+G L  +D YMN+ +  TEEY++GQ   + G+  +R NNVLY+
Subjt:  PKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYL

AT4G30220.1 small nuclear ribonucleoprotein F3.9e-4187.21Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        +IPVNPKPFLNNLTGKTV+VKLKWGMEYKGFL SVDSYMNLQL NTEEYIDGQ TG+LGEILIRCNNVLY+RGVPEDEE+EDA+++
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE

AT4G30220.2 small nuclear ribonucleoprotein F3.9e-4187.21Show/hide
Query:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE
        +IPVNPKPFLNNLTGKTV+VKLKWGMEYKGFL SVDSYMNLQL NTEEYIDGQ TG+LGEILIRCNNVLY+RGVPEDEE+EDA+++
Subjt:  SIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTATCCCAGTTAACCCGAAACCGTTCTTGAACAATTTGACCGGAAAGACTGTGGTTGTGAAACTCAAGTGGGGGATGGAGTACAAAGGCTTTCTTGTCTCAGTGGA
CTCGTATATGAACTTACAGCTTGCCAATACTGAGGAATATATTGATGGACAATTTACTGGGAGTCTCGGAGAAATATTGATCAGGTGTAATAATGTTCTTTACCTTCGTG
GAGTACCAGAGGATGAAGAGATCGAAGATGCCGAGCGTGAATAG
mRNA sequenceShow/hide mRNA sequence
ATTTGTTTCGCTCAGCTGCCGAAAACTCGCAGAAATTCAGCGAAAGGAAAAAAAGGAGGGAAAACAAACCCTAGAGAGAGAGCGCCAATTTCATCGTCTCCTTGTTCTTT
CGAGCCTCCACAGATTCCAGAGAAGCTCTTGCAGATCATTTCTGAACCATGAGTATCCCAGTTAACCCGAAACCGTTCTTGAACAATTTGACCGGAAAGACTGTGGTTGT
GAAACTCAAGTGGGGGATGGAGTACAAAGGCTTTCTTGTCTCAGTGGACTCGTATATGAACTTACAGCTTGCCAATACTGAGGAATATATTGATGGACAATTTACTGGGA
GTCTCGGAGAAATATTGATCAGGTGTAATAATGTTCTTTACCTTCGTGGAGTACCAGAGGATGAAGAGATCGAAGATGCCGAGCGTGAATAGCATGTAAATTTGTAGCAG
TTCTGCCATGCCATGGTAGCAGTAAAGTATTCAGCTTTCTAGTTTTCTAGTTTTTCTTTCACCTGCTGCTTAAGTTGTATCTTGTTATTGATATTATTCACAGGAACATT
CTCTGCTTTGGTCGATTTCAAACCTGCAACAATCATTCGGTTTCCATAGTTGTAGTAGGGAGATTGAATGGTAGAAAAAGAATTCTTGAATGTAACATCTATATTTCTTG
ATAGCATCGAGTTGCTTCAGTTTTTTCAATATAAGATGAAGTTCAGCCGTTTAAACTTTTGGGAAAAAAACATTTTTGATCCCGAACTTTGTAGGTTTGTATCAATTTTA
ATATTAAACTTTCAATTTCATCGATTTAAACCTCAAA
Protein sequenceShow/hide protein sequence
MSIPVNPKPFLNNLTGKTVVVKLKWGMEYKGFLVSVDSYMNLQLANTEEYIDGQFTGSLGEILIRCNNVLYLRGVPEDEEIEDAERE