; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022541 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022541
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionU6 snRNA-associated Sm-like protein LSm4
Genome locationChr05:25262010..25264343
RNA-Seq ExpressionHG10022541
SyntenyHG10022541
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0000956 - nuclear-transcribed mRNA catabolic process (biological process)
GO:0033962 - cytoplasmic mRNA processing body assembly (biological process)
GO:0000932 - P-body (cellular component)
GO:0005681 - spliceosomal complex (cellular component)
GO:0005688 - U6 snRNP (cellular component)
GO:0097526 - spliceosomal tri-snRNP complex (cellular component)
GO:0017070 - U6 snRNA binding (molecular function)
InterPro domainsIPR001163 - LSM domain, eukaryotic/archaea-type
IPR010920 - LSM domain superfamily
IPR027141 - Like-Sm (LSM) domain containing protein, LSm4/SmD1/SmD3
IPR034101 - Sm-like protein Lsm4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046492.1 putative U6 snRNA-associated Sm-like protein LSm4 [Cucumis melo var. makuwa]1.9e-6396.09Show/hide
Query:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM
        MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEE KSRADRKP  +GRGRGRGREDGPGGRPAKGM
Subjt:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM

Query:  ARGFDDGAKAASGGRGKGGPGGKPGANR
         RGFDDGAKAASGGRGKGGPGGKPGANR
Subjt:  ARGFDDGAKAASGGRGKGGPGGKPGANR

XP_008454536.1 PREDICTED: probable U6 snRNA-associated Sm-like protein LSm4 [Cucumis melo]1.9e-6396.09Show/hide
Query:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM
        MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEE KSRADRKP  +GRGRGRGREDGPGGRPAKGM
Subjt:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM

Query:  ARGFDDGAKAASGGRGKGGPGGKPGANR
         RGFDDGAKAASGGRGKGGPGGKPGANR
Subjt:  ARGFDDGAKAASGGRGKGGPGGKPGANR

XP_022926288.1 sm-like protein LSM4 isoform X1 [Cucurbita moschata]3.0e-6496.09Show/hide
Query:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM
        MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEE K+RADRKPP +GRGRGRGREDGPGGRPAKGM
Subjt:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM

Query:  ARGFDDGAKAASGGRGKGGPGGKPGANR
         RGFDDGAKAASGGRGKGGPGGKPGANR
Subjt:  ARGFDDGAKAASGGRGKGGPGGKPGANR

XP_022975088.1 sm-like protein LSM4 isoform X1 [Cucurbita maxima]7.4e-6394.53Show/hide
Query:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM
        MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEE KSR DRKPP +GRGRGRGREDGPGGRP+KGM
Subjt:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM

Query:  ARGFDDGAKAASGGRGKGGPGGKPGANR
         RGFDDGAKAASGGRGKGG GGKPGANR
Subjt:  ARGFDDGAKAASGGRGKGGPGGKPGANR

XP_038896461.1 uncharacterized protein LOC120084721 [Benincasa hispida]6.7e-6496.09Show/hide
Query:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM
        MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEE KSRADRKPP +GRGRGRGREDGPGGRPAKGM
Subjt:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM

Query:  ARGFDDGAKAASGGRGKGGPGGKPGANR
         RGFDD AKAASGGRGKGGPGGKPGANR
Subjt:  ARGFDDGAKAASGGRGKGGPGGKPGANR

TrEMBL top hitse value%identityAlignment
A0A1S3BYD6 U6 snRNA-associated Sm-like protein LSm49.4e-6496.09Show/hide
Query:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM
        MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEE KSRADRKP  +GRGRGRGREDGPGGRPAKGM
Subjt:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM

Query:  ARGFDDGAKAASGGRGKGGPGGKPGANR
         RGFDDGAKAASGGRGKGGPGGKPGANR
Subjt:  ARGFDDGAKAASGGRGKGGPGGKPGANR

A0A5D3BGF1 U6 snRNA-associated Sm-like protein LSm49.4e-6496.09Show/hide
Query:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM
        MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEE KSRADRKP  +GRGRGRGREDGPGGRPAKGM
Subjt:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM

Query:  ARGFDDGAKAASGGRGKGGPGGKPGANR
         RGFDDGAKAASGGRGKGGPGGKPGANR
Subjt:  ARGFDDGAKAASGGRGKGGPGGKPGANR

A0A6J1EHL8 U6 snRNA-associated Sm-like protein LSm41.4e-6496.09Show/hide
Query:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM
        MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEE K+RADRKPP +GRGRGRGREDGPGGRPAKGM
Subjt:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM

Query:  ARGFDDGAKAASGGRGKGGPGGKPGANR
         RGFDDGAKAASGGRGKGGPGGKPGANR
Subjt:  ARGFDDGAKAASGGRGKGGPGGKPGANR

A0A6J1ID83 U6 snRNA-associated Sm-like protein LSm43.6e-6394.53Show/hide
Query:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM
        MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEE KSR DRKPP +GRGRGRGREDGPGGRP+KGM
Subjt:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM

Query:  ARGFDDGAKAASGGRGKGGPGGKPGANR
         RGFDDGAKAASGGRGKGG GGKPGANR
Subjt:  ARGFDDGAKAASGGRGKGGPGGKPGANR

A0A6J1J2N4 U6 snRNA-associated Sm-like protein LSm41.4e-6496.09Show/hide
Query:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM
        MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEE K+RADRKPP +GRGRGRGREDGPGGRPAKGM
Subjt:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM

Query:  ARGFDDGAKAASGGRGKGGPGGKPGANR
         RGFDDGAKAASGGRGKGGPGGKPGANR
Subjt:  ARGFDDGAKAASGGRGKGGPGGKPGANR

SwissProt top hitse value%identityAlignment
F4K4E3 Sm-like protein LSM41.2e-4478.45Show/hide
Query:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM
        MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEE K+R DRKPP +GRGRGRG +DG      +G 
Subjt:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM

Query:  ARGFDDGAKAASGGRG
        + G   G + A  GRG
Subjt:  ARGFDDGAKAASGGRG

Q43582 Probable U6 snRNA-associated Sm-like protein LSm45.7e-5080.16Show/hide
Query:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGR-GREDGPGGRPAKG
        MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECY+RGNTIKYLRVPDEVIDKVQEEAKSR DRKPP +GR R R GR+D   GR  KG
Subjt:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGR-GREDGPGGRPAKG

Query:  MARGFDDGAKAASGGRGKGGPGGKPG
        + RG DDG    + GRGKGGP  K G
Subjt:  MARGFDDGAKAASGGRGKGGPGGKPG

Q9LGE6 Probable U6 snRNA-associated Sm-like protein LSm41.4e-4879.53Show/hide
Query:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEE-AKSRADRKPPVLGRGRGRGR-EDGPGGRPAK
        MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGD+FWRMPECYIRGNTIKYLRVPDEVIDKVQEE +KSR+DR+PP +GRGRGRG     PGGR   
Subjt:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEE-AKSRADRKPPVLGRGRGRGR-EDGPGGRPAK

Query:  GMARGFDDGAKAASGGRGKGGPGGKPG
        G+ RG DDG     GGRG+GG GGK G
Subjt:  GMARGFDDGAKAASGGRGKGGPGGKPG

Q9QXA5 U6 snRNA-associated Sm-like protein LSm42.0e-3462.5Show/hide
Query:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM
        MLVELKNGETYNGHLV+CD WMNI+LREVICTS+DGD+FWRMPECYIRG+TIKYLR+PDE+ID V+EEA            +GRGRG   GP  +  KG 
Subjt:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM

Query:  ARGFDDGAKAASGGRGKGGPGGKPGANR
         RG     +   GGRG+   GG PGA R
Subjt:  ARGFDDGAKAASGGRGKGGPGGKPGANR

Q9ZRU9 Probable U6 snRNA-associated Sm-like protein LSm41.5e-4777.78Show/hide
Query:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM
        MLVELK+GETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMP+CYIRGNTIKYLRVPDEVIDKVQEE KSRADRKPP +GRGRGRGRE+G G R  +G 
Subjt:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM

Query:  ARG-FDDGAKAASGGRGKGGPGGKPG
         R      AKA    RG+G   GK G
Subjt:  ARG-FDDGAKAASGGRGKGGPGGKPG

Arabidopsis top hitse value%identityAlignment
AT1G20580.1 Small nuclear ribonucleoprotein family protein9.4e-0833.61Show/hide
Query:  VELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVID-KVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGMA
        VELK+GE Y G ++ C+   N  L ++  T+KDG +  ++   +IRG+ ++++ +PD +    + +   +R   K   LG GRGRG      G+PA G  
Subjt:  VELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVID-KVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGMA

Query:  RGFDDGAKAASGGRGKGGP
        RG        +GGRG   P
Subjt:  RGFDDGAKAASGGRGKGGP

AT1G76300.1 snRNP core protein SMD38.0e-0733.05Show/hide
Query:  VELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGMAR
        VE+K+GE Y G ++ C+   N  L  +  T+KDG +  ++   +IRG+ +++L +PD ++         R   K   LG GRGRG         AKG  R
Subjt:  VELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGMAR

Query:  GFDDGAKAASGGRGKGGP
        G         GGRG   P
Subjt:  GFDDGAKAASGGRGKGGP

AT4G02840.1 Small nuclear ribonucleoprotein family protein9.4e-0837.62Show/hide
Query:  VELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEV-IDKVQEEAKSRADRKPPVL-------GRGRGRGREDGPGG
        +ELKNG   +G +   D  MN HL+ V  T K G     +    +RGN I+Y  +PD + ++ +  E   R   K P         GRGRGRGR  G GG
Subjt:  VELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEV-IDKVQEEAKSRADRKPPVL-------GRGRGRGREDGPGG

Query:  R
        R
Subjt:  R

AT4G02840.2 Small nuclear ribonucleoprotein family protein2.0e-0534.23Show/hide
Query:  VELKNGETYNGHLVN----------CDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEV-IDKVQEEAKSRADRKPPVL-------GRGR
        +ELKNG   +G + +           D  MN HL+ V  T K G     +    +RGN I+Y  +PD + ++ +  E   R   K P         GRGR
Subjt:  VELKNGETYNGHLVN----------CDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEV-IDKVQEEAKSRADRKPPVL-------GRGR

Query:  GRGREDGPGGR
        GRGR  G GGR
Subjt:  GRGREDGPGGR

AT5G27720.1 Small nuclear ribonucleoprotein family protein8.7e-4678.45Show/hide
Query:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM
        MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEE K+R DRKPP +GRGRGRG +DG      +G 
Subjt:  MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGM

Query:  ARGFDDGAKAASGGRG
        + G   G + A  GRG
Subjt:  ARGFDDGAKAASGGRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGTGGAGTTGAAAAATGGTGAGACTTACAATGGCCATTTGGTTAATTGTGATACATGGATGAACATTCATCTTCGGGAAGTCATCTGCACTTCAAAAGATGGTGA
CCGGTTTTGGCGAATGCCTGAATGTTATATCCGTGGTAATACAATCAAGTATTTGCGGGTTCCAGATGAGGTTATTGATAAAGTTCAGGAAGAAGCCAAAAGCCGTGCAG
ATAGGAAACCTCCAGTGTTGGGGCGTGGAAGAGGAAGAGGGCGTGAGGATGGTCCTGGTGGAAGACCAGCTAAAGGAATGGCCCGAGGCTTTGATGATGGTGCTAAAGCT
GCTTCTGGAGGCCGTGGAAAAGGTGGCCCTGGTGGAAAACCTGGTGCCAACAGAGATAGTTTCATGTTTCCAGATGTCTTAGAGGGCTTGGGAGGCTATGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGTGGAGTTGAAAAATGGTGAGACTTACAATGGCCATTTGGTTAATTGTGATACATGGATGAACATTCATCTTCGGGAAGTCATCTGCACTTCAAAAGATGGTGA
CCGGTTTTGGCGAATGCCTGAATGTTATATCCGTGGTAATACAATCAAGTATTTGCGGGTTCCAGATGAGGTTATTGATAAAGTTCAGGAAGAAGCCAAAAGCCGTGCAG
ATAGGAAACCTCCAGTGTTGGGGCGTGGAAGAGGAAGAGGGCGTGAGGATGGTCCTGGTGGAAGACCAGCTAAAGGAATGGCCCGAGGCTTTGATGATGGTGCTAAAGCT
GCTTCTGGAGGCCGTGGAAAAGGTGGCCCTGGTGGAAAACCTGGTGCCAACAGAGATAGTTTCATGTTTCCAGATGTCTTAGAGGGCTTGGGAGGCTATGTGTGA
Protein sequenceShow/hide protein sequence
MLVELKNGETYNGHLVNCDTWMNIHLREVICTSKDGDRFWRMPECYIRGNTIKYLRVPDEVIDKVQEEAKSRADRKPPVLGRGRGRGREDGPGGRPAKGMARGFDDGAKA
ASGGRGKGGPGGKPGANRDSFMFPDVLEGLGGYV