; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G19730 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G19730
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionSmall nuclear ribonucleoprotein Sm D1
Genome locationClcChr11:30227057..30234074
RNA-Seq ExpressionClc11G19730
SyntenyClc11G19730
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0046855 - inositol phosphate dephosphorylation (biological process)
GO:0046854 - phosphatidylinositol phosphorylation (biological process)
GO:0006790 - sulfur compound metabolic process (biological process)
GO:0034719 - SMN-Sm protein complex (cellular component)
GO:0097526 - spliceosomal tri-snRNP complex (cellular component)
GO:0071013 - catalytic step 2 spliceosome (cellular component)
GO:0071011 - precatalytic spliceosome (cellular component)
GO:0000243 - commitment complex (cellular component)
GO:0034715 - pICln-Sm protein complex (cellular component)
GO:0005689 - U12-type spliceosomal complex (cellular component)
GO:0005687 - U4 snRNP (cellular component)
GO:0005686 - U2 snRNP (cellular component)
GO:0005685 - U1 snRNP (cellular component)
GO:0005682 - U5 snRNP (cellular component)
GO:0008441 - 3'(2'),5'-bisphosphate nucleotidase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR000760 - Inositol monophosphatase-like
IPR001163 - LSM domain, eukaryotic/archaea-type
IPR010920 - LSM domain superfamily
IPR027141 - Like-Sm (LSM) domain containing protein, LSm4/SmD1/SmD3
IPR034102 - Small nuclear ribonucleoprotein D1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036838.1 PAP-specific phosphatase HAL2-like [Cucumis melo var. makuwa]2.3e-4372.09Show/hide
Query:  VLYARKRCSGAWMQPLVHGDKKLDWPNSVSLIQVSSTDDSGLATFCEPVGKRNSNHSFTAGMAHCVGLRFDVI-----ITYAAVARVVAEIFMNFARTRY
        V+YA+K C+GAWMQPLVHGDKKL+WPNS SLIQVSS DD   ATFCEPV KRNSNHSFTAG+AH VGLR   +     + YAA+AR  AEIFM FART Y
Subjt:  VLYARKRCSGAWMQPLVHGDKKLDWPNSVSLIQVSSTDDSGLATFCEPVGKRNSNHSFTAGMAHCVGLRFDVI-----ITYAAVARVVAEIFMNFARTRY

Query:  QEKIWDHAAGIIIAEAARGVVTDSRGRLL
        +EKIWDHAAG+II EAA GVVTD+ GR L
Subjt:  QEKIWDHAAGIIIAEAARGVVTDSRGRLL

KAG7024860.1 PAP-specific phosphatase HAL2-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-4371.32Show/hide
Query:  VLYARKRCSGAWMQPLVHGDKKLDWPNSVSLIQVSSTDDSGLATFCEPVGKRNSNHSFTAGMAHCVGLRFDVI-----ITYAAVARVVAEIFMNFARTRY
        V+YARK C GAWMQPLVHGD+KL+WPNS SL++VSS DD  LATFCEPV KRNSNHSFTAG+AH VGLR   +     + YAA+AR  AEIFM FART Y
Subjt:  VLYARKRCSGAWMQPLVHGDKKLDWPNSVSLIQVSSTDDSGLATFCEPVGKRNSNHSFTAGMAHCVGLRFDVI-----ITYAAVARVVAEIFMNFARTRY

Query:  QEKIWDHAAGIIIAEAARGVVTDSRGRLL
        +EKIWDHAAG+II EAA GVVTD+ GR L
Subjt:  QEKIWDHAAGIIIAEAARGVVTDSRGRLL

MQL92745.1 hypothetical protein [Colocasia esculenta]3.5e-4475.97Show/hide
Query:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTADI-IVH
        MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLK VKLT+KGKNPVT+DHLSVRGNNIRYYILPDSLN+ETLLVEETPRVKPKKPTAD+ +V 
Subjt:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTADI-IVH

Query:  NAQVKRRVLYARKRCSGAWMQPLVHGDKK
         A     V+    +C G  ++  + G+ K
Subjt:  NAQVKRRVLYARKRCSGAWMQPLVHGDKK

XP_008454658.1 PREDICTED: PAP-specific phosphatase HAL2-like [Cucumis melo]2.3e-4372.09Show/hide
Query:  VLYARKRCSGAWMQPLVHGDKKLDWPNSVSLIQVSSTDDSGLATFCEPVGKRNSNHSFTAGMAHCVGLRFDVI-----ITYAAVARVVAEIFMNFARTRY
        V+YA+K C+GAWMQPLVHGDKKL+WPNS SLIQVSS DD   ATFCEPV KRNSNHSFTAG+AH VGLR   +     + YAA+AR  AEIFM FART Y
Subjt:  VLYARKRCSGAWMQPLVHGDKKLDWPNSVSLIQVSSTDDSGLATFCEPVGKRNSNHSFTAGMAHCVGLRFDVI-----ITYAAVARVVAEIFMNFARTRY

Query:  QEKIWDHAAGIIIAEAARGVVTDSRGRLL
        +EKIWDHAAG+II EAA GVVTD+ GR L
Subjt:  QEKIWDHAAGIIIAEAARGVVTDSRGRLL

XP_038897109.1 PAP-specific phosphatase HAL2-like [Benincasa hispida]7.8e-4472.87Show/hide
Query:  VLYARKRCSGAWMQPLVHGDKKLDWPNSVSLIQVSSTDDSGLATFCEPVGKRNSNHSFTAGMAHCVGLRFDVI-----ITYAAVARVVAEIFMNFARTRY
        V+YA+K CSGAWMQPLVHGDKKL+WPNS SLIQVSS DD  LATFCEPV KRNSNHSFTAG+A+ VGLR   +     + YAA+AR  AEIFM FART Y
Subjt:  VLYARKRCSGAWMQPLVHGDKKLDWPNSVSLIQVSSTDDSGLATFCEPVGKRNSNHSFTAGMAHCVGLRFDVI-----ITYAAVARVVAEIFMNFARTRY

Query:  QEKIWDHAAGIIIAEAARGVVTDSRGRLL
        +EKIWDHAAG+II EAA GVVTD+ GR L
Subjt:  QEKIWDHAAGIIIAEAARGVVTDSRGRLL

TrEMBL top hitse value%identityAlignment
A0A199V0P2 Small nuclear ribonucleoprotein Sm D11.4e-4393Show/hide
Query:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTADIIVHN
        MKLVRFLMKLNNETV+IELKNGTVVHGTITGVDISMNTHLK VKLTLKGKNPVT+DHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTAD I+ +
Subjt:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTADIIVHN

A0A1S3BZX2 3'(2'),5'-bisphosphate nucleotidase1.1e-4372.09Show/hide
Query:  VLYARKRCSGAWMQPLVHGDKKLDWPNSVSLIQVSSTDDSGLATFCEPVGKRNSNHSFTAGMAHCVGLRFDVI-----ITYAAVARVVAEIFMNFARTRY
        V+YA+K C+GAWMQPLVHGDKKL+WPNS SLIQVSS DD   ATFCEPV KRNSNHSFTAG+AH VGLR   +     + YAA+AR  AEIFM FART Y
Subjt:  VLYARKRCSGAWMQPLVHGDKKLDWPNSVSLIQVSSTDDSGLATFCEPVGKRNSNHSFTAGMAHCVGLRFDVI-----ITYAAVARVVAEIFMNFARTRY

Query:  QEKIWDHAAGIIIAEAARGVVTDSRGRLL
        +EKIWDHAAG+II EAA GVVTD+ GR L
Subjt:  QEKIWDHAAGIIIAEAARGVVTDSRGRLL

A0A3Q7I508 Small nuclear ribonucleoprotein Sm D11.9e-4394.85Show/hide
Query:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTADII
        MKLVRFLMKLNNETVSIELKNGTVVHGTITGVD+SMNTHLKAVK+TLKGKNPVT+DHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA I+
Subjt:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTADII

A0A5A7T244 3'(2'),5'-bisphosphate nucleotidase1.1e-4372.09Show/hide
Query:  VLYARKRCSGAWMQPLVHGDKKLDWPNSVSLIQVSSTDDSGLATFCEPVGKRNSNHSFTAGMAHCVGLRFDVI-----ITYAAVARVVAEIFMNFARTRY
        V+YA+K C+GAWMQPLVHGDKKL+WPNS SLIQVSS DD   ATFCEPV KRNSNHSFTAG+AH VGLR   +     + YAA+AR  AEIFM FART Y
Subjt:  VLYARKRCSGAWMQPLVHGDKKLDWPNSVSLIQVSSTDDSGLATFCEPVGKRNSNHSFTAGMAHCVGLRFDVI-----ITYAAVARVVAEIFMNFARTRY

Query:  QEKIWDHAAGIIIAEAARGVVTDSRGRLL
        +EKIWDHAAG+II EAA GVVTD+ GR L
Subjt:  QEKIWDHAAGIIIAEAARGVVTDSRGRLL

A0A6J1J227 3'(2'),5'-bisphosphate nucleotidase1.4e-4371.32Show/hide
Query:  VLYARKRCSGAWMQPLVHGDKKLDWPNSVSLIQVSSTDDSGLATFCEPVGKRNSNHSFTAGMAHCVGLRFDVI-----ITYAAVARVVAEIFMNFARTRY
        V+YA++ CSGAWMQPLVHG+KKL+WPNS SLIQVSS DD  LATFCEPV K+NSNHSFTAG+AH VGLR   +     + YAA+AR  AEIFM FART Y
Subjt:  VLYARKRCSGAWMQPLVHGDKKLDWPNSVSLIQVSSTDDSGLATFCEPVGKRNSNHSFTAGMAHCVGLRFDVI-----ITYAAVARVVAEIFMNFARTRY

Query:  QEKIWDHAAGIIIAEAARGVVTDSRGRLL
        +EKIWDHAAG+II EAA GVVTD+ GR L
Subjt:  QEKIWDHAAGIIIAEAARGVVTDSRGRLL

SwissProt top hitse value%identityAlignment
P62314 Small nuclear ribonucleoprotein Sm D13.2e-3274.47Show/hide
Query:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA
        MKLVRFLMKL++ETV+IELKNGT VHGTITGVD+SMNTHLKAVK+TLK + PV ++ LS+RGNNIRY+ILPDSL L+TLLV+  P+VK KK  A
Subjt:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA

Q3ZC10 Small nuclear ribonucleoprotein Sm D13.2e-3274.47Show/hide
Query:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA
        MKLVRFLMKL++ETV+IELKNGT VHGTITGVD+SMNTHLKAVK+TLK + PV ++ LS+RGNNIRY+ILPDSL L+TLLV+  P+VK KK  A
Subjt:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA

Q4R5F6 Small nuclear ribonucleoprotein Sm D13.2e-3274.47Show/hide
Query:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA
        MKLVRFLMKL++ETV+IELKNGT VHGTITGVD+SMNTHLKAVK+TLK + PV ++ LS+RGNNIRY+ILPDSL L+TLLV+  P+VK KK  A
Subjt:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA

Q9SSF1 Small nuclear ribonucleoprotein SmD1a5.3e-4391.49Show/hide
Query:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA
        MKLVRFLMKLNNETVSIELKNGTVVHGTITGVD+SMNTHLK VK++LKGKNPVT+DHLS+RGNNIRYYILPDSLNLETLLVE+TPRVKPKKP A
Subjt:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA

Q9SY09 Small nuclear ribonucleoprotein SmD1b7.3e-4594.68Show/hide
Query:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA
        MKLVRFLMKLNNETVSIELKNGT+VHGTITGVD+SMNTHLKAVKLTLKGKNPVT+DHLSVRGNNIRYYILPDSLNLETLLVE+TPR+KPKKPTA
Subjt:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA

Arabidopsis top hitse value%identityAlignment
AT3G07590.1 Small nuclear ribonucleoprotein family protein3.7e-4491.49Show/hide
Query:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA
        MKLVRFLMKLNNETVSIELKNGTVVHGTITGVD+SMNTHLK VK++LKGKNPVT+DHLS+RGNNIRYYILPDSLNLETLLVE+TPRVKPKKP A
Subjt:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA

AT3G07590.2 Small nuclear ribonucleoprotein family protein3.7e-4491.49Show/hide
Query:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA
        MKLVRFLMKLNNETVSIELKNGTVVHGTITGVD+SMNTHLK VK++LKGKNPVT+DHLS+RGNNIRYYILPDSLNLETLLVE+TPRVKPKKP A
Subjt:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA

AT4G02840.1 Small nuclear ribonucleoprotein family protein5.2e-4694.68Show/hide
Query:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA
        MKLVRFLMKLNNETVSIELKNGT+VHGTITGVD+SMNTHLKAVKLTLKGKNPVT+DHLSVRGNNIRYYILPDSLNLETLLVE+TPR+KPKKPTA
Subjt:  MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTA

AT4G02840.2 Small nuclear ribonucleoprotein family protein1.4e-4385.58Show/hide
Query:  MKLVRFLMKLNNETVSIELKNGTVVHGTIT----------GVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPK
        MKLVRFLMKLNNETVSIELKNGT+VHGTIT          GVD+SMNTHLKAVKLTLKGKNPVT+DHLSVRGNNIRYYILPDSLNLETLLVE+TPR+KPK
Subjt:  MKLVRFLMKLNNETVSIELKNGTVVHGTIT----------GVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPK

Query:  KPTA
        KPTA
Subjt:  KPTA

AT5G54390.1 HAL2-like2.4e-3052.71Show/hide
Query:  VLYARKRCSGAWMQPLVHGDKKLDWPNSVSLIQVSSTDDSGLATFCEPVGKRNSNHSFTAGMAHCVG-----LRFDVIITYAAVARVVAEIFMNFARTRY
        V+YA++    AWMQPL+ G      P S +L++VSS DD  LAT CEPV + NSNH FTAG+A+ +G     +R   ++ YAA+AR  AE+FM FA++ Y
Subjt:  VLYARKRCSGAWMQPLVHGDKKLDWPNSVSLIQVSSTDDSGLATFCEPVGKRNSNHSFTAGMAHCVG-----LRFDVIITYAAVARVVAEIFMNFARTRY

Query:  QEKIWDHAAGIIIAEAARGVVTDSRGRLL
        +EKIWDHAAG++I E A GVVTD+ GR L
Subjt:  QEKIWDHAAGIIIAEAARGVVTDSRGRLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTTGTTAGGTTTTTGATGAAGCTCAACAATGAGACTGTCTCAATCGAGCTCAAAAATGGAACTGTTGTCCACGGCACTATCACAGGTGTGGATATCAGCATGAA
TACACATTTGAAGGCTGTGAAACTTACTCTAAAGGGGAAAAATCCAGTTACCATGGATCATTTAAGTGTGAGGGGTAACAACATCAGATATTATATTCTACCCGACAGCT
TGAATCTTGAGACTTTACTTGTTGAAGAGACACCCAGGGTCAAGCCCAAGAAACCAACTGCAGATATAATAGTTCATAATGCACAGGTTAAGAGGCGTGTATTATATGCA
AGGAAAAGGTGCAGTGGGGCGTGGATGCAGCCATTAGTCCATGGCGATAAGAAGCTTGATTGGCCAAATTCAGTTAGCCTTATTCAGGTTTCTTCCACTGATGACTCAGG
ACTTGCAACTTTCTGTGAACCTGTGGGGAAGCGCAACTCAAACCACTCGTTCACCGCAGGGATGGCTCACTGTGTAGGGCTTAGGTTTGATGTCATTATTACATATGCTG
CCGTTGCTCGTGTGGTTGCTGAGATTTTTATGAACTTTGCAAGGACCAGATACCAAGAGAAGATATGGGACCATGCTGCTGGTATTATCATTGCAGAAGCAGCGAGAGGT
GTGGTAACAGATTCCAGAGGTCGTCTGTTACTTAGAAGGTCTCCATCGGGGAATAATTGTTTGCTCGGGGTCAATCGTACAAGAGAAAATCATTAG
mRNA sequenceShow/hide mRNA sequence
TGTTCTTTAAAGCAAAACCCATCGGGACAAAACCGCGGCAATCTCCCAATTTCCTCTCCCTCGCTTCGTTGTTTGAAACCCTACATTCTTCATTTCCTCTGCAATCATGA
AGCTTGTTAGGTTTTTGATGAAGCTCAACAATGAGACTGTCTCAATCGAGCTCAAAAATGGAACTGTTGTCCACGGCACTATCACAGGTGTGGATATCAGCATGAATACA
CATTTGAAGGCTGTGAAACTTACTCTAAAGGGGAAAAATCCAGTTACCATGGATCATTTAAGTGTGAGGGGTAACAACATCAGATATTATATTCTACCCGACAGCTTGAA
TCTTGAGACTTTACTTGTTGAAGAGACACCCAGGGTCAAGCCCAAGAAACCAACTGCAGATATAATAGTTCATAATGCACAGGTTAAGAGGCGTGTATTATATGCAAGGA
AAAGGTGCAGTGGGGCGTGGATGCAGCCATTAGTCCATGGCGATAAGAAGCTTGATTGGCCAAATTCAGTTAGCCTTATTCAGGTTTCTTCCACTGATGACTCAGGACTT
GCAACTTTCTGTGAACCTGTGGGGAAGCGCAACTCAAACCACTCGTTCACCGCAGGGATGGCTCACTGTGTAGGGCTTAGGTTTGATGTCATTATTACATATGCTGCCGT
TGCTCGTGTGGTTGCTGAGATTTTTATGAACTTTGCAAGGACCAGATACCAAGAGAAGATATGGGACCATGCTGCTGGTATTATCATTGCAGAAGCAGCGAGAGGTGTGG
TAACAGATTCCAGAGGTCGTCTGTTACTTAGAAGGTCTCCATCGGGGAATAATTGTTTGCTCGGGGTCAATCGTACAAGAGAAAATCATTAG
Protein sequenceShow/hide protein sequence
MKLVRFLMKLNNETVSIELKNGTVVHGTITGVDISMNTHLKAVKLTLKGKNPVTMDHLSVRGNNIRYYILPDSLNLETLLVEETPRVKPKKPTADIIVHNAQVKRRVLYA
RKRCSGAWMQPLVHGDKKLDWPNSVSLIQVSSTDDSGLATFCEPVGKRNSNHSFTAGMAHCVGLRFDVIITYAAVARVVAEIFMNFARTRYQEKIWDHAAGIIIAEAARG
VVTDSRGRLLLRRSPSGNNCLLGVNRTRENH