; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G09860 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G09860
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionU11/U12 small nuclear ribonucleoprotein 65 kDa protein
Genome locationClcChr02:14780950..14787966
RNA-Seq ExpressionClc02G09860
SyntenyClc02G09860
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0005689 - U12-type spliceosomal complex (cellular component)
GO:0030626 - U12 snRNA binding (molecular function)
GO:0097157 - pre-mRNA intronic binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008448178.1 PREDICTED: U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X2 [Cucumis melo]1.1e-2795.65Show/hide
Query:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS
        GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPS+ELAQ ALNLVNGYVFKGKPMIIQFGRNPGA KGS
Subjt:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS

XP_022152394.1 U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X1 [Momordica charantia]2.9e-2895.65Show/hide
Query:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS
        GSLF GID+AKSALTVKLMQEGRMRGQAFVTFPS+ELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS
Subjt:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS

XP_022972081.1 U11/U12 small nuclear ribonucleoprotein 65 kDa protein [Cucurbita maxima]5.0e-2897.1Show/hide
Query:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS
        GSLF GIDEAKSALTVKLMQEGRMRGQAFVT PSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS
Subjt:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS

XP_023540623.1 U11/U12 small nuclear ribonucleoprotein 65 kDa protein [Cucurbita pepo subsp. pepo]5.0e-2897.1Show/hide
Query:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS
        GSLF GIDEAKSALTVKLMQEGRMRGQAFVT PSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS
Subjt:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS

XP_038903908.1 U11/U12 small nuclear ribonucleoprotein 65 kDa protein [Benincasa hispida]3.4e-2998.55Show/hide
Query:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS
        GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKG+
Subjt:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS

TrEMBL top hitse value%identityAlignment
A0A1S3BJY8 U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X25.4e-2895.65Show/hide
Query:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS
        GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPS+ELAQ ALNLVNGYVFKGKPMIIQFGRNPGA KGS
Subjt:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS

A0A6J1DFW4 U11/U12 small nuclear ribonucleoprotein 65 kDa protein isoform X11.4e-2895.65Show/hide
Query:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS
        GSLF GID+AKSALTVKLMQEGRMRGQAFVTFPS+ELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS
Subjt:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS

A0A6J1EXB4 U11/U12 small nuclear ribonucleoprotein 65 kDa protein2.3e-2694.12Show/hide
Query:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKG
        GSLF GIDEAKSALTVKLMQEGRMRGQAFVT PSVELAQ ALNLVNGYVFKGKPMIIQFGR+PGAAKG
Subjt:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKG

A0A6J1I4Z6 U11/U12 small nuclear ribonucleoprotein 65 kDa protein2.4e-2897.1Show/hide
Query:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS
        GSLF GIDEAKSALTVKLMQEGRMRGQAFVT PSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS
Subjt:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS

A0A7I8JF77 Hypothetical protein1.6e-2482.61Show/hide
Query:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS
        GSLF G++E KSAL +KLMQEGRMRGQAFVTFPS +LAQHALNLVNG+VFKGKPM+IQFGRNP AAK S
Subjt:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS

SwissProt top hitse value%identityAlignment
F1Q8J0 RNA-binding region-containing protein 35.4e-0949.09Show/hide
Query:  DEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRN
        +E ++   + LM+EGRM+GQAF+  PS   AQ AL   NGYV K KP+++QF R+
Subjt:  DEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRN

Q3UZ01 RNA-binding region-containing protein 33.5e-0855.32Show/hide
Query:  VKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRN
        ++LM+EGRM+GQAFV  P+ + A  AL   NGYV  GKPM++QF R+
Subjt:  VKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRN

Q5R6C7 RNA-binding region-containing protein 33.5e-0855.32Show/hide
Query:  VKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRN
        ++LM+EGRM+GQAFV  P+ + A  AL   NGYV  GKPM++QF R+
Subjt:  VKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRN

Q8RWV8 U11/U12 small nuclear ribonucleoprotein 65 kDa protein5.0e-2379.1Show/hide
Query:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAK
        GS FE  + AKS+L V+LMQEGRMRGQAF+TFPSVE+A  ALNLVNG+VFKGKPMIIQFGR PGAAK
Subjt:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAK

Q96IZ5 RNA-binding protein 411.7e-1046.15Show/hide
Query:  ILMSKSEDVAYAEPDNGSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRN
        +L  K+      E D  SLF    E K       M  GRMRGQAF+TFP+ E+A  AL+LVNGY   GK ++I+FG+N
Subjt:  ILMSKSEDVAYAEPDNGSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRN

Arabidopsis top hitse value%identityAlignment
AT1G09230.1 RNA-binding (RRM/RBD/RNP motifs) family protein3.6e-2479.1Show/hide
Query:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAK
        GS FE  + AKS+L V+LMQEGRMRGQAF+TFPSVE+A  ALNLVNG+VFKGKPMIIQFGR PGAAK
Subjt:  GSLFEGIDEAKSALTVKLMQEGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAK

AT1G67770.1 terminal EAR1-like 23.1e-0451.16Show/hide
Query:  RGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAK
        R Q FV F  V  A  AL ++NG V  GKPM+IQF R  G  K
Subjt:  RGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCATTGCAGAAGTTGTGTCCTTTGCTACAACAGTAGGCAGAAGCGCTGGCCGGGTAGGAAGAATAATTGGGAAAAGGCTTCATCACAACCTAGGAAAATTGCAAAA
GACCCTACCAAGAATGCGATTCAAGTGTGGCAAAATCAGGCATCTTTCCAACGTTGCCCTCAAAGAAAAAAATCTTAGAATTCAAAAAGGCAAGAAGACCAAGAAGAAAT
CAATCTTGATGAGCAAAAGTGAAGATGTTGCTTATGCAGAGCCCGATAATGGATCATTATTTGAAGGTATTGATGAAGCCAAGTCCGCTCTAACTGTGAAGCTAATGCAG
GAGGGAAGAATGAGGGGCCAAGCATTTGTAACATTTCCATCGGTTGAGCTTGCACAACATGCTTTGAATCTAGTAAATGGTTATGTGTTCAAAGGCAAGCCAATGATTAT
CCAGTTTGGACGCAATCCAGGAGCCGCCAAGGGGAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCATTGCAGAAGTTGTGTCCTTTGCTACAACAGTAGGCAGAAGCGCTGGCCGGGTAGGAAGAATAATTGGGAAAAGGCTTCATCACAACCTAGGAAAATTGCAAAA
GACCCTACCAAGAATGCGATTCAAGTGTGGCAAAATCAGGCATCTTTCCAACGTTGCCCTCAAAGAAAAAAATCTTAGAATTCAAAAAGGCAAGAAGACCAAGAAGAAAT
CAATCTTGATGAGCAAAAGTGAAGATGTTGCTTATGCAGAGCCCGATAATGGATCATTATTTGAAGGTATTGATGAAGCCAAGTCCGCTCTAACTGTGAAGCTAATGCAG
GAGGGAAGAATGAGGGGCCAAGCATTTGTAACATTTCCATCGGTTGAGCTTGCACAACATGCTTTGAATCTAGTAAATGGTTATGTGTTCAAAGGCAAGCCAATGATTAT
CCAGTTTGGACGCAATCCAGGAGCCGCCAAGGGGAGTTAA
Protein sequenceShow/hide protein sequence
MSIAEVVSFATTVGRSAGRVGRIIGKRLHHNLGKLQKTLPRMRFKCGKIRHLSNVALKEKNLRIQKGKKTKKKSILMSKSEDVAYAEPDNGSLFEGIDEAKSALTVKLMQ
EGRMRGQAFVTFPSVELAQHALNLVNGYVFKGKPMIIQFGRNPGAAKGS