; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G21160 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G21160
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTHO complex subunit 4D
Genome locationChr3:17274416..17276127
RNA-Seq ExpressionCSPI03G21160
SyntenyCSPI03G21160
Gene Ontology termsGO:0006406 - mRNA export from nucleus (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR025715 - Chromatin target of PRMT1 protein, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145851.1 THO complex subunit 4D [Cucumis sativus]2.5e-35100Show/hide
Query:  SESGRNATSNVVNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGSGGGRGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
        SESGRNATSNVVNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGSGGGRGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
Subjt:  SESGRNATSNVVNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGSGGGRGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT

XP_008457020.1 PREDICTED: THO complex subunit 4D [Cucumis melo]1.7e-3195.29Show/hide
Query:  ESGRNATSNVVNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGSGGGRGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
        ESGRNAT NVVN FPGPSHRGGLRNARGRGRGAW+RGVGL GGSGGGRGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
Subjt:  ESGRNATSNVVNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGSGGGRGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT

XP_022982649.1 THO complex subunit 4D-like [Cucurbita maxima]1.3e-2370.48Show/hide
Query:  SESGRNATSNVVNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGSG---------GGRGRGR----------GRGRGQGRKKPVEKSSDELDKELENYHA
        SESGR  +SN VN FPGPSHRGGLR+ RGRGRG WSRG+G GGG G         GGRGRGR          GRGRGQGRKKPVEKSS ELDKELENYHA
Subjt:  SESGRNATSNVVNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGSG---------GGRGRGR----------GRGRGQGRKKPVEKSSDELDKELENYHA

Query:  EAMQT
        EAMQT
Subjt:  EAMQT

XP_023527122.1 THO complex subunit 4D [Cucurbita pepo subsp. pepo]1.3e-2377.89Show/hide
Query:  SESGRNATSNVVNSFPGPSHRGGLRN--ARGRGRGAWSRGVGLGGG---SGGGRGRGR----GRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
        SESGR  +SN VN FPGPSHRGGLR+   RGRGRG WSRG+G GGG    GGGRGRGR    GRGRGQGRKKPVEKSS ELDKELENYHAEAMQT
Subjt:  SESGRNATSNVVNSFPGPSHRGGLRN--ARGRGRGAWSRGVGLGGG---SGGGRGRGR----GRGRGQGRKKPVEKSSDELDKELENYHAEAMQT

XP_038896761.1 THO complex subunit 4D-like [Benincasa hispida]6.1e-2989.41Show/hide
Query:  ESGRNATSNVVNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGSGGGRGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
        ESGR A+SNVVN FPGPSHRGGLRN RGRGRG W+RG G+GGG GGGRGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
Subjt:  ESGRNATSNVVNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGSGGGRGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT

TrEMBL top hitse value%identityAlignment
A0A0A0LCJ7 FoP_duplication domain-containing protein5.2e-50100Show/hide
Query:  MISNYFFWTGVPSIVNVELLSSGPSESGRNATSNVVNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGSGGGRGRGRGRGRGQGRKKPVEKSSDELDKEL
        MISNYFFWTGVPSIVNVELLSSGPSESGRNATSNVVNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGSGGGRGRGRGRGRGQGRKKPVEKSSDELDKEL
Subjt:  MISNYFFWTGVPSIVNVELLSSGPSESGRNATSNVVNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGSGGGRGRGRGRGRGQGRKKPVEKSSDELDKEL

Query:  ENYHAEAMQT
        ENYHAEAMQT
Subjt:  ENYHAEAMQT

A0A1S3C452 THO complex subunit 4D8.3e-3295.29Show/hide
Query:  ESGRNATSNVVNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGSGGGRGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
        ESGRNAT NVVN FPGPSHRGGLRNARGRGRGAW+RGVGL GGSGGGRGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
Subjt:  ESGRNATSNVVNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGSGGGRGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT

A0A6J1CP51 THO complex subunit 4D-like4.3e-2077.91Show/hide
Query:  SESGRNATSNVVNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGSGGGRGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
        SESGR  +S VVN FPGPS+RG LR  RGRGRG WSRG    G  GGGRGRGRGRGRG GRKK VEKSSDELDK+LENYHAEAMQT
Subjt:  SESGRNATSNVVNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGSGGGRGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT

A0A6J1FAB9 THO complex subunit 4D-like2.4e-2376.84Show/hide
Query:  SESGRNATSNVVNSFPGPSHRGGLRN--ARGRGRGAWSRGVGLGGG---SGGGRGRGR----GRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
        SESGR  +S+ VN FPGPSHRGGLR+   RGRGRG WSRG+G GGG    GGGRGRGR    GRGRGQGRKKPVEKSS ELDKELENYHAEAMQT
Subjt:  SESGRNATSNVVNSFPGPSHRGGLRN--ARGRGRGAWSRGVGLGGG---SGGGRGRGR----GRGRGQGRKKPVEKSSDELDKELENYHAEAMQT

A0A6J1J564 THO complex subunit 4D-like6.4e-2470.48Show/hide
Query:  SESGRNATSNVVNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGSG---------GGRGRGR----------GRGRGQGRKKPVEKSSDELDKELENYHA
        SESGR  +SN VN FPGPSHRGGLR+ RGRGRG WSRG+G GGG G         GGRGRGR          GRGRGQGRKKPVEKSS ELDKELENYHA
Subjt:  SESGRNATSNVVNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGSG---------GGRGRGR----------GRGRGQGRKKPVEKSSDELDKELENYHA

Query:  EAMQT
        EAMQT
Subjt:  EAMQT

SwissProt top hitse value%identityAlignment
Q6NQ72 THO complex subunit 4D5.4e-0450Show/hide
Query:  GPSHRGGLRNAR-GRGRGAW---------SRGVGLGGGSGGGRGRGR---GRGRGQGR---KKPVEKSSDELDKELENYHAEAMQT
        G   RG +R  R GRG              +G G+ GG GG R RGR   GRGRG GR   KKPVEKS+ +LDK+LE+YHA+AM T
Subjt:  GPSHRGGLRNAR-GRGRGAW---------SRGVGLGGGSGGGRGRGR---GRGRGQGR---KKPVEKSSDELDKELENYHAEAMQT

Q94EH8 THO complex subunit 4C1.1e-0450.59Show/hide
Query:  FPGPSHRGGLRNARGRGRGAWSRGV--------GLGGGSGGGRGRGRGRGRGQGR-------KKPVEKSSDELDKELENYHAEAM
        F G   RGG R  RGRG G   R +        G+  G GG RGRGRG G G+G        KKPVEKS+ +LDK+LE+YHAEAM
Subjt:  FPGPSHRGGLRNARGRGRGAWSRGV--------GLGGGSGGGRGRGRGRGRGQGR-------KKPVEKSSDELDKELENYHAEAM

Arabidopsis top hitse value%identityAlignment
AT1G66260.1 RNA-binding (RRM/RBD/RNP motifs) family protein7.7e-0650.59Show/hide
Query:  FPGPSHRGGLRNARGRGRGAWSRGV--------GLGGGSGGGRGRGRGRGRGQGR-------KKPVEKSSDELDKELENYHAEAM
        F G   RGG R  RGRG G   R +        G+  G GG RGRGRG G G+G        KKPVEKS+ +LDK+LE+YHAEAM
Subjt:  FPGPSHRGGLRNARGRGRGAWSRGV--------GLGGGSGGGRGRGRGRGRGQGR-------KKPVEKSSDELDKELENYHAEAM

AT1G66260.2 RNA-binding (RRM/RBD/RNP motifs) family protein7.7e-0650.59Show/hide
Query:  FPGPSHRGGLRNARGRGRGAWSRGV--------GLGGGSGGGRGRGRGRGRGQGR-------KKPVEKSSDELDKELENYHAEAM
        F G   RGG R  RGRG G   R +        G+  G GG RGRGRG G G+G        KKPVEKS+ +LDK+LE+YHAEAM
Subjt:  FPGPSHRGGLRNARGRGRGAWSRGV--------GLGGGSGGGRGRGRGRGRGQGR-------KKPVEKSSDELDKELENYHAEAM

AT5G37720.1 ALWAYS EARLY 43.8e-0550Show/hide
Query:  GPSHRGGLRNAR-GRGRGAW---------SRGVGLGGGSGGGRGRGR---GRGRGQGR---KKPVEKSSDELDKELENYHAEAMQT
        G   RG +R  R GRG              +G G+ GG GG R RGR   GRGRG GR   KKPVEKS+ +LDK+LE+YHA+AM T
Subjt:  GPSHRGGLRNAR-GRGRGAW---------SRGVGLGGGSGGGRGRGR---GRGRGQGR---KKPVEKSSDELDKELENYHAEAMQT

AT5G37720.2 ALWAYS EARLY 42.9e-0550.7Show/hide
Query:  PGPSHRGGLRNARGRGRGAWSRGVGLGGGSGGGRGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT
        P  S R  + N +G G      G    G   GGRGRG GRG G   KKPVEKS+ +LDK+LE+YHA+AM T
Subjt:  PGPSHRGGLRNARGRGRGAWSRGVGLGGGSGGGRGRGRGRGRGQGRKKPVEKSSDELDKELENYHAEAMQT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCAGTTATTAGGGTATCAGTTGCAAGACCATTTTGTAATTATGATATCAAACTATTTCTTTTGGACTGGAGTCCCTTCTATAGTTAATGTTGAGCTCTTGTCTTC
TGGGCCGTCTGAATCTGGTCGTAATGCTACTTCCAACGTGGTTAACTCTTTTCCTGGTCCAAGCCATCGTGGAGGCCTGAGGAATGCTCGTGGCCGTGGGCGAGGTGCTT
GGAGCCGTGGTGTAGGTCTAGGAGGAGGAAGTGGAGGAGGCCGTGGCCGAGGGCGTGGTCGTGGCCGTGGGCAAGGAAGGAAAAAACCTGTAGAGAAGTCCTCAGATGAA
CTTGACAAGGAGCTTGAAAACTACCATGCAGAAGCCATGCAAACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTCAGTTATTAGGGTATCAGTTGCAAGACCATTTTGTAATTATGATATCAAACTATTTCTTTTGGACTGGAGTCCCTTCTATAGTTAATGTTGAGCTCTTGTCTTC
TGGGCCGTCTGAATCTGGTCGTAATGCTACTTCCAACGTGGTTAACTCTTTTCCTGGTCCAAGCCATCGTGGAGGCCTGAGGAATGCTCGTGGCCGTGGGCGAGGTGCTT
GGAGCCGTGGTGTAGGTCTAGGAGGAGGAAGTGGAGGAGGCCGTGGCCGAGGGCGTGGTCGTGGCCGTGGGCAAGGAAGGAAAAAACCTGTAGAGAAGTCCTCAGATGAA
CTTGACAAGGAGCTTGAAAACTACCATGCAGAAGCCATGCAAACCTGA
Protein sequenceShow/hide protein sequence
MFQLLGYQLQDHFVIMISNYFFWTGVPSIVNVELLSSGPSESGRNATSNVVNSFPGPSHRGGLRNARGRGRGAWSRGVGLGGGSGGGRGRGRGRGRGQGRKKPVEKSSDE
LDKELENYHAEAMQT