; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G7734 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G7734
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionTHO complex subunit 1
Genome locationctg1546:508412..510088
RNA-Seq ExpressionCucsat.G7734
SyntenyCucsat.G7734
Gene Ontology termsGO:0006406 - mRNA export from nucleus (biological process)
GO:0032784 - regulation of DNA-templated transcription, elongation (biological process)
GO:0000445 - THO complex part of transcription export complex (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0062399.1 THO complex subunit 1 [Cucumis melo var. makuwa]4.68e-9499.35Show/hide
Query:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
        CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAK+EEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
Subjt:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT

Query:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG
        GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG
Subjt:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG

KGN51016.1 hypothetical protein Csa_007827 [Cucumis sativus]2.89e-95100Show/hide
Query:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
        CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
Subjt:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT

Query:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG
        GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG
Subjt:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG

XP_008460496.1 PREDICTED: THO complex subunit 1 [Cucumis melo]8.12e-9599.35Show/hide
Query:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
        CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAK+EEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
Subjt:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT

Query:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG
        GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG
Subjt:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG

XP_011655214.1 THO complex subunit 1 [Cucumis sativus]8.47e-95100Show/hide
Query:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
        CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
Subjt:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT

Query:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG
        GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG
Subjt:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG

XP_038892206.1 THO complex subunit 1 isoform X3 [Benincasa hispida]1.06e-9196.1Show/hide
Query:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
        CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAK AVQQV+ENQMATPASENDGEGTRSDPDGPSAGMDVDTA+AT
Subjt:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT

Query:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG
        GN+SQGG STPEENKLSSDTDIGQEAGQLEADAEVE GMIDGETDAEVDLDTAG
Subjt:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG

TrEMBL top hitse value%identityAlignment
A0A0A0KTL0 Uncharacterized protein1.40e-95100Show/hide
Query:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
        CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
Subjt:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT

Query:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG
        GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG
Subjt:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG

A0A1S3CC63 THO complex subunit 13.93e-9599.35Show/hide
Query:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
        CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAK+EEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
Subjt:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT

Query:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG
        GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG
Subjt:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG

A0A5A7V9M3 THO complex subunit 12.27e-9499.35Show/hide
Query:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
        CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAK+EEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
Subjt:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT

Query:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG
        GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG
Subjt:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG

A0A6J1CC37 THO complex subunit 13.13e-8690.91Show/hide
Query:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
        CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEE KG+V+QV+ENQMATPASENDGEGTRSD DGPSA MDVDT +A 
Subjt:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT

Query:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG
        GNVSQGG  TP+ENK SSDTDIGQE+GQLEADAEVEPGMIDGETDAEVDLDTAG
Subjt:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG

A0A6J1G1S6 THO complex subunit 14.42e-8690.26Show/hide
Query:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
        CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVR+KYQAKPNERSKRAKKEE KGAV+Q +ENQMATPASENDGEGTRSDPDGPS  MD DT +A 
Subjt:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT

Query:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG
        G+VSQGG  TPEENK SSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG
Subjt:  GNVSQGGISTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG

SwissProt top hitse value%identityAlignment
Q93VM9 THO complex subunit 13.7e-3153.06Show/hide
Query:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
        CWKGLRF+ARQDLEGFSRFT+ GIEGVVP+ELLPP+VR+KYQAKPNE++KRAKKEE KG   + + NQ+    SE + EG R D +   +    DT    
Subjt:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT

Query:  GNVSQGGISTPEENKL--SSDTDIGQEAGQLEADAEVEPGMIDGETD
                 TPEE +    SDT+ GQEAGQ+E     E G++D + D
Subjt:  GNVSQGGISTPEENKL--SSDTDIGQEAGQLEADAEVEPGMIDGETD

Arabidopsis top hitse value%identityAlignment
AT5G09860.1 nuclear matrix protein-related2.6e-3253.06Show/hide
Query:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
        CWKGLRF+ARQDLEGFSRFT+ GIEGVVP+ELLPP+VR+KYQAKPNE++KRAKKEE KG   + + NQ+    SE + EG R D +   +    DT    
Subjt:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT

Query:  GNVSQGGISTPEENKL--SSDTDIGQEAGQLEADAEVEPGMIDGETD
                 TPEE +    SDT+ GQEAGQ+E     E G++D + D
Subjt:  GNVSQGGISTPEENKL--SSDTDIGQEAGQLEADAEVEPGMIDGETD

AT5G09860.2 nuclear matrix protein-related2.6e-3253.06Show/hide
Query:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT
        CWKGLRF+ARQDLEGFSRFT+ GIEGVVP+ELLPP+VR+KYQAKPNE++KRAKKEE KG   + + NQ+    SE + EG R D +   +    DT    
Subjt:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIAT

Query:  GNVSQGGISTPEENKL--SSDTDIGQEAGQLEADAEVEPGMIDGETD
                 TPEE +    SDT+ GQEAGQ+E     E G++D + D
Subjt:  GNVSQGGISTPEENKL--SSDTDIGQEAGQLEADAEVEPGMIDGETD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TGCTGGAAAGGCCTTCGGTTTTCAGCTCGGCAGGATTTAGAAGGATTTTCTCGGTTCACCGACCATGGCATTGAAGGTGTTGTGCCGTTGGAACTCCTGCCACCTGATGT
ACGAGCCAAATACCAAGCCAAACCAAATGAGAGATCTAAACGTGCTAAAAAGGAAGAAGCAAAAGGGGCAGTCCAGCAAGTTGATGAAAATCAGATGGCAACTCCAGCCA
GTGAGAACGATGGCGAAGGAACCAGAAGTGATCCCGATGGGCCATCAGCAGGGATGGATGTCGATACAGCTATTGCCACGGGCAATGTATCTCAAGGTGGTATTTCAACT
CCAGAAGAAAATAAACTAAGCTCTGACACGGATATTGGTCAAGAGGCAGGCCAGTTGGAAGCCGATGCTGAGGTGGAGCCGGGCATGATTGATGGCGAGACAGATGCAGA
GGTTGATTTGGATACTGCAGGTTGA
mRNA sequenceShow/hide mRNA sequence
TGCTGGAAAGGCCTTCGGTTTTCAGCTCGGCAGGATTTAGAAGGATTTTCTCGGTTCACCGACCATGGCATTGAAGGTGTTGTGCCGTTGGAACTCCTGCCACCTGATGT
ACGAGCCAAATACCAAGCCAAACCAAATGAGAGATCTAAACGTGCTAAAAAGGAAGAAGCAAAAGGGGCAGTCCAGCAAGTTGATGAAAATCAGATGGCAACTCCAGCCA
GTGAGAACGATGGCGAAGGAACCAGAAGTGATCCCGATGGGCCATCAGCAGGGATGGATGTCGATACAGCTATTGCCACGGGCAATGTATCTCAAGGTGGTATTTCAACT
CCAGAAGAAAATAAACTAAGCTCTGACACGGATATTGGTCAAGAGGCAGGCCAGTTGGAAGCCGATGCTGAGGTGGAGCCGGGCATGATTGATGGCGAGACAGATGCAGA
GGTTGATTTGGATACTGCAGGTTGA
Protein sequenceShow/hide protein sequence
CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVDENQMATPASENDGEGTRSDPDGPSAGMDVDTAIATGNVSQGGIST
PEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG