; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10000176 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10000176
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionTHO complex subunit 1
Genome locationChr09:1795154..1814417
RNA-Seq ExpressionHG10000176
SyntenyHG10000176
Gene Ontology termsGO:0006406 - mRNA export from nucleus (biological process)
GO:0032784 - regulation of DNA-templated transcription, elongation (biological process)
GO:0000445 - THO complex part of transcription export complex (cellular component)
GO:0016020 - membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR021861 - THO complex, subunit THOC1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0062399.1 THO complex subunit 1 [Cucumis melo var. makuwa]1.1e-14069.57Show/hide
Query:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
        +REEIKSCEERVKKLLEVTPPRGKDFL+KIEHIL+RENNWVWWKRDGC PFEKQPIEKKT +DVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
Subjt:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR

Query:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA
        +PAISEYWKPLAEDMDESAGIEAEYHHRNNR                                                                     
Subjt:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA

Query:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ
                                           VYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAK+EEAKGAVQQ
Subjt:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ

Query:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG
        V ENQMATPA+ENDGEGTRSDPDGPSAGMDVDTA+ATGN+SQGG STPEENKLSSDTDIGQEAGQLEADAEVE GMIDGETDAE+DLDTAG
Subjt:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG

XP_008460496.1 PREDICTED: THO complex subunit 1 [Cucumis melo]1.1e-14069.57Show/hide
Query:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
        +REEIKSCEERVKKLLEVTPPRGKDFL+KIEHIL+RENNWVWWKRDGC PFEKQPIEKKT +DVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
Subjt:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR

Query:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA
        +PAISEYWKPLAEDMDESAGIEAEYHHRNNR                                                                     
Subjt:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA

Query:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ
                                           VYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAK+EEAKGAVQQ
Subjt:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ

Query:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG
        V ENQMATPA+ENDGEGTRSDPDGPSAGMDVDTA+ATGN+SQGG STPEENKLSSDTDIGQEAGQLEADAEVE GMIDGETDAE+DLDTAG
Subjt:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG

XP_038892203.1 THO complex subunit 1 isoform X1 [Benincasa hispida]1.4e-14371.36Show/hide
Query:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
        +REEIKSCEERVK LLEVTPPRGKDFL+KIEHILERENNWVWWKRDGCPPFEKQPIEKKTT+DVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
Subjt:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR

Query:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA
        TPAISEYWKPLAEDMDESAGIEAEYHHRNNR                                                                     
Subjt:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA

Query:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ
                                           VYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAK AVQQ
Subjt:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ

Query:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG
        V ENQMATPA+ENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAE+DLDTAG
Subjt:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG

XP_038892204.1 THO complex subunit 1 isoform X2 [Benincasa hispida]1.4e-14371.36Show/hide
Query:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
        +REEIKSCEERVK LLEVTPPRGKDFL+KIEHILERENNWVWWKRDGCPPFEKQPIEKKTT+DVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
Subjt:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR

Query:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA
        TPAISEYWKPLAEDMDESAGIEAEYHHRNNR                                                                     
Subjt:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA

Query:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ
                                           VYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAK AVQQ
Subjt:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ

Query:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG
        V ENQMATPA+ENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAE+DLDTAG
Subjt:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG

XP_038892206.1 THO complex subunit 1 isoform X3 [Benincasa hispida]1.4e-14371.36Show/hide
Query:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
        +REEIKSCEERVK LLEVTPPRGKDFL+KIEHILERENNWVWWKRDGCPPFEKQPIEKKTT+DVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
Subjt:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR

Query:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA
        TPAISEYWKPLAEDMDESAGIEAEYHHRNNR                                                                     
Subjt:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA

Query:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ
                                           VYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAK AVQQ
Subjt:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ

Query:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG
        V ENQMATPA+ENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAE+DLDTAG
Subjt:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG

TrEMBL top hitse value%identityAlignment
A0A0A0KTL0 Uncharacterized protein1.2e-14069.31Show/hide
Query:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
        +REEIKSCEERVKKLLEVTPPRGKDFL+KIEHIL+RENNWVWWKRDGC PFEKQPIEKKT +DVTKKRRPRWRLGNKELSQLWKW+DQNPNALTDPQRVR
Subjt:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR

Query:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA
        +PAIS+YWKPLAEDMDESAGIEAEYHHRNNR                                                                     
Subjt:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA

Query:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ
                                           VYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ
Subjt:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ

Query:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG
        V ENQMATPA+ENDGEGTRSDPDGPSAGMDVDTA+ATGN+SQGG STPEENKLSSDTDIGQEAGQLEADAEVE GMIDGETDAE+DLDTAG
Subjt:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG

A0A1S3CC63 THO complex subunit 15.2e-14169.57Show/hide
Query:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
        +REEIKSCEERVKKLLEVTPPRGKDFL+KIEHIL+RENNWVWWKRDGC PFEKQPIEKKT +DVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
Subjt:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR

Query:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA
        +PAISEYWKPLAEDMDESAGIEAEYHHRNNR                                                                     
Subjt:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA

Query:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ
                                           VYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAK+EEAKGAVQQ
Subjt:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ

Query:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG
        V ENQMATPA+ENDGEGTRSDPDGPSAGMDVDTA+ATGN+SQGG STPEENKLSSDTDIGQEAGQLEADAEVE GMIDGETDAE+DLDTAG
Subjt:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG

A0A5A7V9M3 THO complex subunit 15.2e-14169.57Show/hide
Query:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
        +REEIKSCEERVKKLLEVTPPRGKDFL+KIEHIL+RENNWVWWKRDGC PFEKQPIEKKT +DVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
Subjt:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR

Query:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA
        +PAISEYWKPLAEDMDESAGIEAEYHHRNNR                                                                     
Subjt:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA

Query:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ
                                           VYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAK+EEAKGAVQQ
Subjt:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ

Query:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG
        V ENQMATPA+ENDGEGTRSDPDGPSAGMDVDTA+ATGN+SQGG STPEENKLSSDTDIGQEAGQLEADAEVE GMIDGETDAE+DLDTAG
Subjt:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG

A0A6J1CC37 THO complex subunit 11.1e-13567.26Show/hide
Query:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
        +REEIKSCEERVKKLLEVTPP+GK+FL+KIEHILERENNWVWWKRDGCPPFEKQP EKKT++D TKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
Subjt:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR

Query:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA
        TPAISEYWKPLAEDMDESAGIEAEYHHRNNR                                                                     
Subjt:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA

Query:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ
                                           VYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEE KG+V+Q
Subjt:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ

Query:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG
        V ENQMATPA+ENDGEGTRSD DGPSA MDVDT VA GN+SQGGT TP+ENK SSDTDIGQE+GQLEADAEVE GMIDGETDAE+DLDTAG
Subjt:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG

A0A6J1G1S6 THO complex subunit 12.3e-13667.52Show/hide
Query:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
        LREEIKSCEERVKKLLEVTPPRGK+FL+KIEHILERENNWVWWKRDGCPPFEKQPIEKKTT D TKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
Subjt:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR

Query:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA
        TPAISEYWKPLAEDMDESAGIEAEYHH++NR                                                                     
Subjt:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA

Query:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ
                                           VYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVR+KYQAKPNERSKRAKKEE KGAV+Q
Subjt:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ

Query:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG
          ENQMATPA+ENDGEGTRSDPDGPS  MD DT VA G++SQGGT TPEENK SSDTDIGQEAGQLEADAEVE GMIDGETDAE+DLDTAG
Subjt:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG

SwissProt top hitse value%identityAlignment
F4HYJ7 DExH-box ATP-dependent RNA helicase DExH38.0e-1440.78Show/hide
Query:  MEPILTVVARLNVRDSFLTPLKKKDILRPVKAIDSLR---------------KVVEKNFGGYDIYWKNFLSVQSVKAIDSLKKEFFSLLKDIGLREEIKS
        ++P++TVVA L+VRD FL P  KKD+    ++  S R               K  E+   GYD  WKNFLS Q++KA+DS++K+FF+LLK+  L + I+ 
Subjt:  MEPILTVVARLNVRDSFLTPLKKKDILRPVKAIDSLR---------------KVVEKNFGGYDIYWKNFLSVQSVKAIDSLKKEFFSLLKDIGLREEIKS

Query:  CEE
        C +
Subjt:  CEE

F4IM84 DExH-box ATP-dependent RNA helicase DExH5, mitochondrial7.3e-1551.06Show/hide
Query:  MEPILTVVARLNVRDSFLTPLKKKDI---------------LRPVKAIDSLRKVVEKNFGGYDIYWKNFLSVQSVKAIDSLKKEFFSLLKDIGL
        ++PILTV A L+VRD FLTP  KKD+               L  V+A +  +K  E++   YD  WKNFLS+QS++AIDSL+KEFFSLLKD GL
Subjt:  MEPILTVVARLNVRDSFLTPLKKKDI---------------LRPVKAIDSLRKVVEKNFGGYDIYWKNFLSVQSVKAIDSLKKEFFSLLKDIGL

Q8R3N6 THO complex subunit 14.0e-1331.39Show/hide
Query:  IKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKR-----------RPRWRLGNKELSQLWKWADQNPNAL
        I+   + V +LL   PP G+ F + +EHIL  E NW  WK +GCP F K+       + V +KR             +  +GN+EL++LW     N  A 
Subjt:  IKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKR-----------RPRWRLGNKELSQLWKWADQNPNAL

Query:  TDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNN
            R   P + E+++   E  D    +E+EY   NN
Subjt:  TDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNN

Q93VM9 THO complex subunit 13.8e-8847.92Show/hide
Query:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
        ++EE+KSCE+RVKKLLE+TPP+GK+FLR +EHILERE NWVWWKRDGCPPFEKQPI+KK+ +   KKRR RWRLGNKELSQLW+WADQNPNALTD QRVR
Subjt:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR

Query:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA
        TP I++YWKPLAEDMD SAGIE EYHH+NNR                                                                     
Subjt:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA

Query:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ
                                           VYCWKGLRF+ARQDLEGFSRFT+ GIEGVVP+ELLPP+VR+KYQAKPNE++KRAKKEE KG   +
Subjt:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ

Query:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKL--SSDTDIGQEAGQLEADAEVETGMIDGETD
           NQ+    +E + EG R D +   +    D            T TPEE +    SDT+ GQEAGQ+E     E G++D + D
Subjt:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKL--SSDTDIGQEAGQLEADAEVETGMIDGETD

Q96FV9 THO complex subunit 16.8e-1332.12Show/hide
Query:  IKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQ-PIEKKTTSDVTKKRRP----------RWRLGNKELSQLWKWADQNPNAL
        I+   + V +LL   PP G+ F + +EHIL  E NW  WK +GCP F K+   + K T  + K+  P          +  +GN+EL++LW     N  A 
Subjt:  IKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQ-PIEKKTTSDVTKKRRP----------RWRLGNKELSQLWKWADQNPNAL

Query:  TDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNN
            R   P + E+++   E  D    +E EY   NN
Subjt:  TDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNN

Arabidopsis top hitse value%identityAlignment
AT1G48650.1 DEA(D/H)-box RNA helicase family protein5.7e-1540.78Show/hide
Query:  MEPILTVVARLNVRDSFLTPLKKKDILRPVKAIDSLR---------------KVVEKNFGGYDIYWKNFLSVQSVKAIDSLKKEFFSLLKDIGLREEIKS
        ++P++TVVA L+VRD FL P  KKD+    ++  S R               K  E+   GYD  WKNFLS Q++KA+DS++K+FF+LLK+  L + I+ 
Subjt:  MEPILTVVARLNVRDSFLTPLKKKDILRPVKAIDSLR---------------KVVEKNFGGYDIYWKNFLSVQSVKAIDSLKKEFFSLLKDIGLREEIKS

Query:  CEE
        C +
Subjt:  CEE

AT1G48650.2 DEA(D/H)-box RNA helicase family protein5.7e-1540.78Show/hide
Query:  MEPILTVVARLNVRDSFLTPLKKKDILRPVKAIDSLR---------------KVVEKNFGGYDIYWKNFLSVQSVKAIDSLKKEFFSLLKDIGLREEIKS
        ++P++TVVA L+VRD FL P  KKD+    ++  S R               K  E+   GYD  WKNFLS Q++KA+DS++K+FF+LLK+  L + I+ 
Subjt:  MEPILTVVARLNVRDSFLTPLKKKDILRPVKAIDSLR---------------KVVEKNFGGYDIYWKNFLSVQSVKAIDSLKKEFFSLLKDIGLREEIKS

Query:  CEE
        C +
Subjt:  CEE

AT2G01130.1 DEA(D/H)-box RNA helicase family protein5.2e-1651.06Show/hide
Query:  MEPILTVVARLNVRDSFLTPLKKKDI---------------LRPVKAIDSLRKVVEKNFGGYDIYWKNFLSVQSVKAIDSLKKEFFSLLKDIGL
        ++PILTV A L+VRD FLTP  KKD+               L  V+A +  +K  E++   YD  WKNFLS+QS++AIDSL+KEFFSLLKD GL
Subjt:  MEPILTVVARLNVRDSFLTPLKKKDI---------------LRPVKAIDSLRKVVEKNFGGYDIYWKNFLSVQSVKAIDSLKKEFFSLLKDIGL

AT5G09860.1 nuclear matrix protein-related2.7e-8947.92Show/hide
Query:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR
        ++EE+KSCE+RVKKLLE+TPP+GK+FLR +EHILERE NWVWWKRDGCPPFEKQPI+KK+ +   KKRR RWRLGNKELSQLW+WADQNPNALTD QRVR
Subjt:  LREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVR

Query:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA
        TP I++YWKPLAEDMD SAGIE EYHH+NNR                                                                     
Subjt:  TPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLA

Query:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ
                                           VYCWKGLRF+ARQDLEGFSRFT+ GIEGVVP+ELLPP+VR+KYQAKPNE++KRAKKEE KG   +
Subjt:  RRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQ

Query:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKL--SSDTDIGQEAGQLEADAEVETGMIDGETD
           NQ+    +E + EG R D +   +    D            T TPEE +    SDT+ GQEAGQ+E     E G++D + D
Subjt:  VGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKL--SSDTDIGQEAGQLEADAEVETGMIDGETD

AT5G09860.2 nuclear matrix protein-related7.8e-8948.17Show/hide
Query:  EEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVRTP
        EE+KSCE+RVKKLLE+TPP+GK+FLR +EHILERE NWVWWKRDGCPPFEKQPI+KK+ +   KKRR RWRLGNKELSQLW+WADQNPNALTD QRVRTP
Subjt:  EEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVRTP

Query:  AISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLARR
         I++YWKPLAEDMD SAGIE EYHH+NNR                                                                       
Subjt:  AISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGPSNREPLLARR

Query:  LVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVG
                                         VYCWKGLRF+ARQDLEGFSRFT+ GIEGVVP+ELLPP+VR+KYQAKPNE++KRAKKEE KG   +  
Subjt:  LVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVG

Query:  ENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKL--SSDTDIGQEAGQLEADAEVETGMIDGETD
         NQ+    +E + EG R D +   +    D            T TPEE +    SDT+ GQEAGQ+E     E G++D + D
Subjt:  ENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKL--SSDTDIGQEAGQLEADAEVETGMIDGETD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCATTCAATCAAATGTTCAAGAACATGTTGCTGTCCTTCAAGAACATCAAGGCAACCCATATCGAGTCCAAAATCAAGTGGTCATGCTGCAAGAACATCAA
GATAGGAGTTGTTATTTGACTATGCTCCCAATGGAACCAATTTTGACCGTTGTTGCCCGTCTTAATGTGAGAGACTCTTTTCTAACACCACTTAAGAAGAAGGAT
ATTTTACGACCAGTGAAGGCTATTGATTCACTAAGGAAAGTTGTTGAGAAGAACTTTGGTGGATATGACATCTATTGGAAGAACTTTCTTTCTGTACAATCAGTG
AAAGCTATTGATTCACTAAAGAAGGAGTTCTTTTCTTTGCTTAAAGATATTGGCTTGAGAGAAGAAATTAAATCTTGTGAAGAGCGCGTGAAGAAGTTGCTTGAG
GTGACACCACCTAGAGGGAAAGATTTCCTTCGGAAGATTGAGCATATATTAGAACGTGAAAATAATTGGGTATGGTGGAAACGTGATGGTTGCCCTCCTTTTGAA
AAGCAGCCAATTGAAAAGAAAACCACCAGTGATGTAACTAAAAAACGGAGGCCAAGGTGGAGATTGGGGAATAAAGAACTCTCTCAATTGTGGAAATGGGCAGAC
CAGAATCCGAATGCCTTGACTGATCCTCAACGTGTTCGTACTCCTGCAATTTCTGAGTACTGGAAACCCTTGGCAGAAGATATGGACGAGTCTGCTGGGATTGAA
GCTGAATATCATCATAGAAACAACCGAAAAGAGGAGTCGATCAAGGTGGGTGCATTGGAGAGACCTCCAACCTTGTCAGGGAAGGTACGACTTAGCCTGTCCTCT
ACGCAACAAAGAAAACAGTTATTGCAGAGGAGAGGAGAGAAAGTCATGGATTTGCCAAGGGGAGAAGAAAGGCGTGTTCATGGGCCTAATTGTGCTATTGGGCCT
TCTAATAGGGAGCCCCTATTGGCAAGAAGGTTAGTAAAGTGGTTACTGGCCCAGTCGAATCAGTCATCTAAAAAGGGCCCTTTATTGGTTGTGTCAAATGATAAG
CTGAAGAAGGCAGACAATTTGGTGGTTTATTGCTGGAAAGGCCTTCGGTTTTCAGCTCGGCAGGACTTAGAAGGATTCTCTCGATTCACCGACCATGGCATTGAA
GGTGTTGTGCCATTGGAACTCCTGCCACCTGATGTACGAGCCAAATACCAAGCCAAACCAAATGAGAGATCTAAACGTGCTAAGAAGGAAGAAGCAAAAGGGGCA
GTCCAGCAAGTTGGAGAAAATCAGATGGCAACTCCAGCTACTGAGAATGATGGTGAAGGAACCAGAAGTGACCCTGATGGGCCATCAGCAGGGATGGATGTCGAT
ACAGCTGTTGCTACTGGCAATCTATCTCAAGGTGGTACTTCTACTCCAGAAGAAAATAAACTAAGCTCTGACACGGATATTGGTCAAGAGGCAGGCCAGTTGGAA
GCTGATGCTGAGGTAGAAACGGGCATGATTGATGGCGAGACAGATGCAGAGATTGATTTGGATACTGCAGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCATTCAATCAAATGTTCAAGAACATGTTGCTGTCCTTCAAGAACATCAAGGCAACCCATATCGAGTCCAAAATCAAGTGGTCATGCTGCAAGAACATCAA
GATAGGAGTTGTTATTTGACTATGCTCCCAATGGAACCAATTTTGACCGTTGTTGCCCGTCTTAATGTGAGAGACTCTTTTCTAACACCACTTAAGAAGAAGGAT
ATTTTACGACCAGTGAAGGCTATTGATTCACTAAGGAAAGTTGTTGAGAAGAACTTTGGTGGATATGACATCTATTGGAAGAACTTTCTTTCTGTACAATCAGTG
AAAGCTATTGATTCACTAAAGAAGGAGTTCTTTTCTTTGCTTAAAGATATTGGCTTGAGAGAAGAAATTAAATCTTGTGAAGAGCGCGTGAAGAAGTTGCTTGAG
GTGACACCACCTAGAGGGAAAGATTTCCTTCGGAAGATTGAGCATATATTAGAACGTGAAAATAATTGGGTATGGTGGAAACGTGATGGTTGCCCTCCTTTTGAA
AAGCAGCCAATTGAAAAGAAAACCACCAGTGATGTAACTAAAAAACGGAGGCCAAGGTGGAGATTGGGGAATAAAGAACTCTCTCAATTGTGGAAATGGGCAGAC
CAGAATCCGAATGCCTTGACTGATCCTCAACGTGTTCGTACTCCTGCAATTTCTGAGTACTGGAAACCCTTGGCAGAAGATATGGACGAGTCTGCTGGGATTGAA
GCTGAATATCATCATAGAAACAACCGAAAAGAGGAGTCGATCAAGGTGGGTGCATTGGAGAGACCTCCAACCTTGTCAGGGAAGGTACGACTTAGCCTGTCCTCT
ACGCAACAAAGAAAACAGTTATTGCAGAGGAGAGGAGAGAAAGTCATGGATTTGCCAAGGGGAGAAGAAAGGCGTGTTCATGGGCCTAATTGTGCTATTGGGCCT
TCTAATAGGGAGCCCCTATTGGCAAGAAGGTTAGTAAAGTGGTTACTGGCCCAGTCGAATCAGTCATCTAAAAAGGGCCCTTTATTGGTTGTGTCAAATGATAAG
CTGAAGAAGGCAGACAATTTGGTGGTTTATTGCTGGAAAGGCCTTCGGTTTTCAGCTCGGCAGGACTTAGAAGGATTCTCTCGATTCACCGACCATGGCATTGAA
GGTGTTGTGCCATTGGAACTCCTGCCACCTGATGTACGAGCCAAATACCAAGCCAAACCAAATGAGAGATCTAAACGTGCTAAGAAGGAAGAAGCAAAAGGGGCA
GTCCAGCAAGTTGGAGAAAATCAGATGGCAACTCCAGCTACTGAGAATGATGGTGAAGGAACCAGAAGTGACCCTGATGGGCCATCAGCAGGGATGGATGTCGAT
ACAGCTGTTGCTACTGGCAATCTATCTCAAGGTGGTACTTCTACTCCAGAAGAAAATAAACTAAGCTCTGACACGGATATTGGTCAAGAGGCAGGCCAGTTGGAA
GCTGATGCTGAGGTAGAAACGGGCATGATTGATGGCGAGACAGATGCAGAGATTGATTTGGATACTGCAGGTTGA
Protein sequenceShow/hide protein sequence
MGIQSNVQEHVAVLQEHQGNPYRVQNQVVMLQEHQDRSCYLTMLPMEPILTVVARLNVRDSFLTPLKKKDILRPVKAIDSLRKVVEKNFGGYDIYWKNFLSVQSV
KAIDSLKKEFFSLLKDIGLREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWAD
QNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRKEESIKVGALERPPTLSGKVRLSLSSTQQRKQLLQRRGEKVMDLPRGEERRVHGPNCAIGP
SNREPLLARRLVKWLLAQSNQSSKKGPLLVVSNDKLKKADNLVVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGA
VQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG