; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G16140 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G16140
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTHO complex subunit 1
Genome locationClcChr05:22128536..22167536
RNA-Seq ExpressionClc05G16140
SyntenyClc05G16140
Gene Ontology termsGO:0006406 - mRNA export from nucleus (biological process)
GO:0006506 - GPI anchor biosynthetic process (biological process)
GO:0032784 - regulation of DNA-templated transcription, elongation (biological process)
GO:0097502 - mannosylation (biological process)
GO:0000445 - THO complex part of transcription export complex (cellular component)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004376 - glycolipid mannosyltransferase activity (molecular function)
GO:0051751 - alpha-1,4-mannosyltransferase activity (molecular function)
InterPro domainsIPR021861 - THO complex, subunit THOC1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN51016.1 hypothetical protein Csa_007827 [Cucumis sativus]0.0e+0096.55Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
        +EEFRKAILQ GPPENFALQ VQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEP+MQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFN+SNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        +PPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHIL+RENNWVWWKRDGC PFEKQPIEKKT NDV KKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKW+DQNPNALTDPQRVR+PAIS+YWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA+KEEAKGAVQQV+ENQMATPASENDGEGTRSDPDGPS GMDVDTA+A GNVSQGG STPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA
Subjt:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTTG
        EVDLDT G
Subjt:  EVDLDTTG

XP_008460496.1 PREDICTED: THO complex subunit 1 [Cucumis melo]0.0e+0096.22Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
        +EEFRKAILQ GPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEP+MQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK
        MIFQLLEDLTEMSTL+NCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFN+SNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        + PDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKF  SLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHIL+RENNWVWWKRDGC PFEKQPIEKKT NDV KKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVR+PAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA++EEAKGAVQQV+ENQMATPASENDGEGTRSDPDGPS GMDVDTA+A GNVSQGG STPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA
Subjt:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTTG
        EVDLDT G
Subjt:  EVDLDTTG

XP_011655214.1 THO complex subunit 1 [Cucumis sativus]0.0e+0096.55Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
        +EEFRKAILQ GPPENFALQ VQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEP+MQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFN+SNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        +PPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHIL+RENNWVWWKRDGC PFEKQPIEKKT NDV KKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKW+DQNPNALTDPQRVR+PAIS+YWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA+KEEAKGAVQQV+ENQMATPASENDGEGTRSDPDGPS GMDVDTA+A GNVSQGG STPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA
Subjt:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTTG
        EVDLDT G
Subjt:  EVDLDTTG

XP_038892203.1 THO complex subunit 1 isoform X1 [Benincasa hispida]0.0e+0097.04Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
        MEEFRKAILQTG PENFALQTVQDVI+PQKHTKLAQDENQLLENILRRLLQELVSS VQSTEPIMQYG+SIDEKETSQGHIPRLLD+VLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFN+SNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        EPPDGFSIDFNFYKTFWSLQE+FCNPASLALA TKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSETMREEIKSCEERVK LLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDV KKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA+KEEAK AVQQVEENQMATPASENDGEGTRSDPDGPS GMDVDTAVA GN+SQGGTSTPEENKLSSDTDIGQEAGQLEADAEVE GMIDGETDA
Subjt:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTTG
        EVDLDT G
Subjt:  EVDLDTTG

XP_038892204.1 THO complex subunit 1 isoform X2 [Benincasa hispida]0.0e+0095.89Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
        MEEFRKAILQTG PENFALQTVQDVI+PQKHTKLAQDENQLLENILRRLLQELVSS VQSTEPIMQYG+SIDEKETSQ       D+VLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFN+SNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        EPPDGFSIDFNFYKTFWSLQE+FCNPASLALA TKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSETMREEIKSCEERVK LLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDV KKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA+KEEAK AVQQVEENQMATPASENDGEGTRSDPDGPS GMDVDTAVA GN+SQGGTSTPEENKLSSDTDIGQEAGQLEADAEVE GMIDGETDA
Subjt:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTTG
        EVDLDT G
Subjt:  EVDLDTTG

TrEMBL top hitse value%identityAlignment
A0A0A0KTL0 Uncharacterized protein0.0e+0096.55Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
        +EEFRKAILQ GPPENFALQ VQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEP+MQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFN+SNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        +PPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHIL+RENNWVWWKRDGC PFEKQPIEKKT NDV KKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKW+DQNPNALTDPQRVR+PAIS+YWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA+KEEAKGAVQQV+ENQMATPASENDGEGTRSDPDGPS GMDVDTA+A GNVSQGG STPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA
Subjt:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTTG
        EVDLDT G
Subjt:  EVDLDTTG

A0A1S3CC63 THO complex subunit 10.0e+0096.22Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
        +EEFRKAILQ GPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEP+MQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK
        MIFQLLEDLTEMSTL+NCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFN+SNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        + PDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKF  SLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHIL+RENNWVWWKRDGC PFEKQPIEKKT NDV KKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVR+PAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA++EEAKGAVQQV+ENQMATPASENDGEGTRSDPDGPS GMDVDTA+A GNVSQGG STPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA
Subjt:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTTG
        EVDLDT G
Subjt:  EVDLDTTG

A0A6J1CC37 THO complex subunit 10.0e+0094.41Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
        MEEFRKAILQT PPENFALQTVQ+VIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQS EP MQYGMSID+KETSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFN+SNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        EPPDGFSIDFNFYKTFWSLQE+FCNPASL+LASTKWQKFTSSLM+VLNTFDAQPLSDEEGDANILEEE+A+FSIKYLTSSKLMGLELKDPSFRRHVL+QC
Subjt:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSET+REEIKSCEERVKKLLEVTPP+GK+FLQKIEHILERENNWVWWKRDGCPPFEKQP EKKT+ND  KKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA+KEE KG+V+QVEENQMATPASENDGEGTRSD DGPS  MDVDT VA GNVSQGGT TP+ENK SSDTDIGQE+GQLEADAEVEPGMIDGETDA
Subjt:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTTG
        EVDLDT G
Subjt:  EVDLDTTG

A0A6J1G1S6 THO complex subunit 10.0e+0094.41Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
        MEEFRKAILQTGPPENFALQTVQDVI+PQK TKLAQDENQLLENILRRLLQELVSSA QS EPIMQYGMSID+ ETSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFN+SNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        EPPDGFSIDFNFYKTFWSLQE+FCNPASL LASTKWQKFTSSL VVLNTFDAQPLSDEEGDAN+LEEESATFSIKYLTSSKLMGLELKDPSFRRHVL+QC
Subjt:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSET+REEIKSCEERVKKLLEVTPPRGK+FLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTT+D  KKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHH++NRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVR+KYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA+KEE KGAV+Q EENQMATPASENDGEGTRSDPDGPS  MD DT VA G+VSQGGT TPEENK SSDTDIGQEAGQLEADAEVEPGMIDGETDA
Subjt:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTTG
        EVDLDT G
Subjt:  EVDLDTTG

A0A6J1HPE8 THO complex subunit 10.0e+0094.08Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
        MEEFRKAILQT PPENFALQTVQDVI+PQK TKLAQDENQLLENILRRLLQELVSSA QS EPIMQYGMSID+ ET+QGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFN+SNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        EPPDGFSIDFNFYKTFWSLQE+FCNPASL LASTKWQKFTSSL VVLNTFDAQPLSDEEGDAN+LEEESATFSIKYLTSSKLMGLELKDPSFRRHVL+QC
Subjt:  EPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSETMREEIKSCEERVKKLLEVTPPRGK+FLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTT+D  KKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHH++NRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVR+KYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA+KEE KGAV+Q EENQMATPASENDGEGTRSDPDGPS  MD DT VA G+VSQGGT TPEENK SSDTDIGQEAGQLEADAEVEPGMIDGE DA
Subjt:  ERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTTG
        EVDLDT G
Subjt:  EVDLDTTG

SwissProt top hitse value%identityAlignment
Q8R3N6 THO complex subunit 13.1e-5734.31Show/hide
Query:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNET------
        F LL D+ +   L  C  IF ++E           ++ GK  +LR CN LLRRLSK+ + VFCGRI +FLA  FPLSE+S +N++  FN  N T      
Subjt:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNET------

Query:  -------KYEKEPPDGFS------------------IDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESA-
               K+ ++  +G                    ID+N Y+ FWSLQ++F NP         W+ F      VL  F +  L D +     +EE    
Subjt:  -------KYEKEPPDGFS------------------IDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESA-

Query:  ---TFSIKYLTSSKLMGLELKDPSFRRHVLMQCLILFDYLKAPGKNEKDTPSETMREE--IKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWK
            +  K+LTS KLM L+L D +FRRH+L+Q LILF YLK   K +      T  +   I+   + V +LL   PP G+ F + +EHIL  E NW  WK
Subjt:  ---TFSIKYLTSSKLMGLELKDPSFRRHVLMQCLILFDYLKAPGKNEKDTPSETMREE--IKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWK

Query:  RDGCPPFEKQ-PIEKKTTNDVPKKRRP----------RWRLGNKELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVY
         +GCP F K+   + K T  V K+  P          +  +GN+EL++LW     N  A     R   P + E+++   E  D    +E+EY   NN  Y
Subjt:  RDGCPPFEKQ-PIEKKTTNDVPKKRRP----------RWRLGNKELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVY

Query:  CWKGLRFSARQ
         W+ LR  AR+
Subjt:  CWKGLRFSARQ

Q93VM9 THO complex subunit 16.8e-26274.67Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETS---QGHIPRLLDIVLYLCEKEHV
        M+ FR AILQ  P E FAL+TVQ  I+PQK TKLAQDENQ+LEN+LR LLQELV++A QS E IMQYG  ID+ +      G IP LLD+VLYLCEKEHV
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETS---QGHIPRLLDIVLYLCEKEHV

Query:  EGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETK
        EGGMIFQLLEDLTEMST++NCKD+FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKA+DVVFCGRILMFLAHFFPLSERSAVNIKGVFN+SNETK
Subjt:  EGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETK

Query:  YEKEPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL
        YEK+PP G S+DFNFYKTFWSLQE+FCNPASL  ASTKWQKF+SSL VVLNTFDAQPLS+EEG+AN LEEE+ATF+IKYLTSSKLMGLELKD SFRRH+L
Subjt:  YEKEPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL

Query:  MQCLILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGN
        +QCLI+FDYL+APGKN+KD PSETM+EE+KSCE+RVKKLLE+TPP+GK+FL+ +EHILERE NWVWWKRDGCPPFEKQPI+KK+ N   KKRR RWRLGN
Subjt:  MQCLILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGN

Query:  KELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQA
        KELSQLW+WADQNPNALTD QRVRTP I++YWKPLAEDMD SAGIE EYHH+NNRVYCWKGLRF+ARQDLEGFSRFT+ GIEGVVP+ELLPP+VR+KYQA
Subjt:  KELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQA

Query:  KPNERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKL--SSDTDIGQEAGQLEADAEVEPGMID
        KPNE++KRA+KEE KG   + E NQ+    SE + EG R D            A  M + +   T TPEE +    SDT+ GQEAGQ+E     E G++D
Subjt:  KPNERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKL--SSDTDIGQEAGQLEADAEVEPGMID

Query:  GETD
         + D
Subjt:  GETD

Q96FV9 THO complex subunit 15.3e-5734.31Show/hide
Query:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKY----
        F LL D+ +   L  C  IF ++E           ++ GK  +LR CN LLRRLSK+ + VFCGRI +FLA  FPLSE+S +N++  FN  N T +    
Subjt:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKY----

Query:  -------------------------EKEPPDGFS--IDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESA-
                                 ++E P   S  ID+N Y+ FWSLQ++F NP         W+ F      VL  F +  L D +     +EE    
Subjt:  -------------------------EKEPPDGFS--IDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESA-

Query:  ---TFSIKYLTSSKLMGLELKDPSFRRHVLMQCLILFDYLKAPGKNEKDTPSETMREE--IKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWK
            +  K+LTS KLM L+L D +FRRH+L+Q LILF YLK   K +      T  +   I+   + V +LL   PP G+ F + +EHIL  E NW  WK
Subjt:  ---TFSIKYLTSSKLMGLELKDPSFRRHVLMQCLILFDYLKAPGKNEKDTPSETMREE--IKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWK

Query:  RDGCPPFEKQ-PIEKKTTNDVPKKRRP----------RWRLGNKELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVY
         +GCP F K+   + K T  + K+  P          +  +GN+EL++LW     N  A     R   P + E+++   E  D    +E EY   NN  Y
Subjt:  RDGCPPFEKQ-PIEKKTTNDVPKKRRP----------RWRLGNKELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVY

Query:  CWKGLRFSARQ
         W+ LR  AR+
Subjt:  CWKGLRFSARQ

Q9URT2 Uncharacterized protein P25A2.037.5e-2726.15Show/hide
Query:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGK-LVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNE-TKYEK
        F +LE+L ++ T+  C  ++ Y E++  ++ K  +  RG+  V+LR  N+LLRRLS+  +  FCGRI + L+  FP  ERS  N++G +N+ +   K E 
Subjt:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGK-LVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNE-TKYEK

Query:  EPPD---GFSIDFNFYK-------TFWSLQEFFCNPASLALASTKWQKFTSSL--------MVVLNTF---DAQPLSDEEGDANILEEESAT----FSIK
         PP        D +++K        +W LQ    NP  L LAS    KF  +          ++ NTF    + P  D    +++L E+  T    F  K
Subjt:  EPPD---GFSIDFNFYK-------TFWSLQEFFCNPASLALASTKWQKFTSSL--------MVVLNTF---DAQPLSDEEGDANILEEESAT----FSIK

Query:  YLTSSKLMGLELKDPSFRRHVLMQCLILFDYLKAPGKN-------EKDTPSETMREEIKSCEERVKKLLEVT--------PPRGKDFLQKIEHILERENN
        Y+ S  L   +L D  FR   ++Q +I+FD+L    K        EK T    +   I S +E   KL E++          R     + I+ I+  E N
Subjt:  YLTSSKLMGLELKDPSFRRHVLMQCLILFDYLKAPGKN-------EKDTPSETMREEIKSCEERVKKLLEVT--------PPRGKDFLQKIEHILERENN

Query:  WVWWKRDGCPPFEKQPIEK----------KTTNDVPKKRRPRWRLGNKELSQLWKWADQNP-NALTDPQRVRTPAISEYWKPLAEDMDESAGI----EAE
        W  WK  GCP  EK  ++K          K   + P K   R+ +GN  LS+LW+ A +N  + L   +R R P+   +   +  D  E        +  
Subjt:  WVWWKRDGCPPFEKQPIEK----------KTTNDVPKKRRPRWRLGNKELSQLWKWADQNP-NALTDPQRVRTPAISEYWKPLAEDMDESAGI----EAE

Query:  YHHRNNRVYCWKGLRFSARQDLEGFS-------RFTDHGIEG------------------VVPLELLPPDVRAKYQAKPNERS-KRARKEEAKGAVQQVE
        +H ++     W+  R +    L+ FS           + IEG                  ++  E L  +++ +   K N ++     + +A+G + Q E
Subjt:  YHHRNNRVYCWKGLRFSARQDLEGFS-------RFTDHGIEG------------------VVPLELLPPDVRAKYQAKPNERS-KRARKEEAKGAVQQVE

Query:  ENQMATPASE--NDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENK-LSSDTDIGQEAGQ
        E + +    E  +  +  +  P  P+ G+D +           G S  + NK L S+ D   EA +
Subjt:  ENQMATPASE--NDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENK-LSSDTDIGQEAGQ

Arabidopsis top hitse value%identityAlignment
AT5G09860.1 nuclear matrix protein-related4.8e-26374.67Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETS---QGHIPRLLDIVLYLCEKEHV
        M+ FR AILQ  P E FAL+TVQ  I+PQK TKLAQDENQ+LEN+LR LLQELV++A QS E IMQYG  ID+ +      G IP LLD+VLYLCEKEHV
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETS---QGHIPRLLDIVLYLCEKEHV

Query:  EGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETK
        EGGMIFQLLEDLTEMST++NCKD+FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKA+DVVFCGRILMFLAHFFPLSERSAVNIKGVFN+SNETK
Subjt:  EGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETK

Query:  YEKEPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL
        YEK+PP G S+DFNFYKTFWSLQE+FCNPASL  ASTKWQKF+SSL VVLNTFDAQPLS+EEG+AN LEEE+ATF+IKYLTSSKLMGLELKD SFRRH+L
Subjt:  YEKEPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL

Query:  MQCLILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGN
        +QCLI+FDYL+APGKN+KD PSETM+EE+KSCE+RVKKLLE+TPP+GK+FL+ +EHILERE NWVWWKRDGCPPFEKQPI+KK+ N   KKRR RWRLGN
Subjt:  MQCLILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGN

Query:  KELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQA
        KELSQLW+WADQNPNALTD QRVRTP I++YWKPLAEDMD SAGIE EYHH+NNRVYCWKGLRF+ARQDLEGFSRFT+ GIEGVVP+ELLPP+VR+KYQA
Subjt:  KELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQA

Query:  KPNERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKL--SSDTDIGQEAGQLEADAEVEPGMID
        KPNE++KRA+KEE KG   + E NQ+    SE + EG R D            A  M + +   T TPEE +    SDT+ GQEAGQ+E     E G++D
Subjt:  KPNERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKL--SSDTDIGQEAGQLEADAEVEPGMID

Query:  GETD
         + D
Subjt:  GETD

AT5G09860.2 nuclear matrix protein-related2.0e-26174.67Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETS---QGHIPRLLDIVLYLCEKEHV
        M+ FR AILQ  P E FAL+TVQ  I+PQK TKLAQDENQ+LEN+LR LLQELV++A QS E IMQYG  ID+ +      G IP LLD+VLYLCEKEHV
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETS---QGHIPRLLDIVLYLCEKEHV

Query:  EGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETK
        EGGMIFQLLEDLTEMST++NCKD+FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKA+DVVFCGRILMFLAHFFPLSERSAVNIKGVFN+SNETK
Subjt:  EGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETK

Query:  YEKEPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL
        YEK+PP G S+DFNFYKTFWSLQE+FCNPASL  ASTKWQKF+SSL VVLNTFDAQPLS+EEG+AN LEEE+ATF+IKYLTSSKLMGLELKD SFRRH+L
Subjt:  YEKEPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL

Query:  MQCLILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGN
        +QCLI+FDYL+APGKN+KD PSETM EE+KSCE+RVKKLLE+TPP+GK+FL+ +EHILERE NWVWWKRDGCPPFEKQPI+KK+ N   KKRR RWRLGN
Subjt:  MQCLILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGN

Query:  KELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQA
        KELSQLW+WADQNPNALTD QRVRTP I++YWKPLAEDMD SAGIE EYHH+NNRVYCWKGLRF+ARQDLEGFSRFT+ GIEGVVP+ELLPP+VR+KYQA
Subjt:  KELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQA

Query:  KPNERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKL--SSDTDIGQEAGQLEADAEVEPGMID
        KPNE++KRA+KEE KG   + E NQ+    SE + EG R D            A  M + +   T TPEE +    SDT+ GQEAGQ+E     E G++D
Subjt:  KPNERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDTAVAMGNVSQGGTSTPEENKL--SSDTDIGQEAGQLEADAEVEPGMID

Query:  GETD
         + D
Subjt:  GETD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGTTCCGGAAGGCCATATTGCAAACGGGGCCACCAGAAAATTTTGCTCTGCAGACAGTTCAAGATGTTATTAGACCTCAGAAGCACACAAAATTGGCACAAGA
TGAAAATCAGTTGCTGGAAAATATTCTACGACGACTACTTCAAGAATTAGTGTCATCTGCAGTTCAATCAACGGAGCCAATAATGCAGTATGGGATGTCTATTGATGAAA
AGGAAACTTCACAGGGTCATATTCCACGTCTTCTTGACATTGTCCTATATCTTTGCGAGAAAGAGCATGTTGAAGGAGGCATGATATTCCAGCTGTTGGAGGACCTGACT
GAAATGTCAACATTGAGGAACTGCAAAGATATTTTTGGTTACATTGAGAGTAAACAAGACATATTGGGAAAGCAAGAGCTTTTTGCACGCGGAAAACTTGTCATGTTGAG
AACGTGCAATCAATTGCTCCGTCGTCTATCAAAGGCAAGTGATGTGGTGTTTTGTGGACGCATTCTGATGTTTCTAGCACATTTTTTCCCTCTATCTGAACGTTCTGCTG
TGAATATAAAAGGAGTGTTTAACTCCTCCAACGAAACAAAATATGAGAAAGAACCCCCTGATGGCTTTTCTATTGATTTCAACTTTTACAAGACCTTTTGGAGTTTACAG
GAATTCTTTTGTAATCCTGCCTCCCTTGCACTCGCTTCAACGAAGTGGCAAAAGTTCACATCCAGTTTAATGGTCGTATTGAATACCTTTGATGCACAACCTTTGTCTGA
TGAAGAGGGAGATGCTAACATCTTGGAGGAAGAGTCAGCAACCTTTAGCATAAAATACCTTACTAGTAGCAAACTGATGGGTTTAGAATTGAAGGACCCAAGTTTTCGGC
GTCACGTTCTTATGCAGTGCCTTATATTGTTTGATTATCTAAAGGCCCCCGGAAAAAATGAGAAAGATACTCCATCTGAAACCATGAGAGAAGAAATTAAATCTTGTGAA
GAACGTGTGAAGAAGCTGCTTGAGGTGACACCACCCAGAGGGAAAGATTTCCTTCAGAAGATTGAGCATATATTAGAACGTGAAAATAATTGGGTATGGTGGAAACGTGA
TGGTTGCCCTCCTTTTGAAAAACAACCAATTGAAAAGAAAACCACCAATGATGTACCTAAAAAACGTAGACCAAGGTGGAGATTGGGGAATAAAGAACTCTCTCAATTGT
GGAAATGGGCAGATCAGAATCCGAATGCCTTGACTGATCCTCAACGTGTTCGTACTCCTGCAATTTCTGAGTATTGGAAACCCTTGGCAGAAGATATGGATGAGTCTGCT
GGGATTGAAGCTGAATATCATCATAGAAATAACCGAGTTTATTGCTGGAAAGGCCTTCGGTTTTCAGCTCGGCAGGATTTAGAAGGATTTTCTCGATTCACCGACCATGG
CATTGAAGGTGTCGTGCCATTGGAACTCCTGCCACCTGATGTACGAGCCAAATACCAAGCCAAGCCAAATGAGAGATCTAAACGTGCTAGAAAGGAAGAAGCAAAAGGGG
CAGTCCAGCAAGTTGAAGAAAATCAGATGGCAACTCCAGCCAGTGAGAATGATGGCGAAGGAACCAGAAGTGACCCTGATGGGCCATCGACAGGGATGGATGTTGATACA
GCTGTAGCCATGGGCAATGTATCTCAAGGTGGTACTTCAACTCCAGAAGAAAATAAACTAAGCTCTGACACAGATATTGGTCAAGAGGCAGGCCAGTTGGAAGCTGATGC
TGAGGTAGAGCCAGGCATGATTGATGGCGAGACAGATGCAGAGGTTGATTTGGATACTACAGGTTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAAAAAGAGTTGTGCAAGAAAAAGAAGCGCAAAAACAAAATTAAGAAAAAAAAAGTTTTTCTCCAATTTTTTTTTATCTATTACAAATATCATTTAAAATAAAT
TCACAAAACAGAAAGAGTATAATTCAATTAGAAGATTGAGAAAGTACCCTCTCGTGGCGTTGTCTTCGTTTACAGGACCTCCGAACTCGAAGCTTTGCTGAGACTGAGAG
TCGACAAGTTTTCTGGTAGCTCATACTTTGGTCACCAGAACGCATAGTACTCGGCAGTGCCAATTTCTGAACCATCGACATGGAAGAGTTCCGGAAGGCCATATTGCAAA
CGGGGCCACCAGAAAATTTTGCTCTGCAGACAGTTCAAGATGTTATTAGACCTCAGAAGCACACAAAATTGGCACAAGATGAAAATCAGTTGCTGGAAAATATTCTACGA
CGACTACTTCAAGAATTAGTGTCATCTGCAGTTCAATCAACGGAGCCAATAATGCAGTATGGGATGTCTATTGATGAAAAGGAAACTTCACAGGGTCATATTCCACGTCT
TCTTGACATTGTCCTATATCTTTGCGAGAAAGAGCATGTTGAAGGAGGCATGATATTCCAGCTGTTGGAGGACCTGACTGAAATGTCAACATTGAGGAACTGCAAAGATA
TTTTTGGTTACATTGAGAGTAAACAAGACATATTGGGAAAGCAAGAGCTTTTTGCACGCGGAAAACTTGTCATGTTGAGAACGTGCAATCAATTGCTCCGTCGTCTATCA
AAGGCAAGTGATGTGGTGTTTTGTGGACGCATTCTGATGTTTCTAGCACATTTTTTCCCTCTATCTGAACGTTCTGCTGTGAATATAAAAGGAGTGTTTAACTCCTCCAA
CGAAACAAAATATGAGAAAGAACCCCCTGATGGCTTTTCTATTGATTTCAACTTTTACAAGACCTTTTGGAGTTTACAGGAATTCTTTTGTAATCCTGCCTCCCTTGCAC
TCGCTTCAACGAAGTGGCAAAAGTTCACATCCAGTTTAATGGTCGTATTGAATACCTTTGATGCACAACCTTTGTCTGATGAAGAGGGAGATGCTAACATCTTGGAGGAA
GAGTCAGCAACCTTTAGCATAAAATACCTTACTAGTAGCAAACTGATGGGTTTAGAATTGAAGGACCCAAGTTTTCGGCGTCACGTTCTTATGCAGTGCCTTATATTGTT
TGATTATCTAAAGGCCCCCGGAAAAAATGAGAAAGATACTCCATCTGAAACCATGAGAGAAGAAATTAAATCTTGTGAAGAACGTGTGAAGAAGCTGCTTGAGGTGACAC
CACCCAGAGGGAAAGATTTCCTTCAGAAGATTGAGCATATATTAGAACGTGAAAATAATTGGGTATGGTGGAAACGTGATGGTTGCCCTCCTTTTGAAAAACAACCAATT
GAAAAGAAAACCACCAATGATGTACCTAAAAAACGTAGACCAAGGTGGAGATTGGGGAATAAAGAACTCTCTCAATTGTGGAAATGGGCAGATCAGAATCCGAATGCCTT
GACTGATCCTCAACGTGTTCGTACTCCTGCAATTTCTGAGTATTGGAAACCCTTGGCAGAAGATATGGATGAGTCTGCTGGGATTGAAGCTGAATATCATCATAGAAATA
ACCGAGTTTATTGCTGGAAAGGCCTTCGGTTTTCAGCTCGGCAGGATTTAGAAGGATTTTCTCGATTCACCGACCATGGCATTGAAGGTGTCGTGCCATTGGAACTCCTG
CCACCTGATGTACGAGCCAAATACCAAGCCAAGCCAAATGAGAGATCTAAACGTGCTAGAAAGGAAGAAGCAAAAGGGGCAGTCCAGCAAGTTGAAGAAAATCAGATGGC
AACTCCAGCCAGTGAGAATGATGGCGAAGGAACCAGAAGTGACCCTGATGGGCCATCGACAGGGATGGATGTTGATACAGCTGTAGCCATGGGCAATGTATCTCAAGGTG
GTACTTCAACTCCAGAAGAAAATAAACTAAGCTCTGACACAGATATTGGTCAAGAGGCAGGCCAGTTGGAAGCTGATGCTGAGGTAGAGCCAGGCATGATTGATGGCGAG
ACAGATGCAGAGGTTGATTTGGATACTACAGGTTGATTTGTGCAGAAGCCCTTTGATAATATCATGCAAGGAAATATGGTCATTTTCACTTCCATAAAGAAGCTGTATTG
TTTTTCCTGTTTCGAGTGCTACGAAACTGCTTCTACCATCAAGTTGAGGTACTATTTTGAGATTGATATTCGACGTGAATCACATGGAAGGAAATTTATATAGGATTACT
ATGACCAGAAGCGACTTTCTTTGTTATTTTCAAACAATATGAGGGGAGTTTTTGAGGCTTTTTGCAGACGGCAACATAATTTATTACAGTTGGAATTTTTCATTACCCCA
CTGTTGTATCATCTGTAAATATGTTTCTGTGAAAGGAAATGGAGGATTTGTTTTGACGATGGTACTGTCATAGATGAATTGAAGGATGCAAATGCTGTGACATGAATTAA
TATTTCATTGATCTATTAAATATAGTGACGAGTCTGATGGAGCAATGTAGAAAACACTTGATTAGGAGTTCGTCCCTTCGCTGTAGTTTGCCGTCCTAAATCACGGCAAA
CCAATTGGTTGGATTTCACATTCTTTTAGGGGCTAAATTTGGGCAGATAACATGTTTTGAACATAATTGCATGTTGCACAGGAAACCTATATGGCAGATTGGTTACTCCG
CTCCATATGATCAATTATGAGAAATCAATGTTGCAAATGGGGCAAGGCAGGGAAGACACCACTTCTGAAGACGAAGTATGTTTTGAAAGGCTCACTGAATGCTTTTAAGG
GCCATGTACACTTTTTAATCCTTCAGATTTAGAACTTCTATTTCGTGCTTGAAATTTTAAAGTGAATTTTTTAATCTTTGAGATTTGCTTTAAATTTTAAGACTCTTTAA
TCTTTGAGGTTTGGAAAATAATTTTTAATGCTCCTAAGTTAGTGAGTAATTGTATTTCGCCAACCTTTTTTGTTAATTTAATATTTAATTTGAGGCTATGTGGCTTCTCA
TTTCTCG
Protein sequenceShow/hide protein sequence
MEEFRKAILQTGPPENFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGGMIFQLLEDLT
EMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNSSNETKYEKEPPDGFSIDFNFYKTFWSLQ
EFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQCLILFDYLKAPGKNEKDTPSETMREEIKSCE
ERVKKLLEVTPPRGKDFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTNDVPKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESA
GIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRARKEEAKGAVQQVEENQMATPASENDGEGTRSDPDGPSTGMDVDT
AVAMGNVSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTTG