; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi10G001230 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi10G001230
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionTHO complex subunit 1
Genome locationchr10:1883382..1930808
RNA-Seq ExpressionLsi10G001230
SyntenyLsi10G001230
Gene Ontology termsGO:0006406 - mRNA export from nucleus (biological process)
GO:0006506 - GPI anchor biosynthetic process (biological process)
GO:0032784 - regulation of DNA-templated transcription, elongation (biological process)
GO:0097502 - mannosylation (biological process)
GO:0000445 - THO complex part of transcription export complex (cellular component)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004376 - glycolipid mannosyltransferase activity (molecular function)
GO:0051751 - alpha-1,4-mannosyltransferase activity (molecular function)
InterPro domainsIPR021861 - THO complex, subunit THOC1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN51016.1 hypothetical protein Csa_007827 [Cucumis sativus]0.0e+0095.39Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG
        +EEFRKAILQ GPPENFALQ VQDVI+PQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEP+MQYGMSIDEK TSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTL+NCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        +PPDGFSIDFNFYKTFWSLQE+FCNPASLALA TKWQKFTSSLMV+L+TFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSETMREEIKSCEERVKKLLEVTPPRGKDFL+KIEHIL+RENNWVWWKRDGC PFEKQPIEKKT +DVTKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKW+DQNPNALTDPQRVR+PAIS+YWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA
        ERSKRAKKEEAKGAVQQV ENQMATPA+ENDGEGTRSDPDGPSAGMDVDTA+ATGN+SQGG STPEENKLSSDTDIGQEAGQLEADAEVE GMIDGETDA
Subjt:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA

Query:  EIDLDTAG
        E+DLDTAG
Subjt:  EIDLDTAG

XP_008460496.1 PREDICTED: THO complex subunit 1 [Cucumis melo]0.0e+0095.39Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG
        +EEFRKAILQ GPPENFALQTVQDVI+PQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEP+MQYGMSIDEK TSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        + PDGFSIDFNFYKTFWSLQE+FCNPASLALA TKWQKF  SLMV+L+TFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSETMREEIKSCEERVKKLLEVTPPRGKDFL+KIEHIL+RENNWVWWKRDGC PFEKQPIEKKT +DVTKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVR+PAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA
        ERSKRAK+EEAKGAVQQV ENQMATPA+ENDGEGTRSDPDGPSAGMDVDTA+ATGN+SQGG STPEENKLSSDTDIGQEAGQLEADAEVE GMIDGETDA
Subjt:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA

Query:  EIDLDTAG
        E+DLDTAG
Subjt:  EIDLDTAG

XP_011655214.1 THO complex subunit 1 [Cucumis sativus]0.0e+0095.39Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG
        +EEFRKAILQ GPPENFALQ VQDVI+PQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEP+MQYGMSIDEK TSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTL+NCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        +PPDGFSIDFNFYKTFWSLQE+FCNPASLALA TKWQKFTSSLMV+L+TFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSETMREEIKSCEERVKKLLEVTPPRGKDFL+KIEHIL+RENNWVWWKRDGC PFEKQPIEKKT +DVTKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKW+DQNPNALTDPQRVR+PAIS+YWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA
        ERSKRAKKEEAKGAVQQV ENQMATPA+ENDGEGTRSDPDGPSAGMDVDTA+ATGN+SQGG STPEENKLSSDTDIGQEAGQLEADAEVE GMIDGETDA
Subjt:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA

Query:  EIDLDTAG
        E+DLDTAG
Subjt:  EIDLDTAG

XP_038892203.1 THO complex subunit 1 isoform X1 [Benincasa hispida]0.0e+0097.2Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG
        MEEFRKAILQTG PENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSS VQSTEPIMQYG+SIDEK TSQGHIPRLLD+VLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTL+NCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        EPPDGFSIDFNFYKTFWSLQEYFCNPASLALA TKWQKFTSSLMV+L+TFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSETMREEIKSCEERVK LLEVTPPRGKDFL+KIEHILERENNWVWWKRDGCPPFEKQPIEKKTT+DVTKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA
        ERSKRAKKEEAK AVQQV ENQMATPA+ENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA
Subjt:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA

Query:  EIDLDTAG
        E+DLDTAG
Subjt:  EIDLDTAG

XP_038892204.1 THO complex subunit 1 isoform X2 [Benincasa hispida]0.0e+0096.05Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG
        MEEFRKAILQTG PENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSS VQSTEPIMQYG+SIDEK TSQ       D+VLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTL+NCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        EPPDGFSIDFNFYKTFWSLQEYFCNPASLALA TKWQKFTSSLMV+L+TFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSETMREEIKSCEERVK LLEVTPPRGKDFL+KIEHILERENNWVWWKRDGCPPFEKQPIEKKTT+DVTKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA
        ERSKRAKKEEAK AVQQV ENQMATPA+ENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA
Subjt:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA

Query:  EIDLDTAG
        E+DLDTAG
Subjt:  EIDLDTAG

TrEMBL top hitse value%identityAlignment
A0A0A0KTL0 Uncharacterized protein0.0e+0095.39Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG
        +EEFRKAILQ GPPENFALQ VQDVI+PQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEP+MQYGMSIDEK TSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTL+NCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        +PPDGFSIDFNFYKTFWSLQE+FCNPASLALA TKWQKFTSSLMV+L+TFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSETMREEIKSCEERVKKLLEVTPPRGKDFL+KIEHIL+RENNWVWWKRDGC PFEKQPIEKKT +DVTKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKW+DQNPNALTDPQRVR+PAIS+YWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA
        ERSKRAKKEEAKGAVQQV ENQMATPA+ENDGEGTRSDPDGPSAGMDVDTA+ATGN+SQGG STPEENKLSSDTDIGQEAGQLEADAEVE GMIDGETDA
Subjt:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA

Query:  EIDLDTAG
        E+DLDTAG
Subjt:  EIDLDTAG

A0A1S3CC63 THO complex subunit 10.0e+0095.39Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG
        +EEFRKAILQ GPPENFALQTVQDVI+PQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEP+MQYGMSIDEK TSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        + PDGFSIDFNFYKTFWSLQE+FCNPASLALA TKWQKF  SLMV+L+TFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSETMREEIKSCEERVKKLLEVTPPRGKDFL+KIEHIL+RENNWVWWKRDGC PFEKQPIEKKT +DVTKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVR+PAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA
        ERSKRAK+EEAKGAVQQV ENQMATPA+ENDGEGTRSDPDGPSAGMDVDTA+ATGN+SQGG STPEENKLSSDTDIGQEAGQLEADAEVE GMIDGETDA
Subjt:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA

Query:  EIDLDTAG
        E+DLDTAG
Subjt:  EIDLDTAG

A0A6J1CC37 THO complex subunit 10.0e+0093.26Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG
        MEEFRKAILQT PPENFALQTVQ+VI+PQKHTKLAQDENQLLENILRRLLQELVSSAVQS EP MQYGMSID+K TSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTL+NCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        EPPDGFSIDFNFYKTFWSLQEYFCNPASL+LA TKWQKFTSSLM++L+TFDAQPLSDEEGDANILEEE+A+FSIKYLTSSKLMGLELKDPSFRRHVL+QC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSET+REEIKSCEERVKKLLEVTPP+GK+FL+KIEHILERENNWVWWKRDGCPPFEKQP EKKT++D TKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA
        ERSKRAKKEE KG+V+QV ENQMATPA+ENDGEGTRSD DGPSA MDVDT VA GN+SQGGT TP+ENK SSDTDIGQE+GQLEADAEVE GMIDGETDA
Subjt:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA

Query:  EIDLDTAG
        E+DLDTAG
Subjt:  EIDLDTAG

A0A6J1G1S6 THO complex subunit 10.0e+0093.59Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG
        MEEFRKAILQTGPPENFALQTVQDVIKPQK TKLAQDENQLLENILRRLLQELVSSA QS EPIMQYGMSID+  TSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTL+NCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        EPPDGFSIDFNFYKTFWSLQEYFCNPASL LA TKWQKFTSSL V+L+TFDAQPLSDEEGDAN+LEEESATFSIKYLTSSKLMGLELKDPSFRRHVL+QC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSET+REEIKSCEERVKKLLEVTPPRGK+FL+KIEHILERENNWVWWKRDGCPPFEKQPIEKKTT D TKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHH++NRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVR+KYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA
        ERSKRAKKEE KGAV+Q  ENQMATPA+ENDGEGTRSDPDGPS  MD DT VA G++SQGGT TPEENK SSDTDIGQEAGQLEADAEVE GMIDGETDA
Subjt:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA

Query:  EIDLDTAG
        E+DLDTAG
Subjt:  EIDLDTAG

A0A6J1HPE8 THO complex subunit 10.0e+0093.26Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG
        MEEFRKAILQT PPENFALQTVQDVIKPQK TKLAQDENQLLENILRRLLQELVSSA QS EPIMQYGMSID+  T+QGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTL+NCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        EPPDGFSIDFNFYKTFWSLQEYFCNPASL LA TKWQKFTSSL V+L+TFDAQPLSDEEGDAN+LEEESATFSIKYLTSSKLMGLELKDPSFRRHVL+QC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD PSETMREEIKSCEERVKKLLEVTPPRGK+FL+KIEHILERENNWVWWKRDGCPPFEKQPIEKKTT D TKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHH++NRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVR+KYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA
        ERSKRAKKEE KGAV+Q  ENQMATPA+ENDGEGTRSDPDGPS  MD DT VA G++SQGGT TPEENK SSDTDIGQEAGQLEADAEVE GMIDGE DA
Subjt:  ERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDA

Query:  EIDLDTAG
        E+DLDTAG
Subjt:  EIDLDTAG

SwissProt top hitse value%identityAlignment
P59924 THO complex subunit 14.0e-0430.43Show/hide
Query:  KQPIEKKTTSDVTKKRRPRWR--LGNKELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSA
        K+ + K++  +    +RP  +  +GN+EL++LW     N  A     R   P + E+++   E  D    +E+EY   NN  Y W  LRF A
Subjt:  KQPIEKKTTSDVTKKRRPRWR--LGNKELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSA

Q8R3N6 THO complex subunit 18.2e-5833.82Show/hide
Query:  FQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNET------
        F LL D+ +   L  C  IF ++E           ++ GK  +LR CN LLRRLSK+ + VFCGRI +FLA  FPLSE+S +N++  FN  N T      
Subjt:  FQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNET------

Query:  -------KYEKEPPDGFS------------------IDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESA-
               K+ ++  +G                    ID+N Y+ FWSLQ+YF NP      ++ W+ F      +L  F +  L D +     +EE    
Subjt:  -------KYEKEPPDGFS------------------IDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESA-

Query:  ---TFSIKYLTSSKLMGLELKDPSFRRHVLMQCLILFDYLKAPGKNEKDTPSETMREE--IKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWK
            +  K+LTS KLM L+L D +FRRH+L+Q LILF YLK   K +      T  +   I+   + V +LL   PP G+ F + +EHIL  E NW  WK
Subjt:  ---TFSIKYLTSSKLMGLELKDPSFRRHVLMQCLILFDYLKAPGKNEKDTPSETMREE--IKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWK

Query:  RDGCPPFEKQPIEKKTTSDVTKKR-----------RPRWRLGNKELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVY
         +GCP F K+       + V +KR             +  +GN+EL++LW     N  A     R   P + E+++   E  D    +E+EY   NN  Y
Subjt:  RDGCPPFEKQPIEKKTTSDVTKKR-----------RPRWRLGNKELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVY

Query:  CWKGLRFSARQ
         W+ LR  AR+
Subjt:  CWKGLRFSARQ

Q93VM9 THO complex subunit 11.7e-26074.34Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTS---QGHIPRLLDIVLYLCEKEHV
        M+ FR AILQ  P E FAL+TVQ  I+PQK TKLAQDENQ+LEN+LR LLQELV++A QS E IMQYG  ID+        G IP LLD+VLYLCEKEHV
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTS---QGHIPRLLDIVLYLCEKEHV

Query:  EGGMIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK
        EGGMIFQLLEDLTEMST+KNCKD+FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKA+DVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK
Subjt:  EGGMIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK

Query:  YEKEPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL
        YEK+PP G S+DFNFYKTFWSLQEYFCNPASL  A TKWQKF+SSL V+L+TFDAQPLS+EEG+AN LEEE+ATF+IKYLTSSKLMGLELKD SFRRH+L
Subjt:  YEKEPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL

Query:  MQCLILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGN
        +QCLI+FDYL+APGKN+KD PSETM+EE+KSCE+RVKKLLE+TPP+GK+FLR +EHILERE NWVWWKRDGCPPFEKQPI+KK+ +   KKRR RWRLGN
Subjt:  MQCLILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGN

Query:  KELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQA
        KELSQLW+WADQNPNALTD QRVRTP I++YWKPLAEDMD SAGIE EYHH+NNRVYCWKGLRF+ARQDLEGFSRFT+ GIEGVVP+ELLPP+VR+KYQA
Subjt:  KELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQA

Query:  KPNERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKL--SSDTDIGQEAGQLEADAEVETGMID
        KPNE++KRAKKEE KG   +   NQ+    +E + EG R D +   +    D            T TPEE +    SDT+ GQEAGQ+E     E G++D
Subjt:  KPNERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKL--SSDTDIGQEAGQLEADAEVETGMID

Query:  GETD
         + D
Subjt:  GETD

Q96FV9 THO complex subunit 11.4e-5734.31Show/hide
Query:  FQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY----
        F LL D+ +   L  C  IF ++E           ++ GK  +LR CN LLRRLSK+ + VFCGRI +FLA  FPLSE+S +N++  FN  N T +    
Subjt:  FQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY----

Query:  -------------------------EKEPPDGFS--IDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESA-
                                 ++E P   S  ID+N Y+ FWSLQ+YF NP      ++ W+ F      +L  F +  L D +     +EE    
Subjt:  -------------------------EKEPPDGFS--IDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESA-

Query:  ---TFSIKYLTSSKLMGLELKDPSFRRHVLMQCLILFDYLKAPGKNEKDTPSETMREE--IKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWK
            +  K+LTS KLM L+L D +FRRH+L+Q LILF YLK   K +      T  +   I+   + V +LL   PP G+ F + +EHIL  E NW  WK
Subjt:  ---TFSIKYLTSSKLMGLELKDPSFRRHVLMQCLILFDYLKAPGKNEKDTPSETMREE--IKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWK

Query:  RDGCPPFEKQ-PIEKKTTSDVTKKRRP----------RWRLGNKELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVY
         +GCP F K+   + K T  + K+  P          +  +GN+EL++LW     N  A     R   P + E+++   E  D    +E EY   NN  Y
Subjt:  RDGCPPFEKQ-PIEKKTTSDVTKKRRP----------RWRLGNKELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVY

Query:  CWKGLRFSARQ
         W+ LR  AR+
Subjt:  CWKGLRFSARQ

Q9URT2 Uncharacterized protein P25A2.034.0e-2825.25Show/hide
Query:  FQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGK-LVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNE-TKYEK
        F +LE+L ++ T+  C  ++ Y E++  ++ K  +  RG+  V+LR  N+LLRRLS+  +  FCGRI + L+  FP  ERS  N++G +NT +   K E 
Subjt:  FQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGK-LVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNE-TKYEK

Query:  EPPD---GFSIDFNFYK-------TFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFD-----------AQPLSDEEGDANILEEESAT----FSIK
         PP        D +++K        +W LQ    NP  L LA     KF  +    +  F+           + P  D    +++L E+  T    F  K
Subjt:  EPPD---GFSIDFNFYK-------TFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFD-----------AQPLSDEEGDANILEEESAT----FSIK

Query:  YLTSSKLMGLELKDPSFRRHVLMQCLILFDYLKAPGKN-------EKDTPSETMREEIKSCEERVKKLLEVT--------PPRGKDFLRKIEHILERENN
        Y+ S  L   +L D  FR   ++Q +I+FD+L    K        EK T    +   I S +E   KL E++          R     R I+ I+  E N
Subjt:  YLTSSKLMGLELKDPSFRRHVLMQCLILFDYLKAPGKN-------EKDTPSETMREEIKSCEERVKKLLEVT--------PPRGKDFLRKIEHILERENN

Query:  WVWWKRDGCPPFEKQPIEKKTTSDVTKKRRP--------RWRLGNKELSQLWKWADQNP-NALTDPQRVRTPAISEYWKPLAEDMDESAGI----EAEYH
        W  WK  GCP  EK  ++K    +  +  +         R+ +GN  LS+LW+ A +N  + L   +R R P+   +   +  D  E        +  +H
Subjt:  WVWWKRDGCPPFEKQPIEKKTTSDVTKKRRP--------RWRLGNKELSQLWKWADQNP-NALTDPQRVRTPAISEYWKPLAEDMDESAGI----EAEYH

Query:  HRNNRVYCWKGLRFSARQDLEGFS-------RFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPD
         ++     W+  R +    L+ FS           + IEG      + P +   +     E  +  ++ + +  V+   +N  +   T+ +G+  +++ +
Subjt:  HRNNRVYCWKGLRFSARQDLEGFS-------RFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPD

Query:  GPSAGMD
          S  ++
Subjt:  GPSAGMD

Arabidopsis top hitse value%identityAlignment
AT5G09860.1 nuclear matrix protein-related1.2e-26174.34Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTS---QGHIPRLLDIVLYLCEKEHV
        M+ FR AILQ  P E FAL+TVQ  I+PQK TKLAQDENQ+LEN+LR LLQELV++A QS E IMQYG  ID+        G IP LLD+VLYLCEKEHV
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTS---QGHIPRLLDIVLYLCEKEHV

Query:  EGGMIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK
        EGGMIFQLLEDLTEMST+KNCKD+FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKA+DVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK
Subjt:  EGGMIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK

Query:  YEKEPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL
        YEK+PP G S+DFNFYKTFWSLQEYFCNPASL  A TKWQKF+SSL V+L+TFDAQPLS+EEG+AN LEEE+ATF+IKYLTSSKLMGLELKD SFRRH+L
Subjt:  YEKEPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL

Query:  MQCLILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGN
        +QCLI+FDYL+APGKN+KD PSETM+EE+KSCE+RVKKLLE+TPP+GK+FLR +EHILERE NWVWWKRDGCPPFEKQPI+KK+ +   KKRR RWRLGN
Subjt:  MQCLILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGN

Query:  KELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQA
        KELSQLW+WADQNPNALTD QRVRTP I++YWKPLAEDMD SAGIE EYHH+NNRVYCWKGLRF+ARQDLEGFSRFT+ GIEGVVP+ELLPP+VR+KYQA
Subjt:  KELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQA

Query:  KPNERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKL--SSDTDIGQEAGQLEADAEVETGMID
        KPNE++KRAKKEE KG   +   NQ+    +E + EG R D +   +    D            T TPEE +    SDT+ GQEAGQ+E     E G++D
Subjt:  KPNERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKL--SSDTDIGQEAGQLEADAEVETGMID

Query:  GETD
         + D
Subjt:  GETD

AT5G09860.2 nuclear matrix protein-related5.0e-26074.34Show/hide
Query:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTS---QGHIPRLLDIVLYLCEKEHV
        M+ FR AILQ  P E FAL+TVQ  I+PQK TKLAQDENQ+LEN+LR LLQELV++A QS E IMQYG  ID+        G IP LLD+VLYLCEKEHV
Subjt:  MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTS---QGHIPRLLDIVLYLCEKEHV

Query:  EGGMIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK
        EGGMIFQLLEDLTEMST+KNCKD+FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKA+DVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK
Subjt:  EGGMIFQLLEDLTEMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK

Query:  YEKEPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL
        YEK+PP G S+DFNFYKTFWSLQEYFCNPASL  A TKWQKF+SSL V+L+TFDAQPLS+EEG+AN LEEE+ATF+IKYLTSSKLMGLELKD SFRRH+L
Subjt:  YEKEPPDGFSIDFNFYKTFWSLQEYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL

Query:  MQCLILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGN
        +QCLI+FDYL+APGKN+KD PSETM EE+KSCE+RVKKLLE+TPP+GK+FLR +EHILERE NWVWWKRDGCPPFEKQPI+KK+ +   KKRR RWRLGN
Subjt:  MQCLILFDYLKAPGKNEKDTPSETMREEIKSCEERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGN

Query:  KELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQA
        KELSQLW+WADQNPNALTD QRVRTP I++YWKPLAEDMD SAGIE EYHH+NNRVYCWKGLRF+ARQDLEGFSRFT+ GIEGVVP+ELLPP+VR+KYQA
Subjt:  KELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQA

Query:  KPNERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKL--SSDTDIGQEAGQLEADAEVETGMID
        KPNE++KRAKKEE KG   +   NQ+    +E + EG R D +   +    D            T TPEE +    SDT+ GQEAGQ+E     E G++D
Subjt:  KPNERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDTAVATGNLSQGGTSTPEENKL--SSDTDIGQEAGQLEADAEVETGMID

Query:  GETD
         + D
Subjt:  GETD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGTTCCGGAAGGCCATATTGCAAACGGGGCCACCGGAAAATTTTGCTCTGCAGACAGTTCAAGATGTTATCAAACCTCAGAAGCACACAAAATTGGCACAAGA
TGAAAATCAGTTGCTGGAAAATATTCTACGGCGACTGCTTCAAGAATTAGTGTCATCTGCAGTTCAATCAACGGAGCCAATAATGCAGTATGGGATGTCTATTGATGAAA
AGGGAACTTCACAGGGTCATATTCCGCGTCTTCTTGACATTGTCCTATATCTTTGTGAGAAAGAGCATGTTGAAGGAGGCATGATATTCCAGCTGTTGGAGGACCTGACT
GAAATGTCAACATTGAAGAACTGCAAAGATATTTTTGGTTACATTGAGAGTAAACAAGACATATTGGGAAAGCAAGAGCTTTTTGCACGTGGAAAACTTGTCATGTTGAG
AACTTGCAATCAATTGCTCCGCCGTCTATCAAAGGCAAGTGATGTGGTGTTTTGTGGACGCATTCTGATGTTTCTAGCACATTTTTTCCCTTTATCTGAACGTTCTGCTG
TGAATATAAAAGGAGTGTTTAACACCTCCAACGAAACAAAATATGAGAAAGAACCCCCTGATGGCTTTTCTATTGATTTCAACTTTTACAAGACCTTTTGGAGTTTACAG
GAATACTTTTGTAATCCTGCCTCCCTTGCACTTGCTTTAACGAAGTGGCAAAAGTTCACATCCAGTTTAATGGTCATATTGGATACCTTTGATGCACAACCTTTGTCTGA
TGAAGAGGGAGATGCTAACATCTTGGAGGAAGAGTCGGCAACCTTTAGCATAAAATATCTTACTAGTAGTAAACTTATGGGTTTAGAATTGAAGGATCCAAGTTTTCGGC
GTCACGTTCTTATGCAGTGCCTTATATTGTTTGATTATCTAAAGGCCCCAGGAAAGAATGAGAAAGATACTCCATCTGAAACCATGAGAGAAGAAATTAAATCTTGTGAA
GAGCGCGTGAAGAAGTTGCTTGAGGTGACACCACCTAGAGGGAAAGATTTCCTTCGGAAGATTGAGCATATATTAGAACGTGAAAATAATTGGGTATGGTGGAAACGTGA
TGGTTGCCCTCCTTTTGAAAAGCAGCCAATTGAAAAGAAAACCACCAGTGATGTAACTAAAAAACGGAGGCCAAGGTGGAGATTGGGGAATAAAGAACTCTCTCAATTGT
GGAAATGGGCAGACCAGAATCCGAATGCCTTGACTGATCCTCAACGTGTTCGTACTCCTGCAATTTCTGAGTACTGGAAACCCTTGGCAGAAGATATGGACGAGTCTGCT
GGGATTGAAGCTGAATATCATCATAGAAACAACCGAGTTTATTGCTGGAAAGGCCTTCGGTTTTCAGCTCGGCAGGACTTAGAAGGATTCTCTCGATTCACCGACCATGG
CATTGAAGGTGTTGTGCCATTGGAACTCCTGCCACCTGATGTACGAGCCAAATACCAAGCCAAACCAAATGAGAGATCTAAACGTGCTAAGAAGGAAGAAGCAAAAGGGG
CAGTCCAGCAAGTTGGAGAAAATCAGATGGCAACTCCAGCTACTGAGAATGATGGTGAAGGAACCAGAAGTGACCCTGATGGGCCATCAGCAGGGATGGATGTCGATACA
GCTGTTGCTACTGGCAATCTATCTCAAGGTGGTACTTCTACTCCAGAAGAAAATAAACTAAGCTCTGACACGGATATTGGTCAAGAGGCAGGCCAGTTGGAAGCTGATGC
TGAGGTAGAAACGGGCATGATTGATGGCGAGACAGATGCAGAGATTGATTTGGATACTGCAGGTTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAGAAAAAAGAAAATTGCGAAAGTACCCTCTCATCGCGTTCTCTTCGTCTATCGGACTTCGAACTCGAAGCAATTCTGAGACAGAGAGTCGACAAGTTTTCTTG
TAGCTCATGCTTCGGTCTTCAGAACGAATAGGAAATACCCGGCAGTGCCAATTTCTGAATCATTGACATGGAAGAGTTCCGGAAGGCCATATTGCAAACGGGGCCACCGG
AAAATTTTGCTCTGCAGACAGTTCAAGATGTTATCAAACCTCAGAAGCACACAAAATTGGCACAAGATGAAAATCAGTTGCTGGAAAATATTCTACGGCGACTGCTTCAA
GAATTAGTGTCATCTGCAGTTCAATCAACGGAGCCAATAATGCAGTATGGGATGTCTATTGATGAAAAGGGAACTTCACAGGGTCATATTCCGCGTCTTCTTGACATTGT
CCTATATCTTTGTGAGAAAGAGCATGTTGAAGGAGGCATGATATTCCAGCTGTTGGAGGACCTGACTGAAATGTCAACATTGAAGAACTGCAAAGATATTTTTGGTTACA
TTGAGAGTAAACAAGACATATTGGGAAAGCAAGAGCTTTTTGCACGTGGAAAACTTGTCATGTTGAGAACTTGCAATCAATTGCTCCGCCGTCTATCAAAGGCAAGTGAT
GTGGTGTTTTGTGGACGCATTCTGATGTTTCTAGCACATTTTTTCCCTTTATCTGAACGTTCTGCTGTGAATATAAAAGGAGTGTTTAACACCTCCAACGAAACAAAATA
TGAGAAAGAACCCCCTGATGGCTTTTCTATTGATTTCAACTTTTACAAGACCTTTTGGAGTTTACAGGAATACTTTTGTAATCCTGCCTCCCTTGCACTTGCTTTAACGA
AGTGGCAAAAGTTCACATCCAGTTTAATGGTCATATTGGATACCTTTGATGCACAACCTTTGTCTGATGAAGAGGGAGATGCTAACATCTTGGAGGAAGAGTCGGCAACC
TTTAGCATAAAATATCTTACTAGTAGTAAACTTATGGGTTTAGAATTGAAGGATCCAAGTTTTCGGCGTCACGTTCTTATGCAGTGCCTTATATTGTTTGATTATCTAAA
GGCCCCAGGAAAGAATGAGAAAGATACTCCATCTGAAACCATGAGAGAAGAAATTAAATCTTGTGAAGAGCGCGTGAAGAAGTTGCTTGAGGTGACACCACCTAGAGGGA
AAGATTTCCTTCGGAAGATTGAGCATATATTAGAACGTGAAAATAATTGGGTATGGTGGAAACGTGATGGTTGCCCTCCTTTTGAAAAGCAGCCAATTGAAAAGAAAACC
ACCAGTGATGTAACTAAAAAACGGAGGCCAAGGTGGAGATTGGGGAATAAAGAACTCTCTCAATTGTGGAAATGGGCAGACCAGAATCCGAATGCCTTGACTGATCCTCA
ACGTGTTCGTACTCCTGCAATTTCTGAGTACTGGAAACCCTTGGCAGAAGATATGGACGAGTCTGCTGGGATTGAAGCTGAATATCATCATAGAAACAACCGAGTTTATT
GCTGGAAAGGCCTTCGGTTTTCAGCTCGGCAGGACTTAGAAGGATTCTCTCGATTCACCGACCATGGCATTGAAGGTGTTGTGCCATTGGAACTCCTGCCACCTGATGTA
CGAGCCAAATACCAAGCCAAACCAAATGAGAGATCTAAACGTGCTAAGAAGGAAGAAGCAAAAGGGGCAGTCCAGCAAGTTGGAGAAAATCAGATGGCAACTCCAGCTAC
TGAGAATGATGGTGAAGGAACCAGAAGTGACCCTGATGGGCCATCAGCAGGGATGGATGTCGATACAGCTGTTGCTACTGGCAATCTATCTCAAGGTGGTACTTCTACTC
CAGAAGAAAATAAACTAAGCTCTGACACGGATATTGGTCAAGAGGCAGGCCAGTTGGAAGCTGATGCTGAGGTAGAAACGGGCATGATTGATGGCGAGACAGATGCAGAG
ATTGATTTGGATACTGCAGGTTGATTTGTGCAGAAGCCTTTTAATAATATCTTGCAAGGTGCAAATGCGGCATCCCTTCCATTATATCGTTCTACTTCCTGGGGGTTCTG
AGCTATTGAAGTTTTGAATTGCACTCGTTTGTATAGGATATCAATGAATTTTATCTACTGTTTATAGGGGAAAAGAAGTAGAAGAATATCGAAGTTTTGATCCAACCTGT
ATTAATTCACGTTTTGCTTTCTGTGATATTTACAATACAGGGCAGTAACTGCCCCAGTTGGTTAGACATTATTTGCTACTAAAATTTAATCCGTCTTTGTTGTTCATTGG
TGATGTAAGAGCAAAGTTTGTAGTTCAATTTCACTATTCACTACATTTTTTACTATGGCTGACCTTTATATGTTGGAACTGTAGGAAATTCATTTTCACTTCCATAAAGT
AGTTGTATTATTTTTCTTGTT
Protein sequenceShow/hide protein sequence
MEEFRKAILQTGPPENFALQTVQDVIKPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPIMQYGMSIDEKGTSQGHIPRLLDIVLYLCEKEHVEGGMIFQLLEDLT
EMSTLKNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKEPPDGFSIDFNFYKTFWSLQ
EYFCNPASLALALTKWQKFTSSLMVILDTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQCLILFDYLKAPGKNEKDTPSETMREEIKSCE
ERVKKLLEVTPPRGKDFLRKIEHILERENNWVWWKRDGCPPFEKQPIEKKTTSDVTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESA
GIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEEAKGAVQQVGENQMATPATENDGEGTRSDPDGPSAGMDVDT
AVATGNLSQGGTSTPEENKLSSDTDIGQEAGQLEADAEVETGMIDGETDAEIDLDTAG