; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008179 (gene) of Snake gourd v1 genome

Gene IDTan0008179
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTHO complex subunit 1
Genome locationLG04:4171243..4186796
RNA-Seq ExpressionTan0008179
SyntenyTan0008179
Gene Ontology termsGO:0006406 - mRNA export from nucleus (biological process)
GO:0006506 - GPI anchor biosynthetic process (biological process)
GO:0032784 - regulation of DNA-templated transcription, elongation (biological process)
GO:0097502 - mannosylation (biological process)
GO:0000445 - THO complex part of transcription export complex (cellular component)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004376 - glycolipid mannosyltransferase activity (molecular function)
GO:0051751 - alpha-1,4-mannosyltransferase activity (molecular function)
InterPro domainsIPR021861 - THO complex, subunit THOC1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573787.1 THO complex subunit 1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0097.03Show/hide
Query:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG
        MEEFRKAILQTGPPE+FALQTVQDVI+PQK TKLAQDENQLLENILRRLLQELVSSA QSAEPIMQYGMSIDD ETSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC
        EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLAS KWQKFTSSL VVLNTFDAQPLSDEEGDAN+LEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC

Query:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKDVPSET+REEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEK++THD TKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVR+KYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA+KEETKGAVRQ EENQMATPASENDGEGTRSDPDGPSVAMDADT VA G+VSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA
Subjt:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTA
        EVDLDTA
Subjt:  EVDLDTA

XP_022139144.1 THO complex subunit 1 [Momordica charantia]0.0e+0095.39Show/hide
Query:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG
        MEEFRKAILQT PPE+FALQTVQ+VIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEP MQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC
        EPPDGFSIDFNFYKTFWSLQEYFCNPASL+LAS KWQKFTSSLM+VLNTFDAQPLSDEEGDANILEEE+A+FSIKYLTSSKLMGLELKDPSFRRHVLVQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC

Query:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKDVPSET+REEIKSCEERVKKLLEVTPP+GKEFLQKIEHILERENNWVWWKRDGCPPFEKQP EK++++DGTKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHH++NRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA+KEETKG+VRQ+EENQMATPASENDGEGTRSD DGPS AMD DT VA GNVSQGGTPTP+ENKQSSDTDIGQE+GQLEADAEVEPGMIDGETDA
Subjt:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTAG
        EVDLDTAG
Subjt:  EVDLDTAG

XP_022945746.1 THO complex subunit 1 [Cucurbita moschata]0.0e+0097.04Show/hide
Query:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG
        MEEFRKAILQTGPPE+FALQTVQDVI+PQK TKLAQDENQLLENILRRLLQELVSSA QSAEPIMQYGMSIDD ETSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC
        EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLAS KWQKFTSSL VVLNTFDAQPLSDEEGDAN+LEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC

Query:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKDVPSET+REEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEK++THD TKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVR+KYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA+KEETKGAVRQ EENQMATPASENDGEGTRSDPDGPSVAMDADT VA G+VSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA
Subjt:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTAG
        EVDLDTAG
Subjt:  EVDLDTAG

XP_022966961.1 THO complex subunit 1 [Cucurbita maxima]0.0e+0096.71Show/hide
Query:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG
        MEEFRKAILQT PPE+FALQTVQDVI+PQK TKLAQDENQLLENILRRLLQELVSSA QSAEPIMQYGMSIDD ET+QGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC
        EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLAS KWQKFTSSL VVLNTFDAQPLSDEEGDAN+LEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC

Query:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEK++THD TKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVR+KYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA+KEETKGAVRQ EENQMATPASENDGEGTRSDPDGPSVAMDADT VA G+VSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGE DA
Subjt:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTAG
        EVDLDTAG
Subjt:  EVDLDTAG

XP_023542342.1 THO complex subunit 1 [Cucurbita pepo subsp. pepo]0.0e+0096.88Show/hide
Query:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG
        MEEFRKAILQTGPPE+FALQTVQDVI+PQK TKLAQDENQLLENILRRLLQELVSSA QSAEPIMQYGMSIDD ETSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC
        EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLAS KWQKFTSSL VVLNTFDAQPLSDEEGDAN+LEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC

Query:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKDVPSET+REEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEK++THD TKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVR+KYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA+KEETKGA+RQ EENQMATPASENDGEGTRSDPDGPSVAMDADT VA G+VSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA
Subjt:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTAG
        EVDLDTAG
Subjt:  EVDLDTAG

TrEMBL top hitse value%identityAlignment
A0A0A0KTL0 Uncharacterized protein0.0e+0093.59Show/hide
Query:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG
        +EEFRKAILQ GPPE+FALQ VQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQS EP+MQYGMSID+KETSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC
        +PPDGFSIDFNFYKTFWSLQE+FCNPASL LAS KWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL+QC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC

Query:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD+PSETMREEIKSCEERVKKLLEVTPPRGK+FLQKIEHIL+RENNWVWWKRDGC PFEKQPIEK++ +D TKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKW+DQNPNALTDPQRVR+PAIS+YWKPLAEDMDESAGIEAEYHH++NRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA+KEE KGAV+Q++ENQMATPASENDGEGTRSDPDGPS  MD DT +ATGNVSQGG  TPEENK SSDTDIGQEAGQLEADAEVEPGMIDGETDA
Subjt:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTAG
        EVDLDTAG
Subjt:  EVDLDTAG

A0A1S3CC63 THO complex subunit 10.0e+0093.26Show/hide
Query:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG
        +EEFRKAILQ GPPE+FALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQS EP+MQYGMSID+KETSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTL+NCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC
        + PDGFSIDFNFYKTFWSLQE+FCNPASL LAS KWQKF  SLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL+QC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC

Query:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD+PSETMREEIKSCEERVKKLLEVTPPRGK+FLQKIEHIL+RENNWVWWKRDGC PFEKQPIEK++ +D TKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVR+PAISEYWKPLAEDMDESAGIEAEYHH++NRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA++EE KGAV+Q++ENQMATPASENDGEGTRSDPDGPS  MD DT +ATGNVSQGG  TPEENK SSDTDIGQEAGQLEADAEVEPGMIDGETDA
Subjt:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTAG
        EVDLDTAG
Subjt:  EVDLDTAG

A0A6J1CC37 THO complex subunit 10.0e+0095.39Show/hide
Query:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG
        MEEFRKAILQT PPE+FALQTVQ+VIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEP MQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC
        EPPDGFSIDFNFYKTFWSLQEYFCNPASL+LAS KWQKFTSSLM+VLNTFDAQPLSDEEGDANILEEE+A+FSIKYLTSSKLMGLELKDPSFRRHVLVQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC

Query:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKDVPSET+REEIKSCEERVKKLLEVTPP+GKEFLQKIEHILERENNWVWWKRDGCPPFEKQP EK++++DGTKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHH++NRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA+KEETKG+VRQ+EENQMATPASENDGEGTRSD DGPS AMD DT VA GNVSQGGTPTP+ENKQSSDTDIGQE+GQLEADAEVEPGMIDGETDA
Subjt:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTAG
        EVDLDTAG
Subjt:  EVDLDTAG

A0A6J1G1S6 THO complex subunit 10.0e+0097.04Show/hide
Query:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG
        MEEFRKAILQTGPPE+FALQTVQDVI+PQK TKLAQDENQLLENILRRLLQELVSSA QSAEPIMQYGMSIDD ETSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC
        EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLAS KWQKFTSSL VVLNTFDAQPLSDEEGDAN+LEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC

Query:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKDVPSET+REEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEK++THD TKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVR+KYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA+KEETKGAVRQ EENQMATPASENDGEGTRSDPDGPSVAMDADT VA G+VSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA
Subjt:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTAG
        EVDLDTAG
Subjt:  EVDLDTAG

A0A6J1HPE8 THO complex subunit 10.0e+0096.71Show/hide
Query:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG
        MEEFRKAILQT PPE+FALQTVQDVI+PQK TKLAQDENQLLENILRRLLQELVSSA QSAEPIMQYGMSIDD ET+QGHIPRLLDIVLYLCEKEHVEGG
Subjt:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC
        EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLAS KWQKFTSSL VVLNTFDAQPLSDEEGDAN+LEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC
Subjt:  EPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQC

Query:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEK++THD TKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKEL

Query:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN
        SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVR+KYQAKPN
Subjt:  SQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPN

Query:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA
        ERSKRA+KEETKGAVRQ EENQMATPASENDGEGTRSDPDGPSVAMDADT VA G+VSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGE DA
Subjt:  ERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDA

Query:  EVDLDTAG
        EVDLDTAG
Subjt:  EVDLDTAG

SwissProt top hitse value%identityAlignment
P59924 THO complex subunit 12.3e-0431.52Show/hide
Query:  KQPIEKRSTHDGTKKRRPRWR--LGNKELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSA
        K+ + KRS  +    +RP  +  +GN+EL++LW     N  A     R   P + E+++   E  D    +E+EY   +N  Y W  LRF A
Subjt:  KQPIEKRSTHDGTKKRRPRWR--LGNKELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSA

Q8R3N6 THO complex subunit 15.9e-5632.05Show/hide
Query:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNET------
        F LL D+ +   L  C  IF ++E           ++ GK  +LR CN LLRRLSK+ + VFCGRI +FLA  FPLSE+S +N++  FN  N T      
Subjt:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNET------

Query:  -------KYEKEPPDGFS------------------IDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESA-
               K+ ++  +G                    ID+N Y+ FWSLQ+YF NP         W+ F      VL  F +  L D +     +EE    
Subjt:  -------KYEKEPPDGFS------------------IDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESA-

Query:  ---TFSIKYLTSSKLMGLELKDPSFRRHVLVQCLILFDYLK--APGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWK
            +  K+LTS KLM L+L D +FRRH+L+Q LILF YLK     K+   V ++     I+   + V +LL   PP G+ F + +EHIL  E NW  WK
Subjt:  ---TFSIKYLTSSKLMGLELKDPSFRRHVLVQCLILFDYLK--APGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWK

Query:  RDGCPPFEKQ---------PIEKRSTHDGTKKRRPRWR--LGNKELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVY
         +GCP F K+          + KR+  +    + P  +  +GN+EL++LW     N  A     R   P + E+++   E  D    +E+EY   +N  Y
Subjt:  RDGCPPFEKQ---------PIEKRSTHDGTKKRRPRWR--LGNKELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVY

Query:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRARKEETKGAVRQIEENQ
         W+ LR  AR+    F             LE +   + AK    P+E  K    E+ +     ++EN+
Subjt:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRARKEETKGAVRQIEENQ

Q93VM9 THO complex subunit 11.6e-26675.66Show/hide
Query:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETS---QGHIPRLLDIVLYLCEKEHV
        M+ FR AILQ  P E+FAL+TVQ  I+PQK TKLAQDENQ+LEN+LR LLQELV++A QS E IMQYG  IDD +      G IP LLD+VLYLCEKEHV
Subjt:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETS---QGHIPRLLDIVLYLCEKEHV

Query:  EGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK
        EGGMIFQLLEDLTEMST++NCKD+FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKA+DVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK
Subjt:  EGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK

Query:  YEKEPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL
        YEK+PP G S+DFNFYKTFWSLQEYFCNPASLT AS KWQKF+SSL VVLNTFDAQPLS+EEG+AN LEEE+ATF+IKYLTSSKLMGLELKD SFRRH+L
Subjt:  YEKEPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL

Query:  VQCLILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGN
        +QCLI+FDYL+APGKN+KD+PSETM+EE+KSCE+RVKKLLE+TPP+GKEFL+ +EHILERE NWVWWKRDGCPPFEKQPI+K+S + G KKRR RWRLGN
Subjt:  VQCLILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGN

Query:  KELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQA
        KELSQLW+WADQNPNALTD QRVRTP I++YWKPLAEDMD SAGIE EYHHK+NRVYCWKGLRF+ARQDLEGFSRFT+ GIEGVVP+ELLPP+VR+KYQA
Subjt:  KELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQA

Query:  KPNERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQ--SSDTDIGQEAGQLEADAEVEPGMID
        KPNE++KRA+KEETKG   + E NQ+    SE + EG R D +     M++D +          TPTPEE ++   SDT+ GQEAGQ+E     E G++D
Subjt:  KPNERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQ--SSDTDIGQEAGQLEADAEVEPGMID

Query:  GETD
         + D
Subjt:  GETD

Q96FV9 THO complex subunit 17.7e-5632.48Show/hide
Query:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY----
        F LL D+ +   L  C  IF ++E           ++ GK  +LR CN LLRRLSK+ + VFCGRI +FLA  FPLSE+S +N++  FN  N T +    
Subjt:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY----

Query:  -------------------------EKEPPDGFS--IDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESA-
                                 ++E P   S  ID+N Y+ FWSLQ+YF NP         W+ F      VL  F +  L D +     +EE    
Subjt:  -------------------------EKEPPDGFS--IDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESA-

Query:  ---TFSIKYLTSSKLMGLELKDPSFRRHVLVQCLILFDYLK--APGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWK
            +  K+LTS KLM L+L D +FRRH+L+Q LILF YLK     K+   V ++     I+   + V +LL   PP G+ F + +EHIL  E NW  WK
Subjt:  ---TFSIKYLTSSKLMGLELKDPSFRRHVLVQCLILFDYLK--APGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWK

Query:  RDGCPPFEKQP---------IEKRSTHDGTKKRRPRWR--LGNKELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVY
         +GCP F K+          I KR+  +    + P  +  +GN+EL++LW     N  A     R   P + E+++   E  D    +E EY   +N  Y
Subjt:  RDGCPPFEKQP---------IEKRSTHDGTKKRRPRWR--LGNKELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVY

Query:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRARKEETKGAVRQIEENQ
         W+ LR  AR+    F             LE +   + AK    P+E  K    E+ +     ++EN+
Subjt:  CWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRARKEETKGAVRQIEENQ

Q9URT2 Uncharacterized protein P25A2.032.8e-2925.69Show/hide
Query:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGK-LVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNE-TKYEK
        F +LE+L ++ T+  C  ++ Y E++  ++ K  +  RG+  V+LR  N+LLRRLS+  +  FCGRI + L+  FP  ERS  N++G +NT +   K E 
Subjt:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGK-LVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNE-TKYEK

Query:  EPPD---GFSIDFNFYK-------TFWSLQEYFCNPASLTLASPKWQKFTSSL--------MVVLNTF---DAQPLSDEEGDANILEEESAT----FSIK
         PP        D +++K        +W LQ    NP  L LAS    KF  +          ++ NTF    + P  D    +++L E+  T    F  K
Subjt:  EPPD---GFSIDFNFYK-------TFWSLQEYFCNPASLTLASPKWQKFTSSL--------MVVLNTF---DAQPLSDEEGDANILEEESAT----FSIK

Query:  YLTSSKLMGLELKDPSFRRHVLVQCLILFDYLKAPGK------------NEKDVPSETMREEIKSCEERVKK--LLEVTPPRGKEFLQKIEHILERENNW
        Y+ S  L   +L D  FR   ++Q +I+FD+L    K            N+  +P   + +E  S    + K     +   R     + I+ I+  E NW
Subjt:  YLTSSKLMGLELKDPSFRRHVLVQCLILFDYLKAPGK------------NEKDVPSETMREEIKSCEERVKK--LLEVTPPRGKEFLQKIEHILERENNW

Query:  VWWKRDGCPPFEKQPIEKRSTH---DGTKKR-----RPRWRLGNKELSQLWKWADQNP-NALTDPQRVRTPAISEYWKPLAEDMDESAGI----EAEYHH
          WK  GCP  EK  ++K +     +G KK      + R+ +GN  LS+LW+ A +N  + L   +R R P+   +   +  D  E        +  +H 
Subjt:  VWWKRDGCPPFEKQPIEKRSTH---DGTKKR-----RPRWRLGNKELSQLWKWADQNP-NALTDPQRVRTPAISEYWKPLAEDMDESAGI----EAEYHH

Query:  KSNRVYCWKGLRFSARQDLEGFS-------RFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDG
        +S     W+  R +    L+ FS           + IEG      + P +   +     E  +   + + +  V+   +N  +   ++ +G+  +++ + 
Subjt:  KSNRVYCWKGLRFSARQDLEGFS-------RFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDG

Query:  PSVAMD
         SV ++
Subjt:  PSVAMD

Arabidopsis top hitse value%identityAlignment
AT5G09860.1 nuclear matrix protein-related1.1e-26775.66Show/hide
Query:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETS---QGHIPRLLDIVLYLCEKEHV
        M+ FR AILQ  P E+FAL+TVQ  I+PQK TKLAQDENQ+LEN+LR LLQELV++A QS E IMQYG  IDD +      G IP LLD+VLYLCEKEHV
Subjt:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETS---QGHIPRLLDIVLYLCEKEHV

Query:  EGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK
        EGGMIFQLLEDLTEMST++NCKD+FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKA+DVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK
Subjt:  EGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK

Query:  YEKEPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL
        YEK+PP G S+DFNFYKTFWSLQEYFCNPASLT AS KWQKF+SSL VVLNTFDAQPLS+EEG+AN LEEE+ATF+IKYLTSSKLMGLELKD SFRRH+L
Subjt:  YEKEPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL

Query:  VQCLILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGN
        +QCLI+FDYL+APGKN+KD+PSETM+EE+KSCE+RVKKLLE+TPP+GKEFL+ +EHILERE NWVWWKRDGCPPFEKQPI+K+S + G KKRR RWRLGN
Subjt:  VQCLILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGN

Query:  KELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQA
        KELSQLW+WADQNPNALTD QRVRTP I++YWKPLAEDMD SAGIE EYHHK+NRVYCWKGLRF+ARQDLEGFSRFT+ GIEGVVP+ELLPP+VR+KYQA
Subjt:  KELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQA

Query:  KPNERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQ--SSDTDIGQEAGQLEADAEVEPGMID
        KPNE++KRA+KEETKG   + E NQ+    SE + EG R D +     M++D +          TPTPEE ++   SDT+ GQEAGQ+E     E G++D
Subjt:  KPNERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQ--SSDTDIGQEAGQLEADAEVEPGMID

Query:  GETD
         + D
Subjt:  GETD

AT5G09860.2 nuclear matrix protein-related3.6e-26675.66Show/hide
Query:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETS---QGHIPRLLDIVLYLCEKEHV
        M+ FR AILQ  P E+FAL+TVQ  I+PQK TKLAQDENQ+LEN+LR LLQELV++A QS E IMQYG  IDD +      G IP LLD+VLYLCEKEHV
Subjt:  MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETS---QGHIPRLLDIVLYLCEKEHV

Query:  EGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK
        EGGMIFQLLEDLTEMST++NCKD+FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKA+DVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK
Subjt:  EGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK

Query:  YEKEPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL
        YEK+PP G S+DFNFYKTFWSLQEYFCNPASLT AS KWQKF+SSL VVLNTFDAQPLS+EEG+AN LEEE+ATF+IKYLTSSKLMGLELKD SFRRH+L
Subjt:  YEKEPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL

Query:  VQCLILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGN
        +QCLI+FDYL+APGKN+KD+PSETM EE+KSCE+RVKKLLE+TPP+GKEFL+ +EHILERE NWVWWKRDGCPPFEKQPI+K+S + G KKRR RWRLGN
Subjt:  VQCLILFDYLKAPGKNEKDVPSETMREEIKSCEERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGN

Query:  KELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQA
        KELSQLW+WADQNPNALTD QRVRTP I++YWKPLAEDMD SAGIE EYHHK+NRVYCWKGLRF+ARQDLEGFSRFT+ GIEGVVP+ELLPP+VR+KYQA
Subjt:  KELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESAGIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQA

Query:  KPNERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQ--SSDTDIGQEAGQLEADAEVEPGMID
        KPNE++KRA+KEETKG   + E NQ+    SE + EG R D +     M++D +          TPTPEE ++   SDT+ GQEAGQ+E     E G++D
Subjt:  KPNERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADTVVATGNVSQGGTPTPEENKQ--SSDTDIGQEAGQLEADAEVEPGMID

Query:  GETD
         + D
Subjt:  GETD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGTTCCGGAAGGCCATATTGCAAACGGGGCCACCAGAAAGTTTTGCTCTGCAGACAGTTCAAGATGTTATCAGACCTCAGAAGCACACAAAATTGGCCCAAGA
TGAAAATCAGTTGCTGGAAAATATTCTACGACGACTACTTCAAGAATTAGTGTCATCTGCAGTTCAATCAGCAGAGCCAATAATGCAGTATGGGATGTCTATTGATGACA
AGGAAACTTCACAGGGTCATATTCCACGTCTTCTTGACATTGTTCTGTATCTTTGTGAGAAAGAGCACGTCGAAGGAGGCATGATATTCCAGCTGTTGGAGGACCTGACT
GAAATGTCAACATTGAGGAACTGCAAAGATATTTTTGGTTACATTGAGAGTAAACAAGATATATTGGGAAAGCAAGAGCTTTTTGCTCGTGGAAAACTTGTCATGTTGAG
AACGTGCAATCAATTGCTTCGCCGTCTATCAAAGGCAAGTGATGTGGTGTTTTGTGGACGCATTCTGATGTTTCTAGCACATTTTTTCCCTTTATCTGAACGTTCGGCTG
TGAATATAAAAGGAGTGTTCAACACCTCCAATGAAACAAAATATGAGAAGGAGCCCCCTGATGGTTTTTCTATTGATTTCAACTTTTACAAGACCTTTTGGAGTTTACAG
GAATACTTTTGTAATCCTGCCTCCCTTACTCTTGCTTCACCGAAGTGGCAAAAGTTCACATCCAGTTTAATGGTCGTGTTGAATACCTTTGATGCACAACCTTTGTCCGA
TGAAGAGGGAGATGCTAACATTTTGGAGGAAGAGTCAGCAACCTTCAGCATAAAATATCTTACTAGTAGTAAACTGATGGGTTTAGAATTAAAGGATCCAAGTTTTCGAC
GCCACGTTCTTGTGCAGTGCCTTATATTGTTTGATTATCTAAAGGCCCCAGGAAAGAATGAGAAAGATGTTCCATCTGAAACCATGAGAGAAGAAATTAAATCTTGTGAG
GAGCGTGTGAAGAAGTTGCTAGAGGTGACACCGCCAAGAGGGAAAGAATTCCTTCAGAAGATTGAGCATATATTAGAACGTGAAAATAATTGGGTATGGTGGAAACGTGA
TGGTTGCCCTCCTTTTGAAAAACAACCAATTGAAAAGAGAAGCACCCATGATGGAACTAAAAAACGTAGGCCAAGGTGGAGATTGGGGAATAAAGAACTCTCTCAATTGT
GGAAGTGGGCTGATCAGAATCCGAATGCCTTGACTGATCCTCAACGTGTCCGTACTCCTGCAATTTCTGAGTATTGGAAACCCTTGGCAGAAGACATGGATGAGTCTGCT
GGAATTGAAGCTGAATATCATCATAAAAGTAACCGAGTTTATTGCTGGAAAGGCCTTCGGTTTTCAGCTCGGCAGGACTTAGAAGGATTTTCTCGATTTACCGACCATGG
CATTGAAGGAGTTGTGCCTTTGGAACTCTTGCCACCTGATGTACGAGCCAAATACCAAGCTAAACCAAATGAGAGATCTAAACGTGCTAGGAAGGAAGAAACAAAAGGGG
CTGTGCGTCAAATTGAAGAAAATCAGATGGCAACACCTGCTAGTGAGAATGATGGTGAAGGAACCAGAAGTGACCCTGATGGGCCATCAGTGGCAATGGATGCCGATACA
GTTGTTGCCACTGGAAATGTATCTCAAGGTGGTACTCCAACTCCAGAAGAAAATAAGCAAAGCTCTGACACAGATATTGGTCAAGAGGCAGGCCAGTTGGAAGCTGATGC
TGAGGTAGAGCCGGGTATGATTGACGGCGAGACAGATGCCGAGGTTGATTTGGATACTGCAGGATGA
mRNA sequenceShow/hide mRNA sequence
ACTAAACCCAACTCAATACCTCAAAAACTTTGTTTTTTTAAACCAAATCAACCCCAGCCGCATAAAGGAAGTGACATTAGAAATTTGCGACATTACGCTTTCATTGTGCT
GCCTTCGTTCTCTCCCCGCGACATCTCTAAACAAGAAGAAGATGTAACATTAGAAATTTGCGATATTGCCCATTCATAGCGCTCTCTTGGATTTCGAAACTCTGCCAAAA
GTCCGAATTTCTGGGTAGTGCCAGTTTCTGAATCATCGACATGGAAGAGTTCCGGAAGGCCATATTGCAAACGGGGCCACCAGAAAGTTTTGCTCTGCAGACAGTTCAAG
ATGTTATCAGACCTCAGAAGCACACAAAATTGGCCCAAGATGAAAATCAGTTGCTGGAAAATATTCTACGACGACTACTTCAAGAATTAGTGTCATCTGCAGTTCAATCA
GCAGAGCCAATAATGCAGTATGGGATGTCTATTGATGACAAGGAAACTTCACAGGGTCATATTCCACGTCTTCTTGACATTGTTCTGTATCTTTGTGAGAAAGAGCACGT
CGAAGGAGGCATGATATTCCAGCTGTTGGAGGACCTGACTGAAATGTCAACATTGAGGAACTGCAAAGATATTTTTGGTTACATTGAGAGTAAACAAGATATATTGGGAA
AGCAAGAGCTTTTTGCTCGTGGAAAACTTGTCATGTTGAGAACGTGCAATCAATTGCTTCGCCGTCTATCAAAGGCAAGTGATGTGGTGTTTTGTGGACGCATTCTGATG
TTTCTAGCACATTTTTTCCCTTTATCTGAACGTTCGGCTGTGAATATAAAAGGAGTGTTCAACACCTCCAATGAAACAAAATATGAGAAGGAGCCCCCTGATGGTTTTTC
TATTGATTTCAACTTTTACAAGACCTTTTGGAGTTTACAGGAATACTTTTGTAATCCTGCCTCCCTTACTCTTGCTTCACCGAAGTGGCAAAAGTTCACATCCAGTTTAA
TGGTCGTGTTGAATACCTTTGATGCACAACCTTTGTCCGATGAAGAGGGAGATGCTAACATTTTGGAGGAAGAGTCAGCAACCTTCAGCATAAAATATCTTACTAGTAGT
AAACTGATGGGTTTAGAATTAAAGGATCCAAGTTTTCGACGCCACGTTCTTGTGCAGTGCCTTATATTGTTTGATTATCTAAAGGCCCCAGGAAAGAATGAGAAAGATGT
TCCATCTGAAACCATGAGAGAAGAAATTAAATCTTGTGAGGAGCGTGTGAAGAAGTTGCTAGAGGTGACACCGCCAAGAGGGAAAGAATTCCTTCAGAAGATTGAGCATA
TATTAGAACGTGAAAATAATTGGGTATGGTGGAAACGTGATGGTTGCCCTCCTTTTGAAAAACAACCAATTGAAAAGAGAAGCACCCATGATGGAACTAAAAAACGTAGG
CCAAGGTGGAGATTGGGGAATAAAGAACTCTCTCAATTGTGGAAGTGGGCTGATCAGAATCCGAATGCCTTGACTGATCCTCAACGTGTCCGTACTCCTGCAATTTCTGA
GTATTGGAAACCCTTGGCAGAAGACATGGATGAGTCTGCTGGAATTGAAGCTGAATATCATCATAAAAGTAACCGAGTTTATTGCTGGAAAGGCCTTCGGTTTTCAGCTC
GGCAGGACTTAGAAGGATTTTCTCGATTTACCGACCATGGCATTGAAGGAGTTGTGCCTTTGGAACTCTTGCCACCTGATGTACGAGCCAAATACCAAGCTAAACCAAAT
GAGAGATCTAAACGTGCTAGGAAGGAAGAAACAAAAGGGGCTGTGCGTCAAATTGAAGAAAATCAGATGGCAACACCTGCTAGTGAGAATGATGGTGAAGGAACCAGAAG
TGACCCTGATGGGCCATCAGTGGCAATGGATGCCGATACAGTTGTTGCCACTGGAAATGTATCTCAAGGTGGTACTCCAACTCCAGAAGAAAATAAGCAAAGCTCTGACA
CAGATATTGGTCAAGAGGCAGGCCAGTTGGAAGCTGATGCTGAGGTAGAGCCGGGTATGATTGACGGCGAGACAGATGCCGAGGTTGATTTGGATACTGCAGGATGATTT
GTGCGGCAGCCCTTCCATAATATCCAGCAAGGAAATATGATCAGTTTCCCTCTCCATAAAGTAGCTTGTTTTGCTGTTTCGGCTGCTACGAGACTGCTTATGCAATCAAG
TTGAGGATAATTAATGACCAGGAAGAGTTGCAGGTTCTTTTGTTATTTAGTCGAACAATATGAGAGGCTTTGAGGCCCTTTGTAGCAGGAACATAATTTATTACAGCTAA
AGTTTTCATAAAGTGATTCTTACATGTATTACTGCTGTATCATTTGTAGATATTACGTTTTACTCTTATTTATGTTAGTACATCACAACGAGATCAAGTATTTTTTTAAA
AAAATTTTACGTTGATGCAGTCAGATGAATCGAGTGATCCAAATGCTGCGACATAAATTAATATTTGATGGGG
Protein sequenceShow/hide protein sequence
MEEFRKAILQTGPPESFALQTVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKETSQGHIPRLLDIVLYLCEKEHVEGGMIFQLLEDLT
EMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKEPPDGFSIDFNFYKTFWSLQ
EYFCNPASLTLASPKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLVQCLILFDYLKAPGKNEKDVPSETMREEIKSCE
ERVKKLLEVTPPRGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKRSTHDGTKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRVRTPAISEYWKPLAEDMDESA
GIEAEYHHKSNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRARKEETKGAVRQIEENQMATPASENDGEGTRSDPDGPSVAMDADT
VVATGNVSQGGTPTPEENKQSSDTDIGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG