; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G7735 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G7735
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionTHO complex subunit 1
Genome locationctg1546:523480..539402
RNA-Seq ExpressionCucsat.G7735
SyntenyCucsat.G7735
Gene Ontology termsGO:0006406 - mRNA export from nucleus (biological process)
GO:0032784 - regulation of DNA-templated transcription, elongation (biological process)
GO:0000445 - THO complex part of transcription export complex (cellular component)
InterPro domainsIPR021861 - THO complex, subunit THOC1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN51016.1 hypothetical protein Csa_007827 [Cucumis sativus]1.21e-298100Show/hide
Query:  MDLSLYLEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEK
        MDLSLYLEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEK
Subjt:  MDLSLYLEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEK

Query:  EHVEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSN
        EHVEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSN
Subjt:  EHVEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSN

Query:  ETKYEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRR
        ETKYEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRR
Subjt:  ETKYEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRR

Query:  HVLMQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWR
        HVLMQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWR
Subjt:  HVLMQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWR

Query:  LGNKELSQLWKW
        LGNKELSQLWKW
Subjt:  LGNKELSQLWKW

XP_008460496.1 PREDICTED: THO complex subunit 1 [Cucumis melo]8.95e-29498.54Show/hide
Query:  MDLSLYLEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEK
        MDLSLYLEEFRKAILQ+GPPENFALQ VQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEK
Subjt:  MDLSLYLEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEK

Query:  EHVEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSN
        EHVEGGMIFQLLEDLTEMSTL+NCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSN
Subjt:  EHVEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSN

Query:  ETKYEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRR
        ETKYEKQ PDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKF  SLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRR
Subjt:  ETKYEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRR

Query:  HVLMQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWR
        HVLMQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWR
Subjt:  HVLMQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWR

Query:  LGNKELSQLWKW
        LGNKELSQLWKW
Subjt:  LGNKELSQLWKW

XP_011655214.1 THO complex subunit 1 [Cucumis sativus]5.98e-298100Show/hide
Query:  MDLSLYLEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEK
        MDLSLYLEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEK
Subjt:  MDLSLYLEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEK

Query:  EHVEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSN
        EHVEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSN
Subjt:  EHVEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSN

Query:  ETKYEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRR
        ETKYEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRR
Subjt:  ETKYEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRR

Query:  HVLMQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWR
        HVLMQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWR
Subjt:  HVLMQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWR

Query:  LGNKELSQLWKW
        LGNKELSQLWKW
Subjt:  LGNKELSQLWKW

XP_022139144.1 THO complex subunit 1 [Momordica charantia]2.06e-27994.09Show/hide
Query:  LEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
        +EEFRKAILQ  PPENFALQ VQ+VIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQS EP MQYGMSID+KETSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  LEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  QPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        +PPDGFSIDFNFYKTFWSLQE+FCNPASL+LASTKWQKFTSSLM+VLNTFDAQPLSDEEGDANILEEE+A+FSIKYLTSSKLMGLELKDPSFRRHVL+QC
Subjt:  QPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD+PSET+REEIKSCEERVKKLLEVTPP+GK+FLQKIEHIL+RENNWVWWKRDGC PFEKQP EKKT ND TKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWRLGNKEL

Query:  SQLWKW
        SQLWKW
Subjt:  SQLWKW

XP_038892203.1 THO complex subunit 1 isoform X1 [Benincasa hispida]3.96e-28496.06Show/hide
Query:  LEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
        +EEFRKAILQ G PENFALQ VQDVI+PQKHTKLAQDENQLLENILRRLLQELVSS VQSTEP+MQYG+SIDEKETSQGHIPRLLD+VLYLCEKEHVEGG
Subjt:  LEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  QPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        +PPDGFSIDFNFYKTFWSLQE+FCNPASLALA TKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
Subjt:  QPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKDIPSETMREEIKSCEERVK LLEVTPPRGKDFLQKIEHIL+RENNWVWWKRDGC PFEKQPIEKKT NDVTKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWRLGNKEL

Query:  SQLWKW
        SQLWKW
Subjt:  SQLWKW

TrEMBL top hitse value%identityAlignment
A0A0A0KTL0 Uncharacterized protein5.84e-299100Show/hide
Query:  MDLSLYLEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEK
        MDLSLYLEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEK
Subjt:  MDLSLYLEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEK

Query:  EHVEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSN
        EHVEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSN
Subjt:  EHVEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSN

Query:  ETKYEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRR
        ETKYEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRR
Subjt:  ETKYEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRR

Query:  HVLMQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWR
        HVLMQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWR
Subjt:  HVLMQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWR

Query:  LGNKELSQLWKW
        LGNKELSQLWKW
Subjt:  LGNKELSQLWKW

A0A1S3CC63 THO complex subunit 14.34e-29498.54Show/hide
Query:  MDLSLYLEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEK
        MDLSLYLEEFRKAILQ+GPPENFALQ VQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEK
Subjt:  MDLSLYLEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEK

Query:  EHVEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSN
        EHVEGGMIFQLLEDLTEMSTL+NCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSN
Subjt:  EHVEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSN

Query:  ETKYEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRR
        ETKYEKQ PDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKF  SLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRR
Subjt:  ETKYEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRR

Query:  HVLMQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWR
        HVLMQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWR
Subjt:  HVLMQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWR

Query:  LGNKELSQLWKW
        LGNKELSQLWKW
Subjt:  LGNKELSQLWKW

A0A6J1CC37 THO complex subunit 19.99e-28094.09Show/hide
Query:  LEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
        +EEFRKAILQ  PPENFALQ VQ+VIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQS EP MQYGMSID+KETSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  LEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  QPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        +PPDGFSIDFNFYKTFWSLQE+FCNPASL+LASTKWQKFTSSLM+VLNTFDAQPLSDEEGDANILEEE+A+FSIKYLTSSKLMGLELKDPSFRRHVL+QC
Subjt:  QPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD+PSET+REEIKSCEERVKKLLEVTPP+GK+FLQKIEHIL+RENNWVWWKRDGC PFEKQP EKKT ND TKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWRLGNKEL

Query:  SQLWKW
        SQLWKW
Subjt:  SQLWKW

A0A6J1G1S6 THO complex subunit 11.42e-27994.09Show/hide
Query:  LEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
        +EEFRKAILQ GPPENFALQ VQDVI+PQK TKLAQDENQLLENILRRLLQELVSSA QS EP+MQYGMSID+ ETSQGHIPRLLDIVLYLCEKEHVEGG
Subjt:  LEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  QPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        +PPDGFSIDFNFYKTFWSLQE+FCNPASL LASTKWQKFTSSL VVLNTFDAQPLSDEEGDAN+LEEESATFSIKYLTSSKLMGLELKDPSFRRHVL+QC
Subjt:  QPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD+PSET+REEIKSCEERVKKLLEVTPPRGK+FLQKIEHIL+RENNWVWWKRDGC PFEKQPIEKKT +D TKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWRLGNKEL

Query:  SQLWKW
        SQLWKW
Subjt:  SQLWKW

A0A6J1HPE8 THO complex subunit 11.16e-27893.84Show/hide
Query:  LEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG
        +EEFRKAILQ  PPENFALQ VQDVI+PQK TKLAQDENQLLENILRRLLQELVSSA QS EP+MQYGMSID+ ET+QGHIPRLLDIVLYLCEKEHVEGG
Subjt:  LEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGG

Query:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
        MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK
Subjt:  MIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK

Query:  QPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC
        +PPDGFSIDFNFYKTFWSLQE+FCNPASL LASTKWQKFTSSL VVLNTFDAQPLSDEEGDAN+LEEESATFSIKYLTSSKLMGLELKDPSFRRHVL+QC
Subjt:  QPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQC

Query:  LILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWRLGNKEL
        LILFDYLKAPGKNEKD+PSETMREEIKSCEERVKKLLEVTPPRGK+FLQKIEHIL+RENNWVWWKRDGC PFEKQPIEKKT +D TKKRRPRWRLGNKEL
Subjt:  LILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWRLGNKEL

Query:  SQLWKW
        SQLWKW
Subjt:  SQLWKW

SwissProt top hitse value%identityAlignment
Q8R3N6 THO complex subunit 14.4e-4734Show/hide
Query:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNET------
        F LL D+ +   L  C  IF ++E           ++ GK  +LR CN LLRRLSK+ + VFCGRI +FLA  FPLSE+S +N++  FN  N T      
Subjt:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNET------

Query:  -------KYEKQPPDGFS------------------IDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESA-
               K+ +   +G                    ID+N Y+ FWSLQ++F NP         W+ F      VL  F +  L D +     +EE    
Subjt:  -------KYEKQPPDGFS------------------IDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESA-

Query:  ---TFSIKYLTSSKLMGLELKDPSFRRHVLMQCLILFDYLK--APGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWK
            +  K+LTS KLM L+L D +FRRH+L+Q LILF YLK     K+   + ++     I+   + V +LL   PP G+ F + +EHIL  E NW  WK
Subjt:  ---TFSIKYLTSSKLMGLELKDPSFRRHVLMQCLILFDYLK--APGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWK

Query:  RDGCAPFEKQPIEKKTINDVTKKR-----------RPRWRLGNKELSQLW
         +GC  F K+         V +KR             +  +GN+EL++LW
Subjt:  RDGCAPFEKQPIEKKTINDVTKKR-----------RPRWRLGNKELSQLW

Q93VM9 THO complex subunit 14.1e-19479.95Show/hide
Query:  LEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETS---QGHIPRLLDIVLYLCEKEHV
        ++ FR AILQ  P E FAL+ VQ  I+PQK TKLAQDENQ+LEN+LR LLQELV++A QS E +MQYG  ID+ +      G IP LLD+VLYLCEKEHV
Subjt:  LEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETS---QGHIPRLLDIVLYLCEKEHV

Query:  EGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK
        EGGMIFQLLEDLTEMST++NCKD+FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKA+DVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK
Subjt:  EGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK

Query:  YEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL
        YEK PP G S+DFNFYKTFWSLQE+FCNPASL  ASTKWQKF+SSL VVLNTFDAQPLS+EEG+AN LEEE+ATF+IKYLTSSKLMGLELKD SFRRH+L
Subjt:  YEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL

Query:  MQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWRLGN
        +QCLI+FDYL+APGKN+KD+PSETM+EE+KSCE+RVKKLLE+TPP+GK+FL+ +EHIL+RE NWVWWKRDGC PFEKQPI+KK+ N   KKRR RWRLGN
Subjt:  MQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWRLGN

Query:  KELSQLWKW
        KELSQLW+W
Subjt:  KELSQLWKW

Q96FV9 THO complex subunit 19.9e-4733.71Show/hide
Query:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNET------
        F LL D+ +   L  C  IF ++E           ++ GK  +LR CN LLRRLSK+ + VFCGRI +FLA  FPLSE+S +N++  FN  N T      
Subjt:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNET------

Query:  -------KYEKQPPDGFS------------------IDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESA-
               K+ +   +G                    ID+N Y+ FWSLQ++F NP         W+ F      VL  F +  L D +     +EE    
Subjt:  -------KYEKQPPDGFS------------------IDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESA-

Query:  ---TFSIKYLTSSKLMGLELKDPSFRRHVLMQCLILFDYLK--APGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWK
            +  K+LTS KLM L+L D +FRRH+L+Q LILF YLK     K+   + ++     I+   + V +LL   PP G+ F + +EHIL  E NW  WK
Subjt:  ---TFSIKYLTSSKLMGLELKDPSFRRHVLMQCLILFDYLK--APGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWK

Query:  RDGCAPFEKQPIEKKTINDVTKKR-----------RPRWRLGNKELSQLW
         +GC  F K+         + +KR             +  +GN+EL++LW
Subjt:  RDGCAPFEKQPIEKKTINDVTKKR-----------RPRWRLGNKELSQLW

Q9URT2 Uncharacterized protein P25A2.035.6e-2629.26Show/hide
Query:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGK-LVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNE-TKYEK
        F +LE+L ++ T+  C  ++ Y E++  ++ K  +  RG+  V+LR  N+LLRRLS+  +  FCGRI + L+  FP  ERS  N++G +NT +   K E 
Subjt:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGK-LVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNE-TKYEK

Query:  QPPD---GFSIDFNFYK-------TFWSLQEFFCNPASLALASTKWQKFTSSL--------MVVLNTF---DAQPLSDEEGDANILEEESAT----FSIK
         PP        D +++K        +W LQ    NP  L LAS    KF  +          ++ NTF    + P  D    +++L E+  T    F  K
Subjt:  QPPD---GFSIDFNFYK-------TFWSLQEFFCNPASLALASTKWQKFTSSL--------MVVLNTF---DAQPLSDEEGDANILEEESAT----FSIK

Query:  YLTSSKLMGLELKDPSFRRHVLMQCLILFDYLKAPGK------------NEKDIPSETMREEIKSCEERVKK--LLEVTPPRGKDFLQKIEHILQRENNW
        Y+ S  L   +L D  FR   ++Q +I+FD+L    K            N+  IP   + +E  S    + K     +   R     + I+ I+  E NW
Subjt:  YLTSSKLMGLELKDPSFRRHVLMQCLILFDYLKAPGK------------NEKDIPSETMREEIKSCEERVKK--LLEVTPPRGKDFLQKIEHILQRENNW

Query:  VWWKRDGCAPFEKQPIEKKTINDVTKKRRP--------RWRLGNKELSQLWK
          WK  GC   EK  ++K  I++  +  +         R+ +GN  LS+LW+
Subjt:  VWWKRDGCAPFEKQPIEKKTINDVTKKRRP--------RWRLGNKELSQLWK

Arabidopsis top hitse value%identityAlignment
AT5G09860.1 nuclear matrix protein-related2.9e-19579.95Show/hide
Query:  LEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETS---QGHIPRLLDIVLYLCEKEHV
        ++ FR AILQ  P E FAL+ VQ  I+PQK TKLAQDENQ+LEN+LR LLQELV++A QS E +MQYG  ID+ +      G IP LLD+VLYLCEKEHV
Subjt:  LEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETS---QGHIPRLLDIVLYLCEKEHV

Query:  EGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK
        EGGMIFQLLEDLTEMST++NCKD+FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKA+DVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK
Subjt:  EGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK

Query:  YEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL
        YEK PP G S+DFNFYKTFWSLQE+FCNPASL  ASTKWQKF+SSL VVLNTFDAQPLS+EEG+AN LEEE+ATF+IKYLTSSKLMGLELKD SFRRH+L
Subjt:  YEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL

Query:  MQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWRLGN
        +QCLI+FDYL+APGKN+KD+PSETM+EE+KSCE+RVKKLLE+TPP+GK+FL+ +EHIL+RE NWVWWKRDGC PFEKQPI+KK+ N   KKRR RWRLGN
Subjt:  MQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWRLGN

Query:  KELSQLWKW
        KELSQLW+W
Subjt:  KELSQLWKW

AT5G09860.2 nuclear matrix protein-related9.4e-19479.95Show/hide
Query:  LEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETS---QGHIPRLLDIVLYLCEKEHV
        ++ FR AILQ  P E FAL+ VQ  I+PQK TKLAQDENQ+LEN+LR LLQELV++A QS E +MQYG  ID+ +      G IP LLD+VLYLCEKEHV
Subjt:  LEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETS---QGHIPRLLDIVLYLCEKEHV

Query:  EGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK
        EGGMIFQLLEDLTEMST++NCKD+FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKA+DVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK
Subjt:  EGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETK

Query:  YEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL
        YEK PP G S+DFNFYKTFWSLQE+FCNPASL  ASTKWQKF+SSL VVLNTFDAQPLS+EEG+AN LEEE+ATF+IKYLTSSKLMGLELKD SFRRH+L
Subjt:  YEKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVL

Query:  MQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWRLGN
        +QCLI+FDYL+APGKN+KD+PSETM EE+KSCE+RVKKLLE+TPP+GK+FL+ +EHIL+RE NWVWWKRDGC PFEKQPI+KK+ N   KKRR RWRLGN
Subjt:  MQCLILFDYLKAPGKNEKDIPSETMREEIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWRLGN

Query:  KELSQLWKW
        KELSQLW+W
Subjt:  KELSQLWKW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCTATCACTTTACTTGGAAGAGTTCCGAAAGGCCATACTGCAAATGGGGCCACCAGAAAATTTTGCTCTGCAGATAGTTCAAGATGTTATTAGACCTCAGAAGCA
CACAAAATTGGCACAAGATGAAAATCAGTTGCTGGAAAATATTCTACGGCGACTACTTCAAGAATTAGTGTCATCTGCCGTTCAATCAACAGAGCCAGTAATGCAGTATG
GGATGTCTATTGATGAAAAGGAAACTTCACAGGGTCATATTCCACGTCTTCTTGACATTGTCCTATATCTTTGTGAGAAAGAGCATGTTGAAGGAGGCATGATATTCCAA
CTGTTGGAGGACCTGACTGAAATGTCAACATTGAGGAACTGCAAAGATATTTTTGGTTACATTGAGAGTAAACAAGACATATTGGGAAAGCAAGAGCTTTTTGCTCGCGG
AAAACTTGTCATGTTGAGAACGTGCAATCAATTGCTCCGTCGTCTATCAAAGGCAAGTGATGTGGTGTTTTGTGGACGCATTCTGATGTTTCTAGCACATTTTTTCCCTC
TATCTGAACGCTCCGCTGTGAATATAAAAGGAGTGTTTAACACCTCCAATGAAACAAAATATGAGAAACAACCCCCTGATGGCTTTTCTATTGATTTCAACTTTTACAAG
ACCTTTTGGAGTTTACAGGAATTCTTTTGTAATCCTGCCTCCCTTGCACTAGCTTCGACGAAGTGGCAAAAGTTCACATCTAGTTTAATGGTCGTATTGAATACCTTTGA
TGCACAACCTTTGTCTGATGAAGAGGGAGATGCTAACATCTTGGAGGAAGAGTCAGCAACCTTCAGCATAAAATATCTTACTAGTAGTAAACTGATGGGTTTAGAATTGA
AGGACCCAAGTTTTCGGCGTCACGTTCTTATGCAGTGCCTTATATTGTTTGATTATCTAAAGGCCCCAGGAAAAAATGAGAAAGATATTCCATCTGAAACCATGAGAGAG
GAAATTAAATCTTGTGAAGAGCGTGTGAAGAAGTTGCTTGAGGTGACACCGCCCAGAGGGAAAGACTTCCTTCAGAAGATTGAGCATATATTACAACGGGAAAATAATTG
GGTATGGTGGAAACGTGATGGTTGCGCTCCTTTTGAAAAGCAACCAATTGAAAAGAAAACCATCAATGATGTAACTAAAAAACGTAGGCCAAGATGGAGACTGGGGAATA
AAGAACTTTCTCAATTGTGGAAATGG
mRNA sequenceShow/hide mRNA sequence
ATGGATCTATCACTTTACTTGGAAGAGTTCCGAAAGGCCATACTGCAAATGGGGCCACCAGAAAATTTTGCTCTGCAGATAGTTCAAGATGTTATTAGACCTCAGAAGCA
CACAAAATTGGCACAAGATGAAAATCAGTTGCTGGAAAATATTCTACGGCGACTACTTCAAGAATTAGTGTCATCTGCCGTTCAATCAACAGAGCCAGTAATGCAGTATG
GGATGTCTATTGATGAAAAGGAAACTTCACAGGGTCATATTCCACGTCTTCTTGACATTGTCCTATATCTTTGTGAGAAAGAGCATGTTGAAGGAGGCATGATATTCCAA
CTGTTGGAGGACCTGACTGAAATGTCAACATTGAGGAACTGCAAAGATATTTTTGGTTACATTGAGAGTAAACAAGACATATTGGGAAAGCAAGAGCTTTTTGCTCGCGG
AAAACTTGTCATGTTGAGAACGTGCAATCAATTGCTCCGTCGTCTATCAAAGGCAAGTGATGTGGTGTTTTGTGGACGCATTCTGATGTTTCTAGCACATTTTTTCCCTC
TATCTGAACGCTCCGCTGTGAATATAAAAGGAGTGTTTAACACCTCCAATGAAACAAAATATGAGAAACAACCCCCTGATGGCTTTTCTATTGATTTCAACTTTTACAAG
ACCTTTTGGAGTTTACAGGAATTCTTTTGTAATCCTGCCTCCCTTGCACTAGCTTCGACGAAGTGGCAAAAGTTCACATCTAGTTTAATGGTCGTATTGAATACCTTTGA
TGCACAACCTTTGTCTGATGAAGAGGGAGATGCTAACATCTTGGAGGAAGAGTCAGCAACCTTCAGCATAAAATATCTTACTAGTAGTAAACTGATGGGTTTAGAATTGA
AGGACCCAAGTTTTCGGCGTCACGTTCTTATGCAGTGCCTTATATTGTTTGATTATCTAAAGGCCCCAGGAAAAAATGAGAAAGATATTCCATCTGAAACCATGAGAGAG
GAAATTAAATCTTGTGAAGAGCGTGTGAAGAAGTTGCTTGAGGTGACACCGCCCAGAGGGAAAGACTTCCTTCAGAAGATTGAGCATATATTACAACGGGAAAATAATTG
GGTATGGTGGAAACGTGATGGTTGCGCTCCTTTTGAAAAGCAACCAATTGAAAAGAAAACCATCAATGATGTAACTAAAAAACGTAGGCCAAGATGGAGACTGGGGAATA
AAGAACTTTCTCAATTGTGGAAATGG
Protein sequenceShow/hide protein sequence
MDLSLYLEEFRKAILQMGPPENFALQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQGHIPRLLDIVLYLCEKEHVEGGMIFQ
LLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKQPPDGFSIDFNFYK
TFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQCLILFDYLKAPGKNEKDIPSETMRE
EIKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRDGCAPFEKQPIEKKTINDVTKKRRPRWRLGNKELSQLWKW