; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012086 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012086
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTHO complex subunit 1
Genome locationtig00153209:43857..70025
RNA-Seq ExpressionSgr012086
SyntenySgr012086
Gene Ontology termsGO:0006406 - mRNA export from nucleus (biological process)
GO:0006506 - GPI anchor biosynthetic process (biological process)
GO:0032784 - regulation of DNA-templated transcription, elongation (biological process)
GO:0097502 - mannosylation (biological process)
GO:0000445 - THO complex part of transcription export complex (cellular component)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004376 - glycolipid mannosyltransferase activity (molecular function)
GO:0051751 - alpha-1,4-mannosyltransferase activity (molecular function)
InterPro domainsIPR021861 - THO complex, subunit THOC1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573787.1 THO complex subunit 1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0094.22Show/hide
Query:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEHVEGGM
        EEFRKAILQTGPPE FAL+TVQ+VI+PQK TKLAQDENQLLENILRRLLQELVSSA QSAEPIMQYGMSIDD +T+QGHIPRLLDIVLYLCEKEHVEGGM
Subjt:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEHVEGGM

Query:  IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE
        IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE
Subjt:  IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE

Query:  PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL
        PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSL VVLNTFDAQPLSDEEGDAN+LEEE+ATFSIKYLTSSKLMGLELKDPSFRRH+LVQCL
Subjt:  PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL

Query:  ILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNKELS
        ILFDYLKAPGKNEKD PSE +REEIKSCEERVKKLLE+TPP+GKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKK THD +KKRRPRWRLGNKELS
Subjt:  ILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNKELS

Query:  QLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNE
        QLWKWADQNPNALTDPQR RTPAISEYWKPLAEDMDESAGIEAEYHH++NRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVR+KYQAKPNE
Subjt:  QLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNE

Query:  RSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDGETDAE
        RSKRAKKEETKG+VRQ EENQMATP SENDGEGTRSD DGPS AMD DTTVA G+VSQGGTPTP+ENKQSSDTD GQEAGQLEADAEVEPGMIDGETDAE
Subjt:  RSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDGETDAE

Query:  VDLDTA
        VDLDTA
Subjt:  VDLDTA

XP_022139144.1 THO complex subunit 1 [Momordica charantia]0.0e+0096.05Show/hide
Query:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEHVEGGM
        EEFRKAILQT PPE FAL+TVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEP MQYGMSIDDK+T+QGHIPRLLDIVLYLCEKEHVEGGM
Subjt:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEHVEGGM

Query:  IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE
        IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE
Subjt:  IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE

Query:  PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL
        PPDGFSIDFNFYKTFWSLQEYFCNPASL+LASTKWQKFTSSLM+VLNTFDAQPLSDEEGDANILEEEAA+FSIKYLTSSKLMGLELKDPSFRRH+LVQCL
Subjt:  PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL

Query:  ILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNKELS
        ILFDYLKAPGKNEKD PSE +REEIKSCEERVKKLLE+TPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQP EKK ++DG+KKRRPRWRLGNKELS
Subjt:  ILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNKELS

Query:  QLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNE
        QLWKWADQNPNALTDPQR RTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNE
Subjt:  QLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNE

Query:  RSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDGETDAE
        RSKRAKKEETKGSVRQVEENQMATP SENDGEGTRSDLDGPSAAMDVDTTVA GNVSQGGTPTPDENKQSSDTD GQE+GQLEADAEVEPGMIDGETDAE
Subjt:  RSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDGETDAE

Query:  VDLDTAG
        VDLDTAG
Subjt:  VDLDTAG

XP_022945746.1 THO complex subunit 1 [Cucurbita moschata]0.0e+0094.23Show/hide
Query:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEHVEGGM
        EEFRKAILQTGPPE FAL+TVQ+VI+PQK TKLAQDENQLLENILRRLLQELVSSA QSAEPIMQYGMSIDD +T+QGHIPRLLDIVLYLCEKEHVEGGM
Subjt:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEHVEGGM

Query:  IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE
        IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE
Subjt:  IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE

Query:  PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL
        PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSL VVLNTFDAQPLSDEEGDAN+LEEE+ATFSIKYLTSSKLMGLELKDPSFRRH+LVQCL
Subjt:  PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL

Query:  ILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNKELS
        ILFDYLKAPGKNEKD PSE +REEIKSCEERVKKLLE+TPP+GKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKK THD +KKRRPRWRLGNKELS
Subjt:  ILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNKELS

Query:  QLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNE
        QLWKWADQNPNALTDPQR RTPAISEYWKPLAEDMDESAGIEAEYHH++NRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVR+KYQAKPNE
Subjt:  QLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNE

Query:  RSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDGETDAE
        RSKRAKKEETKG+VRQ EENQMATP SENDGEGTRSD DGPS AMD DTTVA G+VSQGGTPTP+ENKQSSDTD GQEAGQLEADAEVEPGMIDGETDAE
Subjt:  RSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDGETDAE

Query:  VDLDTAG
        VDLDTAG
Subjt:  VDLDTAG

XP_022966961.1 THO complex subunit 1 [Cucurbita maxima]0.0e+0094.23Show/hide
Query:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEHVEGGM
        EEFRKAILQT PPE FAL+TVQ+VI+PQK TKLAQDENQLLENILRRLLQELVSSA QSAEPIMQYGMSIDD +TAQGHIPRLLDIVLYLCEKEHVEGGM
Subjt:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEHVEGGM

Query:  IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE
        IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE
Subjt:  IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE

Query:  PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL
        PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSL VVLNTFDAQPLSDEEGDAN+LEEE+ATFSIKYLTSSKLMGLELKDPSFRRH+LVQCL
Subjt:  PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL

Query:  ILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNKELS
        ILFDYLKAPGKNEKD PSE MREEIKSCEERVKKLLE+TPP+GKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKK THD +KKRRPRWRLGNKELS
Subjt:  ILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNKELS

Query:  QLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNE
        QLWKWADQNPNALTDPQR RTPAISEYWKPLAEDMDESAGIEAEYHH++NRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVR+KYQAKPNE
Subjt:  QLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNE

Query:  RSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDGETDAE
        RSKRAKKEETKG+VRQ EENQMATP SENDGEGTRSD DGPS AMD DTTVA G+VSQGGTPTP+ENKQSSDTD GQEAGQLEADAEVEPGMIDGE DAE
Subjt:  RSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDGETDAE

Query:  VDLDTAG
        VDLDTAG
Subjt:  VDLDTAG

XP_023542342.1 THO complex subunit 1 [Cucurbita pepo subsp. pepo]0.0e+0094.07Show/hide
Query:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEHVEGGM
        EEFRKAILQTGPPE FAL+TVQ+VI+PQK TKLAQDENQLLENILRRLLQELVSSA QSAEPIMQYGMSIDD +T+QGHIPRLLDIVLYLCEKEHVEGGM
Subjt:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEHVEGGM

Query:  IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE
        IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE
Subjt:  IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE

Query:  PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL
        PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSL VVLNTFDAQPLSDEEGDAN+LEEE+ATFSIKYLTSSKLMGLELKDPSFRRH+LVQCL
Subjt:  PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL

Query:  ILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNKELS
        ILFDYLKAPGKNEKD PSE +REEIKSCEERVKKLLE+TPP+GKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKK THD +KKRRPRWRLGNKELS
Subjt:  ILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNKELS

Query:  QLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNE
        QLWKWADQNPNALTDPQR RTPAISEYWKPLAEDMDESAGIEAEYHH++NRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVR+KYQAKPNE
Subjt:  QLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNE

Query:  RSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDGETDAE
        RSKRAKKEETKG++RQ EENQMATP SENDGEGTRSD DGPS AMD DTTVA G+VSQGGTPTP+ENKQSSDTD GQEAGQLEADAEVEPGMIDGETDAE
Subjt:  RSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDGETDAE

Query:  VDLDTAG
        VDLDTAG
Subjt:  VDLDTAG

TrEMBL top hitse value%identityAlignment
A0A0A0KTL0 Uncharacterized protein0.0e+0091.83Show/hide
Query:  LNAWAEEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEH
        L+ + EEFRKAILQ GPPE FAL+ VQ+VIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQS EP+MQYGMSID+K+T+QGHIPRLLDIVLYLCEKEH
Subjt:  LNAWAEEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEH

Query:  VEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNET
        VEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNET
Subjt:  VEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNET

Query:  KYEKEPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHI
        KYEK+PPDGFSIDFNFYKTFWSLQE+FCNPASL LASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEE+ATFSIKYLTSSKLMGLELKDPSFRRH+
Subjt:  KYEKEPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHI

Query:  LVQCLILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLG
        L+QCLILFDYLKAPGKNEKD PSE MREEIKSCEERVKKLLE+TPP+GK+FLQKIEHIL+RENNWVWWKRDGC PFEKQPIEKK  +D +KKRRPRWRLG
Subjt:  LVQCLILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLG

Query:  NKELSQLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQ
        NKELSQLWKW+DQNPNALTDPQR R+PAIS+YWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQ
Subjt:  NKELSQLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQ

Query:  AKPNERSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDG
        AKPNERSKRAKKEE KG+V+QV+ENQMATP SENDGEGTRSD DGPSA MDVDT +ATGNVSQGG  TP+ENK SSDTD GQEAGQLEADAEVEPGMIDG
Subjt:  AKPNERSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDG

Query:  ETDAEVDLDTAG
        ETDAEVDLDTAG
Subjt:  ETDAEVDLDTAG

A0A1S3CC63 THO complex subunit 10.0e+0091.5Show/hide
Query:  LNAWAEEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEH
        L+ + EEFRKAILQ GPPE FAL+TVQ+VIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQS EP+MQYGMSID+K+T+QGHIPRLLDIVLYLCEKEH
Subjt:  LNAWAEEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEH

Query:  VEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNET
        VEGGMIFQLLEDLTEMSTL+NCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNET
Subjt:  VEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNET

Query:  KYEKEPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHI
        KYEK+ PDGFSIDFNFYKTFWSLQE+FCNPASL LASTKWQKF  SLMVVLNTFDAQPLSDEEGDANILEEE+ATFSIKYLTSSKLMGLELKDPSFRRH+
Subjt:  KYEKEPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHI

Query:  LVQCLILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLG
        L+QCLILFDYLKAPGKNEKD PSE MREEIKSCEERVKKLLE+TPP+GK+FLQKIEHIL+RENNWVWWKRDGC PFEKQPIEKK  +D +KKRRPRWRLG
Subjt:  LVQCLILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLG

Query:  NKELSQLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQ
        NKELSQLWKWADQNPNALTDPQR R+PAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQ
Subjt:  NKELSQLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQ

Query:  AKPNERSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDG
        AKPNERSKRAK+EE KG+V+QV+ENQMATP SENDGEGTRSD DGPSA MDVDT +ATGNVSQGG  TP+ENK SSDTD GQEAGQLEADAEVEPGMIDG
Subjt:  AKPNERSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDG

Query:  ETDAEVDLDTAG
        ETDAEVDLDTAG
Subjt:  ETDAEVDLDTAG

A0A6J1CC37 THO complex subunit 10.0e+0096.05Show/hide
Query:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEHVEGGM
        EEFRKAILQT PPE FAL+TVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEP MQYGMSIDDK+T+QGHIPRLLDIVLYLCEKEHVEGGM
Subjt:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEHVEGGM

Query:  IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE
        IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE
Subjt:  IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE

Query:  PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL
        PPDGFSIDFNFYKTFWSLQEYFCNPASL+LASTKWQKFTSSLM+VLNTFDAQPLSDEEGDANILEEEAA+FSIKYLTSSKLMGLELKDPSFRRH+LVQCL
Subjt:  PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL

Query:  ILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNKELS
        ILFDYLKAPGKNEKD PSE +REEIKSCEERVKKLLE+TPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQP EKK ++DG+KKRRPRWRLGNKELS
Subjt:  ILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNKELS

Query:  QLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNE
        QLWKWADQNPNALTDPQR RTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNE
Subjt:  QLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNE

Query:  RSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDGETDAE
        RSKRAKKEETKGSVRQVEENQMATP SENDGEGTRSDLDGPSAAMDVDTTVA GNVSQGGTPTPDENKQSSDTD GQE+GQLEADAEVEPGMIDGETDAE
Subjt:  RSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDGETDAE

Query:  VDLDTAG
        VDLDTAG
Subjt:  VDLDTAG

A0A6J1G1S6 THO complex subunit 10.0e+0094.23Show/hide
Query:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEHVEGGM
        EEFRKAILQTGPPE FAL+TVQ+VI+PQK TKLAQDENQLLENILRRLLQELVSSA QSAEPIMQYGMSIDD +T+QGHIPRLLDIVLYLCEKEHVEGGM
Subjt:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEHVEGGM

Query:  IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE
        IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE
Subjt:  IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE

Query:  PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL
        PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSL VVLNTFDAQPLSDEEGDAN+LEEE+ATFSIKYLTSSKLMGLELKDPSFRRH+LVQCL
Subjt:  PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL

Query:  ILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNKELS
        ILFDYLKAPGKNEKD PSE +REEIKSCEERVKKLLE+TPP+GKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKK THD +KKRRPRWRLGNKELS
Subjt:  ILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNKELS

Query:  QLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNE
        QLWKWADQNPNALTDPQR RTPAISEYWKPLAEDMDESAGIEAEYHH++NRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVR+KYQAKPNE
Subjt:  QLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNE

Query:  RSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDGETDAE
        RSKRAKKEETKG+VRQ EENQMATP SENDGEGTRSD DGPS AMD DTTVA G+VSQGGTPTP+ENKQSSDTD GQEAGQLEADAEVEPGMIDGETDAE
Subjt:  RSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDGETDAE

Query:  VDLDTAG
        VDLDTAG
Subjt:  VDLDTAG

A0A6J1HPE8 THO complex subunit 10.0e+0094.23Show/hide
Query:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEHVEGGM
        EEFRKAILQT PPE FAL+TVQ+VI+PQK TKLAQDENQLLENILRRLLQELVSSA QSAEPIMQYGMSIDD +TAQGHIPRLLDIVLYLCEKEHVEGGM
Subjt:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEHVEGGM

Query:  IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE
        IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE
Subjt:  IFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKE

Query:  PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL
        PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSL VVLNTFDAQPLSDEEGDAN+LEEE+ATFSIKYLTSSKLMGLELKDPSFRRH+LVQCL
Subjt:  PPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL

Query:  ILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNKELS
        ILFDYLKAPGKNEKD PSE MREEIKSCEERVKKLLE+TPP+GKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKK THD +KKRRPRWRLGNKELS
Subjt:  ILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNKELS

Query:  QLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNE
        QLWKWADQNPNALTDPQR RTPAISEYWKPLAEDMDESAGIEAEYHH++NRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVR+KYQAKPNE
Subjt:  QLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNE

Query:  RSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDGETDAE
        RSKRAKKEETKG+VRQ EENQMATP SENDGEGTRSD DGPS AMD DTTVA G+VSQGGTPTP+ENKQSSDTD GQEAGQLEADAEVEPGMIDGE DAE
Subjt:  RSKRAKKEETKGSVRQVEENQMATP-SENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSDTDNGQEAGQLEADAEVEPGMIDGETDAE

Query:  VDLDTAG
        VDLDTAG
Subjt:  VDLDTAG

SwissProt top hitse value%identityAlignment
Q8R3N6 THO complex subunit 11.2e-5532.37Show/hide
Query:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNET------
        F LL D+ +   L  C  IF ++E           ++ GK  +LR CN LLRRLSK+ + VFCGRI +FLA  FPLSE+S +N++  FN  N T      
Subjt:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNET------

Query:  -------KYEKEPPDGFS------------------IDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAA-
               K+ ++  +G                    ID+N Y+ FWSLQ+YF NP         W+ F      VL  F +  L D +     +EE    
Subjt:  -------KYEKEPPDGFS------------------IDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAA-

Query:  ---TFSIKYLTSSKLMGLELKDPSFRRHILVQCLILFDYLKAPGKNEKDFPSEAMREE----IKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVW
            +  K+LTS KLM L+L D +FRRHIL+Q LILF YLK   K +    +  + +E    I+   + V +LL   PP G+ F + +EHIL  E NW  
Subjt:  ---TFSIKYLTSSKLMGLELKDPSFRRHILVQCLILFDYLKAPGKNEKDFPSEAMREE----IKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVW

Query:  WKRDGCPPFEKQ---------PIEKKATHDGSKKRRPRWR--LGNKELSQLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNR
        WK +GCP F K+          + K+A  +    + P  +  +GN+EL++LW     N  A     R   P + E+++   E  D    +E+EY   NN 
Subjt:  WKRDGCPPFEKQ---------PIEKKATHDGSKKRRPRWR--LGNKELSQLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNR

Query:  VYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEETKGSVRQVEENQMATPSENDGEGTRSD
         Y W+ LR  AR+    F             LE +   + AK    P+E  K  + E+        EE+  A   EN+    R D
Subjt:  VYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEETKGSVRQVEENQMATPSENDGEGTRSD

Q93VM9 THO complex subunit 12.6e-26576.45Show/hide
Query:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSI---DDKDTAQGHIPRLLDIVLYLCEKEHVE
        + FR AILQ  P ETFAL+TVQ  I+PQK TKLAQDENQ+LEN+LR LLQELV++A QS E IMQYG  I   DD D   G IP LLD+VLYLCEKEHVE
Subjt:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSI---DDKDTAQGHIPRLLDIVLYLCEKEHVE

Query:  GGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY
        GGMIFQLLEDLTEMST++NCKD+FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKA+DVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY
Subjt:  GGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY

Query:  EKEPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILV
        EK+PP G S+DFNFYKTFWSLQEYFCNPASLT ASTKWQKF+SSL VVLNTFDAQPLS+EEG+AN LEEEAATF+IKYLTSSKLMGLELKD SFRRHIL+
Subjt:  EKEPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILV

Query:  QCLILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNK
        QCLI+FDYL+APGKN+KD PSE M+EE+KSCE+RVKKLLE+TPPKGKEFL+ +EHILERE NWVWWKRDGCPPFEKQPI+KK+ + G KKRR RWRLGNK
Subjt:  QCLILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNK

Query:  ELSQLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAK
        ELSQLW+WADQNPNALTD QR RTP I++YWKPLAEDMD SAGIE EYHH+NNRVYCWKGLRF+ARQDLEGFSRFT+ GIEGVVP+ELLPP+VR+KYQAK
Subjt:  ELSQLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAK

Query:  PNERSKRAKKEETKGSVRQVEENQM-ATPSENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQ--SSDTDNGQEAGQLEADAEVEPGMIDG
        PNE++KRAKKEETKG   + E NQ+  + SE + EG R D +     M+ D            TPTP+E ++   SDT+NGQEAGQ+E     E G++D 
Subjt:  PNERSKRAKKEETKGSVRQVEENQM-ATPSENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQ--SSDTDNGQEAGQLEADAEVEPGMIDG

Query:  ETD
        + D
Subjt:  ETD

Q96FV9 THO complex subunit 13.4e-5532.78Show/hide
Query:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY----
        F LL D+ +   L  C  IF ++E           ++ GK  +LR CN LLRRLSK+ + VFCGRI +FLA  FPLSE+S +N++  FN  N T +    
Subjt:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY----

Query:  -------------------------EKEPPDGFS--IDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAA-
                                 ++E P   S  ID+N Y+ FWSLQ+YF NP         W+ F      VL  F +  L D +     +EE    
Subjt:  -------------------------EKEPPDGFS--IDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAA-

Query:  ---TFSIKYLTSSKLMGLELKDPSFRRHILVQCLILFDYLKAPGKNEKDFPSEAMREE----IKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVW
            +  K+LTS KLM L+L D +FRRHIL+Q LILF YLK   K +    +  + +E    I+   + V +LL   PP G+ F + +EHIL  E NW  
Subjt:  ---TFSIKYLTSSKLMGLELKDPSFRRHILVQCLILFDYLKAPGKNEKDFPSEAMREE----IKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVW

Query:  WKRDGCPPFEKQ-PIEKKATHDGSKKRRP----------RWRLGNKELSQLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNR
        WK +GCP F K+   + K T    K+  P          +  +GN+EL++LW     N  A     R   P + E+++   E  D    +E EY   NN 
Subjt:  WKRDGCPPFEKQ-PIEKKATHDGSKKRRP----------RWRLGNKELSQLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNR

Query:  VYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEETKGSVRQVEENQMATPSENDGEGTRSD
         Y W+ LR  AR+    F             LE +   + AK    P+E  K  + E+        EE+  A   EN+    R D
Subjt:  VYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEETKGSVRQVEENQMATPSENDGEGTRSD

Q9URT2 Uncharacterized protein P25A2.039.9e-3126.98Show/hide
Query:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGK-LVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNE-TKYEK
        F +LE+L ++ T+  C  ++ Y E++  ++ K  +  RG+  V+LR  N+LLRRLS+  +  FCGRI + L+  FP  ERS  N++G +NT +   K E 
Subjt:  FQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGK-LVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNE-TKYEK

Query:  EPPD---GFSIDFNFYK-------TFWSLQEYFCNPASLTLASTKWQKFTSSL--------MVVLNTF---DAQPLSDEEGDANILEEEAAT----FSIK
         PP        D +++K        +W LQ    NP  L LAS    KF  +          ++ NTF    + P  D    +++L E+  T    F  K
Subjt:  EPPD---GFSIDFNFYK-------TFWSLQEYFCNPASLTLASTKWQKFTSSL--------MVVLNTF---DAQPLSDEEGDANILEEEAAT----FSIK

Query:  YLTSSKLMGLELKDPSFRRHILVQCLILFDYLKAPGKNE------KDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFL---------QKIEHILERENN
        Y+ S  L   +L D  FR   ++Q +I+FD+L    K        + + ++A+   +   +E   KL E++  +   FL         + I+ I+  E N
Subjt:  YLTSSKLMGLELKDPSFRRHILVQCLILFDYLKAPGKNE------KDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFL---------QKIEHILERENN

Query:  WVWWKRDGCPPFEKQPIEKKATH---DGSKKR-----RPRWRLGNKELSQLWKWADQNP-NALTDPQRARTPAISEYWKPLAEDMDESAGI----EAEYH
        W  WK  GCP  EK  ++K A     +G KK      + R+ +GN  LS+LW+ A +N  + L   +R R P+   +   +  D  E        +  +H
Subjt:  WVWWKRDGCPPFEKQPIEKKATH---DGSKKR-----RPRWRLGNKELSQLWKWADQNP-NALTDPQRARTPAISEYWKPLAEDMDESAGI----EAEYH

Query:  HRNNRVYCWKGLRFSARQDLEGFS-------RFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEETKGSVRQVEENQMATPSENDGEG
         ++     W+  R +    L+ FS           + IEG      + P +   +     E  +  ++ + + +V+   +N  A+P + D EG
Subjt:  HRNNRVYCWKGLRFSARQDLEGFS-------RFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEETKGSVRQVEENQMATPSENDGEG

Arabidopsis top hitse value%identityAlignment
AT5G09860.1 nuclear matrix protein-related1.8e-26676.45Show/hide
Query:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSI---DDKDTAQGHIPRLLDIVLYLCEKEHVE
        + FR AILQ  P ETFAL+TVQ  I+PQK TKLAQDENQ+LEN+LR LLQELV++A QS E IMQYG  I   DD D   G IP LLD+VLYLCEKEHVE
Subjt:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSI---DDKDTAQGHIPRLLDIVLYLCEKEHVE

Query:  GGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY
        GGMIFQLLEDLTEMST++NCKD+FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKA+DVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY
Subjt:  GGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY

Query:  EKEPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILV
        EK+PP G S+DFNFYKTFWSLQEYFCNPASLT ASTKWQKF+SSL VVLNTFDAQPLS+EEG+AN LEEEAATF+IKYLTSSKLMGLELKD SFRRHIL+
Subjt:  EKEPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILV

Query:  QCLILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNK
        QCLI+FDYL+APGKN+KD PSE M+EE+KSCE+RVKKLLE+TPPKGKEFL+ +EHILERE NWVWWKRDGCPPFEKQPI+KK+ + G KKRR RWRLGNK
Subjt:  QCLILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNK

Query:  ELSQLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAK
        ELSQLW+WADQNPNALTD QR RTP I++YWKPLAEDMD SAGIE EYHH+NNRVYCWKGLRF+ARQDLEGFSRFT+ GIEGVVP+ELLPP+VR+KYQAK
Subjt:  ELSQLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAK

Query:  PNERSKRAKKEETKGSVRQVEENQM-ATPSENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQ--SSDTDNGQEAGQLEADAEVEPGMIDG
        PNE++KRAKKEETKG   + E NQ+  + SE + EG R D +     M+ D            TPTP+E ++   SDT+NGQEAGQ+E     E G++D 
Subjt:  PNERSKRAKKEETKGSVRQVEENQM-ATPSENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQ--SSDTDNGQEAGQLEADAEVEPGMIDG

Query:  ETD
        + D
Subjt:  ETD

AT5G09860.2 nuclear matrix protein-related7.7e-26576.45Show/hide
Query:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSI---DDKDTAQGHIPRLLDIVLYLCEKEHVE
        + FR AILQ  P ETFAL+TVQ  I+PQK TKLAQDENQ+LEN+LR LLQELV++A QS E IMQYG  I   DD D   G IP LLD+VLYLCEKEHVE
Subjt:  EEFRKAILQTGPPETFALRTVQEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSI---DDKDTAQGHIPRLLDIVLYLCEKEHVE

Query:  GGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY
        GGMIFQLLEDLTEMST++NCKD+FGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKA+DVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY
Subjt:  GGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY

Query:  EKEPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILV
        EK+PP G S+DFNFYKTFWSLQEYFCNPASLT ASTKWQKF+SSL VVLNTFDAQPLS+EEG+AN LEEEAATF+IKYLTSSKLMGLELKD SFRRHIL+
Subjt:  EKEPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSSLMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILV

Query:  QCLILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNK
        QCLI+FDYL+APGKN+KD PSE M EE+KSCE+RVKKLLE+TPPKGKEFL+ +EHILERE NWVWWKRDGCPPFEKQPI+KK+ + G KKRR RWRLGNK
Subjt:  QCLILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIEHILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNK

Query:  ELSQLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAK
        ELSQLW+WADQNPNALTD QR RTP I++YWKPLAEDMD SAGIE EYHH+NNRVYCWKGLRF+ARQDLEGFSRFT+ GIEGVVP+ELLPP+VR+KYQAK
Subjt:  ELSQLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAK

Query:  PNERSKRAKKEETKGSVRQVEENQM-ATPSENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQ--SSDTDNGQEAGQLEADAEVEPGMIDG
        PNE++KRAKKEETKG   + E NQ+  + SE + EG R D +     M+ D            TPTP+E ++   SDT+NGQEAGQ+E     E G++D 
Subjt:  PNERSKRAKKEETKGSVRQVEENQM-ATPSENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQ--SSDTDNGQEAGQLEADAEVEPGMIDG

Query:  ETD
        + D
Subjt:  ETD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCAGCCAATAAACCCAGACGATTTATTTCCGAAGCGAGGGCAGGTTTAGCTACAATCAGGGAGTTATACATTGATGATCCTTTTTCCCATACCATGCTCTCTAG
TAGCCGTTCTTCCAAGGAATTAAGATGTCGCCTCAATTCTCTAGGAAGTGATTCTTCATGTCTAATGAAACTCTCTATCACCTTACTAAGGTCAAAAGCTGTAATGCCAG
TTCTCTCCAATACTGCAGGAAACAACATTGTAACAGTTTTATGTGTTGCCAAAACCAATTCGGCAGAACAAGCCAGCATACATCTATGGAACCTCTCATTAGTCAGCAAA
GAGATGTTATCCATGAGGTTTGCACTCTGTAAATTTCCAGCTATACAGCGCTCTCCTAGAGCACTATTAGGAAATATAGCCTCTAATATTATATGTGCCCTGCGAACTAC
ATCATTAGTTACATCTCTATCACATGAAGCCAAAAAGCGCTCCATCTCTGCCGAAGGTTTTGCTGGAAGTGGAGAAATAACAGTTCTAAGCCACTTAGCCGTTGTCATGG
CTGTGCTAACAGGAGAGCGAAGTGGAGACATTGGACTTGTTATTGTCCTTGCAGGCGGTGATGATTTTGAAAATGGGGTTGTTTTGTTCGAACGATTCACCGGCAGCGCG
GCCAAGCATGAGGCACATGGTGCGGTTTATGGATGGCAAATCAGTGTTCCCGACAATATTATGGCCCCCAAAGTGGTGGAAGGAGGGAAGAATCACGGCGAAGGAATACA
TCAGCCTCAATTTGTCTTTGTGATTCTTGTTGATTACAGAACTGGAGCGTTTGTTTCAACTGATGATTTCGAGTGGAAGTTTTTTTCTGACGAAGTTCAAGATTCATTTT
CCTTTCAAATTGAGCAGAGTTTGAAGAGGAAATTGAATGCCTGGGCTGAAGAGTTCCGGAAGGCCATACTACAAACGGGGCCACCGGAAACTTTTGCTCTGCGGACAGTT
CAAGAAGTTATCAGACCTCAGAAGCACACGAAATTGGCACAAGATGAGAATCAGTTGCTGGAAAATATTCTACGTCGACTACTTCAAGAATTGGTGTCATCTGCAGTTCA
ATCAGCGGAGCCAATAATGCAGTATGGGATGTCCATTGATGACAAGGATACTGCACAGGGTCATATTCCGCGTCTTCTCGACATTGTCCTATATCTTTGTGAGAAAGAGC
ATGTTGAAGGAGGCATGATATTCCAGCTGTTGGAGGACCTGACTGAAATGTCAACATTGAGGAACTGCAAAGATATTTTTGGTTACATAGAGAGTAAACAAGATATATTG
GGAAAGCAAGAGCTTTTTGCACGTGGAAAACTTGTCATGCTGAGAACGTGCAATCAATTGCTCCGTCGTCTATCAAAGGCAAGTGATGTGGTGTTTTGTGGACGCATTCT
GATGTTTCTGGCACATTTTTTCCCTTTATCTGAACGTTCTGCTGTGAATATAAAAGGAGTGTTTAACACCTCCAATGAAACAAAATATGAGAAAGAACCCCCTGATGGCT
TTTCTATCGATTTCAACTTTTACAAGACCTTTTGGAGTTTACAGGAATACTTCTGTAATCCTGCCTCCCTTACTCTTGCTTCAACGAAGTGGCAAAAGTTCACATCCAGT
CTAATGGTTGTGTTGAATACCTTTGATGCACAACCTTTGTCTGATGAAGAGGGAGATGCTAACATCTTGGAGGAAGAGGCGGCAACCTTCAGCATAAAATATCTTACTAG
TAGTAAACTGATGGGTTTAGAATTAAAGGATCCAAGTTTTCGACGCCACATTCTTGTGCAGTGCCTTATATTGTTTGATTATCTGAAGGCCCCAGGAAAGAATGAGAAAG
ACTTTCCATCTGAAGCCATGAGAGAAGAAATTAAATCTTGTGAGGAGCGTGTGAAGAAGTTGCTAGAGATGACACCGCCTAAAGGGAAAGAATTCCTCCAGAAAATTGAG
CATATATTAGAACGTGAAAATAATTGGGTATGGTGGAAACGTGATGGTTGCCCTCCTTTTGAAAAGCAACCAATTGAAAAGAAAGCCACCCATGATGGAAGTAAAAAACG
TAGGCCAAGGTGGAGACTGGGGAATAAAGAACTTTCTCAATTGTGGAAGTGGGCAGATCAGAATCCGAATGCCTTGACTGATCCTCAACGTGCCCGTACACCTGCAATTT
CGGAGTATTGGAAACCCTTGGCAGAAGATATGGACGAGTCAGCTGGAATTGAAGCTGAATATCATCATAGAAATAACCGAGTTTATTGCTGGAAAGGTCTCCGGTTCTCT
GCTCGACAGGACTTGGAAGGATTTTCTAGATTCACTGACCATGGCATTGAAGGAGTTGTGCCTTTGGAACTTCTGCCACCAGATGTACGAGCCAAATACCAAGCTAAACC
AAATGAGAGATCTAAACGTGCTAAGAAGGAAGAAACAAAAGGGTCTGTGCGTCAAGTTGAAGAAAATCAGATGGCAACACCAAGTGAGAATGATGGTGAAGGAACCAGAA
GCGACCTTGATGGTCCATCAGCAGCAATGGATGTTGATACAACTGTTGCTACTGGCAATGTATCTCAAGGTGGTACTCCAACTCCAGATGAAAATAAACAAAGCTCTGAC
ACAGATAACGGTCAAGAGGCAGGACAGTTGGAAGCTGATGCTGAGGTGGAGCCAGGCATGATTGATGGCGAGACAGATGCAGAGGTTGATTTGGACACTGCTGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTCAGCCAATAAACCCAGACGATTTATTTCCGAAGCGAGGGCAGGTTTAGCTACAATCAGGGAGTTATACATTGATGATCCTTTTTCCCATACCATGCTCTCTAG
TAGCCGTTCTTCCAAGGAATTAAGATGTCGCCTCAATTCTCTAGGAAGTGATTCTTCATGTCTAATGAAACTCTCTATCACCTTACTAAGGTCAAAAGCTGTAATGCCAG
TTCTCTCCAATACTGCAGGAAACAACATTGTAACAGTTTTATGTGTTGCCAAAACCAATTCGGCAGAACAAGCCAGCATACATCTATGGAACCTCTCATTAGTCAGCAAA
GAGATGTTATCCATGAGGTTTGCACTCTGTAAATTTCCAGCTATACAGCGCTCTCCTAGAGCACTATTAGGAAATATAGCCTCTAATATTATATGTGCCCTGCGAACTAC
ATCATTAGTTACATCTCTATCACATGAAGCCAAAAAGCGCTCCATCTCTGCCGAAGGTTTTGCTGGAAGTGGAGAAATAACAGTTCTAAGCCACTTAGCCGTTGTCATGG
CTGTGCTAACAGGAGAGCGAAGTGGAGACATTGGACTTGTTATTGTCCTTGCAGGCGGTGATGATTTTGAAAATGGGGTTGTTTTGTTCGAACGATTCACCGGCAGCGCG
GCCAAGCATGAGGCACATGGTGCGGTTTATGGATGGCAAATCAGTGTTCCCGACAATATTATGGCCCCCAAAGTGGTGGAAGGAGGGAAGAATCACGGCGAAGGAATACA
TCAGCCTCAATTTGTCTTTGTGATTCTTGTTGATTACAGAACTGGAGCGTTTGTTTCAACTGATGATTTCGAGTGGAAGTTTTTTTCTGACGAAGTTCAAGATTCATTTT
CCTTTCAAATTGAGCAGAGTTTGAAGAGGAAATTGAATGCCTGGGCTGAAGAGTTCCGGAAGGCCATACTACAAACGGGGCCACCGGAAACTTTTGCTCTGCGGACAGTT
CAAGAAGTTATCAGACCTCAGAAGCACACGAAATTGGCACAAGATGAGAATCAGTTGCTGGAAAATATTCTACGTCGACTACTTCAAGAATTGGTGTCATCTGCAGTTCA
ATCAGCGGAGCCAATAATGCAGTATGGGATGTCCATTGATGACAAGGATACTGCACAGGGTCATATTCCGCGTCTTCTCGACATTGTCCTATATCTTTGTGAGAAAGAGC
ATGTTGAAGGAGGCATGATATTCCAGCTGTTGGAGGACCTGACTGAAATGTCAACATTGAGGAACTGCAAAGATATTTTTGGTTACATAGAGAGTAAACAAGATATATTG
GGAAAGCAAGAGCTTTTTGCACGTGGAAAACTTGTCATGCTGAGAACGTGCAATCAATTGCTCCGTCGTCTATCAAAGGCAAGTGATGTGGTGTTTTGTGGACGCATTCT
GATGTTTCTGGCACATTTTTTCCCTTTATCTGAACGTTCTGCTGTGAATATAAAAGGAGTGTTTAACACCTCCAATGAAACAAAATATGAGAAAGAACCCCCTGATGGCT
TTTCTATCGATTTCAACTTTTACAAGACCTTTTGGAGTTTACAGGAATACTTCTGTAATCCTGCCTCCCTTACTCTTGCTTCAACGAAGTGGCAAAAGTTCACATCCAGT
CTAATGGTTGTGTTGAATACCTTTGATGCACAACCTTTGTCTGATGAAGAGGGAGATGCTAACATCTTGGAGGAAGAGGCGGCAACCTTCAGCATAAAATATCTTACTAG
TAGTAAACTGATGGGTTTAGAATTAAAGGATCCAAGTTTTCGACGCCACATTCTTGTGCAGTGCCTTATATTGTTTGATTATCTGAAGGCCCCAGGAAAGAATGAGAAAG
ACTTTCCATCTGAAGCCATGAGAGAAGAAATTAAATCTTGTGAGGAGCGTGTGAAGAAGTTGCTAGAGATGACACCGCCTAAAGGGAAAGAATTCCTCCAGAAAATTGAG
CATATATTAGAACGTGAAAATAATTGGGTATGGTGGAAACGTGATGGTTGCCCTCCTTTTGAAAAGCAACCAATTGAAAAGAAAGCCACCCATGATGGAAGTAAAAAACG
TAGGCCAAGGTGGAGACTGGGGAATAAAGAACTTTCTCAATTGTGGAAGTGGGCAGATCAGAATCCGAATGCCTTGACTGATCCTCAACGTGCCCGTACACCTGCAATTT
CGGAGTATTGGAAACCCTTGGCAGAAGATATGGACGAGTCAGCTGGAATTGAAGCTGAATATCATCATAGAAATAACCGAGTTTATTGCTGGAAAGGTCTCCGGTTCTCT
GCTCGACAGGACTTGGAAGGATTTTCTAGATTCACTGACCATGGCATTGAAGGAGTTGTGCCTTTGGAACTTCTGCCACCAGATGTACGAGCCAAATACCAAGCTAAACC
AAATGAGAGATCTAAACGTGCTAAGAAGGAAGAAACAAAAGGGTCTGTGCGTCAAGTTGAAGAAAATCAGATGGCAACACCAAGTGAGAATGATGGTGAAGGAACCAGAA
GCGACCTTGATGGTCCATCAGCAGCAATGGATGTTGATACAACTGTTGCTACTGGCAATGTATCTCAAGGTGGTACTCCAACTCCAGATGAAAATAAACAAAGCTCTGAC
ACAGATAACGGTCAAGAGGCAGGACAGTTGGAAGCTGATGCTGAGGTGGAGCCAGGCATGATTGATGGCGAGACAGATGCAGAGGTTGATTTGGACACTGCTGGTTGA
Protein sequenceShow/hide protein sequence
MGSANKPRRFISEARAGLATIRELYIDDPFSHTMLSSSRSSKELRCRLNSLGSDSSCLMKLSITLLRSKAVMPVLSNTAGNNIVTVLCVAKTNSAEQASIHLWNLSLVSK
EMLSMRFALCKFPAIQRSPRALLGNIASNIICALRTTSLVTSLSHEAKKRSISAEGFAGSGEITVLSHLAVVMAVLTGERSGDIGLVIVLAGGDDFENGVVLFERFTGSA
AKHEAHGAVYGWQISVPDNIMAPKVVEGGKNHGEGIHQPQFVFVILVDYRTGAFVSTDDFEWKFFSDEVQDSFSFQIEQSLKRKLNAWAEEFRKAILQTGPPETFALRTV
QEVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSAEPIMQYGMSIDDKDTAQGHIPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDIL
GKQELFARGKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEKEPPDGFSIDFNFYKTFWSLQEYFCNPASLTLASTKWQKFTSS
LMVVLNTFDAQPLSDEEGDANILEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCLILFDYLKAPGKNEKDFPSEAMREEIKSCEERVKKLLEMTPPKGKEFLQKIE
HILERENNWVWWKRDGCPPFEKQPIEKKATHDGSKKRRPRWRLGNKELSQLWKWADQNPNALTDPQRARTPAISEYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFS
ARQDLEGFSRFTDHGIEGVVPLELLPPDVRAKYQAKPNERSKRAKKEETKGSVRQVEENQMATPSENDGEGTRSDLDGPSAAMDVDTTVATGNVSQGGTPTPDENKQSSD
TDNGQEAGQLEADAEVEPGMIDGETDAEVDLDTAG