; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0025467 (gene) of Chayote v1 genome

Gene IDSed0025467
OrganismSechium edule (Chayote v1)
DescriptionDNA mismatch repair protein MLH1
Genome locationLG07:43612607..43626869
RNA-Seq ExpressionSed0025467
SyntenySed0025467
Gene Ontology termsGO:0006298 - mismatch repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0032300 - mismatch repair complex (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
InterPro domainsIPR002099 - DNA mismatch repair protein family, N-terminal
IPR013507 - DNA mismatch repair protein, S5 domain 2-like
IPR014721 - Ribosomal protein S5 domain 2-type fold, subgroup
IPR014762 - DNA mismatch repair, conserved site
IPR020568 - Ribosomal protein S5 domain 2-type fold
IPR032189 - DNA mismatch repair protein Mlh1, C-terminal
IPR036890 - Histidine kinase/HSP90-like ATPase superfamily
IPR038973 - DNA mismatch repair protein MutL/Mlh/Pms


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022932102.1 DNA mismatch repair protein MLH1 isoform X3 [Cucurbita moschata]0.0e+0088.33Show/hide
Query:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH
        ME HADDE         EEV  +EPP+I+RL  SVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNV VKDGGLKLIQVSDDGHGIRYEDLPILCERH
Subjt:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH

Query:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL
        TTSKLSKFEDLQSI SMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQN SDDY+KIVDLL
Subjt:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL

Query:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE
        SRFAIHHINI FSCRKHGAARADVHSVGSTSRLDAI TVYGAS++RNLMKIEVS++ +ACS FKMDGLISNSNY AKKITMVLFIN RMV+CSALKRAIE
Subjt:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE

Query:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQSGSKSQKVP
        IVYAATLPKASKPFIYMSI+LPPEHVDVNVHPTKKEVSLLNQEVIIE+IQS VESKLRSSND RTFQEQDVESSAAS+MVI ++  QNSSQSGSKSQKVP
Subjt:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQSGSKSQKVP

Query:  MHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANV
        + KMVRTDSTDPAGRLHAYVQM PPGLPESSL  VRSFVR RRNP+EAANLTS+QDL +EID+NCHAGLLN VRHC+YIGM DDVFALLQHDTHLYLANV
Subjt:  MHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANV

Query:  VNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDM
        VNLSKELMYQQVL RFAHFNA+QLSNPAPLYELISLALREED NSESNENDDFN+KV+E STKLLKLKAEMLEEFFCI+ID +GNLARLPVVLDQYTPDM
Subjt:  VNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDM

Query:  DRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEE-VAMENEIDHGLLSEAETVWAQREWSIHHV
        DRVPEFVLSLANDIDWE+EK+C+Q+ISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNP+DENIG+++ + MENEI+HGLLSEAET+WAQREWSI HV
Subjt:  DRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEE-VAMENEIDHGLLSEAETVWAQREWSIHHV

Query:  LFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC
        L PS+KLFFKP HSLA+NG+FIRVASLE+LYKIFERC
Subjt:  LFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC

XP_022972873.1 DNA mismatch repair protein MLH1 isoform X1 [Cucurbita maxima]0.0e+0088.5Show/hide
Query:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH
        ME HADDE         EEV  +EPP+I+RL  SVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNV VKDGGLKLIQVSDDGHGIRYEDLPILCERH
Subjt:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH

Query:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL
        TTSKLSKFEDLQSI SMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQN SDDY+KIVDLL
Subjt:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL

Query:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE
        SRFAIHHINI FSCRKHGAARADVHSVGSTSRLDAI TVYGAS++RNLMKIEVS++ +ACS FKMDGLISNSNYIAKKITMVLFIN RMV+CSALKRAIE
Subjt:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE

Query:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQS---GSKSQ
        IVYAATLPKASKPFIYMSI+LPPEHVDVNVHPTKKEVSLLNQEVIIE+IQS VESKLRSSND RTFQEQDVESSAAS+MVIS++  QNSSQS   GSKSQ
Subjt:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQS---GSKSQ

Query:  KVPMHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYL
        K+P+ KMVRTDSTDPAGRLHAYVQMKPPGLPESSL  VRSFVR RRNPKEAANLTS+QDL +EID+NCHAGLLN VRHC+YIGM DDVFALLQHDTHLYL
Subjt:  KVPMHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYL

Query:  ANVVNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYT
        A+VV+LSKELMYQQVL RFAHFNA+QLSNPAPLYELISLALREED NSESNENDDFN+KV+E STKLLKLKAEMLEEFFCI+ID +GNLARLPVVLDQYT
Subjt:  ANVVNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYT

Query:  PDMDRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIH
        PDMDRVPEFVLSLANDIDWE+EK+C+Q+ISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNP+DENIG + + MENEIDHGLLSEAET+WAQREWSI 
Subjt:  PDMDRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIH

Query:  HVLFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC
        HVL PS+KLFFKP HSLA+NG+FIRVASLEKLYKIFERC
Subjt:  HVLFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC

XP_022972874.1 DNA mismatch repair protein MLH1 isoform X2 [Cucurbita maxima]0.0e+0088.86Show/hide
Query:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH
        ME HADDE         EEV  +EPP+I+RL  SVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNV VKDGGLKLIQVSDDGHGIRYEDLPILCERH
Subjt:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH

Query:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL
        TTSKLSKFEDLQSI SMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQN SDDY+KIVDLL
Subjt:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL

Query:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE
        SRFAIHHINI FSCRKHGAARADVHSVGSTSRLDAI TVYGAS++RNLMKIEVS++ +ACS FKMDGLISNSNYIAKKITMVLFIN RMV+CSALKRAIE
Subjt:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE

Query:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQSGSKSQKVP
        IVYAATLPKASKPFIYMSI+LPPEHVDVNVHPTKKEVSLLNQEVIIE+IQS VESKLRSSND RTFQEQDVESSAAS+MVIS++  QNSSQSGSKSQK+P
Subjt:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQSGSKSQKVP

Query:  MHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANV
        + KMVRTDSTDPAGRLHAYVQMKPPGLPESSL  VRSFVR RRNPKEAANLTS+QDL +EID+NCHAGLLN VRHC+YIGM DDVFALLQHDTHLYLA+V
Subjt:  MHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANV

Query:  VNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDM
        V+LSKELMYQQVL RFAHFNA+QLSNPAPLYELISLALREED NSESNENDDFN+KV+E STKLLKLKAEMLEEFFCI+ID +GNLARLPVVLDQYTPDM
Subjt:  VNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDM

Query:  DRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIHHVL
        DRVPEFVLSLANDIDWE+EK+C+Q+ISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNP+DENIG + + MENEIDHGLLSEAET+WAQREWSI HVL
Subjt:  DRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIHHVL

Query:  FPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC
         PS+KLFFKP HSLA+NG+FIRVASLEKLYKIFERC
Subjt:  FPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC

XP_023524450.1 DNA mismatch repair protein MLH1 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0088.09Show/hide
Query:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH
        ME HADDE         EEV  +EPP+I+RL  SVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNV VKDGGLKLIQVSDDGHGIRYEDLPILCERH
Subjt:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH

Query:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL
        TTSKLSKFEDLQSI SMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQN SDDY+KIVDLL
Subjt:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL

Query:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE
        SRF+IHHINI FSCRKHGAARADVHSVGSTSRLDAI TVYGAS++RNLMKIEVS++ +ACS FKMDGLISNSNYIAKKITMVLFIN RMV+CSALKRAIE
Subjt:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE

Query:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQS---GSKSQ
        IVYAATLPKASKPFIYMSI+LPPEHVDVNVHPTKKEVSLLNQEVIIE+IQS VESKLRSSND RTFQEQDVESSAAS+MVIS++  QNSSQS   G+KSQ
Subjt:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQS---GSKSQ

Query:  KVPMHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYL
        KVP+ KMVRTDSTDPAGRLHAY+QMKPPGLPESSL  VRSFVR RRNPKEAANLTS+QDL +EID+NCHAGLLN VRHC+YIGM DDVFALLQHDTHLYL
Subjt:  KVPMHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYL

Query:  ANVVNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYT
        A+VV+LSKELMYQQVL RFAHF+A+QLSNPAPLYELISLALREED NSESNENDDFN+KV+E STKLLKLKAEMLEEFFCI+ID +GNLARLPVVLDQYT
Subjt:  ANVVNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYT

Query:  PDMDRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIH
        PDMDRVPEFVLSLANDIDWE+EK+C+Q+ISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNP+DENIG + + MENEIDHGLLSEAET+WAQREWSI 
Subjt:  PDMDRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIH

Query:  HVLFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC
        HVL PS+KLFFKP HSLA+NG+FIRVASLEKLYKIFERC
Subjt:  HVLFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC

XP_023524451.1 DNA mismatch repair protein MLH1 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0088.45Show/hide
Query:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH
        ME HADDE         EEV  +EPP+I+RL  SVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNV VKDGGLKLIQVSDDGHGIRYEDLPILCERH
Subjt:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH

Query:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL
        TTSKLSKFEDLQSI SMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQN SDDY+KIVDLL
Subjt:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL

Query:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE
        SRF+IHHINI FSCRKHGAARADVHSVGSTSRLDAI TVYGAS++RNLMKIEVS++ +ACS FKMDGLISNSNYIAKKITMVLFIN RMV+CSALKRAIE
Subjt:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE

Query:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQSGSKSQKVP
        IVYAATLPKASKPFIYMSI+LPPEHVDVNVHPTKKEVSLLNQEVIIE+IQS VESKLRSSND RTFQEQDVESSAAS+MVIS++  QNSSQSG+KSQKVP
Subjt:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQSGSKSQKVP

Query:  MHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANV
        + KMVRTDSTDPAGRLHAY+QMKPPGLPESSL  VRSFVR RRNPKEAANLTS+QDL +EID+NCHAGLLN VRHC+YIGM DDVFALLQHDTHLYLA+V
Subjt:  MHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANV

Query:  VNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDM
        V+LSKELMYQQVL RFAHF+A+QLSNPAPLYELISLALREED NSESNENDDFN+KV+E STKLLKLKAEMLEEFFCI+ID +GNLARLPVVLDQYTPDM
Subjt:  VNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDM

Query:  DRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIHHVL
        DRVPEFVLSLANDIDWE+EK+C+Q+ISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNP+DENIG + + MENEIDHGLLSEAET+WAQREWSI HVL
Subjt:  DRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIHHVL

Query:  FPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC
         PS+KLFFKP HSLA+NG+FIRVASLEKLYKIFERC
Subjt:  FPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC

TrEMBL top hitse value%identityAlignment
A0A6J1EVF2 DNA mismatch repair protein MLH1 isoform X20.0e+0088.09Show/hide
Query:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH
        ME HADDE         EEV  +EPP+I+RL  SVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNV VKDGGLKLIQVSDDGHGIRYEDLPILCERH
Subjt:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH

Query:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL
        TTSKLSKFEDLQSI SMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQN SDDY+KIVDLL
Subjt:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL

Query:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE
        SRFAIHHINI FSCRKHGAARADVHSVGSTSRLDAI TVYGAS++RNLMKIEVS++ +ACS FKMDGLISNSNY AKKITMVLFIN RMV+CSALKRAIE
Subjt:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE

Query:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQS---GSKSQ
        IVYAATLPKASKPFIYMSI+LPPEHVDVNVHPTKKEVSLLNQEVIIE+IQS VESKLRSSND RTFQEQDVESSAAS+MVI ++  QNSSQS   GSKSQ
Subjt:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQS---GSKSQ

Query:  KVPMHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYL
        KVP+ KMVRTDSTDPAGRLHAYVQM PPGLPESSL  VRSFVR RRNP+EAANLTS+QDL +EID+NCHAGLLN VRHC+YIGM DDVFALLQHDTHLYL
Subjt:  KVPMHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYL

Query:  ANVVNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYT
        ANVVNLSKELMYQQVL RFAHFNA+QLSNPAPLYELISLALREED NSESNENDDFN+KV+E STKLLKLKAEMLEEFFCI+ID +GNLARLPVVLDQYT
Subjt:  ANVVNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYT

Query:  PDMDRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIH
        PDMDRVPEFVLSLANDIDWE+EK+C+Q+ISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNP+DENIG + + MENEI+HGLLSEAET+WAQREWSI 
Subjt:  PDMDRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIH

Query:  HVLFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC
        HVL PS+KLFFKP HSLA+NG+FIRVASLE+LYKIFERC
Subjt:  HVLFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC

A0A6J1EVP8 DNA mismatch repair protein MLH1 isoform X30.0e+0088.33Show/hide
Query:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH
        ME HADDE         EEV  +EPP+I+RL  SVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNV VKDGGLKLIQVSDDGHGIRYEDLPILCERH
Subjt:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH

Query:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL
        TTSKLSKFEDLQSI SMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQN SDDY+KIVDLL
Subjt:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL

Query:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE
        SRFAIHHINI FSCRKHGAARADVHSVGSTSRLDAI TVYGAS++RNLMKIEVS++ +ACS FKMDGLISNSNY AKKITMVLFIN RMV+CSALKRAIE
Subjt:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE

Query:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQSGSKSQKVP
        IVYAATLPKASKPFIYMSI+LPPEHVDVNVHPTKKEVSLLNQEVIIE+IQS VESKLRSSND RTFQEQDVESSAAS+MVI ++  QNSSQSGSKSQKVP
Subjt:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQSGSKSQKVP

Query:  MHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANV
        + KMVRTDSTDPAGRLHAYVQM PPGLPESSL  VRSFVR RRNP+EAANLTS+QDL +EID+NCHAGLLN VRHC+YIGM DDVFALLQHDTHLYLANV
Subjt:  MHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANV

Query:  VNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDM
        VNLSKELMYQQVL RFAHFNA+QLSNPAPLYELISLALREED NSESNENDDFN+KV+E STKLLKLKAEMLEEFFCI+ID +GNLARLPVVLDQYTPDM
Subjt:  VNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDM

Query:  DRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEE-VAMENEIDHGLLSEAETVWAQREWSIHHV
        DRVPEFVLSLANDIDWE+EK+C+Q+ISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNP+DENIG+++ + MENEI+HGLLSEAET+WAQREWSI HV
Subjt:  DRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEE-VAMENEIDHGLLSEAETVWAQREWSIHHV

Query:  LFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC
        L PS+KLFFKP HSLA+NG+FIRVASLE+LYKIFERC
Subjt:  LFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC

A0A6J1F196 DNA mismatch repair protein MLH1 isoform X10.0e+0087.97Show/hide
Query:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH
        ME HADDE         EEV  +EPP+I+RL  SVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNV VKDGGLKLIQVSDDGHGIRYEDLPILCERH
Subjt:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH

Query:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL
        TTSKLSKFEDLQSI SMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQN SDDY+KIVDLL
Subjt:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL

Query:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE
        SRFAIHHINI FSCRKHGAARADVHSVGSTSRLDAI TVYGAS++RNLMKIEVS++ +ACS FKMDGLISNSNY AKKITMVLFIN RMV+CSALKRAIE
Subjt:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE

Query:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQS---GSKSQ
        IVYAATLPKASKPFIYMSI+LPPEHVDVNVHPTKKEVSLLNQEVIIE+IQS VESKLRSSND RTFQEQDVESSAAS+MVI ++  QNSSQS   GSKSQ
Subjt:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQS---GSKSQ

Query:  KVPMHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYL
        KVP+ KMVRTDSTDPAGRLHAYVQM PPGLPESSL  VRSFVR RRNP+EAANLTS+QDL +EID+NCHAGLLN VRHC+YIGM DDVFALLQHDTHLYL
Subjt:  KVPMHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYL

Query:  ANVVNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYT
        ANVVNLSKELMYQQVL RFAHFNA+QLSNPAPLYELISLALREED NSESNENDDFN+KV+E STKLLKLKAEMLEEFFCI+ID +GNLARLPVVLDQYT
Subjt:  ANVVNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYT

Query:  PDMDRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEE-VAMENEIDHGLLSEAETVWAQREWSI
        PDMDRVPEFVLSLANDIDWE+EK+C+Q+ISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNP+DENIG+++ + MENEI+HGLLSEAET+WAQREWSI
Subjt:  PDMDRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEE-VAMENEIDHGLLSEAETVWAQREWSI

Query:  HHVLFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC
         HVL PS+KLFFKP HSLA+NG+FIRVASLE+LYKIFERC
Subjt:  HHVLFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC

A0A6J1IBD7 DNA mismatch repair protein MLH1 isoform X10.0e+0088.5Show/hide
Query:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH
        ME HADDE         EEV  +EPP+I+RL  SVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNV VKDGGLKLIQVSDDGHGIRYEDLPILCERH
Subjt:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH

Query:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL
        TTSKLSKFEDLQSI SMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQN SDDY+KIVDLL
Subjt:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL

Query:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE
        SRFAIHHINI FSCRKHGAARADVHSVGSTSRLDAI TVYGAS++RNLMKIEVS++ +ACS FKMDGLISNSNYIAKKITMVLFIN RMV+CSALKRAIE
Subjt:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE

Query:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQS---GSKSQ
        IVYAATLPKASKPFIYMSI+LPPEHVDVNVHPTKKEVSLLNQEVIIE+IQS VESKLRSSND RTFQEQDVESSAAS+MVIS++  QNSSQS   GSKSQ
Subjt:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQS---GSKSQ

Query:  KVPMHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYL
        K+P+ KMVRTDSTDPAGRLHAYVQMKPPGLPESSL  VRSFVR RRNPKEAANLTS+QDL +EID+NCHAGLLN VRHC+YIGM DDVFALLQHDTHLYL
Subjt:  KVPMHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYL

Query:  ANVVNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYT
        A+VV+LSKELMYQQVL RFAHFNA+QLSNPAPLYELISLALREED NSESNENDDFN+KV+E STKLLKLKAEMLEEFFCI+ID +GNLARLPVVLDQYT
Subjt:  ANVVNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYT

Query:  PDMDRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIH
        PDMDRVPEFVLSLANDIDWE+EK+C+Q+ISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNP+DENIG + + MENEIDHGLLSEAET+WAQREWSI 
Subjt:  PDMDRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIH

Query:  HVLFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC
        HVL PS+KLFFKP HSLA+NG+FIRVASLEKLYKIFERC
Subjt:  HVLFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC

A0A6J1ICV0 DNA mismatch repair protein MLH1 isoform X20.0e+0088.86Show/hide
Query:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH
        ME HADDE         EEV  +EPP+I+RL  SVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNV VKDGGLKLIQVSDDGHGIRYEDLPILCERH
Subjt:  MEAHADDE-------SGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERH

Query:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL
        TTSKLSKFEDLQSI SMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQN SDDY+KIVDLL
Subjt:  TTSKLSKFEDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLL

Query:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE
        SRFAIHHINI FSCRKHGAARADVHSVGSTSRLDAI TVYGAS++RNLMKIEVS++ +ACS FKMDGLISNSNYIAKKITMVLFIN RMV+CSALKRAIE
Subjt:  SRFAIHHINIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIE

Query:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQSGSKSQKVP
        IVYAATLPKASKPFIYMSI+LPPEHVDVNVHPTKKEVSLLNQEVIIE+IQS VESKLRSSND RTFQEQDVESSAAS+MVIS++  QNSSQSGSKSQK+P
Subjt:  IVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQSGSKSQKVP

Query:  MHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANV
        + KMVRTDSTDPAGRLHAYVQMKPPGLPESSL  VRSFVR RRNPKEAANLTS+QDL +EID+NCHAGLLN VRHC+YIGM DDVFALLQHDTHLYLA+V
Subjt:  MHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANV

Query:  VNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDM
        V+LSKELMYQQVL RFAHFNA+QLSNPAPLYELISLALREED NSESNENDDFN+KV+E STKLLKLKAEMLEEFFCI+ID +GNLARLPVVLDQYTPDM
Subjt:  VNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDM

Query:  DRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIHHVL
        DRVPEFVLSLANDIDWE+EK+C+Q+ISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNP+DENIG + + MENEIDHGLLSEAET+WAQREWSI HVL
Subjt:  DRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIHHVL

Query:  FPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC
         PS+KLFFKP HSLA+NG+FIRVASLEKLYKIFERC
Subjt:  FPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC

SwissProt top hitse value%identityAlignment
P40692 DNA mismatch repair protein Mlh16.9e-14439.52Show/hide
Query:  IIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSKFEDLQSISSMGFRGEALASM
        I RL E+VVNRIAAGEVIQRP +AIKE++EN LDA+STS+ V VK+GGLKLIQ+ D+G GIR EDL I+CER TTSKL  FEDL SIS+ GFRGEALAS+
Subjt:  IIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSKFEDLQSISSMGFRGEALASM

Query:  TYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLLSRFAIHHINIRFSCRKHGAARADVHSV
        ++V HVT+TT T      YR SY DG ++  PKPCA  +GTQITVE+LFYN++ RRK L+N S++Y KI++++ R+++H+  I FS +K G   ADV ++
Subjt:  TYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLLSRFAIHHINIRFSCRKHGAARADVHSV

Query:  GSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMSIVLPPEHVD
         + S +D I +++G ++SR L++I   D   A   FKM+G ISN+NY  KK   +LFIN R+VE ++L++AIE VYAA LPK + PF+Y+S+ + P++VD
Subjt:  GSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMSIVLPPEHVD

Query:  VNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAA---SKMVISNNNAQNSSQSGSKSQKVPMHKMVRTDSTDPAGRLHAYVQ--M
        VNVHPTK EV  L++E I+E++Q  +ESKL  SN  R +  Q +    A    +MV S  +  +SS SGS S KV  H+MVRTDS +   +L A++Q   
Subjt:  VNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAA---SKMVISNNNAQNSSQSGSKSQKVPMHKMVRTDSTDPAGRLHAYVQ--M

Query:  KPPG----------------------------LP------------ESSLTAVRSFVRQRRNP---------------------------------KEAA
        KP                              LP            E   T   S + ++R P                                 +   
Subjt:  KPPG----------------------------LP------------ESSLTAVRSFVRQRRNP---------------------------------KEAA

Query:  NLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANVVNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNE
        NLTS+  L  EI++  H  L  ++ +  ++G  +  +AL QH T LYL N   LS+EL YQ ++  FA+F  ++LS PAPL++L  LAL  +   S   E
Subjt:  NLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANVVNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNE

Query:  NDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDMDRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGL
         D   E ++E   + LK KAEML ++F + ID+ GNL  LP+++D Y P ++ +P F+L LA +++W+EEK C +++S     FY+              
Subjt:  NDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDMDRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGL

Query:  QFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIHHVLFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC
             I+     ++  +  ++  +   I +         W   +W++ H+++ +L+    P     ++G  +++A+L  LYK+FERC
Subjt:  QFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIHHVLFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC

P97679 DNA mismatch repair protein Mlh11.0e-13437.87Show/hide
Query:  IIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSKFEDLQSISSMGFRGEALASM
        I RL E+VVNRIAAGEVIQRP +AIKE+ EN LDA+ST++ V V++GGLKLIQ+ D+G GIR EDL I+CER TTSKL  FEDL  IS+ GFRGEALAS+
Subjt:  IIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSKFEDLQSISSMGFRGEALASM

Query:  TYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLLSRFAIHHINIRFSCRKHGAARADVHSV
        ++V HVT+TT T      YR SY DG ++  PKPCA  +GT ITVE+LFYN+  R+K L+N S++Y KI++++ R++IH+  I FS +K G   +DV ++
Subjt:  TYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLLSRFAIHHINIRFSCRKHGAARADVHSV

Query:  GSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMSIVLPPEHVD
         + + +D I +++G ++SR L+++   D   A   FKM+G ISN+NY  KK   +LFIN R+VE +ALK+AIE VYAA LPK + PF+Y+ + + P++VD
Subjt:  GSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMSIVLPPEHVD

Query:  VNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAA---SKMVISNNNAQNSSQSGSKSQKVPMHKMVRTDSTDP---------AGR
        VNVHPTK EV  L++E I+E++Q  +ESKL  SN  R +  Q +    A    + V S     +SS SGS   KV  ++MVRTDS D          + R
Subjt:  VNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAA---SKMVISNNNAQNSSQSGSKSQKVPMHKMVRTDSTDP---------AGR

Query:  LHAYVQMKPPG--------------------------------LPESSLTAVRSFVRQRRNP----------------------------------KEAA
        L +  Q   PG                                L   S+      V  +R+P                                  +   
Subjt:  LHAYVQMKPPG--------------------------------LPESSLTAVRSFVRQRRNP----------------------------------KEAA

Query:  NLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANVVNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNE
        NLTS+  L  EI+   H  L  ++R+  ++G  +  +AL QH T LYL N   LS+EL YQ ++  FA+F  ++L  PAPL++   LAL  +   S   E
Subjt:  NLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANVVNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNE

Query:  NDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDMDRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGL
         D   E ++E   + LK KA+ML ++F + ID+ GNL  LP+++D Y P ++ +P F+L LA +++W+EE+ C +++S     FY++    +   S    
Subjt:  NDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDMDRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGL

Query:  QFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIHHVLFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC
                SG   D                 +       W   +W++ H+++ + +    P     ++G  +++A+L  L K+FERC
Subjt:  QFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIHHVLFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC

Q54KD8 DNA mismatch repair protein Mlh15.7e-12233.96Show/hide
Query:  VLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSKFEDLQSISSMGF
        ++++   +I RL + VVN+I+AGEVIQRP +A+KEL+EN LDA+ST++ VTVKDGG+K +Q+ D+G GIR ED+ I+CER TTSKL+KFEDL+SI S GF
Subjt:  VLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSKFEDLQSISSMGF

Query:  RGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVM--------EHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLLSRFAIHHINIR
        RGEAL+S+++V H+ + T T      YR  Y +G +          +PKPCA V GTQITVE+LF+N  +R+  L+N  D++S+IV L+ ++AI++  + 
Subjt:  RGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVM--------EHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLLSRFAIHHINIR

Query:  FSCRKHGAARADVHSVGSTSRL--DAICTVYGASISRNLMKIEV--------SDDGEACSG------------FKMDGLISNSNYIAKKITMVLFINGRM
        F  +K G    +VH+ G  + L  D I ++YG  +S+ L  I +        +DD +  SG            F M G  S++NY +KKI  +LFINGR+
Subjt:  FSCRKHGAARADVHSVGSTSRL--DAICTVYGASISRNLMKIEV--------SDDGEACSG------------FKMDGLISNSNYIAKKITMVLFINGRM

Query:  VECSALKRAIEIVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTF--------------------QEQ
        V+   LK  +E VY+  LPK + PF+++ +++PP+++DVN+HPTK EV +L++E IIE IQ +V+ +L  S++ +TF                    Q+ 
Subjt:  VECSALKRAIEIVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTF--------------------QEQ

Query:  DVESSAASKMVISNNN----------AQNSSQSGSKSQKV-----PMHKMVRTDST----------------DPAGRLHAYVQMK-------------PP
           SS  +K   +NNN          A++  +S SKSQ +     PM      DS+                  AG   A  +               P 
Subjt:  DVESSAASKMVISNNN----------AQNSSQSGSKSQKV-----PMHKMVRTDST----------------DPAGRLHAYVQMK-------------PP

Query:  GL--PESSLTAVRS----------------------------------------FVRQRRNPK-EAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMT
         +  P SS+T ++S                                        F+  R+  K +   LTSI+ L SE   N H GL      C+++G  
Subjt:  GL--PESSLTAVRS----------------------------------------FVRQRRNPK-EAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMT

Query:  DDVFALLQHDTHLYLANVVNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDK
        D  +AL+Q    LYL N+ N++KEL YQ  L RF+ F++++ S    +Y L+ ++L  +   S   E+D   +K+++  TKLL  K E+L E+F I I++
Subjt:  DDVFALLQHDTHLYLANVVNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDK

Query:  SGNLARLPVVLDQYTPDMDRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLL
         G L  +P VLD Y P  D +P F+L LA +++WE EK C   I   I +F+ + P  L                    +D  + +++        + + 
Subjt:  SGNLARLPVVLDQYTPDMDRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLL

Query:  SEAETVWAQREWSIHHVLFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC
           +     +EW I H++FP+ +    P    A +G+ I++ +L+ LYK+FERC
Subjt:  SEAETVWAQREWSIHHVLFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC

Q9JK91 DNA mismatch repair protein Mlh13.5e-14038.53Show/hide
Query:  IIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSKFEDLQSISSMGFRGEALASM
        I RL E+VVNRIAAGEVIQRP +AIKE++EN LDA+ST++ V VK+GGLKLIQ+ D+G GIR EDL I+CER TTSKL  FEDL SIS+ GFRGEALAS+
Subjt:  IIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSKFEDLQSISSMGFRGEALASM

Query:  TYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLLSRFAIHHINIRFSCRKHGAARADVHSV
        ++V HVT+TT T      YR SY DG ++  PKPCA  +GT ITVE+LFYN+  RRK L+N S++Y KI++++ R++IH+  I FS +K G   +DV ++
Subjt:  TYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLLSRFAIHHINIRFSCRKHGAARADVHSV

Query:  GSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMSIVLPPEHVD
         + + +D I +++G ++SR L+++   D   A   FKM+G ISN+NY  KK   +LFIN R+VE +AL++AIE VYAA LPK + PF+Y+S+ + P++VD
Subjt:  GSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMSIVLPPEHVD

Query:  VNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAA---SKMVISNNNAQNSSQSGSKSQKVPMHKMVRTDS---------------
        VNVHPTK EV  L++E I++++Q  +ESKL  SN  R +  Q +    A    +         +SS SGS   KV  ++MVRTDS               
Subjt:  VNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAA---SKMVISNNNAQNSSQSGSKSQKVPMHKMVRTDS---------------

Query:  -----TDPAGRLHAYVQMKPP----------GLPESSLTAVRSFVRQRRNPKEAA---------------------------------------------
              DPA    A  +  P            LP  +  A  S   +R +  E +                                             
Subjt:  -----TDPAGRLHAYVQMKPP----------GLPESSLTAVRSFVRQRRNPKEAA---------------------------------------------

Query:  --NLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANVVNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSES
          NLTS+  L  EI + CH  L  ++R+  ++G  +  +AL QH T LYL N   LS+EL YQ ++  FA+F  ++LS PAPL++L  LAL  +   S  
Subjt:  --NLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANVVNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSES

Query:  NENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDMDRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGD
         E+D   E ++E   + LK KAEML ++F + ID+ GNL  LP+++D Y P ++ +P F+L LA +++W+EEK C +++S     FY++    +   S  
Subjt:  NENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDMDRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGD

Query:  GLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIHHVLFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC
                  SG   D                 +       W   +W++ H+++ + +    P     ++G  +++A+L  LYK+FERC
Subjt:  GLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIHHVLFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC

Q9ZRV4 DNA mismatch repair protein MLH11.3e-29671Show/hide
Query:  EAHADDESGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSKF
        E   ++     ++ +EPP+I RL ESVVNRIAAGEVIQRPVSA+KELVENSLDA S+S++V VKDGGLKLIQVSDDGHGIR EDLPILCERHTTSKL+KF
Subjt:  EAHADDESGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSKF

Query:  EDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLLSRFAIHHI
        EDL S+SSMGFRGEALASMTYV HVTVTTITKGQ+HGYRVSYRDGVMEHEPK CAAVKGTQI VENLFYNM ARRKTLQN++DDY KIVDLLSR AIH+ 
Subjt:  EDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLLSRFAIHHI

Query:  NIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSD-DGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIEIVYAATL
        N+ FSCRKHGA +ADVHSV S SRLD+I +VYG S+++NLMK+EVS  D   C+ F M+G ISNSNY+AKK  +VLFIN R+VECSALKRAIEIVYAATL
Subjt:  NIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSD-DGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIEIVYAATL

Query:  PKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQSGSKSQKVPMHKMVRT
        PKASKPF+YMSI LP EHVD+N+HPTKKEVSLLNQE+IIE IQSEVE KLR++ND RTFQEQ VE   ++     +++  +   SG K+QKVP++KMVRT
Subjt:  PKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQSGSKSQKVPMHKMVRT

Query:  DSTDPAGRLHAYVQMKPPGLPE--SSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANVVNLSK
        DS+DPAGRLHA++Q KP  LP+  SSL+ VRS VRQRRNPKE A+L+S+Q+L + +D  CH G+L  VR+C Y+GM DDVFAL+Q++THLYLANVVNLSK
Subjt:  DSTDPAGRLHAYVQMKPPGLPE--SSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANVVNLSK

Query:  ELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDMDRVPE
        ELMYQQ LRRFAHFNA+QLS+PAPL ELI LAL+EED +  ++  DD  E+++E +T+LLK KAEMLEE+F ++ID S NL+RLPV+LDQYTPDMDRVPE
Subjt:  ELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDMDRVPE

Query:  FVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIHHVLFPSLK
        F+L L ND++WE+EKSC Q +SAAIGNFYAMHPPLLPNPSGDG+QFY +   S   K +  G+  V ME+ +D  LLS+AE  WAQREWSI HVLFPS++
Subjt:  FVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIHHVLFPSLK

Query:  LFFKPSHSLAQNGTFIRVASLEKLYKIFERC
        LF KP  S+A NGTF++VASLEKLYKIFERC
Subjt:  LFFKPSHSLAQNGTFIRVASLEKLYKIFERC

Arabidopsis top hitse value%identityAlignment
AT4G02460.1 DNA mismatch repair protein, putative2.2e-3631.02Show/hide
Query:  PRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSKFEDLQSISSMGFRGEALA
        P I  ++ +V++RI +G+VI    SA+KELVENSLDA +TS+ + ++D G    QV D+G GI   +  +L  +H TSKL  F DL ++++ GFRGEAL+
Subjt:  PRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSKFEDLQSISSMGFRGEALA

Query:  SMTYVGHVTVTTITKGQLHGYRVSY-RDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTL-QNASDDYSKIVDLLSRFAIHHINIRFSCRK---HGAA
        S+  +G++TV T TK +     +++   G++  E K    + GT +TV  LF N+  R K   +N   +Y K+V LL+ +A+    +RF C         
Subjt:  SMTYVGHVTVTTITKGQLHGYRVSY-RDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTL-QNASDDYSKIVDLLSRFAIHHINIRFSCRK---HGAA

Query:  RADVHSVGSTSRLDAICTVYGASISRNLMKIE--VSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMS
           +++ G  S  D I TV+G S   +L  +   VS+D          G  +  N   ++     FINGR V+   + + +  +Y  T  +   P   + 
Subjt:  RADVHSVGSTSRLDAICTVYGASISRNLMKIE--VSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMS

Query:  IVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQ---SEVESKLRSSNDMRTFQEQDVESSAA
         ++P    D+NV P K++V   ++  +I  ++   +E+ S   +S  +  F+E   +   A
Subjt:  IVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQ---SEVESKLRSSNDMRTFQEQDVESSAA

AT4G09140.1 MUTL-homologue 19.4e-29871Show/hide
Query:  EAHADDESGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSKF
        E   ++     ++ +EPP+I RL ESVVNRIAAGEVIQRPVSA+KELVENSLDA S+S++V VKDGGLKLIQVSDDGHGIR EDLPILCERHTTSKL+KF
Subjt:  EAHADDESGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSKF

Query:  EDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLLSRFAIHHI
        EDL S+SSMGFRGEALASMTYV HVTVTTITKGQ+HGYRVSYRDGVMEHEPK CAAVKGTQI VENLFYNM ARRKTLQN++DDY KIVDLLSR AIH+ 
Subjt:  EDLQSISSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLLSRFAIHHI

Query:  NIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSD-DGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIEIVYAATL
        N+ FSCRKHGA +ADVHSV S SRLD+I +VYG S+++NLMK+EVS  D   C+ F M+G ISNSNY+AKK  +VLFIN R+VECSALKRAIEIVYAATL
Subjt:  NIRFSCRKHGAARADVHSVGSTSRLDAICTVYGASISRNLMKIEVSD-DGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIEIVYAATL

Query:  PKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQSGSKSQKVPMHKMVRT
        PKASKPF+YMSI LP EHVD+N+HPTKKEVSLLNQE+IIE IQSEVE KLR++ND RTFQEQ VE   ++     +++  +   SG K+QKVP++KMVRT
Subjt:  PKASKPFIYMSIVLPPEHVDVNVHPTKKEVSLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQSGSKSQKVPMHKMVRT

Query:  DSTDPAGRLHAYVQMKPPGLPE--SSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANVVNLSK
        DS+DPAGRLHA++Q KP  LP+  SSL+ VRS VRQRRNPKE A+L+S+Q+L + +D  CH G+L  VR+C Y+GM DDVFAL+Q++THLYLANVVNLSK
Subjt:  DSTDPAGRLHAYVQMKPPGLPE--SSLTAVRSFVRQRRNPKEAANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANVVNLSK

Query:  ELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDMDRVPE
        ELMYQQ LRRFAHFNA+QLS+PAPL ELI LAL+EED +  ++  DD  E+++E +T+LLK KAEMLEE+F ++ID S NL+RLPV+LDQYTPDMDRVPE
Subjt:  ELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKVSEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDMDRVPE

Query:  FVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIHHVLFPSLK
        F+L L ND++WE+EKSC Q +SAAIGNFYAMHPPLLPNPSGDG+QFY +   S   K +  G+  V ME+ +D  LLS+AE  WAQREWSI HVLFPS++
Subjt:  FVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIGSEEVAMENEIDHGLLSEAETVWAQREWSIHHVLFPSLK

Query:  LFFKPSHSLAQNGTFIRVASLEKLYKIFERC
        LF KP  S+A NGTF++VASLEKLYKIFERC
Subjt:  LFFKPSHSLAQNGTFIRVASLEKLYKIFERC

AT4G35520.1 MUTL protein homolog 38.4e-2031.22Show/hide
Query:  LHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSKFEDLQSIS-SMGFRGEALASMTY
        L E V + + +G ++      ++ELV NSLDA +T V++ V       ++V DDG G+  +DL +L ER+ TSK   F ++++ S + GFRGEALAS++ 
Subjt:  LHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSKFEDLQSIS-SMGFRGEALASMTY

Query:  VGHVTVTTITKGQLHGYRVSYRDGVMEH-EPKPCAAVKGTQITVENLFYNMSARRKTLQNASDD-YSKIVDLLSRFAIHHINIRFSCRKHGAARADVHSV
        +  + V T   G+ +GYR   +     H          GT +TV +LFY+   RRK +Q++       I   + R A+ H N+ FS     +      + 
Subjt:  VGHVTVTTITKGQLHGYRVSYRDGVMEH-EPKPCAAVKGTQITVENLFYNMSARRKTLQNASDD-YSKIVDLLSRFAIHHINIRFSCRKHGAARADVHSV

Query:  GSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFK
         S+S    +    G     +L K+ V+D     SGF+
Subjt:  GSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCCCATGCGGACGACGAGAGTGGGGAAGAAGTTCTTCACAAAGAGCCCCCCAGAATCATCCGACTTCACGAGTCCGTCGTTAATCGAATCGCCGCCGGAGAGGT
CATTCAAAGGCCAGTCTCCGCCATTAAAGAACTCGTCGAGAACAGCCTCGACGCCCAATCTACCTCCGTCAACGTCACTGTCAAAGACGGCGGCCTCAAACTCATCCAAG
TTTCCGACGACGGCCATGGCATACGCTATGAAGATTTGCCAATTCTGTGCGAGAGGCACACGACGTCCAAGTTGTCGAAATTTGAGGATCTGCAGTCCATAAGCTCGATG
GGGTTTCGAGGAGAGGCGCTGGCTAGTATGACCTATGTGGGTCATGTTACGGTGACCACCATTACTAAAGGACAACTCCACGGTTACAGAGTATCCTATAGAGATGGAGT
GATGGAGCATGAACCCAAGCCATGTGCTGCTGTAAAAGGAACTCAGATAACGGTTGAGAATCTGTTCTATAACATGTCTGCTAGGAGGAAGACACTACAAAATGCGTCTG
ATGATTACTCAAAGATTGTGGATCTCCTAAGTCGATTTGCCATTCATCATATAAACATCAGATTTTCTTGCAGAAAGCATGGAGCTGCCAGAGCAGATGTTCACTCAGTT
GGGTCAACTTCAAGGTTGGATGCCATTTGTACAGTTTATGGTGCTTCAATTTCTCGCAATCTAATGAAAATCGAAGTTTCAGACGATGGAGAAGCCTGTTCAGGTTTCAA
GATGGATGGTCTAATTTCCAACTCAAATTATATTGCAAAGAAGATCACAATGGTGCTCTTTATCAATGGAAGAATGGTAGAATGCAGTGCTTTAAAAAGAGCTATTGAAA
TTGTTTATGCTGCAACCTTGCCCAAAGCATCAAAACCTTTCATATATATGTCAATTGTATTGCCACCTGAGCATGTTGATGTGAATGTTCATCCGACCAAAAAAGAGGTA
AGCCTTCTGAACCAAGAAGTTATTATTGAGAAGATACAATCAGAGGTTGAATCAAAACTGAGAAGTTCAAACGACATGAGGACATTTCAAGAACAGGATGTAGAATCTTC
TGCGGCTAGTAAAATGGTGATTAGCAATAACAATGCTCAGAATTCCTCGCAATCTGGTTCAAAATCACAAAAGGTTCCAATGCATAAAATGGTTAGGACAGATTCAACGG
ATCCAGCTGGCAGGTTGCACGCATATGTGCAAATGAAGCCTCCTGGCCTCCCTGAATCTAGCTTGACTGCTGTGAGGTCTTTTGTTAGACAGAGAAGGAATCCCAAGGAA
GCTGCTAATCTTACTAGCATTCAAGATCTTAATTCAGAAATTGATAAGAATTGTCATGCTGGTCTCCTTAACATTGTAAGACATTGTATATATATTGGAATGACAGATGA
TGTCTTCGCACTGCTTCAACATGATACTCATCTCTATCTAGCGAATGTTGTGAATTTGAGCAAAGAACTCATGTACCAGCAAGTTTTACGTAGATTTGCTCATTTTAATG
CTATGCAATTGAGCAACCCGGCCCCTCTATACGAGTTAATTAGTTTAGCACTGAGGGAGGAAGATGGGAACTCAGAGTCTAATGAGAATGATGATTTTAATGAGAAGGTT
TCCGAGGCGAGTACAAAACTGCTCAAGTTGAAAGCTGAAATGCTGGAGGAATTTTTCTGCATTTATATCGACAAAAGTGGAAATTTGGCTAGACTTCCAGTCGTACTTGA
CCAATACACACCTGATATGGACCGTGTTCCTGAATTTGTGCTTTCCTTGGCTAATGATATTGATTGGGAAGAAGAGAAAAGTTGTCTCCAGGCAATTTCAGCTGCCATTG
GAAACTTCTATGCTATGCATCCTCCCTTGTTGCCCAATCCATCAGGCGATGGCTTGCAGTTCTACAAAAGGATAAAATCGTCTGGGAATCCTAAGGATGAAAATATAGGT
AGTGAAGAGGTAGCAATGGAGAATGAAATAGACCATGGCTTACTATCGGAAGCAGAGACCGTATGGGCTCAACGAGAATGGTCTATACATCATGTACTCTTCCCATCATT
GAAGCTGTTCTTCAAGCCTTCACATTCTCTGGCCCAAAATGGAACTTTCATTCGGGTTGCATCATTGGAGAAACTTTACAAGATCTTTGAGAGATGTTGA
mRNA sequenceShow/hide mRNA sequence
GTAACATAGGTTCAAATACAATATCGTCTGGAAATTTGGATTGAACGAAAAAGCTACTCTAAGCCATTTTTATGTATGTATTTTTTTAAGTAGTTAATAAATACTCATTT
TAAGTACTCTTTCATTCCACGAATAAAGAGGAAAAGTGGGATTTGTTGGCGGGCTTTTGTCGCAATAGAGAGGAGTGAAAGGTGGTAGGTTTGCCATGGAAGCCCATGCG
GACGACGAGAGTGGGGAAGAAGTTCTTCACAAAGAGCCCCCCAGAATCATCCGACTTCACGAGTCCGTCGTTAATCGAATCGCCGCCGGAGAGGTCATTCAAAGGCCAGT
CTCCGCCATTAAAGAACTCGTCGAGAACAGCCTCGACGCCCAATCTACCTCCGTCAACGTCACTGTCAAAGACGGCGGCCTCAAACTCATCCAAGTTTCCGACGACGGCC
ATGGCATACGCTATGAAGATTTGCCAATTCTGTGCGAGAGGCACACGACGTCCAAGTTGTCGAAATTTGAGGATCTGCAGTCCATAAGCTCGATGGGGTTTCGAGGAGAG
GCGCTGGCTAGTATGACCTATGTGGGTCATGTTACGGTGACCACCATTACTAAAGGACAACTCCACGGTTACAGAGTATCCTATAGAGATGGAGTGATGGAGCATGAACC
CAAGCCATGTGCTGCTGTAAAAGGAACTCAGATAACGGTTGAGAATCTGTTCTATAACATGTCTGCTAGGAGGAAGACACTACAAAATGCGTCTGATGATTACTCAAAGA
TTGTGGATCTCCTAAGTCGATTTGCCATTCATCATATAAACATCAGATTTTCTTGCAGAAAGCATGGAGCTGCCAGAGCAGATGTTCACTCAGTTGGGTCAACTTCAAGG
TTGGATGCCATTTGTACAGTTTATGGTGCTTCAATTTCTCGCAATCTAATGAAAATCGAAGTTTCAGACGATGGAGAAGCCTGTTCAGGTTTCAAGATGGATGGTCTAAT
TTCCAACTCAAATTATATTGCAAAGAAGATCACAATGGTGCTCTTTATCAATGGAAGAATGGTAGAATGCAGTGCTTTAAAAAGAGCTATTGAAATTGTTTATGCTGCAA
CCTTGCCCAAAGCATCAAAACCTTTCATATATATGTCAATTGTATTGCCACCTGAGCATGTTGATGTGAATGTTCATCCGACCAAAAAAGAGGTAAGCCTTCTGAACCAA
GAAGTTATTATTGAGAAGATACAATCAGAGGTTGAATCAAAACTGAGAAGTTCAAACGACATGAGGACATTTCAAGAACAGGATGTAGAATCTTCTGCGGCTAGTAAAAT
GGTGATTAGCAATAACAATGCTCAGAATTCCTCGCAATCTGGTTCAAAATCACAAAAGGTTCCAATGCATAAAATGGTTAGGACAGATTCAACGGATCCAGCTGGCAGGT
TGCACGCATATGTGCAAATGAAGCCTCCTGGCCTCCCTGAATCTAGCTTGACTGCTGTGAGGTCTTTTGTTAGACAGAGAAGGAATCCCAAGGAAGCTGCTAATCTTACT
AGCATTCAAGATCTTAATTCAGAAATTGATAAGAATTGTCATGCTGGTCTCCTTAACATTGTAAGACATTGTATATATATTGGAATGACAGATGATGTCTTCGCACTGCT
TCAACATGATACTCATCTCTATCTAGCGAATGTTGTGAATTTGAGCAAAGAACTCATGTACCAGCAAGTTTTACGTAGATTTGCTCATTTTAATGCTATGCAATTGAGCA
ACCCGGCCCCTCTATACGAGTTAATTAGTTTAGCACTGAGGGAGGAAGATGGGAACTCAGAGTCTAATGAGAATGATGATTTTAATGAGAAGGTTTCCGAGGCGAGTACA
AAACTGCTCAAGTTGAAAGCTGAAATGCTGGAGGAATTTTTCTGCATTTATATCGACAAAAGTGGAAATTTGGCTAGACTTCCAGTCGTACTTGACCAATACACACCTGA
TATGGACCGTGTTCCTGAATTTGTGCTTTCCTTGGCTAATGATATTGATTGGGAAGAAGAGAAAAGTTGTCTCCAGGCAATTTCAGCTGCCATTGGAAACTTCTATGCTA
TGCATCCTCCCTTGTTGCCCAATCCATCAGGCGATGGCTTGCAGTTCTACAAAAGGATAAAATCGTCTGGGAATCCTAAGGATGAAAATATAGGTAGTGAAGAGGTAGCA
ATGGAGAATGAAATAGACCATGGCTTACTATCGGAAGCAGAGACCGTATGGGCTCAACGAGAATGGTCTATACATCATGTACTCTTCCCATCATTGAAGCTGTTCTTCAA
GCCTTCACATTCTCTGGCCCAAAATGGAACTTTCATTCGGGTTGCATCATTGGAGAAACTTTACAAGATCTTTGAGAGATGTTGAGCAACAGTTATTTTCACTGATCTGA
GGCTGCTTATAGAATCTGGGATAGCCTATTGCATTTTGTATCTTTTTTTTTCCTCCTGTAATTTTATGGTGCACATTTTGCTTGTCCAGTAGCAGACATTGATGTTCTTG
TAAATCCTAAAGTACCTTTAAATCATCCAGTTTTGCAAAGCTTTTGTTGGGATGATTTTTTGGGTGACAGTGTACGGTCCAAAAAACTTGCAATTTATTTTCACGATAGA
ATCCATGGGAAGGACCAAATCAATGAAATTAAGAGAAGGTCAAACCCGTTAAAATTATTT
Protein sequenceShow/hide protein sequence
MEAHADDESGEEVLHKEPPRIIRLHESVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVTVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSKFEDLQSISSM
GFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNASDDYSKIVDLLSRFAIHHINIRFSCRKHGAARADVHSV
GSTSRLDAICTVYGASISRNLMKIEVSDDGEACSGFKMDGLISNSNYIAKKITMVLFINGRMVECSALKRAIEIVYAATLPKASKPFIYMSIVLPPEHVDVNVHPTKKEV
SLLNQEVIIEKIQSEVESKLRSSNDMRTFQEQDVESSAASKMVISNNNAQNSSQSGSKSQKVPMHKMVRTDSTDPAGRLHAYVQMKPPGLPESSLTAVRSFVRQRRNPKE
AANLTSIQDLNSEIDKNCHAGLLNIVRHCIYIGMTDDVFALLQHDTHLYLANVVNLSKELMYQQVLRRFAHFNAMQLSNPAPLYELISLALREEDGNSESNENDDFNEKV
SEASTKLLKLKAEMLEEFFCIYIDKSGNLARLPVVLDQYTPDMDRVPEFVLSLANDIDWEEEKSCLQAISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPKDENIG
SEEVAMENEIDHGLLSEAETVWAQREWSIHHVLFPSLKLFFKPSHSLAQNGTFIRVASLEKLYKIFERC