; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg15137 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg15137
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionDNA mismatch repair protein
Genome locationCarg_Chr16:688390..701393
RNA-Seq ExpressionCarg15137
SyntenyCarg15137
Gene Ontology termsGO:0006298 - mismatch repair (biological process)
GO:0006312 - mitotic recombination (biological process)
GO:0007131 - reciprocal meiotic recombination (biological process)
GO:0043570 - maintenance of DNA repeat elements (biological process)
GO:0051096 - positive regulation of helicase activity (biological process)
GO:0032302 - MutSbeta complex (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0032135 - DNA insertion or deletion binding (molecular function)
GO:0008094 - DNA-dependent ATPase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR000432 - DNA mismatch repair protein MutS, C-terminal
IPR045076 - DNA mismatch repair MutS family
IPR036678 - MutS, connector domain superfamily
IPR036187 - DNA mismatch repair protein MutS, core domain superfamily
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR016177 - DNA-binding domain superfamily
IPR016151 - DNA mismatch repair protein MutS, N-terminal
IPR007696 - DNA mismatch repair protein MutS, core
IPR007695 - DNA mismatch repair protein MutS-like, N-terminal
IPR006598 - Glycosyl transferase CAP10 domain
IPR001739 - Methyl-CpG DNA binding


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576729.1 DNA mismatch repair protein MSH3, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0098.44Show/hide
Query:  MGKQKQQVISRFFAPKPKSPSVSSSSSSSSSALAPPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLEK
        MGKQKQQVISRFFAPKPKSPSVSSSSSSSSSALAPPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLEK
Subjt:  MGKQKQQVISRFFAPKPKSPSVSSSSSSSSSALAPPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLEK

Query:  FLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYK
        FLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYK
Subjt:  FLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYK

Query:  VGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDD
        VGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDD
Subjt:  VGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDD

Query:  NFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENI
        NFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENI
Subjt:  NFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENI

Query:  DNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCM
        DNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCM
Subjt:  DNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCM

Query:  NHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSED
        NHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLG APDIQRGITRIFHRTAAPSE 
Subjt:  NHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSED

Query:  DGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKV
                       FIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKV
Subjt:  DGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKV

Query:  ARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGF
        ARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGF
Subjt:  ARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGF

Query:  SRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIA
        SRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIA
Subjt:  SRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIA

Query:  LMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPK
        LMAQVGSYVPAFS KLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPK
Subjt:  LMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPK

Query:  VADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGK
        VADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGK
Subjt:  VADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGK

Query:  GLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG
        GLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG
Subjt:  GLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG

KAG7014774.1 DNA mismatch repair protein MSH3, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MCGTVRKPVAQLRFVIFSVSVSLSIAACLIISSRLLRHVPTTNGRDAELHIYPHRSEVQLPSVNCTAFSWSGKCRTRSSVIVNEEGDRDRQNFDTTCPEY
        MCGTVRKPVAQLRFVIFSVSVSLSIAACLIISSRLLRHVPTTNGRDAELHIYPHRSEVQLPSVNCTAFSWSGKCRTRSSVIVNEEGDRDRQNFDTTCPEY
Subjt:  MCGTVRKPVAQLRFVIFSVSVSLSIAACLIISSRLLRHVPTTNGRDAELHIYPHRSEVQLPSVNCTAFSWSGKCRTRSSVIVNEEGDRDRQNFDTTCPEY

Query:  FRWIHEDLRPWAGTGITREMVESGRPKAGFRLVIIDGRAYVEKFMDAYQSRDKFTLWGILQLLRLYPGKIPDLDLMFNCEDRPNIFIGDYSGPGPNSTAP
        FRWIHEDLRPWAGTGITREMVESGRPKAGFRLVIIDGRAYVEKFMDAYQSRDKFTLWGILQLLRLYPGKIPDLDLMFNCEDRPNIFIGDYSGPGPNSTAP
Subjt:  FRWIHEDLRPWAGTGITREMVESGRPKAGFRLVIIDGRAYVEKFMDAYQSRDKFTLWGILQLLRLYPGKIPDLDLMFNCEDRPNIFIGDYSGPGPNSTAP

Query:  PPLFRYCGDDDTLDIVFPDWSFWGWPEINIKPWVPLMEDLKQGNKRTKWSNREAYAYWKGNIKVSMVRYKLLECNLSREHDWKARVFMQDWDKEQKERFK
        PPLFRYCGDDDTLDIVFPDWSFWGWPEINIKPWVPLMEDLKQGNKRTKWSNREAYAYWKGNIKVSMVRYKLLECNLSREHDWKARVFMQDWDKEQKERFK
Subjt:  PPLFRYCGDDDTLDIVFPDWSFWGWPEINIKPWVPLMEDLKQGNKRTKWSNREAYAYWKGNIKVSMVRYKLLECNLSREHDWKARVFMQDWDKEQKERFK

Query:  NSNLADQCVHRYKIYVEGVGWSVSLKYILACDSVTLMVNPYYYDFFTRSLVPMHHYWPIKDDDDMCNSIKFAVYWGNTHQQKVRAIGKAASKFTEEELRM
        NSNLADQCVHRYKIYVEGVGWSVSLKYILACDSVTLMVNPYYYDFFTRSLVPMHHYWPIKDDDDMCNSIKFAVYWGNTHQQKVRAIGKAASKFTEEELRM
Subjt:  NSNLADQCVHRYKIYVEGVGWSVSLKYILACDSVTLMVNPYYYDFFTRSLVPMHHYWPIKDDDDMCNSIKFAVYWGNTHQQKVRAIGKAASKFTEEELRM

Query:  EKVYDYMFHSLNEYSKLLTFKPTIPPNATELCLEELACPAQDLATKFMIDTLVKQNEKARQSGGHRVCINASVCVRTETKFWSANFMGKQKQQVISRFFA
        EKVYDYMFHSLNEYSKLLTFKPTIPPNATELCLEELACPAQDLATKFMIDTLVKQNEKARQSGGHRVCINASVCVRTETKFWSANFMGKQKQQVISRFFA
Subjt:  EKVYDYMFHSLNEYSKLLTFKPTIPPNATELCLEELACPAQDLATKFMIDTLVKQNEKARQSGGHRVCINASVCVRTETKFWSANFMGKQKQQVISRFFA

Query:  PKPKSPSVSSSSSSSSSALAPPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLEKFLEPIENSFESSNQ
        PKPKSPSVSSSSSSSSSALAPPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLEKFLEPIENSFESSNQ
Subjt:  PKPKSPSVSSSSSSSSSALAPPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLEKFLEPIENSFESSNQ

Query:  NPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYKVGVVKQTETAAIKA
        NPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYKVGVVKQTETAAIKA
Subjt:  NPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYKVGVVKQTETAAIKA

Query:  HGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDDNFMRSGLEAMLLSL
        HGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDDNFMRSGLEAMLLSL
Subjt:  HGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDDNFMRSGLEAMLLSL

Query:  SPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENIDNDNLAADHHDPDT
        SPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENIDNDNLAADHHDPDT
Subjt:  SPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENIDNDNLAADHHDPDT

Query:  VLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCMNHTLTIFGSRLLRQ
        VLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCMNHTLTIFGSRLLRQ
Subjt:  VLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCMNHTLTIFGSRLLRQ

Query:  WITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSEDDGSDISVSGLSFRF
        WITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSEDDGSDISVSGLSFRF
Subjt:  WITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSEDDGSDISVSGLSFRF

Query:  AFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKVARARKEAQSAREKL
        AFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKVARARKEAQSAREKL
Subjt:  AFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKVARARKEAQSAREKL

Query:  DSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGFSRYYAEFQSAVQAL
        DSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGFSRYYAEFQSAVQAL
Subjt:  DSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGFSRYYAEFQSAVQAL

Query:  ASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIALMAQVGSYVPAFSV
        ASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIALMAQVGSYVPAFSV
Subjt:  ASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIALMAQVGSYVPAFSV

Query:  KLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPKVADITKEFPASAGV
        KLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPKVADITKEFPASAGV
Subjt:  KLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPKVADITKEFPASAGV

Query:  YHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGKGLELESSECFLEDR
        YHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGKGLELESSECFLEDR
Subjt:  YHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGKGLELESSECFLEDR

Query:  IDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLGSRRSRNLLRFQSSASLAKSYRSRPLCTVELGEPEERKEKGLLRLATGFHRKVRNSENWLP
        IDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLGSRRSRNLLRFQSSASLAKSYRSRPLCTVELGEPEERKEKGLLRLATGFHRKVRNSENWLP
Subjt:  IDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLGSRRSRNLLRFQSSASLAKSYRSRPLCTVELGEPEERKEKGLLRLATGFHRKVRNSENWLP

Query:  PGWTVKVKVRKSGKKDKYYFEPSSQMRFNSRAEVFRYLKTAAINHPESEGSRTVEQPENNVEVKKTIAKGLPPGWIREIRVTRTANRIRRDSFYINPVNG
        PGWTVKVKVRKSGKKDKYYFEPSSQMRFNSRAEVFRYLKTAAINHPESEGSRTVEQPENNVEVKKTIAKGLPPGWIREIRVTRTANRIRRDSFYINPVNG
Subjt:  PGWTVKVKVRKSGKKDKYYFEPSSQMRFNSRAEVFRYLKTAAINHPESEGSRTVEQPENNVEVKKTIAKGLPPGWIREIRVTRTANRIRRDSFYINPVNG

Query:  NALRSLRDVHRYLTSGKVSRLAYKSRSQSNSNSNIEFQHDDISSPSVSEKEMLAIGKAKRQIIWSDNSPEPSEIADDEAIFPKAASNTCTVLRRVFIFED
        NALRSLRDVHRYLTSGKVSRLAYKSRSQSNSNSNIEFQHDDISSPSVSEKEMLAIGKAKRQIIWSDNSPEPSEIADDEAIFPKAASNTCTVLRRVFIFED
Subjt:  NALRSLRDVHRYLTSGKVSRLAYKSRSQSNSNSNIEFQHDDISSPSVSEKEMLAIGKAKRQIIWSDNSPEPSEIADDEAIFPKAASNTCTVLRRVFIFED

Query:  SVSDVNLETTAGRIGAKLHCSTPEEAKEREQSEGKNNFSDFVPTPDKSFQHKCTSENGATKIESRKRQRKSKDINLRCRASKRLAGLQAEPVLEAKTGHQ
        SVSDVNLETTAGRIGAKLHCSTPEEAKEREQSEGKNNFSDFVPTPDKSFQHKCTSENGATKIESRKRQRKSKDINLRCRASKRLAGLQAEPVLEAKTGHQ
Subjt:  SVSDVNLETTAGRIGAKLHCSTPEEAKEREQSEGKNNFSDFVPTPDKSFQHKCTSENGATKIESRKRQRKSKDINLRCRASKRLAGLQAEPVLEAKTGHQ

Query:  ACSIACEESDKQVASTTKWPSQSPENPDVKHETKSIIDPPKIDTNLDSSRKEHSAVDLSILPEKVKKMKSEDAYELQQPKCNSFLPSEDVLEKHVGQVES
        ACSIACEESDKQVASTTKWPSQSPENPDVKHETKSIIDPPKIDTNLDSSRKEHSAVDLSILPEKVKKMKSEDAYELQQPKCNSFLPSEDVLEKHVGQVES
Subjt:  ACSIACEESDKQVASTTKWPSQSPENPDVKHETKSIIDPPKIDTNLDSSRKEHSAVDLSILPEKVKKMKSEDAYELQQPKCNSFLPSEDVLEKHVGQVES

Query:  DDKADVKKQGTLLKLPMEDLLTDPCIAFAVKTLTGDVFDASISSEASLMPNNIDRPFVAFTHLESENKLNEKLPSPELPVTRVGKVENNNIVEKLDSTLK
        DDKADVKKQGTLLKLPMEDLLTDPCIAFAVKTLTGDVFDASISSEASLMPNNIDRPFVAFTHLESENKLNEKLPSPELPVTRVGKVENNNIVEKLDSTLK
Subjt:  DDKADVKKQGTLLKLPMEDLLTDPCIAFAVKTLTGDVFDASISSEASLMPNNIDRPFVAFTHLESENKLNEKLPSPELPVTRVGKVENNNIVEKLDSTLK

Query:  LPVGEIWADPCIEFAIKTLTDEIPLVNNPDIEDYFHQFAPSKTEGTNANASNHVVKLDNRKPDCADALRQEDVKRIRGL
        LPVGEIWADPCIEFAIKTLTDEIPLVNNPDIEDYFHQFAPSKTEGTNANASNHVVKLDNRKPDCADALRQEDVKRIRGL
Subjt:  LPVGEIWADPCIEFAIKTLTDEIPLVNNPDIEDYFHQFAPSKTEGTNANASNHVVKLDNRKPDCADALRQEDVKRIRGL

XP_022923129.1 DNA mismatch repair protein MSH3 [Cucurbita moschata]0.0e+0094.8Show/hide
Query:  MGKQKQQVISRFFAPKPKSPSVSSSSSSSSSALAPPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLEK
        MGKQKQQVISRFFAPKPKSPSV   SSSSSSALAPPTQ FSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLEK
Subjt:  MGKQKQQVISRFFAPKPKSPSVSSSSSSSSSALAPPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLEK

Query:  FLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYK
        FLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFG+DAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYK
Subjt:  FLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYK

Query:  VGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDD
        VGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEA+QNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDD
Subjt:  VGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDD

Query:  NFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENI
        NFMRSGLEAMLLSLSPAELLLGDPISKATEK                               LLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENI
Subjt:  NFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENI

Query:  DNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCM
        D DNLAADHHDPDTVLA RKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCM
Subjt:  DNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCM

Query:  NHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSED
        NHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKV QNIRESDEEDSDVMVIEPELNYILSSVLTTLG APDIQRGITRIFHRTAAPSE 
Subjt:  NHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSED

Query:  DGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKV
                       FIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKV
Subjt:  DGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKV

Query:  ARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGF
        ARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGF
Subjt:  ARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGF

Query:  SRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIA
        SRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVH DEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIA
Subjt:  SRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIA

Query:  LMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPK
        LMAQVGSYVPAFS KLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTS SLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPK
Subjt:  LMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPK

Query:  VADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGK
        VADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGK
Subjt:  VADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGK

Query:  GLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG
        GLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG
Subjt:  GLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG

XP_022984606.1 DNA mismatch repair protein MSH3 [Cucurbita maxima]0.0e+0093.85Show/hide
Query:  MGKQKQQVISRFFAPKPKSPSVSSSSSSSSSALAPPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLEK
        MGKQKQQVISRFFAPKPKSPSV   SSSSSSALAPPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNP+LHKRFLEK
Subjt:  MGKQKQQVISRFFAPKPKSPSVSSSSSSSSSALAPPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLEK

Query:  FLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYK
        FL+PIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFG+DAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYK
Subjt:  FLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYK

Query:  VGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDD
        VGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLD RIEYGVDVKIGMVAMEISTGDVIYREYDD
Subjt:  VGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDD

Query:  NFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENI
        NFMRSGLEAMLLSLSPAELLLGDPISKATEK                               LLLGYAGPASNVRVEHVSRDCFKDGSALAEV SLYENI
Subjt:  NFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENI

Query:  DNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCM
        D DNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCM
Subjt:  DNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCM

Query:  NHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSED
        NHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVS+KV QNIRES+EEDSDVMVIEPELNYILSSVLTTLG APDIQRGITRIFHRTAAPSE 
Subjt:  NHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSED

Query:  DGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKV
                       FIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKV
Subjt:  DGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKV

Query:  ARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGF
        ARARKEAQSAREKLDSLIT+YRKQLGMRKLEF SVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASR+AWDGFLSGF
Subjt:  ARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGF

Query:  SRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIA
        SRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLEST+QGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIA
Subjt:  SRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIA

Query:  LMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPK
        LMAQVGSYVPAFS KLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTS SLVIIDELGRGTSTHDGVAIAYAAL NLLHHKKCLVLFVTHYPK
Subjt:  LMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPK

Query:  VADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGK
        VADITKEFPASAGVYHVSYLTSHE PSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPL CIARATEMGVWLEEIVTRRAQRKTRE+HLQEASGK
Subjt:  VADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGK

Query:  GLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG
        GLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGK FLQFSQARSMAMDLLG
Subjt:  GLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG

XP_023552828.1 LOW QUALITY PROTEIN: DNA mismatch repair protein MSH3 [Cucurbita pepo subsp. pepo]0.0e+0093.73Show/hide
Query:  SVSNPSLHKRFLEKFLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFR
        ++SN SLHKRFLEKFLEPIENSFESSNQNPKAPTGG AKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFG+DAEIAARVLGIYAHLDRNFMTASIPTFR
Subjt:  SVSNPSLHKRFLEKFLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFR

Query:  LNVHVRRLVSAGYKVGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAM
        LNVHVRRLVSAGYKVGVVKQTETAA+KAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSML+DNLDRRIEYGVDVKIGMVAM
Subjt:  LNVHVRRLVSAGYKVGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAM

Query:  EISTGDVIYREYDDNFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKD
        EISTGDVIYREYDDNFMRSGLEAMLLSLSPAELLLGDPISKATEK                               LLLGYAGPASNVRVEHVSRDCFKD
Subjt:  EISTGDVIYREYDDNFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKD

Query:  GSALAEVMSLYENIDNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKN
        GSALAEVMSLYENID DNLAADHHDPDT+LAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKN
Subjt:  GSALAEVMSLYENIDNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKN

Query:  NDDGSETGSLLQCMNHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRG
        NDDGSETGSLLQCMNHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKV QNIRESDEEDSDVMVIEPELNYILSSVLTTLG APDIQRG
Subjt:  NDDGSETGSLLQCMNHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRG

Query:  ITRIFHRTAAPSEDDGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDL
        ITRIFHRTAAPS +               FIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDL
Subjt:  ITRIFHRTAAPSEDDGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDL

Query:  PNLMIINNDQFPKVARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELM
        PNLMIINNDQFPKVARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELM
Subjt:  PNLMIINNDQFPKVARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELM

Query:  VASRDAWDGFLSGFSRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNM
        VASRDAWDGFLSGFSRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNM
Subjt:  VASRDAWDGFLSGFSRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNM

Query:  GGKSCYIRQVALIALMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLH
        GGKSCYIRQVALIALMAQVGSYVPAFS KLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTS SLVIIDELGRGTSTHDGVAIAYAAL NLLH
Subjt:  GGKSCYIRQVALIALMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLH

Query:  HKKCLVLFVTHYPKVADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQ
        HKKCLVLFVTHYPKVADITKEFP SAGVYHVSYLTSH+TPSFS PKS+QDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQ
Subjt:  HKKCLVLFVTHYPKVADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQ

Query:  RKTREIHLQEASGKGLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG
        RKTREIHLQ+ASGKGLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG
Subjt:  RKTREIHLQEASGKGLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG

TrEMBL top hitse value%identityAlignment
A0A0A0L5V6 DNA mismatch repair protein0.0e+0083.61Show/hide
Query:  MGKQKQQVISRFFAPKPKSPSVSSSSSSSSSALA---PPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRF
        MGKQKQQVISRFFAPKPKSPS+SSSSSSSS+  A   PPTQPFSP K+SATVTFSPSKRLISSA+ASQLTPPKSSKRPKLSPHTHNPLPS+ NPSLH+RF
Subjt:  MGKQKQQVISRFFAPKPKSPSVSSSSSSSSSALA---PPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRF

Query:  LEKFLEPIENSFESSNQNPKAPTGGDA--KYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLV
        L+K LEP ++SF+ SNQNP+   G D   KYTPLE QVVDLKKRYPDVLLMVEVGYRYRFFG+DA IAARVLGIYAHLD NFMTASIPTFRLNVHVRRLV
Subjt:  LEKFLEPIENSFESSNQNPKAPTGGDA--KYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLV

Query:  SAGYKVGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIY
        SAGYKVGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEA Q+LGGAEEGC GESNYLFC+VE SM V N+D RIE GVDVKIGMVAMEISTGDVIY
Subjt:  SAGYKVGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIY

Query:  REYDDNFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMS
         EYDDNFMR+GLEAMLLSLSPAELLLGDPISK TEK                               LLLGYAGPA NVRVE VS DCFKDGSALAEVMS
Subjt:  REYDDNFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMS

Query:  LYENIDNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGS
        LYENID +N   D ++P+ VL G+KS+R AIKEI+NMPNLALQALALTIR+LKQFGLERIVSL SSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGS
Subjt:  LYENIDNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGS

Query:  LLQCMNHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTA
        LL CMNHTLTIFGSRLLRQWITHPLC+R+MIIARQEAVSEIAA+MVSSKV  N R  DEE SDV+VIEPELNYILSSVLTTLG APDIQRGITRIFHRTA
Subjt:  LLQCMNHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTA

Query:  APSEDDGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINND
        APSE                FIAVIQAIL+AGKQLQQFHIDEEDDN SSES+IGSKLLRKLILSASSSGLI+IAAKLLSTISKEAADQGD PNLMII +D
Subjt:  APSEDDGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINND

Query:  QFPKVARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDG
        QFPKVARARKEAQSAREKLD+LIT YRKQLGMRKLEF SVSGTTHLIELA+DVKVPS WVKINSTKKT+RYHPPEVLAALDELSLANEELMVASRDAWD 
Subjt:  QFPKVARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDG

Query:  FLSGFSRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQ
        FL GFSRYYAEFQ+AVQALASIDCLYSLAILSR+KNY RPEFV DDEPAQI +CSGRHPVLE TLQGNFVPNDTNLD NGEHCQIVTGPNMGGKSCYIRQ
Subjt:  FLSGFSRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQ

Query:  VALIALMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFV
        VALIALM+QVGS+VPAFS KLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHIL HS+S SLVIIDELGRGTSTHDGVAIAYAALH+LL  KKCLVLFV
Subjt:  VALIALMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFV

Query:  THYPKVADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQ
        THYPKVA+I KEFPASAGVYHVSYLTSH++PS SGPKS +DV YLYKL+ G+AESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRK+ E HL 
Subjt:  THYPKVADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQ

Query:  EASGKGLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG
        EAS KGLE +S + FLE RID YEEFFLFLKAT+ S  D+G    Q  QARSMAMDLLG
Subjt:  EASGKGLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG

A0A1S3AXV2 DNA mismatch repair protein MSH30.0e+0082.73Show/hide
Query:  QNEKARQSGGHRVCINASVCV---RTETKFWSANFMGKQKQQVISRFFAPKPKSPSVSSSSSSSSSA--LAPPTQPFSPPKISATVTFSPSKRLISSALA
        QNEKARQ  G  VC+N +  V   R+   F S NFMGKQKQQVISRFFAPKPK PS+SSSSSSS++A  +  PTQPFSP K+SATVTFSPSKRLISSA+A
Subjt:  QNEKARQSGGHRVCINASVCV---RTETKFWSANFMGKQKQQVISRFFAPKPKSPSVSSSSSSSSSA--LAPPTQPFSPPKISATVTFSPSKRLISSALA

Query:  SQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLEKFLEPIENSFESSNQNPKAPTGGDA--KYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAE
        SQLTPPKSSKRPKLSPHTHNPLPS+ NPSLH+RFL+KFLEP ++SF+ SNQNP+   GGD   KYTPLE QVVDLKKR+PDVLLMVEVGYRYRFFG+DAE
Subjt:  SQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLEKFLEPIENSFESSNQNPKAPTGGDA--KYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAE

Query:  IAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYKVGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVE
        IAARVLGIYAHLD NFMTASIPTFRLNVHVRRLVSAGYKVGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEA Q+LGGAEEGC GESNYLFC+VE
Subjt:  IAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYKVGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVE

Query:  KSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDDNFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCD
         SMLVDNLD RIE GVDVKIG+VAMEISTGDVIYREYDDNFMRSGLEAMLLSL+PAELLLGDPISK TEK                              
Subjt:  KSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDDNFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCD

Query:  QLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENIDNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSS
         LLLGYAGPASNVRVE VSRDCFK+GSALAEVMSLYENID DNL  D ++P+TVL G+KSD  AIKEI+NMPNLALQA ALTIR+LKQFGLER+VSL SS
Subjt:  QLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENIDNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSS

Query:  FRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCMNHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMV
        FRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLL CMNHTLTIFGSRLLRQWITHPLC+R+MIIARQEAVSEIAA+MVSSKV  N    DEEDSDV+V
Subjt:  FRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCMNHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMV

Query:  IEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSEDDGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSAS
        IEPELNYILSSVLTTLG APDIQRGITRIFHRTAAPSE                FIAVIQAIL+AGKQLQQFHIDEEDDN SSES+IGSKLLRKLILSAS
Subjt:  IEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSEDDGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSAS

Query:  SSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKVARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTK
        SSGLI+IAAKLLSTISKEAADQGD PNLMII +DQ+PKVARARKEAQSAREKLD+LIT YRKQLGMRKLEF SVSGTTHLIELA+DVKVPS WVKINSTK
Subjt:  SSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKVARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTK

Query:  KTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGFSRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQ
        KT+RYHPPEVLAALDELSLANEELMVASRDAWD FL GFSRYYAEFQ+AVQALASIDCLYSLAILSR+KNY RPEFVHDDEPAQI ICSGRHPVLE TLQ
Subjt:  KTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGFSRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQ

Query:  GNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIALMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVII
        GNFVPNDTNLD NGEHCQIVTGPNMGGKSCYIRQVALIALM+QVGS+VPAFS KLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHIL HS+S SLVII
Subjt:  GNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIALMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVII

Query:  DELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPKVADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQ
        DELGRGTSTHDGVAIAYAALHNLL  KKCLVLFVTHYPKVA+I KEFPA A  YHVSYLTSH+ PS SG KS +DV YLYKLVPG+AESSFGFKVAQLAQ
Subjt:  DELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPKVADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQ

Query:  IPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGKGLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG
        IPLSCIARATEMG+WLEEIV RRAQ K+RE HL E S KGLE +S + FLE RID YEEFFLFLKAT+ S D MG    Q  QAR MAMDLLG
Subjt:  IPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGKGLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG

A0A5A7SWU6 DNA mismatch repair protein0.0e+0081Show/hide
Query:  MGKQKQQVISRFFAPKPKSPSVSSSSSSSSSA--LAPPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFL
        MGKQKQQVISRFFAPKPK PS+SSSSSSS++A  +  PTQPFSP K+SATVTFSPSKRLISSA+ASQLTPPKSSKRPKLSPHTHNPLPS+ NPSLH+RFL
Subjt:  MGKQKQQVISRFFAPKPKSPSVSSSSSSSSSA--LAPPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFL

Query:  EKFLEPIENSFESSNQNPKAPTGGDA--KYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVS
        +KFLEP ++SF+ SNQNP+   G D   KYTPLE QVVDLKKRYPDVLLMVEVGYRYRFFG+DAEIAARVLGIYAHLD NFMTASIPTFRLNVHVRRLVS
Subjt:  EKFLEPIENSFESSNQNPKAPTGGDA--KYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVS

Query:  AGYKVGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYR
        AGYKVGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEA Q+LGGAEEGC GESNYLFC+VE SMLVDNLD RIE GVDVKIG+VAMEISTGDVIYR
Subjt:  AGYKVGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYR

Query:  EYDDNFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVAL---------------------NAVACCTDCVVSIV----------------
        EYDDNFMRSGLEAMLLSL+PAELLLGDPISK TEKV   V S+   V+L                       +A   DC+ +I+                
Subjt:  EYDDNFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVAL---------------------NAVACCTDCVVSIV----------------

Query:  ----------CDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENIDNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQ
                   + LLLGYAGPASNVRVE  SRDCFK+GSALAEVMSLYENID DNL  D ++P+TVL G+KSD  AIKEI+NMPNLALQA ALTIR+LKQ
Subjt:  ----------CDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENIDNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQ

Query:  FGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCMNHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNI
        FGLER+VSL SSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLL CMNHTLTIFGSRLLRQWITHPLC+R+MIIARQEAVSEIAA+MVSSKV  N 
Subjt:  FGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCMNHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNI

Query:  RESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSEDDGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIG
           DEEDSDV+VIEPELNYILSSVLTTLG APDIQRGITRIFHRTAAPSE                FIAVIQAIL+AGKQLQQFHIDEEDDN SSES+IG
Subjt:  RESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSEDDGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIG

Query:  SKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKVARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVK
        SKLLRKLILSASSSGLI+IAAKLLSTISKEAADQGD PNLMII +DQ+PKVARARKEAQSAREKLD+LIT YRKQLGMR LEF SVSGTTHLIELA+DVK
Subjt:  SKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKVARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVK

Query:  VPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGFSRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVIC
        VPS WVKINSTKKT+RYHPPEVLAALDELSLANEELMVASRDAWD FL GFSRYYAEFQ+AVQALASIDCLYSLAILSR+KNY RPEFVHDDEPAQI IC
Subjt:  VPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGFSRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVIC

Query:  SGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIALMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHI
        SGRHPVLE TLQGNFVPNDTNLD NGEHCQIVTGPNMGGKSCYIRQVALIALM+QVGS+VPAFS KLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHI
Subjt:  SGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIALMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHI

Query:  LHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPKVADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAE
        L HS+S SLVIIDELGRGTSTHDGVAIAYAALHNLL  KKCLVLFVTHYPKVA+I KEFPA A  YHVSYLTSH+ PS SG KS +DV YLYKLVPG+AE
Subjt:  LHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPKVADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAE

Query:  SSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGKGLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMA
        SSFGFKVAQLAQIPLSCIARATEMG+WLEEIV RRAQ K+RE HL E S KGLE +S + FLE RID YEEFF+FLKA + S D MG    Q  QARSMA
Subjt:  SSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGKGLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMA

Query:  MDLLG
        MDLLG
Subjt:  MDLLG

A0A6J1E5H0 DNA mismatch repair protein0.0e+0094.8Show/hide
Query:  MGKQKQQVISRFFAPKPKSPSVSSSSSSSSSALAPPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLEK
        MGKQKQQVISRFFAPKPKSPSV   SSSSSSALAPPTQ FSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLEK
Subjt:  MGKQKQQVISRFFAPKPKSPSVSSSSSSSSSALAPPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLEK

Query:  FLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYK
        FLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFG+DAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYK
Subjt:  FLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYK

Query:  VGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDD
        VGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEA+QNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDD
Subjt:  VGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDD

Query:  NFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENI
        NFMRSGLEAMLLSLSPAELLLGDPISKATEK                               LLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENI
Subjt:  NFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENI

Query:  DNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCM
        D DNLAADHHDPDTVLA RKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCM
Subjt:  DNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCM

Query:  NHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSED
        NHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKV QNIRESDEEDSDVMVIEPELNYILSSVLTTLG APDIQRGITRIFHRTAAPSE 
Subjt:  NHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSED

Query:  DGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKV
                       FIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKV
Subjt:  DGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKV

Query:  ARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGF
        ARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGF
Subjt:  ARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGF

Query:  SRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIA
        SRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVH DEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIA
Subjt:  SRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIA

Query:  LMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPK
        LMAQVGSYVPAFS KLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTS SLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPK
Subjt:  LMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPK

Query:  VADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGK
        VADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGK
Subjt:  VADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGK

Query:  GLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG
        GLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG
Subjt:  GLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG

A0A6J1J5R3 DNA mismatch repair protein0.0e+0093.85Show/hide
Query:  MGKQKQQVISRFFAPKPKSPSVSSSSSSSSSALAPPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLEK
        MGKQKQQVISRFFAPKPKSPSV   SSSSSSALAPPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNP+LHKRFLEK
Subjt:  MGKQKQQVISRFFAPKPKSPSVSSSSSSSSSALAPPTQPFSPPKISATVTFSPSKRLISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLEK

Query:  FLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYK
        FL+PIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFG+DAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYK
Subjt:  FLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYK

Query:  VGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDD
        VGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLD RIEYGVDVKIGMVAMEISTGDVIYREYDD
Subjt:  VGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDD

Query:  NFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENI
        NFMRSGLEAMLLSLSPAELLLGDPISKATEK                               LLLGYAGPASNVRVEHVSRDCFKDGSALAEV SLYENI
Subjt:  NFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENI

Query:  DNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCM
        D DNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCM
Subjt:  DNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCM

Query:  NHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSED
        NHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVS+KV QNIRES+EEDSDVMVIEPELNYILSSVLTTLG APDIQRGITRIFHRTAAPSE 
Subjt:  NHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSED

Query:  DGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKV
                       FIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKV
Subjt:  DGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKV

Query:  ARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGF
        ARARKEAQSAREKLDSLIT+YRKQLGMRKLEF SVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASR+AWDGFLSGF
Subjt:  ARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGF

Query:  SRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIA
        SRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLEST+QGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIA
Subjt:  SRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIA

Query:  LMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPK
        LMAQVGSYVPAFS KLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTS SLVIIDELGRGTSTHDGVAIAYAAL NLLHHKKCLVLFVTHYPK
Subjt:  LMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPK

Query:  VADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGK
        VADITKEFPASAGVYHVSYLTSHE PSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPL CIARATEMGVWLEEIVTRRAQRKTRE+HLQEASGK
Subjt:  VADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGK

Query:  GLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG
        GLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGK FLQFSQARSMAMDLLG
Subjt:  GLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLG

SwissProt top hitse value%identityAlignment
A1DCB2 DNA mismatch repair protein msh31.8e-14534.68Show/hide
Query:  GDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIY--------------AHLDRNFMTASIPTFRLNVHVRRLVSAGYKVGVVKQT
        G +K TPLE QV+++K+++ D +L++EVGY++RFFG+DA IAA+ L I               AHLDR F +ASIP  RL+VHV+RLVSAGYKVGVV+Q 
Subjt:  GDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIY--------------AHLDRNFMTASIPTFRLNVHVRRLVSAGYKVGVVKQT

Query:  ETAAIKAHGSNKLGPFCRGLSALYTKAT-LEAVQNLGGAEEGCGG----ESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDDNF
        ETAA+KA G N+  PF R L+ LYTK T ++ V+ L GA     G     + Y+ C+ E +      D +      V +G+VA++ +TGD+IY +++D F
Subjt:  ETAAIKAHGSNKLGPFCRGLSALYTKAT-LEAVQNLGGAEEGCGG----ESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDDNF

Query:  MRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENIDN
        MRS +EA LL ++P EL++   +SKATEK+   +        LN                           VRV+ V+    K  +A+AE  S   N   
Subjt:  MRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENIDN

Query:  DNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCMNH
          L A +   D   +        +++++N+P      L+  I +L ++GLE I  L   F+ FS +  M L+ NTL  LE+ +N  D S  GSL   ++ 
Subjt:  DNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCMNH

Query:  TLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSEDDG
        T T FG RLLR+W+  PL +++ +  R  AV E+                  +  D  V    L  +L  + +      D+++ + RI++          
Subjt:  TLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSEDDG

Query:  SDISVSGLSFRFAFIAVIQAILYAGKQLQQFHID---EEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPK
              G   R   + V+Q +    + + Q ++D    ED  ++S  ++G  + R          ++    K L+ I+  AA   D        +++   
Subjt:  SDISVSGLSFRFAFIAVIQAILYAGKQLQQFHID---EEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPK

Query:  VARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIEL----ALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDG
        ++  +    S   +L+   +V    L   K+ +++ SG  +LIE+    A   +VP++WVK++ TKK  R+H PEV+  L +     E L  A   A+  
Subjt:  VARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIEL----ALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDG

Query:  FLSGFSRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQ
         L+  +  Y  F+ +VQ+LA++DCL SLA ++    YV+PE+    +   I +  GRHP++E  L  ++VPND +LD +     +VTGPNMGGKS Y+RQ
Subjt:  FLSGFSRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQ

Query:  VALIALMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFV
        +ALIA+MAQ+GSYVPA S KL +LD ++TRMGA D++  G STF+ E++ET+ IL  +T  SLVI+DELGRGTSTHDGVAIA A L  ++   + L LF+
Subjt:  VALIALMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFV

Query:  THYPKVADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRR
        THY  ++++ + FP      +      H   + SG    +++ +LY++  G+A  S+G  VA+LA +P   +  A +    LEE + RR
Subjt:  THYPKVADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRR

O65607 DNA mismatch repair protein MSH30.0e+0057.17Show/hide
Query:  MGKQKQQVISRFFAPKPKSPSVSSSSSSSSSALAPPTQPFSPPKISATVTFSPSKR-LISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLE
        MGKQKQQ ISRFFAPKPKSP+   +  + SS          PPKISATV+FSPSKR L+S  LA+      S K+PKLSPHT NP+P   +P+LH+RFL+
Subjt:  MGKQKQQVISRFFAPKPKSPSVSSSSSSSSSALAPPTQPFSPPKISATVTFSPSKR-LISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLE

Query:  KFLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGY
        +FLEP       S +     T    KYTPLE QVV+LK +YPDV+LMVEVGYRYRFFG+DAEIAARVLGIYAH+D NFMTAS+PTFRLN HVRRLV+AGY
Subjt:  KFLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGY

Query:  KVGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLG---GAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYR
        K+GVVKQTETAAIK+HG+N+ GPF RGLSALYTKATLEA +++    G EEG G +SN+L CVV++ +  + L   IE   DV++G+V +EISTG+V+Y 
Subjt:  KVGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLG---GAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYR

Query:  EYDDNFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSL
        E++DNFMRSGLEA++LSLSPAELLLG P+S+ TEK                                L+ +AGP SNVRVE  S DCF +G+A+ EV+SL
Subjt:  EYDDNFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSL

Query:  YENIDNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSL
         E I   NL  D  +     A +    + +  IMNMP+L +QALALT  +LKQFG ERI+  G+SFR  S   EMTLS NTL QLEV+KNN DGSE+GSL
Subjt:  YENIDNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSL

Query:  LQCMNHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAA
           MNHTLT++GSRLLR W+THPLC+R++I AR +AVSEI+A M S    Q   E  EE S+  ++ PE   +LSSVLT +  + DIQRGITRIFHRTA 
Subjt:  LQCMNHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAA

Query:  PSEDDGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSES-MIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINND
         +E                FIAV++AIL AGKQ+Q+  I ++ +  S +S  + S LLRKLI   SS  ++  A KLLS ++KEAA +GDL +++I ++D
Subjt:  PSEDDGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSES-MIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINND

Query:  QFPKVARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDG
        QFP++A AR+     REKLDS I  +RK+L +R LEF+ VSG THLIEL +D KVP NWVK+NSTKKTIRYHPPE++A LDEL+LA E L + +R +WD 
Subjt:  QFPKVARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDG

Query:  FLSGFSRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQ
        FL  FSRYY +F++AVQALA++DCL+SL+ LSR+KNYVRPEFV D EP +I I SGRHPVLE+ LQ NFVPNDT L   GE+CQI+TGPNMGGKSCYIRQ
Subjt:  FLSGFSRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQ

Query:  VALIALMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFV
        VALI++MAQVGS+VPA   KLHVLDG++TRMGASDSIQ GRSTFLEE++E SHI+   +S SLVI+DELGRGTSTHDGVAIAYA L +LL  K+CLVLFV
Subjt:  VALIALMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFV

Query:  THYPKVADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQ
        THYP++A+I+  FP S G YHVSYLT  +     G     DV YLYKLV G+   SFGFKVAQLAQIP SCI RA  M   LE      A+ + RE + +
Subjt:  THYPKVADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQ

Query:  EASGKGLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRF
            +G E        E+ I A  + F  LK  + S +D  K F
Subjt:  EASGKGLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRF

P13705 DNA mismatch repair protein Msh39.3e-16336.67Show/hide
Query:  YTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYKVGVVKQTETAAIKAHGSNKLGPFCR
        YTPLELQ +D+K+++ D +L VE GY+YRFFG+DAEIAAR L IY HLD NFMTASIPT RL VHVRRLV+ GYKVGVVKQTETAA+KA G NK   F R
Subjt:  YTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYKVGVVKQTETAAIKAHGSNKLGPFCR

Query:  GLSALYTKATL--EAVQNLGGAEEGCG-------GESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDDNFMRSGLEAMLLSLSP
         L+ALYTK+TL  E V  L   ++            +NYL C+ E+   + +  +      ++ +G+V ++ +TG+V++  + D+  R  LE  + SL P
Subjt:  GLSALYTKATL--EAVQNLGGAEEGCG-------GESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDDNFMRSGLEAMLLSLSP

Query:  AELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENIDNDNLAADHHDPDTVL
         ELLL   +S+ TE +  R  ++S                                 +RVE ++   F+   A   V   Y     D+  +         
Subjt:  AELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENIDNDNLAADHHDPDTVL

Query:  AGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKME-MTLSGNTLTQLEVLKNNDDGSETGSLLQCMNHTLTIFGSRLLRQW
                ++  ++N+    + ALA  IRYLK+F LE+++S   SF+  S  ME M ++G TL  LE+++N  D    GSLL  ++HT T FG R L+ W
Subjt:  AGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKME-MTLSGNTLTQLEVLKNNDDGSETGSLLQCMNHTLTIFGSRLLRQW

Query:  ITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSEDDGSDISVSGLSFRFA
        +T PL +   I AR +AVS++  +   S V++ I                           L   PD++RG+  I+H+  +  E                
Subjt:  ITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSEDDGSDISVSGLSFRFA

Query:  FIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKVARARKEAQSAREKLD
        F  +++++     +LQ           +  S + S LLR LI+      L+S     L  ++  AA  GD   L    +D FP + + + E Q     + 
Subjt:  FIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKVARARKEAQSAREKLD

Query:  SLITVYRKQLGMRKLEFISVSGTTHLIELALDVK--VPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGFSRYYAEFQSAVQA
          +  +RK L +  L++++VSG   +IE+       +P++WVK+ STK   R+HPP ++ +   L+   E+L++     W GFL  F  +Y     AV  
Subjt:  SLITVYRKQLGMRKLEFISVSGTTHLIELALDVK--VPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGFSRYYAEFQSAVQA

Query:  LASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTL--QGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIALMAQVGSYVPA
        LA++DC++SLA +++  NY RP      E  +I+I +GRHP+++  L  Q  FVPN T+L  + E   I+TGPNMGGKS YI+QV L+ +MAQ+GSYVPA
Subjt:  LASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTL--QGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIALMAQVGSYVPA

Query:  FSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPKVADITKEFPAS
            + ++DGI+TRMGA+D+I +GRSTF+E++T+T+ I+  ++  SLVI+DELGRGTSTHDG+AIAYA L   +   K L LFVTHYP V ++ K +P  
Subjt:  FSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPKVADITKEFPAS

Query:  AGVYHVSYLTSH-ETPSFSG--PKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQR
         G YH+ +L +  E+   SG   +    V +LY++  GIA  S+G  VA+LA +P   + +A      LE +V+ R +R
Subjt:  AGVYHVSYLTSH-ETPSFSG--PKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQR

P20585 DNA mismatch repair protein Msh34.6e-16237.76Show/hide
Query:  YTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYKVGVVKQTETAAIKAHGSNKLGPFCR
        YTPLELQ +++K+++ D +L VE GY+YRFFG+DAEIAAR L IY HLD NFMTASIPT RL VHVRRLV+ GYKVGVVKQTETAA+KA G N+   F R
Subjt:  YTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYKVGVVKQTETAAIKAHGSNKLGPFCR

Query:  GLSALYTKATL--EAVQNLGGAEEGCGGE-------SNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDDNFMRSGLEAMLLSLSP
         L+ALYTK+TL  E V  L   ++    +       ++YL C+ E    V     R +   ++ IG+V ++ +TG+V++  + D+  RS LE  + SL P
Subjt:  GLSALYTKATL--EAVQNLGGAEEGCGGE-------SNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYREYDDNFMRSGLEAMLLSLSP

Query:  AELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENIDNDNLAADHHDPDTVL
         ELLL   +S+ TE +  R  S                 VS+  D+           +RVE +    F+   A   V   Y               DTV 
Subjt:  AELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSLYENIDNDNLAADHHDPDTVL

Query:  AGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKME-MTLSGNTLTQLEVLKNNDDGSETGSLLQCMNHTLTIFGSRLLRQW
           K  +I I  I+N+    + +LA  I+YLK+F LE+++S   +F+  S KME MT++G TL  LE+L+N  D    GSLL  ++HT T FG R L++W
Subjt:  AGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKME-MTLSGNTLTQLEVLKNNDDGSETGSLLQCMNHTLTIFGSRLLRQW

Query:  ITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSEDDGSDISVSGLSFRFA
        +T PL +   I AR +AVSE+  +  S                          +   +   L   PDI+RG+  I+H+  +  E                
Subjt:  ITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSEDDGSDISVSGLSFRFA

Query:  FIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKVARARKEAQSAREKLD
        F  +++ + +   + Q           +  S I S LLR +IL      L+S     L  ++++AA  GD   L    +D FP + + + E Q   +++ 
Subjt:  FIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKVARARKEAQSAREKLD

Query:  SLITVYRKQLGMRKLEFISVSGTTHLIELALDVK--VPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGFSRYYAEFQSAVQA
          +   RK L     ++++VSG   +IE+       +P++WVK+ STK   R+H P ++     L+   E+L++     W  FL  FS +Y     AV  
Subjt:  SLITVYRKQLGMRKLEFISVSGTTHLIELALDVK--VPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGFSRYYAEFQSAVQA

Query:  LASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTL--QGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIALMAQVGSYVPA
        LA++DC++SLA +++  +Y RP      E  +IVI +GRHPV++  L  Q  +VPN+T+L  + E   I+TGPNMGGKS YI+QVALI +MAQ+GSYVPA
Subjt:  LASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTL--QGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIALMAQVGSYVPA

Query:  FSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPKVADITKEFPAS
            + ++DGI+TRMGA+D+I +G+STF+EE+T+T+ I+  +TS SLVI+DELGRGTSTHDG+AIAYA L   +   K L LFVTHYP V ++ K +   
Subjt:  FSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPKVADITKEFPAS

Query:  AGVYHVSYLTSHETPSFSGPKSAQD----VIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQR
         G YH+ +L S E  S   P +A+     V +LY++  GIA  S+G  VA+LA +P   + +A      LE ++  + +R
Subjt:  AGVYHVSYLTSHETPSFSGPKSAQD----VIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQR

Q1ZXH0 DNA mismatch repair protein Msh38.4e-14834.44Show/hide
Query:  ESSNQNPKAPTGGDAK---------YTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYK
        +  ++  K PT G +K         YTPLE Q + +KK  PD +LMVE GY+Y+FFG+DAE+A +VL IY+++ +NF+  SIPT RL  H+RRLV AGYK
Subjt:  ESSNQNPKAPTGGDAK---------YTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGYK

Query:  VGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATL-----------EAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEIS
        VG+V+QTETAA+KA  S+K  PF R L+ +YT +T             + Q L    E      N      ++    + +D   E      I  VA+ + 
Subjt:  VGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATL-----------EAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEIS

Query:  TGDVIYREYDDNFMRSGLEAMLLSLSPAELLLGDPISKAT-EKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGS
        TG++IY  + DN MRS LE +L  + P+E+L+    +    +KV + +G+  +  + N  + C                   ++NVR + +    +    
Subjt:  TGDVIYREYDDNFMRSGLEAMLLSLSPAELLLGDPISKAT-EKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGS

Query:  ALAEVMSLYENIDNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQF-GLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNN
        +L +++  YE+  N+N   ++++ + VL         +K  +N   +    L + + YL +F     I+ + S+F+ F     + L  +T+  LE+L N 
Subjt:  ALAEVMSLYENIDNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQF-GLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNN

Query:  DDGSETGSLLQCMNHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGI
         D  E GSL+  MN T T  GSR+   WI  PL + ++I  RQ+AV E+                    + +    P +  I+S   + +   PD+QR +
Subjt:  DDGSETGSLLQCMNHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGI

Query:  TRIFHRT-AAPSE------------DDGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLS
        +RI+++    P E            +   +I+ +  S++F        +L +  +LQ  + D + D++  + + G   L K I               LS
Subjt:  TRIFHRT-AAPSE------------DDGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLS

Query:  TISKEAADQ-----GDLPNLMIINNDQFPKVARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVS--GTTHLIELALDVK-VPSNWVKINSTKKTIRY
         I+KE A +      D  NL  ++ +++ K+   +++ +   ++  +++   RK+L    LE+  +   G  +L+EL    K VP +W+K+NST+K  RY
Subjt:  TISKEAADQ-----GDLPNLMIINNDQFPKVARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVS--GTTHLIELALDVK-VPSNWVKINSTKKTIRY

Query:  HPPEVLAALDELSLANEELMVASRDAWDGFLSGFSRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTL---QGN
        H PEVL  L  LS + E L + S+++W  FL  FS  Y+ F + V  ++++DCL+SLA +S  + Y+RP+FV + +   I I +GRHPV+E+ L    G+
Subjt:  HPPEVLAALDELSLANEELMVASRDAWDGFLSGFSRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTL---QGN

Query:  FVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIALMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDE
        +VPN   L  +     I+TGPNMGGKS  +RQ ALI +MAQVG +VPA S  L V D IYTRMGA DSI  G+STF  E+ ETS IL +ST ++LVI+DE
Subjt:  FVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIALMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDE

Query:  LGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPKVADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIP
        LGRGTST+DGVAIAY+ L  ++   KC  LFVTHYP +A +  ++P   G +H+ YL   +            VI+LYKLV G A++S+G  +A+LA +P
Subjt:  LGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPKVADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIP

Query:  LSCIARATEMGVWLEEIVTRRA
        +  IA A +    ++E +TRRA
Subjt:  LSCIARATEMGVWLEEIVTRRA

Arabidopsis top hitse value%identityAlignment
AT1G63420.1 Arabidopsis thaliana protein of unknown function (DUF821)6.4e-13556.79Show/hide
Query:  SVNCTAF---SWSGKC-RTRSSVIVNEEGDRDRQNFDTTCPEYFRWIHEDLRPWAGTGITREMVESGRPKAGFRLVIIDGRAYVEKFMDAYQSRDKFTLW
        SV+C++F   + SG C RT  S     + + +R     +CP+YF+WIHEDL+PW  TGIT+EMVE G+  A FRLVI++G+ +VE +  + Q+RD FTLW
Subjt:  SVNCTAF---SWSGKC-RTRSSVIVNEEGDRDRQNFDTTCPEYFRWIHEDLRPWAGTGITREMVESGRPKAGFRLVIIDGRAYVEKFMDAYQSRDKFTLW

Query:  GILQLLRLYPGKIPDLDLMFNCEDRPNIFIGDYSGPGPN-STAPPPLFRYCGDDDTLDIVFPDWSFWGWPEINIKPWVPLMEDLKQGNKRTKWSNREAYA
        GILQLLR YPGK+PD+DLMF+C+DRP I    Y+        APPPLFRYCGD  T+DIVFPDWSFWGW EINI+ W  +++++++G K+ K+  R+AYA
Subjt:  GILQLLRLYPGKIPDLDLMFNCEDRPNIFIGDYSGPGPN-STAPPPLFRYCGDDDTLDIVFPDWSFWGWPEINIKPWVPLMEDLKQGNKRTKWSNREAYA

Query:  YWKGN-IKVSMVRYKLLECNLSREHDWKARVFMQDWDKEQKERFKNSNLADQCVHRYKIYVEGVGWSVSLKYILACDSVTLMVNPYYYDFFTRSLVPMHH
        YWKGN    S  R  LL CNLS  HDW AR+F+QDW  E +  F+NSN+A+QC +RYKIY+EG  WSVS KYILACDSVTLMV PYYYDFF+R+L P+ H
Subjt:  YWKGN-IKVSMVRYKLLECNLSREHDWKARVFMQDWDKEQKERFKNSNLADQCVHRYKIYVEGVGWSVSLKYILACDSVTLMVNPYYYDFFTRSLVPMHH

Query:  YWPIKDDDDMCNSIKFAVYWGNTHQQKVRAIGKAASKFTEEELRMEKVYDYMFHSLNEYSKLLTFKPTIPPNATELCLEELACPAQ-----DLATKFMID
        YWPI+ D D C SIKFAV W N H QK + IG+ AS+F + +L ME VYDYMFH LNEYSKLL +KP +P N+ ELC E L CP++      +  KFMI 
Subjt:  YWPIKDDDDMCNSIKFAVYWGNTHQQKVRAIGKAASKFTEEELRMEKVYDYMFHSLNEYSKLLTFKPTIPPNATELCLEELACPAQ-----DLATKFMID

Query:  TLVKQ
        +LV +
Subjt:  TLVKQ

AT2G45830.1 downstream target of AGL15 21.9e-13156.06Show/hide
Query:  NEEGDRDRQNFDTTCPEYFRWIHEDLRPWAGTGITREMVESGRPKAGFRLVIIDGRAYVEKFMDAYQSRDKFTLWGILQLLRLYPGKIPDLDLMFNCEDR
        N +  R   +  +TCP YFRWIHEDLRPW  TG+TR M+E  R  A FR+VI+DGR YV+K+  + Q+RD FTLWGI+QLLR YPG++PDL+LMF+ +DR
Subjt:  NEEGDRDRQNFDTTCPEYFRWIHEDLRPWAGTGITREMVESGRPKAGFRLVIIDGRAYVEKFMDAYQSRDKFTLWGILQLLRLYPGKIPDLDLMFNCEDR

Query:  PNIFIGDYSGPGPNSTAPPPLFRYCGDDDTLDIVFPDWSFWGWPEINIKPWVPLMEDLKQGNKRTKWSNREAYAYWKGNIKVSMVRYKLLECNLSREHDW
        P +   D+   G    APPPLFRYC DD +LDIVFPDWSFWGW E+NIKPW   +  +++GNK T+W +R AYAYW+GN  V+  R  LL CN+S + DW
Subjt:  PNIFIGDYSGPGPNSTAPPPLFRYCGDDDTLDIVFPDWSFWGWPEINIKPWVPLMEDLKQGNKRTKWSNREAYAYWKGNIKVSMVRYKLLECNLSREHDW

Query:  KARVFMQDWDKEQKERFKNSNLADQCVHRYKIYVEGVGWSVSLKYILACDSVTLMVNPYYYDFFTRSLVPMHHYWPIKDDDDMCNSIKFAVYWGNTHQQK
          R+++QDWD+E +E FKNSNL +QC HRYKIY+EG  WSVS KYI+ACDS+TL V P +YDF+ R ++P+ HYWPI+ D   C S+KFAV+WGNTH  +
Subjt:  KARVFMQDWDKEQKERFKNSNLADQCVHRYKIYVEGVGWSVSLKYILACDSVTLMVNPYYYDFFTRSLVPMHHYWPIKDDDDMCNSIKFAVYWGNTHQQK

Query:  VRAIGKAASKFTEEELRMEKVYDYMFHSLNEYSKLLTFKPTIPPNATELCLEELACPAQDLATKFMIDTLV
           IG+  S+F  EE++ME VYDYMFH +NEY+KLL FKP IP  ATE+  + + C A      FM +++V
Subjt:  VRAIGKAASKFTEEELRMEKVYDYMFHSLNEYSKLLTFKPTIPPNATELCLEELACPAQDLATKFMIDTLV

AT3G48980.1 Arabidopsis thaliana protein of unknown function (DUF821)4.0e-14554.84Show/hide
Query:  FVIFSVSVSLSIAACLIISSRLL---------RHVPTTNGRDAELHIYPHRSEV--QLP---SVNCTAFSW--SGKC-------RTRSSVIVNEEGDRDR
        +  FS+ + L + A   +S+RLL           V  T     +   YP  +++  + P   ++NC AFS   +G C         RSS     EG+ DR
Subjt:  FVIFSVSVSLSIAACLIISSRLL---------RHVPTTNGRDAELHIYPHRSEV--QLP---SVNCTAFSW--SGKC-------RTRSSVIVNEEGDRDR

Query:  QNFDTTCPEYFRWIHEDLRPWAGTGITREMVESGRPKAGFRLVIIDGRAYVEKFMDAYQSRDKFTLWGILQLLRLYPGKIPDLDLMFNCEDRPNIFIGDY
             TCP+YFRWIHEDLRPW  TGITRE +E     A FRL II+GR YVEKF +A+Q+RD FT+WG +QLLR YPGKIPDL+LMF+C D P +   ++
Subjt:  QNFDTTCPEYFRWIHEDLRPWAGTGITREMVESGRPKAGFRLVIIDGRAYVEKFMDAYQSRDKFTLWGILQLLRLYPGKIPDLDLMFNCEDRPNIFIGDY

Query:  SGPGPNSTAPPPLFRYCGDDDTLDIVFPDWSFWGWPEINIKPWVPLMEDLKQGNKRTKWSNREAYAYWKGNIKVSMVRYKLLECNLSREHDWKARVFMQD
        +  G +   PPPLFRYC +D+TLDIVFPDWS+WGW E+NIKPW  L+++L++GN+RTKW +RE YAYWKGN  V+  R  L++CNLS  +DWKAR++ QD
Subjt:  SGPGPNSTAPPPLFRYCGDDDTLDIVFPDWSFWGWPEINIKPWVPLMEDLKQGNKRTKWSNREAYAYWKGNIKVSMVRYKLLECNLSREHDWKARVFMQD

Query:  WDKEQKERFKNSNLADQCVHRYKIYVEGVGWSVSLKYILACDSVTLMVNPYYYDFFTRSLVPMHHYWPIKDDDDMCNSIKFAVYWGNTHQQKVRAIGKAA
        W KE KE +K S+LA QC HRYKIY+EG  WSVS KYILACDSVTLMV P+YYDFFTR + P HHYWP+K +DD C SIKFAV WGN H +K + IGK A
Subjt:  WDKEQKERFKNSNLADQCVHRYKIYVEGVGWSVSLKYILACDSVTLMVNPYYYDFFTRSLVPMHHYWPIKDDDDMCNSIKFAVYWGNTHQQKVRAIGKAA

Query:  SKFTEEELRMEKVYDYMFHSLNEYSKLLTFKPTIPPNATELCLEELACPAQDLATKFMIDTLVKQ
        S+F ++EL+M+ VYDYMFH L +YSKLL FKP IP N+TELC E +ACP      KFM+++LVK+
Subjt:  SKFTEEELRMEKVYDYMFHSLNEYSKLLTFKPTIPPNATELCLEELACPAQDLATKFMIDTLVKQ

AT4G25540.1 homolog of DNA mismatch repair protein MSH30.0e+0057.17Show/hide
Query:  MGKQKQQVISRFFAPKPKSPSVSSSSSSSSSALAPPTQPFSPPKISATVTFSPSKR-LISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLE
        MGKQKQQ ISRFFAPKPKSP+   +  + SS          PPKISATV+FSPSKR L+S  LA+      S K+PKLSPHT NP+P   +P+LH+RFL+
Subjt:  MGKQKQQVISRFFAPKPKSPSVSSSSSSSSSALAPPTQPFSPPKISATVTFSPSKR-LISSALASQLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLE

Query:  KFLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGY
        +FLEP       S +     T    KYTPLE QVV+LK +YPDV+LMVEVGYRYRFFG+DAEIAARVLGIYAH+D NFMTAS+PTFRLN HVRRLV+AGY
Subjt:  KFLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLDRNFMTASIPTFRLNVHVRRLVSAGY

Query:  KVGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLG---GAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYR
        K+GVVKQTETAAIK+HG+N+ GPF RGLSALYTKATLEA +++    G EEG G +SN+L CVV++ +  + L   IE   DV++G+V +EISTG+V+Y 
Subjt:  KVGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLG---GAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMVAMEISTGDVIYR

Query:  EYDDNFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSL
        E++DNFMRSGLEA++LSLSPAELLLG P+S+ TEK                                L+ +AGP SNVRVE  S DCF +G+A+ EV+SL
Subjt:  EYDDNFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVMSL

Query:  YENIDNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSL
         E I   NL  D  +     A +    + +  IMNMP+L +QALALT  +LKQFG ERI+  G+SFR  S   EMTLS NTL QLEV+KNN DGSE+GSL
Subjt:  YENIDNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSL

Query:  LQCMNHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAA
           MNHTLT++GSRLLR W+THPLC+R++I AR +AVSEI+A M S    Q   E  EE S+  ++ PE   +LSSVLT +  + DIQRGITRIFHRTA 
Subjt:  LQCMNHTLTIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAA

Query:  PSEDDGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSES-MIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINND
         +E                FIAV++AIL AGKQ+Q+  I ++ +  S +S  + S LLRKLI   SS  ++  A KLLS ++KEAA +GDL +++I ++D
Subjt:  PSEDDGSDISVSGLSFRFAFIAVIQAILYAGKQLQQFHIDEEDDNYSSES-MIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINND

Query:  QFPKVARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDG
        QFP++A AR+     REKLDS I  +RK+L +R LEF+ VSG THLIEL +D KVP NWVK+NSTKKTIRYHPPE++A LDEL+LA E L + +R +WD 
Subjt:  QFPKVARARKEAQSAREKLDSLITVYRKQLGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDG

Query:  FLSGFSRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQ
        FL  FSRYY +F++AVQALA++DCL+SL+ LSR+KNYVRPEFV D EP +I I SGRHPVLE+ LQ NFVPNDT L   GE+CQI+TGPNMGGKSCYIRQ
Subjt:  FLSGFSRYYAEFQSAVQALASIDCLYSLAILSRHKNYVRPEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQ

Query:  VALIALMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFV
        VALI++MAQVGS+VPA   KLHVLDG++TRMGASDSIQ GRSTFLEE++E SHI+   +S SLVI+DELGRGTSTHDGVAIAYA L +LL  K+CLVLFV
Subjt:  VALIALMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMTETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFV

Query:  THYPKVADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQ
        THYP++A+I+  FP S G YHVSYLT  +     G     DV YLYKLV G+   SFGFKVAQLAQIP SCI RA  M   LE      A+ + RE + +
Subjt:  THYPKVADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGFKVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQ

Query:  EASGKGLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRF
            +G E        E+ I A  + F  LK  + S +D  K F
Subjt:  EASGKGLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRF

AT5G23850.1 Arabidopsis thaliana protein of unknown function (DUF821)8.1e-14659.24Show/hide
Query:  SVNCTAFSWSGKCRTRSSVIVNEEGDRDRQNFDT-TCPEYFRWIHEDLRPWAGTGITREMVESGRPKAGFRLVIIDGRAYVEKFMDAYQSRDKFTLWGIL
        +++C+A   +  C +          D D  +  T TCP+YFRWIHEDLRPW+ TGITRE +E  +  A FRL I+ G+ YVEKF DA+Q+RD FT+WG L
Subjt:  SVNCTAFSWSGKCRTRSSVIVNEEGDRDRQNFDT-TCPEYFRWIHEDLRPWAGTGITREMVESGRPKAGFRLVIIDGRAYVEKFMDAYQSRDKFTLWGIL

Query:  QLLRLYPGKIPDLDLMFNCEDRPNIFIGDYSGPGPNSTAPPPLFRYCGDDDTLDIVFPDWSFWGWPEINIKPWVPLMEDLKQGNKRTKWSNREAYAYWKG
        QLLR YPGKIPDL+LMF+C D P +   +++  G N+ +PPPLFRYCG+++TLDIVFPDWSFWGW E+NIKPW  L+++L++GN+RTKW NRE YAYWKG
Subjt:  QLLRLYPGKIPDLDLMFNCEDRPNIFIGDYSGPGPNSTAPPPLFRYCGDDDTLDIVFPDWSFWGWPEINIKPWVPLMEDLKQGNKRTKWSNREAYAYWKG

Query:  NIKVSMVRYKLLECNLSREHDWKARVFMQDWDKEQKERFKNSNLADQCVHRYKIYVEGVGWSVSLKYILACDSVTLMVNPYYYDFFTRSLVPMHHYWPIK
        N  V+  R  L++CN+S EH+W AR++ QDW KE KE +K S+LA QC HRYKIY+EG  WSVS KYILACDSVTL+V P+YYDFFTR L+P HHYWP++
Subjt:  NIKVSMVRYKLLECNLSREHDWKARVFMQDWDKEQKERFKNSNLADQCVHRYKIYVEGVGWSVSLKYILACDSVTLMVNPYYYDFFTRSLVPMHHYWPIK

Query:  DDDDMCNSIKFAVYWGNTHQQKVRAIGKAASKFTEEELRMEKVYDYMFHSLNEYSKLLTFKPTIPPNATELCLEELACPAQDLATKFMIDTLVKQ
         + D C SIKFAV WGN+H QK + IGKAAS F +++L+M+ VYDYM+H L EYSKLL FKP IP NA E+C E +AC       KFM ++LVKQ
Subjt:  DDDDMCNSIKFAVYWGNTHQQKVRAIGKAASKFTEEELRMEKVYDYMFHSLNEYSKLLTFKPTIPPNATELCLEELACPAQDLATKFMIDTLVKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGGCACCGTTAGAAAGCCTGTAGCTCAACTCCGTTTCGTCATTTTTTCCGTTTCCGTTTCGCTCTCTATCGCTGCTTGTCTCATAATCTCTTCACGTCTTCTTCG
TCATGTCCCGACGACCAATGGGAGGGATGCGGAATTACATATTTACCCTCACCGGAGTGAAGTTCAATTGCCATCTGTAAATTGTACGGCATTTTCATGGAGCGGGAAGT
GCCGTACGAGGAGTTCTGTAATTGTGAATGAAGAAGGAGATCGAGACCGTCAAAACTTCGACACGACGTGTCCGGAATACTTCCGTTGGATTCACGAGGATCTAAGGCCG
TGGGCGGGAACGGGGATCACGAGGGAGATGGTGGAGAGCGGGCGGCCAAAGGCGGGTTTCCGGCTGGTGATAATCGACGGTAGGGCTTACGTGGAGAAATTTATGGATGC
GTATCAAAGTAGGGATAAGTTTACGCTGTGGGGGATCCTACAGTTGTTACGGTTGTACCCTGGTAAAATTCCCGATTTAGACCTCATGTTCAATTGCGAGGACCGGCCCA
ACATTTTCATTGGTGACTATAGTGGGCCCGGGCCCAATTCGACGGCCCCACCTCCTTTGTTCCGATACTGTGGAGATGATGACACGTTGGACATTGTCTTTCCTGATTGG
TCCTTCTGGGGATGGCCTGAGATAAATATAAAGCCATGGGTACCCCTGATGGAAGATCTAAAACAAGGCAACAAAAGGACAAAATGGTCAAATAGAGAAGCTTATGCTTA
TTGGAAGGGGAATATTAAAGTTTCTATGGTTCGATATAAACTTCTAGAATGCAATCTCTCTCGTGAGCACGATTGGAAGGCTCGTGTGTTCATGCAGGATTGGGATAAAG
AACAGAAAGAACGATTCAAAAACTCCAATCTAGCTGATCAATGTGTTCATAGGTATAAAATATACGTGGAGGGGGTTGGTTGGTCGGTGAGTCTTAAATACATTCTTGCT
TGTGATTCGGTGACATTGATGGTAAATCCCTATTACTATGATTTCTTCACTAGAAGTTTAGTGCCAATGCATCACTATTGGCCAATCAAAGATGACGACGACATGTGCAA
CTCTATCAAGTTTGCCGTCTACTGGGGCAACACCCATCAACAAAAGGTACGGGCAATTGGGAAGGCAGCGAGCAAGTTTACCGAGGAGGAGCTAAGGATGGAGAAGGTGT
ATGACTACATGTTTCACAGTCTAAATGAATACTCCAAACTCTTAACCTTCAAGCCAACCATCCCACCAAATGCTACTGAACTCTGCTTGGAGGAATTGGCTTGCCCTGCC
CAAGACTTAGCCACCAAGTTTATGATCGATACTCTCGTAAAACAAAACGAAAAAGCGCGCCAAAGCGGTGGCCACCGTGTCTGTATAAATGCGTCTGTGTGCGTGAGAAC
TGAGACGAAATTTTGGAGTGCAAATTTCATGGGGAAACAGAAGCAACAAGTGATTTCTCGATTCTTTGCTCCGAAACCCAAGTCCCCATCTGTTTCTTCTTCTTCCTCCT
CCTCCTCCTCCGCCTTAGCTCCGCCGACCCAACCGTTTTCACCGCCTAAGATATCAGCAACAGTCACTTTCTCCCCTTCCAAGCGTCTCATTTCTTCTGCACTCGCTTCT
CAGTTAACGCCACCCAAATCCTCTAAACGCCCGAAACTCTCTCCTCATACCCATAACCCTCTGCCTTCCGTCTCCAATCCCTCGCTTCACAAAAGATTCCTTGAGAAGTT
CCTTGAACCCATTGAAAACTCCTTTGAATCTTCTAATCAAAACCCCAAAGCCCCAACGGGCGGTGATGCCAAATACACCCCTTTGGAGCTACAAGTTGTGGACCTTAAGA
AGAGGTACCCTGATGTTCTTCTCATGGTGGAGGTTGGTTACAGGTACAGATTTTTTGGCAAAGATGCTGAAATAGCAGCTCGTGTTTTGGGTATTTATGCCCATTTGGAT
CGTAATTTTATGACTGCGAGTATACCCACGTTTAGATTGAATGTGCATGTGAGGAGGCTAGTTAGTGCAGGATATAAGGTGGGTGTTGTCAAGCAAACTGAAACCGCTGC
GATTAAGGCTCATGGGTCGAATAAGTTAGGCCCCTTTTGCAGGGGACTATCGGCTTTGTATACTAAAGCAACATTGGAGGCAGTCCAGAATTTAGGGGGAGCTGAGGAGG
GATGTGGGGGAGAGAGTAATTACTTGTTCTGTGTAGTTGAAAAGAGCATGTTGGTGGATAATTTGGACCGTAGAATCGAATATGGGGTTGATGTGAAAATTGGGATGGTT
GCCATGGAGATATCAACTGGGGATGTTATTTATAGGGAATATGATGATAACTTCATGAGAAGTGGGCTTGAGGCAATGCTCTTGAGCTTGTCTCCTGCAGAGCTTCTTCT
TGGGGATCCCATATCAAAGGCAACAGAGAAGGTGTTCGATCGAGTAGGAAGCCTTTCGTTTACTGTAGCATTGAATGCTGTTGCTTGCTGTACCGATTGTGTAGTTTCGA
TTGTTTGTGACCAGTTATTACTAGGTTATGCTGGACCTGCTTCAAATGTCCGTGTGGAGCATGTTTCACGAGATTGCTTTAAAGATGGCAGTGCACTTGCTGAAGTGATG
TCTTTGTATGAAAACATTGACAACGACAACTTAGCAGCTGATCATCACGACCCTGATACAGTGTTGGCTGGACGAAAAAGTGATCGCATAGCAATTAAGGAAATAATGAA
CATGCCAAATTTAGCTCTTCAAGCATTGGCCTTGACCATTCGTTACTTAAAGCAATTTGGTTTAGAAAGAATTGTGTCCTTGGGATCTTCTTTTAGGCCATTCTCATGCA
AAATGGAGATGACCCTCTCAGGCAATACACTCACACAACTTGAGGTTTTGAAGAATAATGATGACGGTTCTGAAACTGGCTCTCTATTACAGTGCATGAATCACACTCTT
ACAATATTCGGTTCGAGGCTTCTTCGCCAATGGATAACACATCCTTTATGTGAAAGAGACATGATAATTGCTCGTCAAGAGGCCGTTTCCGAGATTGCTGCAGCTATGGT
ATCTTCAAAAGTATATCAAAATATCAGAGAGTCAGATGAAGAAGATTCTGATGTAATGGTCATTGAACCAGAATTGAATTATATACTTTCTTCAGTGTTAACAACCTTGG
GAGGGGCACCAGATATTCAGCGTGGAATAACACGAATCTTTCACCGAACTGCAGCCCCGTCAGAGGATGATGGTTCCGATATTTCGGTCTCTGGCCTGAGTTTTCGATTT
GCCTTCATTGCAGTTATTCAAGCTATTTTATATGCGGGAAAACAGCTTCAGCAGTTTCACATTGATGAAGAGGATGACAATTATTCCAGTGAAAGTATGATTGGCTCCAA
GCTCCTAAGAAAGCTGATTTTATCTGCTTCCTCCTCTGGTTTGATTAGCATTGCTGCAAAACTTTTGTCAACGATCAGCAAAGAAGCTGCAGATCAAGGGGACCTTCCTA
ACCTGATGATCATCAATAATGACCAATTTCCGAAGGTTGCTAGAGCTCGGAAGGAGGCTCAATCTGCAAGAGAGAAATTGGATTCTCTGATCACCGTATACCGTAAGCAG
CTTGGAATGCGGAAGTTGGAGTTCATTAGTGTGTCTGGAACTACGCATTTGATTGAGTTGGCCTTAGATGTAAAGGTACCCTCTAATTGGGTGAAGATCAATAGTACAAA
GAAAACTATAAGATATCACCCACCGGAAGTACTGGCTGCCTTAGACGAGCTATCATTGGCAAATGAGGAGCTCATGGTAGCATCTCGGGATGCTTGGGATGGCTTTCTAA
GTGGATTCAGCAGATATTATGCAGAGTTTCAATCTGCTGTTCAAGCACTGGCTTCCATTGACTGTCTGTATTCGTTAGCTATTCTCTCAAGACATAAGAACTATGTTCGT
CCAGAGTTTGTACATGATGATGAACCTGCTCAGATAGTTATATGTTCTGGACGCCATCCGGTTTTGGAGAGTACATTACAAGGCAATTTCGTCCCAAATGACACGAATTT
GGACGTGAATGGAGAGCATTGTCAAATTGTAACAGGACCAAACATGGGAGGGAAAAGTTGCTACATTCGACAAGTTGCTCTCATTGCTCTAATGGCTCAGGTCGGTTCCT
ATGTACCAGCATTCTCCGTGAAGCTTCACGTCCTGGACGGTATATACACTCGAATGGGTGCTTCTGACAGTATCCAACAAGGTAGAAGCACCTTCTTAGAAGAAATGACT
GAGACTTCACATATACTCCATCATTCCACATCACATTCCTTGGTCATAATTGACGAGCTTGGGAGAGGAACAAGCACCCACGATGGCGTGGCCATTGCATATGCAGCTTT
GCATAATCTTCTCCACCACAAGAAATGTTTGGTCCTCTTCGTTACCCATTATCCTAAAGTTGCTGATATTACAAAGGAATTCCCAGCATCTGCAGGGGTATACCATGTCT
CATATCTGACTTCACACGAAACGCCAAGCTTCTCAGGCCCAAAATCCGCTCAGGATGTGATTTACCTATATAAGCTCGTTCCTGGAATTGCGGAGAGCAGTTTTGGTTTC
AAGGTTGCCCAACTTGCACAGATACCTTTATCATGTATTGCACGAGCCACAGAAATGGGGGTATGGTTAGAAGAAATAGTAACCAGAAGAGCACAACGCAAAACCAGAGA
AATACATTTACAAGAAGCATCAGGCAAGGGATTGGAATTGGAAAGCTCCGAGTGCTTCCTTGAAGATAGAATTGATGCTTACGAGGAATTCTTTCTGTTCTTGAAAGCTA
CAATATGTTCTCCTGATGATATGGGGAAACGTTTCCTTCAATTTTCCCAAGCTAGAAGCATGGCCATGGACTTGTTAGGAAGTAGACGTTCCCGGAACCTCTTGCGGTTT
CAATCCTCTGCTTCACTCGCAAAAAGTTATCGTAGTCGTCCACTTTGCACGGTGGAACTTGGCGAGCCGGAAGAGAGAAAAGAGAAAGGGTTATTGAGATTGGCAACTGG
GTTTCACCGGAAAGTCCGAAATTCAGAAAACTGGCTACCACCCGGCTGGACTGTGAAGGTCAAAGTGAGGAAGAGTGGCAAGAAGGATAAGTATTATTTTGAACCCTCAA
GTCAAATGAGATTTAATTCTAGGGCAGAGGTGTTTAGATACCTCAAAACTGCTGCAATCAATCATCCTGAATCTGAAGGGAGCAGAACCGTGGAGCAACCAGAGAACAAT
GTGGAAGTTAAGAAGACTATAGCAAAAGGGTTACCTCCTGGATGGATCAGAGAAATCAGAGTAACGAGGACTGCTAACAGAATAAGAAGAGATTCATTTTACATTAATCC
TGTAAATGGAAATGCGCTTCGCTCGTTGAGGGATGTACATCGGTATCTAACAAGTGGAAAGGTGAGCCGATTAGCTTATAAATCAAGGAGCCAGAGCAACAGCAACAGCA
ACATTGAGTTTCAGCACGATGATATCTCTTCACCTTCTGTCTCCGAGAAAGAGATGCTAGCGATTGGCAAAGCAAAGAGACAGATTATATGGAGCGACAACTCACCGGAA
CCGAGTGAAATAGCAGATGATGAAGCTATTTTTCCAAAGGCTGCAAGTAACACATGTACAGTGCTTAGAAGGGTTTTCATTTTTGAAGATTCTGTATCTGATGTGAATTT
AGAAACTACTGCAGGGAGGATAGGCGCTAAGTTGCACTGCTCAACTCCCGAAGAGGCAAAAGAACGTGAGCAATCAGAAGGAAAGAATAATTTTTCCGACTTTGTGCCAA
CTCCTGATAAATCCTTCCAACACAAGTGCACTAGTGAGAATGGGGCAACAAAGATTGAGAGTAGAAAGAGACAAAGGAAAAGTAAAGACATAAACTTGCGATGCCGTGCT
TCAAAGCGACTTGCAGGGCTCCAAGCTGAGCCAGTGCTTGAAGCTAAAACTGGTCACCAAGCGTGTTCGATTGCATGTGAAGAGTCTGATAAGCAAGTAGCCAGTACAAC
TAAGTGGCCATCTCAAAGTCCTGAAAATCCTGATGTCAAGCATGAAACAAAGAGCATAATTGACCCTCCTAAGATTGACACAAATCTAGATTCAAGTAGAAAAGAACATT
CTGCTGTTGACTTGTCCATTCTGCCTGAGAAGGTTAAGAAGATGAAATCTGAAGATGCATATGAGCTGCAGCAGCCGAAATGCAATTCCTTTCTGCCTTCAGAGGACGTT
CTGGAAAAGCATGTAGGGCAGGTGGAAAGTGATGATAAGGCTGATGTGAAAAAACAAGGCACTCTTCTTAAACTGCCTATGGAGGACTTACTGACTGATCCTTGCATTGC
ATTTGCAGTTAAAACGCTTACTGGAGACGTTTTTGACGCATCCATAAGCTCAGAAGCATCACTGATGCCAAACAATATCGATCGGCCTTTCGTGGCCTTTACTCACTTGG
AATCAGAAAACAAGCTTAATGAGAAGTTGCCATCTCCAGAACTTCCTGTTACTCGAGTTGGGAAAGTCGAAAATAACAACATTGTTGAAAAGCTAGACTCGACTCTGAAG
CTGCCAGTTGGGGAAATATGGGCGGACCCTTGTATAGAATTTGCTATTAAAACTCTTACTGATGAGATCCCACTTGTTAACAATCCAGATATTGAGGATTACTTTCACCA
ATTTGCCCCCTCAAAAACCGAAGGAACTAACGCAAATGCCTCGAATCATGTTGTAAAGCTAGATAACAGGAAGCCAGATTGTGCTGACGCTCTCAGACAGGAGGATGTAA
AACGAATTAGGGGCCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGTGGCACCGTTAGAAAGCCTGTAGCTCAACTCCGTTTCGTCATTTTTTCCGTTTCCGTTTCGCTCTCTATCGCTGCTTGTCTCATAATCTCTTCACGTCTTCTTCG
TCATGTCCCGACGACCAATGGGAGGGATGCGGAATTACATATTTACCCTCACCGGAGTGAAGTTCAATTGCCATCTGTAAATTGTACGGCATTTTCATGGAGCGGGAAGT
GCCGTACGAGGAGTTCTGTAATTGTGAATGAAGAAGGAGATCGAGACCGTCAAAACTTCGACACGACGTGTCCGGAATACTTCCGTTGGATTCACGAGGATCTAAGGCCG
TGGGCGGGAACGGGGATCACGAGGGAGATGGTGGAGAGCGGGCGGCCAAAGGCGGGTTTCCGGCTGGTGATAATCGACGGTAGGGCTTACGTGGAGAAATTTATGGATGC
GTATCAAAGTAGGGATAAGTTTACGCTGTGGGGGATCCTACAGTTGTTACGGTTGTACCCTGGTAAAATTCCCGATTTAGACCTCATGTTCAATTGCGAGGACCGGCCCA
ACATTTTCATTGGTGACTATAGTGGGCCCGGGCCCAATTCGACGGCCCCACCTCCTTTGTTCCGATACTGTGGAGATGATGACACGTTGGACATTGTCTTTCCTGATTGG
TCCTTCTGGGGATGGCCTGAGATAAATATAAAGCCATGGGTACCCCTGATGGAAGATCTAAAACAAGGCAACAAAAGGACAAAATGGTCAAATAGAGAAGCTTATGCTTA
TTGGAAGGGGAATATTAAAGTTTCTATGGTTCGATATAAACTTCTAGAATGCAATCTCTCTCGTGAGCACGATTGGAAGGCTCGTGTGTTCATGCAGGATTGGGATAAAG
AACAGAAAGAACGATTCAAAAACTCCAATCTAGCTGATCAATGTGTTCATAGGTATAAAATATACGTGGAGGGGGTTGGTTGGTCGGTGAGTCTTAAATACATTCTTGCT
TGTGATTCGGTGACATTGATGGTAAATCCCTATTACTATGATTTCTTCACTAGAAGTTTAGTGCCAATGCATCACTATTGGCCAATCAAAGATGACGACGACATGTGCAA
CTCTATCAAGTTTGCCGTCTACTGGGGCAACACCCATCAACAAAAGGTACGGGCAATTGGGAAGGCAGCGAGCAAGTTTACCGAGGAGGAGCTAAGGATGGAGAAGGTGT
ATGACTACATGTTTCACAGTCTAAATGAATACTCCAAACTCTTAACCTTCAAGCCAACCATCCCACCAAATGCTACTGAACTCTGCTTGGAGGAATTGGCTTGCCCTGCC
CAAGACTTAGCCACCAAGTTTATGATCGATACTCTCGTAAAACAAAACGAAAAAGCGCGCCAAAGCGGTGGCCACCGTGTCTGTATAAATGCGTCTGTGTGCGTGAGAAC
TGAGACGAAATTTTGGAGTGCAAATTTCATGGGGAAACAGAAGCAACAAGTGATTTCTCGATTCTTTGCTCCGAAACCCAAGTCCCCATCTGTTTCTTCTTCTTCCTCCT
CCTCCTCCTCCGCCTTAGCTCCGCCGACCCAACCGTTTTCACCGCCTAAGATATCAGCAACAGTCACTTTCTCCCCTTCCAAGCGTCTCATTTCTTCTGCACTCGCTTCT
CAGTTAACGCCACCCAAATCCTCTAAACGCCCGAAACTCTCTCCTCATACCCATAACCCTCTGCCTTCCGTCTCCAATCCCTCGCTTCACAAAAGATTCCTTGAGAAGTT
CCTTGAACCCATTGAAAACTCCTTTGAATCTTCTAATCAAAACCCCAAAGCCCCAACGGGCGGTGATGCCAAATACACCCCTTTGGAGCTACAAGTTGTGGACCTTAAGA
AGAGGTACCCTGATGTTCTTCTCATGGTGGAGGTTGGTTACAGGTACAGATTTTTTGGCAAAGATGCTGAAATAGCAGCTCGTGTTTTGGGTATTTATGCCCATTTGGAT
CGTAATTTTATGACTGCGAGTATACCCACGTTTAGATTGAATGTGCATGTGAGGAGGCTAGTTAGTGCAGGATATAAGGTGGGTGTTGTCAAGCAAACTGAAACCGCTGC
GATTAAGGCTCATGGGTCGAATAAGTTAGGCCCCTTTTGCAGGGGACTATCGGCTTTGTATACTAAAGCAACATTGGAGGCAGTCCAGAATTTAGGGGGAGCTGAGGAGG
GATGTGGGGGAGAGAGTAATTACTTGTTCTGTGTAGTTGAAAAGAGCATGTTGGTGGATAATTTGGACCGTAGAATCGAATATGGGGTTGATGTGAAAATTGGGATGGTT
GCCATGGAGATATCAACTGGGGATGTTATTTATAGGGAATATGATGATAACTTCATGAGAAGTGGGCTTGAGGCAATGCTCTTGAGCTTGTCTCCTGCAGAGCTTCTTCT
TGGGGATCCCATATCAAAGGCAACAGAGAAGGTGTTCGATCGAGTAGGAAGCCTTTCGTTTACTGTAGCATTGAATGCTGTTGCTTGCTGTACCGATTGTGTAGTTTCGA
TTGTTTGTGACCAGTTATTACTAGGTTATGCTGGACCTGCTTCAAATGTCCGTGTGGAGCATGTTTCACGAGATTGCTTTAAAGATGGCAGTGCACTTGCTGAAGTGATG
TCTTTGTATGAAAACATTGACAACGACAACTTAGCAGCTGATCATCACGACCCTGATACAGTGTTGGCTGGACGAAAAAGTGATCGCATAGCAATTAAGGAAATAATGAA
CATGCCAAATTTAGCTCTTCAAGCATTGGCCTTGACCATTCGTTACTTAAAGCAATTTGGTTTAGAAAGAATTGTGTCCTTGGGATCTTCTTTTAGGCCATTCTCATGCA
AAATGGAGATGACCCTCTCAGGCAATACACTCACACAACTTGAGGTTTTGAAGAATAATGATGACGGTTCTGAAACTGGCTCTCTATTACAGTGCATGAATCACACTCTT
ACAATATTCGGTTCGAGGCTTCTTCGCCAATGGATAACACATCCTTTATGTGAAAGAGACATGATAATTGCTCGTCAAGAGGCCGTTTCCGAGATTGCTGCAGCTATGGT
ATCTTCAAAAGTATATCAAAATATCAGAGAGTCAGATGAAGAAGATTCTGATGTAATGGTCATTGAACCAGAATTGAATTATATACTTTCTTCAGTGTTAACAACCTTGG
GAGGGGCACCAGATATTCAGCGTGGAATAACACGAATCTTTCACCGAACTGCAGCCCCGTCAGAGGATGATGGTTCCGATATTTCGGTCTCTGGCCTGAGTTTTCGATTT
GCCTTCATTGCAGTTATTCAAGCTATTTTATATGCGGGAAAACAGCTTCAGCAGTTTCACATTGATGAAGAGGATGACAATTATTCCAGTGAAAGTATGATTGGCTCCAA
GCTCCTAAGAAAGCTGATTTTATCTGCTTCCTCCTCTGGTTTGATTAGCATTGCTGCAAAACTTTTGTCAACGATCAGCAAAGAAGCTGCAGATCAAGGGGACCTTCCTA
ACCTGATGATCATCAATAATGACCAATTTCCGAAGGTTGCTAGAGCTCGGAAGGAGGCTCAATCTGCAAGAGAGAAATTGGATTCTCTGATCACCGTATACCGTAAGCAG
CTTGGAATGCGGAAGTTGGAGTTCATTAGTGTGTCTGGAACTACGCATTTGATTGAGTTGGCCTTAGATGTAAAGGTACCCTCTAATTGGGTGAAGATCAATAGTACAAA
GAAAACTATAAGATATCACCCACCGGAAGTACTGGCTGCCTTAGACGAGCTATCATTGGCAAATGAGGAGCTCATGGTAGCATCTCGGGATGCTTGGGATGGCTTTCTAA
GTGGATTCAGCAGATATTATGCAGAGTTTCAATCTGCTGTTCAAGCACTGGCTTCCATTGACTGTCTGTATTCGTTAGCTATTCTCTCAAGACATAAGAACTATGTTCGT
CCAGAGTTTGTACATGATGATGAACCTGCTCAGATAGTTATATGTTCTGGACGCCATCCGGTTTTGGAGAGTACATTACAAGGCAATTTCGTCCCAAATGACACGAATTT
GGACGTGAATGGAGAGCATTGTCAAATTGTAACAGGACCAAACATGGGAGGGAAAAGTTGCTACATTCGACAAGTTGCTCTCATTGCTCTAATGGCTCAGGTCGGTTCCT
ATGTACCAGCATTCTCCGTGAAGCTTCACGTCCTGGACGGTATATACACTCGAATGGGTGCTTCTGACAGTATCCAACAAGGTAGAAGCACCTTCTTAGAAGAAATGACT
GAGACTTCACATATACTCCATCATTCCACATCACATTCCTTGGTCATAATTGACGAGCTTGGGAGAGGAACAAGCACCCACGATGGCGTGGCCATTGCATATGCAGCTTT
GCATAATCTTCTCCACCACAAGAAATGTTTGGTCCTCTTCGTTACCCATTATCCTAAAGTTGCTGATATTACAAAGGAATTCCCAGCATCTGCAGGGGTATACCATGTCT
CATATCTGACTTCACACGAAACGCCAAGCTTCTCAGGCCCAAAATCCGCTCAGGATGTGATTTACCTATATAAGCTCGTTCCTGGAATTGCGGAGAGCAGTTTTGGTTTC
AAGGTTGCCCAACTTGCACAGATACCTTTATCATGTATTGCACGAGCCACAGAAATGGGGGTATGGTTAGAAGAAATAGTAACCAGAAGAGCACAACGCAAAACCAGAGA
AATACATTTACAAGAAGCATCAGGCAAGGGATTGGAATTGGAAAGCTCCGAGTGCTTCCTTGAAGATAGAATTGATGCTTACGAGGAATTCTTTCTGTTCTTGAAAGCTA
CAATATGTTCTCCTGATGATATGGGGAAACGTTTCCTTCAATTTTCCCAAGCTAGAAGCATGGCCATGGACTTGTTAGGAAGTAGACGTTCCCGGAACCTCTTGCGGTTT
CAATCCTCTGCTTCACTCGCAAAAAGTTATCGTAGTCGTCCACTTTGCACGGTGGAACTTGGCGAGCCGGAAGAGAGAAAAGAGAAAGGGTTATTGAGATTGGCAACTGG
GTTTCACCGGAAAGTCCGAAATTCAGAAAACTGGCTACCACCCGGCTGGACTGTGAAGGTCAAAGTGAGGAAGAGTGGCAAGAAGGATAAGTATTATTTTGAACCCTCAA
GTCAAATGAGATTTAATTCTAGGGCAGAGGTGTTTAGATACCTCAAAACTGCTGCAATCAATCATCCTGAATCTGAAGGGAGCAGAACCGTGGAGCAACCAGAGAACAAT
GTGGAAGTTAAGAAGACTATAGCAAAAGGGTTACCTCCTGGATGGATCAGAGAAATCAGAGTAACGAGGACTGCTAACAGAATAAGAAGAGATTCATTTTACATTAATCC
TGTAAATGGAAATGCGCTTCGCTCGTTGAGGGATGTACATCGGTATCTAACAAGTGGAAAGGTGAGCCGATTAGCTTATAAATCAAGGAGCCAGAGCAACAGCAACAGCA
ACATTGAGTTTCAGCACGATGATATCTCTTCACCTTCTGTCTCCGAGAAAGAGATGCTAGCGATTGGCAAAGCAAAGAGACAGATTATATGGAGCGACAACTCACCGGAA
CCGAGTGAAATAGCAGATGATGAAGCTATTTTTCCAAAGGCTGCAAGTAACACATGTACAGTGCTTAGAAGGGTTTTCATTTTTGAAGATTCTGTATCTGATGTGAATTT
AGAAACTACTGCAGGGAGGATAGGCGCTAAGTTGCACTGCTCAACTCCCGAAGAGGCAAAAGAACGTGAGCAATCAGAAGGAAAGAATAATTTTTCCGACTTTGTGCCAA
CTCCTGATAAATCCTTCCAACACAAGTGCACTAGTGAGAATGGGGCAACAAAGATTGAGAGTAGAAAGAGACAAAGGAAAAGTAAAGACATAAACTTGCGATGCCGTGCT
TCAAAGCGACTTGCAGGGCTCCAAGCTGAGCCAGTGCTTGAAGCTAAAACTGGTCACCAAGCGTGTTCGATTGCATGTGAAGAGTCTGATAAGCAAGTAGCCAGTACAAC
TAAGTGGCCATCTCAAAGTCCTGAAAATCCTGATGTCAAGCATGAAACAAAGAGCATAATTGACCCTCCTAAGATTGACACAAATCTAGATTCAAGTAGAAAAGAACATT
CTGCTGTTGACTTGTCCATTCTGCCTGAGAAGGTTAAGAAGATGAAATCTGAAGATGCATATGAGCTGCAGCAGCCGAAATGCAATTCCTTTCTGCCTTCAGAGGACGTT
CTGGAAAAGCATGTAGGGCAGGTGGAAAGTGATGATAAGGCTGATGTGAAAAAACAAGGCACTCTTCTTAAACTGCCTATGGAGGACTTACTGACTGATCCTTGCATTGC
ATTTGCAGTTAAAACGCTTACTGGAGACGTTTTTGACGCATCCATAAGCTCAGAAGCATCACTGATGCCAAACAATATCGATCGGCCTTTCGTGGCCTTTACTCACTTGG
AATCAGAAAACAAGCTTAATGAGAAGTTGCCATCTCCAGAACTTCCTGTTACTCGAGTTGGGAAAGTCGAAAATAACAACATTGTTGAAAAGCTAGACTCGACTCTGAAG
CTGCCAGTTGGGGAAATATGGGCGGACCCTTGTATAGAATTTGCTATTAAAACTCTTACTGATGAGATCCCACTTGTTAACAATCCAGATATTGAGGATTACTTTCACCA
ATTTGCCCCCTCAAAAACCGAAGGAACTAACGCAAATGCCTCGAATCATGTTGTAAAGCTAGATAACAGGAAGCCAGATTGTGCTGACGCTCTCAGACAGGAGGATGTAA
AACGAATTAGGGGCCTCTAA
Protein sequenceShow/hide protein sequence
MCGTVRKPVAQLRFVIFSVSVSLSIAACLIISSRLLRHVPTTNGRDAELHIYPHRSEVQLPSVNCTAFSWSGKCRTRSSVIVNEEGDRDRQNFDTTCPEYFRWIHEDLRP
WAGTGITREMVESGRPKAGFRLVIIDGRAYVEKFMDAYQSRDKFTLWGILQLLRLYPGKIPDLDLMFNCEDRPNIFIGDYSGPGPNSTAPPPLFRYCGDDDTLDIVFPDW
SFWGWPEINIKPWVPLMEDLKQGNKRTKWSNREAYAYWKGNIKVSMVRYKLLECNLSREHDWKARVFMQDWDKEQKERFKNSNLADQCVHRYKIYVEGVGWSVSLKYILA
CDSVTLMVNPYYYDFFTRSLVPMHHYWPIKDDDDMCNSIKFAVYWGNTHQQKVRAIGKAASKFTEEELRMEKVYDYMFHSLNEYSKLLTFKPTIPPNATELCLEELACPA
QDLATKFMIDTLVKQNEKARQSGGHRVCINASVCVRTETKFWSANFMGKQKQQVISRFFAPKPKSPSVSSSSSSSSSALAPPTQPFSPPKISATVTFSPSKRLISSALAS
QLTPPKSSKRPKLSPHTHNPLPSVSNPSLHKRFLEKFLEPIENSFESSNQNPKAPTGGDAKYTPLELQVVDLKKRYPDVLLMVEVGYRYRFFGKDAEIAARVLGIYAHLD
RNFMTASIPTFRLNVHVRRLVSAGYKVGVVKQTETAAIKAHGSNKLGPFCRGLSALYTKATLEAVQNLGGAEEGCGGESNYLFCVVEKSMLVDNLDRRIEYGVDVKIGMV
AMEISTGDVIYREYDDNFMRSGLEAMLLSLSPAELLLGDPISKATEKVFDRVGSLSFTVALNAVACCTDCVVSIVCDQLLLGYAGPASNVRVEHVSRDCFKDGSALAEVM
SLYENIDNDNLAADHHDPDTVLAGRKSDRIAIKEIMNMPNLALQALALTIRYLKQFGLERIVSLGSSFRPFSCKMEMTLSGNTLTQLEVLKNNDDGSETGSLLQCMNHTL
TIFGSRLLRQWITHPLCERDMIIARQEAVSEIAAAMVSSKVYQNIRESDEEDSDVMVIEPELNYILSSVLTTLGGAPDIQRGITRIFHRTAAPSEDDGSDISVSGLSFRF
AFIAVIQAILYAGKQLQQFHIDEEDDNYSSESMIGSKLLRKLILSASSSGLISIAAKLLSTISKEAADQGDLPNLMIINNDQFPKVARARKEAQSAREKLDSLITVYRKQ
LGMRKLEFISVSGTTHLIELALDVKVPSNWVKINSTKKTIRYHPPEVLAALDELSLANEELMVASRDAWDGFLSGFSRYYAEFQSAVQALASIDCLYSLAILSRHKNYVR
PEFVHDDEPAQIVICSGRHPVLESTLQGNFVPNDTNLDVNGEHCQIVTGPNMGGKSCYIRQVALIALMAQVGSYVPAFSVKLHVLDGIYTRMGASDSIQQGRSTFLEEMT
ETSHILHHSTSHSLVIIDELGRGTSTHDGVAIAYAALHNLLHHKKCLVLFVTHYPKVADITKEFPASAGVYHVSYLTSHETPSFSGPKSAQDVIYLYKLVPGIAESSFGF
KVAQLAQIPLSCIARATEMGVWLEEIVTRRAQRKTREIHLQEASGKGLELESSECFLEDRIDAYEEFFLFLKATICSPDDMGKRFLQFSQARSMAMDLLGSRRSRNLLRF
QSSASLAKSYRSRPLCTVELGEPEERKEKGLLRLATGFHRKVRNSENWLPPGWTVKVKVRKSGKKDKYYFEPSSQMRFNSRAEVFRYLKTAAINHPESEGSRTVEQPENN
VEVKKTIAKGLPPGWIREIRVTRTANRIRRDSFYINPVNGNALRSLRDVHRYLTSGKVSRLAYKSRSQSNSNSNIEFQHDDISSPSVSEKEMLAIGKAKRQIIWSDNSPE
PSEIADDEAIFPKAASNTCTVLRRVFIFEDSVSDVNLETTAGRIGAKLHCSTPEEAKEREQSEGKNNFSDFVPTPDKSFQHKCTSENGATKIESRKRQRKSKDINLRCRA
SKRLAGLQAEPVLEAKTGHQACSIACEESDKQVASTTKWPSQSPENPDVKHETKSIIDPPKIDTNLDSSRKEHSAVDLSILPEKVKKMKSEDAYELQQPKCNSFLPSEDV
LEKHVGQVESDDKADVKKQGTLLKLPMEDLLTDPCIAFAVKTLTGDVFDASISSEASLMPNNIDRPFVAFTHLESENKLNEKLPSPELPVTRVGKVENNNIVEKLDSTLK
LPVGEIWADPCIEFAIKTLTDEIPLVNNPDIEDYFHQFAPSKTEGTNANASNHVVKLDNRKPDCADALRQEDVKRIRGL