; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10002870 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10002870
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDNA mismatch repair protein MLH3 isoform X2
Genome locationChr11:14899594..14923813
RNA-Seq ExpressionHG10002870
SyntenyHG10002870
Gene Ontology termsGO:0006298 - mismatch repair (biological process)
GO:0009734 - auxin-activated signaling pathway (biological process)
GO:0055085 - transmembrane transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0032300 - mismatch repair complex (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
InterPro domainsIPR013507 - DNA mismatch repair protein, S5 domain 2-like
IPR014721 - Ribosomal protein S5 domain 2-type fold, subgroup
IPR014762 - DNA mismatch repair, conserved site
IPR020568 - Ribosomal protein S5 domain 2-type fold
IPR028830 - DNA mismatch repair protein Mlh3
IPR036890 - Histidine kinase/HSP90-like ATPase superfamily
IPR038973 - DNA mismatch repair protein MutL/Mlh/Pms


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008448461.1 PREDICTED: DNA mismatch repair protein MLH3 isoform X2 [Cucumis melo]0.0e+0080.26Show/hide
Query:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL
        MGTIKPLPKSVR+SVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGI+RDGLVLLGERYVTSKFHDLID D KGGTFGFRGEAL
Subjt:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL

Query:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI
        ASISD+SLVEIITRACGRANGYRKVLKGCKCLYLGI DDMED GTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI
Subjt:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI

Query:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK----------------------------------TDHVFHIRKRSRSE
        LLCTDPSPSPLSLLRSGFGSEVSRSL ELKIG GDLKLSGYI SPFD+FSIK                                  TD VFH RKRSRSE
Subjt:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK----------------------------------TDHVFHIRKRSRSE

Query:  ANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLID
        ANPAYVLNLECP SFYDLTFESSKT VQFKDWTPILTFIEEAIQQFWKEKYNCGKS+VH  PIVGD+LWKDEDN IS KS +ILSVKK+RM+SCQASLID
Subjt:  ANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLID

Query:  LFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQ-ARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEG
        +FSPSV+ T+HDDILS+R  DKKA ESS TSSIE DDGD   A+MQ S+QA HF KSWDTPLAKCSTTAV++ND YQ VPE   + E SFLDRRLNSP+G
Subjt:  LFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQ-ARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEG

Query:  CDDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRLDEPYVQNDVIKRTQMQGMPDDEDDILRLDAYIED
        CDDIVE+NIFC + KGQSSKMHI+ ITGSA+STPS YFHEFSYDD IF GNKPSLTGCSS SSF    PY+QNDVI RTQMQGM DDE DI++LDAYI+ 
Subjt:  CDDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRLDEPYVQNDVIKRTQMQGMPDDEDDILRLDAYIED

Query:  SDFCAGTSLHAEKFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISSYIDSTLIID
        SDFCAG+SLHAE FLSSYQTRNSPN H+TS SILATE DVDCFSVRDEVERSWRSRD+TPFK LVDDDEKGC FDYDIMLSSS K NY SSY DS  I+D
Subjt:  SDFCAGTSLHAEKFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISSYIDSTLIID

Query:  DVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCLDQRKAERPN
        DVFDTRE L  FLKKSNNF+HSSP SPDMHS QKYF NWRLP RDCEKAY SSE KF HQA +QKY SVERPRRGKSAPPFYKRKTSFYCLDQ+KAERPN
Subjt:  DVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCLDQRKAERPN

Query:  ATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSNDEKQGEISK
        A SFYC+NE KAD+ SA++ Y MDQGKVE LKASVFLDSPPHLE  ELRDS+H SGTSN+YVKPFPVDDLL+ TRSSRTDTIKM AIMG+++EKQGEISK
Subjt:  ATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSNDEKQGEISK

Query:  QSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIPVVSGGILAV
        QSQSDVK                              RNEDS+AFDDEVSILDISSGFLSLASNSLVPD IDKNF ++AKVLLQLDKKFIPVVSGGILAV
Subjt:  QSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIPVVSGGILAV

Query:  IDQ
        IDQ
Subjt:  IDQ

XP_038903642.1 DNA mismatch repair protein MLH3 isoform X1 [Benincasa hispida]0.0e+0081.82Show/hide
Query:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL
        MGTIKPLPKSVRSSVRAG+ILYDVTKVVEELVYNSLDAGASKISIF+GIGTSYVKVVDDGSGI+RDGLVLLGERYVTSKFHDL+DMD+K GTFGFRGEAL
Subjt:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL

Query:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI
        ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCV+RTAL+HSKVSFK+VDSESESI
Subjt:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI

Query:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK----------------------------------TDHVFHIRKRSRSE
        LLCTDPSPSPLSLLRSGFGSEVSRSL ELKIGDGDLKLSGYI SPFDSFSIK                                  TDHV H RKRSRSE
Subjt:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK----------------------------------TDHVFHIRKRSRSE

Query:  ANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLID
        ANPAYVLNL+CPGSFYDLTFESSKT+VQFKDWTPILTFIEEA+QQFWKEKYNCGKSLVH TPIVGDQLWKDEDNMIS KSKNI SVKKSRM+SCQASL D
Subjt:  ANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLID

Query:  LFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEGC
        LFSPSV+LTEHDDILSHRL DKKA ESS TSSIELDDGDQ ARMQ S QADHFSKSWDTPLAKCSTTAVQ+ND+YQ VPEN  + EDSFLDRRLNSP+GC
Subjt:  LFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEGC

Query:  DDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRL-------DEPYVQNDVIKRTQMQGMPDDEDDILRL
        DDIVEDNIFC +LKGQSSKM  NMITGSA STPS YF EFSYD+YI TGNKPSL GCSS SSF+L       D+ YVQNDVIKRTQMQ +PDDE DIL+L
Subjt:  DDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRL-------DEPYVQNDVIKRTQMQGMPDDEDDILRL

Query:  DAYIEDSDFCAGTSLHAE---KFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISS
        DAY + SD CAGTS HAE   KFLSSYQTR+SPNG VTS+ ILA+E DVDCFSVRDE ERSWRSRD+TP KDLVDDDEKGC+FD DI LSSSNKKNYI+S
Subjt:  DAYIEDSDFCAGTSLHAE---KFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISS

Query:  YIDSTLIIDDVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCL
         IDSTLIIDDVFD RE LSTFLKKSN+ KHSSP+SPDMHS QKYFFNWRLPGRDCEKA ESSEL F HQ L++KYFSVERPRRGKSAPPFYKRKTSFYCL
Subjt:  YIDSTLIIDDVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCL

Query:  DQRKAERPNATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSN
        DQ KAERP+ATSFYC+NERKA+K SATN Y  DQGKVE L+A VFLD PPHLELGELRDSKHF GTSNRYVKPFPVDD LMGTRS+RT TIKMPAIMG++
Subjt:  DQRKAERPNATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSN

Query:  DEKQGEISKQSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIP
        +EKQGEISKQSQ DVK                               NEDS+AFDDEVSILDI+SGFLSLASNSLVP+SIDKNF EDAKVLLQLDKKFIP
Subjt:  DEKQGEISKQSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIP

Query:  VVSGGILAVIDQ
        VVSGGILAVIDQ
Subjt:  VVSGGILAVIDQ

XP_038903643.1 DNA mismatch repair protein MLH3 isoform X2 [Benincasa hispida]0.0e+0082.06Show/hide
Query:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL
        MGTIKPLPKSVRSSVRAG+ILYDVTKVVEELVYNSLDAGASKISIF+GIGTSYVKVVDDGSGI+RDGLVLLGERYVTSKFHDL+DMD+K GTFGFRGEAL
Subjt:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL

Query:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI
        ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCV+RTAL+HSKVSFK+VDSESESI
Subjt:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI

Query:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK----------------------------------TDHVFHIRKRSRSE
        LLCTDPSPSPLSLLRSGFGSEVSRSL ELKIGDGDLKLSGYI SPFDSFSIK                                  TDHV H RKRSRSE
Subjt:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK----------------------------------TDHVFHIRKRSRSE

Query:  ANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLID
        ANPAYVLNL+CPGSFYDLTFESSKT+VQFKDWTPILTFIEEA+QQFWKEKYNCGKSLVH TPIVGDQLWKDEDNMIS KSKNI SVKKSRM+SCQASL D
Subjt:  ANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLID

Query:  LFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEGC
        LFSPSV+LTEHDDILSHRL DKKA ESS TSSIELDDGDQ ARMQ S QADHFSKSWDTPLAKCSTTAVQ+ND+YQ VPEN  + EDSFLDRRLNSP+GC
Subjt:  LFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEGC

Query:  DDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRL-------DEPYVQNDVIKRTQMQGMPDDEDDILRL
        DDIVEDNIFC +LKGQSSKM  NMITGSA STPS YF EFSYD+YI TGNKPSL GCSS SSF+L       D+ YVQNDVIKRTQMQ +PDDE DIL+L
Subjt:  DDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRL-------DEPYVQNDVIKRTQMQGMPDDEDDILRL

Query:  DAYIEDSDFCAGTSLHAEKFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISSYID
        DAY + SD CAGTS HAEKFLSSYQTR+SPNG VTS+ ILA+E DVDCFSVRDE ERSWRSRD+TP KDLVDDDEKGC+FD DI LSSSNKKNYI+S ID
Subjt:  DAYIEDSDFCAGTSLHAEKFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISSYID

Query:  STLIIDDVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCLDQR
        STLIIDDVFD RE LSTFLKKSN+ KHSSP+SPDMHS QKYFFNWRLPGRDCEKA ESSEL F HQ L++KYFSVERPRRGKSAPPFYKRKTSFYCLDQ 
Subjt:  STLIIDDVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCLDQR

Query:  KAERPNATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSNDEK
        KAERP+ATSFYC+NERKA+K SATN Y  DQGKVE L+A VFLD PPHLELGELRDSKHF GTSNRYVKPFPVDD LMGTRS+RT TIKMPAIMG+++EK
Subjt:  KAERPNATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSNDEK

Query:  QGEISKQSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIPVVS
        QGEISKQSQ DVK                               NEDS+AFDDEVSILDI+SGFLSLASNSLVP+SIDKNF EDAKVLLQLDKKFIPVVS
Subjt:  QGEISKQSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIPVVS

Query:  GGILAVIDQ
        GGILAVIDQ
Subjt:  GGILAVIDQ

XP_038903644.1 DNA mismatch repair protein MLH3 isoform X3 [Benincasa hispida]0.0e+0081.82Show/hide
Query:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL
        MGTIKPLPKSVRSSVRAG+ILYDVTKVVEELVYNSLDAGASKISIF+GIGTSYVKVVDDGSGI+RDGLVLLGERYVTSKFHDL+DMD+K GTFGFRGEAL
Subjt:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL

Query:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI
        ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCV+RTAL+HSKVSFK+VDSESESI
Subjt:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI

Query:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK----------------------------------TDHVFHIRKRSRSE
        LLCTDPSPSPLSLLRSGFGSEVSRSL ELKIGDGDLKLSGYI SPFDSFSIK                                  TDHV H RKRSRSE
Subjt:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK----------------------------------TDHVFHIRKRSRSE

Query:  ANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLID
        ANPAYVLNL+CPGSFYDLTFESSKT+VQFKDWTPILTFIEEA+QQFWKEKYNCGKSLVH TPIVGDQLWKDEDNMIS KSKNI SVKKSRM+SCQASL D
Subjt:  ANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLID

Query:  LFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEGC
        LFSPSV+LTEHDDILSHRL DKKA ESS TSSIELDDGDQ ARMQ S QADHFSKSWDTPLAKCSTTAVQ+ND+YQ VPEN  + EDSFLDRRLNSP+GC
Subjt:  LFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEGC

Query:  DDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRL-------DEPYVQNDVIKRTQMQGMPDDEDDILRL
        DDIVEDNIFC +LKGQSSKM  NMITGSA STPS YF EFSYD+YI TGNKPSL GCSS SSF+L       D+ YVQNDVIKRTQMQ +PDDE DIL+L
Subjt:  DDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRL-------DEPYVQNDVIKRTQMQGMPDDEDDILRL

Query:  DAYIEDSDFCAGTSLHAE---KFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISS
        DAY + SD CAGTS HAE   KFLSSYQTR+SPNG VTS+ ILA+E DVDCFSVRDE ERSWRSRD+TP KDLVDDDEKGC+FD DI LSSSNKKNYI+S
Subjt:  DAYIEDSDFCAGTSLHAE---KFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISS

Query:  YIDSTLIIDDVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCL
         IDSTLIIDDVFD RE LSTFLKKSN+ KHSSP+SPDMHS QKYFFNWRLPGRDCEKA ESSEL F HQ L++KYFSVERPRRGKSAPPFYKRKTSFYCL
Subjt:  YIDSTLIIDDVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCL

Query:  DQRKAERPNATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSN
        DQ KAERP+ATSFYC+NERKA+K SATN Y  DQGKVE L+A VFLD PPHLELGELRDSKHF GTSNRYVKPFPVDD LMGTRS+RT TIKMPAIMG++
Subjt:  DQRKAERPNATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSN

Query:  DEKQGEISKQSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIP
        +EKQGEISKQSQ DVK                               NEDS+AFDDEVSILDI+SGFLSLASNSLVP+SIDKNF EDAKVLLQLDKKFIP
Subjt:  DEKQGEISKQSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIP

Query:  VVSGGILAVIDQ
        VVSGGILAVIDQ
Subjt:  VVSGGILAVIDQ

XP_038903645.1 DNA mismatch repair protein MLH3 isoform X4 [Benincasa hispida]0.0e+0081.82Show/hide
Query:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL
        MGTIKPLPKSVRSSVRAG+ILYDVTKVVEELVYNSLDAGASKISIF+GIGTSYVKVVDDGSGI+RDGLVLLGERYVTSKFHDL+DMD+K GTFGFRGEAL
Subjt:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL

Query:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI
        ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCV+RTAL+HSKVSFK+VDSESESI
Subjt:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI

Query:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK----------------------------------TDHVFHIRKRSRSE
        LLCTDPSPSPLSLLRSGFGSEVSRSL ELKIGDGDLKLSGYI SPFDSFSIK                                  TDHV H RKRSRSE
Subjt:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK----------------------------------TDHVFHIRKRSRSE

Query:  ANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLID
        ANPAYVLNL+CPGSFYDLTFESSKT+VQFKDWTPILTFIEEA+QQFWKEKYNCGKSLVH TPIVGDQLWKDEDNMIS KSKNI SVKKSRM+SCQASL D
Subjt:  ANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLID

Query:  LFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEGC
        LFSPSV+LTEHDDILSHRL DKKA ESS TSSIELDDGDQ ARMQ S QADHFSKSWDTPLAKCSTTAVQ+ND+YQ VPEN  + EDSFLDRRLNSP+GC
Subjt:  LFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEGC

Query:  DDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRL-------DEPYVQNDVIKRTQMQGMPDDEDDILRL
        DDIVEDNIFC +LKGQSSKM  NMITGSA STPS YF EFSYD+YI TGNKPSL GCSS SSF+L       D+ YVQNDVIKRTQMQ +PDDE DIL+L
Subjt:  DDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRL-------DEPYVQNDVIKRTQMQGMPDDEDDILRL

Query:  DAYIEDSDFCAGTSLHAE---KFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISS
        DAY + SD CAGTS HAE   KFLSSYQTR+SPNG VTS+ ILA+E DVDCFSVRDE ERSWRSRD+TP KDLVDDDEKGC+FD DI LSSSNKKNYI+S
Subjt:  DAYIEDSDFCAGTSLHAE---KFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISS

Query:  YIDSTLIIDDVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCL
         IDSTLIIDDVFD RE LSTFLKKSN+ KHSSP+SPDMHS QKYFFNWRLPGRDCEKA ESSEL F HQ L++KYFSVERPRRGKSAPPFYKRKTSFYCL
Subjt:  YIDSTLIIDDVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCL

Query:  DQRKAERPNATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSN
        DQ KAERP+ATSFYC+NERKA+K SATN Y  DQGKVE L+A VFLD PPHLELGELRDSKHF GTSNRYVKPFPVDD LMGTRS+RT TIKMPAIMG++
Subjt:  DQRKAERPNATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSN

Query:  DEKQGEISKQSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIP
        +EKQGEISKQSQ DVK                               NEDS+AFDDEVSILDI+SGFLSLASNSLVP+SIDKNF EDAKVLLQLDKKFIP
Subjt:  DEKQGEISKQSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIP

Query:  VVSGGILAVIDQ
        VVSGGILAVIDQ
Subjt:  VVSGGILAVIDQ

TrEMBL top hitse value%identityAlignment
A0A1S3BJQ0 DNA mismatch repair protein MLH3 isoform X10.0e+0080.02Show/hide
Query:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL
        MGTIKPLPKSVR+SVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGI+RDGLVLLGERYVTSKFHDLID D KGGTFGFRGEAL
Subjt:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL

Query:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI
        ASISD+SLVEIITRACGRANGYRKVLKGCKCLYLGI DDMED GTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI
Subjt:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI

Query:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK----------------------------------TDHVFHIRKRSRSE
        LLCTDPSPSPLSLLRSGFGSEVSRSL ELKIG GDLKLSGYI SPFD+FSIK                                  TD VFH RKRSRSE
Subjt:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK----------------------------------TDHVFHIRKRSRSE

Query:  ANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLID
        ANPAYVLNLECP SFYDLTFESSKT VQFKDWTPILTFIEEAIQQFWKEKYNCGKS+VH  PIVGD+LWKDEDN IS KS +ILSVKK+RM+SCQASLID
Subjt:  ANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLID

Query:  LFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQ-ARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEG
        +FSPSV+ T+HDDILS+R  DKKA ESS TSSIE DDGD   A+MQ S+QA HF KSWDTPLAKCSTTAV++ND YQ VPE   + E SFLDRRLNSP+G
Subjt:  LFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQ-ARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEG

Query:  CDDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRLDEPYVQNDVIKRTQMQGMPDDEDDILRLDAYIED
        CDDIVE+NIFC + KGQSSKMHI+ ITGSA+STPS YFHEFSYDD IF GNKPSLTGCSS SSF    PY+QNDVI RTQMQGM DDE DI++LDAYI+ 
Subjt:  CDDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRLDEPYVQNDVIKRTQMQGMPDDEDDILRLDAYIED

Query:  SDFCAGTSLHAE---KFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISSYIDSTL
        SDFCAG+SLHAE    FLSSYQTRNSPN H+TS SILATE DVDCFSVRDEVERSWRSRD+TPFK LVDDDEKGC FDYDIMLSSS K NY SSY DS  
Subjt:  SDFCAGTSLHAE---KFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISSYIDSTL

Query:  IIDDVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCLDQRKAE
        I+DDVFDTRE L  FLKKSNNF+HSSP SPDMHS QKYF NWRLP RDCEKAY SSE KF HQA +QKY SVERPRRGKSAPPFYKRKTSFYCLDQ+KAE
Subjt:  IIDDVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCLDQRKAE

Query:  RPNATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSNDEKQGE
        RPNA SFYC+NE KAD+ SA++ Y MDQGKVE LKASVFLDSPPHLE  ELRDS+H SGTSN+YVKPFPVDDLL+ TRSSRTDTIKM AIMG+++EKQGE
Subjt:  RPNATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSNDEKQGE

Query:  ISKQSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIPVVSGGI
        ISKQSQSDVK                              RNEDS+AFDDEVSILDISSGFLSLASNSLVPD IDKNF ++AKVLLQLDKKFIPVVSGGI
Subjt:  ISKQSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIPVVSGGI

Query:  LAVIDQ
        LAVIDQ
Subjt:  LAVIDQ

A0A1S3BKD1 DNA mismatch repair protein MLH3 isoform X70.0e+0080.02Show/hide
Query:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL
        MGTIKPLPKSVR+SVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGI+RDGLVLLGERYVTSKFHDLID D KGGTFGFRGEAL
Subjt:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL

Query:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI
        ASISD+SLVEIITRACGRANGYRKVLKGCKCLYLGI DDMED GTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI
Subjt:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI

Query:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK----------------------------------TDHVFHIRKRSRSE
        LLCTDPSPSPLSLLRSGFGSEVSRSL ELKIG GDLKLSGYI SPFD+FSIK                                  TD VFH RKRSRSE
Subjt:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK----------------------------------TDHVFHIRKRSRSE

Query:  ANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLID
        ANPAYVLNLECP SFYDLTFESSKT VQFKDWTPILTFIEEAIQQFWKEKYNCGKS+VH  PIVGD+LWKDEDN IS KS +ILSVKK+RM+SCQASLID
Subjt:  ANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLID

Query:  LFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQ-ARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEG
        +FSPSV+ T+HDDILS+R  DKKA ESS TSSIE DDGD   A+MQ S+QA HF KSWDTPLAKCSTTAV++ND YQ VPE   + E SFLDRRLNSP+G
Subjt:  LFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQ-ARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEG

Query:  CDDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRLDEPYVQNDVIKRTQMQGMPDDEDDILRLDAYIED
        CDDIVE+NIFC + KGQSSKMHI+ ITGSA+STPS YFHEFSYDD IF GNKPSLTGCSS SSF    PY+QNDVI RTQMQGM DDE DI++LDAYI+ 
Subjt:  CDDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRLDEPYVQNDVIKRTQMQGMPDDEDDILRLDAYIED

Query:  SDFCAGTSLHAE---KFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISSYIDSTL
        SDFCAG+SLHAE    FLSSYQTRNSPN H+TS SILATE DVDCFSVRDEVERSWRSRD+TPFK LVDDDEKGC FDYDIMLSSS K NY SSY DS  
Subjt:  SDFCAGTSLHAE---KFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISSYIDSTL

Query:  IIDDVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCLDQRKAE
        I+DDVFDTRE L  FLKKSNNF+HSSP SPDMHS QKYF NWRLP RDCEKAY SSE KF HQA +QKY SVERPRRGKSAPPFYKRKTSFYCLDQ+KAE
Subjt:  IIDDVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCLDQRKAE

Query:  RPNATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSNDEKQGE
        RPNA SFYC+NE KAD+ SA++ Y MDQGKVE LKASVFLDSPPHLE  ELRDS+H SGTSN+YVKPFPVDDLL+ TRSSRTDTIKM AIMG+++EKQGE
Subjt:  RPNATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSNDEKQGE

Query:  ISKQSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIPVVSGGI
        ISKQSQSDVK                              RNEDS+AFDDEVSILDISSGFLSLASNSLVPD IDKNF ++AKVLLQLDKKFIPVVSGGI
Subjt:  ISKQSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIPVVSGGI

Query:  LAVIDQ
        LAVIDQ
Subjt:  LAVIDQ

A0A1S3BKL4 DNA mismatch repair protein MLH3 isoform X20.0e+0080.26Show/hide
Query:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL
        MGTIKPLPKSVR+SVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGI+RDGLVLLGERYVTSKFHDLID D KGGTFGFRGEAL
Subjt:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL

Query:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI
        ASISD+SLVEIITRACGRANGYRKVLKGCKCLYLGI DDMED GTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI
Subjt:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI

Query:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK----------------------------------TDHVFHIRKRSRSE
        LLCTDPSPSPLSLLRSGFGSEVSRSL ELKIG GDLKLSGYI SPFD+FSIK                                  TD VFH RKRSRSE
Subjt:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK----------------------------------TDHVFHIRKRSRSE

Query:  ANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLID
        ANPAYVLNLECP SFYDLTFESSKT VQFKDWTPILTFIEEAIQQFWKEKYNCGKS+VH  PIVGD+LWKDEDN IS KS +ILSVKK+RM+SCQASLID
Subjt:  ANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLID

Query:  LFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQ-ARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEG
        +FSPSV+ T+HDDILS+R  DKKA ESS TSSIE DDGD   A+MQ S+QA HF KSWDTPLAKCSTTAV++ND YQ VPE   + E SFLDRRLNSP+G
Subjt:  LFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQ-ARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEG

Query:  CDDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRLDEPYVQNDVIKRTQMQGMPDDEDDILRLDAYIED
        CDDIVE+NIFC + KGQSSKMHI+ ITGSA+STPS YFHEFSYDD IF GNKPSLTGCSS SSF    PY+QNDVI RTQMQGM DDE DI++LDAYI+ 
Subjt:  CDDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRLDEPYVQNDVIKRTQMQGMPDDEDDILRLDAYIED

Query:  SDFCAGTSLHAEKFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISSYIDSTLIID
        SDFCAG+SLHAE FLSSYQTRNSPN H+TS SILATE DVDCFSVRDEVERSWRSRD+TPFK LVDDDEKGC FDYDIMLSSS K NY SSY DS  I+D
Subjt:  SDFCAGTSLHAEKFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISSYIDSTLIID

Query:  DVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCLDQRKAERPN
        DVFDTRE L  FLKKSNNF+HSSP SPDMHS QKYF NWRLP RDCEKAY SSE KF HQA +QKY SVERPRRGKSAPPFYKRKTSFYCLDQ+KAERPN
Subjt:  DVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCLDQRKAERPN

Query:  ATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSNDEKQGEISK
        A SFYC+NE KAD+ SA++ Y MDQGKVE LKASVFLDSPPHLE  ELRDS+H SGTSN+YVKPFPVDDLL+ TRSSRTDTIKM AIMG+++EKQGEISK
Subjt:  ATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSNDEKQGEISK

Query:  QSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIPVVSGGILAV
        QSQSDVK                              RNEDS+AFDDEVSILDISSGFLSLASNSLVPD IDKNF ++AKVLLQLDKKFIPVVSGGILAV
Subjt:  QSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIPVVSGGILAV

Query:  IDQ
        IDQ
Subjt:  IDQ

A0A1S4DY79 DNA mismatch repair protein MLH3 isoform X50.0e+0080.02Show/hide
Query:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL
        MGTIKPLPKSVR+SVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGI+RDGLVLLGERYVTSKFHDLID D KGGTFGFRGEAL
Subjt:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL

Query:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI
        ASISD+SLVEIITRACGRANGYRKVLKGCKCLYLGI DDMED GTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI
Subjt:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI

Query:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK----------------------------------TDHVFHIRKRSRSE
        LLCTDPSPSPLSLLRSGFGSEVSRSL ELKIG GDLKLSGYI SPFD+FSIK                                  TD VFH RKRSRSE
Subjt:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK----------------------------------TDHVFHIRKRSRSE

Query:  ANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLID
        ANPAYVLNLECP SFYDLTFESSKT VQFKDWTPILTFIEEAIQQFWKEKYNCGKS+VH  PIVGD+LWKDEDN IS KS +ILSVKK+RM+SCQASLID
Subjt:  ANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLID

Query:  LFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQ-ARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEG
        +FSPSV+ T+HDDILS+R  DKKA ESS TSSIE DDGD   A+MQ S+QA HF KSWDTPLAKCSTTAV++ND YQ VPE   + E SFLDRRLNSP+G
Subjt:  LFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQ-ARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEG

Query:  CDDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRLDEPYVQNDVIKRTQMQGMPDDEDDILRLDAYIED
        CDDIVE+NIFC + KGQSSKMHI+ ITGSA+STPS YFHEFSYDD IF GNKPSLTGCSS SSF    PY+QNDVI RTQMQGM DDE DI++LDAYI+ 
Subjt:  CDDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRLDEPYVQNDVIKRTQMQGMPDDEDDILRLDAYIED

Query:  SDFCAGTSLHAE---KFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISSYIDSTL
        SDFCAG+SLHAE    FLSSYQTRNSPN H+TS SILATE DVDCFSVRDEVERSWRSRD+TPFK LVDDDEKGC FDYDIMLSSS K NY SSY DS  
Subjt:  SDFCAGTSLHAE---KFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISSYIDSTL

Query:  IIDDVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCLDQRKAE
        I+DDVFDTRE L  FLKKSNNF+HSSP SPDMHS QKYF NWRLP RDCEKAY SSE KF HQA +QKY SVERPRRGKSAPPFYKRKTSFYCLDQ+KAE
Subjt:  IIDDVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCLDQRKAE

Query:  RPNATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSNDEKQGE
        RPNA SFYC+NE KAD+ SA++ Y MDQGKVE LKASVFLDSPPHLE  ELRDS+H SGTSN+YVKPFPVDDLL+ TRSSRTDTIKM AIMG+++EKQGE
Subjt:  RPNATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSNDEKQGE

Query:  ISKQSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIPVVSGGI
        ISKQSQSDVK                              RNEDS+AFDDEVSILDISSGFLSLASNSLVPD IDKNF ++AKVLLQLDKKFIPVVSGGI
Subjt:  ISKQSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIPVVSGGI

Query:  LAVIDQ
        LAVIDQ
Subjt:  LAVIDQ

A0A5A7UH95 DNA mismatch repair protein MLH3 isoform X20.0e+0079.55Show/hide
Query:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL
        MGTIKPLPKSVR+SVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGI+RDGLVLLGERYVTSKFHDLID D KGGTFGFRGEAL
Subjt:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL

Query:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI
        ASISD+SLVEIITRACGRANGYRKVLKGCKCLYLGI DDMED GTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI
Subjt:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI

Query:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK-------------------------------------------TDHVF
        LLCTDPSPSPLSLLRSGFGSEVSRSL ELKIG GDLKLSGYI SPFD+FSIK                                           TD VF
Subjt:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIK-------------------------------------------TDHVF

Query:  HIRKRSRSEANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRM
        H RKRSRSEANPAYVLNLECP SFYDLTFESSKT VQFKDWTPILTFIEEAIQQFWKEKYNCGKS+VH  PIVGD+LWKDEDN IS KS +ILSVKK+RM
Subjt:  HIRKRSRSEANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRM

Query:  RSCQASLIDLFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQ-ARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFL
        +SCQASLID+FSPSV+ T+HDDILS+R  DKKA ESS TSSIE DDGD   A+MQ S+QA HF KSWDTPLAKCSTTAV++ND YQ VPE   + E SFL
Subjt:  RSCQASLIDLFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQ-ARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFL

Query:  DRRLNSPEGCDDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRLDEPYVQNDVIKRTQMQGMPDDEDDI
        DRRLNSP+GCDDIVE+NIFC + KGQSSKMHI+ ITGSA+STPS YFHEFSYDD IF GNKPSLTGCSS SSF    PY+QNDVI RTQMQGM DDE DI
Subjt:  DRRLNSPEGCDDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRLDEPYVQNDVIKRTQMQGMPDDEDDI

Query:  LRLDAYIEDSDFCAGTSLHAEKFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISS
        ++LDAYI+ SDFCAG+SLHAE FLSSYQTRNSPN H+TS SILATE DVDCFSVRDEVERSWRSRD+TPFK LVDDDEKGC FDYDIMLSSS K NY SS
Subjt:  LRLDAYIEDSDFCAGTSLHAEKFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISS

Query:  YIDSTLIIDDVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCL
        Y DS  I+DDVFDTRE L  FLKKSNNF+HSSP SPDMHS QKYF NWRLP RDCEKAY SSE KF HQA +QKY SVERPRRGKSAPPFYKRKTSFYCL
Subjt:  YIDSTLIIDDVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCL

Query:  DQRKAERPNATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSN
        DQ+KAERPNA SFYC+NE KAD+ SA++ Y MDQGKVE LKASVFLDSPPHLE  ELRDS+H SGTSN+YVKPFPVDDLL+ TRSSRTDTIKM AIMG++
Subjt:  DQRKAERPNATSFYCMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSN

Query:  DEKQGEISKQSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIP
        +EKQGEISKQSQSDVK                              RNEDS+AFDDEVSILDISSGFLSLASNSLVPD IDKNF ++AKVLLQLDKKFIP
Subjt:  DEKQGEISKQSQSDVK------------------------------RNEDSNAFDDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIP

Query:  VVSGGILAVIDQ
        VVSGGILAVIDQ
Subjt:  VVSGGILAVIDQ

SwissProt top hitse value%identityAlignment
A0KSR5 DNA mismatch repair protein MutL3.7e-3231.45Show/hide
Query:  IKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTS-YVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEALAS
        I+ LP  + + + AG ++     VV+ELV NSLDAGA++I I I  G S  +K+ D+GSGI +D L L   R+ TSK H L D++A   +FGFRGEALAS
Subjt:  IKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTS-YVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEALAS

Query:  ISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESILL
        IS VS + + +R   +   ++   +G   + + +      VG+T+ V DLF+N P RR+ ++S   +  H + + + R ALV   + F +  +  + +  
Subjt:  ISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESILL

Query:  CTDP--SPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIKTDHVFHIRKR----------------SRSEA-NPAYVLNLECPGSFYD
        C      P  L  L    G + +   L ++    DL+LSGY+ SP+ +    T H F++  R                 ++E   P YVL L+      D
Subjt:  CTDP--SPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIKTDHVFHIRKR----------------SRSEA-NPAYVLNLECPGSFYD

Query:  LTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCG
        +    +K  V+F     +  +I +A+Q   +E    G
Subjt:  LTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCG

B8CX97 DNA mismatch repair protein MutL7.4e-3326.92Show/hide
Query:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFI-GIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEA
        M  IK LP+SV + + AG ++     VV+ELV NSLDAG++KI I I   G   ++V D+G GI  D + +  +RY TSK  D+ D+ +   + GFRGEA
Subjt:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFI-GIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEA

Query:  LASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESES
        LASI+ VS+++II+R   +    +  LKG K   +  +     VGT +IV+DLF+N P R K+++++  +  H +   + R AL +  V+F ++   +  
Subjt:  LASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESES

Query:  ILLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIKTDHVFHIRKRSRSE------------------ANPAYVLNLECPGSF
        I+L T  +   L  + + +G E+++SL+++   D  +K+SGYI  P      ++  +F + KR+                     A P   LNL+     
Subjt:  ILLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIKTDHVFHIRKRSRSE------------------ANPAYVLNLECPGSF

Query:  YDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLIDLFSPSVLLTEHDDIL
         D+    +K  V+F     I   I+  I     +     +   ++ P+  D   KD+     +K K     ++   +S  A ++   +P   L   D IL
Subjt:  YDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLIDLFSPSVLLTEHDDIL

Query:  ----SHRLRDKKAHESSRTSSIELDDGDQQARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVP
             +    KK  + S+   +++++  ++  +  + + D   K + T        +++ N +   +P
Subjt:  ----SHRLRDKKAHESSRTSSIELDDGDQQARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVP

F4JN26 DNA mismatch repair protein MLH31.2e-11032.33Show/hide
Query:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL
        M TIKPLP+ VR S+R+G+I++D+ +VVEELV+NSLDAGA+K+SIF+G+ +  VKVVDDGSG+SRD LVLLGERY TSKFHD  +++    TFGFRGEAL
Subjt:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL

Query:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI
        ASISD+SL+E+ T+A GR NGYRKV+KG KCL+LGIDDD +D GTTV VRDLFY+QPVRRK+MQSSPKKVL ++KKCV R ALVHS VSF ++D ES+  
Subjt:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI

Query:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIKTDHVFHIRKRSRSEANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTP
        L  T+PS S  SLL    G+E   SL ++ + DG L +SG+  +       K        +R+R ++NP Y+L + CP   Y+ +FE SKT V+FK W P
Subjt:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIKTDHVFHIRKRSRSEANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTP

Query:  ILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWK-DEDNMISRKSKNILSVKKSRMRSCQASLIDLFSPSVLLTEHDDILSHRLRDKKAHESSRTSSI
        +L FIE      WK+       ++ +     D L K D  ++I  K +       S +    A   +   P+            + + K++++ +  SS+
Subjt:  ILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWK-DEDNMISRKSKNILSVKKSRMRSCQASLIDLFSPSVLLTEHDDILSHRLRDKKAHESSRTSSI

Query:  ELDDGDQQARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEGCDDIVEDNIFCPNLKGQSSKMHINMITGSADSTP
             D           D+FS   D    +C       N   Q          DS L  R    +  +D  +            SK     +T    +TP
Subjt:  ELDDGDQQARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEGCDDIVEDNIFCPNLKGQSSKMHINMITGSADSTP

Query:  SFYFHEFSYD----DYIFTG-----------NKPSLTGCSSRSSFRLDEPYVQNDVIKRTQMQGMPDDEDDILRLDAYIEDSDFCAGTSLHAEK-----F
            H+F  D    ++ F G            K  L GCSSR S    EP + +     + +  +P+++    R+    E   +C    ++++K      
Subjt:  SFYFHEFSYD----DYIFTG-----------NKPSLTGCSSRSSFRLDEPYVQNDVIKRTQMQGMPDDEDDILRLDAYIEDSDFCAGTSLHAEK-----F

Query:  LSSYQTR-------NSPNGHV-TSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISSYIDSTLIIDDV--FD
         SS+Q         +S  G V        T  D   F   DE   S +          V         ++  M S+ +   + S Y     I++      
Subjt:  LSSYQTR-------NSPNGHV-TSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISSYIDSTLIIDDV--FD

Query:  TREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNW-------RLPGRDCEKAY-ESSELKFEHQALQQKYFSV----------ERPRRGKSAPPFYKRKT
                   +NN K    + P+M  C+    ++       +L  + C+ ++  + +++ +  +++++ FS           +R +R +SAPPFY+ K 
Subjt:  TREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNW-------RLPGRDCEKAY-ESSELKFEHQALQQKYFSV----------ERPRRGKSAPPFYKRKT

Query:  SFYCLDQRKAERPNATSFYCMNERKADKFSATNLYGMDQ---GKVENLKASVFLD-SPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTI
         F  L  +   +P          + +D     +L  + Q       +LK S+  D S  H++  E    K  S  S+             G R+  ++T 
Subjt:  SFYCLDQRKAERPNATSFYCMNERKADKFSATNLYGMDQ---GKVENLKASVFLD-SPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTI

Query:  KMPAIMGSNDEKQGEISKQSQSDVKRN-------EDSNAFDDEVSILDISSGFLSLASN-SLVPDSIDKNFFEDAKVLLQLDKKFIPVVSGGILAVIDQ
                + E+  +  K S +  + N       ++S+    +  + DISSG L L S+ SLVP+SI+++  EDAKVL Q+DKK+IP+V+ G +A++DQ
Subjt:  KMPAIMGSNDEKQGEISKQSQSDVKRN-------EDSNAFDDEVSILDISSGFLSLASN-SLVPDSIDKNFFEDAKVLLQLDKKFIPVVSGGILAVIDQ

Q0HR40 DNA mismatch repair protein MutL6.3e-3228.85Show/hide
Query:  IKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTS-YVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEALAS
        I+ LP  + + + AG ++     VV+ELV NSLDAGA++I I I  G S  +K+ D+GSGI +D L L   R+ TSK H L D++A   +FGFRGEALAS
Subjt:  IKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTS-YVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEALAS

Query:  ISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIV-DSESESIL
        IS VS + + +R   +   ++   +G   + + +      VG+T+ V DLF+N P RR+ ++S   +  H + + + R ALV   + F +  + ++    
Subjt:  ISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIV-DSESESIL

Query:  LCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIKTDHVFHIRKR----------------SRSEA-NPAYVLNLECPGSFYDL
              P  L  L    G + +   L ++    DL+LSGY+ SP+ +    T H F++  R                 ++E   P YVL L+      D+
Subjt:  LCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIKTDHVFHIRKR----------------SRSEA-NPAYVLNLECPGSFYDL

Query:  TFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCG--KSLVHMTPIVGDQLWKDEDNMISRKSKNILSVK----KSRMRSCQASLIDLFSPSVLLTEHD
            +K  V+F     +  +I +A+Q   +E    G  +     +P V D++   E    ++   +   ++    K+     +AS +D     +     D
Subjt:  TFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCG--KSLVHMTPIVGDQLWKDEDNMISRKSKNILSVK----KSRMRSCQASLIDLFSPSVLLTEHD

Query:  DILSHRLRD
          L  R RD
Subjt:  DILSHRLRD

Q9UHC1 DNA mismatch repair protein Mlh39.7e-3326.71Show/hide
Query:  IKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEALASI
        IK L   V++ +R+G+ +  + + VEEL  NS+DA A  +++ + + T  V+V+D+G G+  D +  +G RY TSK H + D++     +GFRGEALA+I
Subjt:  IKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEALASI

Query:  SDV-SLVEIITRACGRANGYRKVLKGCKCL-YLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESIL
        +D+ S VEI ++       + K+ +  K L     D      GTTV V +LFY  PVRRK M   P+     V++ +   +L+H  +SF + +  S S++
Subjt:  SDV-SLVEIITRACGRANGYRKVLKGCKCL-YLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESIL

Query:  LCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYS------------------------PFDSFSIKTDHVF--------------HIRKRS
        L    +    S     +G   S+ L E+     + +LSGYI S                            F ++ + +                +R RS
Subjt:  LCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYS------------------------PFDSFSIKTDHVF--------------HIRKRS

Query:  RSEANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWK-DEDNMIS----RKSKNILSVKKSRMR
          E    YV+N++C    YD+  E +KT+++F++W  +L  I+E ++ F K++    K  V ++   G+ + +  EDN  S       K + S ++S  +
Subjt:  RSEANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWK-DEDNMIS----RKSKNILSVKKSRMR

Query:  SCQASLIDLFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQARMQLSNQAD-HFSKSWDTPLAKCSTTAVQSNDS
            +++D +       E  ++ S  ++ K   E+  T S      D +A  + +N A  +  +S     +K +  ++Q+ DS
Subjt:  SCQASLIDLFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQARMQLSNQAD-HFSKSWDTPLAKCSTTAVQSNDS

Arabidopsis top hitse value%identityAlignment
AT4G02460.1 DNA mismatch repair protein, putative7.9e-2227.25Show/hide
Query:  IKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFI-GIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEALAS
        I+P+ ++V   + +G ++ D++  V+ELV NSLDAGA+ I I +   G  Y +V+D+G GIS     +L  ++ TSK  D  D+     T+GFRGEAL+S
Subjt:  IKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFI-GIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEALAS

Query:  ISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSF---KIVDSESES
        +  +  + + TR                 L          +GTTV VR LF N PVR K  + + +K    +   +   AL+   V F          +S
Subjt:  ISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSF---KIVDSESES

Query:  ILLCTDPSPSPLSLLRSGFGSEVSRSLLELKI-GDGDLKLSGYIYSPFDS----------FSIK---------TDHVFHIRKRSRSEANPAYVLNLECPG
        ++L T    S    + + FG     SL  + I    D ++ G++  P             F I          +  V  + K + S   P  +L+   PG
Subjt:  ILLCTDPSPSPLSLLRSGFGSEVSRSLLELKI-GDGDLKLSGYIYSPFDS----------FSIK---------TDHVFHIRKRSRSEANPAYVLNLECPG

Query:  SFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFW
           DL     K  V F D T ++  + E + + +
Subjt:  SFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFW

AT4G09140.1 MUTL-homologue 14.2e-2333.61Show/hide
Query:  IKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFI-GIGTSYVKVVDDGSGISRDGLVLLGERYVTS---KFHDLIDMDAKGGTFGFRGEA
        I+ L +SV + + AG ++      V+ELV NSLDA +S IS+ +   G   ++V DDG GI R+ L +L ER+ TS   KF DL  +     + GFRGEA
Subjt:  IKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFI-GIGTSYVKVVDDGSGISRDGLVLLGERYVTS---KFHDLIDMDAKGGTFGFRGEA

Query:  LASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMED--------VGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFK
        LAS++ V+ V + T   G+ +GYR   +         D  ME          GT ++V +LFYN   RRK +Q+S       +   + R A+ ++ VSF 
Subjt:  LASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMED--------VGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFK

Query:  IVDSESESILLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGD
             +    + +  SPS L  +RS +G  V+++L+++++   D
Subjt:  IVDSESESILLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGD

AT4G35520.1 MUTL protein homolog 38.2e-11232.33Show/hide
Query:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL
        M TIKPLP+ VR S+R+G+I++D+ +VVEELV+NSLDAGA+K+SIF+G+ +  VKVVDDGSG+SRD LVLLGERY TSKFHD  +++    TFGFRGEAL
Subjt:  MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEAL

Query:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI
        ASISD+SL+E+ T+A GR NGYRKV+KG KCL+LGIDDD +D GTTV VRDLFY+QPVRRK+MQSSPKKVL ++KKCV R ALVHS VSF ++D ES+  
Subjt:  ASISDVSLVEIITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESI

Query:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIKTDHVFHIRKRSRSEANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTP
        L  T+PS S  SLL    G+E   SL ++ + DG L +SG+  +       K        +R+R ++NP Y+L + CP   Y+ +FE SKT V+FK W P
Subjt:  LLCTDPSPSPLSLLRSGFGSEVSRSLLELKIGDGDLKLSGYIYSPFDSFSIKTDHVFHIRKRSRSEANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTP

Query:  ILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWK-DEDNMISRKSKNILSVKKSRMRSCQASLIDLFSPSVLLTEHDDILSHRLRDKKAHESSRTSSI
        +L FIE      WK+       ++ +     D L K D  ++I  K +       S +    A   +   P+            + + K++++ +  SS+
Subjt:  ILTFIEEAIQQFWKEKYNCGKSLVHMTPIVGDQLWK-DEDNMISRKSKNILSVKKSRMRSCQASLIDLFSPSVLLTEHDDILSHRLRDKKAHESSRTSSI

Query:  ELDDGDQQARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEGCDDIVEDNIFCPNLKGQSSKMHINMITGSADSTP
             D           D+FS   D    +C       N   Q          DS L  R    +  +D  +            SK     +T    +TP
Subjt:  ELDDGDQQARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDSYQWVPENHLIYEDSFLDRRLNSPEGCDDIVEDNIFCPNLKGQSSKMHINMITGSADSTP

Query:  SFYFHEFSYD----DYIFTG-----------NKPSLTGCSSRSSFRLDEPYVQNDVIKRTQMQGMPDDEDDILRLDAYIEDSDFCAGTSLHAEK-----F
            H+F  D    ++ F G            K  L GCSSR S    EP + +     + +  +P+++    R+    E   +C    ++++K      
Subjt:  SFYFHEFSYD----DYIFTG-----------NKPSLTGCSSRSSFRLDEPYVQNDVIKRTQMQGMPDDEDDILRLDAYIEDSDFCAGTSLHAEK-----F

Query:  LSSYQTR-------NSPNGHV-TSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISSYIDSTLIIDDV--FD
         SS+Q         +S  G V        T  D   F   DE   S +          V         ++  M S+ +   + S Y     I++      
Subjt:  LSSYQTR-------NSPNGHV-TSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISSYIDSTLIIDDV--FD

Query:  TREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNW-------RLPGRDCEKAY-ESSELKFEHQALQQKYFSV----------ERPRRGKSAPPFYKRKT
                   +NN K    + P+M  C+    ++       +L  + C+ ++  + +++ +  +++++ FS           +R +R +SAPPFY+ K 
Subjt:  TREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNW-------RLPGRDCEKAY-ESSELKFEHQALQQKYFSV----------ERPRRGKSAPPFYKRKT

Query:  SFYCLDQRKAERPNATSFYCMNERKADKFSATNLYGMDQ---GKVENLKASVFLD-SPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTI
         F  L  +   +P          + +D     +L  + Q       +LK S+  D S  H++  E    K  S  S+             G R+  ++T 
Subjt:  SFYCLDQRKAERPNATSFYCMNERKADKFSATNLYGMDQ---GKVENLKASVFLD-SPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTI

Query:  KMPAIMGSNDEKQGEISKQSQSDVKRN-------EDSNAFDDEVSILDISSGFLSLASN-SLVPDSIDKNFFEDAKVLLQLDKKFIPVVSGGILAVIDQ
                + E+  +  K S +  + N       ++S+    +  + DISSG L L S+ SLVP+SI+++  EDAKVL Q+DKK+IP+V+ G +A++DQ
Subjt:  KMPAIMGSNDEKQGEISKQSQSDVKRN-------EDSNAFDDEVSILDISSGFLSLASN-SLVPDSIDKNFFEDAKVLLQLDKKFIPVVSGGILAVIDQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGACTATCAAGCCCCTTCCGAAGTCTGTTCGTAGTTCTGTGCGTGCTGGCGTTATTCTCTATGATGTCACGAAGGTTGTGGAAGAGCTTGTTTATAATAGCTTGGA
TGCTGGTGCGTCAAAGATTTCAATTTTCATTGGCATTGGGACATCCTATGTTAAAGTAGTGGACGATGGATCTGGTATTAGTCGGGATGGGTTGGTCTTGCTAGGAGAAA
GATATGTGACATCAAAATTTCATGATCTCATCGACATGGATGCCAAAGGTGGAACGTTTGGCTTTAGAGGTGAGGCATTGGCTTCCATTTCAGATGTATCGTTGGTGGAG
ATCATAACTAGAGCATGTGGGAGGGCAAATGGATATCGTAAAGTCTTAAAGGGTTGCAAATGCTTGTATCTTGGAATTGACGATGATATGGAAGATGTTGGTACTACTGT
TATTGTTCGAGATCTGTTTTACAACCAACCAGTTCGAAGGAAACATATGCAATCCAGCCCCAAGAAGGTCTTGCATGCAGTCAAGAAATGTGTAGTTCGAACTGCCCTTG
TACATTCTAAAGTTTCCTTCAAAATTGTAGATAGTGAAAGTGAGAGTATCCTTCTTTGCACGGATCCCTCTCCTTCTCCTTTATCACTTTTGAGAAGTGGCTTTGGTAGT
GAGGTTTCCAGGTCTCTCCTTGAATTAAAAATCGGTGATGGGGACTTAAAGCTTTCTGGCTATATATACAGTCCGTTTGATAGTTTCAGTATCAAGACTGACCATGTCTT
CCACATAAGGAAGAGAAGCAGGTCTGAAGCAAATCCTGCTTATGTTTTAAATTTAGAGTGCCCTGGATCTTTTTACGACCTAACATTTGAATCATCCAAGACCGTTGTTC
AGTTTAAGGACTGGACCCCAATACTTACCTTCATTGAAGAGGCCATTCAACAATTTTGGAAAGAAAAATATAATTGTGGAAAATCTTTGGTCCATATGACCCCCATAGTT
GGAGATCAGCTGTGGAAGGATGAAGACAACATGATTTCAAGAAAATCAAAGAATATTCTATCTGTGAAGAAGAGCAGAATGCGAAGCTGTCAGGCCTCCCTTATTGATTT
GTTTTCACCATCAGTCTTGCTCACGGAACATGATGACATCTTGTCCCATAGGTTGCGTGACAAAAAGGCACATGAGAGTTCACGCACAAGTTCAATTGAATTGGATGACG
GTGACCAACAAGCTAGGATGCAACTTAGTAACCAAGCCGATCATTTCTCAAAATCATGGGATACTCCCTTGGCAAAATGCTCAACTACAGCTGTCCAAAGCAATGACAGT
TATCAATGGGTACCTGAAAACCATTTAATATATGAAGACTCCTTTTTGGATAGAAGACTGAATTCTCCCGAAGGATGTGATGACATTGTGGAGGACAATATCTTCTGTCC
AAATTTGAAAGGTCAATCATCTAAAATGCATATCAATATGATCACTGGGTCTGCGGACAGTACACCATCTTTTTACTTTCATGAATTTAGCTACGATGACTATATCTTCA
CGGGTAACAAACCCTCACTTACGGGATGCTCCTCAAGGAGCAGTTTTCGACTTGATGAACCGTACGTTCAAAACGATGTCATCAAAAGAACCCAAATGCAAGGAATGCCT
GATGATGAAGATGATATTCTAAGACTTGATGCTTACATCGAGGATTCTGATTTTTGTGCCGGAACCTCATTGCATGCTGAGAAGTTTTTGTCAAGTTATCAGACCAGAAA
TTCCCCAAACGGTCACGTGACTTCAAGTTCCATATTAGCCACAGAACGGGATGTTGATTGCTTCAGTGTTAGGGATGAGGTTGAAAGGAGCTGGAGATCTAGAGATAAGA
CGCCCTTCAAGGATTTAGTGGATGATGATGAAAAGGGCTGCGAATTTGATTATGATATCATGTTGAGTAGTTCCAACAAAAAGAATTACATATCAAGCTATATCGATAGT
ACACTGATAATTGATGATGTTTTCGATACAAGAGAAGGCCTTAGTACCTTCCTTAAAAAATCTAATAATTTTAAACATTCTTCTCCTATGAGTCCAGATATGCACTCCTG
TCAGAAGTATTTTTTCAATTGGAGATTGCCTGGAAGAGATTGTGAAAAGGCTTATGAAAGCTCAGAGCTTAAGTTTGAACATCAAGCTTTGCAACAAAAGTATTTTTCTG
TTGAAAGGCCTAGAAGAGGCAAATCGGCTCCACCTTTCTACAAAAGAAAGACTAGTTTCTATTGCCTGGACCAAAGAAAGGCAGAAAGGCCTAATGCCACTAGTTTCTAT
TGCATGAACGAAAGAAAAGCTGATAAGTTTAGTGCCACTAATCTCTATGGCATGGACCAAGGGAAAGTTGAAAATCTTAAGGCATCGGTCTTCCTTGACAGCCCACCTCA
TTTAGAACTAGGTGAGCTGAGAGATTCCAAACATTTCTCTGGTACTAGTAATCGATATGTTAAGCCATTTCCTGTTGATGACTTATTGATGGGAACCAGATCTTCCAGAA
CAGATACGATAAAGATGCCTGCTATTATGGGAAGTAATGACGAGAAACAAGGAGAGATTTCCAAGCAGTCCCAATCCGATGTTAAGAGAAATGAGGATTCAAATGCTTTT
GACGATGAAGTTAGTATACTTGATATCTCTTCAGGGTTTCTATCTCTTGCTAGCAATTCCTTAGTTCCCGATTCCATCGATAAAAATTTCTTTGAAGATGCAAAAGTTCT
TCTACAGCTGGATAAGAAATTCATTCCAGTTGTTTCCGGTGGAATACTGGCTGTTATTGATCAGCCCACTTCAAAGCTCACCAGGCTCACCTACACAGGCTCATATGTTG
GAGCACATGCCAGACCCTTGGAACAACTACTACCCTCCCTTTTTTCATCCAAAACCCATCCTTTTAACCCTATGCCTCTTATGATTCCTCTGGCCCTATGGTTAGATGAA
TGTGGTTTATGCATCGTGTCTATACGAGCCAGGATAAACAATAAAGGAGCTGGAAATACTGGGCAAAAGTGGTTCAAATGGAACATGGAATGGTCTGGGACAACACGAGC
TGAGCATAGAAAGGATGTTGCAGCGCTTGAGTTCGGCCGAAAGTTTATAGTACTCCTGGGAAATGCTGCCTTTTCATCGAGGAGACTAATTGCCCAGGGGCGGAAATACC
AGTCTGCAGAAGATAATGGATTGAGTAGCATTGTTAGATCATGTATAGAACCAGAAGTTGGCGATGATGATTTGTCAAAAAAGAGCAGTGTTTCTGTAGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGACTATCAAGCCCCTTCCGAAGTCTGTTCGTAGTTCTGTGCGTGCTGGCGTTATTCTCTATGATGTCACGAAGGTTGTGGAAGAGCTTGTTTATAATAGCTTGGA
TGCTGGTGCGTCAAAGATTTCAATTTTCATTGGCATTGGGACATCCTATGTTAAAGTAGTGGACGATGGATCTGGTATTAGTCGGGATGGGTTGGTCTTGCTAGGAGAAA
GATATGTGACATCAAAATTTCATGATCTCATCGACATGGATGCCAAAGGTGGAACGTTTGGCTTTAGAGGTGAGGCATTGGCTTCCATTTCAGATGTATCGTTGGTGGAG
ATCATAACTAGAGCATGTGGGAGGGCAAATGGATATCGTAAAGTCTTAAAGGGTTGCAAATGCTTGTATCTTGGAATTGACGATGATATGGAAGATGTTGGTACTACTGT
TATTGTTCGAGATCTGTTTTACAACCAACCAGTTCGAAGGAAACATATGCAATCCAGCCCCAAGAAGGTCTTGCATGCAGTCAAGAAATGTGTAGTTCGAACTGCCCTTG
TACATTCTAAAGTTTCCTTCAAAATTGTAGATAGTGAAAGTGAGAGTATCCTTCTTTGCACGGATCCCTCTCCTTCTCCTTTATCACTTTTGAGAAGTGGCTTTGGTAGT
GAGGTTTCCAGGTCTCTCCTTGAATTAAAAATCGGTGATGGGGACTTAAAGCTTTCTGGCTATATATACAGTCCGTTTGATAGTTTCAGTATCAAGACTGACCATGTCTT
CCACATAAGGAAGAGAAGCAGGTCTGAAGCAAATCCTGCTTATGTTTTAAATTTAGAGTGCCCTGGATCTTTTTACGACCTAACATTTGAATCATCCAAGACCGTTGTTC
AGTTTAAGGACTGGACCCCAATACTTACCTTCATTGAAGAGGCCATTCAACAATTTTGGAAAGAAAAATATAATTGTGGAAAATCTTTGGTCCATATGACCCCCATAGTT
GGAGATCAGCTGTGGAAGGATGAAGACAACATGATTTCAAGAAAATCAAAGAATATTCTATCTGTGAAGAAGAGCAGAATGCGAAGCTGTCAGGCCTCCCTTATTGATTT
GTTTTCACCATCAGTCTTGCTCACGGAACATGATGACATCTTGTCCCATAGGTTGCGTGACAAAAAGGCACATGAGAGTTCACGCACAAGTTCAATTGAATTGGATGACG
GTGACCAACAAGCTAGGATGCAACTTAGTAACCAAGCCGATCATTTCTCAAAATCATGGGATACTCCCTTGGCAAAATGCTCAACTACAGCTGTCCAAAGCAATGACAGT
TATCAATGGGTACCTGAAAACCATTTAATATATGAAGACTCCTTTTTGGATAGAAGACTGAATTCTCCCGAAGGATGTGATGACATTGTGGAGGACAATATCTTCTGTCC
AAATTTGAAAGGTCAATCATCTAAAATGCATATCAATATGATCACTGGGTCTGCGGACAGTACACCATCTTTTTACTTTCATGAATTTAGCTACGATGACTATATCTTCA
CGGGTAACAAACCCTCACTTACGGGATGCTCCTCAAGGAGCAGTTTTCGACTTGATGAACCGTACGTTCAAAACGATGTCATCAAAAGAACCCAAATGCAAGGAATGCCT
GATGATGAAGATGATATTCTAAGACTTGATGCTTACATCGAGGATTCTGATTTTTGTGCCGGAACCTCATTGCATGCTGAGAAGTTTTTGTCAAGTTATCAGACCAGAAA
TTCCCCAAACGGTCACGTGACTTCAAGTTCCATATTAGCCACAGAACGGGATGTTGATTGCTTCAGTGTTAGGGATGAGGTTGAAAGGAGCTGGAGATCTAGAGATAAGA
CGCCCTTCAAGGATTTAGTGGATGATGATGAAAAGGGCTGCGAATTTGATTATGATATCATGTTGAGTAGTTCCAACAAAAAGAATTACATATCAAGCTATATCGATAGT
ACACTGATAATTGATGATGTTTTCGATACAAGAGAAGGCCTTAGTACCTTCCTTAAAAAATCTAATAATTTTAAACATTCTTCTCCTATGAGTCCAGATATGCACTCCTG
TCAGAAGTATTTTTTCAATTGGAGATTGCCTGGAAGAGATTGTGAAAAGGCTTATGAAAGCTCAGAGCTTAAGTTTGAACATCAAGCTTTGCAACAAAAGTATTTTTCTG
TTGAAAGGCCTAGAAGAGGCAAATCGGCTCCACCTTTCTACAAAAGAAAGACTAGTTTCTATTGCCTGGACCAAAGAAAGGCAGAAAGGCCTAATGCCACTAGTTTCTAT
TGCATGAACGAAAGAAAAGCTGATAAGTTTAGTGCCACTAATCTCTATGGCATGGACCAAGGGAAAGTTGAAAATCTTAAGGCATCGGTCTTCCTTGACAGCCCACCTCA
TTTAGAACTAGGTGAGCTGAGAGATTCCAAACATTTCTCTGGTACTAGTAATCGATATGTTAAGCCATTTCCTGTTGATGACTTATTGATGGGAACCAGATCTTCCAGAA
CAGATACGATAAAGATGCCTGCTATTATGGGAAGTAATGACGAGAAACAAGGAGAGATTTCCAAGCAGTCCCAATCCGATGTTAAGAGAAATGAGGATTCAAATGCTTTT
GACGATGAAGTTAGTATACTTGATATCTCTTCAGGGTTTCTATCTCTTGCTAGCAATTCCTTAGTTCCCGATTCCATCGATAAAAATTTCTTTGAAGATGCAAAAGTTCT
TCTACAGCTGGATAAGAAATTCATTCCAGTTGTTTCCGGTGGAATACTGGCTGTTATTGATCAGCCCACTTCAAAGCTCACCAGGCTCACCTACACAGGCTCATATGTTG
GAGCACATGCCAGACCCTTGGAACAACTACTACCCTCCCTTTTTTCATCCAAAACCCATCCTTTTAACCCTATGCCTCTTATGATTCCTCTGGCCCTATGGTTAGATGAA
TGTGGTTTATGCATCGTGTCTATACGAGCCAGGATAAACAATAAAGGAGCTGGAAATACTGGGCAAAAGTGGTTCAAATGGAACATGGAATGGTCTGGGACAACACGAGC
TGAGCATAGAAAGGATGTTGCAGCGCTTGAGTTCGGCCGAAAGTTTATAGTACTCCTGGGAAATGCTGCCTTTTCATCGAGGAGACTAATTGCCCAGGGGCGGAAATACC
AGTCTGCAGAAGATAATGGATTGAGTAGCATTGTTAGATCATGTATAGAACCAGAAGTTGGCGATGATGATTTGTCAAAAAAGAGCAGTGTTTCTGTAGTTTAG
Protein sequenceShow/hide protein sequence
MGTIKPLPKSVRSSVRAGVILYDVTKVVEELVYNSLDAGASKISIFIGIGTSYVKVVDDGSGISRDGLVLLGERYVTSKFHDLIDMDAKGGTFGFRGEALASISDVSLVE
IITRACGRANGYRKVLKGCKCLYLGIDDDMEDVGTTVIVRDLFYNQPVRRKHMQSSPKKVLHAVKKCVVRTALVHSKVSFKIVDSESESILLCTDPSPSPLSLLRSGFGS
EVSRSLLELKIGDGDLKLSGYIYSPFDSFSIKTDHVFHIRKRSRSEANPAYVLNLECPGSFYDLTFESSKTVVQFKDWTPILTFIEEAIQQFWKEKYNCGKSLVHMTPIV
GDQLWKDEDNMISRKSKNILSVKKSRMRSCQASLIDLFSPSVLLTEHDDILSHRLRDKKAHESSRTSSIELDDGDQQARMQLSNQADHFSKSWDTPLAKCSTTAVQSNDS
YQWVPENHLIYEDSFLDRRLNSPEGCDDIVEDNIFCPNLKGQSSKMHINMITGSADSTPSFYFHEFSYDDYIFTGNKPSLTGCSSRSSFRLDEPYVQNDVIKRTQMQGMP
DDEDDILRLDAYIEDSDFCAGTSLHAEKFLSSYQTRNSPNGHVTSSSILATERDVDCFSVRDEVERSWRSRDKTPFKDLVDDDEKGCEFDYDIMLSSSNKKNYISSYIDS
TLIIDDVFDTREGLSTFLKKSNNFKHSSPMSPDMHSCQKYFFNWRLPGRDCEKAYESSELKFEHQALQQKYFSVERPRRGKSAPPFYKRKTSFYCLDQRKAERPNATSFY
CMNERKADKFSATNLYGMDQGKVENLKASVFLDSPPHLELGELRDSKHFSGTSNRYVKPFPVDDLLMGTRSSRTDTIKMPAIMGSNDEKQGEISKQSQSDVKRNEDSNAF
DDEVSILDISSGFLSLASNSLVPDSIDKNFFEDAKVLLQLDKKFIPVVSGGILAVIDQPTSKLTRLTYTGSYVGAHARPLEQLLPSLFSSKTHPFNPMPLMIPLALWLDE
CGLCIVSIRARINNKGAGNTGQKWFKWNMEWSGTTRAEHRKDVAALEFGRKFIVLLGNAAFSSRRLIAQGRKYQSAEDNGLSSIVRSCIEPEVGDDDLSKKSSVSVV