; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G06330 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G06330
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDNA mismatch repair protein PMS1
Genome locationClcChr09:4903097..4913102
RNA-Seq ExpressionClc09G06330
SyntenyClc09G06330
Gene Ontology termsGO:0006298 - mismatch repair (biological process)
GO:0006772 - thiamine metabolic process (biological process)
GO:0009229 - thiamine diphosphate biosynthetic process (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0032389 - MutLalpha complex (cellular component)
GO:0004788 - thiamine diphosphokinase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
GO:0030975 - thiamine binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0016301 - kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR006282 - Thiamin pyrophosphokinase
IPR038973 - DNA mismatch repair protein MutL/Mlh/Pms
IPR036890 - Histidine kinase/HSP90-like ATPase superfamily
IPR036759 - Thiamin pyrophosphokinase, catalytic domain superfamily
IPR036371 - Thiamin pyrophosphokinase, thiamin-binding domain superfamily
IPR020568 - Ribosomal protein S5 domain 2-type fold
IPR014790 - MutL, C-terminal, dimerisation
IPR014762 - DNA mismatch repair, conserved site
IPR014721 - Ribosomal protein S5 domain 2-type fold, subgroup
IPR013507 - DNA mismatch repair protein, S5 domain 2-like
IPR007373 - Thiamin pyrophosphokinase, thiamin-binding domain
IPR007371 - Thiamin pyrophosphokinase, catalytic domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0068005.1 DNA mismatch repair protein PMS1 [Cucumis melo var. makuwa]0.0e+0080.69Show/hide
Query:  DIHMMQGTKIFDESEDQDTTDLHKCVAYILQSMPNQEESDLCILVAGALGGRFDHEIGNINVLCRFSTTRIILLSDDCLVHLLPRTHHHEIHVRSSVEGP
        + +  QGTKIFDESEDQDTTDLHKCVAYILQS+PNQ ES+LCILVAGALGGRFDHEIGNINVLCRFSTTRIILLSDDCL+HLLPRTHHHEI + SSVEGP
Subjt:  DIHMMQGTKIFDESEDQDTTDLHKCVAYILQSMPNQEESDLCILVAGALGGRFDHEIGNINVLCRFSTTRIILLSDDCLVHLLPRTHHHEIHVRSSVEGP

Query:  HCGLIPIGMPSGSTTTTGLQWDLNDTEMKFGAL--------------------------------------------------GQVILDLSSAVKELVEN
        HCGLIPIGMPSGSTTTTGLQWDL+DTEMKFG L                                                  GQVILDLSSAVKELVEN
Subjt:  HCGLIPIGMPSGSTTTTGLQWDLNDTEMKFGAL--------------------------------------------------GQVILDLSSAVKELVEN

Query:  SLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKNESVATHLTFDHSGLLVA
        SLDAGATSIEI+LKDYGEEWFQVIDNGSGISPTNFR                  SLTTYGFRGEALSSLCSLGTLTVET+TKNESVATHLTFDHSGLLVA
Subjt:  SLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKNESVATHLTFDHSGLLVA

Query:  EKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNIITVFGMNTFICLESVSIL
        EKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRF+CTNSAGKNAKSVVFKTQGSGSIKDNIITVFGMNTF CLESVSIL
Subjt:  EKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNIITVFGMNTFICLESVSIL

Query:  LSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFFSDETHILQTLREELLKI
        LSDDCKVEGFVSKSGQGSGRN+GDRQFFFVNNRPVDMPKVSKLVNELYK ANSRQYPIAI+NFTLPSKACDVNVTPDKRKIFFSDETHILQTLREELLKI
Subjt:  LSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFFSDETHILQTLREELLKI

Query:  YSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGA-------------DDDSFKKIENVEQSSHTTEVLNSDGEENITRKDFALRMHVMKK
        YSPTNACYSVNKVEEPT QVD LELCSDNGKL M LEH S DG              DDDSF KI+ VEQSSH+TEVLNSD EEN+TRKDFALRMH  KK
Subjt:  YSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGA-------------DDDSFKKIENVEQSSHTTEVLNSDGEENITRKDFALRMHVMKK

Query:  ADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCADKPSKDVKCTNGNCQVF
        ADALL DHDQHK++YLS+KK V VT SSP +  +GT+TSRVQSSLDKFVTINKRK ETSSAPLSEVPV RNQFLNNQ KKSC D  SKD+KCTNGNC+VF
Subjt:  ADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCADKPSKDVKCTNGNCQVF

Query:  NDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPSGSIKESSSPQLKLCSTL
        +DFSVGNDEDSSIQIKTDRV SK+ LP SSADHSDDGEATEE TGE  AKVH SVIESTA+PTK+LEI+SEDLPL  CS+ PSG +KESSSPQLKLCST 
Subjt:  NDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPSGSIKESSSPQLKLCSTL

Query:  HFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGFIIGKLDQDLLL------
        HFDFHELKKRR+QRQLRYKLNGYTCER KLK HYAAAT+KLSQPDNEDRKARALEAAARELD+LFRKKDF RMKVIGQFNLGFIIGKLDQDL +      
Subjt:  HFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGFIIGKLDQDLLL------

Query:  -------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDLISTLADSEGECSIIGSY
                           L+RPL LELSAEEEVVVSIHMDV RKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDLISTLADSEGECSIIG Y
Subjt:  -------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDLISTLADSEGECSIIGSY

Query:  RMDTADSICP
        RMDTADS+CP
Subjt:  RMDTADSICP

KAG6575837.1 ATPase family AAA domain-containing protein 1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0079.82Show/hide
Query:  DIHMMQGTKIFDESEDQDTTDLHKCVAYILQSMPNQEESDLCILVAGALGGRFDHEIGNINVLCRFSTTRIILLSDDCLVHLLPRTHHHEIHVRSSVEGP
        D +  QGTKIFDESED+DTTDLHKCVAYILQSMPN EES+LCILVAGALGGRFDHEIGNINVLCRFSTTRIILLSDDCLVHLLPRTHHHEIHV SSVEGP
Subjt:  DIHMMQGTKIFDESEDQDTTDLHKCVAYILQSMPNQEESDLCILVAGALGGRFDHEIGNINVLCRFSTTRIILLSDDCLVHLLPRTHHHEIHVRSSVEGP

Query:  HCGLIPIGMPSGSTTTTGLQWDLNDTEMKFGAL--------------------------------------------------GQVILDLSSAVKELVEN
        HCGLIPIGMPSGSTTT GLQWDL DTEMKFG L                                                  GQVILDLSSAVKELVEN
Subjt:  HCGLIPIGMPSGSTTTTGLQWDLNDTEMKFGAL--------------------------------------------------GQVILDLSSAVKELVEN

Query:  SLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKNESVATHLTFDHSGLLVA
        SLDAGATS+EI+LKDYGEEWFQVIDNGSGISPTNFR                  SLTTYGFRGEALSSLCSLG LTVETRTKNESVATHLTFDHSGLLVA
Subjt:  SLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKNESVATHLTFDHSGLLVA

Query:  EKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNIITVFGMNTFICLESVSIL
        EKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNIITVFGM TF CLESVSIL
Subjt:  EKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNIITVFGMNTFICLESVSIL

Query:  LSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFFSDETHILQTLREELLKI
        LSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFFSDETHILQ LREELLKI
Subjt:  LSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFFSDETHILQTLREELLKI

Query:  YSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPD-------------GADDDSFKKIENVEQSSHTTEVLNSDGEENITRKDFALRMHVMKK
        YSP NACYSVNKVEEPT+QVD +ELCSD  KLSM++EH SPD               +DDS K IENVEQSS T EV NSDGEENITRKDFALR+H MKK
Subjt:  YSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPD-------------GADDDSFKKIENVEQSSHTTEVLNSDGEENITRKDFALRMHVMKK

Query:  ADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCADKPSKDVKCTNGNCQVF
        ADA LKD+DQH+K+  SSKK  Q+T SSPCVT +GT TSRVQSSLDKFVT NKRKYET SAPLSEVP+ RNQFLNNQWKK  AD PSKDVKC +G+CQVF
Subjt:  ADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCADKPSKDVKCTNGNCQVF

Query:  NDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEAT-EECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPSGSIKESSSPQLKLCST
        +DFS GNDEDSSIQ  TDRV ++L +PLSSADHSDDGE T EECT E  AKV  SVIEST TP K+LE MSEDLPLS  SVLPSG +KESSSPQLKLCST
Subjt:  NDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEAT-EECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPSGSIKESSSPQLKLCST

Query:  LHFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGFIIGKLDQDLLL-----
         HFDFHELKKRR+QRQLRYKLNGY CER KL  HYAAATLKLSQPDNEDRKARALEAAA+ELDRLFRK+DFSRMKVIGQFNLGFIIGKLDQDL +     
Subjt:  LHFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGFIIGKLDQDLLL-----

Query:  --------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDLISTLADSEGECSIIGS
                            L+RPLRLELSAEEEVVVSIHMDVIRKNGF +EEDPHALPGNRFRLKAVPFSKNITFGVEDVKDLISTLADSEGECSIIGS
Subjt:  --------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDLISTLADSEGECSIIGS

Query:  YRMDTADSICP
        YRMDTA+S+CP
Subjt:  YRMDTADSICP

XP_008451591.1 PREDICTED: DNA mismatch repair protein PMS1 [Cucumis melo]0.0e+0084.16Show/hide
Query:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKN
        GQVILDLSSAVKELVENSLDAGATSIEI+LKDYGEEWFQVIDNGSGISPTNFR                  SLTTYGFRGEALSSLCSLGTLTVET+TKN
Subjt:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKN

Query:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII
        ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRF+CTNSAGKNAKSVVFKTQGSGSIKDNII
Subjt:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII

Query:  TVFGMNTFICLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFF
        TVFGMNTF CLESVSILLSDDCKVEGFVSKSGQGSGRN+GDRQFFFVNNRPVDMPKVSKLVNELYK ANSRQYPIAI+NFTLPSKACDVNVTPDKRKIFF
Subjt:  TVFGMNTFICLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFF

Query:  SDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGA-------------DDDSFKKIENVEQSSHTTEVLNSDGE
        SDETHILQTLREELLKIYSPTNACYSVNKVEEPT QVD LELCSDNGKL M LEH S DG              DDDSF KI+ VEQSSH+TEVLNSD E
Subjt:  SDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGA-------------DDDSFKKIENVEQSSHTTEVLNSDGE

Query:  ENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCA
        EN+TRKDFALRMH  KKADALL DHDQHK++YLS+KK V VT SSP +  +GT+TSRVQSSLDKFVTINKRK ETSSAPLSEVPV RNQFLNNQ KKSC 
Subjt:  ENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCA

Query:  DKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPS
        D  SKD+KCTNGNC+VF+DFSVGNDEDSSIQIKTDRV SK+ LP SSADHSDDGEATEE TGE  AKVH SVIESTA+PTK+LEI+SEDLPL  CS+ PS
Subjt:  DKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPS

Query:  GSIKESSSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGF
        G +KESSSPQLKLCST HFDFHELKKRR+QRQLRYKLNGYTCER KLK HYAAAT+KLSQPDNEDRKARALEAAARELD+LFRKKDF RMKVIGQFNLGF
Subjt:  GSIKESSSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGF

Query:  IIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDL
        IIGKLDQDL +                         L+RPL LELSAEEEVVVSIHMDV RKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDL
Subjt:  IIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDL

Query:  ISTLADSEGECSIIGSYRMDTADSICP
        ISTLADSEGECSIIG YRMDTADS+CP
Subjt:  ISTLADSEGECSIIGSYRMDTADSICP

XP_011659356.1 DNA mismatch repair protein PMS1 [Cucumis sativus]0.0e+0083.31Show/hide
Query:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKN
        GQVILDLSSAVKELVENSLDAGATSIEI+LKDYGEEWFQVIDNGSGISPTNFR                  SLTTYGFRGEALSSLCSLGTLTVET+TKN
Subjt:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKN

Query:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII
        ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRF+CTNSAGKNAKSVVFKTQGSGSIKDNII
Subjt:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII

Query:  TVFGMNTFICLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFF
        TVFGMNTF CLESV ILLSDDCKV+GFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAI+NFTLPSKACDVNVTPDKRKIFF
Subjt:  TVFGMNTFICLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFF

Query:  SDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGA-------------DDDSFKKIENVEQSSHTTEVLNSDGE
        SDETHILQTLREELLKIYSPTNACYSVNKVEEPT QVD LELCSDNGKLSM+LEH S DG              DDDSF KI+NVEQS H+TE+LNSD E
Subjt:  SDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGA-------------DDDSFKKIENVEQSSHTTEVLNSDGE

Query:  ENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCA
        EN TRKDFALR H  KKAD  L DHDQHK++YLS+KKGV VT  SP ++ +GT+TSRVQSSLDKFVTINKRK ET SAPLSEVPV RNQFLNNQWKK+C 
Subjt:  ENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCA

Query:  DKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPS
        D  SKD++CTNGN QVF+DF VGNDED SIQ KTDRV+SK+ LP SSADHSDDGEATEECTGE  AKVH SVIESTA+PTK+L +MSEDLPL GCS+ PS
Subjt:  DKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPS

Query:  GSIKESSSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGF
        G +KESSSPQLKLCST HFDFHELKKRR QRQLR+KLNGYTCER KLK HYAAATLKLSQ DNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGF
Subjt:  GSIKESSSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGF

Query:  IIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDL
        IIGKLDQDL +                         L+RPL LELSAEEEVVVSIHMDV RKNGFTIEEDP +LPGNRFRLKAVPFSKNITFGVEDVKDL
Subjt:  IIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDL

Query:  ISTLADSEGECSIIGSYRMDTADSICP
        ISTLADSEGECSIIGSYRMDTADS+CP
Subjt:  ISTLADSEGECSIIGSYRMDTADSICP

XP_038899185.1 DNA mismatch repair protein PMS1 isoform X1 [Benincasa hispida]0.0e+0086.46Show/hide
Query:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKN
        GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR                  SLTTYGFRGEALSSLCSLGTLTVETRT N
Subjt:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKN

Query:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII
        ESVATHLTFDHSGLLVAEKKTARQVGTT+MVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII
Subjt:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII

Query:  TVFGMNTFICLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFF
        TVFGMNTF CLESVSILLSDDCK+EGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKA DVNVTPDKRKIFF
Subjt:  TVFGMNTFICLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFF

Query:  SDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGA-------------DDDSFKKIENVEQSSHTTEVLNSDGE
        SDETHILQTLREELLKIYSPTNACYSVNKVEEPT+QVD LELCSDNGKLSM+LEH SPDG              DDDSFK+I+N EQSSHTTEVLNSDGE
Subjt:  SDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGA-------------DDDSFKKIENVEQSSHTTEVLNSDGE

Query:  ENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCA
        ENITRKDF LRMH MK ADAL+KD DQHKK+YLSSKKGVQVT SSP VT  GT+TSRVQSSLDKFVTINKRKYE  SAPL+EVP+ RNQFLNNQWKKSCA
Subjt:  ENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCA

Query:  DKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPS
        DKPSKDV+CT+GNCQ F+DFSVGNDEDSSIQIKTDRV SKL LPLSSADHSDDGEATE+CTGE  AKVHPSVIES+ATPTK+LEIMSEDLPLSGCSVLPS
Subjt:  DKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPS

Query:  GSIKESSSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGF
        G +KESSSP LKLCST HFDFHELKKRR QR+LRYKLNGYTCER KLK HYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGF
Subjt:  GSIKESSSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGF

Query:  IIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDL
        IIGKLDQDL +                         L+RPLRLELSAEEEVVVSIHMDV RKNGFTIEEDP AL  NRFRLKAVPFSKNITFGVEDVKDL
Subjt:  IIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDL

Query:  ISTLADSEGECSIIGSYRMDTADSICP
        ISTLADSEGECSIIGSYRMDTADS+CP
Subjt:  ISTLADSEGECSIIGSYRMDTADSICP

TrEMBL top hitse value%identityAlignment
A0A0A0K976 Uncharacterized protein0.0e+0083.31Show/hide
Query:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKN
        GQVILDLSSAVKELVENSLDAGATSIEI+LKDYGEEWFQVIDNGSGISPTNFR                  SLTTYGFRGEALSSLCSLGTLTVET+TKN
Subjt:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKN

Query:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII
        ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRF+CTNSAGKNAKSVVFKTQGSGSIKDNII
Subjt:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII

Query:  TVFGMNTFICLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFF
        TVFGMNTF CLESV ILLSDDCKV+GFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAI+NFTLPSKACDVNVTPDKRKIFF
Subjt:  TVFGMNTFICLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFF

Query:  SDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGA-------------DDDSFKKIENVEQSSHTTEVLNSDGE
        SDETHILQTLREELLKIYSPTNACYSVNKVEEPT QVD LELCSDNGKLSM+LEH S DG              DDDSF KI+NVEQS H+TE+LNSD E
Subjt:  SDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGA-------------DDDSFKKIENVEQSSHTTEVLNSDGE

Query:  ENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCA
        EN TRKDFALR H  KKAD  L DHDQHK++YLS+KKGV VT  SP ++ +GT+TSRVQSSLDKFVTINKRK ET SAPLSEVPV RNQFLNNQWKK+C 
Subjt:  ENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCA

Query:  DKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPS
        D  SKD++CTNGN QVF+DF VGNDED SIQ KTDRV+SK+ LP SSADHSDDGEATEECTGE  AKVH SVIESTA+PTK+L +MSEDLPL GCS+ PS
Subjt:  DKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPS

Query:  GSIKESSSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGF
        G +KESSSPQLKLCST HFDFHELKKRR QRQLR+KLNGYTCER KLK HYAAATLKLSQ DNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGF
Subjt:  GSIKESSSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGF

Query:  IIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDL
        IIGKLDQDL +                         L+RPL LELSAEEEVVVSIHMDV RKNGFTIEEDP +LPGNRFRLKAVPFSKNITFGVEDVKDL
Subjt:  IIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDL

Query:  ISTLADSEGECSIIGSYRMDTADSICP
        ISTLADSEGECSIIGSYRMDTADS+CP
Subjt:  ISTLADSEGECSIIGSYRMDTADSICP

A0A1S3BSM9 DNA mismatch repair protein PMS10.0e+0084.16Show/hide
Query:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKN
        GQVILDLSSAVKELVENSLDAGATSIEI+LKDYGEEWFQVIDNGSGISPTNFR                  SLTTYGFRGEALSSLCSLGTLTVET+TKN
Subjt:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKN

Query:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII
        ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRF+CTNSAGKNAKSVVFKTQGSGSIKDNII
Subjt:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII

Query:  TVFGMNTFICLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFF
        TVFGMNTF CLESVSILLSDDCKVEGFVSKSGQGSGRN+GDRQFFFVNNRPVDMPKVSKLVNELYK ANSRQYPIAI+NFTLPSKACDVNVTPDKRKIFF
Subjt:  TVFGMNTFICLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFF

Query:  SDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGA-------------DDDSFKKIENVEQSSHTTEVLNSDGE
        SDETHILQTLREELLKIYSPTNACYSVNKVEEPT QVD LELCSDNGKL M LEH S DG              DDDSF KI+ VEQSSH+TEVLNSD E
Subjt:  SDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGA-------------DDDSFKKIENVEQSSHTTEVLNSDGE

Query:  ENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCA
        EN+TRKDFALRMH  KKADALL DHDQHK++YLS+KK V VT SSP +  +GT+TSRVQSSLDKFVTINKRK ETSSAPLSEVPV RNQFLNNQ KKSC 
Subjt:  ENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCA

Query:  DKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPS
        D  SKD+KCTNGNC+VF+DFSVGNDEDSSIQIKTDRV SK+ LP SSADHSDDGEATEE TGE  AKVH SVIESTA+PTK+LEI+SEDLPL  CS+ PS
Subjt:  DKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPS

Query:  GSIKESSSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGF
        G +KESSSPQLKLCST HFDFHELKKRR+QRQLRYKLNGYTCER KLK HYAAAT+KLSQPDNEDRKARALEAAARELD+LFRKKDF RMKVIGQFNLGF
Subjt:  GSIKESSSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGF

Query:  IIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDL
        IIGKLDQDL +                         L+RPL LELSAEEEVVVSIHMDV RKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDL
Subjt:  IIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDL

Query:  ISTLADSEGECSIIGSYRMDTADSICP
        ISTLADSEGECSIIG YRMDTADS+CP
Subjt:  ISTLADSEGECSIIGSYRMDTADSICP

A0A5D3D2J3 Thiamine diphosphokinase0.0e+0080.69Show/hide
Query:  DIHMMQGTKIFDESEDQDTTDLHKCVAYILQSMPNQEESDLCILVAGALGGRFDHEIGNINVLCRFSTTRIILLSDDCLVHLLPRTHHHEIHVRSSVEGP
        + +  QGTKIFDESEDQDTTDLHKCVAYILQS+PNQ ES+LCILVAGALGGRFDHEIGNINVLCRFSTTRIILLSDDCL+HLLPRTHHHEI + SSVEGP
Subjt:  DIHMMQGTKIFDESEDQDTTDLHKCVAYILQSMPNQEESDLCILVAGALGGRFDHEIGNINVLCRFSTTRIILLSDDCLVHLLPRTHHHEIHVRSSVEGP

Query:  HCGLIPIGMPSGSTTTTGLQWDLNDTEMKFGAL--------------------------------------------------GQVILDLSSAVKELVEN
        HCGLIPIGMPSGSTTTTGLQWDL+DTEMKFG L                                                  GQVILDLSSAVKELVEN
Subjt:  HCGLIPIGMPSGSTTTTGLQWDLNDTEMKFGAL--------------------------------------------------GQVILDLSSAVKELVEN

Query:  SLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKNESVATHLTFDHSGLLVA
        SLDAGATSIEI+LKDYGEEWFQVIDNGSGISPTNFR                  SLTTYGFRGEALSSLCSLGTLTVET+TKNESVATHLTFDHSGLLVA
Subjt:  SLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKNESVATHLTFDHSGLLVA

Query:  EKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNIITVFGMNTFICLESVSIL
        EKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRF+CTNSAGKNAKSVVFKTQGSGSIKDNIITVFGMNTF CLESVSIL
Subjt:  EKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNIITVFGMNTFICLESVSIL

Query:  LSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFFSDETHILQTLREELLKI
        LSDDCKVEGFVSKSGQGSGRN+GDRQFFFVNNRPVDMPKVSKLVNELYK ANSRQYPIAI+NFTLPSKACDVNVTPDKRKIFFSDETHILQTLREELLKI
Subjt:  LSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFFSDETHILQTLREELLKI

Query:  YSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGA-------------DDDSFKKIENVEQSSHTTEVLNSDGEENITRKDFALRMHVMKK
        YSPTNACYSVNKVEEPT QVD LELCSDNGKL M LEH S DG              DDDSF KI+ VEQSSH+TEVLNSD EEN+TRKDFALRMH  KK
Subjt:  YSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGA-------------DDDSFKKIENVEQSSHTTEVLNSDGEENITRKDFALRMHVMKK

Query:  ADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCADKPSKDVKCTNGNCQVF
        ADALL DHDQHK++YLS+KK V VT SSP +  +GT+TSRVQSSLDKFVTINKRK ETSSAPLSEVPV RNQFLNNQ KKSC D  SKD+KCTNGNC+VF
Subjt:  ADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCADKPSKDVKCTNGNCQVF

Query:  NDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPSGSIKESSSPQLKLCSTL
        +DFSVGNDEDSSIQIKTDRV SK+ LP SSADHSDDGEATEE TGE  AKVH SVIESTA+PTK+LEI+SEDLPL  CS+ PSG +KESSSPQLKLCST 
Subjt:  NDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPSGSIKESSSPQLKLCSTL

Query:  HFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGFIIGKLDQDLLL------
        HFDFHELKKRR+QRQLRYKLNGYTCER KLK HYAAAT+KLSQPDNEDRKARALEAAARELD+LFRKKDF RMKVIGQFNLGFIIGKLDQDL +      
Subjt:  HFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGFIIGKLDQDLLL------

Query:  -------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDLISTLADSEGECSIIGSY
                           L+RPL LELSAEEEVVVSIHMDV RKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDLISTLADSEGECSIIG Y
Subjt:  -------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDLISTLADSEGECSIIGSY

Query:  RMDTADSICP
        RMDTADS+CP
Subjt:  RMDTADSICP

A0A6J1GQS5 DNA mismatch repair protein PMS1 isoform X10.0e+0082.97Show/hide
Query:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKN
        GQVILDLSSAVKELVENSLDAGATS+EI+LKDYGEEWFQVIDNGSGISPTNFR                  SLTTYGFRGEALSSLCSLG LTVETRTKN
Subjt:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKN

Query:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII
        ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII
Subjt:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII

Query:  TVFGMNTFICLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFF
        TVFGM TF CLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFF
Subjt:  TVFGMNTFICLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFF

Query:  SDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPD-------------GADDDSFKKIENVEQSSHTTEVLNSDGE
        SDETHILQ LREELLKIYSP NAC+SVNKVEEPT+QVD +ELCSD  KLSM++EH SPD               +DDS K IENVEQSS T EV++SDGE
Subjt:  SDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPD-------------GADDDSFKKIENVEQSSHTTEVLNSDGE

Query:  ENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCA
        ENITRKDFALR+H MKKADA LKD+DQHKK+  SSKKG Q+T SSPCVT +GTNTSRVQSSLDKFVT NKRKYET SAPLSEVP+ RNQFLNNQWKK  A
Subjt:  ENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCA

Query:  DKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEAT-EECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLP
        D PSKDVKCT+GNCQVF+DFS GNDEDSSIQ  TDRV ++L +PLSSADHSDDGE T EECT E  AKV  SVIEST TP K+LE MSEDLPLS  SVLP
Subjt:  DKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEAT-EECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLP

Query:  SGSIKESSSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLG
        SG +KESSSPQLKLCST HFDFHELKKRR QRQLR KLNGY CER KLK HYAAATLKLSQPDNEDRKARALEAAA+ELDRLFRK+DFSRMKV+GQFNLG
Subjt:  SGSIKESSSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLG

Query:  FIIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKD
        FIIGKLDQDL +                         L+RPLRLELSAEEEVVVSIHMDVIRKNGF +EEDPHALPGNRFRLKAVPFSKNITFGVEDVKD
Subjt:  FIIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKD

Query:  LISTLADSEGECSIIGSYRMDTADSICP
        LISTLADSEGECSIIGSYRMDTA+S+CP
Subjt:  LISTLADSEGECSIIGSYRMDTADSICP

A0A6J1GQV6 DNA mismatch repair protein PMS1 isoform X20.0e+0083.07Show/hide
Query:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKN
        GQVILDLSSAVKELVENSLDAGATS+EI+LKDYGEEWFQVIDNGSGISPTNFR                  SLTTYGFRGEALSSLCSLG LTVETRTKN
Subjt:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTYGFRGEALSSLCSLGTLTVETRTKN

Query:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII
        ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII
Subjt:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII

Query:  TVFGMNTFICLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFF
        TVFGM TF CLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFF
Subjt:  TVFGMNTFICLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFF

Query:  SDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPD-------------GADDDSFKKIENVEQSSHTTEVLNSDGE
        SDETHILQ LREELLKIYSP NAC+SVNKVEEPT+QVD +ELCSD  KLSM++EH SPD               +DDS K IENVEQSS T EV++SDGE
Subjt:  SDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPD-------------GADDDSFKKIENVEQSSHTTEVLNSDGE

Query:  ENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCA
        ENITRKDFALR+H MKKADA LKD+DQHKK+  SSKKG Q+T SSPCVT +GTNTSRVQSSLDKFVT NKRKYET SAPLSEVP+ RNQFLNNQWKK  A
Subjt:  ENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCA

Query:  DKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPS
        D PSKDVKCT+GNCQVF+DFS GNDEDSSIQ  TDRV ++L +PLSSADHSDDGE TEECT E  AKV  SVIEST TP K+LE MSEDLPLS  SVLPS
Subjt:  DKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGE--AKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPS

Query:  GSIKESSSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGF
        G +KESSSPQLKLCST HFDFHELKKRR QRQLR KLNGY CER KLK HYAAATLKLSQPDNEDRKARALEAAA+ELDRLFRK+DFSRMKV+GQFNLGF
Subjt:  GSIKESSSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCER-KLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGF

Query:  IIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDL
        IIGKLDQDL +                         L+RPLRLELSAEEEVVVSIHMDVIRKNGF +EEDPHALPGNRFRLKAVPFSKNITFGVEDVKDL
Subjt:  IIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKDL

Query:  ISTLADSEGECSIIGSYRMDTADSICP
        ISTLADSEGECSIIGSYRMDTA+S+CP
Subjt:  ISTLADSEGECSIIGSYRMDTADSICP

SwissProt top hitse value%identityAlignment
F1NQJ3 Mismatch repair endonuclease PMS23.9e-8330.88Show/hide
Query:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFRSLT------------------TYGFRGEALSSLCSLGTLTVETRTKN
        GQV+L L +AVKELVENSLDAGAT+I++ LKD+G E  +V DNG G+   NF  LT                  T+GFRGEALSSLC+L  +T+ T  K+
Subjt:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFRSLT------------------TYGFRGEALSSLCSLGTLTVETRTKN

Query:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII
          V T L FDH+G +  +    RQ GTTV +++LF  LPVR KEF RNI+KEY K++ +L AY +I++GVR  CTN  G+  KS V  T G  ++K+NI 
Subjt:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII

Query:  TVFGM--------------NTFICLE---SVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLP
         VFG               N  +C E     + L      + GF+S+   G GR+  DRQFFF+N RP D  KV KLVNE+Y   N  QYP  ++N  + 
Subjt:  TVFGM--------------NTFICLE---SVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLP

Query:  SKACDVNVTPDKRKIFFSDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGADDDSFKKIENVEQSSHT-TEVL
        S+  D+NVTPDKR+I   +E  +L  L+  L++++        VNK            L  +   L +V    +P G  +  +     VE S H+ TE  
Subjt:  SKACDVNVTPDKRKIFFSDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGADDDSFKKIENVEQSSHT-TEVL

Query:  NSDGEENIT----RKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQV--TSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQ
        +S+G+  +T    R+ F+L                   +SY  S K V+   +SS     D+  +T R Q +      +     E+     S++PV R  
Subjt:  NSDGEENIT----RKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQV--TSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQ

Query:  FLNNQWKKSCADKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGEAKVHPSVIESTATPTKELEI----MS
                       K     +G C      +  N  ++   + ++ V + +     S++     E  +      K     I+   T  K   +      
Subjt:  FLNNQWKKSCADKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGEAKVHPSVIESTATPTKELEI----MS

Query:  EDLPLSGCSVLP--------SGSIKESSSPQLKLCSTLHF-----DFHELKKRRLQRQLRYKLNGYTCERKLK--------RHYAAATLKLSQPDNEDRK
         +LP    + LP        +   K    P+++  +T ++        E+KK+ +  +   K       + ++        ++Y     K+S  +N    
Subjt:  EDLPLSGCSVLP--------SGSIKESSSPQLKLCSTLHF-----DFHELKKRRLQRQLRYKLNGYTCERKLK--------RHYAAATLKLSQPDNEDRK

Query:  ARALEAAARELDRLFRKKDFSRMKVIGQFNLGFIIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTI
            + A  EL +   K+ F++M++IGQFNLGFII KL+ DL +                         LI P  L L+A  E V+  ++++ RKNGF  
Subjt:  ARALEAAARELDRLFRKKDFSRMKVIGQFNLGFIIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTI

Query:  EEDPHALPGNRFRLKAVPFSKNITFGVEDVKDLISTLADSEG
          + +A    R +L ++P SKN TFG +D+ +LI  L+D  G
Subjt:  EEDPHALPGNRFRLKAVPFSKNITFGVEDVKDLISTLADSEG

P54278 Mismatch repair endonuclease PMS23.6e-8933.01Show/hide
Query:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFRSLT------------------TYGFRGEALSSLCSLGTLTVETRTKN
        GQV+L LS+AVKELVENSLDAGAT+I++ LKDYG +  +V DNG G+   NF  LT                  T+GFRGEALSSLC+L  +T+ T   +
Subjt:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFRSLT------------------TYGFRGEALSSLCSLGTLTVETRTKN

Query:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII
          V T L FDH+G ++ +    R  GTTV V++LFS LPVR KEF RNI+KEY K++ +L+AY +I+ G+R  CTN  G+  +  V  T GS SIK+NI 
Subjt:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII

Query:  TVFGMNTF--------------ICLE---SVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLP
        +VFG                  +C E   S S  L +   + GF+S+   G GR+  DRQFFF+N RP D  KV +LVNE+Y   N  QYP  ++N ++ 
Subjt:  TVFGMNTF--------------ICLE---SVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLP

Query:  SKACDVNVTPDKRKIFFSDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGADDDSFKKIENVEQS-SHTTEVL
        S+  D+NVTPDKR+I   +E  +L  L+  L+ ++        VNK+    + + D+E             +L    A D     +E  +QS S  T   
Subjt:  SKACDVNVTPDKRKIFFSDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGADDDSFKKIENVEQS-SHTTEVL

Query:  NSDGEENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQW
          D   +  R+ F+LR     K  +      + ++S L  K+G+  +S+S  ++D G               +  +K   SS+     P  R +   +  
Subjt:  NSDGEENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQW

Query:  KKSCA-DKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSD-DGEATEECTG-EAKVHPSVIESTATPT-----KELEIMSED
          S + D     +  T  +C   ++++  +  D   Q   + V S+ + P +    SD D  + +E TG + +V P    + ATP      KE  + S D
Subjt:  KKSCA-DKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSD-DGEATEECTG-EAKVHPSVIESTATPT-----KELEIMSED

Query:  LPLSGCSVLPSGSIKESSSPQL-----KLCSTLHFDFHELKKRRLQRQLRYKLNGYTCERKLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKK
        +    C  L +     +S   +     K    L F    L KR   +QL ++      + + +++Y     K+   +N        +AA  EL +   K 
Subjt:  LPLSGCSVLPSGSIKESSSPQL-----KLCSTLHFDFHELKKRRLQRQLRYKLNGYTCERKLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLFRKK

Query:  DFSRMKVIGQFNLGFIIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVP
         F+ M++IGQFNLGFII KL++D+ +                         LI P  L L+A  E V+  ++++ RKNGF    D +A    R +L ++P
Subjt:  DFSRMKVIGQFNLGFIIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVP

Query:  FSKNITFGVEDVKDLISTLADSEG
         SKN TFG +DV +LI  L+DS G
Subjt:  FSKNITFGVEDVKDLISTLADSEG

P54279 Mismatch repair endonuclease PMS25.2e-7229.72Show/hide
Query:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFRSLT------------------TYGFRGEALSSLCSLGTLTVETRTKN
        GQVIL LS+AVKEL+ENS+DAGAT+I++ LKDYG +  +V DNG G+   NF  L                   T+GFRGEALSSLC+L  +T+ T   +
Subjt:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFRSLT------------------TYGFRGEALSSLCSLGTLTVETRTKN

Query:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII
         SV T L FDH+G +  +    R  GTTV V+ LF  LPVR KEF RNI+KEY K++ +L AY +I+ GVR  CTN  G+  +  V  T G+  +K+NI 
Subjt:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII

Query:  TVFGMNTFICLESVSILLSDDCKVEGF-VSKSGQ------------------GSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFT
        +VFG      L     L   D   E + +S SG+                  G  +  G   F      PV   +   L    Y   N  QYP  ++N +
Subjt:  TVFGMNTFICLESVSILLSDDCKVEGF-VSKSGQ------------------GSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFT

Query:  LPSKACDVNVTPDKRKIFFSDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGADDDSFKKIENVEQSSHTTEV
        + S+  D+NVTPDKR+I   +E  +L  L+  L+ ++         NK+    + + D+E   +  KL          G  D+S     +++ ++    V
Subjt:  LPSKACDVNVTPDKRKIFFSDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGADDDSFKKIENVEQSSHTTEV

Query:  LNSDGEENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVT--------INKRKYETSSAPLS-----
               +I+R   A  +H  K+  +   +  +  +S+ S K+GV  +  S  ++  G     ++ S DK V+        +++ K E  S   S     
Subjt:  LNSDGEENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDSGTNTSRVQSSLDKFVT--------INKRKYETSSAPLS-----

Query:  ----EVPVFRNQFLNNQWKKSCADKPSKD-VKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGE-ATEECTGEAKVHPSVIESTA
              P   + F ++    S  D+PS++ + C + +C+          ED   Q K         LPL+    ++     TEE      +   +    +
Subjt:  ----EVPVFRNQFLNNQWKKSCADKPSKD-VKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGE-ATEECTGEAKVHPSVIESTA

Query:  TPTKELEIMSEDLPLSGCSVLPSGSIKESSSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCERKLKRHYAAATLKLSQPDNEDRKARALEAAAREL
        T   E+++                +IK +     K    L F    L KR  Q      L     + K +  Y     K+   +N        +AA  EL
Subjt:  TPTKELEIMSEDLPLSGCSVLPSGSIKESSSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCERKLKRHYAAATLKLSQPDNEDRKARALEAAAREL

Query:  DRLFRKKDFSRMKVIGQFNLGFIIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNR
         +   K  F+ M+++GQFNLGFI+ KL +DL L                         LI P  L L+A  E V+  ++++ RKNGF    D  A    R
Subjt:  DRLFRKKDFSRMKVIGQFNLGFIIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNR

Query:  FRLKAVPFSKNITFGVEDVKDLISTLADSEG
         +L ++P SKN TFG +D+ +LI  L+DS G
Subjt:  FRLKAVPFSKNITFGVEDVKDLISTLADSEG

Q54QA0 Mismatch repair endonuclease pms19.8e-7928.31Show/hide
Query:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFRSLT------------------TYGFRGEALSSLCSLGTLTVETRTKN
        GQVI DLS AVKEL+ENS+DAGAT++EI LK+YGEE+ +VIDNGSG+ P+NF +LT                  TYGFRGEALSSLCSL    + TRTKN
Subjt:  GQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFRSLT------------------TYGFRGEALSSLCSLGTLTVETRTKN

Query:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII
        +  A  L FD  G +  +   AR+VGTTV +  LF  LPVR +EF RNI+KEY KL+++L AYA+I+   R  C N AGK+ +S V  T    +I+DN+I
Subjt:  ESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNII

Query:  TVFGMNTFICLESVSILLSDDC-KVEGFVSKSGQGSG----------------------------------------RNLGDRQFFFVNNRPVDMPKVSK
         VFG      L+  +   SD   KV G +SK G GSG                                        R+  DRQFFFVN+RP +  K++K
Subjt:  TVFGMNTFICLESVSILLSDDC-KVEGFVSKSGQGSG----------------------------------------RNLGDRQFFFVNNRPVDMPKVSK

Query:  LVNELYKSANSR-QYPIAIMNFTLPSKACDVNVTPDKRKIFFSDETHILQTLREELLKIYSPTNACYSVNKVEEPT---EQVDDLELCSDNGKLSMVLEH
         +N LY+S + R  YP+ I N  +P+   DVNVTPDKR IF   E  +L  + + L  ++    + +  N++ + T   E  +D    +   K+S     
Subjt:  LVNELYKSANSR-QYPIAIMNFTLPSKACDVNVTPDKRKIFFSDETHILQTLREELLKIYSPTNACYSVNKVEEPT---EQVDDLELCSDNGKLSMVLEH

Query:  LSPDGADDDSFKKIENVEQSSHTTEVLNSDGEENITRKDFALRMHVMKKADALLKDHDQHKKSYL--SSKKGVQVTSSSP---CVTDSGTNTSRVQSSLD
         +    +D++  KI    +   TT   +S    +  +K             +    ++++ ++ L         +T   P      D   N S  +  L 
Subjt:  LSPDGADDDSFKKIENVEQSSHTTEVLNSDGEENITRKDFALRMHVMKKADALLKDHDQHKKSYL--SSKKGVQVTSSSP---CVTDSGTNTSRVQSSLD

Query:  KFVTINKRKY-----ETSSAPLSEVPVFRNQFLNNQWKKSCADKPSKD-------VKCTNGNCQVF-NDF-------------------SVGNDEDSSIQ
        K    NK+K      E         PVF N   +     S +     D       +  +NGN   F +DF                   ++ N+ +S+  
Subjt:  KFVTINKRKY-----ETSSAPLSEVPVFRNQFLNNQWKKSCADKPSKD-------VKCTNGNCQVF-NDF-------------------SVGNDEDSSIQ

Query:  IKTDRVLSKL----------RLPLSSADHSDDGEATE--------------------ECTGEAKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPSGSI
          +++++  +          + P      +DDG+  E                    +   E +      +   + T ++ I ++   +S   ++ +G+ 
Subjt:  IKTDRVLSKL----------RLPLSSADHSDDGEATE--------------------ECTGEAKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPSGSI

Query:  KESSSPQLKLCSTL--------------HFDFHELKKRRLQR---------QLRYKLNGYTCERKLKRHYAAATLKLSQPDNEDRKARALEAAARELDRL
         + ++P +   + +               FD + +     ++         QL  K +        K+   AAT   SQ   +  +    + A  EL + 
Subjt:  KESSSPQLKLCSTL--------------HFDFHELKKRRLQR---------QLRYKLNGYTCERKLKRHYAAATLKLSQPDNEDRKARALEAAARELDRL

Query:  FRKKDFSRMKVIGQFNLGFIIGKLDQDLLL---------------------------LIRPLRL-ELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNR
        F+K+ F +M VIGQFNLGFII KL  DL +                           L++P  L +L++EEE+++  ++D+ +KNGF    D  A    +
Subjt:  FRKKDFSRMKVIGQFNLGFIIGKLDQDLLL---------------------------LIRPLRL-ELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNR

Query:  FRLKAVPFSKNITFGVEDVKDLISTLADS
         +L A P     +FG++D+ + I  + +S
Subjt:  FRLKAVPFSKNITFGVEDVKDLISTLADS

Q941I6 DNA mismatch repair protein PMS11.0e-20851.24Show/hide
Query:  PSGSTTTTGLQWDLNDTEMKFGALGQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTY
        PS +TT++ L   +N   +     GQVILDLSSAVKELVENSLDAGATSIEI L+DYGE++FQVIDNG GISPTNF+                  +LTTY
Subjt:  PSGSTTTTGLQWDLNDTEMKFGALGQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTY

Query:  GFRGEALSSLCSLGTLTVETRTKNESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTN
        GFRGEALSSLC+LG LTVETRTKNE VAT LTFDHSGLL AEKKTARQ+GTTV V+KLFSNLPVRSKEF RNIRKEYGKL+SLLNAYA+IA+GVRFVC+N
Subjt:  GFRGEALSSLCSLGTLTVETRTKNESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTN

Query:  SAGKNAKSVVFKTQGSGSIKDNIITVFGMNTFICLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIA
        + GKN KSVV  TQG GS+KDNIITVFG++TF  L+ VSI +S+DC+VEGF+SK GQG+GRNL DRQ+FF+N RPVDMPKVSKLVNELYK  +SR+YP+ 
Subjt:  SAGKNAKSVVFKTQGSGSIKDNIITVFGMNTFICLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIA

Query:  IMNFTLPSKACDVNVTPDKRKIFFSDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGADDDSFKKIENVEQSS
        I++F +P  ACD+NVTPDKRK+FFSDET ++ +LRE L +IYS +NA Y VN+ EE +EQ D   + S   K +++ E +  D +        E +E+ +
Subjt:  IMNFTLPSKACDVNVTPDKRKIFFSDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGADDDSFKKIENVEQSS

Query:  HTTEVLNSDGEENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDS----GTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPV
         +   +  D    + +  F ++    KK +  L  HD        S KG+   + +  VTD+     + +S  QS+L+ FVT+ KRK+E  S  LSE PV
Subjt:  HTTEVLNSDGEENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDS----GTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPV

Query:  FRNQFLNNQWKKSCADKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGEAKVHPSVIESTATPTKELEIMS
         RNQ  + + +KS  +  +   +C     Q+  D  V + ED +   + D  L     P + AD+ +  E   E     +      ++T T   ++E +S
Subjt:  FRNQFLNNQWKKSCADKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGEAKVHPSVIESTATPTKELEIMS

Query:  EDLPLSGCSVLPSGSIKES--SSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCE----RKLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLF
        ED P     +    ++ +S   S   K+ STL F F  L+ RRL+R  R +  GY  +     + K+ +AAATL+LSQPD+E+RKARAL AA  EL+RLF
Subjt:  EDLPLSGCSVLPSGSIKES--SSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCE----RKLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLF

Query:  RKKDFSRMKVIGQFNLGFIIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLK
        RK+DF RM+V+GQFNLGFII KL++DL +                         L++PL LELS EEEV V +HMD+IR+NGF +EE+P A PG  FRL+
Subjt:  RKKDFSRMKVIGQFNLGFIIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLK

Query:  AVPFSKNITFGVEDVKDLISTLADSEGECSIIGSYRMDTADSICP
        A+P+SKNITFGVED+KDLISTL D+ GECS+  SY+    DSICP
Subjt:  AVPFSKNITFGVEDVKDLISTLADSEGECSIIGSYRMDTADSICP

Arabidopsis top hitse value%identityAlignment
AT1G02880.1 thiamin pyrophosphokinase16.8e-5165.07Show/hide
Query:  DIHMMQGTKIFDESEDQDTTDLHKCVAYILQSMPNQEESDLCILVAGALGGRFDHEIGNINVLCRFSTTRIILLSDDCLVHLLPRTHHHEIHVRSSVEGP
        D ++  GTK+ DES DQDTTDL KC+ YI  S  NQE S L IL  GALGGRFDHE GN+NVL R+  TRI+LLSDDCL+ LLP+TH HEIH++SS+EGP
Subjt:  DIHMMQGTKIFDESEDQDTTDLHKCVAYILQSMPNQEESDLCILVAGALGGRFDHEIGNINVLCRFSTTRIILLSDDCLVHLLPRTHHHEIHVRSSVEGP

Query:  HCGLIPIGMPSGSTTTTGLQWDLNDTEMKFGALGQVILDLSSAVKE
        HCGLIPIG PS  TTT+GLQWDL++TEM+FG L    +  S+ VKE
Subjt:  HCGLIPIGMPSGSTTTTGLQWDLNDTEMKFGALGQVILDLSSAVKE

AT1G02880.2 thiamin pyrophosphokinase16.8e-5165.07Show/hide
Query:  DIHMMQGTKIFDESEDQDTTDLHKCVAYILQSMPNQEESDLCILVAGALGGRFDHEIGNINVLCRFSTTRIILLSDDCLVHLLPRTHHHEIHVRSSVEGP
        D ++  GTK+ DES DQDTTDL KC+ YI  S  NQE S L IL  GALGGRFDHE GN+NVL R+  TRI+LLSDDCL+ LLP+TH HEIH++SS+EGP
Subjt:  DIHMMQGTKIFDESEDQDTTDLHKCVAYILQSMPNQEESDLCILVAGALGGRFDHEIGNINVLCRFSTTRIILLSDDCLVHLLPRTHHHEIHVRSSVEGP

Query:  HCGLIPIGMPSGSTTTTGLQWDLNDTEMKFGALGQVILDLSSAVKE
        HCGLIPIG PS  TTT+GLQWDL++TEM+FG L    +  S+ VKE
Subjt:  HCGLIPIGMPSGSTTTTGLQWDLNDTEMKFGALGQVILDLSSAVKE

AT1G02880.3 thiamin pyrophosphokinase16.8e-5165.07Show/hide
Query:  DIHMMQGTKIFDESEDQDTTDLHKCVAYILQSMPNQEESDLCILVAGALGGRFDHEIGNINVLCRFSTTRIILLSDDCLVHLLPRTHHHEIHVRSSVEGP
        D ++  GTK+ DES DQDTTDL KC+ YI  S  NQE S L IL  GALGGRFDHE GN+NVL R+  TRI+LLSDDCL+ LLP+TH HEIH++SS+EGP
Subjt:  DIHMMQGTKIFDESEDQDTTDLHKCVAYILQSMPNQEESDLCILVAGALGGRFDHEIGNINVLCRFSTTRIILLSDDCLVHLLPRTHHHEIHVRSSVEGP

Query:  HCGLIPIGMPSGSTTTTGLQWDLNDTEMKFGALGQVILDLSSAVKE
        HCGLIPIG PS  TTT+GLQWDL++TEM+FG L    +  S+ VKE
Subjt:  HCGLIPIGMPSGSTTTTGLQWDLNDTEMKFGALGQVILDLSSAVKE

AT1G02880.4 thiamin pyrophosphokinase16.8e-5165.07Show/hide
Query:  DIHMMQGTKIFDESEDQDTTDLHKCVAYILQSMPNQEESDLCILVAGALGGRFDHEIGNINVLCRFSTTRIILLSDDCLVHLLPRTHHHEIHVRSSVEGP
        D ++  GTK+ DES DQDTTDL KC+ YI  S  NQE S L IL  GALGGRFDHE GN+NVL R+  TRI+LLSDDCL+ LLP+TH HEIH++SS+EGP
Subjt:  DIHMMQGTKIFDESEDQDTTDLHKCVAYILQSMPNQEESDLCILVAGALGGRFDHEIGNINVLCRFSTTRIILLSDDCLVHLLPRTHHHEIHVRSSVEGP

Query:  HCGLIPIGMPSGSTTTTGLQWDLNDTEMKFGALGQVILDLSSAVKE
        HCGLIPIG PS  TTT+GLQWDL++TEM+FG L    +  S+ VKE
Subjt:  HCGLIPIGMPSGSTTTTGLQWDLNDTEMKFGALGQVILDLSSAVKE

AT4G02460.1 DNA mismatch repair protein, putative7.2e-21051.24Show/hide
Query:  PSGSTTTTGLQWDLNDTEMKFGALGQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTY
        PS +TT++ L   +N   +     GQVILDLSSAVKELVENSLDAGATSIEI L+DYGE++FQVIDNG GISPTNF+                  +LTTY
Subjt:  PSGSTTTTGLQWDLNDTEMKFGALGQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFR------------------SLTTY

Query:  GFRGEALSSLCSLGTLTVETRTKNESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTN
        GFRGEALSSLC+LG LTVETRTKNE VAT LTFDHSGLL AEKKTARQ+GTTV V+KLFSNLPVRSKEF RNIRKEYGKL+SLLNAYA+IA+GVRFVC+N
Subjt:  GFRGEALSSLCSLGTLTVETRTKNESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTN

Query:  SAGKNAKSVVFKTQGSGSIKDNIITVFGMNTFICLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIA
        + GKN KSVV  TQG GS+KDNIITVFG++TF  L+ VSI +S+DC+VEGF+SK GQG+GRNL DRQ+FF+N RPVDMPKVSKLVNELYK  +SR+YP+ 
Subjt:  SAGKNAKSVVFKTQGSGSIKDNIITVFGMNTFICLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIA

Query:  IMNFTLPSKACDVNVTPDKRKIFFSDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGADDDSFKKIENVEQSS
        I++F +P  ACD+NVTPDKRK+FFSDET ++ +LRE L +IYS +NA Y VN+ EE +EQ D   + S   K +++ E +  D +        E +E+ +
Subjt:  IMNFTLPSKACDVNVTPDKRKIFFSDETHILQTLREELLKIYSPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGADDDSFKKIENVEQSS

Query:  HTTEVLNSDGEENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDS----GTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPV
         +   +  D    + +  F ++    KK +  L  HD        S KG+   + +  VTD+     + +S  QS+L+ FVT+ KRK+E  S  LSE PV
Subjt:  HTTEVLNSDGEENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQVTSSSPCVTDS----GTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPV

Query:  FRNQFLNNQWKKSCADKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGEAKVHPSVIESTATPTKELEIMS
         RNQ  + + +KS  +  +   +C     Q+  D  V + ED +   + D  L     P + AD+ +  E   E     +      ++T T   ++E +S
Subjt:  FRNQFLNNQWKKSCADKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHSDDGEATEECTGEAKVHPSVIESTATPTKELEIMS

Query:  EDLPLSGCSVLPSGSIKES--SSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCE----RKLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLF
        ED P     +    ++ +S   S   K+ STL F F  L+ RRL+R  R +  GY  +     + K+ +AAATL+LSQPD+E+RKARAL AA  EL+RLF
Subjt:  EDLPLSGCSVLPSGSIKES--SSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCE----RKLKRHYAAATLKLSQPDNEDRKARALEAAARELDRLF

Query:  RKKDFSRMKVIGQFNLGFIIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLK
        RK+DF RM+V+GQFNLGFII KL++DL +                         L++PL LELS EEEV V +HMD+IR+NGF +EE+P A PG  FRL+
Subjt:  RKKDFSRMKVIGQFNLGFIIGKLDQDLLL-------------------------LIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLK

Query:  AVPFSKNITFGVEDVKDLISTLADSEGECSIIGSYRMDTADSICP
        A+P+SKNITFGVED+KDLISTL D+ GECS+  SY+    DSICP
Subjt:  AVPFSKNITFGVEDVKDLISTLADSEGECSIIGSYRMDTADSICP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
atgtatacatttctggctcaagatattcacatgatgcagggaaccaagatatttgatgaatctgaagatcaggataccacagacctacataaatgtgttgcatatattct
tcagtctatgccaaatcaggaggagtctgatctatgtatacttgttgctggagcattaggtggacggtttgatcatgagataggaaacatcaatgttctatgccgttttt
caaccacgcgcataatccttctgtctgatgattgtcttgttcatcttctcccaaggactcatcaccatgaaatccatgttcgctcatctgttgagggtccacattgtggt
ctcattccaattgggatgccttctggaagcactacaaccactggtctccaatgggatttgaatgacacagagatgaagtttggagcattaggccaagtgattcttgacct
ttcttccgccgtcaaggagttagtcgaaaacagtttggacgccggagccactagcatcgagattgccttgaaagattatggagaagagtggttccaggtcattgataatg
ggtccggcatttctcccaccaatttcaggtctctaaccacgtatggtttcagaggagaagcactaagttctctttgttctcttgggactttgactgttgaaactaggaca
aagaatgagtctgttgcaacgcacttaacttttgatcattcaggactactagttgctgagaagaaaactgcacgccaagttggaaccactgtcatggttaagaagttatt
ctccaatttacctgtgcgtagtaaagagttcagtcgtaacattcgaaaggaatatggcaagctcatttcattattgaatgcctatgctgtcatagcaagaggagttagat
ttgtatgcactaattctgcagggaaaaatgcaaagtctgtggtattcaaaactcaaggaagtggttccattaaggacaacatcataacagtgtttggtatgaataccttc
atctgcttggagtctgtatctatattattatcggatgattgcaaagttgaaggatttgtatcgaagagtggacagggaagtgggaggaatttgggagatcgacaattctt
ttttgtaaataatcgacctgtggatatgcctaaagtgagcaagcttgtaaatgagttatataaaagtgcaaactcccgacaatatccgattgcaatcatgaatttcactc
ttccaagtaaagcttgtgatgtcaacgttactcctgataagagaaaaatattcttctctgatgaaactcacattttgcaaacattgagagaggaattactgaagatctat
tcaccaactaatgcctgttattctgtgaataaagttgaagaacctaccgaacaagtagatgaccttgagctgtgttctgataatgggaaattgagcatggtattggaaca
tttgtcacctgatggagctgatgatgattctttcaagaaaattgaaaatgtggaacaatcctcacatactactgaggtattaaatagtgatggtgaggaaaatataacca
gaaaggacttcgctctaagaatgcatgtcatgaagaaggctgatgctcttttaaaggatcatgatcagcataagaaatcttatctcagtagcaagaaaggtgtacaagtt
acttcttcctctccatgtgtgaccgatagtggaaccaatacaagccgtgttcaatcttcacttgacaagttcgtaactataaataagagaaaatatgaaacttcatctgc
accgctgtctgaagtacccgtcttcagaaaccaatttcttaataatcaatggaagaaaagctgtgccgataaaccttccaaggatgtaaaatgcacaaatggaaactgcc
aggtgtttaatgatttttcagtagggaatgatgaagacagctcgatacaaattaaaacagatagagtcttgagtaaacttcgtcttccactatcctctgcggaccatagt
gatgatggagaagcaacagaggaatgtacaggggaagccaaggtgcatccttctgtgatagaatctactgctacacccaccaaggaacttgagattatgtctgaggatct
tccgctgtccggctgttcagtacttccttctggttctataaaagaaagctccagtcctcaattgaagttatgttcaacgttacattttgattttcatgagctgaagaaaa
ggaggttgcagaggcagttgagatacaaattgaatggctatacatgtgaacgaaagttgaaacgccactatgctgctgctactctgaagctttcccaacctgataatgaa
gacaggaaagcaagagctttagaagcagctgctagggagctggacaggctcttcagaaagaaagattttagtagaatgaaggtgattggacaattcaacctcggatttat
cattgggaagttagatcaagatctattattgttgatcaggccgttgaggttggaattatctgctgaagaagaagtcgttgtttcaattcacatggatgtaatcaggaaaa
atggatttactatagaggaagatccacatgctctacctggaaatcgattcagattaaaagctgtcccttttagtaaaaatataacattcggagttgaagatgttaaggac
ctgatttctacacttgctgatagcgaaggtgaatgttctataatcggctcttatcggatggacactgctgattcaatctgccctcccgagttcgagcaatgttggcatct
cgggcatgtagatcttctgtga
mRNA sequenceShow/hide mRNA sequence
atgtatacatttctggctcaagatattcacatgatgcagggaaccaagatatttgatgaatctgaagatcaggataccacagacctacataaatgtgttgcatatattct
tcagtctatgccaaatcaggaggagtctgatctatgtatacttgttgctggagcattaggtggacggtttgatcatgagataggaaacatcaatgttctatgccgttttt
caaccacgcgcataatccttctgtctgatgattgtcttgttcatcttctcccaaggactcatcaccatgaaatccatgttcgctcatctgttgagggtccacattgtggt
ctcattccaattgggatgccttctggaagcactacaaccactggtctccaatgggatttgaatgacacagagatgaagtttggagcattaggccaagtgattcttgacct
ttcttccgccgtcaaggagttagtcgaaaacagtttggacgccggagccactagcatcgagattgccttgaaagattatggagaagagtggttccaggtcattgataatg
ggtccggcatttctcccaccaatttcaggtctctaaccacgtatggtttcagaggagaagcactaagttctctttgttctcttgggactttgactgttgaaactaggaca
aagaatgagtctgttgcaacgcacttaacttttgatcattcaggactactagttgctgagaagaaaactgcacgccaagttggaaccactgtcatggttaagaagttatt
ctccaatttacctgtgcgtagtaaagagttcagtcgtaacattcgaaaggaatatggcaagctcatttcattattgaatgcctatgctgtcatagcaagaggagttagat
ttgtatgcactaattctgcagggaaaaatgcaaagtctgtggtattcaaaactcaaggaagtggttccattaaggacaacatcataacagtgtttggtatgaataccttc
atctgcttggagtctgtatctatattattatcggatgattgcaaagttgaaggatttgtatcgaagagtggacagggaagtgggaggaatttgggagatcgacaattctt
ttttgtaaataatcgacctgtggatatgcctaaagtgagcaagcttgtaaatgagttatataaaagtgcaaactcccgacaatatccgattgcaatcatgaatttcactc
ttccaagtaaagcttgtgatgtcaacgttactcctgataagagaaaaatattcttctctgatgaaactcacattttgcaaacattgagagaggaattactgaagatctat
tcaccaactaatgcctgttattctgtgaataaagttgaagaacctaccgaacaagtagatgaccttgagctgtgttctgataatgggaaattgagcatggtattggaaca
tttgtcacctgatggagctgatgatgattctttcaagaaaattgaaaatgtggaacaatcctcacatactactgaggtattaaatagtgatggtgaggaaaatataacca
gaaaggacttcgctctaagaatgcatgtcatgaagaaggctgatgctcttttaaaggatcatgatcagcataagaaatcttatctcagtagcaagaaaggtgtacaagtt
acttcttcctctccatgtgtgaccgatagtggaaccaatacaagccgtgttcaatcttcacttgacaagttcgtaactataaataagagaaaatatgaaacttcatctgc
accgctgtctgaagtacccgtcttcagaaaccaatttcttaataatcaatggaagaaaagctgtgccgataaaccttccaaggatgtaaaatgcacaaatggaaactgcc
aggtgtttaatgatttttcagtagggaatgatgaagacagctcgatacaaattaaaacagatagagtcttgagtaaacttcgtcttccactatcctctgcggaccatagt
gatgatggagaagcaacagaggaatgtacaggggaagccaaggtgcatccttctgtgatagaatctactgctacacccaccaaggaacttgagattatgtctgaggatct
tccgctgtccggctgttcagtacttccttctggttctataaaagaaagctccagtcctcaattgaagttatgttcaacgttacattttgattttcatgagctgaagaaaa
ggaggttgcagaggcagttgagatacaaattgaatggctatacatgtgaacgaaagttgaaacgccactatgctgctgctactctgaagctttcccaacctgataatgaa
gacaggaaagcaagagctttagaagcagctgctagggagctggacaggctcttcagaaagaaagattttagtagaatgaaggtgattggacaattcaacctcggatttat
cattgggaagttagatcaagatctattattgttgatcaggccgttgaggttggaattatctgctgaagaagaagtcgttgtttcaattcacatggatgtaatcaggaaaa
atggatttactatagaggaagatccacatgctctacctggaaatcgattcagattaaaagctgtcccttttagtaaaaatataacattcggagttgaagatgttaaggac
ctgatttctacacttgctgatagcgaaggtgaatgttctataatcggctcttatcggatggacactgctgattcaatctgccctcccgagttcgagcaatgttggcatct
cgggcatgtagatcttctgtgatgattggggatcctctagggagaaatgaaatgcagaagatactggagcatttggcagagctgaaatctccttggaactgtcctcatgg
aaggccaaccatgagacatttggtggatcttacaacaattaaaaggtcagaagaaagtgacgctgaatgttgaacgtttcttcttcatc
Protein sequenceShow/hide protein sequence
MYTFLAQDIHMMQGTKIFDESEDQDTTDLHKCVAYILQSMPNQEESDLCILVAGALGGRFDHEIGNINVLCRFSTTRIILLSDDCLVHLLPRTHHHEIHVRSSVEGPHCG
LIPIGMPSGSTTTTGLQWDLNDTEMKFGALGQVILDLSSAVKELVENSLDAGATSIEIALKDYGEEWFQVIDNGSGISPTNFRSLTTYGFRGEALSSLCSLGTLTVETRT
KNESVATHLTFDHSGLLVAEKKTARQVGTTVMVKKLFSNLPVRSKEFSRNIRKEYGKLISLLNAYAVIARGVRFVCTNSAGKNAKSVVFKTQGSGSIKDNIITVFGMNTF
ICLESVSILLSDDCKVEGFVSKSGQGSGRNLGDRQFFFVNNRPVDMPKVSKLVNELYKSANSRQYPIAIMNFTLPSKACDVNVTPDKRKIFFSDETHILQTLREELLKIY
SPTNACYSVNKVEEPTEQVDDLELCSDNGKLSMVLEHLSPDGADDDSFKKIENVEQSSHTTEVLNSDGEENITRKDFALRMHVMKKADALLKDHDQHKKSYLSSKKGVQV
TSSSPCVTDSGTNTSRVQSSLDKFVTINKRKYETSSAPLSEVPVFRNQFLNNQWKKSCADKPSKDVKCTNGNCQVFNDFSVGNDEDSSIQIKTDRVLSKLRLPLSSADHS
DDGEATEECTGEAKVHPSVIESTATPTKELEIMSEDLPLSGCSVLPSGSIKESSSPQLKLCSTLHFDFHELKKRRLQRQLRYKLNGYTCERKLKRHYAAATLKLSQPDNE
DRKARALEAAARELDRLFRKKDFSRMKVIGQFNLGFIIGKLDQDLLLLIRPLRLELSAEEEVVVSIHMDVIRKNGFTIEEDPHALPGNRFRLKAVPFSKNITFGVEDVKD
LISTLADSEGECSIIGSYRMDTADSICPPEFEQCWHLGHVDLL