; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015581 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015581
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionendonuclease MutS2 isoform X1
Genome locationtig00004835:332496..356754
RNA-Seq ExpressionSgr015581
SyntenySgr015581
Gene Ontology termsGO:0006298 - mismatch repair (biological process)
GO:0045910 - negative regulation of DNA recombination (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
InterPro domainsIPR000432 - DNA mismatch repair protein MutS, C-terminal
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR045076 - DNA mismatch repair MutS family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141291.1 uncharacterized protein LOC111011726 isoform X1 [Momordica charantia]1.7e-23179.93Show/hide
Query:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP
        S+  GIGTILEPLSAVPLNDELQQARA+V KAEEDVLFML+EKV                   VNARASYGLS GGTCP++ILPEGC S IAN C SGD 
Subjt:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP

Query:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN
         SEASC KK+EWVLYLPNA HPLLLQQYRENL+ AKRDVRNAF +IGRKLPGG+ S KEKK+VDISFLKMKVEELEQAH VP+DFSISQRIRVLV+TGPN
Subjt:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN

Query:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
        TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
Subjt:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF

Query:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------
        AKSGASLTIATTHHGELKTLKYSNEVFENA MEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE               
Subjt:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------

Query:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS
                                IIKH REQRL KVQEVS+AA   RSNLHKKVRELRASAIESSPPTAIRSRQ  G+SSNKLAT GKKN  AL +RIS
Subjt:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS

Query:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD
        STG ISQPRS + EFP  GDTVYVSSLGKEATVLSV+PSK E+VVQVGS+KLKLKFTD
Subjt:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD

XP_022141292.1 uncharacterized protein LOC111011726 isoform X2 [Momordica charantia]8.4e-23180.11Show/hide
Query:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP
        S+  GIGTILEPLSAVPLNDELQQARA+V KAEEDVLFML+EKV                   VNARASYGLS GGTCP++ILPEGC S IAN C SGD 
Subjt:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP

Query:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN
         SEASC KK+EWVLYLPNA HPLLLQQYRENL+ AKRDVRNAF DIGRKLPGG+ S KEKK+VDISFLKMKVEELEQAH VP+DFSISQRIRVLV+TGPN
Subjt:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN

Query:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
        TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
Subjt:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF

Query:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------
        AKSGASLTIATTHHGELKTLKYSNEVFENA MEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE               
Subjt:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------

Query:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS
                                IIKH REQRL KVQEVS+AA   RSNLHKKVRELRASAIESSPPTAIRSRQ  G+SSNKLAT GKKN  AL +RIS
Subjt:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS

Query:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD
        STG ISQPRS + EFP  GDTVYVSSLGKEATVLSV+PSK E+VVQVGS+KLKLKFTD
Subjt:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD

XP_022141293.1 uncharacterized protein LOC111011726 isoform X3 [Momordica charantia]1.7e-23179.93Show/hide
Query:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP
        S+  GIGTILEPLSAVPLNDELQQARA+V KAEEDVLFML+EKV                   VNARASYGLS GGTCP++ILPEGC S IAN C SGD 
Subjt:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP

Query:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN
         SEASC KK+EWVLYLPNA HPLLLQQYRENL+ AKRDVRNAF +IGRKLPGG+ S KEKK+VDISFLKMKVEELEQAH VP+DFSISQRIRVLV+TGPN
Subjt:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN

Query:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
        TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
Subjt:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF

Query:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------
        AKSGASLTIATTHHGELKTLKYSNEVFENA MEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE               
Subjt:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------

Query:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS
                                IIKH REQRL KVQEVS+AA   RSNLHKKVRELRASAIESSPPTAIRSRQ  G+SSNKLAT GKKN  AL +RIS
Subjt:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS

Query:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD
        STG ISQPRS + EFP  GDTVYVSSLGKEATVLSV+PSK E+VVQVGS+KLKLKFTD
Subjt:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD

XP_022141294.1 uncharacterized protein LOC111011726 isoform X4 [Momordica charantia]1.7e-23179.93Show/hide
Query:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP
        S+  GIGTILEPLSAVPLNDELQQARA+V KAEEDVLFML+EKV                   VNARASYGLS GGTCP++ILPEGC S IAN C SGD 
Subjt:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP

Query:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN
         SEASC KK+EWVLYLPNA HPLLLQQYRENL+ AKRDVRNAF +IGRKLPGG+ S KEKK+VDISFLKMKVEELEQAH VP+DFSISQRIRVLV+TGPN
Subjt:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN

Query:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
        TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
Subjt:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF

Query:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------
        AKSGASLTIATTHHGELKTLKYSNEVFENA MEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE               
Subjt:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------

Query:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS
                                IIKH REQRL KVQEVS+AA   RSNLHKKVRELRASAIESSPPTAIRSRQ  G+SSNKLAT GKKN  AL +RIS
Subjt:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS

Query:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD
        STG ISQPRS + EFP  GDTVYVSSLGKEATVLSV+PSK E+VVQVGS+KLKLKFTD
Subjt:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD

XP_022141295.1 uncharacterized protein LOC111011726 isoform X5 [Momordica charantia]1.7e-23179.93Show/hide
Query:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP
        S+  GIGTILEPLSAVPLNDELQQARA+V KAEEDVLFML+EKV                   VNARASYGLS GGTCP++ILPEGC S IAN C SGD 
Subjt:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP

Query:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN
         SEASC KK+EWVLYLPNA HPLLLQQYRENL+ AKRDVRNAF +IGRKLPGG+ S KEKK+VDISFLKMKVEELEQAH VP+DFSISQRIRVLV+TGPN
Subjt:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN

Query:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
        TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
Subjt:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF

Query:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------
        AKSGASLTIATTHHGELKTLKYSNEVFENA MEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE               
Subjt:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------

Query:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS
                                IIKH REQRL KVQEVS+AA   RSNLHKKVRELRASAIESSPPTAIRSRQ  G+SSNKLAT GKKN  AL +RIS
Subjt:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS

Query:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD
        STG ISQPRS + EFP  GDTVYVSSLGKEATVLSV+PSK E+VVQVGS+KLKLKFTD
Subjt:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD

TrEMBL top hitse value%identityAlignment
A0A6J1CHN4 uncharacterized protein LOC111011726 isoform X58.2e-23279.93Show/hide
Query:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP
        S+  GIGTILEPLSAVPLNDELQQARA+V KAEEDVLFML+EKV                   VNARASYGLS GGTCP++ILPEGC S IAN C SGD 
Subjt:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP

Query:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN
         SEASC KK+EWVLYLPNA HPLLLQQYRENL+ AKRDVRNAF +IGRKLPGG+ S KEKK+VDISFLKMKVEELEQAH VP+DFSISQRIRVLV+TGPN
Subjt:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN

Query:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
        TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
Subjt:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF

Query:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------
        AKSGASLTIATTHHGELKTLKYSNEVFENA MEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE               
Subjt:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------

Query:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS
                                IIKH REQRL KVQEVS+AA   RSNLHKKVRELRASAIESSPPTAIRSRQ  G+SSNKLAT GKKN  AL +RIS
Subjt:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS

Query:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD
        STG ISQPRS + EFP  GDTVYVSSLGKEATVLSV+PSK E+VVQVGS+KLKLKFTD
Subjt:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD

A0A6J1CI69 uncharacterized protein LOC111011726 isoform X48.2e-23279.93Show/hide
Query:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP
        S+  GIGTILEPLSAVPLNDELQQARA+V KAEEDVLFML+EKV                   VNARASYGLS GGTCP++ILPEGC S IAN C SGD 
Subjt:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP

Query:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN
         SEASC KK+EWVLYLPNA HPLLLQQYRENL+ AKRDVRNAF +IGRKLPGG+ S KEKK+VDISFLKMKVEELEQAH VP+DFSISQRIRVLV+TGPN
Subjt:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN

Query:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
        TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
Subjt:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF

Query:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------
        AKSGASLTIATTHHGELKTLKYSNEVFENA MEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE               
Subjt:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------

Query:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS
                                IIKH REQRL KVQEVS+AA   RSNLHKKVRELRASAIESSPPTAIRSRQ  G+SSNKLAT GKKN  AL +RIS
Subjt:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS

Query:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD
        STG ISQPRS + EFP  GDTVYVSSLGKEATVLSV+PSK E+VVQVGS+KLKLKFTD
Subjt:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD

A0A6J1CIQ4 uncharacterized protein LOC111011726 isoform X18.2e-23279.93Show/hide
Query:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP
        S+  GIGTILEPLSAVPLNDELQQARA+V KAEEDVLFML+EKV                   VNARASYGLS GGTCP++ILPEGC S IAN C SGD 
Subjt:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP

Query:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN
         SEASC KK+EWVLYLPNA HPLLLQQYRENL+ AKRDVRNAF +IGRKLPGG+ S KEKK+VDISFLKMKVEELEQAH VP+DFSISQRIRVLV+TGPN
Subjt:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN

Query:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
        TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
Subjt:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF

Query:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------
        AKSGASLTIATTHHGELKTLKYSNEVFENA MEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE               
Subjt:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------

Query:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS
                                IIKH REQRL KVQEVS+AA   RSNLHKKVRELRASAIESSPPTAIRSRQ  G+SSNKLAT GKKN  AL +RIS
Subjt:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS

Query:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD
        STG ISQPRS + EFP  GDTVYVSSLGKEATVLSV+PSK E+VVQVGS+KLKLKFTD
Subjt:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD

A0A6J1CJG8 uncharacterized protein LOC111011726 isoform X24.1e-23180.11Show/hide
Query:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP
        S+  GIGTILEPLSAVPLNDELQQARA+V KAEEDVLFML+EKV                   VNARASYGLS GGTCP++ILPEGC S IAN C SGD 
Subjt:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP

Query:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN
         SEASC KK+EWVLYLPNA HPLLLQQYRENL+ AKRDVRNAF DIGRKLPGG+ S KEKK+VDISFLKMKVEELEQAH VP+DFSISQRIRVLV+TGPN
Subjt:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN

Query:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
        TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
Subjt:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF

Query:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------
        AKSGASLTIATTHHGELKTLKYSNEVFENA MEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE               
Subjt:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------

Query:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS
                                IIKH REQRL KVQEVS+AA   RSNLHKKVRELRASAIESSPPTAIRSRQ  G+SSNKLAT GKKN  AL +RIS
Subjt:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS

Query:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD
        STG ISQPRS + EFP  GDTVYVSSLGKEATVLSV+PSK E+VVQVGS+KLKLKFTD
Subjt:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD

A0A6J1CK29 uncharacterized protein LOC111011726 isoform X38.2e-23279.93Show/hide
Query:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP
        S+  GIGTILEPLSAVPLNDELQQARA+V KAEEDVLFML+EKV                   VNARASYGLS GGTCP++ILPEGC S IAN C SGD 
Subjt:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKV------------------WVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP

Query:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN
         SEASC KK+EWVLYLPNA HPLLLQQYRENL+ AKRDVRNAF +IGRKLPGG+ S KEKK+VDISFLKMKVEELEQAH VP+DFSISQRIRVLV+TGPN
Subjt:  TSEASCPKKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPN

Query:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
        TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF
Subjt:  TGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESF

Query:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------
        AKSGASLTIATTHHGELKTLKYSNEVFENA MEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE               
Subjt:  AKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDE---------------

Query:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS
                                IIKH REQRL KVQEVS+AA   RSNLHKKVRELRASAIESSPPTAIRSRQ  G+SSNKLAT GKKN  AL +RIS
Subjt:  ------------------------IIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMALCTRIS

Query:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD
        STG ISQPRS + EFP  GDTVYVSSLGKEATVLSV+PSK E+VVQVGS+KLKLKFTD
Subjt:  STGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD

SwissProt top hitse value%identityAlignment
A7FY72 Endonuclease MutS21.5e-5739.32Show/hide
Query:  VEELEQAHP-------VPLDFSISQRIRVLVLTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISE
        V+ +E  HP       VP+   + +    L++TGPNTGGKTV LKT+GL  +MA SGL + A E+  I +F++VFADIGDEQS+ QSLSTFS H+K I E
Subjt:  VEELEQAHP-------VPLDFSISQRIRVLVLTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISE

Query:  IQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESFAKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGL
        I   +   SLVL DE+GAGT+P EGAAL +S+LE+  K G  + IATTH+ ELK      E  ENAS+EFD   L+PTY++L G+PG+SNA  I++RLGL
Subjt:  IQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESFAKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGL

Query:  PGTVVDDAREHYGAASAQIDEII--------------------KHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTA------IRSRQHA
        P  ++D ARE+    + + +E+I                    K  R++  +K +E  E    VR N     R    + I+ +   A      IR  +  
Subjt:  PGTVVDDAREHYGAASAQIDEII--------------------KHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTA------IRSRQHA

Query:  GISSN---KLATAGKKNPMALCTRISSTGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD
        G SS+   KL    KK    L   I      +  + E  +    GD V ++S+ ++  VLS   +KG+++VQ G MK+     D
Subjt:  GISSN---KLATAGKKNPMALCTRISSTGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD

B1IMK5 Endonuclease MutS21.2e-5739.32Show/hide
Query:  VEELEQAHP-------VPLDFSISQRIRVLVLTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISE
        V+ +E  HP       VP+   + +    L++TGPNTGGKTV LKT+GL  +MA SGL + A E+  I +F++VFADIGDEQS+ QSLSTFS H+K I E
Subjt:  VEELEQAHP-------VPLDFSISQRIRVLVLTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISE

Query:  IQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESFAKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGL
        I   +   SLVL DE+GAGT+P EGAAL +S+LE+  K G  + IATTH+ ELK      E  ENAS+EFD   L+PTY++L G+PG+SNA  I++RLGL
Subjt:  IQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESFAKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGL

Query:  PGTVVDDAREHYGAASAQIDEII--------------------KHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTA------IRSRQHA
        P  ++D ARE+    + + +E+I                    K  R++  +K +E  E    VR N     R    + I+ +   A      IR  +  
Subjt:  PGTVVDDAREHYGAASAQIDEII--------------------KHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTA------IRSRQHA

Query:  GISSN---KLATAGKKNPMALCTRISSTGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD
        G SS+   KL    KK    L   I      +  + E  +    GD V ++S+ ++  VLS   +KG+++VQ G MK+     D
Subjt:  GISSN---KLATAGKKNPMALCTRISSTGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD

B1L0S3 Endonuclease MutS25.2e-5839.58Show/hide
Query:  VEELEQAHP-------VPLDFSISQRIRVLVLTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISE
        V+ +E  HP       VP+   + +    L++TGPNTGGKTV LKT+GL  +MA SGL + A E+  I +F++VFADIGDEQS+ QSLSTFS H+K I E
Subjt:  VEELEQAHP-------VPLDFSISQRIRVLVLTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISE

Query:  IQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESFAKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGL
        I   +   SLVL DE+GAGT+P EGAAL +S+LE+  K GA + IATTH+ ELK      E  ENAS+EFD   L+PTY++L G+PG+SNA  I++RLGL
Subjt:  IQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESFAKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGL

Query:  PGTVVDDAREHYGAASAQIDEII--------------------KHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTA------IRSRQHA
        P  ++D ARE+    + + +E+I                    K  R++  +K +E  E    VR N     R    + I+ +   A      IR  +  
Subjt:  PGTVVDDAREHYGAASAQIDEII--------------------KHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTA------IRSRQHA

Query:  GISSN---KLATAGKKNPMALCTRISSTGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD
        G SS+   KL    KK    L   I      +  + E  +    GD V ++S+ ++  VLS   +KG+++VQ G MK+     D
Subjt:  GISSN---KLATAGKKNPMALCTRISSTGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD

C3KTI4 Endonuclease MutS25.2e-5839.58Show/hide
Query:  VEELEQAHP-------VPLDFSISQRIRVLVLTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISE
        V+ +E  HP       VP+   + +    L++TGPNTGGKTV LKT+GL  +MA SGL + A E+  I +F++VFADIGDEQS+ QSLSTFS H+K I E
Subjt:  VEELEQAHP-------VPLDFSISQRIRVLVLTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISE

Query:  IQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESFAKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGL
        I   +   SLVL DE+GAGT+P EGAAL +S+LE+  K GA + IATTH+ ELK      E  ENAS+EFD   L+PTY++L G+PG+SNA  I++RLGL
Subjt:  IQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESFAKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGL

Query:  PGTVVDDAREHYGAASAQIDEII--------------------KHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTA------IRSRQHA
        P  ++D ARE+    + + +E+I                    K  R++  +K +E  E    VR N     R    + I+ +   A      IR  +  
Subjt:  PGTVVDDAREHYGAASAQIDEII--------------------KHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTA------IRSRQHA

Query:  GISSN---KLATAGKKNPMALCTRISSTGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD
        G SS+   KL    KK    L   I      +  + E  +    GD V ++S+ ++  VLS   +KG+++VQ G MK+     D
Subjt:  GISSN---KLATAGKKNPMALCTRISSTGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD

P73625 Endonuclease MutS22.3e-6139.58Show/hide
Query:  VPLDFSISQRIRVLVLTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKI--------SEIQSVSTSQ
        VP+  +I  +IRV+ +TGPNTGGKTV LKT+GL A+MAK GL++ A E+V++PWF  + ADIGDEQSL Q+LSTFSGH+ +I        S +Q V   +
Subjt:  VPLDFSISQRIRVLVLTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKI--------SEIQSVSTSQ

Query:  ----------SLVLLDEVGAGTNPMEGAALGMSLLESFAKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERL
                  SLVLLDEVGAGT+P EG+AL ++LL   A     LT+ATTH+GELK LKY +  FENAS+EFD+ +L PTY++LWG+PGRSNA+ IA+RL
Subjt:  ----------SLVLLDEVGAGTNPMEGAALGMSLLESFAKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERL

Query:  GLPGTVVDDAREHYGAASAQIDEIIKHGREQRLRKVQEVSEAATMVR-------------SNLHKKVRELRA----------SAIESSPPTAIRSRQHAG
        GLP  +V+ A++  G  S  I+++I     QR  + Q+ + A  +++             ++L  + REL++          +A +      IR  Q   
Subjt:  GLPGTVVDDAREHYGAASAQIDEIIKHGREQRLRKVQEVSEAATMVR-------------SNLHKKVRELRA----------SAIESSPPTAIRSRQHAG

Query:  ISSNKLATAGKKNPMALCTRISSTGDISQPRSEEPE----FPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD
         S+ K   A         T I       Q     P+     P  G+ + + S G+ A V  V  +   + V +G MK+ +   D
Subjt:  ISSNKLATAGKKNPMALCTRISSTGDISQPRSEEPE----FPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD

Arabidopsis top hitse value%identityAlignment
AT1G65070.1 DNA mismatch repair protein MutS, type 21.0e-4845.64Show/hide
Query:  PVPLDFSISQRIRVLVLTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDE
        PVP+D  +    +V+V++GPNTGGKT  LKT+GL ++M+KSG+++ A    ++PWFD + ADIGD QSL QSLSTFSGH+ +I +I  +++  SLVLLDE
Subjt:  PVPLDFSISQRIRVLVLTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDE

Query:  VGAGTNPMEGAALGMSLLESFAKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDARE
        + +GT+P EG AL  S+L+ + K+  ++ + +TH+G+L  LK +   F+NA+MEF    L+PT+++LWG  G SNA+ +A+ +G    ++++A +
Subjt:  VGAGTNPMEGAALGMSLLESFAKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDARE

AT1G65070.2 DNA mismatch repair protein MutS, type 21.0e-4845.64Show/hide
Query:  PVPLDFSISQRIRVLVLTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDE
        PVP+D  +    +V+V++GPNTGGKT  LKT+GL ++M+KSG+++ A    ++PWFD + ADIGD QSL QSLSTFSGH+ +I +I  +++  SLVLLDE
Subjt:  PVPLDFSISQRIRVLVLTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDE

Query:  VGAGTNPMEGAALGMSLLESFAKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDARE
        + +GT+P EG AL  S+L+ + K+  ++ + +TH+G+L  LK +   F+NA+MEF    L+PT+++LWG  G SNA+ +A+ +G    ++++A +
Subjt:  VGAGTNPMEGAALGMSLLESFAKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDARE

AT3G18524.1 MUTS homolog 22.1e-1427.42Show/hide
Query:  VLTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGM
        ++TGPN GGK+  ++ +G+  +MA+ G  V   +   I   D +FA +G      + +STF   + + + I   ++ +SL+++DE+G GT+  +G  L  
Subjt:  VLTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGM

Query:  SLLESFAKSGASLTIATTHHGELKTLKYSN-EVFEN----------ASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAR------EHYG
        ++ E   +   + T+  TH  EL  L  +N EV  N          A ++ +   L   YK+  G   +S  I++AE    P +VV  AR      E + 
Subjt:  SLLESFAKSGASLTIATTHHGELKTLKYSN-EVFEN----------ASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAR------EHYG

Query:  AASAQIDEIIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIE
         +S  I+      R+ R     EVS  A       HK ++E  A  ++
Subjt:  AASAQIDEIIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIE

AT3G24320.1 MUTL protein homolog 17.6e-2035.35Show/hide
Query:  VLVLTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAAL
        + +LTGPN GGK+  L++I  AA++  SGL V A ES  IP FDS+   +    S     S+F   + +I  I S +TS+SLVL+DE+  GT   +G  +
Subjt:  VLVLTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAAL

Query:  GMSLLESFAKSGASLTIATTHHGELK-TLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGA-----ASAQI---
          S++ES   SG    ++T  HG     L   N  ++    E  E   KPT+K+  GV   S A   A+R G+P +V+  A   Y +     ASA++   
Subjt:  GMSLLESFAKSGASLTIATTHHGELK-TLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGA-----ASAQI---

Query:  DEII-KHGREQRLRK
        D+II     +Q+++K
Subjt:  DEII-KHGREQRLRK

AT5G54090.1 DNA mismatch repair protein MutS, type 23.2e-12747.07Show/hide
Query:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKVW------------------VNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP
        SS +G GT  EP++AV +ND+LQ ARA+V KAE ++L MLTEK+                   +NARA+Y  ++GG  P++ LP            +G+ 
Subjt:  SSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKVW------------------VNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDP

Query:  TSEASCP-----KKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLV
        + + + P      K EW+LYLP  +HPLLL Q+++ + K +  V+                          F K     L  A P+P DF IS+  RVLV
Subjt:  TSEASCP-----KKNEWVLYLPNAHHPLLLQQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLV

Query:  LTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMS
        +TGPNTGGKT+CLK++GLAAMMAKSGL+VLA+ES +IPWFD+++ADIGDEQSL QSLSTFSGHLK+ISEI S STS+SLVLLDEVGAGTNP+EGAALGM+
Subjt:  LTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMS

Query:  LLESFAKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDEII--------
        +LESFA+SG+ LT+ATTHHGELKTLKYSN  FENA MEFD++NLKPTYKILWGVPGRSNAINIA+RLGLP  +++ ARE YG+ASA+I+E+I        
Subjt:  LLESFAKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKILWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDEII--------

Query:  -------------------------------KHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMAL
                                        H  ++R +  QE+++A +M RS L + +++ R+SA +SS  + + ++    + + K    G ++   +
Subjt:  -------------------------------KHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMAL

Query:  CTRISSTGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD
          R         P +   + P  G +V+VSSLGK+ATVL V+ SK EI+VQVG MK+K+K TD
Subjt:  CTRISSTGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACTGTTCTAGTACTGCAGGGATTGGTACCATCCTAGAGCCACTCTCTGCGGTTCCTTTAAACGATGAGTTGCAACAAGCAAGGGCAGCAGTGTTAAAAGCTGAGGA
AGACGTTCTCTTTATGCTAACTGAAAAAGTATGGGTCAATGCTCGAGCATCTTATGGTCTTTCATTTGGGGGGACATGTCCCAATGTAATTCTGCCAGAAGGGTGCAACT
CTTCTATTGCTAATGTCTGCTTGTCGGGGGACCCAACATCTGAGGCATCATGCCCAAAGAAGAACGAATGGGTGCTCTATTTACCTAATGCTCATCACCCTTTACTACTC
CAGCAATATAGAGAAAATTTGGAGAAAGCCAAGAGGGATGTCAGAAATGCTTTTGCTGATATAGGGAGAAAACTTCCTGGGGGACATATGTCATGGAAAGAAAAAAAAAA
TGTAGATATTTCATTCTTAAAAATGAAGGTTGAAGAATTGGAGCAAGCTCATCCAGTTCCGCTTGATTTTTCAATATCTCAAAGAATTCGAGTTTTGGTTCTAACTGGCC
CTAATACTGGGGGTAAGACAGTTTGCTTGAAGACCATTGGATTGGCTGCCATGATGGCGAAATCAGGGCTTCATGTTTTAGCTTCAGAATCTGTACAAATCCCTTGGTTT
GACTCTGTTTTTGCTGATATCGGCGATGAACAGTCCCTAACCCAATCTTTGTCCACCTTTTCTGGCCATTTGAAAAAAATAAGTGAGATTCAGTCAGTTTCAACTAGTCA
GTCGTTGGTACTACTGGATGAAGTTGGTGCAGGAACCAATCCTATGGAAGGAGCCGCACTTGGGATGTCACTCCTGGAATCTTTTGCTAAATCTGGTGCTTCATTGACAA
TCGCGACTACACATCATGGAGAACTTAAAACCCTAAAGTATAGCAATGAGGTCTTTGAAAATGCGAGTATGGAATTTGATGAGGTGAACTTAAAGCCAACTTACAAGATT
CTCTGGGGAGTACCAGGGCGTTCAAATGCTATTAATATAGCTGAAAGGTTAGGGTTGCCTGGTACTGTTGTAGATGACGCTCGGGAACATTATGGTGCAGCAAGTGCACA
GATAGATGAGATTATTAAACATGGCAGAGAGCAGAGGCTTAGAAAAGTGCAAGAGGTATCTGAGGCTGCAACCATGGTTCGTTCTAACCTTCACAAAAAAGTACGAGAAC
TGCGTGCATCTGCCATTGAATCCTCCCCGCCCACCGCCATTCGTAGTAGGCAACATGCAGGAATAAGCTCTAATAAGCTAGCTACAGCAGGCAAAAAGAATCCGATGGCA
TTATGTACGCGTATCTCTTCAACTGGTGACATCAGCCAACCACGATCAGAGGAGCCTGAGTTTCCCGTTGCTGGCGATACTGTGTACGTTTCTTCCCTTGGAAAAGAAGC
GACAGTTTTAAGTGTAAAGCCATCAAAAGGCGAAATAGTTGTTCAAGTTGGTAGCATGAAGTTGAAGCTGAAGTTCACTGAC
mRNA sequenceShow/hide mRNA sequence
ATGTACTGTTCTAGTACTGCAGGGATTGGTACCATCCTAGAGCCACTCTCTGCGGTTCCTTTAAACGATGAGTTGCAACAAGCAAGGGCAGCAGTGTTAAAAGCTGAGGA
AGACGTTCTCTTTATGCTAACTGAAAAAGTATGGGTCAATGCTCGAGCATCTTATGGTCTTTCATTTGGGGGGACATGTCCCAATGTAATTCTGCCAGAAGGGTGCAACT
CTTCTATTGCTAATGTCTGCTTGTCGGGGGACCCAACATCTGAGGCATCATGCCCAAAGAAGAACGAATGGGTGCTCTATTTACCTAATGCTCATCACCCTTTACTACTC
CAGCAATATAGAGAAAATTTGGAGAAAGCCAAGAGGGATGTCAGAAATGCTTTTGCTGATATAGGGAGAAAACTTCCTGGGGGACATATGTCATGGAAAGAAAAAAAAAA
TGTAGATATTTCATTCTTAAAAATGAAGGTTGAAGAATTGGAGCAAGCTCATCCAGTTCCGCTTGATTTTTCAATATCTCAAAGAATTCGAGTTTTGGTTCTAACTGGCC
CTAATACTGGGGGTAAGACAGTTTGCTTGAAGACCATTGGATTGGCTGCCATGATGGCGAAATCAGGGCTTCATGTTTTAGCTTCAGAATCTGTACAAATCCCTTGGTTT
GACTCTGTTTTTGCTGATATCGGCGATGAACAGTCCCTAACCCAATCTTTGTCCACCTTTTCTGGCCATTTGAAAAAAATAAGTGAGATTCAGTCAGTTTCAACTAGTCA
GTCGTTGGTACTACTGGATGAAGTTGGTGCAGGAACCAATCCTATGGAAGGAGCCGCACTTGGGATGTCACTCCTGGAATCTTTTGCTAAATCTGGTGCTTCATTGACAA
TCGCGACTACACATCATGGAGAACTTAAAACCCTAAAGTATAGCAATGAGGTCTTTGAAAATGCGAGTATGGAATTTGATGAGGTGAACTTAAAGCCAACTTACAAGATT
CTCTGGGGAGTACCAGGGCGTTCAAATGCTATTAATATAGCTGAAAGGTTAGGGTTGCCTGGTACTGTTGTAGATGACGCTCGGGAACATTATGGTGCAGCAAGTGCACA
GATAGATGAGATTATTAAACATGGCAGAGAGCAGAGGCTTAGAAAAGTGCAAGAGGTATCTGAGGCTGCAACCATGGTTCGTTCTAACCTTCACAAAAAAGTACGAGAAC
TGCGTGCATCTGCCATTGAATCCTCCCCGCCCACCGCCATTCGTAGTAGGCAACATGCAGGAATAAGCTCTAATAAGCTAGCTACAGCAGGCAAAAAGAATCCGATGGCA
TTATGTACGCGTATCTCTTCAACTGGTGACATCAGCCAACCACGATCAGAGGAGCCTGAGTTTCCCGTTGCTGGCGATACTGTGTACGTTTCTTCCCTTGGAAAAGAAGC
GACAGTTTTAAGTGTAAAGCCATCAAAAGGCGAAATAGTTGTTCAAGTTGGTAGCATGAAGTTGAAGCTGAAGTTCACTGAC
Protein sequenceShow/hide protein sequence
MYCSSTAGIGTILEPLSAVPLNDELQQARAAVLKAEEDVLFMLTEKVWVNARASYGLSFGGTCPNVILPEGCNSSIANVCLSGDPTSEASCPKKNEWVLYLPNAHHPLLL
QQYRENLEKAKRDVRNAFADIGRKLPGGHMSWKEKKNVDISFLKMKVEELEQAHPVPLDFSISQRIRVLVLTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWF
DSVFADIGDEQSLTQSLSTFSGHLKKISEIQSVSTSQSLVLLDEVGAGTNPMEGAALGMSLLESFAKSGASLTIATTHHGELKTLKYSNEVFENASMEFDEVNLKPTYKI
LWGVPGRSNAINIAERLGLPGTVVDDAREHYGAASAQIDEIIKHGREQRLRKVQEVSEAATMVRSNLHKKVRELRASAIESSPPTAIRSRQHAGISSNKLATAGKKNPMA
LCTRISSTGDISQPRSEEPEFPVAGDTVYVSSLGKEATVLSVKPSKGEIVVQVGSMKLKLKFTD