; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001692 (gene) of Snake gourd v1 genome

Gene IDTan0001692
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionendonuclease MutS2 isoform X1
Genome locationLG01:308927..344180
RNA-Seq ExpressionTan0001692
SyntenyTan0001692
Gene Ontology termsGO:0006298 - mismatch repair (biological process)
GO:0045910 - negative regulation of DNA recombination (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
InterPro domainsIPR000432 - DNA mismatch repair protein MutS, C-terminal
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR045076 - DNA mismatch repair MutS family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022922843.1 uncharacterized protein LOC111430703 isoform X3 [Cucurbita moschata]8.7e-12482.04Show/hide
Query:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG
        D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLKYS EVFENACMEFDEVNLKPTYK+LWGVPGRSNAINIAERLG+PSS+VDDAREHYG
Subjt:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG

Query:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA
        AASAQIDEVI+DMEC KKKYG+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR+Q LRK QEVS+AAATARSN+H++VREL  SAIE S PSA  S QRA
Subjt:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA

Query:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR
          N N L   G+KN MALDT ISSTG+I+QPRSE+PEFPTVGDTVY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Subjt:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR

XP_022922844.1 uncharacterized protein LOC111430703 isoform X4 [Cucurbita moschata]8.7e-12482.04Show/hide
Query:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG
        D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLKYS EVFENACMEFDEVNLKPTYK+LWGVPGRSNAINIAERLG+PSS+VDDAREHYG
Subjt:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG

Query:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA
        AASAQIDEVI+DMEC KKKYG+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR+Q LRK QEVS+AAATARSN+H++VREL  SAIE S PSA  S QRA
Subjt:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA

Query:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR
          N N L   G+KN MALDT ISSTG+I+QPRSE+PEFPTVGDTVY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Subjt:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR

XP_022985052.1 uncharacterized protein LOC111483140 isoform X3 [Cucurbita maxima]6.3e-12280.99Show/hide
Query:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG
        D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLKYS EVFENACMEFDEVNLKPTYK+LWGVPGRSNAINIAERLG+PSS+VDDAREHYG
Subjt:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG

Query:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA
        AASAQIDEVI+DMEC KKK+G+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR Q LRK QEVS+AAATARSN+H++VREL  SAIE S PSAI S QRA
Subjt:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA

Query:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR
          N N L   G+KN MALDT +SSTG+I+QPRSE+P FPTVGD VY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Subjt:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR

XP_022985053.1 uncharacterized protein LOC111483140 isoform X4 [Cucurbita maxima]6.3e-12280.99Show/hide
Query:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG
        D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLKYS EVFENACMEFDEVNLKPTYK+LWGVPGRSNAINIAERLG+PSS+VDDAREHYG
Subjt:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG

Query:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA
        AASAQIDEVI+DMEC KKK+G+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR Q LRK QEVS+AAATARSN+H++VREL  SAIE S PSAI S QRA
Subjt:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA

Query:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR
          N N L   G+KN MALDT +SSTG+I+QPRSE+P FPTVGD VY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Subjt:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR

XP_023552491.1 uncharacterized protein LOC111810138 [Cucurbita pepo subsp. pepo]3.9e-12482.04Show/hide
Query:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG
        D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLKYS EVFENACMEFDEVNLKPTYK+LWGVPGRSNAINIAERLG+PSS+VDDAREHYG
Subjt:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG

Query:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA
        AASAQIDEVI+DMEC KKKYG+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR+Q LRK QEVS+AAATARSN+H++VREL  SAIE S PSAI S QRA
Subjt:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA

Query:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR
          N N L   G+KN MALDT +SSTG+I+QPRSE+PEFPTVGDTVY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Subjt:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR

TrEMBL top hitse value%identityAlignment
A0A6J1E4M8 uncharacterized protein LOC111430703 isoform X44.2e-12482.04Show/hide
Query:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG
        D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLKYS EVFENACMEFDEVNLKPTYK+LWGVPGRSNAINIAERLG+PSS+VDDAREHYG
Subjt:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG

Query:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA
        AASAQIDEVI+DMEC KKKYG+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR+Q LRK QEVS+AAATARSN+H++VREL  SAIE S PSA  S QRA
Subjt:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA

Query:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR
          N N L   G+KN MALDT ISSTG+I+QPRSE+PEFPTVGDTVY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Subjt:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR

A0A6J1E7X9 uncharacterized protein LOC111430703 isoform X34.2e-12482.04Show/hide
Query:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG
        D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLKYS EVFENACMEFDEVNLKPTYK+LWGVPGRSNAINIAERLG+PSS+VDDAREHYG
Subjt:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG

Query:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA
        AASAQIDEVI+DMEC KKKYG+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR+Q LRK QEVS+AAATARSN+H++VREL  SAIE S PSA  S QRA
Subjt:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA

Query:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR
          N N L   G+KN MALDT ISSTG+I+QPRSE+PEFPTVGDTVY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Subjt:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR

A0A6J1E9Y4 uncharacterized protein LOC111430703 isoform X12.0e-12178.72Show/hide
Query:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLK------------YSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLP
        D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLK            YS EVFENACMEFDEVNLKPTYK+LWGVPGRSNAINIAERLG+P
Subjt:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLK------------YSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLP

Query:  SSIVDDAREHYGAASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIES
        SS+VDDAREHYGAASAQIDEVI+DMEC KKKYG+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR+Q LRK QEVS+AAATARSN+H++VREL  SAIE 
Subjt:  SSIVDDAREHYGAASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIES

Query:  SSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR
        S PSA  S QRA  N N L   G+KN MALDT ISSTG+I+QPRSE+PEFPTVGDTVY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Subjt:  SSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR

A0A6J1J3U0 uncharacterized protein LOC111483140 isoform X33.0e-12280.99Show/hide
Query:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG
        D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLKYS EVFENACMEFDEVNLKPTYK+LWGVPGRSNAINIAERLG+PSS+VDDAREHYG
Subjt:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG

Query:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA
        AASAQIDEVI+DMEC KKK+G+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR Q LRK QEVS+AAATARSN+H++VREL  SAIE S PSAI S QRA
Subjt:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA

Query:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR
          N N L   G+KN MALDT +SSTG+I+QPRSE+P FPTVGD VY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Subjt:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR

A0A6J1JCF6 uncharacterized protein LOC111483140 isoform X43.0e-12280.99Show/hide
Query:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG
        D VGAGTNPLEGAALGMSLLES AK GA+LTIATTHHGELKTLKYS EVFENACMEFDEVNLKPTYK+LWGVPGRSNAINIAERLG+PSS+VDDAREHYG
Subjt:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG

Query:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA
        AASAQIDEVI+DMEC KKK+G+LLQEAQN+L DSKNLYEK+LL RRNIIEHGR Q LRK QEVS+AAATARSN+H++VREL  SAIE S PSAI S QRA
Subjt:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA

Query:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR
          N N L   G+KN MALDT +SSTG+I+QPRSE+P FPTVGD VY+SSFGK+ATVL +EP K E+ V+VGSIKLKLKFTDIMR
Subjt:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR

SwissProt top hitse value%identityAlignment
A0PZP4 Endonuclease MutS22.6e-3032.76Show/hide
Query:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG
        D +GAGT+P EGAAL +S+LE+L K    + IATTH+ ELK      +  ENA +EFD   L+PTY+LL GVPG+SNA  I++RLGLP  I++DARE   
Subjt:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG

Query:  AASAQIDEVIIDME---CIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSS
          + + +++I  ++      +++    + A+   +  K  YE  L   ++I E       ++A+E+ K A      I K +REL                
Subjt:  AASAQIDEVIIDME---CIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSS

Query:  QRAGVNSNKLARAGEKNPMALDTRISST-GNISQPRSEKPEFPTV--GDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR
        +R G +S+ + +  E+N   L  ++  T   ++QP+        V  GD +Y+  F  +  VL+    KG++ VQ G +K+K+   D+ +
Subjt:  QRAGVNSNKLARAGEKNPMALDTRISST-GNISQPRSEKPEFPTV--GDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR

A5D0W6 Endonuclease MutS21.2e-3034.74Show/hide
Query:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG
        D +GAGT+P EGAAL  S+LE L  +GA  T+ATTH+GELK    ++E  ENA +EFD + L+PTY+LL G PGRSNA  IA RLGLP  +V  AR    
Subjt:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG

Query:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKA-AATARSNIHKRVRELHVSAIESSSPSAIGSSQR
        A   Q +E++  +E  +++     + A     +++ L E+   +  ++    RE  L KA E ++A    AR      VREL     E          + 
Subjt:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKA-AATARSNIHKRVRELHVSAIESSSPSAIGSSQR

Query:  AGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR
        A    N +  A EK    L  R+           E P     G+ V+++ + ++  VL      GE++VQVG IK+ +   ++ R
Subjt:  AGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR

A6LS00 Endonuclease MutS21.5e-3032.4Show/hide
Query:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG
        D +GAGT+P EGAAL +S+LE+L   GA L IATTH+ ELK      +  ENA +EFD   L+PTY+LL GVPG+SNA  I++RLGL   ++  A+E+  
Subjt:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG

Query:  AASAQIDEVIIDME---CIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSS
          + Q + +I +++    I KK     +  ++   D K  YE+ L    N  E       R+A+E+   A     +I K +REL    I           
Subjt:  AASAQIDEVIIDME---CIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSS

Query:  QRAGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR
          AG    +L    +K   +L+ R      + +   E     T+G   Y+ S  ++  ++S+   +GE+ V+ G +K+ +K  D+ +
Subjt:  QRAGVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR

C3KTI4 Endonuclease MutS25.8e-3034.38Show/hide
Query:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG
        D +GAGT+P EGAAL +S+LE+L K GA + IATTH+ ELK     KE  ENA +EFD   L+PTY+LL G+PG+SNA  I++RLGLP  I+D ARE+  
Subjt:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG

Query:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMD---SKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSS
          + + +E+I +++    K  E  + A+N  ++    K  YE+ L   + + ++      R+A+ + K A      I K +R+L                
Subjt:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMD---SKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSS

Query:  QRAGVNSN---KLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDI
        +R G +S+   KL    +K    LD+ I      +  + E  +    GD V ++S  ++  VLS    KG+++VQ G +K+     D+
Subjt:  QRAGVNSN---KLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDI

P73625 Endonuclease MutS26.4e-3736.88Show/hide
Query:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG
        D VGAGT+P EG+AL ++LL  LA     LT+ATTH+GELK LKY    FENA +EFD+ +L PTY+LLWG+PGRSNA+ IA+RLGLP +IV+ A++  G
Subjt:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG

Query:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA
          S  I++VI  +E  +++  +    AQ  L +++  Y+++     ++    RE    + QEV +A A A+  I K +R+L     +   PSA  + Q  
Subjt:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA

Query:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDI
         +      +A +K  +A            +P   +   PTVG+ + I SFG+ A V  +      + V +G +K+ +   DI
Subjt:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDI

Arabidopsis top hitse value%identityAlignment
AT1G65070.1 DNA mismatch repair protein MutS, type 21.3e-1625.08Show/hide
Query:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG
        D + +GT+P EG AL  S+L+ + K+  ++ + +TH+G+L  LK ++  F+NA MEF    L+PT+++LWG  G SNA+ +A+ +G    I+++A +   
Subjt:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG

Query:  AASAQIDEVIIDMECIKKKYGELLQ---EAQNHL-----------MDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAI
          + + D        +++K G L Q   E +N L            D  NLY ++     ++ +  R    ++ Q+V +   +A+S + + V E      
Subjt:  AASAQIDEVIIDMECIKKKYGELLQ---EAQNHL-----------MDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAI

Query:  ESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNIS-----QPRSEKPEF----PTVGDTVYISSFGKEATVLSIEPLKGE--IVVQVGSIKL
                        +  ++ +A + N + L T  +    I       P S + E+    P  G+ V ++  G +   +  EP   +  ++VQ G I++
Subjt:  ESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNIS-----QPRSEKPEF----PTVGDTVYISSFGKEATVLSIEPLKGE--IVVQVGSIKL

Query:  KLKFTDI
        ++K  DI
Subjt:  KLKFTDI

AT1G65070.2 DNA mismatch repair protein MutS, type 21.3e-1625.08Show/hide
Query:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG
        D + +GT+P EG AL  S+L+ + K+  ++ + +TH+G+L  LK ++  F+NA MEF    L+PT+++LWG  G SNA+ +A+ +G    I+++A +   
Subjt:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG

Query:  AASAQIDEVIIDMECIKKKYGELLQ---EAQNHL-----------MDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAI
          + + D        +++K G L Q   E +N L            D  NLY ++     ++ +  R    ++ Q+V +   +A+S + + V E      
Subjt:  AASAQIDEVIIDMECIKKKYGELLQ---EAQNHL-----------MDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAI

Query:  ESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNIS-----QPRSEKPEF----PTVGDTVYISSFGKEATVLSIEPLKGE--IVVQVGSIKL
                        +  ++ +A + N + L T  +    I       P S + E+    P  G+ V ++  G +   +  EP   +  ++VQ G I++
Subjt:  ESSSPSAIGSSQRAGVNSNKLARAGEKNPMALDTRISSTGNIS-----QPRSEKPEF----PTVGDTVYISSFGKEATVLSIEPLKGE--IVVQVGSIKL

Query:  KLKFTDI
        ++K  DI
Subjt:  KLKFTDI

AT3G24320.1 MUTL protein homolog 16.7e-0533Show/hide
Query:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENAC-MEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHY
        D +  GT   +G  +  S++ESL  SG    ++T  HG       +K +   A   E  E   KPT+KL  GV   S A   A+R G+P S++  A   Y
Subjt:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENAC-MEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHY

AT5G54090.1 DNA mismatch repair protein MutS, type 25.4e-7149.82Show/hide
Query:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG
        D VGAGTNPLEGAALGM++LES A+SG+ LT+ATTHHGELKTLKYS   FENACMEFD++NLKPTYK+LWGVPGRSNAINIA+RLGLP  I++ ARE YG
Subjt:  DTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYG

Query:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA
        +ASA+I+EVI+DME  K++Y  LL E++ ++  S+ L+E +L  ++NI +H  ++  +  QE+++A +  RS + + +++   SA   SS S + +  + 
Subjt:  AASAQIDEVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRA

Query:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIM
         V + K    G ++   ++ R         P +   + P VG +V++SS GK+ATVL +E  K EI+VQVG +K+K+K TD++
Subjt:  GVNSNKLARAGEKNPMALDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGGAGATACAGTTGGTGCAGGAACCAATCCTCTGGAGGGAGCCGCACTTGGGATGTCACTCCTGGAATCTTTGGCTAAATCTGGTGCTTCATTGACAATCGCGAC
TACACATCATGGAGAACTTAAAACCCTAAAATATAGCAAGGAGGTCTTTGAAAATGCGTGTATGGAATTTGATGAGGTGAACTTAAAGCCAACTTACAAGCTTCTCTGGG
GAGTACCAGGGCGTTCAAATGCTATTAATATAGCTGAAAGGTTGGGGTTGCCTAGTTCTATTGTAGATGATGCTCGTGAACATTATGGTGCGGCAAGTGCACAGATAGAT
GAGGTCATAATCGATATGGAATGTATAAAGAAAAAGTATGGGGAGCTCTTACAAGAAGCTCAAAATCACTTGATGGACTCAAAGAATCTTTATGAAAAGATGTTGCTTGT
TAGGAGAAACATTATTGAACATGGCAGAGAACAGACGCTTAGAAAAGCGCAAGAGGTATCCAAGGCTGCAGCAACAGCTCGTTCTAACATTCACAAAAGAGTACGAGAAC
TGCATGTGTCTGCTATTGAATCCTCCTCACCCTCTGCCATTGGTAGTAGTCAGCGTGCAGGAGTGAACTCTAATAAGCTAGCTAGAGCAGGCGAAAAGAATCCGATGGCA
TTGGATACTCGCATCTCTTCAACCGGTAACATCAGTCAACCACGATCAGAGAAGCCTGAGTTTCCCACTGTTGGCGATACTGTGTACATTTCTTCGTTTGGAAAAGAAGC
GACTGTTTTAAGTATAGAGCCATTAAAAGGCGAAATAGTTGTTCAAGTTGGTAGCATAAAGTTGAAGCTGAAGTTCACCGACATAATGAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGACAGGAGATACAGTTGGTGCAGGAACCAATCCTCTGGAGGGAGCCGCACTTGGGATGTCACTCCTGGAATCTTTGGCTAAATCTGGTGCTTCATTGACAATCGCGAC
TACACATCATGGAGAACTTAAAACCCTAAAATATAGCAAGGAGGTCTTTGAAAATGCGTGTATGGAATTTGATGAGGTGAACTTAAAGCCAACTTACAAGCTTCTCTGGG
GAGTACCAGGGCGTTCAAATGCTATTAATATAGCTGAAAGGTTGGGGTTGCCTAGTTCTATTGTAGATGATGCTCGTGAACATTATGGTGCGGCAAGTGCACAGATAGAT
GAGGTCATAATCGATATGGAATGTATAAAGAAAAAGTATGGGGAGCTCTTACAAGAAGCTCAAAATCACTTGATGGACTCAAAGAATCTTTATGAAAAGATGTTGCTTGT
TAGGAGAAACATTATTGAACATGGCAGAGAACAGACGCTTAGAAAAGCGCAAGAGGTATCCAAGGCTGCAGCAACAGCTCGTTCTAACATTCACAAAAGAGTACGAGAAC
TGCATGTGTCTGCTATTGAATCCTCCTCACCCTCTGCCATTGGTAGTAGTCAGCGTGCAGGAGTGAACTCTAATAAGCTAGCTAGAGCAGGCGAAAAGAATCCGATGGCA
TTGGATACTCGCATCTCTTCAACCGGTAACATCAGTCAACCACGATCAGAGAAGCCTGAGTTTCCCACTGTTGGCGATACTGTGTACATTTCTTCGTTTGGAAAAGAAGC
GACTGTTTTAAGTATAGAGCCATTAAAAGGCGAAATAGTTGTTCAAGTTGGTAGCATAAAGTTGAAGCTGAAGTTCACCGACATAATGAGATAA
Protein sequenceShow/hide protein sequence
MTGDTVGAGTNPLEGAALGMSLLESLAKSGASLTIATTHHGELKTLKYSKEVFENACMEFDEVNLKPTYKLLWGVPGRSNAINIAERLGLPSSIVDDAREHYGAASAQID
EVIIDMECIKKKYGELLQEAQNHLMDSKNLYEKMLLVRRNIIEHGREQTLRKAQEVSKAAATARSNIHKRVRELHVSAIESSSPSAIGSSQRAGVNSNKLARAGEKNPMA
LDTRISSTGNISQPRSEKPEFPTVGDTVYISSFGKEATVLSIEPLKGEIVVQVGSIKLKLKFTDIMR