; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020838 (gene) of Snake gourd v1 genome

Gene IDTan0020838
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionendonuclease MutS2 isoform X1
Genome locationLG01:360238..413172
RNA-Seq ExpressionTan0020838
SyntenyTan0020838
Gene Ontology termsGO:0006298 - mismatch repair (biological process)
GO:0045910 - negative regulation of DNA recombination (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
InterPro domainsIPR000432 - DNA mismatch repair protein MutS, C-terminal
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR045076 - DNA mismatch repair MutS family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022922845.1 uncharacterized protein LOC111430703 isoform X5 [Cucurbita moschata]3.2e-10985.36Show/hide
Query:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEKVNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLL
        G+GT+LEPLSAVPLNDELQQA+A+VAKAE+DVLFMLTEKVNARASY LSFGG CPNL+LP G NSSIANV  SGDQ S+ASHP +N+ VLYLPNAHHPLL
Subjt:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEKVNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLL

Query:  LQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHV
         QQYRE+LENAKRDVRNA TEIGRKLPGG M WKEK V DIS L+MKVE+LE+A P+SVDF+IS R+RVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHV
Subjt:  LQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHV

Query:  LASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS
        LASESVQIPWFDSV ADIGDEQSLTQSLSTFSGHLRKIS
Subjt:  LASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS

XP_022985054.1 uncharacterized protein LOC111483140 isoform X5 [Cucurbita maxima]1.1e-10986.19Show/hide
Query:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEKVNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLL
        G+GTILEPLSAVPLNDELQQA+A+VAKAE+DVLFMLTEKVNARASY LSFGG CPNL+LP G NSSIANV  SGDQ SEASHP +N+ VLYLPNAHHPLL
Subjt:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEKVNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLL

Query:  LQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHV
         QQYRE+LENAKRDVRNA TEIGRKLPGG M WKEK V DIS L+MKVE+LE+A P+SVDF+IS R+RVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHV
Subjt:  LQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHV

Query:  LASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS
        LASESVQIPWFDSV ADIGDEQSLTQSLSTFSGHLRKIS
Subjt:  LASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS

XP_038894013.1 endonuclease MutS2 isoform X7 [Benincasa hispida]1.1e-10981.23Show/hide
Query:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEK----------------------VNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTS
        G G ILEPLSAVPLNDELQQA+ASVAKAE+DVLFMLTEK                      VNARASY LSFGGTCPNL+LPEG NSSIANVC SGDQTS
Subjt:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEK----------------------VNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTS

Query:  EASHPNKNERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTG
        EASH  KNE VLYL NAHHPLLLQQYRENLENAKRDV+NAFTE+GRKLPGG M WKEK VVDIS L+MKVE+LEKA P+SVDFSIS+R++VLVITGPNTG
Subjt:  EASHPNKNERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTG

Query:  GKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS
        GKTVCLKTIGLAAMMAKSGLHVLASES QIPWFDS+FADIGDEQSLTQSLSTFSGHLRKIS
Subjt:  GKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS

XP_038894019.1 endonuclease MutS2 isoform X8 [Benincasa hispida]6.5e-11081.85Show/hide
Query:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEK--------------------VNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEA
        G G ILEPLSAVPLNDELQQA+ASVAKAE+DVLFMLTEK                    VNARASY LSFGGTCPNL+LPEG NSSIANVC SGDQTSEA
Subjt:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEK--------------------VNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEA

Query:  SHPNKNERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGK
        SH  KNE VLYL NAHHPLLLQQYRENLENAKRDV+NAFTE+GRKLPGG M WKEK VVDIS L+MKVE+LEKA P+SVDFSIS+R++VLVITGPNTGGK
Subjt:  SHPNKNERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGK

Query:  TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS
        TVCLKTIGLAAMMAKSGLHVLASES QIPWFDS+FADIGDEQSLTQSLSTFSGHLRKIS
Subjt:  TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS

XP_038894074.1 endonuclease MutS2 isoform X11 [Benincasa hispida]1.6e-11388.7Show/hide
Query:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEKVNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLL
        G G ILEPLSAVPLNDELQQA+ASVAKAE+DVLFMLTEKVNARASY LSFGGTCPNL+LPEG NSSIANVC SGDQTSEASH  KNE VLYL NAHHPLL
Subjt:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEKVNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLL

Query:  LQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHV
        LQQYRENLENAKRDV+NAFTE+GRKLPGG M WKEK VVDIS L+MKVE+LEKA P+SVDFSIS+R++VLVITGPNTGGKTVCLKTIGLAAMMAKSGLHV
Subjt:  LQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHV

Query:  LASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS
        LASES QIPWFDS+FADIGDEQSLTQSLSTFSGHLRKIS
Subjt:  LASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS

TrEMBL top hitse value%identityAlignment
A0A6J1E9Z0 uncharacterized protein LOC111430703 isoform X51.6e-10985.36Show/hide
Query:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEKVNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLL
        G+GT+LEPLSAVPLNDELQQA+A+VAKAE+DVLFMLTEKVNARASY LSFGG CPNL+LP G NSSIANV  SGDQ S+ASHP +N+ VLYLPNAHHPLL
Subjt:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEKVNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLL

Query:  LQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHV
         QQYRE+LENAKRDVRNA TEIGRKLPGG M WKEK V DIS L+MKVE+LE+A P+SVDF+IS R+RVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHV
Subjt:  LQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHV

Query:  LASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS
        LASESVQIPWFDSV ADIGDEQSLTQSLSTFSGHLRKIS
Subjt:  LASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS

A0A6J1J3U0 uncharacterized protein LOC111483140 isoform X33.6e-10678.93Show/hide
Query:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEK----------------------VNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTS
        G+GTILEPLSAVPLNDELQQA+A+VAKAE+DVLFMLTEK                      VNARASY LSFGG CPNL+LP G NSSIANV  SGDQ S
Subjt:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEK----------------------VNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTS

Query:  EASHPNKNERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTG
        EASHP +N+ VLYLPNAHHPLL QQYRE+LENAKRDVRNA TEIGRKLPGG M WKEK V DIS L+MKVE+LE+A P+SVDF+IS R+RVLVITGPNTG
Subjt:  EASHPNKNERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTG

Query:  GKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS
        GKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSV ADIGDEQSLTQSLSTFSGHLRKIS
Subjt:  GKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS

A0A6J1J709 uncharacterized protein LOC111483140 isoform X22.1e-10679.54Show/hide
Query:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEK--------------------VNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEA
        G+GTILEPLSAVPLNDELQQA+A+VAKAE+DVLFMLTEK                    VNARASY LSFGG CPNL+LP G NSSIANV  SGDQ SEA
Subjt:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEK--------------------VNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEA

Query:  SHPNKNERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGK
        SHP +N+ VLYLPNAHHPLL QQYRE+LENAKRDVRNA TEIGRKLPGG M WKEK V DIS L+MKVE+LE+A P+SVDF+IS R+RVLVITGPNTGGK
Subjt:  SHPNKNERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGK

Query:  TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS
        TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSV ADIGDEQSLTQSLSTFSGHLRKIS
Subjt:  TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS

A0A6J1JC76 uncharacterized protein LOC111483140 isoform X55.3e-11086.19Show/hide
Query:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEKVNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLL
        G+GTILEPLSAVPLNDELQQA+A+VAKAE+DVLFMLTEKVNARASY LSFGG CPNL+LP G NSSIANV  SGDQ SEASHP +N+ VLYLPNAHHPLL
Subjt:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEKVNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLL

Query:  LQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHV
         QQYRE+LENAKRDVRNA TEIGRKLPGG M WKEK V DIS L+MKVE+LE+A P+SVDF+IS R+RVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHV
Subjt:  LQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHV

Query:  LASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS
        LASESVQIPWFDSV ADIGDEQSLTQSLSTFSGHLRKIS
Subjt:  LASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS

A0A6J1JCF6 uncharacterized protein LOC111483140 isoform X42.1e-10679.54Show/hide
Query:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEK--------------------VNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEA
        G+GTILEPLSAVPLNDELQQA+A+VAKAE+DVLFMLTEK                    VNARASY LSFGG CPNL+LP G NSSIANV  SGDQ SEA
Subjt:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEK--------------------VNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEA

Query:  SHPNKNERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGK
        SHP +N+ VLYLPNAHHPLL QQYRE+LENAKRDVRNA TEIGRKLPGG M WKEK V DIS L+MKVE+LE+A P+SVDF+IS R+RVLVITGPNTGGK
Subjt:  SHPNKNERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGK

Query:  TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS
        TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSV ADIGDEQSLTQSLSTFSGHLRKIS
Subjt:  TVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS

SwissProt top hitse value%identityAlignment
B8D298 Endonuclease MutS21.7e-2047.17Show/hide
Query:  EKNVVDISFLRMK--VEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFS
        E  + D  F+ ++     L K  P+ +D ++    + LVITGPNTGGKTV LKT+GL  +M ++GLH+ A E   I  F+ V+ADIGDEQS+ Q+LSTFS
Subjt:  EKNVVDISFLRMK--VEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFS

Query:  GHLRKI
         H+ +I
Subjt:  GHLRKI

B9KYW4 Endonuclease MutS22.2e-2048.51Show/hide
Query:  FLRMKVEE-----LEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRK
        FLR+++       L++   + +D  + +R R+LVITGPNTGGKTV LKT+GL A+MA++GL + A+    +  F ++F DIGDEQS+ Q+LSTFS H+R+
Subjt:  FLRMKVEE-----LEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRK

Query:  I
        I
Subjt:  I

C0Z9F1 Endonuclease MutS21.3e-2037.22Show/hide
Query:  ERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEK----NVVDISFLRMKVEE---LEKACPISVDFSISQRVRVLVITGPNTGG
        ER+LY+        ++   EN E        A TE+        + W  K     + D  ++ M+      + +   + VD  +    + +V+TGPNTGG
Subjt:  ERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEK----NVVDISFLRMKVEE---LEKACPISVDFSISQRVRVLVITGPNTGG

Query:  KTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRK-ISFLATHVMWGLVPFCELIRG
        KTV LKTIGL ++M  +GLH+ A E  ++  F S+FADIGDEQS+ QSLSTFS H+   I  LA      LV F EL  G
Subjt:  KTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRK-ISFLATHVMWGLVPFCELIRG

P73625 Endonuclease MutS24.8e-2345.19Show/hide
Query:  LENAKRDVRNAFTEIGRKLPGGTMPWKEKNV----VDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLAS
        L+ A   VR +F  +G   P    P  EK +    +    L  + E+      + +  +I  ++RV+ ITGPNTGGKTV LKT+GL A+MAK GL++ A 
Subjt:  LENAKRDVRNAFTEIGRKLPGGTMPWKEKNV----VDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLAS

Query:  ESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKI
        E+V++PWF  + ADIGDEQSL Q+LSTFSGH+ +I
Subjt:  ESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKI

Q5WEK0 Endonuclease MutS21.0e-2051.92Show/hide
Query:  EKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGH
        ++  +D+   R  +   +K  P   D +I  +VR LVITGPNTGGKTV LKTIGL  +MA+SGL V A+E  ++  F+ +FADIGDEQS+ QSLSTFS H
Subjt:  EKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGH

Query:  LRKI
        ++ I
Subjt:  LRKI

Arabidopsis top hitse value%identityAlignment
AT1G65070.1 DNA mismatch repair protein MutS, type 23.2e-2255.42Show/hide
Query:  PISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKI
        P+ VD  +    +V+VI+GPNTGGKT  LKT+GL ++M+KSG+++ A    ++PWFD + ADIGD QSL QSLSTFSGH+ +I
Subjt:  PISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKI

AT1G65070.2 DNA mismatch repair protein MutS, type 23.2e-2255.42Show/hide
Query:  PISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKI
        P+ VD  +    +V+VI+GPNTGGKT  LKT+GL ++M+KSG+++ A    ++PWFD + ADIGD QSL QSLSTFSGH+ +I
Subjt:  PISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKI

AT3G24320.1 MUTL protein homolog 14.6e-0536.26Show/hide
Query:  VLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKI-SFLATHVMWGLVPFCELIRG
        + ++TGPN GGK+  L++I  AA++  SGL V A ES  IP FDS+   +    S     S+F   + +I S ++      LV   E+ RG
Subjt:  VLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKI-SFLATHVMWGLVPFCELIRG

AT4G25540.1 homolog of DNA mismatch repair protein MSH31.2e-0533.06Show/hide
Query:  VITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS-FLATHVMWGLVPFCELIRGFGSNSIFVIIY
        +ITGPN GGK+  ++ + L ++MA+ G  V AS   ++   D VF  +G   S+    STF   L + S  + T     LV   EL RG  ++    I Y
Subjt:  VITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKIS-FLATHVMWGLVPFCELIRGFGSNSIFVIIY

Query:  FTWSLFVFPSSCLDL-ITDFP
         T    +    CL L +T +P
Subjt:  FTWSLFVFPSSCLDL-ITDFP

AT5G54090.1 DNA mismatch repair protein MutS, type 24.1e-5446.24Show/hide
Query:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEK--------------------VNARASYSLSFGGTCPNLVL-PEGRNSSIANVCSSGDQTSE
        G GT  EP++AV +ND+LQ A+ASVAKAE ++L MLTEK                    +NARA+YS ++GG  P++ L PE    S++   +S D    
Subjt:  GVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEK--------------------VNARASYSLSFGGTCPNLVL-PEGRNSSIANVCSSGDQTSE

Query:  ASHP-NKNERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTG
        +  P +K E +LYLP  +HPLLL Q+++ +   +  V+                          F +     L  A PI  DF IS+  RVLVITGPNTG
Subjt:  ASHP-NKNERVLYLPNAHHPLLLQQYRENLENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTG

Query:  GKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKISFLATH
        GKT+CLK++GLAAMMAKSGL+VLA+ES +IPWFD+++ADIGDEQSL QSLSTFSGHL++IS + +H
Subjt:  GKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKISFLATH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGGAGGGGTTGGTACCATCCTAGAGCCACTCTCTGCCGTTCCTTTAAACGATGAGTTGCAACAAGCAAAGGCATCAGTGGCAAAAGCTGAGAAAGATGTTCTCTT
TATGCTAACTGAAAAAGTCAATGCTCGAGCATCTTATAGTCTTTCATTTGGAGGGACATGTCCCAATTTAGTTCTACCAGAAGGGCGCAACTCTTCTATTGCTAATGTCT
GCTCATCAGGAGACCAAACATCTGAGGCATCGCACCCAAATAAGAATGAACGGGTTCTCTATTTACCAAATGCCCATCACCCTTTACTACTTCAGCAATACAGAGAAAAT
TTGGAGAATGCCAAGCGAGATGTCAGAAATGCTTTTACTGAGATAGGGAGAAAACTTCCTGGGGGGACTATGCCATGGAAAGAAAAAAATGTTGTTGATATTTCATTCTT
AAGAATGAAGGTTGAAGAATTGGAGAAAGCTTGTCCGATTTCGGTTGATTTTTCAATATCTCAAAGAGTTCGAGTTTTAGTTATAACTGGCCCTAATACTGGGGGTAAGA
CAGTTTGTTTGAAGACCATTGGATTGGCAGCCATGATGGCGAAATCAGGTCTTCATGTTTTAGCTTCAGAGTCTGTACAAATCCCTTGGTTTGATTCTGTTTTTGCTGAT
ATCGGTGATGAACAGTCCCTAACCCAATCTTTGTCTACTTTTTCTGGACATTTGAGAAAAATAAGCTTCTTAGCTACTCATGTCATGTGGGGGTTAGTACCTTTTTGTGA
GTTGATAAGGGGCTTTGGAAGTAATTCTATTTTTGTAATTATCTATTTTACTTGGAGCCTCTTTGTCTTTCCATCTAGCTGCCTCGATCTGATTACTGATTTTCCTTGTT
CTCTAAAACTCTTTGATTGCAGTATAGTAGAGAAGGCCCAAAAAGTAGATGATGTTACGGTGAAACAAGATATGAGAATTCTTATTCCATTCTATACAATAGAGTCGAAT
ATATATAGGCATACATGGAAGCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGGAGGGGTTGGTACCATCCTAGAGCCACTCTCTGCCGTTCCTTTAAACGATGAGTTGCAACAAGCAAAGGCATCAGTGGCAAAAGCTGAGAAAGATGTTCTCTT
TATGCTAACTGAAAAAGTCAATGCTCGAGCATCTTATAGTCTTTCATTTGGAGGGACATGTCCCAATTTAGTTCTACCAGAAGGGCGCAACTCTTCTATTGCTAATGTCT
GCTCATCAGGAGACCAAACATCTGAGGCATCGCACCCAAATAAGAATGAACGGGTTCTCTATTTACCAAATGCCCATCACCCTTTACTACTTCAGCAATACAGAGAAAAT
TTGGAGAATGCCAAGCGAGATGTCAGAAATGCTTTTACTGAGATAGGGAGAAAACTTCCTGGGGGGACTATGCCATGGAAAGAAAAAAATGTTGTTGATATTTCATTCTT
AAGAATGAAGGTTGAAGAATTGGAGAAAGCTTGTCCGATTTCGGTTGATTTTTCAATATCTCAAAGAGTTCGAGTTTTAGTTATAACTGGCCCTAATACTGGGGGTAAGA
CAGTTTGTTTGAAGACCATTGGATTGGCAGCCATGATGGCGAAATCAGGTCTTCATGTTTTAGCTTCAGAGTCTGTACAAATCCCTTGGTTTGATTCTGTTTTTGCTGAT
ATCGGTGATGAACAGTCCCTAACCCAATCTTTGTCTACTTTTTCTGGACATTTGAGAAAAATAAGCTTCTTAGCTACTCATGTCATGTGGGGGTTAGTACCTTTTTGTGA
GTTGATAAGGGGCTTTGGAAGTAATTCTATTTTTGTAATTATCTATTTTACTTGGAGCCTCTTTGTCTTTCCATCTAGCTGCCTCGATCTGATTACTGATTTTCCTTGTT
CTCTAAAACTCTTTGATTGCAGTATAGTAGAGAAGGCCCAAAAAGTAGATGATGTTACGGTGAAACAAGATATGAGAATTCTTATTCCATTCTATACAATAGAGTCGAAT
ATATATAGGCATACATGGAAGCTCTAA
Protein sequenceShow/hide protein sequence
MKGGVGTILEPLSAVPLNDELQQAKASVAKAEKDVLFMLTEKVNARASYSLSFGGTCPNLVLPEGRNSSIANVCSSGDQTSEASHPNKNERVLYLPNAHHPLLLQQYREN
LENAKRDVRNAFTEIGRKLPGGTMPWKEKNVVDISFLRMKVEELEKACPISVDFSISQRVRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFAD
IGDEQSLTQSLSTFSGHLRKISFLATHVMWGLVPFCELIRGFGSNSIFVIIYFTWSLFVFPSSCLDLITDFPCSLKLFDCSIVEKAQKVDDVTVKQDMRILIPFYTIESN
IYRHTWKL