; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G09230 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G09230
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionEndonuclease MutS2
Genome locationClcChr07:23598647..23611452
RNA-Seq ExpressionClc07G09230
SyntenyClc07G09230
Gene Ontology termsGO:0006298 - mismatch repair (biological process)
GO:0045910 - negative regulation of DNA recombination (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
InterPro domainsIPR000432 - DNA mismatch repair protein MutS, C-terminal
IPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR043502 - DNA/RNA polymerase superfamily
IPR045076 - DNA mismatch repair MutS family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008462896.1 PREDICTED: endonuclease MutS2 isoform X2 [Cucumis melo]1.0e-15186.29Show/hide
Query:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL
        +L QLMD+LVRNAKSGTS L VE+VDGRWC+K+EGD+LMDVKGLLLSS TGIGSI+EP+SAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL
Subjt:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL

Query:  IGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGGNM
        IGCIIELDVVNARASYGLSFGGTCPNLIL EGCNSSIANVCLSGDQ SEASH KKNEWVLYLQN HHPLLLQQYR+NLENAKRDV+NAF++GRKLPGGNM
Subjt:  IGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGGNM

Query:  SWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTF
        SWKEKE VD+SL K KVEQLEQA PVSVDFSIS R++VL++TGPNTGGKTVCLKTIGLAAMMAKSGLHVLASES QIPWFDS+FADIGDEQSLTQSLSTF
Subjt:  SWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTF

Query:  SGHLRKISAVGTFPRKKKSIL
        SGHLRKIS + +    +  +L
Subjt:  SGHLRKISAVGTFPRKKKSIL

XP_011653396.1 uncharacterized protein LOC101220812 isoform X2 [Cucumis sativus]2.1e-15286.29Show/hide
Query:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL
        +L QLMDSLVR+AKSGTSFLEVE+VDGRWCIK+EGD+LMDVKGLLLSS  GIGS +EP+SAVPLNDELQQARASVAKAEEDVLF+LTEKVKMDFEDISKL
Subjt:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL

Query:  IGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGGNM
        IGCIIELDVVNARASYGLSFGGTCPNL+L EGCNSSIANVCLSGDQ SEASHLKKNEWVLYLQN HHPLLLQQYR+NL+NAKRDV+NAF+MGRK PGGNM
Subjt:  IGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGGNM

Query:  SWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTF
        SWKEKE +D+SL K KV+QLEQARPVSVDFSIS RI+VL++TGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDS+FADIGDEQSLTQSLSTF
Subjt:  SWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTF

Query:  SGHLRKISAVGTFPRKKKSIL
        SGHLRKIS + +    +  +L
Subjt:  SGHLRKISAVGTFPRKKKSIL

XP_038894006.1 endonuclease MutS2 isoform X6 [Benincasa hispida]5.5e-15384.16Show/hide
Query:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL
        +L QLMDSLVRNAKSGTSFLEVE+VDGRWCIK+EGD+LMDVKGLLLSS TG G ILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDI+KL
Subjt:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL

Query:  IGCIIELDV-------------------VNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENA
        IGCIIELDV                   VNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASH KKNEWVLYLQNAHHPLLLQQYR+NLENA
Subjt:  IGCIIELDV-------------------VNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENA

Query:  KRDVQNAFS-MGRKLPGGNMSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWF
        KRDVQNAF+ MGRKLPGGNMSWKEKE VD+SLLK KVEQLE+ARPVSVDFSIS RI+VL++TGPNTGGKTVCLKTIGLAAMMAKSGLHVLASES QIPWF
Subjt:  KRDVQNAFS-MGRKLPGGNMSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWF

Query:  DSVFADIGDEQSLTQSLSTFSGHLRKISAVGTFPRKKKSIL
        DS+FADIGDEQSLTQSLSTFSGHLRKIS + +    +  +L
Subjt:  DSVFADIGDEQSLTQSLSTFSGHLRKISAVGTFPRKKKSIL

XP_038894013.1 endonuclease MutS2 isoform X7 [Benincasa hispida]5.9e-15588.58Show/hide
Query:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEK--VKMDFEDIS
        +L QLMDSLVRNAKSGTSFLEVE+VDGRWCIK+EGD+LMDVKGLLLSS TG G ILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEK  VKMDFEDI+
Subjt:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEK--VKMDFEDIS

Query:  KLIGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFS-MGRKLPG
        KLIGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASH KKNEWVLYLQNAHHPLLLQQYR+NLENAKRDVQNAF+ MGRKLPG
Subjt:  KLIGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFS-MGRKLPG

Query:  GNMSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSL
        GNMSWKEKE VD+SLLK KVEQLE+ARPVSVDFSIS RI+VL++TGPNTGGKTVCLKTIGLAAMMAKSGLHVLASES QIPWFDS+FADIGDEQSLTQSL
Subjt:  GNMSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSL

Query:  STFSGHLRKISAVGTFPRKKKSIL
        STFSGHLRKIS + +    +  +L
Subjt:  STFSGHLRKISAVGTFPRKKKSIL

XP_038894019.1 endonuclease MutS2 isoform X8 [Benincasa hispida]1.8e-15689.13Show/hide
Query:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL
        +L QLMDSLVRNAKSGTSFLEVE+VDGRWCIK+EGD+LMDVKGLLLSS TG G ILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDI+KL
Subjt:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL

Query:  IGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFS-MGRKLPGGN
        IGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASH KKNEWVLYLQNAHHPLLLQQYR+NLENAKRDVQNAF+ MGRKLPGGN
Subjt:  IGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFS-MGRKLPGGN

Query:  MSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLST
        MSWKEKE VD+SLLK KVEQLE+ARPVSVDFSIS RI+VL++TGPNTGGKTVCLKTIGLAAMMAKSGLHVLASES QIPWFDS+FADIGDEQSLTQSLST
Subjt:  MSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLST

Query:  FSGHLRKISAVGTFPRKKKSIL
        FSGHLRKIS + +    +  +L
Subjt:  FSGHLRKISAVGTFPRKKKSIL

TrEMBL top hitse value%identityAlignment
A0A0A0KVU2 Uncharacterized protein2.5e-15189.07Show/hide
Query:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL
        +L QLMDSLVR+AKSGTSFLEVE+VDGRWCIK+EGD+LMDVKGLLLSS  GIGS +EP+SAVPLNDELQQARASVAKAEEDVLF+LTEKVKMDFEDISKL
Subjt:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL

Query:  IGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFS-MGRKLPGGN
        IGCIIELDVVNARASYGLSFGGTCPNL+L EGCNSSIANVCLSGDQ SEASHLKKNEWVLYLQN HHPLLLQQYR+NL+NAKRDV+NAF+ MGRK PGGN
Subjt:  IGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFS-MGRKLPGGN

Query:  MSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLST
        MSWKEKE +D+SL K KV+QLEQARPVSVDFSIS RI+VL++TGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDS+FADIGDEQSLTQSLST
Subjt:  MSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLST

Query:  FSGHLRKISAV
        FSGHLRKIS V
Subjt:  FSGHLRKISAV

A0A1S3CHX9 endonuclease MutS2 isoform X25.1e-15286.29Show/hide
Query:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL
        +L QLMD+LVRNAKSGTS L VE+VDGRWC+K+EGD+LMDVKGLLLSS TGIGSI+EP+SAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL
Subjt:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL

Query:  IGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGGNM
        IGCIIELDVVNARASYGLSFGGTCPNLIL EGCNSSIANVCLSGDQ SEASH KKNEWVLYLQN HHPLLLQQYR+NLENAKRDV+NAF++GRKLPGGNM
Subjt:  IGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGGNM

Query:  SWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTF
        SWKEKE VD+SL K KVEQLEQA PVSVDFSIS R++VL++TGPNTGGKTVCLKTIGLAAMMAKSGLHVLASES QIPWFDS+FADIGDEQSLTQSLSTF
Subjt:  SWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTF

Query:  SGHLRKISAVGTFPRKKKSIL
        SGHLRKIS + +    +  +L
Subjt:  SGHLRKISAVGTFPRKKKSIL

A0A1S3CHZ5 endonuclease MutS2 isoform X11.2e-15086.02Show/hide
Query:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL
        +L QLMD+LVRNAKSGTS L VE+VDGRWC+K+EGD+LMDVKGLLLSS TGIGSI+EP+SAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL
Subjt:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL

Query:  IGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFS-MGRKLPGGN
        IGCIIELDVVNARASYGLSFGGTCPNLIL EGCNSSIANVCLSGDQ SEASH KKNEWVLYLQN HHPLLLQQYR+NLENAKRDV+NAF+ +GRKLPGGN
Subjt:  IGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFS-MGRKLPGGN

Query:  MSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLST
        MSWKEKE VD+SL K KVEQLEQA PVSVDFSIS R++VL++TGPNTGGKTVCLKTIGLAAMMAKSGLHVLASES QIPWFDS+FADIGDEQSLTQSLST
Subjt:  MSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLST

Query:  FSGHLRKISAVGTFPRKKKSIL
        FSGHLRKIS + +    +  +L
Subjt:  FSGHLRKISAVGTFPRKKKSIL

A0A1S3CJJ8 endonuclease MutS2 isoform X41.2e-15086.02Show/hide
Query:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL
        +L QLMD+LVRNAKSGTS L VE+VDGRWC+K+EGD+LMDVKGLLLSS TGIGSI+EP+SAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL
Subjt:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL

Query:  IGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFS-MGRKLPGGN
        IGCIIELDVVNARASYGLSFGGTCPNLIL EGCNSSIANVCLSGDQ SEASH KKNEWVLYLQN HHPLLLQQYR+NLENAKRDV+NAF+ +GRKLPGGN
Subjt:  IGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFS-MGRKLPGGN

Query:  MSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLST
        MSWKEKE VD+SL K KVEQLEQA PVSVDFSIS R++VL++TGPNTGGKTVCLKTIGLAAMMAKSGLHVLASES QIPWFDS+FADIGDEQSLTQSLST
Subjt:  MSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLST

Query:  FSGHLRKISAVGTFPRKKKSIL
        FSGHLRKIS + +    +  +L
Subjt:  FSGHLRKISAVGTFPRKKKSIL

A0A5D3BZK6 Endonuclease MutS2 isoform X11.2e-15086.02Show/hide
Query:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL
        +L QLMD+LVRNAKSGTS L VE+VDGRWC+K+EGD+LMDVKGLLLSS TGIGSI+EP+SAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL
Subjt:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL

Query:  IGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFS-MGRKLPGGN
        IGCIIELDVVNARASYGLSFGGTCPNLIL EGCNSSIANVCLSGDQ SEASH KKNEWVLYLQN HHPLLLQQYR+NLENAKRDV+NAF+ +GRKLPGGN
Subjt:  IGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFS-MGRKLPGGN

Query:  MSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLST
        MSWKEKE VD+SL K KVEQLEQA PVSVDFSIS R++VL++TGPNTGGKTVCLKTIGLAAMMAKSGLHVLASES QIPWFDS+FADIGDEQSLTQSLST
Subjt:  MSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLST

Query:  FSGHLRKISAVGTFPRKKKSIL
        FSGHLRKIS + +    +  +L
Subjt:  FSGHLRKISAVGTFPRKKKSIL

SwissProt top hitse value%identityAlignment
A7GHZ0 Endonuclease MutS26.2e-2228.38Show/hide
Query:  MDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLL-SSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKLIGCI
        ++SLVR+  S        V   R+ +  + +    V GL+   S+TG    +EP+S V LN+E+++         E +L +L+ K+  +   +      +
Subjt:  MDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLL-SSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKLIGCI

Query:  IELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGGNMSWKE
         ELD + A+A +   +  TCP +                            NE ++ +    HPL+                                  
Subjt:  IELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGGNMSWKE

Query:  KEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHL
        +E V                P+SV   +      L++TGPNTGGKTV LKT+GL  +MA SGL + A E+  I +F++VFADIGDEQS+ QSLSTFS H+
Subjt:  KEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHL

Query:  RKI
        + I
Subjt:  RKI

C4ZI07 Endonuclease MutS26.6e-2426.77Show/hide
Query:  RLCQLMDSLVRNAKSGTSFLEVEVV--DGRWCIKAEGDRLMDVKGLLL-SSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDI
        R+   +  L+ N+ + T   +  V   DGR+C+  + +   +V G++   S+TG    +EP++ V LN+EL++      +  E +L  L++KV M+   +
Subjt:  RLCQLMDSLVRNAKSGTSFLEVEVV--DGRWCIKAEGDRLMDVKGLLL-SSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDI

Query:  SKLIGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPG
         +    + ELD + A+A+   S+ G  P+                              +  + ++   HPL                            
Subjt:  SKLIGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPG

Query:  GNMSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSL
                              L+  + V +D  +    + LI+TGPNTGGKTV LKT+GL  +M ++GLH+ A++  ++  F+ VFADIGDEQS+ QSL
Subjt:  GNMSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSL

Query:  STFSGHLRKI
        STFS H+  I
Subjt:  STFSGHLRKI

P73625 Endonuclease MutS26.4e-2733.21Show/hide
Query:  SSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKLIGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQ
        SSA+G    +EP + V L ++L+QAR      EE +L  L+++V     D+  L+     LD+  AR  Y    G   P  + P             GD 
Subjt:  SSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKLIGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQ

Query:  TSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGGNMSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNT
                  E  + L+   HPLL  Q  K                    GG                           V +  +I  +IRV+ +TGPNT
Subjt:  TSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGGNMSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNT

Query:  GGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHL-RKISAVGTFPRKKKSILKPSL
        GGKTV LKT+GL A+MAK GL++ A E+V++PWF  + ADIGDEQSL Q+LSTFSGH+ R I  +   P   + +L P +
Subjt:  GGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHL-RKISAVGTFPRKKKSILKPSL

Q8R9D0 Endonuclease MutS21.0e-2429.67Show/hide
Query:  GRWCIKAEGDRLMDVKGLLL-SSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKLIGCIIELDVVNARASYGLSFGGTCP
        GR+ +  + +     KG++   S+TG    +EP+  V LN+EL++      K  E +LF L+++VK + E I K +  + ELD + A+A Y +    + P
Subjt:  GRWCIKAEGDRLMDVKGLLL-SSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKLIGCIIELDVVNARASYGLSFGGTCP

Query:  NLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGGNMSWKEKEAVDLSLLKTKVEQLEQARP
         L                   TS   +LKK         A HPL                                                  ++  + 
Subjt:  NLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGGNMSWKEKEAVDLSLLKTKVEQLEQARP

Query:  VSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKISAVGTFPRKKKSILKPSL
        V +D  I      L++TGPNTGGKTV LKT+GL  +MA +G+++ A E  QI  F+ VF DIGDEQS+ QSLSTFS H+  I ++     K   +L   L
Subjt:  VSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKISAVGTFPRKKKSILKPSL

Q9K8A0 Endonuclease MutS24.3e-2331.15Show/hide
Query:  SATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKLIGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQT
        SA+G    +EP + V +N++L++A+A   +  E +L  L+ +V    +D+   +  + ELD + ARA YG +   T P L                    
Subjt:  SATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKLIGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQT

Query:  SEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGGNMSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTG
                N   L ++   HPL                         +P   +                         V +D  + H    L++TGPNTG
Subjt:  SEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGGNMSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTG

Query:  GKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKI
        GKTV LKTIGL  +MA+SGLHV A E  ++  F  VFADIGDEQS+ QSLSTFS H+  I
Subjt:  GKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKI

Arabidopsis top hitse value%identityAlignment
AT1G65070.1 DNA mismatch repair protein MutS, type 26.5e-2734.07Show/hide
Query:  GLLLS-SATGIGSILEPLSAVPLNDELQQARASVAKAEE-DVLFMLTEKVKMDFEDISKLIGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANV
        G++LS S++     +EP  AV LN+ ++   A+  KAEE  +L +LT +V M   +I  L+  I+ELD+  ARAS+     G  PN+             
Subjt:  GLLLS-SATGIGSILEPLSAVPLNDELQQARASVAKAEE-DVLFMLTEKVKMDFEDISKLIGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANV

Query:  CLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGGNMSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLI
              TSE  H K     + + +A HPLLL                  S+     GG++                        PV VD  +    +V++
Subjt:  CLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGGNMSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLI

Query:  LTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKISAV
        ++GPNTGGKT  LKT+GL ++M+KSG+++ A    ++PWFD + ADIGD QSL QSLSTFSGH+ +I  +
Subjt:  LTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKISAV

AT1G65070.2 DNA mismatch repair protein MutS, type 26.5e-2734.07Show/hide
Query:  GLLLS-SATGIGSILEPLSAVPLNDELQQARASVAKAEE-DVLFMLTEKVKMDFEDISKLIGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANV
        G++LS S++     +EP  AV LN+ ++   A+  KAEE  +L +LT +V M   +I  L+  I+ELD+  ARAS+     G  PN+             
Subjt:  GLLLS-SATGIGSILEPLSAVPLNDELQQARASVAKAEE-DVLFMLTEKVKMDFEDISKLIGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANV

Query:  CLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGGNMSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLI
              TSE  H K     + + +A HPLLL                  S+     GG++                        PV VD  +    +V++
Subjt:  CLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGGNMSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLI

Query:  LTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKISAV
        ++GPNTGGKT  LKT+GL ++M+KSG+++ A    ++PWFD + ADIGD QSL QSLSTFSGH+ +I  +
Subjt:  LTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLRKISAV

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.3e-1427.16Show/hide
Query:  PIMYLGLPLGGHPKKTAFWQPVIDEVQGKLDKWRRYNLSRGGRVTLCKSVLSNLPNYYMSTFLMPKKVAVSLERTIRNFFWEGRKGDKLNHLVKWEQTIE
        P+ YLGLPL      T+ + P++++++ ++ KW   +LS  GR+ L  SV+ +L N++MS F +P      ++    +F W G + +     V W     
Subjt:  PIMYLGLPLGGHPKKTAFWQPVIDEVQGKLDKWRRYNLSRGGRVTLCKSVLSNLPNYYMSTFLMPKKVAVSLERTIRNFFWEGRKGDKLNHLVKWEQTIE

Query:  GYQDGGLGYGCLKTRNL---------ALLAKWGWRYLKGEPSLWQQVIM-SIHGASSWDISF
           +GGLG   LK  N            L  W W+ +    +L    +   IH  S+    F
Subjt:  GYQDGGLGYGCLKTRNL---------ALLAKWGWRYLKGEPSLWQQVIM-SIHGASSWDISF

AT5G54090.1 DNA mismatch repair protein MutS, type 27.4e-7144.89Show/hide
Query:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL
        +L QL+D+++R+ K   S +    +DGRWCI+   ++L  V GLLLSS +G G+  EP++AV +ND+LQ ARASVAKAE ++L MLTEK++     I  +
Subjt:  RLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQARASVAKAEEDVLFMLTEKVKMDFEDISKL

Query:  IGCIIELDVVNARASYGLSFGGTCPNLILP--EGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGG
        +   I+LDV+NARA+Y  ++GG  P++ LP  +   S  A              L K EW+LYL   +HPLLL Q++K +   +  V+            
Subjt:  IGCIIELDVVNARASYGLSFGGTCPNLILP--EGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLENAKRDVQNAFSMGRKLPGG

Query:  NMSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLS
                       KT    L  A P+  DF IS   RVL++TGPNTGGKT+CLK++GLAAMMAKSGL+VLA+ES +IPWFD+++ADIGDEQSL QSLS
Subjt:  NMSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLS

Query:  TFSGHLRKISAVGTFPRKKKSIL
        TFSGHL++IS + +    +  +L
Subjt:  TFSGHLRKISAVGTFPRKKKSIL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)9.5e-1046.27Show/hide
Query:  INGRPRGRIQASSGIRQGDPLSPFIFLLISEVLSCLISRLHWKEKFEGFGVGKENIHILIFQFADDT
        ING P+G +  S G+RQGDPLSP++F+L +EVLS L  R   + +  G  V   +  I    FADDT
Subjt:  INGRPRGRIQASSGIRQGDPLSPFIFLLISEVLSCLISRLHWKEKFEGFGVGKENIHILIFQFADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAACAAACTCTTACCGCGAATTTGAAGCTTTGATCTTGGAATTTTATAAGAGCCTTTACACAAAAACACCAAGTGCAGGTCACTTCTCGAACAACCTTGACTGGCA
AACAGTTTCTGCAGCTCAAAACAGGTGGCTGGTTTCAAGTTTCACCTTAGAAGAAATTAGAAAAGCAGTAAAAGGAATGGGGAAAAATAAAGCACCCGGTCCAGATGGCT
TTACAGTGGAATTTCTCATCAAATTTTGGGAGAAGATCAAAGAAAATTTTATGACTCTATTCAATGAATTTTATGATAATGGAAAACTAAATTCTTGCGTGAAGCAAAAC
TTTATTTGTTTAATCAAGAAGGAAGATGCAGTCATGGTAAAAGGCTTCCGGCCCATTAGTCTTACAATATTAGTATATAAAGTAATCCCCAAAGTCTTTGCTGAAAGATT
GAAAAAGATCATGCCAAGCATCATAGCCCCTACCCAAAGTGCATTCATAGAAGGGCGCCAAATCCTAGACCCCATTCTTATTGCAAATGAAATTGTTCTTCAAGGAAAAA
ACTTTGATTCGAAATGGATATCATGGATCATGGGTTGCATCAAAAACCCAAAATTTTCAATATTCATCAATGGAAGGCCAAGAGGAAGGATCCAAGCTTCAAGCGGAATA
AGGCAAGGTGACCCTCTCTCACCTTTCATTTTTCTCCTAATAAGTGAAGTTCTAAGTTGTCTTATTTCGAGACTTCACTGGAAAGAGAAATTTGAAGGATTCGGAGTTGG
AAAAGAAAATATTCATATTCTGATATTCCAATTTGCGGACGACACACTATTATTTTGTAAACATGATGACGAGATGTTGGACAACTTGCGAAAAACTATTGAACTCTTCA
AATGGTGCTCCGGTCAAAAAATCAATTGGGAGAAATCAGCCATATGTGGGCTAAATATTGATGAATCAGAGGTCTATTCAGTTGCTGCCAGATTAAATTGCAAAGCTGAG
AAACTCCCAATGATGCATCTTGGTCTACCCTTAGGGGGTACCCAAAGAAGGAGTCTTTTTGGCAGCCCATCCTTGACAAAATCCAAGGAAATACAATCTGTCAAGGGGAG
AACGGGCTTTTGGATCGATCCATGGCTTGATAATTTAACTCTAAATTCAAGATTCCCAAGACTTTTCAAGTTGGCACTCAAACCTAATGGTTCAGTGGCTGATCATTGGG
ACTTTAGAACCTCTTCTTGGGATCTAACTTTCAGAAGGTTGTTGAAAGAGGAAGAGATGGGTGATTTTCAGAACCTTTTATGTCTTGTCGCAAACAAAAAAGTGGTTGAT
CAGCCGGATAAAAGAGTATGGGCCTTGGAAGCTAATGGAATTTTCTCCACAAAATCGCTAATTAAACACCTCTCTTTGGCCTCCCCAATTGACCAAGAGCTGAAAAAGAA
CCTCTGGAAGTCCAAAAGTCCTAGGAGAGTGAATATATTGATTTGGCTAATGATCTTTGGATCACTGAACTGTGCTGCCACTTTACAAAGAAAGCTTCCCTCACATTGCT
TGTCTCCAGATATGTGCCCATTATGCCTACGGAATCAGGAAGAATTACAACATCTGTTCTTTGATTGTAGTTATGCCTTAAACTGCTGGTCTCGGCTTTTTGGCATCTTC
AATATTAGTTGGGTTGTTGAAAGAGATTTCAGCAGCAATCTACTACAAGTCTTGATTGGTCCAACCTTGCGAAAGAAGCCGAAGCTACTATGGATTAATGTGGTCAAAGC
ACTTTTATCAGAGTTATGGTTTGAAAGAAATCAGCGTGTTTTTAATAACATAGCCTCCTCGTGTACTGGCCGAACGACTAAAGAAAATCATGCCAAGTATCCTGTCCCCA
CTCAAAGCACCTTCATAGAAGAAAGACAAATCCTTGATCCCATTCTCATAGCCAATAAAGTAGTGGAAGACTATCGAGCCAAAAAGAAAAAGTGCTCGGGTCAGAAGGTT
AATTGGGAGAAATCGGCCATATATGGAGTCAATATTGATGAAGGAAAGGTGCTTTCTGTTGCAAATATACTAAGCTGTAAAGTAGAGGTTTTTCCTATCATGTACCTTGG
CTTGCCCTTGGGTGGTCACCCTAAAAAGACAGCATTTTGGCAGCCGGTAATAGATGAAGTACAAGGCAAATTGGATAAATGGAGGAGGTACAATTTATCAAGAGGAGGAA
GGGTCACGTTATGTAAATCAGTATTATCCAATCTTCCGAATTACTACATGTCCACATTCCTAATGCCTAAAAAGGTGGCTGTTTCTCTAGAGAGAACTATCAGAAACTTC
TTTTGGGAAGGGCGTAAGGGAGATAAATTAAATCATTTAGTCAAATGGGAGCAAACAATCGAAGGATATCAAGATGGGGGCCTCGGATATGGCTGTCTAAAAACCAGAAA
CTTAGCTCTTTTAGCAAAATGGGGCTGGCGTTATTTGAAAGGTGAACCCTCACTTTGGCAGCAAGTGATTATGAGCATACACGGAGCAAGCTCTTGGGACATTTCCTTCC
GTAGACTTCTAAAGGAGGAAGAGGCTGAAGATTTTCAAGCTCTCATGGGTATTCTACATGACAAGAGAACAACCTCTTTTCAGGACAAAAGAGTATGGTCTTTAGAACCA
AATGGAATTTTCTTGGTGAAGTCTCTTGTCAAGCACCATTCACCGGCTTCACCAATCGACAAACACTTGGAGAAAGCACTTTGGAAATCAAAAAGCCCTCGAAGAGTCAA
TCAATATCATGGTTTGGATCATGATTTTTGGAAGTTTAAACTGCTCATCCATACTACAAAGGAAGCTACAATCATCTTTGCAAAGAAGTGCTGGCAGCGGCTGTTTCTTT
TTTTCAACCTGTCTTGGGTTTTTGGGAATGATTTCAGGGATAACACAATCCAACTTCTAACTGGTCCAGCCCTCCGAAAGAAGCCCAGATTGCTATGGCTCACTGGTAGA
GGAATAGATCTTTGTTCGGAGGGAGGAGATCAAGGGTGCGACAGGTGGTCAACACCAAGTGTATTCCACAAAAATGATTTGTGGTTGTGCTCTATCTGGCTTTGCAACTG
GGCCTTGATAAGGTTATGCCAGTTAATGGACAGCCTAGTTAGGAACGCAAAGAGTGGAACTTCCTTTTTGGAAGTGGAAGTTGTTGATGGAAGGTGGTGTATAAAAGCAG
AGGGTGATCGATTAATGGATGTTAAGGGTCTCCTGTTGTCCAGTGCTACAGGGATTGGTTCCATCTTAGAGCCACTCTCTGCCGTTCCTTTAAATGATGAGTTGCAGCAA
GCAAGGGCTTCAGTGGCAAAAGCTGAGGAAGATGTTCTCTTTATGCTAACTGAAAAAGTGAAAATGGATTTTGAAGATATTAGCAAACTCATTGGCTGCATAATTGAATT
AGATGTGGTCAATGCTCGAGCATCTTATGGTCTTTCATTTGGGGGGACGTGTCCCAATTTAATCCTACCAGAAGGCTGCAACTCTTCTATTGCTAATGTCTGCTTATCAG
GGGACCAAACATCCGAGGCATCGCACTTGAAGAAGAATGAATGGGTCCTCTATTTACAAAATGCCCATCACCCTTTACTACTTCAGCAATATAGAAAAAATTTGGAAAAT
GCCAAGAGGGATGTCCAAAATGCTTTTTCTATGGGGAGAAAACTTCCCGGGGGGAATATGTCATGGAAAGAAAAAGAAGCTGTAGATCTTTCATTATTAAAAACGAAGGT
TGAACAATTGGAGCAAGCTCGTCCTGTTTCGGTTGATTTTTCAATATCTCACAGAATCCGAGTTTTAATTCTAACGGGCCCTAATACTGGGGGTAAGACTGTTTGTTTGA
AGACCATTGGATTGGCAGCCATGATGGCGAAATCAGGGCTTCATGTTTTAGCTTCAGAGTCTGTACAAATCCCTTGGTTTGATTCTGTTTTCGCTGATATCGGAGATGAA
CAGTCCCTTACCCAATCTCTGTCCACTTTTTCTGGCCATTTGAGAAAAATAAGTGCTGTTGGAACCTTTCCCAGAAAAAAGAAATCAATCCTTAAACCTAGTTTAATGGA
TTTCCCCCCTGATTTATTGGAAGTTTACTGCACCCAAGCCTTAGTCCCATCTCTTTCCTCGGAGCCAGCTCATCATTCCTCCCCTTCTCACCCCTTCAAGTATTCAATAT
TCCACATCCCCAATCACAATACCAAATTTATTCGAGGCTCTAGGCAATCCTCCCCAATTCGTCATCAGGAAAAGGATTACGATTCAGACATTGATTCAATAGTAAGTGCT
AGTAGTGAAGAGTTAGAAAACCTTGAGGAAGAAAATATCCAAGTTTTCTTAGATCAAGTTGATAATTTCGCGGAGGAGCTTAATTCTTTATTCCAGGCTAACGAAAGAAA
TCAATCAGAGGAAGAAGTTTGGAATTCTCATTTTAAGCCACTGCCTCAATCGGAGATTCCGACACATTTAAAGTCTATAATAGCAGAATGTGGACTCGTTCTGGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCAACAAACTCTTACCGCGAATTTGAAGCTTTGATCTTGGAATTTTATAAGAGCCTTTACACAAAAACACCAAGTGCAGGTCACTTCTCGAACAACCTTGACTGGCA
AACAGTTTCTGCAGCTCAAAACAGGTGGCTGGTTTCAAGTTTCACCTTAGAAGAAATTAGAAAAGCAGTAAAAGGAATGGGGAAAAATAAAGCACCCGGTCCAGATGGCT
TTACAGTGGAATTTCTCATCAAATTTTGGGAGAAGATCAAAGAAAATTTTATGACTCTATTCAATGAATTTTATGATAATGGAAAACTAAATTCTTGCGTGAAGCAAAAC
TTTATTTGTTTAATCAAGAAGGAAGATGCAGTCATGGTAAAAGGCTTCCGGCCCATTAGTCTTACAATATTAGTATATAAAGTAATCCCCAAAGTCTTTGCTGAAAGATT
GAAAAAGATCATGCCAAGCATCATAGCCCCTACCCAAAGTGCATTCATAGAAGGGCGCCAAATCCTAGACCCCATTCTTATTGCAAATGAAATTGTTCTTCAAGGAAAAA
ACTTTGATTCGAAATGGATATCATGGATCATGGGTTGCATCAAAAACCCAAAATTTTCAATATTCATCAATGGAAGGCCAAGAGGAAGGATCCAAGCTTCAAGCGGAATA
AGGCAAGGTGACCCTCTCTCACCTTTCATTTTTCTCCTAATAAGTGAAGTTCTAAGTTGTCTTATTTCGAGACTTCACTGGAAAGAGAAATTTGAAGGATTCGGAGTTGG
AAAAGAAAATATTCATATTCTGATATTCCAATTTGCGGACGACACACTATTATTTTGTAAACATGATGACGAGATGTTGGACAACTTGCGAAAAACTATTGAACTCTTCA
AATGGTGCTCCGGTCAAAAAATCAATTGGGAGAAATCAGCCATATGTGGGCTAAATATTGATGAATCAGAGGTCTATTCAGTTGCTGCCAGATTAAATTGCAAAGCTGAG
AAACTCCCAATGATGCATCTTGGTCTACCCTTAGGGGGTACCCAAAGAAGGAGTCTTTTTGGCAGCCCATCCTTGACAAAATCCAAGGAAATACAATCTGTCAAGGGGAG
AACGGGCTTTTGGATCGATCCATGGCTTGATAATTTAACTCTAAATTCAAGATTCCCAAGACTTTTCAAGTTGGCACTCAAACCTAATGGTTCAGTGGCTGATCATTGGG
ACTTTAGAACCTCTTCTTGGGATCTAACTTTCAGAAGGTTGTTGAAAGAGGAAGAGATGGGTGATTTTCAGAACCTTTTATGTCTTGTCGCAAACAAAAAAGTGGTTGAT
CAGCCGGATAAAAGAGTATGGGCCTTGGAAGCTAATGGAATTTTCTCCACAAAATCGCTAATTAAACACCTCTCTTTGGCCTCCCCAATTGACCAAGAGCTGAAAAAGAA
CCTCTGGAAGTCCAAAAGTCCTAGGAGAGTGAATATATTGATTTGGCTAATGATCTTTGGATCACTGAACTGTGCTGCCACTTTACAAAGAAAGCTTCCCTCACATTGCT
TGTCTCCAGATATGTGCCCATTATGCCTACGGAATCAGGAAGAATTACAACATCTGTTCTTTGATTGTAGTTATGCCTTAAACTGCTGGTCTCGGCTTTTTGGCATCTTC
AATATTAGTTGGGTTGTTGAAAGAGATTTCAGCAGCAATCTACTACAAGTCTTGATTGGTCCAACCTTGCGAAAGAAGCCGAAGCTACTATGGATTAATGTGGTCAAAGC
ACTTTTATCAGAGTTATGGTTTGAAAGAAATCAGCGTGTTTTTAATAACATAGCCTCCTCGTGTACTGGCCGAACGACTAAAGAAAATCATGCCAAGTATCCTGTCCCCA
CTCAAAGCACCTTCATAGAAGAAAGACAAATCCTTGATCCCATTCTCATAGCCAATAAAGTAGTGGAAGACTATCGAGCCAAAAAGAAAAAGTGCTCGGGTCAGAAGGTT
AATTGGGAGAAATCGGCCATATATGGAGTCAATATTGATGAAGGAAAGGTGCTTTCTGTTGCAAATATACTAAGCTGTAAAGTAGAGGTTTTTCCTATCATGTACCTTGG
CTTGCCCTTGGGTGGTCACCCTAAAAAGACAGCATTTTGGCAGCCGGTAATAGATGAAGTACAAGGCAAATTGGATAAATGGAGGAGGTACAATTTATCAAGAGGAGGAA
GGGTCACGTTATGTAAATCAGTATTATCCAATCTTCCGAATTACTACATGTCCACATTCCTAATGCCTAAAAAGGTGGCTGTTTCTCTAGAGAGAACTATCAGAAACTTC
TTTTGGGAAGGGCGTAAGGGAGATAAATTAAATCATTTAGTCAAATGGGAGCAAACAATCGAAGGATATCAAGATGGGGGCCTCGGATATGGCTGTCTAAAAACCAGAAA
CTTAGCTCTTTTAGCAAAATGGGGCTGGCGTTATTTGAAAGGTGAACCCTCACTTTGGCAGCAAGTGATTATGAGCATACACGGAGCAAGCTCTTGGGACATTTCCTTCC
GTAGACTTCTAAAGGAGGAAGAGGCTGAAGATTTTCAAGCTCTCATGGGTATTCTACATGACAAGAGAACAACCTCTTTTCAGGACAAAAGAGTATGGTCTTTAGAACCA
AATGGAATTTTCTTGGTGAAGTCTCTTGTCAAGCACCATTCACCGGCTTCACCAATCGACAAACACTTGGAGAAAGCACTTTGGAAATCAAAAAGCCCTCGAAGAGTCAA
TCAATATCATGGTTTGGATCATGATTTTTGGAAGTTTAAACTGCTCATCCATACTACAAAGGAAGCTACAATCATCTTTGCAAAGAAGTGCTGGCAGCGGCTGTTTCTTT
TTTTCAACCTGTCTTGGGTTTTTGGGAATGATTTCAGGGATAACACAATCCAACTTCTAACTGGTCCAGCCCTCCGAAAGAAGCCCAGATTGCTATGGCTCACTGGTAGA
GGAATAGATCTTTGTTCGGAGGGAGGAGATCAAGGGTGCGACAGGTGGTCAACACCAAGTGTATTCCACAAAAATGATTTGTGGTTGTGCTCTATCTGGCTTTGCAACTG
GGCCTTGATAAGGTTATGCCAGTTAATGGACAGCCTAGTTAGGAACGCAAAGAGTGGAACTTCCTTTTTGGAAGTGGAAGTTGTTGATGGAAGGTGGTGTATAAAAGCAG
AGGGTGATCGATTAATGGATGTTAAGGGTCTCCTGTTGTCCAGTGCTACAGGGATTGGTTCCATCTTAGAGCCACTCTCTGCCGTTCCTTTAAATGATGAGTTGCAGCAA
GCAAGGGCTTCAGTGGCAAAAGCTGAGGAAGATGTTCTCTTTATGCTAACTGAAAAAGTGAAAATGGATTTTGAAGATATTAGCAAACTCATTGGCTGCATAATTGAATT
AGATGTGGTCAATGCTCGAGCATCTTATGGTCTTTCATTTGGGGGGACGTGTCCCAATTTAATCCTACCAGAAGGCTGCAACTCTTCTATTGCTAATGTCTGCTTATCAG
GGGACCAAACATCCGAGGCATCGCACTTGAAGAAGAATGAATGGGTCCTCTATTTACAAAATGCCCATCACCCTTTACTACTTCAGCAATATAGAAAAAATTTGGAAAAT
GCCAAGAGGGATGTCCAAAATGCTTTTTCTATGGGGAGAAAACTTCCCGGGGGGAATATGTCATGGAAAGAAAAAGAAGCTGTAGATCTTTCATTATTAAAAACGAAGGT
TGAACAATTGGAGCAAGCTCGTCCTGTTTCGGTTGATTTTTCAATATCTCACAGAATCCGAGTTTTAATTCTAACGGGCCCTAATACTGGGGGTAAGACTGTTTGTTTGA
AGACCATTGGATTGGCAGCCATGATGGCGAAATCAGGGCTTCATGTTTTAGCTTCAGAGTCTGTACAAATCCCTTGGTTTGATTCTGTTTTCGCTGATATCGGAGATGAA
CAGTCCCTTACCCAATCTCTGTCCACTTTTTCTGGCCATTTGAGAAAAATAAGTGCTGTTGGAACCTTTCCCAGAAAAAAGAAATCAATCCTTAAACCTAGTTTAATGGA
TTTCCCCCCTGATTTATTGGAAGTTTACTGCACCCAAGCCTTAGTCCCATCTCTTTCCTCGGAGCCAGCTCATCATTCCTCCCCTTCTCACCCCTTCAAGTATTCAATAT
TCCACATCCCCAATCACAATACCAAATTTATTCGAGGCTCTAGGCAATCCTCCCCAATTCGTCATCAGGAAAAGGATTACGATTCAGACATTGATTCAATAGTAAGTGCT
AGTAGTGAAGAGTTAGAAAACCTTGAGGAAGAAAATATCCAAGTTTTCTTAGATCAAGTTGATAATTTCGCGGAGGAGCTTAATTCTTTATTCCAGGCTAACGAAAGAAA
TCAATCAGAGGAAGAAGTTTGGAATTCTCATTTTAAGCCACTGCCTCAATCGGAGATTCCGACACATTTAAAGTCTATAATAGCAGAATGTGGACTCGTTCTGGGTTAA
Protein sequenceShow/hide protein sequence
MPTNSYREFEALILEFYKSLYTKTPSAGHFSNNLDWQTVSAAQNRWLVSSFTLEEIRKAVKGMGKNKAPGPDGFTVEFLIKFWEKIKENFMTLFNEFYDNGKLNSCVKQN
FICLIKKEDAVMVKGFRPISLTILVYKVIPKVFAERLKKIMPSIIAPTQSAFIEGRQILDPILIANEIVLQGKNFDSKWISWIMGCIKNPKFSIFINGRPRGRIQASSGI
RQGDPLSPFIFLLISEVLSCLISRLHWKEKFEGFGVGKENIHILIFQFADDTLLFCKHDDEMLDNLRKTIELFKWCSGQKINWEKSAICGLNIDESEVYSVAARLNCKAE
KLPMMHLGLPLGGTQRRSLFGSPSLTKSKEIQSVKGRTGFWIDPWLDNLTLNSRFPRLFKLALKPNGSVADHWDFRTSSWDLTFRRLLKEEEMGDFQNLLCLVANKKVVD
QPDKRVWALEANGIFSTKSLIKHLSLASPIDQELKKNLWKSKSPRRVNILIWLMIFGSLNCAATLQRKLPSHCLSPDMCPLCLRNQEELQHLFFDCSYALNCWSRLFGIF
NISWVVERDFSSNLLQVLIGPTLRKKPKLLWINVVKALLSELWFERNQRVFNNIASSCTGRTTKENHAKYPVPTQSTFIEERQILDPILIANKVVEDYRAKKKKCSGQKV
NWEKSAIYGVNIDEGKVLSVANILSCKVEVFPIMYLGLPLGGHPKKTAFWQPVIDEVQGKLDKWRRYNLSRGGRVTLCKSVLSNLPNYYMSTFLMPKKVAVSLERTIRNF
FWEGRKGDKLNHLVKWEQTIEGYQDGGLGYGCLKTRNLALLAKWGWRYLKGEPSLWQQVIMSIHGASSWDISFRRLLKEEEAEDFQALMGILHDKRTTSFQDKRVWSLEP
NGIFLVKSLVKHHSPASPIDKHLEKALWKSKSPRRVNQYHGLDHDFWKFKLLIHTTKEATIIFAKKCWQRLFLFFNLSWVFGNDFRDNTIQLLTGPALRKKPRLLWLTGR
GIDLCSEGGDQGCDRWSTPSVFHKNDLWLCSIWLCNWALIRLCQLMDSLVRNAKSGTSFLEVEVVDGRWCIKAEGDRLMDVKGLLLSSATGIGSILEPLSAVPLNDELQQ
ARASVAKAEEDVLFMLTEKVKMDFEDISKLIGCIIELDVVNARASYGLSFGGTCPNLILPEGCNSSIANVCLSGDQTSEASHLKKNEWVLYLQNAHHPLLLQQYRKNLEN
AKRDVQNAFSMGRKLPGGNMSWKEKEAVDLSLLKTKVEQLEQARPVSVDFSISHRIRVLILTGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDE
QSLTQSLSTFSGHLRKISAVGTFPRKKKSILKPSLMDFPPDLLEVYCTQALVPSLSSEPAHHSSPSHPFKYSIFHIPNHNTKFIRGSRQSSPIRHQEKDYDSDIDSIVSA
SSEELENLEEENIQVFLDQVDNFAEELNSLFQANERNQSEEEVWNSHFKPLPQSEIPTHLKSIIAECGLVLG