; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS009709 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS009709
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionendonuclease MutS2
Genome locationscaffold411:57163..74013
RNA-Seq ExpressionMS009709
SyntenyMS009709
Gene Ontology termsGO:0006298 - mismatch repair (biological process)
GO:0045910 - negative regulation of DNA recombination (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0030983 - mismatched DNA binding (molecular function)
InterPro domainsIPR000432 - DNA mismatch repair protein MutS, C-terminal
IPR007696 - DNA mismatch repair protein MutS, core
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR036187 - DNA mismatch repair protein MutS, core domain superfamily
IPR045076 - DNA mismatch repair MutS family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141291.1 uncharacterized protein LOC111011726 isoform X1 [Momordica charantia]1.9e-30099.82Show/hide
Query:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY
        SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY
Subjt:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY

Query:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI
        EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI
Subjt:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI

Query:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND
        KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND
Subjt:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND

Query:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH
        ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH
Subjt:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH

Query:  PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG
        PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSIS RIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG
Subjt:  PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG

Query:  LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS
        LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS
Subjt:  LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS

XP_022141292.1 uncharacterized protein LOC111011726 isoform X2 [Momordica charantia]1.3e-29899.63Show/hide
Query:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY
        SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY
Subjt:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY

Query:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI
        EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI
Subjt:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI

Query:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND
        KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND
Subjt:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND

Query:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH
        ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH
Subjt:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH

Query:  PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG
        PLLLQQYRENLKNAKRDVRNAFD IGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSIS RIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG
Subjt:  PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG

Query:  LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS
        LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS
Subjt:  LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS

XP_022141293.1 uncharacterized protein LOC111011726 isoform X3 [Momordica charantia]4.7e-29698.89Show/hide
Query:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY
        SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY
Subjt:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY

Query:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI
        EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI
Subjt:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI

Query:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND
        KLILKVVDEDGS     SYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND
Subjt:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND

Query:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH
        ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH
Subjt:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH

Query:  PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG
        PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSIS RIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG
Subjt:  PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG

Query:  LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS
        LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS
Subjt:  LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS

XP_022141294.1 uncharacterized protein LOC111011726 isoform X4 [Momordica charantia]6.4e-26991.88Show/hide
Query:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY
        SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIK          
Subjt:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY

Query:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI
                                         VKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI
Subjt:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI

Query:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND
        KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND
Subjt:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND

Query:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH
        ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH
Subjt:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH

Query:  PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG
        PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSIS RIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG
Subjt:  PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG

Query:  LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS
        LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS
Subjt:  LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS

XP_023552491.1 uncharacterized protein LOC111810138 [Cucurbita pepo subsp. pepo]3.6e-24384.35Show/hide
Query:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY
        SAAVFG  LT I SA LPV N++S RFQNR V   + FSLSA  SV NDI  DRN+HSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY
Subjt:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY

Query:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI
        EESLRLL+ETNAAVEM KHGGCSLDLSG+DL LVKSA+EHAQRSLPMDGNEA A+AALLQF+DMLQFNLKTAI+EDADW TRFMPLT+V+MGMVVNQSLI
Subjt:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI

Query:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND
        KLIL VVDEDGSVKDSAS ALRQSRDQVR LEKKL QLMDSLVRNAKSGTSF+EVG VDGRWCIKSEG QLMD KGLLLSSAA GIGTILEPLSAVPLND
Subjt:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND

Query:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIAN-NCSGDSISEASCLKKSEWVLYLPNAL
        ELQQARA+VAKAEEDVLFML+EKVKMD EDI KLI CII+LD+VNARASYGLS GG CP++ILP GC S IAN   SGD ISEAS  K+++WVLYLPNA 
Subjt:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIAN-NCSGDSISEASCLKKSEWVLYLPNAL

Query:  HPLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKS
        HPLL QQYRE+L+NAKRDVRNA  EIGRKLPGGN S KEK+  DIS LKMKVE+LEQA  V VDF+IS RIRVLVITGPNTGGKTVCLKTIGLAAMMAKS
Subjt:  HPLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKS

Query:  GLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS
        GLHVLASESVQIPWFDSV ADIGDEQSLTQSLSTFSGHL+KIS
Subjt:  GLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS

TrEMBL top hitse value%identityAlignment
A0A6J1CI69 uncharacterized protein LOC111011726 isoform X43.1e-26991.88Show/hide
Query:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY
        SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIK          
Subjt:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY

Query:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI
                                         VKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI
Subjt:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI

Query:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND
        KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND
Subjt:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND

Query:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH
        ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH
Subjt:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH

Query:  PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG
        PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSIS RIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG
Subjt:  PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG

Query:  LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS
        LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS
Subjt:  LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS

A0A6J1CIQ4 uncharacterized protein LOC111011726 isoform X19.0e-30199.82Show/hide
Query:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY
        SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY
Subjt:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY

Query:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI
        EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI
Subjt:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI

Query:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND
        KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND
Subjt:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND

Query:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH
        ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH
Subjt:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH

Query:  PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG
        PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSIS RIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG
Subjt:  PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG

Query:  LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS
        LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS
Subjt:  LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS

A0A6J1CJG8 uncharacterized protein LOC111011726 isoform X26.4e-29999.63Show/hide
Query:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY
        SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY
Subjt:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY

Query:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI
        EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI
Subjt:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI

Query:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND
        KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND
Subjt:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND

Query:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH
        ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH
Subjt:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH

Query:  PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG
        PLLLQQYRENLKNAKRDVRNAFD IGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSIS RIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG
Subjt:  PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG

Query:  LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS
        LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS
Subjt:  LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS

A0A6J1CK29 uncharacterized protein LOC111011726 isoform X32.3e-29698.89Show/hide
Query:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY
        SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY
Subjt:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY

Query:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI
        EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI
Subjt:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI

Query:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND
        KLILKVVDEDGS     SYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND
Subjt:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND

Query:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH
        ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH
Subjt:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALH

Query:  PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG
        PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSIS RIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG
Subjt:  PLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSG

Query:  LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS
        LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS
Subjt:  LHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS

A0A6J1E4M8 uncharacterized protein LOC111430703 isoform X44.2e-24283.79Show/hide
Query:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY
        SAAVFG  LT I SA LPV +++S RFQNR V   + FSLSA  SV NDI  DRN+HSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY
Subjt:  SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTY

Query:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI
        EESLRLL+ETNAAVEM KHGGCSLDLSG+DL LVKSA+EHAQRSLPMDGNEA A+AALLQF+DMLQFNLKTAI+EDADW TRFMPLT+V+MGMVVNQSLI
Subjt:  EESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLI

Query:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND
        KLIL VVDEDGSVKDSAS ALRQSRDQVR LEKKL QLMDSLVRNAKSGTSF+EVG VDGRWCIKSEG QLMD KGLLLSSAA GIGT+LEPLSAVPLND
Subjt:  KLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLND

Query:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIAN-NCSGDSISEASCLKKSEWVLYLPNAL
        ELQQARA+VAKAEEDVLFML+EKVKMD EDI KLI CII+LD+VNARASYGLS GG CP++ILP GC S IAN   SGD IS+AS  K+++WVLYLPNA 
Subjt:  ELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIAN-NCSGDSISEASCLKKSEWVLYLPNAL

Query:  HPLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKS
        HPLL QQYRE+L+NAKRDVRNA  EIGRKLPGGN S KEK   DIS LKMKVE+LEQA  V VDF+IS RIRVLVITGPNTGGKTVCLKTIGLAAMMAKS
Subjt:  HPLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKS

Query:  GLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS
        GLHVLASESVQIPWFDSV ADIGDEQSLTQSLSTFSGHL+KIS
Subjt:  GLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKIS

SwissProt top hitse value%identityAlignment
A0RJF2 Endonuclease MutS24.6e-3628.63Show/hide
Query:  SLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTYEESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFS
        +LR LE++K+ + +     +SLGR  +K  + S +  +EE + + + T+ A ++ +  G S  L GI    ++S ++ A+    +  NE   +A  +  S
Subjt:  SLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTYEESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFS

Query:  DMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVN-QSLIKLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVR--NAKSGTSFLEVGNVD
             N+K  IE+  D     +P+ E  +  +V+   L K I   + + G V DSAS  LR  R Q+RT E ++ + ++++ R  NA+   S   V   +
Subjt:  DMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVN-QSLIKLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVR--NAKSGTSFLEVGNVD

Query:  GRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLNDELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCP
         R+ I  +      + G++   +A G    +EP   V LN+ LQ+AR    +  E +L ML+E+V ++ + +   ++ +  LD + A+A Y   I  T  
Subjt:  GRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLNDELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCP

Query:  DIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALHPLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHL
                  PI NN               E  + L  A HPL+                                     D +I              +
Subjt:  DIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALHPLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHL

Query:  VPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKI
        VP +  +      +VITGPNTGGKTV LKT+G+  +MA+SGLH+   +  +I  F ++FADIGDEQS+ QSLSTFS H+  I
Subjt:  VPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKI

B7HF67 Endonuclease MutS21.6e-3628.84Show/hide
Query:  SLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTYEESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFS
        +LR LE++K+ + +     +SLGR  +K  + S +  +EE + + + T+ A ++ +  G S  L GI    ++S ++ A+    +  NE   +A  +  S
Subjt:  SLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTYEESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFS

Query:  DMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVN-QSLIKLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVR--NAKSGTSFLEVGNVD
             N+K  IE+ AD     +P+ E  +  +V+   L K I   + + G V DSAS  LR  R Q+RT E ++ + ++++ R  NA+   S   V   +
Subjt:  DMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVN-QSLIKLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVR--NAKSGTSFLEVGNVD

Query:  GRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLNDELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCP
         R+ I  +      + G++   +A G    +EP   V LN+ LQ+AR    +  E +L ML+E+V ++ + +   ++ +  LD + A+A Y   I  T  
Subjt:  GRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLNDELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCP

Query:  DIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALHPLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHL
                  PI NN               E  + L  A HPL+                                     D +I              +
Subjt:  DIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALHPLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHL

Query:  VPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKI
        VP +  +      +VITGPNTGGKTV LKT+G+  +MA+SGLH+   +  +I  F ++FADIGDEQS+ QSLSTFS H+  I
Subjt:  VPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKI

B7IJV1 Endonuclease MutS24.6e-3628.63Show/hide
Query:  SLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTYEESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFS
        +LR LE++K+ + +     +SLGR  +K  + S +  +EE + + + T+ A ++ +  G S  L GI    ++S ++ A+    +  NE   +A  +  S
Subjt:  SLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTYEESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFS

Query:  DMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVN-QSLIKLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVR--NAKSGTSFLEVGNVD
             N+K  IE+  D     +P+ E  +  +V+   L K I   + + G V DSAS  LR  R Q+RT E ++ + ++++ R  NA+   S   V   +
Subjt:  DMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVN-QSLIKLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVR--NAKSGTSFLEVGNVD

Query:  GRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLNDELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCP
         R+ I  +      + G++   +A G    +EP   V LN+ LQ+AR    +  E +L ML+E+V ++ + +   ++ +  LD + A+A Y   I  T  
Subjt:  GRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLNDELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCP

Query:  DIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALHPLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHL
                  PI NN               E  + L  A HPL+                                     D +I              +
Subjt:  DIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALHPLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHL

Query:  VPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKI
        VP +  +      +VITGPNTGGKTV LKT+G+  +MA+SGLH+   +  +I  F ++FADIGDEQS+ QSLSTFS H+  I
Subjt:  VPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKI

C1ETZ0 Endonuclease MutS24.6e-3628.63Show/hide
Query:  SLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTYEESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFS
        +LR LE++K+ + +     +SLGR  +K  + S +  +EE + + + T+ A ++ +  G S  L GI    ++S ++ A+    +  NE   +A  +  S
Subjt:  SLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTYEESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFS

Query:  DMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVN-QSLIKLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVR--NAKSGTSFLEVGNVD
             N+K  IE+  D     +P+ E  +  +V+   L K I   + + G V DSAS  LR  R Q+RT E ++ + ++++ R  NA+   S   V   +
Subjt:  DMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVN-QSLIKLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVR--NAKSGTSFLEVGNVD

Query:  GRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLNDELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCP
         R+ I  +      + G++   +A G    +EP   V LN+ LQ+AR    +  E +L ML+E+V ++ + +   ++ +  LD + A+A Y   I  T  
Subjt:  GRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLNDELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCP

Query:  DIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALHPLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHL
                  PI NN               E  + L  A HPL+                                     D +I              +
Subjt:  DIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALHPLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHL

Query:  VPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKI
        VP +  +      +VITGPNTGGKTV LKT+G+  +MA+SGLH+   +  +I  F ++FADIGDEQS+ QSLSTFS H+  I
Subjt:  VPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKI

P73625 Endonuclease MutS28.2e-4129.04Show/hide
Query:  DRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTYEESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEA
        D    +I  ++L  LEW +LC  +++F +T LG  AI A+       +EES  LL +T A   ++     +    GI    +   +   +R   + G E 
Subjt:  DRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTYEESLRLLNETNAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEA

Query:  RAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLIKLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSF
         A+A  L         L+  IEE  D       L  +V  +     L + I   + EDG V + AS  L + R +++ + +++ Q +  +++   +    
Subjt:  RAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLIKLILKVVDEDGSVKDSASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSF

Query:  LEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLNDELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGL
          +     R+ +  +        G++  S+A G    +EP + V L ++L+QAR      EE +L  LS++V   L D+E L+    +LD+  AR  Y  
Subjt:  LEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLNDELQQARASVAKAEEDVLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGL

Query:  SIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALHPLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVE
         +G   P  + P G + PI                       L    HPLL  Q  +                     GG                    
Subjt:  SIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALHPLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVE

Query:  ELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKI
              +VP+  +I  +IRV+ ITGPNTGGKTV LKT+GL A+MAK GL++ A E+V++PWF  + ADIGDEQSL Q+LSTFSGH+ +I
Subjt:  ELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKI

Arabidopsis top hitse value%identityAlignment
AT1G65070.1 DNA mismatch repair protein MutS, type 22.3e-3829.47Show/hide
Query:  PVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTYEESLRLLNETN---AAVEMQKHGGCSLDLS
        P SLR+   L    + S+D        S+   +L  LEW  LC+ ++ FA T++G  A K     +  + EES  LLNET+   AA+EM K  G  L   
Subjt:  PVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTYEESLRLLNETN---AAVEMQKHGGCSLDLS

Query:  GIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLIKLILKVVDEDGS-VKDSASYALRQSRD
          ++  +   +E A     +   E   V + L  +      L+ A   D     R  PL +++ G     +L + I   +D + + + D AS    +  +
Subjt:  GIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLIKLILKVVDEDGS-VKDSASYALRQSRD

Query:  QVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVD--------GRWC--IKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLNDELQQARASVAKAEE-D
         +R+  ++  + +DSL++  K  T     G ++         R C  I++    L+   G++LS ++      +EP  AV LN+ ++   A+  KAEE  
Subjt:  QVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVD--------GRWC--IKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLNDELQQARASVAKAEE-D

Query:  VLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALHPLLLQQYRENLKNAK
        +L +L+ +V M   +I  L+D I++LD+  ARAS+   I G  P+ +  E  K+P      G ++              + +A HPLLL           
Subjt:  VLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALHPLLLQQYRENLKNAK

Query:  RDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFD
                  G  L   N                          VPVD  +    +V+VI+GPNTGGKT  LKT+GL ++M+KSG+++ A    ++PWFD
Subjt:  RDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFD

Query:  SVFADIGDEQSLTQSLSTFSGHLKKI
         + ADIGD QSL QSLSTFSGH+ +I
Subjt:  SVFADIGDEQSLTQSLSTFSGHLKKI

AT1G65070.2 DNA mismatch repair protein MutS, type 22.3e-3829.47Show/hide
Query:  PVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTYEESLRLLNETN---AAVEMQKHGGCSLDLS
        P SLR+   L    + S+D        S+   +L  LEW  LC+ ++ FA T++G  A K     +  + EES  LLNET+   AA+EM K  G  L   
Subjt:  PVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTYEESLRLLNETN---AAVEMQKHGGCSLDLS

Query:  GIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLIKLILKVVDEDGS-VKDSASYALRQSRD
          ++  +   +E A     +   E   V + L  +      L+ A   D     R  PL +++ G     +L + I   +D + + + D AS    +  +
Subjt:  GIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLIKLILKVVDEDGS-VKDSASYALRQSRD

Query:  QVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVD--------GRWC--IKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLNDELQQARASVAKAEE-D
         +R+  ++  + +DSL++  K  T     G ++         R C  I++    L+   G++LS ++      +EP  AV LN+ ++   A+  KAEE  
Subjt:  QVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVD--------GRWC--IKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLNDELQQARASVAKAEE-D

Query:  VLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALHPLLLQQYRENLKNAK
        +L +L+ +V M   +I  L+D I++LD+  ARAS+   I G  P+ +  E  K+P      G ++              + +A HPLLL           
Subjt:  VLFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALHPLLLQQYRENLKNAK

Query:  RDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFD
                  G  L   N                          VPVD  +    +V+VI+GPNTGGKT  LKT+GL ++M+KSG+++ A    ++PWFD
Subjt:  RDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFD

Query:  SVFADIGDEQSLTQSLSTFSGHLKKI
         + ADIGD QSL QSLSTFSGH+ +I
Subjt:  SVFADIGDEQSLTQSLSTFSGHLKKI

AT3G24320.1 MUTL protein homolog 14.8e-0438.57Show/hide
Query:  VLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKI
        + ++TGPN GGK+  L++I  AA++  SGL V A ES  IP FDS+   +    S     S+F   + +I
Subjt:  VLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKI

AT3G24495.1 MUTS homolog 76.3e-0438.57Show/hide
Query:  SISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTF
        SI P  R L++TGPN GGK+  L+   LA + A+ G +V   ES +I   D++F  +G    +    STF
Subjt:  SISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTF

AT5G54090.1 DNA mismatch repair protein MutS, type 21.5e-13549.72Show/hide
Query:  SVRFQNRPVSLRLRFSLSATNSVSNDITDD-------RNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTYEESLRLLNETNAAVEM
        S+ F N  + +R R S+   N V   +           +K     DSLR LEWDKLCD VASFARTSLGR+A K +LWSL++++ ESL+LL+ET+AA++M
Subjt:  SVRFQNRPVSLRLRFSLSATNSVSNDITDD-------RNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTYEESLRLLNETNAAVEM

Query:  QKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLIKLILKVVDEDGSVKDS
         +HG   LDLS I + LV+S I HA+R L +  ++A  VA+LL+F + LQ +LK AI++D DW+ RFMPL+E+++  V+N+S +KL+ +V+D DG++KDS
Subjt:  QKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLIKLILKVVDEDGSVKDS

Query:  ASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLNDELQQARASVAKAEEDV
        AS ALRQSR++V+TLE+KL QL+D+++R+ K   S +    +DGRWCI+   NQL    GLLLSS +GG GT  EP++AV +ND+LQ ARASVAKAE ++
Subjt:  ASYALRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLNDELQQARASVAKAEEDV

Query:  LFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILP--EGCKSPIANNCSGD-SISEASCLKKSEWVLYLPNALHPLLLQQYRENLKN
        L ML+EK++  L  IE ++   I+LD++NARA+Y  + GG  PDI LP  +  +S  A   S D ++     L K EW+LYLP   HPLLL Q+++ ++ 
Subjt:  LFMLSEKVKMDLEDIEKLIDCIIKLDMVNARASYGLSIGGTCPDIILP--EGCKSPIANNCSGD-SISEASCLKKSEWVLYLPNALHPLLLQQYRENLKN

Query:  AKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPW
         +  V+                          F K     L  A  +P DF IS   RVLVITGPNTGGKT+CLK++GLAAMMAKSGL+VLA+ES +IPW
Subjt:  AKRDVRNAFDEIGRKLPGGNTSRKEKKDVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPW

Query:  FDSVFADIGDEQSLTQSLSTFSGHLKKIS
        FD+++ADIGDEQSL QSLSTFSGHLK+IS
Subjt:  FDSVFADIGDEQSLTQSLSTFSGHLKKIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCTGCTGCTGTTTTCGGCGATCCCCTGACCTCGATTATCTCTGCCATACTGCCGGTTAAAAACATCACTTCTGTTAGATTCCAGAATCGGCCCGTATCCCTACGCCTACG
ATTCTCCCTCTCTGCAACCAACTCCGTCAGCAATGACATTACAGACGACAGAAACAAACACTCGATTCACCTTGATAGCCTCAGAGCGCTGGAATGGGATAAACTCTGCG
ATTCCGTTGCTTCATTTGCCCGCACTTCTCTGGGCCGTCAAGCAATCAAGGCCCAACTTTGGTCTTTGAACCGGACATATGAAGAAAGTTTGAGACTTCTGAATGAGACT
AATGCGGCTGTAGAAATGCAGAAGCATGGTGGATGCAGTTTGGATTTAAGTGGCATCGACCTTCTTCTGGTGAAATCTGCAATAGAACATGCTCAAAGAAGTCTGCCAAT
GGATGGAAATGAAGCAAGGGCTGTTGCAGCTCTTCTACAGTTTTCTGATATGTTGCAATTTAATTTAAAAACTGCAATCGAAGAGGATGCAGACTGGTTCACCCGTTTTA
TGCCCCTAACGGAAGTGGTAATGGGAATGGTTGTAAATCAATCACTGATTAAACTGATACTGAAAGTTGTAGATGAAGATGGCTCAGTTAAAGATTCTGCGAGTTATGCC
TTGAGACAATCCCGAGATCAAGTTCGAACGCTTGAAAAAAAGTTATATCAGTTAATGGACAGCCTAGTTAGGAATGCAAAGAGTGGAACATCCTTTTTGGAAGTGGGAAA
TGTTGATGGAAGGTGGTGTATAAAATCAGAGGGTAATCAATTGATGGACTTTAAGGGTCTCCTGCTGTCCAGTGCTGCAGGAGGGATTGGTACCATCCTAGAGCCACTCT
CTGCTGTTCCTTTAAACGATGAGTTGCAACAGGCAAGGGCATCAGTGGCAAAAGCTGAGGAAGATGTTCTCTTTATGCTAAGTGAAAAAGTGAAAATGGATCTTGAAGAC
ATTGAGAAACTCATTGACTGTATAATCAAATTAGATATGGTCAATGCGCGAGCATCTTATGGTCTTTCAATTGGGGGGACATGTCCCGATATAATTCTACCAGAAGGGTG
CAAATCTCCCATTGCTAATAACTGCTCGGGGGACTCAATATCTGAGGCATCATGCCTAAAGAAGAGCGAATGGGTGCTCTATTTACCTAATGCTCTTCACCCTTTACTAC
TCCAGCAATATAGAGAAAATTTGAAGAATGCCAAAAGGGATGTCAGAAATGCTTTTGATGAGATAGGGAGAAAACTTCCTGGGGGGAATACGTCAAGGAAAGAAAAAAAA
GATGTAGATATTTCATTCTTAAAAATGAAGGTTGAGGAATTGGAGCAAGCTCATCTAGTTCCGGTTGATTTTTCAATATCTCCAAGAATTCGAGTTTTGGTTATAACTGG
TCCTAATACTGGGGGTAAGACAGTTTGCCTGAAGACCATTGGATTGGCTGCCATGATGGCGAAATCAGGGCTTCATGTTTTGGCTTCAGAATCTGTACAAATCCCTTGGT
TTGATTCTGTTTTTGCTGATATCGGTGATGAACAGTCTCTAACCCAATCTTTGTCTACCTTTTCTGGCCATTTGAAAAAAATAAGTGTA
mRNA sequenceShow/hide mRNA sequence
TCTGCTGCTGTTTTCGGCGATCCCCTGACCTCGATTATCTCTGCCATACTGCCGGTTAAAAACATCACTTCTGTTAGATTCCAGAATCGGCCCGTATCCCTACGCCTACG
ATTCTCCCTCTCTGCAACCAACTCCGTCAGCAATGACATTACAGACGACAGAAACAAACACTCGATTCACCTTGATAGCCTCAGAGCGCTGGAATGGGATAAACTCTGCG
ATTCCGTTGCTTCATTTGCCCGCACTTCTCTGGGCCGTCAAGCAATCAAGGCCCAACTTTGGTCTTTGAACCGGACATATGAAGAAAGTTTGAGACTTCTGAATGAGACT
AATGCGGCTGTAGAAATGCAGAAGCATGGTGGATGCAGTTTGGATTTAAGTGGCATCGACCTTCTTCTGGTGAAATCTGCAATAGAACATGCTCAAAGAAGTCTGCCAAT
GGATGGAAATGAAGCAAGGGCTGTTGCAGCTCTTCTACAGTTTTCTGATATGTTGCAATTTAATTTAAAAACTGCAATCGAAGAGGATGCAGACTGGTTCACCCGTTTTA
TGCCCCTAACGGAAGTGGTAATGGGAATGGTTGTAAATCAATCACTGATTAAACTGATACTGAAAGTTGTAGATGAAGATGGCTCAGTTAAAGATTCTGCGAGTTATGCC
TTGAGACAATCCCGAGATCAAGTTCGAACGCTTGAAAAAAAGTTATATCAGTTAATGGACAGCCTAGTTAGGAATGCAAAGAGTGGAACATCCTTTTTGGAAGTGGGAAA
TGTTGATGGAAGGTGGTGTATAAAATCAGAGGGTAATCAATTGATGGACTTTAAGGGTCTCCTGCTGTCCAGTGCTGCAGGAGGGATTGGTACCATCCTAGAGCCACTCT
CTGCTGTTCCTTTAAACGATGAGTTGCAACAGGCAAGGGCATCAGTGGCAAAAGCTGAGGAAGATGTTCTCTTTATGCTAAGTGAAAAAGTGAAAATGGATCTTGAAGAC
ATTGAGAAACTCATTGACTGTATAATCAAATTAGATATGGTCAATGCGCGAGCATCTTATGGTCTTTCAATTGGGGGGACATGTCCCGATATAATTCTACCAGAAGGGTG
CAAATCTCCCATTGCTAATAACTGCTCGGGGGACTCAATATCTGAGGCATCATGCCTAAAGAAGAGCGAATGGGTGCTCTATTTACCTAATGCTCTTCACCCTTTACTAC
TCCAGCAATATAGAGAAAATTTGAAGAATGCCAAAAGGGATGTCAGAAATGCTTTTGATGAGATAGGGAGAAAACTTCCTGGGGGGAATACGTCAAGGAAAGAAAAAAAA
GATGTAGATATTTCATTCTTAAAAATGAAGGTTGAGGAATTGGAGCAAGCTCATCTAGTTCCGGTTGATTTTTCAATATCTCCAAGAATTCGAGTTTTGGTTATAACTGG
TCCTAATACTGGGGGTAAGACAGTTTGCCTGAAGACCATTGGATTGGCTGCCATGATGGCGAAATCAGGGCTTCATGTTTTGGCTTCAGAATCTGTACAAATCCCTTGGT
TTGATTCTGTTTTTGCTGATATCGGTGATGAACAGTCTCTAACCCAATCTTTGTCTACCTTTTCTGGCCATTTGAAAAAAATAAGTGTA
Protein sequenceShow/hide protein sequence
SAAVFGDPLTSIISAILPVKNITSVRFQNRPVSLRLRFSLSATNSVSNDITDDRNKHSIHLDSLRALEWDKLCDSVASFARTSLGRQAIKAQLWSLNRTYEESLRLLNET
NAAVEMQKHGGCSLDLSGIDLLLVKSAIEHAQRSLPMDGNEARAVAALLQFSDMLQFNLKTAIEEDADWFTRFMPLTEVVMGMVVNQSLIKLILKVVDEDGSVKDSASYA
LRQSRDQVRTLEKKLYQLMDSLVRNAKSGTSFLEVGNVDGRWCIKSEGNQLMDFKGLLLSSAAGGIGTILEPLSAVPLNDELQQARASVAKAEEDVLFMLSEKVKMDLED
IEKLIDCIIKLDMVNARASYGLSIGGTCPDIILPEGCKSPIANNCSGDSISEASCLKKSEWVLYLPNALHPLLLQQYRENLKNAKRDVRNAFDEIGRKLPGGNTSRKEKK
DVDISFLKMKVEELEQAHLVPVDFSISPRIRVLVITGPNTGGKTVCLKTIGLAAMMAKSGLHVLASESVQIPWFDSVFADIGDEQSLTQSLSTFSGHLKKISV