; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0000604 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0000604
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionAdenine DNA glycosylase
Genome locationchr11:31675530..31680167
RNA-Seq ExpressionPay0000604
SyntenyPay0000604
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0000701 - purine-specific mismatch base pair DNA N-glycosylase activity (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0051539 - 4 iron, 4 sulfur cluster binding (molecular function)
InterPro domainsIPR000445 - Helix-hairpin-helix motif
IPR003265 - HhH-GPD domain
IPR004036 - Endonuclease III-like, conserved site-2
IPR005760 - A/G-specific adenine glycosylase MutY
IPR011257 - DNA glycosylase
IPR015797 - NUDIX hydrolase-like domain superfamily
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal
IPR029119 - MutY, C-terminal
IPR044298 - Adenine/Thymine-DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039872.1 adenine DNA glycosylase isoform X1 [Cucumis melo var. makuwa]4.7e-25293.66Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNENEENVKKKTDFRRKKKPTT+RKRR RSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGE------------
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGE            
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGE------------

Query:  -------------VVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSS
                     VVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD SRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSS
Subjt:  -------------VVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSS

Query:  VLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNRED
        VLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD +TRRESIDSLLSKNFGLEPKKNFEIVNRED
Subjt:  VLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNRED

Query:  VGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS
        VGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLP KKQKS
Subjt:  VGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS

KGN46403.2 hypothetical protein Csa_005328 [Cucumis sativus]1.3e-24996.12Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNEN+E +KK TDFRRKKKPTT+RKRR RSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFP+TVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIK KQRHDYSAVCVV
Subjt:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EILESQGT ELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD STRRESI+SLLSKNFGLE KKNFEIVNREDVGDFIH+FTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK

Query:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS
        GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSS+  LP KKQKS
Subjt:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS

XP_004140565.2 adenine DNA glycosylase isoform X1 [Cucumis sativus]1.3e-24996.12Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNEN+E +KK TDFRRKKKPTT+RKRR RSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFP+TVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIK KQRHDYSAVCVV
Subjt:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EILESQGT ELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD STRRESI+SLLSKNFGLE KKNFEIVNREDVGDFIH+FTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK

Query:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS
        GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSS+  LP KKQKS
Subjt:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS

XP_008459934.1 PREDICTED: adenine DNA glycosylase isoform X1 [Cucumis melo]8.0e-260100Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
Subjt:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK

Query:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS
        GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS
Subjt:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS

XP_031743605.1 adenine DNA glycosylase isoform X2 [Cucumis sativus]1.2e-24795.91Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNEN+E +KK TDFRRKKKPTT+RKRR RSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASL EVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFP+TVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIK KQRHDYSAVCVV
Subjt:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EILESQGT ELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD STRRESI+SLLSKNFGLE KKNFEIVNREDVGDFIH+FTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK

Query:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS
        GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSS+  LP KKQKS
Subjt:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS

TrEMBL top hitse value%identityAlignment
A0A0A0KC27 Adenine DNA glycosylase6.2e-25096.12Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNEN+E +KK TDFRRKKKPTT+RKRR RSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFP+TVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIK KQRHDYSAVCVV
Subjt:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EILESQGT ELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD STRRESI+SLLSKNFGLE KKNFEIVNREDVGDFIH+FTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK

Query:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS
        GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSS+  LP KKQKS
Subjt:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS

A0A1S3CBT2 Adenine DNA glycosylase3.9e-260100Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
Subjt:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK

Query:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS
        GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS
Subjt:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS

A0A1S4E2J7 Adenine DNA glycosylase9.2e-246100Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
Subjt:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK

Query:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRK
        GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRK
Subjt:  GEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRK

A0A5A7T8X3 Adenine DNA glycosylase2.3e-25293.66Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNENEENVKKKTDFRRKKKPTT+RKRR RSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGE------------
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGE            
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGE------------

Query:  -------------VVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSS
                     VVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD SRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSS
Subjt:  -------------VVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSS

Query:  VLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNRED
        VLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD +TRRESIDSLLSKNFGLEPKKNFEIVNRED
Subjt:  VLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNRED

Query:  VGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS
        VGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLP KKQKS
Subjt:  VGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSSRVLPIKKQKS

E5GB45 Adenine DNA glycosylase1.1e-225100Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
Subjt:  IARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLK

Query:  G
        G
Subjt:  G

SwissProt top hitse value%identityAlignment
F4JRF4 Adenine DNA glycosylase2.3e-12955.12Show/hide
Query:  DGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVV---DIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRS-LDKGEPETRAYGVWVSEIMLQ
        + E+ E  E  + + D    ++ + + +      +++E      DIED +FS +  Q IR  LLDWYD ++RDLPWR+   + E E RAY VWVSEIMLQ
Subjt:  DGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVV---DIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRS-LDKGEPETRAYGVWVSEIMLQ

Query:  QTRVQTVVQFYNRWMLKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAG
        QTRVQTV+++Y RWM KWPT+  L +ASLE                   EVNEMWAGLGYYRRARFL EGAKM+V     FP   SSL K+ GIG+YTAG
Subjt:  QTRVQTVVQFYNRWMLKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAG

Query:  AIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVT
        AIASIAF E VPVVDGNVIRV+ARLKAIS NPKD    +  WK AAQLVD SRPGDFNQ+LMELGATLCT + PSCS+CPV   C A S+S+ + ++ VT
Subjt:  AIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVT

Query:  DYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSK--NFGLEPKKNFEIVNREDVG
        DYP K IK K RHD+  VCV+EI   +     G   RF+LVKRP++GLLAGLWEFPSV L+ EADS+TRR +I+  L +   F +E KK   IV+RE++G
Subjt:  DYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSK--NFGLEPKKNFEIVNREDVG

Query:  DFIHVFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRK
        +F+H+FTHIR K+YVE LV+ L G    LF+ Q K ++ WKCV + V+ST+GLTS+VRK
Subjt:  DFIHVFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRK

Q10159 Adenine DNA glycosylase2.6e-5635.66Show/hide
Query:  VQTIRASLLDWYDRSRRDLPWRSL------------DKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLE-EVNEMWAGLGYYR
        V+  R SL+ +YD+++R LPWR              D  +P  R Y V VSEIMLQQTRV+TV ++Y +WM   PT++  + A    +V  +W+G+G+Y 
Subjt:  VQTIRASLLDWYDRSRRDLPWRSL------------DKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLE-EVNEMWAGLGYYR

Query:  RARFLFEGAKMIVK-EGGRFPKTVSSLRK-IPGIGEYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQA
        R + L +  + + K      P+T     K IPG+G YTAGA+ SIA+ +   +VDGNVIRV++R  AI  +    K    +WK A +LVD  RPGDFNQA
Subjt:  RARFLFEGAKMIVK-EGGRFPKTVSSLRK-IPGIGEYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQA

Query:  LMELGATLCTPTNPSCSTCPVFDHCEA--------------LSISKRDSSVLVTD--------------YPAKGIKTKQRHDYSAVCVVEILESQGTSEL
        LMELGA  CTP +P CS CP+ + C+A                I     ++ +TD              YP    KTKQR + + V +      Q T   
Subjt:  LMELGATLCTPTNPSCSTCPVFDHCEA--------------LSISKRDSSVLVTD--------------YPAKGIKTKQRHDYSAVCVVEILESQGTSEL

Query:  GQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFG--LEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLV
         +   FL+ KRP  GLLAGLW+FP++    E    +  + +D+   K+    +       I   +  G ++H+F+HIR   +V + +
Subjt:  GQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFG--LEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLV

Q8R5G2 Adenine DNA glycosylase3.4e-8040.79Show/hide
Query:  IDNVQTIRASLLDWYDRSRRDLPWRSLDKGEP--ETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEG
        I +V   R +LL WYD+ +RDLPWR   K E   + RAY VWVSE+MLQQT+V TV+ +Y RWM KWPT+Q L+ ASLEEVN++W+GLGYY R R L EG
Subjt:  IDNVQTIRASLLDWYDRSRRDLPWRSLDKGEP--ETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEG

Query:  AKMIVKE-GGRFPKTVSSLRK-IPGIGEYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATL
        A+ +V+E GG  P+T  +L++ +PG+G YTAGAIASIAF +V  VVDGNVIRV+ R++AI  +P    +   +W  A QLVD +RPGDFNQA MELGAT+
Subjt:  AKMIVKE-GGRFPKTVSSLRK-IPGIGEYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATL

Query:  CTPTNPSCSTCPVFDHCEA--------------------------------LSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSS
        CTP  P C+ CPV   C A                                 S +  D ++ V ++P K  +   R +YSA CVVE   + G        
Subjt:  CTPTNPSCSTCPVFDHCEA--------------------------------LSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSS

Query:  RFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSI
          LLV+RP+ GLLAGLWEFPSV+L    + S + +    L        P     +   + +G+ IHVF+HI+L   V    L L+G+          + +
Subjt:  RFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSI

Query:  LWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSS--------SSRVLPIKKQK
         W+   N  +ST     +++K + + E+ + G    S        SSR  P + Q+
Subjt:  LWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSS--------SSRVLPIKKQK

Q99P21 Adenine DNA glycosylase1.9e-8339.56Show/hide
Query:  KKKPTTKRKRRSRSPSKSEAVVDIED-------------------IMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGE--PETRAYGVWVSEIMLQQT
        KK+P   ++RR+R+ S S+A     D                   +   + +V   R++LL WYD+ +RDLPWR+L K E   + RAY VWVSE+MLQQT
Subjt:  KKKPTTKRKRRSRSPSKSEAVVDIED-------------------IMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGE--PETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKE-GGRFPKTVSSLRK-IPGIGEYTAGAIASIAFGEVVPVVDGNVI
        +V TV+ +Y RWM KWP +Q L+ ASLEEVN++W+GLGYY R R L EGA+ +V+E GG  P+T  +L++ +PG+G YTAGAIASIAF +V  VVDGNV+
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKE-GGRFPKTVSSLRK-IPGIGEYTAGAIASIAFGEVVPVVDGNVI

Query:  RVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEA--------------------------------
        RV+ R++AI  +P    +   +W  A QLVD +RPGDFNQA MELGAT+CTP  P CS CPV   C A                                
Subjt:  RVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEA--------------------------------

Query:  LSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKK
         S S  D S+ V ++P K  +   R +YSA CVVE   + G          LLV+RPD GLLAGLWEFPSV+L  E     + +++   L +  G  P  
Subjt:  LSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKK

Query:  NFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAG-KTSSSSSRVLPIKKQK
            +  + +G+ IH+F+HI+L   V  L L    +          + + W+   N  +ST     +++K + M E  + G +  S  S+V P   +K
Subjt:  NFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAG-KTSSSSSRVLPIKKQK

Q9UIF7 Adenine DNA glycosylase5.8e-8042.82Show/hide
Query:  EAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPW--RSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGL
        +A V    +   +  V   R SLL WYD+ +RDLPW  R+ D+ + + RAY VWVSE+MLQQT+V TV+ +Y  WM KWPT+Q L+ ASLEEVN++WAGL
Subjt:  EAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPW--RSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGL

Query:  GYYRRARFLFEGAKMIVKE-GGRFPKTVSSLRK-IPGIGEYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGD
        GYY R R L EGA+ +V+E GG  P+T  +L++ +PG+G YTAGAIASIAFG+   VVDGNV RV+ R++AI  +P    + +Q+W  A QLVD +RPGD
Subjt:  GYYRRARFLFEGAKMIVKE-GGRFPKTVSSLRK-IPGIGEYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGD

Query:  FNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKR-----------------------------------DSSVLVTDYPAKGIKTKQRHDYSAVCVV
        FNQA MELGAT+CTP  P CS CPV   C A    ++                                   D ++ V ++P K  +   R + SA CV+
Subjt:  FNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKR-----------------------------------DSSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVL
        E   + G       ++ LLV+RP+ GLLAGLWEFPSV+   E     +R+++   L +  G  P  +        +G+ +H F+HI+L   V  L L
Subjt:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVL

Arabidopsis top hitse value%identityAlignment
AT1G05900.2 endonuclease III 22.9e-1025.89Show/hide
Query:  PETRAYGVWVSEIMLQQTRVQ----TVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYY-RRARFLFEGAKMIVKE-GGRFPKTVSSLRKIPGIGEY
        P+ R + V +  ++  QT+       V + +   +L   T + + +A    + E+   +G+Y R+A  + + AK+ + E  G  P+T+  L  +PG+G  
Subjt:  PETRAYGVWVSEIMLQQTRVQ----TVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYY-RRARFLFEGAKMIVKE-GGRFPKTVSSLRKIPGIGEY

Query:  TAGAIASIAFGEVVPV-VDGNVIRVIARL--------KAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHC-EA
         A  +  +A+ +V  + VD +V R+  RL        K  + +P++ ++  Q W    + V +      N  L+  G T+CTP  P C TC + + C  A
Subjt:  TAGAIASIAFGEVVPV-VDGNVIRVIARL--------KAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHC-EA

Query:  LSISKRDSSVLVTDYPAKGIKTKQ
           +   SS L      K IK+K+
Subjt:  LSISKRDSSVLVTDYPAKGIKTKQ

AT4G12740.1 HhH-GPD base excision DNA repair family protein1.7e-13055.12Show/hide
Query:  DGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVV---DIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRS-LDKGEPETRAYGVWVSEIMLQ
        + E+ E  E  + + D    ++ + + +      +++E      DIED +FS +  Q IR  LLDWYD ++RDLPWR+   + E E RAY VWVSEIMLQ
Subjt:  DGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVV---DIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRS-LDKGEPETRAYGVWVSEIMLQ

Query:  QTRVQTVVQFYNRWMLKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAG
        QTRVQTV+++Y RWM KWPT+  L +ASLE                   EVNEMWAGLGYYRRARFL EGAKM+V     FP   SSL K+ GIG+YTAG
Subjt:  QTRVQTVVQFYNRWMLKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAG

Query:  AIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVT
        AIASIAF E VPVVDGNVIRV+ARLKAIS NPKD    +  WK AAQLVD SRPGDFNQ+LMELGATLCT + PSCS+CPV   C A S+S+ + ++ VT
Subjt:  AIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVT

Query:  DYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSK--NFGLEPKKNFEIVNREDVG
        DYP K IK K RHD+  VCV+EI   +     G   RF+LVKRP++GLLAGLWEFPSV L+ EADS+TRR +I+  L +   F +E KK   IV+RE++G
Subjt:  DYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADSSTRRESIDSLLSK--NFGLEPKKNFEIVNREDVG

Query:  DFIHVFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRK
        +F+H+FTHIR K+YVE LV+ L G    LF+ Q K ++ WKCV + V+ST+GLTS+VRK
Subjt:  DFIHVFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGACGGAGAAAAGAATGAGAACGAGGAGAATGTGAAGAAAAAGACTGACTTTCGTCGGAAAAAGAAACCCACGACGAAACGGAAACGCCGGAGCCGAAGTCCGTC
TAAAAGTGAAGCAGTTGTTGACATTGAAGATATTATGTTCAGCATAGACAATGTTCAGACAATCAGGGCCTCGCTATTGGATTGGTACGACCGTAGCCGCAGGGACCTTC
CATGGAGGAGCTTGGACAAAGGGGAACCTGAAACACGGGCTTACGGTGTGTGGGTTTCAGAAATAATGCTGCAGCAGACCAGAGTTCAGACCGTCGTCCAATTTTACAAC
CGTTGGATGCTTAAATGGCCTACCGTTCAACATCTCTCTCGTGCTTCTCTTGAGGAGGTGAATGAAATGTGGGCAGGCTTGGGGTATTATAGACGAGCTCGTTTTCTTTT
TGAGGGTGCAAAGATGATAGTCAAAGAAGGCGGTAGATTTCCTAAAACAGTTTCTTCCCTTCGAAAAATTCCAGGAATTGGAGAATACACAGCAGGGGCTATTGCCTCTA
TAGCGTTCGGTGAAGTGGTGCCTGTAGTCGATGGTAATGTGATAAGGGTAATTGCTCGATTAAAGGCTATTTCAGGAAACCCAAAAGACCCGAAGTTGATCAAGCAAGTT
TGGAAGGCAGCTGCTCAATTAGTTGATCTTTCCAGGCCTGGGGATTTCAATCAGGCACTCATGGAACTTGGTGCAACTTTATGCACTCCAACAAACCCAAGCTGCTCAAC
GTGCCCCGTGTTTGATCACTGTGAGGCCCTTTCAATCTCAAAGCGTGATAGTTCAGTTCTTGTCACAGATTATCCCGCTAAGGGGATAAAGACCAAACAAAGACATGATT
ATTCTGCTGTATGTGTGGTTGAGATATTGGAAAGTCAGGGTACATCTGAGTTAGGGCAATCTAGTAGATTTCTTCTTGTCAAGAGGCCTGATGAAGGTTTGCTTGCTGGT
CTATGGGAGTTTCCATCTGTCTCGTTGGATGGAGAAGCTGATTCAAGCACAAGGAGAGAATCCATCGATAGCCTATTGAGTAAAAACTTTGGACTTGAACCAAAAAAGAA
TTTTGAAATAGTCAATAGAGAAGATGTTGGAGATTTTATCCATGTTTTCACGCACATCCGTCTCAAGATATATGTTGAGCACTTGGTGTTATGTTTAAAAGGTGAAGGTA
GCAAGTTGTTTCGGAAACAGGAGAAGAAATCTATATTATGGAAATGTGTAGAGAACAAGGTTATGTCAACGATGGGCTTGACGTCCAGTGTGAGGAAGGCCTATGCCATG
GTCGAGAAATTTCAGGCAGGGAAGACATCTTCTAGTTCTAGCCGTGTACTACCCATAAAAAAACAGAAATCTTGA
mRNA sequenceShow/hide mRNA sequence
TTTTCTTCTAACCTTTCTGCATTTATTATGGACCAAGTCCTCGTACATCGCTACATCTTCCATCCATAGTCGTTTCTCATTTCTTCTTGCGCCTTCTCTTTGGTCAGCAA
CTTTCTCTTCATCTACAATTTTTCTGCTTTGCCAAATTTGAAGTTGTGGGCGATTACTGACGAGTAATCGGAGTAGTGGGTCGGGCTGTTGCAGTATGAGCGACGGAGAA
AAGAATGAGAACGAGGAGAATGTGAAGAAAAAGACTGACTTTCGTCGGAAAAAGAAACCCACGACGAAACGGAAACGCCGGAGCCGAAGTCCGTCTAAAAGTGAAGCAGT
TGTTGACATTGAAGATATTATGTTCAGCATAGACAATGTTCAGACAATCAGGGCCTCGCTATTGGATTGGTACGACCGTAGCCGCAGGGACCTTCCATGGAGGAGCTTGG
ACAAAGGGGAACCTGAAACACGGGCTTACGGTGTGTGGGTTTCAGAAATAATGCTGCAGCAGACCAGAGTTCAGACCGTCGTCCAATTTTACAACCGTTGGATGCTTAAA
TGGCCTACCGTTCAACATCTCTCTCGTGCTTCTCTTGAGGAGGTGAATGAAATGTGGGCAGGCTTGGGGTATTATAGACGAGCTCGTTTTCTTTTTGAGGGTGCAAAGAT
GATAGTCAAAGAAGGCGGTAGATTTCCTAAAACAGTTTCTTCCCTTCGAAAAATTCCAGGAATTGGAGAATACACAGCAGGGGCTATTGCCTCTATAGCGTTCGGTGAAG
TGGTGCCTGTAGTCGATGGTAATGTGATAAGGGTAATTGCTCGATTAAAGGCTATTTCAGGAAACCCAAAAGACCCGAAGTTGATCAAGCAAGTTTGGAAGGCAGCTGCT
CAATTAGTTGATCTTTCCAGGCCTGGGGATTTCAATCAGGCACTCATGGAACTTGGTGCAACTTTATGCACTCCAACAAACCCAAGCTGCTCAACGTGCCCCGTGTTTGA
TCACTGTGAGGCCCTTTCAATCTCAAAGCGTGATAGTTCAGTTCTTGTCACAGATTATCCCGCTAAGGGGATAAAGACCAAACAAAGACATGATTATTCTGCTGTATGTG
TGGTTGAGATATTGGAAAGTCAGGGTACATCTGAGTTAGGGCAATCTAGTAGATTTCTTCTTGTCAAGAGGCCTGATGAAGGTTTGCTTGCTGGTCTATGGGAGTTTCCA
TCTGTCTCGTTGGATGGAGAAGCTGATTCAAGCACAAGGAGAGAATCCATCGATAGCCTATTGAGTAAAAACTTTGGACTTGAACCAAAAAAGAATTTTGAAATAGTCAA
TAGAGAAGATGTTGGAGATTTTATCCATGTTTTCACGCACATCCGTCTCAAGATATATGTTGAGCACTTGGTGTTATGTTTAAAAGGTGAAGGTAGCAAGTTGTTTCGGA
AACAGGAGAAGAAATCTATATTATGGAAATGTGTAGAGAACAAGGTTATGTCAACGATGGGCTTGACGTCCAGTGTGAGGAAGGCCTATGCCATGGTCGAGAAATTTCAG
GCAGGGAAGACATCTTCTAGTTCTAGCCGTGTACTACCCATAAAAAAACAGAAATCTTGAACTGCAGGAGCTCTTGACATTTACACAAGTTTATTTGATGCCTTTTCCAT
TCATAGATTGTTTTTTGGGGGAACAACAACTATAGATTCGAGGATCAAACCTTTGGTCTGTAGAGAGGAAGTCCGGAGATTATCCCCATGCGAGTGGAAAAAGACAGAAG
GAAGAAGGGTATCAGAAGCAAATGTCTTTTATAGTCATTCAACAGAAAAACCCTACATGTATGAGAAAATTACAAATTCTCCACTGAACATGTAAGGATATTAAATTTTA
TATAGTTATTCAACTTTTATTTTCTCTATGGGGTTTGAGATTAAACCTCCGACCTCTAGGAAAAGGAGTTATTCAAATATCTTTGAGCCAAGTTTACTTTGGTCATCTTT
AAACTTATTTTCCTTAAAGATTGCAATTTACCTTTGTACCATTAAACCAAAGGATGTACCGAT
Protein sequenceShow/hide protein sequence
MSDGEKNENEENVKKKTDFRRKKKPTTKRKRRSRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYN
RWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQV
WKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAG
LWEFPSVSLDGEADSSTRRESIDSLLSKNFGLEPKKNFEIVNREDVGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAM
VEKFQAGKTSSSSSRVLPIKKQKS