; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC06G107300 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC06G107300
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionAdenine DNA glycosylase
Genome locationCiama_Chr06:855588..864809
RNA-Seq ExpressionCaUC06G107300
SyntenyCaUC06G107300
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0016020 - membrane (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0019104 - DNA N-glycosylase activity (molecular function)
GO:0051536 - iron-sulfur cluster binding (molecular function)
InterPro domainsIPR000445 - Helix-hairpin-helix motif
IPR003265 - HhH-GPD domain
IPR004036 - Endonuclease III-like, conserved site-2
IPR011257 - DNA glycosylase
IPR015797 - NUDIX hydrolase-like domain superfamily
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal
IPR029119 - MutY, C-terminal
IPR044298 - Adenine/Thymine-DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN46403.2 hypothetical protein Csa_005328 [Cucumis sativus]5.1e-21274.12Show/hide
Query:  SGSGCCSMSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWVS
        SGSGCCSMS GEKNEN+E++K+NTDFRR+KKPT ERKRRGRSPSK EAVVDIEDIMFSIDNVQ IRASLL+WYDRS RDLPWRSLDKG+PETRAYGVWVS
Subjt:  SGSGCCSMSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWVS

Query:  EIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIM
        EIMLQQTRVQTVV FYNRWML+WPTVQHLSRASLEEVNEMWAGLGYYRRARFL E                                             
Subjt:  EIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIM

Query:  LHFPKKHMIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDG
                                                             GAKMIVKEGGRFP+TVS+LRKIPGIGEYTAGAIASIAF EVVPVVDG
Subjt:  LHFPKKHMIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDG

Query:  NVIRVIARLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYS
        NVIRVIARLKAI GNPKDPKL KQ   AAAQLVD SRPGDFNQALMELGATLC+PTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIK KQRHDYS
Subjt:  NVIRVIARLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYS

Query:  AVSVVEILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHL
        AV VVEILE+QGT +L QSSRFLLVKRPDEGLLAGLWEFPSV LDGEADLSTRRESINSLLSK FGLE KKNFEIV RE+VGDFIH+FTHIRLKIYVEHL
Subjt:  AVSVVEILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHL

Query:  VLCLKGEGSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRK
        VLCLKGEGSKLF+KQEKKSILWKCV+++VMS+MGLTSSVRK
Subjt:  VLCLKGEGSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRK

XP_031743608.1 adenine DNA glycosylase isoform X5 [Cucumis sativus]4.5e-20873.56Show/hide
Query:  MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWVSEIMLQQT
        MS GEKNEN+E++K+NTDFRR+KKPT ERKRRGRSPSK EAVVDIEDIMFSIDNVQ IRASLL+WYDRS RDLPWRSLDKG+PETRAYGVWVSEIMLQQT
Subjt:  MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWVSEIMLQQT

Query:  RVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIMLHFPKKH
        RVQTVV FYNRWML+WPTVQHLSRASLEEVNEMWAGLGYYRRARFL E                                                    
Subjt:  RVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIMLHFPKKH

Query:  MIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIA
                                                      GAKMIVKEGGRFP+TVS+LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRVIA
Subjt:  MIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIA

Query:  RLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEI
        RLKAI GNPKDPKL KQ   AAAQLVD SRPGDFNQALMELGATLC+PTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIK KQRHDYSAV VVEI
Subjt:  RLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEI

Query:  LENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKGE
        LE+QGT +L QSSRFLLVKRPDEGLLAGLWEFPSV LDGEADLSTRRESINSLLSK FGLE KKNFEIV RE+VGDFIH+FTHIRLKIYVEHLVLCLKGE
Subjt:  LENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKGE

Query:  GSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRKVST
        GSKLF+KQEKKSILWKCV+++VMS+MGLTSSVRKV +
Subjt:  GSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRKVST

XP_038874808.1 adenine DNA glycosylase isoform X1 [Benincasa hispida]1.1e-21474.54Show/hide
Query:  GSGSGCCSMSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWV
        GSGSGCCSMSGG KNE EE+ KQ T FRR+KKPTKERKRRG SPSKREAVVDIEDIMFS+DNVQIIRASLLEWYDRS RDLPWR LDKG+PETRAYGVWV
Subjt:  GSGSGCCSMSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWV

Query:  SEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKI
        SEIMLQQTRVQTVVH+YNRWM++WPTVQHLSRASLEEVNEMWAGLGYYRRAR+LLE                                            
Subjt:  SEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKI

Query:  MLHFPKKHMIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVD
                                                              GAKMIVKEGGRFPKTVS LRKI GIGEYTAGAIASIAFDEVVPVVD
Subjt:  MLHFPKKHMIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVD

Query:  GNVIRVIARLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDY
        GNVIRVIARLKAI GNPKDPKL KQ   AAAQLVDPSRPGDFNQALMELGATLC+PTNPSCSTCPVFDHCEALSISKHD+S+LVTDYPAKGIKTKQRHDY
Subjt:  GNVIRVIARLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDY

Query:  SAVSVVEILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEH
        SAV VVEIL+NQGTS+LEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADL+TRRESINS LSK FGLE KKNFEIVIRE+VGDFIHVFTHIRLKIYVEH
Subjt:  SAVSVVEILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEH

Query:  LVLCLKGEGSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRK
        LVLCLKGEGS+LF KQEKKSILW+CVD++ MSSMGLTSSVRK
Subjt:  LVLCLKGEGSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRK

XP_038874809.1 adenine DNA glycosylase isoform X2 [Benincasa hispida]2.0e-20873.1Show/hide
Query:  GSGSGCCSMSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWV
        GSGSGCCSMSGG KNE EE+ KQ T FRR+KKPTKERKRRG SPSKREAVVDIEDIMFS+DNVQIIRASLLEWYDRS RDLPWR LDKG+PETRAYGVWV
Subjt:  GSGSGCCSMSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWV

Query:  SEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKI
        SEIMLQQTRVQTVVH+YNRWM++WPTVQHLSRASLEEVNEMWAGLGYYRRAR+LLE                                            
Subjt:  SEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKI

Query:  MLHFPKKHMIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVD
                                                              GAKMIVKEGGRFPKTVS LRKI GIGEYTAGAIASIAFDEVVPVVD
Subjt:  MLHFPKKHMIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVD

Query:  GNVIRVIARLKAILGNPKDPKLNKQAAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAV
        GNVIRVIARLKAI GNPKDPKL KQ          PGDFNQALMELGATLC+PTNPSCSTCPVFDHCEALSISKHD+S+LVTDYPAKGIKTKQRHDYSAV
Subjt:  GNVIRVIARLKAILGNPKDPKLNKQAAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAV

Query:  SVVEILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVL
         VVEIL+NQGTS+LEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADL+TRRESINS LSK FGLE KKNFEIVIRE+VGDFIHVFTHIRLKIYVEHLVL
Subjt:  SVVEILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVL

Query:  CLKGEGSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRK
        CLKGEGS+LF KQEKKSILW+CVD++ MSSMGLTSSVRK
Subjt:  CLKGEGSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRK

XP_038874810.1 adenine DNA glycosylase isoform X3 [Benincasa hispida]1.1e-20974.16Show/hide
Query:  MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWVSEIMLQQT
        MSGG KNE EE+ KQ T FRR+KKPTKERKRRG SPSKREAVVDIEDIMFS+DNVQIIRASLLEWYDRS RDLPWR LDKG+PETRAYGVWVSEIMLQQT
Subjt:  MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWVSEIMLQQT

Query:  RVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIMLHFPKKH
        RVQTVVH+YNRWM++WPTVQHLSRASLEEVNEMWAGLGYYRRAR+LLE                                                    
Subjt:  RVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIMLHFPKKH

Query:  MIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIA
                                                      GAKMIVKEGGRFPKTVS LRKI GIGEYTAGAIASIAFDEVVPVVDGNVIRVIA
Subjt:  MIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIA

Query:  RLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEI
        RLKAI GNPKDPKL KQ   AAAQLVDPSRPGDFNQALMELGATLC+PTNPSCSTCPVFDHCEALSISKHD+S+LVTDYPAKGIKTKQRHDYSAV VVEI
Subjt:  RLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEI

Query:  LENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKGE
        L+NQGTS+LEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADL+TRRESINS LSK FGLE KKNFEIVIRE+VGDFIHVFTHIRLKIYVEHLVLCLKGE
Subjt:  LENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKGE

Query:  GSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRK
        GS+LF KQEKKSILW+CVD++ MSSMGLTSSVRK
Subjt:  GSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRK

TrEMBL top hitse value%identityAlignment
A0A0A0KC27 Adenine DNA glycosylase2.5e-21274.12Show/hide
Query:  SGSGCCSMSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWVS
        SGSGCCSMS GEKNEN+E++K+NTDFRR+KKPT ERKRRGRSPSK EAVVDIEDIMFSIDNVQ IRASLL+WYDRS RDLPWRSLDKG+PETRAYGVWVS
Subjt:  SGSGCCSMSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWVS

Query:  EIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIM
        EIMLQQTRVQTVV FYNRWML+WPTVQHLSRASLEEVNEMWAGLGYYRRARFL E                                             
Subjt:  EIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIM

Query:  LHFPKKHMIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDG
                                                             GAKMIVKEGGRFP+TVS+LRKIPGIGEYTAGAIASIAF EVVPVVDG
Subjt:  LHFPKKHMIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDG

Query:  NVIRVIARLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYS
        NVIRVIARLKAI GNPKDPKL KQ   AAAQLVD SRPGDFNQALMELGATLC+PTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIK KQRHDYS
Subjt:  NVIRVIARLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYS

Query:  AVSVVEILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHL
        AV VVEILE+QGT +L QSSRFLLVKRPDEGLLAGLWEFPSV LDGEADLSTRRESINSLLSK FGLE KKNFEIV RE+VGDFIH+FTHIRLKIYVEHL
Subjt:  AVSVVEILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHL

Query:  VLCLKGEGSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRK
        VLCLKGEGSKLF+KQEKKSILWKCV+++VMS+MGLTSSVRK
Subjt:  VLCLKGEGSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRK

A0A1S3CBT2 Adenine DNA glycosylase1.6e-20673.97Show/hide
Query:  MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWVSEIMLQQT
        MS GEKNENEE VK+ TDFRR+KKPT +RKRR RSPSK EAVVDIEDIMFSIDNVQ IRASLL+WYDRS RDLPWRSLDKG+PETRAYGVWVSEIMLQQT
Subjt:  MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWVSEIMLQQT

Query:  RVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIMLHFPKKH
        RVQTVV FYNRWML+WPTVQHLSRASLEEVNEMWAGLGYYRRARFL E                                                    
Subjt:  RVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIMLHFPKKH

Query:  MIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIA
                                                      GAKMIVKEGGRFPKTVS+LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRVIA
Subjt:  MIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIA

Query:  RLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEI
        RLKAI GNPKDPKL KQ   AAAQLVD SRPGDFNQALMELGATLC+PTNPSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIKTKQRHDYSAV VVEI
Subjt:  RLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEI

Query:  LENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKGE
        LE+QGTS+L QSSRFLLVKRPDEGLLAGLWEFPSV LDGEAD STRRESI+SLLSK FGLEPKKNFEIV RE+VGDFIHVFTHIRLKIYVEHLVLCLKGE
Subjt:  LENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKGE

Query:  GSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRK
        GSKLF+KQEKKSILWKCV+++VMS+MGLTSSVRK
Subjt:  GSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRK

A0A1S4E2J7 Adenine DNA glycosylase1.2e-20673.83Show/hide
Query:  MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWVSEIMLQQT
        MS GEKNENEE VK+ TDFRR+KKPT +RKRR RSPSK EAVVDIEDIMFSIDNVQ IRASLL+WYDRS RDLPWRSLDKG+PETRAYGVWVSEIMLQQT
Subjt:  MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWVSEIMLQQT

Query:  RVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIMLHFPKKH
        RVQTVV FYNRWML+WPTVQHLSRASLEEVNEMWAGLGYYRRARFL E                                                    
Subjt:  RVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIMLHFPKKH

Query:  MIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIA
                                                      GAKMIVKEGGRFPKTVS+LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRVIA
Subjt:  MIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIA

Query:  RLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEI
        RLKAI GNPKDPKL KQ   AAAQLVD SRPGDFNQALMELGATLC+PTNPSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIKTKQRHDYSAV VVEI
Subjt:  RLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEI

Query:  LENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKGE
        LE+QGTS+L QSSRFLLVKRPDEGLLAGLWEFPSV LDGEAD STRRESI+SLLSK FGLEPKKNFEIV RE+VGDFIHVFTHIRLKIYVEHLVLCLKGE
Subjt:  LENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKGE

Query:  GSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRKV
        GSKLF+KQEKKSILWKCV+++VMS+MGLTSSVRK+
Subjt:  GSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRKV

A0A5A7T8X3 Adenine DNA glycosylase5.0e-20571.2Show/hide
Query:  MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWVSEIMLQQT
        MS GEKNENEE VK+ TDFRR+KKPT ERKRRGRSPSK EAVVDIEDIMFSIDNVQ IRASLL+WYDRS RDLPWRSLDKG+PETRAYGVWVSEIMLQQT
Subjt:  MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWVSEIMLQQT

Query:  RVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIMLHFPKKH
        RVQTVV FYNRWML+WPTVQHLSRASLEEVNEMWAGLGYYRRARFL E                                                    
Subjt:  RVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIMLHFPKKH

Query:  MIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDE--------------
                                                      GAKMIVKEGGRFPKTVS+LRKIPGIGEYTAGAIASIAF E              
Subjt:  MIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDE--------------

Query:  -----------VVPVVDGNVIRVIARLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVL
                   VVPVVDGNVIRVIARLKAI GNPKDPKL KQ   AAAQLVDPSRPGDFNQALMELGATLC+PTNPSCSTCPVFDHCEALSISK DSSVL
Subjt:  -----------VVPVVDGNVIRVIARLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVL

Query:  VTDYPAKGIKTKQRHDYSAVSVVEILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVG
        VTDYPAKGIKTKQRHDYSAV VVEILE+QGTS+L QSSRFLLVKRPDEGLLAGLWEFPSV LDGEADL+TRRESI+SLLSK FGLEPKKNFEIV RE+VG
Subjt:  VTDYPAKGIKTKQRHDYSAVSVVEILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVG

Query:  DFIHVFTHIRLKIYVEHLVLCLKGEGSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRK
        DFIHVFTHIRLKIYVEHLVLCLKGEGSKLF+KQEKKSILWKCV+++VMS+MGLTSSVRK
Subjt:  DFIHVFTHIRLKIYVEHLVLCLKGEGSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRK

A0A6J1JU23 Adenine DNA glycosylase2.5e-19671.03Show/hide
Query:  MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWVSEIMLQQT
        MSGGEKNEN E VK        KKPTK  KRRGRSPSKRE +VDIEDIMFSID VQ +R+SLL+WYD S RDLPWR LDKGQPETR YGVWVSEIMLQQT
Subjt:  MSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWVSEIMLQQT

Query:  RVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIMLHFPKKH
        RVQTVV +Y RWM RWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLE                                                    
Subjt:  RVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIMLHFPKKH

Query:  MIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIA
                                                      GAK+IVKEGG FPKTV  LRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIA
Subjt:  MIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIA

Query:  RLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEI
        RLKAI GNPKD KL KQ   AAAQLVDPSRPGDFNQALMELGATLC+PT+PSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIKTKQRHDYSAV VVE+
Subjt:  RLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEI

Query:  LENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKGE
        LEN+GTS+L+Q SRFLLVKRPDEGLLAGLWEFPSVLL+GEAD STRRESINSLLSK FGLEPKKNFEIVIRE+VGDF+HVF+HIRLKIYVEHLVL LKGE
Subjt:  LENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKGE

Query:  GSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRKV
        GSKLF+KQEKKSI WKCVD++VMSSMGLTSSVRKV
Subjt:  GSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRKV

SwissProt top hitse value%identityAlignment
F4JRF4 Adenine DNA glycosylase2.2e-11244.85Show/hide
Query:  EKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRS-LDKGQPETRAYGVWVSEIMLQQTRVQ
        EK E  E      +   E +  +E +       +     DIED +FS +  Q IR  LL+WYD + RDLPWR+   + + E RAY VWVSEIMLQQTRVQ
Subjt:  EKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRS-LDKGQPETRAYGVWVSEIMLQQTRVQ

Query:  TVVHFYNRWMLRWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIER
        TV+ +Y RWM +WPT+  L +ASLE                   EVNEMWAGLGYYRRARFLLE                                    
Subjt:  TVVHFYNRWMLRWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIER

Query:  TRMHFFKIMLHFPKKHMIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAF
                                                                      GAKM+V     FP   S+L K+ GIG+YTAGAIASIAF
Subjt:  TRMHFFKIMLHFPKKHMIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAF

Query:  DEVVPVVDGNVIRVIARLKAILGNPKDPKLNK---QAAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGI
        +E VPVVDGNVIRV+ARLKAI  NPKD    +   + AAQLVDPSRPGDFNQ+LMELGATLC+ + PSCS+CPV   C A S+S+ + ++ VTDYP K I
Subjt:  DEVVPVVDGNVIRVIARLKAILGNPKDPKLNK---QAAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGI

Query:  KTKQRHDYSAVSVVEILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSK--YFGLEPKKNFEIVIREEVGDFIHVFT
        K K RHD+  V V+EI       + +   RF+LVKRP++GLLAGLWEFPSV+L+ EAD +TRR +IN  L +   F +E KK   IV REE+G+F+H+FT
Subjt:  KTKQRHDYSAVSVVEILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSK--YFGLEPKKNFEIVIREEVGDFIHVFT

Query:  HIRLKIYVEHLVLCLKGEGSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRKV
        HIR K+YVE LV+ L G    LF+ Q K ++ WKCV S+V+S++GLTS+VRKV
Subjt:  HIRLKIYVEHLVLCLKGEGSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRKV

Q10159 Adenine DNA glycosylase1.2e-4129.11Show/hide
Query:  VQIIRASLLEWYDRSCRDLPWRSL------------DKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLE-EVNEMWAGLGYYR
        V+  R SL+++YD++ R LPWR              D  QP  R Y V VSEIMLQQTRV+TV  +Y +WM   PT++  + A    +V  +W+G+G+Y 
Subjt:  VQIIRASLLEWYDRSCRDLPWRSL------------DKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLE-EVNEMWAGLGYYR

Query:  RARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIMLHFPKKHMIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSS
        R +                                         R+H  +   H  K H                                         
Subjt:  RARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIMLHFPKKHMIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSS

Query:  AAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAILGNPKDPKLNK---QAAAQLVDPSRPGDFNQALMEL
                 I + G  + K       IPG+G YTAGA+ SIA+ +   +VDGNVIRV++R  AI  +    K N    + A +LVDP RPGDFNQALMEL
Subjt:  AAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAILGNPKDPKLNK---QAAAQLVDPSRPGDFNQALMEL

Query:  GATLCSPTNPSCSTCPVFDHCEAL---------SISKHD-----SSVLVTD--------------YPAKGIKTKQRHDYSAVSVVEILENQGTSKLEQSS
        GA  C+P +P CS CP+ + C+A          +  K+D      ++ +TD              YP    KTKQR + + V +      Q T    +  
Subjt:  GATLCSPTNPSCSTCPVFDHCEAL---------SISKHD-----SSVLVTD--------------YPAKGIKTKQRHDYSAVSVVEILENQGTSKLEQSS

Query:  RFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLV
         FL+ KRP  GLLAGLW+FP++    E+            ++++   + +    I   +  G ++H+F+HIR   +V + +
Subjt:  RFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLV

Q8R5G2 Adenine DNA glycosylase1.7e-6133.64Show/hide
Query:  KQNTDFRREKKPTKERKRRGR----------------SPSKREAV----VDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQP--ETRAYGVW
        K     R  KK     KRRG+                +  KRE +    V    +   I +V   R +LL WYD+  RDLPWR   K +   + RAY VW
Subjt:  KQNTDFRREKKPTKERKRRGR----------------SPSKREAV----VDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQP--ETRAYGVW

Query:  VSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFK
        VSE+MLQQT+V TV+ +Y RWM +WPT+Q L+ ASLEEVN++W+GLGYY R R L E                                           
Subjt:  VSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFK

Query:  IMLHFPKKHMIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKE-GGRFPKTVSALRK-IPGIGEYTAGAIASIAFDEVVP
                                                               GA+ +V+E GG  P+T   L++ +PG+G YTAGAIASIAFD+V  
Subjt:  IMLHFPKKHMIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKE-GGRFPKTVSALRK-IPGIGEYTAGAIASIAFDEVVP

Query:  VVDGNVIRVIARLKAILGNPKDPKLNK---QAAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEA-------------------------
        VVDGNVIRV+ R++AI  +P    ++      A QLVDP+RPGDFNQA MELGAT+C+P  P C+ CPV   C A                         
Subjt:  VVDGNVIRVIARLKAILGNPKDPKLNK---QAAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEA-------------------------

Query:  -------LSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKY
                S +  D ++ V ++P K  +   R +YSA  VVE     G          LLV+RP+ GLLAGLWEFPSV L  E     + +++   L  +
Subjt:  -------LSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKY

Query:  FGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVL
            P         + +G+ IHVF+HI+L   V  L L
Subjt:  FGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVL

Q99P21 Adenine DNA glycosylase7.5e-6534.22Show/hide
Query:  EKKPTKERKRRGRSPS---------------KRE----AVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSL--DKGQPETRAYGVWVSEIMLQQT
        +K+P   ++RR R+ S               KRE    A V    +   + +V   R++LL WYD+  RDLPWR+L  ++   + RAY VWVSE+MLQQT
Subjt:  EKKPTKERKRRGRSPS---------------KRE----AVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSL--DKGQPETRAYGVWVSEIMLQQT

Query:  RVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIMLHFPKKH
        +V TV+ +Y RWM +WP +Q L+ ASLEEVN++W+GLGYY R R L E                                                    
Subjt:  RVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIMLHFPKKH

Query:  MIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKE-GGRFPKTVSALRK-IPGIGEYTAGAIASIAFDEVVPVVDGNVIRV
                                                      GA+ +V+E GG  P+T   L++ +PG+G YTAGAIASIAFD+V  VVDGNV+RV
Subjt:  MIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKE-GGRFPKTVSALRK-IPGIGEYTAGAIASIAFDEVVPVVDGNVIRV

Query:  IARLKAILGNPKDPKLNK---QAAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEA--------------------------------LS
        + R++AI  +P    ++      A QLVDP+RPGDFNQA MELGAT+C+P  P CS CPV   C A                                 S
Subjt:  IARLKAILGNPKDPKLNK---QAAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEA--------------------------------LS

Query:  ISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNF
         S  D S+ V ++P K  +   R +YSA  VVE     G          LLV+RPD GLLAGLWEFPSV L  E     + +++   L ++ G  P    
Subjt:  ISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNF

Query:  EIVIREEVGDFIHVFTHIRLKIYVEHLVL
          +  + +G+ IH+F+HI+L   V  L L
Subjt:  EIVIREEVGDFIHVFTHIRLKIYVEHLVL

Q9UIF7 Adenine DNA glycosylase6.0e-6234.34Show/hide
Query:  EAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPW--RSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGL
        +A V    +   +  V   R SLL WYD+  RDLPW  R+ D+   + RAY VWVSE+MLQQT+V TV+++Y  WM +WPT+Q L+ ASLEEVN++WAGL
Subjt:  EAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPW--RSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGL

Query:  GYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIMLHFPKKHMIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYD
        GYY R R L E                                                                                         
Subjt:  GYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIMLHFPKKHMIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYD

Query:  LCSSAAMHTGAKMIVKE-GGRFPKTVSALRK-IPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFN
                 GA+ +V+E GG  P+T   L++ +PG+G YTAGAIASIAF +   VVDGNV RV+ R++AI  +P    +++Q    A QLVDP+RPGDFN
Subjt:  LCSSAAMHTGAKMIVKE-GGRFPKTVSALRK-IPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAILGNPKDPKLNKQ---AAAQLVDPSRPGDFN

Query:  QALMELGATLCSPTNPSCSTCPVFDHCEA--------------LSISKH---------------------DSSVLVTDYPAKGIKTKQRHDYSAVSVVEI
        QA MELGAT+C+P  P CS CPV   C A              LS S                       D ++ V ++P K  +   R + SA  V   
Subjt:  QALMELGATLCSPTNPSCSTCPVFDHCEA--------------LSISKH---------------------DSSVLVTDYPAKGIKTKQRHDYSAVSVVEI

Query:  LENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVL
        LE  G       ++ LLV+RP+ GLLAGLWEFPSV  +    L  +R+++   L ++ G  P  +        +G+ +H F+HI+L   V  L L
Subjt:  LENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVL

Arabidopsis top hitse value%identityAlignment
AT1G05900.2 endonuclease III 29.0e-0526.76Show/hide
Query:  IVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPV-VDGNVIRVIARL-----KAILGNPKDPKLNKQAAAQLVDPSRPGDFNQALMELGATLCS
        +++  G  P+T+  L  +PG+G   A  +  +A+++V  + VD +V R+  RL              P+  + A  Q +        N  L+  G T+C+
Subjt:  IVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPV-VDGNVIRVIARL-----KAILGNPKDPKLNKQAAAQLVDPSRPGDFNQALMELGATLCS

Query:  PTNPSCSTCPVFDHC-EALSISKHDSSVLVTDYPAKGIKTKQ
        P  P C TC + + C  A   +   SS L      K IK+K+
Subjt:  PTNPSCSTCPVFDHC-EALSISKHDSSVLVTDYPAKGIKTKQ

AT4G12740.1 HhH-GPD base excision DNA repair family protein1.5e-11344.85Show/hide
Query:  EKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRS-LDKGQPETRAYGVWVSEIMLQQTRVQ
        EK E  E      +   E +  +E +       +     DIED +FS +  Q IR  LL+WYD + RDLPWR+   + + E RAY VWVSEIMLQQTRVQ
Subjt:  EKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRS-LDKGQPETRAYGVWVSEIMLQQTRVQ

Query:  TVVHFYNRWMLRWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIER
        TV+ +Y RWM +WPT+  L +ASLE                   EVNEMWAGLGYYRRARFLLE                                    
Subjt:  TVVHFYNRWMLRWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIER

Query:  TRMHFFKIMLHFPKKHMIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAF
                                                                      GAKM+V     FP   S+L K+ GIG+YTAGAIASIAF
Subjt:  TRMHFFKIMLHFPKKHMIFSSLFSFLTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAF

Query:  DEVVPVVDGNVIRVIARLKAILGNPKDPKLNK---QAAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGI
        +E VPVVDGNVIRV+ARLKAI  NPKD    +   + AAQLVDPSRPGDFNQ+LMELGATLC+ + PSCS+CPV   C A S+S+ + ++ VTDYP K I
Subjt:  DEVVPVVDGNVIRVIARLKAILGNPKDPKLNK---QAAAQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGI

Query:  KTKQRHDYSAVSVVEILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSK--YFGLEPKKNFEIVIREEVGDFIHVFT
        K K RHD+  V V+EI       + +   RF+LVKRP++GLLAGLWEFPSV+L+ EAD +TRR +IN  L +   F +E KK   IV REE+G+F+H+FT
Subjt:  KTKQRHDYSAVSVVEILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLSTRRESINSLLSK--YFGLEPKKNFEIVIREEVGDFIHVFT

Query:  HIRLKIYVEHLVLCLKGEGSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRKV
        HIR K+YVE LV+ L G    LF+ Q K ++ WKCV S+V+S++GLTS+VRKV
Subjt:  HIRLKIYVEHLVLCLKGEGSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTGGGTAGTGGGTCGGGTTGTTGCAGTATGAGCGGCGGAGAAAAGAACGAGAACGAGGAGTTTGTGAAGCAAAATACTGATTTTCGTCGGGAAAAGAAACCAAC
GAAGGAACGAAAACGGCGGGGTCGAAGTCCGTCTAAAAGGGAAGCAGTTGTTGACATTGAAGATATTATGTTCAGCATAGACAATGTTCAGATAATCCGGGCATCGCTAT
TGGAATGGTACGACCGTAGCTGCAGGGACCTTCCATGGAGGAGCTTGGACAAAGGACAACCTGAAACACGGGCTTACGGTGTGTGGGTTTCAGAAATAATGCTGCAGCAG
ACGAGAGTTCAGACCGTCGTTCACTTTTACAACCGTTGGATGCTTAGATGGCCCACCGTTCAACATCTCTCTCGTGCTTCTCTTGAGGAAGTTAATGAAATGTGGGCAGG
CTTGGGGTACTACAGACGAGCTCGTTTTCTTTTGGAGGTAATCGTTATTCATTTCAGTTACCTTGGACATGATAGGCTATCTAGGACGGTATGTGTTTACCATGAGATGA
GGTCGAAGATTACAAAGGAACGAATAATAGAGAGAACTCGTATGCATTTCTTCAAAATTATGCTTCACTTTCCCAAGAAACACATGATTTTTTCCTCTCTATTTTCTTTC
TTAACAAGTTTTGCATCCAGATATCCCCCATTTGCCAGAGCATATAAGAGAAATTACGGTTGCAAGTTTGAGGTGTCTTACGATTTGTGTTCTTCCGCTGCAATGCACAC
AGGTGCAAAGATGATAGTCAAAGAAGGTGGTAGATTTCCTAAAACGGTTTCTGCCCTTCGAAAAATTCCTGGAATTGGAGAATATACAGCAGGGGCTATTGCCTCCATAG
CATTCGATGAAGTGGTGCCTGTGGTCGATGGTAATGTGATAAGGGTAATTGCTCGATTAAAGGCCATTCTAGGAAATCCAAAAGACCCAAAGTTGAACAAGCAAGCAGCT
GCTCAATTAGTTGATCCTTCCAGGCCTGGGGACTTCAATCAGGCACTCATGGAACTTGGTGCTACTTTATGCAGTCCTACAAACCCAAGCTGCTCAACATGCCCTGTGTT
TGATCACTGCGAGGCCCTTTCAATCTCAAAGCATGATAGTTCAGTTCTTGTCACAGATTATCCTGCTAAGGGGATAAAGACCAAACAAAGACATGATTATTCTGCTGTAA
GTGTGGTTGAGATATTGGAAAACCAGGGTACATCTAAGTTAGAGCAATCTAGTAGATTTCTTCTTGTAAAGAGGCCTGATGAAGGTTTGCTTGCTGGTCTATGGGAGTTT
CCATCCGTCTTGTTGGACGGAGAAGCTGATTTAAGTACAAGGAGAGAATCCATTAATAGCCTCTTGAGTAAATACTTTGGACTTGAACCAAAAAAGAATTTTGAAATAGT
TATTAGAGAAGAGGTTGGAGATTTTATCCATGTTTTCACCCACATCCGTCTCAAGATATATGTTGAGCACTTGGTGTTATGTTTAAAAGGTGAAGGTAGCAAGTTGTTTC
AAAAACAGGAGAAGAAATCTATATTATGGAAATGTGTAGACAGCGAGGTTATGTCAAGCATGGGATTGACGTCCAGTGTGAGGAAGGTAAGCACAGATGTCCCATGGGAA
CATAATTATATCAACCCTATAGTTCTAGTAAGTTCTGGCATCAGAACCAAATACTGTCAATTTCTCCACATTTTTCTCCTTCTCTCAGTCACATACACCCAAAGATCCAT
TTCTTCCCCCTGTTTTCTTGGGGGGTTTTTTTCTTCTGGGCACGCGGTTGAGATAAATTACACATTGGTTATAACCATGCTTCTGAAGTTTAAGAATAATGAAGATGTGA
CTTCTGCACGAGATTTGTTTTCTAATCATGCCTATGCCATGGTCGAGAAATTTCAGGCAGAGAAGACATCTTCTAGACGTGCAGTCCCCAGAAAAAAACAGAAAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTTGGGTAGTGGGTCGGGTTGTTGCAGTATGAGCGGCGGAGAAAAGAACGAGAACGAGGAGTTTGTGAAGCAAAATACTGATTTTCGTCGGGAAAAGAAACCAAC
GAAGGAACGAAAACGGCGGGGTCGAAGTCCGTCTAAAAGGGAAGCAGTTGTTGACATTGAAGATATTATGTTCAGCATAGACAATGTTCAGATAATCCGGGCATCGCTAT
TGGAATGGTACGACCGTAGCTGCAGGGACCTTCCATGGAGGAGCTTGGACAAAGGACAACCTGAAACACGGGCTTACGGTGTGTGGGTTTCAGAAATAATGCTGCAGCAG
ACGAGAGTTCAGACCGTCGTTCACTTTTACAACCGTTGGATGCTTAGATGGCCCACCGTTCAACATCTCTCTCGTGCTTCTCTTGAGGAAGTTAATGAAATGTGGGCAGG
CTTGGGGTACTACAGACGAGCTCGTTTTCTTTTGGAGGTAATCGTTATTCATTTCAGTTACCTTGGACATGATAGGCTATCTAGGACGGTATGTGTTTACCATGAGATGA
GGTCGAAGATTACAAAGGAACGAATAATAGAGAGAACTCGTATGCATTTCTTCAAAATTATGCTTCACTTTCCCAAGAAACACATGATTTTTTCCTCTCTATTTTCTTTC
TTAACAAGTTTTGCATCCAGATATCCCCCATTTGCCAGAGCATATAAGAGAAATTACGGTTGCAAGTTTGAGGTGTCTTACGATTTGTGTTCTTCCGCTGCAATGCACAC
AGGTGCAAAGATGATAGTCAAAGAAGGTGGTAGATTTCCTAAAACGGTTTCTGCCCTTCGAAAAATTCCTGGAATTGGAGAATATACAGCAGGGGCTATTGCCTCCATAG
CATTCGATGAAGTGGTGCCTGTGGTCGATGGTAATGTGATAAGGGTAATTGCTCGATTAAAGGCCATTCTAGGAAATCCAAAAGACCCAAAGTTGAACAAGCAAGCAGCT
GCTCAATTAGTTGATCCTTCCAGGCCTGGGGACTTCAATCAGGCACTCATGGAACTTGGTGCTACTTTATGCAGTCCTACAAACCCAAGCTGCTCAACATGCCCTGTGTT
TGATCACTGCGAGGCCCTTTCAATCTCAAAGCATGATAGTTCAGTTCTTGTCACAGATTATCCTGCTAAGGGGATAAAGACCAAACAAAGACATGATTATTCTGCTGTAA
GTGTGGTTGAGATATTGGAAAACCAGGGTACATCTAAGTTAGAGCAATCTAGTAGATTTCTTCTTGTAAAGAGGCCTGATGAAGGTTTGCTTGCTGGTCTATGGGAGTTT
CCATCCGTCTTGTTGGACGGAGAAGCTGATTTAAGTACAAGGAGAGAATCCATTAATAGCCTCTTGAGTAAATACTTTGGACTTGAACCAAAAAAGAATTTTGAAATAGT
TATTAGAGAAGAGGTTGGAGATTTTATCCATGTTTTCACCCACATCCGTCTCAAGATATATGTTGAGCACTTGGTGTTATGTTTAAAAGGTGAAGGTAGCAAGTTGTTTC
AAAAACAGGAGAAGAAATCTATATTATGGAAATGTGTAGACAGCGAGGTTATGTCAAGCATGGGATTGACGTCCAGTGTGAGGAAGGTAAGCACAGATGTCCCATGGGAA
CATAATTATATCAACCCTATAGTTCTAGTAAGTTCTGGCATCAGAACCAAATACTGTCAATTTCTCCACATTTTTCTCCTTCTCTCAGTCACATACACCCAAAGATCCAT
TTCTTCCCCCTGTTTTCTTGGGGGGTTTTTTTCTTCTGGGCACGCGGTTGAGATAAATTACACATTGGTTATAACCATGCTTCTGAAGTTTAAGAATAATGAAGATGTGA
CTTCTGCACGAGATTTGTTTTCTAATCATGCCTATGCCATGGTCGAGAAATTTCAGGCAGAGAAGACATCTTCTAGACGTGCAGTCCCCAGAAAAAAACAGAAAGCTTGA
ATTGCAGGAGCTGTTGACATTTACGATAAGTTTATTTGATGCTTTTCCCATTCGTCGATTGTTGTGTTTGTCCCACTAGTAATCGACAGCAGGACAACCTATTTTCATGA
AAGTTGTAGATGATGAGGGAAAATAGGAGGGGAGATTCATGGGTCAGTTTTGTGTGGATGGGGATCAAACATTCTATCTCTAGAGAGGAAGACCACGTCAGTTATCATTG
AACTAAGCTCACTTTGGTAGTTATGTTTTGGTTTTTCTTTGATAGGTAAGAATTAGATAAATAGATATTGTTGAATGACAAAAAGTTCAAGAATGTACCCATCCAGAATC
CTCATACACCAAAATATTTGAACTCACTAGACTTTGTGGTTGTAAAGAAATAGTAATTTGATAAAACTAATAAGCAAAGCATATCTAGGCTGTCAAATAGGCTTTGTAAA
TTGAGTTATGTACTCAAGTTAGTGGCTATACAAGGCG
Protein sequenceShow/hide protein sequence
MSLGSGSGCCSMSGGEKNENEEFVKQNTDFRREKKPTKERKRRGRSPSKREAVVDIEDIMFSIDNVQIIRASLLEWYDRSCRDLPWRSLDKGQPETRAYGVWVSEIMLQQ
TRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEVIVIHFSYLGHDRLSRTVCVYHEMRSKITKERIIERTRMHFFKIMLHFPKKHMIFSSLFSF
LTSFASRYPPFARAYKRNYGCKFEVSYDLCSSAAMHTGAKMIVKEGGRFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAILGNPKDPKLNKQAA
AQLVDPSRPGDFNQALMELGATLCSPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVSVVEILENQGTSKLEQSSRFLLVKRPDEGLLAGLWEF
PSVLLDGEADLSTRRESINSLLSKYFGLEPKKNFEIVIREEVGDFIHVFTHIRLKIYVEHLVLCLKGEGSKLFQKQEKKSILWKCVDSEVMSSMGLTSSVRKVSTDVPWE
HNYINPIVLVSSGIRTKYCQFLHIFLLLSVTYTQRSISSPCFLGGFFSSGHAVEINYTLVITMLLKFKNNEDVTSARDLFSNHAYAMVEKFQAEKTSSRRAVPRKKQKA