; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G31010 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G31010
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDNA-3-methyladenine glycosylase 1-like
Genome locationChr6:26501971..26503040
RNA-Seq ExpressionCSPI06G31010
SyntenyCSPI06G31010
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006307 - DNA dealkylation involved in DNA repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0032993 - protein-DNA complex (cellular component)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
GO:0032131 - alkylated DNA binding (molecular function)
GO:0043916 - DNA-7-methylguanine glycosylase activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143510.1 DNA-3-methyladenine glycosylase 1 [Cucumis sativus]7.0e-15899.3Show/hide
Query:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC
        MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTR+FPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC
Subjt:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC

Query:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL
        ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL
Subjt:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQ+LYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD

XP_008440714.1 PREDICTED: probable DNA-3-methyladenine glycosylase 2 [Cucumis melo]1.6e-14189.79Show/hide
Query:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC
        MAKRIRRK LFQ ESP+ AVPLSPS+SSKIPF STKVRKISSNQEP KPQ SAP GYNPTR FPNLADPVKSLSS D+ISTAINHLRRSDPLLISLLDSC
Subjt:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC

Query:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL
        E+P+FKSNPPFLALTKSILYQQLATKAAE+IYNRFASLCGGEA+VLPDTVLGLSPQQLRV+GVSGRKASYLHDLATKFIEG+LSNS ILEMDDETLL  L
Subjt:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQ+LYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRL++ K +VK G D
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD

XP_022949777.1 DNA-3-methyladenine glycosylase 1-like [Cucurbita moschata]7.8e-12582.04Show/hide
Query:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC
        MA+R RRK L Q ES +DA P      S I F +TK+RKISS Q+  KPQIS PGG + TR FPN   PVKSLSSSD I TAI+HLRRSDPLLI LLDSC
Subjt:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC

Query:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL
        E+PNFKSNPPFLALTKSILYQQLATKAAE+IYNRFASLCGG+AAVLPD VLGLSPQQLRV+GVSGRKASYLHDLATKFIEGSLSNS ILEMDDETLL AL
Subjt:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQ+LYGLKELPKP EMEKLCEKWKPYRS+GAWYMWRL++ KEIVK+  D
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD

XP_023543059.1 DNA-3-methyladenine glycosylase 1 [Cucurbita pepo subsp. pepo]1.2e-12582.39Show/hide
Query:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC
        MA+R RRK L Q ES ++A P      SKI F +TK+RKISS Q+P KPQIS PGG + TR FPN   PVKSLSSSD I TAI+HLRRSDPLLI LLDSC
Subjt:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC

Query:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL
        E+PNFKSNPPFLALTKSILYQQLATKAAE+IYNRFASLCGGEAAVLPD VLGLSPQQLRV+GVSGRKASYLHDLATKFIEGSLSNS ILEMDDETLL AL
Subjt:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQ+LYGLKELPKP EMEKLCE WKPYRS+GAWYMWRL++ KEIVK+  D
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD

XP_038881017.1 DNA-3-methyladenine glycosylase 1-like [Benincasa hispida]9.6e-13186.23Show/hide
Query:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC
        MAKRIRRK L Q +S  DA PL PS+SSKIPFPSTKVRKISS QEP KPQIS  GG +PTR F NLA P+KSLSSSD+I TAI+HLRRSDPLLIS+L+SC
Subjt:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC

Query:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL
        E+PNFKSNPPFLALTKSILYQQLATKAAE+IYNRFASLCGGE +VLPD VLGLSPQQLRVIGVSGRKASYLHDLATKFIEG+LSNS ILEMDDETLL AL
Subjt:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAK
        TAVKGIG+WSVHMFMIFTLHRPDVLPVGDLGVRKGVQ+LYGLKELPKPAEMEKLCEKWKPYRS+GAWYMWRL++ K
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAK

TrEMBL top hitse value%identityAlignment
A0A0A0KM62 ENDO3c domain-containing protein3.4e-15899.3Show/hide
Query:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC
        MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTR+FPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC
Subjt:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC

Query:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL
        ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL
Subjt:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQ+LYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD

A0A1S3B2D5 probable DNA-3-methyladenine glycosylase 27.6e-14289.79Show/hide
Query:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC
        MAKRIRRK LFQ ESP+ AVPLSPS+SSKIPF STKVRKISSNQEP KPQ SAP GYNPTR FPNLADPVKSLSS D+ISTAINHLRRSDPLLISLLDSC
Subjt:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC

Query:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL
        E+P+FKSNPPFLALTKSILYQQLATKAAE+IYNRFASLCGGEA+VLPDTVLGLSPQQLRV+GVSGRKASYLHDLATKFIEG+LSNS ILEMDDETLL  L
Subjt:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQ+LYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRL++ K +VK G D
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD

A0A5A7T3R0 Putative DNA-3-methyladenine glycosylase 27.6e-14289.79Show/hide
Query:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC
        MAKRIRRK LFQ ESP+ AVPLSPS+SSKIPF STKVRKISSNQEP KPQ SAP GYNPTR FPNLADPVKSLSS D+ISTAINHLRRSDPLLISLLDSC
Subjt:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC

Query:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL
        E+P+FKSNPPFLALTKSILYQQLATKAAE+IYNRFASLCGGEA+VLPDTVLGLSPQQLRV+GVSGRKASYLHDLATKFIEG+LSNS ILEMDDETLL  L
Subjt:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQ+LYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRL++ K +VK G D
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD

A0A6J1GD23 DNA-3-methyladenine glycosylase 1-like3.8e-12582.04Show/hide
Query:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC
        MA+R RRK L Q ES +DA P      S I F +TK+RKISS Q+  KPQIS PGG + TR FPN   PVKSLSSSD I TAI+HLRRSDPLLI LLDSC
Subjt:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC

Query:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL
        E+PNFKSNPPFLALTKSILYQQLATKAAE+IYNRFASLCGG+AAVLPD VLGLSPQQLRV+GVSGRKASYLHDLATKFIEGSLSNS ILEMDDETLL AL
Subjt:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQ+LYGLKELPKP EMEKLCEKWKPYRS+GAWYMWRL++ KEIVK+  D
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD

A0A6J1IQD1 DNA-3-methyladenine glycosylase 1-like7.9e-12379.93Show/hide
Query:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC
        MA+R RRK L Q ES ++A P      SKI F +T++RKISS ++P KPQIS  GG + TR FPN   PVKSLSSSD I TAI+HLRRSDPLLI LLDSC
Subjt:  MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSC

Query:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL
        E+PNFKSNPPFLA+TKSILYQQLATKAAE+IYNRFASLCGGEAAVLPD VLGLSPQQLRV+GVSGRKASYLHDLATKF+EG+LSNS ILEMDDETLL AL
Subjt:  ETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD
        T VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQ+LYGLKELPKP EMEKLCEKWKPYRS+GAWYMWRL++ K I KN  D
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD

SwissProt top hitse value%identityAlignment
O31544 Putative DNA-3-methyladenine glycosylase YfjP1.1e-2030.9Show/hide
Query:  TPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVL-----PDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETL
        TP       +  + K I++QQL    A  +  RF    G +   +     P+T+  L  Q LR +  S RKA Y  D +    EG+LS S +  M DE +
Subjt:  TPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVL-----PDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETL

Query:  LRALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLID
        ++ L  ++GIG W+V   ++F L RP++ P+ D+G++  +++ + L + P    M  + ++W+PY S  + Y+WR I+
Subjt:  LRALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLID

O94468 Alkylbase DNA glycosidase-like protein mag23.4e-2228.29Show/hide
Query:  LSSSDKISTAINHLRRSD---PLLISLLDSCETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCG-GEAAVLPDTVLGLSPQQLRVIGVSGRKA
        +S       A  HL   D     L+  +  C       + P+  + ++I  Q+L+  A  +I N+F + C   +    P  ++    + L   G S  K+
Subjt:  LSSSDKISTAINHLRRSD---PLLISLLDSCETPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCG-GEAAVLPDTVLGLSPQQLRVIGVSGRKA

Query:  SYLHDLATKFIEGSL-SNSFILEMDDETLLRALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAW
          +H +A   +   + S S I +M +E L+ +L+ +KG+  W++ M+ IFTL R D++P  D  ++   ++ +GL   P+  E+EKL +  KPYR+I AW
Subjt:  SYLHDLATKFIEGSL-SNSFILEMDDETLLRALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAW

Query:  YMWRL
        Y+W++
Subjt:  YMWRL

P22134 DNA-3-methyladenine glycosylase1.8e-1531.09Show/hide
Query:  DKISTAINHLRRSDPLLISLLDSCE---------TPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGE----AAVLPDTVLGLSPQQLRVIGV
        +K + A  H+   DP L  +L + E          PN   +  F+ L  +IL QQ++ +AAE+I  R  SL GG       +  D        ++   G+
Subjt:  DKISTAINHLRRSDPLLISLLDSCE---------TPNFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGE----AAVLPDTVLGLSPQQLRVIGV

Query:  SGRKASYLHDLATKFIE--GSLSNSFILEMDDETLLRAL-TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYG-----LKELPKPAE-----
        S RK  YL  LA  F E    +   F  + +DE ++ +L T VKGIG WS  MF+I  L R DV    DLG+ +G  K         KEL +  +     
Subjt:  SGRKASYLHDLATKFIE--GSLSNSFILEMDDETLLRAL-TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYG-----LKELPKPAE-----

Query:  ----------------MEKLCEKWKPYRSIGAWYMWRL
                        MEK  E + PYRS+  + +WRL
Subjt:  ----------------MEKLCEKWKPYRSIGAWYMWRL

Q92383 DNA-3-methyladenine glycosylase 16.2e-2434.32Show/hide
Query:  NFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILE-MDDETLLRALTA
        + +   P+  L +++  QQL +KAA AI+NRF S+        P+ +  +  + +R  G S RK   L  +A   I G +      E + +E L+  LT 
Subjt:  NFKSNPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILE-MDDETLLRALTA

Query:  VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWR
        +KGIG W+V M +IF+L+R DV+P  DL +R G + L+ L ++P    + K  E   P+R+  AWY+W+
Subjt:  VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWR

Arabidopsis top hitse value%identityAlignment
AT1G19480.1 DNA glycosylase superfamily protein1.8e-7453.15Show/hide
Query:  AKRIRRKCLFQLESPSDAVPLSPSASSKIPFPST------KVRKISSNQ--EPTKPQISAPGGYNPTRLFPNLADPVKS--LSSSDKISTAINHLRRSDP
        A+RI    L  + SP   +PL P    K+           K   ISS+Q   P      +PG    + L       +++  L+   ++ TAI++LR +DP
Subjt:  AKRIRRKCLFQLESPSDAVPLSPSASSKIPFPST------KVRKISSNQ--EPTKPQISAPGGYNPTRLFPNLADPVKS--LSSSDKISTAINHLRRSDP

Query:  LLISLLDSCETPNFKS-NPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILE
        LL +L+D    P F+S   PFLAL ++ILYQQLA KA  +IY RF SLCGGE  V+P+TVL L+PQQLR IGVSGRKASYLHDLA K+  G LS+S IL 
Subjt:  LLISLLDSCETPNFKS-NPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILE

Query:  MDDETLLRALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAK
        MD+++L   LT V GIG WSVHMFMI +LHRPDVLPV DLGVRKGVQ LYGL +LP+P++ME+ C KW+PYRS+G+WYMWRLI+AK
Subjt:  MDDETLLRALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAK

AT1G19480.2 DNA glycosylase superfamily protein1.8e-7453.15Show/hide
Query:  AKRIRRKCLFQLESPSDAVPLSPSASSKIPFPST------KVRKISSNQ--EPTKPQISAPGGYNPTRLFPNLADPVKS--LSSSDKISTAINHLRRSDP
        A+RI    L  + SP   +PL P    K+           K   ISS+Q   P      +PG    + L       +++  L+   ++ TAI++LR +DP
Subjt:  AKRIRRKCLFQLESPSDAVPLSPSASSKIPFPST------KVRKISSNQ--EPTKPQISAPGGYNPTRLFPNLADPVKS--LSSSDKISTAINHLRRSDP

Query:  LLISLLDSCETPNFKS-NPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILE
        LL +L+D    P F+S   PFLAL ++ILYQQLA KA  +IY RF SLCGGE  V+P+TVL L+PQQLR IGVSGRKASYLHDLA K+  G LS+S IL 
Subjt:  LLISLLDSCETPNFKS-NPPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILE

Query:  MDDETLLRALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAK
        MD+++L   LT V GIG WSVHMFMI +LHRPDVLPV DLGVRKGVQ LYGL +LP+P++ME+ C KW+PYRS+G+WYMWRLI+AK
Subjt:  MDDETLLRALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAK

AT1G75230.1 DNA glycosylase superfamily protein1.1e-7352.38Show/hide
Query:  SASSKIPFPSTKVRKIS---------------SNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSCETPNFKS-N
        S  +KIP    K+RK+S               S    TKP   +    + T   P +    +SL+   ++  A++HLR  DPLL SL+D    P F++  
Subjt:  SASSKIPFPSTKVRKIS---------------SNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSCETPNFKS-N

Query:  PPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRALTAVKGIGV
         PFLAL +SILYQQLA KA  +IY RF +LCGGE  V+P+ VL L+PQQLR IGVSGRKASYLHDLA K+  G LS+S I+ MD+++L   LT V GIG 
Subjt:  PPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRALTAVKGIGV

Query:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKN
        WSVHMFMI +LHRPDVLPV DLGVRKGVQ L G+++LP+P++ME+LCEKW+PYRS+ +WY+WRLI++K    N
Subjt:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKN

AT1G75230.2 DNA glycosylase superfamily protein1.1e-7352.38Show/hide
Query:  SASSKIPFPSTKVRKIS---------------SNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSCETPNFKS-N
        S  +KIP    K+RK+S               S    TKP   +    + T   P +    +SL+   ++  A++HLR  DPLL SL+D    P F++  
Subjt:  SASSKIPFPSTKVRKIS---------------SNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSCETPNFKS-N

Query:  PPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRALTAVKGIGV
         PFLAL +SILYQQLA KA  +IY RF +LCGGE  V+P+ VL L+PQQLR IGVSGRKASYLHDLA K+  G LS+S I+ MD+++L   LT V GIG 
Subjt:  PPFLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRALTAVKGIGV

Query:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKN
        WSVHMFMI +LHRPDVLPV DLGVRKGVQ L G+++LP+P++ME+LCEKW+PYRS+ +WY+WRLI++K    N
Subjt:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKN

AT3G50880.1 DNA glycosylase superfamily protein2.6e-7356.47Show/hide
Query:  ASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSCETPNF--KSNPPFLALTKSILYQQL
        +SS+I F   K+RK+SS+  P              R+    + P   LS+   +  A+ HL+ SD LL +L+ +   P     SN PFL+L +SILYQQL
Subjt:  ASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSCETPNF--KSNPPFLALTKSILYQQL

Query:  ATKAAEAIYNRFASLC-GGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRALTAVKGIGVWSVHMFMIFTLHRP
        ATKAA+ IY+RF SL  GGEA V+P++V+ LS   LR IGVSGRKASYLHDLA K+  G LS+  IL+M DE L+  LT VKGIGVW+VHMFMIF+LHRP
Subjt:  ATKAAEAIYNRFASLC-GGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRALTAVKGIGVWSVHMFMIFTLHRP

Query:  DVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKE
        DVLPVGDLGVRKGV+ LYGLK LP P +ME+LCEKW+PYRS+G+WYMWRLI++++
Subjt:  DVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAAAAGAATCCGCCGGAAATGCCTCTTTCAGTTGGAGTCTCCATCCGACGCTGTTCCTCTCTCTCCCTCGGCTTCCTCCAAGATTCCCTTCCCATCCACAAAAGT
TCGGAAGATTTCCTCCAATCAAGAACCGACCAAACCACAAATTTCAGCTCCCGGCGGTTATAACCCGACCCGATTATTCCCGAACCTCGCCGATCCCGTCAAATCCTTGT
CGTCTTCGGATAAAATTTCCACAGCGATCAATCATTTACGCCGTTCGGATCCTCTTCTAATAAGTTTGTTAGATTCTTGCGAAACCCCCAATTTCAAGTCCAATCCACCG
TTCTTAGCACTAACGAAGAGCATCCTCTACCAACAACTCGCCACAAAGGCCGCCGAAGCGATCTACAATCGCTTCGCCTCGCTATGCGGCGGAGAGGCGGCGGTACTGCC
GGACACCGTGCTCGGACTCTCGCCGCAACAGCTGCGAGTAATCGGAGTTTCGGGGAGGAAGGCAAGTTACCTCCATGATCTAGCAACGAAATTCATAGAGGGGAGTTTAT
CAAATTCATTTATTCTGGAGATGGACGACGAGACTCTATTGAGGGCGTTAACGGCGGTGAAGGGAATCGGCGTTTGGTCGGTGCATATGTTCATGATATTTACCCTACAT
CGGCCGGATGTGCTGCCCGTGGGGGATTTGGGAGTGAGAAAAGGGGTGCAGAAATTGTACGGACTGAAGGAATTGCCGAAGCCGGCGGAGATGGAGAAACTTTGTGAGAA
ATGGAAGCCTTACAGATCGATCGGAGCTTGGTATATGTGGAGGCTAATCGACGCTAAGGAAATCGTGAAGAATGGTTGCGATTGA
mRNA sequenceShow/hide mRNA sequence
TGACGGAGTAGACGATCTCATCTTAATCAATTACGACCCGACATCTGTACTTCATTCTTCCAAAATCAAAGATTCAAAACCTACCTTTTTACCCACAAATCATTCACCGT
CGCCCATGGCCAAAAGAATCCGCCGGAAATGCCTCTTTCAGTTGGAGTCTCCATCCGACGCTGTTCCTCTCTCTCCCTCGGCTTCCTCCAAGATTCCCTTCCCATCCACA
AAAGTTCGGAAGATTTCCTCCAATCAAGAACCGACCAAACCACAAATTTCAGCTCCCGGCGGTTATAACCCGACCCGATTATTCCCGAACCTCGCCGATCCCGTCAAATC
CTTGTCGTCTTCGGATAAAATTTCCACAGCGATCAATCATTTACGCCGTTCGGATCCTCTTCTAATAAGTTTGTTAGATTCTTGCGAAACCCCCAATTTCAAGTCCAATC
CACCGTTCTTAGCACTAACGAAGAGCATCCTCTACCAACAACTCGCCACAAAGGCCGCCGAAGCGATCTACAATCGCTTCGCCTCGCTATGCGGCGGAGAGGCGGCGGTA
CTGCCGGACACCGTGCTCGGACTCTCGCCGCAACAGCTGCGAGTAATCGGAGTTTCGGGGAGGAAGGCAAGTTACCTCCATGATCTAGCAACGAAATTCATAGAGGGGAG
TTTATCAAATTCATTTATTCTGGAGATGGACGACGAGACTCTATTGAGGGCGTTAACGGCGGTGAAGGGAATCGGCGTTTGGTCGGTGCATATGTTCATGATATTTACCC
TACATCGGCCGGATGTGCTGCCCGTGGGGGATTTGGGAGTGAGAAAAGGGGTGCAGAAATTGTACGGACTGAAGGAATTGCCGAAGCCGGCGGAGATGGAGAAACTTTGT
GAGAAATGGAAGCCTTACAGATCGATCGGAGCTTGGTATATGTGGAGGCTAATCGACGCTAAGGAAATCGTGAAGAATGGTTGCGATTGACCGGATAACATGGAATACAG
AGGTAGTTTTGCAGTGTGACTTTTGAATTCGTGATTTTCAAATTTGGCTAGTTTTTGTGTCTGTGTAATTTTAAGAACTC
Protein sequenceShow/hide protein sequence
MAKRIRRKCLFQLESPSDAVPLSPSASSKIPFPSTKVRKISSNQEPTKPQISAPGGYNPTRLFPNLADPVKSLSSSDKISTAINHLRRSDPLLISLLDSCETPNFKSNPP
FLALTKSILYQQLATKAAEAIYNRFASLCGGEAAVLPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIEGSLSNSFILEMDDETLLRALTAVKGIGVWSVHMFMIFTLH
RPDVLPVGDLGVRKGVQKLYGLKELPKPAEMEKLCEKWKPYRSIGAWYMWRLIDAKEIVKNGCD