; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0008156 (gene) of Chayote v1 genome

Gene IDSed0008156
OrganismSechium edule (Chayote v1)
DescriptionDNA-3-methyladenine glycosylase 1-like
Genome locationLG14:20687629..20688789
RNA-Seq ExpressionSed0008156
SyntenySed0008156
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006307 - DNA dealkylation involved in DNA repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0032993 - protein-DNA complex (cellular component)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
GO:0032131 - alkylated DNA binding (molecular function)
GO:0043916 - DNA-7-methylguanine glycosylase activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603971.1 Zinc-finger homeodomain protein 6, partial [Cucurbita argyrosperma subsp. sororia]5.1e-10673.21Show/hide
Query:  PPMAKRTRRKFLLTEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLIDSS
        PPMA+RTRRK LL   +       S I FR++K+RKISST     + KPQI TP GG  T         VKSLSSS  I TAIDHLRRSDPLLI L+DS 
Subjt:  PPMAKRTRRKFLLTEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLIDSS

Query:  ESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKEL
        ESPNFKSNPPFLALTKSILYQQLAT+AAESIYNRFA+LCGG+AAV P AVL LSP++LR +GVSGRK+SYLHDLATKF +G+LS+ SI+EM+DE+LL  L
Subjt:  ESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKEL

Query:  TAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK
        TAVKGIGVWSVHMFMIF LHRPDVLPVGDLGVRKGVQRLYGLK+LP+P EME+LCEKWKPYRS+GAWYMWRLME+K++ K
Subjt:  TAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK

KAG7034142.1 mag1, partial [Cucurbita argyrosperma subsp. argyrosperma]5.1e-10673.21Show/hide
Query:  PPMAKRTRRKFLLTEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLIDSS
        PPMA+RTRRK LL   +       S I FR++K+RKISST     + KPQI TP GG  T         VKSLSSS  I TAIDHLRRSDPLLI L+DS 
Subjt:  PPMAKRTRRKFLLTEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLIDSS

Query:  ESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKEL
        ESPNFKSNPPFLALTKSILYQQLAT+AAESIYNRFA+LCGG+AAV P AVL LSP++LR +GVSGRK+SYLHDLATKF +G+LS+ SI+EM+DE+LL  L
Subjt:  ESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKEL

Query:  TAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK
        TAVKGIGVWSVHMFMIF LHRPDVLPVGDLGVRKGVQRLYGLK+LP+P EME+LCEKWKPYRS+GAWYMWRLME+K++ K
Subjt:  TAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK

XP_008440714.1 PREDICTED: probable DNA-3-methyladenine glycosylase 2 [Cucumis melo]7.3e-10573.85Show/hide
Query:  MAKRTRRKFLL-----TEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLI
        MAKR RRKFL      T A PL PSSSS+IPFRS+K+RKISS     +  KPQ   P G   T         VKSLSS  EISTAI+HLRRSDPLLI+L+
Subjt:  MAKRTRRKFLL-----TEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLI

Query:  DSSESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLL
        DS ESP+FKSNPPFLALTKSILYQQLAT+AAESIYNRFA+LCGGEA+V P  VL LSP++LR +GVSGRK+SYLHDLATKF +G LS+  I+EM+DE+LL
Subjt:  DSSESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLL

Query:  KELTAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK
         ELTAVKGIGVWSVHMFMIF LHRPDVLPVGDLGVRKGVQRLYGLK+LP+PAEME+LCEKWKPYRS+GAWYMWRLME K V K
Subjt:  KELTAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK

XP_022949777.1 DNA-3-methyladenine glycosylase 1-like [Cucurbita moschata]5.1e-10673.21Show/hide
Query:  PPMAKRTRRKFLLTEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLIDSS
        PPMA+RTRRK LL   +       S I FR++K+RKISST     + KPQI TP GG  T         VKSLSSS  I TAIDHLRRSDPLLI L+DS 
Subjt:  PPMAKRTRRKFLLTEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLIDSS

Query:  ESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKEL
        ESPNFKSNPPFLALTKSILYQQLAT+AAESIYNRFA+LCGG+AAV P AVL LSP++LR +GVSGRK+SYLHDLATKF +G+LS+ SI+EM+DE+LL  L
Subjt:  ESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKEL

Query:  TAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK
        TAVKGIGVWSVHMFMIF LHRPDVLPVGDLGVRKGVQRLYGLK+LP+P EME+LCEKWKPYRS+GAWYMWRLME+K++ K
Subjt:  TAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK

XP_023543059.1 DNA-3-methyladenine glycosylase 1 [Cucurbita pepo subsp. pepo]1.1e-10573.21Show/hide
Query:  PPMAKRTRRKFLLTEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLIDSS
        PPMA+RTRRK LL   +       S+I FR++K+RKISST       KPQI TP GG  T         VKSLSSS  I TAIDHLRRSDPLLI L+DS 
Subjt:  PPMAKRTRRKFLLTEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLIDSS

Query:  ESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKEL
        ESPNFKSNPPFLALTKSILYQQLAT+AAESIYNRFA+LCGGEAAV P AVL LSP++LR +GVSGRK+SYLHDLATKF +G+LS+ SI+EM+DE+LL  L
Subjt:  ESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKEL

Query:  TAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK
        TAVKGIGVWSVHMFMIF LHRPDVLPVGDLGVRKGVQRLYGLK+LP+P EME+LCE WKPYRS+GAWYMWRLME+K++ K
Subjt:  TAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK

TrEMBL top hitse value%identityAlignment
A0A0A0KM62 ENDO3c domain-containing protein1.1e-10371.73Show/hide
Query:  MAKRTRRKFLL-----TEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLI
        MAKR RRK L      ++A PL PS+SS+IPF S+K+RKISS     +  KPQI  P G   T         VKSLSSS +ISTAI+HLRRSDPLLI+L+
Subjt:  MAKRTRRKFLL-----TEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLI

Query:  DSSESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLL
        DS E+PNFKSNPPFLALTKSILYQQLAT+AAE+IYNRFA+LCGGEAAV P  VL LSP++LR IGVSGRK+SYLHDLATKF +G+LS+  I+EM+DE+LL
Subjt:  DSSESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLL

Query:  KELTAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK
        + LTAVKGIGVWSVHMFMIF LHRPDVLPVGDLGVRKGVQRLYGLK+LP+PAEME+LCEKWKPYRS+GAWYMWRL++ K++ K
Subjt:  KELTAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK

A0A1S3B2D5 probable DNA-3-methyladenine glycosylase 23.6e-10573.85Show/hide
Query:  MAKRTRRKFLL-----TEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLI
        MAKR RRKFL      T A PL PSSSS+IPFRS+K+RKISS     +  KPQ   P G   T         VKSLSS  EISTAI+HLRRSDPLLI+L+
Subjt:  MAKRTRRKFLL-----TEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLI

Query:  DSSESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLL
        DS ESP+FKSNPPFLALTKSILYQQLAT+AAESIYNRFA+LCGGEA+V P  VL LSP++LR +GVSGRK+SYLHDLATKF +G LS+  I+EM+DE+LL
Subjt:  DSSESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLL

Query:  KELTAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK
         ELTAVKGIGVWSVHMFMIF LHRPDVLPVGDLGVRKGVQRLYGLK+LP+PAEME+LCEKWKPYRS+GAWYMWRLME K V K
Subjt:  KELTAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK

A0A5A7T3R0 Putative DNA-3-methyladenine glycosylase 23.6e-10573.85Show/hide
Query:  MAKRTRRKFLL-----TEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLI
        MAKR RRKFL      T A PL PSSSS+IPFRS+K+RKISS     +  KPQ   P G   T         VKSLSS  EISTAI+HLRRSDPLLI+L+
Subjt:  MAKRTRRKFLL-----TEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLI

Query:  DSSESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLL
        DS ESP+FKSNPPFLALTKSILYQQLAT+AAESIYNRFA+LCGGEA+V P  VL LSP++LR +GVSGRK+SYLHDLATKF +G LS+  I+EM+DE+LL
Subjt:  DSSESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLL

Query:  KELTAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK
         ELTAVKGIGVWSVHMFMIF LHRPDVLPVGDLGVRKGVQRLYGLK+LP+PAEME+LCEKWKPYRS+GAWYMWRLME K V K
Subjt:  KELTAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK

A0A6J1GD23 DNA-3-methyladenine glycosylase 1-like2.5e-10673.21Show/hide
Query:  PPMAKRTRRKFLLTEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLIDSS
        PPMA+RTRRK LL   +       S I FR++K+RKISST     + KPQI TP GG  T         VKSLSSS  I TAIDHLRRSDPLLI L+DS 
Subjt:  PPMAKRTRRKFLLTEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLIDSS

Query:  ESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKEL
        ESPNFKSNPPFLALTKSILYQQLAT+AAESIYNRFA+LCGG+AAV P AVL LSP++LR +GVSGRK+SYLHDLATKF +G+LS+ SI+EM+DE+LL  L
Subjt:  ESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKEL

Query:  TAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK
        TAVKGIGVWSVHMFMIF LHRPDVLPVGDLGVRKGVQRLYGLK+LP+P EME+LCEKWKPYRS+GAWYMWRLME+K++ K
Subjt:  TAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK

A0A6J1IQD1 DNA-3-methyladenine glycosylase 1-like1.0e-10472.5Show/hide
Query:  PPMAKRTRRKFLLTEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLIDSS
        PPMA+RTRRK LL   +       S+I FR++++RKISST       KPQI T  GG  T         VKSLSSS  I TAIDHLRRSDPLLI L+DS 
Subjt:  PPMAKRTRRKFLLTEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGET---------VKSLSSSGEISTAIDHLRRSDPLLITLIDSS

Query:  ESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKEL
        ESPNFKSNPPFLA+TKSILYQQLAT+AAESIYNRFA+LCGGEAAV P AVL LSP++LR +GVSGRK+SYLHDLATKF +GTLS+ SI+EM+DE+LL  L
Subjt:  ESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKEL

Query:  TAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK
        T VKGIGVWSVHMFMIF LHRPDVLPVGDLGVRKGVQRLYGLK+LP+P EME+LCEKWKPYRS+GAWYMWRLME+K + K
Subjt:  TAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHK

SwissProt top hitse value%identityAlignment
O31544 Putative DNA-3-methyladenine glycosylase YfjP1.7e-1927.19Show/hide
Query:  GETVKSLSS----SGEISTAIDHLRRSDPLLITLIDSSESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGE-----AAVAPAAVLALSPEK
        GE +K +         +   +DH  ++  L     + + +P       +  + K I++QQL    A ++  RF    G +         P  +  L  + 
Subjt:  GETVKSLSS----SGEISTAIDHLRRSDPLLITLIDSSESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGE-----AAVAPAAVLALSPEK

Query:  LRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKELTAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEK
        LR +  S RK+ Y  D +   A+GTLS   +  M DE ++K+L  ++GIG W+V   ++F L RP++ P+ D+G++  ++R + L   P    M  + ++
Subjt:  LRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKELTAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEK

Query:  WKPYRSLGAWYMWRLME
        W+PY S  + Y+WR +E
Subjt:  WKPYRSLGAWYMWRLME

O94468 Alkylbase DNA glycosidase-like protein mag21.2e-2027.62Show/hide
Query:  LSSSGEISTAIDHLRRSDPLLITLIDSSESPNFKSNP---PFLALTKSILYQQLATRAAESIYNRFAALCG-GEAAVAPAAVLALSPEKLRGIGVSGRKS
        +S   +   A  HL   D    +L+          +P   P+  + ++I  Q+L+  A  SI N+F   C   +    P  ++    E L   G S  KS
Subjt:  LSSSGEISTAIDHLRRSDPLLITLIDSSESPNFKSNP---PFLALTKSILYQQLATRAAESIYNRFAALCG-GEAAVAPAAVLALSPEKLRGIGVSGRKS

Query:  SYLHDLATKFADGTL-SDCSIVEMNDESLLKELTAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAW
          +H +A    +  + S   I +M++E L++ L+ +KG+  W++ M+ IF L R D++P  D  ++   +  +GL   P+  E+E+L +  KPYR++ AW
Subjt:  SYLHDLATKFADGTL-SDCSIVEMNDESLLKELTAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAW

Query:  YMWRLMELKK
        Y+W++ +L +
Subjt:  YMWRLMELKK

P22134 DNA-3-methyladenine glycosylase8.9e-1329.83Show/hide
Query:  EISTAIDHLRRSDPLLITLIDSSESPNF--KSNPP------FLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVL---ALSPEKLRGI---GV
        + + A +H+   DP L  ++ ++E   +  ++  P      F+ L  +IL QQ++ +AAESI  R  +L GG  A     +L      P K   I   G+
Subjt:  EISTAIDHLRRSDPLLITLIDSSESPNF--KSNPP------FLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVL---ALSPEKLRGI---GV

Query:  SGRKSSYLHDLATKFAD--GTLSDCSIVEMNDESLLKEL-TAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLK-----------------
        S RK  YL  LA  F +    +      + NDE +++ L T VKGIG WS  MF+I  L R DV    DLG+ +G  +    K                 
Subjt:  SGRKSSYLHDLATKFAD--GTLSDCSIVEMNDESLLKEL-TAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLK-----------------

Query:  ---------KLPEPAEMEELCEKWKPYRSLGAWYMWRL
                 K+ +   ME+  E + PYRS+  + +WRL
Subjt:  ---------KLPEPAEMEELCEKWKPYRSLGAWYMWRL

P37878 DNA-3-methyladenine glycosylase1.5e-1227.33Show/hide
Query:  FLALTKSILYQQLATRAAESIYNRFAALCGGEAA-------VAP--AAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKELT
        F AL   +L QQ+    A S+  +F    G           V P    +  L+P  L  I ++ +KS Y+  +A   A G LS   +++MN +   K L 
Subjt:  FLALTKSILYQQLATRAAESIYNRFAALCGGEAA-------VAP--AAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKELT

Query:  AVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLM
         ++GIG W+ +  ++  L  P   P+ D+G+   ++ L  + + P   E+ E+   WK ++S   +Y+WR++
Subjt:  AVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLM

Q92383 DNA-3-methyladenine glycosylase 12.8e-2232.37Show/hide
Query:  NFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVE-MNDESLLKELTA
        + +   P+  L +++  QQL ++AA +I+NRF ++        P  +  +  E +R  G S RK   L  +A     G +      E +++E L++ LT 
Subjt:  NFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVE-MNDESLLKELTA

Query:  VKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMEL
        +KGIG W+V M +IF+L+R DV+P  DL +R G + L+ L K+P    + +  E   P+R+  AWY+W+  +L
Subjt:  VKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMEL

Arabidopsis top hitse value%identityAlignment
AT1G19480.1 DNA glycosylase superfamily protein2.8e-7062.32Show/hide
Query:  KSLSSSGEISTAIDHLRRSDPLLITLIDSSESPNFKS-NPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSS
        + L+  GE+ TAI +LR +DPLL  LID    P F+S   PFLAL ++ILYQQLA +A  SIY RF +LCGGE  V P  VL+L+P++LR IGVSGRK+S
Subjt:  KSLSSSGEISTAIDHLRRSDPLLITLIDSSESPNFKS-NPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSS

Query:  YLHDLATKFADGTLSDCSIVEMNDESLLKELTAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYM
        YLHDLA K+ +G LSD +I+ M+++SL   LT V GIG WSVHMFMI +LHRPDVLPV DLGVRKGVQ LYGL  LP P++ME+ C KW+PYRS+G+WYM
Subjt:  YLHDLATKFADGTLSDCSIVEMNDESLLKELTAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYM

Query:  WRLMELK
        WRL+E K
Subjt:  WRLMELK

AT1G19480.2 DNA glycosylase superfamily protein2.8e-7062.32Show/hide
Query:  KSLSSSGEISTAIDHLRRSDPLLITLIDSSESPNFKS-NPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSS
        + L+  GE+ TAI +LR +DPLL  LID    P F+S   PFLAL ++ILYQQLA +A  SIY RF +LCGGE  V P  VL+L+P++LR IGVSGRK+S
Subjt:  KSLSSSGEISTAIDHLRRSDPLLITLIDSSESPNFKS-NPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSS

Query:  YLHDLATKFADGTLSDCSIVEMNDESLLKELTAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYM
        YLHDLA K+ +G LSD +I+ M+++SL   LT V GIG WSVHMFMI +LHRPDVLPV DLGVRKGVQ LYGL  LP P++ME+ C KW+PYRS+G+WYM
Subjt:  YLHDLATKFADGTLSDCSIVEMNDESLLKELTAVKGIGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYM

Query:  WRLMELK
        WRL+E K
Subjt:  WRLMELK

AT1G75230.1 DNA glycosylase superfamily protein6.7e-7253.76Show/hide
Query:  SSSSEIPFRSSKLRKISSTPGGGD------------TVKPQIPTPCGGGETV-------KSLSSSGEISTAIDHLRRSDPLLITLIDSSESPNFKS-NPP
        S  ++IP R  K+RK+S      D            T KP   +      TV       +SL+  GE+  A+ HLR  DPLL +LID    P F++   P
Subjt:  SSSSEIPFRSSKLRKISSTPGGGD------------TVKPQIPTPCGGGETV-------KSLSSSGEISTAIDHLRRSDPLLITLIDSSESPNFKS-NPP

Query:  FLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKELTAVKGIGVWS
        FLAL +SILYQQLA +A  SIY RF ALCGGE  V P  VL L+P++LR IGVSGRK+SYLHDLA K+ +G LSD  IV M+++SL   LT V GIG WS
Subjt:  FLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKELTAVKGIGVWS

Query:  VHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELK
        VHMFMI +LHRPDVLPV DLGVRKGVQ L G++ LP P++ME+LCEKW+PYRS+ +WY+WRL+E K
Subjt:  VHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELK

AT1G75230.2 DNA glycosylase superfamily protein6.7e-7253.76Show/hide
Query:  SSSSEIPFRSSKLRKISSTPGGGD------------TVKPQIPTPCGGGETV-------KSLSSSGEISTAIDHLRRSDPLLITLIDSSESPNFKS-NPP
        S  ++IP R  K+RK+S      D            T KP   +      TV       +SL+  GE+  A+ HLR  DPLL +LID    P F++   P
Subjt:  SSSSEIPFRSSKLRKISSTPGGGD------------TVKPQIPTPCGGGETV-------KSLSSSGEISTAIDHLRRSDPLLITLIDSSESPNFKS-NPP

Query:  FLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKELTAVKGIGVWS
        FLAL +SILYQQLA +A  SIY RF ALCGGE  V P  VL L+P++LR IGVSGRK+SYLHDLA K+ +G LSD  IV M+++SL   LT V GIG WS
Subjt:  FLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKELTAVKGIGVWS

Query:  VHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELK
        VHMFMI +LHRPDVLPV DLGVRKGVQ L G++ LP P++ME+LCEKW+PYRS+ +WY+WRL+E K
Subjt:  VHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELK

AT3G50880.1 DNA glycosylase superfamily protein2.3e-7257.83Show/hide
Query:  SSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGETVKSLSSSGEISTAIDHLRRSDPLLITLIDSSESPNF--KSNPPFLALTKSILYQQLATRAAE
        SSS I FR  K+RK+SS P     +    P           LS+   +  A+ HL+ SD LL  LI +   P     SN PFL+L +SILYQQLAT+AA+
Subjt:  SSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGETVKSLSSSGEISTAIDHLRRSDPLLITLIDSSESPNF--KSNPPFLALTKSILYQQLATRAAE

Query:  SIYNRFAALC-GGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKELTAVKGIGVWSVHMFMIFALHRPDVLPVG
         IY+RF +L  GGEA V P +V++LS   LR IGVSGRK+SYLHDLA K+ +G LSD  I++M+DE L+  LT VKGIGVW+VHMFMIF+LHRPDVLPVG
Subjt:  SIYNRFAALC-GGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKELTAVKGIGVWSVHMFMIFALHRPDVLPVG

Query:  DLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKK
        DLGVRKGV+ LYGLK LP P +ME+LCEKW+PYRS+G+WYMWRL+E +K
Subjt:  DLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCAAACCCTAGCTTTTACCCACAAATCATTCACCGCCGCCGTAGCAGCTTCCACCGCCACTCGATCGCCGCCGATGGCCAAAAGAACCCGCCGGAAGTTCCTGCT
AACCGAAGCCAATCCTCTCCCTCCGTCGTCATCCTCCGAGATCCCGTTCCGATCCTCCAAATTACGGAAGATTTCCTCCACTCCCGGCGGCGGTGACACCGTCAAACCAC
AAATCCCAACTCCCTGCGGCGGCGGCGAAACCGTAAAATCGCTATCGTCTTCCGGCGAAATCTCCACAGCGATCGATCACTTACGCCGATCGGATCCTCTCCTAATAACT
CTAATAGATTCATCCGAATCCCCAAATTTCAAATCGAATCCACCATTTCTAGCCCTAACGAAGAGCATCCTCTACCAGCAGCTCGCCACCAGGGCCGCCGAATCGATCTA
CAATCGCTTCGCCGCGCTTTGCGGCGGCGAAGCCGCGGTGGCGCCGGCGGCGGTGCTGGCGCTCTCGCCGGAGAAGCTTCGAGGAATCGGAGTTTCCGGTAGAAAATCAA
GCTACCTCCACGACCTAGCGACGAAATTCGCCGACGGAACGTTATCGGATTGTTCGATTGTGGAGATGAACGACGAGAGTTTGTTGAAGGAGCTGACGGCGGTGAAGGGG
ATCGGCGTTTGGTCGGTGCATATGTTCATGATATTCGCTCTTCACCGGCCGGATGTTTTGCCGGTGGGGGATTTGGGAGTGAGAAAAGGAGTGCAGAGATTGTACGGACT
GAAGAAGCTGCCGGAGCCGGCGGAGATGGAGGAGCTTTGTGAGAAATGGAAGCCATATAGGTCTTTGGGAGCTTGGTATATGTGGAGGCTAATGGAACTGAAGAAGGTTC
ACAAATTAGAGGTGGGTTCTATTGTAATTTGA
mRNA sequenceShow/hide mRNA sequence
CTTATATTATATAGTCCAATCGTGCACCACAAGCTTTGAGTTGCTGCTGACGGAGGAGATGATCAAATCTAAGCCAATCACAACGCTACATCTGTCCTTCACTTTCCTGA
AACATGATTCAAACCCTAGCTTTTACCCACAAATCATTCACCGCCGCCGTAGCAGCTTCCACCGCCACTCGATCGCCGCCGATGGCCAAAAGAACCCGCCGGAAGTTCCT
GCTAACCGAAGCCAATCCTCTCCCTCCGTCGTCATCCTCCGAGATCCCGTTCCGATCCTCCAAATTACGGAAGATTTCCTCCACTCCCGGCGGCGGTGACACCGTCAAAC
CACAAATCCCAACTCCCTGCGGCGGCGGCGAAACCGTAAAATCGCTATCGTCTTCCGGCGAAATCTCCACAGCGATCGATCACTTACGCCGATCGGATCCTCTCCTAATA
ACTCTAATAGATTCATCCGAATCCCCAAATTTCAAATCGAATCCACCATTTCTAGCCCTAACGAAGAGCATCCTCTACCAGCAGCTCGCCACCAGGGCCGCCGAATCGAT
CTACAATCGCTTCGCCGCGCTTTGCGGCGGCGAAGCCGCGGTGGCGCCGGCGGCGGTGCTGGCGCTCTCGCCGGAGAAGCTTCGAGGAATCGGAGTTTCCGGTAGAAAAT
CAAGCTACCTCCACGACCTAGCGACGAAATTCGCCGACGGAACGTTATCGGATTGTTCGATTGTGGAGATGAACGACGAGAGTTTGTTGAAGGAGCTGACGGCGGTGAAG
GGGATCGGCGTTTGGTCGGTGCATATGTTCATGATATTCGCTCTTCACCGGCCGGATGTTTTGCCGGTGGGGGATTTGGGAGTGAGAAAAGGAGTGCAGAGATTGTACGG
ACTGAAGAAGCTGCCGGAGCCGGCGGAGATGGAGGAGCTTTGTGAGAAATGGAAGCCATATAGGTCTTTGGGAGCTTGGTATATGTGGAGGCTAATGGAACTGAAGAAGG
TTCACAAATTAGAGGTGGGTTCTATTGTAATTTGAATTGTAATTTGTTTTGTTTGTTTTGATTGGGACTCATCACTCATAGAACTTGCAATTGTTCTTTGTTTGAACTTT
GAATTGTTTTTCCATGATTGTGAACATTTCAATTGAGTTGTTTCGTATGATGGTTGACTAA
Protein sequenceShow/hide protein sequence
MIQTLAFTHKSFTAAVAASTATRSPPMAKRTRRKFLLTEANPLPPSSSSEIPFRSSKLRKISSTPGGGDTVKPQIPTPCGGGETVKSLSSSGEISTAIDHLRRSDPLLIT
LIDSSESPNFKSNPPFLALTKSILYQQLATRAAESIYNRFAALCGGEAAVAPAAVLALSPEKLRGIGVSGRKSSYLHDLATKFADGTLSDCSIVEMNDESLLKELTAVKG
IGVWSVHMFMIFALHRPDVLPVGDLGVRKGVQRLYGLKKLPEPAEMEELCEKWKPYRSLGAWYMWRLMELKKVHKLEVGSIVI