; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004752 (gene) of Snake gourd v1 genome

Gene IDTan0004752
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA-3-methyladenine glycosylase 1-like
Genome locationLG03:74893708..74895312
RNA-Seq ExpressionTan0004752
SyntenyTan0004752
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006307 - DNA dealkylation involved in DNA repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0032993 - protein-DNA complex (cellular component)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
GO:0032131 - alkylated DNA binding (molecular function)
GO:0043916 - DNA-7-methylguanine glycosylase activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603971.1 Zinc-finger homeodomain protein 6, partial [Cucurbita argyrosperma subsp. sororia]9.9e-12383.99Show/hide
Query:  PMAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGGGD----------PVKSLSSSDEIFTAIDHLRSSDPLLISLLDS
        PMA+RTRRK LLQSESQT+A+P      S + FR+TKIRKISS Q+  KP+ISTPGGGD          PVKSLSSSD I TAIDHLR SDPLLI LLDS
Subjt:  PMAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGGGD----------PVKSLSSSDEIFTAIDHLRSSDPLLISLLDS

Query:  SESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSA
         ESPNFKSNPPFLAL KSILYQQLATKAAESIYNRFA+LCGG+A VLP AVL LSPQQLRV+GVSGRK+SYLHDLATKF+EG LSNSSILEMDDETLLSA
Subjt:  SESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSA

Query:  LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK
        LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLME+KEIVK
Subjt:  LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK

KAG7034142.1 mag1, partial [Cucurbita argyrosperma subsp. argyrosperma]4.9e-12283.63Show/hide
Query:  PMAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGGGD----------PVKSLSSSDEIFTAIDHLRSSDPLLISLLDS
        PMA+RTRRK LLQSESQT+ +P      S + FR+TKIRKISS Q+  KP+ISTPGGGD          PVKSLSSSD I TAIDHLR SDPLLI LLDS
Subjt:  PMAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGGGD----------PVKSLSSSDEIFTAIDHLRSSDPLLISLLDS

Query:  SESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSA
         ESPNFKSNPPFLAL KSILYQQLATKAAESIYNRFA+LCGG+A VLP AVL LSPQQLRV+GVSGRK+SYLHDLATKF+EG LSNSSILEMDDETLLSA
Subjt:  SESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSA

Query:  LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK
        LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLME+KEIVK
Subjt:  LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK

XP_022949777.1 DNA-3-methyladenine glycosylase 1-like [Cucurbita moschata]9.9e-12383.99Show/hide
Query:  PMAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGGGD----------PVKSLSSSDEIFTAIDHLRSSDPLLISLLDS
        PMA+RTRRK LLQSESQT+A+P      S + FR+TKIRKISS Q+  KP+ISTPGGGD          PVKSLSSSD I TAIDHLR SDPLLI LLDS
Subjt:  PMAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGGGD----------PVKSLSSSDEIFTAIDHLRSSDPLLISLLDS

Query:  SESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSA
         ESPNFKSNPPFLAL KSILYQQLATKAAESIYNRFA+LCGG+A VLP AVL LSPQQLRV+GVSGRK+SYLHDLATKF+EG LSNSSILEMDDETLLSA
Subjt:  SESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSA

Query:  LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK
        LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLME+KEIVK
Subjt:  LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK

XP_023543059.1 DNA-3-methyladenine glycosylase 1 [Cucurbita pepo subsp. pepo]3.1e-12485.05Show/hide
Query:  PMAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGGGD----------PVKSLSSSDEIFTAIDHLRSSDPLLISLLDS
        PMA+RTRRK LLQSESQTEA+P      SK+ FR+TKIRKISS Q+P KP+ISTPGGGD          PVKSLSSSD I TAIDHLR SDPLLI LLDS
Subjt:  PMAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGGGD----------PVKSLSSSDEIFTAIDHLRSSDPLLISLLDS

Query:  SESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSA
         ESPNFKSNPPFLAL KSILYQQLATKAAESIYNRFA+LCGGEA VLP AVL LSPQQLRV+GVSGRK+SYLHDLATKF+EG LSNSSILEMDDETLLSA
Subjt:  SESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSA

Query:  LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK
        LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCE WKPYRSMGAWYMWRLME+KEIVK
Subjt:  LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK

XP_038881017.1 DNA-3-methyladenine glycosylase 1-like [Benincasa hispida]6.9e-12484.42Show/hide
Query:  MAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGGGD----------PVKSLSSSDEIFTAIDHLRSSDPLLISLLDSS
        MAKR RRKFL+QS+S+ +A+PLPPSSSSK+PF STK+RKISS QEPAKP+IST GG D          P+KSLSSSDEIFTAIDHLR SDPLLIS+L+S 
Subjt:  MAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGGGD----------PVKSLSSSDEIFTAIDHLRSSDPLLISLLDSS

Query:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSAL
        ESPNFKSNPPFLAL KSILYQQLATKAAESIYNRFA+LCGGE  VLP  VL LSPQQLRVIGVSGRK+SYLHDLATKF+EG LSNS ILEMDDETLL AL
Subjt:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVK
        TAVKGIG+WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP EMEKLCEKWKPYRSMGAWYMWRLMEVK
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVK

TrEMBL top hitse value%identityAlignment
A0A0A0KM62 ENDO3c domain-containing protein1.7e-12081.79Show/hide
Query:  MAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGG----------GDPVKSLSSSDEIFTAIDHLRSSDPLLISLLDSS
        MAKR RRK L Q ES ++A PL PS+SSK+PF STK+RKISS QEP KP+IS PGG           DPVKSLSSSD+I TAI+HLR SDPLLISLLDS 
Subjt:  MAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGG----------GDPVKSLSSSDEIFTAIDHLRSSDPLLISLLDSS

Query:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSAL
        E+PNFKSNPPFLAL KSILYQQLATKAAE+IYNRFA+LCGGEA VLP  VL LSPQQLRVIGVSGRK+SYLHDLATKF+EG LSNS ILEMDDETLL AL
Subjt:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP EMEKLCEKWKPYRS+GAWYMWRL++ KEIVK
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK

A0A1S3B2D5 probable DNA-3-methyladenine glycosylase 22.6e-12182.86Show/hide
Query:  MAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGG----------GDPVKSLSSSDEIFTAIDHLRSSDPLLISLLDSS
        MAKR RRKFL QSES T A PL PSSSSK+PFRSTK+RKISS QEPAKP+ S P G           DPVKSLSS DEI TAI+HLR SDPLLISLLDS 
Subjt:  MAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGG----------GDPVKSLSSSDEIFTAIDHLRSSDPLLISLLDSS

Query:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSAL
        ESP+FKSNPPFLAL KSILYQQLATKAAESIYNRFA+LCGGEA VLP  VL LSPQQLRV+GVSGRK+SYLHDLATKF+EG LSNS ILEMDDETLL  L
Subjt:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP EMEKLCEKWKPYRS+GAWYMWRLME K +VK
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK

A0A5A7T3R0 Putative DNA-3-methyladenine glycosylase 22.6e-12182.86Show/hide
Query:  MAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGG----------GDPVKSLSSSDEIFTAIDHLRSSDPLLISLLDSS
        MAKR RRKFL QSES T A PL PSSSSK+PFRSTK+RKISS QEPAKP+ S P G           DPVKSLSS DEI TAI+HLR SDPLLISLLDS 
Subjt:  MAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGG----------GDPVKSLSSSDEIFTAIDHLRSSDPLLISLLDSS

Query:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSAL
        ESP+FKSNPPFLAL KSILYQQLATKAAESIYNRFA+LCGGEA VLP  VL LSPQQLRV+GVSGRK+SYLHDLATKF+EG LSNS ILEMDDETLL  L
Subjt:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP EMEKLCEKWKPYRS+GAWYMWRLME K +VK
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK

A0A6J1GD23 DNA-3-methyladenine glycosylase 1-like4.8e-12383.99Show/hide
Query:  PMAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGGGD----------PVKSLSSSDEIFTAIDHLRSSDPLLISLLDS
        PMA+RTRRK LLQSESQT+A+P      S + FR+TKIRKISS Q+  KP+ISTPGGGD          PVKSLSSSD I TAIDHLR SDPLLI LLDS
Subjt:  PMAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGGGD----------PVKSLSSSDEIFTAIDHLRSSDPLLISLLDS

Query:  SESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSA
         ESPNFKSNPPFLAL KSILYQQLATKAAESIYNRFA+LCGG+A VLP AVL LSPQQLRV+GVSGRK+SYLHDLATKF+EG LSNSSILEMDDETLLSA
Subjt:  SESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSA

Query:  LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK
        LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLME+KEIVK
Subjt:  LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK

A0A6J1IQD1 DNA-3-methyladenine glycosylase 1-like1.2e-12183.27Show/hide
Query:  PMAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGGGD----------PVKSLSSSDEIFTAIDHLRSSDPLLISLLDS
        PMA+RTRRK LLQSESQTEA+P      SK+ FR+T+IRKISS ++P KP+IST GGGD          PVKSLSSSD I TAIDHLR SDPLLI LLDS
Subjt:  PMAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGGGD----------PVKSLSSSDEIFTAIDHLRSSDPLLISLLDS

Query:  SESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSA
         ESPNFKSNPPFLA+ KSILYQQLATKAAESIYNRFA+LCGGEA VLP AVL LSPQQLRV+GVSGRK+SYLHDLATKFVEG LSNSSILEMDDETLLSA
Subjt:  SESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSA

Query:  LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK
        LT VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLME+K I K
Subjt:  LTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK

SwissProt top hitse value%identityAlignment
O31544 Putative DNA-3-methyladenine glycosylase YfjP4.8e-1928.03Show/hide
Query:  IRKISSPQEPAKPEISTPGGGDPVKSLSSSDEIFTAIDHLR------SSDPLLISLLDSSESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCG
        I K+ +     +PE    G  D  + +     IF   +HL+      S   L     + + +P       +  + K I++QQL    A ++  RF    G
Subjt:  IRKISSPQEPAKPEISTPGGGDPVKSLSSSDEIFTAIDHLR------SSDPLLISLLDSSESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCG

Query:  GEA-GV----LPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKG
         +  GV     P  +  L  Q LR +  S RK+ Y  D +    EG LS S +  M DE ++  L  ++GIG W+V   ++F L RP++ P+ D+G++  
Subjt:  GEA-GV----LPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKG

Query:  VQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLME
        ++R + L + P    M  + ++W+PY S  + Y+WR +E
Subjt:  VQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLME

O94468 Alkylbase DNA glycosidase-like protein mag23.9e-2130.24Show/hide
Query:  LSSSDEIFTAIDHLRSSDPLLISLLDSSESPNFKSNP---PFLALAKSILYQQLATKAAESIYNRFAALCG-GEAGVLPAAVLALSPQQLRVIGVSGRKS
        +S   +   A  HL S D    SL+          +P   P+  + ++I  Q+L+  A  SI N+F   C   +    P  ++    + L   G S  KS
Subjt:  LSSSDEIFTAIDHLRSSDPLLISLLDSSESPNFKSNP---PFLALAKSILYQQLATKAAESIYNRFAALCG-GEAGVLPAAVLALSPQQLRVIGVSGRKS

Query:  SYLHDLATKFV-EGILSNSSILEMDDETLLSALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAW
          +H +A   + + I S S I +M +E L+ +L+ +KG+  W++ M+ IFTL R D++P  D  ++   +  +GL   P+  E+EKL +  KPYR++ AW
Subjt:  SYLHDLATKFV-EGILSNSSILEMDDETLLSALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAW

Query:  YMWRL
        Y+W++
Subjt:  YMWRL

P22134 DNA-3-methyladenine glycosylase4.2e-1530.52Show/hide
Query:  PVKSLSSSDEIFT-AIDHLRSSDPLLISLLDSSESPNF--KSNPP------FLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSP----
        P K ++  +E F  A +H+   DP L  +L ++E   +  ++  P      F+ LA +IL QQ++ +AAESI  R  +L G   G  P   +        
Subjt:  PVKSLSSSDEIFT-AIDHLRSSDPLLISLLDSSESPNF--KSNPP------FLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSP----

Query:  ---QQLRVIGVSGRKSSYLHDLATKFVEGILSNSSIL---EMDDETLLSALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLK-ELPKP
            ++   G+S RK  YL  LA  F E       +    + D+E + S +T VKGIG WS  MF+I  L R DV    DLG+ +G  +    K EL K 
Subjt:  ---QQLRVIGVSGRKSSYLHDLATKFVEGILSNSSIL---EMDDETLLSALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLK-ELPKP

Query:  VE-------------------------MEKLCEKWKPYRSMGAWYMWRL
        +                          MEK  E + PYRS+  + +WRL
Subjt:  VE-------------------------MEKLCEKWKPYRSMGAWYMWRL

Q92383 DNA-3-methyladenine glycosylase 13.2e-2331.46Show/hide
Query:  NFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILE-MDDETLLSALTA
        + +   P+  L +++  QQL +KAA +I+NRF ++        P  +  +  + +R  G S RK   L  +A   + G++      E + +E L+  LT 
Subjt:  NFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILE-MDDETLLSALTA

Query:  VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK
        +KGIG W+V M +IF+L+R DV+P  DL +R G + L+ L ++P  + + K  E   P+R+  AWY+W+  ++ +  K
Subjt:  VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK

Arabidopsis top hitse value%identityAlignment
AT1G19480.1 DNA glycosylase superfamily protein8.5e-7254.58Show/hide
Query:  SSSSKLPFRSTKIRK----------------ISSPQ--EPAKPEISTPGGGD------------PVKSLSSSDEIFTAIDHLRSSDPLLISLLDSSESPN
        S  SK+P R  KIRK                ISS Q   P   +  +PG G               + L+   E+ TAI +LR++DPLL +L+D    P 
Subjt:  SSSSKLPFRSTKIRK----------------ISSPQ--EPAKPEISTPGGGD------------PVKSLSSSDEIFTAIDHLRSSDPLLISLLDSSESPN

Query:  FKS-NPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSALTAV
        F+S   PFLAL ++ILYQQLA KA  SIY RF +LCGGE  V+P  VL+L+PQQLR IGVSGRK+SYLHDLA K+  GILS+S+IL MD+++L + LT V
Subjt:  FKS-NPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSALTAV

Query:  KGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVK
         GIG WSVHMFMI +LHRPDVLPV DLGVRKGVQ LYGL +LP+P +ME+ C KW+PYRS+G+WYMWRL+E K
Subjt:  KGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVK

AT1G19480.2 DNA glycosylase superfamily protein8.5e-7254.58Show/hide
Query:  SSSSKLPFRSTKIRK----------------ISSPQ--EPAKPEISTPGGGD------------PVKSLSSSDEIFTAIDHLRSSDPLLISLLDSSESPN
        S  SK+P R  KIRK                ISS Q   P   +  +PG G               + L+   E+ TAI +LR++DPLL +L+D    P 
Subjt:  SSSSKLPFRSTKIRK----------------ISSPQ--EPAKPEISTPGGGD------------PVKSLSSSDEIFTAIDHLRSSDPLLISLLDSSESPN

Query:  FKS-NPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSALTAV
        F+S   PFLAL ++ILYQQLA KA  SIY RF +LCGGE  V+P  VL+L+PQQLR IGVSGRK+SYLHDLA K+  GILS+S+IL MD+++L + LT V
Subjt:  FKS-NPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSALTAV

Query:  KGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVK
         GIG WSVHMFMI +LHRPDVLPV DLGVRKGVQ LYGL +LP+P +ME+ C KW+PYRS+G+WYMWRL+E K
Subjt:  KGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVK

AT1G75230.1 DNA glycosylase superfamily protein2.5e-7154.48Show/hide
Query:  PMAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGGGDPVKSLSSSDEIFTAIDHLRSSDPLLISLLDSSESPNFKS-N
        P+  R  RK     ++    NP    S       +TK  K+S  +    P I         +SL+   E+  A+ HLRS DPLL SL+D    P F++  
Subjt:  PMAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGGGDPVKSLSSSDEIFTAIDHLRSSDPLLISLLDSSESPNFKS-N

Query:  PPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSALTAVKGIGV
         PFLAL +SILYQQLA KA  SIY RF ALCGGE GV+P  VL L+PQQLR IGVSGRK+SYLHDLA K+  GILS+S I+ MD+++L + LT V GIG 
Subjt:  PPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSALTAVKGIGV

Query:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVK
        WSVHMFMI +LHRPDVLPV DLGVRKGVQ L G+++LP+P +ME+LCEKW+PYRS+ +WY+WRL+E K
Subjt:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVK

AT1G75230.2 DNA glycosylase superfamily protein2.5e-7154.48Show/hide
Query:  PMAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGGGDPVKSLSSSDEIFTAIDHLRSSDPLLISLLDSSESPNFKS-N
        P+  R  RK     ++    NP    S       +TK  K+S  +    P I         +SL+   E+  A+ HLRS DPLL SL+D    P F++  
Subjt:  PMAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGGGDPVKSLSSSDEIFTAIDHLRSSDPLLISLLDSSESPNFKS-N

Query:  PPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSALTAVKGIGV
         PFLAL +SILYQQLA KA  SIY RF ALCGGE GV+P  VL L+PQQLR IGVSGRK+SYLHDLA K+  GILS+S I+ MD+++L + LT V GIG 
Subjt:  PPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSALTAVKGIGV

Query:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVK
        WSVHMFMI +LHRPDVLPV DLGVRKGVQ L G+++LP+P +ME+LCEKW+PYRS+ +WY+WRL+E K
Subjt:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVK

AT3G50880.1 DNA glycosylase superfamily protein9.1e-7459.59Show/hide
Query:  SSSKLPFRSTKIRKISSPQEPAKPEISTPGGGDPVKSLSSSDEIFTAIDHLRSSDPLLISLLDSSESPNF--KSNPPFLALAKSILYQQLATKAAESIYN
        SSS++ FR  KIRK+SS   P     ++P        LS+   +  A+ HL+SSD LL +L+ +   P     SN PFL+LA+SILYQQLATKAA+ IY+
Subjt:  SSSKLPFRSTKIRKISSPQEPAKPEISTPGGGDPVKSLSSSDEIFTAIDHLRSSDPLLISLLDSSESPNF--KSNPPFLALAKSILYQQLATKAAESIYN

Query:  RFAALC-GGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGV
        RF +L  GGEAGV+P +V++LS   LR IGVSGRK+SYLHDLA K+  G+LS+  IL+M DE L+  LT VKGIGVW+VHMFMIF+LHRPDVLPVGDLGV
Subjt:  RFAALC-GGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGV

Query:  RKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKE
        RKGV+ LYGLK LP P++ME+LCEKW+PYRS+G+WYMWRL+E ++
Subjt:  RKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCAAAACCTAGCTTTTAACCACAAATCATTCACCGCCGCCGTGACCTTCCCTTTCACTCGATCGACGCCGATGGCCAAAAGAACCCGCCGGAAGTTTCTCTTACA
GTCCGAGTCTCAAACCGAGGCCAATCCTCTCCCTCCATCGTCATCCTCCAAGCTTCCATTCCGATCCACCAAAATACGGAAGATTTCCTCCCCTCAAGAACCGGCCAAAC
CAGAAATCTCAACTCCCGGCGGCGGCGACCCCGTCAAATCATTATCGTCCTCGGATGAAATTTTCACAGCGATCGATCATTTACGCAGTTCGGATCCTCTCCTAATAAGT
CTATTAGATTCAAGCGAATCACCTAATTTCAAATCGAATCCACCGTTTCTAGCCCTAGCAAAGAGCATTCTCTACCAGCAGCTCGCCACGAAGGCCGCCGAATCGATCTA
CAATCGCTTCGCCGCCCTATGCGGCGGTGAGGCAGGGGTGCTGCCGGCCGCCGTTCTGGCACTCTCGCCGCAACAGCTGCGAGTAATCGGAGTTTCAGGGAGGAAATCAA
GCTACCTCCATGACCTAGCGACGAAATTCGTAGAGGGAATTTTGTCGAATTCTTCGATTCTGGAGATGGACGACGAGACTCTATTGAGTGCGTTGACGGCGGTGAAGGGA
ATCGGCGTTTGGTCGGTGCATATGTTCATGATCTTCACTCTGCACCGGCCGGATGTGCTGCCGGTGGGGGATTTGGGGGTGAGAAAAGGGGTGCAGAGATTGTACGGACT
GAAGGAATTGCCGAAGCCAGTGGAGATGGAGAAACTGTGTGAGAAATGGAAGCCTTACAGATCGATGGGGGCTTGGTACATGTGGCGGCTAATGGAAGTGAAGGAAATCG
TGAAATAA
mRNA sequenceShow/hide mRNA sequence
TGAGTTAATCATGAATGCATTTTTAAATTTGTCAATAAAAGAAGATTATTTAAAAAAAATAATGACATAAGTAAAAGAATGAAATAACATAAATCATTAAAAAGCAAATT
GTATTATATGCTCCCAGTCAAAGGTGATAAAAGATGGAGTTTGACGGAGGAGAAAACTTTGAGTTGCTGACGGAGATCCAATCGTGCACCAGAAGCTTTGAGTTGCTGAC
GGAGGAGACGATCTCATCTCAACCAATCACAACGCGACATCTGTACTTCACTCTTCCTTAATCATGATTCAAAACCTAGCTTTTAACCACAAATCATTCACCGCCGCCGT
GACCTTCCCTTTCACTCGATCGACGCCGATGGCCAAAAGAACCCGCCGGAAGTTTCTCTTACAGTCCGAGTCTCAAACCGAGGCCAATCCTCTCCCTCCATCGTCATCCT
CCAAGCTTCCATTCCGATCCACCAAAATACGGAAGATTTCCTCCCCTCAAGAACCGGCCAAACCAGAAATCTCAACTCCCGGCGGCGGCGACCCCGTCAAATCATTATCG
TCCTCGGATGAAATTTTCACAGCGATCGATCATTTACGCAGTTCGGATCCTCTCCTAATAAGTCTATTAGATTCAAGCGAATCACCTAATTTCAAATCGAATCCACCGTT
TCTAGCCCTAGCAAAGAGCATTCTCTACCAGCAGCTCGCCACGAAGGCCGCCGAATCGATCTACAATCGCTTCGCCGCCCTATGCGGCGGTGAGGCAGGGGTGCTGCCGG
CCGCCGTTCTGGCACTCTCGCCGCAACAGCTGCGAGTAATCGGAGTTTCAGGGAGGAAATCAAGCTACCTCCATGACCTAGCGACGAAATTCGTAGAGGGAATTTTGTCG
AATTCTTCGATTCTGGAGATGGACGACGAGACTCTATTGAGTGCGTTGACGGCGGTGAAGGGAATCGGCGTTTGGTCGGTGCATATGTTCATGATCTTCACTCTGCACCG
GCCGGATGTGCTGCCGGTGGGGGATTTGGGGGTGAGAAAAGGGGTGCAGAGATTGTACGGACTGAAGGAATTGCCGAAGCCAGTGGAGATGGAGAAACTGTGTGAGAAAT
GGAAGCCTTACAGATCGATGGGGGCTTGGTACATGTGGCGGCTAATGGAAGTGAAGGAAATCGTGAAATAAGGTCGCGATTTATCGGATAAGGTAGCGTTGTAATGTGAG
TTTCCTTTCCTTAGTTGTCCACCGCCGCAGGTGCAGAATCAATTGTCCTTTATTGAGTTGTTTTTCTATAATTAAGATTTAAGAACAACCCATTTTAGCTTCTTTCATGT
TTTTTCTCTTTAATGTGTTTATAATCATATGTATCTCAAAGTTGAGATGTTTGAATCTTTTATCTCCGCATTTGAATTAAAAAAAAATCAAAATATTGAACTTCTTTTAA
AGTTTATGTAAACATTCGATGATAATTAACAGCAGGTGTATATTTAGTTGAGACCTGGATAGGACTTGAGCGCCTCAAATTTTTTACTGTATATTATATACAGGAAAGAT
CAAATTATAACATAAAAAATCCTTATGGTTTAGTGTTGTTAGTATGCCCCAGGTTCAAGCTCTAG
Protein sequenceShow/hide protein sequence
MIQNLAFNHKSFTAAVTFPFTRSTPMAKRTRRKFLLQSESQTEANPLPPSSSSKLPFRSTKIRKISSPQEPAKPEISTPGGGDPVKSLSSSDEIFTAIDHLRSSDPLLIS
LLDSSESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFAALCGGEAGVLPAAVLALSPQQLRVIGVSGRKSSYLHDLATKFVEGILSNSSILEMDDETLLSALTAVKG
IGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMEVKEIVK