; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg012237 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg012237
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDNA-3-methyladenine glycosylase 1-like
Genome locationscaffold1:6071766..6072641
RNA-Seq ExpressionSpg012237
SyntenySpg012237
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006307 - DNA dealkylation involved in DNA repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0032993 - protein-DNA complex (cellular component)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
GO:0032131 - alkylated DNA binding (molecular function)
GO:0043916 - DNA-7-methylguanine glycosylase activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603971.1 Zinc-finger homeodomain protein 6, partial [Cucurbita argyrosperma subsp. sororia]2.2e-12284.53Show/hide
Query:  MAKRTRRKSLSQSESQTETNPSSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPN
        MA+RTRRK L QSESQT+ +P S+ I FR+TKIRKISSTQ+  K QI+     D TR FPN    VKSLSSSD ICTAI+HLRRSDPLLI LLDSCESPN
Subjt:  MAKRTRRKSLSQSESQTETNPSSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPN

Query:  FKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVK
        FKSNPPFLALTKSILYQQLATKAAESIYNRFA+LCGG+A VLP AVL LSPQQLRV+GVSGRKASYLHDLATKF+EG LSNSSILEMDDETLL+ LTAVK
Subjt:  FKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVK

Query:  GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVKAD
        GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLME+K IVK D
Subjt:  GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVKAD

KAG7034142.1 mag1, partial [Cucurbita argyrosperma subsp. argyrosperma]2.8e-12284.53Show/hide
Query:  MAKRTRRKSLSQSESQTETNPSSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPN
        MA+RTRRK L QSESQT+ +P S+ I FR+TKIRKISSTQ+  K QI+     D TR FPN    VKSLSSSD ICTAI+HLRRSDPLLI LLDSCESPN
Subjt:  MAKRTRRKSLSQSESQTETNPSSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPN

Query:  FKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVK
        FKSNPPFLALTKSILYQQLATKAAESIYNRFA+LCGG+A VLP AVL LSPQQLRV+GVSGRKASYLHDLATKF+EG LSNSSILEMDDETLL+ LTAVK
Subjt:  FKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVK

Query:  GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVKAD
        GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLME+K IVK D
Subjt:  GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVKAD

XP_022949777.1 DNA-3-methyladenine glycosylase 1-like [Cucurbita moschata]2.2e-12284.53Show/hide
Query:  MAKRTRRKSLSQSESQTETNPSSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPN
        MA+RTRRK L QSESQT+ +P S+ I FR+TKIRKISSTQ+  K QI+     D TR FPN    VKSLSSSD ICTAI+HLRRSDPLLI LLDSCESPN
Subjt:  MAKRTRRKSLSQSESQTETNPSSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPN

Query:  FKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVK
        FKSNPPFLALTKSILYQQLATKAAESIYNRFA+LCGG+A VLP AVL LSPQQLRV+GVSGRKASYLHDLATKF+EG LSNSSILEMDDETLL+ LTAVK
Subjt:  FKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVK

Query:  GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVKAD
        GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLME+K IVK D
Subjt:  GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVKAD

XP_022978525.1 DNA-3-methyladenine glycosylase 1-like [Cucurbita maxima]4.4e-12384.53Show/hide
Query:  MAKRTRRKSLSQSESQTETNPSSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPN
        MA+RTRRK L QSESQTE +P S  I FR+T+IRKISST++P K QI+     D TR FPN    VKSLSSSD ICTAI+HLRRSDPLLI LLDSCESPN
Subjt:  MAKRTRRKSLSQSESQTETNPSSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPN

Query:  FKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVK
        FKSNPPFLA+TKSILYQQLATKAAESIYNRFA+LCGGEA VLP AVL LSPQQLRV+GVSGRKASYLHDLATKFVEG LSNSSILEMDDETLL+ LT VK
Subjt:  FKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVK

Query:  GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVKAD
        GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLME+KGI K D
Subjt:  GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVKAD

XP_023543059.1 DNA-3-methyladenine glycosylase 1 [Cucurbita pepo subsp. pepo]6.3e-12284.89Show/hide
Query:  MAKRTRRKSLSQSESQTETNPSSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPN
        MA+RTRRK L QSESQTE +P S  I FR+TKIRKISSTQ+P K QI+     D TR FPN    VKSLSSSD I TAI+HLRRSDPLLI LLDSCESPN
Subjt:  MAKRTRRKSLSQSESQTETNPSSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPN

Query:  FKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVK
        FKSNPPFLALTKSILYQQLATKAAESIYNRFA+LCGGEA VLP AVL LSPQQLRV+GVSGRKASYLHDLATKF+EG LSNSSILEMDDETLL+ LTAVK
Subjt:  FKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVK

Query:  GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVKAD
        GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCE WKPYRSMGAWYMWRLME+K IVK D
Subjt:  GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVKAD

TrEMBL top hitse value%identityAlignment
A0A0A0KM62 ENDO3c domain-containing protein1.0e-11781.43Show/hide
Query:  MAKRTRRKSLSQSESQTETNP----SSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSC
        MAKR RRK L Q ES ++  P    +SS IPF STK+RKISS QEP K QI+     +PTR FPN  D VKSLSSSD+I TAI HLRRSDPLLISLLDSC
Subjt:  MAKRTRRKSLSQSESQTETNP----SSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSC

Query:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTL
        E+PNFKSNPPFLALTKSILYQQLATKAAE+IYNRFA+LCGGEA VLP  VL LSPQQLRVIGVSGRKASYLHDLATKF+EG LSNS ILEMDDETLL  L
Subjt:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVK
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP EMEKLCEKWKPYRS+GAWYMWRL++ K IVK
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVK

A0A1S3B2D5 probable DNA-3-methyladenine glycosylase 22.2e-12083.57Show/hide
Query:  MAKRTRRKSLSQSESQTETNP----SSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSC
        MAKR RRK L QSES T   P    SSS IPFRSTK+RKISS QEPAK Q +     +PTR FPN  D VKSLSS DEI TAI HLRRSDPLLISLLDSC
Subjt:  MAKRTRRKSLSQSESQTETNP----SSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSC

Query:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTL
        ESP+FKSNPPFLALTKSILYQQLATKAAESIYNRFA+LCGGEA+VLP  VL LSPQQLRV+GVSGRKASYLHDLATKF+EG LSNS ILEMDDETLL  L
Subjt:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVK
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP EMEKLCEKWKPYRS+GAWYMWRLME KG+VK
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVK

A0A5A7T3R0 Putative DNA-3-methyladenine glycosylase 22.2e-12083.57Show/hide
Query:  MAKRTRRKSLSQSESQTETNP----SSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSC
        MAKR RRK L QSES T   P    SSS IPFRSTK+RKISS QEPAK Q +     +PTR FPN  D VKSLSS DEI TAI HLRRSDPLLISLLDSC
Subjt:  MAKRTRRKSLSQSESQTETNP----SSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSC

Query:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTL
        ESP+FKSNPPFLALTKSILYQQLATKAAESIYNRFA+LCGGEA+VLP  VL LSPQQLRV+GVSGRKASYLHDLATKF+EG LSNS ILEMDDETLL  L
Subjt:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVK
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP EMEKLCEKWKPYRS+GAWYMWRLME KG+VK
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVK

A0A6J1GD23 DNA-3-methyladenine glycosylase 1-like1.1e-12284.53Show/hide
Query:  MAKRTRRKSLSQSESQTETNPSSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPN
        MA+RTRRK L QSESQT+ +P S+ I FR+TKIRKISSTQ+  K QI+     D TR FPN    VKSLSSSD ICTAI+HLRRSDPLLI LLDSCESPN
Subjt:  MAKRTRRKSLSQSESQTETNPSSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPN

Query:  FKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVK
        FKSNPPFLALTKSILYQQLATKAAESIYNRFA+LCGG+A VLP AVL LSPQQLRV+GVSGRKASYLHDLATKF+EG LSNSSILEMDDETLL+ LTAVK
Subjt:  FKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVK

Query:  GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVKAD
        GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLME+K IVK D
Subjt:  GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVKAD

A0A6J1IQD1 DNA-3-methyladenine glycosylase 1-like2.1e-12384.53Show/hide
Query:  MAKRTRRKSLSQSESQTETNPSSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPN
        MA+RTRRK L QSESQTE +P S  I FR+T+IRKISST++P K QI+     D TR FPN    VKSLSSSD ICTAI+HLRRSDPLLI LLDSCESPN
Subjt:  MAKRTRRKSLSQSESQTETNPSSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPN

Query:  FKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVK
        FKSNPPFLA+TKSILYQQLATKAAESIYNRFA+LCGGEA VLP AVL LSPQQLRV+GVSGRKASYLHDLATKFVEG LSNSSILEMDDETLL+ LT VK
Subjt:  FKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVK

Query:  GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVKAD
        GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLME+KGI K D
Subjt:  GIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVKAD

SwissProt top hitse value%identityAlignment
O31544 Putative DNA-3-methyladenine glycosylase YfjP6.1e-1931.93Show/hide
Query:  LTKSILYQQLATKAAESIYNRFAALCGGEATVL-----PAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVKGIGV
        + K I++QQL    A ++  RF    G +   +     P  +  L  Q LR +  S RKA Y  D +    EG LS S +  M DE ++  L  ++GIG 
Subjt:  LTKSILYQQLATKAAESIYNRFAALCGGEATVL-----PAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVKGIGV

Query:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLME
        W+V   ++F L RP++ P+ D+G++  ++R + L + P    M  + ++W+PY S  + Y+WR +E
Subjt:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLME

O94468 Alkylbase DNA glycosidase-like protein mag22.3e-2128.85Show/hide
Query:  LSSSDEICTAIEHLRRSD---PLLISLLDSCESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCG-GEATVLPAAVLALSPQQLRVIGVSGRKA
        +S   +   A +HL   D     L+  +  C       + P+  + ++I  Q+L+  A  SI N+F   C   +    P  ++    + L   G S  K+
Subjt:  LSSSDEICTAIEHLRRSD---PLLISLLDSCESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCG-GEATVLPAAVLALSPQQLRVIGVSGRKA

Query:  SYLHDLATKFV-EGILSNSSILEMDDETLLTTLTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAW
          +H +A   + + I S S I +M +E L+ +L+ +KG+  W++ M+ IFTL R D++P  D  ++   +  +GL   P+  E+EKL +  KPYR++ AW
Subjt:  SYLHDLATKFV-EGILSNSSILEMDDETLLTTLTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAW

Query:  YMWRLMEL
        Y+W++ +L
Subjt:  YMWRLMEL

P22134 DNA-3-methyladenine glycosylase1.0e-1330.21Show/hide
Query:  AIEHLRRSDPLLISLLDSCESPNF--KSNPP------FLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSP-------QQLRVIGVSGR
        A EH+   DP L  +L + E   +  ++  P      F+ L  +IL QQ++ +AAESI  R  +L GG     P   +            ++   G+S R
Subjt:  AIEHLRRSDPLLISLLDSCESPNF--KSNPP------FLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSP-------QQLRVIGVSGR

Query:  KASYLHDLATKFVEGILSNSSIL---EMDDETLLTTLTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLK-ELPKPVE------------
        K  YL  LA  F E       +    + D+E + + +T VKGIG WS  MF+I  L R DV    DLG+ +G  +    K EL K +             
Subjt:  KASYLHDLATKFVEGILSNSSIL---EMDDETLLTTLTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLK-ELPKPVE------------

Query:  -------------MEKLCEKWKPYRSMGAWYMWRL
                     MEK  E + PYRS+  + +WRL
Subjt:  -------------MEKLCEKWKPYRSMGAWYMWRL

Q92383 DNA-3-methyladenine glycosylase 12.4e-2332.02Show/hide
Query:  NFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILE-MDDETLLTTLTA
        + +   P+  L +++  QQL +KAA +I+NRF ++        P  +  +  + +R  G S RK   L  +A   + G++      E + +E L+  LT 
Subjt:  NFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILE-MDDETLLTTLTA

Query:  VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVK
        +KGIG W+V M +IF+L+R DV+P  DL +R G + L+ L ++P  + + K  E   P+R+  AWY+W+  +L    K
Subjt:  VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVK

Arabidopsis top hitse value%identityAlignment
AT1G19480.1 DNA glycosylase superfamily protein2.2e-7254.07Show/hide
Query:  STIPFRSTKIRKIS----------STQEPAKSQITNSADEDPTRPFPNPVDSVKS----------LSSSDEICTAIEHLRRSDPLLISLLDSCESPNFKS
        S IP R  KIRK++            ++ + SQ+ +    D   P    +  +++          L+   E+ TAI +LR +DPLL +L+D    P F+S
Subjt:  STIPFRSTKIRKIS----------STQEPAKSQITNSADEDPTRPFPNPVDSVKS----------LSSSDEICTAIEHLRRSDPLLISLLDSCESPNFKS

Query:  -NPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVKGI
           PFLAL ++ILYQQLA KA  SIY RF +LCGGE  V+P  VL+L+PQQLR IGVSGRKASYLHDLA K+  GILS+S+IL MD+++L T LT V GI
Subjt:  -NPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVKGI

Query:  GVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELK
        G WSVHMFMI +LHRPDVLPV DLGVRKGVQ LYGL +LP+P +ME+ C KW+PYRS+G+WYMWRL+E K
Subjt:  GVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELK

AT1G19480.2 DNA glycosylase superfamily protein2.2e-7254.07Show/hide
Query:  STIPFRSTKIRKIS----------STQEPAKSQITNSADEDPTRPFPNPVDSVKS----------LSSSDEICTAIEHLRRSDPLLISLLDSCESPNFKS
        S IP R  KIRK++            ++ + SQ+ +    D   P    +  +++          L+   E+ TAI +LR +DPLL +L+D    P F+S
Subjt:  STIPFRSTKIRKIS----------STQEPAKSQITNSADEDPTRPFPNPVDSVKS----------LSSSDEICTAIEHLRRSDPLLISLLDSCESPNFKS

Query:  -NPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVKGI
           PFLAL ++ILYQQLA KA  SIY RF +LCGGE  V+P  VL+L+PQQLR IGVSGRKASYLHDLA K+  GILS+S+IL MD+++L T LT V GI
Subjt:  -NPPFLALTKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVKGI

Query:  GVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELK
        G WSVHMFMI +LHRPDVLPV DLGVRKGVQ LYGL +LP+P +ME+ C KW+PYRS+G+WYMWRL+E K
Subjt:  GVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELK

AT1G75230.1 DNA glycosylase superfamily protein8.4e-7255.94Show/hide
Query:  IPFRSTKIRKISSTQEPAK--------SQITNS-----ADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPNFKS-NPPFLALT
        IP R  KIRK+S   + +         SQ+T +     +    +R    P    +SL+   E+  A+ HLR  DPLL SL+D    P F++   PFLAL 
Subjt:  IPFRSTKIRKISSTQEPAK--------SQITNS-----ADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPNFKS-NPPFLALT

Query:  KSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVKGIGVWSVHMFM
        +SILYQQLA KA  SIY RF ALCGGE  V+P  VL L+PQQLR IGVSGRKASYLHDLA K+  GILS+S I+ MD+++L T LT V GIG WSVHMFM
Subjt:  KSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVKGIGVWSVHMFM

Query:  IFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELK
        I +LHRPDVLPV DLGVRKGVQ L G+++LP+P +ME+LCEKW+PYRS+ +WY+WRL+E K
Subjt:  IFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELK

AT1G75230.2 DNA glycosylase superfamily protein8.4e-7255.94Show/hide
Query:  IPFRSTKIRKISSTQEPAK--------SQITNS-----ADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPNFKS-NPPFLALT
        IP R  KIRK+S   + +         SQ+T +     +    +R    P    +SL+   E+  A+ HLR  DPLL SL+D    P F++   PFLAL 
Subjt:  IPFRSTKIRKISSTQEPAK--------SQITNS-----ADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPNFKS-NPPFLALT

Query:  KSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVKGIGVWSVHMFM
        +SILYQQLA KA  SIY RF ALCGGE  V+P  VL L+PQQLR IGVSGRKASYLHDLA K+  GILS+S I+ MD+++L T LT V GIG WSVHMFM
Subjt:  KSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVKGIGVWSVHMFM

Query:  IFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELK
        I +LHRPDVLPV DLGVRKGVQ L G+++LP+P +ME+LCEKW+PYRS+ +WY+WRL+E K
Subjt:  IFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELK

AT3G50880.1 DNA glycosylase superfamily protein1.2e-7057.59Show/hide
Query:  TETNPSSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPNF--KSNPPFLALTKSI
        +E + SSS I FR  KIRK+SS   P +  IT S                  LS+   +  A+ HL+ SD LL +L+ +   P     SN PFL+L +SI
Subjt:  TETNPSSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPNF--KSNPPFLALTKSI

Query:  LYQQLATKAAESIYNRFAALC-GGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVKGIGVWSVHMFMIF
        LYQQLATKAA+ IY+RF +L  GGEA V+P +V++LS   LR IGVSGRKASYLHDLA K+  G+LS+  IL+M DE L+  LT VKGIGVW+VHMFMIF
Subjt:  LYQQLATKAAESIYNRFAALC-GGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVKGIGVWSVHMFMIF

Query:  TLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLME
        +LHRPDVLPVGDLGVRKGV+ LYGLK LP P++ME+LCEKW+PYRS+G+WYMWRL+E
Subjt:  TLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAAAAGAACCCGCCGGAAGTCCCTGTCACAGTCTGAGTCTCAAACCGAGACCAATCCATCTTCCTCCACGATTCCGTTCCGATCCACAAAAATACGGAAGATTTC
CTCCACTCAAGAACCAGCCAAATCACAAATCACAAATTCCGCCGACGAAGACCCAACCCGACCGTTTCCGAACCCGGTTGATTCCGTCAAATCGTTATCTTCTTCGGATG
AAATTTGTACAGCGATCGAGCATTTACGCCGCTCGGATCCTCTCCTGATAAGTCTATTAGATTCATGCGAATCCCCCAATTTCAAATCGAATCCGCCATTTCTAGCCCTA
ACGAAGAGCATTCTCTACCAGCAGCTCGCGACGAAGGCCGCCGAATCGATCTACAATCGCTTCGCCGCACTATGCGGCGGCGAGGCGACGGTGCTGCCGGCCGCCGTGCT
GGCACTCTCGCCGCAACAGCTGCGTGTAATCGGAGTTTCGGGGAGGAAAGCAAGTTACCTCCATGACCTAGCGACAAAATTCGTCGAGGGCATTTTGTCGAATTCTTCGA
TTCTGGAGATGGACGATGAGACTCTGTTGACGACGTTGACGGCAGTGAAAGGAATCGGAGTTTGGTCGGTGCATATGTTCATGATCTTCACTCTACACCGGCCGGATGTT
TTGCCGGTGGGGGATTTGGGTGTCAGAAAAGGGGTGCAGAGATTGTACGGACTGAAGGAGTTGCCGAAGCCGGTGGAGATGGAGAAACTGTGTGAGAAATGGAAGCCTTA
CAGGTCGATGGGAGCTTGGTATATGTGGAGGCTAATGGAATTGAAGGGAATCGTGAAGGCAGATCCGATTTTTCGGATAAGGCAAAAAACAGAATTGACATTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCAAAAGAACCCGCCGGAAGTCCCTGTCACAGTCTGAGTCTCAAACCGAGACCAATCCATCTTCCTCCACGATTCCGTTCCGATCCACAAAAATACGGAAGATTTC
CTCCACTCAAGAACCAGCCAAATCACAAATCACAAATTCCGCCGACGAAGACCCAACCCGACCGTTTCCGAACCCGGTTGATTCCGTCAAATCGTTATCTTCTTCGGATG
AAATTTGTACAGCGATCGAGCATTTACGCCGCTCGGATCCTCTCCTGATAAGTCTATTAGATTCATGCGAATCCCCCAATTTCAAATCGAATCCGCCATTTCTAGCCCTA
ACGAAGAGCATTCTCTACCAGCAGCTCGCGACGAAGGCCGCCGAATCGATCTACAATCGCTTCGCCGCACTATGCGGCGGCGAGGCGACGGTGCTGCCGGCCGCCGTGCT
GGCACTCTCGCCGCAACAGCTGCGTGTAATCGGAGTTTCGGGGAGGAAAGCAAGTTACCTCCATGACCTAGCGACAAAATTCGTCGAGGGCATTTTGTCGAATTCTTCGA
TTCTGGAGATGGACGATGAGACTCTGTTGACGACGTTGACGGCAGTGAAAGGAATCGGAGTTTGGTCGGTGCATATGTTCATGATCTTCACTCTACACCGGCCGGATGTT
TTGCCGGTGGGGGATTTGGGTGTCAGAAAAGGGGTGCAGAGATTGTACGGACTGAAGGAGTTGCCGAAGCCGGTGGAGATGGAGAAACTGTGTGAGAAATGGAAGCCTTA
CAGGTCGATGGGAGCTTGGTATATGTGGAGGCTAATGGAATTGAAGGGAATCGTGAAGGCAGATCCGATTTTTCGGATAAGGCAAAAAACAGAATTGACATTGTAG
Protein sequenceShow/hide protein sequence
MAKRTRRKSLSQSESQTETNPSSSTIPFRSTKIRKISSTQEPAKSQITNSADEDPTRPFPNPVDSVKSLSSSDEICTAIEHLRRSDPLLISLLDSCESPNFKSNPPFLAL
TKSILYQQLATKAAESIYNRFAALCGGEATVLPAAVLALSPQQLRVIGVSGRKASYLHDLATKFVEGILSNSSILEMDDETLLTTLTAVKGIGVWSVHMFMIFTLHRPDV
LPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWYMWRLMELKGIVKADPIFRIRQKTELTL