; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10016455 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10016455
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDNA-3-methyladenine glycosylase 1-like
Genome locationChr03:5181322..5182155
RNA-Seq ExpressionHG10016455
SyntenyHG10016455
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006307 - DNA dealkylation involved in DNA repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0032993 - protein-DNA complex (cellular component)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
GO:0032131 - alkylated DNA binding (molecular function)
GO:0043916 - DNA-7-methylguanine glycosylase activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143510.1 DNA-3-methyladenine glycosylase 1 [Cucumis sativus]8.1e-12785.87Show/hide
Query:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC
        MAKRIRRK L Q ES  DA PL PS+SSKIPF STKVRKIS+ QEP KPQ S   G +PTR FPNLA PVKSLSSSD++ TAI+HLRRSDPLLIS+LDSC
Subjt:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL
        E+PNFKSNPPFLALTKSILYQQLATKAAE+IYNRFASLCGGEAAV+PDTVLGLS QQLRVIGVSGRKASYLHDLATKFIEG LSNS ILEMDDE+LL AL
Subjt:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVK
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP EMEKLCEKWKPYRS+GAW MWRL++ K
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVK

XP_008440714.1 PREDICTED: probable DNA-3-methyladenine glycosylase 2 [Cucumis melo]1.6e-13087.73Show/hide
Query:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC
        MAKRIRRKFL QSES   A PL PSSSSKIPFRSTKVRKIS+ QEPAKPQ S  +G +PTR FPNLA PVKSLSS DE+ TAI+HLRRSDPLLIS+LDSC
Subjt:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL
        ESP+FKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEA+V+PDTVLGLS QQLRV+GVSGRKASYLHDLATKFIEG LSNS ILEMDDE+LLG L
Subjt:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVKG
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP EMEKLCEKWKPYRS+GAW MWRLME KG
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVKG

XP_022978525.1 DNA-3-methyladenine glycosylase 1-like [Cucurbita maxima]1.3e-12483.75Show/hide
Query:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC
        MA+R RRK L QSESQ +ADP      SKI FR+T++RKIS+ ++P KPQ ST  G D TRAFPN  GPVKSLSSSD + TAIDHLRRSDPLLI +LDSC
Subjt:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL
        ESPNFKSNPPFLA+TKSILYQQLATKAAESIYNRFASLCGGEAAV+PD VLGLS QQLRV+GVSGRKASYLHDLATKF+EG LSNSSILEMDDE+LL AL
Subjt:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVKG
        T VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAW MWRLME+KG
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVKG

XP_023543059.1 DNA-3-methyladenine glycosylase 1 [Cucurbita pepo subsp. pepo]1.2e-12585.51Show/hide
Query:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC
        MA+R RRK L QSESQ +ADP      SKI FR+TK+RKIS+ Q+P KPQ ST  G D TRAFPN  GPVKSLSSSD +RTAIDHLRRSDPLLI +LDSC
Subjt:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL
        ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAV+PD VLGLS QQLRV+GVSGRKASYLHDLATKFIEG LSNSSILEMDDE+LL AL
Subjt:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVK
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCE WKPYRSMGAW MWRLME+K
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVK

XP_038881017.1 DNA-3-methyladenine glycosylase 1-like [Benincasa hispida]2.5e-13690.97Show/hide
Query:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC
        MAKRIRRKFL QS+S+IDADPLPPSSSSKIPF STKVRKIS+KQEPAKPQ STS GNDPTRAF NLA P+KSLSSSDE+ TAIDHLRRSDPLLISIL+SC
Subjt:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL
        ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGE +V+PD VLGLS QQLRVIGVSGRKASYLHDLATKFIEG LSNS ILEMDDE+LL AL
Subjt:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVKG
        TAVKGIG+WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP EMEKLCEKWKPYRSMGAW MWRLMEVKG
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVKG

TrEMBL top hitse value%identityAlignment
A0A0A0KM62 ENDO3c domain-containing protein3.9e-12785.87Show/hide
Query:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC
        MAKRIRRK L Q ES  DA PL PS+SSKIPF STKVRKIS+ QEP KPQ S   G +PTR FPNLA PVKSLSSSD++ TAI+HLRRSDPLLIS+LDSC
Subjt:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL
        E+PNFKSNPPFLALTKSILYQQLATKAAE+IYNRFASLCGGEAAV+PDTVLGLS QQLRVIGVSGRKASYLHDLATKFIEG LSNS ILEMDDE+LL AL
Subjt:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVK
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP EMEKLCEKWKPYRS+GAW MWRL++ K
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVK

A0A1S3B2D5 probable DNA-3-methyladenine glycosylase 27.7e-13187.73Show/hide
Query:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC
        MAKRIRRKFL QSES   A PL PSSSSKIPFRSTKVRKIS+ QEPAKPQ S  +G +PTR FPNLA PVKSLSS DE+ TAI+HLRRSDPLLIS+LDSC
Subjt:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL
        ESP+FKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEA+V+PDTVLGLS QQLRV+GVSGRKASYLHDLATKFIEG LSNS ILEMDDE+LLG L
Subjt:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVKG
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP EMEKLCEKWKPYRS+GAW MWRLME KG
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVKG

A0A5A7T3R0 Putative DNA-3-methyladenine glycosylase 27.7e-13187.73Show/hide
Query:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC
        MAKRIRRKFL QSES   A PL PSSSSKIPFRSTKVRKIS+ QEPAKPQ S  +G +PTR FPNLA PVKSLSS DE+ TAI+HLRRSDPLLIS+LDSC
Subjt:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL
        ESP+FKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEA+V+PDTVLGLS QQLRV+GVSGRKASYLHDLATKFIEG LSNS ILEMDDE+LLG L
Subjt:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVKG
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP EMEKLCEKWKPYRS+GAW MWRLME KG
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVKG

A0A6J1GD23 DNA-3-methyladenine glycosylase 1-like3.1e-12484.78Show/hide
Query:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC
        MA+R RRK L QSESQ DADP      S I FR+TK+RKIS+ Q+  KPQ ST  G D TRAFPN  GPVKSLSSSD + TAIDHLRRSDPLLI +LDSC
Subjt:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL
        ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGG+AAV+PD VLGLS QQLRV+GVSGRKASYLHDLATKFIEG LSNSSILEMDDE+LL AL
Subjt:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVK
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAW MWRLME+K
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVK

A0A6J1IQD1 DNA-3-methyladenine glycosylase 1-like6.3e-12583.75Show/hide
Query:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC
        MA+R RRK L QSESQ +ADP      SKI FR+T++RKIS+ ++P KPQ ST  G D TRAFPN  GPVKSLSSSD + TAIDHLRRSDPLLI +LDSC
Subjt:  MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL
        ESPNFKSNPPFLA+TKSILYQQLATKAAESIYNRFASLCGGEAAV+PD VLGLS QQLRV+GVSGRKASYLHDLATKF+EG LSNSSILEMDDE+LL AL
Subjt:  ESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVKG
        T VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAW MWRLME+KG
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVKG

SwissProt top hitse value%identityAlignment
O31544 Putative DNA-3-methyladenine glycosylase YfjP1.5e-1932.53Show/hide
Query:  LTKSILYQQLATKAAESIYNRFASLCGGEAAVV-----PDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGALTAVKGIGV
        + K I++QQL    A ++  RF    G +   V     P+T+  L  Q LR +  S RKA Y  D +    EG LS S +  M DE ++  L  ++GIG 
Subjt:  LTKSILYQQLATKAAESIYNRFASLCGGEAAVV-----PDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGALTAVKGIGV

Query:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLME
        W+V   ++F L RP++ P+ D+G++  ++R + L + P    M  + ++W+PY S  +  +WR +E
Subjt:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLME

O94468 Alkylbase DNA glycosidase-like protein mag22.8e-2128.29Show/hide
Query:  LSSSDEVRTAIDHLRRSD---PLLISILDSCESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCG-GEAAVVPDTVLGLSAQQLRVIGVSGRKA
        +S   + + A  HL   D     L+  +  C       + P+  + ++I  Q+L+  A  SI N+F + C   +    P  ++    + L   G S  K+
Subjt:  LSSSDEVRTAIDHLRRSD---PLLISILDSCESPNFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCG-GEAAVVPDTVLGLSAQQLRVIGVSGRKA

Query:  SYLHDLATKFI-EGILSNSSILEMDDESLLGALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAW
          +H +A   + + I S S I +M +E L+ +L+ +KG+  W++ M+ IFTL R D++P  D  ++   +  +GL   P+  E+EKL +  KPYR++ AW
Subjt:  SYLHDLATKFI-EGILSNSSILEMDDESLLGALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAW

Query:  LMWRL
         +W++
Subjt:  LMWRL

P22134 DNA-3-methyladenine glycosylase2.3e-1530.52Show/hide
Query:  LAGPVKSLSSSDE-VRTAIDHLRRSDPLLISILDSCESPNF--KSNPP------FLALTKSILYQQLATKAAESIYNRFASLCGGE----AAVVPDTVLG
        +A P K ++  +E    A +H+   DP L  IL + E   +  ++  P      F+ L  +IL QQ++ +AAESI  R  SL GG       +  D    
Subjt:  LAGPVKSLSSSDE-VRTAIDHLRRSDPLLISILDSCESPNF--KSNPP------FLALTKSILYQQLATKAAESIYNRFASLCGGE----AAVVPDTVLG

Query:  LSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSIL---EMDDESLLGALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLK-ELPKP
            ++   G+S RK  YL  LA  F E       +    + D+E +   +T VKGIG WS  MF+I  L R DV    DLG+ +G  +    K EL K 
Subjt:  LSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSIL---EMDDESLLGALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLK-ELPKP

Query:  VE-------------------------MEKLCEKWKPYRSMGAWLMWRL
        +                          MEK  E + PYRS+  +++WRL
Subjt:  VE-------------------------MEKLCEKWKPYRSMGAWLMWRL

Q92383 DNA-3-methyladenine glycosylase 12.3e-2333.14Show/hide
Query:  NFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILE-MDDESLLGALTA
        + +   P+  L +++  QQL +KAA +I+NRF S+        P+ +  +  + +R  G S RK   L  +A   I G++      E + +E L+  LT 
Subjt:  NFKSNPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILE-MDDESLLGALTA

Query:  VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWR
        +KGIG W+V M +IF+L+R DV+P  DL +R G + L+ L ++P  + + K  E   P+R+  AW +W+
Subjt:  VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWR

Arabidopsis top hitse value%identityAlignment
AT1G19480.1 DNA glycosylase superfamily protein4.7e-7255.31Show/hide
Query:  SSSSKIPFRSTKVRK------ISTKQEPAKPQSST-------SNGNDP-------TRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSCESPN
        S  SKIP R  K+RK      +S +   A+  SS+       ++G  P        RA        + L+   E+ TAI +LR +DPLL +++D    P 
Subjt:  SSSSKIPFRSTKVRK------ISTKQEPAKPQSST-------SNGNDP-------TRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSCESPN

Query:  FKS-NPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGALTAV
        F+S   PFLAL ++ILYQQLA KA  SIY RF SLCGGE  VVP+TVL L+ QQLR IGVSGRKASYLHDLA K+  GILS+S+IL MD++SL   LT V
Subjt:  FKS-NPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGALTAV

Query:  KGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVK
         GIG WSVHMFMI +LHRPDVLPV DLGVRKGVQ LYGL +LP+P +ME+ C KW+PYRS+G+W MWRL+E K
Subjt:  KGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVK

AT1G19480.2 DNA glycosylase superfamily protein4.7e-7255.31Show/hide
Query:  SSSSKIPFRSTKVRK------ISTKQEPAKPQSST-------SNGNDP-------TRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSCESPN
        S  SKIP R  K+RK      +S +   A+  SS+       ++G  P        RA        + L+   E+ TAI +LR +DPLL +++D    P 
Subjt:  SSSSKIPFRSTKVRK------ISTKQEPAKPQSST-------SNGNDP-------TRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSCESPN

Query:  FKS-NPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGALTAV
        F+S   PFLAL ++ILYQQLA KA  SIY RF SLCGGE  VVP+TVL L+ QQLR IGVSGRKASYLHDLA K+  GILS+S+IL MD++SL   LT V
Subjt:  FKS-NPPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGALTAV

Query:  KGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVK
         GIG WSVHMFMI +LHRPDVLPV DLGVRKGVQ LYGL +LP+P +ME+ C KW+PYRS+G+W MWRL+E K
Subjt:  KGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVK

AT1G75230.1 DNA glycosylase superfamily protein1.8e-7153.36Show/hide
Query:  SSSSKIPFRSTKVRKIS---------------TKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSCESPNFKS-N
        S  +KIP R  K+RK+S               ++    KP + +      T   P +    +SL+   E+  A+ HLR  DPLL S++D    P F++  
Subjt:  SSSSKIPFRSTKVRKIS---------------TKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSCESPNFKS-N

Query:  PPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGALTAVKGIGV
         PFLAL +SILYQQLA KA  SIY RF +LCGGE  VVP+ VL L+ QQLR IGVSGRKASYLHDLA K+  GILS+S I+ MD++SL   LT V GIG 
Subjt:  PPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGALTAVKGIGV

Query:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVK
        WSVHMFMI +LHRPDVLPV DLGVRKGVQ L G+++LP+P +ME+LCEKW+PYRS+ +W +WRL+E K
Subjt:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVK

AT1G75230.2 DNA glycosylase superfamily protein1.8e-7153.36Show/hide
Query:  SSSSKIPFRSTKVRKIS---------------TKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSCESPNFKS-N
        S  +KIP R  K+RK+S               ++    KP + +      T   P +    +SL+   E+  A+ HLR  DPLL S++D    P F++  
Subjt:  SSSSKIPFRSTKVRKIS---------------TKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSCESPNFKS-N

Query:  PPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGALTAVKGIGV
         PFLAL +SILYQQLA KA  SIY RF +LCGGE  VVP+ VL L+ QQLR IGVSGRKASYLHDLA K+  GILS+S I+ MD++SL   LT V GIG 
Subjt:  PPFLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGALTAVKGIGV

Query:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVK
        WSVHMFMI +LHRPDVLPV DLGVRKGVQ L G+++LP+P +ME+LCEKW+PYRS+ +W +WRL+E K
Subjt:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVK

AT3G50880.1 DNA glycosylase superfamily protein2.7e-7258.33Show/hide
Query:  SSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSCESPNF--KSNPPFLALTKSILYQQL
        SSS+I FR  K+RK+S               +DP+      A P   LS+   V  A+ HL+ SD LL +++ +   P     SN PFL+L +SILYQQL
Subjt:  SSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSCESPNF--KSNPPFLALTKSILYQQL

Query:  ATKAAESIYNRFASLC-GGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGALTAVKGIGVWSVHMFMIFTLHRP
        ATKAA+ IY+RF SL  GGEA VVP++V+ LSA  LR IGVSGRKASYLHDLA K+  G+LS+  IL+M DE L+  LT VKGIGVW+VHMFMIF+LHRP
Subjt:  ATKAAESIYNRFASLC-GGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGALTAVKGIGVWSVHMFMIFTLHRP

Query:  DVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLME
        DVLPVGDLGVRKGV+ LYGLK LP P++ME+LCEKW+PYRS+G+W MWRL+E
Subjt:  DVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAAAAGAATCCGCCGGAAGTTCCTCTCACAGTCCGAGTCTCAAATCGACGCCGATCCTCTCCCTCCGTCGTCTTCCTCCAAGATTCCGTTCCGATCCACTAAAGT
ACGGAAGATTTCCACCAAACAAGAACCAGCCAAACCACAAAGTTCAACTTCCAACGGAAATGACCCGACCCGAGCATTCCCGAACCTGGCCGGTCCGGTCAAATCGTTAT
CGTCTTCGGATGAAGTTCGTACAGCGATCGATCATTTACGCCGTTCGGATCCTCTCCTGATAAGTATATTAGATTCGTGCGAATCCCCCAATTTCAAGTCCAATCCACCG
TTTCTAGCACTAACGAAGAGCATTCTCTACCAACAGCTCGCTACGAAGGCCGCCGAATCGATCTACAATCGCTTCGCCTCACTATGCGGCGGCGAGGCAGCGGTGGTGCC
GGACACCGTGCTTGGACTCTCGGCGCAACAGCTGCGAGTAATTGGAGTTTCGGGGAGGAAAGCAAGTTACCTCCATGACCTAGCGACCAAATTCATAGAGGGGATTTTAT
CAAATTCATCGATTCTGGAGATGGACGACGAGAGTCTGTTGGGGGCCTTGACGGCGGTGAAGGGAATCGGCGTTTGGTCGGTGCATATGTTCATGATATTTACTCTGCAC
CGGCCGGATGTGCTGCCGGTGGGGGATTTGGGGGTAAGAAAAGGGGTGCAGAGATTGTACGGATTGAAGGAATTGCCGAAGCCGGTGGAGATGGAAAAACTGTGTGAGAA
ATGGAAGCCTTACAGGTCGATGGGGGCTTGGCTTATGTGGAGGCTAATGGAAGTGAAGGGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCAAAAGAATCCGCCGGAAGTTCCTCTCACAGTCCGAGTCTCAAATCGACGCCGATCCTCTCCCTCCGTCGTCTTCCTCCAAGATTCCGTTCCGATCCACTAAAGT
ACGGAAGATTTCCACCAAACAAGAACCAGCCAAACCACAAAGTTCAACTTCCAACGGAAATGACCCGACCCGAGCATTCCCGAACCTGGCCGGTCCGGTCAAATCGTTAT
CGTCTTCGGATGAAGTTCGTACAGCGATCGATCATTTACGCCGTTCGGATCCTCTCCTGATAAGTATATTAGATTCGTGCGAATCCCCCAATTTCAAGTCCAATCCACCG
TTTCTAGCACTAACGAAGAGCATTCTCTACCAACAGCTCGCTACGAAGGCCGCCGAATCGATCTACAATCGCTTCGCCTCACTATGCGGCGGCGAGGCAGCGGTGGTGCC
GGACACCGTGCTTGGACTCTCGGCGCAACAGCTGCGAGTAATTGGAGTTTCGGGGAGGAAAGCAAGTTACCTCCATGACCTAGCGACCAAATTCATAGAGGGGATTTTAT
CAAATTCATCGATTCTGGAGATGGACGACGAGAGTCTGTTGGGGGCCTTGACGGCGGTGAAGGGAATCGGCGTTTGGTCGGTGCATATGTTCATGATATTTACTCTGCAC
CGGCCGGATGTGCTGCCGGTGGGGGATTTGGGGGTAAGAAAAGGGGTGCAGAGATTGTACGGATTGAAGGAATTGCCGAAGCCGGTGGAGATGGAAAAACTGTGTGAGAA
ATGGAAGCCTTACAGGTCGATGGGGGCTTGGCTTATGTGGAGGCTAATGGAAGTGAAGGGGTAG
Protein sequenceShow/hide protein sequence
MAKRIRRKFLSQSESQIDADPLPPSSSSKIPFRSTKVRKISTKQEPAKPQSSTSNGNDPTRAFPNLAGPVKSLSSSDEVRTAIDHLRRSDPLLISILDSCESPNFKSNPP
FLALTKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSAQQLRVIGVSGRKASYLHDLATKFIEGILSNSSILEMDDESLLGALTAVKGIGVWSVHMFMIFTLH
RPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCEKWKPYRSMGAWLMWRLMEVKG