; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G19750 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G19750
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDNA-3-methyladenine glycosylase 1-like
Genome locationClcChr01:31901903..31902866
RNA-Seq ExpressionClc01G19750
SyntenyClc01G19750
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006307 - DNA dealkylation involved in DNA repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0032993 - protein-DNA complex (cellular component)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
GO:0032131 - alkylated DNA binding (molecular function)
GO:0043916 - DNA-7-methylguanine glycosylase activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143510.1 DNA-3-methyladenine glycosylase 1 [Cucumis sativus]1.1e-12685.51Show/hide
Query:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC
        MAKRI RK L Q ES  DA PL PS+SSKIPF STKVRKISS QEP KPQI+  GG +PTR FPNLA  VKSLSSSD+I TAI+HLRRSDPLLIS+LDSC
Subjt:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL
        E+PNFKSNPPFLAL KSILYQQLATKAAE+IYNRFASLCGGEAAV+PDTVLGLSPQQLRVIGVSGRKASYLHDLATKFI+G LSNS ILEMDDE+LL AL
Subjt:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMK
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP EM KLCEKWKPYRS+GAW MWRL++ K
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMK

XP_008440714.1 PREDICTED: probable DNA-3-methyladenine glycosylase 2 [Cucumis melo]6.0e-12886.28Show/hide
Query:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC
        MAKRI RKFL QSES   A PL PSSSSKIPFRSTKVRKISS QEPAKPQ +   G +PTR FPNLA  VKSLSS DEI TAI+HLRRSDPLLIS+LDSC
Subjt:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL
        ESP+FKSNPPFLAL KSILYQQLATKAAESIYNRFASLCGGEA+V+PDTVLGLSPQQLRV+GVSGRKASYLHDLATKFI+G LSNS ILEMDDE+LL  L
Subjt:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKG
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP EM KLCEKWKPYRS+GAW MWRLME KG
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKG

XP_022949777.1 DNA-3-methyladenine glycosylase 1-like [Cucurbita moschata]4.3e-12685.51Show/hide
Query:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC
        MA+R  RK LLQSESQ DADP      S I FR+TK+RKISS Q+  KPQI+T GG D TRAFPN  G VKSLSSSD ICTAIDHLRRSDPLLI +LDSC
Subjt:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL
        ESPNFKSNPPFLAL KSILYQQLATKAAESIYNRFASLCGG+AAV+PD VLGLSPQQLRV+GVSGRKASYLHDLATKFI+G LSNSSILEMDDE+LL AL
Subjt:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMK
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEM KLCEKWKPYRSMGAW MWRLMEMK
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMK

XP_022978525.1 DNA-3-methyladenine glycosylase 1-like [Cucurbita maxima]1.1e-12684.48Show/hide
Query:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC
        MA+R  RK LLQSESQ +ADP      SKI FR+T++RKISS ++P KPQI+T GG D TRAFPN  G VKSLSSSD ICTAIDHLRRSDPLLI +LDSC
Subjt:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL
        ESPNFKSNPPFLA+ KSILYQQLATKAAESIYNRFASLCGGEAAV+PD VLGLSPQQLRV+GVSGRKASYLHDLATKF++G LSNSSILEMDDE+LL AL
Subjt:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKG
        T VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEM KLCEKWKPYRSMGAW MWRLMEMKG
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKG

XP_038881017.1 DNA-3-methyladenine glycosylase 1-like [Benincasa hispida]9.2e-13790.61Show/hide
Query:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC
        MAKRI RKFL+QS+S+IDADPLPPSSSSKIPF STKVRKISSKQEPAKPQI+TSGGNDPTRAF NLA  +KSLSSSDEI TAIDHLRRSDPLLISIL+SC
Subjt:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL
        ESPNFKSNPPFLAL KSILYQQLATKAAESIYNRFASLCGGE +V+PD VLGLSPQQLRVIGVSGRKASYLHDLATKFI+G LSNS ILEMDDE+LLEAL
Subjt:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKG
        TAVKGIG+WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP EM KLCEKWKPYRSMGAW MWRLME+KG
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKG

TrEMBL top hitse value%identityAlignment
A0A0A0KM62 ENDO3c domain-containing protein5.5e-12785.51Show/hide
Query:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC
        MAKRI RK L Q ES  DA PL PS+SSKIPF STKVRKISS QEP KPQI+  GG +PTR FPNLA  VKSLSSSD+I TAI+HLRRSDPLLIS+LDSC
Subjt:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL
        E+PNFKSNPPFLAL KSILYQQLATKAAE+IYNRFASLCGGEAAV+PDTVLGLSPQQLRVIGVSGRKASYLHDLATKFI+G LSNS ILEMDDE+LL AL
Subjt:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMK
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP EM KLCEKWKPYRS+GAW MWRL++ K
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMK

A0A1S3B2D5 probable DNA-3-methyladenine glycosylase 22.9e-12886.28Show/hide
Query:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC
        MAKRI RKFL QSES   A PL PSSSSKIPFRSTKVRKISS QEPAKPQ +   G +PTR FPNLA  VKSLSS DEI TAI+HLRRSDPLLIS+LDSC
Subjt:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL
        ESP+FKSNPPFLAL KSILYQQLATKAAESIYNRFASLCGGEA+V+PDTVLGLSPQQLRV+GVSGRKASYLHDLATKFI+G LSNS ILEMDDE+LL  L
Subjt:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKG
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP EM KLCEKWKPYRS+GAW MWRLME KG
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKG

A0A5A7T3R0 Putative DNA-3-methyladenine glycosylase 22.9e-12886.28Show/hide
Query:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC
        MAKRI RKFL QSES   A PL PSSSSKIPFRSTKVRKISS QEPAKPQ +   G +PTR FPNLA  VKSLSS DEI TAI+HLRRSDPLLIS+LDSC
Subjt:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL
        ESP+FKSNPPFLAL KSILYQQLATKAAESIYNRFASLCGGEA+V+PDTVLGLSPQQLRV+GVSGRKASYLHDLATKFI+G LSNS ILEMDDE+LL  L
Subjt:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKG
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKP EM KLCEKWKPYRS+GAW MWRLME KG
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKG

A0A6J1GD23 DNA-3-methyladenine glycosylase 1-like2.1e-12685.51Show/hide
Query:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC
        MA+R  RK LLQSESQ DADP      S I FR+TK+RKISS Q+  KPQI+T GG D TRAFPN  G VKSLSSSD ICTAIDHLRRSDPLLI +LDSC
Subjt:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL
        ESPNFKSNPPFLAL KSILYQQLATKAAESIYNRFASLCGG+AAV+PD VLGLSPQQLRV+GVSGRKASYLHDLATKFI+G LSNSSILEMDDE+LL AL
Subjt:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMK
        TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEM KLCEKWKPYRSMGAW MWRLMEMK
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMK

A0A6J1IQD1 DNA-3-methyladenine glycosylase 1-like5.5e-12784.48Show/hide
Query:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC
        MA+R  RK LLQSESQ +ADP      SKI FR+T++RKISS ++P KPQI+T GG D TRAFPN  G VKSLSSSD ICTAIDHLRRSDPLLI +LDSC
Subjt:  MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSC

Query:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL
        ESPNFKSNPPFLA+ KSILYQQLATKAAESIYNRFASLCGGEAAV+PD VLGLSPQQLRV+GVSGRKASYLHDLATKF++G LSNSSILEMDDE+LL AL
Subjt:  ESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEAL

Query:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKG
        T VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEM KLCEKWKPYRSMGAW MWRLMEMKG
Subjt:  TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKG

SwissProt top hitse value%identityAlignment
O31544 Putative DNA-3-methyladenine glycosylase YfjP8.1e-1925.1Show/hide
Query:  KIPFRSTK----VRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKSNPPFLALAKSILYQQLA
        ++P R+      + K+ +     +P+   SG  D       +    +     + +   +DH  ++  L     +   +P       +  + K I++QQL 
Subjt:  KIPFRSTK----VRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKSNPPFLALAKSILYQQLA

Query:  TKAAESIYNRFASLCGGEAAVV-----PDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEALTAVKGIGVWSVHMFMIFTL
           A ++  RF    G +   V     P+T+  L  Q LR +  S RKA Y  D +    +G LS S +  M DE +++ L  ++GIG W+V   ++F L
Subjt:  TKAAESIYNRFASLCGGEAAVV-----PDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEALTAVKGIGVWSVHMFMIFTL

Query:  HRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLME
         RP++ P+ D+G++  ++R + L + P    M  + ++W+PY S  +  +WR +E
Subjt:  HRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLME

O94468 Alkylbase DNA glycosidase-like protein mag21.1e-2028.65Show/hide
Query:  LISILDSCESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCG-GEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFID-GILSNSSILE
        L+  +  C       + P+  + ++I  Q+L+  A  SI N+F + C   +    P  ++    + L   G S  K+  +H +A   ++  I S S I +
Subjt:  LISILDSCESPNFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCG-GEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFID-GILSNSSILE

Query:  MDDESLLEALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEM
        M +E L+E+L+ +KG+  W++ M+ IFTL R D++P  D  ++   +  +GL   P+  E+ KL +  KPYR++ AW +W++ ++
Subjt:  MDDESLLEALTAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEM

P22134 DNA-3-methyladenine glycosylase4.6e-1431.47Show/hide
Query:  AIDHLRRSDPLLISILDSCESPNF--KSNPP------FLALAKSILYQQLATKAAESIYNRFASLCGGE----AAVVPDTVLGLSPQQLRVIGVSGRKAS
        A +H+   DP L  IL + E   +  ++  P      F+ LA +IL QQ++ +AAESI  R  SL GG       +  D        ++   G+S RK  
Subjt:  AIDHLRRSDPLLISILDSCESPNF--KSNPP------FLALAKSILYQQLATKAAESIYNRFASLCGGE----AAVVPDTVLGLSPQQLRVIGVSGRKAS

Query:  YLHDLATKFIDGILSNSSIL--EMDDESLLEAL-TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLK-ELPKPVE---------------
        YL  LA  F +       +   + +DE ++E+L T VKGIG WS  MF+I  L R DV    DLG+ +G  +    K EL K +                
Subjt:  YLHDLATKFIDGILSNSSIL--EMDDESLLEAL-TAVKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLK-ELPKPVE---------------

Query:  ----------MGKLCEKWKPYRSMGAWCMWRL
                  M K  E + PYRS+  + +WRL
Subjt:  ----------MGKLCEKWKPYRSMGAWCMWRL

Q92383 DNA-3-methyladenine glycosylase 14.9e-2432.95Show/hide
Query:  NFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILE-MDDESLLEALTA
        + +   P+  L +++  QQL +KAA +I+NRF S+        P+ +  +  + +R  G S RK   L  +A   I G++      E + +E L+E LT 
Subjt:  NFKSNPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILE-MDDESLLEALTA

Query:  VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEM
        +KGIG W+V M +IF+L+R DV+P  DL +R G + L+ L ++P  + + K  E   P+R+  AW +W+  ++
Subjt:  VKGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEM

Arabidopsis top hitse value%identityAlignment
AT1G19480.1 DNA glycosylase superfamily protein2.0e-7354.91Show/hide
Query:  SSSSKIPFRSTKVRKIS-----SKQEPAKPQITTSGGNDP---------------TRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPN
        S  SKIP R  K+RK++     S ++     I++S  N P                RA        + L+   E+ TAI +LR +DPLL +++D    P 
Subjt:  SSSSKIPFRSTKVRKIS-----SKQEPAKPQITTSGGNDP---------------TRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPN

Query:  FKS-NPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEALTAV
        F+S   PFLAL ++ILYQQLA KA  SIY RF SLCGGE  VVP+TVL L+PQQLR IGVSGRKASYLHDLA K+ +GILS+S+IL MD++SL   LT V
Subjt:  FKS-NPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEALTAV

Query:  KGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKGT
         GIG WSVHMFMI +LHRPDVLPV DLGVRKGVQ LYGL +LP+P +M + C KW+PYRS+G+W MWRL+E K T
Subjt:  KGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKGT

AT1G19480.2 DNA glycosylase superfamily protein2.0e-7354.91Show/hide
Query:  SSSSKIPFRSTKVRKIS-----SKQEPAKPQITTSGGNDP---------------TRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPN
        S  SKIP R  K+RK++     S ++     I++S  N P                RA        + L+   E+ TAI +LR +DPLL +++D    P 
Subjt:  SSSSKIPFRSTKVRKIS-----SKQEPAKPQITTSGGNDP---------------TRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPN

Query:  FKS-NPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEALTAV
        F+S   PFLAL ++ILYQQLA KA  SIY RF SLCGGE  VVP+TVL L+PQQLR IGVSGRKASYLHDLA K+ +GILS+S+IL MD++SL   LT V
Subjt:  FKS-NPPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEALTAV

Query:  KGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKGT
         GIG WSVHMFMI +LHRPDVLPV DLGVRKGVQ LYGL +LP+P +M + C KW+PYRS+G+W MWRL+E K T
Subjt:  KGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKGT

AT1G75230.1 DNA glycosylase superfamily protein1.7e-7253.7Show/hide
Query:  SSSSKIPFRSTKVRKIS---------------SKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKS-N
        S  +KIP R  K+RK+S               S+    KP   +      T   P +    +SL+   E+  A+ HLR  DPLL S++D    P F++  
Subjt:  SSSSKIPFRSTKVRKIS---------------SKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKS-N

Query:  PPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEALTAVKGIGV
         PFLAL +SILYQQLA KA  SIY RF +LCGGE  VVP+ VL L+PQQLR IGVSGRKASYLHDLA K+ +GILS+S I+ MD++SL   LT V GIG 
Subjt:  PPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEALTAVKGIGV

Query:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKGT
        WSVHMFMI +LHRPDVLPV DLGVRKGVQ L G+++LP+P +M +LCEKW+PYRS+ +W +WRL+E K T
Subjt:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKGT

AT1G75230.2 DNA glycosylase superfamily protein1.7e-7253.7Show/hide
Query:  SSSSKIPFRSTKVRKIS---------------SKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKS-N
        S  +KIP R  K+RK+S               S+    KP   +      T   P +    +SL+   E+  A+ HLR  DPLL S++D    P F++  
Subjt:  SSSSKIPFRSTKVRKIS---------------SKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKS-N

Query:  PPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEALTAVKGIGV
         PFLAL +SILYQQLA KA  SIY RF +LCGGE  VVP+ VL L+PQQLR IGVSGRKASYLHDLA K+ +GILS+S I+ MD++SL   LT V GIG 
Subjt:  PPFLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEALTAVKGIGV

Query:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKGT
        WSVHMFMI +LHRPDVLPV DLGVRKGVQ L G+++LP+P +M +LCEKW+PYRS+ +W +WRL+E K T
Subjt:  WSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKGT

AT3G50880.1 DNA glycosylase superfamily protein5.0e-7255.64Show/hide
Query:  LQSESQIDADPLPPS----SSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNF-
        L  +S I A  L  S    SSS+I FR  K+RK+SS   P +  IT S                  LS+   +  A+ HL+ SD LL +++ +   P   
Subjt:  LQSESQIDADPLPPS----SSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNF-

Query:  -KSNPPFLALAKSILYQQLATKAAESIYNRFASLC-GGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEALTAV
          SN PFL+LA+SILYQQLATKAA+ IY+RF SL  GGEA VVP++V+ LS   LR IGVSGRKASYLHDLA K+ +G+LS+  IL+M DE L++ LT V
Subjt:  -KSNPPFLALAKSILYQQLATKAAESIYNRFASLC-GGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEALTAV

Query:  KGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKGT
        KGIGVW+VHMFMIF+LHRPDVLPVGDLGVRKGV+ LYGLK LP P++M +LCEKW+PYRS+G+W MWRL+E + T
Subjt:  KGIGVWSVHMFMIFTLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAAAAGAATCTCCCGGAAGTTCCTCTTACAGTCCGAGTCACAAATCGACGCCGATCCTCTTCCTCCGTCGTCTTCCTCCAAGATTCCGTTCCGATCTACAAAAGT
ACGGAAGATTTCCTCCAAACAAGAACCGGCCAAACCACAAATTACAACTTCCGGCGGAAATGACCCGACACGAGCATTTCCGAACCTGGCCGGTACCGTCAAATCATTAT
CGTCTTCGGATGAAATTTGTACAGCGATCGATCATTTACGCCGTTCGGATCCTCTCCTGATAAGCATATTAGATTCATGCGAATCCCCCAATTTCAAGTCCAATCCACCG
TTTCTAGCACTAGCGAAGAGCATCCTCTACCAACAGCTCGCCACGAAGGCCGCCGAATCGATCTACAATCGTTTTGCCTCGCTATGCGGCGGCGAGGCGGCGGTGGTGCC
GGACACCGTGCTCGGACTCTCGCCGCAACAGCTGCGAGTAATCGGAGTTTCGGGGAGGAAAGCAAGTTACCTCCATGACCTTGCGACCAAATTCATAGATGGGATTTTAT
CAAATTCATCAATTCTAGAGATGGACGACGAGAGTCTGTTGGAGGCCTTGACGGCAGTGAAGGGAATCGGCGTCTGGTCGGTGCATATGTTCATGATATTTACTCTGCAC
CGGCCGGATGTGCTGCCGGTGGGGGATTTGGGGGTGAGAAAAGGGGTGCAGAGATTGTACGGATTGAAGGAACTGCCGAAGCCGGTGGAGATGGGGAAACTGTGTGAGAA
ATGGAAGCCTTACAGGTCGATGGGGGCTTGGTGTATGTGGAGGTTAATGGAAATGAAGGGAACTTGTTCTTGGACCATGGCTGGATCTCACCTTGCAATTGCAATTGCTG
ACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAAAAGAATCTCCCGGAAGTTCCTCTTACAGTCCGAGTCACAAATCGACGCCGATCCTCTTCCTCCGTCGTCTTCCTCCAAGATTCCGTTCCGATCTACAAAAGT
ACGGAAGATTTCCTCCAAACAAGAACCGGCCAAACCACAAATTACAACTTCCGGCGGAAATGACCCGACACGAGCATTTCCGAACCTGGCCGGTACCGTCAAATCATTAT
CGTCTTCGGATGAAATTTGTACAGCGATCGATCATTTACGCCGTTCGGATCCTCTCCTGATAAGCATATTAGATTCATGCGAATCCCCCAATTTCAAGTCCAATCCACCG
TTTCTAGCACTAGCGAAGAGCATCCTCTACCAACAGCTCGCCACGAAGGCCGCCGAATCGATCTACAATCGTTTTGCCTCGCTATGCGGCGGCGAGGCGGCGGTGGTGCC
GGACACCGTGCTCGGACTCTCGCCGCAACAGCTGCGAGTAATCGGAGTTTCGGGGAGGAAAGCAAGTTACCTCCATGACCTTGCGACCAAATTCATAGATGGGATTTTAT
CAAATTCATCAATTCTAGAGATGGACGACGAGAGTCTGTTGGAGGCCTTGACGGCAGTGAAGGGAATCGGCGTCTGGTCGGTGCATATGTTCATGATATTTACTCTGCAC
CGGCCGGATGTGCTGCCGGTGGGGGATTTGGGGGTGAGAAAAGGGGTGCAGAGATTGTACGGATTGAAGGAACTGCCGAAGCCGGTGGAGATGGGGAAACTGTGTGAGAA
ATGGAAGCCTTACAGGTCGATGGGGGCTTGGTGTATGTGGAGGTTAATGGAAATGAAGGGAACTTGTTCTTGGACCATGGCTGGATCTCACCTTGCAATTGCAATTGCTG
ACTAA
Protein sequenceShow/hide protein sequence
MAKRISRKFLLQSESQIDADPLPPSSSSKIPFRSTKVRKISSKQEPAKPQITTSGGNDPTRAFPNLAGTVKSLSSSDEICTAIDHLRRSDPLLISILDSCESPNFKSNPP
FLALAKSILYQQLATKAAESIYNRFASLCGGEAAVVPDTVLGLSPQQLRVIGVSGRKASYLHDLATKFIDGILSNSSILEMDDESLLEALTAVKGIGVWSVHMFMIFTLH
RPDVLPVGDLGVRKGVQRLYGLKELPKPVEMGKLCEKWKPYRSMGAWCMWRLMEMKGTCSWTMAGSHLAIAIAD