; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g1112 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g1112
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDNA-3-methyladenine glycosylase 1-like
Genome locationMC01:16496776..16497567
RNA-Seq ExpressionMC01g1112
SyntenyMC01g1112
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006307 - DNA dealkylation involved in DNA repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0032993 - protein-DNA complex (cellular component)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
GO:0032131 - alkylated DNA binding (molecular function)
GO:0043916 - DNA-7-methylguanine glycosylase activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034142.1 mag1, partial [Cucurbita argyrosperma subsp. argyrosperma]2.56e-13876.09Show/hide
Query:  SQSETLEIP-----FRSTKIRKMTV-----KPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKS
        S+S+T + P     FR+TKIRK++      KP I+  G  D TR+  N   PVKSLSS+D ICTAIDHLRRSDPLLI LLDSCE+P FKSNPPFLALTKS
Subjt:  SQSETLEIP-----FRSTKIRKMTV-----KPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKS

Query:  ILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIF
        ILYQQLATKAAESIYNRFA+LCGG+AAV+P AVLGLS QQLRV+GVSGRKASYLHDLA KF+EG L+ SSILEMDDETL+  LTAVKGIGVWSV MFMIF
Subjt:  ILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIF

Query:  SLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKSETRG
        +LHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLC+KWKPYRSMGAWYMWRLME+KEIV    DL   +   G
Subjt:  SLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKSETRG

XP_004143510.1 DNA-3-methyladenine glycosylase 1 [Cucumis sativus]3.92e-13777.86Show/hide
Query:  SQSETLEIPFRSTKIRKMT-----VKPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQ
        S S + +IPF STK+RK++      KP I+  G  +PTR   N A PVKSLSS+D+I TAI+HLRRSDPLLI+LLDSCETP FKSNPPFLALTKSILYQQ
Subjt:  SQSETLEIPFRSTKIRKMT-----VKPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQ

Query:  LATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRP
        LATKAAE+IYNRFA+LCGGEAAV+P  VLGLS QQLRVIGVSGRKASYLHDLA KF+EG L+ S ILEMDDETL+R LTAVKGIGVWSV MFMIF+LHRP
Subjt:  LATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRP

Query:  DVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD
        DVLPVGDLGVRKGVQRLYGLKELPKP EMEKLC+KWKPYRS+GAWYMWRL++ KEIV  G D
Subjt:  DVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD

XP_008440714.1 PREDICTED: probable DNA-3-methyladenine glycosylase 2 [Cucumis melo]5.66e-13976.01Show/hide
Query:  SQSETLEIPFRSTKIRKMT-----VKPPIAGEN--DPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQ
        S S + +IPFRSTK+RK++      KP  +  +  +PTR+  N A PVKSLSS DEI TAI+HLRRSDPLLI+LLDSCE+P FKSNPPFLALTKSILYQQ
Subjt:  SQSETLEIPFRSTKIRKMT-----VKPPIAGEN--DPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQ

Query:  LATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRP
        LATKAAESIYNRFA+LCGGEA+V+P  VLGLS QQLRV+GVSGRKASYLHDLA KF+EG L+ S ILEMDDETL+  LTAVKGIGVWSV MFMIF+LHRP
Subjt:  LATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRP

Query:  DVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKSETRG
        DVLPVGDLGVRKGVQRLYGLKELPKP EMEKLC+KWKPYRS+GAWYMWRLME K +V KG DL    E RG
Subjt:  DVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKSETRG

XP_022132520.1 DNA-3-methyladenine glycosylase 1-like [Momordica charantia]1.84e-187100Show/hide
Query:  SQSETLEIPFRSTKIRKMTVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQLATKAAE
        SQSETLEIPFRSTKIRKMTVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQLATKAAE
Subjt:  SQSETLEIPFRSTKIRKMTVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQLATKAAE

Query:  SIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGD
        SIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGD
Subjt:  SIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGD

Query:  LGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKSETRG
        LGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKSETRG
Subjt:  LGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKSETRG

XP_022949777.1 DNA-3-methyladenine glycosylase 1-like [Cucurbita moschata]2.56e-13878.41Show/hide
Query:  IPFRSTKIRKMTV-----KPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQLATKAAE
        I FR+TKIRK++      KP I+  G  D TR+  N   PVKSLSS+D ICTAIDHLRRSDPLLI LLDSCE+P FKSNPPFLALTKSILYQQLATKAAE
Subjt:  IPFRSTKIRKMTV-----KPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQLATKAAE

Query:  SIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGD
        SIYNRFA+LCGG+AAV+P AVLGLS QQLRV+GVSGRKASYLHDLA KF+EG L+ SSILEMDDETL+  LTAVKGIGVWSV MFMIF+LHRPDVLPVGD
Subjt:  SIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGD

Query:  LGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKSETRG
        LGVRKGVQRLYGLKELPKPVEMEKLC+KWKPYRSMGAWYMWRLME+KEIV    DL   +   G
Subjt:  LGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKSETRG

TrEMBL top hitse value%identityAlignment
A0A0A0KM62 ENDO3c domain-containing protein1.90e-13777.86Show/hide
Query:  SQSETLEIPFRSTKIRKMT-----VKPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQ
        S S + +IPF STK+RK++      KP I+  G  +PTR   N A PVKSLSS+D+I TAI+HLRRSDPLLI+LLDSCETP FKSNPPFLALTKSILYQQ
Subjt:  SQSETLEIPFRSTKIRKMT-----VKPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQ

Query:  LATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRP
        LATKAAE+IYNRFA+LCGGEAAV+P  VLGLS QQLRVIGVSGRKASYLHDLA KF+EG L+ S ILEMDDETL+R LTAVKGIGVWSV MFMIF+LHRP
Subjt:  LATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRP

Query:  DVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD
        DVLPVGDLGVRKGVQRLYGLKELPKP EMEKLC+KWKPYRS+GAWYMWRL++ KEIV  G D
Subjt:  DVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD

A0A1S3B2D5 probable DNA-3-methyladenine glycosylase 22.74e-13976.01Show/hide
Query:  SQSETLEIPFRSTKIRKMT-----VKPPIAGEN--DPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQ
        S S + +IPFRSTK+RK++      KP  +  +  +PTR+  N A PVKSLSS DEI TAI+HLRRSDPLLI+LLDSCE+P FKSNPPFLALTKSILYQQ
Subjt:  SQSETLEIPFRSTKIRKMT-----VKPPIAGEN--DPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQ

Query:  LATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRP
        LATKAAESIYNRFA+LCGGEA+V+P  VLGLS QQLRV+GVSGRKASYLHDLA KF+EG L+ S ILEMDDETL+  LTAVKGIGVWSV MFMIF+LHRP
Subjt:  LATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRP

Query:  DVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKSETRG
        DVLPVGDLGVRKGVQRLYGLKELPKP EMEKLC+KWKPYRS+GAWYMWRLME K +V KG DL    E RG
Subjt:  DVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKSETRG

A0A5A7T3R0 Putative DNA-3-methyladenine glycosylase 22.74e-13976.01Show/hide
Query:  SQSETLEIPFRSTKIRKMT-----VKPPIAGEN--DPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQ
        S S + +IPFRSTK+RK++      KP  +  +  +PTR+  N A PVKSLSS DEI TAI+HLRRSDPLLI+LLDSCE+P FKSNPPFLALTKSILYQQ
Subjt:  SQSETLEIPFRSTKIRKMT-----VKPPIAGEN--DPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQ

Query:  LATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRP
        LATKAAESIYNRFA+LCGGEA+V+P  VLGLS QQLRV+GVSGRKASYLHDLA KF+EG L+ S ILEMDDETL+  LTAVKGIGVWSV MFMIF+LHRP
Subjt:  LATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRP

Query:  DVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKSETRG
        DVLPVGDLGVRKGVQRLYGLKELPKP EMEKLC+KWKPYRS+GAWYMWRLME K +V KG DL    E RG
Subjt:  DVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKSETRG

A0A6J1BSP4 DNA-3-methyladenine glycosylase 1-like8.90e-188100Show/hide
Query:  SQSETLEIPFRSTKIRKMTVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQLATKAAE
        SQSETLEIPFRSTKIRKMTVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQLATKAAE
Subjt:  SQSETLEIPFRSTKIRKMTVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQLATKAAE

Query:  SIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGD
        SIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGD
Subjt:  SIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGD

Query:  LGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKSETRG
        LGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKSETRG
Subjt:  LGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKSETRG

A0A6J1GD23 DNA-3-methyladenine glycosylase 1-like1.24e-13878.41Show/hide
Query:  IPFRSTKIRKMTV-----KPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQLATKAAE
        I FR+TKIRK++      KP I+  G  D TR+  N   PVKSLSS+D ICTAIDHLRRSDPLLI LLDSCE+P FKSNPPFLALTKSILYQQLATKAAE
Subjt:  IPFRSTKIRKMTV-----KPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQLATKAAE

Query:  SIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGD
        SIYNRFA+LCGG+AAV+P AVLGLS QQLRV+GVSGRKASYLHDLA KF+EG L+ SSILEMDDETL+  LTAVKGIGVWSV MFMIF+LHRPDVLPVGD
Subjt:  SIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGD

Query:  LGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKSETRG
        LGVRKGVQRLYGLKELPKPVEMEKLC+KWKPYRSMGAWYMWRLME+KEIV    DL   +   G
Subjt:  LGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKSETRG

SwissProt top hitse value%identityAlignment
O31544 Putative DNA-3-methyladenine glycosylase YfjP6.6e-2031.46Show/hide
Query:  TPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVV-----PGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETL
        TP       +  + K I++QQL    A ++  RF    G +   V     P  +  L  Q LR +  S RKA Y  D +    EG L+ S +  M DE +
Subjt:  TPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVV-----PGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETL

Query:  VRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLME
        ++ L  ++GIG W+VQ  ++F L RP++ P+ D+G++  ++R + L + P    M  +  +W+PY S  + Y+WR +E
Subjt:  VRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLME

O94468 Alkylbase DNA glycosidase-like protein mag23.5e-2128.65Show/hide
Query:  LINLLDSCETPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCG-GEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLA-GKFVEGILTESSILE
        L+  +  C       + P+  + ++I  Q+L+  A  SI N+F   C   +    P  ++    + L   G S  K+  +H +A     + I ++S I +
Subjt:  LINLLDSCETPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCG-GEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLA-GKFVEGILTESSILE

Query:  MDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMEL
        M +E L+ +L+ +KG+  W+++M+ IF+L R D++P  D  ++   +  +GL   P+  E+EKL    KPYR++ AWY+W++ +L
Subjt:  MDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMEL

P22134 DNA-3-methyladenine glycosylase7.0e-1429.64Show/hide
Query:  LNPAYPVKSLSSTDE-ICTAIDHLRRSDPLLINLLDSCETPKF--KSNPP------FLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVL---
        L  A P K ++  +E    A +H+   DP L  +L + E   +  ++  P      F+ L  +IL QQ++ +AAESI  R  +L GG  A     +L   
Subjt:  LNPAYPVKSLSSTDE-ICTAIDHLRRSDPLLINLLDSCETPKF--KSNPP------FLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVL---

Query:  ---GLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSIL---EMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLK-E
                ++   G+S RK  YL  LA  F E       +    + D+E +   +T VKGIG WS +MF+I  L R DV    DLG+ +G  +    K E
Subjt:  ---GLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSIL---EMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLK-E

Query:  LPKPVE-------------------------MEKLCDKWKPYRSMGAWYMWRL
        L K +                          MEK  + + PYRS+  + +WRL
Subjt:  LPKPVE-------------------------MEKLCDKWKPYRSMGAWYMWRL

Q92383 DNA-3-methyladenine glycosylase 12.6e-2433.14Show/hide
Query:  PFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGIL-TESSILEMDDETLVRTLTAVKGIGV
        P+  L +++  QQL +KAA +I+NRF ++        P  +  +  + +R  G S RK   L  +A   + G++ T+     + +E L+  LT +KGIG 
Subjt:  PFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGIL-TESSILEMDDETLVRTLTAVKGIGV

Query:  WSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKE
        W+V+M +IFSL+R DV+P  DL +R G + L+ L ++P  + + K  +   P+R+  AWY+W+  +L +
Subjt:  WSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKE

Arabidopsis top hitse value%identityAlignment
AT1G19480.1 DNA glycosylase superfamily protein2.0e-7252.48Show/hide
Query:  EIPFRSTKIRKMTVKPPIAGE------------NDP---------------TRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKS-
        +IP R  KIRK+T+   ++GE            N P                R++  P    + L+   E+ TAI +LR +DPLL  L+D    P F+S 
Subjt:  EIPFRSTKIRKMTVKPPIAGE------------NDP---------------TRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKS-

Query:  NPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIG
          PFLAL ++ILYQQLA KA  SIY RF +LCGGE  VVP  VL L+ QQLR IGVSGRKASYLHDLA K+  GIL++S+IL MD+++L   LT V GIG
Subjt:  NPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIG

Query:  VWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKS
         WSV MFMI SLHRPDVLPV DLGVRKGVQ LYGL +LP+P +ME+ C KW+PYRS+G+WYMWRL+E K   +     +G S
Subjt:  VWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKS

AT1G19480.2 DNA glycosylase superfamily protein2.0e-7252.48Show/hide
Query:  EIPFRSTKIRKMTVKPPIAGE------------NDP---------------TRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKS-
        +IP R  KIRK+T+   ++GE            N P                R++  P    + L+   E+ TAI +LR +DPLL  L+D    P F+S 
Subjt:  EIPFRSTKIRKMTVKPPIAGE------------NDP---------------TRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKS-

Query:  NPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIG
          PFLAL ++ILYQQLA KA  SIY RF +LCGGE  VVP  VL L+ QQLR IGVSGRKASYLHDLA K+  GIL++S+IL MD+++L   LT V GIG
Subjt:  NPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIG

Query:  VWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKS
         WSV MFMI SLHRPDVLPV DLGVRKGVQ LYGL +LP+P +ME+ C KW+PYRS+G+WYMWRL+E K   +     +G S
Subjt:  VWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKS

AT1G75230.1 DNA glycosylase superfamily protein2.2e-7153.76Show/hide
Query:  SETLEIPFRSTKIRKM--------------------TVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKS-NPP
        S   +IP R  KIRK+                    T KP    +   +R+V  P    +SL+   E+  A+ HLR  DPLL +L+D    P F++   P
Subjt:  SETLEIPFRSTKIRKM--------------------TVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKS-NPP

Query:  FLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWS
        FLAL +SILYQQLA KA  SIY RF ALCGGE  VVP  VL L+ QQLR IGVSGRKASYLHDLA K+  GIL++S I+ MD+++L   LT V GIG WS
Subjt:  FLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWS

Query:  VQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELK
        V MFMI SLHRPDVLPV DLGVRKGVQ L G+++LP+P +ME+LC+KW+PYRS+ +WY+WRL+E K
Subjt:  VQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELK

AT1G75230.2 DNA glycosylase superfamily protein2.2e-7153.76Show/hide
Query:  SETLEIPFRSTKIRKM--------------------TVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKS-NPP
        S   +IP R  KIRK+                    T KP    +   +R+V  P    +SL+   E+  A+ HLR  DPLL +L+D    P F++   P
Subjt:  SETLEIPFRSTKIRKM--------------------TVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKS-NPP

Query:  FLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWS
        FLAL +SILYQQLA KA  SIY RF ALCGGE  VVP  VL L+ QQLR IGVSGRKASYLHDLA K+  GIL++S I+ MD+++L   LT V GIG WS
Subjt:  FLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWS

Query:  VQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELK
        V MFMI SLHRPDVLPV DLGVRKGVQ L G+++LP+P +ME+LC+KW+PYRS+ +WY+WRL+E K
Subjt:  VQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELK

AT3G50880.1 DNA glycosylase superfamily protein1.7e-7159.02Show/hide
Query:  IPFRSTKIRKMTVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKF--KSNPPFLALTKSILYQQLATKAAESIYNR
        I FR  KIRK++        +DP+  ++  A P  S  ST +I  A+ HL+ SD LL  L+ +   P     SN PFL+L +SILYQQLATKAA+ IY+R
Subjt:  IPFRSTKIRKMTVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKF--KSNPPFLALTKSILYQQLATKAAESIYNR

Query:  FAALC-GGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVR
        F +L  GGEA VVP +V+ LSA  LR IGVSGRKASYLHDLA K+  G+L++  IL+M DE L+  LT VKGIGVW+V MFMIFSLHRPDVLPVGDLGVR
Subjt:  FAALC-GGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVR

Query:  KGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKE
        KGV+ LYGLK LP P++ME+LC+KW+PYRS+G+WYMWRL+E ++
Subjt:  KGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCCCAGTCTGAAACGCTGGAGATTCCGTTTCGATCCACAAAAATACGGAAGATGACGGTCAAACCACCAATCGCCGGCGAGAATGACCCAACCCGATCAGTTCTGAACCC
GGCCTATCCCGTCAAATCGTTATCGTCTACGGATGAAATCTGTACGGCGATCGATCACTTACGCCGATCGGACCCTCTCCTGATAAATCTGTTAGATTCGTGCGAAACCC
CCAAATTCAAATCGAATCCCCCATTCCTAGCCCTAACGAAGAGCATTCTGTACCAACAGCTCGCTACAAAGGCCGCCGAATCAATCTACAATCGGTTCGCCGCGCTGTGC
GGCGGCGAGGCGGCGGTGGTCCCGGGCGCCGTGCTGGGGCTGTCGGCGCAGCAGCTGCGGGTAATTGGAGTTTCGGGGCGGAAAGCGAGCTACCTCCATGACCTAGCGGG
GAAATTCGTGGAGGGGATTTTGACGGAATCTTCGATTCTGGAGATGGACGACGAGACTCTGGTGAGGACGTTGACGGCGGTGAAGGGAATCGGCGTTTGGTCGGTGCAAA
TGTTCATGATTTTCAGTCTGCATCGGCCGGACGTGCTGCCGGTGGGCGATCTCGGCGTCAGAAAAGGCGTGCAGCGGCTGTACGGACTGAAGGAGTTGCCGAAGCCAGTG
GAGATGGAGAAACTGTGCGACAAATGGAAGCCGTATAGGTCGATGGGCGCTTGGTACATGTGGAGGCTAATGGAACTGAAGGAGATCGTGAGTAAAGGTCGCGATTTATC
AGGTAAATCGGAAACCAGAGGT
mRNA sequenceShow/hide mRNA sequence
TCCCAGTCTGAAACGCTGGAGATTCCGTTTCGATCCACAAAAATACGGAAGATGACGGTCAAACCACCAATCGCCGGCGAGAATGACCCAACCCGATCAGTTCTGAACCC
GGCCTATCCCGTCAAATCGTTATCGTCTACGGATGAAATCTGTACGGCGATCGATCACTTACGCCGATCGGACCCTCTCCTGATAAATCTGTTAGATTCGTGCGAAACCC
CCAAATTCAAATCGAATCCCCCATTCCTAGCCCTAACGAAGAGCATTCTGTACCAACAGCTCGCTACAAAGGCCGCCGAATCAATCTACAATCGGTTCGCCGCGCTGTGC
GGCGGCGAGGCGGCGGTGGTCCCGGGCGCCGTGCTGGGGCTGTCGGCGCAGCAGCTGCGGGTAATTGGAGTTTCGGGGCGGAAAGCGAGCTACCTCCATGACCTAGCGGG
GAAATTCGTGGAGGGGATTTTGACGGAATCTTCGATTCTGGAGATGGACGACGAGACTCTGGTGAGGACGTTGACGGCGGTGAAGGGAATCGGCGTTTGGTCGGTGCAAA
TGTTCATGATTTTCAGTCTGCATCGGCCGGACGTGCTGCCGGTGGGCGATCTCGGCGTCAGAAAAGGCGTGCAGCGGCTGTACGGACTGAAGGAGTTGCCGAAGCCAGTG
GAGATGGAGAAACTGTGCGACAAATGGAAGCCGTATAGGTCGATGGGCGCTTGGTACATGTGGAGGCTAATGGAACTGAAGGAGATCGTGAGTAAAGGTCGCGATTTATC
AGGTAAATCGGAAACCAGAGGT
Protein sequenceShow/hide protein sequence
SQSETLEIPFRSTKIRKMTVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALC
GGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPV
EMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRDLSGKSETRG