; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS010535 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS010535
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDNA-3-methyladenine glycosylase 1-like
Genome locationscaffold35:380662..381474
RNA-Seq ExpressionMS010535
SyntenyMS010535
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006307 - DNA dealkylation involved in DNA repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0032993 - protein-DNA complex (cellular component)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
GO:0032131 - alkylated DNA binding (molecular function)
GO:0043916 - DNA-7-methylguanine glycosylase activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603971.1 Zinc-finger homeodomain protein 6, partial [Cucurbita argyrosperma subsp. sororia]5.2e-11077.03Show/hide
Query:  MAKRTRLKSKSQSLLQSQSET-----LEIPFRSTKIRKITV-----KPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCE
        MA+RTR K     LLQS+S+T       I FR+TKIRKI+      KP I+  G  D TR+  N   PVKSLSS+D ICTAIDHLRRSDPLLI LLDSCE
Subjt:  MAKRTRLKSKSQSLLQSQSET-----LEIPFRSTKIRKITV-----KPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCE

Query:  TPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLT
        +P FKSNPPFLALTKSILYQQLATKAAESIYNRFA+LCGG+AAV+P AVLGLS QQLRV+GVSGRKASYLHDLA KF+EG L+ SSILEMDDETL+  LT
Subjt:  TPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLT

Query:  AVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD
        AVKGIGVWSV MFMIF+LHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLC+KWKPYRSMGAWYMWRLME+KEIV    D
Subjt:  AVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD

KAG7034142.1 mag1, partial [Cucurbita argyrosperma subsp. argyrosperma]4.0e-11077.03Show/hide
Query:  MAKRTRLKSKSQSLLQSQSETLEIP-----FRSTKIRKITV-----KPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCE
        MA+RTR K     LLQS+S+T + P     FR+TKIRKI+      KP I+  G  D TR+  N   PVKSLSS+D ICTAIDHLRRSDPLLI LLDSCE
Subjt:  MAKRTRLKSKSQSLLQSQSETLEIP-----FRSTKIRKITV-----KPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCE

Query:  TPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLT
        +P FKSNPPFLALTKSILYQQLATKAAESIYNRFA+LCGG+AAV+P AVLGLS QQLRV+GVSGRKASYLHDLA KF+EG L+ SSILEMDDETL+  LT
Subjt:  TPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLT

Query:  AVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD
        AVKGIGVWSV MFMIF+LHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLC+KWKPYRSMGAWYMWRLME+KEIV    D
Subjt:  AVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD

XP_022132520.1 DNA-3-methyladenine glycosylase 1-like [Momordica charantia]6.9e-14799.26Show/hide
Query:  MAKRTRLKSKSQSLLQSQSETLEIPFRSTKIRKITVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLA
        M+KRTRLKSKSQSLLQSQSETLEIPFRSTKIRK+TVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLA
Subjt:  MAKRTRLKSKSQSLLQSQSETLEIPFRSTKIRKITVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLA

Query:  LTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQM
        LTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQM
Subjt:  LTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQM

Query:  FMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD
        FMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD
Subjt:  FMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD

XP_022949777.1 DNA-3-methyladenine glycosylase 1-like [Cucurbita moschata]5.2e-11077.03Show/hide
Query:  MAKRTRLKSKSQSLLQSQSET-----LEIPFRSTKIRKITV-----KPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCE
        MA+RTR K     LLQS+S+T       I FR+TKIRKI+      KP I+  G  D TR+  N   PVKSLSS+D ICTAIDHLRRSDPLLI LLDSCE
Subjt:  MAKRTRLKSKSQSLLQSQSET-----LEIPFRSTKIRKITV-----KPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCE

Query:  TPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLT
        +P FKSNPPFLALTKSILYQQLATKAAESIYNRFA+LCGG+AAV+P AVLGLS QQLRV+GVSGRKASYLHDLA KF+EG L+ SSILEMDDETL+  LT
Subjt:  TPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLT

Query:  AVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD
        AVKGIGVWSV MFMIF+LHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLC+KWKPYRSMGAWYMWRLME+KEIV    D
Subjt:  AVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD

XP_023543059.1 DNA-3-methyladenine glycosylase 1 [Cucurbita pepo subsp. pepo]1.7e-10876.68Show/hide
Query:  MAKRTRLKSKSQSLLQSQSET-----LEIPFRSTKIRKITV-----KPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCE
        MA+RTR K     LLQS+S+T      +I FR+TKIRKI+      KP I+  G  D TR+  N   PVKSLSS+D I TAIDHLRRSDPLLI LLDSCE
Subjt:  MAKRTRLKSKSQSLLQSQSET-----LEIPFRSTKIRKITV-----KPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCE

Query:  TPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLT
        +P FKSNPPFLALTKSILYQQLATKAAESIYNRFA+LCGGEAAV+P AVLGLS QQLRV+GVSGRKASYLHDLA KF+EG L+ SSILEMDDETL+  LT
Subjt:  TPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLT

Query:  AVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD
        AVKGIGVWSV MFMIF+LHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLC+ WKPYRSMGAWYMWRLME+KEIV    D
Subjt:  AVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD

TrEMBL top hitse value%identityAlignment
A0A0A0KM62 ENDO3c domain-containing protein4.0e-10875Show/hide
Query:  MAKRTR------LKSKSQSLLQSQSETLEIPFRSTKIRKIT-----VKPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSC
        MAKR R      L+S S ++  S S + +IPF STK+RKI+      KP I+  G  +PTR   N A PVKSLSS+D+I TAI+HLRRSDPLLI+LLDSC
Subjt:  MAKRTR------LKSKSQSLLQSQSETLEIPFRSTKIRKIT-----VKPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSC

Query:  ETPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTL
        ETP FKSNPPFLALTKSILYQQLATKAAE+IYNRFA+LCGGEAAV+P  VLGLS QQLRVIGVSGRKASYLHDLA KF+EG L+ S ILEMDDETL+R L
Subjt:  ETPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTL

Query:  TAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD
        TAVKGIGVWSV MFMIF+LHRPDVLPVGDLGVRKGVQRLYGLKELPKP EMEKLC+KWKPYRS+GAWYMWRL++ KEIV  G D
Subjt:  TAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD

A0A5A7T3R0 Putative DNA-3-methyladenine glycosylase 27.6e-10773.94Show/hide
Query:  MAKRTRLK------SKSQSLLQSQSETLEIPFRSTKIRKIT-----VKPPIAGEN--DPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSC
        MAKR R K      S + ++  S S + +IPFRSTK+RKI+      KP  +  +  +PTR+  N A PVKSLSS DEI TAI+HLRRSDPLLI+LLDSC
Subjt:  MAKRTRLK------SKSQSLLQSQSETLEIPFRSTKIRKIT-----VKPPIAGEN--DPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSC

Query:  ETPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTL
        E+P FKSNPPFLALTKSILYQQLATKAAESIYNRFA+LCGGEA+V+P  VLGLS QQLRV+GVSGRKASYLHDLA KF+EG L+ S ILEMDDETL+  L
Subjt:  ETPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTL

Query:  TAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD
        TAVKGIGVWSV MFMIF+LHRPDVLPVGDLGVRKGVQRLYGLKELPKP EMEKLC+KWKPYRS+GAWYMWRLME K +V KG D
Subjt:  TAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD

A0A6J1BSP4 DNA-3-methyladenine glycosylase 1-like3.4e-14799.26Show/hide
Query:  MAKRTRLKSKSQSLLQSQSETLEIPFRSTKIRKITVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLA
        M+KRTRLKSKSQSLLQSQSETLEIPFRSTKIRK+TVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLA
Subjt:  MAKRTRLKSKSQSLLQSQSETLEIPFRSTKIRKITVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLA

Query:  LTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQM
        LTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQM
Subjt:  LTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQM

Query:  FMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD
        FMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD
Subjt:  FMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD

A0A6J1GD23 DNA-3-methyladenine glycosylase 1-like2.5e-11077.03Show/hide
Query:  MAKRTRLKSKSQSLLQSQSET-----LEIPFRSTKIRKITV-----KPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCE
        MA+RTR K     LLQS+S+T       I FR+TKIRKI+      KP I+  G  D TR+  N   PVKSLSS+D ICTAIDHLRRSDPLLI LLDSCE
Subjt:  MAKRTRLKSKSQSLLQSQSET-----LEIPFRSTKIRKITV-----KPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCE

Query:  TPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLT
        +P FKSNPPFLALTKSILYQQLATKAAESIYNRFA+LCGG+AAV+P AVLGLS QQLRV+GVSGRKASYLHDLA KF+EG L+ SSILEMDDETL+  LT
Subjt:  TPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLT

Query:  AVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD
        AVKGIGVWSV MFMIF+LHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLC+KWKPYRSMGAWYMWRLME+KEIV    D
Subjt:  AVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD

A0A6J1IQD1 DNA-3-methyladenine glycosylase 1-like1.1e-10875.97Show/hide
Query:  MAKRTRLKSKSQSLLQSQSET-----LEIPFRSTKIRKITV-----KPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCE
        MA+RTR K     LLQS+S+T      +I FR+T+IRKI+      KP I+  G  D TR+  N   PVKSLSS+D ICTAIDHLRRSDPLLI LLDSCE
Subjt:  MAKRTRLKSKSQSLLQSQSET-----LEIPFRSTKIRKITV-----KPPIA--GENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCE

Query:  TPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLT
        +P FKSNPPFLA+TKSILYQQLATKAAESIYNRFA+LCGGEAAV+P AVLGLS QQLRV+GVSGRKASYLHDLA KFVEG L+ SSILEMDDETL+  LT
Subjt:  TPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLT

Query:  AVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD
         VKGIGVWSV MFMIF+LHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLC+KWKPYRSMGAWYMWRLME+K I     D
Subjt:  AVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD

SwissProt top hitse value%identityAlignment
O31544 Putative DNA-3-methyladenine glycosylase YfjP6.8e-2031.46Show/hide
Query:  TPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVV-----PGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETL
        TP       +  + K I++QQL    A ++  RF    G +   V     P  +  L  Q LR +  S RKA Y  D +    EG L+ S +  M DE +
Subjt:  TPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVV-----PGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETL

Query:  VRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLME
        ++ L  ++GIG W+VQ  ++F L RP++ P+ D+G++  ++R + L + P    M  +  +W+PY S  + Y+WR +E
Subjt:  VRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLME

O94468 Alkylbase DNA glycosidase-like protein mag23.6e-2128.65Show/hide
Query:  LINLLDSCETPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCG-GEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLA-GKFVEGILTESSILE
        L+  +  C       + P+  + ++I  Q+L+  A  SI N+F   C   +    P  ++    + L   G S  K+  +H +A     + I ++S I +
Subjt:  LINLLDSCETPKFKSNPPFLALTKSILYQQLATKAAESIYNRFAALCG-GEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLA-GKFVEGILTESSILE

Query:  MDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMEL
        M +E L+ +L+ +KG+  W+++M+ IF+L R D++P  D  ++   +  +GL   P+  E+EKL    KPYR++ AWY+W++ +L
Subjt:  MDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMEL

P22134 DNA-3-methyladenine glycosylase7.2e-1429.64Show/hide
Query:  LNPAYPVKSLSSTDE-ICTAIDHLRRSDPLLINLLDSCETPKF--KSNPP------FLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVL---
        L  A P K ++  +E    A +H+   DP L  +L + E   +  ++  P      F+ L  +IL QQ++ +AAESI  R  +L GG  A     +L   
Subjt:  LNPAYPVKSLSSTDE-ICTAIDHLRRSDPLLINLLDSCETPKF--KSNPP------FLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVL---

Query:  ---GLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSIL---EMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLK-E
                ++   G+S RK  YL  LA  F E       +    + D+E +   +T VKGIG WS +MF+I  L R DV    DLG+ +G  +    K E
Subjt:  ---GLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSIL---EMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLK-E

Query:  LPKPVE-------------------------MEKLCDKWKPYRSMGAWYMWRL
        L K +                          MEK  + + PYRS+  + +WRL
Subjt:  LPKPVE-------------------------MEKLCDKWKPYRSMGAWYMWRL

Q92383 DNA-3-methyladenine glycosylase 12.7e-2433.14Show/hide
Query:  PFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGIL-TESSILEMDDETLVRTLTAVKGIGV
        P+  L +++  QQL +KAA +I+NRF ++        P  +  +  + +R  G S RK   L  +A   + G++ T+     + +E L+  LT +KGIG 
Subjt:  PFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGIL-TESSILEMDDETLVRTLTAVKGIGV

Query:  WSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKE
        W+V+M +IFSL+R DV+P  DL +R G + L+ L ++P  + + K  +   P+R+  AWY+W+  +L +
Subjt:  WSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKE

Arabidopsis top hitse value%identityAlignment
AT1G19480.1 DNA glycosylase superfamily protein6.0e-7254.28Show/hide
Query:  EIPFRSTKIRKITVKPPIAGE------------NDP---------------TRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKS-
        +IP R  KIRK+T+   ++GE            N P                R++  P    + L+   E+ TAI +LR +DPLL  L+D    P F+S 
Subjt:  EIPFRSTKIRKITVKPPIAGE------------NDP---------------TRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKS-

Query:  NPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIG
          PFLAL ++ILYQQLA KA  SIY RF +LCGGE  VVP  VL L+ QQLR IGVSGRKASYLHDLA K+  GIL++S+IL MD+++L   LT V GIG
Subjt:  NPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIG

Query:  VWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELK
         WSV MFMI SLHRPDVLPV DLGVRKGVQ LYGL +LP+P +ME+ C KW+PYRS+G+WYMWRL+E K
Subjt:  VWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELK

AT1G19480.2 DNA glycosylase superfamily protein6.0e-7254.28Show/hide
Query:  EIPFRSTKIRKITVKPPIAGE------------NDP---------------TRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKS-
        +IP R  KIRK+T+   ++GE            N P                R++  P    + L+   E+ TAI +LR +DPLL  L+D    P F+S 
Subjt:  EIPFRSTKIRKITVKPPIAGE------------NDP---------------TRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKS-

Query:  NPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIG
          PFLAL ++ILYQQLA KA  SIY RF +LCGGE  VVP  VL L+ QQLR IGVSGRKASYLHDLA K+  GIL++S+IL MD+++L   LT V GIG
Subjt:  NPPFLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIG

Query:  VWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELK
         WSV MFMI SLHRPDVLPV DLGVRKGVQ LYGL +LP+P +ME+ C KW+PYRS+G+WYMWRL+E K
Subjt:  VWSVQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELK

AT1G75230.1 DNA glycosylase superfamily protein3.0e-7153.76Show/hide
Query:  SETLEIPFRSTKIRKI--------------------TVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKS-NPP
        S   +IP R  KIRK+                    T KP    +   +R+V  P    +SL+   E+  A+ HLR  DPLL +L+D    P F++   P
Subjt:  SETLEIPFRSTKIRKI--------------------TVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKS-NPP

Query:  FLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWS
        FLAL +SILYQQLA KA  SIY RF ALCGGE  VVP  VL L+ QQLR IGVSGRKASYLHDLA K+  GIL++S I+ MD+++L   LT V GIG WS
Subjt:  FLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWS

Query:  VQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELK
        V MFMI SLHRPDVLPV DLGVRKGVQ L G+++LP+P +ME+LC+KW+PYRS+ +WY+WRL+E K
Subjt:  VQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELK

AT1G75230.2 DNA glycosylase superfamily protein3.0e-7153.76Show/hide
Query:  SETLEIPFRSTKIRKI--------------------TVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKS-NPP
        S   +IP R  KIRK+                    T KP    +   +R+V  P    +SL+   E+  A+ HLR  DPLL +L+D    P F++   P
Subjt:  SETLEIPFRSTKIRKI--------------------TVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKS-NPP

Query:  FLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWS
        FLAL +SILYQQLA KA  SIY RF ALCGGE  VVP  VL L+ QQLR IGVSGRKASYLHDLA K+  GIL++S I+ MD+++L   LT V GIG WS
Subjt:  FLALTKSILYQQLATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWS

Query:  VQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELK
        V MFMI SLHRPDVLPV DLGVRKGVQ L G+++LP+P +ME+LC+KW+PYRS+ +WY+WRL+E K
Subjt:  VQMFMIFSLHRPDVLPVGDLGVRKGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELK

AT3G50880.1 DNA glycosylase superfamily protein1.3e-7159.02Show/hide
Query:  IPFRSTKIRKITVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKF--KSNPPFLALTKSILYQQLATKAAESIYNR
        I FR  KIRK++        +DP+  ++  A P  S  ST +I  A+ HL+ SD LL  L+ +   P     SN PFL+L +SILYQQLATKAA+ IY+R
Subjt:  IPFRSTKIRKITVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKF--KSNPPFLALTKSILYQQLATKAAESIYNR

Query:  FAALC-GGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVR
        F +L  GGEA VVP +V+ LSA  LR IGVSGRKASYLHDLA K+  G+L++  IL+M DE L+  LT VKGIGVW+V MFMIFSLHRPDVLPVGDLGVR
Subjt:  FAALC-GGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVR

Query:  KGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKE
        KGV+ LYGLK LP P++ME+LC+KW+PYRS+G+WYMWRL+E ++
Subjt:  KGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAAAAGAACCCGCCTGAAATCGAAATCTCAATCTCTATTACAGTCCCAGTCTGAAACGCTGGAGATTCCGTTTCGATCCACAAAAATACGGAAGATAACGGTCAA
ACCACCAATCGCCGGCGAGAATGACCCAACCCGATCAGTTCTGAACCCGGCCTATCCCGTCAAATCGTTATCGTCTACGGATGAAATCTGTACGGCGATCGATCACTTAC
GCCGATCGGACCCTCTCCTGATAAATCTGTTAGATTCGTGCGAAACCCCCAAATTCAAATCGAATCCCCCATTCCTAGCCCTAACGAAGAGCATTCTGTACCAACAGCTC
GCTACGAAGGCCGCCGAATCAATCTACAATCGGTTCGCCGCGCTGTGCGGCGGCGAGGCGGCGGTGGTCCCGGGCGCCGTGCTGGGGCTGTCGGCGCAGCAGCTGCGGGT
AATTGGAGTTTCGGGGCGGAAAGCGAGCTACCTCCATGACCTAGCGGGTAAATTCGTGGAGGGGATTTTGACGGAATCTTCGATTCTGGAGATGGACGACGAGACTCTGG
TGAGGACGTTGACGGCGGTGAAGGGAATCGGCGTTTGGTCGGTGCAAATGTTCATGATTTTCAGTCTGCATCGGCCGGACGTGCTGCCGGTGGGCGATCTCGGCGTCAGA
AAAGGCGTGCAGCGGCTGTACGGACTGAAGGAGTTGCCGAAGCCAGTGGAGATGGAGAAACTGTGCGACAAATGGAAGCCGTATAGGTCGATGGGCGCTTGGTACATGTG
GAGGCTAATGGAACTGAAGGAGATCGTGAGTAAAGGTCGCGAT
mRNA sequenceShow/hide mRNA sequence
ATGGCCAAAAGAACCCGCCTGAAATCGAAATCTCAATCTCTATTACAGTCCCAGTCTGAAACGCTGGAGATTCCGTTTCGATCCACAAAAATACGGAAGATAACGGTCAA
ACCACCAATCGCCGGCGAGAATGACCCAACCCGATCAGTTCTGAACCCGGCCTATCCCGTCAAATCGTTATCGTCTACGGATGAAATCTGTACGGCGATCGATCACTTAC
GCCGATCGGACCCTCTCCTGATAAATCTGTTAGATTCGTGCGAAACCCCCAAATTCAAATCGAATCCCCCATTCCTAGCCCTAACGAAGAGCATTCTGTACCAACAGCTC
GCTACGAAGGCCGCCGAATCAATCTACAATCGGTTCGCCGCGCTGTGCGGCGGCGAGGCGGCGGTGGTCCCGGGCGCCGTGCTGGGGCTGTCGGCGCAGCAGCTGCGGGT
AATTGGAGTTTCGGGGCGGAAAGCGAGCTACCTCCATGACCTAGCGGGTAAATTCGTGGAGGGGATTTTGACGGAATCTTCGATTCTGGAGATGGACGACGAGACTCTGG
TGAGGACGTTGACGGCGGTGAAGGGAATCGGCGTTTGGTCGGTGCAAATGTTCATGATTTTCAGTCTGCATCGGCCGGACGTGCTGCCGGTGGGCGATCTCGGCGTCAGA
AAAGGCGTGCAGCGGCTGTACGGACTGAAGGAGTTGCCGAAGCCAGTGGAGATGGAGAAACTGTGCGACAAATGGAAGCCGTATAGGTCGATGGGCGCTTGGTACATGTG
GAGGCTAATGGAACTGAAGGAGATCGTGAGTAAAGGTCGCGAT
Protein sequenceShow/hide protein sequence
MAKRTRLKSKSQSLLQSQSETLEIPFRSTKIRKITVKPPIAGENDPTRSVLNPAYPVKSLSSTDEICTAIDHLRRSDPLLINLLDSCETPKFKSNPPFLALTKSILYQQL
ATKAAESIYNRFAALCGGEAAVVPGAVLGLSAQQLRVIGVSGRKASYLHDLAGKFVEGILTESSILEMDDETLVRTLTAVKGIGVWSVQMFMIFSLHRPDVLPVGDLGVR
KGVQRLYGLKELPKPVEMEKLCDKWKPYRSMGAWYMWRLMELKEIVSKGRD