; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G004210 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G004210
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionformamidopyrimidine-DNA glycosylase isoform X1
Genome locationchr07:4479044..4484784
RNA-Seq ExpressionLsi07G004210
SyntenyLsi07G004210
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0003906 - DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0008534 - oxidized purine nucleobase lesion DNA N-glycosylase activity (molecular function)
GO:0016829 - lyase activity (molecular function)
InterPro domainsIPR010979 - Ribosomal protein S13-like, H2TH
IPR012319 - Formamidopyrimidine-DNA glycosylase, catalytic domain
IPR015886 - DNA glycosylase/AP lyase, H2TH DNA-binding
IPR035937 - MutM-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044473.1 formamidopyrimidine-DNA glycosylase isoform X2 [Cucumis melo var. makuwa]1.4e-16074.54Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFE S+LGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLY------
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALD+FIES+ KKKLAIKTLLLDQSYISGIGNWVADEVLY      
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLY------

Query:  ---------------------------------------------------QVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTS
                                                           QVIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTS
Subjt:  ---------------------------------------------------QVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTS

Query:  AFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTKKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVG
        AFVPELQKLTGAEPKNQ SKRKGND+K+MNDEG G+LVSKT+KTAD K+KPKPKGRSKKPS KRKSKS++  +DGSDEEAENDDASDDDNGR  G KK+G
Subjt:  AFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTKKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVG

Query:  KKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK
        KKTNIGQ  DAASE EKSLKQTVQSS+NG  RKKAK
Subjt:  KKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK

XP_004152179.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucumis sativus]1.9e-16079.26Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFE S+LGKTILSAHRKGKHLWL LDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALD+FIES+ KKKLAIKTLLLDQSYISGIGNWVADEVLYQ     
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----

Query:  ---------------------VIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMND
                             VIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQ SKRKGND+K+MND
Subjt:  ---------------------VIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMND

Query:  EGAGKLVSKTDKTADTKKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMG
        E  G+LVSKT KTAD K+KPKPKGRSKKPS KRKSKS++  DDGSDEEAENDDASDDDNGR  G KKVG KTNIGQ  DAASE +KSLKQTV+SS+ G  
Subjt:  EGAGKLVSKTDKTADTKKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMG

Query:  RKKAK
        RKKAK
Subjt:  RKKAK

XP_008454182.1 PREDICTED: formamidopyrimidine-DNA glycosylase isoform X1 [Cucumis melo]4.8e-16480Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADDTKVIDGVSPSDFE S+LGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALD+FIES+ KKKLAIKTLLLDQSYISGIGNWVADEVLYQ     
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----

Query:  ---------------------VIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMND
                             VIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQ SKRKGND+K+MND
Subjt:  ---------------------VIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMND

Query:  EGAGKLVSKTDKTADTKKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMG
        EG G+LVSKT+KTAD K+KPKPKGRSKKPS KRKSKS++  +DGSDEEAENDDASDDDNGR  G KK+GKKTNIGQ  DAASE EKSLKQTVQSS+NG  
Subjt:  EGAGKLVSKTDKTADTKKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMG

Query:  RKKAK
        RKKAK
Subjt:  RKKAK

XP_008454183.1 PREDICTED: formamidopyrimidine-DNA glycosylase isoform X2 [Cucumis melo]8.5e-15376.11Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADDTKVIDGVSPSDFE S+LGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALD+FIES+ KKKLAIKTLLLDQSYISGIGNWVADEVLYQ     
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----

Query:  ---------------------VIEKALEVGADSSRFPSNWLFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMN
                             V+++A+EV A+S+ FP  WLFH R  K+PG+  V+GKEIHFITTGGRTSAFVPELQKLTGAEPKNQ SKRKGND+K+MN
Subjt:  ---------------------VIEKALEVGADSSRFPSNWLFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMN

Query:  DEGAGKLVSKTDKTADTKKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGM
        DEG G+LVSKT+KTAD K+KPKPKGRSKKPS KRKSKS++  +DGSDEEAENDDASDDDNGR  G KK+GKKTNIGQ  DAASE EKSLKQTVQSS+NG 
Subjt:  DEGAGKLVSKTDKTADTKKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGM

Query:  GRKKAK
         RKKAK
Subjt:  GRKKAK

XP_038877199.1 formamidopyrimidine-DNA glycosylase isoform X3 [Benincasa hispida]2.9e-16178.97Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSP+DFE S+LGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ     
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----

Query:  ---------------------VIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMND
                             VIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAE KNQ SKRKGN+SKEMND
Subjt:  ---------------------VIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMND

Query:  EGAGKLVSKTDKTADT----KKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSK
        EGA +LVSKT+KTADT    KK+PKPKGR KKPSTKRKSKSD+   DGS+EEAENDDASDDD+G  VG KKVGK TN G+M++AASE EKSLKQTV SS+
Subjt:  EGAGKLVSKTDKTADT----KKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSK

Query:  NGMGRKKAK
        +G  RKKAK
Subjt:  NGMGRKKAK

TrEMBL top hitse value%identityAlignment
A0A0A0KWY6 FPG_CAT domain-containing protein9.1e-16179.26Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFE S+LGKTILSAHRKGKHLWL LDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALD+FIES+ KKKLAIKTLLLDQSYISGIGNWVADEVLYQ     
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----

Query:  ---------------------VIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMND
                             VIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQ SKRKGND+K+MND
Subjt:  ---------------------VIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMND

Query:  EGAGKLVSKTDKTADTKKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMG
        E  G+LVSKT KTAD K+KPKPKGRSKKPS KRKSKS++  DDGSDEEAENDDASDDDNGR  G KKVG KTNIGQ  DAASE +KSLKQTV+SS+ G  
Subjt:  EGAGKLVSKTDKTADTKKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMG

Query:  RKKAK
        RKKAK
Subjt:  RKKAK

A0A1S3BY09 formamidopyrimidine-DNA glycosylase isoform X24.1e-15376.11Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADDTKVIDGVSPSDFE S+LGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALD+FIES+ KKKLAIKTLLLDQSYISGIGNWVADEVLYQ     
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----

Query:  ---------------------VIEKALEVGADSSRFPSNWLFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMN
                             V+++A+EV A+S+ FP  WLFH R  K+PG+  V+GKEIHFITTGGRTSAFVPELQKLTGAEPKNQ SKRKGND+K+MN
Subjt:  ---------------------VIEKALEVGADSSRFPSNWLFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMN

Query:  DEGAGKLVSKTDKTADTKKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGM
        DEG G+LVSKT+KTAD K+KPKPKGRSKKPS KRKSKS++  +DGSDEEAENDDASDDDNGR  G KK+GKKTNIGQ  DAASE EKSLKQTVQSS+NG 
Subjt:  DEGAGKLVSKTDKTADTKKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGM

Query:  GRKKAK
         RKKAK
Subjt:  GRKKAK

A0A1S3BY51 formamidopyrimidine-DNA glycosylase isoform X12.3e-16480Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADDTKVIDGVSPSDFE S+LGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALD+FIES+ KKKLAIKTLLLDQSYISGIGNWVADEVLYQ     
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----

Query:  ---------------------VIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMND
                             VIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQ SKRKGND+K+MND
Subjt:  ---------------------VIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMND

Query:  EGAGKLVSKTDKTADTKKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMG
        EG G+LVSKT+KTAD K+KPKPKGRSKKPS KRKSKS++  +DGSDEEAENDDASDDDNGR  G KK+GKKTNIGQ  DAASE EKSLKQTVQSS+NG  
Subjt:  EGAGKLVSKTDKTADTKKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMG

Query:  RKKAK
        RKKAK
Subjt:  RKKAK

A0A5A7TLT5 Formamidopyrimidine-DNA glycosylase isoform X27.0e-16174.54Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFE S+LGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLY------
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALD+FIES+ KKKLAIKTLLLDQSYISGIGNWVADEVLY      
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLY------

Query:  ---------------------------------------------------QVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTS
                                                           QVIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTS
Subjt:  ---------------------------------------------------QVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTS

Query:  AFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTKKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVG
        AFVPELQKLTGAEPKNQ SKRKGND+K+MNDEG G+LVSKT+KTAD K+KPKPKGRSKKPS KRKSKS++  +DGSDEEAENDDASDDDNGR  G KK+G
Subjt:  AFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTKKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVG

Query:  KKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK
        KKTNIGQ  DAASE EKSLKQTVQSS+NG  RKKAK
Subjt:  KKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK

A0A5D3E227 Formamidopyrimidine-DNA glycosylase isoform X12.3e-16480Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADDTKVIDGVSPSDFE S+LGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALD+FIES+ KKKLAIKTLLLDQSYISGIGNWVADEVLYQ     
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----

Query:  ---------------------VIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMND
                             VIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQ SKRKGND+K+MND
Subjt:  ---------------------VIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMND

Query:  EGAGKLVSKTDKTADTKKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMG
        EG G+LVSKT+KTAD K+KPKPKGRSKKPS KRKSKS++  +DGSDEEAENDDASDDDNGR  G KK+GKKTNIGQ  DAASE EKSLKQTVQSS+NG  
Subjt:  EGAGKLVSKTDKTADTKKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMG

Query:  RKKAK
        RKKAK
Subjt:  RKKAK

SwissProt top hitse value%identityAlignment
A0PQ49 Formamidopyrimidine-DNA glycosylase1.3e-1532.86Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKV-IDGVSPSDFENSILGKTILSAHRKGKHLWLRLDS--------PPFPAF--HFGMAGAIYIKGVAV
        MPELPEVE  RR +++H VGK +    +     V      P+D    +LG  I    R+GK+LWL LD+         P  A   H GM+G + + GV  
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKV-IDGVSPSDFENSILGKTILSAHRKGKHLWLRLDS--------PPFPAF--HFGMAGAIYIKGVAV

Query:  TNYKR-SMVNDDDEWPSKYSK-----------FFVEPASVP-PISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQVIEK
          + R S V DD    S   +             V+ + VP P++ L  D L     +D  I+ +  K   +K  LLDQ  +SGIGN  ADE L++    
Subjt:  TNYKR-SMVNDDDEWPSKYSK-----------FFVEPASVP-PISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQVIEK

Query:  ALEVGADSSR
           V A  +R
Subjt:  ALEVGADSSR

A4FMJ7 Formamidopyrimidine-DNA glycosylase5.4e-1730.61Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGV-SPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPA-------FHFGMAGAIYIKGVAVTN-
        MPELPEVE  RR +  H VG+ + +  +     V   V  P DF   + G+ + +A R+GK++WL L   P           H GM+G + ++     + 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGV-SPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPA-------FHFGMAGAIYIKGVAVTN-

Query:  -YKRSMVNDDDEWP-------------SKYSKFFVEPASVP-PISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ
         + R     DD  P             S      V+  +VP P++ + PD L     L+  +  M K++  +K  LLDQ+ +SGIGN  ADE L++
Subjt:  -YKRSMVNDDDEWP-------------SKYSKFFVEPASVP-PISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ

A9B0X2 Formamidopyrimidine-DNA glycosylase3.2e-1731.89Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKG------------VA
        MPELPEVE  RR++E+  VG+           K++D  SP  F  +I  + I    R+ K+L + LD+      H  M G + +              VA
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKG------------VA

Query:  VTNYKRSMVNDDDEWPSKYSKF-FVEPASVPPIS-KLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLY
        + N +    +D    P K+ ++  V+ + V  ++ +LGP+ L +   LDDF + + +K   IK  LLDQS ++G+GN  ADE L+
Subjt:  VTNYKRSMVNDDDEWPSKYSKF-FVEPASVPPIS-KLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLY

B2HJJ6 Formamidopyrimidine-DNA glycosylase1.0e-1533.33Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKV-IDGVSPSDFENSILGKTILSAHRKGKHLWLRLDS--------PPFPAF--HFGMAGAIYIKGVAV
        MPELPEVE  RR +++H VGK +    +     V      P+D    +LG  I    R+GK+LWL LD+         P  A   H GM+G + + GV  
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKV-IDGVSPSDFENSILGKTILSAHRKGKHLWLRLDS--------PPFPAF--HFGMAGAIYIKGVAV

Query:  TNYKR-SMVNDDDEWPSKYSK-----------FFVEPASVP-PISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQVIEK
          + R S V DD    S   +             V+ + VP P++ L  D L     +D  I+ +  K   IK  LLDQ  +SGIGN  ADE L++    
Subjt:  TNYKR-SMVNDDDEWPSKYSK-----------FFVEPASVP-PISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQVIEK

Query:  ALEVGADSSR
           V A  +R
Subjt:  ALEVGADSSR

O80358 Formamidopyrimidine-DNA glycosylase1.1e-9954.17Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEE+C+GK IK+ +IADD KVI G+SPSDF+ SILGKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----
        +EWPSKYSKFFVE                       P SV PIS+LGPDALLEPM +D+F ES+ KKK+ IK LLLDQ YISGIGNW+ADEVLYQ     
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----

Query:  ---------------------VIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDS-KEMN
                             VIEKA+EV ADSS+FPS W+FH+REKKPGKAFVDGK+I FIT GGRT+A+VPELQKL G + +     R      K   
Subjt:  ---------------------VIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDS-KEMN

Query:  DEGAG----KLVSKTDKTADTKKKPKPK-GRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQS
        D+G G    +   K D++A +KK  KP+ GR KKP++  K+K++E +DDG D EAE +       G    +K+               +SE+  K T Q+
Subjt:  DEGAG----KLVSKTDKTADTKKKPKPK-GRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQS

Query:  SKNGMGRK
         K   GRK
Subjt:  SKNGMGRK

Arabidopsis top hitse value%identityAlignment
AT1G52500.1 MUTM homolog-13.5e-8059.85Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEE+C+GK IK+ +IADD KVI G+SPSDF+ SILGKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----
        +EWPSKYSKFFVE                       P SV PIS+LGPDALLEPM +D+F ES+ KKK+ IK LLLDQ YISGIGNW+ADEVLYQ     
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----

Query:  ---------------------VIEKALEVGADSSRFPSNWLFHSR-EKKPGKAFVDGKEIHFIT
                             VI+ A++V ADS  FP  WLFH R  KK GK  V+GK  H ++
Subjt:  ---------------------VIEKALEVGADSSRFPSNWLFHSR-EKKPGKAFVDGKEIHFIT

AT1G52500.2 MUTM homolog-18.0e-10154.17Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEE+C+GK IK+ +IADD KVI G+SPSDF+ SILGKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----
        +EWPSKYSKFFVE                       P SV PIS+LGPDALLEPM +D+F ES+ KKK+ IK LLLDQ YISGIGNW+ADEVLYQ     
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQ-----

Query:  ---------------------VIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDS-KEMN
                             VIEKA+EV ADSS+FPS W+FH+REKKPGKAFVDGK+I FIT GGRT+A+VPELQKL G + +     R      K   
Subjt:  ---------------------VIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDS-KEMN

Query:  DEGAG----KLVSKTDKTADTKKKPKPK-GRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQS
        D+G G    +   K D++A +KK  KP+ GR KKP++  K+K++E +DDG D EAE +       G    +K+               +SE+  K T Q+
Subjt:  DEGAG----KLVSKTDKTADTKKKPKPK-GRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQS

Query:  SKNGMGRK
         K   GRK
Subjt:  SKNGMGRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGAGCTACCGGAGGTGGAGGCGGCGAGGAGAGCAATAGAAGAGCATTGCGTTGGGAAAGTGATTAAGAAGGCGGTGATAGCCGACGATACGAAGGTCATCGACGG
CGTATCGCCTTCCGATTTCGAGAATTCCATCTTAGGCAAAACCATTCTCTCTGCCCATCGTAAGGGCAAGCACCTGTGGCTCCGCCTCGATTCTCCTCCTTTCCCTGCAT
TTCACTTCGGGATGGCGGGTGCCATATATATTAAAGGTGTAGCTGTCACAAACTATAAAAGGTCTATGGTTAATGATGATGACGAGTGGCCTTCCAAGTACTCTAAGTTC
TTTGTTGAGCCAGCTTCAGTGCCCCCAATATCTAAGCTTGGCCCAGATGCTCTTCTAGAGCCTATGGCATTGGATGATTTTATTGAATCCATGGGCAAGAAGAAACTGGC
GATTAAGACTCTATTGCTTGATCAGAGCTACATTTCCGGTATTGGCAATTGGGTTGCAGATGAAGTGCTATATCAAGTAATTGAAAAAGCGCTTGAAGTTGGAGCAGATA
GTAGTCGGTTCCCTAGTAATTGGCTTTTCCATTCACGTGAAAAGAAGCCTGGCAAGGCTTTTGTTGATGGTAAGGAAATCCACTTTATTACTACAGGCGGCAGGACGTCG
GCCTTTGTACCCGAGTTGCAAAAGCTTACTGGAGCTGAACCAAAAAATCAAACTTCAAAGAGAAAAGGCAATGATAGCAAAGAAATGAATGATGAGGGTGCTGGTAAATT
AGTGAGCAAGACAGATAAAACTGCTGATACTAAGAAAAAGCCAAAGCCTAAAGGTCGCTCTAAGAAACCTTCAACAAAAAGAAAATCCAAAAGCGATGAAGACAACGACG
ATGGCTCTGATGAGGAAGCTGAAAATGATGATGCTAGTGATGATGACAATGGTCGAGGTGTTGGAATGAAGAAAGTGGGAAAGAAAACGAACATTGGGCAAATGGTTGAT
GCTGCTTCTGAATCAGAGAAGTCTTTGAAACAAACGGTTCAGAGCAGTAAAAATGGTATGGGGAGGAAGAAAGCAAAGTAA
mRNA sequenceShow/hide mRNA sequence
GAAAGGTTAAAATGTGAAATTTGAAAGAATTCATTCACGAGTCTCGGTCAGGGCAGCGCTAACTCGTCTGCCAGTATCATTCTACTAGTCCTAGCATCGTTCACCACAAG
CAGCGCCAAAACTTCGTTCCATTTTCCGACCACCGCAAGATGCCGGAGCTACCGGAGGTGGAGGCGGCGAGGAGAGCAATAGAAGAGCATTGCGTTGGGAAAGTGATTAA
GAAGGCGGTGATAGCCGACGATACGAAGGTCATCGACGGCGTATCGCCTTCCGATTTCGAGAATTCCATCTTAGGCAAAACCATTCTCTCTGCCCATCGTAAGGGCAAGC
ACCTGTGGCTCCGCCTCGATTCTCCTCCTTTCCCTGCATTTCACTTCGGGATGGCGGGTGCCATATATATTAAAGGTGTAGCTGTCACAAACTATAAAAGGTCTATGGTT
AATGATGATGACGAGTGGCCTTCCAAGTACTCTAAGTTCTTTGTTGAGCCAGCTTCAGTGCCCCCAATATCTAAGCTTGGCCCAGATGCTCTTCTAGAGCCTATGGCATT
GGATGATTTTATTGAATCCATGGGCAAGAAGAAACTGGCGATTAAGACTCTATTGCTTGATCAGAGCTACATTTCCGGTATTGGCAATTGGGTTGCAGATGAAGTGCTAT
ATCAAGTAATTGAAAAAGCGCTTGAAGTTGGAGCAGATAGTAGTCGGTTCCCTAGTAATTGGCTTTTCCATTCACGTGAAAAGAAGCCTGGCAAGGCTTTTGTTGATGGT
AAGGAAATCCACTTTATTACTACAGGCGGCAGGACGTCGGCCTTTGTACCCGAGTTGCAAAAGCTTACTGGAGCTGAACCAAAAAATCAAACTTCAAAGAGAAAAGGCAA
TGATAGCAAAGAAATGAATGATGAGGGTGCTGGTAAATTAGTGAGCAAGACAGATAAAACTGCTGATACTAAGAAAAAGCCAAAGCCTAAAGGTCGCTCTAAGAAACCTT
CAACAAAAAGAAAATCCAAAAGCGATGAAGACAACGACGATGGCTCTGATGAGGAAGCTGAAAATGATGATGCTAGTGATGATGACAATGGTCGAGGTGTTGGAATGAAG
AAAGTGGGAAAGAAAACGAACATTGGGCAAATGGTTGATGCTGCTTCTGAATCAGAGAAGTCTTTGAAACAAACGGTTCAGAGCAGTAAAAATGGTATGGGGAGGAAGAA
AGCAAAGTAAGTTTATGTCCCAACACATCATATATAGTTTTTTTTTTTGGGTTAGTAATGAGAACATCTATCAGTAGTAGAACTTGCCCTTTTGTCATTACTGTTAATCC
TGACCGTTGATGTAGGCATGCCTTTTGTATCATCCGTTCATTTTGAAGAAATGTTGATAAGAAGAATGAAAATGTAGGAGCATTTTAAATTCGGATTTAATGTTAGATTT
ACATATTTTCTTAAATATTAAATT
Protein sequenceShow/hide protein sequence
MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDDDEWPSKYSKF
FVEPASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTS
AFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTKKKPKPKGRSKKPSTKRKSKSDEDNDDGSDEEAENDDASDDDNGRGVGMKKVGKKTNIGQMVD
AASESEKSLKQTVQSSKNGMGRKKAK