; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0019924 (gene) of Chayote v1 genome

Gene IDSed0019924
OrganismSechium edule (Chayote v1)
DescriptionDNA glycosylase superfamily protein
Genome locationLG01:19927166..19930418
RNA-Seq ExpressionSed0019924
SyntenySed0019924
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608520.1 hypothetical protein SDJN03_01862, partial [Cucurbita argyrosperma subsp. sororia]3.1e-14983.13Show/hide
Query:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEISKKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKPKLKPVKPLSAG
        MSVATKLQSHA+PV+ESRAILGPGGNRDRAP+KPK KQEI K+T KQNKA+PVV+ES VRDNVSVGSSCSSDSLSSNYSAK LN + KP  KPVK ++AG
Subjt:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEISKKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKPKLKPVKPLSAG

Query:  GEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NA
        G+ANAT T P +S+ GKRCDWITPYSDPLYIAFHDEEWGVP +DDKKLFELLVLSQALAELTWPLIL+KR IFRKVFNDFDP SIA+F+E EFTTL  NA
Subjt:  GEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NA

Query:  IQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRNQ
         QLLS+QKLRAIVENANQVLKIQQEFGSFSNYCW+FVNKKPIR R+R+ RQVPVKTPKAEFMSKDLM+RGFRCVGPTVVYSFLQVSGIVNDHLVDCFR Q
Subjt:  IQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRNQ

Query:  ECDANVKEETKLRTEVRRSELLTGVLEKSCLS
        ECDA+VK + KLR E RRSELL   LEKS L+
Subjt:  ECDANVKEETKLRTEVRRSELLTGVLEKSCLS

XP_022940560.1 uncharacterized protein LOC111446125 [Cucurbita moschata]3.1e-14983.13Show/hide
Query:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEISKKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKPKLKPVKPLSAG
        MSVATKLQSHA+PV+ESRAILGPGGNRDRAP+KPK KQEI K+T KQNKA+PVV+ES VRDNVSVGSSCSSDSLSSNYSAK LN + KP  KPVK ++AG
Subjt:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEISKKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKPKLKPVKPLSAG

Query:  GEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NA
        G+ANAT T P +S+ GKRCDWITPYSDPLYIAFHDEEWGVP +DDKKLFELLVLSQALAELTWPLIL+KR IFRKVFNDFDP SIA F+E EFTTL  NA
Subjt:  GEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NA

Query:  IQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRNQ
         QLLS+QKLRAIVENANQVLKIQQEFGSFSNYCW+FVNKKPIR R+R+ RQVPVKTPKAEFMSKDLM+RGFRCVGPTVVYSFLQVSGIVNDHLVDCFR Q
Subjt:  IQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRNQ

Query:  ECDANVKEETKLRTEVRRSELLTGVLEKSCLS
        ECDA+VK++ KLR E RRSELL   LEKS L+
Subjt:  ECDANVKEETKLRTEVRRSELLTGVLEKSCLS

XP_022981194.1 uncharacterized protein LOC111480412 [Cucurbita maxima]4.5e-14882.23Show/hide
Query:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEISKKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKPKLKPVKPLSAG
        MSVATKLQSHA+PV+ESRAILGPGGNRDRAP+KPK KQEI K+T KQNKA+PVV+ES VRDN+SVGSSCSSDSLSSNYSAK LN + KP  KPVK ++AG
Subjt:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEISKKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKPKLKPVKPLSAG

Query:  GEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NA
        G+ANAT T P + + GKRCDWITPYSDPLYIAFHDEEWGVP +DDKKLFELLVLSQALAELTWPLIL+KR IFR VFNDFDP SIA+F+E EFTTL  NA
Subjt:  GEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NA

Query:  IQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRNQ
         QLLS+QKLRAIVENANQVLKIQQEFGSFSNYCW+FVNKKPIR R+R+ RQVPVKTPKAEFMSKDLM+RGFRCVGPTVVYSFLQVSGIVNDHLVDCFR Q
Subjt:  IQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRNQ

Query:  ECDANVKEETKLRTEVRRSELLTGVLEKSCLS
        ECDA+VK++ KLR E RRSELL   LEKS L+
Subjt:  ECDANVKEETKLRTEVRRSELLTGVLEKSCLS

XP_023523621.1 uncharacterized protein LOC111787800 [Cucurbita pepo subsp. pepo]1.8e-14983.13Show/hide
Query:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEISKKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKPKLKPVKPLSAG
        MSVATKLQSHA+PV+ESRAILGPGGNRDRAP+KPK KQEI K+T KQNKA+PVV+ES VRDNVSVGSSCSSDSLSSNYSAK LN + KP  KPVK ++AG
Subjt:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEISKKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKPKLKPVKPLSAG

Query:  GEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NA
        G+ANAT T P +S+ GKRCDWITPYSDPLYIAFHDEEWGVP +DDKKLFELLVLSQALAELTWPLIL+KR IFRKVFNDFDP SIA+F+E EFTTL  NA
Subjt:  GEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NA

Query:  IQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRNQ
         QLLS+QKLRAIVENANQVLKIQQEFGSFSNYCW+FVNKKPIR R+R+ RQVPVKTPKAEFMSKDLM+RGFRCVGPTVVYSFLQVSGIVNDHLVDCFR Q
Subjt:  IQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRNQ

Query:  ECDANVKEETKLRTEVRRSELLTGVLEKSCLS
        ECDA+VK++ KLR E RRSELL   LEKS L+
Subjt:  ECDANVKEETKLRTEVRRSELLTGVLEKSCLS

XP_038905518.1 DNA-3-methyladenine glycosylase 1 [Benincasa hispida]1.4e-14679.34Show/hide
Query:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEISKKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKPK-LKPVKPLSA
        MSVATKLQSHAKPV+ESR ILGPGGNRDRAP+KPK KQ+  KKT KQN+A+P+++ES +RDNVSVGSSCSSDS+SSNYSAK L P+VKP  +KPVK ++A
Subjt:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEISKKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKPK-LKPVKPLSA

Query:  GGEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--N
        GG+ NAT+  P +S+ GKRCDWIT +SDPLYIAFHDEEWGVP +DDKKLFELLVLSQALAELTWPLIL+KRDIFRKV NDFDP +IA+F+ENEFTTL  N
Subjt:  GGEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--N

Query:  AIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRN
        AIQLLSE KLRAIVENANQVLKIQQEFGSFSNYCW+FVNKKPIR  FR+ RQVPVKTPKAEFMSKDL+RRGFRCVGPTVVYSF+QV+GIVNDHLV+CFR 
Subjt:  AIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRN

Query:  QECDANVKEETKLRTEVRRSELLTGVLEKSCLSR
        QECDA +K++ KLR E +RSE LTG LEK CL+R
Subjt:  QECDANVKEETKLRTEVRRSELLTGVLEKSCLSR

TrEMBL top hitse value%identityAlignment
A0A6J1CHU1 uncharacterized protein LOC1110116239.8e-14177.98Show/hide
Query:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEIS-KKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKP-KLKPVKPLS
        M VA K +SHAKPV+ESRAILGPGGNRDR P+KP+ K E +  KT KQNKA+P V +S +RDNVSVGSSCSSDSLSSNYSAK LNP+VKP  +KPVK ++
Subjt:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEIS-KKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKP-KLKPVKPLS

Query:  AGGEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--
        AG EA+AT T P  S+  KRCDWITPYSDPLYIAFHDEEWGVP +DDKKLFELLVLSQALAELTWP IL+KRD+FRKVFNDFDP SIAKF+ENEF TL  
Subjt:  AGGEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--

Query:  NAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFR
        N IQ+L+E KLRAIVENANQVLKIQQEFGSFSNYCW+FVNKKPIR RFR+ARQVPVKTPKAE MSKDL+RRGFRCVGPTVVYSF+QV+GIVNDHLV+CFR
Subjt:  NAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFR

Query:  NQECDANVKEETKLRTEVRRSELLTGVLEKSCLSRS
         QECDANVK++ K R E  R EL  G  EK CLSRS
Subjt:  NQECDANVKEETKLRTEVRRSELLTGVLEKSCLSRS

A0A6J1F6S0 uncharacterized protein LOC1114426742.0e-14178.96Show/hide
Query:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEISKKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKP-KLKPVKPLSA
        MSVATKL SHAKPV+ESRAILGPGGNRDRAP+KPK KQE  K + KQNKA+P + ES VRDN+S+GSSCSSDSLSSNYS K LNP+VKP  +KPVK ++A
Subjt:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEISKKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKP-KLKPVKPLSA

Query:  GGEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--N
        GG+ N T T P +S+ GKRC WITPYSDPLYIAFHDEEWGVP +DD+KLFELLVLSQALAELTWPLIL KRDIFRKVFNDFDP +IA+F++NEFTTL  N
Subjt:  GGEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--N

Query:  AIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRN
         IQLLSE KLRAIVENANQVLKIQQEFG+FSNYCW+FVNKKPI  RFR+ARQVPVKTPKAEFMSKDL+RRGFRCVGPTVVYSF+QV+GIVNDHLV+CFR 
Subjt:  AIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRN

Query:  QECDANVKEETKLRTEVRRSELLTGVLE
        QECD       KLR E +RSELLTG LE
Subjt:  QECDANVKEETKLRTEVRRSELLTGVLE

A0A6J1FIT9 uncharacterized protein LOC1114461251.5e-14983.13Show/hide
Query:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEISKKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKPKLKPVKPLSAG
        MSVATKLQSHA+PV+ESRAILGPGGNRDRAP+KPK KQEI K+T KQNKA+PVV+ES VRDNVSVGSSCSSDSLSSNYSAK LN + KP  KPVK ++AG
Subjt:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEISKKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKPKLKPVKPLSAG

Query:  GEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NA
        G+ANAT T P +S+ GKRCDWITPYSDPLYIAFHDEEWGVP +DDKKLFELLVLSQALAELTWPLIL+KR IFRKVFNDFDP SIA F+E EFTTL  NA
Subjt:  GEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NA

Query:  IQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRNQ
         QLLS+QKLRAIVENANQVLKIQQEFGSFSNYCW+FVNKKPIR R+R+ RQVPVKTPKAEFMSKDLM+RGFRCVGPTVVYSFLQVSGIVNDHLVDCFR Q
Subjt:  IQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRNQ

Query:  ECDANVKEETKLRTEVRRSELLTGVLEKSCLS
        ECDA+VK++ KLR E RRSELL   LEKS L+
Subjt:  ECDANVKEETKLRTEVRRSELLTGVLEKSCLS

A0A6J1IHE9 uncharacterized protein LOC1114769751.2e-13877.98Show/hide
Query:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEISKKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKPKLKPVKPLSAG
        MSVATKL SHAKPV+ESRAILGPGGNRDRAP+KPK KQE  K + KQNKA+P + ES +RDN+S+GSSCSSDSLSSN SAK LN    PK+KPVK ++AG
Subjt:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEISKKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKPKLKPVKPLSAG

Query:  GEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NA
        G+ N T T P +S+ GKRC WITPYSDPLYIAFHDEEWGVP +DD+KLFELLVLSQALAELTWPLIL KRDIFRKVFNDFDP +IA+F++NEFTTL  N 
Subjt:  GEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NA

Query:  IQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRNQ
        IQLLSE KLRAIVENANQVLKIQQEFG+FSNYCW+FVNKKPI  RFR+ARQ+PVKTPKAEFMSKDL+RRGFRCVGPTVVYSF+QV+GIVNDHLVDCFR Q
Subjt:  IQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRNQ

Query:  ECDANVKEETKLRTEVRRSELLTGVLE
        ECD       KLR E + SELLTG LE
Subjt:  ECDANVKEETKLRTEVRRSELLTGVLE

A0A6J1J188 uncharacterized protein LOC1114804122.2e-14882.23Show/hide
Query:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEISKKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKPKLKPVKPLSAG
        MSVATKLQSHA+PV+ESRAILGPGGNRDRAP+KPK KQEI K+T KQNKA+PVV+ES VRDN+SVGSSCSSDSLSSNYSAK LN + KP  KPVK ++AG
Subjt:  MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEISKKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKPKLKPVKPLSAG

Query:  GEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NA
        G+ANAT T P + + GKRCDWITPYSDPLYIAFHDEEWGVP +DDKKLFELLVLSQALAELTWPLIL+KR IFR VFNDFDP SIA+F+E EFTTL  NA
Subjt:  GEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NA

Query:  IQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRNQ
         QLLS+QKLRAIVENANQVLKIQQEFGSFSNYCW+FVNKKPIR R+R+ RQVPVKTPKAEFMSKDLM+RGFRCVGPTVVYSFLQVSGIVNDHLVDCFR Q
Subjt:  IQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRNQ

Query:  ECDANVKEETKLRTEVRRSELLTGVLEKSCLS
        ECDA+VK++ KLR E RRSELL   LEKS L+
Subjt:  ECDANVKEETKLRTEVRRSELLTGVLEKSCLS

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 12.2e-3641.11Show/hide
Query:  KRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NAIQLLSEQKLRAIVENA
        +RC W++   DPLYIA+HD EWGVP  D KKLFE++ L    A L+W  +L KR+ +R  F+ FDP+ +A   E +   L  +A  +    K++AI+ NA
Subjt:  KRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NAIQLLSEQKLRAIVENA

Query:  NQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDC
           L+++Q    F ++ W+FVN +P  T+     ++P  T  ++ +SK L +RGF+ VG T+ YSF+Q  G+VNDH+V C
Subjt:  NQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDC

P44321 DNA-3-methyladenine glycosylase1.1e-3240.78Show/hide
Query:  RCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NAIQLLSEQKLRAIVENAN
        RC W+   S  +YI +HD+EWG P +D +KLFE + L    A L+W  +L KR+ +R+ F+ FDP  IAK +  +      N+  +    KL AIV+NA 
Subjt:  RCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NAIQLLSEQKLRAIVENAN

Query:  QVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDC
          L +++   +FS++ W+FVN KPI       R VP KT  ++ +SK L +RGF  +G T  Y+F+Q  G+V+DHL DC
Subjt:  QVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]1.5e-3741.75Show/hide
Query:  RCDWITPYSD---PLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NAIQLLSEQKLRAIVE
        RC W T   +    LY  +HD EWG P ++DKKLFE LVL    A L+W  IL KR+ FR  F+DFDP  +A + E++   L  N   + +  K+ A + 
Subjt:  RCDWITPYSD---PLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTL--NAIQLLSEQKLRAIVE

Query:  NANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFR-----NQECD
        NA   + +Q+EFGSF  Y W FV  KPI   F     +P  TP ++ ++KDL +RGF+ VG T +Y+ +Q  G+VNDHL  CF+       +CD
Subjt:  NANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFR-----NQECD

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein1.4e-6744.57Show/hide
Query:  MSVATKLQSHAKPVMESRAILGPGGNR-DRAPKKPKSKQEISKKT---WKQNKA---------------VPVVAESFVRDNVS---------VGSSCSSD
        MSV  + +S      E R++LGP GN+  R P   K ++ + +KT    K  KA                  +  S +R N +           SSC S 
Subjt:  MSVATKLQSHAKPVMESRAILGPGGNR-DRAPKKPKSKQEISKKT---WKQNKA---------------VPVVAESFVRDNVS---------VGSSCSSD

Query:  SLS--SNYSAKSLNPRVKPKLKPVKPLSAGGEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKR
         LS  S+ S K +  R    +   + LS G E    ++  C +   KRC WITP +DP Y+AFHDEEWGVP +DDKKLFELL LS ALAEL+W  IL++R
Subjt:  SLS--SNYSAKSLNPRVKPKLKPVKPLSAGGEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKR

Query:  DIFRKVFNDFDPLSIAKFSENEFTT--LNAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRG
         I R+VF DFDP+++A+ ++ + T     AI LLSE K+R+I++N+  V KI  E GS   Y WNFVN KP +++FR+ RQVPVKT KAEF+SKDL+RRG
Subjt:  DIFRKVFNDFDPLSIAKFSENEFTT--LNAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRG

Query:  FRCVGPTVVYSFLQVSGIVNDHLVDCFRNQEC--DANVKEETKLRTEVRR
        FR V PTV+YSF+Q +G+ NDHL+ CFR Q+C  DA     TK + +  R
Subjt:  FRCVGPTVVYSFLQVSGIVNDHLVDCFRNQEC--DANVKEETKLRTEVRR

AT1G75090.1 DNA glycosylase superfamily protein3.7e-8453.14Show/hide
Query:  MSVATKLQSHAKPVMESRAILGPGGNRDRA-----PKKPKSKQEISKK--TWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKPKLKP
        MS+ +KL+S  KP+ ESRAIL   GNR +       KKP+    ++K   T K +    V  +    D+ S  SS    S+++  S K   P  +  ++ 
Subjt:  MSVATKLQSHAKPVMESRAILGPGGNRDRA-----PKKPKSKQEISKK--TWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKPKLKP

Query:  VKPLSAGGEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEF
        +  + A       ++P  I    KRC WITP SDP+Y+ FHDEEWGVP  DDKKLFELLV SQALAE +WP IL +RD FRK+F +FDP +IA+F+E   
Subjt:  VKPLSAGGEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEF

Query:  TTL--NAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHL
         +L  N   +LSEQKLRAIVENA  VLK++QEFGSFSNYCW FVN KP+R  +R+ RQVPVK+PKAE++SKD+M+RGFRCVGPTV+YSFLQ SGIVNDHL
Subjt:  TTL--NAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHL

Query:  VDCFRNQECDANVKEETK
          CFR QEC+   + ETK
Subjt:  VDCFRNQECDANVKEETK

AT1G80850.1 DNA glycosylase superfamily protein1.5e-6946.79Show/hide
Query:  MSVATKLQSHAKPVMESRAILGPGGNR-DRAPKKPKSKQEISKK------TWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSS------NYSAKSLNPRV
        MS   +++S      E R++LGP GN+  + P     K+ +++K      T K  +  P+      R+ +S+ +S SSD+ SS      + ++ S   RV
Subjt:  MSVATKLQSHAKPVMESRAILGPGGNR-DRAPKKPKSKQEISKK------TWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSS------NYSAKSLNPRV

Query:  KPKLKPVKPLSA----GGEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPL
          +   V   S+      E        C     KRC WITP SD  YIAFHDEEWGVP +DDK+LFELL LS ALAEL+W  IL+KR +FR+VF DFDP+
Subjt:  KPKLKPVKPLSA----GGEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPL

Query:  SIAKFSENEFTT--LNAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFL
        +I++ +  + T+  + A  LLSEQKLR+I+ENANQV KI   FGSF  Y WNFVN+KP +++FR+ RQVPVKT KAE +SKDL+RRGFR V PTV+YSF+
Subjt:  SIAKFSENEFTT--LNAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFL

Query:  QVSGIVNDHLVDCFRNQECDANVKEET
        Q +G+ NDHL  CFR+ +C    K+ET
Subjt:  QVSGIVNDHLVDCFRNQECDANVKEET

AT5G57970.1 DNA glycosylase superfamily protein6.9e-7053.73Show/hide
Query:  ESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRV--------KPKLKPVKPLSAGGEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKK
        E  +  N+S+ +S SSD+   ++ +++   R+        + K  P KP S   E  A  +PP  S T KRC W+TP SDP YI FHDEEWGVP +DDK+
Subjt:  ESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRV--------KPKLKPVKPLSAGGEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKK

Query:  LFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTLN--AIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFR
        LFELLVLS ALAE TWP IL+KR  FR+VF DFDP +I K +E +       A  LLS+ KLRA++ENA Q+LK+ +E+GSF  Y W+FV  K I ++FR
Subjt:  LFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTLN--AIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFR

Query:  HARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRNQEC
        + RQVP KTPKAE +SKDL+RRGFR VGPTVVYSF+Q +GI NDHL  CFR   C
Subjt:  HARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRNQEC

AT5G57970.2 DNA glycosylase superfamily protein6.9e-7053.73Show/hide
Query:  ESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRV--------KPKLKPVKPLSAGGEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKK
        E  +  N+S+ +S SSD+   ++ +++   R+        + K  P KP S   E  A  +PP  S T KRC W+TP SDP YI FHDEEWGVP +DDK+
Subjt:  ESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRV--------KPKLKPVKPLSAGGEANATMTPPCISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKK

Query:  LFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTLN--AIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFR
        LFELLVLS ALAE TWP IL+KR  FR+VF DFDP +I K +E +       A  LLS+ KLRA++ENA Q+LK+ +E+GSF  Y W+FV  K I ++FR
Subjt:  LFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTLN--AIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWNFVNKKPIRTRFR

Query:  HARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRNQEC
        + RQVP KTPKAE +SKDL+RRGFR VGPTVVYSF+Q +GI NDHL  CFR   C
Subjt:  HARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRNQEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTGGCTACGAAGCTTCAATCGCACGCTAAACCGGTTATGGAGTCCCGAGCGATTCTTGGACCCGGCGGCAACAGAGATAGGGCTCCTAAGAAACCGAAATCCAA
ACAGGAGATCTCGAAGAAGACATGGAAGCAGAATAAGGCCGTTCCGGTGGTTGCCGAGTCGTTTGTTCGCGACAATGTCTCCGTTGGGAGTTCCTGCTCTTCCGATTCTT
TATCGAGCAATTATTCGGCCAAATCGTTGAATCCGAGAGTGAAGCCTAAGCTTAAGCCTGTGAAGCCTCTTTCTGCGGGCGGTGAAGCTAACGCTACCATGACGCCGCCT
TGCATTTCGATTACGGGGAAACGCTGTGATTGGATTACGCCTTATTCCGATCCACTCTACATCGCTTTTCATGATGAAGAATGGGGAGTCCCAGCTTATGACGACAAGAA
GCTGTTTGAATTACTTGTATTATCACAAGCCTTGGCAGAACTTACTTGGCCCTTGATTCTTACCAAGAGAGACATATTTAGGAAAGTCTTTAATGATTTTGACCCCTTAT
CCATTGCAAAGTTCTCAGAGAATGAATTTACAACACTGAATGCCATCCAGCTCCTGTCTGAACAAAAGCTTCGTGCAATTGTGGAGAATGCTAATCAAGTACTCAAGATT
CAACAGGAGTTTGGTTCCTTCAGCAACTATTGTTGGAACTTTGTTAACAAGAAGCCAATAAGAACCCGATTTCGACATGCTCGCCAAGTACCGGTGAAGACTCCAAAAGC
GGAGTTCATGAGCAAGGATTTGATGAGGAGAGGATTCCGTTGCGTTGGACCAACTGTGGTTTATTCCTTCTTGCAAGTTAGCGGAATTGTTAACGATCACTTGGTCGATT
GCTTCAGAAATCAAGAGTGCGATGCTAACGTAAAGGAGGAGACAAAATTAAGAACAGAAGTTCGGAGATCGGAGTTACTTACCGGAGTTTTGGAGAAGTCATGCTTGTCT
AGATCTTGA
mRNA sequenceShow/hide mRNA sequence
CTCCCACGGAAGCTCAGCTCCTTCTCTCGCCACGTTCTTCTGTTCATCACTGCCAAACCCTAGGGTTTTCTCTGCAACTCTTTTGCTCTTTCACTATCAGAAATCACTTC
TTTTTTCTACATTTCGCCTTCAAATCGTACAATCATTTTGTAACTCTGGAGAATTTGACGGTTCAAGAGCTTGCTCCGTTTTGATCCTGGTCGAATTCGATTCTGGAAGG
AATCCTTAGCGATGTCTGTGGCTACGAAGCTTCAATCGCACGCTAAACCGGTTATGGAGTCCCGAGCGATTCTTGGACCCGGCGGCAACAGAGATAGGGCTCCTAAGAAA
CCGAAATCCAAACAGGAGATCTCGAAGAAGACATGGAAGCAGAATAAGGCCGTTCCGGTGGTTGCCGAGTCGTTTGTTCGCGACAATGTCTCCGTTGGGAGTTCCTGCTC
TTCCGATTCTTTATCGAGCAATTATTCGGCCAAATCGTTGAATCCGAGAGTGAAGCCTAAGCTTAAGCCTGTGAAGCCTCTTTCTGCGGGCGGTGAAGCTAACGCTACCA
TGACGCCGCCTTGCATTTCGATTACGGGGAAACGCTGTGATTGGATTACGCCTTATTCCGATCCACTCTACATCGCTTTTCATGATGAAGAATGGGGAGTCCCAGCTTAT
GACGACAAGAAGCTGTTTGAATTACTTGTATTATCACAAGCCTTGGCAGAACTTACTTGGCCCTTGATTCTTACCAAGAGAGACATATTTAGGAAAGTCTTTAATGATTT
TGACCCCTTATCCATTGCAAAGTTCTCAGAGAATGAATTTACAACACTGAATGCCATCCAGCTCCTGTCTGAACAAAAGCTTCGTGCAATTGTGGAGAATGCTAATCAAG
TACTCAAGATTCAACAGGAGTTTGGTTCCTTCAGCAACTATTGTTGGAACTTTGTTAACAAGAAGCCAATAAGAACCCGATTTCGACATGCTCGCCAAGTACCGGTGAAG
ACTCCAAAAGCGGAGTTCATGAGCAAGGATTTGATGAGGAGAGGATTCCGTTGCGTTGGACCAACTGTGGTTTATTCCTTCTTGCAAGTTAGCGGAATTGTTAACGATCA
CTTGGTCGATTGCTTCAGAAATCAAGAGTGCGATGCTAACGTAAAGGAGGAGACAAAATTAAGAACAGAAGTTCGGAGATCGGAGTTACTTACCGGAGTTTTGGAGAAGT
CATGCTTGTCTAGATCTTGACATGTTTAAAAGGTTCTTCTTCTCTCTAACTAATCTTACAATTACTGCAATAAAATGATTCTGATGATCTGATGCTGGGTAGTTTGCTTT
TGTTTGTTTGTTCAGATTTTTTATAATTGCTTCTTTGTAGATTTAAGGTGGTGATGTTAACATTTATGTTATTGTAATTTGTATTAACAGAAGACAGGCATGTTGGTTTC
TTGTAAGTATTTCTGGAACTGCAGAACTTGCTTCCATATTAGAGCAGAAGAATAGGGTATGCAAATATGGG
Protein sequenceShow/hide protein sequence
MSVATKLQSHAKPVMESRAILGPGGNRDRAPKKPKSKQEISKKTWKQNKAVPVVAESFVRDNVSVGSSCSSDSLSSNYSAKSLNPRVKPKLKPVKPLSAGGEANATMTPP
CISITGKRCDWITPYSDPLYIAFHDEEWGVPAYDDKKLFELLVLSQALAELTWPLILTKRDIFRKVFNDFDPLSIAKFSENEFTTLNAIQLLSEQKLRAIVENANQVLKI
QQEFGSFSNYCWNFVNKKPIRTRFRHARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRNQECDANVKEETKLRTEVRRSELLTGVLEKSCLS
RS