; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh02G006770 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh02G006770
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationCmo_Chr02:4299163..4300620
RNA-Seq ExpressionCmoCh02G006770
SyntenyCmoCh02G006770
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605362.1 hypothetical protein SDJN03_02679, partial [Cucurbita argyrosperma subsp. sororia]4.3e-18696.05Show/hide
Query:  MCRSDQALE---------STSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPV-AANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAV
        MCRSDQALE         STS+RLLHRRNSLNKHPSP+PNLTSTSD+ILLPV AANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAV
Subjt:  MCRSDQALE---------STSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPV-AANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAV

Query:  RREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDF
        RREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHD+EWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDF
Subjt:  RREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDF

Query:  RNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVG
        RNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVG
Subjt:  RNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVG

Query:  PTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
        PTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
Subjt:  PTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV

XP_022947742.1 uncharacterized protein LOC111451515 isoform X1 [Cucurbita moschata]1.1e-18998.57Show/hide
Query:  MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA
        MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA
Subjt:  MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA

Query:  QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPN-----SDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFS
        QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPN     SDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFS
Subjt:  QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPN-----SDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFS

Query:  DFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVH
        DFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVH
Subjt:  DFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVH

Query:  SFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
        SFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
Subjt:  SFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV

XP_022947743.1 uncharacterized protein LOC111451515 isoform X2 [Cucurbita moschata]1.5e-191100Show/hide
Query:  MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA
        MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA
Subjt:  MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA

Query:  QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAE
        QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAE
Subjt:  QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAE

Query:  VVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQA
        VVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQA
Subjt:  VVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQA

Query:  AGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
        AGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
Subjt:  AGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV

XP_023007196.1 uncharacterized protein LOC111499758 isoform X3 [Cucurbita maxima]2.1e-18093.41Show/hide
Query:  MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA
        MCRSDQALESTSNRLLHRRNSLN+HPSP+PN+TSTSD ILLP+AANGGSLSRPRPALD KKSKSFK GGNGNV SDN AEVASPGSIAAVRREQVALQQA
Subjt:  MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA

Query:  QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPN-----SDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFS
        QRKMRIAHYGRSKSARFEKIVPFDSKIK VVE+RRCSFITPN     SDPIYVAYHD+EWGVPVHDDQ LFELLVLSVAQVGSDWTSILKKRQDFRNAFS
Subjt:  QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPN-----SDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFS

Query:  DFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVH
        DFDA+VVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKS HKIPVKTSKSDTISKDMIRRGFRSVGPTVVH
Subjt:  DFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVH

Query:  SFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
        SFMQAAGLTNDHLTTCHRHLHCTLIAAGR APP EVEET TGAAGSEAV
Subjt:  SFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV

XP_023007199.1 uncharacterized protein LOC111499759 isoform X3 [Cucurbita maxima]1.0e-17993.41Show/hide
Query:  MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA
        MCRSDQALESTSNRLLHRRNSLNKHPSP+PN+TSTSD ILLP+AANG SLSRPRPALD KKSKSFK GGNGNV  DN AEVASPGSIAAVRREQVALQQA
Subjt:  MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA

Query:  QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPN-----SDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFS
        QRKMRIAHYGRSKSARFEKIVPFDSKIK VVE+RRCSFITPN     SDPIYVAYHD+EWGVPVHDDQ LFELLVLSVAQVGSDWTSILKKRQDFRNAFS
Subjt:  QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPN-----SDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFS

Query:  DFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVH
        DFDA+VVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKS HKIPVKTSKSDTISKDMIRRGFRSVGPTVVH
Subjt:  DFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVH

Query:  SFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
        SFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPP EVEET TGAAGSEAV
Subjt:  SFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV

TrEMBL top hitse value%identityAlignment
A0A6J1G7A4 uncharacterized protein LOC111451515 isoform X15.3e-19098.57Show/hide
Query:  MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA
        MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA
Subjt:  MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA

Query:  QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPN-----SDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFS
        QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPN     SDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFS
Subjt:  QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPN-----SDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFS

Query:  DFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVH
        DFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVH
Subjt:  DFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVH

Query:  SFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
        SFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
Subjt:  SFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV

A0A6J1G8B3 uncharacterized protein LOC111451515 isoform X27.4e-192100Show/hide
Query:  MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA
        MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA
Subjt:  MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA

Query:  QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAE
        QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAE
Subjt:  QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAE

Query:  VVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQA
        VVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQA
Subjt:  VVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQA

Query:  AGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
        AGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
Subjt:  AGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV

A0A6J1KZV5 uncharacterized protein LOC111499758 isoform X31.0e-18093.41Show/hide
Query:  MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA
        MCRSDQALESTSNRLLHRRNSLN+HPSP+PN+TSTSD ILLP+AANGGSLSRPRPALD KKSKSFK GGNGNV SDN AEVASPGSIAAVRREQVALQQA
Subjt:  MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA

Query:  QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPN-----SDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFS
        QRKMRIAHYGRSKSARFEKIVPFDSKIK VVE+RRCSFITPN     SDPIYVAYHD+EWGVPVHDDQ LFELLVLSVAQVGSDWTSILKKRQDFRNAFS
Subjt:  QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPN-----SDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFS

Query:  DFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVH
        DFDA+VVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKS HKIPVKTSKSDTISKDMIRRGFRSVGPTVVH
Subjt:  DFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVH

Query:  SFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
        SFMQAAGLTNDHLTTCHRHLHCTLIAAGR APP EVEET TGAAGSEAV
Subjt:  SFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV

A0A6J1L2B3 uncharacterized protein LOC111499758 isoform X22.5e-17990.3Show/hide
Query:  MCRSDQALES-----------------TSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVAS
        MCRSDQALES                 TSNRLLHRRNSLN+HPSP+PN+TSTSD ILLP+AANGGSLSRPRPALD KKSKSFK GGNGNV SDN AEVAS
Subjt:  MCRSDQALES-----------------TSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVAS

Query:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSI
        PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIK VVE+RRCSFITPNSDPIYVAYHD+EWGVPVHDDQ LFELLVLSVAQVGSDWTSI
Subjt:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSI

Query:  LKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIR
        LKKRQDFRNAFSDFDA+VVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKS HKIPVKTSKSDTISKDMIR
Subjt:  LKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIR

Query:  RGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
        RGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGR APP EVEET TGAAGSEAV
Subjt:  RGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV

A0A6J1L4A7 uncharacterized protein LOC111499759 isoform X35.0e-18093.41Show/hide
Query:  MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA
        MCRSDQALESTSNRLLHRRNSLNKHPSP+PN+TSTSD ILLP+AANG SLSRPRPALD KKSKSFK GGNGNV  DN AEVASPGSIAAVRREQVALQQA
Subjt:  MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQA

Query:  QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPN-----SDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFS
        QRKMRIAHYGRSKSARFEKIVPFDSKIK VVE+RRCSFITPN     SDPIYVAYHD+EWGVPVHDDQ LFELLVLSVAQVGSDWTSILKKRQDFRNAFS
Subjt:  QRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPN-----SDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFS

Query:  DFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVH
        DFDA+VVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKS HKIPVKTSKSDTISKDMIRRGFRSVGPTVVH
Subjt:  DFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVH

Query:  SFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
        SFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPP EVEET TGAAGSEAV
Subjt:  SFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 12.6e-3236.87Show/hide
Query:  RCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINR--VRGVVDNAI
        RC ++  + DP+Y+AYHD EWGVP  D + LFE++ L   Q G  W ++LKKR+++R  F  FD   VA   +  +  +  + G+  +R  ++ ++ NA 
Subjt:  RCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINR--VRGVVDNAI

Query:  RILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTC
          L++++       ++W F+N+ P      +  +IP  TS SD +SK + +RGF+ VG T+ +SFMQA GL NDH+  C
Subjt:  RILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTC

P44321 DNA-3-methyladenine glycosylase1.9e-2735.2Show/hide
Query:  RCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVR--GVVDNAI
        RC ++   S  IY+ YHD+EWG P  D Q LFE + L   Q G  W ++LKKR+ +R AF  FD + +A  +   + +     G+  +R +   +V NA 
Subjt:  RCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVR--GVVDNAI

Query:  RILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTC
          L ++K   +   +IW F+N+ P          +P KT  S  +SK + +RGF  +G T  ++FMQ+ GL +DHL  C
Subjt:  RILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]1.7e-3941.36Show/hide
Query:  KGVVEDRRCSFITPNSD---PIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINR-
        +GV E  RC++ T   +    +Y  YHD EWG P+H+D+ LFE LVL   Q G  W +ILKKR+ FR AF DFD  +VAN+ + ++  +    G+  NR 
Subjt:  KGVVEDRRCSFITPNSD---PIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINR-

Query:  -VRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHR
         +   + NA   + +++EFGS +KYIWGF+   P    ++S   +P  T  SD I+KD+ +RGF+ VG T +++ MQ+ G+ NDHLT+C +
Subjt:  -VRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHR

Arabidopsis top hitse value%identityAlignment
AT1G75090.1 DNA glycosylase superfamily protein4.2e-5440.3Show/hide
Query:  TKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVP
        TK   + K   N +V +D+++  +S    ++V            K        +  A    +     KI G V  +RC +ITPNSDPIYV +HD+EWGVP
Subjt:  TKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVP

Query:  VHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDIN--RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNP
        V DD+ LFELLV S A     W SIL++R DFR  F +FD   +A F++++++S+     + ++  ++R +V+NA  +L++K+EFGS   Y W F+N+ P
Subjt:  VHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDIN--RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNP

Query:  FSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTL
            Y+ G ++PVK+ K++ ISKDM++RGFR VGPTV++SF+QA+G+ NDHLT C R+  C +
Subjt:  FSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTL

AT3G12710.1 DNA glycosylase superfamily protein2.1e-9360.54Show/hide
Query:  PSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA---RFEKIVP
        PS   +L   S+S L   +  G   ++ R +L+ KKSKSFK G +      +     +PGSIAAVRREQVA QQA RK++IAHYGRSKS       K+VP
Subjt:  PSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA---RFEKIVP

Query:  FDSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDIN
          +        +RCSF+TP SDPIYVAYHD+EWGVPVHDD+ LFELL LS AQVGSDWTS L+KR D+R AF +F+AEVVA  ++++M +IS EY ++++
Subjt:  FDSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDIN

Query:  RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIA
        +VRGVV+NA +I+EIKK F SLEKY+WGF+N+ P S +YK GHKIPVKTSKS++ISKDM+RRGFR VGPTVVHSFMQAAGLTNDHL TC RH  CTL+A
Subjt:  RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIA

AT5G44680.1 DNA glycosylase superfamily protein2.3e-8452.54Show/hide
Query:  LESTSNRL--LHRRNSLNKH-PSPTPNLTSTSDS------ILLPVAANGGSLSRP----RPALDTKKSKSFKL------GGNGNVVSDNAAEVASPGSIA
        L+  SN++  L RRNSL K  P P   + S   S      I  P++ N  SL +P    +  L +  +KS  +       G    V         PGSIA
Subjt:  LESTSNRL--LHRRNSLNKH-PSPTPNLTSTSDS------ILLPVAANGGSLSRP----RPALDTKKSKSFKL------GGNGNVVSDNAAEVASPGSIA

Query:  AVRREQVALQQAQRKMRIAHYGRSKSARF-EKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKR
        A RRE+VA++Q +RK +I+HYGR KS +  EK +  + + K     +RCSFIT +SDPIYVAYHD+EWGVPVHDD  LFELLVL+ AQVGSDWTS+LK+R
Subjt:  AVRREQVALQQAQRKMRIAHYGRSKSARF-EKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKR

Query:  QDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFR
          FR AFS F+AE+VA+F+++++ SI ++YG+++++V  VVDNA +IL++K++ GS  KYIWGFM + P +  Y S  KIPVKTSKS+TISKDM+RRGFR
Subjt:  QDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFR

Query:  SVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAA
         VGPTV+HS MQAAGLTNDHL TC RHL CT +AA
Subjt:  SVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAA

AT5G57970.1 DNA glycosylase superfamily protein4.5e-5649.49Show/hide
Query:  DSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDIN-
        DS   G    +RC+++TPNSDP Y+ +HD+EWGVPVHDD+ LFELLVLS A     W +IL KRQ FR  F+DFD   +   ++++++   S     ++ 
Subjt:  DSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDIN-

Query:  -RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHC
         ++R V++NA +IL++ +E+GS +KYIW F+ N      ++   ++P KT K++ ISKD++RRGFRSVGPTVV+SFMQAAG+TNDHLT+C R  HC
Subjt:  -RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHC

AT5G57970.2 DNA glycosylase superfamily protein4.5e-5649.49Show/hide
Query:  DSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDIN-
        DS   G    +RC+++TPNSDP Y+ +HD+EWGVPVHDD+ LFELLVLS A     W +IL KRQ FR  F+DFD   +   ++++++   S     ++ 
Subjt:  DSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDIN-

Query:  -RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHC
         ++R V++NA +IL++ +E+GS +KYIW F+ N      ++   ++P KT K++ ISKD++RRGFRSVGPTVV+SFMQAAG+TNDHLT+C R  HC
Subjt:  -RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCGTTCAGACCAAGCCTTGGAATCCACTTCCAATCGCCTCCTCCACCGCCGTAATTCCCTTAACAAACACCCTTCCCCCACCCCCAACCTCACCTCCACCTCTGA
CAGCATTCTCCTTCCGGTCGCCGCTAACGGCGGCTCTCTGTCTCGCCCCCGCCCTGCCTTGGATACGAAGAAATCCAAAAGCTTCAAGCTTGGGGGAAATGGGAATGTGG
TTTCTGATAATGCTGCTGAAGTCGCGTCGCCGGGGAGCATCGCCGCCGTGAGAAGAGAACAGGTGGCGCTGCAGCAGGCGCAGAGGAAGATGAGAATTGCCCATTATGGA
CGGTCTAAATCCGCCCGGTTTGAGAAAATTGTTCCCTTTGATTCTAAAATTAAAGGCGTTGTTGAAGATAGAAGATGCAGCTTCATCACTCCCAACTCAGATCCCATTTA
TGTGGCCTATCATGACCAAGAATGGGGCGTCCCTGTTCATGATGACCAAGCACTGTTTGAACTGCTGGTTCTGAGTGTGGCCCAAGTGGGTTCGGATTGGACTTCAATTT
TGAAGAAACGACAAGATTTCAGAAATGCATTTTCAGATTTCGATGCAGAAGTGGTGGCGAATTTTTCCGACAGACAGATGGTTTCAATCAGCTCAGAGTATGGAATGGAC
ATAAACAGAGTCCGAGGAGTGGTCGACAACGCAATCCGGATCCTGGAGATTAAGAAGGAATTTGGGTCACTGGAGAAATACATTTGGGGGTTTATGAACAACAACCCATT
CTCACCGCACTACAAATCCGGCCACAAAATCCCGGTCAAGACATCAAAATCAGACACCATAAGCAAAGACATGATCCGGCGAGGATTCCGGTCTGTCGGTCCGACCGTGG
TCCATTCCTTCATGCAAGCCGCCGGTCTGACCAACGACCATCTCACCACCTGCCACAGGCACCTGCACTGCACATTAATCGCCGCCGGCCGCCGCGCTCCACCGGCGGAA
GTGGAGGAGACGGCGACAGGTGCGGCAGGCTCGGAAGCTGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGTCGTTCAGACCAAGCCTTGGAATCCACTTCCAATCGCCTCCTCCACCGCCGTAATTCCCTTAACAAACACCCTTCCCCCACCCCCAACCTCACCTCCACCTCTGA
CAGCATTCTCCTTCCGGTCGCCGCTAACGGCGGCTCTCTGTCTCGCCCCCGCCCTGCCTTGGATACGAAGAAATCCAAAAGCTTCAAGCTTGGGGGAAATGGGAATGTGG
TTTCTGATAATGCTGCTGAAGTCGCGTCGCCGGGGAGCATCGCCGCCGTGAGAAGAGAACAGGTGGCGCTGCAGCAGGCGCAGAGGAAGATGAGAATTGCCCATTATGGA
CGGTCTAAATCCGCCCGGTTTGAGAAAATTGTTCCCTTTGATTCTAAAATTAAAGGCGTTGTTGAAGATAGAAGATGCAGCTTCATCACTCCCAACTCAGATCCCATTTA
TGTGGCCTATCATGACCAAGAATGGGGCGTCCCTGTTCATGATGACCAAGCACTGTTTGAACTGCTGGTTCTGAGTGTGGCCCAAGTGGGTTCGGATTGGACTTCAATTT
TGAAGAAACGACAAGATTTCAGAAATGCATTTTCAGATTTCGATGCAGAAGTGGTGGCGAATTTTTCCGACAGACAGATGGTTTCAATCAGCTCAGAGTATGGAATGGAC
ATAAACAGAGTCCGAGGAGTGGTCGACAACGCAATCCGGATCCTGGAGATTAAGAAGGAATTTGGGTCACTGGAGAAATACATTTGGGGGTTTATGAACAACAACCCATT
CTCACCGCACTACAAATCCGGCCACAAAATCCCGGTCAAGACATCAAAATCAGACACCATAAGCAAAGACATGATCCGGCGAGGATTCCGGTCTGTCGGTCCGACCGTGG
TCCATTCCTTCATGCAAGCCGCCGGTCTGACCAACGACCATCTCACCACCTGCCACAGGCACCTGCACTGCACATTAATCGCCGCCGGCCGCCGCGCTCCACCGGCGGAA
GTGGAGGAGACGGCGACAGGTGCGGCAGGCTCGGAAGCTGTGTAG
Protein sequenceShow/hide protein sequence
MCRSDQALESTSNRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYG
RSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMD
INRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAE
VEETATGAAGSEAV