; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007925 (gene) of Snake gourd v1 genome

Gene IDTan0007925
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA glycosylase superfamily protein
Genome locationLG01:114835885..114842141
RNA-Seq ExpressionTan0007925
SyntenyTan0007925
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608520.1 hypothetical protein SDJN03_01862, partial [Cucurbita argyrosperma subsp. sororia]2.1e-16188.79Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKPKPVKAVAADGD
        MSVATKLQSHA+PVLESRAILGPGGNRDRAPEKPKCKQE LK+  KQ KALPVV ESVVRDNVSVGSSCSSDSLSSNYSAK  N K KPKPVK VAA GD
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKPKPVKAVAADGD

Query:  ANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQ
        ANATTTSP LSV GKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILS+R IFRKVFNDFDPSSIAQF E EFTTLKVNA Q
Subjt:  ANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQ

Query:  LLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        LLS+QKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNR+RY RQVPVKTPK+EFMSKDLM+RGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
Subjt:  LLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

Query:  GTSVKDETKLRVEDRRSALLTGALEKPCLS
          SVK++ KLRVE+RRS LL  ALEK  L+
Subjt:  GTSVKDETKLRVEDRRSALLTGALEKPCLS

XP_022940560.1 uncharacterized protein LOC111446125 [Cucurbita moschata]2.1e-16188.79Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKPKPVKAVAADGD
        MSVATKLQSHA+PVLESRAILGPGGNRDRAPEKPKCKQE LK+  KQ KALPVV ESVVRDNVSVGSSCSSDSLSSNYSAK  N K KPKPVK VAA GD
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKPKPVKAVAADGD

Query:  ANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQ
        ANATTTSP LSV GKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILS+R IFRKVFNDFDPSSIA F E EFTTLKVNA Q
Subjt:  ANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQ

Query:  LLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        LLS+QKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNR+RY RQVPVKTPK+EFMSKDLM+RGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
Subjt:  LLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

Query:  GTSVKDETKLRVEDRRSALLTGALEKPCLS
          SVKD+ KLRVE+RRS LL  ALEK  L+
Subjt:  GTSVKDETKLRVEDRRSALLTGALEKPCLS

XP_022981194.1 uncharacterized protein LOC111480412 [Cucurbita maxima]1.3e-16088.18Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKPKPVKAVAADGD
        MSVATKLQSHA+PVLESRAILGPGGNRDRAPEKPKCKQE LK+  KQ KALPVV ESVVRDN+SVGSSCSSDSLSSNYSAK  N K KPKPVK VAA GD
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKPKPVKAVAADGD

Query:  ANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQ
        ANATTTSP L V GKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILS+R IFR VFNDFDPSSIAQF E EFTTLKVNA Q
Subjt:  ANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQ

Query:  LLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        LLS+QKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNR+RY RQVPVKTPK+EFMSKDLM+RGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
Subjt:  LLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

Query:  GTSVKDETKLRVEDRRSALLTGALEKPCLS
          SVKD+ KLRVE+RRS LL  ALEK  L+
Subjt:  GTSVKDETKLRVEDRRSALLTGALEKPCLS

XP_023523621.1 uncharacterized protein LOC111787800 [Cucurbita pepo subsp. pepo]5.5e-16289.09Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKPKPVKAVAADGD
        MSVATKLQSHA+PVLESRAILGPGGNRDRAPEKPKCKQE LK+  KQ KALPVV ESVVRDNVSVGSSCSSDSLSSNYSAK  N K KPKPVK VAA GD
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKPKPVKAVAADGD

Query:  ANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQ
        ANATTTSP LSV GKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILS+R IFRKVFNDFDPSSIAQF E EFTTLKVNA Q
Subjt:  ANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQ

Query:  LLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        LLS+QKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNR+RY RQVPVKTPK+EFMSKDLM+RGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
Subjt:  LLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

Query:  GTSVKDETKLRVEDRRSALLTGALEKPCLS
          SVKD+ KLRVE+RRS LL  ALEK  L+
Subjt:  GTSVKDETKLRVEDRRSALLTGALEKPCLS

XP_038905518.1 DNA-3-methyladenine glycosylase 1 [Benincasa hispida]3.3e-15985.33Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKP---KPVKAVAA
        MSVATKLQSHAKPVLESR ILGPGGNRDRAPEKPKCKQ+TLKK  KQ +ALP++ ESV+RDNVSVGSSCSSDS+SSNYSAK   PK KP   KPVKAVAA
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKP---KPVKAVAA

Query:  DGDANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVN
         GD NAT  SP LS+PGKRCDWIT +SDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILS+RDIFRKV NDFDPS+IAQF ENEFTTLKVN
Subjt:  DGDANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVN

Query:  AIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
        AIQLLSE KLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRN FRY RQVPVKTPK+EFMSKDL+RRGFRCVGPTVVYSF+QV+GIVNDHLV+CFRY
Subjt:  AIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY

Query:  QECGTSVKDETKLRVEDRRSALLTGALEKPCLSK
        QEC   +KD+ KLRVED+RS  LTGALEKPCL++
Subjt:  QECGTSVKDETKLRVEDRRSALLTGALEKPCLSK

TrEMBL top hitse value%identityAlignment
A0A6J1CHU1 uncharacterized protein LOC1110116237.7e-15483.04Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQE-TLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKP---KPVKAVA
        M VA K +SHAKPVLESRAILGPGGNRDR PEKP+CK E TL K  KQ KALP VP+SV+RDNVSVGSSCSSDSLSSNYSAK  NPK KP   KPVKAVA
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQE-TLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKP---KPVKAVA

Query:  ADGDANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKV
        A  +A+ATTTSPR SVP KRCDWITPYSDPLYIAFHDEEWGVP+HDDKKLFELLVLSQALAELTWP ILS+RD+FRKVFNDFDPSSIA+F ENEF TLKV
Subjt:  ADGDANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKV

Query:  NAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFR
        N IQ+L+E KLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPK+E MSKDL+RRGFRCVGPTVVYSF+QV+GIVNDHLV+CFR
Subjt:  NAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFR

Query:  YQECGTSVKDETKLRVEDRRSALLTGALEKPCLSKS
        YQEC  +VKD+ K RVED R  L  GA EKPCLS+S
Subjt:  YQECGTSVKDETKLRVEDRRSALLTGALEKPCLSKS

A0A6J1F6S0 uncharacterized protein LOC1114426744.8e-15685.98Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKP---KPVKAVAA
        MSVATKL SHAKPVLESRAILGPGGNRDRAPEKPKCKQETLK   KQ KALP +PESVVRDN+S+GSSCSSDSLSSNYS K  NPK KP   KPVKAVAA
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKP---KPVKAVAA

Query:  DGDANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVN
         GD N TTT+PRLSVPGKRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWPLIL +RDIFRKVFNDFDPS+IAQF +NEFTTLK N
Subjt:  DGDANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVN

Query:  AIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
         IQLLSE KLRAIVENANQVLKIQQEFG+FSNYCWSFVNKKPI NRFRYARQVPVKTPK+EFMSKDL+RRGFRCVGPTVVYSF+QV+GIVNDHLV+CFRY
Subjt:  AIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY

Query:  QECGTSVKDETKLRVEDRRSALLTGALE
        QEC     D  KLRVED+RS LLTGALE
Subjt:  QECGTSVKDETKLRVEDRRSALLTGALE

A0A6J1FIT9 uncharacterized protein LOC1114461251.0e-16188.79Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKPKPVKAVAADGD
        MSVATKLQSHA+PVLESRAILGPGGNRDRAPEKPKCKQE LK+  KQ KALPVV ESVVRDNVSVGSSCSSDSLSSNYSAK  N K KPKPVK VAA GD
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKPKPVKAVAADGD

Query:  ANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQ
        ANATTTSP LSV GKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILS+R IFRKVFNDFDPSSIA F E EFTTLKVNA Q
Subjt:  ANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQ

Query:  LLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        LLS+QKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNR+RY RQVPVKTPK+EFMSKDLM+RGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
Subjt:  LLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

Query:  GTSVKDETKLRVEDRRSALLTGALEKPCLS
          SVKD+ KLRVE+RRS LL  ALEK  L+
Subjt:  GTSVKDETKLRVEDRRSALLTGALEKPCLS

A0A6J1IHE9 uncharacterized protein LOC1114769755.9e-15485.54Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKPKPVKAVAADGD
        MSVATKL SHAKPVLESRAILGPGGNRDRAPEKPKCKQETLK   KQ KALP +PESV+RDN+S+GSSCSSDSLSSN SAK  NP  K KPVKAVAA GD
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKPKPVKAVAADGD

Query:  ANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQ
         N TTT+PRLSVPGKRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWPLIL +RDIFRKVFNDFDPS+IAQF +NEFTTLK N IQ
Subjt:  ANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQ

Query:  LLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        LLSE KLRAIVENANQVLKIQQEFG+FSNYCWSFVNKKPI NRFRYARQ+PVKTPK+EFMSKDL+RRGFRCVGPTVVYSF+QV+GIVNDHLVDCFRYQEC
Subjt:  LLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

Query:  GTSVKDETKLRVEDRRSALLTGALE
             D  KLRVED+ S LLTGALE
Subjt:  GTSVKDETKLRVEDRRSALLTGALE

A0A6J1J188 uncharacterized protein LOC1114804126.5e-16188.18Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKPKPVKAVAADGD
        MSVATKLQSHA+PVLESRAILGPGGNRDRAPEKPKCKQE LK+  KQ KALPVV ESVVRDN+SVGSSCSSDSLSSNYSAK  N K KPKPVK VAA GD
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKPKPVKAVAADGD

Query:  ANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQ
        ANATTTSP L V GKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILS+R IFR VFNDFDPSSIAQF E EFTTLKVNA Q
Subjt:  ANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQ

Query:  LLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        LLS+QKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNR+RY RQVPVKTPK+EFMSKDLM+RGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
Subjt:  LLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

Query:  GTSVKDETKLRVEDRRSALLTGALEKPCLS
          SVKD+ KLRVE+RRS LL  ALEK  L+
Subjt:  GTSVKDETKLRVEDRRSALLTGALEKPCLS

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 14.4e-3740.98Show/hide
Query:  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQLLSEQKLRAIVENA
        +RC W++   DPLYIA+HD EWGVP  D KKLFE++ L    A L+W  +L +R+ +R  F+ FDP  +A  +E +   L  +A  +    K++AI+ NA
Subjt:  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQLLSEQKLRAIVENA

Query:  NQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
           L+++Q    F ++ WSFVN +P   +     ++P  T  S+ +SK L +RGF+ VG T+ YSF+Q  G+VNDH+V C  Y
Subjt:  NQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY

P44321 DNA-3-methyladenine glycosylase1.0e-3341.34Show/hide
Query:  RCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQLLSEQKLRAIVENAN
        RC W+   S  +YI +HD+EWG P  D +KLFE + L    A L+W  +L +R+ +R+ F+ FDP  IA+    +      N+  +    KL AIV+NA 
Subjt:  RCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQLLSEQKLRAIVENAN

Query:  QVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDC
          L +++   +FS++ WSFVN KPI N     R VP KT  S+ +SK L +RGF  +G T  Y+F+Q  G+V+DHL DC
Subjt:  QVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]7.2e-4039.13Show/hide
Query:  SAKSFNPKGKPKPVKAVAADGDANATTTSPRLSVPGK-RCDWITPYSD---PLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKV
        SA +F+P+  P  +   +               V  K RC W T   +    LY  +HD EWG P+H+DKKLFE LVL    A L+W  IL +R+ FR  
Subjt:  SAKSFNPKGKPKPVKAVAADGDANATTTSPRLSVPGK-RCDWITPYSD---PLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKV

Query:  FNDFDPSSIAQFRENEFTTLKVNAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGP
        F+DFDP  +A + E++   L  N   + +  K+ A + NA   + +Q+EFGSF  Y W FV  KPI N F     +P  TP S+ ++KDL +RGF+ VG 
Subjt:  FNDFDPSSIAQFRENEFTTLKVNAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGP

Query:  TVVYSFLQVSGIVNDHLVDCFRYQECGTSV
        T +Y+ +Q  G+VNDHL  CF+   C +S+
Subjt:  TVVYSFLQVSGIVNDHLVDCFRYQECGTSV

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein4.0e-7044.54Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNR-DRAPEKPKCKQETLKKI---AKQTKA----LPVVPESVVRDNVSVGSSC---SSDSLSSNYSA-KSFNPKGK
        MSV  + +S      E R++LGP GN+  R P   K ++  ++K    +K  KA     P  P + ++   S+ SS    +S S++++YS+  S + +  
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNR-DRAPEKPKCKQETLKKI---AKQTKA----LPVVPESVVRDNVSVGSSC---SSDSLSSNYSA-KSFNPKGK

Query:  PKPV-------KAVAADGDANATTTSPRLSV--------------PGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQ
        P  V       K V   G  ++T    +LSV                KRC WITP +DP Y+AFHDEEWGVPVHDDKKLFELL LS ALAEL+W  ILS+
Subjt:  PKPV-------KAVAADGDANATTTSPRLSV--------------PGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQ

Query:  RDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRR
        R I R+VF DFDP ++A+  + + T     AI LLSE K+R+I++N+  V KI  E GS   Y W+FVN KP +++FRY RQVPVKT K+EF+SKDL+RR
Subjt:  RDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRR

Query:  GFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECGTSVKDETKLRVEDR
        GFR V PTV+YSF+Q +G+ NDHL+ CFRYQ+C    +  T  + + +
Subjt:  GFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECGTSVKDETKLRVEDR

AT1G75090.1 DNA glycosylase superfamily protein8.3e-9256.01Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKPKPVKAVAADGD
        MS+ +KL+S  KP+ ESRAIL   GNR +  +    K+  L    + TK+ P   +     +VS   S SS S S   S  + N      P K    +  
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKPKPVKAVAADGD

Query:  ANATTT-------SPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTT
         N   +       SP++  P KRC WITP SDP+Y+ FHDEEWGVPV DDKKLFELLV SQALAE +WP IL +RD FRK+F +FDPS+IAQF E    +
Subjt:  ANATTT-------SPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTT

Query:  LKVNAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVD
        L+VN   +LSEQKLRAIVENA  VLK++QEFGSFSNYCW FVN KP+RN +RY RQVPVK+PK+E++SKD+M+RGFRCVGPTV+YSFLQ SGIVNDHL  
Subjt:  LKVNAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVD

Query:  CFRYQECGTSVKDETK
        CFRYQEC    + ETK
Subjt:  CFRYQECGTSVKDETK

AT1G80850.1 DNA glycosylase superfamily protein6.6e-7348.48Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKAL----------PVVPESVVRDNVSVGSSCSSDSLSS-NYSAKSFNPKGKP
        MS   +++S      E R++LGP GN  +  +KP  K    K +A++TK L          P+ P  + R+ +S+ +S SSD+ SS   S  S       
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKAL----------PVVPESVVRDNVSVGSSCSSDSLSS-NYSAKSFNPKGKP

Query:  KPV----KAVAADGDANATTTSPRLSVPG-------KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDF
        K V     +V++        T  R            KRC WITP SD  YIAFHDEEWGVPVHDDK+LFELL LS ALAEL+W  ILS+R +FR+VF DF
Subjt:  KPV----KAVAADGDANATTTSPRLSVPG-------KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDF

Query:  DPSSIAQFRENEFTTLKVNAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVY
        DP +I++    + T+ ++ A  LLSEQKLR+I+ENANQV KI   FGSF  Y W+FVN+KP +++FRY RQVPVKT K+E +SKDL+RRGFR V PTV+Y
Subjt:  DPSSIAQFRENEFTTLKVNAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVY

Query:  SFLQVSGIVNDHLVDCFRYQECGTSVKDET
        SF+Q +G+ NDHL  CFR+ +C T  KDET
Subjt:  SFLQVSGIVNDHLVDCFRYQECGTSVKDET

AT5G57970.1 DNA glycosylase superfamily protein1.8e-7052.14Show/hide
Query:  ESVVRDNVSVGSSCSSDSLSSNYSAKS------------FNPKGKPKPVKAVAADGDANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDD
        E  +  N+S+ +S SSD+   ++ +++               K  P   ++V ++G   A  + P  S   KRC W+TP SDP YI FHDEEWGVPVHDD
Subjt:  ESVVRDNVSVGSSCSSDSLSSNYSAKS------------FNPKGKPKPVKAVAADGDANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDD

Query:  KKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNR
        K+LFELLVLS ALAE TWP ILS+R  FR+VF DFDP++I +  E +       A  LLS+ KLRA++ENA Q+LK+ +E+GSF  Y WSFV  K I ++
Subjt:  KKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNR

Query:  FRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        FRY RQVP KTPK+E +SKDL+RRGFR VGPTVVYSF+Q +GI NDHL  CFR+  C
Subjt:  FRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

AT5G57970.2 DNA glycosylase superfamily protein1.8e-7052.14Show/hide
Query:  ESVVRDNVSVGSSCSSDSLSSNYSAKS------------FNPKGKPKPVKAVAADGDANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDD
        E  +  N+S+ +S SSD+   ++ +++               K  P   ++V ++G   A  + P  S   KRC W+TP SDP YI FHDEEWGVPVHDD
Subjt:  ESVVRDNVSVGSSCSSDSLSSNYSAKS------------FNPKGKPKPVKAVAADGDANATTTSPRLSVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDD

Query:  KKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNR
        K+LFELLVLS ALAE TWP ILS+R  FR+VF DFDP++I +  E +       A  LLS+ KLRA++ENA Q+LK+ +E+GSF  Y WSFV  K I ++
Subjt:  KKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQLLSEQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNR

Query:  FRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        FRY RQVP KTPK+E +SKDL+RRGFR VGPTVVYSF+Q +GI NDHL  CFR+  C
Subjt:  FRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTGGCTACGAAGCTCCAATCGCACGCTAAACCGGTTTTGGAGTCCCGAGCAATTCTTGGACCCGGCGGGAACAGAGATAGGGCGCCTGAGAAGCCAAAATGCAA
ACAGGAGACCTTGAAGAAGATAGCGAAGCAGACCAAGGCGCTTCCGGTGGTTCCTGAATCGGTTGTTCGGGACAATGTCTCCGTCGGGAGCTCCTGCTCTTCCGATTCTT
TATCAAGCAACTACTCGGCCAAATCGTTTAATCCGAAAGGGAAGCCCAAGCCTGTGAAGGCTGTTGCTGCTGACGGTGACGCTAACGCAACCACAACGTCGCCTAGGCTC
TCGGTTCCGGGGAAACGCTGTGATTGGATAACGCCTTATTCTGACCCACTTTACATCGCTTTTCACGACGAAGAATGGGGAGTACCAGTTCATGACGACAAGAAGCTGTT
TGAGTTACTAGTGTTATCACAAGCCTTAGCAGAACTTACTTGGCCCTTGATTCTTAGCCAGAGAGACATATTTAGGAAAGTCTTTAATGATTTTGACCCATCTTCCATCG
CACAGTTCAGAGAGAATGAGTTTACGACACTAAAAGTAAATGCCATCCAGCTCCTATCTGAACAAAAGCTTCGTGCAATCGTGGAGAACGCTAATCAAGTACTCAAGATT
CAACAGGAATTTGGTTCCTTCAGCAACTATTGTTGGAGCTTTGTTAACAAGAAGCCTATAAGAAACCGATTTCGATACGCTCGTCAAGTACCGGTAAAGACGCCAAAATC
AGAGTTCATGAGCAAGGATTTGATGAGAAGAGGATTCCGTTGTGTCGGACCAACTGTGGTTTATTCCTTCTTGCAAGTTAGTGGAATTGTTAACGATCACTTGGTTGATT
GCTTCAGATACCAAGAGTGTGGCACAAGCGTAAAAGATGAGACAAAACTAAGAGTAGAAGATCGGAGATCGGCATTACTTACCGGAGCTTTGGAGAAGCCTTGCTTGTCT
AAATCTTGA
mRNA sequenceShow/hide mRNA sequence
GAAATGAGGTGCACACAAAGAGGTATCCCGTCCAAATCCAAACGATCCTTCCGCTTAAAGAACCTCTCAACAATCGACCCATTTCTTCGTCGTTCTCCGCCCACGGAGTC
ACTGCTCTTTCTCTCGCCACGCCCTTCTGCTCTTCACTGCCAAACCCTAGGGCTTTGTCTGCAACTCTTTTGCCCTCTCACTATCAATAATCACTTCATTTTCCTCTCTG
TCTCGCCTTGAAATCGTACTGTTATTTTGTCTCGGGAAATTTTGACAGTTGCCGAGCTTGCTCCGTTTTGATCCACTCCGAATTCGATTCTGGAAGGAATTTTCGCTGAC
GGATATCTTTCTTGGGCAATGTCTGTGGCTACGAAGCTCCAATCGCACGCTAAACCGGTTTTGGAGTCCCGAGCAATTCTTGGACCCGGCGGGAACAGAGATAGGGCGCC
TGAGAAGCCAAAATGCAAACAGGAGACCTTGAAGAAGATAGCGAAGCAGACCAAGGCGCTTCCGGTGGTTCCTGAATCGGTTGTTCGGGACAATGTCTCCGTCGGGAGCT
CCTGCTCTTCCGATTCTTTATCAAGCAACTACTCGGCCAAATCGTTTAATCCGAAAGGGAAGCCCAAGCCTGTGAAGGCTGTTGCTGCTGACGGTGACGCTAACGCAACC
ACAACGTCGCCTAGGCTCTCGGTTCCGGGGAAACGCTGTGATTGGATAACGCCTTATTCTGACCCACTTTACATCGCTTTTCACGACGAAGAATGGGGAGTACCAGTTCA
TGACGACAAGAAGCTGTTTGAGTTACTAGTGTTATCACAAGCCTTAGCAGAACTTACTTGGCCCTTGATTCTTAGCCAGAGAGACATATTTAGGAAAGTCTTTAATGATT
TTGACCCATCTTCCATCGCACAGTTCAGAGAGAATGAGTTTACGACACTAAAAGTAAATGCCATCCAGCTCCTATCTGAACAAAAGCTTCGTGCAATCGTGGAGAACGCT
AATCAAGTACTCAAGATTCAACAGGAATTTGGTTCCTTCAGCAACTATTGTTGGAGCTTTGTTAACAAGAAGCCTATAAGAAACCGATTTCGATACGCTCGTCAAGTACC
GGTAAAGACGCCAAAATCAGAGTTCATGAGCAAGGATTTGATGAGAAGAGGATTCCGTTGTGTCGGACCAACTGTGGTTTATTCCTTCTTGCAAGTTAGTGGAATTGTTA
ACGATCACTTGGTTGATTGCTTCAGATACCAAGAGTGTGGCACAAGCGTAAAAGATGAGACAAAACTAAGAGTAGAAGATCGGAGATCGGCATTACTTACCGGAGCTTTG
GAGAAGCCTTGCTTGTCTAAATCTTGACATGTTCGAAGAAACAGAATATGCTTCCATGGTAGAGCAGAAGAAAGAAGAGCATGCTTCCCCCAACATTTATGATGTTCAAG
TTTTCGTAAGGTTTTAGTTTTATTATAGTTTCTTAACTTTAACGTTCATTCTGTATTTTAATCTCAATTTTTCTTAAATGCTCATTTTAGTGTCCTGATGTATTCTATAA
ACATGGCTTAGAATAAAATACTGTATCGAAC
Protein sequenceShow/hide protein sequence
MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKIAKQTKALPVVPESVVRDNVSVGSSCSSDSLSSNYSAKSFNPKGKPKPVKAVAADGDANATTTSPRL
SVPGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSQRDIFRKVFNDFDPSSIAQFRENEFTTLKVNAIQLLSEQKLRAIVENANQVLKI
QQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKSEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECGTSVKDETKLRVEDRRSALLTGALEKPCLS
KS