; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036866 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036866
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationchr2:1755980..1758467
RNA-Seq ExpressionLag0036866
SyntenyLag0036866
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608520.1 hypothetical protein SDJN03_01862, partial [Cucurbita argyrosperma subsp. sororia]4.2e-16287.99Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV
        MSVATKLQSHA+PVLESRAILGPGGNRDRAPEKPKCKQE LK+T K NKALPVVSESVVRDNVSVGSSCSSDS SSNYSAKLLN K KP   KPVK VA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV

Query:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVN
        GGDANATTTSP LS+  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWP ILSKR IFRKVFNDFDPS+IAQFTE EF+TLKVN
Subjt:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVN

Query:  GIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
          QLLS+ KLRA+VENANQVLKIQQEFGSFSNYCWSFVNKKPIRN++RY RQVPVKTPKAEFMSKDLM+RGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
Subjt:  GIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY

Query:  QECDTDVKDEMKLRVEDRRSELLTGALEKPCLT
        QECD  VK++MKLRVE+RRSELL  ALEK  LT
Subjt:  QECDTDVKDEMKLRVEDRRSELLTGALEKPCLT

XP_022141169.1 uncharacterized protein LOC111011623 [Momordica charantia]1.6e-16486.01Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQE-TLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVA
        M VA K +SHAKPVLESRAILGPGGNRDR PEKP+CK E TL KTEK NKALP V +SV+RDNVSVGSSCSSDS SSNYSAKLLN KVKPYAVKPVK VA
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQE-TLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVA

Query:  VGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKV
         G +A+ATTTSPR S+PRKRCDWITPYSDPLYIAFHDEEWGVP+HDDKKLFELLVLSQALAELTWPSILSKRD+FRKVFNDFDPS+IA+FTENEF+TLKV
Subjt:  VGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKV

Query:  NGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFR
        NGIQ+L+EPKLRA+VENANQVLKIQQEFGSFSNYCWSFVNKKPIRN+FRYARQVPVKTPKAE MSKDL+RRGFRCVGPTVVYSF+QV+GIVNDHLV+CFR
Subjt:  NGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFR

Query:  YQECDTDVKDEMKLRVEDRRSELLTGALEKPCLTRS
        YQECD +VKD+MK RVED R EL  GA EKPCL+RS
Subjt:  YQECDTDVKDEMKLRVEDRRSELLTGALEKPCLTRS

XP_022940560.1 uncharacterized protein LOC111446125 [Cucurbita moschata]4.2e-16287.99Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV
        MSVATKLQSHA+PVLESRAILGPGGNRDRAPEKPKCKQE LK+T K NKALPVVSESVVRDNVSVGSSCSSDS SSNYSAKLLN K KP   KPVK VA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV

Query:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVN
        GGDANATTTSP LS+  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWP ILSKR IFRKVFNDFDPS+IA FTE EF+TLKVN
Subjt:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVN

Query:  GIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
          QLLS+ KLRA+VENANQVLKIQQEFGSFSNYCWSFVNKKPIRN++RY RQVPVKTPKAEFMSKDLM+RGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
Subjt:  GIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY

Query:  QECDTDVKDEMKLRVEDRRSELLTGALEKPCLT
        QECD  VKD+MKLRVE+RRSELL  ALEK  LT
Subjt:  QECDTDVKDEMKLRVEDRRSELLTGALEKPCLT

XP_023523621.1 uncharacterized protein LOC111787800 [Cucurbita pepo subsp. pepo]1.1e-16288.29Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV
        MSVATKLQSHA+PVLESRAILGPGGNRDRAPEKPKCKQE LK+T K NKALPVVSESVVRDNVSVGSSCSSDS SSNYSAKLLN K KP   KPVK VA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV

Query:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVN
        GGDANATTTSP LS+  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWP ILSKR IFRKVFNDFDPS+IAQFTE EF+TLKVN
Subjt:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVN

Query:  GIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
          QLLS+ KLRA+VENANQVLKIQQEFGSFSNYCWSFVNKKPIRN++RY RQVPVKTPKAEFMSKDLM+RGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
Subjt:  GIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY

Query:  QECDTDVKDEMKLRVEDRRSELLTGALEKPCLT
        QECD  VKD+MKLRVE+RRSELL  ALEK  LT
Subjt:  QECDTDVKDEMKLRVEDRRSELLTGALEKPCLT

XP_038905518.1 DNA-3-methyladenine glycosylase 1 [Benincasa hispida]7.5e-16788.02Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV
        MSVATKLQSHAKPVLESR ILGPGGNRDRAPEKPKCKQ+TLKKTEK N+ALP++SESV+RDNVSVGSSCSSDS SSNYSAKLL  KVKP AVKPVK VA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV

Query:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVN
        GGD NAT  SP LS+P KRCDWIT +SDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWP ILSKRDIFRKV NDFDPS IAQFTENEF+TLKVN
Subjt:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVN

Query:  GIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
         IQLLSEPKLRA+VENANQVLKIQQEFGSFSNYCWSFVNKKPIRN FRY RQVPVKTPKAEFMSKDL+RRGFRCVGPTVVYSF+QV+GIVNDHLV+CFRY
Subjt:  GIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY

Query:  QECDTDVKDEMKLRVEDRRSELLTGALEKPCLTR
        QECD  +KD+ KLRVED+RSE LTGALEKPCLTR
Subjt:  QECDTDVKDEMKLRVEDRRSELLTGALEKPCLTR

TrEMBL top hitse value%identityAlignment
A0A6J1CHU1 uncharacterized protein LOC1110116237.5e-16586.01Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQE-TLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVA
        M VA K +SHAKPVLESRAILGPGGNRDR PEKP+CK E TL KTEK NKALP V +SV+RDNVSVGSSCSSDS SSNYSAKLLN KVKPYAVKPVK VA
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQE-TLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVA

Query:  VGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKV
         G +A+ATTTSPR S+PRKRCDWITPYSDPLYIAFHDEEWGVP+HDDKKLFELLVLSQALAELTWPSILSKRD+FRKVFNDFDPS+IA+FTENEF+TLKV
Subjt:  VGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKV

Query:  NGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFR
        NGIQ+L+EPKLRA+VENANQVLKIQQEFGSFSNYCWSFVNKKPIRN+FRYARQVPVKTPKAE MSKDL+RRGFRCVGPTVVYSF+QV+GIVNDHLV+CFR
Subjt:  NGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFR

Query:  YQECDTDVKDEMKLRVEDRRSELLTGALEKPCLTRS
        YQECD +VKD+MK RVED R EL  GA EKPCL+RS
Subjt:  YQECDTDVKDEMKLRVEDRRSELLTGALEKPCLTRS

A0A6J1F6S0 uncharacterized protein LOC1114426742.3e-16186.89Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV
        MSVATKL SHAKPVLESRAILGPGGNRDRAPEKPKCKQETLK +EK NKALP + ESVVRDN+S+GSSCSSDS SSNYS KLLN KVKP  VKPVK VA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV

Query:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVN
        GGD N TTT+PRLS+P KRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWP IL KRDIFRKVFNDFDPSTIAQFT+NEF+TLK N
Subjt:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVN

Query:  GIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
        GIQLLSEPKLRA+VENANQVLKIQQEFG+FSNYCWSFVNKKPI N+FRYARQVPVKTPKAEFMSKDL+RRGFRCVGPTVVYSF+QV+GIVNDHLV+CFRY
Subjt:  GIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY

Query:  QECDTDVKDEMKLRVEDRRSELLTGALE
        QEC     D MKLRVED+RSELLTGALE
Subjt:  QECDTDVKDEMKLRVEDRRSELLTGALE

A0A6J1FIT9 uncharacterized protein LOC1114461252.0e-16287.99Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV
        MSVATKLQSHA+PVLESRAILGPGGNRDRAPEKPKCKQE LK+T K NKALPVVSESVVRDNVSVGSSCSSDS SSNYSAKLLN K KP   KPVK VA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV

Query:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVN
        GGDANATTTSP LS+  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWP ILSKR IFRKVFNDFDPS+IA FTE EF+TLKVN
Subjt:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVN

Query:  GIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
          QLLS+ KLRA+VENANQVLKIQQEFGSFSNYCWSFVNKKPIRN++RY RQVPVKTPKAEFMSKDLM+RGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
Subjt:  GIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY

Query:  QECDTDVKDEMKLRVEDRRSELLTGALEKPCLT
        QECD  VKD+MKLRVE+RRSELL  ALEK  LT
Subjt:  QECDTDVKDEMKLRVEDRRSELLTGALEKPCLT

A0A6J1IHE9 uncharacterized protein LOC1114769754.4e-15785.37Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV
        MSVATKL SHAKPVLESRAILGPGGNRDRAPEKPKCKQETLK +EK NKALP + ESV+RDN+S+GSSCSSDS SSN SAKLLN K     VKPVK VA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV

Query:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVN
        GGD N TTT+PRLS+P KRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWP IL KRDIFRKVFNDFDPSTIAQFT+NEF+TLK N
Subjt:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVN

Query:  GIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
        GIQLLSEPKLRA+VENANQVLKIQQEFG+FSNYCWSFVNKKPI N+FRYARQ+PVKTPKAEFMSKDL+RRGFRCVGPTVVYSF+QV+GIVNDHLVDCFRY
Subjt:  GIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY

Query:  QECDTDVKDEMKLRVEDRRSELLTGALE
        QEC     D MKLRVED+ SELLTGALE
Subjt:  QECDTDVKDEMKLRVEDRRSELLTGALE

A0A6J1J188 uncharacterized protein LOC1114804121.3e-16187.39Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV
        MSVATKLQSHA+PVLESRAILGPGGNRDRAPEKPKCKQE LK+T K NKALPVVSESVVRDN+SVGSSCSSDS SSNYSAKLLN K KP   KPVK VA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV

Query:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVN
        GGDANATTTSP L +  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWP ILSKR IFR VFNDFDPS+IAQFTE EF+TLKVN
Subjt:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVN

Query:  GIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
          QLLS+ KLRA+VENANQVLKIQQEFGSFSNYCWSFVNKKPIRN++RY RQVPVKTPKAEFMSKDLM+RGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
Subjt:  GIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY

Query:  QECDTDVKDEMKLRVEDRRSELLTGALEKPCLT
        QECD  VKD+MKLRVE+RRSELL  ALEK  LT
Subjt:  QECDTDVKDEMKLRVEDRRSELLTGALEKPCLT

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 13.4e-3739.89Show/hide
Query:  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENA
        +RC W++   DPLYIA+HD EWGVP  D KKLFE++ L    A L+W ++L KR+ +R  F+ FDP  +A   E +   L  +   +    K++A++ NA
Subjt:  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENA

Query:  NQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
           L+++Q    F ++ WSFVN +P   +     ++P  T  ++ +SK L +RGF+ VG T+ YSF+Q  G+VNDH+V C  Y
Subjt:  NQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY

P44321 DNA-3-methyladenine glycosylase4.1e-3541.34Show/hide
Query:  RCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENAN
        RC W+   S  +YI +HD+EWG P  D +KLFE + L    A L+W ++L KR+ +R+ F+ FDP  IA+ T  +      N   +    KL A+V+NA 
Subjt:  RCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENAN

Query:  QVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDC
          L +++   +FS++ WSFVN KPI N     R VP KT  ++ +SK L +RGF  +G T  Y+F+Q  G+V+DHL DC
Subjt:  QVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]2.3e-4144.02Show/hide
Query:  RCDWITPYSD---PLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVE
        RC W T   +    LY  +HD EWG P+H+DKKLFE LVL    A L+W +IL KR+ FR  F+DFDP  +A + E++   L  N   + +  K+ A + 
Subjt:  RCDWITPYSD---PLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVE

Query:  NANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFR
        NA   + +Q+EFGSF  Y W FV  KPI N F     +P  TP ++ ++KDL +RGF+ VG T +Y+ +Q  G+VNDHL  CF+
Subjt:  NANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFR

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein1.3e-7146.02Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNR-DRAPEKPKCKQETLKKT------EKYNKALPVVS------------ESVVRDN-VSVGSSCSSDSFSSNYSA
        MSV  + +S      E R++LGP GN+  R P   K ++  ++KT      EK  K     S             S++R N  S+ +S SSD+ SS  S+
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNR-DRAPEKPKCKQETLKKT------EKYNKALPVVS------------ESVVRDN-VSVGSSCSSDSFSSNYSA

Query:  KLLNSKVKPYAVKPVKVVAVGGDANAT-----------TTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILS
         L  S     + K  KVV   G  ++T            +    +  RKRC WITP +DP Y+AFHDEEWGVPVHDDKKLFELL LS ALAEL+W  ILS
Subjt:  KLLNSKVKPYAVKPVKVVAVGGDANAT-----------TTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILS

Query:  KRDIFRKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMR
        +R I R+VF DFDP  +A+  + + +      I LLSE K+R++++N+  V KI  E GS   Y W+FVN KP +++FRY RQVPVKT KAEF+SKDL+R
Subjt:  KRDIFRKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMR

Query:  RGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDTDVK
        RGFR V PTV+YSF+Q +G+ NDHL+ CFRYQ+C  D +
Subjt:  RGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDTDVK

AT1G75090.1 DNA glycosylase superfamily protein9.0e-9455.25Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETL--KKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVV
        MS+ +KL+S  KP+ ESRAIL   GNR +  +    K+  L  + T+      P  + SV  D+ S  SS S  S  +  ++  + +  K   V+ +  V
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETL--KKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVV

Query:  AVGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLK
         V   A     SP++  P KRC WITP SDP+Y+ FHDEEWGVPV DDKKLFELLV SQALAE +WPSIL +RD FRK+F +FDPS IAQFTE    +L+
Subjt:  AVGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLK

Query:  VNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCF
        VNG  +LSE KLRA+VENA  VLK++QEFGSFSNYCW FVN KP+RN +RY RQVPVK+PKAE++SKD+M+RGFRCVGPTV+YSFLQ SGIVNDHL  CF
Subjt:  VNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCF

Query:  RYQECDTDVKDEMKLRVEDRRSEL
        RYQEC+ + + E K    + + +L
Subjt:  RYQECDTDVKDEMKLRVEDRRSEL

AT1G80850.1 DNA glycosylase superfamily protein1.3e-7346.97Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNR------DRAPEKPKC-KQETLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVK
        MS   +++S      E R++LGP GN+       +  +KP   K + L  TEK  +  P+    + R+ +S+ +S SSD+ SS  S+ L  +        
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNR------DRAPEKPKC-KQETLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVK

Query:  PVKVVAVGGDANATTTSPR-------------LSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFND
          +V+   G  +++++  R                 RKRC WITP SD  YIAFHDEEWGVPVHDDK+LFELL LS ALAEL+W  ILSKR +FR+VF D
Subjt:  PVKVVAVGGDANATTTSPR-------------LSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFND

Query:  FDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVV
        FDP  I++ T  + ++ ++    LLSE KLR+++ENANQV KI   FGSF  Y W+FVN+KP +++FRY RQVPVKT KAE +SKDL+RRGFR V PTV+
Subjt:  FDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVV

Query:  YSFLQVSGIVNDHLVDCFRYQECDTDVKDE
        YSF+Q +G+ NDHL  CFR+ +C T  KDE
Subjt:  YSFLQVSGIVNDHLVDCFRYQECDTDVKDE

AT5G57970.1 DNA glycosylase superfamily protein1.2e-7458.06Show/hide
Query:  NVSVGSSCSSDSFSSNYSAKLL------NSKVKPYAVKPVKVVAVGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVL
        N S  S  S DSF S  S   L       S+ K Y  KP  VV+ G    A  + P  S  +KRC W+TP SDP YI FHDEEWGVPVHDDK+LFELLVL
Subjt:  NVSVGSSCSSDSFSSNYSAKLL------NSKVKPYAVKPVKVVAVGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVL

Query:  SQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPV
        S ALAE TWP+ILSKR  FR+VF DFDP+ I +  E +          LLS+ KLRAV+ENA Q+LK+ +E+GSF  Y WSFV  K I +KFRY RQVP 
Subjt:  SQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPV

Query:  KTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        KTPKAE +SKDL+RRGFR VGPTVVYSF+Q +GI NDHL  CFR+  C
Subjt:  KTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

AT5G57970.2 DNA glycosylase superfamily protein1.2e-7458.06Show/hide
Query:  NVSVGSSCSSDSFSSNYSAKLL------NSKVKPYAVKPVKVVAVGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVL
        N S  S  S DSF S  S   L       S+ K Y  KP  VV+ G    A  + P  S  +KRC W+TP SDP YI FHDEEWGVPVHDDK+LFELLVL
Subjt:  NVSVGSSCSSDSFSSNYSAKLL------NSKVKPYAVKPVKVVAVGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVL

Query:  SQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPV
        S ALAE TWP+ILSKR  FR+VF DFDP+ I +  E +          LLS+ KLRAV+ENA Q+LK+ +E+GSF  Y WSFV  K I +KFRY RQVP 
Subjt:  SQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPV

Query:  KTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        KTPKAE +SKDL+RRGFR VGPTVVYSF+Q +GI NDHL  CFR+  C
Subjt:  KTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTGGCTACGAAGCTCCAATCGCACGCTAAACCGGTTTTGGAGTCCCGAGCAATTCTTGGACCCGGCGGGAACAGAGATAGGGCGCCTGAGAAGCCGAAATGCAA
ACAGGAGACCTTGAAGAAGACAGAGAAATATAATAAAGCGCTTCCGGTGGTTTCTGAATCGGTTGTTCGGGATAATGTCTCCGTCGGAAGTTCCTGCTCTTCCGATTCTT
TTTCAAGCAACTATTCGGCCAAATTGTTGAATTCTAAGGTGAAGCCCTACGCCGTGAAGCCTGTGAAGGTTGTTGCTGTCGGCGGAGACGCAAACGCAACTACAACGTCG
CCTAGGCTCTCGATTCCACGCAAACGCTGTGATTGGATAACGCCGTATTCTGACCCACTTTACATCGCTTTTCATGACGAAGAATGGGGTGTCCCAGTTCATGACGACAA
GAAGCTGTTTGAGTTACTTGTATTATCACAAGCCTTAGCAGAACTTACCTGGCCCTCGATTCTTAGCAAGAGAGATATATTTAGGAAAGTCTTTAATGATTTTGACCCAT
CTACCATCGCACAGTTCACAGAGAATGAGTTTTCAACACTAAAAGTAAATGGCATCCAGCTCCTCTCTGAACCAAAGCTTCGAGCAGTTGTGGAGAACGCTAATCAAGTA
CTCAAGATTCAACAGGAATTTGGTTCCTTTAGCAACTATTGTTGGAGCTTCGTTAACAAGAAGCCTATAAGAAACAAATTTCGATACGCCCGTCAAGTACCAGTAAAGAC
GCCAAAAGCAGAATTCATGAGCAAAGATTTGATGAGGAGAGGGTTCCGTTGCGTCGGGCCAACTGTGGTATATTCCTTCTTGCAAGTTAGCGGAATTGTTAACGATCACT
TGGTCGATTGCTTCAGATATCAAGAGTGCGACACAGACGTAAAAGATGAGATGAAACTAAGAGTAGAAGATCGGAGATCGGAGTTGCTTACCGGAGCTTTGGAGAAGCCT
TGCTTGACTAGATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGTGGCTACGAAGCTCCAATCGCACGCTAAACCGGTTTTGGAGTCCCGAGCAATTCTTGGACCCGGCGGGAACAGAGATAGGGCGCCTGAGAAGCCGAAATGCAA
ACAGGAGACCTTGAAGAAGACAGAGAAATATAATAAAGCGCTTCCGGTGGTTTCTGAATCGGTTGTTCGGGATAATGTCTCCGTCGGAAGTTCCTGCTCTTCCGATTCTT
TTTCAAGCAACTATTCGGCCAAATTGTTGAATTCTAAGGTGAAGCCCTACGCCGTGAAGCCTGTGAAGGTTGTTGCTGTCGGCGGAGACGCAAACGCAACTACAACGTCG
CCTAGGCTCTCGATTCCACGCAAACGCTGTGATTGGATAACGCCGTATTCTGACCCACTTTACATCGCTTTTCATGACGAAGAATGGGGTGTCCCAGTTCATGACGACAA
GAAGCTGTTTGAGTTACTTGTATTATCACAAGCCTTAGCAGAACTTACCTGGCCCTCGATTCTTAGCAAGAGAGATATATTTAGGAAAGTCTTTAATGATTTTGACCCAT
CTACCATCGCACAGTTCACAGAGAATGAGTTTTCAACACTAAAAGTAAATGGCATCCAGCTCCTCTCTGAACCAAAGCTTCGAGCAGTTGTGGAGAACGCTAATCAAGTA
CTCAAGATTCAACAGGAATTTGGTTCCTTTAGCAACTATTGTTGGAGCTTCGTTAACAAGAAGCCTATAAGAAACAAATTTCGATACGCCCGTCAAGTACCAGTAAAGAC
GCCAAAAGCAGAATTCATGAGCAAAGATTTGATGAGGAGAGGGTTCCGTTGCGTCGGGCCAACTGTGGTATATTCCTTCTTGCAAGTTAGCGGAATTGTTAACGATCACT
TGGTCGATTGCTTCAGATATCAAGAGTGCGACACAGACGTAAAAGATGAGATGAAACTAAGAGTAGAAGATCGGAGATCGGAGTTGCTTACCGGAGCTTTGGAGAAGCCT
TGCTTGACTAGATCTTGA
Protein sequenceShow/hide protein sequence
MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKTEKYNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAVGGDANATTTS
PRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQV
LKIQQEFGSFSNYCWSFVNKKPIRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDTDVKDEMKLRVEDRRSELLTGALEKP
CLTRS