; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg17129 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg17129
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionDNA glycosylase superfamily protein
Genome locationCarg_Chr01:12597550..12601183
RNA-Seq ExpressionCarg17129
SyntenyCarg17129
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608520.1 hypothetical protein SDJN03_01862, partial [Cucurbita argyrosperma subsp. sororia]7.3e-18399.4Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
        MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD

Query:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQ
        ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIA FTEAEFTTLKVNATQ
Subjt:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQ

Query:  LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
Subjt:  LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

Query:  DASVKDDMKLRVENRRSELLIRALEKSSLTT
        DASVK+DMKLRVENRRSELLIRALEKSSLTT
Subjt:  DASVKDDMKLRVENRRSELLIRALEKSSLTT

XP_022940560.1 uncharacterized protein LOC111446125 [Cucurbita moschata]2.3e-184100Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
        MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD

Query:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQ
        ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQ
Subjt:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQ

Query:  LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
Subjt:  LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

Query:  DASVKDDMKLRVENRRSELLIRALEKSSLTT
        DASVKDDMKLRVENRRSELLIRALEKSSLTT
Subjt:  DASVKDDMKLRVENRRSELLIRALEKSSLTT

XP_022981194.1 uncharacterized protein LOC111480412 [Cucurbita maxima]4.7e-18298.79Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
        MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDN+SVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD

Query:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQ
        ANATTTSPGL VAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFR VFNDFDPSSIA FTEAEFTTLKVNATQ
Subjt:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQ

Query:  LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
Subjt:  LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

Query:  DASVKDDMKLRVENRRSELLIRALEKSSLTT
        DASVKDDMKLRVENRRSELLIRALEKSSLTT
Subjt:  DASVKDDMKLRVENRRSELLIRALEKSSLTT

XP_023523621.1 uncharacterized protein LOC111787800 [Cucurbita pepo subsp. pepo]1.9e-18399.7Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
        MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD

Query:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQ
        ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIA FTEAEFTTLKVNATQ
Subjt:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQ

Query:  LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
Subjt:  LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

Query:  DASVKDDMKLRVENRRSELLIRALEKSSLTT
        DASVKDDMKLRVENRRSELLIRALEKSSLTT
Subjt:  DASVKDDMKLRVENRRSELLIRALEKSSLTT

XP_038905518.1 DNA-3-methyladenine glycosylase 1 [Benincasa hispida]2.9e-15584.38Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKP---KPVKTVAA
        MSVATKLQSHA+PVLESR ILGPGGNRDRAPEKPKCKQ+ LK+T KQN+ALP++SESV+RDNVSVGSSCSSDS+SSNYSAKLL  K KP   KPVK VAA
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKP---KPVKTVAA

Query:  GGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVN
        GGD NAT  SP LS+ GKRCDWIT +SDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKR IFRKV NDFDPS+IA FTE EFTTLKVN
Subjt:  GGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVN

Query:  ATQLLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
        A QLLS+ KLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRN +RY RQVPVKTPKAEFMSKDL++RGFRCVGPTVVYSF+QV+GIVNDHLV+CFRY
Subjt:  ATQLLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY

Query:  QECDASVKDDMKLRVENRRSELLIRALEKSSLT
        QECDA +KDD KLRVE++RSE L  ALEK  LT
Subjt:  QECDASVKDDMKLRVENRRSELLIRALEKSSLT

TrEMBL top hitse value%identityAlignment
A0A0A0LG22 Uncharacterized protein3.1e-14782.91Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
        MSVATKLQSH +P LE RAILGPGGNRDRAP+ PKCK E LK+T KQ+KALP +SESV+RDNVSVGSSCSSDSLSSNYSAKLL    KP  VK V+AGGD
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD

Query:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQ
        +NATTTSP LS+ GKRCDWIT +SDPLYIAFHDEEWGVP+HDDKKLFELLVLSQALAELTWPLILSKR +FRKV NDFDPSSIA FTE EFTTLKVN  Q
Subjt:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQ

Query:  LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        LLS+ KLRAIV+NANQVLKIQ+EFGSFSNYCWSFVNKKPIRNR+RY RQVPVKTPKAEFMSKD+++RGFRCVGPTVVYSF+QV+GIVNDHLV CFRY+EC
Subjt:  LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

Query:  DASVKDDMKLRVENRR
        D  VKDD KLRVE++R
Subjt:  DASVKDDMKLRVENRR

A0A6J1F6S0 uncharacterized protein LOC1114426741.8e-15082.78Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKP---KPVKTVAA
        MSVATKL SHA+PVLESRAILGPGGNRDRAPEKPKCKQE LK + KQNKALP + ESVVRDN+S+GSSCSSDSLSSNYS KLLN K KP   KPVK VAA
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKP---KPVKTVAA

Query:  GGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVN
        GGD N TTT+P LSV GKRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWPLIL KR IFRKVFNDFDPS+IA FT+ EFTTLK N
Subjt:  GGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVN

Query:  ATQLLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
          QLLS+ KLRAIVENANQVLKIQQEFG+FSNYCWSFVNKKPI NR+RY RQVPVKTPKAEFMSKDL++RGFRCVGPTVVYSF+QV+GIVNDHLV+CFRY
Subjt:  ATQLLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY

Query:  QECDASVKDDMKLRVENRRSELLIRALEKSS
        QECD      MKLRVE++RSELL  ALE  S
Subjt:  QECDASVKDDMKLRVENRRSELLIRALEKSS

A0A6J1FIT9 uncharacterized protein LOC1114461251.1e-184100Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
        MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD

Query:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQ
        ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQ
Subjt:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQ

Query:  LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
Subjt:  LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

Query:  DASVKDDMKLRVENRRSELLIRALEKSSLTT
        DASVKDDMKLRVENRRSELLIRALEKSSLTT
Subjt:  DASVKDDMKLRVENRRSELLIRALEKSSLTT

A0A6J1IHE9 uncharacterized protein LOC1114769753.7e-14882.32Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
        MSVATKL SHA+PVLESRAILGPGGNRDRAPEKPKCKQE LK + KQNKALP + ESV+RDN+S+GSSCSSDSLSSN SAKLLN   K KPVK VAAGGD
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD

Query:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQ
         N TTT+P LSV GKRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWPLIL KR IFRKVFNDFDPS+IA FT+ EFTTLK N  Q
Subjt:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQ

Query:  LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        LLS+ KLRAIVENANQVLKIQQEFG+FSNYCWSFVNKKPI NR+RY RQ+PVKTPKAEFMSKDL++RGFRCVGPTVVYSF+QV+GIVNDHLVDCFRYQEC
Subjt:  LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

Query:  DASVKDDMKLRVENRRSELLIRALEKSS
        D      MKLRVE++ SELL  ALE  S
Subjt:  DASVKDDMKLRVENRRSELLIRALEKSS

A0A6J1J188 uncharacterized protein LOC1114804122.3e-18298.79Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
        MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDN+SVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGD

Query:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQ
        ANATTTSPGL VAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFR VFNDFDPSSIA FTEAEFTTLKVNATQ
Subjt:  ANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQ

Query:  LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
Subjt:  LLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

Query:  DASVKDDMKLRVENRRSELLIRALEKSSLTT
        DASVKDDMKLRVENRRSELLIRALEKSSLTT
Subjt:  DASVKDDMKLRVENRRSELLIRALEKSSLTT

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 19.7e-3741.53Show/hide
Query:  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENA
        +RC W++   DPLYIA+HD EWGVP  D KKLFE++ L    A L+W  +L KR  +R  F+ FDP  +A   E +   L  +A  +    K++AI+ NA
Subjt:  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENA

Query:  NQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY
           L+++Q    F ++ WSFVN +P   +     ++P  T  ++ +SK L KRGF+ VG T+ YSF+Q  G+VNDH+V C  Y
Subjt:  NQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRY

P44321 DNA-3-methyladenine glycosylase1.5e-3442.46Show/hide
Query:  RCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENAN
        RC W+   S  +YI +HD+EWG P  D +KLFE + L    A L+W  +L KR  +R+ F+ FDP  IA  T  +      N+  +    KL AIV+NA 
Subjt:  RCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENAN

Query:  QVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDC
          L +++   +FS++ WSFVN KPI N     R VP KT  ++ +SK L KRGF  +G T  Y+F+Q  G+V+DHL DC
Subjt:  QVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]4.2e-4043.23Show/hide
Query:  RCDWITPYSD---PLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVE
        RC W T   +    LY  +HD EWG P+H+DKKLFE LVL    A L+W  IL KR  FR  F+DFDP  +A++ E +   L  N   + +  K+ A + 
Subjt:  RCDWITPYSD---PLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVE

Query:  NANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASV
        NA   + +Q+EFGSF  Y W FV  KPI N +     +P  TP ++ ++KDL KRGF+ VG T +Y+ +Q  G+VNDHL  CF+   C++S+
Subjt:  NANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASV

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein6.2e-7143.84Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNR-DRAPEKPKCKQEILKRTV---KQNKA---------------LPVVSESVVRDN-VSVGSSCSSDSLSSNYSA
        MSV  + +S      E R++LGP GN+  R P   K ++ ++++T+   K  KA                  +  S++R N  S+ +S SSD+ SS+  +
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNR-DRAPEKPKCKQEILKRTV---KQNKA---------------LPVVSESVVRDN-VSVGSSCSSDSLSSNYSA

Query:  KLLNLKAKPKPVKTVAAGGDANAT-----------TTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRH
          L++ +     K V   G  ++T            +    +   KRC WITP +DP Y+AFHDEEWGVPVHDDKKLFELL LS ALAEL+W  ILS+RH
Subjt:  KLLNLKAKPKPVKTVAAGGDANAT-----------TTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRH

Query:  IFRKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGF
        I R+VF DFDP ++A   + + T     A  LLS+ K+R+I++N+  V KI  E GS   Y W+FVN KP ++++RY RQVPVKT KAEF+SKDL++RGF
Subjt:  IFRKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGF

Query:  RCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC--DASVKDDMKLRVENRR
        R V PTV+YSF+Q +G+ NDHL+ CFRYQ+C  DA      K + +N R
Subjt:  RCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC--DASVKDDMKLRVENRR

AT1G75090.1 DNA glycosylase superfamily protein1.0e-8955.27Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKAL--PVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTV--A
        MS+ +KL+S  +P+ ESRAIL   GNR +  +    K+  L   V ++ A   P  + SV  D+ S  SS S  S  +  ++  +   +K   V+ +   
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKAL--PVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTV--A

Query:  AGGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKV
            A     SP +    KRC WITP SDP+Y+ FHDEEWGVPV DDKKLFELLV SQALAE +WP IL +R  FRK+F +FDPS+IA FTE    +L+V
Subjt:  AGGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKV

Query:  NATQLLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFR
        N   +LS+QKLRAIVENA  VLK++QEFGSFSNYCW FVN KP+RN YRYGRQVPVK+PKAE++SKD+M+RGFRCVGPTV+YSFLQ SGIVNDHL  CFR
Subjt:  NATQLLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFR

Query:  YQECDASVKDDMK
        YQEC+   + + K
Subjt:  YQECDASVKDDMK

AT1G80850.1 DNA glycosylase superfamily protein5.6e-7246.56Show/hide
Query:  MSVATKLQSHAEPVLESRAILGPGGNR------DRAPEKPKC-KQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVK
        MS   +++S      E R++LGP GN+       +  +KP   K + L  T K  +  P+    + R+ +S+ +S SSD+ SS+  +  L++ +     +
Subjt:  MSVATKLQSHAEPVLESRAILGPGGNR------DRAPEKPKC-KQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVK

Query:  TVAAGGDANATTT-------------SPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDP
         +   G  +++++             S       KRC WITP SD  YIAFHDEEWGVPVHDDK+LFELL LS ALAEL+W  ILSKR +FR+VF DFDP
Subjt:  TVAAGGDANATTT-------------SPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDP

Query:  SSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSF
         +I+  T  + T+ ++ AT LLS+QKLR+I+ENANQV KI   FGSF  Y W+FVN+KP ++++RY RQVPVKT KAE +SKDL++RGFR V PTV+YSF
Subjt:  SSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSF

Query:  LQVSGIVNDHLVDCFRYQEC
        +Q +G+ NDHL  CFR+ +C
Subjt:  LQVSGIVNDHLVDCFRYQEC

AT5G57970.1 DNA glycosylase superfamily protein8.1e-7155.47Show/hide
Query:  NVSVGSSCSSDSLSSNYSAKLL--------NLKAKPKPVKTVAAGGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLS
        N S  S  S DS  S  S   L          K+ P   ++V + G   A  + P  S   KRC W+TP SDP YI FHDEEWGVPVHDDK+LFELLVLS
Subjt:  NVSVGSSCSSDSLSSNYSAKLL--------NLKAKPKPVKTVAAGGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLS

Query:  QALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVK
         ALAE TWP ILSKR  FR+VF DFDP++I    E +       A+ LLSD KLRA++ENA Q+LK+ +E+GSF  Y WSFV  K I +++RY RQVP K
Subjt:  QALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVK

Query:  TPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        TPKAE +SKDL++RGFR VGPTVVYSF+Q +GI NDHL  CFR+  C
Subjt:  TPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

AT5G57970.2 DNA glycosylase superfamily protein8.1e-7155.47Show/hide
Query:  NVSVGSSCSSDSLSSNYSAKLL--------NLKAKPKPVKTVAAGGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLS
        N S  S  S DS  S  S   L          K+ P   ++V + G   A  + P  S   KRC W+TP SDP YI FHDEEWGVPVHDDK+LFELLVLS
Subjt:  NVSVGSSCSSDSLSSNYSAKLL--------NLKAKPKPVKTVAAGGDANATTTSPGLSVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLS

Query:  QALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVK
         ALAE TWP ILSKR  FR+VF DFDP++I    E +       A+ LLSD KLRA++ENA Q+LK+ +E+GSF  Y WSFV  K I +++RY RQVP K
Subjt:  QALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVK

Query:  TPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        TPKAE +SKDL++RGFR VGPTVVYSF+Q +GI NDHL  CFR+  C
Subjt:  TPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTGGCTACGAAGCTCCAATCGCATGCTGAACCGGTTTTGGAGTCCCGAGCAATTCTTGGACCGGGCGGGAACAGAGATAGGGCGCCTGAGAAGCCAAAATGCAA
ACAGGAGATCTTGAAGAGGACAGTGAAGCAGAATAAGGCGCTTCCAGTGGTTTCTGAATCGGTTGTTCGGGACAATGTCTCCGTCGGGAGCTCCTGCTCTTCCGATTCTT
TATCAAGCAACTATTCGGCCAAATTGTTGAATCTGAAAGCGAAGCCCAAGCCTGTGAAGACTGTCGCTGCCGGCGGTGACGCTAACGCAACCACAACGTCGCCTGGGCTC
TCGGTTGCGGGGAAACGCTGTGATTGGATAACGCCTTATTCTGACCCACTTTACATTGCTTTTCACGACGAAGAATGGGGAGTCCCAGTTCATGACGACAAGAAGCTGTT
TGAGTTACTTGTATTATCACAAGCCTTAGCAGAGCTTACTTGGCCCTTGATCCTCAGCAAGAGACACATTTTCAGGAAAGTGTTCAATGATTTTGACCCATCTTCCATCG
CACATTTCACAGAAGCTGAGTTTACGACACTAAAAGTAAATGCCACGCAGCTCCTGTCTGATCAAAAGCTTCGTGCAATCGTGGAGAACGCTAACCAAGTACTCAAGATT
CAACAGGAATTTGGTTCCTTCAGCAACTATTGTTGGAGCTTTGTTAACAAGAAGCCTATCCGAAACCGATATCGATATGGTCGTCAAGTACCGGTAAAGACTCCTAAAGC
GGAGTTCATGAGCAAGGATTTGATGAAGAGAGGATTCCGTTGTGTCGGGCCAACTGTGGTTTATTCCTTCTTGCAAGTTAGCGGAATTGTTAACGATCACTTGGTCGACT
GCTTCAGGTATCAAGAGTGCGACGCAAGCGTCAAAGATGACATGAAATTAAGAGTAGAAAATCGGAGATCGGAGTTGCTTATTCGAGCTTTGGAGAAGTCTTCCTTGACG
ACCTGA
mRNA sequenceShow/hide mRNA sequence
TTAAATGTTGGATTTCCACTATTTAAATAATAAGTTGCGTCGTTCATTCGATTTTTATTAGACAATGGTGCAAGTGTAAATATAGATGGTAGGTAGTGGCAAACCGGTAA
AAGTTCGTCGCCACGCGTGCACTGTTACCGGCTGTTGTAGGTATCCCGTCCAAATCCACGCTTAAATAACCTTTCAATAGTCGACCCGTTTCTTCGTCGTTCTCCGCCCA
CGGAGTCTCTGCTCTTTCTCTCGCCACGCCCTTCTGATCTTCACTGACAAACCCTAGGGCTTTCTTTTCCCTCTCAGTTTCAACAATCGCTGCATTTTCCTCTCTGTCTC
GCCTTCAGATCGTACGGTTATTTTGTGTCTCGGGATAATTTAACGGTTGTCGAGCTTGTTCCGTTTTGATCCACCGAATTCGATTCTGGAAGGAATTTGAGTTGACGGAT
ATCTTTCTTGAGCAATGTCTGTGGCTACGAAGCTCCAATCGCATGCTGAACCGGTTTTGGAGTCCCGAGCAATTCTTGGACCGGGCGGGAACAGAGATAGGGCGCCTGAG
AAGCCAAAATGCAAACAGGAGATCTTGAAGAGGACAGTGAAGCAGAATAAGGCGCTTCCAGTGGTTTCTGAATCGGTTGTTCGGGACAATGTCTCCGTCGGGAGCTCCTG
CTCTTCCGATTCTTTATCAAGCAACTATTCGGCCAAATTGTTGAATCTGAAAGCGAAGCCCAAGCCTGTGAAGACTGTCGCTGCCGGCGGTGACGCTAACGCAACCACAA
CGTCGCCTGGGCTCTCGGTTGCGGGGAAACGCTGTGATTGGATAACGCCTTATTCTGACCCACTTTACATTGCTTTTCACGACGAAGAATGGGGAGTCCCAGTTCATGAC
GACAAGAAGCTGTTTGAGTTACTTGTATTATCACAAGCCTTAGCAGAGCTTACTTGGCCCTTGATCCTCAGCAAGAGACACATTTTCAGGAAAGTGTTCAATGATTTTGA
CCCATCTTCCATCGCACATTTCACAGAAGCTGAGTTTACGACACTAAAAGTAAATGCCACGCAGCTCCTGTCTGATCAAAAGCTTCGTGCAATCGTGGAGAACGCTAACC
AAGTACTCAAGATTCAACAGGAATTTGGTTCCTTCAGCAACTATTGTTGGAGCTTTGTTAACAAGAAGCCTATCCGAAACCGATATCGATATGGTCGTCAAGTACCGGTA
AAGACTCCTAAAGCGGAGTTCATGAGCAAGGATTTGATGAAGAGAGGATTCCGTTGTGTCGGGCCAACTGTGGTTTATTCCTTCTTGCAAGTTAGCGGAATTGTTAACGA
TCACTTGGTCGACTGCTTCAGGTATCAAGAGTGCGACGCAAGCGTCAAAGATGACATGAAATTAAGAGTAGAAAATCGGAGATCGGAGTTGCTTATTCGAGCTTTGGAGA
AGTCTTCCTTGACGACCTGATCATAACGTGTTCAAAGACAACGGTTGGTTATCTCTAACTATAATCTTACAATTCACTGTTAATAAAATGGCTCTGAGTAGTTTGAGTTT
AACTGCCTGTTTTGAATTTTTAATTGCTTCTTTGTAGATGATGATGATGTATTAACGGACGACAAACATGTTGGTTTCTTGTAATTATATTTATCGTACGCCAATTAGGT
ATTGGGCCTTGTGGCTAAAGTTAGCCCAGCAGAACTGCGGCCTACTCTCTTGGTGGTGGCCCAACGATTTTTGGTTGCCACCACCCACACTTTATCGAAACCATTAATAT
TTTGTTTTAAAAATATTCTCCGTCGA
Protein sequenceShow/hide protein sequence
MSVATKLQSHAEPVLESRAILGPGGNRDRAPEKPKCKQEILKRTVKQNKALPVVSESVVRDNVSVGSSCSSDSLSSNYSAKLLNLKAKPKPVKTVAAGGDANATTTSPGL
SVAGKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRHIFRKVFNDFDPSSIAHFTEAEFTTLKVNATQLLSDQKLRAIVENANQVLKI
QQEFGSFSNYCWSFVNKKPIRNRYRYGRQVPVKTPKAEFMSKDLMKRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDASVKDDMKLRVENRRSELLIRALEKSSLT
T