; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0014052 (gene) of Chayote v1 genome

Gene IDSed0014052
OrganismSechium edule (Chayote v1)
DescriptionDNA glycosylase superfamily protein
Genome locationLG01:5800186..5802354
RNA-Seq ExpressionSed0014052
SyntenySed0014052
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054725.1 putative GMP synthase [Cucumis melo var. makuwa]8.3e-16780.71Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKK--------PPSAAVSPISPKSKSPHPPATKRPNDGNS-MTSCSDKILIPAAAVPARSSL
        MCRS++ALEATSVVVDSKFN+RP LQPT NR+LDRRNSLKK         P+AAVSP SPKSKSP PPATKR NDGN+ M S S+KILIPAAA   R++L
Subjt:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKK--------PPSAAVSPISPKSKSPHPPATKRPNDGNS-MTSCSDKILIPAAAVPARSSL

Query:  DRKKSKSFKLSGNGNVISD-------------IDSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYV
        DRKKSKSFKL GNGNVI D              +SPGSIAAVRREQVALQQAQRKM+IAHYGRSKSA F+KIVP+DS IKP  E +RCSFITPNSDPIYV
Subjt:  DRKKSKSFKLSGNGNVISD-------------IDSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYV

Query:  AYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYI
        AYHD+EWGVPVHDDK+LFELLVLSVAQVGSDW SILKKRQ FRNAFS+FDSEIVANFS+KQMVSISTEYGIDINRVRGVVDN+IRILQ+KKEFGS DKYI
Subjt:  AYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYI

Query:  WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRR--APPAEAAEVEVTTEA
        WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHCTLIAA RR  AP     EVE  T A
Subjt:  WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRR--APPAEAAEVEVTTEA

KAG6570606.1 hypothetical protein SDJN03_29521, partial [Cucurbita argyrosperma subsp. sororia]1.6e-16579.85Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKKPPSAAVSPISPKSKSPHPPATKRPNDGNSMTSCSDKILIPAAAVP-ARSSLDRKKSKSF
        MCRS+QALEAT+VVVDSKF ARP LQPT NR+LDRRNSLKKPPSAAVSP SPKSKSP PPATKR ND N M S SDKILIPAAA+   +++LDRKKSKSF
Subjt:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKKPPSAAVSPISPKSKSPHPPATKRPNDGNSMTSCSDKILIPAAAVP-ARSSLDRKKSKSF

Query:  KLSGNGNVI-------------------SDI--DSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYV
        KL+GNGNV+                   S +  DSPGSIAAVRREQVALQQAQRKM+IAHYGRSKSA FDK+VP+DS IKP  E +RCSFITPNSDPIYV
Subjt:  KLSGNGNVI-------------------SDI--DSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYV

Query:  AYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYI
        AYHD+EWGVPVHDDK+LFELLVLSVAQVGSDW SILKKRQ FRNAFS+F +E VA FSDKQM+SIS+EYGIDINRVRGVVDNAIRIL++KKEFGS DKYI
Subjt:  AYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYI

Query:  WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRRAPPAEAAEVEVTTEA
        WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC++ AADRRAP   A  VE TT A
Subjt:  WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRRAPPAEAAEVEVTTEA

XP_004139917.2 uncharacterized protein LOC101218536 [Cucumis sativus]5.8e-16881.54Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKK-------PPSAAVSPISPKSKSPHPPATKRPNDGNS-MTSCSDKILIPAAAVPARSSLD
        MCRS++ LEATSVVVDSKFN+RP LQPTGNR+LDRRNSLKK       P +AAVSP SPKSKSP PPATKR NDGN+ M S S+KILIPAA    R++LD
Subjt:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKK-------PPSAAVSPISPKSKSPHPPATKRPNDGNS-MTSCSDKILIPAAAVPARSSLD

Query:  RKKSKSFKLSGNGNVISD-------------IDSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYVA
        RKKSKSFKL GNGNVI D              +SPGSIAAVRREQVALQQAQRKM+IAHYGRSKSA F+KIVP+DS IKP  E +RCSFITPNSDPIYVA
Subjt:  RKKSKSFKLSGNGNVISD-------------IDSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYVA

Query:  YHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYIW
        YHD+EWGVPVHDDK+LFELLVLSVAQVGSDW SILKKRQ FRNAFS+FDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQ+KKEFGS DKYIW
Subjt:  YHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYIW

Query:  GFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRR--APPAEAAEVEVT
        GFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHCTLIAA RR  AP     EVE T
Subjt:  GFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRR--APPAEAAEVEVT

XP_022943791.1 uncharacterized protein LOC111448434 [Cucurbita moschata]7.0e-16680.1Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKKPPSAAVSPISPKSKSPHPPATKRPNDGNSMTSCSDKILIPAAAVP-ARSSLDRKKSKSF
        MCRS+QALEATSVVVDSKF ARP LQPT NR+LDRRNSLKKPPSAAVSP SPKSKSP PPATKR ND N M S SDKILIPAAA+   +++LDRKKSKSF
Subjt:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKKPPSAAVSPISPKSKSPHPPATKRPNDGNSMTSCSDKILIPAAAVP-ARSSLDRKKSKSF

Query:  KLSGNGNVI-------------------SDI--DSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYV
        KL+GNGNV+                   S +  DSPGSIAAVRREQVALQQAQRKM+IAHYGRSKSA FDK+VP+DS IKP  E +RCSFITPNSDPIYV
Subjt:  KLSGNGNVI-------------------SDI--DSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYV

Query:  AYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYI
        AYHD+EWGVPVHDDK+LFELLVLSVAQVGSDW SILKKRQ FRNAFS+F +E VA FSDKQM+SIS+EYGIDINRVRGVVDNAIRIL++KKEFGS DKYI
Subjt:  AYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYI

Query:  WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRRAPPAEAAEVEVTTEA
        WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC++ AADRRAP   A  VE TT A
Subjt:  WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRRAPPAEAAEVEVTTEA

XP_023511876.1 uncharacterized protein LOC111776761 [Cucurbita pepo subsp. pepo]7.0e-16680.1Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKKPPSAAVSPISPKSKSPHPPATKRPNDGNSMTSCSDKILIPAAAVP-ARSSLDRKKSKSF
        MCRS+QALEATSVVVDSKF ARP LQPTGNR+LDRRNSLKKPPSAAVSP SPKSKSP PPATKR ND N M S SDKILIPAAA+   +++LDRKKSKSF
Subjt:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKKPPSAAVSPISPKSKSPHPPATKRPNDGNSMTSCSDKILIPAAAVP-ARSSLDRKKSKSF

Query:  KLSGNGNVI-------------------SDI--DSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYV
        KL+GNGNV+                   S +  DSPGSIAAVRREQVALQQAQRKM+IAHYGRSKSA FDK+VPIDS IKP  E +RCSFITPNSDPIYV
Subjt:  KLSGNGNVI-------------------SDI--DSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYV

Query:  AYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYI
        AYHD+EWGVPVHDDK+LFELLVLSVAQVGSDW SILKKRQ FRNAFS+F +E VA FSDKQM+SIS+EYGIDINRVRGVVDNAIRIL++KKEF S DKYI
Subjt:  AYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYI

Query:  WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRRAPPAEAAEVEVTTEA
        WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTV+HSFMQAAGLTNDHLTSCHRHLHC++ AADRRAP   A  VE TT A
Subjt:  WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRRAPPAEAAEVEVTTEA

TrEMBL top hitse value%identityAlignment
A0A0A0KED6 Uncharacterized protein2.8e-16881.54Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKK-------PPSAAVSPISPKSKSPHPPATKRPNDGNS-MTSCSDKILIPAAAVPARSSLD
        MCRS++ LEATSVVVDSKFN+RP LQPTGNR+LDRRNSLKK       P +AAVSP SPKSKSP PPATKR NDGN+ M S S+KILIPAA    R++LD
Subjt:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKK-------PPSAAVSPISPKSKSPHPPATKRPNDGNS-MTSCSDKILIPAAAVPARSSLD

Query:  RKKSKSFKLSGNGNVISD-------------IDSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYVA
        RKKSKSFKL GNGNVI D              +SPGSIAAVRREQVALQQAQRKM+IAHYGRSKSA F+KIVP+DS IKP  E +RCSFITPNSDPIYVA
Subjt:  RKKSKSFKLSGNGNVISD-------------IDSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYVA

Query:  YHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYIW
        YHD+EWGVPVHDDK+LFELLVLSVAQVGSDW SILKKRQ FRNAFS+FDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQ+KKEFGS DKYIW
Subjt:  YHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYIW

Query:  GFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRR--APPAEAAEVEVT
        GFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHCTLIAA RR  AP     EVE T
Subjt:  GFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRR--APPAEAAEVEVT

A0A5A7UM21 Putative GMP synthase4.0e-16780.71Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKK--------PPSAAVSPISPKSKSPHPPATKRPNDGNS-MTSCSDKILIPAAAVPARSSL
        MCRS++ALEATSVVVDSKFN+RP LQPT NR+LDRRNSLKK         P+AAVSP SPKSKSP PPATKR NDGN+ M S S+KILIPAAA   R++L
Subjt:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKK--------PPSAAVSPISPKSKSPHPPATKRPNDGNS-MTSCSDKILIPAAAVPARSSL

Query:  DRKKSKSFKLSGNGNVISD-------------IDSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYV
        DRKKSKSFKL GNGNVI D              +SPGSIAAVRREQVALQQAQRKM+IAHYGRSKSA F+KIVP+DS IKP  E +RCSFITPNSDPIYV
Subjt:  DRKKSKSFKLSGNGNVISD-------------IDSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYV

Query:  AYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYI
        AYHD+EWGVPVHDDK+LFELLVLSVAQVGSDW SILKKRQ FRNAFS+FDSEIVANFS+KQMVSISTEYGIDINRVRGVVDN+IRILQ+KKEFGS DKYI
Subjt:  AYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYI

Query:  WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRR--APPAEAAEVEVTTEA
        WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHCTLIAA RR  AP     EVE  T A
Subjt:  WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRR--APPAEAAEVEVTTEA

A0A6J1D778 uncharacterized protein LOC1110179892.4e-15177.46Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKK------PPSAAVSPISPKSKSPHPPATKRPND-GNSMTSCSDKILIPAAAVPARSSLDR
        MCRS+Q +EATSVV       R  LQPT NR L RRNSLKK      PP +  SP SPKSKSP PPATKR ND   +M S SDK+++PAAA P   +LDR
Subjt:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKK------PPSAAVSPISPKSKSPHPPATKRPND-GNSMTSCSDKILIPAAAVPARSSLDR

Query:  KKSKSFKLSGNG--------NVISDI--DSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYVAYHDQ
        KKSKSFKL G+G        +  S +  +SPGSIAAVRREQVALQQAQRKMKIAHYGRSKSA F+KIVPIDS  KP  E +RCSFITPNSDPIYVAYHD+
Subjt:  KKSKSFKLSGNG--------NVISDI--DSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYVAYHDQ

Query:  EWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYIWGFVN
        EWGVPVH+DKVLFELLVLSVAQVGSDW SILKKRQ FRNAFS+FD+E VANFSDKQMVSISTEYGIDINRVRGVVDNAIRIL++KKEFGS DKYIWGFVN
Subjt:  EWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYIWGFVN

Query:  NKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRRAPPAEAAEVEVTTE
        +KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHL CTL+AA RRAPP  A EVE T+E
Subjt:  NKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRRAPPAEAAEVEVTTE

A0A6J1FSP1 uncharacterized protein LOC1114484343.4e-16680.1Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKKPPSAAVSPISPKSKSPHPPATKRPNDGNSMTSCSDKILIPAAAVP-ARSSLDRKKSKSF
        MCRS+QALEATSVVVDSKF ARP LQPT NR+LDRRNSLKKPPSAAVSP SPKSKSP PPATKR ND N M S SDKILIPAAA+   +++LDRKKSKSF
Subjt:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKKPPSAAVSPISPKSKSPHPPATKRPNDGNSMTSCSDKILIPAAAVP-ARSSLDRKKSKSF

Query:  KLSGNGNVI-------------------SDI--DSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYV
        KL+GNGNV+                   S +  DSPGSIAAVRREQVALQQAQRKM+IAHYGRSKSA FDK+VP+DS IKP  E +RCSFITPNSDPIYV
Subjt:  KLSGNGNVI-------------------SDI--DSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYV

Query:  AYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYI
        AYHD+EWGVPVHDDK+LFELLVLSVAQVGSDW SILKKRQ FRNAFS+F +E VA FSDKQM+SIS+EYGIDINRVRGVVDNAIRIL++KKEFGS DKYI
Subjt:  AYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYI

Query:  WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRRAPPAEAAEVEVTTEA
        WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC++ AADRRAP   A  VE TT A
Subjt:  WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRRAPPAEAAEVEVTTEA

A0A6J1J7H3 uncharacterized protein LOC1114841731.9e-16479.34Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKKPPSAAVSPISPKSKSPHPPATKRPNDGNSMTSCSDKILIPAAAVP-ARSSLDRKKSKSF
        MCRS+QALEATSVVVDSKF ARP LQPT NR+LDRRNSLKKPPSAAVSP SPKSKSP PPATKR N+ N M S SDKILIPAAA+   +++LDRKKSKSF
Subjt:  MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKKPPSAAVSPISPKSKSPHPPATKRPNDGNSMTSCSDKILIPAAAVP-ARSSLDRKKSKSF

Query:  KLSGNGNVI-------------------SDI--DSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYV
        KL+GNGNV+                   S +  DSPGSIAAVRREQVALQQAQRKM+IAHYGRSKSA FDK+VP+DS IKP  E +RCSFITPNSDPIYV
Subjt:  KLSGNGNVI-------------------SDI--DSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDS-IKPVEEQKRCSFITPNSDPIYV

Query:  AYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYI
        AYHD+EWGVPVHDDK+LFELLVLSVAQVGSDW SILKKRQ FRNAFS+F +E VA FSDKQM+SIS+EYGIDINRVRGVVDNAIRIL++KKEFGS DKYI
Subjt:  AYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYI

Query:  WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRRAPPAEAAEVEVTTEA
        WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQ AGLTNDHLTSCHRHLHC++ AA RRAP   A  VE TT A
Subjt:  WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAADRRAPPAEAAEVEVTTEA

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 11.5e-3338.89Show/hide
Query:  KRCSFITPNSDPIYVAYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINR--VRGVVDNA
        +RC ++  + DP+Y+AYHD EWGVP  D K LFE++ L   Q G  W ++LKKR+ +R  F  FD   VA   ++ +  +  + GI  +R  ++ ++ NA
Subjt:  KRCSFITPNSDPIYVAYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINR--VRGVVDNA

Query:  IRILQVKKEFGSLDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC
           LQ+++       ++W FVN++P   Q  +  +IP  TS S+ +SK + +RGF+ VG T+ +SFMQA GL NDH+  C
Subjt:  IRILQVKKEFGSLDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC

P44321 DNA-3-methyladenine glycosylase9.3e-2835.75Show/hide
Query:  RCSFITPNSDPIYVAYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVR--GVVDNAI
        RC ++   S  IY+ YHD+EWG P  D + LFE + L   Q G  W ++LKKR+ +R AF  FD + +A  +   + +     G+  +R +   +V NA 
Subjt:  RCSFITPNSDPIYVAYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVR--GVVDNAI

Query:  RILQVKKEFGSLDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC
          L ++K   +   +IW FVN+KP          +P KT  S+ +SK + +RGF  +G T  ++FMQ+ GL +DHL  C
Subjt:  RILQVKKEFGSLDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]6.6e-4244.33Show/hide
Query:  DSIKPVEEQKRCSFITPNSD---PIYVAYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDI
        DS + V E+ RC++ T   +    +Y  YHD EWG P+H+DK LFE LVL   Q G  W +ILKKR+ FR AF +FD  IVAN+ + ++  +    GI  
Subjt:  DSIKPVEEQKRCSFITPNSD---PIYVAYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDI

Query:  NR--VRGVVDNAIRILQVKKEFGSLDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHR
        NR  +   + NA   + V++EFGS DKYIWGFV  KP    ++S   +P  T  S+ I+KD+ +RGF+ VG T +++ MQ+ G+ NDHLTSC +
Subjt:  NR--VRGVVDNAIRILQVKKEFGSLDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHR

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein2.4e-5538.17Show/hide
Query:  RPALQPTGNRLLDRRNSLK--KP--PSAAVSPISPKSKSPHPPATKRPNDGNSMTSCSDKILIPAAAVPARSSLDRK---KSKSFKLSGNGNVISDIDSP
        R  L PTGN+L  +   +K  KP      +     K+K P  PA+ R       + CS  +   +A++ A  S D     +S    ++ + +    +   
Subjt:  RPALQPTGNRLLDRRNSLK--KP--PSAAVSPISPKSKSPHPPATKRPNDGNSMTSCSDKILIPAAAVPARSSLDRK---KSKSFKLSGNGNVISDIDSP

Query:  GSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDSIKPVEEQKRCSFITPNSDPIYVAYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILK
        GS+++ R+  V  ++ +        GR                     KRC++ITP +DP YVA+HD+EWGVPVHDDK LFELL LS A     W  IL 
Subjt:  GSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDSIKPVEEQKRCSFITPNSDPIYVAYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILK

Query:  KRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDIN--RVRGVVDNAIRILQVKKEFGSLDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVR
        +R + R  F +FD   VA  +DK++ +  T     ++  ++R ++DN+  + ++  E GSL KY+W FVNNKP   Q++   ++PVKTSK+E ISKD+VR
Subjt:  KRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDIN--RVRGVVDNAIRILQVKKEFGSLDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVR

Query:  RGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIA
        RGFRSV PTV++SFMQAAGLTNDHL  C R+  C + A
Subjt:  RGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIA

AT3G12710.1 DNA glycosylase superfamily protein9.7e-9760Show/hide
Query:  SKSPHPPATKR----PNDGNSMTSCSDKI----LIPAAAVPARSSLDRKKSKSFKLSGNGNVISDIDSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSA
        SK+     TKR    P+  NS+   S+ +    ++   A   R SL+RKKSKSFK   + +     ++PGSIAAVRREQVA QQA RK+KIAHYGRSKS 
Subjt:  SKSPHPPATKR----PNDGNSMTSCSDKI----LIPAAAVPARSSLDRKKSKSFKLSGNGNVISDIDSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSA

Query:  ---HFDKIVPIDSIKPVEEQKRCSFITPNSDPIYVAYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSI
              K+VP+ +  P    +RCSF+TP SDPIYVAYHD+EWGVPVHDDK LFELL LS AQVGSDW S L+KR  +R AF  F++E+VA  ++K+M +I
Subjt:  ---HFDKIVPIDSIKPVEEQKRCSFITPNSDPIYVAYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSI

Query:  STEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHR
        S EY I++++VRGVV+NA +I+++KK F SL+KY+WGFVN+KP S  YK GHKIPVKTSKSE+ISKDMVRRGFR VGPTVVHSFMQAAGLTNDHL +C R
Subjt:  STEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHR

Query:  HLHCTLIAAD
        H  CTL+A +
Subjt:  HLHCTLIAAD

AT5G44680.1 DNA glycosylase superfamily protein6.5e-9353.76Show/hide
Query:  SKFNARPALQPTGNRL--LDRRNSLKKPPSAAVSPISPKSKSPHPPATKRPNDGNSMTSCSDKILIPAAAVP--ARSSLDRKKSKSFKLSGNGN------
        S+ N RP LQP  N++  LDRRNSLKK P   ++PI+ K  SP P +   P     ++  +  +  PA +     RSS  + K      + +G       
Subjt:  SKFNARPALQPTGNRL--LDRRNSLKKPPSAAVSPISPKSKSPHPPATKRPNDGNSMTSCSDKILIPAAAVP--ARSSLDRKKSKSFKLSGNGN------

Query:  VISDIDSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHF-DKIVPIDSIKPVEEQKRCSFITPNSDPIYVAYHDQEWGVPVHDDKVLFELLVLSVAQV
        ++     PGSIAA RRE+VA++Q +RK KI+HYGR KS    +K + ++     E++KRCSFIT +SDPIYVAYHD+EWGVPVHDD +LFELLVL+ AQV
Subjt:  VISDIDSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHF-DKIVPIDSIKPVEEQKRCSFITPNSDPIYVAYHDQEWGVPVHDDKVLFELLVLSVAQV

Query:  GSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSET
        GSDW S+LK+R  FR AFS F++E+VA+F++K++ SI  +YGI++++V  VVDNA +IL+VK++ GS +KYIWGF+ +KP + +Y S  KIPVKTSKSET
Subjt:  GSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSET

Query:  ISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAA
        ISKDMVRRGFR VGPTV+HS MQAAGLTNDHL +C RHL CT +AA
Subjt:  ISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAA

AT5G57970.1 DNA glycosylase superfamily protein2.3e-5854.5Show/hide
Query:  EEQKRCSFITPNSDPIYVAYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDIN--RVRGVV
        E +KRC+++TPNSDP Y+ +HD+EWGVPVHDDK LFELLVLS A     W +IL KRQ FR  F++FD   +   ++K+++   +     ++  ++R V+
Subjt:  EEQKRCSFITPNSDPIYVAYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDIN--RVRGVV

Query:  DNAIRILQVKKEFGSLDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC
        +NA +IL+V +E+GS DKYIW FV NK    +++   ++P KT K+E ISKD+VRRGFRSVGPTVV+SFMQAAG+TNDHLTSC R  HC
Subjt:  DNAIRILQVKKEFGSLDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC

AT5G57970.2 DNA glycosylase superfamily protein2.3e-5854.5Show/hide
Query:  EEQKRCSFITPNSDPIYVAYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDIN--RVRGVV
        E +KRC+++TPNSDP Y+ +HD+EWGVPVHDDK LFELLVLS A     W +IL KRQ FR  F++FD   +   ++K+++   +     ++  ++R V+
Subjt:  EEQKRCSFITPNSDPIYVAYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFRNAFSNFDSEIVANFSDKQMVSISTEYGIDIN--RVRGVV

Query:  DNAIRILQVKKEFGSLDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC
        +NA +IL+V +E+GS DKYIW FV NK    +++   ++P KT K+E ISKD+VRRGFRSVGPTVV+SFMQAAG+TNDHLTSC R  HC
Subjt:  DNAIRILQVKKEFGSLDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCGCTCCGACCAAGCCTTGGAAGCCACTTCCGTCGTCGTTGATTCCAAATTCAACGCCCGTCCCGCTCTTCAACCCACCGGCAACCGCCTCCTCGACCGCCGTAA
TTCCCTCAAAAAACCCCCCTCCGCCGCCGTCTCACCCATTTCCCCAAAGTCCAAATCCCCCCATCCGCCGGCCACCAAGCGCCCCAACGACGGCAACTCCATGACCTCCT
GCTCCGACAAGATTCTCATCCCCGCCGCCGCCGTTCCCGCCCGGTCTTCCTTGGACAGGAAGAAATCCAAGAGCTTCAAATTGAGCGGAAATGGGAATGTCATTTCCGAC
ATTGACTCGCCGGGAAGTATCGCCGCCGTGAGAAGAGAACAGGTGGCGCTGCAACAGGCGCAGAGGAAAATGAAAATTGCCCATTATGGAAGATCTAAATCTGCCCATTT
TGACAAAATTGTTCCCATTGATTCAATTAAACCTGTTGAAGAACAAAAAAGATGTAGCTTCATCACTCCCAATTCAGATCCAATTTATGTTGCTTATCATGATCAAGAAT
GGGGCGTCCCTGTTCATGATGACAAAGTACTGTTTGAACTGCTGGTTCTGAGTGTGGCTCAAGTGGGTTCTGATTGGGCTTCCATTTTGAAGAAACGCCAAGTTTTCAGA
AATGCATTTTCGAATTTCGATTCAGAAATTGTGGCTAATTTTTCCGACAAACAGATGGTTTCAATCAGCACAGAATATGGGATCGACATTAACAGAGTCCGAGGAGTCGT
CGACAATGCAATCCGAATCCTCCAGGTGAAGAAAGAATTTGGGTCGTTGGACAAATACATTTGGGGATTTGTGAACAACAAGCCATTTTCACCGCAGTACAAATCCGGCC
ACAAAATTCCGGTCAAGACATCAAAATCAGAGACCATAAGCAAAGACATGGTCCGGCGAGGATTCCGGTCGGTCGGTCCGACCGTGGTTCACTCTTTCATGCAAGCCGCC
GGTCTGACCAACGACCATCTCACCAGCTGTCACCGGCACCTCCACTGCACTTTAATCGCCGCCGACCGTCGTGCGCCGCCGGCGGAGGCGGCGGAAGTAGAGGTGACGAC
GGAGGCGGATTCTGTAACTATTTAG
mRNA sequenceShow/hide mRNA sequence
TTCATTTCCAAATTTCCTTTATAAAAGCCCTCCCTCTTTCCTTCACTAACTCCCATTTCTGTTCTTTAAAAAAAAAAAAACTCAAATTTCTCAATTTCTTTTCCTTACTA
ATAACAATCCCAAAAAGAAGATCCAAAAAATTAAAAAAAAAACAATCCCAAATAGAAAAACGATGTGTCGCTCCGACCAAGCCTTGGAAGCCACTTCCGTCGTCGTTGAT
TCCAAATTCAACGCCCGTCCCGCTCTTCAACCCACCGGCAACCGCCTCCTCGACCGCCGTAATTCCCTCAAAAAACCCCCCTCCGCCGCCGTCTCACCCATTTCCCCAAA
GTCCAAATCCCCCCATCCGCCGGCCACCAAGCGCCCCAACGACGGCAACTCCATGACCTCCTGCTCCGACAAGATTCTCATCCCCGCCGCCGCCGTTCCCGCCCGGTCTT
CCTTGGACAGGAAGAAATCCAAGAGCTTCAAATTGAGCGGAAATGGGAATGTCATTTCCGACATTGACTCGCCGGGAAGTATCGCCGCCGTGAGAAGAGAACAGGTGGCG
CTGCAACAGGCGCAGAGGAAAATGAAAATTGCCCATTATGGAAGATCTAAATCTGCCCATTTTGACAAAATTGTTCCCATTGATTCAATTAAACCTGTTGAAGAACAAAA
AAGATGTAGCTTCATCACTCCCAATTCAGATCCAATTTATGTTGCTTATCATGATCAAGAATGGGGCGTCCCTGTTCATGATGACAAAGTACTGTTTGAACTGCTGGTTC
TGAGTGTGGCTCAAGTGGGTTCTGATTGGGCTTCCATTTTGAAGAAACGCCAAGTTTTCAGAAATGCATTTTCGAATTTCGATTCAGAAATTGTGGCTAATTTTTCCGAC
AAACAGATGGTTTCAATCAGCACAGAATATGGGATCGACATTAACAGAGTCCGAGGAGTCGTCGACAATGCAATCCGAATCCTCCAGGTGAAGAAAGAATTTGGGTCGTT
GGACAAATACATTTGGGGATTTGTGAACAACAAGCCATTTTCACCGCAGTACAAATCCGGCCACAAAATTCCGGTCAAGACATCAAAATCAGAGACCATAAGCAAAGACA
TGGTCCGGCGAGGATTCCGGTCGGTCGGTCCGACCGTGGTTCACTCTTTCATGCAAGCCGCCGGTCTGACCAACGACCATCTCACCAGCTGTCACCGGCACCTCCACTGC
ACTTTAATCGCCGCCGACCGTCGTGCGCCGCCGGCGGAGGCGGCGGAAGTAGAGGTGACGACGGAGGCGGATTCTGTAACTATTTAGAATTGACTTAACAGATAAAAAGG
AAAAAAAAATGATAACCTTTACCGCAAGTCAATCAATGATGATTTGCTTGTTAATTAACTTGATAAACTGTTTTTTTTTTTTTTTTTGGTTTTTGTGGGG
Protein sequenceShow/hide protein sequence
MCRSDQALEATSVVVDSKFNARPALQPTGNRLLDRRNSLKKPPSAAVSPISPKSKSPHPPATKRPNDGNSMTSCSDKILIPAAAVPARSSLDRKKSKSFKLSGNGNVISD
IDSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAHFDKIVPIDSIKPVEEQKRCSFITPNSDPIYVAYHDQEWGVPVHDDKVLFELLVLSVAQVGSDWASILKKRQVFR
NAFSNFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQVKKEFGSLDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAA
GLTNDHLTSCHRHLHCTLIAADRRAPPAEAAEVEVTTEADSVTI