; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g02120 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g02120
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDNA glycosylase superfamily protein
Genome locationchr11:1368321..1370441
RNA-Seq ExpressionMoc11g02120
SyntenyMoc11g02120
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591330.1 hypothetical protein SDJN03_13676, partial [Cucurbita argyrosperma subsp. sororia]4.4e-15185.21Show/hide
Query:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA
        M VA K  SHAKPVLESRAILGPGGNRDR PEKP+CK E TL  +EKQNKALPA+P+SVIRDN+S+GSSCSSDSLSSNYS KLLNPKVKP  VKPVKAVA
Subjt:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA

Query:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV
        AG + + TTT+PR SVP KRC WITPYSDPLYIAFHDEEWGVP+HDD+KLFELLVLSQALAELTWP IL KRD+FRKVFNDFDPS+IA+FT+NEF TLK 
Subjt:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV

Query:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
        NGIQ+L+EPKLRAIVENANQVLKIQQEFG+FSNYCWSFVNKKPI NRFRYARQVPVKTPKAE MSKDL+RRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
Subjt:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR

Query:  YQECDANVKDD
        YQECD   KDD
Subjt:  YQECDANVKDD

XP_022141169.1 uncharacterized protein LOC111011623 [Momordica charantia]3.9e-192100Show/hide
Query:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA
        MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA
Subjt:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA

Query:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV
        AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV
Subjt:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV

Query:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
        NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
Subjt:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR

Query:  YQECDANVKDDMKPRVEDLRLELHNGASEKPCLSRS
        YQECDANVKDDMKPRVEDLRLELHNGASEKPCLSRS
Subjt:  YQECDANVKDDMKPRVEDLRLELHNGASEKPCLSRS

XP_022935907.1 uncharacterized protein LOC111442674 [Cucurbita moschata]3.6e-15382.98Show/hide
Query:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA
        M VA K  SHAKPVLESRAILGPGGNRDR PEKP+CK E TL  +EKQNKALPA+P+SV+RDN+S+GSSCSSDSLSSNYS KLLNPKVKP  VKPVKAVA
Subjt:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA

Query:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV
        AG + + TTT+PR SVP KRC WITPYSDPLYIAFHDEEWGVP+HDD+KLFELLVLSQALAELTWP IL KRD+FRKVFNDFDPS+IA+FT+NEF TLK 
Subjt:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV

Query:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
        NGIQ+L+EPKLRAIVENANQVLKIQQEFG+FSNYCWSFVNKKPI NRFRYARQVPVKTPKAE MSKDL+RRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
Subjt:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR

Query:  YQECDANVKDDMKPRVEDLRLELHNGASE
        YQECD      MK RVED R EL  GA E
Subjt:  YQECDANVKDDMKPRVEDLRLELHNGASE

XP_023535246.1 uncharacterized protein LOC111796735 [Cucurbita pepo subsp. pepo]2.4e-14981.76Show/hide
Query:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA
        M VA K  SHAKPVLESR ILGPGGNRDR PEKP+CK E TL  +EKQNKALPA+P+SVIRDN S+GSSCSSDSL S+YS KLLNPKVKP  VKPVKAVA
Subjt:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA

Query:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV
        AG + + T+T+P  SVP KRC WITPYSDPLYIAFHDEEWGVP+HDD+KLFELLVLSQALAELTWP IL KRD+FRKVFNDFDPS+IAKFT+NEF TLK 
Subjt:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV

Query:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
        NGIQ+L+EPKLRAIVENANQVLKIQQEFG+FSNYCWSFVNKKPI NRFRYARQVPVKTPKAE MSKDL+RRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
Subjt:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR

Query:  YQECDANVKDDMKPRVEDLRLELHNGASE
        YQECD      MK RVE  R EL  GA E
Subjt:  YQECDANVKDDMKPRVEDLRLELHNGASE

XP_038905518.1 DNA-3-methyladenine glycosylase 1 [Benincasa hispida]2.7e-15683.28Show/hide
Query:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA
        M VA K +SHAKPVLESR ILGPGGNRDR PEKP+CK + TL KTEKQN+ALP + +SVIRDNVSVGSSCSSDS+SSNYSAKLL PKVKP AVKPVKAVA
Subjt:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA

Query:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV
        AG + +AT  SP  S+P KRCDWIT +SDPLYIAFHDEEWGVP+HDDKKLFELLVLSQALAELTWP ILSKRD+FRKV NDFDPS+IA+FTENEF TLKV
Subjt:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV

Query:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
        N IQ+L+EPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRN FRY RQVPVKTPKAE MSKDLIRRGFRCVGPTVVYSFMQV GIVNDHLVNCFR
Subjt:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR

Query:  YQECDANVKDDMKPRVEDLRLELHNGASEKPCLSR
        YQECDA +KDD K RVED R E   GA EKPCL+R
Subjt:  YQECDANVKDDMKPRVEDLRLELHNGASEKPCLSR

TrEMBL top hitse value%identityAlignment
A0A6J1CHU1 uncharacterized protein LOC1110116231.9e-192100Show/hide
Query:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA
        MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA
Subjt:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA

Query:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV
        AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV
Subjt:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV

Query:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
        NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
Subjt:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR

Query:  YQECDANVKDDMKPRVEDLRLELHNGASEKPCLSRS
        YQECDANVKDDMKPRVEDLRLELHNGASEKPCLSRS
Subjt:  YQECDANVKDDMKPRVEDLRLELHNGASEKPCLSRS

A0A6J1F6S0 uncharacterized protein LOC1114426741.7e-15382.98Show/hide
Query:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA
        M VA K  SHAKPVLESRAILGPGGNRDR PEKP+CK E TL  +EKQNKALPA+P+SV+RDN+S+GSSCSSDSLSSNYS KLLNPKVKP  VKPVKAVA
Subjt:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA

Query:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV
        AG + + TTT+PR SVP KRC WITPYSDPLYIAFHDEEWGVP+HDD+KLFELLVLSQALAELTWP IL KRD+FRKVFNDFDPS+IA+FT+NEF TLK 
Subjt:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV

Query:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
        NGIQ+L+EPKLRAIVENANQVLKIQQEFG+FSNYCWSFVNKKPI NRFRYARQVPVKTPKAE MSKDL+RRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
Subjt:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR

Query:  YQECDANVKDDMKPRVEDLRLELHNGASE
        YQECD      MK RVED R EL  GA E
Subjt:  YQECDANVKDDMKPRVEDLRLELHNGASE

A0A6J1FIT9 uncharacterized protein LOC1114461251.3e-14881.14Show/hide
Query:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA
        M VA K +SHA+PVLESRAILGPGGNRDR PEKP+CK E  L +T KQNKALP V +SV+RDNVSVGSSCSSDSLSSNYSAKLLN K KP   KPVK VA
Subjt:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA

Query:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV
        AG +A+ATTTSP  SV  KRCDWITPYSDPLYIAFHDEEWGVP+HDDKKLFELLVLSQALAELTWP ILSKR +FRKVFNDFDPSSIA FTE EF TLKV
Subjt:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV

Query:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
        N  Q+L++ KLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNR+RY RQVPVKTPKAE MSKDL++RGFRCVGPTVVYSF+QV+GIVNDHLV+CFR
Subjt:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR

Query:  YQECDANVKDDMKPRVEDLRLELHNGASEKPCLS
        YQECDA+VKDDMK RVE+ R EL   A EK  L+
Subjt:  YQECDANVKDDMKPRVEDLRLELHNGASEKPCLS

A0A6J1IHE9 uncharacterized protein LOC1114769752.2e-14881.46Show/hide
Query:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA
        M VA K  SHAKPVLESRAILGPGGNRDR PEKP+CK E TL  +EKQNKALPA+P+SVIRDN+S+GSSCSSDSLSSN SAKLLNPK     VKPVKAVA
Subjt:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA

Query:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV
        AG + + TTT+PR SVP KRC WITPYSDPLYIAFHDEEWGVP+HDD+KLFELLVLSQALAELTWP IL KRD+FRKVFNDFDPS+IA+FT+NEF TLK 
Subjt:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV

Query:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
        NGIQ+L+EPKLRAIVENANQVLKIQQEFG+FSNYCWSFVNKKPI NRFRYARQ+PVKTPKAE MSKDL+RRGFRCVGPTVVYSFMQVTGIVNDHLV+CFR
Subjt:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR

Query:  YQECDANVKDDMKPRVEDLRLELHNGASE
        YQECD      MK RVED   EL  GA E
Subjt:  YQECDANVKDDMKPRVEDLRLELHNGASE

A0A6J1J188 uncharacterized protein LOC1114804121.9e-14780.24Show/hide
Query:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA
        M VA K +SHA+PVLESRAILGPGGNRDR PEKP+CK E  L +T KQNKALP V +SV+RDN+SVGSSCSSDSLSSNYSAKLLN K KP   KPVK VA
Subjt:  MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVA

Query:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV
        AG +A+ATTTSP   V  KRCDWITPYSDPLYIAFHDEEWGVP+HDDKKLFELLVLSQALAELTWP ILSKR +FR VFNDFDPSSIA+FTE EF TLKV
Subjt:  AGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV

Query:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
        N  Q+L++ KLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNR+RY RQVPVKTPKAE MSKDL++RGFRCVGPTVVYSF+QV+GIVNDHLV+CFR
Subjt:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR

Query:  YQECDANVKDDMKPRVEDLRLELHNGASEKPCLS
        YQECDA+VKDDMK RVE+ R EL   A EK  L+
Subjt:  YQECDANVKDDMKPRVEDLRLELHNGASEKPCLS

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 11.2e-3740.98Show/hide
Query:  KRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKVNGIQVLTEPKLRAIVENA
        +RC W++   DPLYIA+HD EWGVP  D KKLFE++ L    A L+W ++L KR+ +R  F+ FDP  +A   E +   L  +   +    K++AI+ NA
Subjt:  KRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKVNGIQVLTEPKLRAIVENA

Query:  NQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY
           L+++Q    F ++ WSFVN +P   +     ++P  T  ++++SK L +RGF+ VG T+ YSFMQ  G+VNDH+V C  Y
Subjt:  NQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY

P44321 DNA-3-methyladenine glycosylase3.2e-3542.46Show/hide
Query:  RCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKVNGIQVLTEPKLRAIVENAN
        RC W+   S  +YI +HD+EWG P  D +KLFE + L    A L+W ++L KR+ +R+ F+ FDP  IAK T  +      N   +    KL AIV+NA 
Subjt:  RCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKVNGIQVLTEPKLRAIVENAN

Query:  QVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNC
          L +++   +FS++ WSFVN KPI N     R VP KT  ++++SK L +RGF  +G T  Y+FMQ  G+V+DHL +C
Subjt:  QVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]3.5e-4240.44Show/hide
Query:  SAKLLNPKVKPYAV-KPVKAVAAGSEADATTTSPRHSVPRKRCDWITPYSD---PLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVF
        SA   +P++ P ++ +     + G EA  +    R  V   RC W T   +    LY  +HD EWG P+H+DKKLFE LVL    A L+W +IL KR+ F
Subjt:  SAKLLNPKVKPYAV-KPVKAVAAGSEADATTTSPRHSVPRKRCDWITPYSD---PLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVF

Query:  RKVFNDFDPSSIAKFTENEFATLKVNGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRC
        R  F+DFDP  +A + E++   L  N   +    K+ A + NA   + +Q+EFGSF  Y W FV  KPI N F     +P  TP ++ ++KDL +RGF+ 
Subjt:  RKVFNDFDPSSIAKFTENEFATLKVNGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRC

Query:  VGPTVVYSFMQVTGIVNDHLVNCFR
        VG T +Y+ MQ  G+VNDHL +CF+
Subjt:  VGPTVVYSFMQVTGIVNDHLVNCFR

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein4.3e-7244.13Show/hide
Query:  MYVAAKFRSHAKPVLESRAILGPGGNR-DRVP-----EKPRCKHETTLTKTEKQNK-ALPAVP-----------DSVIRDNVS---------VGSSCSSD
        M V  +FRS      E R++LGP GN+  R P     EKP  +     +K EK  K   PA P            S++R N +           SSC S 
Subjt:  MYVAAKFRSHAKPVLESRAILGPGGNR-DRVP-----EKPRCKHETTLTKTEKQNK-ALPAVP-----------DSVIRDNVS---------VGSSCSSD

Query:  SLSSNYSAKLLNPKVKPYAVKPVKAVAAGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRD
         LS   S+       +  +V   + ++ G E +   +    +  RKRC WITP +DP Y+AFHDEEWGVP+HDDKKLFELL LS ALAEL+W  ILS+R 
Subjt:  SLSSNYSAKLLNPKVKPYAVKPVKAVAAGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRD

Query:  VFRKVFNDFDPSSIAKFTENEFATLKVNGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGF
        + R+VF DFDP ++A+  + +        I +L+E K+R+I++N+  V KI  E GS   Y W+FVN KP +++FRY RQVPVKT KAE +SKDL+RRGF
Subjt:  VFRKVFNDFDPSSIAKFTENEFATLKVNGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGF

Query:  RCVGPTVVYSFMQVTGIVNDHLVNCFRYQEC--DANVKDDMKPRVEDLR
        R V PTV+YSFMQ  G+ NDHL+ CFRYQ+C  DA      K + ++ R
Subjt:  RCVGPTVVYSFMQVTGIVNDHLVNCFRYQEC--DANVKDDMKPRVEDLR

AT1G75090.1 DNA glycosylase superfamily protein2.0e-9354.38Show/hide
Query:  MYVAAKFRSHAKPVLESRAILGPGGNRDRV-----PEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSS-DSLSSNYSAKLLNPKVKPYAVK
        M + +K RS  KP+ ESRAIL   GNR +V      +KP+     T +   K+    P    SV  D+ S  SS S   S+++  S K+  P  K   V+
Subjt:  MYVAAKFRSHAKPVLESRAILGPGGNRDRV-----PEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSS-DSLSSNYSAKLLNPKVKPYAVK

Query:  PVKAVAAGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENE
         +  V A S A     SP+   P KRC WITP SDP+Y+ FHDEEWGVP+ DDKKLFELLV SQALAE +WPSIL +RD FRK+F +FDPS+IA+FTE  
Subjt:  PVKAVAAGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENE

Query:  FATLKVNGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDH
          +L+VNG  +L+E KLRAIVENA  VLK++QEFGSFSNYCW FVN KP+RN +RY RQVPVK+PKAE +SKD+++RGFRCVGPTV+YSF+Q +GIVNDH
Subjt:  FATLKVNGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDH

Query:  LVNCFRYQECDANVKDDMKPRVEDLRLELHN
        L  CFRYQEC+   + + K    + +L+LH+
Subjt:  LVNCFRYQECDANVKDDMKPRVEDLRLELHN

AT1G80850.1 DNA glycosylase superfamily protein9.3e-7549.01Show/hide
Query:  ESRAILGPGGNR------DRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKL--LNPKVKPYAVKPVKAVAAGSEADA
        E R++LGP GN+       +  +KP  +    LT TEK  +  P  P  + R+ +S+ +S SSD+ SS  S+ L   +       ++   +V++ S    
Subjt:  ESRAILGPGGNR------DRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKL--LNPKVKPYAVKPVKAVAAGSEADA

Query:  TTTSPRHSVP-------RKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV
          T  R           RKRC WITP SD  YIAFHDEEWGVP+HDDK+LFELL LS ALAEL+W  ILSKR +FR+VF DFDP +I++ T  +  + ++
Subjt:  TTTSPRHSVP-------RKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKV

Query:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
            +L+E KLR+I+ENANQV KI   FGSF  Y W+FVN+KP +++FRY RQVPVKT KAE +SKDL+RRGFR V PTV+YSFMQ  G+ NDHL  CFR
Subjt:  NGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR

Query:  YQEC
        + +C
Subjt:  YQEC

AT5G57970.1 DNA glycosylase superfamily protein3.0e-7346.2Show/hide
Query:  AAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVP-----DSVIRDNVSVGSSCSSDSLSSNYSAKL----------LNPKV
        A+ F +H       R +      R    EK      T    +  Q   L A       +  +  N+S+ +S SSD+   ++ ++           +  + 
Subjt:  AAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVP-----DSVIRDNVSVGSSCSSDSLSSNYSAKL----------LNPKV

Query:  KPYAVKPVKAVAAGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIA
        K Y  KP   V+ G    A  + P  S  +KRC W+TP SDP YI FHDEEWGVP+HDDK+LFELLVLS ALAE TWP+ILSKR  FR+VF DFDP++I 
Subjt:  KPYAVKPVKAVAAGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIA

Query:  KFTENEFATLKVNGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVT
        K  E +          +L++ KLRA++ENA Q+LK+ +E+GSF  Y WSFV  K I ++FRY RQVP KTPKAE +SKDL+RRGFR VGPTVVYSFMQ  
Subjt:  KFTENEFATLKVNGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVT

Query:  GIVNDHLVNCFRYQEC
        GI NDHL +CFR+  C
Subjt:  GIVNDHLVNCFRYQEC

AT5G57970.2 DNA glycosylase superfamily protein3.0e-7346.2Show/hide
Query:  AAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVP-----DSVIRDNVSVGSSCSSDSLSSNYSAKL----------LNPKV
        A+ F +H       R +      R    EK      T    +  Q   L A       +  +  N+S+ +S SSD+   ++ ++           +  + 
Subjt:  AAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVP-----DSVIRDNVSVGSSCSSDSLSSNYSAKL----------LNPKV

Query:  KPYAVKPVKAVAAGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIA
        K Y  KP   V+ G    A  + P  S  +KRC W+TP SDP YI FHDEEWGVP+HDDK+LFELLVLS ALAE TWP+ILSKR  FR+VF DFDP++I 
Subjt:  KPYAVKPVKAVAAGSEADATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIA

Query:  KFTENEFATLKVNGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVT
        K  E +          +L++ KLRA++ENA Q+LK+ +E+GSF  Y WSFV  K I ++FRY RQVP KTPKAE +SKDL+RRGFR VGPTVVYSFMQ  
Subjt:  KFTENEFATLKVNGIQVLTEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVT

Query:  GIVNDHLVNCFRYQEC
        GI NDHL +CFR+  C
Subjt:  GIVNDHLVNCFRYQEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGTGGCTGCGAAGTTCCGATCGCACGCTAAGCCGGTTTTGGAGTCCCGAGCAATTCTTGGACCCGGCGGGAACAGAGATAGGGTCCCTGAGAAGCCGAGA
TGCAAACACGAGACGACGTTGACGAAGACGGAGAAGCAGAACAAGGCACTTCCGGCGGTTCCGGATTCGGTTATTCGGGATAATGTCTCCGTCGGTAGTTCATGT
TCTTCCGATTCTTTATCAAGCAACTATTCGGCCAAATTGTTGAATCCGAAAGTGAAGCCCTACGCCGTGAAACCTGTAAAGGCTGTTGCTGCCGGCAGTGAGGCA
GACGCAACCACAACGTCCCCCAGGCACTCCGTTCCGCGTAAACGGTGTGATTGGATAACGCCTTATTCTGACCCACTTTACATCGCTTTTCATGACGAAGAATGG
GGAGTCCCAATTCATGATGACAAGAAGCTGTTTGAGTTACTTGTATTATCACAAGCCTTAGCAGAACTTACTTGGCCCTCGATTCTTAGCAAGAGAGACGTATTT
AGGAAAGTTTTTAATGATTTTGACCCATCTTCCATCGCTAAGTTCACAGAGAATGAGTTTGCAACACTAAAAGTTAATGGCATCCAGGTTCTGACTGAACCAAAG
CTTCGTGCAATCGTGGAGAATGCTAATCAAGTACTCAAGATTCAACAGGAATTTGGTTCCTTTAGCAACTACTGTTGGAGCTTTGTTAACAAGAAGCCTATACGA
AACAGATTTCGATATGCTCGTCAAGTACCCGTAAAGACGCCGAAAGCAGAGTCCATGAGCAAGGATCTGATCCGGAGAGGGTTCCGATGTGTCGGGCCAACCGTG
GTTTATTCCTTCATGCAGGTTACCGGAATTGTCAACGATCACTTGGTGAATTGCTTCAGATATCAAGAATGTGATGCAAATGTAAAAGACGATATGAAACCAAGA
GTAGAAGATCTGAGGTTGGAGTTGCATAACGGAGCTTCGGAGAAGCCTTGCTTGTCAAGATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATGTGGCTGCGAAGTTCCGATCGCACGCTAAGCCGGTTTTGGAGTCCCGAGCAATTCTTGGACCCGGCGGGAACAGAGATAGGGTCCCTGAGAAGCCGAGA
TGCAAACACGAGACGACGTTGACGAAGACGGAGAAGCAGAACAAGGCACTTCCGGCGGTTCCGGATTCGGTTATTCGGGATAATGTCTCCGTCGGTAGTTCATGT
TCTTCCGATTCTTTATCAAGCAACTATTCGGCCAAATTGTTGAATCCGAAAGTGAAGCCCTACGCCGTGAAACCTGTAAAGGCTGTTGCTGCCGGCAGTGAGGCA
GACGCAACCACAACGTCCCCCAGGCACTCCGTTCCGCGTAAACGGTGTGATTGGATAACGCCTTATTCTGACCCACTTTACATCGCTTTTCATGACGAAGAATGG
GGAGTCCCAATTCATGATGACAAGAAGCTGTTTGAGTTACTTGTATTATCACAAGCCTTAGCAGAACTTACTTGGCCCTCGATTCTTAGCAAGAGAGACGTATTT
AGGAAAGTTTTTAATGATTTTGACCCATCTTCCATCGCTAAGTTCACAGAGAATGAGTTTGCAACACTAAAAGTTAATGGCATCCAGGTTCTGACTGAACCAAAG
CTTCGTGCAATCGTGGAGAATGCTAATCAAGTACTCAAGATTCAACAGGAATTTGGTTCCTTTAGCAACTACTGTTGGAGCTTTGTTAACAAGAAGCCTATACGA
AACAGATTTCGATATGCTCGTCAAGTACCCGTAAAGACGCCGAAAGCAGAGTCCATGAGCAAGGATCTGATCCGGAGAGGGTTCCGATGTGTCGGGCCAACCGTG
GTTTATTCCTTCATGCAGGTTACCGGAATTGTCAACGATCACTTGGTGAATTGCTTCAGATATCAAGAATGTGATGCAAATGTAAAAGACGATATGAAACCAAGA
GTAGAAGATCTGAGGTTGGAGTTGCATAACGGAGCTTCGGAGAAGCCTTGCTTGTCAAGATCTTGA
Protein sequenceShow/hide protein sequence
MYVAAKFRSHAKPVLESRAILGPGGNRDRVPEKPRCKHETTLTKTEKQNKALPAVPDSVIRDNVSVGSSCSSDSLSSNYSAKLLNPKVKPYAVKPVKAVAAGSEA
DATTTSPRHSVPRKRCDWITPYSDPLYIAFHDEEWGVPIHDDKKLFELLVLSQALAELTWPSILSKRDVFRKVFNDFDPSSIAKFTENEFATLKVNGIQVLTEPK
LRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAESMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQECDANVKDDMKPR
VEDLRLELHNGASEKPCLSRS