; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027644 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027644
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationtig00153055:1427849..1430125
RNA-Seq ExpressionSgr027644
SyntenySgr027644
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591330.1 hypothetical protein SDJN03_13676, partial [Cucurbita argyrosperma subsp. sororia]1.7e-14785.48Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA
        MSVATKL SHAK VLESRAILGPGGNRDR PEKPKCK ETL  +EKQNKA P I E VIRDN+S+GSS SSDS  SNYS KLLNPKVK   VKPVKAVAA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA

Query:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG
        G D N TTT+PR +VP KRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWP IL KRDIFRKVFNDFDPS+IAQFT+NEFTTLK  
Subjt:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG

Query:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY
        GIQLLSEPKLRAIVENANQVLKIQQEFG+FSNYCWSFVNKKPI NRFRYARQVPV+TPKAE MSKDL+RRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY
Subjt:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY

Query:  QESNANIKDD
        QE +   KDD
Subjt:  QESNANIKDD

XP_022141169.1 uncharacterized protein LOC111011623 [Momordica charantia]1.0e-16387.5Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHE-TLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVA
        M VA K +SHAK VLESRAILGPGGNRDRVPEKP+CKHE TL KTEKQNKA P + + VIRDNVSVGSS SSDS  SNYSAKLLNPKVK  AVKPVKAVA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHE-TLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVA

Query:  AGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKV
        AG +A+ATTTSPRH+VPRKRCDWITPYSDPLYIAFHDEEWGVP+HDDKKLFELLVLSQALAELTWPSILSKRD+FRKVFNDFDPSSIA+FTENEF TLKV
Subjt:  AGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKV

Query:  GGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
         GIQ+L+EPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPV+TPKAE MSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
Subjt:  GGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR

Query:  YQESNANIKDDMKPRVEE-RSESLTGAFEKPCLSRS
        YQE +AN+KDDMKPRVE+ R E   GA EKPCLSRS
Subjt:  YQESNANIKDDMKPRVEE-RSESLTGAFEKPCLSRS

XP_022935907.1 uncharacterized protein LOC111442674 [Cucurbita moschata]6.3e-15083.84Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA
        MSVATKL SHAK VLESRAILGPGGNRDR PEKPKCK ETL  +EKQNKA P I E V+RDN+S+GSS SSDS  SNYS KLLNPKVK   VKPVKAVAA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA

Query:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG
        G D N TTT+PR +VP KRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWP IL KRDIFRKVFNDFDPS+IAQFT+NEFTTLK  
Subjt:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG

Query:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY
        GIQLLSEPKLRAIVENANQVLKIQQEFG+FSNYCWSFVNKKPI NRFRYARQVPV+TPKAE MSKDL+RRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY
Subjt:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY

Query:  QESNANIKDDMKPRVE-ERSESLTGAFE
        QE      D MK RVE +RSE LTGA E
Subjt:  QESNANIKDDMKPRVE-ERSESLTGAFE

XP_023535246.1 uncharacterized protein LOC111796735 [Cucurbita pepo subsp. pepo]2.2e-14782.93Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA
        MSVATKL SHAK VLESR ILGPGGNRDR PEKPKCK ETL  +EKQNKA P I E VIRDN S+GSS SSDS LS+YS KLLNPKVK   VKPVKAVAA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA

Query:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG
        G D N T+T+P  +VP KRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWP IL KRDIFRKVFNDFDPS+IA+FT+NEFTTLK  
Subjt:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG

Query:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY
        GIQLLSEPKLRAIVENANQVLKIQQEFG+FSNYCWSFVNKKPI NRFRYARQVPV+TPKAE MSKDL+RRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY
Subjt:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY

Query:  QESNANIKDDMKPRVE-ERSESLTGAFE
        QE      D MK RVE +RSE LTGA E
Subjt:  QESNANIKDDMKPRVE-ERSESLTGAFE

XP_038905518.1 DNA-3-methyladenine glycosylase 1 [Benincasa hispida]4.1e-15785.93Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA
        MSVATKLQSHAK VLESR ILGPGGNRDR PEKPKCK +TL KTEKQN+A P+I E VIRDNVSVGSS SSDS  SNYSAKLL PKVK +AVKPVKAVAA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA

Query:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG
        G D NAT  SP  ++P KRCDWIT +SDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWP ILSKRDIFRKV NDFDPS+IAQFTENEFTTLKV 
Subjt:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG

Query:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY
         IQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRN FRY RQVPV+TPKAE MSKDLIRRGFRCVGPTVVYSFMQV GIVNDHLVNCFRY
Subjt:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY

Query:  QESNANIKDDMKPRVEE-RSESLTGAFEKPCLSR
        QE +A IKDD K RVE+ RSESLTGA EKPCL+R
Subjt:  QESNANIKDDMKPRVEE-RSESLTGAFEKPCLSR

TrEMBL top hitse value%identityAlignment
A0A6J1CHU1 uncharacterized protein LOC1110116234.9e-16487.5Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHE-TLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVA
        M VA K +SHAK VLESRAILGPGGNRDRVPEKP+CKHE TL KTEKQNKA P + + VIRDNVSVGSS SSDS  SNYSAKLLNPKVK  AVKPVKAVA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHE-TLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVA

Query:  AGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKV
        AG +A+ATTTSPRH+VPRKRCDWITPYSDPLYIAFHDEEWGVP+HDDKKLFELLVLSQALAELTWPSILSKRD+FRKVFNDFDPSSIA+FTENEF TLKV
Subjt:  AGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKV

Query:  GGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
         GIQ+L+EPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPV+TPKAE MSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR
Subjt:  GGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFR

Query:  YQESNANIKDDMKPRVEE-RSESLTGAFEKPCLSRS
        YQE +AN+KDDMKPRVE+ R E   GA EKPCLSRS
Subjt:  YQESNANIKDDMKPRVEE-RSESLTGAFEKPCLSRS

A0A6J1F6S0 uncharacterized protein LOC1114426743.1e-15083.84Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA
        MSVATKL SHAK VLESRAILGPGGNRDR PEKPKCK ETL  +EKQNKA P I E V+RDN+S+GSS SSDS  SNYS KLLNPKVK   VKPVKAVAA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA

Query:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG
        G D N TTT+PR +VP KRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWP IL KRDIFRKVFNDFDPS+IAQFT+NEFTTLK  
Subjt:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG

Query:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY
        GIQLLSEPKLRAIVENANQVLKIQQEFG+FSNYCWSFVNKKPI NRFRYARQVPV+TPKAE MSKDL+RRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY
Subjt:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY

Query:  QESNANIKDDMKPRVE-ERSESLTGAFE
        QE      D MK RVE +RSE LTGA E
Subjt:  QESNANIKDDMKPRVE-ERSESLTGAFE

A0A6J1FIT9 uncharacterized protein LOC1114461259.2e-14781.68Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA
        MSVATKLQSHA+ VLESRAILGPGGNRDR PEKPKCK E L +T KQNKA PV+ E V+RDNVSVGSS SSDS  SNYSAKLLN K K    KPVK VAA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA

Query:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG
        G DANATTTSP  +V  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWP ILSKR IFRKVFNDFDPSSIA FTE EFTTLKV 
Subjt:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG

Query:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY
          QLLS+ KLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNR+RY RQVPV+TPKAE MSKDL++RGFRCVGPTVVYSF+QV+GIVNDHLV+CFRY
Subjt:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY

Query:  QESNANIKDDMKPRVE-ERSESLTGAFEKPCLS
        QE +A++KDDMK RVE  RSE L  A EK  L+
Subjt:  QESNANIKDDMKPRVE-ERSESLTGAFEKPCLS

A0A6J1IHE9 uncharacterized protein LOC1114769751.2e-14682.62Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA
        MSVATKL SHAK VLESRAILGPGGNRDR PEKPKCK ETL  +EKQNKA P I E VIRDN+S+GSS SSDS  SN SAKLLNPK     VKPVKAVAA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA

Query:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG
        G D N TTT+PR +VP KRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWP IL KRDIFRKVFNDFDPS+IAQFT+NEFTTLK  
Subjt:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG

Query:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY
        GIQLLSEPKLRAIVENANQVLKIQQEFG+FSNYCWSFVNKKPI NRFRYARQ+PV+TPKAE MSKDL+RRGFRCVGPTVVYSFMQVTGIVNDHLV+CFRY
Subjt:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY

Query:  QESNANIKDDMKPRVEER-SESLTGAFE
        QE      D MK RVE++ SE LTGA E
Subjt:  QESNANIKDDMKPRVEER-SESLTGAFE

A0A6J1J188 uncharacterized protein LOC1114804122.1e-14681.38Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA
        MSVATKLQSHA+ VLESRAILGPGGNRDR PEKPKCK E L +T KQNKA PV+ E V+RDN+SVGSS SSDS  SNYSAKLLN K K    KPVK VAA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA

Query:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG
        G DANATTTSP   V  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWP ILSKR IFR VFNDFDPSSIAQFTE EFTTLKV 
Subjt:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG

Query:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY
          QLLS+ KLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNR+RY RQVPV+TPKAE MSKDL++RGFRCVGPTVVYSF+QV+GIVNDHLV+CFRY
Subjt:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY

Query:  QESNANIKDDMKPRVE-ERSESLTGAFEKPCLS
        QE +A++KDDMK RVE  RSE L  A EK  L+
Subjt:  QESNANIKDDMKPRVE-ERSESLTGAFEKPCLS

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 12.0e-3740.98Show/hide
Query:  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENA
        +RC W++   DPLYIA+HD EWGVP  D KKLFE++ L    A L+W ++L KR+ +R  F+ FDP  +A   E +   L      +    K++AI+ NA
Subjt:  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENA

Query:  NQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY
           L+++Q    F ++ WSFVN +P   +     ++P  T  ++ +SK L +RGF+ VG T+ YSFMQ  G+VNDH+V C  Y
Subjt:  NQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY

P44321 DNA-3-methyladenine glycosylase5.9e-3440.78Show/hide
Query:  RCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENAN
        RC W+   S  +YI +HD+EWG P  D +KLFE + L    A L+W ++L KR+ +R+ F+ FDP  IA+ T  +          +    KL AIV+NA 
Subjt:  RCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENAN

Query:  QVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNC
          L +++   +FS++ WSFVN KPI N     R VP +T  ++ +SK L +RGF  +G T  Y+FMQ  G+V+DHL +C
Subjt:  QVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]2.3e-4141.75Show/hide
Query:  GIDANATTTSPRHAVPRKRCDWITPYSD---PLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTL
        G++A  +    R  V   RC W T   +    LY  +HD EWG P+H+DKKLFE LVL    A L+W +IL KR+ FR  F+DFDP  +A + E++   L
Subjt:  GIDANATTTSPRHAVPRKRCDWITPYSD---PLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTL

Query:  KVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNC
              + +  K+ A + NA   + +Q+EFGSF  Y W FV  KPI N F     +P  TP ++ ++KDL +RGF+ VG T +Y+ MQ  G+VNDHL +C
Subjt:  KVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNC

Query:  FRYQES
        F+   S
Subjt:  FRYQES

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein2.8e-7143.68Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNR-DRVP-----EKP------------KCKHETLPKTEKQ--NKAFPVIQELVIRDNVSVGSSYSSDSALSNYSA
        MSV  + +S      E R++LGP GN+  R P     EKP            K K  T P + +    +   +   ++ +++ S+ +SYSSD++ S  S+
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNR-DRVP-----EKP------------KCKHETLPKTEKQ--NKAFPVIQELVIRDNVSVGSSYSSDSALSNYSA

Query:  KL------LNPKV--KSNAVKPVKAVAAGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRD
         L         KV  +S +V   + ++ G +     +    A  RKRC WITP +DP Y+AFHDEEWGVPVHDDKKLFELL LS ALAEL+W  ILS+R 
Subjt:  KL------LNPKV--KSNAVKPVKAVAAGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRD

Query:  IFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGF
        I R+VF DFDP ++A+  + + T      I LLSE K+R+I++N+  V KI  E GS   Y W+FVN KP +++FRY RQVPV+T KAE +SKDL+RRGF
Subjt:  IFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGF

Query:  RCVGPTVVYSFMQVTGIVNDHLVNCFRYQESNANIKDDMKPRVEERSE
        R V PTV+YSFMQ  G+ NDHL+ CFRYQ+   + +     + ++++E
Subjt:  RCVGPTVVYSFMQVTGIVNDHLVNCFRYQESNANIKDDMKPRVEERSE

AT1G75090.1 DNA glycosylase superfamily protein1.1e-9154.94Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETL-------PKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVK
        MS+ +KL+S  K + ESRAIL   GNR +V +    K   L       P T+K +  F V  +    D+ S  SS    S  +  S K+  P  K N V+
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETL-------PKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVK

Query:  PVKAVAAGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENE
         +  V A + A     SP+   P KRC WITP SDP+Y+ FHDEEWGVPV DDKKLFELLV SQALAE +WPSIL +RD FRK+F +FDPS+IAQFTE  
Subjt:  PVKAVAAGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENE

Query:  FTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDH
          +L+V G  +LSE KLRAIVENA  VLK++QEFGSFSNYCW FVN KP+RN +RY RQVPV++PKAE +SKD+++RGFRCVGPTV+YSF+Q +GIVNDH
Subjt:  FTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDH

Query:  LVNCFRYQESNANIKDDMKPRVEE
        L  CFRYQE N   + + K    E
Subjt:  LVNCFRYQESNANIKDDMKPRVEE

AT1G80850.1 DNA glycosylase superfamily protein4.9e-7648.43Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNR------DRVPEKPKC-KHETLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKS--NA
        MS   +++S      E R++LGP GN+       +  +KP   K + L  TEK  +  P+   ++ R+ +S+ +SYSSD++ S  S+ L      S    
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNR------DRVPEKPKC-KHETLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKS--NA

Query:  VKPVKAVAAGIDANATTTSPRHAVP-------RKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPS
        ++   +V++        T  R           RKRC WITP SD  YIAFHDEEWGVPVHDDK+LFELL LS ALAEL+W  ILSKR +FR+VF DFDP 
Subjt:  VKPVKAVAAGIDANATTTSPRHAVP-------RKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPS

Query:  SIAQFTENEFTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFM
        +I++ T  + T+ ++    LLSE KLR+I+ENANQV KI   FGSF  Y W+FVN+KP +++FRY RQVPV+T KAEL+SKDL+RRGFR V PTV+YSFM
Subjt:  SIAQFTENEFTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFM

Query:  QVTGIVNDHLVNCFRYQE
        Q  G+ NDHL  CFR+ +
Subjt:  QVTGIVNDHLVNCFRYQE

AT5G57970.1 DNA glycosylase superfamily protein3.9e-7353.17Show/hide
Query:  IRDNVSVGSSYSSDSALSNYSAKL----------LNPKVKSNAVKPVKAVAAGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKK
        +  N+S+ +S+SSD+++ ++ ++           +  + KS   KP   V+ G    A  + P  +  +KRC W+TP SDP YI FHDEEWGVPVHDDK+
Subjt:  IRDNVSVGSSYSSDSALSNYSAKL----------LNPKVKSNAVKPVKAVAAGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKK

Query:  LFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFR
        LFELLVLS ALAE TWP+ILSKR  FR+VF DFDP++I +  E +          LLS+ KLRA++ENA Q+LK+ +E+GSF  Y WSFV  K I ++FR
Subjt:  LFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFR

Query:  YARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY
        Y RQVP +TPKAE++SKDL+RRGFR VGPTVVYSFMQ  GI NDHL +CFR+
Subjt:  YARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY

AT5G57970.2 DNA glycosylase superfamily protein3.9e-7353.17Show/hide
Query:  IRDNVSVGSSYSSDSALSNYSAKL----------LNPKVKSNAVKPVKAVAAGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKK
        +  N+S+ +S+SSD+++ ++ ++           +  + KS   KP   V+ G    A  + P  +  +KRC W+TP SDP YI FHDEEWGVPVHDDK+
Subjt:  IRDNVSVGSSYSSDSALSNYSAKL----------LNPKVKSNAVKPVKAVAAGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKK

Query:  LFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFR
        LFELLVLS ALAE TWP+ILSKR  FR+VF DFDP++I +  E +          LLS+ KLRA++ENA Q+LK+ +E+GSF  Y WSFV  K I ++FR
Subjt:  LFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFR

Query:  YARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY
        Y RQVP +TPKAE++SKDL+RRGFR VGPTVVYSFMQ  GI NDHL +CFR+
Subjt:  YARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTGGCTACGAAGCTTCAATCACACGCTAAACTGGTTTTGGAGTCCCGAGCAATTCTTGGACCCGGCGGGAACAGAGATAGGGTGCCTGAGAAGCCGAAATGCAA
ACACGAGACCTTGCCGAAGACAGAGAAGCAAAATAAGGCATTTCCGGTGATTCAGGAATTGGTTATTCGGGATAATGTCTCCGTCGGGAGCTCCTACTCTTCCGATTCTG
CATTAAGCAACTATTCGGCCAAATTGCTGAATCCAAAAGTGAAATCCAACGCCGTGAAACCTGTAAAGGCTGTTGCTGCAGGCATTGACGCAAACGCAACCACAACGTCC
CCTAGGCACGCGGTTCCGCGGAAACGCTGTGATTGGATAACGCCTTATTCTGACCCACTTTACATCGCTTTTCACGACGAAGAATGGGGAGTCCCAGTTCATGACGACAA
GAAGCTGTTTGAGTTACTTGTATTATCACAAGCCTTAGCAGAGCTTACTTGGCCCTCGATTCTTAGCAAGAGAGACATATTTAGGAAAGTTTTTAATGATTTTGACCCAT
CTTCCATCGCACAGTTCACAGAGAATGAGTTTACGACACTAAAAGTAGGTGGCATCCAGCTCCTGTCTGAACCTAAGCTTCGTGCAATCGTGGAGAACGCTAATCAAGTA
CTCAAGATTCAGCAGGAATTCGGTTCCTTTAGCAACTACTGTTGGAGCTTTGTTAACAAGAAGCCTATAAGAAACAGATTTCGATACGCTCGTCAAGTACCTGTAAGGAC
GCCGAAAGCAGAGCTCATGAGCAAGGACTTGATCAGGAGAGGGTTTCGTTGTGTCGGGCCAACCGTGGTTTATTCCTTCATGCAGGTCACTGGAATTGTTAACGATCACT
TGGTGAATTGCTTCAGATATCAAGAAAGCAATGCAAACATAAAAGATGATATGAAACCAAGAGTAGAAGAGAGGTCGGAGTCGCTAACCGGAGCTTTCGAGAAGCCTTGC
TTGTCTAGATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGTGGCTACGAAGCTTCAATCACACGCTAAACTGGTTTTGGAGTCCCGAGCAATTCTTGGACCCGGCGGGAACAGAGATAGGGTGCCTGAGAAGCCGAAATGCAA
ACACGAGACCTTGCCGAAGACAGAGAAGCAAAATAAGGCATTTCCGGTGATTCAGGAATTGGTTATTCGGGATAATGTCTCCGTCGGGAGCTCCTACTCTTCCGATTCTG
CATTAAGCAACTATTCGGCCAAATTGCTGAATCCAAAAGTGAAATCCAACGCCGTGAAACCTGTAAAGGCTGTTGCTGCAGGCATTGACGCAAACGCAACCACAACGTCC
CCTAGGCACGCGGTTCCGCGGAAACGCTGTGATTGGATAACGCCTTATTCTGACCCACTTTACATCGCTTTTCACGACGAAGAATGGGGAGTCCCAGTTCATGACGACAA
GAAGCTGTTTGAGTTACTTGTATTATCACAAGCCTTAGCAGAGCTTACTTGGCCCTCGATTCTTAGCAAGAGAGACATATTTAGGAAAGTTTTTAATGATTTTGACCCAT
CTTCCATCGCACAGTTCACAGAGAATGAGTTTACGACACTAAAAGTAGGTGGCATCCAGCTCCTGTCTGAACCTAAGCTTCGTGCAATCGTGGAGAACGCTAATCAAGTA
CTCAAGATTCAGCAGGAATTCGGTTCCTTTAGCAACTACTGTTGGAGCTTTGTTAACAAGAAGCCTATAAGAAACAGATTTCGATACGCTCGTCAAGTACCTGTAAGGAC
GCCGAAAGCAGAGCTCATGAGCAAGGACTTGATCAGGAGAGGGTTTCGTTGTGTCGGGCCAACCGTGGTTTATTCCTTCATGCAGGTCACTGGAATTGTTAACGATCACT
TGGTGAATTGCTTCAGATATCAAGAAAGCAATGCAAACATAAAAGATGATATGAAACCAAGAGTAGAAGAGAGGTCGGAGTCGCTAACCGGAGCTTTCGAGAAGCCTTGC
TTGTCTAGATCTTGA
Protein sequenceShow/hide protein sequence
MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSYSSDSALSNYSAKLLNPKVKSNAVKPVKAVAAGIDANATTTS
PRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENANQV
LKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVRTPKAELMSKDLIRRGFRCVGPTVVYSFMQVTGIVNDHLVNCFRYQESNANIKDDMKPRVEERSESLTGAFEKPC
LSRS