; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr000501 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr000501
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationtig00000246:36543..38825
RNA-Seq ExpressionSgr000501
SyntenySgr000501
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141169.1 uncharacterized protein LOC111011623 [Momordica charantia]8.5e-16387.2Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHE-TLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVA
        M VA K +SHAK VLESRAILGPGGNRDRVPEKP+CKHE TL KTEKQNKA P + + VIRDNVSVGSSCSSDS  SNYSAKLLNPKVK  AVKPVKAVA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHE-TLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVA

Query:  AGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKV
        AG +A+ATTTSPRH+VPRKRCDWITPYSDPLYIAFHDEEWGVP+HDDKKLFELLVLSQALAELTWPSILSKRD+FRKVFNDFDPSSIA+FTENEF TLKV
Subjt:  AGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKV

Query:  GGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFR
         GIQ+L+EPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAE MSK+LIRRGFRCVGPTVVYSFMQV GIVNDHLVNCFR
Subjt:  GGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFR

Query:  YQESNANIKDDMKPRVEE-RSESLTGALEKPYLSRS
        YQE +AN+KDDMKPRVE+ R E   GA EKP LSRS
Subjt:  YQESNANIKDDMKPRVEE-RSESLTGALEKPYLSRS

XP_022935907.1 uncharacterized protein LOC111442674 [Cucurbita moschata]9.7e-15184.15Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA
        MSVATKL SHAK VLESRAILGPGGNRDR PEKPKCK ETL  +EKQNKA P I E V+RDN+S+GSSCSSDS  SNYS KLLNPKVK   VKPVKAVAA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA

Query:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG
        G D N TTT+PR +VP KRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWP IL KRDIFRKVFNDFDPS+IAQFT+NEFTTLK  
Subjt:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG

Query:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
        GIQLLSEPKLRAIVENANQVLKIQQEFG+FSNYCWSFVNKKPI NRFRYARQVPVKTPKAE MSK+L+RRGFRCVGPTVVYSFMQV GIVNDHLVNCFRY
Subjt:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY

Query:  QESNANIKDDMKPRVE-ERSESLTGALE
        QE      D MK RVE +RSE LTGALE
Subjt:  QESNANIKDDMKPRVE-ERSESLTGALE

XP_023523621.1 uncharacterized protein LOC111787800 [Cucurbita pepo subsp. pepo]2.0e-14882.58Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA
        MSVATKLQSHA+ VLESRAILGPGGNRDR PEKPKCK E L +T KQNKA PV+ E V+RDNVSVGSSCSSDS  SNYSAKLLN K K    KPVK VAA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA

Query:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG
        G DANATTTSP  +V  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWP ILSKR IFRKVFNDFDPSSIAQFTE EFTTLKV 
Subjt:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG

Query:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
          QLLS+ KLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNR+RY RQVPVKTPKAE MSK+L++RGFRCVGPTVVYSF+QV+GIVNDHLV+CFRY
Subjt:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY

Query:  QESNANIKDDMKPRVE-ERSESLTGALEKPYLS
        QE +A++KDDMK RVE  RSE L  ALEK  L+
Subjt:  QESNANIKDDMKPRVE-ERSESLTGALEKPYLS

XP_023535246.1 uncharacterized protein LOC111796735 [Cucurbita pepo subsp. pepo]3.5e-14883.23Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA
        MSVATKL SHAK VLESR ILGPGGNRDR PEKPKCK ETL  +EKQNKA P I E VIRDN S+GSSCSSDS LS+YS KLLNPKVK   VKPVKAVAA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA

Query:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG
        G D N T+T+P  +VP KRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWP IL KRDIFRKVFNDFDPS+IA+FT+NEFTTLK  
Subjt:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG

Query:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
        GIQLLSEPKLRAIVENANQVLKIQQEFG+FSNYCWSFVNKKPI NRFRYARQVPVKTPKAE MSK+L+RRGFRCVGPTVVYSFMQV GIVNDHLVNCFRY
Subjt:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY

Query:  QESNANIKDDMKPRVE-ERSESLTGALE
        QE      D MK RVE +RSE LTGALE
Subjt:  QESNANIKDDMKPRVE-ERSESLTGALE

XP_038905518.1 DNA-3-methyladenine glycosylase 1 [Benincasa hispida]1.1e-15786.53Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA
        MSVATKLQSHAK VLESR ILGPGGNRDR PEKPKCK +TL KTEKQN+A P+I E VIRDNVSVGSSCSSDS  SNYSAKLL PKVK +AVKPVKAVAA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA

Query:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG
        G D NAT  SP  ++P KRCDWIT +SDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWP ILSKRDIFRKV NDFDPS+IAQFTENEFTTLKV 
Subjt:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG

Query:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
         IQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRN FRY RQVPVKTPKAE MSK+LIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
Subjt:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY

Query:  QESNANIKDDMKPRVEE-RSESLTGALEKPYLSR
        QE +A IKDD K RVE+ RSESLTGALEKP L+R
Subjt:  QESNANIKDDMKPRVEE-RSESLTGALEKPYLSR

TrEMBL top hitse value%identityAlignment
A0A6J1CHU1 uncharacterized protein LOC1110116234.1e-16387.2Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHE-TLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVA
        M VA K +SHAK VLESRAILGPGGNRDRVPEKP+CKHE TL KTEKQNKA P + + VIRDNVSVGSSCSSDS  SNYSAKLLNPKVK  AVKPVKAVA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHE-TLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVA

Query:  AGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKV
        AG +A+ATTTSPRH+VPRKRCDWITPYSDPLYIAFHDEEWGVP+HDDKKLFELLVLSQALAELTWPSILSKRD+FRKVFNDFDPSSIA+FTENEF TLKV
Subjt:  AGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKV

Query:  GGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFR
         GIQ+L+EPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAE MSK+LIRRGFRCVGPTVVYSFMQV GIVNDHLVNCFR
Subjt:  GGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFR

Query:  YQESNANIKDDMKPRVEE-RSESLTGALEKPYLSRS
        YQE +AN+KDDMKPRVE+ R E   GA EKP LSRS
Subjt:  YQESNANIKDDMKPRVEE-RSESLTGALEKPYLSRS

A0A6J1F6S0 uncharacterized protein LOC1114426744.7e-15184.15Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA
        MSVATKL SHAK VLESRAILGPGGNRDR PEKPKCK ETL  +EKQNKA P I E V+RDN+S+GSSCSSDS  SNYS KLLNPKVK   VKPVKAVAA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA

Query:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG
        G D N TTT+PR +VP KRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWP IL KRDIFRKVFNDFDPS+IAQFT+NEFTTLK  
Subjt:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG

Query:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
        GIQLLSEPKLRAIVENANQVLKIQQEFG+FSNYCWSFVNKKPI NRFRYARQVPVKTPKAE MSK+L+RRGFRCVGPTVVYSFMQV GIVNDHLVNCFRY
Subjt:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY

Query:  QESNANIKDDMKPRVE-ERSESLTGALE
        QE      D MK RVE +RSE LTGALE
Subjt:  QESNANIKDDMKPRVE-ERSESLTGALE

A0A6J1FIT9 uncharacterized protein LOC1114461253.7e-14882.28Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA
        MSVATKLQSHA+ VLESRAILGPGGNRDR PEKPKCK E L +T KQNKA PV+ E V+RDNVSVGSSCSSDS  SNYSAKLLN K K    KPVK VAA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA

Query:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG
        G DANATTTSP  +V  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWP ILSKR IFRKVFNDFDPSSIA FTE EFTTLKV 
Subjt:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG

Query:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
          QLLS+ KLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNR+RY RQVPVKTPKAE MSK+L++RGFRCVGPTVVYSF+QV+GIVNDHLV+CFRY
Subjt:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY

Query:  QESNANIKDDMKPRVE-ERSESLTGALEKPYLS
        QE +A++KDDMK RVE  RSE L  ALEK  L+
Subjt:  QESNANIKDDMKPRVE-ERSESLTGALEKPYLS

A0A6J1IHE9 uncharacterized protein LOC1114769751.9e-14782.93Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA
        MSVATKL SHAK VLESRAILGPGGNRDR PEKPKCK ETL  +EKQNKA P I E VIRDN+S+GSSCSSDS  SN SAKLLNPK     VKPVKAVAA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA

Query:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG
        G D N TTT+PR +VP KRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWP IL KRDIFRKVFNDFDPS+IAQFT+NEFTTLK  
Subjt:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG

Query:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
        GIQLLSEPKLRAIVENANQVLKIQQEFG+FSNYCWSFVNKKPI NRFRYARQ+PVKTPKAE MSK+L+RRGFRCVGPTVVYSFMQV GIVNDHLV+CFRY
Subjt:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY

Query:  QESNANIKDDMKPRVEER-SESLTGALE
        QE      D MK RVE++ SE LTGALE
Subjt:  QESNANIKDDMKPRVEER-SESLTGALE

A0A6J1J188 uncharacterized protein LOC1114804128.3e-14881.98Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA
        MSVATKLQSHA+ VLESRAILGPGGNRDR PEKPKCK E L +T KQNKA PV+ E V+RDN+SVGSSCSSDS  SNYSAKLLN K K    KPVK VAA
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVAA

Query:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG
        G DANATTTSP   V  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWP ILSKR IFR VFNDFDPSSIAQFTE EFTTLKV 
Subjt:  GIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVG

Query:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
          QLLS+ KLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNR+RY RQVPVKTPKAE MSK+L++RGFRCVGPTVVYSF+QV+GIVNDHLV+CFRY
Subjt:  GIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY

Query:  QESNANIKDDMKPRVE-ERSESLTGALEKPYLS
        QE +A++KDDMK RVE  RSE L  ALEK  L+
Subjt:  QESNANIKDDMKPRVE-ERSESLTGALEKPYLS

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 11.2e-3740.98Show/hide
Query:  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENA
        +RC W++   DPLYIA+HD EWGVP  D KKLFE++ L    A L+W ++L KR+ +R  F+ FDP  +A   E +   L      +    K++AI+ NA
Subjt:  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENA

Query:  NQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
           L+++Q    F ++ WSFVN +P   +     ++P  T  ++ +SK L +RGF+ VG T+ YSFMQ  G+VNDH+V C  Y
Subjt:  NQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY

P44321 DNA-3-methyladenine glycosylase2.7e-3441.34Show/hide
Query:  RCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENAN
        RC W+   S  +YI +HD+EWG P  D +KLFE + L    A L+W ++L KR+ +R+ F+ FDP  IA+ T  +          +    KL AIV+NA 
Subjt:  RCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENAN

Query:  QVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNC
          L +++   +FS++ WSFVN KPI N     R VP KT  ++ +SK L +RGF  +G T  Y+FMQ  G+V+DHL +C
Subjt:  QVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]6.6e-4141.26Show/hide
Query:  GIDANATTTSPRHAVPRKRCDWITPYSD---PLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTL
        G++A  +    R  V   RC W T   +    LY  +HD EWG P+H+DKKLFE LVL    A L+W +IL KR+ FR  F+DFDP  +A + E++   L
Subjt:  GIDANATTTSPRHAVPRKRCDWITPYSD---PLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTL

Query:  KVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNC
              + +  K+ A + NA   + +Q+EFGSF  Y W FV  KPI N F     +P  TP ++ ++K+L +RGF+ VG T +Y+ MQ  G+VNDHL +C
Subjt:  KVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNC

Query:  FRYQES
        F+   S
Subjt:  FRYQES

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein1.8e-7043.68Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNR-DRVP-----EKP------------KCKHETLPKTEKQ--NKAFPVIQELVIRDNVSVGSSCSSDSALSNYSA
        MSV  + +S      E R++LGP GN+  R P     EKP            K K  T P + +    +   +   ++ +++ S+ +S SSD++ S  S+
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNR-DRVP-----EKP------------KCKHETLPKTEKQ--NKAFPVIQELVIRDNVSVGSSCSSDSALSNYSA

Query:  KL------LNPKV--KSNAVKPVKAVAAGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRD
         L         KV  +S +V   + ++ G +     +    A  RKRC WITP +DP Y+AFHDEEWGVPVHDDKKLFELL LS ALAEL+W  ILS+R 
Subjt:  KL------LNPKV--KSNAVKPVKAVAAGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRD

Query:  IFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGF
        I R+VF DFDP ++A+  + + T      I LLSE K+R+I++N+  V KI  E GS   Y W+FVN KP +++FRY RQVPVKT KAE +SK+L+RRGF
Subjt:  IFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGF

Query:  RCVGPTVVYSFMQVAGIVNDHLVNCFRYQESNANIKDDMKPRVEERSE
        R V PTV+YSFMQ AG+ NDHL+ CFRYQ+   + +     + ++++E
Subjt:  RCVGPTVVYSFMQVAGIVNDHLVNCFRYQESNANIKDDMKPRVEERSE

AT1G75090.1 DNA glycosylase superfamily protein1.4e-9154.05Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETL-------PKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVK
        MS+ +KL+S  K + ESRAIL   GNR +V +    K   L       P T+K +  F V  +    D+ S  SS    S  +  S K+  P  K N V+
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETL-------PKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVK

Query:  PVKAVAAGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENE
         +  V A + A     SP+   P KRC WITP SDP+Y+ FHDEEWGVPV DDKKLFELLV SQALAE +WPSIL +RD FRK+F +FDPS+IAQFTE  
Subjt:  PVKAVAAGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENE

Query:  FTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDH
          +L+V G  +LSE KLRAIVENA  VLK++QEFGSFSNYCW FVN KP+RN +RY RQVPVK+PKAE +SK++++RGFRCVGPTV+YSF+Q +GIVNDH
Subjt:  FTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDH

Query:  LVNCFRYQESNANIKDDMKPRVEERSESLTGAL
        L  CFRYQE N   + + K    E    L   L
Subjt:  LVNCFRYQESNANIKDDMKPRVEERSESLTGAL

AT1G80850.1 DNA glycosylase superfamily protein2.4e-7548.43Show/hide
Query:  MSVATKLQSHAKLVLESRAILGPGGNR------DRVPEKPKC-KHETLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKS--NA
        MS   +++S      E R++LGP GN+       +  +KP   K + L  TEK  +  P+   ++ R+ +S+ +S SSD++ S  S+ L      S    
Subjt:  MSVATKLQSHAKLVLESRAILGPGGNR------DRVPEKPKC-KHETLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKS--NA

Query:  VKPVKAVAAGIDANATTTSPRHAVP-------RKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPS
        ++   +V++        T  R           RKRC WITP SD  YIAFHDEEWGVPVHDDK+LFELL LS ALAEL+W  ILSKR +FR+VF DFDP 
Subjt:  VKPVKAVAAGIDANATTTSPRHAVP-------RKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPS

Query:  SIAQFTENEFTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFM
        +I++ T  + T+ ++    LLSE KLR+I+ENANQV KI   FGSF  Y W+FVN+KP +++FRY RQVPVKT KAEL+SK+L+RRGFR V PTV+YSFM
Subjt:  SIAQFTENEFTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFM

Query:  QVAGIVNDHLVNCFRYQE
        Q AG+ NDHL  CFR+ +
Subjt:  QVAGIVNDHLVNCFRYQE

AT5G57970.1 DNA glycosylase superfamily protein8.7e-7353.57Show/hide
Query:  IRDNVSVGSSCSSDSALSNYSAKL----------LNPKVKSNAVKPVKAVAAGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKK
        +  N+S+ +S SSD+++ ++ ++           +  + KS   KP   V+ G    A  + P  +  +KRC W+TP SDP YI FHDEEWGVPVHDDK+
Subjt:  IRDNVSVGSSCSSDSALSNYSAKL----------LNPKVKSNAVKPVKAVAAGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKK

Query:  LFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFR
        LFELLVLS ALAE TWP+ILSKR  FR+VF DFDP++I +  E +          LLS+ KLRA++ENA Q+LK+ +E+GSF  Y WSFV  K I ++FR
Subjt:  LFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFR

Query:  YARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
        Y RQVP KTPKAE++SK+L+RRGFR VGPTVVYSFMQ AGI NDHL +CFR+
Subjt:  YARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY

AT5G57970.2 DNA glycosylase superfamily protein8.7e-7353.57Show/hide
Query:  IRDNVSVGSSCSSDSALSNYSAKL----------LNPKVKSNAVKPVKAVAAGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKK
        +  N+S+ +S SSD+++ ++ ++           +  + KS   KP   V+ G    A  + P  +  +KRC W+TP SDP YI FHDEEWGVPVHDDK+
Subjt:  IRDNVSVGSSCSSDSALSNYSAKL----------LNPKVKSNAVKPVKAVAAGIDANATTTSPRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKK

Query:  LFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFR
        LFELLVLS ALAE TWP+ILSKR  FR+VF DFDP++I +  E +          LLS+ KLRA++ENA Q+LK+ +E+GSF  Y WSFV  K I ++FR
Subjt:  LFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENANQVLKIQQEFGSFSNYCWSFVNKKPIRNRFR

Query:  YARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
        Y RQVP KTPKAE++SK+L+RRGFR VGPTVVYSFMQ AGI NDHL +CFR+
Subjt:  YARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTGGCTACGAAGCTTCAATCACACGCTAAACTGGTTTTGGAGTCCCGAGCAATTCTTGGACCCGGCGGGAACAGAGATAGGGTGCCTGAGAAGCCGAAATGCAA
ACACGAGACCTTGCCGAAGACAGAGAAGCAAAATAAGGCATTTCCGGTGATTCAGGAATTGGTTATTCGGGATAATGTCTCCGTCGGGAGCTCCTGCTCTTCCGATTCTG
CATTAAGCAACTATTCGGCCAAATTGCTGAATCCAAAAGTGAAATCCAACGCCGTGAAACCTGTAAAGGCTGTTGCTGCAGGCATTGACGCAAACGCAACCACAACGTCC
CCTAGGCACGCGGTTCCGCGGAAACGCTGTGATTGGATAACGCCTTATTCTGACCCACTTTACATCGCTTTTCACGACGAAGAATGGGGAGTCCCAGTTCATGACGACAA
GAAGCTGTTTGAGTTACTTGTATTATCACAAGCCTTAGCAGAGCTTACTTGGCCCTCGATTCTTAGCAAGAGAGACATATTTAGGAAAGTTTTTAATGATTTTGACCCAT
CTTCCATCGCACAGTTCACAGAGAATGAGTTTACGACACTAAAAGTAGGTGGGATCCAGCTCCTGTCTGAACCTAAGCTTCGTGCAATCGTGGAGAACGCTAATCAAGTA
CTCAAGATTCAGCAGGAATTCGGTTCCTTTAGCAACTACTGTTGGAGCTTTGTTAACAAGAAGCCTATAAGAAACAGATTTCGATACGCTCGTCAAGTACCCGTAAAGAC
GCCGAAAGCAGAGCTCATGAGCAAGGAGTTGATCAGGAGAGGGTTTCGTTGTGTGGGGCCAACCGTGGTTTATTCCTTCATGCAGGTCGCTGGAATTGTTAACGATCACT
TGGTGAATTGCTTCAGATATCAAGAAAGCAATGCAAACATAAAAGATGATATGAAACCAAGAGTAGAAGAGAGGTCGGAGTCGCTAACCGGAGCTTTGGAGAAGCCTTAC
TTGTCTAGATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGTGGCTACGAAGCTTCAATCACACGCTAAACTGGTTTTGGAGTCCCGAGCAATTCTTGGACCCGGCGGGAACAGAGATAGGGTGCCTGAGAAGCCGAAATGCAA
ACACGAGACCTTGCCGAAGACAGAGAAGCAAAATAAGGCATTTCCGGTGATTCAGGAATTGGTTATTCGGGATAATGTCTCCGTCGGGAGCTCCTGCTCTTCCGATTCTG
CATTAAGCAACTATTCGGCCAAATTGCTGAATCCAAAAGTGAAATCCAACGCCGTGAAACCTGTAAAGGCTGTTGCTGCAGGCATTGACGCAAACGCAACCACAACGTCC
CCTAGGCACGCGGTTCCGCGGAAACGCTGTGATTGGATAACGCCTTATTCTGACCCACTTTACATCGCTTTTCACGACGAAGAATGGGGAGTCCCAGTTCATGACGACAA
GAAGCTGTTTGAGTTACTTGTATTATCACAAGCCTTAGCAGAGCTTACTTGGCCCTCGATTCTTAGCAAGAGAGACATATTTAGGAAAGTTTTTAATGATTTTGACCCAT
CTTCCATCGCACAGTTCACAGAGAATGAGTTTACGACACTAAAAGTAGGTGGGATCCAGCTCCTGTCTGAACCTAAGCTTCGTGCAATCGTGGAGAACGCTAATCAAGTA
CTCAAGATTCAGCAGGAATTCGGTTCCTTTAGCAACTACTGTTGGAGCTTTGTTAACAAGAAGCCTATAAGAAACAGATTTCGATACGCTCGTCAAGTACCCGTAAAGAC
GCCGAAAGCAGAGCTCATGAGCAAGGAGTTGATCAGGAGAGGGTTTCGTTGTGTGGGGCCAACCGTGGTTTATTCCTTCATGCAGGTCGCTGGAATTGTTAACGATCACT
TGGTGAATTGCTTCAGATATCAAGAAAGCAATGCAAACATAAAAGATGATATGAAACCAAGAGTAGAAGAGAGGTCGGAGTCGCTAACCGGAGCTTTGGAGAAGCCTTAC
TTGTCTAGATCTTGA
Protein sequenceShow/hide protein sequence
MSVATKLQSHAKLVLESRAILGPGGNRDRVPEKPKCKHETLPKTEKQNKAFPVIQELVIRDNVSVGSSCSSDSALSNYSAKLLNPKVKSNAVKPVKAVAAGIDANATTTS
PRHAVPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRKVFNDFDPSSIAQFTENEFTTLKVGGIQLLSEPKLRAIVENANQV
LKIQQEFGSFSNYCWSFVNKKPIRNRFRYARQVPVKTPKAELMSKELIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRYQESNANIKDDMKPRVEERSESLTGALEKPY
LSRS