; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019814 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019814
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationtig00153414:529229..532435
RNA-Seq ExpressionSgr019814
SyntenySgr019814
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593364.1 hypothetical protein SDJN03_12840, partial [Cucurbita argyrosperma subsp. sororia]3.2e-13876.08Show/hide
Query:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCV-SVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVA+SDSRPVLGPTGNKAR VETRK GVKPLKKLEKP QEAESKDKR PLSPPQCV +VPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCV-SVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRK-QSTVKRAEKAVEKVGVESVVAA-DIVGCLEPKKRCAWVTSNA-------------------------------------
        NSRASSARGTRQRGPNLRRK  STVKRAEKAVEKVG ESVVAA + VGCLEPKKRCAWVTSN                                      
Subjt:  NSRASSARGTRQRGPNLRRK-QSTVKRAEKAVEKVGVESVVAA-DIVGCLEPKKRCAWVTSNA-------------------------------------

Query:  ----AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEV
             KR   +  FL      +   +  +KKMVAPGSAATSLLSE KVRAIIENGRQMCKVIDEF SFNVY+WNFVNHKP ISQFRYPRQVPDKTSKAEV
Subjt:  ----AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEV

Query:  ISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIGTAEKGERDGDNVKPTINEKIPDALKNLEL
        ISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+ CFRFPECI T EKGERDGD +KPTI EKIP+ALKNLEL
Subjt:  ISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIGTAEKGERDGDNVKPTINEKIPDALKNLEL

KAG7025712.1 guaA [Cucurbita argyrosperma subsp. argyrosperma]1.6e-14283.53Show/hide
Query:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCV-SVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVA+SDSRPVLGPTGNKAR VETRK GVKPLKKLEKP QEAESKDKR PLSPPQCV +VPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCV-SVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRK-QSTVKRAEKAVEKVGVESVVAA-DIVGCLEPKKRCAWVTSNAA---------KRVQIKMVFLSCQRCSSFNDEQRQKKM
        NSRASSARGTRQRGPNLRRK  STVKRAEKAVEKVG ESVVAA + VGCLEPKKRCAWVTSN A         KR   +  FL      +   +  +KKM
Subjt:  NSRASSARGTRQRGPNLRRK-QSTVKRAEKAVEKVGVESVVAA-DIVGCLEPKKRCAWVTSNAA---------KRVQIKMVFLSCQRCSSFNDEQRQKKM

Query:  VAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL
        VAPGSAATSLLSE KVRAIIENGRQMCKVIDEF SFNVY+WNFVNHKP ISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL
Subjt:  VAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL

Query:  IGCFRFPECIGTAEKGERDGDNVKPTINEKIPDALKNLEL
        + CFRFPECI T EKGERDGD +KPTI EKIP+ALKNLEL
Subjt:  IGCFRFPECIGTAEKGERDGDNVKPTINEKIPDALKNLEL

XP_004136097.2 uncharacterized protein LOC101205558 [Cucumis sativus]1.2e-13775.34Show/hide
Query:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVA+SDSRPVLGPTGNKAR VETRKPGVKPLKKLEKP+QE ESKDKR PLSPPQCV+VPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQ-STVKRAEKAVEKVGVESV-VAADIVGCLEPKKRCAWVTSNA--------------------------------------
        SRASSARGTRQRGPNLRRKQ STVK A+KAVEKVGVESV V  D VGCLE KKRCAWVT N                                       
Subjt:  SRASSARGTRQRGPNLRRKQ-STVKRAEKAVEKVGVESV-VAADIVGCLEPKKRCAWVTSNA--------------------------------------

Query:  ---AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI
            KR   + +FL     +    +  +KKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEF SFNVY+WNFVNHKPIISQFRYPRQVPDKTSKAEVI
Subjt:  ---AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI

Query:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECI--GTAEKGERDGDNVKPTINEKIPDALKNLEL
        SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRF ECI   TAEKGERDG  +K   NEK+P+ALKNLEL
Subjt:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECI--GTAEKGERDGDNVKPTINEKIPDALKNLEL

XP_022155202.1 uncharacterized protein LOC111022341 [Momordica charantia]2.0e-14076.49Show/hide
Query:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVA+SDSRPVLGPTGNKARPVE RKPG KPLKKLEKP QEAESKDKR PLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVES-VVAADIVGCLEPKKRCAWVTSNA---------------------------------------
        SRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVES VV  D V  LEPKKRCAWVT N                                        
Subjt:  SRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVES-VVAADIVGCLEPKKRCAWVTSNA---------------------------------------

Query:  --AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS
           KR   + +FL      +   +  +KKMVA GSAATSLLSELKVRAIIENGRQMCKVIDEF SF+VYIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS
Subjt:  --AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIGTAEKGERDGDNVKPTINEKIPDALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLI CFRFPECI TAE+GE+DG+ +KP INEKIP+ALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIGTAEKGERDGDNVKPTINEKIPDALKNLEL

XP_023514420.1 uncharacterized protein LOC111778684 [Cucurbita pepo subsp. pepo]9.3e-13875.81Show/hide
Query:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCV-SVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVA+SDSRPVLGPTGNKAR VETRK GVKPLKKLEKP QEAESKDKR PLSPPQCV +VPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCV-SVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRK-QSTVKRAEKAVEKVGVESVVA-ADIVGCLEPKKRCAWVTSNA-------------------------------------
        NSRASSARGTRQRGPNLRRK  S+VKRAEKAVEKVG ESVVA A+ VGCLEPKKRCAWVTSN                                      
Subjt:  NSRASSARGTRQRGPNLRRK-QSTVKRAEKAVEKVGVESVVA-ADIVGCLEPKKRCAWVTSNA-------------------------------------

Query:  ----AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEV
             KR   +  FL      +   +  +KKMVAPGSAATSLLSE KVRAIIENGRQMCKVIDEF SFNVY+WNFVNHKP ISQFRYPRQVPDKTSKAEV
Subjt:  ----AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEV

Query:  ISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIGTAEKGERDGDNVKPTINEKIPDALKNLEL
        ISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+ CFRFPECI T EKGERDGD +KPTI EKIP+ALKNLEL
Subjt:  ISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIGTAEKGERDGDNVKPTINEKIPDALKNLEL

TrEMBL top hitse value%identityAlignment
A0A0A0K8L6 Uncharacterized protein5.9e-13875.34Show/hide
Query:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVA+SDSRPVLGPTGNKAR VETRKPGVKPLKKLEKP+QE ESKDKR PLSPPQCV+VPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQ-STVKRAEKAVEKVGVESV-VAADIVGCLEPKKRCAWVTSNA--------------------------------------
        SRASSARGTRQRGPNLRRKQ STVK A+KAVEKVGVESV V  D VGCLE KKRCAWVT N                                       
Subjt:  SRASSARGTRQRGPNLRRKQ-STVKRAEKAVEKVGVESV-VAADIVGCLEPKKRCAWVTSNA--------------------------------------

Query:  ---AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI
            KR   + +FL     +    +  +KKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEF SFNVY+WNFVNHKPIISQFRYPRQVPDKTSKAEVI
Subjt:  ---AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI

Query:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECI--GTAEKGERDGDNVKPTINEKIPDALKNLEL
        SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRF ECI   TAEKGERDG  +K   NEK+P+ALKNLEL
Subjt:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECI--GTAEKGERDGDNVKPTINEKIPDALKNLEL

A0A1S3CE52 probable GMP synthase [glutamine-hydrolyzing]1.1e-13675.07Show/hide
Query:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVA+SDSRPVLGPTGNKAR VETRKPGVKPLKKLEKP+QE ESKDKR PLSPPQCV+VPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQ-STVKRAEKAVEKVGVESV-VAADIVGCLEPKKRCAWVTSNA--------------------------------------
        SRASSARGTRQRGPNLRRKQ STVK A+KAVEKVGVESV V AD VGCLE KKRCAWVT N                                       
Subjt:  SRASSARGTRQRGPNLRRKQ-STVKRAEKAVEKVGVESV-VAADIVGCLEPKKRCAWVTSNA--------------------------------------

Query:  ---AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI
            KR   + +FL      +   +  +KKMVAPGSAATSLLSELK+RAIIENGRQMCKVIDEF SFNVY+WNFVNHKPIISQFRYPRQVPDKTSKAEVI
Subjt:  ---AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI

Query:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECI--GTAEKGERDGDNVKPTINEKIPDALKNLEL
        SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLI CFRF ECI   TAEKGERDG+ +K   NEK+P+ALKNLEL
Subjt:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECI--GTAEKGERDGDNVKPTINEKIPDALKNLEL

A0A5A7UYZ9 Putative GMP synthase1.1e-13675.07Show/hide
Query:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVA+SDSRPVLGPTGNKAR VETRKPGVKPLKKLEKP+QE ESKDKR PLSPPQCV+VPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQ-STVKRAEKAVEKVGVESV-VAADIVGCLEPKKRCAWVTSNA--------------------------------------
        SRASSARGTRQRGPNLRRKQ STVK A+KAVEKVGVESV V AD VGCLE KKRCAWVT N                                       
Subjt:  SRASSARGTRQRGPNLRRKQ-STVKRAEKAVEKVGVESV-VAADIVGCLEPKKRCAWVTSNA--------------------------------------

Query:  ---AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI
            KR   + +FL      +   +  +KKMVAPGSAATSLLSELK+RAIIENGRQMCKVIDEF SFNVY+WNFVNHKPIISQFRYPRQVPDKTSKAEVI
Subjt:  ---AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI

Query:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECI--GTAEKGERDGDNVKPTINEKIPDALKNLEL
        SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLI CFRF ECI   TAEKGERDG+ +K   NEK+P+ALKNLEL
Subjt:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECI--GTAEKGERDGDNVKPTINEKIPDALKNLEL

A0A6J1DNQ3 uncharacterized protein LOC1110223419.7e-14176.49Show/hide
Query:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVA+SDSRPVLGPTGNKARPVE RKPG KPLKKLEKP QEAESKDKR PLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVES-VVAADIVGCLEPKKRCAWVTSNA---------------------------------------
        SRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVES VV  D V  LEPKKRCAWVT N                                        
Subjt:  SRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVES-VVAADIVGCLEPKKRCAWVTSNA---------------------------------------

Query:  --AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS
           KR   + +FL      +   +  +KKMVA GSAATSLLSELKVRAIIENGRQMCKVIDEF SF+VYIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS
Subjt:  --AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIGTAEKGERDGDNVKPTINEKIPDALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLI CFRFPECI TAE+GE+DG+ +KP INEKIP+ALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIGTAEKGERDGDNVKPTINEKIPDALKNLEL

A0A6J1H7A2 uncharacterized protein LOC1114610812.9e-13775.54Show/hide
Query:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCV-SVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVA+SDSRPVLGPTGNKAR VETRK GVKPLKKLEKP QEAESKDKR PLSPPQCV +VPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCV-SVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRK-QSTVKRAEKAVEKVGVESVVAA-DIVGCLEPKKRCAWVTSNA-------------------------------------
        NSRASSARGTRQRGPNLRRK  STVKRAEKAVEKVG ESVVAA + VGCLEPKKRCAWVTSN                                      
Subjt:  NSRASSARGTRQRGPNLRRK-QSTVKRAEKAVEKVGVESVVAA-DIVGCLEPKKRCAWVTSNA-------------------------------------

Query:  ----AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEV
             KR   +  FL      +   +  +KKMVAPGSAATSLLSE KVRAIIENGRQMCKVIDEF SFNVY+WNFVNHKP ISQFRYPRQVPDKTSKA+V
Subjt:  ----AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEV

Query:  ISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIGTAEKGERDGDNVKPTINEKIPDALKNLEL
        ISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+ CFRF ECI T EKGERDGD +KPTI EKIP+ALKNLEL
Subjt:  ISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIGTAEKGERDGDNVKPTINEKIPDALKNLEL

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 15.0e-1741.94Show/hide
Query:  KVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFP
        K++AII N R   ++      F  ++W+FVNH+P ++Q     ++P  TS ++ +SK L KRGF+ VG T+ Y+FMQ  GL NDH++GC  +P
Subjt:  KVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFP

P44321 DNA-3-methyladenine glycosylase1.1e-1342.7Show/hide
Query:  KVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGC
        K+ AI++N +    +     +F+ +IW+FVNHKPI++     R VP KT  ++ +SK L KRGF  +G T  Y FMQ  GL +DHL  C
Subjt:  KVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]8.5e-1744.33Show/hide
Query:  KVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIG
        K+ A I N +    V  EF SF+ YIW FV  KPII+ F     +P  T  ++ I+KDL KRGF+ VG T +Y  MQ  G+ NDHL  CF+    +G
Subjt:  KVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIG

Arabidopsis top hitse value%identityAlignment
AT1G13635.1 DNA glycosylase superfamily protein1.6e-3147.06Show/hide
Query:  QKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLT
        +K++    S    +L E +VR I++N + + KV++EF SF+ ++W F+++KPII++F+Y R VP ++ KAE+ISKD++KRGFR VGP ++++FMQ AGLT
Subjt:  QKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLT

Query:  NDHLIGCFRFPECIGTAEK
         DHL+ CFR  +C+  AE+
Subjt:  NDHLIGCFRFPECIGTAEK

AT1G15970.1 DNA glycosylase superfamily protein7.0e-5141.94Show/hide
Query:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQE---AESKDKRG-----PLSP----PQCVSV-PSVLRQQDRHQAILNLSMN
        MS PPR RS+N  E + R VLGPTGNK +    RKP   P  KLEKP  E    +SKD++      P SP     QC S+  S+LR+        + SM 
Subjt:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQE---AESKDKRG-----PLSP----PQCVSV-PSVLRQQDRHQAILNLSMN

Query:  ASCSSDASSDSFNSRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVAADIVGCLEPKKRCAWVTSNA---------------------------
        AS SSDASS   +S  S A  +  +   +RR  S     + +V K   E  V+ D     + +KRCAW+T  A                           
Subjt:  ASCSSDASSDSFNSRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVAADIVGCLEPKKRCAWVTSNA---------------------------

Query:  --------------AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQ
                      ++R  ++ VF+     +    E   KK+ APG+AA SLLSE+K+R+I++N R + K+I E  S   Y+WNFVN+KP  SQFRY RQ
Subjt:  --------------AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQ

Query:  VPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIGTAE---------KGERDGD
        VP KTSKAE ISKDLV+RGFRSV PTVIY+FMQ AGLTNDHLIGCFR+ +C   AE         K ER+ D
Subjt:  VPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIGTAE---------KGERDGD

AT1G80850.1 DNA glycosylase superfamily protein4.1e-5140Show/hide
Query:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MS PPR+RS++ ++ + R VLGP GNK +     KP  KP+ +  K     E   +  PLSPP       +LR+         +SM AS SSDASS   +
Subjt:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVAADIVGCLEPKKRCAWVTSNA----------------------------------------
        S  S    +   G  + R+  +V  +      +  E    A    C + +KRCAW+T  +                                        
Subjt:  SRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVAADIVGCLEPKKRCAWVTSNA----------------------------------------

Query:  -AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
         +KR   + VF+     +    E   KK+ +P  AAT+LLSE K+R+I+EN  Q+CK+I  F SF+ YIWNFVN KP  SQFRYPRQVP KTSKAE+ISK
Subjt:  -AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIGTAEKG
        DLV+RGFRSV PTVIY+FMQ AGLTNDHL  CFR  +C+   E G
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIGTAEKG

AT5G57970.1 DNA glycosylase superfamily protein5.9e-5842.13Show/hide
Query:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPP----------QCVSVPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVAE+++R  LG T  KA P  T K   K L+KLE+        D++   + P            ++  S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPP----------QCVSVPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVAADIVGCLEPKKRCAWVTSNA----------------------------
        S SSDAS DSF+SRAS+ R  R      R K S   +    V +  ++S          E KKRC WVT N+                            
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVAADIVGCLEPKKRCAWVTSNA----------------------------

Query:  -------------AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQV
                     +KR   + VF      +    +  +KK++ PGS A++LLS+LK+RA+IEN RQ+ KVI+E+ SF+ YIW+FV +K I+S+FRY RQV
Subjt:  -------------AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQV

Query:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIGTAEK
        P KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ AG+TNDHL  CFRF  CI   E+
Subjt:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIGTAEK

AT5G57970.2 DNA glycosylase superfamily protein5.9e-5842.13Show/hide
Query:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPP----------QCVSVPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVAE+++R  LG T  KA P  T K   K L+KLE+        D++   + P            ++  S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPP----------QCVSVPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVAADIVGCLEPKKRCAWVTSNA----------------------------
        S SSDAS DSF+SRAS+ R  R      R K S   +    V +  ++S          E KKRC WVT N+                            
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVAADIVGCLEPKKRCAWVTSNA----------------------------

Query:  -------------AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQV
                     +KR   + VF      +    +  +KK++ PGS A++LLS+LK+RA+IEN RQ+ KVI+E+ SF+ YIW+FV +K I+S+FRY RQV
Subjt:  -------------AKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFSSFNVYIWNFVNHKPIISQFRYPRQV

Query:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIGTAEK
        P KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ AG+TNDHL  CFRF  CI   E+
Subjt:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIGTAEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGAGTCCGACTCCCGACCGGTACTTGGGCCTACTGGGAACAAAGCACGACCTGTCGAGACCAGAAAACCTGG
TGTGAAGCCTTTGAAGAAGCTTGAGAAGCCTCAACAAGAAGCTGAATCAAAGGACAAGAGGGGGCCATTGTCACCGCCTCAATGCGTCTCAGTGCCATCGGTTTTGAGGC
AGCAGGACCGCCACCAGGCGATTCTCAATCTGTCGATGAATGCCTCGTGTTCTTCTGATGCGTCGTCTGATTCTTTTAATAGCAGGGCATCTAGTGCAAGAGGTACGAGG
CAGCGCGGTCCGAATTTAAGGAGAAAGCAAAGTACGGTGAAGAGGGCTGAAAAGGCTGTTGAAAAGGTTGGCGTTGAAAGTGTGGTGGCGGCGGATATTGTTGGTTGCTT
AGAGCCCAAAAAACGATGTGCTTGGGTAACATCTAATGCAGCTAAAAGAGTTCAGATAAAAATGGTGTTCTTGTCTTGTCAACGTTGCTCCTCATTCAATGATGAGCAGC
GTCAAAAAAAGATGGTTGCACCTGGAAGTGCTGCTACTTCTCTACTGTCAGAACTTAAGGTGCGAGCTATCATTGAAAATGGTCGTCAAATGTGCAAGGTAATTGATGAA
TTTAGTTCCTTCAACGTGTACATTTGGAACTTTGTCAACCATAAACCTATCATCAGTCAGTTCCGGTACCCACGCCAGGTCCCCGATAAGACATCAAAAGCAGAGGTGAT
TAGCAAGGATCTCGTTAAGAGAGGGTTTCGAAGTGTGGGACCGACAGTCATCTATACATTCATGCAGGTGGCTGGATTAACGAATGACCATCTCATCGGTTGCTTTAGGT
TTCCAGAATGTATAGGGACAGCAGAGAAAGGAGAAAGAGATGGTGACAACGTCAAGCCTACTATTAACGAGAAAATACCAGACGCTCTGAAAAACTTGGAACTATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGAGTCCGACTCCCGACCGGTACTTGGGCCTACTGGGAACAAAGCACGACCTGTCGAGACCAGAAAACCTGG
TGTGAAGCCTTTGAAGAAGCTTGAGAAGCCTCAACAAGAAGCTGAATCAAAGGACAAGAGGGGGCCATTGTCACCGCCTCAATGCGTCTCAGTGCCATCGGTTTTGAGGC
AGCAGGACCGCCACCAGGCGATTCTCAATCTGTCGATGAATGCCTCGTGTTCTTCTGATGCGTCGTCTGATTCTTTTAATAGCAGGGCATCTAGTGCAAGAGGTACGAGG
CAGCGCGGTCCGAATTTAAGGAGAAAGCAAAGTACGGTGAAGAGGGCTGAAAAGGCTGTTGAAAAGGTTGGCGTTGAAAGTGTGGTGGCGGCGGATATTGTTGGTTGCTT
AGAGCCCAAAAAACGATGTGCTTGGGTAACATCTAATGCAGCTAAAAGAGTTCAGATAAAAATGGTGTTCTTGTCTTGTCAACGTTGCTCCTCATTCAATGATGAGCAGC
GTCAAAAAAAGATGGTTGCACCTGGAAGTGCTGCTACTTCTCTACTGTCAGAACTTAAGGTGCGAGCTATCATTGAAAATGGTCGTCAAATGTGCAAGGTAATTGATGAA
TTTAGTTCCTTCAACGTGTACATTTGGAACTTTGTCAACCATAAACCTATCATCAGTCAGTTCCGGTACCCACGCCAGGTCCCCGATAAGACATCAAAAGCAGAGGTGAT
TAGCAAGGATCTCGTTAAGAGAGGGTTTCGAAGTGTGGGACCGACAGTCATCTATACATTCATGCAGGTGGCTGGATTAACGAATGACCATCTCATCGGTTGCTTTAGGT
TTCCAGAATGTATAGGGACAGCAGAGAAAGGAGAAAGAGATGGTGACAACGTCAAGCCTACTATTAACGAGAAAATACCAGACGCTCTGAAAAACTTGGAACTATAA
Protein sequenceShow/hide protein sequence
MSGPPRIRSMNVAESDSRPVLGPTGNKARPVETRKPGVKPLKKLEKPQQEAESKDKRGPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTR
QRGPNLRRKQSTVKRAEKAVEKVGVESVVAADIVGCLEPKKRCAWVTSNAAKRVQIKMVFLSCQRCSSFNDEQRQKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDE
FSSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFPECIGTAEKGERDGDNVKPTINEKIPDALKNLEL