; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014047 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014047
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationChr02:7118342..7121064
RNA-Seq ExpressionHG10014047
SyntenyHG10014047
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591330.1 hypothetical protein SDJN03_13676, partial [Cucurbita argyrosperma subsp. sororia]1.3e-13077.64Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAV
        MSVATKL SHAKPVLESRAILGPGGNRDRAPEKPKCKQETLK  EKQNKALP I ESVIRDN+S+GSSCSSDSLSSNYS KL NPKVKP  VKPVKAVA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAV

Query:  GGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVN
        GGDPN T T+PRLS+PGKRC WIT YSDPLYIAFHD+EWGVPVHDD+KLFELLVLSQALAELTWPLIL KRD+FRKV NDFDPS+I+QFT+NEFT+LK N
Subjt:  GGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVN

Query:  GIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
        GIQLLSEPKLRAIVENANQVL                                  KTPKAEFMSKDL+RRGFRCVGPTVVYSFMQV GIVNDHLVNCFRY
Subjt:  GIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY

Query:  QECDAKIKDDTKL
        QECD + KDD  +
Subjt:  QECDAKIKDDTKL

XP_004147795.1 uncharacterized protein LOC101206397 [Cucumis sativus]4.5e-13178.06Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAV
        MSVATKLQSH KP LE RAILGPGGNRDRAP+ PKCK ETLKK EKQ+KALP ISESVIRDNVSVGSSCSSDSLSSNYSAKL    +KPY+VKPV A   
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAV

Query:  GGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVN
        GGD NAT TSP LSLPGKRCDWITL+SDPLYIAFHD+EWGVP+HDDKKLFELLVLSQALAELTWPLILSKRD+FRKVLNDFDPSSI+QFTENEFT+LKVN
Subjt:  GGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVN

Query:  GIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
        GIQLLSEPKLRAIV+NANQVL                                  KTPKAEFMSKD+IRRGFRCVGPTVVYSFMQVAGIVNDHLV+CFRY
Subjt:  GIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY

Query:  QECDAKIKDDTKLRVEDQR
        +ECD K+KDD KLRVED+R
Subjt:  QECDAKIKDDTKLRVEDQR

XP_022141169.1 uncharacterized protein LOC111011623 [Momordica charantia]3.1e-13275.07Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQE-TLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVA
        M VA K +SHAKPVLESRAILGPGGNRDR PEKP+CK E TL K EKQNKALP + +SVIRDNVSVGSSCSSDSLSSNYSAKL NPKVKPY+VKPVKAVA
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQE-TLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVA

Query:  VGGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKV
         G + +AT TSPR S+P KRCDWIT YSDPLYIAFHD+EWGVP+HDDKKLFELLVLSQALAELTWP ILSKRD+FRKV NDFDPSSI++FTENEF +LKV
Subjt:  VGGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKV

Query:  NGIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFR
        NGIQ+L+EPKLRAIVENANQVL                                  KTPKAE MSKDLIRRGFRCVGPTVVYSFMQV GIVNDHLVNCFR
Subjt:  NGIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFR

Query:  YQECDAKIKDDTKLRVEDQRSESLLTGALEKPCLTRS
        YQECDA +KDD K RVED R E L  GA EKPCL+RS
Subjt:  YQECDAKIKDDTKLRVEDQRSESLLTGALEKPCLTRS

XP_022935907.1 uncharacterized protein LOC111442674 [Cucurbita moschata]1.2e-13477.81Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAV
        MSVATKL SHAKPVLESRAILGPGGNRDRAPEKPKCKQETLK  EKQNKALP I ESV+RDN+S+GSSCSSDSLSSNYS KL NPKVKP  VKPVKAVA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAV

Query:  GGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVN
        GGDPN T T+PRLS+PGKRC WIT YSDPLYIAFHD+EWGVPVHDD+KLFELLVLSQALAELTWPLIL KRD+FRKV NDFDPS+I+QFT+NEFT+LK N
Subjt:  GGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVN

Query:  GIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
        GIQLLSEPKLRAIVENANQVL                                  KTPKAEFMSKDL+RRGFRCVGPTVVYSFMQV GIVNDHLVNCFRY
Subjt:  GIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY

Query:  QECDAKIKDDTKLRVEDQRSESLLTGALE
        QECD       KLRVEDQRSE LLTGALE
Subjt:  QECDAKIKDDTKLRVEDQRSESLLTGALE

XP_038905518.1 DNA-3-methyladenine glycosylase 1 [Benincasa hispida]2.2e-14682.39Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAV
        MSVATKLQSHAKPVLESR ILGPGGNRDRAPEKPKCKQ+TLKK EKQN+ALP+ISESVIRDNVSVGSSCSSDS+SSNYSAKL  PKVKP +VKPVKAVA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAV

Query:  GGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVN
        GGD NAT  SP LSLPGKRCDWITL+SDPLYIAFHD+EWGVPVHDDKKLFELLVLSQALAELTWPLILSKRD+FRKVLNDFDPS+I+QFTENEFT+LKVN
Subjt:  GGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVN

Query:  GIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
         IQLLSEPKLRAIVENANQVL                                  KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
Subjt:  GIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY

Query:  QECDAKIKDDTKLRVEDQRSESLLTGALEKPCLTR
        QECDAKIKDD KLRVED+RSES LTGALEKPCLTR
Subjt:  QECDAKIKDDTKLRVEDQRSESLLTGALEKPCLTR

TrEMBL top hitse value%identityAlignment
A0A0A0LG22 Uncharacterized protein2.2e-13178.06Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAV
        MSVATKLQSH KP LE RAILGPGGNRDRAP+ PKCK ETLKK EKQ+KALP ISESVIRDNVSVGSSCSSDSLSSNYSAKL    +KPY+VKPV A   
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAV

Query:  GGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVN
        GGD NAT TSP LSLPGKRCDWITL+SDPLYIAFHD+EWGVP+HDDKKLFELLVLSQALAELTWPLILSKRD+FRKVLNDFDPSSI+QFTENEFT+LKVN
Subjt:  GGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVN

Query:  GIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
        GIQLLSEPKLRAIV+NANQVL                                  KTPKAEFMSKD+IRRGFRCVGPTVVYSFMQVAGIVNDHLV+CFRY
Subjt:  GIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY

Query:  QECDAKIKDDTKLRVEDQR
        +ECD K+KDD KLRVED+R
Subjt:  QECDAKIKDDTKLRVEDQR

A0A6J1CHU1 uncharacterized protein LOC1110116231.5e-13275.07Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQE-TLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVA
        M VA K +SHAKPVLESRAILGPGGNRDR PEKP+CK E TL K EKQNKALP + +SVIRDNVSVGSSCSSDSLSSNYSAKL NPKVKPY+VKPVKAVA
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQE-TLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVA

Query:  VGGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKV
         G + +AT TSPR S+P KRCDWIT YSDPLYIAFHD+EWGVP+HDDKKLFELLVLSQALAELTWP ILSKRD+FRKV NDFDPSSI++FTENEF +LKV
Subjt:  VGGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKV

Query:  NGIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFR
        NGIQ+L+EPKLRAIVENANQVL                                  KTPKAE MSKDLIRRGFRCVGPTVVYSFMQV GIVNDHLVNCFR
Subjt:  NGIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFR

Query:  YQECDAKIKDDTKLRVEDQRSESLLTGALEKPCLTRS
        YQECDA +KDD K RVED R E L  GA EKPCL+RS
Subjt:  YQECDAKIKDDTKLRVEDQRSESLLTGALEKPCLTRS

A0A6J1F6S0 uncharacterized protein LOC1114426745.6e-13577.81Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAV
        MSVATKL SHAKPVLESRAILGPGGNRDRAPEKPKCKQETLK  EKQNKALP I ESV+RDN+S+GSSCSSDSLSSNYS KL NPKVKP  VKPVKAVA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAV

Query:  GGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVN
        GGDPN T T+PRLS+PGKRC WIT YSDPLYIAFHD+EWGVPVHDD+KLFELLVLSQALAELTWPLIL KRD+FRKV NDFDPS+I+QFT+NEFT+LK N
Subjt:  GGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVN

Query:  GIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
        GIQLLSEPKLRAIVENANQVL                                  KTPKAEFMSKDL+RRGFRCVGPTVVYSFMQV GIVNDHLVNCFRY
Subjt:  GIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY

Query:  QECDAKIKDDTKLRVEDQRSESLLTGALE
        QECD       KLRVEDQRSE LLTGALE
Subjt:  QECDAKIKDDTKLRVEDQRSESLLTGALE

A0A6J1FIT9 uncharacterized protein LOC1114461253.2e-13075.45Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAV
        MSVATKLQSHA+PVLESRAILGPGGNRDRAPEKPKCKQE LK+  KQNKALPV+SESV+RDNVSVGSSCSSDSLSSNYSAKL N K KP   KPVK VA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAV

Query:  GGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVN
        GGD NAT TSP LS+ GKRCDWIT YSDPLYIAFHD+EWGVPVHDDKKLFELLVLSQALAELTWPLILSKR +FRKV NDFDPSSI+ FTE EFT+LKVN
Subjt:  GGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVN

Query:  GIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
          QLLS+ KLRAIVENANQVL                                  KTPKAEFMSKDL++RGFRCVGPTVVYSF+QV+GIVNDHLV+CFRY
Subjt:  GIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY

Query:  QECDAKIKDDTKLRVEDQRSESLLTGALEKPCLT
        QECDA +KDD KLRVE++RSE LL  ALEK  LT
Subjt:  QECDAKIKDDTKLRVEDQRSESLLTGALEKPCLT

A0A6J1IHE9 uncharacterized protein LOC1114769759.3e-13076.6Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAV
        MSVATKL SHAKPVLESRAILGPGGNRDRAPEKPKCKQETLK  EKQNKALP I ESVIRDN+S+GSSCSSDSLSSN SAKL NPK     VKPVKAVA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAV

Query:  GGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVN
        GGDPN T T+PRLS+PGKRC WIT YSDPLYIAFHD+EWGVPVHDD+KLFELLVLSQALAELTWPLIL KRD+FRKV NDFDPS+I+QFT+NEFT+LK N
Subjt:  GGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVN

Query:  GIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
        GIQLLSEPKLRAIVENANQVL                                  KTPKAEFMSKDL+RRGFRCVGPTVVYSFMQV GIVNDHLV+CFRY
Subjt:  GIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY

Query:  QECDAKIKDDTKLRVEDQRSESLLTGALE
        QECD       KLRVEDQ SE LLTGALE
Subjt:  QECDAKIKDDTKLRVEDQRSESLLTGALE

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 11.6e-2534.97Show/hide
Query:  KRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENA
        +RC W++   DPLYIA+HD EWGVP  D KKLFE++ L    A L+W  +L KR+ +R   + FDP  ++   E +   L  +   +    K++AI+ NA
Subjt:  KRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENA

Query:  NQVLK----------------------------------TPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY
           L+                                  T  ++ +SK L +RGF+ VG T+ YSFMQ  G+VNDH+V C  Y
Subjt:  NQVLK----------------------------------TPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRY

P44321 DNA-3-methyladenine glycosylase5.2e-2133.52Show/hide
Query:  RCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENAN
        RC W+   S  +YI +HDKEWG P  D +KLFE + L    A L+W  +L KR+ +R+  + FDP  I++ T  +  +   N   +    KL AIV+NA 
Subjt:  RCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENAN

Query:  QVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNC
          L                                  KT  ++ +SK L +RGF  +G T  Y+FMQ  G+V+DHL +C
Subjt:  QVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]2.3e-2434.78Show/hide
Query:  RCDWITLYSD---PLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVE
        RC W T   +    LY  +HD EWG P+H+DKKLFE LVL    A L+W  IL KR+ FR   +DFDP  ++ + E++   L  N   + +  K+ A + 
Subjt:  RCDWITLYSD---PLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVE

Query:  NANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFR
        NA   +                                   TP ++ ++KDL +RGF+ VG T +Y+ MQ  G+VNDHL +CF+
Subjt:  NANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFR

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein2.0e-5238.4Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNR-DRAP-----EKPKCKQETL-KKVEKQNKALPVIS------------ESVIRDNVS---------VGSSCSSD
        MSV  + +S      E R++LGP GN+  R P     EKP  ++  +  K EK  K     S             S++R N +           SSC S 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNR-DRAP-----EKPKCKQETL-KKVEKQNKALPVIS------------ESVIRDNVS---------VGSSCSSD

Query:  SLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRD
         LS   S+       +  SV   + ++VG +     +    +   KRC WIT  +DP Y+AFHD+EWGVPVHDDKKLFELL LS ALAEL+W  ILS+R 
Subjt:  SLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRD

Query:  LFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENANQV----------------------------------LKTPKAEFMSKDLIRRGF
        + R+V  DFDP ++++  + + T+     I LLSE K+R+I++N+  V                                  +KT KAEF+SKDL+RRGF
Subjt:  LFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENANQV----------------------------------LKTPKAEFMSKDLIRRGF

Query:  RCVGPTVVYSFMQVAGIVNDHLVNCFRYQEC--DAKIKDDTKLRVEDQR
        R V PTV+YSFMQ AG+ NDHL+ CFRYQ+C  DA+    TK + +++R
Subjt:  RCVGPTVVYSFMQVAGIVNDHLVNCFRYQEC--DAKIKDDTKLRVEDQR

AT1G75090.1 DNA glycosylase superfamily protein1.4e-6647.17Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKAL--PVISESVIRDNVSVGSSCSS-DSLSSNYSAKLSNPKVKPYSVK---P
        MS+ +KL+S  KP+ ESRAIL   GNR +  +    K+  L     ++ A   P  + SV  D+ S  SS S   S+++  S K++ P  +    K    
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKAL--PVISESVIRDNVSVGSSCSS-DSLSSNYSAKLSNPKVKPYSVK---P

Query:  VKAVAVGGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEF
        V +VAV  D      SP++  P KRC WIT  SDP+Y+ FHD+EWGVPV DDKKLFELLV SQALAE +WP IL +RD FRK+  +FDPS+I+QFTE   
Subjt:  VKAVAVGGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEF

Query:  TSLKVNGIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHL
         SL+VNG  +LSE KLRAIVENA  VL                                  K+PKAE++SKD+++RGFRCVGPTV+YSF+Q +GIVNDHL
Subjt:  TSLKVNGIQLLSEPKLRAIVENANQVL----------------------------------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHL

Query:  VNCFRYQECDAKIKDDTK
          CFRYQEC+ + + +TK
Subjt:  VNCFRYQECDAKIKDDTK

AT1G80850.1 DNA glycosylase superfamily protein1.1e-5341.87Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNR------DRAPEKPKC-KQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVK
        MS   +++S      E R++LGP GN+       +  +KP   K + L   EK  +  P+    + R+ +S+ +S SSD+ SS  S+ LS       S  
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNR------DRAPEKPKC-KQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVK

Query:  PVKAVAVGGDPNATATSPRLSLP--------------GKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLN
          K V       ++++S R +L                KRC WIT  SD  YIAFHD+EWGVPVHDDK+LFELL LS ALAEL+W  ILSKR LFR+V  
Subjt:  PVKAVAVGGDPNATATSPRLSLP--------------GKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLN

Query:  DFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENANQV----------------------------------LKTPKAEFMSKDLIRRGFRCVGPTV
        DFDP +IS+ T  + TS ++    LLSE KLR+I+ENANQV                                  +KT KAE +SKDL+RRGFR V PTV
Subjt:  DFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENANQV----------------------------------LKTPKAEFMSKDLIRRGFRCVGPTV

Query:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDT
        +YSFMQ AG+ NDHL  CFR+ +C    KD+T
Subjt:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDT

AT5G57970.1 DNA glycosylase superfamily protein3.1e-5345.74Show/hide
Query:  ESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKV----------KPYSVKPVKAVAVGGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHD
        E  +  N+S+ +S SSD+   ++ ++ S  ++          K Y  KP   V+ G    A  + P  S   KRC W+T  SDP YI FHD+EWGVPVHD
Subjt:  ESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKV----------KPYSVKPVKAVAVGGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHD

Query:  DKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENANQVL------------------------
        DK+LFELLVLS ALAE TWP ILSKR  FR+V  DFDP++I +  E +          LLS+ KLRA++ENA Q+L                        
Subjt:  DKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENANQVL------------------------

Query:  ----------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRYQEC
                  KTPKAE +SKDL+RRGFR VGPTVVYSFMQ AGI NDHL +CFR+  C
Subjt:  ----------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRYQEC

AT5G57970.2 DNA glycosylase superfamily protein3.1e-5345.74Show/hide
Query:  ESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKV----------KPYSVKPVKAVAVGGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHD
        E  +  N+S+ +S SSD+   ++ ++ S  ++          K Y  KP   V+ G    A  + P  S   KRC W+T  SDP YI FHD+EWGVPVHD
Subjt:  ESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKV----------KPYSVKPVKAVAVGGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHD

Query:  DKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENANQVL------------------------
        DK+LFELLVLS ALAE TWP ILSKR  FR+V  DFDP++I +  E +          LLS+ KLRA++ENA Q+L                        
Subjt:  DKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENANQVL------------------------

Query:  ----------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRYQEC
                  KTPKAE +SKDL+RRGFR VGPTVVYSFMQ AGI NDHL +CFR+  C
Subjt:  ----------KTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRYQEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTGGCTACGAAGCTCCAATCGCACGCTAAACCGGTTTTGGAGTCCCGAGCAATTCTTGGACCCGGCGGGAACAGAGATAGGGCGCCAGAGAAGCCAAAATGTAA
ACAGGAGACTTTGAAGAAGGTAGAGAAGCAGAACAAGGCACTTCCGGTGATTTCTGAATCGGTTATTCGAGACAATGTCTCCGTCGGGAGCTCCTGCTCTTCCGATTCTT
TATCAAGCAATTATTCGGCCAAATTGTCGAATCCCAAAGTGAAGCCCTACTCTGTGAAGCCTGTGAAGGCTGTTGCTGTCGGCGGTGACCCAAACGCTACTGCAACGTCG
CCTAGGCTCTCGCTTCCGGGGAAACGTTGTGATTGGATAACGCTTTATTCGGACCCACTTTACATCGCTTTTCATGACAAAGAATGGGGAGTCCCAGTTCATGACGACAA
GAAGCTATTTGAGTTACTTGTATTATCACAAGCCTTAGCAGAACTTACTTGGCCCTTGATTCTTAGCAAGAGAGATCTATTTAGGAAAGTTTTGAATGACTTTGACCCAT
CTTCCATCTCACAGTTCACCGAGAATGAGTTTACATCACTAAAAGTAAATGGCATCCAGCTCCTGTCTGAACCAAAGCTTCGTGCAATCGTTGAGAACGCTAATCAAGTA
CTCAAGACGCCAAAAGCAGAGTTCATGAGCAAGGATTTGATCAGGAGAGGATTTCGTTGCGTCGGGCCAACTGTGGTTTATTCCTTCATGCAGGTTGCTGGAATTGTTAA
CGATCACTTGGTCAATTGCTTCAGATATCAAGAGTGTGATGCAAAGATAAAAGACGATACGAAACTAAGAGTAGAAGATCAACGATCGGAGTCGTTGCTTACCGGAGCTC
TTGAGAAGCCTTGCTTGACTAGATCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGTGGCTACGAAGCTCCAATCGCACGCTAAACCGGTTTTGGAGTCCCGAGCAATTCTTGGACCCGGCGGGAACAGAGATAGGGCGCCAGAGAAGCCAAAATGTAA
ACAGGAGACTTTGAAGAAGGTAGAGAAGCAGAACAAGGCACTTCCGGTGATTTCTGAATCGGTTATTCGAGACAATGTCTCCGTCGGGAGCTCCTGCTCTTCCGATTCTT
TATCAAGCAATTATTCGGCCAAATTGTCGAATCCCAAAGTGAAGCCCTACTCTGTGAAGCCTGTGAAGGCTGTTGCTGTCGGCGGTGACCCAAACGCTACTGCAACGTCG
CCTAGGCTCTCGCTTCCGGGGAAACGTTGTGATTGGATAACGCTTTATTCGGACCCACTTTACATCGCTTTTCATGACAAAGAATGGGGAGTCCCAGTTCATGACGACAA
GAAGCTATTTGAGTTACTTGTATTATCACAAGCCTTAGCAGAACTTACTTGGCCCTTGATTCTTAGCAAGAGAGATCTATTTAGGAAAGTTTTGAATGACTTTGACCCAT
CTTCCATCTCACAGTTCACCGAGAATGAGTTTACATCACTAAAAGTAAATGGCATCCAGCTCCTGTCTGAACCAAAGCTTCGTGCAATCGTTGAGAACGCTAATCAAGTA
CTCAAGACGCCAAAAGCAGAGTTCATGAGCAAGGATTTGATCAGGAGAGGATTTCGTTGCGTCGGGCCAACTGTGGTTTATTCCTTCATGCAGGTTGCTGGAATTGTTAA
CGATCACTTGGTCAATTGCTTCAGATATCAAGAGTGTGATGCAAAGATAAAAGACGATACGAAACTAAGAGTAGAAGATCAACGATCGGAGTCGTTGCTTACCGGAGCTC
TTGAGAAGCCTTGCTTGACTAGATCCTGA
Protein sequenceShow/hide protein sequence
MSVATKLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNATATS
PRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENANQV
LKTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKLRVEDQRSESLLTGALEKPCLTRS