; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg007289 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg007289
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationscaffold9:46445787..46448949
RNA-Seq ExpressionSpg007289
SyntenySpg007289
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608520.1 hypothetical protein SDJN03_01862, partial [Cucurbita argyrosperma subsp. sororia]7.5e-15781.74Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV
        MSVATKLQSHA+PVLESRAILGPGGNRDR PEKPKCKQE LK+T K NKALPVVSESVVRDNVSVGSSCSSDS SSNYSAKLLN K KP   KPVK VA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV

Query:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKV
        GGDANATTTSP LS+  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWP ILSKR IF                       RKV
Subjt:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKV

Query:  FNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGP
        FNDFDPS+IAQFTE EF+TLKVN  QLLS+ KLRA+VENANQVLKIQQEFGSFSNYCWSFVNKKP RN++RY RQVPVKTPKAEFMSKDLM+RGFRCVGP
Subjt:  FNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGP

Query:  TVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALEKPCLT
        TVVYSFLQVSGIVNDHLVDCFRYQECD +VK++MKLRVE+RRSELL  ALEK  LT
Subjt:  TVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALEKPCLT

XP_022141169.1 uncharacterized protein LOC111011623 [Momordica charantia]5.5e-16080.5Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQE-TLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVA
        M VA K +SHAKPVLESRAILGPGGNRDR PEKP+CK E TL KTEK NKALP V +SV+RDNVSVGSSCSSDS SSNYSAKLLN KVKPYAVKPVK VA
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQE-TLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVA

Query:  VGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARK
         G +A+ATTTSPR S+PRKRCDWITPYSDPLYIAFHDEEWGVP+HDDKKLFELLVLSQALAELTWPSILSKRD+F                       RK
Subjt:  VGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARK

Query:  VFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVG
        VFNDFDPS+IA+FTENEF+TLKVNGIQ+L+EPKLRA+VENANQVLKIQQEFGSFSNYCWSFVNKKP RN+FRYARQVPVKTPKAE MSKDL+RRGFRCVG
Subjt:  VFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVG

Query:  PTVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALEKPCLTRS
        PTVVYSF+QV+GIVNDHLV+CFRYQECD NVKD+MK RVED R EL  GA EKPCL+RS
Subjt:  PTVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALEKPCLTRS

XP_022940560.1 uncharacterized protein LOC111446125 [Cucurbita moschata]7.5e-15781.74Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV
        MSVATKLQSHA+PVLESRAILGPGGNRDR PEKPKCKQE LK+T K NKALPVVSESVVRDNVSVGSSCSSDS SSNYSAKLLN K KP   KPVK VA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV

Query:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKV
        GGDANATTTSP LS+  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWP ILSKR IF                       RKV
Subjt:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKV

Query:  FNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGP
        FNDFDPS+IA FTE EF+TLKVN  QLLS+ KLRA+VENANQVLKIQQEFGSFSNYCWSFVNKKP RN++RY RQVPVKTPKAEFMSKDLM+RGFRCVGP
Subjt:  FNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGP

Query:  TVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALEKPCLT
        TVVYSFLQVSGIVNDHLVDCFRYQECD +VKD+MKLRVE+RRSELL  ALEK  LT
Subjt:  TVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALEKPCLT

XP_023523621.1 uncharacterized protein LOC111787800 [Cucurbita pepo subsp. pepo]2.0e-15782.02Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV
        MSVATKLQSHA+PVLESRAILGPGGNRDR PEKPKCKQE LK+T K NKALPVVSESVVRDNVSVGSSCSSDS SSNYSAKLLN K KP   KPVK VA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV

Query:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKV
        GGDANATTTSP LS+  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWP ILSKR IF                       RKV
Subjt:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKV

Query:  FNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGP
        FNDFDPS+IAQFTE EF+TLKVN  QLLS+ KLRA+VENANQVLKIQQEFGSFSNYCWSFVNKKP RN++RY RQVPVKTPKAEFMSKDLM+RGFRCVGP
Subjt:  FNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGP

Query:  TVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALEKPCLT
        TVVYSFLQVSGIVNDHLVDCFRYQECD +VKD+MKLRVE+RRSELL  ALEK  LT
Subjt:  TVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALEKPCLT

XP_038905518.1 DNA-3-methyladenine glycosylase 1 [Benincasa hispida]1.7e-16181.79Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV
        MSVATKLQSHAKPVLESR ILGPGGNRDR PEKPKCKQ+TLKKTEK N+ALP++SESV+RDNVSVGSSCSSDS SSNYSAKLL  KVKP AVKPVK VA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV

Query:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKV
        GGD NAT  SP LS+P KRCDWIT +SDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWP ILSKRDIF                       RKV
Subjt:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKV

Query:  FNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGP
         NDFDPS IAQFTENEF+TLKVN IQLLSEPKLRA+VENANQVLKIQQEFGSFSNYCWSFVNKKP RN FRY RQVPVKTPKAEFMSKDL+RRGFRCVGP
Subjt:  FNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGP

Query:  TVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALEKPCLTR
        TVVYSF+QV+GIVNDHLV+CFRYQECD  +KD+ KLRVED+RSE LTGALEKPCLTR
Subjt:  TVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALEKPCLTR

TrEMBL top hitse value%identityAlignment
A0A6J1CHU1 uncharacterized protein LOC1110116232.7e-16080.5Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQE-TLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVA
        M VA K +SHAKPVLESRAILGPGGNRDR PEKP+CK E TL KTEK NKALP V +SV+RDNVSVGSSCSSDS SSNYSAKLLN KVKPYAVKPVK VA
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQE-TLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVA

Query:  VGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARK
         G +A+ATTTSPR S+PRKRCDWITPYSDPLYIAFHDEEWGVP+HDDKKLFELLVLSQALAELTWPSILSKRD+F                       RK
Subjt:  VGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARK

Query:  VFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVG
        VFNDFDPS+IA+FTENEF+TLKVNGIQ+L+EPKLRA+VENANQVLKIQQEFGSFSNYCWSFVNKKP RN+FRYARQVPVKTPKAE MSKDL+RRGFRCVG
Subjt:  VFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVG

Query:  PTVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALEKPCLTRS
        PTVVYSF+QV+GIVNDHLV+CFRYQECD NVKD+MK RVED R EL  GA EKPCL+RS
Subjt:  PTVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALEKPCLTRS

A0A6J1F6S0 uncharacterized protein LOC1114426746.8e-15680.63Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV
        MSVATKL SHAKPVLESRAILGPGGNRDR PEKPKCKQETLK +EK NKALP + ESVVRDN+S+GSSCSSDS SSNYS KLLN KVKP  VKPVK VA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV

Query:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKV
        GGD N TTT+PRLS+P KRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWP IL KRDIF                       RKV
Subjt:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKV

Query:  FNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGP
        FNDFDPSTIAQFT+NEF+TLK NGIQLLSEPKLRA+VENANQVLKIQQEFG+FSNYCWSFVNKKP  N+FRYARQVPVKTPKAEFMSKDL+RRGFRCVGP
Subjt:  FNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGP

Query:  TVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALE
        TVVYSF+QV+GIVNDHLV+CFRYQEC     D MKLRVED+RSELLTGALE
Subjt:  TVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALE

A0A6J1FIT9 uncharacterized protein LOC1114461253.6e-15781.74Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV
        MSVATKLQSHA+PVLESRAILGPGGNRDR PEKPKCKQE LK+T K NKALPVVSESVVRDNVSVGSSCSSDS SSNYSAKLLN K KP   KPVK VA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV

Query:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKV
        GGDANATTTSP LS+  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWP ILSKR IF                       RKV
Subjt:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKV

Query:  FNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGP
        FNDFDPS+IA FTE EF+TLKVN  QLLS+ KLRA+VENANQVLKIQQEFGSFSNYCWSFVNKKP RN++RY RQVPVKTPKAEFMSKDLM+RGFRCVGP
Subjt:  FNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGP

Query:  TVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALEKPCLT
        TVVYSFLQVSGIVNDHLVDCFRYQECD +VKD+MKLRVE+RRSELL  ALEK  LT
Subjt:  TVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALEKPCLT

A0A6J1IHE9 uncharacterized protein LOC1114769751.3e-15179.2Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV
        MSVATKL SHAKPVLESRAILGPGGNRDR PEKPKCKQETLK +EK NKALP + ESV+RDN+S+GSSCSSDS SSN SAKLLN K     VKPVK VA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV

Query:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKV
        GGD N TTT+PRLS+P KRC WITPYSDPLYIAFHDEEWGVPVHDD+KLFELLVLSQALAELTWP IL KRDIF                       RKV
Subjt:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKV

Query:  FNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGP
        FNDFDPSTIAQFT+NEF+TLK NGIQLLSEPKLRA+VENANQVLKIQQEFG+FSNYCWSFVNKKP  N+FRYARQ+PVKTPKAEFMSKDL+RRGFRCVGP
Subjt:  FNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGP

Query:  TVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALE
        TVVYSF+QV+GIVNDHLVDCFRYQEC     D MKLRVED+ SELLTGALE
Subjt:  TVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALE

A0A6J1J188 uncharacterized protein LOC1114804121.8e-15681.18Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV
        MSVATKLQSHA+PVLESRAILGPGGNRDR PEKPKCKQE LK+T K NKALPVVSESVVRDN+SVGSSCSSDS SSNYSAKLLN K KP   KPVK VA 
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAV

Query:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKV
        GGDANATTTSP L +  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWP ILSKR IFR+                       V
Subjt:  GGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKV

Query:  FNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGP
        FNDFDPS+IAQFTE EF+TLKVN  QLLS+ KLRA+VENANQVLKIQQEFGSFSNYCWSFVNKKP RN++RY RQVPVKTPKAEFMSKDLM+RGFRCVGP
Subjt:  FNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGP

Query:  TVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALEKPCLT
        TVVYSFLQVSGIVNDHLVDCFRYQECD +VKD+MKLRVE+RRSELL  ALEK  LT
Subjt:  TVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSELLTGALEKPCLT

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 12.9e-3435.92Show/hide
Query:  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKVFNDFDPSTIAQFTENEF
        +RC W++   DPLYIA+HD EWGVP  D KKLFE++ L    A L+W ++L KR+ +R+ FH                        FDP  +A   E + 
Subjt:  KRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKVFNDFDPSTIAQFTENEF

Query:  STLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHL
          L  +   +    K++A++ NA   L+++Q    F ++ WSFVN +P   +     ++P  T  ++ +SK L +RGF+ VG T+ YSF+Q  G+VNDH+
Subjt:  STLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHL

Query:  VDCFRY
        V C  Y
Subjt:  VDCFRY

P44321 DNA-3-methyladenine glycosylase5.0e-3136.63Show/hide
Query:  RCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKVFNDFDPSTIAQFTENEFS
        RC W+   S  +YI +HD+EWG P  D +KLFE + L    A L+W ++L KR+ +R  FH                        FDP  IA+ T  +  
Subjt:  RCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKVFNDFDPSTIAQFTENEFS

Query:  TLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLV
            N   +    KL A+V+NA   L +++   +FS++ WSFVN KP  N     R VP KT  ++ +SK L +RGF  +G T  Y+F+Q  G+V+DHL 
Subjt:  TLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLV

Query:  DC
        DC
Subjt:  DC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]1.4e-3638.65Show/hide
Query:  RCDWITPYSD---PLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKVFNDFDPSTIAQFTEN
        RC W T   +    LY  +HD EWG P+H+DKKLFE LVL    A L+W +IL KR+ F                       R  F+DFDP  +A + E+
Subjt:  RCDWITPYSD---PLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKVFNDFDPSTIAQFTEN

Query:  EFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVND
        +   L  N   + +  K+ A + NA   + +Q+EFGSF  Y W FV  KP  N F     +P  TP ++ ++KDL +RGF+ VG T +Y+ +Q  G+VND
Subjt:  EFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVND

Query:  HLVDCFR
        HL  CF+
Subjt:  HLVDCFR

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein9.0e-6843.7Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNR-DREPEKPKCKQETLKKT------EKHNKALPVVS------------ESVVRDN-VSVGSSCSSDSFSSNYSA
        MSV  + +S      E R++LGP GN+  R+P   K ++  ++KT      EK  K     S             S++R N  S+ +S SSD+ SS  S+
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNR-DREPEKPKCKQETLKKT------EKHNKALPVVS------------ESVVRDN-VSVGSSCSSDSFSSNYSA

Query:  KLLNSKVKPYAVKPVKVVAVGGDANAT-----------TTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILS
         L  S     + K  KVV   G  ++T            +    +  RKRC WITP +DP Y+AFHDEEWGVPVHDDKKLFELL LS ALAEL+W  ILS
Subjt:  KLLNSKVKPYAVKPVKVVAVGGDANAT-----------TTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILS

Query:  KRDIFRSTFHFSMFFYCPKLKQKSVPNARKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNK
        +R I                        R+VF DFDP  +A+  + + +      I LLSE K+R++++N+  V KI  E GS   Y W+FVN KPT+++
Subjt:  KRDIFRSTFHFSMFFYCPKLKQKSVPNARKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNK

Query:  FRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        FRY RQVPVKT KAEF+SKDL+RRGFR V PTV+YSF+Q +G+ NDHL+ CFRYQ+C
Subjt:  FRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

AT1G75090.1 DNA glycosylase superfamily protein9.3e-8951.87Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNRDR--EPEKPKCKQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVV
        MS+ +KL+S  KP+ ESRAIL   GNR +  + E  K  Q   + T+      P  + SV  D+ S  SS S  S  +  ++  + +  K   V+ +  V
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNRDR--EPEKPKCKQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVV

Query:  AVGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNAR
         V   A     SP++  P KRC WITP SDP+Y+ FHDEEWGVPV DDKKLFELLV SQALAE +WPSIL +RD F                       R
Subjt:  AVGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNAR

Query:  KVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCV
        K+F +FDPS IAQFTE    +L+VNG  +LSE KLRA+VENA  VLK++QEFGSFSNYCW FVN KP RN +RY RQVPVK+PKAE++SKD+M+RGFRCV
Subjt:  KVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCV

Query:  GPTVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSEL
        GPTV+YSFLQ SGIVNDHL  CFRYQEC+   + E K    + + +L
Subjt:  GPTVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDEMKLRVEDRRSEL

AT1G80850.1 DNA glycosylase superfamily protein7.4e-7044.19Show/hide
Query:  MSVATKLQSHAKPVLESRAILGPGGNR------DREPEKPKC-KQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVK
        MS   +++S      E R++LGP GN+       +  +KP   K + L  TEK  +  P+    + R+ +S+ +S SSD+ SS  S+ L  +        
Subjt:  MSVATKLQSHAKPVLESRAILGPGGNR------DREPEKPKC-KQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVK

Query:  PVKVVAVGGDANATTTSPR-------------LSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHF
          +V+   G  +++++  R                 RKRC WITP SD  YIAFHDEEWGVPVHDDK+LFELL LS ALAEL+W  ILSKR +F      
Subjt:  PVKVVAVGGDANATTTSPR-------------LSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHF

Query:  SMFFYCPKLKQKSVPNARKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKT
                         R+VF DFDP  I++ T  + ++ ++    LLSE KLR+++ENANQV KI   FGSF  Y W+FVN+KPT+++FRY RQVPVKT
Subjt:  SMFFYCPKLKQKSVPNARKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKT

Query:  PKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDE
         KAE +SKDL+RRGFR V PTV+YSF+Q +G+ NDHL  CFR+ +C T  KDE
Subjt:  PKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDTNVKDE

AT5G57970.1 DNA glycosylase superfamily protein5.7e-7052.77Show/hide
Query:  NVSVGSSCSSDSFSSNYSAKLL------NSKVKPYAVKPVKVVAVGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVL
        N S  S  S DSF S  S   L       S+ K Y  KP  VV+ G    A  + P  S  +KRC W+TP SDP YI FHDEEWGVPVHDDK+LFELLVL
Subjt:  NVSVGSSCSSDSFSSNYSAKLL------NSKVKPYAVKPVKVVAVGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVL

Query:  SQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSN
        S ALAE TWP+ILSKR  F                       R+VF DFDP+ I +  E +          LLS+ KLRAV+ENA Q+LK+ +E+GSF  
Subjt:  SQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSN

Query:  YCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        Y WSFV  K   +KFRY RQVP KTPKAE +SKDL+RRGFR VGPTVVYSF+Q +GI NDHL  CFR+  C
Subjt:  YCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC

AT5G57970.2 DNA glycosylase superfamily protein5.7e-7052.77Show/hide
Query:  NVSVGSSCSSDSFSSNYSAKLL------NSKVKPYAVKPVKVVAVGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVL
        N S  S  S DSF S  S   L       S+ K Y  KP  VV+ G    A  + P  S  +KRC W+TP SDP YI FHDEEWGVPVHDDK+LFELLVL
Subjt:  NVSVGSSCSSDSFSSNYSAKLL------NSKVKPYAVKPVKVVAVGGDANATTTSPRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVL

Query:  SQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSN
        S ALAE TWP+ILSKR  F                       R+VF DFDP+ I +  E +          LLS+ KLRAV+ENA Q+LK+ +E+GSF  
Subjt:  SQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKVFNDFDPSTIAQFTENEFSTLKVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSN

Query:  YCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC
        Y WSFV  K   +KFRY RQVP KTPKAE +SKDL+RRGFR VGPTVVYSF+Q +GI NDHL  CFR+  C
Subjt:  YCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTGGCTACGAAGCTCCAATCGCACGCTAAACCGGTTTTGGAGTCCCGAGCAATTCTTGGACCCGGCGGGAACAGAGATAGGGAGCCTGAGAAGCCGAAATGCAA
ACAGGAGACCTTGAAGAAGACAGAGAAACACAATAAGGCGCTTCCGGTGGTTTCTGAATCGGTTGTTCGGGATAATGTCTCCGTCGGAAGTTCCTGCTCTTCCGATTCTT
TTTCAAGCAACTATTCGGCCAAATTGTTGAATTCTAAGGTGAAGCCCTACGCCGTGAAGCCTGTGAAGGTTGTTGCTGTCGGCGGAGACGCAAACGCAACTACAACGTCG
CCTAGGCTCTCGATTCCACGCAAACGCTGTGATTGGATAACGCCTTATTCTGACCCACTTTACATCGCTTTTCATGACGAAGAATGGGGAGTCCCAGTTCATGACGACAA
GAAGCTGTTTGAGTTACTTGTATTATCACAAGCCTTAGCAGAACTTACCTGGCCCTCGATTCTTAGCAAGAGAGATATATTTAGGTCTACTTTCCATTTTTCTATGTTCT
TTTATTGCCCAAAACTTAAACAAAAATCAGTTCCTAATGCCCGGAAAGTCTTCAATGATTTTGACCCATCTACCATCGCACAGTTCACAGAGAATGAGTTTTCAACGCTC
AAAGTAAATGGCATCCAGCTCCTCTCTGAACCAAAGCTTCGAGCAGTTGTGGAGAACGCTAATCAAGTACTCAAGATTCAACAGGAATTTGGTTCCTTCAGCAACTATTG
TTGGAGCTTCGTTAACAAGAAGCCTACAAGAAACAAATTTCGATATGCCCGTCAAGTGCCGGTAAAGACACCAAAAGCAGAATTCATGAGCAAAGATTTGATGAGGAGAG
GGTTCCGTTGCGTCGGGCCAACTGTGGTATATTCCTTCTTGCAAGTTAGCGGAATTGTTAATGATCACTTGGTTGATTGCTTCAGATATCAAGAGTGCGACACAAACGTA
AAAGATGAGATGAAACTAAGAGTAGAAGATAGGAGATCGGAGTTGCTTACCGGAGCTTTGGAGAAGCCTTGCTTGACTAGATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGTGGCTACGAAGCTCCAATCGCACGCTAAACCGGTTTTGGAGTCCCGAGCAATTCTTGGACCCGGCGGGAACAGAGATAGGGAGCCTGAGAAGCCGAAATGCAA
ACAGGAGACCTTGAAGAAGACAGAGAAACACAATAAGGCGCTTCCGGTGGTTTCTGAATCGGTTGTTCGGGATAATGTCTCCGTCGGAAGTTCCTGCTCTTCCGATTCTT
TTTCAAGCAACTATTCGGCCAAATTGTTGAATTCTAAGGTGAAGCCCTACGCCGTGAAGCCTGTGAAGGTTGTTGCTGTCGGCGGAGACGCAAACGCAACTACAACGTCG
CCTAGGCTCTCGATTCCACGCAAACGCTGTGATTGGATAACGCCTTATTCTGACCCACTTTACATCGCTTTTCATGACGAAGAATGGGGAGTCCCAGTTCATGACGACAA
GAAGCTGTTTGAGTTACTTGTATTATCACAAGCCTTAGCAGAACTTACCTGGCCCTCGATTCTTAGCAAGAGAGATATATTTAGGTCTACTTTCCATTTTTCTATGTTCT
TTTATTGCCCAAAACTTAAACAAAAATCAGTTCCTAATGCCCGGAAAGTCTTCAATGATTTTGACCCATCTACCATCGCACAGTTCACAGAGAATGAGTTTTCAACGCTC
AAAGTAAATGGCATCCAGCTCCTCTCTGAACCAAAGCTTCGAGCAGTTGTGGAGAACGCTAATCAAGTACTCAAGATTCAACAGGAATTTGGTTCCTTCAGCAACTATTG
TTGGAGCTTCGTTAACAAGAAGCCTACAAGAAACAAATTTCGATATGCCCGTCAAGTGCCGGTAAAGACACCAAAAGCAGAATTCATGAGCAAAGATTTGATGAGGAGAG
GGTTCCGTTGCGTCGGGCCAACTGTGGTATATTCCTTCTTGCAAGTTAGCGGAATTGTTAATGATCACTTGGTTGATTGCTTCAGATATCAAGAGTGCGACACAAACGTA
AAAGATGAGATGAAACTAAGAGTAGAAGATAGGAGATCGGAGTTGCTTACCGGAGCTTTGGAGAAGCCTTGCTTGACTAGATCTTGA
Protein sequenceShow/hide protein sequence
MSVATKLQSHAKPVLESRAILGPGGNRDREPEKPKCKQETLKKTEKHNKALPVVSESVVRDNVSVGSSCSSDSFSSNYSAKLLNSKVKPYAVKPVKVVAVGGDANATTTS
PRLSIPRKRCDWITPYSDPLYIAFHDEEWGVPVHDDKKLFELLVLSQALAELTWPSILSKRDIFRSTFHFSMFFYCPKLKQKSVPNARKVFNDFDPSTIAQFTENEFSTL
KVNGIQLLSEPKLRAVVENANQVLKIQQEFGSFSNYCWSFVNKKPTRNKFRYARQVPVKTPKAEFMSKDLMRRGFRCVGPTVVYSFLQVSGIVNDHLVDCFRYQECDTNV
KDEMKLRVEDRRSELLTGALEKPCLTRS