; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G016410 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G016410
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationchr04:23889720..23893566
RNA-Seq ExpressionLsi04G016410
SyntenyLsi04G016410
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591330.1 hypothetical protein SDJN03_13676, partial [Cucurbita argyrosperma subsp. sororia]3.7e-13274.55Show/hide
Query:  LQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA
        L SHAKPVLESRAILGPGGNRDRAPEKPKCKQETLK  EKQNKALP I ESVIRDN+S+GSSCSSDSLSSNYS KL NPKVKP  VKPVKAVA GGDPN 
Subjt:  LQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA

Query:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS
        T T+PRLS+PGKRC WIT YSDPLYIAFHD+EWGVPVHDD+KLFELLVLSQALAELTWPLIL KRD+FRKV NDFDPS+I+QFT+NEFT+LK NGIQLLS
Subjt:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS

Query:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV
        EPKLRAIVENANQVLK   +                            T   +  + V    I    FRY RQVPVKTPKAEFMSKDL+RRGFRCVGPTV
Subjt:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV

Query:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKL
        VYSFMQV GIVNDHLVNCFRYQECD + KDD  +
Subjt:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKL

XP_022141169.1 uncharacterized protein LOC111011623 [Momordica charantia]6.2e-13573.39Show/hide
Query:  QSHAKPVLESRAILGPGGNRDRAPEKPKCKQE-TLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA
        +SHAKPVLESRAILGPGGNRDR PEKP+CK E TL K EKQNKALP + +SVIRDNVSVGSSCSSDSLSSNYSAKL NPKVKPY+VKPVKAVA G + +A
Subjt:  QSHAKPVLESRAILGPGGNRDRAPEKPKCKQE-TLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA

Query:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS
        T TSPR S+P KRCDWIT YSDPLYIAFHD+EWGVP+HDDKKLFELLVLSQALAELTWP ILSKRD+FRKV NDFDPSSI++FTENEF +LKVNGIQ+L+
Subjt:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS

Query:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV
        EPKLRAIVENANQVLK   +                    S   +C   +     N  P+       FRY RQVPVKTPKAE MSKDLIRRGFRCVGPTV
Subjt:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV

Query:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKLRVEDQRSESLLTGALEKPCLTRS
        VYSFMQV GIVNDHLVNCFRYQECDA +KDD K RVED R E L  GA EKPCL+RS
Subjt:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKLRVEDQRSESLLTGALEKPCLTRS

XP_022935907.1 uncharacterized protein LOC111442674 [Cucurbita moschata]2.5e-13674.86Show/hide
Query:  LQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA
        L SHAKPVLESRAILGPGGNRDRAPEKPKCKQETLK  EKQNKALP I ESV+RDN+S+GSSCSSDSLSSNYS KL NPKVKP  VKPVKAVA GGDPN 
Subjt:  LQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA

Query:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS
        T T+PRLS+PGKRC WIT YSDPLYIAFHD+EWGVPVHDD+KLFELLVLSQALAELTWPLIL KRD+FRKV NDFDPS+I+QFT+NEFT+LK NGIQLLS
Subjt:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS

Query:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV
        EPKLRAIVENANQVLK   +                            T   +  + V    I    FRY RQVPVKTPKAEFMSKDL+RRGFRCVGPTV
Subjt:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV

Query:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKLRVEDQRSESLLTGALE
        VYSFMQV GIVNDHLVNCFRYQECD       KLRVEDQRSE LLTGALE
Subjt:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKLRVEDQRSESLLTGALE

XP_023535246.1 uncharacterized protein LOC111796735 [Cucurbita pepo subsp. pepo]3.7e-13273.43Show/hide
Query:  LQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA
        L SHAKPVLESR ILGPGGNRDRAPEKPKCKQETLK  EKQNKALP I ESVIRDN S+GSSCSSDSL S+YS KL NPKVKP  VKPVKAVA GGDPN 
Subjt:  LQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA

Query:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS
        T+T+P LS+PGKRC WIT YSDPLYIAFHD+EWGVPVHDD+KLFELLVLSQALAELTWPLIL KRD+FRKV NDFDPS+I++FT+NEFT+LK NGIQLLS
Subjt:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS

Query:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV
        EPKLRAIVENANQVLK   +                            T   +  + V    I    FRY RQVPVKTPKAEFMSKDL+RRGFRCVGPTV
Subjt:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV

Query:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKLRVEDQRSESLLTGALE
        VYSFMQV GIVNDHLVNCFRYQECD       KLRVE QRSE LLTGALE
Subjt:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKLRVEDQRSESLLTGALE

XP_038905518.1 DNA-3-methyladenine glycosylase 1 [Benincasa hispida]8.3e-14879.49Show/hide
Query:  LQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA
        LQSHAKPVLESR ILGPGGNRDRAPEKPKCKQ+TLKK EKQN+ALP+ISESVIRDNVSVGSSCSSDS+SSNYSAKL  PKVKP +VKPVKAVA GGD NA
Subjt:  LQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA

Query:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS
        T  SP LSLPGKRCDWITL+SDPLYIAFHD+EWGVPVHDDKKLFELLVLSQALAELTWPLILSKRD+FRKVLNDFDPS+I+QFTENEFT+LKVN IQLLS
Subjt:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS

Query:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV
        EPKLRAIVENANQVLK   +                    S   +C   +     N  P+       FRY RQVPVKTPKAEFMSKDLIRRGFRCVGPTV
Subjt:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV

Query:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKLRVEDQRSESLLTGALEKPCLTR
        VYSFMQVAGIVNDHLVNCFRYQECDAKIKDD KLRVED+RSES LTGALEKPCLTR
Subjt:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKLRVEDQRSESLLTGALEKPCLTR

TrEMBL top hitse value%identityAlignment
A0A0A0LG22 Uncharacterized protein2.4e-13274.71Show/hide
Query:  LQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA
        LQSH KP LE RAILGPGGNRDRAP+ PKCK ETLKK EKQ+KALP ISESVIRDNVSVGSSCSSDSLSSNYSAKL    +KPY+VKPV A   GGD NA
Subjt:  LQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA

Query:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS
        T TSP LSLPGKRCDWITL+SDPLYIAFHD+EWGVP+HDDKKLFELLVLSQALAELTWPLILSKRD+FRKVLNDFDPSSI+QFTENEFT+LKVNGIQLLS
Subjt:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS

Query:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV
        EPKLRAIV+NANQVLK      ++     ++    ++  + +R          NR+            RY RQVPVKTPKAEFMSKD+IRRGFRCVGPTV
Subjt:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV

Query:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKLRVEDQR
        VYSFMQVAGIVNDHLV+CFRY+ECD K+KDD KLRVED+R
Subjt:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKLRVEDQR

A0A6J1CHU1 uncharacterized protein LOC1110116233.0e-13573.39Show/hide
Query:  QSHAKPVLESRAILGPGGNRDRAPEKPKCKQE-TLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA
        +SHAKPVLESRAILGPGGNRDR PEKP+CK E TL K EKQNKALP + +SVIRDNVSVGSSCSSDSLSSNYSAKL NPKVKPY+VKPVKAVA G + +A
Subjt:  QSHAKPVLESRAILGPGGNRDRAPEKPKCKQE-TLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA

Query:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS
        T TSPR S+P KRCDWIT YSDPLYIAFHD+EWGVP+HDDKKLFELLVLSQALAELTWP ILSKRD+FRKV NDFDPSSI++FTENEF +LKVNGIQ+L+
Subjt:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS

Query:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV
        EPKLRAIVENANQVLK   +                    S   +C   +     N  P+       FRY RQVPVKTPKAE MSKDLIRRGFRCVGPTV
Subjt:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV

Query:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKLRVEDQRSESLLTGALEKPCLTRS
        VYSFMQV GIVNDHLVNCFRYQECDA +KDD K RVED R E L  GA EKPCL+RS
Subjt:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKLRVEDQRSESLLTGALEKPCLTRS

A0A6J1F6S0 uncharacterized protein LOC1114426741.2e-13674.86Show/hide
Query:  LQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA
        L SHAKPVLESRAILGPGGNRDRAPEKPKCKQETLK  EKQNKALP I ESV+RDN+S+GSSCSSDSLSSNYS KL NPKVKP  VKPVKAVA GGDPN 
Subjt:  LQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA

Query:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS
        T T+PRLS+PGKRC WIT YSDPLYIAFHD+EWGVPVHDD+KLFELLVLSQALAELTWPLIL KRD+FRKV NDFDPS+I+QFT+NEFT+LK NGIQLLS
Subjt:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS

Query:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV
        EPKLRAIVENANQVLK   +                            T   +  + V    I    FRY RQVPVKTPKAEFMSKDL+RRGFRCVGPTV
Subjt:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV

Query:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKLRVEDQRSESLLTGALE
        VYSFMQV GIVNDHLVNCFRYQECD       KLRVEDQRSE LLTGALE
Subjt:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKLRVEDQRSESLLTGALE

A0A6J1FIT9 uncharacterized protein LOC1114461254.5e-13172.68Show/hide
Query:  LQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA
        LQSHA+PVLESRAILGPGGNRDRAPEKPKCKQE LK+  KQNKALPV+SESV+RDNVSVGSSCSSDSLSSNYSAKL N K KP   KPVK VA GGD NA
Subjt:  LQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA

Query:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS
        T TSP LS+ GKRCDWIT YSDPLYIAFHD+EWGVPVHDDKKLFELLVLSQALAELTWPLILSKR +FRKV NDFDPSSI+ FTE EFT+LKVN  QLLS
Subjt:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS

Query:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV
        + KLRAIVENANQVLK   +                    S   +C   +     N  P+       +RY RQVPVKTPKAEFMSKDL++RGFRCVGPTV
Subjt:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV

Query:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKLRVEDQRSESLLTGALEKPCLT
        VYSF+QV+GIVNDHLV+CFRYQECDA +KDD KLRVE++RSE LL  ALEK  LT
Subjt:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKLRVEDQRSESLLTGALEKPCLT

A0A6J1IHE9 uncharacterized protein LOC1114769753.4e-13173.43Show/hide
Query:  LQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA
        L SHAKPVLESRAILGPGGNRDRAPEKPKCKQETLK  EKQNKALP I ESVIRDN+S+GSSCSSDSLSSN SAKL NPK     VKPVKAVA GGDPN 
Subjt:  LQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNA

Query:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS
        T T+PRLS+PGKRC WIT YSDPLYIAFHD+EWGVPVHDD+KLFELLVLSQALAELTWPLIL KRD+FRKV NDFDPS+I+QFT+NEFT+LK NGIQLLS
Subjt:  TATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLS

Query:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV
        EPKLRAIVENANQVLK   +                            T   +  + V    I    FRY RQ+PVKTPKAEFMSKDL+RRGFRCVGPTV
Subjt:  EPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTV

Query:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKLRVEDQRSESLLTGALE
        VYSFMQV GIVNDHLV+CFRYQECD       KLRVEDQ SE LLTGALE
Subjt:  VYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTKLRVEDQRSESLLTGALE

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 13.8e-2633.81Show/hide
Query:  KRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENA
        +RC W++   DPLYIA+HD EWGVP  D KKLFE++ L    A L+W  +L KR+ +R   + FDP  ++   E +   L  +   +    K++AI+ NA
Subjt:  KRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENA

Query:  NQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIV
           L+     +                   + F   F  H+      P  T    L     ++P  T  ++ +SK L +RGF+ VG T+ YSFMQ  G+V
Subjt:  NQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIV

Query:  NDHLVNCFRY
        NDH+V C  Y
Subjt:  NDHLVNCFRY

P44321 DNA-3-methyladenine glycosylase5.6e-2233.5Show/hide
Query:  RCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENAN
        RC W+   S  +YI +HDKEWG P  D +KLFE + L    A L+W  +L KR+ +R+  + FDP  I++ T  +  +   N   +    KL AIV+NA 
Subjt:  RCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENAN

Query:  QVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVN
          L                  ++        F   F  H+   N VP            R VP KT  ++ +SK L +RGF  +G T  Y+FMQ  G+V+
Subjt:  QVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVN

Query:  DHLVNC
        DHL +C
Subjt:  DHLVNC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]5.4e-2534.12Show/hide
Query:  RCDWITLYSD---PLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVE
        RC W T   +    LY  +HD EWG P+H+DKKLFE LVL    A L+W  IL KR+ FR   +DFDP  ++ + E++   L  N   + +  K+ A + 
Subjt:  RCDWITLYSD---PLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVE

Query:  NANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAG
        NA    K  M  Q+                         +  ++    V    I+ A F     +P  TP ++ ++KDL +RGF+ VG T +Y+ MQ  G
Subjt:  NANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAG

Query:  IVNDHLVNCFR
        +VNDHL +CF+
Subjt:  IVNDHLVNCFR

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein3.6e-5639.34Show/hide
Query:  ESRAILGPGGNR-DRAP-----EKPKCKQETL-KKVEKQNKALPVIS------------ESVIRDNVS---------VGSSCSSDSLSSNYSAKLSNPKV
        E R++LGP GN+  R P     EKP  ++  +  K EK  K     S             S++R N +           SSC S  LS   S+       
Subjt:  ESRAILGPGGNR-DRAP-----EKPKCKQETL-KKVEKQNKALPVIS------------ESVIRDNVS---------VGSSCSSDSLSSNYSAKLSNPKV

Query:  KPYSVKPVKAVAVGGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSIS
        +  SV   + ++VG +     +    +   KRC WIT  +DP Y+AFHD+EWGVPVHDDKKLFELL LS ALAEL+W  ILS+R + R+V  DFDP +++
Subjt:  KPYSVKPVKAVAVGGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSIS

Query:  QFTENEFTSLKVNGIQLLSEPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKA
        +  + + T+     I LLSE K+R+I++N+  V         R+++                     +L ++  N V       + FRY RQVPVKT KA
Subjt:  QFTENEFTSLKVNGIQLLSEPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKA

Query:  EFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRYQEC--DAKIKDDTKLRVEDQR
        EF+SKDL+RRGFR V PTV+YSFMQ AG+ NDHL+ CFRYQ+C  DA+    TK + +++R
Subjt:  EFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRYQEC--DAKIKDDTKLRVEDQR

AT1G75090.1 DNA glycosylase superfamily protein2.2e-6947.2Show/hide
Query:  LQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKAL--PVISESVIRDNVSVGSSCSS-DSLSSNYSAKLSNPKVKPYSVK---PVKAVAV
        L+S  KP+ ESRAIL   GNR +  +    K+  L     ++ A   P  + SV  D+ S  SS S   S+++  S K++ P  +    K    V +VAV
Subjt:  LQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKAL--PVISESVIRDNVSVGSSCSS-DSLSSNYSAKLSNPKVKPYSVK---PVKAVAV

Query:  GGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVN
          D      SP++  P KRC WIT  SDP+Y+ FHD+EWGVPV DDKKLFELLV SQALAE +WP IL +RD FRK+  +FDPS+I+QFTE    SL+VN
Subjt:  GGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVN

Query:  GIQLLSEPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFR
        G  +LSE KLRAIVENA  VLK                 +K        +   F  H+  RN           +RY RQVPVK+PKAE++SKD+++RGFR
Subjt:  GIQLLSEPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFR

Query:  CVGPTVVYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTK
        CVGPTV+YSF+Q +GIVNDHL  CFRYQEC+ + + +TK
Subjt:  CVGPTVVYSFMQVAGIVNDHLVNCFRYQECDAKIKDDTK

AT1G80850.1 DNA glycosylase superfamily protein9.4e-5742.73Show/hide
Query:  ESRAILGPGGNR------DRAPEKPKC-KQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNATA
        E R++LGP GN+       +  +KP   K + L   EK  +  P+    + R+ +S+ +S SSD+ SS  S+ LS       S    K V       +++
Subjt:  ESRAILGPGGNR------DRAPEKPKC-KQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNATA

Query:  TSPRLSLP--------------GKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEF
        +S R +L                KRC WIT  SD  YIAFHD+EWGVPVHDDK+LFELL LS ALAEL+W  ILSKR LFR+V  DFDP +IS+ T  + 
Subjt:  TSPRLSLP--------------GKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEF

Query:  TSLKVNGIQLLSEPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDL
        TS ++    LLSE KLR+I+ENANQV K                                +  ++  N V       + FRY RQVPVKT KAE +SKDL
Subjt:  TSLKVNGIQLLSEPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDL

Query:  IRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRYQECDAKIKDDT
        +RRGFR V PTV+YSFMQ AG+ NDHL  CFR+ +C    KD+T
Subjt:  IRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRYQECDAKIKDDT

AT5G57970.1 DNA glycosylase superfamily protein3.2e-5745.26Show/hide
Query:  ESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKV----------KPYSVKPVKAVAVGGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHD
        E  +  N+S+ +S SSD+   ++ ++ S  ++          K Y  KP   V+ G    A  + P  S   KRC W+T  SDP YI FHD+EWGVPVHD
Subjt:  ESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKV----------KPYSVKPVKAVAVGGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHD

Query:  DKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKA
        DK+LFELLVLS ALAE TWP ILSKR  FR+V  DFDP++I +  E +          LLS+ KLRA++ENA Q+LK   +                   
Subjt:  DKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKA

Query:  ESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRYQEC
                 +  ++  + V    IV + FRY RQVP KTPKAE +SKDL+RRGFR VGPTVVYSFMQ AGI NDHL +CFR+  C
Subjt:  ESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRYQEC

AT5G57970.2 DNA glycosylase superfamily protein3.2e-5745.26Show/hide
Query:  ESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKV----------KPYSVKPVKAVAVGGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHD
        E  +  N+S+ +S SSD+   ++ ++ S  ++          K Y  KP   V+ G    A  + P  S   KRC W+T  SDP YI FHD+EWGVPVHD
Subjt:  ESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKV----------KPYSVKPVKAVAVGGDPNATATSPRLSLPGKRCDWITLYSDPLYIAFHDKEWGVPVHD

Query:  DKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKA
        DK+LFELLVLS ALAE TWP ILSKR  FR+V  DFDP++I +  E +          LLS+ KLRA++ENA Q+LK   +                   
Subjt:  DKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENANQVLKPTMQPQKRRLVVSTSIPIKWLKA

Query:  ESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRYQEC
                 +  ++  + V    IV + FRY RQVP KTPKAE +SKDL+RRGFR VGPTVVYSFMQ AGI NDHL +CFR+  C
Subjt:  ESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRYQEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCTCCAATCGCACGCTAAACCGGTTTTGGAGTCCCGAGCAATTCTTGGACCCGGCGGGAACAGAGATAGGGCGCCAGAGAAGCCAAAATGTAAACAGGAGACTTT
GAAGAAGGTAGAGAAGCAGAACAAGGCACTTCCGGTGATTTCTGAATCGGTTATTCGAGACAATGTCTCCGTCGGGAGCTCCTGCTCTTCCGATTCTTTATCAAGCAATT
ATTCGGCCAAATTGTCGAATCCCAAAGTGAAGCCCTACTCTGTGAAGCCTGTGAAGGCTGTTGCTGTCGGCGGTGACCCAAACGCTACTGCAACGTCGCCTAGGCTCTCG
CTTCCGGGGAAACGTTGTGATTGGATAACGCTTTATTCGGACCCACTTTACATCGCTTTTCATGACAAAGAATGGGGAGTCCCAGTTCATGACGACAAGAAGCTATTTGA
GTTACTTGTATTATCACAAGCCTTAGCAGAACTTACTTGGCCCTTGATTCTTAGCAAGAGAGATCTATTTAGGAAAGTTTTGAATGACTTTGACCCATCTTCCATCTCAC
AGTTCACCGAGAATGAGTTTACATCACTAAAAGTAAATGGCATCCAGCTCCTGTCTGAACCAAAGCTTCGTGCAATCGTTGAGAACGCTAATCAAGTACTCAAGCCAACA
ATGCAACCCCAAAAAAGGCGTTTGGTTGTCTCAACTTCCATCCCTATCAAGTGGTTGAAAGCTGAATCTCTACGTTTTCATTGCCTTTTTACCCTACACAGATTCAACAG
GAATTTGGTTCCTTTAGCAACTATTGTTGGAGCTTTATTTCGTTACACTCGTCAAGTACCTGTAAAGACGCCAAAAGCAGAGTTCATGAGCAAGGATTTGATCAGGAGAG
GATTTCGTTGCGTCGGGCCAACTGTGGTTTATTCCTTCATGCAGGTTGCTGGAATTGTTAACGATCACTTGGTCAATTGCTTCAGATATCAAGAGTGTGATGCAAAGATA
AAAGACGATACGAAACTAAGAGTAGAAGATCAACGATCGGAGTCGTTGCTTACCGGAGCTCTTGAGAAGCCTTGCTTGACTAGATCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCTCCAATCGCACGCTAAACCGGTTTTGGAGTCCCGAGCAATTCTTGGACCCGGCGGGAACAGAGATAGGGCGCCAGAGAAGCCAAAATGTAAACAGGAGACTTT
GAAGAAGGTAGAGAAGCAGAACAAGGCACTTCCGGTGATTTCTGAATCGGTTATTCGAGACAATGTCTCCGTCGGGAGCTCCTGCTCTTCCGATTCTTTATCAAGCAATT
ATTCGGCCAAATTGTCGAATCCCAAAGTGAAGCCCTACTCTGTGAAGCCTGTGAAGGCTGTTGCTGTCGGCGGTGACCCAAACGCTACTGCAACGTCGCCTAGGCTCTCG
CTTCCGGGGAAACGTTGTGATTGGATAACGCTTTATTCGGACCCACTTTACATCGCTTTTCATGACAAAGAATGGGGAGTCCCAGTTCATGACGACAAGAAGCTATTTGA
GTTACTTGTATTATCACAAGCCTTAGCAGAACTTACTTGGCCCTTGATTCTTAGCAAGAGAGATCTATTTAGGAAAGTTTTGAATGACTTTGACCCATCTTCCATCTCAC
AGTTCACCGAGAATGAGTTTACATCACTAAAAGTAAATGGCATCCAGCTCCTGTCTGAACCAAAGCTTCGTGCAATCGTTGAGAACGCTAATCAAGTACTCAAGCCAACA
ATGCAACCCCAAAAAAGGCGTTTGGTTGTCTCAACTTCCATCCCTATCAAGTGGTTGAAAGCTGAATCTCTACGTTTTCATTGCCTTTTTACCCTACACAGATTCAACAG
GAATTTGGTTCCTTTAGCAACTATTGTTGGAGCTTTATTTCGTTACACTCGTCAAGTACCTGTAAAGACGCCAAAAGCAGAGTTCATGAGCAAGGATTTGATCAGGAGAG
GATTTCGTTGCGTCGGGCCAACTGTGGTTTATTCCTTCATGCAGGTTGCTGGAATTGTTAACGATCACTTGGTCAATTGCTTCAGATATCAAGAGTGTGATGCAAAGATA
AAAGACGATACGAAACTAAGAGTAGAAGATCAACGATCGGAGTCGTTGCTTACCGGAGCTCTTGAGAAGCCTTGCTTGACTAGATCCTGACATGTTCGTAGAAAAAAGGT
TATCTCTAACTAATCTTACAAATCACTGTAATAAAATGGTTCTGATGTTCTGATGCTGAGTAAGTTTGAGTTCGTTTGTTTAGAATTTATAATTTCTTATTTGTAGATGA
TGTATTAACATTGATGTTATAATGTAATTGTAATTTGTATTTACAAACAACAAACACCTTTGTTTCCTGTAATTATTTCTCAAACTGCAGAACATGCTTCCATTCCATGG
TAGAACAGAAGAAAGAGGTAATGTTTCTTTAATCCAACATTTATGATTTTGAAGTTGAAGTTTTCATAAGATCAAAGGTTTTATTTTC
Protein sequenceShow/hide protein sequence
MSLQSHAKPVLESRAILGPGGNRDRAPEKPKCKQETLKKVEKQNKALPVISESVIRDNVSVGSSCSSDSLSSNYSAKLSNPKVKPYSVKPVKAVAVGGDPNATATSPRLS
LPGKRCDWITLYSDPLYIAFHDKEWGVPVHDDKKLFELLVLSQALAELTWPLILSKRDLFRKVLNDFDPSSISQFTENEFTSLKVNGIQLLSEPKLRAIVENANQVLKPT
MQPQKRRLVVSTSIPIKWLKAESLRFHCLFTLHRFNRNLVPLATIVGALFRYTRQVPVKTPKAEFMSKDLIRRGFRCVGPTVVYSFMQVAGIVNDHLVNCFRYQECDAKI
KDDTKLRVEDQRSESLLTGALEKPCLTRS