; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040205 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040205
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationchr13:2885284..2886933
RNA-Seq ExpressionLag0040205
SyntenyLag0040205
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135425.1 uncharacterized protein LOC101218195 [Cucumis sativus]1.2e-16393.83Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQ
        MSSKATVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRT+SSLS SS+SLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT PPERREVP+PKSIQ
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQ

Query:  PPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLV
          SQEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELF+EAFAGFEPS VANMGEKEITD+ASDKAIMLV
Subjt:  PPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNL
        ESRVRCIVDNAKCILKIA+DFGSFSNYMWSY+NFKPTINRFR+PRNVPLR+PKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRH ECVNL
Subjt:  ESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

XP_008446481.2 PREDICTED: DNA-3-methyladenine glycosylase 1 [Cucumis melo]7.8e-16394.16Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQ
        MSSKATVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRT+SSLS SSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT PPERREVP+PKSIQ
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQ

Query:  PPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLV
          SQEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELF+EAFAGFEPS VANMGEKEITDIASDKAIMLV
Subjt:  PPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNL
        ESRVRCIVDNAKCILKIA+DFGSFSNYMWS +NFKPTINRFR+PRNVPLR+PKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRH ECVNL
Subjt:  ESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

XP_022149074.1 uncharacterized protein LOC111017575 [Momordica charantia]2.7e-16394.48Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQ
        MSSKATVRRQ+LERQTCPKEKDRTSQNILSK LKKIYPIGLQR+SSS SFSSLSLSLSQNSNDSSLTDSS QLDQKISYAIRLI  PPERREVPI KSIQ
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQ

Query:  PPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLV
          SQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELF+EAFAGFEPSTVANMGEKEI DIASDKAIMLV
Subjt:  PPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNL
        ESRVRCIVDNAKCILKIA+DFGSFSNYMWSY+NFKPTINR+RYPRNVPLR+PKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRH+ECVNL
Subjt:  ESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

XP_022993235.1 uncharacterized protein LOC111489316 [Cucurbita maxima]1.3e-16293.87Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT--SPPERREVPIPKS
        MSSKATVRR+ILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRT+SSLS SSLSLSLSQNSNDSSLTDSSIQLD+KISYAIRLIT   PPERRE P+PKS
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT--SPPERREVPIPKS

Query:  IQPPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIM
        +Q   QELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELF+EAFAGFE STVANMGEKEI+DIASDKAIM
Subjt:  IQPPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIM

Query:  LVESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECV
        LVESRVRCIVDNAKCILKIA+DFGSFSNYMWSYMNFKPTINRFRYPRNVPLR+PKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRH ECV
Subjt:  LVESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECV

Query:  NLAERPWRHI
        NLAERPWRHI
Subjt:  NLAERPWRHI

XP_038892395.1 DNA-3-methyladenine glycosylase 1 [Benincasa hispida]7.8e-16394.16Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQ
        MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRT+SSLS SSLSLSLSQNSNDSSLTDSSIQ DQKISYAIRLIT PPERR+VP+PK+IQ
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQ

Query:  PPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLV
          SQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELF+EAF+GFEPS VANMGE EITDIASDKAIMLV
Subjt:  PPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNL
        ESRVRCIVDNAKCILKIA+DFGSFSNYMWSYMNFKPTINRFR+PRNVPLR+PKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRH++CVNL
Subjt:  ESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

TrEMBL top hitse value%identityAlignment
A0A0A0KUC5 Uncharacterized protein5.9e-16493.83Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQ
        MSSKATVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRT+SSLS SS+SLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT PPERREVP+PKSIQ
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQ

Query:  PPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLV
          SQEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELF+EAFAGFEPS VANMGEKEITD+ASDKAIMLV
Subjt:  PPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNL
        ESRVRCIVDNAKCILKIA+DFGSFSNYMWSY+NFKPTINRFR+PRNVPLR+PKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRH ECVNL
Subjt:  ESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

A0A1S3BEN5 DNA-3-methyladenine glycosylase 13.8e-16394.16Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQ
        MSSKATVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRT+SSLS SSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT PPERREVP+PKSIQ
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQ

Query:  PPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLV
          SQEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELF+EAFAGFEPS VANMGEKEITDIASDKAIMLV
Subjt:  PPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNL
        ESRVRCIVDNAKCILKIA+DFGSFSNYMWS +NFKPTINRFR+PRNVPLR+PKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRH ECVNL
Subjt:  ESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

A0A5D3CCU6 DNA-3-methyladenine glycosylase 13.8e-16394.16Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQ
        MSSKATVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRT+SSLS SSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT PPERREVP+PKSIQ
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQ

Query:  PPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLV
          SQEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELF+EAFAGFEPS VANMGEKEITDIASDKAIMLV
Subjt:  PPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNL
        ESRVRCIVDNAKCILKIA+DFGSFSNYMWS +NFKPTINRFR+PRNVPLR+PKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRH ECVNL
Subjt:  ESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

A0A6J1D5X7 uncharacterized protein LOC1110175751.3e-16394.48Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQ
        MSSKATVRRQ+LERQTCPKEKDRTSQNILSK LKKIYPIGLQR+SSS SFSSLSLSLSQNSNDSSLTDSS QLDQKISYAIRLI  PPERREVPI KSIQ
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQ

Query:  PPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLV
          SQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELF+EAFAGFEPSTVANMGEKEI DIASDKAIMLV
Subjt:  PPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNL
        ESRVRCIVDNAKCILKIA+DFGSFSNYMWSY+NFKPTINR+RYPRNVPLR+PKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRH+ECVNL
Subjt:  ESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

A0A6J1JY14 uncharacterized protein LOC1114893166.5e-16393.87Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT--SPPERREVPIPKS
        MSSKATVRR+ILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRT+SSLS SSLSLSLSQNSNDSSLTDSSIQLD+KISYAIRLIT   PPERRE P+PKS
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT--SPPERREVPIPKS

Query:  IQPPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIM
        +Q   QELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELF+EAFAGFE STVANMGEKEI+DIASDKAIM
Subjt:  IQPPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIM

Query:  LVESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECV
        LVESRVRCIVDNAKCILKIA+DFGSFSNYMWSYMNFKPTINRFRYPRNVPLR+PKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRH ECV
Subjt:  LVESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECV

Query:  NLAERPWRHI
        NLAERPWRHI
Subjt:  NLAERPWRHI

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 11.4e-3435.36Show/hide
Query:  LRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDN
        + RC W+  + D  Y+++HD  WGVP  D  +LFE++ L G     +W  ++K+RE ++  F  F+P  VA M E+++  +  D  I+    +++ I+ N
Subjt:  LRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDN

Query:  AKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC
        A+  L++ ++   F +++WS++N +P + +      +P  T  ++A+SK + KRGF+FVG  I YSFMQA GL  DH+V C
Subjt:  AKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC

P44321 DNA-3-methyladenine glycosylase2.1e-3337.43Show/hide
Query:  RCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDNAK
        RC W+   S   Y+ +HD+ WG P +D  +LFE + L G     +W  ++K+RE ++EAF  F+P  +A M   +I     +  ++   +++  IV NAK
Subjt:  RCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDNAK

Query:  CILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC
          L + K   +FS+++WS++N KP +N     R+VP +T  ++A+SK + KRGF F+G    Y+FMQ+ GL  DHL DC
Subjt:  CILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]3.3e-3936.02Show/hide
Query:  IPKSIQPPSQELGDG--ELRRCNWITHTSD---KAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEIT
        + KS+   +Q+  +G  E  RC W T   +   K Y  +HD  WG P+++D +LFE L L G     +W  I+K+RE F+ AF  F+P  VAN  E +I 
Subjt:  IPKSIQPPSQELGDG--ELRRCNWITHTSD---KAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEIT

Query:  DIASDKAIMLVESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLV
        ++  ++ I+   +++   + NAK  + + ++FGSF  Y+W ++  KP IN F    ++P  TP ++ I+KD+ KRGF+FVG   +Y+ MQ+ G+  DHL 
Subjt:  DIASDKAIMLVESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLV

Query:  DCFRHNECVNL
         CF+ N  + +
Subjt:  DCFRHNECVNL

Arabidopsis top hitse value%identityAlignment
AT1G13635.1 DNA glycosylase superfamily protein1.6e-11367.43Show/hide
Query:  RRQILERQTCPKEKD-RTSQNILSKHLKKIYPIGLQR-TSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQPPSQE
        R++I+E+    +EK+ + + N  +KHLK+IYPI LQR TSSS S SS+SLSLSQNS DS  TDS+  L+QKIS A+ LI+S P RRE+ +PKSI  P Q 
Subjt:  RRQILERQTCPKEKD-RTSQNILSKHLKKIYPIGLQR-TSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQPPSQE

Query:  LGD----GELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLVE
          D     E +RCNWIT  SD+ YV FHD+ WGVPVYDDN LFE LA+SGMLMDYNWTEI+KR+E F+EAF  F+P+ VA MGEKEI +IAS+KAIML E
Subjt:  LGD----GELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLVE

Query:  SRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNLA
        SRVRCIVDNAKCI K+  +FGSFS+++W +M++KP IN+F+Y RNVPLR+PKAE ISKDM+KRGFRFVGPVIV+SFMQAAGLTIDHLVDCFRH +CV+LA
Subjt:  SRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNLA

Query:  ERPWRHI
        ERPWRHI
Subjt:  ERPWRHI

AT1G13635.2 DNA glycosylase superfamily protein1.6e-11367.43Show/hide
Query:  RRQILERQTCPKEKD-RTSQNILSKHLKKIYPIGLQR-TSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQPPSQE
        R++I+E+    +EK+ + + N  +KHLK+IYPI LQR TSSS S SS+SLSLSQNS DS  TDS+  L+QKIS A+ LI+S P RRE+ +PKSI  P Q 
Subjt:  RRQILERQTCPKEKD-RTSQNILSKHLKKIYPIGLQR-TSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQPPSQE

Query:  LGD----GELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLVE
          D     E +RCNWIT  SD+ YV FHD+ WGVPVYDDN LFE LA+SGMLMDYNWTEI+KR+E F+EAF  F+P+ VA MGEKEI +IAS+KAIML E
Subjt:  LGD----GELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLVE

Query:  SRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNLA
        SRVRCIVDNAKCI K+  +FGSFS+++W +M++KP IN+F+Y RNVPLR+PKAE ISKDM+KRGFRFVGPVIV+SFMQAAGLTIDHLVDCFRH +CV+LA
Subjt:  SRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNLA

Query:  ERPWRHI
        ERPWRHI
Subjt:  ERPWRHI

AT1G15970.1 DNA glycosylase superfamily protein9.5e-5843.68Show/hide
Query:  SSSLSFSSLSLSLSQNSNDSSLTDS---SIQLDQKISYAIRLITSPPERREVPIPKSIQPPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDN
        SS L  +S S++ S +S+ SS  +S   S+         +R   S    R++ + K  +  S +      +RC WIT  +D  YV+FHDE WGVPV+DD 
Subjt:  SSSLSFSSLSLSLSQNSNDSSLTDS---SIQLDQKISYAIRLITSPPERREVPIPKSIQPPSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDN

Query:  RLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRF
        +LFELL LSG L + +WT+I+ RR + +E F  F+P  VA + +K++T   +    +L E ++R I+DN++ + KI  + GS   YMW+++N KPT ++F
Subjt:  RLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFKPTINRF

Query:  RYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNLAE
        RY R VP++T KAE ISKD+V+RGFR V P ++YSFMQAAGLT DHL+ CFR+ +C   AE
Subjt:  RYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNLAE

AT5G57970.1 DNA glycosylase superfamily protein6.6e-5943.66Show/hide
Query:  LQRTSSSLSFSSLSLSLSQNSNDSSLTD-----SSIQLDQKISYAIRLITSPPERREVPIPKSIQPPSQELGDGELRRCNWITHTSDKAYVSFHDECWGV
        L+R   +L+ S+LSL+ S  S+D+S+       S+ +L +  S   R  + P + R V    ++  P    G    +RC W+T  SD  Y+ FHDE WGV
Subjt:  LQRTSSSLSFSSLSLSLSQNSNDSSLTD-----SSIQLDQKISYAIRLITSPPERREVPIPKSIQPPSQELGDGELRRCNWITHTSDKAYVSFHDECWGV

Query:  PVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFK
        PV+DD RLFELL LSG L ++ W  I+ +R+ F+E FA F+P+ +  + EK+I    S  + +L + ++R +++NA+ ILK+ +++GSF  Y+WS++  K
Subjt:  PVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFK

Query:  PTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNLAER
          +++FRY R VP +TPKAE ISKD+V+RGFR VGP +VYSFMQAAG+T DHL  CFR + C+   ER
Subjt:  PTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNLAER

AT5G57970.2 DNA glycosylase superfamily protein6.6e-5943.66Show/hide
Query:  LQRTSSSLSFSSLSLSLSQNSNDSSLTD-----SSIQLDQKISYAIRLITSPPERREVPIPKSIQPPSQELGDGELRRCNWITHTSDKAYVSFHDECWGV
        L+R   +L+ S+LSL+ S  S+D+S+       S+ +L +  S   R  + P + R V    ++  P    G    +RC W+T  SD  Y+ FHDE WGV
Subjt:  LQRTSSSLSFSSLSLSLSQNSNDSSLTD-----SSIQLDQKISYAIRLITSPPERREVPIPKSIQPPSQELGDGELRRCNWITHTSDKAYVSFHDECWGV

Query:  PVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFK
        PV+DD RLFELL LSG L ++ W  I+ +R+ F+E FA F+P+ +  + EK+I    S  + +L + ++R +++NA+ ILK+ +++GSF  Y+WS++  K
Subjt:  PVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDNAKCILKIAKDFGSFSNYMWSYMNFK

Query:  PTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNLAER
          +++FRY R VP +TPKAE ISKD+V+RGFR VGP +VYSFMQAAG+T DHL  CFR + C+   ER
Subjt:  PTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNLAER


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCCAAAGCCACTGTTAGAAGACAAATTCTGGAGAGGCAAACATGTCCTAAAGAGAAAGATAGGACAAGCCAAAACATTTTGTCCAAACACCTTAAGAAGATTTA
CCCAATTGGGCTTCAAAGAACCAGTTCTTCTTTATCTTTTTCTTCACTGTCATTGTCTTTGTCCCAAAATTCAAATGACTCTTCTCTTACAGACTCCTCAATCCAACTGG
ATCAGAAAATTTCGTACGCGATTCGCCTTATTACGTCGCCTCCTGAAAGAAGAGAAGTCCCAATACCTAAAAGTATCCAACCACCAAGTCAAGAACTTGGTGATGGGGAA
TTGAGGAGGTGCAACTGGATCACCCATACCAGTGATAAAGCCTATGTATCATTTCATGACGAGTGCTGGGGCGTCCCGGTGTATGACGACAATCGACTTTTCGAGTTACT
CGCACTGTCTGGGATGCTGATGGACTACAATTGGACTGAAATTGTGAAAAGAAGGGAACTCTTCAAGGAAGCTTTTGCTGGATTTGAGCCAAGTACTGTTGCCAACATGG
GGGAGAAAGAGATAACAGATATAGCATCTGATAAGGCCATTATGCTTGTGGAGAGCAGAGTGAGGTGCATAGTAGACAATGCCAAATGCATATTGAAGATAGCCAAAGAT
TTTGGATCGTTCAGTAACTATATGTGGAGCTACATGAACTTTAAACCAACAATAAACAGATTTAGATATCCAAGAAATGTTCCTCTGAGAACTCCCAAAGCAGAAGCCAT
CAGCAAGGATATGGTGAAGCGTGGTTTTCGGTTTGTTGGGCCAGTGATTGTCTATTCATTCATGCAAGCTGCAGGGTTGACGATTGATCATCTTGTCGATTGTTTTCGGC
ACAATGAATGCGTAAATCTTGCAGAAAGACCATGGAGACATATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCATCCAAAGCCACTGTTAGAAGACAAATTCTGGAGAGGCAAACATGTCCTAAAGAGAAAGATAGGACAAGCCAAAACATTTTGTCCAAACACCTTAAGAAGATTTA
CCCAATTGGGCTTCAAAGAACCAGTTCTTCTTTATCTTTTTCTTCACTGTCATTGTCTTTGTCCCAAAATTCAAATGACTCTTCTCTTACAGACTCCTCAATCCAACTGG
ATCAGAAAATTTCGTACGCGATTCGCCTTATTACGTCGCCTCCTGAAAGAAGAGAAGTCCCAATACCTAAAAGTATCCAACCACCAAGTCAAGAACTTGGTGATGGGGAA
TTGAGGAGGTGCAACTGGATCACCCATACCAGTGATAAAGCCTATGTATCATTTCATGACGAGTGCTGGGGCGTCCCGGTGTATGACGACAATCGACTTTTCGAGTTACT
CGCACTGTCTGGGATGCTGATGGACTACAATTGGACTGAAATTGTGAAAAGAAGGGAACTCTTCAAGGAAGCTTTTGCTGGATTTGAGCCAAGTACTGTTGCCAACATGG
GGGAGAAAGAGATAACAGATATAGCATCTGATAAGGCCATTATGCTTGTGGAGAGCAGAGTGAGGTGCATAGTAGACAATGCCAAATGCATATTGAAGATAGCCAAAGAT
TTTGGATCGTTCAGTAACTATATGTGGAGCTACATGAACTTTAAACCAACAATAAACAGATTTAGATATCCAAGAAATGTTCCTCTGAGAACTCCCAAAGCAGAAGCCAT
CAGCAAGGATATGGTGAAGCGTGGTTTTCGGTTTGTTGGGCCAGTGATTGTCTATTCATTCATGCAAGCTGCAGGGTTGACGATTGATCATCTTGTCGATTGTTTTCGGC
ACAATGAATGCGTAAATCTTGCAGAAAGACCATGGAGACATATCTGA
Protein sequenceShow/hide protein sequence
MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITSPPERREVPIPKSIQPPSQELGDGE
LRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFKEAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDNAKCILKIAKD
FGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHNECVNLAERPWRHI