; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS015261 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS015261
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationscaffold2:2124812..2126285
RNA-Seq ExpressionMS015261
SyntenyMS015261
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135425.1 uncharacterized protein LOC101218195 [Cucumis sativus]8.7e-16293.18Show/hide
Query:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQ
        MSSKATVRR +LERQ CPKEKDRTSQNILSK LKKIYPIGLQR++SS S SS+SLSLSQNSNDSSLTDSS QLDQKISYAIRLI PPPERREVP+ KSIQ
Subjt:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQ

Query:  QQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLV
        QQSQEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEI D+ASDKAIMLV
Subjt:  QQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNL
        ESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINR+R+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRH ECVNL
Subjt:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

XP_008446481.2 PREDICTED: DNA-3-methyladenine glycosylase 1 [Cucumis melo]5.6e-16193.51Show/hide
Query:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQ
        MSSKATVRR +LERQ CPKEKDRTSQNILSK LKKIYPIGLQR++SS S SSLSLSLSQNSNDSSLTDSS QLDQKISYAIRLI PPPERREVP+ KSIQ
Subjt:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQ

Query:  QQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLV
        QQSQEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEI DIASDKAIMLV
Subjt:  QQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNL
        ESRVRCIVDNAKCILKIARDFGSFSNYMWS VNFKPTINR+R+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRH ECVNL
Subjt:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

XP_022149074.1 uncharacterized protein LOC111017575 [Momordica charantia]1.2e-171100Show/hide
Query:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQ
        MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQ
Subjt:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQ

Query:  QQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLV
        QQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLV
Subjt:  QQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNL
        ESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNL
Subjt:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

XP_022993235.1 uncharacterized protein LOC111489316 [Cucurbita maxima]1.6e-16092.9Show/hide
Query:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIV--PPPERREVPIVKS
        MSSKATVRR++LERQTCPKEKDRTSQNILSK LKKIYPIGLQR++SS S SSLSLSLSQNSNDSSLTDSS QLD+KISYAIRLI   PPPERRE P+ KS
Subjt:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIV--PPPERREVPIVKS

Query:  IQQQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIM
        +QQQ QELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE STVANMGEKEI+DIASDKAIM
Subjt:  IQQQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIM

Query:  LVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECV
        LVESRVRCIVDNAKCILKIARDFGSFSNYMWSY+NFKPTINR+RYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRH ECV
Subjt:  LVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECV

Query:  NLAERPWRHI
        NLAERPWRHI
Subjt:  NLAERPWRHI

XP_023528370.1 uncharacterized protein LOC111791309 [Cucurbita pepo subsp. pepo]1.3e-16092.63Show/hide
Query:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIV----PPPERREVPIV
        MSSKATVRR++LERQTCPKEKDRTSQNILSK LKKIYPIGLQR++SS S SS SLSLSQNSNDSSLTDSS QLDQKISYAIRLI     PPPERRE P+ 
Subjt:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIV----PPPERREVPIV

Query:  KSIQQQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKA
        KS+QQQ QELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE STVANMGEKEIADIASDKA
Subjt:  KSIQQQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKA

Query:  IMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSE
        IMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSY+NFKPTINR+RYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRH E
Subjt:  IMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSE

Query:  CVNLAERPWRHI
        CVNLAERPWRHI
Subjt:  CVNLAERPWRHI

TrEMBL top hitse value%identityAlignment
A0A0A0KUC5 Uncharacterized protein4.2e-16293.18Show/hide
Query:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQ
        MSSKATVRR +LERQ CPKEKDRTSQNILSK LKKIYPIGLQR++SS S SS+SLSLSQNSNDSSLTDSS QLDQKISYAIRLI PPPERREVP+ KSIQ
Subjt:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQ

Query:  QQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLV
        QQSQEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEI D+ASDKAIMLV
Subjt:  QQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNL
        ESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINR+R+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRH ECVNL
Subjt:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

A0A1S3BEN5 DNA-3-methyladenine glycosylase 12.7e-16193.51Show/hide
Query:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQ
        MSSKATVRR +LERQ CPKEKDRTSQNILSK LKKIYPIGLQR++SS S SSLSLSLSQNSNDSSLTDSS QLDQKISYAIRLI PPPERREVP+ KSIQ
Subjt:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQ

Query:  QQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLV
        QQSQEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEI DIASDKAIMLV
Subjt:  QQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNL
        ESRVRCIVDNAKCILKIARDFGSFSNYMWS VNFKPTINR+R+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRH ECVNL
Subjt:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

A0A5D3CCU6 DNA-3-methyladenine glycosylase 12.7e-16193.51Show/hide
Query:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQ
        MSSKATVRR +LERQ CPKEKDRTSQNILSK LKKIYPIGLQR++SS S SSLSLSLSQNSNDSSLTDSS QLDQKISYAIRLI PPPERREVP+ KSIQ
Subjt:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQ

Query:  QQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLV
        QQSQEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEI DIASDKAIMLV
Subjt:  QQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNL
        ESRVRCIVDNAKCILKIARDFGSFSNYMWS VNFKPTINR+R+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRH ECVNL
Subjt:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

A0A6J1D5X7 uncharacterized protein LOC1110175755.8e-172100Show/hide
Query:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQ
        MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQ
Subjt:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQ

Query:  QQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLV
        QQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLV
Subjt:  QQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNL
        ESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNL
Subjt:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

A0A6J1JY14 uncharacterized protein LOC1114893167.9e-16192.9Show/hide
Query:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIV--PPPERREVPIVKS
        MSSKATVRR++LERQTCPKEKDRTSQNILSK LKKIYPIGLQR++SS S SSLSLSLSQNSNDSSLTDSS QLD+KISYAIRLI   PPPERRE P+ KS
Subjt:  MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIV--PPPERREVPIVKS

Query:  IQQQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIM
        +QQQ QELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE STVANMGEKEI+DIASDKAIM
Subjt:  IQQQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIM

Query:  LVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECV
        LVESRVRCIVDNAKCILKIARDFGSFSNYMWSY+NFKPTINR+RYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRH ECV
Subjt:  LVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECV

Query:  NLAERPWRHI
        NLAERPWRHI
Subjt:  NLAERPWRHI

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 18.5e-3535.91Show/hide
Query:  LRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLVESRVRCIVDN
        + RC W+  + D  Y+++HD  WGVP  D  +LFE++ L G     +W  ++K+RE +R  F  F+P  VA M E+++  +  D  I+    +++ I+ N
Subjt:  LRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLVESRVRCIVDN

Query:  AKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC
        A+  L++ ++   F +++WS+VN +P + +      +P  +  ++A+SK + KRGF+FVG  I YSFMQA GL  DH+V C
Subjt:  AKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC

P44321 DNA-3-methyladenine glycosylase4.6e-3337.43Show/hide
Query:  RCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLVESRVRCIVDNAK
        RC W+   S   Y+ +HD+ WG P +D  +LFE + L G     +W  ++K+RE +REAF  F+P  +A M   +I     +  ++   +++  IV NAK
Subjt:  RCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLVESRVRCIVDNAK

Query:  CILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC
          L + +   +FS+++WS+VN KP +N     R+VP ++  ++A+SK + KRGF F+G    Y+FMQ+ GL  DHL DC
Subjt:  CILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]5.7e-3936.74Show/hide
Query:  IVPPPERREVPIVKSIQQQSQELGDG--ELRRCNWITHTSD---KAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS
        ++P    RE  + KS+  ++Q+  +G  E  RC W T   +   K Y  +HD  WG P+++D +LFE L L G     +W  I+K+RE FR AF  F+P 
Subjt:  IVPPPERREVPIVKSIQQQSQELGDG--ELRRCNWITHTSD---KAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS

Query:  TVANMGEKEIADIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFM
         VAN  E +I ++  ++ I+   +++   + NAK  + + R+FGSF  Y+W +V  KP IN +    ++P  +P ++ I+KD+ KRGF+FVG   +Y+ M
Subjt:  TVANMGEKEIADIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFM

Query:  QAAGLTIDHLVDCFR
        Q+ G+  DHL  CF+
Subjt:  QAAGLTIDHLVDCFR

Arabidopsis top hitse value%identityAlignment
AT1G13635.1 DNA glycosylase superfamily protein3.9e-11267.21Show/hide
Query:  RRQLLERQTCPKEKD-RTSQNILSKQLKKIYPIGLQRS-SSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQQQ-SQ
        R++++E+    +EK+ + + N  +K LK+IYPI LQRS SSSFS SS+SLSLSQNS DS  TDS++ L+QKIS A+ LI   P RRE+ + KSI QQ  Q
Subjt:  RRQLLERQTCPKEKD-RTSQNILSKQLKKIYPIGLQRS-SSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQQQ-SQ

Query:  ELGDG-ELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLVESR
        +     E +RCNWIT  SD+ YV FHD+ WGVPVYDDN LFE LA+SGMLMDYNWTEI+KR+E FREAF  F+P+ VA MGEKEIA+IAS+KAIML ESR
Subjt:  ELGDG-ELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLVESR

Query:  VRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNLAER
        VRCIVDNAKCI K+  +FGSFS+++W ++++KP IN+++Y RNVPLRSPKAE ISKDM+KRGFRFVGPVIV+SFMQAAGLTIDHLVDCFRH +CV+LAER
Subjt:  VRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNLAER

Query:  PWRHI
        PWRHI
Subjt:  PWRHI

AT1G13635.2 DNA glycosylase superfamily protein3.9e-11267.21Show/hide
Query:  RRQLLERQTCPKEKD-RTSQNILSKQLKKIYPIGLQRS-SSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQQQ-SQ
        R++++E+    +EK+ + + N  +K LK+IYPI LQRS SSSFS SS+SLSLSQNS DS  TDS++ L+QKIS A+ LI   P RRE+ + KSI QQ  Q
Subjt:  RRQLLERQTCPKEKD-RTSQNILSKQLKKIYPIGLQRS-SSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQQQ-SQ

Query:  ELGDG-ELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLVESR
        +     E +RCNWIT  SD+ YV FHD+ WGVPVYDDN LFE LA+SGMLMDYNWTEI+KR+E FREAF  F+P+ VA MGEKEIA+IAS+KAIML ESR
Subjt:  ELGDG-ELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLVESR

Query:  VRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNLAER
        VRCIVDNAKCI K+  +FGSFS+++W ++++KP IN+++Y RNVPLRSPKAE ISKDM+KRGFRFVGPVIV+SFMQAAGLTIDHLVDCFRH +CV+LAER
Subjt:  VRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNLAER

Query:  PWRHI
        PWRHI
Subjt:  PWRHI

AT1G75090.1 DNA glycosylase superfamily protein7.8e-6044.49Show/hide
Query:  SLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQQQSQELGD------GELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLAL
        + S +++DSS + SS++     +     +  P +R  V  + ++      + D      G ++RC+WIT  SD  YV FHDE WGVPV DD +LFELL  
Subjt:  SLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQQQSQELGD------GELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLAL

Query:  SGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPL
        S  L +++W  I++RR+ FR+ F  F+PS +A   EK +  +  +  ++L E ++R IV+NAK +LK+ ++FGSFSNY W +VN KP  N YRY R VP+
Subjt:  SGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRNVPL

Query:  RSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNLAER
        +SPKAE ISKDM++RGFR VGP ++YSF+QA+G+  DHL  CFR+ EC    ER
Subjt:  RSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNLAER

AT5G57970.1 DNA glycosylase superfamily protein3.3e-5845.14Show/hide
Query:  SSLSLSLSQNSN---DSSLTDSST-QLDQKISYAIRLIVPPPERREVPIVKSIQQQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFEL
        S+LSL+ S +S+   DS  + +ST +L +  S   R    P + R V  V      S   G    +RC W+T  SD  Y+ FHDE WGVPV+DD RLFEL
Subjt:  SSLSLSLSQNSN---DSSLTDSST-QLDQKISYAIRLIVPPPERREVPIVKSIQQQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFEL

Query:  LALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRN
        L LSG L ++ W  I+ +R+ FRE FA F+P+ +  + EK+I    S  + +L + ++R +++NA+ ILK+  ++GSF  Y+WS+V  K  ++++RY R 
Subjt:  LALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRN

Query:  VPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNLAER
        VP ++PKAE ISKD+V+RGFR VGP +VYSFMQAAG+T DHL  CFR   C+   ER
Subjt:  VPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNLAER

AT5G57970.2 DNA glycosylase superfamily protein3.3e-5845.14Show/hide
Query:  SSLSLSLSQNSN---DSSLTDSST-QLDQKISYAIRLIVPPPERREVPIVKSIQQQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFEL
        S+LSL+ S +S+   DS  + +ST +L +  S   R    P + R V  V      S   G    +RC W+T  SD  Y+ FHDE WGVPV+DD RLFEL
Subjt:  SSLSLSLSQNSN---DSSLTDSST-QLDQKISYAIRLIVPPPERREVPIVKSIQQQSQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFEL

Query:  LALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRN
        L LSG L ++ W  I+ +R+ FRE FA F+P+ +  + EK+I    S  + +L + ++R +++NA+ ILK+  ++GSF  Y+WS+V  K  ++++RY R 
Subjt:  LALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRYRYPRN

Query:  VPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNLAER
        VP ++PKAE ISKD+V+RGFR VGP +VYSFMQAAG+T DHL  CFR   C+   ER
Subjt:  VPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNLAER


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCCAAAGCCACTGTTAGAAGACAGCTTCTGGAGAGGCAAACATGTCCTAAAGAGAAGGATAGGACAAGCCAAAACATTTTGTCCAAACAGCTTAAGAAGATTTA
CCCAATTGGGCTTCAAAGAAGCAGTTCATCATTTTCTTTTTCTTCATTGTCATTGTCTTTGTCCCAAAACTCTAATGACTCTTCTCTCACCGACTCCTCGACCCAGCTCG
ATCAGAAGATTTCGTATGCAATTCGCCTAATTGTGCCACCTCCCGAGAGAAGAGAAGTACCGATAGTTAAAAGTATCCAACAACAAAGTCAGGAACTTGGTGATGGAGAA
TTGAGGAGGTGCAACTGGATTACCCATACTAGTGACAAAGCCTATGTATCATTTCACGACGAGTGCTGGGGCGTTCCGGTGTACGACGACAATCGACTTTTCGAGCTGCT
CGCATTGTCTGGCATGTTGATGGACTACAATTGGACCGAAATCGTGAAAAGGAGGGAATTATTCAGGGAAGCTTTTGCTGGATTTGAGCCAAGTACTGTTGCCAACATGG
GGGAGAAAGAGATAGCAGATATAGCATCTGATAAGGCCATTATGTTGGTGGAAAGCAGAGTGAGGTGCATTGTAGACAATGCCAAATGCATACTGAAGATAGCTAGAGAT
TTTGGATCTTTCAGTAACTATATGTGGAGCTATGTGAACTTTAAACCAACAATAAACAGATATAGATATCCAAGAAATGTTCCTCTGAGAAGTCCCAAAGCAGAAGCCAT
TAGCAAGGACATGGTGAAGCGCGGTTTTCGGTTTGTCGGGCCAGTGATTGTCTATTCATTCATGCAAGCTGCAGGGTTGACAATTGATCATCTTGTCGATTGTTTTCGGC
ACAGTGAATGCGTAAATCTTGCGGAAAGACCATGGAGACATATC
mRNA sequenceShow/hide mRNA sequence
ATGTCATCCAAAGCCACTGTTAGAAGACAGCTTCTGGAGAGGCAAACATGTCCTAAAGAGAAGGATAGGACAAGCCAAAACATTTTGTCCAAACAGCTTAAGAAGATTTA
CCCAATTGGGCTTCAAAGAAGCAGTTCATCATTTTCTTTTTCTTCATTGTCATTGTCTTTGTCCCAAAACTCTAATGACTCTTCTCTCACCGACTCCTCGACCCAGCTCG
ATCAGAAGATTTCGTATGCAATTCGCCTAATTGTGCCACCTCCCGAGAGAAGAGAAGTACCGATAGTTAAAAGTATCCAACAACAAAGTCAGGAACTTGGTGATGGAGAA
TTGAGGAGGTGCAACTGGATTACCCATACTAGTGACAAAGCCTATGTATCATTTCACGACGAGTGCTGGGGCGTTCCGGTGTACGACGACAATCGACTTTTCGAGCTGCT
CGCATTGTCTGGCATGTTGATGGACTACAATTGGACCGAAATCGTGAAAAGGAGGGAATTATTCAGGGAAGCTTTTGCTGGATTTGAGCCAAGTACTGTTGCCAACATGG
GGGAGAAAGAGATAGCAGATATAGCATCTGATAAGGCCATTATGTTGGTGGAAAGCAGAGTGAGGTGCATTGTAGACAATGCCAAATGCATACTGAAGATAGCTAGAGAT
TTTGGATCTTTCAGTAACTATATGTGGAGCTATGTGAACTTTAAACCAACAATAAACAGATATAGATATCCAAGAAATGTTCCTCTGAGAAGTCCCAAAGCAGAAGCCAT
TAGCAAGGACATGGTGAAGCGCGGTTTTCGGTTTGTCGGGCCAGTGATTGTCTATTCATTCATGCAAGCTGCAGGGTTGACAATTGATCATCTTGTCGATTGTTTTCGGC
ACAGTGAATGCGTAAATCTTGCGGAAAGACCATGGAGACATATC
Protein sequenceShow/hide protein sequence
MSSKATVRRQLLERQTCPKEKDRTSQNILSKQLKKIYPIGLQRSSSSFSFSSLSLSLSQNSNDSSLTDSSTQLDQKISYAIRLIVPPPERREVPIVKSIQQQSQELGDGE
LRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIADIASDKAIMLVESRVRCIVDNAKCILKIARD
FGSFSNYMWSYVNFKPTINRYRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHSECVNLAERPWRHI