; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg01776 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg01776
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionDNA glycosylase superfamily protein
Genome locationCarg_Chr04:9376831..9378865
RNA-Seq ExpressionCarg01776
SyntenyCarg01776
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601358.1 hypothetical protein SDJN03_06591, partial [Cucurbita argyrosperma subsp. sororia]7.4e-16697.09Show/hide
Query:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSV
        MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSV
Subjt:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSV

Query:  QQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIML
        QQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIML
Subjt:  QQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIML

Query:  VESRVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESRVRCI         IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
Subjt:  VESRVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

KAG7032142.1 guaA [Cucurbita argyrosperma subsp. argyrosperma]3.5e-168100Show/hide
Query:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSV
        MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSV
Subjt:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSV

Query:  QQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIML
        QQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIML
Subjt:  QQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIML

Query:  VESRVRCIIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI
        VESRVRCIIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI
Subjt:  VESRVRCIIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI

XP_022956507.1 uncharacterized protein LOC111458228 [Cucurbita moschata]4.0e-16496.44Show/hide
Query:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSV
        MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSI LDQKISYAIRLITP PPERREAPLPKSV
Subjt:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSV

Query:  QQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIML
        QQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIML
Subjt:  QQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIML

Query:  VESRVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESRVRCI         IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
Subjt:  VESRVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

XP_022993235.1 uncharacterized protein LOC111489316 [Cucurbita maxima]1.1e-16195.48Show/hide
Query:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLIT-PLPPERREAPLPKS
        MSSKATVRRRILERQTC KEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSI LD+KISYAIRLIT P PPERREAPLPKS
Subjt:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLIT-PLPPERREAPLPKS

Query:  VQQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIM
        VQQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIM
Subjt:  VQQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIM

Query:  LVESRVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV
        LVESRVRCI         IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV
Subjt:  LVESRVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV

Query:  NLAERPWRHI
        NLAERPWRHI
Subjt:  NLAERPWRHI

XP_023528370.1 uncharacterized protein LOC111791309 [Cucurbita pepo subsp. pepo]4.2e-16194.55Show/hide
Query:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLIT---PLPPERREAPLP
        MSSKATVRRRILERQTC KEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSS SLSLSQNSNDSSLTDSSI LDQKISYAIRLIT   P PPERREAPLP
Subjt:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLIT---PLPPERREAPLP

Query:  KSVQQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKA
        KSVQQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEI+DIASDKA
Subjt:  KSVQQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKA

Query:  IMLVESRVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGE
        IMLVESRVRCI         IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGE
Subjt:  IMLVESRVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGE

Query:  CVNLAERPWRHI
        CVNLAERPWRHI
Subjt:  CVNLAERPWRHI

TrEMBL top hitse value%identityAlignment
A0A0A0KUC5 Uncharacterized protein3.4e-15691.59Show/hide
Query:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSV
        MSSKATVRR ILERQ C KEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSS+SLSLSQNSNDSSLTDSSI LDQKISYAIRLITP PPERRE PLPKS+
Subjt:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSV

Query:  QQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIML
        QQQ QEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE S VANMGEKEI+D+ASDKAIML
Subjt:  QQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIML

Query:  VESRVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESRVRCI         IARDFGSFSNYMWSY+NFKPTINRFR+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Subjt:  VESRVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

A0A1S3BEN5 DNA-3-methyladenine glycosylase 12.2e-15591.91Show/hide
Query:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSV
        MSSKATVRR ILERQ C KEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSI LDQKISYAIRLITP PPERRE PLPKS+
Subjt:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSV

Query:  QQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIML
        QQQ QEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE S VANMGEKEI+DIASDKAIML
Subjt:  QQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIML

Query:  VESRVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESRVRCI         IARDFGSFSNYMWS +NFKPTINRFR+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Subjt:  VESRVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

A0A5D3CCU6 DNA-3-methyladenine glycosylase 12.2e-15591.91Show/hide
Query:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSV
        MSSKATVRR ILERQ C KEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSI LDQKISYAIRLITP PPERRE PLPKS+
Subjt:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSV

Query:  QQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIML
        QQQ QEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE S VANMGEKEI+DIASDKAIML
Subjt:  QQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIML

Query:  VESRVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESRVRCI         IARDFGSFSNYMWS +NFKPTINRFR+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Subjt:  VESRVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

A0A6J1GX19 uncharacterized protein LOC1114582282.0e-16496.44Show/hide
Query:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSV
        MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSI LDQKISYAIRLITP PPERREAPLPKSV
Subjt:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSV

Query:  QQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIML
        QQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIML
Subjt:  QQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIML

Query:  VESRVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESRVRCI         IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
Subjt:  VESRVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

A0A6J1JY14 uncharacterized protein LOC1114893165.3e-16295.48Show/hide
Query:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLIT-PLPPERREAPLPKS
        MSSKATVRRRILERQTC KEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSI LD+KISYAIRLIT P PPERREAPLPKS
Subjt:  MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLIT-PLPPERREAPLPKS

Query:  VQQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIM
        VQQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIM
Subjt:  VQQQCQELGDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIM

Query:  LVESRVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV
        LVESRVRCI         IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV
Subjt:  LVESRVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV

Query:  NLAERPWRHI
        NLAERPWRHI
Subjt:  NLAERPWRHI

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 18.0e-3033.7Show/hide
Query:  LRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIMLVESRVRCIIA-
        + RC W+  + D  Y+++HD  WGVP  D  +LFE++ L G     +W  ++K+RE +R  F  F+   VA M E+++  +  D  I+    +++ II  
Subjt:  LRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIMLVESRVRCIIA-

Query:  --------RDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC
                ++   F +++WS++N +P + +      +P  +  ++A+SK + KRGF+FVG  I YSFMQA GL  DH+V C
Subjt:  --------RDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC

P44321 DNA-3-methyladenine glycosylase7.5e-2834.08Show/hide
Query:  RCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIMLVESRVRCII--AR
        RC W+   S   Y+ +HD+ WG P +D  +LFE + L G     +W  ++K+RE +REAF  F+   +A M   +I     +  ++   +++  I+  A+
Subjt:  RCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIMLVESRVRCII--AR

Query:  DF-------GSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC
         +        +FS+++WS++N KP +N     R+VP ++  ++A+SK + KRGF F+G    Y+FMQ+ GL  DHL DC
Subjt:  DF-------GSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]1.1e-3435.78Show/hide
Query:  LPKSVQQQCQELGDG--ELRRCNWITHTSD---KAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEIS
        L KS+  + Q+  +G  E  RC W T   +   K Y  +HD  WG P+++D +LFE L L G     +W  I+K+RE FR AF  F+   VAN  E +I 
Subjt:  LPKSVQQQCQELGDG--ELRRCNWITHTSD---KAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEIS

Query:  DIASDKAIM---------LVESRVRCIIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLV
        ++  ++ I+         ++ ++    + R+FGSF  Y+W ++  KP IN F    ++P  +P ++ I+KD+ KRGF+FVG   +Y+ MQ+ G+  DHL 
Subjt:  DIASDKAIM---------LVESRVRCIIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLV

Query:  DCFR
         CF+
Subjt:  DCFR

Arabidopsis top hitse value%identityAlignment
AT1G13635.1 DNA glycosylase superfamily protein2.2e-10765.36Show/hide
Query:  RRRILERQTCSKEKD-RTSQNILSKHLKKIYPIGLQR-TTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSVQQQ-C
        R+ I+E+    +EK+ + + N  +KHLK+IYPI LQR T+SS SLSS+SLSLSQNS DS  TDS+  L+QKIS A+ LI+   P RRE  +PKS+ QQ C
Subjt:  RRRILERQTCSKEKD-RTSQNILSKHLKKIYPIGLQR-TTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSVQQQ-C

Query:  QELGDG-ELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIMLVES
        Q+     E +RCNWIT  SD+ YV FHD+ WGVPVYDDN LFE LA+SGMLMDYNWTEI+KR+E FREAF  F+ + VA MGEKEI++IAS+KAIML ES
Subjt:  QELGDG-ELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIMLVES

Query:  RVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE
        RVRCI         +  +FGSFS+++W +M++KP IN+F+Y RNVPLRSPKAE ISKDM+KRGFRFVGPVIV+SFMQAAGLTIDHLVDCFRHG+CV+LAE
Subjt:  RVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE

Query:  RPWRHI
        RPWRHI
Subjt:  RPWRHI

AT1G13635.2 DNA glycosylase superfamily protein2.2e-10765.36Show/hide
Query:  RRRILERQTCSKEKD-RTSQNILSKHLKKIYPIGLQR-TTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSVQQQ-C
        R+ I+E+    +EK+ + + N  +KHLK+IYPI LQR T+SS SLSS+SLSLSQNS DS  TDS+  L+QKIS A+ LI+   P RRE  +PKS+ QQ C
Subjt:  RRRILERQTCSKEKD-RTSQNILSKHLKKIYPIGLQR-TTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSVQQQ-C

Query:  QELGDG-ELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIMLVES
        Q+     E +RCNWIT  SD+ YV FHD+ WGVPVYDDN LFE LA+SGMLMDYNWTEI+KR+E FREAF  F+ + VA MGEKEI++IAS+KAIML ES
Subjt:  QELGDG-ELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIMLVES

Query:  RVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE
        RVRCI         +  +FGSFS+++W +M++KP IN+F+Y RNVPLRSPKAE ISKDM+KRGFRFVGPVIV+SFMQAAGLTIDHLVDCFRHG+CV+LAE
Subjt:  RVRCI---------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE

Query:  RPWRHI
        RPWRHI
Subjt:  RPWRHI

AT1G75090.1 DNA glycosylase superfamily protein3.4e-5248.21Show/hide
Query:  GELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIMLVESRVRCI-
        G ++RC+WIT  SD  YV FHDE WGVPV DD +LFELL  S  L +++W  I++RR+ FR+ F  F+ S +A   EK +  +  +  ++L E ++R I 
Subjt:  GELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIMLVESRVRCI-

Query:  --------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAER
                + ++FGSFSNY W ++N KP  N +RY R VP++SPKAE ISKDM++RGFR VGP ++YSF+QA+G+  DHL  CFR+ EC    ER
Subjt:  --------IARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAER

AT1G80850.1 DNA glycosylase superfamily protein3.4e-5241.38Show/hide
Query:  LSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSVQQQCQELG-----DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDN
        L  + +S++ S +S+ SS  +SS       S   R++         + L +++ ++  E       DG  +RC WIT  SD+ Y++FHDE WGVPV+DD 
Subjt:  LSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSVQQQCQELG-----DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDN

Query:  RLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIMLVESRVR---------CIIARDFGSFSNYMWSYMNFKPTINRF
        RLFELL+LSG L + +W +I+ +R+LFRE F  F+   ++ +  K+I+        +L E ++R         C I   FGSF  Y+W+++N KPT ++F
Subjt:  RLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIMLVESRVR---------CIIARDFGSFSNYMWSYMNFKPTINRF

Query:  RYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE
        RYPR VP+++ KAE ISKD+V+RGFR V P ++YSFMQ AGLT DHL  CFRH +C+   E
Subjt:  RYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE

AT5G57970.1 DNA glycosylase superfamily protein2.2e-5141.42Show/hide
Query:  LQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSVQQQ----CQELGDGELRRCNWITHTSDKAYVSFHDECWGV
        L+R   +L+ S+LSL+ S  S+D+S+   S H        IR  +     +     P+SV  +        G    +RC W+T  SD  Y+ FHDE WGV
Subjt:  LQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSVQQQ----CQELGDGELRRCNWITHTSDKAYVSFHDECWGV

Query:  PVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIMLVESRVRCII---------ARDFGSFSNYMWSYMNFK
        PV+DD RLFELL LSG L ++ W  I+ +R+ FRE FA F+ + +  + EK+I    S  + +L + ++R +I           ++GSF  Y+WS++  K
Subjt:  PVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIMLVESRVRCII---------ARDFGSFSNYMWSYMNFK

Query:  PTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAER
          +++FRY R VP ++PKAE ISKD+V+RGFR VGP +VYSFMQAAG+T DHL  CFR   C+   ER
Subjt:  PTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAER


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCCAAAGCCACTGTTAGAAGGCGAATTCTGGAGAGGCAAACATGTTCTAAAGAGAAAGATAGAACAAGCCAAAACATATTGTCTAAACACCTTAAGAAGATTTA
CCCAATTGGGCTTCAAAGAACCACTTCATCACTCTCTTTATCTTCATTATCATTGTCTTTGTCCCAAAATTCAAATGATTCTTCTCTTACAGACTCCTCGATCCATCTCG
ATCAGAAGATTTCGTATGCGATTCGTTTGATTACGCCGCTGCCTCCTGAAAGAAGAGAAGCTCCATTGCCTAAGAGTGTCCAACAACAATGTCAGGAACTTGGTGATGGG
GAACTCAGGAGGTGCAACTGGATCACTCATACCAGTGATAAAGCCTATGTATCCTTTCACGACGAGTGTTGGGGCGTCCCAGTGTATGACGACAACCGACTTTTCGAGCT
ACTCGCTCTATCGGGGATGCTGATGGACTACAATTGGACTGAAATTGTGAAAAGAAGGGAACTATTCAGAGAAGCTTTTGCTGGATTTGAGGCAAGTACTGTGGCCAACA
TGGGGGAGAAAGAGATATCAGATATAGCATCTGACAAGGCCATTATGCTGGTGGAGAGCAGAGTGAGGTGCATAATAGCTAGAGATTTTGGGTCGTTCAGTAACTATATG
TGGAGCTATATGAACTTCAAACCAACAATAAACAGATTTAGATATCCAAGAAATGTTCCCCTGAGGAGTCCCAAAGCAGAAGCCATTAGCAAGGACATGGTGAAGCGCGG
TTTTCGGTTTGTTGGGCCAGTGATTGTCTATTCGTTCATGCAGGCGGCAGGGTTGACGATCGATCATCTTGTGGACTGTTTTCGGCATGGCGAATGCGTAAATCTTGCAG
AAAGACCATGGAGACATATCTGA
mRNA sequenceShow/hide mRNA sequence
TATGTCATCCAAAGCCACTGTTAGAAGGCGAATTCTGGAGAGGCAAACATGTTCTAAAGAGAAAGATAGAACAAGCCAAAACATATTGTCTAAACACCTTAAGAAGATTT
ACCCAATTGGGCTTCAAAGAACCACTTCATCACTCTCTTTATCTTCATTATCATTGTCTTTGTCCCAAAATTCAAATGATTCTTCTCTTACAGACTCCTCGATCCATCTC
GATCAGAAGATTTCGTATGCGATTCGTTTGATTACGCCGCTGCCTCCTGAAAGAAGAGAAGCTCCATTGCCTAAGAGTGTCCAACAACAATGTCAGGAACTTGGTGATGG
GGAACTCAGGAGGTGCAACTGGATCACTCATACCAGTGATAAAGCCTATGTATCCTTTCACGACGAGTGTTGGGGCGTCCCAGTGTATGACGACAACCGACTTTTCGAGC
TACTCGCTCTATCGGGGATGCTGATGGACTACAATTGGACTGAAATTGTGAAAAGAAGGGAACTATTCAGAGAAGCTTTTGCTGGATTTGAGGCAAGTACTGTGGCCAAC
ATGGGGGAGAAAGAGATATCAGATATAGCATCTGACAAGGCCATTATGCTGGTGGAGAGCAGAGTGAGGTGCATAATAGCTAGAGATTTTGGGTCGTTCAGTAACTATAT
GTGGAGCTATATGAACTTCAAACCAACAATAAACAGATTTAGATATCCAAGAAATGTTCCCCTGAGGAGTCCCAAAGCAGAAGCCATTAGCAAGGACATGGTGAAGCGCG
GTTTTCGGTTTGTTGGGCCAGTGATTGTCTATTCGTTCATGCAGGCGGCAGGGTTGACGATCGATCATCTTGTGGACTGTTTTCGGCATGGCGAATGCGTAAATCTTGCA
GAAAGACCATGGAGACATATCTGAGTTCAACATTTCCTACTTTCCCTTCAAAGTTGTGGTTGTTTTGCTTTTGTGAGCATTAAGTTAACAATATAAAAAATTATGCATAG
AGAAAGAGAGAGGAACTAAGATGGAACTTCTCTGCTTCTTTGTTTATCAGTTTCTGCTAGAATGGCCAATTCTGCAACTGAGCAATTCATCAGTTGATATCAAAAAGTGT
GCAGAGCTCAGCATTACAACAGTTGCAGAGCAGCCATGGTAATCGACAAGTTGAAGCTGGTGGAGAATAAGATTGGTGTATTTAATTTGTTTATGCTTCTAATATATAAC
TAATTATTTTAGATGTTTGGTGTGGATCTCTTGGGTCAAAATTGCTTATCTATGAGCTTTCTCATACATTCTAAACAAGCTCTCATTTCCTAAGACATTAAGAACCAAGA
ATGTTCTTTGTTCATGAAGAACTTCATGATACAACTAAGTTCATCAAAGCAGACATGACCATAGAAAACTGCAGTCTAGTTACAGTTCATAAGATCCAAATATGTAATAC
AAACCTCGTTATATCATCCCTCGCCTCGAGAAATGTCGATCGAGTTTCGAGATAGCACTCCAAGCCTGGTCCTTCAAATTGGCTGAGAATCAAGTCCATAGATCATCTTT
CTCCCCCATCTGCTGGGCCAAAAAAGCTTTACCAAGATTAGTTGTAATCCTGCTACGAAGGATTCTCTCAATGTGTGAGATCCCACGCATCAGTTGGAGAGGAGAACGAA
ACATT
Protein sequenceShow/hide protein sequence
MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSVQQQCQELGDG
ELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEASTVANMGEKEISDIASDKAIMLVESRVRCIIARDFGSFSNYM
WSYMNFKPTINRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI