; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10015431 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10015431
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationChr02:26557782..26559340
RNA-Seq ExpressionHG10015431
SyntenyHG10015431
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0006541 - glutamine metabolic process (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032142.1 guaA [Cucurbita argyrosperma subsp. argyrosperma]6.2e-15795Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI
        MSSKATVRR+ILERQTC KEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSI LDQKISYAIRLITP PPERRE PLPKS+
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI

Query:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML
        QQQ QEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE STVANMGEKEI+DIASDKAIML
Subjt:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML

Query:  VESR----IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI
        VESR    IARDFGSFSNYMWSYMNFKPTINRFR+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI
Subjt:  VESR----IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI

XP_004135425.1 uncharacterized protein LOC101218195 [Cucumis sativus]5.6e-15893.2Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI
        MSSKATVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSS+SLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT PPPERREVPLPKSI
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI

Query:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML
        QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEITD+ASDKAIML
Subjt:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML

Query:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESR             IARDFGSFSNYMWSY+NFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Subjt:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

XP_008446481.2 PREDICTED: DNA-3-methyladenine glycosylase 1 [Cucumis melo]3.6e-15793.53Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI
        MSSKATVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT PPPERREVPLPKSI
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI

Query:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML
        QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEITDIASDKAIML
Subjt:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML

Query:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESR             IARDFGSFSNYMWS +NFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Subjt:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

XP_022956507.1 uncharacterized protein LOC111458228 [Cucurbita moschata]1.2e-15792.88Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI
        MSSKATVRR+ILERQTC KEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERRE PLPKS+
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI

Query:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML
        QQQ QEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE STVANMGEKEI+DIASDKAIML
Subjt:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML

Query:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESR             IARDFGSFSNYMWSYMNFKPTINRFR+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
Subjt:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

XP_022993235.1 uncharacterized protein LOC111489316 [Cucurbita maxima]1.1e-15692.58Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT-PPPPERREVPLPKS
        MSSKATVRR+ILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLD+KISYAIRLIT PPPPERRE PLPKS
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT-PPPPERREVPLPKS

Query:  IQQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIM
        +QQQ QEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE STVANMGEKEI+DIASDKAIM
Subjt:  IQQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIM

Query:  LVESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV
        LVESR             IARDFGSFSNYMWSYMNFKPTINRFR+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV
Subjt:  LVESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV

Query:  NLAERPWRHI
        NLAERPWRHI
Subjt:  NLAERPWRHI

TrEMBL top hitse value%identityAlignment
A0A0A0KUC5 Uncharacterized protein2.7e-15893.2Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI
        MSSKATVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSS+SLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT PPPERREVPLPKSI
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI

Query:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML
        QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEITD+ASDKAIML
Subjt:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML

Query:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESR             IARDFGSFSNYMWSY+NFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Subjt:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

A0A1S3BEN5 DNA-3-methyladenine glycosylase 11.8e-15793.53Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI
        MSSKATVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT PPPERREVPLPKSI
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI

Query:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML
        QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEITDIASDKAIML
Subjt:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML

Query:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESR             IARDFGSFSNYMWS +NFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Subjt:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

A0A5D3CCU6 DNA-3-methyladenine glycosylase 11.8e-15793.53Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI
        MSSKATVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT PPPERREVPLPKSI
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI

Query:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML
        QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEITDIASDKAIML
Subjt:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML

Query:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESR             IARDFGSFSNYMWS +NFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Subjt:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

A0A6J1GX19 uncharacterized protein LOC1114582286.0e-15892.88Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI
        MSSKATVRR+ILERQTC KEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERRE PLPKS+
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI

Query:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML
        QQQ QEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE STVANMGEKEI+DIASDKAIML
Subjt:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML

Query:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESR             IARDFGSFSNYMWSYMNFKPTINRFR+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
Subjt:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

A0A6J1JY14 uncharacterized protein LOC1114893165.1e-15792.58Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT-PPPPERREVPLPKS
        MSSKATVRR+ILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLD+KISYAIRLIT PPPPERRE PLPKS
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT-PPPPERREVPLPKS

Query:  IQQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIM
        +QQQ QEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE STVANMGEKEI+DIASDKAIM
Subjt:  IQQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIM

Query:  LVESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV
        LVESR             IARDFGSFSNYMWSYMNFKPTINRFR+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV
Subjt:  LVESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV

Query:  NLAERPWRHI
        NLAERPWRHI
Subjt:  NLAERPWRHI

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 16.0e-3034.25Show/hide
Query:  LRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRIARDFGS
        + RC W+  + D  Y+++HD  WGVP  D  +LFE++ L G     +W  ++K+RE +R  F  F+P  VA M E+++  +  D  I+    +I    G+
Subjt:  LRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRIARDFGS

Query:  -------------FSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC
                     F +++WS++N +P + +      +P  +  ++A+SK + KRGF+FVG  I YSFMQA GL  DH+V C
Subjt:  -------------FSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC

P44321 DNA-3-methyladenine glycosylase7.4e-2834.08Show/hide
Query:  RCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRI------AR
        RC W+   S   Y+ +HD+ WG P +D  +LFE + L G     +W  ++K+RE +REAF  F+P  +A M   +I     +  ++   +++      A+
Subjt:  RCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRI------AR

Query:  DF-------GSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC
         +        +FS+++WS++N KP +N     R+VP ++  ++A+SK + KRGF F+G    Y+FMQ+ GL  DHL DC
Subjt:  DF-------GSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]6.2e-3536.76Show/hide
Query:  LPKSIQQQSQELSDG--ELRRCNWITHTSD---KAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIT
        L KS+  ++Q+ ++G  E  RC W T   +   K Y  +HD  WG P+++D +LFE L L G     +W  I+K+RE FR AF  F+P  VAN  E +I 
Subjt:  LPKSIQQQSQELSDG--ELRRCNWITHTSD---KAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIT

Query:  DIASDKAIMLVESRI-------------ARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLV
        ++  ++ I+   ++I              R+FGSF  Y+W ++  KP IN F    ++P  +P ++ I+KD+ KRGF+FVG   +Y+ MQ+ G+  DHL 
Subjt:  DIASDKAIMLVESRI-------------ARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLV

Query:  DCFR
         CF+
Subjt:  DCFR

Arabidopsis top hitse value%identityAlignment
AT1G13635.1 DNA glycosylase superfamily protein1.3e-10464.38Show/hide
Query:  RRQILERQTCPKEKD-RTSQNILSKHLKKIYPIGLQR-TTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQ-S
        R++I+E+    +EK+ + + N  +KHLK+IYPI LQR T+SS SLSS+SLSLSQNS DS  TDS+  L+QKIS A+ LI+   P RRE+ +PKSI QQ  
Subjt:  RRQILERQTCPKEKD-RTSQNILSKHLKKIYPIGLQR-TTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQ-S

Query:  QEL-SDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVES
        Q+  S  E +RCNWIT  SD+ YV FHD+ WGVPVYDDN LFE LA+SGMLMDYNWTEI+KR+E FREAF  F+P+ VA MGEKEI +IAS+KAIML ES
Subjt:  QEL-SDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVES

Query:  R-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE
        R             +  +FGSFS+++W +M++KP IN+F++ RNVPLRSPKAE ISKDM+KRGFRFVGPVIV+SFMQAAGLTIDHLVDCFRHG+CV+LAE
Subjt:  R-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE

Query:  RPWRHI
        RPWRHI
Subjt:  RPWRHI

AT1G13635.2 DNA glycosylase superfamily protein1.3e-10464.38Show/hide
Query:  RRQILERQTCPKEKD-RTSQNILSKHLKKIYPIGLQR-TTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQ-S
        R++I+E+    +EK+ + + N  +KHLK+IYPI LQR T+SS SLSS+SLSLSQNS DS  TDS+  L+QKIS A+ LI+   P RRE+ +PKSI QQ  
Subjt:  RRQILERQTCPKEKD-RTSQNILSKHLKKIYPIGLQR-TTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQ-S

Query:  QEL-SDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVES
        Q+  S  E +RCNWIT  SD+ YV FHD+ WGVPVYDDN LFE LA+SGMLMDYNWTEI+KR+E FREAF  F+P+ VA MGEKEI +IAS+KAIML ES
Subjt:  QEL-SDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVES

Query:  R-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE
        R             +  +FGSFS+++W +M++KP IN+F++ RNVPLRSPKAE ISKDM+KRGFRFVGPVIV+SFMQAAGLTIDHLVDCFRHG+CV+LAE
Subjt:  R-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE

Query:  RPWRHI
        RPWRHI
Subjt:  RPWRHI

AT1G80850.1 DNA glycosylase superfamily protein2.9e-5141Show/hide
Query:  LSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQE-----LSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDN
        L  + +S++ S +S+ SS  +SS       S   R++           L +++ ++  E       DG  +RC WIT  SD+ Y++FHDE WGVPV+DD 
Subjt:  LSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQE-----LSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDN

Query:  RLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITD--------IASDKAIMLVES-----RIARDFGSFSNYMWSYMNFKPTINRF
        RLFELL+LSG L + +W +I+ +R+LFRE F  F+P  ++ +  K+IT         ++  K   ++E+     +I   FGSF  Y+W+++N KPT ++F
Subjt:  RLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITD--------IASDKAIMLVES-----RIARDFGSFSNYMWSYMNFKPTINRF

Query:  RHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE
        R+PR VP+++ KAE ISKD+V+RGFR V P ++YSFMQ AGLT DHL  CFRH +C+   E
Subjt:  RHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE

AT5G57970.1 DNA glycosylase superfamily protein1.7e-5146.88Show/hide
Query:  RRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVE----------
        +RC W+T  SD  Y+ FHDE WGVPV+DD RLFELL LSG L ++ W  I+ +R+ FRE FA F+P+ +  + EK+I    S  + +L +          
Subjt:  RRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVE----------

Query:  ---SRIARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAER
            ++  ++GSF  Y+WS++  K  +++FR+ R VP ++PKAE ISKD+V+RGFR VGP +VYSFMQAAG+T DHL  CFR   C+   ER
Subjt:  ---SRIARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAER

AT5G57970.2 DNA glycosylase superfamily protein1.7e-5146.88Show/hide
Query:  RRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVE----------
        +RC W+T  SD  Y+ FHDE WGVPV+DD RLFELL LSG L ++ W  I+ +R+ FRE FA F+P+ +  + EK+I    S  + +L +          
Subjt:  RRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVE----------

Query:  ---SRIARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAER
            ++  ++GSF  Y+WS++  K  +++FR+ R VP ++PKAE ISKD+V+RGFR VGP +VYSFMQAAG+T DHL  CFR   C+   ER
Subjt:  ---SRIARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAER


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCCAAAGCCACTGTTAGAAGACAGATTCTGGAGAGGCAAACATGTCCTAAAGAGAAAGATAGGACAAGCCAAAACATTTTGTCCAAACACCTTAAGAAGATTTA
CCCAATTGGGTTACAAAGAACCACTTCATCACTATCTTTATCTTCACTATCATTGTCTTTGTCTCAAAATTCAAATGACTCTTCTCTTACAGACTCCTCAATCCAATTGG
ATCAGAAAATTTCGTACGCAATTCGCCTTATTACGCCGCCGCCTCCTGAAAGAAGAGAAGTCCCATTGCCTAAAAGTATCCAACAACAAAGTCAAGAACTTAGTGATGGG
GAATTGAGGAGGTGCAACTGGATCACCCATACCAGTGATAAAGCCTATGTATCTTTTCATGACGAGTGTTGGGGTGTCCCAGTATACGATGACAACCGACTTTTCGAGCT
ACTCGCACTATCTGGGATGTTGATGGACTACAATTGGACTGAAATTGTGAAAAGAAGGGAACTATTCAGGGAAGCTTTTGCTGGATTTGAGCCAAGTACAGTTGCCAATA
TGGGGGAGAAAGAGATAACAGATATAGCTTCTGACAAGGCCATTATGCTGGTGGAGAGCAGAATAGCTAGAGATTTTGGATCGTTTAGTAACTATATGTGGAGCTATATG
AACTTTAAACCTACAATAAACAGATTTAGACATCCAAGAAATGTTCCCTTGAGAAGTCCCAAAGCAGAAGCCATTAGCAAGGACATGGTGAAGCGCGGTTTTCGGTTTGT
TGGGCCAGTGATTGTCTATTCATTCATGCAAGCGGCTGGGTTGACCATCGATCATCTTGTCGATTGTTTTCGACACGGTGAATGTGTAAATCTTGCAGAAAGGCCATGGA
GACATATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCATCCAAAGCCACTGTTAGAAGACAGATTCTGGAGAGGCAAACATGTCCTAAAGAGAAAGATAGGACAAGCCAAAACATTTTGTCCAAACACCTTAAGAAGATTTA
CCCAATTGGGTTACAAAGAACCACTTCATCACTATCTTTATCTTCACTATCATTGTCTTTGTCTCAAAATTCAAATGACTCTTCTCTTACAGACTCCTCAATCCAATTGG
ATCAGAAAATTTCGTACGCAATTCGCCTTATTACGCCGCCGCCTCCTGAAAGAAGAGAAGTCCCATTGCCTAAAAGTATCCAACAACAAAGTCAAGAACTTAGTGATGGG
GAATTGAGGAGGTGCAACTGGATCACCCATACCAGTGATAAAGCCTATGTATCTTTTCATGACGAGTGTTGGGGTGTCCCAGTATACGATGACAACCGACTTTTCGAGCT
ACTCGCACTATCTGGGATGTTGATGGACTACAATTGGACTGAAATTGTGAAAAGAAGGGAACTATTCAGGGAAGCTTTTGCTGGATTTGAGCCAAGTACAGTTGCCAATA
TGGGGGAGAAAGAGATAACAGATATAGCTTCTGACAAGGCCATTATGCTGGTGGAGAGCAGAATAGCTAGAGATTTTGGATCGTTTAGTAACTATATGTGGAGCTATATG
AACTTTAAACCTACAATAAACAGATTTAGACATCCAAGAAATGTTCCCTTGAGAAGTCCCAAAGCAGAAGCCATTAGCAAGGACATGGTGAAGCGCGGTTTTCGGTTTGT
TGGGCCAGTGATTGTCTATTCATTCATGCAAGCGGCTGGGTTGACCATCGATCATCTTGTCGATTGTTTTCGACACGGTGAATGTGTAAATCTTGCAGAAAGGCCATGGA
GACATATCTGA
Protein sequenceShow/hide protein sequence
MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQELSDG
ELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRIARDFGSFSNYMWSYM
NFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI