; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G003420 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G003420
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationchr04:3422632..3424259
RNA-Seq ExpressionLsi04G003420
SyntenyLsi04G003420
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0006541 - glutamine metabolic process (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032142.1 guaA [Cucurbita argyrosperma subsp. argyrosperma]2.3e-15694.67Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI
        MSSKATVRR+ILERQTC KEKDRTSQ+ILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSI LDQKISYAIRLITP PPERRE PLPKS+
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI

Query:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML
        QQQ QEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE STVANMGEKEI+DIASDKAIML
Subjt:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML

Query:  VESR----IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI
        VESR    IARDFGSFSNYMWSYMNFKPTINRFR+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI
Subjt:  VESR----IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI

XP_004135425.1 uncharacterized protein LOC101218195 [Cucumis sativus]2.1e-15792.88Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI
        MSSKATVRR ILERQ CPKEKDRTSQ+ILSKHLKKIYPIGLQRTTSSLSLSS+SLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT PPPERREVPLPKSI
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI

Query:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML
        QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEITD+ASDKAIML
Subjt:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML

Query:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESR             IARDFGSFSNYMWSY+NFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Subjt:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

XP_008446481.2 PREDICTED: DNA-3-methyladenine glycosylase 1 [Cucumis melo]1.4e-15693.2Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI
        MSSKATVRR ILERQ CPKEKDRTSQ+ILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT PPPERREVPLPKSI
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI

Query:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML
        QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEITDIASDKAIML
Subjt:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML

Query:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESR             IARDFGSFSNYMWS +NFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Subjt:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

XP_022956507.1 uncharacterized protein LOC111458228 [Cucurbita moschata]4.7e-15792.56Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI
        MSSKATVRR+ILERQTC KEKDRTSQ+ILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERRE PLPKS+
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI

Query:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML
        QQQ QEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE STVANMGEKEI+DIASDKAIML
Subjt:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML

Query:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESR             IARDFGSFSNYMWSYMNFKPTINRFR+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
Subjt:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

XP_022993235.1 uncharacterized protein LOC111489316 [Cucurbita maxima]4.0e-15692.26Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT-PPPPERREVPLPKS
        MSSKATVRR+ILERQTCPKEKDRTSQ+ILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLD+KISYAIRLIT PPPPERRE PLPKS
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT-PPPPERREVPLPKS

Query:  IQQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIM
        +QQQ QEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE STVANMGEKEI+DIASDKAIM
Subjt:  IQQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIM

Query:  LVESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV
        LVESR             IARDFGSFSNYMWSYMNFKPTINRFR+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV
Subjt:  LVESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV

Query:  NLAERPWRHI
        NLAERPWRHI
Subjt:  NLAERPWRHI

TrEMBL top hitse value%identityAlignment
A0A0A0KUC5 Uncharacterized protein1.0e-15792.88Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI
        MSSKATVRR ILERQ CPKEKDRTSQ+ILSKHLKKIYPIGLQRTTSSLSLSS+SLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT PPPERREVPLPKSI
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI

Query:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML
        QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEITD+ASDKAIML
Subjt:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML

Query:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESR             IARDFGSFSNYMWSY+NFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Subjt:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

A0A1S3BEN5 DNA-3-methyladenine glycosylase 16.7e-15793.2Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI
        MSSKATVRR ILERQ CPKEKDRTSQ+ILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT PPPERREVPLPKSI
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI

Query:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML
        QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEITDIASDKAIML
Subjt:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML

Query:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESR             IARDFGSFSNYMWS +NFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Subjt:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

A0A5D3CCU6 DNA-3-methyladenine glycosylase 16.7e-15793.2Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI
        MSSKATVRR ILERQ CPKEKDRTSQ+ILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT PPPERREVPLPKSI
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI

Query:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML
        QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEITDIASDKAIML
Subjt:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML

Query:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESR             IARDFGSFSNYMWS +NFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Subjt:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

A0A6J1GX19 uncharacterized protein LOC1114582282.3e-15792.56Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI
        MSSKATVRR+ILERQTC KEKDRTSQ+ILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERRE PLPKS+
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSI

Query:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML
        QQQ QEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE STVANMGEKEI+DIASDKAIML
Subjt:  QQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIML

Query:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
        VESR             IARDFGSFSNYMWSYMNFKPTINRFR+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
Subjt:  VESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN

Query:  LAERPWRHI
        LAERPWRHI
Subjt:  LAERPWRHI

A0A6J1JY14 uncharacterized protein LOC1114893161.9e-15692.26Show/hide
Query:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT-PPPPERREVPLPKS
        MSSKATVRR+ILERQTCPKEKDRTSQ+ILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLD+KISYAIRLIT PPPPERRE PLPKS
Subjt:  MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT-PPPPERREVPLPKS

Query:  IQQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIM
        +QQQ QEL DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE STVANMGEKEI+DIASDKAIM
Subjt:  IQQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIM

Query:  LVESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV
        LVESR             IARDFGSFSNYMWSYMNFKPTINRFR+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV
Subjt:  LVESR-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV

Query:  NLAERPWRHI
        NLAERPWRHI
Subjt:  NLAERPWRHI

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 16.0e-3034.25Show/hide
Query:  LRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRIARDFGS
        + RC W+  + D  Y+++HD  WGVP  D  +LFE++ L G     +W  ++K+RE +R  F  F+P  VA M E+++  +  D  I+    +I    G+
Subjt:  LRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRIARDFGS

Query:  -------------FSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC
                     F +++WS++N +P + +      +P  +  ++A+SK + KRGF+FVG  I YSFMQA GL  DH+V C
Subjt:  -------------FSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC

P44321 DNA-3-methyladenine glycosylase7.4e-2834.08Show/hide
Query:  RCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRI------AR
        RC W+   S   Y+ +HD+ WG P +D  +LFE + L G     +W  ++K+RE +REAF  F+P  +A M   +I     +  ++   +++      A+
Subjt:  RCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRI------AR

Query:  DF-------GSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC
         +        +FS+++WS++N KP +N     R+VP ++  ++A+SK + KRGF F+G    Y+FMQ+ GL  DHL DC
Subjt:  DF-------GSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]6.2e-3536.76Show/hide
Query:  LPKSIQQQSQELSDG--ELRRCNWITHTSD---KAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIT
        L KS+  ++Q+ ++G  E  RC W T   +   K Y  +HD  WG P+++D +LFE L L G     +W  I+K+RE FR AF  F+P  VAN  E +I 
Subjt:  LPKSIQQQSQELSDG--ELRRCNWITHTSD---KAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIT

Query:  DIASDKAIMLVESRI-------------ARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLV
        ++  ++ I+   ++I              R+FGSF  Y+W ++  KP IN F    ++P  +P ++ I+KD+ KRGF+FVG   +Y+ MQ+ G+  DHL 
Subjt:  DIASDKAIMLVESRI-------------ARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLV

Query:  DCFR
         CF+
Subjt:  DCFR

Arabidopsis top hitse value%identityAlignment
AT1G13635.1 DNA glycosylase superfamily protein5.0e-10464.05Show/hide
Query:  RRQILERQTCPKEKD-RTSQHILSKHLKKIYPIGLQR-TTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQ-S
        R++I+E+    +EK+ + + +  +KHLK+IYPI LQR T+SS SLSS+SLSLSQNS DS  TDS+  L+QKIS A+ LI+   P RRE+ +PKSI QQ  
Subjt:  RRQILERQTCPKEKD-RTSQHILSKHLKKIYPIGLQR-TTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQ-S

Query:  QEL-SDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVES
        Q+  S  E +RCNWIT  SD+ YV FHD+ WGVPVYDDN LFE LA+SGMLMDYNWTEI+KR+E FREAF  F+P+ VA MGEKEI +IAS+KAIML ES
Subjt:  QEL-SDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVES

Query:  R-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE
        R             +  +FGSFS+++W +M++KP IN+F++ RNVPLRSPKAE ISKDM+KRGFRFVGPVIV+SFMQAAGLTIDHLVDCFRHG+CV+LAE
Subjt:  R-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE

Query:  RPWRHI
        RPWRHI
Subjt:  RPWRHI

AT1G13635.2 DNA glycosylase superfamily protein5.0e-10464.05Show/hide
Query:  RRQILERQTCPKEKD-RTSQHILSKHLKKIYPIGLQR-TTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQ-S
        R++I+E+    +EK+ + + +  +KHLK+IYPI LQR T+SS SLSS+SLSLSQNS DS  TDS+  L+QKIS A+ LI+   P RRE+ +PKSI QQ  
Subjt:  RRQILERQTCPKEKD-RTSQHILSKHLKKIYPIGLQR-TTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQ-S

Query:  QEL-SDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVES
        Q+  S  E +RCNWIT  SD+ YV FHD+ WGVPVYDDN LFE LA+SGMLMDYNWTEI+KR+E FREAF  F+P+ VA MGEKEI +IAS+KAIML ES
Subjt:  QEL-SDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVES

Query:  R-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE
        R             +  +FGSFS+++W +M++KP IN+F++ RNVPLRSPKAE ISKDM+KRGFRFVGPVIV+SFMQAAGLTIDHLVDCFRHG+CV+LAE
Subjt:  R-------------IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE

Query:  RPWRHI
        RPWRHI
Subjt:  RPWRHI

AT1G80850.1 DNA glycosylase superfamily protein2.9e-5141Show/hide
Query:  LSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQE-----LSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDN
        L  + +S++ S +S+ SS  +SS       S   R++           L +++ ++  E       DG  +RC WIT  SD+ Y++FHDE WGVPV+DD 
Subjt:  LSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQE-----LSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDN

Query:  RLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITD--------IASDKAIMLVES-----RIARDFGSFSNYMWSYMNFKPTINRF
        RLFELL+LSG L + +W +I+ +R+LFRE F  F+P  ++ +  K+IT         ++  K   ++E+     +I   FGSF  Y+W+++N KPT ++F
Subjt:  RLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITD--------IASDKAIMLVES-----RIARDFGSFSNYMWSYMNFKPTINRF

Query:  RHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE
        R+PR VP+++ KAE ISKD+V+RGFR V P ++YSFMQ AGLT DHL  CFRH +C+   E
Subjt:  RHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE

AT5G57970.1 DNA glycosylase superfamily protein1.7e-5146.88Show/hide
Query:  RRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVE----------
        +RC W+T  SD  Y+ FHDE WGVPV+DD RLFELL LSG L ++ W  I+ +R+ FRE FA F+P+ +  + EK+I    S  + +L +          
Subjt:  RRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVE----------

Query:  ---SRIARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAER
            ++  ++GSF  Y+WS++  K  +++FR+ R VP ++PKAE ISKD+V+RGFR VGP +VYSFMQAAG+T DHL  CFR   C+   ER
Subjt:  ---SRIARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAER

AT5G57970.2 DNA glycosylase superfamily protein1.7e-5146.88Show/hide
Query:  RRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVE----------
        +RC W+T  SD  Y+ FHDE WGVPV+DD RLFELL LSG L ++ W  I+ +R+ FRE FA F+P+ +  + EK+I    S  + +L +          
Subjt:  RRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVE----------

Query:  ---SRIARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAER
            ++  ++GSF  Y+WS++  K  +++FR+ R VP ++PKAE ISKD+V+RGFR VGP +VYSFMQAAG+T DHL  CFR   C+   ER
Subjt:  ---SRIARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAER


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCCAAAGCCACTGTTAGAAGACAGATTCTGGAGAGGCAAACATGTCCTAAAGAGAAAGATAGGACAAGCCAACACATTTTGTCCAAACACCTTAAGAAGATTTA
CCCAATTGGGTTACAAAGAACCACTTCATCACTATCTTTATCTTCACTATCATTGTCTTTGTCTCAAAATTCAAATGACTCTTCTCTTACAGACTCCTCAATCCAATTGG
ATCAGAAAATTTCGTACGCAATTCGCCTTATTACGCCGCCGCCTCCTGAAAGAAGAGAAGTCCCATTGCCTAAAAGTATCCAACAACAAAGTCAAGAACTTAGTGATGGG
GAATTGAGGAGGTGCAACTGGATCACCCATACCAGTGATAAAGCCTATGTATCTTTTCATGACGAGTGTTGGGGTGTCCCAGTATACGATGACAACCGACTTTTCGAGCT
ACTCGCACTATCTGGGATGTTGATGGACTACAATTGGACTGAAATTGTGAAAAGAAGGGAACTATTCAGGGAAGCTTTTGCTGGATTTGAGCCAAGTACAGTTGCCAATA
TGGGGGAGAAAGAGATAACAGATATAGCTTCTGACAAGGCCATTATGCTGGTGGAGAGCAGAATAGCTAGAGATTTTGGATCGTTTAGTAACTATATGTGGAGCTATATG
AACTTTAAACCTACAATAAACAGATTTAGACATCCAAGAAATGTTCCCTTGAGAAGTCCCAAAGCAGAAGCCATTAGCAAGGACATGGTGAAGCGCGGTTTTCGGTTTGT
TGGGCCAGTGATTGTCTATTCATTCATGCAAGCGGCTGGGTTGACCATCGATCATCTTGTCGATTGTTTTCGACACGGTGAATGTGTAAATCTTGCAGAAAGGCCATGGA
GACATATCTGA
mRNA sequenceShow/hide mRNA sequence
CAAACCTTTATAATCTCAATCCATTTTTGTATCAAAATTGTCCCATTAAGTCTTATCTTATTTTCCAATATGTCATCCAAAGCCACTGTTAGAAGACAGATTCTGGAGAG
GCAAACATGTCCTAAAGAGAAAGATAGGACAAGCCAACACATTTTGTCCAAACACCTTAAGAAGATTTACCCAATTGGGTTACAAAGAACCACTTCATCACTATCTTTAT
CTTCACTATCATTGTCTTTGTCTCAAAATTCAAATGACTCTTCTCTTACAGACTCCTCAATCCAATTGGATCAGAAAATTTCGTACGCAATTCGCCTTATTACGCCGCCG
CCTCCTGAAAGAAGAGAAGTCCCATTGCCTAAAAGTATCCAACAACAAAGTCAAGAACTTAGTGATGGGGAATTGAGGAGGTGCAACTGGATCACCCATACCAGTGATAA
AGCCTATGTATCTTTTCATGACGAGTGTTGGGGTGTCCCAGTATACGATGACAACCGACTTTTCGAGCTACTCGCACTATCTGGGATGTTGATGGACTACAATTGGACTG
AAATTGTGAAAAGAAGGGAACTATTCAGGGAAGCTTTTGCTGGATTTGAGCCAAGTACAGTTGCCAATATGGGGGAGAAAGAGATAACAGATATAGCTTCTGACAAGGCC
ATTATGCTGGTGGAGAGCAGAATAGCTAGAGATTTTGGATCGTTTAGTAACTATATGTGGAGCTATATGAACTTTAAACCTACAATAAACAGATTTAGACATCCAAGAAA
TGTTCCCTTGAGAAGTCCCAAAGCAGAAGCCATTAGCAAGGACATGGTGAAGCGCGGTTTTCGGTTTGTTGGGCCAGTGATTGTCTATTCATTCATGCAAGCGGCTGGGT
TGACCATCGATCATCTTGTCGATTGTTTTCGACACGGTGAATGTGTAAATCTTGCAGAAAGGCCATGGAGACATATCTGA
Protein sequenceShow/hide protein sequence
MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQELSDG
ELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRIARDFGSFSNYMWSYM
NFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI