; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004449 (gene) of Snake gourd v1 genome

Gene IDTan0004449
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA glycosylase superfamily protein
Genome locationLG04:8427077..8429257
RNA-Seq ExpressionTan0004449
SyntenyTan0004449
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135425.1 uncharacterized protein LOC101218195 [Cucumis sativus]1.9e-16494.16Show/hide
Query:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPIAKSIQ
        MSSK+TVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRT+SSLS SS+SLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVP+ KSIQ
Subjt:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPIAKSIQ

Query:  PQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLV
         QSQEL DGELRRCNWITHTSD+AYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEITD+ASDKAIMLV
Subjt:  PQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNL
        ESRVRCIVDNAKCILKIARDFGSFSNYMWSY+NFKPTINRFR+PRNVPLR+PKAE ISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVNL
Subjt:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

XP_008446481.2 PREDICTED: DNA-3-methyladenine glycosylase 1 [Cucumis melo]1.2e-16394.48Show/hide
Query:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPIAKSIQ
        MSSK+TVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRT+SSLS SSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVP+ KSIQ
Subjt:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPIAKSIQ

Query:  PQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLV
         QSQEL DGELRRCNWITHTSD+AYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEITDIASDKAIMLV
Subjt:  PQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNL
        ESRVRCIVDNAKCILKIARDFGSFSNYMWS +NFKPTINRFR+PRNVPLR+PKAE ISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVNL
Subjt:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

XP_022149074.1 uncharacterized protein LOC111017575 [Momordica charantia]1.9e-16494.81Show/hide
Query:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPIAKSIQ
        MSSK+TVRRQ+LERQTCPKEKDRTSQNILSK LKKIYPIGLQR+SSS SFSSLSLSLSQNSNDSSLTDSS QLDQKISYAIRLI PPPERREVPI KSIQ
Subjt:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPIAKSIQ

Query:  PQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLV
         QSQELGDGELRRCNWITHTSD+AYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEI DIASDKAIMLV
Subjt:  PQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNL
        ESRVRCIVDNAKCILKIARDFGSFSNYMWSY+NFKPTINR+RYPRNVPLR+PKAE ISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRH ECVNL
Subjt:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

XP_022993235.1 uncharacterized protein LOC111489316 [Cucurbita maxima]2.1e-16394.19Show/hide
Query:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT--PPPERREVPIAKS
        MSSK+TVRR+ILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRT+SSLS SSLSLSLSQNSNDSSLTDSSIQLD+KISYAIRLIT  PPPERRE P+ KS
Subjt:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT--PPPERREVPIAKS

Query:  IQPQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIM
        +Q Q QELGDGELRRCNWITHTSD+AYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE STVANMGEKEI+DIASDKAIM
Subjt:  IQPQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIM

Query:  LVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV
        LVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLR+PKAE ISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV
Subjt:  LVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV

Query:  NLAERPWRHI
        NLAERPWRHI
Subjt:  NLAERPWRHI

XP_023528370.1 uncharacterized protein LOC111791309 [Cucurbita pepo subsp. pepo]4.6e-16393.59Show/hide
Query:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT----PPPERREVPIA
        MSSK+TVRR+ILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRT+SSLS SS SLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT    PPPERRE P+ 
Subjt:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT----PPPERREVPIA

Query:  KSIQPQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKA
        KS+Q Q QELGDGELRRCNWITHTSD+AYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE STVANMGEKEI DIASDKA
Subjt:  KSIQPQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKA

Query:  IMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGE
        IMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLR+PKAE ISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGE
Subjt:  IMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGE

Query:  CVNLAERPWRHI
        CVNLAERPWRHI
Subjt:  CVNLAERPWRHI

TrEMBL top hitse value%identityAlignment
A0A0A0KUC5 Uncharacterized protein9.0e-16594.16Show/hide
Query:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPIAKSIQ
        MSSK+TVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRT+SSLS SS+SLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVP+ KSIQ
Subjt:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPIAKSIQ

Query:  PQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLV
         QSQEL DGELRRCNWITHTSD+AYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEITD+ASDKAIMLV
Subjt:  PQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNL
        ESRVRCIVDNAKCILKIARDFGSFSNYMWSY+NFKPTINRFR+PRNVPLR+PKAE ISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVNL
Subjt:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

A0A1S3BEN5 DNA-3-methyladenine glycosylase 15.9e-16494.48Show/hide
Query:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPIAKSIQ
        MSSK+TVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRT+SSLS SSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVP+ KSIQ
Subjt:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPIAKSIQ

Query:  PQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLV
         QSQEL DGELRRCNWITHTSD+AYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEITDIASDKAIMLV
Subjt:  PQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNL
        ESRVRCIVDNAKCILKIARDFGSFSNYMWS +NFKPTINRFR+PRNVPLR+PKAE ISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVNL
Subjt:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

A0A5D3CCU6 DNA-3-methyladenine glycosylase 15.9e-16494.48Show/hide
Query:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPIAKSIQ
        MSSK+TVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRT+SSLS SSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVP+ KSIQ
Subjt:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPIAKSIQ

Query:  PQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLV
         QSQEL DGELRRCNWITHTSD+AYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS VANMGEKEITDIASDKAIMLV
Subjt:  PQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNL
        ESRVRCIVDNAKCILKIARDFGSFSNYMWS +NFKPTINRFR+PRNVPLR+PKAE ISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVNL
Subjt:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

A0A6J1D5X7 uncharacterized protein LOC1110175759.0e-16594.81Show/hide
Query:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPIAKSIQ
        MSSK+TVRRQ+LERQTCPKEKDRTSQNILSK LKKIYPIGLQR+SSS SFSSLSLSLSQNSNDSSLTDSS QLDQKISYAIRLI PPPERREVPI KSIQ
Subjt:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPIAKSIQ

Query:  PQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLV
         QSQELGDGELRRCNWITHTSD+AYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEI DIASDKAIMLV
Subjt:  PQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLV

Query:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNL
        ESRVRCIVDNAKCILKIARDFGSFSNYMWSY+NFKPTINR+RYPRNVPLR+PKAE ISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRH ECVNL
Subjt:  ESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNL

Query:  AERPWRHI
        AERPWRHI
Subjt:  AERPWRHI

A0A6J1JY14 uncharacterized protein LOC1114893161.0e-16394.19Show/hide
Query:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT--PPPERREVPIAKS
        MSSK+TVRR+ILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRT+SSLS SSLSLSLSQNSNDSSLTDSSIQLD+KISYAIRLIT  PPPERRE P+ KS
Subjt:  MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLIT--PPPERREVPIAKS

Query:  IQPQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIM
        +Q Q QELGDGELRRCNWITHTSD+AYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE STVANMGEKEI+DIASDKAIM
Subjt:  IQPQSQELGDGELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIM

Query:  LVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV
        LVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLR+PKAE ISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV
Subjt:  LVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV

Query:  NLAERPWRHI
        NLAERPWRHI
Subjt:  NLAERPWRHI

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 11.9e-3435.36Show/hide
Query:  LRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDN
        + RC W+  + D  Y+++HD  WGVP  D  +LFE++ L G     +W  ++K+RE +R  F  F+P  VA M E+++  +  D  I+    +++ I+ N
Subjt:  LRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDN

Query:  AKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC
        A+  L++ ++   F +++WS++N +P + +      +P  T  ++ +SK + KRGF+FVG  I YSFMQA GL  DH+V C
Subjt:  AKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC

P44321 DNA-3-methyladenine glycosylase6.1e-3336.87Show/hide
Query:  RCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDNAK
        RC W+   S   Y+ +HD+ WG P +D  +LFE + L G     +W  ++K+RE +REAF  F+P  +A M   +I     +  ++   +++  IV NAK
Subjt:  RCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDNAK

Query:  CILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC
          L + +   +FS+++WS++N KP +N     R+VP +T  ++ +SK + KRGF F+G    Y+FMQ+ GL  DHL DC
Subjt:  CILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]1.5e-3937.75Show/hide
Query:  IAKSIQPQSQELGDG--ELRRCNWITHTSDEA---YVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIT
        + KS+  ++Q+  +G  E  RC W T   + A   Y  +HD  WG P+++D +LFE L L G     +W  I+K+RE FR AF  F+P  VAN  E +I 
Subjt:  IAKSIQPQSQELGDG--ELRRCNWITHTSDEA---YVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEIT

Query:  DIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLV
        ++  ++ I+   +++   + NAK  + + R+FGSF  Y+W ++  KP IN F    ++P  TP ++ I+KD+ KRGF+FVG   +Y+ MQ+ G+  DHL 
Subjt:  DIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLV

Query:  DCFR
         CF+
Subjt:  DCFR

Arabidopsis top hitse value%identityAlignment
AT1G13635.1 DNA glycosylase superfamily protein5.5e-11466.99Show/hide
Query:  MSSKSTVRRQILERQTCPKEKD-RTSQNILSKHLKKIYPIGLQR-TSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPIAKS
        M   S  R++I+E+    +EK+ + + N  +KHLK+IYPI LQR TSSS S SS+SLSLSQNS DS  TDS+  L+QKIS A+ LI+  P RRE+ + KS
Subjt:  MSSKSTVRRQILERQTCPKEKD-RTSQNILSKHLKKIYPIGLQR-TSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPIAKS

Query:  IQPQ-SQELGDG-ELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKA
        I  Q  Q+     E +RCNWIT  SDE YV FHD+ WGVPVYDDN LFE LA+SGMLMDYNWTEI+KR+E FREAF  F+P+ VA MGEKEI +IAS+KA
Subjt:  IQPQ-SQELGDG-ELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKA

Query:  IMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGE
        IML ESRVRCIVDNAKCI K+  +FGSFS+++W +M++KP IN+F+Y RNVPLR+PKAE ISKDM+KRGFRFVGPVIV+SFMQAAGLTIDHLVDCFRHG+
Subjt:  IMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGE

Query:  CVNLAERPWRHI
        CV+LAERPWRHI
Subjt:  CVNLAERPWRHI

AT1G13635.2 DNA glycosylase superfamily protein5.5e-11466.99Show/hide
Query:  MSSKSTVRRQILERQTCPKEKD-RTSQNILSKHLKKIYPIGLQR-TSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPIAKS
        M   S  R++I+E+    +EK+ + + N  +KHLK+IYPI LQR TSSS S SS+SLSLSQNS DS  TDS+  L+QKIS A+ LI+  P RRE+ + KS
Subjt:  MSSKSTVRRQILERQTCPKEKD-RTSQNILSKHLKKIYPIGLQR-TSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPIAKS

Query:  IQPQ-SQELGDG-ELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKA
        I  Q  Q+     E +RCNWIT  SDE YV FHD+ WGVPVYDDN LFE LA+SGMLMDYNWTEI+KR+E FREAF  F+P+ VA MGEKEI +IAS+KA
Subjt:  IQPQ-SQELGDG-ELRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKA

Query:  IMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGE
        IML ESRVRCIVDNAKCI K+  +FGSFS+++W +M++KP IN+F+Y RNVPLR+PKAE ISKDM+KRGFRFVGPVIV+SFMQAAGLTIDHLVDCFRHG+
Subjt:  IMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGE

Query:  CVNLAERPWRHI
        CV+LAERPWRHI
Subjt:  CVNLAERPWRHI

AT5G44680.1 DNA glycosylase superfamily protein5.6e-5854.21Show/hide
Query:  RRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDNA
        +RC++IT +SD  YV++HD+ WGVPV+DDN LFELL L+G  +  +WT ++KRR  FREAF+GFE   VA+  EK+I  I +D  I L  S+V  +VDNA
Subjt:  RRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDNA

Query:  KCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLA
        K ILK+ RD GSF+ Y+W +M  KP   ++   + +P++T K+ETISKDMV+RGFRFVGP +++S MQAAGLT DHL+ C RH EC  +A
Subjt:  KCILKIARDFGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLA

AT5G57970.1 DNA glycosylase superfamily protein3.3e-5844.03Show/hide
Query:  LQRTSSSLSFSSLSLSLSQNSNDSSLTD-----SSIQLDQKISYAIRLITPPPERREVPIAKSIQPQSQELGDGELRRCNWITHTSDEAYVSFHDECWGV
        L+R   +L+ S+LSL+ S  S+D+S+       S+ +L +  S   R  + P + R V    ++   S   G    +RC W+T  SD  Y+ FHDE WGV
Subjt:  LQRTSSSLSFSSLSLSLSQNSNDSSLTD-----SSIQLDQKISYAIRLITPPPERREVPIAKSIQPQSQELGDGELRRCNWITHTSDEAYVSFHDECWGV

Query:  PVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFK
        PV+DD RLFELL LSG L ++ W  I+ +R+ FRE FA F+P+ +  + EK+I    S  + +L + ++R +++NA+ ILK+  ++GSF  Y+WS++  K
Subjt:  PVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFK

Query:  PTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAER
          +++FRY R VP +TPKAE ISKD+V+RGFR VGP +VYSFMQAAG+T DHL  CFR   C+   ER
Subjt:  PTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAER

AT5G57970.2 DNA glycosylase superfamily protein3.3e-5844.03Show/hide
Query:  LQRTSSSLSFSSLSLSLSQNSNDSSLTD-----SSIQLDQKISYAIRLITPPPERREVPIAKSIQPQSQELGDGELRRCNWITHTSDEAYVSFHDECWGV
        L+R   +L+ S+LSL+ S  S+D+S+       S+ +L +  S   R  + P + R V    ++   S   G    +RC W+T  SD  Y+ FHDE WGV
Subjt:  LQRTSSSLSFSSLSLSLSQNSNDSSLTD-----SSIQLDQKISYAIRLITPPPERREVPIAKSIQPQSQELGDGELRRCNWITHTSDEAYVSFHDECWGV

Query:  PVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFK
        PV+DD RLFELL LSG L ++ W  I+ +R+ FRE FA F+P+ +  + EK+I    S  + +L + ++R +++NA+ ILK+  ++GSF  Y+WS++  K
Subjt:  PVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFK

Query:  PTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAER
          +++FRY R VP +TPKAE ISKD+V+RGFR VGP +VYSFMQAAG+T DHL  CFR   C+   ER
Subjt:  PTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAER


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCCAAATCCACTGTTAGAAGACAGATTCTGGAGAGGCAAACATGTCCTAAAGAGAAAGATAGGACAAGCCAGAACATTTTGTCCAAACATCTTAAGAAGATTTA
CCCAATTGGCCTGCAAAGAACCAGTTCATCATTGTCTTTCTCTTCACTATCATTGTCTTTGTCCCAAAATTCAAACGACTCTTCTCTTACAGACTCCTCAATCCAACTGG
ATCAGAAAATTTCATACGCAATTCGCCTCATTACGCCGCCTCCTGAAAGAAGAGAAGTCCCAATAGCTAAAAGTATCCAACCACAAAGTCAAGAACTTGGTGATGGGGAA
TTAAGGAGATGCAACTGGATCACGCATACTAGTGATGAAGCCTATGTATCATTTCATGATGAGTGTTGGGGCGTCCCGGTGTATGACGACAATCGACTTTTCGAGCTACT
TGCACTGTCCGGGATGCTGATGGACTACAATTGGACTGAAATTGTGAAAAGAAGGGAACTTTTCAGGGAAGCTTTTGCTGGATTTGAGCCAAGTACTGTTGCCAACATGG
GGGAGAAAGAGATAACAGATATAGCATCTGACAAGGCCATTATGCTTGTGGAGAGCAGAGTGAGGTGCATAGTAGACAATGCAAAATGCATATTGAAGATAGCAAGAGAT
TTTGGATCATTCAGTAACTATATGTGGAGCTATATGAACTTTAAACCAACAATAAACAGATTTAGATATCCAAGAAATGTTCCCTTGAGAACTCCCAAAGCAGAAACCAT
TAGCAAGGACATGGTGAAGCGCGGTTTTCGGTTTGTTGGGCCAGTGATTGTCTACTCATTCATGCAAGCTGCAGGATTGACGATTGATCATCTCGTCGATTGTTTTCGGC
ACGGTGAATGCGTAAATCTTGCAGAAAGACCATGGAGACATATCTGA
mRNA sequenceShow/hide mRNA sequence
CTCAATCAACTTTTGTATCAAACTGTCCCATTAAACCTTTTTTTTTCCAGTATGTCATCCAAATCCACTGTTAGAAGACAGATTCTGGAGAGGCAAACATGTCCTAAAGA
GAAAGATAGGACAAGCCAGAACATTTTGTCCAAACATCTTAAGAAGATTTACCCAATTGGCCTGCAAAGAACCAGTTCATCATTGTCTTTCTCTTCACTATCATTGTCTT
TGTCCCAAAATTCAAACGACTCTTCTCTTACAGACTCCTCAATCCAACTGGATCAGAAAATTTCATACGCAATTCGCCTCATTACGCCGCCTCCTGAAAGAAGAGAAGTC
CCAATAGCTAAAAGTATCCAACCACAAAGTCAAGAACTTGGTGATGGGGAATTAAGGAGATGCAACTGGATCACGCATACTAGTGATGAAGCCTATGTATCATTTCATGA
TGAGTGTTGGGGCGTCCCGGTGTATGACGACAATCGACTTTTCGAGCTACTTGCACTGTCCGGGATGCTGATGGACTACAATTGGACTGAAATTGTGAAAAGAAGGGAAC
TTTTCAGGGAAGCTTTTGCTGGATTTGAGCCAAGTACTGTTGCCAACATGGGGGAGAAAGAGATAACAGATATAGCATCTGACAAGGCCATTATGCTTGTGGAGAGCAGA
GTGAGGTGCATAGTAGACAATGCAAAATGCATATTGAAGATAGCAAGAGATTTTGGATCATTCAGTAACTATATGTGGAGCTATATGAACTTTAAACCAACAATAAACAG
ATTTAGATATCCAAGAAATGTTCCCTTGAGAACTCCCAAAGCAGAAACCATTAGCAAGGACATGGTGAAGCGCGGTTTTCGGTTTGTTGGGCCAGTGATTGTCTACTCAT
TCATGCAAGCTGCAGGATTGACGATTGATCATCTCGTCGATTGTTTTCGGCACGGTGAATGCGTAAATCTTGCAGAAAGACCATGGAGACATATCTGAGTTCAACATTTC
CTAATTTCCCTTCAAAGTTGTGGCCTTTTGTTTTTCTTGTGAGCATTAAGTTAAAATATAAAAATTATACATAGAGAAAGAGAGAAGAACTAAGATGAATTATCAGCCTT
TATTTATCAGCTTCTGTTAAAATGGCCAATTCTGGTACTGGACAACTTAGAGCCTTAAGAATCTGATCTCTTAATATCAACAAGTGCAGAATCAACAAGTGCAGAGCTCA
TAGTTACACCAGTTTCGGAGCAACGGTTATCAACAAGTTGAAGCTGGTGGAGAATAAGAATGGTGTATCTAATTTGTTTAAGTATTTTATTGAGGAGTGTTTCAAAATCC
TCCAGTTATGCTTCTAATATATAAGAAACTAATTATTTTAGATGTTTGGTGTGGATCTTTTGAGTTTCATCTTATTATTGTTTCAGAATCTAATCACATAGATGAAAGCA
GTTGGGGTCAAAAATGCTTGTCTATGTTCTATCTCTTACTTTCTAAACAAGCTCTCATTGCTCAAGACAGAGAGAACTAAAAATGTGCTTAGTTCATAAAGAACTTCAAG
ATATAACTAAAATCAATTAAAGCAGACATGACTAAAGAAAAATG
Protein sequenceShow/hide protein sequence
MSSKSTVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTSSSLSFSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPIAKSIQPQSQELGDGE
LRRCNWITHTSDEAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRVRCIVDNAKCILKIARD
FGSFSNYMWSYMNFKPTINRFRYPRNVPLRTPKAETISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI