; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0011968 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0011968
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionAdenine DNA glycosylase
Genome locationchr11:29627430..29629303
RNA-Seq ExpressionIVF0011968
SyntenyIVF0011968
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0000701 - purine-specific mismatch base pair DNA N-glycosylase activity (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0051539 - 4 iron, 4 sulfur cluster binding (molecular function)
InterPro domainsIPR000445 - Helix-hairpin-helix motif
IPR003265 - HhH-GPD domain
IPR004036 - Endonuclease III-like, conserved site-2
IPR005760 - A/G-specific adenine glycosylase MutY
IPR011257 - DNA glycosylase
IPR015797 - NUDIX hydrolase-like domain superfamily
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal
IPR029119 - MutY, C-terminal
IPR044298 - Adenine/Thymine-DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN33687.1 A/G-specific adenine DNA glycosylase [Cucumis melo subsp. melo]4.28e-26894.76Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNENEENVKKKTDFRRKKKPTT+RKRR RSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISG PKDPKLIKQVWKAAAQLVD SRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
Subjt:  IARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD +TRRESIDSLL              +NREDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INREDVGDFIHVFTHIRLKIYVEHLVLCLK

Query:  G
        G
Subjt:  G

KAA0039872.1 adenine DNA glycosylase isoform X1 [Cucumis melo var. makuwa]9.34e-26690.38Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEV-----------
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEV           
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEV-----------

Query:  --------------VPVVDGNVIRVIARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSS
                      VPVVDGNVIRVIARLKAISG PKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSS
Subjt:  --------------VPVVDGNVIRVIARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSS

Query:  VLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INRED
        VLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL              +NRED
Subjt:  VLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INRED

Query:  VGDFIHVFTHIRLKIYVEHLVLCLKG
        VGDFIHVFTHIRLKIYVEHLVLCLKG
Subjt:  VGDFIHVFTHIRLKIYVEHLVLCLKG

TYK24629.1 adenine DNA glycosylase isoform X1 [Cucumis melo var. makuwa]1.15e-27296.01Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISG PKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
Subjt:  IARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL              +NREDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INREDVGDFIHVFTHIRLKIYVEHLVLCLK

Query:  G
        G
Subjt:  G

XP_008459934.1 PREDICTED: adenine DNA glycosylase isoform X1 [Cucumis melo]4.51e-26794.76Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNENEENVKKKTDFRRKKKPTT+RKRR RSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISG PKDPKLIKQVWKAAAQLVD SRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
Subjt:  IARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD +TRRESIDSLL              +NREDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INREDVGDFIHVFTHIRLKIYVEHLVLCLK

Query:  G
        G
Subjt:  G

XP_016902459.1 PREDICTED: adenine DNA glycosylase isoform X2 [Cucumis melo]3.23e-26794.76Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNENEENVKKKTDFRRKKKPTT+RKRR RSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISG PKDPKLIKQVWKAAAQLVD SRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
Subjt:  IARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD +TRRESIDSLL              +NREDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INREDVGDFIHVFTHIRLKIYVEHLVLCLK

Query:  G
        G
Subjt:  G

TrEMBL top hitse value%identityAlignment
A0A1S3CBT2 Adenine DNA glycosylase6.1e-21194.76Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNENEENVKKKTDFRRKKKPTT+RKRR RSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISG PKDPKLIKQVWKAAAQLVD SRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
Subjt:  IARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD +TRRESIDSLL              +NREDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INREDVGDFIHVFTHIRLKIYVEHLVLCLK

Query:  G
        G
Subjt:  G

A0A1S4E2J7 Adenine DNA glycosylase6.1e-21194.76Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNENEENVKKKTDFRRKKKPTT+RKRR RSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISG PKDPKLIKQVWKAAAQLVD SRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
Subjt:  IARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD +TRRESIDSLL              +NREDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INREDVGDFIHVFTHIRLKIYVEHLVLCLK

Query:  G
        G
Subjt:  G

A0A5A7T8X3 Adenine DNA glycosylase3.0e-21090.38Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGE------------
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGE            
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGE------------

Query:  -------------VVPVVDGNVIRVIARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSS
                     VVPVVDGNVIRVIARLKAISG PKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSS
Subjt:  -------------VVPVVDGNVIRVIARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSS

Query:  VLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INRED
        VLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL              +NRED
Subjt:  VLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INRED

Query:  VGDFIHVFTHIRLKIYVEHLVLCLKG
        VGDFIHVFTHIRLKIYVEHLVLCLKG
Subjt:  VGDFIHVFTHIRLKIYVEHLVLCLKG

A0A5D3DMU5 Adenine DNA glycosylase2.0e-21496.01Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISG PKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
Subjt:  IARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL              +NREDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INREDVGDFIHVFTHIRLKIYVEHLVLCLK

Query:  G
        G
Subjt:  G

E5GB45 Adenine DNA glycosylase6.1e-21194.76Show/hide
Query:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
        MSDGEKNENEENVKKKTDFRRKKKPTT+RKRR RSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT
Subjt:  MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
        RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRV

Query:  IARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISG PKDPKLIKQVWKAAAQLVD SRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV
Subjt:  IARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD +TRRESIDSLL              +NREDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL--------------INREDVGDFIHVFTHIRLKIYVEHLVLCLK

Query:  G
        G
Subjt:  G

SwissProt top hitse value%identityAlignment
F4JRF4 Adenine DNA glycosylase3.6e-11554.01Show/hide
Query:  DGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVV---DIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRS-LDKGEPETRAYGVWVSEIMLQ
        + E+ E  E  + + D    ++ + E +      +++E      DIED +FS +  Q IR  LLDWYD ++RDLPWR+   + E E RAY VWVSEIMLQ
Subjt:  DGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVV---DIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRS-LDKGEPETRAYGVWVSEIMLQ

Query:  QTRVQTVVQFYNRWMLKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAG
        QTRVQTV+++Y RWM KWPT+  L +ASLE                   EVNEMWAGLGYYRRARFL EGAKM+V     FP   SSL K+ GIG+YTAG
Subjt:  QTRVQTVVQFYNRWMLKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAG

Query:  AIASIAFGEVVPVVDGNVIRVIARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVT
        AIASIAF E VPVVDGNVIRV+ARLKAIS  PKD    +  WK AAQLVDPSRPGDFNQ+LMELGATLCT + PSCS+CPV   C A S+S+ + ++ VT
Subjt:  AIASIAFGEVVPVVDGNVIRVIARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVT

Query:  DYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL----------------INREDVG
        DYP K IK K RHD+  VCV+EI   +     G   RF+LVKRP++GLLAGLWEFPSV L+ EAD  TRR +I+  L                ++RE++G
Subjt:  DYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL----------------INREDVG

Query:  DFIHVFTHIRLKIYVEHLVLCLKG
        +F+H+FTHIR K+YVE LV+ L G
Subjt:  DFIHVFTHIRLKIYVEHLVLCLKG

Q10159 Adenine DNA glycosylase2.6e-5736.81Show/hide
Query:  VQTIRASLLDWYDRSRRDLPWRSL------------DKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLE-EVNEMWAGLGYYR
        V+  R SL+ +YD+++R LPWR              D  +P  R Y V VSEIMLQQTRV+TV ++Y +WM   PT++  + A    +V  +W+G+G+Y 
Subjt:  VQTIRASLLDWYDRSRRDLPWRSL------------DKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLE-EVNEMWAGLGYYR

Query:  RARFLFEGAKMIVK-EGGRFPKTVSSLRK-IPGIGEYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQA
        R + L +  + + K      P+T     K IPG+G YTAGA+ SIA+ +   +VDGNVIRV++R  AI       K    +WK A +LVDP RPGDFNQA
Subjt:  RARFLFEGAKMIVK-EGGRFPKTVSSLRK-IPGIGEYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQA

Query:  LMELGATLCTPTNPSCSTCPVFDHCEA--------------LSISKRDSSVLVTD--------------YPAKGIKTKQRHDYSAVCVVEILESQGTSEL
        LMELGA  CTP +P CS CP+ + C+A                I     ++ +TD              YP    KTKQR + + V +      Q T   
Subjt:  LMELGATLCTPTNPSCSTCPVFDHCEA--------------LSISKRDSSVLVTD--------------YPAKGIKTKQRHDYSAVCVVEILESQGTSEL

Query:  GQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEA---DLTTR-RESI--------DSLLINREDVGDFIHVFTHIRLKIYVEHLV
         +   FL+ KRP  GLLAGLW+FP++    E+   D+    ++SI         SL+   +  G ++H+F+HIR   +V + +
Subjt:  GQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEA---DLTTR-RESI--------DSLLINREDVGDFIHVFTHIRLKIYVEHLV

Q8R5G2 Adenine DNA glycosylase4.1e-7941.8Show/hide
Query:  KKKTDFRRKKKPTTERKRRGRSPSKS--------------------EAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEP--ETRAYGVW
        K +   R  KK     KRRG+    S                    +  V    +   I +V   R +LL WYD+ +RDLPWR   K E   + RAY VW
Subjt:  KKKTDFRRKKKPTTERKRRGRSPSKS--------------------EAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEP--ETRAYGVW

Query:  VSEIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKE-GGRFPKTVSSLRK-IPGIGEYTAGAIASIAFGEV
        VSE+MLQQT+V TV+ +Y RWM KWPT+Q L+ ASLEEVN++W+GLGYY R R L EGA+ +V+E GG  P+T  +L++ +PG+G YTAGAIASIAF +V
Subjt:  VSEIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKE-GGRFPKTVSSLRK-IPGIGEYTAGAIASIAFGEV

Query:  VPVVDGNVIRVIARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEA-----------------------
          VVDGNVIRV+ R++AI   P    +   +W  A QLVDP+RPGDFNQA MELGAT+CTP  P C+ CPV   C A                       
Subjt:  VPVVDGNVIRVIARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEA-----------------------

Query:  ---------LSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD-------LTTRRE
                  S +  D ++ V ++P K  +   R +YSA CVVE   + G          LLV+RP+ GLLAGLWEFPSV+L+              +  
Subjt:  ---------LSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD-------LTTRRE

Query:  SIDSLLINREDVGDFIHVFTHIRLKIYVEHLVL
        S        + +G+ IHVF+HI+L   V  L L
Subjt:  SIDSLLINREDVGDFIHVFTHIRLKIYVEHLVL

Q99P21 Adenine DNA glycosylase1.2e-8343.06Show/hide
Query:  KKKPTTERKRRGRSPSKSEAVVDIED-------------------IMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGE--PETRAYGVWVSEIMLQQT
        KK+P   ++RR R+ S S+A     D                   +   + +V   R++LL WYD+ +RDLPWR+L K E   + RAY VWVSE+MLQQT
Subjt:  KKKPTTERKRRGRSPSKSEAVVDIED-------------------IMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGE--PETRAYGVWVSEIMLQQT

Query:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKE-GGRFPKTVSSLRK-IPGIGEYTAGAIASIAFGEVVPVVDGNVI
        +V TV+ +Y RWM KWP +Q L+ ASLEEVN++W+GLGYY R R L EGA+ +V+E GG  P+T  +L++ +PG+G YTAGAIASIAF +V  VVDGNV+
Subjt:  RVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKE-GGRFPKTVSSLRK-IPGIGEYTAGAIASIAFGEVVPVVDGNVI

Query:  RVIARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEA--------------------------------
        RV+ R++AI   P    +   +W  A QLVDP+RPGDFNQA MELGAT+CTP  P CS CPV   C A                                
Subjt:  RVIARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEA--------------------------------

Query:  LSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSL--------LIN
         S S  D S+ V ++P K  +   R +YSA CVVE   + G          LLV+RPD GLLAGLWEFPSV+L+  ++    +  +  L         I 
Subjt:  LSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSL--------LIN

Query:  REDVGDFIHVFTHIRLKIYVEHLVL
         + +G+ IH+F+HI+L   V  L L
Subjt:  REDVGDFIHVFTHIRLKIYVEHLVL

Q9UIF7 Adenine DNA glycosylase4.9e-8043.22Show/hide
Query:  EAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPW--RSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGL
        +A V    +   +  V   R SLL WYD+ +RDLPW  R+ D+ + + RAY VWVSE+MLQQT+V TV+ +Y  WM KWPT+Q L+ ASLEEVN++WAGL
Subjt:  EAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPW--RSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGL

Query:  GYYRRARFLFEGAKMIVKE-GGRFPKTVSSLRK-IPGIGEYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGD
        GYY R R L EGA+ +V+E GG  P+T  +L++ +PG+G YTAGAIASIAFG+   VVDGNV RV+ R++AI   P    + +Q+W  A QLVDP+RPGD
Subjt:  GYYRRARFLFEGAKMIVKE-GGRFPKTVSSLRK-IPGIGEYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGD

Query:  FNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKR-----------------------------------DSSVLVTDYPAKGIKTKQRHDYSAVCVV
        FNQA MELGAT+CTP  P CS CPV   C A    ++                                   D ++ V ++P K  +   R + SA CV+
Subjt:  FNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKR-----------------------------------DSSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSL--------LINREDVGDFIHVFTHIRLKIYVEHLVL
        E   + G       ++ LLV+RP+ GLLAGLWEFPSV+ +    L  R+  +  L          +   +G+ +H F+HI+L   V  L L
Subjt:  EILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSL--------LINREDVGDFIHVFTHIRLKIYVEHLVL

Arabidopsis top hitse value%identityAlignment
AT1G05900.2 endonuclease III 27.1e-1025.89Show/hide
Query:  PETRAYGVWVSEIMLQQTRVQ----TVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYY-RRARFLFEGAKMIVKE-GGRFPKTVSSLRKIPGIGEY
        P+ R + V +  ++  QT+       V + +   +L   T + + +A    + E+   +G+Y R+A  + + AK+ + E  G  P+T+  L  +PG+G  
Subjt:  PETRAYGVWVSEIMLQQTRVQ----TVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYY-RRARFLFEGAKMIVKE-GGRFPKTVSSLRKIPGIGEY

Query:  TAGAIASIAFGEVVPV-VDGNVIRVIARL--------KAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHC-EA
         A  +  +A+ +V  + VD +V R+  RL        K  +  P++ ++  Q W    + V        N  L+  G T+CTP  P C TC + + C  A
Subjt:  TAGAIASIAFGEVVPV-VDGNVIRVIARL--------KAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHC-EA

Query:  LSISKRDSSVLVTDYPAKGIKTKQ
           +   SS L      K IK+K+
Subjt:  LSISKRDSSVLVTDYPAKGIKTKQ

AT4G12740.1 HhH-GPD base excision DNA repair family protein2.5e-11654.01Show/hide
Query:  DGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVV---DIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRS-LDKGEPETRAYGVWVSEIMLQ
        + E+ E  E  + + D    ++ + E +      +++E      DIED +FS +  Q IR  LLDWYD ++RDLPWR+   + E E RAY VWVSEIMLQ
Subjt:  DGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVV---DIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRS-LDKGEPETRAYGVWVSEIMLQ

Query:  QTRVQTVVQFYNRWMLKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAG
        QTRVQTV+++Y RWM KWPT+  L +ASLE                   EVNEMWAGLGYYRRARFL EGAKM+V     FP   SSL K+ GIG+YTAG
Subjt:  QTRVQTVVQFYNRWMLKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAG

Query:  AIASIAFGEVVPVVDGNVIRVIARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVT
        AIASIAF E VPVVDGNVIRV+ARLKAIS  PKD    +  WK AAQLVDPSRPGDFNQ+LMELGATLCT + PSCS+CPV   C A S+S+ + ++ VT
Subjt:  AIASIAFGEVVPVVDGNVIRVIARLKAISGKPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVT

Query:  DYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL----------------INREDVG
        DYP K IK K RHD+  VCV+EI   +     G   RF+LVKRP++GLLAGLWEFPSV L+ EAD  TRR +I+  L                ++RE++G
Subjt:  DYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLTTRRESIDSLL----------------INREDVG

Query:  DFIHVFTHIRLKIYVEHLVLCLKG
        +F+H+FTHIR K+YVE LV+ L G
Subjt:  DFIHVFTHIRLKIYVEHLVLCLKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGACGGAGAAAAGAATGAGAACGAGGAGAATGTGAAGAAAAAGACTGACTTTCGTCGGAAAAAGAAACCCACGACGGAACGGAAACGCCGGGGCCGAAGTCCGTC
TAAAAGTGAAGCAGTTGTTGACATTGAAGATATTATGTTCAGCATAGACAATGTTCAGACAATCAGGGCCTCGCTCTTGGATTGGTACGACCGTAGCCGCAGGGACCTTC
CATGGAGGAGCTTGGACAAAGGGGAACCTGAAACACGGGCTTACGGTGTGTGGGTTTCAGAAATAATGCTGCAGCAGACCAGAGTTCAGACCGTCGTCCAATTTTACAAC
CGTTGGATGCTTAAATGGCCTACCGTTCAACATCTCTCTCGTGCTTCTCTTGAGGAGGTGAATGAAATGTGGGCAGGCTTGGGGTATTATAGACGAGCTCGTTTTCTTTT
TGAGGGTGCAAAGATGATAGTCAAAGAAGGTGGTAGATTTCCTAAAACAGTTTCTTCCCTTCGAAAAATTCCAGGAATTGGAGAATACACAGCAGGGGCTATTGCCTCTA
TAGCGTTCGGTGAAGTGGTGCCTGTAGTCGATGGTAATGTGATAAGGGTAATTGCTCGATTAAAGGCTATTTCAGGAAAACCAAAAGACCCGAAGTTGATCAAGCAAGTT
TGGAAGGCAGCTGCTCAATTAGTTGATCCTTCCAGGCCTGGGGATTTCAATCAGGCACTCATGGAACTTGGTGCAACTTTATGCACTCCAACAAACCCAAGCTGCTCAAC
GTGCCCCGTGTTTGATCACTGTGAGGCCCTTTCAATCTCAAAGCGTGATAGTTCAGTTCTTGTCACAGATTATCCCGCTAAGGGGATAAAGACCAAACAAAGACATGATT
ATTCTGCTGTATGTGTGGTTGAGATATTGGAAAGTCAGGGTACATCTGAGTTAGGGCAATCTAGTAGATTTCTTCTTGTAAAGAGGCCTGATGAAGGTTTGCTTGCTGGT
CTATGGGAGTTTCCATCTGTCTCGTTGGATGGAGAAGCTGATTTAACCACAAGGAGAGAATCCATCGATAGCCTCTTGATCAATAGAGAAGATGTTGGAGATTTTATCCA
TGTTTTCACGCACATCCGTCTCAAGATATATGTTGAGCACTTGGTGTTATGTTTAAAAGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCGACGGAGAAAAGAATGAGAACGAGGAGAATGTGAAGAAAAAGACTGACTTTCGTCGGAAAAAGAAACCCACGACGGAACGGAAACGCCGGGGCCGAAGTCCGTC
TAAAAGTGAAGCAGTTGTTGACATTGAAGATATTATGTTCAGCATAGACAATGTTCAGACAATCAGGGCCTCGCTCTTGGATTGGTACGACCGTAGCCGCAGGGACCTTC
CATGGAGGAGCTTGGACAAAGGGGAACCTGAAACACGGGCTTACGGTGTGTGGGTTTCAGAAATAATGCTGCAGCAGACCAGAGTTCAGACCGTCGTCCAATTTTACAAC
CGTTGGATGCTTAAATGGCCTACCGTTCAACATCTCTCTCGTGCTTCTCTTGAGGAGGTGAATGAAATGTGGGCAGGCTTGGGGTATTATAGACGAGCTCGTTTTCTTTT
TGAGGGTGCAAAGATGATAGTCAAAGAAGGTGGTAGATTTCCTAAAACAGTTTCTTCCCTTCGAAAAATTCCAGGAATTGGAGAATACACAGCAGGGGCTATTGCCTCTA
TAGCGTTCGGTGAAGTGGTGCCTGTAGTCGATGGTAATGTGATAAGGGTAATTGCTCGATTAAAGGCTATTTCAGGAAAACCAAAAGACCCGAAGTTGATCAAGCAAGTT
TGGAAGGCAGCTGCTCAATTAGTTGATCCTTCCAGGCCTGGGGATTTCAATCAGGCACTCATGGAACTTGGTGCAACTTTATGCACTCCAACAAACCCAAGCTGCTCAAC
GTGCCCCGTGTTTGATCACTGTGAGGCCCTTTCAATCTCAAAGCGTGATAGTTCAGTTCTTGTCACAGATTATCCCGCTAAGGGGATAAAGACCAAACAAAGACATGATT
ATTCTGCTGTATGTGTGGTTGAGATATTGGAAAGTCAGGGTACATCTGAGTTAGGGCAATCTAGTAGATTTCTTCTTGTAAAGAGGCCTGATGAAGGTTTGCTTGCTGGT
CTATGGGAGTTTCCATCTGTCTCGTTGGATGGAGAAGCTGATTTAACCACAAGGAGAGAATCCATCGATAGCCTCTTGATCAATAGAGAAGATGTTGGAGATTTTATCCA
TGTTTTCACGCACATCCGTCTCAAGATATATGTTGAGCACTTGGTGTTATGTTTAAAAGGTTAG
Protein sequenceShow/hide protein sequence
MSDGEKNENEENVKKKTDFRRKKKPTTERKRRGRSPSKSEAVVDIEDIMFSIDNVQTIRASLLDWYDRSRRDLPWRSLDKGEPETRAYGVWVSEIMLQQTRVQTVVQFYN
RWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPKTVSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGKPKDPKLIKQV
WKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKRDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILESQGTSELGQSSRFLLVKRPDEGLLAG
LWEFPSVSLDGEADLTTRRESIDSLLINREDVGDFIHVFTHIRLKIYVEHLVLCLKG