; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10010729 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10010729
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionAdenine DNA glycosylase
Genome locationChr06:25268742..25285075
RNA-Seq ExpressionHG10010729
SyntenyHG10010729
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0000701 - purine-specific mismatch base pair DNA N-glycosylase activity (molecular function)
GO:0051539 - 4 iron, 4 sulfur cluster binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR044298 - Adenine/Thymine-DNA glycosylase
IPR032675 - Leucine-rich repeat domain superfamily
IPR029119 - MutY, C-terminal
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal
IPR015797 - NUDIX hydrolase-like domain superfamily
IPR011257 - DNA glycosylase
IPR005760 - A/G-specific adenine glycosylase MutY
IPR004036 - Endonuclease III-like, conserved site-2
IPR003603 - U2A'/phosphoprotein 32 family A, C-terminal
IPR001611 - Leucine-rich repeat
IPR000445 - Helix-hairpin-helix motif


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK24629.1 adenine DNA glycosylase isoform X1 [Cucumis melo var. makuwa]1.0e-19191.11Show/hide
Query:  KQSGHRYWNGTTVADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEE
        K+ G        V DIEDIMFSIDNVQTIRASLL+WYDRSRRDLPWRSLDKG+PETRAYGVWVSEIMLQQTRVQTVV FYNRWML+WPTVQHLSRASLEE
Subjt:  KQSGHRYWNGTTVADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEE

Query:  VNEMWAGLGYYRRARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD
        VNEMWAGLGYYRRARFL EGAKMIVKEGG+FPKTVS+LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD
Subjt:  VNEMWAGLGYYRRARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD

Query:  PSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLA
        PSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIKTKQRHDYSAVCVVEILE+QGTS+  +SSRFLLVKRPDEGLLA
Subjt:  PSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLA

Query:  GLWEFPSVLLDGEADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK
        GLWEFPSV LDGEADL TRRESI+SLLSK FGLEPK+NF++V REDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  GLWEFPSVLLDGEADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK

XP_004140565.2 adenine DNA glycosylase isoform X1 [Cucumis sativus]1.6e-18990.03Show/hide
Query:  KQSGHRYWNGTTVADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEE
        K+ G        V DIEDIMFSIDNVQTIRASLL+WYDRSRRDLPWRSLDKG+PETRAYGVWVSEIMLQQTRVQTVV FYNRWML+WPTVQHLSRASLEE
Subjt:  KQSGHRYWNGTTVADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEE

Query:  VNEMWAGLGYYRRARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD
        VNEMWAGLGYYRRARFL EGAKMIVKEGG+FP+TVS+LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD
Subjt:  VNEMWAGLGYYRRARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD

Query:  PSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLA
         SRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIK KQRHDYSAVCVVEILE+QGT +  +SSRFLLVKRPDEGLLA
Subjt:  PSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLA

Query:  GLWEFPSVLLDGEADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK
        GLWEFPSV LDGEADL TRRESINSLLSK FGLE K+NF++V REDVGDFIH+FTHIRLKIYVEHLVLCLK
Subjt:  GLWEFPSVLLDGEADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK

XP_016902459.1 PREDICTED: adenine DNA glycosylase isoform X2 [Cucumis melo]1.6e-18993.04Show/hide
Query:  VADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYR
        V DIEDIMFSIDNVQTIRASLL+WYDRSRRDLPWRSLDKG+PETRAYGVWVSEIMLQQTRVQTVV FYNRWML+WPTVQHLSRASLEEVNEMWAGLGYYR
Subjt:  VADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYR

Query:  RARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALM
        RARFL EGAKMIVKEGG+FPKTVS+LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD SRPGDFNQALM
Subjt:  RARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALM

Query:  ELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLAGLWEFPSVLLDG
        ELGATLCTPTNPSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIKTKQRHDYSAVCVVEILE+QGTS+  +SSRFLLVKRPDEGLLAGLWEFPSV LDG
Subjt:  ELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLAGLWEFPSVLLDG

Query:  EADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EAD  TRRESI+SLLSK FGLEPK+NF++V REDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  EADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK

XP_038874808.1 adenine DNA glycosylase isoform X1 [Benincasa hispida]3.5e-19292.76Show/hide
Query:  VADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYR
        V DIEDIMFS+DNVQ IRASLLEWYDRSRRDLPWR LDKG+PETRAYGVWVSEIMLQQTRVQTVVH+YNRWM++WPTVQHLSRASLEEVNEMWAGLGYYR
Subjt:  VADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYR

Query:  RARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALM
        RAR+LLEGAKMIVKEGG+FPKTVS LRKI GIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKL+KQVWKAAAQLVDPSRPGDFNQALM
Subjt:  RARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALM

Query:  ELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLAGLWEFPSVLLDG
        ELGATLCTPTNPSCSTCPVFDHCEALSISKHD+S+LVTDYPAKGIKTKQRHDYSAVCVVEIL+NQGTS+ E+SSRFLLVKRPDEGLLAGLWEFPSVLLDG
Subjt:  ELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLAGLWEFPSVLLDG

Query:  EADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EADL TRRESINS LSK FGLE K+NF++VIREDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  EADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK

XP_038874810.1 adenine DNA glycosylase isoform X3 [Benincasa hispida]3.5e-19292.76Show/hide
Query:  VADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYR
        V DIEDIMFS+DNVQ IRASLLEWYDRSRRDLPWR LDKG+PETRAYGVWVSEIMLQQTRVQTVVH+YNRWM++WPTVQHLSRASLEEVNEMWAGLGYYR
Subjt:  VADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYR

Query:  RARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALM
        RAR+LLEGAKMIVKEGG+FPKTVS LRKI GIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKL+KQVWKAAAQLVDPSRPGDFNQALM
Subjt:  RARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALM

Query:  ELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLAGLWEFPSVLLDG
        ELGATLCTPTNPSCSTCPVFDHCEALSISKHD+S+LVTDYPAKGIKTKQRHDYSAVCVVEIL+NQGTS+ E+SSRFLLVKRPDEGLLAGLWEFPSVLLDG
Subjt:  ELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLAGLWEFPSVLLDG

Query:  EADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EADL TRRESINS LSK FGLE K+NF++VIREDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  EADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK

TrEMBL top hitse value%identityAlignment
A0A0A0KC27 Adenine DNA glycosylase7.9e-19090.03Show/hide
Query:  KQSGHRYWNGTTVADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEE
        K+ G        V DIEDIMFSIDNVQTIRASLL+WYDRSRRDLPWRSLDKG+PETRAYGVWVSEIMLQQTRVQTVV FYNRWML+WPTVQHLSRASLEE
Subjt:  KQSGHRYWNGTTVADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEE

Query:  VNEMWAGLGYYRRARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD
        VNEMWAGLGYYRRARFL EGAKMIVKEGG+FP+TVS+LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD
Subjt:  VNEMWAGLGYYRRARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD

Query:  PSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLA
         SRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIK KQRHDYSAVCVVEILE+QGT +  +SSRFLLVKRPDEGLLA
Subjt:  PSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLA

Query:  GLWEFPSVLLDGEADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK
        GLWEFPSV LDGEADL TRRESINSLLSK FGLE K+NF++V REDVGDFIH+FTHIRLKIYVEHLVLCLK
Subjt:  GLWEFPSVLLDGEADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK

A0A1S3CBT2 Adenine DNA glycosylase7.9e-19093.04Show/hide
Query:  VADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYR
        V DIEDIMFSIDNVQTIRASLL+WYDRSRRDLPWRSLDKG+PETRAYGVWVSEIMLQQTRVQTVV FYNRWML+WPTVQHLSRASLEEVNEMWAGLGYYR
Subjt:  VADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYR

Query:  RARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALM
        RARFL EGAKMIVKEGG+FPKTVS+LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD SRPGDFNQALM
Subjt:  RARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALM

Query:  ELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLAGLWEFPSVLLDG
        ELGATLCTPTNPSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIKTKQRHDYSAVCVVEILE+QGTS+  +SSRFLLVKRPDEGLLAGLWEFPSV LDG
Subjt:  ELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLAGLWEFPSVLLDG

Query:  EADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EAD  TRRESI+SLLSK FGLEPK+NF++V REDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  EADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK

A0A1S4E2J7 Adenine DNA glycosylase7.9e-19093.04Show/hide
Query:  VADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYR
        V DIEDIMFSIDNVQTIRASLL+WYDRSRRDLPWRSLDKG+PETRAYGVWVSEIMLQQTRVQTVV FYNRWML+WPTVQHLSRASLEEVNEMWAGLGYYR
Subjt:  VADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYR

Query:  RARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALM
        RARFL EGAKMIVKEGG+FPKTVS+LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD SRPGDFNQALM
Subjt:  RARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALM

Query:  ELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLAGLWEFPSVLLDG
        ELGATLCTPTNPSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIKTKQRHDYSAVCVVEILE+QGTS+  +SSRFLLVKRPDEGLLAGLWEFPSV LDG
Subjt:  ELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLAGLWEFPSVLLDG

Query:  EADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EAD  TRRESI+SLLSK FGLEPK+NF++V REDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  EADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK

A0A5D3DMU5 Adenine DNA glycosylase4.9e-19291.11Show/hide
Query:  KQSGHRYWNGTTVADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEE
        K+ G        V DIEDIMFSIDNVQTIRASLL+WYDRSRRDLPWRSLDKG+PETRAYGVWVSEIMLQQTRVQTVV FYNRWML+WPTVQHLSRASLEE
Subjt:  KQSGHRYWNGTTVADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEE

Query:  VNEMWAGLGYYRRARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD
        VNEMWAGLGYYRRARFL EGAKMIVKEGG+FPKTVS+LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD
Subjt:  VNEMWAGLGYYRRARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD

Query:  PSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLA
        PSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIKTKQRHDYSAVCVVEILE+QGTS+  +SSRFLLVKRPDEGLLA
Subjt:  PSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLA

Query:  GLWEFPSVLLDGEADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK
        GLWEFPSV LDGEADL TRRESI+SLLSK FGLEPK+NF++V REDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  GLWEFPSVLLDGEADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK

E5GB45 Adenine DNA glycosylase7.9e-19093.04Show/hide
Query:  VADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYR
        V DIEDIMFSIDNVQTIRASLL+WYDRSRRDLPWRSLDKG+PETRAYGVWVSEIMLQQTRVQTVV FYNRWML+WPTVQHLSRASLEEVNEMWAGLGYYR
Subjt:  VADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYR

Query:  RARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALM
        RARFL EGAKMIVKEGG+FPKTVS+LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD SRPGDFNQALM
Subjt:  RARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALM

Query:  ELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLAGLWEFPSVLLDG
        ELGATLCTPTNPSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIKTKQRHDYSAVCVVEILE+QGTS+  +SSRFLLVKRPDEGLLAGLWEFPSV LDG
Subjt:  ELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLAGLWEFPSVLLDG

Query:  EADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK
        EAD  TRRESI+SLLSK FGLEPK+NF++V REDVGDFIHVFTHIRLKIYVEHLVLCLK
Subjt:  EADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLK

SwissProt top hitse value%identityAlignment
F4JRF4 Adenine DNA glycosylase1.4e-11959.26Show/hide
Query:  DIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRS-LDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLE--------------
        DIED +FS +  Q IR  LL+WYD ++RDLPWR+   + + E RAY VWVSEIMLQQTRVQTV+ +Y RWM +WPT+  L +ASLE              
Subjt:  DIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRS-LDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLE--------------

Query:  -----EVNEMWAGLGYYRRARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKA
             EVNEMWAGLGYYRRARFLLEGAKM+V     FP   S+L K+ GIG+YTAGAIASIAF+E VPVVDGNVIRV+ARLKAIS NPKD    +  WK 
Subjt:  -----EVNEMWAGLGYYRRARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKA

Query:  AAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRP
        AAQLVDPSRPGDFNQ+LMELGATLCT + PSCS+CPV   C A S+S+ + ++ VTDYP K IK K RHD+  VCV+EI       +++   RF+LVKRP
Subjt:  AAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRP

Query:  DEGLLAGLWEFPSVLLDGEADLKTRRESINSLLSKF--FGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCL
        ++GLLAGLWEFPSV+L+ EAD  TRR +IN  L +   F +E K+   +V RE++G+F+H+FTHIR K+YVE LV+ L
Subjt:  DEGLLAGLWEFPSVLLDGEADLKTRRESINSLLSKF--FGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCL

P43333 U2 small nuclear ribonucleoprotein A'4.4e-8176.5Show/hide
Query:  EEEEHGNKIAVIENLGATEDQFDAIDLSDNEIVKLENMPYLNRLGTLLINNNRITRINPNIGEFLPKLHTLVLTNNRLVNLVEIDPLASLPKLQFLSLLD
        E +  GNKI VIENLGATEDQFD IDLSDNEIVKLEN PYLNRLGTLLINNNRITRINPN+GEFLPKLH+LVLTNNRLVNLVEIDPLAS+PKLQ+LSLLD
Subjt:  EEEEHGNKIAVIENLGATEDQFDAIDLSDNEIVKLENMPYLNRLGTLLINNNRITRINPNIGEFLPKLHTLVLTNNRLVNLVEIDPLASLPKLQFLSLLD

Query:  NNITKKPNYRLYVIHKLKSVRVLDFKKVRNKERLEARNLFSSKEVEEEAKKESVKTFVPGELENTSKPVEEKQTSNVSAPTPEQIIAIKAAIVNSQTLEE
        NNITKK NYRLYVIHKLKS+RVLDF K++ KER EA +LFSSKE EEE KK S +     E++  S+  E  +T  V APT EQI+AIKAAI+NSQT+EE
Subjt:  NNITKKPNYRLYVIHKLKSVRVLDFKKVRNKERLEARNLFSSKEVEEEAKKESVKTFVPGELENTSKPVEEKQTSNVSAPTPEQIIAIKAAIVNSQTLEE

Query:  VARLEQALKSGQLPADL
        +ARLEQALK GQ+PA L
Subjt:  VARLEQALKSGQLPADL

Q8R5G2 Adenine DNA glycosylase1.7e-8045.03Show/hide
Query:  IDNVQTIRASLLEWYDRSRRDLPWRSLDKGQP--ETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEG
        I +V   R +LL WYD+ +RDLPWR   K +   + RAY VWVSE+MLQQT+V TV+ +Y RWM +WPT+Q L+ ASLEEVN++W+GLGYY R R L EG
Subjt:  IDNVQTIRASLLEWYDRSRRDLPWRSLDKGQP--ETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEG

Query:  AKMIVKE-GGKFPKTVSALRK-IPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATL
        A+ +V+E GG  P+T   L++ +PG+G YTAGAIASIAFD+V  VVDGNVIRV+ R++AI  +P    +   +W  A QLVDP+RPGDFNQA MELGAT+
Subjt:  AKMIVKE-GGKFPKTVSALRK-IPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATL

Query:  CTPTNPSCSTCPVFDHCEA--------------------------------LSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSS
        CTP  P C+ CPV   C A                                 S +  D ++ V ++P K  +   R +YSA CVVE     G        
Subjt:  CTPTNPSCSTCPVFDHCEA--------------------------------LSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSS

Query:  RFLLVKRPDEGLLAGLWEFPSVLLDGEADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVL
          LLV+RP+ GLLAGLWEFPSV L  E   + + +++   L  +    P         + +G+ IHVF+HI+L   V  L L
Subjt:  RFLLVKRPDEGLLAGLWEFPSVLLDGEADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVL

Q99P21 Adenine DNA glycosylase1.6e-8345.29Show/hide
Query:  IDNVQTIRASLLEWYDRSRRDLPWRSL--DKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEG
        + +V   R++LL WYD+ +RDLPWR+L  ++   + RAY VWVSE+MLQQT+V TV+ +Y RWM +WP +Q L+ ASLEEVN++W+GLGYY R R L EG
Subjt:  IDNVQTIRASLLEWYDRSRRDLPWRSL--DKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEG

Query:  AKMIVKE-GGKFPKTVSALRK-IPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATL
        A+ +V+E GG  P+T   L++ +PG+G YTAGAIASIAFD+V  VVDGNV+RV+ R++AI  +P    +   +W  A QLVDP+RPGDFNQA MELGAT+
Subjt:  AKMIVKE-GGKFPKTVSALRK-IPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATL

Query:  CTPTNPSCSTCPVFDHCEA--------------------------------LSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSS
        CTP  P CS CPV   C A                                 S S  D S+ V ++P K  +   R +YSA CVVE     G        
Subjt:  CTPTNPSCSTCPVFDHCEA--------------------------------LSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSS

Query:  RFLLVKRPDEGLLAGLWEFPSVLLDGEADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVL
          LLV+RPD GLLAGLWEFPSV L  E   + + +++   L ++ G  P      +  + +G+ IH+F+HI+L   V  L L
Subjt:  RFLLVKRPDEGLLAGLWEFPSVLLDGEADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVL

Q9UIF7 Adenine DNA glycosylase4.1e-7942.79Show/hide
Query:  IDNVQTIRASLLEWYDRSRRDLPW--RSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEG
        +  V   R SLL WYD+ +RDLPW  R+ D+   + RAY VWVSE+MLQQT+V TV+++Y  WM +WPT+Q L+ ASLEEVN++WAGLGYY R R L EG
Subjt:  IDNVQTIRASLLEWYDRSRRDLPW--RSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEG

Query:  AKMIVKE-GGKFPKTVSALRK-IPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATL
        A+ +V+E GG  P+T   L++ +PG+G YTAGAIASIAF +   VVDGNV RV+ R++AI  +P    + +Q+W  A QLVDP+RPGDFNQA MELGAT+
Subjt:  AKMIVKE-GGKFPKTVSALRK-IPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATL

Query:  CTPTNPSCSTCPVFDHCEA--------------LSISKH---------------------DSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSE
        CTP  P CS CPV   C A              LS S                       D ++ V ++P K  +   R + SA CV+E           
Subjt:  CTPTNPSCSTCPVFDHCEA--------------LSISKH---------------------DSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSE

Query:  KSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLKALNPVVVV----
          ++ LLV+RP+ GLLAGLWEFPSV  +    L  +R+++   L ++ G  P  +        +G+ +H F+HI+L   V    L L+   PV  V    
Subjt:  KSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLKTRRESINSLLSKFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLKALNPVVVV----

Query:  --VVEEEEH
          + +EE H
Subjt:  --VVEEEEH

Arabidopsis top hitse value%identityAlignment
AT1G05900.2 endonuclease III 24.2e-1025.89Show/hide
Query:  PETRAYGVWVSEIMLQQTRVQ----TVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYY-RRARFLLEGAKMIVKE-GGKFPKTVSALRKIPGIGEY
        P+ R + V +  ++  QT+       V   +   +L   T + + +A    + E+   +G+Y R+A  + + AK+ + E  G  P+T+  L  +PG+G  
Subjt:  PETRAYGVWVSEIMLQQTRVQ----TVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGLGYY-RRARFLLEGAKMIVKE-GGKFPKTVSALRKIPGIGEY

Query:  TAGAIASIAFDEVVPV-VDGNVIRVIARL--------KAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHC-EA
         A  +  +A+++V  + VD +V R+  RL        K  + +P++ ++  Q W    + V        N  L+  G T+CTP  P C TC + + C  A
Subjt:  TAGAIASIAFDEVVPV-VDGNVIRVIARL--------KAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHC-EA

Query:  LSISKHDSSVLVTDYPAKGIKTKQ
           +   SS L      K IK+K+
Subjt:  LSISKHDSSVLVTDYPAKGIKTKQ

AT1G09760.1 U2 small nuclear ribonucleoprotein A3.1e-8276.5Show/hide
Query:  EEEEHGNKIAVIENLGATEDQFDAIDLSDNEIVKLENMPYLNRLGTLLINNNRITRINPNIGEFLPKLHTLVLTNNRLVNLVEIDPLASLPKLQFLSLLD
        E +  GNKI VIENLGATEDQFD IDLSDNEIVKLEN PYLNRLGTLLINNNRITRINPN+GEFLPKLH+LVLTNNRLVNLVEIDPLAS+PKLQ+LSLLD
Subjt:  EEEEHGNKIAVIENLGATEDQFDAIDLSDNEIVKLENMPYLNRLGTLLINNNRITRINPNIGEFLPKLHTLVLTNNRLVNLVEIDPLASLPKLQFLSLLD

Query:  NNITKKPNYRLYVIHKLKSVRVLDFKKVRNKERLEARNLFSSKEVEEEAKKESVKTFVPGELENTSKPVEEKQTSNVSAPTPEQIIAIKAAIVNSQTLEE
        NNITKK NYRLYVIHKLKS+RVLDF K++ KER EA +LFSSKE EEE KK S +     E++  S+  E  +T  V APT EQI+AIKAAI+NSQT+EE
Subjt:  NNITKKPNYRLYVIHKLKSVRVLDFKKVRNKERLEARNLFSSKEVEEEAKKESVKTFVPGELENTSKPVEEKQTSNVSAPTPEQIIAIKAAIVNSQTLEE

Query:  VARLEQALKSGQLPADL
        +ARLEQALK GQ+PA L
Subjt:  VARLEQALKSGQLPADL

AT4G12740.1 HhH-GPD base excision DNA repair family protein1.0e-12059.26Show/hide
Query:  DIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRS-LDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLE--------------
        DIED +FS +  Q IR  LL+WYD ++RDLPWR+   + + E RAY VWVSEIMLQQTRVQTV+ +Y RWM +WPT+  L +ASLE              
Subjt:  DIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRS-LDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLE--------------

Query:  -----EVNEMWAGLGYYRRARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKA
             EVNEMWAGLGYYRRARFLLEGAKM+V     FP   S+L K+ GIG+YTAGAIASIAF+E VPVVDGNVIRV+ARLKAIS NPKD    +  WK 
Subjt:  -----EVNEMWAGLGYYRRARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKA

Query:  AAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRP
        AAQLVDPSRPGDFNQ+LMELGATLCT + PSCS+CPV   C A S+S+ + ++ VTDYP K IK K RHD+  VCV+EI       +++   RF+LVKRP
Subjt:  AAQLVDPSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRP

Query:  DEGLLAGLWEFPSVLLDGEADLKTRRESINSLLSKF--FGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCL
        ++GLLAGLWEFPSV+L+ EAD  TRR +IN  L +   F +E K+   +V RE++G+F+H+FTHIR K+YVE LV+ L
Subjt:  DEGLLAGLWEFPSVLLDGEADLKTRRESINSLLSKF--FGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAAACAATCAGGGCATCGCTATTGGAATGGTACGACCGTAGCCGACATTGAAGATATTATGTTCAGCATAGACAATGTTCAAACAATCAGGGCATCGCTATTGGA
ATGGTACGACCGTAGCCGCAGGGACCTTCCATGGAGGAGCTTGGACAAAGGGCAACCTGAAACACGGGCTTACGGTGTGTGGGTTTCAGAAATAATGCTGCAGCAGACGA
GAGTTCAGACCGTCGTCCACTTTTACAACCGTTGGATGCTTAGATGGCCCACCGTTCAACATCTCTCTCGTGCTTCTCTTGAGGAGGTGAATGAAATGTGGGCAGGCTTG
GGGTACTATAGACGAGCTCGTTTTCTTTTGGAGGGTGCAAAGATGATAGTCAAAGAAGGTGGTAAATTTCCTAAAACGGTTTCTGCCCTTAGAAAAATTCCTGGAATTGG
AGAATATACAGCAGGGGCTATTGCCTCCATAGCATTTGATGAAGTGGTGCCTGTGGTTGATGGTAATGTGATAAGGGTAATTGCTCGATTAAAGGCTATTTCAGGAAATC
CCAAAGATCCAAAGTTGATCAAGCAAGTTTGGAAGGCAGCTGCTCAATTAGTTGATCCTTCCAGGCCTGGGGACTTCAATCAGGCACTCATGGAACTTGGTGCAACTTTA
TGCACTCCAACAAACCCAAGCTGCTCAACATGCCCTGTGTTTGATCACTGTGAGGCCCTTTCAATCTCAAAGCATGATAGTTCAGTTCTTGTCACAGATTATCCCGCTAA
GGGGATAAAGACCAAACAAAGACATGATTATTCCGCTGTATGTGTGGTTGAGATATTGGAGAATCAGGGTACATCTCAATCAGAGAAATCTAGTAGATTTCTTCTTGTAA
AGAGGCCTGATGAAGGTTTGCTTGCTGGTCTATGGGAGTTTCCATCTGTCTTGTTGGATGGAGAAGCTGATTTAAAGACAAGGAGAGAATCCATTAATAGCCTCTTGAGT
AAATTCTTTGGACTTGAACCAAAAGAGAATTTTGATATGGTTATTAGAGAAGATGTTGGAGATTTTATCCATGTTTTCACGCACATCCGTCTCAAGATATATGTTGAGCA
CTTGGTGTTATGTTTAAAAGCCCTAAACCCCGTCGTCGTCGTCGTCGTCGAAGAAGAAGAACATGGAAACAAGATAGCAGTGATAGAAAACCTAGGTGCCACCGAGGACC
AATTTGATGCCATTGATTTGTCTGATAATGAGATTGTGAAGCTGGAAAATATGCCATATCTTAATCGATTGGGCACATTGCTGATCAATAATAATAGAATCACTCGTATC
AATCCAAATATTGGAGAGTTCTTGCCAAAATTACATACGCTAGTTCTTACAAACAACAGACTTGTGAACTTGGTAGAGATCGATCCATTGGCATCCCTTCCAAAACTTCA
GTTTCTTAGTTTGTTGGATAACAATATTACGAAGAAGCCAAACTATAGATTATATGTCATTCACAAGTTAAAGTCAGTCCGGGTGCTTGATTTCAAGAAAGTCAGAAACA
AGGAGAGATTGGAGGCTAGGAATTTATTTTCATCAAAAGAAGTTGAAGAAGAGGCAAAGAAGGAATCTGTGAAGACGTTTGTTCCAGGTGAGTTAGAGAATACATCCAAA
CCTGTGGAGGAGAAACAAACTTCAAACGTGTCTGCACCAACACCGGAGCAGATAATAGCTATTAAGGCGGCCATTGTAAATTCTCAAACTCTTGAAGAAGTTGCGAGATT
AGAACAGGCGCTCAAGTCAGGTCAGTTACCTGCAGATTTGAATCTTTTGGAAGATAATACCGTGCCAAACACCACGAAAGATACAGACGATAAGACAATGTCTGATAATG
GAGACGAAGAAAATGTATCCAAGGATGTTAAAGAGCAATCGAATGATGAATCTACACCTATGGAGCAGGGTAAAGAATTTCCTAACAATATGTACGGATCTATTGGCATA
CACATTGGGACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCAAACAATCAGGGCATCGCTATTGGAATGGTACGACCGTAGCCGACATTGAAGATATTATGTTCAGCATAGACAATGTTCAAACAATCAGGGCATCGCTATTGGA
ATGGTACGACCGTAGCCGCAGGGACCTTCCATGGAGGAGCTTGGACAAAGGGCAACCTGAAACACGGGCTTACGGTGTGTGGGTTTCAGAAATAATGCTGCAGCAGACGA
GAGTTCAGACCGTCGTCCACTTTTACAACCGTTGGATGCTTAGATGGCCCACCGTTCAACATCTCTCTCGTGCTTCTCTTGAGGAGGTGAATGAAATGTGGGCAGGCTTG
GGGTACTATAGACGAGCTCGTTTTCTTTTGGAGGGTGCAAAGATGATAGTCAAAGAAGGTGGTAAATTTCCTAAAACGGTTTCTGCCCTTAGAAAAATTCCTGGAATTGG
AGAATATACAGCAGGGGCTATTGCCTCCATAGCATTTGATGAAGTGGTGCCTGTGGTTGATGGTAATGTGATAAGGGTAATTGCTCGATTAAAGGCTATTTCAGGAAATC
CCAAAGATCCAAAGTTGATCAAGCAAGTTTGGAAGGCAGCTGCTCAATTAGTTGATCCTTCCAGGCCTGGGGACTTCAATCAGGCACTCATGGAACTTGGTGCAACTTTA
TGCACTCCAACAAACCCAAGCTGCTCAACATGCCCTGTGTTTGATCACTGTGAGGCCCTTTCAATCTCAAAGCATGATAGTTCAGTTCTTGTCACAGATTATCCCGCTAA
GGGGATAAAGACCAAACAAAGACATGATTATTCCGCTGTATGTGTGGTTGAGATATTGGAGAATCAGGGTACATCTCAATCAGAGAAATCTAGTAGATTTCTTCTTGTAA
AGAGGCCTGATGAAGGTTTGCTTGCTGGTCTATGGGAGTTTCCATCTGTCTTGTTGGATGGAGAAGCTGATTTAAAGACAAGGAGAGAATCCATTAATAGCCTCTTGAGT
AAATTCTTTGGACTTGAACCAAAAGAGAATTTTGATATGGTTATTAGAGAAGATGTTGGAGATTTTATCCATGTTTTCACGCACATCCGTCTCAAGATATATGTTGAGCA
CTTGGTGTTATGTTTAAAAGCCCTAAACCCCGTCGTCGTCGTCGTCGTCGAAGAAGAAGAACATGGAAACAAGATAGCAGTGATAGAAAACCTAGGTGCCACCGAGGACC
AATTTGATGCCATTGATTTGTCTGATAATGAGATTGTGAAGCTGGAAAATATGCCATATCTTAATCGATTGGGCACATTGCTGATCAATAATAATAGAATCACTCGTATC
AATCCAAATATTGGAGAGTTCTTGCCAAAATTACATACGCTAGTTCTTACAAACAACAGACTTGTGAACTTGGTAGAGATCGATCCATTGGCATCCCTTCCAAAACTTCA
GTTTCTTAGTTTGTTGGATAACAATATTACGAAGAAGCCAAACTATAGATTATATGTCATTCACAAGTTAAAGTCAGTCCGGGTGCTTGATTTCAAGAAAGTCAGAAACA
AGGAGAGATTGGAGGCTAGGAATTTATTTTCATCAAAAGAAGTTGAAGAAGAGGCAAAGAAGGAATCTGTGAAGACGTTTGTTCCAGGTGAGTTAGAGAATACATCCAAA
CCTGTGGAGGAGAAACAAACTTCAAACGTGTCTGCACCAACACCGGAGCAGATAATAGCTATTAAGGCGGCCATTGTAAATTCTCAAACTCTTGAAGAAGTTGCGAGATT
AGAACAGGCGCTCAAGTCAGGTCAGTTACCTGCAGATTTGAATCTTTTGGAAGATAATACCGTGCCAAACACCACGAAAGATACAGACGATAAGACAATGTCTGATAATG
GAGACGAAGAAAATGTATCCAAGGATGTTAAAGAGCAATCGAATGATGAATCTACACCTATGGAGCAGGGTAAAGAATTTCCTAACAATATGTACGGATCTATTGGCATA
CACATTGGGACTTGA
Protein sequenceShow/hide protein sequence
MFKQSGHRYWNGTTVADIEDIMFSIDNVQTIRASLLEWYDRSRRDLPWRSLDKGQPETRAYGVWVSEIMLQQTRVQTVVHFYNRWMLRWPTVQHLSRASLEEVNEMWAGL
GYYRRARFLLEGAKMIVKEGGKFPKTVSALRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDPSRPGDFNQALMELGATL
CTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSQSEKSSRFLLVKRPDEGLLAGLWEFPSVLLDGEADLKTRRESINSLLS
KFFGLEPKENFDMVIREDVGDFIHVFTHIRLKIYVEHLVLCLKALNPVVVVVVEEEEHGNKIAVIENLGATEDQFDAIDLSDNEIVKLENMPYLNRLGTLLINNNRITRI
NPNIGEFLPKLHTLVLTNNRLVNLVEIDPLASLPKLQFLSLLDNNITKKPNYRLYVIHKLKSVRVLDFKKVRNKERLEARNLFSSKEVEEEAKKESVKTFVPGELENTSK
PVEEKQTSNVSAPTPEQIIAIKAAIVNSQTLEEVARLEQALKSGQLPADLNLLEDNTVPNTTKDTDDKTMSDNGDEENVSKDVKEQSNDESTPMEQGKEFPNNMYGSIGI
HIGT