; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014539 (gene) of Snake gourd v1 genome

Gene IDTan0014539
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionAdenine DNA glycosylase
Genome locationLG06:18785474..18819495
RNA-Seq ExpressionTan0014539
SyntenyTan0014539
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0000701 - purine-specific mismatch base pair DNA N-glycosylase activity (molecular function)
GO:0051539 - 4 iron, 4 sulfur cluster binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR044298 - Adenine/Thymine-DNA glycosylase
IPR032675 - Leucine-rich repeat domain superfamily
IPR029119 - MutY, C-terminal
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal
IPR015797 - NUDIX hydrolase-like domain superfamily
IPR011257 - DNA glycosylase
IPR004036 - Endonuclease III-like, conserved site-2
IPR003603 - U2A'/phosphoprotein 32 family A, C-terminal
IPR001611 - Leucine-rich repeat
IPR000445 - Helix-hairpin-helix motif


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN46403.2 hypothetical protein Csa_005328 [Cucumis sativus]1.3e-19788.5Show/hide
Query:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT
        MS GEKNEN+E +K+NTDFR+KKKPT +RKR+ RSPS+ EA VDIEDIMFSID VQTIRASLLDWYDRSRRDLPWRSLDK +PETRAYGVWVSEIMLQQT
Subjt:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT

Query:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV
        RVQTV+ FYNRWM KWPTVQHLSRASLEEVNEMWAGLGYYRRARFL EGAKMIVKEG  FP+TVS LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRV
Subjt:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV

Query:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISGNPKD KLIKQVWKAA QLVD SRPGDFNQALMELGATLCTP +PSCSTCP+FDHCEALSISKHD+SVLVTDYPAKGIK KQRHDYSAVCVV
Subjt:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK
        EILE+QGT    QSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESIN+LLSK FGLE KKNFEIV RED+GDF+H+F+HIRLKIYVEHLVLCLK
Subjt:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK

TYK24629.1 adenine DNA glycosylase isoform X1 [Cucumis melo var. makuwa]5.1e-19989.25Show/hide
Query:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT
        MS GEKNENEEN+K+ TDFR+KKKPT +RKR+ RSPS+ EA VDIEDIMFSID VQTIRASLLDWYDRSRRDLPWRSLDK +PETRAYGVWVSEIMLQQT
Subjt:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT

Query:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV
        RVQTV+ FYNRWM KWPTVQHLSRASLEEVNEMWAGLGYYRRARFL EGAKMIVKEG  FPKTVS LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRV
Subjt:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV

Query:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISGNPKD KLIKQVWKAA QLVDPSRPGDFNQALMELGATLCTP +PSCSTCP+FDHCEALSISK D+SVLVTDYPAKGIKTKQRHDYSAVCVV
Subjt:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK
        EILE+QGTS   QSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADL+TRRESI++LLSK FGLE KKNFEIV RED+GDF+HVF+HIRLKIYVEHLVLCLK
Subjt:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK

XP_004140565.2 adenine DNA glycosylase isoform X1 [Cucumis sativus]1.3e-19788.5Show/hide
Query:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT
        MS GEKNEN+E +K+NTDFR+KKKPT +RKR+ RSPS+ EA VDIEDIMFSID VQTIRASLLDWYDRSRRDLPWRSLDK +PETRAYGVWVSEIMLQQT
Subjt:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT

Query:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV
        RVQTV+ FYNRWM KWPTVQHLSRASLEEVNEMWAGLGYYRRARFL EGAKMIVKEG  FP+TVS LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRV
Subjt:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV

Query:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISGNPKD KLIKQVWKAA QLVD SRPGDFNQALMELGATLCTP +PSCSTCP+FDHCEALSISKHD+SVLVTDYPAKGIK KQRHDYSAVCVV
Subjt:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK
        EILE+QGT    QSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESIN+LLSK FGLE KKNFEIV RED+GDF+H+F+HIRLKIYVEHLVLCLK
Subjt:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK

XP_016902459.1 PREDICTED: adenine DNA glycosylase isoform X2 [Cucumis melo]2.8e-19789Show/hide
Query:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT
        MS GEKNENEEN+K+ TDFR+KKKPT  RKR+ RSPS+ EA VDIEDIMFSID VQTIRASLLDWYDRSRRDLPWRSLDK +PETRAYGVWVSEIMLQQT
Subjt:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT

Query:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV
        RVQTV+ FYNRWM KWPTVQHLSRASLEEVNEMWAGLGYYRRARFL EGAKMIVKEG  FPKTVS LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRV
Subjt:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV

Query:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISGNPKD KLIKQVWKAA QLVD SRPGDFNQALMELGATLCTP +PSCSTCP+FDHCEALSISK D+SVLVTDYPAKGIKTKQRHDYSAVCVV
Subjt:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK
        EILE+QGTS   QSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD STRRESI++LLSK FGLE KKNFEIV RED+GDF+HVF+HIRLKIYVEHLVLCLK
Subjt:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK

XP_031743608.1 adenine DNA glycosylase isoform X5 [Cucumis sativus]1.3e-19788.5Show/hide
Query:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT
        MS GEKNEN+E +K+NTDFR+KKKPT +RKR+ RSPS+ EA VDIEDIMFSID VQTIRASLLDWYDRSRRDLPWRSLDK +PETRAYGVWVSEIMLQQT
Subjt:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT

Query:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV
        RVQTV+ FYNRWM KWPTVQHLSRASLEEVNEMWAGLGYYRRARFL EGAKMIVKEG  FP+TVS LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRV
Subjt:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV

Query:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISGNPKD KLIKQVWKAA QLVD SRPGDFNQALMELGATLCTP +PSCSTCP+FDHCEALSISKHD+SVLVTDYPAKGIK KQRHDYSAVCVV
Subjt:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK
        EILE+QGT    QSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESIN+LLSK FGLE KKNFEIV RED+GDF+H+F+HIRLKIYVEHLVLCLK
Subjt:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK

TrEMBL top hitse value%identityAlignment
A0A0A0KC27 Adenine DNA glycosylase6.1e-19888.5Show/hide
Query:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT
        MS GEKNEN+E +K+NTDFR+KKKPT +RKR+ RSPS+ EA VDIEDIMFSID VQTIRASLLDWYDRSRRDLPWRSLDK +PETRAYGVWVSEIMLQQT
Subjt:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT

Query:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV
        RVQTV+ FYNRWM KWPTVQHLSRASLEEVNEMWAGLGYYRRARFL EGAKMIVKEG  FP+TVS LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRV
Subjt:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV

Query:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISGNPKD KLIKQVWKAA QLVD SRPGDFNQALMELGATLCTP +PSCSTCP+FDHCEALSISKHD+SVLVTDYPAKGIK KQRHDYSAVCVV
Subjt:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK
        EILE+QGT    QSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESIN+LLSK FGLE KKNFEIV RED+GDF+H+F+HIRLKIYVEHLVLCLK
Subjt:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK

A0A1S3CBT2 Adenine DNA glycosylase1.4e-19789Show/hide
Query:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT
        MS GEKNENEEN+K+ TDFR+KKKPT  RKR+ RSPS+ EA VDIEDIMFSID VQTIRASLLDWYDRSRRDLPWRSLDK +PETRAYGVWVSEIMLQQT
Subjt:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT

Query:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV
        RVQTV+ FYNRWM KWPTVQHLSRASLEEVNEMWAGLGYYRRARFL EGAKMIVKEG  FPKTVS LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRV
Subjt:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV

Query:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISGNPKD KLIKQVWKAA QLVD SRPGDFNQALMELGATLCTP +PSCSTCP+FDHCEALSISK D+SVLVTDYPAKGIKTKQRHDYSAVCVV
Subjt:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK
        EILE+QGTS   QSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD STRRESI++LLSK FGLE KKNFEIV RED+GDF+HVF+HIRLKIYVEHLVLCLK
Subjt:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK

A0A1S4E2J7 Adenine DNA glycosylase1.4e-19789Show/hide
Query:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT
        MS GEKNENEEN+K+ TDFR+KKKPT  RKR+ RSPS+ EA VDIEDIMFSID VQTIRASLLDWYDRSRRDLPWRSLDK +PETRAYGVWVSEIMLQQT
Subjt:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT

Query:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV
        RVQTV+ FYNRWM KWPTVQHLSRASLEEVNEMWAGLGYYRRARFL EGAKMIVKEG  FPKTVS LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRV
Subjt:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV

Query:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISGNPKD KLIKQVWKAA QLVD SRPGDFNQALMELGATLCTP +PSCSTCP+FDHCEALSISK D+SVLVTDYPAKGIKTKQRHDYSAVCVV
Subjt:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK
        EILE+QGTS   QSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD STRRESI++LLSK FGLE KKNFEIV RED+GDF+HVF+HIRLKIYVEHLVLCLK
Subjt:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK

A0A5D3DMU5 Adenine DNA glycosylase2.5e-19989.25Show/hide
Query:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT
        MS GEKNENEEN+K+ TDFR+KKKPT +RKR+ RSPS+ EA VDIEDIMFSID VQTIRASLLDWYDRSRRDLPWRSLDK +PETRAYGVWVSEIMLQQT
Subjt:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT

Query:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV
        RVQTV+ FYNRWM KWPTVQHLSRASLEEVNEMWAGLGYYRRARFL EGAKMIVKEG  FPKTVS LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRV
Subjt:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV

Query:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISGNPKD KLIKQVWKAA QLVDPSRPGDFNQALMELGATLCTP +PSCSTCP+FDHCEALSISK D+SVLVTDYPAKGIKTKQRHDYSAVCVV
Subjt:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK
        EILE+QGTS   QSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADL+TRRESI++LLSK FGLE KKNFEIV RED+GDF+HVF+HIRLKIYVEHLVLCLK
Subjt:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK

E5GB45 Adenine DNA glycosylase1.4e-19789Show/hide
Query:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT
        MS GEKNENEEN+K+ TDFR+KKKPT  RKR+ RSPS+ EA VDIEDIMFSID VQTIRASLLDWYDRSRRDLPWRSLDK +PETRAYGVWVSEIMLQQT
Subjt:  MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREA-VDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQT

Query:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV
        RVQTV+ FYNRWM KWPTVQHLSRASLEEVNEMWAGLGYYRRARFL EGAKMIVKEG  FPKTVS LRKIPGIGEYTAGAIASIAF EVVPVVDGNVIRV
Subjt:  RVQTVIHFYNRWMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRV

Query:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV
        IARLKAISGNPKD KLIKQVWKAA QLVD SRPGDFNQALMELGATLCTP +PSCSTCP+FDHCEALSISK D+SVLVTDYPAKGIKTKQRHDYSAVCVV
Subjt:  IARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVV

Query:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK
        EILE+QGTS   QSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD STRRESI++LLSK FGLE KKNFEIV RED+GDF+HVF+HIRLKIYVEHLVLCLK
Subjt:  EILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLK

SwissProt top hitse value%identityAlignment
F4JRF4 Adenine DNA glycosylase7.5e-12154.99Show/hide
Query:  EKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSERE----AVDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKE-QPETRAYGVWVSEIMLQQT
        E+ E  E  +   D  + ++ +++ + +    +E E      DIED +FS ++ Q IR  LLDWYD ++RDLPWR+   E + E RAY VWVSEIMLQQT
Subjt:  EKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSERE----AVDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKE-QPETRAYGVWVSEIMLQQT

Query:  RVQTVIHFYNRWMQKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAI
        RVQTV+ +Y RWMQKWPT+  L +ASLE                   EVNEMWAGLGYYRRARFLLEGAKM+V    GFP   S L K+ GIG+YTAGAI
Subjt:  RVQTVIHFYNRWMQKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAI

Query:  ASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDY
        ASIAF+E VPVVDGNVIRV+ARLKAIS NPKD    +  WK A QLVDPSRPGDFNQ+LMELGATLCT   PSCS+CP+   C A S+S+ + ++ VTDY
Subjt:  ASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDY

Query:  PAKGIKTKQRHDYSAVCVVEILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSK--YFGLETKKNFEIVIREDIGDF
        P K IK K RHD+  VCV+EI        ++   RF+LVKRP++GLLAGLWEFPSV L+ EAD +TRR +IN  L +   F +E KK   IV RE++G+F
Subjt:  PAKGIKTKQRHDYSAVCVVEILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSK--YFGLETKKNFEIVIREDIGDF

Query:  VHVFSHIRLKIYVEHLVLCLKDFLDHKFQDQ
        VH+F+HIR K+YVE LV+ L    +  F+ Q
Subjt:  VHVFSHIRLKIYVEHLVLCLKDFLDHKFQDQ

P43333 U2 small nuclear ribonucleoprotein A'2.6e-7376.38Show/hide
Query:  QDQFDAIDLSDNEIVKLENMPCLNRLGTLLINNNRITRINPNIGEFLPKLHTLVLTNNRLVNLVEIDPLASLPKLQFLSLLDNNITKKPNYRLYVIHKLK
        +DQFD IDLSDNEIVKLEN P LNRLGTLLINNNRITRINPN+GEFLPKLH+LVLTNNRLVNLVEIDPLAS+PKLQ+LSLLDNNITKK NYRLYVIHKLK
Subjt:  QDQFDAIDLSDNEIVKLENMPCLNRLGTLLINNNRITRINPNIGEFLPKLHTLVLTNNRLVNLVEIDPLASLPKLQFLSLLDNNITKKPNYRLYVIHKLK

Query:  SLRVLDFKKVRNKERLEARNLFSSKEVEEEAKKESVKTFVPGEVENASKPVEEKQTSNMSAPTPEQIIAIKAAIVNSQTLEEVARLEQALKSGQLPADL
        SLRVLDF K++ KER EA +LFSSKE EEE KK S +     EV+  S+  E  +T  + APT EQI+AIKAAI+NSQT+EE+ARLEQALK GQ+PA L
Subjt:  SLRVLDFKKVRNKERLEARNLFSSKEVEEEAKKESVKTFVPGEVENASKPVEEKQTSNMSAPTPEQIIAIKAAIVNSQTLEEVARLEQALKSGQLPADL

Q8R5G2 Adenine DNA glycosylase6.2e-8344.28Show/hide
Query:  RQRRSPSEREAVDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQP--ETRAYGVWVSEIMLQQTRVQTVIHFYNRWMQKWPTVQHLSRASLEE
        +Q+R    +  V    +   I  V   R +LL WYD+ +RDLPWR   KE+   + RAY VWVSE+MLQQT+V TVI +Y RWMQKWPT+Q L+ ASLEE
Subjt:  RQRRSPSEREAVDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQP--ETRAYGVWVSEIMLQQTRVQTVIHFYNRWMQKWPTVQHLSRASLEE

Query:  VNEMWAGLGYYRRARFLLEGAKMIVKE-GSGFPKTVSGLRK-IPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLIKQVWKAAGQL
        VN++W+GLGYY R R L EGA+ +V+E G   P+T   L++ +PG+G YTAGAIASIAFD+V  VVDGNVIRV+ R++AI  +P  S +   +W  A QL
Subjt:  VNEMWAGLGYYRRARFLLEGAKMIVKE-GSGFPKTVSGLRK-IPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLIKQVWKAAGQL

Query:  VDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEA--------------------------------LSISKHDTSVLVTDYPAKGIKTKQRHDYS
        VDP+RPGDFNQA MELGAT+CTP  P C+ CP+   C A                                 S +  D ++ V ++P K  +   R +YS
Subjt:  VDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEA--------------------------------LSISKHDTSVLVTDYPAKGIKTKQRHDYS

Query:  AVCVVEILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHL
        A CVVE        G+      LLV+RP+ GLLAGLWEFPSV+L+        +  +  L      L T         + +G+ +HVFSHI+L   V  L
Subjt:  AVCVVEILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHL

Query:  VL
         L
Subjt:  VL

Q99P21 Adenine DNA glycosylase3.0e-8544.28Show/hide
Query:  RQRRSPSEREAVDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQ--PETRAYGVWVSEIMLQQTRVQTVIHFYNRWMQKWPTVQHLSRASLEE
        +Q+R    + +V    +   +  V   R++LL WYD+ +RDLPWR+L KE+   + RAY VWVSE+MLQQT+V TVI +Y RWMQKWP +Q L+ ASLEE
Subjt:  RQRRSPSEREAVDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQ--PETRAYGVWVSEIMLQQTRVQTVIHFYNRWMQKWPTVQHLSRASLEE

Query:  VNEMWAGLGYYRRARFLLEGAKMIVKE-GSGFPKTVSGLRK-IPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLIKQVWKAAGQL
        VN++W+GLGYY R R L EGA+ +V+E G   P+T   L++ +PG+G YTAGAIASIAFD+V  VVDGNV+RV+ R++AI  +P  + +   +W  A QL
Subjt:  VNEMWAGLGYYRRARFLLEGAKMIVKE-GSGFPKTVSGLRK-IPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLIKQVWKAAGQL

Query:  VDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEA--------------------------------LSISKHDTSVLVTDYPAKGIKTKQRHDYS
        VDP+RPGDFNQA MELGAT+CTP  P CS CP+   C A                                 S S  D S+ V ++P K  +   R +YS
Subjt:  VDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEA--------------------------------LSISKHDTSVLVTDYPAKGIKTKQRHDYS

Query:  AVCVVEILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHL
        A CVVE        G+      LLV+RPD GLLAGLWEFPSV+L  E     + +++   L ++ G         +  + +G+ +H+FSHI+L   V  L
Subjt:  AVCVVEILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHL

Query:  VL
         L
Subjt:  VL

Q9UIF7 Adenine DNA glycosylase2.2e-8043.95Show/hide
Query:  RSPSE---REAVDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPW--RSLDKEQPETRAYGVWVSEIMLQQTRVQTVIHFYNRWMQKWPTVQHLSRASLEE
        R P E   + +V    +   + +V   R SLL WYD+ +RDLPW  R+ D+   + RAY VWVSE+MLQQT+V TVI++Y  WMQKWPT+Q L+ ASLEE
Subjt:  RSPSE---REAVDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPW--RSLDKEQPETRAYGVWVSEIMLQQTRVQTVIHFYNRWMQKWPTVQHLSRASLEE

Query:  VNEMWAGLGYYRRARFLLEGAKMIVKE-GSGFPKTVSGLRK-IPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLIKQVWKAAGQL
        VN++WAGLGYY R R L EGA+ +V+E G   P+T   L++ +PG+G YTAGAIASIAF +   VVDGNV RV+ R++AI  +P  + + +Q+W  A QL
Subjt:  VNEMWAGLGYYRRARFLLEGAKMIVKE-GSGFPKTVSGLRK-IPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLIKQVWKAAGQL

Query:  VDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEA--------------LSISKH---------------------DTSVLVTDYPAKGIKTKQRH
        VDP+RPGDFNQA MELGAT+CTP  P CS CP+   C A              LS S                       D ++ V ++P K  +   R 
Subjt:  VDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEA--------------LSISKH---------------------DTSVLVTDYPAKGIKTKQRH

Query:  DYSAVCVVEILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYV
        + SA CV   LE  G  G++     LLV+RP+ GLLAGLWEFPSV+ +    L  +R+++   L ++ G     +        +G+ VH FSHI+L   V
Subjt:  DYSAVCVVEILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYV

Query:  EHLVL
          L L
Subjt:  EHLVL

Arabidopsis top hitse value%identityAlignment
AT1G05900.2 endonuclease III 22.2e-1126.36Show/hide
Query:  PETRAYGVWVSEIMLQQTRVQTVIHFYNRWMQK-WPTVQHLSRASLEEVNEMWAGLGYY-RRARFLLEGAKMIVKEGSG-FPKTVSGLRKIPGIGEYTAG
        P+ R + V +  ++  QT+         R  Q    T + + +A    + E+   +G+Y R+A  + + AK+ + E  G  P+T+  L  +PG+G   A 
Subjt:  PETRAYGVWVSEIMLQQTRVQTVIHFYNRWMQK-WPTVQHLSRASLEEVNEMWAGLGYY-RRARFLLEGAKMIVKEGSG-FPKTVSGLRKIPGIGEYTAG

Query:  AIASIAFDEVVPV-VDGNVIRVIARL--------KAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSIS
         +  +A+++V  + VD +V R+  RL        K  + +P+++++  Q W   G+ V        N  L+  G T+CTP+ P C TC I + C +    
Subjt:  AIASIAFDEVVPV-VDGNVIRVIARL--------KAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSIS

Query:  KHDTSVLVTDYPAKGIKTKQ
           TS  +     K IK+K+
Subjt:  KHDTSVLVTDYPAKGIKTKQ

AT1G09760.1 U2 small nuclear ribonucleoprotein A1.9e-7476.38Show/hide
Query:  QDQFDAIDLSDNEIVKLENMPCLNRLGTLLINNNRITRINPNIGEFLPKLHTLVLTNNRLVNLVEIDPLASLPKLQFLSLLDNNITKKPNYRLYVIHKLK
        +DQFD IDLSDNEIVKLEN P LNRLGTLLINNNRITRINPN+GEFLPKLH+LVLTNNRLVNLVEIDPLAS+PKLQ+LSLLDNNITKK NYRLYVIHKLK
Subjt:  QDQFDAIDLSDNEIVKLENMPCLNRLGTLLINNNRITRINPNIGEFLPKLHTLVLTNNRLVNLVEIDPLASLPKLQFLSLLDNNITKKPNYRLYVIHKLK

Query:  SLRVLDFKKVRNKERLEARNLFSSKEVEEEAKKESVKTFVPGEVENASKPVEEKQTSNMSAPTPEQIIAIKAAIVNSQTLEEVARLEQALKSGQLPADL
        SLRVLDF K++ KER EA +LFSSKE EEE KK S +     EV+  S+  E  +T  + APT EQI+AIKAAI+NSQT+EE+ARLEQALK GQ+PA L
Subjt:  SLRVLDFKKVRNKERLEARNLFSSKEVEEEAKKESVKTFVPGEVENASKPVEEKQTSNMSAPTPEQIIAIKAAIVNSQTLEEVARLEQALKSGQLPADL

AT4G12740.1 HhH-GPD base excision DNA repair family protein5.4e-12254.99Show/hide
Query:  EKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSERE----AVDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKE-QPETRAYGVWVSEIMLQQT
        E+ E  E  +   D  + ++ +++ + +    +E E      DIED +FS ++ Q IR  LLDWYD ++RDLPWR+   E + E RAY VWVSEIMLQQT
Subjt:  EKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSERE----AVDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKE-QPETRAYGVWVSEIMLQQT

Query:  RVQTVIHFYNRWMQKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAI
        RVQTV+ +Y RWMQKWPT+  L +ASLE                   EVNEMWAGLGYYRRARFLLEGAKM+V    GFP   S L K+ GIG+YTAGAI
Subjt:  RVQTVIHFYNRWMQKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAI

Query:  ASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDY
        ASIAF+E VPVVDGNVIRV+ARLKAIS NPKD    +  WK A QLVDPSRPGDFNQ+LMELGATLCT   PSCS+CP+   C A S+S+ + ++ VTDY
Subjt:  ASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLIKQVWKAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDY

Query:  PAKGIKTKQRHDYSAVCVVEILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSK--YFGLETKKNFEIVIREDIGDF
        P K IK K RHD+  VCV+EI        ++   RF+LVKRP++GLLAGLWEFPSV L+ EAD +TRR +IN  L +   F +E KK   IV RE++G+F
Subjt:  PAKGIKTKQRHDYSAVCVVEILENQGTSGSEQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINNLLSK--YFGLETKKNFEIVIREDIGDF

Query:  VHVFSHIRLKIYVEHLVLCLKDFLDHKFQDQ
        VH+F+HIR K+YVE LV+ L    +  F+ Q
Subjt:  VHVFSHIRLKIYVEHLVLCLKDFLDHKFQDQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGCGGAGAAAAGAACGAGAACGAGGAGAATCTCAAGCAAAATACTGATTTTCGCCAGAAAAAGAAACCAACGAAGGATCGAAAACGCCAGCGCCGAAGTCCGTC
TGAAAGAGAAGCAGTCGACATTGAAGATATCATGTTCAGCATAGACCAAGTTCAGACAATTAGGGCATCGCTATTGGATTGGTACGACCGTAGCCGTAGGGACCTCCCAT
GGCGGAGCTTGGACAAAGAACAACCCGAAACACGGGCTTACGGTGTGTGGGTTTCAGAAATAATGCTGCAGCAGACGAGAGTTCAGACCGTCATTCACTTTTACAACCGT
TGGATGCAAAAATGGCCCACTGTTCAACATCTCTCTCGTGCTTCTCTCGAGGAGGTGAATGAGATGTGGGCAGGCTTGGGGTACTATAGACGAGCTCGTTTTCTTTTGGA
GGGTGCAAAGATGATAGTCAAAGAAGGTAGTGGATTTCCTAAAACGGTTTCTGGCCTTCGAAAAATTCCTGGAATTGGAGAGTACACAGCTGGGGCTATCGCCTCCATAG
CATTTGATGAAGTGGTGCCTGTGGTTGATGGTAATGTGATAAGGGTAATTGCTCGATTAAAGGCTATTTCAGGAAATCCAAAAGACTCAAAGTTGATTAAGCAAGTTTGG
AAGGCAGCTGGTCAATTAGTTGATCCTTCCAGACCTGGGGACTTCAATCAGGCACTCATGGAACTTGGTGCAACTTTATGCACTCCAATAAGCCCAAGCTGCTCAACATG
CCCCATATTTGATCACTGTGAGGCCCTTTCAATCTCAAAGCATGATACTTCAGTTCTTGTCACAGATTATCCCGCTAAAGGGATAAAGACCAAACAAAGACATGATTATT
CTGCTGTATGCGTGGTTGAGATATTGGAAAATCAGGGTACATCAGGGTCAGAGCAATCTAGTAGATTTCTTCTCGTAAAGAGGCCCGATGAAGGTTTGCTTGCTGGTCTA
TGGGAGTTCCCGTCTGTCTCGTTGGATGGAGAAGCTGATTTAAGTACGAGGAGAGAATCCATTAATAACCTCTTGAGTAAATATTTTGGACTTGAAACAAAAAAGAATTT
TGAAATAGTTATTAGAGAAGATATTGGAGATTTTGTCCATGTTTTCTCGCACATCCGTCTCAAGATATATGTTGAGCACTTGGTGTTATGTCTAAAAGATTTCCTTGACC
ATAAATTTCAGGACCAATTTGATGCCATTGATTTATCTGATAATGAGATTGTGAAGTTGGAAAATATGCCATGTCTTAATCGATTGGGCACATTGCTGATCAATAATAAT
AGAATCACTCGTATCAATCCGAATATTGGAGAGTTCTTGCCAAAATTACATACATTAGTTCTTACGAACAACAGACTTGTGAACTTGGTAGAGATTGACCCGTTGGCATC
CCTTCCAAAACTTCAGTTTCTTAGTTTGTTGGATAACAATATTACGAAGAAGCCAAACTATAGATTGTATGTCATTCACAAGTTGAAGTCACTCCGGGTGCTTGATTTCA
AGAAAGTCAGAAACAAGGAGAGGTTGGAGGCTAGGAATTTATTTTCATCAAAAGAAGTTGAAGAGGAGGCAAAAAAGGAATCTGTGAAGACATTTGTTCCAGGTGAGGTA
GAGAACGCATCCAAACCTGTGGAGGAGAAACAAACTTCAAACATGTCTGCTCCAACACCCGAACAAATTATAGCTATTAAGGCGGCCATTGTTAATTCCCAAACTCTTGA
AGAGGTTGCAAGATTAGAACAGGCGCTAAAGTCAGGTCAGCTACCTGCAGATTTGAATCTTTTGGAAGATAATACTGTGCCAAATACCACAAAAGATACAGACGATAAGA
CAATGTCTGATGGTGGAGATGAAGAAAATGTATCTAAGGATGCAAAAGAACAATCGAATGATGAATCTACACCTTTGGAGCAGGTACGACAACCTTTATGCTATTTGAGC
GATGCCTTTGTGCAGTTTTTTTTTTTTTTTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGGCGGAGAAAAGAACGAGAACGAGGAGAATCTCAAGCAAAATACTGATTTTCGCCAGAAAAAGAAACCAACGAAGGATCGAAAACGCCAGCGCCGAAGTCCGTC
TGAAAGAGAAGCAGTCGACATTGAAGATATCATGTTCAGCATAGACCAAGTTCAGACAATTAGGGCATCGCTATTGGATTGGTACGACCGTAGCCGTAGGGACCTCCCAT
GGCGGAGCTTGGACAAAGAACAACCCGAAACACGGGCTTACGGTGTGTGGGTTTCAGAAATAATGCTGCAGCAGACGAGAGTTCAGACCGTCATTCACTTTTACAACCGT
TGGATGCAAAAATGGCCCACTGTTCAACATCTCTCTCGTGCTTCTCTCGAGGAGGTGAATGAGATGTGGGCAGGCTTGGGGTACTATAGACGAGCTCGTTTTCTTTTGGA
GGGTGCAAAGATGATAGTCAAAGAAGGTAGTGGATTTCCTAAAACGGTTTCTGGCCTTCGAAAAATTCCTGGAATTGGAGAGTACACAGCTGGGGCTATCGCCTCCATAG
CATTTGATGAAGTGGTGCCTGTGGTTGATGGTAATGTGATAAGGGTAATTGCTCGATTAAAGGCTATTTCAGGAAATCCAAAAGACTCAAAGTTGATTAAGCAAGTTTGG
AAGGCAGCTGGTCAATTAGTTGATCCTTCCAGACCTGGGGACTTCAATCAGGCACTCATGGAACTTGGTGCAACTTTATGCACTCCAATAAGCCCAAGCTGCTCAACATG
CCCCATATTTGATCACTGTGAGGCCCTTTCAATCTCAAAGCATGATACTTCAGTTCTTGTCACAGATTATCCCGCTAAAGGGATAAAGACCAAACAAAGACATGATTATT
CTGCTGTATGCGTGGTTGAGATATTGGAAAATCAGGGTACATCAGGGTCAGAGCAATCTAGTAGATTTCTTCTCGTAAAGAGGCCCGATGAAGGTTTGCTTGCTGGTCTA
TGGGAGTTCCCGTCTGTCTCGTTGGATGGAGAAGCTGATTTAAGTACGAGGAGAGAATCCATTAATAACCTCTTGAGTAAATATTTTGGACTTGAAACAAAAAAGAATTT
TGAAATAGTTATTAGAGAAGATATTGGAGATTTTGTCCATGTTTTCTCGCACATCCGTCTCAAGATATATGTTGAGCACTTGGTGTTATGTCTAAAAGATTTCCTTGACC
ATAAATTTCAGGACCAATTTGATGCCATTGATTTATCTGATAATGAGATTGTGAAGTTGGAAAATATGCCATGTCTTAATCGATTGGGCACATTGCTGATCAATAATAAT
AGAATCACTCGTATCAATCCGAATATTGGAGAGTTCTTGCCAAAATTACATACATTAGTTCTTACGAACAACAGACTTGTGAACTTGGTAGAGATTGACCCGTTGGCATC
CCTTCCAAAACTTCAGTTTCTTAGTTTGTTGGATAACAATATTACGAAGAAGCCAAACTATAGATTGTATGTCATTCACAAGTTGAAGTCACTCCGGGTGCTTGATTTCA
AGAAAGTCAGAAACAAGGAGAGGTTGGAGGCTAGGAATTTATTTTCATCAAAAGAAGTTGAAGAGGAGGCAAAAAAGGAATCTGTGAAGACATTTGTTCCAGGTGAGGTA
GAGAACGCATCCAAACCTGTGGAGGAGAAACAAACTTCAAACATGTCTGCTCCAACACCCGAACAAATTATAGCTATTAAGGCGGCCATTGTTAATTCCCAAACTCTTGA
AGAGGTTGCAAGATTAGAACAGGCGCTAAAGTCAGGTCAGCTACCTGCAGATTTGAATCTTTTGGAAGATAATACTGTGCCAAATACCACAAAAGATACAGACGATAAGA
CAATGTCTGATGGTGGAGATGAAGAAAATGTATCTAAGGATGCAAAAGAACAATCGAATGATGAATCTACACCTTTGGAGCAGGTACGACAACCTTTATGCTATTTGAGC
GATGCCTTTGTGCAGTTTTTTTTTTTTTTTTTTTAA
Protein sequenceShow/hide protein sequence
MSGGEKNENEENLKQNTDFRQKKKPTKDRKRQRRSPSEREAVDIEDIMFSIDQVQTIRASLLDWYDRSRRDLPWRSLDKEQPETRAYGVWVSEIMLQQTRVQTVIHFYNR
WMQKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLLEGAKMIVKEGSGFPKTVSGLRKIPGIGEYTAGAIASIAFDEVVPVVDGNVIRVIARLKAISGNPKDSKLIKQVW
KAAGQLVDPSRPGDFNQALMELGATLCTPISPSCSTCPIFDHCEALSISKHDTSVLVTDYPAKGIKTKQRHDYSAVCVVEILENQGTSGSEQSSRFLLVKRPDEGLLAGL
WEFPSVSLDGEADLSTRRESINNLLSKYFGLETKKNFEIVIREDIGDFVHVFSHIRLKIYVEHLVLCLKDFLDHKFQDQFDAIDLSDNEIVKLENMPCLNRLGTLLINNN
RITRINPNIGEFLPKLHTLVLTNNRLVNLVEIDPLASLPKLQFLSLLDNNITKKPNYRLYVIHKLKSLRVLDFKKVRNKERLEARNLFSSKEVEEEAKKESVKTFVPGEV
ENASKPVEEKQTSNMSAPTPEQIIAIKAAIVNSQTLEEVARLEQALKSGQLPADLNLLEDNTVPNTTKDTDDKTMSDGGDEENVSKDAKEQSNDESTPLEQVRQPLCYLS
DAFVQFFFFFF