; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005606 (gene) of Snake gourd v1 genome

Gene IDTan0005606
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA-(apurinic or apyrimidinic site) lyase
Genome locationLG11:10677925..10679847
RNA-Seq ExpressionTan0005606
SyntenyTan0005606
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006289 - nucleotide-excision repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0034039 - 8-oxo-7,8-dihydroguanine DNA N-glycosylase activity (molecular function)
GO:0140078 - class I DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase
IPR012904 - 8-oxoguanine DNA glycosylase, N-terminal
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149809.2 N-glycosylase/DNA lyase OGG1 [Cucumis sativus]1.2e-18785.93Show/hide
Query:  MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWK
        M S S +P LLM+KRL+PTPPSTPS KP   PP PPTPQ SHSKPTTVS+H+SS NP KTL LL    SPSSSNWV LNL++SDLSLPLTFPTGQTFRWK
Subjt:  MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWK

Query:  QTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCSTSSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNN
        QT P  FTGVV SHLISL HLPNGDVSYCLH  STSS +AAARL LLDFLNA ISLS+IWEVFSAADPRFD LARH EGARVLRQDPLECLIQFLCSSNN
Subjt:  QTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCSTSSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNN

Query:  NIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVA
        NIGRITKMVDYISSLGNYLGNVGGFDF+EFPSLERLSLVSEAELREAGFGYRAKYIIG VNAL+AKP GGAEWLLSLRD DLEEVI+ALSTLPGVGPKVA
Subjt:  NIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVA

Query:  ACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKEKEQTGN
        ACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLF+A+LPQQKALLP+ LEN KRK+STKQQK+    GN
Subjt:  ACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKEKEQTGN

Query:  IDRCE
        ID+CE
Subjt:  IDRCE

XP_008466739.1 PREDICTED: N-glycosylase/DNA lyase OGG1 isoform X1 [Cucumis melo]9.8e-19086.95Show/hide
Query:  MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWK
        M S S +P LLM+KR +PT PSTPS KP   PP PPTPQ SHSKPTTVSIH+SS NP KTLTLL    SPSSSNWV LNL++SDLSLPLTFPTGQTFRWK
Subjt:  MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWK

Query:  QTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCSTS-SVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSN
        QT PL FTGVV SHLISL HLPNG+VSYCLH  STS S +AAARL LLDFLNAGISLS+IWEVFSAADPRFD LARHLEGARVLRQDPLECLIQFLCSSN
Subjt:  QTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCSTS-SVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSN

Query:  NNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV
        NNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNAL+AKPGGGAEWLLSLRD DLEEVI ALSTLPGVGPKV
Subjt:  NNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV

Query:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKEKEQTG
        AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA+LPQQKALLP+ LEN KRK+STKQQ++    G
Subjt:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKEKEQTG

Query:  NIDRCE
        NID+CE
Subjt:  NIDRCE

XP_016903621.1 PREDICTED: N-glycosylase/DNA lyase OGG1 isoform X2 [Cucumis melo]3.9e-18687.85Show/hide
Query:  MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWK
        M S S +P LLM+KR +PT PSTPS KP   PP PPTPQ SHSKPTTVSIH+SS NP KTLTLL    SPSSSNWV LNL++SDLSLPLTFPTGQTFRWK
Subjt:  MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWK

Query:  QTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCSTS-SVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSN
        QT PL FTGVV SHLISL HLPNG+VSYCLH  STS S +AAARL LLDFLNAGISLS+IWEVFSAADPRFD LARHLEGARVLRQDPLECLIQFLCSSN
Subjt:  QTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCSTS-SVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSN

Query:  NNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV
        NNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNAL+AKPGGGAEWLLSLRD DLEEVI ALSTLPGVGPKV
Subjt:  NNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV

Query:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKE
        AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA+LPQQKALLP+ LEN KRK+STKQQ++
Subjt:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKE

XP_038885235.1 N-glycosylase/DNA lyase OGG1 isoform X1 [Benincasa hispida]8.3e-18985.4Show/hide
Query:  SFSSRPLLMSKRLRPTPPSTPSAKPP--SSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAP
        SF+ +PLLM+KRLRPTPPSTPSAKPP   SPP PPTPQ SHSKPTTVS+HYSS N  KTLT   S SS NWV LNL+KS+L+LPLTFPTGQTFRWKQT+P
Subjt:  SFSSRPLLMSKRLRPTPPSTPSAKPP--SSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAP

Query:  LHFTGVVASHLISLKHLPNGDVSYCLHSCSTSSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNNNIGR
        L FTGVV SHLISL HLPN DVSYCLHSCSTSS +AAARL LLDFLNAGISLS+IWEVF AADPRFD LARHLEGARVLRQDPLECLIQFLCSSNNNIGR
Subjt:  LHFTGVVASHLISLKHLPNGDVSYCLHSCSTSSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNNNIGR

Query:  ITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVA
        ITKMVDYISSLGNYLGN+GGFDF+EFPSLERLSLVSEAELREAGFGYRAKYIIG VNAL+AKPGGGAEWLLSLRDLDLEEVI+ALSTLPGVGPKVAACVA
Subjt:  ITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVA

Query:  LFSLDQHHAIPVDTHVWQ------------IATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQ
        LFSLDQHHAIPVDTHVWQ            IATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLP+ LEN KRK+STK Q
Subjt:  LFSLDQHHAIPVDTHVWQ------------IATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQ

Query:  KEKEQTGNIDR
        K+K  TGN+D+
Subjt:  KEKEQTGNIDR

XP_038885236.1 N-glycosylase/DNA lyase OGG1 isoform X2 [Benincasa hispida]1.8e-19187.97Show/hide
Query:  SFSSRPLLMSKRLRPTPPSTPSAKPP--SSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAP
        SF+ +PLLM+KRLRPTPPSTPSAKPP   SPP PPTPQ SHSKPTTVS+HYSS N  KTLT   S SS NWV LNL+KS+L+LPLTFPTGQTFRWKQT+P
Subjt:  SFSSRPLLMSKRLRPTPPSTPSAKPP--SSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAP

Query:  LHFTGVVASHLISLKHLPNGDVSYCLHSCSTSSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNNNIGR
        L FTGVV SHLISL HLPN DVSYCLHSCSTSS +AAARL LLDFLNAGISLS+IWEVF AADPRFD LARHLEGARVLRQDPLECLIQFLCSSNNNIGR
Subjt:  LHFTGVVASHLISLKHLPNGDVSYCLHSCSTSSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNNNIGR

Query:  ITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVA
        ITKMVDYISSLGNYLGN+GGFDF+EFPSLERLSLVSEAELREAGFGYRAKYIIG VNAL+AKPGGGAEWLLSLRDLDLEEVI+ALSTLPGVGPKVAACVA
Subjt:  ITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVA

Query:  LFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKEKEQTGNIDR
        LFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLP+ LEN KRK+STK QK+K  TGN+D+
Subjt:  LFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKEKEQTGNIDR

TrEMBL top hitse value%identityAlignment
A0A0A0KIU8 DNA-(apurinic or apyrimidinic site) lyase5.8e-18885.93Show/hide
Query:  MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWK
        M S S +P LLM+KRL+PTPPSTPS KP   PP PPTPQ SHSKPTTVS+H+SS NP KTL LL    SPSSSNWV LNL++SDLSLPLTFPTGQTFRWK
Subjt:  MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWK

Query:  QTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCSTSSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNN
        QT P  FTGVV SHLISL HLPNGDVSYCLH  STSS +AAARL LLDFLNA ISLS+IWEVFSAADPRFD LARH EGARVLRQDPLECLIQFLCSSNN
Subjt:  QTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCSTSSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNN

Query:  NIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVA
        NIGRITKMVDYISSLGNYLGNVGGFDF+EFPSLERLSLVSEAELREAGFGYRAKYIIG VNAL+AKP GGAEWLLSLRD DLEEVI+ALSTLPGVGPKVA
Subjt:  NIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVA

Query:  ACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKEKEQTGN
        ACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLF+A+LPQQKALLP+ LEN KRK+STKQQK+    GN
Subjt:  ACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKEKEQTGN

Query:  IDRCE
        ID+CE
Subjt:  IDRCE

A0A1S3CS00 DNA-(apurinic or apyrimidinic site) lyase4.7e-19086.95Show/hide
Query:  MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWK
        M S S +P LLM+KR +PT PSTPS KP   PP PPTPQ SHSKPTTVSIH+SS NP KTLTLL    SPSSSNWV LNL++SDLSLPLTFPTGQTFRWK
Subjt:  MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWK

Query:  QTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCSTS-SVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSN
        QT PL FTGVV SHLISL HLPNG+VSYCLH  STS S +AAARL LLDFLNAGISLS+IWEVFSAADPRFD LARHLEGARVLRQDPLECLIQFLCSSN
Subjt:  QTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCSTS-SVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSN

Query:  NNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV
        NNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNAL+AKPGGGAEWLLSLRD DLEEVI ALSTLPGVGPKV
Subjt:  NNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV

Query:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKEKEQTG
        AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA+LPQQKALLP+ LEN KRK+STKQQ++    G
Subjt:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKEKEQTG

Query:  NIDRCE
        NID+CE
Subjt:  NIDRCE

A0A1S4E5V3 DNA-(apurinic or apyrimidinic site) lyase1.9e-18687.85Show/hide
Query:  MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWK
        M S S +P LLM+KR +PT PSTPS KP   PP PPTPQ SHSKPTTVSIH+SS NP KTLTLL    SPSSSNWV LNL++SDLSLPLTFPTGQTFRWK
Subjt:  MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWK

Query:  QTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCSTS-SVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSN
        QT PL FTGVV SHLISL HLPNG+VSYCLH  STS S +AAARL LLDFLNAGISLS+IWEVFSAADPRFD LARHLEGARVLRQDPLECLIQFLCSSN
Subjt:  QTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCSTS-SVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSN

Query:  NNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV
        NNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNAL+AKPGGGAEWLLSLRD DLEEVI ALSTLPGVGPKV
Subjt:  NNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV

Query:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKE
        AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA+LPQQKALLP+ LEN KRK+STKQQ++
Subjt:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKE

A0A5D3CBS3 DNA-(apurinic or apyrimidinic site) lyase1.9e-18687.85Show/hide
Query:  MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWK
        M S S +P LLM+KR +PT PSTPS KP   PP PPTPQ SHSKPTTVSIH+SS NP KTLTLL    SPSSSNWV LNL++SDLSLPLTFPTGQTFRWK
Subjt:  MHSFSSRP-LLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLL---NSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWK

Query:  QTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCSTS-SVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSN
        QT PL FTGVV SHLISL HLPNG+VSYCLH  STS S +AAARL LLDFLNAGISLS+IWEVFSAADPRFD LARHLEGARVLRQDPLECLIQFLCSSN
Subjt:  QTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCSTS-SVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSN

Query:  NNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV
        NNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNAL+AKPGGGAEWLLSLRD DLEEVI ALSTLPGVGPKV
Subjt:  NNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV

Query:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKE
        AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA+LPQQKALLP+ LEN KRK+STKQQ++
Subjt:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKE

A0A6J1FL67 DNA-(apurinic or apyrimidinic site) lyase2.1e-18585.68Show/hide
Query:  MHSFSSRPLLMSKRLRPTPPSTPSAK----PPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLLNSP---SSSNWVPLNLSKSDLSLPLTFPTGQTF
        M S S R  LM+KRLRPTPPSTPSAK    PPS PP PPTPQ  HSKPTTVS+ +SSN+  KTLT L SP   +SSNWV LNL++SDLSLPLTFPTGQTF
Subjt:  MHSFSSRPLLMSKRLRPTPPSTPSAK----PPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLLNSP---SSSNWVPLNLSKSDLSLPLTFPTGQTF

Query:  RWKQTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCST---SSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQF
        RWKQT+PLHFTGVV  HLISL HLPNGDVSYCLHSCST   SS AAAARL LLDFLNAGISLSAIWEVFSAADPRFD L+RHLEGARVLRQDPLECLIQF
Subjt:  RWKQTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCST---SSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQF

Query:  LCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPG
        LCSSNNNIGRITKMVDYISSLGN+LGN+GGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTV  L+ KPGGGAEWLLSLRDL LEEVI+ L+ LPG
Subjt:  LCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPG

Query:  VGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKE
        VGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLP+ LEN KRKKSTK+Q+E
Subjt:  VGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKE

Query:  KEQTG
        K  TG
Subjt:  KEQTG

SwissProt top hitse value%identityAlignment
O08760 N-glycosylase/DNA lyase5.3e-5339.29Show/hide
Query:  TLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCSTSSVAAAARL----TLLDFLNAGISLSAIW
        TL +SP+   W  +   +S+L L L   +GQ+FRWK+ +P H++GV+A  + +L      D  YC       S  +   L    TL  +    +SL+ ++
Subjt:  TLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCSTSSVAAAARL----TLLDFLNAGISLSAIW

Query:  EVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFHEFPSLERLS-LVSEAELREAGFGYRAKYIIG
          +++ D  F  +A+  +G R+LRQDP ECL  F+CSSNNNI RIT MV+ +  + G  L  +    +H FP+L  L+   +E  LR+ G GYRA+Y+  
Subjt:  EVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFHEFPSLERLS-LVSEAELREAGFGYRAKYIIG

Query:  TVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-VAEAFVSKYGK
        +  A+  + GG A WL  LR    EE   AL TLPGVG KVA C+ L +LD+  A+PVD HVWQIA R     P+ + A+    L N+ +   F + +G 
Subjt:  TVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-VAEAFVSKYGK

Query:  YAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQ
        YAGWAQ +LF ADL Q      S+    KRKK +K+
Subjt:  YAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQ

O15527 N-glycosylase/DNA lyase2.9e-5137.5Show/hide
Query:  TLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKH--------LPNGDVSYCLHSCSTSSVAAAARLTLLDFLNAGISL
        TL ++P+   W  +   +S+L L L  P+GQ+FRW++ +P H++GV+A  + +L          +  GD S    S  T     A R     +    ++L
Subjt:  TLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKH--------LPNGDVSYCLHSCSTSSVAAAARLTLLDFLNAGISL

Query:  SAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFHEFPSLERLSLVS-EAELREAGFGYRAK
        + ++  + + D  F  +A+  +G R+LRQDP+ECL  F+CSSNNNI RIT MV+ +  + G  L  +    +H FPSL+ L+    EA LR+ G GYRA+
Subjt:  SAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFHEFPSLERLSLVS-EAELREAGFGYRAK

Query:  YIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLV--PELAGAR-LTPKLCNRVAEAFVS
        Y+  +  A+  + GG A WL  LR+   EE   AL  LPGVG KVA C+ L +LD+  A+PVD H+W IA R     P  + A+  +P+    +   F S
Subjt:  YIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLV--PELAGAR-LTPKLCNRVAEAFVS

Query:  KYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKK
         +G YAGWAQ +LF ADL Q +       + RK  K
Subjt:  KYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKK

O70249 N-glycosylase/DNA lyase6.9e-5338.69Show/hide
Query:  TLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCSTSSVAAAARL----TLLDFLNAGISLSAIW
        TL +SP+   W  +   +S+L L L   +GQ+FRW++ +P H++GV+A  + +L      D  YC              L    TL  +    +SL+ ++
Subjt:  TLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKHLPNGDVSYCLHSCSTSSVAAAARL----TLLDFLNAGISLSAIW

Query:  EVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFHEFPSLERLSLVS-EAELREAGFGYRAKYIIG
          +++ D  F  +A+  +G R+LRQDP ECL  F+CSSNNNI RIT MV+ +  + G  L  +    +H FP+L  L+    E  LR+ G GYRA+Y+  
Subjt:  EVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFHEFPSLERLSLVS-EAELREAGFGYRAKYIIG

Query:  TVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-VAEAFVSKYGK
        +  A+  + GG A WL  LR    EE   AL TLPGVG KVA C+ L +LD+  A+PVD HVWQIA R     P+ +  +    L N+ +   F + +G 
Subjt:  TVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-VAEAFVSKYGK

Query:  YAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQ
        YAGWAQ +LF ADL QQ     S+    KRKK +K+
Subjt:  YAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQ

Q9FNY7 N-glycosylase/DNA lyase OGG17.9e-12667.64Show/hide
Query:  PPPTPQPSHSKPTTVSIHYSSNNPPKTLTLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKHLPNGD-VSYCLHSCST
        P PT QPS S  +TV    S    P     L+   +  W PL L+ ++L+LPLTFPTGQTFRWK+T  + ++G +  HL+SL+  P  D VSYC+H CST
Subjt:  PPPTPQPSHSKPTTVSIHYSSNNPPKTLTLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKHLPNGD-VSYCLHSCST

Query:  SSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLER
        S    +A L LLDFLNA ISL+ +W  FS  DPRF  LARHL GARVLRQDPLECLIQFLCSSNNNI RITKMVD++SSLG +LG++ GF+FH+FPSL+R
Subjt:  SSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLER

Query:  LSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGA
        LS VSE E R+AGFGYRAKYI GTVNAL+AKPGGG EWLLSLR ++L+E + AL TLPGVGPKVAAC+ALFSLDQH AIPVDTHVWQIAT YL+P+LAGA
Subjt:  LSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGA

Query:  RLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPS
        +LTPKL  RVAEAFVSKYG+YAGWAQTLLF+A+LP QK LL S
Subjt:  RLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPS

Q9V3I8 N-glycosylase/DNA lyase2.7e-4132.92Show/hide
Query:  LNLSKSDLSLPLTFPTGQTFRWKQTAPLHFT--GVVASHLISLKHLPNGDVSYCLHSCSTSSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLAR
        + LS  +  L  T   GQ+FRW+     + T  G V  +   +       ++Y  +  S+          + D+L     L    + + + D   D   +
Subjt:  LNLSKSDLSLPLTFPTGQTFRWKQTAPLHFT--GVVASHLISLKHLPNGDVSYCLHSCSTSSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLAR

Query:  HL-EGARVLRQDPLECLIQFLCSSNNNIGRITKMVD-YISSLGNYLGNVGGFDFHEFPSLERLSLVS----EAELREAGFGYRAKYIIGTVNALEAKPGG
         L +  R+L Q+P E +  FLCS NNNI RI+ M++ + ++ G  +G+  G D + FP++ R   +      A+LR A FGYRAK+I  T+  ++ K  G
Subjt:  HL-EGARVLRQDPLECLIQFLCSSNNNIGRITKMVD-YISSLGNYLGNVGGFDFHEFPSLERLSLVS----EAELREAGFGYRAKYIIGTVNALEAKPGG

Query:  GAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGAR-LTPKLCNRVAEAFVSKYGKYAGWAQTLLFVAD
        G  W +SL+ +  E+  + L+ LPG+G KVA C+ L S+    ++PVD H+++IA  Y +P L G + +T K+   V++ F   +GKYAGWAQ +LF AD
Subjt:  GAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGAR-LTPKLCNRVAEAFVSKYGKYAGWAQTLLFVAD

Query:  LPQ---QKALLPSKLENRKRKK
        L Q      +   K  N+K KK
Subjt:  LPQ---QKALLPSKLENRKRKK

Arabidopsis top hitse value%identityAlignment
AT1G21710.1 8-oxoguanine-DNA glycosylase 15.6e-12767.64Show/hide
Query:  PPPTPQPSHSKPTTVSIHYSSNNPPKTLTLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKHLPNGD-VSYCLHSCST
        P PT QPS S  +TV    S    P     L+   +  W PL L+ ++L+LPLTFPTGQTFRWK+T  + ++G +  HL+SL+  P  D VSYC+H CST
Subjt:  PPPTPQPSHSKPTTVSIHYSSNNPPKTLTLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASHLISLKHLPNGD-VSYCLHSCST

Query:  SSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLER
        S    +A L LLDFLNA ISL+ +W  FS  DPRF  LARHL GARVLRQDPLECLIQFLCSSNNNI RITKMVD++SSLG +LG++ GF+FH+FPSL+R
Subjt:  SSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFHEFPSLER

Query:  LSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGA
        LS VSE E R+AGFGYRAKYI GTVNAL+AKPGGG EWLLSLR ++L+E + AL TLPGVGPKVAAC+ALFSLDQH AIPVDTHVWQIAT YL+P+LAGA
Subjt:  LSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGA

Query:  RLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPS
        +LTPKL  RVAEAFVSKYG+YAGWAQTLLF+A+LP QK LL S
Subjt:  RLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPS

AT3G47830.1 DNA glycosylase superfamily protein1.5e-0747.83Show/hide
Query:  LRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRY-LVPELAGARLTPKLCNR
        LR L +EEV   LS   GVGPK  +CV +F+L QH+  PVDTHV++IA     VP+ A    T    NR
Subjt:  LRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRY-LVPELAGARLTPKLCNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACTCATTCTCATCGAGACCCCTTCTAATGTCGAAGAGGCTCAGACCCACTCCACCCTCCACTCCCTCCGCCAAGCCGCCGTCATCGCCGCCGCCTCCTCCGACTCC
TCAACCCTCCCATTCAAAGCCCACCACCGTCTCCATTCACTATTCATCCAACAACCCCCCAAAAACGCTAACCCTCCTCAATTCCCCATCATCCTCCAACTGGGTCCCCC
TCAATCTCTCCAAATCAGACCTCTCTTTGCCTCTCACTTTCCCCACCGGCCAAACCTTCCGCTGGAAACAAACCGCCCCTCTTCACTTCACCGGCGTCGTTGCCTCTCAT
CTCATCTCTCTCAAGCACCTTCCAAACGGCGACGTTTCATATTGCCTTCACTCTTGTTCTACATCCTCCGTCGCCGCCGCCGCCAGATTGACCTTGCTTGACTTCCTTAA
CGCCGGCATCTCCCTCAGTGCCATTTGGGAGGTGTTCTCGGCGGCTGATCCGAGATTCGATGGCTTGGCTCGCCATTTGGAGGGTGCTCGAGTTCTCAGGCAAGACCCAC
TTGAGTGTTTGATTCAGTTTTTGTGTTCTTCGAATAACAATATTGGGAGAATCACGAAAATGGTGGATTATATCTCATCGCTAGGGAATTATTTGGGCAATGTTGGGGGC
TTCGATTTCCATGAGTTCCCCTCTTTGGAGAGGCTGTCCTTGGTCTCTGAGGCTGAGCTTAGAGAGGCAGGTTTTGGTTACAGGGCTAAATACATAATTGGCACTGTAAA
TGCTTTAGAAGCCAAACCTGGGGGAGGTGCAGAATGGCTTCTATCTCTTCGTGATTTGGATCTTGAAGAGGTGATTGATGCCCTTTCCACTTTACCCGGTGTCGGTCCGA
AGGTGGCAGCTTGTGTTGCTCTCTTCTCTCTCGATCAGCATCATGCCATTCCTGTTGACACACACGTGTGGCAGATTGCTACTAGATACCTTGTCCCTGAGCTTGCTGGT
GCACGTCTAACGCCAAAGCTTTGCAACCGTGTGGCTGAGGCATTTGTCAGCAAGTATGGCAAATATGCTGGTTGGGCTCAAACGCTGCTTTTCGTTGCTGATTTACCTCA
ACAGAAGGCCCTCTTACCATCGAAGCTCGAGAATAGGAAAAGGAAAAAATCTACAAAGCAGCAGAAAGAAAAGGAACAGACTGGTAATATAGATCGATGTGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCACTCATTCTCATCGAGACCCCTTCTAATGTCGAAGAGGCTCAGACCCACTCCACCCTCCACTCCCTCCGCCAAGCCGCCGTCATCGCCGCCGCCTCCTCCGACTCC
TCAACCCTCCCATTCAAAGCCCACCACCGTCTCCATTCACTATTCATCCAACAACCCCCCAAAAACGCTAACCCTCCTCAATTCCCCATCATCCTCCAACTGGGTCCCCC
TCAATCTCTCCAAATCAGACCTCTCTTTGCCTCTCACTTTCCCCACCGGCCAAACCTTCCGCTGGAAACAAACCGCCCCTCTTCACTTCACCGGCGTCGTTGCCTCTCAT
CTCATCTCTCTCAAGCACCTTCCAAACGGCGACGTTTCATATTGCCTTCACTCTTGTTCTACATCCTCCGTCGCCGCCGCCGCCAGATTGACCTTGCTTGACTTCCTTAA
CGCCGGCATCTCCCTCAGTGCCATTTGGGAGGTGTTCTCGGCGGCTGATCCGAGATTCGATGGCTTGGCTCGCCATTTGGAGGGTGCTCGAGTTCTCAGGCAAGACCCAC
TTGAGTGTTTGATTCAGTTTTTGTGTTCTTCGAATAACAATATTGGGAGAATCACGAAAATGGTGGATTATATCTCATCGCTAGGGAATTATTTGGGCAATGTTGGGGGC
TTCGATTTCCATGAGTTCCCCTCTTTGGAGAGGCTGTCCTTGGTCTCTGAGGCTGAGCTTAGAGAGGCAGGTTTTGGTTACAGGGCTAAATACATAATTGGCACTGTAAA
TGCTTTAGAAGCCAAACCTGGGGGAGGTGCAGAATGGCTTCTATCTCTTCGTGATTTGGATCTTGAAGAGGTGATTGATGCCCTTTCCACTTTACCCGGTGTCGGTCCGA
AGGTGGCAGCTTGTGTTGCTCTCTTCTCTCTCGATCAGCATCATGCCATTCCTGTTGACACACACGTGTGGCAGATTGCTACTAGATACCTTGTCCCTGAGCTTGCTGGT
GCACGTCTAACGCCAAAGCTTTGCAACCGTGTGGCTGAGGCATTTGTCAGCAAGTATGGCAAATATGCTGGTTGGGCTCAAACGCTGCTTTTCGTTGCTGATTTACCTCA
ACAGAAGGCCCTCTTACCATCGAAGCTCGAGAATAGGAAAAGGAAAAAATCTACAAAGCAGCAGAAAGAAAAGGAACAGACTGGTAATATAGATCGATGTGAATAG
Protein sequenceShow/hide protein sequence
MHSFSSRPLLMSKRLRPTPPSTPSAKPPSSPPPPPTPQPSHSKPTTVSIHYSSNNPPKTLTLLNSPSSSNWVPLNLSKSDLSLPLTFPTGQTFRWKQTAPLHFTGVVASH
LISLKHLPNGDVSYCLHSCSTSSVAAAARLTLLDFLNAGISLSAIWEVFSAADPRFDGLARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGG
FDFHEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAG
ARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLPSKLENRKRKKSTKQQKEKEQTGNIDRCE