; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G20160 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G20160
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDNA-(apurinic or apyrimidinic site) lyase
Genome locationChr6:18190256..18194522
RNA-Seq ExpressionCSPI06G20160
SyntenyCSPI06G20160
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006289 - nucleotide-excision repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0034039 - 8-oxo-7,8-dihydroguanine DNA N-glycosylase activity (molecular function)
GO:0140078 - class I DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055472.1 N-glycosylase/DNA lyase OGG1 isoform X2 [Cucumis melo var. makuwa]6.2e-21195.75Show/hide
Query:  MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK
        MPSLSFKPLLLMTKR KPT PSTPSTKPSPPPPSPPTPQLSHSKPTTVS+HHSSKNPNKTL LLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK
Subjt:  MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK

Query:  QTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFS--STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSN
        QTNPLEFTGVVGSHLISLNHLPNG+VS+CLHFS  STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARH EGARVLRQDPLECLIQFLCSSN
Subjt:  QTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFS--STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSN

Query:  NNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGY-RAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPK
        NNIGRITKMVDYISSLGNYLGNVGGFDF+EFPSLERLSLVSEAELREAGFGY RAKYIIG VNALKAKP GGAEWLLSLRDSDLEEVI ALSTLPGVGPK
Subjt:  NNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGY-RAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPK

Query:  VAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHA
        VAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLF+AELPQQKALLPA LENTKRKRSTKQQ+DMAHA
Subjt:  VAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHA

XP_004149809.2 N-glycosylase/DNA lyase OGG1 [Cucumis sativus]1.7e-22499.01Show/hide
Query:  MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK
        MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK
Subjt:  MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK

Query:  QTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFSSTSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSNNN
        QTNP EFTGVVGSHLISLNHLPNGDVS+CLHFSSTSSSAAARLALLDFLNA ISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSNNN
Subjt:  QTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFSSTSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSNNN

Query:  IGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKVAA
        IGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVI+ALSTLPGVGPKVAA
Subjt:  IGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKVAA

Query:  CVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHAGNI
        CVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHAGNI
Subjt:  CVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHAGNI

Query:  DQCE
        DQCE
Subjt:  DQCE

XP_008466739.1 PREDICTED: N-glycosylase/DNA lyase OGG1 isoform X1 [Cucumis melo]4.4e-21796.06Show/hide
Query:  MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK
        MPSLSFKPLLLMTKR KPT PSTPSTKPSPPPPSPPTPQLSHSKPTTVS+HHSSKNPNKTL LLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK
Subjt:  MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK

Query:  QTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFS--STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSN
        QTNPLEFTGVVGSHLISLNHLPNG+VS+CLHFS  STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARH EGARVLRQDPLECLIQFLCSSN
Subjt:  QTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFS--STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSN

Query:  NNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKV
        NNIGRITKMVDYISSLGNYLGNVGGFDF+EFPSLERLSLVSEAELREAGFGYRAKYIIG VNALKAKP GGAEWLLSLRDSDLEEVI ALSTLPGVGPKV
Subjt:  NNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKV

Query:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHAG
        AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLF+AELPQQKALLPA LENTKRKRSTKQQ+DMAHAG
Subjt:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHAG

Query:  NIDQCE
        NIDQCE
Subjt:  NIDQCE

XP_016903621.1 PREDICTED: N-glycosylase/DNA lyase OGG1 isoform X2 [Cucumis melo]2.5e-21295.99Show/hide
Query:  MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK
        MPSLSFKPLLLMTKR KPT PSTPSTKPSPPPPSPPTPQLSHSKPTTVS+HHSSKNPNKTL LLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK
Subjt:  MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK

Query:  QTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFS--STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSN
        QTNPLEFTGVVGSHLISLNHLPNG+VS+CLHFS  STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARH EGARVLRQDPLECLIQFLCSSN
Subjt:  QTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFS--STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSN

Query:  NNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKV
        NNIGRITKMVDYISSLGNYLGNVGGFDF+EFPSLERLSLVSEAELREAGFGYRAKYIIG VNALKAKP GGAEWLLSLRDSDLEEVI ALSTLPGVGPKV
Subjt:  NNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKV

Query:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHA
        AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLF+AELPQQKALLPA LENTKRKRSTKQQ+DMAHA
Subjt:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHA

XP_038885236.1 N-glycosylase/DNA lyase OGG1 isoform X2 [Benincasa hispida]1.5e-19689.43Show/hide
Query:  MPSLS--FKPLLLMTKRLKPTPPSTPSTKPS--PPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQT
        MPSLS  FKP LLMTKRL+PTPPSTPS KP   P PPSPPTPQLSHSKPTTVS+H+SSKN NKTL    +PQS SS NWVSLNLT+S+L+LPLTFPTGQT
Subjt:  MPSLS--FKPLLLMTKRLKPTPPSTPSTKPS--PPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQT

Query:  FRWKQTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFSST-SSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLC
        FRWKQT+PL+FTGVVGSHLISLNHLPN DVS+CLH  ST SSSAAARLALLDFLNAGISLSSIWEVF AADPRFD LARH EGARVLRQDPLECLIQFLC
Subjt:  FRWKQTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFSST-SSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLC

Query:  SSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVG
        SSNNNIGRITKMVDYISSLGNYLGN+GGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKP GGAEWLLSLRD DLEEVI+ALSTLPGVG
Subjt:  SSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVG

Query:  PKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMA
        PKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLF+A+LPQQKALLPANLEN KRKRSTK QKD A
Subjt:  PKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMA

Query:  HAGNIDQ
        H GN+DQ
Subjt:  HAGNIDQ

TrEMBL top hitse value%identityAlignment
A0A0A0KIU8 DNA-(apurinic or apyrimidinic site) lyase8.1e-22599.01Show/hide
Query:  MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK
        MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK
Subjt:  MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK

Query:  QTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFSSTSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSNNN
        QTNP EFTGVVGSHLISLNHLPNGDVS+CLHFSSTSSSAAARLALLDFLNA ISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSNNN
Subjt:  QTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFSSTSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSNNN

Query:  IGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKVAA
        IGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVI+ALSTLPGVGPKVAA
Subjt:  IGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKVAA

Query:  CVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHAGNI
        CVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHAGNI
Subjt:  CVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHAGNI

Query:  DQCE
        DQCE
Subjt:  DQCE

A0A1S3CS00 DNA-(apurinic or apyrimidinic site) lyase2.1e-21796.06Show/hide
Query:  MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK
        MPSLSFKPLLLMTKR KPT PSTPSTKPSPPPPSPPTPQLSHSKPTTVS+HHSSKNPNKTL LLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK
Subjt:  MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK

Query:  QTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFS--STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSN
        QTNPLEFTGVVGSHLISLNHLPNG+VS+CLHFS  STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARH EGARVLRQDPLECLIQFLCSSN
Subjt:  QTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFS--STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSN

Query:  NNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKV
        NNIGRITKMVDYISSLGNYLGNVGGFDF+EFPSLERLSLVSEAELREAGFGYRAKYIIG VNALKAKP GGAEWLLSLRDSDLEEVI ALSTLPGVGPKV
Subjt:  NNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKV

Query:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHAG
        AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLF+AELPQQKALLPA LENTKRKRSTKQQ+DMAHAG
Subjt:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHAG

Query:  NIDQCE
        NIDQCE
Subjt:  NIDQCE

A0A1S4E5V3 DNA-(apurinic or apyrimidinic site) lyase1.2e-21295.99Show/hide
Query:  MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK
        MPSLSFKPLLLMTKR KPT PSTPSTKPSPPPPSPPTPQLSHSKPTTVS+HHSSKNPNKTL LLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK
Subjt:  MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK

Query:  QTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFS--STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSN
        QTNPLEFTGVVGSHLISLNHLPNG+VS+CLHFS  STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARH EGARVLRQDPLECLIQFLCSSN
Subjt:  QTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFS--STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSN

Query:  NNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKV
        NNIGRITKMVDYISSLGNYLGNVGGFDF+EFPSLERLSLVSEAELREAGFGYRAKYIIG VNALKAKP GGAEWLLSLRDSDLEEVI ALSTLPGVGPKV
Subjt:  NNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKV

Query:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHA
        AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLF+AELPQQKALLPA LENTKRKRSTKQQ+DMAHA
Subjt:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHA

A0A5A7UI18 DNA-(apurinic or apyrimidinic site) lyase3.0e-21195.75Show/hide
Query:  MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK
        MPSLSFKPLLLMTKR KPT PSTPSTKPSPPPPSPPTPQLSHSKPTTVS+HHSSKNPNKTL LLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK
Subjt:  MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK

Query:  QTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFS--STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSN
        QTNPLEFTGVVGSHLISLNHLPNG+VS+CLHFS  STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARH EGARVLRQDPLECLIQFLCSSN
Subjt:  QTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFS--STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSN

Query:  NNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGY-RAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPK
        NNIGRITKMVDYISSLGNYLGNVGGFDF+EFPSLERLSLVSEAELREAGFGY RAKYIIG VNALKAKP GGAEWLLSLRDSDLEEVI ALSTLPGVGPK
Subjt:  NNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGY-RAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPK

Query:  VAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHA
        VAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLF+AELPQQKALLPA LENTKRKRSTKQQ+DMAHA
Subjt:  VAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHA

A0A5D3CBS3 DNA-(apurinic or apyrimidinic site) lyase1.2e-21295.99Show/hide
Query:  MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK
        MPSLSFKPLLLMTKR KPT PSTPSTKPSPPPPSPPTPQLSHSKPTTVS+HHSSKNPNKTL LLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK
Subjt:  MPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWK

Query:  QTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFS--STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSN
        QTNPLEFTGVVGSHLISLNHLPNG+VS+CLHFS  STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARH EGARVLRQDPLECLIQFLCSSN
Subjt:  QTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFS--STSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSN

Query:  NNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKV
        NNIGRITKMVDYISSLGNYLGNVGGFDF+EFPSLERLSLVSEAELREAGFGYRAKYIIG VNALKAKP GGAEWLLSLRDSDLEEVI ALSTLPGVGPKV
Subjt:  NNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKV

Query:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHA
        AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLF+AELPQQKALLPA LENTKRKRSTKQQ+DMAHA
Subjt:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHA

SwissProt top hitse value%identityAlignment
O08760 N-glycosylase/DNA lyase1.4e-5137.35Show/hide
Query:  SPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFSSTSSSAAARL-----ALLDFLNAGISLSSIWEVFS
        S S + W S+   RS+L L L   +GQ+FRWK+ +P  ++GV+   + +L      D  +C  +    S  +         L  +    +SL+ ++  ++
Subjt:  SPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFSSTSSSAAARL-----ALLDFLNAGISLSSIWEVFS

Query:  AADPRFDALARHFEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFYEFPSLERLS-LVSEAELREAGFGYRAKYIIGAVNA
        + D  F  +A+ F+G R+LRQDP ECL  F+CSSNNNI RIT MV+ +  + G  L  +    ++ FP+L  L+   +E  LR+ G GYRA+Y+  +  A
Subjt:  AADPRFDALARHFEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFYEFPSLERLS-LVSEAELREAGFGYRAKYIIGAVNA

Query:  LKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-VAEAFVSKYGKYAGW
        +  +  GG  WL  LR +  EE  KAL TLPGVG KVA C+ L +LD+  A+PVD HVWQIA R     P+ + A+    L N+ +   F + +G YAGW
Subjt:  LKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-VAEAFVSKYGKYAGW

Query:  AQTLLFIAELPQQKALLPANLENTKRKRSTKQ
        AQ +LF A+L Q      +     KRK+ +K+
Subjt:  AQTLLFIAELPQQKALLPANLENTKRKRSTKQ

O15527 N-glycosylase/DNA lyase3.6e-5237.23Show/hide
Query:  WVSLNLTRSDLSLPLTFPTGQTFRWKQTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFSSTSSSAAARL-----ALLDFLNAGISLSSIWEVFSAADPRF
        W S+   RS+L L L  P+GQ+FRW++ +P  ++GV+   + +L      +   C  +    S A+        A+  +    ++L+ ++  + + D  F
Subjt:  WVSLNLTRSDLSLPLTFPTGQTFRWKQTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFSSTSSSAAARL-----ALLDFLNAGISLSSIWEVFSAADPRF

Query:  DALARHFEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFYEFPSLERLSLVS-EAELREAGFGYRAKYIIGAVNALKAKPV
          +A+ F+G R+LRQDP+ECL  F+CSSNNNI RIT MV+ +  + G  L  +    ++ FPSL+ L+    EA LR+ G GYRA+Y+  +  A+  +  
Subjt:  DALARHFEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFYEFPSLERLSLVS-EAELREAGFGYRAKYIIGAVNALKAKPV

Query:  GGAEWLLSLRDSDLEEVIKALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLV--PELAGAR-LTPKLCNRVAEAFVSKYGKYAGWAQTLLF
        GG  WL  LR+S  EE  KAL  LPGVG KVA C+ L +LD+  A+PVD H+W IA R     P  + A+  +P+    +   F S +G YAGWAQ +LF
Subjt:  GGAEWLLSLRDSDLEEVIKALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLV--PELAGAR-LTPKLCNRVAEAFVSKYGKYAGWAQTLLF

Query:  IAELPQQKALLPANLENTKRKRSTK
         A+L Q +    A     KR++ +K
Subjt:  IAELPQQKALLPANLENTKRKRSTK

O70249 N-glycosylase/DNA lyase5.1e-5136.67Show/hide
Query:  SSSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFSSTSSSAAARL-----ALLDFLNAGISLSSIWEVFSAA
        S + W S+   RS+L L L   +GQ+FRW++ +P  ++GV+   + +L      D  +C  +                 L  +    +SL+ ++  +++ 
Subjt:  SSSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFSSTSSSAAARL-----ALLDFLNAGISLSSIWEVFSAA

Query:  DPRFDALARHFEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFYEFPSLERLSLVS-EAELREAGFGYRAKYIIGAVNALK
        D  F ++A+ F+G R+LRQDP ECL  F+CSSNNNI RIT MV+ +  + G  L  +    ++ FP+L  L+    E  LR+ G GYRA+Y+  +  A+ 
Subjt:  DPRFDALARHFEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFYEFPSLERLSLVS-EAELREAGFGYRAKYIIGAVNALK

Query:  AKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-VAEAFVSKYGKYAGWAQ
         +  GG  WL  LR +  EE  KAL TLPGVG KVA C+ L +LD+  A+PVD HVWQIA R     P+ +  +    L N+ +   F + +G YAGWAQ
Subjt:  AKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-VAEAFVSKYGKYAGWAQ

Query:  TLLFIAELPQQKALLPANLENTKRKRSTKQ
         +LF A+L QQ     +     KRK+ +K+
Subjt:  TLLFIAELPQQKALLPANLENTKRKRSTKQ

Q9FNY7 N-glycosylase/DNA lyase OGG14.0e-12063.43Show/hide
Query:  KPTPPSTPSTKPSPPPP-SPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPLEFTGVVGSHL
        +P P S PS   +  PP SPP   +   K     LH +                  +  W  L LT ++L+LPLTFPTGQTFRWK+T  ++++G +G HL
Subjt:  KPTPPSTPSTKPSPPPP-SPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPLEFTGVVGSHL

Query:  ISLNHLPNGD-VSFCLHFSSTSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSL
        +SL   P  D VS+C+H S++  S  A LALLDFLNA ISL+ +W  FS  DPRF  LARH  GARVLRQDPLECLIQFLCSSNNNI RITKMVD++SSL
Subjt:  ISLNHLPNGD-VSFCLHFSSTSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSL

Query:  GNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKVAACVALFSLDQHHAIP
        G +LG++ GF+F++FPSL+RLS VSE E R+AGFGYRAKYI G VNAL+AKP GG EWLLSLR  +L+E + AL TLPGVGPKVAAC+ALFSLDQH AIP
Subjt:  GNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKVAACVALFSLDQHHAIP

Query:  VDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALL
        VDTHVWQIAT YL+P+LAGA+LTPKL  RVAEAFVSKYG+YAGWAQTLLFIAELP QK LL
Subjt:  VDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALL

Q9V3I8 N-glycosylase/DNA lyase2.6e-3932.83Show/hide
Query:  LNLTRSDLSLPLTFPTGQTFRWKQT---NPLEFTGVVGSHLISLNHLPNGDVSFCLHFSSTSSSAAA----RLALLDFLNAGISLSSIWEVFSAADPRFD
        + L+  +  L  T   GQ+FRW+     N  ++ GVV +    L      + SF  + +  +SS  A       + D+L     L    + + + D  F 
Subjt:  LNLTRSDLSLPLTFPTGQTFRWKQT---NPLEFTGVVGSHLISLNHLPNGDVSFCLHFSSTSSSAAA----RLALLDFLNAGISLSSIWEVFSAADPRFD

Query:  ALARHFEGARVLRQDPLECLIQFLCSSNNNIGRITKMVD-YISSLGNYLGNVGGFDFYEFPSLERLSLVS----EAELREAGFGYRAKYIIGAVNALKAK
              +  R+L Q+P E +  FLCS NNNI RI+ M++ + ++ G  +G+  G D Y FP++ R   +      A+LR A FGYRAK+I   +  ++ K
Subjt:  ALARHFEGARVLRQDPLECLIQFLCSSNNNIGRITKMVD-YISSLGNYLGNVGGFDFYEFPSLERLSLVS----EAELREAGFGYRAKYIIGAVNALKAK

Query:  PVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGAR-LTPKLCNRVAEAFVSKYGKYAGWAQTLLF
          GG  W +SL+    E+  + L+ LPG+G KVA C+ L S+    ++PVD H+++IA  Y +P L G + +T K+   V++ F   +GKYAGWAQ +LF
Subjt:  PVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGAR-LTPKLCNRVAEAFVSKYGKYAGWAQTLLF

Query:  IAELPQQKALLPANLENTK----RKRSTKQQK
         A+L Q         +NT     +K+S K+ K
Subjt:  IAELPQQKALLPANLENTK----RKRSTKQQK

Arabidopsis top hitse value%identityAlignment
AT1G21710.1 8-oxoguanine-DNA glycosylase 12.8e-12163.43Show/hide
Query:  KPTPPSTPSTKPSPPPP-SPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPLEFTGVVGSHL
        +P P S PS   +  PP SPP   +   K     LH +                  +  W  L LT ++L+LPLTFPTGQTFRWK+T  ++++G +G HL
Subjt:  KPTPPSTPSTKPSPPPP-SPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQTFRWKQTNPLEFTGVVGSHL

Query:  ISLNHLPNGD-VSFCLHFSSTSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSL
        +SL   P  D VS+C+H S++  S  A LALLDFLNA ISL+ +W  FS  DPRF  LARH  GARVLRQDPLECLIQFLCSSNNNI RITKMVD++SSL
Subjt:  ISLNHLPNGD-VSFCLHFSSTSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYISSL

Query:  GNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKVAACVALFSLDQHHAIP
        G +LG++ GF+F++FPSL+RLS VSE E R+AGFGYRAKYI G VNAL+AKP GG EWLLSLR  +L+E + AL TLPGVGPKVAAC+ALFSLDQH AIP
Subjt:  GNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKVAACVALFSLDQHHAIP

Query:  VDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALL
        VDTHVWQIAT YL+P+LAGA+LTPKL  RVAEAFVSKYG+YAGWAQTLLFIAELP QK LL
Subjt:  VDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALL

AT3G47830.1 DNA glycosylase superfamily protein8.0e-0746.38Show/hide
Query:  LRDSDLEEVIKALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRY-LVPELAGARLTPKLCNR
        LR   +EEV   LS   GVGPK  +CV +F+L QH+  PVDTHV++IA     VP+ A    T    NR
Subjt:  LRDSDLEEVIKALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRY-LVPELAGARLTPKLCNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCAGTATTTGGGGGAATGCACCGCTCATAACCCTCCAAATGCCTTCATTGTCATTCAAACCCCTTCTTCTAATGACGAAGAGGCTCAAACCCACTCCTCCCTCCAC
TCCCTCCACCAAGCCATCGCCACCGCCTCCATCTCCTCCTACTCCTCAACTCTCCCATTCAAAACCAACCACTGTCTCCCTCCACCACTCGTCCAAAAACCCAAACAAAA
CCCTACCTCTCCTCAAATCCCCTCAATCCCCATCATCCTCCAACTGGGTCTCTCTAAATCTCACCAGATCAGACCTCTCTTTGCCTCTCACTTTCCCCACCGGCCAAACC
TTCCGTTGGAAACAAACCAACCCCCTTGAGTTCACCGGCGTCGTTGGCTCTCATCTCATCTCTCTCAACCATCTTCCAAACGGCGACGTTTCATTTTGCCTTCACTTTTC
TTCTACATCCTCCTCCGCCGCCGCCAGATTGGCATTACTTGATTTTCTTAATGCCGGCATCTCCCTGAGCTCCATTTGGGAGGTATTCTCGGCGGCTGATCCGAGATTCG
ATGCGTTGGCTCGCCATTTTGAGGGGGCTCGAGTTCTCAGGCAAGACCCACTTGAATGTTTGATTCAGTTTTTGTGTTCTTCCAATAATAATATTGGGAGAATCACCAAA
ATGGTGGATTACATCTCATCACTTGGGAATTATTTGGGTAATGTTGGAGGTTTTGATTTCTATGAATTCCCTTCTTTGGAGAGGCTGTCCTTGGTCTCTGAGGCTGAGCT
TAGAGAGGCAGGCTTTGGTTACAGGGCTAAATACATAATTGGCGCTGTGAATGCACTAAAAGCAAAACCTGTTGGAGGTGCAGAATGGCTTCTATCCCTTCGTGATTCGG
ATCTTGAAGAGGTGATCAAAGCCCTTTCTACTTTACCGGGCGTGGGTCCAAAGGTAGCAGCTTGTGTTGCCCTCTTCTCTCTCGACCAGCACCACGCCATTCCTGTTGAC
ACACATGTCTGGCAGATTGCTACAAGGTACCTTGTTCCTGAGCTTGCTGGTGCACGCCTAACACCAAAGCTTTGCAATCGTGTGGCTGAGGCATTTGTCAGCAAGTATGG
AAAATATGCTGGATGGGCTCAAACTCTACTTTTCATCGCTGAGTTGCCTCAACAGAAGGCCCTTTTACCAGCAAATCTTGAGAATACCAAAAGGAAAAGATCTACAAAGC
AGCAGAAAGATATGGCGCATGCTGGTAACATAGACCAATGTGAATAG
mRNA sequenceShow/hide mRNA sequence
CTCTTTTACTAGATAACAAAATCTTAGAGCAAAATTAGAAAAATAACTCTTTTTCACATATTTGAAGAGGCAGTTTGGTCGTTGATGGTCAGTATTTGGGGGAATGCACC
GCTCATAACCCTCCAAATGCCTTCATTGTCATTCAAACCCCTTCTTCTAATGACGAAGAGGCTCAAACCCACTCCTCCCTCCACTCCCTCCACCAAGCCATCGCCACCGC
CTCCATCTCCTCCTACTCCTCAACTCTCCCATTCAAAACCAACCACTGTCTCCCTCCACCACTCGTCCAAAAACCCAAACAAAACCCTACCTCTCCTCAAATCCCCTCAA
TCCCCATCATCCTCCAACTGGGTCTCTCTAAATCTCACCAGATCAGACCTCTCTTTGCCTCTCACTTTCCCCACCGGCCAAACCTTCCGTTGGAAACAAACCAACCCCCT
TGAGTTCACCGGCGTCGTTGGCTCTCATCTCATCTCTCTCAACCATCTTCCAAACGGCGACGTTTCATTTTGCCTTCACTTTTCTTCTACATCCTCCTCCGCCGCCGCCA
GATTGGCATTACTTGATTTTCTTAATGCCGGCATCTCCCTGAGCTCCATTTGGGAGGTATTCTCGGCGGCTGATCCGAGATTCGATGCGTTGGCTCGCCATTTTGAGGGG
GCTCGAGTTCTCAGGCAAGACCCACTTGAATGTTTGATTCAGTTTTTGTGTTCTTCCAATAATAATATTGGGAGAATCACCAAAATGGTGGATTACATCTCATCACTTGG
GAATTATTTGGGTAATGTTGGAGGTTTTGATTTCTATGAATTCCCTTCTTTGGAGAGGCTGTCCTTGGTCTCTGAGGCTGAGCTTAGAGAGGCAGGCTTTGGTTACAGGG
CTAAATACATAATTGGCGCTGTGAATGCACTAAAAGCAAAACCTGTTGGAGGTGCAGAATGGCTTCTATCCCTTCGTGATTCGGATCTTGAAGAGGTGATCAAAGCCCTT
TCTACTTTACCGGGCGTGGGTCCAAAGGTAGCAGCTTGTGTTGCCCTCTTCTCTCTCGACCAGCACCACGCCATTCCTGTTGACACACATGTCTGGCAGATTGCTACAAG
GTACCTTGTTCCTGAGCTTGCTGGTGCACGCCTAACACCAAAGCTTTGCAATCGTGTGGCTGAGGCATTTGTCAGCAAGTATGGAAAATATGCTGGATGGGCTCAAACTC
TACTTTTCATCGCTGAGTTGCCTCAACAGAAGGCCCTTTTACCAGCAAATCTTGAGAATACCAAAAGGAAAAGATCTACAAAGCAGCAGAAAGATATGGCGCATGCTGGT
AACATAGACCAATGTGAATAGCTATATTTAGTTGGTCCCAGGTTTTCGAGGTTGAGGTAAATCCTGATTTCTCTTGTAGAGCAAGATTCATAGGTTTGTTCAAAGATCAA
AGATGAAGATGCGAGAGGTTTGTTCAAAGTACAAATTACTGGTAAATGTGGGATACCGAAGGATGATTGATGTGATTAATGCTAAGCTTTGCTTGCTTCTGAACAAGTAA
AGTTTTCGATAGGAATATAGTACTTAAATATTTTGGTTTGCTTTGGCTCATTGATGGTGTTTCCTCCTTAAGGAAGAATTCAGAATTTATTCACCACTTTTGGGGAAGTC
TCACATTCCATTCCTTTGCAGCATTTCAATTGTAATATGTTAGGAAAGTTCAGTTGCAAAAAAATTTGATTTTCTGTAACAAGAGAACTGTGGTTGTATTTTATTGTAGA
GACTTGACATGAATCAATTAATAATTGAAGTATTGTGTAAATGTTTCTAGGACATCAAAAAGAATGAAATAAGCTGTGGTAAGGGAGAAGATTATTGCTAAAGCTATTTA
TATATTGAAAGCTACTGTGAATTATCATGCCATCTACTTCATGCTAGGCGTATTTGATCATTTATGTGCATTGTGCATCGACCAACAATGTTTTCGTGACATGTGGACAT
GGATCGCCTCGAAAGTTCAGATTTGAGATTGGAATCGAGCATTTGGCCCTCCCTAGACATGCCTGAAACATCACAATGCAATTTTCATGAAAAAACTCACATGTTCTACA
ATGGCTCGGGAGTTTGGCGAGTGACACAGTCCAATAGCATAGCTTTGACAATCACCAGATGGGGTACCAAAGCTTGCAAATAAGATGTTGGAGATTTTCTTTTTACTAGG
GCAGCTTAATTGGACCTTAGGCCTTCTGGTTCTGTTCTTCGCTCTTCTTACCTTTTGTTTCTTTGCACCCATCCACGAAGCTACTAAAGGATAATGTGATTCAGACACTT
GCCCACATGTTTTGGTAATCAAAACGGAATCCAAAGATATCTCAACTGGGTTTCCTGTTTCTTCTTCAAGAATAACTAACTGGTTGTCAGTTGGCTTAAGGAAGGAACGT
GGTACATTATACCTGGATTGAAGGAGAGACAAAATTGTATAAAGATGCAGGAAACTAGTGGAAGGAAATATGAGACATAAGGTATGATATAACACTAAATATCAAGGTGA
GTAAACCATTTACCATTTCTGTGAAGGCTCTCCTTTTGGGGTGAGGAACGAGACCCAGTACCGACCAATGCCCCGGCCGTTAACCCAAACTGCACCCTTCCCCATGGAAC
CAAGGTTCAGTGCAATTGGATCATCACCAGGAGGTGCATCAAACTGAGTCTATCCGATTAGCAATGAATGACAATGACAAAGGTCAGTTAATGAACATTTGAATTCAAGA
TAGTTGCTATTGATTATTGGTAGTACCTTGTACCATGTGAGCGGCTGAGAAGAGTTTCCTAACCTGCTCCACTGAACATTGCTTGACCCAGTGTCTAAAAATATTTGGGA
TTGCTCTCCTGATAGGCCAACCTTTGCAAAAATATTAACAAGAGTCCAGCAACTCTGGTCTCAAGAAATGC
Protein sequenceShow/hide protein sequence
MVSIWGNAPLITLQMPSLSFKPLLLMTKRLKPTPPSTPSTKPSPPPPSPPTPQLSHSKPTTVSLHHSSKNPNKTLPLLKSPQSPSSSNWVSLNLTRSDLSLPLTFPTGQT
FRWKQTNPLEFTGVVGSHLISLNHLPNGDVSFCLHFSSTSSSAAARLALLDFLNAGISLSSIWEVFSAADPRFDALARHFEGARVLRQDPLECLIQFLCSSNNNIGRITK
MVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGAVNALKAKPVGGAEWLLSLRDSDLEEVIKALSTLPGVGPKVAACVALFSLDQHHAIPVD
THVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFIAELPQQKALLPANLENTKRKRSTKQQKDMAHAGNIDQCE