; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1287 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1287
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDNA-(apurinic or apyrimidinic site) lyase
Genome locationMC04:20841422..20845736
RNA-Seq ExpressionMC04g1287
SyntenyMC04g1287
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006289 - nucleotide-excision repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0034039 - 8-oxo-7,8-dihydroguanine DNA N-glycosylase activity (molecular function)
GO:0140078 - class I DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase
IPR012904 - 8-oxoguanine DNA glycosylase, N-terminal
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149809.2 N-glycosylase/DNA lyase OGG1 [Cucumis sativus]1.27e-20579.53Show/hide
Query:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTVP-----------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV
        MTKRL+ TPPSTPS K SPPPP PP        PTT+S HHSS NP KT+P            WV LNLT+SDL LPLTFPTGQTFRWKQT P +FTG V
Subjt:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTVP-----------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV

Query:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV
        GSHLISL  LPNGDVSYC H  S+S+S   AAAR ALLDFLNA ISLS+IWEVFSAADPRF+ LARH  GARVLRQ PLECL+QFLCSSNNNIGRITKMV
Subjt:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV

Query:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD
        DYISSLGNYLGNV GF+F+EFPSLERLSLVSEAELREAGFGYRAKYI+G+V AL+AKP GGAEWLLSLRD DLEEVI+ALS+LPG+GPKVAAC+ALFSLD
Subjt:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD

Query:  QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE
        QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFV+KYGKYAGWAQTLLF+A+LPQQKALL ++LE +TKRK+STKQQK+
Subjt:  QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE

XP_008466739.1 PREDICTED: N-glycosylase/DNA lyase OGG1 isoform X1 [Cucumis melo]1.67e-20680.31Show/hide
Query:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTVP-----------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV
        MTKR + T PSTPS K SPPPP PP        PTT+S HHSS NP KT+             WV LNLT+SDL LPLTFPTGQTFRWKQT PL+FTG V
Subjt:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTVP-----------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV

Query:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV
        GSHLISL  LPNG+VSYC H  S+STSS+AAA R ALLDFLNAGISLS+IWEVFSAADPRF+ LARHL GARVLRQ PLECL+QFLCSSNNNIGRITKMV
Subjt:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV

Query:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD
        DYISSLGNYLGNV GF+FHEFPSLERLSLVSEAELREAGFGYRAKYI+G+V AL+AKPGGGAEWLLSLRD DLEEVI ALS+LPG+GPKVAAC+ALFSLD
Subjt:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD

Query:  QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE
        QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFV+KYGKYAGWAQTLLFVA+LPQQKALL + LE +TKRK+STKQQ++
Subjt:  QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE

XP_016903621.1 PREDICTED: N-glycosylase/DNA lyase OGG1 isoform X2 [Cucumis melo]1.49e-20680.31Show/hide
Query:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTVP-----------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV
        MTKR + T PSTPS K SPPPP PP        PTT+S HHSS NP KT+             WV LNLT+SDL LPLTFPTGQTFRWKQT PL+FTG V
Subjt:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTVP-----------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV

Query:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV
        GSHLISL  LPNG+VSYC H  S+STSS+AAA R ALLDFLNAGISLS+IWEVFSAADPRF+ LARHL GARVLRQ PLECL+QFLCSSNNNIGRITKMV
Subjt:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV

Query:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD
        DYISSLGNYLGNV GF+FHEFPSLERLSLVSEAELREAGFGYRAKYI+G+V AL+AKPGGGAEWLLSLRD DLEEVI ALS+LPG+GPKVAAC+ALFSLD
Subjt:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD

Query:  QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE
        QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFV+KYGKYAGWAQTLLFVA+LPQQKALL + LE +TKRK+STKQQ++
Subjt:  QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE

XP_022141451.1 N-glycosylase/DNA lyase OGG1 [Momordica charantia]2.28e-275100Show/hide
Query:  MFSADAPQPSRMHSLRPLPMTKRLRTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGS
        MFSADAPQPSRMHSLRPLPMTKRLRTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGS
Subjt:  MFSADAPQPSRMHSLRPLPMTKRLRTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGS

Query:  HLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDY
        HLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDY
Subjt:  HLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDY

Query:  ISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQH
        ISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQH
Subjt:  ISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQH

Query:  HAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEKN
        HAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEKN
Subjt:  HAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEKN

XP_038885236.1 N-glycosylase/DNA lyase OGG1 isoform X2 [Benincasa hispida]5.29e-20979.29Show/hide
Query:  PSRMHSLRPLPMTKRLRTTPPSTPSVKSPPPPPPP-----------PTTISGHHSSSNP-KTVP-------KWVPLNLTKSDLFLPLTFPTGQTFRWKQT
        PS   + +PL MTKRLR TPPSTPS K PP P PP           PTT+S H+SS N  KT+         WV LNLTKS+L LPLTFPTGQTFRWKQT
Subjt:  PSRMHSLRPLPMTKRLRTTPPSTPSVKSPPPPPPP-----------PTTISGHHSSSNP-KTVP-------KWVPLNLTKSDLFLPLTFPTGQTFRWKQT

Query:  GPLQFTGAVGSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNN
         PLQFTG VGSHLISL  LPN DVSYC HS   STSS++AAAR ALLDFLNAGISLS+IWEVF AADPRF+VLARHL GARVLRQ PLECL+QFLCSSNN
Subjt:  GPLQFTGAVGSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNN

Query:  NIGRITKMVDYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVA
        NIGRITKMVDYISSLGNYLGN+ GF+F+EFPSLERLSLVSEAELREAGFGYRAKYI+G+V AL+AKPGGGAEWLLSLRDLDLEEVI+ALS+LPG+GPKVA
Subjt:  NIGRITKMVDYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVA

Query:  ACIALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEK
        AC+ALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFV+KYGKYAGWAQTLLFVADLPQQKALL ++LE + KRK+STK QK+K
Subjt:  ACIALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEK

TrEMBL top hitse value%identityAlignment
A0A0A0KIU8 DNA-(apurinic or apyrimidinic site) lyase6.14e-20679.53Show/hide
Query:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTVP-----------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV
        MTKRL+ TPPSTPS K SPPPP PP        PTT+S HHSS NP KT+P            WV LNLT+SDL LPLTFPTGQTFRWKQT P +FTG V
Subjt:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTVP-----------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV

Query:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV
        GSHLISL  LPNGDVSYC H  S+S+S   AAAR ALLDFLNA ISLS+IWEVFSAADPRF+ LARH  GARVLRQ PLECL+QFLCSSNNNIGRITKMV
Subjt:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV

Query:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD
        DYISSLGNYLGNV GF+F+EFPSLERLSLVSEAELREAGFGYRAKYI+G+V AL+AKP GGAEWLLSLRD DLEEVI+ALS+LPG+GPKVAAC+ALFSLD
Subjt:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD

Query:  QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE
        QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFV+KYGKYAGWAQTLLF+A+LPQQKALL ++LE +TKRK+STKQQK+
Subjt:  QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE

A0A1S3CS00 DNA-(apurinic or apyrimidinic site) lyase8.06e-20780.31Show/hide
Query:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTVP-----------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV
        MTKR + T PSTPS K SPPPP PP        PTT+S HHSS NP KT+             WV LNLT+SDL LPLTFPTGQTFRWKQT PL+FTG V
Subjt:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTVP-----------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV

Query:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV
        GSHLISL  LPNG+VSYC H  S+STSS+AAA R ALLDFLNAGISLS+IWEVFSAADPRF+ LARHL GARVLRQ PLECL+QFLCSSNNNIGRITKMV
Subjt:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV

Query:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD
        DYISSLGNYLGNV GF+FHEFPSLERLSLVSEAELREAGFGYRAKYI+G+V AL+AKPGGGAEWLLSLRD DLEEVI ALS+LPG+GPKVAAC+ALFSLD
Subjt:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD

Query:  QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE
        QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFV+KYGKYAGWAQTLLFVA+LPQQKALL + LE +TKRK+STKQQ++
Subjt:  QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE

A0A1S4E5V3 DNA-(apurinic or apyrimidinic site) lyase7.23e-20780.31Show/hide
Query:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTVP-----------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV
        MTKR + T PSTPS K SPPPP PP        PTT+S HHSS NP KT+             WV LNLT+SDL LPLTFPTGQTFRWKQT PL+FTG V
Subjt:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTVP-----------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV

Query:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV
        GSHLISL  LPNG+VSYC H  S+STSS+AAA R ALLDFLNAGISLS+IWEVFSAADPRF+ LARHL GARVLRQ PLECL+QFLCSSNNNIGRITKMV
Subjt:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV

Query:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD
        DYISSLGNYLGNV GF+FHEFPSLERLSLVSEAELREAGFGYRAKYI+G+V AL+AKPGGGAEWLLSLRD DLEEVI ALS+LPG+GPKVAAC+ALFSLD
Subjt:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD

Query:  QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE
        QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFV+KYGKYAGWAQTLLFVA+LPQQKALL + LE +TKRK+STKQQ++
Subjt:  QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE

A0A5D3CBS3 DNA-(apurinic or apyrimidinic site) lyase7.23e-20780.31Show/hide
Query:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTVP-----------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV
        MTKR + T PSTPS K SPPPP PP        PTT+S HHSS NP KT+             WV LNLT+SDL LPLTFPTGQTFRWKQT PL+FTG V
Subjt:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTVP-----------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV

Query:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV
        GSHLISL  LPNG+VSYC H  S+STSS+AAA R ALLDFLNAGISLS+IWEVFSAADPRF+ LARHL GARVLRQ PLECL+QFLCSSNNNIGRITKMV
Subjt:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV

Query:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD
        DYISSLGNYLGNV GF+FHEFPSLERLSLVSEAELREAGFGYRAKYI+G+V AL+AKPGGGAEWLLSLRD DLEEVI ALS+LPG+GPKVAAC+ALFSLD
Subjt:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD

Query:  QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE
        QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFV+KYGKYAGWAQTLLFVA+LPQQKALL + LE +TKRK+STKQQ++
Subjt:  QHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE

A0A6J1CJ84 DNA-(apurinic or apyrimidinic site) lyase1.10e-275100Show/hide
Query:  MFSADAPQPSRMHSLRPLPMTKRLRTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGS
        MFSADAPQPSRMHSLRPLPMTKRLRTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGS
Subjt:  MFSADAPQPSRMHSLRPLPMTKRLRTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGS

Query:  HLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDY
        HLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDY
Subjt:  HLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDY

Query:  ISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQH
        ISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQH
Subjt:  ISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQH

Query:  HAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEKN
        HAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEKN
Subjt:  HAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEKN

SwissProt top hitse value%identityAlignment
O08760 N-glycosylase/DNA lyase4.3e-5238.26Show/hide
Query:  PTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGDVSYCT--HSESSSTSSAAAAARQALLDFLNA
        P+++     SS+P     W  +   +S+L L L   +GQ+FRWK+  P  ++G +   + +L Q    D  YCT    + S  S       + L  +   
Subjt:  PTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGDVSYCT--HSESSSTSSAAAAARQALLDFLNA

Query:  GISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVEGFEFHEFPSLERLS-LVSEAELREAGFG
         +SL+ ++  +++ D  F+ +A+   G R+LRQ P ECL  F+CSSNNNI RIT MV+ +  + G  L  ++   +H FP+L  L+   +E  LR+ G G
Subjt:  GISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVEGFEFHEFPSLERLS-LVSEAELREAGFG

Query:  YRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-VAE
        YRA+Y+  S KA+  + GG A WL  LR    EE   AL +LPG+G KVA CI L +LD+  A+PVD HVWQIA R     P+ + A+    L N+ +  
Subjt:  YRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-VAE

Query:  AFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQ
         F N +G YAGWAQ +LF ADL Q     S   E   KRKK +K+
Subjt:  AFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQ

O15527 N-glycosylase/DNA lyase2.3e-5036.28Show/hide
Query:  GHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGDVSYCT--HSESSSTSSAAAAARQALLDFLNAGISLS
        GH + ++  T   W  +   +S+L L L  P+GQ+FRW++  P  ++G +   + +L Q    +  +CT    + S  S       +A+  +    ++L+
Subjt:  GHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGDVSYCT--HSESSSTSSAAAAARQALLDFLNAGISLS

Query:  AIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVEGFEFHEFPSLERLSLVS-EAELREAGFGYRAKY
         ++  + + D  F+ +A+   G R+LRQ P+ECL  F+CSSNNNI RIT MV+ +  + G  L  ++   +H FPSL+ L+    EA LR+ G GYRA+Y
Subjt:  AIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVEGFEFHEFPSLERLSLVS-EAELREAGFGYRAKY

Query:  IVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQIATRYLV--PELAGAR-LTPKLCNRVAEAFVNK
        +  S +A+  + GG A WL  LR+   EE   AL  LPG+G KVA CI L +LD+  A+PVD H+W IA R     P  + A+  +P+    +   F + 
Subjt:  IVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQIATRYLV--PELAGAR-LTPKLCNRVAEAFVNK

Query:  YGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTK
        +G YAGWAQ +LF ADL Q +       E   KR+K +K
Subjt:  YGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTK

O70249 N-glycosylase/DNA lyase2.8e-5137.92Show/hide
Query:  WVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGDVSYCT--HSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRF
        W  +   +S+L L L   +GQ+FRW++  P  ++G +   + +L Q    D  YCT    +            + L  +    +SL+ ++  +++ D  F
Subjt:  WVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGDVSYCT--HSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRF

Query:  EVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVEGFEFHEFPSLERLSLVS-EAELREAGFGYRAKYIVGSVKALEAKPG
        + +A+   G R+LRQ P ECL  F+CSSNNNI RIT MV+ +  + G  L  ++   +H FP+L  L+    E  LR+ G GYRA+Y+  S KA+  + G
Subjt:  EVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVEGFEFHEFPSLERLSLVS-EAELREAGFGYRAKYIVGSVKALEAKPG

Query:  GGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-VAEAFVNKYGKYAGWAQTLLF
        G A WL  LR    EE   AL +LPG+G KVA CI L +LD+  A+PVD HVWQIA R     P+ +  +    L N+ +   F N +G YAGWAQ +LF
Subjt:  GGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-VAEAFVNKYGKYAGWAQTLLF

Query:  VADLPQQKALLSSHLEKSTKRKKSTKQ
         ADL QQ    +   E   KRKK +K+
Subjt:  VADLPQQKALLSSHLEKSTKRKKSTKQ

Q9FNY7 N-glycosylase/DNA lyase OGG18.4e-12564.71Show/hide
Query:  RTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGD-VSYCTHSESSS
        R  P S PS+ S   PP  P          +    PKW PL LT ++L LPLTFPTGQTFRWK+TG +Q++G +G HL+SL+Q P  D VSYC H  +S 
Subjt:  RTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGD-VSYCTHSESSS

Query:  TSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYISSLGNYLGNVEGFEFHEFPSLE
         S     A  ALLDFLNA ISL+ +W  FS  DPRF  LARHL GARVLRQ PLECL+QFLCSSNNNI RITKMVD++SSLG +LG+++GFEFH+FPSL+
Subjt:  TSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYISSLGNYLGNVEGFEFHEFPSLE

Query:  RLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQIATRYLVPELAG
        RLS VSE E R+AGFGYRAKYI G+V AL+AKPGGG EWLLSLR ++L+E + AL +LPG+GPKVAACIALFSLDQH AIPVDTHVWQIAT YL+P+LAG
Subjt:  RLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQIATRYLVPELAG

Query:  ARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTK
        A+LTPKL  RVAEAFV+KYG+YAGWAQTLLF+A+LP QK LL S  +   K  +S +
Subjt:  ARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTK

Q9V3I8 N-glycosylase/DNA lyase4.9e-4033.54Show/hide
Query:  LNLTKSDLFLPLTFPTGQTFRWKQT---GPLQFTGAVGSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALL-DFLNAGISLSAIWEVFSAADPRF-
        + L+  +  L  T   GQ+FRW+        ++ G V +    L+Q    + S+ T+    ++S  A     +L+ D+L     L    + + + D  F 
Subjt:  LNLTKSDLFLPLTFPTGQTFRWKQT---GPLQFTGAVGSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALL-DFLNAGISLSAIWEVFSAADPRF-

Query:  EVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVD-YISSLGNYLGNVEGFEFHEFPSLERLSLVS----EAELREAGFGYRAKYIVGSVKALEA
        + L++ +   R+L Q P E +  FLCS NNNI RI+ M++ + ++ G  +G+  G + + FP++ R   +      A+LR A FGYRAK+I  +++ ++ 
Subjt:  EVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVD-YISSLGNYLGNVEGFEFHEFPSLERLSLVS----EAELREAGFGYRAKYIVGSVKALEA

Query:  KPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQIATRYLVPELAGAR-LTPKLCNRVAEAFVNKYGKYAGWAQTLL
        K  GG  W +SL+ +  E+  + L+ LPGIG KVA CI L S+    ++PVD H+++IA  Y +P L G + +T K+   V++ F   +GKYAGWAQ +L
Subjt:  KPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQIATRYLVPELAGAR-LTPKLCNRVAEAFVNKYGKYAGWAQTLL

Query:  FVADLPQ-QKALLSSHLEKSTKRKK
        F ADL Q Q     +  +KS K+ K
Subjt:  FVADLPQ-QKALLSSHLEKSTKRKK

Arabidopsis top hitse value%identityAlignment
AT1G21710.1 8-oxoguanine-DNA glycosylase 16.0e-12664.71Show/hide
Query:  RTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGD-VSYCTHSESSS
        R  P S PS+ S   PP  P          +    PKW PL LT ++L LPLTFPTGQTFRWK+TG +Q++G +G HL+SL+Q P  D VSYC H  +S 
Subjt:  RTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGD-VSYCTHSESSS

Query:  TSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYISSLGNYLGNVEGFEFHEFPSLE
         S     A  ALLDFLNA ISL+ +W  FS  DPRF  LARHL GARVLRQ PLECL+QFLCSSNNNI RITKMVD++SSLG +LG+++GFEFH+FPSL+
Subjt:  TSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYISSLGNYLGNVEGFEFHEFPSLE

Query:  RLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQIATRYLVPELAG
        RLS VSE E R+AGFGYRAKYI G+V AL+AKPGGG EWLLSLR ++L+E + AL +LPG+GPKVAACIALFSLDQH AIPVDTHVWQIAT YL+P+LAG
Subjt:  RLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQIATRYLVPELAG

Query:  ARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTK
        A+LTPKL  RVAEAFV+KYG+YAGWAQTLLF+A+LP QK LL S  +   K  +S +
Subjt:  ARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTK

AT3G47830.1 DNA glycosylase superfamily protein1.5e-0744.93Show/hide
Query:  LRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQIATRY-LVPELAGARLTPKLCNR
        LR L +EEV   LS   G+GPK  +C+ +F+L QH+  PVDTHV++IA     VP+ A    T    NR
Subjt:  LRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQIATRY-LVPELAGARLTPKLCNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTCGGCGGACGCTCCCCAACCCTCCCGAATGCACTCACTCAGACCCCTTCCAATGACGAAGCGGCTCAGAACCACTCCACCGTCGACTCCCTCCGTCAAGTCTCC
GCCGCCGCCGCCGCCGCCGCCCACCACCATCTCCGGCCACCATTCGTCCAGCAACCCTAAAACCGTCCCCAAGTGGGTCCCCCTCAATCTCACAAAATCAGACCTCTTTT
TGCCTCTCACTTTCCCCACCGGCCAAACCTTCCGGTGGAAGCAAACCGGGCCTCTCCAGTTCACCGGCGCTGTTGGGTCTCATCTCATATCTCTCAAGCAACTTCCAAAC
GGCGACGTTTCGTATTGCACTCACTCTGAGTCTTCTTCAACATCCTCCGCCGCCGCCGCCGCCAGGCAGGCCTTGCTTGATTTCCTTAACGCCGGAATCTCCCTGAGCGC
CATTTGGGAGGTTTTTTCGGCGGCTGATCCGAGGTTCGAGGTGTTGGCGCGCCATTTGGGGGGCGCTCGAGTTCTCAGGCAACACCCTCTCGAGTGTTTGGTTCAGTTTC
TATGTTCTTCAAACAACAATATTGGGAGAATCACCAAAATGGTGGATTACATCTCATCACTAGGGAATTATTTGGGCAATGTTGAAGGTTTTGAGTTCCATGAATTCCCC
TCTTTGGAGCGCTTGTCCTTGGTCTCTGAGGCTGAGCTTAGAGAGGCAGGCTTTGGTTACAGGGCTAAATACATAGTTGGAAGTGTAAAGGCACTGGAAGCCAAACCTGG
GGGAGGTGCAGAATGGCTTCTGTCTCTTCGTGATTTGGATCTCGAAGAGGTGATTGATGCGCTTTCTTCGTTACCCGGCATCGGTCCGAAAGTGGCAGCTTGTATTGCTC
TCTTCTCCCTCGATCAGCATCATGCCATTCCTGTAGATACACATGTATGGCAGATTGCTACTAGGTACCTTGTCCCCGAGCTCGCTGGTGCACGTCTGACACCAAAGCTT
TGCAACCGTGTGGCTGAGGCATTCGTCAACAAGTATGGAAAATATGCTGGTTGGGCTCAAACTCTGCTTTTCGTAGCCGATTTACCTCAACAGAAAGCCCTCTTATCATC
ACATCTTGAGAAGAGTACTAAGAGGAAAAAATCTACAAAGCAGCAGAAGGAGAAGAACTGA
mRNA sequenceShow/hide mRNA sequence
GCCTTCTATTCTACTTAGCAACCAACTTGATTATGTGATTTGTCAATCAGGTTTATAGACAAATATTCCATGACACTCTCAAAAAATATATATTGCACTCTCAAAAGTAT
TTTTAAATGTTTTTTTAAAATGAGCCAAAAATACTAGAGAAATACTAAAACTACTCTCAAATATAACGATCTTGATATAAATGTAGGAACTGGCTCCAAAAATCACTGTT
GGTCGTTGGTGGTCAATGTTTTCGGCGGACGCTCCCCAACCCTCCCGAATGCACTCACTCAGACCCCTTCCAATGACGAAGCGGCTCAGAACCACTCCACCGTCGACTCC
CTCCGTCAAGTCTCCGCCGCCGCCGCCGCCGCCGCCCACCACCATCTCCGGCCACCATTCGTCCAGCAACCCTAAAACCGTCCCCAAGTGGGTCCCCCTCAATCTCACAA
AATCAGACCTCTTTTTGCCTCTCACTTTCCCCACCGGCCAAACCTTCCGGTGGAAGCAAACCGGGCCTCTCCAGTTCACCGGCGCTGTTGGGTCTCATCTCATATCTCTC
AAGCAACTTCCAAACGGCGACGTTTCGTATTGCACTCACTCTGAGTCTTCTTCAACATCCTCCGCCGCCGCCGCCGCCAGGCAGGCCTTGCTTGATTTCCTTAACGCCGG
AATCTCCCTGAGCGCCATTTGGGAGGTTTTTTCGGCGGCTGATCCGAGGTTCGAGGTGTTGGCGCGCCATTTGGGGGGCGCTCGAGTTCTCAGGCAACACCCTCTCGAGT
GTTTGGTTCAGTTTCTATGTTCTTCAAACAACAATATTGGGAGAATCACCAAAATGGTGGATTACATCTCATCACTAGGGAATTATTTGGGCAATGTTGAAGGTTTTGAG
TTCCATGAATTCCCCTCTTTGGAGCGCTTGTCCTTGGTCTCTGAGGCTGAGCTTAGAGAGGCAGGCTTTGGTTACAGGGCTAAATACATAGTTGGAAGTGTAAAGGCACT
GGAAGCCAAACCTGGGGGAGGTGCAGAATGGCTTCTGTCTCTTCGTGATTTGGATCTCGAAGAGGTGATTGATGCGCTTTCTTCGTTACCCGGCATCGGTCCGAAAGTGG
CAGCTTGTATTGCTCTCTTCTCCCTCGATCAGCATCATGCCATTCCTGTAGATACACATGTATGGCAGATTGCTACTAGGTACCTTGTCCCCGAGCTCGCTGGTGCACGT
CTGACACCAAAGCTTTGCAACCGTGTGGCTGAGGCATTCGTCAACAAGTATGGAAAATATGCTGGTTGGGCTCAAACTCTGCTTTTCGTAGCCGATTTACCTCAACAGAA
AGCCCTCTTATCATCACATCTTGAGAAGAGTACTAAGAGGAAAAAATCTACAAAGCAGCAGAAGGAGAAGAACTGATTCAAGAGGATCGTTCAAGTACAAACTGCTGGCA
AATGTGGGATACGGGAGATGAATGGTGATCGATGACCGAGTTTTGCTTGCTTTTGAAGGAGGTGTATATTTATTTTTTTTAATGTTCCATTTGGCAAAGAAATGCTCTTG
CTTTGAAAATCTTTCCCTAGAAACGTCCGAGGGAAGAGAATCGTTTGGCCAGGTGTGAGTTGGCATATTGATGTTGTTTTCTCCTGAAAAGAGGAATTCAAAATTTAATC
CCCATTATTAGGGAAGTTCATTGACATTCCAGCATTACAGTTGTAATAGTGTTGCTACTCTGAAACATTTCAATTGTAATATGGTAGGAAAATTCACTAGCAAAAAAACT
AATTTTCTGTGACAAGAGAACTGAGGTTGCATTTCATTGATGACATGAATCAATTAATACTTGATGTGTTGCATAATTGTTTCTAAGACCCCTAAAAGAATGAAACAAGC
TGTGATATTGCAGAAGATTATCTAAAGCTATTTATCTATAGAAAGTTATGGCGAATTATCTTGTCGTGTACTGCACGCGGGACATCTTCCATCATTATGTGCATTGTGCA
TCAACCAACAATGTTTTGGTGACGCGTGGACATGGATCGCCACGGAAGTTCAGATTTGTGATCGGAATCGAGCATTTAGTCTTTCCTAGACAAGCCTGAAACATCACAAC
ACAATTTTCATGTGAGTTATTGTCTTGTCTTTACTATCATTTATTTAGTGCAAGGCAATACTTTCAGAGTGGTAATATGTTTCAGGTAAAAAAACTCACATGCTCAACAA
TGGCTCTGGATTTTGGCGAGTGACATGCTCCAGTAGCATAGCTTTGACAGTCGCCAGTAGGGGTCCCAAAGCTTGCAAATAAGATGTTGGAGATGTTCTTGTCATGAGGG
CAGCTTAGTAGGAGCTTCGGCTTTCTGCTTCTGTTCTTACGGTTCGCTCTTTGTTTCTTTGCACCTATCCAAGAAGCTACTTGGGGATAATGTGATTCAGACACTTGCCC
ACATGTTTTGCTAATTGAAACGGCATCCAGTGATATCCCGAGCGGGTTTCCAGTTTCTTCTTCAAGAATAATCAACTGGTTTCCAGTTGGCTCGAGGAAGGAACGTGGTA
CTGTATACCATTTCTGTGAAGGCTCCCCTTTTGAAGTTAGGAAGGAGACCCAGTACCGGCCAATGCCCCTGCCGTTAACCCAAGCTGCACCCTTCCCCATGGAACCAAGG
TTCAAGGCGATCGGGTCATCACCCGGTGGTGCATCGAACCGTGTCTGCCAATTGGCGATGAATGAGAATGACAAAGTTATTAAACAACTGAATTCAATATAGAGTTGCTG
TAGAAATAGTCTAAAACGATTTATAATGTGTAGTACCTTATACCATGTGAGGGGCTGAGAAGAGTTTCCTAACCTGCTCCATTGAATTTCACTCGATCCAGTGTCTAAGA
ATATTAGTGATTGCTCTCCTACCAGGCCAACCTGATTTCCAAATCGTTCAAAAAAACCAGCAAAAATATGAAAGAAGGTTAGATTTTAAGTATATATTCATACACATGTT
GGGAGCTTAGAAAATTAAAGGACCTTGTATCCCCAAGATTTTGCAGAGAAATCTTCGCCTTGAATTTGCACTCTTCGCAGTCCAGCAACTCTGCGCTCGAGATATGCTCC
AGAATCCTTCATATTAGAACTATCAGAACTTACTAAGATGGTGTAACGTGATGATTTTGTTGCAAATATCTTTCAAAGTCAAAAATACATTAACTATGGAAGGGAAGAAG
AAGATTAAATATCCGTACCGGTAAGCCAACCATCACACTGAGCAACGAGATGTTGTTGATGCCATTTCTCAACGTAATACTTTTCTCCAGAGAGAAACTTCTTTCTTTGA
AAGTTCCATGGGCGGAGCCTATAGGGTAATCCTAGCAAAATGCTCAATAACAGTTTCTCCGAATTTCAACCATAAAAAGGAAGACAAGATTAACCTGCATAAACTCCATT
GA
Protein sequenceShow/hide protein sequence
MFSADAPQPSRMHSLRPLPMTKRLRTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPN
GDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYISSLGNYLGNVEGFEFHEFP
SLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKL
CNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEKN