; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g31430 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g31430
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDNA-(apurinic or apyrimidinic site) lyase
Genome locationchr4:23606572..23608303
RNA-Seq ExpressionMoc04g31430
SyntenyMoc04g31430
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006289 - nucleotide-excision repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0034039 - 8-oxo-7,8-dihydroguanine DNA N-glycosylase activity (molecular function)
GO:0140078 - class I DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase
IPR012904 - 8-oxoguanine DNA glycosylase, N-terminal
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008466739.1 PREDICTED: N-glycosylase/DNA lyase OGG1 isoform X1 [Cucumis melo]6.9e-16177.89Show/hide
Query:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTV-----------PKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV
        MTKR + T PSTPS K SPPPP PP        PTT+S HHSS NP KT+             WV LNLT+SDL LPLTFPTGQTFRWKQT PL+FTG V
Subjt:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTV-----------PKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV

Query:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV
        GSHLISL  LPNG+VSYC H  S+STSS +AAAR ALLDFLNAGISLS+IWEVFSAADPRF+ LARHL GARVLRQ PLECL+QFLCSSNNNIGRITKMV
Subjt:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV

Query:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD
        DYISSLGNYLGNV GF+FHEFPSLERLSLVSEAELREAGFGYRAKYI+G+V AL+AKPGGGAEWLLSLRD DLEEVI ALS+LPG+GPKVAAC+ALFSLD
Subjt:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD

Query:  QHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE
        QHHAIPVDTHVWQ            IATRYLVPELAGARLTPKLCNRVAEAFV+KYGKYAGWAQTLLFVA+LPQQKALL + LE +TKRK+STKQQ++
Subjt:  QHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE

XP_016903621.1 PREDICTED: N-glycosylase/DNA lyase OGG1 isoform X2 [Cucumis melo]6.9e-16177.89Show/hide
Query:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTV-----------PKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV
        MTKR + T PSTPS K SPPPP PP        PTT+S HHSS NP KT+             WV LNLT+SDL LPLTFPTGQTFRWKQT PL+FTG V
Subjt:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTV-----------PKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV

Query:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV
        GSHLISL  LPNG+VSYC H  S+STSS +AAAR ALLDFLNAGISLS+IWEVFSAADPRF+ LARHL GARVLRQ PLECL+QFLCSSNNNIGRITKMV
Subjt:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV

Query:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD
        DYISSLGNYLGNV GF+FHEFPSLERLSLVSEAELREAGFGYRAKYI+G+V AL+AKPGGGAEWLLSLRD DLEEVI ALS+LPG+GPKVAAC+ALFSLD
Subjt:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD

Query:  QHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE
        QHHAIPVDTHVWQ            IATRYLVPELAGARLTPKLCNRVAEAFV+KYGKYAGWAQTLLFVA+LPQQKALL + LE +TKRK+STKQQ++
Subjt:  QHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE

XP_022141451.1 N-glycosylase/DNA lyase OGG1 [Momordica charantia]1.1e-20196.83Show/hide
Query:  MTKRLRTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGDVSYCTHS
        MTKRLRTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGDVSYCTHS
Subjt:  MTKRLRTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGDVSYCTHS

Query:  ESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYISSLGNYLGNVEGFEFHEF
        ESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYISSLGNYLGNVEGFEFHEF
Subjt:  ESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYISSLGNYLGNVEGFEFHEF

Query:  PSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLV
        PSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQ        
Subjt:  PSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLV

Query:  WYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEKN
            IATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEKN
Subjt:  WYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEKN

XP_038885235.1 N-glycosylase/DNA lyase OGG1 isoform X1 [Benincasa hispida]1.3e-16779.85Show/hide
Query:  MTKRLRTTPPSTPSVKSPPPPPPP-----------PTTISGHHSSSN-PKTVP-------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGS
        MTKRLR TPPSTPS K PP P PP           PTT+S H+SS N  KT+         WV LNLTKS+L LPLTFPTGQTFRWKQT PLQFTG VGS
Subjt:  MTKRLRTTPPSTPSVKSPPPPPPP-----------PTTISGHHSSSN-PKTVP-------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGS

Query:  HLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDY
        HLISL  LPN DVSYC H  S STSS++AAAR ALLDFLNAGISLS+IWEVF AADPRF+VLARHL GARVLRQ PLECL+QFLCSSNNNIGRITKMVDY
Subjt:  HLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDY

Query:  ISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQH
        ISSLGNYLGN+ GF+F+EFPSLERLSLVSEAELREAGFGYRAKYI+G+V AL+AKPGGGAEWLLSLRDLDLEEVI+ALS+LPG+GPKVAAC+ALFSLDQH
Subjt:  ISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQH

Query:  HAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEK
        HAIPVDTHVWQLI + LL W EKIATRYLVPELAGARLTPKLCNRVAEAFV+KYGKYAGWAQTLLFVADLPQQKALL ++LE + KRK+STK QK+K
Subjt:  HAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEK

XP_038885236.1 N-glycosylase/DNA lyase OGG1 isoform X2 [Benincasa hispida]4.1e-16178.09Show/hide
Query:  MTKRLRTTPPSTPSVKSPPPPPPP-----------PTTISGHHSSSN-PKTVP-------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGS
        MTKRLR TPPSTPS K PP P PP           PTT+S H+SS N  KT+         WV LNLTKS+L LPLTFPTGQTFRWKQT PLQFTG VGS
Subjt:  MTKRLRTTPPSTPSVKSPPPPPPP-----------PTTISGHHSSSN-PKTVP-------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGS

Query:  HLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDY
        HLISL  LPN DVSYC H  S STSS++AAAR ALLDFLNAGISLS+IWEVF AADPRF+VLARHL GARVLRQ PLECL+QFLCSSNNNIGRITKMVDY
Subjt:  HLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDY

Query:  ISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQH
        ISSLGNYLGN+ GF+F+EFPSLERLSLVSEAELREAGFGYRAKYI+G+V AL+AKPGGGAEWLLSLRDLDLEEVI+ALS+LPG+GPKVAAC+ALFSLDQH
Subjt:  ISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQH

Query:  HAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEK
        HAIPVDTHVWQ            IATRYLVPELAGARLTPKLCNRVAEAFV+KYGKYAGWAQTLLFVADLPQQKALL ++LE + KRK+STK QK+K
Subjt:  HAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEK

TrEMBL top hitse value%identityAlignment
A0A0A0KIU8 DNA-(apurinic or apyrimidinic site) lyase2.2e-16077.14Show/hide
Query:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTVP-----------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV
        MTKRL+ TPPSTPS K SPPPP PP        PTT+S HHSS NP KT+P            WV LNLT+SDL LPLTFPTGQTFRWKQT P +FTG V
Subjt:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTVP-----------KWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV

Query:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV
        GSHLISL  LPNGDVSYC H  S+S+S   AAAR ALLDFLNA ISLS+IWEVFSAADPRF+ LARH  GARVLRQ PLECL+QFLCSSNNNIGRITKMV
Subjt:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV

Query:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD
        DYISSLGNYLGNV GF+F+EFPSLERLSLVSEAELREAGFGYRAKYI+G+V AL+AKP GGAEWLLSLRD DLEEVI+ALS+LPG+GPKVAAC+ALFSLD
Subjt:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD

Query:  QHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE
        QHHAIPVDTHVWQ            IATRYLVPELAGARLTPKLCNRVAEAFV+KYGKYAGWAQTLLF+A+LPQQKALL ++LE +TKRK+STKQQK+
Subjt:  QHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE

A0A1S3CS00 DNA-(apurinic or apyrimidinic site) lyase3.4e-16177.89Show/hide
Query:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTV-----------PKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV
        MTKR + T PSTPS K SPPPP PP        PTT+S HHSS NP KT+             WV LNLT+SDL LPLTFPTGQTFRWKQT PL+FTG V
Subjt:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTV-----------PKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV

Query:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV
        GSHLISL  LPNG+VSYC H  S+STSS +AAAR ALLDFLNAGISLS+IWEVFSAADPRF+ LARHL GARVLRQ PLECL+QFLCSSNNNIGRITKMV
Subjt:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV

Query:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD
        DYISSLGNYLGNV GF+FHEFPSLERLSLVSEAELREAGFGYRAKYI+G+V AL+AKPGGGAEWLLSLRD DLEEVI ALS+LPG+GPKVAAC+ALFSLD
Subjt:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD

Query:  QHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE
        QHHAIPVDTHVWQ            IATRYLVPELAGARLTPKLCNRVAEAFV+KYGKYAGWAQTLLFVA+LPQQKALL + LE +TKRK+STKQQ++
Subjt:  QHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE

A0A1S4E5V3 DNA-(apurinic or apyrimidinic site) lyase3.4e-16177.89Show/hide
Query:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTV-----------PKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV
        MTKR + T PSTPS K SPPPP PP        PTT+S HHSS NP KT+             WV LNLT+SDL LPLTFPTGQTFRWKQT PL+FTG V
Subjt:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTV-----------PKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV

Query:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV
        GSHLISL  LPNG+VSYC H  S+STSS +AAAR ALLDFLNAGISLS+IWEVFSAADPRF+ LARHL GARVLRQ PLECL+QFLCSSNNNIGRITKMV
Subjt:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV

Query:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD
        DYISSLGNYLGNV GF+FHEFPSLERLSLVSEAELREAGFGYRAKYI+G+V AL+AKPGGGAEWLLSLRD DLEEVI ALS+LPG+GPKVAAC+ALFSLD
Subjt:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD

Query:  QHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE
        QHHAIPVDTHVWQ            IATRYLVPELAGARLTPKLCNRVAEAFV+KYGKYAGWAQTLLFVA+LPQQKALL + LE +TKRK+STKQQ++
Subjt:  QHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE

A0A5D3CBS3 DNA-(apurinic or apyrimidinic site) lyase3.4e-16177.89Show/hide
Query:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTV-----------PKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV
        MTKR + T PSTPS K SPPPP PP        PTT+S HHSS NP KT+             WV LNLT+SDL LPLTFPTGQTFRWKQT PL+FTG V
Subjt:  MTKRLRTTPPSTPSVK-SPPPPPPP--------PTTISGHHSSSNP-KTV-----------PKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAV

Query:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV
        GSHLISL  LPNG+VSYC H  S+STSS +AAAR ALLDFLNAGISLS+IWEVFSAADPRF+ LARHL GARVLRQ PLECL+QFLCSSNNNIGRITKMV
Subjt:  GSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMV

Query:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD
        DYISSLGNYLGNV GF+FHEFPSLERLSLVSEAELREAGFGYRAKYI+G+V AL+AKPGGGAEWLLSLRD DLEEVI ALS+LPG+GPKVAAC+ALFSLD
Subjt:  DYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLD

Query:  QHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE
        QHHAIPVDTHVWQ            IATRYLVPELAGARLTPKLCNRVAEAFV+KYGKYAGWAQTLLFVA+LPQQKALL + LE +TKRK+STKQQ++
Subjt:  QHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKE

A0A6J1CJ84 DNA-(apurinic or apyrimidinic site) lyase5.1e-20296.83Show/hide
Query:  MTKRLRTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGDVSYCTHS
        MTKRLRTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGDVSYCTHS
Subjt:  MTKRLRTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGDVSYCTHS

Query:  ESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYISSLGNYLGNVEGFEFHEF
        ESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYISSLGNYLGNVEGFEFHEF
Subjt:  ESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYISSLGNYLGNVEGFEFHEF

Query:  PSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLV
        PSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQ        
Subjt:  PSLERLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLV

Query:  WYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEKN
            IATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEKN
Subjt:  WYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEKN

SwissProt top hitse value%identityAlignment
O08760 N-glycosylase/DNA lyase1.4e-5036.9Show/hide
Query:  PTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGDVSYCT--HSESSSTSSAAAAARQALLDFLNA
        P+++     SS+P     W  +   +S+L L L   +GQ+FRWK+  P  ++G +   + +L Q    D  YCT    + S  S       + L  +   
Subjt:  PTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGDVSYCT--HSESSSTSSAAAAARQALLDFLNA

Query:  GISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVEGFEFHEFPSLERLS-LVSEAELREAGFG
         +SL+ ++  +++ D  F+ +A+   G R+LRQ P ECL  F+CSSNNNI RIT MV+ +  + G  L  ++   +H FP+L  L+   +E  LR+ G G
Subjt:  GISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVEGFEFHEFPSLERLS-LVSEAELREAGFG

Query:  YRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLT
        YRA+Y+  S KA+  + GG A WL  LR    EE   AL +LPG+G KVA CI L +LD+  A+PVD HVWQ+  R    W+         P+ + A+  
Subjt:  YRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLT

Query:  PKLCNR-VAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQ
          L N+ +   F N +G YAGWAQ +LF ADL Q     S   E   KRKK +K+
Subjt:  PKLCNR-VAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQ

O15527 N-glycosylase/DNA lyase9.7e-4934.96Show/hide
Query:  GHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGDVSYCT--HSESSSTSSAAAAARQALLDFLNAGISLS
        GH + ++  T   W  +   +S+L L L  P+GQ+FRW++  P  ++G +   + +L Q    +  +CT    + S  S       +A+  +    ++L+
Subjt:  GHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGDVSYCT--HSESSSTSSAAAAARQALLDFLNAGISLS

Query:  AIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVEGFEFHEFPSLERLSLVS-EAELREAGFGYRAKY
         ++  + + D  F+ +A+   G R+LRQ P+ECL  F+CSSNNNI RIT MV+ +  + G  L  ++   +H FPSL+ L+    EA LR+ G GYRA+Y
Subjt:  AIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVEGFEFHEFPSLERLSLVS-EAELREAGFGYRAKY

Query:  IVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGAR-LTPKLC
        +  S +A+  + GG A WL  LR+   EE   AL  LPG+G KVA CI L +LD+  A+PVD H+W +  R    W+         P  + A+  +P+  
Subjt:  IVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGAR-LTPKLC

Query:  NRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTK
          +   F + +G YAGWAQ +LF ADL Q +       E   KR+K +K
Subjt:  NRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTK

O70249 N-glycosylase/DNA lyase2.0e-4936.9Show/hide
Query:  WVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGDVSYCT--HSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRF
        W  +   +S+L L L   +GQ+FRW++  P  ++G +   + +L Q    D  YCT    +            + L  +    +SL+ ++  +++ D  F
Subjt:  WVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGDVSYCT--HSESSSTSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRF

Query:  EVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVEGFEFHEFPSLERLSLVS-EAELREAGFGYRAKYIVGSVKALEAKPG
        + +A+   G R+LRQ P ECL  F+CSSNNNI RIT MV+ +  + G  L  ++   +H FP+L  L+    E  LR+ G GYRA+Y+  S KA+  + G
Subjt:  EVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVEGFEFHEFPSLERLSLVS-EAELREAGFGYRAKYIVGSVKALEAKPG

Query:  GGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKY
        G A WL  LR    EE   AL +LPG+G KVA CI L +LD+  A+PVD HVWQ+  R    W  K +       LA   L           F N +G Y
Subjt:  GGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLTPKLCNRVAEAFVNKYGKY

Query:  AGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQ
        AGWAQ +LF ADL QQ    +   E   KRKK +K+
Subjt:  AGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQ

Q9FNY7 N-glycosylase/DNA lyase OGG13.9e-12262.6Show/hide
Query:  RTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGD-VSYCTHSESSS
        R  P S PS+ S   PP  P          +    PKW PL LT ++L LPLTFPTGQTFRWK+TG +Q++G +G HL+SL+Q P  D VSYC H  +S 
Subjt:  RTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGD-VSYCTHSESSS

Query:  TSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYISSLGNYLGNVEGFEFHEFPSLE
         S     A  ALLDFLNA ISL+ +W  FS  DPRF  LARHL GARVLRQ PLECL+QFLCSSNNNI RITKMVD++SSLG +LG+++GFEFH+FPSL+
Subjt:  TSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYISSLGNYLGNVEGFEFHEFPSLE

Query:  RLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLVWYEK
        RLS VSE E R+AGFGYRAKYI G+V AL+AKPGGG EWLLSLR ++L+E + AL +LPG+GPKVAACIALFSLDQH AIPVDTHVWQ            
Subjt:  RLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLVWYEK

Query:  IATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTK
        IAT YL+P+LAGA+LTPKL  RVAEAFV+KYG+YAGWAQTLLF+A+LP QK LL S  +   K  +S +
Subjt:  IATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTK

Q9V3I8 N-glycosylase/DNA lyase1.3e-3732.34Show/hide
Query:  LNLTKSDLFLPLTFPTGQTFRWKQT---GPLQFTGAVGSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALL-DFLNAGISLSAIWEVFSAADPRF-
        + L+  +  L  T   GQ+FRW+        ++ G V +    L+Q    + S+ T+    ++S  A     +L+ D+L     L    + + + D  F 
Subjt:  LNLTKSDLFLPLTFPTGQTFRWKQT---GPLQFTGAVGSHLISLKQLPNGDVSYCTHSESSSTSSAAAAARQALL-DFLNAGISLSAIWEVFSAADPRF-

Query:  EVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVD-YISSLGNYLGNVEGFEFHEFPSLERLSLVS----EAELREAGFGYRAKYIVGSVKALEA
        + L++ +   R+L Q P E +  FLCS NNNI RI+ M++ + ++ G  +G+  G + + FP++ R   +      A+LR A FGYRAK+I  +++ ++ 
Subjt:  EVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVD-YISSLGNYLGNVEGFEFHEFPSLERLSLVS----EAELREAGFGYRAKYIVGSVKALEA

Query:  KPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGAR-LTPKLCNRVAEAFVNK
        K  GG  W +SL+ +  E+  + L+ LPGIG KVA CI L S+    ++PVD H++            +IA  Y +P L G + +T K+   V++ F   
Subjt:  KPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGAR-LTPKLCNRVAEAFVNK

Query:  YGKYAGWAQTLLFVADLPQ-QKALLSSHLEKSTKRKK
        +GKYAGWAQ +LF ADL Q Q     +  +KS K+ K
Subjt:  YGKYAGWAQTLLFVADLPQ-QKALLSSHLEKSTKRKK

Arabidopsis top hitse value%identityAlignment
AT1G05900.1 endonuclease III 27.5e-0434.92Show/hide
Query:  DLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLVWYEKIATRYLV
        D D+   ++ L SLPG+GPK+A  +   + +    I VDTHV ++  R  L W  K  T+ ++
Subjt:  DLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLVWYEKIATRYLV

AT1G05900.2 endonuclease III 27.5e-0436.67Show/hide
Query:  DLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLVWYEKIATR
        D D+   ++ L SLPG+GPK+A  +   + +    I VDTHV ++  R  L W  K  T+
Subjt:  DLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLVWYEKIATR

AT1G21710.1 8-oxoguanine-DNA glycosylase 12.7e-12362.6Show/hide
Query:  RTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGD-VSYCTHSESSS
        R  P S PS+ S   PP  P          +    PKW PL LT ++L LPLTFPTGQTFRWK+TG +Q++G +G HL+SL+Q P  D VSYC H  +S 
Subjt:  RTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGD-VSYCTHSESSS

Query:  TSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYISSLGNYLGNVEGFEFHEFPSLE
         S     A  ALLDFLNA ISL+ +W  FS  DPRF  LARHL GARVLRQ PLECL+QFLCSSNNNI RITKMVD++SSLG +LG+++GFEFH+FPSL+
Subjt:  TSSAAAAARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYISSLGNYLGNVEGFEFHEFPSLE

Query:  RLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLVWYEK
        RLS VSE E R+AGFGYRAKYI G+V AL+AKPGGG EWLLSLR ++L+E + AL +LPG+GPKVAACIALFSLDQH AIPVDTHVWQ            
Subjt:  RLSLVSEAELREAGFGYRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLVWYEK

Query:  IATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTK
        IAT YL+P+LAGA+LTPKL  RVAEAFV+KYG+YAGWAQTLLF+A+LP QK LL S  +   K  +S +
Subjt:  IATRYLVPELAGARLTPKLCNRVAEAFVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTK

AT3G47830.1 DNA glycosylase superfamily protein5.5e-0745.16Show/hide
Query:  LRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLVWYEKIATR
        LR L +EEV   LS   G+GPK  +C+ +F+L QH+  PVDTHV+++     L W  K A R
Subjt:  LRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLVWYEKIATR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGAAGCGGCTCAGAACCACTCCACCGTCGACTCCCTCCGTCAAGTCTCCGCCGCCGCCGCCGCCGCCGCCCACCACCATCTCCGGCCACCATTCGTCCAGCAACCC
TAAAACCGTCCCCAAGTGGGTCCCCCTCAATCTCACAAAATCAGACCTCTTTTTGCCTCTCACTTTCCCCACCGGCCAAACCTTCCGGTGGAAGCAAACCGGGCCTCTCC
AGTTCACCGGCGCTGTTGGGTCTCATCTCATATCTCTCAAGCAACTTCCAAACGGCGACGTTTCGTATTGCACTCACTCTGAGTCTTCTTCAACATCCTCCGCCGCCGCC
GCCGCCAGGCAGGCCTTGCTTGATTTCCTTAACGCCGGAATCTCCCTGAGCGCCATTTGGGAGGTTTTTTCGGCGGCTGATCCGAGGTTCGAGGTGTTGGCGCGCCATTT
GGGGGGCGCTCGAGTTCTCAGGCAACACCCTCTCGAGTGTTTGGTTCAGTTTCTATGTTCTTCAAACAACAATATTGGGAGAATCACCAAAATGGTGGATTACATCTCAT
CACTAGGGAATTATTTGGGCAATGTTGAAGGTTTTGAGTTCCATGAATTCCCCTCTTTGGAGCGCTTGTCCTTGGTCTCTGAGGCTGAGCTTAGAGAGGCAGGCTTTGGT
TACAGGGCTAAATACATAGTTGGAAGTGTAAAGGCACTGGAAGCCAAACCTGGGGGAGGTGCAGAATGGCTTCTGTCTCTTCGTGATTTGGATCTCGAAGAGGTGATTGA
TGCGCTTTCTTCGTTACCCGGCATCGGTCCGAAAGTGGCAGCTTGTATTGCTCTCTTCTCCCTCGATCAGCATCATGCCATTCCTGTAGATACACATGTATGGCAGTTGA
TTGGAAGGCTTCTTCTTGTGTGGTATGAAAAGATTGCTACTAGGTACCTTGTCCCCGAGCTCGCTGGTGCACGTCTGACACCAAAGCTTTGCAACCGTGTGGCTGAGGCA
TTCGTCAACAAGTATGGAAAATATGCTGGTTGGGCTCAAACTCTGCTTTTCGTAGCCGATTTACCTCAACAGAAAGCCCTCTTATCATCACATCTTGAGAAGAGTACTAA
GAGGAAAAAATCTACAAAGCAGCAGAAGGAGAAGAACTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACGAAGCGGCTCAGAACCACTCCACCGTCGACTCCCTCCGTCAAGTCTCCGCCGCCGCCGCCGCCGCCGCCCACCACCATCTCCGGCCACCATTCGTCCAGCAACCC
TAAAACCGTCCCCAAGTGGGTCCCCCTCAATCTCACAAAATCAGACCTCTTTTTGCCTCTCACTTTCCCCACCGGCCAAACCTTCCGGTGGAAGCAAACCGGGCCTCTCC
AGTTCACCGGCGCTGTTGGGTCTCATCTCATATCTCTCAAGCAACTTCCAAACGGCGACGTTTCGTATTGCACTCACTCTGAGTCTTCTTCAACATCCTCCGCCGCCGCC
GCCGCCAGGCAGGCCTTGCTTGATTTCCTTAACGCCGGAATCTCCCTGAGCGCCATTTGGGAGGTTTTTTCGGCGGCTGATCCGAGGTTCGAGGTGTTGGCGCGCCATTT
GGGGGGCGCTCGAGTTCTCAGGCAACACCCTCTCGAGTGTTTGGTTCAGTTTCTATGTTCTTCAAACAACAATATTGGGAGAATCACCAAAATGGTGGATTACATCTCAT
CACTAGGGAATTATTTGGGCAATGTTGAAGGTTTTGAGTTCCATGAATTCCCCTCTTTGGAGCGCTTGTCCTTGGTCTCTGAGGCTGAGCTTAGAGAGGCAGGCTTTGGT
TACAGGGCTAAATACATAGTTGGAAGTGTAAAGGCACTGGAAGCCAAACCTGGGGGAGGTGCAGAATGGCTTCTGTCTCTTCGTGATTTGGATCTCGAAGAGGTGATTGA
TGCGCTTTCTTCGTTACCCGGCATCGGTCCGAAAGTGGCAGCTTGTATTGCTCTCTTCTCCCTCGATCAGCATCATGCCATTCCTGTAGATACACATGTATGGCAGTTGA
TTGGAAGGCTTCTTCTTGTGTGGTATGAAAAGATTGCTACTAGGTACCTTGTCCCCGAGCTCGCTGGTGCACGTCTGACACCAAAGCTTTGCAACCGTGTGGCTGAGGCA
TTCGTCAACAAGTATGGAAAATATGCTGGTTGGGCTCAAACTCTGCTTTTCGTAGCCGATTTACCTCAACAGAAAGCCCTCTTATCATCACATCTTGAGAAGAGTACTAA
GAGGAAAAAATCTACAAAGCAGCAGAAGGAGAAGAACTGGTAA
Protein sequenceShow/hide protein sequence
MTKRLRTTPPSTPSVKSPPPPPPPPTTISGHHSSSNPKTVPKWVPLNLTKSDLFLPLTFPTGQTFRWKQTGPLQFTGAVGSHLISLKQLPNGDVSYCTHSESSSTSSAAA
AARQALLDFLNAGISLSAIWEVFSAADPRFEVLARHLGGARVLRQHPLECLVQFLCSSNNNIGRITKMVDYISSLGNYLGNVEGFEFHEFPSLERLSLVSEAELREAGFG
YRAKYIVGSVKALEAKPGGGAEWLLSLRDLDLEEVIDALSSLPGIGPKVAACIALFSLDQHHAIPVDTHVWQLIGRLLLVWYEKIATRYLVPELAGARLTPKLCNRVAEA
FVNKYGKYAGWAQTLLFVADLPQQKALLSSHLEKSTKRKKSTKQQKEKNW