; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0022381 (gene) of Chayote v1 genome

Gene IDSed0022381
OrganismSechium edule (Chayote v1)
DescriptionDNA-(apurinic or apyrimidinic site) lyase
Genome locationLG01:42323008..42325037
RNA-Seq ExpressionSed0022381
SyntenySed0022381
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006289 - nucleotide-excision repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0034039 - 8-oxo-7,8-dihydroguanine DNA N-glycosylase activity (molecular function)
GO:0140078 - class I DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase
IPR012904 - 8-oxoguanine DNA glycosylase, N-terminal
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149809.2 N-glycosylase/DNA lyase OGG1 [Cucumis sativus]4.6e-16878.02Show/hide
Query:  MHTFSSTPFLQMSKRLRTTPPSTPSAK---PPPPPPTPQLSRSKGTAVSLRQPKRGLTLTL-----------SNWVPLNLSRSDLSLPLTFPTGQTFRWK
        M + S  P L M+KRL+ TPPSTPS K   PPP PPTPQLS SK T VSL    +    TL           SNWV LNL+RSDLSLPLTFPTGQTFRWK
Subjt:  MHTFSSTPFLQMSKRLRTTPPSTPSAK---PPPPPPTPQLSRSKGTAVSLRQPKRGLTLTL-----------SNWVPLNLSRSDLSLPLTFPTGQTFRWK

Query:  QTAALQFTGVVGPHLISLNQLPKAHVSYCLHSSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSNN
        QT   +FTGVVG HLISLN LP   VSYCLH SS+SS SAAAR ALLDFLNA ISL ++WEVFSAADP F ALAR  EGARVLRQ PLECL+QFLCSSNN
Subjt:  QTAALQFTGVVGPHLISLNQLPKAHVSYCLHSSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSNN

Query:  NIARITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVA
        NI RITKMVDYISSLGNYLGNVGGFDF+EFP+LERLSLVSEAELREAGFGYRAKYIIG +NAL+AKP GGAEWLLSLRD DLEEVI+ALSTLPGVGPKVA
Subjt:  NIARITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVA

Query:  ACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRRKEKEQAGN
        ACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLF+A+LPQQKALL ++++N KRK+STK++K+   AGN
Subjt:  ACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRRKEKEQAGN

Query:  MNQCE
        ++QCE
Subjt:  MNQCE

XP_008466739.1 PREDICTED: N-glycosylase/DNA lyase OGG1 isoform X1 [Cucumis melo]7.7e-17179.06Show/hide
Query:  MHTFSSTPFLQMSKRLRTTPPSTPSAK---PPPPPPTPQLSRSKGTAVSL----RQPKRGLTL-------TLSNWVPLNLSRSDLSLPLTFPTGQTFRWK
        M + S  P L M+KR + T PSTPS K   PPP PPTPQLS SK T VS+    + P + LTL       + SNWV LNL+RSDLSLPLTFPTGQTFRWK
Subjt:  MHTFSSTPFLQMSKRLRTTPPSTPSAK---PPPPPPTPQLSRSKGTAVSL----RQPKRGLTL-------TLSNWVPLNLSRSDLSLPLTFPTGQTFRWK

Query:  QTAALQFTGVVGPHLISLNQLPKAHVSYCLH-SSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSN
        QT  L+FTGVVG HLISLN LP   VSYCLH SS+S+S+SAAAR ALLDFLNAGISL ++WEVFSAADP F ALAR LEGARVLRQ PLECL+QFLCSSN
Subjt:  QTAALQFTGVVGPHLISLNQLPKAHVSYCLH-SSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSN

Query:  NNIARITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV
        NNI RITKMVDYISSLGNYLGNVGGFDFHEFP+LERLSLVSEAELREAGFGYRAKYIIGT+NAL+AKPGGGAEWLLSLRD DLEEVI ALSTLPGVGPKV
Subjt:  NNIARITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV

Query:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRRKEKEQAG
        AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA+LPQQKALL ++++N KRK+STK++++   AG
Subjt:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRRKEKEQAG

Query:  NMNQCE
        N++QCE
Subjt:  NMNQCE

XP_016903621.1 PREDICTED: N-glycosylase/DNA lyase OGG1 isoform X2 [Cucumis melo]3.0e-16779.2Show/hide
Query:  MHTFSSTPFLQMSKRLRTTPPSTPSAK---PPPPPPTPQLSRSKGTAVSL----RQPKRGLTL-------TLSNWVPLNLSRSDLSLPLTFPTGQTFRWK
        M + S  P L M+KR + T PSTPS K   PPP PPTPQLS SK T VS+    + P + LTL       + SNWV LNL+RSDLSLPLTFPTGQTFRWK
Subjt:  MHTFSSTPFLQMSKRLRTTPPSTPSAK---PPPPPPTPQLSRSKGTAVSL----RQPKRGLTL-------TLSNWVPLNLSRSDLSLPLTFPTGQTFRWK

Query:  QTAALQFTGVVGPHLISLNQLPKAHVSYCLH-SSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSN
        QT  L+FTGVVG HLISLN LP   VSYCLH SS+S+S+SAAAR ALLDFLNAGISL ++WEVFSAADP F ALAR LEGARVLRQ PLECL+QFLCSSN
Subjt:  QTAALQFTGVVGPHLISLNQLPKAHVSYCLH-SSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSN

Query:  NNIARITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV
        NNI RITKMVDYISSLGNYLGNVGGFDFHEFP+LERLSLVSEAELREAGFGYRAKYIIGT+NAL+AKPGGGAEWLLSLRD DLEEVI ALSTLPGVGPKV
Subjt:  NNIARITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV

Query:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRRKEKEQA
        AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA+LPQQKALL ++++N KRK+STK++++   A
Subjt:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRRKEKEQA

XP_038885235.1 N-glycosylase/DNA lyase OGG1 isoform X1 [Benincasa hispida]1.8e-16776.89Show/hide
Query:  TFSSTPFLQMSKRLRTTPPSTPSAKP-----PPPPPTPQLSRSKGTAVSLRQPKRGLTLTLS-------NWVPLNLSRSDLSLPLTFPTGQTFRWKQTAA
        +F+  P L M+KRLR TPPSTPSAKP     PP PPTPQLS SK T VS+    +    TL+       NWV LNL++S+L+LPLTFPTGQTFRWKQT+ 
Subjt:  TFSSTPFLQMSKRLRTTPPSTPSAKP-----PPPPPTPQLSRSKGTAVSLRQPKRGLTLTLS-------NWVPLNLSRSDLSLPLTFPTGQTFRWKQTAA

Query:  LQFTGVVGPHLISLNQLPKAHVSYCLHSSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSNNNIAR
        LQFTGVVG HLISLN LP + VSYCLHS S+SS+SAAAR ALLDFLNAGISL ++WEVF AADP F  LAR LEGARVLRQ PLECL+QFLCSSNNNI R
Subjt:  LQFTGVVGPHLISLNQLPKAHVSYCLHSSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSNNNIAR

Query:  ITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVA
        ITKMVDYISSLGNYLGN+GGFDF+EFP+LERLSLVSEAELREAGFGYRAKYIIG +NAL+AKPGGGAEWLLSLRDLDLEEVI+ALSTLPGVGPKVAACVA
Subjt:  ITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVA

Query:  LFSLDQHHAIPVDTHVWQ------------IATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRR
        LFSLDQHHAIPVDTHVWQ            IATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALL ++++NAKRK+STK +
Subjt:  LFSLDQHHAIPVDTHVWQ------------IATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRR

Query:  KEKEQAGNMNQ
        K+K   GN++Q
Subjt:  KEKEQAGNMNQ

XP_038885236.1 N-glycosylase/DNA lyase OGG1 isoform X2 [Benincasa hispida]3.8e-17079.2Show/hide
Query:  TFSSTPFLQMSKRLRTTPPSTPSAKP-----PPPPPTPQLSRSKGTAVSLRQPKRGLTLTLS-------NWVPLNLSRSDLSLPLTFPTGQTFRWKQTAA
        +F+  P L M+KRLR TPPSTPSAKP     PP PPTPQLS SK T VS+    +    TL+       NWV LNL++S+L+LPLTFPTGQTFRWKQT+ 
Subjt:  TFSSTPFLQMSKRLRTTPPSTPSAKP-----PPPPPTPQLSRSKGTAVSLRQPKRGLTLTLS-------NWVPLNLSRSDLSLPLTFPTGQTFRWKQTAA

Query:  LQFTGVVGPHLISLNQLPKAHVSYCLHSSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSNNNIAR
        LQFTGVVG HLISLN LP + VSYCLHS S+SS+SAAAR ALLDFLNAGISL ++WEVF AADP F  LAR LEGARVLRQ PLECL+QFLCSSNNNI R
Subjt:  LQFTGVVGPHLISLNQLPKAHVSYCLHSSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSNNNIAR

Query:  ITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVA
        ITKMVDYISSLGNYLGN+GGFDF+EFP+LERLSLVSEAELREAGFGYRAKYIIG +NAL+AKPGGGAEWLLSLRDLDLEEVI+ALSTLPGVGPKVAACVA
Subjt:  ITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVA

Query:  LFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRRKEKEQAGNMNQ
        LFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALL ++++NAKRK+STK +K+K   GN++Q
Subjt:  LFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRRKEKEQAGNMNQ

TrEMBL top hitse value%identityAlignment
A0A0A0KIU8 DNA-(apurinic or apyrimidinic site) lyase2.2e-16878.02Show/hide
Query:  MHTFSSTPFLQMSKRLRTTPPSTPSAK---PPPPPPTPQLSRSKGTAVSLRQPKRGLTLTL-----------SNWVPLNLSRSDLSLPLTFPTGQTFRWK
        M + S  P L M+KRL+ TPPSTPS K   PPP PPTPQLS SK T VSL    +    TL           SNWV LNL+RSDLSLPLTFPTGQTFRWK
Subjt:  MHTFSSTPFLQMSKRLRTTPPSTPSAK---PPPPPPTPQLSRSKGTAVSLRQPKRGLTLTL-----------SNWVPLNLSRSDLSLPLTFPTGQTFRWK

Query:  QTAALQFTGVVGPHLISLNQLPKAHVSYCLHSSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSNN
        QT   +FTGVVG HLISLN LP   VSYCLH SS+SS SAAAR ALLDFLNA ISL ++WEVFSAADP F ALAR  EGARVLRQ PLECL+QFLCSSNN
Subjt:  QTAALQFTGVVGPHLISLNQLPKAHVSYCLHSSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSNN

Query:  NIARITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVA
        NI RITKMVDYISSLGNYLGNVGGFDF+EFP+LERLSLVSEAELREAGFGYRAKYIIG +NAL+AKP GGAEWLLSLRD DLEEVI+ALSTLPGVGPKVA
Subjt:  NIARITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVA

Query:  ACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRRKEKEQAGN
        ACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLF+A+LPQQKALL ++++N KRK+STK++K+   AGN
Subjt:  ACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRRKEKEQAGN

Query:  MNQCE
        ++QCE
Subjt:  MNQCE

A0A1S3CS00 DNA-(apurinic or apyrimidinic site) lyase3.7e-17179.06Show/hide
Query:  MHTFSSTPFLQMSKRLRTTPPSTPSAK---PPPPPPTPQLSRSKGTAVSL----RQPKRGLTL-------TLSNWVPLNLSRSDLSLPLTFPTGQTFRWK
        M + S  P L M+KR + T PSTPS K   PPP PPTPQLS SK T VS+    + P + LTL       + SNWV LNL+RSDLSLPLTFPTGQTFRWK
Subjt:  MHTFSSTPFLQMSKRLRTTPPSTPSAK---PPPPPPTPQLSRSKGTAVSL----RQPKRGLTL-------TLSNWVPLNLSRSDLSLPLTFPTGQTFRWK

Query:  QTAALQFTGVVGPHLISLNQLPKAHVSYCLH-SSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSN
        QT  L+FTGVVG HLISLN LP   VSYCLH SS+S+S+SAAAR ALLDFLNAGISL ++WEVFSAADP F ALAR LEGARVLRQ PLECL+QFLCSSN
Subjt:  QTAALQFTGVVGPHLISLNQLPKAHVSYCLH-SSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSN

Query:  NNIARITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV
        NNI RITKMVDYISSLGNYLGNVGGFDFHEFP+LERLSLVSEAELREAGFGYRAKYIIGT+NAL+AKPGGGAEWLLSLRD DLEEVI ALSTLPGVGPKV
Subjt:  NNIARITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV

Query:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRRKEKEQAG
        AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA+LPQQKALL ++++N KRK+STK++++   AG
Subjt:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRRKEKEQAG

Query:  NMNQCE
        N++QCE
Subjt:  NMNQCE

A0A1S4E5V3 DNA-(apurinic or apyrimidinic site) lyase1.5e-16779.2Show/hide
Query:  MHTFSSTPFLQMSKRLRTTPPSTPSAK---PPPPPPTPQLSRSKGTAVSL----RQPKRGLTL-------TLSNWVPLNLSRSDLSLPLTFPTGQTFRWK
        M + S  P L M+KR + T PSTPS K   PPP PPTPQLS SK T VS+    + P + LTL       + SNWV LNL+RSDLSLPLTFPTGQTFRWK
Subjt:  MHTFSSTPFLQMSKRLRTTPPSTPSAK---PPPPPPTPQLSRSKGTAVSL----RQPKRGLTL-------TLSNWVPLNLSRSDLSLPLTFPTGQTFRWK

Query:  QTAALQFTGVVGPHLISLNQLPKAHVSYCLH-SSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSN
        QT  L+FTGVVG HLISLN LP   VSYCLH SS+S+S+SAAAR ALLDFLNAGISL ++WEVFSAADP F ALAR LEGARVLRQ PLECL+QFLCSSN
Subjt:  QTAALQFTGVVGPHLISLNQLPKAHVSYCLH-SSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSN

Query:  NNIARITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV
        NNI RITKMVDYISSLGNYLGNVGGFDFHEFP+LERLSLVSEAELREAGFGYRAKYIIGT+NAL+AKPGGGAEWLLSLRD DLEEVI ALSTLPGVGPKV
Subjt:  NNIARITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV

Query:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRRKEKEQA
        AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA+LPQQKALL ++++N KRK+STK++++   A
Subjt:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRRKEKEQA

A0A5A7UI18 DNA-(apurinic or apyrimidinic site) lyase3.6e-16679Show/hide
Query:  MHTFSSTPFLQMSKRLRTTPPSTPSAK---PPPPPPTPQLSRSKGTAVSL----RQPKRGLTL-------TLSNWVPLNLSRSDLSLPLTFPTGQTFRWK
        M + S  P L M+KR + T PSTPS K   PPP PPTPQLS SK T VS+    + P + LTL       + SNWV LNL+RSDLSLPLTFPTGQTFRWK
Subjt:  MHTFSSTPFLQMSKRLRTTPPSTPSAK---PPPPPPTPQLSRSKGTAVSL----RQPKRGLTL-------TLSNWVPLNLSRSDLSLPLTFPTGQTFRWK

Query:  QTAALQFTGVVGPHLISLNQLPKAHVSYCLH-SSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSN
        QT  L+FTGVVG HLISLN LP   VSYCLH SS+S+S+SAAAR ALLDFLNAGISL ++WEVFSAADP F ALAR LEGARVLRQ PLECL+QFLCSSN
Subjt:  QTAALQFTGVVGPHLISLNQLPKAHVSYCLH-SSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSN

Query:  NNIARITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGY-RAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPK
        NNI RITKMVDYISSLGNYLGNVGGFDFHEFP+LERLSLVSEAELREAGFGY RAKYIIGT+NAL+AKPGGGAEWLLSLRD DLEEVI ALSTLPGVGPK
Subjt:  NNIARITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGY-RAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPK

Query:  VAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRRKEKEQA
        VAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA+LPQQKALL ++++N KRK+STK++++   A
Subjt:  VAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRRKEKEQA

A0A5D3CBS3 DNA-(apurinic or apyrimidinic site) lyase1.5e-16779.2Show/hide
Query:  MHTFSSTPFLQMSKRLRTTPPSTPSAK---PPPPPPTPQLSRSKGTAVSL----RQPKRGLTL-------TLSNWVPLNLSRSDLSLPLTFPTGQTFRWK
        M + S  P L M+KR + T PSTPS K   PPP PPTPQLS SK T VS+    + P + LTL       + SNWV LNL+RSDLSLPLTFPTGQTFRWK
Subjt:  MHTFSSTPFLQMSKRLRTTPPSTPSAK---PPPPPPTPQLSRSKGTAVSL----RQPKRGLTL-------TLSNWVPLNLSRSDLSLPLTFPTGQTFRWK

Query:  QTAALQFTGVVGPHLISLNQLPKAHVSYCLH-SSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSN
        QT  L+FTGVVG HLISLN LP   VSYCLH SS+S+S+SAAAR ALLDFLNAGISL ++WEVFSAADP F ALAR LEGARVLRQ PLECL+QFLCSSN
Subjt:  QTAALQFTGVVGPHLISLNQLPKAHVSYCLH-SSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSN

Query:  NNIARITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV
        NNI RITKMVDYISSLGNYLGNVGGFDFHEFP+LERLSLVSEAELREAGFGYRAKYIIGT+NAL+AKPGGGAEWLLSLRD DLEEVI ALSTLPGVGPKV
Subjt:  NNIARITKMVDYISSLGNYLGNVGGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKV

Query:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRRKEKEQA
        AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA+LPQQKALL ++++N KRK+STK++++   A
Subjt:  AACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRRKEKEQA

SwissProt top hitse value%identityAlignment
O08760 N-glycosylase/DNA lyase9.1e-5038.81Show/hide
Query:  RGLTLTLSNWVPLNLSRSDLSLPLTFPTGQTFRWKQTAALQFTGVVGPHLISLNQLPKAHVSYCL----HSSSSSSASAAARSALLDFLNAGISLRALWE
        R L+ + + W  +   RS+L L L   +GQ+FRWK+ +   ++GV+   + +L Q       YC       S  S  +      L  +    +SL  L+ 
Subjt:  RGLTLTLSNWVPLNLSRSDLSLPLTFPTGQTFRWKQTAALQFTGVVGPHLISLNQLPKAHVSYCL----HSSSSSSASAAARSALLDFLNAGISLRALWE

Query:  VFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSNNNIARITKMVDYI-SSLGNYLGNVGGFDFHEFPTLERLS-LVSEAELREAGFGYRAKYIIGT
         +++ D  F  +A+  +G R+LRQ P ECL  F+CSSNNNIARIT MV+ +  + G  L  +    +H FP L  L+   +E  LR+ G GYRA+Y+  +
Subjt:  VFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSNNNIARITKMVDYI-SSLGNYLGNVGGFDFHEFPTLERLS-LVSEAELREAGFGYRAKYIIGT

Query:  INALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-VAEAFVSKYGKY
          A+  + GG A WL  LR    EE   AL TLPGVG KVA C+ L +LD+  A+PVD HVWQIA R     P+ + A+    L N+ +   F + +G Y
Subjt:  INALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-VAEAFVSKYGKY

Query:  AGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKR
        AGWAQ +LF ADL Q      S    AKRKK +KR
Subjt:  AGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKR

O15527 N-glycosylase/DNA lyase4.1e-5037.5Show/hide
Query:  RQPKRGLTLTLSNWVPLNLSRSDLSLPLTFPTGQTFRWKQTAALQFTGVVGPHLISLNQL-PKAHVS-YCLHSSSSSSASAAARSALLDFLNAGISLRAL
        R   R L  T + W  +   RS+L L L  P+GQ+FRW++ +   ++GV+   + +L Q   + H + Y    S +S  +     A+  +    ++L  L
Subjt:  RQPKRGLTLTLSNWVPLNLSRSDLSLPLTFPTGQTFRWKQTAALQFTGVVGPHLISLNQL-PKAHVS-YCLHSSSSSSASAAARSALLDFLNAGISLRAL

Query:  WEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSNNNIARITKMVDYI-SSLGNYLGNVGGFDFHEFPTLERLSLVS-EAELREAGFGYRAKYII
        +  + + D  F  +A+  +G R+LRQ P+ECL  F+CSSNNNIARIT MV+ +  + G  L  +    +H FP+L+ L+    EA LR+ G GYRA+Y+ 
Subjt:  WEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSNNNIARITKMVDYI-SSLGNYLGNVGGFDFHEFPTLERLSLVS-EAELREAGFGYRAKYII

Query:  GTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLV--PELAGAR-LTPKLCNRVAEAFVSKYG
         +  A+  + GG A WL  LR+   EE   AL  LPGVG KVA C+ L +LD+  A+PVD H+W IA R     P  + A+  +P+    +   F S +G
Subjt:  GTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLV--PELAGAR-LTPKLCNRVAEAFVSKYG

Query:  KYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTK
         YAGWAQ +LF ADL Q +    +    AKR+K +K
Subjt:  KYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTK

O70249 N-glycosylase/DNA lyase2.0e-4937.21Show/hide
Query:  TAVSLRQPKRGLTLTLSNWVPLNLSRSDLSLPLTFPTGQTFRWKQTAALQFTGVVGPHLISLNQLPKAHVSYCL----HSSSSSSASAAARSALLDFLNA
        +++S     R LT + + W  +   RS+L L L   +GQ+FRW++ +   ++GV+   + +L Q       YC             +      L  +   
Subjt:  TAVSLRQPKRGLTLTLSNWVPLNLSRSDLSLPLTFPTGQTFRWKQTAALQFTGVVGPHLISLNQLPKAHVSYCL----HSSSSSSASAAARSALLDFLNA

Query:  GISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSNNNIARITKMVDYI-SSLGNYLGNVGGFDFHEFPTLERLSLVS-EAELREAGFG
         +SL  L+  +++ D  F ++A+  +G R+LRQ P ECL  F+CSSNNNIARIT MV+ +  + G  L  +    +H FP L  L+    E  LR+ G G
Subjt:  GISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSNNNIARITKMVDYI-SSLGNYLGNVGGFDFHEFPTLERLSLVS-EAELREAGFG

Query:  YRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-VAE
        YRA+Y+  +  A+  + GG A WL  LR    EE   AL TLPGVG KVA C+ L +LD+  A+PVD HVWQIA R     P+ +  +    L N+ +  
Subjt:  YRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYL--VPELAGARLTPKLCNR-VAE

Query:  AFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKR
         F + +G YAGWAQ +LF ADL QQ     S    AKRKK +K+
Subjt:  AFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKR

Q9FNY7 N-glycosylase/DNA lyase OGG12.8e-12365.45Show/hide
Query:  MSKRLRTTPPSTPSAKPPP--PPPTPQLSRSKGTAVSLRQPKRGLTLTLSNWVPLNLSRSDLSLPLTFPTGQTFRWKQTAALQFTGVVGPHLISLNQLP-
        M +   T+ PS  S   PP  PP TP L +      +   PK         W PL L+ ++L+LPLTFPTGQTFRWK+T A+Q++G +GPHL+SL Q P 
Subjt:  MSKRLRTTPPSTPSAKPPP--PPPTPQLSRSKGTAVSLRQPKRGLTLTLSNWVPLNLSRSDLSLPLTFPTGQTFRWKQTAALQFTGVVGPHLISLNQLP-

Query:  KAHVSYCLHSSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSNNNIARITKMVDYISSLGNYLGNV
           VSYC+H S+S     +A  ALLDFLNA ISL  LW  FS  DP F  LAR L GARVLRQ PLECL+QFLCSSNNNIARITKMVD++SSLG +LG++
Subjt:  KAHVSYCLHSSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSNNNIARITKMVDYISSLGNYLGNV

Query:  GGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQ
         GF+FH+FP+L+RLS VSE E R+AGFGYRAKYI GT+NAL+AKPGGG EWLLSLR ++L+E + AL TLPGVGPKVAAC+ALFSLDQH AIPVDTHVWQ
Subjt:  GGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQ

Query:  IATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQS
        IAT YL+P+LAGA+LTPKL  RVAEAFVSKYG+YAGWAQTLLF+A+LP QK LLQS
Subjt:  IATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQS

Q9V3I8 N-glycosylase/DNA lyase8.8e-4534.97Show/hide
Query:  LNLSRSDLSLPLTFPTGQTFRWKQTA---ALQFTGVVGPHLISLNQLPKAHVSYCLHSSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALA
        + LS  +  L  T   GQ+FRW+        ++ GVV      L Q  ++ ++Y  + +SS  A+    S + D+L     L+   + + + D  F  + 
Subjt:  LNLSRSDLSLPLTFPTGQTFRWKQTA---ALQFTGVVGPHLISLNQLPKAHVSYCLHSSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALA

Query:  RLLEGARVLRQHPLECLVQFLCSSNNNIARITKMVD-YISSLGNYLGNVGGFDFHEFPTLERLSLVS----EAELREAGFGYRAKYIIGTINALEAKPGG
         L +  R+L Q P E +  FLCS NNNI RI+ M++ + ++ G  +G+  G D + FPT+ R   +      A+LR A FGYRAK+I  T+  ++ K  G
Subjt:  RLLEGARVLRQHPLECLVQFLCSSNNNIARITKMVD-YISSLGNYLGNVGGFDFHEFPTLERLSLVS----EAELREAGFGYRAKYIIGTINALEAKPGG

Query:  GAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGAR-LTPKLCNRVAEAFVSKYGKYAGWAQTLLFVAD
        G  W +SL+ +  E+  + L+ LPG+G KVA C+ L S+    ++PVD H+++IA  Y +P L G + +T K+   V++ F   +GKYAGWAQ +LF AD
Subjt:  GAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGAR-LTPKLCNRVAEAFVSKYGKYAGWAQTLLFVAD

Query:  LPQQKALLQSSVDNAKRKKSTKRRKE
        L Q     Q++   A +KKS K+ K+
Subjt:  LPQQKALLQSSVDNAKRKKSTKRRKE

Arabidopsis top hitse value%identityAlignment
AT1G21710.1 8-oxoguanine-DNA glycosylase 12.0e-12465.45Show/hide
Query:  MSKRLRTTPPSTPSAKPPP--PPPTPQLSRSKGTAVSLRQPKRGLTLTLSNWVPLNLSRSDLSLPLTFPTGQTFRWKQTAALQFTGVVGPHLISLNQLP-
        M +   T+ PS  S   PP  PP TP L +      +   PK         W PL L+ ++L+LPLTFPTGQTFRWK+T A+Q++G +GPHL+SL Q P 
Subjt:  MSKRLRTTPPSTPSAKPPP--PPPTPQLSRSKGTAVSLRQPKRGLTLTLSNWVPLNLSRSDLSLPLTFPTGQTFRWKQTAALQFTGVVGPHLISLNQLP-

Query:  KAHVSYCLHSSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSNNNIARITKMVDYISSLGNYLGNV
           VSYC+H S+S     +A  ALLDFLNA ISL  LW  FS  DP F  LAR L GARVLRQ PLECL+QFLCSSNNNIARITKMVD++SSLG +LG++
Subjt:  KAHVSYCLHSSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSNNNIARITKMVDYISSLGNYLGNV

Query:  GGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQ
         GF+FH+FP+L+RLS VSE E R+AGFGYRAKYI GT+NAL+AKPGGG EWLLSLR ++L+E + AL TLPGVGPKVAAC+ALFSLDQH AIPVDTHVWQ
Subjt:  GGFDFHEFPTLERLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQ

Query:  IATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQS
        IAT YL+P+LAGA+LTPKL  RVAEAFVSKYG+YAGWAQTLLF+A+LP QK LLQS
Subjt:  IATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQS

AT3G47830.1 DNA glycosylase superfamily protein1.5e-0747.83Show/hide
Query:  LRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRY-LVPELAGARLTPKLCNR
        LR L +EEV   LS   GVGPK  +CV +F+L QH+  PVDTHV++IA     VP+ A    T    NR
Subjt:  LRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRY-LVPELAGARLTPKLCNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACACATTCTCATCCACGCCCTTTCTTCAAATGTCGAAGCGGCTCAGAACAACTCCACCTTCCACTCCCTCCGCCAAGCCACCGCCGCCACCTCCAACTCCACAACT
CTCCCGTTCCAAGGGCACCGCCGTCTCCCTCCGCCAACCCAAAAGAGGCCTAACCCTAACCCTCTCCAACTGGGTCCCTCTCAATCTCTCCAGATCCGACCTCTCTTTGC
CCCTCACTTTCCCCACCGGCCAAACATTCCGGTGGAAACAAACCGCCGCTCTTCAGTTCACCGGCGTCGTCGGCCCCCACCTCATCTCTCTCAACCAACTCCCCAAGGCC
CACGTTTCCTATTGCCTTCACTCTTCTTCTTCTTCTTCTGCCTCCGCCGCCGCCAGATCGGCGTTGCTTGATTTCCTTAACGCCGGCATCTCCCTCCGTGCCCTTTGGGA
GGTTTTCTCCGCCGCTGATCCGACATTCCATGCCTTGGCTCGCCTTTTGGAGGGTGCCCGGGTTCTCAGGCAACACCCACTTGAGTGCTTGGTTCAGTTTTTGTGTTCTT
CCAATAATAATATTGCCAGAATTACCAAAATGGTGGATTATATTTCTTCTCTCGGGAATTATTTGGGCAATGTTGGAGGTTTTGATTTCCATGAATTTCCCACTCTGGAG
AGGTTGTCTTTGGTTTCTGAGGCTGAGCTTAGAGAGGCGGGCTTTGGTTACAGGGCTAAATATATAATTGGCACTATAAATGCTTTAGAAGCCAAACCTGGAGGAGGTGC
AGAGTGGCTTTTGTCTCTTCGTGATTTGGATCTTGAAGAGGTGATTGATGCTCTTTCTACTTTGCCGGGCGTTGGTCCCAAGGTAGCGGCTTGTGTTGCTCTCTTTTCCC
TCGATCAGCACCATGCCATTCCTGTTGACACACATGTTTGGCAGATTGCTACCAGGTACCTTGTCCCTGAGCTTGCTGGTGCACGTCTAACACCAAAGCTTTGCAACCGT
GTGGCCGAGGCATTTGTCAGCAAGTATGGAAAATATGCTGGTTGGGCTCAAACTCTGCTTTTCGTTGCTGATTTACCTCAACAGAAGGCCCTCTTACAATCAAGTGTTGA
CAATGCTAAAAGGAAAAAATCTACAAAGCGACGAAAAGAAAAGGAACAGGCTGGTAATATGAATCAATGTGAATTGCTATGTTAA
mRNA sequenceShow/hide mRNA sequence
CAATATTATATCAGAATCCCAGTCTCCAAGCCCCCAGATGCACACATTCTCATCCACGCCCTTTCTTCAAATGTCGAAGCGGCTCAGAACAACTCCACCTTCCACTCCCT
CCGCCAAGCCACCGCCGCCACCTCCAACTCCACAACTCTCCCGTTCCAAGGGCACCGCCGTCTCCCTCCGCCAACCCAAAAGAGGCCTAACCCTAACCCTCTCCAACTGG
GTCCCTCTCAATCTCTCCAGATCCGACCTCTCTTTGCCCCTCACTTTCCCCACCGGCCAAACATTCCGGTGGAAACAAACCGCCGCTCTTCAGTTCACCGGCGTCGTCGG
CCCCCACCTCATCTCTCTCAACCAACTCCCCAAGGCCCACGTTTCCTATTGCCTTCACTCTTCTTCTTCTTCTTCTGCCTCCGCCGCCGCCAGATCGGCGTTGCTTGATT
TCCTTAACGCCGGCATCTCCCTCCGTGCCCTTTGGGAGGTTTTCTCCGCCGCTGATCCGACATTCCATGCCTTGGCTCGCCTTTTGGAGGGTGCCCGGGTTCTCAGGCAA
CACCCACTTGAGTGCTTGGTTCAGTTTTTGTGTTCTTCCAATAATAATATTGCCAGAATTACCAAAATGGTGGATTATATTTCTTCTCTCGGGAATTATTTGGGCAATGT
TGGAGGTTTTGATTTCCATGAATTTCCCACTCTGGAGAGGTTGTCTTTGGTTTCTGAGGCTGAGCTTAGAGAGGCGGGCTTTGGTTACAGGGCTAAATATATAATTGGCA
CTATAAATGCTTTAGAAGCCAAACCTGGAGGAGGTGCAGAGTGGCTTTTGTCTCTTCGTGATTTGGATCTTGAAGAGGTGATTGATGCTCTTTCTACTTTGCCGGGCGTT
GGTCCCAAGGTAGCGGCTTGTGTTGCTCTCTTTTCCCTCGATCAGCACCATGCCATTCCTGTTGACACACATGTTTGGCAGATTGCTACCAGGTACCTTGTCCCTGAGCT
TGCTGGTGCACGTCTAACACCAAAGCTTTGCAACCGTGTGGCCGAGGCATTTGTCAGCAAGTATGGAAAATATGCTGGTTGGGCTCAAACTCTGCTTTTCGTTGCTGATT
TACCTCAACAGAAGGCCCTCTTACAATCAAGTGTTGACAATGCTAAAAGGAAAAAATCTACAAAGCGACGAAAAGAAAAGGAACAGGCTGGTAATATGAATCAATGTGAA
TTGCTATGTTAAGTTGGTTTTAAAATTTTATGTTTAGTTTCCAATATTCCAATTGGCATGGAATTGCCCTTGCTTCGAAAATTTTTTGGGTATGTCACGGCACCTTGATG
TTGTTATCTCCCGGAAGAGAGGAATTCAGAATTTAATACTCACCTTGATGTTGTAGGTATACACTGTTAAGCAACTGCTCTTTATTTTTATGCCGCTAGCACTAAGCAGC
TACTCTTGTTGACTTGTACTAATTGTGATATGATGTGACTTCTTTGAAACATTTCCAATTGTAATAATATTGTAATTCACTAGC
Protein sequenceShow/hide protein sequence
MHTFSSTPFLQMSKRLRTTPPSTPSAKPPPPPPTPQLSRSKGTAVSLRQPKRGLTLTLSNWVPLNLSRSDLSLPLTFPTGQTFRWKQTAALQFTGVVGPHLISLNQLPKA
HVSYCLHSSSSSSASAAARSALLDFLNAGISLRALWEVFSAADPTFHALARLLEGARVLRQHPLECLVQFLCSSNNNIARITKMVDYISSLGNYLGNVGGFDFHEFPTLE
RLSLVSEAELREAGFGYRAKYIIGTINALEAKPGGGAEWLLSLRDLDLEEVIDALSTLPGVGPKVAACVALFSLDQHHAIPVDTHVWQIATRYLVPELAGARLTPKLCNR
VAEAFVSKYGKYAGWAQTLLFVADLPQQKALLQSSVDNAKRKKSTKRRKEKEQAGNMNQCELLC