; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G004540 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G004540
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionDNA-(apurinic or apyrimidinic site) lyase
Genome locationchr08:12180202..12184266
RNA-Seq ExpressionLsi08G004540
SyntenyLsi08G004540
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006289 - nucleotide-excision repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0034039 - 8-oxo-7,8-dihydroguanine DNA N-glycosylase activity (molecular function)
GO:0140078 - class I DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase
IPR012904 - 8-oxoguanine DNA glycosylase, N-terminal
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055472.1 N-glycosylase/DNA lyase OGG1 isoform X2 [Cucumis melo var. makuwa]2.1e-19588.63Show/hide
Query:  MPSLSFKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQ
        MPSLSFKPLLLMTK  +PT PSTPS KPSPPPP    SPPTPQLSHS PTTVSIHHSS N NKTLTLLKSP  S SSSNW+SLNLTRSDLSLPLTFPTGQ
Subjt:  MPSLSFKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQ

Query:  TFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLI
        TFRWKQT+PL FTGVVGSHLISLNHLPNG+VSYCLH  STS+S  SSAAARLALLDFLNAGISLS+IWEVFSAADPRFD LARHLEGARVLRQDPLECLI
Subjt:  TFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLI

Query:  QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGY-RAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALAT
        QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDF+EFPSLERLSLVSEAELREAGFGY RAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVI AL+T
Subjt:  QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGY-RAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALAT

Query:  LPGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTAD
        LPGVGPKVAACVALFSLDQHHAIPVDTHVWQ            IATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA+LPQQKALL A 
Subjt:  LPGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTAD

Query:  LENTKRKRSTKQQKDKAH-EQD
        LENTKRKRSTKQQ+D AH EQD
Subjt:  LENTKRKRSTKQQKDKAH-EQD

XP_004149809.2 N-glycosylase/DNA lyase OGG1 [Cucumis sativus]1.6e-19588.25Show/hide
Query:  MPSLSFKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQ
        MPSLSFKPLLLMTK L+PTPPSTPS KPSPPPP    SPPTPQLSHS PTTVS+HHSS N NKTL LLKSP  S SSSNW+SLNLTRSDLSLPLTFPTGQ
Subjt:  MPSLSFKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQ

Query:  TFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLI
        TFRWKQT+P  FTGVVGSHLISLNHLPNGDVSYCLH  STS    SSAAARLALLDFLNA ISLS+IWEVFSAADPRFD LARH EGARVLRQDPLECLI
Subjt:  TFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLI

Query:  QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALATL
        QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIG VNALKAKP GGAEWLLSLRDSDLEEVIEAL+TL
Subjt:  QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALATL

Query:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTADL
        PGVGPKVAACVALFSLDQHHAIPVDTHVWQ            IATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLF+A+LPQQKALL A+L
Subjt:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTADL

Query:  ENTKRKRSTKQQKDKAH
        ENTKRKRSTKQQKD AH
Subjt:  ENTKRKRSTKQQKDKAH

XP_008466739.1 PREDICTED: N-glycosylase/DNA lyase OGG1 isoform X1 [Cucumis melo]1.1e-19688.97Show/hide
Query:  MPSLSFKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQ
        MPSLSFKPLLLMTK  +PT PSTPS KPSPPPP    SPPTPQLSHS PTTVSIHHSS N NKTLTLLKSP  S SSSNW+SLNLTRSDLSLPLTFPTGQ
Subjt:  MPSLSFKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQ

Query:  TFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLI
        TFRWKQT+PL FTGVVGSHLISLNHLPNG+VSYCLH  STS+S  SSAAARLALLDFLNAGISLS+IWEVFSAADPRFD LARHLEGARVLRQDPLECLI
Subjt:  TFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLI

Query:  QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALATL
        QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDF+EFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVI AL+TL
Subjt:  QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALATL

Query:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTADL
        PGVGPKVAACVALFSLDQHHAIPVDTHVWQ            IATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA+LPQQKALL A L
Subjt:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTADL

Query:  ENTKRKRSTKQQKDKAH
        ENTKRKRSTKQQ+D AH
Subjt:  ENTKRKRSTKQQKDKAH

XP_016903621.1 PREDICTED: N-glycosylase/DNA lyase OGG1 isoform X2 [Cucumis melo]8.7e-19788.84Show/hide
Query:  MPSLSFKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQ
        MPSLSFKPLLLMTK  +PT PSTPS KPSPPPP    SPPTPQLSHS PTTVSIHHSS N NKTLTLLKSP  S SSSNW+SLNLTRSDLSLPLTFPTGQ
Subjt:  MPSLSFKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQ

Query:  TFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLI
        TFRWKQT+PL FTGVVGSHLISLNHLPNG+VSYCLH  STS+S  SSAAARLALLDFLNAGISLS+IWEVFSAADPRFD LARHLEGARVLRQDPLECLI
Subjt:  TFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLI

Query:  QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALATL
        QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDF+EFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVI AL+TL
Subjt:  QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALATL

Query:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTADL
        PGVGPKVAACVALFSLDQHHAIPVDTHVWQ            IATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA+LPQQKALL A L
Subjt:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTADL

Query:  ENTKRKRSTKQQKDKAH-EQD
        ENTKRKRSTKQQ+D AH EQD
Subjt:  ENTKRKRSTKQQKDKAH-EQD

XP_038885235.1 N-glycosylase/DNA lyase OGG1 isoform X1 [Benincasa hispida]4.2e-19989.26Show/hide
Query:  MPSLS--FKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPT
        MPSLS  FKP LLMTK LRPTPPSTPSAK  PPP  SP SPPTPQLSHS PTTVS+H+SS N+NKTLT       S SS NW+SLNLT+S+L+LPLTFPT
Subjt:  MPSLS--FKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPT

Query:  GQTFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLEC
        GQTFRWKQTSPL FTGVVGSHLISLNHLPN DVSYCLHSCSTSS   SSAAARLALLDFLNAGISLS+IWEVF AADPRFD LARHLEGARVLRQDPLEC
Subjt:  GQTFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLEC

Query:  LIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALA
        LIQFLCSSNNNIGRITKMVDYISSLGNYLGN+GGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIG VNALKAKPGGGAEWLLSLRD DLEEVIEAL+
Subjt:  LIQFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALA

Query:  TLPGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTA
        TLPGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFL+ WDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALL A
Subjt:  TLPGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTA

Query:  DLENTKRKRSTKQQKDKAH
        +LEN KRKRSTK QKDKAH
Subjt:  DLENTKRKRSTKQQKDKAH

TrEMBL top hitse value%identityAlignment
A0A0A0KIU8 DNA-(apurinic or apyrimidinic site) lyase7.9e-19688.25Show/hide
Query:  MPSLSFKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQ
        MPSLSFKPLLLMTK L+PTPPSTPS KPSPPPP    SPPTPQLSHS PTTVS+HHSS N NKTL LLKSP  S SSSNW+SLNLTRSDLSLPLTFPTGQ
Subjt:  MPSLSFKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQ

Query:  TFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLI
        TFRWKQT+P  FTGVVGSHLISLNHLPNGDVSYCLH  STS    SSAAARLALLDFLNA ISLS+IWEVFSAADPRFD LARH EGARVLRQDPLECLI
Subjt:  TFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLI

Query:  QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALATL
        QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIG VNALKAKP GGAEWLLSLRDSDLEEVIEAL+TL
Subjt:  QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALATL

Query:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTADL
        PGVGPKVAACVALFSLDQHHAIPVDTHVWQ            IATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLF+A+LPQQKALL A+L
Subjt:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTADL

Query:  ENTKRKRSTKQQKDKAH
        ENTKRKRSTKQQKD AH
Subjt:  ENTKRKRSTKQQKDKAH

A0A1S3CS00 DNA-(apurinic or apyrimidinic site) lyase5.5e-19788.97Show/hide
Query:  MPSLSFKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQ
        MPSLSFKPLLLMTK  +PT PSTPS KPSPPPP    SPPTPQLSHS PTTVSIHHSS N NKTLTLLKSP  S SSSNW+SLNLTRSDLSLPLTFPTGQ
Subjt:  MPSLSFKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQ

Query:  TFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLI
        TFRWKQT+PL FTGVVGSHLISLNHLPNG+VSYCLH  STS+S  SSAAARLALLDFLNAGISLS+IWEVFSAADPRFD LARHLEGARVLRQDPLECLI
Subjt:  TFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLI

Query:  QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALATL
        QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDF+EFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVI AL+TL
Subjt:  QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALATL

Query:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTADL
        PGVGPKVAACVALFSLDQHHAIPVDTHVWQ            IATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA+LPQQKALL A L
Subjt:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTADL

Query:  ENTKRKRSTKQQKDKAH
        ENTKRKRSTKQQ+D AH
Subjt:  ENTKRKRSTKQQKDKAH

A0A1S4E5V3 DNA-(apurinic or apyrimidinic site) lyase4.2e-19788.84Show/hide
Query:  MPSLSFKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQ
        MPSLSFKPLLLMTK  +PT PSTPS KPSPPPP    SPPTPQLSHS PTTVSIHHSS N NKTLTLLKSP  S SSSNW+SLNLTRSDLSLPLTFPTGQ
Subjt:  MPSLSFKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQ

Query:  TFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLI
        TFRWKQT+PL FTGVVGSHLISLNHLPNG+VSYCLH  STS+S  SSAAARLALLDFLNAGISLS+IWEVFSAADPRFD LARHLEGARVLRQDPLECLI
Subjt:  TFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLI

Query:  QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALATL
        QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDF+EFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVI AL+TL
Subjt:  QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALATL

Query:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTADL
        PGVGPKVAACVALFSLDQHHAIPVDTHVWQ            IATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA+LPQQKALL A L
Subjt:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTADL

Query:  ENTKRKRSTKQQKDKAH-EQD
        ENTKRKRSTKQQ+D AH EQD
Subjt:  ENTKRKRSTKQQKDKAH-EQD

A0A5A7UI18 DNA-(apurinic or apyrimidinic site) lyase1.0e-19588.63Show/hide
Query:  MPSLSFKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQ
        MPSLSFKPLLLMTK  +PT PSTPS KPSPPPP    SPPTPQLSHS PTTVSIHHSS N NKTLTLLKSP  S SSSNW+SLNLTRSDLSLPLTFPTGQ
Subjt:  MPSLSFKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQ

Query:  TFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLI
        TFRWKQT+PL FTGVVGSHLISLNHLPNG+VSYCLH  STS+S  SSAAARLALLDFLNAGISLS+IWEVFSAADPRFD LARHLEGARVLRQDPLECLI
Subjt:  TFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLI

Query:  QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGY-RAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALAT
        QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDF+EFPSLERLSLVSEAELREAGFGY RAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVI AL+T
Subjt:  QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGY-RAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALAT

Query:  LPGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTAD
        LPGVGPKVAACVALFSLDQHHAIPVDTHVWQ            IATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA+LPQQKALL A 
Subjt:  LPGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTAD

Query:  LENTKRKRSTKQQKDKAH-EQD
        LENTKRKRSTKQQ+D AH EQD
Subjt:  LENTKRKRSTKQQKDKAH-EQD

A0A5D3CBS3 DNA-(apurinic or apyrimidinic site) lyase4.2e-19788.84Show/hide
Query:  MPSLSFKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQ
        MPSLSFKPLLLMTK  +PT PSTPS KPSPPPP    SPPTPQLSHS PTTVSIHHSS N NKTLTLLKSP  S SSSNW+SLNLTRSDLSLPLTFPTGQ
Subjt:  MPSLSFKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQ

Query:  TFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLI
        TFRWKQT+PL FTGVVGSHLISLNHLPNG+VSYCLH  STS+S  SSAAARLALLDFLNAGISLS+IWEVFSAADPRFD LARHLEGARVLRQDPLECLI
Subjt:  TFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLI

Query:  QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALATL
        QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDF+EFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVI AL+TL
Subjt:  QFLCSSNNNIGRITKMVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALATL

Query:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTADL
        PGVGPKVAACVALFSLDQHHAIPVDTHVWQ            IATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVA+LPQQKALL A L
Subjt:  PGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTADL

Query:  ENTKRKRSTKQQKDKAH-EQD
        ENTKRKRSTKQQ+D AH EQD
Subjt:  ENTKRKRSTKQQKDKAH-EQD

SwissProt top hitse value%identityAlignment
O08760 N-glycosylase/DNA lyase2.6e-5037.28Show/hide
Query:  HHSLSSSN--WISLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYC-LHSCSTSSSSTSSAAARLALLDFLNAGISLSAIW
        H +LSSS   W S+   RS+L L L   +GQ+FRWK+ SP H++GV+   + +L      D  YC ++    S  S  +      L  +    +SL+ ++
Subjt:  HHSLSSSN--WISLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYC-LHSCSTSSSSTSSAAARLALLDFLNAGISLSAIW

Query:  EVFSAADPRFDELARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFYEFPSLERLS-LVSEAELREAGFGYRAKYIIG
          +++ D  F  +A+  +G R+LRQDP ECL  F+CSSNNNI RIT MV+ +  + G  L  +    ++ FP+L  L+   +E  LR+ G GYRA+Y+  
Subjt:  EVFSAADPRFDELARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFYEFPSLERLS-LVSEAELREAGFGYRAKYIIG

Query:  TVNALKAKPGGGAEWLLSLRDSDLEEVIEALATLPGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNR-V
        +  A+  + GG A WL  LR +  EE  +AL TLPGVG KVA C+ L +LD+  A+PVD HVWQ+  +    W          P+ + A+    L N+ +
Subjt:  TVNALKAKPGGGAEWLLSLRDSDLEEVIEALATLPGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNR-V

Query:  AEAFVSKYGKYAGWAQTLLFVADLPQQKALLTADLENTKRKRSTKQ
           F + +G YAGWAQ +LF ADL Q      +     KRK+ +K+
Subjt:  AEAFVSKYGKYAGWAQTLLFVADLPQQKALLTADLENTKRKRSTKQ

O15527 N-glycosylase/DNA lyase5.2e-5136.05Show/hide
Query:  HHSLSSSN--WISLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWE
        H +L+S+   W S+   RS+L L L  P+GQ+FRW++ SP H++GV+   + +L       +   ++    S +S  +     A+  +    ++L+ ++ 
Subjt:  HHSLSSSN--WISLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWE

Query:  VFSAADPRFDELARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFYEFPSLERLSLVS-EAELREAGFGYRAKYIIGT
         + + D  F E+A+  +G R+LRQDP+ECL  F+CSSNNNI RIT MV+ +  + G  L  +    ++ FPSL+ L+    EA LR+ G GYRA+Y+  +
Subjt:  VFSAADPRFDELARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFYEFPSLERLSLVS-EAELREAGFGYRAKYIIGT

Query:  VNALKAKPGGGAEWLLSLRDSDLEEVIEALATLPGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGAR-LTPKLCNRVA
          A+  + GG A WL  LR+S  EE  +AL  LPGVG KVA C+ L +LD+  A+PVD H+W + ++    W          P  + A+  +P+    + 
Subjt:  VNALKAKPGGGAEWLLSLRDSDLEEVIEALATLPGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGAR-LTPKLCNRVA

Query:  EAFVSKYGKYAGWAQTLLFVADLPQQKALLTADLENTKRKRSTK
          F S +G YAGWAQ +LF ADL Q +    A     KR++ +K
Subjt:  EAFVSKYGKYAGWAQTLLFVADLPQQKALLTADLENTKRKRSTK

O70249 N-glycosylase/DNA lyase2.8e-4936.52Show/hide
Query:  HHSLSSSN--WISLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYC-LHSCSTSSSSTSSAAARLALLDFLNAGISLSAIW
        H +L+SS   W S+   RS+L L L   +GQ+FRW++ SP H++GV+   + +L      D  YC ++          +      L  +    +SL+ ++
Subjt:  HHSLSSSN--WISLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYC-LHSCSTSSSSTSSAAARLALLDFLNAGISLSAIW

Query:  EVFSAADPRFDELARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFYEFPSLERLSLVS-EAELREAGFGYRAKYIIG
          +++ D  F  +A+  +G R+LRQDP ECL  F+CSSNNNI RIT MV+ +  + G  L  +    ++ FP+L  L+    E  LR+ G GYRA+Y+  
Subjt:  EVFSAADPRFDELARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI-SSLGNYLGNVGGFDFYEFPSLERLSLVS-EAELREAGFGYRAKYIIG

Query:  TVNALKAKPGGGAEWLLSLRDSDLEEVIEALATLPGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVA
        +  A+  + GG A WL  LR +  EE  +AL TLPGVG KVA C+ L +LD+  A+PVD HVWQ+  +    W  K +       LA   L         
Subjt:  TVNALKAKPGGGAEWLLSLRDSDLEEVIEALATLPGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVA

Query:  EAFVSKYGKYAGWAQTLLFVADLPQQKALLTADLENTKRKRSTKQ
          F + +G YAGWAQ +LF ADL QQ     +     KRK+ +K+
Subjt:  EAFVSKYGKYAGWAQTLLFVADLPQQKALLTADLENTKRKRSTKQ

Q9FNY7 N-glycosylase/DNA lyase OGG11.6e-12162.47Show/hide
Query:  RPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVV
        RP P S PS   +  PP SP  P TP                        +LK   H   +  W  L LT ++L+LPLTFPTGQTFRWK+T  + ++G +
Subjt:  RPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVV

Query:  GSHLISLNHLPNGD-VSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLIQFLCSSNNNIGRITK
        G HL+SL   P  D VSYC+H CSTS  S     A LALLDFLNA ISL+ +W  FS  DPRF ELARHL GARVLRQDPLECLIQFLCSSNNNI RITK
Subjt:  GSHLISLNHLPNGD-VSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLIQFLCSSNNNIGRITK

Query:  MVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALATLPGVGPKVAACVALFS
        MVD++SSLG +LG++ GF+F++FPSL+RLS VSE E R+AGFGYRAKYI GTVNAL+AKPGGG EWLLSLR  +L+E + AL TLPGVGPKVAAC+ALFS
Subjt:  MVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALATLPGVGPKVAACVALFS

Query:  LDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALL
        LDQH AIPVDTHVWQ            IAT YL+P+LAGA+LTPKL  RVAEAFVSKYG+YAGWAQTLLF+A+LP QK LL
Subjt:  LDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALL

Q9V3I8 N-glycosylase/DNA lyase2.9e-3831.14Show/hide
Query:  LNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELA
        + L+  +  L  T   GQ+FRW+     + T   G  + +   +   + S+  +    +SS  ++      + D+L     L    + + + D  F +  
Subjt:  LNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELA

Query:  RHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVD-YISSLGNYLGNVGGFDFYEFPSLERLSLVS----EAELREAGFGYRAKYIIGTVNALKAKPGG
           +  R+L Q+P E +  FLCS NNNI RI+ M++ + ++ G  +G+  G D Y FP++ R   +      A+LR A FGYRAK+I  T+  ++ K  G
Subjt:  RHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVD-YISSLGNYLGNVGGFDFYEFPSLERLSLVS----EAELREAGFGYRAKYIIGTVNALKAKPGG

Query:  GAEWLLSLRDSDLEEVIEALATLPGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGAR-LTPKLCNRVAEAFVSKYGKY
        G  W +SL+    E+  E L  LPG+G KVA C+ L S+    ++PVD H++            +IA  Y +P L G + +T K+   V++ F   +GKY
Subjt:  GAEWLLSLRDSDLEEVIEALATLPGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGAR-LTPKLCNRVAEAFVSKYGKY

Query:  AGWAQTLLFVADLPQQKALLTADLENTKRKRSTK
        AGWAQ +LF ADL Q +   T   +    K+  K
Subjt:  AGWAQTLLFVADLPQQKALLTADLENTKRKRSTK

Arabidopsis top hitse value%identityAlignment
AT1G21710.1 8-oxoguanine-DNA glycosylase 11.2e-12262.47Show/hide
Query:  RPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVV
        RP P S PS   +  PP SP  P TP                        +LK   H   +  W  L LT ++L+LPLTFPTGQTFRWK+T  + ++G +
Subjt:  RPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQTFRWKQTSPLHFTGVV

Query:  GSHLISLNHLPNGD-VSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLIQFLCSSNNNIGRITK
        G HL+SL   P  D VSYC+H CSTS  S     A LALLDFLNA ISL+ +W  FS  DPRF ELARHL GARVLRQDPLECLIQFLCSSNNNI RITK
Subjt:  GSHLISLNHLPNGD-VSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLIQFLCSSNNNIGRITK

Query:  MVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALATLPGVGPKVAACVALFS
        MVD++SSLG +LG++ GF+F++FPSL+RLS VSE E R+AGFGYRAKYI GTVNAL+AKPGGG EWLLSLR  +L+E + AL TLPGVGPKVAAC+ALFS
Subjt:  MVDYISSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALATLPGVGPKVAACVALFS

Query:  LDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALL
        LDQH AIPVDTHVWQ            IAT YL+P+LAGA+LTPKL  RVAEAFVSKYG+YAGWAQTLLF+A+LP QK LL
Subjt:  LDQHHAIPVDTHVWQLIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALL

AT3G47830.1 DNA glycosylase superfamily protein8.9e-0643.55Show/hide
Query:  LRDSDLEEVIEALATLPGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATR
        LR   +EEV   L+   GVGPK  +CV +F+L QH+  PVDTHV+++ +   + W  K A R
Subjt:  LRDSDLEEVIEALATLPGVGPKVAACVALFSLDQHHAIPVDTHVWQLIEKFLVVWDEKIATR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTCATTGTCATTCAAACCTCTTCTTCTAATGACGAAGAGCCTCAGACCCACTCCACCCTCCACTCCCTCCGCCAAGCCATCGCCACCGCCACCGTCATCGCCGCG
GTCTCCTCCGACCCCTCAGCTCTCCCATTCAAACCCAACCACCGTCTCCATCCACCACTCATCCAACAACCAAAACAAAACCCTAACCCTCCTAAAATCCCCCCACCATT
CCCTATCTTCCTCCAATTGGATCTCTCTAAATCTCACCAGATCAGACCTCTCTTTGCCTCTCACTTTCCCCACCGGCCAAACCTTCCGCTGGAAACAAACCAGCCCCCTT
CACTTCACCGGCGTCGTTGGCTCTCATCTTATCTCTCTCAACCATCTTCCAAACGGCGACGTTTCATATTGCCTTCACTCTTGTTCTACATCCTCCTCCTCCACCTCCTC
CGCCGCCGCCAGATTGGCCTTGCTTGATTTCCTTAACGCCGGCATCTCCCTGAGTGCCATTTGGGAGGTTTTCTCGGCGGCTGATCCGAGATTCGATGAGTTGGCTCGCC
ATTTGGAGGGGGCTCGAGTTCTCAGGCAAGACCCACTTGAATGTTTGATTCAGTTTTTGTGTTCTTCCAATAATAATATTGGGAGAATCACTAAAATGGTGGATTATATC
TCATCACTTGGGAATTATTTGGGTAATGTTGGAGGTTTTGATTTCTATGAATTCCCCTCTTTGGAGAGGCTGTCCTTGGTCTCTGAGGCTGAGCTTAGAGAAGCAGGCTT
TGGTTACAGGGCTAAATACATAATTGGCACTGTAAATGCACTAAAGGCCAAACCTGGGGGAGGTGCAGAATGGCTTCTGTCCCTTCGTGATTCGGATCTTGAAGAGGTGA
TTGAAGCCCTTGCTACATTACCGGGCGTAGGTCCGAAGGTAGCAGCTTGTGTTGCTCTCTTCTCTCTCGATCAGCACCACGCCATTCCTGTTGACACGCATGTCTGGCAG
TTGATTGAAAAGTTTCTTGTTGTGTGGGATGAAAAGATTGCTACTAGGTACCTTGTCCCCGAGCTTGCTGGTGCACGTCTAACGCCAAAGCTATGCAATCGTGTGGCTGA
GGCATTTGTCAGCAAATATGGAAAATATGCTGGTTGGGCTCAAACTCTGCTTTTCGTCGCTGATTTGCCTCAACAGAAGGCCCTCTTAACTGCAGATCTTGAGAATACCA
AAAGAAAAAGATCTACAAAGCAGCAGAAGGATAAAGCACATGAGCAAGATCCGTAG
mRNA sequenceShow/hide mRNA sequence
TTTGAACGGAAGCACCACTATAAACCCTCCAAATGCCTTCATTGTCATTCAAACCTCTTCTTCTAATGACGAAGAGCCTCAGACCCACTCCACCCTCCACTCCCTCCGCC
AAGCCATCGCCACCGCCACCGTCATCGCCGCGGTCTCCTCCGACCCCTCAGCTCTCCCATTCAAACCCAACCACCGTCTCCATCCACCACTCATCCAACAACCAAAACAA
AACCCTAACCCTCCTAAAATCCCCCCACCATTCCCTATCTTCCTCCAATTGGATCTCTCTAAATCTCACCAGATCAGACCTCTCTTTGCCTCTCACTTTCCCCACCGGCC
AAACCTTCCGCTGGAAACAAACCAGCCCCCTTCACTTCACCGGCGTCGTTGGCTCTCATCTTATCTCTCTCAACCATCTTCCAAACGGCGACGTTTCATATTGCCTTCAC
TCTTGTTCTACATCCTCCTCCTCCACCTCCTCCGCCGCCGCCAGATTGGCCTTGCTTGATTTCCTTAACGCCGGCATCTCCCTGAGTGCCATTTGGGAGGTTTTCTCGGC
GGCTGATCCGAGATTCGATGAGTTGGCTCGCCATTTGGAGGGGGCTCGAGTTCTCAGGCAAGACCCACTTGAATGTTTGATTCAGTTTTTGTGTTCTTCCAATAATAATA
TTGGGAGAATCACTAAAATGGTGGATTATATCTCATCACTTGGGAATTATTTGGGTAATGTTGGAGGTTTTGATTTCTATGAATTCCCCTCTTTGGAGAGGCTGTCCTTG
GTCTCTGAGGCTGAGCTTAGAGAAGCAGGCTTTGGTTACAGGGCTAAATACATAATTGGCACTGTAAATGCACTAAAGGCCAAACCTGGGGGAGGTGCAGAATGGCTTCT
GTCCCTTCGTGATTCGGATCTTGAAGAGGTGATTGAAGCCCTTGCTACATTACCGGGCGTAGGTCCGAAGGTAGCAGCTTGTGTTGCTCTCTTCTCTCTCGATCAGCACC
ACGCCATTCCTGTTGACACGCATGTCTGGCAGTTGATTGAAAAGTTTCTTGTTGTGTGGGATGAAAAGATTGCTACTAGGTACCTTGTCCCCGAGCTTGCTGGTGCACGT
CTAACGCCAAAGCTATGCAATCGTGTGGCTGAGGCATTTGTCAGCAAATATGGAAAATATGCTGGTTGGGCTCAAACTCTGCTTTTCGTCGCTGATTTGCCTCAACAGAA
GGCCCTCTTAACTGCAGATCTTGAGAATACCAAAAGAAAAAGATCTACAAAGCAGCAGAAGGATAAAGCACATGAGCAAGATCCGTAGGCATTACAGTTTCAAATATTTC
GATGCAAGAGGTTCATTCAAAGTACAAACTGCTGGAGTATAAATTATTTGATGTGATGTGATTACTGCTGAAGTTTGTTTGCTTTTGAATGAGGTGTATAAATTTTCAAT
ATTCCACTTGGCATAGCTATTGCTTAAATGTTTCGGGTCTGCTTTGGCTCATTGATGTAATCTCTACTTTTGGGGAAGTCCTACATTCCATCTTGTATGTAGCTTTCTTT
GAACCATTTCAATTGTAATATGGTAGGAAAATTCACTAGCAAAAAGAATTAATTCTGTGACAAGAGAACTGTGGTTGCATTTCATTGTTGACACTTGACATGAATCAATT
AATAATTGAAGTATTGCGTAAATGTTTCTAGGACTCCTAAAAGAATGAGATAAGCTGTGGTAATGGAGATGATTATTGCTAAAGAGATATCTATTGAAAGCTATGTGAAT
TATCATGACATGTACTTCATGCTTTGCGTCTTCCATCATTTATGTGCATTGTGCATCGACCAACAATGTTTTGGTGACGTGTGGACATGGATCGCCTCTAAAGTTCAGAT
TTGAGATTGGAATCGAGCATTTGGCCCTCCCTAGACACGCCTGAAACATCACAACACAATTTTCATGAAAAAACTCACATGCTCTACAATGGCTCTGGAGTTTGGCGAGT
GACACATTCCAATAGCGTAGCTTTGACAGTCACCGGACGGGGTCCCAAAGCTTGCAAATAAGATGTTGGAGATGTTCTTGTTTGTAGGGCAGCTTAATCGAACCTTAGGC
CTTCTGGTTCTGTTCTTTGTACCACTCGCCGTCTGTTTCTTTGCACCCATCCATGAAGCTACTAAAGGATAATGTGATTCAGATACTTGCCCACATGTTTTGGTAATTGA
AACAGAATCCAAAGATATCCCAATCGGGTTTCCTGTTTCTTCTTCAAGAATAACCAACTGGTTTCCAGCTGGCTTGAGGAAGGAACGTGGTACATTATACCTGGATTGAA
AGGAGAGACAATTGTGTTATCGATACAGGAAACTAGTGGAAGGAAATTTGAAAGATAAGGTATAATATTAAGCTGAGTAAACTGTTTACCATTTCTGTGAAGGCTCCCCT
TTTGGGGTGAGGAAGGAGACCCAGTACCGGCCAATGCCCCAGCCGTTAACCCACGCTGCACCCTTCCCCATTGAACCAAGATTCAGTGCAATGGGGTCATCACCAGGAGG
CGCATCAAACTGAGTCTGTTCAATTAGCGATGAATGACAATGGCAAAGGTCAGTTAATGAACATTTGAGTTCAAGATAGTTGCTGTAGAAAGTAGTCTAAACAATTGATT
ATCGAAAGTACCTTGTACCATGTGAGCGGCTGAGAAGAGTTTCCTAACCTGCTCCACTGAACATCGCTTGACCCCGTGTCTAAAAATATTTGTGATTGCTCTCCTGATAG
GCCAACCTTTGCAAATTTGTTTGGAAAAATCAACAAAAATATAAAAAAGAGGTTAGTTTTAAGTATGTGTTTGTGCATGTGTATTTTTTTATGAATATTACACCAGCTTA
AGTGTTTATGTACTCATATTAGGAGTTTAGAAATTAAAGAACCTTGTATCCCCAAGGTTGTTCCGAGAAATCCTCGCCTTGAATTCTCACTCTTCGTAGTCCAGCAACTC
TGGTCTCAAGAAATGCTCCAGAATCCTTTTTCAAATTAAAACTATCAGAATTCAC
Protein sequenceShow/hide protein sequence
MPSLSFKPLLLMTKSLRPTPPSTPSAKPSPPPPSSPRSPPTPQLSHSNPTTVSIHHSSNNQNKTLTLLKSPHHSLSSSNWISLNLTRSDLSLPLTFPTGQTFRWKQTSPL
HFTGVVGSHLISLNHLPNGDVSYCLHSCSTSSSSTSSAAARLALLDFLNAGISLSAIWEVFSAADPRFDELARHLEGARVLRQDPLECLIQFLCSSNNNIGRITKMVDYI
SSLGNYLGNVGGFDFYEFPSLERLSLVSEAELREAGFGYRAKYIIGTVNALKAKPGGGAEWLLSLRDSDLEEVIEALATLPGVGPKVAACVALFSLDQHHAIPVDTHVWQ
LIEKFLVVWDEKIATRYLVPELAGARLTPKLCNRVAEAFVSKYGKYAGWAQTLLFVADLPQQKALLTADLENTKRKRSTKQQKDKAHEQDP