; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G16838 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G16838
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Descriptionformamidopyrimidine-DNA glycosylase isoform X1
Genome locationctg24:1865579..1870173
RNA-Seq ExpressionCucsat.G16838
SyntenyCucsat.G16838
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0006979 - response to oxidative stress (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0003906 - DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0008534 - oxidized purine nucleobase lesion DNA N-glycosylase activity (molecular function)
GO:0016829 - lyase activity (molecular function)
GO:0140078 - class I DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
InterPro domainsIPR010979 - Ribosomal protein S13-like, H2TH
IPR012319 - Formamidopyrimidine-DNA glycosylase, catalytic domain
IPR015886 - DNA glycosylase/AP lyase, H2TH DNA-binding
IPR020629 - Formamidopyrimidine-DNA glycosylase
IPR035937 - MutM-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044473.1 formamidopyrimidine-DNA glycosylase isoform X2 [Cucumis melo var. makuwa]1.67e-26390.07Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWL LDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESL+KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEV-------------------------------IEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
        NQSAATLSKESCAALHKSIQEV                               IEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
Subjt:  NQSAATLSKESCAALHKSIQEV-------------------------------IEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS

Query:  AFVPELQKLTGAEPKNQNSKRKGNDNKKMNDESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKT
        AFVPELQKLTGAEPKNQNSKRKGNDNKKMNDE DGELVSKT+KTADIKQKPKPKGRSKKPSKRKSKSED+DGSDEEAENDDASDDDNGRPEG KK+G KT
Subjt:  AFVPELQKLTGAEPKNQNSKRKGNDNKKMNDESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKT

Query:  NIGQRFDAASEPDKSLKQTVRSSQIGRRRKKAK
        NIGQRFDAASEP+KSLKQTV+SS+ GRRRKKAK
Subjt:  NIGQRFDAASEPDKSLKQTVRSSQIGRRRKKAK

XP_004152179.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucumis sativus]1.05e-279100Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMND
        NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMND
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMND

Query:  ESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRKK
        ESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRKK
Subjt:  ESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRKK

Query:  AK
        AK
Subjt:  AK

XP_008454182.1 PREDICTED: formamidopyrimidine-DNA glycosylase isoform X1 [Cucumis melo]2.96e-27096.77Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWL LDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESL+KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMND
        NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMND
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMND

Query:  ESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRKK
        E DGELVSKT+KTADIKQKPKPKGRSKKPSKRKSKSED+DGSDEEAENDDASDDDNGRPEG KK+G KTNIGQRFDAASEP+KSLKQTV+SS+ GRRRKK
Subjt:  ESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRKK

Query:  AK
        AK
Subjt:  AK

XP_008454183.1 PREDICTED: formamidopyrimidine-DNA glycosylase isoform X2 [Cucumis melo]4.03e-25492.06Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWL LDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESL+KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMN
        NQSAATLSKESCAALHKSIQEV+++A+EV A+S+ FP  W+FH R  K+PG+  V+GKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMN
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMN

Query:  DESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRK
        DE DGELVSKT+KTADIKQKPKPKGRSKKPSKRKSKSED+DGSDEEAENDDASDDDNGRPEG KK+G KTNIGQRFDAASEP+KSLKQTV+SS+ GRRRK
Subjt:  DESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRK

Query:  KAK
        KAK
Subjt:  KAK

XP_011652995.1 formamidopyrimidine-DNA glycosylase isoform X2 [Cucumis sativus]1.44e-26395.29Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMN
        NQSAATLSKESCAALHKSIQEV+++A+EV A+S+ FP  W+FH R  K+PG+  V+GKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMN
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMN

Query:  DESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRK
        DESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRK
Subjt:  DESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRK

Query:  KAK
        KAK
Subjt:  KAK

TrEMBL top hitse value%identityAlignment
A0A0A0KWY6 FPG_CAT domain-containing protein5.10e-280100Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMND
        NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMND
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMND

Query:  ESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRKK
        ESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRKK
Subjt:  ESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRKK

Query:  AK
        AK
Subjt:  AK

A0A1S3BY09 formamidopyrimidine-DNA glycosylase isoform X21.95e-25492.06Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWL LDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESL+KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMN
        NQSAATLSKESCAALHKSIQEV+++A+EV A+S+ FP  W+FH R  K+PG+  V+GKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMN
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMN

Query:  DESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRK
        DE DGELVSKT+KTADIKQKPKPKGRSKKPSKRKSKSED+DGSDEEAENDDASDDDNGRPEG KK+G KTNIGQRFDAASEP+KSLKQTV+SS+ GRRRK
Subjt:  DESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRK

Query:  KAK
        KAK
Subjt:  KAK

A0A1S3BY51 formamidopyrimidine-DNA glycosylase isoform X11.43e-27096.77Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWL LDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESL+KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMND
        NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMND
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMND

Query:  ESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRKK
        E DGELVSKT+KTADIKQKPKPKGRSKKPSKRKSKSED+DGSDEEAENDDASDDDNGRPEG KK+G KTNIGQRFDAASEP+KSLKQTV+SS+ GRRRKK
Subjt:  ESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRKK

Query:  AK
        AK
Subjt:  AK

A0A5A7TLT5 Formamidopyrimidine-DNA glycosylase isoform X28.09e-26490.07Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWL LDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESL+KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEV-------------------------------IEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
        NQSAATLSKESCAALHKSIQEV                               IEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
Subjt:  NQSAATLSKESCAALHKSIQEV-------------------------------IEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS

Query:  AFVPELQKLTGAEPKNQNSKRKGNDNKKMNDESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKT
        AFVPELQKLTGAEPKNQNSKRKGNDNKKMNDE DGELVSKT+KTADIKQKPKPKGRSKKPSKRKSKSED+DGSDEEAENDDASDDDNGRPEG KK+G KT
Subjt:  AFVPELQKLTGAEPKNQNSKRKGNDNKKMNDESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKT

Query:  NIGQRFDAASEPDKSLKQTVRSSQIGRRRKKAK
        NIGQRFDAASEP+KSLKQTV+SS+ GRRRKKAK
Subjt:  NIGQRFDAASEPDKSLKQTVRSSQIGRRRKKAK

A0A5D3E227 Formamidopyrimidine-DNA glycosylase isoform X11.43e-27096.77Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWL LDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESL+KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMND
        NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMND
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMND

Query:  ESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRKK
        E DGELVSKT+KTADIKQKPKPKGRSKKPSKRKSKSED+DGSDEEAENDDASDDDNGRPEG KK+G KTNIGQRFDAASEP+KSLKQTV+SS+ GRRRKK
Subjt:  ESDGELVSKTKKTADIKQKPKPKGRSKKPSKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRKK

Query:  AK
        AK
Subjt:  AK

SwissProt top hitse value%identityAlignment
A5UUN1 Formamidopyrimidine-DNA glycosylase2.9e-2734.04Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEV+ A  ++    VG  I +    D T++++  SP +F   L G+ +    R+ K + L LD     A H  M+G++              V   
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        D  P K++   + LDDG  + F D R+F +  LL+        +  G + L     ++   E L+ +K AIK LLLDQ+ I+GIGN  ADE L++ARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSR
         + A+ LS +  AALH  I+  + +AL  G  + R
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSR

A9B0X2 Formamidopyrimidine-DNA glycosylase2.9e-2732.89Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVE  RR++E+  VG+           K++D  SP  F  ++  + I    R+ K+L + LD+      H  M G + +                
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DE   +++   V LD+G +L F D R+F + SL++         +LGP+ L +   LD+F + L +K   IK  LLDQS ++G+GN  ADE L+ A+IHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALE
         +SA +L+    A L ++I+ V+  ++E
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALE

O80358 Formamidopyrimidine-DNA glycosylase3.8e-12863.93Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEE+C+GK IK+ +IADD KVI G+SPSDF+ S+LGKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +P SV PIS+LGPDALLEPM +DEF ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAE--------PKNQNSKRKG
         Q+A++LSKE C ALH SI+EVIEKA+EV ADSS+FP+ WIFH+REKKPGKAFVDGK+I FIT GGRT+A+VPELQKL G +        P  +  K K 
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAE--------PKNQNSKRKG

Query:  NDNKKMNDESDGELVSKTKKTADIKQKPKPK-GRSKKP-SKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVR
        +D     DE + E   K  ++A  K+  KP+ GR KKP SK K++  DDDG D EAE +        +P+G+   GTK  I ++ +  +      K   R
Subjt:  NDNKKMNDESDGELVSKTKKTADIKQKPKPK-GRSKKP-SKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVR

Query:  SS
         S
Subjt:  SS

Q03GC2 Formamidopyrimidine-DNA glycosylase2.9e-2732.42Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSD--FEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVN
        MPELPEVE  RR +     GK++   V+    +    VSP    F   L GK IL+  R+GK+L +          H  M G            K S+V+
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSD--FEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVN

Query:  DDDEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLE--DPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQA
          +E+  K+     ELDDG DL + D R+F +++L+   +   V  +  +GP+   E + L+     L+ +K  +K+ LLDQS I+G+GN  ADEVL+ +
Subjt:  DDDEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLE--DPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQA

Query:  RIHPNQSAATLSKESCAALHKSIQEVIEKALE---------VGAD--SSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKL
        +IHP Q + TL+ E  A L +SI E ++ A+E         + AD  +  F N    + R+  P +    G  I  I    R + F P  Q L
Subjt:  RIHPNQSAATLSKESCAALHKSIQEVIEKALE---------VGAD--SSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKL

Q8FP17 Formamidopyrimidine-DNA glycosylase1.1e-2635.54Show/hide
Query:  MPELPEVEAARRAIEEHCVGK-VIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPA-------FHFGMAGAIYIKGVAVTNY
        MPELPEVE  RR +EEH VG+ ++  AV+   T        ++ EA+L G  + + +R+GK LWL LD     A        H GM+G + +K       
Subjt:  MPELPEVEAARRAIEEHCVGK-VIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPA-------FHFGMAGAIYIKGVAVTNY

Query:  KRSMVNDDDEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVP-PISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADE
                D   + + +   ELDDG ++ F D+R F    L E    VP  +S +  D L + + +      LK K   IK LLL+Q  +SGIGN  ADE
Subjt:  KRSMVNDDDEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVP-PISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADE

Query:  VLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADS
        +L++A IHP Q A+ +S     AL ++ +EV+ +AL+ G  S
Subjt:  VLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADS

Arabidopsis top hitse value%identityAlignment
AT1G52500.1 MUTM homolog-13.4e-10873.48Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEE+C+GK IK+ +IADD KVI G+SPSDF+ S+LGKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +P SV PIS+LGPDALLEPM +DEF ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFIT
         Q+A++LSKE C ALH SI+EVI+ A++V ADS  FP  W+FH R  KK GK  V+GK  H ++
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFIT

AT1G52500.2 MUTM homolog-12.7e-12963.93Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEE+C+GK IK+ +IADD KVI G+SPSDF+ S+LGKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +P SV PIS+LGPDALLEPM +DEF ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAE--------PKNQNSKRKG
         Q+A++LSKE C ALH SI+EVIEKA+EV ADSS+FP+ WIFH+REKKPGKAFVDGK+I FIT GGRT+A+VPELQKL G +        P  +  K K 
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAE--------PKNQNSKRKG

Query:  NDNKKMNDESDGELVSKTKKTADIKQKPKPK-GRSKKP-SKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVR
        +D     DE + E   K  ++A  K+  KP+ GR KKP SK K++  DDDG D EAE +        +P+G+   GTK  I ++ +  +      K   R
Subjt:  NDNKKMNDESDGELVSKTKKTADIKQKPKPK-GRSKKP-SKRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVR

Query:  SS
         S
Subjt:  SS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGAGTTACCGGAGGTGGAGGCGGCGAGGAGAGCCATAGAAGAGCATTGCGTCGGGAAAGTAATCAAGAAGGCGGTAATAGCCGACGATACGAAGGTTATCGACGG
CGTATCACCTTCCGATTTTGAGGCTTCGCTCTTAGGCAAAACCATCCTCTCCGCCCATCGTAAGGGCAAGCACCTGTGGCTCTGTCTCGATTCTCCTCCTTTCCCTGCAT
TTCACTTCGGGATGGCAGGTGCCATATATATTAAAGGTGTGGCTGTCACAAACTATAAAAGGTCTATGGTTAATGACGATGATGAGTGGCCTTCTAAGTACTCTAAGTTC
TTTGTTGAGCTCGATGATGGTGTAGACCTATCCTTCACTGACAAAAGGCGATTTGCAAAAGTCTCCCTGCTCGAAGATCCGGCTTCAGTGCCCCCAATATCTAAGCTTGG
CCCAGATGCTCTTCTAGAGCCTATGGCATTGGATGAGTTTATTGAATCCCTGAAGAAGAAGAAACTAGCAATTAAGACTTTATTGCTTGATCAGAGTTACATTTCGGGTA
TTGGCAATTGGGTTGCAGATGAAGTGCTTTATCAAGCAAGAATTCATCCAAATCAAAGTGCTGCAACCCTCTCCAAAGAAAGTTGTGCAGCTTTGCATAAGAGCATACAG
GAGGTAATTGAAAAAGCGCTTGAAGTTGGAGCAGATAGTAGTCGGTTCCCTAATAATTGGATTTTCCATTCACGTGAAAAGAAGCCTGGCAAGGCTTTTGTTGATGGTAA
GGAAATCCACTTTATCACCACAGGTGGCAGGACATCAGCCTTTGTACCTGAGTTGCAAAAGCTTACTGGAGCTGAACCGAAAAATCAAAACTCAAAGAGAAAAGGCAACG
ATAACAAGAAAATGAATGATGAGAGCGATGGTGAACTAGTGAGCAAGACAAAGAAAACTGCCGATATTAAGCAGAAGCCAAAGCCTAAAGGTCGCTCTAAGAAACCTTCA
AAAAGAAAATCCAAAAGCGAGGACGATGATGGGTCTGACGAGGAAGCTGAAAATGATGATGCTAGTGATGATGACAATGGTCGCCCTGAAGGAAAGAAGAAAGTGGGAAC
GAAAACAAATATTGGACAAAGGTTTGATGCTGCTTCTGAACCAGATAAGTCTTTGAAGCAAACGGTTCGGAGCAGTCAAATTGGTAGGCGGAGGAAGAAAGCAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGGAGTTACCGGAGGTGGAGGCGGCGAGGAGAGCCATAGAAGAGCATTGCGTCGGGAAAGTAATCAAGAAGGCGGTAATAGCCGACGATACGAAGGTTATCGACGG
CGTATCACCTTCCGATTTTGAGGCTTCGCTCTTAGGCAAAACCATCCTCTCCGCCCATCGTAAGGGCAAGCACCTGTGGCTCTGTCTCGATTCTCCTCCTTTCCCTGCAT
TTCACTTCGGGATGGCAGGTGCCATATATATTAAAGGTGTGGCTGTCACAAACTATAAAAGGTCTATGGTTAATGACGATGATGAGTGGCCTTCTAAGTACTCTAAGTTC
TTTGTTGAGCTCGATGATGGTGTAGACCTATCCTTCACTGACAAAAGGCGATTTGCAAAAGTCTCCCTGCTCGAAGATCCGGCTTCAGTGCCCCCAATATCTAAGCTTGG
CCCAGATGCTCTTCTAGAGCCTATGGCATTGGATGAGTTTATTGAATCCCTGAAGAAGAAGAAACTAGCAATTAAGACTTTATTGCTTGATCAGAGTTACATTTCGGGTA
TTGGCAATTGGGTTGCAGATGAAGTGCTTTATCAAGCAAGAATTCATCCAAATCAAAGTGCTGCAACCCTCTCCAAAGAAAGTTGTGCAGCTTTGCATAAGAGCATACAG
GAGGTAATTGAAAAAGCGCTTGAAGTTGGAGCAGATAGTAGTCGGTTCCCTAATAATTGGATTTTCCATTCACGTGAAAAGAAGCCTGGCAAGGCTTTTGTTGATGGTAA
GGAAATCCACTTTATCACCACAGGTGGCAGGACATCAGCCTTTGTACCTGAGTTGCAAAAGCTTACTGGAGCTGAACCGAAAAATCAAAACTCAAAGAGAAAAGGCAACG
ATAACAAGAAAATGAATGATGAGAGCGATGGTGAACTAGTGAGCAAGACAAAGAAAACTGCCGATATTAAGCAGAAGCCAAAGCCTAAAGGTCGCTCTAAGAAACCTTCA
AAAAGAAAATCCAAAAGCGAGGACGATGATGGGTCTGACGAGGAAGCTGAAAATGATGATGCTAGTGATGATGACAATGGTCGCCCTGAAGGAAAGAAGAAAGTGGGAAC
GAAAACAAATATTGGACAAAGGTTTGATGCTGCTTCTGAACCAGATAAGTCTTTGAAGCAAACGGTTCGGAGCAGTCAAATTGGTAGGCGGAGGAAGAAAGCAAAGTAA
Protein sequenceShow/hide protein sequence
MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLCLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDDDEWPSKYSKF
FVELDDGVDLSFTDKRRFAKVSLLEDPASVPPISKLGPDALLEPMALDEFIESLKKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQ
EVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKMNDESDGELVSKTKKTADIKQKPKPKGRSKKPS
KRKSKSEDDDGSDEEAENDDASDDDNGRPEGKKKVGTKTNIGQRFDAASEPDKSLKQTVRSSQIGRRRKKAK