; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0003402 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0003402
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
Descriptionformamidopyrimidine-DNA glycosylase isoform X1
Genome locationchr07:1564901..1569497
RNA-Seq ExpressionPI0003402
SyntenyPI0003402
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0003906 - DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0008534 - oxidized purine nucleobase lesion DNA N-glycosylase activity (molecular function)
GO:0016829 - lyase activity (molecular function)
InterPro domainsIPR010979 - Ribosomal protein S13-like, H2TH
IPR012319 - Formamidopyrimidine-DNA glycosylase, catalytic domain
IPR015886 - DNA glycosylase/AP lyase, H2TH DNA-binding
IPR020629 - Formamidopyrimidine-DNA glycosylase
IPR035937 - MutM-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044473.1 formamidopyrimidine-DNA glycosylase isoform X2 [Cucumis melo var. makuwa]5.7e-20689.63Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLL+DPASVPPISKLGPDALLEPMALDEFIESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQGAATLSKESCAALHKSIQE-------------------------------VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
        NQ AATLSKESCAALHKSIQE                               VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
Subjt:  NQGAATLSKESCAALHKSIQE-------------------------------VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS

Query:  AFVPELQKLTGAEPKNQNSKRKGNDNKKLNDEGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKK
        AFVPELQKLTGAEPKNQNSKRKGNDNKK+NDEGDGELVSKTEKTA+IKQK KPKGRSKKPS KRKSKSED+DGSDE AENDDASDDDNGRPEG KK+GKK
Subjt:  AFVPELQKLTGAEPKNQNSKRKGNDNKKLNDEGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKK

Query:  TNIGQRFDAASEPEKSLKETVRSSQNGRRRKKAK
        TNIGQRFDAASEPEKSLK+TV+SS+NGRRRKKAK
Subjt:  TNIGQRFDAASEPEKSLKETVRSSQNGRRRKKAK

XP_004152179.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucumis sativus]1.6e-20896.03Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWL LDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLL+DPASVPPISKLGPDALLEPMALDEFIESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKLND
        NQ AATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKK+ND
Subjt:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKLND

Query:  EGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKKTNIGQRFDAASEPEKSLKETVRSSQNGRRRK
        E DGELVSKT+KTA+IKQK KPKGRSKKPS KRKSKSEDDDGSDE AENDDASDDDNGRPEG+KKVG KTNIGQRFDAASEP+KSLK+TVRSSQ GRRRK
Subjt:  EGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKKTNIGQRFDAASEPEKSLKETVRSSQNGRRRK

Query:  KAK
        KAK
Subjt:  KAK

XP_008454182.1 PREDICTED: formamidopyrimidine-DNA glycosylase isoform X1 [Cucumis melo]1.0e-21096.28Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLL+DPASVPPISKLGPDALLEPMALDEFIESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKLND
        NQ AATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKK+ND
Subjt:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKLND

Query:  EGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKKTNIGQRFDAASEPEKSLKETVRSSQNGRRRK
        EGDGELVSKTEKTA+IKQK KPKGRSKKPS KRKSKSED+DGSDE AENDDASDDDNGRPEG KK+GKKTNIGQRFDAASEPEKSLK+TV+SS+NGRRRK
Subjt:  EGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKKTNIGQRFDAASEPEKSLKETVRSSQNGRRRK

Query:  KAK
        KAK
Subjt:  KAK

XP_008454183.1 PREDICTED: formamidopyrimidine-DNA glycosylase isoform X2 [Cucumis melo]2.0e-19891.58Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLL+DPASVPPISKLGPDALLEPMALDEFIESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKLN
        NQ AATLSKESCAALHKSIQEV+++A+EV A+S+ FP  W+FH R  K+PG+  V+GKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKK+N
Subjt:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKLN

Query:  DEGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKKTNIGQRFDAASEPEKSLKETVRSSQNGRRR
        DEGDGELVSKTEKTA+IKQK KPKGRSKKPS KRKSKSED+DGSDE AENDDASDDDNGRPEG KK+GKKTNIGQRFDAASEPEKSLK+TV+SS+NGRRR
Subjt:  DEGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKKTNIGQRFDAASEPEKSLKETVRSSQNGRRR

Query:  KKAK
        KKAK
Subjt:  KKAK

XP_011652995.1 formamidopyrimidine-DNA glycosylase isoform X2 [Cucumis sativus]3.2e-19691.34Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWL LDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLL+DPASVPPISKLGPDALLEPMALDEFIESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKLN
        NQ AATLSKESCAALHKSIQEV+++A+EV A+S+ FP  W+FH R  K+PG+  V+GKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKK+N
Subjt:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKLN

Query:  DEGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKKTNIGQRFDAASEPEKSLKETVRSSQNGRRR
        DE DGELVSKT+KTA+IKQK KPKGRSKKPS KRKSKSEDDDGSDE AENDDASDDDNGRPEG+KKVG KTNIGQRFDAASEP+KSLK+TVRSSQ GRRR
Subjt:  DEGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKKTNIGQRFDAASEPEKSLKETVRSSQNGRRR

Query:  KKAK
        KKAK
Subjt:  KKAK

TrEMBL top hitse value%identityAlignment
A0A0A0KWY6 FPG_CAT domain-containing protein7.8e-20996.03Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWL LDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLL+DPASVPPISKLGPDALLEPMALDEFIESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKLND
        NQ AATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKK+ND
Subjt:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKLND

Query:  EGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKKTNIGQRFDAASEPEKSLKETVRSSQNGRRRK
        E DGELVSKT+KTA+IKQK KPKGRSKKPS KRKSKSEDDDGSDE AENDDASDDDNGRPEG+KKVG KTNIGQRFDAASEP+KSLK+TVRSSQ GRRRK
Subjt:  EGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKKTNIGQRFDAASEPEKSLKETVRSSQNGRRRK

Query:  KAK
        KAK
Subjt:  KAK

A0A1S3BY09 formamidopyrimidine-DNA glycosylase isoform X29.6e-19991.58Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLL+DPASVPPISKLGPDALLEPMALDEFIESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKLN
        NQ AATLSKESCAALHKSIQEV+++A+EV A+S+ FP  W+FH R  K+PG+  V+GKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKK+N
Subjt:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKLN

Query:  DEGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKKTNIGQRFDAASEPEKSLKETVRSSQNGRRR
        DEGDGELVSKTEKTA+IKQK KPKGRSKKPS KRKSKSED+DGSDE AENDDASDDDNGRPEG KK+GKKTNIGQRFDAASEPEKSLK+TV+SS+NGRRR
Subjt:  DEGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKKTNIGQRFDAASEPEKSLKETVRSSQNGRRR

Query:  KKAK
        KKAK
Subjt:  KKAK

A0A1S3BY51 formamidopyrimidine-DNA glycosylase isoform X14.9e-21196.28Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLL+DPASVPPISKLGPDALLEPMALDEFIESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKLND
        NQ AATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKK+ND
Subjt:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKLND

Query:  EGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKKTNIGQRFDAASEPEKSLKETVRSSQNGRRRK
        EGDGELVSKTEKTA+IKQK KPKGRSKKPS KRKSKSED+DGSDE AENDDASDDDNGRPEG KK+GKKTNIGQRFDAASEPEKSLK+TV+SS+NGRRRK
Subjt:  EGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKKTNIGQRFDAASEPEKSLKETVRSSQNGRRRK

Query:  KAK
        KAK
Subjt:  KAK

A0A5A7TLT5 Formamidopyrimidine-DNA glycosylase isoform X22.8e-20689.63Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLL+DPASVPPISKLGPDALLEPMALDEFIESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQGAATLSKESCAALHKSIQE-------------------------------VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
        NQ AATLSKESCAALHKSIQE                               VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
Subjt:  NQGAATLSKESCAALHKSIQE-------------------------------VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS

Query:  AFVPELQKLTGAEPKNQNSKRKGNDNKKLNDEGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKK
        AFVPELQKLTGAEPKNQNSKRKGNDNKK+NDEGDGELVSKTEKTA+IKQK KPKGRSKKPS KRKSKSED+DGSDE AENDDASDDDNGRPEG KK+GKK
Subjt:  AFVPELQKLTGAEPKNQNSKRKGNDNKKLNDEGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKK

Query:  TNIGQRFDAASEPEKSLKETVRSSQNGRRRKKAK
        TNIGQRFDAASEPEKSLK+TV+SS+NGRRRKKAK
Subjt:  TNIGQRFDAASEPEKSLKETVRSSQNGRRRKKAK

A0A5D3E227 Formamidopyrimidine-DNA glycosylase isoform X14.9e-21196.28Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLL+DPASVPPISKLGPDALLEPMALDEFIESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKLND
        NQ AATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKK+ND
Subjt:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKLND

Query:  EGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKKTNIGQRFDAASEPEKSLKETVRSSQNGRRRK
        EGDGELVSKTEKTA+IKQK KPKGRSKKPS KRKSKSED+DGSDE AENDDASDDDNGRPEG KK+GKKTNIGQRFDAASEPEKSLK+TV+SS+NGRRRK
Subjt:  EGDGELVSKTEKTANIKQKSKPKGRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKKTNIGQRFDAASEPEKSLKETVRSSQNGRRRK

Query:  KAK
        KAK
Subjt:  KAK

SwissProt top hitse value%identityAlignment
A5UUN1 Formamidopyrimidine-DNA glycosylase1.9e-2634.04Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEV+ A  ++    VG  I +    D T++++  SP +F   L G+ +    R+ K + L LD     A H  M+G++              V   
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        D  P K++   + LDDG  + F D R+F +  LL         +  G + L     ++   E L  +K AIK LLLDQ+ I+GIGN  ADE L++ARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSR
         + A+ LS +  AALH  I+  + +AL  G  + R
Subjt:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSR

A9B0X2 Formamidopyrimidine-DNA glycosylase2.9e-2732.46Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVE  RR++E+  VG+           K++D  SP  F  ++  + I    R+ K+L + LD+      H  M G + +                
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DE   +++   V LD+G +L F D R+F + SL+          +LGP+ L +   LD+F + L +K   IK  LLDQS ++G+GN  ADE L+ A+IHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQGAATLSKESCAALHKSIQEVIEKALE
         + A +L+    A L ++I+ V+  ++E
Subjt:  NQGAATLSKESCAALHKSIQEVIEKALE

B0TER7 Formamidopyrimidine-DNA glycosylase1.4e-2631.03Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVE  RR++     G  I+K  +    K+   +  + F  +L G+ I+   R+GK+L L LD       H  M G +         + R    ++
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASV--PPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARI
         E    ++ FF  LDDG  L +TD R+F  ++L+   A++  P   +LGP+ L +  +  +F  +L K+K  +K LLLDQS+++G+GN  ADE L +AR+
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASV--PPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARI

Query:  HPNQGAATLSKESCAALHKSIQEVIEKALEVGADSSR-----------FPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQK
        HP++ A +L  E    L+  I+ V+++ ++    S R           F      + R   P +    G EI      GR++ F P  QK
Subjt:  HPNQGAATLSKESCAALHKSIQEVIEKALEVGADSSR-----------FPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQK

O80358 Formamidopyrimidine-DNA glycosylase1.6e-12966.49Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEE+C+GK IK+ +IADD KVI G+SPSDF+ S+LGKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +P SV PIS+LGPDALLEPM +DEF ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDN-KKLN
         Q A++LSKE C ALH SI+EVIEKA+EV ADSS+FP+ WIFH+REKKPGKAFVDGK+I FIT GGRT+A+VPELQKL G + +     R      K   
Subjt:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDN-KKLN

Query:  DEGDGE----LVSKTEKTANIKQKSKPK-GRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNG-RPEGEKKVGKK
        D+GDGE       K +++A  K+  KP+ GR KKP+ K K++  DDDG D  AE +       G +P  ++K  +K
Subjt:  DEGDGE----LVSKTEKTANIKQKSKPK-GRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNG-RPEGEKKVGKK

Q03GC2 Formamidopyrimidine-DNA glycosylase6.5e-2732.42Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSD--FEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVN
        MPELPEVE  RR +     GK++   V+    +    VSP    F   L GK IL+  R+GK+L +          H  M G            K S+V+
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSD--FEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVN

Query:  DDDEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLK--DPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQA
          +E+  K+     ELDDG DL + D R+F +++L+   +   V  +  +GP+   E + L+     L  +K  +K+ LLDQS I+G+GN  ADEVL+ +
Subjt:  DDDEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLK--DPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQA

Query:  RIHPNQGAATLSKESCAALHKSIQEVIEKALE---------VGAD--SSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKL
        +IHP Q + TL+ E  A L +SI E ++ A+E         + AD  +  F N    + R+  P +    G  I  I    R + F P  Q L
Subjt:  RIHPNQGAATLSKESCAALHKSIQEVIEKALE---------VGAD--SSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKL

Arabidopsis top hitse value%identityAlignment
AT1G52500.1 MUTM homolog-11.2e-10873.48Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEE+C+GK IK+ +IADD KVI G+SPSDF+ S+LGKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +P SV PIS+LGPDALLEPM +DEF ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFIT
         Q A++LSKE C ALH SI+EVI+ A++V ADS  FP  W+FH R  KK GK  V+GK  H ++
Subjt:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFIT

AT1G52500.2 MUTM homolog-11.1e-13066.49Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEE+C+GK IK+ +IADD KVI G+SPSDF+ S+LGKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +P SV PIS+LGPDALLEPM +DEF ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDN-KKLN
         Q A++LSKE C ALH SI+EVIEKA+EV ADSS+FP+ WIFH+REKKPGKAFVDGK+I FIT GGRT+A+VPELQKL G + +     R      K   
Subjt:  NQGAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDN-KKLN

Query:  DEGDGE----LVSKTEKTANIKQKSKPK-GRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNG-RPEGEKKVGKK
        D+GDGE       K +++A  K+  KP+ GR KKP+ K K++  DDDG D  AE +       G +P  ++K  +K
Subjt:  DEGDGE----LVSKTEKTANIKQKSKPK-GRSKKPSPKRKSKSEDDDGSDEAAENDDASDDDNG-RPEGEKKVGKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGAGTTACCGGAGGTGGAGGCGGCGAGGAGAGCCATAGAAGAGCACTGCGTCGGGAAAGTAATTAAGAAGGCGGTAATAGCCGACGATACGAAGGTCATCGACGG
CGTATCACCTTCCGATTTCGAGGCTTCGCTCTTAGGCAAAACCATCCTCTCCGCCCATCGTAAGGGCAAGCACCTGTGGCTCCGCCTCGATTCTCCTCCTTTCCCTGCAT
TTCACTTCGGGATGGCAGGTGCAATATATATTAAAGGCGTGGCTGTCACAAACTATAAAAGGTCTATGGTTAATGATGACGACGAGTGGCCTTCCAAGTACTCTAAGTTC
TTTGTTGAGCTTGATGATGGTGTAGACCTATCCTTCACTGACAAAAGGCGGTTTGCAAAAGTCTCCTTGCTAAAAGATCCGGCTTCTGTGCCCCCAATATCTAAGCTTGG
CCCAGATGCTCTTCTAGAGCCTATGGCATTGGATGAGTTTATCGAATCCCTGGGGAAGAAGAAACTAGCAATTAAGACTTTATTGCTTGATCAGAGCTACATTTCGGGTA
TTGGCAATTGGGTTGCAGATGAAGTGCTTTATCAAGCAAGAATTCATCCAAATCAAGGTGCTGCAACCCTATCCAAAGAAAGTTGTGCAGCTTTGCATAAGAGCATACAG
GAGGTAATTGAAAAAGCGCTTGAAGTTGGAGCAGATAGTAGTCGGTTCCCTAATAATTGGATTTTCCATTCACGTGAAAAGAAGCCTGGGAAGGCTTTTGTTGATGGTAA
GGAAATCCACTTTATCACCACAGGTGGCAGGACATCAGCCTTTGTACCTGAGTTGCAAAAGCTTACTGGAGCTGAACCGAAAAATCAAAACTCAAAGAGAAAAGGCAACG
ATAACAAAAAACTGAATGATGAGGGCGATGGTGAACTAGTGAGCAAGACAGAGAAAACTGCCAATATTAAGCAAAAGTCAAAGCCTAAAGGTCGCTCTAAGAAACCTTCA
CCAAAAAGAAAATCCAAAAGCGAGGACGATGATGGGTCTGACGAGGCAGCTGAAAATGATGATGCTAGTGATGATGACAATGGTCGCCCTGAAGGAGAGAAGAAAGTGGG
AAAGAAAACAAACATTGGGCAAAGGTTTGATGCTGCTTCTGAACCAGAGAAGTCTTTGAAGGAAACGGTTCGGAGCAGTCAAAATGGTAGGCGGAGGAAGAAAGCAAAGT
AA
mRNA sequenceShow/hide mRNA sequence
TCTATCAGTCCTACTTCGTTCCTCTCAAGCACCTCCAAGAGAACAGCGCGCCAAAAGTATCGTTCCGTGTTCCGACCCCCACAACATGCCGGAGTTACCGGAGGTGGAGG
CGGCGAGGAGAGCCATAGAAGAGCACTGCGTCGGGAAAGTAATTAAGAAGGCGGTAATAGCCGACGATACGAAGGTCATCGACGGCGTATCACCTTCCGATTTCGAGGCT
TCGCTCTTAGGCAAAACCATCCTCTCCGCCCATCGTAAGGGCAAGCACCTGTGGCTCCGCCTCGATTCTCCTCCTTTCCCTGCATTTCACTTCGGGATGGCAGGTGCAAT
ATATATTAAAGGCGTGGCTGTCACAAACTATAAAAGGTCTATGGTTAATGATGACGACGAGTGGCCTTCCAAGTACTCTAAGTTCTTTGTTGAGCTTGATGATGGTGTAG
ACCTATCCTTCACTGACAAAAGGCGGTTTGCAAAAGTCTCCTTGCTAAAAGATCCGGCTTCTGTGCCCCCAATATCTAAGCTTGGCCCAGATGCTCTTCTAGAGCCTATG
GCATTGGATGAGTTTATCGAATCCCTGGGGAAGAAGAAACTAGCAATTAAGACTTTATTGCTTGATCAGAGCTACATTTCGGGTATTGGCAATTGGGTTGCAGATGAAGT
GCTTTATCAAGCAAGAATTCATCCAAATCAAGGTGCTGCAACCCTATCCAAAGAAAGTTGTGCAGCTTTGCATAAGAGCATACAGGAGGTAATTGAAAAAGCGCTTGAAG
TTGGAGCAGATAGTAGTCGGTTCCCTAATAATTGGATTTTCCATTCACGTGAAAAGAAGCCTGGGAAGGCTTTTGTTGATGGTAAGGAAATCCACTTTATCACCACAGGT
GGCAGGACATCAGCCTTTGTACCTGAGTTGCAAAAGCTTACTGGAGCTGAACCGAAAAATCAAAACTCAAAGAGAAAAGGCAACGATAACAAAAAACTGAATGATGAGGG
CGATGGTGAACTAGTGAGCAAGACAGAGAAAACTGCCAATATTAAGCAAAAGTCAAAGCCTAAAGGTCGCTCTAAGAAACCTTCACCAAAAAGAAAATCCAAAAGCGAGG
ACGATGATGGGTCTGACGAGGCAGCTGAAAATGATGATGCTAGTGATGATGACAATGGTCGCCCTGAAGGAGAGAAGAAAGTGGGAAAGAAAACAAACATTGGGCAAAGG
TTTGATGCTGCTTCTGAACCAGAGAAGTCTTTGAAGGAAACGGTTCGGAGCAGTCAAAATGGTAGGCGGAGGAAGAAAGCAAAGTAAGTTTATTTCCTAAGACATCGTAT
GTAGTATTTTGTTTTGTTGTTTTTCTTTTGGGGGGGTTAACTATGAGAACATCTATCTATAGAAGAACTTGCCCTTTTGTCATTACTGGTAATCCTGGCCATAGATGTGG
GGGAATGACTTTTGTATCATCCGGTCATTTTGAAGAATGTTGATATGAAGAATGGAGATGTTCAGCTCGTTTTTTTCTTTCTACATGCACAGTTGCAACTCACATCGTTT
ATATATATATATAGATATAGATATTCGTGAA
Protein sequenceShow/hide protein sequence
MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFEASLLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDDDEWPSKYSKF
FVELDDGVDLSFTDKRRFAKVSLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHPNQGAATLSKESCAALHKSIQ
EVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKGNDNKKLNDEGDGELVSKTEKTANIKQKSKPKGRSKKPS
PKRKSKSEDDDGSDEAAENDDASDDDNGRPEGEKKVGKKTNIGQRFDAASEPEKSLKETVRSSQNGRRRKKAK