; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy02g005200 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy02g005200
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
Descriptionformamidopyrimidine-DNA glycosylase isoform X1
Genome locationChr02:6776232..6780474
RNA-Seq ExpressionLcy02g005200
SyntenyLcy02g005200
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0003906 - DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0008534 - oxidized purine nucleobase lesion DNA N-glycosylase activity (molecular function)
GO:0016829 - lyase activity (molecular function)
InterPro domainsIPR010979 - Ribosomal protein S13-like, H2TH
IPR012319 - Formamidopyrimidine-DNA glycosylase, catalytic domain
IPR015886 - DNA glycosylase/AP lyase, H2TH DNA-binding
IPR020629 - Formamidopyrimidine-DNA glycosylase
IPR035937 - MutM-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153476.1 formamidopyrimidine-DNA glycosylase isoform X3 [Momordica charantia]2.0e-19087.65Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCV K+IKKA+IADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAI+IKGVAVTNYKRSMV DD
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDP SVPPISKLGPDALLEPM L+ F ESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND
        +QSAATLSKE+CA LHKCIQEVIEKALEVGADSS+FPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKL GAEPK QNSKRK++  KQM+D
Subjt:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND

Query:  EDAGELASKTKKTADTADTKTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVGRIRDASEAEKSSKQTVQSSRNGGQ
        E  GEL SKTK+TADT DTK KPKP GRSKK  TKRKS+S E D SDEE ENDDA  +DDGH VGKKK G+K N+GRIRDASE +KS KQTVQS  NG Q
Subjt:  EDAGELASKTKKTADTADTKTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVGRIRDASEAEKSSKQTVQSSRNGGQ

Query:  RKKSK
        RKK+K
Subjt:  RKKSK

XP_022949541.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucurbita moschata]2.8e-19287.29Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCV KVIKKAVIADD KVIDG+SPSDFEASL+GKTILSAHRKGKH+W+RLDSPPFP FHFGMAGAIYIKGVAVTNYKRS+VN+D
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMAL++F ESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND
        NQSAATLSKE+CAALHK IQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK+N+ K+MND
Subjt:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND

Query:  EDAGELASKTKKTADTADTKTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDD-GHGVGKKKVGQKMNVGRIRDASEAE---KSSKQTVQSSR
        E  GE  SKTKKTADT DTKTK KPKG SKK +TKRKS+ +EDDGSDEEAENDDASD++D  H +GK K G++ NVGR+ DASE+E   K SKQTV SSR
Subjt:  EDAGELASKTKKTADTADTKTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDD-GHGVGKKKVGQKMNVGRIRDASEAE---KSSKQTVQSSR

Query:  NGGQRKKSK
        +G QRKK+K
Subjt:  NGGQRKKSK

XP_022998520.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucurbita maxima]4.3e-19387.68Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCV KVIKKAVIADD KVIDG+SPSDFEASL+GKTILSAHRKGKH+WLRLDSPPFP FHFGMAGAIYIKGVAVTNYKRS+VN+D
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMAL++F ESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND
        NQSAATLSKE+CAALHK IQ+VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK+N+ K+MND
Subjt:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND

Query:  EDAGELASKTKKTADTADTKTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDD-GHGVGKKKVGQKMNVGRIRDASEAEKSSKQTVQSSRNGG
        E  GEL SKTKKTADT DTKTK KPKG SKK +TKRKS+ +EDDGSDEEAENDDASD++D  H +GK K G++ NVGR+ +AS +EK SKQTV SSR+G 
Subjt:  EDAGELASKTKKTADTADTKTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDD-GHGVGKKKVGQKMNVGRIRDASEAEKSSKQTVQSSRNGG

Query:  QRKKSK
        QRKK+K
Subjt:  QRKKSK

XP_023523304.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucurbita pepo subsp. pepo]1.7e-19487.93Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCV KVIKKA+IADD KVIDG+SPSDFEASL+GKTILSAHRKGKH+WLRLDSPPFP FHFGMAGAIYIKGVAVTNYKRS+VN+D
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMAL++F ESLGKKKLAIKTLLLDQSYISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND
        NQSAATLSKE+CAALHK IQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK+N+ K+MND
Subjt:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND

Query:  EDAGELASKTKKTADTADTKTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDG-HGVGKKKVGQKMNVGRIRDASEAEKSSKQTVQSSRNGG
        E  GEL SKTKKTADT DTKTK KPKG SKK +TKRKS+ +EDDGSDEEAENDDASD++D  H +GK K G++ NVGR+ DASE+EK SKQTV SSR+G 
Subjt:  EDAGELASKTKKTADTADTKTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDG-HGVGKKKVGQKMNVGRIRDASEAEKSSKQTVQSSRNGG

Query:  QRKKSK
        QRKK+K
Subjt:  QRKKSK

XP_038877199.1 formamidopyrimidine-DNA glycosylase isoform X3 [Benincasa hispida]4.4e-19088.45Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCV KVIKKAVIADD KVIDGVSP+DFEASL+GKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVEL+DGVDLSFTDKRRFAKV LLKDPASVPPISKLGPDALLEPMAL++F ES+GKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND
        NQSAATLSKE+CAALHK IQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAE KNQNSKRK N+SK+MND
Subjt:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND

Query:  EDAGELASKTKKTADTADTKTK-PKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVGRIRD-ASEAEKSSKQTVQSSRNG
        E A EL SKT+KTADTADTK K PKPKGR KK +TKRKS+SD+ DGS+EEAENDDASD+DDGH VGKKKVG+  N GR+ + ASE EKS KQTV SS++G
Subjt:  EDAGELASKTKKTADTADTKTK-PKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVGRIRD-ASEAEKSSKQTVQSSRNG

Query:  GQRKKSK
          RKK+K
Subjt:  GQRKKSK

TrEMBL top hitse value%identityAlignment
A0A1S3BY51 formamidopyrimidine-DNA glycosylase isoform X13.4e-18887.93Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+ KVIKKAVIADD KVIDGVSPSDFEASL+GKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKV LL+DPASVPPISKLGPDALLEPMAL+EF ESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND
        NQSAATLSKE+CAALHK IQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK ND+K+MND
Subjt:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND

Query:  EDAGELASKTKKTADTADTKTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVG-RIRDASEAEKSSKQTVQSSRNGG
        E  GEL SKT+K   TAD K KPKPKGRSKK  +KRKS+S+++DGSDEEAENDDASD+D+G   G KK+G+K N+G R   ASE EKS KQTVQSSRNG 
Subjt:  EDAGELASKTKKTADTADTKTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVG-RIRDASEAEKSSKQTVQSSRNGG

Query:  QRKKSK
        +RKK+K
Subjt:  QRKKSK

A0A5D3E227 Formamidopyrimidine-DNA glycosylase isoform X13.4e-18887.93Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+ KVIKKAVIADD KVIDGVSPSDFEASL+GKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKV LL+DPASVPPISKLGPDALLEPMAL+EF ESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND
        NQSAATLSKE+CAALHK IQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK ND+K+MND
Subjt:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND

Query:  EDAGELASKTKKTADTADTKTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVG-RIRDASEAEKSSKQTVQSSRNGG
        E  GEL SKT+K   TAD K KPKPKGRSKK  +KRKS+S+++DGSDEEAENDDASD+D+G   G KK+G+K N+G R   ASE EKS KQTVQSSRNG 
Subjt:  EDAGELASKTKKTADTADTKTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVG-RIRDASEAEKSSKQTVQSSRNGG

Query:  QRKKSK
        +RKK+K
Subjt:  QRKKSK

A0A6J1DKS0 formamidopyrimidine-DNA glycosylase isoform X39.6e-19187.65Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCV K+IKKA+IADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAI+IKGVAVTNYKRSMV DD
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDP SVPPISKLGPDALLEPM L+ F ESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND
        +QSAATLSKE+CA LHKCIQEVIEKALEVGADSS+FPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKL GAEPK QNSKRK++  KQM+D
Subjt:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND

Query:  EDAGELASKTKKTADTADTKTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVGRIRDASEAEKSSKQTVQSSRNGGQ
        E  GEL SKTK+TADT DTK KPKP GRSKK  TKRKS+S E D SDEE ENDDA  +DDGH VGKKK G+K N+GRIRDASE +KS KQTVQS  NG Q
Subjt:  EDAGELASKTKKTADTADTKTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVGRIRDASEAEKSSKQTVQSSRNGGQ

Query:  RKKSK
        RKK+K
Subjt:  RKKSK

A0A6J1GCA1 formamidopyrimidine-DNA glycosylase isoform X11.3e-19287.29Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCV KVIKKAVIADD KVIDG+SPSDFEASL+GKTILSAHRKGKH+W+RLDSPPFP FHFGMAGAIYIKGVAVTNYKRS+VN+D
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMAL++F ESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND
        NQSAATLSKE+CAALHK IQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK+N+ K+MND
Subjt:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND

Query:  EDAGELASKTKKTADTADTKTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDD-GHGVGKKKVGQKMNVGRIRDASEAE---KSSKQTVQSSR
        E  GE  SKTKKTADT DTKTK KPKG SKK +TKRKS+ +EDDGSDEEAENDDASD++D  H +GK K G++ NVGR+ DASE+E   K SKQTV SSR
Subjt:  EDAGELASKTKKTADTADTKTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDD-GHGVGKKKVGQKMNVGRIRDASEAE---KSSKQTVQSSR

Query:  NGGQRKKSK
        +G QRKK+K
Subjt:  NGGQRKKSK

A0A6J1KCR6 formamidopyrimidine-DNA glycosylase isoform X12.1e-19387.68Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCV KVIKKAVIADD KVIDG+SPSDFEASL+GKTILSAHRKGKH+WLRLDSPPFP FHFGMAGAIYIKGVAVTNYKRS+VN+D
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMAL++F ESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND
        NQSAATLSKE+CAALHK IQ+VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK+N+ K+MND
Subjt:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMND

Query:  EDAGELASKTKKTADTADTKTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDD-GHGVGKKKVGQKMNVGRIRDASEAEKSSKQTVQSSRNGG
        E  GEL SKTKKTADT DTKTK KPKG SKK +TKRKS+ +EDDGSDEEAENDDASD++D  H +GK K G++ NVGR+ +AS +EK SKQTV SSR+G 
Subjt:  EDAGELASKTKKTADTADTKTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDD-GHGVGKKKVGQKMNVGRIRDASEAEKSSKQTVQSSRNGG

Query:  QRKKSK
        QRKK+K
Subjt:  QRKKSK

SwissProt top hitse value%identityAlignment
A9B0X2 Formamidopyrimidine-DNA glycosylase8.5e-2732.02Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVE  RR++E+  V +          PK++D  SP  F  ++  + I    R+ K+L + LD+      H  M G + +                
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DE   +++   V LD+G +L F D R+F +  L+          +LGP+ L +   L++F + L +K   IK  LLDQS ++G+GN  ADE L+ A+IHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVIEKALE
         +SA +L+    A L + I+ V+  ++E
Subjt:  NQSAATLSKENCAALHKCIQEVIEKALE

B0TER7 Formamidopyrimidine-DNA glycosylase2.7e-2831.38Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVE  RR++        I+K  +   PK+   +  + F  +L G+ I+   R+GK+L L LD       H  M G +         + R    ++
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASV--PPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARI
         E    ++ FF  LDDG  L +TD R+F  + L+   A++  P   +LGP+ L +  +  +F  +L K+K  +K LLLDQS+++G+GN  ADE L +AR+
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASV--PPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARI

Query:  HPNQSAATLSKENCAALHKCIQEVIEKALEVGADSSR-----------FPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQK
        HP+++A +L  E    L+ CI+ V+++ ++    S R           F      + R   P +    G EI      GR++ F P  QK
Subjt:  HPNQSAATLSKENCAALHKCIQEVIEKALEVGADSSR-----------FPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQK

B8FU83 Formamidopyrimidine-DNA glycosylase9.4e-2629.72Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVE  RR++ +H + + I++ +I   P  ++G     F  ++ G    S  R+GK+L   L+       H  M G             R + +  
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLK--DPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARI
         + P K++   ++L  G ++ FTD R+F ++ L++  +    P +++LGP+ L E  +  E    L  +KLAIK  LLDQ+ ++GIGN  ADE L++A I
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLK--DPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARI

Query:  HPNQSAATLSKENCAALHKCIQEVIEKALEVGADSSR-----------FPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVP
         P + A +L+KE    L+  I +V+E+ +     S R           F      + R  +P K    G  +  I   GR++ F P
Subjt:  HPNQSAATLSKENCAALHKCIQEVIEKALEVGADSSR-----------FPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVP

O80358 Formamidopyrimidine-DNA glycosylase2.3e-12867.69Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEE+C+ K IK+ +IADD KVI G+SPSDF+ S++GKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +P SV PIS+LGPDALLEPM ++EF ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSK----
         Q+A++LSKE C ALH  I+EVIEKA+EV ADSS+FP+ WIFH+REKKPGKAFVDGK+I FIT GGRT+A+VPELQKL G   K+     KV  +K    
Subjt:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSK----

Query:  -QMNDEDAGELASKTKKTADTADTKTKPKPK-GRSKKSATKRKSQSDEDDGSDEEAEND
         + +D D  E   +T+K  ++A +K   KP+ GR KK A+K K++  +DDG D EAE +
Subjt:  -QMNDEDAGELASKTKKTADTADTKTKPKPK-GRSKKSATKRKSQSDEDDGSDEEAEND

Q03GC2 Formamidopyrimidine-DNA glycosylase4.2e-2632.08Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSD--FEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVN
        MPELPEVE  RR +      K++   V+    +    VSP    F   L GK IL+  R+GK+L +          H  M G            K S+V+
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSD--FEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVN

Query:  DDDEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLK--DPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQA
          +E+  K+     ELDDG DL + D R+F ++ L+   +   V  +  +GP+   E + L   T  L  +K  +K+ LLDQS I+G+GN  ADEVL+ +
Subjt:  DDDEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLK--DPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQA

Query:  RIHPNQSAATLSKENCAALHKCIQEVIEKALE---------VGAD--SSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKL
        +IHP Q + TL+ E  A L + I E ++ A+E         + AD  +  F N    + R+  P +    G  I  I    R + F P  Q L
Subjt:  RIHPNQSAATLSKENCAALHKCIQEVIEKALE---------VGAD--SSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKL

Arabidopsis top hitse value%identityAlignment
AT1G52500.1 MUTM homolog-11.1e-10671.97Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEE+C+ K IK+ +IADD KVI G+SPSDF+ S++GKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +P SV PIS+LGPDALLEPM ++EF ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFIT
         Q+A++LSKE C ALH  I+EVI+ A++V ADS  FP  W+FH R  KK GK  V+GK  H ++
Subjt:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFIT

AT1G52500.2 MUTM homolog-11.6e-12967.69Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEE+C+ K IK+ +IADD KVI G+SPSDF+ S++GKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +P SV PIS+LGPDALLEPM ++EF ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSK----
         Q+A++LSKE C ALH  I+EVIEKA+EV ADSS+FP+ WIFH+REKKPGKAFVDGK+I FIT GGRT+A+VPELQKL G   K+     KV  +K    
Subjt:  NQSAATLSKENCAALHKCIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSK----

Query:  -QMNDEDAGELASKTKKTADTADTKTKPKPK-GRSKKSATKRKSQSDEDDGSDEEAEND
         + +D D  E   +T+K  ++A +K   KP+ GR KK A+K K++  +DDG D EAE +
Subjt:  -QMNDEDAGELASKTKKTADTADTKTKPKPK-GRSKKSATKRKSQSDEDDGSDEEAEND


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGAGCTACCGGAGGTAGAGGCGGCGAGGAGGGCCATTGAAGAGCATTGCGTCAGGAAAGTCATCAAGAAGGCCGTGATAGCCGACGATCCGAAGGTAATCGACGG
CGTATCGCCCTCCGACTTCGAGGCTTCGCTCGTAGGCAAGACCATTCTCTCCGCCCATCGCAAGGGCAAGCATCTGTGGCTCCGCCTCGATTCTCCTCCTTTCCCTGCAT
TTCACTTTGGGATGGCGGGTGCCATATATATCAAGGGCGTAGCTGTCACAAACTATAAAAGGTCTATGGTTAATGATGATGATGAGTGGCCTTCCAAGTACTCTAAGTTC
TTTGTTGAGCTTGACGACGGTGTAGACCTATCCTTCACAGACAAAAGGCGGTTTGCAAAAGTCTGCCTGCTCAAAGATCCAGCTTCAGTGCCCCCAATATCTAAGCTTGG
CCCAGATGCTCTCTTAGAACCTATGGCACTGAATGAGTTTACCGAATCCTTGGGCAAGAAGAAACTGGCAATTAAGACTCTATTGCTTGATCAGAGCTATATTTCAGGTA
TTGGCAATTGGGTCGCAGATGAAGTGCTATATCAAGCGAGAATTCATCCAAATCAAAGTGCTGCAACCTTATCTAAAGAGAATTGTGCAGCTTTGCACAAGTGCATACAA
GAGGTAATTGAAAAAGCACTTGAAGTTGGAGCAGATAGTAGTCGGTTCCCAAATAATTGGATTTTCCATTCACGTGAAAAGAAGCCTGGCAAGGCTTTTGTTGATGGTAA
GGAAATCCATTTCATCACTACAGGCGGCAGGACATCGGCCTTCGTACCCGAGTTGCAAAAGCTTACTGGAGCTGAACCGAAAAATCAAAATTCAAAGAGGAAAGTCAATG
ATAGCAAACAAATGAATGATGAGGATGCTGGTGAACTAGCGAGCAAGACAAAGAAAACTGCAGATACAGCTGATACAAAGACAAAGCCAAAGCCTAAAGGTCGCTCTAAG
AAGTCTGCAACAAAAAGAAAATCCCAAAGCGACGAGGATGATGGCTCTGATGAAGAAGCTGAAAACGACGATGCCAGTGATAACGACGATGGTCATGGTGTTGGAAAGAA
GAAAGTGGGACAGAAAATGAACGTCGGGCGAATACGTGATGCTTCTGAAGCAGAGAAGTCTTCGAAGCAAACAGTTCAAAGCAGTCGAAATGGTGGGCAGAGGAAGAAAT
CAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGGAGCTACCGGAGGTAGAGGCGGCGAGGAGGGCCATTGAAGAGCATTGCGTCAGGAAAGTCATCAAGAAGGCCGTGATAGCCGACGATCCGAAGGTAATCGACGG
CGTATCGCCCTCCGACTTCGAGGCTTCGCTCGTAGGCAAGACCATTCTCTCCGCCCATCGCAAGGGCAAGCATCTGTGGCTCCGCCTCGATTCTCCTCCTTTCCCTGCAT
TTCACTTTGGGATGGCGGGTGCCATATATATCAAGGGCGTAGCTGTCACAAACTATAAAAGGTCTATGGTTAATGATGATGATGAGTGGCCTTCCAAGTACTCTAAGTTC
TTTGTTGAGCTTGACGACGGTGTAGACCTATCCTTCACAGACAAAAGGCGGTTTGCAAAAGTCTGCCTGCTCAAAGATCCAGCTTCAGTGCCCCCAATATCTAAGCTTGG
CCCAGATGCTCTCTTAGAACCTATGGCACTGAATGAGTTTACCGAATCCTTGGGCAAGAAGAAACTGGCAATTAAGACTCTATTGCTTGATCAGAGCTATATTTCAGGTA
TTGGCAATTGGGTCGCAGATGAAGTGCTATATCAAGCGAGAATTCATCCAAATCAAAGTGCTGCAACCTTATCTAAAGAGAATTGTGCAGCTTTGCACAAGTGCATACAA
GAGGTAATTGAAAAAGCACTTGAAGTTGGAGCAGATAGTAGTCGGTTCCCAAATAATTGGATTTTCCATTCACGTGAAAAGAAGCCTGGCAAGGCTTTTGTTGATGGTAA
GGAAATCCATTTCATCACTACAGGCGGCAGGACATCGGCCTTCGTACCCGAGTTGCAAAAGCTTACTGGAGCTGAACCGAAAAATCAAAATTCAAAGAGGAAAGTCAATG
ATAGCAAACAAATGAATGATGAGGATGCTGGTGAACTAGCGAGCAAGACAAAGAAAACTGCAGATACAGCTGATACAAAGACAAAGCCAAAGCCTAAAGGTCGCTCTAAG
AAGTCTGCAACAAAAAGAAAATCCCAAAGCGACGAGGATGATGGCTCTGATGAAGAAGCTGAAAACGACGATGCCAGTGATAACGACGATGGTCATGGTGTTGGAAAGAA
GAAAGTGGGACAGAAAATGAACGTCGGGCGAATACGTGATGCTTCTGAAGCAGAGAAGTCTTCGAAGCAAACAGTTCAAAGCAGTCGAAATGGTGGGCAGAGGAAGAAAT
CAAAGTAA
Protein sequenceShow/hide protein sequence
MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDDDEWPSKYSKF
FVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHPNQSAATLSKENCAALHKCIQ
EVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADTKTKPKPKGRSK
KSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVGRIRDASEAEKSSKQTVQSSRNGGQRKKSK