; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg038659 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg038659
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionformamidopyrimidine-DNA glycosylase isoform X1
Genome locationscaffold12:6790490..6799585
RNA-Seq ExpressionSpg038659
SyntenySpg038659
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0003906 - DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0019104 - DNA N-glycosylase activity (molecular function)
InterPro domainsIPR010979 - Ribosomal protein S13-like, H2TH
IPR012319 - Formamidopyrimidine-DNA glycosylase, catalytic domain
IPR015886 - DNA glycosylase/AP lyase, H2TH DNA-binding
IPR035937 - MutM-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044473.1 formamidopyrimidine-DNA glycosylase isoform X2 [Cucumis melo var. makuwa]9.7e-16671.25Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCV KVIKKAVIADD KVIDGVSPSDFEASL+GKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMAL+EF ESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR
        NQSAATLSKE+CAALHK IQEV         K +  + A+S          SF + +  H                             F +G   G   
Subjt:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR

Query:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT
          VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK ND+K+MNDE  GEL SKT+K   TAD 
Subjt:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT

Query:  KTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVG-RIRDASEAEKSSKQTVQSSRNGGQRKKSK
        K KPKPKGRSKK  +KRKS+S+++DGSDEEAENDDASD+D+G   G KK+G+K N+G R   ASE EKS KQTVQSSRNG +RKK+K
Subjt:  KTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVG-RIRDASEAEKSSKQTVQSSRNGGQRKKSK

XP_022949541.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucurbita moschata]5.3e-16468.16Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCV KVIKKAVIADD KVIDG+SPSDFEASL+GKTILSAHRKGKH+W+RLDSPPFP FHFGMAGAIYIKGVAVTNYKRS+VN+D
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMAL++F ESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR
        NQSAATLSKE+CAALHK IQE                                                                               
Subjt:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR

Query:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT
          VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK+N+ K+MNDE  GE  SKTKKTADT DT
Subjt:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT

Query:  KTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDD-GHGVGKKKVGQKMNVGRIRDASEAE---KSSKQTVQSSRNGGQRKKSK
        KTK KPKG SKK +TKRKS+ +EDDGSDEEAENDDASD++D  H +GK K G++ NVGR+ DASE+E   K SKQTV SSR+G QRKK+K
Subjt:  KTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDD-GHGVGKKKVGQKMNVGRIRDASEAE---KSSKQTVQSSRNGGQRKKSK

XP_022998520.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucurbita maxima]8.2e-16568.38Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCV KVIKKAVIADD KVIDG+SPSDFEASL+GKTILSAHRKGKH+WLRLDSPPFP FHFGMAGAIYIKGVAVTNYKRS+VN+D
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMAL++F ESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR
        NQSAATLSKE+CAALHK IQ+                                                                               
Subjt:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR

Query:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT
          VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK+N+ K+MNDE  GEL SKTKKTADT DT
Subjt:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT

Query:  KTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDD-GHGVGKKKVGQKMNVGRIRDASEAEKSSKQTVQSSRNGGQRKKSK
        KTK KPKG SKK +TKRKS+ +EDDGSDEEAENDDASD++D  H +GK K G++ NVGR+ +AS +EK SKQTV SSR+G QRKK+K
Subjt:  KTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDD-GHGVGKKKVGQKMNVGRIRDASEAEKSSKQTVQSSRNGGQRKKSK

XP_023523304.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucurbita pepo subsp. pepo]3.3e-16668.58Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCV KVIKKA+IADD KVIDG+SPSDFEASL+GKTILSAHRKGKH+WLRLDSPPFP FHFGMAGAIYIKGVAVTNYKRS+VN+D
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMAL++F ESLGKKKLAIKTLLLDQSYISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR
        NQSAATLSKE+CAALHK IQE                                                                               
Subjt:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR

Query:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT
          VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK+N+ K+MNDE  GEL SKTKKTADT DT
Subjt:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT

Query:  KTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDG-HGVGKKKVGQKMNVGRIRDASEAEKSSKQTVQSSRNGGQRKKSK
        KTK KPKG SKK +TKRKS+ +EDDGSDEEAENDDASD++D  H +GK K G++ NVGR+ DASE+EK SKQTV SSR+G QRKK+K
Subjt:  KTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDG-HGVGKKKVGQKMNVGRIRDASEAEKSSKQTVQSSRNGGQRKKSK

XP_038877199.1 formamidopyrimidine-DNA glycosylase isoform X3 [Benincasa hispida]2.0e-16369.47Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCV KVIKKAVIADD KVIDGVSP+DFEASL+GKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMAL++F ES+GKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR
        NQSAATLSKE+CAALHK IQE                                                                               
Subjt:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR

Query:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT
          VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAE KNQNSKRK N+SK+MNDE A EL SKT+KTADTADT
Subjt:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT

Query:  KTK-PKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVGRIRD-ASEAEKSSKQTVQSSRNGGQRKKSK
        K K PKPKGR KK +TKRKS+SD+ DGS+EEAENDDASD+DDGH VGKKKVG+  N GR+ + ASE EKS KQTV SS++G  RKK+K
Subjt:  KTK-PKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVGRIRD-ASEAEKSSKQTVQSSRNGGQRKKSK

TrEMBL top hitse value%identityAlignment
A0A5A7TLT5 Formamidopyrimidine-DNA glycosylase isoform X24.7e-16671.25Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCV KVIKKAVIADD KVIDGVSPSDFEASL+GKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMAL+EF ESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR
        NQSAATLSKE+CAALHK IQEV         K +  + A+S          SF + +  H                             F +G   G   
Subjt:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR

Query:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT
          VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK ND+K+MNDE  GEL SKT+K   TAD 
Subjt:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT

Query:  KTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVG-RIRDASEAEKSSKQTVQSSRNGGQRKKSK
        K KPKPKGRSKK  +KRKS+S+++DGSDEEAENDDASD+D+G   G KK+G+K N+G R   ASE EKS KQTVQSSRNG +RKK+K
Subjt:  KTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVG-RIRDASEAEKSSKQTVQSSRNGGQRKKSK

A0A5D3E227 Formamidopyrimidine-DNA glycosylase isoform X11.6e-16168.99Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+ KVIKKAVIADD KVIDGVSPSDFEASL+GKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMAL+EF ESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR
        NQSAATLSKE+CAALHK IQE                                                                               
Subjt:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR

Query:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT
          VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK ND+K+MNDE  GEL SKT+K   TAD 
Subjt:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT

Query:  KTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVG-RIRDASEAEKSSKQTVQSSRNGGQRKKSK
        K KPKPKGRSKK  +KRKS+S+++DGSDEEAENDDASD+D+G   G KK+G+K N+G R   ASE EKS KQTVQSSRNG +RKK+K
Subjt:  KTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVG-RIRDASEAEKSSKQTVQSSRNGGQRKKSK

A0A6J1DKS0 formamidopyrimidine-DNA glycosylase isoform X32.4e-16268.31Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCV K+IKKA+IADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAI+IKGVAVTNYKRSMV DD
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       P SVPPISKLGPDALLEPM L+ F ESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR
        +QSAATLSKE+CA LHKCIQE                                                                               
Subjt:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR

Query:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT
          VIEKALEVGADSS+FPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKL GAEPK QNSKRK++  KQM+DE  GEL SKTK+TADT DT
Subjt:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT

Query:  KTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVGRIRDASEAEKSSKQTVQSSRNGGQRKKSK
        K KPKP GRSKK  TKRKS+S E D SDEE ENDDA  +DDGH VGKKK G+K N+GRIRDASE +KS KQTVQS  NG QRKK+K
Subjt:  KTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVGRIRDASEAEKSSKQTVQSSRNGGQRKKSK

A0A6J1GCA1 formamidopyrimidine-DNA glycosylase isoform X12.6e-16468.16Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCV KVIKKAVIADD KVIDG+SPSDFEASL+GKTILSAHRKGKH+W+RLDSPPFP FHFGMAGAIYIKGVAVTNYKRS+VN+D
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMAL++F ESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR
        NQSAATLSKE+CAALHK IQE                                                                               
Subjt:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR

Query:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT
          VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK+N+ K+MNDE  GE  SKTKKTADT DT
Subjt:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT

Query:  KTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDD-GHGVGKKKVGQKMNVGRIRDASEAE---KSSKQTVQSSRNGGQRKKSK
        KTK KPKG SKK +TKRKS+ +EDDGSDEEAENDDASD++D  H +GK K G++ NVGR+ DASE+E   K SKQTV SSR+G QRKK+K
Subjt:  KTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDD-GHGVGKKKVGQKMNVGRIRDASEAE---KSSKQTVQSSRNGGQRKKSK

A0A6J1KCR6 formamidopyrimidine-DNA glycosylase isoform X14.0e-16568.38Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCV KVIKKAVIADD KVIDG+SPSDFEASL+GKTILSAHRKGKH+WLRLDSPPFP FHFGMAGAIYIKGVAVTNYKRS+VN+D
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMAL++F ESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR
        NQSAATLSKE+CAALHK IQ+                                                                               
Subjt:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR

Query:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT
          VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK+N+ K+MNDE  GEL SKTKKTADT DT
Subjt:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADT

Query:  KTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDD-GHGVGKKKVGQKMNVGRIRDASEAEKSSKQTVQSSRNGGQRKKSK
        KTK KPKG SKK +TKRKS+ +EDDGSDEEAENDDASD++D  H +GK K G++ NVGR+ +AS +EK SKQTV SSR+G QRKK+K
Subjt:  KTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDD-GHGVGKKKVGQKMNVGRIRDASEAEKSSKQTVQSSRNGGQRKKSK

SwissProt top hitse value%identityAlignment
A9B0X2 Formamidopyrimidine-DNA glycosylase3.0e-2131.92Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKG------------VA
        MPELPEVE  RR++E+  V +          PK++D  SP  F  ++  + I    R+ K+L + LD+      H  M G + +              VA
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKG------------VA

Query:  VTNYKRSMVNDDDEWPSKYSKF-FVEPASVPPIS-KLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHPNQSAATLSK
        + N +    +D    P K+ ++  V+ + V  ++ +LGP+ L +   L++F + L +K   IK  LLDQS ++G+GN  ADE L+ A+IHP +SA +L+ 
Subjt:  VTNYKRSMVNDDDEWPSKYSKF-FVEPASVPPIS-KLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHPNQSAATLSK

Query:  ENCAALHKCIQEV
           A L + I+ V
Subjt:  ENCAALHKCIQEV

B0TER7 Formamidopyrimidine-DNA glycosylase9.8e-2030.18Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVE  RR++        I+K  +   PK+   +  + F  +L G+ I+   R+GK+L L LD       H  M G +         + R    ++
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVEPASV-----------------------PPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
         E    ++ FF++  S+                       P   +LGP+ L +  +  +F  +L K+K  +K LLLDQS+++G+GN  ADE L +AR+HP
Subjt:  DEWPSKYSKFFVEPASV-----------------------PPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEV
        +++A +L  E    L+ CI+ V
Subjt:  NQSAATLSKENCAALHKCIQEV

O80358 Formamidopyrimidine-DNA glycosylase3.2e-10351.14Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEE+C+ K IK+ +IADD KVI G+SPSDF+ S++GKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVE                       P SV PIS+LGPDALLEPM ++EF ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR
         Q+A++LSKE C ALH  I+E                                                                               
Subjt:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR

Query:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSK-----QMNDEDAGELASKTKKTA
          VIEKA+EV ADSS+FP+ WIFH+REKKPGKAFVDGK+I FIT GGRT+A+VPELQKL G   K+     KV  +K     + +D D  E   +T+K  
Subjt:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSK-----QMNDEDAGELASKTKKTA

Query:  DTADTKTKPKPK-GRSKKSATKRKSQSDEDDGSDEEAEND
        ++A +K   KP+ GR KK A+K K++  +DDG D EAE +
Subjt:  DTADTKTKPKPK-GRSKKSATKRKSQSDEDDGSDEEAEND

Q8FP17 Formamidopyrimidine-DNA glycosylase8.3e-1933.94Show/hide
Query:  MPELPEVEAARRAIEEHCV-RKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPA-------FHFGMAGAIYIKGVAVT--
        MPELPEVE  RR +EEH V R ++  AV+            ++ EA+L G  + + +R+GK LWL LD     A        H GM+G + +K    T  
Subjt:  MPELPEVEAARRAIEEHCV-RKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPA-------FHFGMAGAIYIKGVAVT--

Query:  --NYKRSMVNDDDE-WPSKYSKF----FVEPASVPP--ISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHPNQSA
             R+ ++D +E W      F      E     P  +S +  D L + + +      L  K   IK LLL+Q  +SGIGN  ADE+L++A IHP Q A
Subjt:  --NYKRSMVNDDDE-WPSKYSKF----FVEPASVPP--ISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHPNQSA

Query:  ATLSKENCAALHKCIQEV
        + +S     AL +  +EV
Subjt:  ATLSKENCAALHKCIQEV

Q8NNV7 Formamidopyrimidine-DNA glycosylase1.4e-1835.58Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVID--GVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFP--------AFHFGMAGAIYIK--GVA
        MPELPEVE  RR +E+H V   I  A +       +  G  P + EA++ G  + +A R+GK LWL L   P            H GM+G + IK     
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVID--GVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFP--------AFHFGMAGAIYIK--GVA

Query:  VTNYKRSMV---NDDDEWPSKYSKF-------FVEPASVPP-ISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        ++ + R+ V   N D+ W      F        V+   VP  +S +  D L E    +    +L  +K  IK LLL+Q  +SGIGN  ADE+L+QA+IHP
Subjt:  VTNYKRSMV---NDDDEWPSKYSKF-------FVEPASVPP-ISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLS
         Q A  LS
Subjt:  NQSAATLS

Arabidopsis top hitse value%identityAlignment
AT1G52500.1 MUTM homolog-17.1e-8268.47Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEE+C+ K IK+ +IADD KVI G+SPSDF+ S++GKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVE                       P SV PIS+LGPDALLEPM ++EF ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEV
         Q+A++LSKE C ALH  I+EV
Subjt:  NQSAATLSKENCAALHKCIQEV

AT1G52500.2 MUTM homolog-12.3e-10451.14Show/hide
Query:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEE+C+ K IK+ +IADD KVI G+SPSDF+ S++GKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVE                       P SV PIS+LGPDALLEPM ++EF ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR
         Q+A++LSKE C ALH  I+E                                                                               
Subjt:  NQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGLSHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDR

Query:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSK-----QMNDEDAGELASKTKKTA
          VIEKA+EV ADSS+FP+ WIFH+REKKPGKAFVDGK+I FIT GGRT+A+VPELQKL G   K+     KV  +K     + +D D  E   +T+K  
Subjt:  SMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVNDSK-----QMNDEDAGELASKTKKTA

Query:  DTADTKTKPKPK-GRSKKSATKRKSQSDEDDGSDEEAEND
        ++A +K   KP+ GR KK A+K K++  +DDG D EAE +
Subjt:  DTADTKTKPKPK-GRSKKSATKRKSQSDEDDGSDEEAEND


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGAGCTACCGGAGGTAGAGGCGGCGAGGAGGGCCATTGAAGAGCATTGCGTCAGGAAAGTCATCAAGAAGGCCGTGATAGCCGACGATCCGAAGGTAATCGACGG
CGTATCGCCCTCCGACTTCGAGGCTTCGCTCGTAGGCAAGACCATTCTCTCCGCCCATCGCAAGGGCAAGCATCTGTGGCTCCGCCTCGATTCTCCTCCTTTCCCTGCAT
TTCACTTTGGGATGGCGGGTGCCATATATATCAAGGGCGTAGCTGTCACAAACTATAAAAGGTCTATGGTTAATGATGATGATGAGTGGCCTTCCAAGTACTCTAAGTTC
TTTGTTGAGCCAGCTTCAGTGCCCCCAATATCTAAGCTTGGCCCAGATGCTCTCTTAGAACCTATGGCACTGAATGAGTTTACCGAATCCTTGGGCAAGAAGAAACTGGC
AATTAAGACTCTATTGCTTGATCAGAGCTATATTTCAGGTATTGGCAATTGGGTCGCAGATGAAGTGCTATATCAAGCGAGAATTCATCCAAATCAAAGTGCTGCAACCT
TATCTAAAGAGAATTGTGCAGCTTTGCACAAGTGCATACAAGAGGTACGCCTTTGCTCTTTTCCCTCCCCCTCCAAACCATCCCCAACCCTTGCTGCCCAGTCAGGGTTA
AGCCATGCTGCGCTCTTAGAGATTTCTTTTAGTAAAACATTCACTCTTCATGGAAAGTATAGAGGAACCACCTGTAGGTCCTTAAAAGAGCAGTTGAAGTTGATGCTGAG
TCCAACAACTTTCCTGAAGAATGGTTGTTTCATTTTCGGTGGGGAAAAAGGCCTGGACAGGTCAATGGTAATTGAAAAAGCACTTGAAGTTGGAGCAGATAGTAGTCGGT
TCCCAAATAATTGGATTTTCCATTCACGTGAAAAGAAGCCTGGCAAGGCTTTTGTTGATGGTAAGGAAATCCATTTCATCACTACAGGCGGCAGGACATCGGCCTTCGTA
CCCGAGTTGCAAAAGCTTACTGGAGCTGAACCGAAAAATCAAAATTCAAAGAGGAAAGTCAATGATAGCAAACAAATGAATGATGAGGATGCTGGTGAACTAGCGAGCAA
GACAAAGAAAACTGCAGATACAGCTGATACAAAGACAAAGCCAAAGCCTAAAGGTCGCTCTAAGAAGTCTGCAACAAAAAGAAAATCCCAAAGCGACGAGGATGATGGCT
CTGATGAAGAAGCTGAAAACGACGATGCCAGTGATAACGACGATGGTCATGGTGTTGGAAAGAAGAAAGTGGGACAGAAAATGAACGTCGGGCGAATACGTGATGCTTCT
GAAGCAGAGAAGTCTTCGAAGCAAACAGTTCAAAGCAGTCGAAATGGTGGGCAGAGGAAGAAATCAAAGAGAATGGCAGGAGCAGACAAACAAGCAAAACATATATGGAC
GAGGTTGGAGGAGGCAAAATTGGTTGAATGCCTCGTGGAGCTTTCCCACTTAGCTGCAAAGGGTCTACTGAACAAGCCATTTCCTCAGTACGAGGAACTCGCCTTCGTGT
TCGGCAAGGATCGGGCTAGTGGATCGGGGTGCAATGATTATTATGTCCCCATCCCTCCAGTAGAAAATTTGGCCACAGATGTCGAGTTTGAGGATGTCCCCATAACGCCC
ACGAGCCGATCGAGTACAGCGGGGTCCTCGCAAGGACGGAAGAGGAGTAGAGCATCATATGAAGCAGAAGCACTGGAAATAATGAGGCAGGCAGTCAGCATACAGGAGAC
ACAATTCACGAAAATTGCTGACTGGCCAGACACACAAGACGCTAGGGAGTTCAAGAGGCGAGAAACAGTTGGGGAGATGCTCATGGCTCAGTCGGAGCTAACAGATCGCG
AGAGAGTCTCCCTTATGCGTGTCCTCTTCGTCGACACCAAGATGACCAATATGATACTGATTACCGATCATATGTCGTCACATGTTACAGACGATCTGGTGGATGATGTG
TCTGACACAAGCAGCAGTAATGTGGGACCAGCTGAGACATCAACAGGATCAAGTAGTAGGAGACGAACTTCCTATAACAGAGAGATGATTGAGGTCGTGAAGGCTGCAAT
GGATAGCCAAATTACCAGCCTTCAAAAGATCGCATCCTGGCGAGAACAAAAAAACGAACGGGAGGCTGCACGACGAAAGTTGAATGGCCGGTTCGTCCACAACCCTGAAG
CACACTTGTACGAAAAACGAGGATGCGAAGCTGGTAGAATGCTTCGTGTCTTTTGTCCATGTTGGCGGTTGGAGGGTTTGAACCCTGAAGAAACAGTACCAGGCGATTGC
AGAAATGATGGGGCCAGATGCAACGACTTTGGGTGGAACGAAGAATTTAAGTGTATTGAGGTAGAGAAGAAAACATTCGACCTATGGGTGAAGGACCATCCTATAGCAAA
AGGCATGCGAAACAAATCGTTCCAGCATTTTGACGACTTGGCCTTTGTATTCGAAAAGGATCAAGCTACAGAGGCAGGGTTGAAATGTCGTGGAGATATGGCATCAAATG
TGCCAGAGCATATGGAAGAGGAGATACACCTCGGTGGATCTCAAGAGAACAACATCTTAATCTCGTCGTTCACCATGCCTAGTGTGGACATGCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCGGAGCTACCGGAGGTAGAGGCGGCGAGGAGGGCCATTGAAGAGCATTGCGTCAGGAAAGTCATCAAGAAGGCCGTGATAGCCGACGATCCGAAGGTAATCGACGG
CGTATCGCCCTCCGACTTCGAGGCTTCGCTCGTAGGCAAGACCATTCTCTCCGCCCATCGCAAGGGCAAGCATCTGTGGCTCCGCCTCGATTCTCCTCCTTTCCCTGCAT
TTCACTTTGGGATGGCGGGTGCCATATATATCAAGGGCGTAGCTGTCACAAACTATAAAAGGTCTATGGTTAATGATGATGATGAGTGGCCTTCCAAGTACTCTAAGTTC
TTTGTTGAGCCAGCTTCAGTGCCCCCAATATCTAAGCTTGGCCCAGATGCTCTCTTAGAACCTATGGCACTGAATGAGTTTACCGAATCCTTGGGCAAGAAGAAACTGGC
AATTAAGACTCTATTGCTTGATCAGAGCTATATTTCAGGTATTGGCAATTGGGTCGCAGATGAAGTGCTATATCAAGCGAGAATTCATCCAAATCAAAGTGCTGCAACCT
TATCTAAAGAGAATTGTGCAGCTTTGCACAAGTGCATACAAGAGGTACGCCTTTGCTCTTTTCCCTCCCCCTCCAAACCATCCCCAACCCTTGCTGCCCAGTCAGGGTTA
AGCCATGCTGCGCTCTTAGAGATTTCTTTTAGTAAAACATTCACTCTTCATGGAAAGTATAGAGGAACCACCTGTAGGTCCTTAAAAGAGCAGTTGAAGTTGATGCTGAG
TCCAACAACTTTCCTGAAGAATGGTTGTTTCATTTTCGGTGGGGAAAAAGGCCTGGACAGGTCAATGGTAATTGAAAAAGCACTTGAAGTTGGAGCAGATAGTAGTCGGT
TCCCAAATAATTGGATTTTCCATTCACGTGAAAAGAAGCCTGGCAAGGCTTTTGTTGATGGTAAGGAAATCCATTTCATCACTACAGGCGGCAGGACATCGGCCTTCGTA
CCCGAGTTGCAAAAGCTTACTGGAGCTGAACCGAAAAATCAAAATTCAAAGAGGAAAGTCAATGATAGCAAACAAATGAATGATGAGGATGCTGGTGAACTAGCGAGCAA
GACAAAGAAAACTGCAGATACAGCTGATACAAAGACAAAGCCAAAGCCTAAAGGTCGCTCTAAGAAGTCTGCAACAAAAAGAAAATCCCAAAGCGACGAGGATGATGGCT
CTGATGAAGAAGCTGAAAACGACGATGCCAGTGATAACGACGATGGTCATGGTGTTGGAAAGAAGAAAGTGGGACAGAAAATGAACGTCGGGCGAATACGTGATGCTTCT
GAAGCAGAGAAGTCTTCGAAGCAAACAGTTCAAAGCAGTCGAAATGGTGGGCAGAGGAAGAAATCAAAGAGAATGGCAGGAGCAGACAAACAAGCAAAACATATATGGAC
GAGGTTGGAGGAGGCAAAATTGGTTGAATGCCTCGTGGAGCTTTCCCACTTAGCTGCAAAGGGTCTACTGAACAAGCCATTTCCTCAGTACGAGGAACTCGCCTTCGTGT
TCGGCAAGGATCGGGCTAGTGGATCGGGGTGCAATGATTATTATGTCCCCATCCCTCCAGTAGAAAATTTGGCCACAGATGTCGAGTTTGAGGATGTCCCCATAACGCCC
ACGAGCCGATCGAGTACAGCGGGGTCCTCGCAAGGACGGAAGAGGAGTAGAGCATCATATGAAGCAGAAGCACTGGAAATAATGAGGCAGGCAGTCAGCATACAGGAGAC
ACAATTCACGAAAATTGCTGACTGGCCAGACACACAAGACGCTAGGGAGTTCAAGAGGCGAGAAACAGTTGGGGAGATGCTCATGGCTCAGTCGGAGCTAACAGATCGCG
AGAGAGTCTCCCTTATGCGTGTCCTCTTCGTCGACACCAAGATGACCAATATGATACTGATTACCGATCATATGTCGTCACATGTTACAGACGATCTGGTGGATGATGTG
TCTGACACAAGCAGCAGTAATGTGGGACCAGCTGAGACATCAACAGGATCAAGTAGTAGGAGACGAACTTCCTATAACAGAGAGATGATTGAGGTCGTGAAGGCTGCAAT
GGATAGCCAAATTACCAGCCTTCAAAAGATCGCATCCTGGCGAGAACAAAAAAACGAACGGGAGGCTGCACGACGAAAGTTGAATGGCCGGTTCGTCCACAACCCTGAAG
CACACTTGTACGAAAAACGAGGATGCGAAGCTGGTAGAATGCTTCGTGTCTTTTGTCCATGTTGGCGGTTGGAGGGTTTGAACCCTGAAGAAACAGTACCAGGCGATTGC
AGAAATGATGGGGCCAGATGCAACGACTTTGGGTGGAACGAAGAATTTAAGTGTATTGAGGTAGAGAAGAAAACATTCGACCTATGGGTGAAGGACCATCCTATAGCAAA
AGGCATGCGAAACAAATCGTTCCAGCATTTTGACGACTTGGCCTTTGTATTCGAAAAGGATCAAGCTACAGAGGCAGGGTTGAAATGTCGTGGAGATATGGCATCAAATG
TGCCAGAGCATATGGAAGAGGAGATACACCTCGGTGGATCTCAAGAGAACAACATCTTAATCTCGTCGTTCACCATGCCTAGTGTGGACATGCCCTAG
Protein sequenceShow/hide protein sequence
MPELPEVEAARRAIEEHCVRKVIKKAVIADDPKVIDGVSPSDFEASLVGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDDDEWPSKYSKF
FVEPASVPPISKLGPDALLEPMALNEFTESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHPNQSAATLSKENCAALHKCIQEVRLCSFPSPSKPSPTLAAQSGL
SHAALLEISFSKTFTLHGKYRGTTCRSLKEQLKLMLSPTTFLKNGCFIFGGEKGLDRSMVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFV
PELQKLTGAEPKNQNSKRKVNDSKQMNDEDAGELASKTKKTADTADTKTKPKPKGRSKKSATKRKSQSDEDDGSDEEAENDDASDNDDGHGVGKKKVGQKMNVGRIRDAS
EAEKSSKQTVQSSRNGGQRKKSKRMAGADKQAKHIWTRLEEAKLVECLVELSHLAAKGLLNKPFPQYEELAFVFGKDRASGSGCNDYYVPIPPVENLATDVEFEDVPITP
TSRSSTAGSSQGRKRSRASYEAEALEIMRQAVSIQETQFTKIADWPDTQDAREFKRRETVGEMLMAQSELTDRERVSLMRVLFVDTKMTNMILITDHMSSHVTDDLVDDV
SDTSSSNVGPAETSTGSSSRRRTSYNREMIEVVKAAMDSQITSLQKIASWREQKNEREAARRKLNGRFVHNPEAHLYEKRGCEAGRMLRVFCPCWRLEGLNPEETVPGDC
RNDGARCNDFGWNEEFKCIEVEKKTFDLWVKDHPIAKGMRNKSFQHFDDLAFVFEKDQATEAGLKCRGDMASNVPEHMEEEIHLGGSQENNILISSFTMPSVDMP