; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005743 (gene) of Snake gourd v1 genome

Gene IDTan0005743
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionformamidopyrimidine-DNA glycosylase isoform X1
Genome locationLG04:84483420..84488809
RNA-Seq ExpressionTan0005743
SyntenyTan0005743
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0003906 - DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0008534 - oxidized purine nucleobase lesion DNA N-glycosylase activity (molecular function)
GO:0016829 - lyase activity (molecular function)
InterPro domainsIPR010979 - Ribosomal protein S13-like, H2TH
IPR012319 - Formamidopyrimidine-DNA glycosylase, catalytic domain
IPR015886 - DNA glycosylase/AP lyase, H2TH DNA-binding
IPR020629 - Formamidopyrimidine-DNA glycosylase
IPR035937 - MutM-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606816.1 Formamidopyrimidine-DNA glycosylase, partial [Cucurbita argyrosperma subsp. sororia]1.2e-19087.96Show/hide
Query:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD
        MPELPEVEAARRAIEE+CVGKVIKKAVIADDSKVIDG+SPSDFEAS+LGKTILSAHRKGKH+WLRLDSPPFP FHFGMAGAIYIKGVAVTNYKRSVVN+D
Subjt:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPA VPPISKLGPDALLEPMALD+FIESLGKKKLAIKTLLLDQS+ISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVKDSKHMNG
        NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK+ + K MN 
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVKDSKHMNG

Query:  EGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKSDEDD-DDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMRDATESE------KSSKQRVP
        EGVGE VSKTK+TADT DTK K KPKG SKKPSTKRKSK +EDD  DEEAENDDASD++D+SH  GK K GK+ NVG++ DA+ESE      K SKQ VP
Subjt:  EGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKSDEDD-DDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMRDATESE------KSSKQRVP

Query:  SSRNGRQ
        SSR+  Q
Subjt:  SSRNGRQ

KAG7036522.1 Formamidopyrimidine-DNA glycosylase, partial [Cucurbita argyrosperma subsp. argyrosperma]2.4e-19185.31Show/hide
Query:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD
        MPELPEVEAARRAIEE+CVGKVIKKAVIADDSKVIDG+SPSDFEAS+LGKTILSAHRKGKH+WLRLDSPPFP FHFGMAGAIYIKGVAVTNYKRSVVN+D
Subjt:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKD--------------------PALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSY
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKD                    PA VPPISKLGPDALLEPMALD+FIESLGKKKLAIKTLLLDQS+
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKD--------------------PALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSY

Query:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
        ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
Subjt:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG

Query:  AEPKNQNSKRKVKDSKHMNGEGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKSDEDD-DDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMR
        AEPKNQNSKRK+ + K MN EGVGE VSKTK+TADT DTK K KPKG SKKPSTKRKSK +EDD  DEEAENDDASD++D+SH  GK K GK+ NVG+M 
Subjt:  AEPKNQNSKRKVKDSKHMNGEGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKSDEDD-DDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMR

Query:  DATESE---KSSKQRVPSSRNGRQRKKAK
        DA+ESE   K SKQ VPSSR+GRQRKKAK
Subjt:  DATESE---KSSKQRVPSSRNGRQRKKAK

XP_022949541.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucurbita moschata]6.0e-19589.24Show/hide
Query:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD
        MPELPEVEAARRAIEE+CVGKVIKKAVIADDSKVIDG+SPSDFEAS+LGKTILSAHRKGKH+W+RLDSPPFP FHFGMAGAIYIKGVAVTNYKRSVVN+D
Subjt:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPA VPPISKLGPDALLEPMALD+F+ESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVKDSKHMNG
        NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK+ + K MN 
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVKDSKHMNG

Query:  EGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKSDEDD-DDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMRDATESE---KSSKQRVPSSR
        EGVGE VSKTK+TADT DTK K KPKG SKKPSTKRKSK +EDD  DEEAENDDASD++D+SH  GK K GK+ NVG+M DA+ESE   K SKQ VPSSR
Subjt:  EGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKSDEDD-DDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMRDATESE---KSSKQRVPSSR

Query:  NGRQRKKAK
        +GRQRKKAK
Subjt:  NGRQRKKAK

XP_022998520.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucurbita maxima]1.6e-19589.66Show/hide
Query:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD
        MPELPEVEAARRAIEE+CVGKVIKKAVIADDSKVIDG+SPSDFEAS+LGKTILSAHRKGKH+WLRLDSPPFP FHFGMAGAIYIKGVAVTNYKRSVVN+D
Subjt:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPA VPPISKLGPDALLEPMALD+FIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVKDSKHMNG
        NQSAATLSKESCAALHKSIQ+VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK+ + K MN 
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVKDSKHMNG

Query:  EGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKSDEDD-DDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMRDATESEKSSKQRVPSSRNGR
        EGVGELVSKTK+TADT DTK K KPKG SKKPSTKRKSK +EDD  DEEAENDDASD++D+SH  GK K GK+ NVG+M +A+ SEK SKQ VPSSR+GR
Subjt:  EGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKSDEDD-DDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMRDATESEKSSKQRVPSSRNGR

Query:  QRKKAK
        QRKK K
Subjt:  QRKKAK

XP_023523304.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucurbita pepo subsp. pepo]2.4e-19689.66Show/hide
Query:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD
        MPELPEVEAARRAIEE+CVGKVIKKA+IADDSKVIDG+SPSDFEAS+LGKTILSAHRKGKH+WLRLDSPPFP FHFGMAGAIYIKGVAVTNYKRSVVN+D
Subjt:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPA VPPISKLGPDALLEPMALD+FIESLGKKKLAIKTLLLDQSYISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVKDSKHMNG
        NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK+ + K MN 
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVKDSKHMNG

Query:  EGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKSDEDD-DDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMRDATESEKSSKQRVPSSRNGR
        EGVGELVSKTK+TADT DTK K KPKG SKKPSTKRKSK +EDD  DEEAENDDASD++D+ H  GK K GK+ NVG+M DA+ESEK SKQ VPSSR+G+
Subjt:  EGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKSDEDD-DDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMRDATESEKSSKQRVPSSRNGR

Query:  QRKKAK
        QRKKAK
Subjt:  QRKKAK

TrEMBL top hitse value%identityAlignment
A0A1S3BY51 formamidopyrimidine-DNA glycosylase isoform X11.9e-18688.94Show/hide
Query:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD
        MPELPEVEAARRAIEE+C+GKVIKKAVIADD+KVIDGVSPSDFEAS+LGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRS+VNDD
Subjt:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKV LL+DPA VPPISKLGPDALLEPMALDEFIESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVKDSKHMNG
        NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK  D+K MN 
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVKDSKHMNG

Query:  EGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKS-DEDDDDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMRD-ATESEKSSKQRVPSSRNG
        EG GELVSKT++   TAD K+KPKPKGRSKKPS KRKSKS DED  DEEAENDDASD DD+    G KK+GKK N+GQ  D A+E EKS KQ V SSRNG
Subjt:  EGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKS-DEDDDDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMRD-ATESEKSSKQRVPSSRNG

Query:  RQRKKAK
        R+RKKAK
Subjt:  RQRKKAK

A0A5D3E227 Formamidopyrimidine-DNA glycosylase isoform X11.9e-18688.94Show/hide
Query:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD
        MPELPEVEAARRAIEE+C+GKVIKKAVIADD+KVIDGVSPSDFEAS+LGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRS+VNDD
Subjt:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKV LL+DPA VPPISKLGPDALLEPMALDEFIESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVKDSKHMNG
        NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK  D+K MN 
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVKDSKHMNG

Query:  EGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKS-DEDDDDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMRD-ATESEKSSKQRVPSSRNG
        EG GELVSKT++   TAD K+KPKPKGRSKKPS KRKSKS DED  DEEAENDDASD DD+    G KK+GKK N+GQ  D A+E EKS KQ V SSRNG
Subjt:  EGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKS-DEDDDDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMRD-ATESEKSSKQRVPSSRNG

Query:  RQRKKAK
        R+RKKAK
Subjt:  RQRKKAK

A0A6J1DKS0 formamidopyrimidine-DNA glycosylase isoform X31.6e-18586.7Show/hide
Query:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD
        MPELPEVEAARRAIEE+CVGK+IKKA+IADD KVIDGVSPSDFEAS++GKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAI+IKGVAVTNYKRS+V DD
Subjt:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDP  VPPISKLGPDALLEPM LD FIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVKDSKHMNG
        +QSAATLSKESCA LHK IQEVIEKALEVGADSS+FPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKL GAEPK QNSKRK+   K M+ 
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVKDSKHMNG

Query:  EGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKSDE-DDDDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMRDATESEKSSKQRVPSSRNGR
        EGVGELVSKTKETADT DTKKKPKP GRSKK  TKRKSKS E D+ DEE ENDDA   DDD H  GKKK GKK N+G++RDA+E +KS KQ V S  NGR
Subjt:  EGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKSDE-DDDDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMRDATESEKSSKQRVPSSRNGR

Query:  QRKKAK
        QRKKAK
Subjt:  QRKKAK

A0A6J1GCA1 formamidopyrimidine-DNA glycosylase isoform X12.9e-19589.24Show/hide
Query:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD
        MPELPEVEAARRAIEE+CVGKVIKKAVIADDSKVIDG+SPSDFEAS+LGKTILSAHRKGKH+W+RLDSPPFP FHFGMAGAIYIKGVAVTNYKRSVVN+D
Subjt:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPA VPPISKLGPDALLEPMALD+F+ESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVKDSKHMNG
        NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK+ + K MN 
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVKDSKHMNG

Query:  EGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKSDEDD-DDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMRDATESE---KSSKQRVPSSR
        EGVGE VSKTK+TADT DTK K KPKG SKKPSTKRKSK +EDD  DEEAENDDASD++D+SH  GK K GK+ NVG+M DA+ESE   K SKQ VPSSR
Subjt:  EGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKSDEDD-DDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMRDATESE---KSSKQRVPSSR

Query:  NGRQRKKAK
        +GRQRKKAK
Subjt:  NGRQRKKAK

A0A6J1KCR6 formamidopyrimidine-DNA glycosylase isoform X17.6e-19689.66Show/hide
Query:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD
        MPELPEVEAARRAIEE+CVGKVIKKAVIADDSKVIDG+SPSDFEAS+LGKTILSAHRKGKH+WLRLDSPPFP FHFGMAGAIYIKGVAVTNYKRSVVN+D
Subjt:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPA VPPISKLGPDALLEPMALD+FIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVKDSKHMNG
        NQSAATLSKESCAALHKSIQ+VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRK+ + K MN 
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVKDSKHMNG

Query:  EGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKSDEDD-DDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMRDATESEKSSKQRVPSSRNGR
        EGVGELVSKTK+TADT DTK K KPKG SKKPSTKRKSK +EDD  DEEAENDDASD++D+SH  GK K GK+ NVG+M +A+ SEK SKQ VPSSR+GR
Subjt:  EGVGELVSKTKETADTADTKKKPKPKGRSKKPSTKRKSKSDEDD-DDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMRDATESEKSSKQRVPSSRNGR

Query:  QRKKAK
        QRKK K
Subjt:  QRKKAK

SwissProt top hitse value%identityAlignment
A5D0T6 Formamidopyrimidine-DNA glycosylase2.1e-2530.48Show/hide
Query:  MPELPEVEAARRAIEENCVG-KVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVND
        MPELPEVE  RR ++    G K+    V+    KVI     S+F+ +I  K IL   R+GK+L + L      A H  M G             R V   
Subjt:  MPELPEVEAARRAIEENCVG-KVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVND

Query:  DDEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPAL--VPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQAR
          + P++++     L +G  L F D R+F ++ L+   AL  +  I +LG + L E    +   + L ++   IK LLLDQ++I+G+GN  ADE L++AR
Subjt:  DDEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPAL--VPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQAR

Query:  IHPNQSAATLSKESCAALHKSIQEVIEKALE-----------VGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKL
        I+P + A TL+    A L+++I++++++ +E               +  +      ++RE KP      G +I     GGR+S + P  QK+
Subjt:  IHPNQSAATLSKESCAALHKSIQEVIEKALE-----------VGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKL

A5UUN1 Formamidopyrimidine-DNA glycosylase1.6e-2533.19Show/hide
Query:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD
        MPELPEV+ A  ++    VG  I +    D +++++  SP +F   + G+ +    R+ K + L LD     A H  M+G++              V   
Subjt:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        D  P K++   + LDDG  + F D R+F +  LL         +  G + L     ++   E L  +K AIK LLLDQ+ I+GIGN  ADE L++ARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSR
         + A+ LS +  AALH  I+  + +AL  G  + R
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSR

A9B0X2 Formamidopyrimidine-DNA glycosylase1.3e-2732.89Show/hide
Query:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD
        MPELPEVE  RR++E+  VG+           K++D  SP  F  +I  + I    R+ K+L + LD+      H  M G + +                
Subjt:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DE   +++   V LD+G +L F D R+F +  L+    +     +LGP+ L +   LD+F + L +K   IK  LLDQS ++G+GN  ADE L+ A+IHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALE
         +SA +L+    A L ++I+ V+  ++E
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALE

O80358 Formamidopyrimidine-DNA glycosylase7.7e-12965.98Show/hide
Query:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD
        MPELPEVEAARRAIEENC+GK IK+ +IADD+KVI G+SPSDF+ SILGKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +P  V PIS+LGPDALLEPM +DEF ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAE----PKNQNSKRKVKDSK
         Q+A++LSKE C ALH SI+EVIEKA+EV ADSS+FP+ WIFH+REKKPGKAFVDGK+I FIT GGRT+A+VPELQKL G +     K + +KR VK  K
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAE----PKNQNSKRKVKDSK

Query:  HMNGEGVGELVSKTKETADTADTKKKPKPK-GRSKKPSTKRKS-KSDEDDDDEEAEND-------------DASDEDDDSHGAGKKKVGKK
          +G+G  E   +T++  ++A +KK  KP+ GR KKP++K K+ +SD+D DD EAE +                 E+  +  AGKK  G+K
Subjt:  HMNGEGVGELVSKTKETADTADTKKKPKPK-GRSKKPSTKRKS-KSDEDDDDEEAEND-------------DASDEDDDSHGAGKKKVGKK

Q03GC2 Formamidopyrimidine-DNA glycosylase1.9e-2631.96Show/hide
Query:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD
        MPELPEVE  RR +     GK++   V+     V        F   + GK IL+  R+GK+L +          H  M G            K SVV+  
Subjt:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLK--DPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARI
        +E+  K+     ELDDG DL + D R+F ++ L+   +   V  +  +GP+   E + L+     L  +K  +K+ LLDQS I+G+GN  ADEVL+ ++I
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLK--DPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARI

Query:  HPNQSAATLSKESCAALHKSIQEVIEKALE---------VGAD--SSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKL
        HP Q + TL+ E  A L +SI E ++ A+E         + AD  +  F N    + R+  P +    G  I  I    R + F P  Q L
Subjt:  HPNQSAATLSKESCAALHKSIQEVIEKALE---------VGAD--SSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKL

Arabidopsis top hitse value%identityAlignment
AT1G52500.1 MUTM homolog-15.4e-10973.86Show/hide
Query:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD
        MPELPEVEAARRAIEENC+GK IK+ +IADD+KVI G+SPSDF+ SILGKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +P  V PIS+LGPDALLEPM +DEF ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFIT
         Q+A++LSKE C ALH SI+EVI+ A++V ADS  FP  W+FH R  KK GK  V+GK  H ++
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFIT

AT1G52500.2 MUTM homolog-15.5e-13065.98Show/hide
Query:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD
        MPELPEVEAARRAIEENC+GK IK+ +IADD+KVI G+SPSDF+ SILGKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDD

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +P  V PIS+LGPDALLEPM +DEF ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAE----PKNQNSKRKVKDSK
         Q+A++LSKE C ALH SI+EVIEKA+EV ADSS+FP+ WIFH+REKKPGKAFVDGK+I FIT GGRT+A+VPELQKL G +     K + +KR VK  K
Subjt:  NQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAE----PKNQNSKRKVKDSK

Query:  HMNGEGVGELVSKTKETADTADTKKKPKPK-GRSKKPSTKRKS-KSDEDDDDEEAEND-------------DASDEDDDSHGAGKKKVGKK
          +G+G  E   +T++  ++A +KK  KP+ GR KKP++K K+ +SD+D DD EAE +                 E+  +  AGKK  G+K
Subjt:  HMNGEGVGELVSKTKETADTADTKKKPKPK-GRSKKPSTKRKS-KSDEDDDDEEAEND-------------DASDEDDDSHGAGKKKVGKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGAGTTACCTGAGGTGGAGGCGGCGAGGAGGGCCATAGAAGAGAATTGCGTCGGGAAAGTGATCAAGAAGGCCGTGATAGCCGACGATTCGAAGGTCATCGACGG
CGTATCGCCCTCCGACTTCGAGGCTTCGATCTTAGGCAAAACCATTCTCTCCGCCCATCGCAAGGGCAAGCACCTCTGGCTCCGCCTCGATTCTCCTCCTTTCCCTGCAT
TTCACTTCGGAATGGCGGGTGCCATATATATCAAGGGTGTAGCTGTTACAAACTATAAAAGGTCTGTGGTTAATGATGATGATGAGTGGCCTTCCAAGTACTCTAAGTTC
TTTGTTGAGCTTGACGATGGTGTAGACCTATCTTTCACAGACAAAAGACGGTTTGCAAAAGTCTGCCTGCTCAAAGATCCAGCTTTAGTGCCTCCAATATCTAAGCTTGG
CCCAGATGCTCTCTTAGAGCCTATGGCACTGGATGAGTTTATTGAATCCTTGGGCAAGAAGAAACTGGCTATTAAGACTCTATTGCTTGATCAGAGCTATATTTCGGGAA
TTGGCAATTGGGTTGCAGATGAAGTGCTATATCAAGCGAGAATTCATCCAAATCAAAGTGCTGCAACCCTATCCAAAGAAAGTTGTGCAGCTTTGCACAAGAGCATACAA
GAGGTAATTGAAAAAGCACTTGAAGTTGGAGCAGATAGTAGTCGGTTCCCTAATAATTGGATTTTCCATTCACGTGAAAAGAAGCCTGGCAAGGCTTTTGTTGATGGTAA
GGAAATCCATTTCATCACTACAGGCGGCAGGACATCGGCATTCGTACCCGAGTTGCAAAAGCTTACTGGAGCTGAACCGAAAAATCAAAATTCTAAGCGAAAAGTCAAAG
ATAGCAAACACATGAATGGTGAGGGTGTTGGTGAACTAGTGAGCAAGACAAAGGAAACTGCAGATACAGCTGATACAAAGAAAAAGCCAAAGCCTAAAGGTCGCTCTAAG
AAGCCTTCAACAAAAAGAAAATCCAAAAGTGACGAGGACGATGATGATGAAGAAGCTGAAAACGATGATGCCAGTGACGAAGACGACGACAGTCACGGTGCTGGAAAAAA
GAAAGTAGGAAAGAAAATGAATGTTGGGCAAATGCGTGATGCTACTGAATCGGAGAAGTCTTCGAAGCAAAGGGTTCCGAGCAGTCGAAATGGTAGGCAGAGGAAGAAAG
CAAAGTAA
mRNA sequenceShow/hide mRNA sequence
GTTACAATTCACTCCTCTCCTCTCCCCCGCTCTCGGTTCGTAGCGCCAGCGGTAAACCCCTCTGTGAGTATCACTGTTCAGTCCTACTTTTTCCTCATAAAAGCGTCGTC
GTCCTCCGGCGACACCAGCAAGAGAAAGGTGCGCCAAAGCTTCGTATCTGTTTTCCGACCACCGCACGATGCCGGAGTTACCTGAGGTGGAGGCGGCGAGGAGGGCCATA
GAAGAGAATTGCGTCGGGAAAGTGATCAAGAAGGCCGTGATAGCCGACGATTCGAAGGTCATCGACGGCGTATCGCCCTCCGACTTCGAGGCTTCGATCTTAGGCAAAAC
CATTCTCTCCGCCCATCGCAAGGGCAAGCACCTCTGGCTCCGCCTCGATTCTCCTCCTTTCCCTGCATTTCACTTCGGAATGGCGGGTGCCATATATATCAAGGGTGTAG
CTGTTACAAACTATAAAAGGTCTGTGGTTAATGATGATGATGAGTGGCCTTCCAAGTACTCTAAGTTCTTTGTTGAGCTTGACGATGGTGTAGACCTATCTTTCACAGAC
AAAAGACGGTTTGCAAAAGTCTGCCTGCTCAAAGATCCAGCTTTAGTGCCTCCAATATCTAAGCTTGGCCCAGATGCTCTCTTAGAGCCTATGGCACTGGATGAGTTTAT
TGAATCCTTGGGCAAGAAGAAACTGGCTATTAAGACTCTATTGCTTGATCAGAGCTATATTTCGGGAATTGGCAATTGGGTTGCAGATGAAGTGCTATATCAAGCGAGAA
TTCATCCAAATCAAAGTGCTGCAACCCTATCCAAAGAAAGTTGTGCAGCTTTGCACAAGAGCATACAAGAGGTAATTGAAAAAGCACTTGAAGTTGGAGCAGATAGTAGT
CGGTTCCCTAATAATTGGATTTTCCATTCACGTGAAAAGAAGCCTGGCAAGGCTTTTGTTGATGGTAAGGAAATCCATTTCATCACTACAGGCGGCAGGACATCGGCATT
CGTACCCGAGTTGCAAAAGCTTACTGGAGCTGAACCGAAAAATCAAAATTCTAAGCGAAAAGTCAAAGATAGCAAACACATGAATGGTGAGGGTGTTGGTGAACTAGTGA
GCAAGACAAAGGAAACTGCAGATACAGCTGATACAAAGAAAAAGCCAAAGCCTAAAGGTCGCTCTAAGAAGCCTTCAACAAAAAGAAAATCCAAAAGTGACGAGGACGAT
GATGATGAAGAAGCTGAAAACGATGATGCCAGTGACGAAGACGACGACAGTCACGGTGCTGGAAAAAAGAAAGTAGGAAAGAAAATGAATGTTGGGCAAATGCGTGATGC
TACTGAATCGGAGAAGTCTTCGAAGCAAAGGGTTCCGAGCAGTCGAAATGGTAGGCAGAGGAAGAAAGCAAAGTAAGTTTATGTCCTAACAATCGTATGTAGTGTTTTAC
TTTTTTGTTTAATTATGAGAACGACTGCCTGCAGTAGAACTTGCCCTTTTATCATTACTGTTAACCCCAACTGTAGATGTGGGCATGGCTTTTGTATCTTCTGTTCTGTT
CCTCATCATTTTGAAGAAGGTTGATAAGATTAAGAAATGGCAAATCTACCCTTCTGCTTGGTTCGACTCGTTTTCTT
Protein sequenceShow/hide protein sequence
MPELPEVEAARRAIEENCVGKVIKKAVIADDSKVIDGVSPSDFEASILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSVVNDDDEWPSKYSKF
FVELDDGVDLSFTDKRRFAKVCLLKDPALVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQ
EVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKVKDSKHMNGEGVGELVSKTKETADTADTKKKPKPKGRSK
KPSTKRKSKSDEDDDDEEAENDDASDEDDDSHGAGKKKVGKKMNVGQMRDATESEKSSKQRVPSSRNGRQRKKAK