; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0013008 (gene) of Chayote v1 genome

Gene IDSed0013008
OrganismSechium edule (Chayote v1)
Descriptionformamidopyrimidine-DNA glycosylase isoform X1
Genome locationLG05:27774614..27780986
RNA-Seq ExpressionSed0013008
SyntenySed0013008
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0003906 - DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0008534 - oxidized purine nucleobase lesion DNA N-glycosylase activity (molecular function)
GO:0016829 - lyase activity (molecular function)
InterPro domainsIPR010979 - Ribosomal protein S13-like, H2TH
IPR012319 - Formamidopyrimidine-DNA glycosylase, catalytic domain
IPR015886 - DNA glycosylase/AP lyase, H2TH DNA-binding
IPR020629 - Formamidopyrimidine-DNA glycosylase
IPR035937 - MutM-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606816.1 Formamidopyrimidine-DNA glycosylase, partial [Cucurbita argyrosperma subsp. sororia]1.5e-18585.26Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVID +SPSDFEA+LLGKTILSAHRKGKH+WLRLDSPPFP FHFGMAGAIYIKGVAVTNYKRS+VN+D
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD

Query:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG+DLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALD+FIESLGKKKLAIKTLLLDQS+ISGIGNWVADEVLYQARIHP
Subjt:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPEPKNQNSKRKVNNSKQIND
        NQSAATLSKESC ALHKSIQEVIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG EPKNQNSKRK+N  K++ND
Subjt:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPEPKNQNSKRKVNNSKQIND

Query:  EDVGELVSKTKKTEDTGD--TKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMCDASELE------KSSKQTVS
        E VGE VSKTKKT DT D  TK KPKG SKKPSTKRKSK  +D+ SDEEAE+D+A D++D+ H  GK K GK+TNVG++ DASE E      K SKQTV 
Subjt:  EDVGELVSKTKKTEDTGD--TKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMCDASELE------KSSKQTVS

Query:  SSRNSRQ
        SSR+  Q
Subjt:  SSRNSRQ

KAG7036522.1 Formamidopyrimidine-DNA glycosylase, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-18582.52Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVID +SPSDFEA+LLGKTILSAHRKGKH+WLRLDSPPFP FHFGMAGAIYIKGVAVTNYKRS+VN+D
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD

Query:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKD--------------------PASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSY
        +EWPSKYSKFFVELDDG+DLSFTDKRRFAKVCLLKD                    PASVPPISKLGPDALLEPMALD+FIESLGKKKLAIKTLLLDQS+
Subjt:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKD--------------------PASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSY

Query:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
        ISGIGNWVADEVLYQARIHPNQSAATLSKESC ALHKSIQEVIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
Subjt:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG

Query:  PEPKNQNSKRKVNNSKQINDEDVGELVSKTKKTEDTGD--TKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMC
         EPKNQNSKRK+N  K++NDE VGE VSKTKKT DT D  TK KPKG SKKPSTKRKSK  +D+ SDEEAE+D+A D++D+ H  GK K GK+TNVG+M 
Subjt:  PEPKNQNSKRKVNNSKQINDEDVGELVSKTKKTEDTGD--TKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMC

Query:  DASELE---KSSKQTVSSSRNSRQRKKAK
        DASE E   K SKQTV SSR+ RQRKKAK
Subjt:  DASELE---KSSKQTVSSSRNSRQRKKAK

XP_022949541.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucurbita moschata]6.4e-18986.31Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVID +SPSDFEA+LLGKTILSAHRKGKH+W+RLDSPPFP FHFGMAGAIYIKGVAVTNYKRS+VN+D
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD

Query:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG+DLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALD+F+ESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPEPKNQNSKRKVNNSKQIND
        NQSAATLSKESC ALHKSIQEVIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG EPKNQNSKRK+N  K++ND
Subjt:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPEPKNQNSKRKVNNSKQIND

Query:  EDVGELVSKTKKTEDTGD--TKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMCDASELE---KSSKQTVSSSR
        E VGE VSKTKKT DT D  TK KPKG SKKPSTKRKSK  +D+ SDEEAE+D+A D++D+ H  GK K GK+TNVG+M DASE E   K SKQTV SSR
Subjt:  EDVGELVSKTKKTEDTGD--TKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMCDASELE---KSSKQTVSSSR

Query:  NSRQRKKAK
        + RQRKKAK
Subjt:  NSRQRKKAK

XP_022998520.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucurbita maxima]1.7e-18986.7Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVID +SPSDFEA+LLGKTILSAHRKGKH+WLRLDSPPFP FHFGMAGAIYIKGVAVTNYKRS+VN+D
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD

Query:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG+DLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALD+FIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPEPKNQNSKRKVNNSKQIND
        NQSAATLSKESC ALHKSIQ+VIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG EPKNQNSKRK+N  K++ND
Subjt:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPEPKNQNSKRKVNNSKQIND

Query:  EDVGELVSKTKKTEDTGD--TKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMCDASELEKSSKQTVSSSRNSR
        E VGELVSKTKKT DT D  TK KPKG SKKPSTKRKSK  +D+ SDEEAE+D+A D++D+ H  GK K GK+TNVG+M +AS  EK SKQTV SSR+ R
Subjt:  EDVGELVSKTKKTEDTGD--TKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMCDASELEKSSKQTVSSSRNSR

Query:  QRKKAK
        QRKK K
Subjt:  QRKKAK

XP_023523304.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucurbita pepo subsp. pepo]1.2e-19086.95Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD
        MPELPEVEAARRAIEEHCVGKVIKKA+IADDSKVID +SPSDFEA+LLGKTILSAHRKGKH+WLRLDSPPFP FHFGMAGAIYIKGVAVTNYKRS+VN+D
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD

Query:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG+DLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALD+FIESLGKKKLAIKTLLLDQSYISGIGNW+ADEVLYQARIHP
Subjt:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPEPKNQNSKRKVNNSKQIND
        NQSAATLSKESC ALHKSIQEVIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG EPKNQNSKRK+N  K++ND
Subjt:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPEPKNQNSKRKVNNSKQIND

Query:  EDVGELVSKTKKTEDTGD--TKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMCDASELEKSSKQTVSSSRNSR
        E VGELVSKTKKT DT D  TK KPKG SKKPSTKRKSK  +D+ SDEEAE+D+A D++D+ H  GK K GK+TNVG+M DASE EK SKQTV SSR+ +
Subjt:  EDVGELVSKTKKTEDTGD--TKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMCDASELEKSSKQTVSSSRNSR

Query:  QRKKAK
        QRKKAK
Subjt:  QRKKAK

TrEMBL top hitse value%identityAlignment
A0A0A0KWY6 FPG_CAT domain-containing protein6.3e-18286.91Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADD+KVID VSPSDFEA+LLGKTILSAHRKGKHLWL LDSPPFPAFHFGMAGAIYIKGVAVTNYKRS+VNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD

Query:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG+DLSFTDKRRFAKV LL+DPASVPPISKLGPDALLEPMALDEFIESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPEPKNQNSKRKVNNSKQIND
        NQSAATLSKESC ALHKSIQEVIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG EPKNQNSKRK N++K++ND
Subjt:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPEPKNQNSKRKVNNSKQIND

Query:  EDVGELVSKTKKTEDTGDTKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMCD-ASELEKSSKQTVSSSRNSRQ
        E  GELVSKTKKT D    KPKPKGRSKKPS KRKSKS  D+ SDEEAE+D+A D DD+    GKKKVG KTN+GQ  D ASE +KS KQTV SS+  R+
Subjt:  EDVGELVSKTKKTEDTGDTKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMCD-ASELEKSSKQTVSSSRNSRQ

Query:  RKKAK
        RKKAK
Subjt:  RKKAK

A0A1S3BY51 formamidopyrimidine-DNA glycosylase isoform X11.9e-18386.91Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADD+KVID VSPSDFEA+LLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRS+VNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD

Query:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG+DLSFTDKRRFAKV LL+DPASVPPISKLGPDALLEPMALDEFIESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPEPKNQNSKRKVNNSKQIND
        NQSAATLSKESC ALHKSIQEVIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG EPKNQNSKRK N++K++ND
Subjt:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPEPKNQNSKRKVNNSKQIND

Query:  EDVGELVSKTKKTEDTGDTKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMCD-ASELEKSSKQTVSSSRNSRQ
        E  GELVSKT+KT D    KPKPKGRSKKPS KRKSKS  ++ SDEEAE+D+A D DD+    G KK+GKKTN+GQ  D ASE EKS KQTV SSRN R+
Subjt:  EDVGELVSKTKKTEDTGDTKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMCD-ASELEKSSKQTVSSSRNSRQ

Query:  RKKAK
        RKKAK
Subjt:  RKKAK

A0A5D3E227 Formamidopyrimidine-DNA glycosylase isoform X11.9e-18386.91Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADD+KVID VSPSDFEA+LLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRS+VNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD

Query:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG+DLSFTDKRRFAKV LL+DPASVPPISKLGPDALLEPMALDEFIESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPEPKNQNSKRKVNNSKQIND
        NQSAATLSKESC ALHKSIQEVIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG EPKNQNSKRK N++K++ND
Subjt:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPEPKNQNSKRKVNNSKQIND

Query:  EDVGELVSKTKKTEDTGDTKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMCD-ASELEKSSKQTVSSSRNSRQ
        E  GELVSKT+KT D    KPKPKGRSKKPS KRKSKS  ++ SDEEAE+D+A D DD+    G KK+GKKTN+GQ  D ASE EKS KQTV SSRN R+
Subjt:  EDVGELVSKTKKTEDTGDTKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMCD-ASELEKSSKQTVSSSRNSRQ

Query:  RKKAK
        RKKAK
Subjt:  RKKAK

A0A6J1GCA1 formamidopyrimidine-DNA glycosylase isoform X13.1e-18986.31Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVID +SPSDFEA+LLGKTILSAHRKGKH+W+RLDSPPFP FHFGMAGAIYIKGVAVTNYKRS+VN+D
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD

Query:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG+DLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALD+F+ESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPEPKNQNSKRKVNNSKQIND
        NQSAATLSKESC ALHKSIQEVIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG EPKNQNSKRK+N  K++ND
Subjt:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPEPKNQNSKRKVNNSKQIND

Query:  EDVGELVSKTKKTEDTGD--TKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMCDASELE---KSSKQTVSSSR
        E VGE VSKTKKT DT D  TK KPKG SKKPSTKRKSK  +D+ SDEEAE+D+A D++D+ H  GK K GK+TNVG+M DASE E   K SKQTV SSR
Subjt:  EDVGELVSKTKKTEDTGD--TKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMCDASELE---KSSKQTVSSSR

Query:  NSRQRKKAK
        + RQRKKAK
Subjt:  NSRQRKKAK

A0A6J1KCR6 formamidopyrimidine-DNA glycosylase isoform X18.1e-19086.7Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVID +SPSDFEA+LLGKTILSAHRKGKH+WLRLDSPPFP FHFGMAGAIYIKGVAVTNYKRS+VN+D
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD

Query:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG+DLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALD+FIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPEPKNQNSKRKVNNSKQIND
        NQSAATLSKESC ALHKSIQ+VIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG EPKNQNSKRK+N  K++ND
Subjt:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPEPKNQNSKRKVNNSKQIND

Query:  EDVGELVSKTKKTEDTGD--TKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMCDASELEKSSKQTVSSSRNSR
        E VGELVSKTKKT DT D  TK KPKG SKKPSTKRKSK  +D+ SDEEAE+D+A D++D+ H  GK K GK+TNVG+M +AS  EK SKQTV SSR+ R
Subjt:  EDVGELVSKTKKTEDTGD--TKPKPKGRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMCDASELEKSSKQTVSSSRNSR

Query:  QRKKAK
        QRKK K
Subjt:  QRKKAK

SwissProt top hitse value%identityAlignment
A9B0X2 Formamidopyrimidine-DNA glycosylase1.9e-2632.02Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD
        MPELPEVE  RR++E+  VG+           K++D  SP  F  A+  + I    R+ K+L + LD+      H  M G + +                
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD

Query:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +E   +++   V LD+G +L F D R+F +  L+          +LGP+ L +   LD+F + L +K   IK  LLDQS ++G+GN  ADE L+ A+IHP
Subjt:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCEALHKSIQEVIEKALE
         +SA +L+      L ++I+ V+  ++E
Subjt:  NQSAATLSKESCEALHKSIQEVIEKALE

B0TER7 Formamidopyrimidine-DNA glycosylase1.7e-2731.72Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD
        MPELPEVE  RR++     G  I+K  +    K+  A+  + F  AL G+ I+   R+GK+L L LD       H  M G +         + R    ++
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD

Query:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASV--PPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARI
         E    ++ FF  LDDG  L +TD R+F  + L+   A++  P   +LGP+ L +  +  +F  +L K+K  +K LLLDQS+++G+GN  ADE L +AR+
Subjt:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASV--PPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARI

Query:  HPNQSAATLSKESCEALHKSIQEVIEKALEVGADSSR-----------FPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQK
        HP+++A +L  E    L+  I+ V+++ ++    S R           F      + R   P +    G EI      GR++ F P  QK
Subjt:  HPNQSAATLSKESCEALHKSIQEVIEKALEVGADSSR-----------FPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQK

O34403 Formamidopyrimidine-DNA glycosylase9.4e-2630.1Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD
        MPELPEVE  RR +     GK IK   I   + +     P +F   L G+TI S  R+GK L   LD       H+ M   + ++G       +  ++  
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD

Query:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLK--DPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARI
        EE   K+      + DG  L + D R+F  + L K  + A   P+S+LGP+   E        + L K   A+KT LLDQ  + G+GN   DE L++A +
Subjt:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLK--DPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARI

Query:  HPNQSAATLSKESCEALHKSIQEVIEKALEVGADSSR-----------FPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQ
        HP   A  LS ++ + LH  I+  +++A++ G  + R           F      + ++ +P K    G  I  I  GGR + F  + Q
Subjt:  HPNQSAATLSKESCEALHKSIQEVIEKALEVGADSSR-----------FPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQ

O80358 Formamidopyrimidine-DNA glycosylase3.6e-12665.61Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD
        MPELPEVEAARRAIEE+C+GK IK+ +IADD+KVI  +SPSDF+ ++LGKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD

Query:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        EEWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +P SV PIS+LGPDALLEPM +DEF ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPE----PKNQNSKRKVNNSK
         Q+A++LSKE CEALH SI+EVIEKA+EV ADSS+FPS W+FH+REKKPGKAFVDGK+I FIT GGRT+A+VPELQKL G +     K + +KR V   +
Subjt:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPE----PKNQNSKRKVNNSK

Query:  QINDEDVGELVSKTKKTEDTGDTK--PKPK-GRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKK
          +D D  E   +T+K +++  +K   KP+ GR KKP++K K++   D+  D EAE +           A K+K  +K
Subjt:  QINDEDVGELVSKTKKTEDTGDTK--PKPK-GRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKK

Q03GC2 Formamidopyrimidine-DNA glycosylase5.5e-2634.91Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSD--FEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVN
        MPELPEVE  RR +     GK++   V+    +    VSP    F   L GK IL+  R+GK+L +          H  M G            K S+V+
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSD--FEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVN

Query:  DDEEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLK--DPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQA
          EE+  K+     ELDDG DL + D R+F ++ L+   +   V  +  +GP+   E + L+     L  +K  +K+ LLDQS I+G+GN  ADEVL+ +
Subjt:  DDEEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLK--DPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQA

Query:  RIHPNQSAATLSKESCEALHKSIQEVIEKALE
        +IHP Q + TL+ E    L +SI E ++ A+E
Subjt:  RIHPNQSAATLSKESCEALHKSIQEVIEKALE

Arabidopsis top hitse value%identityAlignment
AT1G52500.1 MUTM homolog-13.1e-10973.86Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD
        MPELPEVEAARRAIEE+C+GK IK+ +IADD+KVI  +SPSDF+ ++LGKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD

Query:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        EEWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +P SV PIS+LGPDALLEPM +DEF ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSR-EKKPGKAFVDGKEIHFIT
         Q+A++LSKE CEALH SI+EVI+ A++V ADS  FP  WLFH R  KK GK  V+GK  H ++
Subjt:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSR-EKKPGKAFVDGKEIHFIT

AT1G52500.2 MUTM homolog-12.5e-12765.61Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD
        MPELPEVEAARRAIEE+C+GK IK+ +IADD+KVI  +SPSDF+ ++LGKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDD

Query:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        EEWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +P SV PIS+LGPDALLEPM +DEF ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  EEWPSKYSKFFVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPE----PKNQNSKRKVNNSK
         Q+A++LSKE CEALH SI+EVIEKA+EV ADSS+FPS W+FH+REKKPGKAFVDGK+I FIT GGRT+A+VPELQKL G +     K + +KR V   +
Subjt:  NQSAATLSKESCEALHKSIQEVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPE----PKNQNSKRKVNNSK

Query:  QINDEDVGELVSKTKKTEDTGDTK--PKPK-GRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKK
          +D D  E   +T+K +++  +K   KP+ GR KKP++K K++   D+  D EAE +           A K+K  +K
Subjt:  QINDEDVGELVSKTKKTEDTGDTK--PKPK-GRSKKPSTKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGAGTTGCCGGAGGTGGAGGCGGCGAGGAGGGCTATAGAAGAGCACTGCGTCGGGAAAGTGATCAAGAAGGCCGTCATCGCCGACGATTCCAAGGTCATCGACGC
CGTTTCGCCCTCCGACTTCGAGGCTGCGCTCTTAGGCAAGACCATCCTCTCCGCCCATCGCAAGGGCAAACACCTGTGGCTCCGCCTCGATTCTCCTCCTTTCCCTGCAT
TTCACTTCGGGATGGCAGGTGCCATATATATCAAGGGTGTAGCTGTCACAAACTATAAAAGGTCTATAGTTAATGATGATGAGGAATGGCCTTCCAAGTACTCTAAGTTC
TTTGTTGAGCTTGACGATGGCATAGACCTATCCTTCACAGACAAAAGGCGGTTTGCTAAAGTCTGCCTACTCAAAGATCCAGCTTCAGTGCCTCCAATATCTAAGCTTGG
CCCAGATGCTCTTTTAGAGCCTATGGCACTGGATGAGTTTATTGAATCCTTGGGCAAGAAGAAACTGGCTATTAAGACTCTATTGCTTGATCAGAGCTATATTTCGGGTA
TTGGCAATTGGGTTGCAGATGAAGTGCTATATCAAGCGAGAATTCATCCTAATCAAAGTGCTGCAACCCTATCCAAAGAAAGTTGTGAAGCTTTGCACAAGAGCATACAA
GAGGTGATTGAAAAAGCACTTGAAGTTGGAGCAGATAGTAGTCGGTTCCCTAGTAATTGGCTTTTCCATTCACGTGAAAAGAAGCCTGGCAAGGCTTTTGTTGATGGTAA
GGAAATCCATTTCATCACTACAGGCGGAAGGACATCGGCCTTTGTACCCGAGTTGCAAAAGCTTACTGGACCCGAACCGAAAAATCAAAATTCAAAGAGAAAAGTTAACA
ATAGCAAACAAATAAATGATGAGGATGTTGGTGAACTAGTGAGCAAGACAAAGAAAACTGAAGATACAGGTGATACAAAGCCAAAGCCTAAAGGTCGCTCTAAGAAGCCT
TCAACAAAAAGAAAATCCAAGAGCGGCAAGGACAATGACTCTGATGAAGAAGCTGAAAGCGACGAAGCTGGTGACAACGACGATGACGATCACGGTGCTGGAAAGAAGAA
AGTAGGAAAGAAAACAAACGTTGGGCAAATGTGTGATGCTTCTGAATTGGAGAAGTCTTCGAAGCAAACGGTTTCGAGCAGTCGAAATAGTAGGCAGAGAAAGAAAGCAA
AGTAA
mRNA sequenceShow/hide mRNA sequence
CAGCAGCAATCCCTGGGTCAGTTTCACTGACCAGTCTACCTTTTCCTCGTAAAAAGCTTTGCATCTGTTTATCCGACCACCGCACGATGCCGGAGTTGCCGGAGGTGGAG
GCGGCGAGGAGGGCTATAGAAGAGCACTGCGTCGGGAAAGTGATCAAGAAGGCCGTCATCGCCGACGATTCCAAGGTCATCGACGCCGTTTCGCCCTCCGACTTCGAGGC
TGCGCTCTTAGGCAAGACCATCCTCTCCGCCCATCGCAAGGGCAAACACCTGTGGCTCCGCCTCGATTCTCCTCCTTTCCCTGCATTTCACTTCGGGATGGCAGGTGCCA
TATATATCAAGGGTGTAGCTGTCACAAACTATAAAAGGTCTATAGTTAATGATGATGAGGAATGGCCTTCCAAGTACTCTAAGTTCTTTGTTGAGCTTGACGATGGCATA
GACCTATCCTTCACAGACAAAAGGCGGTTTGCTAAAGTCTGCCTACTCAAAGATCCAGCTTCAGTGCCTCCAATATCTAAGCTTGGCCCAGATGCTCTTTTAGAGCCTAT
GGCACTGGATGAGTTTATTGAATCCTTGGGCAAGAAGAAACTGGCTATTAAGACTCTATTGCTTGATCAGAGCTATATTTCGGGTATTGGCAATTGGGTTGCAGATGAAG
TGCTATATCAAGCGAGAATTCATCCTAATCAAAGTGCTGCAACCCTATCCAAAGAAAGTTGTGAAGCTTTGCACAAGAGCATACAAGAGGTGATTGAAAAAGCACTTGAA
GTTGGAGCAGATAGTAGTCGGTTCCCTAGTAATTGGCTTTTCCATTCACGTGAAAAGAAGCCTGGCAAGGCTTTTGTTGATGGTAAGGAAATCCATTTCATCACTACAGG
CGGAAGGACATCGGCCTTTGTACCCGAGTTGCAAAAGCTTACTGGACCCGAACCGAAAAATCAAAATTCAAAGAGAAAAGTTAACAATAGCAAACAAATAAATGATGAGG
ATGTTGGTGAACTAGTGAGCAAGACAAAGAAAACTGAAGATACAGGTGATACAAAGCCAAAGCCTAAAGGTCGCTCTAAGAAGCCTTCAACAAAAAGAAAATCCAAGAGC
GGCAAGGACAATGACTCTGATGAAGAAGCTGAAAGCGACGAAGCTGGTGACAACGACGATGACGATCACGGTGCTGGAAAGAAGAAAGTAGGAAAGAAAACAAACGTTGG
GCAAATGTGTGATGCTTCTGAATTGGAGAAGTCTTCGAAGCAAACGGTTTCGAGCAGTCGAAATAGTAGGCAGAGAAAGAAAGCAAAGTAAGTTATGTCCTAACGAGCAA
CAGTCATATGTAGTTTTTTGGTTAATTGTTAGAACATCTGCCTACTGTAGAATTTGCATTCTTGTCATTACTGTCAATCCCCCAACTGTAGACGTGGGCATGGCTTTTGT
ATCATCTGTTCTGTTGCACATCCTTTTGAAGAATGTTGATTAAGGTTCTGAAAAGTCAAACCTACCCTTCTTCCCAGTTCGGTTTGATTTTCTTTTCTAAATGCATTCAT
TTTTCTGTGTACCTCAACCGTTTTTGTCTTTGTTGTTCAGACCTGCAAC
Protein sequenceShow/hide protein sequence
MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDAVSPSDFEAALLGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSIVNDDEEWPSKYSKF
FVELDDGIDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDEFIESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHPNQSAATLSKESCEALHKSIQ
EVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGPEPKNQNSKRKVNNSKQINDEDVGELVSKTKKTEDTGDTKPKPKGRSKKP
STKRKSKSGKDNDSDEEAESDEAGDNDDDDHGAGKKKVGKKTNVGQMCDASELEKSSKQTVSSSRNSRQRKKAK