; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10002183 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10002183
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionFormamidopyrimidine-DNA glycosylase isoform X2
Genome locationChr11:4256222..4260802
RNA-Seq ExpressionHG10002183
SyntenyHG10002183
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0003684 - damaged DNA binding (molecular function)
GO:0003906 - DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0019104 - DNA N-glycosylase activity (molecular function)
InterPro domainsIPR010979 - Ribosomal protein S13-like, H2TH
IPR012319 - Formamidopyrimidine-DNA glycosylase, catalytic domain
IPR015886 - DNA glycosylase/AP lyase, H2TH DNA-binding
IPR035937 - MutM-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044473.1 formamidopyrimidine-DNA glycosylase isoform X2 [Cucumis melo var. makuwa]1.3e-16067.22Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFE S+LGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALD+FIES+ KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL
        NQSAATLSKESCAALHKSI+EV                                                  LK  +++     +F +   F F    G 
Subjt:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL

Query:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTK
            VIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQ SKRKGND+K+MNDEG G+LVSKT+KTAD K
Subjt:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTK

Query:  KKPKPKG-----------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK
        +KPKPKG                                    G KK+GKKTNIGQ  DAASE EKSLKQTVQSS+NG  RKKAK
Subjt:  KKPKPKG-----------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK

XP_004152179.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucumis sativus]1.1e-15164.74Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFE S+LGKTILSAHRKGKHLWL LDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALD+FIES+ KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL
        NQSAATLSKESCAALHKSI+E                                                                               
Subjt:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL

Query:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTK
            VIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQ SKRKGND+K+MNDE  G+LVSKT KTAD K
Subjt:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTK

Query:  KKPKPKG-----------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK
        +KPKPKG                                    G KKVG KTNIGQ  DAASE +KSLKQTV+SS+ G  RKKAK
Subjt:  KKPKPKG-----------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK

XP_008454182.1 PREDICTED: formamidopyrimidine-DNA glycosylase isoform X1 [Cucumis melo]9.9e-15665.57Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADDTKVIDGVSPSDFE S+LGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALD+FIES+ KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL
        NQSAATLSKESCAALHKSI+E                                                                               
Subjt:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL

Query:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTK
            VIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQ SKRKGND+K+MNDEG G+LVSKT+KTAD K
Subjt:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTK

Query:  KKPKPKG-----------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK
        +KPKPKG                                    G KK+GKKTNIGQ  DAASE EKSLKQTVQSS+NG  RKKAK
Subjt:  KKPKPKG-----------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK

XP_038877190.1 formamidopyrimidine-DNA glycosylase isoform X1 [Benincasa hispida]3.6e-15062.92Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSP+DFE S+LGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQ                      
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL
                   C    + IR                    ++V    P +     ++ +      RGTTCRSLK+QLKLM SPTAFLKNGCFIFDGEKGL
Subjt:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL

Query:  DRSM-----------------VIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMND
        DRSM                 VIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAE KNQ SKRKGN+SKEMND
Subjt:  DRSM-----------------VIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMND

Query:  EGAGKLVSKTDKTADT----KKKPKPKG------------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNG
        EGA +LVSKT+KTADT    KK+PKPKG                                    VG KKVGK TN G+M++AASE EKSLKQTV SS++G
Subjt:  EGAGKLVSKTDKTADT----KKKPKPKG------------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNG

Query:  MGRKKAK
          RKKAK
Subjt:  MGRKKAK

XP_038877199.1 formamidopyrimidine-DNA glycosylase isoform X3 [Benincasa hispida]2.7e-15365.1Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSP+DFE S+LGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL
        NQSAATLSKESCAALHKSI+E                                                                               
Subjt:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL

Query:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADT-
            VIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAE KNQ SKRKGN+SKEMNDEGA +LVSKT+KTADT 
Subjt:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADT-

Query:  ---KKKPKPKG------------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK
           KK+PKPKG                                    VG KKVGK TN G+M++AASE EKSLKQTV SS++G  RKKAK
Subjt:  ---KKKPKPKG------------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK

TrEMBL top hitse value%identityAlignment
A0A0A0KWY6 FPG_CAT domain-containing protein5.5e-15264.74Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFE S+LGKTILSAHRKGKHLWL LDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALD+FIES+ KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL
        NQSAATLSKESCAALHKSI+E                                                                               
Subjt:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL

Query:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTK
            VIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQ SKRKGND+K+MNDE  G+LVSKT KTAD K
Subjt:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTK

Query:  KKPKPKG-----------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK
        +KPKPKG                                    G KKVG KTNIGQ  DAASE +KSLKQTV+SS+ G  RKKAK
Subjt:  KKPKPKG-----------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK

A0A1S3BY09 formamidopyrimidine-DNA glycosylase isoform X28.4e-14562.35Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADDTKVIDGVSPSDFE S+LGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALD+FIES+ KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL
        NQSAATLSKESCAALHKSI+E                                                                               
Subjt:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL

Query:  DRSMVIEKALEVGADSSRFPSNWLFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADT
            V+++A+EV A+S+ FP  WLFH R  K+PG+  V+GKEIHFITTGGRTSAFVPELQKLTGAEPKNQ SKRKGND+K+MNDEG G+LVSKT+KTAD 
Subjt:  DRSMVIEKALEVGADSSRFPSNWLFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADT

Query:  KKKPKPKG-----------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK
        K+KPKPKG                                    G KK+GKKTNIGQ  DAASE EKSLKQTVQSS+NG  RKKAK
Subjt:  KKKPKPKG-----------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK

A0A1S3BY51 formamidopyrimidine-DNA glycosylase isoform X14.8e-15665.57Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADDTKVIDGVSPSDFE S+LGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALD+FIES+ KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL
        NQSAATLSKESCAALHKSI+E                                                                               
Subjt:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL

Query:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTK
            VIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQ SKRKGND+K+MNDEG G+LVSKT+KTAD K
Subjt:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTK

Query:  KKPKPKG-----------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK
        +KPKPKG                                    G KK+GKKTNIGQ  DAASE EKSLKQTVQSS+NG  RKKAK
Subjt:  KKPKPKG-----------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK

A0A5A7TLT5 Formamidopyrimidine-DNA glycosylase isoform X26.4e-16167.22Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFE S+LGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALD+FIES+ KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL
        NQSAATLSKESCAALHKSI+EV                                                  LK  +++     +F +   F F    G 
Subjt:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL

Query:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTK
            VIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQ SKRKGND+K+MNDEG G+LVSKT+KTAD K
Subjt:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTK

Query:  KKPKPKG-----------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK
        +KPKPKG                                    G KK+GKKTNIGQ  DAASE EKSLKQTVQSS+NG  RKKAK
Subjt:  KKPKPKG-----------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK

A0A5D3E227 Formamidopyrimidine-DNA glycosylase isoform X14.8e-15665.57Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEEHC+GKVIKKAVIADDTKVIDGVSPSDFE S+LGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVE                       PASVPPISKLGPDALLEPMALD+FIES+ KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL
        NQSAATLSKESCAALHKSI+E                                                                               
Subjt:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL

Query:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTK
            VIEKALEVGADSSRFP+NW+FHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQ SKRKGND+K+MNDEG G+LVSKT+KTAD K
Subjt:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTK

Query:  KKPKPKG-----------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK
        +KPKPKG                                    G KK+GKKTNIGQ  DAASE EKSLKQTVQSS+NG  RKKAK
Subjt:  KKPKPKG-----------------------------------VGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK

SwissProt top hitse value%identityAlignment
A0PQ49 Formamidopyrimidine-DNA glycosylase4.0e-1933.18Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKV-IDGVSPSDFENSILGKTILSAHRKGKHLWLRLDS--------PPFPAF--HFGMAGAIYIKGVAV
        MPELPEVE  RR +++H VGK +    +     V      P+D    +LG  I    R+GK+LWL LD+         P  A   H GM+G + + GV  
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKV-IDGVSPSDFENSILGKTILSAHRKGKHLWLRLDS--------PPFPAF--HFGMAGAIYIKGVAV

Query:  TNYKR-SMVNDDDEWPSKYSK-----------FFVEPASVP-PISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIH
          + R S V DD    S   +             V+ + VP P++ L  D L     +D  I+ +  K   +K  LLDQ  +SGIGN  ADE L++A++H
Subjt:  TNYKR-SMVNDDDEWPSKYSK-----------FFVEPASVP-PISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIH

Query:  PNQSAATLSKESCAALHKSIREV
          + AATL++    A+  +  +V
Subjt:  PNQSAATLSKESCAALHKSIREV

A9B0X2 Formamidopyrimidine-DNA glycosylase1.0e-2232.86Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKG------------VA
        MPELPEVE  RR++E+  VG+           K++D  SP  F  +I  + I    R+ K+L + LD+      H  M G + +              VA
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKG------------VA

Query:  VTNYKRSMVNDDDEWPSKYSKF-FVEPASVPPIS-KLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHPNQSAATLSK
        + N +    +D    P K+ ++  V+ + V  ++ +LGP+ L +   LDDF + + +K   IK  LLDQS ++G+GN  ADE L+ A+IHP +SA +L+ 
Subjt:  VTNYKRSMVNDDDEWPSKYSKF-FVEPASVPPIS-KLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHPNQSAATLSK

Query:  ESCAALHKSIREV
           A L ++I+ V
Subjt:  ESCAALHKSIREV

B2HJJ6 Formamidopyrimidine-DNA glycosylase3.1e-1933.63Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKV-IDGVSPSDFENSILGKTILSAHRKGKHLWLRLDS--------PPFPAF--HFGMAGAIYIKGVAV
        MPELPEVE  RR +++H VGK +    +     V      P+D    +LG  I    R+GK+LWL LD+         P  A   H GM+G + + GV  
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKV-IDGVSPSDFENSILGKTILSAHRKGKHLWLRLDS--------PPFPAF--HFGMAGAIYIKGVAV

Query:  TNYKR-SMVNDDDEWPSKYSK-----------FFVEPASVP-PISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIH
          + R S V DD    S   +             V+ + VP P++ L  D L     +D  I+ +  K   IK  LLDQ  +SGIGN  ADE L++A++H
Subjt:  TNYKR-SMVNDDDEWPSKYSK-----------FFVEPASVP-PISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIH

Query:  PNQSAATLSKESCAALHKSIREV
          + AATL++    A+  +  +V
Subjt:  PNQSAATLSKESCAALHKSIREV

O80358 Formamidopyrimidine-DNA glycosylase1.1e-10450.8Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEE+C+GK IK+ +IADD KVI G+SPSDF+ SILGKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVE                       P SV PIS+LGPDALLEPM +D+F ES+ KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL
         Q+A++LSKE C ALH SI+E                                                                               
Subjt:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL

Query:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDS-KEMNDEGAG----KLVSKTDK
            VIEKA+EV ADSS+FPS W+FH+REKKPGKAFVDGK+I FIT GGRT+A+VPELQKL G + +     R      K   D+G G    +   K D+
Subjt:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDS-KEMNDEGAG----KLVSKTDK

Query:  TADTKKKPKPKGVGMKKVGKKTNIGQMVDAASESE
        +A +KK  KP+G   KK   KT   +  D   +SE
Subjt:  TADTKKKPKPKGVGMKKVGKKTNIGQMVDAASESE

Q8FP17 Formamidopyrimidine-DNA glycosylase2.4e-1933.49Show/hide
Query:  MPELPEVEAARRAIEEHCVGK-VIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPA-------FHFGMAGAIYIKGVAVT--
        MPELPEVE  RR +EEH VG+ ++  AV+   T        ++ E ++ G  + + +R+GK LWL LD     A        H GM+G + +K    T  
Subjt:  MPELPEVEAARRAIEEHCVGK-VIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPA-------FHFGMAGAIYIKGVAVT--

Query:  --NYKRSMVNDDDE-WPSKYSKF----FVEPASVPP--ISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHPNQSA
             R+ ++D +E W      F      E     P  +S +  D L + + +      +  K   IK LLL+Q  +SGIGN  ADE+L++A IHP Q A
Subjt:  --NYKRSMVNDDDE-WPSKYSKF----FVEPASVPP--ISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHPNQSA

Query:  ATLSKESCAALHKSIREV
        + +S     AL ++ REV
Subjt:  ATLSKESCAALHKSIREV

Arabidopsis top hitse value%identityAlignment
AT1G52500.1 MUTM homolog-11.7e-8469.82Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEE+C+GK IK+ +IADD KVI G+SPSDF+ SILGKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVE                       P SV PIS+LGPDALLEPM +D+F ES+ KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIREV
         Q+A++LSKE C ALH SI+EV
Subjt:  NQSAATLSKESCAALHKSIREV

AT1G52500.2 MUTM homolog-17.6e-10650.8Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD
        MPELPEVEAARRAIEE+C+GK IK+ +IADD KVI G+SPSDF+ SILGKTI+SA RKGK+LWL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V D 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDD

Query:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVE                       P SV PIS+LGPDALLEPM +D+F ES+ KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVE-----------------------PASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL
         Q+A++LSKE C ALH SI+E                                                                               
Subjt:  NQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRVRHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGL

Query:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDS-KEMNDEGAG----KLVSKTDK
            VIEKA+EV ADSS+FPS W+FH+REKKPGKAFVDGK+I FIT GGRT+A+VPELQKL G + +     R      K   D+G G    +   K D+
Subjt:  DRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQTSKRKGNDS-KEMNDEGAG----KLVSKTDK

Query:  TADTKKKPKPKGVGMKKVGKKTNIGQMVDAASESE
        +A +KK  KP+G   KK   KT   +  D   +SE
Subjt:  TADTKKKPKPKGVGMKKVGKKTNIGQMVDAASESE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGAGCTACCGGAGGTGGAGGCGGCGAGGAGAGCAATAGAAGAGCATTGCGTTGGGAAAGTGATTAAGAAGGCGGTGATAGCCGACGATACGAAGGTCATCGACGG
CGTATCGCCTTCCGATTTCGAGAATTCCATCTTAGGCAAAACCATTCTCTCTGCCCATCGTAAGGGCAAGCACCTGTGGCTCCGCCTCGATTCTCCTCCTTTCCCTGCAT
TTCACTTCGGGATGGCGGGTGCCATATATATTAAAGGTGTAGCTGTCACAAACTATAAAAGGTCTATGGTTAATGATGATGACGAGTGGCCTTCCAAGTACTCTAAGTTC
TTTGTTGAGCCAGCTTCAGTGCCCCCAATATCTAAGCTTGGCCCAGATGCTCTTCTAGAGCCTATGGCATTGGATGATTTTATTGAATCCATGGGCAAGAAGAAACTGGC
GATTAAGACTCTATTGCTTGATCAGAGCTACATTTCCGGTATTGGCAATTGGGTTGCAGATGAAGTGCTATATCAAGCAAGAATTCATCCAAATCAAAGTGCTGCAACCC
TATCAAAAGAAAGTTGTGCAGCTTTGCACAAAAGCATACGAGAGGTACGACTTTGCTTTTTTACCAACCCCTCCAAATCATCCCCAACCCTTGCTGCCCTAGTCAGAGTT
AGGCATGCTGTGCCCTTTAGAGATTTTTTTTTTTATAGCAAAACATTCATTCTTCATGGAAAATATAGAGGAACCACCTGTAGGTCCTTAAAAGAGCAGTTGAAGTTGAT
GCTGAGTCCAACAGCTTTCCTGAAGAATGGTTGTTTCATTTTCGATGGGGAAAAAGGCCTGGACAGGTCAATGGTAATTGAAAAAGCGCTTGAAGTTGGAGCAGATAGTA
GTCGGTTCCCTAGTAATTGGCTTTTCCATTCACGTGAAAAGAAGCCTGGCAAGGCTTTTGTTGATGGTAAGGAAATCCACTTTATTACTACAGGCGGCAGGACGTCGGCC
TTTGTACCCGAGTTGCAAAAGCTTACTGGAGCTGAACCAAAAAATCAAACTTCAAAGAGAAAAGGCAATGATAGCAAAGAAATGAATGATGAGGGTGCTGGTAAATTAGT
GAGCAAGACAGATAAAACTGCTGATACTAAGAAAAAGCCAAAGCCTAAAGGTGTTGGAATGAAGAAAGTGGGAAAGAAAACGAACATTGGGCAAATGGTTGATGCTGCTT
CTGAATCAGAGAAGTCTTTGAAACAAACGGTTCAGAGCAGTAAAAATGGTATGGGGAGGAAGAAAGCAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGGAGCTACCGGAGGTGGAGGCGGCGAGGAGAGCAATAGAAGAGCATTGCGTTGGGAAAGTGATTAAGAAGGCGGTGATAGCCGACGATACGAAGGTCATCGACGG
CGTATCGCCTTCCGATTTCGAGAATTCCATCTTAGGCAAAACCATTCTCTCTGCCCATCGTAAGGGCAAGCACCTGTGGCTCCGCCTCGATTCTCCTCCTTTCCCTGCAT
TTCACTTCGGGATGGCGGGTGCCATATATATTAAAGGTGTAGCTGTCACAAACTATAAAAGGTCTATGGTTAATGATGATGACGAGTGGCCTTCCAAGTACTCTAAGTTC
TTTGTTGAGCCAGCTTCAGTGCCCCCAATATCTAAGCTTGGCCCAGATGCTCTTCTAGAGCCTATGGCATTGGATGATTTTATTGAATCCATGGGCAAGAAGAAACTGGC
GATTAAGACTCTATTGCTTGATCAGAGCTACATTTCCGGTATTGGCAATTGGGTTGCAGATGAAGTGCTATATCAAGCAAGAATTCATCCAAATCAAAGTGCTGCAACCC
TATCAAAAGAAAGTTGTGCAGCTTTGCACAAAAGCATACGAGAGGTACGACTTTGCTTTTTTACCAACCCCTCCAAATCATCCCCAACCCTTGCTGCCCTAGTCAGAGTT
AGGCATGCTGTGCCCTTTAGAGATTTTTTTTTTTATAGCAAAACATTCATTCTTCATGGAAAATATAGAGGAACCACCTGTAGGTCCTTAAAAGAGCAGTTGAAGTTGAT
GCTGAGTCCAACAGCTTTCCTGAAGAATGGTTGTTTCATTTTCGATGGGGAAAAAGGCCTGGACAGGTCAATGGTAATTGAAAAAGCGCTTGAAGTTGGAGCAGATAGTA
GTCGGTTCCCTAGTAATTGGCTTTTCCATTCACGTGAAAAGAAGCCTGGCAAGGCTTTTGTTGATGGTAAGGAAATCCACTTTATTACTACAGGCGGCAGGACGTCGGCC
TTTGTACCCGAGTTGCAAAAGCTTACTGGAGCTGAACCAAAAAATCAAACTTCAAAGAGAAAAGGCAATGATAGCAAAGAAATGAATGATGAGGGTGCTGGTAAATTAGT
GAGCAAGACAGATAAAACTGCTGATACTAAGAAAAAGCCAAAGCCTAAAGGTGTTGGAATGAAGAAAGTGGGAAAGAAAACGAACATTGGGCAAATGGTTGATGCTGCTT
CTGAATCAGAGAAGTCTTTGAAACAAACGGTTCAGAGCAGTAAAAATGGTATGGGGAGGAAGAAAGCAAAGTAA
Protein sequenceShow/hide protein sequence
MPELPEVEAARRAIEEHCVGKVIKKAVIADDTKVIDGVSPSDFENSILGKTILSAHRKGKHLWLRLDSPPFPAFHFGMAGAIYIKGVAVTNYKRSMVNDDDEWPSKYSKF
FVEPASVPPISKLGPDALLEPMALDDFIESMGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIREVRLCFFTNPSKSSPTLAALVRV
RHAVPFRDFFFYSKTFILHGKYRGTTCRSLKEQLKLMLSPTAFLKNGCFIFDGEKGLDRSMVIEKALEVGADSSRFPSNWLFHSREKKPGKAFVDGKEIHFITTGGRTSA
FVPELQKLTGAEPKNQTSKRKGNDSKEMNDEGAGKLVSKTDKTADTKKKPKPKGVGMKKVGKKTNIGQMVDAASESEKSLKQTVQSSKNGMGRKKAK