; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G001680 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G001680
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionformamidopyrimidine-DNA glycosylase isoform X1
Genome locationCmo_Chr01:725361..730651
RNA-Seq ExpressionCmoCh01G001680
SyntenyCmoCh01G001680
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0003906 - DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0008534 - oxidized purine nucleobase lesion DNA N-glycosylase activity (molecular function)
GO:0016829 - lyase activity (molecular function)
InterPro domainsIPR010979 - Ribosomal protein S13-like, H2TH
IPR012319 - Formamidopyrimidine-DNA glycosylase, catalytic domain
IPR015886 - DNA glycosylase/AP lyase, H2TH DNA-binding
IPR020629 - Formamidopyrimidine-DNA glycosylase
IPR035937 - MutM-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606816.1 Formamidopyrimidine-DNA glycosylase, partial [Cucurbita argyrosperma subsp. sororia]9.0e-21390.87Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMW+RLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDF+ESLGKKKLAIKTLLLDQS+ISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
        NQSAATLSKESCAALHKSIQE                               VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
Subjt:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS

Query:  AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK
        AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK
Subjt:  AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK

Query:  AGKRTNVGRMHDASESE---KPSKPSKQTVPSSRSGRQ
        AGKRTNVGR+HDASESE   KPSKPSKQTVPSSRS  Q
Subjt:  AGKRTNVGRMHDASESE---KPSKPSKQTVPSSRSGRQ

KAG7036522.1 Formamidopyrimidine-DNA glycosylase, partial [Cucurbita argyrosperma subsp. argyrosperma]7.4e-21588.26Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMW+RLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKD--------------------PASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSY
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKD                    PASVPPISKLGPDALLEPMALDDF+ESLGKKKLAIKTLLLDQS+
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKD--------------------PASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSY

Query:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKP
        ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQE                               VIEKALEVGADSSRFPNNWIFHSREKKP
Subjt:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKP

Query:  GKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEE
        GKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEE
Subjt:  GKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEE

Query:  AENDDASDDEDNSHDIGKTKAGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK
        AENDDASDDEDNSHDIGKTKAGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK
Subjt:  AENDDASDDEDNSHDIGKTKAGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK

XP_022949541.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucurbita moschata]2.2e-21992.95Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
        NQSAATLSKESCAALHKSIQE                               VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
Subjt:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS

Query:  AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK
        AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK
Subjt:  AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK

Query:  AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK
        AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK
Subjt:  AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK

XP_022949542.1 formamidopyrimidine-DNA glycosylase isoform X2 [Cucurbita moschata]6.5e-21992.5Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
        NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGK                                V+GKEIHFITTGGRTS
Subjt:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS

Query:  AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK
        AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK
Subjt:  AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK

Query:  AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK
        AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK
Subjt:  AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK

XP_022998520.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucurbita maxima]9.0e-21390.91Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMW+RLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDF+ESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
        NQSAATLSKESCAALHKSIQ                               KVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
Subjt:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS

Query:  AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK
        AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGE VSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK
Subjt:  AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK

Query:  AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK
        AGKRTNVGRMH+AS SE   KPSKQTVPSSRSGRQRKK K
Subjt:  AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK

TrEMBL top hitse value%identityAlignment
A0A5A7TLT5 Formamidopyrimidine-DNA glycosylase isoform X26.8e-20686.82Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEEHCVGKVIKKAVIADD+KVIDG+SPSDFEASLLGKTILSAHRKGKH+W+RLDSPPFP FHFGMAGAIYIKGVAVTNYKRS+VN+D
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKV LL+DPASVPPISKLGPDALLEPMALD+F+ESL KKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
        NQSAATLSKESCAALHKSIQEVLKRAV+VDAESN+FPEEWLFHFRWGK+PG+VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
Subjt:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS

Query:  AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK
        AFVPELQKLTGAEPKNQNSKRK N+ KKMNDEG GE VSKT+KTA   D K K KPKG SKKPS KRKSK  ++DGSDEEAENDDASDD DN    G  K
Subjt:  AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK

Query:  AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK
         GK+TN+G+  DA  + +P K  KQTV SSR+GR+RKKAK
Subjt:  AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK

A0A6J1GCA1 formamidopyrimidine-DNA glycosylase isoform X11.1e-21992.95Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
        NQSAATLSKESCAALHKSIQE                               VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
Subjt:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS

Query:  AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK
        AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK
Subjt:  AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK

Query:  AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK
        AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK
Subjt:  AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK

A0A6J1GCC5 formamidopyrimidine-DNA glycosylase isoform X23.1e-21992.5Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
        NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGK                                V+GKEIHFITTGGRTS
Subjt:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS

Query:  AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK
        AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK
Subjt:  AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK

Query:  AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK
        AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK
Subjt:  AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK

A0A6J1KAE4 formamidopyrimidine-DNA glycosylase isoform X23.7e-21290.23Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMW+RLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDF+ESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
        NQSAATLSKESCAALHKSIQ+VLKRAVDVDAESNNFPEEWLFHFRWGKKPGK                                V+GKEIHFITTGGRTS
Subjt:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS

Query:  AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK
        AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGE VSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK
Subjt:  AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK

Query:  AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK
        AGKRTNVGRMH+AS SE   KPSKQTVPSSRSGRQRKK K
Subjt:  AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK

A0A6J1KCR6 formamidopyrimidine-DNA glycosylase isoform X14.4e-21390.91Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMW+RLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDF+ESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
        NQSAATLSKESCAALHKSIQ                               KVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
Subjt:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS

Query:  AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK
        AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGE VSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK
Subjt:  AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTK

Query:  AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK
        AGKRTNVGRMH+AS SE   KPSKQTVPSSRSGRQRKK K
Subjt:  AGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK

SwissProt top hitse value%identityAlignment
A9B0X2 Formamidopyrimidine-DNA glycosylase6.4e-2832.46Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVE  RR++E+  VG+           K++D  SP  F  ++  + I    R+ K++ + LD+      H  M G + +                
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        DE   +++   V LD+G +L F D R+F +  L+          +LGP+ L +   LDDF + L +K   IK  LLDQS ++G+GN  ADE L+ A+IHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVLKRAVD
         +SA +L+    A L ++I+ VL+ +++
Subjt:  NQSAATLSKESCAALHKSIQEVLKRAVD

B0TER7 Formamidopyrimidine-DNA glycosylase7.1e-2733.62Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVE  RR++     G  I+K  +    K+   +  + F  +L G+ I+   R+GK++ + LD       H  M G +         + R    E+
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASV--PPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARI
         E    ++ FF  LDDG  L +TD R+F  + L+   A++  P   +LGP+ L +  +  DF  +L K+K  +K LLLDQS+++G+GN  ADE L +AR+
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASV--PPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARI

Query:  HPNQSAATLSKESCAALHKSIQEVLKRAVDVDAES
        HP+++A +L  E    L+  I+ VL+  +D    S
Subjt:  HPNQSAATLSKESCAALHKSIQEVLKRAVDVDAES

O34403 Formamidopyrimidine-DNA glycosylase3.0e-2531.66Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVE  RR +     GK IK   I   + +     P +F   L G+TI S  R+GK +   LD       H+ M   + ++G       +  +++ 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLK--DPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARI
        +E   K+      + DG  L + D R+F  + L K  + A   P+S+LGP+   E        + L K   A+KT LLDQ  + G+GN   DE L++A +
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLK--DPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARI

Query:  HPNQSAATLSKESCAALHKSIQEVLKRAVDVDAES-----NNFPEEWLF---HFRWGKK
        HP   A  LS ++   LH  I+  L+ A+D    +     N+  E  +F   HF +GKK
Subjt:  HPNQSAATLSKESCAALHKSIQEVLKRAVDVDAES-----NNFPEEWLF---HFRWGKK

O80358 Formamidopyrimidine-DNA glycosylase8.7e-12662.53Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEE+C+GK IK+ +IADD+KVI GISPSDF+ S+LGKTI+SA RKGK++W+ LDSPPFP+F FGMAGAIYIKGVAVT YKRS V + 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +P SV PIS+LGPDALLEPM +D+F ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
         Q+A++LSKE C ALH SI+E                               VIEKA+EV ADSS+FP+ WIFH+REKKPGKAFVDGK+I FIT GGRT+
Subjt:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS

Query:  AFVPELQKLTGAEPKNQNSKRKINEG-KKMNDEGVG-EPVSKTKKTADTTDTKTKSKPK-GPSKKPSTKRKSKINEDDGSDEEAEND
        A+VPELQKL G + +     R    G K   D+G G E   +T+K  ++  +K   KP+ G  KKP++K K++ ++DDG D EAE +
Subjt:  AFVPELQKLTGAEPKNQNSKRKINEG-KKMNDEGVG-EPVSKTKKTADTTDTKTKSKPK-GPSKKPSTKRKSKINEDDGSDEEAEND

Q03GC2 Formamidopyrimidine-DNA glycosylase1.2e-2634.91Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSD--FEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVN
        MPELPEVE  RR +     GK++   V+     V    SP    F   L GK IL+  R+GK++ +          H  M G            K SVV+
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSD--FEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVN

Query:  EDDEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLK--DPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQA
          +E+  K+     ELDDG DL + D R+F ++ L+   +   V  +  +GP+   E + L+     L  +K  +K+ LLDQS I+G+GN  ADEVL+ +
Subjt:  EDDEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLK--DPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQA

Query:  RIHPNQSAATLSKESCAALHKSIQEVLKRAVD
        +IHP Q + TL+ E  A L +SI E L+ A++
Subjt:  RIHPNQSAATLSKESCAALHKSIQEVLKRAVD

Arabidopsis top hitse value%identityAlignment
AT1G52500.1 MUTM homolog-11.5e-11275.89Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEE+C+GK IK+ +IADD+KVI GISPSDF+ S+LGKTI+SA RKGK++W+ LDSPPFP+F FGMAGAIYIKGVAVT YKRS V + 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +P SV PIS+LGPDALLEPM +D+F ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKV
         Q+A++LSKE C ALH SI+EV++ AV V+A+S  FP EWLFHFRWGKK GKV
Subjt:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKV

AT1G52500.2 MUTM homolog-16.2e-12762.53Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEE+C+GK IK+ +IADD+KVI GISPSDF+ S+LGKTI+SA RKGK++W+ LDSPPFP+F FGMAGAIYIKGVAVT YKRS V + 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP
        +EWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +P SV PIS+LGPDALLEPM +D+F ESL KKK+ IK LLLDQ YISGIGNW+ADEVLYQARIHP
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHP

Query:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
         Q+A++LSKE C ALH SI+E                               VIEKA+EV ADSS+FP+ WIFH+REKKPGKAFVDGK+I FIT GGRT+
Subjt:  NQSAATLSKESCAALHKSIQEVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS

Query:  AFVPELQKLTGAEPKNQNSKRKINEG-KKMNDEGVG-EPVSKTKKTADTTDTKTKSKPK-GPSKKPSTKRKSKINEDDGSDEEAEND
        A+VPELQKL G + +     R    G K   D+G G E   +T+K  ++  +K   KP+ G  KKP++K K++ ++DDG D EAE +
Subjt:  AFVPELQKLTGAEPKNQNSKRKINEG-KKMNDEGVG-EPVSKTKKTADTTDTKTKSKPK-GPSKKPSTKRKSKINEDDGSDEEAEND


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGAGTTACCTGAGGTGGAGGCGGCGAGGAGGGCCATTGAAGAGCATTGTGTCGGGAAAGTCATCAAGAAGGCCGTGATAGCTGACGATTCGAAGGTCATCGACGG
CATATCGCCTTCTGACTTCGAGGCTTCGCTCTTAGGCAAAACCATCCTCTCCGCCCATCGCAAGGGCAAACACATGTGGGTCCGCCTCGATTCTCCTCCTTTCCCTACAT
TTCACTTCGGGATGGCGGGTGCCATATACATCAAGGGCGTAGCTGTCACGAACTATAAAAGGTCTGTGGTTAATGAAGATGATGAGTGGCCTTCCAAGTACTCGAAGTTC
TTTGTTGAGCTTGACGATGGTGTAGACCTATCCTTCACAGACAAAAGGCGGTTTGCAAAAGTCTGCCTGCTGAAAGACCCAGCTTCAGTGCCCCCAATATCTAAGCTTGG
CCCAGATGCTCTTTTAGAGCCTATGGCACTGGATGACTTTGTTGAATCCTTGGGCAAGAAGAAACTGGCTATTAAGACTCTATTGCTTGATCAGAGCTATATTTCGGGTA
TTGGCAATTGGGTTGCAGATGAAGTACTATATCAAGCGAGAATTCATCCAAATCAAAGTGCTGCTACCCTATCCAAAGAAAGTTGTGCAGCTTTGCACAAGAGCATACAA
GAGGTCCTTAAACGAGCAGTTGATGTTGATGCTGAGTCCAACAACTTTCCTGAAGAATGGTTGTTTCATTTTCGGTGGGGGAAAAAGCCTGGGAAGGTAATTGAAAAAGC
GCTTGAAGTTGGAGCAGATAGTAGTCGGTTTCCTAATAATTGGATTTTCCATTCACGCGAAAAGAAGCCTGGCAAGGCTTTTGTTGATGGTAAGGAAATCCATTTCATCA
CTACAGGCGGCAGGACATCGGCCTTCGTACCTGAGTTGCAAAAGCTTACTGGAGCTGAACCAAAAAATCAAAATTCAAAGAGAAAAATCAACGAAGGCAAAAAAATGAAT
GATGAGGGTGTTGGTGAACCAGTGAGCAAGACAAAGAAAACTGCAGATACAACAGATACAAAGACAAAGTCAAAGCCTAAGGGTCCCTCTAAGAAGCCTTCAACCAAAAG
AAAATCCAAAATCAATGAGGACGATGGCTCTGATGAAGAAGCTGAAAACGATGATGCCAGTGATGATGAAGACAACAGTCATGACATTGGAAAGACGAAAGCAGGAAAGA
GGACGAACGTTGGGCGAATGCACGATGCTTCTGAATCGGAGAAGCCTTCGAAGCCTTCGAAGCAAACAGTTCCTAGCAGTCGAAGTGGTAGGCAGAGGAAGAAAGCAAAG
TAA
mRNA sequenceShow/hide mRNA sequence
GTCAGCCGACCAGTACTACTTTCCCCTCGGAAAAGCATCGTCCTCCATCCGCAACCCCAGCAAGAGAAAAGCGCGCCAAAGCTTCTTATCTGCTTTCCAACCACCGCACG
ATGCCGGAGTTACCTGAGGTGGAGGCGGCGAGGAGGGCCATTGAAGAGCATTGTGTCGGGAAAGTCATCAAGAAGGCCGTGATAGCTGACGATTCGAAGGTCATCGACGG
CATATCGCCTTCTGACTTCGAGGCTTCGCTCTTAGGCAAAACCATCCTCTCCGCCCATCGCAAGGGCAAACACATGTGGGTCCGCCTCGATTCTCCTCCTTTCCCTACAT
TTCACTTCGGGATGGCGGGTGCCATATACATCAAGGGCGTAGCTGTCACGAACTATAAAAGGTCTGTGGTTAATGAAGATGATGAGTGGCCTTCCAAGTACTCGAAGTTC
TTTGTTGAGCTTGACGATGGTGTAGACCTATCCTTCACAGACAAAAGGCGGTTTGCAAAAGTCTGCCTGCTGAAAGACCCAGCTTCAGTGCCCCCAATATCTAAGCTTGG
CCCAGATGCTCTTTTAGAGCCTATGGCACTGGATGACTTTGTTGAATCCTTGGGCAAGAAGAAACTGGCTATTAAGACTCTATTGCTTGATCAGAGCTATATTTCGGGTA
TTGGCAATTGGGTTGCAGATGAAGTACTATATCAAGCGAGAATTCATCCAAATCAAAGTGCTGCTACCCTATCCAAAGAAAGTTGTGCAGCTTTGCACAAGAGCATACAA
GAGGTCCTTAAACGAGCAGTTGATGTTGATGCTGAGTCCAACAACTTTCCTGAAGAATGGTTGTTTCATTTTCGGTGGGGGAAAAAGCCTGGGAAGGTAATTGAAAAAGC
GCTTGAAGTTGGAGCAGATAGTAGTCGGTTTCCTAATAATTGGATTTTCCATTCACGCGAAAAGAAGCCTGGCAAGGCTTTTGTTGATGGTAAGGAAATCCATTTCATCA
CTACAGGCGGCAGGACATCGGCCTTCGTACCTGAGTTGCAAAAGCTTACTGGAGCTGAACCAAAAAATCAAAATTCAAAGAGAAAAATCAACGAAGGCAAAAAAATGAAT
GATGAGGGTGTTGGTGAACCAGTGAGCAAGACAAAGAAAACTGCAGATACAACAGATACAAAGACAAAGTCAAAGCCTAAGGGTCCCTCTAAGAAGCCTTCAACCAAAAG
AAAATCCAAAATCAATGAGGACGATGGCTCTGATGAAGAAGCTGAAAACGATGATGCCAGTGATGATGAAGACAACAGTCATGACATTGGAAAGACGAAAGCAGGAAAGA
GGACGAACGTTGGGCGAATGCACGATGCTTCTGAATCGGAGAAGCCTTCGAAGCCTTCGAAGCAAACAGTTCCTAGCAGTCGAAGTGGTAGGCAGAGGAAGAAAGCAAAG
TAAGTTCATGTCCTTATTACCGTATGTAGTGTTTTTTTGTTGATAATTATGAGAACATATGCCTGCAGTTGGAACTTGCCTTCTTGTCATGACTGTTAATCCTAACCGTA
GATGTGGGCATGACTTTTGTATCATCTGTTCTGTTCCCATCATTTTGAAGAATGTTCATAAGATTAGGAAAAGGCAAATTTATCATTCTACTTGGTTCGGCCCATTTCCT
AAATGTTTTTAGTTCATTCATCTCTCAAAGGATATTTGCTACTATTTTATTCCTTACGTTTTGGTGTAATGG
Protein sequenceShow/hide protein sequence
MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWVRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNEDDEWPSKYSKF
FVELDDGVDLSFTDKRRFAKVCLLKDPASVPPISKLGPDALLEPMALDDFVESLGKKKLAIKTLLLDQSYISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQ
EVLKRAVDVDAESNNFPEEWLFHFRWGKKPGKVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKINEGKKMN
DEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK