; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg04216 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg04216
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionformamidopyrimidine-DNA glycosylase isoform X1
Genome locationCarg_Chr01:711200..716376
RNA-Seq ExpressionCarg04216
SyntenyCarg04216
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0003906 - DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0008534 - oxidized purine nucleobase lesion DNA N-glycosylase activity (molecular function)
GO:0016829 - lyase activity (molecular function)
InterPro domainsIPR010979 - Ribosomal protein S13-like, H2TH
IPR012319 - Formamidopyrimidine-DNA glycosylase, catalytic domain
IPR015886 - DNA glycosylase/AP lyase, H2TH DNA-binding
IPR020629 - Formamidopyrimidine-DNA glycosylase
IPR035937 - MutM-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606816.1 Formamidopyrimidine-DNA glycosylase, partial [Cucurbita argyrosperma subsp. sororia]3.0e-21493.91Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKD                    PASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF

Query:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
        ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
Subjt:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG

Query:  AEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMH
        AEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGR+H
Subjt:  AEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMH

Query:  DASESE---KPSKPSKQTVPSSRSGRQ
        DASESE   KPSKPSKQTVPSSRS  Q
Subjt:  DASESE---KPSKPSKQTVPSSRSGRQ

KAG7036522.1 Formamidopyrimidine-DNA glycosylase, partial [Cucurbita argyrosperma subsp. argyrosperma]2.6e-258100Show/hide
Query:  AAVKRQPTSTTFPSEKHRPPSATPAREKRAKASYLLSNHRTMPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKG
        AAVKRQPTSTTFPSEKHRPPSATPAREKRAKASYLLSNHRTMPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKG
Subjt:  AAVKRQPTSTTFPSEKHRPPSATPAREKRAKASYLLSNHRTMPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKG

Query:  KHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNEDDEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPAS
        KHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNEDDEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPAS
Subjt:  KHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNEDDEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPAS

Query:  VPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSFISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNN
        VPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSFISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNN
Subjt:  VPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSFISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNN

Query:  WIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSK
        WIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSK
Subjt:  WIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSK

Query:  INEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK
        INEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK
Subjt:  INEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMHDASESEKPSKPSKQTVPSSRSGRQRKKAK

XP_022949541.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucurbita moschata]5.3e-21994.64Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMW+RLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKD                    PASVPPISKLGPDALLEPMALDDF+ESLGKKKLAIKTLLLDQS+
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF

Query:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
        ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
Subjt:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG

Query:  AEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMH
        AEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMH
Subjt:  AEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMH

Query:  DASESEKPSKPSKQTVPSSRSGRQRKKAK
        DASESEKPSKPSKQTVPSSRSGRQRKKAK
Subjt:  DASESEKPSKPSKQTVPSSRSGRQRKKAK

XP_022998520.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucurbita maxima]7.4e-21393.24Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKD                    PASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQS+
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF

Query:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
        ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQ+VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
Subjt:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG

Query:  AEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMH
        AEPKNQNSKRKINEGKKMNDEGVGE VSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMH
Subjt:  AEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMH

Query:  DASESEKPSKPSKQTVPSSRSGRQRKKAK
        +AS SE   KPSKQTVPSSRSGRQRKK K
Subjt:  DASESEKPSKPSKQTVPSSRSGRQRKKAK

XP_023523304.1 formamidopyrimidine-DNA glycosylase isoform X1 [Cucurbita pepo subsp. pepo]5.7e-21393.01Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEEHCVGKVIKKA+IADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKD                    PASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQS+
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF

Query:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
        ISGIGNW+ADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
Subjt:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG

Query:  AEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMH
        AEPKNQNSKRKINEGKKMNDEGVGE VSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDN HDIGK KAGKRTNVGRMH
Subjt:  AEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMH

Query:  DASESEKPSKPSKQTVPSSRSGRQRKKAK
        DASESE   KPSKQTVPSSRSG+QRKKAK
Subjt:  DASESEKPSKPSKQTVPSSRSGRQRKKAK

TrEMBL top hitse value%identityAlignment
A0A5D3E227 Formamidopyrimidine-DNA glycosylase isoform X18.6e-18382.75Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEEHC+GKVIKKAVIADD+KVIDG+SPSDFEASLLGKTILSAHRKGKH+WLRLDSPPFP FHFGMAGAIYIKGVAVTNYKRS+VN+D
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKV LL+D                    PASVPPISKLGPDALLEPMALD+FIESL KKKLAIKTLLLDQS+
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF

Query:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
        ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
Subjt:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG

Query:  AEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMH
        AEPKNQNSKRK N+ KKMNDEG GE VSKT+KTA   D K K KPKG SKKPS KRKSK  ++DGSDEEAENDDASDD DN    G  K GK+TN+G+  
Subjt:  AEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMH

Query:  DASESEKPSKPSKQTVPSSRSGRQRKKAK
        DA  + +P K  KQTV SSR+GR+RKKAK
Subjt:  DASESEKPSKPSKQTVPSSRSGRQRKKAK

A0A6J1GCA1 formamidopyrimidine-DNA glycosylase isoform X12.6e-21994.64Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMW+RLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKD                    PASVPPISKLGPDALLEPMALDDF+ESLGKKKLAIKTLLLDQS+
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF

Query:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
        ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
Subjt:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG

Query:  AEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMH
        AEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMH
Subjt:  AEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMH

Query:  DASESEKPSKPSKQTVPSSRSGRQRKKAK
        DASESEKPSKPSKQTVPSSRSGRQRKKAK
Subjt:  DASESEKPSKPSKQTVPSSRSGRQRKKAK

A0A6J1GCC5 formamidopyrimidine-DNA glycosylase isoform X21.3e-20790.47Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMW+RLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKD                    PASVPPISKLGPDALLEPMALDDF+ESLGKKKLAIKTLLLDQS+
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF

Query:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLT
        ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEV+++A++V A+S+ FP  W+FH R  KKPGK  V+GKEIHFITTGGRTSAFVPELQKLT
Subjt:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLT

Query:  GAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRM
        GAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRM
Subjt:  GAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRM

Query:  HDASESEKPSKPSKQTVPSSRSGRQRKKAK
        HDASESEKPSKPSKQTVPSSRSGRQRKKAK
Subjt:  HDASESEKPSKPSKQTVPSSRSGRQRKKAK

A0A6J1KAE4 formamidopyrimidine-DNA glycosylase isoform X21.8e-20189.07Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKD                    PASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQS+
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF

Query:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLT
        ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQ+V+++A++V A+S+ FP  W+FH R  KKPGK  V+GKEIHFITTGGRTSAFVPELQKLT
Subjt:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLT

Query:  GAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRM
        GAEPKNQNSKRKINEGKKMNDEGVGE VSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRM
Subjt:  GAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRM

Query:  HDASESEKPSKPSKQTVPSSRSGRQRKKAK
        H+AS SE   KPSKQTVPSSRSGRQRKK K
Subjt:  HDASESEKPSKPSKQTVPSSRSGRQRKKAK

A0A6J1KCR6 formamidopyrimidine-DNA glycosylase isoform X13.6e-21393.24Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF
        DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKD                    PASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQS+
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF

Query:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
        ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQ+VIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
Subjt:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG

Query:  AEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMH
        AEPKNQNSKRKINEGKKMNDEGVGE VSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMH
Subjt:  AEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRMH

Query:  DASESEKPSKPSKQTVPSSRSGRQRKKAK
        +AS SE   KPSKQTVPSSRSGRQRKK K
Subjt:  DASESEKPSKPSKQTVPSSRSGRQRKKAK

SwissProt top hitse value%identityAlignment
A9B0X2 Formamidopyrimidine-DNA glycosylase2.4e-2530.65Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVE  RR++E+  VG+           K++D  SP  F  ++  + I    R+ K++ + LD+      H  M G + +                
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF
        DE   +++   V LD+G +L F D R+F +            W+ +  S    + Q        +LGP+ L +   LDDF + L +K   IK  LLDQS 
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF

Query:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALE
        ++G+GN  ADE L+ A+IHP +SA +L+    A L ++I+ V+  ++E
Subjt:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALE

B0TER7 Formamidopyrimidine-DNA glycosylase7.1e-2530.1Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVE  RR++     G  I+K  +    K+   +  + F  +L G+ I+   R+GK++ L LD       H  M G +         + R    E+
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPA-SVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQS
         E    ++ FF  LDDG  L +TD R+F                    +LT++  + A   P   +LGP+ L +  +  DF  +L K+K  +K LLLDQS
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPA-SVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQS

Query:  FISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSR-----------FPNNWIFHSREKKPGKAFVDGKEIHFITTGGRT
        F++G+GN  ADE L +AR+HP+++A +L  E    L+  I+ V+++ ++    S R           F      + R   P +    G EI      GR+
Subjt:  FISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSR-----------FPNNWIFHSREKKPGKAFVDGKEIHFITTGGRT

Query:  SAFVPELQK
        + F P  QK
Subjt:  SAFVPELQK

O80358 Formamidopyrimidine-DNA glycosylase4.2e-12664.36Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEE+C+GK IK+ +IADD+KVI GISPSDF+ S+LGKTI+SA RKGK++WL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V + 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF
        +EWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +                    P SV PIS+LGPDALLEPM +D+F ESL KKK+ IK LLLDQ +
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF

Query:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
        ISGIGNW+ADEVLYQARIHP Q+A++LSKE C ALH SI+EVIEKA+EV ADSS+FP+ WIFH+REKKPGKAFVDGK+I FIT GGRT+A+VPELQKL G
Subjt:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG

Query:  AEPKNQNSKRKINEG-KKMNDEGVG-EPVSKTKKTADTTDTKTKSKPK-GPSKKPSTKRKSKINEDDGSDEEAEND
         + +     R    G K   D+G G E   +T+K  ++  +K   KP+ G  KKP++K K++ ++DDG D EAE +
Subjt:  AEPKNQNSKRKINEG-KKMNDEGVG-EPVSKTKKTADTTDTKTKSKPK-GPSKKPSTKRKSKINEDDGSDEEAEND

Q03GC2 Formamidopyrimidine-DNA glycosylase7.1e-2531.19Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSD--FEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVN
        MPELPEVE  RR +     GK++   V+     V    SP    F   L GK IL+  R+GK++ +          H  M G            K SVV+
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSD--FEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVN

Query:  EDDEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQ
          +E+  K+     ELDDG DL + D R+F ++ L   VP  +                  V  +  +GP+   E + L+     L  +K  +K+ LLDQ
Subjt:  EDDEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQ

Query:  SFISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALE---------VGAD--SSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGR
        S I+G+GN  ADEVL+ ++IHP Q + TL+ E  A L +SI E ++ A+E         + AD  +  F N    + R+  P +    G  I  I    R
Subjt:  SFISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALE---------VGAD--SSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGR

Query:  TSAFVPELQKL
         + F P  Q L
Subjt:  TSAFVPELQKL

Q4JUY8 Formamidopyrimidine-DNA glycosylase1.9e-2530.42Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVE  RR +EEH VG+      +     V  G  P    +SL   T+ +  R+GK +WL          H GM+G             + +V E 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF
         +  S + +    L DG +L F D+R F +  L K VP  D W   +   +     P +V  I+    +A+ +  A    +  +  K+ A+KT+LL+Q  
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF

Query:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGAD------------SSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRT
        +SGIGN  ADE L+ A + P +SAA LS+ +   + +S  EV+E ALE G              S  F  +   + R  +P K    G  I  +  GGR+
Subjt:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGAD------------SSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRT

Query:  SAFVPELQK
        + +    Q+
Subjt:  SAFVPELQK

Arabidopsis top hitse value%identityAlignment
AT1G52500.1 MUTM homolog-16.0e-10467.25Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEE+C+GK IK+ +IADD+KVI GISPSDF+ S+LGKTI+SA RKGK++WL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V + 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF
        +EWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +                    P SV PIS+LGPDALLEPM +D+F ESL KKK+ IK LLLDQ +
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF

Query:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFIT
        ISGIGNW+ADEVLYQARIHP Q+A++LSKE C ALH SI+EVI+ A++V ADS  FP  W+FH R  KK GK  V+GK  H ++
Subjt:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSR-EKKPGKAFVDGKEIHFIT

AT1G52500.2 MUTM homolog-13.0e-12764.36Show/hide
Query:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED
        MPELPEVEAARRAIEE+C+GK IK+ +IADD+KVI GISPSDF+ S+LGKTI+SA RKGK++WL LDSPPFP+F FGMAGAIYIKGVAVT YKRS V + 
Subjt:  MPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSPPFPTFHFGMAGAIYIKGVAVTNYKRSVVNED

Query:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF
        +EWPSKYSKFFVELDDG++LSFTDKRRFAKV LL +                    P SV PIS+LGPDALLEPM +D+F ESL KKK+ IK LLLDQ +
Subjt:  DEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDDFIESLGKKKLAIKTLLLDQSF

Query:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG
        ISGIGNW+ADEVLYQARIHP Q+A++LSKE C ALH SI+EVIEKA+EV ADSS+FP+ WIFH+REKKPGKAFVDGK+I FIT GGRT+A+VPELQKL G
Subjt:  ISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTSAFVPELQKLTG

Query:  AEPKNQNSKRKINEG-KKMNDEGVG-EPVSKTKKTADTTDTKTKSKPK-GPSKKPSTKRKSKINEDDGSDEEAEND
         + +     R    G K   D+G G E   +T+K  ++  +K   KP+ G  KKP++K K++ ++DDG D EAE +
Subjt:  AEPKNQNSKRKINEG-KKMNDEGVG-EPVSKTKKTADTTDTKTKSKPK-GPSKKPSTKRKSKINEDDGSDEEAEND


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GCAGCAGTAAAACGTCAGCCGACCAGTACTACTTTCCCCTCGGAAAAGCATCGTCCTCCATCCGCAACCCCAGCAAGAGAAAAGCGCGCCAAAGCTTCTTATCTGCTTTC
CAACCACCGCACGATGCCGGAGTTACCTGAGGTGGAGGCGGCGAGGAGGGCTATTGAAGAGCATTGTGTCGGGAAAGTCATCAAGAAGGCCGTGATAGCTGACGATTCGA
AGGTCATCGACGGCATATCGCCCTCTGACTTCGAGGCTTCGCTCTTAGGCAAAACCATCCTCTCCGCCCATCGCAAGGGCAAACACATGTGGCTCCGCCTCGATTCTCCT
CCTTTCCCTACATTTCACTTTGGGATGGCGGGTGCCATATACATCAAGGGCGTAGCTGTCACGAACTATAAAAGATCTGTGGTTAATGAAGATGATGAGTGGCCTTCCAA
GTACTCAAAGTTCTTTGTTGAGCTTGACGATGGTGTAGACCTATCCTTCACAGACAAAAGGCGGTTTGCAAAAGTCTGCCTGCTGAAAGATGTACCTCAATATGACTCAT
GGACATTCTTGTCAGCTTCTCTAACAATTTTGGTTTTACAGCCAGCTTCAGTGCCCCCAATATCTAAGCTTGGCCCAGATGCTCTTTTAGAGCCTATGGCACTAGATGAC
TTTATTGAATCCTTGGGCAAGAAGAAACTGGCTATTAAGACTCTATTGCTTGATCAGAGCTTTATTTCGGGTATCGGCAATTGGGTTGCAGATGAAGTGCTATATCAAGC
GAGAATTCATCCAAATCAAAGTGCTGCTACCCTATCCAAAGAAAGTTGTGCAGCTTTGCACAAGAGCATACAAGAGGTAATTGAAAAAGCGCTTGAAGTTGGAGCAGATA
GTAGTCGGTTTCCTAATAATTGGATTTTCCATTCACGCGAAAAGAAGCCTGGCAAGGCTTTTGTTGATGGTAAGGAAATCCATTTCATCACTACAGGCGGCAGGACATCG
GCCTTCGTACCTGAGTTGCAAAAGCTTACTGGAGCTGAACCGAAAAATCAAAATTCAAAGAGAAAAATCAACGAAGGCAAAAAAATGAATGATGAGGGTGTTGGTGAACC
AGTGAGCAAGACAAAGAAAACTGCAGATACAACAGATACAAAGACAAAGTCAAAGCCTAAGGGTCCCTCTAAGAAGCCATCAACCAAAAGAAAATCCAAAATCAATGAGG
ACGATGGCTCTGATGAAGAAGCTGAAAACGATGATGCCAGTGATGATGAAGACAACAGTCATGACATTGGAAAGACGAAAGCGGGAAAGAGGACGAACGTTGGGCGAATG
CACGATGCTTCTGAATCGGAGAAGCCTTCGAAGCCTTCGAAGCAAACAGTTCCTAGCAGTCGAAGTGGTAGGCAGAGGAAGAAAGCAAAGTAA
mRNA sequenceShow/hide mRNA sequence
GCAGCAGTAAAACGTCAGCCGACCAGTACTACTTTCCCCTCGGAAAAGCATCGTCCTCCATCCGCAACCCCAGCAAGAGAAAAGCGCGCCAAAGCTTCTTATCTGCTTTC
CAACCACCGCACGATGCCGGAGTTACCTGAGGTGGAGGCGGCGAGGAGGGCTATTGAAGAGCATTGTGTCGGGAAAGTCATCAAGAAGGCCGTGATAGCTGACGATTCGA
AGGTCATCGACGGCATATCGCCCTCTGACTTCGAGGCTTCGCTCTTAGGCAAAACCATCCTCTCCGCCCATCGCAAGGGCAAACACATGTGGCTCCGCCTCGATTCTCCT
CCTTTCCCTACATTTCACTTTGGGATGGCGGGTGCCATATACATCAAGGGCGTAGCTGTCACGAACTATAAAAGATCTGTGGTTAATGAAGATGATGAGTGGCCTTCCAA
GTACTCAAAGTTCTTTGTTGAGCTTGACGATGGTGTAGACCTATCCTTCACAGACAAAAGGCGGTTTGCAAAAGTCTGCCTGCTGAAAGATGTACCTCAATATGACTCAT
GGACATTCTTGTCAGCTTCTCTAACAATTTTGGTTTTACAGCCAGCTTCAGTGCCCCCAATATCTAAGCTTGGCCCAGATGCTCTTTTAGAGCCTATGGCACTAGATGAC
TTTATTGAATCCTTGGGCAAGAAGAAACTGGCTATTAAGACTCTATTGCTTGATCAGAGCTTTATTTCGGGTATCGGCAATTGGGTTGCAGATGAAGTGCTATATCAAGC
GAGAATTCATCCAAATCAAAGTGCTGCTACCCTATCCAAAGAAAGTTGTGCAGCTTTGCACAAGAGCATACAAGAGGTAATTGAAAAAGCGCTTGAAGTTGGAGCAGATA
GTAGTCGGTTTCCTAATAATTGGATTTTCCATTCACGCGAAAAGAAGCCTGGCAAGGCTTTTGTTGATGGTAAGGAAATCCATTTCATCACTACAGGCGGCAGGACATCG
GCCTTCGTACCTGAGTTGCAAAAGCTTACTGGAGCTGAACCGAAAAATCAAAATTCAAAGAGAAAAATCAACGAAGGCAAAAAAATGAATGATGAGGGTGTTGGTGAACC
AGTGAGCAAGACAAAGAAAACTGCAGATACAACAGATACAAAGACAAAGTCAAAGCCTAAGGGTCCCTCTAAGAAGCCATCAACCAAAAGAAAATCCAAAATCAATGAGG
ACGATGGCTCTGATGAAGAAGCTGAAAACGATGATGCCAGTGATGATGAAGACAACAGTCATGACATTGGAAAGACGAAAGCGGGAAAGAGGACGAACGTTGGGCGAATG
CACGATGCTTCTGAATCGGAGAAGCCTTCGAAGCCTTCGAAGCAAACAGTTCCTAGCAGTCGAAGTGGTAGGCAGAGGAAGAAAGCAAAGTAAGTTCATGTCCTTATCAC
CATATGTAGTGTTTTTTTTGTTGATAATTATGAGAACATATGCCTGCAGTTGGAACTTGCCTTCTTGTCATGACTGTTAATCCTAACCGTAGATGTGGGCATGACTTTTG
TATCATCTGTTCTGTTCCCATCATTTTGAAGAATGTTGATAAGATTGAAAATGCAAA
Protein sequenceShow/hide protein sequence
AAVKRQPTSTTFPSEKHRPPSATPAREKRAKASYLLSNHRTMPELPEVEAARRAIEEHCVGKVIKKAVIADDSKVIDGISPSDFEASLLGKTILSAHRKGKHMWLRLDSP
PFPTFHFGMAGAIYIKGVAVTNYKRSVVNEDDEWPSKYSKFFVELDDGVDLSFTDKRRFAKVCLLKDVPQYDSWTFLSASLTILVLQPASVPPISKLGPDALLEPMALDD
FIESLGKKKLAIKTLLLDQSFISGIGNWVADEVLYQARIHPNQSAATLSKESCAALHKSIQEVIEKALEVGADSSRFPNNWIFHSREKKPGKAFVDGKEIHFITTGGRTS
AFVPELQKLTGAEPKNQNSKRKINEGKKMNDEGVGEPVSKTKKTADTTDTKTKSKPKGPSKKPSTKRKSKINEDDGSDEEAENDDASDDEDNSHDIGKTKAGKRTNVGRM
HDASESEKPSKPSKQTVPSSRSGRQRKKAK