; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh02G006760 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh02G006760
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionDNA glycosylase superfamily protein
Genome locationCma_Chr02:4150878..4153278
RNA-Seq ExpressionCmaCh02G006760
SyntenyCmaCh02G006760
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023007194.1 uncharacterized protein LOC111499758 isoform X1 [Cucurbita maxima]8.5e-20198.63Show/hide
Query:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS
        MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS
Subjt:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS

Query:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPN-----SDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGS
        PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPN     SDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGS
Subjt:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPN-----SDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGS

Query:  DWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTIS
        DWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTIS
Subjt:  DWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTIS

Query:  KDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV
        KDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV
Subjt:  KDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV

XP_023007195.1 uncharacterized protein LOC111499758 isoform X2 [Cucurbita maxima]1.2e-202100Show/hide
Query:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS
        MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS
Subjt:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS

Query:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSI
        PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSI
Subjt:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSI

Query:  LKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIR
        LKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIR
Subjt:  LKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIR

Query:  RGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV
        RGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV
Subjt:  RGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV

XP_023007196.1 uncharacterized protein LOC111499758 isoform X3 [Cucurbita maxima]1.4e-18793.99Show/hide
Query:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS
        MCRSDQALES                 TSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS
Subjt:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS

Query:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPN-----SDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGS
        PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPN     SDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGS
Subjt:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPN-----SDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGS

Query:  DWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTIS
        DWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTIS
Subjt:  DWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTIS

Query:  KDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV
        KDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV
Subjt:  KDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV

XP_023007197.1 uncharacterized protein LOC111499759 isoform X1 [Cucurbita maxima]6.7e-19897.54Show/hide
Query:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS
        MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLN+HPSPSPNITSTSDKILLPLAANG SLSRPRPALDRKKSKSFKPGGNGNVG DNVAEVAS
Subjt:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS

Query:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPN-----SDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGS
        PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPN     SDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGS
Subjt:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPN-----SDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGS

Query:  DWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTIS
        DWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTIS
Subjt:  DWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTIS

Query:  KDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV
        KDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGR APPPEVEETTTGAAGSEAV
Subjt:  KDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV

XP_023007198.1 uncharacterized protein LOC111499759 isoform X2 [Cucurbita maxima]9.4e-20098.89Show/hide
Query:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS
        MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLN+HPSPSPNITSTSDKILLPLAANG SLSRPRPALDRKKSKSFKPGGNGNVG DNVAEVAS
Subjt:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS

Query:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSI
        PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSI
Subjt:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSI

Query:  LKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIR
        LKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIR
Subjt:  LKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIR

Query:  RGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV
        RGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGR APPPEVEETTTGAAGSEAV
Subjt:  RGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV

TrEMBL top hitse value%identityAlignment
A0A6J1KY00 uncharacterized protein LOC111499759 isoform X13.3e-19897.54Show/hide
Query:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS
        MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLN+HPSPSPNITSTSDKILLPLAANG SLSRPRPALDRKKSKSFKPGGNGNVG DNVAEVAS
Subjt:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS

Query:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPN-----SDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGS
        PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPN     SDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGS
Subjt:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPN-----SDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGS

Query:  DWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTIS
        DWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTIS
Subjt:  DWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTIS

Query:  KDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV
        KDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGR APPPEVEETTTGAAGSEAV
Subjt:  KDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV

A0A6J1KZV5 uncharacterized protein LOC111499758 isoform X36.8e-18893.99Show/hide
Query:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS
        MCRSDQALES                 TSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS
Subjt:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS

Query:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPN-----SDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGS
        PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPN     SDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGS
Subjt:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPN-----SDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGS

Query:  DWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTIS
        DWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTIS
Subjt:  DWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTIS

Query:  KDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV
        KDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV
Subjt:  KDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV

A0A6J1L2B3 uncharacterized protein LOC111499758 isoform X25.7e-203100Show/hide
Query:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS
        MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS
Subjt:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS

Query:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSI
        PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSI
Subjt:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSI

Query:  LKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIR
        LKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIR
Subjt:  LKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIR

Query:  RGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV
        RGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV
Subjt:  RGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV

A0A6J1L4A1 uncharacterized protein LOC111499758 isoform X14.1e-20198.63Show/hide
Query:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS
        MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS
Subjt:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS

Query:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPN-----SDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGS
        PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPN     SDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGS
Subjt:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPN-----SDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGS

Query:  DWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTIS
        DWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTIS
Subjt:  DWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTIS

Query:  KDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV
        KDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV
Subjt:  KDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV

A0A6J1L721 uncharacterized protein LOC111499759 isoform X24.5e-20098.89Show/hide
Query:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS
        MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLN+HPSPSPNITSTSDKILLPLAANG SLSRPRPALDRKKSKSFKPGGNGNVG DNVAEVAS
Subjt:  MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVAS

Query:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSI
        PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSI
Subjt:  PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSI

Query:  LKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIR
        LKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIR
Subjt:  LKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIR

Query:  RGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV
        RGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGR APPPEVEETTTGAAGSEAV
Subjt:  RGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSEAV

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 11.6e-3236.87Show/hide
Query:  RCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINR--VRGVVDNAI
        RC ++  + DP+Y+AYHD EWGVP  D + LFE++ L   Q G  W ++LKKR+++R  F  FD   VA   +  +  +  + G+  +R  ++ ++ NA 
Subjt:  RCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINR--VRGVVDNAI

Query:  RILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTC
          L++++       ++W F+N+ P      +  +IP  TS SD +SK + +RGF+ VG T+ +SFMQA GL NDH+  C
Subjt:  RILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTC

P44321 DNA-3-methyladenine glycosylase1.2e-2735.2Show/hide
Query:  RCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVR--GVVDNAI
        RC ++   S  IY+ YHD+EWG P  D Q LFE + L   Q G  W ++LKKR+ +R AF  FD + +A  +   + +     G+  +R +   +V NA 
Subjt:  RCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVR--GVVDNAI

Query:  RILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTC
          L ++K   +   +IW F+N+ P          +P KT  S  +SK + +RGF  +G T  ++FMQ+ GL +DHL  C
Subjt:  RILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]3.9e-3940.84Show/hide
Query:  KAVVEERRCSFITPNSD---PIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINR-
        + V E+ RC++ T   +    +Y  YHD EWG P+H+D+ LFE LVL   Q G  W +ILKKR+ FR AF DFD  +VAN+ + ++  +    G+  NR 
Subjt:  KAVVEERRCSFITPNSD---PIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINR-

Query:  -VRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHR
         +   + NA   + +++EFGS +KYIWGF+   P    ++S   +P  T  SD I+KD+ +RGF+ VG T +++ MQ+ G+ NDHLT+C +
Subjt:  -VRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHR

Arabidopsis top hitse value%identityAlignment
AT1G75090.1 DNA glycosylase superfamily protein2.8e-5339.85Show/hide
Query:  RPALDRKKSKS---FKPGGNGNVGSDNVAEVASPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSDPIYVAY
        +P L+ + +KS    KP  N +V +D+ +  +S    ++V            K        +  A    +     KI   V  +RC +ITPNSDPIYV +
Subjt:  RPALDRKKSKS---FKPGGNGNVGSDNVAEVASPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSDPIYVAY

Query:  HDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDIN--RVRGVVDNAIRILEIKKEFGSLEKYI
        HDEEWGVPV DD+ LFELLV S A     W SIL++R DFR  F +FD   +A F++++++S+     + ++  ++R +V+NA  +L++K+EFGS   Y 
Subjt:  HDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDIN--RVRGVVDNAIRILEIKKEFGSLEKYI

Query:  WGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTL
        W F+N+ P    Y+   ++PVK+ K++ ISKDM++RGFR VGPTV++SF+QA+G+ NDHLT C R+  C +
Subjt:  WGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTL

AT3G12710.1 DNA glycosylase superfamily protein5.7e-9464.03Show/hide
Query:  GGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVASPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA---RFEKIVPFDSKIKAVVEERRCSFITPNS
        G   ++ R +L+RKKSKSFK G   +  S  + E  +PGSIAAVRREQVA QQA RK++IAHYGRSKS       K+VP  +        +RCSF+TP S
Subjt:  GGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVASPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA---RFEKIVPFDSKIKAVVEERRCSFITPNS

Query:  DPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGS
        DPIYVAYHDEEWGVPVHDD+TLFELL LS AQVGSDWTS L+KR D+R AF +F+A+VVA  ++++M +IS EY +++++VRGVV+NA +I+EIKK F S
Subjt:  DPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGS

Query:  LEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIA
        LEKY+WGF+N+ P S +YK  HKIPVKTSKS++ISKDM+RRGFR VGPTVVHSFMQAAGLTNDHL TC RH  CTL+A
Subjt:  LEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIA

AT5G44680.1 DNA glycosylase superfamily protein4.6e-8852.89Show/hide
Query:  SKFNPPPLLQPTSNRL--LHRRNSLNRHPS----------PSPNITSTSDKILLPLAANGGSLSRPRPA---LDRKKSKSFKP---GGNGNVGSDNVAEV
        S+ N  P+LQP SN++  L RRNSL + P           PSP   S    I  PL+ N  SL +P  +   L R  S   KP     N + G   V  +
Subjt:  SKFNPPPLLQPTSNRL--LHRRNSLNRHPS----------PSPNITSTSDKILLPLAANGGSLSRPRPA---LDRKKSKSFKP---GGNGNVGSDNVAEV

Query:  A----SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF-EKIVPFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQV
              PGSIAA RRE+VA++Q +RK +I+HYGR KS +  EK +  + +     +++RCSFIT +SDPIYVAYHD+EWGVPVHDD  LFELLVL+ AQV
Subjt:  A----SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF-EKIVPFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQV

Query:  GSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDT
        GSDWTS+LK+R  FR AFS F+A++VA+F+++++ SI ++YG+++++V  VVDNA +IL++K++ GS  KYIWGFM + P +  Y S  KIPVKTSKS+T
Subjt:  GSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDT

Query:  ISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAA
        ISKDM+RRGFR VGPTV+HS MQAAGLTNDHL TC RHL CT +AA
Subjt:  ISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAA

AT5G57970.1 DNA glycosylase superfamily protein1.5e-5438.26Show/hide
Query:  HPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVASPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIV---
        + +P+  ++S+S K  L    N  S+ R R   +   + S     + +   D+    AS G +              R   +    +S  ++   +V   
Subjt:  HPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVASPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIV---

Query:  PFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDI
          DS       ++RC+++TPNSDP Y+ +HDEEWGVPVHDD+ LFELLVLS A     W +IL KRQ FR  F+DFD   +   ++++++   S     +
Subjt:  PFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDI

Query:  N--RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHC
        +  ++R V++NA +IL++ +E+GS +KYIW F+ N      ++   ++P KT K++ ISKD++RRGFRSVGPTVV+SFMQAAG+TNDHLT+C R  HC
Subjt:  N--RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHC

AT5G57970.2 DNA glycosylase superfamily protein1.5e-5438.26Show/hide
Query:  HPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVASPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIV---
        + +P+  ++S+S K  L    N  S+ R R   +   + S     + +   D+    AS G +              R   +    +S  ++   +V   
Subjt:  HPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVASPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIV---

Query:  PFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDI
          DS       ++RC+++TPNSDP Y+ +HDEEWGVPVHDD+ LFELLVLS A     W +IL KRQ FR  F+DFD   +   ++++++   S     +
Subjt:  PFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDI

Query:  N--RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHC
        +  ++R V++NA +IL++ +E+GS +KYIW F+ N      ++   ++P KT K++ ISKD++RRGFRSVGPTVV+SFMQAAG+TNDHLT+C R  HC
Subjt:  N--RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCGTTCCGACCAAGCCTTGGAATCCTCTTCTGTCGTCCTTGATTCCAAATTCAACCCCCCTCCCCTCCTTCAACCCACTTCCAATCGCCTCCTCCACCGCCGTAA
TTCCCTTAACAGACACCCTTCCCCCTCCCCCAACATCACCTCCACCTCTGACAAGATTCTCCTTCCGCTCGCCGCTAACGGCGGCTCTCTGTCTCGCCCCCGGCCTGCCT
TGGATAGGAAGAAATCCAAAAGCTTCAAGCCTGGGGGAAATGGGAACGTGGGTTCTGATAATGTTGCTGAAGTCGCGTCGCCGGGGAGCATCGCCGCCGTGAGAAGAGAG
CAGGTGGCGCTGCAGCAGGCGCAGAGGAAGATGAGAATTGCCCATTATGGACGGTCTAAATCCGCCCGGTTTGAGAAAATCGTTCCCTTTGATTCTAAGATTAAAGCCGT
TGTTGAAGAGAGAAGATGCAGCTTCATCACTCCCAACTCAGATCCCATTTATGTGGCCTATCACGATGAAGAATGGGGCGTCCCTGTTCATGATGACCAAACGCTGTTTG
AACTGCTGGTTCTGAGTGTGGCCCAAGTGGGTTCGGATTGGACTTCAATTTTGAAGAAACGACAAGATTTCAGAAATGCATTTTCAGATTTCGATGCACAAGTGGTGGCG
AATTTTTCCGACAGACAGATGGTTTCGATCAGCTCAGAGTATGGAATGGACATTAACAGAGTCCGAGGAGTGGTCGACAACGCAATCCGGATCCTGGAGATTAAGAAGGA
ATTTGGGTCATTGGAGAAATACATTTGGGGGTTTATGAACAACAACCCATTCTCACCGCACTACAAATCCGCCCACAAAATCCCGGTCAAGACATCAAAATCAGACACCA
TAAGCAAAGACATGATCCGGCGAGGATTCCGGTCCGTCGGTCCCACCGTGGTCCATTCCTTCATGCAAGCCGCCGGTCTGACCAACGACCATCTCACCACCTGCCACAGG
CACCTGCACTGCACATTAATCGCCGCCGGCCGTCACGCTCCACCGCCGGAAGTGGAGGAGACGACGACAGGTGCGGCAGGCTCGGAAGCTGTGTAG
mRNA sequenceShow/hide mRNA sequence
TAGGGTAGTACTTTCGAAAGAAGGCTTCTCGAAAATATTCCCATGTCACCTGCTCTCCCACATTCTCGAGGAACGGAGAACTGACTCCTACTAGGAATGGGCTCGATTTC
TTAGTATATAAGTTTCATAGTGGACATTCTTGGTGTCAGAGCAGCTCATGTACTTGAAGACTAACTCAATGGTCTTCAACCAAGTTTCTATCACCTCTATATCTAGGGTT
CACCCCTCAACTAGTGGGGGTCATACTTCTTAAAGTCCCTTTAGATAACAAGACTTTTCTGACGTAAACTTATCCTGGTGTCCTCGTCTCGATCTGTTTGCCAACACTGC
ATCCACAATGTTTGTTACTTGATTGGTCAAAGTTGTCTGCAACAGGTTTTACAGCTCCTCTACAGACATCGTCGCCATGTTGCACAACGGTGGTGTCATTGTATCTTGTG
CCGGGACTTGGGTTGCCTCTATCTCAGGCGTCTCTTGAGAGGGAACATCGACAGCCTGTGCCATACCACCAATCGTATGACACCTGCTAACACTATCTCTAGCTTAAGAA
GAAGGCTTAAGAAGAAGGTGTGTGTGGTTGTGATGGTGGTGGTGTTTGTTCCCTTCGTTCATCTCTCTTCCAAATTTCCTTTATAAAAAGCCTCTCTTTCCCTCTTCTTC
TCTAACTCCCATTTCTTTCTCTTTCTCTCTCTTCCCCAAAACACAACCAAAAACGATGTGTCGTTCCGACCAAGCCTTGGAATCCTCTTCTGTCGTCCTTGATTCCAAAT
TCAACCCCCCTCCCCTCCTTCAACCCACTTCCAATCGCCTCCTCCACCGCCGTAATTCCCTTAACAGACACCCTTCCCCCTCCCCCAACATCACCTCCACCTCTGACAAG
ATTCTCCTTCCGCTCGCCGCTAACGGCGGCTCTCTGTCTCGCCCCCGGCCTGCCTTGGATAGGAAGAAATCCAAAAGCTTCAAGCCTGGGGGAAATGGGAACGTGGGTTC
TGATAATGTTGCTGAAGTCGCGTCGCCGGGGAGCATCGCCGCCGTGAGAAGAGAGCAGGTGGCGCTGCAGCAGGCGCAGAGGAAGATGAGAATTGCCCATTATGGACGGT
CTAAATCCGCCCGGTTTGAGAAAATCGTTCCCTTTGATTCTAAGATTAAAGCCGTTGTTGAAGAGAGAAGATGCAGCTTCATCACTCCCAACTCAGATCCCATTTATGTG
GCCTATCACGATGAAGAATGGGGCGTCCCTGTTCATGATGACCAAACGCTGTTTGAACTGCTGGTTCTGAGTGTGGCCCAAGTGGGTTCGGATTGGACTTCAATTTTGAA
GAAACGACAAGATTTCAGAAATGCATTTTCAGATTTCGATGCACAAGTGGTGGCGAATTTTTCCGACAGACAGATGGTTTCGATCAGCTCAGAGTATGGAATGGACATTA
ACAGAGTCCGAGGAGTGGTCGACAACGCAATCCGGATCCTGGAGATTAAGAAGGAATTTGGGTCATTGGAGAAATACATTTGGGGGTTTATGAACAACAACCCATTCTCA
CCGCACTACAAATCCGCCCACAAAATCCCGGTCAAGACATCAAAATCAGACACCATAAGCAAAGACATGATCCGGCGAGGATTCCGGTCCGTCGGTCCCACCGTGGTCCA
TTCCTTCATGCAAGCCGCCGGTCTGACCAACGACCATCTCACCACCTGCCACAGGCACCTGCACTGCACATTAATCGCCGCCGGCCGTCACGCTCCACCGCCGGAAGTGG
AGGAGACGACGACAGGTGCGGCAGGCTCGGAAGCTGTGTAGAATTCAGTGGTTGCTAATTAACTATATAACTGTTTTTATTTTATGTTTTTTGTGTCAAAATGTTTGTAT
GAATAGAATGTCGTGAAGTGGTAGTGACAATGTCGTGTGTGTTGGTCAGTTTGCTTTTGTAAATTCCCATCTCATCCATCCTAATTTCAAACATTATTAAACCATTATTG
TATTATTTTTAGTCTTATTTAGCCTTTTATATTCTCGATTAAGTTATCGAAGGTTCGAGTCTGAAAATAAT
Protein sequenceShow/hide protein sequence
MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPLAANGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVASPGSIAAVRRE
QVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDDQTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVA
NFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHR
HLHCTLIAAGRHAPPPEVEETTTGAAGSEAV