; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G6367 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G6367
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionAdenine DNA glycosylase
Genome locationctg1429:309807..314128
RNA-Seq ExpressionCucsat.G6367
SyntenyCucsat.G6367
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0000701 - purine-specific mismatch base pair DNA N-glycosylase activity (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0051539 - 4 iron, 4 sulfur cluster binding (molecular function)
InterPro domainsIPR000445 - Helix-hairpin-helix motif
IPR003265 - HhH-GPD domain
IPR004036 - Endonuclease III-like, conserved site-2
IPR011257 - DNA glycosylase
IPR015797 - NUDIX hydrolase-like domain superfamily
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal
IPR029119 - MutY, C-terminal
IPR044298 - Adenine/Thymine-DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN46403.2 hypothetical protein Csa_005328 [Cucumis sativus]2.24e-25599.73Show/hide
Query:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV
        +IMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV
Subjt:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV

Query:  DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD
        DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD
Subjt:  DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD

Query:  YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE
        YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE
Subjt:  YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE

Query:  HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
        HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
Subjt:  HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS

XP_004140565.2 adenine DNA glycosylase isoform X1 [Cucumis sativus]3.76e-26399.73Show/hide
Query:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV
        +IMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV
Subjt:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV

Query:  DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD
        DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD
Subjt:  DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD

Query:  YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE
        YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE
Subjt:  YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE

Query:  HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
        HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
Subjt:  HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS

XP_031743605.1 adenine DNA glycosylase isoform X2 [Cucumis sativus]1.41e-26099.46Show/hide
Query:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV
        +IMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLE VNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV
Subjt:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV

Query:  DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD
        DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD
Subjt:  DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD

Query:  YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE
        YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE
Subjt:  YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE

Query:  HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
        HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
Subjt:  HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS

XP_031743606.1 adenine DNA glycosylase isoform X3 [Cucumis sativus]8.77e-25998.92Show/hide
Query:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV
        +IMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFE   MIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV
Subjt:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV

Query:  DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD
        DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD
Subjt:  DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD

Query:  YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE
        YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE
Subjt:  YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE

Query:  HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
        HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
Subjt:  HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS

XP_031743607.1 adenine DNA glycosylase isoform X4 [Cucumis sativus]3.28e-25698.65Show/hide
Query:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV
        +IMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLE VNEMWAGLGYYRRARFLFE   MIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV
Subjt:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV

Query:  DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD
        DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD
Subjt:  DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD

Query:  YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE
        YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE
Subjt:  YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE

Query:  HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
        HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
Subjt:  HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS

TrEMBL top hitse value%identityAlignment
A0A0A0KC27 Adenine DNA glycosylase1.33e-26299.73Show/hide
Query:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV
        +IMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV
Subjt:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV

Query:  DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD
        DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD
Subjt:  DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD

Query:  YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE
        YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE
Subjt:  YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE

Query:  HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
        HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
Subjt:  HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS

A0A1S3CBT2 Adenine DNA glycosylase8.30e-25396.5Show/hide
Query:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV
        +IMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFP+TVSSLRKIPGIGEYTAGAIASIAFGEVVPVV
Subjt:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV

Query:  DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD
        DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIK KQRHD
Subjt:  DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD

Query:  YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE
        YSAVCVVEILESQGT ELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD STRRESI+SLLSKNFGLE KKNFEIVNREDVGDFIH+FTHIRLKIYVE
Subjt:  YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE

Query:  HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
        HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSS+  LP KKQKS
Subjt:  HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS

A0A1S4E2J7 Adenine DNA glycosylase1.41e-23797.38Show/hide
Query:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV
        +IMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFP+TVSSLRKIPGIGEYTAGAIASIAFGEVVPVV
Subjt:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV

Query:  DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD
        DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISK DSSVLVTDYPAKGIK KQRHD
Subjt:  DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD

Query:  YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE
        YSAVCVVEILESQGT ELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEAD STRRESI+SLLSKNFGLE KKNFEIVNREDVGDFIH+FTHIRLKIYVE
Subjt:  YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE

Query:  HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRK
        HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRK
Subjt:  HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRK

A0A5A7T8X3 Adenine DNA glycosylase1.55e-24790.4Show/hide
Query:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEV----
        +IMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFP+TVSSLRKIPGIGEYTAGAIASIAFGEV    
Subjt:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEV----

Query:  ---------------------VPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALS
                             VPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVD SRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALS
Subjt:  ---------------------VPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALS

Query:  ISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNF
        ISK DSSVLVTDYPAKGIK KQRHDYSAVCVVEILESQGT ELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADL+TRRESI+SLLSKNFGLE KKNF
Subjt:  ISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNF

Query:  EIVNREDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS
        EIVNREDVGDFIH+FTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSS+  LPRKKQKS
Subjt:  EIVNREDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS

A0A6J1GP17 Adenine DNA glycosylase7.95e-22386.76Show/hide
Query:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV
        +IMLQQTRVQTVV++Y RWM +WPTVQHLSRASLEEVNEMWAGLGYYRRARFL EGAK+IVKEGG FP+TVS+LRKIPGIGEYTAGAIASIAF EVVPVV
Subjt:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPVV

Query:  DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD
        DGNVIRVIARLKAIS NPKD KL+KQVWKAAAQLVD SRPGDFNQALMELGATLCTPT+PSCSTCPVFDHCEALS SK DSSVLVTDYPAKG+K KQRHD
Subjt:  DGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDSSVLVTDYPAKGIKIKQRHD

Query:  YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE
        YSAVCVVEILE+QGT EL QSSRFLLVKRPDEGLLAGLWEFPSV L+GEAD STRRESINSLLSK+FGLE KKNFEIV REDVGDF+H+F+HIRLKIYVE
Subjt:  YSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVE

Query:  HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQK
        HLVL LKGEGSKLFRKQEKKSI WKCV+NKVMS+MGLTSSVRK YAMVEKF+A K S S   A+  KKQ+
Subjt:  HLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQK

SwissProt top hitse value%identityAlignment
F4JRF4 Adenine DNA glycosylase8.4e-11559.89Show/hide
Query:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIG
        +IMLQQTRVQTV+++Y RWM KWPT+  L +ASLE                   EVNEMWAGLGYYRRARFL EGAKM+V     FP   SSL K+ GIG
Subjt:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIG

Query:  EYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDS
        +YTAGAIASIAF E VPVVDGNVIRV+ARLKAIS NPKD    +  WK AAQLVD SRPGDFNQ+LMELGATLCT + PSCS+CPV   C A S+S+ + 
Subjt:  EYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDS

Query:  SVLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSK--NFGLEAKKNFEIVN
        ++ VTDYP K IK K RHD+  VCV+EI   +     G   RF+LVKRP++GLLAGLWEFPSV L+ EAD +TRR +IN  L +   F +E KK   IV+
Subjt:  SVLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSK--NFGLEAKKNFEIVN

Query:  REDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRK
        RE++G+F+HIFTHIR K+YVE LV+ L G    LF+ Q K ++ WKCV + V+ST+GLTS+VRK
Subjt:  REDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRK

Q10159 Adenine DNA glycosylase5.2e-4836.69Show/hide
Query:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLE-EVNEMWAGLGYYRRARFLFEGAKMIVK-EGGRFPRTVSSLRK-IPGIGEYTAGAIASIAFGEVV
        +IMLQQTRV+TV ++Y +WM   PT++  + A    +V  +W+G+G+Y R + L +  + + K      PRT     K IPG+G YTAGA+ SIA+ +  
Subjt:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLE-EVNEMWAGLGYYRRARFLFEGAKMIVK-EGGRFPRTVSSLRK-IPGIGEYTAGAIASIAFGEVV

Query:  PVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEAL---------SISKHD-----SSV
         +VDGNVIRV++R  AI  +    K    +WK A +LVD  RPGDFNQALMELGA  CTP +P CS CP+ + C+A          +  K+D      ++
Subjt:  PVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEAL---------SISKHD-----SSV

Query:  LVTD--------------YPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEA---DLSTR-RESINSLLSK
         +TD              YP    K KQR + + V +      Q T    +   FL+ KRP  GLLAGLW+FP++    E+   D+    ++SI   +S 
Subjt:  LVTD--------------YPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEA---DLSTR-RESINSLLSK

Query:  NFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLV
        +     KK       +  G ++HIF+HIR   +V + +
Subjt:  NFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLV

Q8R5G2 Adenine DNA glycosylase1.7e-6739.66Show/hide
Query:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKE-GGRFPRTVSSLRK-IPGIGEYTAGAIASIAFGEVVP
        ++MLQQT+V TV+ +Y RWM KWPT+Q L+ ASLEEVN++W+GLGYY R R L EGA+ +V+E GG  PRT  +L++ +PG+G YTAGAIASIAF +V  
Subjt:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKE-GGRFPRTVSSLRK-IPGIGEYTAGAIASIAFGEVVP

Query:  VVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEA-------------------------
        VVDGNVIRV+ R++AI  +P    +   +W  A QLVD +RPGDFNQA MELGAT+CTP  P C+ CPV   C A                         
Subjt:  VVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEA-------------------------

Query:  -------LSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKN
                S +  D ++ V ++P K  +   R +YSA CVVE   + G P +      LLV+RP+ GLLAGLWEFPSV+L+      + +    +LL + 
Subjt:  -------LSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKN

Query:  FGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAG--KTSSSSNCAL
            A         + +G+ IH+F+HI+L   V    L L+G+          + + W+   N  +ST     +++K + + E+ + G  K S       
Subjt:  FGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAG--KTSSSSNCAL

Query:  PRKKQK
        P  ++K
Subjt:  PRKKQK

Q99P21 Adenine DNA glycosylase2.0e-6840.64Show/hide
Query:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKE-GGRFPRTVSSLRK-IPGIGEYTAGAIASIAFGEVVP
        ++MLQQT+V TV+ +Y RWM KWP +Q L+ ASLEEVN++W+GLGYY R R L EGA+ +V+E GG  PRT  +L++ +PG+G YTAGAIASIAF +V  
Subjt:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKE-GGRFPRTVSSLRK-IPGIGEYTAGAIASIAFGEVVP

Query:  VVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEA-------------------------
        VVDGNV+RV+ R++AI  +P    +   +W  A QLVD +RPGDFNQA MELGAT+CTP  P CS CPV   C A                         
Subjt:  VVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEA-------------------------

Query:  -------LSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKN
                S S  D S+ V ++P K  +   R +YSA CVVE   + G P +      LLV+RPD GLLAGLWEFPSV+L  E     + +++   L + 
Subjt:  -------LSISKHDSSVLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKN

Query:  FGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAG--KTSSSSNCAL
         G         +  + +G+ IHIF+HI+L   V  L L    +          + + W+   N  +ST     +++K + M E  + G  K S  S    
Subjt:  FGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAG--KTSSSSNCAL

Query:  PRKKQK
        P  ++K
Subjt:  PRKKQK

Q9UIF7 Adenine DNA glycosylase2.1e-6543.4Show/hide
Query:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKE-GGRFPRTVSSLRK-IPGIGEYTAGAIASIAFGEVVP
        ++MLQQT+V TV+ +Y  WM KWPT+Q L+ ASLEEVN++WAGLGYY R R L EGA+ +V+E GG  PRT  +L++ +PG+G YTAGAIASIAFG+   
Subjt:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKE-GGRFPRTVSSLRK-IPGIGEYTAGAIASIAFGEVVP

Query:  VVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEA--------------LSISKH-----
        VVDGNV RV+ R++AI  +P    + +Q+W  A QLVD +RPGDFNQA MELGAT+CTP  P CS CPV   C A              LS S       
Subjt:  VVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEA--------------LSISKH-----

Query:  ----------------DSSVLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLL
                        D ++ V ++P K  +   R + SA CV+E   + G       ++ LLV+RP+ GLLAGLWEFPSV+ +    L  +R+++   L
Subjt:  ----------------DSSVLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLL

Query:  SKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVL
         +  G          +   +G+ +H F+HI+L   V  L L
Subjt:  SKNFGLEAKKNFEIVNREDVGDFIHIFTHIRLKIYVEHLVL

Arabidopsis top hitse value%identityAlignment
AT1G05900.2 endonuclease III 29.9e-1027.85Show/hide
Query:  TVQHLSRASLEEVNEMWAGLGYY-RRARFLFEGAKMIVKE-GGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPV-VDGNVIRVIARL--------KA
        T + + +A    + E+   +G+Y R+A  + + AK+ + E  G  PRT+  L  +PG+G   A  +  +A+ +V  + VD +V R+  RL        K 
Subjt:  TVQHLSRASLEEVNEMWAGLGYY-RRARFLFEGAKMIVKE-GGRFPRTVSSLRKIPGIGEYTAGAIASIAFGEVVPV-VDGNVIRVIARL--------KA

Query:  ISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHC
         + +P++ ++  Q W    + V +      N  L+  G T+CTP  P C TC + + C
Subjt:  ISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHC

AT4G12740.1 HhH-GPD base excision DNA repair family protein6.0e-11659.89Show/hide
Query:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIG
        +IMLQQTRVQTV+++Y RWM KWPT+  L +ASLE                   EVNEMWAGLGYYRRARFL EGAKM+V     FP   SSL K+ GIG
Subjt:  KIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLE-------------------EVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRTVSSLRKIPGIG

Query:  EYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDS
        +YTAGAIASIAF E VPVVDGNVIRV+ARLKAIS NPKD    +  WK AAQLVD SRPGDFNQ+LMELGATLCT + PSCS+CPV   C A S+S+ + 
Subjt:  EYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHDS

Query:  SVLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSK--NFGLEAKKNFEIVN
        ++ VTDYP K IK K RHD+  VCV+EI   +     G   RF+LVKRP++GLLAGLWEFPSV L+ EAD +TRR +IN  L +   F +E KK   IV+
Subjt:  SVLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSK--NFGLEAKKNFEIVN

Query:  REDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRK
        RE++G+F+HIFTHIR K+YVE LV+ L G    LF+ Q K ++ WKCV + V+ST+GLTS+VRK
Subjt:  REDVGDFIHIFTHIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAAACAATCAGGGCATCGCTATTGGATTGGTACGACCGTAGCCGCAGGGACCTTCCATGGAGGAGCTTGGACAAAGGGGAACCTGAAACACGGGCTTACGGTGTG
TGGGTTTCAGAAAATAATGCTGCAGCAGACCAGAGTTCAGACCGTCGTCCAATTTTACAACCGTTGGATGCTTAAATGGCCTACCGTTCAACATCTCTCTCGTGCTTCTC
TTGAGGAGGTGAATGAAATGTGGGCAGGCTTGGGGTATTATAGACGAGCTCGTTTTCTTTTTGAGGGTGCAAAGATGATAGTCAAAGAAGGTGGTAGGTTTCCTAGAACA
GTTTCTTCCCTGCGAAAAATTCCAGGAATTGGAGAATACACAGCAGGGGCTATTGCCTCTATAGCGTTCGGTGAAGTGGTGCCTGTGGTCGATGGTAATGTGATAAGGGT
AATTGCTCGATTAAAGGCTATTTCAGGAAACCCAAAAGACCCAAAGTTGATCAAGCAAGTTTGGAAGGCAGCTGCTCAATTAGTTGATCTTTCCAGGCCTGGGGATTTCA
ATCAGGCACTCATGGAACTTGGTGCAACTTTATGCACTCCAACAAACCCAAGCTGCTCAACATGCCCCGTGTTTGATCACTGTGAGGCCCTTTCAATCTCAAAGCATGAT
AGTTCAGTTCTTGTCACAGATTATCCCGCTAAGGGGATAAAGATCAAACAAAGACATGATTACTCTGCTGTATGTGTGGTTGAGATATTGGAAAGTCAGGGTACACCTGA
GTTAGGGCAATCTAGTAGATTTCTTCTTGTAAAGAGGCCTGATGAAGGTTTGCTTGCTGGTCTATGGGAGTTTCCATCTGTCTCGTTGGATGGAGAAGCTGATTTAAGCA
CCAGGAGAGAATCCATTAATAGCCTCTTGAGCAAAAACTTTGGACTTGAAGCAAAAAAGAATTTTGAAATAGTTAATAGAGAAGATGTTGGAGATTTTATCCATATTTTC
ACACACATCCGTCTCAAGATATATGTTGAGCACTTGGTGTTATGTTTAAAAGGTGAAGGTAGCAAGTTGTTTCGGAAACAGGAGAAGAAATCTATATTATGGAAATGTGT
AGAGAACAAGGTTATGTCAACAATGGGCTTGACGTCCAGTGTGAGGAAGGCCTATGCCATGGTCGAGAAATTTCAGGCAGGGAAGACATCTTCTAGTTCTAACTGTGCAC
TACCCAGAAAGAAACAGAAATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCAAACAATCAGGGCATCGCTATTGGATTGGTACGACCGTAGCCGCAGGGACCTTCCATGGAGGAGCTTGGACAAAGGGGAACCTGAAACACGGGCTTACGGTGTG
TGGGTTTCAGAAAATAATGCTGCAGCAGACCAGAGTTCAGACCGTCGTCCAATTTTACAACCGTTGGATGCTTAAATGGCCTACCGTTCAACATCTCTCTCGTGCTTCTC
TTGAGGAGGTGAATGAAATGTGGGCAGGCTTGGGGTATTATAGACGAGCTCGTTTTCTTTTTGAGGGTGCAAAGATGATAGTCAAAGAAGGTGGTAGGTTTCCTAGAACA
GTTTCTTCCCTGCGAAAAATTCCAGGAATTGGAGAATACACAGCAGGGGCTATTGCCTCTATAGCGTTCGGTGAAGTGGTGCCTGTGGTCGATGGTAATGTGATAAGGGT
AATTGCTCGATTAAAGGCTATTTCAGGAAACCCAAAAGACCCAAAGTTGATCAAGCAAGTTTGGAAGGCAGCTGCTCAATTAGTTGATCTTTCCAGGCCTGGGGATTTCA
ATCAGGCACTCATGGAACTTGGTGCAACTTTATGCACTCCAACAAACCCAAGCTGCTCAACATGCCCCGTGTTTGATCACTGTGAGGCCCTTTCAATCTCAAAGCATGAT
AGTTCAGTTCTTGTCACAGATTATCCCGCTAAGGGGATAAAGATCAAACAAAGACATGATTACTCTGCTGTATGTGTGGTTGAGATATTGGAAAGTCAGGGTACACCTGA
GTTAGGGCAATCTAGTAGATTTCTTCTTGTAAAGAGGCCTGATGAAGGTTTGCTTGCTGGTCTATGGGAGTTTCCATCTGTCTCGTTGGATGGAGAAGCTGATTTAAGCA
CCAGGAGAGAATCCATTAATAGCCTCTTGAGCAAAAACTTTGGACTTGAAGCAAAAAAGAATTTTGAAATAGTTAATAGAGAAGATGTTGGAGATTTTATCCATATTTTC
ACACACATCCGTCTCAAGATATATGTTGAGCACTTGGTGTTATGTTTAAAAGGTGAAGGTAGCAAGTTGTTTCGGAAACAGGAGAAGAAATCTATATTATGGAAATGTGT
AGAGAACAAGGTTATGTCAACAATGGGCTTGACGTCCAGTGTGAGGAAGGCCTATGCCATGGTCGAGAAATTTCAGGCAGGGAAGACATCTTCTAGTTCTAACTGTGCAC
TACCCAGAAAGAAACAGAAATCTTGA
Protein sequenceShow/hide protein sequence
MFKQSGHRYWIGTTVAAGTFHGGAWTKGNLKHGLTVCGFQKIMLQQTRVQTVVQFYNRWMLKWPTVQHLSRASLEEVNEMWAGLGYYRRARFLFEGAKMIVKEGGRFPRT
VSSLRKIPGIGEYTAGAIASIAFGEVVPVVDGNVIRVIARLKAISGNPKDPKLIKQVWKAAAQLVDLSRPGDFNQALMELGATLCTPTNPSCSTCPVFDHCEALSISKHD
SSVLVTDYPAKGIKIKQRHDYSAVCVVEILESQGTPELGQSSRFLLVKRPDEGLLAGLWEFPSVSLDGEADLSTRRESINSLLSKNFGLEAKKNFEIVNREDVGDFIHIF
THIRLKIYVEHLVLCLKGEGSKLFRKQEKKSILWKCVENKVMSTMGLTSSVRKAYAMVEKFQAGKTSSSSNCALPRKKQKS