; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026027 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026027
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionApurinic-apyrimidinic endonuclease 2
Genome locationtig00153031:998288..1002466
RNA-Seq ExpressionSgr026027
SyntenySgr026027
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0098506 - polynucleotide 3' dephosphorylation (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0080111 - DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016829 - lyase activity (molecular function)
GO:0046403 - polynucleotide 3'-phosphatase activity (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0008311 - double-stranded DNA 3'-5' exodeoxyribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0008081 - phosphoric diester hydrolase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0003906 - DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR010666 - Zinc finger, GRF-type
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR004808 - AP endonuclease 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026737.1 DNA-(apurinic or apyrimidinic site) lyase 2 [Cucurbita argyrosperma subsp. argyrosperma]1.2e-26879.61Show/hide
Query:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF
        MKIVTYNVNGLRPRI Q+GSLLKLLDSFDADIICFQETKLRRQELRADL+IA+GYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPV AEEGF
Subjt:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF

Query:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS
        TGLLESS  G+GTMPAV EGLEEFSKEELLKVD EGRCIVTDHGHFVLFNIYGPRA+SDD+ER+LFK NFYN+LQKRWE LL++GKRIFVVGDLNIAPTS
Subjt:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA
        MDRCDAGPDFENNEFRRWLRSLLVGC      F++        ++  K   H   +DAYTCWPQSTGAEVFNYGTRIDHILCAG CLH DSNLP H+IV 
Subjt:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA

Query:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI
        CHV+ECDIL+QYKRWKDGNSFRWKGE+T KLEGSDHAPVYASLLEIP+TPQHSTPSLSARY+  IHGLQQTLVSMLLKRQAAE SA+C+ISNSFSR  N+
Subjt:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI

Query:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA
        +LG+CSQ +NGS                   +ETEDSLLKTEESSGG Y E A CN L THESLHTKTLP+NE+RKRVRRSSQMSLKSFF+KN  ISN A
Subjt:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA

Query:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG
        DSSNA+SS+NK+DTS+SN  PIE+PRSDT  ++S +YL+ + DQS +NASSVE+EKSGVALLEWRRIQQVMQNSIPLCK HKE CVARVVKKQGPNNGR 
Subjt:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG

Query:  FYVCARGE
        FYVCAR E
Subjt:  FYVCARGE

XP_022133212.1 DNA-(apurinic or apyrimidinic site) lyase 2 isoform X1 [Momordica charantia]2.6e-27682.4Show/hide
Query:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF
        MKIVTYNVNGLRPRI QFGSL KLLDSFDADIICFQETKLR+QE RADLVIA+GYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSS+EVALPVGAEEGF
Subjt:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF

Query:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS
        TGLLESSQNGK TMPAV EGLEEFSK ELLKVDSEGRCIVTDHGHFVLFNIYGPRA+SDDSER+LFKL FY++LQKRWE LL +GKRIFVVGDLNIAPTS
Subjt:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA
        MDRCDAGPDFENNEFRRWLRSLLVGC    +                    H   RDAYTCWPQSTGAEVFNYG+RIDHILCAG CLHHDSNLPGHNIVA
Subjt:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA

Query:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI
        CHV+EC+IL QYKRWKDGNS RWKGERTFKLEGSDHAPVYASLLEIP+TPQHSTPSLSARYN MIHGLQQTLVSMLLKRQA EDSASC+ISNS SR  NI
Subjt:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI

Query:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA
        ILGSCSQ ++GS                   LETEDSLLKTEESSGG +PE A CN L T +SL TKTLP+NE+RKRVRRSSQMSLKSFF+KNS ISND+
Subjt:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA

Query:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG
        DSSNADS  NK+DTSQSN   IEVP+SDTQSSNSEQYLDASQDQ QL+ SSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKE CVARVVKKQGPNNGR 
Subjt:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG

Query:  FYVCARGE
        FYVCAR E
Subjt:  FYVCARGE

XP_022926300.1 DNA-(apurinic or apyrimidinic site) lyase 2 [Cucurbita moschata]1.6e-26578.95Show/hide
Query:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF
        MKIVTYNVNGLRPRI Q+GSLLKLLDSFDADIICFQETKLRRQELRADL+IA+GYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPV AEEGF
Subjt:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF

Query:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS
        TGLLESS  G+GTMPAV EGLEEFSKEELLKVD EGRCIVTDHGHFVLFNIYGPRA+SDD+ER+LFK NFYN+LQKRWE LL++GKRIFVVGDLNIAPTS
Subjt:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA
        MD CDAGPDFENNEFRRWLRSLLVGC      F++        ++  K   H    DAYTCWPQSTGAEVFNYGTRIDHILCAG CLH DSNLP H+IV 
Subjt:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA

Query:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI
        CHV+ECDIL+QYKRWKDGNSFRWKGE+T KLEGSDHAPVYASLLEIP+TPQHS PSLSARYN  IHGLQQTLVSMLLK+QAAE SA+C+ISNSFS + N 
Subjt:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI

Query:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA
        +LG+CSQ +NGS                   +ETEDSLLKTEESSGG Y E A CN L THESLHTKTLP+NE+RKRVRRSSQMSLKSFF+ NS ISN A
Subjt:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA

Query:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG
        DSSNA+S++NK+DTS+SN  PIE+PRSDT  ++S +YL+ + DQS +NASSVE+EKSGVALLEWRRIQQVMQNSIPLCK HKE CVARVVKKQGPNNGR 
Subjt:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG

Query:  FYVCARGE
        FYVCAR E
Subjt:  FYVCARGE

XP_023517127.1 DNA-(apurinic or apyrimidinic site) lyase 2 [Cucurbita pepo subsp. pepo]1.0e-26779.61Show/hide
Query:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF
        MKIVTYNVNGLRPRI Q+GSLLKLLDSFDADIICFQETKLRRQELRADL+IA+GYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPV AEEGF
Subjt:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF

Query:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS
        TGLLESS  G+GTMPA+ EGLEEFSKEELLKVD EGRCIVTDHGHFVLFNIYGPRA+SDD+ER+LFK NFYN+LQKRWE LL++GKRIFVVGDLNIAPTS
Subjt:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA
        MDRCDAGPDFENNEFRRWLRSLLVG       F++        ++  K   H   +DAYTCWPQSTGAEVFNYGTRIDHILCAG CLH DSNLP H+IV 
Subjt:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA

Query:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI
        CHV+ECDIL+QYKRWKDGNSFRWKGE+T KLEGSDHAPVYASLLEIP+TPQHSTPSLSARYN  IHGLQQTLVSMLLKRQAAE SA+C+ISNSFSR  N+
Subjt:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI

Query:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA
        +LG+CSQ +NGS                   +ETEDSLLKTEESSGG Y E A CN L THESLHTKTLP+NE+RKRVRRSSQMSLKSFF+KNS ISN A
Subjt:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA

Query:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG
        DSSNA+SS NK DTS+SN  PIE+PRSDT  ++S +YL+ + DQS +NASSVE+EKSGVALLEWRRIQQVMQNSIPLCK HKE CVARVVKKQGPNNGR 
Subjt:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG

Query:  FYVCARGE
        FYVCAR E
Subjt:  FYVCARGE

XP_038881293.1 DNA-(apurinic or apyrimidinic site) endonuclease 2 [Benincasa hispida]7.4e-27180.26Show/hide
Query:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF
        MKIVTYNVNGLRPRI QFGSLLKLLDSFDADIIC QETKLRRQELRADL+IA+GYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPV AEEGF
Subjt:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF

Query:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS
        TGLLESSQ+GKGTM  V EG+EEFSKEELLK+DSEGRCIVTDHGHFVLFNIYGPRAESDDSER+LFKL F+N+LQKRWE LL  GKRIFVVGDLNIAPTS
Subjt:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA
        MDRCDAGPDFENNEFRRWLRSLLV      +                    H   RDAYTCWPQSTGAEVFNYGTRIDHILCAG CLHHDSNLPGHNIVA
Subjt:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA

Query:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI
        CH+MECDIL+QYKRWKDGNSFRWKGERT KLEGSDHAPVYASLLEIP+TPQHSTPSLSARYN  IHGLQQTLVSMLLKRQAAEDSA C+ISNSFSR  NI
Subjt:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI

Query:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA
        ILG+CSQ +NGS                   LE EDSLLKTE+ SGGSY E A CN L THE LHTK LP+NE+RKRVRR SQMSLKSFF+KNS +SN+ 
Subjt:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA

Query:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG
        DSSNADSS+NK+DTS+SN  PIE+PRS+TQ S+S Q+L+  + QSQ+NASSVEKEKS VALLEWRRIQQVMQNSIPLCKGHKE CVARVVKKQGPNNGR 
Subjt:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG

Query:  FYVCARGE
        FYVCAR E
Subjt:  FYVCARGE

TrEMBL top hitse value%identityAlignment
A0A1S4DUC2 DNA-(apurinic or apyrimidinic site) endonuclease6.6e-26578.95Show/hide
Query:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF
        MKIVTYNVNGLRPRI QFGSLLKLLDSFDADIIC QETKLRRQELRADLVIA+GYE+FVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPV AEEGF
Subjt:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF

Query:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS
        TGLLESSQ+GK TM AV EGLEEFSKEELL++DSEGRCIVTDHGHFVLFNIYGPRAESDDS+R+LFKL FYN+LQKRWE LL  GKR+FVVGDLNIAPTS
Subjt:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA
        MDRCDAGPDFENNEFRRWLRSLLV C    +                    H   RDAYTCWPQSTGAEVFNYGTRIDHILCAG CLHHD++LPGHNIVA
Subjt:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA

Query:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI
        CHVMECDIL++YKRWKDGNSFRWKGE++ KLEGSDHAPV ASLLEIP+TPQHSTPSLSARYN  IHGLQQTLVSMLLKRQAAEDSA C+ SNS SR  NI
Subjt:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI

Query:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA
        ILG+CSQ  NGS                   LETEDSLLKT E +GGSY E A CN L +HESLH K LP+NE+RKRV+R SQMSLKSFF+KNS +SNDA
Subjt:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA

Query:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG
        +SSNADS ++K++TS+SN  PIE+PRS+TQ SNS + L+A QDQSQ+NAS VEKEKSGVALLEWRRIQQVMQNSIPLCKGHKE CVARVVKKQGPNNGR 
Subjt:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG

Query:  FYVCARGE
        FYVCAR E
Subjt:  FYVCARGE

A0A5A7SJ45 DNA-(apurinic or apyrimidinic site) endonuclease6.6e-26578.95Show/hide
Query:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF
        MKIVTYNVNGLRPRI QFGSLLKLLDSFDADIIC QETKLRRQELRADLVIA+GYE+FVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPV AEEGF
Subjt:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF

Query:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS
        TGLLESSQ+GK TM AV EGLEEFSKEELL++DSEGRCIVTDHGHFVLFNIYGPRAESDDS+R+LFKL FYN+LQKRWE LL  GKR+FVVGDLNIAPTS
Subjt:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA
        MDRCDAGPDFENNEFRRWLRSLLV C    +                    H   RDAYTCWPQSTGAEVFNYGTRIDHILCAG CLHHD++LPGHNIVA
Subjt:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA

Query:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI
        CHVMECDIL++YKRWKDGNSFRWKGE++ KLEGSDHAPV ASLLEIP+TPQHSTPSLSARYN  IHGLQQTLVSMLLKRQAAEDSA C+ SNS SR  NI
Subjt:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI

Query:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA
        ILG+CSQ  NGS                   LETEDSLLKT E +GGSY E A CN L +HESLH K LP+NE+RKRV+R SQMSLKSFF+KNS +SNDA
Subjt:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA

Query:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG
        +SSNADS ++K++TS+SN  PIE+PRS+TQ SNS + L+A QDQSQ+NAS VEKEKSGVALLEWRRIQQVMQNSIPLCKGHKE CVARVVKKQGPNNGR 
Subjt:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG

Query:  FYVCARGE
        FYVCAR E
Subjt:  FYVCARGE

A0A6J1BYG7 Apurinic-apyrimidinic endonuclease 21.3e-27682.4Show/hide
Query:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF
        MKIVTYNVNGLRPRI QFGSL KLLDSFDADIICFQETKLR+QE RADLVIA+GYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSS+EVALPVGAEEGF
Subjt:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF

Query:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS
        TGLLESSQNGK TMPAV EGLEEFSK ELLKVDSEGRCIVTDHGHFVLFNIYGPRA+SDDSER+LFKL FY++LQKRWE LL +GKRIFVVGDLNIAPTS
Subjt:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA
        MDRCDAGPDFENNEFRRWLRSLLVGC    +                    H   RDAYTCWPQSTGAEVFNYG+RIDHILCAG CLHHDSNLPGHNIVA
Subjt:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA

Query:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI
        CHV+EC+IL QYKRWKDGNS RWKGERTFKLEGSDHAPVYASLLEIP+TPQHSTPSLSARYN MIHGLQQTLVSMLLKRQA EDSASC+ISNS SR  NI
Subjt:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI

Query:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA
        ILGSCSQ ++GS                   LETEDSLLKTEESSGG +PE A CN L T +SL TKTLP+NE+RKRVRRSSQMSLKSFF+KNS ISND+
Subjt:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA

Query:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG
        DSSNADS  NK+DTSQSN   IEVP+SDTQSSNSEQYLDASQDQ QL+ SSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKE CVARVVKKQGPNNGR 
Subjt:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG

Query:  FYVCARGE
        FYVCAR E
Subjt:  FYVCARGE

A0A6J1EKQ4 Apurinic-apyrimidinic endonuclease 27.8e-26678.95Show/hide
Query:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF
        MKIVTYNVNGLRPRI Q+GSLLKLLDSFDADIICFQETKLRRQELRADL+IA+GYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPV AEEGF
Subjt:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF

Query:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS
        TGLLESS  G+GTMPAV EGLEEFSKEELLKVD EGRCIVTDHGHFVLFNIYGPRA+SDD+ER+LFK NFYN+LQKRWE LL++GKRIFVVGDLNIAPTS
Subjt:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA
        MD CDAGPDFENNEFRRWLRSLLVGC      F++        ++  K   H    DAYTCWPQSTGAEVFNYGTRIDHILCAG CLH DSNLP H+IV 
Subjt:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA

Query:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI
        CHV+ECDIL+QYKRWKDGNSFRWKGE+T KLEGSDHAPVYASLLEIP+TPQHS PSLSARYN  IHGLQQTLVSMLLK+QAAE SA+C+ISNSFS + N 
Subjt:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI

Query:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA
        +LG+CSQ +NGS                   +ETEDSLLKTEESSGG Y E A CN L THESLHTKTLP+NE+RKRVRRSSQMSLKSFF+ NS ISN A
Subjt:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA

Query:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG
        DSSNA+S++NK+DTS+SN  PIE+PRSDT  ++S +YL+ + DQS +NASSVE+EKSGVALLEWRRIQQVMQNSIPLCK HKE CVARVVKKQGPNNGR 
Subjt:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG

Query:  FYVCARGE
        FYVCAR E
Subjt:  FYVCARGE

A0A6J1KNT8 DNA-(apurinic or apyrimidinic site) endonuclease2.1e-26378.45Show/hide
Query:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF
        MKIVTYNVNGLRPRI Q+GSLLKLLDSFDADIICFQETKLRRQELRADL+IA+GYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPV AEEGF
Subjt:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF

Query:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS
        +GLLESS  G+GTMPAV EGLEEFSKEELLKVD EGRCIVTDHGHFVLFNIYGPRA+SDD+ER+LFK NFYN+LQKRWE LL++GKRIFVVGDLNIAPTS
Subjt:  TGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA
        MDRCDAGPDFENNEFRRW+RSLLV C      F++        ++  K   H   +DAYTCW QSTGAEVFNYGTRIDHILCAG CLH DSN PGH+IV 
Subjt:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA

Query:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI
        CHV+ECDIL+QYKRWKDGNSFR KGE+T KLEGSDHAPVYASLLEIP+TPQHSTPSLSARYN  IHGLQQTLVSMLLKRQAAE SA+C+ISNSFSR  NI
Subjt:  CHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENI

Query:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA
        +LG+CSQ +NGS                   ++TEDSLLKTEESSGG Y E A CN L THESLHTKTL +NE+RKRVRRSSQMSLKSFF+KNS ISN A
Subjt:  ILGSCSQ-MNGS-------------------LETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDA

Query:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG
        DSSNA+SS+NK+DTS+SN  PIE+PRSDT  ++S +Y + + DQS +NA SVE+EKSGVALLEWRRIQ+VMQNSIPLCK HKE CVARVVKKQGPNNGR 
Subjt:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG

Query:  FYVCARGE
        FYVCAR E
Subjt:  FYVCARGE

SwissProt top hitse value%identityAlignment
F4JNY0 DNA-(apurinic or apyrimidinic site) endonuclease 22.2e-17254.84Show/hide
Query:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF
        MKIVTYNVNGLR R+ QF SLLKLLDSFDADIICFQETKLRRQEL ADL IA+GYESF SCTRTSEKGRTGYSGVATFCRVKSA SS E ALPV AEEG 
Subjt:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF

Query:  TGLLES-SQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPT
        TGL+ S S+ GK     V EGLEE+ KEELL +D EGRC++TDHGHFV+FN+YGPRA +DD++R+ FK  FY +L++RWECLL++G+R+FVVGDLNIAP 
Subjt:  TGLLES-SQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPT

Query:  SMDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIV
        +MDRC+AGPDFE NEFR+W RSLL   V    +FS+        ++  K   H   +DA+TCW  S+GAE FNYG+RIDHIL AGSCLH D +  GH+ +
Subjt:  SMDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIV

Query:  ACHVMECDILTQYKRWKDGN-SFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAE
        ACHV ECDILT+YKR+K+ N   RWKG    K +GSDH PV+ S  ++P+ P+HSTP L++RY  MI+G QQTLVS+  KR+A E++ +  +S S S   
Subjt:  ACHVMECDILTQYKRWKDGN-SFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAE

Query:  NII----------LGSCSQMNGSLETEDSLLKTEESSGGSYPEAAVCNALSTH-------ESLHTKTLPDNESRKRVRR--SSQMSLKSFFRKNSDISND
        N            L +C  M  SLE   S  + + +SG +  E         +        S+    +  +  RK+ R+  SSQ+SLKSFF  NS ++N 
Subjt:  NII----------LGSCSQMNGSLETEDSLLKTEESSGGSYPEAAVCNALSTH-------ESLHTKTLPDNESRKRVRR--SSQMSLKSFFRKNSDISND

Query:  ADSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGR
         DSS++  S + S   +S   P    + D++ + S      +Q+Q Q  +S+  K+K+  AL+EW+RIQ +MQNSIPLCKGHKE CVARVVKK GP  GR
Subjt:  ADSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGR

Query:  GFYVCARGE
         FYVC+R E
Subjt:  GFYVCARGE

P38207 DNA-(apurinic or apyrimidinic site) endonuclease 23.9e-2026.59Show/hide
Query:  MKIVTYNVNGLR------PRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALP-
        ++ +T+NVNG+R      P      SL  + D F ADII FQE K  +  + +     +G+ SF+S  +T    R GYSGV  + R+         AL  
Subjt:  MKIVTYNVNGLR------PRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALP-

Query:  VGAEEGFTGLLESSQNGKGTMPA----VVEGL-------EEFSKEELLKVDSEGRCIVTDHG-HFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECL
        V AEEG TG L + +NGK +  +    V +G+        +  ++  L++DSEGRC++ +     V+ ++Y P   +   E  +F+L F  +L +R   L
Subjt:  VGAEEGFTGLLESSQNGKGTMPA----VVEGL-------EEFSKEELLKVDSEGRCIVTDHG-HFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECL

Query:  LQRGKRIFVVGDLNIAPTSMDRCDAGPDFE---------------------------NNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFM
         + GK+I ++GD+N+    +D  D    F                            +   RR    +L    ++    S++ ILI         LI   
Subjt:  LQRGKRIFVVGDLNIAPTSMDRCDAGPDFE---------------------------NNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFM

Query:  LR-DAYTCWPQSTGAEVFNYGTRIDHILCA---GSCLHHDSNLPGHNIVACHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEI----
         R   YT W         NYG+RID IL +     C+     LP            DIL                       GSDH PVY+ L  +    
Subjt:  LR-DAYTCWPQSTGAEVFNYGTRIDHILCA---GSCLHHDSNLPGHNIVACHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEI----

Query:  -PETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDS
         P T Q   P   ARY   +      ++ M  K+   ++S
Subjt:  -PETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDS

Q5E9N9 DNA-(apurinic or apyrimidinic site) endonuclease 21.3e-4728.18Show/hide
Query:  MKIVTYNVNGLR----------PRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEV
        +++V++N+NG+R          P      ++ ++LD  DADI+C QETK+ R  L   L I  GY S+ S +R     R+GYSGVATFC+        + 
Subjt:  MKIVTYNVNGLR----------PRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEV

Query:  ALPVGAEEGFTGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDH---------GHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECL
        A PV AEEG +GLL S+QNG          +++F++EEL  +DSEGR ++T H             L N+Y P A+    ERL FK+ FY +LQ R E L
Subjt:  ALPVGAEEGFTGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDH---------GHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECL

Query:  LQRGKRIFVVGDLNIAPTSMDRCDA--GPDFENNEFRRWLRSLL--VGCVAISLTFSEQNILIGWVLWMEKFLIHFML-----RDAYTCWPQSTGAEVFN
        L  G  + ++GDLN A   +D  DA     FE +  R+W+  LL  +GC + S               M  F+  +       + A+TCW   +GA   N
Subjt:  LQRGKRIFVVGDLNIAPTSMDRCDA--GPDFENNEFRRWLRSLL--VGCVAISLTFSEQNILIGWVLWMEKFLIHFML-----RDAYTCWPQSTGAEVFN

Query:  YGTRIDHILCAGSCLHHDSNLPGHNIVACHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTL
        YG+R+D++L             G   +     +   L                    ++ GSDH PV  ++L +   P    P L   +     G Q  +
Subjt:  YGTRIDHILCAGSCLHHDSNLPGHNIVACHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTL

Query:  VSMLLKRQAAEDSASCRISNSFSRAENIILGSCSQMNGSLETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRK
        +  L+                  + + +   S  Q +   +      K    S                    T++ P      R     Q +L S+F+ 
Subjt:  VSMLLKRQAAEDSASCRISNSFSRAENIILGSCSQMNGSLETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRK

Query:  NSDISNDADSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKK
        +S            S    S+    +L  +  P++      SE+ + A+  + Q  AS  + EK  +    W+ +     + +PLC GH+E CV R VKK
Subjt:  NSDISNDADSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKK

Query:  QGPNNGRGFYVCAR
         GPN GR FY+CAR
Subjt:  QGPNNGRGFYVCAR

Q68G58 DNA-(apurinic or apyrimidinic site) endonuclease 21.4e-4928.08Show/hide
Query:  MKIVTYNVNGLRPRIVQFG---------SLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVA
        +++V++N+NG+R  +             +L ++LD  DADI+C QETK+ R  L   L I  GY S+ S +R+    R+GYSGVATFC+        + A
Subjt:  MKIVTYNVNGLRPRIVQFG---------SLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVA

Query:  LPVGAEEGFTGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDH---------GHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLL
         PV AEEG +G+  +     G        ++EF++EEL  +DSEGR ++T H             L N+Y P A+    ERL FK+ FY +LQ R E LL
Subjt:  LPVGAEEGFTGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDH---------GHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLL

Query:  QRGKRIFVVGDLNIAPTSMDRCDAG--PDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKF-LIHFMLRDAYTCWPQSTGAEVFNYGTRIDH
          G  + ++GDLN A   +D CDA     FE +  R+W+  LL      S    E    IG  L+M+ +  +H   + A+TCW   +GA   NYG+R+D+
Subjt:  QRGKRIFVVGDLNIAPTSMDRCDAG--PDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKF-LIHFMLRDAYTCWPQSTGAEVFNYGTRIDH

Query:  ILCAGSCLHHDSNLPGHNIVACHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSML--L
        +L             G   +     +   L                    ++ GSDH PV  ++L +   P    P+L  R+     G Q  ++  L  L
Subjt:  ILCAGSCLHHDSNLPGHNIVACHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSML--L

Query:  KRQAAEDSASCRISNSFSRAENIILGSCSQMNGSLETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDIS
        +++   +    + S+   +A+     +C              +  +S GG                                +  Q +L S+F+ +S + 
Subjt:  KRQAAEDSASCRISNSFSRAENIILGSCSQMNGSLETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDIS

Query:  NDADSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNN
                      S TS   L  + +    T    +E+   A+  + + N     K++ G     W+ +     + +PLC GH+E CV R VKK GPN 
Subjt:  NDADSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNN

Query:  GRGFYVCAR
        GR FY+CAR
Subjt:  GRGFYVCAR

Q9UBZ4 DNA-(apurinic or apyrimidinic site) endonuclease 24.8e-4728.24Show/hide
Query:  MKIVTYNVNGLR----------PRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEV
        +++V++N+NG+R          P      ++ ++LD  DADI+C QETK+ R  L   L I  GY S+ S +R     R+GYSGVATFC+        + 
Subjt:  MKIVTYNVNGLR----------PRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEV

Query:  ALPVGAEEGFTGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDH---------GHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECL
        A PV AEEG +GL  ++QNG          ++EF++EEL  +DSEGR ++T H             L N+Y P A+    ERL+FK+ FY +LQ R E L
Subjt:  ALPVGAEEGFTGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDH---------GHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECL

Query:  LQRGKRIFVVGDLNIAPTSMDRCDAG--PDFENNEFRRWLRSLL--VGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRI
        L  G  + ++GDLN A   +D  DA     FE +  R+W+ SLL  +GC + S        +  +  +  K         A+TCW   TGA   NYG+R+
Subjt:  LQRGKRIFVVGDLNIAPTSMDRCDAG--PDFENNEFRRWLRSLL--VGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRI

Query:  DHILCAGSCLHHDSNLPGHNIVACHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLL
        D++L             G   +     +   L                    ++ GSDH PV  ++L +   P    P L  R+     G Q  ++  L+
Subjt:  DHILCAGSCLHHDSNLPGHNIVACHVMECDILTQYKRWKDGNSFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLL

Query:  KRQAAEDSASCRISNSFSRAENIILGSCSQMNGSLETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDIS
          + +                 ++  S  Q N     +    K +  S                    T+  P      R     Q +LKS+F+     S
Subjt:  KRQAAEDSASCRISNSFSRAENIILGSCSQMNGSLETEDSLLKTEESSGGSYPEAAVCNALSTHESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDIS

Query:  NDADSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNN
             ++ D  +       + + P             E+   A   + Q   S  + EK  +    W+ +      + PLC GH+E CV R VKK GPN 
Subjt:  NDADSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNN

Query:  GRGFYVCAR
        GR FY+CAR
Subjt:  GRGFYVCAR

Arabidopsis top hitse value%identityAlignment
AT2G41460.1 apurinic endonuclease-redox protein7.5e-1128.29Show/hide
Query:  MKIVTYNVNGLRPRI-VQFGSLLKLLDSFDADIICFQETKLRRQEL-RADLVIANGYE-SFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAE
        +K++T+NVNGLR  +  +  S L+L    + DI+C QETKL+ +++      + +GY+ SF SC+      + GYSG A   R+K          P+   
Subjt:  MKIVTYNVNGLRPRI-VQFGSLLKLLDSFDADIICFQETKLRRQEL-RADLVIANGYE-SFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAE

Query:  EGFTGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIA
         G TGL     +G                      D+EGR +  +   F L N Y P +  D  +RL +++  ++         L++ K + + GDLN A
Subjt:  EGFTGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIA

Query:  PTSMD
           +D
Subjt:  PTSMD

AT2G41460.2 apurinic endonuclease-redox protein2.9e-0730Show/hide
Query:  MKIVTYNVNGLRPRI-VQFGSLLKLLDSFDADIICFQETKLRRQEL-RADLVIANGYE-SFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAE
        +K++T+NVNGLR  +  +  S L+L    + DI+C QETKL+ +++      + +GY+ SF SC+      + GYSG A   R+K          P+   
Subjt:  MKIVTYNVNGLRPRI-VQFGSLLKLLDSFDADIICFQETKLRRQEL-RADLVIANGYE-SFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAE

Query:  EGFTGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFK
         G TGL     +G                      D+EGR +  +   F L N Y P +  D  +RL+ K
Subjt:  EGFTGLLESSQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFK

AT4G36050.1 endonuclease/exonuclease/phosphatase family protein4.0e-8145.1Show/hide
Query:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA
        MDRC+AGPDFE NEFR+W RSLL   V    +FS+        ++  K   H   +DA+TCW  S+GAE FNYG+RIDHIL AGSCLH D +  GH+ +A
Subjt:  MDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVA

Query:  CHVMECDILTQYKRWKDGN-SFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAEN
        CHV ECDILT+YKR+K+ N   RWKG    K +GSDH PV+ S  ++P+ P+HSTP L++RY  MI+G QQTLVS+  KR+A E++ +  +S S S   N
Subjt:  CHVMECDILTQYKRWKDGN-SFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAEN

Query:  II----------LGSCSQMNGSLETEDSLLKTEESSGGSYPEAAVCNALSTH-------ESLHTKTLPDNESRKRVRR--SSQMSLKSFFRKNSDISNDA
                    L +C  M  SLE   S  + + +SG +  E         +        S+    +  +  RK+ R+  SSQ+SLKSFF  NS ++N  
Subjt:  II----------LGSCSQMNGSLETEDSLLKTEESSGGSYPEAAVCNALSTH-------ESLHTKTLPDNESRKRVRR--SSQMSLKSFFRKNSDISNDA

Query:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG
        DSS++  S + S   +S   P    + D++ + S      +Q+Q Q  +S+  K+K+  AL+EW+RIQ +MQNSIPLCKGHKE CVARVVKK GP  GR 
Subjt:  DSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGRG

Query:  FYVCARGE
        FYVC+R E
Subjt:  FYVCARGE

AT4G36050.2 endonuclease/exonuclease/phosphatase family protein1.5e-17354.84Show/hide
Query:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF
        MKIVTYNVNGLR R+ QF SLLKLLDSFDADIICFQETKLRRQEL ADL IA+GYESF SCTRTSEKGRTGYSGVATFCRVKSA SS E ALPV AEEG 
Subjt:  MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGF

Query:  TGLLES-SQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPT
        TGL+ S S+ GK     V EGLEE+ KEELL +D EGRC++TDHGHFV+FN+YGPRA +DD++R+ FK  FY +L++RWECLL++G+R+FVVGDLNIAP 
Subjt:  TGLLES-SQNGKGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPT

Query:  SMDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIV
        +MDRC+AGPDFE NEFR+W RSLL   V    +FS+        ++  K   H   +DA+TCW  S+GAE FNYG+RIDHIL AGSCLH D +  GH+ +
Subjt:  SMDRCDAGPDFENNEFRRWLRSLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIV

Query:  ACHVMECDILTQYKRWKDGN-SFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAE
        ACHV ECDILT+YKR+K+ N   RWKG    K +GSDH PV+ S  ++P+ P+HSTP L++RY  MI+G QQTLVS+  KR+A E++ +  +S S S   
Subjt:  ACHVMECDILTQYKRWKDGN-SFRWKGERTFKLEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAE

Query:  NII----------LGSCSQMNGSLETEDSLLKTEESSGGSYPEAAVCNALSTH-------ESLHTKTLPDNESRKRVRR--SSQMSLKSFFRKNSDISND
        N            L +C  M  SLE   S  + + +SG +  E         +        S+    +  +  RK+ R+  SSQ+SLKSFF  NS ++N 
Subjt:  NII----------LGSCSQMNGSLETEDSLLKTEESSGGSYPEAAVCNALSTH-------ESLHTKTLPDNESRKRVRR--SSQMSLKSFFRKNSDISND

Query:  ADSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGR
         DSS++  S + S   +S   P    + D++ + S      +Q+Q Q  +S+  K+K+  AL+EW+RIQ +MQNSIPLCKGHKE CVARVVKK GP  GR
Subjt:  ADSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQVMQNSIPLCKGHKERCVARVVKKQGPNNGR

Query:  GFYVCARGE
         FYVC+R E
Subjt:  GFYVCARGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATCGTCACTTACAACGTGAATGGTCTTCGGCCACGCATCGTACAGTTTGGTTCACTCCTTAAACTGCTCGATTCCTTCGACGCCGATATCATTTGCTTTCAGGA
AACGAAATTAAGGAGGCAGGAATTGCGGGCGGATTTAGTCATTGCCAATGGTTACGAATCATTCGTTTCTTGCACTCGTACCTCTGAGAAAGGTCGAACCGGCTACTCAG
GGGTTGCGACGTTTTGCCGTGTTAAGTCAGCATTTTCGAGTAATGAAGTGGCATTGCCAGTTGGTGCGGAGGAAGGCTTCACGGGTCTACTAGAAAGTTCACAAAATGGA
AAAGGCACAATGCCTGCAGTTGTAGAAGGGCTTGAGGAATTTTCTAAAGAGGAGCTTCTTAAAGTAGACAGTGAGGGACGTTGTATAGTCACAGATCATGGTCATTTTGT
TCTCTTCAACATTTATGGACCTCGAGCTGAAAGTGATGATTCCGAAAGGCTTCTATTTAAGTTAAACTTTTACAACATGCTACAGAAAAGATGGGAGTGTCTTCTGCAAC
GGGGAAAAAGGATTTTTGTTGTTGGTGATCTTAATATTGCACCCACTTCTATGGATCGCTGTGATGCAGGACCAGATTTTGAGAATAATGAGTTTCGAAGATGGCTGAGA
TCTTTACTGGTGGGATGTGTGGCCATTTCATTGACATTTTCAGAGCAAAACATCCTGATCGGTTGGGTTTTATGGATGGAGAAGTTTCTGATTCATTTTATGCTAAGAGA
TGCATACACATGCTGGCCTCAAAGCACCGGTGCTGAGGTATTCAATTATGGGACAAGGATCGACCATATATTGTGTGCTGGATCATGCTTACATCATGACAGCAACCTGC
CAGGCCATAATATAGTGGCTTGTCATGTTATGGAATGTGACATACTGACACAGTACAAACGCTGGAAAGATGGAAATTCGTTTAGGTGGAAGGGAGAACGGACCTTTAAA
CTGGAAGGTTCTGATCATGCACCTGTTTATGCAAGTTTACTGGAAATTCCTGAGACCCCTCAACATAGCACTCCATCTTTATCTGCAAGATACAATCGCATGATTCACGG
GCTTCAGCAAACTCTTGTGTCTATGCTACTGAAAAGACAAGCTGCTGAAGATTCAGCATCGTGCAGGATATCAAATTCATTTTCACGTGCTGAGAATATCATCTTAGGGA
GTTGTTCCCAGATGAATGGATCACTTGAAACTGAAGATTCTCTGTTAAAAACAGAAGAGAGTTCTGGTGGAAGTTATCCTGAGGCGGCTGTATGCAACGCCTTAAGTACA
CATGAGTCTCTACATACAAAAACATTACCTGACAATGAATCTAGGAAAAGAGTGAGAAGAAGTTCCCAGATGTCATTAAAGTCATTCTTCCGGAAAAACTCAGATATTAG
CAACGATGCTGACAGCTCTAATGCTGATTCTTCAGTTAACAAATCAGATACCTCCCAGTCTAATCTACATCCTATTGAAGTTCCTAGATCAGATACTCAAAGTAGTAATT
CAGAGCAATATTTAGACGCATCTCAGGATCAGTCTCAATTAAATGCCTCTTCTGTAGAGAAAGAGAAGAGTGGTGTTGCCTTGTTGGAGTGGCGGAGGATACAGCAGGTT
ATGCAGAATAGCATACCTCTTTGCAAGGGCCACAAGGAACGTTGTGTTGCTCGAGTAGTTAAGAAACAAGGTCCTAATAATGGCCGCGGATTTTATGTTTGTGCCCGTGG
TGAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGATCGTCACTTACAACGTGAATGGTCTTCGGCCACGCATCGTACAGTTTGGTTCACTCCTTAAACTGCTCGATTCCTTCGACGCCGATATCATTTGCTTTCAGGA
AACGAAATTAAGGAGGCAGGAATTGCGGGCGGATTTAGTCATTGCCAATGGTTACGAATCATTCGTTTCTTGCACTCGTACCTCTGAGAAAGGTCGAACCGGCTACTCAG
GGGTTGCGACGTTTTGCCGTGTTAAGTCAGCATTTTCGAGTAATGAAGTGGCATTGCCAGTTGGTGCGGAGGAAGGCTTCACGGGTCTACTAGAAAGTTCACAAAATGGA
AAAGGCACAATGCCTGCAGTTGTAGAAGGGCTTGAGGAATTTTCTAAAGAGGAGCTTCTTAAAGTAGACAGTGAGGGACGTTGTATAGTCACAGATCATGGTCATTTTGT
TCTCTTCAACATTTATGGACCTCGAGCTGAAAGTGATGATTCCGAAAGGCTTCTATTTAAGTTAAACTTTTACAACATGCTACAGAAAAGATGGGAGTGTCTTCTGCAAC
GGGGAAAAAGGATTTTTGTTGTTGGTGATCTTAATATTGCACCCACTTCTATGGATCGCTGTGATGCAGGACCAGATTTTGAGAATAATGAGTTTCGAAGATGGCTGAGA
TCTTTACTGGTGGGATGTGTGGCCATTTCATTGACATTTTCAGAGCAAAACATCCTGATCGGTTGGGTTTTATGGATGGAGAAGTTTCTGATTCATTTTATGCTAAGAGA
TGCATACACATGCTGGCCTCAAAGCACCGGTGCTGAGGTATTCAATTATGGGACAAGGATCGACCATATATTGTGTGCTGGATCATGCTTACATCATGACAGCAACCTGC
CAGGCCATAATATAGTGGCTTGTCATGTTATGGAATGTGACATACTGACACAGTACAAACGCTGGAAAGATGGAAATTCGTTTAGGTGGAAGGGAGAACGGACCTTTAAA
CTGGAAGGTTCTGATCATGCACCTGTTTATGCAAGTTTACTGGAAATTCCTGAGACCCCTCAACATAGCACTCCATCTTTATCTGCAAGATACAATCGCATGATTCACGG
GCTTCAGCAAACTCTTGTGTCTATGCTACTGAAAAGACAAGCTGCTGAAGATTCAGCATCGTGCAGGATATCAAATTCATTTTCACGTGCTGAGAATATCATCTTAGGGA
GTTGTTCCCAGATGAATGGATCACTTGAAACTGAAGATTCTCTGTTAAAAACAGAAGAGAGTTCTGGTGGAAGTTATCCTGAGGCGGCTGTATGCAACGCCTTAAGTACA
CATGAGTCTCTACATACAAAAACATTACCTGACAATGAATCTAGGAAAAGAGTGAGAAGAAGTTCCCAGATGTCATTAAAGTCATTCTTCCGGAAAAACTCAGATATTAG
CAACGATGCTGACAGCTCTAATGCTGATTCTTCAGTTAACAAATCAGATACCTCCCAGTCTAATCTACATCCTATTGAAGTTCCTAGATCAGATACTCAAAGTAGTAATT
CAGAGCAATATTTAGACGCATCTCAGGATCAGTCTCAATTAAATGCCTCTTCTGTAGAGAAAGAGAAGAGTGGTGTTGCCTTGTTGGAGTGGCGGAGGATACAGCAGGTT
ATGCAGAATAGCATACCTCTTTGCAAGGGCCACAAGGAACGTTGTGTTGCTCGAGTAGTTAAGAAACAAGGTCCTAATAATGGCCGCGGATTTTATGTTTGTGCCCGTGG
TGAG
Protein sequenceShow/hide protein sequence
MKIVTYNVNGLRPRIVQFGSLLKLLDSFDADIICFQETKLRRQELRADLVIANGYESFVSCTRTSEKGRTGYSGVATFCRVKSAFSSNEVALPVGAEEGFTGLLESSQNG
KGTMPAVVEGLEEFSKEELLKVDSEGRCIVTDHGHFVLFNIYGPRAESDDSERLLFKLNFYNMLQKRWECLLQRGKRIFVVGDLNIAPTSMDRCDAGPDFENNEFRRWLR
SLLVGCVAISLTFSEQNILIGWVLWMEKFLIHFMLRDAYTCWPQSTGAEVFNYGTRIDHILCAGSCLHHDSNLPGHNIVACHVMECDILTQYKRWKDGNSFRWKGERTFK
LEGSDHAPVYASLLEIPETPQHSTPSLSARYNRMIHGLQQTLVSMLLKRQAAEDSASCRISNSFSRAENIILGSCSQMNGSLETEDSLLKTEESSGGSYPEAAVCNALST
HESLHTKTLPDNESRKRVRRSSQMSLKSFFRKNSDISNDADSSNADSSVNKSDTSQSNLHPIEVPRSDTQSSNSEQYLDASQDQSQLNASSVEKEKSGVALLEWRRIQQV
MQNSIPLCKGHKERCVARVVKKQGPNNGRGFYVCARGE