; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G006000 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G006000
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionApurinic-apyrimidinic endonuclease 2
Genome locationchr01:4795937..4801495
RNA-Seq ExpressionLsi01G006000
SyntenyLsi01G006000
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0098506 - polynucleotide 3' dephosphorylation (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0080111 - DNA demethylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016829 - lyase activity (molecular function)
GO:0046403 - polynucleotide 3'-phosphatase activity (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0008311 - double-stranded DNA 3'-5' exodeoxyribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0008081 - phosphoric diester hydrolase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0003906 - DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR020847 - AP endonuclease 1, binding site
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR004808 - AP endonuclease 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026737.1 DNA-(apurinic or apyrimidinic site) lyase 2 [Cucurbita argyrosperma subsp. argyrosperma]8.3e-29181.73Show/hide
Query:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF
        MKIVTYNVNGLRPRIAQ+GSLLKLLDSFDADIIC QETKLRRQELRADL+IADGYESFVSCTRTSEKGRTGYSGVATFCRV SAFSSNEVALPVRAEEGF
Subjt:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF

Query:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS
        TGLL+SS  G+GTM AVAEGLEEFSKEELLK+D EGRCIVTDHGHFVLFNIYGPRA+SDD+ERVLFK  FYN+LQKRWEHLL  GKRIFVVGDLNIAPTS
Subjt:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR
        MDRCDAGPDFENNEFRRWLRSL V CGG F DIFRAK+PDR+DAYTCWPQSTGAEVFNYGTRIDH+LCAGPCLH DSNLP HDIV CHV+ECDILSQYKR
Subjt:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR

Query:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN
        WKDGNSFRWKGE+TVKLEGSDHAPVYASLLE+P+ PQHSTP+LSARY+PKIHGLQQTLVSMLLKRQAAE SA C+ISNSFSRGN++LGNCSQG NGSFDN
Subjt:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN

Query:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS
         DL G LP ESCS TN+ETEDSLL+  E SGGGY+EEA CN LITHESLH K LPE ETRKRVRR SQMSLKSFFQKN V+SN ADSSNA+SSINKADTS
Subjt:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS

Query:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQQ---------------------------------------GPASNPEAN
        ESNPIEIPRS+T I+DSG+YLE    QS INASSVE+EKSGVALLEWRRIQQ                                       GPASNPEAN
Subjt:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQQ---------------------------------------GPASNPEAN

Query:  CGYFKWAASKSRH
        CGYFKWA SKSRH
Subjt:  CGYFKWAASKSRH

XP_011658045.1 DNA-(apurinic or apyrimidinic site) lyase 2 [Cucumis sativus]8.2e-29984.69Show/hide
Query:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF
        MKIVTYNVNGLRPRI QFGSL KLLDSFDADIICIQETKLRRQELRADLVIADGYE+FVSCTRTSEKGRTGYSGVATFCRV SAFSSNEVALPVRAEEGF
Subjt:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF

Query:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS
        TGLL+SSQDGK TMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRA+SDDS+RVLFKL FYNVLQKRWEHLLH+GKR+FVVGDLNIAPTS
Subjt:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR
        MDRCDAGPDFENNEFRRWLRSL VACGGRFIDIFRAK+PDRRDAYTCWPQSTGAEVFNYGTRIDH+LCAGPCLHHD+NLPGH+IV CHVMECDILSQYKR
Subjt:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR

Query:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN
        WKDGNS+RWK ERTVKLEGSDHAPV ASLLE+P+ PQHSTP+LSARYNPKIHGLQQTLVSMLLKRQAAEDSAPC+ SNS S GN  LGNCSQG NGSFDN
Subjt:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN

Query:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS
         D  G LPSESCSLTNLETEDSLLE GECSGG YA+EAAC  L THE LH KALPE  TRKRVRRCSQMSLK+FFQKNSVVSNDADSSNADSSI+K DTS
Subjt:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS

Query:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQQ---------------------------------------GPASNPEAN
        ESN IEIPRSNTQISDSG+ LEAYQGQSQINA+  EKEKSGVA+LEWRRIQQ                                       GPASNPEAN
Subjt:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQQ---------------------------------------GPASNPEAN

Query:  CGYFKWAASKSRHK
        CGYFKWAASKSRHK
Subjt:  CGYFKWAASKSRHK

XP_016899320.1 PREDICTED: DNA-(apurinic or apyrimidinic site) lyase 2 isoform X1 [Cucumis melo]1.9e-30385.18Show/hide
Query:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF
        MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYE+FVSCTRTSEKGRTGYSGVATFCRV SAFSSNEVALPVRAEEGF
Subjt:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF

Query:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS
        TGLL+SSQDGK TM AVAEGLEEFSKEELL+LDSEGRCIVTDHGHFVLFNIYGPRAESDDS+RVLFKLKFYNVLQKRWEHLLH+GKR+FVVGDLNIAPTS
Subjt:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR
        MDRCDAGPDFENNEFRRWLRSL VACGGRFID+FRAK+PDRRDAYTCWPQSTGAEVFNYGTRIDH+LCAGPCLHHD++LPGH+IV CHVMECDILS+YKR
Subjt:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR

Query:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN
        WKDGNSFRWKGE++VKLEGSDHAPV ASLLE+P+ PQHSTP+LSARYNPKIHGLQQTLVSMLLKRQAAEDSAPC+ SNS SRGNIILGNCSQG NGSF+N
Subjt:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN

Query:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS
         D PG LPSESCSLTNLETEDSLL+ GEC+GG YAEEAACN LI+HESLH KALPE ETRKRV+RCSQMSLKSFFQKNSVVSNDA+SSNADS I+KA+TS
Subjt:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS

Query:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQQ---------------------------------------GPASNPEAN
        ESNPIEIPRSNTQ S+SG+ LEAYQ QSQINAS VEKEKSGVALLEWRRIQQ                                       GPASNPEAN
Subjt:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQQ---------------------------------------GPASNPEAN

Query:  CGYFKWAASKSRHK
        CGYFKWAASKSRHK
Subjt:  CGYFKWAASKSRHK

XP_023517127.1 DNA-(apurinic or apyrimidinic site) lyase 2 [Cucurbita pepo subsp. pepo]1.7e-28881.08Show/hide
Query:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF
        MKIVTYNVNGLRPRIAQ+GSLLKLLDSFDADIIC QETKLRRQELRADL+IADGYESFVSCTRTSEKGRTGYSGVATFCRV SAFSSNEVALPVRAEEGF
Subjt:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF

Query:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS
        TGLL+SS  G+GTM A+AEGLEEFSKEELLK+D EGRCIVTDHGHFVLFNIYGPRA+SDD+ERVLFK  FYN+LQKRWEHLL  GKRIFVVGDLNIAPTS
Subjt:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR
        MDRCDAGPDFENNEFRRWLRSL V  GG F DIFR+K+PDR+DAYTCWPQSTGAEVFNYGTRIDH+LCAGPCLH DSNLP HDIV CHV+ECDILSQYKR
Subjt:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR

Query:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN
        WKDGNSFRWKGE+TVKLEGSDHAPVYASLLE+P+ PQHSTP+LSARYNPKIHGLQQTLVSMLLKRQAAE SA C+ISNSFSRGN++LGNCSQG NGSFDN
Subjt:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN

Query:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS
         DL G LP ESCS T++ETEDSLL+  E SGGGY+EEA CN LITHESLH K LPE ETRKRVRR SQMSLKSFFQKNSV+SN ADSSNA+SS NK DTS
Subjt:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS

Query:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQQ---------------------------------------GPASNPEAN
        ESNPIEIPRS+T I+DSG+YLE    QS INASSVE+EKSGVALLEWRRIQQ                                       GPASNPEAN
Subjt:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQQ---------------------------------------GPASNPEAN

Query:  CGYFKWAASKSRH
        CGYFKWA SKSRH
Subjt:  CGYFKWAASKSRH

XP_038881293.1 DNA-(apurinic or apyrimidinic site) endonuclease 2 [Benincasa hispida]6.6e-30986.81Show/hide
Query:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF
        MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADL+IADGYESFVSCTRTSEKGRTGYSGVATFCRV SAFSSNEVALPVRAEEGF
Subjt:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF

Query:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS
        TGLL+SSQDGKGTMA VAEG+EEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKF+NVLQKRWEHLLHLGKRIFVVGDLNIAPTS
Subjt:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR
        MDRCDAGPDFENNEFRRWLRSL VA GG FIDIFRAK+PDRRDAYTCWPQSTGAEVFNYGTRIDH+LCAGPCLHHDSNLPGH+IV CH+MECDILSQYKR
Subjt:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR

Query:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN
        WKDGNSFRWKGERTVKLEGSDHAPVYASLLE+P+ PQHSTP+LSARYNPKIHGLQQTLVSMLLKRQAAEDSAPC+ISNSFSRGNIILGNCSQG NGSFDN
Subjt:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN

Query:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS
         D  GFLP+ESCS TNLE EDSLL+  +CSGG YAEEAACN LITHE LH KALPE ETRKRVRRCSQMSLKSFFQKNSVVSN+ DSSNADSSINKADTS
Subjt:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS

Query:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQQ---------------------------------------GPASNPEAN
        ESNPIEIPRSNTQISDSGQ+LE  +GQSQINASSVEKEKS VALLEWRRIQQ                                       GPASNPEAN
Subjt:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQQ---------------------------------------GPASNPEAN

Query:  CGYFKWAASKSRHK
        CGYFKWAASKSRHK
Subjt:  CGYFKWAASKSRHK

TrEMBL top hitse value%identityAlignment
A0A0A0KMF7 Apurinic-apyrimidinic endonuclease 24.0e-29984.69Show/hide
Query:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF
        MKIVTYNVNGLRPRI QFGSL KLLDSFDADIICIQETKLRRQELRADLVIADGYE+FVSCTRTSEKGRTGYSGVATFCRV SAFSSNEVALPVRAEEGF
Subjt:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF

Query:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS
        TGLL+SSQDGK TMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRA+SDDS+RVLFKL FYNVLQKRWEHLLH+GKR+FVVGDLNIAPTS
Subjt:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR
        MDRCDAGPDFENNEFRRWLRSL VACGGRFIDIFRAK+PDRRDAYTCWPQSTGAEVFNYGTRIDH+LCAGPCLHHD+NLPGH+IV CHVMECDILSQYKR
Subjt:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR

Query:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN
        WKDGNS+RWK ERTVKLEGSDHAPV ASLLE+P+ PQHSTP+LSARYNPKIHGLQQTLVSMLLKRQAAEDSAPC+ SNS S GN  LGNCSQG NGSFDN
Subjt:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN

Query:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS
         D  G LPSESCSLTNLETEDSLLE GECSGG YA+EAAC  L THE LH KALPE  TRKRVRRCSQMSLK+FFQKNSVVSNDADSSNADSSI+K DTS
Subjt:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS

Query:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQQ---------------------------------------GPASNPEAN
        ESN IEIPRSNTQISDSG+ LEAYQGQSQINA+  EKEKSGVA+LEWRRIQQ                                       GPASNPEAN
Subjt:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQQ---------------------------------------GPASNPEAN

Query:  CGYFKWAASKSRHK
        CGYFKWAASKSRHK
Subjt:  CGYFKWAASKSRHK

A0A1S4DUC2 DNA-(apurinic or apyrimidinic site) endonuclease9.2e-30485.18Show/hide
Query:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF
        MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYE+FVSCTRTSEKGRTGYSGVATFCRV SAFSSNEVALPVRAEEGF
Subjt:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF

Query:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS
        TGLL+SSQDGK TM AVAEGLEEFSKEELL+LDSEGRCIVTDHGHFVLFNIYGPRAESDDS+RVLFKLKFYNVLQKRWEHLLH+GKR+FVVGDLNIAPTS
Subjt:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR
        MDRCDAGPDFENNEFRRWLRSL VACGGRFID+FRAK+PDRRDAYTCWPQSTGAEVFNYGTRIDH+LCAGPCLHHD++LPGH+IV CHVMECDILS+YKR
Subjt:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR

Query:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN
        WKDGNSFRWKGE++VKLEGSDHAPV ASLLE+P+ PQHSTP+LSARYNPKIHGLQQTLVSMLLKRQAAEDSAPC+ SNS SRGNIILGNCSQG NGSF+N
Subjt:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN

Query:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS
         D PG LPSESCSLTNLETEDSLL+ GEC+GG YAEEAACN LI+HESLH KALPE ETRKRV+RCSQMSLKSFFQKNSVVSNDA+SSNADS I+KA+TS
Subjt:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS

Query:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQQ---------------------------------------GPASNPEAN
        ESNPIEIPRSNTQ S+SG+ LEAYQ QSQINAS VEKEKSGVALLEWRRIQQ                                       GPASNPEAN
Subjt:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQQ---------------------------------------GPASNPEAN

Query:  CGYFKWAASKSRHK
        CGYFKWAASKSRHK
Subjt:  CGYFKWAASKSRHK

A0A5A7SJ45 DNA-(apurinic or apyrimidinic site) endonuclease9.2e-30485.18Show/hide
Query:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF
        MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYE+FVSCTRTSEKGRTGYSGVATFCRV SAFSSNEVALPVRAEEGF
Subjt:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF

Query:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS
        TGLL+SSQDGK TM AVAEGLEEFSKEELL+LDSEGRCIVTDHGHFVLFNIYGPRAESDDS+RVLFKLKFYNVLQKRWEHLLH+GKR+FVVGDLNIAPTS
Subjt:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR
        MDRCDAGPDFENNEFRRWLRSL VACGGRFID+FRAK+PDRRDAYTCWPQSTGAEVFNYGTRIDH+LCAGPCLHHD++LPGH+IV CHVMECDILS+YKR
Subjt:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR

Query:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN
        WKDGNSFRWKGE++VKLEGSDHAPV ASLLE+P+ PQHSTP+LSARYNPKIHGLQQTLVSMLLKRQAAEDSAPC+ SNS SRGNIILGNCSQG NGSF+N
Subjt:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN

Query:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS
         D PG LPSESCSLTNLETEDSLL+ GEC+GG YAEEAACN LI+HESLH KALPE ETRKRV+RCSQMSLKSFFQKNSVVSNDA+SSNADS I+KA+TS
Subjt:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS

Query:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQQ---------------------------------------GPASNPEAN
        ESNPIEIPRSNTQ S+SG+ LEAYQ QSQINAS VEKEKSGVALLEWRRIQQ                                       GPASNPEAN
Subjt:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQQ---------------------------------------GPASNPEAN

Query:  CGYFKWAASKSRHK
        CGYFKWAASKSRHK
Subjt:  CGYFKWAASKSRHK

A0A6J1EKQ4 Apurinic-apyrimidinic endonuclease 24.1e-28881.08Show/hide
Query:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF
        MKIVTYNVNGLRPRIAQ+GSLLKLLDSFDADIIC QETKLRRQELRADL+IADGYESFVSCTRTSEKGRTGYSGVATFCRV SAFSSNEVALPVRAEEGF
Subjt:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF

Query:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS
        TGLL+SS  G+GTM AVAEGLEEFSKEELLK+D EGRCIVTDHGHFVLFNIYGPRA+SDD+ERVLFK  FYN+LQKRWEHLL  GKRIFVVGDLNIAPTS
Subjt:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR
        MD CDAGPDFENNEFRRWLRSL V CGG F DIFRAK+PDR DAYTCWPQSTGAEVFNYGTRIDH+LCAGPCLH DSNLP HDIV CHV+ECDILSQYKR
Subjt:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR

Query:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN
        WKDGNSFRWKGE+TVKLEGSDHAPVYASLLE+P+ PQHS P+LSARYNPKIHGLQQTLVSMLLK+QAAE SA C+ISNSFS GN +LGNCSQG NGSFDN
Subjt:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN

Query:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS
         DL G LP ESCS TN+ETEDSLL+  E SGGGY+EEA CN LITHESLH K LPE ETRKRVRR SQMSLKSFFQ NSV+SN ADSSNA+S+INKADTS
Subjt:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS

Query:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQQ---------------------------------------GPASNPEAN
        ESNPIEIPRS+T I+DSG+YLE    QS INASSVE+EKSGVALLEWRRIQQ                                       GPASNPEAN
Subjt:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQQ---------------------------------------GPASNPEAN

Query:  CGYFKWAASKSRH
        CGYFKWA SKSRH
Subjt:  CGYFKWAASKSRH

A0A6J1KNT8 DNA-(apurinic or apyrimidinic site) endonuclease1.5e-28580.59Show/hide
Query:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF
        MKIVTYNVNGLRPRIAQ+GSLLKLLDSFDADIIC QETKLRRQELRADL+IADGYESFVSCTRTSEKGRTGYSGVATFCRV SAFSSNEVALPVRAEEGF
Subjt:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF

Query:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS
        +GLL+SS  G+GTM AVAEGLEEFSKEELLK+D EGRCIVTDHGHFVLFNIYGPRA+SDD+ERVLFK  FYN+LQKRWEHLL  GKRIFVVGDLNIAPTS
Subjt:  TGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTS

Query:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR
        MDRCDAGPDFENNEFRRW+RSL V CGG F DIFRAK+PDR+DAYTCW QSTGAEVFNYGTRIDH+LCAGPCLH DSN PGHDIV CHV+ECDILSQYKR
Subjt:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR

Query:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN
        WKDGNSFR KGE+TVKLEGSDHAPVYASLLE+P+ PQHSTP+LSARYNPKIHGLQQTLVSMLLKRQAAE SA C+ISNSFSRGNI+LGNCSQG NGSFDN
Subjt:  WKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDN

Query:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS
         DL G LP ESCS TN++TEDSLL+  E SGG Y+EEA CN LITHESLH K L E ETRKRVRR SQMSLKSFFQKNSV+SN ADSSNA+SSINKADTS
Subjt:  VDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTS

Query:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQ---------------------------------------QGPASNPEAN
        ESNPIEIPRS+T I+DSG+Y E    QS INA SVE+EKSGVALLEWRRIQ                                       +GPASNPEAN
Subjt:  ESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQ---------------------------------------QGPASNPEAN

Query:  CGYFKWAASKSRH
        CGYFKWA SKSRH
Subjt:  CGYFKWAASKSRH

SwissProt top hitse value%identityAlignment
F4JNY0 DNA-(apurinic or apyrimidinic site) endonuclease 22.8e-16454.84Show/hide
Query:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF
        MKIVTYNVNGLR R++QF SLLKLLDSFDADIIC QETKLRRQEL ADL IADGYESF SCTRTSEKGRTGYSGVATFCRV SA SS E ALPV AEEG 
Subjt:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF

Query:  TGLLDS-SQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPT
        TGL++S S+ GK   + VAEGLEE+ KEELL +D EGRC++TDHGHFV+FN+YGPRA +DD++R+ FK +FY VL++RWE LL  G+R+FVVGDLNIAP 
Subjt:  TGLLDS-SQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPT

Query:  SMDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYK
        +MDRC+AGPDFE NEFR+W RSL V  GG F D+FR+K+P+R+DA+TCW  S+GAE FNYG+RIDH+L AG CLH D +  GH  + CHV ECDIL++YK
Subjt:  SMDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYK

Query:  RWKDGN-SFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSF
        R+K+ N   RWKG    K +GSDH PV+ S  +LP++P+HSTP L++RY P I+G QQTLVS+  KR+A E++    +S S S  +     C     G  
Subjt:  RWKDGN-SFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSF

Query:  DNVDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNL---ITHESLHMKALPEKETRKRVRR--CSQMSLKSFFQKNSVVSNDADSSNADSS
         N    G    +SCS  N  T       G       A   + +NL   I   S+    +     RK+ R+   SQ+SLKSFF  NS V+N  DSS++  S
Subjt:  DNVDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNL---ITHESLHMKALPEKETRKRVRR--CSQMSLKSFFQKNSVVSNDADSSNADSS

Query:  INKADTSESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQ
         + +   ES    I   N    +  +   + Q Q Q  +S+  K+K+  AL+EW+RIQ
Subjt:  INKADTSESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQ

P38207 DNA-(apurinic or apyrimidinic site) endonuclease 22.1e-2326.74Show/hide
Query:  MKIVTYNVNGLR------PRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALP-
        ++ +T+NVNG+R      P      SL  + D F ADII  QE K  +  + +     DG+ SF+S  +T    R GYSGV  + R+         AL  
Subjt:  MKIVTYNVNGLR------PRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALP-

Query:  VRAEEGFTGLLDSSQDGKGTMAA----VAEGL-------EEFSKEELLKLDSEGRCIVTDHG-HFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHL
        V+AEEG TG L + ++GK +  +    V +G+        +  ++  L+LDSEGRC++ +     V+ ++Y P   +   E  +F+L+F  VL +R  +L
Subjt:  VRAEEGFTGLLDSSQDGKGTMAA----VAEGL-------EEFSKEELLKLDSEGRCIVTDHG-HFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHL

Query:  LHLGKRIFVVGDLNIAPTSMDRCDAGPDFE---------------------------NNEFRRWLRSL-------QVACGGRFIDIFR-AKNPDRRDAYT
          +GK+I ++GD+N+    +D  D    F                            +   RR    +         +  G  ID  R  +  +R   YT
Subjt:  LHLGKRIFVVGDLNIAPTSMDRCDAGPDFE---------------------------NNEFRRWLRSL-------QVACGGRFIDIFR-AKNPDRRDAYT

Query:  CWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKRWKDGNSFRWKGERTVKLEGSDHAPVYASLLEL-----PEVPQHSTP
         W         NYG+RID +L +   L  +  +   DI+       DIL                       GSDH PVY+ L  L     P   Q   P
Subjt:  CWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKRWKDGNSFRWKGERTVKLEGSDHAPVYASLLEL-----PEVPQHSTP

Query:  ALSARYNPKIHGLQQTLVSMLLKRQAAEDS
           ARY  K +     ++ M  K+   ++S
Subjt:  ALSARYNPKIHGLQQTLVSMLLKRQAAEDS

Q5E9N9 DNA-(apurinic or apyrimidinic site) endonuclease 21.3e-4929.74Show/hide
Query:  MKIVTYNVNGLR----------PRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEV
        +++V++N+NG+R          P      ++ ++LD  DADI+C+QETK+ R  L   L I +GY S+ S +R     R+GYSGVATFC+        + 
Subjt:  MKIVTYNVNGLR----------PRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEV

Query:  ALPVRAEEGFTGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDH---------GHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHL
        A PV AEEG +GLL +     G        +++F++EEL  LDSEGR ++T H             L N+Y P A+    ER+ FK++FY +LQ R E L
Subjt:  ALPVRAEEGFTGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDH---------GHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHL

Query:  LHLGKRIFVVGDLNIAPTSMDRCDA--GPDFENNEFRRWLRSLQVACG-------GRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPC
        L  G  + ++GDLN A   +D  DA     FE +  R+W+  L    G       G FID +R   P ++ A+TCW   +GA   NYG+R+D+VL     
Subjt:  LHLGKRIFVVGDLNIAPTSMDRCDA--GPDFENNEFRRWLRSLQVACG-------GRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPC

Query:  LHHDSNLPGHDIVVCHVMECDILSQYKRWKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLL--------K
                G   +V    +   L                    ++ GSDH PV  ++L +  VP    P L   + P+  G Q  ++  L+        K
Subjt:  LHHDSNLPGHDIVVCHVMECDILSQYKRWKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLL--------K

Query:  RQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDNVD--LPGFLPSES---CSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKET
        + A + S   ++    ++  +          GS       +  F PS S    S  +L +  +L+ P        +EE    N++  ++   +A  EKE 
Subjt:  RQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDNVD--LPGFLPSES---CSLTNLETEDSLLEPGECSGGGYAEEAACNNLITHESLHMKALPEKET

Query:  R
        R
Subjt:  R

Q68G58 DNA-(apurinic or apyrimidinic site) endonuclease 23.2e-5133.08Show/hide
Query:  MKIVTYNVNGLRPRIAQFG---------SLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVA
        +++V++N+NG+R  +             +L ++LD  DADI+C+QETK+ R  L   L I +GY S+ S +R+    R+GYSGVATFC+        + A
Subjt:  MKIVTYNVNGLRPRIAQFG---------SLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVA

Query:  LPVRAEEGFTGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDH---------GHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLL
         PV AEEG +G+  +     G        ++EF++EEL  LDSEGR ++T H             L N+Y P A+    ER+ FK++FY +LQ R E LL
Subjt:  LPVRAEEGFTGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDH---------GHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLL

Query:  HLGKRIFVVGDLNIAPTSMDRCDAG--PDFENNEFRRWLRSLQVACG-------GRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCL
          G  + ++GDLN A   +D CDA     FE +  R+W+  L    G       G F+D +R  +P ++ A+TCW   +GA   NYG+R+D+VL      
Subjt:  HLGKRIFVVGDLNIAPTSMDRCDAG--PDFENNEFRRWLRSLQVACG-------GRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCL

Query:  HHDSNLPGHDIVVCHVMECDILSQYKRWKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLL
               G   +V    +   L                    ++ GSDH PV  ++L +  VP    PAL  R+ P+  G Q  ++  L+
Subjt:  HHDSNLPGHDIVVCHVMECDILSQYKRWKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLL

Q9UBZ4 DNA-(apurinic or apyrimidinic site) endonuclease 21.4e-5133.76Show/hide
Query:  MKIVTYNVNGLR----------PRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEV
        +++V++N+NG+R          P      ++ ++LD  DADI+C+QETK+ R  L   L I +GY S+ S +R     R+GYSGVATFC+ N        
Subjt:  MKIVTYNVNGLR----------PRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEV

Query:  ALPVRAEEGFTGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDH---------GHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHL
        A PV AEEG +GL  +     G        ++EF++EEL  LDSEGR ++T H             L N+Y P A+    ER++FK++FY +LQ R E L
Subjt:  ALPVRAEEGFTGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDH---------GHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHL

Query:  LHLGKRIFVVGDLNIAPTSMDRCDAG--PDFENNEFRRWLRSLQVACG-------GRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPC
        L  G  + ++GDLN A   +D  DA     FE +  R+W+ SL    G       G FID +R   P +  A+TCW   TGA   NYG+R+D+VL     
Subjt:  LHLGKRIFVVGDLNIAPTSMDRCDAG--PDFENNEFRRWLRSLQVACG-------GRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPC

Query:  LHHDSNLPGHDIVVCHVMECDILSQYKRWKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLL
                G   +V    +   L                    ++ GSDH PV  ++L +  VP    P L  R+ P+  G Q  ++  L+
Subjt:  LHHDSNLPGHDIVVCHVMECDILSQYKRWKDGNSFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLL

Arabidopsis top hitse value%identityAlignment
AT2G41460.1 apurinic endonuclease-redox protein3.7e-1526.84Show/hide
Query:  MKIVTYNVNGLRPRIA-QFGSLLKLLDSFDADIICIQETKLRRQEL-RADLVIADGYE-SFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAE
        +K++T+NVNGLR  +  +  S L+L    + DI+C+QETKL+ +++      + DGY+ SF SC+      + GYSG A   R+          L VR  
Subjt:  MKIVTYNVNGLRPRIA-QFGSLLKLLDSFDADIICIQETKLRRQEL-RADLVIADGYE-SFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAE

Query:  EGFTGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIA
         G +G                              D+EGR +  +   F L N Y P +  D  +R+ ++++ ++         L   K + + GDLN A
Subjt:  EGFTGLLDSSQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIA

Query:  PTSMDRCDAGPDFENNEFRRWLRSLQVA--CGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVL
           +D  +   +  +  F    R    A      F+D FR ++P     YT W    G    N G R+D+ L
Subjt:  PTSMDRCDAGPDFENNEFRRWLRSLQVA--CGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVL

AT2G41460.2 apurinic endonuclease-redox protein4.2e-0638.1Show/hide
Query:  MKIVTYNVNGLRPRIA-QFGSLLKLLDSFDADIICIQETKLRRQEL-RADLVIADGYE-SFVSCTRTSEKGRTGYSGVATFCRV
        +K++T+NVNGLR  +  +  S L+L    + DI+C+QETKL+ +++      + DGY+ SF SC+      + GYSG A   R+
Subjt:  MKIVTYNVNGLRPRIA-QFGSLLKLLDSFDADIICIQETKLRRQEL-RADLVIADGYE-SFVSCTRTSEKGRTGYSGVATFCRV

AT4G36050.1 endonuclease/exonuclease/phosphatase family protein5.1e-8141.67Show/hide
Query:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR
        MDRC+AGPDFE NEFR+W RSL V  GG F D+FR+K+P+R+DA+TCW  S+GAE FNYG+RIDH+L AG CLH D +  GH  + CHV ECDIL++YKR
Subjt:  MDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKR

Query:  WKDGN-SFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFD
        +K+ N   RWKG    K +GSDH PV+ S  +LP++P+HSTP L++RY P I+G QQTLVS+  KR+A E++    +S S S  +     C     G   
Subjt:  WKDGN-SFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFD

Query:  NVDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNL---ITHESLHMKALPEKETRKRVRR--CSQMSLKSFFQKNSVVSNDADSSNADSSI
        N    G    +SCS  N  T       G       A   + +NL   I   S+    +     RK+ R+   SQ+SLKSFF  NS V+N  DSS++  S 
Subjt:  NVDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNL---ITHESLHMKALPEKETRKRVRR--CSQMSLKSFFQKNSVVSNDADSSNADSSI

Query:  NKADTSESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQ---------------------------------------QGPA
        + +   ES    I   N    +  +   + Q Q Q  +S+  K+K+  AL+EW+RIQ                                       +GP+
Subjt:  NKADTSESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQ---------------------------------------QGPA

Query:  SNPEANCGYFKWAASKSRHK
        SNPEANCGYFKWA+SK R K
Subjt:  SNPEANCGYFKWAASKSRHK

AT4G36050.2 endonuclease/exonuclease/phosphatase family protein2.0e-16554.84Show/hide
Query:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF
        MKIVTYNVNGLR R++QF SLLKLLDSFDADIIC QETKLRRQEL ADL IADGYESF SCTRTSEKGRTGYSGVATFCRV SA SS E ALPV AEEG 
Subjt:  MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGF

Query:  TGLLDS-SQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPT
        TGL++S S+ GK   + VAEGLEE+ KEELL +D EGRC++TDHGHFV+FN+YGPRA +DD++R+ FK +FY VL++RWE LL  G+R+FVVGDLNIAP 
Subjt:  TGLLDS-SQDGKGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPT

Query:  SMDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYK
        +MDRC+AGPDFE NEFR+W RSL V  GG F D+FR+K+P+R+DA+TCW  S+GAE FNYG+RIDH+L AG CLH D +  GH  + CHV ECDIL++YK
Subjt:  SMDRCDAGPDFENNEFRRWLRSLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYK

Query:  RWKDGN-SFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSF
        R+K+ N   RWKG    K +GSDH PV+ S  +LP++P+HSTP L++RY P I+G QQTLVS+  KR+A E++    +S S S  +     C     G  
Subjt:  RWKDGN-SFRWKGERTVKLEGSDHAPVYASLLELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSF

Query:  DNVDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNL---ITHESLHMKALPEKETRKRVRR--CSQMSLKSFFQKNSVVSNDADSSNADSS
         N    G    +SCS  N  T       G       A   + +NL   I   S+    +     RK+ R+   SQ+SLKSFF  NS V+N  DSS++  S
Subjt:  DNVDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAACNNL---ITHESLHMKALPEKETRKRVRR--CSQMSLKSFFQKNSVVSNDADSSNADSS

Query:  INKADTSESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQ
         + +   ES    I   N    +  +   + Q Q Q  +S+  K+K+  AL+EW+RIQ
Subjt:  INKADTSESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRIQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATAGTGACTTACAACGTGAATGGTCTTAGGCCACGCATCGCACAGTTTGGTTCACTTCTTAAACTGCTCGATTCCTTCGATGCCGATATAATTTGCATTCAGGA
AACGAAATTAAGGAGGCAGGAATTGCGAGCGGATTTAGTCATCGCTGATGGTTACGAATCATTCGTTTCTTGCACCCGTACCTCTGAGAAAGGTCGAACCGGCTACTCAG
GGGTTGCTACGTTTTGCCGTGTTAATTCAGCATTTTCGAGTAATGAAGTAGCATTGCCAGTTCGGGCGGAGGAAGGCTTCACAGGTCTTCTAGATAGTTCGCAGGATGGA
AAAGGCACAATGGCTGCAGTTGCAGAAGGGCTTGAGGAATTTTCGAAAGAGGAGCTCCTTAAATTAGACAGTGAGGGGCGCTGTATTGTCACAGATCATGGTCATTTTGT
TCTCTTCAACATTTATGGCCCTCGAGCTGAAAGTGATGATTCAGAAAGGGTTCTATTCAAGTTAAAATTTTACAATGTGCTACAGAAAAGATGGGAGCATCTTCTGCATC
TAGGAAAAAGGATATTTGTCGTTGGTGATCTTAATATTGCACCTACTTCAATGGATCGCTGTGATGCAGGACCAGATTTTGAGAATAATGAGTTTCGGAGATGGCTGAGA
TCTTTACAGGTGGCATGTGGTGGCCGTTTCATTGACATTTTCAGAGCAAAAAATCCTGATCGAAGAGATGCATATACATGCTGGCCTCAAAGTACAGGTGCCGAGGTATT
CAATTATGGGACAAGGATTGATCATGTATTGTGTGCTGGACCATGCTTACATCATGACAGTAACCTGCCAGGCCATGATATTGTAGTTTGTCATGTTATGGAATGTGACA
TACTGTCACAGTACAAACGCTGGAAAGATGGAAATTCATTTAGATGGAAGGGAGAGCGGACTGTTAAACTAGAAGGTTCTGATCATGCACCGGTTTATGCAAGTCTTCTG
GAACTGCCTGAGGTTCCTCAACATAGTACTCCAGCTTTATCTGCAAGATACAATCCCAAGATTCACGGGCTTCAGCAAACTCTTGTGTCAATGCTTCTGAAAAGACAAGC
TGCTGAAGATTCAGCACCGTGCAGAATATCAAATTCATTTTCACGTGGGAACATCATCTTAGGGAATTGTTCTCAGGGACATAATGGATCATTTGATAATGTTGACCTAC
CTGGCTTTCTTCCTAGTGAATCTTGTTCTTTGACAAACCTAGAAACAGAAGATTCTCTATTAGAACCAGGAGAGTGTTCTGGTGGAGGTTATGCTGAGGAGGCTGCATGC
AACAACTTAATTACGCACGAGTCTCTACATATGAAAGCATTGCCTGAGAAGGAAACTAGGAAAAGAGTTAGAAGGTGTTCCCAGATGTCATTAAAGTCGTTCTTCCAGAA
AAACTCAGTTGTTAGCAACGATGCTGACAGCTCTAATGCAGATTCTTCTATTAACAAAGCAGATACCTCCGAATCTAATCCTATTGAAATTCCTAGATCAAATACTCAAA
TTAGCGATTCAGGCCAATATTTAGAAGCATACCAGGGTCAGTCTCAAATTAATGCCTCTTCTGTAGAGAAAGAAAAGAGTGGTGTTGCCTTGTTGGAGTGGCGGAGGATA
CAGCAGGGCCCCGCATCTAATCCGGAAGCAAATTGTGGTTACTTTAAATGGGCGGCTTCCAAATCTCGGCATAAATGA
mRNA sequenceShow/hide mRNA sequence
AACGTTTTAATAATGAAAAGGAAAAAATGGACCCCTTAGGGAGACAGACTCCTTGCCGCGAGAACGTTGCGCGCTACGACGCAAGCCGACGTTTCCGCCAAGCACCGTCG
TTTTTCCATCTTCCGGTGGTCGGTTTTTCGCCGGCGCTGCTGCAGTAGACGGGTTGCCTAGCTATTCAGGACAAATTATTGTCCATCTCTTCTCAGAGAGAGGAAGACAA
TTTACATTGAAATGAAAAGGCAACGAATTACAAGATGAAGATAGTGACTTACAACGTGAATGGTCTTAGGCCACGCATCGCACAGTTTGGTTCACTTCTTAAACTGCTCG
ATTCCTTCGATGCCGATATAATTTGCATTCAGGAAACGAAATTAAGGAGGCAGGAATTGCGAGCGGATTTAGTCATCGCTGATGGTTACGAATCATTCGTTTCTTGCACC
CGTACCTCTGAGAAAGGTCGAACCGGCTACTCAGGGGTTGCTACGTTTTGCCGTGTTAATTCAGCATTTTCGAGTAATGAAGTAGCATTGCCAGTTCGGGCGGAGGAAGG
CTTCACAGGTCTTCTAGATAGTTCGCAGGATGGAAAAGGCACAATGGCTGCAGTTGCAGAAGGGCTTGAGGAATTTTCGAAAGAGGAGCTCCTTAAATTAGACAGTGAGG
GGCGCTGTATTGTCACAGATCATGGTCATTTTGTTCTCTTCAACATTTATGGCCCTCGAGCTGAAAGTGATGATTCAGAAAGGGTTCTATTCAAGTTAAAATTTTACAAT
GTGCTACAGAAAAGATGGGAGCATCTTCTGCATCTAGGAAAAAGGATATTTGTCGTTGGTGATCTTAATATTGCACCTACTTCAATGGATCGCTGTGATGCAGGACCAGA
TTTTGAGAATAATGAGTTTCGGAGATGGCTGAGATCTTTACAGGTGGCATGTGGTGGCCGTTTCATTGACATTTTCAGAGCAAAAAATCCTGATCGAAGAGATGCATATA
CATGCTGGCCTCAAAGTACAGGTGCCGAGGTATTCAATTATGGGACAAGGATTGATCATGTATTGTGTGCTGGACCATGCTTACATCATGACAGTAACCTGCCAGGCCAT
GATATTGTAGTTTGTCATGTTATGGAATGTGACATACTGTCACAGTACAAACGCTGGAAAGATGGAAATTCATTTAGATGGAAGGGAGAGCGGACTGTTAAACTAGAAGG
TTCTGATCATGCACCGGTTTATGCAAGTCTTCTGGAACTGCCTGAGGTTCCTCAACATAGTACTCCAGCTTTATCTGCAAGATACAATCCCAAGATTCACGGGCTTCAGC
AAACTCTTGTGTCAATGCTTCTGAAAAGACAAGCTGCTGAAGATTCAGCACCGTGCAGAATATCAAATTCATTTTCACGTGGGAACATCATCTTAGGGAATTGTTCTCAG
GGACATAATGGATCATTTGATAATGTTGACCTACCTGGCTTTCTTCCTAGTGAATCTTGTTCTTTGACAAACCTAGAAACAGAAGATTCTCTATTAGAACCAGGAGAGTG
TTCTGGTGGAGGTTATGCTGAGGAGGCTGCATGCAACAACTTAATTACGCACGAGTCTCTACATATGAAAGCATTGCCTGAGAAGGAAACTAGGAAAAGAGTTAGAAGGT
GTTCCCAGATGTCATTAAAGTCGTTCTTCCAGAAAAACTCAGTTGTTAGCAACGATGCTGACAGCTCTAATGCAGATTCTTCTATTAACAAAGCAGATACCTCCGAATCT
AATCCTATTGAAATTCCTAGATCAAATACTCAAATTAGCGATTCAGGCCAATATTTAGAAGCATACCAGGGTCAGTCTCAAATTAATGCCTCTTCTGTAGAGAAAGAAAA
GAGTGGTGTTGCCTTGTTGGAGTGGCGGAGGATACAGCAGGGCCCCGCATCTAATCCGGAAGCAAATTGTGGTTACTTTAAATGGGCGGCTTCCAAATCTCGGCATAAAT
GA
Protein sequenceShow/hide protein sequence
MKIVTYNVNGLRPRIAQFGSLLKLLDSFDADIICIQETKLRRQELRADLVIADGYESFVSCTRTSEKGRTGYSGVATFCRVNSAFSSNEVALPVRAEEGFTGLLDSSQDG
KGTMAAVAEGLEEFSKEELLKLDSEGRCIVTDHGHFVLFNIYGPRAESDDSERVLFKLKFYNVLQKRWEHLLHLGKRIFVVGDLNIAPTSMDRCDAGPDFENNEFRRWLR
SLQVACGGRFIDIFRAKNPDRRDAYTCWPQSTGAEVFNYGTRIDHVLCAGPCLHHDSNLPGHDIVVCHVMECDILSQYKRWKDGNSFRWKGERTVKLEGSDHAPVYASLL
ELPEVPQHSTPALSARYNPKIHGLQQTLVSMLLKRQAAEDSAPCRISNSFSRGNIILGNCSQGHNGSFDNVDLPGFLPSESCSLTNLETEDSLLEPGECSGGGYAEEAAC
NNLITHESLHMKALPEKETRKRVRRCSQMSLKSFFQKNSVVSNDADSSNADSSINKADTSESNPIEIPRSNTQISDSGQYLEAYQGQSQINASSVEKEKSGVALLEWRRI
QQGPASNPEANCGYFKWAASKSRHK