; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G16775 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G16775
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionEndonuclease III homolog
Genome locationctg24:970385..973297
RNA-Seq ExpressionCucsat.G16775
SyntenyCucsat.G16775
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0032259 - methylation (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0140078 - class I DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
GO:0051539 - 4 iron, 4 sulfur cluster binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0000703 - oxidized pyrimidine nucleobase lesion DNA N-glycosylase activity (molecular function)
InterPro domainsIPR030841 - Endonuclease III-like protein 1
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal
IPR011257 - DNA glycosylase
IPR004036 - Endonuclease III-like, conserved site-2
IPR004035 - Endonuclease III, iron-sulphur binding site
IPR003651 - Endonuclease III-like, iron-sulphur cluster loop motif
IPR003265 - HhH-GPD domain
IPR000445 - Helix-hairpin-helix motif


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044590.1 endonuclease III-like protein 1 [Cucumis melo var. makuwa]4.34e-26795.63Show/hide
Query:  MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
        MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKI+TKRS PPNIEDFAF
Subjt:  MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF

Query:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSR+ KPP DLLLNGIE S PTTHKG A R KPP NWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
Subjt:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  ---AALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRI
           AALRLQESGLLTA+AMDKADEETIKSLIYPVGFYSTKAKNLKKIA+ICLMKYGGDIPRSL ELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRI
Subjt:  ---AALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRI

Query:  CNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        CNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
Subjt:  CNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

TYK16994.1 endonuclease III-like protein 1 [Cucumis melo var. makuwa]7.01e-25596.96Show/hide
Query:  MSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIE
        MSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAFKRTKDSPGSR+ KPP DLLLNGIE
Subjt:  MSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIE

Query:  DSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETI
         S PTTHKG A RGKPP NWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTA+AMDKADEETI
Subjt:  DSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETI

Query:  KSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLE
        KSLIYPVGFYSTKAKNLKKIA+ICLMKYGGDIPRSL ELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLE
Subjt:  KSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLE

Query:  LWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        LWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
Subjt:  LWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

XP_004152104.1 endonuclease III homolog 1, chloroplastic [Cucumis sativus]9.02e-282100Show/hide
Query:  MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
        MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
Subjt:  MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF

Query:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
Subjt:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  AALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
        AALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
Subjt:  AALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR

Query:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
Subjt:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

XP_008453986.1 PREDICTED: endonuclease III homolog 1, chloroplastic-like [Cucumis melo]1.26e-27297.15Show/hide
Query:  MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
        MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
Subjt:  MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF

Query:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSR+ KPP DLLLNGIE S PTTHKG A RGKPP NWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
Subjt:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  AALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
        AALRLQESGLLTA+AMDKADEETIKSLIYPVGFYSTKAKNLKKIA+ICLMKYGGDIPRSL ELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
Subjt:  AALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR

Query:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
Subjt:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

XP_038878188.1 endonuclease III homolog 2, chloroplastic-like isoform X3 [Benincasa hispida]5.16e-25692.23Show/hide
Query:  MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
        M FACPIR PA SITFARRITCS MSKGS SSLPTSSNEVPPNPGIS VKSSNGVSE ETRVFVRRRVKK AE Q SG EVEPK+D KR CPPNIEDFAF
Subjt:  MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF

Query:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSRK K PLD LLNGIE SNPT  KG AE GKPPVNWEKVL+GIR+MRSSEEAPVDTMGCG+AGSTLPPKERRFAVLASSLLSSQTKDHVTHG
Subjt:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  AALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
        AALRLQESGLLTADAMDKADE TIKSLIYPVGFYSTKAKNLKKIA+ICLMKYGGDIPRSL ELL+LPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
Subjt:  AALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR

Query:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        LGWVSGKGSKQKTSTPEETRV LELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
Subjt:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

TrEMBL top hitse value%identityAlignment
A0A0A0KU72 ENDO3c domain-containing protein3.97e-25392.75Show/hide
Query:  MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
        MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
Subjt:  MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF

Query:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
Subjt:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  AALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
        AALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
Subjt:  AALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR

Query:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        LGWVSGKGSKQ                            VGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
Subjt:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

A0A1S3BX24 Endonuclease III homolog6.11e-27397.15Show/hide
Query:  MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
        MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
Subjt:  MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF

Query:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSR+ KPP DLLLNGIE S PTTHKG A RGKPP NWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
Subjt:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  AALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
        AALRLQESGLLTA+AMDKADEETIKSLIYPVGFYSTKAKNLKKIA+ICLMKYGGDIPRSL ELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
Subjt:  AALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR

Query:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
Subjt:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

A0A5A7TN21 Endonuclease III homolog2.10e-26795.63Show/hide
Query:  MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
        MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKI+TKRS PPNIEDFAF
Subjt:  MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF

Query:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSR+ KPP DLLLNGIE S PTTHKG A R KPP NWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
Subjt:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  ---AALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRI
           AALRLQESGLLTA+AMDKADEETIKSLIYPVGFYSTKAKNLKKIA+ICLMKYGGDIPRSL ELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRI
Subjt:  ---AALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRI

Query:  CNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        CNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
Subjt:  CNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

A0A5D3D186 Endonuclease III homolog3.39e-25596.96Show/hide
Query:  MSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIE
        MSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAFKRTKDSPGSR+ KPP DLLLNGIE
Subjt:  MSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIE

Query:  DSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETI
         S PTTHKG A RGKPP NWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTA+AMDKADEETI
Subjt:  DSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETI

Query:  KSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLE
        KSLIYPVGFYSTKAKNLKKIA+ICLMKYGGDIPRSL ELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLE
Subjt:  KSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLE

Query:  LWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        LWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
Subjt:  LWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

A0A6J1GW15 Endonuclease III homolog1.96e-23687.24Show/hide
Query:  MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
        MF ACPIR  ALSITFARRITC+ MSK SSSS+P +SNE PP  GIS V+SSNGVS+ ETRVFVRR VKK AE Q SG ++E K D KR CPP+IEDFAF
Subjt:  MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF

Query:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSRK KPPLD LL  IEDSNPT  KG AE GK PV+WEKVL+GIREMRSSE APVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
Subjt:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  AALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
        AALRLQESGLLTADAMDKADE TIKSLIYPVGFYSTKAKNLKKIA+ICLMKYGGDIPRSL ELLLLPGIGPKIAHLIMIMAWNDV+GICVDTHVHRICNR
Subjt:  AALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR

Query:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTK
        LGWVSGKGSKQKTS+PEETRV LELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVS LCPSAFKE+SSPSPKLK SSSTK
Subjt:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTK

SwissProt top hitse value%identityAlignment
A7M7B9 Endonuclease III-like protein 11.3e-6153.3Show/hide
Query:  PVNWEKVLKGIREMRSSEEAPVDTMGCGRAGST-LPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAK
        P NW++ L+ IREMR   +APVD MG  +   T  PP+  R+ VL S +LSSQTKD VT  A LRL++ G LT D++ + D+ T+  +IYPVGF+  K K
Subjt:  PVNWEKVLKGIREMRSSEEAPVDTMGCGRAGST-LPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAK

Query:  NLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLL
         +K+   I   KYGGDIP ++ EL+ LPG+GPK+AHL M +AWN V GI VDTHVHRI NRL WV     K++T  PEETRV LE WLP++ W  IN LL
Subjt:  NLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLL

Query:  VGFGQTICTPLRPKCGNCSVSDLCPSA
        VGFGQ  C P+ P+C  C   D+CP+A
Subjt:  VGFGQTICTPLRPKCGNCSVSDLCPSA

B9DFZ0 Endonuclease III homolog 2, chloroplastic5.5e-10859.65Show/hide
Query:  SSVKSSNGVSEPETRVFVR-RRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNW
        S+ ++++G SE ETRV +R +R+K+                 K  C  P+IED  +K+T  +  SR  K  L+  +   E S   +    A  G PP NW
Subjt:  SSVKSSNGVSEPETRVFVR-RRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNW

Query:  EKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKI
        EKVL+GIR+M+ SEEAPV+ + C R GS LPPKERRF VL  +LLSSQTK+H+T  A  RL ++GLLT +A+DKADE TIK LIYPVGFY+ KA N+KK+
Subjt:  EKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKI

Query:  ARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQ
        A+ICLM+Y GDIPR+L ELL LPG+GPKIAHL++ +AWNDVQGICVDTHVHRICNRLGWVS  G+KQKTS+PEETRV L+ WLPK EWV IN LLVGFGQ
Subjt:  ARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQ

Query:  TICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        TICTPLRP CG CS++++CPSAFKE+ S S KLK S  +KKL
Subjt:  TICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

P78549 Endonuclease III-like protein 11.5e-6052.86Show/hide
Query:  PVNWEKVLKGIREMRSSEEAPVDTMGCGRA-GSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAK
        P +W++ L  IR MR+ ++APVD +G      S+ PPK RR+ VL S +LSSQTKD VT GA  RL+  G LT D++ + D+ T+  LIYPVGF+ +K K
Subjt:  PVNWEKVLKGIREMRSSEEAPVDTMGCGRA-GSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAK

Query:  NLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLL
         +K+ + I    YGGDIP S+AEL+ LPG+GPK+AHL M +AW  V GI VDTHVHRI NRL W     +K+ T +PEETR  LE WLP+E W  IN LL
Subjt:  NLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLL

Query:  VGFGQTICTPLRPKCGNCSVSDLCPSA
        VGFGQ  C P+ P+C  C    LCP+A
Subjt:  VGFGQTICTPLRPKCGNCSVSDLCPSA

Q2KID2 Endonuclease III-like protein 11.2e-5952.42Show/hide
Query:  PVNWEKVLKGIREMRSSEEAPVDTMGCGRA-GSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAK
        P +W + L  IR MRS ++APVD +G       +  PK RR+ VL S +LSSQTKD VT GA  RL+  G LT D++ + D+ T+ +LIYPVGF+ +K K
Subjt:  PVNWEKVLKGIREMRSSEEAPVDTMGCGRA-GSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAK

Query:  NLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLL
         +K+ + I   +Y GDIP S+AEL+ LPG+GPK+AHL M +AW  V GI VDTHVHRI NRL W     +K+ T +PEETR  LE WLP+E W  IN LL
Subjt:  NLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLL

Query:  VGFGQTICTPLRPKCGNCSVSDLCPSA
        VGFGQ  C P+RP+C  C    LCP+A
Subjt:  VGFGQTICTPLRPKCGNCSVSDLCPSA

Q9SIC4 Endonuclease III homolog 1, chloroplastic4.7e-10757.95Show/hide
Query:  FARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKP
        F R  T S    G+ SS    S +       S+ + + G S  ETRV+ R++  K    +         ++T + C  P+IEDFA+K+T  SP S +   
Subjt:  FARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKP

Query:  PLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTAD
                 E S   T    A  G PP NW +VL+GIR+MRSSE+APVD+MGC +AGS LPP ERRFAVL  +LLSSQTKD V + A  RL ++GLLT +
Subjt:  PLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTAD

Query:  AMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTS
        A+DKADE TIK LIYPVGFY+ KA  +KKIARICL+KY GDIP SL +LL LPGIGPK+AHLI+ +AWNDVQGICVDTHVHRICNRLGWVS  G+KQKT+
Subjt:  AMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTS

Query:  TPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKK
        +PEETRV L+ WLPKEEWV INPLLVGFGQ ICTP+RP+C  CSVS LCP+AFKE+SSPS KLK S+ +K+
Subjt:  TPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKK

Arabidopsis top hitse value%identityAlignment
AT1G05900.1 endonuclease III 23.3e-7656.18Show/hide
Query:  SSVKSSNGVSEPETRVFVR-RRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNW
        S+ ++++G SE ETRV +R +R+K+                 K  C  P+IED  +K+T  +  SR  K  L+  +   E S   +    A  G PP NW
Subjt:  SSVKSSNGVSEPETRVFVR-RRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNW

Query:  EKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKI
        EKVL+GIR+M+ SEEAPV+ + C R GS LPPKERRF VL  +LLSSQTK+H+T  A  RL ++GLLT +A+DKADE TIK LIYPVGFY+ KA N+KK+
Subjt:  EKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKI

Query:  ARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ
        A+ICLM+Y GDIPR+L ELL LPG+GPKIAHL++ +AWNDVQGICVDTHVHRICNRLGWVS  G+KQ
Subjt:  ARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ

AT1G05900.2 endonuclease III 23.9e-10959.65Show/hide
Query:  SSVKSSNGVSEPETRVFVR-RRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNW
        S+ ++++G SE ETRV +R +R+K+                 K  C  P+IED  +K+T  +  SR  K  L+  +   E S   +    A  G PP NW
Subjt:  SSVKSSNGVSEPETRVFVR-RRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNW

Query:  EKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKI
        EKVL+GIR+M+ SEEAPV+ + C R GS LPPKERRF VL  +LLSSQTK+H+T  A  RL ++GLLT +A+DKADE TIK LIYPVGFY+ KA N+KK+
Subjt:  EKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKI

Query:  ARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQ
        A+ICLM+Y GDIPR+L ELL LPG+GPKIAHL++ +AWNDVQGICVDTHVHRICNRLGWVS  G+KQKTS+PEETRV L+ WLPK EWV IN LLVGFGQ
Subjt:  ARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQ

Query:  TICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        TICTPLRP CG CS++++CPSAFKE+ S S KLK S  +KKL
Subjt:  TICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

AT2G31450.1 DNA glycosylase superfamily protein3.3e-10857.95Show/hide
Query:  FARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKP
        F R  T S    G+ SS    S +       S+ + + G S  ETRV+ R++  K    +         ++T + C  P+IEDFA+K+T  SP S +   
Subjt:  FARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKP

Query:  PLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTAD
                 E S   T    A  G PP NW +VL+GIR+MRSSE+APVD+MGC +AGS LPP ERRFAVL  +LLSSQTKD V + A  RL ++GLLT +
Subjt:  PLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTAD

Query:  AMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTS
        A+DKADE TIK LIYPVGFY+ KA  +KKIARICL+KY GDIP SL +LL LPGIGPK+AHLI+ +AWNDVQGICVDTHVHRICNRLGWVS  G+KQKT+
Subjt:  AMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTS

Query:  TPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKK
        +PEETRV L+ WLPKEEWV INPLLVGFGQ ICTP+RP+C  CSVS LCP+AFKE+SSPS KLK S+ +K+
Subjt:  TPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKK

AT2G31450.2 DNA glycosylase superfamily protein6.0e-11061Show/hide
Query:  ISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNW
        +S   S+ G S  ETRV+ R++  K    +         ++T + C  P+IEDFA+K+T  SP S +            E S   T    A  G PP NW
Subjt:  ISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNW

Query:  EKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKI
         +VL+GIR+MRSSE+APVD+MGC +AGS LPP ERRFAVL  +LLSSQTKD V + A  RL ++GLLT +A+DKADE TIK LIYPVGFY+ KA  +KKI
Subjt:  EKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKADEETIKSLIYPVGFYSTKAKNLKKI

Query:  ARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQ
        ARICL+KY GDIP SL +LL LPGIGPK+AHLI+ +AWNDVQGICVDTHVHRICNRLGWVS  G+KQKT++PEETRV L+ WLPKEEWV INPLLVGFGQ
Subjt:  ARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQ

Query:  TICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKK
         ICTP+RP+C  CSVS LCP+AFKE+SSPS KLK S+ +K+
Subjt:  TICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKK

AT4G12740.1 HhH-GPD base excision DNA repair family protein4.0e-0525.69Show/hide
Query:  DEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEET
        +++ +  +   +G+Y  +A+ L + A++ +    G  P   + L+ + GIG   A  I  +A+N+   + VD +V R+  RL  +S    K + +     
Subjt:  DEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEET

Query:  RVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLC
        ++  +L  P       N  L+  G T+CT  +P C +C VS  C
Subjt:  RVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTTTGCTTGTCCTATTCGAATACCGGCGCTTTCAATCACATTTGCTCGAAGAATTACATGCAGCGCCATGTCGAAAGGAAGTTCGTCCTCCTTGCCAACAAGTTC
GAACGAAGTCCCTCCAAACCCGGGAATTTCGAGTGTTAAGTCCTCGAATGGCGTTTCTGAGCCTGAAACTCGGGTATTTGTGAGGAGAAGAGTGAAAAAGATTGCAGAAA
GTCAAGATAGTGGGTTTGAAGTTGAACCTAAAATCGACACTAAACGCTCCTGTCCTCCTAATATTGAAGATTTTGCATTCAAAAGAACAAAGGATTCCCCTGGATCAAGG
AAGTTAAAGCCTCCTCTAGATCTTCTTCTCAACGGAATTGAAGATTCTAATCCAACTACACATAAAGGCAAGGCAGAACGAGGTAAACCACCTGTGAATTGGGAAAAAGT
TCTTAAAGGAATTCGGGAAATGAGGTCCTCTGAAGAAGCTCCAGTAGATACCATGGGATGTGGACGAGCTGGTAGTACTCTTCCTCCCAAGGAAAGAAGATTTGCTGTCT
TGGCATCTTCTCTTCTCTCAAGCCAAACCAAAGACCACGTGACTCATGGAGCAGCATTGCGTCTCCAAGAAAGTGGTCTTCTTACAGCCGATGCCATGGACAAAGCTGAT
GAAGAAACCATTAAAAGTTTGATTTACCCGGTTGGATTTTATTCTACAAAGGCTAAGAATTTGAAGAAGATTGCAAGAATATGTCTTATGAAGTATGGTGGGGACATACC
TAGATCATTGGCGGAGCTACTTCTGCTACCTGGGATAGGCCCTAAGATTGCACATTTGATCATGATTATGGCTTGGAACGATGTTCAGGGGATATGTGTAGACACTCATG
TGCATCGCATTTGCAATCGGCTTGGATGGGTGTCTGGGAAAGGCTCAAAACAGAAAACATCAACTCCTGAAGAAACTCGAGTAGGATTAGAACTGTGGCTGCCAAAAGAA
GAATGGGTTCCAATTAATCCTCTTCTGGTTGGATTTGGACAGACTATTTGCACTCCCCTTAGACCCAAATGTGGAAATTGCAGTGTTAGTGACCTATGCCCATCCGCATT
CAAAGAGTCTTCAAGCCCATCTCCCAAATTAAAGGGTTCAAGTTCTACCAAAAAGTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTTTTGCTTGTCCTATTCGAATACCGGCGCTTTCAATCACATTTGCTCGAAGAATTACATGCAGCGCCATGTCGAAAGGAAGTTCGTCCTCCTTGCCAACAAGTTC
GAACGAAGTCCCTCCAAACCCGGGAATTTCGAGTGTTAAGTCCTCGAATGGCGTTTCTGAGCCTGAAACTCGGGTATTTGTGAGGAGAAGAGTGAAAAAGATTGCAGAAA
GTCAAGATAGTGGGTTTGAAGTTGAACCTAAAATCGACACTAAACGCTCCTGTCCTCCTAATATTGAAGATTTTGCATTCAAAAGAACAAAGGATTCCCCTGGATCAAGG
AAGTTAAAGCCTCCTCTAGATCTTCTTCTCAACGGAATTGAAGATTCTAATCCAACTACACATAAAGGCAAGGCAGAACGAGGTAAACCACCTGTGAATTGGGAAAAAGT
TCTTAAAGGAATTCGGGAAATGAGGTCCTCTGAAGAAGCTCCAGTAGATACCATGGGATGTGGACGAGCTGGTAGTACTCTTCCTCCCAAGGAAAGAAGATTTGCTGTCT
TGGCATCTTCTCTTCTCTCAAGCCAAACCAAAGACCACGTGACTCATGGAGCAGCATTGCGTCTCCAAGAAAGTGGTCTTCTTACAGCCGATGCCATGGACAAAGCTGAT
GAAGAAACCATTAAAAGTTTGATTTACCCGGTTGGATTTTATTCTACAAAGGCTAAGAATTTGAAGAAGATTGCAAGAATATGTCTTATGAAGTATGGTGGGGACATACC
TAGATCATTGGCGGAGCTACTTCTGCTACCTGGGATAGGCCCTAAGATTGCACATTTGATCATGATTATGGCTTGGAACGATGTTCAGGGGATATGTGTAGACACTCATG
TGCATCGCATTTGCAATCGGCTTGGATGGGTGTCTGGGAAAGGCTCAAAACAGAAAACATCAACTCCTGAAGAAACTCGAGTAGGATTAGAACTGTGGCTGCCAAAAGAA
GAATGGGTTCCAATTAATCCTCTTCTGGTTGGATTTGGACAGACTATTTGCACTCCCCTTAGACCCAAATGTGGAAATTGCAGTGTTAGTGACCTATGCCCATCCGCATT
CAAAGAGTCTTCAAGCCCATCTCCCAAATTAAAGGGTTCAAGTTCTACCAAAAAGTTATGA
Protein sequenceShow/hide protein sequence
MFFACPIRIPALSITFARRITCSAMSKGSSSSLPTSSNEVPPNPGISSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAFKRTKDSPGSR
KLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADAMDKAD
EETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKE
EWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL