; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G03310 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G03310
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionEndonuclease III homolog
Genome locationChr4:2038900..2045039
RNA-Seq ExpressionCSPI04G03310
SyntenyCSPI04G03310
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0140078 - class I DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
GO:0000703 - oxidized pyrimidine nucleobase lesion DNA N-glycosylase activity (molecular function)
GO:0051539 - 4 iron, 4 sulfur cluster binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR030841 - Endonuclease III-like protein 1
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal
IPR011257 - DNA glycosylase
IPR004036 - Endonuclease III-like, conserved site-2
IPR004035 - Endonuclease III, iron-sulphur binding site
IPR000445 - Helix-hairpin-helix motif
IPR003265 - HhH-GPD domain
IPR003651 - Endonuclease III-like, iron-sulphur cluster loop motif


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044590.1 endonuclease III-like protein 1 [Cucumis melo var. makuwa]2.5e-20694.86Show/hide
Query:  MFFACPIRIPALSITFARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
        MFFACPIRIPALSITFARRITCSAMSKGSSSS+PTSSNEVPPNPG SSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKI+TKRS PPNIEDFAF
Subjt:  MFFACPIRIPALSITFARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF

Query:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTH-
        KRTKDSPGSR+ KPP DLLLNGIE S PTTHKG A R KPP NWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTH 
Subjt:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTH-

Query:  --GAALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRI
          GAALRLQESGLLTA++MDKADEETIKSLIYPVGFYSTKAKNLKKIA+ICLMKYGGDIPRSL ELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRI
Subjt:  --GAALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRI

Query:  CNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        CNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
Subjt:  CNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

TYK16994.1 endonuclease III-like protein 1 [Cucumis melo var. makuwa]8.0e-19796.13Show/hide
Query:  MSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIE
        MSKGSSSS+PTSSNEVPPNPG SSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAFKRTKDSPGSR+ KPP DLLLNGIE
Subjt:  MSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIE

Query:  DSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADSMDKADEETI
         S PTTHKG A RGKPP NWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTA++MDKADEETI
Subjt:  DSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADSMDKADEETI

Query:  KSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLE
        KSLIYPVGFYSTKAKNLKKIA+ICLMKYGGDIPRSL ELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLE
Subjt:  KSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLE

Query:  LWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        LWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
Subjt:  LWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

XP_004152104.1 endonuclease III homolog 1, chloroplastic [Cucumis sativus]1.8e-21799.22Show/hide
Query:  MFFACPIRIPALSITFARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
        MFFACPIRIPALSITFARRITCSAMSKGSSSS+PTSSNEVPPNPG SSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
Subjt:  MFFACPIRIPALSITFARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF

Query:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
Subjt:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  AALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
        AALRLQESGLLTAD+MDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
Subjt:  AALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR

Query:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
Subjt:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

XP_008453986.1 PREDICTED: endonuclease III homolog 1, chloroplastic-like [Cucumis melo]1.7e-21096.37Show/hide
Query:  MFFACPIRIPALSITFARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
        MFFACPIRIPALSITFARRITCSAMSKGSSSS+PTSSNEVPPNPG SSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
Subjt:  MFFACPIRIPALSITFARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF

Query:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSR+ KPP DLLLNGIE S PTTHKG A RGKPP NWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
Subjt:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  AALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
        AALRLQESGLLTA++MDKADEETIKSLIYPVGFYSTKAKNLKKIA+ICLMKYGGDIPRSL ELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
Subjt:  AALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR

Query:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
Subjt:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

XP_038878188.1 endonuclease III homolog 2, chloroplastic-like isoform X3 [Benincasa hispida]7.2e-19891.45Show/hide
Query:  MFFACPIRIPALSITFARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
        M FACPIR PA SITFARRITCS MSKGS SS+PTSSNEVPPNPG S VKSSNGVSE ETRVFVRRRVKK AE Q SG EVEPK+D KR CPPNIEDFAF
Subjt:  MFFACPIRIPALSITFARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF

Query:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSRK K PLD LLNGIE SNPT  KG AE GKPPVNWEKVL+GIR+MRSSEEAPVDTMGCG+AGSTLPPKERRFAVLASSLLSSQTKDHVTHG
Subjt:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  AALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
        AALRLQESGLLTAD+MDKADE TIKSLIYPVGFYSTKAKNLKKIA+ICLMKYGGDIPRSL ELL+LPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
Subjt:  AALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR

Query:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        LGWVSGKGSKQKTSTPEETRV LELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
Subjt:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

TrEMBL top hitse value%identityAlignment
A0A0A0KU72 ENDO3c domain-containing protein2.8e-19591.97Show/hide
Query:  MFFACPIRIPALSITFARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
        MFFACPIRIPALSITFARRITCSAMSKGSSSS+PTSSNEVPPNPG SSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
Subjt:  MFFACPIRIPALSITFARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF

Query:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
Subjt:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  AALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
        AALRLQESGLLTAD+MDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
Subjt:  AALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR

Query:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        LGWVSGKGSKQ                            VGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
Subjt:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

A0A1S3BX24 Endonuclease III homolog8.0e-21196.37Show/hide
Query:  MFFACPIRIPALSITFARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
        MFFACPIRIPALSITFARRITCSAMSKGSSSS+PTSSNEVPPNPG SSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
Subjt:  MFFACPIRIPALSITFARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF

Query:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSR+ KPP DLLLNGIE S PTTHKG A RGKPP NWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
Subjt:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  AALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
        AALRLQESGLLTA++MDKADEETIKSLIYPVGFYSTKAKNLKKIA+ICLMKYGGDIPRSL ELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
Subjt:  AALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR

Query:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
Subjt:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

A0A5A7TN21 Endonuclease III homolog1.2e-20694.86Show/hide
Query:  MFFACPIRIPALSITFARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
        MFFACPIRIPALSITFARRITCSAMSKGSSSS+PTSSNEVPPNPG SSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKI+TKRS PPNIEDFAF
Subjt:  MFFACPIRIPALSITFARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF

Query:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTH-
        KRTKDSPGSR+ KPP DLLLNGIE S PTTHKG A R KPP NWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTH 
Subjt:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTH-

Query:  --GAALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRI
          GAALRLQESGLLTA++MDKADEETIKSLIYPVGFYSTKAKNLKKIA+ICLMKYGGDIPRSL ELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRI
Subjt:  --GAALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRI

Query:  CNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        CNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
Subjt:  CNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

A0A5D3D186 Endonuclease III homolog3.9e-19796.13Show/hide
Query:  MSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIE
        MSKGSSSS+PTSSNEVPPNPG SSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAFKRTKDSPGSR+ KPP DLLLNGIE
Subjt:  MSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIE

Query:  DSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADSMDKADEETI
         S PTTHKG A RGKPP NWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTA++MDKADEETI
Subjt:  DSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADSMDKADEETI

Query:  KSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLE
        KSLIYPVGFYSTKAKNLKKIA+ICLMKYGGDIPRSL ELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLE
Subjt:  KSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLE

Query:  LWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        LWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
Subjt:  LWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

A0A6J1GW15 Endonuclease III homolog1.9e-18386.98Show/hide
Query:  MFFACPIRIPALSITFARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF
        MF ACPIR  ALSITFARRITC+ MSK SSSSMP +SNE P  PG S V+SSNGVS+ ETRVFVRR VKK AE Q SG ++E K D KR CPP+IEDFAF
Subjt:  MFFACPIRIPALSITFARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAF

Query:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSRK KPPLD LL  IEDSNPT  KG AE GK PV+WEKVL+GIREMRSSE APVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
Subjt:  KRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  AALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
        AALRLQESGLLTAD+MDKADE TIKSLIYPVGFYSTKAKNLKKIA+ICLMKYGGDIPRSL ELLLLPGIGPKIAHLIMIMAWNDV+GICVDTHVHRICNR
Subjt:  AALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR

Query:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTK
        LGWVSGKGSKQKTS+PEETRV LELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVS LCPSAFKE+SSPSPKLK SSSTK
Subjt:  LGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTK

SwissProt top hitse value%identityAlignment
A7M7B9 Endonuclease III-like protein 16.0e-6253.74Show/hide
Query:  PVNWEKVLKGIREMRSSEEAPVDTMGCGRAGST-LPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAK
        P NW++ L+ IREMR   +APVD MG  +   T  PP+  R+ VL S +LSSQTKD VT  A LRL++ G LT DS+ + D+ T+  +IYPVGF+  K K
Subjt:  PVNWEKVLKGIREMRSSEEAPVDTMGCGRAGST-LPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAK

Query:  NLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLL
         +K+   I   KYGGDIP ++ EL+ LPG+GPK+AHL M +AWN V GI VDTHVHRI NRL WV     K++T  PEETRV LE WLP++ W  IN LL
Subjt:  NLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLL

Query:  VGFGQTICTPLRPKCGNCSVSDLCPSA
        VGFGQ  C P+ P+C  C   D+CP+A
Subjt:  VGFGQTICTPLRPKCGNCSVSDLCPSA

B9DFZ0 Endonuclease III homolog 2, chloroplastic7.2e-10859.36Show/hide
Query:  SSVKSSNGVSEPETRVFVR-RRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNW
        S+ ++++G SE ETRV +R +R+K+                 K  C  P+IED  +K+T  +  SR  K  L+  +   E S   +    A  G PP NW
Subjt:  SSVKSSNGVSEPETRVFVR-RRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNW

Query:  EKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKI
        EKVL+GIR+M+ SEEAPV+ + C R GS LPPKERRF VL  +LLSSQTK+H+T  A  RL ++GLLT +++DKADE TIK LIYPVGFY+ KA N+KK+
Subjt:  EKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKI

Query:  ARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQ
        A+ICLM+Y GDIPR+L ELL LPG+GPKIAHL++ +AWNDVQGICVDTHVHRICNRLGWVS  G+KQKTS+PEETRV L+ WLPK EWV IN LLVGFGQ
Subjt:  ARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQ

Query:  TICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        TICTPLRP CG CS++++CPSAFKE+ S S KLK S  +KKL
Subjt:  TICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

P78549 Endonuclease III-like protein 16.6e-6153.3Show/hide
Query:  PVNWEKVLKGIREMRSSEEAPVDTMGCGRA-GSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAK
        P +W++ L  IR MR+ ++APVD +G      S+ PPK RR+ VL S +LSSQTKD VT GA  RL+  G LT DS+ + D+ T+  LIYPVGF+ +K K
Subjt:  PVNWEKVLKGIREMRSSEEAPVDTMGCGRA-GSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAK

Query:  NLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLL
         +K+ + I    YGGDIP S+AEL+ LPG+GPK+AHL M +AW  V GI VDTHVHRI NRL W     +K+ T +PEETR  LE WLP+E W  IN LL
Subjt:  NLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLL

Query:  VGFGQTICTPLRPKCGNCSVSDLCPSA
        VGFGQ  C P+ P+C  C    LCP+A
Subjt:  VGFGQTICTPLRPKCGNCSVSDLCPSA

Q2KID2 Endonuclease III-like protein 15.6e-6052.86Show/hide
Query:  PVNWEKVLKGIREMRSSEEAPVDTMGCGRA-GSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAK
        P +W + L  IR MRS ++APVD +G       +  PK RR+ VL S +LSSQTKD VT GA  RL+  G LT DS+ + D+ T+ +LIYPVGF+ +K K
Subjt:  PVNWEKVLKGIREMRSSEEAPVDTMGCGRA-GSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAK

Query:  NLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLL
         +K+ + I   +Y GDIP S+AEL+ LPG+GPK+AHL M +AW  V GI VDTHVHRI NRL W     +K+ T +PEETR  LE WLP+E W  IN LL
Subjt:  NLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLL

Query:  VGFGQTICTPLRPKCGNCSVSDLCPSA
        VGFGQ  C P+RP+C  C    LCP+A
Subjt:  VGFGQTICTPLRPKCGNCSVSDLCPSA

Q9SIC4 Endonuclease III homolog 1, chloroplastic6.1e-10757.68Show/hide
Query:  FARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKP
        F R  T S    G+ SS    S +       S+ + + G S  ETRV+ R++  K    +         ++T + C  P+IEDFA+K+T  SP S +   
Subjt:  FARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKP

Query:  PLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTAD
                 E S   T    A  G PP NW +VL+GIR+MRSSE+APVD+MGC +AGS LPP ERRFAVL  +LLSSQTKD V + A  RL ++GLLT +
Subjt:  PLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTAD

Query:  SMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTS
        ++DKADE TIK LIYPVGFY+ KA  +KKIARICL+KY GDIP SL +LL LPGIGPK+AHLI+ +AWNDVQGICVDTHVHRICNRLGWVS  G+KQKT+
Subjt:  SMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTS

Query:  TPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKK
        +PEETRV L+ WLPKEEWV INPLLVGFGQ ICTP+RP+C  CSVS LCP+AFKE+SSPS KLK S+ +K+
Subjt:  TPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKK

Arabidopsis top hitse value%identityAlignment
AT1G05900.1 endonuclease III 27.5e-7655.81Show/hide
Query:  SSVKSSNGVSEPETRVFVR-RRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNW
        S+ ++++G SE ETRV +R +R+K+                 K  C  P+IED  +K+T  +  SR  K  L+  +   E S   +    A  G PP NW
Subjt:  SSVKSSNGVSEPETRVFVR-RRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNW

Query:  EKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKI
        EKVL+GIR+M+ SEEAPV+ + C R GS LPPKERRF VL  +LLSSQTK+H+T  A  RL ++GLLT +++DKADE TIK LIYPVGFY+ KA N+KK+
Subjt:  EKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKI

Query:  ARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ
        A+ICLM+Y GDIPR+L ELL LPG+GPKIAHL++ +AWNDVQGICVDTHVHRICNRLGWVS  G+KQ
Subjt:  ARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ

AT1G05900.2 endonuclease III 25.1e-10959.36Show/hide
Query:  SSVKSSNGVSEPETRVFVR-RRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNW
        S+ ++++G SE ETRV +R +R+K+                 K  C  P+IED  +K+T  +  SR  K  L+  +   E S   +    A  G PP NW
Subjt:  SSVKSSNGVSEPETRVFVR-RRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNW

Query:  EKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKI
        EKVL+GIR+M+ SEEAPV+ + C R GS LPPKERRF VL  +LLSSQTK+H+T  A  RL ++GLLT +++DKADE TIK LIYPVGFY+ KA N+KK+
Subjt:  EKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKI

Query:  ARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQ
        A+ICLM+Y GDIPR+L ELL LPG+GPKIAHL++ +AWNDVQGICVDTHVHRICNRLGWVS  G+KQKTS+PEETRV L+ WLPK EWV IN LLVGFGQ
Subjt:  ARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQ

Query:  TICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL
        TICTPLRP CG CS++++CPSAFKE+ S S KLK S  +KKL
Subjt:  TICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL

AT2G31450.1 DNA glycosylase superfamily protein4.3e-10857.68Show/hide
Query:  FARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKP
        F R  T S    G+ SS    S +       S+ + + G S  ETRV+ R++  K    +         ++T + C  P+IEDFA+K+T  SP S +   
Subjt:  FARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKP

Query:  PLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTAD
                 E S   T    A  G PP NW +VL+GIR+MRSSE+APVD+MGC +AGS LPP ERRFAVL  +LLSSQTKD V + A  RL ++GLLT +
Subjt:  PLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTAD

Query:  SMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTS
        ++DKADE TIK LIYPVGFY+ KA  +KKIARICL+KY GDIP SL +LL LPGIGPK+AHLI+ +AWNDVQGICVDTHVHRICNRLGWVS  G+KQKT+
Subjt:  SMDKADEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTS

Query:  TPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKK
        +PEETRV L+ WLPKEEWV INPLLVGFGQ ICTP+RP+C  CSVS LCP+AFKE+SSPS KLK S+ +K+
Subjt:  TPEETRVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKK

AT2G31450.2 DNA glycosylase superfamily protein1.8e-10960.88Show/hide
Query:  SSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWE
        S   S+ G S  ETRV+ R++  K    +         ++T + C  P+IEDFA+K+T  SP S +            E S   T    A  G PP NW 
Subjt:  SSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCP-PNIEDFAFKRTKDSPGSRKLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWE

Query:  KVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKIA
        +VL+GIR+MRSSE+APVD+MGC +AGS LPP ERRFAVL  +LLSSQTKD V + A  RL ++GLLT +++DKADE TIK LIYPVGFY+ KA  +KKIA
Subjt:  KVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADSMDKADEETIKSLIYPVGFYSTKAKNLKKIA

Query:  RICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQT
        RICL+KY GDIP SL +LL LPGIGPK+AHLI+ +AWNDVQGICVDTHVHRICNRLGWVS  G+KQKT++PEETRV L+ WLPKEEWV INPLLVGFGQ 
Subjt:  RICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKEEWVPINPLLVGFGQT

Query:  ICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKK
        ICTP+RP+C  CSVS LCP+AFKE+SSPS KLK S+ +K+
Subjt:  ICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKK

AT4G12740.1 HhH-GPD base excision DNA repair family protein4.0e-0525.69Show/hide
Query:  DEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEET
        +++ +  +   +G+Y  +A+ L + A++ +    G  P   + L+ + GIG   A  I  +A+N+   + VD +V R+  RL  +S    K + +     
Subjt:  DEETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEET

Query:  RVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLC
        ++  +L  P       N  L+  G T+CT  +P C +C VS  C
Subjt:  RVGLELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTTTGCTTGTCCTATTCGAATACCGGCGCTTTCAATCACATTTGCTCGAAGAATTACATGCAGCGCCATGTCGAAAGGAAGTTCGTCCTCCATGCCAACAAGTTC
GAACGAAGTCCCTCCAAACCCGGGAACTTCGAGTGTTAAGTCCTCGAATGGCGTTTCTGAGCCTGAAACTCGGGTATTTGTGAGGAGAAGAGTGAAAAAGATTGCAGAAA
GTCAAGATAGTGGGTTTGAAGTTGAACCTAAAATCGACACTAAACGCTCCTGTCCTCCTAATATTGAAGATTTTGCATTCAAAAGAACAAAGGATTCCCCTGGATCAAGG
AAGTTAAAGCCTCCTCTAGATCTTCTTCTCAACGGAATTGAAGATTCTAATCCAACTACACATAAAGGCAAGGCAGAACGAGGTAAACCACCTGTGAATTGGGAAAAAGT
TCTTAAAGGAATTCGGGAAATGAGGTCCTCTGAAGAAGCTCCAGTAGATACAATGGGATGTGGACGAGCTGGTAGTACTCTTCCTCCCAAGGAAAGAAGATTTGCTGTCT
TGGCATCTTCTCTTCTCTCAAGCCAAACCAAAGACCACGTGACTCATGGAGCAGCATTGCGTCTCCAAGAAAGTGGTCTTCTTACAGCCGATTCCATGGACAAAGCTGAT
GAAGAAACCATTAAAAGTTTGATTTATCCGGTTGGATTTTATTCTACAAAGGCTAAGAATTTGAAGAAGATTGCAAGAATATGTCTTATGAAGTATGGTGGAGACATACC
TAGATCATTGGCGGAGCTACTTCTGCTACCTGGGATAGGCCCTAAGATTGCACATTTGATCATGATTATGGCTTGGAACGATGTTCAGGGGATATGTGTAGACACTCATG
TGCATCGCATTTGCAATCGGCTTGGATGGGTATCTGGGAAAGGCTCAAAACAGAAAACATCAACTCCTGAAGAAACTCGAGTAGGATTAGAACTGTGGCTGCCAAAAGAA
GAATGGGTTCCAATTAATCCTCTTCTGGTTGGATTTGGACAGACTATTTGCACTCCCCTTAGACCCAAATGTGGAAATTGCAGTGTTAGTGACCTATGCCCATCCGCATT
CAAAGAGTCTTCAAGCCCATCTCCCAAATTAAAGGGTTCAAGTTCTACCAAAAAGTTATGA
mRNA sequenceShow/hide mRNA sequence
TGGAAAAAAAAATCTTCCTTTCAAACATCCTATTCTTTGGTTGTACCTATTTCCACGTAGTTCCTTCCATTACAATGTGATAACAACAATCATATGATTATGAATTTAAA
AAATTTGATGTAATTCAAGTTTCATCAAGATATTTTTCATTGCAGATTGTAATAATGAAAAACTCACAATGTCAATCTTGAGAGATGTGGTGGTATCAAAGAAGACGACA
TAGAGAGCCCAGTACCCATCTTGATCAACCAATCTGCGGTTTCCAAGAAGCAAATATTTGGAACTTGAGCGTGGAAAAATGGACCCTTTTTAGATTGGATTGTTCTACTT
CTACTTATTACCATGTTTTTTGCTTGTCCTATTCGAATACCGGCGCTTTCAATCACATTTGCTCGAAGAATTACATGCAGCGCCATGTCGAAAGGAAGTTCGTCCTCCAT
GCCAACAAGTTCGAACGAAGTCCCTCCAAACCCGGGAACTTCGAGTGTTAAGTCCTCGAATGGCGTTTCTGAGCCTGAAACTCGGGTATTTGTGAGGAGAAGAGTGAAAA
AGATTGCAGAAAGTCAAGATAGTGGGTTTGAAGTTGAACCTAAAATCGACACTAAACGCTCCTGTCCTCCTAATATTGAAGATTTTGCATTCAAAAGAACAAAGGATTCC
CCTGGATCAAGGAAGTTAAAGCCTCCTCTAGATCTTCTTCTCAACGGAATTGAAGATTCTAATCCAACTACACATAAAGGCAAGGCAGAACGAGGTAAACCACCTGTGAA
TTGGGAAAAAGTTCTTAAAGGAATTCGGGAAATGAGGTCCTCTGAAGAAGCTCCAGTAGATACAATGGGATGTGGACGAGCTGGTAGTACTCTTCCTCCCAAGGAAAGAA
GATTTGCTGTCTTGGCATCTTCTCTTCTCTCAAGCCAAACCAAAGACCACGTGACTCATGGAGCAGCATTGCGTCTCCAAGAAAGTGGTCTTCTTACAGCCGATTCCATG
GACAAAGCTGATGAAGAAACCATTAAAAGTTTGATTTATCCGGTTGGATTTTATTCTACAAAGGCTAAGAATTTGAAGAAGATTGCAAGAATATGTCTTATGAAGTATGG
TGGAGACATACCTAGATCATTGGCGGAGCTACTTCTGCTACCTGGGATAGGCCCTAAGATTGCACATTTGATCATGATTATGGCTTGGAACGATGTTCAGGGGATATGTG
TAGACACTCATGTGCATCGCATTTGCAATCGGCTTGGATGGGTATCTGGGAAAGGCTCAAAACAGAAAACATCAACTCCTGAAGAAACTCGAGTAGGATTAGAACTGTGG
CTGCCAAAAGAAGAATGGGTTCCAATTAATCCTCTTCTGGTTGGATTTGGACAGACTATTTGCACTCCCCTTAGACCCAAATGTGGAAATTGCAGTGTTAGTGACCTATG
CCCATCCGCATTCAAAGAGTCTTCAAGCCCATCTCCCAAATTAAAGGGTTCAAGTTCTACCAAAAAGTTATGATCATCTTTCTGTGTGGATGAATATATTTACTAGTCAT
ACATTTCCCTCGTTTTTGCTTTTCTATCTTTTTTTTTGGGCACCATAATGGTTTAGAGATTTAGAGCTTCCATTATGGTTTCTATCACCAAGTGTTGATAGATAGATAGA
TAGGTGGATATTGAAGGATCTCAATTGGTCAGTGTTGTGTAGAACAAACATTGTTACTGATGTTATGATGTATTGATGTTCTTGGTACATGTCGTACATACCATAATTTC
TTCATCATTATTATCAACTAATCCTTCATGAAACTGTT
Protein sequenceShow/hide protein sequence
MFFACPIRIPALSITFARRITCSAMSKGSSSSMPTSSNEVPPNPGTSSVKSSNGVSEPETRVFVRRRVKKIAESQDSGFEVEPKIDTKRSCPPNIEDFAFKRTKDSPGSR
KLKPPLDLLLNGIEDSNPTTHKGKAERGKPPVNWEKVLKGIREMRSSEEAPVDTMGCGRAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAALRLQESGLLTADSMDKAD
EETIKSLIYPVGFYSTKAKNLKKIARICLMKYGGDIPRSLAELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVGLELWLPKE
EWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKGSSSTKKL