; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G000400 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G000400
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionEndonuclease III homolog
Genome locationchr07:472462..479367
RNA-Seq ExpressionLsi07G000400
SyntenyLsi07G000400
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000703 - oxidized pyrimidine nucleobase lesion DNA N-glycosylase activity (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0051539 - 4 iron, 4 sulfur cluster binding (molecular function)
GO:0140078 - class I DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
InterPro domainsIPR000445 - Helix-hairpin-helix motif
IPR003265 - HhH-GPD domain
IPR003651 - Endonuclease III-like, iron-sulphur cluster loop motif
IPR004035 - Endonuclease III, iron-sulphur binding site
IPR004036 - Endonuclease III-like, conserved site-2
IPR011257 - DNA glycosylase
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152104.1 endonuclease III homolog 1, chloroplastic [Cucumis sativus]6.0e-16278.28Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF
        MF ACPIR  ALSITFARRITCS MSKGS SS+PTSSNEVPPN GIS VKSS+GVSE  TRVFVRRRVKK AE QDSG EVEP ID KR CPPNIEDFAF
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF

Query:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLM---
        KRTKDSPGSRK KPPLD L+NGIEDSNPT  K  AERGKPPVNWE+VL+GIREMRSSEEAPVDTMGCG+AGSTLPP        KER F   +  L+   
Subjt:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLM---

Query:  VFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICV
          DH   + GAALRLQESGLLTADAMDKADE TIKSLIYPVGFYSTKAKNLKKIA+ICLMKYGGDIPRSL ELLLLPGIGPKIAHLIMIMAWNDVQGICV
Subjt:  VFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICV

Query:  DTHVHRICNRLGWVSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL
        DTHVHRICNRLGWVSGKGSKQ                            VGFGQTICTPLRPKCGNC VSDLCPSAFKESSSPSPKLKGSSS KKL
Subjt:  DTHVHRICNRLGWVSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL

XP_038878185.1 endonuclease III homolog 2, chloroplastic-like isoform X1 [Benincasa hispida]7.6e-16575.35Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAK------------
        M  ACPIRT A SITFARRITCSGMSKGSLSS+PTSSNEVPPN GISGVKSS+GVSES TRVFVRRRVKKNAEGQ SGLEVEP +DAK            
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAK------------

Query:  ------------------RCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQA
                          RCCPPNIEDFAFKRTKDSPGSRKSK PLDPL+NGIE SNPTRQK IAE GKPPVNWE+VLEGIR+MRSSEEAPVDTMGCGQA
Subjt:  ------------------RCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQA

Query:  GSTLPPKQQVNKNFKERNFPSFSEYLM---VFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSL
        GSTLPP        KER F   +  L+     DH   + GAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSL
Subjt:  GSTLPPKQQVNKNFKERNFPSFSEYLM---VFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSL

Query:  EELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVS
        EELL+LPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ                            VGFGQTICTPLRPKCGNC VS
Subjt:  EELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVS

Query:  DLCPSAFKESSSPSPKLKGSSSAKKL
        DLCPSAFKESSSPSPKLKGSSS KKL
Subjt:  DLCPSAFKESSSPSPKLKGSSSAKKL

XP_038878187.1 endonuclease III homolog 2, chloroplastic-like isoform X2 [Benincasa hispida]1.3e-16174.65Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAK------------
        M  ACPIRT A SITFARRITCSGMSKGSLSS+PTSSNEVPPN    GVKSS+GVSES TRVFVRRRVKKNAEGQ SGLEVEP +DAK            
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAK------------

Query:  ------------------RCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQA
                          RCCPPNIEDFAFKRTKDSPGSRKSK PLDPL+NGIE SNPTRQK IAE GKPPVNWE+VLEGIR+MRSSEEAPVDTMGCGQA
Subjt:  ------------------RCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQA

Query:  GSTLPPKQQVNKNFKERNFPSFSEYLM---VFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSL
        GSTLPP        KER F   +  L+     DH   + GAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSL
Subjt:  GSTLPPKQQVNKNFKERNFPSFSEYLM---VFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSL

Query:  EELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVS
        EELL+LPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ                            VGFGQTICTPLRPKCGNC VS
Subjt:  EELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVS

Query:  DLCPSAFKESSSPSPKLKGSSSAKKL
        DLCPSAFKESSSPSPKLKGSSS KKL
Subjt:  DLCPSAFKESSSPSPKLKGSSSAKKL

XP_038878188.1 endonuclease III homolog 2, chloroplastic-like isoform X3 [Benincasa hispida]1.3e-16981.06Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF
        M  ACPIRT A SITFARRITCSGMSKGSLSS+PTSSNEVPPN GISGVKSS+GVSES TRVFVRRRVKKNAEGQ SGLEVEP +DAKRCCPPNIEDFAF
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF

Query:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLM---
        KRTKDSPGSRKSK PLDPL+NGIE SNPTRQK IAE GKPPVNWE+VLEGIR+MRSSEEAPVDTMGCGQAGSTLPP        KER F   +  L+   
Subjt:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLM---

Query:  VFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICV
          DH   + GAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELL+LPGIGPKIAHLIMIMAWNDVQGICV
Subjt:  VFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICV

Query:  DTHVHRICNRLGWVSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL
        DTHVHRICNRLGWVSGKGSKQ                            VGFGQTICTPLRPKCGNC VSDLCPSAFKESSSPSPKLKGSSS KKL
Subjt:  DTHVHRICNRLGWVSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL

XP_038878189.1 endonuclease III homolog 2, chloroplastic-like isoform X4 [Benincasa hispida]2.4e-16680.3Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF
        M  ACPIRT A SITFARRITCSGMSKGSLSS+PTSSNEVPPN    GVKSS+GVSES TRVFVRRRVKKNAEGQ SGLEVEP +DAKRCCPPNIEDFAF
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF

Query:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLM---
        KRTKDSPGSRKSK PLDPL+NGIE SNPTRQK IAE GKPPVNWE+VLEGIR+MRSSEEAPVDTMGCGQAGSTLPP        KER F   +  L+   
Subjt:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLM---

Query:  VFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICV
          DH   + GAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELL+LPGIGPKIAHLIMIMAWNDVQGICV
Subjt:  VFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICV

Query:  DTHVHRICNRLGWVSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL
        DTHVHRICNRLGWVSGKGSKQ                            VGFGQTICTPLRPKCGNC VSDLCPSAFKESSSPSPKLKGSSS KKL
Subjt:  DTHVHRICNRLGWVSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL

TrEMBL top hitse value%identityAlignment
A0A0A0KU72 ENDO3c domain-containing protein8.8e-16784.24Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF
        MF ACPIR  ALSITFARRITCS MSKGS SS+PTSSNEVPPN GIS VKSS+GVSE  TRVFVRRRVKK AE QDSG EVEP ID KR CPPNIEDFAF
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF

Query:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLM---
        KRTKDSPGSRK KPPLD L+NGIEDSNPT  K  AERGKPPVNWE+VL+GIREMRSSEEAPVDTMGCG+AGSTLPP        KER F   +  L+   
Subjt:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLM---

Query:  VFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICV
          DH   + GAALRLQESGLLTADAMDKADE TIKSLIYPVGFYSTKAKNLKKIA+ICLMKYGGDIPRSL ELLLLPGIGPKIAHLIMIMAWNDVQGICV
Subjt:  VFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICV

Query:  DTHVHRICNRLGWVSGKGSKQVGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL
        DTHVHRICNRLGWVSGKGSKQVGFGQTICTPLRPKCGNC VSDLCPSAFKESSSPSPKLKGSSS KKL
Subjt:  DTHVHRICNRLGWVSGKGSKQVGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL

A0A1S3BX24 Endonuclease III homolog3.0e-15977.27Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF
        MF ACPIR  ALSITFARRITCS MSKGS SS+PTSSNEVPPN GIS VKSS+GVSE  TRVFVRRRVKK AE QDSG EVEP ID KR CPPNIEDFAF
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF

Query:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLM---
        KRTKDSPGSR+SKPP D L+NGIE S PT  K  A RGKPP NWE+VL+GIREMRSSEEAPVDTMGCG+AGSTLPP        KER F   +  L+   
Subjt:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLM---

Query:  VFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICV
          DH   + GAALRLQESGLLTA+AMDKADE TIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICV
Subjt:  VFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICV

Query:  DTHVHRICNRLGWVSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL
        DTHVHRICNRLGWVSGKGSKQ                            VGFGQTICTPLRPKCGNC VSDLCPSAFKESSSPSPKLKGSSS KKL
Subjt:  DTHVHRICNRLGWVSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL

A0A5A7TN21 Endonuclease III homolog2.2e-15776.57Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF
        MF ACPIR  ALSITFARRITCS MSKGS SS+PTSSNEVPPN GIS VKSS+GVSE  TRVFVRRRVKK AE QDSG EVEP I+ KR  PPNIEDFAF
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF

Query:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLM---
        KRTKDSPGSR+SKPP D L+NGIE S PT  K  A R KPP NWE+VL+GIREMRSSEEAPVDTMGCG+AGSTLPP        KER F   +  L+   
Subjt:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLM---

Query:  VFDHFER-SAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGIC
          DH    +AGAALRLQESGLLTA+AMDKADE TIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGIC
Subjt:  VFDHFER-SAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGIC

Query:  VDTHVHRICNRLGWVSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL
        VDTHVHRICNRLGWVSGKGSKQ                            VGFGQTICTPLRPKCGNC VSDLCPSAFKESSSPSPKLKGSSS KKL
Subjt:  VDTHVHRICNRLGWVSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL

A0A6J1CTU5 Endonuclease III homolog2.0e-15575.63Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVP-PNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFA
        MFL C I T  L I FARRITC  MSKGSLSS+PTSSN+ P  ++GISGV+SS+GVSES TRVFVRRRVKKNAE QDSG +VEPN+DAKRCCPP+IEDFA
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVP-PNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFA

Query:  FKRTKDSPGSRKSK-PPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLM-
        FKRTK+SPGS KSK PPLDPLV GIE SNP RQK IAERGKPPVNWE++LEGIREMRSSE+APVDTMGCGQA STLPP        KER F   +  L+ 
Subjt:  FKRTKDSPGSRKSK-PPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLM-

Query:  --VFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGI
            DH   + GAA RLQE+GLLTADAMDKADEATIKSLIYPVGFYSTKA+NLKKIAKICL+KYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGI
Subjt:  --VFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGI

Query:  CVDTHVHRICNRLGWVSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL
        CVDTHVHRI NRLGWVSGKGSKQ                            VGFGQTICTPLRPKCGNC VSDLCPSAFKESSSPSPKLK SSS KKL
Subjt:  CVDTHVHRICNRLGWVSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL

A0A6J1CVP5 Endonuclease III homolog8.5e-15475.31Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF
        MFL C I T  L I FARRITC  MSKGSLSS+PTSSN+ P +   SGV+SS+GVSES TRVFVRRRVKKNAE QDSG +VEPN+DAKRCCPP+IEDFAF
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF

Query:  KRTKDSPGSRKSK-PPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLM--
        KRTK+SPGS KSK PPLDPLV GIE SNP RQK IAERGKPPVNWE++LEGIREMRSSE+APVDTMGCGQA STLPP        KER F   +  L+  
Subjt:  KRTKDSPGSRKSK-PPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLM--

Query:  -VFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGIC
           DH   + GAA RLQE+GLLTADAMDKADEATIKSLIYPVGFYSTKA+NLKKIAKICL+KYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGIC
Subjt:  -VFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGIC

Query:  VDTHVHRICNRLGWVSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL
        VDTHVHRI NRLGWVSGKGSKQ                            VGFGQTICTPLRPKCGNC VSDLCPSAFKESSSPSPKLK SSS KKL
Subjt:  VDTHVHRICNRLGWVSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL

SwissProt top hitse value%identityAlignment
A7M7B9 Endonuclease III-like protein 12.5e-4140.32Show/hide
Query:  SNPTRQKSIAERGKP----------PVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLMVFDHFERSAGAALRLQES
        S  TR+  IA   +P          P NW++ LE IREMR   +APVD MG  +   T  P Q +      R     S  L      + ++ A LRL++ 
Subjt:  SNPTRQKSIAERGKP----------PVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLMVFDHFERSAGAALRLQES

Query:  GLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKG
        G LT D++ + D+AT+  +IYPVGF+  K K +K+   I   KYGGDIP ++EEL+ LPG+GPK+AHL M +AWN V GI VDTHVHRI NRL WV  + 
Subjt:  GLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKG

Query:  SKQ-----------------------VGFGQTICTPLRPKCGNCCVSDLCPSA
                                  VGFGQ  C P+ P+C  C   D+CP+A
Subjt:  SKQ-----------------------VGFGQTICTPLRPKCGNCCVSDLCPSA

B9DFZ0 Endonuclease III homolog 2, chloroplastic1.4e-8149.18Show/hide
Query:  SLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVR-RRVKK-NAEGQDSGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDS
        +LSS  + S E       S  +++ G SES TRV +R +R+K+ + E        E       C  P+IED  +K+T  +  SR  K  L+  +   E S
Subjt:  SLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVR-RRVKK-NAEGQDSGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDS

Query:  NPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLMVFDHFERSAGAAL-RLQESGLLTADAMDK
                A  G PP NWE+VLEGIR+M+ SEEAPV+ + C + GS LPP        KER F      L+     E   GAA+ RL ++GLLT +A+DK
Subjt:  NPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLMVFDHFERSAGAAL-RLQESGLLTADAMDK

Query:  ADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ-------
        ADE+TIK LIYPVGFY+ KA N+KK+AKICLM+Y GDIPR+LEELL LPG+GPKIAHL++ +AWNDVQGICVDTHVHRICNRLGWVS  G+KQ       
Subjt:  ADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ-------

Query:  ---------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL
                             VGFGQTICTPLRP CG C ++++CPSAFKE+ S S KLK S  +KKL
Subjt:  ---------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL

P78549 Endonuclease III-like protein 11.7e-3740.87Show/hide
Query:  PVNWEEVLEGIREMRSSEEAPVDTMGCGQA-GSTLPPKQQVNKNFKERNFPSFSEYLMVFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVG
        P +W++ L  IR MR+ ++APVD +G      S+ PPK +       R     S  L      + +AGA  RL+  G LT D++ + D+AT+  LIYPVG
Subjt:  PVNWEEVLEGIREMRSSEEAPVDTMGCGQA-GSTLPPKQQVNKNFKERNFPSFSEYLMVFDHFERSAGAALRLQESGLLTADAMDKADEATIKSLIYPVG

Query:  FYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ---------------------
        F+ +K K +K+ + I    YGGDIP S+ EL+ LPG+GPK+AHL M +AW  V GI VDTHVHRI NRL W   K +K                      
Subjt:  FYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ---------------------

Query:  ---VGFGQTICTPLRPKCGNCCVSDLCPSA
           VGFGQ  C P+ P+C  C    LCP+A
Subjt:  ---VGFGQTICTPLRPKCGNCCVSDLCPSA

Q2KID2 Endonuclease III-like protein 18.2e-3739.57Show/hide
Query:  PVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLMVFDHFER-SAGAALRLQESGLLTADAMDKADEATIKSLIYPVG
        P +W + L+ IR MRS ++APVD +G   A     P    + + K R +      ++     ++ +AGA  RL+  G LT D++ + D++T+ +LIYPVG
Subjt:  PVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLMVFDHFER-SAGAALRLQESGLLTADAMDKADEATIKSLIYPVG

Query:  FYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ---------------------
        F+ +K K +K+ + I   +Y GDIP S+ EL+ LPG+GPK+AHL M +AW  V GI VDTHVHRI NRL W   K +K                      
Subjt:  FYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ---------------------

Query:  ---VGFGQTICTPLRPKCGNCCVSDLCPSA
           VGFGQ  C P+RP+C  C    LCP+A
Subjt:  ---VGFGQTICTPLRPKCGNCCVSDLCPSA

Q9SIC4 Endonuclease III homolog 1, chloroplastic1.7e-7947.91Show/hide
Query:  FARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQD----SGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRK
        F R  T S    G++SS    S +       S  + + G S S TRV+ R++  K    +     SG  V  +   K C  P+IEDFA+K+T  SP S +
Subjt:  FARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQD----SGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRK

Query:  SKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLMVFDHFERSAGAAL-
        S                    S+   G PP NW EVLEGIR+MRSSE+APVD+MGC +AGS LPP         ER F      L+     ++   AA+ 
Subjt:  SKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLMVFDHFERSAGAAL-

Query:  RLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGW
        RL ++GLLT +A+DKADE+TIK LIYPVGFY+ KA  +KKIA+ICL+KY GDIP SL++LL LPGIGPK+AHLI+ +AWNDVQGICVDTHVHRICNRLGW
Subjt:  RLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGW

Query:  VSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKK
        VS  G+KQ                            VGFGQ ICTP+RP+C  C VS LCP+AFKE+SSPS KLK S+ +K+
Subjt:  VSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKK

Arabidopsis top hitse value%identityAlignment
AT1G05900.1 endonuclease III 23.2e-6851.02Show/hide
Query:  SLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVR-RRVKK-NAEGQDSGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDS
        +LSS  + S E       S  +++ G SES TRV +R +R+K+ + E        E       C  P+IED  +K+T  +  SR  K  L+  +   E S
Subjt:  SLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVR-RRVKK-NAEGQDSGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDS

Query:  NPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLMVFDHFERSAGAAL-RLQESGLLTADAMDK
                A  G PP NWE+VLEGIR+M+ SEEAPV+ + C + GS LPP        KER F      L+     E   GAA+ RL ++GLLT +A+DK
Subjt:  NPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLMVFDHFERSAGAAL-RLQESGLLTADAMDK

Query:  ADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQV
        ADE+TIK LIYPVGFY+ KA N+KK+AKICLM+Y GDIPR+LEELL LPG+GPKIAHL++ +AWNDVQGICVDTHVHRICNRLGWVS  G+KQV
Subjt:  ADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQV

AT1G05900.2 endonuclease III 21.0e-8249.18Show/hide
Query:  SLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVR-RRVKK-NAEGQDSGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDS
        +LSS  + S E       S  +++ G SES TRV +R +R+K+ + E        E       C  P+IED  +K+T  +  SR  K  L+  +   E S
Subjt:  SLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVR-RRVKK-NAEGQDSGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDS

Query:  NPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLMVFDHFERSAGAAL-RLQESGLLTADAMDK
                A  G PP NWE+VLEGIR+M+ SEEAPV+ + C + GS LPP        KER F      L+     E   GAA+ RL ++GLLT +A+DK
Subjt:  NPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLMVFDHFERSAGAAL-RLQESGLLTADAMDK

Query:  ADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ-------
        ADE+TIK LIYPVGFY+ KA N+KK+AKICLM+Y GDIPR+LEELL LPG+GPKIAHL++ +AWNDVQGICVDTHVHRICNRLGWVS  G+KQ       
Subjt:  ADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ-------

Query:  ---------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL
                             VGFGQTICTPLRP CG C ++++CPSAFKE+ S S KLK S  +KKL
Subjt:  ---------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL

AT2G31450.1 DNA glycosylase superfamily protein1.2e-8047.91Show/hide
Query:  FARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQD----SGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRK
        F R  T S    G++SS    S +       S  + + G S S TRV+ R++  K    +     SG  V  +   K C  P+IEDFA+K+T  SP S +
Subjt:  FARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQD----SGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRK

Query:  SKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLMVFDHFERSAGAAL-
        S                    S+   G PP NW EVLEGIR+MRSSE+APVD+MGC +AGS LPP         ER F      L+     ++   AA+ 
Subjt:  SKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLMVFDHFERSAGAAL-

Query:  RLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGW
        RL ++GLLT +A+DKADE+TIK LIYPVGFY+ KA  +KKIA+ICL+KY GDIP SL++LL LPGIGPK+AHLI+ +AWNDVQGICVDTHVHRICNRLGW
Subjt:  RLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGW

Query:  VSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKK
        VS  G+KQ                            VGFGQ ICTP+RP+C  C VS LCP+AFKE+SSPS KLK S+ +K+
Subjt:  VSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKK

AT2G31450.2 DNA glycosylase superfamily protein1.6e-8048.17Show/hide
Query:  FARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQD----SGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRK
        F R  T S    G++SS    S  +     +S   S+ G S S TRV+ R++  K    +     SG  V  +   K C  P+IEDFA+K+T  SP S +
Subjt:  FARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQD----SGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRK

Query:  SKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLMVFDHFERSAGAAL-
        S                    S+   G PP NW EVLEGIR+MRSSE+APVD+MGC +AGS LPP         ER F      L+     ++   AA+ 
Subjt:  SKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLMVFDHFERSAGAAL-

Query:  RLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGW
        RL ++GLLT +A+DKADE+TIK LIYPVGFY+ KA  +KKIA+ICL+KY GDIP SL++LL LPGIGPK+AHLI+ +AWNDVQGICVDTHVHRICNRLGW
Subjt:  RLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGW

Query:  VSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKK
        VS  G+KQ                            VGFGQ ICTP+RP+C  C VS LCP+AFKE+SSPS KLK S+ +K+
Subjt:  VSGKGSKQ----------------------------VGFGQTICTPLRPKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTTGGCTTGTCCTATTCGAACATCGGCGCTTTCGATTACATTTGCGCGAAGAATTACATGCAGCGGCATGTCGAAAGGAAGCTTGTCTTCCATGCCTACAAGCTC
AAACGAAGTCCCCCCAAATGCAGGAATTTCAGGTGTTAAGTCCTCCGATGGCGTTTCTGAGTCTGTAACTCGTGTATTTGTGAGGAGAAGAGTGAAAAAGAATGCTGAAG
GTCAAGATAGCGGGCTTGAAGTTGAACCTAATATCGACGCTAAACGCTGCTGTCCTCCTAATATTGAAGATTTTGCATTCAAAAGAACAAAGGATTCCCCTGGATCAAGG
AAGTCAAAGCCTCCTCTAGATCCCCTTGTCAACGGAATTGAAGATTCTAATCCAACTAGACAGAAAAGCATTGCAGAAAGAGGTAAACCACCTGTGAATTGGGAAGAAGT
CCTTGAAGGAATTCGGGAGATGAGATCCTCTGAAGAAGCTCCAGTTGATACCATGGGATGTGGGCAAGCTGGTAGTACTCTTCCTCCCAAGCAACAAGTAAACAAAAATT
TTAAGGAGAGGAATTTTCCTTCGTTCTCTGAATATCTGATGGTTTTTGACCATTTCGAGAGGAGTGCAGGAGCAGCATTGCGTCTCCAAGAAAGTGGTCTTCTTACTGCT
GATGCCATGGACAAAGCTGATGAAGCAACCATTAAAAGCTTGATTTACCCGGTTGGGTTTTATTCTACGAAGGCTAAGAATTTGAAGAAGATTGCAAAAATATGTCTCAT
GAAGTATGGTGGGGACATACCTAGATCATTGGAGGAGCTACTTCTGCTACCTGGGATAGGCCCTAAGATTGCACATTTGATCATGATTATGGCTTGGAACGATGTTCAGG
GGATATGTGTTGATACTCATGTGCATCGCATTTGCAATCGGCTTGGATGGGTGTCTGGGAAAGGCTCAAAACAGGTTGGATTTGGACAGACTATTTGCACTCCCCTTAGA
CCCAAATGTGGAAATTGCTGTGTTAGTGACCTGTGTCCATCTGCATTTAAGGAGTCTTCAAGCCCATCTCCCAAATTAAAGGGTTCAAGTTCTGCCAAAAAGCTATGA
mRNA sequenceShow/hide mRNA sequence
AAAATATCTATGCCAATCTTGAGAGATAAGGTATCAAAGAAAGCATAAAGAGCCCAATTACCCATCTTGATCAGCCAATCTGCGGCTTCCGTAATGCAAAATTTTGAAGC
TTGAGCCTGGAAGGATGGACCCCTTTTAGATTGGATTGTTCTAATTCTACCCATTACCATGTTTTTGGCTTGTCCTATTCGAACATCGGCGCTTTCGATTACATTTGCGC
GAAGAATTACATGCAGCGGCATGTCGAAAGGAAGCTTGTCTTCCATGCCTACAAGCTCAAACGAAGTCCCCCCAAATGCAGGAATTTCAGGTGTTAAGTCCTCCGATGGC
GTTTCTGAGTCTGTAACTCGTGTATTTGTGAGGAGAAGAGTGAAAAAGAATGCTGAAGGTCAAGATAGCGGGCTTGAAGTTGAACCTAATATCGACGCTAAACGCTGCTG
TCCTCCTAATATTGAAGATTTTGCATTCAAAAGAACAAAGGATTCCCCTGGATCAAGGAAGTCAAAGCCTCCTCTAGATCCCCTTGTCAACGGAATTGAAGATTCTAATC
CAACTAGACAGAAAAGCATTGCAGAAAGAGGTAAACCACCTGTGAATTGGGAAGAAGTCCTTGAAGGAATTCGGGAGATGAGATCCTCTGAAGAAGCTCCAGTTGATACC
ATGGGATGTGGGCAAGCTGGTAGTACTCTTCCTCCCAAGCAACAAGTAAACAAAAATTTTAAGGAGAGGAATTTTCCTTCGTTCTCTGAATATCTGATGGTTTTTGACCA
TTTCGAGAGGAGTGCAGGAGCAGCATTGCGTCTCCAAGAAAGTGGTCTTCTTACTGCTGATGCCATGGACAAAGCTGATGAAGCAACCATTAAAAGCTTGATTTACCCGG
TTGGGTTTTATTCTACGAAGGCTAAGAATTTGAAGAAGATTGCAAAAATATGTCTCATGAAGTATGGTGGGGACATACCTAGATCATTGGAGGAGCTACTTCTGCTACCT
GGGATAGGCCCTAAGATTGCACATTTGATCATGATTATGGCTTGGAACGATGTTCAGGGGATATGTGTTGATACTCATGTGCATCGCATTTGCAATCGGCTTGGATGGGT
GTCTGGGAAAGGCTCAAAACAGGTTGGATTTGGACAGACTATTTGCACTCCCCTTAGACCCAAATGTGGAAATTGCTGTGTTAGTGACCTGTGTCCATCTGCATTTAAGG
AGTCTTCAAGCCCATCTCCCAAATTAAAGGGTTCAAGTTCTGCCAAAAAGCTATGATCATCTTTTTGTATGAATTGAAATATTTACTAGTCGCACATTTCCTCCCAAGTT
TTGGTTTTGTATCTTTTTTTGGCTCTATAATGGTTTACAGATTTAAAGCTTCCATTATGGTTTCTATCACCAATTGTTAATAGATAGATAGGTAGATATTGAAGGATCTC
AATTGGCCAATATTGTGTAGAACGAAACTTGTTATTGATGTATATCAATGTTCTCGGTACAGATTTTACATCATACTATAATTTTTTCATCATTGTTATTATTTTTGAC
Protein sequenceShow/hide protein sequence
MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSR
KSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKQQVNKNFKERNFPSFSEYLMVFDHFERSAGAALRLQESGLLTA
DAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQVGFGQTICTPLR
PKCGNCCVSDLCPSAFKESSSPSPKLKGSSSAKKL