; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001784 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001784
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionEndonuclease III homolog
Genome locationChr11:450631..456884
RNA-Seq ExpressionHG10001784
SyntenyHG10001784
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000703 - oxidized pyrimidine nucleobase lesion DNA N-glycosylase activity (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0051539 - 4 iron, 4 sulfur cluster binding (molecular function)
GO:0140078 - class I DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
InterPro domainsIPR000445 - Helix-hairpin-helix motif
IPR003265 - HhH-GPD domain
IPR004036 - Endonuclease III-like, conserved site-2
IPR011257 - DNA glycosylase
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152104.1 endonuclease III homolog 1, chloroplastic [Cucumis sativus]2.2e-16989.53Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF
        MF ACPIR  ALSITFARRITCS MSKGS SS+PTSSNEVPPN GIS VKSS+GVSE  TRVFVRRRVKK AE QDSG EVEP ID KR CPPNIEDFAF
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF

Query:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSRK KPPLD L+NGIEDSNPT  K  AERGKPPVNWE+VL+GIREMRSSEEAPVDTMGCG+AGSTLPPKERRFAVLASSLLSSQTKDHVTH 
Subjt:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  KSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHR
           GAALRLQESGLLTADAMDKADE TIKSLIYPVGFYSTKAKNLKKIA+ICLMKYGGDIPRSL ELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHR
Subjt:  KSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHR

Query:  ICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV
        ICNRLGWVSGKGSKQKTSTPEETRV LELWLPK+EWVPINPLLV
Subjt:  ICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV

XP_038878185.1 endonuclease III homolog 2, chloroplastic-like isoform X1 [Benincasa hispida]9.5e-17385.56Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAK------------
        M  ACPIRT A SITFARRITCSGMSKGSLSS+PTSSNEVPPN GISGVKSS+GVSES TRVFVRRRVKKNAEGQ SGLEVEP +DAK            
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAK------------

Query:  ------------------RCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQA
                          RCCPPNIEDFAFKRTKDSPGSRKSK PLDPL+NGIE SNPTRQK IAE GKPPVNWE+VLEGIR+MRSSEEAPVDTMGCGQA
Subjt:  ------------------RCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQA

Query:  GSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLL
        GSTLPPKERRFAVLASSLLSSQTKDHVTH    GAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELL+L
Subjt:  GSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLL

Query:  PGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV
        PGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVALELWLPK+EWVPINPLLV
Subjt:  PGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV

XP_038878187.1 endonuclease III homolog 2, chloroplastic-like isoform X2 [Benincasa hispida]1.7e-16984.76Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAK------------
        M  ACPIRT A SITFARRITCSGMSKGSLSS+PTSSNEVPPN    GVKSS+GVSES TRVFVRRRVKKNAEGQ SGLEVEP +DAK            
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAK------------

Query:  ------------------RCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQA
                          RCCPPNIEDFAFKRTKDSPGSRKSK PLDPL+NGIE SNPTRQK IAE GKPPVNWE+VLEGIR+MRSSEEAPVDTMGCGQA
Subjt:  ------------------RCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQA

Query:  GSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLL
        GSTLPPKERRFAVLASSLLSSQTKDHVTH    GAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELL+L
Subjt:  GSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLL

Query:  PGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV
        PGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVALELWLPK+EWVPINPLLV
Subjt:  PGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV

XP_038878188.1 endonuclease III homolog 2, chloroplastic-like isoform X3 [Benincasa hispida]1.7e-17793.02Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF
        M  ACPIRT A SITFARRITCSGMSKGSLSS+PTSSNEVPPN GISGVKSS+GVSES TRVFVRRRVKKNAEGQ SGLEVEP +DAKRCCPPNIEDFAF
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF

Query:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSRKSK PLDPL+NGIE SNPTRQK IAE GKPPVNWE+VLEGIR+MRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTH 
Subjt:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  KSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHR
           GAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELL+LPGIGPKIAHLIMIMAWNDVQGICVDTHVHR
Subjt:  KSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHR

Query:  ICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV
        ICNRLGWVSGKGSKQKTSTPEETRVALELWLPK+EWVPINPLLV
Subjt:  ICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV

XP_038878189.1 endonuclease III homolog 2, chloroplastic-like isoform X4 [Benincasa hispida]3.0e-17492.15Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF
        M  ACPIRT A SITFARRITCSGMSKGSLSS+PTSSNEVPPN    GVKSS+GVSES TRVFVRRRVKKNAEGQ SGLEVEP +DAKRCCPPNIEDFAF
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF

Query:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSRKSK PLDPL+NGIE SNPTRQK IAE GKPPVNWE+VLEGIR+MRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTH 
Subjt:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  KSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHR
           GAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELL+LPGIGPKIAHLIMIMAWNDVQGICVDTHVHR
Subjt:  KSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHR

Query:  ICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV
        ICNRLGWVSGKGSKQKTSTPEETRVALELWLPK+EWVPINPLLV
Subjt:  ICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV

TrEMBL top hitse value%identityAlignment
A0A1S3BX24 Endonuclease III homolog1.1e-16688.37Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF
        MF ACPIR  ALSITFARRITCS MSKGS SS+PTSSNEVPPN GIS VKSS+GVSE  TRVFVRRRVKK AE QDSG EVEP ID KR CPPNIEDFAF
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF

Query:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSR+SKPP D L+NGIE S PT  K  A RGKPP NWE+VL+GIREMRSSEEAPVDTMGCG+AGSTLPPKERRFAVLASSLLSSQTKDHVTH 
Subjt:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  KSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHR
           GAALRLQESGLLTA+AMDKADE TIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHR
Subjt:  KSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHR

Query:  ICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV
        ICNRLGWVSGKGSKQKTSTPEETRV LELWLPK+EWVPINPLLV
Subjt:  ICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV

A0A5A7TN21 Endonuclease III homolog1.6e-16588.08Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF
        MF ACPIR  ALSITFARRITCS MSKGS SS+PTSSNEVPPN GIS VKSS+GVSE  TRVFVRRRVKK AE QDSG EVEP I+ KR  PPNIEDFAF
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF

Query:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSR+SKPP D L+NGIE S PT  K  A R KPP NWE+VL+GIREMRSSEEAPVDTMGCG+AGSTLPPKERRFAVLASSLLSSQTKDHVTHG
Subjt:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  KSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHR
         +AGAALRLQESGLLTA+AMDKADE TIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHR
Subjt:  KSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHR

Query:  ICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV
        ICNRLGWVSGKGSKQKTSTPEETRV LELWLPK+EWVPINPLLV
Subjt:  ICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV

A0A6J1CTU5 Endonuclease III homolog5.6e-16386.42Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVP-PNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFA
        MFL C I T  L I FARRITC  MSKGSLSS+PTSSN+ P  ++GISGV+SS+GVSES TRVFVRRRVKKNAE QDSG +VEPN+DAKRCCPP+IEDFA
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVP-PNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFA

Query:  FKRTKDSPGSRKSK-PPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVT
        FKRTK+SPGS KSK PPLDPLV GIE SNP RQK IAERGKPPVNWE++LEGIREMRSSE+APVDTMGCGQA STLPPKERRF+VLASSLLSSQTKDHVT
Subjt:  FKRTKDSPGSRKSK-PPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVT

Query:  HGKSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHV
        H    GAA RLQE+GLLTADAMDKADEATIKSLIYPVGFYSTKA+NLKKIAKICL+KYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHV
Subjt:  HGKSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHV

Query:  HRICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV
        HRI NRLGWVSGKGSKQKTSTPEETRVALELWLPK+EWVPIN LLV
Subjt:  HRICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV

A0A6J1GW15 Endonuclease III homolog5.1e-16487.5Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF
        MFLACPIR SALSITFARRITC+GMSK S SSMP +SNE PP  GISGV+SS+GVS+S TRVFVRR VKK AEGQ SGL++E   D KRCCPP+IEDFAF
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF

Query:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSRKSKPPLD L+  IEDSNPTRQK +AE GK PV+WE+VLEGIREMRSSE APVDTMGCG+AGSTLPPKERRFAVLASSLLSSQTKDHVTH 
Subjt:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  KSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHR
           GAALRLQESGLLTADAMDKADE+TIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDV+GICVDTHVHR
Subjt:  KSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHR

Query:  ICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV
        ICNRLGWVSGKGSKQKTS+PEETRVALELWLPK+EWVPINPLLV
Subjt:  ICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV

A0A6J1IUE8 Endonuclease III homolog2.1e-16287.5Show/hide
Query:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF
        MFLACPIR SALSITFARRITC+GMSK S SSMP +SNE  P  GISGV+SS+GVS+S TRVFVRR VKK AEGQ SGL++E   D K CC PNIEDFAF
Subjt:  MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAF

Query:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSRKSKPPLD L+  IEDSNPTRQK +AE GKPPV+WE+VLEGIREMRSSE APVDTMGCG+AGSTLPPKERRFAVLASSLLSSQTKDHVTH 
Subjt:  KRTKDSPGSRKSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  KSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHR
           GAALRLQESGLLTADAMDKADE+TIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHR
Subjt:  KSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHR

Query:  ICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV
        ICNRLGWVSGKGSKQKTS+PEETRVALELWLPK+EWVPINPLLV
Subjt:  ICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPINPLLV

SwissProt top hitse value%identityAlignment
A7M7B9 Endonuclease III-like protein 11.0e-5251.74Show/hide
Query:  SNPTRQKSIAERGKP----------PVNWEEVLEGIREMRSSEEAPVDTMGCGQAGST-LPPKERRFAVLASSLLSSQTKDHVTHGKSAGAALRLQESGL
        S  TR+  IA   +P          P NW++ LE IREMR   +APVD MG  +   T  PP+  R+ VL S +LSSQTKD VT    + A LRL++ G 
Subjt:  SNPTRQKSIAERGKP----------PVNWEEVLEGIREMRSSEEAPVDTMGCGQAGST-LPPKERRFAVLASSLLSSQTKDHVTHGKSAGAALRLQESGL

Query:  LTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSK
        LT D++ + D+AT+  +IYPVGF+  K K +K+   I   KYGGDIP ++EEL+ LPG+GPK+AHL M +AWN V GI VDTHVHRI NRL WV     K
Subjt:  LTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSK

Query:  QKTSTPEETRVALELWLPKDEWVPINPLLV
        ++T  PEETRVALE WLP+D W  IN LLV
Subjt:  QKTSTPEETRVALELWLPKDEWVPINPLLV

B9DFZ0 Endonuclease III homolog 2, chloroplastic2.6e-8857.05Show/hide
Query:  SLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVR-RRVKK-NAEGQDSGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDS
        +LSS  + S E       S  +++ G SES TRV +R +R+K+ + E        E       C  P+IED  +K+T  +  SR  K  L+  +   E S
Subjt:  SLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVR-RRVKK-NAEGQDSGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDS

Query:  NPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAAL-RLQESGLLTADAMDKADE
                A  G PP NWE+VLEGIR+M+ SEEAPV+ + C + GS LPPKERRF VL  +LLSSQTK+H+T     GAA+ RL ++GLLT +A+DKADE
Subjt:  NPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAAL-RLQESGLLTADAMDKADE

Query:  ATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRV
        +TIK LIYPVGFY+ KA N+KK+AKICLM+Y GDIPR+LEELL LPG+GPKIAHL++ +AWNDVQGICVDTHVHRICNRLGWVS  G+KQKTS+PEETRV
Subjt:  ATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRV

Query:  ALELWLPKDEWVPINPLLV
        AL+ WLPK EWV IN LLV
Subjt:  ALELWLPKDEWVPINPLLV

P78549 Endonuclease III-like protein 12.1e-5052.68Show/hide
Query:  PVNWEEVLEGIREMRSSEEAPVDTMGCGQA-GSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYS
        P +W++ L  IR MR+ ++APVD +G      S+ PPK RR+ VL S +LSSQTKD VT    AGA  RL+  G LT D++ + D+AT+  LIYPVGF+ 
Subjt:  PVNWEEVLEGIREMRSSEEAPVDTMGCGQA-GSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYS

Query:  TKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPI
        +K K +K+ + I    YGGDIP S+ EL+ LPG+GPK+AHL M +AW  V GI VDTHVHRI NRL W     +K+ T +PEETR ALE WLP++ W  I
Subjt:  TKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPI

Query:  NPLLV
        N LLV
Subjt:  NPLLV

Q2KID2 Endonuclease III-like protein 18.9e-4951.22Show/hide
Query:  PVNWEEVLEGIREMRSSEEAPVDTMGCGQA-GSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYS
        P +W + L+ IR MRS ++APVD +G       +  PK RR+ VL S +LSSQTKD VT    AGA  RL+  G LT D++ + D++T+ +LIYPVGF+ 
Subjt:  PVNWEEVLEGIREMRSSEEAPVDTMGCGQA-GSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYS

Query:  TKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPI
        +K K +K+ + I   +Y GDIP S+ EL+ LPG+GPK+AHL M +AW  V GI VDTHVHRI NRL W     +K+ T +PEETR ALE WLP++ W  I
Subjt:  TKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVALELWLPKDEWVPI

Query:  NPLLV
        N LLV
Subjt:  NPLLV

Q9SIC4 Endonuclease III homolog 1, chloroplastic5.2e-8955.86Show/hide
Query:  FARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQD----SGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRK
        F R  T S    G++SS    S +       S  + + G S S TRV+ R++  K    +     SG  V  +   K C  P+IEDFA+K+T  SP S +
Subjt:  FARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQD----SGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRK

Query:  SKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAALRLQE
        S                    S+   G PP NW EVLEGIR+MRSSE+APVD+MGC +AGS LPP ERRFAVL  +LLSSQTKD V +     A  RL +
Subjt:  SKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAALRLQE

Query:  SGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGK
        +GLLT +A+DKADE+TIK LIYPVGFY+ KA  +KKIA+ICL+KY GDIP SL++LL LPGIGPK+AHLI+ +AWNDVQGICVDTHVHRICNRLGWVS  
Subjt:  SGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGK

Query:  GSKQKTSTPEETRVALELWLPKDEWVPINPLLV
        G+KQKT++PEETRVAL+ WLPK+EWV INPLLV
Subjt:  GSKQKTSTPEETRVALELWLPKDEWVPINPLLV

Arabidopsis top hitse value%identityAlignment
AT1G05900.1 endonuclease III 26.7e-7654.83Show/hide
Query:  SLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVR-RRVKK-NAEGQDSGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDS
        +LSS  + S E       S  +++ G SES TRV +R +R+K+ + E        E       C  P+IED  +K+T  +  SR  K  L+  +   E S
Subjt:  SLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVR-RRVKK-NAEGQDSGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDS

Query:  NPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAAL-RLQESGLLTADAMDKADE
                A  G PP NWE+VLEGIR+M+ SEEAPV+ + C + GS LPPKERRF VL  +LLSSQTK+H+T     GAA+ RL ++GLLT +A+DKADE
Subjt:  NPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAAL-RLQESGLLTADAMDKADE

Query:  ATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ
        +TIK LIYPVGFY+ KA N+KK+AKICLM+Y GDIPR+LEELL LPG+GPKIAHL++ +AWNDVQGICVDTHVHRICNRLGWVS  G+KQ
Subjt:  ATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ

AT1G05900.2 endonuclease III 21.8e-8957.05Show/hide
Query:  SLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVR-RRVKK-NAEGQDSGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDS
        +LSS  + S E       S  +++ G SES TRV +R +R+K+ + E        E       C  P+IED  +K+T  +  SR  K  L+  +   E S
Subjt:  SLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVR-RRVKK-NAEGQDSGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLVNGIEDS

Query:  NPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAAL-RLQESGLLTADAMDKADE
                A  G PP NWE+VLEGIR+M+ SEEAPV+ + C + GS LPPKERRF VL  +LLSSQTK+H+T     GAA+ RL ++GLLT +A+DKADE
Subjt:  NPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAAL-RLQESGLLTADAMDKADE

Query:  ATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRV
        +TIK LIYPVGFY+ KA N+KK+AKICLM+Y GDIPR+LEELL LPG+GPKIAHL++ +AWNDVQGICVDTHVHRICNRLGWVS  G+KQKTS+PEETRV
Subjt:  ATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRV

Query:  ALELWLPKDEWVPINPLLV
        AL+ WLPK EWV IN LLV
Subjt:  ALELWLPKDEWVPINPLLV

AT2G31450.1 DNA glycosylase superfamily protein3.7e-9055.86Show/hide
Query:  FARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQD----SGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRK
        F R  T S    G++SS    S +       S  + + G S S TRV+ R++  K    +     SG  V  +   K C  P+IEDFA+K+T  SP S +
Subjt:  FARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQD----SGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRK

Query:  SKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAALRLQE
        S                    S+   G PP NW EVLEGIR+MRSSE+APVD+MGC +AGS LPP ERRFAVL  +LLSSQTKD V +     A  RL +
Subjt:  SKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAALRLQE

Query:  SGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGK
        +GLLT +A+DKADE+TIK LIYPVGFY+ KA  +KKIA+ICL+KY GDIP SL++LL LPGIGPK+AHLI+ +AWNDVQGICVDTHVHRICNRLGWVS  
Subjt:  SGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGK

Query:  GSKQKTSTPEETRVALELWLPKDEWVPINPLLV
        G+KQKT++PEETRVAL+ WLPK+EWV INPLLV
Subjt:  GSKQKTSTPEETRVALELWLPKDEWVPINPLLV

AT2G31450.2 DNA glycosylase superfamily protein3.7e-9056.16Show/hide
Query:  FARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQD----SGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRK
        F R  T S    G++SS    S  +     +S   S+ G S S TRV+ R++  K    +     SG  V  +   K C  P+IEDFA+K+T  SP S +
Subjt:  FARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQD----SGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSRK

Query:  SKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAALRLQE
        S                    S+   G PP NW EVLEGIR+MRSSE+APVD+MGC +AGS LPP ERRFAVL  +LLSSQTKD V +     A  RL +
Subjt:  SKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAALRLQE

Query:  SGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGK
        +GLLT +A+DKADE+TIK LIYPVGFY+ KA  +KKIA+ICL+KY GDIP SL++LL LPGIGPK+AHLI+ +AWNDVQGICVDTHVHRICNRLGWVS  
Subjt:  SGLLTADAMDKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGK

Query:  GSKQKTSTPEETRVALELWLPKDEWVPINPLLV
        G+KQKT++PEETRVAL+ WLPK+EWV INPLLV
Subjt:  GSKQKTSTPEETRVALELWLPKDEWVPINPLLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTTGGCTTGTCCTATTCGAACATCGGCGCTTTCGATTACATTTGCGCGAAGAATTACATGCAGCGGCATGTCGAAAGGAAGCTTGTCTTCCATGCCTACAAGCTC
AAACGAAGTCCCCCCAAATGCAGGAATTTCAGGTGTTAAGTCCTCCGATGGCGTTTCTGAGTCTGTAACTCGTGTATTTGTGAGGAGAAGAGTGAAAAAGAATGCTGAAG
GTCAAGATAGCGGGCTTGAAGTTGAACCTAATATCGACGCTAAACGCTGCTGTCCTCCTAATATTGAAGATTTTGCATTCAAAAGAACAAAGGATTCCCCTGGATCAAGG
AAGTCAAAGCCTCCTCTAGATCCCCTTGTCAACGGAATTGAAGATTCTAATCCAACTAGACAGAAAAGCATTGCAGAAAGAGGTAAACCACCTGTGAATTGGGAAGAAGT
CCTTGAAGGAATTCGGGAGATGAGATCCTCTGAAGAAGCTCCAGTTGATACCATGGGATGTGGGCAAGCTGGTAGTACTCTTCCTCCCAAGGAAAGAAGATTCGCTGTCT
TGGCATCTTCTCTTCTCTCAAGCCAAACCAAAGACCACGTGACTCATGGCAAGAGTGCAGGAGCAGCATTGCGTCTCCAAGAAAGTGGTCTTCTTACTGCTGATGCCATG
GACAAAGCTGATGAAGCAACCATTAAAAGCTTGATTTACCCGGTTGGGTTTTATTCTACGAAGGCTAAGAATTTGAAGAAGATTGCAAAAATATGTCTCATGAAGTATGG
TGGGGACATACCTAGATCATTGGAGGAGCTACTTCTGCTACCTGGGATAGGCCCTAAGATTGCACATTTGATCATGATTATGGCTTGGAACGATGTTCAGGGGATATGTG
TTGATACTCATGTGCATCGCATTTGCAATCGGCTTGGATGGGTGTCTGGGAAAGGCTCAAAACAGAAAACATCCACTCCTGAAGAAACTCGAGTGGCATTAGAACTGTGG
CTGCCAAAAGATGAATGGGTTCCAATTAATCCTCTTCTGGTTAAGAAACATCCTCCATTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTTTGGCTTGTCCTATTCGAACATCGGCGCTTTCGATTACATTTGCGCGAAGAATTACATGCAGCGGCATGTCGAAAGGAAGCTTGTCTTCCATGCCTACAAGCTC
AAACGAAGTCCCCCCAAATGCAGGAATTTCAGGTGTTAAGTCCTCCGATGGCGTTTCTGAGTCTGTAACTCGTGTATTTGTGAGGAGAAGAGTGAAAAAGAATGCTGAAG
GTCAAGATAGCGGGCTTGAAGTTGAACCTAATATCGACGCTAAACGCTGCTGTCCTCCTAATATTGAAGATTTTGCATTCAAAAGAACAAAGGATTCCCCTGGATCAAGG
AAGTCAAAGCCTCCTCTAGATCCCCTTGTCAACGGAATTGAAGATTCTAATCCAACTAGACAGAAAAGCATTGCAGAAAGAGGTAAACCACCTGTGAATTGGGAAGAAGT
CCTTGAAGGAATTCGGGAGATGAGATCCTCTGAAGAAGCTCCAGTTGATACCATGGGATGTGGGCAAGCTGGTAGTACTCTTCCTCCCAAGGAAAGAAGATTCGCTGTCT
TGGCATCTTCTCTTCTCTCAAGCCAAACCAAAGACCACGTGACTCATGGCAAGAGTGCAGGAGCAGCATTGCGTCTCCAAGAAAGTGGTCTTCTTACTGCTGATGCCATG
GACAAAGCTGATGAAGCAACCATTAAAAGCTTGATTTACCCGGTTGGGTTTTATTCTACGAAGGCTAAGAATTTGAAGAAGATTGCAAAAATATGTCTCATGAAGTATGG
TGGGGACATACCTAGATCATTGGAGGAGCTACTTCTGCTACCTGGGATAGGCCCTAAGATTGCACATTTGATCATGATTATGGCTTGGAACGATGTTCAGGGGATATGTG
TTGATACTCATGTGCATCGCATTTGCAATCGGCTTGGATGGGTGTCTGGGAAAGGCTCAAAACAGAAAACATCCACTCCTGAAGAAACTCGAGTGGCATTAGAACTGTGG
CTGCCAAAAGATGAATGGGTTCCAATTAATCCTCTTCTGGTTAAGAAACATCCTCCATTGTAA
Protein sequenceShow/hide protein sequence
MFLACPIRTSALSITFARRITCSGMSKGSLSSMPTSSNEVPPNAGISGVKSSDGVSESVTRVFVRRRVKKNAEGQDSGLEVEPNIDAKRCCPPNIEDFAFKRTKDSPGSR
KSKPPLDPLVNGIEDSNPTRQKSIAERGKPPVNWEEVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGKSAGAALRLQESGLLTADAM
DKADEATIKSLIYPVGFYSTKAKNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQKTSTPEETRVALELW
LPKDEWVPINPLLVKKHPPL