; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005241 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005241
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionEndonuclease III homolog
Genome locationscaffold11:33739565..33752072
RNA-Seq ExpressionSpg005241
SyntenySpg005241
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003906 - DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
GO:0019104 - DNA N-glycosylase activity (molecular function)
GO:0051539 - 4 iron, 4 sulfur cluster binding (molecular function)
InterPro domainsIPR000445 - Helix-hairpin-helix motif
IPR003265 - HhH-GPD domain
IPR003651 - Endonuclease III-like, iron-sulphur cluster loop motif
IPR004035 - Endonuclease III, iron-sulphur binding site
IPR004036 - Endonuclease III-like, conserved site-2
IPR011257 - DNA glycosylase
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal
IPR030841 - Endonuclease III-like protein 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145215.1 endonuclease III homolog 1, chloroplastic-like isoform X1 [Momordica charantia]6.0e-15650.21Show/hide
Query:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPE----SGVKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAKRCCPPNIEDFA
        MFLTC I TP L IAFARRITC  MSKGSLSSL TSSNK P +    SGV+SSNGVSESETRVFVRRRVKKNAEVQDSG +VEPNVDAKRCCPP+IEDFA
Subjt:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPE----SGVKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAKRCCPPNIEDFA

Query:  FKRTKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLT
        FKRTK+SPGS                          W+ KP                                                   PPLDPL+T
Subjt:  FKRTKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLT

Query:  EIEDSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIF
         IE SNP RQKGIAE GKPP NWEK+LEGIREMRSSE+APVDTMGCGQA STLPPKERRFSVLASSLLSSQTKDHVTH                      
Subjt:  EIEDSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIF

Query:  SLIDKRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGI
                   GAA RLQE+GLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICL+KYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGI
Subjt:  SLIDKRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGI

Query:  CVDTHVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAK
        CVDTHVHRI NRLGWVSGKGSKQ                                                                             
Subjt:  CVDTHVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAK

Query:  ESFSSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWS
                                                                                                            
Subjt:  ESFSSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWS

Query:  IALPTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSST
                                    KTSTPEETRVALELWLPKEEWVPIN LLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSST
Subjt:  IALPTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSST

Query:  KKL
        KKL
Subjt:  KKL

XP_022145216.1 endonuclease III homolog 1, chloroplastic-like isoform X2 [Momordica charantia]2.7e-15650.43Show/hide
Query:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPE-SGVKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAKRCCPPNIEDFAFKR
        MFLTC I TP L IAFARRITC  MSKGSLSSL TSSNK P + SGV+SSNGVSESETRVFVRRRVKKNAEVQDSG +VEPNVDAKRCCPP+IEDFAFKR
Subjt:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPE-SGVKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAKRCCPPNIEDFAFKR

Query:  TKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTEIE
        TK+SPGS                          W+ KP                                                   PPLDPL+T IE
Subjt:  TKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTEIE

Query:  DSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLI
         SNP RQKGIAE GKPP NWEK+LEGIREMRSSE+APVDTMGCGQA STLPPKERRFSVLASSLLSSQTKDHVTH                         
Subjt:  DSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLI

Query:  DKRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVD
                GAA RLQE+GLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICL+KYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVD
Subjt:  DKRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVD

Query:  THVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKESF
        THVHRI NRLGWVSGKGSKQ                                                                                
Subjt:  THVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKESF

Query:  SSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSIAL
                                                                                                            
Subjt:  SSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSIAL

Query:  PTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL
                                 KTSTPEETRVALELWLPKEEWVPIN LLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL
Subjt:  PTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL

XP_038878187.1 endonuclease III homolog 2, chloroplastic-like isoform X2 [Benincasa hispida]1.6e-15649.11Show/hide
Query:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAK---------------
        M   CPIRTP  SI FARRITC+GMSKGSLSSL TSSN+VPP  GVKSSNGVSESETRVFVRRRVKKNAE Q SGLEVEP VDAK               
Subjt:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAK---------------

Query:  ---------------RCCPPNIEDFAFKRTKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQ
                       RCCPPNIEDFAFKRTKDSPGS+                                                               
Subjt:  ---------------RCCPPNIEDFAFKRTKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQ

Query:  FYIHVWKWKYATIEGKSKPPLDPLLTEIEDSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKD
                       KSK PLDPLL  IE SNPTRQKGIAEGGKPP NWEKVLEGIR+MRSSEEAPVDTMGCGQAGSTLPPKERRF+VLASSLLSSQTKD
Subjt:  FYIHVWKWKYATIEGKSKPPLDPLLTEIEDSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKD

Query:  HVTHGKPRAVHYSLFPRDVKLSLLIFSLIDKRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEE
        HVTH                                 GAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKA+NLKKIAKICLMKYGGDIPRSLEE
Subjt:  HVTHGKPRAVHYSLFPRDVKLSLLIFSLIDKRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEE

Query:  LLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEW
        LL+LPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQ                                                   
Subjt:  LLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEW

Query:  AALSNLIPSIHLTDCQGKWIWKLDAKESFSSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGN
                                                                                                            
Subjt:  AALSNLIPSIHLTDCQGKWIWKLDAKESFSSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGN

Query:  LSATSLYTARSQGKFGMRFLHPLGWSIALPTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNC
                                                              KTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNC
Subjt:  LSATSLYTARSQGKFGMRFLHPLGWSIALPTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNC

Query:  SVSDLCPSAFKESSSPSPKLKRSSSTKKL
        SVSDLCPSAFKESSSPSPKLK SSSTKKL
Subjt:  SVSDLCPSAFKESSSPSPKLKRSSSTKKL

XP_038878188.1 endonuclease III homolog 2, chloroplastic-like isoform X3 [Benincasa hispida]3.1e-16051.14Show/hide
Query:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPE---SGVKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAKRCCPPNIEDFAF
        M   CPIRTP  SI FARRITC+GMSKGSLSSL TSSN+VPP    SGVKSSNGVSESETRVFVRRRVKKNAE Q SGLEVEP VDAKRCCPPNIEDFAF
Subjt:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPE---SGVKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAKRCCPPNIEDFAF

Query:  KRTKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTE
        KRTKDSPGS+                                                                              KSK PLDPLL  
Subjt:  KRTKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTE

Query:  IEDSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFS
        IE SNPTRQKGIAEGGKPP NWEKVLEGIR+MRSSEEAPVDTMGCGQAGSTLPPKERRF+VLASSLLSSQTKDHVTH                       
Subjt:  IEDSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFS

Query:  LIDKRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGIC
                  GAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKA+NLKKIAKICLMKYGGDIPRSLEELL+LPGIGPKIAHLIMIMAWNDVQGIC
Subjt:  LIDKRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGIC

Query:  VDTHVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKE
        VDTHVHRICNRLGWVSGKGSKQ                                                                              
Subjt:  VDTHVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKE

Query:  SFSSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSI
                                                                                                            
Subjt:  SFSSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSI

Query:  ALPTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTK
                                   KTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLK SSSTK
Subjt:  ALPTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTK

Query:  KL
        KL
Subjt:  KL

XP_038878189.1 endonuclease III homolog 2, chloroplastic-like isoform X4 [Benincasa hispida]2.8e-16151.22Show/hide
Query:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAKRCCPPNIEDFAFKRT
        M   CPIRTP  SI FARRITC+GMSKGSLSSL TSSN+VPP  GVKSSNGVSESETRVFVRRRVKKNAE Q SGLEVEP VDAKRCCPPNIEDFAFKRT
Subjt:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAKRCCPPNIEDFAFKRT

Query:  KDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTEIED
        KDSPGS+                                                                              KSK PLDPLL  IE 
Subjt:  KDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTEIED

Query:  SNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLID
        SNPTRQKGIAEGGKPP NWEKVLEGIR+MRSSEEAPVDTMGCGQAGSTLPPKERRF+VLASSLLSSQTKDHVTH                          
Subjt:  SNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLID

Query:  KRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDT
               GAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKA+NLKKIAKICLMKYGGDIPRSLEELL+LPGIGPKIAHLIMIMAWNDVQGICVDT
Subjt:  KRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDT

Query:  HVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKESFS
        HVHRICNRLGWVSGKGSKQ                                                                                 
Subjt:  HVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKESFS

Query:  SKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSIALP
                                                                                                            
Subjt:  SKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSIALP

Query:  TFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL
                                KTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLK SSSTKKL
Subjt:  TFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL

TrEMBL top hitse value%identityAlignment
A0A1S3BX24 Endonuclease III homolog5.8e-14948.58Show/hide
Query:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPESG---VKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAKRCCPPNIEDFAF
        MF  CPIR P LSI FARRITC+ MSKGS SSL TSSN+VPP  G   VKSSNGVSE ETRVFVRRRVKK AE QDSG EVEP +D KR CPPNIEDFAF
Subjt:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPESG---VKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAKRCCPPNIEDFAF

Query:  KRTKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTE
        KRTKDSPGS+                                                                              +SKPP D LL  
Subjt:  KRTKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTE

Query:  IEDSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFS
        IE S PT  KG A  GKPP NWEKVL+GIREMRSSEEAPVDTMGCG+AGSTLPPKERRF+VLASSLLSSQTKDHVTH                       
Subjt:  IEDSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFS

Query:  LIDKRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGIC
                  GAALRLQESGLLTA+AMDKADE TIKSLIYPVGFYSTKA+NLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGIC
Subjt:  LIDKRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGIC

Query:  VDTHVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKE
        VDTHVHRICNRLGWVSGKGSKQ                                                                              
Subjt:  VDTHVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKE

Query:  SFSSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSI
                                                                                                            
Subjt:  SFSSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSI

Query:  ALPTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTK
                                   KTSTPEETRV LELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLK SSSTK
Subjt:  ALPTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTK

Query:  KL
        KL
Subjt:  KL

A0A5A7TN21 Endonuclease III homolog4.9e-14848.43Show/hide
Query:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPESG---VKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAKRCCPPNIEDFAF
        MF  CPIR P LSI FARRITC+ MSKGS SSL TSSN+VPP  G   VKSSNGVSE ETRVFVRRRVKK AE QDSG EVEP ++ KR  PPNIEDFAF
Subjt:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPESG---VKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAKRCCPPNIEDFAF

Query:  KRTKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTE
        KRTKDSPGS+                                                                              +SKPP D LL  
Subjt:  KRTKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTE

Query:  IEDSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFS
        IE S PT  KG A   KPP NWEKVL+GIREMRSSEEAPVDTMGCG+AGSTLPPKERRF+VLASSLLSSQTKDHVTHG                      
Subjt:  IEDSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFS

Query:  LIDKRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGIC
                +AGAALRLQESGLLTA+AMDKADE TIKSLIYPVGFYSTKA+NLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGIC
Subjt:  LIDKRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGIC

Query:  VDTHVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKE
        VDTHVHRICNRLGWVSGKGSKQ                                                                              
Subjt:  VDTHVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKE

Query:  SFSSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSI
                                                                                                            
Subjt:  SFSSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSI

Query:  ALPTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTK
                                   KTSTPEETRV LELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLK SSSTK
Subjt:  ALPTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTK

Query:  KL
        KL
Subjt:  KL

A0A6J1CTU5 Endonuclease III homolog2.9e-15650.21Show/hide
Query:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPE----SGVKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAKRCCPPNIEDFA
        MFLTC I TP L IAFARRITC  MSKGSLSSL TSSNK P +    SGV+SSNGVSESETRVFVRRRVKKNAEVQDSG +VEPNVDAKRCCPP+IEDFA
Subjt:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPE----SGVKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAKRCCPPNIEDFA

Query:  FKRTKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLT
        FKRTK+SPGS                          W+ KP                                                   PPLDPL+T
Subjt:  FKRTKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLT

Query:  EIEDSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIF
         IE SNP RQKGIAE GKPP NWEK+LEGIREMRSSE+APVDTMGCGQA STLPPKERRFSVLASSLLSSQTKDHVTH                      
Subjt:  EIEDSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIF

Query:  SLIDKRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGI
                   GAA RLQE+GLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICL+KYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGI
Subjt:  SLIDKRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGI

Query:  CVDTHVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAK
        CVDTHVHRI NRLGWVSGKGSKQ                                                                             
Subjt:  CVDTHVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAK

Query:  ESFSSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWS
                                                                                                            
Subjt:  ESFSSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWS

Query:  IALPTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSST
                                    KTSTPEETRVALELWLPKEEWVPIN LLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSST
Subjt:  IALPTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSST

Query:  KKL
        KKL
Subjt:  KKL

A0A6J1CVP5 Endonuclease III homolog1.3e-15650.43Show/hide
Query:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPE-SGVKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAKRCCPPNIEDFAFKR
        MFLTC I TP L IAFARRITC  MSKGSLSSL TSSNK P + SGV+SSNGVSESETRVFVRRRVKKNAEVQDSG +VEPNVDAKRCCPP+IEDFAFKR
Subjt:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPE-SGVKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAKRCCPPNIEDFAFKR

Query:  TKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTEIE
        TK+SPGS                          W+ KP                                                   PPLDPL+T IE
Subjt:  TKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTEIE

Query:  DSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLI
         SNP RQKGIAE GKPP NWEK+LEGIREMRSSE+APVDTMGCGQA STLPPKERRFSVLASSLLSSQTKDHVTH                         
Subjt:  DSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLI

Query:  DKRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVD
                GAA RLQE+GLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICL+KYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVD
Subjt:  DKRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVD

Query:  THVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKESF
        THVHRI NRLGWVSGKGSKQ                                                                                
Subjt:  THVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKESF

Query:  SSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSIAL
                                                                                                            
Subjt:  SSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSIAL

Query:  PTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL
                                 KTSTPEETRVALELWLPKEEWVPIN LLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL
Subjt:  PTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL

A0A6J1GW15 Endonuclease III homolog1.4e-14748.14Show/hide
Query:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPE-SGVKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAKRCCPPNIEDFAFKR
        MFL CPIR   LSI FARRITC GMSK S SS+  +SN+ PP  SGV+SSNGVS+SETRVFVRR VKK AE Q SGL++E   D KRCCPP+IEDFAFKR
Subjt:  MFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPE-SGVKSSNGVSESETRVFVRRRVKKNAEVQDSGLEVEPNVDAKRCCPPNIEDFAFKR

Query:  TKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTEIE
        TKDSPGS+                                                                              KSKPPLD LL  IE
Subjt:  TKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTEIE

Query:  DSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLI
        DSNPTRQKG+AEGGK P +WEKVLEGIREMRSSE APVDTMGCG+AGSTLPPKERRF+VLASSLLSSQTKDHVTH                         
Subjt:  DSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLI

Query:  DKRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVD
                GAALRLQESGLLTADAMDKADE+TIKSLIYPVGFYSTKA+NLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDV+GICVD
Subjt:  DKRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVD

Query:  THVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKESF
        THVHRICNRLGWVSGKGSKQ                                                                                
Subjt:  THVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKESF

Query:  SSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSIAL
                                                                                                            
Subjt:  SSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSIAL

Query:  PTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTK
                                 KTS+PEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVS LCPSAFKE+SSPSPKLKRSSSTK
Subjt:  PTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTK

SwissProt top hitse value%identityAlignment
A7M7B9 Endonuclease III-like protein 14.7e-3945.96Show/hide
Query:  PANWEKVLEGIREMRSSEEAPVDTMGCGQAGST-LPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLIDKRSFRSAGAALRLQ
        P NW++ LE IREMR   +APVD MG  +   T  PP+  R+ VL S +LSSQTKD VT                                 + A LRL+
Subjt:  PANWEKVLEGIREMRSSEEAPVDTMGCGQAGST-LPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLIDKRSFRSAGAALRLQ

Query:  ESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWV
        + G LT D++ + D+AT+  +IYPVGF+  K + +K+   I   KYGGDIP ++EEL+ LPG+GPK+AHL M +AWN V GI VDTHVHRI NRL WV
Subjt:  ESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGWV

A7M7B9 Endonuclease III-like protein 16.5e-1258.93Show/hide
Query:  KKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSA
        K+T  PEETRVALE WLP++ W  IN LLVGFGQ  C P+ P+C  C   D+CP+A
Subjt:  KKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSA

B9DFZ0 Endonuclease III homolog 2, chloroplastic5.9e-7431.81Show/hide
Query:  TCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKNAEVQD----SGLEVEPNVDAKRCCPPNIEDFAFKR
        T PI   VL+    R      +S  S  S+S  S  +  +S  ++++G SESETRV +R++  K  +++     S  E +   D   C  P+IED  +K+
Subjt:  TCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKNAEVQD----SGLEVEPNVDAKRCCPPNIEDFAFKR

Query:  TKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTEIE
        T  +  S++                   R ++++ +K T    SA+S                                                     
Subjt:  TKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTEIE

Query:  DSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLI
             +  G+   G PP NWEKVLEGIR+M+ SEEAPV+ + C + GS LPPKERRF VL  +LLSSQTK+H+T                          
Subjt:  DSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLI

Query:  DKRSFRSAGAAL-RLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICV
                GAA+ RL ++GLLT +A+DKADE+TIK LIYPVGFY+ KA N+KK+AKICLM+Y GDIPR+LEELL LPG+GPKIAHL++ +AWNDVQGICV
Subjt:  DKRSFRSAGAAL-RLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICV

Query:  DTHVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKES
        DTHVHRICNRLGWVS  G+KQ                                                                               
Subjt:  DTHVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKES

Query:  FSSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSIA
                                                                                                            
Subjt:  FSSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSIA

Query:  LPTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKK
                                  KTS+PEETRVAL+ WLPK EWV IN LLVGFGQTICTPLRP CG CS++++CPSAFKE+ S S KLK+S  +KK
Subjt:  LPTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKK

Query:  L
        L
Subjt:  L

P78549 Endonuclease III-like protein 13.4e-3743.54Show/hide
Query:  EGGKP-------PANWEKVLEGIREMRSSEEAPVDTMGCGQA-GSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLIDKR
        EG +P       P +W++ L  IR MR+ ++APVD +G      S+ PPK RR+ VL S +LSSQTKD VT                             
Subjt:  EGGKP-------PANWEKVLEGIREMRSSEEAPVDTMGCGQA-GSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLIDKR

Query:  SFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHV
            AGA  RL+  G LT D++ + D+AT+  LIYPVGF+ +K + +K+ + I    YGGDIP S+ EL+ LPG+GPK+AHL M +AW  V GI VDTHV
Subjt:  SFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHV

Query:  HRICNRLGW
        HRI NRL W
Subjt:  HRICNRLGW

P78549 Endonuclease III-like protein 11.9e-1158.93Show/hide
Query:  KKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSA
        K T +PEETR ALE WLP+E W  IN LLVGFGQ  C P+ P+C  C    LCP+A
Subjt:  KKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSA

Q2KID2 Endonuclease III-like protein 18.4e-3643.15Show/hide
Query:  PANWEKVLEGIREMRSSEEAPVDTMGCGQA-GSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLIDKRSFRSAGAALRLQ
        P +W + L+ IR MRS ++APVD +G       +  PK RR+ VL S +LSSQTKD VT                                 AGA  RL+
Subjt:  PANWEKVLEGIREMRSSEEAPVDTMGCGQA-GSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLIDKRSFRSAGAALRLQ

Query:  ESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGW
          G LT D++ + D++T+ +LIYPVGF+ +K + +K+ + I   +Y GDIP S+ EL+ LPG+GPK+AHL M +AW  V GI VDTHVHRI NRL W
Subjt:  ESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNRLGW

Q2KID2 Endonuclease III-like protein 18.4e-1260.71Show/hide
Query:  KKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSA
        K T +PEETR ALE WLP+E W  IN LLVGFGQ  C P+RP+C  C    LCP+A
Subjt:  KKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSA

Q9SIC4 Endonuclease III homolog 1, chloroplastic2.0e-7437.14Show/hide
Query:  IAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLIDKRSFRSAG
        +   G PP NW +VLEGIR+MRSSE+APVD+MGC +AGS LPP ERRF+VL  +LLSSQTKD V +    A+H                           
Subjt:  IAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLIDKRSFRSAG

Query:  AALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
           RL ++GLLT +A+DKADE+TIK LIYPVGFY+ KA  +KKIA+ICL+KY GDIP SL++LL LPGIGPK+AHLI+ +AWNDVQGICVDTHVHRICNR
Subjt:  AALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR

Query:  LGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKESFSSKSLVTSL
        LGWVS  G+KQ                                                                                         
Subjt:  LGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKESFSSKSLVTSL

Query:  VSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSIALPTFLPLLTL
                                                                                                            
Subjt:  VSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSIALPTFLPLLTL

Query:  SANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKK
                        KT++PEETRVAL+ WLPKEEWV INPLLVGFGQ ICTP+RP+C  CSVS LCP+AFKE+SSPS KLK+S+ +K+
Subjt:  SANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKK

Arabidopsis top hitse value%identityAlignment
AT1G05900.1 endonuclease III 24.2e-6739.81Show/hide
Query:  TCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKNAEVQD----SGLEVEPNVDAKRCCPPNIEDFAFKR
        T PI   VL+    R      +S  S  S+S  S  +  +S  ++++G SESETRV +R++  K  +++     S  E +   D   C  P+IED  +K+
Subjt:  TCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKNAEVQD----SGLEVEPNVDAKRCCPPNIEDFAFKR

Query:  TKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTEIE
        T  +  S++                   R ++++ +K T    SA+S                                                     
Subjt:  TKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTEIE

Query:  DSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLI
             +  G+   G PP NWEKVLEGIR+M+ SEEAPV+ + C + GS LPPKERRF VL  +LLSSQTK+H+T                          
Subjt:  DSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLI

Query:  DKRSFRSAGAAL-RLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICV
                GAA+ RL ++GLLT +A+DKADE+TIK LIYPVGFY+ KA N+KK+AKICLM+Y GDIPR+LEELL LPG+GPKIAHL++ +AWNDVQGICV
Subjt:  DKRSFRSAGAAL-RLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICV

Query:  DTHVHRICNRLGWVSGKGSKQI
        DTHVHRICNRLGWVS  G+KQ+
Subjt:  DTHVHRICNRLGWVSGKGSKQI

AT1G05900.2 endonuclease III 24.2e-7531.81Show/hide
Query:  TCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKNAEVQD----SGLEVEPNVDAKRCCPPNIEDFAFKR
        T PI   VL+    R      +S  S  S+S  S  +  +S  ++++G SESETRV +R++  K  +++     S  E +   D   C  P+IED  +K+
Subjt:  TCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKNAEVQD----SGLEVEPNVDAKRCCPPNIEDFAFKR

Query:  TKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTEIE
        T  +  S++                   R ++++ +K T    SA+S                                                     
Subjt:  TKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCLGKQFYIHVWKWKYATIEGKSKPPLDPLLTEIE

Query:  DSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLI
             +  G+   G PP NWEKVLEGIR+M+ SEEAPV+ + C + GS LPPKERRF VL  +LLSSQTK+H+T                          
Subjt:  DSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLI

Query:  DKRSFRSAGAAL-RLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICV
                GAA+ RL ++GLLT +A+DKADE+TIK LIYPVGFY+ KA N+KK+AKICLM+Y GDIPR+LEELL LPG+GPKIAHL++ +AWNDVQGICV
Subjt:  DKRSFRSAGAAL-RLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICV

Query:  DTHVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKES
        DTHVHRICNRLGWVS  G+KQ                                                                               
Subjt:  DTHVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKES

Query:  FSSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSIA
                                                                                                            
Subjt:  FSSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSIA

Query:  LPTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKK
                                  KTS+PEETRVAL+ WLPK EWV IN LLVGFGQTICTPLRP CG CS++++CPSAFKE+ S S KLK+S  +KK
Subjt:  LPTFLPLLTLSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKK

Query:  L
        L
Subjt:  L

AT2G31450.1 DNA glycosylase superfamily protein1.4e-7537.14Show/hide
Query:  IAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLIDKRSFRSAG
        +   G PP NW +VLEGIR+MRSSE+APVD+MGC +AGS LPP ERRF+VL  +LLSSQTKD V +    A+H                           
Subjt:  IAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLIDKRSFRSAG

Query:  AALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
           RL ++GLLT +A+DKADE+TIK LIYPVGFY+ KA  +KKIA+ICL+KY GDIP SL++LL LPGIGPK+AHLI+ +AWNDVQGICVDTHVHRICNR
Subjt:  AALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR

Query:  LGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKESFSSKSLVTSL
        LGWVS  G+KQ                                                                                         
Subjt:  LGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKESFSSKSLVTSL

Query:  VSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSIALPTFLPLLTL
                                                                                                            
Subjt:  VSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSIALPTFLPLLTL

Query:  SANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKK
                        KT++PEETRVAL+ WLPKEEWV INPLLVGFGQ ICTP+RP+C  CSVS LCP+AFKE+SSPS KLK+S+ +K+
Subjt:  SANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKK

AT2G31450.2 DNA glycosylase superfamily protein1.4e-7537.14Show/hide
Query:  IAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLIDKRSFRSAG
        +   G PP NW +VLEGIR+MRSSE+APVD+MGC +AGS LPP ERRF+VL  +LLSSQTKD V +    A+H                           
Subjt:  IAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKPRAVHYSLFPRDVKLSLLIFSLIDKRSFRSAG

Query:  AALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR
           RL ++GLLT +A+DKADE+TIK LIYPVGFY+ KA  +KKIA+ICL+KY GDIP SL++LL LPGIGPK+AHLI+ +AWNDVQGICVDTHVHRICNR
Subjt:  AALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRICNR

Query:  LGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKESFSSKSLVTSL
        LGWVS  G+KQ                                                                                         
Subjt:  LGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKESFSSKSLVTSL

Query:  VSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSIALPTFLPLLTL
                                                                                                            
Subjt:  VSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSIALPTFLPLLTL

Query:  SANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKK
                        KT++PEETRVAL+ WLPKEEWV INPLLVGFGQ ICTP+RP+C  CSVS LCP+AFKE+SSPS KLK+S+ +K+
Subjt:  SANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CAAAACGCACTATGCCATGCCATCTTGAGAGATCTGGTGTCAGACAAGGCATCGAGCCCAATTATCCATTTCGACCAGCCAATCTGGGGCTTTCACAAAGCAAAATTTTG
GAATTTGGAGCCTGGACCCTTTTTAAATTGGATCGTTCTACCGATGGCCATGTTTCTTACTTGTCCTATTAGAACTCCGGTGCTTTCGATTGCATTTGCGCGAAGAATTA
CATGCAACGGCATGTCGAAAGGAAGTTTATCTTCCTTGTCAACAAGCTCAAACAAAGTCCCTCCAGAATCAGGTGTTAAGTCCTCCAATGGCGTTTCTGAGTCTGAAACT
CGTGTATTCGTGAGGAGGAGGGTGAAAAAGAATGCAGAAGTTCAAGATAGCGGGCTTGAAGTTGAACCTAATGTCGACGCTAAACGCTGCTGTCCTCCTAATATTGAAGA
TTTTGCATTCAAACGAACAAAGGATTCCCCCGGATCAAAAAGTGGCAAAAAGTTTATCAAAAGGTTACCAAGGCTAGTGCTAGTTTTAAGGCATGCTATGAGGCCTGTGT
CTGCTTGGCAGGTGAAGCCAACCTCTCGAAGGTCTTCTGCTGCTAGCGGGCATCGATGTCAGCCAACCAGTGGAGAGGTTAGCAGGTCCTTTTGGACTGCCGACTGCCTG
GGAAAGCAATTTTATATTCATGTATGGAAATGGAAGTATGCAACGATTGAAGGGAAGTCGAAGCCTCCTCTAGATCCTCTTCTCACAGAAATTGAAGATTCTAATCCAAC
TAGACAAAAAGGCATAGCAGAAGGAGGCAAACCACCTGCGAATTGGGAAAAAGTCCTTGAAGGAATTCGTGAAATGAGATCCTCTGAAGAAGCTCCAGTAGATACCATGG
GATGTGGGCAAGCTGGTAGTACTCTTCCTCCCAAGGAAAGAAGATTTTCTGTCTTGGCATCTTCTCTTCTCTCTAGCCAAACCAAAGACCACGTGACTCATGGCAAGCCC
AGGGCAGTACATTATTCGTTGTTTCCCAGAGATGTTAAGCTCTCTCTTCTAATATTTTCTTTAATTGACAAAAGATCTTTCAGGAGTGCAGGAGCAGCATTGCGTCTCCA
AGAAAGTGGTCTTCTTACTGCTGATGCTATGGACAAAGCTGATGAAGCAACCATTAAAAGCTTGATTTACCCGGTTGGATTTTATTCTACAAAGGCTAGGAATTTGAAGA
AGATTGCAAAAATATGTCTTATGAAGTATGGTGGGGATATACCTAGATCATTGGAGGAGCTACTTCTACTACCTGGCATAGGTCCTAAGATTGCGCATTTGATCATGATT
ATGGCATGGAACGATGTTCAGGGGATATGTGTAGATACTCACGTGCATCGCATTTGCAATCGGCTTGGATGGGTGTCTGGAAAAGGCTCAAAACAGATTGGAGACAAGCC
TCTCATTTTGGTTTTTCCTCGTTTATTTGCTATAACTGATTCTAAGAATGCTTCCATTGCTGAAGTTTGGGATTTGGTAAATGGGTCCTCGATTTTGAACTTTTGTTGGA
ATTTTAAGGTATATGGAATTAATGAATGGGCAGCCCTCTCCAATTTGATCCCTTCCATTCACCTCACTGATTGCCAGGGCAAATGGATTTGGAAGCTAGATGCAAAGGAA
TCCTTCTCATCAAAATCTTTGGTTACAAGCCTTGTATCCGGAAATCAATCACAAACCAAGGAGTTATACAAAAAGATTTGGAAGGAGCCCTGCCCTAAGATGATAAAGCT
TTTTTCGTGGGAGATCAGCCATTCCTGTTTTAATACCCTTGATAGACTTCAAAAGAGATGCCCTTGTTATCACTCCCCTAGTACTGTCTTTGCAAGAAAGCAGGGGAATC
TCTCAGCCACATCCTTATACACTGCTCGTTCACAAGGAAAATTTGGAATGAGATTTTTGCACCCTCTGGGGTGGAGCATTGCTCTGCCGACGTTTCTTCCTCTTCTTACT
CTTTCTGCAAATTCTTCCTCCATAGTCACTTTGCAAATCTGCCTCCTCAAGAAAACATCCACTCCTGAAGAAACTCGAGTGGCGTTAGAACTGTGGCTGCCGAAGGAAGA
ATGGGTTCCAATTAATCCTCTTCTGGTTGGATTTGGACAGACTATCTGCACTCCCCTTAGACCCAAATGTGGAAATTGCAGTGTTAGCGACTTGTGTCCATCTGCATTCA
AGGAGTCTTCAAGCCCATCTCCCAAATTAAAGCGTTCAAGTTCCACCAAAAAGCTATGA
mRNA sequenceShow/hide mRNA sequence
CAAAACGCACTATGCCATGCCATCTTGAGAGATCTGGTGTCAGACAAGGCATCGAGCCCAATTATCCATTTCGACCAGCCAATCTGGGGCTTTCACAAAGCAAAATTTTG
GAATTTGGAGCCTGGACCCTTTTTAAATTGGATCGTTCTACCGATGGCCATGTTTCTTACTTGTCCTATTAGAACTCCGGTGCTTTCGATTGCATTTGCGCGAAGAATTA
CATGCAACGGCATGTCGAAAGGAAGTTTATCTTCCTTGTCAACAAGCTCAAACAAAGTCCCTCCAGAATCAGGTGTTAAGTCCTCCAATGGCGTTTCTGAGTCTGAAACT
CGTGTATTCGTGAGGAGGAGGGTGAAAAAGAATGCAGAAGTTCAAGATAGCGGGCTTGAAGTTGAACCTAATGTCGACGCTAAACGCTGCTGTCCTCCTAATATTGAAGA
TTTTGCATTCAAACGAACAAAGGATTCCCCCGGATCAAAAAGTGGCAAAAAGTTTATCAAAAGGTTACCAAGGCTAGTGCTAGTTTTAAGGCATGCTATGAGGCCTGTGT
CTGCTTGGCAGGTGAAGCCAACCTCTCGAAGGTCTTCTGCTGCTAGCGGGCATCGATGTCAGCCAACCAGTGGAGAGGTTAGCAGGTCCTTTTGGACTGCCGACTGCCTG
GGAAAGCAATTTTATATTCATGTATGGAAATGGAAGTATGCAACGATTGAAGGGAAGTCGAAGCCTCCTCTAGATCCTCTTCTCACAGAAATTGAAGATTCTAATCCAAC
TAGACAAAAAGGCATAGCAGAAGGAGGCAAACCACCTGCGAATTGGGAAAAAGTCCTTGAAGGAATTCGTGAAATGAGATCCTCTGAAGAAGCTCCAGTAGATACCATGG
GATGTGGGCAAGCTGGTAGTACTCTTCCTCCCAAGGAAAGAAGATTTTCTGTCTTGGCATCTTCTCTTCTCTCTAGCCAAACCAAAGACCACGTGACTCATGGCAAGCCC
AGGGCAGTACATTATTCGTTGTTTCCCAGAGATGTTAAGCTCTCTCTTCTAATATTTTCTTTAATTGACAAAAGATCTTTCAGGAGTGCAGGAGCAGCATTGCGTCTCCA
AGAAAGTGGTCTTCTTACTGCTGATGCTATGGACAAAGCTGATGAAGCAACCATTAAAAGCTTGATTTACCCGGTTGGATTTTATTCTACAAAGGCTAGGAATTTGAAGA
AGATTGCAAAAATATGTCTTATGAAGTATGGTGGGGATATACCTAGATCATTGGAGGAGCTACTTCTACTACCTGGCATAGGTCCTAAGATTGCGCATTTGATCATGATT
ATGGCATGGAACGATGTTCAGGGGATATGTGTAGATACTCACGTGCATCGCATTTGCAATCGGCTTGGATGGGTGTCTGGAAAAGGCTCAAAACAGATTGGAGACAAGCC
TCTCATTTTGGTTTTTCCTCGTTTATTTGCTATAACTGATTCTAAGAATGCTTCCATTGCTGAAGTTTGGGATTTGGTAAATGGGTCCTCGATTTTGAACTTTTGTTGGA
ATTTTAAGGTATATGGAATTAATGAATGGGCAGCCCTCTCCAATTTGATCCCTTCCATTCACCTCACTGATTGCCAGGGCAAATGGATTTGGAAGCTAGATGCAAAGGAA
TCCTTCTCATCAAAATCTTTGGTTACAAGCCTTGTATCCGGAAATCAATCACAAACCAAGGAGTTATACAAAAAGATTTGGAAGGAGCCCTGCCCTAAGATGATAAAGCT
TTTTTCGTGGGAGATCAGCCATTCCTGTTTTAATACCCTTGATAGACTTCAAAAGAGATGCCCTTGTTATCACTCCCCTAGTACTGTCTTTGCAAGAAAGCAGGGGAATC
TCTCAGCCACATCCTTATACACTGCTCGTTCACAAGGAAAATTTGGAATGAGATTTTTGCACCCTCTGGGGTGGAGCATTGCTCTGCCGACGTTTCTTCCTCTTCTTACT
CTTTCTGCAAATTCTTCCTCCATAGTCACTTTGCAAATCTGCCTCCTCAAGAAAACATCCACTCCTGAAGAAACTCGAGTGGCGTTAGAACTGTGGCTGCCGAAGGAAGA
ATGGGTTCCAATTAATCCTCTTCTGGTTGGATTTGGACAGACTATCTGCACTCCCCTTAGACCCAAATGTGGAAATTGCAGTGTTAGCGACTTGTGTCCATCTGCATTCA
AGGAGTCTTCAAGCCCATCTCCCAAATTAAAGCGTTCAAGTTCCACCAAAAAGCTATGA
Protein sequenceShow/hide protein sequence
QNALCHAILRDLVSDKASSPIIHFDQPIWGFHKAKFWNLEPGPFLNWIVLPMAMFLTCPIRTPVLSIAFARRITCNGMSKGSLSSLSTSSNKVPPESGVKSSNGVSESET
RVFVRRRVKKNAEVQDSGLEVEPNVDAKRCCPPNIEDFAFKRTKDSPGSKSGKKFIKRLPRLVLVLRHAMRPVSAWQVKPTSRRSSAASGHRCQPTSGEVSRSFWTADCL
GKQFYIHVWKWKYATIEGKSKPPLDPLLTEIEDSNPTRQKGIAEGGKPPANWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFSVLASSLLSSQTKDHVTHGKP
RAVHYSLFPRDVKLSLLIFSLIDKRSFRSAGAALRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLIMI
MAWNDVQGICVDTHVHRICNRLGWVSGKGSKQIGDKPLILVFPRLFAITDSKNASIAEVWDLVNGSSILNFCWNFKVYGINEWAALSNLIPSIHLTDCQGKWIWKLDAKE
SFSSKSLVTSLVSGNQSQTKELYKKIWKEPCPKMIKLFSWEISHSCFNTLDRLQKRCPCYHSPSTVFARKQGNLSATSLYTARSQGKFGMRFLHPLGWSIALPTFLPLLT
LSANSSSIVTLQICLLKKTSTPEETRVALELWLPKEEWVPINPLLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL