; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007314 (gene) of Snake gourd v1 genome

Gene IDTan0007314
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionEndonuclease III homolog
Genome locationLG10:14855792..14891244
RNA-Seq ExpressionTan0007314
SyntenyTan0007314
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003906 - DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0016829 - lyase activity (molecular function)
GO:0019104 - DNA N-glycosylase activity (molecular function)
GO:0051539 - 4 iron, 4 sulfur cluster binding (molecular function)
InterPro domainsIPR000445 - Helix-hairpin-helix motif
IPR003265 - HhH-GPD domain
IPR003651 - Endonuclease III-like, iron-sulphur cluster loop motif
IPR004035 - Endonuclease III, iron-sulphur binding site
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145216.1 endonuclease III homolog 1, chloroplastic-like isoform X2 [Momordica charantia]2.2e-14473.51Show/hide
Query:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPE-SGVKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAFKR
        M  TC I TP L IAFARRITC  MSKG LSS  TSSNK P + SGV+SSNGVSESETRVFVRRRVKK AEVQDSG +V+PNVDAKRCCPP+IEDFAFKR
Subjt:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPE-SGVKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAFKR

Query:  TKDSPGSRKSK-PPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGA
        TK+SPGS KSK PPLDPL+TGIE S P RQKGIAE GKPP+NWEK+LEGIREMRSSE+APVDTMGCGQA STLPPKERRF+VLASSLLSSQTKDHVTHGA
Subjt:  TKDSPGSRKSK-PPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGA

Query:  VLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH--------------------------
          RLQE+GLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICL+KYGGDIPRSLEELLLLPGIGPKIAH                          
Subjt:  VLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH--------------------------

Query:  -------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL
                                             LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTK L
Subjt:  -------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL

XP_038878185.1 endonuclease III homolog 2, chloroplastic-like isoform X1 [Benincasa hispida]1.1e-14670.67Show/hide
Query:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPE---SGVKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAK------------
        M+F CPIRTPA SI FARRITC+GMSKG LSS  TSSN+VPP    SGVKSSNGVSESETRVFVRRRVKK AE Q SGLEV+P VDAK            
Subjt:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPE---SGVKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAK------------

Query:  ------------------RCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQA
                          RCCPPNIEDFAFKRTKDSPGSRKSK PLDPLL GIE S PTRQKGIAEGGKPP+NWEKVLEGIR+MRSSEEAPVDTMGCGQA
Subjt:  ------------------RCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQA

Query:  GSTLPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIG
        GSTLPPKERRFAVLASSLLSSQTKDHVTHGA LRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKA+NLKKIAKICLMKYGGDIPRSLEELL+LPGIG
Subjt:  GSTLPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIG

Query:  PKIAH---------------------------------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKES
        PKIAH                                                               LVGFGQTICTPLRPKCGNCSVSDLCPSAFKES
Subjt:  PKIAH---------------------------------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKES

Query:  SSPSPKLKRSSSTKNL
        SSPSPKLK SSSTK L
Subjt:  SSPSPKLKRSSSTKNL

XP_038878187.1 endonuclease III homolog 2, chloroplastic-like isoform X2 [Benincasa hispida]9.7e-14870.94Show/hide
Query:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAK---------------
        M+F CPIRTPA SI FARRITC+GMSKG LSS  TSSN+VPP  GVKSSNGVSESETRVFVRRRVKK AE Q SGLEV+P VDAK               
Subjt:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAK---------------

Query:  ---------------RCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGST
                       RCCPPNIEDFAFKRTKDSPGSRKSK PLDPLL GIE S PTRQKGIAEGGKPP+NWEKVLEGIR+MRSSEEAPVDTMGCGQAGST
Subjt:  ---------------RCCPPNIEDFAFKRTKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGST

Query:  LPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKI
        LPPKERRFAVLASSLLSSQTKDHVTHGA LRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKA+NLKKIAKICLMKYGGDIPRSLEELL+LPGIGPKI
Subjt:  LPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKI

Query:  AH---------------------------------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSP
        AH                                                               LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSP
Subjt:  AH---------------------------------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSP

Query:  SPKLKRSSSTKNL
        SPKLK SSSTK L
Subjt:  SPKLKRSSSTKNL

XP_038878188.1 endonuclease III homolog 2, chloroplastic-like isoform X3 [Benincasa hispida]1.9e-15176.17Show/hide
Query:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPE---SGVKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAF
        M+F CPIRTPA SI FARRITC+GMSKG LSS  TSSN+VPP    SGVKSSNGVSESETRVFVRRRVKK AE Q SGLEV+P VDAKRCCPPNIEDFAF
Subjt:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPE---SGVKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAF

Query:  KRTKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSRKSK PLDPLL GIE S PTRQKGIAEGGKPP+NWEKVLEGIR+MRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
Subjt:  KRTKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  AVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH-------------------------
        A LRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKA+NLKKIAKICLMKYGGDIPRSLEELL+LPGIGPKIAH                         
Subjt:  AVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH-------------------------

Query:  --------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL
                                              LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLK SSSTK L
Subjt:  --------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL

XP_038878189.1 endonuclease III homolog 2, chloroplastic-like isoform X4 [Benincasa hispida]1.7e-15276.5Show/hide
Query:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAFKRT
        M+F CPIRTPA SI FARRITC+GMSKG LSS  TSSN+VPP  GVKSSNGVSESETRVFVRRRVKK AE Q SGLEV+P VDAKRCCPPNIEDFAFKRT
Subjt:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAFKRT

Query:  KDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAVL
        KDSPGSRKSK PLDPLL GIE S PTRQKGIAEGGKPP+NWEKVLEGIR+MRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGA L
Subjt:  KDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAVL

Query:  RLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH----------------------------
        RLQESGLLTADAMDKADEATIKSLIYPVGFYSTKA+NLKKIAKICLMKYGGDIPRSLEELL+LPGIGPKIAH                            
Subjt:  RLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH----------------------------

Query:  -----------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL
                                           LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLK SSSTK L
Subjt:  -----------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL

TrEMBL top hitse value%identityAlignment
A0A0A0KU72 ENDO3c domain-containing protein4.4e-14678.21Show/hide
Query:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESG---VKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAF
        M F CPIR PALSI FARRITC+ MSKG  SS  TSSN+VPP  G   VKSSNGVSE ETRVFVRRRVKK AE QDSG EV+P +D KR CPPNIEDFAF
Subjt:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESG---VKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAF

Query:  KRTKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSRK KPPLD LL GIEDS PT  KG AE GKPP+NWEKVL+GIREMRSSEEAPVDTMGCG+AGSTLPPKERRFAVLASSLLSSQTKDHVTHG
Subjt:  KRTKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  AVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHL------------------------
        A LRLQESGLLTADAMDKADE TIKSLIYPVGFYSTKA+NLKKIA+ICLMKYGGDIPRSL ELLLLPGIGPKIAHL                        
Subjt:  AVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHL------------------------

Query:  -----------VGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL
                   VGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLK SSSTK L
Subjt:  -----------VGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL

A0A1S3BX24 Endonuclease III homolog7.2e-14172.02Show/hide
Query:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESG---VKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAF
        M F CPIR PALSI FARRITC+ MSKG  SS  TSSN+VPP  G   VKSSNGVSE ETRVFVRRRVKK AE QDSG EV+P +D KR CPPNIEDFAF
Subjt:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESG---VKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAF

Query:  KRTKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG
        KRTKDSPGSR+SKPP D LL GIE S PT  KG A  GKPP NWEKVL+GIREMRSSEEAPVDTMGCG+AGSTLPPKERRFAVLASSLLSSQTKDHVTHG
Subjt:  KRTKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHG

Query:  AVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH-------------------------
        A LRLQESGLLTA+AMDKADE TIKSLIYPVGFYSTKA+NLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH                         
Subjt:  AVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH-------------------------

Query:  --------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL
                                              LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLK SSSTK L
Subjt:  --------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL

A0A6J1CTU5 Endonuclease III homolog2.4e-14472.94Show/hide
Query:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPE----SGVKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFA
        M  TC I TP L IAFARRITC  MSKG LSS  TSSNK P +    SGV+SSNGVSESETRVFVRRRVKK AEVQDSG +V+PNVDAKRCCPP+IEDFA
Subjt:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPE----SGVKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFA

Query:  FKRTKDSPGSRKSK-PPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVT
        FKRTK+SPGS KSK PPLDPL+TGIE S P RQKGIAE GKPP+NWEK+LEGIREMRSSE+APVDTMGCGQA STLPPKERRF+VLASSLLSSQTKDHVT
Subjt:  FKRTKDSPGSRKSK-PPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVT

Query:  HGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH-----------------------
        HGA  RLQE+GLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICL+KYGGDIPRSLEELLLLPGIGPKIAH                       
Subjt:  HGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH-----------------------

Query:  ----------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL
                                                LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTK L
Subjt:  ----------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL

A0A6J1CVP5 Endonuclease III homolog1.1e-14473.51Show/hide
Query:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPE-SGVKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAFKR
        M  TC I TP L IAFARRITC  MSKG LSS  TSSNK P + SGV+SSNGVSESETRVFVRRRVKK AEVQDSG +V+PNVDAKRCCPP+IEDFAFKR
Subjt:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPE-SGVKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAFKR

Query:  TKDSPGSRKSK-PPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGA
        TK+SPGS KSK PPLDPL+TGIE S P RQKGIAE GKPP+NWEK+LEGIREMRSSE+APVDTMGCGQA STLPPKERRF+VLASSLLSSQTKDHVTHGA
Subjt:  TKDSPGSRKSK-PPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGA

Query:  VLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH--------------------------
          RLQE+GLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICL+KYGGDIPRSLEELLLLPGIGPKIAH                          
Subjt:  VLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH--------------------------

Query:  -------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL
                                             LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTK L
Subjt:  -------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL

A0A6J1GW15 Endonuclease III homolog2.0e-13871.47Show/hide
Query:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPE-SGVKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAFKR
        M   CPIR  ALSI FARRITC GMSK   SS   +SN+ PP  SGV+SSNGVS+SETRVFVRR VKKKAE Q SGL+++   D KRCCPP+IEDFAFKR
Subjt:  MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPE-SGVKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAFKR

Query:  TKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAV
        TKDSPGSRKSKPPLD LL  IEDS PTRQKG+AEGGK P++WEKVLEGIREMRSSE APVDTMGCG+AGSTLPPKERRFAVLASSLLSSQTKDHVTHGA 
Subjt:  TKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAV

Query:  LRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH---------------------------
        LRLQESGLLTADAMDKADE+TIKSLIYPVGFYSTKA+NLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH                           
Subjt:  LRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH---------------------------

Query:  ------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTK
                                            LVGFGQTICTPLRPKCGNCSVS LCPSAFKE+SSPSPKLKRSSSTK
Subjt:  ------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTK

SwissProt top hitse value%identityAlignment
A7M7B9 Endonuclease III-like protein 12.0e-3437.39Show/hide
Query:  SYPTRQKGIAEGGK-PPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGST-LPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEAT
        +Y    K  + G K  P NW++ LE IREMR   +APVD MG  +   T  PP+  R+ VL S +LSSQTKD VT  A+LRL++ G LT D++ + D+AT
Subjt:  SYPTRQKGIAEGGK-PPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGST-LPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEAT

Query:  IKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH------------------------------------------------
        +  +IYPVGF+  K + +K+   I   KYGGDIP ++EEL+ LPG+GPK+AH                                                
Subjt:  IKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH------------------------------------------------

Query:  ----------LVGFGQTICTPLRPKCGNCSVSDLCPSA
                  LVGFGQ  C P+ P+C  C   D+CP+A
Subjt:  ----------LVGFGQTICTPLRPKCGNCSVSDLCPSA

B9DFZ0 Endonuclease III homolog 2, chloroplastic2.6e-7143.75Show/hide
Query:  TCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKKAEVQD----SGLEVDPNVDAKRCCPPNIEDFAFKR
        T PI    L+    R      +S     S S  S  +  +S  ++++G SESETRV +R++  K+ +++     S  E     D   C  P+IED  +K+
Subjt:  TCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKKAEVQD----SGLEVDPNVDAKRCCPPNIEDFAFKR

Query:  TKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAV
        T  +  SR  K  L+  +   E S        A  G PP NWEKVLEGIR+M+ SEEAPV+ + C + GS LPPKERRF VL  +LLSSQTK+H+T  AV
Subjt:  TKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAV

Query:  LRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH---------------------------
         RL ++GLLT +A+DKADE+TIK LIYPVGFY+ KA N+KK+AKICLM+Y GDIPR+LEELL LPG+GPKIAH                           
Subjt:  LRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH---------------------------

Query:  ------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL
                                            LVGFGQTICTPLRP CG CS++++CPSAFKE+ S S KLK+S  +K L
Subjt:  ------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL

O35980 Endonuclease III-like protein 12.9e-3036.49Show/hide
Query:  PMNWEKVLEGIREMRSSEEAPVDTMGCGQA-GSTLPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAR
        P NW++ L  IR MRS ++APVD +G      ++  PK RR+ VL S +LSSQTKD VT GA+ RL+  G LT +++ + D+ T+  LIYPVGF+  K +
Subjt:  PMNWEKVLEGIREMRSSEEAPVDTMGCGQA-GSTLPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKAR

Query:  NLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH----------------------------------------------------------LVGFGQ
         +K+   I   +Y GDIP S+ EL+ LPG+GPK+AH                                                          LVGFGQ
Subjt:  NLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH----------------------------------------------------------LVGFGQ

Query:  TICTPLRPKCGNCSVSDLCPSA
         IC P+ P+C  C    LCP+A
Subjt:  TICTPLRPKCGNCSVSDLCPSA

P78549 Endonuclease III-like protein 15.3e-3236.75Show/hide
Query:  EGGKP-------PMNWEKVLEGIREMRSSEEAPVDTMGCGQA-GSTLPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSL
        EG +P       P +W++ L  IR MR+ ++APVD +G      S+ PPK RR+ VL S +LSSQTKD VT GA+ RL+  G LT D++ + D+AT+  L
Subjt:  EGGKP-------PMNWEKVLEGIREMRSSEEAPVDTMGCGQA-GSTLPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEATIKSL

Query:  IYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH----------------------------------------------------
        IYPVGF+ +K + +K+ + I    YGGDIP S+ EL+ LPG+GPK+AH                                                    
Subjt:  IYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH----------------------------------------------------

Query:  ------LVGFGQTICTPLRPKCGNCSVSDLCPSA
              LVGFGQ  C P+ P+C  C    LCP+A
Subjt:  ------LVGFGQTICTPLRPKCGNCSVSDLCPSA

Q9SIC4 Endonuclease III homolog 1, chloroplastic9.3e-6944.77Show/hide
Query:  FARRITCNGMSKGCLSSFSTSSNKVP---PESGVKSSNGVSESETRVFVRRRVKKKAEVQD----SGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRK
        F R  T +    G +SS    S K      +S  + + G S SETRV+ R++  K+   +     SG  V+ +   K C  P+IEDFA+K+T  SP S +
Subjt:  FARRITCNGMSKGCLSSFSTSSNKVP---PESGVKSSNGVSESETRVFVRRRVKKKAEVQD----SGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRK

Query:  SKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGLL
        S           E S       +   G PP NW +VLEGIR+MRSSE+APVD+MGC +AGS LPP ERRFAVL  +LLSSQTKD V + A+ RL ++GLL
Subjt:  SKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGLL

Query:  TADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH------------------------------------
        T +A+DKADE+TIK LIYPVGFY+ KA  +KKIA+ICL+KY GDIP SL++LL LPGIGPK+AH                                    
Subjt:  TADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH------------------------------------

Query:  ---------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTK
                                   LVGFGQ ICTP+RP+C  CSVS LCP+AFKE+SSPS KLK+S+ +K
Subjt:  ---------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTK

Arabidopsis top hitse value%identityAlignment
AT1G05900.1 endonuclease III 21.5e-6149.82Show/hide
Query:  TCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKKAEVQD----SGLEVDPNVDAKRCCPPNIEDFAFKR
        T PI    L+    R      +S     S S  S  +  +S  ++++G SESETRV +R++  K+ +++     S  E     D   C  P+IED  +K+
Subjt:  TCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKKAEVQD----SGLEVDPNVDAKRCCPPNIEDFAFKR

Query:  TKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAV
        T  +  SR  K  L+  +   E S        A  G PP NWEKVLEGIR+M+ SEEAPV+ + C + GS LPPKERRF VL  +LLSSQTK+H+T  AV
Subjt:  TKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAV

Query:  LRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLV
         RL ++GLLT +A+DKADE+TIK LIYPVGFY+ KA N+KK+AKICLM+Y GDIPR+LEELL LPG+GPKIAHLV
Subjt:  LRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLV

AT1G05900.2 endonuclease III 21.9e-7243.75Show/hide
Query:  TCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKKAEVQD----SGLEVDPNVDAKRCCPPNIEDFAFKR
        T PI    L+    R      +S     S S  S  +  +S  ++++G SESETRV +R++  K+ +++     S  E     D   C  P+IED  +K+
Subjt:  TCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKKAEVQD----SGLEVDPNVDAKRCCPPNIEDFAFKR

Query:  TKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAV
        T  +  SR  K  L+  +   E S        A  G PP NWEKVLEGIR+M+ SEEAPV+ + C + GS LPPKERRF VL  +LLSSQTK+H+T  AV
Subjt:  TKDSPGSRKSKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAV

Query:  LRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH---------------------------
         RL ++GLLT +A+DKADE+TIK LIYPVGFY+ KA N+KK+AKICLM+Y GDIPR+LEELL LPG+GPKIAH                           
Subjt:  LRLQESGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH---------------------------

Query:  ------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL
                                            LVGFGQTICTPLRP CG CS++++CPSAFKE+ S S KLK+S  +K L
Subjt:  ------------------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL

AT2G31450.1 DNA glycosylase superfamily protein6.6e-7044.77Show/hide
Query:  FARRITCNGMSKGCLSSFSTSSNKVP---PESGVKSSNGVSESETRVFVRRRVKKKAEVQD----SGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRK
        F R  T +    G +SS    S K      +S  + + G S SETRV+ R++  K+   +     SG  V+ +   K C  P+IEDFA+K+T  SP S +
Subjt:  FARRITCNGMSKGCLSSFSTSSNKVP---PESGVKSSNGVSESETRVFVRRRVKKKAEVQD----SGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRK

Query:  SKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGLL
        S           E S       +   G PP NW +VLEGIR+MRSSE+APVD+MGC +AGS LPP ERRFAVL  +LLSSQTKD V + A+ RL ++GLL
Subjt:  SKPPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGLL

Query:  TADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH------------------------------------
        T +A+DKADE+TIK LIYPVGFY+ KA  +KKIA+ICL+KY GDIP SL++LL LPGIGPK+AH                                    
Subjt:  TADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH------------------------------------

Query:  ---------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTK
                                   LVGFGQ ICTP+RP+C  CSVS LCP+AFKE+SSPS KLK+S+ +K
Subjt:  ---------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTK

AT2G31450.2 DNA glycosylase superfamily protein6.0e-7145.55Show/hide
Query:  FARRITCNGMSKGCLSSFSTSSNKVP-PESGVKSSNGVSESETRVFVRRRVKKKAEVQD----SGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRKSK
        F R  T +    G +SS    S K   P S   S+ G S SETRV+ R++  K+   +     SG  V+ +   K C  P+IEDFA+K+T  SP S +S 
Subjt:  FARRITCNGMSKGCLSSFSTSSNKVP-PESGVKSSNGVSESETRVFVRRRVKKKAEVQD----SGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRKSK

Query:  PPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGLLTA
                  E S       +   G PP NW +VLEGIR+MRSSE+APVD+MGC +AGS LPP ERRFAVL  +LLSSQTKD V + A+ RL ++GLLT 
Subjt:  PPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGLLTA

Query:  DAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH--------------------------------------
        +A+DKADE+TIK LIYPVGFY+ KA  +KKIA+ICL+KY GDIP SL++LL LPGIGPK+AH                                      
Subjt:  DAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAH--------------------------------------

Query:  -------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTK
                                 LVGFGQ ICTP+RP+C  CSVS LCP+AFKE+SSPS KLK+S+ +K
Subjt:  -------------------------LVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTTTTACTTGTCCTATTAGAACTCCGGCGCTTTCGATTGCATTTGCGCGAAGAATTACATGCAACGGCATGTCGAAAGGATGTTTATCTTCCTTCTCAACTAGCTC
AAACAAAGTCCCTCCAGAATCAGGTGTCAAGTCTTCCAATGGCGTTTCTGAGTCTGAAACTCGTGTATTCGTGAGGAGAAGAGTGAAAAAGAAGGCAGAAGTTCAAGATA
GCGGGCTTGAAGTTGACCCTAATGTCGACGCTAAACGCTGCTGTCCTCCTAATATTGAAGATTTTGCATTCAAAAGAACAAAGGATTCCCCTGGATCAAGGAAGTCAAAG
CCTCCACTAGATCCTCTTCTCACAGGAATTGAAGATTCCTATCCAACTAGACAAAAAGGCATTGCAGAAGGAGGTAAACCACCCATGAATTGGGAAAAAGTCCTTGAAGG
AATTCGTGAAATGAGATCCTCTGAAGAAGCTCCAGTAGATACCATGGGATGTGGGCAAGCTGGGAGTACTCTTCCTCCCAAGGAAAGAAGATTTGCTGTCTTGGCATCTT
CTCTTCTCTCAAGCCAAACCAAAGACCACGTGACTCATGGAGCGGTATTGCGTCTCCAGGAAAGTGGTCTTCTTACTGCTGATGCCATGGACAAAGCTGATGAAGCAACC
ATTAAAAGCTTGATTTACCCGGTTGGATTTTATTCTACAAAGGCTAGGAATTTGAAGAAGATTGCAAAAATATGTCTTATGAAGTATGGTGGGGACATACCTAGATCATT
GGAAGAGCTACTTCTACTACCTGGGATAGGTCCTAAGATTGCACATTTGGTCGGATTTGGACAGACTATTTGCACTCCTCTTAGACCCAAATGTGGAAATTGCAGTGTTA
GTGACTTGTGCCCATCTGCATTCAAGGAATCATCAAGCCCATCTCCCAAATTAAAGCGTTCAAGTTCCACCAAAAACCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTTTTACTTGTCCTATTAGAACTCCGGCGCTTTCGATTGCATTTGCGCGAAGAATTACATGCAACGGCATGTCGAAAGGATGTTTATCTTCCTTCTCAACTAGCTC
AAACAAAGTCCCTCCAGAATCAGGTGTCAAGTCTTCCAATGGCGTTTCTGAGTCTGAAACTCGTGTATTCGTGAGGAGAAGAGTGAAAAAGAAGGCAGAAGTTCAAGATA
GCGGGCTTGAAGTTGACCCTAATGTCGACGCTAAACGCTGCTGTCCTCCTAATATTGAAGATTTTGCATTCAAAAGAACAAAGGATTCCCCTGGATCAAGGAAGTCAAAG
CCTCCACTAGATCCTCTTCTCACAGGAATTGAAGATTCCTATCCAACTAGACAAAAAGGCATTGCAGAAGGAGGTAAACCACCCATGAATTGGGAAAAAGTCCTTGAAGG
AATTCGTGAAATGAGATCCTCTGAAGAAGCTCCAGTAGATACCATGGGATGTGGGCAAGCTGGGAGTACTCTTCCTCCCAAGGAAAGAAGATTTGCTGTCTTGGCATCTT
CTCTTCTCTCAAGCCAAACCAAAGACCACGTGACTCATGGAGCGGTATTGCGTCTCCAGGAAAGTGGTCTTCTTACTGCTGATGCCATGGACAAAGCTGATGAAGCAACC
ATTAAAAGCTTGATTTACCCGGTTGGATTTTATTCTACAAAGGCTAGGAATTTGAAGAAGATTGCAAAAATATGTCTTATGAAGTATGGTGGGGACATACCTAGATCATT
GGAAGAGCTACTTCTACTACCTGGGATAGGTCCTAAGATTGCACATTTGGTCGGATTTGGACAGACTATTTGCACTCCTCTTAGACCCAAATGTGGAAATTGCAGTGTTA
GTGACTTGTGCCCATCTGCATTCAAGGAATCATCAAGCCCATCTCCCAAATTAAAGCGTTCAAGTTCCACCAAAAACCTATGA
Protein sequenceShow/hide protein sequence
MLFTCPIRTPALSIAFARRITCNGMSKGCLSSFSTSSNKVPPESGVKSSNGVSESETRVFVRRRVKKKAEVQDSGLEVDPNVDAKRCCPPNIEDFAFKRTKDSPGSRKSK
PPLDPLLTGIEDSYPTRQKGIAEGGKPPMNWEKVLEGIREMRSSEEAPVDTMGCGQAGSTLPPKERRFAVLASSLLSSQTKDHVTHGAVLRLQESGLLTADAMDKADEAT
IKSLIYPVGFYSTKARNLKKIAKICLMKYGGDIPRSLEELLLLPGIGPKIAHLVGFGQTICTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKNL