; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019331 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019331
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionabasic site processing protein YoqW isoform X2
Genome locationtig00153343:1914999..1930480
RNA-Seq ExpressionSgr019331
SyntenySgr019331
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0006974 - cellular response to DNA damage stimulus (biological process)
GO:0018142 - protein-DNA covalent cross-linking (biological process)
GO:0003697 - single-stranded DNA binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR003738 - SOS response associated peptidase (SRAP)
IPR036590 - SOS response associated peptidase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593042.1 Abasic site processing protein HMCES, partial [Cucurbita argyrosperma subsp. sororia]3.2e-16078.28Show/hide
Query:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS
        MCGRARCTLR DDI RACHRTGGP+R+LNMDR                        FRPL+NASPGSDLPVVRRDD+S GGGVVLQCMKWGLIPSFTGKS
Subjt:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS

Query:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP
        EKPN+FKMFNARSESM EKASFRRLVPK RCLVAVEGFYEWKKDGSRKQPYYIHFKDG+PLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP
Subjt:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP

Query:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRKETSDSQEKTSCNTSVKPEPSQSL
        VILGDKER+DMWLNDSSSSKYD VL+ YEAPDLVWYPVTP+MGK SFDGPDCIKEIQLKTDGNNLISKFFS+KET+KE SDSQEKTSCNTSVKPEPSQ+L
Subjt:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRKETSDSQEKTSCNTSVKPEPSQSL

Query:  EEHKRDEIHGASSYSIGAEESKDSLVKCSSESAAACQMKRD-------MNTSIGDYGKIGSSPKKRKKESLKT
        EEHKRDE H ASS SI   +S+D+L KC S +A+ C+ KRD           + D  KI SS K RKK SLKT
Subjt:  EEHKRDEIHGASSYSIGAEESKDSLVKCSSESAAACQMKRD-------MNTSIGDYGKIGSSPKKRKKESLKT

XP_011659220.1 uncharacterized protein LOC101206083 isoform X1 [Cucumis sativus]9.2e-15275.07Show/hide
Query:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS
        MCGRARCTLRADDI RACHRTGGPVR+LNMDR                        FRPL+NASPGSDLPVVRRDD+S  GGVVLQCMKWGLIPSFT K 
Subjt:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS

Query:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP
        EKPN+FKMFNARSES+ EKASF RLVPK RCLVAVEGFYEWKKDGS+KQPYYIHFKDG+PL  AALYD WEN EGELLYTFTILTTSSSPAL+WLHDRMP
Subjt:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP

Query:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRKETSDSQEKTSCNTSVKPEPSQSL
        VILGDKERMDMWLNDSSSSKYD VL+ YEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLK DG+NLISKFFS+KET+KE S SQEKT  NTSVKPE S SL
Subjt:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRKETSDSQEKTSCNTSVKPEPSQSL

Query:  EEHKRDEIHGASSYSIGAEESKDSLVKCSSESAAACQMKR-------DMNTSIGDYGKIGSSPKKRKKESLKT
        EEHKR+   GASS     EESKD L KCSS+++   Q+KR       D+ + + DY K+GSSPK RKK +LKT
Subjt:  EEHKRDEIHGASSYSIGAEESKDSLVKCSSESAAACQMKR-------DMNTSIGDYGKIGSSPKKRKKESLKT

XP_011659221.1 uncharacterized protein LOC101206083 isoform X2 [Cucumis sativus]3.3e-14974.53Show/hide
Query:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS
        MCGRARCTLRADDI RACHRTGGPVR+LNMDR                        FRPL+NASPGSDLPVVRRDD+S  GGVVLQCMKWGLIPSFT K 
Subjt:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS

Query:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP
        EKPN+FKMFNARSES+ EKASF RLVPK RCLVAVEGFYEWKKDGS+KQPYYIHFKDG+PL  AALYD WEN EGELLYTFTILTTSSSPAL+WLHDRMP
Subjt:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP

Query:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRKETSDSQEKTSCNTSVKPEPSQSL
        VILGDKERMDMWLNDSSSSKYD VL+ YEAPDLVWYPVTPSMGKPSFDGPDCIKE  LK DG+NLISKFFS+KET+KE S SQEKT  NTSVKPE S SL
Subjt:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRKETSDSQEKTSCNTSVKPEPSQSL

Query:  EEHKRDEIHGASSYSIGAEESKDSLVKCSSESAAACQMKR-------DMNTSIGDYGKIGSSPKKRKKESLKT
        EEHKR+   GASS     EESKD L KCSS+++   Q+KR       D+ + + DY K+GSSPK RKK +LKT
Subjt:  EEHKRDEIHGASSYSIGAEESKDSLVKCSSESAAACQMKR-------DMNTSIGDYGKIGSSPKKRKKESLKT

XP_038896829.1 abasic site processing protein YoqW isoform X1 [Benincasa hispida]3.6e-15677.15Show/hide
Query:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS
        MCGRARCTLRADDIPRACHRTGG VRTLNMDR                        FRPL+NASPGSDLPVVRRDD+SG GGVVLQCMKWGLIPSFT K 
Subjt:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS

Query:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP
        EKPN+FKMFNARSES+ EKASFRRLVPK RCLVAVEGFYEWKKDGS+KQPYYIHFKDGRPLV AALYD WENPEGELLYTFTILTTS+SPAL WLHDRMP
Subjt:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP

Query:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRKETSDSQEKTSCNTSVKPEPSQSL
        VILGDKERMDMWLNDSSSSKYD VL+ YEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLK DG+NLISKFF +KE +KE SDSQEKTSCNT VKPE S SL
Subjt:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRKETSDSQEKTSCNTSVKPEPSQSL

Query:  EEHKRDEIHGASSYSIGAEESKDSLVKCSSESAAACQMKRD-------MNTSIGDYGKIGSSPKKRKKESLK
        EEHK D    ASS     EESKD L KCSSE+A  CQ+KRD         + + DY K+GSSPKKRKK +LK
Subjt:  EEHKRDEIHGASSYSIGAEESKDSLVKCSSESAAACQMKRD-------MNTSIGDYGKIGSSPKKRKKESLK

XP_038896830.1 abasic site processing protein HMCES isoform X2 [Benincasa hispida]1.7e-15376.61Show/hide
Query:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS
        MCGRARCTLRADDIPRACHRTGG VRTLNMDR                        FRPL+NASPGSDLPVVRRDD+SG GGVVLQCMKWGLIPSFT K 
Subjt:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS

Query:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP
        EKPN+FKMFNARSES+ EKASFRRLVPK RCLVAVEGFYEWKKDGS+KQPYYIHFKDGRPLV AALYD WENPEGELLYTFTILTTS+SPAL WLHDRMP
Subjt:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP

Query:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRKETSDSQEKTSCNTSVKPEPSQSL
        VILGDKERMDMWLNDSSSSKYD VL+ YEAPDLVWYPVTPSMGKPSFDGPDCIKE  LK DG+NLISKFF +KE +KE SDSQEKTSCNT VKPE S SL
Subjt:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRKETSDSQEKTSCNTSVKPEPSQSL

Query:  EEHKRDEIHGASSYSIGAEESKDSLVKCSSESAAACQMKRD-------MNTSIGDYGKIGSSPKKRKKESLK
        EEHK D    ASS     EESKD L KCSSE+A  CQ+KRD         + + DY K+GSSPKKRKK +LK
Subjt:  EEHKRDEIHGASSYSIGAEESKDSLVKCSSESAAACQMKRD-------MNTSIGDYGKIGSSPKKRKKESLK

TrEMBL top hitse value%identityAlignment
A0A0A0K6X8 Uncharacterized protein4.5e-15275.07Show/hide
Query:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS
        MCGRARCTLRADDI RACHRTGGPVR+LNMDR                        FRPL+NASPGSDLPVVRRDD+S  GGVVLQCMKWGLIPSFT K 
Subjt:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS

Query:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP
        EKPN+FKMFNARSES+ EKASF RLVPK RCLVAVEGFYEWKKDGS+KQPYYIHFKDG+PL  AALYD WEN EGELLYTFTILTTSSSPAL+WLHDRMP
Subjt:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP

Query:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRKETSDSQEKTSCNTSVKPEPSQSL
        VILGDKERMDMWLNDSSSSKYD VL+ YEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLK DG+NLISKFFS+KET+KE S SQEKT  NTSVKPE S SL
Subjt:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRKETSDSQEKTSCNTSVKPEPSQSL

Query:  EEHKRDEIHGASSYSIGAEESKDSLVKCSSESAAACQMKR-------DMNTSIGDYGKIGSSPKKRKKESLKT
        EEHKR+   GASS     EESKD L KCSS+++   Q+KR       D+ + + DY K+GSSPK RKK +LKT
Subjt:  EEHKRDEIHGASSYSIGAEESKDSLVKCSSESAAACQMKR-------DMNTSIGDYGKIGSSPKKRKKESLKT

A0A1S3C6L7 putative SOS response-associated peptidase YobE isoform X12.5e-12680.14Show/hide
Query:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS
        MCGRARCTLRADDI RACHRTGGPVR+LNMDR                        FRPL+NASPGSDLPVVRRDD+S  GGVVLQCMKWGLIPSFT K 
Subjt:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS

Query:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP
        EKPN+FKMFNARSES+ EK SF RLVPK RCLVAVEGFYEWKKDGS+KQPYYIHFKDG+PL  AALYD WEN EGELLYTFTILTTS SPAL+WLHDRMP
Subjt:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP

Query:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRK
        VILGDKERMDMWL+DSSSSKYD V + YEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLK DG+NLISKFFS+KET+K
Subjt:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRK

A0A6J1DZF4 LOW QUALITY PROTEIN: uncharacterized protein LOC1110244882.6e-12881.52Show/hide
Query:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS
        MCGRARCTL A D+PRACHR  GP+RTLN+ R                        FRPLYNASPGSDLPVVRRDD+SGGGGVVLQCMKWGLIPSFT KS
Subjt:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS

Query:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP
        EKPN++KMFNARSES+ EKASFRRLVPKSRCLVAVEGFYEWKKDGS+KQPYY+HFKDGRPLVFAALYD WENPEGELLYTFTILTTSSSPALEWLHDRMP
Subjt:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP

Query:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETR
        VI GDKERMDMWLND SSSK+D VL+ YEAPDLVWYPVTPSMGK SFDGPDCIKEIQLKT+GNNLISKFFS+KET+
Subjt:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETR

A0A6J1H9B1 LOW QUALITY PROTEIN: uncharacterized protein LOC1114612586.0e-13381.75Show/hide
Query:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS
        MCGRARCTLR DDI RACHRTGGP+R+LNMDR                        FRPL+NASPGSDLPVVRRDD+S GGGVVLQCMKWGLIPSFTGKS
Subjt:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS

Query:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP
        EKPN+FKMFNARSESM EKASFRRLVPK RCLVAVEGFYEWKKDGS+KQPYYIHFKDG+PLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP
Subjt:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP

Query:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRKETSDSQEK
        VILGDKER+DMWLNDSSSSKYD VL+ YEAPDLVWYPVTP+MGK SFDGPDCIKEIQLKTDGNNLISKFFS+KET K    + ++
Subjt:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRKETSDSQEK

A0A6J1KPK2 LOW QUALITY PROTEIN: uncharacterized protein LOC1114975574.3e-13180.7Show/hide
Query:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS
        MCGRARCTLR DDI RACHRTGGP+R+LNMDR                        FRPL+NASPGSDLPVVRRDD+S GGGVVLQCMKWGLIPSFTGKS
Subjt:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKS

Query:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP
        EKPN+FKMFNARSES+ EKASFRRLVPK RCLVAVEGFYEWKKDGS+KQPYYIHFKDG+PLVFAALYDSWENPEGE LYTFTILTTSSSPALEWLHDRMP
Subjt:  EKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMP

Query:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRKETSDSQEK
        VILGDKERMDMWLNDSSSSKYD VL+ YEAPDLVWYPVTP+MGK SFDGPDCIKEIQ K+DGNNLISKFFS+KET K    + ++
Subjt:  VILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRKETSDSQEK

SwissProt top hitse value%identityAlignment
O31916 Abasic site processing protein YoqW9.9e-3237.31Show/hide
Query:  FRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKSEKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKK-DGSRKQPYYIHF
        + P YN +P  ++  +  D    G    L  ++WGLIP +  K EK   +KM NAR+E++ EK SFR+ +   RC++  + FYEWK+ D   K P  I  
Subjt:  FRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKSEKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKK-DGSRKQPYYIHF

Query:  KDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMPVILGDKERMDMWLNDSSSSK--YDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCI
        K      FA LY+ W  PEG  LYT TI+TT  +  +E +HDRMPVIL D+   + WLN  ++       +L+ Y+A D+  Y V+  +  P  + P+ I
Subjt:  KDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMPVILGDKERMDMWLNDSSSSK--YDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCI

Query:  K
        +
Subjt:  K

O64131 SOS response-associated protein yoqW9.9e-3237.31Show/hide
Query:  FRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKSEKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKK-DGSRKQPYYIHF
        + P YN +P  ++  +  D    G    L  ++WGLIP +  K EK   +KM NAR+E++ EK SFR+ +   RC++  + FYEWK+ D   K P  I  
Subjt:  FRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKSEKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKK-DGSRKQPYYIHF

Query:  KDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMPVILGDKERMDMWLNDSSSSK--YDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCI
        K      FA LY+ W  PEG  LYT TI+TT  +  +E +HDRMPVIL D+   + WLN  ++       +L+ Y+A D+  Y V+  +  P  + P+ I
Subjt:  KDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMPVILGDKERMDMWLNDSSSSK--YDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCI

Query:  K
        +
Subjt:  K

Q5ZJT1 Abasic site processing protein HMCES3.4e-3233.22Show/hide
Query:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPV------VRRDDDSGGGGVVLQCMKWGLIP
        MCGR  C+L A  + RAC            DR        +LR        GR   +RP YN  P S  PV      V++D DS     VL  M+WGL+P
Subjt:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPV------VRRDDDSGGGGVVLQCMKWGLIP

Query:  SFTGKSEKPN--HFKMFNARSESMCEKASFR-RLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHF------------------KDGRPLVFAALYDSWENP
        S+  K + P+   FK  N RS++M  K+S++  L+   RC+V  +GFYEW++ G  KQPY+I+F                  +  R L  A ++D WE P
Subjt:  SFTGKSEKPN--HFKMFNARSESMCEKASFR-RLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHF------------------KDGRPLVFAALYDSWENP

Query:  E-GELLYTFTILTTSSSPALEWLHDRMPVILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQL
        + GE LYT+TI+T  +S  + ++H RMP IL   E ++ WL+ +     + +     A ++ ++PV+  +     D P+C+  I+L
Subjt:  E-GELLYTFTILTTSSSPALEWLHDRMPVILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQL

Q6IND6 Abasic site processing protein HMCES2.0e-3231.11Show/hide
Query:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVV------RRDDDSGGGGVVLQCMKWGLIP
        MCGR  CTL  DD+ +AC       R       D                 G    ++P YN SP S+ PV+      ++D DS     VL  M+WGLIP
Subjt:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVV------RRDDDSGGGGVVLQCMKWGLIP

Query:  S-FTGKSEKPNHFKMFNARSESMCEKASFRR-LVPKSRCLVAVEGFYEWKKDGSRKQPYYIHF-----------------KDGRPLVFAALYDSWENPE-
        S F         +K  N RS+++ EKA ++  L    RC+V  +GFYEWK+    KQPYYI+F                    R L  A L+D WE P  
Subjt:  S-FTGKSEKPNHFKMFNARSESMCEKASFRR-LVPKSRCLVAVEGFYEWKKDGSRKQPYYIHF-----------------KDGRPLVFAALYDSWENPE-

Query:  GELLYTFTILTTSSSPALEWLHDRMPVILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLIS-------
        GE LY++T++T  SS  +  +HDRMP IL   E +  WL+    S  D +   +   ++ ++PV+  +     +  +CI  + L       +S       
Subjt:  GELLYTFTILTTSSSPALEWLHDRMPVILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLIS-------

Query:  KFFSSKETRKETSDS
        ++  +K  +KE S S
Subjt:  KFFSSKETRKETSDS

Q6P7N4 Abasic site processing protein HMCES1.6e-3431.76Show/hide
Query:  MCGRARCTLRADDIPRAC---HRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVV------RRDDDSGGGGVVLQCMKWG
        MCGR  CTL  DD+ +AC    + GG       D                    G    ++P YN SP S+ PV+      ++D DS     VL  M+WG
Subjt:  MCGRARCTLRADDIPRAC---HRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVV------RRDDDSGGGGVVLQCMKWG

Query:  LIPS-FTGKSEKPNHFKMFNARSESMCEKASFR-RLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHF-----------------KDGRPLVFAALYDSWEN
        LIPS F         +K  N RS++M EKA ++  L    RC+V  +GFYEW++  S KQPYYI+F                    R L  A L+D WE 
Subjt:  LIPS-FTGKSEKPNHFKMFNARSESMCEKASFR-RLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHF-----------------KDGRPLVFAALYDSWEN

Query:  PE-GELLYTFTILTTSSSPALEWLHDRMPVILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLIS----
        P  GE LY++T++T  SS  + W+HDRMP IL   E +  WL+       D +   +   ++ ++PV+  +     + P+C+  I L       +S    
Subjt:  PE-GELLYTFTILTTSSSPALEWLHDRMPVILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLIS----

Query:  ---KFFSSKETRKETSDS
            +  +K  +KE S S
Subjt:  ---KFFSSKETRKETSDS

Arabidopsis top hitse value%identityAlignment
AT2G26470.1 unknown protein1.1e-10259.8Show/hide
Query:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDS-GGGGVVLQCMKWGLIPSFTGK
        MCGR RCTLR DD+PRA HR   P R L++DR                        +RP YN +PGS +PV+RRD++   G GVV+ CMKWGL+PSFT K
Subjt:  MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDS-GGGGVVLQCMKWGLIPSFTGK

Query:  SEKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRM
        ++KP+ FKMFNARSES+ EKASFRRL+PK+RCLVAV+GFYEWKK+GS+KQPYYIHF+DGRPLVFAAL+D+W+N  GE LYTFTILTT+SS AL+WLHDRM
Subjt:  SEKPNHFKMFNARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRM

Query:  PVILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRKETSDSQEK-TSCN--TSVKPEP
        PVILGDK+ +D WL+D S++K   +L  YE  DLVWYPVT ++GKP+FDGP+CI++I LKT  N+LISKFFS+K+ + +  D + K T  N    +K EP
Subjt:  PVILGDKERMDMWLNDSSSSKYDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRKETSDSQEK-TSCN--TSVKPEP

Query:  S
        +
Subjt:  S


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCGGGAGGGCCCGTTGTACTCTTAGAGCCGATGACATCCCCAGGGCCTGCCACCGCACCGGCGGCCCCGTCCGTACCCTCAACATGGACCGGTTAGACGACCCAGT
TCTTTCTTTTTTCCTTCGTTCTCTTTCTATTCCATGTCTACCGGGAAGGTTCAGAACTTTTCGTCCGCTGTACAATGCGTCACCGGGTTCGGATTTGCCGGTTGTTCGTC
GAGATGATGATTCAGGTGGCGGAGGAGTCGTCCTCCAGTGCATGAAATGGGGGCTCATTCCTAGTTTTACTGGGAAATCCGAGAAGCCTAATCACTTCAAGATGTTCAAT
GCTCGCTCAGAGTCCATGTGTGAAAAGGCCTCTTTTCGCCGTCTAGTTCCTAAAAGCAGATGCCTTGTGGCAGTGGAAGGCTTCTATGAGTGGAAAAAGGATGGATCAAG
AAAGCAGCCGTATTATATTCATTTTAAGGATGGGCGGCCACTTGTTTTTGCTGCTTTATATGATTCTTGGGAAAATCCTGAAGGTGAATTACTTTACACTTTTACTATTC
TTACAACTTCATCATCTCCAGCTTTAGAGTGGTTGCACGATAGGATGCCTGTTATTTTGGGTGACAAAGAACGGATGGATATGTGGTTGAATGATTCTTCATCTTCCAAG
TATGATATCGTCCTTGAACAATATGAGGCTCCTGATCTGGTGTGGTACCCTGTAACTCCTTCCATGGGCAAGCCATCATTTGACGGGCCGGACTGCATCAAGGAGATACA
GTTAAAGACTGATGGAAATAACCTCATCTCCAAATTTTTCTCTTCGAAGGAAACTAGAAAGGAAACTTCAGACTCACAAGAGAAAACTTCCTGTAACACATCTGTGAAGC
CCGAGCCATCGCAAAGTCTGGAAGAACACAAAAGAGATGAAATTCATGGAGCTTCGTCCTACTCCATAGGAGCTGAAGAATCAAAGGATAGTCTTGTTAAGTGTTCTTCT
GAGAGTGCAGCAGCATGCCAAATGAAACGGGATATGAACACCAGCATCGGTGACTACGGTAAGATAGGCAGTAGTCCAAAGAAAAGGAAGAAGGAAAGCCTGAAGACTGT
CGCTGTAGTTTGTTTATCTTGCTGTGTTAGCTGCTGCTGTTACCTTATTGTCGGCTTAAGCATCCGAAATGGACCTACCACCCCCCATTGCACCACCGCCCGCGACCCGA
GACAGAACCCCCGCCGCCTATTCCCCAGCCGTATTGTGCTGCCCCGCCGCCGTCTCGACCTCCGTCGTGGGCGGCACCAACCCACTGACACGGCGGAGTTCAAGCACGTG
GGTATGGCCATGGCGTCTCTCTTCAACATAGCCCTAAAGGGTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGCGGGAGGGCCCGTTGTACTCTTAGAGCCGATGACATCCCCAGGGCCTGCCACCGCACCGGCGGCCCCGTCCGTACCCTCAACATGGACCGGTTAGACGACCCAGT
TCTTTCTTTTTTCCTTCGTTCTCTTTCTATTCCATGTCTACCGGGAAGGTTCAGAACTTTTCGTCCGCTGTACAATGCGTCACCGGGTTCGGATTTGCCGGTTGTTCGTC
GAGATGATGATTCAGGTGGCGGAGGAGTCGTCCTCCAGTGCATGAAATGGGGGCTCATTCCTAGTTTTACTGGGAAATCCGAGAAGCCTAATCACTTCAAGATGTTCAAT
GCTCGCTCAGAGTCCATGTGTGAAAAGGCCTCTTTTCGCCGTCTAGTTCCTAAAAGCAGATGCCTTGTGGCAGTGGAAGGCTTCTATGAGTGGAAAAAGGATGGATCAAG
AAAGCAGCCGTATTATATTCATTTTAAGGATGGGCGGCCACTTGTTTTTGCTGCTTTATATGATTCTTGGGAAAATCCTGAAGGTGAATTACTTTACACTTTTACTATTC
TTACAACTTCATCATCTCCAGCTTTAGAGTGGTTGCACGATAGGATGCCTGTTATTTTGGGTGACAAAGAACGGATGGATATGTGGTTGAATGATTCTTCATCTTCCAAG
TATGATATCGTCCTTGAACAATATGAGGCTCCTGATCTGGTGTGGTACCCTGTAACTCCTTCCATGGGCAAGCCATCATTTGACGGGCCGGACTGCATCAAGGAGATACA
GTTAAAGACTGATGGAAATAACCTCATCTCCAAATTTTTCTCTTCGAAGGAAACTAGAAAGGAAACTTCAGACTCACAAGAGAAAACTTCCTGTAACACATCTGTGAAGC
CCGAGCCATCGCAAAGTCTGGAAGAACACAAAAGAGATGAAATTCATGGAGCTTCGTCCTACTCCATAGGAGCTGAAGAATCAAAGGATAGTCTTGTTAAGTGTTCTTCT
GAGAGTGCAGCAGCATGCCAAATGAAACGGGATATGAACACCAGCATCGGTGACTACGGTAAGATAGGCAGTAGTCCAAAGAAAAGGAAGAAGGAAAGCCTGAAGACTGT
CGCTGTAGTTTGTTTATCTTGCTGTGTTAGCTGCTGCTGTTACCTTATTGTCGGCTTAAGCATCCGAAATGGACCTACCACCCCCCATTGCACCACCGCCCGCGACCCGA
GACAGAACCCCCGCCGCCTATTCCCCAGCCGTATTGTGCTGCCCCGCCGCCGTCTCGACCTCCGTCGTGGGCGGCACCAACCCACTGACACGGCGGAGTTCAAGCACGTG
GGTATGGCCATGGCGTCTCTCTTCAACATAGCCCTAAAGGGTTCTTGA
Protein sequenceShow/hide protein sequence
MCGRARCTLRADDIPRACHRTGGPVRTLNMDRLDDPVLSFFLRSLSIPCLPGRFRTFRPLYNASPGSDLPVVRRDDDSGGGGVVLQCMKWGLIPSFTGKSEKPNHFKMFN
ARSESMCEKASFRRLVPKSRCLVAVEGFYEWKKDGSRKQPYYIHFKDGRPLVFAALYDSWENPEGELLYTFTILTTSSSPALEWLHDRMPVILGDKERMDMWLNDSSSSK
YDIVLEQYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTDGNNLISKFFSSKETRKETSDSQEKTSCNTSVKPEPSQSLEEHKRDEIHGASSYSIGAEESKDSLVKCSS
ESAAACQMKRDMNTSIGDYGKIGSSPKKRKKESLKTVAVVCLSCCVSCCCYLIVGLSIRNGPTTPHCTTARDPRQNPRRLFPSRIVLPRRRLDLRRGRHQPTDTAEFKHV
GMAMASLFNIALKGS