; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G008260 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G008260
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionabasic site processing protein YoqW isoform X4
Genome locationCG_Chr09:7662028..7670932
RNA-Seq ExpressionClCG09G008260
SyntenyClCG09G008260
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0006974 - cellular response to DNA damage stimulus (biological process)
GO:0018142 - protein-DNA covalent cross-linking (biological process)
GO:0003697 - single-stranded DNA binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR003738 - SOS response associated peptidase (SRAP)
IPR036590 - SOS response associated peptidase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593042.1 Abasic site processing protein HMCES, partial [Cucurbita argyrosperma subsp. sororia]7.2e-16883.24Show/hide
Query:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR
        MCGRARCTLR DDI+RACHRTG  +R+LNMDRFRPLFNASPGSDLPVVRRDDES GG VVLQCMKWGLIPSFT KSEKPNYFKMFNARSES+ EKASFRR
Subjt:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR

Query:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV
        LVPKRRCLVAVEGFYEWKKDGS+KQPYYIHFKDGQPLV AALYD WENP+GELLYTFTILTTSSSPALEWLHDRMPVILGDKE++D+WLNDSSSSKYD V
Subjt:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV

Query:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEHSDSQEKTSCGTSVKHEASPSLEEHKRDVNLEASSV-----ESKDC
        LKPY APDLVWYPVTP+MGK SFDGPDCIKEIQLK DG+NLISKFFSAKE KKE SDSQEKTSC TSVK E S +LEEHKRD +  ASS      +S+D 
Subjt:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEHSDSQEKTSCGTSVKHEASPSLEEHKRDVNLEASSV-----ESKDC

Query:  LAKCSSDTARTCQIKRDREGISSESKSGVDDYSKVGSSPKIRKKGSLKTGNDNQSTLFSYFGRK
        LAKC S TA TC+ KRDREG SSES+ GV+D SK+ SS KIRKK SLKTG +N+STLFSYFGRK
Subjt:  LAKCSSDTARTCQIKRDREGISSESKSGVDDYSKVGSSPKIRKKGSLKTGNDNQSTLFSYFGRK

XP_011659220.1 uncharacterized protein LOC101206083 isoform X1 [Cucumis sativus]3.9e-18289.42Show/hide
Query:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR
        MCGRARCTLRADDITRACHRTG  VR+LNMDRFRPLFNASPGSDLPVVRRDDES  G VVLQCMKWGLIPSFTEK EKPNYFKMFNARSESI EKASF R
Subjt:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR

Query:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV
        LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPL LAALYDCWEN +GELLYTFTILTTSSSPAL+WLHDRMPVILGDKE+MD+WLNDSSSSKYD+V
Subjt:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV

Query:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEHSDSQEKTSCGTSVKHEASPSLEEHKRDVNLEASSVESKDCLAKCS
        LKPY APDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKE KKE+S SQEKT   TSVK EASPSLEEHKR+VN  ASS ESKDCLAKCS
Subjt:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEHSDSQEKTSCGTSVKHEASPSLEEHKRDVNLEASSVESKDCLAKCS

Query:  SDTARTCQIKRDREGISSESKSGVDDYSKVGSSPKIRKKGSLKTGNDNQSTLFSYFGRK
        SDT+ T QIKRDRE ISS+ KSG+DDYSKVGSSPKIRKKG+LKTGNDNQ TLFSYFG+K
Subjt:  SDTARTCQIKRDREGISSESKSGVDDYSKVGSSPKIRKKGSLKTGNDNQSTLFSYFGRK

XP_011659221.1 uncharacterized protein LOC101206083 isoform X2 [Cucumis sativus]1.4e-17988.86Show/hide
Query:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR
        MCGRARCTLRADDITRACHRTG  VR+LNMDRFRPLFNASPGSDLPVVRRDDES  G VVLQCMKWGLIPSFTEK EKPNYFKMFNARSESI EKASF R
Subjt:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR

Query:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV
        LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPL LAALYDCWEN +GELLYTFTILTTSSSPAL+WLHDRMPVILGDKE+MD+WLNDSSSSKYD+V
Subjt:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV

Query:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEHSDSQEKTSCGTSVKHEASPSLEEHKRDVNLEASSVESKDCLAKCS
        LKPY APDLVWYPVTPSMGKPSFDGPDCIKE  LKNDGSNLISKFFSAKE KKE+S SQEKT   TSVK EASPSLEEHKR+VN  ASS ESKDCLAKCS
Subjt:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEHSDSQEKTSCGTSVKHEASPSLEEHKRDVNLEASSVESKDCLAKCS

Query:  SDTARTCQIKRDREGISSESKSGVDDYSKVGSSPKIRKKGSLKTGNDNQSTLFSYFGRK
        SDT+ T QIKRDRE ISS+ KSG+DDYSKVGSSPKIRKKG+LKTGNDNQ TLFSYFG+K
Subjt:  SDTARTCQIKRDREGISSESKSGVDDYSKVGSSPKIRKKGSLKTGNDNQSTLFSYFGRK

XP_038896829.1 abasic site processing protein YoqW isoform X1 [Benincasa hispida]3.9e-19092.76Show/hide
Query:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR
        MCGRARCTLRADDI RACHRTG RVRTLNMDRFRPLFNASPGSDLPVVRRDDESG G VVLQCMKWGLIPSFTEK EKPNYFKMFNARSESIREKASFRR
Subjt:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR

Query:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV
        LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDG+PLVLAALYDCWENP+GELLYTFTILTTS+SPAL WLHDRMPVILGDKE+MD+WLNDSSSSKYDTV
Subjt:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV

Query:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEHSDSQEKTSCGTSVKHEASPSLEEHKRDVNLEASSVESKDCLAKCS
        LKPY APDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFF AKEIKKEHSDSQEKTSC T VK EASPSLEEHK DVNL ASS ESKDCLAKCS
Subjt:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEHSDSQEKTSCGTSVKHEASPSLEEHKRDVNLEASSVESKDCLAKCS

Query:  SDTARTCQIKRDREGISSESKSGVDDYSKVGSSPKIRKKGSLKTGNDNQSTLFSYFGRK
        S+TA TCQIKRDRE ISS SKSGVDDYSKVGSSPK RKKG+LK GNDNQSTLFSYFGRK
Subjt:  SDTARTCQIKRDREGISSESKSGVDDYSKVGSSPKIRKKGSLKTGNDNQSTLFSYFGRK

XP_038896830.1 abasic site processing protein HMCES isoform X2 [Benincasa hispida]1.4e-18792.2Show/hide
Query:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR
        MCGRARCTLRADDI RACHRTG RVRTLNMDRFRPLFNASPGSDLPVVRRDDESG G VVLQCMKWGLIPSFTEK EKPNYFKMFNARSESIREKASFRR
Subjt:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR

Query:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV
        LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDG+PLVLAALYDCWENP+GELLYTFTILTTS+SPAL WLHDRMPVILGDKE+MD+WLNDSSSSKYDTV
Subjt:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV

Query:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEHSDSQEKTSCGTSVKHEASPSLEEHKRDVNLEASSVESKDCLAKCS
        LKPY APDLVWYPVTPSMGKPSFDGPDCIKE  LKNDGSNLISKFF AKEIKKEHSDSQEKTSC T VK EASPSLEEHK DVNL ASS ESKDCLAKCS
Subjt:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEHSDSQEKTSCGTSVKHEASPSLEEHKRDVNLEASSVESKDCLAKCS

Query:  SDTARTCQIKRDREGISSESKSGVDDYSKVGSSPKIRKKGSLKTGNDNQSTLFSYFGRK
        S+TA TCQIKRDRE ISS SKSGVDDYSKVGSSPK RKKG+LK GNDNQSTLFSYFGRK
Subjt:  SDTARTCQIKRDREGISSESKSGVDDYSKVGSSPKIRKKGSLKTGNDNQSTLFSYFGRK

TrEMBL top hitse value%identityAlignment
A0A0A0K6X8 Uncharacterized protein1.9e-18289.42Show/hide
Query:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR
        MCGRARCTLRADDITRACHRTG  VR+LNMDRFRPLFNASPGSDLPVVRRDDES  G VVLQCMKWGLIPSFTEK EKPNYFKMFNARSESI EKASF R
Subjt:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR

Query:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV
        LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPL LAALYDCWEN +GELLYTFTILTTSSSPAL+WLHDRMPVILGDKE+MD+WLNDSSSSKYD+V
Subjt:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV

Query:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEHSDSQEKTSCGTSVKHEASPSLEEHKRDVNLEASSVESKDCLAKCS
        LKPY APDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKE KKE+S SQEKT   TSVK EASPSLEEHKR+VN  ASS ESKDCLAKCS
Subjt:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEHSDSQEKTSCGTSVKHEASPSLEEHKRDVNLEASSVESKDCLAKCS

Query:  SDTARTCQIKRDREGISSESKSGVDDYSKVGSSPKIRKKGSLKTGNDNQSTLFSYFGRK
        SDT+ T QIKRDRE ISS+ KSG+DDYSKVGSSPKIRKKG+LKTGNDNQ TLFSYFG+K
Subjt:  SDTARTCQIKRDREGISSESKSGVDDYSKVGSSPKIRKKGSLKTGNDNQSTLFSYFGRK

A0A1S3C6L7 putative SOS response-associated peptidase YobE isoform X13.3e-13490.59Show/hide
Query:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR
        MCGRARCTLRADDITRACHRTG  VR+LNMDRFRPLFNASPGSDLPVVRRDDES  G VVLQCMKWGLIPSFTEK EKPNYFKMFNARSESI EK SF R
Subjt:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR

Query:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV
        LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDG+PL LAALYDCWEN +GELLYTFTILTTS SPAL+WLHDRMPVILGDKE+MD+WL+DSSSSKYDTV
Subjt:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV

Query:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEH
         KPY APDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKE KK +
Subjt:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEH

A0A1S3C774 embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein isoform X21.5e-13189.8Show/hide
Query:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR
        MCGRARCTLRADDITRACHRTG  VR+LNMDRFRPLFNASPGSDLPVVRRDDES  G VVLQCMKWGLIPSFTEK EKPNYFKMFNARSESI EK SF R
Subjt:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR

Query:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV
        LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDG+PL LAALYDCWEN +GELLYTFTILTTS SPAL+WLHDRMPVILGDKE+MD+WL+DSSSSKYDTV
Subjt:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV

Query:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEH
         KPY APDLVWYPVTPSMGKPSFDGPDCIKE  LKNDGSNLISKFFSAKE KK +
Subjt:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEH

A0A6J1H9B1 LOW QUALITY PROTEIN: uncharacterized protein LOC1114612581.8e-13287.36Show/hide
Query:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR
        MCGRARCTLR DDI+RACHRTG  +R+LNMDRFRPLFNASPGSDLPVVRRDDES GG VVLQCMKWGLIPSFT KSEKPNYFKMFNARSES+ EKASFRR
Subjt:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR

Query:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV
        LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLV AALYD WENP+GELLYTFTILTTSSSPALEWLHDRMPVILGDKE++D+WLNDSSSSKYD V
Subjt:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV

Query:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEHSDSQEK
        LKPY APDLVWYPVTP+MGK SFDGPDCIKEIQLK DG+NLISKFFSAKE  K +  + ++
Subjt:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEHSDSQEK

A0A6J1KPK2 LOW QUALITY PROTEIN: uncharacterized protein LOC1114975575.3e-13287.36Show/hide
Query:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR
        MCGRARCTLR DDI+RACHRTG  +R+LNMDRFRPLFNASPGSDLPVVRRDDES GG VVLQCMKWGLIPSFT KSEKPNYFKMFNARSESI EKASFRR
Subjt:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRR

Query:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV
        LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLV AALYD WENP+GE LYTFTILTTSSSPALEWLHDRMPVILGDKE+MD+WLNDSSSSKYD V
Subjt:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTV

Query:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEHSDSQEK
        LKPY APDLVWYPVTP+MGK SFDGPDCIKEIQ K+DG+NLISKFFSAKE  K +  + ++
Subjt:  LKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEHSDSQEK

SwissProt top hitse value%identityAlignment
Q5XIJ1 Abasic site processing protein HMCES2.0e-3531.39Show/hide
Query:  MCGRARCTLRADDITRAC---HRTGAR--VRTLNMDRFRPLFNASPGSDLPV----VRRDDESGGGEVVLQCMKWGLIPS-FTEKSEKPNYFKMFNARSE
        MCGR  C L  D +TRAC    R G R   +  + D++ P +N SP S  PV    +  + ++   + ++  M+WGL+PS F E       F   N RS+
Subjt:  MCGRARCTLRADDITRAC---HRTGAR--VRTLNMDRFRPLFNASPGSDLPV----VRRDDESGGGEVVLQCMKWGLIPS-FTEKSEKPNYFKMFNARSE

Query:  SIREKASFRRLVPK-RRCLVAVEGFYEWKK--DGSKKQPYYIHF------KDGQP------------------LVLAALYDCWENPKGELLYTFTILTTS
        +I EK SF+  + K RRC+V  +GFYEW++    +++QPY+I+F      K G+                   L +A ++DCWE PKGE LY+++I+T  
Subjt:  SIREKASFRRLVPK-RRCLVAVEGFYEWKK--DGSKKQPYYIHF------KDGQP------------------LVLAALYDCWENPKGELLYTFTILTTS

Query:  SSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTVLKPYGAPDLVWYPVTPSMGKPSFDGPDC-------IKEIQLKNDGSNLISKFFSAKEIKKEHS
        S   L  +H RMP IL  +E +  WL+    S  + +   +   ++ ++PV+P +     + P+C       +K+    +  S  + ++ + K  KKE  
Subjt:  SSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTVLKPYGAPDLVWYPVTPSMGKPSFDGPDC-------IKEIQLKNDGSNLISKFFSAKEIKKEHS

Query:  DSQEKTSCG
        DS +K + G
Subjt:  DSQEKTSCG

Q5ZJT1 Abasic site processing protein HMCES4.0e-3634.96Show/hide
Query:  MCGRARCTLRADDITRAC---HRTGARVRT--LNMDRFRPLFNASPGSDLPV------VRRDDESGGGEVVLQCMKWGLIPS-FTEKSEKPNYFKMFNAR
        MCGR  C+L A  + RAC    R G R +   L   R+RP +N  P S  PV      V++D +S   E VL  M+WGL+PS F E       FK  N R
Subjt:  MCGRARCTLRADDITRAC---HRTGARVRT--LNMDRFRPLFNASPGSDLPV------VRRDDESGGGEVVLQCMKWGLIPS-FTEKSEKPNYFKMFNAR

Query:  SESIREKASFR-RLVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQP------------------LVLAALYDCWENPK-GELLYTFTILTTSSSPAL
        S+++  K+S++  L+  +RC+V  +GFYEW++ G  KQPY+I+F   +                   L +A ++DCWE PK GE LYT+TI+T  +S  +
Subjt:  SESIREKASFR-RLVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQP------------------LVLAALYDCWENPK-GELLYTFTILTTSSSPAL

Query:  EWLHDRMPVILGDKEQMDIWLNDSSSSKYDTVLKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQL
         ++H RMP IL   E ++ WL+ +     + +     A ++ ++PV+  +     D P+C+  I+L
Subjt:  EWLHDRMPVILGDKEQMDIWLNDSSSSKYDTVLKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQL

Q6IND6 Abasic site processing protein HMCES3.3e-3833.78Show/hide
Query:  MCGRARCTLRADDITRAC-------HRTGARVRTLNMDRFRPLFNASPGSDLPVV----RRDDESGGGEVVLQCMKWGLIPS-FTEKSEKPNYFKMFNAR
        MCGR  CTL  DD+++AC        +   + R  + D+++P +N SP S+ PV+        ++   E VL  M+WGLIPS F E       +K  N R
Subjt:  MCGRARCTLRADDITRAC-------HRTGARVRTLNMDRFRPLFNASPGSDLPVV----RRDDESGGGEVVLQCMKWGLIPS-FTEKSEKPNYFKMFNAR

Query:  SESIREKASFRR-LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFK----------------DGQPLV-LAALYDCWENPK-GELLYTFTILTTSSSPALE
        S++I EKA ++  L   RRC+V  +GFYEWK+   +KQPYYI+F                 +GQ L+ +A L+DCWE P  GE LY++T++T  SS  + 
Subjt:  SESIREKASFRR-LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFK----------------DGQPLV-LAALYDCWENPK-GELLYTFTILTTSSSPALE

Query:  WLHDRMPVILGDKEQMDIWLNDSSSSKYDTVLKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLK-------NDGSNLISKFFSAKEIKKEHSDS
         +HDRMP IL   E +  WL+    S  D +   +   ++ ++PV+  +     +  +CI  + L        +  S  + ++   K  KKE S S
Subjt:  WLHDRMPVILGDKEQMDIWLNDSSSSKYDTVLKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLK-------NDGSNLISKFFSAKEIKKEHSDS

Q6P7N4 Abasic site processing protein HMCES1.3e-3934.8Show/hide
Query:  MCGRARCTLRADDITRAC---HRTGARV----RTLNMDRFRPLFNASPGSDLPVV----RRDDESGGGEVVLQCMKWGLIPS-FTEKSEKPNYFKMFNAR
        MCGR  CTL  DD+ +AC    + G R     R  + D+++P +N SP S+ PV+        ++   E VL  M+WGLIPS F E       +K  N R
Subjt:  MCGRARCTLRADDITRAC---HRTGARV----RTLNMDRFRPLFNASPGSDLPVV----RRDDESGGGEVVLQCMKWGLIPS-FTEKSEKPNYFKMFNAR

Query:  SESIREKASFR-RLVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFK----------------DGQPLV-LAALYDCWENPK-GELLYTFTILTTSSSPALE
        S+++ EKA ++  L   +RC+V  +GFYEW++  S+KQPYYI+F                 +GQ L+ +A L+DCWE P  GE LY++T++T  SS  + 
Subjt:  SESIREKASFR-RLVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFK----------------DGQPLV-LAALYDCWENPK-GELLYTFTILTTSSSPALE

Query:  WLHDRMPVILGDKEQMDIWLNDSSSSKYDTVLKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEI---QLKNDGSNLISK----FFSAKEIKKEHSDS
        W+HDRMP IL   E +  WL+       D +   +   ++ ++PV+  +     + P+C+  I   Q K    +  SK    +   K  KKE S S
Subjt:  WLHDRMPVILGDKEQMDIWLNDSSSSKYDTVLKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEI---QLKNDGSNLISK----FFSAKEIKKEHSDS

Q8R1M0 Abasic site processing protein HMCES6.4e-3430.74Show/hide
Query:  MCGRARCTLRADDITRAC---HRTGAR--VRTLNMDRFRPLFNASPGSDLPV----VRRDDESGGGEVVLQCMKWGLIPS-FTEKSEKPNYFKMFNARSE
        MCGR  C L  + +TRAC    R G R   +  + D++ P +N SP S  PV    +  + ++   + ++  M+WGL+PS F E       F   N RS+
Subjt:  MCGRARCTLRADDITRAC---HRTGAR--VRTLNMDRFRPLFNASPGSDLPV----VRRDDESGGGEVVLQCMKWGLIPS-FTEKSEKPNYFKMFNARSE

Query:  SIREKASFRRLVPK-RRCLVAVEGFYEWKK--DGSKKQPYYIHF------KDG------------------QPLVLAALYDCWENPKGELLYTFTILTTS
        +I EK SF+  + K RRC+V  +GFYEW++    +++QPY+I+F      K G                  + L +A ++DCWE P GE LY+++I+T  
Subjt:  SIREKASFRRLVPK-RRCLVAVEGFYEWKK--DGSKKQPYYIHF------KDG------------------QPLVLAALYDCWENPKGELLYTFTILTTS

Query:  SSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTVLKPYGAPDLVWYPVTPSMGKPSFDGPDC-------IKEIQLKNDGSNLISKFFSAKEIKKEHS
        S   L  +H RMP IL  +E +  WL+    +  + +   +   ++ ++PV+P +     + P+C       +K+    N  S  + ++ + K  KKE  
Subjt:  SSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTVLKPYGAPDLVWYPVTPSMGKPSFDGPDC-------IKEIQLKNDGSNLISKFFSAKEIKKEHS

Query:  DSQEKTSCG
        DS +K + G
Subjt:  DSQEKTSCG

Arabidopsis top hitse value%identityAlignment
AT2G26470.1 unknown protein3.9e-10362.14Show/hide
Query:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGE-VVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFR
        MCGR RCTLR DD+ RA HR     R L++DR+RP +N +PGS +PV+RRD+E   G+ VV+ CMKWGL+PSFT+K++KP++FKMFNARSES+ EKASFR
Subjt:  MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGE-VVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFR

Query:  RLVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDT
        RL+PK RCLVAV+GFYEWKK+GSKKQPYYIHF+DG+PLV AAL+D W+N  GE LYTFTILTT+SS AL+WLHDRMPVILGDK+ +D WL+D S++K   
Subjt:  RLVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDT

Query:  VLKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEHSDSQEK-TSCGTSVKHEASPSLEE
        +L PY   DLVWYPVT ++GKP+FDGP+CI++I LK   ++LISKFFS K+ K +  D + K T     V  +  P+ E+
Subjt:  VLKPYGAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEHSDSQEK-TSCGTSVKHEASPSLEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCGGAAGAGCCCGTTGTACTCTTCGAGCTGATGATATCACCAGGGCCTGCCACCGCACCGGCGCCCGCGTCCGCACCCTCAACATGGACCGTTTTCGTCCGCTGTT
CAATGCCTCCCCGGGCTCGGATTTGCCGGTTGTTCGTCGAGACGATGAATCTGGTGGCGGAGAGGTCGTCCTCCAGTGCATGAAATGGGGGCTGATTCCTAGTTTTACTG
AGAAATCCGAGAAACCTAATTACTTCAAGATGTTCAATGCTCGCTCAGAATCCATACGTGAGAAGGCCTCTTTTCGCCGTCTAGTTCCTAAAAGAAGGTGCCTTGTGGCA
GTGGAAGGGTTCTATGAGTGGAAAAAGGATGGATCAAAAAAGCAGCCGTATTATATCCATTTTAAGGATGGGCAGCCACTTGTTCTTGCTGCTTTATATGATTGTTGGGA
AAATCCTAAAGGTGAATTACTTTACACTTTTACCATTCTTACAACTTCATCATCTCCAGCTTTGGAGTGGTTGCATGATAGGATGCCTGTAATTTTGGGTGACAAAGAAC
AGATGGATATATGGTTGAATGATTCTTCATCGTCCAAGTATGATACTGTTCTTAAACCATATGGGGCTCCTGATTTGGTATGGTACCCTGTAACTCCATCCATGGGTAAG
CCATCATTTGATGGGCCAGACTGCATTAAGGAGATACAGCTGAAGAATGATGGAAGCAACCTCATCTCCAAATTTTTCTCTGCAAAAGAAATTAAAAAGGAACATTCAGA
CTCACAAGAGAAAACCTCCTGTGGCACATCTGTGAAGCATGAGGCATCGCCAAGTCTAGAAGAACACAAAAGAGATGTAAATCTTGAAGCTTCATCTGTAGAATCAAAGG
ATTGTCTTGCAAAGTGTTCATCCGATACTGCACGAACATGTCAAATAAAACGGGACCGTGAAGGCATCTCATCCGAGTCGAAAAGTGGCGTGGATGACTACAGTAAGGTA
GGAAGCAGTCCAAAGATAAGAAAGAAGGGAAGCCTGAAGACTGGTAATGACAACCAATCAACCCTCTTTTCATACTTTGGGAGGAAATAG
mRNA sequenceShow/hide mRNA sequence
CCAACAGTCAAGCTTCTGAACTCTACCTTTCCTTTCCAGTAGAATACTATACAGAGTGTAGCAAAGCATACAACAGACGGATGTGCGGAAGAGCCCGTTGTACTCTTCGA
GCTGATGATATCACCAGGGCCTGCCACCGCACCGGCGCCCGCGTCCGCACCCTCAACATGGACCGTTTTCGTCCGCTGTTCAATGCCTCCCCGGGCTCGGATTTGCCGGT
TGTTCGTCGAGACGATGAATCTGGTGGCGGAGAGGTCGTCCTCCAGTGCATGAAATGGGGGCTGATTCCTAGTTTTACTGAGAAATCCGAGAAACCTAATTACTTCAAGA
TGTTCAATGCTCGCTCAGAATCCATACGTGAGAAGGCCTCTTTTCGCCGTCTAGTTCCTAAAAGAAGGTGCCTTGTGGCAGTGGAAGGGTTCTATGAGTGGAAAAAGGAT
GGATCAAAAAAGCAGCCGTATTATATCCATTTTAAGGATGGGCAGCCACTTGTTCTTGCTGCTTTATATGATTGTTGGGAAAATCCTAAAGGTGAATTACTTTACACTTT
TACCATTCTTACAACTTCATCATCTCCAGCTTTGGAGTGGTTGCATGATAGGATGCCTGTAATTTTGGGTGACAAAGAACAGATGGATATATGGTTGAATGATTCTTCAT
CGTCCAAGTATGATACTGTTCTTAAACCATATGGGGCTCCTGATTTGGTATGGTACCCTGTAACTCCATCCATGGGTAAGCCATCATTTGATGGGCCAGACTGCATTAAG
GAGATACAGCTGAAGAATGATGGAAGCAACCTCATCTCCAAATTTTTCTCTGCAAAAGAAATTAAAAAGGAACATTCAGACTCACAAGAGAAAACCTCCTGTGGCACATC
TGTGAAGCATGAGGCATCGCCAAGTCTAGAAGAACACAAAAGAGATGTAAATCTTGAAGCTTCATCTGTAGAATCAAAGGATTGTCTTGCAAAGTGTTCATCCGATACTG
CACGAACATGTCAAATAAAACGGGACCGTGAAGGCATCTCATCCGAGTCGAAAAGTGGCGTGGATGACTACAGTAAGGTAGGAAGCAGTCCAAAGATAAGAAAGAAGGGA
AGCCTGAAGACTGGTAATGACAACCAATCAACCCTCTTTTCATACTTTGGGAGGAAATAGACAGGCCTTACCTTGTTTCGAAACAAATATTATCCAGATAGGTAAATCGA
TTACTCGTGCGTGACGTCTCGTGTTTATATGTATGCTGTTTAGTTTGTTTATCTTGCTGTGTTAGATGCTGCGGAATGCCGAGGTACGTGGA
Protein sequenceShow/hide protein sequence
MCGRARCTLRADDITRACHRTGARVRTLNMDRFRPLFNASPGSDLPVVRRDDESGGGEVVLQCMKWGLIPSFTEKSEKPNYFKMFNARSESIREKASFRRLVPKRRCLVA
VEGFYEWKKDGSKKQPYYIHFKDGQPLVLAALYDCWENPKGELLYTFTILTTSSSPALEWLHDRMPVILGDKEQMDIWLNDSSSSKYDTVLKPYGAPDLVWYPVTPSMGK
PSFDGPDCIKEIQLKNDGSNLISKFFSAKEIKKEHSDSQEKTSCGTSVKHEASPSLEEHKRDVNLEASSVESKDCLAKCSSDTARTCQIKRDREGISSESKSGVDDYSKV
GSSPKIRKKGSLKTGNDNQSTLFSYFGRK