; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G15020 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G15020
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionabasic site processing protein YoqW isoform X2
Genome locationChr7:13698487..13706513
RNA-Seq ExpressionCSPI07G15020
SyntenyCSPI07G15020
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0006974 - cellular response to DNA damage stimulus (biological process)
GO:0018142 - protein-DNA covalent cross-linking (biological process)
GO:0003697 - single-stranded DNA binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR003738 - SOS response associated peptidase (SRAP)
IPR036590 - SOS response associated peptidase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593042.1 Abasic site processing protein HMCES, partial [Cucurbita argyrosperma subsp. sororia]4.1e-16380.77Show/hide
Query:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR
        MCGRARCTLR DDI+RACH TGGP+RSLNMDRFRPLFNASPGSDLPVVRRDDES  GGVVLQCMKWGLIPSFT K EKPNYFKMFNARSES+ EKASF R
Subjt:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR

Query:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV
        LVPKRRCLVAVEGFYEWKKDGS+KQPYYIHFKDGQPL  AALYD WEN EGELLYTFTILTTSSSPAL+WLHDRMPVILGDKER+DMWLNDSSSSKYD+V
Subjt:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV

Query:  LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASS-----EESKDC
        LKPYEAPDLVWYPVTP+MGK SFDGPDCIKEIQLK DG+NLISKFFSAKETKKE S SQEKT  NTSVKPE S +LEEHKR+ +  ASS      +S+D 
Subjt:  LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASS-----EESKDC

Query:  LAKCSSDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK
        LAKC S T+ T + KRDRE  SS+ + G++D SK+ SS KIRKK +LKTG +N+ TLFSYFG+K
Subjt:  LAKCSSDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK

XP_011659220.1 uncharacterized protein LOC101206083 isoform X1 [Cucumis sativus]5.5e-20899.72Show/hide
Query:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR
        MCGRARCTLRADDITRACH TGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR
Subjt:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR

Query:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV
        LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV
Subjt:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV

Query:  LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS
        LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS
Subjt:  LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS

Query:  SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK
        SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK
Subjt:  SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK

XP_011659221.1 uncharacterized protein LOC101206083 isoform X2 [Cucumis sativus]1.9e-20599.16Show/hide
Query:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR
        MCGRARCTLRADDITRACH TGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR
Subjt:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR

Query:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV
        LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV
Subjt:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV

Query:  LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS
        LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKE  LKNDGSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS
Subjt:  LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS

Query:  SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK
        SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK
Subjt:  SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK

XP_038896829.1 abasic site processing protein YoqW isoform X1 [Benincasa hispida]5.0e-18590.25Show/hide
Query:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR
        MCGRARCTLRADDI RACH TGG VR+LNMDRFRPLFNASPGSDLPVVRRDDES DGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESI EKASF R
Subjt:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR

Query:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV
        LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDG+PL LAALYDCWEN EGELLYTFTILTTS+SPAL WLHDRMPVILGDKERMDMWLNDSSSSKYD+V
Subjt:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV

Query:  LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS
        LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFF AKE KKE+S SQEKT  NT VKPEASPSLEEHK +VN  ASSEESKDCLAKCS
Subjt:  LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS

Query:  SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK
        S+T+ T QIKRDREDISS  KSG+DDYSKVGSSPK RKKGNLK GNDNQ TLFSYFG+K
Subjt:  SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK

XP_038896830.1 abasic site processing protein HMCES isoform X2 [Benincasa hispida]1.8e-18289.69Show/hide
Query:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR
        MCGRARCTLRADDI RACH TGG VR+LNMDRFRPLFNASPGSDLPVVRRDDES DGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESI EKASF R
Subjt:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR

Query:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV
        LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDG+PL LAALYDCWEN EGELLYTFTILTTS+SPAL WLHDRMPVILGDKERMDMWLNDSSSSKYD+V
Subjt:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV

Query:  LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS
        LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKE  LKNDGSNLISKFF AKE KKE+S SQEKT  NT VKPEASPSLEEHK +VN  ASSEESKDCLAKCS
Subjt:  LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS

Query:  SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK
        S+T+ T QIKRDREDISS  KSG+DDYSKVGSSPK RKKGNLK GNDNQ TLFSYFG+K
Subjt:  SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK

TrEMBL top hitse value%identityAlignment
A0A0A0K6X8 Uncharacterized protein2.7e-20899.72Show/hide
Query:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR
        MCGRARCTLRADDITRACH TGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR
Subjt:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR

Query:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV
        LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV
Subjt:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV

Query:  LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS
        LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS
Subjt:  LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCS

Query:  SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK
        SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK
Subjt:  SDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK

A0A1S3C6L7 putative SOS response-associated peptidase YobE isoform X19.3e-14597.23Show/hide
Query:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR
        MCGRARCTLRADDITRACH TGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEK SFHR
Subjt:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR

Query:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV
        LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDG+PLALAALYDCWENLEGELLYTFTILTTS SPALKWLHDRMPVILGDKERMDMWL+DSSSSKYD+V
Subjt:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV

Query:  LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKK
         KPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKK
Subjt:  LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKK

A0A1S3C774 embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein isoform X24.3e-14296.44Show/hide
Query:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR
        MCGRARCTLRADDITRACH TGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEK SFHR
Subjt:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR

Query:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV
        LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDG+PLALAALYDCWENLEGELLYTFTILTTS SPALKWLHDRMPVILGDKERMDMWL+DSSSSKYD+V
Subjt:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV

Query:  LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKK
         KPYEAPDLVWYPVTPSMGKPSFDGPDCIKE  LKNDGSNLISKFFSAKETKK
Subjt:  LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKK

A0A5A7UR90 Embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein isoform X22.1e-14192.03Show/hide
Query:  FNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKER
        FNARSESIHEK SFHRLVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDG+PLALAALYDCWENLEGELLYTFTILTTS SPALKWLHDRMPVILGDKER
Subjt:  FNARSESIHEKASFHRLVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKER

Query:  MDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKET-KKEYSVSQEKTCSNTSVKPEASPSLEEHKREV
        MDMWL+DSSSSKYD+V KPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKET KKE+S SQ+KT SNTSVKPEASPSLEEHKRE 
Subjt:  MDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKET-KKEYSVSQEKTCSNTSVKPEASPSLEEHKREV

Query:  NRGASSEESKDCLAKCSSDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK
        N GASSEES+DCLAKCSS TSLTYQIKRDREDISS  KSG+DDYSK GS PKIRKKGNLKTGNDNQLTL SYFG+K
Subjt:  NRGASSEESKDCLAKCSSDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSPKIRKKGNLKTGNDNQLTLFSYFGKK

A0A6J1H9B1 LOW QUALITY PROTEIN: uncharacterized protein LOC1114612585.7e-13490.91Show/hide
Query:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR
        MCGRARCTLR DDI+RACH TGGP+RSLNMDRFRPLFNASPGSDLPVVRRDDES  GGVVLQCMKWGLIPSFT K EKPNYFKMFNARSES+ EKASF R
Subjt:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHR

Query:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV
        LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPL  AALYD WEN EGELLYTFTILTTSSSPAL+WLHDRMPVILGDKER+DMWLNDSSSSKYD+V
Subjt:  LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSV

Query:  LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKK
        LKPYEAPDLVWYPVTP+MGK SFDGPDCIKEIQLK DG+NLISKFFSAKET K
Subjt:  LKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKK

SwissProt top hitse value%identityAlignment
Q5XIJ1 Abasic site processing protein HMCES4.2e-3330.75Show/hide
Query:  MCGRARCTLRADDITRACHCTGGPVRS-----LNMDRFRPLFNASPGSDLPVV------RRDDESSDGGVVLQCMKWGLIPS-FTEKFEKPNYFKMFNAR
        MCGR  C L  D +TRAC       R       + D++ P +N SP S  PV+       +D +SSD   ++  M+WGL+PS F E       F   N R
Subjt:  MCGRARCTLRADDITRACHCTGGPVRS-----LNMDRFRPLFNASPGSDLPVV------RRDDESSDGGVVLQCMKWGLIPS-FTEKFEKPNYFKMFNAR

Query:  SESIHEKASFHRLVPK-RRCLVAVEGFYEWKK--DGSKKQPYYIHF------KDGQP------------------LALAALYDCWENLEGELLYTFTILT
        S++I EK SF   + K RRC+V  +GFYEW++    +++QPY+I+F      K G+                   L +A ++DCWE  +GE LY+++I+T
Subjt:  SESIHEKASFHRLVPK-RRCLVAVEGFYEWKK--DGSKKQPYYIHF------KDGQP------------------LALAALYDCWENLEGELLYTFTILT

Query:  TSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDC-------IKEIQLKNDGSNLISKFFSAKETKKE
          S   L  +H RMP IL  +E +  WL+    S  +++   +   ++ ++PV+P +     + P+C       +K+    +  S  + ++ + K  KKE
Subjt:  TSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDC-------IKEIQLKNDGSNLISKFFSAKETKKE

Query:  YSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASS
           S +K  S   +   +S  L++      RGASS
Subjt:  YSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASS

Q5ZJT1 Abasic site processing protein HMCES8.4e-3433.46Show/hide
Query:  MCGRARCTLRADDITRACHCTGGPVRS-----LNMDRFRPLFNASPGSDLPV------VRRDDESSDGGVVLQCMKWGLIPS-FTEKFEKPNYFKMFNAR
        MCGR  C+L A  + RAC       R      L   R+RP +N  P S  PV      V++D +SS+   VL  M+WGL+PS F E       FK  N R
Subjt:  MCGRARCTLRADDITRACHCTGGPVRS-----LNMDRFRPLFNASPGSDLPV------VRRDDESSDGGVVLQCMKWGLIPS-FTEKFEKPNYFKMFNAR

Query:  SESIHEKASFH-RLVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQP------------------LALAALYDCWENLE-GELLYTFTILTTSSSPAL
        S+++  K+S+   L+  +RC+V  +GFYEW++ G  KQPY+I+F   +                   L +A ++DCWE  + GE LYT+TI+T  +S  +
Subjt:  SESIHEKASFH-RLVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQP------------------LALAALYDCWENLE-GELLYTFTILTTSSSPAL

Query:  KWLHDRMPVILGDKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQL
         ++H RMP IL   E ++ WL+ +     +++     A ++ ++PV+  +     D P+C+  I+L
Subjt:  KWLHDRMPVILGDKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQL

Q6IND6 Abasic site processing protein HMCES1.1e-3633.89Show/hide
Query:  MCGRARCTLRADDITRACHCTGGPVRSL-------NMDRFRPLFNASPGSDLPVV------RRDDESSDGGVVLQCMKWGLIPS-FTEKFEKPNYFKMFN
        MCGR  CTL  DD+++AC       R         + D+++P +N SP S+ PV+      ++D +SS+   VL  M+WGLIPS F E       +K  N
Subjt:  MCGRARCTLRADDITRACHCTGGPVRSL-------NMDRFRPLFNASPGSDLPVV------RRDDESSDGGVVLQCMKWGLIPS-FTEKFEKPNYFKMFN

Query:  ARSESIHEKASFHR-LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFK----------------DGQP-LALAALYDCWENLE-GELLYTFTILTTSSSPA
         RS++I EKA +   L   RRC+V  +GFYEWK+   +KQPYYI+F                 +GQ  L +A L+DCWE    GE LY++T++T  SS  
Subjt:  ARSESIHEKASFHR-LVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFK----------------DGQP-LALAALYDCWENLE-GELLYTFTILTTSSSPA

Query:  LKWLHDRMPVILGDKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLK-------NDGSNLISKFFSAKETKKEYSVS
        +  +HDRMP IL   E +  WL+    S  D++   +   ++ ++PV+  +     +  +CI  + L        +  S  + ++   K  KKE S S
Subjt:  LKWLHDRMPVILGDKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLK-------NDGSNLISKFFSAKETKKEYSVS

Q6P7N4 Abasic site processing protein HMCES9.5e-3834.9Show/hide
Query:  MCGRARCTLRADDITRAC---HCTGGPV----RSLNMDRFRPLFNASPGSDLPVV------RRDDESSDGGVVLQCMKWGLIPS-FTEKFEKPNYFKMFN
        MCGR  CTL  DD+ +AC      GG      R  + D+++P +N SP S+ PV+      ++D +SS+   VL  M+WGLIPS F E       +K  N
Subjt:  MCGRARCTLRADDITRAC---HCTGGPV----RSLNMDRFRPLFNASPGSDLPVV------RRDDESSDGGVVLQCMKWGLIPS-FTEKFEKPNYFKMFN

Query:  ARSESIHEKASFH-RLVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFK----------------DGQP-LALAALYDCWENLE-GELLYTFTILTTSSSPA
         RS+++ EKA +   L   +RC+V  +GFYEW++  S+KQPYYI+F                 +GQ  L +A L+DCWE    GE LY++T++T  SS  
Subjt:  ARSESIHEKASFH-RLVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFK----------------DGQP-LALAALYDCWENLE-GELLYTFTILTTSSSPA

Query:  LKWLHDRMPVILGDKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEI---QLKNDGSNLISK----FFSAKETKKEYSVS
        + W+HDRMP IL   E +  WL+       D++   +   ++ ++PV+  +     + P+C+  I   Q K    +  SK    +   K  KKE S S
Subjt:  LKWLHDRMPVILGDKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEI---QLKNDGSNLISK----FFSAKETKKEYSVS

Q8R1M0 Abasic site processing protein HMCES1.0e-3130.15Show/hide
Query:  MCGRARCTLRADDITRACHCTGGPVRS-----LNMDRFRPLFNASPGSDLPVV------RRDDESSDGGVVLQCMKWGLIPS-FTEKFEKPNYFKMFNAR
        MCGR  C L  + +TRAC       R       + D++ P +N SP S  PV+       +D +SSD   ++  M+WGL+PS F E       F   N R
Subjt:  MCGRARCTLRADDITRACHCTGGPVRS-----LNMDRFRPLFNASPGSDLPVV------RRDDESSDGGVVLQCMKWGLIPS-FTEKFEKPNYFKMFNAR

Query:  SESIHEKASFHRLVPK-RRCLVAVEGFYEWKK--DGSKKQPYYIHF------KDG------------------QPLALAALYDCWENLEGELLYTFTILT
        S++I EK SF   + K RRC+V  +GFYEW++    +++QPY+I+F      K G                  + L +A ++DCWE   GE LY+++I+T
Subjt:  SESIHEKASFHRLVPK-RRCLVAVEGFYEWKK--DGSKKQPYYIHF------KDG------------------QPLALAALYDCWENLEGELLYTFTILT

Query:  TSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDC-------IKEIQLKNDGSNLISKFFSAKETKKE
          S   L  +H RMP IL  +E +  WL+    +  +++   +   ++ ++PV+P +     + P+C       +K+    N  S  + ++ + K  KKE
Subjt:  TSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDC-------IKEIQLKNDGSNLISKFFSAKETKKE

Query:  YSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASS
           S +K  S   +   +S  L++      RGA+S
Subjt:  YSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASS

Arabidopsis top hitse value%identityAlignment
AT2G26470.1 unknown protein4.3e-10262.14Show/hide
Query:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDG-GVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFH
        MCGR RCTLR DD+ RA H    P R L++DR+RP +N +PGS +PV+RRD+E   G GVV+ CMKWGL+PSFT+K +KP++FKMFNARSES+ EKASF 
Subjt:  MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDG-GVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFH

Query:  RLVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDS
        RL+PK RCLVAV+GFYEWKK+GSKKQPYYIHF+DG+PL  AAL+D W+N  GE LYTFTILTT+SS AL+WLHDRMPVILGDK+ +D WL+D S++K   
Subjt:  RLVPKRRCLVAVEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDS

Query:  VLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKKEYSVSQEK-TCSNTSVKPEASPSLEE
        +L PYE  DLVWYPVT ++GKP+FDGP+CI++I LK   ++LISKFFS K+ K +    + K T +N  V  +  P+ E+
Subjt:  VLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKNDGSNLISKFFSAKETKKEYSVSQEK-TCSNTSVKPEASPSLEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCGGAAGAGCCCGTTGTACTCTTCGAGCTGATGACATCACCAGGGCCTGCCACTGCACCGGCGGCCCCGTACGCTCCCTCAACATGGACCGTTTTCGTCCGCTGTT
CAATGCCTCCCCGGGCTCCGATTTACCGGTTGTTCGTCGAGACGATGAATCTAGTGATGGAGGAGTCGTCCTTCAGTGCATGAAATGGGGGCTTATTCCTAGTTTTACTG
AGAAATTCGAGAAACCTAATTACTTCAAGATGTTCAATGCTCGCTCAGAGTCCATACATGAAAAGGCCTCTTTTCACCGTCTAGTTCCTAAAAGAAGGTGCCTTGTGGCA
GTGGAAGGGTTCTACGAGTGGAAAAAGGACGGATCAAAAAAGCAGCCGTATTATATTCATTTTAAGGATGGGCAGCCACTTGCTCTTGCTGCTTTATATGATTGTTGGGA
AAACCTTGAAGGTGAATTACTTTACACTTTCACCATTCTAACAACTTCATCATCTCCAGCTTTGAAGTGGTTGCACGATAGGATGCCTGTAATATTGGGTGACAAAGAAC
GTATGGATATGTGGTTGAATGATTCTTCATCGTCCAAGTATGATTCCGTCCTTAAACCATACGAGGCTCCTGATTTGGTATGGTACCCTGTAACTCCTTCCATGGGAAAG
CCTTCATTTGATGGGCCAGACTGCATCAAGGAGATACAGCTAAAGAATGATGGAAGCAATCTCATCTCCAAATTTTTCTCTGCAAAAGAAACAAAAAAGGAATATTCGGT
CTCACAAGAGAAAACTTGCTCTAACACATCTGTGAAGCCCGAGGCATCTCCAAGTCTAGAAGAGCACAAAAGAGAAGTAAATCGTGGAGCTTCATCTGAAGAATCAAAGG
ATTGTCTTGCAAAGTGTTCATCTGATACTTCACTAACATATCAAATAAAACGAGATCGTGAAGACATCTCATCCGACTTGAAAAGTGGCATGGACGACTACAGCAAGGTA
GGCAGCAGTCCAAAGATACGGAAGAAGGGAAACCTGAAAACTGGTAATGACAACCAATTAACCCTCTTTTCATACTTTGGAAAGAAATAG
mRNA sequenceShow/hide mRNA sequence
GTGGGAATGTAAGAAAATGGATGTGTTCATCGGTTTGCTTCTTCCCGCCTTTTGCGTTTCTTCACGTCCAGAACTTTCCGTTCAGGCTGAGCTGAGCACCTTCTCCAAGG
CAAGGCTTCCGCCATAGCCACCCCAACAGTCAAAGCTTCTCAACTCCACTCTTCCTTTCCCGTAGAATACTACAAAGACTGTAGCAAAGCATATTAGAGACGGATGTGCG
GAAGAGCCCGTTGTACTCTTCGAGCTGATGACATCACCAGGGCCTGCCACTGCACCGGCGGCCCCGTACGCTCCCTCAACATGGACCGTTTTCGTCCGCTGTTCAATGCC
TCCCCGGGCTCCGATTTACCGGTTGTTCGTCGAGACGATGAATCTAGTGATGGAGGAGTCGTCCTTCAGTGCATGAAATGGGGGCTTATTCCTAGTTTTACTGAGAAATT
CGAGAAACCTAATTACTTCAAGATGTTCAATGCTCGCTCAGAGTCCATACATGAAAAGGCCTCTTTTCACCGTCTAGTTCCTAAAAGAAGGTGCCTTGTGGCAGTGGAAG
GGTTCTACGAGTGGAAAAAGGACGGATCAAAAAAGCAGCCGTATTATATTCATTTTAAGGATGGGCAGCCACTTGCTCTTGCTGCTTTATATGATTGTTGGGAAAACCTT
GAAGGTGAATTACTTTACACTTTCACCATTCTAACAACTTCATCATCTCCAGCTTTGAAGTGGTTGCACGATAGGATGCCTGTAATATTGGGTGACAAAGAACGTATGGA
TATGTGGTTGAATGATTCTTCATCGTCCAAGTATGATTCCGTCCTTAAACCATACGAGGCTCCTGATTTGGTATGGTACCCTGTAACTCCTTCCATGGGAAAGCCTTCAT
TTGATGGGCCAGACTGCATCAAGGAGATACAGCTAAAGAATGATGGAAGCAATCTCATCTCCAAATTTTTCTCTGCAAAAGAAACAAAAAAGGAATATTCGGTCTCACAA
GAGAAAACTTGCTCTAACACATCTGTGAAGCCCGAGGCATCTCCAAGTCTAGAAGAGCACAAAAGAGAAGTAAATCGTGGAGCTTCATCTGAAGAATCAAAGGATTGTCT
TGCAAAGTGTTCATCTGATACTTCACTAACATATCAAATAAAACGAGATCGTGAAGACATCTCATCCGACTTGAAAAGTGGCATGGACGACTACAGCAAGGTAGGCAGCA
GTCCAAAGATACGGAAGAAGGGAAACCTGAAAACTGGTAATGACAACCAATTAACCCTCTTTTCATACTTTGGAAAGAAATAGATAGGCCTGCTTTGTTTCAAAACAGAC
AGGTGTGCGCTGCATCTCATATGTTTATATATGCCATTTATTTTGTTTATCTTGGTGTGTTAGTTGCTGCTGACTGCTGAGGTACGTGGAAGGTTTTTATTTTTTTTTAA
TGAACATTTCGATTGGGTTAAATCTTAAATGCAGTGCATTTTTTGTTGTATAAAGGGCAGTCTTCTGTAGCTTAGAAGGGC
Protein sequenceShow/hide protein sequence
MCGRARCTLRADDITRACHCTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGGVVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVA
VEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKWLHDRMPVILGDKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGK
PSFDGPDCIKEIQLKNDGSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSEESKDCLAKCSSDTSLTYQIKRDREDISSDLKSGMDDYSKV
GSSPKIRKKGNLKTGNDNQLTLFSYFGKK