; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004793 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004793
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionabasic site processing protein YoqW isoform X4
Genome locationscaffold176:474622..482659
RNA-Seq ExpressionMS004793
SyntenyMS004793
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0006974 - cellular response to DNA damage stimulus (biological process)
GO:0018142 - protein-DNA covalent cross-linking (biological process)
GO:0003697 - single-stranded DNA binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR003738 - SOS response associated peptidase (SRAP)
IPR036590 - SOS response associated peptidase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593042.1 Abasic site processing protein HMCES, partial [Cucurbita argyrosperma subsp. sororia]4.4e-16881.08Show/hide
Query:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRR
        MCGRARCTL   D+ RACHR  GP+R+LN+ RFRPL+NASPGSDLPVVRRDDES GGGVVLQCMKWGLIPSFT KSEKPNY+KMFNARSES+ EKASFRR
Subjt:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRR

Query:  LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLND-SSS
        LVPK RCLVAVEGFYEWKKDGS+KQPYY+HFKDG+PLVFAALYD WENPEG     ELLYTFTILTTSSSPALEWLHDRMPVI GDKER+DMWLND SSS
Subjt:  LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLND-SSS

Query:  KFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSPCNTSVKPEPSQPLEEHERDVNHGASSCSVGA
        K+D VLKPYEAPDLVWYPVTP+MGK SFDGPDCIKEIQLKT+GNNLISKFFSAKETKKE SDSQE+TS CNTSVKPEPSQ LEEH+RD +H ASSCS+  
Subjt:  KFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSPCNTSVKPEPSQPLEEHERDVNHGASSCSVGA

Query:  EESKDNLAKCSSESAATCQMKRDRENISSDSNIGINDYGKIGSSSKIRKKGGMKPGSDNQSTLFSYFGRK
         +S+DNLAKC S +A+TC+ KRDRE  SS+S IG+ND  KI SSSKIRKK  +K G +N+STLFSYFGRK
Subjt:  EESKDNLAKCSSESAATCQMKRDRENISSDSNIGINDYGKIGSSSKIRKKGGMKPGSDNQSTLFSYFGRK

XP_011659220.1 uncharacterized protein LOC101206083 isoform X1 [Cucumis sativus]1.4e-16179.19Show/hide
Query:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRR
        MCGRARCTL A D+ RACHR  GP+R+LN+ RFRPL+NASPGSDLPVVRRDDES  GGVVLQCMKWGLIPSFT+K EKPNY+KMFNARSESI EKASF R
Subjt:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRR

Query:  LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLND-SSS
        LVPK RCLVAVEGFYEWKKDGSKKQPYY+HFKDG+PL  AALYD WEN EG     ELLYTFTILTTSSSPAL+WLHDRMPVI GDKERMDMWLND SSS
Subjt:  LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLND-SSS

Query:  KFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSPCNTSVKPEPSQPLEEHERDVNHGASSCSVGA
        K+D+VLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLK +G+NLISKFFSAKETKKE S SQE+T   NTSVKPE S  LEEH+R+VN GASS     
Subjt:  KFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSPCNTSVKPEPSQPLEEHERDVNHGASSCSVGA

Query:  EESKDNLAKCSSESAATCQMKRDRENISSDSNIGINDYGKIGSSSKIRKKGGMKPGSDNQSTLFSYFGRK
        EESKD LAKCSS+++ T Q+KRDRE+ISSD   G++DY K+GSS KIRKKG +K G+DNQ TLFSYFG+K
Subjt:  EESKDNLAKCSSESAATCQMKRDRENISSDSNIGINDYGKIGSSSKIRKKGGMKPGSDNQSTLFSYFGRK

XP_011659221.1 uncharacterized protein LOC101206083 isoform X2 [Cucumis sativus]6.3e-15978.65Show/hide
Query:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRR
        MCGRARCTL A D+ RACHR  GP+R+LN+ RFRPL+NASPGSDLPVVRRDDES  GGVVLQCMKWGLIPSFT+K EKPNY+KMFNARSESI EKASF R
Subjt:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRR

Query:  LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLND-SSS
        LVPK RCLVAVEGFYEWKKDGSKKQPYY+HFKDG+PL  AALYD WEN EG     ELLYTFTILTTSSSPAL+WLHDRMPVI GDKERMDMWLND SSS
Subjt:  LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLND-SSS

Query:  KFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSPCNTSVKPEPSQPLEEHERDVNHGASSCSVGA
        K+D+VLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKE  LK +G+NLISKFFSAKETKKE S SQE+T   NTSVKPE S  LEEH+R+VN GASS     
Subjt:  KFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSPCNTSVKPEPSQPLEEHERDVNHGASSCSVGA

Query:  EESKDNLAKCSSESAATCQMKRDRENISSDSNIGINDYGKIGSSSKIRKKGGMKPGSDNQSTLFSYFGRK
        EESKD LAKCSS+++ T Q+KRDRE+ISSD   G++DY K+GSS KIRKKG +K G+DNQ TLFSYFG+K
Subjt:  EESKDNLAKCSSESAATCQMKRDRENISSDSNIGINDYGKIGSSSKIRKKGGMKPGSDNQSTLFSYFGRK

XP_038896829.1 abasic site processing protein YoqW isoform X1 [Benincasa hispida]8.2e-16781.89Show/hide
Query:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRR
        MCGRARCTL A D+PRACHR  G +RTLN+ RFRPL+NASPGSDLPVVRRDDESG GGVVLQCMKWGLIPSFT+K EKPNY+KMFNARSESIREKASFRR
Subjt:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRR

Query:  LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLND-SSS
        LVPK RCLVAVEGFYEWKKDGSKKQPYY+HFKDGRPLV AALYD WENPEG     ELLYTFTILTTS+SPAL WLHDRMPVI GDKERMDMWLND SSS
Subjt:  LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLND-SSS

Query:  KFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSPCNTSVKPEPSQPLEEHERDVNHGASSCSVGA
        K+DTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLK +G+NLISKFF AKE KKE SDSQE+TS CNT VKPE S  LEEH+ DVN  ASS     
Subjt:  KFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSPCNTSVKPEPSQPLEEHERDVNHGASSCSVGA

Query:  EESKDNLAKCSSESAATCQMKRDRENISSDSNIGINDYGKIGSSSKIRKKGGMKPGSDNQSTLFSYFGRK
        EESKD LAKCSSE+A TCQ+KRDRE+ISS S  G++DY K+GSS K RKKG +K G+DNQSTLFSYFGRK
Subjt:  EESKDNLAKCSSESAATCQMKRDRENISSDSNIGINDYGKIGSSSKIRKKGGMKPGSDNQSTLFSYFGRK

XP_038896830.1 abasic site processing protein HMCES isoform X2 [Benincasa hispida]3.8e-16481.35Show/hide
Query:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRR
        MCGRARCTL A D+PRACHR  G +RTLN+ RFRPL+NASPGSDLPVVRRDDESG GGVVLQCMKWGLIPSFT+K EKPNY+KMFNARSESIREKASFRR
Subjt:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRR

Query:  LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLND-SSS
        LVPK RCLVAVEGFYEWKKDGSKKQPYY+HFKDGRPLV AALYD WENPEG     ELLYTFTILTTS+SPAL WLHDRMPVI GDKERMDMWLND SSS
Subjt:  LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLND-SSS

Query:  KFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSPCNTSVKPEPSQPLEEHERDVNHGASSCSVGA
        K+DTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKE  LK +G+NLISKFF AKE KKE SDSQE+TS CNT VKPE S  LEEH+ DVN  ASS     
Subjt:  KFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSPCNTSVKPEPSQPLEEHERDVNHGASSCSVGA

Query:  EESKDNLAKCSSESAATCQMKRDRENISSDSNIGINDYGKIGSSSKIRKKGGMKPGSDNQSTLFSYFGRK
        EESKD LAKCSSE+A TCQ+KRDRE+ISS S  G++DY K+GSS K RKKG +K G+DNQSTLFSYFGRK
Subjt:  EESKDNLAKCSSESAATCQMKRDRENISSDSNIGINDYGKIGSSSKIRKKGGMKPGSDNQSTLFSYFGRK

TrEMBL top hitse value%identityAlignment
A0A0A0K6X8 Uncharacterized protein6.6e-16279.19Show/hide
Query:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRR
        MCGRARCTL A D+ RACHR  GP+R+LN+ RFRPL+NASPGSDLPVVRRDDES  GGVVLQCMKWGLIPSFT+K EKPNY+KMFNARSESI EKASF R
Subjt:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRR

Query:  LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLND-SSS
        LVPK RCLVAVEGFYEWKKDGSKKQPYY+HFKDG+PL  AALYD WEN EG     ELLYTFTILTTSSSPAL+WLHDRMPVI GDKERMDMWLND SSS
Subjt:  LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLND-SSS

Query:  KFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSPCNTSVKPEPSQPLEEHERDVNHGASSCSVGA
        K+D+VLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLK +G+NLISKFFSAKETKKE S SQE+T   NTSVKPE S  LEEH+R+VN GASS     
Subjt:  KFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSPCNTSVKPEPSQPLEEHERDVNHGASSCSVGA

Query:  EESKDNLAKCSSESAATCQMKRDRENISSDSNIGINDYGKIGSSSKIRKKGGMKPGSDNQSTLFSYFGRK
        EESKD LAKCSS+++ T Q+KRDRE+ISSD   G++DY K+GSS KIRKKG +K G+DNQ TLFSYFG+K
Subjt:  EESKDNLAKCSSESAATCQMKRDRENISSDSNIGINDYGKIGSSSKIRKKGGMKPGSDNQSTLFSYFGRK

A0A6J1DZF4 LOW QUALITY PROTEIN: uncharacterized protein LOC1110244885.6e-14597.66Show/hide
Query:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRR
        MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRR
Subjt:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRR

Query:  LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLNDSSSK
        LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEG     ELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLNDSSSK
Subjt:  LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLNDSSSK

Query:  FDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETK
        FDTVLKPYEAPDLVWYPVTPSMGK SFDGPDCIKEIQLKTEGNNLISKFFSAKETK
Subjt:  FDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETK

A0A6J1H9B1 LOW QUALITY PROTEIN: uncharacterized protein LOC1114612582.8e-12883.27Show/hide
Query:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRR
        MCGRARCTL   D+ RACHR  GP+R+LN+ RFRPL+NASPGSDLPVVRRDDES GGGVVLQCMKWGLIPSFT KSEKPNY+KMFNARSES+ EKASFRR
Subjt:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRR

Query:  LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLND-SSS
        LVPK RCLVAVEGFYEWKKDGSKKQPYY+HFKDG+PLVFAALYD WENPEG     ELLYTFTILTTSSSPALEWLHDRMPVI GDKER+DMWLND SSS
Subjt:  LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLND-SSS

Query:  KFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSP
        K+D VLKPYEAPDLVWYPVTP+MGK SFDGPDCIKEIQLKT+GNNLISKFFSAKET K    + +   P
Subjt:  KFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSP

A0A6J1KPK2 LOW QUALITY PROTEIN: uncharacterized protein LOC1114975573.1e-12782.9Show/hide
Query:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRR
        MCGRARCTL   D+ RACHR  GP+R+LN+ RFRPL+NASPGSDLPVVRRDDES GGGVVLQCMKWGLIPSFT KSEKPNY+KMFNARSESI EKASFRR
Subjt:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRR

Query:  LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLND-SSS
        LVPK RCLVAVEGFYEWKKDGSKKQPYY+HFKDG+PLVFAALYD WENPEG     E LYTFTILTTSSSPALEWLHDRMPVI GDKERMDMWLND SSS
Subjt:  LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLND-SSS

Query:  KFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSP
        K+D VLKPYEAPDLVWYPVTP+MGK SFDGPDCIKEIQ K++GNNLISKFFSAKET K    + +   P
Subjt:  KFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSP

A0A6P4AC51 uncharacterized protein LOC1074267961.5e-12664.64Show/hide
Query:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGG--VVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASF
        MCGRARCTL A D+PRACHR  G +RT+NI R+RP YN SPGS+LPVVRR D S  GG  VVL+CMKWGLIPSFT K+EKP++YKMFNARSESI EKASF
Subjt:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGG--VVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASF

Query:  RRLVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLNDSS
        RRLVP+SRCLVAVEGFYEWKKDGSKKQPYY+HFKDGRPLVFAALYD WEN EG     E+ YTFTILTTSSS AL+WLHDRMPVI GDKE  D WL  SS
Subjt:  RRLVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLNDSS

Query:  -SKFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSPCNTSVKPE-----PSQPLEEHE-RDVNHG
         +KFDT+LKPYE  DLVWYPVTP+MGKPSFDGP+CIKEI+LKTEG+NL+SKFFS K  KKE+    E+ S  + SVK +       +P EE E R+ N G
Subjt:  -SKFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSPCNTSVKPE-----PSQPLEEHE-RDVNHG

Query:  ASSCSVGAEES-KDNLAKCSSESAATCQMKRDRENISSDSNIGINDYGKIGSSSKIRKKGGMKPGSDN-QSTLFSYFGR
         SS +   E+  K +      + A  CQ KR  E +S+DS +  ++  K+  +S  +KKG +K   DN Q TLFSYFG+
Subjt:  ASSCSVGAEES-KDNLAKCSSESAATCQMKRDRENISSDSNIGINDYGKIGSSSKIRKKGGMKPGSDN-QSTLFSYFGR

SwissProt top hitse value%identityAlignment
O31916 Abasic site processing protein YoqW3.0e-3136.89Show/hide
Query:  FRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRRLVPKSRCLVAVEGFYEWKK-DGSKKQPYYVHF
        + P YN +P  ++  +  D    G    L  ++WGLIP +  K EK   YKM NAR+E++ EK SFR+ +   RC++  + FYEWK+ D   K P  +  
Subjt:  FRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRRLVPKSRCLVAVEGFYEWKK-DGSKKQPYYVHF

Query:  KDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLNDSSSKFD---TVLKPYEAPDLVWYPVTPSMGKPSFD
        K      FA LY+ W  PEG       LYT TI+TT  +  +E +HDRMPVI  D+   + WLN  ++  D   ++L+PY+A D+  Y V+  +  P  +
Subjt:  KDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLNDSSSKFD---TVLKPYEAPDLVWYPVTPSMGKPSFD

Query:  GPDCIK
         P+ I+
Subjt:  GPDCIK

O64131 SOS response-associated protein yoqW3.0e-3136.89Show/hide
Query:  FRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRRLVPKSRCLVAVEGFYEWKK-DGSKKQPYYVHF
        + P YN +P  ++  +  D    G    L  ++WGLIP +  K EK   YKM NAR+E++ EK SFR+ +   RC++  + FYEWK+ D   K P  +  
Subjt:  FRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRRLVPKSRCLVAVEGFYEWKK-DGSKKQPYYVHF

Query:  KDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLNDSSSKFD---TVLKPYEAPDLVWYPVTPSMGKPSFD
        K      FA LY+ W  PEG       LYT TI+TT  +  +E +HDRMPVI  D+   + WLN  ++  D   ++L+PY+A D+  Y V+  +  P  +
Subjt:  KDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLNDSSSKFD---TVLKPYEAPDLVWYPVTPSMGKPSFD

Query:  GPDCIK
         P+ I+
Subjt:  GPDCIK

Q5ZJT1 Abasic site processing protein HMCES3.0e-3132.84Show/hide
Query:  MCGRARCTLTAADVPRAC------HRAAGPLRTLNIHRFRPLYNASPGSDLPV------VRRDDESGGGGVVLQCMKWGLIPS-FTDKSEKPNYYKMFNA
        MCGR  C+L AA + RAC       R   P   L   R+RP YN  P S  PV      V++D +S     VL  M+WGL+PS F +       +K  N 
Subjt:  MCGRARCTLTAADVPRAC------HRAAGPLRTLNIHRFRPLYNASPGSDLPV------VRRDDESGGGGVVLQCMKWGLIPS-FTDKSEKPNYYKMFNA

Query:  RSESIREKASFR-RLVPKSRCLVAVEGFYEWKKDGSKKQPYYVHF------------------KDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTS
        RS+++  K+S++  L+   RC+V  +GFYEW++ G  KQPY+++F                  +  R L  A ++D WE P+G     E LYT+TI+T  
Subjt:  RSESIREKASFR-RLVPKSRCLVAVEGFYEWKKDGSKKQPYYVHF------------------KDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTS

Query:  SSPALEWLHDRMPVIFGDKERMDMWLNDSSSKFDTVLKPYE-APDLVWYPVTPSMGKPSFDGPDCIKEIQL
        +S  + ++H RMP I    E ++ WL+ +       +K    A ++ ++PV+  +     D P+C+  I+L
Subjt:  SSPALEWLHDRMPVIFGDKERMDMWLNDSSSKFDTVLKPYE-APDLVWYPVTPSMGKPSFDGPDCIKEIQL

Q6IND6 Abasic site processing protein HMCES2.1e-3231.79Show/hide
Query:  MCGRARCTLTAADVPRAC-------HRAAGPLRTLNIHRFRPLYNASPGSDLPVV------RRDDESGGGGVVLQCMKWGLIPS-FTDKSEKPNYYKMFN
        MCGR  CTL   DV +AC        +     R  +  +++P YN SP S+ PV+      ++D +S     VL  M+WGLIPS F +       YK  N
Subjt:  MCGRARCTLTAADVPRAC-------HRAAGPLRTLNIHRFRPLYNASPGSDLPVV------RRDDESGGGGVVLQCMKWGLIPS-FTDKSEKPNYYKMFN

Query:  ARSESIREKASFRR-LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHF-----------------KDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTS
         RS++I EKA ++  L    RC+V  +GFYEWK+   +KQPYY++F                    R L  A L+D WE P G     E LY++T++T  
Subjt:  ARSESIREKASFRR-LVPKSRCLVAVEGFYEWKKDGSKKQPYYVHF-----------------KDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTS

Query:  SSPALEWLHDRMPVIFGDKERMDMWLNDSSSKFDTVLK-PYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLIS-------KFFSAKETKKETS
        SS  +  +HDRMP I    E +  WL+         LK  +   ++ ++PV+  +     +  +CI  + L  +    +S       ++   K  KKE S
Subjt:  SSPALEWLHDRMPVIFGDKERMDMWLNDSSSKFDTVLK-PYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLIS-------KFFSAKETKKETS

Query:  DS
         S
Subjt:  DS

Q6P7N4 Abasic site processing protein HMCES6.6e-3432.45Show/hide
Query:  MCGRARCTLTAADVPRAC-------HRAAGPLRTLNIHRFRPLYNASPGSDLPVV------RRDDESGGGGVVLQCMKWGLIPS-FTDKSEKPNYYKMFN
        MCGR  CTL   DV +AC        R     R  +  +++P YN SP S+ PV+      ++D +S     VL  M+WGLIPS F +       YK  N
Subjt:  MCGRARCTLTAADVPRAC-------HRAAGPLRTLNIHRFRPLYNASPGSDLPVV------RRDDESGGGGVVLQCMKWGLIPS-FTDKSEKPNYYKMFN

Query:  ARSESIREKASFR-RLVPKSRCLVAVEGFYEWKKDGSKKQPYYVHF-----------------KDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTS
         RS+++ EKA ++  L    RC+V  +GFYEW++  S+KQPYY++F                    R L  A L+D WE P G     E LY++T++T  
Subjt:  ARSESIREKASFR-RLVPKSRCLVAVEGFYEWKKDGSKKQPYYVHF-----------------KDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTS

Query:  SSPALEWLHDRMPVIFGDKERMDMWLNDSSSKFDTVLK-PYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLIS-------KFFSAKETKKETS
        SS  + W+HDRMP I    E +  WL+         LK  +   ++ ++PV+  +     + P+C+  I L  +    +S        +   K  KKE S
Subjt:  SSPALEWLHDRMPVIFGDKERMDMWLNDSSSKFDTVLK-PYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLIS-------KFFSAKETKKETS

Query:  DS
         S
Subjt:  DS

Arabidopsis top hitse value%identityAlignment
AT2G26470.1 unknown protein3.1e-10362.77Show/hide
Query:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDES-GGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFR
        MCGR RCTL   DVPRA HR   P R L++ R+RP YN +PGS +PV+RRD+E   G GVV+ CMKWGL+PSFT K++KP+++KMFNARSES+ EKASFR
Subjt:  MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDES-GGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFR

Query:  RLVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLND-SS
        RL+PK+RCLVAV+GFYEWKK+GSKKQPYY+HF+DGRPLVFAAL+D W+N  G     E LYTFTILTT+SS AL+WLHDRMPVI GDK+ +D WL+D S+
Subjt:  RLVPKSRCLVAVEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLND-SS

Query:  SKFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSPCN--TSVKPEPS
        +K   +L PYE  DLVWYPVT ++GKP+FDGP+CI++I LKT  N+LISKFFS K+ K +  D + +++  N    +K EP+
Subjt:  SKFDTVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSPCN--TSVKPEPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCGGTAGAGCCCGTTGTACTCTCACAGCCGCCGACGTCCCCAGGGCCTGCCACCGCGCCGCTGGCCCCCTCCGTACCCTCAACATCCACCGTTTTCGCCCGCTGTA
TAATGCCTCGCCGGGCTCGGATTTGCCGGTTGTTCGTCGAGATGATGAATCGGGTGGCGGAGGAGTCGTCCTCCAGTGCATGAAATGGGGGCTCATTCCTAGTTTTACTG
ACAAATCCGAGAAACCTAATTACTACAAGATGTTCAATGCTCGTTCAGAGTCCATACGTGAGAAGGCCTCTTTTCGCCGTCTAGTTCCTAAAAGCAGATGCCTTGTGGCA
GTGGAAGGGTTCTATGAGTGGAAAAAGGATGGATCAAAAAAGCAGCCGTATTACGTTCATTTTAAGGATGGGCGGCCACTTGTTTTTGCTGCTTTATATGATCATTGGGA
AAATCCTGAAGGCAAGTTTCTTTTATGTGAATTACTTTACACTTTTACTATTTTGACGACTTCATCATCTCCAGCTTTAGAGTGGTTGCACGATAGGATGCCTGTAATTT
TTGGTGATAAAGAACGGATGGATATGTGGTTGAATGATTCATCTTCCAAGTTTGATACTGTCCTTAAACCGTATGAGGCTCCTGATTTGGTGTGGTACCCTGTAACTCCT
TCCATGGGCAAGCCATCATTTGACGGACCAGACTGCATTAAGGAGATACAGTTAAAGACTGAAGGAAACAACCTCATCTCCAAATTTTTCTCTGCAAAGGAAACTAAGAA
GGAAACTTCAGACTCCCAAGAGGAGACATCTCCCTGCAACACATCTGTGAAGCCCGAGCCATCACAACCTCTGGAAGAACATGAAAGAGATGTAAATCATGGAGCTTCAT
CCTGCTCTGTAGGAGCCGAAGAATCAAAGGATAATCTTGCAAAGTGTTCTTCTGAGAGTGCAGCAACATGCCAAATGAAACGTGACCGTGAAAACATCTCATCTGATTCG
AACATCGGCATCAACGACTATGGCAAGATAGGTAGTAGTTCGAAGATAAGGAAGAAGGGAGGCATGAAGCCTGGTAGTGACAACCAGTCAACTCTGTTTTCATATTTTGG
GAGGAAA
mRNA sequenceShow/hide mRNA sequence
ATGTGCGGTAGAGCCCGTTGTACTCTCACAGCCGCCGACGTCCCCAGGGCCTGCCACCGCGCCGCTGGCCCCCTCCGTACCCTCAACATCCACCGTTTTCGCCCGCTGTA
TAATGCCTCGCCGGGCTCGGATTTGCCGGTTGTTCGTCGAGATGATGAATCGGGTGGCGGAGGAGTCGTCCTCCAGTGCATGAAATGGGGGCTCATTCCTAGTTTTACTG
ACAAATCCGAGAAACCTAATTACTACAAGATGTTCAATGCTCGTTCAGAGTCCATACGTGAGAAGGCCTCTTTTCGCCGTCTAGTTCCTAAAAGCAGATGCCTTGTGGCA
GTGGAAGGGTTCTATGAGTGGAAAAAGGATGGATCAAAAAAGCAGCCGTATTACGTTCATTTTAAGGATGGGCGGCCACTTGTTTTTGCTGCTTTATATGATCATTGGGA
AAATCCTGAAGGCAAGTTTCTTTTATGTGAATTACTTTACACTTTTACTATTTTGACGACTTCATCATCTCCAGCTTTAGAGTGGTTGCACGATAGGATGCCTGTAATTT
TTGGTGATAAAGAACGGATGGATATGTGGTTGAATGATTCATCTTCCAAGTTTGATACTGTCCTTAAACCGTATGAGGCTCCTGATTTGGTGTGGTACCCTGTAACTCCT
TCCATGGGCAAGCCATCATTTGACGGACCAGACTGCATTAAGGAGATACAGTTAAAGACTGAAGGAAACAACCTCATCTCCAAATTTTTCTCTGCAAAGGAAACTAAGAA
GGAAACTTCAGACTCCCAAGAGGAGACATCTCCCTGCAACACATCTGTGAAGCCCGAGCCATCACAACCTCTGGAAGAACATGAAAGAGATGTAAATCATGGAGCTTCAT
CCTGCTCTGTAGGAGCCGAAGAATCAAAGGATAATCTTGCAAAGTGTTCTTCTGAGAGTGCAGCAACATGCCAAATGAAACGTGACCGTGAAAACATCTCATCTGATTCG
AACATCGGCATCAACGACTATGGCAAGATAGGTAGTAGTTCGAAGATAAGGAAGAAGGGAGGCATGAAGCCTGGTAGTGACAACCAGTCAACTCTGTTTTCATATTTTGG
GAGGAAA
Protein sequenceShow/hide protein sequence
MCGRARCTLTAADVPRACHRAAGPLRTLNIHRFRPLYNASPGSDLPVVRRDDESGGGGVVLQCMKWGLIPSFTDKSEKPNYYKMFNARSESIREKASFRRLVPKSRCLVA
VEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDHWENPEGKFLLCELLYTFTILTTSSSPALEWLHDRMPVIFGDKERMDMWLNDSSSKFDTVLKPYEAPDLVWYPVTP
SMGKPSFDGPDCIKEIQLKTEGNNLISKFFSAKETKKETSDSQEETSPCNTSVKPEPSQPLEEHERDVNHGASSCSVGAEESKDNLAKCSSESAATCQMKRDRENISSDS
NIGINDYGKIGSSSKIRKKGGMKPGSDNQSTLFSYFGRK