; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g1289 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g1289
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDRMBL domain-containing protein
Genome locationMC02:12735682..12762671
RNA-Seq ExpressionMC02g1289
SyntenyMC02g1289
Gene Ontology termsGO:0006303 - double-strand break repair via nonhomologous end joining (biological process)
GO:0031848 - protection from non-homologous end joining at telomere (biological process)
GO:0036297 - interstrand cross-link repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0035312 - 5'-3' exodeoxyribonuclease activity (molecular function)
InterPro domainsIPR011084 - DNA repair metallo-beta-lactamase
IPR036866 - Ribonuclease Z/Hydroxyacylglutathione hydrolase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035278.1 5' exonuclease Apollo, partial [Cucurbita argyrosperma subsp. argyrosperma]0.080.48Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVP-HSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQK HHFLTHAH DHT GI   HSSFPI+ST +TK IVLQHFPQ+ DSLFVCIE+GQ+LVVKDPDG FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVP-HSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG

Query:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI
        NFGNILHTGDCRLTPECLQ+LPEKYRGKSGKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQVSQTFGSKI
Subjt:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI

Query:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM
        FVDE MKAGYKALELIDP+ILTQDPSSRFHLL GFPKLCQ AK LL +AQTNFQ EPL+IRPSTQWYV EELSE+  TRKQIISEAIKDQHGIWHVCYSM
Subjt:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM
        HSSKEELEWALQIL PKWV STTPGCRAMDLDYVKKK SC+SLTSNGLIWKLFG+ EESSSDLD S +EV CSP+VE     ++DPQLQP KLYA P+EM
Subjt:  HSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM

Query:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDG-LLVDNNASISS
        L++LSSSNLPPLTLFGRARL A++A++L EEV YPST N EPVEAVG KV DLSIHDAN G+ SD+ S++S NEVNS+ K K FAND  LL D  AS  S
Subjt:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDG-LLVDNNASISS

Query:  ERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCC-NSHLLSVGSSKGFNDKFRKLYRSMNV
        +R  LH SEVKV SMN+ +PP+ V S VEEL++H Q+ R KGN+SL DCEDVG++PETH GKL+ DDRI  C  NSH LSVGSSKGFND+FRKLYRSMNV
Subjt:  ERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCC-NSHLLSVGSSKGFNDKFRKLYRSMNV

Query:  PVPKPLPSLVELMKSRKRAKKNAYF
         VP+PLPSLVELMKSRKRAK+NAYF
Subjt:  PVPKPLPSLVELMKSRKRAKKNAYF

XP_022156681.1 protein artemis [Momordica charantia]0.098.73Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG-
        MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG 
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG-

Query:  -------NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS
               NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS
Subjt:  -------NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS

Query:  QTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGI
        QTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGI
Subjt:  QTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGI

Query:  WHVCYSMHSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKL
        WHVCYSMHSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKL
Subjt:  WHVCYSMHSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKL

Query:  YAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDGLLVDN
        YAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDGLLVDN
Subjt:  YAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDGLLVDN

Query:  NASISSERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLIDDRITVCCNSHLLSVGSSKGFNDKFRKLYR
        NASISSERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLIDDRITVCCNSHLLSVGSSKGFNDKFRKLYR
Subjt:  NASISSERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLIDDRITVCCNSHLLSVGSSKGFNDKFRKLYR

Query:  SMNVPVPKPLPSLVELMKSRKRAKKNAYF
        SMNVPVPKPLPSLVELMKSRKRAKKNAYF
Subjt:  SMNVPVPKPLPSLVELMKSRKRAKKNAYF

XP_022948240.1 uncharacterized protein LOC111451874 isoform X2 [Cucurbita moschata]0.079.81Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVP-HSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQK HHFLTHAH DHT GI   HSSFPI+ST +TK IVLQHFPQ+ DSLFVCIE+GQ+LVVKDPDG FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVP-HSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG

Query:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI
        NFGNILHTGDCRLTPECLQ+LPEKYRGKSGKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQVSQTFGSKI
Subjt:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI

Query:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM
        FVDE  KAGYKALELIDP+ILTQDPSSRFHLL GFPKLCQ AK LL +AQTNFQ EPL+IRPSTQWYV EELSE+  TRKQIISEAIKDQHGIWHVCYSM
Subjt:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM
        HSSKEELEWALQIL PKWV STTPGCRAMDLDYVKKK S SSLTSNGLIWKLFG+AEESSSDLD S +EV CSP+VE     ++DPQLQP KLYA P+E 
Subjt:  HSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM

Query:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDG-LLVDNNASISS
        L++LS SNLPPLTLFGRARL  K+A++L EEV YPST N EPVEAVG KV DLSIHDAN G+ SD+ S++S+NEVNS+ KH+KFAND  LL D +AS  S
Subjt:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDG-LLVDNNASISS

Query:  ERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP
        +R RLH SEV+V SMN+ +PP+ V S VEEL++H Q+ R KG++SL DCEDV ++P+TH GKL+ DDR+ V  NSH+LSVGSSKGFND+FRKLYRSMNV 
Subjt:  ERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP

Query:  VPKPLPSLVELMKSRKRAKKNAYF
        VP+PLPSLVELMKSRKRAK+NAYF
Subjt:  VPKPLPSLVELMKSRKRAKKNAYF

XP_023007140.1 uncharacterized protein LOC111499723 isoform X2 [Cucurbita maxima]0.079.65Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVP-HSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQK HHFLTHAH DHT GI   HSSFPI+ST +TK IVLQHFPQ+ DSLFVCIE+GQ+LVVKDP+G FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVP-HSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG

Query:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI
        NFGNILHTGDCRLTPECLQ+LPEKYRGKSGKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQVSQTFGSKI
Subjt:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI

Query:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM
        FVDE  KAGYKALELIDP+ILTQDPSSRFHLL GFPKLCQ AK LL +AQTNFQ EPL+IRPSTQWYV EELSE   TRKQIISEAIKDQHGIWHVCYSM
Subjt:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM
        HSSKEELEWALQIL PKWV STTPGCRAMDLDYVK K SCSSLTS+GLIWKLFG+AEESSSDLD S +EV CSP+VE     ++DPQLQP KLYA P+E 
Subjt:  HSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM

Query:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDG-LLVDNNASISS
        L++LS SNLPPLTLFGRARL A++A++LLEEV YPS  N EPVEAVG KV DLSIHDAN G+ SD+ S++S+NEVNS+ KH+KFAN   LL D  AS  S
Subjt:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDG-LLVDNNASISS

Query:  ERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP
        +R RLH+SEVKV SMN+ +PP+ V S VEEL+ H Q+ R KGN+SL DCEDV ++PET  GKL+ DDRI  C NSH+LSVGSSKGFN +FRKLYRSMNV 
Subjt:  ERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP

Query:  VPKPLPSLVELMKSRKRAKKNAYF
        VP+PLPSLVELMKSRKRAK+NAYF
Subjt:  VPKPLPSLVELMKSRKRAKKNAYF

XP_023532364.1 uncharacterized protein LOC111794563 isoform X2 [Cucurbita pepo subsp. pepo]0.079.78Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVP-HSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQK HHFLTHAH DHT GI   HSSFPI+ST +TK IVLQHFPQ+ DSLFVCIE+GQ+LVVKDPDG FTVTV DAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVP-HSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG

Query:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI
        NFGNILHTGDCRLTPECLQ+LPEKYRGKSGKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQVSQTFGSKI
Subjt:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI

Query:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM
        FVDE  KAGYKALELIDP+ILTQDPSSRFHLL GFPKLCQ AK LL +AQTNFQ EPL+IRPSTQWYV EELSE+  TRKQIISEAIKDQHGIWHVCYSM
Subjt:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM
        HSSKEELEWALQIL PKWV STTPGCRAMDLDYVKKK SCSSLTSNGLIW+LFG+AEESSSDLD S +EV CSP+VE     ++DPQLQP KLYA P+E 
Subjt:  HSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM

Query:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDG-LLVDNNASISS
        L++LSSSNLPPLTLFGRARL  K+A++L EEV YPST N EPVEAVG KV DLSIHDAN G+ SD+ S++S+NE+NS+ KH+KFAN+  LL D +AS  S
Subjt:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDG-LLVDNNASISS

Query:  ERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP
        +  RLH SEVKV SMN+ +PP+ V S VEEL++H Q+ R  GN+SL DCEDV ++PETH GKL+ DDRI  C NSH+LSVGSSKGFND+FRKLYRSMNV 
Subjt:  ERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP

Query:  VPKPLPSLVELMKSRKRAKKNAY
        VP+PLPSLVELMKSRKRAK+NAY
Subjt:  VPKPLPSLVELMKSRKRAKKNAY

TrEMBL top hitse value%identityAlignment
A0A6J1DVN9 protein artemis0.098.73Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG-
        MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG 
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG-

Query:  -------NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS
               NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS
Subjt:  -------NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS

Query:  QTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGI
        QTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGI
Subjt:  QTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGI

Query:  WHVCYSMHSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKL
        WHVCYSMHSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKL
Subjt:  WHVCYSMHSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKL

Query:  YAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDGLLVDN
        YAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDGLLVDN
Subjt:  YAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDGLLVDN

Query:  NASISSERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLIDDRITVCCNSHLLSVGSSKGFNDKFRKLYR
        NASISSERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLIDDRITVCCNSHLLSVGSSKGFNDKFRKLYR
Subjt:  NASISSERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLIDDRITVCCNSHLLSVGSSKGFNDKFRKLYR

Query:  SMNVPVPKPLPSLVELMKSRKRAKKNAYF
        SMNVPVPKPLPSLVELMKSRKRAKKNAYF
Subjt:  SMNVPVPKPLPSLVELMKSRKRAKKNAYF

A0A6J1G8U0 uncharacterized protein LOC111451874 isoform X10.078.8Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVP-HSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQK HHFLTHAH DHT GI   HSSFPI+ST +TK IVLQHFPQ+ DSLFVCIE+GQ+LVVKDPDG FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVP-HSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG

Query:  --------NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
                NFGNILHTGDCRLTPECLQ+LPEKYRGKSGKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  --------NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHG
        SQTFGSKIFVDE  KAGYKALELIDP+ILTQDPSSRFHLL GFPKLCQ AK LL +AQTNFQ EPL+IRPSTQWYV EELSE+  TRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVK
        IWHVCYSMHSSKEELEWALQIL PKWV STTPGCRAMDLDYVKKK S SSLTSNGLIWKLFG+AEESSSDLD S +EV CSP+VE     ++DPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVK

Query:  LYAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDG-LLV
        LYA P+E L++LS SNLPPLTLFGRARL  K+A++L EEV YPST N EPVEAVG KV DLSIHDAN G+ SD+ S++S+NEVNS+ KH+KFAND  LL 
Subjt:  LYAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDG-LLV

Query:  DNNASISSERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRK
        D +AS  S+R RLH SEV+V SMN+ +PP+ V S VEEL++H Q+ R KG++SL DCEDV ++P+TH GKL+ DDR+ V  NSH+LSVGSSKGFND+FRK
Subjt:  DNNASISSERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRK

Query:  LYRSMNVPVPKPLPSLVELMKSRKRAKKNAYF
        LYRSMNV VP+PLPSLVELMKSRKRAK+NAYF
Subjt:  LYRSMNVPVPKPLPSLVELMKSRKRAKKNAYF

A0A6J1G997 uncharacterized protein LOC111451874 isoform X20.079.81Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVP-HSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQK HHFLTHAH DHT GI   HSSFPI+ST +TK IVLQHFPQ+ DSLFVCIE+GQ+LVVKDPDG FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVP-HSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG

Query:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI
        NFGNILHTGDCRLTPECLQ+LPEKYRGKSGKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQVSQTFGSKI
Subjt:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI

Query:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM
        FVDE  KAGYKALELIDP+ILTQDPSSRFHLL GFPKLCQ AK LL +AQTNFQ EPL+IRPSTQWYV EELSE+  TRKQIISEAIKDQHGIWHVCYSM
Subjt:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM
        HSSKEELEWALQIL PKWV STTPGCRAMDLDYVKKK S SSLTSNGLIWKLFG+AEESSSDLD S +EV CSP+VE     ++DPQLQP KLYA P+E 
Subjt:  HSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM

Query:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDG-LLVDNNASISS
        L++LS SNLPPLTLFGRARL  K+A++L EEV YPST N EPVEAVG KV DLSIHDAN G+ SD+ S++S+NEVNS+ KH+KFAND  LL D +AS  S
Subjt:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDG-LLVDNNASISS

Query:  ERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP
        +R RLH SEV+V SMN+ +PP+ V S VEEL++H Q+ R KG++SL DCEDV ++P+TH GKL+ DDR+ V  NSH+LSVGSSKGFND+FRKLYRSMNV 
Subjt:  ERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP

Query:  VPKPLPSLVELMKSRKRAKKNAYF
        VP+PLPSLVELMKSRKRAK+NAYF
Subjt:  VPKPLPSLVELMKSRKRAKKNAYF

A0A6J1L262 uncharacterized protein LOC111499723 isoform X20.079.65Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVP-HSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQK HHFLTHAH DHT GI   HSSFPI+ST +TK IVLQHFPQ+ DSLFVCIE+GQ+LVVKDP+G FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVP-HSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG

Query:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI
        NFGNILHTGDCRLTPECLQ+LPEKYRGKSGKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQVSQTFGSKI
Subjt:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI

Query:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM
        FVDE  KAGYKALELIDP+ILTQDPSSRFHLL GFPKLCQ AK LL +AQTNFQ EPL+IRPSTQWYV EELSE   TRKQIISEAIKDQHGIWHVCYSM
Subjt:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM
        HSSKEELEWALQIL PKWV STTPGCRAMDLDYVK K SCSSLTS+GLIWKLFG+AEESSSDLD S +EV CSP+VE     ++DPQLQP KLYA P+E 
Subjt:  HSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM

Query:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDG-LLVDNNASISS
        L++LS SNLPPLTLFGRARL A++A++LLEEV YPS  N EPVEAVG KV DLSIHDAN G+ SD+ S++S+NEVNS+ KH+KFAN   LL D  AS  S
Subjt:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDG-LLVDNNASISS

Query:  ERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP
        +R RLH+SEVKV SMN+ +PP+ V S VEEL+ H Q+ R KGN+SL DCEDV ++PET  GKL+ DDRI  C NSH+LSVGSSKGFN +FRKLYRSMNV 
Subjt:  ERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP

Query:  VPKPLPSLVELMKSRKRAKKNAYF
        VP+PLPSLVELMKSRKRAK+NAYF
Subjt:  VPKPLPSLVELMKSRKRAKKNAYF

A0A6J1L450 protein artemis isoform X10.078.64Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVP-HSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQK HHFLTHAH DHT GI   HSSFPI+ST +TK IVLQHFPQ+ DSLFVCIE+GQ+LVVKDP+G FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVP-HSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG

Query:  --------NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
                NFGNILHTGDCRLTPECLQ+LPEKYRGKSGKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  --------NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHG
        SQTFGSKIFVDE  KAGYKALELIDP+ILTQDPSSRFHLL GFPKLCQ AK LL +AQTNFQ EPL+IRPSTQWYV EELSE   TRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVK
        IWHVCYSMHSSKEELEWALQIL PKWV STTPGCRAMDLDYVK K SCSSLTS+GLIWKLFG+AEESSSDLD S +EV CSP+VE     ++DPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVK

Query:  LYAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDG-LLV
        LYA P+E L++LS SNLPPLTLFGRARL A++A++LLEEV YPS  N EPVEAVG KV DLSIHDAN G+ SD+ S++S+NEVNS+ KH+KFAN   LL 
Subjt:  LYAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDG-LLV

Query:  DNNASISSERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRK
        D  AS  S+R RLH+SEVKV SMN+ +PP+ V S VEEL+ H Q+ R KGN+SL DCEDV ++PET  GKL+ DDRI  C NSH+LSVGSSKGFN +FRK
Subjt:  DNNASISSERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRK

Query:  LYRSMNVPVPKPLPSLVELMKSRKRAKKNAYF
        LYRSMNV VP+PLPSLVELMKSRKRAK+NAYF
Subjt:  LYRSMNVPVPKPLPSLVELMKSRKRAKKNAYF

SwissProt top hitse value%identityAlignment
Q6PJP8 DNA cross-link repair 1A protein2.7e-1627.48Show/hide
Query:  HFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG---------NFGNILHTGDCRLTPEC
        +FLTH H DH AG+  H +FP+Y + +T  + L++   +++     + +    +V        V + DA+HCPG         N   ILHTGD R  P  
Subjt:  HFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG---------NFGNILHTGDCRLTPEC

Query:  LQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDA-PLVYLICNL--LGQEDILQQVSQTFGSKIFVDESMKAGYKALE
         +SL    +          + +++LD T+      FPS+   I   IN  ++     P   ++C    +G+E +   ++   GSK+ + +     YK L+
Subjt:  LQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDA-PLVYLICNL--LGQEDILQQVSQTFGSKIFVDESMKAGYKALE

Query:  LID-PEI---LTQDP-SSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSMHSSKEELEW
         ++ PEI   +T D  SS  HLL       +  +  L+     + ++ L  RP T W  H   S  FT    +I +  K    I+ + YS HSS  E++ 
Subjt:  LID-PEI---LTQDP-SSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSMHSSKEELEW

Query:  ALQILGPKWVFST
         +Q L P+ +  T
Subjt:  ALQILGPKWVFST

Q9H816 5' exonuclease Apollo4.5e-1929.59Show/hide
Query:  PFSVDTWT-PSSKQKHHHFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVK-DPDG--AFTVTVFDAHHCPGN-----
        P +VD W+   +      FL+H H DHT G+    + P+Y + +T  ++ +H  Q+       +E+G+S V+  D  G    TVT+ DA+HCPG+     
Subjt:  PFSVDTWT-PSSKQKHHHFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVK-DPDG--AFTVTVFDAHHCPGN-----

Query:  ---FGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGS
           FG IL+TGD R TP  L+  P    GK       Q+  ++LD T        PSR  + HQI+  I KHP   +   + + LG+E +L+Q++  F +
Subjt:  ---FGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGS

Query:  KIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQ
         + +        + L L D     ++ + R H +    ++C     +LR  QT   H  + I P+++
Subjt:  KIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQ

Q9JIC3 DNA cross-link repair 1A protein6.3e-1326.2Show/hide
Query:  HFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG---------NFGNILHTGDCRLTPEC
        +FLTH H DH AG+    + P+Y + +T  ++ +   ++++     + +    VV     +  V + DA+HCPG         N   ILHTGD R  P  
Subjt:  HFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG---------NFGNILHTGDCRLTPEC

Query:  LQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDA-PLVYLICN--LLGQEDILQQVSQTFGSKIFVDESMKAGYKALE
         +S   +  G+       ++  +FLD T+      FPS+   I   IN  ++     P   ++C    +G+E +   ++   GSK+ + +     YK L+
Subjt:  LQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDA-PLVYLICN--LLGQEDILQQVSQTFGSKIFVDESMKAGYKALE

Query:  LID-PE----ILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSMHSSKEELEW
         ++ PE    I T    S  HLL       +  +  L+     +  + L  RP T W  H   S   T+   II +  +    I+ + YS HSS  E++ 
Subjt:  LID-PE----ILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSMHSSKEELEW

Query:  ALQILGPKWVFST
         +Q L P+ +  T
Subjt:  ALQILGPKWVFST

Arabidopsis top hitse value%identityAlignment
AT1G19025.1 DNA repair metallo-beta-lactamase family protein7.9e-13643.41Show/hide
Query:  MPIEMPQGLPFSVDT---WTPSSKQKHHHFLTHAHRDHTAGIVPHS--SFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAH
        M IEMP+GLPF+VDT   +T + ++K HHFLTHAH+DHT G+ P +   FPIYSTSLT  ++LQ FPQ+++S FV +EIGQS++V DPDG F VT FDA+
Subjt:  MPIEMPQGLPFSVDT---WTPSSKQKHHHFLTHAHRDHTAGIVPHS--SFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAH

Query:  HCP--------GNFGNILHTGDCRLTPECLQSLPEKYRGKS-GKEPRCQLDLIFLDCTFGR--FFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQ
        HCP        G+FGNILHTGDCRLT +CL SLPEKY G+S G +P+C L  IFLDCTFG+    Q+FP++HS+I QIINCIW HPDAP+VYL C++LGQ
Subjt:  HCP--------GNFGNILHTGDCRLTPECLQSLPEKYRGKS-GKEPRCQLDLIFLDCTFGR--FFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQ

Query:  EDILQQVSQTFGSKIFVDESMKAG-YKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSE-----VFTTR
        ED+L +VS+TFGSKI+VD++     +++L +I PEI+++DPSSRFH+  GFPKL +R    L +A++  Q EPLIIRPS QWYV ++  +     +   R
Subjt:  EDILQQVSQTFGSKIFVDESMKAG-YKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSE-----VFTTR

Query:  KQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESS--SDLDVSTMEVGCSPMVE
        K   SEA+KD+ G+WHVCYSMHSS+ ELE A+Q+L PKWV ST P CRAM+L+YVKK    S  + +   WKL  I  E+S  +  D  T+ + C  M E
Subjt:  KQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILGPKWVFSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESS--SDLDVSTMEVGCSPMVE

Query:  APAQINIDPQLQPVKLYAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNS
             +   +L+PV   +  K+ L  LS  N  P+TLFGRAR  ++E D L E                  KVI           + +            
Subjt:  APAQINIDPQLQPVKLYAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNS

Query:  EEKHKKFANDGLLVDNNASISSERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLIDDRITVCCNSHLLS
                                 +L+V  V     +DT                         E  V+ E                  T+   S   S
Subjt:  EEKHKKFANDGLLVDNNASISSERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLIDDRITVCCNSHLLS

Query:  VGSSKGFNDKFRKLYRSMNVPVPKPLPSLVELMKSRKRAKKNAYF
          + K  +   RKLYRSMN PVP+PLPSL+ELM +RKR++ +  F
Subjt:  VGSSKGFNDKFRKLYRSMNVPVPKPLPSLVELMKSRKRAKKNAYF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGATCGAAATGCCCCAAGGCCTGCCATTCTCGGTGGATACATGGACTCCATCTTCCAAGCAGAAGCACCACCATTTTCTAACGCACGCTCACAGGGATCACACCGC
CGGGATTGTCCCCCATTCTTCCTTCCCAATTTATTCTACTTCTCTCACCAAAATCATCGTTCTTCAGCACTTCCCTCAGATTGAGGATTCGTTGTTTGTCTGTATCGAGA
TCGGGCAATCGCTGGTAGTCAAAGATCCTGATGGTGCTTTCACTGTCACAGTTTTCGACGCTCATCACTGTCCTGGCAATTTTGGCAATATTCTGCATACGGGCGATTGC
AGACTAACTCCTGAGTGTCTACAGAGCTTACCGGAGAAGTATCGTGGAAAAAGTGGTAAAGAGCCTAGATGTCAACTGGATTTGATTTTTCTAGACTGCACATTTGGTAG
ATTCTTTCAACAATTCCCCAGCAGGCATTCATCAATACATCAGATTATTAATTGCATATGGAAACATCCCGATGCTCCGTTAGTGTATCTGATTTGCAATCTTCTAGGAC
AGGAAGATATATTGCAACAAGTGTCCCAAACATTTGGTTCAAAGATTTTTGTTGATGAATCCATGAAAGCAGGTTACAAGGCTCTTGAACTTATAGATCCTGAGATCCTC
ACTCAAGATCCATCCTCCCGTTTTCATTTGCTTGGTGGATTCCCTAAACTATGTCAAAGAGCAAAACCACTGCTTCGGGATGCCCAGACCAATTTTCAGCATGAACCTCT
CATAATCCGCCCTTCGACCCAGTGGTATGTTCATGAGGAACTGTCAGAGGTTTTCACCACAAGGAAACAGATAATTAGTGAAGCAATCAAAGATCAGCATGGTATTTGGC
ATGTCTGTTACTCGATGCACTCGTCGAAGGAAGAACTTGAATGGGCCTTGCAAATTTTAGGCCCTAAGTGGGTTTTTTCGACCACTCCTGGTTGTCGTGCCATGGATTTG
GATTACGTGAAGAAGAAACTCAGTTGCTCTAGTTTAACTTCCAATGGCCTAATCTGGAAGCTTTTTGGTATAGCTGAGGAAAGTTCTTCAGATTTAGATGTTTCAACGAT
GGAAGTGGGCTGTTCCCCTATGGTTGAAGCACCCGCTCAAATCAACATAGATCCTCAACTGCAGCCCGTGAAACTGTATGCTTTCCCTAAAGAAATGCTGAATGTTTTGT
CTTCAAGCAACTTGCCACCTCTTACATTATTTGGACGAGCTAGACTTGGCGCAAAAGAGGCTGATTTGTTGCTGGAAGAAGTTCCATATCCGTCTACAGGGAATGATGAG
CCTGTAGAAGCAGTTGGAGGTAAGGTAATAGACTTGTCAATTCATGATGCAAATAATGGTCAATTGAGCGACGAATCATCAGAAAATTCTGAAAACGAAGTTAATTCTGA
AGAGAAGCACAAGAAGTTTGCAAATGATGGGTTATTAGTTGACAATAACGCCTCTATTAGCTCTGAACGGGTTAGGCTCCATGTTTCTGAAGTAAAAGTCACGTCCATGA
ATGACACTCATCCACCAAAACCAGTAGGCAGTAACGTAGAAGAACTTTATATCCATGTGCAAAAAGGTAGAGTGAAGGGAAATGAGTCGTTAGTTGATTGTGAAGATGTT
GGTAGTATTCCCGAAACACATGCTGGGAAGTTAATAGATGACAGGATAACAGTCTGTTGTAATTCACATCTTTTAAGTGTTGGATCTTCAAAGGGTTTCAATGACAAGTT
TAGAAAGTTGTACAGGTCGATGAACGTTCCTGTGCCCAAGCCTCTTCCTTCATTAGTAGAACTCATGAAATCGAGAAAACGCGCAAAGAAGAATGCATACTTCTAG
mRNA sequenceShow/hide mRNA sequence
GGAAGCCTCGAGAGAGAACCATGAATTTTCTGATTAGGGCTCAAATTCAATTCTTTCGCTTTCGTCTTCCTCTGCCATCGAAATTCCCCCACAGAATCCAGAGATGCCGA
TCGAAATGCCCCAAGGCCTGCCATTCTCGGTGGATACATGGACTCCATCTTCCAAGCAGAAGCACCACCATTTTCTAACGCACGCTCACAGGGATCACACCGCCGGGATT
GTCCCCCATTCTTCCTTCCCAATTTATTCTACTTCTCTCACCAAAATCATCGTTCTTCAGCACTTCCCTCAGATTGAGGATTCGTTGTTTGTCTGTATCGAGATCGGGCA
ATCGCTGGTAGTCAAAGATCCTGATGGTGCTTTCACTGTCACAGTTTTCGACGCTCATCACTGTCCTGGCAATTTTGGCAATATTCTGCATACGGGCGATTGCAGACTAA
CTCCTGAGTGTCTACAGAGCTTACCGGAGAAGTATCGTGGAAAAAGTGGTAAAGAGCCTAGATGTCAACTGGATTTGATTTTTCTAGACTGCACATTTGGTAGATTCTTT
CAACAATTCCCCAGCAGGCATTCATCAATACATCAGATTATTAATTGCATATGGAAACATCCCGATGCTCCGTTAGTGTATCTGATTTGCAATCTTCTAGGACAGGAAGA
TATATTGCAACAAGTGTCCCAAACATTTGGTTCAAAGATTTTTGTTGATGAATCCATGAAAGCAGGTTACAAGGCTCTTGAACTTATAGATCCTGAGATCCTCACTCAAG
ATCCATCCTCCCGTTTTCATTTGCTTGGTGGATTCCCTAAACTATGTCAAAGAGCAAAACCACTGCTTCGGGATGCCCAGACCAATTTTCAGCATGAACCTCTCATAATC
CGCCCTTCGACCCAGTGGTATGTTCATGAGGAACTGTCAGAGGTTTTCACCACAAGGAAACAGATAATTAGTGAAGCAATCAAAGATCAGCATGGTATTTGGCATGTCTG
TTACTCGATGCACTCGTCGAAGGAAGAACTTGAATGGGCCTTGCAAATTTTAGGCCCTAAGTGGGTTTTTTCGACCACTCCTGGTTGTCGTGCCATGGATTTGGATTACG
TGAAGAAGAAACTCAGTTGCTCTAGTTTAACTTCCAATGGCCTAATCTGGAAGCTTTTTGGTATAGCTGAGGAAAGTTCTTCAGATTTAGATGTTTCAACGATGGAAGTG
GGCTGTTCCCCTATGGTTGAAGCACCCGCTCAAATCAACATAGATCCTCAACTGCAGCCCGTGAAACTGTATGCTTTCCCTAAAGAAATGCTGAATGTTTTGTCTTCAAG
CAACTTGCCACCTCTTACATTATTTGGACGAGCTAGACTTGGCGCAAAAGAGGCTGATTTGTTGCTGGAAGAAGTTCCATATCCGTCTACAGGGAATGATGAGCCTGTAG
AAGCAGTTGGAGGTAAGGTAATAGACTTGTCAATTCATGATGCAAATAATGGTCAATTGAGCGACGAATCATCAGAAAATTCTGAAAACGAAGTTAATTCTGAAGAGAAG
CACAAGAAGTTTGCAAATGATGGGTTATTAGTTGACAATAACGCCTCTATTAGCTCTGAACGGGTTAGGCTCCATGTTTCTGAAGTAAAAGTCACGTCCATGAATGACAC
TCATCCACCAAAACCAGTAGGCAGTAACGTAGAAGAACTTTATATCCATGTGCAAAAAGGTAGAGTGAAGGGAAATGAGTCGTTAGTTGATTGTGAAGATGTTGGTAGTA
TTCCCGAAACACATGCTGGGAAGTTAATAGATGACAGGATAACAGTCTGTTGTAATTCACATCTTTTAAGTGTTGGATCTTCAAAGGGTTTCAATGACAAGTTTAGAAAG
TTGTACAGGTCGATGAACGTTCCTGTGCCCAAGCCTCTTCCTTCATTAGTAGAACTCATGAAATCGAGAAAACGCGCAAAGAAGAATGCATACTTCTAGTCAGAACAGGC
CAAGCAAGAAAGGAAAACAACGACCTTACTTGAACAGGAACTGTTCTACAACAGCAAGGAAAACAGCATATATTACTCATTGGAGGCTTGCCGAAGTGATAAGTAGGAAT
GAGCATCTAAACCCTGCAAAAAGGGGAAAAAATTTCAAAGATTCGATGTTCAAAGCGATGGAAGAGAGAGAACCCATAAGTGAGTGCGGCTCCTTTAACGGCGGCAAGGA
GCATTCGGTGGTAGGGTTGTTGCTTATTCGGCGAATCGATGGTGATCACTTTCTGAAGATGATGATGAGGTTATTGAATAAAAAATTAAAGCTTTCTTGGATGAAAATGC
GTTGAGTATATCCTTTACTGCATTGGGTCTTTGATGTGGCTTAGGAAGAATAAGCCGAATACTCTAATATCCATAGTTTACATCACAAGGTATACAACATCAAAATTGGA
GAAATGGTCACTTATCTTATTTACCTGTTTTTAACTATTTTGAGCATTAAAATGCATTTAAAGTAAGTTATTCTAGTTAATTCAAAACTTGTGATATAGTTACCTATTTA
GATTAGCCTTACATCAAGTTTGAAATATATGTTGCGTTATATTTGTGAATTGATTTGTCATTTTGAAATAAAATGAAATGGCTTCAAATACGGAATTCCTCTAGGGCAAA
ATATAATTTTGTTAATGTTGATTAATTAGTAAGGATGTTTATTTGCTGATTTTGATAATGAAATCCTTCACTCAATTGTAAAGGATTTTTCTAGTGACAAAACATATAGC
AAACTAATTTCTATGCATACGCTGATTCTAGCCGCACAACATTTTTTTGTTATTCTTTTTGTTACGTTTACTTCTTCATCGTTGCTTTTAATAGCAGTATAGCAAATATT
CTCATTTTTTGTATGTGTCAATCATCACTTACCGCTGTAAGAAGAATATGACATTGATGTTTGTTTGACACTAGGGGTGATGTTTTTGCCTAATGTATTCCTATTGATAC
TTCTTTCTCTTCCATTAATACATATTTGTGTGGCTATCCTGCAGCAAAGATAGCCCATTAAGACTTAATGTTTATATGTGTCATGCATATAATATAAAGTTTAGTTGGAT
AATTGTGTAGTTTAAAGTTTAGTTGGATGATTATGTTGCTATCCTGTAGTAAAGATAGCCAATTAAGACTTAGTGTTTATATTTGTGTCATTACGTAATGCATGTTGGAG
TTAAAAAGAAAATGTGGGAGTTAATATTAATATTTTCTTAATTATTCGTTTTATTTTTTTGGAGGAGTAACTTGCGGCAGCAACATTTTGAGAAAGGAGAATGTATAGGG
AAGAATTTCGGCGTATTTCGACATGTACAAGGTTCATGTAATGTGAATGTAGTTTGTTGGGCATTCTCTTATTGTAGTCATAGATAGTATAGTATTTGTATAAAATATCC
TAATGTACGAATTTGATATGACTTTGATCTAGATAATTTTTTTGAAACGTGTACAGTGTTATAAAATGTATTTCATGGCTGTTATGAAGTTATCATATATAAACTTTTAT
TGATATTACAG
Protein sequenceShow/hide protein sequence
MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPGNFGNILHTGDC
RLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKIFVDESMKAGYKALELIDPEIL
TQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILGPKWVFSTTPGCRAMDL
DYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDE
PVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHKKFANDGLLVDNNASISSERVRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDV
GSIPETHAGKLIDDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVPVPKPLPSLVELMKSRKRAKKNAYF