; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004496 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004496
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDRMBL domain-containing protein
Genome locationscaffold1189:29382..36863
RNA-Seq ExpressionMS004496
SyntenyMS004496
Gene Ontology termsGO:0006303 - double-strand break repair via nonhomologous end joining (biological process)
GO:0031848 - protection from non-homologous end joining at telomere (biological process)
GO:0036297 - interstrand cross-link repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0035312 - 5'-3' exodeoxyribonuclease activity (molecular function)
InterPro domainsIPR011084 - DNA repair metallo-beta-lactamase
IPR036866 - Ribonuclease Z/Hydroxyacylglutathione hydrolase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035278.1 5' exonuclease Apollo, partial [Cucurbita argyrosperma subsp. argyrosperma]8.9e-29180.64Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGI-VPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQK HHFLTHAH DHT GI   HSSFPI+ST +TK IVLQHFPQ+ DSLFVCIE+GQ+LVVKDPDG FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGI-VPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG

Query:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI
        NFGNILHTGDCRLTPECLQ+LPEKYRGKSGKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQVSQTFGSKI
Subjt:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI

Query:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM
        FVDE MKAGYKALELIDP+ILTQDPSSRFHLL GFPKLCQ AK LL +AQTNFQ EPL+IRPSTQWYV EELSE+  TRKQIISEAIKDQHGIWHVCYSM
Subjt:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM
        HSSKEELEWALQIL PKWVVSTTPGCRAMDLDYVKKK SC+SLTSNGLIWKLFG+ EESSSDLD S +EV CSP+VE     ++DPQLQP KLYA P+EM
Subjt:  HSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM

Query:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDG-LLVDNNASISS
        L++LSSSNLPPLTLFGRARL A++A++L EEV YPST N EPVEAVG KV DLSIHDA NG+ SD+ S++S NEVNS+ K  KFAND  LL D  AS  S
Subjt:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDG-LLVDNNASISS

Query:  ERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVC-CNSHLLSVGSSKGFNDKFRKLYRSMNV
        +R  LH SEVKV SMN+ +PP+ V S VEEL++H Q+ R KGN+SL DCEDVG++PETH GKL+ DDRI  C  NSH LSVGSSKGFND+FRKLYRSMNV
Subjt:  ERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVC-CNSHLLSVGSSKGFNDKFRKLYRSMNV

Query:  PVPKPLPSLVELMKSRKRAKKNAYF
         VP+PLPSLVELMKSRKRAK+NAYF
Subjt:  PVPKPLPSLVELMKSRKRAKKNAYF

XP_022156681.1 protein artemis [Momordica charantia]0.0e+0098.25Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCP--
        MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCP  
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCP--

Query:  ------GNFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS
              GNFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS
Subjt:  ------GNFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS

Query:  QTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGI
        QTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGI
Subjt:  QTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGI

Query:  WHVCYSMHSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKL
        WHVCYSMHSSKEELEWALQILGPKWV STTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKL
Subjt:  WHVCYSMHSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKL

Query:  YAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDGLLVDN
        YAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKH KFANDGLLVDN
Subjt:  YAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDGLLVDN

Query:  NASISSERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLIDDRITVCCNSHLLSVGSSKGFNDKFRKLYR
        NASISSER RLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLIDDRITVCCNSHLLSVGSSKGFNDKFRKLYR
Subjt:  NASISSERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLIDDRITVCCNSHLLSVGSSKGFNDKFRKLYR

Query:  SMNVPVPKPLPSLVELMKSRKRAKKNAYF
        SMNVPVPKPLPSLVELMKSRKRAKKNAYF
Subjt:  SMNVPVPKPLPSLVELMKSRKRAKKNAYF

XP_022948240.1 uncharacterized protein LOC111451874 isoform X2 [Cucurbita moschata]1.2e-29079.97Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGI-VPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQK HHFLTHAH DHT GI   HSSFPI+ST +TK IVLQHFPQ+ DSLFVCIE+GQ+LVVKDPDG FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGI-VPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG

Query:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI
        NFGNILHTGDCRLTPECLQ+LPEKYRGKSGKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQVSQTFGSKI
Subjt:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI

Query:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM
        FVDE  KAGYKALELIDP+ILTQDPSSRFHLL GFPKLCQ AK LL +AQTNFQ EPL+IRPSTQWYV EELSE+  TRKQIISEAIKDQHGIWHVCYSM
Subjt:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM
        HSSKEELEWALQIL PKWVVSTTPGCRAMDLDYVKKK S SSLTSNGLIWKLFG+AEESSSDLD S +EV CSP+VE     ++DPQLQP KLYA P+E 
Subjt:  HSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM

Query:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDG-LLVDNNASISS
        L++LS SNLPPLTLFGRARL  K+A++L EEV YPST N EPVEAVG KV DLSIHDA NG+ SD+ S++S+NEVNS+ KH KFAND  LL D +AS  S
Subjt:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDG-LLVDNNASISS

Query:  ERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP
        +R RLH SEV+V SMN+ +PP+ V S VEEL++H Q+ R KG++SL DCEDV ++P+TH GKL+ DDR+ V  NSH+LSVGSSKGFND+FRKLYRSMNV 
Subjt:  ERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP

Query:  VPKPLPSLVELMKSRKRAKKNAYF
        VP+PLPSLVELMKSRKRAK+NAYF
Subjt:  VPKPLPSLVELMKSRKRAKKNAYF

XP_023007140.1 uncharacterized protein LOC111499723 isoform X2 [Cucurbita maxima]3.8e-28979.81Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGI-VPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQK HHFLTHAH DHT GI   HSSFPI+ST +TK IVLQHFPQ+ DSLFVCIE+GQ+LVVKDP+G FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGI-VPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG

Query:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI
        NFGNILHTGDCRLTPECLQ+LPEKYRGKSGKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQVSQTFGSKI
Subjt:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI

Query:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM
        FVDE  KAGYKALELIDP+ILTQDPSSRFHLL GFPKLCQ AK LL +AQTNFQ EPL+IRPSTQWYV EELSE   TRKQIISEAIKDQHGIWHVCYSM
Subjt:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM
        HSSKEELEWALQIL PKWVVSTTPGCRAMDLDYVK K SCSSLTS+GLIWKLFG+AEESSSDLD S +EV CSP+VE     ++DPQLQP KLYA P+E 
Subjt:  HSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM

Query:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDG-LLVDNNASISS
        L++LS SNLPPLTLFGRARL A++A++LLEEV YPS  N EPVEAVG KV DLSIHDA NG+ SD+ S++S+NEVNS+ KH KFAN   LL D  AS  S
Subjt:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDG-LLVDNNASISS

Query:  ERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP
        +R RLH+SEVKV SMN+ +PP+ V S VEEL+ H Q+ R KGN+SL DCEDV ++PET  GKL+ DDRI  C NSH+LSVGSSKGFN +FRKLYRSMNV 
Subjt:  ERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP

Query:  VPKPLPSLVELMKSRKRAKKNAYF
        VP+PLPSLVELMKSRKRAK+NAYF
Subjt:  VPKPLPSLVELMKSRKRAKKNAYF

XP_023532364.1 uncharacterized protein LOC111794563 isoform X2 [Cucurbita pepo subsp. pepo]5.2e-29179.94Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGI-VPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQK HHFLTHAH DHT GI   HSSFPI+ST +TK IVLQHFPQ+ DSLFVCIE+GQ+LVVKDPDG FTVTV DAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGI-VPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG

Query:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI
        NFGNILHTGDCRLTPECLQ+LPEKYRGKSGKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQVSQTFGSKI
Subjt:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI

Query:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM
        FVDE  KAGYKALELIDP+ILTQDPSSRFHLL GFPKLCQ AK LL +AQTNFQ EPL+IRPSTQWYV EELSE+  TRKQIISEAIKDQHGIWHVCYSM
Subjt:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM
        HSSKEELEWALQIL PKWVVSTTPGCRAMDLDYVKKK SCSSLTSNGLIW+LFG+AEESSSDLD S +EV CSP+VE     ++DPQLQP KLYA P+E 
Subjt:  HSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM

Query:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDG-LLVDNNASISS
        L++LSSSNLPPLTLFGRARL  K+A++L EEV YPST N EPVEAVG KV DLSIHDA NG+ SD+ S++S+NE+NS+ KH KFAN+  LL D +AS  S
Subjt:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDG-LLVDNNASISS

Query:  ERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP
        +  RLH SEVKV SMN+ +PP+ V S VEEL++H Q+ R  GN+SL DCEDV ++PETH GKL+ DDRI  C NSH+LSVGSSKGFND+FRKLYRSMNV 
Subjt:  ERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP

Query:  VPKPLPSLVELMKSRKRAKKNAY
        VP+PLPSLVELMKSRKRAK+NAY
Subjt:  VPKPLPSLVELMKSRKRAKKNAY

TrEMBL top hitse value%identityAlignment
A0A6J1DVN9 protein artemis0.0e+0098.25Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCP--
        MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCP  
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCP--

Query:  ------GNFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS
              GNFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS
Subjt:  ------GNFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS

Query:  QTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGI
        QTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGI
Subjt:  QTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGI

Query:  WHVCYSMHSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKL
        WHVCYSMHSSKEELEWALQILGPKWV STTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKL
Subjt:  WHVCYSMHSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKL

Query:  YAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDGLLVDN
        YAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKH KFANDGLLVDN
Subjt:  YAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDGLLVDN

Query:  NASISSERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLIDDRITVCCNSHLLSVGSSKGFNDKFRKLYR
        NASISSER RLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLIDDRITVCCNSHLLSVGSSKGFNDKFRKLYR
Subjt:  NASISSERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLIDDRITVCCNSHLLSVGSSKGFNDKFRKLYR

Query:  SMNVPVPKPLPSLVELMKSRKRAKKNAYF
        SMNVPVPKPLPSLVELMKSRKRAKKNAYF
Subjt:  SMNVPVPKPLPSLVELMKSRKRAKKNAYF

A0A6J1G8U0 uncharacterized protein LOC111451874 isoform X19.0e-28978.96Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGI-VPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCP-
        MPIEMP GLPFSVDTWTPSSKQK HHFLTHAH DHT GI   HSSFPI+ST +TK IVLQHFPQ+ DSLFVCIE+GQ+LVVKDPDG FTVTVFDAHHCP 
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGI-VPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCP-

Query:  -------GNFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
               GNFGNILHTGDCRLTPECLQ+LPEKYRGKSGKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  -------GNFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHG
        SQTFGSKIFVDE  KAGYKALELIDP+ILTQDPSSRFHLL GFPKLCQ AK LL +AQTNFQ EPL+IRPSTQWYV EELSE+  TRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVK
        IWHVCYSMHSSKEELEWALQIL PKWVVSTTPGCRAMDLDYVKKK S SSLTSNGLIWKLFG+AEESSSDLD S +EV CSP+VE     ++DPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVK

Query:  LYAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDG-LLV
        LYA P+E L++LS SNLPPLTLFGRARL  K+A++L EEV YPST N EPVEAVG KV DLSIHDA NG+ SD+ S++S+NEVNS+ KH KFAND  LL 
Subjt:  LYAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDG-LLV

Query:  DNNASISSERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRK
        D +AS  S+R RLH SEV+V SMN+ +PP+ V S VEEL++H Q+ R KG++SL DCEDV ++P+TH GKL+ DDR+ V  NSH+LSVGSSKGFND+FRK
Subjt:  DNNASISSERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRK

Query:  LYRSMNVPVPKPLPSLVELMKSRKRAKKNAYF
        LYRSMNV VP+PLPSLVELMKSRKRAK+NAYF
Subjt:  LYRSMNVPVPKPLPSLVELMKSRKRAKKNAYF

A0A6J1G997 uncharacterized protein LOC111451874 isoform X25.6e-29179.97Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGI-VPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQK HHFLTHAH DHT GI   HSSFPI+ST +TK IVLQHFPQ+ DSLFVCIE+GQ+LVVKDPDG FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGI-VPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG

Query:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI
        NFGNILHTGDCRLTPECLQ+LPEKYRGKSGKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQVSQTFGSKI
Subjt:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI

Query:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM
        FVDE  KAGYKALELIDP+ILTQDPSSRFHLL GFPKLCQ AK LL +AQTNFQ EPL+IRPSTQWYV EELSE+  TRKQIISEAIKDQHGIWHVCYSM
Subjt:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM
        HSSKEELEWALQIL PKWVVSTTPGCRAMDLDYVKKK S SSLTSNGLIWKLFG+AEESSSDLD S +EV CSP+VE     ++DPQLQP KLYA P+E 
Subjt:  HSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM

Query:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDG-LLVDNNASISS
        L++LS SNLPPLTLFGRARL  K+A++L EEV YPST N EPVEAVG KV DLSIHDA NG+ SD+ S++S+NEVNS+ KH KFAND  LL D +AS  S
Subjt:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDG-LLVDNNASISS

Query:  ERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP
        +R RLH SEV+V SMN+ +PP+ V S VEEL++H Q+ R KG++SL DCEDV ++P+TH GKL+ DDR+ V  NSH+LSVGSSKGFND+FRKLYRSMNV 
Subjt:  ERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP

Query:  VPKPLPSLVELMKSRKRAKKNAYF
        VP+PLPSLVELMKSRKRAK+NAYF
Subjt:  VPKPLPSLVELMKSRKRAKKNAYF

A0A6J1L262 uncharacterized protein LOC111499723 isoform X21.8e-28979.81Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGI-VPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQK HHFLTHAH DHT GI   HSSFPI+ST +TK IVLQHFPQ+ DSLFVCIE+GQ+LVVKDP+G FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGI-VPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG

Query:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI
        NFGNILHTGDCRLTPECLQ+LPEKYRGKSGKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQVSQTFGSKI
Subjt:  NFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKI

Query:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM
        FVDE  KAGYKALELIDP+ILTQDPSSRFHLL GFPKLCQ AK LL +AQTNFQ EPL+IRPSTQWYV EELSE   TRKQIISEAIKDQHGIWHVCYSM
Subjt:  FVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM
        HSSKEELEWALQIL PKWVVSTTPGCRAMDLDYVK K SCSSLTS+GLIWKLFG+AEESSSDLD S +EV CSP+VE     ++DPQLQP KLYA P+E 
Subjt:  HSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEM

Query:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDG-LLVDNNASISS
        L++LS SNLPPLTLFGRARL A++A++LLEEV YPS  N EPVEAVG KV DLSIHDA NG+ SD+ S++S+NEVNS+ KH KFAN   LL D  AS  S
Subjt:  LNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDG-LLVDNNASISS

Query:  ERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP
        +R RLH+SEVKV SMN+ +PP+ V S VEEL+ H Q+ R KGN+SL DCEDV ++PET  GKL+ DDRI  C NSH+LSVGSSKGFN +FRKLYRSMNV 
Subjt:  ERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVP

Query:  VPKPLPSLVELMKSRKRAKKNAYF
        VP+PLPSLVELMKSRKRAK+NAYF
Subjt:  VPKPLPSLVELMKSRKRAKKNAYF

A0A6J1L450 protein artemis isoform X12.9e-28778.8Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGI-VPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCP-
        MPIEMP GLPFSVDTWTPSSKQK HHFLTHAH DHT GI   HSSFPI+ST +TK IVLQHFPQ+ DSLFVCIE+GQ+LVVKDP+G FTVTVFDAHHCP 
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGI-VPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCP-

Query:  -------GNFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
               GNFGNILHTGDCRLTPECLQ+LPEKYRGKSGKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  -------GNFGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHG
        SQTFGSKIFVDE  KAGYKALELIDP+ILTQDPSSRFHLL GFPKLCQ AK LL +AQTNFQ EPL+IRPSTQWYV EELSE   TRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVK
        IWHVCYSMHSSKEELEWALQIL PKWVVSTTPGCRAMDLDYVK K SCSSLTS+GLIWKLFG+AEESSSDLD S +EV CSP+VE     ++DPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVK

Query:  LYAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDG-LLV
        LYA P+E L++LS SNLPPLTLFGRARL A++A++LLEEV YPS  N EPVEAVG KV DLSIHDA NG+ SD+ S++S+NEVNS+ KH KFAN   LL 
Subjt:  LYAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDG-LLV

Query:  DNNASISSERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRK
        D  AS  S+R RLH+SEVKV SMN+ +PP+ V S VEEL+ H Q+ R KGN+SL DCEDV ++PET  GKL+ DDRI  C NSH+LSVGSSKGFN +FRK
Subjt:  DNNASISSERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLI-DDRITVCCNSHLLSVGSSKGFNDKFRK

Query:  LYRSMNVPVPKPLPSLVELMKSRKRAKKNAYF
        LYRSMNV VP+PLPSLVELMKSRKRAK+NAYF
Subjt:  LYRSMNVPVPKPLPSLVELMKSRKRAKKNAYF

SwissProt top hitse value%identityAlignment
Q6PJP8 DNA cross-link repair 1A protein1.2e-1627.48Show/hide
Query:  HFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG---------NFGNILHTGDCRLTPEC
        +FLTH H DH AG+  H +FP+Y + +T  + L++   +++     + +    +V        V + DA+HCPG         N   ILHTGD R  P  
Subjt:  HFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG---------NFGNILHTGDCRLTPEC

Query:  LQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDA-PLVYLICNL--LGQEDILQQVSQTFGSKIFVDESMKAGYKALE
         +SL    +          + +++LD T+      FPS+   I   IN  ++     P   ++C    +G+E +   ++   GSK+ + +     YK L+
Subjt:  LQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDA-PLVYLICNL--LGQEDILQQVSQTFGSKIFVDESMKAGYKALE

Query:  LID-PEI---LTQDP-SSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSMHSSKEELEW
         ++ PEI   +T D  SS  HLL       +  +  L+     + ++ L  RP T W  H   S  FT    +I +  K    I+ + YS HSS  E++ 
Subjt:  LID-PEI---LTQDP-SSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSMHSSKEELEW

Query:  ALQILGPKWVVST
         +Q L P+ ++ T
Subjt:  ALQILGPKWVVST

Q9H816 5' exonuclease Apollo4.5e-1929.59Show/hide
Query:  PFSVDTWT-PSSKQKHHHFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVK-DPDG--AFTVTVFDAHHCPGN-----
        P +VD W+   +      FL+H H DHT G+    + P+Y + +T  ++ +H  Q+       +E+G+S V+  D  G    TVT+ DA+HCPG+     
Subjt:  PFSVDTWT-PSSKQKHHHFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVK-DPDG--AFTVTVFDAHHCPGN-----

Query:  ---FGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGS
           FG IL+TGD R TP  L+  P    GK       Q+  ++LD T        PSR  + HQI+  I KHP   +   + + LG+E +L+Q++  F +
Subjt:  ---FGNILHTGDCRLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGS

Query:  KIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQ
         + +        + L L D     ++ + R H +    ++C     +LR  QT   H  + I P+++
Subjt:  KIFVDESMKAGYKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQ

Q9JIC3 DNA cross-link repair 1A protein3.7e-1326.2Show/hide
Query:  HFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG---------NFGNILHTGDCRLTPEC
        +FLTH H DH AG+    + P+Y + +T  ++ +   ++++     + +    VV     +  V + DA+HCPG         N   ILHTGD R  P  
Subjt:  HFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPG---------NFGNILHTGDCRLTPEC

Query:  LQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDA-PLVYLICN--LLGQEDILQQVSQTFGSKIFVDESMKAGYKALE
         +S   +  G+       ++  +FLD T+      FPS+   I   IN  ++     P   ++C    +G+E +   ++   GSK+ + +     YK L+
Subjt:  LQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDA-PLVYLICN--LLGQEDILQQVSQTFGSKIFVDESMKAGYKALE

Query:  LID-PE----ILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSMHSSKEELEW
         ++ PE    I T    S  HLL       +  +  L+     +  + L  RP T W  H   S   T+   II +  +    I+ + YS HSS  E++ 
Subjt:  LID-PE----ILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSMHSSKEELEW

Query:  ALQILGPKWVVST
         +Q L P+ ++ T
Subjt:  ALQILGPKWVVST

Arabidopsis top hitse value%identityAlignment
AT1G19025.1 DNA repair metallo-beta-lactamase family protein2.7e-13643.57Show/hide
Query:  MPIEMPQGLPFSVDT---WTPSSKQKHHHFLTHAHRDHTAGIVPHS--SFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAH
        M IEMP+GLPF+VDT   +T + ++K HHFLTHAH+DHT G+ P +   FPIYSTSLT  ++LQ FPQ+++S FV +EIGQS++V DPDG F VT FDA+
Subjt:  MPIEMPQGLPFSVDT---WTPSSKQKHHHFLTHAHRDHTAGIVPHS--SFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAH

Query:  HCP--------GNFGNILHTGDCRLTPECLQSLPEKYRGKS-GKEPRCQLDLIFLDCTFGR--FFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQ
        HCP        G+FGNILHTGDCRLT +CL SLPEKY G+S G +P+C L  IFLDCTFG+    Q+FP++HS+I QIINCIW HPDAP+VYL C++LGQ
Subjt:  HCP--------GNFGNILHTGDCRLTPECLQSLPEKYRGKS-GKEPRCQLDLIFLDCTFGR--FFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQ

Query:  EDILQQVSQTFGSKIFVDESMKAG-YKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSE-----VFTTR
        ED+L +VS+TFGSKI+VD++     +++L +I PEI+++DPSSRFH+  GFPKL +R    L +A++  Q EPLIIRPS QWYV ++  +     +   R
Subjt:  EDILQQVSQTFGSKIFVDESMKAG-YKALELIDPEILTQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSE-----VFTTR

Query:  KQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESS--SDLDVSTMEVGCSPMVE
        K   SEA+KD+ G+WHVCYSMHSS+ ELE A+Q+L PKWVVST P CRAM+L+YVKK    S  + +   WKL  I  E+S  +  D  T+ + C  M E
Subjt:  KQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILGPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESS--SDLDVSTMEVGCSPMVE

Query:  APAQINIDPQLQPVKLYAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNS
             +   +L+PV   +  K+ L  LS  N  P+TLFGRAR  ++E D L E                  KVI           + +            
Subjt:  APAQINIDPQLQPVKLYAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDEPVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNS

Query:  EEKHNKFANDGLLVDNNASISSERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLIDDRITVCCNSHLLS
                                 +L+V  V     +DT                         E  V+ E                  T+   S   S
Subjt:  EEKHNKFANDGLLVDNNASISSERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDVGSIPETHAGKLIDDRITVCCNSHLLS

Query:  VGSSKGFNDKFRKLYRSMNVPVPKPLPSLVELMKSRKRAKKNAYF
          + K  +   RKLYRSMN PVP+PLPSL+ELM +RKR++ +  F
Subjt:  VGSSKGFNDKFRKLYRSMNVPVPKPLPSLVELMKSRKRAKKNAYF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGATCGAAATGCCCCAAGGCCTGCCATTCTCGGTGGATACATGGACTCCATCTTCCAAGCAGAAGCACCACCATTTTCTAACGCACGCTCACAGGGATCACACCGC
CGGGATTGTCCCCCATTCTTCCTTCCCAATTTATTCTACTTCTCTCACCAAAATCATCGTTCTTCAGCACTTCCCTCAGATTGAGGATTCGTTGTTTGTCTGTATCGAGA
TCGGGCAATCGCTGGTAGTCAAAGATCCTGATGGTGCTTTCACTGTCACAGTTTTCGACGCTCATCACTGTCCTGGCAATTTTGGCAATATTCTGCATACGGGCGATTGC
AGACTAACTCCTGAGTGTCTACAGAGCTTACCGGAGAAGTATCGTGGAAAAAGTGGTAAAGAGCCTAGATGTCAACTGGATTTGATTTTTCTAGACTGCACATTTGGTAG
ATTCTTTCAACAATTCCCCAGCAGGCATTCATCAATACATCAGATTATTAATTGCATATGGAAACATCCCGATGCTCCGTTAGTGTATCTGATTTGCAATCTTCTAGGAC
AGGAAGATATATTGCAACAAGTGTCCCAAACATTTGGTTCAAAGATTTTTGTTGATGAATCCATGAAAGCAGGTTACAAGGCTCTTGAACTTATAGATCCTGAGATCCTC
ACTCAAGATCCATCCTCCCGTTTTCATCTGCTTGGTGGATTCCCTAAACTATGTCAAAGAGCAAAACCACTGCTTCGGGATGCCCAGACCAATTTTCAGCATGAACCTCT
CATAATCCGCCCTTCGACCCAGTGGTATGTTCATGAGGAACTGTCAGAGGTTTTCACCACAAGGAAACAGATAATTAGTGAAGCAATCAAAGATCAGCATGGTATTTGGC
ATGTCTGTTACTCGATGCACTCGTCGAAGGAAGAACTTGAATGGGCCTTGCAAATTTTAGGCCCTAAGTGGGTTGTTTCGACCACTCCTGGTTGTCGTGCCATGGATTTG
GATTACGTGAAGAAGAAACTCAGTTGCTCTAGTTTAACTTCCAATGGCCTAATCTGGAAGCTTTTTGGTATAGCTGAGGAAAGTTCTTCAGATTTAGATGTTTCAACGAT
GGAAGTGGGCTGTTCCCCTATGGTTGAAGCACCCGCTCAAATCAACATAGATCCTCAACTGCAGCCCGTGAAACTGTATGCTTTCCCTAAAGAAATGCTGAATGTTTTGT
CTTCAAGCAACTTGCCACCTCTTACATTATTTGGACGAGCTAGACTTGGCGCAAAAGAGGCTGATTTGTTGCTGGAAGAAGTTCCATATCCGTCTACAGGGAATGATGAG
CCTGTAGAAGCAGTTGGAGGTAAGGTAATAGACTTGTCAATTCATGATGCAAATAATGGTCAATTGAGCGACGAATCATCAGAAAATTCTGAAAACGAAGTTAACTCTGA
AGAGAAGCACAATAAGTTTGCAAATGATGGGTTATTAGTTGACAATAACGCCTCTATTAGCTCTGAACGGTTTAGGCTCCATGTTTCTGAAGTAAAAGTCACGTCCATGA
ATGACACTCATCCACCAAAACCAGTAGGCAGTAACGTAGAAGAACTTTATATCCATGTGCAAAAAGGTAGAGTGAAGGGAAATGAGTCGTTAGTTGATTGTGAAGATGTC
GGTAGTATTCCCGAAACACATGCTGGGAAGTTAATAGATGACAGGATAACAGTCTGTTGTAATTCACATCTTTTAAGTGTTGGATCTTCAAAGGGTTTCAATGACAAGTT
TAGAAAGTTGTACAGGTCGATGAACGTTCCTGTGCCCAAGCCTCTTCCTTCATTAGTAGAACTCATGAAATCGAGAAAACGCGCAAAGAAGAATGCATACTTC
mRNA sequenceShow/hide mRNA sequence
ATGCCGATCGAAATGCCCCAAGGCCTGCCATTCTCGGTGGATACATGGACTCCATCTTCCAAGCAGAAGCACCACCATTTTCTAACGCACGCTCACAGGGATCACACCGC
CGGGATTGTCCCCCATTCTTCCTTCCCAATTTATTCTACTTCTCTCACCAAAATCATCGTTCTTCAGCACTTCCCTCAGATTGAGGATTCGTTGTTTGTCTGTATCGAGA
TCGGGCAATCGCTGGTAGTCAAAGATCCTGATGGTGCTTTCACTGTCACAGTTTTCGACGCTCATCACTGTCCTGGCAATTTTGGCAATATTCTGCATACGGGCGATTGC
AGACTAACTCCTGAGTGTCTACAGAGCTTACCGGAGAAGTATCGTGGAAAAAGTGGTAAAGAGCCTAGATGTCAACTGGATTTGATTTTTCTAGACTGCACATTTGGTAG
ATTCTTTCAACAATTCCCCAGCAGGCATTCATCAATACATCAGATTATTAATTGCATATGGAAACATCCCGATGCTCCGTTAGTGTATCTGATTTGCAATCTTCTAGGAC
AGGAAGATATATTGCAACAAGTGTCCCAAACATTTGGTTCAAAGATTTTTGTTGATGAATCCATGAAAGCAGGTTACAAGGCTCTTGAACTTATAGATCCTGAGATCCTC
ACTCAAGATCCATCCTCCCGTTTTCATCTGCTTGGTGGATTCCCTAAACTATGTCAAAGAGCAAAACCACTGCTTCGGGATGCCCAGACCAATTTTCAGCATGAACCTCT
CATAATCCGCCCTTCGACCCAGTGGTATGTTCATGAGGAACTGTCAGAGGTTTTCACCACAAGGAAACAGATAATTAGTGAAGCAATCAAAGATCAGCATGGTATTTGGC
ATGTCTGTTACTCGATGCACTCGTCGAAGGAAGAACTTGAATGGGCCTTGCAAATTTTAGGCCCTAAGTGGGTTGTTTCGACCACTCCTGGTTGTCGTGCCATGGATTTG
GATTACGTGAAGAAGAAACTCAGTTGCTCTAGTTTAACTTCCAATGGCCTAATCTGGAAGCTTTTTGGTATAGCTGAGGAAAGTTCTTCAGATTTAGATGTTTCAACGAT
GGAAGTGGGCTGTTCCCCTATGGTTGAAGCACCCGCTCAAATCAACATAGATCCTCAACTGCAGCCCGTGAAACTGTATGCTTTCCCTAAAGAAATGCTGAATGTTTTGT
CTTCAAGCAACTTGCCACCTCTTACATTATTTGGACGAGCTAGACTTGGCGCAAAAGAGGCTGATTTGTTGCTGGAAGAAGTTCCATATCCGTCTACAGGGAATGATGAG
CCTGTAGAAGCAGTTGGAGGTAAGGTAATAGACTTGTCAATTCATGATGCAAATAATGGTCAATTGAGCGACGAATCATCAGAAAATTCTGAAAACGAAGTTAACTCTGA
AGAGAAGCACAATAAGTTTGCAAATGATGGGTTATTAGTTGACAATAACGCCTCTATTAGCTCTGAACGGTTTAGGCTCCATGTTTCTGAAGTAAAAGTCACGTCCATGA
ATGACACTCATCCACCAAAACCAGTAGGCAGTAACGTAGAAGAACTTTATATCCATGTGCAAAAAGGTAGAGTGAAGGGAAATGAGTCGTTAGTTGATTGTGAAGATGTC
GGTAGTATTCCCGAAACACATGCTGGGAAGTTAATAGATGACAGGATAACAGTCTGTTGTAATTCACATCTTTTAAGTGTTGGATCTTCAAAGGGTTTCAATGACAAGTT
TAGAAAGTTGTACAGGTCGATGAACGTTCCTGTGCCCAAGCCTCTTCCTTCATTAGTAGAACTCATGAAATCGAGAAAACGCGCAAAGAAGAATGCATACTTC
Protein sequenceShow/hide protein sequence
MPIEMPQGLPFSVDTWTPSSKQKHHHFLTHAHRDHTAGIVPHSSFPIYSTSLTKIIVLQHFPQIEDSLFVCIEIGQSLVVKDPDGAFTVTVFDAHHCPGNFGNILHTGDC
RLTPECLQSLPEKYRGKSGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKIFVDESMKAGYKALELIDPEIL
TQDPSSRFHLLGGFPKLCQRAKPLLRDAQTNFQHEPLIIRPSTQWYVHEELSEVFTTRKQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILGPKWVVSTTPGCRAMDL
DYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSTMEVGCSPMVEAPAQINIDPQLQPVKLYAFPKEMLNVLSSSNLPPLTLFGRARLGAKEADLLLEEVPYPSTGNDE
PVEAVGGKVIDLSIHDANNGQLSDESSENSENEVNSEEKHNKFANDGLLVDNNASISSERFRLHVSEVKVTSMNDTHPPKPVGSNVEELYIHVQKGRVKGNESLVDCEDV
GSIPETHAGKLIDDRITVCCNSHLLSVGSSKGFNDKFRKLYRSMNVPVPKPLPSLVELMKSRKRAKKNAYF