; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015412 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015412
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDRMBL domain-containing protein
Genome locationtig00003469:1515743..1536811
RNA-Seq ExpressionSgr015412
SyntenySgr015412
Gene Ontology termsGO:0006303 - double-strand break repair via nonhomologous end joining (biological process)
GO:0031848 - protection from non-homologous end joining at telomere (biological process)
GO:0036297 - interstrand cross-link repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0035312 - 5'-3' exodeoxyribonuclease activity (molecular function)
InterPro domainsIPR011084 - DNA repair metallo-beta-lactamase
IPR036866 - Ribonuclease Z/Hydroxyacylglutathione hydrolase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605321.1 5' exonuclease Apollo, partial [Cucurbita argyrosperma subsp. sororia]4.0e-29981.93Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGI-VAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPG
        MPIEMP GLPFSVDTWTPSS  KRHHFLTHAH DHT GI  AHSSFPI+STF+TK+IVLQ FPQL DSLFVCIE+GQTLV+KDPDG FTVTVFDA+HCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGI-VAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPG

Query:  AAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        A MFLFEGNFGNILHTGDCRLTPEC+QNLPEKY GK+GKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHG
        SQTFGSKIFV+E  KAGYKALELIDPDILTQDPSS FHLL GFP LC+ AK+LLA+A+TN Q EPL+IRPSTQWYVREELSE+CN+RK IISEAIKDQHG
Subjt:  SQTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRA+DLDYVKKK SC+SLTSNGLIWKLFG+ EESSSDLD S +EV CSP+ E  T  +++PQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVK

Query:  LYDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDNEPVESVGDKVADLSIHDDNGRLSDELSKNSKNKVHSEGKHEKFANDG-LLADD
        LY V +EML+ILSSSNLPPLTLFGRARLAAEDA++L EEVSYPST+NEPVE+VGDKVADLSIHD NGR SD+ SK+S N+V+S+GK +KFAND  LLAD+
Subjt:  LYDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDNEPVESVGDKVADLSIHDDNGRLSDELSKNSKNKVHSEGKHEKFANDG-LLADD

Query:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLV-DDRIAVC-SNSHLLSVGSSKGFNDKFRKL
          S CSDR  LH S VKVVSMNN +PPE VSS+V EL++HEQ SR +GNKSLDD EDVG+VPET   KLV DDRIA C SNSH LSVGSSKGFND+FRKL
Subjt:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLV-DDRIAVC-SNSHLLSVGSSKGFNDKFRKL

Query:  YRSKNVPVPEPLPSLVKLMKSRKRAKRNAYF
        YRS NV VPEPLPSLV+LMKSRKRAKRNAYF
Subjt:  YRSKNVPVPEPLPSLVKLMKSRKRAKRNAYF

XP_022156681.1 protein artemis [Momordica charantia]0.0e+0084.26Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGIVAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPGA
        MPIEMPQGLPFSVDTWTPSS  K HHFLTHAHRDHT GIV HSSFPIYST LTK IVLQ FPQ++DSLFVCIE+GQ+LV+KDPDG FTVTVFDA+HCPGA
Subjt:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGIVAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPGA

Query:  AMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS
         MFLFEGNFGNILHTGDCRLTPEC+Q+LPEKYRGK+GKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS
Subjt:  AMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS

Query:  QTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHGI
        QTFGSKIFV+ES KAGYKALELIDP+ILTQDPSS FHLL GFP LC+RAK LL DA+TN QHEPLIIRPSTQWYV EELSEV  +RK IISEAIKDQHGI
Subjt:  QTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHGI

Query:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVKL
        WHVCYSMHSSKEELEWALQIL PKWV STTPGCRA+DLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVS MEVGCSPM EAP QINI+PQLQPVKL
Subjt:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVKL

Query:  YDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDN-EPVESVGDKVADLSIHD-DNGRLSDELSKNSKNKVHSEGKHEKFANDGLLADD
        Y   KEMLN+LSSSNLPPLTLFGRARL A++ADLL EEV YPST N EPVE+VG KV DLSIHD +NG+LSDE S+NS+N+V+SE KH+KFANDGLL D+
Subjt:  YDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDN-EPVESVGDKVADLSIHD-DNGRLSDELSKNSKNKVHSEGKHEKFANDGLLADD

Query:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLVDDRIAVCSNSHLLSVGSSKGFNDKFRKLYR
        N S+ S+RVRLHVS VKV SMN+T PP+ V S V ELYIH Q+ RV+GN+SL D EDVGS+PET   KL+DDRI VC NSHLLSVGSSKGFNDKFRKLYR
Subjt:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLVDDRIAVCSNSHLLSVGSSKGFNDKFRKLYR

Query:  SKNVPVPEPLPSLVKLMKSRKRAKRNAYF
        S NVPVP+PLPSLV+LMKSRKRAK+NAYF
Subjt:  SKNVPVPEPLPSLVKLMKSRKRAKRNAYF

XP_022948238.1 uncharacterized protein LOC111451874 isoform X1 [Cucurbita moschata]5.6e-30181.75Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGI-VAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPG
        MPIEMP GLPFSVDTWTPSS  KRHHFLTHAH DHT GI  AHSSFPI+STF+TK+IVLQ FPQL DSLFVCIE+GQTLV+KDPDG FTVTVFDA+HCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGI-VAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPG

Query:  AAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        A MFLFEGNFGNILHTGDCRLTPEC+QNLPEKYRGK+GKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHG
        SQTFGSKIFV+E +KAGYKALELIDPDILTQDPSS FHLL GFP LC+ AKALLA+A+TN Q EPL+IRPSTQWYVREELSE+CN+RK IISEAIKDQHG
Subjt:  SQTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRA+DLDYVKKK S SSLTSNGLIWKLFG+AEESSSDLD SV+EV CSP+ E  T  +++PQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVK

Query:  LYDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDNEPVESVGDKVADLSIHDDNGRLSDELSKNSKNKVHSEGKHEKFANDG-LLADD
        LY V +E L+ILS SNLPPLTLFGRARLA +DA++L EEVSYPST+NEPVE+VGDKVADLSIHD NGR SD+ SK+SKN+V+S+GKHEKFAND  LLAD+
Subjt:  LYDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDNEPVESVGDKVADLSIHDDNGRLSDELSKNSKNKVHSEGKHEKFANDG-LLADD

Query:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLV-DDRIAVCSNSHLLSVGSSKGFNDKFRKLY
        + S CSDR RLH S V+VVSMNN +PPE VSS+V EL++HEQ SR +G+KSLDD EDV +VP+T   KLV DDR+ V SNSH+LSVGSSKGFND+FRKLY
Subjt:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLV-DDRIAVCSNSHLLSVGSSKGFNDKFRKLY

Query:  RSKNVPVPEPLPSLVKLMKSRKRAKRNAYF
        RS NV VPEPLPSLV+LMKSRKRAKRNAYF
Subjt:  RSKNVPVPEPLPSLVKLMKSRKRAKRNAYF

XP_023007139.1 protein artemis isoform X1 [Cucurbita maxima]3.6e-30081.75Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGI-VAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPG
        MPIEMP GLPFSVDTWTPSS  KRHHFLTHAH DHT GI  AHSSFPI+STF+TK+IVLQ FPQL DSLFVCIE+GQTLV+KDP+G FTVTVFDA+HCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGI-VAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPG

Query:  AAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        A MFLFEGNFGNILHTGDCRLTPEC+QNLPEKYRGK+GKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHG
        SQTFGSKIFV+E +KAGYKALELIDPDILTQDPSS FHLL GFP LC+ AKALLA+A+TN Q EPL+IRPSTQWYVREELSE CN+RK IISEAIKDQHG
Subjt:  SQTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRA+DLDYVK K SCSSLTS+GLIWKLFG+AEESSSDLD S +EV CSP+ E  T  +++PQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVK

Query:  LYDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDNEPVESVGDKVADLSIHDDNGRLSDELSKNSKNKVHSEGKHEKFANDG-LLADD
        LY V +E L+ILS SNLPPLTLFGRARL AEDA++L EEVSYPS +NEPVE+VGDKVADLSIHD NGR SD+ SK+SKN+V+S+GKHEKFAN   LLAD+
Subjt:  LYDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDNEPVESVGDKVADLSIHDDNGRLSDELSKNSKNKVHSEGKHEKFANDG-LLADD

Query:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLV-DDRIAVCSNSHLLSVGSSKGFNDKFRKLY
          S CSDR RLH+S VKVVSMNN +PPE VSS+V EL+ HEQ SR +GNKSLDD EDV +VPET   KLV DDRIA CSNSH+LSVGSSKGFN +FRKLY
Subjt:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLV-DDRIAVCSNSHLLSVGSSKGFNDKFRKLY

Query:  RSKNVPVPEPLPSLVKLMKSRKRAKRNAYF
        RS NV VPEPLPSLV+LMKSRKRAKRNAYF
Subjt:  RSKNVPVPEPLPSLVKLMKSRKRAKRNAYF

XP_023532363.1 uncharacterized protein LOC111794563 isoform X1 [Cucurbita pepo subsp. pepo]1.1e-30181.88Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGI-VAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPG
        MPIEMP GLPFSVDTWTPSS  KRHHFLTHAH DHT GI  AHSSFPI+STF+TK+IVLQ FPQL DSLFVCIE+GQTLV+KDPDG FTVTV DA+HCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGI-VAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPG

Query:  AAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        A MFLFEGNFGNILHTGDCRLTPEC+QNLPEKYRGK+GKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHG
        SQTFGSKIFV+E +KAGYKALELIDPDILTQDPSS FHLL GFP LC+ AKALLA+A+TN Q EPL+IRPSTQWYVREELSE+CN+RK IISEAIKDQHG
Subjt:  SQTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRA+DLDYVKKK SCSSLTSNGLIW+LFG+AEESSSDLD S +EV CSP+ E  T  +++PQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVK

Query:  LYDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDNEPVESVGDKVADLSIHDDNGRLSDELSKNSKNKVHSEGKHEKFANDG-LLADD
        LY V +E L+ILSSSNLPPLTLFGRARLA +DA++L EEVSYPST+NEPVE+VGDKVADLSIHD NGR SD+ SK+SKN+++S+GKHEKFAN+  LLAD+
Subjt:  LYDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDNEPVESVGDKVADLSIHDDNGRLSDELSKNSKNKVHSEGKHEKFANDG-LLADD

Query:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLV-DDRIAVCSNSHLLSVGSSKGFNDKFRKLY
        + S CSD  RLH S VKVVSMNN +PPE VSS+V EL++HEQ SR  GNKSLDD EDV +VPET   KLV DDRIA CSNSH+LSVGSSKGFND+FRKLY
Subjt:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLV-DDRIAVCSNSHLLSVGSSKGFNDKFRKLY

Query:  RSKNVPVPEPLPSLVKLMKSRKRAKRNAY
        RS NV VPEPLPSLV+LMKSRKRAKRNAY
Subjt:  RSKNVPVPEPLPSLVKLMKSRKRAKRNAY

TrEMBL top hitse value%identityAlignment
A0A6J1DVN9 protein artemis0.0e+0084.26Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGIVAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPGA
        MPIEMPQGLPFSVDTWTPSS  K HHFLTHAHRDHT GIV HSSFPIYST LTK IVLQ FPQ++DSLFVCIE+GQ+LV+KDPDG FTVTVFDA+HCPGA
Subjt:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGIVAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPGA

Query:  AMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS
         MFLFEGNFGNILHTGDCRLTPEC+Q+LPEKYRGK+GKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS
Subjt:  AMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS

Query:  QTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHGI
        QTFGSKIFV+ES KAGYKALELIDP+ILTQDPSS FHLL GFP LC+RAK LL DA+TN QHEPLIIRPSTQWYV EELSEV  +RK IISEAIKDQHGI
Subjt:  QTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHGI

Query:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVKL
        WHVCYSMHSSKEELEWALQIL PKWV STTPGCRA+DLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVS MEVGCSPM EAP QINI+PQLQPVKL
Subjt:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVKL

Query:  YDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDN-EPVESVGDKVADLSIHD-DNGRLSDELSKNSKNKVHSEGKHEKFANDGLLADD
        Y   KEMLN+LSSSNLPPLTLFGRARL A++ADLL EEV YPST N EPVE+VG KV DLSIHD +NG+LSDE S+NS+N+V+SE KH+KFANDGLL D+
Subjt:  YDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDN-EPVESVGDKVADLSIHD-DNGRLSDELSKNSKNKVHSEGKHEKFANDGLLADD

Query:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLVDDRIAVCSNSHLLSVGSSKGFNDKFRKLYR
        N S+ S+RVRLHVS VKV SMN+T PP+ V S V ELYIH Q+ RV+GN+SL D EDVGS+PET   KL+DDRI VC NSHLLSVGSSKGFNDKFRKLYR
Subjt:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLVDDRIAVCSNSHLLSVGSSKGFNDKFRKLYR

Query:  SKNVPVPEPLPSLVKLMKSRKRAKRNAYF
        S NVPVP+PLPSLV+LMKSRKRAK+NAYF
Subjt:  SKNVPVPEPLPSLVKLMKSRKRAKRNAYF

A0A6J1G8U0 uncharacterized protein LOC111451874 isoform X12.7e-30181.75Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGI-VAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPG
        MPIEMP GLPFSVDTWTPSS  KRHHFLTHAH DHT GI  AHSSFPI+STF+TK+IVLQ FPQL DSLFVCIE+GQTLV+KDPDG FTVTVFDA+HCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGI-VAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPG

Query:  AAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        A MFLFEGNFGNILHTGDCRLTPEC+QNLPEKYRGK+GKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHG
        SQTFGSKIFV+E +KAGYKALELIDPDILTQDPSS FHLL GFP LC+ AKALLA+A+TN Q EPL+IRPSTQWYVREELSE+CN+RK IISEAIKDQHG
Subjt:  SQTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRA+DLDYVKKK S SSLTSNGLIWKLFG+AEESSSDLD SV+EV CSP+ E  T  +++PQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVK

Query:  LYDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDNEPVESVGDKVADLSIHDDNGRLSDELSKNSKNKVHSEGKHEKFANDG-LLADD
        LY V +E L+ILS SNLPPLTLFGRARLA +DA++L EEVSYPST+NEPVE+VGDKVADLSIHD NGR SD+ SK+SKN+V+S+GKHEKFAND  LLAD+
Subjt:  LYDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDNEPVESVGDKVADLSIHDDNGRLSDELSKNSKNKVHSEGKHEKFANDG-LLADD

Query:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLV-DDRIAVCSNSHLLSVGSSKGFNDKFRKLY
        + S CSDR RLH S V+VVSMNN +PPE VSS+V EL++HEQ SR +G+KSLDD EDV +VP+T   KLV DDR+ V SNSH+LSVGSSKGFND+FRKLY
Subjt:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLV-DDRIAVCSNSHLLSVGSSKGFNDKFRKLY

Query:  RSKNVPVPEPLPSLVKLMKSRKRAKRNAYF
        RS NV VPEPLPSLV+LMKSRKRAKRNAYF
Subjt:  RSKNVPVPEPLPSLVKLMKSRKRAKRNAYF

A0A6J1G997 uncharacterized protein LOC111451874 isoform X28.5e-29580.63Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGI-VAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPG
        MPIEMP GLPFSVDTWTPSS  KRHHFLTHAH DHT GI  AHSSFPI+STF+TK+IVLQ FPQL DSLFVCIE+GQTLV+KDPDG FTVTVFDA+HCP 
Subjt:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGI-VAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPG

Query:  AAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
               GNFGNILHTGDCRLTPEC+QNLPEKYRGK+GKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHG
        SQTFGSKIFV+E +KAGYKALELIDPDILTQDPSS FHLL GFP LC+ AKALLA+A+TN Q EPL+IRPSTQWYVREELSE+CN+RK IISEAIKDQHG
Subjt:  SQTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRA+DLDYVKKK S SSLTSNGLIWKLFG+AEESSSDLD SV+EV CSP+ E  T  +++PQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVK

Query:  LYDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDNEPVESVGDKVADLSIHDDNGRLSDELSKNSKNKVHSEGKHEKFANDG-LLADD
        LY V +E L+ILS SNLPPLTLFGRARLA +DA++L EEVSYPST+NEPVE+VGDKVADLSIHD NGR SD+ SK+SKN+V+S+GKHEKFAND  LLAD+
Subjt:  LYDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDNEPVESVGDKVADLSIHDDNGRLSDELSKNSKNKVHSEGKHEKFANDG-LLADD

Query:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLV-DDRIAVCSNSHLLSVGSSKGFNDKFRKLY
        + S CSDR RLH S V+VVSMNN +PPE VSS+V EL++HEQ SR +G+KSLDD EDV +VP+T   KLV DDR+ V SNSH+LSVGSSKGFND+FRKLY
Subjt:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLV-DDRIAVCSNSHLLSVGSSKGFNDKFRKLY

Query:  RSKNVPVPEPLPSLVKLMKSRKRAKRNAYF
        RS NV VPEPLPSLV+LMKSRKRAKRNAYF
Subjt:  RSKNVPVPEPLPSLVKLMKSRKRAKRNAYF

A0A6J1L262 uncharacterized protein LOC111499723 isoform X25.5e-29480.63Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGI-VAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPG
        MPIEMP GLPFSVDTWTPSS  KRHHFLTHAH DHT GI  AHSSFPI+STF+TK+IVLQ FPQL DSLFVCIE+GQTLV+KDP+G FTVTVFDA+HCP 
Subjt:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGI-VAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPG

Query:  AAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
               GNFGNILHTGDCRLTPEC+QNLPEKYRGK+GKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHG
        SQTFGSKIFV+E +KAGYKALELIDPDILTQDPSS FHLL GFP LC+ AKALLA+A+TN Q EPL+IRPSTQWYVREELSE CN+RK IISEAIKDQHG
Subjt:  SQTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRA+DLDYVK K SCSSLTS+GLIWKLFG+AEESSSDLD S +EV CSP+ E  T  +++PQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVK

Query:  LYDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDNEPVESVGDKVADLSIHDDNGRLSDELSKNSKNKVHSEGKHEKFANDG-LLADD
        LY V +E L+ILS SNLPPLTLFGRARL AEDA++L EEVSYPS +NEPVE+VGDKVADLSIHD NGR SD+ SK+SKN+V+S+GKHEKFAN   LLAD+
Subjt:  LYDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDNEPVESVGDKVADLSIHDDNGRLSDELSKNSKNKVHSEGKHEKFANDG-LLADD

Query:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLV-DDRIAVCSNSHLLSVGSSKGFNDKFRKLY
          S CSDR RLH+S VKVVSMNN +PPE VSS+V EL+ HEQ SR +GNKSLDD EDV +VPET   KLV DDRIA CSNSH+LSVGSSKGFN +FRKLY
Subjt:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLV-DDRIAVCSNSHLLSVGSSKGFNDKFRKLY

Query:  RSKNVPVPEPLPSLVKLMKSRKRAKRNAYF
        RS NV VPEPLPSLV+LMKSRKRAKRNAYF
Subjt:  RSKNVPVPEPLPSLVKLMKSRKRAKRNAYF

A0A6J1L450 protein artemis isoform X11.8e-30081.75Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGI-VAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPG
        MPIEMP GLPFSVDTWTPSS  KRHHFLTHAH DHT GI  AHSSFPI+STF+TK+IVLQ FPQL DSLFVCIE+GQTLV+KDP+G FTVTVFDA+HCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGI-VAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPG

Query:  AAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        A MFLFEGNFGNILHTGDCRLTPEC+QNLPEKYRGK+GKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHG
        SQTFGSKIFV+E +KAGYKALELIDPDILTQDPSS FHLL GFP LC+ AKALLA+A+TN Q EPL+IRPSTQWYVREELSE CN+RK IISEAIKDQHG
Subjt:  SQTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRA+DLDYVK K SCSSLTS+GLIWKLFG+AEESSSDLD S +EV CSP+ E  T  +++PQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVK

Query:  LYDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDNEPVESVGDKVADLSIHDDNGRLSDELSKNSKNKVHSEGKHEKFANDG-LLADD
        LY V +E L+ILS SNLPPLTLFGRARL AEDA++L EEVSYPS +NEPVE+VGDKVADLSIHD NGR SD+ SK+SKN+V+S+GKHEKFAN   LLAD+
Subjt:  LYDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDNEPVESVGDKVADLSIHDDNGRLSDELSKNSKNKVHSEGKHEKFANDG-LLADD

Query:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLV-DDRIAVCSNSHLLSVGSSKGFNDKFRKLY
          S CSDR RLH+S VKVVSMNN +PPE VSS+V EL+ HEQ SR +GNKSLDD EDV +VPET   KLV DDRIA CSNSH+LSVGSSKGFN +FRKLY
Subjt:  NTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKV-ELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLV-DDRIAVCSNSHLLSVGSSKGFNDKFRKLY

Query:  RSKNVPVPEPLPSLVKLMKSRKRAKRNAYF
        RS NV VPEPLPSLV+LMKSRKRAKRNAYF
Subjt:  RSKNVPVPEPLPSLVKLMKSRKRAKRNAYF

SwissProt top hitse value%identityAlignment
D2H8V8 5' exonuclease Apollo7.5e-2235.35Show/hide
Query:  PFSVDTWT-PSSNIKRHHFLTHAHRDHTTGIVAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIK-DPDG--PFTVTVFDAYHCPGAAMFLF
        P +VD W+   +   R  FL+H H DHT G+ +  + P+Y + +T A ++ +  Q+       +E+G++ V+  D  G    TVT+ DA HCPG+ MFLF
Subjt:  PFSVDTWT-PSSNIKRHHFLTHAHRDHTTGIVAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIK-DPDG--PFTVTVFDAYHCPGAAMFLF

Query:  EGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTF
        EG FG IL+TGD R TP  ++  P    GK       Q+  ++LD T        PSR  +  QI+  I KHP   +   + + LG+E +L+Q++  F
Subjt:  EGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTF

Q4KLY6 5' exonuclease Apollo6.3e-2132.74Show/hide
Query:  IEMPQGLPFSVDTWT-PSSNIKRHHFLTHAHRDHTTGIVAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQT-LVIKDPDG--PFTVTVFDAYHCP
        + +PQ  P +VD W+   +   R  FL+H H DHT G+ +  + P+Y + +T A +L +  Q+       +E+G++ +++ D  G    TVT+ DA HCP
Subjt:  IEMPQGLPFSVDTWT-PSSNIKRHHFLTHAHRDHTTGIVAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQT-LVIKDPDG--PFTVTVFDAYHCP

Query:  GAAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQ
        G+ MFLFEG FG IL+TGD R TP  ++  P    GK       Q+  ++LD T        PSR  +  QII  I + P   +   + + LG+E +L+Q
Subjt:  GAAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQ

Query:  VSQTFGSKIFVNESSKAGYKALELID
        ++  F + + ++       + L L D
Subjt:  VSQTFGSKIFVNESSKAGYKALELID

Q5QJC3 5' exonuclease Apollo6.8e-2334.22Show/hide
Query:  GLPFSVDTWT-PSSNIKRHHFLTHAHRDHTTGIVAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPGAAMFLFE
        G P +VD W+   +   R  FL+H H DHT G+ +  S P+Y + LT A +L    ++       +E+GQ+  + +     TVT+ DA HCPG+ MFLFE
Subjt:  GLPFSVDTWT-PSSNIKRHHFLTHAHRDHTTGIVAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPGAAMFLFE

Query:  GNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSK
        G FG IL+TGD R +P  +Q  P      +G+    ++D ++LD T  R     PSR  +  Q    I +HP   +V  + + LG+E++L  ++  FG+ 
Subjt:  GNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSK

Query:  IFVNESSKAGYKALELIDPDILTQD
        + V+ S     + LEL  P++ T +
Subjt:  IFVNESSKAGYKALELIDPDILTQD

Q8C7W7 5' exonuclease Apollo2.3e-2329.39Show/hide
Query:  IEMPQGLPFSVDTWT-PSSNIKRHHFLTHAHRDHTTGIVAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIK-DPDG--PFTVTVFDAYHCP
        + +PQ  P +VD W+   +   R  FLTH H DHT G+ +  + P+Y + +T A +L +  Q+       +E+G++ V+  D  G    TVT+ DA HCP
Subjt:  IEMPQGLPFSVDTWT-PSSNIKRHHFLTHAHRDHTTGIVAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIK-DPDG--PFTVTVFDAYHCP

Query:  GAAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQ
        G+ MFLFEG FG IL+TGD R TP  ++  P    GK       Q+  ++LD T        PSR  +  QI+  I + P   +   + + LG+E +L+Q
Subjt:  GAAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQ

Query:  VSQTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQH
        ++  F + + ++       + L L D     ++ +   H ++    +C  A       + N  H  + I P+                    S  ++  H
Subjt:  VSQTFGSKIFVNESSKAGYKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQH

Query:  -GIWHVCYSMHSSKEELEWALQILAPKWVV
          I+ V YS HSS  EL   +  L P  VV
Subjt:  -GIWHVCYSMHSSKEELEWALQILAPKWVV

Q9H816 5' exonuclease Apollo3.0e-2336.36Show/hide
Query:  PFSVDTWT-PSSNIKRHHFLTHAHRDHTTGIVAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIK-DPDG--PFTVTVFDAYHCPGAAMFLF
        P +VD W+   +   R  FL+H H DHT G+ +  + P+Y + +T A +L +  Q+       +E+G++ V+  D  G    TVT+ DA HCPG+ MFLF
Subjt:  PFSVDTWT-PSSNIKRHHFLTHAHRDHTTGIVAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIK-DPDG--PFTVTVFDAYHCPGAAMFLF

Query:  EGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTF
        EG FG IL+TGD R TP  ++  P    GK       Q+  ++LD T        PSR  + HQI+  I KHP   +   + + LG+E +L+Q++  F
Subjt:  EGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTF

Arabidopsis top hitse value%identityAlignment
AT1G19025.1 DNA repair metallo-beta-lactamase family protein1.1e-14043.3Show/hide
Query:  MPIEMPQGLPFSVDT---WTPSSNIKRHHFLTHAHRDHTTGIVAHS--SFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAY
        M IEMP+GLPF+VDT   +T +   KRHHFLTHAH+DHT G+   +   FPIYST LT +++LQ+FPQLD+S FV +E+GQ++++ DPDG F VT FDA 
Subjt:  MPIEMPQGLPFSVDT---WTPSSNIKRHHFLTHAHRDHTTGIVAHS--SFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAY

Query:  HCPGAAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGK-NGKEPRCQLDLIFLDCTFGR--FFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQ
        HCPGA MFLFEG+FGNILHTGDCRLT +C+ +LPEKY G+ +G +P+C L  IFLDCTFG+    Q+FP++HS+I QIINCIW HPDAP+VYL C++LGQ
Subjt:  HCPGAAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGK-NGKEPRCQLDLIFLDCTFGR--FFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQ

Query:  EDILQQVSQTFGSKIFVNESSKAG-YKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSE-----VCNSR
        ED+L +VS+TFGSKI+V++++    +++L +I P+I+++DPSS FH+  GFP L ER  A LA+AR+ +Q EPLIIRPS QWYV ++  +     +   R
Subjt:  EDILQQVSQTFGSKIFVNESSKAG-YKALELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSE-----VCNSR

Query:  KLIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESS--SDLDVSVMEVGCSPMAE
        K+  SEA+KD+ G+WHVCYSMHSS+ ELE A+Q+L+PKWVVST P CRA++L+YVKK    S  + +   WKL  I  E+S  +  D   + + C  M+E
Subjt:  KLIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESS--SDLDVSVMEVGCSPMAE

Query:  APTQINIEPQLQPVKLYDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDNEPVESVGDKVADLSIHDDNGRLSDELSKNSKNKVHSEG
             + + +L+PV      K+ L  LS  N  P+TLFGRAR ++++ D L E                                       +  +H++ 
Subjt:  APTQINIEPQLQPVKLYDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTDNEPVESVGDKVADLSIHDDNGRLSDELSKNSKNKVHSEG

Query:  KHEKFANDGLLADDNTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKVELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLVDDRIAVCSNSHLLSVGS
         + K           TS   +++      VKVV                               ++ ++D     +T  E +  +   + S S   S  +
Subjt:  KHEKFANDGLLADDNTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKVELYIHEQRSRVEGNKSLDDIEDVGSVPETCFEKLVDDRIAVCSNSHLLSVGS

Query:  SKGFNDKFRKLYRSKNVPVPEPLPSLVKLMKSRKRAKRNAYF
         K  +   RKLYRS N PVP PLPSL++LM +RKR++ +  F
Subjt:  SKGFNDKFRKLYRSKNVPVPEPLPSLVKLMKSRKRAKRNAYF

AT1G27410.1 DNA repair metallo-beta-lactamase family protein5.1e-2627.95Show/hide
Query:  MPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGIV-AHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEM--GQTLVIKDPDGPFTVTV----FDAYHC
        M  GL  SVD W    N  + +FLTH H DHT G+    S  P+Y +  T ++   +FP  D SL   + +    +L ++ P    TV +     DA+HC
Subjt:  MPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGIV-AHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEM--GQTLVIKDPDGPFTVTV----FDAYHC

Query:  PGAAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQ
        PG+ MFLF G+FG  L+TGD R   +              + P   +D+++LD T+      FPSR  ++  + + I  HP   ++  + + LG+ED+L 
Subjt:  PGAAMFLFEGNFGNILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQ

Query:  QVSQTFGSKIFVNESSKAGYKALELID-PDILTQDP-----------SSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRP-------STQWYVREE
         VS+    KI+V        + + L+   DI T D            S     LEG  ++C     + +         P + RP       S  +     
Subjt:  QVSQTFGSKIFVNESSKAGYKALELID-PDILTQDP-----------SSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRP-------STQWYVREE

Query:  LSEVCNSRKLIISEAIKDQHG-IWHVCYSMHSSKEELEWALQILAPK
         +E  +++K + + A+   H  ++ V YS HS  EE+   ++++ PK
Subjt:  LSEVCNSRKLIISEAIKDQHG-IWHVCYSMHSSKEELEWALQILAPK

AT3G26680.1 DNA repair metallo-beta-lactamase family protein3.9e-1826.59Show/hide
Query:  GLPFSVDTWTPSS-NIKRHHFLTHAHRDHTTGIV-AHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPGAAMFLF
        G PF+VD +          +FLTH H DH  G+  A S  PIY + LT  + L+    ++ S    +E+     I        VT+ +A HCPGAA+  F
Subjt:  GLPFSVDTWTPSS-NIKRHHFLTHAHRDHTTGIV-AHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPGAAMFLF

Query:  EGNFGN-ILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQII----NCIWKHPDAPLVYLICNLLGQEDILQQVS
            G   LHTGD R + + +Q  P  +  +        + +++LD T+     +FPS+   +  ++    + + K P   L+ +    +G+E +   ++
Subjt:  EGNFGN-ILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQII----NCIWKHPDAPLVYLICNLLGQEDILQQVS

Query:  QTFGSKIFVNESSKAGYKAL--ELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQH
        +  G KIF N S +   ++   + I  ++ T   ++C H+L       ER    L   R   Q+  ++    T W   E++ E  +    +I    + + 
Subjt:  QTFGSKIFVNESSKAGYKAL--ELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQH

Query:  GIWHVCYSMHSSKEELEWALQILAPKWVVST
         I+ V YS HSS  EL   +Q L P  ++ T
Subjt:  GIWHVCYSMHSSKEELEWALQILAPKWVVST

AT3G26680.2 DNA repair metallo-beta-lactamase family protein3.9e-1826.59Show/hide
Query:  GLPFSVDTWTPSS-NIKRHHFLTHAHRDHTTGIV-AHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPGAAMFLF
        G PF+VD +          +FLTH H DH  G+  A S  PIY + LT  + L+    ++ S    +E+     I        VT+ +A HCPGAA+  F
Subjt:  GLPFSVDTWTPSS-NIKRHHFLTHAHRDHTTGIV-AHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPGAAMFLF

Query:  EGNFGN-ILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQII----NCIWKHPDAPLVYLICNLLGQEDILQQVS
            G   LHTGD R + + +Q  P  +  +        + +++LD T+     +FPS+   +  ++    + + K P   L+ +    +G+E +   ++
Subjt:  EGNFGN-ILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQII----NCIWKHPDAPLVYLICNLLGQEDILQQVS

Query:  QTFGSKIFVNESSKAGYKAL--ELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQH
        +  G KIF N S +   ++   + I  ++ T   ++C H+L       ER    L   R   Q+  ++    T W   E++ E  +    +I    + + 
Subjt:  QTFGSKIFVNESSKAGYKAL--ELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQH

Query:  GIWHVCYSMHSSKEELEWALQILAPKWVVST
         I+ V YS HSS  EL   +Q L P  ++ T
Subjt:  GIWHVCYSMHSSKEELEWALQILAPKWVVST

AT3G26680.3 DNA repair metallo-beta-lactamase family protein3.9e-1826.59Show/hide
Query:  GLPFSVDTWTPSS-NIKRHHFLTHAHRDHTTGIV-AHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPGAAMFLF
        G PF+VD +          +FLTH H DH  G+  A S  PIY + LT  + L+    ++ S    +E+     I        VT+ +A HCPGAA+  F
Subjt:  GLPFSVDTWTPSS-NIKRHHFLTHAHRDHTTGIV-AHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPGAAMFLF

Query:  EGNFGN-ILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQII----NCIWKHPDAPLVYLICNLLGQEDILQQVS
            G   LHTGD R + + +Q  P  +  +        + +++LD T+     +FPS+   +  ++    + + K P   L+ +    +G+E +   ++
Subjt:  EGNFGN-ILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQII----NCIWKHPDAPLVYLICNLLGQEDILQQVS

Query:  QTFGSKIFVNESSKAGYKAL--ELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQH
        +  G KIF N S +   ++   + I  ++ T   ++C H+L       ER    L   R   Q+  ++    T W   E++ E  +    +I    + + 
Subjt:  QTFGSKIFVNESSKAGYKAL--ELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQH

Query:  GIWHVCYSMHSSKEELEWALQILAPKWVVST
         I+ V YS HSS  EL   +Q L P  ++ T
Subjt:  GIWHVCYSMHSSKEELEWALQILAPKWVVST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGATCGAAATGCCCCAAGGGCTGCCGTTCTCGGTGGATACATGGACTCCATCTTCCAACATAAAGCGCCACCATTTCCTAACGCACGCTCACAGGGACCACACCAC
TGGGATTGTCGCCCATTCTTCTTTCCCGATTTATTCTACTTTTCTCACCAAGGCCATCGTTCTTCAGCAGTTCCCTCAGCTTGATGATTCATTGTTTGTCTGTATCGAGA
TGGGGCAAACGCTGGTCATCAAAGATCCTGATGGACCCTTCACCGTCACAGTTTTCGATGCTTATCACTGCCCTGGAGCTGCTATGTTCTTATTTGAAGGCAATTTTGGC
AATATTCTACATACGGGTGATTGCAGACTAACTCCTGAGTGCATACAGAACTTACCTGAGAAGTATCGTGGAAAAAATGGTAAAGAGCCGAGATGTCAACTGGATCTGAT
TTTTCTAGATTGCACTTTTGGCAGATTCTTTCAACAATTCCCCAGCAGGCACTCATCAATACATCAGATTATTAATTGCATATGGAAACATCCTGATGCTCCTTTAGTAT
ATCTGATTTGCAATCTCCTAGGACAGGAAGATATATTGCAACAAGTGTCCCAAACATTTGGTTCAAAGATATTTGTTAATGAATCCTCGAAAGCAGGTTACAAGGCTCTT
GAACTTATAGATCCTGACATCCTCACTCAAGATCCATCCTCCTGCTTTCATCTGCTTGAAGGATTCCCTAGTCTATGTGAAAGAGCTAAAGCACTGCTTGCAGATGCCCG
GACCAATGTTCAGCATGAACCTCTCATAATCCGCCCTTCAACCCAGTGGTATGTTCGTGAGGAATTGTCAGAGGTTTGCAACTCAAGGAAACTAATAATTAGTGAAGCAA
TCAAAGATCAGCATGGTATTTGGCACGTCTGTTACTCTATGCACTCGTCGAAGGAAGAACTAGAATGGGCATTGCAAATTTTGGCACCTAAGTGGGTTGTTTCAACCACT
CCTGGTTGTCGGGCCATTGATTTGGATTACGTGAAAAAGAAACTCAGTTGCTCTAGTTTAACTTCCAATGGCCTAATCTGGAAGCTTTTTGGTATAGCTGAGGAAAGTTC
TTCAGATTTAGATGTTTCAGTGATGGAAGTGGGCTGTTCCCCTATGGCTGAAGCACCCACTCAAATAAATATAGAGCCTCAACTACAGCCAGTGAAACTGTATGATGTTT
GTAAAGAAATGTTAAATATTTTGTCTTCAAGCAACTTGCCACCTCTCACATTATTTGGACGAGCTCGACTTGCTGCTGAAGATGCTGATTTGTTGCAGGAAGAAGTTTCA
TATCCGTCTACAGACAATGAGCCTGTAGAATCAGTTGGAGATAAGGTTGCAGACTTGTCCATTCATGATGATAATGGTAGACTGAGTGACGAATTATCAAAGAATTCTAA
AAACAAGGTTCACTCTGAAGGAAAACACGAGAAGTTTGCGAATGATGGGTTATTAGCTGATGATAACACCTCTCTTTGCTCTGATCGGGTTAGGCTCCATGTTTCTGGAG
TAAAAGTTGTGTCCATGAACAACACTGATCCACCAGAAGGAGTTAGCAGTAAGGTAGAACTCTATATCCATGAGCAAAGAAGTAGAGTGGAGGGAAACAAGTCGTTAGAT
GATATTGAAGATGTTGGTAGTGTTCCTGAAACATGCTTTGAGAAGTTAGTAGATGATAGGATAGCAGTATGTAGTAATTCACATCTTTTAAGTGTTGGATCTTCAAAGGG
TTTTAATGACAAGTTCAGAAAGTTGTACAGGTCAAAGAATGTTCCTGTGCCCGAGCCTCTTCCTTCACTGGTGAAACTTATGAAATCTAGAAAACGCGCTAAGAGGAATG
CATATTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCGATCGAAATGCCCCAAGGGCTGCCGTTCTCGGTGGATACATGGACTCCATCTTCCAACATAAAGCGCCACCATTTCCTAACGCACGCTCACAGGGACCACACCAC
TGGGATTGTCGCCCATTCTTCTTTCCCGATTTATTCTACTTTTCTCACCAAGGCCATCGTTCTTCAGCAGTTCCCTCAGCTTGATGATTCATTGTTTGTCTGTATCGAGA
TGGGGCAAACGCTGGTCATCAAAGATCCTGATGGACCCTTCACCGTCACAGTTTTCGATGCTTATCACTGCCCTGGAGCTGCTATGTTCTTATTTGAAGGCAATTTTGGC
AATATTCTACATACGGGTGATTGCAGACTAACTCCTGAGTGCATACAGAACTTACCTGAGAAGTATCGTGGAAAAAATGGTAAAGAGCCGAGATGTCAACTGGATCTGAT
TTTTCTAGATTGCACTTTTGGCAGATTCTTTCAACAATTCCCCAGCAGGCACTCATCAATACATCAGATTATTAATTGCATATGGAAACATCCTGATGCTCCTTTAGTAT
ATCTGATTTGCAATCTCCTAGGACAGGAAGATATATTGCAACAAGTGTCCCAAACATTTGGTTCAAAGATATTTGTTAATGAATCCTCGAAAGCAGGTTACAAGGCTCTT
GAACTTATAGATCCTGACATCCTCACTCAAGATCCATCCTCCTGCTTTCATCTGCTTGAAGGATTCCCTAGTCTATGTGAAAGAGCTAAAGCACTGCTTGCAGATGCCCG
GACCAATGTTCAGCATGAACCTCTCATAATCCGCCCTTCAACCCAGTGGTATGTTCGTGAGGAATTGTCAGAGGTTTGCAACTCAAGGAAACTAATAATTAGTGAAGCAA
TCAAAGATCAGCATGGTATTTGGCACGTCTGTTACTCTATGCACTCGTCGAAGGAAGAACTAGAATGGGCATTGCAAATTTTGGCACCTAAGTGGGTTGTTTCAACCACT
CCTGGTTGTCGGGCCATTGATTTGGATTACGTGAAAAAGAAACTCAGTTGCTCTAGTTTAACTTCCAATGGCCTAATCTGGAAGCTTTTTGGTATAGCTGAGGAAAGTTC
TTCAGATTTAGATGTTTCAGTGATGGAAGTGGGCTGTTCCCCTATGGCTGAAGCACCCACTCAAATAAATATAGAGCCTCAACTACAGCCAGTGAAACTGTATGATGTTT
GTAAAGAAATGTTAAATATTTTGTCTTCAAGCAACTTGCCACCTCTCACATTATTTGGACGAGCTCGACTTGCTGCTGAAGATGCTGATTTGTTGCAGGAAGAAGTTTCA
TATCCGTCTACAGACAATGAGCCTGTAGAATCAGTTGGAGATAAGGTTGCAGACTTGTCCATTCATGATGATAATGGTAGACTGAGTGACGAATTATCAAAGAATTCTAA
AAACAAGGTTCACTCTGAAGGAAAACACGAGAAGTTTGCGAATGATGGGTTATTAGCTGATGATAACACCTCTCTTTGCTCTGATCGGGTTAGGCTCCATGTTTCTGGAG
TAAAAGTTGTGTCCATGAACAACACTGATCCACCAGAAGGAGTTAGCAGTAAGGTAGAACTCTATATCCATGAGCAAAGAAGTAGAGTGGAGGGAAACAAGTCGTTAGAT
GATATTGAAGATGTTGGTAGTGTTCCTGAAACATGCTTTGAGAAGTTAGTAGATGATAGGATAGCAGTATGTAGTAATTCACATCTTTTAAGTGTTGGATCTTCAAAGGG
TTTTAATGACAAGTTCAGAAAGTTGTACAGGTCAAAGAATGTTCCTGTGCCCGAGCCTCTTCCTTCACTGGTGAAACTTATGAAATCTAGAAAACGCGCTAAGAGGAATG
CATATTTCTAG
Protein sequenceShow/hide protein sequence
MPIEMPQGLPFSVDTWTPSSNIKRHHFLTHAHRDHTTGIVAHSSFPIYSTFLTKAIVLQQFPQLDDSLFVCIEMGQTLVIKDPDGPFTVTVFDAYHCPGAAMFLFEGNFG
NILHTGDCRLTPECIQNLPEKYRGKNGKEPRCQLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKIFVNESSKAGYKAL
ELIDPDILTQDPSSCFHLLEGFPSLCERAKALLADARTNVQHEPLIIRPSTQWYVREELSEVCNSRKLIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVSTT
PGCRAIDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDVSVMEVGCSPMAEAPTQINIEPQLQPVKLYDVCKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVS
YPSTDNEPVESVGDKVADLSIHDDNGRLSDELSKNSKNKVHSEGKHEKFANDGLLADDNTSLCSDRVRLHVSGVKVVSMNNTDPPEGVSSKVELYIHEQRSRVEGNKSLD
DIEDVGSVPETCFEKLVDDRIAVCSNSHLLSVGSSKGFNDKFRKLYRSKNVPVPEPLPSLVKLMKSRKRAKRNAYF