; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009220 (gene) of Snake gourd v1 genome

Gene IDTan0009220
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDRMBL domain-containing protein
Genome locationLG07:14548890..14563164
RNA-Seq ExpressionTan0009220
SyntenyTan0009220
Gene Ontology termsGO:0006303 - double-strand break repair via nonhomologous end joining (biological process)
GO:0031848 - protection from non-homologous end joining at telomere (biological process)
GO:0036297 - interstrand cross-link repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0035312 - 5'-3' exodeoxyribonuclease activity (molecular function)
InterPro domainsIPR011084 - DNA repair metallo-beta-lactamase
IPR036866 - Ribonuclease Z/Hydroxyacylglutathione hydrolase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605321.1 5' exonuclease Apollo, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0090.02Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GIAAAHSSFPI+ST +TKSIVLQ FPQLHDSLFV IEVGQ LVVKDPDG FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKY GKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG
        SQTFGSKIFVDEY KAGYKALELI+PDILTQDPSSRFHLL GFPK CQ+AK+LLA+AQTNFQPEPL+IRPSTQWYVREE SE+CNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKK SC+SLTSNGLIWKLFG++EESSSDLDASAIEV CSPIVE  T KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK

Query:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEGKHEKFANDR-LLADE
        LYAVP+EML+ILSSSNLPPLTLFGRARLAAEDA++L EEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDK S++S NEVNS+GK +KFANDR LLADE
Subjt:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEGKHEKFANDR-LLADE

Query:  NASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGC-SNSHLLSVGSSKGFNDRFRKL
         AS CSDRA LH SEVKVVSMNNNNPPEAVSSE EELH+HEQESR KGN+SLDDCEDV T  ETHIGKLV DDRI GC SNSH LSVGSSKGFNDRFRKL
Subjt:  NASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGC-SNSHLLSVGSSKGFNDRFRKL

Query:  YRSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        YRSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  YRSMNVPVPEPLPSLVELMKSRKRAKRNAYF

XP_022948238.1 uncharacterized protein LOC111451874 isoform X1 [Cucurbita moschata]0.0e+0089.52Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GIAAAHSSFPI+ST +TKSIVLQ FPQLHDSLFV IEVGQ LVVKDPDG FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG
        SQTFGSKIFVDEYTKAGYKALELI+PDILTQDPSSRFHLL GFPK CQ+AKALLA+AQTNFQPEPL+IRPSTQWYVREE SE+CNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKK S SSLTSNGLIWKLFG+ EESSSDLDAS IEV CSPIVE  T KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK

Query:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEGKHEKFANDR-LLADE
        LYAVP+E L+ILS SNLPPLTLFGRARLA +DA++L EEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDK S++SKNEVNS+GKHEKFANDR LLADE
Subjt:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEGKHEKFANDR-LLADE

Query:  NASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLSVGSSKGFNDRFRKLY
        +AS CSDRARLH SEV+VVSMNNNNPPEAVSSE EELH+HEQESR KG++SLDDCEDV T  +THIGKLV DDR+   SNSH+LSVGSSKGFNDRFRKLY
Subjt:  NASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        RSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF

XP_023007139.1 protein artemis isoform X1 [Cucurbita maxima]0.0e+0089.84Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GIAAAHSSFPI+ST +TKSIVLQ FPQLHDSLFV IEVGQ LVVKDP+G FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG
        SQTFGSKIFVDEYTKAGYKALELI+PDILTQDPSSRFHLL GFPK CQ+AKALLA+AQTNFQPEPL+IRPSTQWYVREE SE CNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVK K SCSSLTS+GLIWKLFG+ EESSSDLDASAIEV CSPIVE  T KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK

Query:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEGKHEKFANDR-LLADE
        LYAVP+E L+ILS SNLPPLTLFGRARL AEDA++L EEVSYPS ENEPVEAVGDKVADLSIHDANGRPSDK S++SKNEVNS+GKHEKFAN R LLADE
Subjt:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEGKHEKFANDR-LLADE

Query:  NASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLSVGSSKGFNDRFRKLY
         AS CSDRARLH+SEVKVVSMNNNNPPEAVSSE EELH HEQESR KGN+SLDDCEDV T  ET IGKLV DDRI GCSNSH+LSVGSSKGFN RFRKLY
Subjt:  NASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        RSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF

XP_023532363.1 uncharacterized protein LOC111794563 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0089.98Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GIAAAHSSFPI+ST +TKSIVLQ FPQLHDSLFV IEVGQ LVVKDPDG FTVTV DAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG
        SQTFGSKIFVDEYTKAGYKALELI+PDILTQDPSSRFHLL GFPK CQ+AKALLA+AQTNFQPEPL+IRPSTQWYVREE SE+CNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKK SCSSLTSNGLIW+LFG+ EESSSDLDASAIEV CSPIVE  T KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK

Query:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEGKHEKFANDR-LLADE
        LYAVP+E L+ILSSSNLPPLTLFGRARLA +DA++L EEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDK S++SKNE+NS+GKHEKFAN+R LLADE
Subjt:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEGKHEKFANDR-LLADE

Query:  NASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLSVGSSKGFNDRFRKLY
        +AS CSD ARLH SEVKVVSMNNNNPPEAVSSE EELH+HEQESR  GN+SLDDCEDV T  ETHIGKLV DDRI GCSNSH+LSVGSSKGFNDRFRKLY
Subjt:  NASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAY
        RSMNV VPEPLPSLVELMKSRKRAKRNAY
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAY

XP_023532364.1 uncharacterized protein LOC111794563 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0088.71Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GIAAAHSSFPI+ST +TKSIVLQ FPQLHDSLFV IEVGQ LVVKDPDG FTVTV DAHHCP 
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
               GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG
        SQTFGSKIFVDEYTKAGYKALELI+PDILTQDPSSRFHLL GFPK CQ+AKALLA+AQTNFQPEPL+IRPSTQWYVREE SE+CNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKK SCSSLTSNGLIW+LFG+ EESSSDLDASAIEV CSPIVE  T KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK

Query:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEGKHEKFANDR-LLADE
        LYAVP+E L+ILSSSNLPPLTLFGRARLA +DA++L EEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDK S++SKNE+NS+GKHEKFAN+R LLADE
Subjt:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEGKHEKFANDR-LLADE

Query:  NASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLSVGSSKGFNDRFRKLY
        +AS CSD ARLH SEVKVVSMNNNNPPEAVSSE EELH+HEQESR  GN+SLDDCEDV T  ETHIGKLV DDRI GCSNSH+LSVGSSKGFNDRFRKLY
Subjt:  NASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAY
        RSMNV VPEPLPSLVELMKSRKRAKRNAY
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAY

TrEMBL top hitse value%identityAlignment
A0A6J1DVN9 protein artemis0.0e+0085.42Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG
        MPIEMPQGLPFSVDTWTPSSKQK HHFLTHAHRDHT GI   HSSFPIYSTSLTK IVLQ FPQ+ DSLFV IE+GQ LVVKDPDGAFTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        AVMFLFEGNFGNILHTGDCRLTPECLQ+LPEKYRGKSGKEPRC+LDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG
        SQTFGSKIFVDE  KAGYKALELI+P+ILTQDPSSRFHLL GFPK CQ AK LL DAQTNFQ EPLIIRPSTQWYV EE SEV  TRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQIL PKWV STTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGI EESSSDLD S +EV CSP+VEAP Q ++DPQLQPVK
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK

Query:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTEN-EPVEAVGDKVADLSIHDA-NGRPSDKSSENSKNEVNSEGKHEKFANDRLLAD
        LYA PKEMLN+LSSSNLPPLTLFGRARL A++ADLL EEV YPST N EPVEAVG KV DLSIHDA NG+ SD+SSENS+NEVNSE KH+KFAND LL D
Subjt:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTEN-EPVEAVGDKVADLSIHDA-NGRPSDKSSENSKNEVNSEGKHEKFANDRLLAD

Query:  ENASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLSVGSSKGFNDRFRKL
         NAS+ S+R RLHVSEVKV SMN+ +PP+ V S  EEL+IH Q+ RVKGNESL DCEDV +  ETH GKL+ DDRI  C NSHLLSVGSSKGFND+FRKL
Subjt:  ENASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLSVGSSKGFNDRFRKL

Query:  YRSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        YRSMNVPVP+PLPSLVELMKSRKRAK+NAYF
Subjt:  YRSMNVPVPEPLPSLVELMKSRKRAKRNAYF

A0A6J1G8U0 uncharacterized protein LOC111451874 isoform X10.0e+0089.52Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GIAAAHSSFPI+ST +TKSIVLQ FPQLHDSLFV IEVGQ LVVKDPDG FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG
        SQTFGSKIFVDEYTKAGYKALELI+PDILTQDPSSRFHLL GFPK CQ+AKALLA+AQTNFQPEPL+IRPSTQWYVREE SE+CNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKK S SSLTSNGLIWKLFG+ EESSSDLDAS IEV CSPIVE  T KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK

Query:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEGKHEKFANDR-LLADE
        LYAVP+E L+ILS SNLPPLTLFGRARLA +DA++L EEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDK S++SKNEVNS+GKHEKFANDR LLADE
Subjt:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEGKHEKFANDR-LLADE

Query:  NASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLSVGSSKGFNDRFRKLY
        +AS CSDRARLH SEV+VVSMNNNNPPEAVSSE EELH+HEQESR KG++SLDDCEDV T  +THIGKLV DDR+   SNSH+LSVGSSKGFNDRFRKLY
Subjt:  NASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        RSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF

A0A6J1G997 uncharacterized protein LOC111451874 isoform X20.0e+0088.25Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GIAAAHSSFPI+ST +TKSIVLQ FPQLHDSLFV IEVGQ LVVKDPDG FTVTVFDAHHCP 
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
               GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG
        SQTFGSKIFVDEYTKAGYKALELI+PDILTQDPSSRFHLL GFPK CQ+AKALLA+AQTNFQPEPL+IRPSTQWYVREE SE+CNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKK S SSLTSNGLIWKLFG+ EESSSDLDAS IEV CSPIVE  T KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK

Query:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEGKHEKFANDR-LLADE
        LYAVP+E L+ILS SNLPPLTLFGRARLA +DA++L EEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDK S++SKNEVNS+GKHEKFANDR LLADE
Subjt:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEGKHEKFANDR-LLADE

Query:  NASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLSVGSSKGFNDRFRKLY
        +AS CSDRARLH SEV+VVSMNNNNPPEAVSSE EELH+HEQESR KG++SLDDCEDV T  +THIGKLV DDR+   SNSH+LSVGSSKGFNDRFRKLY
Subjt:  NASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        RSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF

A0A6J1L262 uncharacterized protein LOC111499723 isoform X20.0e+0088.57Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GIAAAHSSFPI+ST +TKSIVLQ FPQLHDSLFV IEVGQ LVVKDP+G FTVTVFDAHHCP 
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
               GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG
        SQTFGSKIFVDEYTKAGYKALELI+PDILTQDPSSRFHLL GFPK CQ+AKALLA+AQTNFQPEPL+IRPSTQWYVREE SE CNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVK K SCSSLTS+GLIWKLFG+ EESSSDLDASAIEV CSPIVE  T KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK

Query:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEGKHEKFANDR-LLADE
        LYAVP+E L+ILS SNLPPLTLFGRARL AEDA++L EEVSYPS ENEPVEAVGDKVADLSIHDANGRPSDK S++SKNEVNS+GKHEKFAN R LLADE
Subjt:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEGKHEKFANDR-LLADE

Query:  NASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLSVGSSKGFNDRFRKLY
         AS CSDRARLH+SEVKVVSMNNNNPPEAVSSE EELH HEQESR KGN+SLDDCEDV T  ET IGKLV DDRI GCSNSH+LSVGSSKGFN RFRKLY
Subjt:  NASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        RSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF

A0A6J1L450 protein artemis isoform X10.0e+0089.84Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GIAAAHSSFPI+ST +TKSIVLQ FPQLHDSLFV IEVGQ LVVKDP+G FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG
        SQTFGSKIFVDEYTKAGYKALELI+PDILTQDPSSRFHLL GFPK CQ+AKALLA+AQTNFQPEPL+IRPSTQWYVREE SE CNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYTKAGYKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVK K SCSSLTS+GLIWKLFG+ EESSSDLDASAIEV CSPIVE  T KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVK

Query:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEGKHEKFANDR-LLADE
        LYAVP+E L+ILS SNLPPLTLFGRARL AEDA++L EEVSYPS ENEPVEAVGDKVADLSIHDANGRPSDK S++SKNEVNS+GKHEKFAN R LLADE
Subjt:  LYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEGKHEKFANDR-LLADE

Query:  NASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLSVGSSKGFNDRFRKLY
         AS CSDRARLH+SEVKVVSMNNNNPPEAVSSE EELH HEQESR KGN+SLDDCEDV T  ET IGKLV DDRI GCSNSH+LSVGSSKGFN RFRKLY
Subjt:  NASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        RSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF

SwissProt top hitse value%identityAlignment
B0V2S2 5' exonuclease Apollo1.0e-2334.05Show/hide
Query:  PFSVDTW-TPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQ--VLVVKD-PDGAFTVTVFDAHHCPGAVMFL
        P +VD W        R  FL+H H DHT G+++  S  PIY + LT  ++  +  Q+       +E+GQ  +L++ D      TV + DA+HCPGAVMFL
Subjt:  PFSVDTW-TPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQ--VLVVKD-PDGAFTVTVFDAHHCPGAVMFL

Query:  FEGNFGNILHTGDCRLTPE-----CLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        F+G FG  L+TGD R TP      CLQN                +D+++LD T     +  PSR  +  QI   I  HP   +V  + + LG+E +L  +
Subjt:  FEGNFGNILHTGDCRLTPE-----CLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELINPDILTQD
        +  F + + VD       + L+L  PD+ T D
Subjt:  SQTFGSKIFVDEYTKAGYKALELINPDILTQD

D2H8V8 5' exonuclease Apollo5.2e-2333.46Show/hide
Query:  PFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVK-DPDG--AFTVTVFDAHHCPGAVMFL
        P +VD W+   +   R  FL+H H DHT G+++  +  P+Y + +T  +V +   Q+       +EVG+  V+  D  G    TVT+ DA+HCPG+VMFL
Subjt:  PFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVK-DPDG--AFTVTVFDAHHCPGAVMFL

Query:  FEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKL----DLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS
        FEG FG IL+TGD R TP  L            KEP  KL      ++LD T        PSR  +  QI+  I KHP   +   + + LG+E +L+Q++
Subjt:  FEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKL----DLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS

Query:  QTFGSKIFVDEYTKAGYKALELIN----PDILT-QDPSSRFHLLDGFPKQCQSA
          F + + +        + LEL+      D+ T ++ + R H +D   + C SA
Subjt:  QTFGSKIFVDEYTKAGYKALELIN----PDILT-QDPSSRFHLLDGFPKQCQSA

Q4KLY6 5' exonuclease Apollo8.9e-2328.96Show/hide
Query:  IEMPQGLPFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQ---VLVVKDPDGAFTVTVFDAHHC
        + +PQ  P +VD W+   +   R  FL+H H DHT G+++  +  P+Y + +T  ++ ++  Q+       +E+G+   +L+ +      TVT+ DA+HC
Subjt:  IEMPQGLPFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQ---VLVVKDPDGAFTVTVFDAHHC

Query:  PGAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQ
        PG+VMFLFEG FG IL+TGD R TP  L+  P    GK       ++  ++LD T        PSR  +  QII  I + P   +   + + LG+E +L+
Subjt:  PGAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQ

Query:  QVSQTFGSKIFVDEYTKAGYKALELIN----PDILT-QDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISE
        Q++  F + + +        + LEL+      D+ T ++ + R H +D   + C SA                      QW     +  +  T ++I S 
Subjt:  QVSQTFGSKIFVDEYTKAGYKALELIN----PDILT-QDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISE

Query:  AIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVV
               I+ + YS HSS  EL   +  L P  VV
Subjt:  AIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVV

Q5QJC3 5' exonuclease Apollo3.6e-2434.32Show/hide
Query:  GLPFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPGAVMFLF
        G P +VD W+   +   R  FL+H H DHT G+++  S  P+Y + LT  ++  +  ++       +EVGQ   V +     TVT+ DA+HCPG+VMFLF
Subjt:  GLPFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPGAVMFLF

Query:  EGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGS
        EG FG IL+TGD R +P  +Q  P      SG+    ++D ++LD T  R     PSR  +  Q    I +HP   +V  + + LG+E++L  ++  FG+
Subjt:  EGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGS

Query:  KIFVDEYTKAGYKALELINPDIL-TQDPSSRFHLLD
         + V        + LEL  P++  T++ + R H +D
Subjt:  KIFVDEYTKAGYKALELINPDIL-TQDPSSRFHLLD

Q9H816 5' exonuclease Apollo6.2e-2433.61Show/hide
Query:  PFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVK-DPDG--AFTVTVFDAHHCPGAVMFL
        P +VD W+   +   R  FL+H H DHT G+++  +  P+Y + +T  + L +  Q+       +EVG+  V+  D  G    TVT+ DA+HCPG+VMFL
Subjt:  PFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVK-DPDG--AFTVTVFDAHHCPGAVMFL

Query:  FEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFG
        FEG FG IL+TGD R TP  L+  P    GK       ++  ++LD T        PSR  + HQI+  I KHP   +   + + LG+E +L+Q++  F 
Subjt:  FEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFG

Query:  SKIFVDEYTKAGYKALELIN----PDILT-QDPSSRFHLLD
        + + +        + LEL+      D+ T ++ + R H +D
Subjt:  SKIFVDEYTKAGYKALELIN----PDILT-QDPSSRFHLLD

Arabidopsis top hitse value%identityAlignment
AT1G19025.1 DNA repair metallo-beta-lactamase family protein1.0e-14344.34Show/hide
Query:  MPIEMPQGLPFSVDT---WTPSSKQKRHHFLTHAHRDHTTGIAAAH-SSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAH
        M IEMP+GLPF+VDT   +T + ++KRHHFLTHAH+DHT G++ ++   FPIYSTSLT S++LQ+FPQL +S FVR+E+GQ ++V DPDG F VT FDA+
Subjt:  MPIEMPQGLPFSVDT---WTPSSKQKRHHFLTHAHRDHTTGIAAAH-SSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAH

Query:  HCPGAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKS-GKEPRCKLDLIFLDCTFGR--FFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQ
        HCPGAVMFLFEG+FGNILHTGDCRLT +CL +LPEKY G+S G +P+C L  IFLDCTFG+    Q+FP++HS+I QIINCIW HPDAP+VYL C++LGQ
Subjt:  HCPGAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKS-GKEPRCKLDLIFLDCTFGR--FFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQ

Query:  EDILQQVSQTFGSKIFVDEYTKAG-YKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREE-----SSEVCNTR
        ED+L +VS+TFGSKI+VD+ T    +++L +I P+I+++DPSSRFH+  GFPK  +   A LA+A++  Q EPLIIRPS QWYV ++     S  +   R
Subjt:  EDILQQVSQTFGSKIFVDEYTKAG-YKALELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREE-----SSEVCNTR

Query:  KQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESS--SDLDASAIEVSCSPIVE
        K   SEA+KD+ G+WHVCYSMHSS+ ELE A+Q+L+PKWVVST P CRAM+L+YVKK    S  + +   WKL  ID E+S  +  D   + +SC  + E
Subjt:  KQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESS--SDLDASAIEVSCSPIVE

Query:  APTQKDMDPQLQPVKLYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEG
                 +L+PV   +  K+ L  LS  N  P+TLFGRAR ++++ D L E                                       +  ++++ 
Subjt:  APTQKDMDPQLQPVKLYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEG

Query:  KHEKFANDRLLADENASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRV-KGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLS
         + K +                  L    VKVV        E +  + +E  + ++   +   + S + C+D++  L                       
Subjt:  KHEKFANDRLLADENASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRV-KGNESLDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLS

Query:  VGSSKGFNDRFRKLYRSMNVPVPEPLPSLVELMKSRKRAKRNAYF
                   RKLYRSMN PVP PLPSL+ELM +RKR++ +  F
Subjt:  VGSSKGFNDRFRKLYRSMNVPVPEPLPSLVELMKSRKRAKRNAYF

AT1G27410.1 DNA repair metallo-beta-lactamase family protein2.2e-2929.41Show/hide
Query:  MPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLF--VRIEVGQVLVVKDPDGAFTVTV----FDAHHC
        M  GL  SVD W   S+    +FLTH H DHT G++   S  P+Y +  T S+   +FP    SL   V +     L ++ P    TV +     DAHHC
Subjt:  MPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLF--VRIEVGQVLVVKDPDGAFTVTV----FDAHHC

Query:  PGAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQ
        PG++MFLF G+FG  L+TGD R   +              + P   +D+++LD T+      FPSR  ++  + + I  HP   ++  + + LG+ED+L 
Subjt:  PGAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQ

Query:  QVSQTFGSKIFVDEYTKAGYKALELIN-PDILTQDPS-SRFHLLDGFPKQCQSAKALLADAQT-NFQPE--PLIIRP-------STQWYVREESSEVCNT
         VS+    KI+V        + + L+   DI T D S +R   +  +    Q+ + L     T    P   P + RP       S  +      +E  + 
Subjt:  QVSQTFGSKIFVDEYTKAGYKALELIN-PDILTQDPS-SRFHLLDGFPKQCQSAKALLADAQT-NFQPE--PLIIRP-------STQWYVREESSEVCNT

Query:  RKQIISEAIKDQHG-IWHVCYSMHSSKEELEWALQILAPK
        +K++ + A+   H  ++ V YS HS  EE+   ++++ PK
Subjt:  RKQIISEAIKDQHG-IWHVCYSMHSSKEELEWALQILAPK

AT1G66730.1 DNA LIGASE 64.4e-1725.15Show/hide
Query:  FSVDTW-TPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPGAVMFLFE--
        F VD +  P        FL+H H DH +G++++ S   IY +  T  +V  +  Q+       + + Q++ +   DG+  V + +A+HCPGAV FLF+  
Subjt:  FSVDTW-TPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPGAVMFLFE--

Query:  ---GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICN-LLGQEDILQQVSQT
             F   +HTGD R   E          G  G       D +FLD T+      FPS+  S+  +++ I K  +  +++L+   ++G+E IL ++++ 
Subjt:  ---GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICN-LLGQEDILQQVSQT

Query:  FGSKIFVDEYTKAGYKALELINPDILTQDPS-SRFHLLDGFPKQCQSAKALLADAQTNFQP------EPLIIRPSTQWYVREESSEVCNTRKQIISEAIK
           KI VD    +    L      + T+D + S  H++            +L +    F+P      E ++ +   +      +      ++   +   K
Subjt:  FGSKIFVDEYTKAGYKALELINPDILTQDPS-SRFHLLDGFPKQCQSAKALLADAQTNFQP------EPLIIRPSTQWYVREESSEVCNTRKQIISEAIK

Query:  DQHGIWHVCYSMHSSKEELEWALQILAPKWVVST
        D   I  V YS HS+ +EL   ++ L PK V+ T
Subjt:  DQHGIWHVCYSMHSSKEELEWALQILAPKWVVST

AT3G26680.1 DNA repair metallo-beta-lactamase family protein4.4e-1726.87Show/hide
Query:  GLPFSVDTWTPSSKQK-RHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPGAVMFLF
        G PF+VD +     Q    +FLTH H DH  G+  A S  PIY +SLT S +L+    ++ S    +E    L V+       VT+ +A+HCPGA +  F
Subjt:  GLPFSVDTWTPSSKQK-RHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPGAVMFLF

Query:  EGNFGN-ILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQII----NCIWKHPDAPLVYLICNLLGQEDILQQVS
            G   LHTGD R + + +Q  P  +          ++ +++LD T+     +FPS+   +  ++    + + K P   L+ +    +G+E +   ++
Subjt:  EGNFGN-ILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQII----NCIWKHPDAPLVYLICNLLGQEDILQQVS

Query:  QTFGSKIFVDEYTKAGYKAL--ELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEP----LIIRPSTQWYVREESSEVCNTRKQIISEAI
        +  G KIF +   +   ++   + I+ ++ T   ++  H+L        S K    D       E     L  RP T W   E+  E       +I    
Subjt:  QTFGSKIFVDEYTKAGYKAL--ELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEP----LIIRPSTQWYVREESSEVCNTRKQIISEAI

Query:  KDQHGIWHVCYSMHSSKEELEWALQILAPKWVVST
        + +  I+ V YS HSS  EL   +Q L P  ++ T
Subjt:  KDQHGIWHVCYSMHSSKEELEWALQILAPKWVVST

AT3G26680.2 DNA repair metallo-beta-lactamase family protein4.4e-1726.87Show/hide
Query:  GLPFSVDTWTPSSKQK-RHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPGAVMFLF
        G PF+VD +     Q    +FLTH H DH  G+  A S  PIY +SLT S +L+    ++ S    +E    L V+       VT+ +A+HCPGA +  F
Subjt:  GLPFSVDTWTPSSKQK-RHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPGAVMFLF

Query:  EGNFGN-ILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQII----NCIWKHPDAPLVYLICNLLGQEDILQQVS
            G   LHTGD R + + +Q  P  +          ++ +++LD T+     +FPS+   +  ++    + + K P   L+ +    +G+E +   ++
Subjt:  EGNFGN-ILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQII----NCIWKHPDAPLVYLICNLLGQEDILQQVS

Query:  QTFGSKIFVDEYTKAGYKAL--ELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEP----LIIRPSTQWYVREESSEVCNTRKQIISEAI
        +  G KIF +   +   ++   + I+ ++ T   ++  H+L        S K    D       E     L  RP T W   E+  E       +I    
Subjt:  QTFGSKIFVDEYTKAGYKAL--ELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEP----LIIRPSTQWYVREESSEVCNTRKQIISEAI

Query:  KDQHGIWHVCYSMHSSKEELEWALQILAPKWVVST
        + +  I+ V YS HSS  EL   +Q L P  ++ T
Subjt:  KDQHGIWHVCYSMHSSKEELEWALQILAPKWVVST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGATCGAAATGCCCCAAGGGCTGCCATTCTCGGTGGATACATGGACTCCATCTTCCAAGCAAAAGCGCCACCATTTTCTAACGCACGCTCACAGGGATCACACCAC
TGGAATTGCCGCAGCCCATTCTTCCTTCCCGATTTATTCTACTTCTCTCACCAAATCCATCGTTCTCCAGCAGTTCCCTCAGCTTCATGATTCGTTGTTTGTACGTATCG
AGGTGGGGCAAGTGCTGGTCGTCAAAGATCCTGATGGTGCTTTCACTGTCACAGTTTTCGATGCTCATCACTGCCCTGGAGCTGTTATGTTCTTATTTGAAGGCAATTTT
GGCAATATTCTGCATACGGGTGATTGCAGACTAACTCCTGAGTGCCTACAGAACTTACCTGAGAAGTATCGTGGAAAAAGTGGTAAAGAGCCGAGATGTAAACTGGATCT
GATTTTTCTAGATTGCACATTTGGTAGATTTTTTCAACAATTCCCCAGCAGGCATTCATCAATACATCAGATTATTAATTGCATATGGAAACACCCAGATGCTCCTTTAG
TATATCTGATTTGCAATCTTCTAGGACAGGAAGATATATTGCAACAAGTATCCCAAACATTTGGTTCAAAGATATTTGTTGACGAATACACTAAAGCAGGTTACAAGGCT
CTTGAACTTATAAATCCTGACATCCTCACTCAAGATCCATCCTCCCGCTTTCATCTGCTTGATGGATTCCCTAAACAATGTCAAAGTGCAAAAGCACTGCTTGCAGATGC
CCAGACCAATTTTCAGCCCGAACCTCTCATAATCCGCCCTTCGACCCAGTGGTATGTTCGTGAGGAATCATCAGAGGTTTGCAACACAAGGAAACAAATAATTAGTGAAG
CAATTAAAGATCAGCATGGTATTTGGCATGTCTGTTATTCGATGCACTCGTCGAAGGAAGAACTAGAGTGGGCCTTGCAAATTTTAGCACCAAAATGGGTTGTTTCAACC
ACTCCTGGTTGTCGGGCTATGGATTTGGACTACGTGAAAAAGAAACTCAGTTGCTCTAGTTTAACTTCCAATGGTCTAATCTGGAAGCTTTTTGGTATAGATGAGGAAAG
TTCTTCAGATTTAGATGCTTCAGCGATTGAAGTGAGCTGTTCCCCTATAGTTGAAGCACCCACTCAAAAAGATATGGACCCTCAACTACAGCCTGTGAAACTATATGCTG
TTCCTAAAGAAATGTTAAACATTTTGTCTTCAAGCAACTTGCCACCTCTCACACTATTCGGACGAGCTAGACTTGCCGCTGAAGATGCAGATTTGTTGCAGGAAGAAGTT
TCATATCCATCTACAGAGAATGAGCCTGTAGAAGCAGTTGGAGATAAAGTAGCAGACTTGTCCATTCATGATGCAAATGGTAGACCGAGTGACAAATCATCAGAAAATTC
AAAAAATGAAGTCAACTCTGAAGGAAAACATGAAAAGTTCGCAAATGACAGGTTATTAGCTGATGAAAACGCCTCTCTTTGCTCCGATCGGGCTAGGCTCCATGTTTCTG
AAGTAAAAGTTGTGTCCATGAATAACAATAACCCACCAGAAGCAGTCAGCAGTGAGGCAGAAGAACTTCATATCCACGAGCAAGAAAGTAGAGTGAAGGGAAACGAGTCG
TTAGACGATTGTGAAGATGTCAATACCGCTCTCGAAACACACATTGGGAAGTTAGTAAATGATGACAGAATAGTAGGCTGTAGTAATTCACATCTTTTAAGTGTAGGATC
TTCAAAGGGGTTTAATGACAGGTTTAGAAAGTTGTACAGGTCAATGAATGTTCCTGTGCCTGAGCCTCTTCCTTCGCTCGTGGAGCTTATGAAATCGAGAAAACGGGCGA
AGCGGAATGCATATTTCTAG
mRNA sequenceShow/hide mRNA sequence
TTTGCTTGCAGTGGAAGCTTTGATTGAGACGGCTATGAGTTTTCAAATTAGGGCTCATAAGTTTCATTTTCGTCTTCGTCTTCCTCTCCCATCGGAATTTTCCCTCAGAG
CTCGAAGATGCCGATCGAAATGCCCCAAGGGCTGCCATTCTCGGTGGATACATGGACTCCATCTTCCAAGCAAAAGCGCCACCATTTTCTAACGCACGCTCACAGGGATC
ACACCACTGGAATTGCCGCAGCCCATTCTTCCTTCCCGATTTATTCTACTTCTCTCACCAAATCCATCGTTCTCCAGCAGTTCCCTCAGCTTCATGATTCGTTGTTTGTA
CGTATCGAGGTGGGGCAAGTGCTGGTCGTCAAAGATCCTGATGGTGCTTTCACTGTCACAGTTTTCGATGCTCATCACTGCCCTGGAGCTGTTATGTTCTTATTTGAAGG
CAATTTTGGCAATATTCTGCATACGGGTGATTGCAGACTAACTCCTGAGTGCCTACAGAACTTACCTGAGAAGTATCGTGGAAAAAGTGGTAAAGAGCCGAGATGTAAAC
TGGATCTGATTTTTCTAGATTGCACATTTGGTAGATTTTTTCAACAATTCCCCAGCAGGCATTCATCAATACATCAGATTATTAATTGCATATGGAAACACCCAGATGCT
CCTTTAGTATATCTGATTTGCAATCTTCTAGGACAGGAAGATATATTGCAACAAGTATCCCAAACATTTGGTTCAAAGATATTTGTTGACGAATACACTAAAGCAGGTTA
CAAGGCTCTTGAACTTATAAATCCTGACATCCTCACTCAAGATCCATCCTCCCGCTTTCATCTGCTTGATGGATTCCCTAAACAATGTCAAAGTGCAAAAGCACTGCTTG
CAGATGCCCAGACCAATTTTCAGCCCGAACCTCTCATAATCCGCCCTTCGACCCAGTGGTATGTTCGTGAGGAATCATCAGAGGTTTGCAACACAAGGAAACAAATAATT
AGTGAAGCAATTAAAGATCAGCATGGTATTTGGCATGTCTGTTATTCGATGCACTCGTCGAAGGAAGAACTAGAGTGGGCCTTGCAAATTTTAGCACCAAAATGGGTTGT
TTCAACCACTCCTGGTTGTCGGGCTATGGATTTGGACTACGTGAAAAAGAAACTCAGTTGCTCTAGTTTAACTTCCAATGGTCTAATCTGGAAGCTTTTTGGTATAGATG
AGGAAAGTTCTTCAGATTTAGATGCTTCAGCGATTGAAGTGAGCTGTTCCCCTATAGTTGAAGCACCCACTCAAAAAGATATGGACCCTCAACTACAGCCTGTGAAACTA
TATGCTGTTCCTAAAGAAATGTTAAACATTTTGTCTTCAAGCAACTTGCCACCTCTCACACTATTCGGACGAGCTAGACTTGCCGCTGAAGATGCAGATTTGTTGCAGGA
AGAAGTTTCATATCCATCTACAGAGAATGAGCCTGTAGAAGCAGTTGGAGATAAAGTAGCAGACTTGTCCATTCATGATGCAAATGGTAGACCGAGTGACAAATCATCAG
AAAATTCAAAAAATGAAGTCAACTCTGAAGGAAAACATGAAAAGTTCGCAAATGACAGGTTATTAGCTGATGAAAACGCCTCTCTTTGCTCCGATCGGGCTAGGCTCCAT
GTTTCTGAAGTAAAAGTTGTGTCCATGAATAACAATAACCCACCAGAAGCAGTCAGCAGTGAGGCAGAAGAACTTCATATCCACGAGCAAGAAAGTAGAGTGAAGGGAAA
CGAGTCGTTAGACGATTGTGAAGATGTCAATACCGCTCTCGAAACACACATTGGGAAGTTAGTAAATGATGACAGAATAGTAGGCTGTAGTAATTCACATCTTTTAAGTG
TAGGATCTTCAAAGGGGTTTAATGACAGGTTTAGAAAGTTGTACAGGTCAATGAATGTTCCTGTGCCTGAGCCTCTTCCTTCGCTCGTGGAGCTTATGAAATCGAGAAAA
CGGGCGAAGCGGAATGCATATTTCTAGTCAGAACATGTCAAGCAAGAAGGGAACACAGCATAGCTCATGGCTATTGGTGGCTTGCTAAGTGATAAGGTACACAATTTTTA
TCTGTTGTATAGTAAAAGTATATAGAGATGATATTTACATTACTCCAAAGTTGGGAAAGCTTAGAGACATTGTTGTTACATTTTTTTTTTCTTTTTTCAAATTCTTGGTT
GTTCAATGTATGAATAAATCTGGTTAATTCTTATCCTATGAATAAAAG
Protein sequenceShow/hide protein sequence
MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAAHSSFPIYSTSLTKSIVLQQFPQLHDSLFVRIEVGQVLVVKDPDGAFTVTVFDAHHCPGAVMFLFEGNF
GNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSSIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKIFVDEYTKAGYKA
LELINPDILTQDPSSRFHLLDGFPKQCQSAKALLADAQTNFQPEPLIIRPSTQWYVREESSEVCNTRKQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVST
TPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIDEESSSDLDASAIEVSCSPIVEAPTQKDMDPQLQPVKLYAVPKEMLNILSSSNLPPLTLFGRARLAAEDADLLQEEV
SYPSTENEPVEAVGDKVADLSIHDANGRPSDKSSENSKNEVNSEGKHEKFANDRLLADENASLCSDRARLHVSEVKVVSMNNNNPPEAVSSEAEELHIHEQESRVKGNES
LDDCEDVNTALETHIGKLVNDDRIVGCSNSHLLSVGSSKGFNDRFRKLYRSMNVPVPEPLPSLVELMKSRKRAKRNAYF