; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg027216 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg027216
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDRMBL domain-containing protein
Genome locationscaffold8:3933599..3939854
RNA-Seq ExpressionSpg027216
SyntenySpg027216
Gene Ontology termsGO:0006303 - double-strand break repair via nonhomologous end joining (biological process)
GO:0031848 - protection from non-homologous end joining at telomere (biological process)
GO:0036297 - interstrand cross-link repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0035312 - 5'-3' exodeoxyribonuclease activity (molecular function)
InterPro domainsIPR011084 - DNA repair metallo-beta-lactamase
IPR036866 - Ribonuclease Z/Hydroxyacylglutathione hydrolase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605321.1 5' exonuclease Apollo, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0086.53Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GI AAHSSFPI+STF+TKSIVLQ FPQLHDSLFVCIEVGQ LVVKDPDG FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQV
        AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKY GKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQ+INC+WKHPDAP VYLIC+ LGQE+ILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEY KAGYKALELIDPDILTQDPSSRFHLL GFPKLCQ+AK+LLA+AQTN Q EPL+IRPSTQWYVREE SEICNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKK SC+SLTSNGLIWKLFG+ EESSSDLDAS IEV CSPIVE ST KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVK

Query:  LYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSEEKHEKFVNDG-LLTEK
        LYAVP+EML IL S NLPPLT FGRARL AED ++L EEVSYPSTE EPVEAVGDKVADLSIHDAN R S +  ++S NEVNS+ K +KF ND  LL ++
Subjt:  LYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSEEKHEKFVNDG-LLTEK

Query:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGC-SNSHLLSVGSSKGFNDRFRKL
         AS CSDR  LH SEVKVV MNN N PEAVSSEVEELHVHEQ  R KGN++LD CEDV TV ETH GKLV DDRIAGC SNSH LSVGSSKGFNDRFRKL
Subjt:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGC-SNSHLLSVGSSKGFNDRFRKL

Query:  YRSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        YRSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  YRSMNVPVPEPLPSLVELMKSRKRAKRNAYF

XP_022948238.1 uncharacterized protein LOC111451874 isoform X1 [Cucurbita moschata]0.0e+0086.35Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GI AAHSSFPI+STF+TKSIVLQ FPQLHDSLFVCIEVGQ LVVKDPDG FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQV
        AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQ+INC+WKHPDAP VYLIC+ LGQE+ILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLL GFPKLCQ+AKALLA+AQTN Q EPL+IRPSTQWYVREE SEICNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKK S SSLTSNGLIWKLFG+AEESSSDLDASVIEV CSPIVE ST KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVK

Query:  LYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSEEKHEKFVNDG-LLTEK
        LYAVP+E L IL   NLPPLT FGRARL  +D ++L EEVSYPSTE EPVEAVGDKVADLSIHDAN R S +  ++SKNEVNS+ KHEKF ND  LL ++
Subjt:  LYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSEEKHEKFVNDG-LLTEK

Query:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY
        +AS CSDR RLH SEV+VV MNN N PEAVSSEVEELHVHEQ  R KG+++LD CEDV TV +TH GKLV DDR+   SNSH+LSVGSSKGFNDRFRKLY
Subjt:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        RSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF

XP_023007139.1 protein artemis isoform X1 [Cucurbita maxima]0.0e+0086.35Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GI AAHSSFPI+STF+TKSIVLQ FPQLHDSLFVCIEVGQ LVVKDP+G FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQV
        AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQ+INC+WKHPDAP VYLIC+ LGQE+ILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLL GFPKLCQ+AKALLA+AQTN Q EPL+IRPSTQWYVREE SE CNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVK K SCSSLTS+GLIWKLFG+AEESSSDLDAS IEV CSPIVE ST KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVK

Query:  LYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSEEKHEKFVNDG-LLTEK
        LYAVP+E L IL   NLPPLT FGRARL AED ++L EEVSYPS E EPVEAVGDKVADLSIHDAN R S +  ++SKNEVNS+ KHEKF N   LL ++
Subjt:  LYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSEEKHEKFVNDG-LLTEK

Query:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY
         AS CSDR RLH+SEVKVV MNN N PEAVSSEVEELH HEQ  R KGN++LD CEDV TV ET  GKLV DDRIAGCSNSH+LSVGSSKGFN RFRKLY
Subjt:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        RSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF

XP_023532363.1 uncharacterized protein LOC111794563 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0086.65Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GI AAHSSFPI+STF+TKSIVLQ FPQLHDSLFVCIEVGQ LVVKDPDG FTVTV DAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQV
        AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQ+INC+WKHPDAP VYLIC+ LGQE+ILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLL GFPKLCQ+AKALLA+AQTN Q EPL+IRPSTQWYVREE SEICNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKK SCSSLTSNGLIW+LFG+AEESSSDLDAS IEV CSPIVE ST KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVK

Query:  LYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSEEKHEKFVNDG-LLTEK
        LYAVP+E L IL S NLPPLT FGRARL  +D ++L EEVSYPSTE EPVEAVGDKVADLSIHDAN R S +  ++SKNE+NS+ KHEKF N+  LL ++
Subjt:  LYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSEEKHEKFVNDG-LLTEK

Query:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY
        +AS CSD  RLH SEVKVV MNN N PEAVSSEVEELHVHEQ  R  GN++LD CEDV TV ETH GKLV DDRIAGCSNSH+LSVGSSKGFNDRFRKLY
Subjt:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAY
        RSMNV VPEPLPSLVELMKSRKRAKRNAY
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAY

XP_023532364.1 uncharacterized protein LOC111794563 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0085.37Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GI AAHSSFPI+STF+TKSIVLQ FPQLHDSLFVCIEVGQ LVVKDPDG FTVTV DAHHCP 
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQV
               GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQ+INC+WKHPDAP VYLIC+ LGQE+ILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLL GFPKLCQ+AKALLA+AQTN Q EPL+IRPSTQWYVREE SEICNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKK SCSSLTSNGLIW+LFG+AEESSSDLDAS IEV CSPIVE ST KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVK

Query:  LYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSEEKHEKFVNDG-LLTEK
        LYAVP+E L IL S NLPPLT FGRARL  +D ++L EEVSYPSTE EPVEAVGDKVADLSIHDAN R S +  ++SKNE+NS+ KHEKF N+  LL ++
Subjt:  LYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSEEKHEKFVNDG-LLTEK

Query:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY
        +AS CSD  RLH SEVKVV MNN N PEAVSSEVEELHVHEQ  R  GN++LD CEDV TV ETH GKLV DDRIAGCSNSH+LSVGSSKGFNDRFRKLY
Subjt:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAY
        RSMNV VPEPLPSLVELMKSRKRAKRNAY
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAY

TrEMBL top hitse value%identityAlignment
A0A6J1DVN9 protein artemis6.1e-30983.33Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGA
        MPIEMPQGLPFSVDTWTPSSKQK HHFLTHAHRDHT GI  HSSFPIYST LTK IVLQ FPQ+ DSLFVCIE+GQ+LVVKDPDGAFTVTVFDAHHCPGA
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGA

Query:  VMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQVS
        VMFLFEGNFGNILHTGDCRLTPECLQ+LPEKYRGKSGKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQIINC+WKHPDAP VYLICNLLGQE+ILQQVS
Subjt:  VMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQVS

Query:  QTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHGI
        QTFGSKIFVDE  KAGYKALELIDP+ILTQDPSSRFHLL GFPKLCQ AK LL DAQTN Q EPLIIRPSTQWYV EE SE+  TRKQIISEAIKDQHGI
Subjt:  QTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHGI

Query:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVKL
        WHVCYSMHSSKEELEWALQIL PKWV STTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLD S +EV CSP+VEA  Q ++DPQLQPVKL
Subjt:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVKL

Query:  YAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPST-ETEPVEAVGDKVADLSIHDANN-RLSGETPENSKNEVNSEEKHEKFVNDGLLTEK
        YA PKEML +L S NLPPLT FGRARL A++ DLL EEV YPST   EPVEAVG KV DLSIHDANN +LS E+ ENS+NEVNSEEKH+KF NDGLL + 
Subjt:  YAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPST-ETEPVEAVGDKVADLSIHDANN-RLSGETPENSKNEVNSEEKHEKFVNDGLLTEK

Query:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY
        NAS+ S+R+RLHVSEVKV  MN+T+ P+ V S VEEL++H Q  RVKGNE+L  CEDV ++ ETH GKL+ DDRI  C NSHLLSVGSSKGFND+FRKLY
Subjt:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        RSMNVPVP+PLPSLVELMKSRKRAK+NAYF
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF

A0A6J1G8U0 uncharacterized protein LOC111451874 isoform X10.0e+0086.35Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GI AAHSSFPI+STF+TKSIVLQ FPQLHDSLFVCIEVGQ LVVKDPDG FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQV
        AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQ+INC+WKHPDAP VYLIC+ LGQE+ILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLL GFPKLCQ+AKALLA+AQTN Q EPL+IRPSTQWYVREE SEICNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKK S SSLTSNGLIWKLFG+AEESSSDLDASVIEV CSPIVE ST KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVK

Query:  LYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSEEKHEKFVNDG-LLTEK
        LYAVP+E L IL   NLPPLT FGRARL  +D ++L EEVSYPSTE EPVEAVGDKVADLSIHDAN R S +  ++SKNEVNS+ KHEKF ND  LL ++
Subjt:  LYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSEEKHEKFVNDG-LLTEK

Query:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY
        +AS CSDR RLH SEV+VV MNN N PEAVSSEVEELHVHEQ  R KG+++LD CEDV TV +TH GKLV DDR+   SNSH+LSVGSSKGFNDRFRKLY
Subjt:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        RSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF

A0A6J1G997 uncharacterized protein LOC111451874 isoform X20.0e+0085.08Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GI AAHSSFPI+STF+TKSIVLQ FPQLHDSLFVCIEVGQ LVVKDPDG FTVTVFDAHHCP 
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQV
               GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQ+INC+WKHPDAP VYLIC+ LGQE+ILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLL GFPKLCQ+AKALLA+AQTN Q EPL+IRPSTQWYVREE SEICNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKK S SSLTSNGLIWKLFG+AEESSSDLDASVIEV CSPIVE ST KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVK

Query:  LYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSEEKHEKFVNDG-LLTEK
        LYAVP+E L IL   NLPPLT FGRARL  +D ++L EEVSYPSTE EPVEAVGDKVADLSIHDAN R S +  ++SKNEVNS+ KHEKF ND  LL ++
Subjt:  LYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSEEKHEKFVNDG-LLTEK

Query:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY
        +AS CSDR RLH SEV+VV MNN N PEAVSSEVEELHVHEQ  R KG+++LD CEDV TV +TH GKLV DDR+   SNSH+LSVGSSKGFNDRFRKLY
Subjt:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        RSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF

A0A6J1L262 uncharacterized protein LOC111499723 isoform X20.0e+0085.08Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GI AAHSSFPI+STF+TKSIVLQ FPQLHDSLFVCIEVGQ LVVKDP+G FTVTVFDAHHCP 
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQV
               GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQ+INC+WKHPDAP VYLIC+ LGQE+ILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLL GFPKLCQ+AKALLA+AQTN Q EPL+IRPSTQWYVREE SE CNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVK K SCSSLTS+GLIWKLFG+AEESSSDLDAS IEV CSPIVE ST KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVK

Query:  LYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSEEKHEKFVNDG-LLTEK
        LYAVP+E L IL   NLPPLT FGRARL AED ++L EEVSYPS E EPVEAVGDKVADLSIHDAN R S +  ++SKNEVNS+ KHEKF N   LL ++
Subjt:  LYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSEEKHEKFVNDG-LLTEK

Query:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY
         AS CSDR RLH+SEVKVV MNN N PEAVSSEVEELH HEQ  R KGN++LD CEDV TV ET  GKLV DDRIAGCSNSH+LSVGSSKGFN RFRKLY
Subjt:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        RSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF

A0A6J1L450 protein artemis isoform X10.0e+0086.35Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GI AAHSSFPI+STF+TKSIVLQ FPQLHDSLFVCIEVGQ LVVKDP+G FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQV
        AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQ+INC+WKHPDAP VYLIC+ LGQE+ILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQV

Query:  SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLL GFPKLCQ+AKALLA+AQTN Q EPL+IRPSTQWYVREE SE CNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVK K SCSSLTS+GLIWKLFG+AEESSSDLDAS IEV CSPIVE ST KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVK

Query:  LYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSEEKHEKFVNDG-LLTEK
        LYAVP+E L IL   NLPPLT FGRARL AED ++L EEVSYPS E EPVEAVGDKVADLSIHDAN R S +  ++SKNEVNS+ KHEKF N   LL ++
Subjt:  LYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSEEKHEKFVNDG-LLTEK

Query:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY
         AS CSDR RLH+SEVKVV MNN N PEAVSSEVEELH HEQ  R KGN++LD CEDV TV ET  GKLV DDRIAGCSNSH+LSVGSSKGFN RFRKLY
Subjt:  NASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        RSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF

SwissProt top hitse value%identityAlignment
D2H8V8 5' exonuclease Apollo5.6e-2533.87Show/hide
Query:  PFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVK-DPDG--AFTVTVFDAHHCPGAVMFLF
        P +VD W+   +   R  FL+H H DHT G+++  + P+Y + +T  +V +   Q+       +EVG++ V+  D  G    TVT+ DA+HCPG+VMFLF
Subjt:  PFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVK-DPDG--AFTVTVFDAHHCPGAVMFLF

Query:  EGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKL----DLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQVSQ
        EG FG IL+TGD R TP  L            KEP  KL      ++LD T        PSR  A  QI+  + KHP   ++ +    LG+E +L+Q++ 
Subjt:  EGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKL----DLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQVSQ

Query:  TFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSA
         F + + +        + L L D   L ++ + R H +D   ++C SA
Subjt:  TFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSA

Q4KLY6 5' exonuclease Apollo9.5e-2529.48Show/hide
Query:  IEMPQGLPFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQA-LVVKDPDG--AFTVTVFDAHHCP
        + +PQ  P +VD W+   +   R  FL+H H DHT G+++  + P+Y + +T  ++ ++  Q+       +E+G++ +++ D  G    TVT+ DA+HCP
Subjt:  IEMPQGLPFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQA-LVVKDPDG--AFTVTVFDAHHCP

Query:  GAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQ
        G+VMFLFEG FG IL+TGD R TP  L+  P    GK       ++  ++LD T        PSR  A  QII  + + P   ++ +    LG+E +L+Q
Subjt:  GAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQ

Query:  VSQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQH
        ++  F + + +        + L L D     ++ + R H +D   ++C SA                      QW     +  I  T ++I S       
Subjt:  VSQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQH

Query:  GIWHVCYSMHSSKEELEWALQILAPKWVV
         I+ + YS HSS  EL   +  L P  VV
Subjt:  GIWHVCYSMHSSKEELEWALQILAPKWVV

Q5QJC3 5' exonuclease Apollo3.3e-2534.89Show/hide
Query:  GLPFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGAVMFLFE
        G P +VD W+   +   R  FL+H H DHT G+++  S P+Y + LT  ++  +  ++       +EVGQ+  V +     TVT+ DA+HCPG+VMFLFE
Subjt:  GLPFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGAVMFLFE

Query:  GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQVSQTFGSK
        G FG IL+TGD R +P  +Q  P      SG+    ++D ++LD T  R     PSR  A  Q    + +HP    V  + + LG+EE+L  ++  FG+ 
Subjt:  GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQVSQTFGSK

Query:  IFVDEYTKAGYKALELIDPDIL-TQDPSSRFHLLD
        + V        + LEL  P++  T++ + R H +D
Subjt:  IFVDEYTKAGYKALELIDPDIL-TQDPSSRFHLLD

Q8C7W7 5' exonuclease Apollo4.3e-2530.4Show/hide
Query:  IEMPQGLPFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVK-DPDG--AFTVTVFDAHHCP
        + +PQ  P +VD W+   +   R  FLTH H DHT G+++  + P+Y + +T + +L +  Q+       +EVG++ V+  D  G    TVT+ DA+HCP
Subjt:  IEMPQGLPFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVK-DPDG--AFTVTVFDAHHCP

Query:  GAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQ
        G+VMFLFEG FG IL+TGD R TP  L+  P    GK       ++  ++LD T        PSR  A  QI+  + + P   ++ +    LG+E +L+Q
Subjt:  GAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQ

Query:  VSQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQH
        ++  F + + +        + L L D     ++ + R H +D   ++C SA                      QW     +  I  T +++ S       
Subjt:  VSQTFGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQH

Query:  GIWHVCYSMHSSKEELEWALQILAPKWVV
         I+ V YS HSS  EL   +  L P  VV
Subjt:  GIWHVCYSMHSSKEELEWALQILAPKWVV

Q9H816 5' exonuclease Apollo1.5e-2533.47Show/hide
Query:  PFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVK-DPDG--AFTVTVFDAHHCPGAVMFLF
        P +VD W+   +   R  FL+H H DHT G+++  + P+Y + +T  + L +  Q+       +EVG++ V+  D  G    TVT+ DA+HCPG+VMFLF
Subjt:  PFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVK-DPDG--AFTVTVFDAHHCPGAVMFLF

Query:  EGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQVSQTFGS
        EG FG IL+TGD R TP  L+  P    GK       ++  ++LD T        PSR  A HQI+  + KHP   ++ +    LG+E +L+Q++  F +
Subjt:  EGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQVSQTFGS

Query:  KIFVDEYTKAGYKALELID----PDILT-QDPSSRFHLLDGFPKLCQS
         + +        + LEL+      D+ T ++ + R H +D   ++C S
Subjt:  KIFVDEYTKAGYKALELID----PDILT-QDPSSRFHLLDGFPKLCQS

Arabidopsis top hitse value%identityAlignment
AT1G19025.1 DNA repair metallo-beta-lactamase family protein3.0e-14344.34Show/hide
Query:  MPIEMPQGLPFSVDT---WTPSSKQKRHHFLTHAHRDHTTGIAAHS--SFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAH
        M IEMP+GLPF+VDT   +T + ++KRHHFLTHAH+DHT G++  +   FPIYST LT S++LQ+FPQL +S FV +E+GQ+++V DPDG F VT FDA+
Subjt:  MPIEMPQGLPFSVDT---WTPSSKQKRHHFLTHAHRDHTTGIAAHS--SFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAH

Query:  HCPGAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKS-GKEPRCKLDLIFLDCTFGR--FFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQ
        HCPGAVMFLFEG+FGNILHTGDCRLT +CL +LPEKY G+S G +P+C L  IFLDCTFG+    Q+FP++HSAI QIINC+W HPDAP VYL C++LGQ
Subjt:  HCPGAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKS-GKEPRCKLDLIFLDCTFGR--FFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQ

Query:  EEILQQVSQTFGSKIFVDEYTKAG-YKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREE-----SSEICNTR
        E++L +VS+TFGSKI+VD+ T    +++L +I P+I+++DPSSRFH+  GFPKL +   A LA+A++ +QSEPLIIRPS QWYV ++     S  I   R
Subjt:  EEILQQVSQTFGSKIFVDEYTKAG-YKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREE-----SSEICNTR

Query:  KQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESS--SDLDASVIEVSCSPIVE
        K   SEA+KD+ G+WHVCYSMHSS+ ELE A+Q+L+PKWVVST P CRAM+L+YVKK    S  + +   WKL  I  E+S  +  D   + +SC  + E
Subjt:  KQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESS--SDLDASVIEVSCSPIVE

Query:  ASTQKDMDPQLQPV-KLYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSE
                 +L+PV +  +  K++L + P  +L P+T FGRAR  +++ D L E                                              
Subjt:  ASTQKDMDPQLQPV-KLYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVSYPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSE

Query:  EKHEKFVNDGLLTEKNASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLS
            K ++   +  K + +      L    VKVV        E +  + +E  V ++   +            ST  ET                     
Subjt:  EKHEKFVNDGLLTEKNASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEALDHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLS

Query:  VGSSKGFNDRFRKLYRSMNVPVPEPLPSLVELMKSRKRAKRNAYF
            K  +   RKLYRSMN PVP PLPSL+ELM +RKR++ +  F
Subjt:  VGSSKGFNDRFRKLYRSMNVPVPEPLPSLVELMKSRKRAKRNAYF

AT1G27410.1 DNA repair metallo-beta-lactamase family protein6.1e-2728.53Show/hide
Query:  MPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIA-AHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEV--GQALVVKDPDGAFTVTV----FDAHHC
        M  GL  SVD W   S+    +FLTH H DHT G++   S  P+Y +  T S+   +FP    SL   + +    +L ++ P    TV +     DAHHC
Subjt:  MPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIA-AHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEV--GQALVVKDPDGAFTVTV----FDAHHC

Query:  PGAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQ
        PG++MFLF G+FG  L+TGD R   +              + P   +D+++LD T+      FPSR  A+  + + +  HP +  + +  + LG+E++L 
Subjt:  PGAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQ

Query:  QVSQTFGSKIFVDEYTKAGYKALELID-PDILTQDPS-SRFHLLDGFPKLCQSAKALLADAQT---NVQSEPLIIRP-------STQWYVREESSEICNT
         VS+    KI+V        + + L+   DI T D S +R   +  +    Q+ + L     T        P + RP       S  +      +E  + 
Subjt:  QVSQTFGSKIFVDEYTKAGYKALELID-PDILTQDPS-SRFHLLDGFPKLCQSAKALLADAQT---NVQSEPLIIRP-------STQWYVREESSEICNT

Query:  RKQIISEAIKDQHG-IWHVCYSMHSSKEELEWALQILAPK
        +K++ + A+   H  ++ V YS HS  EE+   ++++ PK
Subjt:  RKQIISEAIKDQHG-IWHVCYSMHSSKEELEWALQILAPK

AT1G66730.1 DNA LIGASE 65.3e-1524.39Show/hide
Query:  FSVDTW-TPSSKQKRHHFLTHAHRDHTTGIAAH-SSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGAVMFLFE--
        F VD +  P        FL+H H DH +G+++  S   IY +  T  +V  +  Q+       + + Q + +   DG+  V + +A+HCPGAV FLF+  
Subjt:  FSVDTW-TPSSKQKRHHFLTHAHRDHTTGIAAH-SSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGAVMFLFE--

Query:  ---GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICN-LLGQEEILQQVSQT
             F   +HTGD R   E          G  G       D +FLD T+      FPS+  ++  +++ + K  +   ++L+   ++G+E+IL ++++ 
Subjt:  ---GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICN-LLGQEEILQQVSQT

Query:  FGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVR-EESSEICNTRKQIISEAIKDQHGIW
           KI VD    +    L   +  + T+D +     + G+  L ++      +    V+   +++       V    +      ++   +   KD   I 
Subjt:  FGSKIFVDEYTKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVR-EESSEICNTRKQIISEAIKDQHGIW

Query:  HVCYSMHSSKEELEWALQILAPKWVVST
         V YS HS+ +EL   ++ L PK V+ T
Subjt:  HVCYSMHSSKEELEWALQILAPKWVVST

AT2G45700.1 sterile alpha motif (SAM) domain-containing protein2.3e-1825.69Show/hide
Query:  GLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIA-AHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGAVMFLFE
        G PF VD +   ++   H FLTH H DH  G+  + S   IY + +T  +V  +     + L V +++GQ + +   D    VT FDA+HCPG++M LFE
Subjt:  GLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIA-AHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGAVMFLFE

Query:  -GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAP-SVYLICN-LLGQEEILQQVSQTF
          N   +LHTGD R + E    L   +           +  + LD T+      FP + + I  ++  +      P +++LI +  +G+E +  +V++  
Subjt:  -GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAP-SVYLICN-LLGQEEILQQVSQTF

Query:  GSKIFVDEYTKAGYKALELIDPDI---LTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHGI
          KI+++       + L     DI     ++  S  H++  +          +A+  TN  S  LI+  S   +   ++ +    R+      I+     
Subjt:  GSKIFVDEYTKAGYKALELIDPDI---LTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHGI

Query:  WHVCYSMHSSKEELEWALQILAPKWVV
        + V YS HSS  EL+  +Q ++P+ ++
Subjt:  WHVCYSMHSSKEELEWALQILAPKWVV

AT3G26680.1 DNA repair metallo-beta-lactamase family protein1.6e-1426.27Show/hide
Query:  GLPFSVDTWTPSSKQK-RHHFLTHAHRDHTTGIA-AHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGAVMFLF
        G PF+VD +     Q    +FLTH H DH  G+  A S  PIY + LT S +L+    ++ S    +E    L V+       VT+ +A+HCPGA +  F
Subjt:  GLPFSVDTWTPSSKQK-RHHFLTHAHRDHTTGIA-AHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGAVMFLF

Query:  EGNFGN-ILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQII----NCVWKHPDAPSVYLICNLLGQEEILQQVS
            G   LHTGD R + + +Q  P  +          ++ +++LD T+     +FPS+   +  ++    + + K P    + +    +G+E +   ++
Subjt:  EGNFGN-ILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQII----NCVWKHPDAPSVYLICNLLGQEEILQQVS

Query:  QTFGSKIFVDEYTKAGYKAL--ELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEP----LIIRPSTQWYVREESSEICNTRKQIISEAI
        +  G KIF +   +   ++   + I  ++ T   ++  H+L        S K    D    +  E     L  RP T W   E+  E       +I    
Subjt:  QTFGSKIFVDEYTKAGYKAL--ELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEP----LIIRPSTQWYVREESSEICNTRKQIISEAI

Query:  KDQHGIWHVCYSMHSSKEELEWALQILAPKWVVST
        + +  I+ V YS HSS  EL   +Q L P  ++ T
Subjt:  KDQHGIWHVCYSMHSSKEELEWALQILAPKWVVST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGATCGAAATGCCCCAAGGGCTGCCATTCTCGGTGGATACATGGACTCCATCTTCCAAGCAAAAGCGCCACCATTTTCTGACGCACGCTCACAGGGATCATACCAC
TGGAATTGCCGCCCATTCTTCCTTCCCCATTTATTCTACTTTTCTCACCAAATCCATCGTTCTTCAGCAGTTCCCTCAGCTTCATGATTCGTTGTTTGTATGTATCGAGG
TGGGGCAAGCGCTGGTCGTCAAAGATCCTGATGGAGCTTTCACTGTTACAGTTTTCGATGCTCATCACTGCCCTGGAGCTGTTATGTTCTTATTTGAAGGCAATTTTGGC
AATATTCTGCATACGGGTGATTGCAGACTAACTCCTGAGTGCCTACAGAACTTACCTGAGAAGTATCGTGGAAAAAGTGGTAAAGAGCCGAGATGTAAACTGGATCTGAT
TTTTCTAGATTGCACATTTGGTAGATTCTTTCAACAATTCCCCAGCAGGCATTCAGCAATACATCAGATTATTAATTGCGTATGGAAACATCCCGATGCTCCTTCAGTAT
ATCTGATTTGCAATCTTCTAGGACAGGAAGAGATATTGCAACAAGTGTCCCAAACGTTTGGTTCAAAGATATTTGTTGATGAGTACACGAAAGCAGGTTACAAGGCTCTT
GAACTTATAGATCCTGACATTCTCACTCAAGATCCATCCTCCCGCTTCCATCTGCTTGATGGATTCCCTAAACTATGTCAAAGTGCAAAAGCGCTGCTTGCAGATGCCCA
GACCAATGTCCAGTCTGAACCTCTCATAATCCGCCCTTCAACCCAGTGGTATGTTCGTGAGGAATCGTCAGAGATTTGCAACACAAGGAAACAAATAATTAGTGAAGCAA
TTAAAGATCAGCATGGTATTTGGCATGTCTGTTACTCGATGCATTCGTCGAAGGAAGAACTAGAATGGGCCTTGCAAATTTTAGCACCAAAATGGGTTGTTTCAACCACT
CCTGGTTGTCGGGCCATGGATTTGGATTACGTGAAAAAGAAACTCAGTTGTTCTAGTTTAACTTCCAATGGCCTAATCTGGAAGCTTTTTGGTATAGCTGAGGAAAGTTC
TTCAGATTTAGATGCTTCAGTGATTGAAGTGAGCTGTTCCCCTATAGTTGAAGCATCCACTCAAAAAGATATGGACCCTCAACTACAGCCTGTGAAATTATATGCTGTTC
CTAAAGAAATGTTAAAAATTTTACCTTCAGGAAACTTGCCACCTCTCACATTTTTCGGCCGAGCTAGACTTGACGCTGAAGATGTCGATTTGTTGCAGGAAGAAGTTTCA
TATCCGTCTACAGAGACTGAGCCTGTAGAAGCAGTTGGAGATAAAGTAGCAGACTTGTCCATTCATGATGCAAACAATAGACTGAGTGGCGAAACACCAGAAAATTCTAA
AAACGAAGTTAACTCTGAAGAAAAACACGAGAAGTTTGTTAATGATGGGTTATTAACTGAGAAAAATGCCTCTCTTTGCTCTGATCGGATTAGACTCCATGTTTCTGAAG
TAAAAGTTGTGTACATGAATAACACTAACACTCCAGAAGCAGTGAGCAGTGAGGTAGAAGAACTTCATGTCCATGAGCAAGGAATTAGAGTAAAGGGAAACGAGGCGTTA
GACCATTGTGAAGATGTCAGTACTGTTGCCGAAACACACTTTGGCAAGTTAGTAAATGATGACAGAATAGCAGGCTGTAGTAATTCACATCTTTTAAGTGTTGGATCTTC
AAAGGGTTTTAATGACAGGTTTAGAAAGCTGTACAGGTCAATGAATGTCCCTGTGCCTGAGCCTCTTCCTTCGCTGGTGGAGCTTATGAAATCAAGAAAACGGGCAAAGA
GGAATGCATATTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCGATCGAAATGCCCCAAGGGCTGCCATTCTCGGTGGATACATGGACTCCATCTTCCAAGCAAAAGCGCCACCATTTTCTGACGCACGCTCACAGGGATCATACCAC
TGGAATTGCCGCCCATTCTTCCTTCCCCATTTATTCTACTTTTCTCACCAAATCCATCGTTCTTCAGCAGTTCCCTCAGCTTCATGATTCGTTGTTTGTATGTATCGAGG
TGGGGCAAGCGCTGGTCGTCAAAGATCCTGATGGAGCTTTCACTGTTACAGTTTTCGATGCTCATCACTGCCCTGGAGCTGTTATGTTCTTATTTGAAGGCAATTTTGGC
AATATTCTGCATACGGGTGATTGCAGACTAACTCCTGAGTGCCTACAGAACTTACCTGAGAAGTATCGTGGAAAAAGTGGTAAAGAGCCGAGATGTAAACTGGATCTGAT
TTTTCTAGATTGCACATTTGGTAGATTCTTTCAACAATTCCCCAGCAGGCATTCAGCAATACATCAGATTATTAATTGCGTATGGAAACATCCCGATGCTCCTTCAGTAT
ATCTGATTTGCAATCTTCTAGGACAGGAAGAGATATTGCAACAAGTGTCCCAAACGTTTGGTTCAAAGATATTTGTTGATGAGTACACGAAAGCAGGTTACAAGGCTCTT
GAACTTATAGATCCTGACATTCTCACTCAAGATCCATCCTCCCGCTTCCATCTGCTTGATGGATTCCCTAAACTATGTCAAAGTGCAAAAGCGCTGCTTGCAGATGCCCA
GACCAATGTCCAGTCTGAACCTCTCATAATCCGCCCTTCAACCCAGTGGTATGTTCGTGAGGAATCGTCAGAGATTTGCAACACAAGGAAACAAATAATTAGTGAAGCAA
TTAAAGATCAGCATGGTATTTGGCATGTCTGTTACTCGATGCATTCGTCGAAGGAAGAACTAGAATGGGCCTTGCAAATTTTAGCACCAAAATGGGTTGTTTCAACCACT
CCTGGTTGTCGGGCCATGGATTTGGATTACGTGAAAAAGAAACTCAGTTGTTCTAGTTTAACTTCCAATGGCCTAATCTGGAAGCTTTTTGGTATAGCTGAGGAAAGTTC
TTCAGATTTAGATGCTTCAGTGATTGAAGTGAGCTGTTCCCCTATAGTTGAAGCATCCACTCAAAAAGATATGGACCCTCAACTACAGCCTGTGAAATTATATGCTGTTC
CTAAAGAAATGTTAAAAATTTTACCTTCAGGAAACTTGCCACCTCTCACATTTTTCGGCCGAGCTAGACTTGACGCTGAAGATGTCGATTTGTTGCAGGAAGAAGTTTCA
TATCCGTCTACAGAGACTGAGCCTGTAGAAGCAGTTGGAGATAAAGTAGCAGACTTGTCCATTCATGATGCAAACAATAGACTGAGTGGCGAAACACCAGAAAATTCTAA
AAACGAAGTTAACTCTGAAGAAAAACACGAGAAGTTTGTTAATGATGGGTTATTAACTGAGAAAAATGCCTCTCTTTGCTCTGATCGGATTAGACTCCATGTTTCTGAAG
TAAAAGTTGTGTACATGAATAACACTAACACTCCAGAAGCAGTGAGCAGTGAGGTAGAAGAACTTCATGTCCATGAGCAAGGAATTAGAGTAAAGGGAAACGAGGCGTTA
GACCATTGTGAAGATGTCAGTACTGTTGCCGAAACACACTTTGGCAAGTTAGTAAATGATGACAGAATAGCAGGCTGTAGTAATTCACATCTTTTAAGTGTTGGATCTTC
AAAGGGTTTTAATGACAGGTTTAGAAAGCTGTACAGGTCAATGAATGTCCCTGTGCCTGAGCCTCTTCCTTCGCTGGTGGAGCTTATGAAATCAAGAAAACGGGCAAAGA
GGAATGCATATTTCTAG
Protein sequenceShow/hide protein sequence
MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGAVMFLFEGNFG
NILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCVWKHPDAPSVYLICNLLGQEEILQQVSQTFGSKIFVDEYTKAGYKAL
ELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNVQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVSTT
PGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEASTQKDMDPQLQPVKLYAVPKEMLKILPSGNLPPLTFFGRARLDAEDVDLLQEEVS
YPSTETEPVEAVGDKVADLSIHDANNRLSGETPENSKNEVNSEEKHEKFVNDGLLTEKNASLCSDRIRLHVSEVKVVYMNNTNTPEAVSSEVEELHVHEQGIRVKGNEAL
DHCEDVSTVAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLYRSMNVPVPEPLPSLVELMKSRKRAKRNAYF