; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg11696 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg11696
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionDRMBL domain-containing protein
Genome locationCarg_Chr02:3537707..3542474
RNA-Seq ExpressionCarg11696
SyntenyCarg11696
Gene Ontology termsGO:0006303 - double-strand break repair via nonhomologous end joining (biological process)
GO:0031848 - protection from non-homologous end joining at telomere (biological process)
GO:0036297 - interstrand cross-link repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0035312 - 5'-3' exodeoxyribonuclease activity (molecular function)
InterPro domainsIPR011084 - DNA repair metallo-beta-lactamase
IPR036866 - Ribonuclease Z/Hydroxyacylglutathione hydrolase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605321.1 5' exonuclease Apollo, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0098.57Show/hide
Query:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCP-
        MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCP 
Subjt:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCP-

Query:  -------GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQV
               GNFGNILHTGDCRLTPECLQNLPEKY GKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQV
Subjt:  -------GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQV

Query:  SQTFGSKIFVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAK
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAK

Query:  LYAVPREMLDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGKQKFANDRVLLADEC
        LYAVPREMLDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGKQKFANDRVLLADEC
Subjt:  LYAVPREMLDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGKQKFANDRVLLADEC

Query:  ASHCSDRASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKLY
        ASHCSDRASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKLY
Subjt:  ASHCSDRASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKLY

Query:  RSMNVAVPEPLPSLVELMKSRKRAKRNAYF
        RSMNVAVPEPLPSLVELMKSRKRAKRNAYF
Subjt:  RSMNVAVPEPLPSLVELMKSRKRAKRNAYF

KAG7035278.1 5' exonuclease Apollo, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG
        MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG
Subjt:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG

Query:  NFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQTFGSKI
        NFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQTFGSKI
Subjt:  NFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQTFGSKI

Query:  FVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM
        FVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM
Subjt:  FVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAKLYAVPREM
        HSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAKLYAVPREM
Subjt:  HSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAKLYAVPREM

Query:  LDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGKQKFANDRVLLADECASHCSDRA
        LDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGKQKFANDRVLLADECASHCSDRA
Subjt:  LDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGKQKFANDRVLLADECASHCSDRA

Query:  SLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKLYRSMNVAVP
        SLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKLYRSMNVAVP
Subjt:  SLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKLYRSMNVAVP

Query:  EPLPSLVELMKSRKRAKRNAYF
        EPLPSLVELMKSRKRAKRNAYF
Subjt:  EPLPSLVELMKSRKRAKRNAYF

XP_022948240.1 uncharacterized protein LOC111451874 isoform X2 [Cucurbita moschata]0.0e+0095.83Show/hide
Query:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG
        MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDG FTVTVFDAHHCPG
Subjt:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG

Query:  NFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQTFGSKI
        NFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQTFGSKI
Subjt:  NFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQTFGSKI

Query:  FVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM
        FVDEY KAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAK+LLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM
Subjt:  FVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAKLYAVPREM
        HSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPS +SLTSNGLIWKLFGL EESSSDLDAS IEVRCSPIVETSTLKDMDPQLQPAKLYAVPRE 
Subjt:  HSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAKLYAVPREM

Query:  LDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGK-QKFANDRVLLADECASHCSDR
        LDILS SNLPPLTLFGRARLA +DANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHS NEVNSKGK +KFANDRVLLADE ASHCSDR
Subjt:  LDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGK-QKFANDRVLLADECASHCSDR

Query:  ASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKLYRSMNVAV
        A LHTSEV+VVSMNNNNPPEAVSSEVEELHVHEQESRGKG+KSLDDCEDV TVP+THIGKLVKDDR+    SNSH LSVGSSKGFNDRFRKLYRSMNVAV
Subjt:  ASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKLYRSMNVAV

Query:  PEPLPSLVELMKSRKRAKRNAYF
        PEPLPSLVELMKSRKRAKRNAYF
Subjt:  PEPLPSLVELMKSRKRAKRNAYF

XP_023532363.1 uncharacterized protein LOC111794563 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0095.24Show/hide
Query:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCP-
        MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDG FTVTV DAHHCP 
Subjt:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCP-

Query:  -------GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQV
               GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQV
Subjt:  -------GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQV

Query:  SQTFGSKIFVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEY KAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAK+LLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSC+SLTSNGLIW+LFGL EESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAK
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAK

Query:  LYAVPREMLDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGK-QKFANDRVLLADE
        LYAVPRE LDILSSSNLPPLTLFGRARLA +DANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHS NE+NSKGK +KFAN+RVLLADE
Subjt:  LYAVPREMLDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGK-QKFANDRVLLADE

Query:  CASHCSDRASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKL
         ASHCSD A LHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRG GNKSLDDCEDV TVPETHIGKLVKDDRIAGC SNSH LSVGSSKGFNDRFRKL
Subjt:  CASHCSDRASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKL

Query:  YRSMNVAVPEPLPSLVELMKSRKRAKRNAY
        YRSMNVAVPEPLPSLVELMKSRKRAKRNAY
Subjt:  YRSMNVAVPEPLPSLVELMKSRKRAKRNAY

XP_023532364.1 uncharacterized protein LOC111794563 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0096.46Show/hide
Query:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG
        MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDG FTVTV DAHHCPG
Subjt:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG

Query:  NFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQTFGSKI
        NFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQTFGSKI
Subjt:  NFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQTFGSKI

Query:  FVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM
        FVDEY KAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAK+LLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM
Subjt:  FVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAKLYAVPREM
        HSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSC+SLTSNGLIW+LFGL EESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAKLYAVPRE 
Subjt:  HSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAKLYAVPREM

Query:  LDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGK-QKFANDRVLLADECASHCSDR
        LDILSSSNLPPLTLFGRARLA +DANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHS NE+NSKGK +KFAN+RVLLADE ASHCSD 
Subjt:  LDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGK-QKFANDRVLLADECASHCSDR

Query:  ASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKLYRSMNVAV
        A LHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRG GNKSLDDCEDV TVPETHIGKLVKDDRIAGC SNSH LSVGSSKGFNDRFRKLYRSMNVAV
Subjt:  ASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKLYRSMNVAV

Query:  PEPLPSLVELMKSRKRAKRNAY
        PEPLPSLVELMKSRKRAKRNAY
Subjt:  PEPLPSLVELMKSRKRAKRNAY

TrEMBL top hitse value%identityAlignment
A0A6J1DVN9 protein artemis1.7e-28779.46Show/hide
Query:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCP-
        MPIEMP GLPFSVDTWTPSSKQK HHFLTHAH DHT GI   HSSFPI+ST +TK IVLQHFPQ+ DSLFVCIE+GQ+LVVKDPDG FTVTVFDAHHCP 
Subjt:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCP-

Query:  -------GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQV
               GNFGNILHTGDCRLTPECLQ+LPEKYRGKSGKEPRC+LDLIFLDCTFGRFFQQFPSRHS+IHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  -------GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQV

Query:  SQTFGSKIFVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDE MKAGYKALELIDP+ILTQDPSSRFHLL GFPKLCQ AK LL +AQTNFQ EPL+IRPSTQWYV EELSE+  TRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAK
        IWHVCYSMHSSKEELEWALQIL PKWV STTPGCRAMDLDYVKKK SC+SLTSNGLIWKLFG+ EESSSDLD S +EV CSP+VE     ++DPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAK

Query:  LYAVPREMLDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTEN-EPVEAVGDKVADLSIHDA-NGRPSDKPSKHSINEVNSKGK-QKFANDRVLLA
        LYA P+EML++LSSSNLPPLTLFGRARL A++A++L EEV YPST N EPVEAVG KV DLSIHDA NG+ SD+ S++S NEVNS+ K +KFAND  LL 
Subjt:  LYAVPREMLDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTEN-EPVEAVGDKVADLSIHDA-NGRPSDKPSKHSINEVNSKGK-QKFANDRVLLA

Query:  DECASHCSDRASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFR
        D  AS  S+R  LH SEVKV SMN+ +PP+ V S VEEL++H Q+ R KGN+SL DCEDVG++PETH GKL+ DDRI  C  NSH LSVGSSKGFND+FR
Subjt:  DECASHCSDRASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFR

Query:  KLYRSMNVAVPEPLPSLVELMKSRKRAKRNAYF
        KLYRSMNV VP+PLPSLVELMKSRKRAK+NAYF
Subjt:  KLYRSMNVAVPEPLPSLVELMKSRKRAKRNAYF

A0A6J1G8U0 uncharacterized protein LOC111451874 isoform X10.0e+0094.61Show/hide
Query:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCP-
        MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDG FTVTVFDAHHCP 
Subjt:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCP-

Query:  -------GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQV
               GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQV
Subjt:  -------GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQV

Query:  SQTFGSKIFVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEY KAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAK+LLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPS +SLTSNGLIWKLFGL EESSSDLDAS IEVRCSPIVETSTLKDMDPQLQPAK
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAK

Query:  LYAVPREMLDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGK-QKFANDRVLLADE
        LYAVPRE LDILS SNLPPLTLFGRARLA +DANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHS NEVNSKGK +KFANDRVLLADE
Subjt:  LYAVPREMLDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGK-QKFANDRVLLADE

Query:  CASHCSDRASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKL
         ASHCSDRA LHTSEV+VVSMNNNNPPEAVSSEVEELHVHEQESRGKG+KSLDDCEDV TVP+THIGKLVKDDR+    SNSH LSVGSSKGFNDRFRKL
Subjt:  CASHCSDRASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKL

Query:  YRSMNVAVPEPLPSLVELMKSRKRAKRNAYF
        YRSMNVAVPEPLPSLVELMKSRKRAKRNAYF
Subjt:  YRSMNVAVPEPLPSLVELMKSRKRAKRNAYF

A0A6J1G997 uncharacterized protein LOC111451874 isoform X20.0e+0095.83Show/hide
Query:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG
        MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDG FTVTVFDAHHCPG
Subjt:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG

Query:  NFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQTFGSKI
        NFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQTFGSKI
Subjt:  NFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQTFGSKI

Query:  FVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM
        FVDEY KAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAK+LLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM
Subjt:  FVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAKLYAVPREM
        HSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPS +SLTSNGLIWKLFGL EESSSDLDAS IEVRCSPIVETSTLKDMDPQLQPAKLYAVPRE 
Subjt:  HSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAKLYAVPREM

Query:  LDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGK-QKFANDRVLLADECASHCSDR
        LDILS SNLPPLTLFGRARLA +DANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHS NEVNSKGK +KFANDRVLLADE ASHCSDR
Subjt:  LDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGK-QKFANDRVLLADECASHCSDR

Query:  ASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKLYRSMNVAV
        A LHTSEV+VVSMNNNNPPEAVSSEVEELHVHEQESRGKG+KSLDDCEDV TVP+THIGKLVKDDR+    SNSH LSVGSSKGFNDRFRKLYRSMNVAV
Subjt:  ASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKLYRSMNVAV

Query:  PEPLPSLVELMKSRKRAKRNAYF
        PEPLPSLVELMKSRKRAKRNAYF
Subjt:  PEPLPSLVELMKSRKRAKRNAYF

A0A6J1L262 uncharacterized protein LOC111499723 isoform X20.0e+0095.67Show/hide
Query:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG
        MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDP+G FTVTVFDAHHCPG
Subjt:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG

Query:  NFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQTFGSKI
        NFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQTFGSKI
Subjt:  NFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQTFGSKI

Query:  FVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM
        FVDEY KAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAK+LLANAQTNFQPEPLVIRPSTQWYVREELSE CNTRKQIISEAIKDQHGIWHVCYSM
Subjt:  FVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSM

Query:  HSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAKLYAVPREM
        HSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVK KPSC+SLTS+GLIWKLFGL EESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAKLYAVPRE 
Subjt:  HSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAKLYAVPREM

Query:  LDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGK-QKFANDRVLLADECASHCSDR
        LDILS SNLPPLTLFGRARL AEDAN+L EEVSYPS ENEPVEAVGDKVADLSIHDANGRPSDKPSKHS NEVNSKGK +KFAN RVLLADECASHCSDR
Subjt:  LDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGK-QKFANDRVLLADECASHCSDR

Query:  ASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKLYRSMNVAV
        A LH SEVKVVSMNNNNPPEAVSSEVEELH HEQESRGKGNKSLDDCEDV TVPET IGKLVKDDRIAGC SNSH LSVGSSKGFN RFRKLYRSMNVAV
Subjt:  ASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKLYRSMNVAV

Query:  PEPLPSLVELMKSRKRAKRNAYF
        PEPLPSLVELMKSRKRAKRNAYF
Subjt:  PEPLPSLVELMKSRKRAKRNAYF

A0A6J1L450 protein artemis isoform X10.0e+0094.45Show/hide
Query:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCP-
        MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDP+G FTVTVFDAHHCP 
Subjt:  MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCP-

Query:  -------GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQV
               GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQV
Subjt:  -------GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQV

Query:  SQTFGSKIFVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEY KAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAK+LLANAQTNFQPEPLVIRPSTQWYVREELSE CNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVK KPSC+SLTS+GLIWKLFGL EESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAK
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAK

Query:  LYAVPREMLDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGK-QKFANDRVLLADE
        LYAVPRE LDILS SNLPPLTLFGRARL AEDAN+L EEVSYPS ENEPVEAVGDKVADLSIHDANGRPSDKPSKHS NEVNSKGK +KFAN RVLLADE
Subjt:  LYAVPREMLDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGK-QKFANDRVLLADE

Query:  CASHCSDRASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKL
        CASHCSDRA LH SEVKVVSMNNNNPPEAVSSEVEELH HEQESRGKGNKSLDDCEDV TVPET IGKLVKDDRIAGC SNSH LSVGSSKGFN RFRKL
Subjt:  CASHCSDRASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKL

Query:  YRSMNVAVPEPLPSLVELMKSRKRAKRNAYF
        YRSMNVAVPEPLPSLVELMKSRKRAKRNAYF
Subjt:  YRSMNVAVPEPLPSLVELMKSRKRAKRNAYF

SwissProt top hitse value%identityAlignment
Q38961 DNA cross-link repair protein SNM19.7e-1425.45Show/hide
Query:  PGLPFSVDTWTPSSKQK-RHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG----N
        PG PF+VD +     Q    +FLTH H DH IG+  A S  PI+ + +T S +L+    ++ S    +E+     +        VT+ +A+HCPG    +
Subjt:  PGLPFSVDTWTPSSKQK-RHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG----N

Query:  FGNI-----LHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVI----NCIWKHPDAPLVYLICSHLGQEDILQQV
        F  +     LHTGD R + + +Q  P  +          ++ +++LD T+     +FPS+   +  V+    + + K P   L+ +    +G+E +   +
Subjt:  FGNI-----LHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVI----NCIWKHPDAPLVYLICSHLGQEDILQQV

Query:  SQTFGSKIFVDEYMKAGYKALELIDPDILTQDPSS--RFHLLHGFPKLCQTAKSLLANAQTNFQP--EPLVIRPSTQWYVREELSEICNTRKQIISEAIK
        ++  G KIF +   +   + L+    D ++++ S+  +   LH  P      + L  + +   +     L  RP T W   E++ E       +I    +
Subjt:  SQTFGSKIFVDEYMKAGYKALELIDPDILTQDPSS--RFHLLHGFPKLCQTAKSLLANAQTNFQP--EPLVIRPSTQWYVREELSEICNTRKQIISEAIK

Query:  DQHGIWHVCYSMHSSKEELEWALQILAPKWVVST
         +  I+ V YS HSS  EL   +Q L P  ++ T
Subjt:  DQHGIWHVCYSMHSSKEELEWALQILAPKWVVST

Q5QJC3 5' exonuclease Apollo1.2e-1932.48Show/hide
Query:  PGLPFSVDTWT-PSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPGN----
        PG P +VD W+   +   R  FL+H H DHT+G+++  S  P++ + +T  + L H  ++       +EVGQ+  V +     TVT+ DA+HCPG+    
Subjt:  PGLPFSVDTWT-PSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPGN----

Query:  ----FGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQTFG
            FG IL+TGD R +P  +Q  P      SG+    ++D ++LD T  R     PSR  A  Q    I +HP   +V  + S LG+E++L  ++  FG
Subjt:  ----FGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQTFG

Query:  SKIFVDEYMKAGYKALELIDPDIL-TQDPSSRFH
        + + V        + LEL  P++  T++ + R H
Subjt:  SKIFVDEYMKAGYKALELIDPDIL-TQDPSSRFH

Q86KS1 DNA cross-link repair 1 protein2.9e-1023.82Show/hide
Query:  GLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPGNF-----
        G  F VD +   S+   H+FLTH H DH +GI    S   I+ T  T  +V  H   +     V  E  + + ++       V   D++HCPG+      
Subjt:  GLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPGNF-----

Query:  -------------GNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSH-LGQEDI
                      +ILHTGD R   + + N P   +G++       +  ++LD T+      FP +   I QV + + K  D   ++L  ++ +G+E I
Subjt:  -------------GNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSH-LGQEDI

Query:  LQQVSQTFGSKIFVDEYMKAGYKALE-LIDPDILTQDPSSRFHLLHGFPKLCQTAKSL-----LANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQI
        L ++++  G  + V     A    L   +D +  T +      L+  F  +  +  S      L ++  N     +  RP T W             K+ 
Subjt:  LQQVSQTFGSKIFVDEYMKAGYKALE-LIDPDILTQDPSSRFHLLHGFPKLCQTAKSL-----LANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQI

Query:  ISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVST
        I+   +     + V YS HSS  EL   +    P  ++ T
Subjt:  ISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVST

Q8C7W7 5' exonuclease Apollo1.6e-1629.44Show/hide
Query:  PGLPFSVDTWT-PSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVK-DPDGH--FTVTVFDAHHCPGN-
        P  P +VD W+   +   R  FLTH H DHT+G+++  +  P++ + IT + +L    Q+       +EVG++ V+  D  G    TVT+ DA+HCPG+ 
Subjt:  PGLPFSVDTWT-PSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVK-DPDGH--FTVTVFDAHHCPGN-

Query:  -------FGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQ
               FG IL+TGD R TP  L+  P    GK       ++  ++LD T        PSR  A  Q++  I + P   +   + S LG+E +L+Q++ 
Subjt:  -------FGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQ

Query:  TFGSKIFVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTA
         F + + +        + L L D     ++ + R H +    ++C +A
Subjt:  TFGSKIFVDEYMKAGYKALELIDPDILTQDPSSRFHLLHGFPKLCQTA

Q9H816 5' exonuclease Apollo4.5e-1931.12Show/hide
Query:  PGLPFSVDTWT-PSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVK-DPDGH--FTVTVFDAHHCPGN-
        P  P +VD W+   +   R  FL+H H DHT+G+++  +  P++ + IT  ++ +H  Q+       +EVG++ V+  D  G    TVT+ DA+HCPG+ 
Subjt:  PGLPFSVDTWT-PSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVK-DPDGH--FTVTVFDAHHCPGN-

Query:  -------FGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQ
               FG IL+TGD R TP  L+  P    GK       ++  ++LD T        PSR  A HQ++  I KHP   +   + S LG+E +L+Q++ 
Subjt:  -------FGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQ

Query:  TFGSKIFVDEYMKAGYKALELID----PDILT-QDPSSRFH
         F + + +        + LEL+      D+ T ++ + R H
Subjt:  TFGSKIFVDEYMKAGYKALELID----PDILT-QDPSSRFH

Arabidopsis top hitse value%identityAlignment
AT1G19025.1 DNA repair metallo-beta-lactamase family protein3.1e-13241.71Show/hide
Query:  MPIEMPPGLPFSVDT---WTPSSKQKRHHFLTHAHMDHTIGIAAAH-SSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAH
        M IEMP GLPF+VDT   +T + ++KRHHFLTHAH DHT+G++ ++   FPI+ST +T S++LQ FPQL +S FV +E+GQ+++V DPDG F VT FDA+
Subjt:  MPIEMPPGLPFSVDT---WTPSSKQKRHHFLTHAHMDHTIGIAAAH-SSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAH

Query:  HCP--------GNFGNILHTGDCRLTPECLQNLPEKYRGKS-GKEPRCKLDLIFLDCTFGR--FFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQ
        HCP        G+FGNILHTGDCRLT +CL +LPEKY G+S G +P+C L  IFLDCTFG+    Q+FP++HSAI Q+INCIW HPDAP+VYL C  LGQ
Subjt:  HCP--------GNFGNILHTGDCRLTPECLQNLPEKYRGKS-GKEPRCKLDLIFLDCTFGR--FFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQ

Query:  EDILQQVSQTFGSKIFVDEYMKAG-YKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSE-----ICNTR
        ED+L +VS+TFGSKI+VD+      +++L +I P+I+++DPSSRFH+  GFPKL +   + LA A++  Q EPL+IRPS QWYV ++  +     I   R
Subjt:  EDILQQVSQTFGSKIFVDEYMKAG-YKALELIDPDILTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSE-----ICNTR

Query:  KQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESS--SDLDASAIEVRCSPIVE
        K   SEA+KD+ G+WHVCYSMHSS+ ELE A+Q+L+PKWVVST P CRAM+L+YVKK    +  + +   WKL  ++ E+S  +  D   + + C  + E
Subjt:  KQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKPSCTSLTSNGLIWKLFGLEEESS--SDLDASAIEVRCSPIVE

Query:  TSTLKDMDPQLQPAKLYAVPREMLDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKG
           L     +L+P    +  ++ L  LS  N  P+TLFGRAR ++++ + L                                                 
Subjt:  TSTLKDMDPQLQPAKLYAVPREMLDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENEPVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKG

Query:  KQKFANDRVLLADECASHCSDRASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLS
             ++R ++  +C  +      L    VKVV        E +  + +E  V E+ES                                 C+  S   S
Subjt:  KQKFANDRVLLADECASHCSDRASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVGTVPETHIGKLVKDDRIAGCSSNSHGLS

Query:  VGSSKGFNDRFRKLYRSMNVAVPEPLPSLVELMKSRKRAKRNAYF
          + K  +   RKLYRSMN  VP PLPSL+ELM +RKR++ +  F
Subjt:  VGSSKGFNDRFRKLYRSMNVAVPEPLPSLVELMKSRKRAKRNAYF

AT2G45700.1 sterile alpha motif (SAM) domain-containing protein5.7e-1725.53Show/hide
Query:  PGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG------
        PG PF VD +   ++   H FLTH H+DH  G+  + S   I+ + +T  +V        + L V +++GQ + +   D    VT FDA+HCPG      
Subjt:  PGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG------

Query:  ---NFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAP-LVYLICSH-LGQEDILQQVSQT
           N   +LHTGD R + E    L   +           +  + LD T+      FP + + I  V+  I      P  ++LI S+ +G+E +  +V++ 
Subjt:  ---NFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAP-LVYLICSH-LGQEDILQQVSQT

Query:  FGSKIFVDEYMKAGYKALELIDPDI---LTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHG
           KI+++       + L     DI     ++  S  H++  +          +AN  TN     +   P T W   +       T+K+     ++    
Subjt:  FGSKIFVDEYMKAGYKALELIDPDI---LTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHG

Query:  I-WHVCYSMHSSKEELEWALQILAPKWVV
        I + V YS HSS  EL+  +Q ++P+ ++
Subjt:  I-WHVCYSMHSSKEELEWALQILAPKWVV

AT3G26680.1 DNA repair metallo-beta-lactamase family protein6.9e-1525.45Show/hide
Query:  PGLPFSVDTWTPSSKQK-RHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG----N
        PG PF+VD +     Q    +FLTH H DH IG+  A S  PI+ + +T S +L+    ++ S    +E+     +        VT+ +A+HCPG    +
Subjt:  PGLPFSVDTWTPSSKQK-RHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG----N

Query:  FGNI-----LHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVI----NCIWKHPDAPLVYLICSHLGQEDILQQV
        F  +     LHTGD R + + +Q  P  +          ++ +++LD T+     +FPS+   +  V+    + + K P   L+ +    +G+E +   +
Subjt:  FGNI-----LHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVI----NCIWKHPDAPLVYLICSHLGQEDILQQV

Query:  SQTFGSKIFVDEYMKAGYKALELIDPDILTQDPSS--RFHLLHGFPKLCQTAKSLLANAQTNFQP--EPLVIRPSTQWYVREELSEICNTRKQIISEAIK
        ++  G KIF +   +   + L+    D ++++ S+  +   LH  P      + L  + +   +     L  RP T W   E++ E       +I    +
Subjt:  SQTFGSKIFVDEYMKAGYKALELIDPDILTQDPSS--RFHLLHGFPKLCQTAKSLLANAQTNFQP--EPLVIRPSTQWYVREELSEICNTRKQIISEAIK

Query:  DQHGIWHVCYSMHSSKEELEWALQILAPKWVVST
         +  I+ V YS HSS  EL   +Q L P  ++ T
Subjt:  DQHGIWHVCYSMHSSKEELEWALQILAPKWVVST

AT3G26680.2 DNA repair metallo-beta-lactamase family protein6.9e-1525.45Show/hide
Query:  PGLPFSVDTWTPSSKQK-RHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG----N
        PG PF+VD +     Q    +FLTH H DH IG+  A S  PI+ + +T S +L+    ++ S    +E+     +        VT+ +A+HCPG    +
Subjt:  PGLPFSVDTWTPSSKQK-RHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG----N

Query:  FGNI-----LHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVI----NCIWKHPDAPLVYLICSHLGQEDILQQV
        F  +     LHTGD R + + +Q  P  +          ++ +++LD T+     +FPS+   +  V+    + + K P   L+ +    +G+E +   +
Subjt:  FGNI-----LHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVI----NCIWKHPDAPLVYLICSHLGQEDILQQV

Query:  SQTFGSKIFVDEYMKAGYKALELIDPDILTQDPSS--RFHLLHGFPKLCQTAKSLLANAQTNFQP--EPLVIRPSTQWYVREELSEICNTRKQIISEAIK
        ++  G KIF +   +   + L+    D ++++ S+  +   LH  P      + L  + +   +     L  RP T W   E++ E       +I    +
Subjt:  SQTFGSKIFVDEYMKAGYKALELIDPDILTQDPSS--RFHLLHGFPKLCQTAKSLLANAQTNFQP--EPLVIRPSTQWYVREELSEICNTRKQIISEAIK

Query:  DQHGIWHVCYSMHSSKEELEWALQILAPKWVVST
         +  I+ V YS HSS  EL   +Q L P  ++ T
Subjt:  DQHGIWHVCYSMHSSKEELEWALQILAPKWVVST

AT3G26680.3 DNA repair metallo-beta-lactamase family protein6.9e-1525.45Show/hide
Query:  PGLPFSVDTWTPSSKQK-RHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG----N
        PG PF+VD +     Q    +FLTH H DH IG+  A S  PI+ + +T S +L+    ++ S    +E+     +        VT+ +A+HCPG    +
Subjt:  PGLPFSVDTWTPSSKQK-RHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPG----N

Query:  FGNI-----LHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVI----NCIWKHPDAPLVYLICSHLGQEDILQQV
        F  +     LHTGD R + + +Q  P  +          ++ +++LD T+     +FPS+   +  V+    + + K P   L+ +    +G+E +   +
Subjt:  FGNI-----LHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVI----NCIWKHPDAPLVYLICSHLGQEDILQQV

Query:  SQTFGSKIFVDEYMKAGYKALELIDPDILTQDPSS--RFHLLHGFPKLCQTAKSLLANAQTNFQP--EPLVIRPSTQWYVREELSEICNTRKQIISEAIK
        ++  G KIF +   +   + L+    D ++++ S+  +   LH  P      + L  + +   +     L  RP T W   E++ E       +I    +
Subjt:  SQTFGSKIFVDEYMKAGYKALELIDPDILTQDPSS--RFHLLHGFPKLCQTAKSLLANAQTNFQP--EPLVIRPSTQWYVREELSEICNTRKQIISEAIK

Query:  DQHGIWHVCYSMHSSKEELEWALQILAPKWVVST
         +  I+ V YS HSS  EL   +Q L P  ++ T
Subjt:  DQHGIWHVCYSMHSSKEELEWALQILAPKWVVST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGATCGAAATGCCCCCAGGCCTGCCATTCTCGGTGGATACATGGACTCCTTCTTCCAAGCAAAAGCGCCACCATTTTCTAACGCACGCCCACATGGATCACACCAT
TGGAATTGCCGCCGCCCATTCCTCCTTCCCTATTTTTTCTACTTTTATCACCAAATCGATTGTTCTTCAGCACTTCCCTCAGCTTCATGATTCGTTGTTCGTATGTATCG
AGGTGGGGCAAACGCTGGTCGTAAAAGATCCTGATGGTCATTTCACTGTTACAGTTTTCGATGCTCATCATTGCCCTGGCAATTTTGGCAATATTCTACATACGGGTGAT
TGCAGACTAACTCCTGAGTGCCTACAGAACTTACCTGAGAAGTATCGTGGAAAAAGTGGTAAAGAGCCAAGGTGTAAACTGGATCTGATTTTTCTAGATTGCACATTTGG
TAGATTCTTTCAACAATTCCCCAGCAGACATTCAGCAATACATCAGGTTATTAATTGCATATGGAAACATCCTGATGCTCCTTTGGTATATCTGATTTGCAGTCATCTAG
GACAGGAAGATATATTGCAACAAGTATCCCAAACATTTGGTTCAAAGATATTTGTTGATGAATACATGAAAGCAGGTTACAAGGCTCTTGAACTTATAGATCCTGACATC
CTCACTCAAGATCCATCCTCCCGCTTTCATCTGCTTCATGGATTCCCTAAACTATGTCAAACTGCAAAATCACTGCTTGCAAATGCCCAGACCAATTTTCAGCCGGAACC
TCTCGTAATACGCCCTTCGACTCAGTGGTATGTTCGTGAGGAATTGTCAGAGATTTGCAACACAAGGAAACAAATAATTAGTGAAGCAATTAAAGATCAGCACGGTATTT
GGCATGTCTGTTACTCAATGCACTCGTCGAAGGAAGAACTAGAATGGGCCTTGCAAATTTTAGCACCAAAATGGGTTGTTTCAACCACTCCTGGTTGTCGGGCTATGGAT
TTGGATTACGTGAAAAAGAAACCCAGTTGCACTAGTTTAACTTCCAATGGCCTAATCTGGAAGCTTTTTGGTTTAGAAGAGGAAAGTTCTTCAGATTTAGATGCTTCAGC
GATTGAAGTGAGGTGTTCTCCCATAGTTGAAACATCCACTCTAAAAGATATGGATCCTCAACTTCAGCCTGCAAAATTGTATGCAGTTCCTAGAGAAATGTTAGACATTT
TGTCTTCAAGCAACTTGCCACCTCTCACATTATTCGGACGAGCTAGACTCGCCGCCGAAGATGCCAATATGTTACCGGAAGAAGTTTCATACCCATCAACAGAGAATGAG
CCTGTAGAAGCAGTTGGAGATAAAGTAGCAGACTTGTCCATTCACGATGCAAACGGTAGACCGAGTGACAAACCATCAAAACATTCTATAAACGAAGTTAACTCCAAAGG
GAAACAGAAATTTGCAAACGACAGGGTATTATTAGCTGATGAATGCGCCTCTCATTGCTCTGATCGGGCTAGCCTCCATACTTCTGAAGTAAAAGTTGTGTCCATGAACA
ACAATAACCCACCAGAAGCAGTCAGCAGTGAGGTAGAAGAACTTCATGTCCATGAGCAAGAAAGTAGAGGTAAGGGAAACAAATCGTTAGACGATTGTGAAGATGTCGGT
ACCGTTCCCGAAACACACATTGGGAAGTTAGTAAAGGATGACAGAATAGCAGGGTGTAGTAGTAATTCACATGGTTTAAGTGTTGGATCTTCAAAAGGTTTTAATGACAG
GTTTAGAAAGCTGTACAGGTCAATGAATGTAGCTGTGCCAGAGCCTCTTCCTTCGCTTGTGGAGCTTATGAAATCCAGAAAACGGGCAAAGCGGAATGCATATTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCGATCGAAATGCCCCCAGGCCTGCCATTCTCGGTGGATACATGGACTCCTTCTTCCAAGCAAAAGCGCCACCATTTTCTAACGCACGCCCACATGGATCACACCAT
TGGAATTGCCGCCGCCCATTCCTCCTTCCCTATTTTTTCTACTTTTATCACCAAATCGATTGTTCTTCAGCACTTCCCTCAGCTTCATGATTCGTTGTTCGTATGTATCG
AGGTGGGGCAAACGCTGGTCGTAAAAGATCCTGATGGTCATTTCACTGTTACAGTTTTCGATGCTCATCATTGCCCTGGCAATTTTGGCAATATTCTACATACGGGTGAT
TGCAGACTAACTCCTGAGTGCCTACAGAACTTACCTGAGAAGTATCGTGGAAAAAGTGGTAAAGAGCCAAGGTGTAAACTGGATCTGATTTTTCTAGATTGCACATTTGG
TAGATTCTTTCAACAATTCCCCAGCAGACATTCAGCAATACATCAGGTTATTAATTGCATATGGAAACATCCTGATGCTCCTTTGGTATATCTGATTTGCAGTCATCTAG
GACAGGAAGATATATTGCAACAAGTATCCCAAACATTTGGTTCAAAGATATTTGTTGATGAATACATGAAAGCAGGTTACAAGGCTCTTGAACTTATAGATCCTGACATC
CTCACTCAAGATCCATCCTCCCGCTTTCATCTGCTTCATGGATTCCCTAAACTATGTCAAACTGCAAAATCACTGCTTGCAAATGCCCAGACCAATTTTCAGCCGGAACC
TCTCGTAATACGCCCTTCGACTCAGTGGTATGTTCGTGAGGAATTGTCAGAGATTTGCAACACAAGGAAACAAATAATTAGTGAAGCAATTAAAGATCAGCACGGTATTT
GGCATGTCTGTTACTCAATGCACTCGTCGAAGGAAGAACTAGAATGGGCCTTGCAAATTTTAGCACCAAAATGGGTTGTTTCAACCACTCCTGGTTGTCGGGCTATGGAT
TTGGATTACGTGAAAAAGAAACCCAGTTGCACTAGTTTAACTTCCAATGGCCTAATCTGGAAGCTTTTTGGTTTAGAAGAGGAAAGTTCTTCAGATTTAGATGCTTCAGC
GATTGAAGTGAGGTGTTCTCCCATAGTTGAAACATCCACTCTAAAAGATATGGATCCTCAACTTCAGCCTGCAAAATTGTATGCAGTTCCTAGAGAAATGTTAGACATTT
TGTCTTCAAGCAACTTGCCACCTCTCACATTATTCGGACGAGCTAGACTCGCCGCCGAAGATGCCAATATGTTACCGGAAGAAGTTTCATACCCATCAACAGAGAATGAG
CCTGTAGAAGCAGTTGGAGATAAAGTAGCAGACTTGTCCATTCACGATGCAAACGGTAGACCGAGTGACAAACCATCAAAACATTCTATAAACGAAGTTAACTCCAAAGG
GAAACAGAAATTTGCAAACGACAGGGTATTATTAGCTGATGAATGCGCCTCTCATTGCTCTGATCGGGCTAGCCTCCATACTTCTGAAGTAAAAGTTGTGTCCATGAACA
ACAATAACCCACCAGAAGCAGTCAGCAGTGAGGTAGAAGAACTTCATGTCCATGAGCAAGAAAGTAGAGGTAAGGGAAACAAATCGTTAGACGATTGTGAAGATGTCGGT
ACCGTTCCCGAAACACACATTGGGAAGTTAGTAAAGGATGACAGAATAGCAGGGTGTAGTAGTAATTCACATGGTTTAAGTGTTGGATCTTCAAAAGGTTTTAATGACAG
GTTTAGAAAGCTGTACAGGTCAATGAATGTAGCTGTGCCAGAGCCTCTTCCTTCGCTTGTGGAGCTTATGAAATCCAGAAAACGGGCAAAGCGGAATGCATATTTCTAG
Protein sequenceShow/hide protein sequence
MPIEMPPGLPFSVDTWTPSSKQKRHHFLTHAHMDHTIGIAAAHSSFPIFSTFITKSIVLQHFPQLHDSLFVCIEVGQTLVVKDPDGHFTVTVFDAHHCPGNFGNILHTGD
CRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQVINCIWKHPDAPLVYLICSHLGQEDILQQVSQTFGSKIFVDEYMKAGYKALELIDPDI
LTQDPSSRFHLLHGFPKLCQTAKSLLANAQTNFQPEPLVIRPSTQWYVREELSEICNTRKQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMD
LDYVKKKPSCTSLTSNGLIWKLFGLEEESSSDLDASAIEVRCSPIVETSTLKDMDPQLQPAKLYAVPREMLDILSSSNLPPLTLFGRARLAAEDANMLPEEVSYPSTENE
PVEAVGDKVADLSIHDANGRPSDKPSKHSINEVNSKGKQKFANDRVLLADECASHCSDRASLHTSEVKVVSMNNNNPPEAVSSEVEELHVHEQESRGKGNKSLDDCEDVG
TVPETHIGKLVKDDRIAGCSSNSHGLSVGSSKGFNDRFRKLYRSMNVAVPEPLPSLVELMKSRKRAKRNAYF