; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007694 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007694
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDRMBL domain-containing protein
Genome locationchr9:3191585..3200212
RNA-Seq ExpressionLag0007694
SyntenyLag0007694
Gene Ontology termsGO:0006303 - double-strand break repair via nonhomologous end joining (biological process)
GO:0031848 - protection from non-homologous end joining at telomere (biological process)
GO:0036297 - interstrand cross-link repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003684 - damaged DNA binding (molecular function)
GO:0035312 - 5'-3' exodeoxyribonuclease activity (molecular function)
InterPro domainsIPR011084 - DNA repair metallo-beta-lactamase
IPR036866 - Ribonuclease Z/Hydroxyacylglutathione hydrolase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605321.1 5' exonuclease Apollo, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0086.85Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GI AAHSSFPI+STF+TKSIVLQ FPQLHDSLFVCIEVGQ LVVKDPDG FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKY GKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEY KAGYKALELIDPDILTQDPSSRFHLL GFPKLCQ+AK+LLA+AQTNFQ EPL+IRPSTQWYVREE SEICNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKK SC+SLTSNGLIWKLFG+ EESSSDLDAS IEV CSPIVE  T KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVK

Query:  LYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDG-LLTEK
        LYAVP+EML IL S NLPPLTLFGRARLAAEDA++L EEVSYPS E EPVEAVGDKVADLSIHDAN R S +  ++S NEVNS+ K +KF ND  LL ++
Subjt:  LYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDG-LLTEK

Query:  NTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGC-SNSHLLSVGSSKGFNDRFRKL
          S CSDR   H SEVKVVSMNN NPPEAVS EVEELHVHEQ  R KGN+ LD CED  T+ ETH GKLV DDRIAGC SNSH LSVGSSKGFNDRFRKL
Subjt:  NTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGC-SNSHLLSVGSSKGFNDRFRKL

Query:  YRSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        YRSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  YRSMNVPVPEPLPSLVELMKSRKRAKRNAYF

XP_022948238.1 uncharacterized protein LOC111451874 isoform X1 [Cucurbita moschata]0.0e+0086.51Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GI AAHSSFPI+STF+TKSIVLQ FPQLHDSLFVCIEVGQ LVVKDPDG FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEY KAGYKALELIDPDILTQDPSSRFHLL GFPKLCQ+AKALLA+AQTNFQ EPL+IRPSTQWYVREE SEICNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKK S SSLTSNGLIWKLFG+AEESSSDLDASVIEV CSPIVE  T KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVK

Query:  LYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDG-LLTEK
        LYAVP+E L IL   NLPPLTLFGRARLA +DA++L EEVSYPS E EPVEAVGDKVADLSIHDAN R S +  ++SKNEVNS+ KHEKF ND  LL ++
Subjt:  LYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDG-LLTEK

Query:  NTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY
        + S CSDR R H SEV+VVSMNN NPPEAVS EVEELHVHEQ  R KG++ LD CED  T+ +TH GKLV DDR+   SNSH+LSVGSSKGFNDRFRKLY
Subjt:  NTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        RSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF

XP_023007139.1 protein artemis isoform X1 [Cucurbita maxima]0.0e+0086.67Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GI AAHSSFPI+STF+TKSIVLQ FPQLHDSLFVCIEVGQ LVVKDP+G FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEY KAGYKALELIDPDILTQDPSSRFHLL GFPKLCQ+AKALLA+AQTNFQ EPL+IRPSTQWYVREE SE CNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVK K SCSSLTS+GLIWKLFG+AEESSSDLDAS IEV CSPIVE  T KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVK

Query:  LYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDG-LLTEK
        LYAVP+E L IL   NLPPLTLFGRARL AEDA++L EEVSYPSIE EPVEAVGDKVADLSIHDAN R S +  ++SKNEVNS+ KHEKF N   LL ++
Subjt:  LYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDG-LLTEK

Query:  NTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY
          S CSDR R H+SEVKVVSMNN NPPEAVS EVEELH HEQ  R KGN+ LD CED  T+ ET  GKLV DDRIAGCSNSH+LSVGSSKGFN RFRKLY
Subjt:  NTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        RSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF

XP_023532363.1 uncharacterized protein LOC111794563 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0086.8Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GI AAHSSFPI+STF+TKSIVLQ FPQLHDSLFVCIEVGQ LVVKDPDG FTVTV DAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEY KAGYKALELIDPDILTQDPSSRFHLL GFPKLCQ+AKALLA+AQTNFQ EPL+IRPSTQWYVREE SEICNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKK SCSSLTSNGLIW+LFG+AEESSSDLDAS IEV CSPIVE  T KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVK

Query:  LYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDG-LLTEK
        LYAVP+E L IL S NLPPLTLFGRARLA +DA++L EEVSYPS E EPVEAVGDKVADLSIHDAN R S +  ++SKNE+NS+ KHEKF N+  LL ++
Subjt:  LYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDG-LLTEK

Query:  NTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY
        + S CSD  R H SEVKVVSMNN NPPEAVS EVEELHVHEQ  R  GN+ LD CED  T+ ETH GKLV DDRIAGCSNSH+LSVGSSKGFNDRFRKLY
Subjt:  NTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAY
        RSMNV VPEPLPSLVELMKSRKRAKRNAY
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAY

XP_023532364.1 uncharacterized protein LOC111794563 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0085.53Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GI AAHSSFPI+STF+TKSIVLQ FPQLHDSLFVCIEVGQ LVVKDPDG FTVTV DAHHCP 
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
               GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEY KAGYKALELIDPDILTQDPSSRFHLL GFPKLCQ+AKALLA+AQTNFQ EPL+IRPSTQWYVREE SEICNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKK SCSSLTSNGLIW+LFG+AEESSSDLDAS IEV CSPIVE  T KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVK

Query:  LYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDG-LLTEK
        LYAVP+E L IL S NLPPLTLFGRARLA +DA++L EEVSYPS E EPVEAVGDKVADLSIHDAN R S +  ++SKNE+NS+ KHEKF N+  LL ++
Subjt:  LYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDG-LLTEK

Query:  NTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY
        + S CSD  R H SEVKVVSMNN NPPEAVS EVEELHVHEQ  R  GN+ LD CED  T+ ETH GKLV DDRIAGCSNSH+LSVGSSKGFNDRFRKLY
Subjt:  NTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAY
        RSMNV VPEPLPSLVELMKSRKRAKRNAY
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAY

TrEMBL top hitse value%identityAlignment
A0A0A0LMF2 DRMBL domain-containing protein0.0e+0084.87Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGA
        MPIEMPQGLPFSVDTW+PSSK+KRHHFLTHAHRDHTTGI  H SFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQ+LVVKDPDGAFTVTVFDAHHCPGA
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGA

Query:  VMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS
        VMFLFEG FGN+LHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHS+IHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS
Subjt:  VMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVS

Query:  QTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHGI
        QTFGSKIF DE +KAGYKALELI+PDILTQDPSSRFHLLDGFPKLCQ+A+ LLADAQTNF SEPL+IRPSTQWYVREE SEI N+RKQIISEAIKDQHGI
Subjt:  QTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHGI

Query:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVKL
        WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVK+KLS SSLTSNGLIWKLFGIAE+SSSDLDASVIEVSCS IVEAPTQ++++PQ Q VK+
Subjt:  WHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVKL

Query:  YAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDGLLTEKNT
        Y VPQEML IL S NLPPLTLFGRARLA EDA LL EEVSYPS E EPVEAVGD VA+LSIHDA  +LSG+S  NSK+EV+ E KHEKF ND L  + N 
Subjt:  YAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDGLLTEKNT

Query:  SLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLYRS
        SL S+  +  +SE+KV+S NN N PE  + +VEE HVHEQ  RVK  EL D+C+D S I ETH GK+VN+DRIAGCSNSHLLSVGSSKGFND+FRKLYRS
Subjt:  SLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLYRS

Query:  MNVPVPEPLPSLVELMKSRKRAKRNAYF
        MNVPVPEPLPSLVELMKSRKR KRN YF
Subjt:  MNVPVPEPLPSLVELMKSRKRAKRNAYF

A0A6J1G8U0 uncharacterized protein LOC111451874 isoform X10.0e+0086.51Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GI AAHSSFPI+STF+TKSIVLQ FPQLHDSLFVCIEVGQ LVVKDPDG FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEY KAGYKALELIDPDILTQDPSSRFHLL GFPKLCQ+AKALLA+AQTNFQ EPL+IRPSTQWYVREE SEICNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKK S SSLTSNGLIWKLFG+AEESSSDLDASVIEV CSPIVE  T KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVK

Query:  LYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDG-LLTEK
        LYAVP+E L IL   NLPPLTLFGRARLA +DA++L EEVSYPS E EPVEAVGDKVADLSIHDAN R S +  ++SKNEVNS+ KHEKF ND  LL ++
Subjt:  LYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDG-LLTEK

Query:  NTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY
        + S CSDR R H SEV+VVSMNN NPPEAVS EVEELHVHEQ  R KG++ LD CED  T+ +TH GKLV DDR+   SNSH+LSVGSSKGFNDRFRKLY
Subjt:  NTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        RSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF

A0A6J1G997 uncharacterized protein LOC111451874 isoform X20.0e+0085.24Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GI AAHSSFPI+STF+TKSIVLQ FPQLHDSLFVCIEVGQ LVVKDPDG FTVTVFDAHHCP 
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
               GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEY KAGYKALELIDPDILTQDPSSRFHLL GFPKLCQ+AKALLA+AQTNFQ EPL+IRPSTQWYVREE SEICNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKK S SSLTSNGLIWKLFG+AEESSSDLDASVIEV CSPIVE  T KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVK

Query:  LYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDG-LLTEK
        LYAVP+E L IL   NLPPLTLFGRARLA +DA++L EEVSYPS E EPVEAVGDKVADLSIHDAN R S +  ++SKNEVNS+ KHEKF ND  LL ++
Subjt:  LYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDG-LLTEK

Query:  NTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY
        + S CSDR R H SEV+VVSMNN NPPEAVS EVEELHVHEQ  R KG++ LD CED  T+ +TH GKLV DDR+   SNSH+LSVGSSKGFNDRFRKLY
Subjt:  NTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        RSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF

A0A6J1L262 uncharacterized protein LOC111499723 isoform X20.0e+0085.4Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GI AAHSSFPI+STF+TKSIVLQ FPQLHDSLFVCIEVGQ LVVKDP+G FTVTVFDAHHCP 
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
               GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEY KAGYKALELIDPDILTQDPSSRFHLL GFPKLCQ+AKALLA+AQTNFQ EPL+IRPSTQWYVREE SE CNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVK K SCSSLTS+GLIWKLFG+AEESSSDLDAS IEV CSPIVE  T KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVK

Query:  LYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDG-LLTEK
        LYAVP+E L IL   NLPPLTLFGRARL AEDA++L EEVSYPSIE EPVEAVGDKVADLSIHDAN R S +  ++SKNEVNS+ KHEKF N   LL ++
Subjt:  LYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDG-LLTEK

Query:  NTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY
          S CSDR R H+SEVKVVSMNN NPPEAVS EVEELH HEQ  R KGN+ LD CED  T+ ET  GKLV DDRIAGCSNSH+LSVGSSKGFN RFRKLY
Subjt:  NTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        RSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF

A0A6J1L450 protein artemis isoform X10.0e+0086.67Show/hide
Query:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG
        MPIEMP GLPFSVDTWTPSSKQKRHHFLTHAH DHT GI AAHSSFPI+STF+TKSIVLQ FPQLHDSLFVCIEVGQ LVVKDP+G FTVTVFDAHHCPG
Subjt:  MPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGI-AAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPG

Query:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV
        AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQ+INCIWKHPDAPLVYLIC+ LGQEDILQQV
Subjt:  AVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQV

Query:  SQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG
        SQTFGSKIFVDEY KAGYKALELIDPDILTQDPSSRFHLL GFPKLCQ+AKALLA+AQTNFQ EPL+IRPSTQWYVREE SE CNTRKQIISEAIKDQHG
Subjt:  SQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHG

Query:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVK
        IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVK K SCSSLTS+GLIWKLFG+AEESSSDLDAS IEV CSPIVE  T KDMDPQLQP K
Subjt:  IWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVSCSPIVEAPTQKDMDPQLQPVK

Query:  LYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDG-LLTEK
        LYAVP+E L IL   NLPPLTLFGRARL AEDA++L EEVSYPSIE EPVEAVGDKVADLSIHDAN R S +  ++SKNEVNS+ KHEKF N   LL ++
Subjt:  LYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEKFVNDG-LLTEK

Query:  NTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY
          S CSDR R H+SEVKVVSMNN NPPEAVS EVEELH HEQ  R KGN+ LD CED  T+ ET  GKLV DDRIAGCSNSH+LSVGSSKGFN RFRKLY
Subjt:  NTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY

Query:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF
        RSMNV VPEPLPSLVELMKSRKRAKRNAYF
Subjt:  RSMNVPVPEPLPSLVELMKSRKRAKRNAYF

SwissProt top hitse value%identityAlignment
D2H8V8 5' exonuclease Apollo7.2e-2634.68Show/hide
Query:  PFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVK-DPDG--AFTVTVFDAHHCPGAVMFLF
        P +VD W+   +   R  FL+H H DHT G+++  + P+Y + +T  +V +   Q+       +EVG++ V+  D  G    TVT+ DA+HCPG+VMFLF
Subjt:  PFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVK-DPDG--AFTVTVFDAHHCPGAVMFLF

Query:  EGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKL----DLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQ
        EG FG IL+TGD R TP  L            KEP  KL      ++LD T        PSR  A  QI+  I KHP   +   + + LG+E +L+Q++ 
Subjt:  EGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKL----DLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQ

Query:  TFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSA
         F + + +   R    + L L D   L ++ + R H +D   ++C SA
Subjt:  TFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSA

Q4KLY6 5' exonuclease Apollo1.2e-2530.09Show/hide
Query:  IEMPQGLPFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQA-LVVKDPDG--AFTVTVFDAHHCP
        + +PQ  P +VD W+   +   R  FL+H H DHT G+++  + P+Y + +T  ++ ++  Q+       +E+G++ +++ D  G    TVT+ DA+HCP
Subjt:  IEMPQGLPFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQA-LVVKDPDG--AFTVTVFDAHHCP

Query:  GAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQ
        G+VMFLFEG FG IL+TGD R TP  L+  P    GK       ++  ++LD T        PSR  A  QII  I + P   +   + + LG+E +L+Q
Subjt:  GAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQ

Query:  VSQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQH
        ++  F + + +   R    + L L D     ++ + R H +D   ++C SA                      QW     +  I  T ++I S       
Subjt:  VSQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQH

Query:  GIWHVCYSMHSSKEELEWALQILAPKWVV
         I+ + YS HSS  EL   +  L P  VV
Subjt:  GIWHVCYSMHSSKEELEWALQILAPKWVV

Q5QJC3 5' exonuclease Apollo5.5e-2635.32Show/hide
Query:  GLPFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGAVMFLFE
        G P +VD W+   +   R  FL+H H DHT G+++  S P+Y + LT  ++  +  ++       +EVGQ+  V +     TVT+ DA+HCPG+VMFLFE
Subjt:  GLPFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGAVMFLFE

Query:  GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSK
        G FG IL+TGD R +P  +Q  P      SG+    ++D ++LD T  R     PSR  A  Q    I +HP   +V  + + LG+E++L  ++  FG+ 
Subjt:  GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSK

Query:  IFVDEYRKAGYKALELIDPDIL-TQDPSSRFHLLD
        + V   R    + LEL  P++  T++ + R H +D
Subjt:  IFVDEYRKAGYKALELIDPDIL-TQDPSSRFHLLD

Q8C7W7 5' exonuclease Apollo5.5e-2631Show/hide
Query:  IEMPQGLPFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVK-DPDG--AFTVTVFDAHHCP
        + +PQ  P +VD W+   +   R  FLTH H DHT G+++  + P+Y + +T + +L +  Q+       +EVG++ V+  D  G    TVT+ DA+HCP
Subjt:  IEMPQGLPFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVK-DPDG--AFTVTVFDAHHCP

Query:  GAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQ
        G+VMFLFEG FG IL+TGD R TP  L+  P    GK       ++  ++LD T        PSR  A  QI+  I + P   +   + + LG+E +L+Q
Subjt:  GAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQ

Query:  VSQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQH
        ++  F + + +   R    + L L D     ++ + R H +D   ++C SA                      QW     +  I  T +++ S       
Subjt:  VSQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQH

Query:  GIWHVCYSMHSSKEELEWALQILAPKWVV
         I+ V YS HSS  EL   +  L P  VV
Subjt:  GIWHVCYSMHSSKEELEWALQILAPKWVV

Q9H816 5' exonuclease Apollo2.5e-2634.16Show/hide
Query:  PFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVK-DPDG--AFTVTVFDAHHCPGAVMFLF
        P +VD W+   +   R  FL+H H DHT G+++  + P+Y + +T  + L +  Q+       +EVG++ V+  D  G    TVT+ DA+HCPG+VMFLF
Subjt:  PFSVDTWT-PSSKQKRHHFLTHAHRDHTTGIAAHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVK-DPDG--AFTVTVFDAHHCPGAVMFLF

Query:  EGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGS
        EG FG IL+TGD R TP  L+  P    GK       ++  ++LD T        PSR  A HQI+  I KHP   +   + + LG+E +L+Q++  F +
Subjt:  EGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGS

Query:  KIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQS
         + +   R    + L L D     ++ + R H +D   ++C S
Subjt:  KIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQS

Arabidopsis top hitse value%identityAlignment
AT1G19025.1 DNA repair metallo-beta-lactamase family protein4.3e-14344.65Show/hide
Query:  MPIEMPQGLPFSVDT---WTPSSKQKRHHFLTHAHRDHTTGIAAHS--SFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAH
        M IEMP+GLPF+VDT   +T + ++KRHHFLTHAH+DHT G++  +   FPIYST LT S++LQ+FPQL +S FV +E+GQ+++V DPDG F VT FDA+
Subjt:  MPIEMPQGLPFSVDT---WTPSSKQKRHHFLTHAHRDHTTGIAAHS--SFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAH

Query:  HCPGAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKS-GKEPRCKLDLIFLDCTFGR--FFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQ
        HCPGAVMFLFEG+FGNILHTGDCRLT +CL +LPEKY G+S G +P+C L  IFLDCTFG+    Q+FP++HSAI QIINCIW HPDAP+VYL C++LGQ
Subjt:  HCPGAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKS-GKEPRCKLDLIFLDCTFGR--FFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQ

Query:  EDILQQVSQTFGSKIFVDEYRKAG-YKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREE-----SSEICNTR
        ED+L +VS+TFGSKI+VD+      +++L +I P+I+++DPSSRFH+  GFPKL +   A LA+A++  QSEPLIIRPS QWYV ++     S  I   R
Subjt:  EDILQQVSQTFGSKIFVDEYRKAG-YKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREE-----SSEICNTR

Query:  KQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESS--SDLDASVIEVSCSPIVE
        K   SEA+KD+ G+WHVCYSMHSS+ ELE A+Q+L+PKWVVST P CRAM+L+YVKK    S  + +   WKL  I  E+S  +  D   + +SC  + E
Subjt:  KQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESS--SDLDASVIEVSCSPIVE

Query:  APTQKDMDPQLQPV-KLYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSE
                 +L+PV +  +  +++L + P  +L P+TLFGRAR ++++ D L                                                
Subjt:  APTQKDMDPQLQPV-KLYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSE

Query:  EKHEKFVNDGLLTEKNTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLS
          HE+ V         TS   +++      VKVV        E +  + +E  V ++   +            ST  ET                     
Subjt:  EKHEKFVNDGLLTEKNTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLS

Query:  VGSSKGFNDRFRKLYRSMNVPVPEPLPSLVELMKSRKRAKRNAYF
            K  +   RKLYRSMN PVP PLPSL+ELM +RKR++ +  F
Subjt:  VGSSKGFNDRFRKLYRSMNVPVPEPLPSLVELMKSRKRAKRNAYF

AT1G27410.1 DNA repair metallo-beta-lactamase family protein4.6e-2829.41Show/hide
Query:  MPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIA-AHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEV--GQALVVKDPDGAFTVTV----FDAHHC
        M  GL  SVD W   S+    +FLTH H DHT G++   S  P+Y +  T S+   +FP    SL   + +    +L ++ P    TV +     DAHHC
Subjt:  MPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIA-AHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEV--GQALVVKDPDGAFTVTV----FDAHHC

Query:  PGAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQ
        PG++MFLF G+FG  L+TGD R   +              + P   +D+++LD T+      FPSR  A+  + + I  HP   ++  + + LG+ED+L 
Subjt:  PGAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQ

Query:  QVSQTFGSKIFVDEYRKAGYKALELID-PDILTQDPS-SRFHLLDGFPKLCQSAKALLADAQT---NFQSEPLIIRP-------STQWYVREESSEICNT
         VS+    KI+V   R    + + L+   DI T D S +R   +  +    Q+ + L     T        P + RP       S  +      +E  + 
Subjt:  QVSQTFGSKIFVDEYRKAGYKALELID-PDILTQDPS-SRFHLLDGFPKLCQSAKALLADAQT---NFQSEPLIIRP-------STQWYVREESSEICNT

Query:  RKQIISEAIKDQHG-IWHVCYSMHSSKEELEWALQILAPK
        +K++ + A+   H  ++ V YS HS  EE+   ++++ PK
Subjt:  RKQIISEAIKDQHG-IWHVCYSMHSSKEELEWALQILAPK

AT1G66730.1 DNA LIGASE 65.3e-1624.92Show/hide
Query:  FSVDTW-TPSSKQKRHHFLTHAHRDHTTGIAAH-SSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGAVMFLFE--
        F VD +  P        FL+H H DH +G+++  S   IY +  T  +V  +  Q+       + + Q + +   DG+  V + +A+HCPGAV FLF+  
Subjt:  FSVDTW-TPSSKQKRHHFLTHAHRDHTTGIAAH-SSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGAVMFLFE--

Query:  ---GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICN-LLGQEDILQQVSQT
             F   +HTGD R   E          G  G       D +FLD T+      FPS+  ++  +++ I K  +  +++L+   ++G+E IL ++++ 
Subjt:  ---GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAPLVYLICN-LLGQEDILQQVSQT

Query:  FGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNF--QSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHGI
           KI VD  + +    L   +  + T+D +     + G+  L ++        + NF   +E ++ +   +      +      ++   +   KD   I
Subjt:  FGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNF--QSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHGI

Query:  WHVCYSMHSSKEELEWALQILAPKWVVST
          V YS HS+ +EL   ++ L PK V+ T
Subjt:  WHVCYSMHSSKEELEWALQILAPKWVVST

AT2G45700.1 sterile alpha motif (SAM) domain-containing protein1.5e-1825.99Show/hide
Query:  GLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIA-AHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGAVMFLFE
        G PF VD +   ++   H FLTH H DH  G+  + S   IY + +T  +V  +     + L V +++GQ + +   D    VT FDA+HCPG++M LFE
Subjt:  GLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIA-AHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGAVMFLFE

Query:  -GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAP-LVYLICN-LLGQEDILQQVSQTF
          N   +LHTGD R + E    L   +           +  + LD T+      FP + + I  ++  I      P  ++LI +  +G+E +  +V++  
Subjt:  -GNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQIINCIWKHPDAP-LVYLICN-LLGQEDILQQVSQTF

Query:  GSKIFVDEYRKAGYKALELIDPDI---LTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHGI
          KI+++  +    + L     DI     ++  S  H++  +          +A+  TN  S  LI+  S   +   ++ +    R+      I+     
Subjt:  GSKIFVDEYRKAGYKALELIDPDI---LTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIRPSTQWYVREESSEICNTRKQIISEAIKDQHGI

Query:  WHVCYSMHSSKEELEWALQILAPKWVV
        + V YS HSS  EL+  +Q ++P+ ++
Subjt:  WHVCYSMHSSKEELEWALQILAPKWVV

AT3G26680.1 DNA repair metallo-beta-lactamase family protein5.3e-1626.1Show/hide
Query:  SSSAIGIPPQNSKMPIEMPQ---------GLPFSVDTWTPSSKQK-RHHFLTHAHRDHTTGIA-AHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQ
        ++SA+G   ++S    + P+         G PF+VD +     Q    +FLTH H DH  G+  A S  PIY + LT S +L+    ++ S    +E   
Subjt:  SSSAIGIPPQNSKMPIEMPQ---------GLPFSVDTWTPSSKQK-RHHFLTHAHRDHTTGIA-AHSSFPIYSTFLTKSIVLQQFPQLHDSLFVCIEVGQ

Query:  ALVVKDPDGAFTVTVFDAHHCPGAVMFLFEGNFGN-ILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQII----
         L V+       VT+ +A+HCPGA +  F    G   LHTGD R + + +Q  P  +          ++ +++LD T+     +FPS+   +  ++    
Subjt:  ALVVKDPDGAFTVTVFDAHHCPGAVMFLFEGNFGN-ILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQQFPSRHSAIHQII----

Query:  NCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKIFVDEYRKAGYKAL--ELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEP----L
        + + K P   L+ +    +G+E +   +++  G KIF +  R+   ++   + I  ++ T   ++  H+L        S K    D       E     L
Subjt:  NCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKIFVDEYRKAGYKAL--ELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEP----L

Query:  IIRPSTQWYVREESSEICNTRKQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVST
          RP T W   E+  E       +I    + +  I+ V YS HSS  EL   +Q L P  ++ T
Subjt:  IIRPSTQWYVREESSEICNTRKQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCCCAACAGTTGAAAACGACAAAAGTAAGTCTGAAGCTTCCTGATGACTTCAGATTCACCAATGAGACGACGTCGTCGGCTCCTGTAAAGCATGTCATGCTCAA
ATCCCCATTTCTTCAAGGGCTCATAGTTTCATTTTCGTCTTCCTCTGCCATCGGAATTCCCCCTCAGAACTCCAAGATGCCGATCGAAATGCCCCAAGGGCTGCCATTCT
CGGTGGATACATGGACTCCATCTTCCAAGCAAAAGCGCCACCATTTTCTGACGCACGCTCACAGGGATCATACCACTGGAATTGCCGCCCATTCTTCCTTCCCCATTTAT
TCTACTTTTCTCACCAAATCCATCGTTCTTCAGCAGTTCCCTCAGCTTCATGATTCGTTGTTTGTATGTATCGAGGTGGGGCAAGCGCTGGTCGTCAAAGATCCTGATGG
AGCTTTCACTGTTACAGTTTTCGATGCTCATCACTGCCCTGGAGCTGTTATGTTCTTATTTGAAGGCAATTTTGGAAATATTCTGCATACGGGTGATTGCAGACTAACTC
CTGAGTGCCTACAGAACTTACCTGAGAAGTATCGTGGAAAAAGTGGTAAAGAGCCGAGATGTAAACTGGATCTGATTTTTCTAGATTGCACATTTGGTAGATTCTTTCAA
CAATTCCCCAGCAGGCATTCAGCAATACATCAGATTATTAATTGCATATGGAAACATCCCGATGCTCCTTTAGTATATCTGATTTGCAATCTTCTAGGACAGGAAGATAT
ATTGCAACAAGTGTCCCAAACATTTGGTTCAAAGATATTTGTTGATGAGTACAGGAAAGCAGGTTACAAGGCTCTTGAACTTATAGATCCTGACATTCTCACTCAAGATC
CATCCTCCCGCTTCCATTTGCTTGATGGATTCCCTAAACTATGTCAAAGTGCAAAAGCGCTGCTTGCAGATGCCCAGACCAATTTCCAGTCTGAACCTCTCATAATCCGC
CCTTCAACCCAGTGGTATGTTCGTGAGGAATCGTCAGAGATTTGCAACACAAGGAAACAAATAATTAGTGAAGCAATTAAAGATCAGCATGGTATTTGGCATGTCTGTTA
CTCGATGCATTCATCGAAGGAAGAACTAGAATGGGCCTTGCAAATTTTAGCACCAAAATGGGTTGTTTCAACCACGCCTGGTTGTCGGGCCATGGATTTGGATTACGTGA
AAAAGAAACTCAGTTGTTCTAGTTTAACTTCCAATGGCCTAATCTGGAAGCTTTTTGGTATAGCTGAGGAAAGCTCTTCAGATTTAGATGCTTCAGTGATTGAAGTGAGC
TGTTCCCCTATAGTTGAAGCACCCACTCAAAAAGATATGGACCCTCAACTACAGCCTGTGAAACTATATGCTGTTCCTCAAGAAATGTTAAAAATTTTGCCTTCAGGCAA
CTTGCCACCTCTCACATTATTCGGCCGAGCTAGACTTGCCGCTGAAGATGCCGATTTGTTGCAGGAAGAAGTTTCATATCCTTCTATAGAGACTGAGCCTGTAGAAGCAG
TTGGAGATAAAGTAGCAGACTTGTCCATTCATGATGCAAACAATAGACTGAGTGGCGAATCACCAGAAAATTCTAAAAACGAAGTTAACTCCGAAGAAAAACACGAGAAG
TTTGTGAATGATGGGTTATTAACCGAGAAAAACACCTCTCTTTGCTCCGACCGGATTAGATTCCATGTTTCTGAAGTAAAAGTTGTGTCCATGAATAACGCTAACCCACC
AGAAGCAGTGAGCAGAGAGGTAGAAGAACTTCATGTCCATGAGCAAGGAATTAGAGTAAAGGGAAACGAGTTGTTAGACCATTGTGAAGATAACAGTACTATTGCCGAAA
CACACTTTGGGAAGTTAGTAAATGATGACAGAATAGCAGGCTGTAGTAATTCACATCTTTTAAGTGTTGGATCTTCAAAGGGTTTTAATGACAGGTTTAGAAAGCTGTAC
AGGTCAATGAATGTCCCTGTGCCTGAGCCTCTTCCTTCTCTGGTGGAGCTTATGAAATCAAGAAAACGGGCAAAGAGGAATGCATATTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCCCCAACAGTTGAAAACGACAAAAGTAAGTCTGAAGCTTCCTGATGACTTCAGATTCACCAATGAGACGACGTCGTCGGCTCCTGTAAAGCATGTCATGCTCAA
ATCCCCATTTCTTCAAGGGCTCATAGTTTCATTTTCGTCTTCCTCTGCCATCGGAATTCCCCCTCAGAACTCCAAGATGCCGATCGAAATGCCCCAAGGGCTGCCATTCT
CGGTGGATACATGGACTCCATCTTCCAAGCAAAAGCGCCACCATTTTCTGACGCACGCTCACAGGGATCATACCACTGGAATTGCCGCCCATTCTTCCTTCCCCATTTAT
TCTACTTTTCTCACCAAATCCATCGTTCTTCAGCAGTTCCCTCAGCTTCATGATTCGTTGTTTGTATGTATCGAGGTGGGGCAAGCGCTGGTCGTCAAAGATCCTGATGG
AGCTTTCACTGTTACAGTTTTCGATGCTCATCACTGCCCTGGAGCTGTTATGTTCTTATTTGAAGGCAATTTTGGAAATATTCTGCATACGGGTGATTGCAGACTAACTC
CTGAGTGCCTACAGAACTTACCTGAGAAGTATCGTGGAAAAAGTGGTAAAGAGCCGAGATGTAAACTGGATCTGATTTTTCTAGATTGCACATTTGGTAGATTCTTTCAA
CAATTCCCCAGCAGGCATTCAGCAATACATCAGATTATTAATTGCATATGGAAACATCCCGATGCTCCTTTAGTATATCTGATTTGCAATCTTCTAGGACAGGAAGATAT
ATTGCAACAAGTGTCCCAAACATTTGGTTCAAAGATATTTGTTGATGAGTACAGGAAAGCAGGTTACAAGGCTCTTGAACTTATAGATCCTGACATTCTCACTCAAGATC
CATCCTCCCGCTTCCATTTGCTTGATGGATTCCCTAAACTATGTCAAAGTGCAAAAGCGCTGCTTGCAGATGCCCAGACCAATTTCCAGTCTGAACCTCTCATAATCCGC
CCTTCAACCCAGTGGTATGTTCGTGAGGAATCGTCAGAGATTTGCAACACAAGGAAACAAATAATTAGTGAAGCAATTAAAGATCAGCATGGTATTTGGCATGTCTGTTA
CTCGATGCATTCATCGAAGGAAGAACTAGAATGGGCCTTGCAAATTTTAGCACCAAAATGGGTTGTTTCAACCACGCCTGGTTGTCGGGCCATGGATTTGGATTACGTGA
AAAAGAAACTCAGTTGTTCTAGTTTAACTTCCAATGGCCTAATCTGGAAGCTTTTTGGTATAGCTGAGGAAAGCTCTTCAGATTTAGATGCTTCAGTGATTGAAGTGAGC
TGTTCCCCTATAGTTGAAGCACCCACTCAAAAAGATATGGACCCTCAACTACAGCCTGTGAAACTATATGCTGTTCCTCAAGAAATGTTAAAAATTTTGCCTTCAGGCAA
CTTGCCACCTCTCACATTATTCGGCCGAGCTAGACTTGCCGCTGAAGATGCCGATTTGTTGCAGGAAGAAGTTTCATATCCTTCTATAGAGACTGAGCCTGTAGAAGCAG
TTGGAGATAAAGTAGCAGACTTGTCCATTCATGATGCAAACAATAGACTGAGTGGCGAATCACCAGAAAATTCTAAAAACGAAGTTAACTCCGAAGAAAAACACGAGAAG
TTTGTGAATGATGGGTTATTAACCGAGAAAAACACCTCTCTTTGCTCCGACCGGATTAGATTCCATGTTTCTGAAGTAAAAGTTGTGTCCATGAATAACGCTAACCCACC
AGAAGCAGTGAGCAGAGAGGTAGAAGAACTTCATGTCCATGAGCAAGGAATTAGAGTAAAGGGAAACGAGTTGTTAGACCATTGTGAAGATAACAGTACTATTGCCGAAA
CACACTTTGGGAAGTTAGTAAATGATGACAGAATAGCAGGCTGTAGTAATTCACATCTTTTAAGTGTTGGATCTTCAAAGGGTTTTAATGACAGGTTTAGAAAGCTGTAC
AGGTCAATGAATGTCCCTGTGCCTGAGCCTCTTCCTTCTCTGGTGGAGCTTATGAAATCAAGAAAACGGGCAAAGAGGAATGCATATTTCTAG
Protein sequenceShow/hide protein sequence
MDPQQLKTTKVSLKLPDDFRFTNETTSSAPVKHVMLKSPFLQGLIVSFSSSSAIGIPPQNSKMPIEMPQGLPFSVDTWTPSSKQKRHHFLTHAHRDHTTGIAAHSSFPIY
STFLTKSIVLQQFPQLHDSLFVCIEVGQALVVKDPDGAFTVTVFDAHHCPGAVMFLFEGNFGNILHTGDCRLTPECLQNLPEKYRGKSGKEPRCKLDLIFLDCTFGRFFQ
QFPSRHSAIHQIINCIWKHPDAPLVYLICNLLGQEDILQQVSQTFGSKIFVDEYRKAGYKALELIDPDILTQDPSSRFHLLDGFPKLCQSAKALLADAQTNFQSEPLIIR
PSTQWYVREESSEICNTRKQIISEAIKDQHGIWHVCYSMHSSKEELEWALQILAPKWVVSTTPGCRAMDLDYVKKKLSCSSLTSNGLIWKLFGIAEESSSDLDASVIEVS
CSPIVEAPTQKDMDPQLQPVKLYAVPQEMLKILPSGNLPPLTLFGRARLAAEDADLLQEEVSYPSIETEPVEAVGDKVADLSIHDANNRLSGESPENSKNEVNSEEKHEK
FVNDGLLTEKNTSLCSDRIRFHVSEVKVVSMNNANPPEAVSREVEELHVHEQGIRVKGNELLDHCEDNSTIAETHFGKLVNDDRIAGCSNSHLLSVGSSKGFNDRFRKLY
RSMNVPVPEPLPSLVELMKSRKRAKRNAYF