; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC04G076700 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC04G076700
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptiontyrosyl-DNA phosphodiesterase 1
Genome locationCicolChr04:31821604..31832917
RNA-Seq ExpressionCcUC04G076700
SyntenyCcUC04G076700
Gene Ontology termsGO:0000012 - single strand break repair (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003690 - double-stranded DNA binding (molecular function)
GO:0003697 - single-stranded DNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0017005 - 3'-tyrosyl-DNA phosphodiesterase activity (molecular function)
InterPro domainsIPR000253 - Forkhead-associated (FHA) domain
IPR008984 - SMAD/FHA domain superfamily
IPR010347 - Tyrosyl-DNA phosphodiesterase I
IPR041388 - PNK, FHA domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN49134.1 hypothetical protein Csa_003275 [Cucumis sativus]0.0e+0086.38Show/hide
Query:  MCSVSARRFSTTT---NNQLRRSPAFLAPQPPFSLSSPQILQYSSSPFKPLSLNHRHPQLSSPLASSGPSP------LCSAMARLQKVGYLVPLDKNLEE
        MCS   RRFST +   +N+LRRS AFLA +PP+S S P+IL YSSSPFK LSL +R  QL SPL SS PSP      LC  MARLQ VGYLVPLDKNLE 
Subjt:  MCSVSARRFSTTT---NNQLRRSPAFLAPQPPFSLSSPQILQYSSSPFKPLSLNHRHPQLSSPLASSGPSP------LCSAMARLQKVGYLVPLDKNLEE

Query:  DNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELIPGHYLFKYASHCFNTRPSS
        DNSGLKI LSEGPN IGRSNVLVS+KRISRKHITLT STDG AKLLVEG NPVVIN  DGRKKLG R SV+IRDGDVIELIPGHY FKYASHCFN+RP S
Subjt:  DNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELIPGHYLFKYASHCFNTRPSS

Query:  EDLGQKRVRQVA-DDIFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVRITDIVQGDILFAVLSNYMV
        EDLGQKRVRQVA D I ER AKR EMGSPLEN+QSG SQ +E NSVEAIRNF + DDRLPMTFRLLSVKGLPPWANTSCVRITDI+QGDILFAVLSNYMV
Subjt:  EDLGQKRVRQVA-DDIFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVRITDIVQGDILFAVLSNYMV

Query:  DIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWKDQNS
        DIDWLIPACPALAKVP VLVIHGE DGT+DNMKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWKDQNS
Subjt:  DIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWKDQNS

Query:  STRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKEFQRSPLVYQFSSLGS
        S+RGCAFEDDLVDYL+ALKWPEF A+FP  GN NINP FFRKFDYS AAVRLIASVPGYHTGRYLKKWGHMKLRSVLQEC+FDKEFQRSPLVYQFSSLGS
Subjt:  STRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKEFQRSPLVYQFSSLGS

Query:  LDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHIKTFARYNGQKLAWLV
        L+EKWMAEFAASLSSG + DKTPLGLGEPLIVWPTVEDVRCSLEGYAAG+A+PSPLKNVEKGFL KYWAKW S+HSGRCHAMPHIKTFARYNGQKLAWLV
Subjt:  LDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHIKTFARYNGQKLAWLV

Query:  LTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNK---SRPSENLEEKTELVTLAWQENRKRESLSEVIQLPIPYELPPQP
        LTSSNLS+AAWGALQKNNSQLMIRSYELGVLFLPQKR+D SFSCTK+GGSAQNK   SRPSE LE KTELVTLAWQEN+KRESLSEVIQLPIPYELPPQP
Subjt:  LTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNK---SRPSENLEEKTELVTLAWQENRKRESLSEVIQLPIPYELPPQP

Query:  YGPQDVPWSWDRRYTQKDVHGAVWPRQ
        YGP+DVPWSW+RRYTQKDVHGAVWPRQ
Subjt:  YGPQDVPWSWDRRYTQKDVHGAVWPRQ

XP_022978031.1 tyrosyl-DNA phosphodiesterase 1 isoform X2 [Cucurbita maxima]0.0e+0091.83Show/hide
Query:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELI
        MARLQ VGYLVPLDKNLEE NS LKIPLS+GPNIIGRSNVLVSDKRISRKHITLT STDG AKL VEGANPVVIN  DGRKKLG R SV++RDG+VIELI
Subjt:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELI

Query:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-IFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVR
        PGHYLFKYA+HC NTRP SEDLGQKRVRQ+ DD I E KAKRVEMG PLEN Q+GISQ ++DNSVEAIRNF V DD+LPMTFRLLSVKGLPPWANTSCVR
Subjt:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-IFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVR

Query:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW
        I+D++QGDILFAVLSNYMVD+DWLIPACP LAKVPHVLVIHGE DGT+D+MKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW
Subjt:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW

Query:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV
        NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYL+ALKWPEF ANFPALGN N+NPSFFRKFDYSNAAVRLIASVPGYHTGR+LKKWGHMKLRSVLQECV
Subjt:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV

Query:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA
        FDKEF+RSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNV+KGFLRKYWAKWKSYHSGRCHA
Subjt:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA

Query:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL
        MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKR D SFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQEN+KRESL
Subjt:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL

Query:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS
        SEVIQLP+PYELPPQPYGPQDVPWSWDRRYTQKDV GAVWPRQV LYAS
Subjt:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS

XP_023544483.1 tyrosyl-DNA phosphodiesterase 1 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0092.4Show/hide
Query:  QKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELIPGHY
        +KVGYLVPLDKNLEE NS LKIPLS+GPNIIGRSNVLVSDKRISRKHITLT STDG AKLLVEGANPVVIN  DGRKKLG R SV++RDG+VIELIPGHY
Subjt:  QKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELIPGHY

Query:  LFKYASHCFNTRPSSEDLGQKRVRQVADD-IFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVRITDI
        LFKYA+HC NTRP SEDLGQKRVR++ DD I ERKAKRVEMG PLEN Q+GISQ ++DNSVEAIRNF V DD+LPMTFRLLSVKGLPPWANTSCVRI+D+
Subjt:  LFKYASHCFNTRPSSEDLGQKRVRQVADD-IFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVRITDI

Query:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS
        +QGDILFAVLSNYMVDIDWLIPACP LAKVPHVLV HGE DGT+D+MKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIR+VVHTANLIYVDWNNKS
Subjt:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS

Query:  QGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKE
        QGLWMQDFPWKDQNSSTRGCAFEDDLVDYL+ALKWPEF ANFPALGN NINPSFFRKFDYSNAAVRLIASVPGYHTGR+LKKWGHMKLRSVLQECVFDKE
Subjt:  QGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKE

Query:  FQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHI
        F+RSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHI
Subjt:  FQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHI

Query:  KTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVI
        KTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKR D SFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVI
Subjt:  KTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVI

Query:  QLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS
        QLP+PYELPPQPYGPQDVPWSWDRRYTQKDV GAVWPRQVQLYAS
Subjt:  QLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS

XP_023544484.1 tyrosyl-DNA phosphodiesterase 1 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0092.3Show/hide
Query:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELI
        MAR Q VGYLVPLDKNLEE NS LKIPLS+GPNIIGRSNVLVSDKRISRKHITLT STDG AKLLVEGANPVVIN  DGRKKLG R SV++RDG+VIELI
Subjt:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELI

Query:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-IFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVR
        PGHYLFKYA+HC NTRP SEDLGQKRVR++ DD I ERKAKRVEMG PLEN Q+GISQ ++DNSVEAIRNF V DD+LPMTFRLLSVKGLPPWANTSCVR
Subjt:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-IFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVR

Query:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW
        I+D++QGDILFAVLSNYMVDIDWLIPACP LAKVPHVLV HGE DGT+D+MKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIR+VVHTANLIYVDW
Subjt:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW

Query:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV
        NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYL+ALKWPEF ANFPALGN NINPSFFRKFDYSNAAVRLIASVPGYHTGR+LKKWGHMKLRSVLQECV
Subjt:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV

Query:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA
        FDKEF+RSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA
Subjt:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA

Query:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL
        MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKR D SFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL
Subjt:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL

Query:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS
        SEVIQLP+PYELPPQPYGPQDVPWSWDRRYTQKDV GAVWPRQVQLYAS
Subjt:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS

XP_038881476.1 tyrosyl-DNA phosphodiesterase 1 isoform X1 [Benincasa hispida]0.0e+0093.84Show/hide
Query:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELI
        MARLQ VGYLVPLDKNLEEDN GLKIPLSEGPN +GRSNVLVSD+RISRKHITLT STDG AKLLVEG NPVVIN  DGRKKLG R SVVIRDGDVIELI
Subjt:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELI

Query:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-IFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVR
        PGHYLFKYASHCF+TRPS EDLGQKRVRQVADD I ERKAKRVEM SP ENLQSG SQ +EDNSVEAIRNF + DDRLPMTFRLLSVKGLPPWANTSCVR
Subjt:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-IFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVR

Query:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW
        ITDI+QGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGE DGT+DNMKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW
Subjt:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW

Query:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV
        NNKSQGLWMQDFPWKDQNSSTRGC FEDDLVDYL+ALKWPEF ANFPA GN NINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQEC+
Subjt:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV

Query:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA
        FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFL+KYWAKWKSYHSGRCHA
Subjt:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA

Query:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL
        MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHD SFSCTKSGGSA NK RPSENLEEKTELVTLAWQENRK+ESL
Subjt:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL

Query:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS
        SEVIQLPIPY+LPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS
Subjt:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS

TrEMBL top hitse value%identityAlignment
A0A0A0KJY5 FHA domain-containing protein0.0e+0086.38Show/hide
Query:  MCSVSARRFSTTT---NNQLRRSPAFLAPQPPFSLSSPQILQYSSSPFKPLSLNHRHPQLSSPLASSGPSP------LCSAMARLQKVGYLVPLDKNLEE
        MCS   RRFST +   +N+LRRS AFLA +PP+S S P+IL YSSSPFK LSL +R  QL SPL SS PSP      LC  MARLQ VGYLVPLDKNLE 
Subjt:  MCSVSARRFSTTT---NNQLRRSPAFLAPQPPFSLSSPQILQYSSSPFKPLSLNHRHPQLSSPLASSGPSP------LCSAMARLQKVGYLVPLDKNLEE

Query:  DNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELIPGHYLFKYASHCFNTRPSS
        DNSGLKI LSEGPN IGRSNVLVS+KRISRKHITLT STDG AKLLVEG NPVVIN  DGRKKLG R SV+IRDGDVIELIPGHY FKYASHCFN+RP S
Subjt:  DNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELIPGHYLFKYASHCFNTRPSS

Query:  EDLGQKRVRQVA-DDIFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVRITDIVQGDILFAVLSNYMV
        EDLGQKRVRQVA D I ER AKR EMGSPLEN+QSG SQ +E NSVEAIRNF + DDRLPMTFRLLSVKGLPPWANTSCVRITDI+QGDILFAVLSNYMV
Subjt:  EDLGQKRVRQVA-DDIFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVRITDIVQGDILFAVLSNYMV

Query:  DIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWKDQNS
        DIDWLIPACPALAKVP VLVIHGE DGT+DNMKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWKDQNS
Subjt:  DIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWKDQNS

Query:  STRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKEFQRSPLVYQFSSLGS
        S+RGCAFEDDLVDYL+ALKWPEF A+FP  GN NINP FFRKFDYS AAVRLIASVPGYHTGRYLKKWGHMKLRSVLQEC+FDKEFQRSPLVYQFSSLGS
Subjt:  STRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKEFQRSPLVYQFSSLGS

Query:  LDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHIKTFARYNGQKLAWLV
        L+EKWMAEFAASLSSG + DKTPLGLGEPLIVWPTVEDVRCSLEGYAAG+A+PSPLKNVEKGFL KYWAKW S+HSGRCHAMPHIKTFARYNGQKLAWLV
Subjt:  LDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHIKTFARYNGQKLAWLV

Query:  LTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNK---SRPSENLEEKTELVTLAWQENRKRESLSEVIQLPIPYELPPQP
        LTSSNLS+AAWGALQKNNSQLMIRSYELGVLFLPQKR+D SFSCTK+GGSAQNK   SRPSE LE KTELVTLAWQEN+KRESLSEVIQLPIPYELPPQP
Subjt:  LTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNK---SRPSENLEEKTELVTLAWQENRKRESLSEVIQLPIPYELPPQP

Query:  YGPQDVPWSWDRRYTQKDVHGAVWPRQ
        YGP+DVPWSW+RRYTQKDVHGAVWPRQ
Subjt:  YGPQDVPWSWDRRYTQKDVHGAVWPRQ

A0A6J1GE46 tyrosyl-DNA phosphodiesterase 1 isoform X20.0e+0091.99Show/hide
Query:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELI
        MARLQ VGYLVPLDKNLEE NS LKIPLS+GPNIIGRSNVLVSDKRISRKHITLT STDG AKLLVEGANPVVIN  DGRKKLG R SV++RDG+VIELI
Subjt:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELI

Query:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-IFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVR
        PGHYLFKYA+HC NTRP SEDLGQKRVR++ DD I ERKAKRVEMG  LEN Q+GISQ ++DNSVEAIRNF V DD+LPMTFRLLSVKGLPPWANTSCVR
Subjt:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-IFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVR

Query:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW
        I+D++QGDILFAVLSNYMVDIDWLIPACP LAKVPHVLV HGE DGT+D+MKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIR+VVHTANLIYVDW
Subjt:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW

Query:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV
        NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYL+ALKWPEF ANFPALGN NINPSFFRKFDYSNAAVRLIASVPGYHTGR+LKKWGHMKLR VLQECV
Subjt:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV

Query:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA
        FDKEF+RSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA
Subjt:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA

Query:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL
        MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKR D SFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRK ESL
Subjt:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL

Query:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS
        SEVIQLP+PYELPPQPYGPQDVPWSWDRRYTQKDV GAVWPRQVQLYAS
Subjt:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS

A0A6J1GF28 tyrosyl-DNA phosphodiesterase 1 isoform X10.0e+0091.94Show/hide
Query:  QKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELIPGHY
        +KVGYLVPLDKNLEE NS LKIPLS+GPNIIGRSNVLVSDKRISRKHITLT STDG AKLLVEGANPVVIN  DGRKKLG R SV++RDG+VIELIPGHY
Subjt:  QKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELIPGHY

Query:  LFKYASHCFNTRPSSEDLGQKRVRQVADD-IFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVRITDI
        LFKYA+HC NTRP SEDLGQKRVR++ DD I ERKAKRVEMG  LEN Q+GISQ ++DNSVEAIRNF V DD+LPMTFRLLSVKGLPPWANTSCVRI+D+
Subjt:  LFKYASHCFNTRPSSEDLGQKRVRQVADD-IFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVRITDI

Query:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS
        +QGDILFAVLSNYMVDIDWLIPACP LAKVPHVLV HGE DGT+D+MKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIR+VVHTANLIYVDWNNKS
Subjt:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS

Query:  QGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKE
        QGLWMQDFPWKDQNSSTRGCAFEDDLVDYL+ALKWPEF ANFPALGN NINPSFFRKFDYSNAAVRLIASVPGYHTGR+LKKWGHMKLR VLQECVFDKE
Subjt:  QGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKE

Query:  FQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHI
        F+RSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHI
Subjt:  FQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHI

Query:  KTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVI
        KTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKR D SFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRK ESLSEVI
Subjt:  KTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVI

Query:  QLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS
        QLP+PYELPPQPYGPQDVPWSWDRRYTQKDV GAVWPRQVQLYAS
Subjt:  QLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS

A0A6J1ILM7 tyrosyl-DNA phosphodiesterase 1 isoform X20.0e+0091.83Show/hide
Query:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELI
        MARLQ VGYLVPLDKNLEE NS LKIPLS+GPNIIGRSNVLVSDKRISRKHITLT STDG AKL VEGANPVVIN  DGRKKLG R SV++RDG+VIELI
Subjt:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELI

Query:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-IFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVR
        PGHYLFKYA+HC NTRP SEDLGQKRVRQ+ DD I E KAKRVEMG PLEN Q+GISQ ++DNSVEAIRNF V DD+LPMTFRLLSVKGLPPWANTSCVR
Subjt:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-IFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVR

Query:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW
        I+D++QGDILFAVLSNYMVD+DWLIPACP LAKVPHVLVIHGE DGT+D+MKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW
Subjt:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW

Query:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV
        NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYL+ALKWPEF ANFPALGN N+NPSFFRKFDYSNAAVRLIASVPGYHTGR+LKKWGHMKLRSVLQECV
Subjt:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV

Query:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA
        FDKEF+RSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNV+KGFLRKYWAKWKSYHSGRCHA
Subjt:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA

Query:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL
        MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKR D SFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQEN+KRESL
Subjt:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL

Query:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS
        SEVIQLP+PYELPPQPYGPQDVPWSWDRRYTQKDV GAVWPRQV LYAS
Subjt:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS

A0A6J1INX7 tyrosyl-DNA phosphodiesterase 1 isoform X10.0e+0091.78Show/hide
Query:  QKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELIPGHY
        +KVGYLVPLDKNLEE NS LKIPLS+GPNIIGRSNVLVSDKRISRKHITLT STDG AKL VEGANPVVIN  DGRKKLG R SV++RDG+VIELIPGHY
Subjt:  QKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELIPGHY

Query:  LFKYASHCFNTRPSSEDLGQKRVRQVADD-IFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVRITDI
        LFKYA+HC NTRP SEDLGQKRVRQ+ DD I E KAKRVEMG PLEN Q+GISQ ++DNSVEAIRNF V DD+LPMTFRLLSVKGLPPWANTSCVRI+D+
Subjt:  LFKYASHCFNTRPSSEDLGQKRVRQVADD-IFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVRITDI

Query:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS
        +QGDILFAVLSNYMVD+DWLIPACP LAKVPHVLVIHGE DGT+D+MKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS
Subjt:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS

Query:  QGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKE
        QGLWMQDFPWKDQNSSTRGCAFEDDLVDYL+ALKWPEF ANFPALGN N+NPSFFRKFDYSNAAVRLIASVPGYHTGR+LKKWGHMKLRSVLQECVFDKE
Subjt:  QGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKE

Query:  FQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHI
        F+RSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNV+KGFLRKYWAKWKSYHSGRCHAMPHI
Subjt:  FQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHI

Query:  KTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVI
        KTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKR D SFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQEN+KRESLSEVI
Subjt:  KTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVI

Query:  QLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS
        QLP+PYELPPQPYGPQDVPWSWDRRYTQKDV GAVWPRQV LYAS
Subjt:  QLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS

SwissProt top hitse value%identityAlignment
Q4G056 Tyrosyl-DNA phosphodiesterase 14.6e-7635.96Show/hide
Query:  AKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVRITDIVQ---GDILFAVLSNYMVDIDWLIPACPALAKVPH
        A++VE  SP ++ ++  +    + S E    + + D   P  F L  V G+    N+  + I DI+    G ++ +   NY  D++WLI   P   +   
Subjt:  AKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVRITDIVQ---GDILFAVLSNYMVDIDWLIPACPALAKVPH

Query:  VLVIHGES-DGTVDNMKRKKP-DNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQD-FPWKDQNSSTRG---CAFEDDL
        +L++HG+  +   D   + KP  N  L +  L I+FGTHH+K + L+Y  G+R+V+HT+NLI  DW+ K+QG+W+   +P   Q + T G     F+ DL
Subjt:  VLVIHGES-DGTVDNMKRKKP-DNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQD-FPWKDQNSSTRG---CAFEDDL

Query:  VDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQ-ECVFDKEFQRSPLVYQFSSLGSL---DEKWM-
          YL A   P                   ++ D S   V LI S PG   G +   WGH +LR +LQ         +  P+V QFSS+GSL   + KW+ 
Subjt:  VDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQ-ECVFDKEFQRSPLVYQFSSLGSL---DEKWM-

Query:  AEFAASLSSGSSADKTPLGLGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPLKNVEK-GFLRKYWAKWKSYHSGRCHAMPHIKTFARYNG--QKLAWLVLT
        +EF  SL +     +TP     PL +++P+VE+VR SLEGY AG ++P  ++  EK  +L  Y+ KW +  SGR +AMPHIKT+ R +    KLAW ++T
Subjt:  AEFAASLSSGSSADKTPLGLGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPLKNVEK-GFLRKYWAKWKSYHSGRCHAMPHIKTFARYNG--QKLAWLVLT

Query:  SSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVIQLPIPYELPPQPYGPQD
        S+NLSKAAWGAL+KN +QLMIRSYELGVLFLP      +F                        L T   ++     S   +   P+PY+LPP+ YG +D
Subjt:  SSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVIQLPIPYELPPQPYGPQD

Query:  VPWSWDRRYTQ-KDVHGAVW
         PW W+  Y +  D HG +W
Subjt:  VPWSWDRRYTQ-KDVHGAVW

Q8BJ37 Tyrosyl-DNA phosphodiesterase 13.6e-7636.15Show/hide
Query:  AKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVRITDIVQ---GDILFAVLSNYMVDIDWLIPACPALAKVPH
        A++V   SP  +L+   +    + S E    + + D   P  F L  V G+    N+  + I DI+    G ++ +   NY  D+DWLI   P   +   
Subjt:  AKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVRITDIVQ---GDILFAVLSNYMVDIDWLIPACPALAKVPH

Query:  VLVIHGES-DGTVDNMKRKKP-DNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQD-FPWKDQNSSTRG---CAFEDDL
        +L++HG+  +   D   + KP  N  L +  L I+FGTHH+K + L+Y  G+R+V+HT+NLI  DW+ K+QG+W+   +P  DQ S T G     F+ DL
Subjt:  VLVIHGES-DGTVDNMKRKKP-DNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQD-FPWKDQNSSTRG---CAFEDDL

Query:  VDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQ-ECVFDKEFQRSPLVYQFSSLGSL---DEKWM-
          YL A   P                   ++ D S   V LI S PG   G +   WGH +LR +LQ       + +  P+V QFSS+GSL   + KW+ 
Subjt:  VDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQ-ECVFDKEFQRSPLVYQFSSLGSL---DEKWM-

Query:  AEFAASLSSGSSADKTPLGLGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPLKNVEK-GFLRKYWAKWKSYHSGRCHAMPHIKTFARYNG--QKLAWLVLT
        +EF  SL +     + P     PL +++P+VE+VR SLEGY AG ++P  ++  EK  +L  Y+ KW +  SGR +AMPHIKT+ R +    KLAW ++T
Subjt:  AEFAASLSSGSSADKTPLGLGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPLKNVEK-GFLRKYWAKWKSYHSGRCHAMPHIKTFARYNG--QKLAWLVLT

Query:  SSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVIQLPIPYELPPQPYGPQD
        S+NLSKAAWGAL+KN +QLMIRSYELGVLFLP      +F                        L T   ++     S       P+PY+LPP+ Y  +D
Subjt:  SSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVIQLPIPYELPPQPYGPQD

Query:  VPWSWDRRYTQ-KDVHGAVW
         PW W+  Y +  D HG +W
Subjt:  VPWSWDRRYTQ-KDVHGAVW

Q8H1D9 Tyrosyl-DNA phosphodiesterase 14.1e-24263.86Show/hide
Query:  KVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCS-DG-RKKLGHRGSVVIRDGDVIELIPGH
        +V YL+PL  +L+EDNS  +I LSEGPNIIGR NV + DKR+SRKHIT+ +ST G A L V+G NPVVI  S DG RKK+     V + + D+IELIPGH
Subjt:  KVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCS-DG-RKKLGHRGSVVIRDGDVIELIPGH

Query:  YLFKYASHCFNTRPSSEDLGQKRVRQVADDIFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVRITDI
        + FK      N R +      K+ R+  DD                              VEAIR F   +++LP TFRLLSV  LP WANTSCV I D+
Subjt:  YLFKYASHCFNTRPSSEDLGQKRVRQVADDIFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVRITDI

Query:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS
        ++GD++ A+LSNYMVDIDWL+ ACP LA +P V+VIHGE DG  + ++RKKP NWILHKP LPISFGTHHSKAIFLVYPRG+R+VVHTANLI+VDWNNKS
Subjt:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS

Query:  QGLWMQDFPWKDQNSS-TRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDK
        QGLWMQDFPWKD +    +GC FE DL+DYLN LKWPEF+AN P  GN+ IN +FF+KFDYS+A VRLIASVPGYHTG  L KWGHMKLR++LQEC+FD+
Subjt:  QGLWMQDFPWKDQNSS-TRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDK

Query:  EFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPH
        EF+RSPL+YQFSSLGSLDEKW+AEF  SLSSG + DKTPLG G+ LI+WPTVEDVRCSLEGYAAGNAIPSPLKNVEK FL+KYWA+WK+ HS R  AMPH
Subjt:  EFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPH

Query:  IKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQ--KRHDCSFSCTKSGGSAQN-KSRPSENLEEKTELVTLAWQENRKRESL
        IKTF RYN QK+AW +LTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLP   K   C FSCT+S  S    K    + +E++++LVT+ WQ +R    L
Subjt:  IKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQ--KRHDCSFSCTKSGGSAQN-KSRPSENLEEKTELVTLAWQENRKRESL

Query:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPR
         E+I LP+PY+LPP+PY P+DVPWSWDR Y++KDV+G VWPR
Subjt:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPR

Q9NUW8 Tyrosyl-DNA phosphodiesterase 12.7e-7633.33Show/hide
Query:  SSEDLGQKRVRQVADDIFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRL-----------------PMTFRLLSVKGLPPWANTSCVRI
        S ++L  +  ++ A+ +  +K K  ++ +P      G +Q  E++   A    +  +D                   P  F L  V G+ P  N+  + I
Subjt:  SSEDLGQKRVRQVADDIFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRL-----------------PMTFRLLSVKGLPPWANTSCVRI

Query:  TDIVQ---GDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNM--KRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLI
         DI+    G ++ +   NY  D+DWL+   P   +   +L++HG+      ++  + K  +N  L +  L I+FGTHH+K + L+Y  G+R+V+HT+NLI
Subjt:  TDIVQ---GDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNM--KRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLI

Query:  YVDWNNKSQGLWMQ-------DFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGH
        + DW+ K+QG+W+        D   K   S T    F+ DL+ YL A   P        +           K D S   V LI S PG   G     WGH
Subjt:  YVDWNNKSQGLWMQ-------DFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGH

Query:  MKLRSVLQECVFDKEFQRS-PLVYQFSSLGSL---DEKWM-AEFAASLSSGSSADKTPLGLGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPLKNVEK-GF
         +L+ +L++         S P+V QFSS+GSL   + KW+ +EF  S+ +     KTP     PL +++P+VE+VR SLEGY AG ++P  ++  EK  +
Subjt:  MKLRSVLQECVFDKEFQRS-PLVYQFSSLGSL---DEKWM-AEFAASLSSGSSADKTPLGLGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPLKNVEK-GF

Query:  LRKYWAKWKSYHSGRCHAMPHIKTFARYNG--QKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENL
        L  Y+ KW +  SGR +AMPHIKT+ R +    K+AW ++TS+NLSKAAWGAL+KN +QLMIRSYELGVLFLP      SF   +   +           
Subjt:  LRKYWAKWKSYHSGRCHAMPHIKTFARYNG--QKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENL

Query:  EEKTELVTLAWQENRKRESLSEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQ-KDVHGAVW
                          S   +   P+PY+LPP+ YG +D PW W+  Y +  D HG +W
Subjt:  EEKTELVTLAWQENRKRESLSEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQ-KDVHGAVW

Q9TXV7 Probable tyrosyl-DNA phosphodiesterase4.4e-4233.16Show/hide
Query:  NYMVDIDWLIPAC-PALAKVPHVLVIHGESDGTVDNMKRKKPDNWI-LHKPPLPISFGTHHSKAIFLVYPRG-IRMVVHTANLIYVDWNNKSQGLWMQDF
        ++M+D ++LI +  P+L + P  LV+ G  D   D +K  K    + +    LPI FGTHH+K   L    G   ++V TANL+  DW  K+Q  +  +F
Subjt:  NYMVDIDWLIPAC-PALAKVPHVLVIHGESDGTVDNMKRKKPDNWI-LHKPPLPISFGTHHSKAIFLVYPRG-IRMVVHTANLIYVDWNNKSQGLWMQDF

Query:  PWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQE-CVFDKEF---QRS
          K  + +     F+DDL++YL+  +             L+      +K D+S  + RLI S PGYHT    ++ GH +L  +L E   FD  +   +R 
Subjt:  PWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQE-CVFDKEF---QRS

Query:  PLVYQFSSLGSLDE---KWM-AEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIP-SPLKNVEKGFLRKYWAKWKSYHSGRCHAMPH
          V Q SS+GSL      W   +F  SL   + + K      +  +V+P+VEDVR S +GYA G ++P     +  + +L+    KW+S    R +A+PH
Subjt:  PLVYQFSSLGSLDE---KWM-AEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIP-SPLKNVEKGFLRKYWAKWKSYHSGRCHAMPH

Query:  IKTFARYNGQKLAWLVLTSSNLSKAAWGAL----QKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAW
         KT+ +Y+ +   W +LTS+NLSKAAWG +     KN  QLMIRS+E+GVL     R +  F       SA ++   ++   EK +++   W
Subjt:  IKTFARYNGQKLAWLVLTSSNLSKAAWGAL----QKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAW

Arabidopsis top hitse value%identityAlignment
AT5G07400.1 forkhead-associated domain-containing protein / FHA domain-containing protein7.8e-1025.91Show/hide
Query:  ANTSCVRITDIVQ-----GDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKP------DNWILHKPPLP--ISFG---------
        ++T C R+  + +       I    L+ +  DI W +  C     +P  +  H        N   +         N  +  PP P  I+FG         
Subjt:  ANTSCVRITDIVQ-----GDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKP------DNWILHKPPLP--ISFG---------

Query:  THHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWK---DQNSSTRGCAFEDDLVDYLNALKWPEFSANFP--ALGNLNINPS------FFR
         HH K   L     IR+++ +ANL+   WN+ +  +W QDFP +   D  S    C  E +     + LK P+F A     A   L   PS       F 
Subjt:  THHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWK---DQNSSTRGCAFEDDLVDYLNALKWPEFSANFP--ALGNLNINPS------FFR

Query:  KFDYSNAAVRLIASVPGYHTGR--YLKKWGHMKLRSVLQECVFDKEF
        K+++ ++A  L+ASVPG H+ +  YL + G           +F +EF
Subjt:  KFDYSNAAVRLIASVPGYHTGR--YLKKWGHMKLRSVLQECVFDKEF

AT5G15170.1 tyrosyl-DNA phosphodiesterase-related2.9e-24363.86Show/hide
Query:  KVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCS-DG-RKKLGHRGSVVIRDGDVIELIPGH
        +V YL+PL  +L+EDNS  +I LSEGPNIIGR NV + DKR+SRKHIT+ +ST G A L V+G NPVVI  S DG RKK+     V + + D+IELIPGH
Subjt:  KVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCS-DG-RKKLGHRGSVVIRDGDVIELIPGH

Query:  YLFKYASHCFNTRPSSEDLGQKRVRQVADDIFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVRITDI
        + FK      N R +      K+ R+  DD                              VEAIR F   +++LP TFRLLSV  LP WANTSCV I D+
Subjt:  YLFKYASHCFNTRPSSEDLGQKRVRQVADDIFERKAKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVRITDI

Query:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS
        ++GD++ A+LSNYMVDIDWL+ ACP LA +P V+VIHGE DG  + ++RKKP NWILHKP LPISFGTHHSKAIFLVYPRG+R+VVHTANLI+VDWNNKS
Subjt:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGESDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS

Query:  QGLWMQDFPWKDQNSS-TRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDK
        QGLWMQDFPWKD +    +GC FE DL+DYLN LKWPEF+AN P  GN+ IN +FF+KFDYS+A VRLIASVPGYHTG  L KWGHMKLR++LQEC+FD+
Subjt:  QGLWMQDFPWKDQNSS-TRGCAFEDDLVDYLNALKWPEFSANFPALGNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDK

Query:  EFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPH
        EF+RSPL+YQFSSLGSLDEKW+AEF  SLSSG + DKTPLG G+ LI+WPTVEDVRCSLEGYAAGNAIPSPLKNVEK FL+KYWA+WK+ HS R  AMPH
Subjt:  EFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPH

Query:  IKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQ--KRHDCSFSCTKSGGSAQN-KSRPSENLEEKTELVTLAWQENRKRESL
        IKTF RYN QK+AW +LTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLP   K   C FSCT+S  S    K    + +E++++LVT+ WQ +R    L
Subjt:  IKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQ--KRHDCSFSCTKSGGSAQN-KSRPSENLEEKTELVTLAWQENRKRESL

Query:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPR
         E+I LP+PY+LPP+PY P+DVPWSWDR Y++KDV+G VWPR
Subjt:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTCAGTCTCTGCCCGCCGCTTCAGTACCACCACCAACAACCAACTCCGTCGTTCCCCGGCTTTCCTCGCCCCACAGCCCCCATTTTCATTGTCATCTCCA
CAGATTTTGCAGTATTCCTCTTCTCCGTTCAAACCTCTCTCCTTAAACCATCGCCATCCTCAGCTTAGTTCTCCATTGGCTTCATCAGGCCCTTCTCCTCTCTGC
TCAGCGATGGCTCGCCTCCAGAAGGTTGGGTATTTGGTTCCGCTTGATAAAAATCTGGAGGAAGACAATTCGGGGTTAAAGATACCTCTCTCTGAGGGCCCAAAT
ATCATCGGCCGTAGCAACGTTCTCGTTTCTGATAAGAGGATCAGCCGCAAGCACATCACTCTTACCATTTCCACTGACGGCCCTGCTAAACTACTTGTGGAAGGT
GCGAATCCAGTTGTTATTAATTGTAGTGATGGTAGAAAGAAACTTGGCCATCGTGGAAGTGTGGTAATTCGGGACGGGGATGTTATAGAGTTGATTCCAGGCCAT
TATCTTTTCAAGTATGCATCTCATTGTTTCAATACAAGGCCCAGTTCTGAGGATTTGGGACAGAAGAGAGTTAGACAAGTGGCGGACGATATTTTTGAGAGAAAG
GCTAAGAGGGTTGAAATGGGCAGCCCTTTGGAGAATCTACAATCTGGAATCTCTCAGCCTAGGGAAGATAATAGTGTGGAAGCTATTCGTAATTTTCGCGTTTCC
GACGACAGATTGCCAATGACTTTCAGACTTTTGAGCGTTAAAGGCCTGCCACCATGGGCTAATACTTCATGTGTGAGGATTACTGATATTGTACAGGGGGACATT
CTCTTTGCCGTCCTGTCAAATTACATGGTGGATATCGATTGGTTAATACCTGCATGTCCTGCTCTTGCAAAAGTTCCTCACGTGCTGGTTATTCATGGTGAAAGT
GATGGAACAGTGGATAATATGAAGAGGAAGAAGCCTGATAATTGGATTTTGCACAAACCACCACTACCCATATCTTTTGGGACTCACCATTCAAAAGCAATATTT
CTTGTCTATCCTAGAGGAATAAGAATGGTTGTACACACTGCAAATCTAATCTATGTTGATTGGAACAACAAAAGCCAAGGTTTATGGATGCAAGATTTCCCCTGG
AAAGATCAAAATTCCTCTACAAGAGGATGCGCATTTGAAGATGACTTGGTTGACTATCTTAATGCTTTGAAGTGGCCAGAGTTTTCTGCTAATTTTCCTGCACTT
GGAAACCTCAACATCAATCCATCTTTTTTCAGAAAGTTTGATTATAGCAACGCAGCGGTTAGATTGATTGCTTCTGTGCCTGGATATCATACGGGTCGCTATTTA
AAGAAGTGGGGCCATATGAAGCTACGTTCTGTTCTCCAGGAGTGTGTTTTTGATAAAGAGTTTCAGAGATCTCCTCTTGTATACCAGTTCTCTTCCCTTGGATCG
CTGGATGAGAAATGGATGGCTGAGTTTGCAGCATCACTGTCATCAGGTTCCTCTGCTGATAAAACACCTCTCGGTCTCGGGGAACCACTAATAGTATGGCCTACC
GTGGAAGACGTCAGATGTTCTCTGGAGGGTTATGCTGCTGGAAATGCCATTCCTAGTCCGTTAAAGAATGTGGAGAAGGGATTTTTAAGAAAGTATTGGGCAAAA
TGGAAATCATACCATAGTGGCCGATGTCATGCAATGCCACACATAAAGACATTTGCTCGTTATAATGGTCAGAAACTTGCTTGGTTAGTGCTAACGTCATCCAAC
CTCAGCAAAGCTGCCTGGGGAGCACTTCAGAAGAACAATTCTCAGCTAATGATTCGTTCTTACGAGCTTGGGGTGCTCTTTCTTCCTCAAAAAAGACATGATTGC
AGTTTTTCTTGTACCAAGAGTGGAGGTTCAGCACAGAACAAGTCAAGACCATCAGAGAACTTGGAAGAAAAAACAGAACTGGTGACATTGGCTTGGCAAGAAAAC
AGGAAAAGAGAATCATTGTCTGAAGTAATTCAATTGCCTATACCTTATGAACTCCCTCCTCAGCCATATGGCCCTCAAGATGTACCCTGGTCTTGGGACCGTCGC
TATACCCAAAAAGATGTTCACGGTGCGGTTTGGCCACGTCAAGTTCAGCTTTATGCTTCCTAG
mRNA sequenceShow/hide mRNA sequence
CGCAGAAGCAAAGTCTTAAAATTTCCCAATTCATTCAGTTTTCTTTTTCCTCTGTTTCTCATGGACGCTTCTTCCCGGCTTCTTTCCTCCGCGCTCAAATAGAAG
AATCATCTTCATCTCCAAGTTTCACAACGCAGTCAAAATCAAAATCAATCAGATCTCGGATCTTTTGTGTGAGAAAGAATTCTCTAATTCGTATTCGTAATCTAG
TGATTTTATAGCAGTCTGCAAGGAGAGAAAGAAAGACGAAGAAGGATCTTTGATTTCTGAGTTCTCCAACTTCTTCCTATAAATTCAAACTCATTTTCCTTCATT
TTCTTTCTTCAATGGAATTGGCGGACAATCCAAAGTCGGTAATTACTTCTGGCGAAGTGAATAATGGGTTAAAAGAAAAAGCAGTGATTTCCGATCAATCAAACA
GACTTCAGTACAAAACAAAACCCGATAGCTTCATCATCGATATGGACAATTTTTCCAACAAGGAAATTTCTCACAATTCAAGAATTACAAAGAATTTTAGCAGAA
AAGGAATGTTACGTGGCGGAAACAAGATGGGGCAAGATCATGGTGACGGTGACGATGGGGCAGGAGAGTCCATCTCCCCAATAGGAGGAGGGTCGAGCGTGCTCG
AAAAGCAGGTGGCGGTGGGCATAGAAGAATTGGGCGGTGGCGCACTCAGCCACGAGATAGCGATTAGTAATGCCCACGGCGGCGGCGGCGAAAGGCTGAGCTTTA
GAAGAAACAGCTTCAAGCGGTCTCCACAACCTTGCAGCTGGTACTTGGACCCCAGGAAGATTTTTTTGTTCTGCGCCACGCTGTCGTGCGTGGGAACAATGTTGC
TGATATACTTGGCATTTTCATCTGGGATTCTCAAAGTAGAAGAAAGCGAGTTGGATTCGTAGAGCTCAACAGTCCCAAACCGTTAGGGAGAGAACAGACCCAGGC
CCAAGACGGTGCGTTCCATCAGAGTCTGACCACCGTGGCATGTGTTCAGTCTCTGCCCGCCGCTTCAGTACCACCACCAACAACCAACTCCGTCGTTCCCCGGCT
TTCCTCGCCCCACAGCCCCCATTTTCATTGTCATCTCCACAGATTTTGCAGTATTCCTCTTCTCCGTTCAAACCTCTCTCCTTAAACCATCGCCATCCTCAGCTT
AGTTCTCCATTGGCTTCATCAGGCCCTTCTCCTCTCTGCTCAGCGATGGCTCGCCTCCAGAAGGTTGGGTATTTGGTTCCGCTTGATAAAAATCTGGAGGAAGAC
AATTCGGGGTTAAAGATACCTCTCTCTGAGGGCCCAAATATCATCGGCCGTAGCAACGTTCTCGTTTCTGATAAGAGGATCAGCCGCAAGCACATCACTCTTACC
ATTTCCACTGACGGCCCTGCTAAACTACTTGTGGAAGGTGCGAATCCAGTTGTTATTAATTGTAGTGATGGTAGAAAGAAACTTGGCCATCGTGGAAGTGTGGTA
ATTCGGGACGGGGATGTTATAGAGTTGATTCCAGGCCATTATCTTTTCAAGTATGCATCTCATTGTTTCAATACAAGGCCCAGTTCTGAGGATTTGGGACAGAAG
AGAGTTAGACAAGTGGCGGACGATATTTTTGAGAGAAAGGCTAAGAGGGTTGAAATGGGCAGCCCTTTGGAGAATCTACAATCTGGAATCTCTCAGCCTAGGGAA
GATAATAGTGTGGAAGCTATTCGTAATTTTCGCGTTTCCGACGACAGATTGCCAATGACTTTCAGACTTTTGAGCGTTAAAGGCCTGCCACCATGGGCTAATACT
TCATGTGTGAGGATTACTGATATTGTACAGGGGGACATTCTCTTTGCCGTCCTGTCAAATTACATGGTGGATATCGATTGGTTAATACCTGCATGTCCTGCTCTT
GCAAAAGTTCCTCACGTGCTGGTTATTCATGGTGAAAGTGATGGAACAGTGGATAATATGAAGAGGAAGAAGCCTGATAATTGGATTTTGCACAAACCACCACTA
CCCATATCTTTTGGGACTCACCATTCAAAAGCAATATTTCTTGTCTATCCTAGAGGAATAAGAATGGTTGTACACACTGCAAATCTAATCTATGTTGATTGGAAC
AACAAAAGCCAAGGTTTATGGATGCAAGATTTCCCCTGGAAAGATCAAAATTCCTCTACAAGAGGATGCGCATTTGAAGATGACTTGGTTGACTATCTTAATGCT
TTGAAGTGGCCAGAGTTTTCTGCTAATTTTCCTGCACTTGGAAACCTCAACATCAATCCATCTTTTTTCAGAAAGTTTGATTATAGCAACGCAGCGGTTAGATTG
ATTGCTTCTGTGCCTGGATATCATACGGGTCGCTATTTAAAGAAGTGGGGCCATATGAAGCTACGTTCTGTTCTCCAGGAGTGTGTTTTTGATAAAGAGTTTCAG
AGATCTCCTCTTGTATACCAGTTCTCTTCCCTTGGATCGCTGGATGAGAAATGGATGGCTGAGTTTGCAGCATCACTGTCATCAGGTTCCTCTGCTGATAAAACA
CCTCTCGGTCTCGGGGAACCACTAATAGTATGGCCTACCGTGGAAGACGTCAGATGTTCTCTGGAGGGTTATGCTGCTGGAAATGCCATTCCTAGTCCGTTAAAG
AATGTGGAGAAGGGATTTTTAAGAAAGTATTGGGCAAAATGGAAATCATACCATAGTGGCCGATGTCATGCAATGCCACACATAAAGACATTTGCTCGTTATAAT
GGTCAGAAACTTGCTTGGTTAGTGCTAACGTCATCCAACCTCAGCAAAGCTGCCTGGGGAGCACTTCAGAAGAACAATTCTCAGCTAATGATTCGTTCTTACGAG
CTTGGGGTGCTCTTTCTTCCTCAAAAAAGACATGATTGCAGTTTTTCTTGTACCAAGAGTGGAGGTTCAGCACAGAACAAGTCAAGACCATCAGAGAACTTGGAA
GAAAAAACAGAACTGGTGACATTGGCTTGGCAAGAAAACAGGAAAAGAGAATCATTGTCTGAAGTAATTCAATTGCCTATACCTTATGAACTCCCTCCTCAGCCA
TATGGCCCTCAAGATGTACCCTGGTCTTGGGACCGTCGCTATACCCAAAAAGATGTTCACGGTGCGGTTTGGCCACGTCAAGTTCAGCTTTATGCTTCCTAGGAT
TCTTGATATCCTCTTTCCCCTCCAAGTACTTATTAAACTTAGGGTTCATTTGGACGGTTCCTTTTCATCTTACTATATCCCTCTGGATTGGAAGTACTAATTTGG
GTCACTCTTTTCTAGGAAAAGGTGGGAAAGTTGGAAGCAGGAAGAGTTGTTAATTAGACAAAAGACATTCGGCTCCATGATATTCTTGTATCAACTCATATCCTA
GGGAACAAAATGTTTTTGAATTATTCTTCATATTAAAGAATATTTTGTCTATTAGGAGTCTTTAAGGAAGTAGTAGTAACAGTAAGGCATACTTTGCTATTGACC
AAAATGTCATAAATTTCAAGAATCGCTACGCTCACGTGGTTGAACTAAAAAACATGTATATTCAAGAGCAGATTGATGGAAACTTAACACATAAATCACATCAAT
AAAATCACCGCCGTGGAAATGTGAGAAGCAATCTCATTAATAGGCACTCTTTGACTATTAGTAGACATAAAATCATTAGTTATTGGATTAGCAGATGTAATCTTT
TTGAGTATTTATACCTCAGGCCGAGGTGATGGCGTGATGCAGTAATTGAAGTTCAAGTTTTTCTTTCCATTGATCAATTAATTAAGAGATGTAAACGGAT
Protein sequenceShow/hide protein sequence
MCSVSARRFSTTTNNQLRRSPAFLAPQPPFSLSSPQILQYSSSPFKPLSLNHRHPQLSSPLASSGPSPLCSAMARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPN
IIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGANPVVINCSDGRKKLGHRGSVVIRDGDVIELIPGHYLFKYASHCFNTRPSSEDLGQKRVRQVADDIFERK
AKRVEMGSPLENLQSGISQPREDNSVEAIRNFRVSDDRLPMTFRLLSVKGLPPWANTSCVRITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGES
DGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPAL
GNLNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPT
VEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDC
SFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS