; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G22690 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G22690
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptiontyrosyl-DNA phosphodiesterase 1
Genome locationClcChr01:33734765..33741732
RNA-Seq ExpressionClc01G22690
SyntenyClc01G22690
Gene Ontology termsGO:0000012 - single strand break repair (biological process)
GO:0006302 - double-strand break repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003690 - double-stranded DNA binding (molecular function)
GO:0003697 - single-stranded DNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0017005 - 3'-tyrosyl-DNA phosphodiesterase activity (molecular function)
InterPro domainsIPR000253 - Forkhead-associated (FHA) domain
IPR008984 - SMAD/FHA domain superfamily
IPR010347 - Tyrosyl-DNA phosphodiesterase I


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN49134.1 hypothetical protein Csa_003275 [Cucumis sativus]0.0e+0087.48Show/hide
Query:  MCSVSARRFSTTT---NNQLRRSSAFLAPQPPFSLSSPQILQYSSSPFKPLSLNHRHRQLSSPLASSGPSP------LCSAMARLQKVGYLVPLDKNLEE
        MCS   RRFST +   +N+LRRSSAFLA +PP+S S P+IL YSSSPFK LSL +R RQL SPL SS PSP      LC  MARLQ VGYLVPLDKNLE 
Subjt:  MCSVSARRFSTTT---NNQLRRSSAFLAPQPPFSLSSPQILQYSSSPFKPLSLNHRHRQLSSPLASSGPSP------LCSAMARLQKVGYLVPLDKNLEE

Query:  DNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELIPGHYLFKYASHCFNTRPSS
        DNSGLKI LSEGPN IGRSNVLVS+KRISRKHITLT STDG AKLLVEGTNPVVINSGDGRKKLG R SV+IRDGDVIELIPGHY FKYASHCFN+RP S
Subjt:  DNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELIPGHYLFKYASHCFNTRPSS

Query:  EDLGQKRVRQVA-DDISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVRITDIVQGDILFAVLSNYMV
        EDLGQKRVRQVA D ISER AKR EMGSPLEN+QSG SQS+E NSVEAIRNF + DDRLP+TFRLLSVKGLPPWANTSCVRITDI+QGDILFAVLSNYMV
Subjt:  EDLGQKRVRQVA-DDISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVRITDIVQGDILFAVLSNYMV

Query:  DIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWKDQNS
        DIDWLIPACPALAKVP VLVIHGEGDGT+DNMKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWKDQNS
Subjt:  DIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWKDQNS

Query:  STRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKEFQRSPLVYQFSSLGS
        S+RGCAFEDDLVDYL+ALKWPEF A+FP  GNFNINP FFRKFDYS AAVRLIASVPGYHTGRYLKKWGHMKLRSVLQEC+FDKEFQRSPLVYQFSSLGS
Subjt:  STRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKEFQRSPLVYQFSSLGS

Query:  LDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHIKTFARYNGQKLAWLV
        L+EKWMAEFAASLSSG + DKTPLGLGEPLIVWPTVEDVRCSLEGYAAG+A+PSPLKNVEKGFL KYWAKW S+HSGRCHAMPHIKTFARYNGQKLAWLV
Subjt:  LDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHIKTFARYNGQKLAWLV

Query:  LTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNK---SRPSENLEEKTELVTLAWQENRKRESLSEVIQLPIPYELPPQP
        LTSSNLS+AAWGALQKNNSQLMIRSYELGVLFLPQKR+D SFSCTK+GGSAQNK   SRPSE LE KTELVTLAWQEN+KRESLSEVIQLPIPYELPPQP
Subjt:  LTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNK---SRPSENLEEKTELVTLAWQENRKRESLSEVIQLPIPYELPPQP

Query:  YGPQDVPWSWDRRYTQKDVHGAVWPRQ
        YGP+DVPWSW+RRYTQKDVHGAVWPRQ
Subjt:  YGPQDVPWSWDRRYTQKDVHGAVWPRQ

XP_022978031.1 tyrosyl-DNA phosphodiesterase 1 isoform X2 [Cucurbita maxima]0.0e+0092.45Show/hide
Query:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELI
        MARLQ VGYLVPLDKNLEE NS LKIPLS+GPNIIGRSNVLVSDKRISRKHITLT STDG AKL VEG NPVVINSGDGRKKLG R SV++RDG+VIELI
Subjt:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELI

Query:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-ISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVR
        PGHYLFKYA+HC NTRP SEDLGQKRVRQ+ DD ISE KAKRVEMG PLEN Q+GISQS++DNSVEAIRNF V DD+LP+TFRLLSVKGLPPWANTSCVR
Subjt:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-ISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVR

Query:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW
        I+D++QGDILFAVLSNYMVD+DWLIPACP LAKVPHVLVIHGEGDGT+D+MKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW
Subjt:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW

Query:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV
        NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYL+ALKWPEF ANFPALGNFN+NPSFFRKFDYSNAAVRLIASVPGYHTGR+LKKWGHMKLRSVLQECV
Subjt:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV

Query:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA
        FDKEF+RSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNV+KGFLRKYWAKWKSYHSGRCHA
Subjt:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA

Query:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL
        MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKR D SFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQEN+KRESL
Subjt:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL

Query:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS
        SEVIQLP+PYELPPQPYGPQDVPWSWDRRYTQKDV GAVWPRQV LYAS
Subjt:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS

XP_023544483.1 tyrosyl-DNA phosphodiesterase 1 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0093.02Show/hide
Query:  QKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELIPGHY
        +KVGYLVPLDKNLEE NS LKIPLS+GPNIIGRSNVLVSDKRISRKHITLT STDG AKLLVEG NPVVINSGDGRKKLG R SV++RDG+VIELIPGHY
Subjt:  QKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELIPGHY

Query:  LFKYASHCFNTRPSSEDLGQKRVRQVADD-ISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVRITDI
        LFKYA+HC NTRP SEDLGQKRVR++ DD ISERKAKRVEMG PLEN Q+GISQS++DNSVEAIRNF V DD+LP+TFRLLSVKGLPPWANTSCVRI+D+
Subjt:  LFKYASHCFNTRPSSEDLGQKRVRQVADD-ISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVRITDI

Query:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS
        +QGDILFAVLSNYMVDIDWLIPACP LAKVPHVLV HGEGDGT+D+MKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIR+VVHTANLIYVDWNNKS
Subjt:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS

Query:  QGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKE
        QGLWMQDFPWKDQNSSTRGCAFEDDLVDYL+ALKWPEF ANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGR+LKKWGHMKLRSVLQECVFDKE
Subjt:  QGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKE

Query:  FQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHI
        F+RSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHI
Subjt:  FQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHI

Query:  KTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVI
        KTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKR D SFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVI
Subjt:  KTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVI

Query:  QLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS
        QLP+PYELPPQPYGPQDVPWSWDRRYTQKDV GAVWPRQVQLYAS
Subjt:  QLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS

XP_023544484.1 tyrosyl-DNA phosphodiesterase 1 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0092.91Show/hide
Query:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELI
        MAR Q VGYLVPLDKNLEE NS LKIPLS+GPNIIGRSNVLVSDKRISRKHITLT STDG AKLLVEG NPVVINSGDGRKKLG R SV++RDG+VIELI
Subjt:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELI

Query:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-ISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVR
        PGHYLFKYA+HC NTRP SEDLGQKRVR++ DD ISERKAKRVEMG PLEN Q+GISQS++DNSVEAIRNF V DD+LP+TFRLLSVKGLPPWANTSCVR
Subjt:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-ISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVR

Query:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW
        I+D++QGDILFAVLSNYMVDIDWLIPACP LAKVPHVLV HGEGDGT+D+MKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIR+VVHTANLIYVDW
Subjt:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW

Query:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV
        NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYL+ALKWPEF ANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGR+LKKWGHMKLRSVLQECV
Subjt:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV

Query:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA
        FDKEF+RSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA
Subjt:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA

Query:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL
        MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKR D SFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL
Subjt:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL

Query:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS
        SEVIQLP+PYELPPQPYGPQDVPWSWDRRYTQKDV GAVWPRQVQLYAS
Subjt:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS

XP_038881476.1 tyrosyl-DNA phosphodiesterase 1 isoform X1 [Benincasa hispida]0.0e+0094.76Show/hide
Query:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELI
        MARLQ VGYLVPLDKNLEEDN GLKIPLSEGPN +GRSNVLVSD+RISRKHITLT STDG AKLLVEGTNPVVINSGDGRKKLG R SVVIRDGDVIELI
Subjt:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELI

Query:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-ISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVR
        PGHYLFKYASHCF+TRPS EDLGQKRVRQVADD ISERKAKRVEM SP ENLQSG SQS+EDNSVEAIRNF + DDRLP+TFRLLSVKGLPPWANTSCVR
Subjt:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-ISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVR

Query:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW
        ITDI+QGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGT+DNMKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW
Subjt:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW

Query:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV
        NNKSQGLWMQDFPWKDQNSSTRGC FEDDLVDYL+ALKWPEF ANFPA GNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQEC+
Subjt:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV

Query:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA
        FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFL+KYWAKWKSYHSGRCHA
Subjt:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA

Query:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL
        MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHD SFSCTKSGGSA NK RPSENLEEKTELVTLAWQENRK+ESL
Subjt:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL

Query:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS
        SEVIQLPIPY+LPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS
Subjt:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS

TrEMBL top hitse value%identityAlignment
A0A0A0KJY5 FHA domain-containing protein0.0e+0087.48Show/hide
Query:  MCSVSARRFSTTT---NNQLRRSSAFLAPQPPFSLSSPQILQYSSSPFKPLSLNHRHRQLSSPLASSGPSP------LCSAMARLQKVGYLVPLDKNLEE
        MCS   RRFST +   +N+LRRSSAFLA +PP+S S P+IL YSSSPFK LSL +R RQL SPL SS PSP      LC  MARLQ VGYLVPLDKNLE 
Subjt:  MCSVSARRFSTTT---NNQLRRSSAFLAPQPPFSLSSPQILQYSSSPFKPLSLNHRHRQLSSPLASSGPSP------LCSAMARLQKVGYLVPLDKNLEE

Query:  DNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELIPGHYLFKYASHCFNTRPSS
        DNSGLKI LSEGPN IGRSNVLVS+KRISRKHITLT STDG AKLLVEGTNPVVINSGDGRKKLG R SV+IRDGDVIELIPGHY FKYASHCFN+RP S
Subjt:  DNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELIPGHYLFKYASHCFNTRPSS

Query:  EDLGQKRVRQVA-DDISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVRITDIVQGDILFAVLSNYMV
        EDLGQKRVRQVA D ISER AKR EMGSPLEN+QSG SQS+E NSVEAIRNF + DDRLP+TFRLLSVKGLPPWANTSCVRITDI+QGDILFAVLSNYMV
Subjt:  EDLGQKRVRQVA-DDISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVRITDIVQGDILFAVLSNYMV

Query:  DIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWKDQNS
        DIDWLIPACPALAKVP VLVIHGEGDGT+DNMKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWKDQNS
Subjt:  DIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWKDQNS

Query:  STRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKEFQRSPLVYQFSSLGS
        S+RGCAFEDDLVDYL+ALKWPEF A+FP  GNFNINP FFRKFDYS AAVRLIASVPGYHTGRYLKKWGHMKLRSVLQEC+FDKEFQRSPLVYQFSSLGS
Subjt:  STRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKEFQRSPLVYQFSSLGS

Query:  LDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHIKTFARYNGQKLAWLV
        L+EKWMAEFAASLSSG + DKTPLGLGEPLIVWPTVEDVRCSLEGYAAG+A+PSPLKNVEKGFL KYWAKW S+HSGRCHAMPHIKTFARYNGQKLAWLV
Subjt:  LDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHIKTFARYNGQKLAWLV

Query:  LTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNK---SRPSENLEEKTELVTLAWQENRKRESLSEVIQLPIPYELPPQP
        LTSSNLS+AAWGALQKNNSQLMIRSYELGVLFLPQKR+D SFSCTK+GGSAQNK   SRPSE LE KTELVTLAWQEN+KRESLSEVIQLPIPYELPPQP
Subjt:  LTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNK---SRPSENLEEKTELVTLAWQENRKRESLSEVIQLPIPYELPPQP

Query:  YGPQDVPWSWDRRYTQKDVHGAVWPRQ
        YGP+DVPWSW+RRYTQKDVHGAVWPRQ
Subjt:  YGPQDVPWSWDRRYTQKDVHGAVWPRQ

A0A6J1GE46 tyrosyl-DNA phosphodiesterase 1 isoform X20.0e+0092.6Show/hide
Query:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELI
        MARLQ VGYLVPLDKNLEE NS LKIPLS+GPNIIGRSNVLVSDKRISRKHITLT STDG AKLLVEG NPVVINSGDGRKKLG R SV++RDG+VIELI
Subjt:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELI

Query:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-ISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVR
        PGHYLFKYA+HC NTRP SEDLGQKRVR++ DD ISERKAKRVEMG  LEN Q+GISQS++DNSVEAIRNF V DD+LP+TFRLLSVKGLPPWANTSCVR
Subjt:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-ISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVR

Query:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW
        I+D++QGDILFAVLSNYMVDIDWLIPACP LAKVPHVLV HGEGDGT+D+MKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIR+VVHTANLIYVDW
Subjt:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW

Query:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV
        NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYL+ALKWPEF ANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGR+LKKWGHMKLR VLQECV
Subjt:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV

Query:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA
        FDKEF+RSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA
Subjt:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA

Query:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL
        MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKR D SFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRK ESL
Subjt:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL

Query:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS
        SEVIQLP+PYELPPQPYGPQDVPWSWDRRYTQKDV GAVWPRQVQLYAS
Subjt:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS

A0A6J1GF28 tyrosyl-DNA phosphodiesterase 1 isoform X10.0e+0092.56Show/hide
Query:  QKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELIPGHY
        +KVGYLVPLDKNLEE NS LKIPLS+GPNIIGRSNVLVSDKRISRKHITLT STDG AKLLVEG NPVVINSGDGRKKLG R SV++RDG+VIELIPGHY
Subjt:  QKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELIPGHY

Query:  LFKYASHCFNTRPSSEDLGQKRVRQVADD-ISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVRITDI
        LFKYA+HC NTRP SEDLGQKRVR++ DD ISERKAKRVEMG  LEN Q+GISQS++DNSVEAIRNF V DD+LP+TFRLLSVKGLPPWANTSCVRI+D+
Subjt:  LFKYASHCFNTRPSSEDLGQKRVRQVADD-ISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVRITDI

Query:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS
        +QGDILFAVLSNYMVDIDWLIPACP LAKVPHVLV HGEGDGT+D+MKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIR+VVHTANLIYVDWNNKS
Subjt:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS

Query:  QGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKE
        QGLWMQDFPWKDQNSSTRGCAFEDDLVDYL+ALKWPEF ANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGR+LKKWGHMKLR VLQECVFDKE
Subjt:  QGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKE

Query:  FQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHI
        F+RSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHI
Subjt:  FQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHI

Query:  KTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVI
        KTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKR D SFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRK ESLSEVI
Subjt:  KTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVI

Query:  QLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS
        QLP+PYELPPQPYGPQDVPWSWDRRYTQKDV GAVWPRQVQLYAS
Subjt:  QLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS

A0A6J1ILM7 tyrosyl-DNA phosphodiesterase 1 isoform X20.0e+0092.45Show/hide
Query:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELI
        MARLQ VGYLVPLDKNLEE NS LKIPLS+GPNIIGRSNVLVSDKRISRKHITLT STDG AKL VEG NPVVINSGDGRKKLG R SV++RDG+VIELI
Subjt:  MARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELI

Query:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-ISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVR
        PGHYLFKYA+HC NTRP SEDLGQKRVRQ+ DD ISE KAKRVEMG PLEN Q+GISQS++DNSVEAIRNF V DD+LP+TFRLLSVKGLPPWANTSCVR
Subjt:  PGHYLFKYASHCFNTRPSSEDLGQKRVRQVADD-ISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVR

Query:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW
        I+D++QGDILFAVLSNYMVD+DWLIPACP LAKVPHVLVIHGEGDGT+D+MKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW
Subjt:  ITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDW

Query:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV
        NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYL+ALKWPEF ANFPALGNFN+NPSFFRKFDYSNAAVRLIASVPGYHTGR+LKKWGHMKLRSVLQECV
Subjt:  NNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECV

Query:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA
        FDKEF+RSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNV+KGFLRKYWAKWKSYHSGRCHA
Subjt:  FDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHA

Query:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL
        MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKR D SFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQEN+KRESL
Subjt:  MPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESL

Query:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS
        SEVIQLP+PYELPPQPYGPQDVPWSWDRRYTQKDV GAVWPRQV LYAS
Subjt:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS

A0A6J1INX7 tyrosyl-DNA phosphodiesterase 1 isoform X10.0e+0092.4Show/hide
Query:  QKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELIPGHY
        +KVGYLVPLDKNLEE NS LKIPLS+GPNIIGRSNVLVSDKRISRKHITLT STDG AKL VEG NPVVINSGDGRKKLG R SV++RDG+VIELIPGHY
Subjt:  QKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELIPGHY

Query:  LFKYASHCFNTRPSSEDLGQKRVRQVADD-ISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVRITDI
        LFKYA+HC NTRP SEDLGQKRVRQ+ DD ISE KAKRVEMG PLEN Q+GISQS++DNSVEAIRNF V DD+LP+TFRLLSVKGLPPWANTSCVRI+D+
Subjt:  LFKYASHCFNTRPSSEDLGQKRVRQVADD-ISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVRITDI

Query:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS
        +QGDILFAVLSNYMVD+DWLIPACP LAKVPHVLVIHGEGDGT+D+MKRKKP NWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS
Subjt:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS

Query:  QGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKE
        QGLWMQDFPWKDQNSSTRGCAFEDDLVDYL+ALKWPEF ANFPALGNFN+NPSFFRKFDYSNAAVRLIASVPGYHTGR+LKKWGHMKLRSVLQECVFDKE
Subjt:  QGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKE

Query:  FQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHI
        F+RSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNV+KGFLRKYWAKWKSYHSGRCHAMPHI
Subjt:  FQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPHI

Query:  KTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVI
        KTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKR D SFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQEN+KRESLSEVI
Subjt:  KTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVI

Query:  QLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS
        QLP+PYELPPQPYGPQDVPWSWDRRYTQKDV GAVWPRQV LYAS
Subjt:  QLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS

SwissProt top hitse value%identityAlignment
Q4G056 Tyrosyl-DNA phosphodiesterase 11.0e-7535.78Show/hide
Query:  RQVADDISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVRITDIVQ---GDILFAVLSNYMVDIDWLI
        ++ A   +   A++VE  SP ++ ++  +    + S E    + + D   P  F L  V G+    N+  + I DI+    G ++ +   NY  D++WLI
Subjt:  RQVADDISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVRITDIVQ---GDILFAVLSNYMVDIDWLI

Query:  PACPALAKVPHVLVIHGE-GDGTVDNMKRKKP-DNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQD-FPWKDQNSSTR
           P   +   +L++HG+  +   D   + KP  N  L +  L I+FGTHH+K + L+Y  G+R+V+HT+NLI  DW+ K+QG+W+   +P   Q + T 
Subjt:  PACPALAKVPHVLVIHGE-GDGTVDNMKRKKP-DNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQD-FPWKDQNSSTR

Query:  G---CAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQ-ECVFDKEFQRSPLVYQFSSLG
        G     F+ DL  YL A        N P L  +       ++ D S   V LI S PG   G +   WGH +LR +LQ         +  P+V QFSS+G
Subjt:  G---CAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQ-ECVFDKEFQRSPLVYQFSSLG

Query:  SL---DEKWM-AEFAASLSSGSSADKTPLGLGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPLKNVEK-GFLRKYWAKWKSYHSGRCHAMPHIKTFARYNG
        SL   + KW+ +EF  SL +     +TP     PL +++P+VE+VR SLEGY AG ++P  ++  EK  +L  Y+ KW +  SGR +AMPHIKT+ R + 
Subjt:  SL---DEKWM-AEFAASLSSGSSADKTPLGLGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPLKNVEK-GFLRKYWAKWKSYHSGRCHAMPHIKTFARYNG

Query:  --QKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVIQLPIPY
           KLAW ++TS+NLSKAAWGAL+KN +QLMIRSYELGVLFLP      +F                        L T   ++     S   +   P+PY
Subjt:  --QKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVIQLPIPY

Query:  ELPPQPYGPQDVPWSWDRRYTQ-KDVHGAVW
        +LPP+ YG +D PW W+  Y +  D HG +W
Subjt:  ELPPQPYGPQDVPWSWDRRYTQ-KDVHGAVW

Q8BJ37 Tyrosyl-DNA phosphodiesterase 16.1e-7636.54Show/hide
Query:  AKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVRITDIVQ---GDILFAVLSNYMVDIDWLIPACPALAKVPH
        A++V   SP  +L+   +    + S E    + + D   P  F L  V G+    N+  + I DI+    G ++ +   NY  D+DWLI   P   +   
Subjt:  AKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVRITDIVQ---GDILFAVLSNYMVDIDWLIPACPALAKVPH

Query:  VLVIHGE-GDGTVDNMKRKKP-DNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQD-FPWKDQNSSTRG---CAFEDDL
        +L++HG+  +   D   + KP  N  L +  L I+FGTHH+K + L+Y  G+R+V+HT+NLI  DW+ K+QG+W+   +P  DQ S T G     F+ DL
Subjt:  VLVIHGE-GDGTVDNMKRKKP-DNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQD-FPWKDQNSSTRG---CAFEDDL

Query:  VDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQ-ECVFDKEFQRSPLVYQFSSLGSL---DEKWM-
          YL A        N P L  +       ++ D S   V LI S PG   G +   WGH +LR +LQ       + +  P+V QFSS+GSL   + KW+ 
Subjt:  VDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQ-ECVFDKEFQRSPLVYQFSSLGSL---DEKWM-

Query:  AEFAASLSSGSSADKTPLGLGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPLKNVEK-GFLRKYWAKWKSYHSGRCHAMPHIKTFARYNG--QKLAWLVLT
        +EF  SL +     + P     PL +++P+VE+VR SLEGY AG ++P  ++  EK  +L  Y+ KW +  SGR +AMPHIKT+ R +    KLAW ++T
Subjt:  AEFAASLSSGSSADKTPLGLGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPLKNVEK-GFLRKYWAKWKSYHSGRCHAMPHIKTFARYNG--QKLAWLVLT

Query:  SSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVIQLPIPYELPPQPYGPQD
        S+NLSKAAWGAL+KN +QLMIRSYELGVLFLP      +F                        L T   ++     S       P+PY+LPP+ Y  +D
Subjt:  SSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAWQENRKRESLSEVIQLPIPYELPPQPYGPQD

Query:  VPWSWDRRYTQ-KDVHGAVW
         PW W+  Y +  D HG +W
Subjt:  VPWSWDRRYTQ-KDVHGAVW

Q8H1D9 Tyrosyl-DNA phosphodiesterase 11.3e-24364.33Show/hide
Query:  KVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVI-NSGDG-RKKLGHRGSVVIRDGDVIELIPGH
        +V YL+PL  +L+EDNS  +I LSEGPNIIGR NV + DKR+SRKHIT+ +ST G A L V+GTNPVVI +SGDG RKK+     V + + D+IELIPGH
Subjt:  KVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVI-NSGDG-RKKLGHRGSVVIRDGDVIELIPGH

Query:  YLFKYASHCFNTRPSSEDLGQKRVRQVADDISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVRITDI
        + FK      N R +      K+ R+  DD                              VEAIR F   +++LP TFRLLSV  LP WANTSCV I D+
Subjt:  YLFKYASHCFNTRPSSEDLGQKRVRQVADDISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVRITDI

Query:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS
        ++GD++ A+LSNYMVDIDWL+ ACP LA +P V+VIHGEGDG  + ++RKKP NWILHKP LPISFGTHHSKAIFLVYPRG+R+VVHTANLI+VDWNNKS
Subjt:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS

Query:  QGLWMQDFPWKDQNSS-TRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDK
        QGLWMQDFPWKD +    +GC FE DL+DYLN LKWPEF+AN P  GN  IN +FF+KFDYS+A VRLIASVPGYHTG  L KWGHMKLR++LQEC+FD+
Subjt:  QGLWMQDFPWKDQNSS-TRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDK

Query:  EFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPH
        EF+RSPL+YQFSSLGSLDEKW+AEF  SLSSG + DKTPLG G+ LI+WPTVEDVRCSLEGYAAGNAIPSPLKNVEK FL+KYWA+WK+ HS R  AMPH
Subjt:  EFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPH

Query:  IKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQ--KRHDCSFSCTKSGGSAQN-KSRPSENLEEKTELVTLAWQENRKRESL
        IKTF RYN QK+AW +LTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLP   K   C FSCT+S  S    K    + +E++++LVT+ WQ +R    L
Subjt:  IKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQ--KRHDCSFSCTKSGGSAQN-KSRPSENLEEKTELVTLAWQENRKRESL

Query:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPR
         E+I LP+PY+LPP+PY P+DVPWSWDR Y++KDV+G VWPR
Subjt:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPR

Q9NUW8 Tyrosyl-DNA phosphodiesterase 11.8e-7533.57Show/hide
Query:  SSEDLG---QKRVRQVADDISERKAKRV--EMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRL-----------------PLTFRLLSVKGLPPWANT
        S EDLG        ++  ++ +++A++V  +    +     G +Q  E++   A    +  +D                   P  F L  V G+ P  N+
Subjt:  SSEDLG---QKRVRQVADDISERKAKRV--EMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRL-----------------PLTFRLLSVKGLPPWANT

Query:  SCVRITDIVQ---GDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNM--KRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVH
          + I DI+    G ++ +   NY  D+DWL+   P   +   +L++HG+      ++  + K  +N  L +  L I+FGTHH+K + L+Y  G+R+V+H
Subjt:  SCVRITDIVQ---GDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNM--KRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVH

Query:  TANLIYVDWNNKSQGLWMQ-------DFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYL
        T+NLI+ DW+ K+QG+W+        D   K   S T    F+ DL+ YL A        N P+L  +        K D S   V LI S PG   G   
Subjt:  TANLIYVDWNNKSQGLWMQ-------DFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYL

Query:  KKWGHMKLRSVLQECVFDKEFQRS-PLVYQFSSLGSL---DEKWM-AEFAASLSSGSSADKTPLGLGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPLKNV
          WGH +L+ +L++         S P+V QFSS+GSL   + KW+ +EF  S+ +     KTP     PL +++P+VE+VR SLEGY AG ++P  ++  
Subjt:  KKWGHMKLRSVLQECVFDKEFQRS-PLVYQFSSLGSL---DEKWM-AEFAASLSSGSSADKTPLGLGEPL-IVWPTVEDVRCSLEGYAAGNAIPSPLKNV

Query:  EK-GFLRKYWAKWKSYHSGRCHAMPHIKTFARYNG--QKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSR
        EK  +L  Y+ KW +  SGR +AMPHIKT+ R +    K+AW ++TS+NLSKAAWGAL+KN +QLMIRSYELGVLFLP      SF   +   +      
Subjt:  EK-GFLRKYWAKWKSYHSGRCHAMPHIKTFARYNG--QKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSR

Query:  PSENLEEKTELVTLAWQENRKRESLSEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQ-KDVHGAVW
                               S   +   P+PY+LPP+ YG +D PW W+  Y +  D HG +W
Subjt:  PSENLEEKTELVTLAWQENRKRESLSEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQ-KDVHGAVW

Q9TXV7 Probable tyrosyl-DNA phosphodiesterase1.7e-4132.91Show/hide
Query:  NYMVDIDWLIPAC-PALAKVPHVLVIHGEGDGTVDNMKRKKPDNWI-LHKPPLPISFGTHHSKAIFLVYPRG-IRMVVHTANLIYVDWNNKSQGLWMQDF
        ++M+D ++LI +  P+L + P  LV+ G  D   D +K  K    + +    LPI FGTHH+K   L    G   ++V TANL+  DW  K+Q  +  +F
Subjt:  NYMVDIDWLIPAC-PALAKVPHVLVIHGEGDGTVDNMKRKKPDNWI-LHKPPLPISFGTHHSKAIFLVYPRG-IRMVVHTANLIYVDWNNKSQGLWMQDF

Query:  PWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQE-CVFDKEF---QRS
          K  + +     F+DDL++YL+  +              +      +K D+S  + RLI S PGYHT    ++ GH +L  +L E   FD  +   +R 
Subjt:  PWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQE-CVFDKEF---QRS

Query:  PLVYQFSSLGSLDE---KWM-AEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIP-SPLKNVEKGFLRKYWAKWKSYHSGRCHAMPH
          V Q SS+GSL      W   +F  SL   + + K      +  +V+P+VEDVR S +GYA G ++P     +  + +L+    KW+S    R +A+PH
Subjt:  PLVYQFSSLGSLDE---KWM-AEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIP-SPLKNVEKGFLRKYWAKWKSYHSGRCHAMPH

Query:  IKTFARYNGQKLAWLVLTSSNLSKAAWGAL----QKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAW
         KT+ +Y+ +   W +LTS+NLSKAAWG +     KN  QLMIRS+E+GVL     R +  F       SA ++   ++   EK +++   W
Subjt:  IKTFARYNGQKLAWLVLTSSNLSKAAWGAL----QKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTLAW

Arabidopsis top hitse value%identityAlignment
AT5G07400.1 forkhead-associated domain-containing protein / FHA domain-containing protein2.9e-0925.1Show/hide
Query:  ANTSCVRITDIVQ-----GDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKP------DNWILHKPPLP--ISFG---------
        ++T C R+  + +       I    L+ +  DI W +  C     +P  +  H        N   +         N  +  PP P  I+FG         
Subjt:  ANTSCVRITDIVQ-----GDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKP------DNWILHKPPLP--ISFG---------

Query:  THHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWK---DQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNIN--PS------FFR
         HH K   L     IR+++ +ANL+   WN+ +  +W QDFP +   D  S    C  E +     + LK P+F A         +   PS       F 
Subjt:  THHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWK---DQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNIN--PS------FFR

Query:  KFDYSNAAVRLIASVPGYHTGR--YLKKWGHMKLRSVLQECVFDKEF
        K+++ ++A  L+ASVPG H+ +  YL + G           +F +EF
Subjt:  KFDYSNAAVRLIASVPGYHTGR--YLKKWGHMKLRSVLQECVFDKEF

AT5G15170.1 tyrosyl-DNA phosphodiesterase-related9.2e-24564.33Show/hide
Query:  KVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVI-NSGDG-RKKLGHRGSVVIRDGDVIELIPGH
        +V YL+PL  +L+EDNS  +I LSEGPNIIGR NV + DKR+SRKHIT+ +ST G A L V+GTNPVVI +SGDG RKK+     V + + D+IELIPGH
Subjt:  KVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRSNVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVI-NSGDG-RKKLGHRGSVVIRDGDVIELIPGH

Query:  YLFKYASHCFNTRPSSEDLGQKRVRQVADDISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVRITDI
        + FK      N R +      K+ R+  DD                              VEAIR F   +++LP TFRLLSV  LP WANTSCV I D+
Subjt:  YLFKYASHCFNTRPSSEDLGQKRVRQVADDISERKAKRVEMGSPLENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVRITDI

Query:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS
        ++GD++ A+LSNYMVDIDWL+ ACP LA +P V+VIHGEGDG  + ++RKKP NWILHKP LPISFGTHHSKAIFLVYPRG+R+VVHTANLI+VDWNNKS
Subjt:  VQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNWILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKS

Query:  QGLWMQDFPWKDQNSS-TRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDK
        QGLWMQDFPWKD +    +GC FE DL+DYLN LKWPEF+AN P  GN  IN +FF+KFDYS+A VRLIASVPGYHTG  L KWGHMKLR++LQEC+FD+
Subjt:  QGLWMQDFPWKDQNSS-TRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAVRLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDK

Query:  EFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPH
        EF+RSPL+YQFSSLGSLDEKW+AEF  SLSSG + DKTPLG G+ LI+WPTVEDVRCSLEGYAAGNAIPSPLKNVEK FL+KYWA+WK+ HS R  AMPH
Subjt:  EFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVEKGFLRKYWAKWKSYHSGRCHAMPH

Query:  IKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQ--KRHDCSFSCTKSGGSAQN-KSRPSENLEEKTELVTLAWQENRKRESL
        IKTF RYN QK+AW +LTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLP   K   C FSCT+S  S    K    + +E++++LVT+ WQ +R    L
Subjt:  IKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQ--KRHDCSFSCTKSGGSAQN-KSRPSENLEEKTELVTLAWQENRKRESL

Query:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPR
         E+I LP+PY+LPP+PY P+DVPWSWDR Y++KDV+G VWPR
Subjt:  SEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTCAGTCTCTGCCCGCCGCTTCAGTACCACCACCAACAACCAACTCCGTCGCTCATCGGCTTTCCTCGCCCCACAGCCCCCATTTTCATTGTCATCTCCACAGAT
TTTGCAGTATTCCTCTTCTCCCTTCAAACCTCTCTCCTTAAACCATCGCCACCGTCAGCTTAGTTCGCCATTGGCTTCATCAGGCCCTTCTCCTCTCTGCTCAGCGATGG
CTCGCCTCCAGAAGGTTGGGTATTTGGTTCCGCTTGATAAAAATCTGGAGGAAGACAATTCGGGGTTAAAGATACCTCTCTCTGAGGGCCCAAATATCATCGGCCGTAGC
AACGTTCTCGTTTCTGATAAGAGGATCAGCCGCAAGCACATCACTCTTACCATTTCCACTGACGGCCCTGCTAAACTACTTGTGGAAGGTACGAATCCAGTTGTTATTAA
TTCTGGTGATGGTAGAAAGAAACTTGGCCATCGTGGAAGTGTGGTAATTCGGGACGGGGATGTTATAGAGTTGATTCCAGGCCATTATCTTTTCAAGTATGCATCTCATT
GTTTCAATACAAGGCCCAGTTCTGAGGATTTGGGACAGAAGAGAGTTAGACAAGTGGCGGACGATATTTCTGAGAGAAAGGCTAAGAGGGTTGAAATGGGAAGCCCTTTG
GAGAATCTACAATCTGGGATCTCGCAGTCTAGGGAAGATAATAGTGTGGAAGCTATTCGTAATTTTCGCGTTTCCGACGACAGATTGCCATTGACTTTCAGACTTTTGAG
CGTTAAAGGCCTGCCACCATGGGCTAATACATCATGTGTGAGGATTACTGATATTGTTCAGGGGGACATTCTCTTTGCCGTCCTGTCAAATTACATGGTGGATATCGATT
GGTTAATACCTGCATGTCCTGCTCTTGCAAAAGTTCCTCACGTGCTGGTTATTCATGGTGAGGGTGATGGAACAGTGGATAATATGAAGAGGAAGAAGCCTGATAATTGG
ATTTTGCACAAACCACCACTACCCATATCTTTTGGGACTCACCATTCAAAAGCAATATTTCTTGTCTATCCTAGAGGAATAAGAATGGTTGTACACACTGCGAATCTAAT
CTATGTTGATTGGAACAACAAAAGCCAAGGTTTATGGATGCAAGATTTCCCCTGGAAAGATCAAAATTCCTCTACAAGAGGATGCGCATTTGAAGATGACTTGGTTGACT
ATCTTAATGCTTTGAAGTGGCCAGAGTTTTCTGCTAATTTTCCTGCACTTGGAAACTTCAACATCAATCCATCTTTTTTCAGAAAGTTTGATTATAGCAACGCAGCGGTT
AGATTGATTGCTTCTGTGCCTGGATATCATACGGGTCGCTATTTAAAGAAGTGGGGCCATATGAAGCTACGTTCTGTTCTCCAGGAGTGTGTTTTTGATAAAGAGTTTCA
GAGATCTCCTCTTGTATACCAGTTCTCTTCCCTTGGATCGCTGGATGAGAAATGGATGGCTGAGTTTGCAGCGTCACTGTCATCTGGTTCCTCCGCTGATAAAACACCTC
TCGGTCTTGGGGAACCACTAATAGTATGGCCTACCGTGGAAGACGTCAGATGTTCTCTGGAGGGTTATGCTGCTGGAAACGCCATTCCTAGTCCATTAAAGAATGTGGAG
AAGGGATTTTTAAGAAAGTATTGGGCAAAATGGAAATCATACCATAGTGGCCGATGTCATGCAATGCCACACATAAAGACATTTGCTCGTTATAATGGTCAGAAACTTGC
TTGGTTGGTGCTAACGTCATCCAACCTCAGCAAAGCTGCCTGGGGAGCACTTCAGAAGAACAATTCTCAGCTAATGATTCGTTCTTATGAGCTTGGGGTGCTCTTTCTTC
CTCAAAAAAGACATGATTGCAGTTTTTCTTGTACCAAGAGTGGAGGTTCAGCACAGAACAAGTCAAGACCATCAGAGAACTTAGAAGAAAAAACAGAACTGGTGACATTG
GCTTGGCAAGAAAACAGGAAAAGAGAATCATTGTCTGAAGTAATTCAATTGCCTATACCTTATGAACTCCCTCCTCAGCCATATGGCCCTCAAGATGTACCCTGGTCTTG
GGACCGTCGCTATACCCAAAAAGATGTTCACGGTGCGGTTTGGCCACGTCAAGTTCAGCTTTATGCTTCCTAG
mRNA sequenceShow/hide mRNA sequence
TTAGGGAGAGAACAGACCCAGGCCCAAGACGGTGCGTTTCATCATAGTCTGACCACCGTGGCATGTGTTCAGTCTCTGCCCGCCGCTTCAGTACCACCACCAACAACCAA
CTCCGTCGCTCATCGGCTTTCCTCGCCCCACAGCCCCCATTTTCATTGTCATCTCCACAGATTTTGCAGTATTCCTCTTCTCCCTTCAAACCTCTCTCCTTAAACCATCG
CCACCGTCAGCTTAGTTCGCCATTGGCTTCATCAGGCCCTTCTCCTCTCTGCTCAGCGATGGCTCGCCTCCAGAAGGTTGGGTATTTGGTTCCGCTTGATAAAAATCTGG
AGGAAGACAATTCGGGGTTAAAGATACCTCTCTCTGAGGGCCCAAATATCATCGGCCGTAGCAACGTTCTCGTTTCTGATAAGAGGATCAGCCGCAAGCACATCACTCTT
ACCATTTCCACTGACGGCCCTGCTAAACTACTTGTGGAAGGTACGAATCCAGTTGTTATTAATTCTGGTGATGGTAGAAAGAAACTTGGCCATCGTGGAAGTGTGGTAAT
TCGGGACGGGGATGTTATAGAGTTGATTCCAGGCCATTATCTTTTCAAGTATGCATCTCATTGTTTCAATACAAGGCCCAGTTCTGAGGATTTGGGACAGAAGAGAGTTA
GACAAGTGGCGGACGATATTTCTGAGAGAAAGGCTAAGAGGGTTGAAATGGGAAGCCCTTTGGAGAATCTACAATCTGGGATCTCGCAGTCTAGGGAAGATAATAGTGTG
GAAGCTATTCGTAATTTTCGCGTTTCCGACGACAGATTGCCATTGACTTTCAGACTTTTGAGCGTTAAAGGCCTGCCACCATGGGCTAATACATCATGTGTGAGGATTAC
TGATATTGTTCAGGGGGACATTCTCTTTGCCGTCCTGTCAAATTACATGGTGGATATCGATTGGTTAATACCTGCATGTCCTGCTCTTGCAAAAGTTCCTCACGTGCTGG
TTATTCATGGTGAGGGTGATGGAACAGTGGATAATATGAAGAGGAAGAAGCCTGATAATTGGATTTTGCACAAACCACCACTACCCATATCTTTTGGGACTCACCATTCA
AAAGCAATATTTCTTGTCTATCCTAGAGGAATAAGAATGGTTGTACACACTGCGAATCTAATCTATGTTGATTGGAACAACAAAAGCCAAGGTTTATGGATGCAAGATTT
CCCCTGGAAAGATCAAAATTCCTCTACAAGAGGATGCGCATTTGAAGATGACTTGGTTGACTATCTTAATGCTTTGAAGTGGCCAGAGTTTTCTGCTAATTTTCCTGCAC
TTGGAAACTTCAACATCAATCCATCTTTTTTCAGAAAGTTTGATTATAGCAACGCAGCGGTTAGATTGATTGCTTCTGTGCCTGGATATCATACGGGTCGCTATTTAAAG
AAGTGGGGCCATATGAAGCTACGTTCTGTTCTCCAGGAGTGTGTTTTTGATAAAGAGTTTCAGAGATCTCCTCTTGTATACCAGTTCTCTTCCCTTGGATCGCTGGATGA
GAAATGGATGGCTGAGTTTGCAGCGTCACTGTCATCTGGTTCCTCCGCTGATAAAACACCTCTCGGTCTTGGGGAACCACTAATAGTATGGCCTACCGTGGAAGACGTCA
GATGTTCTCTGGAGGGTTATGCTGCTGGAAACGCCATTCCTAGTCCATTAAAGAATGTGGAGAAGGGATTTTTAAGAAAGTATTGGGCAAAATGGAAATCATACCATAGT
GGCCGATGTCATGCAATGCCACACATAAAGACATTTGCTCGTTATAATGGTCAGAAACTTGCTTGGTTGGTGCTAACGTCATCCAACCTCAGCAAAGCTGCCTGGGGAGC
ACTTCAGAAGAACAATTCTCAGCTAATGATTCGTTCTTATGAGCTTGGGGTGCTCTTTCTTCCTCAAAAAAGACATGATTGCAGTTTTTCTTGTACCAAGAGTGGAGGTT
CAGCACAGAACAAGTCAAGACCATCAGAGAACTTAGAAGAAAAAACAGAACTGGTGACATTGGCTTGGCAAGAAAACAGGAAAAGAGAATCATTGTCTGAAGTAATTCAA
TTGCCTATACCTTATGAACTCCCTCCTCAGCCATATGGCCCTCAAGATGTACCCTGGTCTTGGGACCGTCGCTATACCCAAAAAGATGTTCACGGTGCGGTTTGGCCACG
TCAAGTTCAGCTTTATGCTTCCTAGGATTCTTGATATCCTCTTTCCCCTCCAAGTATTTATTAAACTTAGGGTTCATTTGGACGGTTCCTTTTCATCTTACTATATCCCT
CTGGATTGGAAGTGCTAATTTGGGTCACTCTTTCTTAGGAAAAGGTGGGAAAGTTGGAAGCAGGAAGAGTTGTTAATTAGACAAAAGACATTCGGCTCCATGATATTCTT
GTATCAACTCATATCCCAGGGAACAAAATGTTTTTGAATTATTCTTCATATTAAAGAATATTTTGTCTATTAGGAGGAGTCTTTAAGAAAGTAGTAGTAACAGTCAGGCA
TATTTTGCTGTTGACCAAAATGTCATAAATTTGAAGAATCGCTACGCTCACGTGGTTGAACTAAAAAACATGTATATTCAAAAGCAGATTGATGGAAACTTAACACATAA
ATCACATCAATCAAATCACCGCCGTGGAAATATGGGAAGCAATCTCATTAATAGGCACTCTTTGACTATTAGTAGACATAATATCATTAGTTATTGGATTGGCAGATGTA
ATCTTTTTGAGTATTTATACCTCTGGCCGAGGTGATGGCGTGATGCAGTAATTAAACTTCAATTTTTTCTTTCCATTGATCAATTAATTAAGAGATGTAAACG
Protein sequenceShow/hide protein sequence
MCSVSARRFSTTTNNQLRRSSAFLAPQPPFSLSSPQILQYSSSPFKPLSLNHRHRQLSSPLASSGPSPLCSAMARLQKVGYLVPLDKNLEEDNSGLKIPLSEGPNIIGRS
NVLVSDKRISRKHITLTISTDGPAKLLVEGTNPVVINSGDGRKKLGHRGSVVIRDGDVIELIPGHYLFKYASHCFNTRPSSEDLGQKRVRQVADDISERKAKRVEMGSPL
ENLQSGISQSREDNSVEAIRNFRVSDDRLPLTFRLLSVKGLPPWANTSCVRITDIVQGDILFAVLSNYMVDIDWLIPACPALAKVPHVLVIHGEGDGTVDNMKRKKPDNW
ILHKPPLPISFGTHHSKAIFLVYPRGIRMVVHTANLIYVDWNNKSQGLWMQDFPWKDQNSSTRGCAFEDDLVDYLNALKWPEFSANFPALGNFNINPSFFRKFDYSNAAV
RLIASVPGYHTGRYLKKWGHMKLRSVLQECVFDKEFQRSPLVYQFSSLGSLDEKWMAEFAASLSSGSSADKTPLGLGEPLIVWPTVEDVRCSLEGYAAGNAIPSPLKNVE
KGFLRKYWAKWKSYHSGRCHAMPHIKTFARYNGQKLAWLVLTSSNLSKAAWGALQKNNSQLMIRSYELGVLFLPQKRHDCSFSCTKSGGSAQNKSRPSENLEEKTELVTL
AWQENRKRESLSEVIQLPIPYELPPQPYGPQDVPWSWDRRYTQKDVHGAVWPRQVQLYAS