; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G28070 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G28070
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionmethyl-CpG-binding domain protein 4-like protein
Genome locationChr5:26568383..26571343
RNA-Seq ExpressionCSPI05G28070
SyntenyCSPI05G28070
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase
IPR045138 - Methyl-CpG binding protein MeCP2/MBD4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054277.1 methyl-CpG-binding domain protein 4-like protein [Cucumis melo var. makuwa]2.7e-22890.22Show/hide
Query:  MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLASPSSEVHEPPILTLEDLQN
        MA+TTSI+PNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSA Q+PNPYQDSTQHSP+STL+DLQT EP+NHHN+SLASPSSE  EPPILTLEDLQN
Subjt:  MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLASPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
        GKLP QSPK+PSLARRVLSFYREFGFDKKLLQATSHSVLNS P QEGTRVVSRYFQNSRSTQQ +RIVSRYF++SVKER AHYEDEND GNLTEQPSKRS
Subjt:  GKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV P S NSKTNHHS+GKT+RSVQKS TDT+ RIVSGYFQ  EKSLEMDREVSPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERK+K+KTSSTKPRTTLTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAYPCTLSC
        ATLEVSREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVGKY A    + C
Subjt:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAYPCTLSC

XP_004142362.1 methyl-CpG-binding domain protein 4-like protein isoform X1 [Cucumis sativus]3.0e-25198.26Show/hide
Query:  MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLASPSSEVHEPPILTLEDLQN
        MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLASPSSEVHEPPILTLEDLQN
Subjt:  MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLASPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
        GKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
Subjt:  GKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKR+RKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAYPCTLSC
        ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKY A    + C
Subjt:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAYPCTLSC

XP_008460559.1 PREDICTED: methyl-CpG-binding domain protein 4-like protein [Cucumis melo]2.7e-22890.22Show/hide
Query:  MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLASPSSEVHEPPILTLEDLQN
        MA+TTSI+PNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSA Q+PNPYQDSTQHSP+STL+DLQT EP+NHHN+SLASPSSE  EPPILTLEDLQN
Subjt:  MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLASPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
        GKLP QSPK+PSLARRVLSFYREFGFDKKLLQATSHSVLNS P QEGTRVVSRYFQNSRSTQQ +RIVSRYF++SVKER AHYEDEND GNLTEQPSKRS
Subjt:  GKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV P S NSKTNHHS+GKT+RSVQKS TDT+ RIVSGYFQ  EKSLEMDREVSPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERK+K+KTSSTKPRTTLTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAYPCTLSC
        ATLEVSREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVGKY A    + C
Subjt:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAYPCTLSC

XP_031741432.1 methyl-CpG-binding domain protein 4 isoform X2 [Cucumis sativus]7.4e-25098.9Show/hide
Query:  MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLASPSSEVHEPPILTLEDLQN
        MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLASPSSEVHEPPILTLEDLQN
Subjt:  MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLASPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
        GKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
Subjt:  GKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKR+RKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAY
        ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVG  L +
Subjt:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAY

XP_038892490.1 methyl-CpG-binding domain protein 4-like protein isoform X1 [Benincasa hispida]7.1e-16066.53Show/hide
Query:  ASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDST--------QHSPLSTLHDLQTPEPSNHHNESL-------------
        A+T SI+ NLTPPSSSSYP DLFS+F FRG+SRSR    PSKS+QQ+P   QD T        QHSP++T  DLQ  EP NH N+SL             
Subjt:  ASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDST--------QHSPLSTLHDLQTPEPSNHHNESL-------------

Query:  -ASPSSEVHEPPILTLEDLQNGKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKER
         +SPSS+V+EPPILTLEDLQN K   Q PK+P LARR+L+FYREFGFD+K+ Q TSHSVLNS P QEG R+ SRYFQNS+STQQ +R VSRYFQ+SVK+R
Subjt:  -ASPSSEVHEPPILTLEDLQNGKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKER

Query:  TAHYEDENDGGNLTEQPSKRSSKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFF
         AH EDE++  NLTEQPSKRSSKRRRKDV P SDNSKTN HS+GK +RS+QKSGTD +VRIVS YFQ+ EK++E+DR                       
Subjt:  TAHYEDENDGGNLTEQPSKRSSKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFF

Query:  LKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNR
                     EAT+Q+NQ AKS KRVRKPVNERK++DKTSS+KPRTTLTAAEL LEAYRRKS  DTWKPP SG RLLQ DHAYDPWRVLVICMLLNR
Subjt:  LKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNR

Query:  TSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAYPCTLSC
        TSGQQAKEVIPKLF LCPNPKATL+VS+EQIEDIIRPLG  RKRSRTM  LSEMYLKE+WSHVTQLPGVGKY A    + C
Subjt:  TSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAYPCTLSC

TrEMBL top hitse value%identityAlignment
A0A0A0KRW9 ENDO3c domain-containing protein1.8e-25799.78Show/hide
Query:  MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLASPSSEVHEPPILTLEDLQN
        MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLASPSSEVHEPPILTLEDLQN
Subjt:  MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLASPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
        GKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
Subjt:  GKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKR+RKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAYPCTLSC
        ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAYPCTLSC
Subjt:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAYPCTLSC

A0A1S3CCU6 methyl-CpG-binding domain protein 4-like protein1.3e-22890.22Show/hide
Query:  MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLASPSSEVHEPPILTLEDLQN
        MA+TTSI+PNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSA Q+PNPYQDSTQHSP+STL+DLQT EP+NHHN+SLASPSSE  EPPILTLEDLQN
Subjt:  MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLASPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
        GKLP QSPK+PSLARRVLSFYREFGFDKKLLQATSHSVLNS P QEGTRVVSRYFQNSRSTQQ +RIVSRYF++SVKER AHYEDEND GNLTEQPSKRS
Subjt:  GKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV P S NSKTNHHS+GKT+RSVQKS TDT+ RIVSGYFQ  EKSLEMDREVSPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERK+K+KTSSTKPRTTLTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAYPCTLSC
        ATLEVSREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVGKY A    + C
Subjt:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAYPCTLSC

A0A5D3CU57 Methyl-CpG-binding domain protein 4-like protein1.3e-22890.22Show/hide
Query:  MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLASPSSEVHEPPILTLEDLQN
        MA+TTSI+PNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSA Q+PNPYQDSTQHSP+STL+DLQT EP+NHHN+SLASPSSE  EPPILTLEDLQN
Subjt:  MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLASPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
        GKLP QSPK+PSLARRVLSFYREFGFDKKLLQATSHSVLNS P QEGTRVVSRYFQNSRSTQQ +RIVSRYF++SVKER AHYEDEND GNLTEQPSKRS
Subjt:  GKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV P S NSKTNHHS+GKT+RSVQKS TDT+ RIVSGYFQ  EKSLEMDREVSPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERK+K+KTSSTKPRTTLTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAYPCTLSC
        ATLEVSREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVGKY A    + C
Subjt:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAYPCTLSC

A0A6J1EZJ4 methyl-CpG-binding domain protein 4-like protein5.0e-14358.99Show/hide
Query:  MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSK----SAQQDPNPYQDST--------QHSPLSTLHDLQTPEPSNHHNE----------
        M +TT ++PNL+PPSSSS+P  LFS+F F+G S SRFRFPPSK    S +Q+P P +D T        Q+SP+STL  LQT E SNH             
Subjt:  MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSK----SAQQDPNPYQDST--------QHSPLSTLHDLQTPEPSNHHNE----------

Query:  -----------------------SLASPSSE-----VHEPPILTLEDLQNGKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRV
                               S  +P+SE     VHEPPILTLED+QN K   Q   +P LARRVL FYR+FGFD++++Q T  SV NS+P Q   RV
Subjt:  -----------------------SLASPSSE-----VHEPPILTLEDLQNGKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRV

Query:  VSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGG--NLTEQPSKRS-----SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSG
        VSR+FQ S+S QQ +RIVSRYFQ S  ER AH EDE++    N+T+QP KRS      KRRRKDV   SDNSK    S+ K++R V++SGTD +VR VS 
Subjt:  VSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGG--NLTEQPSKRS-----SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSG

Query:  YFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRK
        YFQ+ EK+ E++ EVSP LQNSK+ QQ E++VSRFF KS +Q+ VNNQ+E  +  +QCAKSVKR+RKP  ERK +DK S+ +PRTTL+A ELFLEAYRRK
Subjt:  YFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRK

Query:  SPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVT
        S  DTWKPP SG RLLQ DHAYDPWRVLVICMLLNRT+GQQAKEVIPKLF+LCP+PK+ LEVS+EQIEDIIRPLG  RKRS T+ RLSEMYLKESWSHVT
Subjt:  SPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVT

Query:  QLPGVGKYLAYPCTLSC
        QLPGVGKY A    + C
Subjt:  QLPGVGKYLAYPCTLSC

A0A6J1HWM5 methyl-CpG-binding domain protein 4-like protein isoform X11.2e-13165.2Show/hide
Query:  TLHDLQTPEPSNHHNESLASPSSE-----VHEPPILTLEDLQNGKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQN
        T+ D+Q   P         +P+SE      HEPPILTLEDLQN K   Q   +P LARRVL F R+FGFD++++Q T  SV NS+P Q   RVVSR+FQ 
Subjt:  TLHDLQTPEPSNHHNESLASPSSE-----VHEPPILTLEDLQNGKLPRQSPKQPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQN

Query:  SRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS-----SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSL
        S+S QQ +RIVSRYFQ S  ER AH EDE+D  N+T+QP KRS      KRRRKDV   SDNSK    S+ K++RS++KSGTD +VRIVS YFQ+ EK+ 
Subjt:  SRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS-----SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSL

Query:  EMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPP
        E++ EVSPSLQNSK+NQQEE+VVSRFF KS + + VNNQ+E  +  +QCAKSVKR+RKP  ERK +DK S+ KPRTTL+A ELFLEAYRRKS  DTWKPP
Subjt:  EMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPP

Query:  TSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYL
         SG RLLQ DHAYDPWRVLVICMLLNRT+GQQAKEVIPKLF+LCP+PK+ LEVS+EQIEDIIRPLG  RKRS T+ RLSEMYLKESWSHVTQLPGVGKY 
Subjt:  TSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYL

Query:  AYPCTLSC
        A    + C
Subjt:  AYPCTLSC

SwissProt top hitse value%identityAlignment
O95243 Methyl-CpG-binding domain protein 47.5e-1124.48Show/hide
Query:  QESVKERTAHYEDENDGGNLTEQPSKRSSKRRRKDVTPGSDNSKTNHHSVGKTARSVQK--SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQ
        Q+S  +RT    D    G      S+ +S  ++K+    S +S +N  S  KT+  + K  S  D++      + + YE +     E+   ++  +  + 
Subjt:  QESVKERTAHYEDENDGGNLTEQPSKRSSKRRRKDVTPGSDNSKTNHHSVGKTARSVQK--SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQ

Query:  EEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKR-VRKPVNERKEKDKTSSTKPRTTLTAAELF----LEAYRRKSPYDTWKPPTSGTRLLQHDHAY
            +    LK G           +E  N C+ + K    + + +     +T   + +T+L  +  +    L   RRK+ +  W PP S   L+Q    +
Subjt:  EEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKR-VRKPVNERKEKDKTSSTKPRTTLTAAELF----LEAYRRKSPYDTWKPPTSGTRLLQHDHAY

Query:  DPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKY
        DPW++L+  + LNRTSG+ A  V+ K     P+ +         + ++++PLG Y  R++T+ + S+ YL + W +  +L G+GKY
Subjt:  DPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKY

Q0IGK1 Methyl-CpG-binding domain protein 4-like protein2.2e-3439.59Show/hide
Query:  SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRK-------------PVNE-RKE
        S    +V  VS YFQ+   S + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  K ++   K              VNE +KE
Subjt:  SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRK-------------PVNE-RKE

Query:  KDKTSSTKP--RTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIR
        K +     P     L+ ++   + Y RK+P +TW PP S   LLQ DH +DPWRVLVICMLLN+TSG Q + VI  LF LC + K   EV  E+IE++I+
Subjt:  KDKTSSTKP--RTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIR

Query:  PLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAYPCTLSC
        PLG  +KR++ + RLS  YL+ESW+HVTQL GVGKY A    + C
Subjt:  PLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAYPCTLSC

Q7LX22 Thymine/uracil-DNA glycosylase1.3e-0732.08Show/hide
Query:  GTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHV-------TQLPG
        G + L    A DPW VLV  +LL +T+ +Q  ++  +     P+P    + S E+I+ II+PLG    R+  + +LSE  ++     +         LPG
Subjt:  GTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHV-------TQLPG

Query:  VGKYLA
        VG Y A
Subjt:  VGKYLA

Q9YDP0 Thymine-DNA glycosylase1.2e-0830.19Show/hide
Query:  GTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMY-------LKESWSHVTQLPG
        G + L   +  DPW +LV   LL +T+ +Q   V  +     PNPKA      +++ ++IRPLG   +R++ +  L++         +  S   + +LPG
Subjt:  GTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMY-------LKESWSHVTQLPG

Query:  VGKYLA
        VG Y+A
Subjt:  VGKYLA

Q9Z2D7 Methyl-CpG-binding domain protein 48.5e-1528.3Show/hide
Query:  QLNQCAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELF----LEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKL
        ++  C+++ K       +     +T   K +T+L  +  +    L   RRKS +  W PP S   L+Q    +DPW++L+  + LNRTSG+ A  V+ + 
Subjt:  QLNQCAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELF----LEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKL

Query:  FSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKY
            P+ +         + ++++PLG Y  R++T+ + S+ YL + W +  +L G+GKY
Subjt:  FSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKY

Arabidopsis top hitse value%identityAlignment
AT3G07930.1 DNA glycosylase superfamily protein7.7e-1134.34Show/hide
Query:  SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRK-------------PVNE-RKE
        S    +V  VS YFQ+   S + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  K ++   K              VNE +KE
Subjt:  SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRK-------------PVNE-RKE

Query:  KDKTSSTKP--RTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTS
        K +     P     L+ ++   + Y RK+P +TW PP S   LLQ DH +DPWRVLVICMLLN+TS
Subjt:  KDKTSSTKP--RTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTS

AT3G07930.2 DNA glycosylase superfamily protein4.1e-1234.91Show/hide
Query:  SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRK-------------PVNE-RKE
        S    +V  VS YFQ+   S + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  K ++   K              VNE +KE
Subjt:  SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRK-------------PVNE-RKE

Query:  KDKTSSTKP--RTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQ
        K +     P     L+ ++   + Y RK+P +TW PP S   LLQ DH +DPWRVLVICMLLN+TSG Q
Subjt:  KDKTSSTKP--RTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQ

AT3G07930.3 DNA glycosylase superfamily protein1.5e-3539.59Show/hide
Query:  SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRK-------------PVNE-RKE
        S    +V  VS YFQ+   S + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  K ++   K              VNE +KE
Subjt:  SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRK-------------PVNE-RKE

Query:  KDKTSSTKP--RTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIR
        K +     P     L+ ++   + Y RK+P +TW PP S   LLQ DH +DPWRVLVICMLLN+TSG Q + VI  LF LC + K   EV  E+IE++I+
Subjt:  KDKTSSTKP--RTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIR

Query:  PLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAYPCTLSC
        PLG  +KR++ + RLS  YL+ESW+HVTQL GVGKY A    + C
Subjt:  PLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYLAYPCTLSC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAACGACAAGCATCCACCCTAACCTCACCCCACCGTCATCTTCTTCGTATCCCCACGATTTGTTTTCTGAATTCGTCTTTCGAGGTACTTCTCGCTCCAGATT
TCGCTTTCCTCCTTCCAAATCCGCTCAACAAGACCCTAATCCCTACCAGGATTCTACCCAGCACTCCCCACTGTCTACTCTTCATGATCTCCAAACTCCAGAACCCAGCA
ATCATCACAACGAATCCTTAGCATCCCCATCTTCTGAAGTCCACGAGCCTCCTATATTAACACTAGAGGATCTTCAAAATGGAAAACTACCCCGTCAATCGCCAAAACAG
CCTTCACTCGCTCGTAGAGTCTTGTCTTTCTACCGAGAGTTCGGATTTGATAAAAAATTGTTGCAAGCAACTTCGCATTCTGTCCTGAATTCAGTACCTGCTCAAGAAGG
GACCCGTGTGGTTTCGCGTTATTTCCAAAACTCAAGATCAACCCAACAAAGCAAACGAATTGTCTCACGATATTTTCAAGAATCGGTGAAGGAACGAACAGCTCATTATG
AGGATGAGAACGATGGTGGCAATCTCACAGAGCAGCCAAGTAAAAGATCAAGCAAAAGGAGGAGGAAAGACGTAACCCCCGGCTCCGATAACTCAAAAACAAATCATCAT
TCAGTGGGAAAGACTGCACGCTCTGTTCAGAAGTCGGGAACAGATACACAAGTGCGAATTGTTTCGGGCTATTTTCAGAGTTATGAAAAGAGTCTTGAAATGGATCGAGA
AGTATCACCTTCTTTACAAAATTCAAAATCAAATCAACAAGAAGAGAAAGTGGTCTCACGTTTCTTTCTAAAATCAGGGAAACAACAAGCCGTGAACAATCAGGAAGAGG
CTACAGAGCAGCTAAATCAGTGTGCGAAATCTGTTAAAAGGGTCCGTAAACCAGTCAATGAAAGGAAAGAGAAGGATAAGACAAGTTCTACTAAACCTCGGACCACTCTG
ACTGCTGCAGAGTTGTTTTTGGAAGCTTACAGAAGGAAATCGCCATATGATACATGGAAGCCTCCTACCTCTGGAACTCGCCTTCTCCAACATGATCATGCGTACGACCC
TTGGAGGGTTCTAGTCATATGTATGCTCCTCAACCGGACAAGTGGGCAACAGGCAAAAGAAGTGATACCTAAACTCTTCAGTTTGTGTCCCAATCCAAAGGCTACTTTGG
AGGTATCACGTGAGCAGATAGAAGATATCATTCGACCTCTTGGTTTCTATAGGAAAAGATCACGAACAATGCATCGTTTATCTGAGATGTATTTAAAAGAAAGTTGGAGC
CATGTCACCCAACTTCCTGGTGTCGGCAAGTATTTAGCCTATCCTTGTACACTTTCCTGTTGA
mRNA sequenceShow/hide mRNA sequence
AAAAGCGCTAAAATGGTAGAAAGGATTTATAACCCAAGGAGAAGCGCGCGAAGACAGTGAAGCGATTGTTCTTGTTTTCGGTGCTCCGTAGCGAATCCCGCCATGGCTTC
AACGACAAGCATCCACCCTAACCTCACCCCACCGTCATCTTCTTCGTATCCCCACGATTTGTTTTCTGAATTCGTCTTTCGAGGTACTTCTCGCTCCAGATTTCGCTTTC
CTCCTTCCAAATCCGCTCAACAAGACCCTAATCCCTACCAGGATTCTACCCAGCACTCCCCACTGTCTACTCTTCATGATCTCCAAACTCCAGAACCCAGCAATCATCAC
AACGAATCCTTAGCATCCCCATCTTCTGAAGTCCACGAGCCTCCTATATTAACACTAGAGGATCTTCAAAATGGAAAACTACCCCGTCAATCGCCAAAACAGCCTTCACT
CGCTCGTAGAGTCTTGTCTTTCTACCGAGAGTTCGGATTTGATAAAAAATTGTTGCAAGCAACTTCGCATTCTGTCCTGAATTCAGTACCTGCTCAAGAAGGGACCCGTG
TGGTTTCGCGTTATTTCCAAAACTCAAGATCAACCCAACAAAGCAAACGAATTGTCTCACGATATTTTCAAGAATCGGTGAAGGAACGAACAGCTCATTATGAGGATGAG
AACGATGGTGGCAATCTCACAGAGCAGCCAAGTAAAAGATCAAGCAAAAGGAGGAGGAAAGACGTAACCCCCGGCTCCGATAACTCAAAAACAAATCATCATTCAGTGGG
AAAGACTGCACGCTCTGTTCAGAAGTCGGGAACAGATACACAAGTGCGAATTGTTTCGGGCTATTTTCAGAGTTATGAAAAGAGTCTTGAAATGGATCGAGAAGTATCAC
CTTCTTTACAAAATTCAAAATCAAATCAACAAGAAGAGAAAGTGGTCTCACGTTTCTTTCTAAAATCAGGGAAACAACAAGCCGTGAACAATCAGGAAGAGGCTACAGAG
CAGCTAAATCAGTGTGCGAAATCTGTTAAAAGGGTCCGTAAACCAGTCAATGAAAGGAAAGAGAAGGATAAGACAAGTTCTACTAAACCTCGGACCACTCTGACTGCTGC
AGAGTTGTTTTTGGAAGCTTACAGAAGGAAATCGCCATATGATACATGGAAGCCTCCTACCTCTGGAACTCGCCTTCTCCAACATGATCATGCGTACGACCCTTGGAGGG
TTCTAGTCATATGTATGCTCCTCAACCGGACAAGTGGGCAACAGGCAAAAGAAGTGATACCTAAACTCTTCAGTTTGTGTCCCAATCCAAAGGCTACTTTGGAGGTATCA
CGTGAGCAGATAGAAGATATCATTCGACCTCTTGGTTTCTATAGGAAAAGATCACGAACAATGCATCGTTTATCTGAGATGTATTTAAAAGAAAGTTGGAGCCATGTCAC
CCAACTTCCTGGTGTCGGCAAGTATTTAGCCTATCCTTGTACACTTTCCTGTTGACTATTCTATTTCTTCAGTCTAAGATGCTTATTTTAGTCATGTCCCAAAACTTAAA
TTATGAACTCATCCATCATTAAAGGCTATGATGGTGGTGAATCACAAACGTGGGACTATGTATCATCGCCTAAGAAGCACAGACACTTCAATTTGGATAGCATGTCCATG
TTGGACACTTGGCGGGCACCTATTGCACGCTTGCTAGTCCAACAAATGTATTAGACATACATAGAACACTTATTGAGTAAGTTAAAAAGACACATATATGACTATAATAA
CAACTTTTGAGCATGAAATACATCAAGTTAAGTCTTTTAAGCATATAAATGCATTAATTCATTTGCTATGAATTTTCTTTTACTATAAAAATGATATATATATT
Protein sequenceShow/hide protein sequence
MASTTSIHPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLASPSSEVHEPPILTLEDLQNGKLPRQSPKQ
PSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRSSKRRRKDVTPGSDNSKTNHH
SVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKEKDKTSSTKPRTTL
TAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWS
HVTQLPGVGKYLAYPCTLSC