; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G8307 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G8307
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Descriptionmethyl-CpG-binding domain protein 4-like protein
Genome locationctg1557:4790297..4798325
RNA-Seq ExpressionCucsat.G8307
SyntenyCucsat.G8307
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054277.1 methyl-CpG-binding domain protein 4-like protein [Cucumis melo var. makuwa]2.23e-28589.89Show/hide
Query:  MASTTSFNPNLIPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLVSPSSEVHEPPILTLEDLQN
        MA+TTS NPNL PPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSA Q+PNPYQDSTQHSP+STL+DLQT EP+NHHN+SL SPSSE  EPPILTLEDLQN
Subjt:  MASTTSFNPNLIPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLVSPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRHFQNTRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
        GKLP QSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNS P QEGTRVVSR+FQN+RSTQQ +RIVSRYF++SVKER AHYEDEND GNLTEQPSKRS
Subjt:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRHFQNTRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV P S NSKTNHHS+GKT+RSVQKS TDT+ RIVSGYFQ  EKSLEMDREVSPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERK+K+KTSSTKPRTTLTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGES
        ATLEVSREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVG  G ++
Subjt:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGES

XP_004142362.1 methyl-CpG-binding domain protein 4-like protein isoform X1 [Cucumis sativus]4.12e-31394.5Show/hide
Query:  MASTTSFNPNLIPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLVSPSSEVHEPPILTLEDLQN
        MASTTS +PNL PPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESL SPSSEVHEPPILTLEDLQN
Subjt:  MASTTSFNPNLIPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLVSPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRHFQNTRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
        GKLPRQSPK+PSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSR+FQN+RSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
Subjt:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRHFQNTRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKR+RKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGESQT------WDYVSSPKKH
        ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVG  G ++        W  V  PK H
Subjt:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGESQT------WDYVSSPKKH

XP_008460559.1 PREDICTED: methyl-CpG-binding domain protein 4-like protein [Cucumis melo]5.29e-28687.53Show/hide
Query:  MASTTSFNPNLIPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLVSPSSEVHEPPILTLEDLQN
        MA+TTS NPNL PPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSA Q+PNPYQDSTQHSP+STL+DLQT EP+NHHN+SL SPSSE  EPPILTLEDLQN
Subjt:  MASTTSFNPNLIPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLVSPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRHFQNTRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
        GKLP QSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNS P QEGTRVVSR+FQN+RSTQQ +RIVSRYF++SVKER AHYEDEND GNLTEQPSKRS
Subjt:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRHFQNTRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV P S NSKTNHHS+GKT+RSVQKS TDT+ RIVSGYFQ  EKSLEMDREVSPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERK+K+KTSSTKPRTTLTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGESQT------WDYVSSPKKH
        ATLEVSREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVG  G ++        W  V  PK H
Subjt:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGESQT------WDYVSSPKKH

XP_031741432.1 methyl-CpG-binding domain protein 4 isoform X2 [Cucumis sativus]2.86e-31298.22Show/hide
Query:  MASTTSFNPNLIPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLVSPSSEVHEPPILTLEDLQN
        MASTTS +PNL PPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESL SPSSEVHEPPILTLEDLQN
Subjt:  MASTTSFNPNLIPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLVSPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRHFQNTRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
        GKLPRQSPK+PSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSR+FQN+RSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
Subjt:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRHFQNTRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKR+RKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVG
        ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVG
Subjt:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVG

XP_038892490.1 methyl-CpG-binding domain protein 4-like protein isoform X1 [Benincasa hispida]7.31e-19764.78Show/hide
Query:  ASTTSFNPNLIPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQ--------HSPLSTLHDLQTPEPSNHHNESLV------------
        A+T S N NL PPSSSSYP DLFS+F FRG+SRSR    PSKS+QQ+P   QD TQ        HSP++T  DLQ  EP NH N+SL             
Subjt:  ASTTSFNPNLIPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQ--------HSPLSTLHDLQTPEPSNHHNESLV------------

Query:  --SPSSEVHEPPILTLEDLQNGKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRHFQNTRSTQQSKRIVSRYFQESVKER
          SPSS+V+EPPILTLEDLQN K   Q PKKP LARR+L+FYREFGFD+K+ Q TSHSVLNS P QEG R+ SR+FQN++STQQ +R VSRYFQ+SVK+R
Subjt:  --SPSSEVHEPPILTLEDLQNGKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRHFQNTRSTQQSKRIVSRYFQESVKER

Query:  TAHYEDENDGGNLTEQPSKRSSKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFF
         AH EDE++  NLTEQPSKRSSKRRRKDV P SDNSKTN HS+GK +RS+QKSGTD +VRIVS YFQ+ EK++E+DR                       
Subjt:  TAHYEDENDGGNLTEQPSKRSSKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFF

Query:  LKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNR
                     EAT+Q+NQ AKS KRVRKPVNERK++DKTSS+KPRTTLTAAEL LEAYRRKS  DTWKPP SG RLLQ DHAYDPWRVLVICMLLNR
Subjt:  LKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNR

Query:  TSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGESQT------WDYVSSPKKH
        TSGQQAKEVIPKLF LCPNPKATL+VS+EQIEDIIRPLG  RKRSRTM  LSEMYLKE+WSHVTQLPGVG  G ++        W+ V  PK H
Subjt:  TSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGESQT------WDYVSSPKKH

TrEMBL top hitse value%identityAlignment
A0A0A0KRW9 ENDO3c domain-containing protein1.15e-31298.22Show/hide
Query:  MASTTSFNPNLIPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLVSPSSEVHEPPILTLEDLQN
        MASTTS +PNL PPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESL SPSSEVHEPPILTLEDLQN
Subjt:  MASTTSFNPNLIPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLVSPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRHFQNTRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
        GKLPRQSPK+PSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSR+FQN+RSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
Subjt:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRHFQNTRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKR+RKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVG
        ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVG
Subjt:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVG

A0A1S3CCU6 methyl-CpG-binding domain protein 4-like protein2.56e-28687.53Show/hide
Query:  MASTTSFNPNLIPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLVSPSSEVHEPPILTLEDLQN
        MA+TTS NPNL PPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSA Q+PNPYQDSTQHSP+STL+DLQT EP+NHHN+SL SPSSE  EPPILTLEDLQN
Subjt:  MASTTSFNPNLIPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLVSPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRHFQNTRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
        GKLP QSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNS P QEGTRVVSR+FQN+RSTQQ +RIVSRYF++SVKER AHYEDEND GNLTEQPSKRS
Subjt:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRHFQNTRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV P S NSKTNHHS+GKT+RSVQKS TDT+ RIVSGYFQ  EKSLEMDREVSPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERK+K+KTSSTKPRTTLTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGESQT------WDYVSSPKKH
        ATLEVSREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVG  G ++        W  V  PK H
Subjt:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGESQT------WDYVSSPKKH

A0A5D3CU57 Methyl-CpG-binding domain protein 4-like protein1.08e-28589.89Show/hide
Query:  MASTTSFNPNLIPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLVSPSSEVHEPPILTLEDLQN
        MA+TTS NPNL PPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSA Q+PNPYQDSTQHSP+STL+DLQT EP+NHHN+SL SPSSE  EPPILTLEDLQN
Subjt:  MASTTSFNPNLIPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLVSPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRHFQNTRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS
        GKLP QSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNS P QEGTRVVSR+FQN+RSTQQ +RIVSRYF++SVKER AHYEDEND GNLTEQPSKRS
Subjt:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRHFQNTRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV P S NSKTNHHS+GKT+RSVQKS TDT+ RIVSGYFQ  EKSLEMDREVSPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERK+K+KTSSTKPRTTLTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGES
        ATLEVSREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVG  G ++
Subjt:  ATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGES

A0A6J1EZJ4 methyl-CpG-binding domain protein 4-like protein3.45e-17558.11Show/hide
Query:  MASTTSFNPNLIPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSK----SAQQDPNPYQDSTQ--------HSPLSTLHDLQTPEPSNHHNE----------
        M +TT  NPNL PPSSSS+P  LFS+F F+G S SRFRFPPSK    S +Q+P P +D TQ        +SP+STL  LQT E SNH             
Subjt:  MASTTSFNPNLIPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSK----SAQQDPNPYQDSTQ--------HSPLSTLHDLQTPEPSNHHNE----------

Query:  -----------------------SLVSPSSE-----VHEPPILTLEDLQNGKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRV
                               S  +P+SE     VHEPPILTLED+QN K   Q   +P LARRVL FYR+FGFD++++Q T  SV NS+P Q   RV
Subjt:  -----------------------SLVSPSSE-----VHEPPILTLEDLQNGKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRV

Query:  VSRHFQNTRSTQQSKRIVSRYFQESVKERTAHYEDENDGG--NLTEQPSKRSS-----KRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSG
        VSRHFQ ++S QQ +RIVSRYFQ S  ER AH EDE++    N+T+QP KRS      KRRRKDV   SDNSK    S+ K++R V++SGTD +VR VS 
Subjt:  VSRHFQNTRSTQQSKRIVSRYFQESVKERTAHYEDENDGG--NLTEQPSKRSS-----KRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSG

Query:  YFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRK
        YFQ+ EK+ E++ EVSP LQNSK+ QQ E++VSRFF KS +Q+ VNNQ+E  +  +QCAKSVKR+RKP  ERK +DK S+ +PRTTL+A ELFLEAYRRK
Subjt:  YFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRK

Query:  SPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVT
        S  DTWKPP SG RLLQ DHAYDPWRVLVICMLLNRT+GQQAKEVIPKLF+LCP+PK+ LEVS+EQIEDIIRPLG  RKRS T+ RLSEMYLKESWSHVT
Subjt:  SPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVT

Query:  QLPGVGYDGGESQT------WDYVSSPKKH
        QLPGVG  G ++        W  V  PK H
Subjt:  QLPGVGYDGGESQT------WDYVSSPKKH

A0A6J1HWM5 methyl-CpG-binding domain protein 4-like protein isoform X11.21e-16166.84Show/hide
Query:  VHEPPILTLEDLQNGKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRHFQNTRSTQQSKRIVSRYFQESVKERTAHYEDE
         HEPPILTLEDLQN K   Q   KP LARRVL F R+FGFD++++Q T  SV NS+P Q   RVVSRHFQ ++S QQ +RIVSRYFQ S  ER AH EDE
Subjt:  VHEPPILTLEDLQNGKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRHFQNTRSTQQSKRIVSRYFQESVKERTAHYEDE

Query:  NDGGNLTEQPSKRSS-----KRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLK
        +D  N+T+QP KRS      KRRRKDV   SDNSK    S+ K++RS++KSGTD +VRIVS YFQ+ EK+ E++ EVSPSLQNSK+NQQEE+VVSRFF K
Subjt:  NDGGNLTEQPSKRSS-----KRRRKDVTPGSDNSKTNHHSVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLK

Query:  SGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTS
        S + + VNNQ+E  +  +QCAKSVKR+RKP  ERK +DK S+ KPRTTL+A ELFLEAYRRKS  DTWKPP SG RLLQ DHAYDPWRVLVICMLLNRT+
Subjt:  SGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTS

Query:  GQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGESQT------WDYVSSPKKH
        GQQAKEVIPKLF+LCP+PK+ LEVS+EQIEDIIRPLG  RKRS T+ RLSEMYLKESWSHVTQLPGVG  G ++        W  V  PK H
Subjt:  GQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGESQT------WDYVSSPKKH

SwissProt top hitse value%identityAlignment
O95243 Methyl-CpG-binding domain protein 45.2e-1024.14Show/hide
Query:  QESVKERTAHYEDENDGGNLTEQPSKRSSKRRRKDVTPGSDNSKTNHHSVGKTARSVQK--SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQ
        Q+S  +RT    D    G      S+ +S  ++K+    S +S +N  S  KT+  + K  S  D++      + + YE +     E+   ++  +  + 
Subjt:  QESVKERTAHYEDENDGGNLTEQPSKRSSKRRRKDVTPGSDNSKTNHHSVGKTARSVQK--SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQ

Query:  EEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKR-VRKPVNERKEKDKTSSTKPRTTLTAAELF----LEAYRRKSPYDTWKPPTSGTRLLQHDHAY
            +    LK G           +E  N C+ + K    + + +     +T   + +T+L  +  +    L   RRK+ +  W PP S   L+Q    +
Subjt:  EEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKR-VRKPVNERKEKDKTSSTKPRTTLTAAELF----LEAYRRKSPYDTWKPPTSGTRLLQHDHAY

Query:  DPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGES
        DPW++L+  + LNRTSG+ A  V+ K     P+ +         + ++++PLG Y  R++T+ + S+ YL + W +  +L G+G  G +S
Subjt:  DPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGES

Q0IGK1 Methyl-CpG-binding domain protein 4-like protein4.8e-3237.98Show/hide
Query:  SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRK-------------PVNE-RKE
        S    +V  VS YFQ+   S + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  K ++   K              VNE +KE
Subjt:  SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRK-------------PVNE-RKE

Query:  KDKTSSTKP--RTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIR
        K +     P     L+ ++   + Y RK+P +TW PP S   LLQ DH +DPWRVLVICMLLN+TSG Q + VI  LF LC + K   EV  E+IE++I+
Subjt:  KDKTSSTKP--RTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIR

Query:  PLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGESQ------TWDYVSSPKKH
        PLG  +KR++ + RLS  YL+ESW+HVTQL GVG    ++        WD V  P  H
Subjt:  PLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGESQ------TWDYVSSPKKH

Q9YDP0 Thymine-DNA glycosylase3.1e-0728.45Show/hide
Query:  GTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMY-------LKESWSHVTQLPG
        G + L   +  DPW +LV   LL +T+ +Q   V  +     PNPKA      +++ ++IRPLG   +R++ +  L++         +  S   + +LPG
Subjt:  GTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMY-------LKESWSHVTQLPG

Query:  VGYDGGESQTWDYVSS
        VG         DY++S
Subjt:  VGYDGGESQTWDYVSS

Q9Z2D7 Methyl-CpG-binding domain protein 45.9e-1427.61Show/hide
Query:  QLNQCAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELF----LEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKL
        ++  C+++ K       +     +T   K +T+L  +  +    L   RRKS +  W PP S   L+Q    +DPW++L+  + LNRTSG+ A  V+ + 
Subjt:  QLNQCAKSVKRVRKPVNERKEKDKTSSTKPRTTLTAAELF----LEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKL

Query:  FSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGES
            P+ +         + ++++PLG Y  R++T+ + S+ YL + W +  +L G+G  G +S
Subjt:  FSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGES

Arabidopsis top hitse value%identityAlignment
AT3G07930.1 DNA glycosylase superfamily protein8.2e-1134.34Show/hide
Query:  SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRK-------------PVNE-RKE
        S    +V  VS YFQ+   S + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  K ++   K              VNE +KE
Subjt:  SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRK-------------PVNE-RKE

Query:  KDKTSSTKP--RTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTS
        K +     P     L+ ++   + Y RK+P +TW PP S   LLQ DH +DPWRVLVICMLLN+TS
Subjt:  KDKTSSTKP--RTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTS

AT3G07930.2 DNA glycosylase superfamily protein4.4e-1234.91Show/hide
Query:  SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRK-------------PVNE-RKE
        S    +V  VS YFQ+   S + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  K ++   K              VNE +KE
Subjt:  SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRK-------------PVNE-RKE

Query:  KDKTSSTKP--RTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQ
        K +     P     L+ ++   + Y RK+P +TW PP S   LLQ DH +DPWRVLVICMLLN+TSG Q
Subjt:  KDKTSSTKP--RTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQ

AT3G07930.3 DNA glycosylase superfamily protein3.4e-3337.98Show/hide
Query:  SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRK-------------PVNE-RKE
        S    +V  VS YFQ+   S + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  K ++   K              VNE +KE
Subjt:  SGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRK-------------PVNE-RKE

Query:  KDKTSSTKP--RTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIR
        K +     P     L+ ++   + Y RK+P +TW PP S   LLQ DH +DPWRVLVICMLLN+TSG Q + VI  LF LC + K   EV  E+IE++I+
Subjt:  KDKTSSTKP--RTTLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIR

Query:  PLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGESQ------TWDYVSSPKKH
        PLG  +KR++ + RLS  YL+ESW+HVTQL GVG    ++        WD V  P  H
Subjt:  PLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGYDGGESQ------TWDYVSSPKKH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAACGACAAGCTTCAACCCTAACCTCATCCCACCGTCATCTTCTTCGTATCCCCACGATTTGTTTTCTGAATTCGTCTTTCGAGGTACTTCTCGCTCCAGATT
TCGCTTTCCTCCTTCCAAATCCGCTCAACAAGACCCTAATCCCTACCAGGATTCTACCCAGCACTCCCCACTGTCTACTCTTCATGATCTCCAAACTCCAGAACCCAGCA
ATCATCACAACGAATCCTTAGTATCCCCATCTTCTGAAGTCCACGAGCCTCCTATATTAACACTAGAGGATCTTCAAAATGGAAAACTACCCCGTCAATCGCCAAAAAAG
CCTTCACTCGCTCGTAGAGTCTTGTCTTTCTACCGAGAGTTCGGATTTGATAAAAAATTGTTGCAAGCAACTTCGCATTCTGTCCTGAATTCAGTACCTGCTCAAGAAGG
GACCCGTGTGGTTTCGCGTCATTTCCAAAACACAAGATCAACCCAACAAAGCAAACGAATTGTCTCACGATATTTTCAAGAATCGGTGAAGGAACGAACAGCTCATTATG
AGGATGAGAACGATGGTGGCAATCTCACAGAGCAGCCAAGTAAAAGATCAAGCAAAAGGAGGAGGAAAGACGTAACCCCCGGCTCCGATAACTCAAAAACAAATCATCAT
TCAGTGGGAAAGACTGCACGCTCTGTTCAGAAGTCGGGAACAGATACACAAGTGCGAATTGTTTCGGGCTATTTTCAGAGTTATGAAAAGAGTCTTGAAATGGATCGAGA
AGTATCACCTTCTTTACAAAATTCAAAATCAAATCAACAAGAAGAGAAAGTGGTCTCACGTTTCTTTCTAAAATCAGGGAAACAACAAGCCGTGAACAATCAGGAAGAGG
CTACAGAGCAGCTAAATCAGTGTGCGAAATCTGTTAAAAGGGTCCGTAAACCAGTCAATGAAAGGAAAGAGAAGGATAAGACAAGTTCTACTAAACCTCGGACCACTCTG
ACTGCTGCAGAGTTGTTTTTGGAAGCTTACAGAAGGAAATCGCCATATGATACATGGAAGCCTCCTACCTCTGGAACTCGCCTTCTCCAACATGATCATGCGTACGACCC
TTGGAGGGTTCTAGTCATATGTATGCTCCTCAACCGGACAAGTGGGCAACAGGCAAAAGAAGTGATACCTAAACTCTTCAGTTTGTGTCCCAATCCAAAGGCTACTTTGG
AGGTATCACGTGAGCAGATAGAAGATATCATTCGACCTCTTGGTTTCTATAGGAAAAGATCACGAACAATGCATCGTTTATCTGAGATGTATTTAAAAGAAAGTTGGAGC
CATGTCACCCAACTTCCTGGTGTCGGCTATGATGGTGGTGAATCACAAACGTGGGACTATGTATCATCGCCTAAGAAGCACAGACACTTCAATTTGGATAGCATGTTCAT
GTTGGACACTTGGCGGGCATCTATTGCACGCTTGCTAGTCCAACAAATGTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCAACGACAAGCTTCAACCCTAACCTCATCCCACCGTCATCTTCTTCGTATCCCCACGATTTGTTTTCTGAATTCGTCTTTCGAGGTACTTCTCGCTCCAGATT
TCGCTTTCCTCCTTCCAAATCCGCTCAACAAGACCCTAATCCCTACCAGGATTCTACCCAGCACTCCCCACTGTCTACTCTTCATGATCTCCAAACTCCAGAACCCAGCA
ATCATCACAACGAATCCTTAGTATCCCCATCTTCTGAAGTCCACGAGCCTCCTATATTAACACTAGAGGATCTTCAAAATGGAAAACTACCCCGTCAATCGCCAAAAAAG
CCTTCACTCGCTCGTAGAGTCTTGTCTTTCTACCGAGAGTTCGGATTTGATAAAAAATTGTTGCAAGCAACTTCGCATTCTGTCCTGAATTCAGTACCTGCTCAAGAAGG
GACCCGTGTGGTTTCGCGTCATTTCCAAAACACAAGATCAACCCAACAAAGCAAACGAATTGTCTCACGATATTTTCAAGAATCGGTGAAGGAACGAACAGCTCATTATG
AGGATGAGAACGATGGTGGCAATCTCACAGAGCAGCCAAGTAAAAGATCAAGCAAAAGGAGGAGGAAAGACGTAACCCCCGGCTCCGATAACTCAAAAACAAATCATCAT
TCAGTGGGAAAGACTGCACGCTCTGTTCAGAAGTCGGGAACAGATACACAAGTGCGAATTGTTTCGGGCTATTTTCAGAGTTATGAAAAGAGTCTTGAAATGGATCGAGA
AGTATCACCTTCTTTACAAAATTCAAAATCAAATCAACAAGAAGAGAAAGTGGTCTCACGTTTCTTTCTAAAATCAGGGAAACAACAAGCCGTGAACAATCAGGAAGAGG
CTACAGAGCAGCTAAATCAGTGTGCGAAATCTGTTAAAAGGGTCCGTAAACCAGTCAATGAAAGGAAAGAGAAGGATAAGACAAGTTCTACTAAACCTCGGACCACTCTG
ACTGCTGCAGAGTTGTTTTTGGAAGCTTACAGAAGGAAATCGCCATATGATACATGGAAGCCTCCTACCTCTGGAACTCGCCTTCTCCAACATGATCATGCGTACGACCC
TTGGAGGGTTCTAGTCATATGTATGCTCCTCAACCGGACAAGTGGGCAACAGGCAAAAGAAGTGATACCTAAACTCTTCAGTTTGTGTCCCAATCCAAAGGCTACTTTGG
AGGTATCACGTGAGCAGATAGAAGATATCATTCGACCTCTTGGTTTCTATAGGAAAAGATCACGAACAATGCATCGTTTATCTGAGATGTATTTAAAAGAAAGTTGGAGC
CATGTCACCCAACTTCCTGGTGTCGGCTATGATGGTGGTGAATCACAAACGTGGGACTATGTATCATCGCCTAAGAAGCACAGACACTTCAATTTGGATAGCATGTTCAT
GTTGGACACTTGGCGGGCATCTATTGCACGCTTGCTAGTCCAACAAATGTATTAG
Protein sequenceShow/hide protein sequence
MASTTSFNPNLIPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPSNHHNESLVSPSSEVHEPPILTLEDLQNGKLPRQSPKK
PSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRHFQNTRSTQQSKRIVSRYFQESVKERTAHYEDENDGGNLTEQPSKRSSKRRRKDVTPGSDNSKTNHH
SVGKTARSVQKSGTDTQVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKVVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKEKDKTSSTKPRTTL
TAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWS
HVTQLPGVGYDGGESQTWDYVSSPKKHRHFNLDSMFMLDTWRASIARLLVQQMY