; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ChyUNG229070 (gene) of Cucumber (hystrix) v1 genome

Gene IDChyUNG229070
OrganismCucumis hystrix (Cucumber (hystrix) v1)
Descriptionmethyl-CpG-binding domain protein 4-like protein
Genome locationscaffold26_size3936274:3818319..3869621
RNA-Seq ExpressionChyUNG229070
SyntenyChyUNG229070
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase
IPR045138 - Methyl-CpG binding protein MeCP2/MBD4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054277.1 methyl-CpG-binding domain protein 4-like protein [Cucumis melo var. makuwa]3.06e-28591.78Show/hide
Query:  MASTTSINPNLTPPSSSSYHHDLFSEFVFRGTFRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPNNHHNESLASPSSEVHEPPILTLEDLQN
        MA+TTSINPNLTPPSSSSY HDLFSEFVFRGT RSRFRFPPSKSA Q+PNPYQDSTQHSP+STL+DLQT EPNNHHN+SLASPSSE  EPPILTLEDLQN
Subjt:  MASTTSINPNLTPPSSSSYHHDLFSEFVFRGTFRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPNNHHNESLASPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQKSKRIVSRYFQKSVKERTAHYEDENDGGNLTEQPSKRS
        GKLP QSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNS P QEGTRVVSRYFQNSRSTQ+ +RIVSRYF+KSVKER AHYEDEND GNLTEQPSKRS
Subjt:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQKSKRIVSRYFQKSVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQ
        SKRRRKDV P S NSKTNHHS+GKTSRSVQKS TDTR RIVSGYFQ  EKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEE TEQLNQ
Subjt:  SKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQ

Query:  LAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
         AKSVKRVRKPVNERKQK+KTSSTKPRT+LTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  LAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK
        ATLE SREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVGK
Subjt:  ATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK

XP_004142362.1 methyl-CpG-binding domain protein 4-like protein isoform X1 [Cucumis sativus]3.37e-30596.22Show/hide
Query:  MASTTSINPNLTPPSSSSYHHDLFSEFVFRGTFRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPNNHHNESLASPSSEVHEPPILTLEDLQN
        MASTTSI+PNLTPPSSSSY HDLFSEFVFRGT RSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEP+NHHNESLASPSSEVHEPPILTLEDLQN
Subjt:  MASTTSINPNLTPPSSSSYHHDLFSEFVFRGTFRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPNNHHNESLASPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQKSKRIVSRYFQKSVKERTAHYEDENDGGNLTEQPSKRS
        GKLPRQSPK+PSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQ+SKRIVSRYFQ+SVKERTAHYEDENDGGNLTEQPSKRS
Subjt:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQKSKRIVSRYFQKSVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQ
        SKRRRKDV PGSDNSKTNHHSVGKT+RSVQKSGTDT+VRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEE TEQLNQ
Subjt:  SKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQ

Query:  LAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
         AKSVKR+RKPVNERK+KDKTSSTKPRT+LTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
Subjt:  LAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK
        ATLE SREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK
Subjt:  ATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK

XP_008460559.1 PREDICTED: methyl-CpG-binding domain protein 4-like protein [Cucumis melo]2.95e-28591.78Show/hide
Query:  MASTTSINPNLTPPSSSSYHHDLFSEFVFRGTFRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPNNHHNESLASPSSEVHEPPILTLEDLQN
        MA+TTSINPNLTPPSSSSY HDLFSEFVFRGT RSRFRFPPSKSA Q+PNPYQDSTQHSP+STL+DLQT EPNNHHN+SLASPSSE  EPPILTLEDLQN
Subjt:  MASTTSINPNLTPPSSSSYHHDLFSEFVFRGTFRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPNNHHNESLASPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQKSKRIVSRYFQKSVKERTAHYEDENDGGNLTEQPSKRS
        GKLP QSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNS P QEGTRVVSRYFQNSRSTQ+ +RIVSRYF+KSVKER AHYEDEND GNLTEQPSKRS
Subjt:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQKSKRIVSRYFQKSVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQ
        SKRRRKDV P S NSKTNHHS+GKTSRSVQKS TDTR RIVSGYFQ  EKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEE TEQLNQ
Subjt:  SKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQ

Query:  LAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
         AKSVKRVRKPVNERKQK+KTSSTKPRT+LTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  LAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK
        ATLE SREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVGK
Subjt:  ATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK

XP_031741432.1 methyl-CpG-binding domain protein 4 isoform X2 [Cucumis sativus]8.17e-30596.21Show/hide
Query:  MASTTSINPNLTPPSSSSYHHDLFSEFVFRGTFRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPNNHHNESLASPSSEVHEPPILTLEDLQN
        MASTTSI+PNLTPPSSSSY HDLFSEFVFRGT RSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEP+NHHNESLASPSSEVHEPPILTLEDLQN
Subjt:  MASTTSINPNLTPPSSSSYHHDLFSEFVFRGTFRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPNNHHNESLASPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQKSKRIVSRYFQKSVKERTAHYEDENDGGNLTEQPSKRS
        GKLPRQSPK+PSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQ+SKRIVSRYFQ+SVKERTAHYEDENDGGNLTEQPSKRS
Subjt:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQKSKRIVSRYFQKSVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQ
        SKRRRKDV PGSDNSKTNHHSVGKT+RSVQKSGTDT+VRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEE TEQLNQ
Subjt:  SKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQ

Query:  LAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
         AKSVKR+RKPVNERK+KDKTSSTKPRT+LTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
Subjt:  LAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVG
        ATLE SREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVG
Subjt:  ATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVG

XP_038892490.1 methyl-CpG-binding domain protein 4-like protein isoform X1 [Benincasa hispida]2.61e-19667.3Show/hide
Query:  ASTTSINPNLTPPSSSSYHHDLFSEFVFRGTFRSRFRFPPSKSAQQDPNPYQDSTQ--------HSPLSTLHDLQTPEPNNHHNESLA------------
        A+T SIN NLTPPSSSSY  DLFS+F FRG+ RSR    PSKS+QQ+P   QD TQ        HSP++T  DLQ  EP NH N+SL+            
Subjt:  ASTTSINPNLTPPSSSSYHHDLFSEFVFRGTFRSRFRFPPSKSAQQDPNPYQDSTQ--------HSPLSTLHDLQTPEPNNHHNESLA------------

Query:  --SPSSEVHEPPILTLEDLQNGKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQKSKRIVSRYFQKSVKER
          SPSS+V+EPPILTLEDLQN K   Q PKKP LARR+L+FYREFGFD+K+ Q TSHSVLNS P QEG R+ SRYFQNS+STQ+ +R VSRYFQKSVK+R
Subjt:  --SPSSEVHEPPILTLEDLQNGKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQKSKRIVSRYFQKSVKER

Query:  TAHYEDENDGGNLTEQPSKRSSKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFF
         AH EDE++  NLTEQPSKRSSKRRRKDV P SDNSKTN HS+GK SRS+QKSGTD RVRIVS YFQ+ EK++E+DR                       
Subjt:  TAHYEDENDGGNLTEQPSKRSSKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFF

Query:  LKSGKQQAVNNQEEVTEQLNQLAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNR
                     E T+Q+NQ AKS KRVRKPVNERKQ+DKTSS+KPRT+LTAAEL LEAYRRKS  DTWKPP SG RLLQ DHAYDPWRVLVICMLLNR
Subjt:  LKSGKQQAVNNQEEVTEQLNQLAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNR

Query:  TSGQQAKEVIPKLFSLCPNPKATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK
        TSGQQAKEVIPKLF LCPNPKATL+ S+EQIEDIIRPLG  RKRSRTM  LSEMYLKE+WSHVTQLPGVGK
Subjt:  TSGQQAKEVIPKLFSLCPNPKATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK

TrEMBL top hitse value%identityAlignment
A0A0A0KRW9 ENDO3c domain-containing protein4.6e-24196.22Show/hide
Query:  MASTTSINPNLTPPSSSSYHHDLFSEFVFRGTFRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPNNHHNESLASPSSEVHEPPILTLEDLQN
        MASTTSI+PNLTPPSSSSY HDLFSEFVFRGT RSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEP+NHHNESLASPSSEVHEPPILTLEDLQN
Subjt:  MASTTSINPNLTPPSSSSYHHDLFSEFVFRGTFRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPNNHHNESLASPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQKSKRIVSRYFQKSVKERTAHYEDENDGGNLTEQPSKRS
        GKLPRQSPK+PSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQ+SKRIVSRYFQ+SVKERTAHYEDENDGGNLTEQPSKRS
Subjt:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQKSKRIVSRYFQKSVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQ
        SKRRRKDV PGSDNSKTNHHSVGKT+RSVQKSGTDT+VRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEE TEQLNQ
Subjt:  SKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQ

Query:  LAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
         AKSVKR+RKPVNERK+KDKTSSTKPRT+LTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
Subjt:  LAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK
        ATLE SREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK
Subjt:  ATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK

A0A1S3CCU6 methyl-CpG-binding domain protein 4-like protein7.2e-22691.78Show/hide
Query:  MASTTSINPNLTPPSSSSYHHDLFSEFVFRGTFRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPNNHHNESLASPSSEVHEPPILTLEDLQN
        MA+TTSINPNLTPPSSSSY HDLFSEFVFRGT RSRFRFPPSKSA Q+PNPYQDSTQHSP+STL+DLQT EPNNHHN+SLASPSSE  EPPILTLEDLQN
Subjt:  MASTTSINPNLTPPSSSSYHHDLFSEFVFRGTFRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPNNHHNESLASPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQKSKRIVSRYFQKSVKERTAHYEDENDGGNLTEQPSKRS
        GKLP QSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNS P QEGTRVVSRYFQNSRSTQ+ +RIVSRYF+KSVKER AHYEDEND GNLTEQPSKRS
Subjt:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQKSKRIVSRYFQKSVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQ
        SKRRRKDV P S NSKTNHHS+GKTSRSVQKS TDTR RIVSGYFQ  EKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEE TEQLNQ
Subjt:  SKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQ

Query:  LAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
         AKSVKRVRKPVNERKQK+KTSSTKPRT+LTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  LAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK
        ATLE SREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVGK
Subjt:  ATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK

A0A5D3CU57 Methyl-CpG-binding domain protein 4-like protein7.2e-22691.78Show/hide
Query:  MASTTSINPNLTPPSSSSYHHDLFSEFVFRGTFRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPNNHHNESLASPSSEVHEPPILTLEDLQN
        MA+TTSINPNLTPPSSSSY HDLFSEFVFRGT RSRFRFPPSKSA Q+PNPYQDSTQHSP+STL+DLQT EPNNHHN+SLASPSSE  EPPILTLEDLQN
Subjt:  MASTTSINPNLTPPSSSSYHHDLFSEFVFRGTFRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPNNHHNESLASPSSEVHEPPILTLEDLQN

Query:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQKSKRIVSRYFQKSVKERTAHYEDENDGGNLTEQPSKRS
        GKLP QSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNS P QEGTRVVSRYFQNSRSTQ+ +RIVSRYF+KSVKER AHYEDEND GNLTEQPSKRS
Subjt:  GKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQKSKRIVSRYFQKSVKERTAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQ
        SKRRRKDV P S NSKTNHHS+GKTSRSVQKS TDTR RIVSGYFQ  EKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEE TEQLNQ
Subjt:  SKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQ

Query:  LAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
         AKSVKRVRKPVNERKQK+KTSSTKPRT+LTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  LAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK
        ATLE SREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVGK
Subjt:  ATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK

A0A6J1EZJ4 methyl-CpG-binding domain protein 4-like protein6.7e-13959.09Show/hide
Query:  MASTTSINPNLTPPSSSSYHHDLFSEFVFRGTFRSRFRFPPSK----SAQQDPNPYQDST--------QHSPLSTLHDLQTPEPNNH-------------
        M +TT +NPNL+PPSSSS+   LFS+F F+G   SRFRFPPSK    S +Q+P P +D T        Q+SP+STL  LQT E N+              
Subjt:  MASTTSINPNLTPPSSSSYHHDLFSEFVFRGTFRSRFRFPPSK----SAQQDPNPYQDST--------QHSPLSTLHDLQTPEPNNH-------------

Query:  -------------------HNESLASPSSE-----VHEPPILTLEDLQNGKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVV
                              S  +P+SE     VHEPPILTLED+QN K   Q   +P LARRVL FYR+FGFD++++Q T  SV NS+P Q   RVV
Subjt:  -------------------HNESLASPSSE-----VHEPPILTLEDLQNGKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVV

Query:  SRYFQNSRSTQKSKRIVSRYFQKSVKERTAHYEDENDGG--NLTEQPSKRS-----SKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGY
        SR+FQ S+S Q+ +RIVSRYFQ S  ER AH EDE++    N+T+QP KRS      KRRRKDVA  SDNSK    S+ K+SR V++SGTD RVR VS Y
Subjt:  SRYFQNSRSTQKSKRIVSRYFQKSVKERTAHYEDENDGG--NLTEQPSKRS-----SKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGY

Query:  FQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQLAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKS
        FQ+ EK+ E++ EVSP LQNSK+ QQ E++VSRFF KS +Q+ VNNQ+EV +  +Q AKSVKR+RKP  ERK +DK S+ +PRT+L+A ELFLEAYRRKS
Subjt:  FQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQLAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKS

Query:  PYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQ
          DTWKPP SG RLLQ DHAYDPWRVLVICMLLNRT+GQQAKEVIPKLF+LCP+PK+ LE S+EQIEDIIRPLG  RKRS T+ RLSEMYLKESWSHVTQ
Subjt:  PYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQ

Query:  LPGVGK
        LPGVGK
Subjt:  LPGVGK

A0A6J1HWM5 methyl-CpG-binding domain protein 4-like protein isoform X12.8e-12966.08Show/hide
Query:  TLHDLQTPEPNNHHNESLASPSSE-----VHEPPILTLEDLQNGKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQN
        T+ D+Q   P         +P+SE      HEPPILTLEDLQN K   Q   KP LARRVL F R+FGFD++++Q T  SV NS+P Q   RVVSR+FQ 
Subjt:  TLHDLQTPEPNNHHNESLASPSSE-----VHEPPILTLEDLQNGKLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQN

Query:  SRSTQKSKRIVSRYFQKSVKERTAHYEDENDGGNLTEQPSKRS-----SKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGYFQSYEKSL
        S+S Q+ +RIVSRYFQ S  ER AH EDE+D  N+T+QP KRS      KRRRKDVA  SDNSK    S+ K+SRS++KSGTD RVRIVS YFQ+ EK+ 
Subjt:  SRSTQKSKRIVSRYFQKSVKERTAHYEDENDGGNLTEQPSKRS-----SKRRRKDVAPGSDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGYFQSYEKSL

Query:  EMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQLAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPP
        E++ EVSPSLQNSK+NQQEE++VSRFF KS + + VNNQ+EV +  +Q AKSVKR+RKP  ERK +DK S+ KPRT+L+A ELFLEAYRRKS  DTWKPP
Subjt:  EMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQLAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPP

Query:  TSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK
         SG RLLQ DHAYDPWRVLVICMLLNRT+GQQAKEVIPKLF+LCP+PK+ LE S+EQIEDIIRPLG  RKRS T+ RLSEMYLKESWSHVTQLPGVGK
Subjt:  TSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK

SwissProt top hitse value%identityAlignment
O95243 Methyl-CpG-binding domain protein 43.3e-1025.09Show/hide
Query:  QKSVKERTAHYEDENDGGNLTEQPSKRSSKRRRKDVAPGSDNSKTNHHSVGKTSRSVQK--SGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQ
        QKS  +RT    D    G      S+ +S  ++K+    S +S +N  S  KTS  + K  S  D      S + + YE +     E+   ++      +
Subjt:  QKSVKERTAHYEDENDGGNLTEQPSKRSSKRRRKDVAPGSDNSKTNHHSVGKTSRSVQK--SGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQ

Query:  EEKMVSRFFLKSGKQQAVN---NQEEVTEQLNQLAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDP
         ++ +    LK G +   N    +++ T +      ++ R +  +  RK     SS   + +L+          R+  +  W PP S   L+Q    +DP
Subjt:  EEKMVSRFFLKSGKQQAVN---NQEEVTEQLNQLAKSVKRVRKPVNERKQKDKTSSTKPRTSLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDP

Query:  WRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK
        W++L+  + LNRTSG+ A  V+ K     P+ +    A    + ++++PLG Y  R++T+ + S+ YL + W +  +L G+GK
Subjt:  WRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK

Q0IGK1 Methyl-CpG-binding domain protein 4-like protein1.8e-3239.41Show/hide
Query:  SGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQLAKSVKRVRKPVN------------ERKQKD
        S    +V  VS YFQ+   S + D ++   + +S+S +   K  S+  +K  +      +  V+EQ NQ  K ++   K V                QK+
Subjt:  SGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQLAKSVKRVRKPVN------------ERKQKD

Query:  KTSSTKPRTSLTAAELFL-----EAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEASREQIEDII
        K+ + + +T + +  L L     + Y RK+P +TW PP S   LLQ DH +DPWRVLVICMLLN+TSG Q + VI  LF LC + K   E   E+IE++I
Subjt:  KTSSTKPRTSLTAAELFL-----EAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEASREQIEDII

Query:  RPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK
        +PLG  +KR++ + RLS  YL+ESW+HVTQL GVGK
Subjt:  RPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK

Q9YDP0 Thymine-DNA glycosylase1.5e-0730.28Show/hide
Query:  GTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMY-------LKESWSHVTQLPG
        G + L   +  DPW +LV   LL +T+ +Q   V  +     PNPKA   A  +++ ++IRPLG   +R++ +  L++         +  S   + +LPG
Subjt:  GTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEASREQIEDIIRPLGFYRKRSRTMHRLSEMY-------LKESWSHVTQLPG

Query:  VGKLKQIEV
        VG     EV
Subjt:  VGKLKQIEV

Q9Z2D7 Methyl-CpG-binding domain protein 49.9e-1532.59Show/hide
Query:  KTSSTKPRTSLTAAELF----LEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEASREQIEDIIR
        +T   K +TSL  +  +    L   RRKS +  W PP S   L+Q    +DPW++L+  + LNRTSG+ A  V+ +     P+ +    A    + ++++
Subjt:  KTSSTKPRTSLTAAELF----LEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEASREQIEDIIR

Query:  PLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK
        PLG Y  R++T+ + S+ YL + W +  +L G+GK
Subjt:  PLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK

Arabidopsis top hitse value%identityAlignment
AT3G07930.1 DNA glycosylase superfamily protein3.1e-1134.13Show/hide
Query:  SGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQLAKSVKRVRKPVN------------ERKQKD
        S    +V  VS YFQ+   S + D ++   + +S+S +   K  S+  +K  +      +  V+EQ NQ  K ++   K V                QK+
Subjt:  SGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQLAKSVKRVRKPVN------------ERKQKD

Query:  KTSSTKPRTSLTAAELFL-----EAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTS
        K+ + + +T + +  L L     + Y RK+P +TW PP S   LLQ DH +DPWRVLVICMLLN+TS
Subjt:  KTSSTKPRTSLTAAELFL-----EAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTS

AT3G07930.2 DNA glycosylase superfamily protein1.6e-1234.71Show/hide
Query:  SGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQLAKSVKRVRKPVN------------ERKQKD
        S    +V  VS YFQ+   S + D ++   + +S+S +   K  S+  +K  +      +  V+EQ NQ  K ++   K V                QK+
Subjt:  SGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQLAKSVKRVRKPVN------------ERKQKD

Query:  KTSSTKPRTSLTAAELFL-----EAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQ
        K+ + + +T + +  L L     + Y RK+P +TW PP S   LLQ DH +DPWRVLVICMLLN+TSG Q
Subjt:  KTSSTKPRTSLTAAELFL-----EAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQ

AT3G07930.3 DNA glycosylase superfamily protein1.3e-3339.41Show/hide
Query:  SGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQLAKSVKRVRKPVN------------ERKQKD
        S    +V  VS YFQ+   S + D ++   + +S+S +   K  S+  +K  +      +  V+EQ NQ  K ++   K V                QK+
Subjt:  SGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQLAKSVKRVRKPVN------------ERKQKD

Query:  KTSSTKPRTSLTAAELFL-----EAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEASREQIEDII
        K+ + + +T + +  L L     + Y RK+P +TW PP S   LLQ DH +DPWRVLVICMLLN+TSG Q + VI  LF LC + K   E   E+IE++I
Subjt:  KTSSTKPRTSLTAAELFL-----EAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEASREQIEDII

Query:  RPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK
        +PLG  +KR++ + RLS  YL+ESW+HVTQL GVGK
Subjt:  RPLGFYRKRSRTMHRLSEMYLKESWSHVTQLPGVGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAGGTTCAACGAATCCCGCCATGGCTTCAACGACAAGCATCAACCCTAACCTCACCCCACCATCATCTTCTTCGTATCACCACGATTTGTTTTCCGAATTCGT
CTTTCGAGGTACTTTTCGCTCCAGATTTCGCTTTCCTCCTTCCAAATCCGCTCAACAAGACCCTAATCCCTACCAGGATTCTACCCAGCACTCCCCACTTTCTACTCTTC
ATGATCTCCAAACTCCAGAACCCAACAATCATCACAACGAATCCTTAGCATCCCCATCTTCTGAAGTCCACGAGCCTCCTATATTAACACTAGAGGATCTTCAAAATGGA
AAACTACCCCGTCAATCGCCAAAAAAGCCTTCACTCGCTCGTAGAGTCTTGTCTTTCTACCGAGAGTTCGGATTTGATAAAAAATTGTTGCAAGCAACTTCGCATTCTGT
CCTGAATTCAGTACCTGCTCAAGAAGGGACCCGTGTGGTTTCGCGTTATTTCCAAAACTCAAGATCAACTCAAAAAAGCAAACGAATTGTCTCACGATATTTTCAAAAAT
CGGTGAAGGAACGAACAGCTCATTATGAGGATGAGAACGATGGTGGCAATCTCACAGAGCAGCCAAGTAAAAGATCAAGCAAAAGGAGGAGGAAAGACGTAGCCCCCGGC
TCCGATAACTCAAAAACAAATCATCATTCAGTGGGAAAAACTTCACGGTCTGTTCAGAAGTCGGGAACAGATACACGAGTGCGAATTGTTTCGGGCTATTTTCAAAGTTA
TGAAAAGAGTCTTGAAATGGATCGAGAAGTATCACCTTCTTTACAAAATTCAAAATCAAATCAACAAGAAGAGAAAATGGTCTCACGTTTCTTTCTAAAATCAGGGAAAC
AACAAGCCGTGAACAATCAGGAAGAGGTTACAGAGCAGCTAAATCAGCTTGCGAAATCTGTTAAAAGGGTCCGTAAACCAGTCAATGAAAGGAAACAGAAGGATAAGACA
AGTTCTACTAAACCTCGGACCAGTCTGACTGCTGCAGAGTTGTTTTTGGAAGCTTACAGAAGGAAATCGCCATATGATACATGGAAGCCCCCTACCTCTGGAACTCGCCT
TCTCCAACATGATCATGCGTACGACCCTTGGAGGGTTCTAGTCATATGTATGCTCCTCAACCGGACAAGTGGGCAACAGGCAAAAGAAGTGATACCTAAACTCTTCAGTT
TGTGTCCCAATCCAAAGGCTACTTTGGAGGCATCACGTGAGCAGATAGAAGATATCATTCGACCTCTTGGTTTCTATAGGAAAAGATCACGAACAATGCATCGTTTATCT
GAGATGTATTTAAAAGAAAGTTGGAGCCATGTCACCCAACTTCCTGGTGTCGGCAAGCTTAAGCAAATAGAAGTTCACAGAAAAGACAACAAGTACAGGGAAATCGAAAT
AACAAGCGAATTTTCAGCACCATTTGAGAGAGTGGACCCCACAACTGATTCCTCAGTTCACTTCCCAAACCTCATCCGTGTCCTGTGTTGCTCAGCTGCTGTAGGTTGGT
TTCGCCTGACAAGCATAGAAAAAACCGCACTCGAAGGAACTGGCATGCCTAAAAAGCCCGTCTTCCTGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCAGGTTCAACGAATCCCGCCATGGCTTCAACGACAAGCATCAACCCTAACCTCACCCCACCATCATCTTCTTCGTATCACCACGATTTGTTTTCCGAATTCGT
CTTTCGAGGTACTTTTCGCTCCAGATTTCGCTTTCCTCCTTCCAAATCCGCTCAACAAGACCCTAATCCCTACCAGGATTCTACCCAGCACTCCCCACTTTCTACTCTTC
ATGATCTCCAAACTCCAGAACCCAACAATCATCACAACGAATCCTTAGCATCCCCATCTTCTGAAGTCCACGAGCCTCCTATATTAACACTAGAGGATCTTCAAAATGGA
AAACTACCCCGTCAATCGCCAAAAAAGCCTTCACTCGCTCGTAGAGTCTTGTCTTTCTACCGAGAGTTCGGATTTGATAAAAAATTGTTGCAAGCAACTTCGCATTCTGT
CCTGAATTCAGTACCTGCTCAAGAAGGGACCCGTGTGGTTTCGCGTTATTTCCAAAACTCAAGATCAACTCAAAAAAGCAAACGAATTGTCTCACGATATTTTCAAAAAT
CGGTGAAGGAACGAACAGCTCATTATGAGGATGAGAACGATGGTGGCAATCTCACAGAGCAGCCAAGTAAAAGATCAAGCAAAAGGAGGAGGAAAGACGTAGCCCCCGGC
TCCGATAACTCAAAAACAAATCATCATTCAGTGGGAAAAACTTCACGGTCTGTTCAGAAGTCGGGAACAGATACACGAGTGCGAATTGTTTCGGGCTATTTTCAAAGTTA
TGAAAAGAGTCTTGAAATGGATCGAGAAGTATCACCTTCTTTACAAAATTCAAAATCAAATCAACAAGAAGAGAAAATGGTCTCACGTTTCTTTCTAAAATCAGGGAAAC
AACAAGCCGTGAACAATCAGGAAGAGGTTACAGAGCAGCTAAATCAGCTTGCGAAATCTGTTAAAAGGGTCCGTAAACCAGTCAATGAAAGGAAACAGAAGGATAAGACA
AGTTCTACTAAACCTCGGACCAGTCTGACTGCTGCAGAGTTGTTTTTGGAAGCTTACAGAAGGAAATCGCCATATGATACATGGAAGCCCCCTACCTCTGGAACTCGCCT
TCTCCAACATGATCATGCGTACGACCCTTGGAGGGTTCTAGTCATATGTATGCTCCTCAACCGGACAAGTGGGCAACAGGCAAAAGAAGTGATACCTAAACTCTTCAGTT
TGTGTCCCAATCCAAAGGCTACTTTGGAGGCATCACGTGAGCAGATAGAAGATATCATTCGACCTCTTGGTTTCTATAGGAAAAGATCACGAACAATGCATCGTTTATCT
GAGATGTATTTAAAAGAAAGTTGGAGCCATGTCACCCAACTTCCTGGTGTCGGCAAGCTTAAGCAAATAGAAGTTCACAGAAAAGACAACAAGTACAGGGAAATCGAAAT
AACAAGCGAATTTTCAGCACCATTTGAGAGAGTGGACCCCACAACTGATTCCTCAGTTCACTTCCCAAACCTCATCCGTGTCCTGTGTTGCTCAGCTGCTGTAGGTTGGT
TTCGCCTGACAAGCATAGAAAAAACCGCACTCGAAGGAACTGGCATGCCTAAAAAGCCCGTCTTCCTGGAATAG
Protein sequenceShow/hide protein sequence
MASGSTNPAMASTTSINPNLTPPSSSSYHHDLFSEFVFRGTFRSRFRFPPSKSAQQDPNPYQDSTQHSPLSTLHDLQTPEPNNHHNESLASPSSEVHEPPILTLEDLQNG
KLPRQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSVPAQEGTRVVSRYFQNSRSTQKSKRIVSRYFQKSVKERTAHYEDENDGGNLTEQPSKRSSKRRRKDVAPG
SDNSKTNHHSVGKTSRSVQKSGTDTRVRIVSGYFQSYEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEVTEQLNQLAKSVKRVRKPVNERKQKDKT
SSTKPRTSLTAAELFLEAYRRKSPYDTWKPPTSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEASREQIEDIIRPLGFYRKRSRTMHRLS
EMYLKESWSHVTQLPGVGKLKQIEVHRKDNKYREIEITSEFSAPFERVDPTTDSSVHFPNLIRVLCCSAAVGWFRLTSIEKTALEGTGMPKKPVFLE