; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0028387 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0028387
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
Descriptionmethyl-CpG-binding domain protein 4-like protein
Genome locationchr10:434306..439119
RNA-Seq ExpressionPI0028387
SyntenyPI0028387
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase
IPR045138 - Methyl-CpG binding protein MeCP2/MBD4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054277.1 methyl-CpG-binding domain protein 4-like protein [Cucumis melo var. makuwa]9.7e-23792.47Show/hide
Query:  MAVTTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSTQQNPNPYQDSTQHSPLSTLYDFQTSEPNNHHNKSLASPSSEALEPPILTLED---
        MA TTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKS  QNPNPYQDSTQHSP+STLYD QTSEPNNHHNKSLASPSSEA EPPILTLED   
Subjt:  MAVTTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSTQQNPNPYQDSTQHSPLSTLYDFQTSEPNNHHNKSLASPSSEALEPPILTLED---

Query:  ----------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDGGNLTEQPSKRS
                  PSLARRVL FYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYF+KSVKERAAHYEDEND GNLTEQPSKRS
Subjt:  ----------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV PS  NSKTNHHSMGK SRSVQKS TDTR RIVSGYFQ SEK LEMDRE+SPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERKQK+KTSS KPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWS
        ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW+
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWS

XP_004142362.1 methyl-CpG-binding domain protein 4-like protein isoform X1 [Cucumis sativus]2.5e-24589.96Show/hide
Query:  MAVTTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSTQQNPNPYQDSTQHSPLSTLYDFQTSEPNNHHNKSLASPSSEALEPPILTLED---
        MA TTSI+PNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKS QQ+PNPYQDSTQHSPLSTL+D QT EP+NHHN+SLASPSSE  EPPILTLED   
Subjt:  MAVTTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSTQQNPNPYQDSTQHSPLSTLYDFQTSEPNNHHNKSLASPSSEALEPPILTLED---

Query:  ----------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDGGNLTEQPSKRS
                  PSLARRVL FYREFGFDKKLLQATSHSVLNS P QEGTRVVSRYFQNSRSTQQ +RIVSRYFQ+SVKER AHYEDENDGGNLTEQPSKRS
Subjt:  ----------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV P  DNSKTNHHS+GK +RSVQKSGTDT+VRIVSGYFQ+ EK LEMDRE+SPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKR+RKPVNERK+KDKTSS KPRTTLTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
        ATLEVSREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL

XP_008460559.1 PREDICTED: methyl-CpG-binding domain protein 4-like protein [Cucumis melo]1.6e-25293.03Show/hide
Query:  MAVTTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSTQQNPNPYQDSTQHSPLSTLYDFQTSEPNNHHNKSLASPSSEALEPPILTLED---
        MA TTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKS  QNPNPYQDSTQHSP+STLYD QTSEPNNHHNKSLASPSSEA EPPILTLED   
Subjt:  MAVTTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSTQQNPNPYQDSTQHSPLSTLYDFQTSEPNNHHNKSLASPSSEALEPPILTLED---

Query:  ----------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDGGNLTEQPSKRS
                  PSLARRVL FYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYF+KSVKERAAHYEDEND GNLTEQPSKRS
Subjt:  ----------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV PS  NSKTNHHSMGK SRSVQKS TDTR RIVSGYFQ SEK LEMDRE+SPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERKQK+KTSS KPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
        ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL

XP_031741432.1 methyl-CpG-binding domain protein 4 isoform X2 [Cucumis sativus]1.2e-21889.09Show/hide
Query:  MAVTTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSTQQNPNPYQDSTQHSPLSTLYDFQTSEPNNHHNKSLASPSSEALEPPILTLED---
        MA TTSI+PNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKS QQ+PNPYQDSTQHSPLSTL+D QT EP+NHHN+SLASPSSE  EPPILTLED   
Subjt:  MAVTTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSTQQNPNPYQDSTQHSPLSTLYDFQTSEPNNHHNKSLASPSSEALEPPILTLED---

Query:  ----------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDGGNLTEQPSKRS
                  PSLARRVL FYREFGFDKKLLQATSHSVLNS P QEGTRVVSRYFQNSRSTQQ +RIVSRYFQ+SVKER AHYEDENDGGNLTEQPSKRS
Subjt:  ----------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV P  DNSKTNHHS+GK +RSVQKSGTDT+VRIVSGYFQ+ EK LEMDRE+SPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKR+RKPVNERK+KDKTSS KPRTTLTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVG
        ATLEVSREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVG
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVG

XP_038892490.1 methyl-CpG-binding domain protein 4-like protein isoform X1 [Benincasa hispida]3.3e-18470.53Show/hide
Query:  AVTTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSTQQNPNPYQDST--------QHSPLSTLYDFQTSEPNNHHNKSL-------------
        A T SIN NLTPPSSSSYP DLFS+F FRG+SRSR    PSKS+QQNP   QD T        QHSP++T  D Q SEP NH NKSL             
Subjt:  AVTTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSTQQNPNPYQDST--------QHSPLSTLYDFQTSEPNNHHNKSL-------------

Query:  -ASPSSEALEPPILTLED-------------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKER
         +SPSS+  EPPILTLED             P LARR+L FYREFGFD+K+ Q TSHSVLNSEPVQEG R+ SRYFQNS+STQQ ER VSRYFQKSVK+R
Subjt:  -ASPSSEALEPPILTLED-------------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKER

Query:  AAHYEDENDGGNLTEQPSKRSSKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFF
         AH EDE++  NLTEQPSKRSSKRRRKDV PS DNSKTN HSMGK SRS+QKSGTD RVRIVS YFQNSEK +E+DR                       
Subjt:  AAHYEDENDGGNLTEQPSKRSSKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFF

Query:  LKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNR
                     EAT+Q+NQ AKS KRVRKPVNERKQ+DKTSS+KPRTTLTAAEL LEAYRRKS DDTWKPPPSG RLLQ DHAYDPWRVLVICMLLNR
Subjt:  LKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNR

Query:  TSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWD
        TSGQQAKEVIPKLF LCPNPKATL+VS+EQIEDIIRPLGL RKRSRTM  LSEMYLKE+WSHVTQLPGVGKYGADAHAIFCTGYW+EV+PKDHMLNYYW+
Subjt:  TSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWD

Query:  FLHSIKHLL
        FLHSI+HLL
Subjt:  FLHSIKHLL

TrEMBL top hitse value%identityAlignment
A0A0A0KRW9 ENDO3c domain-containing protein1.0e-22087.83Show/hide
Query:  MAVTTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSTQQNPNPYQDSTQHSPLSTLYDFQTSEPNNHHNKSLASPSSEALEPPILTLED---
        MA TTSI+PNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKS QQ+PNPYQDSTQHSPLSTL+D QT EP+NHHN+SLASPSSE  EPPILTLED   
Subjt:  MAVTTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSTQQNPNPYQDSTQHSPLSTLYDFQTSEPNNHHNKSLASPSSEALEPPILTLED---

Query:  ----------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDGGNLTEQPSKRS
                  PSLARRVL FYREFGFDKKLLQATSHSVLNS P QEGTRVVSRYFQNSRSTQQ +RIVSRYFQ+SVKER AHYEDENDGGNLTEQPSKRS
Subjt:  ----------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV P  DNSKTNHHS+GK +RSVQKSGTDT+VRIVSGYFQ+ EK LEMDRE+SPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKR+RKPVNERK+KDKTSS KPRTTLTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFC
        ATLEVSREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVGKY A    + C
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFC

A0A1S3CCU6 methyl-CpG-binding domain protein 4-like protein8.0e-25393.03Show/hide
Query:  MAVTTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSTQQNPNPYQDSTQHSPLSTLYDFQTSEPNNHHNKSLASPSSEALEPPILTLED---
        MA TTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKS  QNPNPYQDSTQHSP+STLYD QTSEPNNHHNKSLASPSSEA EPPILTLED   
Subjt:  MAVTTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSTQQNPNPYQDSTQHSPLSTLYDFQTSEPNNHHNKSLASPSSEALEPPILTLED---

Query:  ----------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDGGNLTEQPSKRS
                  PSLARRVL FYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYF+KSVKERAAHYEDEND GNLTEQPSKRS
Subjt:  ----------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV PS  NSKTNHHSMGK SRSVQKS TDTR RIVSGYFQ SEK LEMDRE+SPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERKQK+KTSS KPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
        ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL

A0A5D3CU57 Methyl-CpG-binding domain protein 4-like protein4.7e-23792.47Show/hide
Query:  MAVTTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSTQQNPNPYQDSTQHSPLSTLYDFQTSEPNNHHNKSLASPSSEALEPPILTLED---
        MA TTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKS  QNPNPYQDSTQHSP+STLYD QTSEPNNHHNKSLASPSSEA EPPILTLED   
Subjt:  MAVTTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSTQQNPNPYQDSTQHSPLSTLYDFQTSEPNNHHNKSLASPSSEALEPPILTLED---

Query:  ----------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDGGNLTEQPSKRS
                  PSLARRVL FYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYF+KSVKERAAHYEDEND GNLTEQPSKRS
Subjt:  ----------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDGGNLTEQPSKRS

Query:  SKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV PS  NSKTNHHSMGK SRSVQKS TDTR RIVSGYFQ SEK LEMDRE+SPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERKQK+KTSS KPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWS
        ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW+
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWS

A0A6J1EZJ4 methyl-CpG-binding domain protein 4-like protein4.3e-16663Show/hide
Query:  MAVTTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSK----STQQNPNPYQDST--------QHSPLSTLYDFQTSEPNNHHNKSLA-------
        M  TT +NPNL+PPSSSS+P  LFS+F F+G S SRFRFPPSK    S +QNP P +D T        Q+SP+STL   QTSE N  H K+ A       
Subjt:  MAVTTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSK----STQQNPNPYQDST--------QHSPLSTLYDFQTSEPNNHHNKSLA-------

Query:  ---------------------------SPSSE-----ALEPPILTLED-------------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTR
                                   +P+SE       EPPILTLED             P LARRVL FYR+FGFD++++Q T  SV NS PVQ   R
Subjt:  ---------------------------SPSSE-----ALEPPILTLED-------------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTR

Query:  VVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDGG--NLTEQPSKRS-----SKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVS
        VVSR+FQ S+S QQ ERIVSRYFQ S  ERAAH EDE++    N+T+QP KRS      KRRRKDVA S DNSK    S+ K SR V++SGTD RVR VS
Subjt:  VVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDGG--NLTEQPSKRS-----SKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVS

Query:  GYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRR
         YFQNSEK  E++ E+SP LQNSK+ QQ E++VSRFF KS +Q+ VNNQ+E  +  +QCAKSVKR+RKP  ERK +DK  SA+PRTTL+A ELFLEAYRR
Subjt:  GYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRR

Query:  KSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHV
        KS DDTWKPPPSG RLLQ DHAYDPWRVLVICMLLNRT+GQQAKEVIPKLF+LCP+PK+ LEVS+EQIEDIIRPLGL RKRS T+ RLSEMYLKESWSHV
Subjt:  KSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHV

Query:  TQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
        TQLPGVGKYGADAHAIFCTGYW+EV PKDHMLNYYW+FLHSIKHLL
Subjt:  TQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL

A0A6J1HWM5 methyl-CpG-binding domain protein 4-like protein isoform X11.4e-15371.7Show/hide
Query:  SPSSE-----ALEPPILTLED-------------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSV
        +P+SE     A EPPILTLED             P LARRVL F R+FGFD++++Q T  SV NS PVQ   RVVSR+FQ S+S QQ ERIVSRYFQ S 
Subjt:  SPSSE-----ALEPPILTLED-------------PSLARRVLCFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSV

Query:  KERAAHYEDENDGGNLTEQPSKRS-----SKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQE
         ERAAH EDE+D  N+T+QP KRS      KRRRKDVA S DNSK    S+ K SRS++KSGTD RVRIVS YFQNSEK  E++ E+SPSLQNSK+NQQE
Subjt:  KERAAHYEDENDGGNLTEQPSKRS-----SKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSGTDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQE

Query:  EKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVL
        E++VSRFF KS + + VNNQ+E  +  +QCAKSVKR+RKP  ERK +DK  SAKPRTTL+A ELFLEAYRRKS DDTWKPPPSG RLLQ DHAYDPWRVL
Subjt:  EKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVL

Query:  VICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKD
        VICMLLNRT+GQQAKEVIPKLF+LCP+PK+ LEVS+EQIEDIIRPLGL RKRS T+ RLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW+EV PKD
Subjt:  VICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKD

Query:  HMLNYYWDFLHSIKHLL
        HMLNYYW+FLHSIKHLL
Subjt:  HMLNYYWDFLHSIKHLL

SwissProt top hitse value%identityAlignment
O95243 Methyl-CpG-binding domain protein 48.5e-2637.14Show/hide
Query:  RKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSH
        R+     W PP S   L+Q    +DPW++L+  + LNRTSG+ A  V+ K     P+ +         + ++++PLGLY  R++T+ + S+ YL + W +
Subjt:  RKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSH

Query:  VTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL
          +L G+GKYG D++ IFC   W +V P+DH LN Y D+L
Subjt:  VTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL

Q0IGK1 Methyl-CpG-binding domain protein 4-like protein2.9e-5042.57Show/hide
Query:  VAPSYDNSKTNHHSM-GKPSRSV-QKSG---TDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCA
        V+P +  S  +  S  G  S SV  K G      +V  VS YFQ S    + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  
Subjt:  VAPSYDNSKTNHHSM-GKPSRSV-QKSG---TDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCA

Query:  KSVKRVRKPVN------------ERKQKDKTSSAKPRTTLTAAELFL-----EAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQA
        K ++   K V                QK+K+ + + +T + +  L L     + Y RK+PD+TW PP S   LLQ DH +DPWRVLVICMLLN+TSG Q 
Subjt:  KSVKRVRKPVN------------ERKQKDKTSSAKPRTTLTAAELFL-----EAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQA

Query:  KEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL
        + VI  LF LC + K   EV  E+IE++I+PLGL +KR++ + RLS  YL+ESW+HVTQL GVGKY ADA+AIFC G W  V+P DHMLNYYWD+L
Subjt:  KEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL

Q7LX22 Thymine/uracil-DNA glycosylase1.0e-0732.08Show/hide
Query:  GTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHV-------TQLPG
        G + L    A DPW VLV  +LL +T+ +Q  ++  +     P+P    + S E+I+ II+PLG+   R+  + +LSE  ++     +         LPG
Subjt:  GTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHV-------TQLPG

Query:  VGKYGA
        VG Y A
Subjt:  VGKYGA

Q9YDP0 Thymine-DNA glycosylase6.1e-0829.81Show/hide
Query:  GTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMY-------LKESWSHVTQLPG
        G + L   +  DPW +LV   LL +T+ +Q   V  +     PNPKA      +++ ++IRPLG+  +R++ +  L++         +  S   + +LPG
Subjt:  GTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMY-------LKESWSHVTQLPG

Query:  VGKY
        VG Y
Subjt:  VGKY

Q9Z2D7 Methyl-CpG-binding domain protein 43.6e-2429.13Show/hide
Query:  VRIVSG---YFQNSEKCLEMDRE---LSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKDKTSSAKPRTTLT
        +++ SG    F ++E   E +RE   L      SK +++ E  +    L+ G             ++  C+++ K       +     +T   K +T+L 
Subjt:  VRIVSG---YFQNSEKCLEMDRE---LSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKDKTSSAKPRTTLT

Query:  AAELF----LEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTM
         +  +    L   RRKS    W PP S   L+Q    +DPW++L+  + LNRTSG+ A  V+ +     P+ +         + ++++PLGLY  R++T+
Subjt:  AAELF----LEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTM

Query:  HRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL
         + S+ YL + W +  +L G+GKYG D++ IFC   W +V P+DH LN Y D+L
Subjt:  HRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL

Arabidopsis top hitse value%identityAlignment
AT3G07930.1 DNA glycosylase superfamily protein9.3e-1233.67Show/hide
Query:  VAPSYDNSKTNHHSM-GKPSRSV-QKSG---TDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCA
        V+P +  S  +  S  G  S SV  K G      +V  VS YFQ S    + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  
Subjt:  VAPSYDNSKTNHHSM-GKPSRSV-QKSG---TDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCA

Query:  KSVKRVRKPVN------------ERKQKDKTSSAKPRTTLTAAELFL-----EAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTS
        K ++   K V                QK+K+ + + +T + +  L L     + Y RK+PD+TW PP S   LLQ DH +DPWRVLVICMLLN+TS
Subjt:  KSVKRVRKPVN------------ERKQKDKTSSAKPRTTLTAAELFL-----EAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTS

AT3G07930.2 DNA glycosylase superfamily protein4.9e-1334.17Show/hide
Query:  VAPSYDNSKTNHHSM-GKPSRSV-QKSG---TDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCA
        V+P +  S  +  S  G  S SV  K G      +V  VS YFQ S    + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  
Subjt:  VAPSYDNSKTNHHSM-GKPSRSV-QKSG---TDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCA

Query:  KSVKRVRKPVN------------ERKQKDKTSSAKPRTTLTAAELFL-----EAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQ
        K ++   K V                QK+K+ + + +T + +  L L     + Y RK+PD+TW PP S   LLQ DH +DPWRVLVICMLLN+TSG Q
Subjt:  KSVKRVRKPVN------------ERKQKDKTSSAKPRTTLTAAELFL-----EAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQ

AT3G07930.3 DNA glycosylase superfamily protein2.1e-5142.57Show/hide
Query:  VAPSYDNSKTNHHSM-GKPSRSV-QKSG---TDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCA
        V+P +  S  +  S  G  S SV  K G      +V  VS YFQ S    + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  
Subjt:  VAPSYDNSKTNHHSM-GKPSRSV-QKSG---TDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCA

Query:  KSVKRVRKPVN------------ERKQKDKTSSAKPRTTLTAAELFL-----EAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQA
        K ++   K V                QK+K+ + + +T + +  L L     + Y RK+PD+TW PP S   LLQ DH +DPWRVLVICMLLN+TSG Q 
Subjt:  KSVKRVRKPVN------------ERKQKDKTSSAKPRTTLTAAELFL-----EAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQA

Query:  KEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL
        + VI  LF LC + K   EV  E+IE++I+PLGL +KR++ + RLS  YL+ESW+HVTQL GVGKY ADA+AIFC G W  V+P DHMLNYYWD+L
Subjt:  KEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGTAACAACAAGCATCAACCCTAACCTCACCCCACCATCCTCTTCTTCGTATCCCCACGATTTGTTTTCCGAATTCGTCTTTCGAGGTACTTCTCGCTCCAGATT
TCGCTTTCCTCCTTCCAAATCGACTCAACAAAACCCTAATCCCTACCAGGATTCTACCCAGCACTCTCCACTTTCTACTCTTTATGATTTCCAAACTTCAGAACCCAACA
ATCATCACAACAAATCCTTAGCATCCCCATCTTCTGAAGCCCTCGAGCCTCCTATATTAACACTAGAGGATCCTTCACTCGCTCGTAGAGTCTTGTGTTTTTACCGAGAG
TTCGGATTTGATAAAAAATTGTTGCAAGCAACTTCGCATTCTGTCCTGAATTCAGAACCTGTTCAAGAAGGGACCCGTGTGGTTTCGCGTTATTTCCAAAACTCAAGATC
AACCCAACAACGCGAACGAATTGTCTCACGATACTTTCAAAAATCGGTGAAGGAACGAGCAGCCCATTATGAGGATGAGAATGATGGTGGCAATCTAACAGAGCAGCCAA
GTAAAAGATCAAGCAAAAGGAGGAGGAAAGACGTAGCCCCCAGCTACGATAACTCAAAAACAAATCACCATTCAATGGGAAAGCCTTCACGCTCTGTTCAGAAGTCGGGA
ACAGATACACGAGTGCGAATTGTTTCGGGCTATTTTCAAAATTCTGAAAAGTGTCTTGAAATGGATCGAGAACTTTCACCTTCTTTACAAAATTCAAAATCAAATCAACA
AGAAGAGAAAATGGTCTCACGTTTCTTTCTAAAATCAGGGAAACAACAAGCCGTGAACAATCAGGAAGAGGCTACAGAGCAGCTAAATCAGTGTGCGAAATCTGTTAAAA
GGGTCCGTAAACCAGTCAATGAAAGGAAACAGAAGGATAAGACAAGTTCTGCTAAACCTCGGACCACTCTTACTGCTGCAGAGTTGTTTTTGGAAGCTTACAGAAGGAAA
TCGCCAGATGATACATGGAAGCCTCCTCCCTCTGGAACTCGCCTTCTCCAACATGATCATGCGTACGACCCTTGGAGGGTTCTAGTCATATGTATGCTCCTCAACCGGAC
AAGTGGGCAACAGGCAAAAGAAGTGATACCTAAACTCTTCAGTTTGTGTCCCAATCCAAAGGCTACGTTGGAGGTATCACGTGAGCAGATAGAAGATATCATTCGACCTC
TTGGTTTATATAGGAAAAGATCACGAACAATGCATCGTTTATCTGAGATGTATTTAAAAGAAAGTTGGAGCCATGTCACCCAGCTTCCTGGTGTCGGCAAGTATGGTGCT
GATGCACATGCGATATTCTGCACTGGATATTGGAGTGAAGTAGAACCTAAAGATCACATGCTTAATTATTACTGGGATTTTCTCCACAGCATCAAACACCTGCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGTAACAACAAGCATCAACCCTAACCTCACCCCACCATCCTCTTCTTCGTATCCCCACGATTTGTTTTCCGAATTCGTCTTTCGAGGTACTTCTCGCTCCAGATT
TCGCTTTCCTCCTTCCAAATCGACTCAACAAAACCCTAATCCCTACCAGGATTCTACCCAGCACTCTCCACTTTCTACTCTTTATGATTTCCAAACTTCAGAACCCAACA
ATCATCACAACAAATCCTTAGCATCCCCATCTTCTGAAGCCCTCGAGCCTCCTATATTAACACTAGAGGATCCTTCACTCGCTCGTAGAGTCTTGTGTTTTTACCGAGAG
TTCGGATTTGATAAAAAATTGTTGCAAGCAACTTCGCATTCTGTCCTGAATTCAGAACCTGTTCAAGAAGGGACCCGTGTGGTTTCGCGTTATTTCCAAAACTCAAGATC
AACCCAACAACGCGAACGAATTGTCTCACGATACTTTCAAAAATCGGTGAAGGAACGAGCAGCCCATTATGAGGATGAGAATGATGGTGGCAATCTAACAGAGCAGCCAA
GTAAAAGATCAAGCAAAAGGAGGAGGAAAGACGTAGCCCCCAGCTACGATAACTCAAAAACAAATCACCATTCAATGGGAAAGCCTTCACGCTCTGTTCAGAAGTCGGGA
ACAGATACACGAGTGCGAATTGTTTCGGGCTATTTTCAAAATTCTGAAAAGTGTCTTGAAATGGATCGAGAACTTTCACCTTCTTTACAAAATTCAAAATCAAATCAACA
AGAAGAGAAAATGGTCTCACGTTTCTTTCTAAAATCAGGGAAACAACAAGCCGTGAACAATCAGGAAGAGGCTACAGAGCAGCTAAATCAGTGTGCGAAATCTGTTAAAA
GGGTCCGTAAACCAGTCAATGAAAGGAAACAGAAGGATAAGACAAGTTCTGCTAAACCTCGGACCACTCTTACTGCTGCAGAGTTGTTTTTGGAAGCTTACAGAAGGAAA
TCGCCAGATGATACATGGAAGCCTCCTCCCTCTGGAACTCGCCTTCTCCAACATGATCATGCGTACGACCCTTGGAGGGTTCTAGTCATATGTATGCTCCTCAACCGGAC
AAGTGGGCAACAGGCAAAAGAAGTGATACCTAAACTCTTCAGTTTGTGTCCCAATCCAAAGGCTACGTTGGAGGTATCACGTGAGCAGATAGAAGATATCATTCGACCTC
TTGGTTTATATAGGAAAAGATCACGAACAATGCATCGTTTATCTGAGATGTATTTAAAAGAAAGTTGGAGCCATGTCACCCAGCTTCCTGGTGTCGGCAAGTATGGTGCT
GATGCACATGCGATATTCTGCACTGGATATTGGAGTGAAGTAGAACCTAAAGATCACATGCTTAATTATTACTGGGATTTTCTCCACAGCATCAAACACCTGCTCTGA
Protein sequenceShow/hide protein sequence
MAVTTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSTQQNPNPYQDSTQHSPLSTLYDFQTSEPNNHHNKSLASPSSEALEPPILTLEDPSLARRVLCFYRE
FGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDGGNLTEQPSKRSSKRRRKDVAPSYDNSKTNHHSMGKPSRSVQKSG
TDTRVRIVSGYFQNSEKCLEMDRELSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKDKTSSAKPRTTLTAAELFLEAYRRK
SPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGQQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGA
DAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL