; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0108261 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0108261
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
Descriptionmethyl-CpG-binding domain protein 4-like protein
Genome locationCMiso1.1chr04:27395392..27399990
RNA-Seq ExpressionCmc04g0108261
SyntenyCmc04g0108261
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase
IPR045138 - Methyl-CpG binding protein MeCP2/MBD4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054277.1 methyl-CpG-binding domain protein 4-like protein [Cucumis melo var. makuwa]8.7e-25798.92Show/hide
Query:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
        MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSA QNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
Subjt:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN

Query:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDDGNLTEQPSKRS
        GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYF+KSVKERAAHYEDENDDGNLTEQPSKRS
Subjt:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDDGNLTEQPSKRS

Query:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKS TDTR RIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWS
        ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW+
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWS

XP_004142362.1 methyl-CpG-binding domain protein 4-like protein isoform X1 [Cucumis sativus]1.1e-25492.83Show/hide
Query:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
        MA+TTSI+PNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQ+PNPYQDSTQHSP+STL+DLQT EP+NHHN+SLASPSSE  EPPILTLEDLQN
Subjt:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN

Query:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDDGNLTEQPSKRS
        GKLP QSPK+PSLARRVLSFYREFGFDKKLLQATSHSVLNS P QEGTRVVSRYFQNSRSTQQ +RIVSRYFQ+SVKER AHYEDEND GNLTEQPSKRS
Subjt:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDDGNLTEQPSKRS

Query:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV P S NSKTNHHS+GKT+RSVQKSGTDT+VRIVSGYFQ  EKSLEMDREVSPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
        CAKSVKR+RKPVNERK+K+KTSSTKPRTTLTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
        ATLEVSREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL

XP_008460559.1 PREDICTED: methyl-CpG-binding domain protein 4-like protein [Cucumis melo]1.5e-27299.18Show/hide
Query:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
        MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSA QNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
Subjt:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN

Query:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDDGNLTEQPSKRS
        GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYF+KSVKERAAHYEDENDDGNLTEQPSKRS
Subjt:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDDGNLTEQPSKRS

Query:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKS TDTR RIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
        ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL

XP_031741432.1 methyl-CpG-binding domain protein 4 isoform X2 [Cucumis sativus]4.9e-22892.2Show/hide
Query:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
        MA+TTSI+PNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQ+PNPYQDSTQHSP+STL+DLQT EP+NHHN+SLASPSSE  EPPILTLEDLQN
Subjt:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN

Query:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDDGNLTEQPSKRS
        GKLP QSPK+PSLARRVLSFYREFGFDKKLLQATSHSVLNS P QEGTRVVSRYFQNSRSTQQ +RIVSRYFQ+SVKER AHYEDEND GNLTEQPSKRS
Subjt:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDDGNLTEQPSKRS

Query:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV P S NSKTNHHS+GKT+RSVQKSGTDT+VRIVSGYFQ  EKSLEMDREVSPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
        CAKSVKR+RKPVNERK+K+KTSSTKPRTTLTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVG
        ATLEVSREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVG
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVG

XP_038892490.1 methyl-CpG-binding domain protein 4-like protein isoform X1 [Benincasa hispida]7.5e-19272.69Show/hide
Query:  AATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQNPNPYQDST--------QHSPISTLYDLQTSEPNNHHNKSL-------------
        AAT SIN NLTPPSSSSYP DLFS+F FRG+SRSR    PSKS+QQNP   QD T        QHSPI+T  DLQ SEP NH NKSL             
Subjt:  AATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQNPNPYQDST--------QHSPISTLYDLQTSEPNNHHNKSL-------------

Query:  -ASPSSEADEPPILTLEDLQNGKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKER
         +SPSS+  EPPILTLEDLQN K  LQ PKKP LARR+L+FYREFGFD+K+ Q TSHSVLNSEPVQEG R+ SRYFQNS+STQQ ER VSRYFQKSVK+R
Subjt:  -ASPSSEADEPPILTLEDLQNGKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKER

Query:  AAHYEDENDDGNLTEQPSKRSSKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFF
         AH EDE++D NLTEQPSKRSSKRRRKDVDPSS NSKTN HSMGK SRS+QKSGTD RVRIVS YFQ SEK++E+DR                       
Subjt:  AAHYEDENDDGNLTEQPSKRSSKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFF

Query:  LKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNR
                     EAT+Q+NQ AKS KRVRKPVNERKQ++KTSS+KPRTTLTAAEL LEAYRRKS DDTWKPPPSG RLLQ DHAYDPWRVLVICMLLNR
Subjt:  LKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNR

Query:  TSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWD
        TSG+QAKEVIPKLF LCPNPKATL+VS+EQIEDIIRPLGL RKRSRTM  LSEMYLKE+WSHVTQLPGVGKYGADAHAIFCTGYW+EV+PKDHMLNYYW+
Subjt:  TSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWD

Query:  FLHSIKHLL
        FLHSI+HLL
Subjt:  FLHSIKHLL

TrEMBL top hitse value%identityAlignment
A0A0A0KRW9 ENDO3c domain-containing protein4.4e-23090.87Show/hide
Query:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
        MA+TTSI+PNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQ+PNPYQDSTQHSP+STL+DLQT EP+NHHN+SLASPSSE  EPPILTLEDLQN
Subjt:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN

Query:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDDGNLTEQPSKRS
        GKLP QSPK+PSLARRVLSFYREFGFDKKLLQATSHSVLNS P QEGTRVVSRYFQNSRSTQQ +RIVSRYFQ+SVKER AHYEDEND GNLTEQPSKRS
Subjt:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDDGNLTEQPSKRS

Query:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV P S NSKTNHHS+GKT+RSVQKSGTDT+VRIVSGYFQ  EKSLEMDREVSPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
        CAKSVKR+RKPVNERK+K+KTSSTKPRTTLTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFC
        ATLEVSREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVGKY A    + C
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFC

A0A1S3CCU6 methyl-CpG-binding domain protein 4-like protein7.1e-27399.18Show/hide
Query:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
        MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSA QNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
Subjt:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN

Query:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDDGNLTEQPSKRS
        GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYF+KSVKERAAHYEDENDDGNLTEQPSKRS
Subjt:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDDGNLTEQPSKRS

Query:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKS TDTR RIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
        ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL

A0A5D3CU57 Methyl-CpG-binding domain protein 4-like protein4.2e-25798.92Show/hide
Query:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
        MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSA QNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
Subjt:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN

Query:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDDGNLTEQPSKRS
        GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYF+KSVKERAAHYEDENDDGNLTEQPSKRS
Subjt:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDDGNLTEQPSKRS

Query:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKS TDTR RIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWS
        ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW+
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWS

A0A6J1EZJ4 methyl-CpG-binding domain protein 4-like protein4.3e-16963.74Show/hide
Query:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSK----SAQQNPNPYQDST--------QHSPISTLYDLQTSEPNNHHNKSLA-------
        M ATT +NPNL+PPSSSS+P  LFS+F F+G S SRFRFPPSK    S +QNP P +D T        Q+SPISTL  LQTSE N  H K+ A       
Subjt:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSK----SAQQNPNPYQDST--------QHSPISTLYDLQTSEPNNHHNKSLA-------

Query:  ---------------------------SPSSEAD-----EPPILTLEDLQNGKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTR
                                   +P+SE +     EPPILTLED+QN K   Q   +P LARRVL FYR+FGFD++++Q T  SV NS PVQ   R
Subjt:  ---------------------------SPSSEAD-----EPPILTLEDLQNGKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTR

Query:  VVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDDG--NLTEQPSKRS-----SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSGTDTRVRIVS
        VVSR+FQ S+S QQ ERIVSRYFQ S  ERAAH EDE++D   N+T+QP KRS      KRRRKDV  SS NSK    S+ K+SR V++SGTD RVR VS
Subjt:  VVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDDG--NLTEQPSKRS-----SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSGTDTRVRIVS

Query:  GYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRR
         YFQ SEK+ E++ EVSP LQNSK+ QQ E++VSRFF KS +Q+ VNNQ+E  +  +QCAKSVKR+RKP  ERK ++K S+ +PRTTL+A ELFLEAYRR
Subjt:  GYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRR

Query:  KSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHV
        KS DDTWKPPPSG RLLQ DHAYDPWRVLVICMLLNRT+G+QAKEVIPKLF+LCP+PK+ LEVS+EQIEDIIRPLGL RKRS T+ RLSEMYLKESWSHV
Subjt:  KSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHV

Query:  TQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
        TQLPGVGKYGADAHAIFCTGYW+EV PKDHMLNYYW+FLHSIKHLL
Subjt:  TQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL

A0A6J1HWM5 methyl-CpG-binding domain protein 4-like protein isoform X18.4e-15766.74Show/hide
Query:  HSPISTLYDLQTSEPNNHHNKSLA----------------------------------SPSSE-----ADEPPILTLEDLQNGKLPLQSPKKPSLARRVL
        +SPISTL  LQTSE N  H K+ A                                  +P+SE     A EPPILTLEDLQN K   Q   KP LARRVL
Subjt:  HSPISTLYDLQTSEPNNHHNKSLA----------------------------------SPSSE-----ADEPPILTLEDLQNGKLPLQSPKKPSLARRVL

Query:  SFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDDGNLTEQPSKRS-----SKRRRKDVDPSSV
         F R+FGFD++++Q T  SV NS PVQ   RVVSR+FQ S+S QQ ERIVSRYFQ S  ERAAH EDE+DD N+T+QP KRS      KRRRKDV  SS 
Subjt:  SFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDDGNLTEQPSKRS-----SKRRRKDVDPSSV

Query:  NSKTNHHSMGKTSRSVQKSGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVN
        NSK    S+ K+SRS++KSGTD RVRIVS YFQ SEK+ E++ EVSPSLQNSK+NQQEE++VSRFF KS + + VNNQ+E  +  +QCAKSVKR+RKP  
Subjt:  NSKTNHHSMGKTSRSVQKSGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVN

Query:  ERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDI
        ERK ++K S+ KPRTTL+A ELFLEAYRRKS DDTWKPPPSG RLLQ DHAYDPWRVLVICMLLNRT+G+QAKEVIPKLF+LCP+PK+ LEVS+EQIEDI
Subjt:  ERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDI

Query:  IRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
        IRPLGL RKRS T+ RLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW+EV PKDHMLNYYW+FLHSIKHLL
Subjt:  IRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL

SwissProt top hitse value%identityAlignment
O95243 Methyl-CpG-binding domain protein 46.7e-2637.14Show/hide
Query:  RKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSH
        R+     W PP S   L+Q    +DPW++L+  + LNRTSG+ A  V+ K     P+ +         + ++++PLGLY  R++T+ + S+ YL + W +
Subjt:  RKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSH

Query:  VTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL
          +L G+GKYG D++ IFC   W +V P+DH LN Y D+L
Subjt:  VTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL

Q0IGK1 Methyl-CpG-binding domain protein 4-like protein2.3e-5043.98Show/hide
Query:  SGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERK-------QKNKTSST
        S    +V  VS YFQ S  S + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  K ++   K V   +       Q N++   
Subjt:  SGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERK-------QKNKTSST

Query:  KPRT---------TLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIR
        K R           L+ ++   + Y RK+PD+TW PP S   LLQ DH +DPWRVLVICMLLN+TSG Q + VI  LF LC + K   EV  E+IE++I+
Subjt:  KPRT---------TLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIR

Query:  PLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL
        PLGL +KR++ + RLS  YL+ESW+HVTQL GVGKY ADA+AIFC G W  V+P DHMLNYYWD+L
Subjt:  PLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL

Q7LX22 Thymine/uracil-DNA glycosylase8.2e-0832.08Show/hide
Query:  GTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHV-------TQLPG
        G + L    A DPW VLV  +LL +T+ +Q  ++  +     P+P    + S E+I+ II+PLG+   R+  + +LSE  ++     +         LPG
Subjt:  GTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHV-------TQLPG

Query:  VGKYGA
        VG Y A
Subjt:  VGKYGA

Q9YDP0 Thymine-DNA glycosylase2.2e-0830.77Show/hide
Query:  GTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMY-------LKESWSHVTQLPG
        G + L   +  DPW +LV   LL +T+ RQ   V  +     PNPKA      +++ ++IRPLG+  +R++ +  L++         +  S   + +LPG
Subjt:  GTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMY-------LKESWSHVTQLPG

Query:  VGKY
        VG Y
Subjt:  VGKY

Q9Z2D7 Methyl-CpG-binding domain protein 49.6e-2532.28Show/hide
Query:  QLNQCAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELF----LEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKL
        ++  C+++ K       +     +T   K +T+L  +  +    L   RRKS    W PP S   L+Q    +DPW++L+  + LNRTSG+ A  V+ + 
Subjt:  QLNQCAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELF----LEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKL

Query:  FSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL
            P+ +         + ++++PLGLY  R++T+ + S+ YL + W +  +L G+GKYG D++ IFC   W +V P+DH LN Y D+L
Subjt:  FSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL

Arabidopsis top hitse value%identityAlignment
AT3G07930.1 DNA glycosylase superfamily protein7.3e-1234.34Show/hide
Query:  SGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERK-------QKNKTSST
        S    +V  VS YFQ S  S + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  K ++   K V   +       Q N++   
Subjt:  SGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERK-------QKNKTSST

Query:  KPRT---------TLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTS
        K R           L+ ++   + Y RK+PD+TW PP S   LLQ DH +DPWRVLVICMLLN+TS
Subjt:  KPRT---------TLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTS

AT3G07930.2 DNA glycosylase superfamily protein3.9e-1334.91Show/hide
Query:  SGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERK-------QKNKTSST
        S    +V  VS YFQ S  S + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  K ++   K V   +       Q N++   
Subjt:  SGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERK-------QKNKTSST

Query:  KPRT---------TLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQ
        K R           L+ ++   + Y RK+PD+TW PP S   LLQ DH +DPWRVLVICMLLN+TSG Q
Subjt:  KPRT---------TLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQ

AT3G07930.3 DNA glycosylase superfamily protein1.6e-5143.98Show/hide
Query:  SGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERK-------QKNKTSST
        S    +V  VS YFQ S  S + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  K ++   K V   +       Q N++   
Subjt:  SGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERK-------QKNKTSST

Query:  KPRT---------TLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIR
        K R           L+ ++   + Y RK+PD+TW PP S   LLQ DH +DPWRVLVICMLLN+TSG Q + VI  LF LC + K   EV  E+IE++I+
Subjt:  KPRT---------TLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIR

Query:  PLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL
        PLGL +KR++ + RLS  YL+ESW+HVTQL GVGKY ADA+AIFC G W  V+P DHMLNYYWD+L
Subjt:  PLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCAACAACAAGCATCAACCCTAACCTCACCCCACCATCCTCTTCTTCATATCCCCACGATTTGTTTTCCGAATTCGTCTTTCGAGGTACTTCTCGCTCCAGATT
TCGCTTTCCTCCTTCTAAATCCGCTCAACAAAACCCTAATCCGTACCAGGATTCCACCCAGCACTCTCCAATTTCTACTCTTTATGATCTCCAAACGTCAGAACCCAACA
ATCATCACAACAAATCCTTAGCATCCCCATCTTCTGAAGCCGACGAGCCTCCTATATTAACACTAGAGGATCTTCAAAATGGAAAACTACCCCTTCAATCGCCAAAAAAG
CCTTCACTCGCTCGTAGAGTCTTGTCTTTTTACCGAGAGTTCGGATTTGATAAAAAATTGTTGCAAGCAACTTCGCATTCTGTCCTGAATTCAGAACCTGTTCAAGAAGG
GACCCGTGTGGTTTCGCGTTATTTCCAAAACTCAAGATCAACCCAACAACGCGAACGAATTGTCTCACGATATTTTCAAAAATCGGTGAAGGAACGAGCAGCCCATTATG
AGGATGAGAATGATGATGGCAATCTCACAGAGCAGCCAAGTAAAAGATCTAGCAAAAGGAGGAGGAAAGACGTAGACCCCAGCTCCGTTAACTCAAAAACAAATCATCAT
TCAATGGGAAAGACTTCGCGCTCTGTTCAGAAGTCGGGAACGGATACACGAGTGCGAATTGTTTCGGGCTATTTTCAATATTCTGAAAAGAGTCTTGAAATGGATCGAGA
AGTATCACCTTCTTTACAAAATTCAAAATCAAATCAACAAGAAGAGAAAATGGTCTCACGTTTCTTTCTAAAGTCAGGGAAACAACAAGCCGTGAACAATCAGGAAGAGG
CTACAGAGCAGCTAAATCAGTGTGCGAAATCTGTTAAAAGGGTGCGTAAACCAGTCAATGAAAGGAAACAGAAGAACAAGACAAGTTCTACTAAACCTCGGACCACTCTT
ACTGCTGCAGAGTTGTTTTTGGAAGCTTACAGAAGGAAATCGCCAGATGATACATGGAAGCCTCCTCCCTCTGGAACTCGCCTTCTCCAACACGATCATGCGTACGACCC
TTGGAGGGTTCTAGTCATATGTATGCTCCTCAACCGGACGAGTGGGCGACAGGCAAAAGAAGTGATACCTAAACTCTTCAGTTTGTGTCCCAATCCAAAGGCTACTTTGG
AGGTATCACGTGAGCAGATAGAAGATATCATTCGACCTCTTGGTTTATATAGGAAAAGATCACGAACAATGCATCGTTTATCTGAGATGTATTTAAAAGAAAGTTGGAGC
CATGTCACCCAGCTTCCTGGTGTCGGCAAGTATGGTGCCGATGCACATGCGATATTCTGCACTGGATATTGGAGTGAAGTAGAACCTAAAGATCACATGCTTAATTATTA
CTGGGATTTTCTCCACAGTATCAAACACCTTCTCTGA
mRNA sequenceShow/hide mRNA sequence
AAATAAATAAAAAGCGCTAAAATGGTAGAAAGGATTTATAACCCAAGGAGAAGCGCGCGAAGGCAGTGAAGTGATTGTTCTTGTTTCTGGTGTTCCGTAGCGAATCCCGC
CATGGCTGCAACAACAAGCATCAACCCTAACCTCACCCCACCATCCTCTTCTTCATATCCCCACGATTTGTTTTCCGAATTCGTCTTTCGAGGTACTTCTCGCTCCAGAT
TTCGCTTTCCTCCTTCTAAATCCGCTCAACAAAACCCTAATCCGTACCAGGATTCCACCCAGCACTCTCCAATTTCTACTCTTTATGATCTCCAAACGTCAGAACCCAAC
AATCATCACAACAAATCCTTAGCATCCCCATCTTCTGAAGCCGACGAGCCTCCTATATTAACACTAGAGGATCTTCAAAATGGAAAACTACCCCTTCAATCGCCAAAAAA
GCCTTCACTCGCTCGTAGAGTCTTGTCTTTTTACCGAGAGTTCGGATTTGATAAAAAATTGTTGCAAGCAACTTCGCATTCTGTCCTGAATTCAGAACCTGTTCAAGAAG
GGACCCGTGTGGTTTCGCGTTATTTCCAAAACTCAAGATCAACCCAACAACGCGAACGAATTGTCTCACGATATTTTCAAAAATCGGTGAAGGAACGAGCAGCCCATTAT
GAGGATGAGAATGATGATGGCAATCTCACAGAGCAGCCAAGTAAAAGATCTAGCAAAAGGAGGAGGAAAGACGTAGACCCCAGCTCCGTTAACTCAAAAACAAATCATCA
TTCAATGGGAAAGACTTCGCGCTCTGTTCAGAAGTCGGGAACGGATACACGAGTGCGAATTGTTTCGGGCTATTTTCAATATTCTGAAAAGAGTCTTGAAATGGATCGAG
AAGTATCACCTTCTTTACAAAATTCAAAATCAAATCAACAAGAAGAGAAAATGGTCTCACGTTTCTTTCTAAAGTCAGGGAAACAACAAGCCGTGAACAATCAGGAAGAG
GCTACAGAGCAGCTAAATCAGTGTGCGAAATCTGTTAAAAGGGTGCGTAAACCAGTCAATGAAAGGAAACAGAAGAACAAGACAAGTTCTACTAAACCTCGGACCACTCT
TACTGCTGCAGAGTTGTTTTTGGAAGCTTACAGAAGGAAATCGCCAGATGATACATGGAAGCCTCCTCCCTCTGGAACTCGCCTTCTCCAACACGATCATGCGTACGACC
CTTGGAGGGTTCTAGTCATATGTATGCTCCTCAACCGGACGAGTGGGCGACAGGCAAAAGAAGTGATACCTAAACTCTTCAGTTTGTGTCCCAATCCAAAGGCTACTTTG
GAGGTATCACGTGAGCAGATAGAAGATATCATTCGACCTCTTGGTTTATATAGGAAAAGATCACGAACAATGCATCGTTTATCTGAGATGTATTTAAAAGAAAGTTGGAG
CCATGTCACCCAGCTTCCTGGTGTCGGCAAGTATGGTGCCGATGCACATGCGATATTCTGCACTGGATATTGGAGTGAAGTAGAACCTAAAGATCACATGCTTAATTATT
ACTGGGATTTTCTCCACAGTATCAAACACCTTCTCTGATCTTATTTGACTGTAGATGGTTTTGTTCGAGAAGCAGAAGTTGTGGATTTCAGGCTCTACATCCCTTTCTAC
GCATATTTTGGAATGTTCTGACAAAAAGAAACTCTTTTGTGCTATCGGTTTTGTAAATGTTATTTGATGTTGTTGGGACTTGGAGTTTGGTTGGGAATTATTTAAACTTG
TTATCGATCTCTCCTATTATTAACATTATATACTAGTAAGAGGAGAAAAGGAAATTCTCCTCCAACATTAGCTAGTTATGTTAGACTGTAGCTGAAGCTGTGTACTGTTT
GTGAGGTCAATGGATTTTGTTCATGTTTTGTCTCCAAACGTGTCATCACCAAGGGTGGCTTACCGTTCACTAAATATTTTGTATATGGCTGTAACCTTTACATTTTAATC
TACTGGTTGATGTACTTGGTTTCAATTTAAAATCCATTCGACTAGTTTCTGCATTTGTTCCACGT
Protein sequenceShow/hide protein sequence
MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAQQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQNGKLPLQSPKK
PSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFQKSVKERAAHYEDENDDGNLTEQPSKRSSKRRRKDVDPSSVNSKTNHH
SMGKTSRSVQKSGTDTRVRIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKNKTSSTKPRTTL
TAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWS
HVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL