; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0022876 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0022876
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
Descriptionmethyl-CpG-binding domain protein 4-like protein
Genome locationchr10:26324840..26329354
RNA-Seq ExpressionIVF0022876
SyntenyIVF0022876
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase
IPR045138 - Methyl-CpG binding protein MeCP2/MBD4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054277.1 methyl-CpG-binding domain protein 4-like protein [Cucumis melo var. makuwa]0.099.78Show/hide
Query:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
        MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
Subjt:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN

Query:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS
        GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS
Subjt:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS

Query:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWS
        ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW+
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWS

XP_004142362.1 methyl-CpG-binding domain protein 4-like protein isoform X1 [Cucumis sativus]0.092.01Show/hide
Query:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
        MA+TTSI+PNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSA Q+PNPYQDSTQHSP+STL+DLQT EP+NHHN+SLASPSSE  EPPILTLEDLQN
Subjt:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN

Query:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS
        GKLP QSPK+PSLARRVLSFYREFGFDKKLLQATSHSVLNS P QEGTRVVSRYFQNSRSTQQ +RIVSRYF++SVKER AHYEDEND GNLTEQPSKRS
Subjt:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS

Query:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV P S NSKTNHHS+GKT+RSVQKS TDT+ RIVSGYFQ  EKSLEMDREVSPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
        CAKSVKR+RKPVNERK+K+KTSSTKPRTTLTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
        ATLEVSREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL

XP_008460559.1 PREDICTED: methyl-CpG-binding domain protein 4-like protein [Cucumis melo]0.0100Show/hide
Query:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
        MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
Subjt:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN

Query:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS
        GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS
Subjt:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS

Query:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
        ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL

XP_031741432.1 methyl-CpG-binding domain protein 4 isoform X2 [Cucumis sativus]1.45e-28591.31Show/hide
Query:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
        MA+TTSI+PNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSA Q+PNPYQDSTQHSP+STL+DLQT EP+NHHN+SLASPSSE  EPPILTLEDLQN
Subjt:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN

Query:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS
        GKLP QSPK+PSLARRVLSFYREFGFDKKLLQATSHSVLNS P QEGTRVVSRYFQNSRSTQQ +RIVSRYF++SVKER AHYEDEND GNLTEQPSKRS
Subjt:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS

Query:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV P S NSKTNHHS+GKT+RSVQKS TDT+ RIVSGYFQ  EKSLEMDREVSPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
        CAKSVKR+RKPVNERK+K+KTSSTKPRTTLTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVG
        ATLEVSREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVG
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVG

XP_038892490.1 methyl-CpG-binding domain protein 4-like protein isoform X1 [Benincasa hispida]2.23e-23771.91Show/hide
Query:  AATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQ--------HSPISTLYDLQTSEPNNHHNKSLA------------
        AAT SIN NLTPPSSSSYP DLFS+F FRG+SRSR    PSKS+ QNP   QD TQ        HSPI+T  DLQ SEP NH NKSL+            
Subjt:  AATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQ--------HSPISTLYDLQTSEPNNHHNKSLA------------

Query:  --SPSSEADEPPILTLEDLQNGKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKER
          SPSS+  EPPILTLEDLQN K  LQ PKKP LARR+L+FYREFGFD+K+ Q TSHSVLNSEPVQEG R+ SRYFQNS+STQQ ER VSRYF+KSVK+R
Subjt:  --SPSSEADEPPILTLEDLQNGKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKER

Query:  AAHYEDENDDGNLTEQPSKRSSKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFF
         AH EDE++D NLTEQPSKRSSKRRRKDVDPSS NSKTN HSMGK SRS+QKS TD R RIVS YFQ SEK++E+DR                       
Subjt:  AAHYEDENDDGNLTEQPSKRSSKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFF

Query:  LKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNR
                     EAT+Q+NQ AKS KRVRKPVNERKQ++KTSS+KPRTTLTAAEL LEAYRRKS DDTWKPPPSG RLLQ DHAYDPWRVLVICMLLNR
Subjt:  LKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNR

Query:  TSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWD
        TSG+QAKEVIPKLF LCPNPKATL+VS+EQIEDIIRPLGL RKRSRTM  LSEMYLKE+WSHVTQLPGVGKYGADAHAIFCTGYW+EV+PKDHMLNYYW+
Subjt:  TSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWD

Query:  FLHSIKHLL
        FLHSI+HLL
Subjt:  FLHSIKHLL

TrEMBL top hitse value%identityAlignment
A0A0A0KRW9 ENDO3c domain-containing protein1.6e-22790Show/hide
Query:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
        MA+TTSI+PNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSA Q+PNPYQDSTQHSP+STL+DLQT EP+NHHN+SLASPSSE  EPPILTLEDLQN
Subjt:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN

Query:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS
        GKLP QSPK+PSLARRVLSFYREFGFDKKLLQATSHSVLNS P QEGTRVVSRYFQNSRSTQQ +RIVSRYF++SVKER AHYEDEND GNLTEQPSKRS
Subjt:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS

Query:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDV P S NSKTNHHS+GKT+RSVQKS TDT+ RIVSGYFQ  EKSLEMDREVSPSLQNSKSNQQEEK+VSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
        CAKSVKR+RKPVNERK+K+KTSSTKPRTTLTAAELFLEAYRRKSP DTWKPP SGTRLLQHDHAYDPWRVLVICMLLNRTSG+QAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFC
        ATLEVSREQIEDIIRPLG YRKRSRTMHRLSEMYLKESWSHVTQLPGVGKY A    + C
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFC

A0A1S3CCU6 methyl-CpG-binding domain protein 4-like protein2.6e-275100Show/hide
Query:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
        MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
Subjt:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN

Query:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS
        GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS
Subjt:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS

Query:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
        ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL

A0A5D3CU57 Methyl-CpG-binding domain protein 4-like protein1.5e-25999.78Show/hide
Query:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
        MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN
Subjt:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQN

Query:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS
        GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS
Subjt:  GKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS

Query:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
        SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ
Subjt:  SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQ

Query:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
        CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK
Subjt:  CAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPK

Query:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWS
        ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW+
Subjt:  ATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWS

A0A6J1EZJ4 methyl-CpG-binding domain protein 4-like protein5.3e-16763.19Show/hide
Query:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSK----SAHQNPNPYQDST--------QHSPISTLYDLQTSEPNNHHNKSLA-------
        M ATT +NPNL+PPSSSS+P  LFS+F F+G S SRFRFPPSK    S  QNP P +D T        Q+SPISTL  LQTSE N  H K+ A       
Subjt:  MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSK----SAHQNPNPYQDST--------QHSPISTLYDLQTSEPNNHHNKSLA-------

Query:  ---------------------------SPSSEAD-----EPPILTLEDLQNGKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTR
                                   +P+SE +     EPPILTLED+QN K   Q   +P LARRVL FYR+FGFD++++Q T  SV NS PVQ   R
Subjt:  ---------------------------SPSSEAD-----EPPILTLEDLQNGKLPLQSPKKPSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTR

Query:  VVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDG--NLTEQPSKRS-----SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVS
        VVSR+FQ S+S QQ ERIVSRYF+ S  ERAAH EDE++D   N+T+QP KRS      KRRRKDV  SS NSK    S+ K+SR V++S TD R R VS
Subjt:  VVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDG--NLTEQPSKRS-----SKRRRKDVDPSSVNSKTNHHSMGKTSRSVQKSRTDTRARIVS

Query:  GYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRR
         YFQ SEK+ E++ EVSP LQNSK+ QQ E++VSRFF KS +Q+ VNNQ+E  +  +QCAKSVKR+RKP  ERK ++K S+ +PRTTL+A ELFLEAYRR
Subjt:  GYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELFLEAYRR

Query:  KSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHV
        KS DDTWKPPPSG RLLQ DHAYDPWRVLVICMLLNRT+G+QAKEVIPKLF+LCP+PK+ LEVS+EQIEDIIRPLGL RKRS T+ RLSEMYLKESWSHV
Subjt:  KSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHV

Query:  TQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
        TQLPGVGKYGADAHAIFCTGYW+EV PKDHMLNYYW+FLHSIKHLL
Subjt:  TQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL

A0A6J1HWM5 methyl-CpG-binding domain protein 4-like protein isoform X11.0e-15466.11Show/hide
Query:  HSPISTLYDLQTSEPNNHHNKSLA----------------------------------SPSSE-----ADEPPILTLEDLQNGKLPLQSPKKPSLARRVL
        +SPISTL  LQTSE N  H K+ A                                  +P+SE     A EPPILTLEDLQN K   Q   KP LARRVL
Subjt:  HSPISTLYDLQTSEPNNHHNKSLA----------------------------------SPSSE-----ADEPPILTLEDLQNGKLPLQSPKKPSLARRVL

Query:  SFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS-----SKRRRKDVDPSSV
         F R+FGFD++++Q T  SV NS PVQ   RVVSR+FQ S+S QQ ERIVSRYF+ S  ERAAH EDE+DD N+T+QP KRS      KRRRKDV  SS 
Subjt:  SFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRS-----SKRRRKDVDPSSV

Query:  NSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVN
        NSK    S+ K+SRS++KS TD R RIVS YFQ SEK+ E++ EVSPSLQNSK+NQQEE++VSRFF KS + + VNNQ+E  +  +QCAKSVKR+RKP  
Subjt:  NSKTNHHSMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVN

Query:  ERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDI
        ERK ++K S+ KPRTTL+A ELFLEAYRRKS DDTWKPPPSG RLLQ DHAYDPWRVLVICMLLNRT+G+QAKEVIPKLF+LCP+PK+ LEVS+EQIEDI
Subjt:  ERKQKNKTSSTKPRTTLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDI

Query:  IRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL
        IRPLGL RKRS T+ RLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW+EV PKDHMLNYYW+FLHSIKHLL
Subjt:  IRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL

SwissProt top hitse value%identityAlignment
O95243 Methyl-CpG-binding domain protein 46.7e-2637.14Show/hide
Query:  RKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSH
        R+     W PP S   L+Q    +DPW++L+  + LNRTSG+ A  V+ K     P+ +         + ++++PLGLY  R++T+ + S+ YL + W +
Subjt:  RKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSH

Query:  VTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL
          +L G+GKYG D++ IFC   W +V P+DH LN Y D+L
Subjt:  VTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL

Q0IGK1 Methyl-CpG-binding domain protein 4-like protein3.9e-5043.61Show/hide
Query:  SRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERK-------QKNKTSST
        S+   +   VS YFQ S  S + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  K ++   K V   +       Q N++   
Subjt:  SRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERK-------QKNKTSST

Query:  KPRT---------TLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIR
        K R           L+ ++   + Y RK+PD+TW PP S   LLQ DH +DPWRVLVICMLLN+TSG Q + VI  LF LC + K   EV  E+IE++I+
Subjt:  KPRT---------TLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIR

Query:  PLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL
        PLGL +KR++ + RLS  YL+ESW+HVTQL GVGKY ADA+AIFC G W  V+P DHMLNYYWD+L
Subjt:  PLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL

Q7LX22 Thymine/uracil-DNA glycosylase8.2e-0832.08Show/hide
Query:  GTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHV-------TQLPG
        G + L    A DPW VLV  +LL +T+ +Q  ++  +     P+P    + S E+I+ II+PLG+   R+  + +LSE  ++     +         LPG
Subjt:  GTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHV-------TQLPG

Query:  VGKYGA
        VG Y A
Subjt:  VGKYGA

Q9YDP0 Thymine-DNA glycosylase2.2e-0830.77Show/hide
Query:  GTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMY-------LKESWSHVTQLPG
        G + L   +  DPW +LV   LL +T+ RQ   V  +     PNPKA      +++ ++IRPLG+  +R++ +  L++         +  S   + +LPG
Subjt:  GTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMY-------LKESWSHVTQLPG

Query:  VGKY
        VG Y
Subjt:  VGKY

Q9Z2D7 Methyl-CpG-binding domain protein 49.6e-2532.28Show/hide
Query:  QLNQCAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELF----LEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKL
        ++  C+++ K       +     +T   K +T+L  +  +    L   RRKS    W PP S   L+Q    +DPW++L+  + LNRTSG+ A  V+ + 
Subjt:  QLNQCAKSVKRVRKPVNERKQKNKTSSTKPRTTLTAAELF----LEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKL

Query:  FSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL
            P+ +         + ++++PLGLY  R++T+ + S+ YL + W +  +L G+GKYG D++ IFC   W +V P+DH LN Y D+L
Subjt:  FSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL

Arabidopsis top hitse value%identityAlignment
AT3G07930.1 DNA glycosylase superfamily protein1.3e-1133.73Show/hide
Query:  SRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERK-------QKNKTSST
        S+   +   VS YFQ S  S + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  K ++   K V   +       Q N++   
Subjt:  SRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERK-------QKNKTSST

Query:  KPRT---------TLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTS
        K R           L+ ++   + Y RK+PD+TW PP S   LLQ DH +DPWRVLVICMLLN+TS
Subjt:  KPRT---------TLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTS

AT3G07930.2 DNA glycosylase superfamily protein6.6e-1334.32Show/hide
Query:  SRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERK-------QKNKTSST
        S+   +   VS YFQ S  S + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  K ++   K V   +       Q N++   
Subjt:  SRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERK-------QKNKTSST

Query:  KPRT---------TLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQ
        K R           L+ ++   + Y RK+PD+TW PP S   LLQ DH +DPWRVLVICMLLN+TSG Q
Subjt:  KPRT---------TLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQ

AT3G07930.3 DNA glycosylase superfamily protein2.8e-5143.61Show/hide
Query:  SRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERK-------QKNKTSST
        S+   +   VS YFQ S  S + D ++   + +S+S +   K  S+  +K  +      +   +EQ NQ  K ++   K V   +       Q N++   
Subjt:  SRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERK-------QKNKTSST

Query:  KPRT---------TLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIR
        K R           L+ ++   + Y RK+PD+TW PP S   LLQ DH +DPWRVLVICMLLN+TSG Q + VI  LF LC + K   EV  E+IE++I+
Subjt:  KPRT---------TLTAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIR

Query:  PLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL
        PLGL +KR++ + RLS  YL+ESW+HVTQL GVGKY ADA+AIFC G W  V+P DHMLNYYWD+L
Subjt:  PLGLYRKRSRTMHRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCAACAACAAGCATCAACCCTAACCTCACCCCACCATCCTCTTCTTCATATCCCCACGATTTGTTTTCCGAATTCGTCTTTCGAGGTACTTCTCGCTCCAGATT
TCGCTTTCCTCCTTCTAAATCCGCTCATCAAAACCCTAATCCGTACCAGGATTCCACCCAGCACTCTCCAATTTCTACTCTTTATGATCTCCAAACGTCAGAACCCAACA
ATCATCACAACAAATCCTTAGCATCCCCATCTTCTGAAGCCGACGAGCCTCCTATATTAACACTAGAGGATCTTCAAAATGGAAAACTACCCCTTCAATCGCCAAAAAAG
CCTTCACTCGCTCGTAGAGTCTTGTCTTTTTACCGAGAGTTCGGATTTGATAAAAAATTGTTGCAAGCAACTTCGCATTCTGTCCTGAATTCAGAACCTGTTCAAGAAGG
GACCCGTGTGGTTTCGCGTTATTTCCAAAACTCAAGATCAACCCAACAACGCGAACGAATTGTCTCACGATATTTTAAAAAATCGGTGAAGGAACGAGCAGCCCATTATG
AGGATGAGAATGATGATGGCAATCTCACAGAGCAGCCAAGTAAAAGATCTAGCAAAAGGAGGAGGAAAGACGTAGACCCCAGCTCCGTTAACTCAAAAACAAATCATCAT
TCAATGGGAAAGACTTCGCGCTCTGTTCAGAAGTCGAGAACGGATACACGAGCGCGAATTGTTTCGGGCTATTTTCAATATTCTGAAAAGAGTCTTGAAATGGATCGAGA
AGTATCACCTTCTTTACAAAATTCAAAATCAAATCAACAAGAAGAGAAAATGGTCTCACGTTTCTTTCTAAAGTCAGGGAAACAACAAGCCGTGAACAATCAGGAAGAGG
CTACAGAGCAGCTAAATCAGTGTGCGAAATCTGTTAAAAGGGTGCGTAAACCAGTCAATGAAAGGAAACAGAAGAACAAGACAAGTTCTACTAAACCTCGGACCACTCTT
ACTGCTGCAGAGTTGTTTTTGGAAGCTTACAGAAGGAAATCGCCAGATGATACATGGAAGCCTCCTCCCTCTGGAACTCGCCTTCTCCAACACGATCATGCGTACGACCC
TTGGAGGGTTCTAGTCATATGTATGCTCCTCAACCGGACGAGTGGGCGACAGGCAAAAGAAGTGATACCTAAACTCTTCAGTTTGTGTCCCAATCCAAAGGCTACTTTGG
AGGTATCACGTGAGCAGATAGAAGATATCATTCGACCTCTTGGTTTATATAGGAAAAGATCACGAACAATGCATCGTTTATCTGAGATGTATTTAAAAGAAAGTTGGAGC
CATGTCACCCAGCTTCCTGGTGTCGGCAAGTATGGTGCCGATGCACATGCGATATTCTGCACTGGATATTGGAGTGAAGTAGAACCTAAAGATCACATGCTTAATTATTA
CTGGGATTTTCTCCACAGTATCAAACACCTTCTCTGA
mRNA sequenceShow/hide mRNA sequence
GCGCGCGAAGGCAGTGAAGTGATTGTTCTTGTTTCTGGTGTTCCGTAGCGAATCCCGCCATGGCTGCAACAACAAGCATCAACCCTAACCTCACCCCACCATCCTCTTCT
TCATATCCCCACGATTTGTTTTCCGAATTCGTCTTTCGAGGTACTTCTCGCTCCAGATTTCGCTTTCCTCCTTCTAAATCCGCTCATCAAAACCCTAATCCGTACCAGGA
TTCCACCCAGCACTCTCCAATTTCTACTCTTTATGATCTCCAAACGTCAGAACCCAACAATCATCACAACAAATCCTTAGCATCCCCATCTTCTGAAGCCGACGAGCCTC
CTATATTAACACTAGAGGATCTTCAAAATGGAAAACTACCCCTTCAATCGCCAAAAAAGCCTTCACTCGCTCGTAGAGTCTTGTCTTTTTACCGAGAGTTCGGATTTGAT
AAAAAATTGTTGCAAGCAACTTCGCATTCTGTCCTGAATTCAGAACCTGTTCAAGAAGGGACCCGTGTGGTTTCGCGTTATTTCCAAAACTCAAGATCAACCCAACAACG
CGAACGAATTGTCTCACGATATTTTAAAAAATCGGTGAAGGAACGAGCAGCCCATTATGAGGATGAGAATGATGATGGCAATCTCACAGAGCAGCCAAGTAAAAGATCTA
GCAAAAGGAGGAGGAAAGACGTAGACCCCAGCTCCGTTAACTCAAAAACAAATCATCATTCAATGGGAAAGACTTCGCGCTCTGTTCAGAAGTCGAGAACGGATACACGA
GCGCGAATTGTTTCGGGCTATTTTCAATATTCTGAAAAGAGTCTTGAAATGGATCGAGAAGTATCACCTTCTTTACAAAATTCAAAATCAAATCAACAAGAAGAGAAAAT
GGTCTCACGTTTCTTTCTAAAGTCAGGGAAACAACAAGCCGTGAACAATCAGGAAGAGGCTACAGAGCAGCTAAATCAGTGTGCGAAATCTGTTAAAAGGGTGCGTAAAC
CAGTCAATGAAAGGAAACAGAAGAACAAGACAAGTTCTACTAAACCTCGGACCACTCTTACTGCTGCAGAGTTGTTTTTGGAAGCTTACAGAAGGAAATCGCCAGATGAT
ACATGGAAGCCTCCTCCCTCTGGAACTCGCCTTCTCCAACACGATCATGCGTACGACCCTTGGAGGGTTCTAGTCATATGTATGCTCCTCAACCGGACGAGTGGGCGACA
GGCAAAAGAAGTGATACCTAAACTCTTCAGTTTGTGTCCCAATCCAAAGGCTACTTTGGAGGTATCACGTGAGCAGATAGAAGATATCATTCGACCTCTTGGTTTATATA
GGAAAAGATCACGAACAATGCATCGTTTATCTGAGATGTATTTAAAAGAAAGTTGGAGCCATGTCACCCAGCTTCCTGGTGTCGGCAAGTATGGTGCCGATGCACATGCG
ATATTCTGCACTGGATATTGGAGTGAAGTAGAACCTAAAGATCACATGCTTAATTATTACTGGGATTTTCTCCACAGTATCAAACACCTTCTCTGATCTTATTTGACTGT
AGATGGTTTTGTTCGAGAAGCAGAAGTTGTGGATTTCAGGCTCTACATCCCTTTCTACGCATATTTTGGAATGTTCTGACAAAAAGAAACTCTTTTGTGCTATCGGTTTT
GTAAATGTTATTTGATGTTGTTGGGACTTGGAGTTTGGTTGGGAATTATTTAAACTTGTTATCGATCTCTCCTATTATTAACATTATATACTAGTAAGGGGAGAAAAGGA
AATTCTCCTCCAACATTAGCTAGTTATGTTAGACTGTAGCTGAAGCTGTGTACTGTTTGTGAGGTCAATGGATTTTGTTCATGTTTTGTCTCCAAACGTGTCATCACCAA
GGGTGGCTTACCGTTCACTAAATATTTTGTATATGGCTGTAACCTTTGCATTTTAATCTACTGGTTGATGTACTTGGTTTCAATTTAAAA
Protein sequenceShow/hide protein sequence
MAATTSINPNLTPPSSSSYPHDLFSEFVFRGTSRSRFRFPPSKSAHQNPNPYQDSTQHSPISTLYDLQTSEPNNHHNKSLASPSSEADEPPILTLEDLQNGKLPLQSPKK
PSLARRVLSFYREFGFDKKLLQATSHSVLNSEPVQEGTRVVSRYFQNSRSTQQRERIVSRYFKKSVKERAAHYEDENDDGNLTEQPSKRSSKRRRKDVDPSSVNSKTNHH
SMGKTSRSVQKSRTDTRARIVSGYFQYSEKSLEMDREVSPSLQNSKSNQQEEKMVSRFFLKSGKQQAVNNQEEATEQLNQCAKSVKRVRKPVNERKQKNKTSSTKPRTTL
TAAELFLEAYRRKSPDDTWKPPPSGTRLLQHDHAYDPWRVLVICMLLNRTSGRQAKEVIPKLFSLCPNPKATLEVSREQIEDIIRPLGLYRKRSRTMHRLSEMYLKESWS
HVTQLPGVGKYGADAHAIFCTGYWSEVEPKDHMLNYYWDFLHSIKHLL