; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi11G001200 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi11G001200
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionmethyl-CpG-binding domain protein 4-like protein
Genome locationchr11:1351569..1352846
RNA-Seq ExpressionLsi11G001200
SyntenyLsi11G001200
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR045138 - Methyl-CpG binding protein MeCP2/MBD4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054277.1 methyl-CpG-binding domain protein 4-like protein [Cucumis melo var. makuwa]3.1e-14973.21Show/hide
Query:  MTATASINPNLTPPSSSSHPDDLFSQFAFRGSSRSRFRFPPSKSTQQNPTSEDFTQNTTILMTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPFEDLQ
        M AT SINPNLTPPSSSS+P DLFS+F FRG+SRSRFRFPPSKS  QNP       N     TQHSPISTL DLQTSEP NH NK  A            
Subjt:  MTATASINPNLTPPSSSSHPDDLFSQFAFRGSSRSRFRFPPSKSTQQNPTSEDFTQNTTILMTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPFEDLQ

Query:  NCPNCEIPITSLSSEAHEPPLLTLDDLQNAKADHQPPRKPSLARRVLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVVSRYFQNSKSTQQGERIVSRY
                  S SSEA EPP+LTL+DLQN K   Q P+KPSLARRVL FYREFGFD+K++Q TSHSVLN EPVQ+G RVVSRYFQNS+STQQ ERIVSRY
Subjt:  NCPNCEIPITSLSSEAHEPPLLTLDDLQNAKADHQPPRKPSLARRVLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVVSRYFQNSKSTQQGERIVSRY

Query:  FQNSEKEQAAHIED--DDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRYFQNSEKNLEVDREVSPSLQNSK
        F+ S KE+AAH ED  DD NLTEQ SKRS     SKRRRKDV PSS NSKTN HSMGK SRSVQKS TD R RIVS YFQ SEK+LE+DREVSPSLQNSK
Subjt:  FQNSEKEQAAHIED--DDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRYFQNSEKNLEVDREVSPSLQNSK

Query:  SNQQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYD
        SNQQ E++VSRFF KS KQQAVN+Q+EATEQLNQCAKSVKRVRKPVN RK ++K+SS KPRTTL+AAELFLEAYRRKS DDTWKPPPSG RLLQ DHAYD
Subjt:  SNQQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYD

Query:  PWRVLVICMLLNRTTGQQ
        PWRVLVICMLLNRT+G+Q
Subjt:  PWRVLVICMLLNRTTGQQ

KAG7022375.1 Methyl-CpG-binding domain protein 4-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]4.9e-14770.41Show/hide
Query:  MTATASINPNLTPPSSSSHPDDLFSQFAFRGSSRSRFRFPPSK----STQQNPTSEDFTQNTTILMTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPF
        MTAT  +NPN +PP SSS PD LFSQFAF+G S SRFRFPPSK    S +QNPT EDFTQ  + LM Q+SPISTLE LQTSE  NHQ      EIPI   
Subjt:  MTATASINPNLTPPSSSSHPDDLFSQFAFRGSSRSRFRFPPSK----STQQNPTSEDFTQNTTILMTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPF

Query:  EDLQNCPNCEI------------PITSLSS----EAHEPPLLTLDDLQNAKADHQPPRKPSLARRVLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVV
        EDLQ+ P  EI            P T  S      AHEPP+LTL+DLQNAK+DHQP  KP LARRVLRFYR+FGFD+++VQ T   V N  PVQ   RVV
Subjt:  EDLQNCPNCEI------------PITSLSS----EAHEPPLLTLDDLQNAKADHQPPRKPSLARRVLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVV

Query:  SRYFQNSKSTQQGERIVSRYFQNSEKEQAAHIEDDDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRYFQNS
        SR+FQ SKSTQQGERIVSRYFQ+SE EQA+H ED+D N T+Q  KRS VG+Y KRRRKDVAPSSDNSK  Q S+ K+SRSV+KSGTDKRVRIVSRYFQNS
Subjt:  SRYFQNSKSTQQGERIVSRYFQNSEKEQAAHIEDDDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRYFQNS

Query:  EKNLEVDREVSPSLQNSKSNQQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKSSDDT
        EKN EV+ EVSPSLQNSK+NQQ E++VSRFFQKS++Q+ VN+QQE T+Q +QCAKSVKR+RKP   RK RDK  SA+PRTTLSA ELFLEAYRRKS DDT
Subjt:  EKNLEVDREVSPSLQNSKSNQQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKSSDDT

Query:  WKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQ
        WKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQ
Subjt:  WKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQ

XP_008460559.1 PREDICTED: methyl-CpG-binding domain protein 4-like protein [Cucumis melo]3.1e-14973.21Show/hide
Query:  MTATASINPNLTPPSSSSHPDDLFSQFAFRGSSRSRFRFPPSKSTQQNPTSEDFTQNTTILMTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPFEDLQ
        M AT SINPNLTPPSSSS+P DLFS+F FRG+SRSRFRFPPSKS  QNP       N     TQHSPISTL DLQTSEP NH NK  A            
Subjt:  MTATASINPNLTPPSSSSHPDDLFSQFAFRGSSRSRFRFPPSKSTQQNPTSEDFTQNTTILMTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPFEDLQ

Query:  NCPNCEIPITSLSSEAHEPPLLTLDDLQNAKADHQPPRKPSLARRVLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVVSRYFQNSKSTQQGERIVSRY
                  S SSEA EPP+LTL+DLQN K   Q P+KPSLARRVL FYREFGFD+K++Q TSHSVLN EPVQ+G RVVSRYFQNS+STQQ ERIVSRY
Subjt:  NCPNCEIPITSLSSEAHEPPLLTLDDLQNAKADHQPPRKPSLARRVLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVVSRYFQNSKSTQQGERIVSRY

Query:  FQNSEKEQAAHIED--DDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRYFQNSEKNLEVDREVSPSLQNSK
        F+ S KE+AAH ED  DD NLTEQ SKRS     SKRRRKDV PSS NSKTN HSMGK SRSVQKS TD R RIVS YFQ SEK+LE+DREVSPSLQNSK
Subjt:  FQNSEKEQAAHIED--DDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRYFQNSEKNLEVDREVSPSLQNSK

Query:  SNQQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYD
        SNQQ E++VSRFF KS KQQAVN+Q+EATEQLNQCAKSVKRVRKPVN RK ++K+SS KPRTTL+AAELFLEAYRRKS DDTWKPPPSG RLLQ DHAYD
Subjt:  SNQQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYD

Query:  PWRVLVICMLLNRTTGQQ
        PWRVLVICMLLNRT+G+Q
Subjt:  PWRVLVICMLLNRTTGQQ

XP_038892490.1 methyl-CpG-binding domain protein 4-like protein isoform X1 [Benincasa hispida]2.1e-15072.84Show/hide
Query:  ATASINPNLTPPSSSSHPDDLFSQFAFRGSSRSRFRFPPSKSTQQNPTSEDFTQNTTILMTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPFEDLQNC
        ATASIN NLTPPSSSS+PDDLFSQFAFRGSSRSR    PSKS+QQNPTS+DFTQNTTIL+ QHSPI+T EDLQ SEPKNHQNK  +REIPICPF+     
Subjt:  ATASINPNLTPPSSSSHPDDLFSQFAFRGSSRSRFRFPPSKSTQQNPTSEDFTQNTTILMTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPFEDLQNC

Query:  PNCEIPITSLSSEAHEPPLLTLDDLQNAKADHQPPRKPSLARRVLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVVSRYFQNSKSTQQGERIVSRYFQ
           EIPI+S SS+ +EPP+LTL+DLQNAK   QPP+KP LARR+L FYREFGFDQK+ Q TSHSVLN EPVQ+GAR+ SRYFQNSKSTQQGER VSRYFQ
Subjt:  PNCEIPITSLSSEAHEPPLLTLDDLQNAKADHQPPRKPSLARRVLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVVSRYFQNSKSTQQGERIVSRYFQ

Query:  NSEKEQAAH--IEDDDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRYFQNSEKNLEVDREVSPSLQNSKSN
         S K++ AH   ED+D NLTEQ SKRS     SKRRRKDV PSSDNSKTNQHSMGKASRS+QKSGTDKRVRIVSRYFQNSEKN+EVDR            
Subjt:  NSEKEQAAH--IEDDDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRYFQNSEKNLEVDREVSPSLQNSKSN

Query:  QQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPW
                                EAT+Q+NQ AKS KRVRKPVN RK RDK+SS+KPRTTL+AAEL LEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPW
Subjt:  QQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPW

Query:  RVLVICMLLNRTTGQQ
        RVLVICMLLNRT+GQQ
Subjt:  RVLVICMLLNRTTGQQ

XP_038892491.1 uncharacterized protein LOC120081563 isoform X2 [Benincasa hispida]2.1e-15072.84Show/hide
Query:  ATASINPNLTPPSSSSHPDDLFSQFAFRGSSRSRFRFPPSKSTQQNPTSEDFTQNTTILMTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPFEDLQNC
        ATASIN NLTPPSSSS+PDDLFSQFAFRGSSRSR    PSKS+QQNPTS+DFTQNTTIL+ QHSPI+T EDLQ SEPKNHQNK  +REIPICPF+     
Subjt:  ATASINPNLTPPSSSSHPDDLFSQFAFRGSSRSRFRFPPSKSTQQNPTSEDFTQNTTILMTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPFEDLQNC

Query:  PNCEIPITSLSSEAHEPPLLTLDDLQNAKADHQPPRKPSLARRVLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVVSRYFQNSKSTQQGERIVSRYFQ
           EIPI+S SS+ +EPP+LTL+DLQNAK   QPP+KP LARR+L FYREFGFDQK+ Q TSHSVLN EPVQ+GAR+ SRYFQNSKSTQQGER VSRYFQ
Subjt:  PNCEIPITSLSSEAHEPPLLTLDDLQNAKADHQPPRKPSLARRVLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVVSRYFQNSKSTQQGERIVSRYFQ

Query:  NSEKEQAAH--IEDDDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRYFQNSEKNLEVDREVSPSLQNSKSN
         S K++ AH   ED+D NLTEQ SKRS     SKRRRKDV PSSDNSKTNQHSMGKASRS+QKSGTDKRVRIVSRYFQNSEKN+EVDR            
Subjt:  NSEKEQAAH--IEDDDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRYFQNSEKNLEVDREVSPSLQNSKSN

Query:  QQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPW
                                EAT+Q+NQ AKS KRVRKPVN RK RDK+SS+KPRTTL+AAEL LEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPW
Subjt:  QQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPW

Query:  RVLVICMLLNRTTGQQ
        RVLVICMLLNRT+GQQ
Subjt:  RVLVICMLLNRTTGQQ

TrEMBL top hitse value%identityAlignment
A0A0A0KRW9 ENDO3c domain-containing protein2.9e-14570.1Show/hide
Query:  MTATASINPNLTPPSSSSHPDDLFSQFAFRGSSRSRFRFPPSKSTQQNPTSEDFTQNTTILMTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPFEDLQ
        M +T SI+PNLTPPSSSS+P DLFS+F FRG+SRSRFRFPPSKS QQ+P       N     TQHSP+STL DLQT EP NH N+  A            
Subjt:  MTATASINPNLTPPSSSSHPDDLFSQFAFRGSSRSRFRFPPSKSTQQNPTSEDFTQNTTILMTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPFEDLQ

Query:  NCPNCEIPITSLSSEAHEPPLLTLDDLQNAKADHQPPRKPSLARRVLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVVSRYFQNSKSTQQGERIVSRY
                  S SSE HEPP+LTL+DLQN K   Q P++PSLARRVL FYREFGFD+K++Q TSHSVLN  P Q+G RVVSRYFQNS+STQQ +RIVSRY
Subjt:  NCPNCEIPITSLSSEAHEPPLLTLDDLQNAKADHQPPRKPSLARRVLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVVSRYFQNSKSTQQGERIVSRY

Query:  FQNSEKEQAAHIED--DDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRYFQNSEKNLEVDREVSPSLQNSK
        FQ S KE+ AH ED  D  NLTEQ SKRS     SKRRRKDV P SDNSKTN HS+GK +RSVQKSGTD +VRIVS YFQ+ EK+LE+DREVSPSLQNSK
Subjt:  FQNSEKEQAAHIED--DDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRYFQNSEKNLEVDREVSPSLQNSK

Query:  SNQQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYD
        SNQQ E++VSRFF KS KQQAVN+Q+EATEQLNQCAKSVKR+RKPVN RK +DK+SS KPRTTL+AAELFLEAYRRKS  DTWKPP SG RLLQ DHAYD
Subjt:  SNQQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYD

Query:  PWRVLVICMLLNRTTGQQ
        PWRVLVICMLLNRT+GQQ
Subjt:  PWRVLVICMLLNRTTGQQ

A0A1S3CCU6 methyl-CpG-binding domain protein 4-like protein1.5e-14973.21Show/hide
Query:  MTATASINPNLTPPSSSSHPDDLFSQFAFRGSSRSRFRFPPSKSTQQNPTSEDFTQNTTILMTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPFEDLQ
        M AT SINPNLTPPSSSS+P DLFS+F FRG+SRSRFRFPPSKS  QNP       N     TQHSPISTL DLQTSEP NH NK  A            
Subjt:  MTATASINPNLTPPSSSSHPDDLFSQFAFRGSSRSRFRFPPSKSTQQNPTSEDFTQNTTILMTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPFEDLQ

Query:  NCPNCEIPITSLSSEAHEPPLLTLDDLQNAKADHQPPRKPSLARRVLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVVSRYFQNSKSTQQGERIVSRY
                  S SSEA EPP+LTL+DLQN K   Q P+KPSLARRVL FYREFGFD+K++Q TSHSVLN EPVQ+G RVVSRYFQNS+STQQ ERIVSRY
Subjt:  NCPNCEIPITSLSSEAHEPPLLTLDDLQNAKADHQPPRKPSLARRVLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVVSRYFQNSKSTQQGERIVSRY

Query:  FQNSEKEQAAHIED--DDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRYFQNSEKNLEVDREVSPSLQNSK
        F+ S KE+AAH ED  DD NLTEQ SKRS     SKRRRKDV PSS NSKTN HSMGK SRSVQKS TD R RIVS YFQ SEK+LE+DREVSPSLQNSK
Subjt:  FQNSEKEQAAHIED--DDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRYFQNSEKNLEVDREVSPSLQNSK

Query:  SNQQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYD
        SNQQ E++VSRFF KS KQQAVN+Q+EATEQLNQCAKSVKRVRKPVN RK ++K+SS KPRTTL+AAELFLEAYRRKS DDTWKPPPSG RLLQ DHAYD
Subjt:  SNQQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYD

Query:  PWRVLVICMLLNRTTGQQ
        PWRVLVICMLLNRT+G+Q
Subjt:  PWRVLVICMLLNRTTGQQ

A0A5D3CU57 Methyl-CpG-binding domain protein 4-like protein1.5e-14973.21Show/hide
Query:  MTATASINPNLTPPSSSSHPDDLFSQFAFRGSSRSRFRFPPSKSTQQNPTSEDFTQNTTILMTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPFEDLQ
        M AT SINPNLTPPSSSS+P DLFS+F FRG+SRSRFRFPPSKS  QNP       N     TQHSPISTL DLQTSEP NH NK  A            
Subjt:  MTATASINPNLTPPSSSSHPDDLFSQFAFRGSSRSRFRFPPSKSTQQNPTSEDFTQNTTILMTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPFEDLQ

Query:  NCPNCEIPITSLSSEAHEPPLLTLDDLQNAKADHQPPRKPSLARRVLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVVSRYFQNSKSTQQGERIVSRY
                  S SSEA EPP+LTL+DLQN K   Q P+KPSLARRVL FYREFGFD+K++Q TSHSVLN EPVQ+G RVVSRYFQNS+STQQ ERIVSRY
Subjt:  NCPNCEIPITSLSSEAHEPPLLTLDDLQNAKADHQPPRKPSLARRVLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVVSRYFQNSKSTQQGERIVSRY

Query:  FQNSEKEQAAHIED--DDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRYFQNSEKNLEVDREVSPSLQNSK
        F+ S KE+AAH ED  DD NLTEQ SKRS     SKRRRKDV PSS NSKTN HSMGK SRSVQKS TD R RIVS YFQ SEK+LE+DREVSPSLQNSK
Subjt:  FQNSEKEQAAHIED--DDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRYFQNSEKNLEVDREVSPSLQNSK

Query:  SNQQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYD
        SNQQ E++VSRFF KS KQQAVN+Q+EATEQLNQCAKSVKRVRKPVN RK ++K+SS KPRTTL+AAELFLEAYRRKS DDTWKPPPSG RLLQ DHAYD
Subjt:  SNQQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYD

Query:  PWRVLVICMLLNRTTGQQ
        PWRVLVICMLLNRT+G+Q
Subjt:  PWRVLVICMLLNRTTGQQ

A0A6J1EZJ4 methyl-CpG-binding domain protein 4-like protein1.0e-14267.95Show/hide
Query:  MTATASINPNLTPPSSSSHPDDLFSQFAFRGSSRSRFRFPPSK----STQQNPTSEDFTQNTTILMTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPF
        MTAT  +NPNL+PPSSSS PD LFSQFAF+G S SRFRFPPSK    S +QNPT EDFTQ  T LM Q+SPISTLE LQTSE  NHQ     +EIPI   
Subjt:  MTATASINPNLTPPSSSSHPDDLFSQFAFRGSSRSRFRFPPSK----STQQNPTSEDFTQNTTILMTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPF

Query:  EDLQNCPN-----------CEIPITSLSSE-----AHEPPLLTLDDLQNAKADHQPPRKPSLARRVLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVV
        EDLQ+ P             E+   + +SE      HEPP+LTL+D+QNAK+DHQP  +P LARRVLRFYR+FGFD+++VQ T  SV N  PVQ+  RVV
Subjt:  EDLQNCPN-----------CEIPITSLSSE-----AHEPPLLTLDDLQNAKADHQPPRKPSLARRVLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVV

Query:  SRYFQNSKSTQQGERIVSRYFQNSEKEQAAH----IEDDDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRY
        SR+FQ SKS QQGERIVSRYFQ+SE E+AAH     ED+D N+T+Q  KRS VG Y KRRRKDVA SSDNSK  Q S+ K+SR V++SGTDKRVR VSRY
Subjt:  SRYFQNSKSTQQGERIVSRYFQNSEKEQAAH----IEDDDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRY

Query:  FQNSEKNLEVDREVSPSLQNSKSNQQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKS
        FQNSEKN EV+ EVSP LQNSK+ QQ E+IVSRFFQKS++Q+ VN+QQE  +  +QCAKSVKR+RKP   RK RDK  SA+PRTTLSA ELFLEAYRRKS
Subjt:  FQNSEKNLEVDREVSPSLQNSKSNQQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKS

Query:  SDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQ
        SDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQ
Subjt:  SDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQ

A0A6J1HWM5 methyl-CpG-binding domain protein 4-like protein isoform X13.8e-12169.44Show/hide
Query:  MTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPFEDLQNCPNCEI------------PITSLSSE----AHEPPLLTLDDLQNAKADHQPPRKPSLARR
        M  +SPISTLE LQTSE  NHQ      EIPI   E LQ+ P  EI            P T  S      AHEPP+LTL+DLQNAK+DHQP  KP LARR
Subjt:  MTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPFEDLQNCPNCEI------------PITSLSSE----AHEPPLLTLDDLQNAKADHQPPRKPSLARR

Query:  VLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVVSRYFQNSKSTQQGERIVSRYFQNSEKEQAAH--IEDDDANLTEQISKRSMVGDYSKRRRKDVAPS
        VLRF R+FGFD+++VQ T  SV N  PVQ+  RVVSR+FQ SKS QQGERIVSRYFQ+SE E+AAH   EDDD N+T+Q  KRS VG Y KRRRKDVA S
Subjt:  VLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVVSRYFQNSKSTQQGERIVSRYFQNSEKEQAAH--IEDDDANLTEQISKRSMVGDYSKRRRKDVAPS

Query:  SDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRYFQNSEKNLEVDREVSPSLQNSKSNQQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKP
        SDNSK  Q S+ K+SRS++KSGTDKRVRIVSRYFQNSEKN EV+ EVSPSLQNSK+NQQ E++VSRFFQKS++ + VN+QQE  +  +QCAKSVKR+RKP
Subjt:  SDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRYFQNSEKNLEVDREVSPSLQNSKSNQQAEQIVSRFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKP

Query:  VNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQ
           RK RDK  SAKPRTTLSA ELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQ
Subjt:  VNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQ

SwissProt top hitse value%identityAlignment
Q0IGK1 Methyl-CpG-binding domain protein 4-like protein2.0e-1031.7Show/hide
Query:  EDDDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSV-QKSGTDK---RVRIVSRYFQNSEKNLEVDREVSPSLQNSKSNQQAEQIVS
        +DDD ++++   +R    ++    R+       ++ + Q   G  S SV  K G  K   +V  VS YFQ S  + + D ++  S Q+ ++ ++     S
Subjt:  EDDDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSV-QKSGTDK---RVRIVSRYFQNSEKNLEVDREVSPSLQNSKSNQQAEQIVS

Query:  RFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRK--HRD--------KSSSAKPRTT------LSAAELFLEAYRRKSSDDTWKPPPSGIRLLQ
        +   K ++      +   +EQ NQ  K ++   K V V +  H D        K  S   R T      LS ++   + Y RK+ D+TW PP S   LLQ
Subjt:  RFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRK--HRD--------KSSSAKPRTT------LSAAELFLEAYRRKSSDDTWKPPPSGIRLLQ

Query:  QDHAYDPWRVLVICMLLNRTTGQQ
        +DH +DPWRVLVICMLLN+T+G Q
Subjt:  QDHAYDPWRVLVICMLLNRTTGQQ

Arabidopsis top hitse value%identityAlignment
AT3G07930.1 DNA glycosylase superfamily protein3.5e-1031.22Show/hide
Query:  EDDDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSV-QKSGTDK---RVRIVSRYFQNSEKNLEVDREVSPSLQNSKSNQQAEQIVS
        +DDD ++++   +R    ++    R+       ++ + Q   G  S SV  K G  K   +V  VS YFQ S  + + D ++  S Q+ ++ ++     S
Subjt:  EDDDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSV-QKSGTDK---RVRIVSRYFQNSEKNLEVDREVSPSLQNSKSNQQAEQIVS

Query:  RFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRK--HRD--------KSSSAKPRTT------LSAAELFLEAYRRKSSDDTWKPPPSGIRLLQ
        +   K ++      +   +EQ NQ  K ++   K V V +  H D        K  S   R T      LS ++   + Y RK+ D+TW PP S   LLQ
Subjt:  RFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRK--HRD--------KSSSAKPRTT------LSAAELFLEAYRRKSSDDTWKPPPSGIRLLQ

Query:  QDHAYDPWRVLVICMLLNRTT
        +DH +DPWRVLVICMLLN+T+
Subjt:  QDHAYDPWRVLVICMLLNRTT

AT3G07930.2 DNA glycosylase superfamily protein4.9e-1232Show/hide
Query:  EDDDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSV-QKSGTDK---RVRIVSRYFQNSEKNLEVDREVSPSLQNSKSNQQAEQIVS
        +DDD ++++   +R    ++    R+       ++ + Q   G  S SV  K G  K   +V  VS YFQ S  + + D ++  S Q+ ++ ++     S
Subjt:  EDDDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSV-QKSGTDK---RVRIVSRYFQNSEKNLEVDREVSPSLQNSKSNQQAEQIVS

Query:  RFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRK--HRD--------KSSSAKPRTT------LSAAELFLEAYRRKSSDDTWKPPPSGIRLLQ
        +   K ++      +   +EQ NQ  K ++   K V V +  H D        K  S   R T      LS ++   + Y RK+ D+TW PP S   LLQ
Subjt:  RFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRK--HRD--------KSSSAKPRTT------LSAAELFLEAYRRKSSDDTWKPPPSGIRLLQ

Query:  QDHAYDPWRVLVICMLLNRTTGQQV
        +DH +DPWRVLVICMLLN+T+G QV
Subjt:  QDHAYDPWRVLVICMLLNRTTGQQV

AT3G07930.3 DNA glycosylase superfamily protein1.4e-1131.7Show/hide
Query:  EDDDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSV-QKSGTDK---RVRIVSRYFQNSEKNLEVDREVSPSLQNSKSNQQAEQIVS
        +DDD ++++   +R    ++    R+       ++ + Q   G  S SV  K G  K   +V  VS YFQ S  + + D ++  S Q+ ++ ++     S
Subjt:  EDDDANLTEQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSV-QKSGTDK---RVRIVSRYFQNSEKNLEVDREVSPSLQNSKSNQQAEQIVS

Query:  RFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRK--HRD--------KSSSAKPRTT------LSAAELFLEAYRRKSSDDTWKPPPSGIRLLQ
        +   K ++      +   +EQ NQ  K ++   K V V +  H D        K  S   R T      LS ++   + Y RK+ D+TW PP S   LLQ
Subjt:  RFFQKSQKQQAVNSQQEATEQLNQCAKSVKRVRKPVNVRK--HRD--------KSSSAKPRTT------LSAAELFLEAYRRKSSDDTWKPPPSGIRLLQ

Query:  QDHAYDPWRVLVICMLLNRTTGQQ
        +DH +DPWRVLVICMLLN+T+G Q
Subjt:  QDHAYDPWRVLVICMLLNRTTGQQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGCAACAGCTAGCATCAATCCTAACCTCACCCCTCCATCCTCTTCTTCACATCCCGACGATTTGTTTTCCCAATTCGCCTTTCGAGGTAGTTCCCGCTCCAGATT
TCGCTTTCCTCCTTCCAAATCCACTCAACAAAACCCTACGTCCGAGGATTTTACCCAAAACACCACGATTCTCATGACCCAACACTCTCCAATTTCAACTCTTGAGGATC
TCCAAACTTCAGAACCCAAGAATCATCAGAACAAACCCTTTGCCCGCGAGATTCCCATTTGCCCTTTTGAGGATCTTCAAAACTGTCCAAACTGTGAGATTCCAATAACA
TCCCTCTCTTCTGAAGCGCACGAGCCTCCTCTATTAACACTAGACGATCTTCAAAATGCAAAAGCAGACCATCAACCGCCAAGAAAGCCTTCACTGGCGCGTAGAGTGTT
ACGTTTTTACCGAGAGTTCGGATTTGATCAAAAAATGGTGCAAACAACTTCGCATTCTGTCCTAAATTTAGAACCAGTTCAACAAGGGGCCCGTGTGGTTTCGCGTTATT
TCCAAAACTCAAAATCAACTCAACAAGGAGAACGAATTGTCTCACGATACTTTCAAAACTCGGAGAAGGAACAAGCAGCCCATATTGAGGATGATGATGCCAATCTCACA
GAGCAGATAAGTAAAAGATCAATGGTGGGAGACTACAGCAAGAGGAGGAGGAAAGACGTAGCCCCCAGCTCCGATAACTCAAAAACAAATCAACATTCAATGGGAAAAGC
TTCACGCTCTGTTCAGAAGTCAGGAACAGATAAACGAGTGCGAATTGTTTCGCGCTATTTCCAAAATTCAGAAAAGAATCTTGAAGTGGATCGAGAAGTTTCACCTTCCT
TACAAAATTCAAAATCAAATCAACAAGCAGAGCAAATTGTCTCACGTTTCTTTCAAAAATCACAAAAGCAACAAGCCGTGAACAGTCAGCAAGAGGCTACAGAGCAGCTA
AATCAGTGTGCGAAATCTGTTAAAAGGGTCCGCAAACCAGTTAATGTAAGGAAACATAGGGATAAATCAAGTTCTGCTAAACCTCGGACCACTCTTTCTGCTGCAGAGTT
GTTTTTGGAAGCTTATAGAAGGAAATCGTCAGATGATACATGGAAGCCTCCTCCCTCTGGAATTCGCCTTCTCCAACAGGATCATGCGTACGACCCTTGGAGGGTTCTAG
TCATATGTATGCTCCTTAACCGGACAACTGGGCAACAGGTATCTATCATCCGCTATCCATTTACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGCAACAGCTAGCATCAATCCTAACCTCACCCCTCCATCCTCTTCTTCACATCCCGACGATTTGTTTTCCCAATTCGCCTTTCGAGGTAGTTCCCGCTCCAGATT
TCGCTTTCCTCCTTCCAAATCCACTCAACAAAACCCTACGTCCGAGGATTTTACCCAAAACACCACGATTCTCATGACCCAACACTCTCCAATTTCAACTCTTGAGGATC
TCCAAACTTCAGAACCCAAGAATCATCAGAACAAACCCTTTGCCCGCGAGATTCCCATTTGCCCTTTTGAGGATCTTCAAAACTGTCCAAACTGTGAGATTCCAATAACA
TCCCTCTCTTCTGAAGCGCACGAGCCTCCTCTATTAACACTAGACGATCTTCAAAATGCAAAAGCAGACCATCAACCGCCAAGAAAGCCTTCACTGGCGCGTAGAGTGTT
ACGTTTTTACCGAGAGTTCGGATTTGATCAAAAAATGGTGCAAACAACTTCGCATTCTGTCCTAAATTTAGAACCAGTTCAACAAGGGGCCCGTGTGGTTTCGCGTTATT
TCCAAAACTCAAAATCAACTCAACAAGGAGAACGAATTGTCTCACGATACTTTCAAAACTCGGAGAAGGAACAAGCAGCCCATATTGAGGATGATGATGCCAATCTCACA
GAGCAGATAAGTAAAAGATCAATGGTGGGAGACTACAGCAAGAGGAGGAGGAAAGACGTAGCCCCCAGCTCCGATAACTCAAAAACAAATCAACATTCAATGGGAAAAGC
TTCACGCTCTGTTCAGAAGTCAGGAACAGATAAACGAGTGCGAATTGTTTCGCGCTATTTCCAAAATTCAGAAAAGAATCTTGAAGTGGATCGAGAAGTTTCACCTTCCT
TACAAAATTCAAAATCAAATCAACAAGCAGAGCAAATTGTCTCACGTTTCTTTCAAAAATCACAAAAGCAACAAGCCGTGAACAGTCAGCAAGAGGCTACAGAGCAGCTA
AATCAGTGTGCGAAATCTGTTAAAAGGGTCCGCAAACCAGTTAATGTAAGGAAACATAGGGATAAATCAAGTTCTGCTAAACCTCGGACCACTCTTTCTGCTGCAGAGTT
GTTTTTGGAAGCTTATAGAAGGAAATCGTCAGATGATACATGGAAGCCTCCTCCCTCTGGAATTCGCCTTCTCCAACAGGATCATGCGTACGACCCTTGGAGGGTTCTAG
TCATATGTATGCTCCTTAACCGGACAACTGGGCAACAGGTATCTATCATCCGCTATCCATTTACTTGA
Protein sequenceShow/hide protein sequence
MTATASINPNLTPPSSSSHPDDLFSQFAFRGSSRSRFRFPPSKSTQQNPTSEDFTQNTTILMTQHSPISTLEDLQTSEPKNHQNKPFAREIPICPFEDLQNCPNCEIPIT
SLSSEAHEPPLLTLDDLQNAKADHQPPRKPSLARRVLRFYREFGFDQKMVQTTSHSVLNLEPVQQGARVVSRYFQNSKSTQQGERIVSRYFQNSEKEQAAHIEDDDANLT
EQISKRSMVGDYSKRRRKDVAPSSDNSKTNQHSMGKASRSVQKSGTDKRVRIVSRYFQNSEKNLEVDREVSPSLQNSKSNQQAEQIVSRFFQKSQKQQAVNSQQEATEQL
NQCAKSVKRVRKPVNVRKHRDKSSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQVSIIRYPFT