; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G04880 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G04880
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionmethyl-CpG-binding domain protein 4-like protein
Genome locationClcChr08:14996780..15001742
RNA-Seq ExpressionClc08G04880
SyntenyClc08G04880
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR011257 - DNA glycosylase
IPR045138 - Methyl-CpG binding protein MeCP2/MBD4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7022375.1 Methyl-CpG-binding domain protein 4-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]5.0e-19470.93Show/hide
Query:  MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFP----PSESTQQNPTSQDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPS
        MTAT  +NPN +PP SSSFPD LFSQFAF+G S SRF FP    PSES +QNPT +DFTQ  + LM Q+SPISTLE  Q SE+ NHQ      +I I   
Subjt:  MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFP----PSESTQQNPTSQDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPS

Query:  DDLQNCPNCEI------------PVTSLSS----EAHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVV
        +DLQ+ P  EI            P T  S      AHEPPILTL+DLQNAK DH P  KP LARRVLRFYR+FGFD+++VQ T     N  PVQ   RVV
Subjt:  DDLQNCPNCEI------------PVTSLSS----EAHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVV

Query:  SRYFQNSKSTQQGERIVARYFQNSEKERAARNEDDDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNS
        SR+FQ SKSTQQGERIV+RYFQ+SE E+A+ NED+D + T+Q  KRS VG Y KRRRK VAPSSD SK  Q S+ K+SRSV+KSGTD+RVRIVSRYFQNS
Subjt:  SRYFQNSKSTQQGERIVARYFQNSEKERAARNEDDDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNS

Query:  EKNLEVDREVSPCLRSSKSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDT
        EKN EV+ EVSP L++SK+NQQ E++VSRFFQKS +Q+ VN+QQE T+Q +Q AKSVKR+RKP  ERK RDK  SA+PRTTLSA ELFLEAYRRKS DDT
Subjt:  EKNLEVDREVSPCLRSSKSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDT

Query:  WKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGV
        WKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLF+LCP+ ++ALEVS EQIEDIIRPLGLQRKRS T+QRLSEMYLKESWSHVTQLPGV
Subjt:  WKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGV

Query:  GKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFLHSIKHLL
        GKYGADAHAIFCTGYW EV+P+DHMLNYYW+FLHSIKHLL
Subjt:  GKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFLHSIKHLL

XP_004142362.1 methyl-CpG-binding domain protein 4-like protein isoform X1 [Cucumis sativus]1.7e-19471.89Show/hide
Query:  MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTS-QDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDL
        M +T SI+PNLTPPSSSS+P DLFS+F FRG+SRSRF FPPS+S QQ+P   QD         TQHSP+STL D Q  E  NH N+ LA           
Subjt:  MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTS-QDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDL

Query:  QNCPNCEIPVTSLSSEAHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVVSRYFQNSKSTQQGERIVAR
                   S SSE HEPPILTL+DLQN K     P++PSLARRVL FYREFGFD+K++Q TSHS LN  P Q+G RVVSRYFQNS+STQQ +RIV+R
Subjt:  QNCPNCEIPVTSLSSEAHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVVSRYFQNSKSTQQGERIVAR

Query:  YFQNSEKERAARNED--DDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSS
        YFQ S KER A  ED  D  + TEQ SKRS     SKRRRK V P SD SKTN HS+GK +RSVQKSGTD +VRIVS YFQ+ EK+LE+DREVSP L++S
Subjt:  YFQNSEKERAARNED--DDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSS

Query:  KSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAY
        KSNQQ E++VSRFF KS KQQAVN+Q+EATEQLNQ AKSVKR+RKPVNERK++DKTSS KPRTTL+AAELFLEAYRRKS  DTWKPP SG RLLQ DHAY
Subjt:  KSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAY

Query:  DPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWN
        DPWRVLVICMLLNRT+GQQAKEVIPKLFSLCPN +A LEVS EQIEDIIRPLG  RKRSRTM RLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW+
Subjt:  DPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWN

Query:  EVVPEDHMLNYYWDFLHSIKHLL
        EV P+DHMLNYYWDFLHSIKHLL
Subjt:  EVVPEDHMLNYYWDFLHSIKHLL

XP_008460559.1 PREDICTED: methyl-CpG-binding domain protein 4-like protein [Cucumis melo]1.0e-19974.76Show/hide
Query:  MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTS-QDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDL
        M AT SINPNLTPPSSSS+P DLFS+F FRG+SRSRF FPPS+S  QNP   QD         TQHSPISTL D Q SE  NH NK LA           
Subjt:  MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTS-QDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDL

Query:  QNCPNCEIPVTSLSSEAHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVVSRYFQNSKSTQQGERIVAR
                   S SSEA EPPILTL+DLQN K     P+KPSLARRVL FYREFGFD+K++Q TSHS LN EPVQ+G RVVSRYFQNS+STQQ ERIV+R
Subjt:  QNCPNCEIPVTSLSSEAHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVVSRYFQNSKSTQQGERIVAR

Query:  YFQNSEKERAARNED--DDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSS
        YF+ S KERAA  ED  DD + TEQ SKRS     SKRRRK V PSS  SKTN HSMGK SRSVQKS TD R RIVS YFQ SEK+LE+DREVSP L++S
Subjt:  YFQNSEKERAARNED--DDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSS

Query:  KSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAY
        KSNQQ E+MVSRFF KS KQQAVN+Q+EATEQLNQ AKSVKRVRKPVNERK ++KTSS KPRTTL+AAELFLEAYRRKS DDTWKPPPSG RLLQ DHAY
Subjt:  KSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAY

Query:  DPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWN
        DPWRVLVICMLLNRT+G+QAKEVIPKLFSLCPN +A LEVS EQIEDIIRPLGL RKRSRTM RLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW+
Subjt:  DPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWN

Query:  EVVPEDHMLNYYWDFLHSIKHLL
        EV P+DHMLNYYWDFLHSIKHLL
Subjt:  EVVPEDHMLNYYWDFLHSIKHLL

XP_022931728.1 methyl-CpG-binding domain protein 4-like protein [Cucurbita moschata]1.8e-19169.67Show/hide
Query:  MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFP----PSESTQQNPTSQDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPS
        MTAT  +NPNL+PPSSSSFPD LFSQFAF+G S SRF FP    PSES +QNPT +DFTQ  T LM Q+SPISTLE  Q SES NHQ     ++I I   
Subjt:  MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFP----PSESTQQNPTSQDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPS

Query:  DDLQNCPN-----------CEIPVTSLSSE-----AHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVV
        +DLQ+ P             E+   + +SE      HEPPILTL+D+QNAK DH P  +P LARRVLRFYR+FGFD+++VQ T  S  N  PVQ+  RVV
Subjt:  DDLQNCPN-----------CEIPVTSLSSE-----AHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVV

Query:  SRYFQNSKSTQQGERIVARYFQNSEKERAARNEDDDAD----FTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRY
        SR+FQ SKS QQGERIV+RYFQ+SE ERAA NED+D D     T+Q  KRS VG Y KRRRK VA SSD SK  Q S+ K+SR V++SGTD+RVR VSRY
Subjt:  SRYFQNSKSTQQGERIVARYFQNSEKERAARNEDDDAD----FTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRY

Query:  FQNSEKNLEVDREVSPCLRSSKSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKS
        FQNSEKN EV+ EVSP L++SK+ QQ E++VSRFFQKS +Q+ VN+QQE  +  +Q AKSVKR+RKP  ERK RDK  SA+PRTTLSA ELFLEAYRRKS
Subjt:  FQNSEKNLEVDREVSPCLRSSKSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKS

Query:  SDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQ
        SDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLF+LCP+ ++ALEVS EQIEDIIRPLGLQRKRS T+QRLSEMYLKESWSHVTQ
Subjt:  SDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQ

Query:  LPGVGKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFLHSIKHLL
        LPGVGKYGADAHAIFCTGYW EV+P+DHMLNYYW+FLHSIKHLL
Subjt:  LPGVGKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFLHSIKHLL

XP_038892490.1 methyl-CpG-binding domain protein 4-like protein isoform X1 [Benincasa hispida]3.1e-20475Show/hide
Query:  ATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTSQDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDLQNC
        ATASIN NLTPPSSSS+PDDLFSQFAFRGSSRSR C  PS+S+QQNPTSQDFTQNTTIL+ QHSPI+T ED Q SE KNHQNK L+R+I ICP       
Subjt:  ATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTSQDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDLQNC

Query:  PNCEIPVTSLSSEAHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVVSRYFQNSKSTQQGERIVARYFQ
           EIP++S SS+ +EPPILTL+DLQNAKP   PP+KP LARR+L FYREFGFDQK+ Q TSHS LN EPVQ+GAR+ SRYFQNSKSTQQGER V+RYFQ
Subjt:  PNCEIPVTSLSSEAHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVVSRYFQNSKSTQQGERIVARYFQ

Query:  NSEKERAARNEDDDAD--FTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSSKSN
         S K+R A NED+D D   TEQ SKRS     SKRRRK V PSSD SKTNQHSMGKASRS+QKSGTD+RVRIVSRYFQNSEKN+EVDR            
Subjt:  NSEKERAARNEDDDAD--FTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSSKSN

Query:  QQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPW
                                EAT+Q+NQRAKS KRVRKPVNERK RDKTSS+KPRTTL+AAEL LEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPW
Subjt:  QQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPW

Query:  RVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWNEVV
        RVLVICMLLNRT+GQQAKEVIPKLF LCPN +A L+VS EQIEDIIRPLGLQRKRSRTMQ LSEMYLKE+WSHVTQLPGVGKYGADAHAIFCTGYWNEV 
Subjt:  RVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWNEVV

Query:  PEDHMLNYYWDFLHSIKHLL
        P+DHMLNYYW+FLHSI+HLL
Subjt:  PEDHMLNYYWDFLHSIKHLL

TrEMBL top hitse value%identityAlignment
A0A0A0KRW9 ENDO3c domain-containing protein1.3e-17169.49Show/hide
Query:  MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTS-QDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDL
        M +T SI+PNLTPPSSSS+P DLFS+F FRG+SRSRF FPPS+S QQ+P   QD         TQHSP+STL D Q  E  NH N+ LA           
Subjt:  MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTS-QDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDL

Query:  QNCPNCEIPVTSLSSEAHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVVSRYFQNSKSTQQGERIVAR
                   S SSE HEPPILTL+DLQN K     P++PSLARRVL FYREFGFD+K++Q TSHS LN  P Q+G RVVSRYFQNS+STQQ +RIV+R
Subjt:  QNCPNCEIPVTSLSSEAHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVVSRYFQNSKSTQQGERIVAR

Query:  YFQNSEKERAARNED--DDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSS
        YFQ S KER A  ED  D  + TEQ SKRS     SKRRRK V P SD SKTN HS+GK +RSVQKSGTD +VRIVS YFQ+ EK+LE+DREVSP L++S
Subjt:  YFQNSEKERAARNED--DDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSS

Query:  KSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAY
        KSNQQ E++VSRFF KS KQQAVN+Q+EATEQLNQ AKSVKR+RKPVNERK++DKTSS KPRTTL+AAELFLEAYRRKS  DTWKPP SG RLLQ DHAY
Subjt:  KSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAY

Query:  DPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFC
        DPWRVLVICMLLNRT+GQQAKEVIPKLFSLCPN +A LEVS EQIEDIIRPLG  RKRSRTM RLSEMYLKESWSHVTQLPGVGKY A    + C
Subjt:  DPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFC

A0A1S3CCU6 methyl-CpG-binding domain protein 4-like protein5.0e-20074.76Show/hide
Query:  MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTS-QDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDL
        M AT SINPNLTPPSSSS+P DLFS+F FRG+SRSRF FPPS+S  QNP   QD         TQHSPISTL D Q SE  NH NK LA           
Subjt:  MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTS-QDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDL

Query:  QNCPNCEIPVTSLSSEAHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVVSRYFQNSKSTQQGERIVAR
                   S SSEA EPPILTL+DLQN K     P+KPSLARRVL FYREFGFD+K++Q TSHS LN EPVQ+G RVVSRYFQNS+STQQ ERIV+R
Subjt:  QNCPNCEIPVTSLSSEAHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVVSRYFQNSKSTQQGERIVAR

Query:  YFQNSEKERAARNED--DDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSS
        YF+ S KERAA  ED  DD + TEQ SKRS     SKRRRK V PSS  SKTN HSMGK SRSVQKS TD R RIVS YFQ SEK+LE+DREVSP L++S
Subjt:  YFQNSEKERAARNED--DDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSS

Query:  KSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAY
        KSNQQ E+MVSRFF KS KQQAVN+Q+EATEQLNQ AKSVKRVRKPVNERK ++KTSS KPRTTL+AAELFLEAYRRKS DDTWKPPPSG RLLQ DHAY
Subjt:  KSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAY

Query:  DPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWN
        DPWRVLVICMLLNRT+G+QAKEVIPKLFSLCPN +A LEVS EQIEDIIRPLGL RKRSRTM RLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW+
Subjt:  DPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWN

Query:  EVVPEDHMLNYYWDFLHSIKHLL
        EV P+DHMLNYYWDFLHSIKHLL
Subjt:  EVVPEDHMLNYYWDFLHSIKHLL

A0A5D3CU57 Methyl-CpG-binding domain protein 4-like protein1.3e-18772.99Show/hide
Query:  MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTS-QDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDL
        M AT SINPNLTPPSSSS+P DLFS+F FRG+SRSRF FPPS+S  QNP   QD         TQHSPISTL D Q SE  NH NK LA           
Subjt:  MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTS-QDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDL

Query:  QNCPNCEIPVTSLSSEAHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVVSRYFQNSKSTQQGERIVAR
                   S SSEA EPPILTL+DLQN K     P+KPSLARRVL FYREFGFD+K++Q TSHS LN EPVQ+G RVVSRYFQNS+STQQ ERIV+R
Subjt:  QNCPNCEIPVTSLSSEAHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVVSRYFQNSKSTQQGERIVAR

Query:  YFQNSEKERAARNED--DDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSS
        YF+ S KERAA  ED  DD + TEQ SKRS     SKRRRK V PSS  SKTN HSMGK SRSVQKS TD R RIVS YFQ SEK+LE+DREVSP L++S
Subjt:  YFQNSEKERAARNED--DDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSS

Query:  KSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAY
        KSNQQ E+MVSRFF KS KQQAVN+Q+EATEQLNQ AKSVKRVRKPVNERK ++KTSS KPRTTL+AAELFLEAYRRKS DDTWKPPPSG RLLQ DHAY
Subjt:  KSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAY

Query:  DPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWN
        DPWRVLVICMLLNRT+G+QAKEVIPKLFSLCPN +A LEVS EQIEDIIRPLGL RKRSRTM RLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWN
Subjt:  DPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWN

Query:  EVVPEDHMLNY
          V E  ++++
Subjt:  EVVPEDHMLNY

A0A6J1EZJ4 methyl-CpG-binding domain protein 4-like protein8.6e-19269.67Show/hide
Query:  MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFP----PSESTQQNPTSQDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPS
        MTAT  +NPNL+PPSSSSFPD LFSQFAF+G S SRF FP    PSES +QNPT +DFTQ  T LM Q+SPISTLE  Q SES NHQ     ++I I   
Subjt:  MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFP----PSESTQQNPTSQDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPS

Query:  DDLQNCPN-----------CEIPVTSLSSE-----AHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVV
        +DLQ+ P             E+   + +SE      HEPPILTL+D+QNAK DH P  +P LARRVLRFYR+FGFD+++VQ T  S  N  PVQ+  RVV
Subjt:  DDLQNCPN-----------CEIPVTSLSSE-----AHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVV

Query:  SRYFQNSKSTQQGERIVARYFQNSEKERAARNEDDDAD----FTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRY
        SR+FQ SKS QQGERIV+RYFQ+SE ERAA NED+D D     T+Q  KRS VG Y KRRRK VA SSD SK  Q S+ K+SR V++SGTD+RVR VSRY
Subjt:  SRYFQNSKSTQQGERIVARYFQNSEKERAARNEDDDAD----FTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRY

Query:  FQNSEKNLEVDREVSPCLRSSKSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKS
        FQNSEKN EV+ EVSP L++SK+ QQ E++VSRFFQKS +Q+ VN+QQE  +  +Q AKSVKR+RKP  ERK RDK  SA+PRTTLSA ELFLEAYRRKS
Subjt:  FQNSEKNLEVDREVSPCLRSSKSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKS

Query:  SDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQ
        SDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLF+LCP+ ++ALEVS EQIEDIIRPLGLQRKRS T+QRLSEMYLKESWSHVTQ
Subjt:  SDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQ

Query:  LPGVGKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFLHSIKHLL
        LPGVGKYGADAHAIFCTGYW EV+P+DHMLNYYW+FLHSIKHLL
Subjt:  LPGVGKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFLHSIKHLL

A0A6J1HWM5 methyl-CpG-binding domain protein 4-like protein isoform X17.1e-17070.86Show/hide
Query:  MTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDLQNCPNCEI------------PVTSLSSE----AHEPPILTLDDLQNAKPDHYPPRKPSLARR
        M  +SPISTLE  Q SE+ NHQ      +I I   + LQ+ P  EI            P T  S      AHEPPILTL+DLQNAK DH P  KP LARR
Subjt:  MTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDLQNCPNCEI------------PVTSLSSE----AHEPPILTLDDLQNAKPDHYPPRKPSLARR

Query:  VLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVVSRYFQNSKSTQQGERIVARYFQNSEKERAARN--EDDDADFTEQTSKRSMVGGYSKRRRKYVAPS
        VLRF R+FGFD+++VQ T  S  N  PVQ+  RVVSR+FQ SKS QQGERIV+RYFQ+SE ERAA N  EDDD + T+Q  KRS VG Y KRRRK VA S
Subjt:  VLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVVSRYFQNSKSTQQGERIVARYFQNSEKERAARN--EDDDADFTEQTSKRSMVGGYSKRRRKYVAPS

Query:  SDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSSKSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKP
        SD SK  Q S+ K+SRS++KSGTD+RVRIVSRYFQNSEKN EV+ EVSP L++SK+NQQ E++VSRFFQKS + + VN+QQE  +  +Q AKSVKR+RKP
Subjt:  SDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSSKSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRKP

Query:  VNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIE
          ERK RDK  SAKPRTTLSA ELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLF+LCP+ ++ALEVS EQIE
Subjt:  VNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIE

Query:  DIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFLHSIKHLL
        DIIRPLGLQRKRS T+QRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW EV+P+DHMLNYYW+FLHSIKHLL
Subjt:  DIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFLHSIKHLL

SwissProt top hitse value%identityAlignment
O95243 Methyl-CpG-binding domain protein 41.1e-2638.57Show/hide
Query:  RKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSH
        R+ +   W PP S   L+Q+   +DPW++L+  + LNRT+G+ A  V+ K     P+AE A       + ++++PLGL   R++T+ + S+ YL + W +
Subjt:  RKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSH

Query:  VTQLPGVGKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFL
          +L G+GKYG D++ IFC   W +V PEDH LN Y D+L
Subjt:  VTQLPGVGKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFL

Q0IGK1 Methyl-CpG-binding domain protein 4-like protein1.7e-4841.25Show/hide
Query:  EDDDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSV-QKSGTDR---RVRIVSRYFQNS-----EKNLEVDREVSPCLRSSKSNQQT
        +DDD   ++   +R     +    R+        + + Q   G  S SV  K G  +   +V  VS YFQ S     + ++    +     R   S +Q 
Subjt:  EDDDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSV-QKSGTDR---RVRIVSRYFQNS-----EKNLEVDREVSPCLRSSKSNQQT

Query:  E-QMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRK------PVNE---RKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQ
        + + VS +FQ+S   +  N   +A + L    K VK  R        VNE    K R+   +      LS ++   + Y RK+ D+TW PP S   LLQ+
Subjt:  E-QMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRK------PVNE---RKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQ

Query:  DHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCT
        DH +DPWRVLVICMLLN+T+G Q + VI  LF LC +A+ A EV  E+IE++I+PLGLQ+KR++ +QRLS  YL+ESW+HVTQL GVGKY ADA+AIFC 
Subjt:  DHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCT

Query:  GYWNEVVPEDHMLNYYWDFL
        G W+ V P DHMLNYYWD+L
Subjt:  GYWNEVVPEDHMLNYYWDFL

Q7LX22 Thymine/uracil-DNA glycosylase3.3e-0732.99Show/hide
Query:  AYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHV-------TQLPGVGKYGA
        A DPW VLV  +LL +TT +Q  ++  +     P+     + S E+I+ II+PLG++  R+  +++LSE  ++     +         LPGVG Y A
Subjt:  AYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHV-------TQLPGVGKYGA

Q9YDP0 Thymine-DNA glycosylase1.3e-0630.11Show/hide
Query:  DPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMY-------LKESWSHVTQLPGVGKY
        DPW +LV   LL +TT +Q   V  +     PN +A      +++ ++IRPLG++ +R++ +  L++         +  S   + +LPGVG Y
Subjt:  DPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMY-------LKESWSHVTQLPGVGKY

Q9Z2D7 Methyl-CpG-binding domain protein 44.2e-2638.57Show/hide
Query:  RKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSH
        R+ S   W PP S   L+Q+   +DPW++L+  + LNRT+G+ A  V+ +     P+AE A       + ++++PLGL   R++T+ + S+ YL + W +
Subjt:  RKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSH

Query:  VTQLPGVGKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFL
          +L G+GKYG D++ IFC   W +V PEDH LN Y D+L
Subjt:  VTQLPGVGKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFL

Arabidopsis top hitse value%identityAlignment
AT3G07930.1 DNA glycosylase superfamily protein1.1e-0831.36Show/hide
Query:  EDDDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSV-QKSGTDR---RVRIVSRYFQNS-----EKNLEVDREVSPCLRSSKSNQQT
        +DDD   ++   +R     +    R+        + + Q   G  S SV  K G  +   +V  VS YFQ S     + ++    +     R   S +Q 
Subjt:  EDDDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSV-QKSGTDR---RVRIVSRYFQNS-----EKNLEVDREVSPCLRSSKSNQQT

Query:  E-QMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRK------PVNE---RKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQ
        + + VS +FQ+S   +  N   +A + L    K VK  R        VNE    K R+   +      LS ++   + Y RK+ D+TW PP S   LLQ+
Subjt:  E-QMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRK------PVNE---RKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQ

Query:  DHAYDPWRVLVICMLLNRTT
        DH +DPWRVLVICMLLN+T+
Subjt:  DHAYDPWRVLVICMLLNRTT

AT3G07930.2 DNA glycosylase superfamily protein5.6e-1031.84Show/hide
Query:  EDDDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSV-QKSGTDR---RVRIVSRYFQNS-----EKNLEVDREVSPCLRSSKSNQQT
        +DDD   ++   +R     +    R+        + + Q   G  S SV  K G  +   +V  VS YFQ S     + ++    +     R   S +Q 
Subjt:  EDDDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSV-QKSGTDR---RVRIVSRYFQNS-----EKNLEVDREVSPCLRSSKSNQQT

Query:  E-QMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRK------PVNE---RKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQ
        + + VS +FQ+S   +  N   +A + L    K VK  R        VNE    K R+   +      LS ++   + Y RK+ D+TW PP S   LLQ+
Subjt:  E-QMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRK------PVNE---RKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQ

Query:  DHAYDPWRVLVICMLLNRTTGQQ
        DH +DPWRVLVICMLLN+T+G Q
Subjt:  DHAYDPWRVLVICMLLNRTTGQQ

AT3G07930.3 DNA glycosylase superfamily protein1.2e-4941.25Show/hide
Query:  EDDDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSV-QKSGTDR---RVRIVSRYFQNS-----EKNLEVDREVSPCLRSSKSNQQT
        +DDD   ++   +R     +    R+        + + Q   G  S SV  K G  +   +V  VS YFQ S     + ++    +     R   S +Q 
Subjt:  EDDDADFTEQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSV-QKSGTDR---RVRIVSRYFQNS-----EKNLEVDREVSPCLRSSKSNQQT

Query:  E-QMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRK------PVNE---RKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQ
        + + VS +FQ+S   +  N   +A + L    K VK  R        VNE    K R+   +      LS ++   + Y RK+ D+TW PP S   LLQ+
Subjt:  E-QMVSRFFQKSAKQQAVNSQQEATEQLNQRAKSVKRVRK------PVNE---RKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQ

Query:  DHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCT
        DH +DPWRVLVICMLLN+T+G Q + VI  LF LC +A+ A EV  E+IE++I+PLGLQ+KR++ +QRLS  YL+ESW+HVTQL GVGKY ADA+AIFC 
Subjt:  DHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCT

Query:  GYWNEVVPEDHMLNYYWDFL
        G W+ V P DHMLNYYWD+L
Subjt:  GYWNEVVPEDHMLNYYWDFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGCAACAGCAAGCATCAATCCTAACCTCACCCCTCCGTCCTCTTCTTCCTTTCCCGACGATTTGTTTTCCCAATTCGCCTTTCGAGGTAGTTCGCGCTCCAGATT
TTGCTTTCCTCCTTCAGAATCCACTCAACAAAACCCTACGTCCCAGGATTTTACCCAAAACACTACGATTCTCATGACCCAACACTCTCCAATTTCCACTCTTGAGGATT
TCCAAATTTCAGAATCCAAGAATCATCAGAACAAACCCTTAGCCCGCAAGATTTCCATTTGCCCTTCTGATGATCTTCAAAACTGTCCAAACTGTGAGATTCCGGTAACA
TCCCTCTCTTCTGAAGCGCACGAGCCTCCTATTTTAACACTAGACGATCTTCAAAATGCCAAACCAGACCATTACCCGCCAAGAAAGCCTTCACTGGCGCGTAGAGTGTT
ACGTTTTTACCGAGAATTCGGATTTGATCAAAAAATGGTGCAAACAACTTCGCATTCTGACCTAAATTTAGAACCAGTTCAACAAGGGGCCCGTGTGGTTTCGCGATATT
TCCAAAACTCAAAATCAACCCAACAAGGTGAACGAATTGTCGCACGATACTTTCAAAACTCGGAGAAGGAACGAGCAGCCCGTAATGAGGATGATGATGCCGATTTCACA
GAGCAGACAAGTAAAAGATCAATGGTGGGAGGCTACAGCAAAAGGAGGAGGAAATACGTGGCCCCCAGCTCCGATAAGTCAAAAACAAATCAACATTCAATGGGAAAAGC
TTCACGCTCTGTTCAGAAGTCAGGAACAGATAGACGAGTTCGAATTGTTTCGCGCTATTTCCAAAATTCAGAAAAGAATCTTGAAGTGGATCGAGAAGTTTCACCTTGCT
TACGAAGTTCAAAATCAAATCAACAAACGGAGCAAATGGTCTCACGTTTCTTTCAAAAATCAGCAAAGCAACAAGCCGTGAACAGTCAGCAAGAGGCTACAGAGCAGCTA
AATCAGCGTGCGAAATCTGTTAAAAGGGTCCGTAAACCAGTTAATGAAAGGAAAGATAGGGATAAAACAAGTTCTGCTAAACCTCGGACCACTCTTTCTGCTGCAGAGTT
GTTTTTGGAAGCTTATAGAAGGAAATCGTCAGATGATACATGGAAGCCTCCTCCCTCTGGAATTCGCCTTCTCCAACAGGATCATGCCTACGACCCTTGGAGGGTTCTAG
TCATATGTATGCTCCTTAACCGGACAACTGGGCAGCAGGCAAAAGAAGTAATACCAAAACTCTTCAGTTTGTGTCCCAATGCAGAGGCTGCTTTGGAGGTATCACATGAG
CAGATAGAAGATATCATTCGACCTCTTGGTTTACAAAGAAAAAGATCACGAACAATGCAGCGTTTATCTGAGATGTATTTAAAAGAAAGTTGGAGCCATGTCACCCAGCT
TCCTGGTGTTGGCAAGTATGGAGCTGATGCACATGCAATATTCTGCACTGGATATTGGAATGAAGTAGTACCTGAAGATCACATGCTTAATTATTACTGGGATTTTCTCC
ACAGCATCAAACACCTGCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGCAACAGCAAGCATCAATCCTAACCTCACCCCTCCGTCCTCTTCTTCCTTTCCCGACGATTTGTTTTCCCAATTCGCCTTTCGAGGTAGTTCGCGCTCCAGATT
TTGCTTTCCTCCTTCAGAATCCACTCAACAAAACCCTACGTCCCAGGATTTTACCCAAAACACTACGATTCTCATGACCCAACACTCTCCAATTTCCACTCTTGAGGATT
TCCAAATTTCAGAATCCAAGAATCATCAGAACAAACCCTTAGCCCGCAAGATTTCCATTTGCCCTTCTGATGATCTTCAAAACTGTCCAAACTGTGAGATTCCGGTAACA
TCCCTCTCTTCTGAAGCGCACGAGCCTCCTATTTTAACACTAGACGATCTTCAAAATGCCAAACCAGACCATTACCCGCCAAGAAAGCCTTCACTGGCGCGTAGAGTGTT
ACGTTTTTACCGAGAATTCGGATTTGATCAAAAAATGGTGCAAACAACTTCGCATTCTGACCTAAATTTAGAACCAGTTCAACAAGGGGCCCGTGTGGTTTCGCGATATT
TCCAAAACTCAAAATCAACCCAACAAGGTGAACGAATTGTCGCACGATACTTTCAAAACTCGGAGAAGGAACGAGCAGCCCGTAATGAGGATGATGATGCCGATTTCACA
GAGCAGACAAGTAAAAGATCAATGGTGGGAGGCTACAGCAAAAGGAGGAGGAAATACGTGGCCCCCAGCTCCGATAAGTCAAAAACAAATCAACATTCAATGGGAAAAGC
TTCACGCTCTGTTCAGAAGTCAGGAACAGATAGACGAGTTCGAATTGTTTCGCGCTATTTCCAAAATTCAGAAAAGAATCTTGAAGTGGATCGAGAAGTTTCACCTTGCT
TACGAAGTTCAAAATCAAATCAACAAACGGAGCAAATGGTCTCACGTTTCTTTCAAAAATCAGCAAAGCAACAAGCCGTGAACAGTCAGCAAGAGGCTACAGAGCAGCTA
AATCAGCGTGCGAAATCTGTTAAAAGGGTCCGTAAACCAGTTAATGAAAGGAAAGATAGGGATAAAACAAGTTCTGCTAAACCTCGGACCACTCTTTCTGCTGCAGAGTT
GTTTTTGGAAGCTTATAGAAGGAAATCGTCAGATGATACATGGAAGCCTCCTCCCTCTGGAATTCGCCTTCTCCAACAGGATCATGCCTACGACCCTTGGAGGGTTCTAG
TCATATGTATGCTCCTTAACCGGACAACTGGGCAGCAGGCAAAAGAAGTAATACCAAAACTCTTCAGTTTGTGTCCCAATGCAGAGGCTGCTTTGGAGGTATCACATGAG
CAGATAGAAGATATCATTCGACCTCTTGGTTTACAAAGAAAAAGATCACGAACAATGCAGCGTTTATCTGAGATGTATTTAAAAGAAAGTTGGAGCCATGTCACCCAGCT
TCCTGGTGTTGGCAAGTATGGAGCTGATGCACATGCAATATTCTGCACTGGATATTGGAATGAAGTAGTACCTGAAGATCACATGCTTAATTATTACTGGGATTTTCTCC
ACAGCATCAAACACCTGCTCTGA
Protein sequenceShow/hide protein sequence
MTATASINPNLTPPSSSSFPDDLFSQFAFRGSSRSRFCFPPSESTQQNPTSQDFTQNTTILMTQHSPISTLEDFQISESKNHQNKPLARKISICPSDDLQNCPNCEIPVT
SLSSEAHEPPILTLDDLQNAKPDHYPPRKPSLARRVLRFYREFGFDQKMVQTTSHSDLNLEPVQQGARVVSRYFQNSKSTQQGERIVARYFQNSEKERAARNEDDDADFT
EQTSKRSMVGGYSKRRRKYVAPSSDKSKTNQHSMGKASRSVQKSGTDRRVRIVSRYFQNSEKNLEVDREVSPCLRSSKSNQQTEQMVSRFFQKSAKQQAVNSQQEATEQL
NQRAKSVKRVRKPVNERKDRDKTSSAKPRTTLSAAELFLEAYRRKSSDDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTTGQQAKEVIPKLFSLCPNAEAALEVSHE
QIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWNEVVPEDHMLNYYWDFLHSIKHLL