; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036630 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036630
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionmethyl-CpG-binding domain protein 4-like protein
Genome locationchr3:49689115..49708735
RNA-Seq ExpressionLag0036630
SyntenyLag0036630
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase
IPR045138 - Methyl-CpG binding protein MeCP2/MBD4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7022375.1 Methyl-CpG-binding domain protein 4-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]6.1e-22777.22Show/hide
Query:  MTSNLSPPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPSKSNQENPTAENLTQKRRILMAHSTPISALQDLQNSEPNHQKQAFRHEIPILSIEDLQEVP
        M  N SPP SSS+PD LFSQFAF+G S SRFRFP SKCPS+SN++NPT E+ TQKR  LMA ++PIS L+ LQ SE NHQK A  HEIPIL IEDLQ+  
Subjt:  MTSNLSPPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPSKSNQENPTAENLTQKRRILMAHSTPISALQDLQNSEPNHQKQAFRHEIPILSIEDLQEVP

Query:  PDHQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRV
               PKR I  L +ED +EVSP T +SE ER  AHEPPILTLEDLQ AK+DHQPA KPPLARRVL+FYR+FGFD+Q+VQ+TPP VRNS+PVQ   RV
Subjt:  PDHQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRV

Query:  VSRYFQNSKSSQQGERIVSRYFQNPEIVQAAHNEDEDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQN
        VSR+FQ SKS+QQGERIVSRYFQ+ EI QA+HNEDED N T+QP KRS VG+Y KRRRKDVAPSSDNSKA Q S+ K+SRSV+KSG DKRVRIVSRYFQN
Subjt:  VSRYFQNSKSSQQGERIVSRYFQNPEIVQAAHNEDEDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQN

Query:  SEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVKRIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKSADDT
        SEK+ E E  VSP+LQN  +NQQGERVVSRFFQ S +Q+VVNN+QEV +QP+QC KSVKRIR+PAKERK RDK SARPR+TL A +LFLEAYRRKS DDT
Subjt:  SEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVKRIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKSADDT

Query:  WKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGV
        WKPPPSGIRLLQQDHAYDPWRVLVICMLLNRT+GQQAK+VIPKLFTLCP+PK+ALEVS EQIEDIIRPLG QRKRS T+QRLSEMYL+ESWSHVTQLPGV
Subjt:  WKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGV

Query:  GKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL
        GKYGADAHAIFCTGYW EVLPKDHMLNYYWEFLHSIKHLL
Subjt:  GKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL

XP_008460559.1 PREDICTED: methyl-CpG-binding domain protein 4-like protein [Cucumis melo]1.8e-17566.54Show/hide
Query:  NLSPPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPSKSNQENPTA-ENLTQKRRILMAHSTPISALQDLQNSEPNHQKQAFRHEIPILSIEDLQEVPPD
        NL+PPSSSSYP DLFS+F FRGTS SRFRFP    PSKS  +NP   ++ TQ       HS PIS L DLQ SEPN                        
Subjt:  NLSPPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPSKSNQENPTA-ENLTQKRRILMAHSTPISALQDLQNSEPNHQKQAFRHEIPILSIEDLQEVPPD

Query:  HQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVS
                     N  +    SP   SSEA+     EPPILTLEDLQ  K   Q  KKP LARRVL FYR+FGFD++++Q T  SV NS PVQ+G RVVS
Subjt:  HQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVS

Query:  RYFQNSKSSQQGERIVSRYFQNPEIVQAAHNEDE--DANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQN
        RYFQNS+S+QQ ERIVSRYF+     +AAH EDE  D N TEQPSKRS     SKRRRKDV PSS NSK N HSMGKTSRSVQKS  D R RIVS YFQ 
Subjt:  RYFQNSKSSQQGERIVSRYFQNPEIVQAAHNEDE--DANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQN

Query:  SEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVKRIREPAKERKQRDK-SSARPRSTLPAAKLFLEAYRRKSADD
        SEKS E ++ VSP+LQN  SNQQ E++VSRFF  S KQQ VNN++E  EQ NQC KSVKR+R+P  ERKQ++K SS +PR+TL AA+LFLEAYRRKS DD
Subjt:  SEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVKRIREPAKERKQRDK-SSARPRSTLPAAKLFLEAYRRKSADD

Query:  TWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPG
        TWKPPPSG RLLQ DHAYDPWRVLVICMLLNRTSG+QAK+VIPKLF+LCPNPKA LEVS EQIEDIIRPLG  RKRSRTM RLSEMYL+ESWSHVTQLPG
Subjt:  TWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPG

Query:  VGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL
        VGKYGADAHAIFCTGYW+EV PKDHMLNYYW+FLHSIKHLL
Subjt:  VGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL

XP_022931728.1 methyl-CpG-binding domain protein 4-like protein [Cucurbita moschata]9.0e-22375.18Show/hide
Query:  MTSNLSPPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPSKSNQENPTAENLTQKRRILMAHSTPISALQDLQNSEPNHQKQAFRHEIPILSIEDLQEVP
        M  NLSPPSSSS+PD LFSQFAF+G S SRFRFP SKCPS+SN++NPT E+ TQKR  LMA ++PIS L+ LQ SE NHQK A   EIPIL IEDLQ+  
Subjt:  MTSNLSPPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPSKSNQENPTAENLTQKRRILMAHSTPISALQDLQNSEPNHQKQAFRHEIPILSIEDLQEVP

Query:  PDHQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRV
               PKR    L +ED +EVSP T +SE ER L HEPPILTLED+Q AK+DHQPA +PPLARRVL+FYR+FGFD+Q+VQ+TPPSVRNS+PVQ+  RV
Subjt:  PDHQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRV

Query:  VSRYFQNSKSSQQGERIVSRYFQNPEIVQAAHN----EDEDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSR
        VSR+FQ SKS+QQGERIVSRYFQ+ EI +AAHN    EDED N T+QP KRS VG Y KRRRKDVA SSDNSKA Q S+ K+SR V++SG DKRVR VSR
Subjt:  VSRYFQNSKSSQQGERIVSRYFQNPEIVQAAHN----EDEDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSR

Query:  YFQNSEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVKRIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKS
        YFQNSEK+ E E  VSP LQN  + QQGER+VSRFFQ S +Q+VVNN+QEV++ P+QC KSVKRIR+PAKERK RDK SARPR+TL A +LFLEAYRRKS
Subjt:  YFQNSEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVKRIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKS

Query:  ADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQ
        +DDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRT+GQQAK+VIPKLFTLCP+PK+ALEVS EQIEDIIRPLG QRKRS T+QRLSEMYL+ESWSHVTQ
Subjt:  ADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQ

Query:  LPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL
        LPGVGKYGADAHAIFCTGYW EVLPKDHMLNYYWEFLHSIKHLL
Subjt:  LPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL

XP_022969557.1 methyl-CpG-binding domain protein 4-like protein isoform X1 [Cucurbita maxima]2.4e-19976.6Show/hide
Query:  MAHSTPISALQDLQNSEPNHQKQAFRHEIPILSIEDLQEVPPDHQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAK
        MA ++PIS L+ LQ SE NHQK A  HEIPIL IE LQ+         PKR I  L +ED +EVSP T +SE ER LAHEPPILTLEDLQ AK+DHQPA 
Subjt:  MAHSTPISALQDLQNSEPNHQKQAFRHEIPILSIEDLQEVPPDHQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAK

Query:  KPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSRYFQNSKSSQQGERIVSRYFQNPEIVQAAHNEDE--DANFTEQPSKRSMVGDYSKRR
        KPPLARRVL+F R+FGFD+Q+VQ+TPPSVRNS+PVQ+  RVVSR+FQ SKS+QQGERIVSRYFQ+ EI +AAHNEDE  D N T+QP KRS VG Y KRR
Subjt:  KPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSRYFQNSKSSQQGERIVSRYFQNPEIVQAAHNEDE--DANFTEQPSKRSMVGDYSKRR

Query:  RKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNSEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKS
        RKDVA SSDNSKA Q S+ K+SRS++KSG DKRVRIVSRYFQNSEK+ E E  VSP+LQN  +NQQ ERVVSRFFQ S + +VVNN+QEV++ P+QC KS
Subjt:  RKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNSEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKS

Query:  VKRIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKSADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEV
        VKRIR+PAKERK RDK SA+PR+TL A +LFLEAYRRKS+DDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRT+GQQAK+VIPKLFTLCP+PK+ALEV
Subjt:  VKRIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKSADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEV

Query:  SHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL
        S EQIEDIIRPLG QRKRS T+QRLSEMYL+ESWSHVTQLPGVGKYGADAHAIFCTGYW EVLPKDHMLNYYWEFLHSIKHLL
Subjt:  SHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL

XP_023529473.1 methyl-CpG-binding domain protein 4-like protein [Cucurbita pepo subsp. pepo]6.5e-22175.46Show/hide
Query:  MTSNLSPPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPSKSNQENPTAENLTQKRRILMAHSTPISALQDLQNSEPNHQKQAFRHEIPILSIEDLQEVP
        M  NLSPPSSSS+PD       F     SRFRFP SKCPS SN +NPT E+ TQKR  LMA ++PIS L+ LQ SE NHQK A  HEIPIL IEDLQ+  
Subjt:  MTSNLSPPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPSKSNQENPTAENLTQKRRILMAHSTPISALQDLQNSEPNHQKQAFRHEIPILSIEDLQEVP

Query:  PDHQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRV
               PKR    L +ED ++VSP T +SE ER LAHEPPILTLEDLQ AK+DHQPA KPPLARRVL+FYR+FGFD+Q+VQ+TPPSVRNS+PVQ+  RV
Subjt:  PDHQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRV

Query:  VSRYFQNSKSSQQGERIVSRYFQNPEIVQAAHN--EDEDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYF
        VSR+FQ SKS+QQGERIVSRYFQ+ EI QAAHN  EDED N T+QP KRS VG+Y KRRRKDVA SSDNSKA Q S+ K+SRSV+KSG+DKRVRIVSRYF
Subjt:  VSRYFQNSKSSQQGERIVSRYFQNPEIVQAAHN--EDEDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYF

Query:  QNSEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVKRIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKSAD
        QNSEK+ E E  VSP+LQN  + QQGER+VSRFFQ S +Q+VVNN+QEV++ P+QC KSVKRIR+PAKERK RDK SARPR+TL A +LFLEAYRRKS+D
Subjt:  QNSEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVKRIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKSAD

Query:  DTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLP
        DTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRT+GQQAKDVIPKLFTLCP+PK+ALEVS EQIEDIIRPLG QRKRS T+QRLSEMYL+ESWSHVTQLP
Subjt:  DTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLP

Query:  GVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL
        GVGKYGADAHAIFCTGYW EVLPKDHMLNYYWEFLHSIKHLL
Subjt:  GVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL

TrEMBL top hitse value%identityAlignment
A0A0A0KRW9 ENDO3c domain-containing protein4.0e-15262.06Show/hide
Query:  NLSPPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPSKSNQENPTA-ENLTQKRRILMAHSTPISALQDLQNSEP-NHQKQAFRHEIPILSIEDLQEVPP
        NL+PPSSSSYP DLFS+F FRGTS SRFRFP    PSKS Q++P   ++ TQ       HS P+S L DLQ  EP NH  ++                  
Subjt:  NLSPPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPSKSNQENPTA-ENLTQKRRILMAHSTPISALQDLQNSEP-NHQKQAFRHEIPILSIEDLQEVPP

Query:  DHQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVV
                              SP+++         HEPPILTLEDLQ  K   Q  K+P LARRVL FYR+FGFD++++Q T  SV NSVP Q+G RVV
Subjt:  DHQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVV

Query:  SRYFQNSKSSQQGERIVSRYFQNPEIVQAAHNEDED--ANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQ
        SRYFQNS+S+QQ +RIVSRYFQ     + AH EDE+   N TEQPSKRS     SKRRRKDV P SDNSK N HS+GKT+RSVQKSG D +VRIVS YFQ
Subjt:  SRYFQNSKSSQQGERIVSRYFQNPEIVQAAHNEDED--ANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQ

Query:  NSEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVKRIREPAKERKQRDK-SSARPRSTLPAAKLFLEAYRRKSAD
        + EKS E ++ VSP+LQN  SNQQ E+VVSRFF  S KQQ VNN++E  EQ NQC KSVKR+R+P  ERK++DK SS +PR+TL AA+LFLEAYRRKS  
Subjt:  NSEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVKRIREPAKERKQRDK-SSARPRSTLPAAKLFLEAYRRKSAD

Query:  DTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLP
        DTWKPP SG RLLQ DHAYDPWRVLVICMLLNRTSGQQAK+VIPKLF+LCPNPKA LEVS EQIEDIIRPLGF RKRSRTM RLSEMYL+ESWSHVTQLP
Subjt:  DTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLP

Query:  GVGKYGADAHAIFC
        GVGKY A    + C
Subjt:  GVGKYGADAHAIFC

A0A1S3CCU6 methyl-CpG-binding domain protein 4-like protein9.0e-17666.54Show/hide
Query:  NLSPPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPSKSNQENPTA-ENLTQKRRILMAHSTPISALQDLQNSEPNHQKQAFRHEIPILSIEDLQEVPPD
        NL+PPSSSSYP DLFS+F FRGTS SRFRFP    PSKS  +NP   ++ TQ       HS PIS L DLQ SEPN                        
Subjt:  NLSPPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPSKSNQENPTA-ENLTQKRRILMAHSTPISALQDLQNSEPNHQKQAFRHEIPILSIEDLQEVPPD

Query:  HQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVS
                     N  +    SP   SSEA+     EPPILTLEDLQ  K   Q  KKP LARRVL FYR+FGFD++++Q T  SV NS PVQ+G RVVS
Subjt:  HQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVS

Query:  RYFQNSKSSQQGERIVSRYFQNPEIVQAAHNEDE--DANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQN
        RYFQNS+S+QQ ERIVSRYF+     +AAH EDE  D N TEQPSKRS     SKRRRKDV PSS NSK N HSMGKTSRSVQKS  D R RIVS YFQ 
Subjt:  RYFQNSKSSQQGERIVSRYFQNPEIVQAAHNEDE--DANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQN

Query:  SEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVKRIREPAKERKQRDK-SSARPRSTLPAAKLFLEAYRRKSADD
        SEKS E ++ VSP+LQN  SNQQ E++VSRFF  S KQQ VNN++E  EQ NQC KSVKR+R+P  ERKQ++K SS +PR+TL AA+LFLEAYRRKS DD
Subjt:  SEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVKRIREPAKERKQRDK-SSARPRSTLPAAKLFLEAYRRKSADD

Query:  TWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPG
        TWKPPPSG RLLQ DHAYDPWRVLVICMLLNRTSG+QAK+VIPKLF+LCPNPKA LEVS EQIEDIIRPLG  RKRSRTM RLSEMYL+ESWSHVTQLPG
Subjt:  TWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPG

Query:  VGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL
        VGKYGADAHAIFCTGYW+EV PKDHMLNYYW+FLHSIKHLL
Subjt:  VGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL

A0A5D3CU57 Methyl-CpG-binding domain protein 4-like protein8.7e-16364.27Show/hide
Query:  NLSPPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPSKSNQENPTA-ENLTQKRRILMAHSTPISALQDLQNSEPNHQKQAFRHEIPILSIEDLQEVPPD
        NL+PPSSSSYP DLFS+F FRGTS SRFRFP    PSKS  +NP   ++ TQ       HS PIS L DLQ SEPN                        
Subjt:  NLSPPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPSKSNQENPTA-ENLTQKRRILMAHSTPISALQDLQNSEPNHQKQAFRHEIPILSIEDLQEVPPD

Query:  HQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVS
                     N  +    SP   SSEA+     EPPILTLEDLQ  K   Q  KKP LARRVL FYR+FGFD++++Q T  SV NS PVQ+G RVVS
Subjt:  HQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVS

Query:  RYFQNSKSSQQGERIVSRYFQNPEIVQAAHNEDE--DANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQN
        RYFQNS+S+QQ ERIVSRYF+     +AAH EDE  D N TEQPSKRS     SKRRRKDV PSS NSK N HSMGKTSRSVQKS  D R RIVS YFQ 
Subjt:  RYFQNSKSSQQGERIVSRYFQNPEIVQAAHNEDE--DANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQN

Query:  SEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVKRIREPAKERKQRDK-SSARPRSTLPAAKLFLEAYRRKSADD
        SEKS E ++ VSP+LQN  SNQQ E++VSRFF  S KQQ VNN++E  EQ NQC KSVKR+R+P  ERKQ++K SS +PR+TL AA+LFLEAYRRKS DD
Subjt:  SEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVKRIREPAKERKQRDK-SSARPRSTLPAAKLFLEAYRRKSADD

Query:  TWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPG
        TWKPPPSG RLLQ DHAYDPWRVLVICMLLNRTSG+QAK+VIPKLF+LCPNPKA LEVS EQIEDIIRPLG  RKRSRTM RLSEMYL+ESWSHVTQLPG
Subjt:  TWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPG

Query:  VGKYGADAHAIFCTGYWNEVLPKDHMLNY
        VGKYGADAHAIFCTGYWN  + +  ++++
Subjt:  VGKYGADAHAIFCTGYWNEVLPKDHMLNY

A0A6J1EZJ4 methyl-CpG-binding domain protein 4-like protein4.4e-22375.18Show/hide
Query:  MTSNLSPPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPSKSNQENPTAENLTQKRRILMAHSTPISALQDLQNSEPNHQKQAFRHEIPILSIEDLQEVP
        M  NLSPPSSSS+PD LFSQFAF+G S SRFRFP SKCPS+SN++NPT E+ TQKR  LMA ++PIS L+ LQ SE NHQK A   EIPIL IEDLQ+  
Subjt:  MTSNLSPPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPSKSNQENPTAENLTQKRRILMAHSTPISALQDLQNSEPNHQKQAFRHEIPILSIEDLQEVP

Query:  PDHQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRV
               PKR    L +ED +EVSP T +SE ER L HEPPILTLED+Q AK+DHQPA +PPLARRVL+FYR+FGFD+Q+VQ+TPPSVRNS+PVQ+  RV
Subjt:  PDHQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRV

Query:  VSRYFQNSKSSQQGERIVSRYFQNPEIVQAAHN----EDEDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSR
        VSR+FQ SKS+QQGERIVSRYFQ+ EI +AAHN    EDED N T+QP KRS VG Y KRRRKDVA SSDNSKA Q S+ K+SR V++SG DKRVR VSR
Subjt:  VSRYFQNSKSSQQGERIVSRYFQNPEIVQAAHN----EDEDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSR

Query:  YFQNSEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVKRIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKS
        YFQNSEK+ E E  VSP LQN  + QQGER+VSRFFQ S +Q+VVNN+QEV++ P+QC KSVKRIR+PAKERK RDK SARPR+TL A +LFLEAYRRKS
Subjt:  YFQNSEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVKRIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKS

Query:  ADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQ
        +DDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRT+GQQAK+VIPKLFTLCP+PK+ALEVS EQIEDIIRPLG QRKRS T+QRLSEMYL+ESWSHVTQ
Subjt:  ADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQ

Query:  LPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL
        LPGVGKYGADAHAIFCTGYW EVLPKDHMLNYYWEFLHSIKHLL
Subjt:  LPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL

A0A6J1HWM5 methyl-CpG-binding domain protein 4-like protein isoform X11.2e-19976.6Show/hide
Query:  MAHSTPISALQDLQNSEPNHQKQAFRHEIPILSIEDLQEVPPDHQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAK
        MA ++PIS L+ LQ SE NHQK A  HEIPIL IE LQ+         PKR I  L +ED +EVSP T +SE ER LAHEPPILTLEDLQ AK+DHQPA 
Subjt:  MAHSTPISALQDLQNSEPNHQKQAFRHEIPILSIEDLQEVPPDHQSSQPKRNIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAK

Query:  KPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSRYFQNSKSSQQGERIVSRYFQNPEIVQAAHNEDE--DANFTEQPSKRSMVGDYSKRR
        KPPLARRVL+F R+FGFD+Q+VQ+TPPSVRNS+PVQ+  RVVSR+FQ SKS+QQGERIVSRYFQ+ EI +AAHNEDE  D N T+QP KRS VG Y KRR
Subjt:  KPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSRYFQNSKSSQQGERIVSRYFQNPEIVQAAHNEDE--DANFTEQPSKRSMVGDYSKRR

Query:  RKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNSEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKS
        RKDVA SSDNSKA Q S+ K+SRS++KSG DKRVRIVSRYFQNSEK+ E E  VSP+LQN  +NQQ ERVVSRFFQ S + +VVNN+QEV++ P+QC KS
Subjt:  RKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNSEKSHEAEQNVSPTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKS

Query:  VKRIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKSADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEV
        VKRIR+PAKERK RDK SA+PR+TL A +LFLEAYRRKS+DDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRT+GQQAK+VIPKLFTLCP+PK+ALEV
Subjt:  VKRIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKSADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEV

Query:  SHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL
        S EQIEDIIRPLG QRKRS T+QRLSEMYL+ESWSHVTQLPGVGKYGADAHAIFCTGYW EVLPKDHMLNYYWEFLHSIKHLL
Subjt:  SHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL

SwissProt top hitse value%identityAlignment
O95243 Methyl-CpG-binding domain protein 43.7e-2536.43Show/hide
Query:  RKSADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSH
        R+ A   W PP S   L+Q+   +DPW++L+  + LNRTSG+ A  V+ K     P+ + A       + ++++PLG    R++T+ + S+ YL + W +
Subjt:  RKSADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSH

Query:  VTQLPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFL
          +L G+GKYG D++ IFC   W +V P+DH LN Y ++L
Subjt:  VTQLPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFL

Q0IGK1 Methyl-CpG-binding domain protein 4-like protein1.9e-5041.46Show/hide
Query:  EDEDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSM-GKTSRSV-QKSGRDK---RVRIVSRYFQNSEKSHEAEQNVSPTLQNLN------SNQ
        +D+D + ++   +R    ++    R+ V+P    S  +Q S  G  S SV  K G  K   +V  VS YFQ S  S      VS +    N        Q
Subjt:  EDEDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSM-GKTSRSV-QKSGRDK---RVRIVSRYFQNSEKSHEAEQNVSPTLQNLN------SNQ

Query:  QGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVK-----------------RIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKSADDTWKPPP
           R VS +FQ          E  V EQPNQ  K ++                 ++ E  KE+ +  + +      L  ++   + Y RK+ D+TW PP 
Subjt:  QGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVK-----------------RIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKSADDTWKPPP

Query:  SGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGVGKYGA
        S   LLQ+DH +DPWRVLVICMLLN+TSG Q + VI  LF LC + K A EV  E+IE++I+PLG Q+KR++ +QRLS  YLQESW+HVTQL GVGKY A
Subjt:  SGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGVGKYGA

Query:  DAHAIFCTGYWNEVLPKDHMLNYYWEFL
        DA+AIFC G W+ V P DHMLNYYW++L
Subjt:  DAHAIFCTGYWNEVLPKDHMLNYYWEFL

Q7LX22 Thymine/uracil-DNA glycosylase4.1e-0834.02Show/hide
Query:  AYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHV-------TQLPGVGKYGA
        A DPW VLV  +LL +T+ +Q  D+  +     P+P    + S E+I+ II+PLG +  R+  +++LSE  ++     +         LPGVG Y A
Subjt:  AYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHV-------TQLPGVGKYGA

Q9YDP0 Thymine-DNA glycosylase2.0e-0731.18Show/hide
Query:  DPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMY-------LQESWSHVTQLPGVGKY
        DPW +LV   LL +T+ +Q   V  +     PNPKA      +++ ++IRPLG + +R++ +  L++         +  S   + +LPGVG Y
Subjt:  DPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMY-------LQESWSHVTQLPGVGKY

Q9Z2D7 Methyl-CpG-binding domain protein 46.9e-2435Show/hide
Query:  RKSADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSH
        R+ +   W PP S   L+Q+   +DPW++L+  + LNRTSG+ A  V+ +     P+ + A       + ++++PLG    R++T+ + S+ YL + W +
Subjt:  RKSADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSH

Query:  VTQLPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFL
          +L G+GKYG D++ IFC   W +V P+DH LN Y ++L
Subjt:  VTQLPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFL

Arabidopsis top hitse value%identityAlignment
AT3G07930.1 DNA glycosylase superfamily protein4.0e-1132.46Show/hide
Query:  EDEDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSM-GKTSRSV-QKSGRDK---RVRIVSRYFQNSEKSHEAEQNVSPTLQNLN------SNQ
        +D+D + ++   +R    ++    R+ V+P    S  +Q S  G  S SV  K G  K   +V  VS YFQ S  S      VS +    N        Q
Subjt:  EDEDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSM-GKTSRSV-QKSGRDK---RVRIVSRYFQNSEKSHEAEQNVSPTLQNLN------SNQ

Query:  QGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVK-----------------RIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKSADDTWKPPP
           R VS +FQ          E  V EQPNQ  K ++                 ++ E  KE+ +  + +      L  ++   + Y RK+ D+TW PP 
Subjt:  QGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVK-----------------RIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKSADDTWKPPP

Query:  SGIRLLQQDHAYDPWRVLVICMLLNRTS
        S   LLQ+DH +DPWRVLVICMLLN+TS
Subjt:  SGIRLLQQDHAYDPWRVLVICMLLNRTS

AT3G07930.2 DNA glycosylase superfamily protein2.1e-1232.9Show/hide
Query:  EDEDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSM-GKTSRSV-QKSGRDK---RVRIVSRYFQNSEKSHEAEQNVSPTLQNLN------SNQ
        +D+D + ++   +R    ++    R+ V+P    S  +Q S  G  S SV  K G  K   +V  VS YFQ S  S      VS +    N        Q
Subjt:  EDEDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSM-GKTSRSV-QKSGRDK---RVRIVSRYFQNSEKSHEAEQNVSPTLQNLN------SNQ

Query:  QGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVK-----------------RIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKSADDTWKPPP
           R VS +FQ          E  V EQPNQ  K ++                 ++ E  KE+ +  + +      L  ++   + Y RK+ D+TW PP 
Subjt:  QGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVK-----------------RIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKSADDTWKPPP

Query:  SGIRLLQQDHAYDPWRVLVICMLLNRTSGQQ
        S   LLQ+DH +DPWRVLVICMLLN+TSG Q
Subjt:  SGIRLLQQDHAYDPWRVLVICMLLNRTSGQQ

AT3G07930.3 DNA glycosylase superfamily protein1.4e-5141.46Show/hide
Query:  EDEDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSM-GKTSRSV-QKSGRDK---RVRIVSRYFQNSEKSHEAEQNVSPTLQNLN------SNQ
        +D+D + ++   +R    ++    R+ V+P    S  +Q S  G  S SV  K G  K   +V  VS YFQ S  S      VS +    N        Q
Subjt:  EDEDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSM-GKTSRSV-QKSGRDK---RVRIVSRYFQNSEKSHEAEQNVSPTLQNLN------SNQ

Query:  QGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVK-----------------RIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKSADDTWKPPP
           R VS +FQ          E  V EQPNQ  K ++                 ++ E  KE+ +  + +      L  ++   + Y RK+ D+TW PP 
Subjt:  QGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKSVK-----------------RIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKSADDTWKPPP

Query:  SGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGVGKYGA
        S   LLQ+DH +DPWRVLVICMLLN+TSG Q + VI  LF LC + K A EV  E+IE++I+PLG Q+KR++ +QRLS  YLQESW+HVTQL GVGKY A
Subjt:  SGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGVGKYGA

Query:  DAHAIFCTGYWNEVLPKDHMLNYYWEFL
        DA+AIFC G W+ V P DHMLNYYW++L
Subjt:  DAHAIFCTGYWNEVLPKDHMLNYYWEFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTCCAACCTCTCCCCTCCTTCATCTTCTTCATATCCTGACGATTTGTTTTCCCAATTCGCCTTTCGAGGTACTTCCTGCTCCAGATTTCGCTTTCCTTCTTCCAA
ATGTCCCTCCAAATCGAATCAAGAAAACCCTACGGCGGAAAATCTTACCCAAAAGAGGAGGATTCTCATGGCGCATTCCACTCCAATTTCAGCTCTCCAGGATCTTCAAA
ATTCCGAACCTAATCATCAGAAACAAGCCTTTCGGCATGAGATTCCCATTTTGTCTATTGAGGATCTTCAAGAAGTTCCACCCGATCACCAGTCTTCTCAACCGAAGCGT
AACATTCCGGTATTAAACCTAGAGGATTCCCGAGAAGTTTCGCCTAACACCCAATCTTCAGAAGCGGAGAGAGGTTTAGCGCACGAACCTCCTATATTAACTCTAGAGGA
TCTTCAAAAGGCGAAAGCAGACCATCAACCGGCAAAAAAGCCTCCACTGGCGCGTAGGGTCTTACAATTTTACCGGAAGTTCGGATTTGATCAACAAATGGTGCAAAGAA
CTCCGCCTTCTGTCCGAAATTCAGTACCAGTTCAACAAGGTGTACGTGTAGTTTCGCGTTATTTCCAGAATTCAAAATCATCTCAACAAGGAGAACGAATTGTCTCACGC
TACTTTCAAAACCCGGAGATTGTACAAGCAGCCCACAATGAGGATGAGGATGCCAATTTCACGGAGCAGCCAAGCAAAAGATCAATGGTGGGGGATTACAGCAAAAGGAG
GAGGAAAGACGTAGCTCCCAGCTCTGATAATTCAAAAGCAAATCAACATTCAATGGGAAAAACTTCGCGCTCTGTTCAAAAGTCAGGAAGAGATAAACGAGTGCGAATTG
TTTCGCGCTATTTCCAAAATTCAGAAAAGAGTCATGAAGCAGAGCAAAATGTTTCACCTACTTTACAAAATTTAAATTCAAATCAACAAGGAGAGCGAGTAGTCTCACGT
TTCTTTCAAAATTCAGCTAAACAACAAGTAGTGAACAATGAGCAAGAGGTTGTAGAGCAGCCAAATCAGTGTGTAAAATCTGTTAAAAGAATCCGTGAACCGGCCAAAGA
AAGGAAACAGAGGGATAAAAGTTCTGCCAGGCCTAGATCCACTCTTCCTGCTGCCAAGTTGTTTTTGGAAGCTTATAGAAGAAAATCCGCAGATGATACATGGAAGCCTC
CTCCCTCTGGAATTCGCCTTCTCCAACAGGATCATGCGTACGACCCTTGGAGGGTTCTAGTCATATGTATGCTCCTTAACCGGACATCTGGGCAGCAGGCAAAAGACGTG
ATACCTAAACTCTTCACTTTGTGTCCTAATCCAAAGGCTGCTTTGGAGGTATCACATGAGCAGATAGAAGATATTATACGACCTCTTGGTTTTCAAAGAAAAAGATCGCG
CACAATGCAACGTTTATCTGAGATGTATTTACAAGAAAGTTGGAGTCATGTCACTCAACTTCCTGGCGTTGGCAAGTACGGAGCTGATGCACATGCAATATTCTGCACTG
GATATTGGAATGAAGTACTACCTAAAGATCACATGCTTAATTATTATTGGGAGTTTCTCCACAGCATAAAACACTTGCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTTCCAACCTCTCCCCTCCTTCATCTTCTTCATATCCTGACGATTTGTTTTCCCAATTCGCCTTTCGAGGTACTTCCTGCTCCAGATTTCGCTTTCCTTCTTCCAA
ATGTCCCTCCAAATCGAATCAAGAAAACCCTACGGCGGAAAATCTTACCCAAAAGAGGAGGATTCTCATGGCGCATTCCACTCCAATTTCAGCTCTCCAGGATCTTCAAA
ATTCCGAACCTAATCATCAGAAACAAGCCTTTCGGCATGAGATTCCCATTTTGTCTATTGAGGATCTTCAAGAAGTTCCACCCGATCACCAGTCTTCTCAACCGAAGCGT
AACATTCCGGTATTAAACCTAGAGGATTCCCGAGAAGTTTCGCCTAACACCCAATCTTCAGAAGCGGAGAGAGGTTTAGCGCACGAACCTCCTATATTAACTCTAGAGGA
TCTTCAAAAGGCGAAAGCAGACCATCAACCGGCAAAAAAGCCTCCACTGGCGCGTAGGGTCTTACAATTTTACCGGAAGTTCGGATTTGATCAACAAATGGTGCAAAGAA
CTCCGCCTTCTGTCCGAAATTCAGTACCAGTTCAACAAGGTGTACGTGTAGTTTCGCGTTATTTCCAGAATTCAAAATCATCTCAACAAGGAGAACGAATTGTCTCACGC
TACTTTCAAAACCCGGAGATTGTACAAGCAGCCCACAATGAGGATGAGGATGCCAATTTCACGGAGCAGCCAAGCAAAAGATCAATGGTGGGGGATTACAGCAAAAGGAG
GAGGAAAGACGTAGCTCCCAGCTCTGATAATTCAAAAGCAAATCAACATTCAATGGGAAAAACTTCGCGCTCTGTTCAAAAGTCAGGAAGAGATAAACGAGTGCGAATTG
TTTCGCGCTATTTCCAAAATTCAGAAAAGAGTCATGAAGCAGAGCAAAATGTTTCACCTACTTTACAAAATTTAAATTCAAATCAACAAGGAGAGCGAGTAGTCTCACGT
TTCTTTCAAAATTCAGCTAAACAACAAGTAGTGAACAATGAGCAAGAGGTTGTAGAGCAGCCAAATCAGTGTGTAAAATCTGTTAAAAGAATCCGTGAACCGGCCAAAGA
AAGGAAACAGAGGGATAAAAGTTCTGCCAGGCCTAGATCCACTCTTCCTGCTGCCAAGTTGTTTTTGGAAGCTTATAGAAGAAAATCCGCAGATGATACATGGAAGCCTC
CTCCCTCTGGAATTCGCCTTCTCCAACAGGATCATGCGTACGACCCTTGGAGGGTTCTAGTCATATGTATGCTCCTTAACCGGACATCTGGGCAGCAGGCAAAAGACGTG
ATACCTAAACTCTTCACTTTGTGTCCTAATCCAAAGGCTGCTTTGGAGGTATCACATGAGCAGATAGAAGATATTATACGACCTCTTGGTTTTCAAAGAAAAAGATCGCG
CACAATGCAACGTTTATCTGAGATGTATTTACAAGAAAGTTGGAGTCATGTCACTCAACTTCCTGGCGTTGGCAAGTACGGAGCTGATGCACATGCAATATTCTGCACTG
GATATTGGAATGAAGTACTACCTAAAGATCACATGCTTAATTATTATTGGGAGTTTCTCCACAGCATAAAACACTTGCTCTGA
Protein sequenceShow/hide protein sequence
MTSNLSPPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPSKSNQENPTAENLTQKRRILMAHSTPISALQDLQNSEPNHQKQAFRHEIPILSIEDLQEVPPDHQSSQPKR
NIPVLNLEDSREVSPNTQSSEAERGLAHEPPILTLEDLQKAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSRYFQNSKSSQQGERIVSR
YFQNPEIVQAAHNEDEDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNSEKSHEAEQNVSPTLQNLNSNQQGERVVSR
FFQNSAKQQVVNNEQEVVEQPNQCVKSVKRIREPAKERKQRDKSSARPRSTLPAAKLFLEAYRRKSADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDV
IPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL