; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037662 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037662
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionmethyl-CpG-binding domain protein 4-like protein
Genome locationscaffold2:50923111..50929713
RNA-Seq ExpressionSpg037662
SyntenySpg037662
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase
IPR045138 - Methyl-CpG binding protein MeCP2/MBD4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7022375.1 Methyl-CpG-binding domain protein 4-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]9.7e-22576.78Show/hide
Query:  SPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPAKSNQENPTAENLTQKRRILMAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVPPDHQSS
        SP SSS+PD LFSQFAF+G S SRFRFP SKCP++SN++NPT E+ TQKR  LMAQ +PIS L+ LQ SE N QK + WHEIPIL IEDLQ+        
Subjt:  SPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPAKSNQENPTAENLTQKRRILMAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVPPDHQSS

Query:  QPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSRYFQ
         PKR I  L +ED +EVSP T +SE ER  AHEPPILTLEDLQNAK+DHQPA KPPLARRVL+FYR+FGFD+Q+VQ+TPP VRNS+PVQ   RVVSR+FQ
Subjt:  QPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSRYFQ

Query:  NSKSSQQGERIVSRYFQNPEVVQAAHNEDGDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNSEKSHE
         SKS+QQGERIVSRYFQ+ E+ QA+HNED D N T+QP KRS VG+Y KRRRKDVAPSSDNSKA Q S+ K+SRSV+KSG DKRVRIVSRYFQNSEK+ E
Subjt:  NSKSSQQGERIVSRYFQNPEVVQAAHNEDGDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNSEKSHE

Query:  AEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKSADDTWKPPPS
         E  VS +LQN  +NQQGERVVSRFFQ S +Q+VVNN+QEV +QP+QC K VKRIR+PAKERK RDK SARPR+ L A ELFLEAYRRKS DDTWKPPPS
Subjt:  AEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKSADDTWKPPPS

Query:  GIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGVGKYGAD
        GIRLLQQDHAYDPWRVLVICMLLNRT+GQQAK+VIPKLFTLCP+PK+ALEVS EQIEDIIRPLG QRKRS T+QRLSEMYL+ESWSHVTQLPGVGKYGAD
Subjt:  GIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGVGKYGAD

Query:  AHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL
        AHAIFCTGYW EVLPKDHMLNYYWEFLHSIKHLL
Subjt:  AHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL

XP_008460559.1 PREDICTED: methyl-CpG-binding domain protein 4-like protein [Cucumis melo]8.6e-17365.56Show/hide
Query:  NLSSPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPAKSNQENPTAENLTQKRRILMAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVPPDH
        NL+ PSSSSYP DLFS+F FRGTS SRFRFP    P+KS  +NP     +        Q +PIS L DLQ SEPN                        H
Subjt:  NLSSPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPAKSNQENPTAENLTQKRRILMAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVPPDH

Query:  QSSQPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSR
          S          L  P        SSEA+     EPPILTLEDLQN K   Q  KKP LARRVL FYR+FGFD++++Q T  SV NS PVQ+G RVVSR
Subjt:  QSSQPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSR

Query:  YFQNSKSSQQGERIVSRYFQNPEVVQAAHNED--GDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNS
        YFQNS+S+QQ ERIVSRYF+     +AAH ED   D N TEQPSKRS     SKRRRKDV PSS NSK N HSMGKTSRSVQKS  D R RIVS YFQ S
Subjt:  YFQNSKSSQQGERIVSRYFQNPEVVQAAHNED--GDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNS

Query:  EKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDK-SSARPRSNLPAAELFLEAYRRKSADDT
        EKS E ++ VS +LQN  SNQQ E++VSRFF  S KQQ VNN++E  EQ NQC K VKR+R+P  ERKQ++K SS +PR+ L AAELFLEAYRRKS DDT
Subjt:  EKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDK-SSARPRSNLPAAELFLEAYRRKSADDT

Query:  WKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGV
        WKPPPSG RLLQ DHAYDPWRVLVICMLLNRTSG+QAK+VIPKLF+LCPNPKA LEVS EQIEDIIRPLG  RKRSRTM RLSEMYL+ESWSHVTQLPGV
Subjt:  WKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGV

Query:  GKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL
        GKYGADAHAIFCTGYW+EV PKDHMLNYYW+FLHSIKHLL
Subjt:  GKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL

XP_022931728.1 methyl-CpG-binding domain protein 4-like protein [Cucurbita moschata]6.7e-21874.08Show/hide
Query:  MISNLSSPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPAKSNQENPTAENLTQKRRILMAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVP
        M  NLS PSSSS+PD LFSQFAF+G S SRFRFP SKCP++SN++NPT E+ TQKR  LMAQ +PIS L+ LQ SE N QK +   EIPIL IEDLQ+  
Subjt:  MISNLSSPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPAKSNQENPTAENLTQKRRILMAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVP

Query:  PDHQSSQPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRV
               PKR    L +ED +EVSP T +SE ER L HEPPILTLED+QNAK+DHQPA +PPLARRVL+FYR+FGFD+Q+VQ+TPPSVRNS+PVQ+  RV
Subjt:  PDHQSSQPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRV

Query:  VSRYFQNSKSSQQGERIVSRYFQNPEVVQAAHN----EDGDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSR
        VSR+FQ SKS+QQGERIVSRYFQ+ E+ +AAHN    ED D N T+QP KRS VG Y KRRRKDVA SSDNSKA Q S+ K+SR V++SG DKRVR VSR
Subjt:  VSRYFQNSKSSQQGERIVSRYFQNPEVVQAAHN----EDGDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSR

Query:  YFQNSEKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKS
        YFQNSEK+ E E  VS  LQN  + QQGER+VSRFFQ S +Q+VVNN+QEV++ P+QC K VKRIR+PAKERK RDK SARPR+ L A ELFLEAYRRKS
Subjt:  YFQNSEKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKS

Query:  ADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQ
        +DDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRT+GQQAK+VIPKLFTLCP+PK+ALEVS EQIEDIIRPLG QRKRS T+QRLSEMYL+ESWSHVTQ
Subjt:  ADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQ

Query:  LPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL
        LPGVGKYGADAHAIFCTGYW EVLPKDHMLNYYWEFLHSIKHLL
Subjt:  LPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL

XP_022969557.1 methyl-CpG-binding domain protein 4-like protein isoform X1 [Cucurbita maxima]7.2e-19675.57Show/hide
Query:  MAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVPPDHQSSQPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAK
        MA  +PIS L+ LQ SE N QK +  HEIPIL IE LQ+         PKR I  L +ED +EVSP T +SE ER LAHEPPILTLEDLQNAK+DHQPA 
Subjt:  MAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVPPDHQSSQPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAK

Query:  KPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSRYFQNSKSSQQGERIVSRYFQNPEVVQAAHN--EDGDANFTEQPSKRSMVGDYSKRR
        KPPLARRVL+F R+FGFD+Q+VQ+TPPSVRNS+PVQ+  RVVSR+FQ SKS+QQGERIVSRYFQ+ E+ +AAHN  ED D N T+QP KRS VG Y KRR
Subjt:  KPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSRYFQNSKSSQQGERIVSRYFQNPEVVQAAHN--EDGDANFTEQPSKRSMVGDYSKRR

Query:  RKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNSEKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKP
        RKDVA SSDNSKA Q S+ K+SRS++KSG DKRVRIVSRYFQNSEK+ E E  VS +LQN  +NQQ ERVVSRFFQ S + +VVNN+QEV++ P+QC K 
Subjt:  RKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNSEKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKP

Query:  VKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKSADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEV
        VKRIR+PAKERK RDK SA+PR+ L A ELFLEAYRRKS+DDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRT+GQQAK+VIPKLFTLCP+PK+ALEV
Subjt:  VKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKSADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEV

Query:  SHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL
        S EQIEDIIRPLG QRKRS T+QRLSEMYL+ESWSHVTQLPGVGKYGADAHAIFCTGYW EVLPKDHMLNYYWEFLHSIKHLL
Subjt:  SHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL

XP_023529473.1 methyl-CpG-binding domain protein 4-like protein [Cucurbita pepo subsp. pepo]3.7e-21674.35Show/hide
Query:  MISNLSSPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPAKSNQENPTAENLTQKRRILMAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVP
        M  NLS PSSSS+PD       F     SRFRFP SKCP+ SN +NPT E+ TQKR  LMAQ +PIS L+ LQ SE N QK +  HEIPIL IEDLQ+  
Subjt:  MISNLSSPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPAKSNQENPTAENLTQKRRILMAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVP

Query:  PDHQSSQPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRV
               PKR    L +ED ++VSP T +SE ER LAHEPPILTLEDLQNAK+DHQPA KPPLARRVL+FYR+FGFD+Q+VQ+TPPSVRNS+PVQ+  RV
Subjt:  PDHQSSQPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRV

Query:  VSRYFQNSKSSQQGERIVSRYFQNPEVVQAAHN--EDGDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYF
        VSR+FQ SKS+QQGERIVSRYFQ+ E+ QAAHN  ED D N T+QP KRS VG+Y KRRRKDVA SSDNSKA Q S+ K+SRSV+KSG+DKRVRIVSRYF
Subjt:  VSRYFQNSKSSQQGERIVSRYFQNPEVVQAAHN--EDGDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYF

Query:  QNSEKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKSAD
        QNSEK+ E E  VS +LQN  + QQGER+VSRFFQ S +Q+VVNN+QEV++ P+QC K VKRIR+PAKERK RDK SARPR+ L A ELFLEAYRRKS+D
Subjt:  QNSEKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKSAD

Query:  DTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLP
        DTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRT+GQQAKDVIPKLFTLCP+PK+ALEVS EQIEDIIRPLG QRKRS T+QRLSEMYL+ESWSHVTQLP
Subjt:  DTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLP

Query:  GVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL
        GVGKYGADAHAIFCTGYW EVLPKDHMLNYYWEFLHSIKHLL
Subjt:  GVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL

TrEMBL top hitse value%identityAlignment
A0A0A0KRW9 ENDO3c domain-containing protein9.3e-14960.94Show/hide
Query:  NLSSPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPAKSNQENPTAENLTQKRRILMAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVPPDH
        NL+ PSSSSYP DLFS+F FRGTS SRFRFP    P+KS Q++P     +        Q +P+S L DLQ  EP+                        H
Subjt:  NLSSPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPAKSNQENPTAENLTQKRRILMAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVPPDH

Query:  QSSQPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSR
          S                 SP+++         HEPPILTLEDLQN K   Q  K+P LARRVL FYR+FGFD++++Q T  SV NSVP Q+G RVVSR
Subjt:  QSSQPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSR

Query:  YFQNSKSSQQGERIVSRYFQNPEVVQAAHNEDGD--ANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNS
        YFQNS+S+QQ +RIVSRYFQ     + AH ED +   N TEQPSKRS     SKRRRKDV P SDNSK N HS+GKT+RSVQKSG D +VRIVS YFQ+ 
Subjt:  YFQNSKSSQQGERIVSRYFQNPEVVQAAHNEDGD--ANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNS

Query:  EKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDK-SSARPRSNLPAAELFLEAYRRKSADDT
        EKS E ++ VS +LQN  SNQQ E+VVSRFF  S KQQ VNN++E  EQ NQC K VKR+R+P  ERK++DK SS +PR+ L AAELFLEAYRRKS  DT
Subjt:  EKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDK-SSARPRSNLPAAELFLEAYRRKSADDT

Query:  WKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGV
        WKPP SG RLLQ DHAYDPWRVLVICMLLNRTSGQQAK+VIPKLF+LCPNPKA LEVS EQIEDIIRPLGF RKRSRTM RLSEMYL+ESWSHVTQLPGV
Subjt:  WKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGV

Query:  GKYGADAHAIFC
        GKY A    + C
Subjt:  GKYGADAHAIFC

A0A1S3CCU6 methyl-CpG-binding domain protein 4-like protein4.2e-17365.56Show/hide
Query:  NLSSPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPAKSNQENPTAENLTQKRRILMAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVPPDH
        NL+ PSSSSYP DLFS+F FRGTS SRFRFP    P+KS  +NP     +        Q +PIS L DLQ SEPN                        H
Subjt:  NLSSPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPAKSNQENPTAENLTQKRRILMAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVPPDH

Query:  QSSQPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSR
          S          L  P        SSEA+     EPPILTLEDLQN K   Q  KKP LARRVL FYR+FGFD++++Q T  SV NS PVQ+G RVVSR
Subjt:  QSSQPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSR

Query:  YFQNSKSSQQGERIVSRYFQNPEVVQAAHNED--GDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNS
        YFQNS+S+QQ ERIVSRYF+     +AAH ED   D N TEQPSKRS     SKRRRKDV PSS NSK N HSMGKTSRSVQKS  D R RIVS YFQ S
Subjt:  YFQNSKSSQQGERIVSRYFQNPEVVQAAHNED--GDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNS

Query:  EKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDK-SSARPRSNLPAAELFLEAYRRKSADDT
        EKS E ++ VS +LQN  SNQQ E++VSRFF  S KQQ VNN++E  EQ NQC K VKR+R+P  ERKQ++K SS +PR+ L AAELFLEAYRRKS DDT
Subjt:  EKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDK-SSARPRSNLPAAELFLEAYRRKSADDT

Query:  WKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGV
        WKPPPSG RLLQ DHAYDPWRVLVICMLLNRTSG+QAK+VIPKLF+LCPNPKA LEVS EQIEDIIRPLG  RKRSRTM RLSEMYL+ESWSHVTQLPGV
Subjt:  WKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGV

Query:  GKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL
        GKYGADAHAIFCTGYW+EV PKDHMLNYYW+FLHSIKHLL
Subjt:  GKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL

A0A5D3CU57 Methyl-CpG-binding domain protein 4-like protein4.0e-16063.26Show/hide
Query:  NLSSPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPAKSNQENPTAENLTQKRRILMAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVPPDH
        NL+ PSSSSYP DLFS+F FRGTS SRFRFP    P+KS  +NP     +        Q +PIS L DLQ SEPN                        H
Subjt:  NLSSPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPAKSNQENPTAENLTQKRRILMAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVPPDH

Query:  QSSQPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSR
          S          L  P        SSEA+     EPPILTLEDLQN K   Q  KKP LARRVL FYR+FGFD++++Q T  SV NS PVQ+G RVVSR
Subjt:  QSSQPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSR

Query:  YFQNSKSSQQGERIVSRYFQNPEVVQAAHNED--GDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNS
        YFQNS+S+QQ ERIVSRYF+     +AAH ED   D N TEQPSKRS     SKRRRKDV PSS NSK N HSMGKTSRSVQKS  D R RIVS YFQ S
Subjt:  YFQNSKSSQQGERIVSRYFQNPEVVQAAHNED--GDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNS

Query:  EKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDK-SSARPRSNLPAAELFLEAYRRKSADDT
        EKS E ++ VS +LQN  SNQQ E++VSRFF  S KQQ VNN++E  EQ NQC K VKR+R+P  ERKQ++K SS +PR+ L AAELFLEAYRRKS DDT
Subjt:  EKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDK-SSARPRSNLPAAELFLEAYRRKSADDT

Query:  WKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGV
        WKPPPSG RLLQ DHAYDPWRVLVICMLLNRTSG+QAK+VIPKLF+LCPNPKA LEVS EQIEDIIRPLG  RKRSRTM RLSEMYL+ESWSHVTQLPGV
Subjt:  WKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGV

Query:  GKYGADAHAIFCTGYWNEVLPKDHMLNY
        GKYGADAHAIFCTGYWN  + +  ++++
Subjt:  GKYGADAHAIFCTGYWNEVLPKDHMLNY

A0A6J1EZJ4 methyl-CpG-binding domain protein 4-like protein3.2e-21874.08Show/hide
Query:  MISNLSSPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPAKSNQENPTAENLTQKRRILMAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVP
        M  NLS PSSSS+PD LFSQFAF+G S SRFRFP SKCP++SN++NPT E+ TQKR  LMAQ +PIS L+ LQ SE N QK +   EIPIL IEDLQ+  
Subjt:  MISNLSSPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPAKSNQENPTAENLTQKRRILMAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVP

Query:  PDHQSSQPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRV
               PKR    L +ED +EVSP T +SE ER L HEPPILTLED+QNAK+DHQPA +PPLARRVL+FYR+FGFD+Q+VQ+TPPSVRNS+PVQ+  RV
Subjt:  PDHQSSQPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRV

Query:  VSRYFQNSKSSQQGERIVSRYFQNPEVVQAAHN----EDGDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSR
        VSR+FQ SKS+QQGERIVSRYFQ+ E+ +AAHN    ED D N T+QP KRS VG Y KRRRKDVA SSDNSKA Q S+ K+SR V++SG DKRVR VSR
Subjt:  VSRYFQNSKSSQQGERIVSRYFQNPEVVQAAHN----EDGDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSR

Query:  YFQNSEKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKS
        YFQNSEK+ E E  VS  LQN  + QQGER+VSRFFQ S +Q+VVNN+QEV++ P+QC K VKRIR+PAKERK RDK SARPR+ L A ELFLEAYRRKS
Subjt:  YFQNSEKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKS

Query:  ADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQ
        +DDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRT+GQQAK+VIPKLFTLCP+PK+ALEVS EQIEDIIRPLG QRKRS T+QRLSEMYL+ESWSHVTQ
Subjt:  ADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQ

Query:  LPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL
        LPGVGKYGADAHAIFCTGYW EVLPKDHMLNYYWEFLHSIKHLL
Subjt:  LPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL

A0A6J1HWM5 methyl-CpG-binding domain protein 4-like protein isoform X13.5e-19675.57Show/hide
Query:  MAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVPPDHQSSQPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAK
        MA  +PIS L+ LQ SE N QK +  HEIPIL IE LQ+         PKR I  L +ED +EVSP T +SE ER LAHEPPILTLEDLQNAK+DHQPA 
Subjt:  MAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVPPDHQSSQPKRNIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAK

Query:  KPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSRYFQNSKSSQQGERIVSRYFQNPEVVQAAHN--EDGDANFTEQPSKRSMVGDYSKRR
        KPPLARRVL+F R+FGFD+Q+VQ+TPPSVRNS+PVQ+  RVVSR+FQ SKS+QQGERIVSRYFQ+ E+ +AAHN  ED D N T+QP KRS VG Y KRR
Subjt:  KPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSRYFQNSKSSQQGERIVSRYFQNPEVVQAAHN--EDGDANFTEQPSKRSMVGDYSKRR

Query:  RKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNSEKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKP
        RKDVA SSDNSKA Q S+ K+SRS++KSG DKRVRIVSRYFQNSEK+ E E  VS +LQN  +NQQ ERVVSRFFQ S + +VVNN+QEV++ P+QC K 
Subjt:  RKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNSEKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKP

Query:  VKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKSADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEV
        VKRIR+PAKERK RDK SA+PR+ L A ELFLEAYRRKS+DDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRT+GQQAK+VIPKLFTLCP+PK+ALEV
Subjt:  VKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKSADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEV

Query:  SHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL
        S EQIEDIIRPLG QRKRS T+QRLSEMYL+ESWSHVTQLPGVGKYGADAHAIFCTGYW EVLPKDHMLNYYWEFLHSIKHLL
Subjt:  SHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL

SwissProt top hitse value%identityAlignment
O95243 Methyl-CpG-binding domain protein 43.7e-2536.43Show/hide
Query:  RKSADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSH
        R+ A   W PP S   L+Q+   +DPW++L+  + LNRTSG+ A  V+ K     P+ + A       + ++++PLG    R++T+ + S+ YL + W +
Subjt:  RKSADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSH

Query:  VTQLPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFL
          +L G+GKYG D++ IFC   W +V P+DH LN Y ++L
Subjt:  VTQLPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFL

Q0IGK1 Methyl-CpG-binding domain protein 4-like protein8.1e-4938.99Show/hide
Query:  VRVVSRYFQNSKSSQQGERIVSRYFQNPEVVQAAHNEDGDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRY
        VR VS YFQ S  SQQ +              +  +++G +    +  + S     S   + D    S +     +  G + R V       +VR VS Y
Subjt:  VRVVSRYFQNSKSSQQGERIVSRYFQNPEVVQAAHNEDGDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRY

Query:  FQNSEKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKSA
        FQ S  S +  Q   + L+N     +    VSR+F     Q                      + E  KE+ +  + +      L  ++   + Y RK+ 
Subjt:  FQNSEKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKSA

Query:  DDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQL
        D+TW PP S   LLQ+DH +DPWRVLVICMLLN+TSG Q + VI  LF LC + K A EV  E+IE++I+PLG Q+KR++ +QRLS  YLQESW+HVTQL
Subjt:  DDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQL

Query:  PGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFL
         GVGKY ADA+AIFC G W+ V P DHMLNYYW++L
Subjt:  PGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFL

Q7LX22 Thymine/uracil-DNA glycosylase4.1e-0834.02Show/hide
Query:  AYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHV-------TQLPGVGKYGA
        A DPW VLV  +LL +T+ +Q  D+  +     P+P    + S E+I+ II+PLG +  R+  +++LSE  ++     +         LPGVG Y A
Subjt:  AYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHV-------TQLPGVGKYGA

Q9YDP0 Thymine-DNA glycosylase2.0e-0731.18Show/hide
Query:  DPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMY-------LQESWSHVTQLPGVGKY
        DPW +LV   LL +T+ +Q   V  +     PNPKA      +++ ++IRPLG + +R++ +  L++         +  S   + +LPGVG Y
Subjt:  DPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMY-------LQESWSHVTQLPGVGKY

Q9Z2D7 Methyl-CpG-binding domain protein 46.9e-2435Show/hide
Query:  RKSADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSH
        R+ +   W PP S   L+Q+   +DPW++L+  + LNRTSG+ A  V+ +     P+ + A       + ++++PLG    R++T+ + S+ YL + W +
Subjt:  RKSADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSH

Query:  VTQLPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFL
          +L G+GKYG D++ IFC   W +V P+DH LN Y ++L
Subjt:  VTQLPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFL

Arabidopsis top hitse value%identityAlignment
AT3G07930.1 DNA glycosylase superfamily protein1.7e-0929.24Show/hide
Query:  VRVVSRYFQNSKSSQQGERIVSRYFQNPEVVQAAHNEDGDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRY
        VR VS YFQ S  SQQ +              +  +++G +    +  + S     S   + D    S +     +  G + R V       +VR VS Y
Subjt:  VRVVSRYFQNSKSSQQGERIVSRYFQNPEVVQAAHNEDGDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRY

Query:  FQNSEKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKSA
        FQ S  S +  Q   + L+N     +    VSR+F     Q                      + E  KE+ +  + +      L  ++   + Y RK+ 
Subjt:  FQNSEKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKSA

Query:  DDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTS
        D+TW PP S   LLQ+DH +DPWRVLVICMLLN+TS
Subjt:  DDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTS

AT3G07930.2 DNA glycosylase superfamily protein9.0e-1129.71Show/hide
Query:  VRVVSRYFQNSKSSQQGERIVSRYFQNPEVVQAAHNEDGDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRY
        VR VS YFQ S  SQQ +              +  +++G +    +  + S     S   + D    S +     +  G + R V       +VR VS Y
Subjt:  VRVVSRYFQNSKSSQQGERIVSRYFQNPEVVQAAHNEDGDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRY

Query:  FQNSEKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKSA
        FQ S  S +  Q   + L+N     +    VSR+F     Q                      + E  KE+ +  + +      L  ++   + Y RK+ 
Subjt:  FQNSEKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKSA

Query:  DDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQ
        D+TW PP S   LLQ+DH +DPWRVLVICMLLN+TSG Q
Subjt:  DDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQ

AT3G07930.3 DNA glycosylase superfamily protein5.7e-5038.99Show/hide
Query:  VRVVSRYFQNSKSSQQGERIVSRYFQNPEVVQAAHNEDGDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRY
        VR VS YFQ S  SQQ +              +  +++G +    +  + S     S   + D    S +     +  G + R V       +VR VS Y
Subjt:  VRVVSRYFQNSKSSQQGERIVSRYFQNPEVVQAAHNEDGDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRY

Query:  FQNSEKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKSA
        FQ S  S +  Q   + L+N     +    VSR+F     Q                      + E  KE+ +  + +      L  ++   + Y RK+ 
Subjt:  FQNSEKSHEAEQNVSRTLQNLNSNQQGERVVSRFFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKSA

Query:  DDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQL
        D+TW PP S   LLQ+DH +DPWRVLVICMLLN+TSG Q + VI  LF LC + K A EV  E+IE++I+PLG Q+KR++ +QRLS  YLQESW+HVTQL
Subjt:  DDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQL

Query:  PGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFL
         GVGKY ADA+AIFC G W+ V P DHMLNYYW++L
Subjt:  PGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTCCAACCTCTCCTCTCCTTCATCTTCTTCATATCCCGACGATTTGTTTTCTCAATTCGCCTTTCGAGGTACTTCCTGCTCCAGATTTCGCTTTCCTTCTTCCAA
ATGTCCCGCCAAATCGAATCAAGAAAACCCTACGGCGGAAAATCTTACCCAAAAGAGGAGGATTCTCATGGCGCAGGCCACTCCAATTTCAGCTCTCCAGGATCTCCAAA
ATTCAGAACCTAATCAACAGAAACAATCCTTTTGGCATGAGATTCCCATTTTGTCTATTGAGGATCTTCAAGAAGTTCCACCCGATCACCAGTCTTCTCAACCGAAGCGT
AACATTCCGGTATTACCCCTAGAGGATCCCCGAGAAGTTTCGCCTAACACCCAATCTTCAGAAGCGGAGAGAGGTTTAGCGCACGAACCTCCTATATTAACTCTAGAGGA
TCTTCAAAATGCGAAAGCAGACCATCAACCGGCAAAAAAACCTCCACTGGCGCGTAGGGTCTTACAATTTTACCGGAAGTTCGGATTTGATCAACAAATGGTGCAAAGAA
CTCCGCCTTCTGTCCGAAATTCAGTACCAGTTCAACAAGGTGTACGTGTAGTTTCGCGTTATTTCCAGAATTCAAAATCATCTCAACAAGGAGAACGAATTGTCTCACGC
TACTTTCAAAACCCGGAGGTTGTACAAGCAGCCCATAATGAGGATGGGGATGCCAATTTCACGGAGCAGCCAAGCAAAAGATCGATGGTGGGGGACTACAGCAAAAGGAG
GAGGAAAGACGTAGCTCCCAGCTCTGATAATTCAAAAGCAAATCAACATTCAATGGGAAAGACTTCGCGCTCTGTTCAAAAGTCAGGAAGAGATAAACGAGTGCGAATTG
TTTCGCGCTATTTCCAAAATTCAGAAAAGAGTCATGAAGCAGAGCAAAATGTTTCACGTACTTTACAAAATTTAAATTCAAATCAACAAGGAGAGCGAGTAGTCTCACGT
TTCTTTCAAAATTCAGCAAAACAACAAGTAGTGAACAATGAGCAAGAGGTTGTAGAGCAGCCAAATCAGTGTGTAAAACCTGTTAAAAGAATCCGTGAACCGGCCAAAGA
AAGGAAACAGAGGGATAAAAGTTCTGCCAGGCCTAGATCCAATCTTCCTGCTGCCGAGTTGTTTTTGGAAGCTTATAGAAGAAAATCCGCAGATGATACATGGAAGCCTC
CTCCCTCTGGAATTCGCCTTCTCCAACAGGATCATGCGTACGACCCTTGGAGGGTTCTAGTCATATGTATGCTCCTTAACCGTACATCTGGGCAGCAGGCAAAAGACGTG
ATACCTAAACTCTTCACTTTGTGTCCTAATCCAAAGGCTGCTTTGGAGGTATCACATGAGCAGATAGAAGATATTATACGACCTCTTGGTTTTCAAAGAAAAAGATCGCG
CACAATGCAACGATTGTCTGAGATGTATTTACAAGAAAGTTGGAGTCATGTCACTCAGCTTCCTGGCGTTGGCAAGTACGGAGCTGATGCACATGCAATATTCTGCACTG
GATATTGGAATGAAGTACTACCTAAAGATCACATGCTTAATTATTATTGGGAGTTTCTCCACAGCATAAAACATTTGCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTTCCAACCTCTCCTCTCCTTCATCTTCTTCATATCCCGACGATTTGTTTTCTCAATTCGCCTTTCGAGGTACTTCCTGCTCCAGATTTCGCTTTCCTTCTTCCAA
ATGTCCCGCCAAATCGAATCAAGAAAACCCTACGGCGGAAAATCTTACCCAAAAGAGGAGGATTCTCATGGCGCAGGCCACTCCAATTTCAGCTCTCCAGGATCTCCAAA
ATTCAGAACCTAATCAACAGAAACAATCCTTTTGGCATGAGATTCCCATTTTGTCTATTGAGGATCTTCAAGAAGTTCCACCCGATCACCAGTCTTCTCAACCGAAGCGT
AACATTCCGGTATTACCCCTAGAGGATCCCCGAGAAGTTTCGCCTAACACCCAATCTTCAGAAGCGGAGAGAGGTTTAGCGCACGAACCTCCTATATTAACTCTAGAGGA
TCTTCAAAATGCGAAAGCAGACCATCAACCGGCAAAAAAACCTCCACTGGCGCGTAGGGTCTTACAATTTTACCGGAAGTTCGGATTTGATCAACAAATGGTGCAAAGAA
CTCCGCCTTCTGTCCGAAATTCAGTACCAGTTCAACAAGGTGTACGTGTAGTTTCGCGTTATTTCCAGAATTCAAAATCATCTCAACAAGGAGAACGAATTGTCTCACGC
TACTTTCAAAACCCGGAGGTTGTACAAGCAGCCCATAATGAGGATGGGGATGCCAATTTCACGGAGCAGCCAAGCAAAAGATCGATGGTGGGGGACTACAGCAAAAGGAG
GAGGAAAGACGTAGCTCCCAGCTCTGATAATTCAAAAGCAAATCAACATTCAATGGGAAAGACTTCGCGCTCTGTTCAAAAGTCAGGAAGAGATAAACGAGTGCGAATTG
TTTCGCGCTATTTCCAAAATTCAGAAAAGAGTCATGAAGCAGAGCAAAATGTTTCACGTACTTTACAAAATTTAAATTCAAATCAACAAGGAGAGCGAGTAGTCTCACGT
TTCTTTCAAAATTCAGCAAAACAACAAGTAGTGAACAATGAGCAAGAGGTTGTAGAGCAGCCAAATCAGTGTGTAAAACCTGTTAAAAGAATCCGTGAACCGGCCAAAGA
AAGGAAACAGAGGGATAAAAGTTCTGCCAGGCCTAGATCCAATCTTCCTGCTGCCGAGTTGTTTTTGGAAGCTTATAGAAGAAAATCCGCAGATGATACATGGAAGCCTC
CTCCCTCTGGAATTCGCCTTCTCCAACAGGATCATGCGTACGACCCTTGGAGGGTTCTAGTCATATGTATGCTCCTTAACCGTACATCTGGGCAGCAGGCAAAAGACGTG
ATACCTAAACTCTTCACTTTGTGTCCTAATCCAAAGGCTGCTTTGGAGGTATCACATGAGCAGATAGAAGATATTATACGACCTCTTGGTTTTCAAAGAAAAAGATCGCG
CACAATGCAACGATTGTCTGAGATGTATTTACAAGAAAGTTGGAGTCATGTCACTCAGCTTCCTGGCGTTGGCAAGTACGGAGCTGATGCACATGCAATATTCTGCACTG
GATATTGGAATGAAGTACTACCTAAAGATCACATGCTTAATTATTATTGGGAGTTTCTCCACAGCATAAAACATTTGCTCTGA
Protein sequenceShow/hide protein sequence
MISNLSSPSSSSYPDDLFSQFAFRGTSCSRFRFPSSKCPAKSNQENPTAENLTQKRRILMAQATPISALQDLQNSEPNQQKQSFWHEIPILSIEDLQEVPPDHQSSQPKR
NIPVLPLEDPREVSPNTQSSEAERGLAHEPPILTLEDLQNAKADHQPAKKPPLARRVLQFYRKFGFDQQMVQRTPPSVRNSVPVQQGVRVVSRYFQNSKSSQQGERIVSR
YFQNPEVVQAAHNEDGDANFTEQPSKRSMVGDYSKRRRKDVAPSSDNSKANQHSMGKTSRSVQKSGRDKRVRIVSRYFQNSEKSHEAEQNVSRTLQNLNSNQQGERVVSR
FFQNSAKQQVVNNEQEVVEQPNQCVKPVKRIREPAKERKQRDKSSARPRSNLPAAELFLEAYRRKSADDTWKPPPSGIRLLQQDHAYDPWRVLVICMLLNRTSGQQAKDV
IPKLFTLCPNPKAALEVSHEQIEDIIRPLGFQRKRSRTMQRLSEMYLQESWSHVTQLPGVGKYGADAHAIFCTGYWNEVLPKDHMLNYYWEFLHSIKHLL