; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006661 (gene) of Snake gourd v1 genome

Gene IDTan0006661
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionmethyl-CpG-binding domain protein 4-like protein
Genome locationLG09:491365..498132
RNA-Seq ExpressionTan0006661
SyntenyTan0006661
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR011257 - DNA glycosylase
IPR045138 - Methyl-CpG binding protein MeCP2/MBD4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588518.1 Methyl-CpG-binding domain protein 4-like protein, partial [Cucurbita argyrosperma subsp. sororia]4.8e-17678.01Show/hide
Query:  MTATTIINPNVSPPSSSTYPDDLFSQFAFRGSSSSRFRFPPSKCPSESNHQNPTPKDFTQKSRILMRQDSPIS------TSEATHQKKSSGHEIPISSIG
        MTATTI+NPN SPPSSS +PD LFSQFAF+G SSSRFRFPPSKCPSESN QNPTP+DFTQK   LM Q+SPIS      TSEA HQK ++ HEIPI  I 
Subjt:  MTATTIINPNVSPPSSSTYPDDLFSQFAFRGSSSSRFRFPPSKCPSESNHQNPTPKDFTQKSRILMRQDSPIS------TSEATHQKKSSGHEIPISSIG

Query:  DLLDNPKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRVLQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVS
        DL D+PKREI TLT+EDVQEVSP  PT E ERV AHEPPILTLEDLQN KSDHQPA KPPLARRVL+FYRQFGFD+ + Q+TPP VRNS+PVQ    VVS
Subjt:  DLLDNPKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRVLQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVS

Query:  RYFQKAKPTQQRERIVSRYFQNSEMERAAHNEDEDTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKASRSVQNSGADKRVRNVSRYFQNSK
        R+FQ++K TQQ ERIVSRYFQ+SE+E+A+HNEDED N T+QP KRS VG+Y + RRKDV PSSDNSKAYQ S+RK+SRSV+ SG DKRVR VSRYFQNS+
Subjt:  RYFQKAKPTQQRERIVSRYFQNSEMERAAHNEDEDTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKASRSVQNSGADKRVRNVSRYFQNSK

Query:  KNPELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDKSSARPRGTLSAAELFLEAYRRKSLDDTWK
        KNPE+E E+S SLQNS+ NQQGERVVSRFFQKSE+Q+VVNNQQEV +QP+QCA+SVKRIRKPAKERKVRDK SARPR TLSA ELFLEAYRRKS DDTWK
Subjt:  KNPELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDKSSARPRGTLSAAELFLEAYRRKSLDDTWK

Query:  PPSSGIRLLQQDHAYDPWRVLVICMLLNRTTG
        PP SGIRLLQQDHAYDPWRVLVICMLLNRTTG
Subjt:  PPSSGIRLLQQDHAYDPWRVLVICMLLNRTTG

KAG7022375.1 Methyl-CpG-binding domain protein 4-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]8.3e-23780.67Show/hide
Query:  MTATTIINPNVSPPSSSTYPDDLFSQFAFRGSSSSRFRFPPSKCPSESNHQNPTPKDFTQKSRILMRQDSPIS------TSEATHQKKSSGHEIPISSIG
        MTATTI+NPN SPPSSS +PD LFSQFAF+G SSSRFRFPPSKCPSESN QNPTP+DFTQK   LM Q+SPIS      TSEA HQK ++ HEIPI  I 
Subjt:  MTATTIINPNVSPPSSSTYPDDLFSQFAFRGSSSSRFRFPPSKCPSESNHQNPTPKDFTQKSRILMRQDSPIS------TSEATHQKKSSGHEIPISSIG

Query:  DLLDNPKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRVLQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVS
        DL D+PKREI TLT+EDVQEVSP  PT E ERV AHEPPILTLEDLQN KSDHQPA KPPLARRVL+FYRQFGFD+ + Q+TPP VRNS+PVQ    VVS
Subjt:  DLLDNPKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRVLQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVS

Query:  RYFQKAKPTQQRERIVSRYFQNSEMERAAHNEDEDTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKASRSVQNSGADKRVRNVSRYFQNSK
        R+FQ++K TQQ ERIVSRYFQ+SE+E+A+HNEDED N T+QP KRS VG+Y + RRKDV PSSDNSKAYQ S+RK+SRSV+ SG DKRVR VSRYFQNS+
Subjt:  RYFQKAKPTQQRERIVSRYFQNSEMERAAHNEDEDTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKASRSVQNSGADKRVRNVSRYFQNSK

Query:  KNPELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDKSSARPRGTLSAAELFLEAYRRKSLDDTWK
        KNPE+E E+S SLQNS+ NQQGERVVSRFFQKSE+Q+VVNNQQEV +QP+QCA+SVKRIRKPAKERKVRDK SARPR TLSA ELFLEAYRRKS DDTWK
Subjt:  KNPELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDKSSARPRGTLSAAELFLEAYRRKSLDDTWK

Query:  PPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGK
        PP SGIRLLQQDHAYDPWRVLVICMLLNRTTG QAKEVIPKLFTLCP+PK+ LEVSQEQIEDIIRPLGLQRKRS T+QRLSEMYLKESWSHVTQLPGVGK
Subjt:  PPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGK

Query:  YGADAHAIFCTGYWTQVAPKDHMLNYYWEFLRSIKHLL
        YGADAHAIFCTGYWT+V PKDHMLNYYWEFL SIKHLL
Subjt:  YGADAHAIFCTGYWTQVAPKDHMLNYYWEFLRSIKHLL

XP_022931728.1 methyl-CpG-binding domain protein 4-like protein [Cucurbita moschata]5.0e-23479.15Show/hide
Query:  MTATTIINPNVSPPSSSTYPDDLFSQFAFRGSSSSRFRFPPSKCPSESNHQNPTPKDFTQKSRILMRQDSPIS------TSEATHQKKSSGHEIPISSIG
        MTATTI+NPN+SPPSSS++PD LFSQFAF+G SSSRFRFPPSKCPSESN QNPTP+DFTQK   LM Q+SPIS      TSE+ HQK ++G EIPI  I 
Subjt:  MTATTIINPNVSPPSSSTYPDDLFSQFAFRGSSSSRFRFPPSKCPSESNHQNPTPKDFTQKSRILMRQDSPIS------TSEATHQKKSSGHEIPISSIG

Query:  DLLDNPKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRVLQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVS
        DL DNPKR   TLT+EDVQEVSP  PT E ERVL HEPPILTLED+QN KSDHQPA +PPLARRVL+FYRQFGFD+ + Q+TPPSVRNS+PVQ+   VVS
Subjt:  DLLDNPKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRVLQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVS

Query:  RYFQKAKPTQQRERIVSRYFQNSEMERAAHN----EDEDTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKASRSVQNSGADKRVRNVSRYF
        R+FQ++K  QQ ERIVSRYFQ+SE+ERAAHN    EDED NVT+QP KRSRVG Y + RRKDV  SSDNSKAYQ S+RK+SR V+ SG DKRVR VSRYF
Subjt:  RYFQKAKPTQQRERIVSRYFQNSEMERAAHN----EDEDTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKASRSVQNSGADKRVRNVSRYF

Query:  QNSKKNPELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDKSSARPRGTLSAAELFLEAYRRKSLD
        QNS+KNPE+E E+S  LQNS+  QQGER+VSRFFQKSE+Q+VVNNQQEVI+ P+QCA+SVKRIRKPAKERKVRDK SARPR TLSA ELFLEAYRRKS D
Subjt:  QNSKKNPELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDKSSARPRGTLSAAELFLEAYRRKSLD

Query:  DTWKPPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLP
        DTWKPP SGIRLLQQDHAYDPWRVLVICMLLNRTTG QAKEVIPKLFTLCP+PK+ LEVSQEQIEDIIRPLGLQRKRS T+QRLSEMYLKESWSHVTQLP
Subjt:  DTWKPPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLP

Query:  GVGKYGADAHAIFCTGYWTQVAPKDHMLNYYWEFLRSIKHLL
        GVGKYGADAHAIFCTGYWT+V PKDHMLNYYWEFL SIKHLL
Subjt:  GVGKYGADAHAIFCTGYWTQVAPKDHMLNYYWEFLRSIKHLL

XP_022969557.1 methyl-CpG-binding domain protein 4-like protein isoform X1 [Cucurbita maxima]1.2e-20379.79Show/hide
Query:  MRQDSPIS------TSEATHQKKSSGHEIPISSIGDLLDNPKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRV
        M  +SPIS      TSEA HQK ++GHEIPI  I  L D+PKREI TLT+EDVQEVSP  PT E ERVLAHEPPILTLEDLQN KSDHQPA KPPLARRV
Subjt:  MRQDSPIS------TSEATHQKKSSGHEIPISSIGDLLDNPKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRV

Query:  LQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVSRYFQKAKPTQQRERIVSRYFQNSEMERAAHNEDE--DTNVTEQPSKRSRVGDYSRGRRKDVVPSS
        L+F RQFGFD+ + Q+TPPSVRNS+PVQ+   VVSR+FQ++K  QQ ERIVSRYFQ+SE+ERAAHNEDE  D NVT+QP KRSRVG Y + RRKDV  SS
Subjt:  LQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVSRYFQKAKPTQQRERIVSRYFQNSEMERAAHNEDE--DTNVTEQPSKRSRVGDYSRGRRKDVVPSS

Query:  DNSKAYQHSMRKASRSVQNSGADKRVRNVSRYFQNSKKNPELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPA
        DNSKAYQ S+RK+SRS++ SG DKRVR VSRYFQNS+KNPE+E E+S SLQNS+ NQQ ERVVSRFFQKSE+ +VVNNQQEVI+ P+QCA+SVKRIRKPA
Subjt:  DNSKAYQHSMRKASRSVQNSGADKRVRNVSRYFQNSKKNPELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPA

Query:  KERKVRDKSSARPRGTLSAAELFLEAYRRKSLDDTWKPPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDI
        KERKVRDK SA+PR TLSA ELFLEAYRRKS DDTWKPP SGIRLLQQDHAYDPWRVLVICMLLNRTTG QAKEVIPKLFTLCP+PK+ LEVSQEQIEDI
Subjt:  KERKVRDKSSARPRGTLSAAELFLEAYRRKSLDDTWKPPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDI

Query:  IRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWTQVAPKDHMLNYYWEFLRSIKHLL
        IRPLGLQRKRS T+QRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWT+V PKDHMLNYYWEFL SIKHLL
Subjt:  IRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWTQVAPKDHMLNYYWEFLRSIKHLL

XP_023529473.1 methyl-CpG-binding domain protein 4-like protein [Cucurbita pepo subsp. pepo]4.9e-22978.33Show/hide
Query:  MTATTIINPNVSPPSSSTYPDDLFSQFAFRGSSSSRFRFPPSKCPSESNHQNPTPKDFTQKSRILMRQDSPIS------TSEATHQKKSSGHEIPISSIG
        M+ATTI+NPN+SPPSSS++PD       F    SSRFRFPPSKCPS+SN QNPTP+DFTQK   LM Q+SPIS      TSE+ HQK + GHEIPI  I 
Subjt:  MTATTIINPNVSPPSSSTYPDDLFSQFAFRGSSSSRFRFPPSKCPSESNHQNPTPKDFTQKSRILMRQDSPIS------TSEATHQKKSSGHEIPISSIG

Query:  DLLDNPKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRVLQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVS
        DL DNPKR   TLT+EDVQ+VSP  PT E ERVLAHEPPILTLEDLQN KSDHQPA KPPLARRVL+FYRQFGFD+ + Q+TPPSVRNS+PVQ+   VVS
Subjt:  DLLDNPKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRVLQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVS

Query:  RYFQKAKPTQQRERIVSRYFQNSEMERAAHN--EDEDTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKASRSVQNSGADKRVRNVSRYFQN
        R+FQ++K TQQ ERIVSRYFQ+SE+E+AAHN  EDED NVT+QP KRSRVG+Y + RRKDV  SSDNSKAYQ S+RK+SRSV+ SG DKRVR VSRYFQN
Subjt:  RYFQKAKPTQQRERIVSRYFQNSEMERAAHN--EDEDTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKASRSVQNSGADKRVRNVSRYFQN

Query:  SKKNPELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDKSSARPRGTLSAAELFLEAYRRKSLDDT
        S+KNPE+E E+S SLQNS+  QQGER+VSRFFQKSE+Q+VVNNQQEVI+ P+QCA+SVKRIRKPAKERKVRDK SARPR TLSA ELFLEAYRRKS DDT
Subjt:  SKKNPELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDKSSARPRGTLSAAELFLEAYRRKSLDDT

Query:  WKPPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGV
        WKPP SGIRLLQQDHAYDPWRVLVICMLLNRTTG QAK+VIPKLFTLCP+PK+ LEVSQEQIEDIIRPLGLQRKRS T+QRLSEMYLKESWSHVTQLPGV
Subjt:  WKPPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGV

Query:  GKYGADAHAIFCTGYWTQVAPKDHMLNYYWEFLRSIKHLL
        GKYGADAHAIFCTGYWT+V PKDHMLNYYWEFL SIKHLL
Subjt:  GKYGADAHAIFCTGYWTQVAPKDHMLNYYWEFLRSIKHLL

TrEMBL top hitse value%identityAlignment
A0A0A0KRW9 ENDO3c domain-containing protein7.0e-14961.81Show/hide
Query:  MTATTIINPNVSPPSSSTYPDDLFSQFAFRGSSSSRFRFPPSKCPSESNHQNPTP-KDFTQKSRILMRQDSPISTSEATHQKKSSGHEIPISSIGDLLDN
        M +TT I+PN++PPSSS+YP DLFS+F FRG+S SRFRFPPSK    S  Q+P P +D T        Q SP+ST       + S H        + L +
Subjt:  MTATTIINPNVSPPSSSTYPDDLFSQFAFRGSSSSRFRFPPSKCPSESNHQNPTP-KDFTQKSRILMRQDSPISTSEATHQKKSSGHEIPISSIGDLLDN

Query:  PKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRVLQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVSRYFQK
        P  E+                         HEPPILTLEDLQN K   Q  K+P LARRVL FYR+FGFD+ + Q T  SV NS+P Q+G  VVSRYFQ 
Subjt:  PKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRVLQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVSRYFQK

Query:  AKPTQQRERIVSRYFQNSEMERAAHNEDEDT--NVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKASRSVQNSGADKRVRNVSRYFQNSKKNP
        ++ TQQ +RIVSRYFQ S  ER AH EDE+   N+TEQPSKRS     S+ RRKDV P SDNSK   HS+ K +RSVQ SG D +VR VS YFQ+ +K+ 
Subjt:  AKPTQQRERIVSRYFQNSEMERAAHNEDEDT--NVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKASRSVQNSGADKRVRNVSRYFQNSKKNP

Query:  ELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDK-SSARPRGTLSAAELFLEAYRRKSLDDTWKPP
        E++RE+S SLQNS+ NQQ E+VVSRFF KS KQQ VNNQ+E  EQ NQCA+SVKR+RKP  ERK +DK SS +PR TL+AAELFLEAYRRKS  DTWKPP
Subjt:  ELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDK-SSARPRGTLSAAELFLEAYRRKSLDDTWKPP

Query:  SSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYG
        +SG RLLQ DHAYDPWRVLVICMLLNRT+G QAKEVIPKLF+LCPNPKATLEVS+EQIEDIIRPLG  RKRSRTM RLSEMYLKESWSHVTQLPGVGKY 
Subjt:  SSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYG

Query:  ADAHAIFC
        A    + C
Subjt:  ADAHAIFC

A0A1S3CCU6 methyl-CpG-binding domain protein 4-like protein1.4e-17366.42Show/hide
Query:  MTATTIINPNVSPPSSSTYPDDLFSQFAFRGSSSSRFRFPPSKCPSESNHQNPTP-KDFTQKSRILMRQDSPISTSEATHQKKSSGHEIPISSIGDLLDN
        M ATT INPN++PPSSS+YP DLFS+F FRG+S SRFRFPPSK    S HQNP P +D TQ S I    D  + TSE  +    S            L +
Subjt:  MTATTIINPNVSPPSSSTYPDDLFSQFAFRGSSSSRFRFPPSKCPSESNHQNPTP-KDFTQKSRILMRQDSPISTSEATHQKKSSGHEIPISSIGDLLDN

Query:  PKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRVLQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVSRYFQK
        P  E                         A EPPILTLEDLQN K   Q  KKP LARRVL FYR+FGFD+ + Q T  SV NS PVQ+G  VVSRYFQ 
Subjt:  PKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRVLQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVSRYFQK

Query:  AKPTQQRERIVSRYFQNSEMERAAHNEDE--DTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKASRSVQNSGADKRVRNVSRYFQNSKKNP
        ++ TQQRERIVSRYF+ S  ERAAH EDE  D N+TEQPSKRS     S+ RRKDV PSS NSK   HSM K SRSVQ S  D R R VS YFQ S+K+ 
Subjt:  AKPTQQRERIVSRYFQNSEMERAAHNEDE--DTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKASRSVQNSGADKRVRNVSRYFQNSKKNP

Query:  ELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDK-SSARPRGTLSAAELFLEAYRRKSLDDTWKPP
        E++RE+S SLQNS+ NQQ E++VSRFF KS KQQ VNNQ+E  EQ NQCA+SVKR+RKP  ERK ++K SS +PR TL+AAELFLEAYRRKS DDTWKPP
Subjt:  ELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDK-SSARPRGTLSAAELFLEAYRRKSLDDTWKPP

Query:  SSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYG
         SG RLLQ DHAYDPWRVLVICMLLNRT+G QAKEVIPKLF+LCPNPKATLEVS+EQIEDIIRPLGL RKRSRTM RLSEMYLKESWSHVTQLPGVGKYG
Subjt:  SSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYG

Query:  ADAHAIFCTGYWTQVAPKDHMLNYYWEFLRSIKHLL
        ADAHAIFCTGYW++V PKDHMLNYYW+FL SIKHLL
Subjt:  ADAHAIFCTGYWTQVAPKDHMLNYYWEFLRSIKHLL

A0A5D3CU57 Methyl-CpG-binding domain protein 4-like protein3.0e-16065.82Show/hide
Query:  MTATTIINPNVSPPSSSTYPDDLFSQFAFRGSSSSRFRFPPSKCPSESNHQNPTP-KDFTQKSRILMRQDSPISTSEATHQKKSSGHEIPISSIGDLLDN
        M ATT INPN++PPSSS+YP DLFS+F FRG+S SRFRFPPSK    S HQNP P +D TQ S I    D  + TSE  +    S            L +
Subjt:  MTATTIINPNVSPPSSSTYPDDLFSQFAFRGSSSSRFRFPPSKCPSESNHQNPTP-KDFTQKSRILMRQDSPISTSEATHQKKSSGHEIPISSIGDLLDN

Query:  PKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRVLQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVSRYFQK
        P  E                         A EPPILTLEDLQN K   Q  KKP LARRVL FYR+FGFD+ + Q T  SV NS PVQ+G  VVSRYFQ 
Subjt:  PKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRVLQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVSRYFQK

Query:  AKPTQQRERIVSRYFQNSEMERAAHNEDE--DTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKASRSVQNSGADKRVRNVSRYFQNSKKNP
        ++ TQQRERIVSRYF+ S  ERAAH EDE  D N+TEQPSKRS     S+ RRKDV PSS NSK   HSM K SRSVQ S  D R R VS YFQ S+K+ 
Subjt:  AKPTQQRERIVSRYFQNSEMERAAHNEDE--DTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKASRSVQNSGADKRVRNVSRYFQNSKKNP

Query:  ELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDK-SSARPRGTLSAAELFLEAYRRKSLDDTWKPP
        E++RE+S SLQNS+ NQQ E++VSRFF KS KQQ VNNQ+E  EQ NQCA+SVKR+RKP  ERK ++K SS +PR TL+AAELFLEAYRRKS DDTWKPP
Subjt:  ELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDK-SSARPRGTLSAAELFLEAYRRKSLDDTWKPP

Query:  SSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYG
         SG RLLQ DHAYDPWRVLVICMLLNRT+G QAKEVIPKLF+LCPNPKATLEVS+EQIEDIIRPLGL RKRSRTM RLSEMYLKESWSHVTQLPGVGKYG
Subjt:  SSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYG

Query:  ADAHAIFCTGYW
        ADAHAIFCTGYW
Subjt:  ADAHAIFCTGYW

A0A6J1EZJ4 methyl-CpG-binding domain protein 4-like protein2.4e-23479.15Show/hide
Query:  MTATTIINPNVSPPSSSTYPDDLFSQFAFRGSSSSRFRFPPSKCPSESNHQNPTPKDFTQKSRILMRQDSPIS------TSEATHQKKSSGHEIPISSIG
        MTATTI+NPN+SPPSSS++PD LFSQFAF+G SSSRFRFPPSKCPSESN QNPTP+DFTQK   LM Q+SPIS      TSE+ HQK ++G EIPI  I 
Subjt:  MTATTIINPNVSPPSSSTYPDDLFSQFAFRGSSSSRFRFPPSKCPSESNHQNPTPKDFTQKSRILMRQDSPIS------TSEATHQKKSSGHEIPISSIG

Query:  DLLDNPKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRVLQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVS
        DL DNPKR   TLT+EDVQEVSP  PT E ERVL HEPPILTLED+QN KSDHQPA +PPLARRVL+FYRQFGFD+ + Q+TPPSVRNS+PVQ+   VVS
Subjt:  DLLDNPKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRVLQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVS

Query:  RYFQKAKPTQQRERIVSRYFQNSEMERAAHN----EDEDTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKASRSVQNSGADKRVRNVSRYF
        R+FQ++K  QQ ERIVSRYFQ+SE+ERAAHN    EDED NVT+QP KRSRVG Y + RRKDV  SSDNSKAYQ S+RK+SR V+ SG DKRVR VSRYF
Subjt:  RYFQKAKPTQQRERIVSRYFQNSEMERAAHN----EDEDTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKASRSVQNSGADKRVRNVSRYF

Query:  QNSKKNPELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDKSSARPRGTLSAAELFLEAYRRKSLD
        QNS+KNPE+E E+S  LQNS+  QQGER+VSRFFQKSE+Q+VVNNQQEVI+ P+QCA+SVKRIRKPAKERKVRDK SARPR TLSA ELFLEAYRRKS D
Subjt:  QNSKKNPELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDKSSARPRGTLSAAELFLEAYRRKSLD

Query:  DTWKPPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLP
        DTWKPP SGIRLLQQDHAYDPWRVLVICMLLNRTTG QAKEVIPKLFTLCP+PK+ LEVSQEQIEDIIRPLGLQRKRS T+QRLSEMYLKESWSHVTQLP
Subjt:  DTWKPPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLP

Query:  GVGKYGADAHAIFCTGYWTQVAPKDHMLNYYWEFLRSIKHLL
        GVGKYGADAHAIFCTGYWT+V PKDHMLNYYWEFL SIKHLL
Subjt:  GVGKYGADAHAIFCTGYWTQVAPKDHMLNYYWEFLRSIKHLL

A0A6J1HWM5 methyl-CpG-binding domain protein 4-like protein isoform X15.9e-20479.79Show/hide
Query:  MRQDSPIS------TSEATHQKKSSGHEIPISSIGDLLDNPKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRV
        M  +SPIS      TSEA HQK ++GHEIPI  I  L D+PKREI TLT+EDVQEVSP  PT E ERVLAHEPPILTLEDLQN KSDHQPA KPPLARRV
Subjt:  MRQDSPIS------TSEATHQKKSSGHEIPISSIGDLLDNPKREIPTLTLEDVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRV

Query:  LQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVSRYFQKAKPTQQRERIVSRYFQNSEMERAAHNEDE--DTNVTEQPSKRSRVGDYSRGRRKDVVPSS
        L+F RQFGFD+ + Q+TPPSVRNS+PVQ+   VVSR+FQ++K  QQ ERIVSRYFQ+SE+ERAAHNEDE  D NVT+QP KRSRVG Y + RRKDV  SS
Subjt:  LQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVSRYFQKAKPTQQRERIVSRYFQNSEMERAAHNEDE--DTNVTEQPSKRSRVGDYSRGRRKDVVPSS

Query:  DNSKAYQHSMRKASRSVQNSGADKRVRNVSRYFQNSKKNPELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPA
        DNSKAYQ S+RK+SRS++ SG DKRVR VSRYFQNS+KNPE+E E+S SLQNS+ NQQ ERVVSRFFQKSE+ +VVNNQQEVI+ P+QCA+SVKRIRKPA
Subjt:  DNSKAYQHSMRKASRSVQNSGADKRVRNVSRYFQNSKKNPELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPA

Query:  KERKVRDKSSARPRGTLSAAELFLEAYRRKSLDDTWKPPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDI
        KERKVRDK SA+PR TLSA ELFLEAYRRKS DDTWKPP SGIRLLQQDHAYDPWRVLVICMLLNRTTG QAKEVIPKLFTLCP+PK+ LEVSQEQIEDI
Subjt:  KERKVRDKSSARPRGTLSAAELFLEAYRRKSLDDTWKPPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDI

Query:  IRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWTQVAPKDHMLNYYWEFLRSIKHLL
        IRPLGLQRKRS T+QRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWT+V PKDHMLNYYWEFL SIKHLL
Subjt:  IRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWTQVAPKDHMLNYYWEFLRSIKHLL

SwissProt top hitse value%identityAlignment
O95243 Methyl-CpG-binding domain protein 41.5e-1823.68Show/hide
Query:  RNSIPVQQGVPVVSRYFQKAKPTQQRERIVSRYFQ-NSEMERAAHNEDEDTNVTEQPSKRSR---VGDYSRGRRKDVVPSSDNSKAYQHSMRKASRSVQN
        +  + + +G+P+        K  +   +  S + Q +S+ E   +  D ++    Q S+  R   + D         V S +NS   +     +S S  N
Subjt:  RNSIPVQQGVPVVSRYFQKAKPTQQRERIVSRYFQ-NSEMERAAHNEDEDTNVTEQPSKRSR---VGDYSRGRRKDVVPSSDNSKAYQHSMRKASRSVQN

Query:  SGADKRVRNVSRYFQNSKKNPELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDKSSARPRGTLSA
          ++++   +   F ++K +   E+     L++ EI  + E V        E+++ ++   +++++ ++   +    RK     K+  + +  PR  +  
Subjt:  SGADKRVRNVSRYFQNSKKNPELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDKSSARPRGTLSA

Query:  AE---LFLEAYRRKSLD-------DTWKPPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRK
         +    F   Y +++L          W PP S   L+Q+   +DPW++L+  + LNRT+G  A  V+ K     P+ +         + ++++PLGL   
Subjt:  AE---LFLEAYRRKSLD-------DTWKPPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRK

Query:  RSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWTQVAPKDHMLNYYWEFL
        R++T+ + S+ YL + W +  +L G+GKYG D++ IFC   W QV P+DH LN Y ++L
Subjt:  RSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWTQVAPKDHMLNYYWEFL

Q0IGK1 Methyl-CpG-binding domain protein 4-like protein2.1e-4940.96Show/hide
Query:  EDEDTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKA-SRSV-QNSGADK---RVRNVSRYFQNSKKNPELEREISHSLQNSEINQQGE---
        +D+D +V++   +R    ++    R+ V P    S   Q S     S SV    G  K   +V  VS YFQ S  + + + +I  S Q+    ++G    
Subjt:  EDEDTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKA-SRSV-QNSGADK---RVRNVSRYFQNSKKNPELEREISHSLQNSEINQQGE---

Query:  ----RVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRK-------------------PAKERKVRDKSSARPRGTLSAAELFLEAYRRKSLDDTWK
            R VS +FQ+S           V EQPNQ  + ++   K                     K R VR      P   LS ++   + Y RK+ D+TW 
Subjt:  ----RVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRK-------------------PAKERKVRDKSSARPRGTLSAAELFLEAYRRKSLDDTWK

Query:  PPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGK
        PP S   LLQ+DH +DPWRVLVICMLLN+T+G Q + VI  LF LC + K   EV +E+IE++I+PLGLQ+KR++ +QRLS  YL+ESW+HVTQL GVGK
Subjt:  PPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGK

Query:  YGADAHAIFCTGYWTQVAPKDHMLNYYWEFLR
        Y ADA+AIFC G W +V P DHMLNYYW++LR
Subjt:  YGADAHAIFCTGYWTQVAPKDHMLNYYWEFLR

Q7LX22 Thymine/uracil-DNA glycosylase1.2e-0734.02Show/hide
Query:  AYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHV-------TQLPGVGKYGA
        A DPW VLV  +LL +TT  Q  ++  +     P+P    + S E+I+ II+PLG++  R+  +++LSE  ++     +         LPGVG Y A
Subjt:  AYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHV-------TQLPGVGKYGA

Q9YDP0 Thymine-DNA glycosylase5.2e-0830.33Show/hide
Query:  LEAYRRKSLDDTWKPPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMY--
        +EA RR+ ++  W     G + L   +  DPW +LV   LL +TT  Q   V  +     PNPKA     ++++ ++IRPLG++ +R++ +  L++    
Subjt:  LEAYRRKSLDDTWKPPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMY--

Query:  -----LKESWSHVTQLPGVGKY
             +  S   + +LPGVG Y
Subjt:  -----LKESWSHVTQLPGVGKY

Q9Z2D7 Methyl-CpG-binding domain protein 43.2e-2128.4Show/hide
Query:  FQNSKKNPELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDKSSARPRGTLSAAELFLEAYRRKSL
        F +++   E  RE +  L++ EI  +G+R       K E        Q+  E P+ C+++ K       +     ++    R T   +  F   Y +++L
Subjt:  FQNSKKNPELEREISHSLQNSEINQQGERVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDKSSARPRGTLSAAELFLEAYRRKSL

Query:  D-------DTWKPPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKES
                  W PP S   L+Q+   +DPW++L+  + LNRT+G  A  V+ +     P+ +         + ++++PLGL   R++T+ + S+ YL + 
Subjt:  D-------DTWKPPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKES

Query:  WSHVTQLPGVGKYGADAHAIFCTGYWTQVAPKDHMLNYYWEFL
        W +  +L G+GKYG D++ IFC   W QV P+DH LN Y ++L
Subjt:  WSHVTQLPGVGKYGADAHAIFCTGYWTQVAPKDHMLNYYWEFL

Arabidopsis top hitse value%identityAlignment
AT3G07930.1 DNA glycosylase superfamily protein4.4e-1032.03Show/hide
Query:  EDEDTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKA-SRSV-QNSGADK---RVRNVSRYFQNSKKNPELEREISHSLQNSEINQQGE---
        +D+D +V++   +R    ++    R+ V P    S   Q S     S SV    G  K   +V  VS YFQ S  + + + +I  S Q+    ++G    
Subjt:  EDEDTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKA-SRSV-QNSGADK---RVRNVSRYFQNSKKNPELEREISHSLQNSEINQQGE---

Query:  ----RVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRK-------------------PAKERKVRDKSSARPRGTLSAAELFLEAYRRKSLDDTWK
            R VS +FQ+S           V EQPNQ  + ++   K                     K R VR      P   LS ++   + Y RK+ D+TW 
Subjt:  ----RVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRK-------------------PAKERKVRDKSSARPRGTLSAAELFLEAYRRKSLDDTWK

Query:  PPSSGIRLLQQDHAYDPWRVLVICMLLNRTT
        PP S   LLQ+DH +DPWRVLVICMLLN+T+
Subjt:  PPSSGIRLLQQDHAYDPWRVLVICMLLNRTT

AT3G07930.2 DNA glycosylase superfamily protein1.8e-1132.48Show/hide
Query:  EDEDTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKA-SRSV-QNSGADK---RVRNVSRYFQNSKKNPELEREISHSLQNSEINQQGE---
        +D+D +V++   +R    ++    R+ V P    S   Q S     S SV    G  K   +V  VS YFQ S  + + + +I  S Q+    ++G    
Subjt:  EDEDTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKA-SRSV-QNSGADK---RVRNVSRYFQNSKKNPELEREISHSLQNSEINQQGE---

Query:  ----RVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRK-------------------PAKERKVRDKSSARPRGTLSAAELFLEAYRRKSLDDTWK
            R VS +FQ+S           V EQPNQ  + ++   K                     K R VR      P   LS ++   + Y RK+ D+TW 
Subjt:  ----RVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRK-------------------PAKERKVRDKSSARPRGTLSAAELFLEAYRRKSLDDTWK

Query:  PPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQ
        PP S   LLQ+DH +DPWRVLVICMLLN+T+G Q
Subjt:  PPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQ

AT3G07930.3 DNA glycosylase superfamily protein1.5e-5040.96Show/hide
Query:  EDEDTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKA-SRSV-QNSGADK---RVRNVSRYFQNSKKNPELEREISHSLQNSEINQQGE---
        +D+D +V++   +R    ++    R+ V P    S   Q S     S SV    G  K   +V  VS YFQ S  + + + +I  S Q+    ++G    
Subjt:  EDEDTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKA-SRSV-QNSGADK---RVRNVSRYFQNSKKNPELEREISHSLQNSEINQQGE---

Query:  ----RVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRK-------------------PAKERKVRDKSSARPRGTLSAAELFLEAYRRKSLDDTWK
            R VS +FQ+S           V EQPNQ  + ++   K                     K R VR      P   LS ++   + Y RK+ D+TW 
Subjt:  ----RVVSRFFQKSEKQQVVNNQQEVIEQPNQCAESVKRIRK-------------------PAKERKVRDKSSARPRGTLSAAELFLEAYRRKSLDDTWK

Query:  PPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGK
        PP S   LLQ+DH +DPWRVLVICMLLN+T+G Q + VI  LF LC + K   EV +E+IE++I+PLGLQ+KR++ +QRLS  YL+ESW+HVTQL GVGK
Subjt:  PPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLCPNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGK

Query:  YGADAHAIFCTGYWTQVAPKDHMLNYYWEFLR
        Y ADA+AIFC G W +V P DHMLNYYW++LR
Subjt:  YGADAHAIFCTGYWTQVAPKDHMLNYYWEFLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGCAACTACGATCATCAACCCTAATGTCTCCCCTCCATCCTCTTCTACATATCCGGATGATTTGTTTTCCCAATTCGCCTTTCGAGGTAGTTCCTCCTCCAGATT
TCGCTTTCCTCCTTCCAAATGCCCATCCGAATCGAATCATCAAAACCCTACGCCGAAGGATTTTACTCAAAAGAGTAGGATTCTCATGAGGCAAGACTCTCCAATTTCAA
CTTCAGAAGCGACTCATCAGAAGAAATCCTCAGGCCACGAGATTCCGATTTCGTCTATTGGGGATCTTCTAGATAACCCGAAGCGTGAGATTCCCACATTAACCCTAGAG
GATGTCCAAGAAGTTTCACCTAATGCACCAACTTTAGAAGCGGAGAGAGTTTTAGCGCACGAGCCTCCTATATTAACTCTAGAGGATCTTCAAAATACAAAATCAGACCA
TCAACCGGCAAAAAAGCCTCCACTGGCGCGTAGGGTCTTACAGTTTTACCGGCAGTTCGGGTTTGATCAACCAATGGCGCAAAGAACTCCGCCTTCTGTCCGAAATTCAA
TACCAGTTCAACAAGGTGTACCCGTAGTTTCACGTTATTTCCAGAAAGCAAAACCAACCCAACAAAGAGAACGAATTGTATCGCGCTACTTTCAAAACTCGGAGATGGAA
CGAGCAGCCCATAATGAGGATGAGGATACCAATGTCACAGAACAGCCAAGCAAAAGATCAAGGGTCGGAGATTATAGCAGAGGGAGGAGGAAAGACGTAGTTCCCAGCTC
CGATAATTCAAAAGCGTATCAACACTCAATGAGAAAAGCTTCACGTTCTGTTCAAAACTCAGGAGCTGACAAACGGGTGCGAAATGTTTCACGCTATTTCCAAAATTCAA
AAAAGAATCCTGAATTGGAGCGAGAAATTTCACATTCGTTACAAAATTCAGAAATAAATCAACAAGGAGAGCGCGTAGTCTCACGTTTCTTTCAAAAATCAGAAAAACAA
CAAGTGGTGAACAATCAGCAAGAGGTTATAGAGCAGCCAAATCAGTGTGCAGAATCTGTTAAAAGAATCCGTAAACCAGCCAAGGAAAGGAAAGTGAGGGATAAAAGTTC
TGCCAGGCCTAGAGGCACTCTTTCTGCTGCCGAGTTGTTTTTGGAAGCTTATAGAAGAAAATCGTTAGATGATACATGGAAGCCTCCTTCCTCTGGGATTCGCCTTCTCC
AACAGGATCATGCGTACGACCCTTGGAGGGTTCTAGTCATATGTATGCTCCTTAACCGGACGACTGGGGTGCAGGCAAAAGAAGTGATACCTAAACTCTTCACTCTGTGT
CCCAATCCAAAGGCTACTTTGGAGGTATCACAAGAGCAAATAGAAGATATTATTCGACCTCTTGGTCTTCAAAGAAAAAGATCGCGAACAATGCAGCGTTTATCTGAGAT
GTATTTAAAAGAAAGTTGGAGTCATGTCACTCAGCTTCCTGGTGTTGGCAAGTATGGAGCTGATGCACATGCAATATTTTGCACTGGATATTGGACCCAAGTAGCGCCTA
AAGATCACATGCTTAATTATTACTGGGAGTTTCTCCGCAGCATAAAACACCTGCTCTGA
mRNA sequenceShow/hide mRNA sequence
AAAGTTCATAAAGTGATCATTTCCTGTGCGCTAAATGCTCTTCCTTCTCGGCAGCCGGCATGACTGCAACTACGATCATCAACCCTAATGTCTCCCCTCCATCCTCTTCT
ACATATCCGGATGATTTGTTTTCCCAATTCGCCTTTCGAGGTAGTTCCTCCTCCAGATTTCGCTTTCCTCCTTCCAAATGCCCATCCGAATCGAATCATCAAAACCCTAC
GCCGAAGGATTTTACTCAAAAGAGTAGGATTCTCATGAGGCAAGACTCTCCAATTTCAACTTCAGAAGCGACTCATCAGAAGAAATCCTCAGGCCACGAGATTCCGATTT
CGTCTATTGGGGATCTTCTAGATAACCCGAAGCGTGAGATTCCCACATTAACCCTAGAGGATGTCCAAGAAGTTTCACCTAATGCACCAACTTTAGAAGCGGAGAGAGTT
TTAGCGCACGAGCCTCCTATATTAACTCTAGAGGATCTTCAAAATACAAAATCAGACCATCAACCGGCAAAAAAGCCTCCACTGGCGCGTAGGGTCTTACAGTTTTACCG
GCAGTTCGGGTTTGATCAACCAATGGCGCAAAGAACTCCGCCTTCTGTCCGAAATTCAATACCAGTTCAACAAGGTGTACCCGTAGTTTCACGTTATTTCCAGAAAGCAA
AACCAACCCAACAAAGAGAACGAATTGTATCGCGCTACTTTCAAAACTCGGAGATGGAACGAGCAGCCCATAATGAGGATGAGGATACCAATGTCACAGAACAGCCAAGC
AAAAGATCAAGGGTCGGAGATTATAGCAGAGGGAGGAGGAAAGACGTAGTTCCCAGCTCCGATAATTCAAAAGCGTATCAACACTCAATGAGAAAAGCTTCACGTTCTGT
TCAAAACTCAGGAGCTGACAAACGGGTGCGAAATGTTTCACGCTATTTCCAAAATTCAAAAAAGAATCCTGAATTGGAGCGAGAAATTTCACATTCGTTACAAAATTCAG
AAATAAATCAACAAGGAGAGCGCGTAGTCTCACGTTTCTTTCAAAAATCAGAAAAACAACAAGTGGTGAACAATCAGCAAGAGGTTATAGAGCAGCCAAATCAGTGTGCA
GAATCTGTTAAAAGAATCCGTAAACCAGCCAAGGAAAGGAAAGTGAGGGATAAAAGTTCTGCCAGGCCTAGAGGCACTCTTTCTGCTGCCGAGTTGTTTTTGGAAGCTTA
TAGAAGAAAATCGTTAGATGATACATGGAAGCCTCCTTCCTCTGGGATTCGCCTTCTCCAACAGGATCATGCGTACGACCCTTGGAGGGTTCTAGTCATATGTATGCTCC
TTAACCGGACGACTGGGGTGCAGGCAAAAGAAGTGATACCTAAACTCTTCACTCTGTGTCCCAATCCAAAGGCTACTTTGGAGGTATCACAAGAGCAAATAGAAGATATT
ATTCGACCTCTTGGTCTTCAAAGAAAAAGATCGCGAACAATGCAGCGTTTATCTGAGATGTATTTAAAAGAAAGTTGGAGTCATGTCACTCAGCTTCCTGGTGTTGGCAA
GTATGGAGCTGATGCACATGCAATATTTTGCACTGGATATTGGACCCAAGTAGCGCCTAAAGATCACATGCTTAATTATTACTGGGAGTTTCTCCGCAGCATAAAACACC
TGCTCTGAAGTCTGATCTTATCTGACTGTAGATGGTTTGGGAAGTTGTAAATTTCATGGTCTACTTCCTCTCTCTCTATATATATATTATATTTTGGAACGTTACATTTT
TTTGTGCTCTCGGTTTTGTAAATGTTCTTTTTGTTGTTGTTGGGAGTCTGTGAGAAGGGGAGAAATGGGAATTCTCCTCCAACAGCTAATTAGACTGTAGGCTGAATGAA
GTAGCGTGAGGGTTATGCTATGTTTTGTCTGGGGAAAGGTCGACTAAAGATCGGATCTTTTGTAATAAGACTATGGGCTGTAACCTTCGTAGCTCGCATTTAATCTATGA
GTATATGATAAGGGATAACTTTGTGCCA
Protein sequenceShow/hide protein sequence
MTATTIINPNVSPPSSSTYPDDLFSQFAFRGSSSSRFRFPPSKCPSESNHQNPTPKDFTQKSRILMRQDSPISTSEATHQKKSSGHEIPISSIGDLLDNPKREIPTLTLE
DVQEVSPNAPTLEAERVLAHEPPILTLEDLQNTKSDHQPAKKPPLARRVLQFYRQFGFDQPMAQRTPPSVRNSIPVQQGVPVVSRYFQKAKPTQQRERIVSRYFQNSEME
RAAHNEDEDTNVTEQPSKRSRVGDYSRGRRKDVVPSSDNSKAYQHSMRKASRSVQNSGADKRVRNVSRYFQNSKKNPELEREISHSLQNSEINQQGERVVSRFFQKSEKQ
QVVNNQQEVIEQPNQCAESVKRIRKPAKERKVRDKSSARPRGTLSAAELFLEAYRRKSLDDTWKPPSSGIRLLQQDHAYDPWRVLVICMLLNRTTGVQAKEVIPKLFTLC
PNPKATLEVSQEQIEDIIRPLGLQRKRSRTMQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWTQVAPKDHMLNYYWEFLRSIKHLL