; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g1464 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g1464
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionmethyl-CpG-binding domain protein 4-like protein
Genome locationMC02:21113301..21119768
RNA-Seq ExpressionMC02g1464
SyntenyMC02g1464
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase
IPR045138 - Methyl-CpG binding protein MeCP2/MBD4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7022375.1 Methyl-CpG-binding domain protein 4-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]4.03e-16957.95Show/hide
Query:  PNLCSYPDDLFSQFAFRGNGCSRFRSTPAK----SDQRKPTVPAEDFTQ-------------------TSEPNHQ--ASGHEIPILTLEDLQD-------
        P   S+PD LFSQFAF+G   SRFR  P+K    S+++ PT   EDFTQ                   TSE NHQ  A+ HEIPIL +EDLQD       
Subjt:  PNLCSYPDDLFSQFAFRGNGCSRFRSTPAK----SDQRKPTVPAEDFTQ-------------------TSEPNHQ--ASGHEIPILTLEDLQD-------

Query:  ------VHQSSKPN----------HEIPILTLEDLHNAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQ-------------HGVRK
              V + S             HE PILTLEDL NAK   Q    P   R +L  YR+FGFD ++VQKT P VR+S  VQ                ++
Subjt:  ------VHQSSKPN----------HEIPILTLEDLHNAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQ-------------HGVRK

Query:  GEPIVSRYFQSSE------DEDLDSNVTNRSNKRLMVGDYSGRGRNDVAPTSGTTKANQHSV----------GTDLPS------FQNSGKNQE-----GN
        GE IVSRYFQ SE      +ED D N T++  KR  VG+Y  R R DVAP+S  +KA Q S+          GTD         FQNS KN E       
Subjt:  GEPIVSRYFQSSE------DEDLDSNVTNRSNKRLMVGDYSGRGRNDVAPTSGTTKANQHSV----------GTDLPS------FQNSGKNQE-----GN

Query:  SFKNSKPNQEGERVVSRFFQKSAKQQAVNNQ-EGAEESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFLEAYRRKSSDDTWDPPPSEIRLLQ
        S +NSK NQ+GERVVSRFFQKS +Q+ VNNQ E  ++ +QCA+ VKR RK AKERK   K  S+RPRTTLSA ELFLEAYRRKS DDTW PPPS IRLLQ
Subjt:  SFKNSKPNQEGERVVSRFFQKSAKQQAVNNQ-EGAEESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFLEAYRRKSSDDTWDPPPSEIRLLQ

Query:  QDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFC
        QDHAYDPWRVL+IC+LLNRT+G+QAK+VIPKLFTLCP+PK+ALEVS EQIEDIIRPLGLQRKRS TIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFC
Subjt:  QDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFC

Query:  TGYWDQVLPKDHMLNYYWEFLQSIKEHL
        TGYW +VLPKDHMLNYYWEFL SIK  L
Subjt:  TGYWDQVLPKDHMLNYYWEFLQSIKEHL

XP_008460559.1 PREDICTED: methyl-CpG-binding domain protein 4-like protein [Cucumis melo]8.79e-15956.97Show/hide
Query:  NPNLC-----SYPDDLFSQFAFRGNGCSRFRSTPAKSDQRKPTVPAEDFTQTSEPNHQASGHEIPILTLEDLQDVHQSSKPNH--------------EIP
        NPNL      SYP DLFS+F FRG   SRFR  P+KS  + P  P +D TQ S           PI TL DLQ    +S+PN+              E P
Subjt:  NPNLC-----SYPDDLFSQFAFRGNGCSRFRSTPAKSDQRKPTVPAEDFTQTSEPNHQASGHEIPILTLEDLQDVHQSSKPNH--------------EIP

Query:  ILTLEDLHNAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQHGVR-------------KGEPIVSRYFQSS--------EDEDLDSN
        ILTLEDL N K   Q+ + P   R +L+ YR+FGFD KL+Q TS  V +SE VQ G R             + E IVSRYF+ S        EDE+ D N
Subjt:  ILTLEDLHNAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQHGVR-------------KGEPIVSRYFQSS--------EDEDLDSN

Query:  VTNRSNKRLMVGDYSGRGRNDVAPTSGTTKANQHSVG----------TDLPS------FQNSGKNQEGN-----SFKNSKPNQEGERVVSRFFQKSAKQQ
        +T + +KR      S R R DV P+S  +K N HS+G          TD  +      FQ S K+ E +     S +NSK NQ+ E++VSRFF KS KQQ
Subjt:  VTNRSNKRLMVGDYSGRGRNDVAPTSGTTKANQHSVG----------TDLPS------FQNSGKNQEGN-----SFKNSKPNQEGERVVSRFFQKSAKQQ

Query:  AVNNQEGA-EESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFLEAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAK
        AVNNQE A E+ NQCA+ VKR RK   ERK+ +K +S++PRTTL+AAELFLEAYRRKS DDTW PPPS  RLLQ DHAYDPWRVL+IC+LLNRTSGRQAK
Subjt:  AVNNQEGA-EESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFLEAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAK

Query:  DVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQSIKE
        +VIPKLF+LCPNPKA LEVS EQIEDIIRPLGL RKRSRT+ RLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW +V PKDHMLNYYW+FL SIK 
Subjt:  DVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQSIKE

Query:  HL
         L
Subjt:  HL

XP_022156995.1 methyl-CpG-binding domain protein 4-like protein [Momordica charantia]0.0100Show/hide
Query:  MAMSSSSSSSNNPNLCSYPDDLFSQFAFRGNGCSRFRSTPAKSDQRKPTVPAEDFTQTSEPNHQASGHEIPILTLEDLQDVHQSSKPNHEIPILTLEDLH
        MAMSSSSSSSNNPNLCSYPDDLFSQFAFRGNGCSRFRSTPAKSDQRKPTVPAEDFTQTSEPNHQASGHEIPILTLEDLQDVHQSSKPNHEIPILTLEDLH
Subjt:  MAMSSSSSSSNNPNLCSYPDDLFSQFAFRGNGCSRFRSTPAKSDQRKPTVPAEDFTQTSEPNHQASGHEIPILTLEDLQDVHQSSKPNHEIPILTLEDLH

Query:  NAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQHGVRKGEPIVSRYFQSSEDEDLDSNVTNRSNKRLMVGDYSGRGRNDVAPTSGTT
        NAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQHGVRKGEPIVSRYFQSSEDEDLDSNVTNRSNKRLMVGDYSGRGRNDVAPTSGTT
Subjt:  NAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQHGVRKGEPIVSRYFQSSEDEDLDSNVTNRSNKRLMVGDYSGRGRNDVAPTSGTT

Query:  KANQHSVGTDLPSFQNSGKNQEGNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQEGAEESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFL
        KANQHSVGTDLPSFQNSGKNQEGNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQEGAEESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFL
Subjt:  KANQHSVGTDLPSFQNSGKNQEGNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQEGAEESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFL

Query:  EAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKE
        EAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKE
Subjt:  EAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKE

Query:  SWSHVTQLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQSIKEHL
        SWSHVTQLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQSIKEHL
Subjt:  SWSHVTQLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQSIKEHL

XP_022931728.1 methyl-CpG-binding domain protein 4-like protein [Cucurbita moschata]8.88e-16757.25Show/hide
Query:  NPNLC-----SYPDDLFSQFAFRGNGCSRFRSTPAK----SDQRKPTVPAEDFTQ-------------------TSEPNHQ--ASGHEIPILTLEDLQD-
        NPNL      S+PD LFSQFAF+G   SRFR  P+K    S+++ PT   EDFTQ                   TSE NHQ  A+G EIPIL +EDLQD 
Subjt:  NPNLC-----SYPDDLFSQFAFRGNGCSRFRSTPAK----SDQRKPTVPAEDFTQ-------------------TSEPNHQ--ASGHEIPILTLEDLQD-

Query:  ------------VHQSSKPN----------HEIPILTLEDLHNAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQHGVR--------
                    V + S             HE PILTLED+ NAK   Q    P   R +L  YR+FGFD ++VQKT P VR+S  VQ   R        
Subjt:  ------------VHQSSKPN----------HEIPILTLEDLHNAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQHGVR--------

Query:  -----KGEPIVSRYFQSSE----------DEDLDSNVTNRSNKRLMVGDYSGRGRNDVAPTSGTTKANQHSV----------GTDLPS------FQNSGK
             +GE IVSRYFQ SE          DED D NVT++  KR  VG Y  R R DVA +S  +KA Q S+          GTD         FQNS K
Subjt:  -----KGEPIVSRYFQSSE----------DEDLDSNVTNRSNKRLMVGDYSGRGRNDVAPTSGTTKANQHSV----------GTDLPS------FQNSGK

Query:  NQE-----GNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQ-EGAEESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFLEAYRRKSSDDTWD
        N E         +NSK  Q+GER+VSRFFQKS +Q+ VNNQ E  +  +QCA+ VKR RK AKERK   K  S+RPRTTLSA ELFLEAYRRKSSDDTW 
Subjt:  NQE-----GNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQ-EGAEESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFLEAYRRKSSDDTWD

Query:  PPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHVTQLPGVGK
        PPPS IRLLQQDHAYDPWRVL+IC+LLNRT+G+QAK+VIPKLFTLCP+PK+ALEVS EQIEDIIRPLGLQRKRS TIQRLSEMYLKESWSHVTQLPGVGK
Subjt:  PPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHVTQLPGVGK

Query:  YGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQSIKEHL
        YGADAHAIFCTGYW +VLPKDHMLNYYWEFL SIK  L
Subjt:  YGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQSIKEHL

XP_023529473.1 methyl-CpG-binding domain protein 4-like protein [Cucurbita pepo subsp. pepo]1.14e-16256.9Show/hide
Query:  NPNLCSYPDDLFSQFAFRGNGCSRFRSTPAK--SDQRKPTVPAEDFTQ-------------------TSEPNHQ--ASGHEIPILTLEDLQD--------
        NPNL       F  F+ + +  SRFR  P+K  SD        EDFTQ                   TSE NHQ  A GHEIPIL +EDLQD        
Subjt:  NPNLCSYPDDLFSQFAFRGNGCSRFRSTPAK--SDQRKPTVPAEDFTQ-------------------TSEPNHQ--ASGHEIPILTLEDLQD--------

Query:  -----VHQSSKPN----------HEIPILTLEDLHNAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQHGVR-------------KG
             V Q S             HE PILTLEDL NAK   Q    P   R +L  YR+FGFD ++VQKT P VR+S  VQ   R             +G
Subjt:  -----VHQSSKPN----------HEIPILTLEDLHNAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQHGVR-------------KG

Query:  EPIVSRYFQSSE--------DEDLDSNVTNRSNKRLMVGDYSGRGRNDVAPTSGTTKANQHSVGTDLPS----------------FQNSGKNQE-----G
        E IVSRYFQ SE        DED D NVT++  KR  VG+Y  R R DVA +S  +KA Q S+     S                FQNS KN E      
Subjt:  EPIVSRYFQSSE--------DEDLDSNVTNRSNKRLMVGDYSGRGRNDVAPTSGTTKANQHSVGTDLPS----------------FQNSGKNQE-----G

Query:  NSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQ-EGAEESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFLEAYRRKSSDDTWDPPPSEIRLL
         S +NSK  Q+GER+VSRFFQKS +Q+ VNNQ E  +  +QCA+ VKR RK AKERK   K  S+RPRTTLSA ELFLEAYRRKSSDDTW PPPS IRLL
Subjt:  NSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQ-EGAEESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFLEAYRRKSSDDTWDPPPSEIRLL

Query:  QQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIF
        QQDHAYDPWRVL+IC+LLNRT+G+QAKDVIPKLFTLCP+PK+ALEVS EQIEDIIRPLGLQRKRS TIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIF
Subjt:  QQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIF

Query:  CTGYWDQVLPKDHMLNYYWEFLQSIKEHL
        CTGYW +VLPKDHMLNYYWEFL SIK  L
Subjt:  CTGYWDQVLPKDHMLNYYWEFLQSIKEHL

TrEMBL top hitse value%identityAlignment
A0A1S3CCU6 methyl-CpG-binding domain protein 4-like protein4.25e-15956.97Show/hide
Query:  NPNLC-----SYPDDLFSQFAFRGNGCSRFRSTPAKSDQRKPTVPAEDFTQTSEPNHQASGHEIPILTLEDLQDVHQSSKPNH--------------EIP
        NPNL      SYP DLFS+F FRG   SRFR  P+KS  + P  P +D TQ S           PI TL DLQ    +S+PN+              E P
Subjt:  NPNLC-----SYPDDLFSQFAFRGNGCSRFRSTPAKSDQRKPTVPAEDFTQTSEPNHQASGHEIPILTLEDLQDVHQSSKPNH--------------EIP

Query:  ILTLEDLHNAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQHGVR-------------KGEPIVSRYFQSS--------EDEDLDSN
        ILTLEDL N K   Q+ + P   R +L+ YR+FGFD KL+Q TS  V +SE VQ G R             + E IVSRYF+ S        EDE+ D N
Subjt:  ILTLEDLHNAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQHGVR-------------KGEPIVSRYFQSS--------EDEDLDSN

Query:  VTNRSNKRLMVGDYSGRGRNDVAPTSGTTKANQHSVG----------TDLPS------FQNSGKNQEGN-----SFKNSKPNQEGERVVSRFFQKSAKQQ
        +T + +KR      S R R DV P+S  +K N HS+G          TD  +      FQ S K+ E +     S +NSK NQ+ E++VSRFF KS KQQ
Subjt:  VTNRSNKRLMVGDYSGRGRNDVAPTSGTTKANQHSVG----------TDLPS------FQNSGKNQEGN-----SFKNSKPNQEGERVVSRFFQKSAKQQ

Query:  AVNNQEGA-EESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFLEAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAK
        AVNNQE A E+ NQCA+ VKR RK   ERK+ +K +S++PRTTL+AAELFLEAYRRKS DDTW PPPS  RLLQ DHAYDPWRVL+IC+LLNRTSGRQAK
Subjt:  AVNNQEGA-EESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFLEAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAK

Query:  DVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQSIKE
        +VIPKLF+LCPNPKA LEVS EQIEDIIRPLGL RKRSRT+ RLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW +V PKDHMLNYYW+FL SIK 
Subjt:  DVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQSIKE

Query:  HL
         L
Subjt:  HL

A0A5D3CU57 Methyl-CpG-binding domain protein 4-like protein3.66e-14554.9Show/hide
Query:  NPNLC-----SYPDDLFSQFAFRGNGCSRFRSTPAKSDQRKPTVPAEDFTQTSEPNHQASGHEIPILTLEDLQDVHQSSKPNH--------------EIP
        NPNL      SYP DLFS+F FRG   SRFR  P+KS  + P  P +D TQ S           PI TL DLQ    +S+PN+              E P
Subjt:  NPNLC-----SYPDDLFSQFAFRGNGCSRFRSTPAKSDQRKPTVPAEDFTQTSEPNHQASGHEIPILTLEDLQDVHQSSKPNH--------------EIP

Query:  ILTLEDLHNAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQHGVR-------------KGEPIVSRYFQSS--------EDEDLDSN
        ILTLEDL N K   Q+ + P   R +L+ YR+FGFD KL+Q TS  V +SE VQ G R             + E IVSRYF+ S        EDE+ D N
Subjt:  ILTLEDLHNAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQHGVR-------------KGEPIVSRYFQSS--------EDEDLDSN

Query:  VTNRSNKRLMVGDYSGRGRNDVAPTSGTTKANQHSVG----------TDLPS------FQNSGKNQEGN-----SFKNSKPNQEGERVVSRFFQKSAKQQ
        +T + +KR      S R R DV P+S  +K N HS+G          TD  +      FQ S K+ E +     S +NSK NQ+ E++VSRFF KS KQQ
Subjt:  VTNRSNKRLMVGDYSGRGRNDVAPTSGTTKANQHSVG----------TDLPS------FQNSGKNQEGN-----SFKNSKPNQEGERVVSRFFQKSAKQQ

Query:  AVNNQEGA-EESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFLEAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAK
        AVNNQE A E+ NQCA+ VKR RK   ERK+ +K +S++PRTTL+AAELFLEAYRRKS DDTW PPPS  RLLQ DHAYDPWRVL+IC+LLNRTSGRQAK
Subjt:  AVNNQEGA-EESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFLEAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAK

Query:  DVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNY
        +VIPKLF+LCPNPKA LEVS EQIEDIIRPLGL RKRSRT+ RLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW+  + +  ++++
Subjt:  DVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNY

A0A6J1DRX8 methyl-CpG-binding domain protein 4-like protein0.0100Show/hide
Query:  MAMSSSSSSSNNPNLCSYPDDLFSQFAFRGNGCSRFRSTPAKSDQRKPTVPAEDFTQTSEPNHQASGHEIPILTLEDLQDVHQSSKPNHEIPILTLEDLH
        MAMSSSSSSSNNPNLCSYPDDLFSQFAFRGNGCSRFRSTPAKSDQRKPTVPAEDFTQTSEPNHQASGHEIPILTLEDLQDVHQSSKPNHEIPILTLEDLH
Subjt:  MAMSSSSSSSNNPNLCSYPDDLFSQFAFRGNGCSRFRSTPAKSDQRKPTVPAEDFTQTSEPNHQASGHEIPILTLEDLQDVHQSSKPNHEIPILTLEDLH

Query:  NAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQHGVRKGEPIVSRYFQSSEDEDLDSNVTNRSNKRLMVGDYSGRGRNDVAPTSGTT
        NAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQHGVRKGEPIVSRYFQSSEDEDLDSNVTNRSNKRLMVGDYSGRGRNDVAPTSGTT
Subjt:  NAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQHGVRKGEPIVSRYFQSSEDEDLDSNVTNRSNKRLMVGDYSGRGRNDVAPTSGTT

Query:  KANQHSVGTDLPSFQNSGKNQEGNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQEGAEESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFL
        KANQHSVGTDLPSFQNSGKNQEGNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQEGAEESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFL
Subjt:  KANQHSVGTDLPSFQNSGKNQEGNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQEGAEESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFL

Query:  EAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKE
        EAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKE
Subjt:  EAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKE

Query:  SWSHVTQLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQSIKEHL
        SWSHVTQLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQSIKEHL
Subjt:  SWSHVTQLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQSIKEHL

A0A6J1EZJ4 methyl-CpG-binding domain protein 4-like protein4.30e-16757.25Show/hide
Query:  NPNLC-----SYPDDLFSQFAFRGNGCSRFRSTPAK----SDQRKPTVPAEDFTQ-------------------TSEPNHQ--ASGHEIPILTLEDLQD-
        NPNL      S+PD LFSQFAF+G   SRFR  P+K    S+++ PT   EDFTQ                   TSE NHQ  A+G EIPIL +EDLQD 
Subjt:  NPNLC-----SYPDDLFSQFAFRGNGCSRFRSTPAK----SDQRKPTVPAEDFTQ-------------------TSEPNHQ--ASGHEIPILTLEDLQD-

Query:  ------------VHQSSKPN----------HEIPILTLEDLHNAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQHGVR--------
                    V + S             HE PILTLED+ NAK   Q    P   R +L  YR+FGFD ++VQKT P VR+S  VQ   R        
Subjt:  ------------VHQSSKPN----------HEIPILTLEDLHNAKPFRQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQHGVR--------

Query:  -----KGEPIVSRYFQSSE----------DEDLDSNVTNRSNKRLMVGDYSGRGRNDVAPTSGTTKANQHSV----------GTDLPS------FQNSGK
             +GE IVSRYFQ SE          DED D NVT++  KR  VG Y  R R DVA +S  +KA Q S+          GTD         FQNS K
Subjt:  -----KGEPIVSRYFQSSE----------DEDLDSNVTNRSNKRLMVGDYSGRGRNDVAPTSGTTKANQHSV----------GTDLPS------FQNSGK

Query:  NQE-----GNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQ-EGAEESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFLEAYRRKSSDDTWD
        N E         +NSK  Q+GER+VSRFFQKS +Q+ VNNQ E  +  +QCA+ VKR RK AKERK   K  S+RPRTTLSA ELFLEAYRRKSSDDTW 
Subjt:  NQE-----GNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQ-EGAEESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFLEAYRRKSSDDTWD

Query:  PPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHVTQLPGVGK
        PPPS IRLLQQDHAYDPWRVL+IC+LLNRT+G+QAK+VIPKLFTLCP+PK+ALEVS EQIEDIIRPLGLQRKRS TIQRLSEMYLKESWSHVTQLPGVGK
Subjt:  PPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHVTQLPGVGK

Query:  YGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQSIKEHL
        YGADAHAIFCTGYW +VLPKDHMLNYYWEFL SIK  L
Subjt:  YGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQSIKEHL

A0A6J1HWM5 methyl-CpG-binding domain protein 4-like protein isoform X14.15e-15659.87Show/hide
Query:  DFTQTSEPNHQ--ASGHEIPIL----------------TLEDLQDVHQSSKPN-------HEIPILTLEDLHNAKPFRQTIQNPRSPRTILNLYRKFGFD
        +  QTSE NHQ  A+GHEIPIL                T+ED+Q+V   +  +       HE PILTLEDL NAK   Q    P   R +L   R+FGFD
Subjt:  DFTQTSEPNHQ--ASGHEIPIL----------------TLEDLQDVHQSSKPN-------HEIPILTLEDLHNAKPFRQTIQNPRSPRTILNLYRKFGFD

Query:  IKLVQKTSPLVRHSEAVQHGVR-------------KGEPIVSRYFQSSE--------DEDLDSNVTNRSNKRLMVGDYSGRGRNDVAPTSGTTKANQHSV
         ++VQKT P VR+S  VQ   R             +GE IVSRYFQ SE        DED D NVT++  KR  VG Y  R R DVA +S  +KA Q S+
Subjt:  IKLVQKTSPLVRHSEAVQHGVR-------------KGEPIVSRYFQSSE--------DEDLDSNVTNRSNKRLMVGDYSGRGRNDVAPTSGTTKANQHSV

Query:  ----------GTDLPS------FQNSGKNQE-----GNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQ-EGAEESNQCARCVKRKRKRAKERKRTSKRN
                  GTD         FQNS KN E       S +NSK NQ+ ERVVSRFFQKS + + VNNQ E  +  +QCA+ VKR RK AKERK   K  
Subjt:  ----------GTDLPS------FQNSGKNQE-----GNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQ-EGAEESNQCARCVKRKRKRAKERKRTSKRN

Query:  SSRPRTTLSAAELFLEAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRK
        S++PRTTLSA ELFLEAYRRKSSDDTW PPPS IRLLQQDHAYDPWRVL+IC+LLNRT+G+QAK+VIPKLFTLCP+PK+ALEVS EQIEDIIRPLGLQRK
Subjt:  SSRPRTTLSAAELFLEAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRK

Query:  RSRTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQSIKEHL
        RS TIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYW +VLPKDHMLNYYWEFL SIK  L
Subjt:  RSRTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQSIKEHL

SwissProt top hitse value%identityAlignment
O95243 Methyl-CpG-binding domain protein 42.9e-2331.02Show/hide
Query:  NSFKNSKPNQEGERVVSRFFQK---SAKQQAVNNQE---------GAEESNQCARCVKRKRKRAKERKRTSKRNS-SRPRTTLSAAELF----LEAYRRK
        N F ++K ++  E+    F +      K + V  +E         G+E  N C+   K        ++ T  R    R +T+L  +  +    L   RRK
Subjt:  NSFKNSKPNQEGERVVSRFFQK---SAKQQAVNNQE---------GAEESNQCARCVKRKRKRAKERKRTSKRNS-SRPRTTLSAAELF----LEAYRRK

Query:  SSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHVT
        +    W PP S   L+Q+   +DPW++L+  + LNRTSG+ A  V+ K     P+ + A       + ++++PLGL   R++TI + S+ YL + W +  
Subjt:  SSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHVT

Query:  QLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQSIKEHL
        +L G+GKYG D++ IFC   W QV P+DH LN Y ++L    E L
Subjt:  QLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQSIKEHL

Q0IGK1 Methyl-CpG-binding domain protein 4-like protein2.3e-4945.93Show/hide
Query:  TDLPSFQNSGKNQEGNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQEGAEESNQCARCVKRKR---------KRAKERKRTSKRNSSRPRTTLSAAELF
        +D+ S   SG+N      K S   Q   R VS +FQ+S   +  N  +  +      + VK  R           +++ K  + R +      LS ++  
Subjt:  TDLPSFQNSGKNQEGNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQEGAEESNQCARCVKRKR---------KRAKERKRTSKRNSSRPRTTLSAAELF

Query:  LEAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLK
         + Y RK+ D+TW PP S   LLQ+DH +DPWRVL+IC+LLN+TSG Q + VI  LF LC + K A EV  E+IE++I+PLGLQ+KR++ IQRLS  YL+
Subjt:  LEAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLK

Query:  ESWSHVTQLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQ
        ESW+HVTQL GVGKY ADA+AIFC G WD+V P DHMLNYYW++L+
Subjt:  ESWSHVTQLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQ

Q7LX22 Thymine/uracil-DNA glycosylase4.0e-0930.43Show/hide
Query:  AYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHV-------TQLPGVGKYGADAH
        A DPW VL+  LLL +T+ +Q  D+  +     P+P    + S E+I+ II+PLG++  R+  +++LSE  ++     +         LPGVG Y A   
Subjt:  AYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHV-------TQLPGVGKYGADAH

Query:  AIFCTGYWDQVLPKD
         +   G  + +L ++
Subjt:  AIFCTGYWDQVLPKD

Q9YDP0 Thymine-DNA glycosylase2.0e-0828.32Show/hide
Query:  DPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMY-------LKESWSHVTQLPGVGKYGADAHAI
        DPW +L+   LL +T+ RQ   V  +     PNPKA      +++ ++IRPLG++ +R++ +  L++         +  S   + +LPGVG Y A    +
Subjt:  DPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMY-------LKESWSHVTQLPGVGKYGADAHAI

Query:  FCTGYWDQVLPKD
           G  + +L ++
Subjt:  FCTGYWDQVLPKD

Q9Z2D7 Methyl-CpG-binding domain protein 41.4e-2230.18Show/hide
Query:  EGERVVSRFFQKSAKQQAVNNQEGAEESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELF----LEAYRRKSSDDTWDPPPSEIRLLQQDHAYD
        E E + S+  +K          +   E   C++  K       +     +    + +T+L  +  +    L   RRKS    W PP S   L+Q+   +D
Subjt:  EGERVVSRFFQKSAKQQAVNNQEGAEESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELF----LEAYRRKSSDDTWDPPPSEIRLLQQDHAYD

Query:  PWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWDQ
        PW++L+  + LNRTSG+ A  V+ +     P+ + A       + ++++PLGL   R++TI + S+ YL + W +  +L G+GKYG D++ IFC   W Q
Subjt:  PWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHVTQLPGVGKYGADAHAIFCTGYWDQ

Query:  VLPKDHMLNYYWEFLQSIKEHL
        V P+DH LN Y ++L    E L
Subjt:  VLPKDHMLNYYWEFLQSIKEHL

Arabidopsis top hitse value%identityAlignment
AT3G07930.1 DNA glycosylase superfamily protein1.2e-0833.79Show/hide
Query:  TDLPSFQNSGKNQEGNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQEGAEESNQCARCVKRKR---------KRAKERKRTSKRNSSRPRTTLSAAELF
        +D+ S   SG+N      K S   Q   R VS +FQ+S   +  N  +  +      + VK  R           +++ K  + R +      LS ++  
Subjt:  TDLPSFQNSGKNQEGNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQEGAEESNQCARCVKRKR---------KRAKERKRTSKRNSSRPRTTLSAAELF

Query:  LEAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTS
         + Y RK+ D+TW PP S   LLQ+DH +DPWRVL+IC+LLN+TS
Subjt:  LEAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTS

AT3G07930.2 DNA glycosylase superfamily protein4.9e-1034.46Show/hide
Query:  TDLPSFQNSGKNQEGNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQEGAEESNQCARCVKRKR---------KRAKERKRTSKRNSSRPRTTLSAAELF
        +D+ S   SG+N      K S   Q   R VS +FQ+S   +  N  +  +      + VK  R           +++ K  + R +      LS ++  
Subjt:  TDLPSFQNSGKNQEGNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQEGAEESNQCARCVKRKR---------KRAKERKRTSKRNSSRPRTTLSAAELF

Query:  LEAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQ
         + Y RK+ D+TW PP S   LLQ+DH +DPWRVL+IC+LLN+TSG Q
Subjt:  LEAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQ

AT3G07930.3 DNA glycosylase superfamily protein1.7e-5045.93Show/hide
Query:  TDLPSFQNSGKNQEGNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQEGAEESNQCARCVKRKR---------KRAKERKRTSKRNSSRPRTTLSAAELF
        +D+ S   SG+N      K S   Q   R VS +FQ+S   +  N  +  +      + VK  R           +++ K  + R +      LS ++  
Subjt:  TDLPSFQNSGKNQEGNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQEGAEESNQCARCVKRKR---------KRAKERKRTSKRNSSRPRTTLSAAELF

Query:  LEAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLK
         + Y RK+ D+TW PP S   LLQ+DH +DPWRVL+IC+LLN+TSG Q + VI  LF LC + K A EV  E+IE++I+PLGLQ+KR++ IQRLS  YL+
Subjt:  LEAYRRKSSDDTWDPPPSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLK

Query:  ESWSHVTQLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQ
        ESW+HVTQL GVGKY ADA+AIFC G WD+V P DHMLNYYW++L+
Subjt:  ESWSHVTQLPGVGKYGADAHAIFCTGYWDQVLPKDHMLNYYWEFLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATGAGTTCGAGTTCGAGCTCGAGCAACAACCCTAATCTGTGTTCATATCCTGACGATTTGTTTTCCCAATTCGCATTTCGGGGGAATGGGTGTTCCAGA
TTCCGCTCTACTCCTGCCAAATCGGATCAACGAAAACCAACGGTGCCGGCGGAGGATTTTACCCAAACTTCAGAACCAAATCATCAAGCCTCAGGGCACGAGATT
CCGATTTTGACTCTTGAGGATCTTCAAGATGTTCACCAGTCTTCTAAACCAAACCATGAGATTCCTATATTAACTCTAGAGGATCTCCACAATGCAAAACCATTC
CGTCAAACCATACAAAATCCTCGATCGCCTCGTACAATCTTAAATCTTTACCGAAAGTTCGGATTTGATATAAAATTGGTGCAAAAAACTTCACCTCTTGTCCGA
CATTCAGAAGCAGTTCAACACGGGGTACGTAAAGGAGAACCAATTGTCTCGCGCTACTTCCAGAGCTCGGAGGATGAGGATTTGGATTCCAATGTCACAAACCGC
TCAAATAAAAGATTAATGGTGGGAGATTACAGCGGAAGGGGGAGGAATGACGTAGCCCCCACCTCCGGTACTACAAAAGCAAATCAACATTCAGTGGGAACAGAT
TTACCCTCTTTCCAAAACTCAGGAAAGAATCAAGAAGGAAACTCATTCAAAAATTCCAAACCAAACCAAGAAGGAGAGCGAGTAGTTTCGCGTTTCTTTCAAAAA
TCAGCAAAACAACAAGCCGTGAACAATCAGGAGGGTGCAGAGGAGTCAAATCAGTGTGCAAGATGTGTTAAAAGAAAACGTAAACGAGCCAAGGAAAGGAAACGG
ACAAGTAAAAGAAATTCTTCAAGACCTAGGACCACTCTTTCTGCTGCCGAGTTGTTTTTGGAAGCTTATAGAAGGAAATCCTCAGATGACACATGGGACCCTCCT
CCCTCTGAAATTCGCCTTCTCCAACAGGATCATGCGTACGACCCTTGGAGGGTTCTACTCATATGTTTGCTTCTTAACCGGACAAGTGGGCGGCAGGCAAAAGAT
GTGATACCTAAACTCTTCACTTTGTGTCCCAATCCAAAGGCTGCTTTGGAGGTATCACATGAGCAGATAGAAGATATTATTCGACCTCTTGGTCTACAAAGAAAA
AGATCACGAACAATTCAGCGTTTGTCTGAGATGTATTTAAAAGAAAGTTGGAGTCATGTCACTCAGCTTCCTGGTGTTGGCAAGTATGGAGCTGATGCACATGCA
ATATTTTGCACTGGGTATTGGGACCAAGTATTACCTAAAGATCACATGCTTAATTATTACTGGGAGTTTCTCCAGAGCATAAAGGAACACCTCTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAGAAAAAGAAAAAAGAGTAAAATATAATATTAAAAAGGAAATTTCCAGAATCGTGCCGCTTAGGCTTACAAGTAAGGAGAAGCGGAAGGCGGCAAAAA
TTGCGAGGCTAATCTGGAAGCAGCCGATGCCATGGCCATGAGTTCGAGTTCGAGCTCGAGCAACAACCCTAATCTGTGTTCATATCCTGACGATTTGTTTTCCCA
ATTCGCATTTCGGGGGAATGGGTGTTCCAGATTCCGCTCTACTCCTGCCAAATCGGATCAACGAAAACCAACGGTGCCGGCGGAGGATTTTACCCAAACTTCAGA
ACCAAATCATCAAGCCTCAGGGCACGAGATTCCGATTTTGACTCTTGAGGATCTTCAAGATGTTCACCAGTCTTCTAAACCAAACCATGAGATTCCTATATTAAC
TCTAGAGGATCTCCACAATGCAAAACCATTCCGTCAAACCATACAAAATCCTCGATCGCCTCGTACAATCTTAAATCTTTACCGAAAGTTCGGATTTGATATAAA
ATTGGTGCAAAAAACTTCACCTCTTGTCCGACATTCAGAAGCAGTTCAACACGGGGTACGTAAAGGAGAACCAATTGTCTCGCGCTACTTCCAGAGCTCGGAGGA
TGAGGATTTGGATTCCAATGTCACAAACCGCTCAAATAAAAGATTAATGGTGGGAGATTACAGCGGAAGGGGGAGGAATGACGTAGCCCCCACCTCCGGTACTAC
AAAAGCAAATCAACATTCAGTGGGAACAGATTTACCCTCTTTCCAAAACTCAGGAAAGAATCAAGAAGGAAACTCATTCAAAAATTCCAAACCAAACCAAGAAGG
AGAGCGAGTAGTTTCGCGTTTCTTTCAAAAATCAGCAAAACAACAAGCCGTGAACAATCAGGAGGGTGCAGAGGAGTCAAATCAGTGTGCAAGATGTGTTAAAAG
AAAACGTAAACGAGCCAAGGAAAGGAAACGGACAAGTAAAAGAAATTCTTCAAGACCTAGGACCACTCTTTCTGCTGCCGAGTTGTTTTTGGAAGCTTATAGAAG
GAAATCCTCAGATGACACATGGGACCCTCCTCCCTCTGAAATTCGCCTTCTCCAACAGGATCATGCGTACGACCCTTGGAGGGTTCTACTCATATGTTTGCTTCT
TAACCGGACAAGTGGGCGGCAGGCAAAAGATGTGATACCTAAACTCTTCACTTTGTGTCCCAATCCAAAGGCTGCTTTGGAGGTATCACATGAGCAGATAGAAGA
TATTATTCGACCTCTTGGTCTACAAAGAAAAAGATCACGAACAATTCAGCGTTTGTCTGAGATGTATTTAAAAGAAAGTTGGAGTCATGTCACTCAGCTTCCTGG
TGTTGGCAAGTATGGAGCTGATGCACATGCAATATTTTGCACTGGGTATTGGGACCAAGTATTACCTAAAGATCACATGCTTAATTATTACTGGGAGTTTCTCCA
GAGCATAAAGGAACACCTCTGATCTTATCCTGACTGTAGATGGTTTTGTACGAGAGCACACAAGTTATAAATTTCAGGGCCTACTTGTCCCCATATCTATATATA
TTTTTGAGTGTTAATATTTTAGAAAACCAAAGTTTTGTGCTCTGGGTTTTGTTTTTTGTTGTTAGAAGGAAGGGAGGGGGATGGGAATCTTGTCTGATTATGGAT
GATTTTGTTATGGTGACCAGGTCTTGTTTTACTTGTTCACTTGTTGGAAGTTGTTGGTAAATGGCACCAAGCAATTATCTGTTTCAAGCTAATCAACTTTTGAAA
ACAAATAATGTAATAATTATTCATGATTTCATGTAGAATTTGAC
Protein sequenceShow/hide protein sequence
MAMSSSSSSSNNPNLCSYPDDLFSQFAFRGNGCSRFRSTPAKSDQRKPTVPAEDFTQTSEPNHQASGHEIPILTLEDLQDVHQSSKPNHEIPILTLEDLHNAKPF
RQTIQNPRSPRTILNLYRKFGFDIKLVQKTSPLVRHSEAVQHGVRKGEPIVSRYFQSSEDEDLDSNVTNRSNKRLMVGDYSGRGRNDVAPTSGTTKANQHSVGTD
LPSFQNSGKNQEGNSFKNSKPNQEGERVVSRFFQKSAKQQAVNNQEGAEESNQCARCVKRKRKRAKERKRTSKRNSSRPRTTLSAAELFLEAYRRKSSDDTWDPP
PSEIRLLQQDHAYDPWRVLLICLLLNRTSGRQAKDVIPKLFTLCPNPKAALEVSHEQIEDIIRPLGLQRKRSRTIQRLSEMYLKESWSHVTQLPGVGKYGADAHA
IFCTGYWDQVLPKDHMLNYYWEFLQSIKEHL