; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0016555 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0016555
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionSWR1-complex protein 4
Genome locationchr01:25486890..25494020
RNA-Seq ExpressionPI0016555
SyntenyPI0016555
Gene Ontology termsGO:0000122 - negative regulation of transcription by RNA polymerase II (biological process)
GO:0006281 - DNA repair (biological process)
GO:0043486 - histone exchange (biological process)
GO:0043967 - histone H4 acetylation (biological process)
GO:0043968 - histone H2A acetylation (biological process)
GO:0000812 - Swr1 complex (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
GO:0003714 - transcription corepressor activity (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR008468 - DNA methyltransferase 1-associated 1
IPR009057 - Homeobox-like domain superfamily
IPR027109 - SWR1-complex protein 4/DNA methyltransferase 1-associated protein 1
IPR032563 - DAMP1, SANT/Myb-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451441.1 PREDICTED: SWR1-complex protein 4 [Cucumis melo]2.0e-22895.49Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAP+MPAIDTSELKKRPPSDEKITWQWLPF+NSARKDNLQLYHWVRVVNG PPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKHE------------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
        QEIERKRALSMVLSQTK +            KITESRKAERVAEESELPVTSNAVPEVTERV VPGD VPS+SNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKHE------------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL
        SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL
Subjt:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL

XP_011659406.1 SWR1-complex protein 4 [Cucumis sativus]2.8e-22795.03Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAID SELKKRPPSDEKITWQWLPF+NSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEK+LKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYR SRAIVAARGSISRESSGNTPAKDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKHE------------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
        QEIERKRALSMVLSQTK +            KITE+RKAERVAEESELPVTSNAVPEVTERV VPGD VPS+SNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKHE------------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL
        SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL
Subjt:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL

XP_023549249.1 SWR1-complex protein 4-like isoform X1 [Cucurbita pepo subsp. pepo]1.2e-21490.54Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPL QEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPF+NSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEK LKD SWTKEETDQLFDLCERFDLRF+VIADRFPS RTVEELKERYY AS+AI+ ARG  SRE SGNTPAKDP+NVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKH------------EKITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
        QEIERKRALSMVLSQTK             +KI ESRKAERVAEES+L VTSNAVPEVTER  VPG++VPSVSNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKH------------EKITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMV AASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+SPY+EAPGTPKDR FIAD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQS-KRPRKQKGSDL
        S+SFGGERF KRDQKRKATGRLSEAPSSPAQS KRPRKQKGSDL
Subjt:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQS-KRPRKQKGSDL

XP_038898526.1 SWR1-complex protein 4 isoform X1 [Benincasa hispida]4.5e-22594.36Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLPQEKKPRA KDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEK+LKD SWTKEETDQLFDLCERFDLRFIVI+DRFPSARTVEELKERYYRASRAIVAARG  SRESSGNTPAKDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKHE------------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
        QEIERKRALSMVLSQTK +            KITESR+AERVAEESELPVTSNAVPEVTERV VP DTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKHE------------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+SPYTEAPGTPKDRTFIAD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL
        S+SFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL
Subjt:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL

XP_038898527.1 SWR1-complex protein 4 isoform X2 [Benincasa hispida]4.2e-21591.2Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLPQEKKPRA KDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEK+LKD SWTKEETDQLFDLCERFDLRFIVI+DRFPSARTVEELKERYYRASRAIVAARG  SRESSGNTPAKDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKHE------------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
        QEIERKRALSMVLSQTK +            KITESR+AERVAEESELPVTSNAVPEVTERV VP DTVPSVSNVQPPPPAAVPST              
Subjt:  QEIERKRALSMVLSQTKHE------------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+SPYTEAPGTPKDRTFIAD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL
        S+SFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL
Subjt:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL

TrEMBL top hitse value%identityAlignment
A0A0A0K9K0 SANT domain-containing protein1.4e-22795.03Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAID SELKKRPPSDEKITWQWLPF+NSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEK+LKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYR SRAIVAARGSISRESSGNTPAKDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKHE------------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
        QEIERKRALSMVLSQTK +            KITE+RKAERVAEESELPVTSNAVPEVTERV VPGD VPS+SNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKHE------------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL
        SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL
Subjt:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL

A0A1S3BSL6 SWR1-complex protein 49.5e-22995.49Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAP+MPAIDTSELKKRPPSDEKITWQWLPF+NSARKDNLQLYHWVRVVNG PPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKHE------------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
        QEIERKRALSMVLSQTK +            KITESRKAERVAEESELPVTSNAVPEVTERV VPGD VPS+SNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKHE------------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL
        SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL
Subjt:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL

A0A6J1DID3 SWR1-complex protein 42.9e-20988.04Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPK T P+PQ+KKPRAQKDAQRKRDGISREVYALTGGL PIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNKSV+VVKYTDEEYEKHL D  WTKEETDQLF LCERFDLRFIVIADRF S RTVEELKERYYRASRAI+ AR  +SRE SGNT  KDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKHE------------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
         EIERKRALSMVLSQTK +            KITESR+AERVAEESELPV SN VPEV ER  VPGD+VPS+SNVQP PPAA PST+ ADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKHE------------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPY +APGTPKDR+FI D
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL
        SV+FGGERF KRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL
Subjt:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL

A0A6J1GQH2 SWR1-complex protein 4-like isoform X11.9e-21389.86Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPL QEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPF+NSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEK LKD SWTKEETDQLFDLCERFDLRF+VIADRFPS RTVEELKERYY AS+AI+ ARG  SRE  GNTPAKDP+NVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKH------------EKITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
        QEI+RKRALSMVLSQTK             +KI ESRKAERVAEES+L VTSNAVPEVTER  VPG++VPSVSNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKH------------EKITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMV AASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+SPY+EAPGTPK+R FIAD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQS-KRPRKQKGSDL
        S+SFGGERF KRDQKRKATGRLSEAPSSPAQS KRPRKQKGSDL
Subjt:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQS-KRPRKQKGSDL

A0A6J1JX24 SWR1-complex protein 4-like isoform X33.6e-21289.41Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPL QEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPF+NSARKDNLQLYHWVRVVNGIPP G
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEK LKD SWTKEETDQLFDLCERFDLRF+VIADRFPS RTVEELKERYY AS+AI+ ARG  SRE SGNTPAKDP+NVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKH------------EKITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
        QEI+RKRALSMVLSQTK             +KI ESRKAERVAEES+L VTSN VPEVTER  VPG++V SVSNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKH------------EKITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMV AASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+SPY+EAPGTPKDR FIAD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQS-KRPRKQKGSDL
        S++FGGERF KRDQKRKATGRLSEAPSSPAQS KRPRKQKGSDL
Subjt:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQS-KRPRKQKGSDL

SwissProt top hitse value%identityAlignment
O14308 SWR1-complex protein 41.4e-3028.71Show/hide
Query:  DAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTG-GLAPIMPAIDTSELKKRPPSDEKI-TWQWLPFTNSARKDNLQLYHWVRVVNGIPPT
        D +D+  LP    P     K +++   +R+ +GISRE+Y+L G   AP+  AI   + K++P    K   W   PF+ S+RKD+  L+HWV + + +   
Subjt:  DAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTG-GLAPIMPAIDTSELKKRPPSDEKI-TWQWLPFTNSARKDNLQLYHWVRVVNGIPPT

Query:  GDYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSA-----RTVEELKERYYRASRAIVAARGSISRESSGNTPAK
          Y F K+N  + ++ YTDEEY+ +LKD  W K+ETD LF LC+ +DLRF VIADR+ +      RT+E+LK+R+Y  SR I+ AR  I+  ++  +   
Subjt:  GDYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSA-----RTVEELKERYYRASRAIVAARGSISRESSGNTPAK

Query:  D--PYNVSQEIERKRALSMVLSQTKHE-----------KITESRKAERVAEESEL--------------------------------PVTSNAVPEVTER
        +   YN  QE+ RK+ L  + S+T  E           K  E+ +A+ +++  E+                                  T N V E    
Subjt:  D--PYNVSQEIERKRALSMVLSQTKHE-----------KITESRKAERVAEESEL--------------------------------PVTSNAVPEVTER

Query:  VAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRML---PVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRK
         + P   V SV N    P A     +      T     +     ++  T+   Q + A  +S      +RV   + +L V+ +  +PT     + +EL+ 
Subjt:  VAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRML---PVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRK

Query:  EILTLLNLQKQLQNKEAE
         I++LL L++++     E
Subjt:  EILTLLNLQKQLQNKEAE

Q7K3D8 DNA methyltransferase 1-associated protein 11.4e-3230.19Show/hide
Query:  DAKDILGLPKNTLP--------LPQEKKPRAQKDAQRKRDGISREVYAL----TGGLAPIMPAIDTS--------ELKKRPPSDEKITWQWLPFTNSARK
        D +DIL + +   P          +++     K A R+ +G+ REV+AL         P++P  DT+        E K R    +   W+W PF+N AR 
Subjt:  DAKDILGLPKNTLP--------LPQEKKPRAQKDAQRKRDGISREVYAL----TGGLAPIMPAIDTS--------ELKKRPPSDEKITWQWLPFTNSARK

Query:  DNLQLYHWVRVVNGIPPTGDYSFAKYNKSVEVVKYTDEEYEKHLKD--ASWTKEETDQLFDLCERFDLRFIVIADRF----PSARTVEELKERYYRASRA
        D+   +HW RV +    + DY FAK+NK +EV  YT  EY  HL++   +W+K +TD LFDL  RFDLRFIV+ADR+       +TVEELKERYY     
Subjt:  DNLQLYHWVRVVNGIPPTGDYSFAKYNKSVEVVKYTDEEYEKHLKD--ASWTKEETDQLFDLCERFDLRFIVIADRF----PSARTVEELKERYYRASRA

Query:  IVAARGSISRESSGNTPAKDPYNVSQEIERKRALSMVLSQTKHE-----------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNV-
        +  A+   S +          Y+V  E  RK  L  +  +T  +           K  E+RK ER  +  +L    +   +  E  +    T      + 
Subjt:  IVAARGSISRESSGNTPAKDPYNVSQEIERKRALSMVLSQTKHE-----------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNV-

Query:  -----QPPPPAAVPSTV----VADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNL
             Q P P+ V S V    +  +    A LR   V LR+  ++       ++ G R +K +EQ +Q+  V+  P  PT+ +C    ELR +++ L  L
Subjt:  -----QPPPPAAVPSTV----VADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNL

Query:  QKQLQNKEAEGSSFRDSPYTEAPG
        +  L     E  S +       PG
Subjt:  QKQLQNKEAEGSSFRDSPYTEAPG

Q8VZL6 SWR1-complex protein 41.5e-14665.01Show/hide
Query:  DAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTGD
        DAKDILGLPK  L L QEKK R QK++ RK DGISREVYALTGG+AP+MP+ID   LK+RPP+DEK+ W+WL FTNSARKD+LQLYHWVRVVN +PPTGD
Subjt:  DAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTGD

Query:  YSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVSQ
        YSFAKYNKSV+++KYTDEEYE HL D+ WTKEETDQLF+ C+ FDLRF+VIADRFP +RTVEELK+RYY  +RA++ AR     + + +   K+PY++++
Subjt:  YSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVSQ

Query:  EIERKRALSMVLSQTKHE------------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTV-VADNASTLASLRM
        + ERKRALSMVLSQ++H+            +ITE R A R AEE ++    NA  +  + V VPG +V   SN Q P  A  PST+ +AD ASTLASLRM
Subjt:  EIERKRALSMVLSQTKHE------------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTV-VADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        L VYLRTY LEQMVQAASS+ GLRTIKRVEQTLQDL VNLKP+VPTK VC EHLELRKEILTLLNLQKQLQ KE+EGSS R+  Y   P TPKDR F  D
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL
          SFG ER  K++QKRK  GR ++ P SPA  KRPRK K SDL
Subjt:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL

Q9JI44 DNA methyltransferase 1-associated protein 15.5e-3231.87Show/hide
Query:  LPKNTLPLPQEKKPRAQKDAQ--RKRDGISREVYAL----TGGLAPIMPAIDTSE----LKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPP
        + K  +  P +KK +   +    ++ +G+ REVYAL         P++P+ DT +    +K +  S +   W+W+PFTN ARKD    +HW R       
Subjt:  LPKNTLPLPQEKKPRAQKDAQ--RKRDGISREVYAL----TGGLAPIMPAIDTSE----LKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPP

Query:  TGDYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFP----SARTVEELKERYYRASRAIVAARGSISRESSGNTPAK
          DY FA++NK+V+V  Y+++EY+ +L D +WTK ETD LFDL  RFDLRF+VI DR+       R+VE+LKERYY      + A+ +  R   G     
Subjt:  TGDYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFP----SARTVEELKERYYRASRAIVAARGSISRESSGNTPAK

Query:  DPYNVSQEIERKRALSMVLSQTKHE-----------KITESRKAERVAEESEL-----PVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVAD
          ++   E  RK  L  + ++T  +           +  E+RK ER     +L        + A    TER A P   +P     + P   AVP T    
Subjt:  DPYNVSQEIERKRALSMVLSQTKHE-----------KITESRKAERVAEESEL-----PVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVAD

Query:  NASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE
                +   V LR+  ++       SS G + IK +EQ L +L V L P  PT+ +     ELR +++ L  L++   N E E
Subjt:  NASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE

Q9NPF5 DNA methyltransferase 1-associated protein 15.5e-3231.87Show/hide
Query:  LPKNTLPLPQEKKPRAQKDAQ--RKRDGISREVYAL----TGGLAPIMPAIDTSE----LKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPP
        + K  +  P +KK +   +    ++ +G+ REVYAL         P++P+ DT +    +K +  S +   W+W+PFTN ARKD    +HW R       
Subjt:  LPKNTLPLPQEKKPRAQKDAQ--RKRDGISREVYAL----TGGLAPIMPAIDTSE----LKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPP

Query:  TGDYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFP----SARTVEELKERYYRASRAIVAARGSISRESSGNTPAK
          DY FA++NK+V+V  Y+++EY+ +L D +WTK ETD LFDL  RFDLRF+VI DR+       R+VE+LKERYY      + A+ +  R   G     
Subjt:  TGDYSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFP----SARTVEELKERYYRASRAIVAARGSISRESSGNTPAK

Query:  DPYNVSQEIERKRALSMVLSQTKHE-----------KITESRKAERVAEESEL-----PVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVAD
          ++   E  RK  L  + ++T  +           +  E+RK ER     +L        + A    TER A P   +P     + P   AVP T    
Subjt:  DPYNVSQEIERKRALSMVLSQTKHE-----------KITESRKAERVAEESEL-----PVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVAD

Query:  NASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE
                +   V LR+  ++       SS G + IK +EQ L +L V L P  PT+ +     ELR +++ L  L++   N E E
Subjt:  NASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE

Arabidopsis top hitse value%identityAlignment
AT2G47210.1 myb-like transcription factor family protein1.1e-14765.01Show/hide
Query:  DAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTGD
        DAKDILGLPK  L L QEKK R QK++ RK DGISREVYALTGG+AP+MP+ID   LK+RPP+DEK+ W+WL FTNSARKD+LQLYHWVRVVN +PPTGD
Subjt:  DAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTGD

Query:  YSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVSQ
        YSFAKYNKSV+++KYTDEEYE HL D+ WTKEETDQLF+ C+ FDLRF+VIADRFP +RTVEELK+RYY  +RA++ AR     + + +   K+PY++++
Subjt:  YSFAKYNKSVEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVSQ

Query:  EIERKRALSMVLSQTKHE------------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTV-VADNASTLASLRM
        + ERKRALSMVLSQ++H+            +ITE R A R AEE ++    NA  +  + V VPG +V   SN Q P  A  PST+ +AD ASTLASLRM
Subjt:  EIERKRALSMVLSQTKHE------------KITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTV-VADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        L VYLRTY LEQMVQAASS+ GLRTIKRVEQTLQDL VNLKP+VPTK VC EHLELRKEILTLLNLQKQLQ KE+EGSS R+  Y   P TPKDR F  D
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL
          SFG ER  K++QKRK  GR ++ P SPA  KRPRK K SDL
Subjt:  SVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCCAAGGATATTTTGGGCTTGCCCAAAAATACGCTGCCTTTACCCCAAGAGAAGAAGCCTAGGGCTCAGAAAGATGCCCAGAGAAAGCGAGATGGGATTTCTCG
GGAGGTTTATGCTCTTACTGGTGGTCTGGCGCCTATTATGCCCGCAATCGATACGTCTGAGCTGAAGAAGCGACCTCCATCAGATGAGAAGATTACGTGGCAGTGGCTTC
CCTTTACAAATTCTGCTAGAAAGGATAATTTGCAGCTTTACCATTGGGTTAGAGTTGTAAATGGCATTCCACCAACAGGTGACTATTCTTTTGCAAAGTACAACAAGTCT
GTTGAAGTTGTCAAATACACGGATGAGGAGTACGAGAAGCATTTGAAAGATGCTTCATGGACAAAGGAGGAGACAGATCAATTATTTGACTTGTGTGAACGGTTTGATCT
TCGCTTCATTGTGATAGCTGACAGGTTTCCATCAGCAAGGACAGTGGAGGAACTGAAGGAGCGGTATTATCGTGCATCAAGAGCAATTGTGGCTGCTAGAGGATCAATAT
CTCGGGAGAGTTCAGGAAATACTCCCGCCAAGGATCCTTACAATGTCTCACAAGAGATTGAGCGCAAACGAGCATTGTCCATGGTTCTCTCCCAAACAAAACACGAAAAA
ATAACTGAATCACGCAAAGCTGAAAGAGTGGCTGAAGAATCTGAGTTGCCTGTCACTTCAAATGCTGTCCCAGAAGTTACTGAAAGGGTTGCCGTTCCTGGAGATACCGT
ACCATCTGTATCCAATGTGCAGCCCCCTCCTCCAGCAGCTGTACCTTCAACTGTAGTGGCAGATAATGCTTCTACTCTTGCTTCTCTTCGCATGCTTCCTGTATACTTGA
GAACATATGCACTCGAGCAAATGGTACAAGCTGCAAGCTCATCTGCTGGCCTTCGGACCATCAAGCGAGTTGAACAGACATTACAAGATCTTTCGGTTAATTTAAAACCC
AGGGTTCCAACAAAAGCTGTCTGTGCAGAGCATCTTGAATTAAGAAAAGAAATATTGACTCTACTTAATCTTCAAAAGCAGTTGCAAAATAAGGAGGCAGAAGGTTCTTC
TTTCCGTGACAGTCCATACACTGAGGCACCTGGCACACCTAAGGATCGCACTTTTATTGCTGATTCTGTGAGTTTTGGAGGGGAAAGGTTTGGTAAACGGGATCAAAAGC
GCAAGGCCACTGGAAGATTATCTGAAGCTCCATCATCACCAGCTCAATCTAAAAGGCCAAGAAAACAGAAGGGATCCGATCTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGCCAAGGATATTTTGGGCTTGCCCAAAAATACGCTGCCTTTACCCCAAGAGAAGAAGCCTAGGGCTCAGAAAGATGCCCAGAGAAAGCGAGATGGGATTTCTCG
GGAGGTTTATGCTCTTACTGGTGGTCTGGCGCCTATTATGCCCGCAATCGATACGTCTGAGCTGAAGAAGCGACCTCCATCAGATGAGAAGATTACGTGGCAGTGGCTTC
CCTTTACAAATTCTGCTAGAAAGGATAATTTGCAGCTTTACCATTGGGTTAGAGTTGTAAATGGCATTCCACCAACAGGTGACTATTCTTTTGCAAAGTACAACAAGTCT
GTTGAAGTTGTCAAATACACGGATGAGGAGTACGAGAAGCATTTGAAAGATGCTTCATGGACAAAGGAGGAGACAGATCAATTATTTGACTTGTGTGAACGGTTTGATCT
TCGCTTCATTGTGATAGCTGACAGGTTTCCATCAGCAAGGACAGTGGAGGAACTGAAGGAGCGGTATTATCGTGCATCAAGAGCAATTGTGGCTGCTAGAGGATCAATAT
CTCGGGAGAGTTCAGGAAATACTCCCGCCAAGGATCCTTACAATGTCTCACAAGAGATTGAGCGCAAACGAGCATTGTCCATGGTTCTCTCCCAAACAAAACACGAAAAA
ATAACTGAATCACGCAAAGCTGAAAGAGTGGCTGAAGAATCTGAGTTGCCTGTCACTTCAAATGCTGTCCCAGAAGTTACTGAAAGGGTTGCCGTTCCTGGAGATACCGT
ACCATCTGTATCCAATGTGCAGCCCCCTCCTCCAGCAGCTGTACCTTCAACTGTAGTGGCAGATAATGCTTCTACTCTTGCTTCTCTTCGCATGCTTCCTGTATACTTGA
GAACATATGCACTCGAGCAAATGGTACAAGCTGCAAGCTCATCTGCTGGCCTTCGGACCATCAAGCGAGTTGAACAGACATTACAAGATCTTTCGGTTAATTTAAAACCC
AGGGTTCCAACAAAAGCTGTCTGTGCAGAGCATCTTGAATTAAGAAAAGAAATATTGACTCTACTTAATCTTCAAAAGCAGTTGCAAAATAAGGAGGCAGAAGGTTCTTC
TTTCCGTGACAGTCCATACACTGAGGCACCTGGCACACCTAAGGATCGCACTTTTATTGCTGATTCTGTGAGTTTTGGAGGGGAAAGGTTTGGTAAACGGGATCAAAAGC
GCAAGGCCACTGGAAGATTATCTGAAGCTCCATCATCACCAGCTCAATCTAAAAGGCCAAGAAAACAGAAGGGATCCGATCTGTGA
Protein sequenceShow/hide protein sequence
MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTGDYSFAKYNKS
VEVVKYTDEEYEKHLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVSQEIERKRALSMVLSQTKHEK
ITESRKAERVAEESELPVTSNAVPEVTERVAVPGDTVPSVSNVQPPPPAAVPSTVVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKP
RVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIADSVSFGGERFGKRDQKRKATGRLSEAPSSPAQSKRPRKQKGSDL