; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy1G017770 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy1G017770
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionSWR1-complex protein 4
Genome locationchrH01:23397278..23404768
RNA-Seq ExpressionChy1G017770
SyntenyChy1G017770
Gene Ontology termsGO:0000122 - negative regulation of transcription by RNA polymerase II (biological process)
GO:0006281 - DNA repair (biological process)
GO:0043486 - histone exchange (biological process)
GO:0043967 - histone H4 acetylation (biological process)
GO:0043968 - histone H2A acetylation (biological process)
GO:0000812 - Swr1 complex (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
GO:0003714 - transcription corepressor activity (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR008468 - DNA methyltransferase 1-associated 1
IPR009057 - Homeobox-like domain superfamily
IPR027109 - SWR1-complex protein 4/DNA methyltransferase 1-associated protein 1
IPR032563 - DAMP1, SANT/Myb-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046714.1 SWR1-complex protein 4 [Cucumis melo var. makuwa]0.096.95Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAP+MPAID SELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNG PPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEK+LKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM
        QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITE+RKAERVAEESELPVTSNAVPEVTERVVVPGDN+PSISNVQPPPPAAVPST+VADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  SGKDLVNGIKNARPLEDYLKLHHHQLNLKGQENRRDPICDPPGGWRLLLEDTEEYNLAFGASLSELNFDAGVVKRRDFLIRGVISICGIEL
        SGK LVNGIKNARP EDYLKLHHHQLNLKGQENRRDPICDPPGGWRLLLE +EEYNLAF ASLSELNFD GVVKRRD LI GVISICGIEL
Subjt:  SGKDLVNGIKNARPLEDYLKLHHHQLNLKGQENRRDPICDPPGGWRLLLEDTEEYNLAFGASLSELNFDAGVVKRRDFLIRGVISICGIEL

XP_008451441.1 PREDICTED: SWR1-complex protein 4 [Cucumis melo]5.57e-27398.5Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAP+MPAID SELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNG PPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEK+LKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM
        QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITE+RKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPST+VADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  S
        S
Subjt:  S

XP_011659406.1 SWR1-complex protein 4 [Cucumis sativus]2.04e-27599.5Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYR SRAIVAARGSISRESSGNTPAKDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM
        QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPST+VADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  S
        S
Subjt:  S

XP_022991705.1 SWR1-complex protein 4-like isoform X1 [Cucurbita maxima]1.83e-27986.2Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPL QEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAID SELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP G
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEK+LKD SWTKEETDQLFDLCERFDLRF+VIADRFPS RTVEELKERYY AS+AI+ ARG  SRE SGNTPAKDP+NVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM
        QEI+RKRALSMVLSQTKQ+ERKDAEVLAEAKKI E+RKAERVAEES+L VTSN VPEVTER VVPG++V S+SNVQPPPPAAVPST+VADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMV AASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+SPY+EAPGTPKDR FIAD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  S----------------------------GKDLVNGIKNARPLEDYLKLHHHQLNL-KGQENRRDPICDPP
        S                            GK L+NG KN RPLEDYLKLHHHQLNL KGQENRRDPICDPP
Subjt:  S----------------------------GKDLVNGIKNARPLEDYLKLHHHQLNL-KGQENRRDPICDPP

XP_038898526.1 SWR1-complex protein 4 isoform X1 [Benincasa hispida]1.18e-26796.51Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLPQEKKPRA KDAQRKRDGISREVYALTGGLAPIMPAID SELKKRPPSDEKITWQWLPF+NSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEKYLKD SWTKEETDQLFDLCERFDLRFIVI+DRFPSARTVEELKERYYRASRAIVAARG  SRESSGNTPAKDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM
        QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITE+R+AERVAEESELPVTSNAVPEVTERVVVP D VPS+SNVQPPPPAAVPST+VADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+SPYTEAPGTPKDRTFIAD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  S
        S
Subjt:  S

TrEMBL top hitse value%identityAlignment
A0A0A0K9K0 SANT domain-containing protein7.6e-21799.5Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYR SRAIVAARGSISRESSGNTPAKDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM
        QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPST+VADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  S
        S
Subjt:  S

A0A1S3BSL6 SWR1-complex protein 45.5e-21598.5Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAP+MPAID SELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNG PPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEK+LKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM
        QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITE+RKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPST+VADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  S
        S
Subjt:  S

A0A5A7TUG7 SWR1-complex protein 42.6e-26593.58Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAP+MPAID SELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNG PPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEK+LKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM
        QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITE+RKAERVAEESELPVTSNAVPEVTERVVVPGDN+PSISNVQPPPPAAVPST+VADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  SGKDLVNGIKNARPLEDYLKLHHHQLNLKGQENRRDPICDPPGGWRLLLEDTEEYNLAFGASLSELNFDAGVVKRRDFLIRGVISICGIELIAACLLHFI
        SGK LVNGIKNARP EDYLKLHHHQLNLKGQENRRDPICDPPGGWRLLLE +EEYNLAF ASLSELNFD GVVKRRD LI GVISICGIEL  +  ++FI
Subjt:  SGKDLVNGIKNARPLEDYLKLHHHQLNLKGQENRRDPICDPPGGWRLLLEDTEEYNLAFGASLSELNFDAGVVKRRDFLIRGVISICGIELIAACLLHFI

Query:  FITKPKLWFASSVS
           K  ++    VS
Subjt:  FITKPKLWFASSVS

A0A6J1JRI6 SWR1-complex protein 4-like isoform X27.2e-20782.59Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPL QEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAID SELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP G
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNK                   SWTKEETDQLFDLCERFDLRF+VIADRFPS RTVEELKERYY AS+AI+ ARG  SRE SGNTPAKDP+NVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM
        QEI+RKRALSMVLSQTKQ+ERKDAEVLAEAKKI E+RKAERVAEES+L VTSN VPEVTER VVPG++V S+SNVQPPPPAAVPST+VADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMV AASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+SPY+EAPGTPKDR FIAD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  S----------------------------GKDLVNGIKNARPLEDYLKLHHHQLNL-KGQENRRDPICDPP
        S                            GK L+NG KN RPLEDYLKLHHHQLNL KGQENRRDPICDPP
Subjt:  S----------------------------GKDLVNGIKNARPLEDYLKLHHHQLNL-KGQENRRDPICDPP

A0A6J1JTQ5 SWR1-complex protein 4-like isoform X15.6e-22086.2Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPL QEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAID SELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP G
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEK+LKD SWTKEETDQLFDLCERFDLRF+VIADRFPS RTVEELKERYY AS+AI+ ARG  SRE SGNTPAKDP+NVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM
        QEI+RKRALSMVLSQTKQ+ERKDAEVLAEAKKI E+RKAERVAEES+L VTSN VPEVTER VVPG++V S+SNVQPPPPAAVPST+VADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        LPVYLRTYALEQMV AASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+SPY+EAPGTPKDR FIAD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Query:  S----------------------------GKDLVNGIKNARPLEDYLKLHHHQLNL-KGQENRRDPICDPP
        S                            GK L+NG KN RPLEDYLKLHHHQLNL KGQENRRDPICDPP
Subjt:  S----------------------------GKDLVNGIKNARPLEDYLKLHHHQLNL-KGQENRRDPICDPP

SwissProt top hitse value%identityAlignment
O14308 SWR1-complex protein 47.8e-3329.36Show/hide
Query:  DAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTG-GLAPIMPAIDVSELKKRPPSDEKI-TWQWLPFSNSARKDNLQLYHWVRVVNGIPPT
        D +D+  LP    P     K +++   +R+ +GISRE+Y+L G   AP+  AI   + K++P    K   W   PFS S+RKD+  L+HWV + + +   
Subjt:  DAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTG-GLAPIMPAIDVSELKKRPPSDEKI-TWQWLPFSNSARKDNLQLYHWVRVVNGIPPT

Query:  GDYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSA-----RTVEELKERYYRASRAIVAARGSISRESSGNTPAK
          Y F K+N  + ++ YTDEEY+ YLKD  W K+ETD LF LC+ +DLRF VIADR+ +      RT+E+LK+R+Y  SR I+ AR  I+  ++  +   
Subjt:  GDYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSA-----RTVEELKERYYRASRAIVAARGSISRESSGNTPAK

Query:  D--PYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESEL--------------------------------PVTSNAVPEVTE
        +   YN  QE+ RK+ L  + S+T ++  ++  +  E K+I E  +A+ +++  E+                                  T N V E   
Subjt:  D--PYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESEL--------------------------------PVTSNAVPEVTE

Query:  RVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRML---PVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELR
            P   V S+ N    P A     I      T     +     ++  T+   Q + A  +S      +RV   + +L V+ +  +PT     + +EL+
Subjt:  RVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRML---PVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELR

Query:  KEILTLLNLQKQLQNKEAE
          I++LL L++++     E
Subjt:  KEILTLLNLQKQLQNKEAE

Q7K3D8 DNA methyltransferase 1-associated protein 17.5e-3631.07Show/hide
Query:  DAKDILGLPKNTLP--------LPQEKKPRAQKDAQRKRDGISREVYAL----TGGLAPIMP-------AIDVSELKKRPPSDEKITWQWLPFSNSARKD
        D +DIL + +   P          +++     K A R+ +G+ REV+AL         P++P            E K R    +   W+W PFSN AR D
Subjt:  DAKDILGLPKNTLP--------LPQEKKPRAQKDAQRKRDGISREVYAL----TGGLAPIMP-------AIDVSELKKRPPSDEKITWQWLPFSNSARKD

Query:  NLQLYHWVRVVNGIPPTGDYSFAKYNKSVEVVKYTDEEYEKYLKD--ASWTKEETDQLFDLCERFDLRFIVIADRF----PSARTVEELKERYYRASRAI
        +   +HW RV +    + DY FAK+NK +EV  YT  EY  +L++   +W+K +TD LFDL  RFDLRFIV+ADR+       +TVEELKERYY     +
Subjt:  NLQLYHWVRVVNGIPPTGDYSFAKYNKSVEVVKYTDEEYEKYLKD--ASWTKEETDQLFDLCERFDLRFIVIADRF----PSARTVEELKERYYRASRAI

Query:  VAARGSISRESSGNTPAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNV-
          A+   S +          Y+V  E  RK  L  +  +T QQ  ++  ++ E KKI EARK ER  +  +L    +   +  E       N PS     
Subjt:  VAARGSISRESSGNTPAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNV-

Query:  ---------QPPPPAAVPSTI----VADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILT
                 Q P P+ V S +    +  +    A LR   V LR+  ++       ++ G R +K +EQ +Q+  V+  P  PT+ +C    ELR +++ 
Subjt:  ---------QPPPPAAVPSTI----VADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILT

Query:  LLNLQKQLQNKEAEGSSFRDSPYTEAPG
        L  L+  L     E  S +       PG
Subjt:  LLNLQKQLQNKEAEGSSFRDSPYTEAPG

Q8VZL6 SWR1-complex protein 45.4e-14367.75Show/hide
Query:  DAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTGD
        DAKDILGLPK  L L QEKK R QK++ RK DGISREVYALTGG+AP+MP+ID   LK+RPP+DEK+ W+WL F+NSARKD+LQLYHWVRVVN +PPTGD
Subjt:  DAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTGD

Query:  YSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVSQ
        YSFAKYNKSV+++KYTDEEYE +L D+ WTKEETDQLF+ C+ FDLRF+VIADRFP +RTVEELK+RYY  +RA++ AR     + + +   K+PY++++
Subjt:  YSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVSQ

Query:  EIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTI-VADNASTLASLRM
        + ERKRALSMVLSQ++ QE+KDAE+LAEAK+ITE R A R AEE ++    NA  +  +  VVPG +V   SN Q P  A  PST+ +AD ASTLASLRM
Subjt:  EIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTI-VADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        L VYLRTY LEQMVQAASS+ GLRTIKRVEQTLQDL VNLKP+VPTK VC EHLELRKEILTLLNLQKQLQ KE+EGSS R+  Y   P TPKDR F  D
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD

Q9JI44 DNA methyltransferase 1-associated protein 13.1e-3432.12Show/hide
Query:  LPKNTLPLPQEKKPRAQKDAQ--RKRDGISREVYAL----TGGLAPIMPAIDVSE----LKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP
        + K  +  P +KK +   +    ++ +G+ REVYAL         P++P+ D  +    +K +  S +   W+W+PF+N ARKD    +HW R       
Subjt:  LPKNTLPLPQEKKPRAQKDAQ--RKRDGISREVYAL----TGGLAPIMPAIDVSE----LKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP

Query:  TGDYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFP----SARTVEELKERYYRASRAIVAARGSISRESSGNTPAK
          DY FA++NK+V+V  Y+++EY+ YL D +WTK ETD LFDL  RFDLRF+VI DR+       R+VE+LKERYY      + A+ +  R   G     
Subjt:  TGDYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFP----SARTVEELKERYYRASRAIVAARGSISRESSGNTPAK

Query:  DPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTER----VVVPGDNVPSISNVQPPPPAAVPSTIVAD
          ++   E  RK  L  + ++T +Q  ++  +L E +KI EARK ER     +L     A     E+       P   +P     + P   AVP T    
Subjt:  DPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTER----VVVPGDNVPSISNVQPPPPAAVPSTIVAD

Query:  NASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE
             A ++          L        SS G + IK +EQ L +L V L P  PT+ +     ELR +++ L  L++   N E E
Subjt:  NASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE

Q9NPF5 DNA methyltransferase 1-associated protein 13.1e-3432.12Show/hide
Query:  LPKNTLPLPQEKKPRAQKDAQ--RKRDGISREVYAL----TGGLAPIMPAIDVSE----LKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP
        + K  +  P +KK +   +    ++ +G+ REVYAL         P++P+ D  +    +K +  S +   W+W+PF+N ARKD    +HW R       
Subjt:  LPKNTLPLPQEKKPRAQKDAQ--RKRDGISREVYAL----TGGLAPIMPAIDVSE----LKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP

Query:  TGDYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFP----SARTVEELKERYYRASRAIVAARGSISRESSGNTPAK
          DY FA++NK+V+V  Y+++EY+ YL D +WTK ETD LFDL  RFDLRF+VI DR+       R+VE+LKERYY      + A+ +  R   G     
Subjt:  TGDYSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFP----SARTVEELKERYYRASRAIVAARGSISRESSGNTPAK

Query:  DPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTER----VVVPGDNVPSISNVQPPPPAAVPSTIVAD
          ++   E  RK  L  + ++T +Q  ++  +L E +KI EARK ER     +L     A     E+       P   +P     + P   AVP T    
Subjt:  DPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTER----VVVPGDNVPSISNVQPPPPAAVPSTIVAD

Query:  NASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE
             A ++          L        SS G + IK +EQ L +L V L P  PT+ +     ELR +++ L  L++   N E E
Subjt:  NASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE

Arabidopsis top hitse value%identityAlignment
AT2G47210.1 myb-like transcription factor family protein3.8e-14467.75Show/hide
Query:  DAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTGD
        DAKDILGLPK  L L QEKK R QK++ RK DGISREVYALTGG+AP+MP+ID   LK+RPP+DEK+ W+WL F+NSARKD+LQLYHWVRVVN +PPTGD
Subjt:  DAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTGD

Query:  YSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVSQ
        YSFAKYNKSV+++KYTDEEYE +L D+ WTKEETDQLF+ C+ FDLRF+VIADRFP +RTVEELK+RYY  +RA++ AR     + + +   K+PY++++
Subjt:  YSFAKYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVSQ

Query:  EIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTI-VADNASTLASLRM
        + ERKRALSMVLSQ++ QE+KDAE+LAEAK+ITE R A R AEE ++    NA  +  +  VVPG +V   SN Q P  A  PST+ +AD ASTLASLRM
Subjt:  EIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTI-VADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD
        L VYLRTY LEQMVQAASS+ GLRTIKRVEQTLQDL VNLKP+VPTK VC EHLELRKEILTLLNLQKQLQ KE+EGSS R+  Y   P TPKDR F  D
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIAD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCTAAGGATATCTTGGGCTTGCCCAAAAACACGCTGCCTTTACCTCAAGAGAAGAAGCCTAGGGCTCAGAAAGATGCCCAGAGAAAGCGAGATGGGATT
TCCCGGGAGGTTTATGCTCTTACTGGTGGTCTGGCGCCTATTATGCCCGCAATCGATGTGTCTGAGCTGAAGAAGCGACCTCCATCAGATGAGAAGATTACGTGG
CAGTGGCTTCCTTTTTCAAATTCTGCTAGAAAGGATAATCTGCAGCTGTACCATTGGGTTAGAGTTGTAAATGGCATTCCACCAACAGGGGACTATTCTTTTGCA
AAGTACAACAAGTCTGTTGAAGTTGTCAAATACACGGATGAGGAGTACGAGAAGTATTTGAAAGACGCTTCGTGGACAAAGGAGGAGACAGATCAATTATTTGAT
TTGTGCGAACGGTTTGATCTTCGCTTCATTGTGATAGCCGACAGGTTTCCGTCAGCAAGGACAGTGGAGGAACTGAAGGAGCGGTATTATCGTGCATCAAGAGCA
ATTGTGGCTGCTAGAGGATCAATATCTCGGGAGAGTTCAGGAAATACTCCCGCCAAGGATCCTTACAACGTCTCACAAGAGATTGAGCGCAAGCGAGCATTGTCC
ATGGTTCTCTCCCAAACAAAACAGCAAGAACGAAAAGATGCAGAGGTTCTTGCTGAAGCGAAAAAAATAACTGAAGCACGCAAAGCTGAAAGAGTGGCTGAAGAA
TCTGAGTTGCCTGTCACTTCAAATGCTGTCCCAGAAGTTACTGAAAGGGTTGTCGTTCCTGGAGATAACGTACCATCTATATCCAATGTGCAGCCCCCTCCTCCA
GCAGCCGTACCTTCAACCATAGTGGCAGATAATGCTTCTACTCTTGCTTCTCTTCGCATGCTTCCTGTATACTTGAGAACGTATGCACTCGAGCAAATGGTACAA
GCTGCAAGCTCATCTGCTGGCCTTCGAACCATCAAGCGAGTTGAACAGACATTACAAGATCTTTCGGTTAATTTAAAACCCAGGGTTCCAACAAAAGCTGTCTGT
GCAGAGCATCTTGAATTAAGAAAAGAAATATTGACTCTACTGAATCTTCAAAAGCAGTTGCAAAATAAGGAGGCAGAAGGTTCTTCTTTCCGTGACAGTCCATAC
ACTGAGGCACCTGGCACACCTAAGGATCGCACTTTTATTGCTGATTCTGGGAAAGATTTGGTAAACGGGATCAAAAACGCAAGGCCACTGGAAGATTATCTGAAG
CTCCATCATCACCAGCTCAATCTAAAAGGCCAAGAAAACAGAAGGGATCCGATCTGTGATCCTCCAGGAGGCTGGCGGCTTTTGCTGGAAGACACTGAAGAATAT
AATTTAGCTTTTGGAGCATCATTATCTGAACTGAACTTTGATGCTGGTGTGGTTAAGAGGCGTGATTTCCTCATCCGTGGAGTCATTAGCATCTGTGGAATCGAG
TTAATAGCTGCCTGTTTACTTCATTTTATCTTCATCACGAAGCCAAAGCTCTGGTTTGCTTCATCTGTTAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGCTAAGGATATCTTGGGCTTGCCCAAAAACACGCTGCCTTTACCTCAAGAGAAGAAGCCTAGGGCTCAGAAAGATGCCCAGAGAAAGCGAGATGGGATT
TCCCGGGAGGTTTATGCTCTTACTGGTGGTCTGGCGCCTATTATGCCCGCAATCGATGTGTCTGAGCTGAAGAAGCGACCTCCATCAGATGAGAAGATTACGTGG
CAGTGGCTTCCTTTTTCAAATTCTGCTAGAAAGGATAATCTGCAGCTGTACCATTGGGTTAGAGTTGTAAATGGCATTCCACCAACAGGGGACTATTCTTTTGCA
AAGTACAACAAGTCTGTTGAAGTTGTCAAATACACGGATGAGGAGTACGAGAAGTATTTGAAAGACGCTTCGTGGACAAAGGAGGAGACAGATCAATTATTTGAT
TTGTGCGAACGGTTTGATCTTCGCTTCATTGTGATAGCCGACAGGTTTCCGTCAGCAAGGACAGTGGAGGAACTGAAGGAGCGGTATTATCGTGCATCAAGAGCA
ATTGTGGCTGCTAGAGGATCAATATCTCGGGAGAGTTCAGGAAATACTCCCGCCAAGGATCCTTACAACGTCTCACAAGAGATTGAGCGCAAGCGAGCATTGTCC
ATGGTTCTCTCCCAAACAAAACAGCAAGAACGAAAAGATGCAGAGGTTCTTGCTGAAGCGAAAAAAATAACTGAAGCACGCAAAGCTGAAAGAGTGGCTGAAGAA
TCTGAGTTGCCTGTCACTTCAAATGCTGTCCCAGAAGTTACTGAAAGGGTTGTCGTTCCTGGAGATAACGTACCATCTATATCCAATGTGCAGCCCCCTCCTCCA
GCAGCCGTACCTTCAACCATAGTGGCAGATAATGCTTCTACTCTTGCTTCTCTTCGCATGCTTCCTGTATACTTGAGAACGTATGCACTCGAGCAAATGGTACAA
GCTGCAAGCTCATCTGCTGGCCTTCGAACCATCAAGCGAGTTGAACAGACATTACAAGATCTTTCGGTTAATTTAAAACCCAGGGTTCCAACAAAAGCTGTCTGT
GCAGAGCATCTTGAATTAAGAAAAGAAATATTGACTCTACTGAATCTTCAAAAGCAGTTGCAAAATAAGGAGGCAGAAGGTTCTTCTTTCCGTGACAGTCCATAC
ACTGAGGCACCTGGCACACCTAAGGATCGCACTTTTATTGCTGATTCTGGGAAAGATTTGGTAAACGGGATCAAAAACGCAAGGCCACTGGAAGATTATCTGAAG
CTCCATCATCACCAGCTCAATCTAAAAGGCCAAGAAAACAGAAGGGATCCGATCTGTGATCCTCCAGGAGGCTGGCGGCTTTTGCTGGAAGACACTGAAGAATAT
AATTTAGCTTTTGGAGCATCATTATCTGAACTGAACTTTGATGCTGGTGTGGTTAAGAGGCGTGATTTCCTCATCCGTGGAGTCATTAGCATCTGTGGAATCGAG
TTAATAGCTGCCTGTTTACTTCATTTTATCTTCATCACGAAGCCAAAGCTCTGGTTTGCTTCATCTGTTAGCTAG
Protein sequenceShow/hide protein sequence
MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDVSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTGDYSFA
KYNKSVEVVKYTDEEYEKYLKDASWTKEETDQLFDLCERFDLRFIVIADRFPSARTVEELKERYYRASRAIVAARGSISRESSGNTPAKDPYNVSQEIERKRALS
MVLSQTKQQERKDAEVLAEAKKITEARKAERVAEESELPVTSNAVPEVTERVVVPGDNVPSISNVQPPPPAAVPSTIVADNASTLASLRMLPVYLRTYALEQMVQ
AASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYTEAPGTPKDRTFIADSGKDLVNGIKNARPLEDYLK
LHHHQLNLKGQENRRDPICDPPGGWRLLLEDTEEYNLAFGASLSELNFDAGVVKRRDFLIRGVISICGIELIAACLLHFIFITKPKLWFASSVS