; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh17G009950 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh17G009950
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionSWR1-complex protein 4
Genome locationCmo_Chr17:8581387..8587023
RNA-Seq ExpressionCmoCh17G009950
SyntenyCmoCh17G009950
Gene Ontology termsGO:0000122 - negative regulation of transcription by RNA polymerase II (biological process)
GO:0006281 - DNA repair (biological process)
GO:0043486 - histone exchange (biological process)
GO:0043967 - histone H4 acetylation (biological process)
GO:0043968 - histone H2A acetylation (biological process)
GO:0000812 - Swr1 complex (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
GO:0003714 - transcription corepressor activity (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR008468 - DNA methyltransferase 1-associated 1
IPR009057 - Homeobox-like domain superfamily
IPR027109 - SWR1-complex protein 4/DNA methyltransferase 1-associated protein 1
IPR032563 - DAMP1, SANT/Myb-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046714.1 SWR1-complex protein 4 [Cucumis melo var. makuwa]4.9e-22291.01Show/hide
Query:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPL QEKKPRAQKDAQRKRDGISREVYALTGGLAP+MPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNG PPTG
Subjt:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS
        DYSFAKYNKSVEVVKYTDEEYEK LKD SWTKEETDQLFDLCERFDLRF+VIADRFPS RTVEELKERYY AS+AI+ ARG  SRE  GNTPAKDP+NVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS

Query:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
        QEI+RKRALSMVLSQTKQ ERKDAEVLAEAKKI ESRKAERVAEES+L VTSNAVPEVTER VVPG+++PS+SNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD
        LPVYLRTYALEQMV AASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+SPY+EAPGTPK+R FIAD
Subjt:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD

Query:  SMSFGGLLNEIKNERPLEDYLKLHHHQLNLIKGQENRRDPICDPP
        S    GL+N IKN RP EDYLKLHHHQLNL KGQENRRDPICDPP
Subjt:  SMSFGGLLNEIKNERPLEDYLKLHHHQLNLIKGQENRRDPICDPP

KAG6575746.1 SWR1-complex protein 4, partial [Cucurbita argyrosperma subsp. sororia]3.5e-22099.26Show/hide
Query:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS
        DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREG GNTPAKDPFNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS

Query:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
        QEIDRKRALSMVLSQTKQ ERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD
        LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPK+RAFIAD
Subjt:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD

Query:  SMSFGG
        SMSFGG
Subjt:  SMSFGG

XP_022954223.1 SWR1-complex protein 4-like isoform X1 [Cucurbita moschata]6.4e-222100Show/hide
Query:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS
        DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS

Query:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
        QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD
        LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD
Subjt:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD

Query:  SMSFGG
        SMSFGG
Subjt:  SMSFGG

XP_022991705.1 SWR1-complex protein 4-like isoform X1 [Cucurbita maxima]8.8e-24092.63Show/hide
Query:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP G
Subjt:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS
        DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREG GNTPAKDPFNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS

Query:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
        QEIDRKRALSMVLSQTKQ ERKDAEVLAEAKKIIESRKAERVAEESKLAVTSN VPEVTERAVVPGESV SVSNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD
        LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPK+RAFIAD
Subjt:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD

Query:  SMSFG--------------------------GLLNEIKNERPLEDYLKLHHHQLNLIKGQENRRDPICDPPVVCI
        SM+FG                          GLLN  KNERPLEDYLKLHHHQLNLIKGQENRRDPICDPPVVCI
Subjt:  SMSFG--------------------------GLLNEIKNERPLEDYLKLHHHQLNLIKGQENRRDPICDPPVVCI

XP_022991706.1 SWR1-complex protein 4-like isoform X2 [Cucurbita maxima]3.6e-22588.63Show/hide
Query:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP G
Subjt:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS
        DYSFAKYNK                   SWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREG GNTPAKDPFNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS

Query:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
        QEIDRKRALSMVLSQTKQ ERKDAEVLAEAKKIIESRKAERVAEESKLAVTSN VPEVTERAVVPGESV SVSNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD
        LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPK+RAFIAD
Subjt:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD

Query:  SMSFG--------------------------GLLNEIKNERPLEDYLKLHHHQLNLIKGQENRRDPICDPPVVCI
        SM+FG                          GLLN  KNERPLEDYLKLHHHQLNLIKGQENRRDPICDPPVVCI
Subjt:  SMSFG--------------------------GLLNEIKNERPLEDYLKLHHHQLNLIKGQENRRDPICDPPVVCI

TrEMBL top hitse value%identityAlignment
A0A5A7TUG7 SWR1-complex protein 42.4e-22291.01Show/hide
Query:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPL QEKKPRAQKDAQRKRDGISREVYALTGGLAP+MPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNG PPTG
Subjt:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS
        DYSFAKYNKSVEVVKYTDEEYEK LKD SWTKEETDQLFDLCERFDLRF+VIADRFPS RTVEELKERYY AS+AI+ ARG  SRE  GNTPAKDP+NVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS

Query:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
        QEI+RKRALSMVLSQTKQ ERKDAEVLAEAKKI ESRKAERVAEES+L VTSNAVPEVTER VVPG+++PS+SNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD
        LPVYLRTYALEQMV AASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+SPY+EAPGTPK+R FIAD
Subjt:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD

Query:  SMSFGGLLNEIKNERPLEDYLKLHHHQLNLIKGQENRRDPICDPP
        S    GL+N IKN RP EDYLKLHHHQLNL KGQENRRDPICDPP
Subjt:  SMSFGGLLNEIKNERPLEDYLKLHHHQLNLIKGQENRRDPICDPP

A0A6J1GQH2 SWR1-complex protein 4-like isoform X13.1e-222100Show/hide
Query:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS
        DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS

Query:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
        QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD
        LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD
Subjt:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD

Query:  SMSFGG
        SMSFGG
Subjt:  SMSFGG

A0A6J1JRI6 SWR1-complex protein 4-like isoform X21.7e-22588.63Show/hide
Query:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP G
Subjt:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS
        DYSFAKYNK                   SWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREG GNTPAKDPFNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS

Query:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
        QEIDRKRALSMVLSQTKQ ERKDAEVLAEAKKIIESRKAERVAEESKLAVTSN VPEVTERAVVPGESV SVSNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD
        LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPK+RAFIAD
Subjt:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD

Query:  SMSFG--------------------------GLLNEIKNERPLEDYLKLHHHQLNLIKGQENRRDPICDPPVVCI
        SM+FG                          GLLN  KNERPLEDYLKLHHHQLNLIKGQENRRDPICDPPVVCI
Subjt:  SMSFG--------------------------GLLNEIKNERPLEDYLKLHHHQLNLIKGQENRRDPICDPPVVCI

A0A6J1JTQ5 SWR1-complex protein 4-like isoform X14.3e-24092.63Show/hide
Query:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP G
Subjt:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS
        DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREG GNTPAKDPFNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS

Query:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
        QEIDRKRALSMVLSQTKQ ERKDAEVLAEAKKIIESRKAERVAEESKLAVTSN VPEVTERAVVPGESV SVSNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD
        LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPK+RAFIAD
Subjt:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD

Query:  SMSFG--------------------------GLLNEIKNERPLEDYLKLHHHQLNLIKGQENRRDPICDPPVVCI
        SM+FG                          GLLN  KNERPLEDYLKLHHHQLNLIKGQENRRDPICDPPVVCI
Subjt:  SMSFG--------------------------GLLNEIKNERPLEDYLKLHHHQLNLIKGQENRRDPICDPPVVCI

A0A6J1JX24 SWR1-complex protein 4-like isoform X34.6e-21898.28Show/hide
Query:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP G
Subjt:  MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS
        DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREG GNTPAKDPFNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVS

Query:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM
        QEIDRKRALSMVLSQTKQ ERKDAEVLAEAKKIIESRKAERVAEESKLAVTSN VPEVTERAVVPGESV SVSNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD
        LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPK+RAFIAD
Subjt:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD

Query:  SMSFGG
        SM+FGG
Subjt:  SMSFGG

SwissProt top hitse value%identityAlignment
O14308 SWR1-complex protein 41.2e-3229.69Show/hide
Query:  DAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTG-GLAPIMPAIDTSELKKRPPSDEKI-TWQWLPFSNSARKDNLQLYHWVRVVNGIPPT
        D +D+  LP    P     K +++   +R+ +GISRE+Y+L G   AP+  AI   + K++P    K   W   PFS S+RKD+  L+HWV + + +   
Subjt:  DAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTG-GLAPIMPAIDTSELKKRPPSDEKI-TWQWLPFSNSARKDNLQLYHWVRVVNGIPPT

Query:  GDYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPST-----RTVEELKERYYHASKAILGARGP-----TSREGLG
          Y F K+N  + ++ YTDEEY+ +LKD  W K+ETD LF LC+ +DLRF VIADR+ +      RT+E+LK+R+Y  S+ IL AR P      ++  L 
Subjt:  GDYSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPST-----RTVEELKERYYHASKAILGARGP-----TSREGLG

Query:  NTPAKDPFNVSQEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESR--------KAERVAEESK-----------------------LAVTSNAVPEV
        NT     +N  QE+ RK+ L  + S+T +   ++  +  E K+I  S+        +  R+ +E K                          T N V E 
Subjt:  NTPAKDPFNVSQEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESR--------KAERVAEESK-----------------------LAVTSNAVPEV

Query:  TERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRML---PVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLE
           +  P   V SV N    P A     +      T     +     ++  T+   Q + A  +S      +RV   + +L V+ +  +PT     + +E
Subjt:  TERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRML---PVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLE

Query:  LRKEILTLLNLQKQLQNKEAE
        L+  I++LL L++++     E
Subjt:  LRKEILTLLNLQKQLQNKEAE

Q7K3D8 DNA methyltransferase 1-associated protein 15.0e-3631.24Show/hide
Query:  DAKDILGLPKNTLP--------LSQEKKPRAQKDAQRKRDGISREVYAL----TGGLAPIMPAIDTS--------ELKKRPPSDEKITWQWLPFSNSARK
        D +DIL + +   P         ++++     K A R+ +G+ REV+AL         P++P  DT+        E K R    +   W+W PFSN AR 
Subjt:  DAKDILGLPKNTLP--------LSQEKKPRAQKDAQRKRDGISREVYAL----TGGLAPIMPAIDTS--------ELKKRPPSDEKITWQWLPFSNSARK

Query:  DNLQLYHWVRVVNGIPPTGDYSFAKYNKSVEVVKYTDEEYEKFLKD--PSWTKEETDQLFDLCERFDLRFVVIADRF----PSTRTVEELKERYYHASKA
        D+   +HW RV +    + DY FAK+NK +EV  YT  EY   L++   +W+K +TD LFDL  RFDLRF+V+ADR+      T+TVEELKERYY     
Subjt:  DNLQLYHWVRVVNGIPPTGDYSFAKYNKSVEVVKYTDEEYEKFLKD--PSWTKEETDQLFDLCERFDLRFVVIADRF----PSTRTVEELKERYYHASKA

Query:  ILGARGPTSREGLGNTPAKDPFNVSQEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNV
        +  A+  TS + +        ++V  E  RK  L  +  +T Q   ++  ++ E KK IE+RK ER  +   L    +   +  E A     + PS    
Subjt:  ILGARGPTSREGLGNTPAKDPFNVSQEIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNV

Query:  ----------QPPPPAAVPSTV----VADNASTLASLRMLPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEIL
                  Q P P+ V S V    +  +    A LR   V LR+  ++       ++ G R +K +EQ +Q+  V+  P  PT+ +C    ELR +++
Subjt:  ----------QPPPPAAVPSTV----VADNASTLASLRMLPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEIL

Query:  TLLNLQKQLQNKEAEGSSFRESPYSEAPG
         L  L+  L     E  S +    +  PG
Subjt:  TLLNLQKQLQNKEAEGSSFRESPYSEAPG

Q8VZL6 SWR1-complex protein 41.2e-14365.55Show/hide
Query:  DAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTGD
        DAKDILGLPK  L L+QEKK R QK++ RK DGISREVYALTGG+AP+MP+ID   LK+RPP+DEK+ W+WL F+NSARKD+LQLYHWVRVVN +PPTGD
Subjt:  DAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTGD

Query:  YSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVSQ
        YSFAKYNKSV+++KYTDEEYE  L D  WTKEETDQLF+ C+ FDLRFVVIADRFP +RTVEELK+RYY  ++A+L AR  +  +   +   K+P+++++
Subjt:  YSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVSQ

Query:  EIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTV-VADNASTLASLRM
        + +RKRALSMVLSQ++  E+KDAE+LAEAK+I E R A R AEE  ++   NA  +  +  VVPG SV   SN Q P  A  PST+ +AD ASTLASLRM
Subjt:  EIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTV-VADNASTLASLRM

Query:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD
        L VYLRTY LEQMV AASS+ GLRTIKRVEQTLQDL VNLKP+VPTK VC EHLELRKEILTLLNLQKQLQ KE+EGSS RE  Y+  P TPK+R F  D
Subjt:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD

Query:  SMSFGGLLNEIKNERPLE
          SFG        ERP++
Subjt:  SMSFGGLLNEIKNERPLE

Q9JI44 DNA methyltransferase 1-associated protein 11.0e-3633.68Show/hide
Query:  PLSQEKKPRAQKDAQRKRDGISREVYAL----TGGLAPIMPAIDTSE----LKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTGDYSFAK
        P  ++ K  ++    ++ +G+ REVYAL         P++P+ DT +    +K +  S +   W+W+PF+N ARKD    +HW R         DY FA+
Subjt:  PLSQEKKPRAQKDAQRKRDGISREVYAL----TGGLAPIMPAIDTSE----LKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTGDYSFAK

Query:  YNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFP----STRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDP-FNVSQ
        +NK+V+V  Y+++EY+ +L D +WTK ETD LFDL  RFDLRFVVI DR+       R+VE+LKERYYH    +   R       +  T  K P F+   
Subjt:  YNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFP----STRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDP-FNVSQ

Query:  EIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKL-----AVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLA
        E  RK  L  + ++T +   ++  +L E +K IE+RK ER      L     A  + A    TER   P + +P     + P   AVP T          
Subjt:  EIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKL-----AVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLA

Query:  SLRMLPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE
          +   V LR+  ++       SS G + IK +EQ L +L V L P  PT+ +     ELR +++ L  L++   N E E
Subjt:  SLRMLPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE

Q9NPF5 DNA methyltransferase 1-associated protein 11.0e-3633.68Show/hide
Query:  PLSQEKKPRAQKDAQRKRDGISREVYAL----TGGLAPIMPAIDTSE----LKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTGDYSFAK
        P  ++ K  ++    ++ +G+ REVYAL         P++P+ DT +    +K +  S +   W+W+PF+N ARKD    +HW R         DY FA+
Subjt:  PLSQEKKPRAQKDAQRKRDGISREVYAL----TGGLAPIMPAIDTSE----LKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTGDYSFAK

Query:  YNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFP----STRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDP-FNVSQ
        +NK+V+V  Y+++EY+ +L D +WTK ETD LFDL  RFDLRFVVI DR+       R+VE+LKERYYH    +   R       +  T  K P F+   
Subjt:  YNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFP----STRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDP-FNVSQ

Query:  EIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKL-----AVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLA
        E  RK  L  + ++T +   ++  +L E +K IE+RK ER      L     A  + A    TER   P + +P     + P   AVP T          
Subjt:  EIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKL-----AVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLA

Query:  SLRMLPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE
          +   V LR+  ++       SS G + IK +EQ L +L V L P  PT+ +     ELR +++ L  L++   N E E
Subjt:  SLRMLPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE

Arabidopsis top hitse value%identityAlignment
AT2G47210.1 myb-like transcription factor family protein8.8e-14565.55Show/hide
Query:  DAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTGD
        DAKDILGLPK  L L+QEKK R QK++ RK DGISREVYALTGG+AP+MP+ID   LK+RPP+DEK+ W+WL F+NSARKD+LQLYHWVRVVN +PPTGD
Subjt:  DAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTGD

Query:  YSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVSQ
        YSFAKYNKSV+++KYTDEEYE  L D  WTKEETDQLF+ C+ FDLRFVVIADRFP +RTVEELK+RYY  ++A+L AR  +  +   +   K+P+++++
Subjt:  YSFAKYNKSVEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVSQ

Query:  EIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTV-VADNASTLASLRM
        + +RKRALSMVLSQ++  E+KDAE+LAEAK+I E R A R AEE  ++   NA  +  +  VVPG SV   SN Q P  A  PST+ +AD ASTLASLRM
Subjt:  EIDRKRALSMVLSQTKQNERKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTV-VADNASTLASLRM

Query:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD
        L VYLRTY LEQMV AASS+ GLRTIKRVEQTLQDL VNLKP+VPTK VC EHLELRKEILTLLNLQKQLQ KE+EGSS RE  Y+  P TPK+R F  D
Subjt:  LPVYLRTYALEQMVHAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIAD

Query:  SMSFGGLLNEIKNERPLE
          SFG        ERP++
Subjt:  SMSFGGLLNEIKNERPLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCCAAGGATATCTTGGGGTTGCCCAAAAACACGCTGCCTTTATCCCAAGAGAAGAAGCCTAGGGCTCAGAAAGATGCCCAGAGAAAGCGAGATGGAATTTCTCG
GGAGGTTTATGCGCTTACTGGTGGTCTTGCGCCTATTATGCCAGCAATCGATACATCCGAGCTGAAGAAGCGACCTCCATCAGATGAGAAGATTACATGGCAATGGCTTC
CTTTTTCAAACTCTGCGAGAAAGGATAATTTGCAGCTTTACCACTGGGTTCGAGTTGTAAATGGCATTCCACCAACAGGTGACTATTCCTTTGCAAAGTACAACAAGTCT
GTTGAAGTTGTCAAATACACAGATGAGGAGTACGAGAAGTTTTTAAAAGACCCTTCATGGACAAAGGAGGAGACAGATCAATTATTCGACTTGTGCGAAAGGTTCGATCT
TCGCTTTGTTGTGATAGCCGACAGGTTTCCATCAACAAGGACAGTGGAGGAACTGAAGGAGCGATATTATCATGCATCAAAAGCAATTTTGGGTGCTAGAGGACCAACAT
CTCGGGAGGGTTTGGGGAATACTCCTGCCAAGGATCCTTTCAACGTCTCACAAGAGATCGACCGCAAACGGGCATTGTCCATGGTTCTCTCTCAAACAAAACAGAACGAA
CGAAAAGATGCAGAGGTTCTTGCTGAAGCAAAAAAAATAATCGAATCACGCAAAGCTGAAAGAGTGGCTGAAGAATCTAAGTTGGCTGTCACTTCAAATGCTGTCCCAGA
AGTTACCGAAAGGGCCGTTGTTCCTGGAGAGTCGGTACCATCTGTATCCAATGTGCAGCCTCCACCTCCAGCAGCTGTACCTTCAACTGTAGTAGCAGATAACGCTTCTA
CTCTAGCTTCTCTTCGCATGCTTCCTGTATACTTGAGAACATATGCACTTGAGCAAATGGTACATGCTGCAAGCTCATCTGCCGGCCTTAGGACTATCAAACGAGTCGAA
CAGACCCTACAAGATCTTTCGGTTAATCTAAAACCCAGGGTTCCAACAAAAGCCGTGTGTGCAGAGCATCTTGAATTAAGAAAAGAAATATTGACACTGCTGAATCTTCA
AAAGCAGTTGCAAAATAAGGAGGCAGAAGGTTCTTCTTTCCGTGAGAGTCCATACAGCGAGGCACCTGGCACACCCAAGGAACGCGCTTTTATTGCTGATTCTATGAGTT
TTGGAGGTTTGTTAAACGAGATCAAAAACGAAAGGCCACTGGAAGATTATCTGAAGCTCCATCATCACCAGCTCAATCTAATAAAAGGCCAAGAAAACAGAAGGGATCCG
ATCTGTGATCCTCCAGTGGTATGCATCTAA
mRNA sequenceShow/hide mRNA sequence
TTTCTTATTTGGAAAATCGTAAATAAAACAACTAATTTTTCACTTAATCATGGCTCGTTGTTCGTCGCTTGCAGGTGAATTAAGGAAATCCAATAATAACAAGTTGTCTT
CCTCTGCTCACAGTGATTTGGCGCTCGAATGATGCTCAAAAAATGAAACTAGAATGATCTTCGTCGAATCGATCATATCAAAATCCCGATTGTATGGTACCCATGTGATG
AAATTCGATCGGAGGAACATTAGTGTGTAAATCATTCGATTTGTGAACTTGAGTTTTCCGGCATTGCGAAGTAGAAAAAAAAGCCCTAGAATTCGAGGGAAGTCTGATAT
GGATGCCAAGGATATCTTGGGGTTGCCCAAAAACACGCTGCCTTTATCCCAAGAGAAGAAGCCTAGGGCTCAGAAAGATGCCCAGAGAAAGCGAGATGGAATTTCTCGGG
AGGTTTATGCGCTTACTGGTGGTCTTGCGCCTATTATGCCAGCAATCGATACATCCGAGCTGAAGAAGCGACCTCCATCAGATGAGAAGATTACATGGCAATGGCTTCCT
TTTTCAAACTCTGCGAGAAAGGATAATTTGCAGCTTTACCACTGGGTTCGAGTTGTAAATGGCATTCCACCAACAGGTGACTATTCCTTTGCAAAGTACAACAAGTCTGT
TGAAGTTGTCAAATACACAGATGAGGAGTACGAGAAGTTTTTAAAAGACCCTTCATGGACAAAGGAGGAGACAGATCAATTATTCGACTTGTGCGAAAGGTTCGATCTTC
GCTTTGTTGTGATAGCCGACAGGTTTCCATCAACAAGGACAGTGGAGGAACTGAAGGAGCGATATTATCATGCATCAAAAGCAATTTTGGGTGCTAGAGGACCAACATCT
CGGGAGGGTTTGGGGAATACTCCTGCCAAGGATCCTTTCAACGTCTCACAAGAGATCGACCGCAAACGGGCATTGTCCATGGTTCTCTCTCAAACAAAACAGAACGAACG
AAAAGATGCAGAGGTTCTTGCTGAAGCAAAAAAAATAATCGAATCACGCAAAGCTGAAAGAGTGGCTGAAGAATCTAAGTTGGCTGTCACTTCAAATGCTGTCCCAGAAG
TTACCGAAAGGGCCGTTGTTCCTGGAGAGTCGGTACCATCTGTATCCAATGTGCAGCCTCCACCTCCAGCAGCTGTACCTTCAACTGTAGTAGCAGATAACGCTTCTACT
CTAGCTTCTCTTCGCATGCTTCCTGTATACTTGAGAACATATGCACTTGAGCAAATGGTACATGCTGCAAGCTCATCTGCCGGCCTTAGGACTATCAAACGAGTCGAACA
GACCCTACAAGATCTTTCGGTTAATCTAAAACCCAGGGTTCCAACAAAAGCCGTGTGTGCAGAGCATCTTGAATTAAGAAAAGAAATATTGACACTGCTGAATCTTCAAA
AGCAGTTGCAAAATAAGGAGGCAGAAGGTTCTTCTTTCCGTGAGAGTCCATACAGCGAGGCACCTGGCACACCCAAGGAACGCGCTTTTATTGCTGATTCTATGAGTTTT
GGAGGTTTGTTAAACGAGATCAAAAACGAAAGGCCACTGGAAGATTATCTGAAGCTCCATCATCACCAGCTCAATCTAATAAAAGGCCAAGAAAACAGAAGGGATCCGAT
CTGTGATCCTCCAGTGGTATGCATCTAAAAGGCTAATGCTGGAGCAGGCAGCTTTCCATGATGCCCTTTTGGAGCATCAGTTTCTGATTGAGTAAGGATGTTGTGATGAA
CGTTGATACTGGTTAAGAGACTCGATTTTCATGGTCGTCTGTGCAATCGGGTTAACAACTGTCTGTTTAGTTCATTTATCTTCATCACGACAGCAAAGCTCTGGTTTTCT
TACATCTACTAGCTAGTTTGGTCTTTCTTTTGTTGGCAGAGTTAGCAGGATAGTTCTTATATGATGGTGTGTTGCATGTAAATCTAGCCAGAAACGAGGAAGATTAACTC
CTTTTTATACACTACTTTCATTTTGTGTATTAGTTTTTGTTCTTCAACCTAAAAGAATCATGATGTAATTTTCTTTTCTTTTCTATTGTTGACAAGTAACGTAATATCAA
ATGGGTCTAGTTTCTATAATAATTTGCTATTCT
Protein sequenceShow/hide protein sequence
MDAKDILGLPKNTLPLSQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTGDYSFAKYNKS
VEVVKYTDEEYEKFLKDPSWTKEETDQLFDLCERFDLRFVVIADRFPSTRTVEELKERYYHASKAILGARGPTSREGLGNTPAKDPFNVSQEIDRKRALSMVLSQTKQNE
RKDAEVLAEAKKIIESRKAERVAEESKLAVTSNAVPEVTERAVVPGESVPSVSNVQPPPPAAVPSTVVADNASTLASLRMLPVYLRTYALEQMVHAASSSAGLRTIKRVE
QTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYSEAPGTPKERAFIADSMSFGGLLNEIKNERPLEDYLKLHHHQLNLIKGQENRRDP
ICDPPVVCI