; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC09G167600 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC09G167600
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionSWR1-complex protein 4
Genome locationCicolChr09:4194406..4202998
RNA-Seq ExpressionCcUC09G167600
SyntenyCcUC09G167600
Gene Ontology termsGO:0000122 - negative regulation of transcription by RNA polymerase II (biological process)
GO:0006281 - DNA repair (biological process)
GO:0043486 - histone exchange (biological process)
GO:0043967 - histone H4 acetylation (biological process)
GO:0043968 - histone H2A acetylation (biological process)
GO:0000812 - Swr1 complex (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
GO:0003714 - transcription corepressor activity (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR008468 - DNA methyltransferase 1-associated 1
IPR009057 - Homeobox-like domain superfamily
IPR027109 - SWR1-complex protein 4/DNA methyltransferase 1-associated protein 1
IPR032563 - DAMP1, SANT/Myb-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046714.1 SWR1-complex protein 4 [Cucumis melo var. makuwa]6.8e-22792.83Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAP+MPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNG PPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEK+LKD SWTKEETDQLFDLCE FDLRFIVIADRFPS RTVEELKERYYR SRAI+AARG  SRESSGNT AKDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM
        QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITE+R+AE+VAEESELPVTSNAVPEVTE+ VVPG+++PSISNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPK-------
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+SPY EAPGTPK       
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPK-------

Query:  -GKGLVNGIKNARPLEDYLKLHHHQLNLKGQENRRDPICDPPGSFR
         GKGLVNGIKNARP EDYLKLHHHQLNLKGQENRRDPICDPPG +R
Subjt:  -GKGLVNGIKNARPLEDYLKLHHHQLNLKGQENRRDPICDPPGSFR

XP_011659406.1 SWR1-complex protein 4 [Cucumis sativus]6.2e-20494.72Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAID SELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEKYLKD SWTKEETDQLFDLCE FDLRFIVIADRFPS RTVEELKERYYRVSRAI+AARG  SRESSGNT AKDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM
        QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEAR+AE+VAEESELPVTSNAVPEVTE+ VVPG++VPSISNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPKGKGLV
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+SPY EAPGTPK +  +
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPKGKGLV

XP_022991705.1 SWR1-complex protein 4-like isoform X1 [Cucurbita maxima]3.2e-21685.77Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPL QEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP G
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEK+LKD SWTKEETDQLFDLCE FDLRF+VIADRFPSTRTVEELKERYY  S+AIL ARGPTSRE SGNT AKDP+NVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM
        QEI+RKRALSMVLSQTKQ+ERKDAEVLAEAKKI E+R+AE+VAEES+L VTSN VPEVTE+AVVPGESV S+SNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPK-------
        LPVYLRTYALEQMV AASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPY+EAPGTPK       
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPK-------

Query:  -----------------------------GKGLVNGIKNARPLEDYLKLHHHQLNL-KGQENRRDPICDPP
                                     GKGL+NG KN RPLEDYLKLHHHQLNL KGQENRRDPICDPP
Subjt:  -----------------------------GKGLVNGIKNARPLEDYLKLHHHQLNL-KGQENRRDPICDPP

XP_022991706.1 SWR1-complex protein 4-like isoform X2 [Cucurbita maxima]4.0e-20382.17Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPL QEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP G
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS
        DYSFAKYNK                   SWTKEETDQLFDLCE FDLRF+VIADRFPSTRTVEELKERYY  S+AIL ARGPTSRE SGNT AKDP+NVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM
        QEI+RKRALSMVLSQTKQ+ERKDAEVLAEAKKI E+R+AE+VAEES+L VTSN VPEVTE+AVVPGESV S+SNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPK-------
        LPVYLRTYALEQMV AASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPY+EAPGTPK       
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPK-------

Query:  -----------------------------GKGLVNGIKNARPLEDYLKLHHHQLNL-KGQENRRDPICDPP
                                     GKGL+NG KN RPLEDYLKLHHHQLNL KGQENRRDPICDPP
Subjt:  -----------------------------GKGLVNGIKNARPLEDYLKLHHHQLNL-KGQENRRDPICDPP

XP_038898526.1 SWR1-complex protein 4 isoform X1 [Benincasa hispida]1.4e-20394.22Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLPQEKKPRA KDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPF+NSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEKYLKD SWTKEETDQLFDLCE FDLRFIVI+DRFPS RTVEELKERYYR SRAI+AARGPTSRESSGNT AKDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM
        QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITE+RRAE+VAEESELPVTSNAVPEVTE+ VVP ++VPS+SNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPKGKGLV
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPY EAPGTPK +  +
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPKGKGLV

TrEMBL top hitse value%identityAlignment
A0A0A0K9K0 SANT domain-containing protein3.0e-20494.72Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAID SELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEKYLKD SWTKEETDQLFDLCE FDLRFIVIADRFPS RTVEELKERYYRVSRAI+AARG  SRESSGNT AKDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM
        QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEAR+AE+VAEESELPVTSNAVPEVTE+ VVPG++VPSISNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPKGKGLV
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+SPY EAPGTPK +  +
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPKGKGLV

A0A1S3BSL6 SWR1-complex protein 41.7e-20293.72Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAP+MPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNG PPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEK+LKD SWTKEETDQLFDLCE FDLRFIVIADRFPS RTVEELKERYYR SRAI+AARG  SRESSGNT AKDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM
        QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITE+R+AE+VAEESELPVTSNAVPEVTE+ VVPG++VPSISNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPKGKGLV
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+SPY EAPGTPK +  +
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPKGKGLV

A0A5A7TUG7 SWR1-complex protein 43.3e-22792.83Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAP+MPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNG PPTG
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEK+LKD SWTKEETDQLFDLCE FDLRFIVIADRFPS RTVEELKERYYR SRAI+AARG  SRESSGNT AKDPYNVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM
        QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITE+R+AE+VAEESELPVTSNAVPEVTE+ VVPG+++PSISNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPK-------
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+SPY EAPGTPK       
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPK-------

Query:  -GKGLVNGIKNARPLEDYLKLHHHQLNLKGQENRRDPICDPPGSFR
         GKGLVNGIKNARP EDYLKLHHHQLNLKGQENRRDPICDPPG +R
Subjt:  -GKGLVNGIKNARPLEDYLKLHHHQLNLKGQENRRDPICDPPGSFR

A0A6J1JRI6 SWR1-complex protein 4-like isoform X22.0e-20382.17Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPL QEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP G
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS
        DYSFAKYNK                   SWTKEETDQLFDLCE FDLRF+VIADRFPSTRTVEELKERYY  S+AIL ARGPTSRE SGNT AKDP+NVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM
        QEI+RKRALSMVLSQTKQ+ERKDAEVLAEAKKI E+R+AE+VAEES+L VTSN VPEVTE+AVVPGESV S+SNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPK-------
        LPVYLRTYALEQMV AASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPY+EAPGTPK       
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPK-------

Query:  -----------------------------GKGLVNGIKNARPLEDYLKLHHHQLNL-KGQENRRDPICDPP
                                     GKGL+NG KN RPLEDYLKLHHHQLNL KGQENRRDPICDPP
Subjt:  -----------------------------GKGLVNGIKNARPLEDYLKLHHHQLNL-KGQENRRDPICDPP

A0A6J1JTQ5 SWR1-complex protein 4-like isoform X11.5e-21685.77Show/hide
Query:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTLPL QEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP G
Subjt:  MDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS
        DYSFAKYNKSVEVVKYTDEEYEK+LKD SWTKEETDQLFDLCE FDLRF+VIADRFPSTRTVEELKERYY  S+AIL ARGPTSRE SGNT AKDP+NVS
Subjt:  DYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS

Query:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM
        QEI+RKRALSMVLSQTKQ+ERKDAEVLAEAKKI E+R+AE+VAEES+L VTSN VPEVTE+AVVPGESV S+SNVQPPPPAAVPSTVVADNASTLASLRM
Subjt:  QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPK-------
        LPVYLRTYALEQMV AASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPY+EAPGTPK       
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPK-------

Query:  -----------------------------GKGLVNGIKNARPLEDYLKLHHHQLNL-KGQENRRDPICDPP
                                     GKGL+NG KN RPLEDYLKLHHHQLNL KGQENRRDPICDPP
Subjt:  -----------------------------GKGLVNGIKNARPLEDYLKLHHHQLNL-KGQENRRDPICDPP

SwissProt top hitse value%identityAlignment
O14308 SWR1-complex protein 43.7e-3429.59Show/hide
Query:  DAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTG-GLAPIMPAIDTSELKKRPPSDEKI-TWQWLPFSNSARKDNLQLYHWVRVVNGIPPT
        D +D+  LP    P     K +++   +R+ +GISRE+Y+L G   AP+  AI   + K++P    K   W   PFS S+RKD+  L+HWV + + +   
Subjt:  DAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTG-GLAPIMPAIDTSELKKRPPSDEKI-TWQWLPFSNSARKDNLQLYHWVRVVNGIPPT

Query:  GDYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPST-----RTVEELKERYYRVSRAILAARGPTSRESSGNTTAK
          Y F K+N  + ++ YTDEEY+ YLKD  W K+ETD LF LC+ +DLRF VIADR+ +      RT+E+LK+R+Y VSR IL AR P +  ++  ++  
Subjt:  GDYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPST-----RTVEELKERYYRVSRAILAARGPTSRESSGNTTAK

Query:  D--PYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESEL--------------------------------PVTSNAVPEVTE
        +   YN  QE+ RK+ L  + S+T ++  ++  +  E K+I E  +A+ +++  E+                                  T N V E   
Subjt:  D--PYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESEL--------------------------------PVTSNAVPEVTE

Query:  KAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRML---PVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELR
         +  P   V S+ N    P A     +      T     +     ++  T+   Q + A  +S      +RV   + +L V+ +  +PT     + +EL+
Subjt:  KAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRML---PVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELR

Query:  KEILTLLNLQKQLQNKEAE
          I++LL L++++     E
Subjt:  KEILTLLNLQKQLQNKEAE

Q7K3D8 DNA methyltransferase 1-associated protein 11.2e-3531.32Show/hide
Query:  SPRIRGRSDMDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYAL----TGGLAPIMPAIDTS--------ELKKRPPSDEKITWQWLPFSNSA
        S  +R   DM+  +   + +++    +++     K A R+ +G+ REV+AL         P++P  DT+        E K R    +   W+W PFSN A
Subjt:  SPRIRGRSDMDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYAL----TGGLAPIMPAIDTS--------ELKKRPPSDEKITWQWLPFSNSA

Query:  RKDNLQLYHWVRVVNGIPPTGDYSFAKYNKSVEVVKYTDEEYEKYLKD--TSWTKEETDQLFDLCEWFDLRFIVIADRF----PSTRTVEELKERYYRVS
        R D+   +HW RV +    + DY FAK+NK +EV  YT  EY  +L++   +W+K +TD LFDL   FDLRFIV+ADR+      T+TVEELKERYY V 
Subjt:  RKDNLQLYHWVRVVNGIPPTGDYSFAKYNKSVEVVKYTDEEYEKYLKD--TSWTKEETDQLFDLCEWFDLRFIVIADRF----PSTRTVEELKERYYRVS

Query:  RAILAARGPTSRESSGNTTAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSIS
          +  A+  TS +          Y+V  E  RK  L  +  +T QQ  ++  ++ E KKI EAR+ E+  +  +L    +   +  E A     + PS  
Subjt:  RAILAARGPTSRESSGNTTAKDPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSIS

Query:  NV----------QPPPPAAVPSTV----VADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKE
                    Q P P+ V S V    +  +    A LR   V LR+  ++       ++ G R +K +EQ +Q+  V+  P  PT+ +C    ELR +
Subjt:  NV----------QPPPPAAVPSTV----VADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKE

Query:  ILTLLNLQKQLQNKEAEGSSFRESPYNEAPG
        ++ L  L+  L     E  S +       PG
Subjt:  ILTLLNLQKQLQNKEAEGSSFRESPYNEAPG

Q8VZL6 SWR1-complex protein 42.7e-14168.35Show/hide
Query:  DAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTGD
        DAKDILGLPK  L L QEKK R QK++ RK DGISREVYALTGG+AP+MP+ID   LK+RPP+DEK+ W+WL F+NSARKD+LQLYHWVRVVN +PPTGD
Subjt:  DAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTGD

Query:  YSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVSQ
        YSFAKYNKSV+++KYTDEEYE +L D+ WTKEETDQLF+ C+ FDLRF+VIADRFP +RTVEELK+RYY V+RA+L AR  +  + + +   K+PY++++
Subjt:  YSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVSQ

Query:  EIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTV-VADNASTLASLRM
        + ERKRALSMVLSQ++ QE+KDAE+LAEAK+ITE R A + AEE ++    NA  +  +  VVPG SV   SN Q P  A  PST+ +AD ASTLASLRM
Subjt:  EIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTV-VADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPKGK
        L VYLRTY LEQMVQAASS+ GLRTIKRVEQTLQDL VNLKP+VPTK VC EHLELRKEILTLLNLQKQLQ KE+EGSS RE  Y   P TPK +
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPKGK

Q9JI44 DNA methyltransferase 1-associated protein 18.3e-3431.87Show/hide
Query:  LPKNTLPLPQEKKPRAQKDAQ--RKRDGISREVYAL----TGGLAPIMPAIDTSE----LKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP
        + K  +  P +KK +   +    ++ +G+ REVYAL         P++P+ DT +    +K +  S +   W+W+PF+N ARKD    +HW R       
Subjt:  LPKNTLPLPQEKKPRAQKDAQ--RKRDGISREVYAL----TGGLAPIMPAIDTSE----LKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP

Query:  TGDYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFP----STRTVEELKERYYRVSRAILAARGPTSRESSGNTTAK
          DY FA++NK+V+V  Y+++EY+ YL D +WTK ETD LFDL   FDLRF+VI DR+       R+VE+LKERYY +      A+    R   G     
Subjt:  TGDYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFP----STRTVEELKERYYRVSRAILAARGPTSRESSGNTTAK

Query:  DPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTE----KAVVPGESVPSISNVQPPPPAAVPSTVVAD
          ++   E  RK  L  + ++T +Q  ++  +L E +KI EAR+ E+     +L     A     E    +   P + +P     + P   AVP T    
Subjt:  DPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTE----KAVVPGESVPSISNVQPPPPAAVPSTVVAD

Query:  NASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE
                +   V LR+  ++       SS G + IK +EQ L +L V L P  PT+ +     ELR +++ L  L++   N E E
Subjt:  NASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE

Q9NPF5 DNA methyltransferase 1-associated protein 18.3e-3431.87Show/hide
Query:  LPKNTLPLPQEKKPRAQKDAQ--RKRDGISREVYAL----TGGLAPIMPAIDTSE----LKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP
        + K  +  P +KK +   +    ++ +G+ REVYAL         P++P+ DT +    +K +  S +   W+W+PF+N ARKD    +HW R       
Subjt:  LPKNTLPLPQEKKPRAQKDAQ--RKRDGISREVYAL----TGGLAPIMPAIDTSE----LKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPP

Query:  TGDYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFP----STRTVEELKERYYRVSRAILAARGPTSRESSGNTTAK
          DY FA++NK+V+V  Y+++EY+ YL D +WTK ETD LFDL   FDLRF+VI DR+       R+VE+LKERYY +      A+    R   G     
Subjt:  TGDYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFP----STRTVEELKERYYRVSRAILAARGPTSRESSGNTTAK

Query:  DPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTE----KAVVPGESVPSISNVQPPPPAAVPSTVVAD
          ++   E  RK  L  + ++T +Q  ++  +L E +KI EAR+ E+     +L     A     E    +   P + +P     + P   AVP T    
Subjt:  DPYNVSQEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTE----KAVVPGESVPSISNVQPPPPAAVPSTVVAD

Query:  NASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE
                +   V LR+  ++       SS G + IK +EQ L +L V L P  PT+ +     ELR +++ L  L++   N E E
Subjt:  NASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE

Arabidopsis top hitse value%identityAlignment
AT2G47210.1 myb-like transcription factor family protein1.9e-14268.35Show/hide
Query:  DAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTGD
        DAKDILGLPK  L L QEKK R QK++ RK DGISREVYALTGG+AP+MP+ID   LK+RPP+DEK+ W+WL F+NSARKD+LQLYHWVRVVN +PPTGD
Subjt:  DAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWVRVVNGIPPTGD

Query:  YSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVSQ
        YSFAKYNKSV+++KYTDEEYE +L D+ WTKEETDQLF+ C+ FDLRF+VIADRFP +RTVEELK+RYY V+RA+L AR  +  + + +   K+PY++++
Subjt:  YSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVSQ

Query:  EIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTV-VADNASTLASLRM
        + ERKRALSMVLSQ++ QE+KDAE+LAEAK+ITE R A + AEE ++    NA  +  +  VVPG SV   SN Q P  A  PST+ +AD ASTLASLRM
Subjt:  EIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTV-VADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPKGK
        L VYLRTY LEQMVQAASS+ GLRTIKRVEQTLQDL VNLKP+VPTK VC EHLELRKEILTLLNLQKQLQ KE+EGSS RE  Y   P TPK +
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPKGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGTTTTCCGGCATTGCGAAGTAGAAAGAAGCCCTAGAATTCGGGGAAGGTCTGATATGGATGCCAAGGATATCTTGGGCTTGCCCAAGAATACGCTTCCTTTACC
CCAAGAGAAGAAGCCTAGGGCTCAGAAAGATGCTCAGAGAAAGCGAGATGGAATTTCTCGGGAGGTTTATGCACTTACTGGTGGTCTGGCGCCTATTATGCCCGCAATCG
ATACATCTGAGCTGAAGAAGCGACCTCCATCTGATGAGAAGATTACGTGGCAGTGGCTTCCTTTTTCAAATTCTGCTAGAAAGGATAATTTGCAGCTTTACCACTGGGTT
AGAGTTGTAAATGGCATCCCACCAACAGGTGACTATTCCTTTGCTAAGTACAACAAGTCTGTTGAAGTTGTCAAATACACGGATGAGGAGTACGAGAAGTATTTGAAAGA
CACTTCGTGGACAAAGGAGGAGACAGATCAATTATTTGACTTGTGCGAATGGTTTGATCTTCGCTTCATTGTGATAGCTGACAGGTTTCCATCAACAAGGACAGTGGAGG
AACTGAAGGAGCGATATTATCGTGTGTCAAGAGCAATTTTGGCTGCAAGAGGACCAACATCTCGGGAGAGTTCAGGAAATACTACCGCCAAGGATCCTTACAATGTCTCA
CAAGAGATTGAGCGCAAACGGGCATTGTCCATGGTTCTCTCCCAAACCAAACAGCAAGAACGAAAAGATGCAGAGGTTCTTGCTGAAGCAAAAAAGATAACTGAAGCACG
CAGAGCTGAAAAGGTGGCTGAAGAATCTGAGTTGCCTGTCACTTCAAACGCTGTCCCAGAAGTTACTGAAAAGGCTGTCGTTCCTGGAGAGTCTGTGCCATCTATATCCA
ATGTGCAGCCCCCACCTCCAGCAGCTGTACCTTCAACTGTAGTGGCTGATAATGCTTCTACTCTTGCTTCTCTTCGCATGCTTCCTGTATACTTGAGAACGTATGCACTT
GAGCAAATGGTACAAGCTGCAAGCTCATCTGCTGGCCTTCGGACTATCAAGCGAGTTGAACAGACATTACAAGATCTTTCGGTTAATTTAAAGCCCAGGGTTCCAACAAA
AGCTGTCTGTGCAGAGCATCTTGAATTAAGAAAAGAAATATTGACTCTACTTAATCTTCAAAAGCAGTTGCAAAATAAGGAGGCAGAAGGTTCTTCTTTCCGTGAGAGTC
CATACAATGAGGCACCTGGCACACCCAAGGGGAAAGGTTTGGTAAACGGGATCAAAAACGCAAGGCCACTGGAAGATTATCTGAAGCTCCATCATCACCAGCTCAATCTA
AAAGGCCAAGAAAACAGAAGGGATCCGATCTGTGATCCTCCAGGCAGCTTTCGCTCGAAGGCACTGGAGAATATTATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGTTTTCCGGCATTGCGAAGTAGAAAGAAGCCCTAGAATTCGGGGAAGGTCTGATATGGATGCCAAGGATATCTTGGGCTTGCCCAAGAATACGCTTCCTTTACC
CCAAGAGAAGAAGCCTAGGGCTCAGAAAGATGCTCAGAGAAAGCGAGATGGAATTTCTCGGGAGGTTTATGCACTTACTGGTGGTCTGGCGCCTATTATGCCCGCAATCG
ATACATCTGAGCTGAAGAAGCGACCTCCATCTGATGAGAAGATTACGTGGCAGTGGCTTCCTTTTTCAAATTCTGCTAGAAAGGATAATTTGCAGCTTTACCACTGGGTT
AGAGTTGTAAATGGCATCCCACCAACAGGTGACTATTCCTTTGCTAAGTACAACAAGTCTGTTGAAGTTGTCAAATACACGGATGAGGAGTACGAGAAGTATTTGAAAGA
CACTTCGTGGACAAAGGAGGAGACAGATCAATTATTTGACTTGTGCGAATGGTTTGATCTTCGCTTCATTGTGATAGCTGACAGGTTTCCATCAACAAGGACAGTGGAGG
AACTGAAGGAGCGATATTATCGTGTGTCAAGAGCAATTTTGGCTGCAAGAGGACCAACATCTCGGGAGAGTTCAGGAAATACTACCGCCAAGGATCCTTACAATGTCTCA
CAAGAGATTGAGCGCAAACGGGCATTGTCCATGGTTCTCTCCCAAACCAAACAGCAAGAACGAAAAGATGCAGAGGTTCTTGCTGAAGCAAAAAAGATAACTGAAGCACG
CAGAGCTGAAAAGGTGGCTGAAGAATCTGAGTTGCCTGTCACTTCAAACGCTGTCCCAGAAGTTACTGAAAAGGCTGTCGTTCCTGGAGAGTCTGTGCCATCTATATCCA
ATGTGCAGCCCCCACCTCCAGCAGCTGTACCTTCAACTGTAGTGGCTGATAATGCTTCTACTCTTGCTTCTCTTCGCATGCTTCCTGTATACTTGAGAACGTATGCACTT
GAGCAAATGGTACAAGCTGCAAGCTCATCTGCTGGCCTTCGGACTATCAAGCGAGTTGAACAGACATTACAAGATCTTTCGGTTAATTTAAAGCCCAGGGTTCCAACAAA
AGCTGTCTGTGCAGAGCATCTTGAATTAAGAAAAGAAATATTGACTCTACTTAATCTTCAAAAGCAGTTGCAAAATAAGGAGGCAGAAGGTTCTTCTTTCCGTGAGAGTC
CATACAATGAGGCACCTGGCACACCCAAGGGGAAAGGTTTGGTAAACGGGATCAAAAACGCAAGGCCACTGGAAGATTATCTGAAGCTCCATCATCACCAGCTCAATCTA
AAAGGCCAAGAAAACAGAAGGGATCCGATCTGTGATCCTCCAGGCAGCTTTCGCTCGAAGGCACTGGAGAATATTATTTGAGCTAACTATGCCCTTTGGAGCATCATTAT
CTGAACTGAACTCTGATGCTGGTTTGGTTAAGAGACACAATTTTCCTGGTCATCTGTGCAACCGAGTTAATGACGGTCTGTTTGCTCATTTATCGTCATCATGAAGCTAA
AGCTCTGGTTTTCTTACATTTATTAGCTAGGTTTGTCTTTCTTCTGTTGGCATAGTCAGCAGGATAGTTCTCCTAATGTGATGGTGTATTATATGTAAATCTACCAAGAA
ATGAGGAAAAATATCTTCCATAACTTTCATTCAGTAGACTAAAAGAATGATGATGTAATTTTCTTTTCTTCTTCCCCTTTTACTGTTGACGAATCAACATATATTAGCAA
GAGTATGTTTCCTGATGA
Protein sequenceShow/hide protein sequence
MGVFRHCEVERSPRIRGRSDMDAKDILGLPKNTLPLPQEKKPRAQKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPSDEKITWQWLPFSNSARKDNLQLYHWV
RVVNGIPPTGDYSFAKYNKSVEVVKYTDEEYEKYLKDTSWTKEETDQLFDLCEWFDLRFIVIADRFPSTRTVEELKERYYRVSRAILAARGPTSRESSGNTTAKDPYNVS
QEIERKRALSMVLSQTKQQERKDAEVLAEAKKITEARRAEKVAEESELPVTSNAVPEVTEKAVVPGESVPSISNVQPPPPAAVPSTVVADNASTLASLRMLPVYLRTYAL
EQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRESPYNEAPGTPKGKGLVNGIKNARPLEDYLKLHHHQLNL
KGQENRRDPICDPPGSFRSKALENII