; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029560 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029560
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSWR1-complex protein 4
Genome locationtig00153403:1930664..1945267
RNA-Seq ExpressionSgr029560
SyntenySgr029560
Gene Ontology termsGO:0000122 - negative regulation of transcription by RNA polymerase II (biological process)
GO:0006281 - DNA repair (biological process)
GO:0043486 - histone exchange (biological process)
GO:0043967 - histone H4 acetylation (biological process)
GO:0043968 - histone H2A acetylation (biological process)
GO:0000812 - Swr1 complex (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
GO:0003714 - transcription corepressor activity (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR027109 - SWR1-complex protein 4/DNA methyltransferase 1-associated protein 1
IPR032563 - DAMP1, SANT/Myb-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046714.1 SWR1-complex protein 4 [Cucumis melo var. makuwa]1.9e-18277.26Show/hide
Query:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG
        MDAKDILGLPKNTLP+PQEKKPRA KDAQRKRDGISREVYALTGGLAP+MPAIDTSELKKRPP+DEKITWQWLPF+++ARKDNLQLYHWVRVVN  PPTG
Subjt:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG

Query:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS
        DYSFAKYNKSV+VV+YTDEEYEK+L D SWTKEETDQLFDLCERFDLRFIVIADRFPS RTVEELKERYYRASRAI+ AR    RE SGN   KDPYNVS
Subjt:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRI-TITNVQPPPPVAAPSTLVADNASTLASLRM
         EIERKRALSMVLSQTKQQERKDAEVLAEAKKI ESRKAERVAEESELPVTSNAVP++ + +      I +I+NVQPPPP A PST+VADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRI-TITNVQPPPPVAAPSTLVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS                                LQNKEAEGSSFRDSPY EAPGTPKDR+FI D
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD

Query:  SMSFGGDRFGKRDQKRKAQEGYLKLHHHQLNLKGQENRRDPISDPPGVFRALI
        S    G       +  +  E YLKLHHHQLNLKGQENRRDPI DPPG +R L+
Subjt:  SMSFGGDRFGKRDQKRKAQEGYLKLHHHQLNLKGQENRRDPISDPPGVFRALI

KAG6593255.1 SWR1-complex protein 4, partial [Cucurbita argyrosperma subsp. sororia]8.9e-17780.14Show/hide
Query:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG
        MDAKDILGLPKNTL IP EKK RA KDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPP+DEKITWQWLPF ++ARKDNLQLYHWVRVVN +PPTG
Subjt:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG

Query:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS
        DYSFAKYNKSV++V+YTDE+YEKYLN+PSWTKEETDQLFDLCERFDLRF+VIADRFPSTRTVEELKERYYRASRAIL AR P PRE SGNPL KDPYNVS
Subjt:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRI-TITNVQPPPPVAAPSTLVADNASTLASLRM
         EIERKRALSMVLSQTKQQERKDAEVLAEAKKI ESR+ ERV E+SELPVTSNAVP   +        + +++NVQPP P AAPSTLVADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRI-TITNVQPPPPVAAPSTLVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS                                LQNKEAEGSSFR+ PYNEAPGTPKDRSFIPD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD

Query:  SMSFGGDRFGKRDQKRKA
        SMS GG+R GKRDQKRKA
Subjt:  SMSFGGDRFGKRDQKRKA

XP_022153204.1 SWR1-complex protein 4 [Momordica charantia]1.4e-17780.86Show/hide
Query:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG
        MDAKDILGLPK T PIPQ+KKPRA KDAQRKRDGISREVYALTGGL PIMPAIDTSELKKRPP+DEKITWQWLPFT++ARKDNLQLYHWVRVVN +PPTG
Subjt:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG

Query:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS
        DYSFAKYNKSVDVV+YTDEEYEK+LNDP WTKEETDQLF LCERFDLRFIVIADRF STRTVEELKERYYRASRAI+VARAP  REVSGN LVKDPYNVS
Subjt:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRI-TITNVQPPPPVAAPSTLVADNASTLASLRM
        HEIERKRALSMVLSQTKQQERKDAEVLAEAKKI ESR+AERVAEESELPV SN VP++ +        + +++NVQP PP AAPSTL ADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRI-TITNVQPPPPVAAPSTLVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS                                LQNKEAEGSSFRDSPYN+APGTPKDRSFIPD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD

Query:  SMSFGGDRFGKRDQKRKA
        S++FGG+RF KRDQKRKA
Subjt:  SMSFGGDRFGKRDQKRKA

XP_022991705.1 SWR1-complex protein 4-like isoform X1 [Cucurbita maxima]2.3e-17772.73Show/hide
Query:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG
        MDAKDILGLPKNTLP+ QEKKPRA KDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPP+DEKITWQWLPF+++ARKDNLQLYHWVRVVN +PP G
Subjt:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG

Query:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS
        DYSFAKYNKSV+VV+YTDEEYEK+L DPSWTKEETDQLFDLCERFDLRF+VIADRFPSTRTVEELKERYY AS+AIL AR P+ RE SGN   KDP+NVS
Subjt:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLL-KGLSFLQIRITITNVQPPPPVAAPSTLVADNASTLASLRM
         EI+RKRALSMVLSQTKQ+ERKDAEVLAEAKKIIESRKAERVAEES+L VTSN VP++  + +   +   +++NVQPPPP A PST+VADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLL-KGLSFLQIRITITNVQPPPPVAAPSTLVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD
        LPVYLRTYALEQMV AASSSAGLRTIKRVEQTLQDLS                                LQNKEAEGSSFR+SPY+EAPGTPKDR+FI D
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD

Query:  SMSFGGD--RFGKRD----------------------QKRKAQEGYLKLHHHQLNL-KGQENRRDPISDPPGV
        SM+FGG   RF ++                       +  +  E YLKLHHHQLNL KGQENRRDPI DPP V
Subjt:  SMSFGGD--RFGKRD----------------------QKRKAQEGYLKLHHHQLNL-KGQENRRDPISDPPGV

XP_038898526.1 SWR1-complex protein 4 isoform X1 [Benincasa hispida]1.1e-17980.62Show/hide
Query:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG
        MDAKDILGLPKNTLP+PQEKKPRA KDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPP+DEKITWQWLPFT++ARKDNLQLYHWVRVVN +PPTG
Subjt:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG

Query:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS
        DYSFAKYNKSV+VV+YTDEEYEKYL DPSWTKEETDQLFDLCERFDLRFIVI+DRFPS RTVEELKERYYRASRAI+ AR P+ RE SGN   KDPYNVS
Subjt:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRI-TITNVQPPPPVAAPSTLVADNASTLASLRM
         EIERKRALSMVLSQTKQQERKDAEVLAEAKKI ESR+AERVAEESELPVTSNAVP++ + +   +  + +++NVQPPPP A PST+VADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRI-TITNVQPPPPVAAPSTLVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS                                LQNKEAEGSSFR+SPY EAPGTPKDR+FI D
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD

Query:  SMSFGGDRFGKRDQKRKA
        SMSFGG+RFGKRDQKRKA
Subjt:  SMSFGGDRFGKRDQKRKA

TrEMBL top hitse value%identityAlignment
A0A0A0K9K0 SANT domain-containing protein1.3e-17679.9Show/hide
Query:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG
        MDAKDILGLPKNTLP+PQEKKPRA KDAQRKRDGISREVYALTGGLAPIMPAID SELKKRPP+DEKITWQWLPF+++ARKDNLQLYHWVRVVN +PPTG
Subjt:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG

Query:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS
        DYSFAKYNKSV+VV+YTDEEYEKYL D SWTKEETDQLFDLCERFDLRFIVIADRFPS RTVEELKERYYR SRAI+ AR    RE SGN   KDPYNVS
Subjt:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRI-TITNVQPPPPVAAPSTLVADNASTLASLRM
         EIERKRALSMVLSQTKQQERKDAEVLAEAKKI E+RKAERVAEESELPVTSNAVP++ + +      + +I+NVQPPPP A PST+VADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRI-TITNVQPPPPVAAPSTLVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS                                LQNKEAEGSSFRDSPY EAPGTPKDR+FI D
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD

Query:  SMSFGGDRFGKRDQKRKA
        S+SFGG+RFGKRDQKRKA
Subjt:  SMSFGGDRFGKRDQKRKA

A0A1S3BSL6 SWR1-complex protein 49.6e-17780.14Show/hide
Query:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG
        MDAKDILGLPKNTLP+PQEKKPRA KDAQRKRDGISREVYALTGGLAP+MPAIDTSELKKRPP+DEKITWQWLPF+++ARKDNLQLYHWVRVVN  PPTG
Subjt:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG

Query:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS
        DYSFAKYNKSV+VV+YTDEEYEK+L D SWTKEETDQLFDLCERFDLRFIVIADRFPS RTVEELKERYYRASRAI+ AR    RE SGN   KDPYNVS
Subjt:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRI-TITNVQPPPPVAAPSTLVADNASTLASLRM
         EIERKRALSMVLSQTKQQERKDAEVLAEAKKI ESRKAERVAEESELPVTSNAVP++ + +      + +I+NVQPPPP A PST+VADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRI-TITNVQPPPPVAAPSTLVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS                                LQNKEAEGSSFRDSPY EAPGTPKDR+FI D
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD

Query:  SMSFGGDRFGKRDQKRKA
        S+SFGG+RFGKRDQKRKA
Subjt:  SMSFGGDRFGKRDQKRKA

A0A5A7TUG7 SWR1-complex protein 49.0e-18377.26Show/hide
Query:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG
        MDAKDILGLPKNTLP+PQEKKPRA KDAQRKRDGISREVYALTGGLAP+MPAIDTSELKKRPP+DEKITWQWLPF+++ARKDNLQLYHWVRVVN  PPTG
Subjt:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG

Query:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS
        DYSFAKYNKSV+VV+YTDEEYEK+L D SWTKEETDQLFDLCERFDLRFIVIADRFPS RTVEELKERYYRASRAI+ AR    RE SGN   KDPYNVS
Subjt:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRI-TITNVQPPPPVAAPSTLVADNASTLASLRM
         EIERKRALSMVLSQTKQQERKDAEVLAEAKKI ESRKAERVAEESELPVTSNAVP++ + +      I +I+NVQPPPP A PST+VADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRI-TITNVQPPPPVAAPSTLVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS                                LQNKEAEGSSFRDSPY EAPGTPKDR+FI D
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD

Query:  SMSFGGDRFGKRDQKRKAQEGYLKLHHHQLNLKGQENRRDPISDPPGVFRALI
        S    G       +  +  E YLKLHHHQLNLKGQENRRDPI DPPG +R L+
Subjt:  SMSFGGDRFGKRDQKRKAQEGYLKLHHHQLNLKGQENRRDPISDPPGVFRALI

A0A6J1DID3 SWR1-complex protein 46.6e-17880.86Show/hide
Query:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG
        MDAKDILGLPK T PIPQ+KKPRA KDAQRKRDGISREVYALTGGL PIMPAIDTSELKKRPP+DEKITWQWLPFT++ARKDNLQLYHWVRVVN +PPTG
Subjt:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG

Query:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS
        DYSFAKYNKSVDVV+YTDEEYEK+LNDP WTKEETDQLF LCERFDLRFIVIADRF STRTVEELKERYYRASRAI+VARAP  REVSGN LVKDPYNVS
Subjt:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRI-TITNVQPPPPVAAPSTLVADNASTLASLRM
        HEIERKRALSMVLSQTKQQERKDAEVLAEAKKI ESR+AERVAEESELPV SN VP++ +        + +++NVQP PP AAPSTL ADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRI-TITNVQPPPPVAAPSTLVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS                                LQNKEAEGSSFRDSPYN+APGTPKDRSFIPD
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD

Query:  SMSFGGDRFGKRDQKRKA
        S++FGG+RF KRDQKRKA
Subjt:  SMSFGGDRFGKRDQKRKA

A0A6J1JTQ5 SWR1-complex protein 4-like isoform X11.1e-17772.73Show/hide
Query:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG
        MDAKDILGLPKNTLP+ QEKKPRA KDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPP+DEKITWQWLPF+++ARKDNLQLYHWVRVVN +PP G
Subjt:  MDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTG

Query:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS
        DYSFAKYNKSV+VV+YTDEEYEK+L DPSWTKEETDQLFDLCERFDLRF+VIADRFPSTRTVEELKERYY AS+AIL AR P+ RE SGN   KDP+NVS
Subjt:  DYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLL-KGLSFLQIRITITNVQPPPPVAAPSTLVADNASTLASLRM
         EI+RKRALSMVLSQTKQ+ERKDAEVLAEAKKIIESRKAERVAEES+L VTSN VP++  + +   +   +++NVQPPPP A PST+VADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLL-KGLSFLQIRITITNVQPPPPVAAPSTLVADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD
        LPVYLRTYALEQMV AASSSAGLRTIKRVEQTLQDLS                                LQNKEAEGSSFR+SPY+EAPGTPKDR+FI D
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLS--------------------------------LQNKEAEGSSFRDSPYNEAPGTPKDRSFIPD

Query:  SMSFGGD--RFGKRD----------------------QKRKAQEGYLKLHHHQLNL-KGQENRRDPISDPPGV
        SM+FGG   RF ++                       +  +  E YLKLHHHQLNL KGQENRRDPI DPP V
Subjt:  SMSFGGD--RFGKRD----------------------QKRKAQEGYLKLHHHQLNL-KGQENRRDPISDPPGV

SwissProt top hitse value%identityAlignment
O14308 SWR1-complex protein 41.1e-3336.72Show/hide
Query:  DAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTG-GLAPIMPAIDTSELKKRPPTDEKI-TWQWLPFTSTARKDNLQLYHWVRVVNAVPPT
        D +D+  LP    P     K ++    +R+ +GISRE+Y+L G   AP+  AI   + K++P    K   W   PF+ ++RKD+  L+HWV + + V   
Subjt:  DAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTG-GLAPIMPAIDTSELKKRPPTDEKI-TWQWLPFTSTARKDNLQLYHWVRVVNAVPPT

Query:  GDYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPST-----RTVEELKERYYRASRAILVARAP--SPREVSGNPL
          Y F K+N  + ++ YTDEEY+ YL D  W K+ETD LF LC+ +DLRF VIADR+ +      RT+E+LK+R+Y  SR IL+AR P  S      + L
Subjt:  GDYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPST-----RTVEELKERYYRASRAILVARAP--SPREVSGNPL

Query:  VKDPYNVSHEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESEL
            YN   E+ RK+ L  + S+T ++  ++  +  E K+ IE+ +A+ +++  E+
Subjt:  VKDPYNVSHEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESEL

Q7K3D8 DNA methyltransferase 1-associated protein 11.3e-2930.87Show/hide
Query:  DAKDILGLPKNTLP--------IPQEKKPRAPKDAQRKRDGISREVYAL----TGGLAPIMPAIDTS--------ELKKRPPTDEKITWQWLPFTSTARK
        D +DIL + +   P          +++     K A R+ +G+ REV+AL         P++P  DT+        E K R    +   W+W PF++ AR 
Subjt:  DAKDILGLPKNTLP--------IPQEKKPRAPKDAQRKRDGISREVYAL----TGGLAPIMPAIDTS--------ELKKRPPTDEKITWQWLPFTSTARK

Query:  DNLQLYHWVRVVNAVPPTGDYSFAKYNKSVDVVRYTDEEYEKYL--NDPSWTKEETDQLFDLCERFDLRFIVIADRF----PSTRTVEELKERYYRASRA
        D+   +HW RV +    + DY FAK+NK ++V  YT  EY  +L  N  +W+K +TD LFDL  RFDLRFIV+ADR+      T+TVEELKERYY     
Subjt:  DNLQLYHWVRVVNAVPPTGDYSFAKYNKSVDVVRYTDEEYEKYL--NDPSWTKEETDQLFDLCERFDLRFIVIADRF----PSTRTVEELKERYYRASRA

Query:  ILVARAPSPREVSGNPLVKDPYNVSHEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAER-------------VAEESELPVTSNAVPKLLKGL
        ++   A +  + S   +    Y+V HE  RK  L  +  +T QQ  ++  ++ E KK IE+RK ER               +++E    + +  K  K L
Subjt:  ILVARAPSPREVSGNPLVKDPYNVSHEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAER-------------VAEESELPVTSNAVPKLLKGL

Query:  SFLQIRITITNVQPPPPVAAPSTL----VADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSL
           ++       Q P P    S +    +  +    A LR   V LR+  ++       ++ G R +K +EQ +Q+  +
Subjt:  SFLQIRITITNVQPPPPVAAPSTL----VADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSL

Q8VZL6 SWR1-complex protein 41.0e-12760.58Show/hide
Query:  DAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTGD
        DAKDILGLPK  L + QEKK R  K++ RK DGISREVYALTGG+AP+MP+ID   LK+RPP DEK+ W+WL FT++ARKD+LQLYHWVRVVN VPPTGD
Subjt:  DAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTGD

Query:  YSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVSH
        YSFAKYNKSVD+++YTDEEYE +L D  WTKEETDQLF+ C+ FDLRF+VIADRFP +RTVEELK+RYY  +RA+L ARA SP +V+ +PL+K+PY+++ 
Subjt:  YSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVSH

Query:  EIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRITITNVQPPPPVAAPSTL-VADNASTLASLRML
        + ERKRALSMVLSQ++ QE+KDAE+LAEAK+I E R A R AEE ++    NA      G+   +     +N Q P    APSTL +AD ASTLASLRML
Subjt:  EIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRITITNVQPPPPVAAPSTL-VADNASTLASLRML

Query:  PVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDL--------------------------------SLQNKEAEGSSFRDSPYNEAPGTPKDRSFIPDS
         VYLRTY LEQMVQAASS+ GLRTIKRVEQTLQDL                                 LQ KE+EGSS R+  Y   P TPKDR F PD 
Subjt:  PVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDL--------------------------------SLQNKEAEGSSFRDSPYNEAPGTPKDRSFIPDS

Query:  MSFGGDRFGKRDQKRK
         SFG +R  K++QKRK
Subjt:  MSFGGDRFGKRDQKRK

Q9JI44 DNA methyltransferase 1-associated protein 11.6e-3231.4Show/hide
Query:  LPKNTLPIPQEKKPRAPKDAQ--RKRDGISREVYAL----TGGLAPIMPAIDTSE----LKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPP
        + K  +  P +KK +   +    ++ +G+ REVYAL         P++P+ DT +    +K +  + +   W+W+PFT+ ARKD    +HW R   A   
Subjt:  LPKNTLPIPQEKKPRAPKDAQ--RKRDGISREVYAL----TGGLAPIMPAIDTSE----LKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPP

Query:  TGDYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFP----STRTVEELKERYYRASRAILVARAPSPREVSGNPLVK
          DY FA++NK+V V  Y+++EY+ YL+D +WTK ETD LFDL  RFDLRF+VI DR+       R+VE+LKERYY      + A+  + R V G  L  
Subjt:  TGDYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFP----STRTVEELKERYYRASRAILVARAPSPREVSGNPLVK

Query:  DPYNVSHEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRITITNVQPPPPVAAPSTLVADNASTL
          ++  HE  RK  L  + ++T +Q  ++  +L E +K IE+RK ER     +L        KL+        +       P   +              
Subjt:  DPYNVSHEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRITITNVQPPPPVAAPSTLVADNASTL

Query:  ASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSLQ
        A ++          L        SS G + IK +EQ L +L ++
Subjt:  ASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSLQ

Q9NPF5 DNA methyltransferase 1-associated protein 11.6e-3231.4Show/hide
Query:  LPKNTLPIPQEKKPRAPKDAQ--RKRDGISREVYAL----TGGLAPIMPAIDTSE----LKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPP
        + K  +  P +KK +   +    ++ +G+ REVYAL         P++P+ DT +    +K +  + +   W+W+PFT+ ARKD    +HW R   A   
Subjt:  LPKNTLPIPQEKKPRAPKDAQ--RKRDGISREVYAL----TGGLAPIMPAIDTSE----LKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPP

Query:  TGDYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFP----STRTVEELKERYYRASRAILVARAPSPREVSGNPLVK
          DY FA++NK+V V  Y+++EY+ YL+D +WTK ETD LFDL  RFDLRF+VI DR+       R+VE+LKERYY      + A+  + R V G  L  
Subjt:  TGDYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFP----STRTVEELKERYYRASRAILVARAPSPREVSGNPLVK

Query:  DPYNVSHEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRITITNVQPPPPVAAPSTLVADNASTL
          ++  HE  RK  L  + ++T +Q  ++  +L E +K IE+RK ER     +L        KL+        +       P   +              
Subjt:  DPYNVSHEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRITITNVQPPPPVAAPSTLVADNASTL

Query:  ASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSLQ
        A ++          L        SS G + IK +EQ L +L ++
Subjt:  ASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSLQ

Arabidopsis top hitse value%identityAlignment
AT2G47210.1 myb-like transcription factor family protein7.4e-12960.58Show/hide
Query:  DAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTGD
        DAKDILGLPK  L + QEKK R  K++ RK DGISREVYALTGG+AP+MP+ID   LK+RPP DEK+ W+WL FT++ARKD+LQLYHWVRVVN VPPTGD
Subjt:  DAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAIDTSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTGD

Query:  YSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVSH
        YSFAKYNKSVD+++YTDEEYE +L D  WTKEETDQLF+ C+ FDLRF+VIADRFP +RTVEELK+RYY  +RA+L ARA SP +V+ +PL+K+PY+++ 
Subjt:  YSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEELKERYYRASRAILVARAPSPREVSGNPLVKDPYNVSH

Query:  EIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRITITNVQPPPPVAAPSTL-VADNASTLASLRML
        + ERKRALSMVLSQ++ QE+KDAE+LAEAK+I E R A R AEE ++    NA      G+   +     +N Q P    APSTL +AD ASTLASLRML
Subjt:  EIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRITITNVQPPPPVAAPSTL-VADNASTLASLRML

Query:  PVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDL--------------------------------SLQNKEAEGSSFRDSPYNEAPGTPKDRSFIPDS
         VYLRTY LEQMVQAASS+ GLRTIKRVEQTLQDL                                 LQ KE+EGSS R+  Y   P TPKDR F PD 
Subjt:  PVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDL--------------------------------SLQNKEAEGSSFRDSPYNEAPGTPKDRSFIPDS

Query:  MSFGGDRFGKRDQKRK
         SFG +R  K++QKRK
Subjt:  MSFGGDRFGKRDQKRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTACTTGGAGAGCATTACTCATGCCGATAGTAAACTTGAAGAATCATACTTGGGTGTACATTATCAACACCAGAGGAACATTACTCTGCACATTCGGCCTCTGAA
CTTGAGTTATCCGGCATTACAAACTAGAAAGAAAGCCCTAGAATTCGGGGAGTCTGTCATGGATGCCAAGGATATCTTGGGCTTGCCCAAAAACACGCTGCCTATACCCC
AAGAGAAGAAACCTAGGGCTCCCAAGGATGCCCAGAGAAAGCGAGATGGTATTTCCCGGGAGGTTTATGCGCTTACTGGTGGTCTGGCACCTATTATGCCGGCAATCGAT
ACATCTGAGCTGAAAAAGCGACCTCCAACAGATGAGAAGATTACTTGGCAGTGGCTTCCTTTCACAAGTACTGCTAGAAAAGACAATTTGCAGCTTTACCATTGGGTTAG
AGTTGTAAATGCCGTTCCACCAACAGGTGACTATTCCTTTGCGAAGTATAACAAGTCTGTTGACGTTGTCAGATACACAGATGAGGAGTACGAGAAGTATTTGAATGACC
CTTCATGGACGAAGGAGGAGACAGATCAATTATTTGACTTGTGTGAACGGTTCGATCTTCGCTTCATTGTGATAGCTGACAGGTTTCCATCAACAAGGACAGTGGAGGAA
CTGAAGGAGCGATATTATCGTGCATCTAGAGCAATTTTGGTTGCTAGAGCACCATCACCTCGGGAGGTTTCAGGGAATCCTCTTGTCAAGGATCCTTACAATGTCTCACA
TGAGATTGAGCGCAAACGGGCATTGTCCATGGTTCTCTCCCAAACAAAACAGCAAGAACGGAAAGATGCAGAGGTTCTTGCTGAAGCAAAAAAAATAATTGAATCACGCA
AAGCTGAAAGAGTGGCTGAAGAATCTGAGTTGCCTGTCACATCAAATGCTGTTCCGAAGTTGCTGAAAGGATTGTCGTTCCTGCAGATTCGTATCACCATTACCAATGTG
CAGCCCCCTCCTCCAGTCGCTGCACCTTCAACTTTAGTGGCAGATAATGCTTCTACTCTTGCTTCTCTTCGCATGCTTCCTGTGTATTTGAGAACATATGCACTCGAGCA
AATGGTACAAGCTGCAAGCTCATCTGCTGGACTTCGAACTATCAAGCGAGTTGAACAAACATTACAAGATCTCTCGTTGCAAAATAAGGAGGCAGAAGGTTCATCTTTCC
GTGACAGTCCATACAACGAGGCACCTGGCACACCCAAGGATCGCAGTTTTATTCCTGATTCTATGAGTTTTGGAGGGGATAGGTTTGGCAAACGGGATCAGAAACGTAAG
GCCCAGGAAGGTTATCTGAAGCTCCATCATCACCAGCTCAATCTAAAAGGCCAAGAAAACAGAAGGGATCCGATCTCTGATCCTCCGGGTGTATTCAGGGCCCTCATACT
TCACAAGGCAATCGCCACCGTCGATCGATATTACTTCCACGCTCCCGCCATAGTTCCTTATGGCTGGCCTCAATATGTCAAGCATTCAGCACGAGTACAAAAGCGTCACA
GAATAGTGAAACTCGTTCAACTCGTTTTGTATTTGTTCAACTACAGAAGCTCACCTCAGATCAGAAGTCCACCCACCTTGAAGTTTGAGGGAAACTACACCATCTTCAAC
AGACACAACATCGACATTTCCTCCATCTGCGATGAGATAAGGCCGACGTCCTCAAGAACCAAGTCCACGTTTCCAACGGTCAGTTCAAACTTCTGGGCCGAGTACAGTCC
TGGCGAGGAGCTTGTGGATGGACCCGAATTCGAAGCTGAAGCTCTGATGGCTCGTCTGGGCAAGACCCTTTTCAGGGAAACAGGTCCTTGCTGGTGCTGGCCGGAGGTAG
TCAGAGACGCCATGGCTGAAGAAACGCAGAGGAAAATGGCCGATGGGCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGATGTACTTGGAGAGCATTACTCATGCCGATAGTAAACTTGAAGAATCATACTTGGGTGTACATTATCAACACCAGAGGAACATTACTCTGCACATTCGGCCTCTGAA
CTTGAGTTATCCGGCATTACAAACTAGAAAGAAAGCCCTAGAATTCGGGGAGTCTGTCATGGATGCCAAGGATATCTTGGGCTTGCCCAAAAACACGCTGCCTATACCCC
AAGAGAAGAAACCTAGGGCTCCCAAGGATGCCCAGAGAAAGCGAGATGGTATTTCCCGGGAGGTTTATGCGCTTACTGGTGGTCTGGCACCTATTATGCCGGCAATCGAT
ACATCTGAGCTGAAAAAGCGACCTCCAACAGATGAGAAGATTACTTGGCAGTGGCTTCCTTTCACAAGTACTGCTAGAAAAGACAATTTGCAGCTTTACCATTGGGTTAG
AGTTGTAAATGCCGTTCCACCAACAGGTGACTATTCCTTTGCGAAGTATAACAAGTCTGTTGACGTTGTCAGATACACAGATGAGGAGTACGAGAAGTATTTGAATGACC
CTTCATGGACGAAGGAGGAGACAGATCAATTATTTGACTTGTGTGAACGGTTCGATCTTCGCTTCATTGTGATAGCTGACAGGTTTCCATCAACAAGGACAGTGGAGGAA
CTGAAGGAGCGATATTATCGTGCATCTAGAGCAATTTTGGTTGCTAGAGCACCATCACCTCGGGAGGTTTCAGGGAATCCTCTTGTCAAGGATCCTTACAATGTCTCACA
TGAGATTGAGCGCAAACGGGCATTGTCCATGGTTCTCTCCCAAACAAAACAGCAAGAACGGAAAGATGCAGAGGTTCTTGCTGAAGCAAAAAAAATAATTGAATCACGCA
AAGCTGAAAGAGTGGCTGAAGAATCTGAGTTGCCTGTCACATCAAATGCTGTTCCGAAGTTGCTGAAAGGATTGTCGTTCCTGCAGATTCGTATCACCATTACCAATGTG
CAGCCCCCTCCTCCAGTCGCTGCACCTTCAACTTTAGTGGCAGATAATGCTTCTACTCTTGCTTCTCTTCGCATGCTTCCTGTGTATTTGAGAACATATGCACTCGAGCA
AATGGTACAAGCTGCAAGCTCATCTGCTGGACTTCGAACTATCAAGCGAGTTGAACAAACATTACAAGATCTCTCGTTGCAAAATAAGGAGGCAGAAGGTTCATCTTTCC
GTGACAGTCCATACAACGAGGCACCTGGCACACCCAAGGATCGCAGTTTTATTCCTGATTCTATGAGTTTTGGAGGGGATAGGTTTGGCAAACGGGATCAGAAACGTAAG
GCCCAGGAAGGTTATCTGAAGCTCCATCATCACCAGCTCAATCTAAAAGGCCAAGAAAACAGAAGGGATCCGATCTCTGATCCTCCGGGTGTATTCAGGGCCCTCATACT
TCACAAGGCAATCGCCACCGTCGATCGATATTACTTCCACGCTCCCGCCATAGTTCCTTATGGCTGGCCTCAATATGTCAAGCATTCAGCACGAGTACAAAAGCGTCACA
GAATAGTGAAACTCGTTCAACTCGTTTTGTATTTGTTCAACTACAGAAGCTCACCTCAGATCAGAAGTCCACCCACCTTGAAGTTTGAGGGAAACTACACCATCTTCAAC
AGACACAACATCGACATTTCCTCCATCTGCGATGAGATAAGGCCGACGTCCTCAAGAACCAAGTCCACGTTTCCAACGGTCAGTTCAAACTTCTGGGCCGAGTACAGTCC
TGGCGAGGAGCTTGTGGATGGACCCGAATTCGAAGCTGAAGCTCTGATGGCTCGTCTGGGCAAGACCCTTTTCAGGGAAACAGGTCCTTGCTGGTGCTGGCCGGAGGTAG
TCAGAGACGCCATGGCTGAAGAAACGCAGAGGAAAATGGCCGATGGGCGATGA
Protein sequenceShow/hide protein sequence
MMYLESITHADSKLEESYLGVHYQHQRNITLHIRPLNLSYPALQTRKKALEFGESVMDAKDILGLPKNTLPIPQEKKPRAPKDAQRKRDGISREVYALTGGLAPIMPAID
TSELKKRPPTDEKITWQWLPFTSTARKDNLQLYHWVRVVNAVPPTGDYSFAKYNKSVDVVRYTDEEYEKYLNDPSWTKEETDQLFDLCERFDLRFIVIADRFPSTRTVEE
LKERYYRASRAILVARAPSPREVSGNPLVKDPYNVSHEIERKRALSMVLSQTKQQERKDAEVLAEAKKIIESRKAERVAEESELPVTSNAVPKLLKGLSFLQIRITITNV
QPPPPVAAPSTLVADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSLQNKEAEGSSFRDSPYNEAPGTPKDRSFIPDSMSFGGDRFGKRDQKRK
AQEGYLKLHHHQLNLKGQENRRDPISDPPGVFRALILHKAIATVDRYYFHAPAIVPYGWPQYVKHSARVQKRHRIVKLVQLVLYLFNYRSSPQIRSPPTLKFEGNYTIFN
RHNIDISSICDEIRPTSSRTKSTFPTVSSNFWAEYSPGEELVDGPEFEAEALMARLGKTLFRETGPCWCWPEVVRDAMAEETQRKMADGR