; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012084 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012084
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionSWR1-complex protein 4
Genome locationscaffold708:165303..171913
RNA-Seq ExpressionMS012084
SyntenyMS012084
Gene Ontology termsGO:0000122 - negative regulation of transcription by RNA polymerase II (biological process)
GO:0006281 - DNA repair (biological process)
GO:0043486 - histone exchange (biological process)
GO:0043967 - histone H4 acetylation (biological process)
GO:0043968 - histone H2A acetylation (biological process)
GO:0000812 - Swr1 complex (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
GO:0003714 - transcription corepressor activity (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR008468 - DNA methyltransferase 1-associated 1
IPR009057 - Homeobox-like domain superfamily
IPR027109 - SWR1-complex protein 4/DNA methyltransferase 1-associated protein 1
IPR032563 - DAMP1, SANT/Myb-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451441.1 PREDICTED: SWR1-complex protein 4 [Cucumis melo]5.6e-20486.68Show/hide
Query:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNT P+PQ+KKPRAQKDAQRKRDGISREVYALTGGL P+MPAIDTSELKKRPPSDEKITWQWLPF+NSARKDNLQLYHWVRVVNG PPTG
Subjt:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS
        DYSFAKYNKSV+VVKYTDEEYEKHL D  WTKEETDQLFDLCERFDLRFIVIADRF S RTVEELKERYYRASRAI+ AR  +SRE SGNT  KDPYNVS
Subjt:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM
         EIERKRALSMVLSQTKQQERKDAEVLAEAKKITESR+AERVAEESELPV SN VPEV ER VVPGD+VPS+SNVQP PPAA PST+ ADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPY +APGTP        
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------

Query:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL
                        KATGRLSEAPSSPAQSKRPRKQKGSDL
Subjt:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL

XP_011659406.1 SWR1-complex protein 4 [Cucumis sativus]8.1e-20386.23Show/hide
Query:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNT P+PQ+KKPRAQKDAQRKRDGISREVYALTGGL PIMPAID SELKKRPPSDEKITWQWLPF+NSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS
        DYSFAKYNKSV+VVKYTDEEYEK+L D  WTKEETDQLFDLCERFDLRFIVIADRF S RTVEELKERYYR SRAI+ AR  +SRE SGNT  KDPYNVS
Subjt:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM
         EIERKRALSMVLSQTKQQERKDAEVLAEAKKITE+R+AERVAEESELPV SN VPEV ER VVPGD+VPS+SNVQP PPAA PST+ ADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPY +APGTP        
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------

Query:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL
                        KATGRLSEAPSSPAQSKRPRKQKGSDL
Subjt:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL

XP_022153204.1 SWR1-complex protein 4 [Momordica charantia]4.6e-22294.13Show/hide
Query:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPK TPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS
        DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLF LCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS
Subjt:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM
        HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP        
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------

Query:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL
                        KATGRLSEAPSSPAQSKRPRKQKGSDL
Subjt:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL

XP_022959680.1 SWR1-complex protein 4-like [Cucurbita moschata]5.2e-20285.33Show/hide
Query:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTP IPQ+KKPRAQKDAQRKRDGISREVYALTGGL PIMPAIDTSELKKRPPSDEKITWQWLPF NSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS
        DYSFAKYNKSV++VKYTDE+YEK+LN+P WTKEETDQLFDLCERFDLRF+VIADRF STRTVEELKERYYRASRAI+ A+ P  RE SGNTL KDPY+VS
Subjt:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM
         EIER+RALSMVLSQTKQQERKDAEVLAEAKKITESRR ERV E+SELPV SN VP   ERAVVPGDS+PS+SNVQP PPAAAPSTL ADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTK+VCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+ PYN+APGTP        
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------

Query:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL
                        KATGRLSEAPS PAQ+KRPRKQKGSDL
Subjt:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL

XP_038898526.1 SWR1-complex protein 4 isoform X1 [Benincasa hispida]4.3e-20486.91Show/hide
Query:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNT P+PQ+KKPRA KDAQRKRDGISREVYALTGGL PIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS
        DYSFAKYNKSV+VVKYTDEEYEK+L DP WTKEETDQLFDLCERFDLRFIVI+DRF S RTVEELKERYYRASRAI+ AR P SRE SGNT  KDPYNVS
Subjt:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM
         EIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPV SN VPEV ER VVP D+VPS+SNVQP PPAA PST+ ADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+SPY +APGTP        
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------

Query:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL
                        KATGRLSEAPSSPAQSKRPRKQKGSDL
Subjt:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL

TrEMBL top hitse value%identityAlignment
A0A0A0K9K0 SANT domain-containing protein3.9e-20386.23Show/hide
Query:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNT P+PQ+KKPRAQKDAQRKRDGISREVYALTGGL PIMPAID SELKKRPPSDEKITWQWLPF+NSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS
        DYSFAKYNKSV+VVKYTDEEYEK+L D  WTKEETDQLFDLCERFDLRFIVIADRF S RTVEELKERYYR SRAI+ AR  +SRE SGNT  KDPYNVS
Subjt:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM
         EIERKRALSMVLSQTKQQERKDAEVLAEAKKITE+R+AERVAEESELPV SN VPEV ER VVPGD+VPS+SNVQP PPAA PST+ ADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPY +APGTP        
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------

Query:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL
                        KATGRLSEAPSSPAQSKRPRKQKGSDL
Subjt:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL

A0A1S3BSL6 SWR1-complex protein 42.7e-20486.68Show/hide
Query:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNT P+PQ+KKPRAQKDAQRKRDGISREVYALTGGL P+MPAIDTSELKKRPPSDEKITWQWLPF+NSARKDNLQLYHWVRVVNG PPTG
Subjt:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS
        DYSFAKYNKSV+VVKYTDEEYEKHL D  WTKEETDQLFDLCERFDLRFIVIADRF S RTVEELKERYYRASRAI+ AR  +SRE SGNT  KDPYNVS
Subjt:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM
         EIERKRALSMVLSQTKQQERKDAEVLAEAKKITESR+AERVAEESELPV SN VPEV ER VVPGD+VPS+SNVQP PPAA PST+ ADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPY +APGTP        
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------

Query:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL
                        KATGRLSEAPSSPAQSKRPRKQKGSDL
Subjt:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL

A0A6J1DID3 SWR1-complex protein 42.2e-22294.13Show/hide
Query:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPK TPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS
        DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLF LCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS
Subjt:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM
        HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP        
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------

Query:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL
                        KATGRLSEAPSSPAQSKRPRKQKGSDL
Subjt:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL

A0A6J1H708 SWR1-complex protein 4-like2.5e-20285.33Show/hide
Query:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNTP IPQ+KKPRAQKDAQRKRDGISREVYALTGGL PIMPAIDTSELKKRPPSDEKITWQWLPF NSARKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS
        DYSFAKYNKSV++VKYTDE+YEK+LN+P WTKEETDQLFDLCERFDLRF+VIADRF STRTVEELKERYYRASRAI+ A+ P  RE SGNTL KDPY+VS
Subjt:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM
         EIER+RALSMVLSQTKQQERKDAEVLAEAKKITESRR ERV E+SELPV SN VP   ERAVVPGDS+PS+SNVQP PPAAAPSTL ADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTK+VCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+ PYN+APGTP        
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------

Query:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL
                        KATGRLSEAPS PAQ+KRPRKQKGSDL
Subjt:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL

A0A6J1KWJ6 SWR1-complex protein 4-like1.0e-19884.65Show/hide
Query:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG
        MDAKDILGLPKNT  IPQ+KK RAQKDAQRKRDGISREVYALTGGL PIMPAIDTSELKKRPPSDEKITWQWLPF NS RKDNLQLYHWVRVVNGIPPTG
Subjt:  MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTG

Query:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS
        DYSFAKYNKSV++VKYTDE+YEK+LN+P WTKEETDQLFDLCERFDLRF+VIADRF STRTVEELKERYYRASRAI+ AR P+ RE SGN L KDPYNVS
Subjt:  DYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVS

Query:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM
         E ERKRALSMVLSQTKQQERKDAEVLAEAKKITESRR ERV E+SELPV  N VP   ERAVVPGDSVPS+SNVQP  PAAAPSTL ADNASTLASLRM
Subjt:  HEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------
        LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTK+VCAEHLELRKEILTLLNLQKQLQNKEAEGSSFR+ PYN+APGTP        
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------

Query:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL
                        KATGRLSEAPS PAQ+KRPRKQKGSDL
Subjt:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL

SwissProt top hitse value%identityAlignment
O14308 SWR1-complex protein 47.0e-3227.68Show/hide
Query:  DAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTG-GLPPIMPAIDTSELKKRPPSDEKI-TWQWLPFTNSARKDNLQLYHWVRVVNGIPPT
        D +D+  L    PP     K +++   +R+ +GISRE+Y+L G    P+  AI   + K++P    K   W   PF+ S+RKD+  L+HWV + + +   
Subjt:  DAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTG-GLPPIMPAIDTSELKKRPPSDEKI-TWQWLPFTNSARKDNLQLYHWVRVVNGIPPT

Query:  GDYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSST-----RTVEELKERYYRASRAIMVARAPVSREVSGNTLVK
          Y F K+N  + ++ YTDEEY+ +L D  W K+ETD LF LC+ +DLRF VIADR+ +      RT+E+LK+R+Y  SR I++AR P++   +  + + 
Subjt:  GDYSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSST-----RTVEELKERYYRASRAIMVARAPVSREVSGNTLVK

Query:  D--PYNVSHEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESEL--------------------------------PVASNVVPEVAE
        +   YN   E+ RK+ L  + S+T ++  ++  +  E K+I E+ +A+ +++  E+                                    N V E   
Subjt:  D--PYNVSHEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESEL--------------------------------PVASNVVPEVAE

Query:  RAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRML---PVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELR
         +  P   V S+ N    P A +   +      T     +     ++  T+   Q + A  +S      +RV   + +L V+ +  +PT     + +EL+
Subjt:  RAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRML---PVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELR

Query:  KEILTLLNLQKQLQNKEAE
          I++LL L++++     E
Subjt:  KEILTLLNLQKQLQNKEAE

Q7K3D8 DNA methyltransferase 1-associated protein 19.4e-3732.63Show/hide
Query:  DAKDILGLPK-NTPPIPQD-----KKPRAQ--KDAQRKRDGISREVYAL----TGGLPPIMPAIDTS--------ELKKRPPSDEKITWQWLPFTNSARK
        D +DIL + + NTP + +D     KK   +  K A R+ +G+ REV+AL        PP++P  DT+        E K R    +   W+W PF+N AR 
Subjt:  DAKDILGLPK-NTPPIPQD-----KKPRAQ--KDAQRKRDGISREVYAL----TGGLPPIMPAIDTS--------ELKKRPPSDEKITWQWLPFTNSARK

Query:  DNLQLYHWVRVVNGIPPTGDYSFAKYNKSVDVVKYTDEEYEKHLNDPL--WTKEETDQLFDLCERFDLRFIVIADRFS----STRTVEELKERYYRASRA
        D+   +HW RV +    + DY FAK+NK ++V  YT  EY  HL + +  W+K +TD LFDL  RFDLRFIV+ADR++     T+TVEELKERYY     
Subjt:  DNLQLYHWVRVVNGIPPTGDYSFAKYNKSVDVVKYTDEEYEKHLNDPL--WTKEETDQLFDLCERFDLRFIVIADRFS----STRTVEELKERYYRASRA

Query:  IMVARAPVSREVSGNTLVKDPYNVSHEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNV
        +  A+   S +      V   Y+V HE  RK  L  +  +T QQ  ++  ++ E KKI E+R+ ER  +  +L    +   +  E A     + PS    
Subjt:  IMVARAPVSREVSGNTLVKDPYNVSHEIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNV

Query:  ----------QPSPPAAAPSTLGA----DNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEIL
                  Q   P+   S + A     +    A LR   V LR+  ++       ++ G R +K +EQ +Q+  V+  P  PT+ +C    ELR +++
Subjt:  ----------QPSPPAAAPSTLGA----DNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEIL

Query:  TLLNLQKQLQNKEAEGSSFRDSPYNDAPG
         L  L+  L     E  S +       PG
Subjt:  TLLNLQKQLQNKEAEGSSFRDSPYNDAPG

Q8VZL6 SWR1-complex protein 46.3e-14264.33Show/hide
Query:  DAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTGD
        DAKDILGLPK    + Q+KK R QK++ RK DGISREVYALTGG+ P+MP+ID   LK+RPP+DEK+ W+WL FTNSARKD+LQLYHWVRVVN +PPTGD
Subjt:  DAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTGD

Query:  YSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVSH
        YSFAKYNKSVD++KYTDEEYE HL D +WTKEETDQLF+ C+ FDLRF+VIADRF  +RTVEELK+RYY  +RA++ ARA    +V+ + L+K+PY+++ 
Subjt:  YSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVSH

Query:  EIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLG-ADNASTLASLRM
        + ERKRALSMVLSQ++ QE+KDAE+LAEAK+ITE R A R AEE ++    N   + A+  VVPG SV   SN Q    A APSTL  AD ASTLASLRM
Subjt:  EIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLG-ADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------
        L VYLRTY LEQMVQAASS+ GLRTIKRVEQTLQDL VNLKP+VPTK VC EHLELRKEILTLLNLQKQLQ KE+EGSS R+  Y   P TP        
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------

Query:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL
                        K  GR ++ P SPA  KRPRK K SDL
Subjt:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL

Q9JI44 DNA methyltransferase 1-associated protein 14.2e-3733.6Show/hide
Query:  PQDKKPRAQKDAQ--RKRDGISREVYAL----TGGLPPIMPAIDTSE----LKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTGDYSFAK
        P  KK +   +    ++ +G+ REVYAL        PP++P+ DT +    +K +  S +   W+W+PFTN ARKD    +HW R         DY FA+
Subjt:  PQDKKPRAQKDAQ--RKRDGISREVYAL----TGGLPPIMPAIDTSE----LKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTGDYSFAK

Query:  YNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFS----STRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVSHE
        +NK+V V  Y+++EY+ +L+D  WTK ETD LFDL  RFDLRF+VI DR+       R+VE+LKERYY      + A+    R V G  L    ++  HE
Subjt:  YNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFS----STRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVSHE

Query:  IERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAE-RAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRML
          RK  L  + ++T +Q  ++  +L E +KI E+R+ ER     +L          AE R          L   + +   A P T G          +  
Subjt:  IERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAE-RAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRML

Query:  PVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE
         V LR+  ++       SS G + IK +EQ L +L V L P  PT+ +     ELR +++ L  L++   N E E
Subjt:  PVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE

Q9NPF5 DNA methyltransferase 1-associated protein 14.2e-3733.6Show/hide
Query:  PQDKKPRAQKDAQ--RKRDGISREVYAL----TGGLPPIMPAIDTSE----LKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTGDYSFAK
        P  KK +   +    ++ +G+ REVYAL        PP++P+ DT +    +K +  S +   W+W+PFTN ARKD    +HW R         DY FA+
Subjt:  PQDKKPRAQKDAQ--RKRDGISREVYAL----TGGLPPIMPAIDTSE----LKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTGDYSFAK

Query:  YNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFS----STRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVSHE
        +NK+V V  Y+++EY+ +L+D  WTK ETD LFDL  RFDLRF+VI DR+       R+VE+LKERYY      + A+    R V G  L    ++  HE
Subjt:  YNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFS----STRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVSHE

Query:  IERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAE-RAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRML
          RK  L  + ++T +Q  ++  +L E +KI E+R+ ER     +L          AE R          L   + +   A P T G          +  
Subjt:  IERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAE-RAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRML

Query:  PVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE
         V LR+  ++       SS G + IK +EQ L +L V L P  PT+ +     ELR +++ L  L++   N E E
Subjt:  PVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAE

Arabidopsis top hitse value%identityAlignment
AT2G47210.1 myb-like transcription factor family protein4.5e-14364.33Show/hide
Query:  DAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTGD
        DAKDILGLPK    + Q+KK R QK++ RK DGISREVYALTGG+ P+MP+ID   LK+RPP+DEK+ W+WL FTNSARKD+LQLYHWVRVVN +PPTGD
Subjt:  DAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTGD

Query:  YSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVSH
        YSFAKYNKSVD++KYTDEEYE HL D +WTKEETDQLF+ C+ FDLRF+VIADRF  +RTVEELK+RYY  +RA++ ARA    +V+ + L+K+PY+++ 
Subjt:  YSFAKYNKSVDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVSH

Query:  EIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLG-ADNASTLASLRM
        + ERKRALSMVLSQ++ QE+KDAE+LAEAK+ITE R A R AEE ++    N   + A+  VVPG SV   SN Q    A APSTL  AD ASTLASLRM
Subjt:  EIERKRALSMVLSQTKQQERKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLG-ADNASTLASLRM

Query:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------
        L VYLRTY LEQMVQAASS+ GLRTIKRVEQTLQDL VNLKP+VPTK VC EHLELRKEILTLLNLQKQLQ KE+EGSS R+  Y   P TP        
Subjt:  LPVYLRTYALEQMVQAASSSAGLRTIKRVEQTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTP--------

Query:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL
                        K  GR ++ P SPA  KRPRK K SDL
Subjt:  ----------------KATGRLSEAPSSPAQSKRPRKQKGSDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCCAAGGATATCTTGGGCCTGCCCAAAAATACGCCGCCTATACCCCAGGACAAGAAACCTAGGGCTCAGAAGGATGCCCAGAGGAAGCGAGATGGTATTTCTCG
AGAGGTTTACGCGCTTACTGGTGGTCTGCCGCCTATTATGCCGGCAATTGATACATCTGAGCTGAAAAAACGTCCTCCATCAGACGAGAAGATTACGTGGCAGTGGCTTC
CTTTCACAAATTCTGCTAGAAAAGACAATTTGCAGCTTTACCATTGGGTTAGAGTTGTAAATGGCATTCCGCCAACAGGTGACTATTCCTTTGCAAAGTATAACAAGTCA
GTTGACGTTGTCAAATACACGGATGAGGAGTATGAGAAGCATTTGAACGACCCGTTGTGGACGAAGGAGGAGACAGATCAATTATTTGACTTGTGCGAACGGTTTGATCT
TCGCTTCATTGTGATAGCTGACAGGTTTTCATCAACTAGGACAGTGGAGGAACTGAAGGAGCGATATTATCGTGCATCTAGAGCAATTATGGTTGCTAGAGCACCAGTGT
CTCGCGAGGTTTCAGGGAATACTCTCGTCAAGGATCCTTACAATGTCTCACATGAGATTGAGCGGAAACGGGCATTGTCCATGGTTCTCTCCCAAACAAAACAGCAAGAA
CGAAAAGATGCAGAGGTTCTTGCTGAAGCAAAAAAGATAACTGAATCACGCAGAGCGGAAAGAGTGGCTGAAGAATCTGAGTTGCCTGTTGCATCCAATGTTGTTCCAGA
AGTTGCTGAAAGGGCTGTTGTTCCTGGAGATTCTGTACCATCTTTATCCAATGTGCAGCCCTCTCCTCCGGCAGCTGCACCTTCAACTTTAGGGGCAGATAACGCTTCAA
CTCTGGCTTCCCTTCGCATGCTTCCTGTGTACTTGAGAACGTATGCACTTGAGCAAATGGTACAAGCTGCAAGTTCATCCGCTGGGCTTCGGACGATCAAGCGAGTTGAA
CAAACATTACAAGATCTCTCGGTTAATCTAAAACCCAGGGTTCCAACAAAAGCTGTCTGTGCAGAGCATCTTGAATTAAGAAAAGAAATCTTGACTCTACTGAATCTTCA
AAAGCAGTTGCAAAATAAGGAGGCAGAAGGTTCATCTTTCCGTGACAGTCCATACAACGATGCACCTGGCACACCCAAGGCCACTGGAAGACTGTCTGAAGCTCCATCAT
CACCAGCTCAATCTAAAAGGCCAAGAAAACAGAAGGGATCCGATCTG
mRNA sequenceShow/hide mRNA sequence
ATGGATGCCAAGGATATCTTGGGCCTGCCCAAAAATACGCCGCCTATACCCCAGGACAAGAAACCTAGGGCTCAGAAGGATGCCCAGAGGAAGCGAGATGGTATTTCTCG
AGAGGTTTACGCGCTTACTGGTGGTCTGCCGCCTATTATGCCGGCAATTGATACATCTGAGCTGAAAAAACGTCCTCCATCAGACGAGAAGATTACGTGGCAGTGGCTTC
CTTTCACAAATTCTGCTAGAAAAGACAATTTGCAGCTTTACCATTGGGTTAGAGTTGTAAATGGCATTCCGCCAACAGGTGACTATTCCTTTGCAAAGTATAACAAGTCA
GTTGACGTTGTCAAATACACGGATGAGGAGTATGAGAAGCATTTGAACGACCCGTTGTGGACGAAGGAGGAGACAGATCAATTATTTGACTTGTGCGAACGGTTTGATCT
TCGCTTCATTGTGATAGCTGACAGGTTTTCATCAACTAGGACAGTGGAGGAACTGAAGGAGCGATATTATCGTGCATCTAGAGCAATTATGGTTGCTAGAGCACCAGTGT
CTCGCGAGGTTTCAGGGAATACTCTCGTCAAGGATCCTTACAATGTCTCACATGAGATTGAGCGGAAACGGGCATTGTCCATGGTTCTCTCCCAAACAAAACAGCAAGAA
CGAAAAGATGCAGAGGTTCTTGCTGAAGCAAAAAAGATAACTGAATCACGCAGAGCGGAAAGAGTGGCTGAAGAATCTGAGTTGCCTGTTGCATCCAATGTTGTTCCAGA
AGTTGCTGAAAGGGCTGTTGTTCCTGGAGATTCTGTACCATCTTTATCCAATGTGCAGCCCTCTCCTCCGGCAGCTGCACCTTCAACTTTAGGGGCAGATAACGCTTCAA
CTCTGGCTTCCCTTCGCATGCTTCCTGTGTACTTGAGAACGTATGCACTTGAGCAAATGGTACAAGCTGCAAGTTCATCCGCTGGGCTTCGGACGATCAAGCGAGTTGAA
CAAACATTACAAGATCTCTCGGTTAATCTAAAACCCAGGGTTCCAACAAAAGCTGTCTGTGCAGAGCATCTTGAATTAAGAAAAGAAATCTTGACTCTACTGAATCTTCA
AAAGCAGTTGCAAAATAAGGAGGCAGAAGGTTCATCTTTCCGTGACAGTCCATACAACGATGCACCTGGCACACCCAAGGCCACTGGAAGACTGTCTGAAGCTCCATCAT
CACCAGCTCAATCTAAAAGGCCAAGAAAACAGAAGGGATCCGATCTG
Protein sequenceShow/hide protein sequence
MDAKDILGLPKNTPPIPQDKKPRAQKDAQRKRDGISREVYALTGGLPPIMPAIDTSELKKRPPSDEKITWQWLPFTNSARKDNLQLYHWVRVVNGIPPTGDYSFAKYNKS
VDVVKYTDEEYEKHLNDPLWTKEETDQLFDLCERFDLRFIVIADRFSSTRTVEELKERYYRASRAIMVARAPVSREVSGNTLVKDPYNVSHEIERKRALSMVLSQTKQQE
RKDAEVLAEAKKITESRRAERVAEESELPVASNVVPEVAERAVVPGDSVPSLSNVQPSPPAAAPSTLGADNASTLASLRMLPVYLRTYALEQMVQAASSSAGLRTIKRVE
QTLQDLSVNLKPRVPTKAVCAEHLELRKEILTLLNLQKQLQNKEAEGSSFRDSPYNDAPGTPKATGRLSEAPSSPAQSKRPRKQKGSDL