; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019483 (gene) of Snake gourd v1 genome

Gene IDTan0019483
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBromo domain-containing protein
Genome locationLG05:84335859..84340412
RNA-Seq ExpressionTan0019483
SyntenyTan0019483
Gene Ontology termsGO:0016573 - histone acetylation (biological process)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR001487 - Bromodomain
IPR036427 - Bromodomain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146636.1 uncharacterized protein LOC101217843 isoform X1 [Cucumis sativus]1.7e-19680.55Show/hide
Query:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEA++  WDTW+ELLLGGA+LRHGT DWNLVA ELRSRI RPYACTPEVCKAKYEDL+KRFVGCKAWYEELRR+R+MELRQALEHSEDSIGSLESKLE
Subjt:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSG-DKSLVNSSSRSESWGAVQKATNELSAGSFTQENR-TCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSR
        ALKSRSG DKSLVN S+RSESWGAVQK TNELSA SFTQENR TCSS+ECQPAP+S +E EIKPE    QSLE GK S IGKLG VLYE+QGG +RKRSR
Subjt:  ALKSRSG-DKSLVNSSSRSESWGAVQKATNELSAGSFTQENR-TCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSR

Query:  GKRKRKDCNKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDGVDVLMAAFNSVADNKSASVFRRRLDSQKRGRYKKIIRQ
        GKRKRKDCN++VKEGS+GENNLSESANPSTVS SKENSCCNSFEARE SDANEASRSS MDGVDVLMAAFN+VA++KSAS+FRRRLDSQ+R RYKK+IRQ
Subjt:  GKRKRKDCNKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDGVDVLMAAFNSVADNKSASVFRRRLDSQKRGRYKKIIRQ

Query:  HLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVSRK
        HLDIETIRSRVASH ITT+ ELYRDLLLLANNALVFYS NSREHQSAVLLRRLI+STF+K  K+SS++VAHN  N+RT+  + +AKPRRSQPAKRN S++
Subjt:  HLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVSRK

Query:  EVNSGDVKTPSGNRRRRSNANSHP-SVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK
        E N GDVKTP GNRRR++N+++ P S+GLAKKETS S +KK P GTRK V GTSKSER +ATG+RGRKRG+TK
Subjt:  EVNSGDVKTPSGNRRRRSNANSHP-SVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK

XP_008442126.1 PREDICTED: uncharacterized protein LOC103486076 isoform X1 [Cucumis melo]2.1e-19982.03Show/hide
Query:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEA+ KRWDTW+ELLLGGA++RHGTGDWNLVA ELRSRI RPY CTPEVCKAKYEDL+KRFVGCKAWYEELR++RIMELRQALEHSEDSIGSLESKLE
Subjt:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSG-DKSLVNSSSRSESWGAVQKATNELSAGSFTQENR-TCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSR
        ALKSRSG DKSLVN S+RSESWGAVQK TNE SA SFTQENR TCSS+ECQPAP+  EE EIKPE    QSLEWGK   IGKLG VLYE+QGG +RKRSR
Subjt:  ALKSRSG-DKSLVNSSSRSESWGAVQKATNELSAGSFTQENR-TCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSR

Query:  GKRKRKDCNKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDGVDVLMAAFNSVADNKSASVFRRRLDSQKRGRYKKIIRQ
        GKRKRKDCN++VKEGS+GENNLSESANPSTVS SKENSCCNSFEARESSDANEASRSS MDGVDVLMA FNSVA++KSASVFRRRLDSQ+R RYKK+IRQ
Subjt:  GKRKRKDCNKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDGVDVLMAAFNSVADNKSASVFRRRLDSQKRGRYKKIIRQ

Query:  HLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVSRK
        HLDIETIRSRVASHYITT+KELYRDLLLLANNALVFYS NSREHQSAV LRRLI+STFQKL K+SS++VAHN  NQRT+  + +AKPRRSQPAKRN S++
Subjt:  HLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVSRK

Query:  EVNSGDVKTPSGNRRRRSNANSHP-SVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK
        E N GDVKTP+GNRRRR+N+++ P S+GL+KKETS ST KK P G RK V GTSKSER +ATG+RGRKRGRTK
Subjt:  EVNSGDVKTPSGNRRRRSNANSHP-SVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK

XP_022954655.1 uncharacterized protein LOC111456852 isoform X1 [Cucurbita moschata]4.5e-19481.61Show/hide
Query:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEAIQKRWDTWEELLLGGA+LRHGT DWNLVAAELR+RIVRP A TPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELR+ALEHSEDSIGSLESKLE
Subjt:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQKATNELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSRGK
        ALKSRSGDKSLVNSS RSESWG V K TNELSAGSFTQENRTCSS+EC+ AP  A+E EIKPE SQ + LEWGKV               GTV+KRSRGK
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQKATNELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSRGK

Query:  RKRKDC-NKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDG--VDVLMAAFNSVADNKSASVFRRRLDSQKRGRYKKIIR
        RKRKDC ++DVKEGSTGENNLSESANPSTVSHSK+NSCCNSFE RESSDANEASRSS MDG  VDVLMAAFN+VA+NKSA+VFRRRLDSQKRGRYKK+IR
Subjt:  RKRKDC-NKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDG--VDVLMAAFNSVADNKSASVFRRRLDSQKRGRYKKIIR

Query:  QHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVSR
        QHLDIETIRSRVAS YITTQKELYRDLLLLANNALVFY PN+RE++SAVLLRRLIT+TFQKLFKNS        H++RT+  +QMAK  R QPAKRN SR
Subjt:  QHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVSR

Query:  KEVNSGDVKTPSGNRRRRSNANSHPSVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK
        KEVN GD KTPSGN RRRSNANSH SVGLAK ETS STVK+ P GTRK VVGTSKSER AAT  RGRKRGR K
Subjt:  KEVNSGDVKTPSGNRRRRSNANSHPSVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK

XP_022994396.1 uncharacterized protein LOC111490126 isoform X1 [Cucurbita maxima]6.1e-19982.49Show/hide
Query:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEAIQKRWDTWEELLLGGA+LRHGT DWNLVAAELR+RIVRP A TPEVCKAKYEDLQKRFVGCKAWYEELRRQRI+ELR+ALEHSEDSIGSLESKLE
Subjt:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQKATNELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSRGK
        ALKSRSGDKSLVNSS RSESWG V K TNELSAGSFTQENRTCSS+EC+ AP  A+E EIKPE SQ + LEWGKV               GTV+KRSRGK
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQKATNELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSRGK

Query:  RKRKDC--NKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDG--VDVLMAAFNSVADNKSASVFRRRLDSQKRGRYKKII
        RKRKDC  ++DVKEGSTGENNLSESANPSTVSHSK+NSCCNSFE RESSDANEASRSS MDG  VDVLMAAFN+VA+NKSA VFRRRLDSQKRGRYKK+I
Subjt:  RKRKDC--NKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDG--VDVLMAAFNSVADNKSASVFRRRLDSQKRGRYKKII

Query:  RQHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVS
        RQHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFY PN+REH+SAVLLRRLITSTFQKLFKNS        H +RT+  +QMAKP R QPAKR  S
Subjt:  RQHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVS

Query:  RKEVNSGDVKTPSGNRRRRSNANSHPSVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK
        RKEVN GD KTPSGNRRRRSNANSH SVGLAK ETS STVK+ P GTRK VVGTSKSE+ AATGVRGRKRGRTK
Subjt:  RKEVNSGDVKTPSGNRRRRSNANSHPSVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK

XP_023542669.1 uncharacterized protein LOC111802504 isoform X1 [Cucurbita pepo subsp. pepo]1.5e-19481.61Show/hide
Query:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEAIQK+WDTWEELLLGGA+LRHGT DWNLVAAELR+RIVRP A TPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELR+ALEHSEDSIGSLESKLE
Subjt:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQKATNELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSRGK
        ALKSRSGDKSLVNSS RSESWG V K TNELSAGSFTQENRTCSS+EC+ AP  A+E EIKPE SQ + L+WGKV               GT +KRSRGK
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQKATNELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSRGK

Query:  RKRKDC-NKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDG--VDVLMAAFNSVADNKSASVFRRRLDSQKRGRYKKIIR
        RKRKDC ++DVKEGSTGENNLSESANPSTVSHSK+NSCCNSFE RESSDANEASRSS MDG  VDVLMAAFN+VA+NKSASVFRRRLDSQKRGRYKK+IR
Subjt:  RKRKDC-NKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDG--VDVLMAAFNSVADNKSASVFRRRLDSQKRGRYKKIIR

Query:  QHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVSR
        QHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFY PN+RE++SAVLLRRLITSTFQKLFKNS        H++RT+  +Q+AKP R QPAKRN SR
Subjt:  QHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVSR

Query:  KEVNSGDVKTPSGNRRRRSNANSHPSVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK
        KEVN GD KTPSGN RRRSNANSH SVGLAK ETS STVK+ P GTRK VVGT KSER AAT  RGRKRGRTK
Subjt:  KEVNSGDVKTPSGNRRRRSNANSHPSVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK

TrEMBL top hitse value%identityAlignment
A0A0A0LV17 Bromo domain-containing protein8.0e-19780.55Show/hide
Query:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEA++  WDTW+ELLLGGA+LRHGT DWNLVA ELRSRI RPYACTPEVCKAKYEDL+KRFVGCKAWYEELRR+R+MELRQALEHSEDSIGSLESKLE
Subjt:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSG-DKSLVNSSSRSESWGAVQKATNELSAGSFTQENR-TCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSR
        ALKSRSG DKSLVN S+RSESWGAVQK TNELSA SFTQENR TCSS+ECQPAP+S +E EIKPE    QSLE GK S IGKLG VLYE+QGG +RKRSR
Subjt:  ALKSRSG-DKSLVNSSSRSESWGAVQKATNELSAGSFTQENR-TCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSR

Query:  GKRKRKDCNKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDGVDVLMAAFNSVADNKSASVFRRRLDSQKRGRYKKIIRQ
        GKRKRKDCN++VKEGS+GENNLSESANPSTVS SKENSCCNSFEARE SDANEASRSS MDGVDVLMAAFN+VA++KSAS+FRRRLDSQ+R RYKK+IRQ
Subjt:  GKRKRKDCNKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDGVDVLMAAFNSVADNKSASVFRRRLDSQKRGRYKKIIRQ

Query:  HLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVSRK
        HLDIETIRSRVASH ITT+ ELYRDLLLLANNALVFYS NSREHQSAVLLRRLI+STF+K  K+SS++VAHN  N+RT+  + +AKPRRSQPAKRN S++
Subjt:  HLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVSRK

Query:  EVNSGDVKTPSGNRRRRSNANSHP-SVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK
        E N GDVKTP GNRRR++N+++ P S+GLAKKETS S +KK P GTRK V GTSKSER +ATG+RGRKRG+TK
Subjt:  EVNSGDVKTPSGNRRRRSNANSHP-SVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK

A0A1S3B4Z1 uncharacterized protein LOC103486076 isoform X11.0e-19982.03Show/hide
Query:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEA+ KRWDTW+ELLLGGA++RHGTGDWNLVA ELRSRI RPY CTPEVCKAKYEDL+KRFVGCKAWYEELR++RIMELRQALEHSEDSIGSLESKLE
Subjt:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSG-DKSLVNSSSRSESWGAVQKATNELSAGSFTQENR-TCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSR
        ALKSRSG DKSLVN S+RSESWGAVQK TNE SA SFTQENR TCSS+ECQPAP+  EE EIKPE    QSLEWGK   IGKLG VLYE+QGG +RKRSR
Subjt:  ALKSRSG-DKSLVNSSSRSESWGAVQKATNELSAGSFTQENR-TCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSR

Query:  GKRKRKDCNKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDGVDVLMAAFNSVADNKSASVFRRRLDSQKRGRYKKIIRQ
        GKRKRKDCN++VKEGS+GENNLSESANPSTVS SKENSCCNSFEARESSDANEASRSS MDGVDVLMA FNSVA++KSASVFRRRLDSQ+R RYKK+IRQ
Subjt:  GKRKRKDCNKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDGVDVLMAAFNSVADNKSASVFRRRLDSQKRGRYKKIIRQ

Query:  HLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVSRK
        HLDIETIRSRVASHYITT+KELYRDLLLLANNALVFYS NSREHQSAV LRRLI+STFQKL K+SS++VAHN  NQRT+  + +AKPRRSQPAKRN S++
Subjt:  HLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVSRK

Query:  EVNSGDVKTPSGNRRRRSNANSHP-SVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK
        E N GDVKTP+GNRRRR+N+++ P S+GL+KKETS ST KK P G RK V GTSKSER +ATG+RGRKRGRTK
Subjt:  EVNSGDVKTPSGNRRRRSNANSHP-SVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK

A0A6J1CGL2 uncharacterized protein LOC1110106371.1e-18578.35Show/hide
Query:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MG EAI++RWDTWEELLLGGAVLRHGTGDWNLVAAELR+RIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRR+RIMELRQALEHSEDSIGSLESKLE
Subjt:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQKAT-NELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSRG
        ALKSRSGDK +VNS SRSESWGAVQK T NELSAGSFTQE RTCSSLEC+ AP+SAEE+EIK E    Q     KVS+I KL G+LY SQGGTVRKR RG
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQKAT-NELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSRG

Query:  KRKRKDC----------NKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMD--GVDVLMAAFNSVADNKSASVFRRRLDSQ
        KRKRK+C          N+DVKEGS GENNLSES NP+TVS     SCCNSFE    SDANEA RSS MD  GVDVLMAAFNSVA +KSASVFRRRLDSQ
Subjt:  KRKRKDC----------NKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMD--GVDVLMAAFNSVADNKSASVFRRRLDSQ

Query:  KRGRYKKIIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRR
        KRGRYKK+IRQHLDIE IRSRV SHYITT KELYRDLLLLANNALVFYS NSREHQSAVLLR +ITS F+KLFKNSS+VV HNHH Q+T+  + + KPRR
Subjt:  KRGRYKKIIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRR

Query:  SQPAKRNVSRKEVNSGDVKTPSGNRRRRSNANSHPSVGLAKKETS--GSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK
        SQPAK NVS+KE N  DVKT +G RRR + AN H SVGL KKETS   ST KKGP  TRK VVGTSKSER +ATG RGRKRGRTK
Subjt:  SQPAKRNVSRKEVNSGDVKTPSGNRRRRSNANSHPSVGLAKKETS--GSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK

A0A6J1GT05 uncharacterized protein LOC111456852 isoform X12.2e-19481.61Show/hide
Query:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEAIQKRWDTWEELLLGGA+LRHGT DWNLVAAELR+RIVRP A TPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELR+ALEHSEDSIGSLESKLE
Subjt:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQKATNELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSRGK
        ALKSRSGDKSLVNSS RSESWG V K TNELSAGSFTQENRTCSS+EC+ AP  A+E EIKPE SQ + LEWGKV               GTV+KRSRGK
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQKATNELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSRGK

Query:  RKRKDC-NKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDG--VDVLMAAFNSVADNKSASVFRRRLDSQKRGRYKKIIR
        RKRKDC ++DVKEGSTGENNLSESANPSTVSHSK+NSCCNSFE RESSDANEASRSS MDG  VDVLMAAFN+VA+NKSA+VFRRRLDSQKRGRYKK+IR
Subjt:  RKRKDC-NKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDG--VDVLMAAFNSVADNKSASVFRRRLDSQKRGRYKKIIR

Query:  QHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVSR
        QHLDIETIRSRVAS YITTQKELYRDLLLLANNALVFY PN+RE++SAVLLRRLIT+TFQKLFKNS        H++RT+  +QMAK  R QPAKRN SR
Subjt:  QHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVSR

Query:  KEVNSGDVKTPSGNRRRRSNANSHPSVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK
        KEVN GD KTPSGN RRRSNANSH SVGLAK ETS STVK+ P GTRK VVGTSKSER AAT  RGRKRGR K
Subjt:  KEVNSGDVKTPSGNRRRRSNANSHPSVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK

A0A6J1JZ11 uncharacterized protein LOC111490126 isoform X12.9e-19982.49Show/hide
Query:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEAIQKRWDTWEELLLGGA+LRHGT DWNLVAAELR+RIVRP A TPEVCKAKYEDLQKRFVGCKAWYEELRRQRI+ELR+ALEHSEDSIGSLESKLE
Subjt:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQKATNELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSRGK
        ALKSRSGDKSLVNSS RSESWG V K TNELSAGSFTQENRTCSS+EC+ AP  A+E EIKPE SQ + LEWGKV               GTV+KRSRGK
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQKATNELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSRGK

Query:  RKRKDC--NKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDG--VDVLMAAFNSVADNKSASVFRRRLDSQKRGRYKKII
        RKRKDC  ++DVKEGSTGENNLSESANPSTVSHSK+NSCCNSFE RESSDANEASRSS MDG  VDVLMAAFN+VA+NKSA VFRRRLDSQKRGRYKK+I
Subjt:  RKRKDC--NKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDG--VDVLMAAFNSVADNKSASVFRRRLDSQKRGRYKKII

Query:  RQHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVS
        RQHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFY PN+REH+SAVLLRRLITSTFQKLFKNS        H +RT+  +QMAKP R QPAKR  S
Subjt:  RQHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVS

Query:  RKEVNSGDVKTPSGNRRRRSNANSHPSVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK
        RKEVN GD KTPSGNRRRRSNANSH SVGLAK ETS STVK+ P GTRK VVGTSKSE+ AATGVRGRKRGRTK
Subjt:  RKEVNSGDVKTPSGNRRRRSNANSHPSVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRTK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61215.1 bromodomain 46.1e-6438.72Show/hide
Query:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        M    ++  W TWEELLLGGAVLRHGTGDW +VA ELRS  + P   TPE+CKAKY+DL+KR+VGCKAW+EEL+++R+ EL+ AL  SEDSIGSLESKL+
Subjt:  MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNS-----------SSRSESWG-AVQKATNE--LSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKV-SNIGKLGGVL
        +LKS S D+   N+           S +SE  G    K T++   S GSFTQ+  T ++     +P +  E  +  E  +++ L    +  ++   GG +
Subjt:  ALKSRSGDKSLVNS-----------SSRSESWG-AVQKATNE--LSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKV-SNIGKLGGVL

Query:  YESQGGTVRKRSRGKRKRKDCN----KDVKEGSTGENN--LSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDGVDVLMAAFNSVADNKSASV
          S      ++ RGKRKRKDC+    K+V E S  E +     SA+ +++  SKE           +S ++  SR  ++     LM  +N++A N+ A V
Subjt:  YESQGGTVRKRSRGKRKRKDCN----KDVKEGSTGENN--LSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDGVDVLMAAFNSVADNKSASV

Query:  FRRRLDSQKRGRYKKIIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLF-----KNSSSVVAHNHH--
        FRRRLDSQKRGRYKK++R+H+D++T++SR+    I++ KEL+RD LL+ANNA +FYS N+RE++SAV LR ++T + +         + SS+ A +    
Subjt:  FRRRLDSQKRGRYKKIIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLF-----KNSSSVVAHNHH--

Query:  --NQRTRASNQMAKPRRSQPAKRNVSRKEVNSGDVKTPS-GNRRRRSNANSHPSVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRT
          +Q++ + +        +P       K V     KT S GN+R  ++      V   K   +G   KKG    + G       E PA   + GRKR R 
Subjt:  --NQRTRASNQMAKPRRSQPAKRNVSRKEVNSGDVKTPS-GNRRRRSNANSHPSVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGRT

Query:  K
        +
Subjt:  K

AT2G42150.1 DNA-binding bromodomain-containing protein1.9e-2028.57Show/hide
Query:  QKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRF------------VGCKAWYEELRRQRIMELRQALEHSEDSIGS
        ++ W TWEELLL  AV RHGT  WN V+AE++       + T   C+ KY DL+ RF            +    W EELR+ R+ ELR+ +E  + SI +
Subjt:  QKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRF------------VGCKAWYEELRRQRIMELRQALEHSEDSIGS

Query:  LESKLEALKSRSGDKSLVNSSSRSESWGAVQKATNELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVR
        L+SK++ L+    + S +   + +E+    +K        S + E      ++     IS +  EI  E ++ +    G      KL G        + R
Subjt:  LESKLEALKSRSGDKSLVNSSSRSESWGAVQKATNELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVR

Query:  KRSRGKRKRKDCNKD-VKEGSTGENNLSE--SANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDGV---DVLMAAFNSVADNKSASVFRRRLDSQK
               K    N + V+  S  E   SE  ++    ++   ++S     +     D  + S +S  D       L++    +  +   S F RRL+ Q+
Subjt:  KRSRGKRKRKDCNKD-VKEGSTGENNLSE--SANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDGV---DVLMAAFNSVADNKSASVFRRRLDSQK

Query:  RGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSS
           Y  IIR+H+D E IR RV    Y + +   +RDLLLL NNA VFY   S E + A  L +L+        K  S+
Subjt:  RGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSS

AT2G44430.1 DNA-binding bromodomain-containing protein1.5e-1725.36Show/hide
Query:  WDTWEELLLGGAVLRHGTGDWNLVAAELRSR-IVRPYACTPEVCKAKYEDLQKRF---------------------VGCK-AWYEELRRQRIMELRQALE
        W TWEELLL  AV RHG GDW+ VA E+RSR  +     +   C+ KY DL++RF                     VG    W E+LR  R+ ELR+ +E
Subjt:  WDTWEELLLGGAVLRHGTGDWNLVAAELRSR-IVRPYACTPEVCKAKYEDLQKRF---------------------VGCK-AWYEELRRQRIMELRQALE

Query:  HSEDSIGSLESKLEALKSRSGDKSLVNSSSRSESWGAVQKATNELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLY
          + SI SL+ K++ L+    ++ +       E+    +++ N+             S  E +   +SA E   +   S ++S          ++ G   
Subjt:  HSEDSIGSLESKLEALKSRSGDKSLVNSSSRSESWGAVQKATNELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLY

Query:  ESQGGTVRKRSRGKRKRKDCNKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEA------RESSDANEASRSSNMDGVDVLMAAFNSVADNKSASVF
          +    R+   G  K  D +   K+ +  E      +  S  SHS E     + E+      R+   A E   + +      L++  + +  +   S+F
Subjt:  ESQGGTVRKRSRGKRKRKDCNKDVKEGSTGENNLSESANPSTVSHSKENSCCNSFEA------RESSDANEASRSSNMDGVDVLMAAFNSVADNKSASVF

Query:  RRRLDSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRAS
         RRL SQ+   YK +++QHLDIETI+ ++    Y ++    YRDL LL  NA+VF+  +S E  +A  LR +++   +K    +   +       +  AS
Subjt:  RRRLDSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRAS

Query:  NQMAKPRRSQPAKRNVSRKEVNSGDVKTPSGNRRRRSNANSHPSVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGR
           +    ++ +  ++SR++ +SG +      R   + A+   S    K +T   T+ +       GV  + ++ + AA      K G+
Subjt:  NQMAKPRRSQPAKRNVSRKEVNSGDVKTPSGNRRRRSNANSHPSVGLAKKETSGSTVKKGPVGTRKGVVGTSKSERPAATGVRGRKRGR

AT3G57980.1 DNA-binding bromodomain-containing protein4.6e-1927.85Show/hide
Query:  EELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRF------------------VGCKAWYEELRRQRIMELRQALEHSEDSIGSL
        EELLL  AV RHGT  W+ VA+E+  +       T   C+ KY DL++RF                  +    W EELR+ R+ ELR+ +E  + SI SL
Subjt:  EELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRF------------------VGCKAWYEELRRQRIMELRQALEHSEDSIGSL

Query:  ESKLEALKSRSGDKSLVNSSSRSESWGAVQKATNELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRK
        + K++ L+    +KSL   +S  +        T E    S        + L+  P P           T+++  +         ++GG           K
Subjt:  ESKLEALKSRSGDKSLVNSSSRSESWGAVQKATNELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRK

Query:  RSRGKRKRKDCNKDVKEGSTGE------------NNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDGVDVLMAAFNSVAD-------NKS
         +R    R  C    KE    E             ++ ES        + +     SF  +E+ D ++         V+ +      ++D       +  
Subjt:  RSRGKRKRKDCNKDVKEGSTGE------------NNLSESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDGVDVLMAAFNSVAD-------NKS

Query:  ASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLI
         S F RRL++Q+   Y +IIRQH+D E IRSRV   +Y T + + +RDLLLL NN  VFY   S E  +A  L +LI
Subjt:  ASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLI

AT3G60110.1 DNA-binding bromodomain-containing protein7.3e-1724.43Show/hide
Query:  IQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRF--------------------VGCKAWYEELRRQRIMELRQAL
        I++ W TWEEL+L  AV RH   DW+ VA E+++R       +   C+ KY+DL++RF                    VG  +W E+LR   + ELR+ +
Subjt:  IQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRF--------------------VGCKAWYEELRRQRIMELRQAL

Query:  EHSEDSIGSLESKLEAL-KSRSGDKSLVNSSSRSESWGAVQKATNELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGV
        +  +DSI SL+ K++ L + + GD        +++    V+   N  +  S   +NR+ +                  E++ + S++  K+++  +L G 
Subjt:  EHSEDSIGSLESKLEAL-KSRSGDKSLVNSSSRSESWGAVQKATNELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGV

Query:  LYESQGGTVRKRSRGKRKRKDCNKD-VKEGSTGE------NNLSESANPSTVSHSKENSCCNSFEARE-SSDANEASRSSNMDGVDVLMAAFNSVADNKS
                  K  +     ++ + D V +  T E      +  SE +N   +  S  ++C    + ++  S        S  D    L+     +  +  
Subjt:  LYESQGGTVRKRSRGKRKRKDCNKD-VKEGSTGE------NNLSESANPSTVSHSKENSCCNSFEARE-SSDANEASRSSNMDGVDVLMAAFNSVADNKS

Query:  ASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQK-----------------
         SVF  RL SQ    YK++IRQHLD++TI  ++    Y+++    YRDL LL  NA+VF+  +S E  +A  LR L+++  +K                 
Subjt:  ASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRRLITSTFQK-----------------

Query:  LFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVSRKEVNSGDVKTPSGNRRRRSNANSHPSVGLAKKETSGSTVK
        + +  SSV++     +++ A  +   P  S   K     +EV+   + T +     RS+  +   + +  K+T     K
Subjt:  LFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVSRKEVNSGDVKTPSGNRRRRSNANSHPSVGLAKKETSGSTVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCGGAGGCGATACAAAAGAGGTGGGATACGTGGGAAGAACTTTTATTAGGAGGGGCCGTACTCCGACACGGAACCGGCGACTGGAATCTCGTCGCGGCGGAGCT
CCGGTCGAGGATTGTTCGTCCGTACGCCTGCACGCCCGAGGTTTGTAAAGCAAAATATGAAGACTTGCAAAAGCGATTTGTTGGATGCAAAGCTTGGTATGAGGAGCTTC
GGCGGCAACGAATCATGGAACTAAGACAAGCTCTAGAGCATTCTGAAGACTCAATAGGGTCATTGGAATCAAAGCTTGAAGCTCTAAAGTCTAGGAGTGGAGACAAGTCT
CTTGTCAATAGCTCTAGCAGATCAGAATCTTGGGGAGCTGTTCAGAAAGCAACGAATGAGCTATCTGCCGGTAGTTTCACACAGGAAAACAGGACGTGCAGTTCACTTGA
ATGTCAGCCAGCCCCGATATCGGCCGAAGAGATGGAGATTAAACCAGAAACATCGCAGTCGCAGTCTCTCGAATGGGGAAAAGTATCGAACATTGGCAAGTTGGGAGGGG
TGTTATATGAAAGCCAAGGAGGAACAGTGAGGAAGAGATCAAGAGGGAAGAGAAAGAGGAAGGATTGTAATAAGGATGTTAAGGAAGGAAGTACTGGGGAAAATAACTTG
TCTGAATCAGCTAACCCTTCAACTGTTTCTCATTCTAAAGAAAATTCATGCTGCAACTCGTTTGAGGCTCGCGAATCTTCTGATGCAAATGAAGCTAGCAGAAGCTCAAA
CATGGATGGAGTTGATGTTCTAATGGCTGCTTTTAACTCTGTTGCAGACAATAAAAGCGCCTCCGTATTTCGACGTCGCCTTGATAGCCAGAAGAGAGGAAGATACAAGA
AAATAATCCGGCAACATTTGGATATTGAAACAATAAGATCAAGAGTTGCAAGTCATTACATAACGACGCAAAAGGAGCTGTACAGAGATCTGCTGTTGCTTGCTAACAAT
GCGCTCGTCTTCTACTCGCCGAACTCCCGGGAGCATCAGTCTGCAGTGCTTCTAAGACGCCTCATTACAAGTACATTTCAGAAGCTTTTTAAGAACTCTAGCAGTGTGGT
AGCCCACAATCACCACAACCAGAGAACACGAGCATCTAATCAGATGGCAAAACCACGTCGTTCGCAGCCTGCAAAGCGTAATGTATCTCGAAAAGAAGTCAATTCAGGAG
ATGTCAAAACTCCAAGTGGAAATAGAAGAAGAAGAAGTAATGCCAATTCACATCCCTCAGTGGGATTAGCAAAGAAAGAAACTTCGGGTTCTACTGTAAAGAAAGGTCCT
GTTGGGACGAGAAAGGGTGTCGTTGGGACGTCGAAAAGTGAACGACCTGCAGCAACAGGCGTTAGGGGAAGGAAAAGAGGGAGAACGAAGTAA
mRNA sequenceShow/hide mRNA sequence
CTGATTTCCACAACAAAGGTGGACTTGATTTTGAACTCGTTTATTTCTTTTTTATTTTTTTAAATTCAATTTCTTTATGGAGATTTAATTTTCTGCTTCTCTTTTCTTTT
TATTTTTTATTTTGTCATCTTTGGTTCTTCTTCAGCCTCTATTAAAAGTAACGAAACCCCCAATTCTCGCGATCACCAATAAAACCTCAAAACCCAATACGCACAAATAA
CCAATTTTTCTTTTACAAAAAAAAAAAAAACACAACAAAAAAACAAATGATGAAATGAACCCTTCAACCGACCCATCCCCATCCCAAAACCGCTAATTCGCACACCGGAA
AACCATCTTCCTCCTTCAATACTTCTCCCTTAAACCGCTTTTCTCCTTTTCCGATTAGGCCTCCGCTTTATTTGAAGGCGAGCTAGGGTTCCGGTGATTCATACGGAGCG
AATTTGAGATAAATATGGGAGCGGAGGCGATACAAAAGAGGTGGGATACGTGGGAAGAACTTTTATTAGGAGGGGCCGTACTCCGACACGGAACCGGCGACTGGAATCTC
GTCGCGGCGGAGCTCCGGTCGAGGATTGTTCGTCCGTACGCCTGCACGCCCGAGGTTTGTAAAGCAAAATATGAAGACTTGCAAAAGCGATTTGTTGGATGCAAAGCTTG
GTATGAGGAGCTTCGGCGGCAACGAATCATGGAACTAAGACAAGCTCTAGAGCATTCTGAAGACTCAATAGGGTCATTGGAATCAAAGCTTGAAGCTCTAAAGTCTAGGA
GTGGAGACAAGTCTCTTGTCAATAGCTCTAGCAGATCAGAATCTTGGGGAGCTGTTCAGAAAGCAACGAATGAGCTATCTGCCGGTAGTTTCACACAGGAAAACAGGACG
TGCAGTTCACTTGAATGTCAGCCAGCCCCGATATCGGCCGAAGAGATGGAGATTAAACCAGAAACATCGCAGTCGCAGTCTCTCGAATGGGGAAAAGTATCGAACATTGG
CAAGTTGGGAGGGGTGTTATATGAAAGCCAAGGAGGAACAGTGAGGAAGAGATCAAGAGGGAAGAGAAAGAGGAAGGATTGTAATAAGGATGTTAAGGAAGGAAGTACTG
GGGAAAATAACTTGTCTGAATCAGCTAACCCTTCAACTGTTTCTCATTCTAAAGAAAATTCATGCTGCAACTCGTTTGAGGCTCGCGAATCTTCTGATGCAAATGAAGCT
AGCAGAAGCTCAAACATGGATGGAGTTGATGTTCTAATGGCTGCTTTTAACTCTGTTGCAGACAATAAAAGCGCCTCCGTATTTCGACGTCGCCTTGATAGCCAGAAGAG
AGGAAGATACAAGAAAATAATCCGGCAACATTTGGATATTGAAACAATAAGATCAAGAGTTGCAAGTCATTACATAACGACGCAAAAGGAGCTGTACAGAGATCTGCTGT
TGCTTGCTAACAATGCGCTCGTCTTCTACTCGCCGAACTCCCGGGAGCATCAGTCTGCAGTGCTTCTAAGACGCCTCATTACAAGTACATTTCAGAAGCTTTTTAAGAAC
TCTAGCAGTGTGGTAGCCCACAATCACCACAACCAGAGAACACGAGCATCTAATCAGATGGCAAAACCACGTCGTTCGCAGCCTGCAAAGCGTAATGTATCTCGAAAAGA
AGTCAATTCAGGAGATGTCAAAACTCCAAGTGGAAATAGAAGAAGAAGAAGTAATGCCAATTCACATCCCTCAGTGGGATTAGCAAAGAAAGAAACTTCGGGTTCTACTG
TAAAGAAAGGTCCTGTTGGGACGAGAAAGGGTGTCGTTGGGACGTCGAAAAGTGAACGACCTGCAGCAACAGGCGTTAGGGGAAGGAAAAGAGGGAGAACGAAGTAAATG
GTAAAAAGTTAAAACCTTCTCTTGATAGATTCTTAGACCAGAACTTGTAAGTTGTTTGTAAATCAGGTAGCTCCAGGCTTCAAGGATGTGTTTGGCTATAGAAAGATTGG
AAAGATAATGGGAATTGGAATACTCATTGTGTCATTTCGTTTCATGGAATGTTTTTTTTTTTTTATTTTGAATTCATTTGTTTCAAAGAAAGCAAATGGAACGTGTTTCA
ATTGCTGAACATAAATCTCATTATATTGAAGCAAAATTTAGAATAATTGATCGATCATTCGCATAAACTGAAATTCAACTTGCAGGACAATAATTGCAAGTATTTATAAA
ATTTTGGCTTGTTCTTTGTTTCT
Protein sequenceShow/hide protein sequence
MGAEAIQKRWDTWEELLLGGAVLRHGTGDWNLVAAELRSRIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLEALKSRSGDKS
LVNSSSRSESWGAVQKATNELSAGSFTQENRTCSSLECQPAPISAEEMEIKPETSQSQSLEWGKVSNIGKLGGVLYESQGGTVRKRSRGKRKRKDCNKDVKEGSTGENNL
SESANPSTVSHSKENSCCNSFEARESSDANEASRSSNMDGVDVLMAAFNSVADNKSASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRVASHYITTQKELYRDLLLLANN
ALVFYSPNSREHQSAVLLRRLITSTFQKLFKNSSSVVAHNHHNQRTRASNQMAKPRRSQPAKRNVSRKEVNSGDVKTPSGNRRRRSNANSHPSVGLAKKETSGSTVKKGP
VGTRKGVVGTSKSERPAATGVRGRKRGRTK