; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0010049 (gene) of Chayote v1 genome

Gene IDSed0010049
OrganismSechium edule (Chayote v1)
DescriptionBromo domain-containing protein
Genome locationLG08:36428756..36433264
RNA-Seq ExpressionSed0010049
SyntenySed0010049
Gene Ontology termsGO:0016573 - histone acetylation (biological process)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001487 - Bromodomain
IPR036427 - Bromodomain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573239.1 hypothetical protein SDJN03_27126, partial [Cucurbita argyrosperma subsp. sororia]1.2e-16474.03Show/hide
Query:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRIF---LCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE
        MGAEA++KRWDTW+ELLLGGA+LRHG  DWNLVAAELR+RI      TPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMEL++ALEHSEDSI        
Subjt:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRIF---LCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE

Query:  ALKSGSGDNSLVNSSSRSESCEAVHKPTNELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWKGTVRKKSRGKRKRKDC-DR
            GSGD SLVNSS RSES   VHKPTNELS  SFTQENRTCSS+EC+ A   A+ETEIKPEASQ + LEWG        GTV+K+SRGKRKRKDC  R
Subjt:  ALKSGSGDNSLVNSSSRSESCEAVHKPTNELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWKGTVRKKSRGKRKRKDC-DR

Query:  DVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMD--RVDVLMTAFNSVADNKNASIFRRRLDSQKRGRYKKLIRQHMDIETIR
        DVKEGS+GENNLSESANPSTVSHS +NS C+SFEPRESSDANEASRSSTMD   VDVLM AFN+VA+NK+AS+FRRRLDSQKRGRYKKLIRQH+DIETIR
Subjt:  DVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMD--RVDVLMTAFNSVADNKNASIFRRRLDSQKRGRYKKLIRQHMDIETIR

Query:  SRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHSERTQTCDPMAKPRRSQPATKLNVSKKEVNPVDVKT
        SRVASH ITTQKELYRDLLLLANNALVFY PN+RE++SAVLLR  ITSTF+ L  KN     SH +RTQT D MAKP R QPA + N S+KEVNP D KT
Subjt:  SRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHSERTQTCDPMAKPRRSQPATKLNVSKKEVNPVDVKT

Query:  PTGS-RKSNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK
        P+G+ R+SNA SHSSVGLAK ++S STVK+ P GTRK VV  SK +  AAT  RGRKRGRTK
Subjt:  PTGS-RKSNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK

XP_008442126.1 PREDICTED: uncharacterized protein LOC103486076 isoform X1 [Cucumis melo]1.2e-16472.63Show/hide
Query:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRI---FLCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE
        MGAEA+ KRWDTWQELLLGGA++RHG  DWNLVA ELRSRI   +LCTPEVCKAKYEDL+KRFVGCKAWYEELR++RIMEL+QALEHSEDSIGSLESKLE
Subjt:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRI---FLCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE

Query:  ALKSGSG-DNSLVNSSSRSESCEAVHKPTNELSVDSFTQENR-TCSSIECQPASLSAEETEIKPEASQSQSLEWG-----GDVSSI----WKGTVRKKSR
        ALKS SG D SLVN S+RSES  AV KPTNE S  SFTQENR TCSSIECQPA L  EETEIKPE    QSLEWG     G +  +      G +RK+SR
Subjt:  ALKSGSG-DNSLVNSSSRSESCEAVHKPTNELSVDSFTQENR-TCSSIECQPASLSAEETEIKPEASQSQSLEWG-----GDVSSI----WKGTVRKKSR

Query:  GKRKRKDCDRDVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMDRVDVLMTAFNSVADNKNASIFRRRLDSQKRGRYKKLIRQ
        GKRKRKDC+R+VKEGSSGENNLSESANPSTVS S ENS C+SFE RESSDANEASRSSTMD VDVLM  FNSVA++K+AS+FRRRLDSQ+R RYKKLIRQ
Subjt:  GKRKRKDCDRDVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMDRVDVLMTAFNSVADNKNASIFRRRLDSQKRGRYKKLIRQ

Query:  HMDIETIRSRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISH---SERTQTCDPMAKPRRSQPATKLNVS
        H+DIETIRSRVASH ITT+KELYRDLLLLANNALVFYS NSREHQSAV LR  I+STF+ L+ K+S+++++H   ++RTQTCD +AKPRRSQPA K N S
Subjt:  HMDIETIRSRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISH---SERTQTCDPMAKPRRSQPATKLNVS

Query:  KKEVNPVDVKTPTGSRK---SNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK
        ++E NP DVKTP G+R+   +++   SS+GL+KK++S ST KK PGG RK V   SKS+  +ATG+RGRKRGRTK
Subjt:  KKEVNPVDVKTPTGSRK---SNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK

XP_022954655.1 uncharacterized protein LOC111456852 isoform X1 [Cucurbita moschata]9.8e-17075.76Show/hide
Query:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRIF---LCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE
        MGAEA++KRWDTW+ELLLGGA+LRHG  DWNLVAAELR+RI      TPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMEL++ALEHSEDSIGSLESKLE
Subjt:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRIF---LCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE

Query:  ALKSGSGDNSLVNSSSRSESCEAVHKPTNELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWKGTVRKKSRGKRKRKDC-DR
        ALKS SGD SLVNSS RSES   VHKPTNELS  SFTQENRTCSS+EC+ A   A+ETEIKPEASQ + LEWG        GTV+K+SRGKRKRKDC  R
Subjt:  ALKSGSGDNSLVNSSSRSESCEAVHKPTNELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWKGTVRKKSRGKRKRKDC-DR

Query:  DVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMD--RVDVLMTAFNSVADNKNASIFRRRLDSQKRGRYKKLIRQHMDIETIR
        DVKEGS+GENNLSESANPSTVSHS +NS C+SFEPRESSDANEASRSSTMD   VDVLM AFN+VA+NK+A++FRRRLDSQKRGRYKKLIRQH+DIETIR
Subjt:  DVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMD--RVDVLMTAFNSVADNKNASIFRRRLDSQKRGRYKKLIRQHMDIETIR

Query:  SRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHSERTQTCDPMAKPRRSQPATKLNVSKKEVNPVDVKT
        SRVAS  ITTQKELYRDLLLLANNALVFY PN+RE++SAVLLR  IT+TF+ L  KN     SH +RTQT D MAK  R QPA K N S+KEVNP D KT
Subjt:  SRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHSERTQTCDPMAKPRRSQPATKLNVSKKEVNPVDVKT

Query:  PTGS-RKSNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK
        P+G+ R+SNA SHSSVGLAK ++S STVK+ P GTRK VV  SKS+  AAT  RGRKRGR K
Subjt:  PTGS-RKSNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK

XP_022994396.1 uncharacterized protein LOC111490126 isoform X1 [Cucurbita maxima]2.5e-17376.51Show/hide
Query:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRIF---LCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE
        MGAEA++KRWDTW+ELLLGGA+LRHG  DWNLVAAELR+RI      TPEVCKAKYEDLQKRFVGCKAWYEELRRQRI+EL++ALEHSEDSIGSLESKLE
Subjt:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRIF---LCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE

Query:  ALKSGSGDNSLVNSSSRSESCEAVHKPTNELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWKGTVRKKSRGKRKRKDC--D
        ALKS SGD SLVNSS RSES   VHKPTNELS  SFTQENRTCSS+EC+ A   A+ETEIKPEASQ + LEWG        GTV+K+SRGKRKRKDC   
Subjt:  ALKSGSGDNSLVNSSSRSESCEAVHKPTNELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWKGTVRKKSRGKRKRKDC--D

Query:  RDVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMD--RVDVLMTAFNSVADNKNASIFRRRLDSQKRGRYKKLIRQHMDIETI
        RDVKEGS+GENNLSESANPSTVSHS +NS C+SFEPRESSDANEASRSSTMD   VDVLM AFN+VA+NK+A +FRRRLDSQKRGRYKKLIRQH+DIETI
Subjt:  RDVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMD--RVDVLMTAFNSVADNKNASIFRRRLDSQKRGRYKKLIRQHMDIETI

Query:  RSRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHSERTQTCDPMAKPRRSQPATKLNVSKKEVNPVDVK
        RSRVASH ITTQKELYRDLLLLANNALVFY PN+REH+SAVLLR  ITSTF+ L  KN     SH +RTQT D MAKP R QPA K   S+KEVNP D K
Subjt:  RSRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHSERTQTCDPMAKPRRSQPATKLNVSKKEVNPVDVK

Query:  TPTGS--RKSNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK
        TP+G+  R+SNA SHSSVGLAK ++S STVK+ P GTRK VV  SKS+  AATGVRGRKRGRTK
Subjt:  TPTGS--RKSNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK

XP_023542669.1 uncharacterized protein LOC111802504 isoform X1 [Cucurbita pepo subsp. pepo]3.4e-17075.76Show/hide
Query:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRIF---LCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE
        MGAEA++K+WDTW+ELLLGGA+LRHG  DWNLVAAELR+RI      TPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMEL++ALEHSEDSIGSLESKLE
Subjt:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRIF---LCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE

Query:  ALKSGSGDNSLVNSSSRSESCEAVHKPTNELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWKGTVRKKSRGKRKRKDC-DR
        ALKS SGD SLVNSS RSES   VHKPTNELS  SFTQENRTCSS+EC+ A   A+ETEIKPEASQ + L+WG        GT +K+SRGKRKRKDC  R
Subjt:  ALKSGSGDNSLVNSSSRSESCEAVHKPTNELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWKGTVRKKSRGKRKRKDC-DR

Query:  DVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMD--RVDVLMTAFNSVADNKNASIFRRRLDSQKRGRYKKLIRQHMDIETIR
        DVKEGS+GENNLSESANPSTVSHS +NS C+SFEPRESSDANEASRSSTMD   VDVLM AFN+VA+NK+AS+FRRRLDSQKRGRYKKLIRQH+DIETIR
Subjt:  DVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMD--RVDVLMTAFNSVADNKNASIFRRRLDSQKRGRYKKLIRQHMDIETIR

Query:  SRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHSERTQTCDPMAKPRRSQPATKLNVSKKEVNPVDVKT
        SRVASH ITTQKELYRDLLLLANNALVFY PN+RE++SAVLLR  ITSTF+ L  KN     SH +RTQT D +AKP R QPA K N S+KEVNP D KT
Subjt:  SRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHSERTQTCDPMAKPRRSQPATKLNVSKKEVNPVDVKT

Query:  PTGS-RKSNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK
        P+G+ R+SNA SHSSVGLAK ++S STVK+ P GTRK VV   KS+  AAT  RGRKRGRTK
Subjt:  PTGS-RKSNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK

TrEMBL top hitse value%identityAlignment
A0A0A0LV17 Bromo domain-containing protein1.1e-16372.27Show/hide
Query:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRI---FLCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE
        MGAEA++  WDTWQELLLGGA+LRHG ADWNLVA ELRSRI   + CTPEVCKAKYEDL+KRFVGCKAWYEELRR+R+MEL+QALEHSEDSIGSLESKLE
Subjt:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRI---FLCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE

Query:  ALKSGSG-DNSLVNSSSRSESCEAVHKPTNELSVDSFTQENR-TCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWK----------GTVRKKS
        ALKS SG D SLVN S+RSES  AV KPTNELS  SFTQENR TCSSIECQPA LS +ETEIKPE    QSLE  G  S I K          G +RK+S
Subjt:  ALKSGSG-DNSLVNSSSRSESCEAVHKPTNELSVDSFTQENR-TCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWK----------GTVRKKS

Query:  RGKRKRKDCDRDVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMDRVDVLMTAFNSVADNKNASIFRRRLDSQKRGRYKKLIR
        RGKRKRKDC+R+VKEGSSGENNLSESANPSTVS S ENS C+SFE RE SDANEASRSS MD VDVLM AFN+VA++K+AS+FRRRLDSQ+R RYKKLIR
Subjt:  RGKRKRKDCDRDVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMDRVDVLMTAFNSVADNKNASIFRRRLDSQKRGRYKKLIR

Query:  QHMDIETIRSRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISH---SERTQTCDPMAKPRRSQPATKLNV
        QH+DIETIRSRVASH ITT+ ELYRDLLLLANNALVFYS NSREHQSAVLLR  I+STFE  + K+S+++++H   ++RTQTCD +AKPRRSQPA K N 
Subjt:  QHMDIETIRSRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISH---SERTQTCDPMAKPRRSQPATKLNV

Query:  SKKEVNPVDVKTPTGSRK---SNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK
        S++E NP DVKTP G+R+   +++   SS+GLAKK++S S +KK PGGTRK V   SKS+  +ATG+RGRKRG+TK
Subjt:  SKKEVNPVDVKTPTGSRK---SNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK

A0A1S3B4Z1 uncharacterized protein LOC103486076 isoform X16.0e-16572.63Show/hide
Query:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRI---FLCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE
        MGAEA+ KRWDTWQELLLGGA++RHG  DWNLVA ELRSRI   +LCTPEVCKAKYEDL+KRFVGCKAWYEELR++RIMEL+QALEHSEDSIGSLESKLE
Subjt:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRI---FLCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE

Query:  ALKSGSG-DNSLVNSSSRSESCEAVHKPTNELSVDSFTQENR-TCSSIECQPASLSAEETEIKPEASQSQSLEWG-----GDVSSI----WKGTVRKKSR
        ALKS SG D SLVN S+RSES  AV KPTNE S  SFTQENR TCSSIECQPA L  EETEIKPE    QSLEWG     G +  +      G +RK+SR
Subjt:  ALKSGSG-DNSLVNSSSRSESCEAVHKPTNELSVDSFTQENR-TCSSIECQPASLSAEETEIKPEASQSQSLEWG-----GDVSSI----WKGTVRKKSR

Query:  GKRKRKDCDRDVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMDRVDVLMTAFNSVADNKNASIFRRRLDSQKRGRYKKLIRQ
        GKRKRKDC+R+VKEGSSGENNLSESANPSTVS S ENS C+SFE RESSDANEASRSSTMD VDVLM  FNSVA++K+AS+FRRRLDSQ+R RYKKLIRQ
Subjt:  GKRKRKDCDRDVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMDRVDVLMTAFNSVADNKNASIFRRRLDSQKRGRYKKLIRQ

Query:  HMDIETIRSRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISH---SERTQTCDPMAKPRRSQPATKLNVS
        H+DIETIRSRVASH ITT+KELYRDLLLLANNALVFYS NSREHQSAV LR  I+STF+ L+ K+S+++++H   ++RTQTCD +AKPRRSQPA K N S
Subjt:  HMDIETIRSRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISH---SERTQTCDPMAKPRRSQPATKLNVS

Query:  KKEVNPVDVKTPTGSRK---SNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK
        ++E NP DVKTP G+R+   +++   SS+GL+KK++S ST KK PGG RK V   SKS+  +ATG+RGRKRGRTK
Subjt:  KKEVNPVDVKTPTGSRK---SNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK

A0A6J1CGL2 uncharacterized protein LOC1110106375.4e-15069.06Show/hide
Query:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRI---FLCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE
        MG EA+E+RWDTW+ELLLGGAVLRHG  DWNLVAAELR+RI   + CTPEVCKAKYEDLQKRFVGCKAWYEELRR+RIMEL+QALEHSEDSIGSLESKLE
Subjt:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRI---FLCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE

Query:  ALKSGSGDNSLVNSSSRSESCEAVHKPT-NELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWK----------GTVRKKSR
        ALKS SGD  +VNS SRSES  AV K T NELS  SFTQE RTCSS+EC+ A LSAEE EIK EA   Q       VSSI K          GTVRK+ R
Subjt:  ALKSGSGDNSLVNSSSRSESCEAVHKPT-NELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWK----------GTVRKKSR

Query:  GKRKRKDC----------DRDVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMD--RVDVLMTAFNSVADNKNASIFRRRLDS
        GKRKRK+C          +RDVKEGS GENNLSES NP+TVS S     C+SFEP   SDANEA RSS MD   VDVLM AFNSVA +K+AS+FRRRLDS
Subjt:  GKRKRKDC----------DRDVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMD--RVDVLMTAFNSVADNKNASIFRRRLDS

Query:  QKRGRYKKLIRQHMDIETIRSRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHS---ERTQTCDPMAKP
        QKRGRYKK+IRQH+DIE IRSRV SH ITT KELYRDLLLLANNALVFYS NSREHQSAVLLR  ITS F+ L  KNS++V+ H+   ++TQ  DP+ KP
Subjt:  QKRGRYKKLIRQHMDIETIRSRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHS---ERTQTCDPMAKP

Query:  RRSQPATKLNVSKKEVNPVDVKTPTGSRKSN--AQSHSSVGLAKKKSS--GSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK
        RRSQPA K NVS+KE N  DVKT  G R+    A  HSSVGL KK++S   ST KKGPG TRK VV  SKS+  +ATG RGRKRGRTK
Subjt:  RRSQPATKLNVSKKEVNPVDVKTPTGSRKSN--AQSHSSVGLAKKKSS--GSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK

A0A6J1GT05 uncharacterized protein LOC111456852 isoform X14.7e-17075.76Show/hide
Query:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRIF---LCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE
        MGAEA++KRWDTW+ELLLGGA+LRHG  DWNLVAAELR+RI      TPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMEL++ALEHSEDSIGSLESKLE
Subjt:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRIF---LCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE

Query:  ALKSGSGDNSLVNSSSRSESCEAVHKPTNELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWKGTVRKKSRGKRKRKDC-DR
        ALKS SGD SLVNSS RSES   VHKPTNELS  SFTQENRTCSS+EC+ A   A+ETEIKPEASQ + LEWG        GTV+K+SRGKRKRKDC  R
Subjt:  ALKSGSGDNSLVNSSSRSESCEAVHKPTNELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWKGTVRKKSRGKRKRKDC-DR

Query:  DVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMD--RVDVLMTAFNSVADNKNASIFRRRLDSQKRGRYKKLIRQHMDIETIR
        DVKEGS+GENNLSESANPSTVSHS +NS C+SFEPRESSDANEASRSSTMD   VDVLM AFN+VA+NK+A++FRRRLDSQKRGRYKKLIRQH+DIETIR
Subjt:  DVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMD--RVDVLMTAFNSVADNKNASIFRRRLDSQKRGRYKKLIRQHMDIETIR

Query:  SRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHSERTQTCDPMAKPRRSQPATKLNVSKKEVNPVDVKT
        SRVAS  ITTQKELYRDLLLLANNALVFY PN+RE++SAVLLR  IT+TF+ L  KN     SH +RTQT D MAK  R QPA K N S+KEVNP D KT
Subjt:  SRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHSERTQTCDPMAKPRRSQPATKLNVSKKEVNPVDVKT

Query:  PTGS-RKSNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK
        P+G+ R+SNA SHSSVGLAK ++S STVK+ P GTRK VV  SKS+  AAT  RGRKRGR K
Subjt:  PTGS-RKSNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK

A0A6J1JZ11 uncharacterized protein LOC111490126 isoform X11.2e-17376.51Show/hide
Query:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRIF---LCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE
        MGAEA++KRWDTW+ELLLGGA+LRHG  DWNLVAAELR+RI      TPEVCKAKYEDLQKRFVGCKAWYEELRRQRI+EL++ALEHSEDSIGSLESKLE
Subjt:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRIF---LCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLE

Query:  ALKSGSGDNSLVNSSSRSESCEAVHKPTNELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWKGTVRKKSRGKRKRKDC--D
        ALKS SGD SLVNSS RSES   VHKPTNELS  SFTQENRTCSS+EC+ A   A+ETEIKPEASQ + LEWG        GTV+K+SRGKRKRKDC   
Subjt:  ALKSGSGDNSLVNSSSRSESCEAVHKPTNELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWKGTVRKKSRGKRKRKDC--D

Query:  RDVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMD--RVDVLMTAFNSVADNKNASIFRRRLDSQKRGRYKKLIRQHMDIETI
        RDVKEGS+GENNLSESANPSTVSHS +NS C+SFEPRESSDANEASRSSTMD   VDVLM AFN+VA+NK+A +FRRRLDSQKRGRYKKLIRQH+DIETI
Subjt:  RDVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMD--RVDVLMTAFNSVADNKNASIFRRRLDSQKRGRYKKLIRQHMDIETI

Query:  RSRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHSERTQTCDPMAKPRRSQPATKLNVSKKEVNPVDVK
        RSRVASH ITTQKELYRDLLLLANNALVFY PN+REH+SAVLLR  ITSTF+ L  KN     SH +RTQT D MAKP R QPA K   S+KEVNP D K
Subjt:  RSRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHSERTQTCDPMAKPRRSQPATKLNVSKKEVNPVDVK

Query:  TPTGS--RKSNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK
        TP+G+  R+SNA SHSSVGLAK ++S STVK+ P GTRK VV  SKS+  AATGVRGRKRGRTK
Subjt:  TPTGS--RKSNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61215.1 bromodomain 44.1e-6538.66Show/hide
Query:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRIF--LCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLEA
        M    ME  W TW+ELLLGGAVLRHG  DW +VA ELRS     + TPE+CKAKY+DL+KR+VGCKAW+EEL+++R+ ELK AL  SEDSIGSLESKL++
Subjt:  MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRIF--LCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLEA

Query:  LKSGSGDNSLVNSSSRSESCEAVHKPTNE--------------LSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWKG-----
        LKS S D    N+   S +      P +E               SV SFTQ+  T ++   +  S    E  +  E  +++ L       S++ G     
Subjt:  LKSGSGDNSLVNSSSRSESCEAVHKPTNE--------------LSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWKG-----

Query:  -TVRKKSRGKRKRKDCD----RDVKEGSSGENN--LSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMDRVDVLMTAFNSVADNKNASIFRRRL
         ++RKK RGKRKRKDC     ++V E S+ E +     SA+ +++  S E           +S ++  SR  ++     LM  +N++A N+ A +FRRRL
Subjt:  -TVRKKSRGKRKRKDCD----RDVKEGSSGENN--LSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMDRVDVLMTAFNSVADNKNASIFRRRL

Query:  DSQKRGRYKKLIRQHMDIETIRSRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKN-----------STSVISHSER
        DSQKRGRYKKL+R+HMD++T++SR+    I++ KEL+RD LL+ANNA +FYS N+RE++SAV LRD +T +    L+++           ST V+   ++
Subjt:  DSQKRGRYKKLIRQHMDIETIRSRVASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKN-----------STSVISHSER

Query:  TQTCDPMAKPRRSQPATKLNVSKKEVNPVDVKTPTGSRKSNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK
        + +          +P T  +  K  V  +  KT   S + N +S + + ++  KSS +  KKG    + G       + PA   + GRKR R +
Subjt:  TQTCDPMAKPRRSQPATKLNVSKKEVNPVDVKTPTGSRKSNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK

AT2G42150.1 DNA-binding bromodomain-containing protein3.6e-2128.92Show/hide
Query:  EKRWDTWQELLLGGAVLRHGHADWNLVAAELRS---RIFLCTPEVCKAKYEDLQKRF------------VGCKAWYEELRRQRIMELKQALEHSEDSIGS
        ++ W TW+ELLL  AV RHG   WN V+AE++     +   T   C+ KY DL+ RF            +    W EELR+ R+ EL++ +E  + SI +
Subjt:  EKRWDTWQELLLGGAVLRHGHADWNLVAAELRS---RIFLCTPEVCKAKYEDLQKRF------------VGCKAWYEELRRQRIMELKQALEHSEDSIGS

Query:  LESKLEALK-----------SGSGDNSLVNSSSRSESCEAVHKPTNELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWKGT
        L+SK++ L+           + + +  L     RS+S E V  P  +L        N T S     P  + +E TE + E + S     GG+     + +
Subjt:  LESKLEALK-----------SGSGDNSLVNSSSRSESCEAVHKPTNELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWKGT

Query:  VRKKSRGKRKRKDCDRDVKEGSSGENNLSESANPSTVSHSIENSYCDSFE-PRESS---DANEASRSSTMD---RVDVLMTAFNSVADNKNASIFRRRLD
         R       K    + +  E  S    L ES + ++    I +    S   PR+ +   D  + S +S  D       L++    +  +   S F RRL+
Subjt:  VRKKSRGKRKRKDCDRDVKEGSSGENNLSESANPSTVSHSIENSYCDSFE-PRESS---DANEASRSSTMD---RVDVLMTAFNSVADNKNASIFRRRLD

Query:  SQKRGRYKKLIRQHMDIETIRSRVASHCITTQK-ELYRDLLLLANNALVFYSPNSREHQSAVLLRDFI----TSTFEMLLSKNSTSVISHSERTQTCDPM
         Q+   Y  +IR+H+D E IR RV      + +   +RDLLLL NNA VFY   S E + A  L   +    T+T + L +++  S IS  +      P 
Subjt:  SQKRGRYKKLIRQHMDIETIRSRVASHCITTQK-ELYRDLLLLANNALVFYSPNSREHQSAVLLRDFI----TSTFEMLLSKNSTSVISHSERTQTCDPM

Query:  AKPRRSQP
        +KP  S+P
Subjt:  AKPRRSQP

AT2G44430.1 DNA-binding bromodomain-containing protein1.5e-1925.61Show/hide
Query:  EAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRI----FLCTPEVCKAKYEDLQKRF---------------------VGCK-AWYEELRRQRIME
        ++  + W TW+ELLL  AV RHG  DW+ VA E+RSR      L +   C+ KY DL++RF                     VG    W E+LR  R+ E
Subjt:  EAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRI----FLCTPEVCKAKYEDLQKRF---------------------VGCK-AWYEELRRQRIME

Query:  LKQALEHSEDSIGSLESKLEALKSGSG--------DNSLVNSSSRSESCEAVHKPTNELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEW
        L++ +E  + SI SL+ K++ L+            +N      S ++  E+ H+     + +   +ENR+ +      A+   EE     E SQ+     
Subjt:  LKQALEHSEDSIGSLESKLEALKSGSG--------DNSLVNSSSRSESCEAVHKPTNELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEW

Query:  GGDVSSIWKGTVRKKSRGKRKRKDCDRDVKEGSSGENNLSESANPSTVSHSIENSYCDSFEP------RESSDANEASRSSTMDRVDVLMTAFNSVADNK
                    R+   G  K  D D   K+ ++ E      +  S  SHS E     + E       R+   A E    S   +   L++  + +  + 
Subjt:  GGDVSSIWKGTVRKKSRGKRKRKDCDRDVKEGSSGENNLSESANPSTVSHSIENSYCDSFEP------RESSDANEASRSSTMDRVDVLMTAFNSVADNK

Query:  NASIFRRRLDSQKRGRYKKLIRQHMDIETIRSRVASHCITTQKEL-YRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHSERT
          S+F RRL SQ+   YK +++QH+DIETI+ ++      +   + YRDL LL  NA+VF+  +S E  +A  LR  ++        K    +I      
Subjt:  NASIFRRRLDSQKRGRYKKLIRQHMDIETIRSRVASHCITTQKEL-YRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHSERT

Query:  QTCDPMAKPRRSQPATKLNVSK-KEVNPVDVKTPTGSRKSNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGR
        Q    M   +     +  ++S+ K   P+ V     S  + A   SS    K  +   T+ +       GV    ++   AA      K G+
Subjt:  QTCDPMAKPRRSQPATKLNVSK-KEVNPVDVKTPTGSRKSNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGR

AT3G57980.1 DNA-binding bromodomain-containing protein5.1e-1526.46Show/hide
Query:  QELLLGGAVLRHGHADWNLVAAEL---RSRIFLCTPEVCKAKYEDLQKRF------------------VGCKAWYEELRRQRIMELKQALEHSEDSIGSL
        +ELLL  AV RHG   W+ VA+E+    S     T   C+ KY DL++RF                  +    W EELR+ R+ EL++ +E  + SI SL
Subjt:  QELLLGGAVLRHGHADWNLVAAEL---RSRIFLCTPEVCKAKYEDLQKRF------------------VGCKAWYEELRRQRIMELKQALEHSEDSIGSL

Query:  ESKLEAL-----KSGSGDNSLVN-----SSSRSESCEAVHKPTNELSVDSFTQEN-------------RTCSSIECQPASLSAEETEIKPEASQSQSLEW
        + K++ L     KS   +NS ++       + +ES      P  EL       +N             +    ++ +P  +  E+ + KP    S     
Subjt:  ESKLEAL-----KSGSGDNSLVN-----SSSRSESCEAVHKPTNELSVDSFTQEN-------------RTCSSIECQPASLSAEETEIKPEASQSQSLEW

Query:  GGDVSSIWKGTVRKKSRGKRKRKDCDRDVK--EGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMDRVDV----LMTAFNSVADNK
         G   S+ K + R  +  KR+  D    V+  + S GE +  E+++  + +        D  +P      +   +S T++++ V    L      +  + 
Subjt:  GGDVSSIWKGTVRKKSRGKRKRKDCDRDVK--EGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTMDRVDV----LMTAFNSVADNK

Query:  NASIFRRRLDSQKRGRYKKLIRQHMDIETIRSRV-ASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHSERT
          S F RRL++Q+   Y ++IRQH+D E IRSRV   +  T + + +RDLLLL NN  VFY   S E  +A  L   I       + K            
Subjt:  NASIFRRRLDSQKRGRYKKLIRQHMDIETIRSRV-ASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHSERT

Query:  QTCDPMAKPRRSQPATKLNVSKKEVNPVDVKTPTGSRKSNAQSHSSVGLAKKKSSGSTVKK
        QT  P   P+     T    SK+EV    +K          +  SS+ +    S   T+KK
Subjt:  QTCDPMAKPRRSQPATKLNVSKKEVNPVDVKTPTGSRKSNAQSHSSVGLAKKKSSGSTVKK

AT3G60110.1 DNA-binding bromodomain-containing protein3.2e-1724.65Show/hide
Query:  WDTWQELLLGGAVLRHGHADWNLVAAEL--RSRIFLCTPEV-CKAKYEDLQKRF--------------------VGCKAWYEELRRQRIMELKQALEHSE
        W TW+EL+L  AV RH  +DW+ VA E+  RSR  L    V C+ KY+DL++RF                    VG  +W E+LR   + EL++ ++  +
Subjt:  WDTWQELLLGGAVLRHGHADWNLVAAEL--RSRIFLCTPEV-CKAKYEDLQKRF--------------------VGCKAWYEELRRQRIMELKQALEHSE

Query:  DSIGSLE---SKLEALKSG-SGDNSLVNSSSRSESCEAVHKPTNELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWKGTVR
        DSI SL+    KLE  K G  GDN                KP  +L  D            E +P  ++ E TE   +   ++S+      +S+ K    
Subjt:  DSIGSLE---SKLEALKSG-SGDNSLVNSSSRSESCEAVHKPTNELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWKGTVR

Query:  KKSRGKRKRKDCDRDVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTM---------------------DRVDVLMTAFNSVAD
                R D D+ VK   +  N   +  N +      E +     E   S + +E+  S+ +                     D+   L+     +  
Subjt:  KKSRGKRKRKDCDRDVKEGSSGENNLSESANPSTVSHSIENSYCDSFEPRESSDANEASRSSTM---------------------DRVDVLMTAFNSVAD

Query:  NKNASIFRRRLDSQKRGRYKKLIRQHMDIETIRSRV-ASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHSE
        +   S+F  RL SQ    YK+LIRQH+D++TI  ++     +++    YRDL LL  NA+VF+  +S E  +A  LR  +++  +    K    VI    
Subjt:  NKNASIFRRRLDSQKRGRYKKLIRQHMDIETIRSRV-ASHCITTQKELYRDLLLLANNALVFYSPNSREHQSAVLLRDFITSTFEMLLSKNSTSVISHSE

Query:  RTQTCDPMAKPRRSQPATKLNVSKKEVNPVDVKTPTGSRKSNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK
         +      +      P  K + + K+ +P         +KS   S   +      +S  + ++    T K +  ++K           +K+  TK
Subjt:  RTQTCDPMAKPRRSQPATKLNVSKKEVNPVDVKTPTGSRKSNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKPPAATGVRGRKRGRTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCGGAGGCGATGGAAAAGAGGTGGGACACGTGGCAAGAACTATTACTAGGAGGAGCCGTACTCCGCCACGGACACGCCGACTGGAACCTCGTCGCGGCGGAGCT
CCGGTCCAGGATTTTTCTCTGCACACCCGAGGTTTGTAAAGCCAAATATGAAGATTTGCAGAAGCGTTTTGTTGGATGCAAAGCTTGGTATGAGGAGCTTCGGCGGCAAC
GAATTATGGAACTAAAACAAGCTCTAGAGCATTCTGAAGACTCAATAGGTTCATTGGAATCAAAGCTTGAAGCTCTCAAGTCTGGGAGTGGAGACAACTCTCTTGTCAAT
AGTTCTAGCAGATCAGAATCTTGTGAAGCTGTTCACAAACCAACAAATGAGCTGTCTGTTGATAGCTTCACACAGGAAAACAGGACGTGCAGTTCGATCGAATGTCAGCC
TGCTTCGTTATCAGCCGAAGAGACAGAGATAAAACCAGAAGCATCGCAGTCGCAGTCTCTCGAATGGGGCGGCGATGTATCGAGCATTTGGAAAGGAACAGTGAGGAAGA
AATCCAGAGGGAAGAGAAAAAGGAAGGATTGTGATAGGGATGTTAAGGAAGGAAGTAGTGGGGAAAATAACTTGTCTGAATCAGCTAACCCTTCAACTGTTTCTCATTCT
ATAGAAAACTCATACTGCGATTCGTTTGAGCCACGTGAATCGTCTGATGCAAATGAAGCTAGTAGAAGCTCGACCATGGATCGAGTTGATGTTCTTATGACTGCTTTTAA
CTCTGTTGCAGACAATAAAAATGCCTCCATATTTCGTCGCCGCCTTGATAGTCAGAAAAGAGGAAGATACAAGAAATTAATCCGCCAACACATGGATATTGAAACAATAA
GATCAAGAGTTGCAAGTCATTGCATAACGACGCAAAAGGAACTGTACAGAGATCTTCTGTTGCTTGCTAACAACGCACTCGTCTTCTACTCGCCGAATTCCCGCGAGCAT
CAGTCTGCAGTGCTTCTTAGAGACTTCATTACAAGTACATTTGAGATGCTTCTTTCTAAGAACTCTACCAGTGTGATATCCCACAGCGAGAGAACTCAAACCTGTGATCC
GATGGCAAAACCACGTCGTTCGCAGCCTGCTACGAAACTTAATGTATCTAAAAAAGAAGTCAATCCAGTAGATGTCAAGACTCCGACTGGAAGTAGAAAAAGTAATGCTC
AATCCCATTCCTCAGTGGGATTAGCAAAGAAAAAATCTTCGGGTTCTACGGTAAAGAAAGGCCCCGGTGGGACGAGAAAGGGTGTCGTGCAGATGTCGAAAAGTAAACCA
CCTGCAGCAACTGGTGTTAGAGGAAGAAAAAGAGGAAGAACAAAGTAA
mRNA sequenceShow/hide mRNA sequence
CTCAAAACCATTTTTTGTTTTACAACTTAAAAAAAAGAACCCTCCAACCAACCTATCTCCATTCTCACCCTTCCAAAACCGCTAATTCGCAGGCCGGAAAACCACTTTCC
TCCTTCAATACTACCTCTCCCTAAACAGCTTTTCGCTTTTCTCCGATAAGGGCTTTGGAGCTAGGGTTCCGGTGACAAATACGGAGCGAATTCGTCGGCGCGTGAGGCAG
AAATATGGGAGCGGAGGCGATGGAAAAGAGGTGGGACACGTGGCAAGAACTATTACTAGGAGGAGCCGTACTCCGCCACGGACACGCCGACTGGAACCTCGTCGCGGCGG
AGCTCCGGTCCAGGATTTTTCTCTGCACACCCGAGGTTTGTAAAGCCAAATATGAAGATTTGCAGAAGCGTTTTGTTGGATGCAAAGCTTGGTATGAGGAGCTTCGGCGG
CAACGAATTATGGAACTAAAACAAGCTCTAGAGCATTCTGAAGACTCAATAGGTTCATTGGAATCAAAGCTTGAAGCTCTCAAGTCTGGGAGTGGAGACAACTCTCTTGT
CAATAGTTCTAGCAGATCAGAATCTTGTGAAGCTGTTCACAAACCAACAAATGAGCTGTCTGTTGATAGCTTCACACAGGAAAACAGGACGTGCAGTTCGATCGAATGTC
AGCCTGCTTCGTTATCAGCCGAAGAGACAGAGATAAAACCAGAAGCATCGCAGTCGCAGTCTCTCGAATGGGGCGGCGATGTATCGAGCATTTGGAAAGGAACAGTGAGG
AAGAAATCCAGAGGGAAGAGAAAAAGGAAGGATTGTGATAGGGATGTTAAGGAAGGAAGTAGTGGGGAAAATAACTTGTCTGAATCAGCTAACCCTTCAACTGTTTCTCA
TTCTATAGAAAACTCATACTGCGATTCGTTTGAGCCACGTGAATCGTCTGATGCAAATGAAGCTAGTAGAAGCTCGACCATGGATCGAGTTGATGTTCTTATGACTGCTT
TTAACTCTGTTGCAGACAATAAAAATGCCTCCATATTTCGTCGCCGCCTTGATAGTCAGAAAAGAGGAAGATACAAGAAATTAATCCGCCAACACATGGATATTGAAACA
ATAAGATCAAGAGTTGCAAGTCATTGCATAACGACGCAAAAGGAACTGTACAGAGATCTTCTGTTGCTTGCTAACAACGCACTCGTCTTCTACTCGCCGAATTCCCGCGA
GCATCAGTCTGCAGTGCTTCTTAGAGACTTCATTACAAGTACATTTGAGATGCTTCTTTCTAAGAACTCTACCAGTGTGATATCCCACAGCGAGAGAACTCAAACCTGTG
ATCCGATGGCAAAACCACGTCGTTCGCAGCCTGCTACGAAACTTAATGTATCTAAAAAAGAAGTCAATCCAGTAGATGTCAAGACTCCGACTGGAAGTAGAAAAAGTAAT
GCTCAATCCCATTCCTCAGTGGGATTAGCAAAGAAAAAATCTTCGGGTTCTACGGTAAAGAAAGGCCCCGGTGGGACGAGAAAGGGTGTCGTGCAGATGTCGAAAAGTAA
ACCACCTGCAGCAACTGGTGTTAGAGGAAGAAAAAGAGGAAGAACAAAGTAA
Protein sequenceShow/hide protein sequence
MGAEAMEKRWDTWQELLLGGAVLRHGHADWNLVAAELRSRIFLCTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELKQALEHSEDSIGSLESKLEALKSGSGDNSLVN
SSSRSESCEAVHKPTNELSVDSFTQENRTCSSIECQPASLSAEETEIKPEASQSQSLEWGGDVSSIWKGTVRKKSRGKRKRKDCDRDVKEGSSGENNLSESANPSTVSHS
IENSYCDSFEPRESSDANEASRSSTMDRVDVLMTAFNSVADNKNASIFRRRLDSQKRGRYKKLIRQHMDIETIRSRVASHCITTQKELYRDLLLLANNALVFYSPNSREH
QSAVLLRDFITSTFEMLLSKNSTSVISHSERTQTCDPMAKPRRSQPATKLNVSKKEVNPVDVKTPTGSRKSNAQSHSSVGLAKKKSSGSTVKKGPGGTRKGVVQMSKSKP
PAATGVRGRKRGRTK