; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg036064 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg036064
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionBromo domain-containing protein
Genome locationscaffold5:42124396..42128440
RNA-Seq ExpressionSpg036064
SyntenySpg036064
Gene Ontology termsGO:0016573 - histone acetylation (biological process)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR001487 - Bromodomain
IPR036427 - Bromodomain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146636.1 uncharacterized protein LOC101217843 isoform X1 [Cucumis sativus]1.5e-19780.89Show/hide
Query:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEA+   WDTW+ELLLGGA+LRHGTADWNLVA ELR+RI RPYACTPEVCKAKYEDL+KRFVGCKAWYEELRR+R+MELRQALEHSEDSIGSLESKLE
Subjt:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSG-DKSLVNSSSRSESWGAVQKPANEPSAGSFTRENR-TCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLYESQGGTLRKR-RGK
        ALKSRSG DKSLVN S+RSESWGAVQKP NE SA SFT+ENR TCSS+ECQPAPLS +ETEIKP+  QSLE+GK SRIGKLG VLYE+QGG +RKR RGK
Subjt:  ALKSRSG-DKSLVNSSSRSESWGAVQKPANEPSAGSFTRENR-TCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLYESQGGTLRKR-RGK

Query:  RKRKDCNRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQHL
        RKRKDCNR+VKEGSSGENNLSES NPSTVS SKENSCCNSFEARE SDANEASRSS +DGV+VLMAAFN+VAE+KSAS+FRRRLDSQ+R RYKK+IRQHL
Subjt:  RKRKDCNRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQHL

Query:  DIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQPAKRNVSQKQ
        DIETIRSRVASH ITT+ ELYRDLLLLANNA+VFYS NSREHQSAV LR LI+STF+K  K+SS+M AHN   N+RT+T DL+AKPRR QPAKRN SQ++
Subjt:  DIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQPAKRNVSQKQ

Query:  GNPGDVKTPSGNRRRRSN-ANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK
         NPGDVKTP GNRRR++N +N  SS+GLAKKET  S  KK PGGTRKA  GTSKS+RSATG+RGRKRG+TK
Subjt:  GNPGDVKTPSGNRRRRSN-ANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK

XP_008442126.1 PREDICTED: uncharacterized protein LOC103486076 isoform X1 [Cucumis melo]4.6e-19981.95Show/hide
Query:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEA+   WDTW+ELLLGGA++RHGT DWNLVA ELR+RI RPY CTPEVCKAKYEDL+KRFVGCKAWYEELR++RIMELRQALEHSEDSIGSLESKLE
Subjt:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSG-DKSLVNSSSRSESWGAVQKPANEPSAGSFTRENR-TCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLYESQGGTLRKR-RGK
        ALKSRSG DKSLVN S+RSESWGAVQKP NE SA SFT+ENR TCSS+ECQPAPL  EETEIKP+  QSLE GK  RIGKLG VLYE+QGG +RKR RGK
Subjt:  ALKSRSG-DKSLVNSSSRSESWGAVQKPANEPSAGSFTRENR-TCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLYESQGGTLRKR-RGK

Query:  RKRKDCNRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQHL
        RKRKDCNR+VKEGSSGENNLSES NPSTVS SKENSCCNSFEARESSDANEASRSST+DGV+VLMA FNSVAE+KSASVFRRRLDSQ+R RYKK+IRQHL
Subjt:  RKRKDCNRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQHL

Query:  DIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQPAKRNVSQKQ
        DIETIRSRVASHYITT+KELYRDLLLLANNA+VFYS NSREHQSAV LR LI+STFQKL K+SS+M AHN   NQRT+T DL+AKPRR QPAKRN SQ++
Subjt:  DIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQPAKRNVSQKQ

Query:  GNPGDVKTPSGNRRRRSN-ANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK
         NPGDVKTP+GNRRRR+N +N  SS+GL+KKET  ST KK PGG RKA  GTSKS+RSATG+RGRKRGRTK
Subjt:  GNPGDVKTPSGNRRRRSN-ANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK

XP_022139813.1 uncharacterized protein LOC111010637 [Momordica charantia]1.2e-18377.18Show/hide
Query:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MG EAI+  WDTWEELLLGGAVLRHGT DWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRR+RIMELRQALEHSEDSIGSLESKLE
Subjt:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQK-PANEPSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLYESQGGTLRKRRGKRK
        ALKSRSGDK +VNS SRSESWGAVQK  +NE SAGSFT+E RTCSS+EC+ APLS EE EIK +A     Q K+S I KL G+LY SQGGT+RKRRGKRK
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQK-PANEPSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLYESQGGTLRKRRGKRK

Query:  RKDC----------NRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTID--GVEVLMAAFNSVAENKSASVFRRRLDSQKRG
        RK+C          NRDVKEGS GENNLSES NP+TVS     SCCNSFE    SDANEA RSS +D  GV+VLMAAFNSVA++KSASVFRRRLDSQKRG
Subjt:  RKDC----------NRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTID--GVEVLMAAFNSVAENKSASVFRRRLDSQKRG

Query:  RYKKIIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQ
        RYKK+IRQHLDIE IRSRV SHYITT KELYRDLLLLANNA+VFYS NSREHQSAV LRG+ITS F+KLFKNSS++  H NHH Q+T+  D + KPRR Q
Subjt:  RYKKIIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQ

Query:  PAKRNVSQKQGNPGDVKTPSGNRRRRSNANSHSSVGLAKKETP--ASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK
        PAK NVSQK+GN  DVKT +G RRR + AN HSSVGL KKET   AST KKGPG TRKA VGTSKS+RSATG RGRKRGRTK
Subjt:  PAKRNVSQKQGNPGDVKTPSGNRRRRSNANSHSSVGLAKKETP--ASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK

XP_022994396.1 uncharacterized protein LOC111490126 isoform X1 [Cucurbita maxima]4.8e-18879.58Show/hide
Query:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEAI   WDTWEELLLGGA+LRHGT DWNLVAAELRARIVRP A TPEVCKAKYEDLQKRFVGCKAWYEELRRQRI+ELR+ALEHSEDSIGSLESKLE
Subjt:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQKPANEPSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQ--SLEQGKLSRIGKLGGVLYESQGGTLRKR-RGK
        ALKSRSGDKSLVNSS RSESWG V KP NE SAGSFT+ENRTCSSVEC+ AP   +ETEIKP+ASQ   LE GK+               GT++KR RGK
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQKPANEPSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQ--SLEQGKLSRIGKLGGVLYESQGGTLRKR-RGK

Query:  RKRKDC--NRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDG--VEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKII
        RKRKDC  +RDVKEGS+GENNLSES NPSTVSHSK+NSCCNSFE RESSDANEASRSST+DG  V+VLMAAFN+VAENKSA VFRRRLDSQKRGRYKK+I
Subjt:  RKRKDC--NRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDG--VEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKII

Query:  RQHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQPAKRNV
        RQHLDIETIRSRVASHYITTQKELYRDLLLLANNA+VFY PN+REH+SAV LR LITSTFQKLFKNS         H +RT+T D MAKP RLQPAKR  
Subjt:  RQHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQPAKRNV

Query:  SQKQGNPGDVKTPSGNRRRRSNANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRS-ATGVRGRKRGRTK
        S+K+ NPGD KTPSGNRRRRSNANSHSSVGLAK ET AST K+ P GTRK+ VGTSKS++S ATGVRGRKRGRTK
Subjt:  SQKQGNPGDVKTPSGNRRRRSNANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRS-ATGVRGRKRGRTK

XP_023542669.1 uncharacterized protein LOC111802504 isoform X1 [Cucurbita pepo subsp. pepo]1.9e-18479.11Show/hide
Query:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEAI   WDTWEELLLGGA+LRHGT DWNLVAAELRARIVRP A TPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELR+ALEHSEDSIGSLESKLE
Subjt:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQKPANEPSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQ--SLEQGKLSRIGKLGGVLYESQGGTLRKR-RGK
        ALKSRSGDKSLVNSS RSESWG V KP NE SAGSFT+ENRTCSSVEC+ AP   +ETEIKP+ASQ   L+ GK+               GT +KR RGK
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQKPANEPSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQ--SLEQGKLSRIGKLGGVLYESQGGTLRKR-RGK

Query:  RKRKDC-NRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDG--VEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIR
        RKRKDC +RDVKEGS+GENNLSES NPSTVSHSK+NSCCNSFE RESSDANEASRSST+DG  V+VLMAAFN+VAENKSASVFRRRLDSQKRGRYKK+IR
Subjt:  RKRKDC-NRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDG--VEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIR

Query:  QHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQPAKRNVS
        QHLDIETIRSRVASHYITTQKELYRDLLLLANNA+VFY PN+RE++SAV LR LITSTFQKLFKNS         H++RT+T D +AKP RLQPAKRN S
Subjt:  QHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQPAKRNVS

Query:  QKQGNPGDVKTPSGNRRRRSNANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRS-ATGVRGRKRGRTK
        +K+ NPGD KTPSGN RRRSNANSHSSVGLAK ET AST K+ P GTRK+ VGT KS+RS AT  RGRKRGRTK
Subjt:  QKQGNPGDVKTPSGNRRRRSNANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRS-ATGVRGRKRGRTK

TrEMBL top hitse value%identityAlignment
A0A0A0LV17 Bromo domain-containing protein7.2e-19880.89Show/hide
Query:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEA+   WDTW+ELLLGGA+LRHGTADWNLVA ELR+RI RPYACTPEVCKAKYEDL+KRFVGCKAWYEELRR+R+MELRQALEHSEDSIGSLESKLE
Subjt:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSG-DKSLVNSSSRSESWGAVQKPANEPSAGSFTRENR-TCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLYESQGGTLRKR-RGK
        ALKSRSG DKSLVN S+RSESWGAVQKP NE SA SFT+ENR TCSS+ECQPAPLS +ETEIKP+  QSLE+GK SRIGKLG VLYE+QGG +RKR RGK
Subjt:  ALKSRSG-DKSLVNSSSRSESWGAVQKPANEPSAGSFTRENR-TCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLYESQGGTLRKR-RGK

Query:  RKRKDCNRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQHL
        RKRKDCNR+VKEGSSGENNLSES NPSTVS SKENSCCNSFEARE SDANEASRSS +DGV+VLMAAFN+VAE+KSAS+FRRRLDSQ+R RYKK+IRQHL
Subjt:  RKRKDCNRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQHL

Query:  DIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQPAKRNVSQKQ
        DIETIRSRVASH ITT+ ELYRDLLLLANNA+VFYS NSREHQSAV LR LI+STF+K  K+SS+M AHN   N+RT+T DL+AKPRR QPAKRN SQ++
Subjt:  DIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQPAKRNVSQKQ

Query:  GNPGDVKTPSGNRRRRSN-ANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK
         NPGDVKTP GNRRR++N +N  SS+GLAKKET  S  KK PGGTRKA  GTSKS+RSATG+RGRKRG+TK
Subjt:  GNPGDVKTPSGNRRRRSN-ANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK

A0A1S3B4Z1 uncharacterized protein LOC103486076 isoform X12.2e-19981.95Show/hide
Query:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEA+   WDTW+ELLLGGA++RHGT DWNLVA ELR+RI RPY CTPEVCKAKYEDL+KRFVGCKAWYEELR++RIMELRQALEHSEDSIGSLESKLE
Subjt:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSG-DKSLVNSSSRSESWGAVQKPANEPSAGSFTRENR-TCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLYESQGGTLRKR-RGK
        ALKSRSG DKSLVN S+RSESWGAVQKP NE SA SFT+ENR TCSS+ECQPAPL  EETEIKP+  QSLE GK  RIGKLG VLYE+QGG +RKR RGK
Subjt:  ALKSRSG-DKSLVNSSSRSESWGAVQKPANEPSAGSFTRENR-TCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLYESQGGTLRKR-RGK

Query:  RKRKDCNRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQHL
        RKRKDCNR+VKEGSSGENNLSES NPSTVS SKENSCCNSFEARESSDANEASRSST+DGV+VLMA FNSVAE+KSASVFRRRLDSQ+R RYKK+IRQHL
Subjt:  RKRKDCNRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQHL

Query:  DIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQPAKRNVSQKQ
        DIETIRSRVASHYITT+KELYRDLLLLANNA+VFYS NSREHQSAV LR LI+STFQKL K+SS+M AHN   NQRT+T DL+AKPRR QPAKRN SQ++
Subjt:  DIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQPAKRNVSQKQ

Query:  GNPGDVKTPSGNRRRRSN-ANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK
         NPGDVKTP+GNRRRR+N +N  SS+GL+KKET  ST KK PGG RKA  GTSKS+RSATG+RGRKRGRTK
Subjt:  GNPGDVKTPSGNRRRRSN-ANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK

A0A6J1CGL2 uncharacterized protein LOC1110106375.9e-18477.18Show/hide
Query:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MG EAI+  WDTWEELLLGGAVLRHGT DWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRR+RIMELRQALEHSEDSIGSLESKLE
Subjt:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQK-PANEPSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLYESQGGTLRKRRGKRK
        ALKSRSGDK +VNS SRSESWGAVQK  +NE SAGSFT+E RTCSS+EC+ APLS EE EIK +A     Q K+S I KL G+LY SQGGT+RKRRGKRK
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQK-PANEPSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLYESQGGTLRKRRGKRK

Query:  RKDC----------NRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTID--GVEVLMAAFNSVAENKSASVFRRRLDSQKRG
        RK+C          NRDVKEGS GENNLSES NP+TVS     SCCNSFE    SDANEA RSS +D  GV+VLMAAFNSVA++KSASVFRRRLDSQKRG
Subjt:  RKDC----------NRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTID--GVEVLMAAFNSVAENKSASVFRRRLDSQKRG

Query:  RYKKIIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQ
        RYKK+IRQHLDIE IRSRV SHYITT KELYRDLLLLANNA+VFYS NSREHQSAV LRG+ITS F+KLFKNSS++  H NHH Q+T+  D + KPRR Q
Subjt:  RYKKIIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQ

Query:  PAKRNVSQKQGNPGDVKTPSGNRRRRSNANSHSSVGLAKKETP--ASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK
        PAK NVSQK+GN  DVKT +G RRR + AN HSSVGL KKET   AST KKGPG TRKA VGTSKS+RSATG RGRKRGRTK
Subjt:  PAKRNVSQKQGNPGDVKTPSGNRRRRSNANSHSSVGLAKKETP--ASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK

A0A6J1GT05 uncharacterized protein LOC111456852 isoform X11.7e-18378.69Show/hide
Query:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEAI   WDTWEELLLGGA+LRHGT DWNLVAAELRARIVRP A TPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELR+ALEHSEDSIGSLESKLE
Subjt:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQKPANEPSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQ--SLEQGKLSRIGKLGGVLYESQGGTLRKR-RGK
        ALKSRSGDKSLVNSS RSESWG V KP NE SAGSFT+ENRTCSSVEC+ AP   +ETEIKP+ASQ   LE GK+               GT++KR RGK
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQKPANEPSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQ--SLEQGKLSRIGKLGGVLYESQGGTLRKR-RGK

Query:  RKRKDC-NRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDG--VEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIR
        RKRKDC +RDVKEGS+GENNLSES NPSTVSHSK+NSCCNSFE RESSDANEASRSST+DG  V+VLMAAFN+VAENKSA+VFRRRLDSQKRGRYKK+IR
Subjt:  RKRKDC-NRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDG--VEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIR

Query:  QHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQPAKRNVS
        QHLDIETIRSRVAS YITTQKELYRDLLLLANNA+VFY PN+RE++SAV LR LIT+TFQKLFKNS         H++RT+T D MAK  RLQPAKRN S
Subjt:  QHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQPAKRNVS

Query:  QKQGNPGDVKTPSGNRRRRSNANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRS-ATGVRGRKRGRTK
        +K+ NPGD KTPSGN RRRSNANSHSSVGLAK ET AST K+ P GTRK+ VGTSKS+RS AT  RGRKRGR K
Subjt:  QKQGNPGDVKTPSGNRRRRSNANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRS-ATGVRGRKRGRTK

A0A6J1JZ11 uncharacterized protein LOC111490126 isoform X12.3e-18879.58Show/hide
Query:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEAI   WDTWEELLLGGA+LRHGT DWNLVAAELRARIVRP A TPEVCKAKYEDLQKRFVGCKAWYEELRRQRI+ELR+ALEHSEDSIGSLESKLE
Subjt:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQKPANEPSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQ--SLEQGKLSRIGKLGGVLYESQGGTLRKR-RGK
        ALKSRSGDKSLVNSS RSESWG V KP NE SAGSFT+ENRTCSSVEC+ AP   +ETEIKP+ASQ   LE GK+               GT++KR RGK
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQKPANEPSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQ--SLEQGKLSRIGKLGGVLYESQGGTLRKR-RGK

Query:  RKRKDC--NRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDG--VEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKII
        RKRKDC  +RDVKEGS+GENNLSES NPSTVSHSK+NSCCNSFE RESSDANEASRSST+DG  V+VLMAAFN+VAENKSA VFRRRLDSQKRGRYKK+I
Subjt:  RKRKDC--NRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDG--VEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKII

Query:  RQHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQPAKRNV
        RQHLDIETIRSRVASHYITTQKELYRDLLLLANNA+VFY PN+REH+SAV LR LITSTFQKLFKNS         H +RT+T D MAKP RLQPAKR  
Subjt:  RQHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQPAKRNV

Query:  SQKQGNPGDVKTPSGNRRRRSNANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRS-ATGVRGRKRGRTK
        S+K+ NPGD KTPSGNRRRRSNANSHSSVGLAK ET AST K+ P GTRK+ VGTSKS++S ATGVRGRKRGRTK
Subjt:  SQKQGNPGDVKTPSGNRRRRSNANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRS-ATGVRGRKRGRTK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61215.1 bromodomain 42.5e-6537.62Show/hide
Query:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        M    +++ W TWEELLLGGAVLRHGT DW +VA ELR+  + P   TPE+CKAKY+DL+KR+VGCKAW+EEL+++R+ EL+ AL  SEDSIGSLESKL+
Subjt:  MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQKPANE--------------PSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLYES
        +LKS S D+   N+   S +      P +E               S GSFT++  T ++             E K +A   +EQ K   +  L   ++ES
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQKPANE--------------PSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLYES

Query:  QGG-------TLRKRRGKRKRKDCN----RDVKEGSSGENN--LSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKS
          G       ++RK+RGKRKRKDC+    ++V E S+ E +     S + +++  SKE           +S ++  SR  ++   + LM  +N++A+N+ 
Subjt:  QGG-------TLRKRRGKRKRKDCN----RDVKEGSSGENN--LSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKS

Query:  ASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQR
        A VFRRRLDSQKRGRYKK++R+H+D++T++SR+    I++ KEL+RD LL+ANNA +FYS N+RE++SAV LR ++T + +           H  H +  
Subjt:  ASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQR

Query:  T--RTSDLMAKPRRLQPAKR-NVSQKQGNPG--DVKTPSGNRRRRSNANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSAT------GVRGRK
        T   T  ++   +   P+ R +++ K+   G   +KT   +  + S+  +  SV     + P S  K    G +  AV   K  R A        + GRK
Subjt:  T--RTSDLMAKPRRLQPAKR-NVSQKQGNPG--DVKTPSGNRRRRSNANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSAT------GVRGRK

Query:  RGRTK
        R R +
Subjt:  RGRTK

AT2G42150.1 DNA-binding bromodomain-containing protein3.7e-2128.17Show/hide
Query:  SWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRF------------VGCKAWYEELRRQRIMELRQALEHSEDSIGSLE
        +W TWEELLL  AV RHGT  WN V+AE++       + T   C+ KY DL+ RF            +    W EELR+ R+ ELR+ +E  + SI +L+
Subjt:  SWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRF------------VGCKAWYEELRRQRIMELRQALEHSEDSIGSLE

Query:  SKLEALKSRSGDKSLVNSSSRSESWGAVQKPANEPSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLYESQGGTLRKRRG
        SK++ L+    + S +   + +E+    +K            + R+ S       P+ +    I PD  + +      R  ++ G    S GG  +    
Subjt:  SKLEALKSRSGDKSLVNSSSRSESWGAVQKPANEPSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLYESQGGTLRKRRG

Query:  KRKRKDCNRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANE--------------------ASRSSTIDGVEVLMAAFNSVAENKSASV
           R  C    KE ++     SE V P +V+   E+    S     +SD                       +++  T++  + L++    +  +   S 
Subjt:  KRKRKDCNRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANE--------------------ASRSSTIDGVEVLMAAFNSVAENKSASV

Query:  FRRRLDSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSS
        F RRL+ Q+   Y  IIR+H+D E IR RV    Y + +   +RDLLLL NNA VFY   S E + A  L  L+        K  S+
Subjt:  FRRRLDSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSS

AT2G44430.1 DNA-binding bromodomain-containing protein1.2e-1926.92Show/hide
Query:  SWDTWEELLLGGAVLRHGTADWNLVAAELRAR-IVRPYACTPEVCKAKYEDLQKRF---------------------VGCK-AWYEELRRQRIMELRQAL
        +W TWEELLL  AV RHG  DW+ VA E+R+R  +     +   C+ KY DL++RF                     VG    W E+LR  R+ ELR+ +
Subjt:  SWDTWEELLLGGAVLRHGTADWNLVAAELRAR-IVRPYACTPEVCKAKYEDLQKRF---------------------VGCK-AWYEELRRQRIMELRQAL

Query:  EHSEDSIGSLESKLEALKSRSGDKSLVNSSSRSESWGAVQKPANEPSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLYE
        E  + SI SL+ K++ L+               E     +KP  E        EN    S   + A  + EE++ + + S + E    +  G+   V  +
Subjt:  EHSEDSIGSLESKLEALKSRSGDKSLVNSSSRSESWGAVQKPANEPSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLYE

Query:  SQGGTLRKRRGKRKRKDCNRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVE----VLMAAFNSVAENKSASVFRRRL
            T     G  K  D +   K+ ++ E         S  SHS E     + E++      +   +  I   E     L++  + +  +   S+F RRL
Subjt:  SQGGTLRKRRGKRKRKDCNRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVE----VLMAAFNSVAENKSASVFRRRL

Query:  DSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLM
         SQ+   YK +++QHLDIETI+ ++    Y ++    YRDL LL  NAIVF+  +S E  +A  LR +++   +K    +            R+  +D  
Subjt:  DSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLM

Query:  AKPRRLQPAKRNVSQKQGNPGDVKTPSGNRRRRSNANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKR
             L        QK   P  V      ++RRS         ++ K +P+S++      T++  +   K D  ATGVR  +R
Subjt:  AKPRRLQPAKRNVSQKQGNPGDVKTPSGNRRRRSNANSHSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKR

AT3G57980.1 DNA-binding bromodomain-containing protein1.2e-1928.24Show/hide
Query:  EELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRF------------------VGCKAWYEELRRQRIMELRQALEHSEDSIGSL
        EELLL  AV RHGT  W+ VA+E+  +       T   C+ KY DL++RF                  +    W EELR+ R+ ELR+ +E  + SI SL
Subjt:  EELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRF------------------VGCKAWYEELRRQRIMELRQALEHSEDSIGSL

Query:  ESKLEALKSRSGDKSLVNSSS-----------RSESWGAVQKPA-----------NEPSAGS--FTRENRTCSSVECQPAPLSVEETEIKPDASQSLEQG
        + K++ L+    +KSL   +S            +ES      P            N P  GS    R  +    V+ +P  +  E+ + KP A +   +G
Subjt:  ESKLEALKSRSGDKSLVNSSS-----------RSESWGAVQKPA-----------NEPSAGS--FTRENRTCSSVECQPAPLSVEETEIKPDASQSLEQG

Query:  KLSRIGKLGGVLYESQGGTLRKRRGKRKRKDCNRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANE---ASRSSTIDGVEVLMAAFNSV
            + K                  +  R +  R+  +      ++ ES        + +     SF  +E+ D ++     +S T++ + V     +  
Subjt:  KLSRIGKLGGVLYESQGGTLRKRRGKRKRKDCNRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANE---ASRSSTIDGVEVLMAAFNSV

Query:  AE----NKSASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLI
         E    +   S F RRL++Q+   Y +IIRQH+D E IRSRV   +Y T + + +RDLLLL NN  VFY   S E  +A  L  LI
Subjt:  AE----NKSASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLI

AT3G60110.1 DNA-binding bromodomain-containing protein4.5e-1924.39Show/hide
Query:  IDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRF--------------------VGCKAWYEELRRQRIMELRQAL
        I   W TWEEL+L  AV RH  +DW+ VA E++AR       +   C+ KY+DL++RF                    VG  +W E+LR   + ELR+ +
Subjt:  IDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRF--------------------VGCKAWYEELRRQRIMELRQAL

Query:  EHSEDSIGSLESKLEAL-KSRSGDKSLVNSSSRSESWGAVQKPANEPSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLY
        +  +DSI SL+ K++ L + + GD                    N+P             + E +P  ++ E TE   D ++S+                
Subjt:  EHSEDSIGSLESKLEAL-KSRSGDKSLVNSSSRSESWGAVQKPANEPSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLY

Query:  ESQGGTLRKRRGKRKRKDCNRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTI---------------------DGVEVLMA
        ES       +     R D ++ VK   +  N   + VN +     +E +     E   S + +E+  S+ +                     D  + L+ 
Subjt:  ESQGGTLRKRRGKRKRKDCNRDVKEGSSGENNLSESVNPSTVSHSKENSCCNSFEARESSDANEASRSSTI---------------------DGVEVLMA

Query:  AFNSVAENKSASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQ----KLFK
            +  +   SVF  RL SQ    YK++IRQHLD++TI  ++    Y+++    YRDL LL  NAIVF+  +S E  +A  LR L+++  +    KL  
Subjt:  AFNSVAENKSASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQ----KLFK

Query:  NSSSMAAHNNHHNQRTRTSDLMAKPRRLQPAKRNVSQKQGNPGDVK-----------TPSGNRRRRSNANSHSSVGLAKKETPASTAK
              A ++   Q++    L+   ++    K+          D K           T +     RS+  +   + +  K+T    AK
Subjt:  NSSSMAAHNNHHNQRTRTSDLMAKPRRLQPAKRNVSQKQGNPGDVK-----------TPSGNRRRRSNANSHSSVGLAKKETPASTAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCGGAGGCCATAGACAACAGCTGGGACACCTGGGAAGAGCTTCTATTGGGAGGCGCCGTACTCCGCCACGGCACCGCCGACTGGAACCTCGTCGCGGCCGAGCT
CCGGGCCAGGATTGTTCGTCCGTACGCCTGTACGCCCGAGGTTTGTAAGGCCAAATATGAAGACTTACAGAAGCGTTTTGTTGGATGCAAAGCTTGGTATGAGGAGCTTC
GGCGGCAACGAATCATGGAACTAAGACAAGCTCTAGAGCATTCTGAAGACTCAATAGGGTCATTGGAATCAAAGCTGGAAGCTCTTAAGTCTAGAAGTGGAGACAAGTCT
CTTGTCAATAGCTCTAGCAGATCAGAATCTTGGGGAGCTGTTCAGAAGCCAGCAAATGAGCCATCTGCCGGTAGCTTCACACGGGAAAACAGGACGTGCAGTTCGGTCGA
ATGTCAGCCAGCTCCGTTGTCGGTTGAAGAGACGGAGATTAAGCCAGATGCCTCGCAGTCTCTCGAACAGGGAAAGTTGTCGAGGATTGGGAAGTTGGGTGGGGTATTAT
ATGAAAGCCAAGGAGGAACATTGAGGAAGAGAAGAGGGAAGAGAAAGAGGAAGGATTGTAATAGGGATGTTAAGGAAGGAAGTAGTGGGGAAAATAACTTGTCCGAATCA
GTTAACCCTTCAACTGTTTCTCATTCTAAAGAAAACTCATGCTGCAACTCGTTTGAGGCACGTGAATCTTCTGATGCAAATGAAGCTAGCAGAAGCTCAACTATTGATGG
AGTTGAAGTTCTAATGGCTGCTTTTAACTCTGTTGCAGAGAATAAAAGTGCCTCTGTATTTCGTCGTCGCCTTGATAGTCAGAAGAGAGGAAGATACAAGAAAATAATCC
GTCAACACTTGGATATTGAAACAATAAGGTCAAGAGTTGCAAGTCATTACATAACGACGCAAAAGGAGCTGTACAGAGATCTGCTGTTGCTTGCTAACAATGCTATCGTC
TTCTACTCGCCGAATTCCCGGGAGCATCAGTCTGCAGTGCATCTCAGAGGCCTCATTACAAGTACATTTCAGAAGCTTTTTAAGAACTCTAGCAGTATGGCAGCCCACAA
CAACCACCACAACCAAAGAACACGAACCTCTGATCTGATGGCGAAACCGCGTCGTTTGCAGCCTGCTAAGCGTAATGTATCGCAAAAGCAAGGCAATCCAGGAGATGTCA
AAACTCCAAGTGGAAATAGAAGAAGAAGAAGTAATGCTAATTCTCATTCCTCAGTGGGATTAGCAAAGAAAGAAACTCCAGCCTCTACAGCAAAGAAAGGCCCTGGTGGG
ACGAGGAAGGCCGCGGTTGGGACGTCGAAAAGCGATCGATCTGCAACTGGCGTTAGGGGAAGAAAAAGAGGGAGAACAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGCGGAGGCCATAGACAACAGCTGGGACACCTGGGAAGAGCTTCTATTGGGAGGCGCCGTACTCCGCCACGGCACCGCCGACTGGAACCTCGTCGCGGCCGAGCT
CCGGGCCAGGATTGTTCGTCCGTACGCCTGTACGCCCGAGGTTTGTAAGGCCAAATATGAAGACTTACAGAAGCGTTTTGTTGGATGCAAAGCTTGGTATGAGGAGCTTC
GGCGGCAACGAATCATGGAACTAAGACAAGCTCTAGAGCATTCTGAAGACTCAATAGGGTCATTGGAATCAAAGCTGGAAGCTCTTAAGTCTAGAAGTGGAGACAAGTCT
CTTGTCAATAGCTCTAGCAGATCAGAATCTTGGGGAGCTGTTCAGAAGCCAGCAAATGAGCCATCTGCCGGTAGCTTCACACGGGAAAACAGGACGTGCAGTTCGGTCGA
ATGTCAGCCAGCTCCGTTGTCGGTTGAAGAGACGGAGATTAAGCCAGATGCCTCGCAGTCTCTCGAACAGGGAAAGTTGTCGAGGATTGGGAAGTTGGGTGGGGTATTAT
ATGAAAGCCAAGGAGGAACATTGAGGAAGAGAAGAGGGAAGAGAAAGAGGAAGGATTGTAATAGGGATGTTAAGGAAGGAAGTAGTGGGGAAAATAACTTGTCCGAATCA
GTTAACCCTTCAACTGTTTCTCATTCTAAAGAAAACTCATGCTGCAACTCGTTTGAGGCACGTGAATCTTCTGATGCAAATGAAGCTAGCAGAAGCTCAACTATTGATGG
AGTTGAAGTTCTAATGGCTGCTTTTAACTCTGTTGCAGAGAATAAAAGTGCCTCTGTATTTCGTCGTCGCCTTGATAGTCAGAAGAGAGGAAGATACAAGAAAATAATCC
GTCAACACTTGGATATTGAAACAATAAGGTCAAGAGTTGCAAGTCATTACATAACGACGCAAAAGGAGCTGTACAGAGATCTGCTGTTGCTTGCTAACAATGCTATCGTC
TTCTACTCGCCGAATTCCCGGGAGCATCAGTCTGCAGTGCATCTCAGAGGCCTCATTACAAGTACATTTCAGAAGCTTTTTAAGAACTCTAGCAGTATGGCAGCCCACAA
CAACCACCACAACCAAAGAACACGAACCTCTGATCTGATGGCGAAACCGCGTCGTTTGCAGCCTGCTAAGCGTAATGTATCGCAAAAGCAAGGCAATCCAGGAGATGTCA
AAACTCCAAGTGGAAATAGAAGAAGAAGAAGTAATGCTAATTCTCATTCCTCAGTGGGATTAGCAAAGAAAGAAACTCCAGCCTCTACAGCAAAGAAAGGCCCTGGTGGG
ACGAGGAAGGCCGCGGTTGGGACGTCGAAAAGCGATCGATCTGCAACTGGCGTTAGGGGAAGAAAAAGAGGGAGAACAAAGTAA
Protein sequenceShow/hide protein sequence
MGAEAIDNSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLEALKSRSGDKS
LVNSSSRSESWGAVQKPANEPSAGSFTRENRTCSSVECQPAPLSVEETEIKPDASQSLEQGKLSRIGKLGGVLYESQGGTLRKRRGKRKRKDCNRDVKEGSSGENNLSES
VNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNAIV
FYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNNHHNQRTRTSDLMAKPRRLQPAKRNVSQKQGNPGDVKTPSGNRRRRSNANSHSSVGLAKKETPASTAKKGPGG
TRKAAVGTSKSDRSATGVRGRKRGRTK