; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024861 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024861
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionBromo domain-containing protein
Genome locationchr10:6469623..6473133
RNA-Seq ExpressionLag0024861
SyntenyLag0024861
Gene Ontology termsGO:0016573 - histone acetylation (biological process)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR001487 - Bromodomain
IPR036427 - Bromodomain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146636.1 uncharacterized protein LOC101217843 isoform X1 [Cucumis sativus]1.2e-19681.1Show/hide
Query:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEA+   WDTW+ELLLGGA+LRHGTADWNLVA ELR+RI RPYACTPEVCKAKYEDL+KRFVGCKAWYEELRR+R+MELRQALEHSEDSIGSLESKLE
Subjt:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSG-DKSLVNSSSRSESWGAVQKPANELSAGSFTRENR-TCSSVECQPAPLSVEETEIKPEASQSLERGKLSRIGKLGGVLYESQGGTLRKR-RGK
        ALKSRSG DKSLVN S+RSESWGAVQKP NELSA SFT+ENR TCSS+ECQPAPLS +ETEIKPE  QSLERGK SRIGKLG VLYE+QGG +RKR RGK
Subjt:  ALKSRSG-DKSLVNSSSRSESWGAVQKPANELSAGSFTRENR-TCSSVECQPAPLSVEETEIKPEASQSLERGKLSRIGKLGGVLYESQGGTLRKR-RGK

Query:  RKRKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQH
        RKRKDCNR +VKEGSSGENNLSE  NPSTVS SKENSCCNSFEARE SDANEASRSS +DGV+VLMAAFN+VAE+KSAS+FRRRLDSQ+R RYKK+IRQH
Subjt:  RKRKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQH

Query:  LDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAH---NQRTRTSDPMAKPRRSQPAKRNVSQKQ
        LDIETIRSRVASH ITT+ ELYRDLLLLANNA+VFYS NSREHQSAV LR LI+STF+K  K+SS+M AH   N+RT+T D +AKPRRSQPAKRN SQ++
Subjt:  LDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAH---NQRTRTSDPMAKPRRSQPAKRNVSQKQ

Query:  GNPGDVKTPSGNRRRRSN-ANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK
         NPGDVKTP GNRRR++N +N  SS+GLAKKET  S  KK PGGTRKA  GTSKS+RSATG+RGRKRG+TK
Subjt:  GNPGDVKTPSGNRRRRSN-ANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK

XP_008442126.1 PREDICTED: uncharacterized protein LOC103486076 isoform X1 [Cucumis melo]5.6e-19781.95Show/hide
Query:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEA+ K WDTW+ELLLGGA++RHGT DWNLVA ELR+RI RPY CTPEVCKAKYEDL+KRFVGCKAWYEELR++RIMELRQALEHSEDSIGSLESKLE
Subjt:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSG-DKSLVNSSSRSESWGAVQKPANELSAGSFTRENR-TCSSVECQPAPLSVEETEIKPEASQSLERGKLSRIGKLGGVLYESQGGTLRKR-RGK
        ALKSRSG DKSLVN S+RSESWGAVQKP NE SA SFT+ENR TCSS+ECQPAPL  EETEIKPE  QSLE GK  RIGKLG VLYE+QGG +RKR RGK
Subjt:  ALKSRSG-DKSLVNSSSRSESWGAVQKPANELSAGSFTRENR-TCSSVECQPAPLSVEETEIKPEASQSLERGKLSRIGKLGGVLYESQGGTLRKR-RGK

Query:  RKRKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQH
        RKRKDCNR +VKEGSSGENNLSE  NPSTVS SKENSCCNSFEARESSDANEASRSST+DGV+VLMA FNSVAE+KSASVFRRRLDSQ+R RYKK+IRQH
Subjt:  RKRKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQH

Query:  LDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAH---NQRTRTSDPMAKPRRSQPAKRNVSQKQ
        LDIETIRSRVASHYITT+KELYRDLLLLANNA+VFYS NSREHQSAV LR LI+STFQKL K+SS+M AH   NQRT+T D +AKPRRSQPAKRN SQ++
Subjt:  LDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAH---NQRTRTSDPMAKPRRSQPAKRNVSQKQ

Query:  GNPGDVKTPSGNRRRRSN-ANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK
         NPGDVKTP+GNRRRR+N +N  SS+GL+KKET  ST KK PGG RKA  GTSKS+RSATG+RGRKRGRTK
Subjt:  GNPGDVKTPSGNRRRRSN-ANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK

XP_022954655.1 uncharacterized protein LOC111456852 isoform X1 [Cucurbita moschata]2.1e-18379.15Show/hide
Query:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEAI K WDTWEELLLGGA+LRHGT DWNLVAAELRARIVRP A TPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELR+ALEHSEDSIGSLESKLE
Subjt:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQKPANELSAGSFTRENRTCSSVECQPAPLSVEETEIKPEASQ--SLERGKLSRIGKLGGVLYESQGGTLRKR-RGK
        ALKSRSGDKSLVNSS RSESWG V KP NELSAGSFT+ENRTCSSVEC+ AP   +ETEIKPEASQ   LE GK+               GT++KR RGK
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQKPANELSAGSFTRENRTCSSVECQPAPLSVEETEIKPEASQ--SLERGKLSRIGKLGGVLYESQGGTLRKR-RGK

Query:  RKRKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDG--VEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIR
        RKRKDC+  DVKEGS+GENNLSE  NPSTVSHSK+NSCCNSFE RESSDANEASRSST+DG  V+VLMAAFN+VAENKSA+VFRRRLDSQKRGRYKK+IR
Subjt:  RKRKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDG--VEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIR

Query:  QHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNQRTRTSDPMAKPRRSQPAKRNVSQKQG
        QHLDIETIRSRVAS YITTQKELYRDLLLLANNA+VFY PN+RE++SAV LR LIT+TFQKLFKNS     H++RT+T D MAK  R QPAKRN S+K+ 
Subjt:  QHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNQRTRTSDPMAKPRRSQPAKRNVSQKQG

Query:  NPGDVKTPSGNRRRRSNANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRS-ATGVRGRKRGRTK
        NPGD KTPSGN RRRSNANS SSVGLAK ET AST K+ P GTRK+ VGTSKS+RS AT  RGRKRGR K
Subjt:  NPGDVKTPSGNRRRRSNANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRS-ATGVRGRKRGRTK

XP_022994396.1 uncharacterized protein LOC111490126 isoform X1 [Cucurbita maxima]3.1e-18780.04Show/hide
Query:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEAI K WDTWEELLLGGA+LRHGT DWNLVAAELRARIVRP A TPEVCKAKYEDLQKRFVGCKAWYEELRRQRI+ELR+ALEHSEDSIGSLESKLE
Subjt:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQKPANELSAGSFTRENRTCSSVECQPAPLSVEETEIKPEASQ--SLERGKLSRIGKLGGVLYESQGGTLRKR-RGK
        ALKSRSGDKSLVNSS RSESWG V KP NELSAGSFT+ENRTCSSVEC+ AP   +ETEIKPEASQ   LE GK+               GT++KR RGK
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQKPANELSAGSFTRENRTCSSVECQPAPLSVEETEIKPEASQ--SLERGKLSRIGKLGGVLYESQGGTLRKR-RGK

Query:  RKRKDCNRG-DVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDG--VEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKII
        RKRKDC+   DVKEGS+GENNLSE  NPSTVSHSK+NSCCNSFE RESSDANEASRSST+DG  V+VLMAAFN+VAENKSA VFRRRLDSQKRGRYKK+I
Subjt:  RKRKDCNRG-DVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDG--VEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKII

Query:  RQHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNQRTRTSDPMAKPRRSQPAKRNVSQKQ
        RQHLDIETIRSRVASHYITTQKELYRDLLLLANNA+VFY PN+REH+SAV LR LITSTFQKLFKNS     H +RT+T D MAKP R QPAKR  S+K+
Subjt:  RQHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNQRTRTSDPMAKPRRSQPAKRNVSQKQ

Query:  GNPGDVKTPSGNRRRRSNANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRS-ATGVRGRKRGRTK
         NPGD KTPSGNRRRRSNANS SSVGLAK ET AST K+ P GTRK+ VGTSKS++S ATGVRGRKRGRTK
Subjt:  GNPGDVKTPSGNRRRRSNANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRS-ATGVRGRKRGRTK

XP_023542669.1 uncharacterized protein LOC111802504 isoform X1 [Cucurbita pepo subsp. pepo]1.1e-18480.13Show/hide
Query:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEAI K WDTWEELLLGGA+LRHGT DWNLVAAELRARIVRP A TPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELR+ALEHSEDSIGSLESKLE
Subjt:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQKPANELSAGSFTRENRTCSSVECQPAPLSVEETEIKPEASQSLERGKLSRIGKLGGVLYESQGGTLRKR-RGKRK
        ALKSRSGDKSLVNSS RSESWG V KP NELSAGSFT+ENRTCSSVEC+ AP   +ETEIKPEASQ        R  K G V      GT +KR RGKRK
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQKPANELSAGSFTRENRTCSSVECQPAPLSVEETEIKPEASQSLERGKLSRIGKLGGVLYESQGGTLRKR-RGKRK

Query:  RKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDG--VEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQH
        RKDC+  DVKEGS+GENNLSE  NPSTVSHSK+NSCCNSFE RESSDANEASRSST+DG  V+VLMAAFN+VAENKSASVFRRRLDSQKRGRYKK+IRQH
Subjt:  RKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDG--VEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQH

Query:  LDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNQRTRTSDPMAKPRRSQPAKRNVSQKQGNP
        LDIETIRSRVASHYITTQKELYRDLLLLANNA+VFY PN+RE++SAV LR LITSTFQKLFKNS     H++RT+T D +AKP R QPAKRN S+K+ NP
Subjt:  LDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNQRTRTSDPMAKPRRSQPAKRNVSQKQGNP

Query:  GDVKTPSGNRRRRSNANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRS-ATGVRGRKRGRTK
        GD KTPSGN RRRSNANS SSVGLAK ET AST K+ P GTRK+ VGT KS+RS AT  RGRKRGRTK
Subjt:  GDVKTPSGNRRRRSNANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRS-ATGVRGRKRGRTK

TrEMBL top hitse value%identityAlignment
A0A0A0LV17 Bromo domain-containing protein6.1e-19781.1Show/hide
Query:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEA+   WDTW+ELLLGGA+LRHGTADWNLVA ELR+RI RPYACTPEVCKAKYEDL+KRFVGCKAWYEELRR+R+MELRQALEHSEDSIGSLESKLE
Subjt:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSG-DKSLVNSSSRSESWGAVQKPANELSAGSFTRENR-TCSSVECQPAPLSVEETEIKPEASQSLERGKLSRIGKLGGVLYESQGGTLRKR-RGK
        ALKSRSG DKSLVN S+RSESWGAVQKP NELSA SFT+ENR TCSS+ECQPAPLS +ETEIKPE  QSLERGK SRIGKLG VLYE+QGG +RKR RGK
Subjt:  ALKSRSG-DKSLVNSSSRSESWGAVQKPANELSAGSFTRENR-TCSSVECQPAPLSVEETEIKPEASQSLERGKLSRIGKLGGVLYESQGGTLRKR-RGK

Query:  RKRKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQH
        RKRKDCNR +VKEGSSGENNLSE  NPSTVS SKENSCCNSFEARE SDANEASRSS +DGV+VLMAAFN+VAE+KSAS+FRRRLDSQ+R RYKK+IRQH
Subjt:  RKRKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQH

Query:  LDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAH---NQRTRTSDPMAKPRRSQPAKRNVSQKQ
        LDIETIRSRVASH ITT+ ELYRDLLLLANNA+VFYS NSREHQSAV LR LI+STF+K  K+SS+M AH   N+RT+T D +AKPRRSQPAKRN SQ++
Subjt:  LDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAH---NQRTRTSDPMAKPRRSQPAKRNVSQKQ

Query:  GNPGDVKTPSGNRRRRSN-ANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK
         NPGDVKTP GNRRR++N +N  SS+GLAKKET  S  KK PGGTRKA  GTSKS+RSATG+RGRKRG+TK
Subjt:  GNPGDVKTPSGNRRRRSN-ANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK

A0A1S3B4Z1 uncharacterized protein LOC103486076 isoform X12.7e-19781.95Show/hide
Query:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEA+ K WDTW+ELLLGGA++RHGT DWNLVA ELR+RI RPY CTPEVCKAKYEDL+KRFVGCKAWYEELR++RIMELRQALEHSEDSIGSLESKLE
Subjt:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSG-DKSLVNSSSRSESWGAVQKPANELSAGSFTRENR-TCSSVECQPAPLSVEETEIKPEASQSLERGKLSRIGKLGGVLYESQGGTLRKR-RGK
        ALKSRSG DKSLVN S+RSESWGAVQKP NE SA SFT+ENR TCSS+ECQPAPL  EETEIKPE  QSLE GK  RIGKLG VLYE+QGG +RKR RGK
Subjt:  ALKSRSG-DKSLVNSSSRSESWGAVQKPANELSAGSFTRENR-TCSSVECQPAPLSVEETEIKPEASQSLERGKLSRIGKLGGVLYESQGGTLRKR-RGK

Query:  RKRKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQH
        RKRKDCNR +VKEGSSGENNLSE  NPSTVS SKENSCCNSFEARESSDANEASRSST+DGV+VLMA FNSVAE+KSASVFRRRLDSQ+R RYKK+IRQH
Subjt:  RKRKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQH

Query:  LDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAH---NQRTRTSDPMAKPRRSQPAKRNVSQKQ
        LDIETIRSRVASHYITT+KELYRDLLLLANNA+VFYS NSREHQSAV LR LI+STFQKL K+SS+M AH   NQRT+T D +AKPRRSQPAKRN SQ++
Subjt:  LDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAH---NQRTRTSDPMAKPRRSQPAKRNVSQKQ

Query:  GNPGDVKTPSGNRRRRSN-ANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK
         NPGDVKTP+GNRRRR+N +N  SS+GL+KKET  ST KK PGG RKA  GTSKS+RSATG+RGRKRGRTK
Subjt:  GNPGDVKTPSGNRRRRSN-ANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK

A0A6J1CGL2 uncharacterized protein LOC1110106372.5e-18276.92Show/hide
Query:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MG EAI++ WDTWEELLLGGAVLRHGT DWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRR+RIMELRQALEHSEDSIGSLESKLE
Subjt:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQK-PANELSAGSFTRENRTCSSVECQPAPLSVEETEIKPEASQSLERGKLSRIGKLGGVLYESQGGTLRKRRGKRK
        ALKSRSGDK +VNS SRSESWGAVQK  +NELSAGSFT+E RTCSS+EC+ APLS EE EIK EA     + K+S I KL G+LY SQGGT+RKRRGKRK
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQK-PANELSAGSFTRENRTCSSVECQPAPLSVEETEIKPEASQSLERGKLSRIGKLGGVLYESQGGTLRKRRGKRK

Query:  RKDCNRG---------DVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTID--GVEVLMAAFNSVAENKSASVFRRRLDSQKRG
        RK+CN           DVKEGS GENNLSE  NP+TVS     SCCNSFE    SDANEA RSS +D  GV+VLMAAFNSVA++KSASVFRRRLDSQKRG
Subjt:  RKDCNRG---------DVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTID--GVEVLMAAFNSVAENKSASVFRRRLDSQKRG

Query:  RYKKIIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHN---QRTRTSDPMAKPRRSQP
        RYKK+IRQHLDIE IRSRV SHYITT KELYRDLLLLANNA+VFYS NSREHQSAV LRG+ITS F+KLFKNSS++  HN   Q+T+  DP+ KPRRSQP
Subjt:  RYKKIIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHN---QRTRTSDPMAKPRRSQP

Query:  AKRNVSQKQGNPGDVKTPSGNRRRRSNANSQSSVGLAKKETP--ASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK
        AK NVSQK+GN  DVKT +G RRR + AN  SSVGL KKET   AST KKGPG TRKA VGTSKS+RSATG RGRKRGRTK
Subjt:  AKRNVSQKQGNPGDVKTPSGNRRRRSNANSQSSVGLAKKETP--ASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKRGRTK

A0A6J1GT05 uncharacterized protein LOC111456852 isoform X11.0e-18379.15Show/hide
Query:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEAI K WDTWEELLLGGA+LRHGT DWNLVAAELRARIVRP A TPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELR+ALEHSEDSIGSLESKLE
Subjt:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQKPANELSAGSFTRENRTCSSVECQPAPLSVEETEIKPEASQ--SLERGKLSRIGKLGGVLYESQGGTLRKR-RGK
        ALKSRSGDKSLVNSS RSESWG V KP NELSAGSFT+ENRTCSSVEC+ AP   +ETEIKPEASQ   LE GK+               GT++KR RGK
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQKPANELSAGSFTRENRTCSSVECQPAPLSVEETEIKPEASQ--SLERGKLSRIGKLGGVLYESQGGTLRKR-RGK

Query:  RKRKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDG--VEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIR
        RKRKDC+  DVKEGS+GENNLSE  NPSTVSHSK+NSCCNSFE RESSDANEASRSST+DG  V+VLMAAFN+VAENKSA+VFRRRLDSQKRGRYKK+IR
Subjt:  RKRKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDG--VEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIR

Query:  QHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNQRTRTSDPMAKPRRSQPAKRNVSQKQG
        QHLDIETIRSRVAS YITTQKELYRDLLLLANNA+VFY PN+RE++SAV LR LIT+TFQKLFKNS     H++RT+T D MAK  R QPAKRN S+K+ 
Subjt:  QHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNQRTRTSDPMAKPRRSQPAKRNVSQKQG

Query:  NPGDVKTPSGNRRRRSNANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRS-ATGVRGRKRGRTK
        NPGD KTPSGN RRRSNANS SSVGLAK ET AST K+ P GTRK+ VGTSKS+RS AT  RGRKRGR K
Subjt:  NPGDVKTPSGNRRRRSNANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRS-ATGVRGRKRGRTK

A0A6J1JZ11 uncharacterized protein LOC111490126 isoform X11.5e-18780.04Show/hide
Query:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGAEAI K WDTWEELLLGGA+LRHGT DWNLVAAELRARIVRP A TPEVCKAKYEDLQKRFVGCKAWYEELRRQRI+ELR+ALEHSEDSIGSLESKLE
Subjt:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQKPANELSAGSFTRENRTCSSVECQPAPLSVEETEIKPEASQ--SLERGKLSRIGKLGGVLYESQGGTLRKR-RGK
        ALKSRSGDKSLVNSS RSESWG V KP NELSAGSFT+ENRTCSSVEC+ AP   +ETEIKPEASQ   LE GK+               GT++KR RGK
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQKPANELSAGSFTRENRTCSSVECQPAPLSVEETEIKPEASQ--SLERGKLSRIGKLGGVLYESQGGTLRKR-RGK

Query:  RKRKDCNRG-DVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDG--VEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKII
        RKRKDC+   DVKEGS+GENNLSE  NPSTVSHSK+NSCCNSFE RESSDANEASRSST+DG  V+VLMAAFN+VAENKSA VFRRRLDSQKRGRYKK+I
Subjt:  RKRKDCNRG-DVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDG--VEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKII

Query:  RQHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNQRTRTSDPMAKPRRSQPAKRNVSQKQ
        RQHLDIETIRSRVASHYITTQKELYRDLLLLANNA+VFY PN+REH+SAV LR LITSTFQKLFKNS     H +RT+T D MAKP R QPAKR  S+K+
Subjt:  RQHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNQRTRTSDPMAKPRRSQPAKRNVSQKQ

Query:  GNPGDVKTPSGNRRRRSNANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRS-ATGVRGRKRGRTK
         NPGD KTPSGNRRRRSNANS SSVGLAK ET AST K+ P GTRK+ VGTSKS++S ATGVRGRKRGRTK
Subjt:  GNPGDVKTPSGNRRRRSNANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRS-ATGVRGRKRGRTK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61215.1 bromodomain 41.0e-6337.18Show/hide
Query:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        M    ++  W TWEELLLGGAVLRHGT DW +VA ELR+  + P   TPE+CKAKY+DL+KR+VGCKAW+EEL+++R+ EL+ AL  SEDSIGSLESKL+
Subjt:  MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSRSESWGAVQKPANE--------------LSAGSFTRENRTCSSVECQPAPLSVEETEIKPEASQSLERGKLSRIGKLGGVLYES
        +LKS S D+   N+   S +      P +E               S GSFT++  T ++             E K EA   +E+ K   +  L   ++ES
Subjt:  ALKSRSGDKSLVNSSSRSESWGAVQKPANE--------------LSAGSFTRENRTCSSVECQPAPLSVEETEIKPEASQSLERGKLSRIGKLGGVLYES

Query:  QGG-------TLRKRRGKRKRKDCNRG---DVKEGSSGENN--LSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKS
          G       ++RK+RGKRKRKDC+     +V E S+ E +       + +++  SKE           +S ++  SR  ++   + LM  +N++A+N+ 
Subjt:  QGG-------TLRKRRGKRKRKDCNRG---DVKEGSSGENN--LSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKS

Query:  ASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLF-----KNSSSMAAHNQ
        A VFRRRLDSQKRGRYKK++R+H+D++T++SR+    I++ KEL+RD LL+ANNA +FYS N+RE++SAV LR ++T + +         + SS+ A + 
Subjt:  ASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLF-----KNSSSMAAHNQ

Query:  RTRTSDPMAKPRRSQPAKRNVSQKQGNPG--DVKTPSGNRRRRSNANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSAT------GVRGRKRG
        +      + +   S   + +++ K+   G   +KT   +  + S+  ++ SV     + P S  K    G +  AV   K  R A        + GRKR 
Subjt:  RTRTSDPMAKPRRSQPAKRNVSQKQGNPG--DVKTPSGNRRRRSNANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSAT------GVRGRKRG

Query:  RTK
        R +
Subjt:  RTK

AT2G42150.1 DNA-binding bromodomain-containing protein4.8e-2128.01Show/hide
Query:  KSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRF------------VGCKAWYEELRRQRIMELRQALEHSEDSIGSL
        ++W TWEELLL  AV RHGT  WN V+AE++       + T   C+ KY DL+ RF            +    W EELR+ R+ ELR+ +E  + SI +L
Subjt:  KSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRF------------VGCKAWYEELRRQRIMELRQALEHSEDSIGSL

Query:  ESKLEALK-----------SRSGDKSLVNSSSRSESWGAVQKPANELSAGSFTRENRTCSSVECQPAPLSVEETEIKPE-ASQSLERGKLSRIGKLGGVL
        +SK++ L+           + + +  L     RS+S   V  P  +L        N T S     P  +  E TE + E A       KL+      G  
Subjt:  ESKLEALK-----------SRSGDKSLVNSSSRSESWGAVQKPANELSAGSFTRENRTCSSVECQPAPLSVEETEIKPE-ASQSLERGKLSRIGKLGGVL

Query:  YESQGGTLRKRRGKRKRKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLD
           +                   + ++G+S    ++  V  S     K  S     E  +   +  +++  T++  + L++    +  +   S F RRL+
Subjt:  YESQGGTLRKRRGKRKRKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLD

Query:  SQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLI----TSTFQKLFKNSSSMAAHNQRTRTSDPMA
         Q+   Y  IIR+H+D E IR RV    Y + +   +RDLLLL NNA VFY   S E + A  L  L+    T+T + L        +  +    + P +
Subjt:  SQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLI----TSTFQKLFKNSSSMAAHNQRTRTSDPMA

Query:  KPRRSQP
        KP  S+P
Subjt:  KPRRSQP

AT2G44430.1 DNA-binding bromodomain-containing protein1.0e-1827.03Show/hide
Query:  KSWDTWEELLLGGAVLRHGTADWNLVAAELRAR-IVRPYACTPEVCKAKYEDLQKRF---------------------VGCK-AWYEELRRQRIMELRQA
        ++W TWEELLL  AV RHG  DW+ VA E+R+R  +     +   C+ KY DL++RF                     VG    W E+LR  R+ ELR+ 
Subjt:  KSWDTWEELLLGGAVLRHGTADWNLVAAELRAR-IVRPYACTPEVCKAKYEDLQKRF---------------------VGCK-AWYEELRRQRIMELRQA

Query:  LEHSEDSIGSLESKLEALK------SRSGDKSLVNSSSRSESWGAVQKPANEL--SAGSFTRENRTCSSVECQPAPLSVEETEIKPEASQSLERGKLSRI
        +E  + SI SL+ K++ L+          D        RSE+ G+  +   +   +A    RENR+ +      A    EE     E SQ+         
Subjt:  LEHSEDSIGSLESKLEALK------SRSGDKSLVNSSSRSESWGAVQKPANEL--SAGSFTRENRTCSSVECQPAPLSVEETEIKPEASQSLERGKLSRI

Query:  GKLGGVLYESQGGTLRKRRGKRKRKDCNRGDVKEGS---SGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENK
                E   G  +        KD    + +EGS     E + S+ +  S  S SK        + R+   A E   + +    + L++  + +  + 
Subjt:  GKLGGVLYESQGGTLRKRRGKRKRKDCNRGDVKEGS---SGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENK

Query:  SASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNQRTR
          S+F RRL SQ+   YK +++QHLDIETI+ ++    Y ++    YRDL LL  NAIVF+  +S E  +A  LR ++                 +Q  R
Subjt:  SASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNQRTR

Query:  TSDPMAKPRRSQPAKRNVSQKQGNPGDVKTPSGNRRRRSNAN----SQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKR
             A PR     K+  S  +    D +T   +  R+ ++      +    ++ K +P+S++      T++  +   K D  ATGVR  +R
Subjt:  TSDPMAKPRRSQPAKRNVSQKQGNPGDVKTPSGNRRRRSNAN----SQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVRGRKR

AT3G57980.1 DNA-binding bromodomain-containing protein3.0e-1526.44Show/hide
Query:  EELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRF------------------VGCKAWYEELRRQRIMELRQALEHSEDSIGSL
        EELLL  AV RHGT  W+ VA+E+  +       T   C+ KY DL++RF                  +    W EELR+ R+ ELR+ +E  + SI SL
Subjt:  EELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRF------------------VGCKAWYEELRRQRIMELRQALEHSEDSIGSL

Query:  ESKLEALKSRSGDKSLVNSSS-----------RSESWGAVQKPANELSAGSFTREN-------------RTCSSVECQPAPLSVEETEIKPEASQSLERG
        + K++ L+    +KSL   +S            +ES      P  EL       +N             +    V+ +P  +  E+ + KP A +   RG
Subjt:  ESKLEALKSRSGDKSLVNSSS-----------RSESWGAVQKPANELSAGSFTREN-------------RTCSSVECQPAPLSVEETEIKPEASQSLERG

Query:  KLSRIGKLGGVLYESQGGTLRKRRGKRKRKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANE--ASRSSTIDGVEVLMAAFNSV
            + K        + G       +   +     D KE S G+++ S P   +      +N         +S   N+         D +E+L +     
Subjt:  KLSRIGKLGGVLYESQGGTLRKRRGKRKRKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANE--ASRSSTIDGVEVLMAAFNSV

Query:  AENKSASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFK---------
          +   S F RRL++Q+   Y +IIRQH+D E IRSRV   +Y T + + +RDLLLL NN  VFY   S E  +A  L  LI    Q  FK         
Subjt:  AENKSASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITSTFQKLFK---------

Query:  NSSSMAAHNQRTRTSDPMAKPRRSQP---AKRNVSQKQGNPGDVKTPSGNRRRRSNANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVR
           ++    +  + S    KP  S P    ++  S    +P  V      + R      +  V   ++  P+   +K P  ++K A G + S     G R
Subjt:  NSSSMAAHNQRTRTSDPMAKPRRSQP---AKRNVSQKQGNPGDVKTPSGNRRRRSNANSQSSVGLAKKETPASTAKKGPGGTRKAAVGTSKSDRSATGVR

Query:  GRK
          K
Subjt:  GRK

AT3G60110.1 DNA-binding bromodomain-containing protein1.8e-2025.65Show/hide
Query:  IDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRF--------------------VGCKAWYEELRRQRIMELRQAL
        I + W TWEEL+L  AV RH  +DW+ VA E++AR       +   C+ KY+DL++RF                    VG  +W E+LR   + ELR+ +
Subjt:  IDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRF--------------------VGCKAWYEELRRQRIMELRQAL

Query:  EHSEDSIGSLESKLEAL---------------------------KSRSGDKSLVNSSSRSESWGAVQKPA--NELSAGSFTRENRTCSSVECQPAPLSVE
        +  +DSI SL+ K++ L                           ++   D+    S + S S  +V K A  + L      + N   +S    P P++  
Subjt:  EHSEDSIGSLESKLEAL---------------------------KSRSGDKSLVNSSSRSESWGAVQKPA--NELSAGSFTRENRTCSSVECQPAPLSVE

Query:  ETEIKPEASQSLERGKLSRIGKLGGVLYESQGGT---LRKRRGKRKRKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRS
        ET  + E + S +R ++S  G+L       + GT   L KR+G++ R     G VK  S+G+   S+P+                               
Subjt:  ETEIKPEASQSLERGKLSRIGKLGGVLYESQGGT---LRKRRGKRKRKDCNRGDVKEGSSGENNLSEPVNPSTVSHSKENSCCNSFEARESSDANEASRS

Query:  STIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITS
          ID +++       +  +   SVF  RL SQ    YK++IRQHLD++TI  ++    Y+++    YRDL LL  NAIVF+  +S E  +A  LR L+++
Subjt:  STIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNAIVFYSPNSREHQSAVHLRGLITS

Query:  TFQK--------LFKNSSSMAAHNQRT---------RTSDPMAKPRRSQPAKRNVSQKQGNPGDVK--TPSGNRRRRSNANSQSSVGLAKKETPASTAK
          +K        + K+ +  +   Q++         + S  + K   S  +++   +K     + K  T +     RS+  +   + +  K+T    AK
Subjt:  TFQK--------LFKNSSSMAAHNQRT---------RTSDPMAKPRRSQPAKRNVSQKQGNPGDVK--TPSGNRRRRSNANSQSSVGLAKKETPASTAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCGGAGGCCATAGACAAGAGCTGGGACACCTGGGAAGAGCTTCTATTGGGAGGCGCCGTACTCCGCCACGGCACCGCCGACTGGAACCTCGTCGCGGCCGAGCT
CCGGGCCAGGATTGTTCGTCCGTACGCCTGTACGCCCGAGGTTTGTAAGGCCAAATATGAAGACTTACAGAAGCGTTTTGTTGGATGCAAAGCTTGGTATGAGGAGCTTC
GGCGGCAACGAATCATGGAACTAAGACAAGCTCTAGAGCATTCTGAAGACTCAATAGGGTCATTGGAATCAAAGCTTGAAGCTCTTAAGTCTAGGAGTGGAGACAAGTCT
CTTGTGAATAGCTCTAGCAGATCAGAATCTTGGGGAGCTGTTCAGAAGCCAGCGAATGAGCTATCTGCCGGTAGCTTCACACGAGAAAACAGGACGTGCAGTTCGGTCGA
ATGTCAGCCAGCTCCGTTGTCGGTCGAAGAGACGGAGATTAAGCCAGAAGCCTCGCAGTCTCTCGAACGGGGAAAGTTGTCGAGGATTGGGAAGTTGGGTGGGGTATTAT
ATGAAAGCCAAGGAGGAACATTGAGGAAGAGAAGAGGGAAGAGAAAGAGGAAGGATTGTAATAGGGGGGATGTTAAGGAAGGAAGTAGTGGGGAAAATAACTTGTCCGAA
CCAGTTAACCCTTCAACTGTTTCTCATTCTAAAGAAAACTCATGCTGCAACTCGTTTGAGGCTCGTGAATCTTCTGATGCAAATGAAGCTAGCAGAAGCTCAACTATTGA
TGGAGTTGAAGTTCTAATGGCTGCTTTTAACTCTGTTGCAGAGAATAAAAGTGCCTCTGTATTTCGTCGTCGCCTTGATAGTCAGAAGAGAGGAAGATACAAGAAAATAA
TCCGTCAACACTTGGATATTGAAACAATAAGGTCAAGAGTTGCAAGTCATTACATAACGACGCAAAAGGAGCTGTACAGAGATCTGCTGTTGCTTGCTAACAATGCTATC
GTCTTCTACTCGCCGAATTCCCGGGAACATCAGTCTGCAGTGCATCTCAGAGGCCTCATTACAAGTACATTTCAGAAGCTTTTTAAGAACTCTAGCAGTATGGCAGCCCA
CAACCAAAGAACACGAACCTCTGATCCGATGGCGAAACCGCGTCGTTCGCAGCCTGCTAAGCGTAATGTATCGCAAAAGCAAGGCAATCCAGGAGATGTCAAAACTCCAA
GTGGAAATAGAAGAAGAAGAAGTAATGCTAATTCTCAATCCTCAGTGGGGTTAGCAAAGAAAGAAACTCCAGCCTCTACAGCAAAGAAAGGCCCTGGTGGGACGAGAAAG
GCCGCCGTTGGGACGTCGAAAAGCGATCGATCTGCAACTGGCGTTAGGGGAAGGAAAAGAGGGAGAACAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGCGGAGGCCATAGACAAGAGCTGGGACACCTGGGAAGAGCTTCTATTGGGAGGCGCCGTACTCCGCCACGGCACCGCCGACTGGAACCTCGTCGCGGCCGAGCT
CCGGGCCAGGATTGTTCGTCCGTACGCCTGTACGCCCGAGGTTTGTAAGGCCAAATATGAAGACTTACAGAAGCGTTTTGTTGGATGCAAAGCTTGGTATGAGGAGCTTC
GGCGGCAACGAATCATGGAACTAAGACAAGCTCTAGAGCATTCTGAAGACTCAATAGGGTCATTGGAATCAAAGCTTGAAGCTCTTAAGTCTAGGAGTGGAGACAAGTCT
CTTGTGAATAGCTCTAGCAGATCAGAATCTTGGGGAGCTGTTCAGAAGCCAGCGAATGAGCTATCTGCCGGTAGCTTCACACGAGAAAACAGGACGTGCAGTTCGGTCGA
ATGTCAGCCAGCTCCGTTGTCGGTCGAAGAGACGGAGATTAAGCCAGAAGCCTCGCAGTCTCTCGAACGGGGAAAGTTGTCGAGGATTGGGAAGTTGGGTGGGGTATTAT
ATGAAAGCCAAGGAGGAACATTGAGGAAGAGAAGAGGGAAGAGAAAGAGGAAGGATTGTAATAGGGGGGATGTTAAGGAAGGAAGTAGTGGGGAAAATAACTTGTCCGAA
CCAGTTAACCCTTCAACTGTTTCTCATTCTAAAGAAAACTCATGCTGCAACTCGTTTGAGGCTCGTGAATCTTCTGATGCAAATGAAGCTAGCAGAAGCTCAACTATTGA
TGGAGTTGAAGTTCTAATGGCTGCTTTTAACTCTGTTGCAGAGAATAAAAGTGCCTCTGTATTTCGTCGTCGCCTTGATAGTCAGAAGAGAGGAAGATACAAGAAAATAA
TCCGTCAACACTTGGATATTGAAACAATAAGGTCAAGAGTTGCAAGTCATTACATAACGACGCAAAAGGAGCTGTACAGAGATCTGCTGTTGCTTGCTAACAATGCTATC
GTCTTCTACTCGCCGAATTCCCGGGAACATCAGTCTGCAGTGCATCTCAGAGGCCTCATTACAAGTACATTTCAGAAGCTTTTTAAGAACTCTAGCAGTATGGCAGCCCA
CAACCAAAGAACACGAACCTCTGATCCGATGGCGAAACCGCGTCGTTCGCAGCCTGCTAAGCGTAATGTATCGCAAAAGCAAGGCAATCCAGGAGATGTCAAAACTCCAA
GTGGAAATAGAAGAAGAAGAAGTAATGCTAATTCTCAATCCTCAGTGGGGTTAGCAAAGAAAGAAACTCCAGCCTCTACAGCAAAGAAAGGCCCTGGTGGGACGAGAAAG
GCCGCCGTTGGGACGTCGAAAAGCGATCGATCTGCAACTGGCGTTAGGGGAAGGAAAAGAGGGAGAACAAAGTAA
Protein sequenceShow/hide protein sequence
MGAEAIDKSWDTWEELLLGGAVLRHGTADWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLEALKSRSGDKS
LVNSSSRSESWGAVQKPANELSAGSFTRENRTCSSVECQPAPLSVEETEIKPEASQSLERGKLSRIGKLGGVLYESQGGTLRKRRGKRKRKDCNRGDVKEGSSGENNLSE
PVNPSTVSHSKENSCCNSFEARESSDANEASRSSTIDGVEVLMAAFNSVAENKSASVFRRRLDSQKRGRYKKIIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNAI
VFYSPNSREHQSAVHLRGLITSTFQKLFKNSSSMAAHNQRTRTSDPMAKPRRSQPAKRNVSQKQGNPGDVKTPSGNRRRRSNANSQSSVGLAKKETPASTAKKGPGGTRK
AAVGTSKSDRSATGVRGRKRGRTK