; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0023930 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0023930
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionBromo domain-containing protein
Genome locationchr12:4724200..4728785
RNA-Seq ExpressionPI0023930
SyntenyPI0023930
Gene Ontology termsGO:0016573 - histone acetylation (biological process)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001487 - Bromodomain
IPR036427 - Bromodomain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146636.1 uncharacterized protein LOC101217843 isoform X1 [Cucumis sativus]5.4e-23292.77Show/hide
Query:  MGAEALEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLE
        MGAEAL+  WDTW ELLLGGAILRHGTADWNLVA ELR+RI RPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKR+MELRQ LEHSEDSIGSLESKLE
Subjt:  MGAEALEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIRKGSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSA SFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLER KASRI KLG VLYENQGGIIRK SRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIRKGSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARE SDANEASRSS MDGVDVLMAAFN+VAEDKSAS+FRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQKEV
        DIETIRSRVASH ITTK ELYRDLLLLANNAL+FYSRNSREHQSAVLLRRLISSTF+K MKSSSNMVAHNTPN+RTQTCDLIAKPRRSQPAKRNESQ+E 
Subjt:  DIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQKEV

Query:  NPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERSATGIRGRKRGRTK
        NPGDVKTP GNRRR+NNSSNPPSS+GLAKKETSTS +KKAPGGTRKA G TSKSERSATGIRGRKRG+TK
Subjt:  NPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERSATGIRGRKRGRTK

XP_008442126.1 PREDICTED: uncharacterized protein LOC103486076 isoform X1 [Cucumis melo]4.6e-23193.19Show/hide
Query:  MGAEALEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLE
        MGAEAL KRWDTW ELLLGGAI+RHGT DWNLVA ELR+RI RPY CTPEVCKAKYEDLKKRFVGCKAWYEELR+KRIMELRQ LEHSEDSIGSLESKLE
Subjt:  MGAEALEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIRKGSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SA SFTQENRTTCSSIECQPAPL T+ETEIKPEPLQSLE  K+ RI KLG VLYENQGGIIRK SRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIRKGSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMA FNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQKEV
        DIETIRSRVASHYITTKKELYRDLLLLANNAL+FYSRNSREHQSAV LRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQ+E 
Subjt:  DIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQKEV

Query:  NPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERSATGIRGRKRGRTK
        NPGDVKTPNGNRRRRNNSSNPPSS+GL+KKETSTST KKAPGG RKA G TSKSERSATGIRGRKRGRTK
Subjt:  NPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERSATGIRGRKRGRTK

XP_008442135.1 PREDICTED: uncharacterized protein LOC103486076 isoform X2 [Cucumis melo]7.4e-21388.09Show/hide
Query:  MGAEALEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLE
        MGAEAL KRWDTW ELLLGGAI+RHGT DWNLVA ELR+RI RPY CTPEVCKAKYEDLKKRFVGCK                          SLESKLE
Subjt:  MGAEALEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIRKGSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SA SFTQENRTTCSSIECQPAPL T+ETEIKPEPLQSLE  K+ RI KLG VLYENQGGIIRK SRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIRKGSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMA FNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQKEV
        DIETIRSRVASHYITTKKELYRDLLLLANNAL+FYSRNSREHQSAV LRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQ+E 
Subjt:  DIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQKEV

Query:  NPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERSATGIRGRKRGRTK
        NPGDVKTPNGNRRRRNNSSNPPSS+GL+KKETSTST KKAPGG RKA G TSKSERSATGIRGRKRGRTK
Subjt:  NPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERSATGIRGRKRGRTK

XP_031736491.1 uncharacterized protein LOC101217843 isoform X2 [Cucumis sativus]1.5e-19481.49Show/hide
Query:  MGAEALEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLE
        MGAEAL+  WDTW ELLLGGAILRHGTADWNLVA ELR+RI RPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKR+MELRQ LEHSEDSIGSLESKLE
Subjt:  MGAEALEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIRKGSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSA SFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLER KASRI KLG VLYENQGGIIRK SRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIRKGSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARE SDANEASRSS MDGVDVLMAAFN+VAEDKSAS+FRRRLDS              
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQKEV
                                                  QSAVLLRRLISSTF+K MKSSSNMVAHNTPN+RTQTCDLIAKPRRSQPAKRNESQ+E 
Subjt:  DIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQKEV

Query:  NPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERSATGIRGRKRGRTK
        NPGDVKTP GNRRR+NNSSNPPSS+GLAKKETSTS +KKAPGGTRKA G TSKSERSATGIRGRKRG+TK
Subjt:  NPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERSATGIRGRKRGRTK

XP_038894005.1 uncharacterized protein LOC120082772 [Benincasa hispida]1.7e-18887.89Show/hide
Query:  EVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIE
        + C+AKYEDLKKRFVGCKAWYEELRR+RIMELRQ LEHSEDSIGSLESKLEALKSRSGSDKSLVN STRSESWGAVQKPTNELSAGSFTQEN TTCSSIE
Subjt:  EVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIE

Query:  CQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIRKGSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDA
        CQPAPLST+ETEIKPEP +SLER KASRI KLGGVLYE+QGG +RK SRGKRKRKDCNREVKEGSSGENNLS+S NPSTVSQSKENSCCNSFEARESSDA
Subjt:  CQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIRKGSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDA

Query:  NEASRSSTMDGVDVLMAAFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLR
        NEASRSSTMDGVDVLMAAFNSVAE+K+A+VFRRRLDSQRR RYKKLIRQHLDIETIRSRVASHY TTKKELYRDLLLLANNA++FYS NSREHQSAVLLR
Subjt:  NEASRSSTMDGVDVLMAAFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLR

Query:  RLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQKEVNPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAG
         LISSTFQKLMKSSSNMVAH+  NQRTQTCDL+AKPRRSQPAKRN  QKEVNPGDVKTPNG   RR N++NP SSM LAKKETSTS VKK PGGTRKA G
Subjt:  RLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQKEVNPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAG

Query:  WTSKSERSATGIRGRKRGRTK
          SKS +SAT ++GRKRGRTK
Subjt:  WTSKSERSATGIRGRKRGRTK

TrEMBL top hitse value%identityAlignment
A0A0A0LV17 Bromo domain-containing protein2.6e-23292.77Show/hide
Query:  MGAEALEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLE
        MGAEAL+  WDTW ELLLGGAILRHGTADWNLVA ELR+RI RPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKR+MELRQ LEHSEDSIGSLESKLE
Subjt:  MGAEALEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIRKGSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSA SFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLER KASRI KLG VLYENQGGIIRK SRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIRKGSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARE SDANEASRSS MDGVDVLMAAFN+VAEDKSAS+FRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQKEV
        DIETIRSRVASH ITTK ELYRDLLLLANNAL+FYSRNSREHQSAVLLRRLISSTF+K MKSSSNMVAHNTPN+RTQTCDLIAKPRRSQPAKRNESQ+E 
Subjt:  DIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQKEV

Query:  NPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERSATGIRGRKRGRTK
        NPGDVKTP GNRRR+NNSSNPPSS+GLAKKETSTS +KKAPGGTRKA G TSKSERSATGIRGRKRG+TK
Subjt:  NPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERSATGIRGRKRGRTK

A0A1S3B4K1 uncharacterized protein LOC103486076 isoform X23.6e-21388.09Show/hide
Query:  MGAEALEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLE
        MGAEAL KRWDTW ELLLGGAI+RHGT DWNLVA ELR+RI RPY CTPEVCKAKYEDLKKRFVGCK                          SLESKLE
Subjt:  MGAEALEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIRKGSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SA SFTQENRTTCSSIECQPAPL T+ETEIKPEPLQSLE  K+ RI KLG VLYENQGGIIRK SRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIRKGSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMA FNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQKEV
        DIETIRSRVASHYITTKKELYRDLLLLANNAL+FYSRNSREHQSAV LRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQ+E 
Subjt:  DIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQKEV

Query:  NPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERSATGIRGRKRGRTK
        NPGDVKTPNGNRRRRNNSSNPPSS+GL+KKETSTST KKAPGG RKA G TSKSERSATGIRGRKRGRTK
Subjt:  NPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERSATGIRGRKRGRTK

A0A1S3B4Z1 uncharacterized protein LOC103486076 isoform X12.2e-23193.19Show/hide
Query:  MGAEALEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLE
        MGAEAL KRWDTW ELLLGGAI+RHGT DWNLVA ELR+RI RPY CTPEVCKAKYEDLKKRFVGCKAWYEELR+KRIMELRQ LEHSEDSIGSLESKLE
Subjt:  MGAEALEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIRKGSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SA SFTQENRTTCSSIECQPAPL T+ETEIKPEPLQSLE  K+ RI KLG VLYENQGGIIRK SRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIRKGSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMA FNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQKEV
        DIETIRSRVASHYITTKKELYRDLLLLANNAL+FYSRNSREHQSAV LRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQ+E 
Subjt:  DIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQKEV

Query:  NPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERSATGIRGRKRGRTK
        NPGDVKTPNGNRRRRNNSSNPPSS+GL+KKETSTST KKAPGG RKA G TSKSERSATGIRGRKRGRTK
Subjt:  NPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERSATGIRGRKRGRTK

A0A5A7UTW9 Putative Bromodomain 46.8e-18894.13Show/hide
Query:  MELRQDLEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRI
        MELRQ LEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SA SFTQENRTTCSSIECQPAPL T+ETEIKPEPLQSLE  K+ RI
Subjt:  MELRQDLEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRI

Query:  WKLGGVLYENQGGIIRKGSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSAS
         KLG VLYENQGGIIRK SRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMA FNSVAEDKSAS
Subjt:  WKLGGVLYENQGGIIRKGSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSAS

Query:  VFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQT
        VFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTKKELYRDLLLLANNAL+FYSRNSREHQSAV LRRLISSTFQKLMKSSSNMVAHNTPNQRTQT
Subjt:  VFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQT

Query:  CDLIAKPRRSQPAKRNESQKEVNPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERSATGIRGRKRGRTK
        CDLIAKPRRSQPAKRNESQ+E NPGDVKTPNGNRRRRNNSSNPPSS+GL+KKETSTST KKAPGG RKA G TSKSERSATGIRGRKRGRTK
Subjt:  CDLIAKPRRSQPAKRNESQKEVNPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERSATGIRGRKRGRTK

A0A6J1JZ11 uncharacterized protein LOC111490126 isoform X18.6e-17576.42Show/hide
Query:  MGAEALEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLE
        MGAEA++KRWDTW ELLLGGAILRHGT DWNLVA ELRARIVRP A TPEVCKAKYEDL+KRFVGCKAWYEELRR+RI+ELR+ LEHSEDSIGSLESKLE
Subjt:  MGAEALEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIRKGSRGK
        ALKSRSG DKSLVN S RSESWG V KPTNELSAGSFTQENR TCSS+EC+ AP    ETEIKPE  Q        R  + G V      G ++K SRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIRKGSRGK

Query:  RKRKDC--NREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDG--VDVLMAAFNSVAEDKSASVFRRRLDSQRRSRYKKLI
        RKRKDC  +R+VKEGS+GENNLSESANPSTVS SK+NSCCNSFE RESSDANEASRSSTMDG  VDVLMAAFN+VAE+KSA VFRRRLDSQ+R RYKKLI
Subjt:  RKRKDC--NREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDG--VDVLMAAFNSVAEDKSASVFRRRLDSQRRSRYKKLI

Query:  RQHLDIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNES
        RQHLDIETIRSRVASHYITT+KELYRDLLLLANNAL+FY  N+REH+SAVLLRRLI+STFQKL K        N+  +RTQT D +AKP R QPAKR ES
Subjt:  RQHLDIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNES

Query:  QKEVNPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERS-ATGIRGRKRGRTK
        +KEVNPGD KTP+GNRRRR+N +N  SS+GLAK ETS STVK+ P GTRK+   TSKSE+S ATG+RGRKRGRTK
Subjt:  QKEVNPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERS-ATGIRGRKRGRTK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61215.1 bromodomain 42.6e-5937.1Show/hide
Query:  MGAEALEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLE
        M    +E  W TW ELLLGGA+LRHGT DW +VA ELR+  + P   TPE+CKAKY+DL+KR+VGCKAW+EEL++KR+ EL+  L  SEDSIGSLESKL+
Subjt:  MGAEALEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLE

Query:  ALKSRSG---------SDKSL-VNGSTRSESWG-AVQKPTNE--LSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIW--KLGGVL
        +LKS S          S ++L +  S +SE  G    K T++   S GSFTQ+  TT           +    E K E    +E+EK   +    +   +
Subjt:  ALKSRSG---------SDKSL-VNGSTRSESWG-AVQKPTNE--LSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIW--KLGGVL

Query:  YENQGGII--RKGSRGKRKRKDCN----REVKEGSSGENN--LSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSA
        Y   G ++   +  RGKRKRKDC+    +EV E S+ E +     SA+ +++ +SKE           +S ++  SR  ++     LM  +N++A+++ A
Subjt:  YENQGGII--RKGSRGKRKRKDCN----REVKEGSSGENN--LSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSA

Query:  SVFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQ
         VFRRRLDSQ+R RYKKL+R+H+D++T++SR+    I++ KEL+RD LL+ANNA IFYS+N+RE++SAV LR +++ + +  +  + +   H +      
Subjt:  SVFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQ

Query:  TCDLIAKPRRSQPAKRNE-SQKEVNPG--DVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERSATGIRGRKRGRTK
        T  ++   + + P+ R   + K+   G   +KT   +  + ++  N  S   L      +S   K     RK     ++   S   + GRKR R +
Subjt:  TCDLIAKPRRSQPAKRNE-SQKEVNPG--DVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKSERSATGIRGRKRGRTK

AT2G42150.1 DNA-binding bromodomain-containing protein5.8e-2226.34Show/hide
Query:  EKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRF------------VGCKAWYEELRRKRIMELRQDLEHSEDSIGS
        ++ W TW ELLL  A+ RHGT  WN V+ E++       + T   C+ KY DLK RF            +    W EELR+ R+ ELR+++E  + SI +
Subjt:  EKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRF------------VGCKAWYEELRRKRIMELRQDLEHSEDSIGS

Query:  LESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIR
        L+SK++ L+     + S +   T +E+    +K           +E   +   +   P  L  +     P+ + S   E+   +   GG   +  G    
Subjt:  LESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIR

Query:  KGSRGKRKRKDCNREVKEGSSGENNLSESANPST----VSQSKENSCCNSFEARESSDANEASRSSTMDGV---DVLMAAFNSVAEDKSASVFRRRLDSQ
        +GS    +++      +        L ES + ++    ++   ++S     +     D  + S +S  D       L++    +      S F RRL+ Q
Subjt:  KGSRGKRKRKDCNREVKEGSSGENNLSESANPST----VSQSKENSCCNSFEARESSDANEASRSSTMDGV---DVLMAAFNSVAEDKSASVFRRRLDSQ

Query:  RRSRYKKLIRQHLDIETIRSRV-ASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNM--VAHNTPNQRT----QTCD
            Y  +IR+H+D E IR RV    Y + +   +RDLLLL NNA +FY R S E + A  L +L+       +K  SN   ++ + P +       +  
Subjt:  RRSRYKKLIRQHLDIETIRSRV-ASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNM--VAHNTPNQRT----QTCD

Query:  LIAKPRRSQP
        + +KPR S P
Subjt:  LIAKPRRSQP

AT2G44430.1 DNA-binding bromodomain-containing protein4.4e-1425.19Show/hide
Query:  WDTWHELLLGGAILRHGTADWNLVAMELRAR-IVRPYACTPEVCKAKYEDLKKRF---------------------VGCK-AWYEELRRKRIMELRQDLE
        W TW ELLL  A+ RHG  DW+ VA E+R+R  +     +   C+ KY DLK+RF                     VG    W E+LR  R+ ELR+++E
Subjt:  WDTWHELLLGGAILRHGTADWNLVAMELRAR-IVRPYACTPEVCKAKYEDLKKRF---------------------VGCK-AWYEELRRKRIMELRQDLE

Query:  HSEDSIGSLESKLEALKSRS--GSDKSLVNG---STRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEI-KPEPLQSLEREKASRIWK
          + SI SL+ K++ L+     G +K  +       RSE+ G+ +    E +  +  + +R   S  E      + +E  +   EP Q+           
Subjt:  HSEDSIGSLESKLEALKSRS--GSDKSLVNG---STRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEI-KPEPLQSLEREKASRIWK

Query:  LGGVLYENQGGIIRKGSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEA------RESSDANEASRSSTMDGVDVLMAAFNSVAED
                     R+   G  K  D +   K+ ++ E      +  S  S S E     + E+      R+   A E   + +      L++  + +   
Subjt:  LGGVLYENQGGIIRKGSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEA------RESSDANEASRSSTMDGVDVLMAAFNSVAED

Query:  KSASVFRRRLDSQRRSRYKKLIRQHLDIETIRSRV-ASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQK-LMKSSSNMVAHNTP
           S+F RRL SQ    YK +++QHLDIETI+ ++    Y ++    YRDL LL  NA++F+  +S E  +A  LR ++S   +K   K+   ++     
Subjt:  KSASVFRRRLDSQRRSRYKKLIRQHLDIETIRSRV-ASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQK-LMKSSSNMVAHNTP

Query:  NQRTQTCD-----------------LIAKPRRSQPAKRNESQKEVN-PGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKS
          R+   D                 ++ K RRS  AK + S    +   D K    +  + N ++   SS    K     +   K   G  K     SK+
Subjt:  NQRTQTCD-----------------LIAKPRRSQPAKRNESQKEVN-PGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKAPGGTRKAAGWTSKS

Query:  ERSATGIRGRKRGRTK
          S      +  G+T+
Subjt:  ERSATGIRGRKRGRTK

AT3G57980.1 DNA-binding bromodomain-containing protein9.2e-2028.76Show/hide
Query:  ELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRF------------------VGCKAWYEELRRKRIMELRQDLEHSEDSIGSLE
        ELLL  A+ RHGT  W+ VA E+  +       T   C+ KY DLK+RF                  +    W EELR+ R+ ELR+++E  + SI SL+
Subjt:  ELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRF------------------VGCKAWYEELRRKRIMELRQDLEHSEDSIGSLE

Query:  SKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIK--PEPLQS---LEREKASRIWKLGGVLYENQGG
         K++ L+     +KSL             +   ++L   + T+EN T   +    P       TE+K  P+P  +      E  +R  K+   + E    
Subjt:  SKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIK--PEPLQS---LEREKASRIWKLGGVLYENQGG

Query:  IIRKGSRGKRKRKD-----CNREVKEGSSGE------------NNLSESANPSTVSQSKENSCCNSFEARESSDANE---ASRSSTMDGVDVLMAAFNSV
        I  + +  K  R+D     C    KE    E             ++ ES       ++ +     SF  +E+ D ++     +S T++ + V     +  
Subjt:  IIRKGSRGKRKRKD-----CNREVKEGSSGE------------NNLSESANPSTVSQSKENSCCNSFEARESSDANE---ASRSSTMDGVDVLMAAFNSV

Query:  AE----DKSASVFRRRLDSQRRSRYKKLIRQHLDIETIRSRV-ASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLI
         E        S F RRL++Q  S Y ++IRQH+D E IRSRV   +Y T + + +RDLLLL NN  +FY   S E  +A  L +LI
Subjt:  AE----DKSASVFRRRLDSQRRSRYKKLIRQHLDIETIRSRV-ASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLI

AT3G60110.1 DNA-binding bromodomain-containing protein3.5e-1924.26Show/hide
Query:  LEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRF--------------------VGCKAWYEELRRKRIMELRQDL
        +++ W TW EL+L  A+ RH  +DW+ VA E++AR       +   C+ KY+DLK+RF                    VG  +W E+LR   + ELR+++
Subjt:  LEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRF--------------------VGCKAWYEELRRKRIMELRQDL

Query:  EHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVL
        +  +DSI SL+ K++ L+     D    +G  + +      KP          + NR T  S       ++   +    + +   +R    ++ K     
Subjt:  EHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVL

Query:  YENQGGIIRKGSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSASVFRRRLD
               + K    + + +  ++  +  +SGE  L ES   + + + K          +  S        S  D    L+     +      SVF  RL 
Subjt:  YENQGGIIRKGSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSASVFRRRLD

Query:  SQRRSRYKKLIRQHLDIETIRSRV-ASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQK--------LMKSSS--------NMVA
        SQ    YK+LIRQHLD++TI  ++    Y+++    YRDL LL  NA++F+  +S E  +A  LR L+S+  +K        ++KS +        + V 
Subjt:  SQRRSRYKKLIRQHLDIETIRSRV-ASHYITTKKELYRDLLLLANNALIFYSRNSREHQSAVLLRRLISSTFQK--------LMKSSS--------NMVA

Query:  HNTPNQRTQTCDLIAKPRRSQPAKRNESQKEVNPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVK
           P ++  +      P  S   K  +  +EV+   + T       R +S      + +  K+T T   K
Subjt:  HNTPNQRTQTCDLIAKPRRSQPAKRNESQKEVNPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCGGAAGCGTTGGAGAAGAGGTGGGATACGTGGCACGAGCTTTTATTAGGTGGTGCCATACTCCGCCATGGTACTGCTGATTGGAACCTGGTCGCCATGGAGCT
CCGGGCGAGAATTGTTCGTCCGTACGCCTGCACCCCCGAGGTTTGTAAGGCGAAATATGAAGACTTGAAGAAGCGTTTCGTTGGATGCAAAGCTTGGTATGAGGAGCTGC
GACGAAAAAGAATCATGGAACTAAGACAGGATCTAGAGCATTCTGAAGACTCAATAGGGTCATTGGAATCAAAGCTTGAAGCTCTTAAGTCAAGGAGTGGATCAGACAAG
TCTCTTGTCAATGGCTCTACCAGATCAGAATCTTGGGGAGCTGTTCAGAAACCAACCAATGAGCTATCTGCCGGTAGCTTCACGCAGGAAAACAGGACGACGTGCAGTTC
GATCGAGTGTCAGCCAGCTCCATTGTCGACCAAAGAGACAGAGATTAAACCAGAACCATTACAGTCTCTCGAACGGGAAAAAGCCTCAAGAATTTGGAAGTTGGGAGGGG
TATTGTATGAAAACCAAGGAGGAATAATTAGGAAGGGATCAAGAGGGAAGAGAAAGAGGAAGGATTGTAATCGGGAAGTTAAGGAAGGAAGTAGTGGGGAAAATAACTTG
TCTGAATCAGCTAACCCTTCCACTGTTTCACAGTCTAAAGAAAACTCATGTTGCAACTCGTTTGAGGCACGTGAATCTTCGGATGCAAATGAAGCTAGCAGAAGCTCAAC
CATGGATGGTGTCGATGTTTTAATGGCTGCTTTTAACTCTGTTGCGGAGGACAAAAGTGCCTCTGTATTTCGTCGTCGCCTTGATAGTCAGAGGAGAAGTAGATATAAGA
AACTAATCAGGCAACATTTGGATATTGAAACAATAAGATCAAGAGTTGCAAGTCATTACATAACGACAAAAAAGGAGTTGTACAGAGATCTGTTGTTGCTTGCTAACAAT
GCACTCATTTTCTACTCGCGGAATTCCCGGGAGCATCAGTCTGCAGTGTTGCTCAGAAGACTCATTTCAAGTACATTTCAGAAGCTAATGAAGAGCTCTAGCAATATGGT
AGCTCATAACACCCCCAACCAGAGAACACAAACCTGTGATCTGATAGCAAAACCGCGTCGTTCGCAGCCAGCTAAACGTAATGAATCTCAAAAAGAAGTCAATCCAGGAG
ATGTTAAAACTCCAAATGGAAATAGAAGAAGAAGAAATAATAGTTCTAATCCTCCTTCATCAATGGGGTTGGCAAAGAAAGAAACTTCGACTTCTACGGTAAAGAAAGCC
CCTGGTGGGACGAGAAAGGCTGCCGGTTGGACATCAAAAAGCGAACGATCTGCAACTGGCATTAGGGGAAGGAAAAGAGGGAGAACGAAGTAA
mRNA sequenceShow/hide mRNA sequence
CATATATTTTTCTTTTCTAATTTAAAAATAATTTTGTATTTTGTATTTGTTCTTTTTGTTTTGTTTTTTTAATGTTTTTCTTTTGCCCTTCTTCAACCCCTCCTATTAAA
AGTAACGAGAAACCCCAATTCCCGCTATAACCAATAATACCCAAAACCCATTACACAAATAACTTTTTTTTGGTGTAAAGGAAACAAATGAAATGAACCCTCCAACCTAC
CCATCCCCATCCAAAACCTTTTCCCCTTCTCTGAAATTTTCCTTTGAATTTGTAGTCGTGCTAGGGTTCCGAGGAATATCCCGTGATTAATATGGGAGCGGAAGCGTTGG
AGAAGAGGTGGGATACGTGGCACGAGCTTTTATTAGGTGGTGCCATACTCCGCCATGGTACTGCTGATTGGAACCTGGTCGCCATGGAGCTCCGGGCGAGAATTGTTCGT
CCGTACGCCTGCACCCCCGAGGTTTGTAAGGCGAAATATGAAGACTTGAAGAAGCGTTTCGTTGGATGCAAAGCTTGGTATGAGGAGCTGCGACGAAAAAGAATCATGGA
ACTAAGACAGGATCTAGAGCATTCTGAAGACTCAATAGGGTCATTGGAATCAAAGCTTGAAGCTCTTAAGTCAAGGAGTGGATCAGACAAGTCTCTTGTCAATGGCTCTA
CCAGATCAGAATCTTGGGGAGCTGTTCAGAAACCAACCAATGAGCTATCTGCCGGTAGCTTCACGCAGGAAAACAGGACGACGTGCAGTTCGATCGAGTGTCAGCCAGCT
CCATTGTCGACCAAAGAGACAGAGATTAAACCAGAACCATTACAGTCTCTCGAACGGGAAAAAGCCTCAAGAATTTGGAAGTTGGGAGGGGTATTGTATGAAAACCAAGG
AGGAATAATTAGGAAGGGATCAAGAGGGAAGAGAAAGAGGAAGGATTGTAATCGGGAAGTTAAGGAAGGAAGTAGTGGGGAAAATAACTTGTCTGAATCAGCTAACCCTT
CCACTGTTTCACAGTCTAAAGAAAACTCATGTTGCAACTCGTTTGAGGCACGTGAATCTTCGGATGCAAATGAAGCTAGCAGAAGCTCAACCATGGATGGTGTCGATGTT
TTAATGGCTGCTTTTAACTCTGTTGCGGAGGACAAAAGTGCCTCTGTATTTCGTCGTCGCCTTGATAGTCAGAGGAGAAGTAGATATAAGAAACTAATCAGGCAACATTT
GGATATTGAAACAATAAGATCAAGAGTTGCAAGTCATTACATAACGACAAAAAAGGAGTTGTACAGAGATCTGTTGTTGCTTGCTAACAATGCACTCATTTTCTACTCGC
GGAATTCCCGGGAGCATCAGTCTGCAGTGTTGCTCAGAAGACTCATTTCAAGTACATTTCAGAAGCTAATGAAGAGCTCTAGCAATATGGTAGCTCATAACACCCCCAAC
CAGAGAACACAAACCTGTGATCTGATAGCAAAACCGCGTCGTTCGCAGCCAGCTAAACGTAATGAATCTCAAAAAGAAGTCAATCCAGGAGATGTTAAAACTCCAAATGG
AAATAGAAGAAGAAGAAATAATAGTTCTAATCCTCCTTCATCAATGGGGTTGGCAAAGAAAGAAACTTCGACTTCTACGGTAAAGAAAGCCCCTGGTGGGACGAGAAAGG
CTGCCGGTTGGACATCAAAAAGCGAACGATCTGCAACTGGCATTAGGGGAAGGAAAAGAGGGAGAACGAAGTAAATGGTAAAAATTTCAAAACTATTTCTTGGTAGATTC
TCAGGCTAGAACTTGTAAGTTGTAAACGAGGTAGGCTGAGGCTTTAAGGATGTTTTTGGCTCTAGAAAATTGGAAAGATAATGGGAATTGGAATACTCGTCCTGTCTTAT
TGTTTGATGGGATTGATTTTTGTATTTGTTTCATAAGACAATTTGTTCTATAGAATGTTAATTCCAATGAAAAA
Protein sequenceShow/hide protein sequence
MGAEALEKRWDTWHELLLGGAILRHGTADWNLVAMELRARIVRPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQDLEHSEDSIGSLESKLEALKSRSGSDK
SLVNGSTRSESWGAVQKPTNELSAGSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLEREKASRIWKLGGVLYENQGGIIRKGSRGKRKRKDCNREVKEGSSGENNL
SESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMAAFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTKKELYRDLLLLANN
ALIFYSRNSREHQSAVLLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQKEVNPGDVKTPNGNRRRRNNSSNPPSSMGLAKKETSTSTVKKA
PGGTRKAAGWTSKSERSATGIRGRKRGRTK