; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G14369 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G14369
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionBromo domain-containing protein
Genome locationctg1869:6548804..6552838
RNA-Seq ExpressionCucsat.G14369
SyntenyCucsat.G14369
Gene Ontology termsGO:0016573 - histone acetylation (biological process)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001487 - Bromodomain
IPR036427 - Bromodomain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056951.1 putative Bromodomain 4 [Cucumis melo var. makuwa]6.05e-23993.88Show/hide
Query:  MELRQALEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRI
        MELRQALEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SASSFTQENRTTCSSIECQPAPL T+ETEIKPEPLQSLE GK+ RI
Subjt:  MELRQALEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRI

Query:  GKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSAS
        GKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARE SDANEASRSS MDGVDVLMA FN+VAEDKSAS
Subjt:  GKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSAS

Query:  LFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQT
        +FRRRLDSQRRSRYKKLIRQHLDIETIRSRVASH ITTK ELYRDLLLLANNALVFYSRNSREHQSAV LRRLISSTF+K MKSSSNMVAHNTPN+RTQT
Subjt:  LFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQT

Query:  CDLIAKPRRSQPAKRNESQREANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
        CDLIAKPRRSQPAKRNESQREANPGDVKTP GNRRR+NNSSNPPSSLGL+KKETSTS  KKAPGG RKAVGGTSKSERSATGIRGRKRG+TK
Subjt:  CDLIAKPRRSQPAKRNESQREANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

XP_004146636.1 uncharacterized protein LOC101217843 isoform X1 [Cucumis sativus]0.0100Show/hide
Query:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLE
        MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLE
Subjt:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNESQREA
        DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNESQREA
Subjt:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNESQREA

Query:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
        NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
Subjt:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

XP_008442126.1 PREDICTED: uncharacterized protein LOC103486076 isoform X1 [Cucumis melo]1.48e-29593.4Show/hide
Query:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLE
        MGAEALK  WDTWQELLLGGAI+RHGT DWNLVATELRSRIARPY CTPEVCKAKYEDLKKRFVGCKAWYEELR+KR+MELRQALEHSEDSIGSLESKLE
Subjt:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SASSFTQENRTTCSSIECQPAPL T+ETEIKPEPLQSLE GK+ RIGKLGEVLYENQGGIIRKRSRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARE SDANEASRSS MDGVDVLMA FN+VAEDKSAS+FRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNESQREA
        DIETIRSRVASH ITTK ELYRDLLLLANNALVFYSRNSREHQSAV LRRLISSTF+K MKSSSNMVAHNTPN+RTQTCDLIAKPRRSQPAKRNESQREA
Subjt:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNESQREA

Query:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
        NPGDVKTP GNRRR+NNSSNPPSSLGL+KKETSTS  KKAPGG RKAVGGTSKSERSATGIRGRKRG+TK
Subjt:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

XP_008442135.1 PREDICTED: uncharacterized protein LOC103486076 isoform X2 [Cucumis melo]3.20e-27288.3Show/hide
Query:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLE
        MGAEALK  WDTWQELLLGGAI+RHGT DWNLVATELRSRIARPY CTPEVCKAKYEDLKKRFVGCK                          SLESKLE
Subjt:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SASSFTQENRTTCSSIECQPAPL T+ETEIKPEPLQSLE GK+ RIGKLGEVLYENQGGIIRKRSRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARE SDANEASRSS MDGVDVLMA FN+VAEDKSAS+FRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNESQREA
        DIETIRSRVASH ITTK ELYRDLLLLANNALVFYSRNSREHQSAV LRRLISSTF+K MKSSSNMVAHNTPN+RTQTCDLIAKPRRSQPAKRNESQREA
Subjt:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNESQREA

Query:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
        NPGDVKTP GNRRR+NNSSNPPSSLGL+KKETSTS  KKAPGG RKAVGGTSKSERSATGIRGRKRG+TK
Subjt:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

XP_031736491.1 uncharacterized protein LOC101217843 isoform X2 [Cucumis sativus]1.03e-27088.09Show/hide
Query:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLE
        MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLE
Subjt:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQ             
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNESQREA
                                                   SAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNESQREA
Subjt:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNESQREA

Query:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
        NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
Subjt:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

TrEMBL top hitse value%identityAlignment
A0A0A0LV17 Bromo domain-containing protein0.0100Show/hide
Query:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLE
        MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLE
Subjt:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNESQREA
        DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNESQREA
Subjt:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNESQREA

Query:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
        NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
Subjt:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

A0A1S3B4K1 uncharacterized protein LOC103486076 isoform X21.55e-27288.3Show/hide
Query:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLE
        MGAEALK  WDTWQELLLGGAI+RHGT DWNLVATELRSRIARPY CTPEVCKAKYEDLKKRFVGCK                          SLESKLE
Subjt:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SASSFTQENRTTCSSIECQPAPL T+ETEIKPEPLQSLE GK+ RIGKLGEVLYENQGGIIRKRSRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARE SDANEASRSS MDGVDVLMA FN+VAEDKSAS+FRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNESQREA
        DIETIRSRVASH ITTK ELYRDLLLLANNALVFYSRNSREHQSAV LRRLISSTF+K MKSSSNMVAHNTPN+RTQTCDLIAKPRRSQPAKRNESQREA
Subjt:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNESQREA

Query:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
        NPGDVKTP GNRRR+NNSSNPPSSLGL+KKETSTS  KKAPGG RKAVGGTSKSERSATGIRGRKRG+TK
Subjt:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

A0A1S3B4Z1 uncharacterized protein LOC103486076 isoform X17.15e-29693.4Show/hide
Query:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLE
        MGAEALK  WDTWQELLLGGAI+RHGT DWNLVATELRSRIARPY CTPEVCKAKYEDLKKRFVGCKAWYEELR+KR+MELRQALEHSEDSIGSLESKLE
Subjt:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SASSFTQENRTTCSSIECQPAPL T+ETEIKPEPLQSLE GK+ RIGKLGEVLYENQGGIIRKRSRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARE SDANEASRSS MDGVDVLMA FN+VAEDKSAS+FRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNESQREA
        DIETIRSRVASH ITTK ELYRDLLLLANNALVFYSRNSREHQSAV LRRLISSTF+K MKSSSNMVAHNTPN+RTQTCDLIAKPRRSQPAKRNESQREA
Subjt:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNESQREA

Query:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
        NPGDVKTP GNRRR+NNSSNPPSSLGL+KKETSTS  KKAPGG RKAVGGTSKSERSATGIRGRKRG+TK
Subjt:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

A0A5A7UTW9 Putative Bromodomain 42.93e-23993.88Show/hide
Query:  MELRQALEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRI
        MELRQALEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SASSFTQENRTTCSSIECQPAPL T+ETEIKPEPLQSLE GK+ RI
Subjt:  MELRQALEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRI

Query:  GKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSAS
        GKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARE SDANEASRSS MDGVDVLMA FN+VAEDKSAS
Subjt:  GKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSAS

Query:  LFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQT
        +FRRRLDSQRRSRYKKLIRQHLDIETIRSRVASH ITTK ELYRDLLLLANNALVFYSRNSREHQSAV LRRLISSTF+K MKSSSNMVAHNTPN+RTQT
Subjt:  LFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQT

Query:  CDLIAKPRRSQPAKRNESQREANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
        CDLIAKPRRSQPAKRNESQREANPGDVKTP GNRRR+NNSSNPPSSLGL+KKETSTS  KKAPGG RKAVGGTSKSERSATGIRGRKRG+TK
Subjt:  CDLIAKPRRSQPAKRNESQREANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

A0A6J1JZ11 uncharacterized protein LOC111490126 isoform X19.13e-21373.58Show/hide
Query:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLE
        MGAEA++  WDTW+ELLLGGAILRHGT DWNLVA ELR+RI RP A TPEVCKAKYEDL+KRFVGCKAWYEELRR+R++ELR+ALEHSEDSIGSLESKLE
Subjt:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEP--LQSLERGKASRIGKLGEVLYENQGGIIRKRSR
        ALKSRSG DKSLVN S RSESWG V KPTNELSA SFTQENRT CSS+EC+ AP    ETEIKPE   L+ LE GK                G ++KRSR
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEP--LQSLERGKASRIGKLGEVLYENQGGIIRKRSR

Query:  GKRKRKDCN--REVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDV--LMAAFNTVAEDKSASLFRRRLDSQRRSRYKK
        GKRKRKDC+  R+VKEGS+GENNLSESANPSTVS SK+NSCCNSFE RE SDANEASRSS MDGVDV  LMAAFN VAE+KSA +FRRRLDSQ+R RYKK
Subjt:  GKRKRKDCN--REVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDV--LMAAFNTVAEDKSASLFRRRLDSQRRSRYKK

Query:  LIRQHLDIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRN
        LIRQHLDIETIRSRVASH ITT+ ELYRDLLLLANNALVFY  N+REH+SAVLLRRLI+STF+K  K+S          KRTQT D +AKP R QPAKR 
Subjt:  LIRQHLDIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRN

Query:  ESQREANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSA-TGIRGRKRGKTK
        ES++E NPGD KTP GNRRR++N+ N  SS+GLAK ETS S +K+ P GTRK+V GTSKSE+SA TG+RGRKRG+TK
Subjt:  ESQREANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSA-TGIRGRKRGKTK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61215.1 bromodomain 41.9e-5736.06Show/hide
Query:  WDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLEALKSRSGSD
        W TW+ELLLGGA+LRHGT DW +VA ELRS  + P   TPE+CKAKY+DL+KR+VGCKAW+EEL++KR+ EL+ AL  SEDSIGSLESKL++LKS S +D
Subjt:  WDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLEALKSRSGSD

Query:  KSLVNGSTRSESWGAVQKPTNE---LSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRI--GKLGEVLYENQGGII--RKRSRGKRK
        +   N    S +      P +E      S  T ++ ++  S   Q    +    E K E    +E+ K   +    + E +Y   G ++   ++ RGKRK
Subjt:  KSLVNGSTRSESWGAVQKPTNE---LSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRI--GKLGEVLYENQGGII--RKRSRGKRK

Query:  RKDCN----REVKEGSSGENN--LSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLI
        RKDC+    +EV E S+ E +     SA+ +++ +SKE +          S ++  SR  ++     LM  +NT+A+++ A +FRRRLDSQ+R RYKKL+
Subjt:  RKDCN----REVKEGSSGENN--LSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLI

Query:  RQHLDIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNE-
        R+H+D++T++SR+   +I++  EL+RD LL+ANNA +FYS+N+RE++SAV LR +++ +    +  + +   H +      T  ++   + + P+ R   
Subjt:  RQHLDIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNE-

Query:  --SQREANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
           +       +KT   +  + ++  N  S   L      +S   K     RK  G  +     +  + GRKR + +
Subjt:  --SQREANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

AT2G42150.1 DNA-binding bromodomain-containing protein1.2e-1928.02Show/hide
Query:  KMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRF------------VGCKAWYEELRRKRMMELRQALEHSEDSIGS
        K  W TW+ELLL  A+ RHGT  WN V+ E++       + T   C+ KY DLK RF            +    W EELR+ R+ ELR+ +E  + SI +
Subjt:  KMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRF------------VGCKAWYEELRRKRMMELRQALEHSEDSIGS

Query:  LESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQEN---RTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGG
        L+SK++ L+     + S +   T +E+    +K   E S S     N   +    +I   P  + ++ TE + E   S   G  S++   GE        
Subjt:  LESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQEN---RTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGG

Query:  IIRKR--SRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGV---DVLMAAFNTVAEDKSASLFRRRLDS
         + K   +  +R       E+ E   G +   E      ++   ++S     +     D  + S +SA D       L++    +      S F RRL+ 
Subjt:  IIRKR--SRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGV---DVLMAAFNTVAEDKSASLFRRRLDS

Query:  QRRSRYKKLIRQHLDIETIRSRVASHNITT-KMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNM--VAHNTPNKRT----QTC
        Q    Y  +IR+H+D E IR RV      + ++  +RDLLLL NNA VFY R S E + A  L +L+       +K  SN   ++ + P +       + 
Subjt:  QRRSRYKKLIRQHLDIETIRSRVASHNITT-KMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNM--VAHNTPNKRT----QTC

Query:  DLIAKPRRSQP---AKRNESQREANPGDVKTPKGNRRRK
         + +KPR S P   A R  S   A P  +  P  +++ K
Subjt:  DLIAKPRRSQP---AKRNESQREANPGDVKTPKGNRRRK

AT2G44430.1 DNA-binding bromodomain-containing protein6.2e-1625.88Show/hide
Query:  WDTWQELLLGGAILRHGTADWNLVATELRSRIARPY-ACTPEVCKAKYEDLKKRF---------------------VGCK-AWYEELRRKRMMELRQALE
        W TW+ELLL  A+ RHG  DW+ VATE+RSR +  +   +   C+ KY DLK+RF                     VG    W E+LR  R+ ELR+ +E
Subjt:  WDTWQELLLGGAILRHGTADWNLVATELRSRIARPY-ACTPEVCKAKYEDLKKRF---------------------VGCK-AWYEELRRKRMMELRQALE

Query:  HSEDSIGSLESKLEALKSRS--GSDKSLVNG---STRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEI-KPEPLQSLERGKASRIGK
          + SI SL+ K++ L+     G +K  +       RSE+ G+ +    E + S+  + +R   S  E      + +E  +   EP Q+           
Subjt:  HSEDSIGSLESKLEALKSRS--GSDKSLVNG---STRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEI-KPEPLQSLERGKASRIGK

Query:  LGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKE----NSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKS
                     R+   G  K  D +   K+ ++ E      +  S  S S E     +  + ++ +           SA      L++  + +     
Subjt:  LGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKE----NSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKS

Query:  ASLFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHNI-TTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQM-KSSSNMVAHNTPNK
         SLF RRL SQ    YK +++QHLDIETI+ ++   +  ++ +  YRDL LL  NA+VF+  +S E  +A  LR ++S    K+  K+   ++       
Subjt:  ASLFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHNI-TTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQM-KSSSNMVAHNTPNK

Query:  RTQTCD-----------------LIAKPRRSQPAKRNESQRE-ANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSER
        R+   D                 ++ K RRS  AK + S    +   D K    +  + N ++   SS    K     +   K   G  K     SK+  
Subjt:  RTQTCD-----------------LIAKPRRSQPAKRNESQRE-ANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSER

Query:  SATGIRGRKRGKTK
        S      +  GKT+
Subjt:  SATGIRGRKRGKTK

AT3G57980.1 DNA-binding bromodomain-containing protein6.4e-1325.73Show/hide
Query:  QELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRF------------------VGCKAWYEELRRKRMMELRQALEHSEDSIGSL
        +ELLL  A+ RHGT  W+ VA+E+  + +     T   C+ KY DLK+RF                  +    W EELR+ R+ ELR+ +E  + SI SL
Subjt:  QELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRF------------------VGCKAWYEELRRKRMMELRQALEHSEDSIGSL

Query:  ESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIK--PEPLQS---LERGKASRIGKLGEVLYENQG
        + K++ L+     +KSL             +   ++L   + T+EN T   +    P       TE+K  P+P  +         +R  K+ E + E   
Subjt:  ESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIK--PEPLQS---LERGKASRIGKLGEVLYENQG

Query:  GIIRKRSRGKRKRKD-----CNREVKEGSSGE------------NNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAE
         I  + +  K  R+D     C    KE    E             ++ ES       ++ +     SF  +E  D ++         V+ +      +++
Subjt:  GIIRKRSRGKRKRKD-----CNREVKEGSSGE------------NNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAE

Query:  -------DKSASLFRRRLDSQRRSRYKKLIRQHLDIETIRSRV-ASHNITTKMELYRDLLLLANNALVFYSRNSREHQSA----VLLRRLISSTFEKQMK
                   S F RRL++Q  S Y ++IRQH+D E IRSRV   +  T + + +RDLLLL NN  VFY   S E  +A     L+++ +S    KQ  
Subjt:  -------DKSASLFRRRLDSQRRSRYKKLIRQHLDIETIRSRV-ASHNITTKMELYRDLLLLANNALVFYSRNSREHQSA----VLLRRLISSTFEKQMK

Query:  SSSNMVAHNTPNKRTQTCDLIAKPRRSQP---AKRNESQREANPGDVK---------TPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVG
              A  T  +  +   L  KP  S P    ++  S    +P  V           P  + ++ +       S    K   S  M + A   T K VG
Subjt:  SSSNMVAHNTPNKRTQTCDLIAKPRRSQP---AKRNESQREANPGDVK---------TPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVG

Query:  GTSKSERSATGIRGRKR
          +       GI  R R
Subjt:  GTSKSERSATGIRGRKR

AT3G60110.1 DNA-binding bromodomain-containing protein9.2e-2023.88Show/hide
Query:  LKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRF--------------------VGCKAWYEELRRKRMMELRQAL
        +K +W TW+EL+L  A+ RH  +DW+ VA E+++R       +   C+ KY+DLK+RF                    VG  +W E+LR   M ELR+ +
Subjt:  LKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRF--------------------VGCKAWYEELRRKRMMELRQAL

Query:  EHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVL
        +  +DSI SL+ K++ L+     D    +G  + +      KP          + NR T  S       ++   +    + +   +R    ++ K  E  
Subjt:  EHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVL

Query:  YENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLD
               + K    + + +  ++  +  +SGE  L ES   + + + K          +  S        SA D    L+     +      S+F  RL 
Subjt:  YENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLD

Query:  SQRRSRYKKLIRQHLDIETIRSRVASHN-ITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMV-----AHNTPNKRTQTC
        SQ    YK+LIRQHLD++TI  ++   + +++ +  YRDL LL  NA+VF+  +S E  +A  LR L+S+  +K+     + V       +   +++   
Subjt:  SQRRSRYKKLIRQHLDIETIRSRVASHN-ITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMV-----AHNTPNKRTQTC

Query:  DLIAKPRRSQPAKRNESQREANPGDVKTPKGNRRRK----------NNSSNPPSSLGLAKKETSTSMLK
         L+   ++S   K+      +   D K  +     K           +S      + +  K+T T   K
Subjt:  DLIAKPRRSQPAKRNESQREANPGDVKTPKGNRRRK----------NNSSNPPSSLGLAKKETSTSMLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCGGAAGCATTGAAGATGATGTGGGATACGTGGCAAGAGCTTTTATTAGGTGGCGCCATACTCCGCCACGGTACCGCCGACTGGAACCTCGTCGCCACCGAGCT
CCGGTCCAGGATTGCTCGTCCGTACGCCTGCACCCCCGAGGTTTGTAAGGCGAAATATGAAGACTTGAAGAAGCGTTTTGTTGGATGCAAAGCTTGGTATGAGGAGCTTC
GACGAAAAAGAATGATGGAACTAAGACAGGCTCTAGAGCATTCTGAAGACTCAATAGGGTCATTGGAATCAAAGCTTGAAGCTCTTAAGTCAAGGAGTGGATCAGACAAG
TCTCTTGTCAATGGCTCTACCAGATCAGAATCTTGGGGAGCTGTTCAGAAGCCAACGAATGAGCTATCTGCCAGTAGCTTCACGCAGGAAAACAGGACGACATGCAGTTC
GATCGAGTGCCAGCCAGCTCCATTGTCGACCAAAGAGACTGAGATTAAACCAGAACCCTTGCAGTCTCTGGAACGAGGAAAAGCCTCGAGAATTGGGAAGTTGGGAGAGG
TATTGTATGAAAACCAAGGAGGAATAATTAGGAAGAGATCAAGAGGGAAAAGAAAGAGGAAGGATTGTAATAGGGAAGTTAAGGAAGGAAGTAGTGGGGAAAATAACTTG
TCTGAATCAGCTAACCCTTCAACTGTTTCACAGTCTAAAGAAAACTCATGTTGCAACTCGTTTGAGGCACGTGAACCTTCGGATGCAAATGAAGCTAGCAGAAGCTCAGC
CATGGATGGTGTTGATGTTTTAATGGCTGCTTTTAACACTGTTGCAGAGGACAAAAGTGCCTCCCTATTTCGTCGTCGCCTTGATAGTCAGAGGAGAAGTAGATATAAGA
AACTAATCAGGCAACATTTGGATATTGAAACAATAAGATCAAGAGTTGCAAGTCATAACATTACGACAAAAATGGAGTTGTACAGAGATCTGTTGTTGCTTGCTAACAAC
GCACTCGTCTTCTACTCACGGAATTCCCGTGAGCATCAGTCTGCAGTGTTGCTCAGAAGACTCATTTCAAGTACATTTGAGAAGCAAATGAAGAGCTCTAGCAATATGGT
AGCTCATAACACCCCCAACAAGAGAACACAAACCTGTGATCTGATAGCAAAACCGCGTCGTTCGCAGCCAGCTAAACGTAATGAATCCCAAAGAGAAGCCAATCCAGGAG
ATGTTAAAACTCCAAAGGGAAATAGAAGAAGAAAAAATAATAGCTCTAATCCTCCTTCCTCGTTGGGGTTGGCAAAGAAAGAAACTTCGACTTCTATGCTAAAGAAAGCC
CCTGGTGGGACGAGAAAGGCTGTCGGTGGGACATCGAAAAGTGAACGATCTGCAACTGGCATCAGGGGAAGGAAAAGAGGGAAAACGAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGCGGAAGCATTGAAGATGATGTGGGATACGTGGCAAGAGCTTTTATTAGGTGGCGCCATACTCCGCCACGGTACCGCCGACTGGAACCTCGTCGCCACCGAGCT
CCGGTCCAGGATTGCTCGTCCGTACGCCTGCACCCCCGAGGTTTGTAAGGCGAAATATGAAGACTTGAAGAAGCGTTTTGTTGGATGCAAAGCTTGGTATGAGGAGCTTC
GACGAAAAAGAATGATGGAACTAAGACAGGCTCTAGAGCATTCTGAAGACTCAATAGGGTCATTGGAATCAAAGCTTGAAGCTCTTAAGTCAAGGAGTGGATCAGACAAG
TCTCTTGTCAATGGCTCTACCAGATCAGAATCTTGGGGAGCTGTTCAGAAGCCAACGAATGAGCTATCTGCCAGTAGCTTCACGCAGGAAAACAGGACGACATGCAGTTC
GATCGAGTGCCAGCCAGCTCCATTGTCGACCAAAGAGACTGAGATTAAACCAGAACCCTTGCAGTCTCTGGAACGAGGAAAAGCCTCGAGAATTGGGAAGTTGGGAGAGG
TATTGTATGAAAACCAAGGAGGAATAATTAGGAAGAGATCAAGAGGGAAAAGAAAGAGGAAGGATTGTAATAGGGAAGTTAAGGAAGGAAGTAGTGGGGAAAATAACTTG
TCTGAATCAGCTAACCCTTCAACTGTTTCACAGTCTAAAGAAAACTCATGTTGCAACTCGTTTGAGGCACGTGAACCTTCGGATGCAAATGAAGCTAGCAGAAGCTCAGC
CATGGATGGTGTTGATGTTTTAATGGCTGCTTTTAACACTGTTGCAGAGGACAAAAGTGCCTCCCTATTTCGTCGTCGCCTTGATAGTCAGAGGAGAAGTAGATATAAGA
AACTAATCAGGCAACATTTGGATATTGAAACAATAAGATCAAGAGTTGCAAGTCATAACATTACGACAAAAATGGAGTTGTACAGAGATCTGTTGTTGCTTGCTAACAAC
GCACTCGTCTTCTACTCACGGAATTCCCGTGAGCATCAGTCTGCAGTGTTGCTCAGAAGACTCATTTCAAGTACATTTGAGAAGCAAATGAAGAGCTCTAGCAATATGGT
AGCTCATAACACCCCCAACAAGAGAACACAAACCTGTGATCTGATAGCAAAACCGCGTCGTTCGCAGCCAGCTAAACGTAATGAATCCCAAAGAGAAGCCAATCCAGGAG
ATGTTAAAACTCCAAAGGGAAATAGAAGAAGAAAAAATAATAGCTCTAATCCTCCTTCCTCGTTGGGGTTGGCAAAGAAAGAAACTTCGACTTCTATGCTAAAGAAAGCC
CCTGGTGGGACGAGAAAGGCTGTCGGTGGGACATCGAAAAGTGAACGATCTGCAACTGGCATCAGGGGAAGGAAAAGAGGGAAAACGAAGTAA
Protein sequenceShow/hide protein sequence
MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRMMELRQALEHSEDSIGSLESKLEALKSRSGSDK
SLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTKETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNL
SESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHNITTKMELYRDLLLLANN
ALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNKRTQTCDLIAKPRRSQPAKRNESQREANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKA
PGGTRKAVGGTSKSERSATGIRGRKRGKTK