; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G11730 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G11730
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionBromo domain-containing protein
Genome locationChr1:7374310..7378347
RNA-Seq ExpressionCSPI01G11730
SyntenyCSPI01G11730
Gene Ontology termsGO:0016573 - histone acetylation (biological process)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001487 - Bromodomain
IPR036427 - Bromodomain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056951.1 putative Bromodomain 4 [Cucumis melo var. makuwa]2.2e-18894.39Show/hide
Query:  MELRQALEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRI
        MELRQALEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SASSFTQENRTTCSSIECQPAPL TEETEIKPEPLQSLE GK+ RI
Subjt:  MELRQALEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRI

Query:  GKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSAS
        GKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARE SDANEASRSS MDGVDVLMA FN+VAEDKSAS
Subjt:  GKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSAS

Query:  LFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQT
        +FRRRLDSQRRSRYKKLIRQHLDIETIRSRVASH ITTK ELYRDLLLLANNALVFYSRNSREHQSAV LRRLISSTF+K MKSSSNMVAHNTPNQRTQT
Subjt:  LFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQT

Query:  CDLIAKPRRSQPAKRNESQREANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
        CDLIAKPRRSQPAKRNESQREANPGDVKTP GNRRR+NNSSNPPSSLGL+KKETSTS  KKAPGG RKAVGGTSKSERSATGIRGRKRG+TK
Subjt:  CDLIAKPRRSQPAKRNESQREANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

XP_004146636.1 uncharacterized protein LOC101217843 isoform X1 [Cucumis sativus]9.9e-25099.36Show/hide
Query:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
        MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKR+MELRQALEHSEDSIGSLESKLE
Subjt:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLST+ETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA
        DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPN+RTQTCDLIAKPRRSQPAKRNESQREA
Subjt:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA

Query:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
        NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
Subjt:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

XP_008442126.1 PREDICTED: uncharacterized protein LOC103486076 isoform X1 [Cucumis melo]4.9e-23394.04Show/hide
Query:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
        MGAEALK  WDTWQELLLGGAI+RHGT DWNLVATELRSRIARPY CTPEVCKAKYEDLKKRFVGCKAWYEELR+KRIMELRQALEHSEDSIGSLESKLE
Subjt:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SASSFTQENRTTCSSIECQPAPL TEETEIKPEPLQSLE GK+ RIGKLGEVLYENQGGIIRKRSRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARE SDANEASRSS MDGVDVLMA FN+VAEDKSAS+FRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA
        DIETIRSRVASH ITTK ELYRDLLLLANNALVFYSRNSREHQSAV LRRLISSTF+K MKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA
Subjt:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA

Query:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
        NPGDVKTP GNRRR+NNSSNPPSSLGL+KKETSTS  KKAPGG RKAVGGTSKSERSATGIRGRKRG+TK
Subjt:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

XP_008442135.1 PREDICTED: uncharacterized protein LOC103486076 isoform X2 [Cucumis melo]3.0e-21488.72Show/hide
Query:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
        MGAEALK  WDTWQELLLGGAI+RHGT DWNLVATELRSRIARPY CTPEVCKAKYEDLKKRFVGCK                          SLESKLE
Subjt:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SASSFTQENRTTCSSIECQPAPL TEETEIKPEPLQSLE GK+ RIGKLGEVLYENQGGIIRKRSRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARE SDANEASRSS MDGVDVLMA FN+VAEDKSAS+FRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA
        DIETIRSRVASH ITTK ELYRDLLLLANNALVFYSRNSREHQSAV LRRLISSTF+K MKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA
Subjt:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA

Query:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
        NPGDVKTP GNRRR+NNSSNPPSSLGL+KKETSTS  KKAPGG RKAVGGTSKSERSATGIRGRKRG+TK
Subjt:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

XP_031736491.1 uncharacterized protein LOC101217843 isoform X2 [Cucumis sativus]1.5e-21087.45Show/hide
Query:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
        MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKR+MELRQALEHSEDSIGSLESKLE
Subjt:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLST+ETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDS              
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA
                                                  QSAVLLRRLISSTFEKQMKSSSNMVAHNTPN+RTQTCDLIAKPRRSQPAKRNESQREA
Subjt:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA

Query:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
        NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
Subjt:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

TrEMBL top hitse value%identityAlignment
A0A0A0LV17 Bromo domain-containing protein4.8e-25099.36Show/hide
Query:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
        MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKR+MELRQALEHSEDSIGSLESKLE
Subjt:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLST+ETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA
        DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPN+RTQTCDLIAKPRRSQPAKRNESQREA
Subjt:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA

Query:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
        NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
Subjt:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

A0A1S3B4K1 uncharacterized protein LOC103486076 isoform X21.5e-21488.72Show/hide
Query:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
        MGAEALK  WDTWQELLLGGAI+RHGT DWNLVATELRSRIARPY CTPEVCKAKYEDLKKRFVGCK                          SLESKLE
Subjt:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SASSFTQENRTTCSSIECQPAPL TEETEIKPEPLQSLE GK+ RIGKLGEVLYENQGGIIRKRSRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARE SDANEASRSS MDGVDVLMA FN+VAEDKSAS+FRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA
        DIETIRSRVASH ITTK ELYRDLLLLANNALVFYSRNSREHQSAV LRRLISSTF+K MKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA
Subjt:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA

Query:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
        NPGDVKTP GNRRR+NNSSNPPSSLGL+KKETSTS  KKAPGG RKAVGGTSKSERSATGIRGRKRG+TK
Subjt:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

A0A1S3B4Z1 uncharacterized protein LOC103486076 isoform X12.4e-23394.04Show/hide
Query:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
        MGAEALK  WDTWQELLLGGAI+RHGT DWNLVATELRSRIARPY CTPEVCKAKYEDLKKRFVGCKAWYEELR+KRIMELRQALEHSEDSIGSLESKLE
Subjt:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SASSFTQENRTTCSSIECQPAPL TEETEIKPEPLQSLE GK+ RIGKLGEVLYENQGGIIRKRSRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARE SDANEASRSS MDGVDVLMA FN+VAEDKSAS+FRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA
        DIETIRSRVASH ITTK ELYRDLLLLANNALVFYSRNSREHQSAV LRRLISSTF+K MKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA
Subjt:  DIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA

Query:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
        NPGDVKTP GNRRR+NNSSNPPSSLGL+KKETSTS  KKAPGG RKAVGGTSKSERSATGIRGRKRG+TK
Subjt:  NPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

A0A5A7UTW9 Putative Bromodomain 41.1e-18894.39Show/hide
Query:  MELRQALEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRI
        MELRQALEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SASSFTQENRTTCSSIECQPAPL TEETEIKPEPLQSLE GK+ RI
Subjt:  MELRQALEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRI

Query:  GKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSAS
        GKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARE SDANEASRSS MDGVDVLMA FN+VAEDKSAS
Subjt:  GKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSAS

Query:  LFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQT
        +FRRRLDSQRRSRYKKLIRQHLDIETIRSRVASH ITTK ELYRDLLLLANNALVFYSRNSREHQSAV LRRLISSTF+K MKSSSNMVAHNTPNQRTQT
Subjt:  LFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQT

Query:  CDLIAKPRRSQPAKRNESQREANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
        CDLIAKPRRSQPAKRNESQREANPGDVKTP GNRRR+NNSSNPPSSLGL+KKETSTS  KKAPGG RKAVGGTSKSERSATGIRGRKRG+TK
Subjt:  CDLIAKPRRSQPAKRNESQREANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

A0A6J1JZ11 uncharacterized protein LOC111490126 isoform X15.4e-16973.58Show/hide
Query:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
        MGAEA++  WDTW+ELLLGGAILRHGT DWNLVA ELR+RI RP A TPEVCKAKYEDL+KRFVGCKAWYEELRR+RI+ELR+ALEHSEDSIGSLESKLE
Subjt:  MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPE--PLQSLERGKASRIGKLGEVLYENQGGIIRKRSR
        ALKSRSG DKSLVN S RSESWG V KPTNELSA SFTQENR TCSS+EC+ AP   +ETEIKPE   L+ LE GK                G ++KRSR
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPE--PLQSLERGKASRIGKLGEVLYENQGGIIRKRSR

Query:  GKRKRKDC--NREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDG--VDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKK
        GKRKRKDC  +R+VKEGS+GENNLSESANPSTVS SK+NSCCNSFE RE SDANEASRSS MDG  VDVLMAAFN VAE+KSA +FRRRLDSQ+R RYKK
Subjt:  GKRKRKDC--NREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDG--VDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKK

Query:  LIRQHLDIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRN
        LIRQHLDIETIRSRVASH ITT+ ELYRDLLLLANNALVFY  N+REH+SAVLLRRLI+STF+K  K        N+  +RTQT D +AKP R QPAKR 
Subjt:  LIRQHLDIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRN

Query:  ESQREANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERS-ATGIRGRKRGKTK
        ES++E NPGD KTP GNRRR++N +N  SS+GLAK ETS S +K+ P GTRK+V GTSKSE+S ATG+RGRKRG+TK
Subjt:  ESQREANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERS-ATGIRGRKRGKTK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61215.1 bromodomain 43.8e-5836.06Show/hide
Query:  WDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLEALKSRSGSD
        W TW+ELLLGGA+LRHGT DW +VA ELRS  + P   TPE+CKAKY+DL+KR+VGCKAW+EEL++KR+ EL+ AL  SEDSIGSLESKL++LKS S +D
Subjt:  WDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLEALKSRSGSD

Query:  KSLVNGSTRSESWGAVQKPTNE---LSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRI--GKLGEVLYENQGGII--RKRSRGKRK
        +   N    S +      P +E      S  T ++ ++  S   Q    +    E K E    +E+ K   +    + E +Y   G ++   ++ RGKRK
Subjt:  KSLVNGSTRSESWGAVQKPTNE---LSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRI--GKLGEVLYENQGGII--RKRSRGKRK

Query:  RKDCN----REVKEGSSGENN--LSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLI
        RKDC+    +EV E S+ E +     SA+ +++ +SKE +          S ++  SR  ++     LM  +NT+A+++ A +FRRRLDSQ+R RYKKL+
Subjt:  RKDCN----REVKEGSSGENN--LSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLI

Query:  RQHLDIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNE-
        R+H+D++T++SR+   +I++  EL+RD LL+ANNA +FYS+N+RE++SAV LR +++ +    +  + +   H +      T  ++   + + P+ R   
Subjt:  RQHLDIETIRSRVASHNITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNE-

Query:  --SQREANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK
           +       +KT   +  + ++  N  S   L      +S   K     RK  G  +     +  + GRKR + +
Subjt:  --SQREANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIRGRKRGKTK

AT2G42150.1 DNA-binding bromodomain-containing protein1.1e-2028.25Show/hide
Query:  KMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRF------------VGCKAWYEELRRKRIMELRQALEHSEDSIGS
        K  W TW+ELLL  A+ RHGT  WN V+ E++       + T   C+ KY DLK RF            +    W EELR+ R+ ELR+ +E  + SI +
Subjt:  KMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRF------------VGCKAWYEELRRKRIMELRQALEHSEDSIGS

Query:  LESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQEN---RTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVLYENQGG
        L+SK++ L+     + S +   T +E+    +K   E S S     N   +    +I   P  + +E TE + E   S   G  S++   GE        
Subjt:  LESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQEN---RTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVLYENQGG

Query:  IIRKR--SRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGV---DVLMAAFNTVAEDKSASLFRRRLDS
         + K   +  +R       E+ E   G +   E      ++   ++S     +     D  + S +SA D       L++    +      S F RRL+ 
Subjt:  IIRKR--SRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGV---DVLMAAFNTVAEDKSASLFRRRLDS

Query:  QRRSRYKKLIRQHLDIETIRSRVASHNITT-KMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNM--VAHNTPNQRT----QTC
        Q    Y  +IR+H+D E IR RV      + ++  +RDLLLL NNA VFY R S E + A  L +L+       +K  SN   ++ + P +       + 
Subjt:  QRRSRYKKLIRQHLDIETIRSRVASHNITT-KMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNM--VAHNTPNQRT----QTC

Query:  DLIAKPRRSQP---AKRNESQREANPGDVKTPKGNRRRK
         + +KPR S P   A R  S   A P  +  P  +++ K
Subjt:  DLIAKPRRSQP---AKRNESQREANPGDVKTPKGNRRRK

AT2G44430.1 DNA-binding bromodomain-containing protein3.3e-1726.07Show/hide
Query:  WDTWQELLLGGAILRHGTADWNLVATELRSRIARPY-ACTPEVCKAKYEDLKKRF---------------------VGCK-AWYEELRRKRIMELRQALE
        W TW+ELLL  A+ RHG  DW+ VATE+RSR +  +   +   C+ KY DLK+RF                     VG    W E+LR  R+ ELR+ +E
Subjt:  WDTWQELLLGGAILRHGTADWNLVATELRSRIARPY-ACTPEVCKAKYEDLKKRF---------------------VGCK-AWYEELRRKRIMELRQALE

Query:  HSEDSIGSLESKLEALKSRS--GSDKSLVNG---STRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEI-KPEPLQSLERGKASRIGK
          + SI SL+ K++ L+     G +K  +       RSE+ G+ +    E + S+  + +R   S  E      + EE  +   EP Q+           
Subjt:  HSEDSIGSLESKLEALKSRS--GSDKSLVNG---STRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEI-KPEPLQSLERGKASRIGK

Query:  LGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKE----NSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKS
                     R+   G  K  D +   K+ ++ E      +  S  S S E     +  + ++ +           SA      L++  + +     
Subjt:  LGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKE----NSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKS

Query:  ASLFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHNI-TTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQM-KSSSNMVAHNTPNQ
         SLF RRL SQ    YK +++QHLDIETI+ ++   +  ++ +  YRDL LL  NA+VF+  +S E  +A  LR ++S    K+  K+   ++       
Subjt:  ASLFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHNI-TTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQM-KSSSNMVAHNTPNQ

Query:  RTQTCD-----------------LIAKPRRSQPAKRNESQRE-ANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSER
        R+   D                 ++ K RRS  AK + S    +   D K    +  + N ++   SS    K     +   K   G  K     SK+  
Subjt:  RTQTCD-----------------LIAKPRRSQPAKRNESQRE-ANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSER

Query:  SATGIRGRKRGKTK
        S      +  GKT+
Subjt:  SATGIRGRKRGKTK

AT3G57980.1 DNA-binding bromodomain-containing protein2.2e-1325.2Show/hide
Query:  QELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRF------------------VGCKAWYEELRRKRIMELRQALEHSEDSIGSL
        +ELLL  A+ RHGT  W+ VA+E+  + +     T   C+ KY DLK+RF                  +    W EELR+ R+ ELR+ +E  + SI SL
Subjt:  QELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRF------------------VGCKAWYEELRRKRIMELRQALEHSEDSIGSL

Query:  ESKLEALK---------SRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVLY
        + K++ L+           S  D+        +ES      P  EL  S    +N     S     A    E  + +P  +   +  +       G    
Subjt:  ESKLEALK---------SRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVLY

Query:  ENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAE-------DKSASL
        E+    + K S     R +  RE  +      ++ ES       ++ +     SF  +E  D ++         V+ +      +++           S 
Subjt:  ENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAE-------DKSASL

Query:  FRRRLDSQRRSRYKKLIRQHLDIETIRSRV-ASHNITTKMELYRDLLLLANNALVFYSRNSREHQSA----VLLRRLISSTFEKQMKSSSNMVAHNTPNQ
        F RRL++Q  S Y ++IRQH+D E IRSRV   +  T + + +RDLLLL NN  VFY   S E  +A     L+++ +S    KQ        A  T  +
Subjt:  FRRRLDSQRRSRYKKLIRQHLDIETIRSRV-ASHNITTKMELYRDLLLLANNALVFYSRNSREHQSA----VLLRRLISSTFEKQMKSSSNMVAHNTPNQ

Query:  RTQTCDLIAKPRRSQP---AKRNESQREANPGDVK---------TPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIR
          +   L  KP  S P    ++  S    +P  V           P  + ++ +       S    K   S  M + A   T K VG  +       GI 
Subjt:  RTQTCDLIAKPRRSQP---AKRNESQREANPGDVK---------TPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKAPGGTRKAVGGTSKSERSATGIR

Query:  GRKR
         R R
Subjt:  GRKR

AT3G60110.1 DNA-binding bromodomain-containing protein7.1e-2023.88Show/hide
Query:  LKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRF--------------------VGCKAWYEELRRKRIMELRQAL
        +K +W TW+EL+L  A+ RH  +DW+ VA E+++R       +   C+ KY+DLK+RF                    VG  +W E+LR   + ELR+ +
Subjt:  LKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRF--------------------VGCKAWYEELRRKRIMELRQAL

Query:  EHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVL
        +  +DSI SL+ K++ L+     D    +G  + +      KP          + NR T  S       ++   +    + +   +R    ++ K  E  
Subjt:  EHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVL

Query:  YENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLD
               + K    + + +  ++  +  +SGE  L ES   + + + K          +  S        SA D    L+     +      S+F  RL 
Subjt:  YENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLD

Query:  SQRRSRYKKLIRQHLDIETIRSRVASHN-ITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMV-----AHNTPNQRTQTC
        SQ    YK+LIRQHLD++TI  ++   + +++ +  YRDL LL  NA+VF+  +S E  +A  LR L+S+  +K+     + V       +   Q++   
Subjt:  SQRRSRYKKLIRQHLDIETIRSRVASHN-ITTKMELYRDLLLLANNALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMV-----AHNTPNQRTQTC

Query:  DLIAKPRRSQPAKRNESQREANPGDVKTPKGNRRRK----------NNSSNPPSSLGLAKKETSTSMLK
         L+   ++S   K+      +   D K  +     K           +S      + +  K+T T   K
Subjt:  DLIAKPRRSQPAKRNESQREANPGDVKTPKGNRRRK----------NNSSNPPSSLGLAKKETSTSMLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCGGAAGCTTTGAAGATGATGTGGGATACGTGGCAAGAGCTTTTATTAGGTGGCGCCATACTCCGCCACGGTACCGCCGACTGGAACCTCGTCGCCACCGAGCT
CCGGTCCAGGATTGCTCGTCCGTACGCCTGCACCCCCGAGGTTTGTAAGGCGAAATATGAAGACTTGAAGAAGCGTTTTGTTGGATGCAAAGCTTGGTATGAGGAGCTTC
GACGAAAAAGAATCATGGAACTAAGACAGGCTCTAGAGCATTCTGAAGACTCAATAGGGTCATTGGAATCAAAGCTTGAAGCTCTTAAGTCAAGGAGTGGATCAGACAAG
TCTCTTGTCAATGGCTCTACCAGATCAGAATCTTGGGGAGCTGTTCAGAAACCAACCAATGAGCTATCTGCCAGTAGCTTCACGCAGGAAAACAGGACGACATGCAGTTC
GATCGAGTGCCAGCCAGCTCCATTGTCGACCGAAGAGACTGAGATTAAACCAGAACCCTTGCAGTCTCTGGAACGAGGAAAAGCCTCGAGAATTGGGAAGTTGGGAGAGG
TATTGTATGAAAACCAAGGAGGAATAATTAGGAAGAGATCAAGAGGGAAAAGAAAGAGGAAGGATTGTAATAGGGAAGTTAAGGAAGGAAGTAGTGGGGAAAATAACTTG
TCTGAATCAGCTAACCCTTCAACTGTTTCACAGTCTAAAGAAAACTCATGTTGCAACTCGTTTGAGGCACGTGAACCTTCGGATGCAAATGAAGCTAGCAGAAGCTCAGC
CATGGATGGTGTTGATGTTTTAATGGCTGCTTTTAACACTGTTGCAGAGGACAAAAGTGCCTCCCTATTTCGTCGTCGCCTTGATAGTCAGAGGAGAAGTAGATATAAGA
AACTAATCAGGCAACATTTGGATATTGAAACAATAAGATCAAGAGTTGCAAGTCATAACATTACGACAAAAATGGAGTTGTACAGAGATCTGTTGTTGCTTGCTAACAAC
GCACTCGTCTTCTACTCACGGAATTCCCGTGAGCATCAGTCTGCAGTGTTGCTCAGAAGACTCATTTCAAGTACATTTGAGAAGCAAATGAAGAGCTCTAGCAATATGGT
AGCTCATAACACCCCCAACCAGAGAACACAAACCTGTGATCTGATAGCAAAACCGCGTCGTTCGCAGCCAGCTAAACGTAATGAATCCCAAAGAGAAGCCAATCCAGGAG
ATGTTAAAACTCCAAAGGGAAATAGAAGACGAAAAAATAATAGCTCTAATCCTCCTTCCTCGTTGGGGTTGGCAAAGAAAGAAACTTCGACTTCTATGCTAAAGAAAGCC
CCTGGTGGGACGAGAAAGGCTGTCGGTGGGACATCGAAAAGTGAACGATCTGCAACTGGCATCAGGGGAAGGAAAAGAGGGAAAACGAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATTTTCTATTTGTTCTTTTGTATTTTGTTTGTCTTTTGCCCTTCTTCGACCCCTTCTATTAAAAGCAATGAGAAACCCCAATTCTCGCCCTCAACAATAATACCCAAAAA
AATTCCATTACGCACAATTCACTTCACACATAACCTTTTTTATTTTTTATTTTTTATTTTTTAAAAACATAGAAACAAAGCAATGAACCCTCCAACGTACCCCCACCCAT
TCCCATCCAAAACAGCCAATTCCTCCGCCGGAAAACCATAACCTCAGACATTTTCCCTTTTTATTTGAAGCCGAGTTAGGGTTCTGGGTCATTAATATGGGAGCGGAAGC
TTTGAAGATGATGTGGGATACGTGGCAAGAGCTTTTATTAGGTGGCGCCATACTCCGCCACGGTACCGCCGACTGGAACCTCGTCGCCACCGAGCTCCGGTCCAGGATTG
CTCGTCCGTACGCCTGCACCCCCGAGGTTTGTAAGGCGAAATATGAAGACTTGAAGAAGCGTTTTGTTGGATGCAAAGCTTGGTATGAGGAGCTTCGACGAAAAAGAATC
ATGGAACTAAGACAGGCTCTAGAGCATTCTGAAGACTCAATAGGGTCATTGGAATCAAAGCTTGAAGCTCTTAAGTCAAGGAGTGGATCAGACAAGTCTCTTGTCAATGG
CTCTACCAGATCAGAATCTTGGGGAGCTGTTCAGAAACCAACCAATGAGCTATCTGCCAGTAGCTTCACGCAGGAAAACAGGACGACATGCAGTTCGATCGAGTGCCAGC
CAGCTCCATTGTCGACCGAAGAGACTGAGATTAAACCAGAACCCTTGCAGTCTCTGGAACGAGGAAAAGCCTCGAGAATTGGGAAGTTGGGAGAGGTATTGTATGAAAAC
CAAGGAGGAATAATTAGGAAGAGATCAAGAGGGAAAAGAAAGAGGAAGGATTGTAATAGGGAAGTTAAGGAAGGAAGTAGTGGGGAAAATAACTTGTCTGAATCAGCTAA
CCCTTCAACTGTTTCACAGTCTAAAGAAAACTCATGTTGCAACTCGTTTGAGGCACGTGAACCTTCGGATGCAAATGAAGCTAGCAGAAGCTCAGCCATGGATGGTGTTG
ATGTTTTAATGGCTGCTTTTAACACTGTTGCAGAGGACAAAAGTGCCTCCCTATTTCGTCGTCGCCTTGATAGTCAGAGGAGAAGTAGATATAAGAAACTAATCAGGCAA
CATTTGGATATTGAAACAATAAGATCAAGAGTTGCAAGTCATAACATTACGACAAAAATGGAGTTGTACAGAGATCTGTTGTTGCTTGCTAACAACGCACTCGTCTTCTA
CTCACGGAATTCCCGTGAGCATCAGTCTGCAGTGTTGCTCAGAAGACTCATTTCAAGTACATTTGAGAAGCAAATGAAGAGCTCTAGCAATATGGTAGCTCATAACACCC
CCAACCAGAGAACACAAACCTGTGATCTGATAGCAAAACCGCGTCGTTCGCAGCCAGCTAAACGTAATGAATCCCAAAGAGAAGCCAATCCAGGAGATGTTAAAACTCCA
AAGGGAAATAGAAGACGAAAAAATAATAGCTCTAATCCTCCTTCCTCGTTGGGGTTGGCAAAGAAAGAAACTTCGACTTCTATGCTAAAGAAAGCCCCTGGTGGGACGAG
AAAGGCTGTCGGTGGGACATCGAAAAGTGAACGATCTGCAACTGGCATCAGGGGAAGGAAAAGAGGGAAAACGAAGTAAATGGTAAAAAGTTCAAAACTATTTCTTGATA
GATTCTCAGGCTAGAACTTGTATGTTGTAAACCAGGTAGACTGAGGCTTTAAGGATGTTTTTGGCTCTAGAAAAATTGGAAAGATAATGGGA
Protein sequenceShow/hide protein sequence
MGAEALKMMWDTWQELLLGGAILRHGTADWNLVATELRSRIARPYACTPEVCKAKYEDLKKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLEALKSRSGSDK
SLVNGSTRSESWGAVQKPTNELSASSFTQENRTTCSSIECQPAPLSTEETEIKPEPLQSLERGKASRIGKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNL
SESANPSTVSQSKENSCCNSFEAREPSDANEASRSSAMDGVDVLMAAFNTVAEDKSASLFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHNITTKMELYRDLLLLANN
ALVFYSRNSREHQSAVLLRRLISSTFEKQMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREANPGDVKTPKGNRRRKNNSSNPPSSLGLAKKETSTSMLKKA
PGGTRKAVGGTSKSERSATGIRGRKRGKTK