; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0019723 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0019723
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionBromo domain-containing protein
Genome locationchr12:21537427..21541333
RNA-Seq ExpressionIVF0019723
SyntenyIVF0019723
Gene Ontology termsGO:0016573 - histone acetylation (biological process)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR001487 - Bromodomain
IPR036427 - Bromodomain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056951.1 putative Bromodomain 4 [Cucumis melo var. makuwa]1.09e-257100Show/hide
Query:  MELRQALEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRI
        MELRQALEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRI
Subjt:  MELRQALEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRI

Query:  GKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSAS
        GKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSAS
Subjt:  GKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSAS

Query:  VFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQT
        VFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQT
Subjt:  VFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQT

Query:  CDLIAKPRRSQPAKRNESQREANPGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK
        CDLIAKPRRSQPAKRNESQREANPGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK
Subjt:  CDLIAKPRRSQPAKRNESQREANPGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK

XP_004146636.1 uncharacterized protein LOC101217843 isoform X1 [Cucumis sativus]4.92e-29493.4Show/hide
Query:  MGAEALKR-WDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLE
        MGAEALK  WDTWQELLLGGAI+RHGT DWNLVATELRSRIARPY CTPEVCKAKYEDLKKRFVGCKAWYEELR+KR+MELRQALEHSEDSIGSLESKLE
Subjt:  MGAEALKR-WDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SASSFTQENRTTCSSIECQPAPL T+ETEIKPEPLQSLE GK+ RIGKLGEVLYENQGGIIRKRSRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARE SDANEASRSS MDGVDVLMA FN+VAEDKSAS+FRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA
        DIETIRSRVASH ITTK ELYRDLLLLANNALVFYSRNSREHQSAV LRRLISSTF+K MKSSSNMVAHNTPN+RTQTCDLIAKPRRSQPAKRNESQREA
Subjt:  DIETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA

Query:  NPGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK
        NPGDVKTP GNRRR+NNSSNPPSSLGL+KKETSTS  KKAPGG RKAVGGTSKSERSATGIRGRKRG+TK
Subjt:  NPGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK

XP_008442126.1 PREDICTED: uncharacterized protein LOC103486076 isoform X1 [Cucumis melo]0.0100Show/hide
Query:  MGAEALKRWDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLEA
        MGAEALKRWDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLEA
Subjt:  MGAEALKRWDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLEA

Query:  LKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSRGKR
        LKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSRGKR
Subjt:  LKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSRGKR

Query:  KRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHLD
        KRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHLD
Subjt:  KRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHLD

Query:  IETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREAN
        IETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREAN
Subjt:  IETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREAN

Query:  PGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK
        PGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK
Subjt:  PGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK

XP_008442135.1 PREDICTED: uncharacterized protein LOC103486076 isoform X2 [Cucumis melo]3.08e-29594.46Show/hide
Query:  MGAEALKRWDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLEA
        MGAEALKRWDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCK                          SLESKLEA
Subjt:  MGAEALKRWDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLEA

Query:  LKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSRGKR
        LKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSRGKR
Subjt:  LKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSRGKR

Query:  KRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHLD
        KRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHLD
Subjt:  KRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHLD

Query:  IETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREAN
        IETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREAN
Subjt:  IETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREAN

Query:  PGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK
        PGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK
Subjt:  PGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK

XP_031736491.1 uncharacterized protein LOC101217843 isoform X2 [Cucumis sativus]9.25e-24681.91Show/hide
Query:  MGAEALKR-WDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLE
        MGAEALK  WDTWQELLLGGAI+RHGT DWNLVATELRSRIARPY CTPEVCKAKYEDLKKRFVGCKAWYEELR+KR+MELRQALEHSEDSIGSLESKLE
Subjt:  MGAEALKR-WDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SASSFTQENRTTCSSIECQPAPL T+ETEIKPEPLQSLE GK+ RIGKLGEVLYENQGGIIRKRSRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARE SDANEASRSS MDGVDVLMA FN+VAEDKSAS+FRRRLDSQ             
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA
                                                   SAV LRRLISSTF+K MKSSSNMVAHNTPN+RTQTCDLIAKPRRSQPAKRNESQREA
Subjt:  DIETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA

Query:  NPGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK
        NPGDVKTP GNRRR+NNSSNPPSSLGL+KKETSTS  KKAPGG RKAVGGTSKSERSATGIRGRKRG+TK
Subjt:  NPGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK

TrEMBL top hitse value%identityAlignment
A0A0A0LV17 Bromo domain-containing protein4.5e-23293.4Show/hide
Query:  MGAEALK-RWDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLE
        MGAEALK  WDTWQELLLGGAI+RHGT DWNLVATELRSRIARPY CTPEVCKAKYEDLKKRFVGCKAWYEELR+KR+MELRQALEHSEDSIGSLESKLE
Subjt:  MGAEALK-RWDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSRGK
        ALKSRSGSDKSLVNGSTRSESWGAVQKPTNE SASSFTQENRTTCSSIECQPAPL T+ETEIKPEPLQSLE GK+ RIGKLGEVLYENQGGIIRKRSRGK
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARE SDANEASRSS MDGVDVLMA FN+VAEDKSAS+FRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA
        DIETIRSRVASH ITTK ELYRDLLLLANNALVFYSRNSREHQSAV LRRLISSTF+K MKSSSNMVAHNTPN+RTQTCDLIAKPRRSQPAKRNESQREA
Subjt:  DIETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREA

Query:  NPGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK
        NPGDVKTP GNRRR+NNSSNPPSSLGL+KKETSTS  KKAPGG RKAVGGTSKSERSATGIRGRKRG+TK
Subjt:  NPGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK

A0A1S3B4K1 uncharacterized protein LOC103486076 isoform X21.5e-23294.46Show/hide
Query:  MGAEALKRWDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLEA
        MGAEALKRWDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCK                          SLESKLEA
Subjt:  MGAEALKRWDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLEA

Query:  LKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSRGKR
        LKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSRGKR
Subjt:  LKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSRGKR

Query:  KRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHLD
        KRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHLD
Subjt:  KRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHLD

Query:  IETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREAN
        IETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREAN
Subjt:  IETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREAN

Query:  PGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK
        PGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK
Subjt:  PGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK

A0A1S3B4Z1 uncharacterized protein LOC103486076 isoform X18.7e-252100Show/hide
Query:  MGAEALKRWDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLEA
        MGAEALKRWDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLEA
Subjt:  MGAEALKRWDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLEA

Query:  LKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSRGKR
        LKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSRGKR
Subjt:  LKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSRGKR

Query:  KRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHLD
        KRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHLD
Subjt:  KRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHLD

Query:  IETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREAN
        IETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREAN
Subjt:  IETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREAN

Query:  PGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK
        PGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK
Subjt:  PGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK

A0A5A7UTW9 Putative Bromodomain 41.3e-202100Show/hide
Query:  MELRQALEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRI
        MELRQALEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRI
Subjt:  MELRQALEHSEDSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRI

Query:  GKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSAS
        GKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSAS
Subjt:  GKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSAS

Query:  VFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQT
        VFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQT
Subjt:  VFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQT

Query:  CDLIAKPRRSQPAKRNESQREANPGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK
        CDLIAKPRRSQPAKRNESQREANPGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK
Subjt:  CDLIAKPRRSQPAKRNESQREANPGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK

A0A6J1JZ11 uncharacterized protein LOC111490126 isoform X16.8e-17274.84Show/hide
Query:  MGAEAL-KRWDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLE
        MGAEA+ KRWDTW+ELLLGGAI+RHGT DWNLVA ELR+RI RP   TPEVCKAKYEDL+KRFVGCKAWYEELR++RI+ELR+ALEHSEDSIGSLESKLE
Subjt:  MGAEAL-KRWDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPE--PLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSR
        ALKSRSG DKSLVN S RSESWG V KPTNE SA SFTQENR TCSS+EC+ AP L +ETEIKPE   L+ LEWGK                G ++KRSR
Subjt:  ALKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPE--PLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSR

Query:  GKRKRKDC--NREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDG--VDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKK
        GKRKRKDC  +R+VKEGS+GENNLSESANPSTVS SK+NSCCNSFE RESSDANEASRSSTMDG  VDVLMA FN+VAE+KSA VFRRRLDSQ+R RYKK
Subjt:  GKRKRKDC--NREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDG--VDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKK

Query:  LIRQHLDIETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRN
        LIRQHLDIETIRSRVASHYITT+KELYRDLLLLANNALVFY  N+REH+SAV LRRLI+STFQKL K        N+  +RTQT D +AKP R QPAKR 
Subjt:  LIRQHLDIETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRN

Query:  ESQREANPGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERS-ATGIRGRKRGRTK
        ES++E NPGD KTP+GNRRRR+N +N  SS+GL+K ETS ST K+ P G RK+V GTSKSE+S ATG+RGRKRGRTK
Subjt:  ESQREANPGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERS-ATGIRGRKRGRTK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61215.1 bromodomain 41.7e-6137.58Show/hide
Query:  WDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLEALKSRSG--
        W TW+ELLLGGA++RHGTGDW +VA ELRS  + P + TPE+CKAKY+DL+KR+VGCKAW+EEL++KR+ EL+ AL  SEDSIGSLESKL++LKS S   
Subjt:  WDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLEALKSRSG--

Query:  -------SDKSL-VNGSTRSESWG-AVQKPTNE--QSASSFTQENRTTCS-SIECQ-PAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGII-
               S ++L +  S +SE  G    K T++   S  SFTQ+  TT + S E +  AP++ E+           E  K L    + E +Y   G ++ 
Subjt:  -------SDKSL-VNGSTRSESWG-AVQKPTNE--QSASSFTQENRTTCS-SIECQ-PAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGII-

Query:  -RKRSRGKRKRKDCN----REVKEGSSGENN--LSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDS
          ++ RGKRKRKDC+    +EV E S+ E +     SA+ +++ +SKE           +S ++  SR  ++     LM ++N++A+++ A VFRRRLDS
Subjt:  -RKRSRGKRKRKDCN----REVKEGSSGENN--LSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDS

Query:  QRRSRYKKLIRQHLDIETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPR
        Q+R RYKKL+R+H+D++T++SR+    I++ KEL+RD LL+ANNA +FYS+N+RE++SAV LR +++ + +  +  + +   H +      T  ++   +
Subjt:  QRRSRYKKLIRQHLDIETIRSRVASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPR

Query:  RSQPAKRNE---SQREANPGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK
         + P+ R      +       +KT   +  + ++  N  S   L      +S   K    +RK  G  +     +  + GRKR R +
Subjt:  RSQPAKRNE---SQREANPGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKSERSATGIRGRKRGRTK

AT2G42150.1 DNA-binding bromodomain-containing protein9.2e-2028.24Show/hide
Query:  WDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLC--TPEVCKAKYEDLKKRF------------VGCKAWYEELRQKRIMELRQALEHSEDSIGSL
        W TW+ELLL  A+ RHGT  WN V+ E++     P LC  T   C+ KY DLK RF            +    W EELR+ R+ ELR+ +E  + SI +L
Subjt:  WDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLC--TPEVCKAKYEDLKKRF------------VGCKAWYEELRQKRIMELRQALEHSEDSIGSL

Query:  ESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQEN-RTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKL-GEVLYENQGGII
        +SK++ L+     + S +   T +E+    +K     S         +    +I   P  + +E TE + E +     G+S    KL GE         +
Subjt:  ESKLEALKSRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQEN-RTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKL-GEVLYENQGGII

Query:  RKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYK
         K      +R +    V E    E+  S     ++  QS  +         +  D +  S          L++    +      S F RRL+ Q    Y 
Subjt:  RKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYK

Query:  KLIRQHLDIETIRSRV-ASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNM--VAHNTPNQRT----QTCDLIAKPR
         +IR+H+D E IR RV    Y + +   +RDLLLL NNA VFY R S E + A  L +L+       +K  SN   ++ + P +       +  + +KPR
Subjt:  KLIRQHLDIETIRSRV-ASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNM--VAHNTPNQRT----QTCDLIAKPR

Query:  RSQP---AKRNESQREANPGDVKTPNGNRRRR
         S P   A R  S   A P  +  P  +++ +
Subjt:  RSQP---AKRNESQREANPGDVKTPNGNRRRR

AT2G44430.1 DNA-binding bromodomain-containing protein2.3e-1826.74Show/hide
Query:  WDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPY-LCTPEVCKAKYEDLKKRF---------------------VGCK-AWYEELRQKRIMELRQALE
        W TW+ELLL  A+ RHG GDW+ VATE+RSR +  + L +   C+ KY DLK+RF                     VG    W E+LR  R+ ELR+ +E
Subjt:  WDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPY-LCTPEVCKAKYEDLKKRF---------------------VGCK-AWYEELRQKRIMELRQALE

Query:  HSEDSIGSLESKLEALKSRS--GSDKSLVNG---STRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEI-KPEPLQSLEWGKSLRIGK
          + SI SL+ K++ L+     G +K  +       RSE+ G+ +    E++ S+  + +R   S  E        EE  +   EP Q+           
Subjt:  HSEDSIGSLESKLEALKSRS--GSDKSLVNG---STRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEI-KPEPLQSLEWGKSLRIGK

Query:  LGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEA------RESSDANEASRSSTMDGVDVLMALFNSVAED
                     R+   G  K  D +   K+ ++ E      +  S  S S E     + E+      R+   A E   + +      L++L + +   
Subjt:  LGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEA------RESSDANEASRSSTMDGVDVLMALFNSVAED

Query:  KSASVFRRRLDSQRRSRYKKLIRQHLDIETIRSRV-ASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQK-LMKSSSNMVAHNTP
           S+F RRL SQ    YK +++QHLDIETI+ ++    Y ++    YRDL LL  NA+VF+  +S E  +A  LR ++S   +K   K+   ++     
Subjt:  KSASVFRRRLDSQRRSRYKKLIRQHLDIETIRSRV-ASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQK-LMKSSSNMVAHNTP

Query:  NQRTQTCD-----------------LIAKPRRSQPAKRNESQRE-ANPGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKS
          R+   D                 ++ K RRS  AK + S    +   D K    +  + N ++   SS   +K     +   K   G  K     SK+
Subjt:  NQRTQTCD-----------------LIAKPRRSQPAKRNESQRE-ANPGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAPGGIRKAVGGTSKS

Query:  ERSATGIRGRKRGRTK
          S      +  G+T+
Subjt:  ERSATGIRGRKRGRTK

AT3G57980.1 DNA-binding bromodomain-containing protein2.0e-1928.76Show/hide
Query:  QELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRF------------------VGCKAWYEELRQKRIMELRQALEHSEDSIGSL
        +ELLL  A+ RHGT  W+ VA+E+  + +     T   C+ KY DLK+RF                  +    W EELR+ R+ ELR+ +E  + SI SL
Subjt:  QELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRF------------------VGCKAWYEELRQKRIMELRQALEHSEDSIGSL

Query:  ESKLEALK---------SRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLY
        + K++ L+           S  D+        +ES      P  E   S    +N     S     A  + E  + +P            RIG  GE   
Subjt:  ESKLEALK---------SRSGSDKSLVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLY

Query:  ENQGGIIRKRSRGKRKRKDCNREVKEGSSGE------------NNLSESANPSTVSQSKENSCCNSFEARESSDANE---ASRSSTMDGVDV----LMAL
        +N      K +R    R  C    KE    E             ++ ES       ++ +     SF  +E+ D ++     +S T++ + V    L   
Subjt:  ENQGGIIRKRSRGKRKRKDCNREVKEGSSGE------------NNLSESANPSTVSQSKENSCCNSFEARESSDANE---ASRSSTMDGVDV----LMAL

Query:  FNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHLDIETIRSRV-ASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLI
           +      S F RRL++Q  S Y ++IRQH+D E IRSRV   +Y T + + +RDLLLL NN  VFY   S E  +A  L +LI
Subjt:  FNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHLDIETIRSRV-ASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLI

AT3G60110.1 DNA-binding bromodomain-containing protein1.7e-2124.94Show/hide
Query:  WDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRF--------------------VGCKAWYEELRQKRIMELRQALEHSE
        W TW+EL+L  A+ RH   DW+ VA E+++R     + +   C+ KY+DLK+RF                    VG  +W E+LR   + ELR+ ++  +
Subjt:  WDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRF--------------------VGCKAWYEELRQKRIMELRQALEHSE

Query:  DSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKP--TNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYE
        DSI SL+ K++ L+     D    +G  + +      KP   N ++  S   +NR+   S        + +   +  +           ++ K  E    
Subjt:  DSIGSLESKLEALKSRSGSDKSLVNGSTRSESWGAVQKP--TNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYE

Query:  NQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQ
             + K    + + +  ++  +  +SGE  L ES   + + + K          +  S        S  D    L+ +   +      SVF  RL SQ
Subjt:  NQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLSESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQ

Query:  RRSRYKKLIRQHLDIETIRSRV-ASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMV-----AHNTPNQRTQTCDL
            YK+LIRQHLD++TI  ++    Y+++    YRDL LL  NA+VF+  +S E  +A  LR L+S+  +K      + V       +   Q++    L
Subjt:  RRSRYKKLIRQHLDIETIRSRV-ASHYITTKKELYRDLLLLANNALVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMV-----AHNTPNQRTQTCDL

Query:  IAKPRRSQPAKRNESQREANPGDVK
        +   ++S   K+      +   D K
Subjt:  IAKPRRSQPAKRNESQREANPGDVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCGGAAGCATTGAAGAGGTGGGATACGTGGCAGGAGCTTTTATTAGGTGGCGCCATTGTCCGCCACGGTACCGGTGACTGGAACCTCGTCGCCACGGAGCTCCG
GTCGAGGATTGCTCGTCCGTACCTCTGCACTCCCGAGGTTTGTAAGGCGAAATATGAAGACTTGAAGAAGCGTTTCGTTGGATGCAAAGCTTGGTATGAGGAGCTTCGAC
AAAAAAGAATCATGGAACTAAGACAGGCTCTAGAGCATTCTGAAGACTCAATAGGGTCATTGGAATCAAAGCTTGAAGCTCTTAAGTCAAGGAGTGGATCAGACAAGTCT
CTTGTCAATGGCTCTACCAGGTCAGAATCTTGGGGAGCTGTTCAGAAACCAACCAATGAGCAATCTGCCAGTAGCTTCACGCAGGAAAACAGGACGACCTGCAGTTCAAT
CGAGTGCCAGCCAGCTCCATTGTTGACTGAAGAGACAGAGATTAAACCAGAACCATTGCAATCTCTCGAATGGGGAAAATCCTTGAGAATTGGGAAGTTGGGAGAGGTAT
TGTATGAAAACCAAGGAGGAATAATTAGGAAGAGATCAAGAGGGAAGAGAAAGAGGAAGGACTGTAATAGGGAAGTTAAGGAAGGAAGTAGTGGGGAAAATAACTTGTCT
GAATCAGCTAACCCTTCAACTGTTTCACAGTCTAAAGAAAACTCATGTTGCAACTCGTTTGAGGCACGTGAATCTTCGGATGCAAATGAAGCTAGCAGAAGCTCAACCAT
GGATGGTGTTGATGTTTTAATGGCTCTTTTTAACTCTGTTGCAGAGGACAAAAGTGCCTCCGTATTTCGTCGTCGCCTTGATAGTCAGAGGAGAAGTAGATATAAGAAAC
TAATCAGGCAACATTTGGATATTGAAACAATAAGATCAAGAGTTGCAAGTCATTACATAACGACTAAAAAGGAGTTGTACAGAGATCTGTTGTTGCTTGCTAACAACGCA
CTCGTCTTCTACTCACGGAATTCCCGGGAGCATCAGTCTGCAGTGTCGCTCAGAAGACTCATTTCAAGTACATTTCAGAAGCTAATGAAGAGCTCTAGCAATATGGTAGC
TCATAACACCCCCAACCAGAGAACACAAACCTGCGATCTGATAGCGAAACCGCGTCGTTCGCAGCCAGCTAAACGTAATGAATCTCAAAGAGAAGCCAATCCAGGAGATG
TTAAAACTCCAAATGGAAATAGAAGAAGAAGAAATAATAGTTCTAATCCTCCTTCCTCATTGGGGTTGTCAAAGAAGGAAACTTCGACTTCTACGCCAAAGAAAGCCCCT
GGTGGGATAAGAAAGGCTGTCGGTGGGACATCGAAAAGCGAACGATCTGCAACTGGCATCAGGGGAAGAAAAAGAGGGAGAACGAAGTAA
mRNA sequenceShow/hide mRNA sequence
CCTCTCCCTCTCCCCCTCTCCTTCTATTAAAAGTAATGAGAAACCCCAATTCTCGCCATCTACAAAAATACCCAAAATCTATTGCACACAAATAACCTTTTCTTTGTTTC
ACCAGGAAAAAGAACAGAAAAAAAAAAGGACAAATCCTCCAACCTACCCATTCTCATTCAAAACACCTAATTCCCACACCCAGCTTTTGCTCTTGCAGATATTTTTCCTT
TAATTTGAAGCCGGAGGGTTCTGAGGAAGTTCCGATTAATATGGGAGCGGAAGCATTGAAGAGGTGGGATACGTGGCAGGAGCTTTTATTAGGTGGCGCCATTGTCCGCC
ACGGTACCGGTGACTGGAACCTCGTCGCCACGGAGCTCCGGTCGAGGATTGCTCGTCCGTACCTCTGCACTCCCGAGGTTTGTAAGGCGAAATATGAAGACTTGAAGAAG
CGTTTCGTTGGATGCAAAGCTTGGTATGAGGAGCTTCGACAAAAAAGAATCATGGAACTAAGACAGGCTCTAGAGCATTCTGAAGACTCAATAGGGTCATTGGAATCAAA
GCTTGAAGCTCTTAAGTCAAGGAGTGGATCAGACAAGTCTCTTGTCAATGGCTCTACCAGGTCAGAATCTTGGGGAGCTGTTCAGAAACCAACCAATGAGCAATCTGCCA
GTAGCTTCACGCAGGAAAACAGGACGACCTGCAGTTCAATCGAGTGCCAGCCAGCTCCATTGTTGACTGAAGAGACAGAGATTAAACCAGAACCATTGCAATCTCTCGAA
TGGGGAAAATCCTTGAGAATTGGGAAGTTGGGAGAGGTATTGTATGAAAACCAAGGAGGAATAATTAGGAAGAGATCAAGAGGGAAGAGAAAGAGGAAGGACTGTAATAG
GGAAGTTAAGGAAGGAAGTAGTGGGGAAAATAACTTGTCTGAATCAGCTAACCCTTCAACTGTTTCACAGTCTAAAGAAAACTCATGTTGCAACTCGTTTGAGGCACGTG
AATCTTCGGATGCAAATGAAGCTAGCAGAAGCTCAACCATGGATGGTGTTGATGTTTTAATGGCTCTTTTTAACTCTGTTGCAGAGGACAAAAGTGCCTCCGTATTTCGT
CGTCGCCTTGATAGTCAGAGGAGAAGTAGATATAAGAAACTAATCAGGCAACATTTGGATATTGAAACAATAAGATCAAGAGTTGCAAGTCATTACATAACGACTAAAAA
GGAGTTGTACAGAGATCTGTTGTTGCTTGCTAACAACGCACTCGTCTTCTACTCACGGAATTCCCGGGAGCATCAGTCTGCAGTGTCGCTCAGAAGACTCATTTCAAGTA
CATTTCAGAAGCTAATGAAGAGCTCTAGCAATATGGTAGCTCATAACACCCCCAACCAGAGAACACAAACCTGCGATCTGATAGCGAAACCGCGTCGTTCGCAGCCAGCT
AAACGTAATGAATCTCAAAGAGAAGCCAATCCAGGAGATGTTAAAACTCCAAATGGAAATAGAAGAAGAAGAAATAATAGTTCTAATCCTCCTTCCTCATTGGGGTTGTC
AAAGAAGGAAACTTCGACTTCTACGCCAAAGAAAGCCCCTGGTGGGATAAGAAAGGCTGTCGGTGGGACATCGAAAAGCGAACGATCTGCAACTGGCATCAGGGGAAGAA
AAAGAGGGAGAACGAAGTAAATGGTAAAATTTCAAAACTGTTTCTAGATAGATTCTCAGGCTAGAACTTGTAAGTTGTAAACCAGGTAGGCTGACTGAGGCTTTAAGAAT
GTTTTTGGCTCTAGTAGAAAAATTGGAAAGATAATGGGAATTGGATAACTCGTCCTGTCTTATTGTTTGATGGGATTGATTTTTGTATTTGTTTCATAATACAATTTGTT
CTACAGAATGTTAATTTCCATGAAAAATCAGAACAATATTGTCAGTTGTTAGGTAATGATATGATC
Protein sequenceShow/hide protein sequence
MGAEALKRWDTWQELLLGGAIVRHGTGDWNLVATELRSRIARPYLCTPEVCKAKYEDLKKRFVGCKAWYEELRQKRIMELRQALEHSEDSIGSLESKLEALKSRSGSDKS
LVNGSTRSESWGAVQKPTNEQSASSFTQENRTTCSSIECQPAPLLTEETEIKPEPLQSLEWGKSLRIGKLGEVLYENQGGIIRKRSRGKRKRKDCNREVKEGSSGENNLS
ESANPSTVSQSKENSCCNSFEARESSDANEASRSSTMDGVDVLMALFNSVAEDKSASVFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTKKELYRDLLLLANNA
LVFYSRNSREHQSAVSLRRLISSTFQKLMKSSSNMVAHNTPNQRTQTCDLIAKPRRSQPAKRNESQREANPGDVKTPNGNRRRRNNSSNPPSSLGLSKKETSTSTPKKAP
GGIRKAVGGTSKSERSATGIRGRKRGRTK