; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10021176 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10021176
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionBromo domain-containing protein
Genome locationChr05:6169955..6174684
RNA-Seq ExpressionHG10021176
SyntenyHG10021176
Gene Ontology termsGO:0016573 - histone acetylation (biological process)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR001487 - Bromodomain
IPR036427 - Bromodomain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146636.1 uncharacterized protein LOC101217843 isoform X1 [Cucumis sativus]9.3e-21686.17Show/hide
Query:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        M AEAL+  WDTW+ELLLGGA+LRHGT DWNLVATELR+RI RPY+CTPEVCKAKYEDL+KRFVGCKAWYEELRR+R+MELRQALEHSEDSIGSLESKLE
Subjt:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENR-TCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRKRSRGK
        ALKSRSGSDKS VNGS+RSE WGAVQKPTNELSA SFTQENR TCSS+ECQPAPLST+ETEIKPEP QSLE+GKASRIGKL  VLYE+QGG +RKRSRGK
Subjt:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENR-TCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEGVDVLMTAFNSVAENKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSES NPSTVSQSKENSCCNSFEARE SDANEASRSS M+GVDVLM AFN+VAE+KSASLFRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEGVDVLMTAFNSVAENKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRLQPAKRNVSQKEV
        DIETIRSRVASH ITT+ ELYRDLLLLANNAL+FYS NSREHQSAVLLR LISSTF+K MKSSSNMVAHN   +RTQT DL+AKPRR QPAKRN SQ+E 
Subjt:  DIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRLQPAKRNVSQKEV

Query:  NPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGRTK
        NPGDVKTP GNRRR+N+S+NP SS+GLAKKETST  +KK PGGTR AVGGTSKSERSATG+RGRKRG+TK
Subjt:  NPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGRTK

XP_008442126.1 PREDICTED: uncharacterized protein LOC103486076 isoform X1 [Cucumis melo]4.9e-21787.45Show/hide
Query:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        M AEAL KRWDTW+ELLLGGA++RHGTGDWNLVATELR+RI RPY CTPEVCKAKYEDL+KRFVGCKAWYEELR++RIMELRQALEHSEDSIGSLESKLE
Subjt:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENR-TCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRKRSRGK
        ALKSRSGSDKS VNGS+RSE WGAVQKPTNE SA SFTQENR TCSS+ECQPAPL TEETEIKPEP QSLE GK+ RIGKL  VLYE+QGG +RKRSRGK
Subjt:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENR-TCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEGVDVLMTAFNSVAENKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSES NPSTVSQSKENSCCNSFEARESSDANEASRSSTM+GVDVLM  FNSVAE+KSAS+FRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEGVDVLMTAFNSVAENKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRLQPAKRNVSQKEV
        DIETIRSRVASHYITT+KELYRDLLLLANNAL+FYS NSREHQSAV LR LISSTFQKLMKSSSNMVAHN   QRTQT DL+AKPRR QPAKRN SQ+E 
Subjt:  DIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRLQPAKRNVSQKEV

Query:  NPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGRTK
        NPGDVKTPNGNRRRRN+S+NP SS+GL+KKETST T KK PGG R AVGGTSKSERSATG+RGRKRGRTK
Subjt:  NPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGRTK

XP_008442135.1 PREDICTED: uncharacterized protein LOC103486076 isoform X2 [Cucumis melo]1.0e-19882.34Show/hide
Query:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        M AEAL KRWDTW+ELLLGGA++RHGTGDWNLVATELR+RI RPY CTPEVCKAKYEDL+KRFVGCK                          SLESKLE
Subjt:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENR-TCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRKRSRGK
        ALKSRSGSDKS VNGS+RSE WGAVQKPTNE SA SFTQENR TCSS+ECQPAPL TEETEIKPEP QSLE GK+ RIGKL  VLYE+QGG +RKRSRGK
Subjt:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENR-TCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEGVDVLMTAFNSVAENKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSES NPSTVSQSKENSCCNSFEARESSDANEASRSSTM+GVDVLM  FNSVAE+KSAS+FRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEGVDVLMTAFNSVAENKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRLQPAKRNVSQKEV
        DIETIRSRVASHYITT+KELYRDLLLLANNAL+FYS NSREHQSAV LR LISSTFQKLMKSSSNMVAHN   QRTQT DL+AKPRR QPAKRN SQ+E 
Subjt:  DIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRLQPAKRNVSQKEV

Query:  NPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGRTK
        NPGDVKTPNGNRRRRN+S+NP SS+GL+KKETST T KK PGG R AVGGTSKSERSATG+RGRKRGRTK
Subjt:  NPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGRTK

XP_022139813.1 uncharacterized protein LOC111010637 [Momordica charantia]7.4e-18177.48Show/hide
Query:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        M  EA+E+RWDTWEELLLGGAVLRHGTGDWNLVA ELRARIVRPY+CTPEVCKAKYEDL+KRFVGCKAWYEELRR+RIMELRQALEHSEDSIGSLESKLE
Subjt:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPT-NELSAGSFTQENRTCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRKRSRGK
        ALKSRSG DK  VN  SRSE WGAVQK T NELSAGSFTQE RTCSSLEC+ APLS EE EIK E +++L Q K S I KLRG+LY SQGGTVRKR RGK
Subjt:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPT-NELSAGSFTQENRTCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRKRSRGK

Query:  RKRKDC----------NREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTME--GVDVLMTAFNSVAENKSASLFRRRLDSQR
        RKRK+C          NR+VKEGS GENNLSES NP+TVSQ    SCCNSFE    SDANEA RSS M+  GVDVLM AFNSVA++KSAS+FRRRLDSQ+
Subjt:  RKRKDC----------NREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTME--GVDVLMTAFNSVAENKSASLFRRRLDSQR

Query:  RSRYKKLIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRL
        R RYKK+IRQHLDIE IRSRV SHYITT KELYRDLLLLANNAL+FYS NSREHQSAVLLRG+I+S F+KL K+SS +V HN HKQ+TQ ID V KPRR 
Subjt:  RSRYKKLIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRL

Query:  QPAKRNVSQKEVNPGDVKTPNGNRRRRNDSANPHSSMGLAKKETS--TPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGRTK
        QPAK NVSQKE N  DVKT NG RRR N  ANPHSS+GL KKETS    T KKGPG TR AV GTSKSERSATG RGRKRGRTK
Subjt:  QPAKRNVSQKEVNPGDVKTPNGNRRRRNDSANPHSSMGLAKKETS--TPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGRTK

XP_038894005.1 uncharacterized protein LOC120082772 [Benincasa hispida]1.2e-19488.81Show/hide
Query:  EVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLEALKSRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENRTCSSLEC
        + C+AKYEDL+KRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLEALKSRSGSDKS VN S+RSE WGAVQKPTNELSAGSFTQEN TCSS+EC
Subjt:  EVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLEALKSRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENRTCSSLEC

Query:  QPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRKRSRGKRKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDAN
        QPAPLSTEETEIKPEPS+SLE+GKASRIGKL GVLYESQGGTVRKRSRGKRKRKDCNREVKEGSSGENNLS+S NPSTVSQSKENSCCNSFEARESSDAN
Subjt:  QPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRKRSRGKRKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDAN

Query:  EASRSSTMEGVDVLMTAFNSVAENKSASLFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRG
        EASRSSTM+GVDVLM AFNSVAENK+A++FRRRLDSQRR RYKKLIRQHLDIETIRSRVASHY TT+KELYRDLLLLANNA++FYSPNSREHQSAVLLR 
Subjt:  EASRSSTMEGVDVLMTAFNSVAENKSASLFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRG

Query:  LISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRLQPAKRNVSQKEVNPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGG
        LISSTFQKLMKSSSNMVAH+PH QRTQT DL+AKPRR QPAKRNV QKEVNPGDVKTPNG   RR ++ANPHSSM LAKKETST  VKKGPGGTR AVGG
Subjt:  LISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRLQPAKRNVSQKEVNPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGG

Query:  TSKSERSATGVRGRKRGRTK
         SKS +SAT V+GRKRGRTK
Subjt:  TSKSERSATGVRGRKRGRTK

TrEMBL top hitse value%identityAlignment
A0A0A0LV17 Bromo domain-containing protein4.5e-21686.17Show/hide
Query:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        M AEAL+  WDTW+ELLLGGA+LRHGT DWNLVATELR+RI RPY+CTPEVCKAKYEDL+KRFVGCKAWYEELRR+R+MELRQALEHSEDSIGSLESKLE
Subjt:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENR-TCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRKRSRGK
        ALKSRSGSDKS VNGS+RSE WGAVQKPTNELSA SFTQENR TCSS+ECQPAPLST+ETEIKPEP QSLE+GKASRIGKL  VLYE+QGG +RKRSRGK
Subjt:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENR-TCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEGVDVLMTAFNSVAENKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSES NPSTVSQSKENSCCNSFEARE SDANEASRSS M+GVDVLM AFN+VAE+KSASLFRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEGVDVLMTAFNSVAENKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRLQPAKRNVSQKEV
        DIETIRSRVASH ITT+ ELYRDLLLLANNAL+FYS NSREHQSAVLLR LISSTF+K MKSSSNMVAHN   +RTQT DL+AKPRR QPAKRN SQ+E 
Subjt:  DIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRLQPAKRNVSQKEV

Query:  NPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGRTK
        NPGDVKTP GNRRR+N+S+NP SS+GLAKKETST  +KK PGGTR AVGGTSKSERSATG+RGRKRG+TK
Subjt:  NPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGRTK

A0A1S3B4K1 uncharacterized protein LOC103486076 isoform X25.0e-19982.34Show/hide
Query:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        M AEAL KRWDTW+ELLLGGA++RHGTGDWNLVATELR+RI RPY CTPEVCKAKYEDL+KRFVGCK                          SLESKLE
Subjt:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENR-TCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRKRSRGK
        ALKSRSGSDKS VNGS+RSE WGAVQKPTNE SA SFTQENR TCSS+ECQPAPL TEETEIKPEP QSLE GK+ RIGKL  VLYE+QGG +RKRSRGK
Subjt:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENR-TCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEGVDVLMTAFNSVAENKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSES NPSTVSQSKENSCCNSFEARESSDANEASRSSTM+GVDVLM  FNSVAE+KSAS+FRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEGVDVLMTAFNSVAENKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRLQPAKRNVSQKEV
        DIETIRSRVASHYITT+KELYRDLLLLANNAL+FYS NSREHQSAV LR LISSTFQKLMKSSSNMVAHN   QRTQT DL+AKPRR QPAKRN SQ+E 
Subjt:  DIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRLQPAKRNVSQKEV

Query:  NPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGRTK
        NPGDVKTPNGNRRRRN+S+NP SS+GL+KKETST T KK PGG R AVGGTSKSERSATG+RGRKRGRTK
Subjt:  NPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGRTK

A0A1S3B4Z1 uncharacterized protein LOC103486076 isoform X12.4e-21787.45Show/hide
Query:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        M AEAL KRWDTW+ELLLGGA++RHGTGDWNLVATELR+RI RPY CTPEVCKAKYEDL+KRFVGCKAWYEELR++RIMELRQALEHSEDSIGSLESKLE
Subjt:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENR-TCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRKRSRGK
        ALKSRSGSDKS VNGS+RSE WGAVQKPTNE SA SFTQENR TCSS+ECQPAPL TEETEIKPEP QSLE GK+ RIGKL  VLYE+QGG +RKRSRGK
Subjt:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENR-TCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRKRSRGK

Query:  RKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEGVDVLMTAFNSVAENKSASLFRRRLDSQRRSRYKKLIRQHL
        RKRKDCNREVKEGSSGENNLSES NPSTVSQSKENSCCNSFEARESSDANEASRSSTM+GVDVLM  FNSVAE+KSAS+FRRRLDSQRRSRYKKLIRQHL
Subjt:  RKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEGVDVLMTAFNSVAENKSASLFRRRLDSQRRSRYKKLIRQHL

Query:  DIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRLQPAKRNVSQKEV
        DIETIRSRVASHYITT+KELYRDLLLLANNAL+FYS NSREHQSAV LR LISSTFQKLMKSSSNMVAHN   QRTQT DL+AKPRR QPAKRN SQ+E 
Subjt:  DIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRLQPAKRNVSQKEV

Query:  NPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGRTK
        NPGDVKTPNGNRRRRN+S+NP SS+GL+KKETST T KK PGG R AVGGTSKSERSATG+RGRKRGRTK
Subjt:  NPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGRTK

A0A6J1CGL2 uncharacterized protein LOC1110106373.6e-18177.48Show/hide
Query:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        M  EA+E+RWDTWEELLLGGAVLRHGTGDWNLVA ELRARIVRPY+CTPEVCKAKYEDL+KRFVGCKAWYEELRR+RIMELRQALEHSEDSIGSLESKLE
Subjt:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPT-NELSAGSFTQENRTCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRKRSRGK
        ALKSRSG DK  VN  SRSE WGAVQK T NELSAGSFTQE RTCSSLEC+ APLS EE EIK E +++L Q K S I KLRG+LY SQGGTVRKR RGK
Subjt:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPT-NELSAGSFTQENRTCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRKRSRGK

Query:  RKRKDC----------NREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTME--GVDVLMTAFNSVAENKSASLFRRRLDSQR
        RKRK+C          NR+VKEGS GENNLSES NP+TVSQ    SCCNSFE    SDANEA RSS M+  GVDVLM AFNSVA++KSAS+FRRRLDSQ+
Subjt:  RKRKDC----------NREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTME--GVDVLMTAFNSVAENKSASLFRRRLDSQR

Query:  RSRYKKLIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRL
        R RYKK+IRQHLDIE IRSRV SHYITT KELYRDLLLLANNAL+FYS NSREHQSAVLLRG+I+S F+KL K+SS +V HN HKQ+TQ ID V KPRR 
Subjt:  RSRYKKLIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRL

Query:  QPAKRNVSQKEVNPGDVKTPNGNRRRRNDSANPHSSMGLAKKETS--TPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGRTK
        QPAK NVSQKE N  DVKT NG RRR N  ANPHSS+GL KKETS    T KKGPG TR AV GTSKSERSATG RGRKRGRTK
Subjt:  QPAKRNVSQKEVNPGDVKTPNGNRRRRNDSANPHSSMGLAKKETS--TPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGRTK

A0A6J1JZ11 uncharacterized protein LOC111490126 isoform X13.6e-18177.1Show/hide
Query:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        M AEA++KRWDTWEELLLGGA+LRHGT DWNLVA ELRARIVRP + TPEVCKAKYEDL+KRFVGCKAWYEELRRQRI+ELR+ALEHSEDSIGSLESKLE
Subjt:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENRTCSSLECQPAPLSTEETEIKPEPSQ--SLEQGKASRIGKLRGVLYESQGGTVRKRSRG
        ALKSRSG DKS VN S RSE WG V KPTNELSAGSFTQENRTCSS+EC+ AP   +ETEIKPE SQ   LE GK                GTV+KRSRG
Subjt:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENRTCSSLECQPAPLSTEETEIKPEPSQ--SLEQGKASRIGKLRGVLYESQGGTVRKRSRG

Query:  KRKRKDC--NREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEG--VDVLMTAFNSVAENKSASLFRRRLDSQRRSRYKKL
        KRKRKDC  +R+VKEGS+GENNLSES NPSTVS SK+NSCCNSFE RESSDANEASRSSTM+G  VDVLM AFN+VAENKSA +FRRRLDSQ+R RYKKL
Subjt:  KRKRKDC--NREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEG--VDVLMTAFNSVAENKSASLFRRRLDSQRRSRYKKL

Query:  IRQHLDIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRLQPAKRNV
        IRQHLDIETIRSRVASHYITTQKELYRDLLLLANNAL+FY PN+REH+SAVLLR LI+STFQKL K        N H++RTQT D +AKP RLQPAKR  
Subjt:  IRQHLDIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRLQPAKRNV

Query:  SQKEVNPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGGTSKSERS-ATGVRGRKRGRTK
        S+KEVNPGD KTP+GNRRRR+ +AN HSS+GLAK ETS  TVK+ P GTR +V GTSKSE+S ATGVRGRKRGRTK
Subjt:  SQKEVNPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGGTSKSERS-ATGVRGRKRGRTK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61215.1 bromodomain 48.8e-6337.85Show/hide
Query:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        M    +E  W TWEELLLGGAVLRHGTGDW +VA ELR+  + P   TPE+CKAKY+DLRKR+VGCKAW+EEL+++R+ EL+ AL  SEDSIGSLESKL+
Subjt:  MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPTNE--------------LSAGSFTQENRTCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYE
        +LKS S +D+   N    S        P +E               S GSFTQ+  T ++             E K E    +EQ K   +  L   ++E
Subjt:  ALKSRSGSDKSFVNGSSRSEFWGAVQKPTNE--------------LSAGSFTQENRTCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYE

Query:  S---QGGTV---RKRSRGKRKRKDCN----REVKEGSSGENN--LSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEGVDVLMTAFNSVAENK
        S    GG V    ++ RGKRKRKDC+    +EV E S+ E +     S + +++ +SKE           +S ++  SR  ++     LM  +N++A+N+
Subjt:  S---QGGTV---RKRSRGKRKRKDCN----REVKEGSSGENN--LSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEGVDVLMTAFNSVAENK

Query:  SASLFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQR
         A +FRRRLDSQ+R RYKKL+R+H+D++T++SR+    I++ KEL+RD LL+ANNA IFYS N+RE++SAV LR +++ + +  +        H PH+  
Subjt:  SASLFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQR

Query:  T---QTIDLVAKPRRLQPAKR-NVSQKEVNPG--DVKTPNGNRRRRNDSANPHSSMGL-AKKETSTPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGR
             T  +V   +   P+ R +++ K+   G   +KT   +  + +   N  S   L      S+   KKG    ++  G  +     +  + GRKR R
Subjt:  T---QTIDLVAKPRRLQPAKR-NVSQKEVNPG--DVKTPNGNRRRRNDSANPHSSMGL-AKKETSTPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGR

Query:  TK
         +
Subjt:  TK

AT2G42150.1 DNA-binding bromodomain-containing protein1.4e-2027.1Show/hide
Query:  EKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRF------------VGCKAWYEELRRQRIMELRQALEHSEDSIGS
        ++ W TWEELLL  AV RHGT  WN V+ E++       S T   C+ KY DL+ RF            +    W EELR+ R+ ELR+ +E  + SI +
Subjt:  EKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRF------------VGCKAWYEELRRQRIMELRQALEHSEDSIGS

Query:  LESKLEALKSRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENRTCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRK
        L+SK++ L+     + SF+   + +E     +K            + R+ S       P+      I P+P + +      R  ++ G      GG   K
Subjt:  LESKLEALKSRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENRTCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRK

Query:  RSRGKRKRKDCNREVKEGSSGENNL----------SESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTME-----GVDVLMTAFNSVAENKSASLF
         +     R  C    KE ++    +          SE G       + +     S   + +S+ ++  +S T           L++    +  +   S F
Subjt:  RSRGKRKRKDCNREVKEGSSGENNL----------SESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTME-----GVDVLMTAFNSVAENKSASLF

Query:  RRRLDSQRRSRYKKLIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNM--VAHNPHKQRTQ
         RRL+ Q    Y  +IR+H+D E IR RV    Y + +   +RDLLLL NNA +FY   S E + A  L  L+       +K  SN   ++ +P K+   
Subjt:  RRRLDSQRRSRYKKLIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNM--VAHNPHKQRTQ

Query:  TI----DLVAKPRRLQP
         I     + +KPR   P
Subjt:  TI----DLVAKPRRLQP

AT2G44430.1 DNA-binding bromodomain-containing protein8.3e-2127.68Show/hide
Query:  WDTWEELLLGGAVLRHGTGDWNLVATELRAR-IVRPYSCTPEVCKAKYEDLRKRF---------------------VGCK-AWYEELRRQRIMELRQALE
        W TWEELLL  AV RHG GDW+ VATE+R+R  +     +   C+ KY DL++RF                     VG    W E+LR  R+ ELR+ +E
Subjt:  WDTWEELLLGGAVLRHGTGDWNLVATELRAR-IVRPYSCTPEVCKAKYEDLRKRF---------------------VGCK-AWYEELRRQRIMELRQALE

Query:  HSEDSIGSLESKLEALKSRS--GSDKSFVNG---SSRSEFWGAVQKPTNEL--SAGSFTQENRTCSSLECQPAPLSTEETEIKPEPSQSLEQGKAS----
          + SI SL+ K++ L+     G +K  +       RSE  G+  +   +   +A    +ENR+ +      A    EE     EPSQ+ E    +    
Subjt:  HSEDSIGSLESKLEALKSRS--GSDKSFVNG---SSRSEFWGAVQKPTNEL--SAGSFTQENRTCSSLECQPAPLSTEETEIKPEPSQSLEQGKAS----

Query:  ---RIGKLRGVLYESQGGTVRKRSRGKRKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEGVDVLMTAFNSVAE
            + K      E +G      SRG              +S  + L ESG   +  + K      + E R +   ++            L++  + +  
Subjt:  ---RIGKLRGVLYESQGGTVRKRSRGKRKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEGVDVLMTAFNSVAE

Query:  NKSASLFRRRLDSQRRSRYKKLIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQK-LMKSSSNMVAHNP
        +   SLF RRL SQ    YK +++QHLDIETI+ ++    Y ++    YRDL LL  NA++F+  +S E  +A  LR ++S   +K   K+   ++    
Subjt:  NKSASLFRRRLDSQRRSRYKKLIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQK-LMKSSSNMVAHNP

Query:  HKQRTQTIDLVAKPRRLQPAKRNVSQKEVNPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGGTSKSERSATGVRGRKR
           R+   D       L        QK   P  V      RR  +  A+P SS    K +T   T+             + + +  ATGVR  +R
Subjt:  HKQRTQTIDLVAKPRRLQPAKRNVSQKEVNPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGGTSKSERSATGVRGRKR

AT3G57980.1 DNA-binding bromodomain-containing protein3.5e-1928.5Show/hide
Query:  EELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRF------------------VGCKAWYEELRRQRIMELRQALEHSEDSIGSL
        EELLL  AV RHGT  W+ VA+E+  +     + T   C+ KY DL++RF                  +    W EELR+ R+ ELR+ +E  + SI SL
Subjt:  EELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRF------------------VGCKAWYEELRRQRIMELRQALEHSEDSIGSL

Query:  ESKLEALK---------SRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENRTCSSLECQ------PAPLSTEETEIKPEPSQSLEQGKASRIGKL
        + K++ L+           S  D+      + +E       P  EL       +N   +  E          P+  E   I  E +      + S  G  
Subjt:  ESKLEALK---------SRSGSDKSFVNGSSRSEFWGAVQKPTNELSAGSFTQENRTCSSLECQ------PAPLSTEETEIKPEPSQSLEQGKASRIGKL

Query:  RGVLYESQGGTVRKRSRGKRKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANE---ASRSSTMEGVDV----LMTAFNSVAEN
          V  ES       R+  KR+  D    V+       ++ ES       ++ +     SF  +E+ D ++     +S T+  + V    L      +  +
Subjt:  RGVLYESQGGTVRKRSRGKRKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANE---ASRSSTMEGVDV----LMTAFNSVAEN

Query:  KSASLFRRRLDSQRRSRYKKLIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLI
           S F RRL++Q  S Y ++IRQH+D E IRSRV   +Y T + + +RDLLLL NN  +FY   S E  +A  L  LI
Subjt:  KSASLFRRRLDSQRRSRYKKLIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLI

AT3G60110.1 DNA-binding bromodomain-containing protein4.3e-1723.98Show/hide
Query:  LEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRF--------------------VGCKAWYEELRRQRIMELRQAL
        +++ W TWEEL+L  AV RH   DW+ VA E++AR       +   C+ KY+DL++RF                    VG  +W E+LR   + ELR+ +
Subjt:  LEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRF--------------------VGCKAWYEELRRQRIMELRQAL

Query:  EHSEDSIGSLESKLEALKSRSGSDKSFVNGSSRSEFWGAVQKP--TNELSAGSFTQENRTCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGV
        +  +DSI SL+ K++ L+     D    +G ++ +      KP   N  +  S   +NR+ +      A +       + +  + ++  + SR       
Subjt:  EHSEDSIGSLESKLEALKSRSGSDKSFVNGSSRSEFWGAVQKP--TNELSAGSFTQENRTCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGV

Query:  LYESQGGTVRKRSRGKRKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEGVDVLMTAFNSVAENKSASLFRRRL
                V K    + + +  ++  +  +SGE  L ESG  + + + K          +  S        S  +    L+     +  +   S+F  RL
Subjt:  LYESQGGTVRKRSRGKRKRKDCNREVKEGSSGENNLSESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEGVDVLMTAFNSVAENKSASLFRRRL

Query:  DSQRRSRYKKLIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNP-----HKQRTQT
         SQ    YK+LIRQHLD++TI  ++    Y+++    YRDL LL  NA++F+  +S E  +A  LR L+S+  +K      + V  +       +Q++  
Subjt:  DSQRRSRYKKLIRQHLDIETIRSRV-ASHYITTQKELYRDLLLLANNALIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNP-----HKQRTQT

Query:  IDLVAKPRRLQPAKRNVSQKEVNPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGRTK
        + LV   ++    K+             +P+ + R++++      S  +++++  T T       T +A      S+  A   +  K GR K
Subjt:  IDLVAKPRRLQPAKRNVSQKEVNPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGPGGTRNAVGGTSKSERSATGVRGRKRGRTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGCGGAGGCGTTAGAGAAGAGGTGGGATACGTGGGAAGAGCTTCTATTAGGTGGCGCCGTACTCCGCCACGGTACCGGTGACTGGAATCTCGTCGCGACGGAGCT
CCGGGCGAGGATTGTTCGTCCGTACTCCTGCACCCCTGAGGTTTGTAAGGCCAAATATGAAGACTTGCGGAAGCGTTTTGTTGGATGCAAAGCTTGGTATGAGGAGCTTC
GACGGCAACGAATCATGGAACTAAGGCAAGCTCTAGAGCATTCTGAAGACTCAATAGGATCATTGGAATCAAAGCTTGAAGCTCTTAAGTCTAGGAGTGGATCAGACAAG
TCTTTTGTCAATGGCTCTAGCAGATCAGAATTTTGGGGAGCTGTTCAGAAACCAACCAATGAGCTATCTGCCGGTAGCTTCACACAGGAAAACAGGACGTGCAGTTCGTT
GGAATGCCAGCCAGCTCCGTTGTCGACTGAAGAAACGGAGATTAAACCGGAACCATCGCAGTCTCTCGAACAGGGAAAAGCCTCAAGAATTGGGAAGTTGAGAGGGGTAT
TGTATGAAAGCCAAGGAGGAACAGTAAGGAAGAGATCAAGAGGGAAGAGAAAGAGGAAGGATTGTAATAGGGAAGTTAAGGAAGGAAGTAGTGGGGAAAATAACTTGTCC
GAATCAGGTAACCCTTCAACTGTTTCACAGTCTAAAGAAAACTCGTGTTGCAACTCGTTTGAGGCTCGTGAATCTTCTGATGCAAATGAAGCTAGCAGAAGCTCAACCAT
GGAGGGAGTTGATGTTCTAATGACTGCTTTTAACTCTGTTGCAGAGAACAAAAGTGCCTCCTTATTTCGTCGTCGCCTTGATAGTCAGAGAAGAAGTAGATACAAGAAAT
TAATCCGGCAACATTTGGATATTGAAACAATAAGGTCAAGAGTTGCAAGTCATTACATAACGACGCAAAAGGAGCTGTACAGGGATCTGCTGTTGCTTGCTAACAATGCC
CTCATCTTCTACTCGCCGAATTCCCGGGAACATCAGTCTGCAGTGCTACTCAGAGGCCTCATTTCAAGTACATTTCAGAAGCTAATGAAGAGCTCTAGCAATATGGTAGC
CCACAACCCCCACAAACAGAGAACACAAACCATTGATCTGGTGGCAAAACCTCGTCGTTTGCAGCCTGCTAAACGTAATGTATCTCAGAAAGAAGTCAATCCAGGAGATG
TTAAAACTCCAAATGGAAATAGAAGAAGAAGAAATGATAGTGCTAATCCCCATTCCTCAATGGGGTTGGCAAAGAAAGAAACTTCGACTCCTACAGTAAAGAAAGGCCCT
GGTGGGACGAGAAATGCCGTCGGTGGGACGTCGAAAAGTGAACGATCTGCAACTGGCGTTAGGGGAAGGAAAAGAGGGAGAACGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGCGGAGGCGTTAGAGAAGAGGTGGGATACGTGGGAAGAGCTTCTATTAGGTGGCGCCGTACTCCGCCACGGTACCGGTGACTGGAATCTCGTCGCGACGGAGCT
CCGGGCGAGGATTGTTCGTCCGTACTCCTGCACCCCTGAGGTTTGTAAGGCCAAATATGAAGACTTGCGGAAGCGTTTTGTTGGATGCAAAGCTTGGTATGAGGAGCTTC
GACGGCAACGAATCATGGAACTAAGGCAAGCTCTAGAGCATTCTGAAGACTCAATAGGATCATTGGAATCAAAGCTTGAAGCTCTTAAGTCTAGGAGTGGATCAGACAAG
TCTTTTGTCAATGGCTCTAGCAGATCAGAATTTTGGGGAGCTGTTCAGAAACCAACCAATGAGCTATCTGCCGGTAGCTTCACACAGGAAAACAGGACGTGCAGTTCGTT
GGAATGCCAGCCAGCTCCGTTGTCGACTGAAGAAACGGAGATTAAACCGGAACCATCGCAGTCTCTCGAACAGGGAAAAGCCTCAAGAATTGGGAAGTTGAGAGGGGTAT
TGTATGAAAGCCAAGGAGGAACAGTAAGGAAGAGATCAAGAGGGAAGAGAAAGAGGAAGGATTGTAATAGGGAAGTTAAGGAAGGAAGTAGTGGGGAAAATAACTTGTCC
GAATCAGGTAACCCTTCAACTGTTTCACAGTCTAAAGAAAACTCGTGTTGCAACTCGTTTGAGGCTCGTGAATCTTCTGATGCAAATGAAGCTAGCAGAAGCTCAACCAT
GGAGGGAGTTGATGTTCTAATGACTGCTTTTAACTCTGTTGCAGAGAACAAAAGTGCCTCCTTATTTCGTCGTCGCCTTGATAGTCAGAGAAGAAGTAGATACAAGAAAT
TAATCCGGCAACATTTGGATATTGAAACAATAAGGTCAAGAGTTGCAAGTCATTACATAACGACGCAAAAGGAGCTGTACAGGGATCTGCTGTTGCTTGCTAACAATGCC
CTCATCTTCTACTCGCCGAATTCCCGGGAACATCAGTCTGCAGTGCTACTCAGAGGCCTCATTTCAAGTACATTTCAGAAGCTAATGAAGAGCTCTAGCAATATGGTAGC
CCACAACCCCCACAAACAGAGAACACAAACCATTGATCTGGTGGCAAAACCTCGTCGTTTGCAGCCTGCTAAACGTAATGTATCTCAGAAAGAAGTCAATCCAGGAGATG
TTAAAACTCCAAATGGAAATAGAAGAAGAAGAAATGATAGTGCTAATCCCCATTCCTCAATGGGGTTGGCAAAGAAAGAAACTTCGACTCCTACAGTAAAGAAAGGCCCT
GGTGGGACGAGAAATGCCGTCGGTGGGACGTCGAAAAGTGAACGATCTGCAACTGGCGTTAGGGGAAGGAAAAGAGGGAGAACGAAGTGA
Protein sequenceShow/hide protein sequence
MRAEALEKRWDTWEELLLGGAVLRHGTGDWNLVATELRARIVRPYSCTPEVCKAKYEDLRKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLEALKSRSGSDK
SFVNGSSRSEFWGAVQKPTNELSAGSFTQENRTCSSLECQPAPLSTEETEIKPEPSQSLEQGKASRIGKLRGVLYESQGGTVRKRSRGKRKRKDCNREVKEGSSGENNLS
ESGNPSTVSQSKENSCCNSFEARESSDANEASRSSTMEGVDVLMTAFNSVAENKSASLFRRRLDSQRRSRYKKLIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNA
LIFYSPNSREHQSAVLLRGLISSTFQKLMKSSSNMVAHNPHKQRTQTIDLVAKPRRLQPAKRNVSQKEVNPGDVKTPNGNRRRRNDSANPHSSMGLAKKETSTPTVKKGP
GGTRNAVGGTSKSERSATGVRGRKRGRTK