; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0824 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0824
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionBromo domain-containing protein
Genome locationMC03:14898548..14902578
RNA-Seq ExpressionMC03g0824
SyntenyMC03g0824
Gene Ontology termsGO:0016573 - histone acetylation (biological process)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR001487 - Bromodomain
IPR036427 - Bromodomain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146636.1 uncharacterized protein LOC101217843 isoform X1 [Cucumis sativus]2.12e-21072.58Show/hide
Query:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
        MG EA++  WDTW+ELLLGGA+LRHGT DWNLVA ELR+RI RPYACTPEVCKAKYEDL+KRFVGCKAWYEELRRKR+MELRQALEHSEDSIGSLESKLE
Subjt:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSG-DKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRT-CSSLECEAAPLSAEEIEIKAEA-EALQ-DKVSSIEKLRGILYGSQGGTVRKR-RG
        ALKSRSG DK +VN  +RSESWGAVQK T NELSA SFTQE RT CSS+EC+ APLS +E EIK E  ++L+  K S I KL  +LY +QGG +RKR RG
Subjt:  ALKSRSG-DKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRT-CSSLECEAAPLSAEEIEIKAEA-EALQ-DKVSSIEKLRGILYGSQGGTVRKR-RG

Query:  KRKRKNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQS----CCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQ
        KRKRK+CN          R+VKEGS GENNLSES NP+TVSQS    CCNSFE   PSDANEA RSSAMD  GVDVLMAAFN+VA+ KSAS+FRRRLDSQ
Subjt:  KRKRKNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQS----CCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQ

Query:  KRGRYKKVIRQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRR
        +R RYKK+IRQHLDIE IRSRV SH ITT  ELYRDLLLLANNALVFYSRNSREHQSAVLLR +I+S F+K  K+SS +V HN   ++TQ  D + KPRR
Subjt:  KRGRYKKVIRQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRR

Query:  SQPAKHNVSQKEGNLRDVKTQNGGRRR-GNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSATGTRGRKRGRTK
        SQPAK N SQ+E N  DVKT  G RRR  N +NP SS+GL KKETS   S  KK PG TRKAV GTSKSERSATG RGRKRG+TK
Subjt:  SQPAKHNVSQKEGNLRDVKTQNGGRRR-GNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSATGTRGRKRGRTK

XP_008442126.1 PREDICTED: uncharacterized protein LOC103486076 isoform X1 [Cucumis melo]8.29e-21072.99Show/hide
Query:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
        MG EA+ +RWDTW+ELLLGGA++RHGTGDWNLVA ELR+RI RPY CTPEVCKAKYEDL+KRFVGCKAWYEELR+KRIMELRQALEHSEDSIGSLESKLE
Subjt:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSG-DKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRT-CSSLECEAAPLSAEEIEIKAEA-EALQ-DKVSSIEKLRGILYGSQGGTVRKR-RG
        ALKSRSG DK +VN  +RSESWGAVQK T NE SA SFTQE RT CSS+EC+ APL  EE EIK E  ++L+  K   I KL  +LY +QGG +RKR RG
Subjt:  ALKSRSG-DKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRT-CSSLECEAAPLSAEEIEIKAEA-EALQ-DKVSSIEKLRGILYGSQGGTVRKR-RG

Query:  KRKRKNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQS----CCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQ
        KRKRK+CN          R+VKEGS GENNLSES NP+TVSQS    CCNSFE    SDANEA RSS MD  GVDVLMA FNSVA+ KSASVFRRRLDSQ
Subjt:  KRKRKNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQS----CCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQ

Query:  KRGRYKKVIRQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRR
        +R RYKK+IRQHLDIE IRSRV SHYITT KELYRDLLLLANNALVFYSRNSREHQSAV LR +I+S F+KL K+SS +V HN   Q+TQ  D + KPRR
Subjt:  KRGRYKKVIRQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRR

Query:  SQPAKHNVSQKEGNLRDVKTQNGGRRR-GNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSATGTRGRKRGRTK
        SQPAK N SQ+E N  DVKT NG RRR  N +NP SS+GL KKETS   ST KK PG  RKAV GTSKSERSATG RGRKRGRTK
Subjt:  SQPAKHNVSQKEGNLRDVKTQNGGRRR-GNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSATGTRGRKRGRTK

XP_022139813.1 uncharacterized protein LOC111010637 [Momordica charantia]0.0100Show/hide
Query:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
        MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
Subjt:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRGILYGSQGGTVRKRRGKRKRK
        ALKSRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRGILYGSQGGTVRKRRGKRKRK
Subjt:  ALKSRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRGILYGSQGGTVRKRRGKRKRK

Query:  NCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQSCCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQKRGRYKKVI
        NCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQSCCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQKRGRYKKVI
Subjt:  NCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQSCCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQKRGRYKKVI

Query:  RQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRRSQPAKHNVS
        RQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRRSQPAKHNVS
Subjt:  RQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRRSQPAKHNVS

Query:  QKEGNLRDVKTQNGGRRRGNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSATGTRGRKRGRTK
        QKEGNLRDVKTQNGGRRRGNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSATGTRGRKRGRTK
Subjt:  QKEGNLRDVKTQNGGRRRGNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSATGTRGRKRGRTK

XP_022994396.1 uncharacterized protein LOC111490126 isoform X1 [Cucurbita maxima]1.28e-20471.73Show/hide
Query:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
        MG EAI++RWDTWEELLLGGA+LRHGT DWNLVAAELRARIVRP A TPEVCKAKYEDLQKRFVGCKAWYEELRR+RI+ELR+ALEHSEDSIGSLESKLE
Subjt:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRGILYGSQGGTVRKR-RGKRKR
        ALKSRSGDK +VNS  RSESWG V K T NELSAGSFTQE RTCSS+EC +AP  A+E EIK EA  L+            L   + GTV+KR RGKRKR
Subjt:  ALKSRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRGILYGSQGGTVRKR-RGKRKR

Query:  KNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQS----CCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQKRGR
        K+C+        S+RDVKEGS GENNLSES NP+TVS S    CCNSFEP   SDANEA RSS MDGV VDVLMAAFN+VA++KSA VFRRRLDSQKRGR
Subjt:  KNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQS----CCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQKRGR

Query:  YKKVIRQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRRSQPA
        YKK+IRQHLDIE IRSRV SHYITT KELYRDLLLLANNALVFY  N+REH+SAVLLR +ITS F+KLFKNS        H+++TQ  D + KP R QPA
Subjt:  YKKVIRQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRRSQPA

Query:  KHNVSQKEGNLRDVKTQNGGRRRGNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSA-TGTRGRKRGRTK
        K   S+KE N  D KT +G RRR + AN HSSVGL K ETS  AST K+ P  TRK+VVGTSKSE+SA TG RGRKRGRTK
Subjt:  KHNVSQKEGNLRDVKTQNGGRRRGNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSA-TGTRGRKRGRTK

XP_023542669.1 uncharacterized protein LOC111802504 isoform X1 [Cucurbita pepo subsp. pepo]2.78e-20371.67Show/hide
Query:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
        MG EAI+++WDTWEELLLGGA+LRHGT DWNLVAAELRARIVRP A TPEVCKAKYEDLQKRFVGCKAWYEELRR+RIMELR+ALEHSEDSIGSLESKLE
Subjt:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRGILYGSQGGTVRKRRGKRKRK
        ALKSRSGDK +VNS  RSESWG V K T NELSAGSFTQE RTCSS+EC +AP  A+E EIK EA           +LR + +G  G   ++ RGKRKRK
Subjt:  ALKSRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRGILYGSQGGTVRKRRGKRKRK

Query:  NCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQS----CCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQKRGRY
        +C         S+RDVKEGS GENNLSES NP+TVS S    CCNSFEP   SDANEA RSS MDGV VDVLMAAFN+VA++KSASVFRRRLDSQKRGRY
Subjt:  NCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQS----CCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQKRGRY

Query:  KKVIRQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRRSQPAK
        KK+IRQHLDIE IRSRV SHYITT KELYRDLLLLANNALVFY  N+RE++SAVLLR +ITS F+KLFKNS        H ++TQ  D V KP R QPAK
Subjt:  KKVIRQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRRSQPAK

Query:  HNVSQKEGNLRDVKTQNGGRRRGNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSA-TGTRGRKRGRTK
         N S+KE N  D KT +G RRR N AN HSSVGL K ETS  AST K+ P  TRK+VVGT KSERSA T  RGRKRGRTK
Subjt:  HNVSQKEGNLRDVKTQNGGRRRGNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSA-TGTRGRKRGRTK

TrEMBL top hitse value%identityAlignment
A0A0A0LV17 Bromo domain-containing protein1.03e-21072.58Show/hide
Query:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
        MG EA++  WDTW+ELLLGGA+LRHGT DWNLVA ELR+RI RPYACTPEVCKAKYEDL+KRFVGCKAWYEELRRKR+MELRQALEHSEDSIGSLESKLE
Subjt:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSG-DKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRT-CSSLECEAAPLSAEEIEIKAEA-EALQ-DKVSSIEKLRGILYGSQGGTVRKR-RG
        ALKSRSG DK +VN  +RSESWGAVQK T NELSA SFTQE RT CSS+EC+ APLS +E EIK E  ++L+  K S I KL  +LY +QGG +RKR RG
Subjt:  ALKSRSG-DKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRT-CSSLECEAAPLSAEEIEIKAEA-EALQ-DKVSSIEKLRGILYGSQGGTVRKR-RG

Query:  KRKRKNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQS----CCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQ
        KRKRK+CN          R+VKEGS GENNLSES NP+TVSQS    CCNSFE   PSDANEA RSSAMD  GVDVLMAAFN+VA+ KSAS+FRRRLDSQ
Subjt:  KRKRKNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQS----CCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQ

Query:  KRGRYKKVIRQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRR
        +R RYKK+IRQHLDIE IRSRV SH ITT  ELYRDLLLLANNALVFYSRNSREHQSAVLLR +I+S F+K  K+SS +V HN   ++TQ  D + KPRR
Subjt:  KRGRYKKVIRQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRR

Query:  SQPAKHNVSQKEGNLRDVKTQNGGRRR-GNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSATGTRGRKRGRTK
        SQPAK N SQ+E N  DVKT  G RRR  N +NP SS+GL KKETS   S  KK PG TRKAV GTSKSERSATG RGRKRG+TK
Subjt:  SQPAKHNVSQKEGNLRDVKTQNGGRRR-GNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSATGTRGRKRGRTK

A0A1S3B4Z1 uncharacterized protein LOC103486076 isoform X14.01e-21072.99Show/hide
Query:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
        MG EA+ +RWDTW+ELLLGGA++RHGTGDWNLVA ELR+RI RPY CTPEVCKAKYEDL+KRFVGCKAWYEELR+KRIMELRQALEHSEDSIGSLESKLE
Subjt:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSG-DKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRT-CSSLECEAAPLSAEEIEIKAEA-EALQ-DKVSSIEKLRGILYGSQGGTVRKR-RG
        ALKSRSG DK +VN  +RSESWGAVQK T NE SA SFTQE RT CSS+EC+ APL  EE EIK E  ++L+  K   I KL  +LY +QGG +RKR RG
Subjt:  ALKSRSG-DKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRT-CSSLECEAAPLSAEEIEIKAEA-EALQ-DKVSSIEKLRGILYGSQGGTVRKR-RG

Query:  KRKRKNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQS----CCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQ
        KRKRK+CN          R+VKEGS GENNLSES NP+TVSQS    CCNSFE    SDANEA RSS MD  GVDVLMA FNSVA+ KSASVFRRRLDSQ
Subjt:  KRKRKNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQS----CCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQ

Query:  KRGRYKKVIRQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRR
        +R RYKK+IRQHLDIE IRSRV SHYITT KELYRDLLLLANNALVFYSRNSREHQSAV LR +I+S F+KL K+SS +V HN   Q+TQ  D + KPRR
Subjt:  KRGRYKKVIRQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRR

Query:  SQPAKHNVSQKEGNLRDVKTQNGGRRR-GNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSATGTRGRKRGRTK
        SQPAK N SQ+E N  DVKT NG RRR  N +NP SS+GL KKETS   ST KK PG  RKAV GTSKSERSATG RGRKRGRTK
Subjt:  SQPAKHNVSQKEGNLRDVKTQNGGRRR-GNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSATGTRGRKRGRTK

A0A6J1CGL2 uncharacterized protein LOC1110106370.0100Show/hide
Query:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
        MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
Subjt:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRGILYGSQGGTVRKRRGKRKRK
        ALKSRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRGILYGSQGGTVRKRRGKRKRK
Subjt:  ALKSRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRGILYGSQGGTVRKRRGKRKRK

Query:  NCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQSCCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQKRGRYKKVI
        NCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQSCCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQKRGRYKKVI
Subjt:  NCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQSCCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQKRGRYKKVI

Query:  RQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRRSQPAKHNVS
        RQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRRSQPAKHNVS
Subjt:  RQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRRSQPAKHNVS

Query:  QKEGNLRDVKTQNGGRRRGNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSATGTRGRKRGRTK
        QKEGNLRDVKTQNGGRRRGNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSATGTRGRKRGRTK
Subjt:  QKEGNLRDVKTQNGGRRRGNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSATGTRGRKRGRTK

A0A6J1GT05 uncharacterized protein LOC111456852 isoform X12.08e-20071.31Show/hide
Query:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
        MG EAI++RWDTWEELLLGGA+LRHGT DWNLVAAELRARIVRP A TPEVCKAKYEDLQKRFVGCKAWYEELRR+RIMELR+ALEHSEDSIGSLESKLE
Subjt:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRGILYGSQGGTVRKR-RGKRKR
        ALKSRSGDK +VNS  RSESWG V K T NELSAGSFTQE RTCSS+EC +AP  A+E EIK EA  L+            L   + GTV+KR RGKRKR
Subjt:  ALKSRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRGILYGSQGGTVRKR-RGKRKR

Query:  KNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQS----CCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQKRGR
        K+C         S+RDVKEGS GENNLSES NP+TVS S    CCNSFEP   SDANEA RSS MDGV VDVLMAAFN+VA++KSA+VFRRRLDSQKRGR
Subjt:  KNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQS----CCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQKRGR

Query:  YKKVIRQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRRSQPA
        YKK+IRQHLDIE IRSRV S YITT KELYRDLLLLANNALVFY  N+RE++SAVLLR +IT+ F+KLFKNS        H ++TQ  D + K  R QPA
Subjt:  YKKVIRQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRRSQPA

Query:  KHNVSQKEGNLRDVKTQNGGRRRGNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSA-TGTRGRKRGRTK
        K N S+KE N  D KT +G RRR N AN HSSVGL K ETS  AST K+ P  TRK+VVGTSKSERSA T  RGRKRGR K
Subjt:  KHNVSQKEGNLRDVKTQNGGRRRGNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSA-TGTRGRKRGRTK

A0A6J1JZ11 uncharacterized protein LOC111490126 isoform X16.21e-20571.73Show/hide
Query:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
        MG EAI++RWDTWEELLLGGA+LRHGT DWNLVAAELRARIVRP A TPEVCKAKYEDLQKRFVGCKAWYEELRR+RI+ELR+ALEHSEDSIGSLESKLE
Subjt:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRGILYGSQGGTVRKR-RGKRKR
        ALKSRSGDK +VNS  RSESWG V K T NELSAGSFTQE RTCSS+EC +AP  A+E EIK EA  L+            L   + GTV+KR RGKRKR
Subjt:  ALKSRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRGILYGSQGGTVRKR-RGKRKR

Query:  KNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQS----CCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQKRGR
        K+C+        S+RDVKEGS GENNLSES NP+TVS S    CCNSFEP   SDANEA RSS MDGV VDVLMAAFN+VA++KSA VFRRRLDSQKRGR
Subjt:  KNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQS----CCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQKRGR

Query:  YKKVIRQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRRSQPA
        YKK+IRQHLDIE IRSRV SHYITT KELYRDLLLLANNALVFY  N+REH+SAVLLR +ITS F+KLFKNS        H+++TQ  D + KP R QPA
Subjt:  YKKVIRQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRRSQPA

Query:  KHNVSQKEGNLRDVKTQNGGRRRGNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSA-TGTRGRKRGRTK
        K   S+KE N  D KT +G RRR + AN HSSVGL K ETS  AST K+ P  TRK+VVGTSKSE+SA TG RGRKRGRTK
Subjt:  KHNVSQKEGNLRDVKTQNGGRRRGNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSA-TGTRGRKRGRTK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61215.1 bromodomain 41.3e-6638.05Show/hide
Query:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE
        M T  +E  W TWEELLLGGAVLRHGTGDW +VA ELR+  + P   TPE+CKAKY+DL+KR+VGCKAW+EEL++KR+ EL+ AL  SEDSIGSLESKL+
Subjt:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKCVVNS-----------CSRSESWG-AVQKRTSNELSA-GSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRGILYGSQG
        +LKS S D+C  N+             +SE  G    K TS +LS+ GSFTQ+  T ++   EA   +   IE +   + L + +         +YG  G
Subjt:  ALKSRSGDKCVVNS-----------CSRSESWG-AVQKRTSNELSA-GSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRGILYGSQG

Query:  ---GTVRKRRGKRKRKNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQSCCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVF
            ++RK+RGKRKRK+C+ +          V+E  + + +   +        S   S E    S +     S A+       LM  +N++AQ++ A VF
Subjt:  ---GTVRKRRGKRKRKNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQSCCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVF

Query:  RRRLDSQKRGRYKKVIRQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKN-----------SSTVVPH
        RRRLDSQKRGRYKK++R+H+D++ ++SR+    I++ KEL+RD LL+ANNA +FYS+N+RE++SAV LR I+T   +                 ST V  
Subjt:  RRRLDSQKRGRYKKVIRQHLDIEAIRSRVTSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKN-----------SSTVVPH

Query:  NHHKQKTQIIDPVVKPRRSQPAKHNVSQKEGNLRDVKTQNGGRRRGNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSATGTRGRKRGR
         H K  +  +   +  ++ +   H +     ++   KT + G +R     P S+V          ++ GKKG    +         E  A    GRKR R
Subjt:  NHHKQKTQIIDPVVKPRRSQPAKHNVSQKEGNLRDVKTQNGGRRRGNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSERSATGTRGRKRGR

Query:  TK
         +
Subjt:  TK

AT2G42150.1 DNA-binding bromodomain-containing protein2.4e-2027.49Show/hide
Query:  ERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRF------------VGCKAWYEELRRKRIMELRQALEHSEDSIGS
        ++ W TWEELLL  AV RHGT  WN V+AE++       + T   C+ KY DL+ RF            +    W EELR+ R+ ELR+ +E  + SI +
Subjt:  ERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRF------------VGCKAWYEELRRKRIMELRQALEHSEDSIGS

Query:  LESKLEALKSRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRGILYGSQGGTVRKRR
        L+SK++ L+    +   +   + +E+    +K+  ++         ++  +        +S +  EI +E    +++++          GS GG  +   
Subjt:  LESKLEALKSRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRGILYGSQGGTVRKRR

Query:  GKRKRKNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQSCCN------SFEPHGPS--DANEACRSSAMD-GVGVDVLMAAFNSVAQSKSASVFR
            R +C +       ++  V+  S+ E  L ES + A+  +   +      S    G S  D  +   +SA D  V    L++    +      S F 
Subjt:  GKRKRKNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQSCCN------SFEPHGPS--DANEACRSSAMD-GVGVDVLMAAFNSVAQSKSASVFR

Query:  RRLDSQKRGRYKKVIRQHLDIEAIRSRVTSH-YITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIID
        RRL+ Q+   Y  +IR+H+D E IR RV    Y +     +RDLLLL NNA VFY R S E + A  L  ++  +     K  S     +    K +++ 
Subjt:  RRLDSQKRGRYKKVIRQHLDIEAIRSRVTSH-YITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIID

Query:  -PVVKPRRSQP
         P  KP  S+P
Subjt:  -PVVKPRRSQP

AT2G44430.1 DNA-binding bromodomain-containing protein4.5e-1425.24Show/hide
Query:  WDTWEELLLGGAVLRHGTGDWNLVAAELRAR-IVRPYACTPEVCKAKYEDLQKRF---------------------VGCK-AWYEELRRKRIMELRQALE
        W TWEELLL  AV RHG GDW+ VA E+R+R  +     +   C+ KY DL++RF                     VG    W E+LR  R+ ELR+ +E
Subjt:  WDTWEELLLGGAVLRHGTGDWNLVAAELRAR-IVRPYACTPEVCKAKYEDLQKRF---------------------VGCK-AWYEELRRKRIMELRQALE

Query:  HSEDSIGSLESKLEALK------SRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRG
          + SI SL+ K++ L+          D        RSE+ G+  +     +SA   +       +     A    EE     E    ++  S  +K   
Subjt:  HSEDSIGSLESKLEALK------SRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRG

Query:  ILYGSQGGTVRKRRGKRKRKNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQSCCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKS
                 V K     + +  + +  S  + + +     +GE+  SES                 G +    +  S +        L++  + +     
Subjt:  ILYGSQGGTVRKRRGKRKRKNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQSCCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKS

Query:  ASVFRRRLDSQKRGRYKKVIRQHLDIEAIRSRV-TSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKK--------LFKNSST--
         S+F RRL SQ+   YK +++QHLDIE I+ ++    Y ++    YRDL LL  NA+VF+  +S E  +A  LR +++ + +K        L K  ++  
Subjt:  ASVFRRRLDSQKRGRYKKVIRQHLDIEAIRSRV-TSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKK--------LFKNSST--

Query:  --------VVPHNHHKQKTQIIDPVVKPRRSQPAKHNVSQKEGNLR-DVKTQNGGRRRGNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSE
                    +  +QK+     V K RRS  AK + S    + + D K +     + NIA    S     K  ++ A+  K G G  ++     SK+ 
Subjt:  --------VVPHNHHKQKTQIIDPVVKPRRSQPAKHNVSQKEGNLR-DVKTQNGGRRRGNIANPHSSVGLPKKETSLGASTGKKGPGATRKAVVGTSKSE

Query:  RSATGTRGRKRGRTK
         S   +  +  G+T+
Subjt:  RSATGTRGRKRGRTK

AT3G57980.1 DNA-binding bromodomain-containing protein1.8e-1527.46Show/hide
Query:  EELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRF------------------VGCKAWYEELRRKRIMELRQALEHSEDSIGSL
        EELLL  AV RHGT  W+ VA+E+  +       T   C+ KY DL++RF                  +    W EELR+ R+ ELR+ +E  + SI SL
Subjt:  EELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRF------------------VGCKAWYEELRRKRIMELRQALEHSEDSIGSL

Query:  ESKLEALKSRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRT----------CSSLECEAAPLSAEEIEIKAE---AEALQDKVSSIEKLRGIL
        + K++ L+          +          +  T +  ++G    E++             S     A   AE ++ +      E   +K +  +  RG  
Subjt:  ESKLEALKSRSGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRT----------CSSLECEAAPLSAEEIEIKAE---AEALQDKVSSIEKLRGIL

Query:  YGSQGGTVRKRRGKRKRKNCNT----NSNSNCNSNRDVKEGSIGENNLSESPNPATVSQSCCNSFEPHGPSDANEACRSSAMDGVGVDVL-MAAFNSVAQ
           +       R + KR+  ++     S        D KE S G+++ S  P   TV Q              +   +S  ++ + V+   ++ F  + Q
Subjt:  YGSQGGTVRKRRGKRKRKNCNT----NSNSNCNSNRDVKEGSIGENNLSESPNPATVSQSCCNSFEPHGPSDANEACRSSAMDGVGVDVL-MAAFNSVAQ

Query:  SKS-ASVFRRRLDSQKRGRYKKVIRQHLDIEAIRSRVTSHYITTLK-ELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVP
        S    S F RRL++Q+   Y ++IRQH+D E IRSRV   Y  T + + +RDLLLL NN  VFY   S E  +A  L  +I  K +  FK     +P
Subjt:  SKS-ASVFRRRLDSQKRGRYKKVIRQHLDIEAIRSRVTSHYITTLK-ELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVP

AT3G60110.1 DNA-binding bromodomain-containing protein2.4e-1524.03Show/hide
Query:  IERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRF--------------------VGCKAWYEELRRKRIMELRQAL
        I++ W TWEEL+L  AV RH   DW+ VA E++AR       +   C+ KY+DL++RF                    VG  +W E+LR   + ELR+ +
Subjt:  IERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRF--------------------VGCKAWYEELRRKRIMELRQAL

Query:  EHSEDSIGSLESKLEALKSR----SGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLE--CEAAPLSAEEIEIKAEAEALQDKVSSIEKLR
        +  +DSI SL+ K++ L+       GD         ++     ++ T ++        E  + +S++   +   L  +++ +KA   +       + K  
Subjt:  EHSEDSIGSLESKLEALKSR----SGDKCVVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLE--CEAAPLSAEEIEIKAEAEALQDKVSSIEKLR

Query:  GILYGSQGGTVRKRRGKRKRKNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQSCCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSK
              +  TV KR         + +  SNC   R  ++   G                        G   A +  +           L+     +    
Subjt:  GILYGSQGGTVRKRRGKRKRKNCNTNSNSNCNSNRDVKEGSIGENNLSESPNPATVSQSCCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSK

Query:  SASVFRRRLDSQKRGRYKKVIRQHLDIEAIRSRV-TSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNH---
          SVF  RL SQ    YK++IRQHLD++ I  ++    Y+++    YRDL LL  NA+VF+  +S E  +A  LR +++++ KK        V  +    
Subjt:  SASVFRRRLDSQKRGRYKKVIRQHLDIEAIRSRV-TSHYITTLKELYRDLLLLANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNH---

Query:  --HKQKTQIIDPVVKPRRSQPAKHNVSQKEGNLRDVK
           +QK+ ++  V   ++S   K          +D K
Subjt:  --HKQKTQIIDPVVKPRRSQPAKHNVSQKEGNLRDVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAACGGAGGCGATTGAGAGGAGGTGGGATACGTGGGAAGAGCTGCTATTGGGAGGGGCCGTACTCCGGCACGGTACCGGCGACTGGAACCTCGTCGCGGCGGAGCT
CCGAGCGAGGATTGTTCGTCCGTACGCTTGCACACCCGAGGTTTGTAAGGCCAAATATGAAGACTTGCAGAAGCGTTTCGTTGGATGCAAAGCTTGGTACGAGGAGCTAC
GGCGGAAACGAATCATGGAACTAAGGCAAGCTCTGGAACATTCTGAAGATTCAATAGGGTCATTGGAATCAAAGCTTGAAGCTCTCAAGTCTAGGAGTGGAGATAAGTGT
GTTGTCAATAGCTGTAGTAGATCAGAATCTTGGGGAGCTGTTCAGAAACGAACGTCGAACGAGCTATCTGCTGGTAGCTTCACGCAGGAAATCCGAACTTGCAGTTCGCT
CGAATGTGAGGCAGCTCCATTGTCAGCTGAAGAGATTGAGATCAAAGCAGAAGCAGAAGCATTGCAAGACAAAGTTTCGAGCATTGAGAAGTTGAGAGGGATACTATATG
GAAGCCAAGGAGGAACAGTCAGGAAGAGAAGAGGGAAGAGAAAGAGGAAGAATTGTAATACTAATAGCAATAGCAATTGTAATAGTAATAGGGATGTCAAGGAAGGAAGC
ATTGGAGAAAATAACTTGTCCGAATCACCTAATCCTGCTACTGTTTCTCAATCATGCTGCAACTCATTCGAGCCTCATGGACCGTCGGATGCAAATGAAGCCTGCAGGAG
CTCAGCCATGGATGGGGTAGGAGTTGATGTTCTAATGGCTGCTTTTAATTCTGTTGCACAGAGCAAAAGTGCCTCCGTATTTCGTCGTCGCCTCGATAGTCAGAAGAGAG
GAAGATACAAGAAAGTAATCAGGCAACACTTGGATATTGAAGCAATAAGGTCAAGAGTTACAAGTCATTACATAACGACGCTAAAGGAGCTGTACAGAGATCTGCTGCTG
CTTGCTAATAACGCTCTGGTATTTTACTCGAGGAATTCGCGGGAGCACCAGTCTGCTGTGCTTCTCAGAGGTATCATTACAAGTAAATTTAAGAAGCTTTTTAAGAACTC
CAGCACTGTGGTACCCCACAACCACCACAAGCAGAAAACACAGATCATTGATCCGGTGGTGAAACCACGTCGTTCGCAGCCTGCGAAGCATAATGTATCACAGAAAGAAG
GCAATCTAAGAGATGTCAAAACTCAAAATGGCGGAAGAAGAAGGGGTAATATTGCTAATCCCCATTCCTCAGTGGGACTACCAAAGAAAGAGACTTCACTTGGGGCATCC
ACGGGAAAGAAAGGCCCTGGTGCGACAAGAAAGGCTGTCGTTGGGACATCGAAAAGCGAACGATCTGCAACTGGCACCAGGGGAAGGAAAAGAGGGAGAACAAAGTGA
mRNA sequenceShow/hide mRNA sequence
CAGCATTGGAAAACGACTAGTCCACCATCCATTGACCATTTTTTTAGAAAATATAGAAGCTTTTAGGATTAATTAGCAAAATACTTTTTTTTTTTCAGGGTGTGAACTCC
TACCTAATCACGTATTCAAAATTTTGTGCTATACTTTTATCATTTGAAGTATCTCTAATTGAATTTTAAAATGTTACAATATTTTATTGATTAGTACTACAATAATATTT
TTACAGTGTTTATAATTTTAAATGAACGCGTATCTTTCATGTTTTTTCCGATTCTCACATCCACGGTGGACTCGGAATCGGATTTTGGGCTCTCGTTATCTTTTTTTATT
TAAATTTTTTTTAGGAGGATTTATTTAATTTTATGGGTTTTTTCTTTTAAAAGTTTTCCCTTTGTCATCTTCGGTCCTTCTTCAACCTCCCCTATTAAAAGTAACGAAGA
GCCCTATTTTTCGCGATCCACCACTAATACCAAAAACCCATTATACAAAATAACCAGAGAAAAAACAAAAACAAAAAAAATTTGAAAGAAAAGGAGAATATGAAATGAAC
CGTCCAACCAACCCATCCGAAACCGCCTTTTCGTACCCCGGAAAACCATATTCCTCGTTAAATACTTCTCCCTTAAACCGCTTTTCACTTTTTCCGAGGCCAGCTAGGGT
TCCGGCGGAGGTTCGGTGATAAATCCGGAGAGAATTCGCCGGCGCGTGAGAGAGGGGGTATGGGAACGGAGGCGATTGAGAGGAGGTGGGATACGTGGGAAGAGCTGCTA
TTGGGAGGGGCCGTACTCCGGCACGGTACCGGCGACTGGAACCTCGTCGCGGCGGAGCTCCGAGCGAGGATTGTTCGTCCGTACGCTTGCACACCCGAGGTTTGTAAGGC
CAAATATGAAGACTTGCAGAAGCGTTTCGTTGGATGCAAAGCTTGGTACGAGGAGCTACGGCGGAAACGAATCATGGAACTAAGGCAAGCTCTGGAACATTCTGAAGATT
CAATAGGGTCATTGGAATCAAAGCTTGAAGCTCTCAAGTCTAGGAGTGGAGATAAGTGTGTTGTCAATAGCTGTAGTAGATCAGAATCTTGGGGAGCTGTTCAGAAACGA
ACGTCGAACGAGCTATCTGCTGGTAGCTTCACGCAGGAAATCCGAACTTGCAGTTCGCTCGAATGTGAGGCAGCTCCATTGTCAGCTGAAGAGATTGAGATCAAAGCAGA
AGCAGAAGCATTGCAAGACAAAGTTTCGAGCATTGAGAAGTTGAGAGGGATACTATATGGAAGCCAAGGAGGAACAGTCAGGAAGAGAAGAGGGAAGAGAAAGAGGAAGA
ATTGTAATACTAATAGCAATAGCAATTGTAATAGTAATAGGGATGTCAAGGAAGGAAGCATTGGAGAAAATAACTTGTCCGAATCACCTAATCCTGCTACTGTTTCTCAA
TCATGCTGCAACTCATTCGAGCCTCATGGACCGTCGGATGCAAATGAAGCCTGCAGGAGCTCAGCCATGGATGGGGTAGGAGTTGATGTTCTAATGGCTGCTTTTAATTC
TGTTGCACAGAGCAAAAGTGCCTCCGTATTTCGTCGTCGCCTCGATAGTCAGAAGAGAGGAAGATACAAGAAAGTAATCAGGCAACACTTGGATATTGAAGCAATAAGGT
CAAGAGTTACAAGTCATTACATAACGACGCTAAAGGAGCTGTACAGAGATCTGCTGCTGCTTGCTAATAACGCTCTGGTATTTTACTCGAGGAATTCGCGGGAGCACCAG
TCTGCTGTGCTTCTCAGAGGTATCATTACAAGTAAATTTAAGAAGCTTTTTAAGAACTCCAGCACTGTGGTACCCCACAACCACCACAAGCAGAAAACACAGATCATTGA
TCCGGTGGTGAAACCACGTCGTTCGCAGCCTGCGAAGCATAATGTATCACAGAAAGAAGGCAATCTAAGAGATGTCAAAACTCAAAATGGCGGAAGAAGAAGGGGTAATA
TTGCTAATCCCCATTCCTCAGTGGGACTACCAAAGAAAGAGACTTCACTTGGGGCATCCACGGGAAAGAAAGGCCCTGGTGCGACAAGAAAGGCTGTCGTTGGGACATCG
AAAAGCGAACGATCTGCAACTGGCACCAGGGGAAGGAAAAGAGGGAGAACAAAGTGAATGGTAAGAAGTTAAAACTTGCTCTGGATAAATATTTTCAGATTAGAACTTGT
AAGTTGTAACCCAGGTAGCTCCAGGTTTCTAGGATGTGTTTGGCTTTGGCCACTGGCCAGATTGGAAAGATAAATGGGAATTGGAATGTTCATTCTATCATTTTGTTTGA
TGGGATTGATCTTTATTTTGATTCTCATTTGTCCGTTGGATGATGGAATGGAAACTTTATCCATTTTCAATGGAAGGTCCGTTTGAATTTATTAGATGCATTGGATTTGA
TGAAGACAAAACTTGAAACTCATTTTATTGAAGCGAATTTTTTAATAATTCATCTATGATGAGAATGGAAT
Protein sequenceShow/hide protein sequence
MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRKRIMELRQALEHSEDSIGSLESKLEALKSRSGDKC
VVNSCSRSESWGAVQKRTSNELSAGSFTQEIRTCSSLECEAAPLSAEEIEIKAEAEALQDKVSSIEKLRGILYGSQGGTVRKRRGKRKRKNCNTNSNSNCNSNRDVKEGS
IGENNLSESPNPATVSQSCCNSFEPHGPSDANEACRSSAMDGVGVDVLMAAFNSVAQSKSASVFRRRLDSQKRGRYKKVIRQHLDIEAIRSRVTSHYITTLKELYRDLLL
LANNALVFYSRNSREHQSAVLLRGIITSKFKKLFKNSSTVVPHNHHKQKTQIIDPVVKPRRSQPAKHNVSQKEGNLRDVKTQNGGRRRGNIANPHSSVGLPKKETSLGAS
TGKKGPGATRKAVVGTSKSERSATGTRGRKRGRTK