; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0025637 (gene) of Chayote v1 genome

Gene IDSed0025637
OrganismSechium edule (Chayote v1)
DescriptionBromo domain-containing protein
Genome locationLG01:15459339..15465423
RNA-Seq ExpressionSed0025637
SyntenySed0025637
Gene Ontology termsGO:0016573 - histone acetylation (biological process)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR001487 - Bromodomain
IPR036427 - Bromodomain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146636.1 uncharacterized protein LOC101217843 isoform X1 [Cucumis sativus]7.7e-17875.37Show/hide
Query:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE
        MGAEA +  WDTW ELLLG A+LRHGT DWNLVA ELR R+ RPYACTPEVCKAKYE+L+KRFVGCKAWYEELRR+R+MELR+ALE SEDSIGSLESKLE
Subjt:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE

Query:  ALKSRSG-DKSLVNSSSKSESWGAVQKPMNELSAGSFTQENR-TCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKR-R
        ALKSRSG DKSLVN S++SESWGAVQKP NELSA SFTQENR TCSS+EC+ APLS +E E+KP+  QS  LE GK SRIGK GEVLYE+QG  +RKR R
Subjt:  ALKSRSG-DKSLVNSSSKSESWGAVQKPMNELSAGSFTQENR-TCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKR-R

Query:  GKRKRKDCSRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDGVDVLMAALNSVADNKSATIFRRRLDSQQKRGRYKKVIR
        GKRKRKDC+R+VKEGSSGENNLSESANPSTVS SKENSCCNSFE RE SDANEASRSS +DGVDVLMAA N+VA++KSA++FRRRLDS Q+R RYKK+IR
Subjt:  GKRKRKDCSRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDGVDVLMAALNSVADNKSATIFRRRLDSQQKRGRYKKVIR

Query:  QHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPAKRNVP
        QHLDIETIRSR+ASH ITT+ ELYRDLLLL NNALV YS +SREHQSAVLLR LI+STF+K + K+SSN  A    NK  QT     KPRR QPAKRN  
Subjt:  QHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPAKRNVP

Query:  KKEVDPGDAKTPSGNRRRRSN-ANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK
        ++E +PGD KTP GNRRR++N +N  SS+GLAKKETS S +KK  GGT+K V GTSKSERS ATG+RGRKRG+TK
Subjt:  KKEVDPGDAKTPSGNRRRRSN-ANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK

XP_008442126.1 PREDICTED: uncharacterized protein LOC103486076 isoform X1 [Cucumis melo]9.1e-17976Show/hide
Query:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE
        MGAEA  KRWDTW ELLLG A++RHGTGDWNLVA ELR R+ RPY CTPEVCKAKYE+L+KRFVGCKAWYEELR++RIMELR+ALE SEDSIGSLESKLE
Subjt:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE

Query:  ALKSRSG-DKSLVNSSSKSESWGAVQKPMNELSAGSFTQENR-TCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKR-R
        ALKSRSG DKSLVN S++SESWGAVQKP NE SA SFTQENR TCSS+EC+ APL  EE E+KP+  QS  LEWGK  RIGK GEVLYE+QG  +RKR R
Subjt:  ALKSRSG-DKSLVNSSSKSESWGAVQKPMNELSAGSFTQENR-TCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKR-R

Query:  GKRKRKDCSRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDGVDVLMAALNSVADNKSATIFRRRLDSQQKRGRYKKVIR
        GKRKRKDC+R+VKEGSSGENNLSESANPSTVS SKENSCCNSFE RE+SDANEASRSST+DGVDVLMA  NSVA++KSA++FRRRLDS Q+R RYKK+IR
Subjt:  GKRKRKDCSRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDGVDVLMAALNSVADNKSATIFRRRLDSQQKRGRYKKVIR

Query:  QHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPAKRNVP
        QHLDIETIRSR+ASHYITT+KELYRDLLLL NNALV YS +SREHQSAV LR LI+STFQKL+ K+SSN  A    N+  QT     KPRR QPAKRN  
Subjt:  QHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPAKRNVP

Query:  KKEVDPGDAKTPSGNRRRRSN-ANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK
        ++E +PGD KTP+GNRRRR+N +N  SS+GL+KKETS S  KK  GG +K V GTSKSERS ATG+RGRKRGRTK
Subjt:  KKEVDPGDAKTPSGNRRRRSN-ANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK

XP_022954655.1 uncharacterized protein LOC111456852 isoform X1 [Cucurbita moschata]2.0e-17876.63Show/hide
Query:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE
        MGAEA +KRWDTW+ELLLG A+LRHGT DWNLVAAELR R+VRP A TPEVCKAKYE+LQKRFVGCKAWYEELRRQRIMELR+ALE SEDSIGSLESKLE
Subjt:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSKSESWGAVQKPMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKR-RGK
        ALKSRSGDKSLVNSS +SESWG V KP NELSAGSFTQENRTCSS+ECR+AP   +E E+KP+A Q + LEWGKV                TV+KR RGK
Subjt:  ALKSRSGDKSLVNSSSKSESWGAVQKPMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKR-RGK

Query:  RKRKDC-SRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDG--VDVLMAALNSVADNKSATIFRRRLDSQQKRGRYKKVI
        RKRKDC SRDVKEGS+GENNLSESANPSTVSHSK+NSCCNSFEPRE+SDANEASRSST+DG  VDVLMAA N+VA+NKSA +FRRRLDS QKRGRYKK+I
Subjt:  RKRKDC-SRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDG--VDVLMAALNSVADNKSATIFRRRLDSQQKRGRYKKVI

Query:  RQHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPAKRNV
        RQHLDIETIRSR+AS YITTQKELYRDLLLL NNALV Y P++RE++SAVLLR LIT+TFQKL  KNS        H+K  QT  Q  K  RLQPAKRN 
Subjt:  RQHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPAKRNV

Query:  PKKEVDPGDAKTPSGNRRRRSNANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK
         +KEV+PGDAKTPSGN RRRSNANS SSVGLAK ETSAS +K+   GT+K VVGTSKSERSAAT  RGRKRGR K
Subjt:  PKKEVDPGDAKTPSGNRRRRSNANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK

XP_022994396.1 uncharacterized protein LOC111490126 isoform X1 [Cucurbita maxima]1.6e-18377.52Show/hide
Query:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE
        MGAEA +KRWDTW+ELLLG A+LRHGT DWNLVAAELR R+VRP A TPEVCKAKYE+LQKRFVGCKAWYEELRRQRI+ELR+ALE SEDSIGSLESKLE
Subjt:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSKSESWGAVQKPMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKR-RGK
        ALKSRSGDKSLVNSS +SESWG V KP NELSAGSFTQENRTCSS+ECR+AP   +E E+KP+A Q + LEWGKV                TV+KR RGK
Subjt:  ALKSRSGDKSLVNSSSKSESWGAVQKPMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKR-RGK

Query:  RKRKDC--SRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDG--VDVLMAALNSVADNKSATIFRRRLDSQQKRGRYKKV
        RKRKDC  SRDVKEGS+GENNLSESANPSTVSHSK+NSCCNSFEPRE+SDANEASRSST+DG  VDVLMAA N+VA+NKSA +FRRRLDS QKRGRYKK+
Subjt:  RKRKDC--SRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDG--VDVLMAALNSVADNKSATIFRRRLDSQQKRGRYKKV

Query:  IRQHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPAKRN
        IRQHLDIETIRSR+ASHYITTQKELYRDLLLL NNALV Y P++REH+SAVLLR LITSTFQKL  KNS        H K  QT  Q  KP RLQPAKR 
Subjt:  IRQHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPAKRN

Query:  VPKKEVDPGDAKTPSGNRRRRSNANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK
          +KEV+PGDAKTPSGNRRRRSNANS SSVGLAK ETSAS +K+   GT+K VVGTSKSE+SAATGVRGRKRGRTK
Subjt:  VPKKEVDPGDAKTPSGNRRRRSNANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK

XP_023542669.1 uncharacterized protein LOC111802504 isoform X1 [Cucurbita pepo subsp. pepo]2.4e-17976.37Show/hide
Query:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE
        MGAEA +K+WDTW+ELLLG A+LRHGT DWNLVAAELR R+VRP A TPEVCKAKYE+LQKRFVGCKAWYEELRRQRIMELR+ALE SEDSIGSLESKLE
Subjt:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSKSESWGAVQKPMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKRRGKR
        ALKSRSGDKSLVNSS +SESWG V KP NELSAGSFTQENRTCSS+ECR+AP   +E E+KP+A Q + L+WGKV              G   ++ RGKR
Subjt:  ALKSRSGDKSLVNSSSKSESWGAVQKPMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKRRGKR

Query:  KRKDC-SRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDG--VDVLMAALNSVADNKSATIFRRRLDSQQKRGRYKKVIR
        KRKDC SRDVKEGS+GENNLSESANPSTVSHSK+NSCCNSFEPRE+SDANEASRSST+DG  VDVLMAA N+VA+NKSA++FRRRLDS QKRGRYKK+IR
Subjt:  KRKDC-SRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDG--VDVLMAALNSVADNKSATIFRRRLDSQQKRGRYKKVIR

Query:  QHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPAKRNVP
        QHLDIETIRSR+ASHYITTQKELYRDLLLL NNALV Y P++RE++SAVLLR LITSTFQKL  KNS        H+K  QT  Q  KP RLQPAKRN  
Subjt:  QHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPAKRNVP

Query:  KKEVDPGDAKTPSGNRRRRSNANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK
        +KEV+PGDAKTPSGN RRRSNANS SSVGLAK ETSAS +K+   GT+K VVGT KSERSAAT  RGRKRGRTK
Subjt:  KKEVDPGDAKTPSGNRRRRSNANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK

TrEMBL top hitse value%identityAlignment
A0A0A0LV17 Bromo domain-containing protein3.7e-17875.37Show/hide
Query:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE
        MGAEA +  WDTW ELLLG A+LRHGT DWNLVA ELR R+ RPYACTPEVCKAKYE+L+KRFVGCKAWYEELRR+R+MELR+ALE SEDSIGSLESKLE
Subjt:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE

Query:  ALKSRSG-DKSLVNSSSKSESWGAVQKPMNELSAGSFTQENR-TCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKR-R
        ALKSRSG DKSLVN S++SESWGAVQKP NELSA SFTQENR TCSS+EC+ APLS +E E+KP+  QS  LE GK SRIGK GEVLYE+QG  +RKR R
Subjt:  ALKSRSG-DKSLVNSSSKSESWGAVQKPMNELSAGSFTQENR-TCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKR-R

Query:  GKRKRKDCSRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDGVDVLMAALNSVADNKSATIFRRRLDSQQKRGRYKKVIR
        GKRKRKDC+R+VKEGSSGENNLSESANPSTVS SKENSCCNSFE RE SDANEASRSS +DGVDVLMAA N+VA++KSA++FRRRLDS Q+R RYKK+IR
Subjt:  GKRKRKDCSRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDGVDVLMAALNSVADNKSATIFRRRLDSQQKRGRYKKVIR

Query:  QHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPAKRNVP
        QHLDIETIRSR+ASH ITT+ ELYRDLLLL NNALV YS +SREHQSAVLLR LI+STF+K + K+SSN  A    NK  QT     KPRR QPAKRN  
Subjt:  QHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPAKRNVP

Query:  KKEVDPGDAKTPSGNRRRRSN-ANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK
        ++E +PGD KTP GNRRR++N +N  SS+GLAKKETS S +KK  GGT+K V GTSKSERS ATG+RGRKRG+TK
Subjt:  KKEVDPGDAKTPSGNRRRRSN-ANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK

A0A1S3B4Z1 uncharacterized protein LOC103486076 isoform X14.4e-17976Show/hide
Query:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE
        MGAEA  KRWDTW ELLLG A++RHGTGDWNLVA ELR R+ RPY CTPEVCKAKYE+L+KRFVGCKAWYEELR++RIMELR+ALE SEDSIGSLESKLE
Subjt:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE

Query:  ALKSRSG-DKSLVNSSSKSESWGAVQKPMNELSAGSFTQENR-TCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKR-R
        ALKSRSG DKSLVN S++SESWGAVQKP NE SA SFTQENR TCSS+EC+ APL  EE E+KP+  QS  LEWGK  RIGK GEVLYE+QG  +RKR R
Subjt:  ALKSRSG-DKSLVNSSSKSESWGAVQKPMNELSAGSFTQENR-TCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKR-R

Query:  GKRKRKDCSRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDGVDVLMAALNSVADNKSATIFRRRLDSQQKRGRYKKVIR
        GKRKRKDC+R+VKEGSSGENNLSESANPSTVS SKENSCCNSFE RE+SDANEASRSST+DGVDVLMA  NSVA++KSA++FRRRLDS Q+R RYKK+IR
Subjt:  GKRKRKDCSRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDGVDVLMAALNSVADNKSATIFRRRLDSQQKRGRYKKVIR

Query:  QHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPAKRNVP
        QHLDIETIRSR+ASHYITT+KELYRDLLLL NNALV YS +SREHQSAV LR LI+STFQKL+ K+SSN  A    N+  QT     KPRR QPAKRN  
Subjt:  QHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPAKRNVP

Query:  KKEVDPGDAKTPSGNRRRRSN-ANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK
        ++E +PGD KTP+GNRRRR+N +N  SS+GL+KKETS S  KK  GG +K V GTSKSERS ATG+RGRKRGRTK
Subjt:  KKEVDPGDAKTPSGNRRRRSN-ANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK

A0A6J1CGL2 uncharacterized protein LOC1110106373.3e-16672.63Show/hide
Query:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE
        MG EA E+RWDTW+ELLLG AVLRHGTGDWNLVAAELR R+VRPYACTPEVCKAKYE+LQKRFVGCKAWYEELRR+RIMELR+ALE SEDSIGSLESKLE
Subjt:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSKSESWGAVQK-PMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKRRGK
        ALKSRSGDK +VNS S+SESWGAVQK   NELSAGSFTQE RTCSSLEC  APLS EE+E+K +A   Q     KVS I K   +LY SQG TVRKRRGK
Subjt:  ALKSRSGDKSLVNSSSKSESWGAVQK-PMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKRRGK

Query:  RKRKDC----------SRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTID--GVDVLMAALNSVADNKSATIFRRRLDSQQ
        RKRK+C          +RDVKEGS GENNLSES NP+TVS     SCCNSFEP   SDANEA RSS +D  GVDVLMAA NSVA +KSA++FRRRLDS Q
Subjt:  RKRKDC----------SRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTID--GVDVLMAALNSVADNKSATIFRRRLDSQQ

Query:  KRGRYKKVIRQHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPR
        KRGRYKKVIRQHLDIE IRSR+ SHYITT KELYRDLLLL NNALV YS +SREHQSAVLLRG+ITS F+KL  KNSS      HH +  Q      KPR
Subjt:  KRGRYKKVIRQHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPR

Query:  RLQPAKRNVPKKEVDPGDAKTPSGNRRRRSNANSQSSVGLAKKETS--ASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK
        R QPAK NV +KE +  D KT +G RRR + AN  SSVGL KKETS  AS  KKG G T+K VVGTSKSERS ATG RGRKRGRTK
Subjt:  RLQPAKRNVPKKEVDPGDAKTPSGNRRRRSNANSQSSVGLAKKETS--ASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK

A0A6J1GT05 uncharacterized protein LOC111456852 isoform X19.9e-17976.63Show/hide
Query:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE
        MGAEA +KRWDTW+ELLLG A+LRHGT DWNLVAAELR R+VRP A TPEVCKAKYE+LQKRFVGCKAWYEELRRQRIMELR+ALE SEDSIGSLESKLE
Subjt:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSKSESWGAVQKPMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKR-RGK
        ALKSRSGDKSLVNSS +SESWG V KP NELSAGSFTQENRTCSS+ECR+AP   +E E+KP+A Q + LEWGKV                TV+KR RGK
Subjt:  ALKSRSGDKSLVNSSSKSESWGAVQKPMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKR-RGK

Query:  RKRKDC-SRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDG--VDVLMAALNSVADNKSATIFRRRLDSQQKRGRYKKVI
        RKRKDC SRDVKEGS+GENNLSESANPSTVSHSK+NSCCNSFEPRE+SDANEASRSST+DG  VDVLMAA N+VA+NKSA +FRRRLDS QKRGRYKK+I
Subjt:  RKRKDC-SRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDG--VDVLMAALNSVADNKSATIFRRRLDSQQKRGRYKKVI

Query:  RQHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPAKRNV
        RQHLDIETIRSR+AS YITTQKELYRDLLLL NNALV Y P++RE++SAVLLR LIT+TFQKL  KNS        H+K  QT  Q  K  RLQPAKRN 
Subjt:  RQHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPAKRNV

Query:  PKKEVDPGDAKTPSGNRRRRSNANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK
         +KEV+PGDAKTPSGN RRRSNANS SSVGLAK ETSAS +K+   GT+K VVGTSKSERSAAT  RGRKRGR K
Subjt:  PKKEVDPGDAKTPSGNRRRRSNANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK

A0A6J1JZ11 uncharacterized protein LOC111490126 isoform X17.8e-18477.52Show/hide
Query:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE
        MGAEA +KRWDTW+ELLLG A+LRHGT DWNLVAAELR R+VRP A TPEVCKAKYE+LQKRFVGCKAWYEELRRQRI+ELR+ALE SEDSIGSLESKLE
Subjt:  MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLE

Query:  ALKSRSGDKSLVNSSSKSESWGAVQKPMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKR-RGK
        ALKSRSGDKSLVNSS +SESWG V KP NELSAGSFTQENRTCSS+ECR+AP   +E E+KP+A Q + LEWGKV                TV+KR RGK
Subjt:  ALKSRSGDKSLVNSSSKSESWGAVQKPMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKR-RGK

Query:  RKRKDC--SRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDG--VDVLMAALNSVADNKSATIFRRRLDSQQKRGRYKKV
        RKRKDC  SRDVKEGS+GENNLSESANPSTVSHSK+NSCCNSFEPRE+SDANEASRSST+DG  VDVLMAA N+VA+NKSA +FRRRLDS QKRGRYKK+
Subjt:  RKRKDC--SRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDG--VDVLMAALNSVADNKSATIFRRRLDSQQKRGRYKKV

Query:  IRQHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPAKRN
        IRQHLDIETIRSR+ASHYITTQKELYRDLLLL NNALV Y P++REH+SAVLLR LITSTFQKL  KNS        H K  QT  Q  KP RLQPAKR 
Subjt:  IRQHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPAKRN

Query:  VPKKEVDPGDAKTPSGNRRRRSNANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK
          +KEV+PGDAKTPSGNRRRRSNANS SSVGLAK ETSAS +K+   GT+K VVGTSKSE+SAATGVRGRKRGRTK
Subjt:  VPKKEVDPGDAKTPSGNRRRRSNANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61215.1 bromodomain 42.7e-6439.11Show/hide
Query:  EKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLEALKSRS
        E  W TW+ELLLG AVLRHGTGDW +VA ELR   + P   TPE+CKAKY++L+KR+VGCKAW+EEL+++R+ EL+ AL +SEDSIGSLESKL++LKS S
Subjt:  EKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLEALKSRS

Query:  GDKSLVNSSSKSESWGAVQKPMNE--------------LSAGSFTQENRTCS--SLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQG
         D+   N+   S +      P +E               S GSFTQ+  T +  S E ++      E E   D   +   E    S  G  G+VL     
Subjt:  GDKSLVNSSSKSESWGAVQKPMNE--------------LSAGSFTQENRTCS--SLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQG

Query:  RTVRKRRGKRKRKDCS----RDVKEGSSGENN--LSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDGVDVLMAALNSVADNKSATIFRRRLD
         ++RK+RGKRKRKDCS    ++V E S+ E +     SA+ +++  SKE            S ++  SR  ++     LM   N++A N+ A +FRRRLD
Subjt:  RTVRKRRGKRKRKDCS----RDVKEGSSGENN--LSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDGVDVLMAALNSVADNKSATIFRRRLD

Query:  SQQKRGRYKKVIRQHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKN---------SSNTAAQIHHNK
        S QKRGRYKK++R+H+D++T++SRI    I++ KEL+RD LL+ NNA + YS ++RE++SAV LR ++T + +  L ++         + +T   + H K
Subjt:  SQQKRGRYKKVIRQHLDIETIRSRIASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKN---------SSNTAAQIHHNK

Query:  GIQTSGQTTKPRRLQPAKRNVPKKEVDPGDAKTPSGNRRRRSNANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK
            S +T+   + +P     P K V    AKT S     R N  S + + ++  ++SA+  KKGT   K G       E  A   + GRKR R +
Subjt:  GIQTSGQTTKPRRLQPAKRNVPKKEVDPGDAKTPSGNRRRRSNANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK

AT2G42150.1 DNA-binding bromodomain-containing protein1.9e-1726.42Show/hide
Query:  EKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRF------------VGCKAWYEELRRQRIMELRKALERSEDSIGS
        ++ W TW+ELLL  AV RHGT  WN V+AE++       + T   C+ KY +L+ RF            +    W EELR+ R+ ELR+ +E+ + SI +
Subjt:  EKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRF------------VGCKAWYEELRRQRIMELRKALERSEDSIGS

Query:  LESKLEALKSRSGDKSLVNSSSKSESWGAVQKPMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVR
        L+SK++ L+    + S +   +++E+    +K   E S       N     +    +P   E      +  +      G  S++   GE        +V 
Subjt:  LESKLEALKSRSGDKSLVNSSSKSESWGAVQKPMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVR

Query:  KRRGKRKRKDCSRDVKEGSSGENNLSESAN-PSTVSHSKENSCCNSFEPRETSDANEASRSSTIDGVDVLMAALNSVADNKSATIFRRRLDSQQKRGRYK
        K       +     V E    E+  S      S V  S       + EP +   +  +++  T++    L++ +  +  +   + F RRL+ +Q+   Y 
Subjt:  KRRGKRKRKDCSRDVKEGSSGENNLSESAN-PSTVSHSKENSCCNSFEPRETSDANEASRSSTIDGVDVLMAALNSVADNKSATIFRRRLDSQQKRGRYK

Query:  KVIRQHLDIETIRSRI-ASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPA
         +IR+H+D E IR R+    Y + +   +RDLLLLVNNA V Y   S E + A  L  L+       L   S+     I   K    +  ++KP   +P 
Subjt:  KVIRQHLDIETIRSRI-ASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPA

Query:  KRNVP
        + +VP
Subjt:  KRNVP

AT2G44430.1 DNA-binding bromodomain-containing protein9.5e-1725.2Show/hide
Query:  WDTWDELLLGAAVLRHGTGDWNLVAAELRPR-MVRPYACTPEVCKAKYEELQKRF---------------------VGCK-AWYEELRRQRIMELRKALE
        W TW+ELLL  AV RHG GDW+ VA E+R R  +     +   C+ KY +L++RF                     VG    W E+LR  R+ ELR+ +E
Subjt:  WDTWDELLLGAAVLRHGTGDWNLVAAELRPR-MVRPYACTPEVCKAKYEELQKRF---------------------VGCK-AWYEELRRQRIMELRKALE

Query:  RSEDSIGSLESKLEALKSRSGDKSLVNSSSKSESWGAVQKPMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLY
        R + SI SL+ K++ L              + E     +KP  E        EN    S     A  + EE     D       E    +  G+   V  
Subjt:  RSEDSIGSLESKLEALKSRSGDKSLVNSSSKSESWGAVQKPMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLY

Query:  ESQGRTVRKRRGKRKRKDCSRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDGVD----VLMAALNSVADNKSATIFRRR
        +   +T     G  K  D     K+ ++ E      +  S  SHS E     + E +      +   +  I   +     L++ L+ +  +   ++F RR
Subjt:  ESQGRTVRKRRGKRKRKDCSRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDGVD----VLMAALNSVADNKSATIFRRR

Query:  LDSQQKRGRYKKVIRQHLDIETIRSRI-ASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSG
        L SQ+ +  YK +++QHLDIETI+ ++    Y ++    YRDL LL  NA+V +  SS E  +A  LR +++   +K   K       Q       + SG
Subjt:  LDSQQKRGRYKKVIRQHLDIETIRSRI-ASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSG

Query:  QTTKPRRLQPAKRNVPKKEVDPGDAKTPSGNRRRRSNANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGR
          +     + +  ++ +++   G        R   + A+  SS    K +T    + +       GV  + ++ + AA      K G+
Subjt:  QTTKPRRLQPAKRNVPKKEVDPGDAKTPSGNRRRRSNANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGR

AT3G57980.1 DNA-binding bromodomain-containing protein2.3e-1828.31Show/hide
Query:  DELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRF------------------VGCKAWYEELRRQRIMELRKALERSEDSIGSL
        +ELLL  AV RHGT  W+ VA+E+  +       T   C+ KY +L++RF                  +    W EELR+ R+ ELR+ +ER + SI SL
Subjt:  DELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRF------------------VGCKAWYEELRRQRIMELRKALERSEDSIGSL

Query:  ESKLEALKSRSGDKSLVNSSSKSESWGAVQKPMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWG--KVSRIGKFGEVLYESQGR--
        + K++ L+    +KSL   +S             +L   + T+EN T S      + +   E++  PD P       G    +R  K  E + E   R  
Subjt:  ESKLEALKSRSGDKSLVNSSSKSESWGAVQKPMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWG--KVSRIGKFGEVLYESQGR--

Query:  ----TVRKRRGKRKRKDCSRDVKEGSSGE------------NNLSESANPSTVSHSKENSCCNSFEPRETSDANE---ASRSSTIDGVDV----LMAALN
              +  R    R  C    KE    E             ++ ES        + +     SF  +ET D ++     +S T++ + V    L   + 
Subjt:  ----TVRKRRGKRKRKDCSRDVKEGSSGE------------NNLSESANPSTVSHSKENSCCNSFEPRETSDANE---ASRSSTIDGVDV----LMAALN

Query:  SVADNKSATIFRRRLDSQQKRGRYKKVIRQHLDIETIRSRI-ASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLI
         +  +   + F RRL++Q+    Y ++IRQH+D E IRSR+   +Y T + + +RDLLLL+NN  V Y   S E  +A  L  LI
Subjt:  SVADNKSATIFRRRLDSQQKRGRYKKVIRQHLDIETIRSRI-ASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLI

AT3G60110.1 DNA-binding bromodomain-containing protein1.8e-1524.69Show/hide
Query:  WDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRF--------------------VGCKAWYEELRRQRIMELRKALERSE
        W TW+EL+L  AV RH   DW+ VA E++ R       +   C+ KY++L++RF                    VG  +W E+LR   + ELR+ ++R +
Subjt:  WDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRF--------------------VGCKAWYEELRRQRIMELRKALERSE

Query:  DSIGSLESKLEAL-KSRSGDKSLVNSSSKSESWGAVQKPMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEV----
        DSI SL+ K++ L + + GD        K++    V+  +N  +  S   +NR   S+    +  S +++         + ++  + SR      V    
Subjt:  DSIGSLESKLEAL-KSRSGDKSLVNSSSKSESWGAVQKPMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEV----

Query:  LYESQGRTVRKRRGKRKRKDCSRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRE-TSDANEASRSSTIDGVDVLMAALNSVADNKSATIFRRRL
          E + RTV KR                       SE +N   +  S  ++C    + ++  S        S  D    L+  +  +  +   ++F  RL
Subjt:  LYESQGRTVRKRRGKRKRKDCSRDVKEGSSGENNLSESANPSTVSHSKENSCCNSFEPRE-TSDANEASRSSTIDGVDVLMAALNSVADNKSATIFRRRL

Query:  DSQQKRGRYKKVIRQHLDIETIRSRI-ASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQ
         SQ  +  YK++IRQHLD++TI  ++    Y+++    YRDL LL  NA+V +  SS E  +A  LR L+++  +K           ++ H   I++  +
Subjt:  DSQQKRGRYKKVIRQHLDIETIRSRI-ASHYITTQKELYRDLLLLVNNALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQ

Query:  TTKPRRLQPAKRNVPKKEVDPGDAKT-PSGNRRRRSNANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK
        ++  R+       VP K+      KT PS + R++    SQ         T+A+   + +  T K +   +K  ++       +K+  TK
Subjt:  TTKPRRLQPAKRNVPKKEVDPGDAKT-PSGNRRRRSNANSQSSVGLAKKETSASAIKKGTGGTKKGVVGTSKSERSAATGVRGRKRGRTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCAGAGGCGGCAGAGAAGAGATGGGACACGTGGGATGAACTTTTATTAGGAGCCGCCGTACTCCGGCACGGAACCGGCGACTGGAACCTCGTCGCGGCGGAGCT
CCGGCCAAGGATGGTTCGTCCGTACGCCTGCACGCCCGAGGTTTGTAAAGCCAAATATGAAGAATTACAGAAGCGTTTTGTTGGATGCAAAGCTTGGTATGAGGAGCTTC
GGCGGCAAAGAATTATGGAACTAAGAAAAGCTCTAGAGCGTTCTGAAGATTCAATAGGGTCATTAGAATCAAAGCTTGAAGCTCTCAAGTCTAGGAGTGGAGACAAGTCT
CTTGTCAATAGCTCAAGCAAATCAGAATCATGGGGAGCTGTTCAGAAACCAATGAATGAGCTATCCGCCGGTAGCTTCACGCAGGAAAACAGGACGTGCAGCTCGCTCGA
ATGTCGGACAGCTCCATTGTCGAACGAAGAGATGGAGATGAAACCAGATGCGCCGCAGTCGCAGTTTCTCGAATGGGGGAAGGTATCGAGAATCGGGAAGTTTGGAGAGG
TGTTATATGAAAGCCAAGGAAGAACAGTAAGGAAGAGAAGAGGGAAGAGAAAGAGGAAGGATTGTAGTAGGGATGTTAAGGAAGGAAGTAGTGGGGAAAATAACTTGTCT
GAATCAGCTAACCCTTCAACTGTTTCTCATTCTAAAGAAAACTCATGCTGCAACTCGTTTGAGCCTCGGGAAACATCTGATGCAAATGAAGCTAGCAGAAGCTCAACCAT
TGATGGAGTTGATGTACTTATGGCTGCTCTTAATTCTGTTGCCGATAATAAAAGTGCCACGATATTTCGCCGTCGCCTTGATAGTCAGCAGAAGAGAGGAAGATACAAGA
AAGTAATCCGGCAACACTTGGATATTGAAACAATAAGATCAAGAATTGCAAGTCATTACATAACGACACAAAAGGAGCTGTACAGAGATCTGCTGCTGCTTGTTAACAAT
GCTCTCGTTTTGTACTCGCCGAGCTCGCGGGAGCATCAGTCTGCTGTGCTTCTCAGAGGCCTCATTACAAGTACATTTCAAAAGCTTCTTTGTAAGAATTCTAGCAATAC
AGCAGCCCAAATCCACCACAACAAGGGAATACAAACCTCTGGTCAGACAACAAAACCGCGTCGTTTGCAGCCTGCTAAACGCAATGTACCTAAAAAAGAAGTCGATCCAG
GAGATGCCAAAACGCCAAGTGGGAATAGAAGAAGACGAAGTAATGCTAATTCTCAGTCCTCAGTGGGATTAGCAAAGAAAGAAACTTCAGCTTCTGCCATAAAGAAAGGC
ACCGGTGGGACGAAAAAGGGTGTCGTTGGGACGTCGAAAAGCGAACGATCTGCAGCAACTGGTGTTAGGGGAAGGAAAAGAGGGAGAACGAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGCAGAGGCGGCAGAGAAGAGATGGGACACGTGGGATGAACTTTTATTAGGAGCCGCCGTACTCCGGCACGGAACCGGCGACTGGAACCTCGTCGCGGCGGAGCT
CCGGCCAAGGATGGTTCGTCCGTACGCCTGCACGCCCGAGGTTTGTAAAGCCAAATATGAAGAATTACAGAAGCGTTTTGTTGGATGCAAAGCTTGGTATGAGGAGCTTC
GGCGGCAAAGAATTATGGAACTAAGAAAAGCTCTAGAGCGTTCTGAAGATTCAATAGGGTCATTAGAATCAAAGCTTGAAGCTCTCAAGTCTAGGAGTGGAGACAAGTCT
CTTGTCAATAGCTCAAGCAAATCAGAATCATGGGGAGCTGTTCAGAAACCAATGAATGAGCTATCCGCCGGTAGCTTCACGCAGGAAAACAGGACGTGCAGCTCGCTCGA
ATGTCGGACAGCTCCATTGTCGAACGAAGAGATGGAGATGAAACCAGATGCGCCGCAGTCGCAGTTTCTCGAATGGGGGAAGGTATCGAGAATCGGGAAGTTTGGAGAGG
TGTTATATGAAAGCCAAGGAAGAACAGTAAGGAAGAGAAGAGGGAAGAGAAAGAGGAAGGATTGTAGTAGGGATGTTAAGGAAGGAAGTAGTGGGGAAAATAACTTGTCT
GAATCAGCTAACCCTTCAACTGTTTCTCATTCTAAAGAAAACTCATGCTGCAACTCGTTTGAGCCTCGGGAAACATCTGATGCAAATGAAGCTAGCAGAAGCTCAACCAT
TGATGGAGTTGATGTACTTATGGCTGCTCTTAATTCTGTTGCCGATAATAAAAGTGCCACGATATTTCGCCGTCGCCTTGATAGTCAGCAGAAGAGAGGAAGATACAAGA
AAGTAATCCGGCAACACTTGGATATTGAAACAATAAGATCAAGAATTGCAAGTCATTACATAACGACACAAAAGGAGCTGTACAGAGATCTGCTGCTGCTTGTTAACAAT
GCTCTCGTTTTGTACTCGCCGAGCTCGCGGGAGCATCAGTCTGCTGTGCTTCTCAGAGGCCTCATTACAAGTACATTTCAAAAGCTTCTTTGTAAGAATTCTAGCAATAC
AGCAGCCCAAATCCACCACAACAAGGGAATACAAACCTCTGGTCAGACAACAAAACCGCGTCGTTTGCAGCCTGCTAAACGCAATGTACCTAAAAAAGAAGTCGATCCAG
GAGATGCCAAAACGCCAAGTGGGAATAGAAGAAGACGAAGTAATGCTAATTCTCAGTCCTCAGTGGGATTAGCAAAGAAAGAAACTTCAGCTTCTGCCATAAAGAAAGGC
ACCGGTGGGACGAAAAAGGGTGTCGTTGGGACGTCGAAAAGCGAACGATCTGCAGCAACTGGTGTTAGGGGAAGGAAAAGAGGGAGAACGAAGTAAACTTTATCTTATTA
GATTTCTCAGGCCAGAGCTTGTAACTTTATAGGATCTGTTTTGCGTTAGAAAGTTTGGAAATAGTGGGAATTGGAATACTCATTCTGTCATTTCGTTTTATGGGAATGAT
TTTTGTTTTGATTCTCATTTTATTGAAGCAAAATTTAGAATGAAGTTCAAGCTTCCAGCCACATAAACTGAAATTGAAACTTGCAG
Protein sequenceShow/hide protein sequence
MGAEAAEKRWDTWDELLLGAAVLRHGTGDWNLVAAELRPRMVRPYACTPEVCKAKYEELQKRFVGCKAWYEELRRQRIMELRKALERSEDSIGSLESKLEALKSRSGDKS
LVNSSSKSESWGAVQKPMNELSAGSFTQENRTCSSLECRTAPLSNEEMEMKPDAPQSQFLEWGKVSRIGKFGEVLYESQGRTVRKRRGKRKRKDCSRDVKEGSSGENNLS
ESANPSTVSHSKENSCCNSFEPRETSDANEASRSSTIDGVDVLMAALNSVADNKSATIFRRRLDSQQKRGRYKKVIRQHLDIETIRSRIASHYITTQKELYRDLLLLVNN
ALVLYSPSSREHQSAVLLRGLITSTFQKLLCKNSSNTAAQIHHNKGIQTSGQTTKPRRLQPAKRNVPKKEVDPGDAKTPSGNRRRRSNANSQSSVGLAKKETSASAIKKG
TGGTKKGVVGTSKSERSAATGVRGRKRGRTK