; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G26970 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G26970
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionBromo domain-containing protein
Genome locationChr4:23874818..23878991
RNA-Seq ExpressionCSPI04G26970
SyntenyCSPI04G26970
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001487 - Bromodomain
IPR018359 - Bromodomain, conserved site
IPR036427 - Bromodomain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608638.1 Bromodomain and PHD finger-containing protein 3, partial [Cucurbita argyrosperma subsp. sororia]2.5e-20180.62Show/hide
Query:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAADDDDERIEKKHKPLLGLTSRQNCPTLSGYSLRK
        MKK+KKKGRPSLLDLQKRFLKQQKLQ EQHQ+P +AFD +SNP++P SC+N NVH  TE+V      DDDDERIEKKHKPLLGLTSRQN PTLS YSLRK
Subjt:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAADDDDERIEKKHKPLLGLTSRQNCPTLSGYSLRK

Query:  LASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPA-HGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTV
         A Y E+SE+ALKRRR  A QFGSS+V E K  KATD A +GSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFS+PVD N+LPDYHVII+NPMDFGTV
Subjt:  LASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPA-HGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTV

Query:  RTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSL-GIG-NAIDS-NGAEFC
        R KLD GAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAK+DFENLRRESSDESE EQKVVRRGRPPGKS K+ L G+G N ++S  GAE C
Subjt:  RTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSL-GIG-NAIDS-NGAEFC

Query:  SGATLASGCDDSYNVNGYNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTY-NRSTSCGNWPSV
        SGAT ASG DDS NVNGYNLRRARS+FRPLP DP  RTS A AQHGETLASWLPEWK EFPASVLK VLKSGKNDNMAV+ENRR TY N S SCGN  SV
Subjt:  SGATLASGCDDSYNVNGYNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTY-NRSTSCGNWPSV

Query:  FGDLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGISRELGRVLIQEIEML-QQHHLLPLDGGSSDMKTVAES
        FG+LDGDLKQLITVGLHAEHGY RSLALF ADLGP VW IA KKI+  SRELGRVLIQEIEML QQ  +LP DGGS+D KT AES
Subjt:  FGDLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGISRELGRVLIQEIEML-QQHHLLPLDGGSSDMKTVAES

XP_004145600.1 uncharacterized protein LOC101217603 [Cucumis sativus]0.0e+00100Show/hide
Query:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAADDDDERIEKKHKPLLGLTSRQNCPTLSGYSLRK
        MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAADDDDERIEKKHKPLLGLTSRQNCPTLSGYSLRK
Subjt:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAADDDDERIEKKHKPLLGLTSRQNCPTLSGYSLRK

Query:  LASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTVR
        LASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTVR
Subjt:  LASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTVR

Query:  TKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEFCSGAT
        TKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEFCSGAT
Subjt:  TKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEFCSGAT

Query:  LASGCDDSYNVNGYNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPSVFGDLD
        LASGCDDSYNVNGYNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPSVFGDLD
Subjt:  LASGCDDSYNVNGYNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPSVFGDLD

Query:  GDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGISRELGRVLIQEIEMLQQHHLLPLDGGSSDMKTVAESTANIGVSNNFLKLGEDADNEI
        GDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGISRELGRVLIQEIEMLQQHHLLPLDGGSSDMKTVAESTANIGVSNNFLKLGEDADNEI
Subjt:  GDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGISRELGRVLIQEIEMLQQHHLLPLDGGSSDMKTVAESTANIGVSNNFLKLGEDADNEI

Query:  DRVRNAESKTVLLDRSRGLIGSTTCIPNEQNILVPSNIHPTNGNLFPHFSQEMKMVRLDSILGGTSCSEYSTCPAPLNNASFQVPSSSDNTDLLNQTGMP
        DRVRNAESKTVLLDRSRGLIGSTTCIPNEQNILVPSNIHPTNGNLFPHFSQEMKMVRLDSILGGTSCSEYSTCPAPLNNASFQVPSSSDNTDLLNQTGMP
Subjt:  DRVRNAESKTVLLDRSRGLIGSTTCIPNEQNILVPSNIHPTNGNLFPHFSQEMKMVRLDSILGGTSCSEYSTCPAPLNNASFQVPSSSDNTDLLNQTGMP

Query:  KLAEEDASQSHAPWHSPARACFQDTVDAQQDKRSEKAHWQELSTRPVLDSIAFDPDLNFGLGLSAAPNSNLQILSQIQPDLVLQL
        KLAEEDASQSHAPWHSPARACFQDTVDAQQDKRSEKAHWQELSTRPVLDSIAFDPDLNFGLGLSAAPNSNLQILSQIQPDLVLQL
Subjt:  KLAEEDASQSHAPWHSPARACFQDTVDAQQDKRSEKAHWQELSTRPVLDSIAFDPDLNFGLGLSAAPNSNLQILSQIQPDLVLQL

XP_008452972.1 PREDICTED: uncharacterized protein LOC103493819 [Cucumis melo]0.0e+0090.26Show/hide
Query:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAA--ADDDDERIEKKHKPLLGLTSRQNCPTLSGYSL
        MKKKKKKGRPSLLDLQKRFLKQQKLQ EQHQEP +AF F SNP+NPPSCRN NVH  TEQVPITAA  ADDDDERIEKKHKPLLGLTSRQNCPTLSGYSL
Subjt:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAA--ADDDDERIEKKHKPLLGLTSRQNCPTLSGYSL

Query:  RKLASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGT
        RK ASYGEESETALKRRRTGAAQFGSSQVREDKALKATD AHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGT
Subjt:  RKLASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGT

Query:  VRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEFCSG
        VRTKLD GAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLG+ N I+SNGAEFCSG
Subjt:  VRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEFCSG

Query:  ATLASGCDDSYNVNGYNLRRARSTFRPLPADPLARTST-AQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPSVFG
        ATLASGCDDSYNVNGY+LRRARSTFRPLPADPLARTS+ AQ QHGETLASWLPEWK EFPASVLKGVLKSGKNDNMAVNENRRDTYNRS SCGNWPSVFG
Subjt:  ATLASGCDDSYNVNGYNLRRARSTFRPLPADPLARTST-AQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPSVFG

Query:  DLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGISRELGRVLIQEIEMLQQHHLLPLDGGSSDMKTVAESTA----------NIGVSN
        DLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKI+ ISRELGRVLI EIEMLQQH LLPLDGGSSDMKTVA+STA          NIGVSN
Subjt:  DLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGISRELGRVLIQEIEMLQQHHLLPLDGGSSDMKTVAESTA----------NIGVSN

Query:  NFLKLGEDADNEIDRVRNAESKTVLLDRSRGLIGSTTCIPNEQNILVPSNIHPTNGNLFPHFSQEMKMVRLDSILGGTSCSEYSTCPAPLNNASFQVPSS
        NFLKLGEDA +EIDRVRN ES+TVLLDRSRG+ GSTTCIPNEQN++VPSNIH TNGNLFPHF QEM+MVRLDSILGGTS S+YSTCP+ LNNASFQ+PSS
Subjt:  NFLKLGEDADNEIDRVRNAESKTVLLDRSRGLIGSTTCIPNEQNILVPSNIHPTNGNLFPHFSQEMKMVRLDSILGGTSCSEYSTCPAPLNNASFQVPSS

Query:  SDNTDLLNQTGMPKLAEEDASQSHAPWHSPARACFQDTVDAQQDKRSEKAHWQELSTRPVLDSIAFDPDLNFGLGLSAAPNSNLQILSQIQPDLVLQL
        SDNTDLL+Q GMPKLAEE ASQSHA  HSPAR   QD+VDAQQDKRSEKAHWQELSTRPVLDSI F+ DLNFGLGLSAAP+SNLQILSQIQPDLVLQL
Subjt:  SDNTDLLNQTGMPKLAEEDASQSHAPWHSPARACFQDTVDAQQDKRSEKAHWQELSTRPVLDSIAFDPDLNFGLGLSAAPNSNLQILSQIQPDLVLQL

XP_022137702.1 uncharacterized protein LOC111009072 [Momordica charantia]1.7e-25068.64Show/hide
Query:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAA----DDDDERIEKKHKPLLGLTSRQNCPTLSGY
        MKK+KKKGRPSL+DLQKRFLKQQKLQ E+H +P +AF+F+SNPR P S R+ NVH   E V          DDDDERIEKKHKPLLGLTS QN PTLS  
Subjt:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAA----DDDDERIEKKHKPLLGLTSRQNCPTLSGY

Query:  SLRKLASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDF
         LR  AS  E+SE ALKRRR GA QFGS +V E KALKATD  HGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPN+LPDYHVII+NPMDF
Subjt:  SLRKLASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDF

Query:  GTVRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEFC
        GTVR KLD GAYANLEQFEEDIFLICSNAMKYN SDTVF+RQA++IQELAKKDFENLR++SSD+SEPEQKVVRRGRPPGKS K+SL +GN I+S G EFC
Subjt:  GTVRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEFC

Query:  S-GATLASGCDDSYNVNGYNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPSV
        +   TLASGCDDS +VNGYNLRR+RS+FRPL +DPL RTS+   QHGETLASWLPEWK EFPASVLKGVLKSGKNDNMAV+ENRRDTYN S   GNWPSV
Subjt:  S-GATLASGCDDSYNVNGYNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPSV

Query:  FGDLDGDLKQLITVGLHAEHGYARSLALFAAD-LGPAVWNIALKKIKGISRELGRVLIQEIEMLQQHHLLPLDGGS-SDMKTVAEST----------ANI
        FGD DGDLKQLITV     H    SL  F  + +   +W IA KKI+ ISRELG VL QEIEML+Q  +LP+DGGS SD KTVAES           +N 
Subjt:  FGDLDGDLKQLITVGLHAEHGYARSLALFAAD-LGPAVWNIALKKIKGISRELGRVLIQEIEMLQQHHLLPLDGGS-SDMKTVAEST----------ANI

Query:  GVSNNFLKLGEDADNEIDRVRNAESKTVLLDRSRGLIGSTTCIPNEQNILVPSNIHPTNGNLFPHFSQEMKMVRLDSILGGTSCSEYSTCPAPL------
        G+S+NFLK  E  D EIDR R+ +S+T+LLDRSRG + STTCIPNE+  ++PSNIH    N  PHFS EM+MVRLDSI+GGTSCS+ S+ P  +      
Subjt:  GVSNNFLKLGEDADNEIDRVRNAESKTVLLDRSRGLIGSTTCIPNEQNILVPSNIHPTNGNLFPHFSQEMKMVRLDSILGGTSCSEYSTCPAPL------

Query:  -NNASFQVPSSSDNTDLLNQTGMPKLAEEDASQSHAPWHSPARACFQDTVDAQQDKR--SEKAHWQELSTRPVLDSIAFDPDLNFGLGLSAAPNSNLQIL
         N ASFQ+PS   + DLL+   M +L+ ED S S    HSPAR   Q+ ++A++D++   EK  WQELST PVLDS+ F+PDLNFGLG S AP+SNLQIL
Subjt:  -NNASFQVPSSSDNTDLLNQTGMPKLAEEDASQSHAPWHSPARACFQDTVDAQQDKR--SEKAHWQELSTRPVLDSIAFDPDLNFGLGLSAAPNSNLQIL

Query:  SQIQPDLVLQL
        SQIQPDLVLQL
Subjt:  SQIQPDLVLQL

XP_038900278.1 uncharacterized protein LOC120087359 [Benincasa hispida]0.0e+0082.02Show/hide
Query:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAA------DDDDERIEKKHKPLLGLTSRQNCPTLS
        MKKKKKKGRPSLLDLQKRFLKQQKLQ EQHQ+P +AFDF+SNP NP SCRN NV   T++VP  AAA      DDDDERIEKKHKPLLGLTSRQN PT S
Subjt:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAA------DDDDERIEKKHKPLLGLTSRQNCPTLS

Query:  GYSLRKLASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPM
        GYSLRK ASYGE+SE  LKRRR GAAQFGSS+VRE+KALKATD AHG QVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPN+LPDYH IIQNPM
Subjt:  GYSLRKLASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPM

Query:  DFGTVRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAE
        DFGTVR KLD GAYANL+QFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLR+ESSDESEPEQKVVRRGRPPGKSLK+SLG+GN I+SNGAE
Subjt:  DFGTVRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAE

Query:  FCSGATLASGCDDSYNVNGYNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPS
        FCSGAT ASGCDDSYNVNGYNLRR+RS FRPLPADPLARTST  AQHGETLASWLPEWK EFPASVLKGVLKSGKNDNMAVNENRRDTYN + SCGNWPS
Subjt:  FCSGATLASGCDDSYNVNGYNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPS

Query:  VFGDLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGISRELGRVLIQEIEMLQQHHLLPLDGGSSDMKTVAEST----------ANIG
        VFG+  GDLKQLITVGLHAEHGYARSLALFAADLGP VWNIAL+KI+ ISRELGRVLIQEIEM QQ+ +LPLDGGSS MKTVAEST          +NIG
Subjt:  VFGDLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGISRELGRVLIQEIEMLQQHHLLPLDGGSSDMKTVAEST----------ANIG

Query:  VSNNFLKLGEDADNEIDRVRNAESKTVLLDRSRGLIGSTTCIPNEQN---ILVPSNIHPTNGNLFPHFSQEMKMVRLDSILGGTSCSE--------YSTC
        VSNN LKLGEDADNEIDRVRN+ES+TVLLDRSRG+IGSTTCIPNEQN   ++VPSNI  TNGNLFPHFSQEM+MVRLDSILGGTSCS+        + T 
Subjt:  VSNNFLKLGEDADNEIDRVRNAESKTVLLDRSRGLIGSTTCIPNEQN---ILVPSNIHPTNGNLFPHFSQEMKMVRLDSILGGTSCSE--------YSTC

Query:  PAPLNNASFQVPSSSDNTDLLNQTGMPKLAEEDASQSHAPWHSPARACFQDTVDAQQDKRSEKAHWQELSTRPVLDSIAFDPDLNFGLGLSAAPNSNLQI
        PA +NNASFQ+P+ + + DLLNQ GMPKLAEE ASQSH P HS +   FQD++D QQD+R+EK   QELST PVLDSI F+PDLNFGLGLSAAP+SNLQI
Subjt:  PAPLNNASFQVPSSSDNTDLLNQTGMPKLAEEDASQSHAPWHSPARACFQDTVDAQQDKRSEKAHWQELSTRPVLDSIAFDPDLNFGLGLSAAPNSNLQI

Query:  LSQIQPDLVLQL
        LSQIQPDLVLQL
Subjt:  LSQIQPDLVLQL

TrEMBL top hitse value%identityAlignment
A0A0A0L1D3 Bromo domain-containing protein0.0e+00100Show/hide
Query:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAADDDDERIEKKHKPLLGLTSRQNCPTLSGYSLRK
        MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAADDDDERIEKKHKPLLGLTSRQNCPTLSGYSLRK
Subjt:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAADDDDERIEKKHKPLLGLTSRQNCPTLSGYSLRK

Query:  LASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTVR
        LASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTVR
Subjt:  LASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTVR

Query:  TKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEFCSGAT
        TKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEFCSGAT
Subjt:  TKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEFCSGAT

Query:  LASGCDDSYNVNGYNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPSVFGDLD
        LASGCDDSYNVNGYNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPSVFGDLD
Subjt:  LASGCDDSYNVNGYNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPSVFGDLD

Query:  GDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGISRELGRVLIQEIEMLQQHHLLPLDGGSSDMKTVAESTANIGVSNNFLKLGEDADNEI
        GDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGISRELGRVLIQEIEMLQQHHLLPLDGGSSDMKTVAESTANIGVSNNFLKLGEDADNEI
Subjt:  GDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGISRELGRVLIQEIEMLQQHHLLPLDGGSSDMKTVAESTANIGVSNNFLKLGEDADNEI

Query:  DRVRNAESKTVLLDRSRGLIGSTTCIPNEQNILVPSNIHPTNGNLFPHFSQEMKMVRLDSILGGTSCSEYSTCPAPLNNASFQVPSSSDNTDLLNQTGMP
        DRVRNAESKTVLLDRSRGLIGSTTCIPNEQNILVPSNIHPTNGNLFPHFSQEMKMVRLDSILGGTSCSEYSTCPAPLNNASFQVPSSSDNTDLLNQTGMP
Subjt:  DRVRNAESKTVLLDRSRGLIGSTTCIPNEQNILVPSNIHPTNGNLFPHFSQEMKMVRLDSILGGTSCSEYSTCPAPLNNASFQVPSSSDNTDLLNQTGMP

Query:  KLAEEDASQSHAPWHSPARACFQDTVDAQQDKRSEKAHWQELSTRPVLDSIAFDPDLNFGLGLSAAPNSNLQILSQIQPDLVLQL
        KLAEEDASQSHAPWHSPARACFQDTVDAQQDKRSEKAHWQELSTRPVLDSIAFDPDLNFGLGLSAAPNSNLQILSQIQPDLVLQL
Subjt:  KLAEEDASQSHAPWHSPARACFQDTVDAQQDKRSEKAHWQELSTRPVLDSIAFDPDLNFGLGLSAAPNSNLQILSQIQPDLVLQL

A0A1S3BVX0 uncharacterized protein LOC1034938190.0e+0090.26Show/hide
Query:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAA--ADDDDERIEKKHKPLLGLTSRQNCPTLSGYSL
        MKKKKKKGRPSLLDLQKRFLKQQKLQ EQHQEP +AF F SNP+NPPSCRN NVH  TEQVPITAA  ADDDDERIEKKHKPLLGLTSRQNCPTLSGYSL
Subjt:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAA--ADDDDERIEKKHKPLLGLTSRQNCPTLSGYSL

Query:  RKLASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGT
        RK ASYGEESETALKRRRTGAAQFGSSQVREDKALKATD AHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGT
Subjt:  RKLASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGT

Query:  VRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEFCSG
        VRTKLD GAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLG+ N I+SNGAEFCSG
Subjt:  VRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEFCSG

Query:  ATLASGCDDSYNVNGYNLRRARSTFRPLPADPLARTST-AQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPSVFG
        ATLASGCDDSYNVNGY+LRRARSTFRPLPADPLARTS+ AQ QHGETLASWLPEWK EFPASVLKGVLKSGKNDNMAVNENRRDTYNRS SCGNWPSVFG
Subjt:  ATLASGCDDSYNVNGYNLRRARSTFRPLPADPLARTST-AQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPSVFG

Query:  DLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGISRELGRVLIQEIEMLQQHHLLPLDGGSSDMKTVAESTA----------NIGVSN
        DLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKI+ ISRELGRVLI EIEMLQQH LLPLDGGSSDMKTVA+STA          NIGVSN
Subjt:  DLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGISRELGRVLIQEIEMLQQHHLLPLDGGSSDMKTVAESTA----------NIGVSN

Query:  NFLKLGEDADNEIDRVRNAESKTVLLDRSRGLIGSTTCIPNEQNILVPSNIHPTNGNLFPHFSQEMKMVRLDSILGGTSCSEYSTCPAPLNNASFQVPSS
        NFLKLGEDA +EIDRVRN ES+TVLLDRSRG+ GSTTCIPNEQN++VPSNIH TNGNLFPHF QEM+MVRLDSILGGTS S+YSTCP+ LNNASFQ+PSS
Subjt:  NFLKLGEDADNEIDRVRNAESKTVLLDRSRGLIGSTTCIPNEQNILVPSNIHPTNGNLFPHFSQEMKMVRLDSILGGTSCSEYSTCPAPLNNASFQVPSS

Query:  SDNTDLLNQTGMPKLAEEDASQSHAPWHSPARACFQDTVDAQQDKRSEKAHWQELSTRPVLDSIAFDPDLNFGLGLSAAPNSNLQILSQIQPDLVLQL
        SDNTDLL+Q GMPKLAEE ASQSHA  HSPAR   QD+VDAQQDKRSEKAHWQELSTRPVLDSI F+ DLNFGLGLSAAP+SNLQILSQIQPDLVLQL
Subjt:  SDNTDLLNQTGMPKLAEEDASQSHAPWHSPARACFQDTVDAQQDKRSEKAHWQELSTRPVLDSIAFDPDLNFGLGLSAAPNSNLQILSQIQPDLVLQL

A0A5A7VEA9 Bromodomain domain-containing protein0.0e+0090.26Show/hide
Query:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAA--ADDDDERIEKKHKPLLGLTSRQNCPTLSGYSL
        MKKKKKKGRPSLLDLQKRFLKQQKLQ EQHQEP +AF F SNP+NPPSCRN NVH  TEQVPITAA  ADDDDERIEKKHKPLLGLTSRQNCPTLSGYSL
Subjt:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAA--ADDDDERIEKKHKPLLGLTSRQNCPTLSGYSL

Query:  RKLASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGT
        RK ASYGEESETALKRRRTGAAQFGSSQVREDKALKATD AHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGT
Subjt:  RKLASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGT

Query:  VRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEFCSG
        VRTKLD GAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLG+ N I+SNGAEFCSG
Subjt:  VRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEFCSG

Query:  ATLASGCDDSYNVNGYNLRRARSTFRPLPADPLARTST-AQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPSVFG
        ATLASGCDDSYNVNGY+LRRARSTFRPLPADPLARTS+ AQ QHGETLASWLPEWK EFPASVLKGVLKSGKNDNMAVNENRRDTYNRS SCGNWPSVFG
Subjt:  ATLASGCDDSYNVNGYNLRRARSTFRPLPADPLARTST-AQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPSVFG

Query:  DLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGISRELGRVLIQEIEMLQQHHLLPLDGGSSDMKTVAESTA----------NIGVSN
        DLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKI+ ISRELGRVLI EIEMLQQH LLPLDGGSSDMKTVA+STA          NIGVSN
Subjt:  DLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGISRELGRVLIQEIEMLQQHHLLPLDGGSSDMKTVAESTA----------NIGVSN

Query:  NFLKLGEDADNEIDRVRNAESKTVLLDRSRGLIGSTTCIPNEQNILVPSNIHPTNGNLFPHFSQEMKMVRLDSILGGTSCSEYSTCPAPLNNASFQVPSS
        NFLKLGEDA +EIDRVRN ES+TVLLDRSRG+ GSTTCIPNEQN++VPSNIH TNGNLFPHF QEM+MVRLDSILGGTS S+YSTCP+ LNNASFQ+PSS
Subjt:  NFLKLGEDADNEIDRVRNAESKTVLLDRSRGLIGSTTCIPNEQNILVPSNIHPTNGNLFPHFSQEMKMVRLDSILGGTSCSEYSTCPAPLNNASFQVPSS

Query:  SDNTDLLNQTGMPKLAEEDASQSHAPWHSPARACFQDTVDAQQDKRSEKAHWQELSTRPVLDSIAFDPDLNFGLGLSAAPNSNLQILSQIQPDLVLQL
        SDNTDLL+Q GMPKLAEE ASQSHA  HSPAR   QD+VDAQQDKRSEKAHWQELSTRPVLDSI F+ DLNFGLGLSAAP+SNLQILSQIQPDLVLQL
Subjt:  SDNTDLLNQTGMPKLAEEDASQSHAPWHSPARACFQDTVDAQQDKRSEKAHWQELSTRPVLDSIAFDPDLNFGLGLSAAPNSNLQILSQIQPDLVLQL

A0A6J1C905 uncharacterized protein LOC1110090728.2e-25168.64Show/hide
Query:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAA----DDDDERIEKKHKPLLGLTSRQNCPTLSGY
        MKK+KKKGRPSL+DLQKRFLKQQKLQ E+H +P +AF+F+SNPR P S R+ NVH   E V          DDDDERIEKKHKPLLGLTS QN PTLS  
Subjt:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAA----DDDDERIEKKHKPLLGLTSRQNCPTLSGY

Query:  SLRKLASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDF
         LR  AS  E+SE ALKRRR GA QFGS +V E KALKATD  HGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPN+LPDYHVII+NPMDF
Subjt:  SLRKLASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDF

Query:  GTVRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEFC
        GTVR KLD GAYANLEQFEEDIFLICSNAMKYN SDTVF+RQA++IQELAKKDFENLR++SSD+SEPEQKVVRRGRPPGKS K+SL +GN I+S G EFC
Subjt:  GTVRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEFC

Query:  S-GATLASGCDDSYNVNGYNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPSV
        +   TLASGCDDS +VNGYNLRR+RS+FRPL +DPL RTS+   QHGETLASWLPEWK EFPASVLKGVLKSGKNDNMAV+ENRRDTYN S   GNWPSV
Subjt:  S-GATLASGCDDSYNVNGYNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPSV

Query:  FGDLDGDLKQLITVGLHAEHGYARSLALFAAD-LGPAVWNIALKKIKGISRELGRVLIQEIEMLQQHHLLPLDGGS-SDMKTVAEST----------ANI
        FGD DGDLKQLITV     H    SL  F  + +   +W IA KKI+ ISRELG VL QEIEML+Q  +LP+DGGS SD KTVAES           +N 
Subjt:  FGDLDGDLKQLITVGLHAEHGYARSLALFAAD-LGPAVWNIALKKIKGISRELGRVLIQEIEMLQQHHLLPLDGGS-SDMKTVAEST----------ANI

Query:  GVSNNFLKLGEDADNEIDRVRNAESKTVLLDRSRGLIGSTTCIPNEQNILVPSNIHPTNGNLFPHFSQEMKMVRLDSILGGTSCSEYSTCPAPL------
        G+S+NFLK  E  D EIDR R+ +S+T+LLDRSRG + STTCIPNE+  ++PSNIH    N  PHFS EM+MVRLDSI+GGTSCS+ S+ P  +      
Subjt:  GVSNNFLKLGEDADNEIDRVRNAESKTVLLDRSRGLIGSTTCIPNEQNILVPSNIHPTNGNLFPHFSQEMKMVRLDSILGGTSCSEYSTCPAPL------

Query:  -NNASFQVPSSSDNTDLLNQTGMPKLAEEDASQSHAPWHSPARACFQDTVDAQQDKR--SEKAHWQELSTRPVLDSIAFDPDLNFGLGLSAAPNSNLQIL
         N ASFQ+PS   + DLL+   M +L+ ED S S    HSPAR   Q+ ++A++D++   EK  WQELST PVLDS+ F+PDLNFGLG S AP+SNLQIL
Subjt:  -NNASFQVPSSSDNTDLLNQTGMPKLAEEDASQSHAPWHSPARACFQDTVDAQQDKR--SEKAHWQELSTRPVLDSIAFDPDLNFGLGLSAAPNSNLQIL

Query:  SQIQPDLVLQL
        SQIQPDLVLQL
Subjt:  SQIQPDLVLQL

A0A6J1FLH0 uncharacterized protein LOC111446319 isoform X11.3e-20080.45Show/hide
Query:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAADDDDERIEKKHKPLLGLTSRQNCPTLSGYSLRK
        MKK+KKKGRPSLLDLQKRFLKQQKLQ EQHQ+P +AFD +SNP++P SC+N NVH  TE+V      DDDDERIEKKHKPLLGLTSRQN PTLS YSLRK
Subjt:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAADDDDERIEKKHKPLLGLTSRQNCPTLSGYSLRK

Query:  LASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPA-HGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTV
         A Y E+SE+ALKRRR  A QFGSS+V E K  KATD A +GSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFS+PVD N+LPDYHVII+NPMDFGTV
Subjt:  LASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPA-HGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTV

Query:  RTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSL-GIG-NAIDS-NGAEFC
        R KLD GAY+NLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAK+DFENLRRESSDESE EQKVVRRGRPPGKS K+ L G+G N ++S  GAE C
Subjt:  RTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSL-GIG-NAIDS-NGAEFC

Query:  SGATLASGCDDSYNVNGYNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTY-NRSTSCGNWPSV
        SGAT ASG DDS NVNGYNLRRARS+FRPLP DP  RTS A AQHGETLASWLPEWK EFPASVLKGVLKSGKNDNMAV+ENRR TY N S SCGN  SV
Subjt:  SGATLASGCDDSYNVNGYNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTY-NRSTSCGNWPSV

Query:  FGDLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGISRELGRVLIQEIEML-QQHHLLPLD-GGSSDMKTVAES
        FG+LDGDLKQLITVGLHAEHGY RSLALF ADLGP VW IA KKI+  SRELGRVLIQEIEML QQ  +LP D GGS+D KT AES
Subjt:  FGDLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGISRELGRVLIQEIEML-QQHHLLPLD-GGSSDMKTVAES

SwissProt top hitse value%identityAlignment
B2KF05 Bromodomain and PHD finger-containing protein 32.5e-1844.44Show/hide
Query:  LPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTVRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFE
        +P   LL   LD LQ+KD+  +F+EPV  +++PDY   I  PMDF T+R KL+   Y  LE+FEED  LI +N MKYNA DT+F R A  +++L      
Subjt:  LPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTVRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFE

Query:  NLRRESSD
        + RR++ +
Subjt:  NLRRESSD

B2RRD7 Peregrin1.4e-1644.55Show/hide
Query:  LLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTVRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRE
        LL   L++LQ+KDT  +FSEPV  +++PDY   I+ PMDF T++  L+   Y N + FEED  LI SN +KYNA DT+F+R A  ++E         RR+
Subjt:  LLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTVRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRE

Query:  S
        +
Subjt:  S

O95696 Bromodomain-containing protein 13.6e-1746Show/hide
Query:  LLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTVRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRE
        LL  +LD+LQ KD   +F++PV   ++PDY   I++PMDF T+R +L+   Y NL +FEED  LI  N MKYNA DTVF+R A  +++         RRE
Subjt:  LLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTVRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRE

P55201 Peregrin1.4e-1644.55Show/hide
Query:  LLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTVRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRE
        LL   L++LQ+KDT  +FSEPV  +++PDY   I+ PMDF T++  L+   Y N + FEED  LI SN +KYNA DT+F+R A  ++E         RR+
Subjt:  LLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTVRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRE

Query:  S
        +
Subjt:  S

Q9ULD4 Bromodomain and PHD finger-containing protein 31.9e-1844.44Show/hide
Query:  LPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTVRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFE
        +P   LL   LD LQ+KD   +F+EPV+ +++PDY   I  PMDF T+R KL+   Y  LE+FEED  LI +N MKYNA DT+F R A  +++L      
Subjt:  LPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTVRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFE

Query:  NLRRESSD
        + RR++ +
Subjt:  NLRRESSD

Arabidopsis top hitse value%identityAlignment
AT1G20670.1 DNA-binding bromodomain-containing protein9.1e-10148.08Show/hide
Query:  MKKKKKKGRPSLLDLQKRFLK--QQKLQEEQ------HQEPSDAFDFSSN---PRNPPSC------RNLNVHSG-TEQVPITAAADDDDERIEKKHKPLL
        M KKKKKGRPSLLDLQKR +K  QQ+LQ++Q      HQ+  D    ++N    +NP S       RN N + G +  +      +DDDER EKKHK L 
Subjt:  MKKKKKKGRPSLLDLQKRFLK--QQKLQEEQ------HQEPSDAFDFSSN---PRNPPSC------RNLNVHSG-TEQVPITAAADDDDERIEKKHKPLL

Query:  GLTSRQNCPTLSGYSLRKLASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPND
        GL S  +  + +  S   L    +  E +  RR   AA  GS     +KA KATD   GS VESGPTT LPDKKLL+FILDRLQKKDT+GV+S+PVDP +
Subjt:  GLTSRQNCPTLSGYSLRKLASYGEESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPND

Query:  LPDYHVIIQNPMDFGTVRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQ-------KVVRRGRPPG
        LPDY  II+NPMDF T+R KLD GAY+ LEQFE D+FLIC+NAM+YN++DTV++RQAR+IQELAKKDFENLR++S DE    Q       KV RRGRPP 
Subjt:  LPDYHVIIQNPMDFGTVRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQ-------KVVRRGRPPG

Query:  KSLKKSLGIGNAIDSNGAEFCSGATLASGCDDSYNVNGYNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAV
        K  + S     +ID   +E  + A +     + ++   YNLR+A  +++   A+   R +     + ET + W  +W+ EFP+SV+K V K G   +  V
Subjt:  KSLKKSLGIGNAIDSNGAEFCSGATLASGCDDSYNVNGYNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAV

Query:  NENRRDTYNRSTSCGNWPSVFGDLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGI
        ++NRRDTYN  ++    PSV   L+ +LKQLI VGL+ E+GYA+SLA +AA+LGP  W IA ++I+ +
Subjt:  NENRRDTYNRSTSCGNWPSVFGDLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIKGI

AT1G76380.1 DNA-binding bromodomain-containing protein5.2e-8845.3Show/hide
Query:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAADDDDERIEKKHKPLLGLTSRQNCPTLSGYSLRK
        + K+KKKGRPSLLDLQKR LKQQ+L + ++    +  +  S+ RNP      N  + + +   + + DDDDER +KKH+ L GL S +      G     
Subjt:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAADDDDERIEKKHKPLLGLTSRQNCPTLSGYSLRK

Query:  LASYGEESETALKRRRTGAAQFGSSQVREDKALKATD-PAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTV
          S G + ++  + RR    +   S    +KA KATD     S VES P   LPDKKLL FILDR+QKKDT+GV+S+P DP +LPDY+ II+NPMDF T+
Subjt:  LASYGEESETALKRRRTGAAQFGSSQVREDKALKATD-PAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTV

Query:  RTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDES----EPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEF
        R KL+ GAY  LEQFE+D+FLIC+NAM+YN++DTV++RQAR++ ELAKKDF NLR+ES  E       + KVV+RGRPPG  LKK L   + ID   ++ 
Subjt:  RTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDES----EPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEF

Query:  CSGATLASGCDDSYNVNG-YNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPS
         + A   +   DS  ++G YNLR+   ++        A TS     + E  +  L +W+ EFP SV+K V K G  +   V+ENRRDTYN++++     S
Subjt:  CSGATLASGCDDSYNVNG-YNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPS

Query:  VFGDLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVW---NIALKKIKGISRELG
        +F  LD +LKQL  VGL AE+GYARSLA +AA++GP  W   N+ ++K+     E G
Subjt:  VFGDLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVW---NIALKKIKGISRELG

AT1G76380.2 DNA-binding bromodomain-containing protein8.0e-8945.41Show/hide
Query:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAADDDDERIEKKHKPLLGLTSRQNCPTLSGYSLRK
        + K+KKKGRPSLLDLQKR LKQQ+L + ++    +  +  S+ RNP      N  + + +   + + DDDDER +KKH+ L GL S +      G     
Subjt:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAADDDDERIEKKHKPLLGLTSRQNCPTLSGYSLRK

Query:  LASYGEESETALKRRRTGAAQFGSSQVREDKALKATD--PAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGT
          S G + ++  + RR    +   S    +KA KATD     GS VES P   LPDKKLL FILDR+QKKDT+GV+S+P DP +LPDY+ II+NPMDF T
Subjt:  LASYGEESETALKRRRTGAAQFGSSQVREDKALKATD--PAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGT

Query:  VRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDES----EPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAE
        +R KL+ GAY  LEQFE+D+FLIC+NAM+YN++DTV++RQAR++ ELAKKDF NLR+ES  E       + KVV+RGRPPG  LKK L   + ID   ++
Subjt:  VRTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDES----EPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAE

Query:  FCSGATLASGCDDSYNVNG-YNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWP
          + A   +   DS  ++G YNLR+   ++        A TS     + E  +  L +W+ EFP SV+K V K G  +   V+ENRRDTYN++++     
Subjt:  FCSGATLASGCDDSYNVNG-YNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWP

Query:  SVFGDLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVW---NIALKKIKGISRELG
        S+F  LD +LKQL  VGL AE+GYARSLA +AA++GP  W   N+ ++K+     E G
Subjt:  SVFGDLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVW---NIALKKIKGISRELG

AT1G76380.3 DNA-binding bromodomain-containing protein4.4e-8745.08Show/hide
Query:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAADDDDERIEKKHKPLLGLTSRQNCPTLSGYSLRK
        + K+KKKGRPSLLDLQKR LKQQ+L + ++    +  +  S+ RNP      N  + + +   + + DDDDER +KKH+ L GL S +      G     
Subjt:  MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAADDDDERIEKKHKPLLGLTSRQNCPTLSGYSLRK

Query:  LASYGEESETALKRRRTGAAQFGSSQVREDKALKATD-PAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTV
          S G + ++  + RR    +   S    +KA KATD     S VES P   LPDKKLL FILDR+QKKDT+GV+S+P DP +LPDY+ II+NPMDF T+
Subjt:  LASYGEESETALKRRRTGAAQFGSSQVREDKALKATD-PAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTV

Query:  RTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDES----EPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEF
        R KL+ GAY  LEQFE ++FLIC+NAM+YN++DTV++RQAR++ ELAKKDF NLR+ES  E       + KVV+RGRPPG  LKK L   + ID   ++ 
Subjt:  RTKLDEGAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDES----EPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEF

Query:  CSGATLASGCDDSYNVNG-YNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPS
         + A   +   DS  ++G YNLR+   ++        A TS     + E  +  L +W+ EFP SV+K V K G  +   V+ENRRDTYN++++     S
Subjt:  CSGATLASGCDDSYNVNG-YNLRRARSTFRPLPADPLARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPS

Query:  VFGDLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVW---NIALKKIKGISRELG
        +F  LD +LKQL  VGL AE+GYARSLA +AA++GP  W   N+ ++K+     E G
Subjt:  VFGDLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVW---NIALKKIKGISRELG

AT5G55040.1 DNA-binding bromodomain-containing protein1.6e-5237.89Show/hide
Query:  EESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTVRTKLDE
        EE +  ++           S+  +DK  K    +      S  TT + DKK L  ILD+LQKKD +GV++EPVDP +LPDYH +I++PMDF TVR KL  
Subjt:  EESETALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTVRTKLDE

Query:  GAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLR---RESSDESEPEQKVVRRGRPPGKSLKKSLG---IGNAIDSNGAEFCSGA
        G+Y+ LE+ E D+ LICSNAM+YN+SDTV+++QAR+IQE+ K+ FE  R   + +  E + ++KV      P  S+KK +      N +++ G++F SGA
Subjt:  GAYANLEQFEEDIFLICSNAMKYNASDTVFFRQARSIQELAKKDFENLR---RESSDESEPEQKVVRRGRPPGKSLKKSLG---IGNAIDSNGAEFCSGA

Query:  TLASGCDD-----SYNVNGYNLRRARSTFRPLPADPLARTSTA---QAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGN
         LASG        S  + G+         +    D L   +T+     +  E L+S              KG+          V E+RR TY  S   G+
Subjt:  TLASGCDD-----SYNVNGYNLRRARSTFRPLPADPLARTSTA---QAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGN

Query:  -WPSVFGDLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIK
           S+F   + ++KQ + VGLHAEH Y RSLA FAA LGP  W IA ++I+
Subjt:  -WPSVFGDLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGAAGAAGAAAAAGGGTCGCCCTTCTCTTTTAGATCTCCAAAAGCGCTTCCTCAAACAGCAAAAGTTGCAGGAGGAACAGCATCAAGAACCGTCTGACGCTTT
CGACTTTTCCTCCAACCCTAGGAACCCACCTTCTTGTCGGAATCTCAATGTTCACTCCGGAACAGAGCAAGTCCCCATCACTGCCGCTGCCGACGACGACGACGAGCGCA
TTGAGAAGAAGCATAAGCCCTTACTCGGTTTAACGTCTCGTCAAAACTGTCCAACATTGTCCGGTTATTCTTTGCGTAAATTGGCCTCCTATGGCGAAGAGTCCGAGACA
GCCCTCAAGCGTCGCAGAACCGGCGCCGCCCAGTTTGGATCTTCTCAAGTGAGAGAAGATAAAGCTTTGAAAGCGACAGACCCTGCACATGGGTCGCAGGTGGAGTCTGG
TCCCACGACAACTTTGCCAGACAAAAAGTTGTTGATTTTCATCCTTGATAGGCTTCAAAAAAAAGACACCCATGGAGTATTTTCCGAACCCGTGGATCCAAACGATCTTC
CCGATTACCATGTCATTATTCAAAATCCTATGGATTTTGGGACTGTGAGAACGAAATTAGATGAAGGAGCTTACGCAAATTTGGAACAGTTTGAAGAAGACATTTTTTTG
ATATGTTCGAATGCAATGAAGTATAATGCTTCTGATACTGTTTTCTTCCGTCAGGCACGATCAATACAAGAACTTGCAAAGAAGGATTTTGAGAATTTGAGGCGAGAGAG
CAGCGATGAAAGTGAACCAGAACAGAAAGTTGTGAGGAGAGGTAGGCCTCCAGGAAAGAGCCTGAAGAAATCATTAGGTATTGGTAACGCTATTGACAGTAATGGAGCAG
AGTTCTGCTCAGGTGCTACTCTTGCTTCTGGATGTGATGATTCCTACAATGTCAATGGTTACAATTTGAGAAGAGCTCGTTCCACCTTCAGACCCCTGCCTGCAGATCCT
CTAGCAAGAACATCTACAGCACAAGCTCAACATGGCGAAACTTTGGCCAGCTGGTTGCCTGAATGGAAAATTGAGTTTCCAGCTTCTGTTTTAAAGGGTGTTCTTAAAAG
TGGAAAGAATGATAATATGGCTGTGAATGAGAATAGACGTGATACCTATAATCGTTCCACGTCATGTGGGAATTGGCCATCTGTCTTTGGTGACCTTGATGGAGACTTGA
AGCAACTAATTACTGTGGGTTTGCATGCCGAGCATGGTTATGCAAGAAGCTTAGCTCTCTTTGCGGCCGATCTTGGTCCTGCAGTTTGGAATATTGCTTTAAAGAAAATC
AAAGGTATTAGTCGGGAGCTGGGGCGAGTATTAATTCAAGAAATTGAAATGCTACAGCAGCATCATTTGTTGCCTCTGGATGGAGGCTCATCCGACATGAAAACAGTGGC
AGAAAGCACGGCCAACATTGGTGTTTCAAATAATTTTTTGAAACTCGGCGAAGATGCTGATAATGAAATTGATAGAGTGAGAAATGCTGAAAGCAAGACAGTGTTGCTAG
ACAGAAGTAGAGGTTTAATAGGATCAACAACTTGTATACCGAATGAACAGAACATACTAGTTCCTTCAAACATCCACCCCACAAATGGTAATCTTTTCCCCCATTTTTCA
CAAGAAATGAAAATGGTAAGACTTGATTCTATTCTTGGTGGGACGTCTTGCTCAGAATACTCTACATGCCCAGCTCCTCTCAACAATGCTTCTTTCCAAGTCCCTTCCAG
TTCAGATAATACGGATTTGTTAAACCAAACTGGAATGCCAAAACTTGCAGAAGAAGATGCCTCTCAGTCACACGCTCCGTGGCATTCACCAGCCCGCGCCTGCTTTCAGG
ACACAGTCGATGCACAACAAGACAAGCGTAGTGAAAAGGCACATTGGCAAGAATTATCAACACGTCCTGTGCTAGATTCAATCGCCTTCGATCCTGACCTTAATTTTGGA
TTAGGCTTATCTGCTGCACCCAATTCTAATCTGCAGATTCTTTCTCAAATTCAACCCGATTTAGTATTGCAGCTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAGAAGAAGAAAAAGGGTCGCCCTTCTCTTTTAGATCTCCAAAAGCGCTTCCTCAAACAGCAAAAGTTGCAGGAGGAACAGCATCAAGAACCGTCTGACGCTTT
CGACTTTTCCTCCAACCCTAGGAACCCACCTTCTTGTCGGAATCTCAATGTTCACTCCGGAACAGAGCAAGTCCCCATCACTGCCGCTGCCGACGACGACGACGAGCGCA
TTGAGAAGAAGCATAAGCCCTTACTCGGTTTAACGTCTCGTCAAAACTGTCCAACATTGTCCGGTTATTCTTTGCGTAAATTGGCCTCCTATGGCGAAGAGTCCGAGACA
GCCCTCAAGCGTCGCAGAACCGGCGCCGCCCAGTTTGGATCTTCTCAAGTGAGAGAAGATAAAGCTTTGAAAGCGACAGACCCTGCACATGGGTCGCAGGTGGAGTCTGG
TCCCACGACAACTTTGCCAGACAAAAAGTTGTTGATTTTCATCCTTGATAGGCTTCAAAAAAAAGACACCCATGGAGTATTTTCCGAACCCGTGGATCCAAACGATCTTC
CCGATTACCATGTCATTATTCAAAATCCTATGGATTTTGGGACTGTGAGAACGAAATTAGATGAAGGAGCTTACGCAAATTTGGAACAGTTTGAAGAAGACATTTTTTTG
ATATGTTCGAATGCAATGAAGTATAATGCTTCTGATACTGTTTTCTTCCGTCAGGCACGATCAATACAAGAACTTGCAAAGAAGGATTTTGAGAATTTGAGGCGAGAGAG
CAGCGATGAAAGTGAACCAGAACAGAAAGTTGTGAGGAGAGGTAGGCCTCCAGGAAAGAGCCTGAAGAAATCATTAGGTATTGGTAACGCTATTGACAGTAATGGAGCAG
AGTTCTGCTCAGGTGCTACTCTTGCTTCTGGATGTGATGATTCCTACAATGTCAATGGTTACAATTTGAGAAGAGCTCGTTCCACCTTCAGACCCCTGCCTGCAGATCCT
CTAGCAAGAACATCTACAGCACAAGCTCAACATGGCGAAACTTTGGCCAGCTGGTTGCCTGAATGGAAAATTGAGTTTCCAGCTTCTGTTTTAAAGGGTGTTCTTAAAAG
TGGAAAGAATGATAATATGGCTGTGAATGAGAATAGACGTGATACCTATAATCGTTCCACGTCATGTGGGAATTGGCCATCTGTCTTTGGTGACCTTGATGGAGACTTGA
AGCAACTAATTACTGTGGGTTTGCATGCCGAGCATGGTTATGCAAGAAGCTTAGCTCTCTTTGCGGCCGATCTTGGTCCTGCAGTTTGGAATATTGCTTTAAAGAAAATC
AAAGGTATTAGTCGGGAGCTGGGGCGAGTATTAATTCAAGAAATTGAAATGCTACAGCAGCATCATTTGTTGCCTCTGGATGGAGGCTCATCCGACATGAAAACAGTGGC
AGAAAGCACGGCCAACATTGGTGTTTCAAATAATTTTTTGAAACTCGGCGAAGATGCTGATAATGAAATTGATAGAGTGAGAAATGCTGAAAGCAAGACAGTGTTGCTAG
ACAGAAGTAGAGGTTTAATAGGATCAACAACTTGTATACCGAATGAACAGAACATACTAGTTCCTTCAAACATCCACCCCACAAATGGTAATCTTTTCCCCCATTTTTCA
CAAGAAATGAAAATGGTAAGACTTGATTCTATTCTTGGTGGGACGTCTTGCTCAGAATACTCTACATGCCCAGCTCCTCTCAACAATGCTTCTTTCCAAGTCCCTTCCAG
TTCAGATAATACGGATTTGTTAAACCAAACTGGAATGCCAAAACTTGCAGAAGAAGATGCCTCTCAGTCACACGCTCCGTGGCATTCACCAGCCCGCGCCTGCTTTCAGG
ACACAGTCGATGCACAACAAGACAAGCGTAGTGAAAAGGCACATTGGCAAGAATTATCAACACGTCCTGTGCTAGATTCAATCGCCTTCGATCCTGACCTTAATTTTGGA
TTAGGCTTATCTGCTGCACCCAATTCTAATCTGCAGATTCTTTCTCAAATTCAACCCGATTTAGTATTGCAGCTGTGAGGACATGATATTTGTTATGTATACGATACAAT
TAATATTGCAGTTATGTACAGATAGTAACTTCAGTTTATCTCATTTTCTCGATCTAGACCTTAGTGATTTGTACAATTATTGTCTAGATCCAACTCATTCATCTGTTTAC
GGGTTCAAACCAAAACCAAAATCAATATACAGTGAACTTTTCAAGAACAAATGATGTGTTAATTGTCTCCATTATTGCTTTTTCCCAAA
Protein sequenceShow/hide protein sequence
MKKKKKKGRPSLLDLQKRFLKQQKLQEEQHQEPSDAFDFSSNPRNPPSCRNLNVHSGTEQVPITAAADDDDERIEKKHKPLLGLTSRQNCPTLSGYSLRKLASYGEESET
ALKRRRTGAAQFGSSQVREDKALKATDPAHGSQVESGPTTTLPDKKLLIFILDRLQKKDTHGVFSEPVDPNDLPDYHVIIQNPMDFGTVRTKLDEGAYANLEQFEEDIFL
ICSNAMKYNASDTVFFRQARSIQELAKKDFENLRRESSDESEPEQKVVRRGRPPGKSLKKSLGIGNAIDSNGAEFCSGATLASGCDDSYNVNGYNLRRARSTFRPLPADP
LARTSTAQAQHGETLASWLPEWKIEFPASVLKGVLKSGKNDNMAVNENRRDTYNRSTSCGNWPSVFGDLDGDLKQLITVGLHAEHGYARSLALFAADLGPAVWNIALKKI
KGISRELGRVLIQEIEMLQQHHLLPLDGGSSDMKTVAESTANIGVSNNFLKLGEDADNEIDRVRNAESKTVLLDRSRGLIGSTTCIPNEQNILVPSNIHPTNGNLFPHFS
QEMKMVRLDSILGGTSCSEYSTCPAPLNNASFQVPSSSDNTDLLNQTGMPKLAEEDASQSHAPWHSPARACFQDTVDAQQDKRSEKAHWQELSTRPVLDSIAFDPDLNFG
LGLSAAPNSNLQILSQIQPDLVLQL