; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0005924 (gene) of Chayote v1 genome

Gene IDSed0005924
OrganismSechium edule (Chayote v1)
DescriptionBromo domain-containing protein
Genome locationLG07:5116665..5121695
RNA-Seq ExpressionSed0005924
SyntenySed0005924
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR001487 - Bromodomain
IPR017930 - Myb domain
IPR036427 - Bromodomain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461248.1 PREDICTED: uncharacterized protein LOC103499890 [Cucumis melo]2.9e-27477.37Show/hide
Query:  ESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDELRKLRVAE
        ESE+  A WGTWEELLLACAVKRHGFKDWNSV+ME+Q RSSLPHLLTTARNCELKF DLKRRF S QNDA  N N+ GI+D +D+A+PWVDELRKLRVAE
Subjt:  ESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDELRKLRVAE

Query:  LRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSNSTGSKCG
        LRREV RYDVSINSLQLKVK+LEEER+QG+ND EA  G PDLKTESRERRSENDKK FGEPD RSGPNGTV KPP V GEDSDREDFSVNQSNSTGSK G
Subjt:  LRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSNSTGSKCG

Query:  NQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRV-DSSELADSEAQSNGGETAATRESSEVQSSASLTG
        N K          KPDF GSYRPE+NRR VEPAGP SDDGSTDTVVK  T D S+TK KKET+RV DSSELADSEAQS+GG T  TRESSEVQSSASLTG
Subjt:  NQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRV-DSSELADSEAQSNGGETAATRESSEVQSSASLTG

Query:  ---RKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLCNNA
           RK +L+ EISGGSSGNEPRR+A +KSRRFDEVL++IRAHKHGSLFESRL SQETE YKGM+RQHLDLE VQ K+NSGSYSSSN+AFYRDLLLL NN 
Subjt:  ---RKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLCNNA

Query:  ITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLGEKGDRSNDD
        +TFFPKSSKE++ ACELR LVSNEMKKSL  A+ DPLPEVVDS P  PSQSKGPDLEG+Q+L  KQKSSVPI+VCRKRSKIS+  ST+G GEKGDRSNDD
Subjt:  ITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLGEKGDRSNDD

Query:  EKSAIDLKSSIKMASTIVEDQGTTKDNKVKEKPITGARSMRRSNDNATNSTGPSNKKQ---TTGWKPGSSANEPGTPNSDKKKSEIVALEKKRSAADFLK
        EK A DLKSSIK+ S +VE+  TTKD+KVKEKP TGARSMRRSND+ATNS+GPS+ K+   T+ WKP SSANE   P  DKKKSE VALEKKRSAADFLK
Subjt:  EKSAIDLKSSIKMASTIVEDQGTTKDNKVKEKPITGARSMRRSNDNATNSTGPSNKKQ---TTGWKPGSSANEPGTPNSDKKKSEIVALEKKRSAADFLK

Query:  RIKQNSPAETTKKNGRGGST-NIGNAAAGEQKKGSGKSDKGKEKATS-VTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPAKRAREGGGKEPLKR
        RIKQNSPAETTK+NGRGGS+  +GNA   EQKKG+ K+DK KE+ +S +  SNDKKRPKED SPSKRSVGRPPKKA EAEP  TP KRAREGGGKEPLKR
Subjt:  RIKQNSPAETTKKNGRGGST-NIGNAAAGEQKKGSGKSDKGKEKATS-VTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPAKRAREGGGKEPLKR

Query:  ARKRAKK
         RK++K+
Subjt:  ARKRAKK

XP_022959831.1 uncharacterized protein LOC111460774 [Cucurbita moschata]1.1e-27377.95Show/hide
Query:  MAEDKEPESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDEL
        MAEDKE ES +S A WGTWEELLLA AV RHGFKDWNSVAME+Q RSSLPH LTTA NCELKFLDLKRRF S QNDA  + N  GI+DNVDSAVPWVDEL
Subjt:  MAEDKEPESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDEL

Query:  RKLRVAELRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSN
        RKLRVAELRREV  YDVSINSLQLKVK LEEER QGLNDSEAG G PDLKTE+RERRSENDKKLFGEPD RSG  GTV KP  V GEDSDREDFSVNQSN
Subjt:  RKLRVAELRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSN

Query:  STGSKCGNQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKS-DTKRKKETKRV-DSSELADSEAQSNGGETAATRESSEV
        STGSK GN+K          KP+F GS RPE NRRA E AGP SDDGSTDTVVK  T D S  TK+KKETKRV DSSELADSEA+SNGGET ATRESSEV
Subjt:  STGSKCGNQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKS-DTKRKKETKRV-DSSELADSEAQSNGGETAATRESSEV

Query:  QSSASLTG---RKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRD
        QSSASLTG   R+  L+KEISGGSSGNEPRRT AVKSRRFDEVL+MIR+HKHGSLFESRL SQETE YK MIRQHLDLETVQAK+NSGSY  S++AFYRD
Subjt:  QSSASLTG---RKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRD

Query:  LLLLCNNAITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKG-PDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLG
        LLLLCNNA+TFFP SSKES+ A ELR LVS EMKKS+P AR DP P    S P   SQSKG PDLEG+QSL  K+KS VPI+VCRKRSKISSKLS+SGLG
Subjt:  LLLLCNNAITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKG-PDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLG

Query:  EKGDRSNDDEKSAIDLKSSIKMASTIVEDQGTTKD--NKVKEKPITGARSMRRSNDNATNSTGPSNKKQTT--GWKPGSSANEPGTPNSDKKKSEIVALE
        EK DR+NDDEK A+DLKSS+KMAST+VED GTTKD  +K+KE+ ITGARSMRRSND+ATNS+GPS KKQTT  GWKP +SANE  TP  DKKKSE VALE
Subjt:  EKGDRSNDDEKSAIDLKSSIKMASTIVEDQGTTKD--NKVKEKPITGARSMRRSNDNATNSTGPSNKKQTT--GWKPGSSANEPGTPNSDKKKSEIVALE

Query:  KKRSAADFLKRIKQNSPAETTKKNGRGGSTNIGN----AAAGEQKKGSGKSDKGKEKATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPA--
        KKRSAADFLKRIKQNSPAETTK+NGRGGS+N  N    AAAGEQKKGSGK+  GKEKA+S+  SN+KKR KED SPSKRSVGRPPKKAVEA P   PA  
Subjt:  KKRSAADFLKRIKQNSPAETTKKNGRGGSTNIGN----AAAGEQKKGSGKSDKGKEKATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPA--

Query:  KRAREGGGKEPLKRARKRAKK
        KRAREGGGKEPLKRARKRA++
Subjt:  KRAREGGGKEPLKRARKRAKK

XP_023004478.1 dentin sialophosphoprotein-like [Cucurbita maxima]1.6e-27277.39Show/hide
Query:  MAEDKEPESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDEL
        MAEDKE ES +S A WGTWEELLLA AV RHGFKDWNSVAME+Q RSSLPH LTTA NCELKFLDLKRRF + QNDA  N N  GI+DNVDSAVPWVDEL
Subjt:  MAEDKEPESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDEL

Query:  RKLRVAELRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSN
        RKLRVAELRREV  YDVSINSLQLKVK LEEER QGLNDSEAG G PDLKTE+RERRSENDKKLFGEPD RSG  GTV KP  V GEDSDREDFSVNQSN
Subjt:  RKLRVAELRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSN

Query:  STGSKCGNQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRVD-SSELADSEAQSNGGETAATRESSEVQ
        STGSK GN+K          K +F GS RPE NRRA EP+GP SDDGSTDTVVK  T D S TK+KKETKRVD SSELADSEA+SNGG T ATRESSEVQ
Subjt:  STGSKCGNQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRVD-SSELADSEAQSNGGETAATRESSEVQ

Query:  SSASLTG---RKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDL
        SSASLTG   R+  L+KEISGGSSGNEPRRT AVKSRRFDEVL+MIR+HKHGSLFESRL SQETE YKGMIRQHLDLETVQAK+NSGSY  S++AFYRDL
Subjt:  SSASLTG---RKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDL

Query:  LLLCNNAITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYP-SQSKGPDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLGE
        LLLCNNA+TFFP SSKES+ A ELR LVS EM KS+P AR+DP P    S+P  P SQSKGPDLEG+QSL  K+KS VPI+VCRKRSKISSKLS+SGLGE
Subjt:  LLLCNNAITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYP-SQSKGPDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLGE

Query:  KGDRSNDDEKSAIDLKSSIKMASTIVEDQGTTKD--NKVKEKPITGARSMRRSNDNATNSTGPSNKKQTT--GWKPGSSANEPGTPNSDKKKSEIVALEK
        K DR+NDDEK A+DLKSS+KMAST+VED GT KD  +K+KE+ ITGARSMRRSND+ATNS+GPS KKQTT  GWK  +SANE  T   DKKKSE VALEK
Subjt:  KGDRSNDDEKSAIDLKSSIKMASTIVEDQGTTKD--NKVKEKPITGARSMRRSNDNATNSTGPSNKKQTT--GWKPGSSANEPGTPNSDKKKSEIVALEK

Query:  KRSAADFLKRIKQNSPAETTKKNGRGGSTNIGN-----AAAGEQKKGSGKSDKGKEKATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPA--
        KRSAADFLKRIKQNSPAETTK+NGRGGS+N  N     AAAGEQKKGSGK+  GKEKA+S+  SN+KKR KED SPSKRSVGRPPKKAVEA P   PA  
Subjt:  KRSAADFLKRIKQNSPAETTKKNGRGGSTNIGN-----AAAGEQKKGSGKSDKGKEKATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPA--

Query:  KRAREGGGKEPLKRARKRAKK
        KRAREGGGKEPLKRARKRA++
Subjt:  KRAREGGGKEPLKRARKRAKK

XP_023513859.1 bromodomain-containing protein bet-1 [Cucurbita pepo subsp. pepo]5.3e-27678.41Show/hide
Query:  MAEDKEPESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDEL
        MAEDKE ES +S A WGTWEELLLA AV RHGFKDWNSVAME+Q RSSLPH LTTA NCELKFLDLKRRF S QNDA  N N  GI+DNVDSAVPWVDEL
Subjt:  MAEDKEPESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDEL

Query:  RKLRVAELRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSN
        RKLRVAELRREV  YDVSINSLQLKVK LEEER QGLNDSEAG G PDLKTE+RERRSENDKKLFGEPD RSG  GTV KP  V GEDSDREDFSVNQSN
Subjt:  RKLRVAELRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSN

Query:  STGSKCGNQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKS-DTKRKKETKRV-DSSELADSEAQSNGGETAATRESSEV
        STGSK GN+K          KP+F GS RPE NRRA E AGP SDDGSTDTVVK  T D S  TK+KKETKRV DSSELADSEA+SNGGET ATRESSEV
Subjt:  STGSKCGNQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKS-DTKRKKETKRV-DSSELADSEAQSNGGETAATRESSEV

Query:  QSSASLTG---RKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRD
        QSSASLTG   R+  L+KEISGGSSGNEPRRT AVKSRRFDEVL+MIR+HKHGSLFESRL SQETE YKGMIRQHLDLETVQAK+NSGSY SS++AFYRD
Subjt:  QSSASLTG---RKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRD

Query:  LLLLCNNAITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYP-SQSKG-PDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGL
        LLLLCNNA+TFFP SSKE + A ELR LVS EMKKS+P AR DP P    S+P  P SQSKG PDLEG+QSL  K+KS VPI+VCRKRSKISSKLS+SGL
Subjt:  LLLLCNNAITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYP-SQSKG-PDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGL

Query:  GEKGDRSNDDEKSAIDLKSSIKMASTIVEDQGTTKD--NKVKEKPITGARSMRRSNDNATNSTGPSNKKQTT--GWKPGSSANEPGTPNSDKKKSEIVAL
        GEK DR+NDDEK A+DLKSS+KMAST+VED GTTKD  +K+KE+ ITGARSMRRSND+ATNS+GPS KKQTT  GWKP +SANE  TP  D KKSE VAL
Subjt:  GEKGDRSNDDEKSAIDLKSSIKMASTIVEDQGTTKD--NKVKEKPITGARSMRRSNDNATNSTGPSNKKQTT--GWKPGSSANEPGTPNSDKKKSEIVAL

Query:  EKKRSAADFLKRIKQNSPAETTKKNGRGGSTNIGN--AAAGEQKKGSGKSDKGKEKATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPAKRA
        EKKRSAADFLKRIKQNSPAETTK+NGRGGS+N  N  AAAGEQKKGSGK+  GKEKA+S+  SN+KKR KED SPSKRSVGRPPKKAVEA P   P KRA
Subjt:  EKKRSAADFLKRIKQNSPAETTKKNGRGGSTNIGN--AAAGEQKKGSGKSDKGKEKATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPAKRA

Query:  REGGGKEPLKRARKRAKK
        REGGGKEPLKRARKRA++
Subjt:  REGGGKEPLKRARKRAKK

XP_038897226.1 uncharacterized protein LOC120085356 [Benincasa hispida]4.0e-27979.29Show/hide
Query:  ESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDELRKLRVAE
        +SE+  A WGTWEELLLACAVKRHGFKDWNSVAME+Q RSSLPHLLTTARNCELKF DLKRRF S +NDA  ++N  GI+D VDSAVPWVDELRKLRVAE
Subjt:  ESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDELRKLRVAE

Query:  LRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSNSTGSKCG
        LRREV RYDVSINSLQLKVK+LEEER+QG+ND EA  G PDLKTESRERRSENDK LFGEPD RSGPNGTVAKP  V GEDSDREDFSVNQSNSTGSK G
Subjt:  LRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSNSTGSKCG

Query:  NQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRV-DSSELADSEAQSNGGETAATRESSEVQSSASLTG
        N+K          KPDFAGSYRPE+NRRA EPAGP SDDGSTDTVVK  T D S+TK KKET+RV DSSELADSEAQSNGG T  TRESSEVQSSASLTG
Subjt:  NQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRV-DSSELADSEAQSNGGETAATRESSEVQSSASLTG

Query:  ---RKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLCNNA
           RK +L+KEISGGSSGNEPRRTAAVKS+RFDEVL+ IRAHKHGSLFESRL SQETE YK MIRQHLDLE VQ K+NSGSYSSS+ AFYRDLLLL NN 
Subjt:  ---RKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLCNNA

Query:  ITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLGEKGDRSNDD
        +TFFPKSSKE + AC+LR L+SNEMKKSL  ARIDP PEVVDS+P  PS+SKGPDLEG+QSL  KQKSSVPIIVCRKRSKISSK S++GLGEKG+RSNDD
Subjt:  ITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLGEKGDRSNDD

Query:  EKSAIDLKSSIKMASTIVEDQGTTKDNKVKEKPITGARSMRRSNDNATNSTGPSN-KKQTTG--WKPGSSANEPGTPNSDKKKSEIVALEKKRSAADFLK
        EK A+DLKSSIK+ S +VED+ TTKD+KVKEKPITGARSMRRSND+ATNS+GPS+ KKQ T   WKP SSANE  TP  DKKKSE VALEKKRSAADFLK
Subjt:  EKSAIDLKSSIKMASTIVEDQGTTKDNKVKEKPITGARSMRRSNDNATNSTGPSN-KKQTTG--WKPGSSANEPGTPNSDKKKSEIVALEKKRSAADFLK

Query:  RIKQNSPAETTKKNGRGGSTNIGNAAAGEQKKGSGKSDKGKEKATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPAKRAREGGGKEPLKRAR
        RIKQNSPAET K+NGRGGS+++G+ A  EQKKGSGKSDKGKEK +++  SND KRPKED SPSKRSVGRPPKKA EA+P  TP KRAREGGGKEPLKR R
Subjt:  RIKQNSPAETTKKNGRGGSTNIGNAAAGEQKKGSGKSDKGKEKATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPAKRAREGGGKEPLKRAR

Query:  KRAKK
        K+AK+
Subjt:  KRAKK

TrEMBL top hitse value%identityAlignment
A0A0A0K641 Bromo domain-containing protein3.0e-27276.49Show/hide
Query:  ESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDELRKLRVAE
        +SE+  A WGT EELLLACAVKRHGFKDWNSV+MELQ RSSLP LLTTARNCELKF DLKRRF S QNDA  N N+ GI+D +D+A+PWVDELRKLRVAE
Subjt:  ESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDELRKLRVAE

Query:  LRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSNSTGSKCG
        LRREV RYDVSINSLQLKVK+LEEER+QG+ND EA  G PDLKTESRERRSENDKK FGEPD RSGPNGTV KPP V GEDSDRE+FSVNQSNSTGSK G
Subjt:  LRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSNSTGSKCG

Query:  NQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRV-DSSELADSEAQSNGGETAATRESSEVQSSASLTG
        N+K          KPDFAGSYRPE+NR   EPAGP SDDGSTDTVVK  T D S+TK KKET+RV DSSELADSEAQS+GG T  TRESSEVQSSASLTG
Subjt:  NQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRV-DSSELADSEAQSNGGETAATRESSEVQSSASLTG

Query:  R---KIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLCNNA
        R   K +L+KEISGGSSGNEPRR+  +KSRRFDEVL++IRAHKHGSLFESRL SQETE YKGM+RQHLDLE VQ+K+ SGSYSSSN+AFYRDLLLL NN 
Subjt:  R---KIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLCNNA

Query:  ITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLGEKGDRSNDD
        +TFFPKSSKE++ ACELR L+SNEMKKSL  A+ DPLPEVVDS+P  PS+SKGPDLEG+QSL  KQKSSVPI+VCRKRSKIS+  ST+G+GEKG+RSNDD
Subjt:  ITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLGEKGDRSNDD

Query:  EKSAIDLKSSIKMASTIVEDQGTTKDNKVKEKPITGARSMRRSNDNATNSTGPSNKKQ---TTGWKPGSSANEPGTPNSDKKKSEIVALEKKRSAADFLK
        EK A DLKSSIK AS +VED+ TTKD+KVKEKP TGARSMRRSND+ATNS+GPS+ K+   T+ WKP SSANE   P  DKKKSE VALEKKRSAADFLK
Subjt:  EKSAIDLKSSIKMASTIVEDQGTTKDNKVKEKPITGARSMRRSNDNATNSTGPSNKKQ---TTGWKPGSSANEPGTPNSDKKKSEIVALEKKRSAADFLK

Query:  RIKQNSPAETTKKNGRGGSTNIGNAAAGEQKKGSGKSDKGKEK-ATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPAKRAREGGGKEPLKRA
        RIKQNSPAETTK+NGRGGS+   + A  EQKKGS K++KGKE+ +T++  SND+KRPKED SPSKRSVGRPPKKA EAEP  TP KRAREGGGKEPLKR 
Subjt:  RIKQNSPAETTKKNGRGGSTNIGNAAAGEQKKGSGKSDKGKEK-ATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPAKRAREGGGKEPLKRA

Query:  RKRAKK
        RK++K+
Subjt:  RKRAKK

A0A1S3CFH2 uncharacterized protein LOC1034998901.4e-27477.37Show/hide
Query:  ESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDELRKLRVAE
        ESE+  A WGTWEELLLACAVKRHGFKDWNSV+ME+Q RSSLPHLLTTARNCELKF DLKRRF S QNDA  N N+ GI+D +D+A+PWVDELRKLRVAE
Subjt:  ESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDELRKLRVAE

Query:  LRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSNSTGSKCG
        LRREV RYDVSINSLQLKVK+LEEER+QG+ND EA  G PDLKTESRERRSENDKK FGEPD RSGPNGTV KPP V GEDSDREDFSVNQSNSTGSK G
Subjt:  LRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSNSTGSKCG

Query:  NQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRV-DSSELADSEAQSNGGETAATRESSEVQSSASLTG
        N K          KPDF GSYRPE+NRR VEPAGP SDDGSTDTVVK  T D S+TK KKET+RV DSSELADSEAQS+GG T  TRESSEVQSSASLTG
Subjt:  NQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRV-DSSELADSEAQSNGGETAATRESSEVQSSASLTG

Query:  ---RKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLCNNA
           RK +L+ EISGGSSGNEPRR+A +KSRRFDEVL++IRAHKHGSLFESRL SQETE YKGM+RQHLDLE VQ K+NSGSYSSSN+AFYRDLLLL NN 
Subjt:  ---RKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLCNNA

Query:  ITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLGEKGDRSNDD
        +TFFPKSSKE++ ACELR LVSNEMKKSL  A+ DPLPEVVDS P  PSQSKGPDLEG+Q+L  KQKSSVPI+VCRKRSKIS+  ST+G GEKGDRSNDD
Subjt:  ITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLGEKGDRSNDD

Query:  EKSAIDLKSSIKMASTIVEDQGTTKDNKVKEKPITGARSMRRSNDNATNSTGPSNKKQ---TTGWKPGSSANEPGTPNSDKKKSEIVALEKKRSAADFLK
        EK A DLKSSIK+ S +VE+  TTKD+KVKEKP TGARSMRRSND+ATNS+GPS+ K+   T+ WKP SSANE   P  DKKKSE VALEKKRSAADFLK
Subjt:  EKSAIDLKSSIKMASTIVEDQGTTKDNKVKEKPITGARSMRRSNDNATNSTGPSNKKQ---TTGWKPGSSANEPGTPNSDKKKSEIVALEKKRSAADFLK

Query:  RIKQNSPAETTKKNGRGGST-NIGNAAAGEQKKGSGKSDKGKEKATS-VTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPAKRAREGGGKEPLKR
        RIKQNSPAETTK+NGRGGS+  +GNA   EQKKG+ K+DK KE+ +S +  SNDKKRPKED SPSKRSVGRPPKKA EAEP  TP KRAREGGGKEPLKR
Subjt:  RIKQNSPAETTKKNGRGGST-NIGNAAAGEQKKGSGKSDKGKEKATS-VTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPAKRAREGGGKEPLKR

Query:  ARKRAKK
         RK++K+
Subjt:  ARKRAKK

A0A5A7UYV1 Histone H3.v11.4e-27477.37Show/hide
Query:  ESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDELRKLRVAE
        ESE+  A WGTWEELLLACAVKRHGFKDWNSV+ME+Q RSSLPHLLTTARNCELKF DLKRRF S QNDA  N N+ GI+D +D+A+PWVDELRKLRVAE
Subjt:  ESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDELRKLRVAE

Query:  LRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSNSTGSKCG
        LRREV RYDVSINSLQLKVK+LEEER+QG+ND EA  G PDLKTESRERRSENDKK FGEPD RSGPNGTV KPP V GEDSDREDFSVNQSNSTGSK G
Subjt:  LRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSNSTGSKCG

Query:  NQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRV-DSSELADSEAQSNGGETAATRESSEVQSSASLTG
        N K          KPDF GSYRPE+NRR VEPAGP SDDGSTDTVVK  T D S+TK KKET+RV DSSELADSEAQS+GG T  TRESSEVQSSASLTG
Subjt:  NQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRV-DSSELADSEAQSNGGETAATRESSEVQSSASLTG

Query:  ---RKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLCNNA
           RK +L+ EISGGSSGNEPRR+A +KSRRFDEVL++IRAHKHGSLFESRL SQETE YKGM+RQHLDLE VQ K+NSGSYSSSN+AFYRDLLLL NN 
Subjt:  ---RKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLCNNA

Query:  ITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLGEKGDRSNDD
        +TFFPKSSKE++ ACELR LVSNEMKKSL  A+ DPLPEVVDS P  PSQSKGPDLEG+Q+L  KQKSSVPI+VCRKRSKIS+  ST+G GEKGDRSNDD
Subjt:  ITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLGEKGDRSNDD

Query:  EKSAIDLKSSIKMASTIVEDQGTTKDNKVKEKPITGARSMRRSNDNATNSTGPSNKKQ---TTGWKPGSSANEPGTPNSDKKKSEIVALEKKRSAADFLK
        EK A DLKSSIK+ S +VE+  TTKD+KVKEKP TGARSMRRSND+ATNS+GPS+ K+   T+ WKP SSANE   P  DKKKSE VALEKKRSAADFLK
Subjt:  EKSAIDLKSSIKMASTIVEDQGTTKDNKVKEKPITGARSMRRSNDNATNSTGPSNKKQ---TTGWKPGSSANEPGTPNSDKKKSEIVALEKKRSAADFLK

Query:  RIKQNSPAETTKKNGRGGST-NIGNAAAGEQKKGSGKSDKGKEKATS-VTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPAKRAREGGGKEPLKR
        RIKQNSPAETTK+NGRGGS+  +GNA   EQKKG+ K+DK KE+ +S +  SNDKKRPKED SPSKRSVGRPPKKA EAEP  TP KRAREGGGKEPLKR
Subjt:  RIKQNSPAETTKKNGRGGST-NIGNAAAGEQKKGSGKSDKGKEKATS-VTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPAKRAREGGGKEPLKR

Query:  ARKRAKK
         RK++K+
Subjt:  ARKRAKK

A0A6J1H5Z4 uncharacterized protein LOC1114607745.4e-27477.95Show/hide
Query:  MAEDKEPESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDEL
        MAEDKE ES +S A WGTWEELLLA AV RHGFKDWNSVAME+Q RSSLPH LTTA NCELKFLDLKRRF S QNDA  + N  GI+DNVDSAVPWVDEL
Subjt:  MAEDKEPESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDEL

Query:  RKLRVAELRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSN
        RKLRVAELRREV  YDVSINSLQLKVK LEEER QGLNDSEAG G PDLKTE+RERRSENDKKLFGEPD RSG  GTV KP  V GEDSDREDFSVNQSN
Subjt:  RKLRVAELRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSN

Query:  STGSKCGNQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKS-DTKRKKETKRV-DSSELADSEAQSNGGETAATRESSEV
        STGSK GN+K          KP+F GS RPE NRRA E AGP SDDGSTDTVVK  T D S  TK+KKETKRV DSSELADSEA+SNGGET ATRESSEV
Subjt:  STGSKCGNQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKS-DTKRKKETKRV-DSSELADSEAQSNGGETAATRESSEV

Query:  QSSASLTG---RKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRD
        QSSASLTG   R+  L+KEISGGSSGNEPRRT AVKSRRFDEVL+MIR+HKHGSLFESRL SQETE YK MIRQHLDLETVQAK+NSGSY  S++AFYRD
Subjt:  QSSASLTG---RKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRD

Query:  LLLLCNNAITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKG-PDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLG
        LLLLCNNA+TFFP SSKES+ A ELR LVS EMKKS+P AR DP P    S P   SQSKG PDLEG+QSL  K+KS VPI+VCRKRSKISSKLS+SGLG
Subjt:  LLLLCNNAITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKG-PDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLG

Query:  EKGDRSNDDEKSAIDLKSSIKMASTIVEDQGTTKD--NKVKEKPITGARSMRRSNDNATNSTGPSNKKQTT--GWKPGSSANEPGTPNSDKKKSEIVALE
        EK DR+NDDEK A+DLKSS+KMAST+VED GTTKD  +K+KE+ ITGARSMRRSND+ATNS+GPS KKQTT  GWKP +SANE  TP  DKKKSE VALE
Subjt:  EKGDRSNDDEKSAIDLKSSIKMASTIVEDQGTTKD--NKVKEKPITGARSMRRSNDNATNSTGPSNKKQTT--GWKPGSSANEPGTPNSDKKKSEIVALE

Query:  KKRSAADFLKRIKQNSPAETTKKNGRGGSTNIGN----AAAGEQKKGSGKSDKGKEKATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPA--
        KKRSAADFLKRIKQNSPAETTK+NGRGGS+N  N    AAAGEQKKGSGK+  GKEKA+S+  SN+KKR KED SPSKRSVGRPPKKAVEA P   PA  
Subjt:  KKRSAADFLKRIKQNSPAETTKKNGRGGSTNIGN----AAAGEQKKGSGKSDKGKEKATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPA--

Query:  KRAREGGGKEPLKRARKRAKK
        KRAREGGGKEPLKRARKRA++
Subjt:  KRAREGGGKEPLKRARKRAKK

A0A6J1KZN3 dentin sialophosphoprotein-like7.8e-27377.39Show/hide
Query:  MAEDKEPESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDEL
        MAEDKE ES +S A WGTWEELLLA AV RHGFKDWNSVAME+Q RSSLPH LTTA NCELKFLDLKRRF + QNDA  N N  GI+DNVDSAVPWVDEL
Subjt:  MAEDKEPESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDEL

Query:  RKLRVAELRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSN
        RKLRVAELRREV  YDVSINSLQLKVK LEEER QGLNDSEAG G PDLKTE+RERRSENDKKLFGEPD RSG  GTV KP  V GEDSDREDFSVNQSN
Subjt:  RKLRVAELRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSN

Query:  STGSKCGNQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRVD-SSELADSEAQSNGGETAATRESSEVQ
        STGSK GN+K          K +F GS RPE NRRA EP+GP SDDGSTDTVVK  T D S TK+KKETKRVD SSELADSEA+SNGG T ATRESSEVQ
Subjt:  STGSKCGNQKG--------AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRVD-SSELADSEAQSNGGETAATRESSEVQ

Query:  SSASLTG---RKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDL
        SSASLTG   R+  L+KEISGGSSGNEPRRT AVKSRRFDEVL+MIR+HKHGSLFESRL SQETE YKGMIRQHLDLETVQAK+NSGSY  S++AFYRDL
Subjt:  SSASLTG---RKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDL

Query:  LLLCNNAITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYP-SQSKGPDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLGE
        LLLCNNA+TFFP SSKES+ A ELR LVS EM KS+P AR+DP P    S+P  P SQSKGPDLEG+QSL  K+KS VPI+VCRKRSKISSKLS+SGLGE
Subjt:  LLLCNNAITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYP-SQSKGPDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLGE

Query:  KGDRSNDDEKSAIDLKSSIKMASTIVEDQGTTKD--NKVKEKPITGARSMRRSNDNATNSTGPSNKKQTT--GWKPGSSANEPGTPNSDKKKSEIVALEK
        K DR+NDDEK A+DLKSS+KMAST+VED GT KD  +K+KE+ ITGARSMRRSND+ATNS+GPS KKQTT  GWK  +SANE  T   DKKKSE VALEK
Subjt:  KGDRSNDDEKSAIDLKSSIKMASTIVEDQGTTKD--NKVKEKPITGARSMRRSNDNATNSTGPSNKKQTT--GWKPGSSANEPGTPNSDKKKSEIVALEK

Query:  KRSAADFLKRIKQNSPAETTKKNGRGGSTNIGN-----AAAGEQKKGSGKSDKGKEKATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPA--
        KRSAADFLKRIKQNSPAETTK+NGRGGS+N  N     AAAGEQKKGSGK+  GKEKA+S+  SN+KKR KED SPSKRSVGRPPKKAVEA P   PA  
Subjt:  KRSAADFLKRIKQNSPAETTKKNGRGGSTNIGN-----AAAGEQKKGSGKSDKGKEKATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPA--

Query:  KRAREGGGKEPLKRARKRAKK
        KRAREGGGKEPLKRARKRA++
Subjt:  KRAREGGGKEPLKRARKRAKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61215.1 bromodomain 42.1e-2025.42Show/hide
Query:  WGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDELRKLRVAELRREVHRY
        WGTWEELLL  AV RHG  DW  VA EL++  SLP +  T   C+ K+ DL++R++                        W +EL+K RVAEL+  + + 
Subjt:  WGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDELRKLRVAELRREVHRY

Query:  DVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSNSTGSKCGN-QKGAKP
        + SI SL+ K++ L+ E     ND            E  +   ++ + L  EP  +S   G      +   +D+ ++  SV           N    AK 
Subjt:  DVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSNSTGSKCGN-QKGAKP

Query:  DFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRVDSSELADSEAQSNGGETAATRESSEVQSSASLTGRKIVLKKEISGGSSGN
        +       EK +  +      S  G    V+       S  K++ + KR D S     E      E +A  ES    +SA +     + + + +  +S +
Subjt:  DFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRVDSSELADSEAQSNGGETAATRESSEVQSSASLTGRKIVLKKEISGGSSGN

Query:  EPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLCNNAITFFPKSSKESIGACELRY
        + R  +    +   ++   I  ++   +F  RL SQ+   YK ++R+H+DL+TVQ+++N  S SS+    +RD LL+ NNA  F+ K+++E   A  LR 
Subjt:  EPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLCNNAITFFPKSSKESIGACELRY

Query:  LVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSVPII----VCRKRSKISSKLSTSGLGEKGDRSNDDEKSAIDLK-SSIKMA
        +V+  ++  L            D  P   S + G     T+ +   QKS+ P +      +K    +  L T         S  +++S  DL  S++K +
Subjt:  LVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSVPII----VCRKRSKISSKLSTSGLGEKGDRSNDDEKSAIDLK-SSIKMA

Query:  STIVEDQGTTKDNKVKEKPITGARSMRRSND
        +   +     KD +   + +     M R  +
Subjt:  STIVEDQGTTKDNKVKEKPITGARSMRRSND

AT2G42150.1 DNA-binding bromodomain-containing protein2.2e-4631.72Show/hide
Query:  MAEDKEPESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLT-TARNCELKFLDLKRRFIS----PQNDADSNRNSVGISDNVDSAVP
        MA+ +   +      W TWEELLLACAV RHG + WNSV+ E+Q  S  P+L + TA  C  K+ DLK RF      P++ A+             S  P
Subjt:  MAEDKEPESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLT-TARNCELKFLDLKRRFIS----PQNDADSNRNSVGISDNVDSAVP

Query:  WVDELRKLRVAELRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVA-KPPDVAGEDSDREDF
        W++ELRKLRV ELRREV +YD+SI++LQ KVK+LEEER+      +    N DL  E ++ RS++ + +   P +    N T++  P ++  E+++RE+ 
Subjt:  WVDELRKLRVAELRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVA-KPPDVAGEDSDREDF

Query:  SVNQSNSTGSKCGNQKGAKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRVDSSELADSEAQSNGGETAATRESSEVQSSA
                GS  G  K A  D                    S  GS ++V      +K  T   +  + V  +EL +SE  ++ GE      +S+VQSSA
Subjt:  SVNQSNSTGSKCGNQKGAKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRVDSSELADSEAQSNGGETAATRESSEVQSSA

Query:  SL--TGRKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLC
        SL   G     K++ S  S+     +   V+S+     +E++ +H  GS F  RL  QET  Y  +IR+H+D E ++ +V  G Y S  I F+RDLLLL 
Subjt:  SL--TGRKIVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLC

Query:  NNAITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSVP-IIVCRKRSKISSK---LSTSGLGEK
        NNA  F+ + S E   A +L  LV  +M  +L            D     P + +   +  ++ + +K + SVP I+ CRKRS +++K   L   G  +K
Subjt:  NNAITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSVP-IIVCRKRSKISSK---LSTSGLGEK

Query:  GDRSND----DEKSAIDLKSSIKMASTIVEDQGTTKDNKVKEKPITGARSMRRSNDNATNSTGP----SNKKQTTGWKPGSSANEPGTPNSDKKKSEIVA
          +++     DEK   D            E  G   D+ +  K +T  R         T+STG     ++K + +      S ++    + +KK      
Subjt:  GDRSND----DEKSAIDLKSSIKMASTIVEDQGTTKDNKVKEKPITGARSMRRSNDNATNSTGP----SNKKQTTGWKPGSSANEPGTPNSDKKKSEIVA

Query:  LEKKRSAADFLKRIKQNSPAETTKKNGRGGSTNIGNAAAGEQKK-GSGKSDKGKEKATSVTPSNDKKRPKEDTSPSKRS-VGRPPKKAVEAEPTSTP--A
          KK+ AA FL+R+K  S  +T K++    S+  G     EQ+K  S K+D  K     +  +N K       SP KRS  GR  ++      +S P  A
Subjt:  LEKKRSAADFLKRIKQNSPAETTKKNGRGGSTNIGNAAAGEQKK-GSGKSDKGKEKATSVTPSNDKKRPKEDTSPSKRS-VGRPPKKAVEAEPTSTP--A

Query:  KRAREGGGKEPLKRARKRAKKR
        KR+R+ G KE       R KKR
Subjt:  KRAREGGGKEPLKRARKRAKKR

AT2G44430.1 DNA-binding bromodomain-containing protein2.4e-8538.35Show/hide
Query:  EDKEPESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQND--ADSNRNSVGISDNVDSAVPWVDEL
        ED     +     WGTWEELLLACAVKRHGF DW+SVA E+++RSSL HLL +A +C  K+ DLKRRF   +      +        + V + +PW+++L
Subjt:  EDKEPESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQND--ADSNRNSVGISDNVDSAVPWVDEL

Query:  RKLRVAELRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSN
        R LRVAELRREV RYD SI SLQLKVK+LEEER       E G   PDL+ E +E RSEND           G      +    A E+SDRE+ S+N+SN
Subjt:  RKLRVAELRREVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSN

Query:  STGSKCGNQKGAKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRVDSSELADSEAQSNGGETAATRESSEVQSSASLTGRK
        ST +  G ++    D              EP+    DD   D        +K  T  ++E   V           S G E + + E  E  +S S   R 
Subjt:  STGSKCGNQKGAKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRVDSSELADSEAQSNGGETAATRESSEVQSSASLTGRK

Query:  IVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLCNNAITFFP
           K+   GG+      R+A  KS+    +L++IR+H  GSLFE RL SQE + YK M++QHLD+ET+Q K+  GSY SS++ FYRDL LL  NAI FFP
Subjt:  IVLKKEISGGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLCNNAITFFP

Query:  KSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLGEKGDRSNDDEKSAI
         SS ES+ A ELR +VS EM+K    A     P ++    A   +S   D E + S  ++QKSS P++VC+KR  +S+K S S          DD K   
Subjt:  KSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLGEKGDRSNDDEKSAI

Query:  DLKSSIKMASTIVEDQGTTKDNKVKEKPITGARSMRRSND------NATNSTGPSNKKQTTGWKPGSSANEPGTPNSDKKKSEIVALEKKRSAADFLKRI
                  T+ E+         K+   TG RS RR+N       N T +    NK++ T  K  SS +     ++ K + + V+ +KK+S ADFLKR+
Subjt:  DLKSSIKMASTIVEDQGTTKDNKVKEKPITGARSMRRSND------NATNSTGPSNKKQTTGWKPGSSANEPGTPNSDKKKSEIVALEKKRSAADFLKRI

Query:  KQNSPAETTKKNGRGGSTNIGNAAAGEQKKGSGKSDKGKEKATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPAKRAREGG-----GKEPLK
        K+NSP +  K   + G         G  KK S    K +E  +S   S  KK+ + + +P KR+ GRP KK  EA   S   KR R+ G      K+P K
Subjt:  KQNSPAETTKKNGRGGSTNIGNAAAGEQKKGSGKSDKGKEKATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPAKRAREGG-----GKEPLK

Query:  RARK
        R RK
Subjt:  RARK

AT3G57980.1 DNA-binding bromodomain-containing protein8.2e-4932.34Show/hide
Query:  EELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRF----ISPQNDADSNRNSVGISDNVDSAVPWVDELRKLRVAELRREVHRY
        EELLLACAV RHG   W+SVA E+  ++S    L TA +C  K+ DLKRRF    +SP   AD    +  I     S+VPW++ELRKLRV ELRREV RY
Subjt:  EELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRF----ISPQNDADSNRNSVGISDNVDSAVPWVDELRKLRVAELRREVHRY

Query:  DVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDK-------KLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSNSTGSKCGN
        D+SI+SLQLKVK LE+ER++ L   +  N + D   E++E  +E+         +L   PD      GT ++  + A + ++  D    + N  G +  +
Subjt:  DVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDK-------KLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSNSTGSKCGN

Query:  QKGAKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRVDSSELADSEAQSNGGETAATRESSEVQSSASLTGRKIVLKKEIS
        +K A+ D                    S  GS ++V K+  SD+++ KR+      DS EL +S  +S G E   T+E+S+ QSSAS   RK  + ++  
Subjt:  QKGAKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRVDSSELADSEAQSNGGETAATRESSEVQSSASLTGRKIVLKKEIS

Query:  GGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLCNNAITFFPKSSKESIG
             +       V+S+   + +E++++H  GS F  RL +QET  Y  +IRQH+D E ++++V  G Y ++   F+RDLLLL NN   F+ + S E   
Subjt:  GGSSGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLCNNAITFFPKSSKESIG

Query:  ACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLGEKGDRSNDDEKSAIDLKSSIKM
        A +L  L+  +M   +P   + P  E  D+      + K   L+ T         SVPII CRKRS ++ +   S          +  K    +  ++  
Subjt:  ACELRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLGEKGDRSNDDEKSAIDLKSSIKM

Query:  ASTIVEDQGTTKDNKVKEKPITGARSMRRSNDNATNSTGPSNKKQTTGWKPGSSANEPGTPN---------SDKKKSEIVALEKKRSAADFLKRIKQNSP
             E++G   D    EKPI   +  R +  +     G  N K  T    G S N   +PN           KK        KK+SAA FLKR+K  S 
Subjt:  ASTIVEDQGTTKDNKVKEKPITGARSMRRSNDNATNSTGPSNKKQTTGWKPGSSANEPGTPN---------SDKKKSEIVALEKKRSAADFLKRIKQNSP

Query:  AETTKKNGRGGSTNIGNAAAGEQKKGSGKSDK--------GKEKATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPAKRAREGGGKE--PLK
        +ET  +  +  S+N       EQ+K + KS+K        G+++ T   P+ +K       SP+K++ G   K+       S  AKR  E   KE     
Subjt:  AETTKKNGRGGSTNIGNAAAGEQKKGSGKSDK--------GKEKATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPAKRAREGGGKE--PLK

Query:  RARKRAKK
        R +KR+K+
Subjt:  RARKRAKK

AT3G60110.1 DNA-binding bromodomain-containing protein6.5e-7035.49Show/hide
Query:  VWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDELRKLRVAELRREVHR
        VWGTWEEL+L CAVKRH F DW+SVA E+Q RS    L+ +A NC LK+ DLKRRF    +  D N  +    ++    + W+++LR L +AELRREV R
Subjt:  VWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDELRKLRVAELRREVHR

Query:  YDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGE--DSDRED-FSVNQSNSTGSKCGNQKG
         D SI SLQLKVK+LEEE+     D + G+  PDLK         ND+                 KP  V  E  +SDR+D  S+N+SNST S       
Subjt:  YDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGE--DSDRED-FSVNQSNSTGSKCGNQKG

Query:  AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRVDSSELADSEAQSNGGETAATRESSEVQSSASLTGRKIVLKKEISGGS
         + D     +  +N R  +P          D V K  T ++ +    K ++  +S EL +S                    +++  G++   K    GG 
Subjt:  AKPDFAGSYRPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRVDSSELADSEAQSNGGETAATRESSEVQSSASLTGRKIVLKKEISGGS

Query:  SGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLCNNAITFFPKSSKESIGACE
         G    ++A  KS+   +++++IR+H  GS+FESRL SQ+T+ YK +IRQHLD++T++ K+  GSY SS+++FYRDL LL  NAI FFP SS ESI A E
Subjt:  SGNEPRRTAAVKSRRFDEVLEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLCNNAITFFPKSSKESIGACE

Query:  LRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSV-PIIVCRKRSKISSKLSTSGLGEKGDRSNDDEKSAIDLKSSIKMAS
        LR LVSNEMKK     R   L   V  + A             +S  ++QKSSV  ++ C+K+S    K S S       R  D++KS            
Subjt:  LRYLVSNEMKKSLPFARIDPLPEVVDSAPAYPSQSKGPDLEGTQSLFTKQKSSV-PIIVCRKRSKISSKLSTSGLGEKGDRSNDDEKSAIDLKSSIKMAS

Query:  TIVEDQGTTKDNKVKEKPITGARSMRRSN-------DNATNSTGPSNKKQTTGWKPGSSANEPGTPNSDKKKSEIVALEKKRSAADFLKRIKQNSP---A
             Q  +++  V     T ARS RR++        +       +N K+ T  K  SS ++      + K  +    +KK+S ADFLKRIK+NSP    
Subjt:  TIVEDQGTTKDNKVKEKPITGARSMRRSN-------DNATNSTGPSNKKQTTGWKPGSSANEPGTPNSDKKKSEIVALEKKRSAADFLKRIKQNSP---A

Query:  ETTKKNGRGGSTNIGNAAAGEQKK-GSGKSDKGKEKATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPAKRAREGG--GKEPLKRARK
        ETT KN +    N+      ++K  G+ K +  K K   +  S  KK+ + + + SK S  R  K+  E    +T  KR RE G   K+P KR+R+
Subjt:  ETTKKNGRGGSTNIGNAAAGEQKK-GSGKSDKGKEKATSVTPSNDKKRPKEDTSPSKRSVGRPPKKAVEAEPTSTPAKRAREGG--GKEPLKRARK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAGATAAAGAGCCGGAGAGTGAACGGAGCGCCGCCGTGTGGGGGACGTGGGAGGAGCTTTTACTGGCCTGTGCCGTTAAGCGCCATGGATTCAAGGACTGGAA
TTCCGTCGCCATGGAGCTTCAGACTCGTTCCTCGCTTCCTCATCTCCTAACCACCGCCCGTAACTGCGAACTCAAATTCCTTGACCTTAAACGACGGTTCATTTCTCCAC
AAAACGACGCCGATTCGAACCGGAATAGTGTTGGAATTTCCGATAACGTCGACAGCGCCGTACCTTGGGTAGATGAGTTGAGGAAACTCCGCGTTGCCGAACTCCGACGA
GAGGTTCACCGCTATGACGTTTCAATCAACTCGTTGCAGTTGAAGGTCAAGGAATTGGAGGAGGAGCGAAAACAAGGCCTGAACGACAGCGAGGCCGGCAATGGAAATCC
AGATCTGAAAACAGAGTCCAGAGAACGACGATCAGAAAACGACAAAAAACTCTTCGGAGAACCAGACCGCCGGTCCGGACCGAACGGAACAGTCGCGAAGCCACCAGACG
TTGCCGGTGAAGACTCCGACCGGGAGGATTTCTCCGTCAACCAGTCCAATTCCACCGGCTCGAAGTGCGGCAACCAAAAAGGCGCTAAACCGGATTTTGCCGGTTCGTAC
AGACCGGAAAAGAACCGGAGAGCCGTCGAGCCAGCCGGACCGCTGTCGGATGACGGCAGTACGGACACGGTGGTCAAGAAATCTACGAGCGACAAATCTGACACGAAAAG
GAAAAAGGAAACAAAGCGAGTTGACTCGTCTGAGTTAGCGGACTCGGAGGCTCAGTCAAACGGCGGAGAGACGGCGGCGACGAGGGAGAGCAGCGAGGTACAGAGCTCGG
CAAGTTTGACGGGGAGGAAGATAGTCCTCAAAAAGGAGATCTCCGGCGGCAGCAGCGGCAATGAACCCCGCCGCACCGCCGCCGTCAAGTCACGGCGGTTCGACGAGGTT
TTGGAGATGATACGCGCGCATAAACATGGCTCCTTGTTCGAATCTCGTCTTCTAAGTCAGGAGACCGAAGTTTACAAGGGCATGATTCGACAACATTTGGACCTAGAAAC
AGTTCAAGCCAAGGTTAATTCTGGCTCTTATTCGTCCTCCAACATTGCATTTTATCGCGATCTATTGCTTCTTTGCAACAATGCAATAACCTTCTTCCCAAAATCATCCA
AGGAGTCGATAGGTGCTTGTGAGCTCCGCTATCTTGTCTCCAATGAAATGAAGAAGAGTCTTCCATTTGCTCGAATTGACCCTTTACCTGAAGTAGTGGATTCAGCCCCT
GCATACCCTTCTCAATCCAAGGGTCCTGATCTTGAAGGTACCCAATCCTTGTTTACAAAGCAGAAGTCCTCTGTCCCTATAATAGTTTGTCGAAAACGAAGCAAAATTTC
ATCAAAGCTTTCGACGAGTGGTCTTGGCGAAAAAGGTGATCGAAGCAATGATGATGAGAAGTCGGCCATTGATCTAAAATCAAGTATCAAAATGGCTTCCACTATTGTGG
AAGACCAGGGTACTACAAAAGACAACAAGGTTAAAGAGAAACCTATAACTGGAGCTAGAAGCATGCGAAGAAGCAACGATAATGCCACAAACTCTACCGGTCCTAGCAAC
AAGAAACAGACTACTGGATGGAAACCTGGTTCTTCAGCAAACGAACCAGGGACACCAAATTCAGACAAGAAGAAATCCGAAATAGTTGCGCTAGAGAAGAAGCGCAGTGC
AGCAGATTTTTTGAAGAGGATCAAGCAGAACTCACCTGCAGAGACTACAAAGAAAAATGGCAGAGGTGGAAGTACTAATATTGGTAATGCAGCTGCGGGAGAACAAAAGA
AAGGGAGTGGCAAAAGTGATAAAGGAAAGGAAAAAGCGACATCGGTAACACCAAGCAATGACAAGAAGCGCCCTAAGGAGGACACCAGTCCTTCCAAGCGCAGCGTCGGG
CGACCGCCAAAGAAGGCAGTGGAGGCCGAACCAACTTCTACTCCAGCAAAGAGAGCCAGGGAAGGTGGTGGAAAAGAACCCCTAAAGCGGGCTAGAAAAAGGGCTAAGAA
AAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATTTACCAAACGAAAACGGGAGATTTTTCATTTTCTCTCTCAATTTTTCTTAAAAAAAAAAAAGAAAAAAAAACTCAATTCATTTAAATATTTTTAAACGAGTTTCGAAC
TTGCGTCTCTTCCATTTTTGTTTCTCTTTTACGCCGCAGAGAAGAAGAACAAAACTCAGAAACCCTAATCATAAATCTCTCTAATGGATGAACTTTCCCTTTCTGTTTAA
ATCTTGTCGTGATTTTTTCCAAGGAATTTTGTAAATGAAAAGAGAATTGGATCCTATATGCTGTTTGTTGTTGTTGTTGTAATCTAATGGCGGAAGATAAAGAGCCGGAG
AGTGAACGGAGCGCCGCCGTGTGGGGGACGTGGGAGGAGCTTTTACTGGCCTGTGCCGTTAAGCGCCATGGATTCAAGGACTGGAATTCCGTCGCCATGGAGCTTCAGAC
TCGTTCCTCGCTTCCTCATCTCCTAACCACCGCCCGTAACTGCGAACTCAAATTCCTTGACCTTAAACGACGGTTCATTTCTCCACAAAACGACGCCGATTCGAACCGGA
ATAGTGTTGGAATTTCCGATAACGTCGACAGCGCCGTACCTTGGGTAGATGAGTTGAGGAAACTCCGCGTTGCCGAACTCCGACGAGAGGTTCACCGCTATGACGTTTCA
ATCAACTCGTTGCAGTTGAAGGTCAAGGAATTGGAGGAGGAGCGAAAACAAGGCCTGAACGACAGCGAGGCCGGCAATGGAAATCCAGATCTGAAAACAGAGTCCAGAGA
ACGACGATCAGAAAACGACAAAAAACTCTTCGGAGAACCAGACCGCCGGTCCGGACCGAACGGAACAGTCGCGAAGCCACCAGACGTTGCCGGTGAAGACTCCGACCGGG
AGGATTTCTCCGTCAACCAGTCCAATTCCACCGGCTCGAAGTGCGGCAACCAAAAAGGCGCTAAACCGGATTTTGCCGGTTCGTACAGACCGGAAAAGAACCGGAGAGCC
GTCGAGCCAGCCGGACCGCTGTCGGATGACGGCAGTACGGACACGGTGGTCAAGAAATCTACGAGCGACAAATCTGACACGAAAAGGAAAAAGGAAACAAAGCGAGTTGA
CTCGTCTGAGTTAGCGGACTCGGAGGCTCAGTCAAACGGCGGAGAGACGGCGGCGACGAGGGAGAGCAGCGAGGTACAGAGCTCGGCAAGTTTGACGGGGAGGAAGATAG
TCCTCAAAAAGGAGATCTCCGGCGGCAGCAGCGGCAATGAACCCCGCCGCACCGCCGCCGTCAAGTCACGGCGGTTCGACGAGGTTTTGGAGATGATACGCGCGCATAAA
CATGGCTCCTTGTTCGAATCTCGTCTTCTAAGTCAGGAGACCGAAGTTTACAAGGGCATGATTCGACAACATTTGGACCTAGAAACAGTTCAAGCCAAGGTTAATTCTGG
CTCTTATTCGTCCTCCAACATTGCATTTTATCGCGATCTATTGCTTCTTTGCAACAATGCAATAACCTTCTTCCCAAAATCATCCAAGGAGTCGATAGGTGCTTGTGAGC
TCCGCTATCTTGTCTCCAATGAAATGAAGAAGAGTCTTCCATTTGCTCGAATTGACCCTTTACCTGAAGTAGTGGATTCAGCCCCTGCATACCCTTCTCAATCCAAGGGT
CCTGATCTTGAAGGTACCCAATCCTTGTTTACAAAGCAGAAGTCCTCTGTCCCTATAATAGTTTGTCGAAAACGAAGCAAAATTTCATCAAAGCTTTCGACGAGTGGTCT
TGGCGAAAAAGGTGATCGAAGCAATGATGATGAGAAGTCGGCCATTGATCTAAAATCAAGTATCAAAATGGCTTCCACTATTGTGGAAGACCAGGGTACTACAAAAGACA
ACAAGGTTAAAGAGAAACCTATAACTGGAGCTAGAAGCATGCGAAGAAGCAACGATAATGCCACAAACTCTACCGGTCCTAGCAACAAGAAACAGACTACTGGATGGAAA
CCTGGTTCTTCAGCAAACGAACCAGGGACACCAAATTCAGACAAGAAGAAATCCGAAATAGTTGCGCTAGAGAAGAAGCGCAGTGCAGCAGATTTTTTGAAGAGGATCAA
GCAGAACTCACCTGCAGAGACTACAAAGAAAAATGGCAGAGGTGGAAGTACTAATATTGGTAATGCAGCTGCGGGAGAACAAAAGAAAGGGAGTGGCAAAAGTGATAAAG
GAAAGGAAAAAGCGACATCGGTAACACCAAGCAATGACAAGAAGCGCCCTAAGGAGGACACCAGTCCTTCCAAGCGCAGCGTCGGGCGACCGCCAAAGAAGGCAGTGGAG
GCCGAACCAACTTCTACTCCAGCAAAGAGAGCCAGGGAAGGTGGTGGAAAAGAACCCCTAAAGCGGGCTAGAAAAAGGGCTAAGAAAAGGTGACCATCTTAGTTGCTGCA
GTTTATTGTTTGTATATTAATAGACCTGTAGTGTATCATCAATCTGTATTGCATTCAATTATTTTTTTTTACTTTTAATTTGTACCACGAAAAACTTGGGCACTGTTTTC
ATTAGTTTTCTGCTACAGCTAGTTTAATCAGATAGTTACAATTCAATTGGCAATTTGATTTGGAAAGATCTGGACCGTTGATTGAAGAGAACCAGTTTGTTCTTTGCACA
AGAGTCCAAGTGTGGTGGAGATTGTAGATGAGCATAAGCTATGTCTAAAGAAAGTTCCCACCACAACCAC
Protein sequenceShow/hide protein sequence
MAEDKEPESERSAAVWGTWEELLLACAVKRHGFKDWNSVAMELQTRSSLPHLLTTARNCELKFLDLKRRFISPQNDADSNRNSVGISDNVDSAVPWVDELRKLRVAELRR
EVHRYDVSINSLQLKVKELEEERKQGLNDSEAGNGNPDLKTESRERRSENDKKLFGEPDRRSGPNGTVAKPPDVAGEDSDREDFSVNQSNSTGSKCGNQKGAKPDFAGSY
RPEKNRRAVEPAGPLSDDGSTDTVVKKSTSDKSDTKRKKETKRVDSSELADSEAQSNGGETAATRESSEVQSSASLTGRKIVLKKEISGGSSGNEPRRTAAVKSRRFDEV
LEMIRAHKHGSLFESRLLSQETEVYKGMIRQHLDLETVQAKVNSGSYSSSNIAFYRDLLLLCNNAITFFPKSSKESIGACELRYLVSNEMKKSLPFARIDPLPEVVDSAP
AYPSQSKGPDLEGTQSLFTKQKSSVPIIVCRKRSKISSKLSTSGLGEKGDRSNDDEKSAIDLKSSIKMASTIVEDQGTTKDNKVKEKPITGARSMRRSNDNATNSTGPSN
KKQTTGWKPGSSANEPGTPNSDKKKSEIVALEKKRSAADFLKRIKQNSPAETTKKNGRGGSTNIGNAAAGEQKKGSGKSDKGKEKATSVTPSNDKKRPKEDTSPSKRSVG
RPPKKAVEAEPTSTPAKRAREGGGKEPLKRARKRAKKR