; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy05g004000 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy05g004000
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionBromo domain-containing protein
Genome locationChr05:3601461..3606198
RNA-Seq ExpressionLcy05g004000
SyntenyLcy05g004000
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR001487 - Bromodomain
IPR017930 - Myb domain
IPR036427 - Bromodomain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593331.1 hypothetical protein SDJN03_12807, partial [Cucurbita argyrosperma subsp. sororia]3.6e-30483.89Show/hide
Query:  MAEDRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDEL
        MAED++R++ Q+ ATWGTWEELLLA AV RHGFKDWNSVAMEVQARSSLPH LT A NCELKFLDLKRRFTSFQNDAV  +NGAGI D VDS VPWVDEL
Subjt:  MAEDRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDEL

Query:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN
        RKLRVAELRREV+ YDVSINSLQLKVKRLEEER+QGLND EAGTGKPDLKTEARERRSENDKKLFGEPDHRSG  GTV KP AV GEDSDRED SVNQSN
Subjt:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN

Query:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTS-ETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ
        STGSKSGNRKSTA+N KSET+P+F GS RPE NRRA E AGPQSDDGSTDTVVKNPTCDTS  TKKK ETKRVDDSSELADSEA+SNGGETATRESSEVQ
Subjt:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTS-ETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ

Query:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDL
        SSASLTGRMKR+R LRKEISGGSSGNEPRRT AVKSR+FDEVLQMIR+HK+GSLFESRLQSQET EYKGMIRQHLDLETVQAK+NSGSY SSSLAFYRDL
Subjt:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDL

Query:  LLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SRSKG-TDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLG
        LLLCNNA TFFP SSKES+AA ELRLLVS EM KS+PVAR DPSP    SSP  P S+SKG  DLEGSQSL AK+KS VPI+VCRKRSKISSKLSSSGLG
Subjt:  LLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SRSKG-TDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLG

Query:  EKGDQSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SGKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEK
        EK D++NDDEK A+DLKS++KMAST VEDHGTTKD S K+KE+ +TGARSMRRSNDSA NSSGPSIKKQ TNSGWKP+S NETETP PDKKKSETVALEK
Subjt:  EKGDQSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SGKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEK

Query:  KRSAADFLKRIKQNSPAETMKRNGRGGSTSVGN----AAAGEQKKGSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGRPPKKTAE--PDPAPAPTK
        KRSAADFLKRIKQNSPAET KRNGRGGS++  N    AAAGEQKKGSGK+   KE+  SIKQSNEKK  KED SPSKRSVGRPPKK  E  PDP PAPTK
Subjt:  KRSAADFLKRIKQNSPAETMKRNGRGGSTSVGN----AAAGEQKKGSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGRPPKKTAE--PDPAPAPTK

Query:  RAREGGGKEPLKRAKKRARR
        RAREGGGKEPLKRA+KRARR
Subjt:  RAREGGGKEPLKRAKKRARR

XP_022144471.1 uncharacterized protein LOC111014149 [Momordica charantia]1.7e-30683.73Show/hide
Query:  MAEDRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDEL
        MAED++R++EQ+ A+WGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLL  A NCELKFLDLKRRFTSFQNDAVF QNG GIADK+D+ +PWVDEL
Subjt:  MAEDRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDEL

Query:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKP--PAVPGEDSDREDLSVNQ
        RKLRVAELRREV+RYDVSI+SLQLKVKRLEEERE GLNDR+AGTGKPDLKTE+RERRSENDKKLFGEPD+RSGPNGTV+KP  PAVPGEDSDRED SVNQ
Subjt:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKP--PAVPGEDSDREDLSVNQ

Query:  SNSTGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEV
        SNSTGSKSGNRKST +N KSET+P+F GSFR EQNRRA EPAGPQSDDGSTDTV KNPTCD SET+KK + +   DSSELADSEAQSNGGETATRESSEV
Subjt:  SNSTGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEV

Query:  QSSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRD
        QSSASLTGRMKRKRLLRKEISGGSSGNEPRRTA VKSR+FD++LQMIRAHK+GS FESRLQSQE+EEYKGM+RQHLDLE VQAK+NSGSY+SSSLAFYRD
Subjt:  QSSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRD

Query:  LLLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSRSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLGE
        LLLLCNNA  FFPK SKE++AACELRLLVSNE+KKSL V RTDPSPEVVDSSPA+PSRSKG DLEGS SLLAKQKSSVPIIVCRKRSKIS+KLSSS LGE
Subjt:  LLLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSRSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLGE

Query:  KGDQSNDDEKSAVDLKSTIKMASTAVEDHGTTKDSGKVKEKPVTGARSMRRSNDS-ANSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKR
        K D+S++DEKSAVD+K T+K AS AVEDHGTTKDS KVKEKPVTGARSMRRSNDS  NSSGPSIKKQN +SGWKP S NE ET  PDKKK ETVALEKKR
Subjt:  KGDQSNDDEKSAVDLKSTIKMASTAVEDHGTTKDSGKVKEKPVTGARSMRRSNDS-ANSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKR

Query:  SAADFLKRIKQNSPAETMKRNGRGGS-TSVGNAAAGEQKKGSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGG
        SAADFLKRIKQNSPAET+KRN R GS +S GNA A EQKKGS KSDK KER  S++QSN+KK  K+DASPSKRSVGRPPKK    +  P PTKRAREGGG
Subjt:  SAADFLKRIKQNSPAETMKRNGRGGS-TSVGNAAAGEQKKGSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGG

Query:  KEPLKRAKKRARR
        KEPLKR KKR+RR
Subjt:  KEPLKRAKKRARR

XP_022959831.1 uncharacterized protein LOC111460774 [Cucurbita moschata]1.7e-30684.03Show/hide
Query:  MAEDRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDEL
        MAED++R++ Q+ ATWGTWEELLLA AV RHGFKDWNSVAMEVQARSSLPH LT A NCELKFLDLKRRFTSFQNDAV  +NGAGI D VDS VPWVDEL
Subjt:  MAEDRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDEL

Query:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN
        RKLRVAELRREV+ YDVSINSLQLKVKRLEEER+QGLND EAGTGKPDLKTEARERRSENDKKLFGEPDHRSG  GTV KP AVPGEDSDRED SVNQSN
Subjt:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN

Query:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTS-ETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ
        STGSKSGNRKSTA+N KSET+P+F GS RPE NRRA E AGPQSDDGSTDTVVKNPTCDTS  TKKKKETKRVDDSSELADSEA+SNGGETATRESSEVQ
Subjt:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTS-ETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ

Query:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDL
        SSASLTGRMKR+R LRKEISGGSSGNEPRRT AVKSR+FDEVLQMIR+HK+GSLFESRLQSQETEEYK MIRQHLDLETVQAK+NSGSY  SSLAFYRDL
Subjt:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDL

Query:  LLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SRSKG-TDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLG
        LLLCNNA TFFP SSKES+AA ELRLLVS EMKKS+PVAR DPSP    SSP  P S+SKG  DLEGSQSL AK+KS VPI+VCRKRSKISSKLSSSGLG
Subjt:  LLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SRSKG-TDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLG

Query:  EKGDQSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SGKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEK
        EK D++NDDEK A+DLKS++KMAST VEDHGTTKD S K+KE+ +TGARSMRRSNDSA NSSGPSIKKQ TNSGWKP+S NETETP PDKKKSETVALEK
Subjt:  EKGDQSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SGKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEK

Query:  KRSAADFLKRIKQNSPAETMKRNGRGGSTSVGN----AAAGEQKKGSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGRPPKKTAE--PDPAPAPTK
        KRSAADFLKRIKQNSPAET KRNGRGGS++  N    AAAGEQKKGSGK+   KE+  SI+QSNEKK  KED SPSKRSVGRPPKK  E  PDP PAPTK
Subjt:  KRSAADFLKRIKQNSPAETMKRNGRGGSTSVGN----AAAGEQKKGSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGRPPKKTAE--PDPAPAPTK

Query:  RAREGGGKEPLKRAKKRARR
        RAREGGGKEPLKRA+KRARR
Subjt:  RAREGGGKEPLKRAKKRARR

XP_023513859.1 bromodomain-containing protein bet-1 [Cucurbita pepo subsp. pepo]4.6e-30784.22Show/hide
Query:  MAEDRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDEL
        MAED++R++ Q+ ATWGTWEELLLA AV RHGFKDWNSVAMEVQARSSLPH LT A NCELKFLDLKRRFTSFQNDAV  +NGAGI D VDS VPWVDEL
Subjt:  MAEDRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDEL

Query:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN
        RKLRVAELRREV+ YDVSINSLQLKVKRLEEER+QGLND EAGTGKPDLKTEARERRSENDKKLFGEPDHRSG  GTV KP AVPGEDSDRED SVNQSN
Subjt:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN

Query:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTS-ETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ
        STGSKSGNRKSTA+N KSET+P+F GS RPE NRRA E AGPQSDDGSTDTVVKNPTCDTS  TKKKKETKRVDDSSELADSEA+SNGGETATRESSEVQ
Subjt:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTS-ETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ

Query:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDL
        SSASLTGRMKR+R LRKEISGGSSGNEPRRT AVKSR+FDEVLQMIR+HK+GSLFESRLQSQETEEYKGMIRQHLDLETVQAK+NSGSY SSSLAFYRDL
Subjt:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDL

Query:  LLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SRSKG-TDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLG
        LLLCNNA TFFP SSKE +AA ELRLLVS EMKKS+PVAR DPSP    SSP  P S+SKG  DLEGSQSL AK+KS VPI+VCRKRSKISSKLSSSGLG
Subjt:  LLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SRSKG-TDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLG

Query:  EKGDQSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SGKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEK
        EK D++NDDEK A+DLKS++KMAST VEDHGTTKD S K+KE+ +TGARSMRRSNDSA NSSGPSIKKQ TNSGWKP+S NETETP PD KKSETVALEK
Subjt:  EKGDQSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SGKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEK

Query:  KRSAADFLKRIKQNSPAETMKRNGRGGSTSVGN--AAAGEQKKGSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRARE
        KRSAADFLKRIKQNSPAET KRNGRGGS++  N  AAAGEQKKGSGK+   KE+  SI+QSNEKK  KED SPSKRSVGRPPKK  E +P PAPTKRARE
Subjt:  KRSAADFLKRIKQNSPAETMKRNGRGGSTSVGN--AAAGEQKKGSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRARE

Query:  GGGKEPLKRAKKRARR
        GGGKEPLKRA+KRARR
Subjt:  GGGKEPLKRAKKRARR

XP_038897226.1 uncharacterized protein LOC120085356 [Benincasa hispida]0.0e+0086.93Show/hide
Query:  DTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDELRKLRVAE
        D+EQ TATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLT ARNCELKF DLKRRFTSF+NDAV  QNGAGIADKVDS VPWVDELRKLRVAE
Subjt:  DTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDELRKLRVAE

Query:  LRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSG
        LRREV+RYDVSINSLQLKVK+LEEEREQG+NDREA TGKPDLKTE+RERRSENDK LFGEPDHRSGPNGTVAKP AVPGEDSDRED SVNQSNSTGSKSG
Subjt:  LRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSG

Query:  NRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGR
        NRKSTA+  KSET+PDF GS+RPEQNRRA EPAGPQSDDGSTDTVVKNPTCD SET KKKET+RVDDSSELADSEAQSNGG T TRESSEVQSSASLTGR
Subjt:  NRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGR

Query:  MKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAA
        MKRKRLLRKEISGGSSGNEPRRTAAVKS++FDEVLQ IRAHK+GSLFESRLQSQETEEYK MIRQHLDLE VQ KINSGSY SSS AFYRDLLLL NN  
Subjt:  MKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAA

Query:  TFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSRSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLGEKGDQSNDDE
        TFFPKSSKE +AAC+LRLL+SNEMKKSL VAR DPSPEVVDSSP IPSRSKG DLEGSQSLLAKQKSSVPIIVCRKRSKISSK SS+GLGEKG++SNDDE
Subjt:  TFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSRSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLGEKGDQSNDDE

Query:  KSAVDLKSTIKMASTAVEDHGTTKDSGKVKEKPVTGARSMRRSNDSA-NSSGP-SIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKR
        K AVDLKS+IK+ S  VED  TTKDS KVKEKP+TGARSMRRSNDSA NSSGP SIKKQNTNS WKPSS NETETP PDKKKSETVALEKKRSAADFLKR
Subjt:  KSAVDLKSTIKMASTAVEDHGTTKDSGKVKEKPVTGARSMRRSNDSA-NSSGP-SIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKR

Query:  IKQNSPAETMKRNGRGGSTSVGNAAAGEQKKGSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGGKEPLKRAKK
        IKQNSPAET+KRNGRGGS+SVG+ A  EQKKGSGKSDK KE+  ++KQSN+K+ PKEDASPSKRSVGRPPKK AE D  P P KRAREGGGKEPLKR +K
Subjt:  IKQNSPAETMKRNGRGGSTSVGNAAAGEQKKGSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGGKEPLKRAKK

Query:  RARR
        +A+R
Subjt:  RARR

TrEMBL top hitse value%identityAlignment
A0A1S3CFH2 uncharacterized protein LOC1034998909.9e-30083Show/hide
Query:  DTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDELRKLRVAE
        ++EQ TATWGTWEELLLACAVKRHGFKDWNSV+MEVQARSSLPHLLT ARNCELKF DLKRRFTSFQNDAV   N +GIADK+D+ +PWVDELRKLRVAE
Subjt:  DTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDELRKLRVAE

Query:  LRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSG
        LRREV+RYDVSINSLQLKVK+LEEEREQG+NDREA TGKPDLKTE+RERRSENDKK FGEPDHRSGPNGTV KPP VPGEDSDRED SVNQSNSTGSKSG
Subjt:  LRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSG

Query:  NRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGR
        N KSTA+  KSET+PDF GS+RPEQNRR VEPAGPQSDDGSTDTVVKNPTCD SET KKKET+RVDDSSELADSEAQS+GG T TRESSEVQSSASLTGR
Subjt:  NRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGR

Query:  MKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAA
        MKRKRLLR EISGGSSGNEPRR+A +KSR+FDEVLQ+IRAHK+GSLFESRLQSQETEEYKGM+RQHLDLE VQ KINSGSY+SS+LAFYRDLLLL NN  
Subjt:  MKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAA

Query:  TFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSRSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLGEKGDQSNDDE
        TFFPKSSKE++AACELRLLVSNEMKKSL VA+TDP PEVVDS P IPS+SKG DLEGSQ+LLAKQKSSVPI+VCRKRSKIS+  S++G GEKGD+SNDDE
Subjt:  TFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSRSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLGEKGDQSNDDE

Query:  KSAVDLKSTIKMASTAVEDHGTTKDSGKVKEKPVTGARSMRRSNDSA-NSSGP-SIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKR
        K A DLKS+IK+ S  VE+  TTKDS KVKEKP TGARSMRRSNDSA NSSGP S KKQN  S WKPSS NETE P PDKKKSETVALEKKRSAADFLKR
Subjt:  KSAVDLKSTIKMASTAVEDHGTTKDSGKVKEKPVTGARSMRRSNDSA-NSSGP-SIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKR

Query:  IKQNSPAETMKRNGRGGST-SVGNAAAGEQKKGSGKSDKSKERTPS-IKQSNEKKHPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGGKEPLKRA
        IKQNSPAET KRNGRGGS+  VGNA   EQKKG+ K+DK+KER  S +KQSN+KK PKEDASPSKRSVGRPPKK AE +P P P KRAREGGGKEPLKR 
Subjt:  IKQNSPAETMKRNGRGGST-SVGNAAAGEQKKGSGKSDKSKERTPS-IKQSNEKKHPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGGKEPLKRA

Query:  KKRARR
        +K+++R
Subjt:  KKRARR

A0A5A7UYV1 Histone H3.v19.9e-30083Show/hide
Query:  DTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDELRKLRVAE
        ++EQ TATWGTWEELLLACAVKRHGFKDWNSV+MEVQARSSLPHLLT ARNCELKF DLKRRFTSFQNDAV   N +GIADK+D+ +PWVDELRKLRVAE
Subjt:  DTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDELRKLRVAE

Query:  LRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSG
        LRREV+RYDVSINSLQLKVK+LEEEREQG+NDREA TGKPDLKTE+RERRSENDKK FGEPDHRSGPNGTV KPP VPGEDSDRED SVNQSNSTGSKSG
Subjt:  LRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSG

Query:  NRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGR
        N KSTA+  KSET+PDF GS+RPEQNRR VEPAGPQSDDGSTDTVVKNPTCD SET KKKET+RVDDSSELADSEAQS+GG T TRESSEVQSSASLTGR
Subjt:  NRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGR

Query:  MKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAA
        MKRKRLLR EISGGSSGNEPRR+A +KSR+FDEVLQ+IRAHK+GSLFESRLQSQETEEYKGM+RQHLDLE VQ KINSGSY+SS+LAFYRDLLLL NN  
Subjt:  MKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAA

Query:  TFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSRSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLGEKGDQSNDDE
        TFFPKSSKE++AACELRLLVSNEMKKSL VA+TDP PEVVDS P IPS+SKG DLEGSQ+LLAKQKSSVPI+VCRKRSKIS+  S++G GEKGD+SNDDE
Subjt:  TFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSRSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLGEKGDQSNDDE

Query:  KSAVDLKSTIKMASTAVEDHGTTKDSGKVKEKPVTGARSMRRSNDSA-NSSGP-SIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKR
        K A DLKS+IK+ S  VE+  TTKDS KVKEKP TGARSMRRSNDSA NSSGP S KKQN  S WKPSS NETE P PDKKKSETVALEKKRSAADFLKR
Subjt:  KSAVDLKSTIKMASTAVEDHGTTKDSGKVKEKPVTGARSMRRSNDSA-NSSGP-SIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKR

Query:  IKQNSPAETMKRNGRGGST-SVGNAAAGEQKKGSGKSDKSKERTPS-IKQSNEKKHPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGGKEPLKRA
        IKQNSPAET KRNGRGGS+  VGNA   EQKKG+ K+DK+KER  S +KQSN+KK PKEDASPSKRSVGRPPKK AE +P P P KRAREGGGKEPLKR 
Subjt:  IKQNSPAETMKRNGRGGST-SVGNAAAGEQKKGSGKSDKSKERTPS-IKQSNEKKHPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGGKEPLKRA

Query:  KKRARR
        +K+++R
Subjt:  KKRARR

A0A6J1CTH9 uncharacterized protein LOC1110141498.4e-30783.73Show/hide
Query:  MAEDRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDEL
        MAED++R++EQ+ A+WGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLL  A NCELKFLDLKRRFTSFQNDAVF QNG GIADK+D+ +PWVDEL
Subjt:  MAEDRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDEL

Query:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKP--PAVPGEDSDREDLSVNQ
        RKLRVAELRREV+RYDVSI+SLQLKVKRLEEERE GLNDR+AGTGKPDLKTE+RERRSENDKKLFGEPD+RSGPNGTV+KP  PAVPGEDSDRED SVNQ
Subjt:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKP--PAVPGEDSDREDLSVNQ

Query:  SNSTGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEV
        SNSTGSKSGNRKST +N KSET+P+F GSFR EQNRRA EPAGPQSDDGSTDTV KNPTCD SET+KK + +   DSSELADSEAQSNGGETATRESSEV
Subjt:  SNSTGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEV

Query:  QSSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRD
        QSSASLTGRMKRKRLLRKEISGGSSGNEPRRTA VKSR+FD++LQMIRAHK+GS FESRLQSQE+EEYKGM+RQHLDLE VQAK+NSGSY+SSSLAFYRD
Subjt:  QSSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRD

Query:  LLLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSRSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLGE
        LLLLCNNA  FFPK SKE++AACELRLLVSNE+KKSL V RTDPSPEVVDSSPA+PSRSKG DLEGS SLLAKQKSSVPIIVCRKRSKIS+KLSSS LGE
Subjt:  LLLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSRSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLGE

Query:  KGDQSNDDEKSAVDLKSTIKMASTAVEDHGTTKDSGKVKEKPVTGARSMRRSNDS-ANSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKR
        K D+S++DEKSAVD+K T+K AS AVEDHGTTKDS KVKEKPVTGARSMRRSNDS  NSSGPSIKKQN +SGWKP S NE ET  PDKKK ETVALEKKR
Subjt:  KGDQSNDDEKSAVDLKSTIKMASTAVEDHGTTKDSGKVKEKPVTGARSMRRSNDS-ANSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKR

Query:  SAADFLKRIKQNSPAETMKRNGRGGS-TSVGNAAAGEQKKGSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGG
        SAADFLKRIKQNSPAET+KRN R GS +S GNA A EQKKGS KSDK KER  S++QSN+KK  K+DASPSKRSVGRPPKK    +  P PTKRAREGGG
Subjt:  SAADFLKRIKQNSPAETMKRNGRGGS-TSVGNAAAGEQKKGSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGG

Query:  KEPLKRAKKRARR
        KEPLKR KKR+RR
Subjt:  KEPLKRAKKRARR

A0A6J1H5Z4 uncharacterized protein LOC1114607748.4e-30784.03Show/hide
Query:  MAEDRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDEL
        MAED++R++ Q+ ATWGTWEELLLA AV RHGFKDWNSVAMEVQARSSLPH LT A NCELKFLDLKRRFTSFQNDAV  +NGAGI D VDS VPWVDEL
Subjt:  MAEDRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDEL

Query:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN
        RKLRVAELRREV+ YDVSINSLQLKVKRLEEER+QGLND EAGTGKPDLKTEARERRSENDKKLFGEPDHRSG  GTV KP AVPGEDSDRED SVNQSN
Subjt:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN

Query:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTS-ETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ
        STGSKSGNRKSTA+N KSET+P+F GS RPE NRRA E AGPQSDDGSTDTVVKNPTCDTS  TKKKKETKRVDDSSELADSEA+SNGGETATRESSEVQ
Subjt:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTS-ETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ

Query:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDL
        SSASLTGRMKR+R LRKEISGGSSGNEPRRT AVKSR+FDEVLQMIR+HK+GSLFESRLQSQETEEYK MIRQHLDLETVQAK+NSGSY  SSLAFYRDL
Subjt:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDL

Query:  LLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SRSKG-TDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLG
        LLLCNNA TFFP SSKES+AA ELRLLVS EMKKS+PVAR DPSP    SSP  P S+SKG  DLEGSQSL AK+KS VPI+VCRKRSKISSKLSSSGLG
Subjt:  LLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SRSKG-TDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLG

Query:  EKGDQSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SGKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEK
        EK D++NDDEK A+DLKS++KMAST VEDHGTTKD S K+KE+ +TGARSMRRSNDSA NSSGPSIKKQ TNSGWKP+S NETETP PDKKKSETVALEK
Subjt:  EKGDQSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SGKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEK

Query:  KRSAADFLKRIKQNSPAETMKRNGRGGSTSVGN----AAAGEQKKGSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGRPPKKTAE--PDPAPAPTK
        KRSAADFLKRIKQNSPAET KRNGRGGS++  N    AAAGEQKKGSGK+   KE+  SI+QSNEKK  KED SPSKRSVGRPPKK  E  PDP PAPTK
Subjt:  KRSAADFLKRIKQNSPAETMKRNGRGGSTSVGN----AAAGEQKKGSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGRPPKKTAE--PDPAPAPTK

Query:  RAREGGGKEPLKRAKKRARR
        RAREGGGKEPLKRA+KRARR
Subjt:  RAREGGGKEPLKRAKKRARR

A0A6J1KZN3 dentin sialophosphoprotein-like6.6e-30483.17Show/hide
Query:  MAEDRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDEL
        MAED++R++ Q+ ATWGTWEELLLA AV RHGFKDWNSVAMEVQARSSLPH LT A NCELKFLDLKRRFT+FQNDAV  +NGAGI D VDS VPWVDEL
Subjt:  MAEDRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDEL

Query:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN
        RKLRVAELRREV+ YDVSINSLQLKVKRLEEER+QGLND EAGTGKPDLKTEARERRSENDKKLFGEPDHRSG  GTV KP AVPGEDSDRED SVNQSN
Subjt:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN

Query:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQS
        STGSKSGNRKSTA+N KSET+ +F GS RPE NRRA EP+GPQSDDGSTDTVVKNPTCDTS TKKKKETKRVD+SSELADSEA+SNGG TATRESSEVQS
Subjt:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQS

Query:  SASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLL
        SASLTGRMKR+R LRKEISGGSSGNEPRRT AVKSR+FDEVLQMIR+HK+GSLFESRLQSQETEEYKGMIRQHLDLETVQAK+NSGSY  SSLAFYRDLL
Subjt:  SASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLL

Query:  LLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SRSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLGEK
        LLCNNA TFFP SSKES+AA ELRLLVS EM KS+PVAR DPSP    SSP  P S+SKG DLEGSQSL AK+KS VPI+VCRKRSKISSKLSSSGLGEK
Subjt:  LLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SRSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLGEK

Query:  GDQSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SGKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKR
         D++NDDEK A+DLKS++KMAST VEDHGT KD S K+KE+ +TGARSMRRSNDSA NSSGPSIKKQ TNSGWK +S NETET  PDKKKSETVALEKKR
Subjt:  GDQSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SGKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKR

Query:  SAADFLKRIKQNSPAETMKRNGRGGSTSVGN-----AAAGEQKKGSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGRPPKKTAE--PDPAPAPTKR
        SAADFLKRIKQNSPAET KRNGRGGS++  N     AAAGEQKKGSGK+   KE+  SI+QSNEKK  KED SPSKRSVGRPPKK  E  PDP PAPTKR
Subjt:  SAADFLKRIKQNSPAETMKRNGRGGSTSVGN-----AAAGEQKKGSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGRPPKKTAE--PDPAPAPTKR

Query:  AREGGGKEPLKRAKKRARR
        AREGGGKEPLKRA+KRARR
Subjt:  AREGGGKEPLKRAKKRARR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61215.1 bromodomain 41.3e-2026.39Show/hide
Query:  TEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDELRKLRVAEL
        T    + WGTWEELLL  AV RHG  DW  VA E+++  SLP + T    C+ K+ DL++R+   +                     W +EL+K RVAEL
Subjt:  TEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDELRKLRVAEL

Query:  RREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSGN
        +  + + + SI SL+ K++ L+ E     ND            E  +   ++ + L  EP            P +  G +   +D S + S S GS +  
Subjt:  RREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSGN

Query:  RKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRM
        ++ T  N+  E + +       E+ +  +     +S  G    V+       S  KK+ + KR D S  +     +     +A  ES    +SA +    
Subjt:  RKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRM

Query:  KRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAAT
            + R + +  +S ++ R  +    ++  ++   I  ++   +F  RL SQ+   YK ++R+H+DL+TVQ++IN G   SS+   +RD LL+ NNAA 
Subjt:  KRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAAT

Query:  FFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPE-----VVDSSPAIPSRSKGTDLEGSQSLLAK--QKSSVPI-IVCRKRSKISSK----------
        F+ K+++E  +A  LR +V+  ++  L    T+  P         S+  +    K T      SL AK  +  + P+  V +  +K SS+          
Subjt:  FFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPE-----VVDSSPAIPSRSKGTDLEGSQSLLAK--QKSSVPI-IVCRKRSKISSK----------

Query:  ---LSSSGLGEKGDQSNDDEKSA
           + SS  G+KG     D + A
Subjt:  ---LSSSGLGEKGDQSNDDEKSA

AT2G42150.1 DNA-binding bromodomain-containing protein1.6e-5534.33Show/hide
Query:  MAEDRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLT-NARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDE
        MA+  +  T     TW TWEELLLACAV RHG + WNSV+ E+Q  S  P+L +  A  C  K+ DLK RFT    +    ++ A I     ST PW++E
Subjt:  MAEDRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLT-NARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDE

Query:  LRKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQS
        LRKLRV ELRREVE+YD+SI++LQ KVK+LEEERE           KPD +TE  +   + ++   GEP         V  PP             +N++
Subjt:  LRKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQS

Query:  NSTGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ
         S   K    ++T      E E +  GS   E      + AG  S  GS ++V K PT ++   +    T       EL +SE  ++ GE  T   S+VQ
Subjt:  NSTGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ

Query:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTA---AVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFY
        SSASL          RK  S     ++   +A    V+S+     ++++ +H  GS F  RL+ QET EY  +IR+H+D E ++ ++  G Y S  + F+
Subjt:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTA---AVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFY

Query:  RDLLLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSL-------PVARTDPSPEVVDSSPAIPSRSKGTDLEGSQSLLAKQKSSVP-IIVCRKRSKIS
        RDLLLL NNA  F+ + S E   A +L  LV  +M  +L        ++ + P  EVV    AIPS         S+ + +K + SVP I+ CRKRS ++
Subjt:  RDLLLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSL-------PVARTDPSPEVVDSSPAIPSRSKGTDLEGSQSLLAKQKSSVP-IIVCRKRSKIS

Query:  SK---LSSSGLGEKGDQSND----DEKSAVDLKSTIKMASTAVEDHGTTKDSGKVKEKPVTGARSMRRSNDSANSSGPSIKKQNTNSGWKPSSTNETETP
        +K   L   G  +K  +++     DEK   D            E  G   D   +        + M R   S+       K  N N   + SS N  ++ 
Subjt:  SK---LSSSGLGEKGDQSND----DEKSAVDLKSTIKMASTAVEDHGTTKDSGKVKEKPVTGARSMRRSNDSANSSGPSIKKQNTNSGWKPSSTNETETP

Query:  NPDKK-KSETVALEKKRSAADFLKRIKQNSPAETMKRNGRGGSTSVGNAAAGEQKK-GSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRS-VGRPPKKT
        +  KK   E     KK+ AA FL+R+K  S  +T+KR+    S++ G     EQ+K  S K+D  K   P I+Q+N+K      ASP KRS  GR  ++ 
Subjt:  NPDKK-KSETVALEKKRSAADFLKRIKQNSPAETMKRNGRGGSTSVGNAAAGEQKK-GSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRS-VGRPPKKT

Query:  AEPDPAPAP--TKRAREGGGKEPLK----RAKKRARR
        A P  +  P   KR+R+ G KE       R KKRARR
Subjt:  AEPDPAPAP--TKRAREGGGKEPLK----RAKKRARR

AT2G44430.1 DNA-binding bromodomain-containing protein2.0e-9039.11Show/hide
Query:  EDRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFG--QNGAGIADKVDSTVPWVDEL
        ED     +  T  WGTWEELLLACAVKRHGF DW+SVA EV++RSSL HLL +A +C  K+ DLKRRF   +   V    +      ++V + +PW+++L
Subjt:  EDRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFG--QNGAGIADKVDSTVPWVDEL

Query:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN
        R LRVAELRREVERYD SI SLQLKVK+LEEE       RE G  KPDL+ E +E RSEND     E +HR        +      E+SDRE+ S+N+SN
Subjt:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN

Query:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQS
        ST +                          E+     EP+  + DD   D   KNP  D            V+  +  A+ E  S    +    S E+  
Subjt:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQS

Query:  SASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLL
        S +   + KRKR  RK+   G       R+A  KS+    +L +IR+H  GSLFE RL+SQE ++YK M++QHLD+ET+Q K+  GSY SSSL FYRDL 
Subjt:  SASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLL

Query:  LLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSRSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLGEKG
        LL  NA  FFP SS ESMAA ELR +VS EM+K    A     P ++    A   RS   D E S S L++QKSS P++VC+KR  +S+K S S      
Subjt:  LLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSRSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLGEKG

Query:  DQSNDDEKSAVDLKSTIKMASTAVEDHGTTKDSGKVKEKPVTGARSMRRSNDSA------NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALE
                           +S + +D    +   + K+   TG RS RR+N  A        +G    KQ        SS + +   +  K + +TV+ +
Subjt:  DQSNDDEKSAVDLKSTIKMASTAVEDHGTTKDSGKVKEKPVTGARSMRRSNDSA------NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALE

Query:  KKRSAADFLKRIKQNSPAETMKRNGRGGSTSVGNAAAGEQKKGSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREG
        KK+S ADFLKR+K+NSP +  K   + G         G  KK S K+   + R+ S+     KK  + + +P KR+ GRP KKTAE   A A  KR R+ 
Subjt:  KKRSAADFLKRIKQNSPAETMKRNGRGGSTSVGNAAAGEQKKGSGKSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREG

Query:  G-----GKEPLKRAKK
        G      K+P KR +K
Subjt:  G-----GKEPLKRAKK

AT3G57980.1 DNA-binding bromodomain-containing protein8.3e-4931.62Show/hide
Query:  EELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDELRKLRVAELRREVERYDVSI
        EELLLACAV RHG   W+SVA EV  ++S    LT A +C  K+ DLKRRF+  +N    G           S+VPW++ELRKLRV ELRREVERYD+SI
Subjt:  EELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDELRKLRVAELRREVERYDVSI

Query:  NSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSGNRKSTAKNFKSE
        +SLQLKVK LE+ERE+ L    +     D   E +E  +E+         + SG   T  K    P ++S            TGS++ NR         E
Subjt:  NSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSGNRKSTAKNFKSE

Query:  TEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRMKRKRLLRKEIS
             GG    E      +PA   S  GS ++V K       E+ + +  +  +DS EL +S  +S  GE  T+E+S+ QSSAS      RK  + ++  
Subjt:  TEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRMKRKRLLRKEIS

Query:  GGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAATFFPKSSKESMA
             +       V+S+   + ++++++H  GS F  RL++QET +Y  +IRQH+D E +++++  G Y ++   F+RDLLLL NN   F+ + S E  A
Subjt:  GGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAATFFPKSSKESMA

Query:  ACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSRSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLGEKGDQSNDDEKSAVDLKSTIKM
        A +L  L+  +M   +P     P  E  D+        K + L        K   SVPII CRKRS ++ +  +S                  LK   ++
Subjt:  ACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSRSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLGEKGDQSNDDEKSAVDLKSTIKM

Query:  ASTAVEDHGTTKDSGKVKEKPVTGARSMRRSNDSANSSGPSIKKQNTNSGWKPSSTNETETPN---------PDKKKSETVALEKKRSAADFLKRIKQNS
          T  E   + ++ G+  +K      S + +  +A S+   +  +N  +      +N   +PN           KK   T    KK+SAA FLKR+K  S
Subjt:  ASTAVEDHGTTKDSGKVKEKPVTGARSMRRSNDSANSSGPSIKKQNTNSGWKPSSTNETETPN---------PDKKKSETVALEKKRSAADFLKRIKQNS

Query:  PAETMKRNGRGGSTSVGNAAAGEQKKGSGKSDK-SKERTPSIKQSNEKKHPK-EDASPSKRSVGRPPKKTAEPDPA--PAPTKRAREGGGKEPLKRAKK
         +ET+    +  S++    A  EQ+K + KS+K    + P+ ++    K P  E  SP+K++ G   K+      A   + T     G    P KR+K+
Subjt:  PAETMKRNGRGGSTSVGNAAAGEQKKGSGKSDK-SKERTPSIKQSNEKKHPK-EDASPSKRSVGRPPKKTAEPDPA--PAPTKRAREGGGKEPLKRAKK

AT3G60110.1 DNA-binding bromodomain-containing protein9.5e-6937.1Show/hide
Query:  DRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDST--VPWVDELR
        D + +  Q    WGTWEEL+L CAVKRH F DW+SVA EVQARS    L+ +A NC LK+ DLKRRF    +  V  +N    A++ D    + W+++LR
Subjt:  DRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDST--VPWVDELR

Query:  KLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGE--DSDRED-LSVNQ
         L +AELRREV+R D SI SLQLKVK+LEEE+     D + G  KPDLK         ND+                 KP  V  E  +SDR+D  S+N+
Subjt:  KLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGE--DSDRED-LSVNQ

Query:  SNSTGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEV
        SNST S                  D     +  +N R  +P          D V K    +T E +++  +KR +           SN GE         
Subjt:  SNSTGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEV

Query:  QSSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRD
            S T     KR  +K  SGG  G    ++A  KS+   +++++IR+H  GS+FESRL+SQ+T++YK +IRQHLD++T++ K+  GSY SSSL+FYRD
Subjt:  QSSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRD

Query:  LLLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSRSKGTDLEGSQSLLAKQKSSV-PIIVCRKRSKISSKLSSSGLG
        L LL  NA  FFP SS ES+AA ELR LVSNEMKK     RT      V  S A             +S +++QKSSV  ++ C+K+S    K S S   
Subjt:  LLLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSRSKGTDLEGSQSLLAKQKSSV-PIIVCRKRSKISSKLSSSGLG

Query:  EKGDQSNDDEKSAVDLKSTIKMASTAVEDHGTTKDSGKVKEKPVTGARSMRRSNDSANSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKR
         + D+    E S   + +T    S A     T+K+   V +   TG    R  N        +IKKQ T++  + S  ++ E     K + +TVA +KK+
Subjt:  EKGDQSNDDEKSAVDLKSTIKMASTAVEDHGTTKDSGKVKEKPVTGARSMRRSNDSANSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKR

Query:  SAADFLKRIKQNSPAETMKRNGRGGSTSVGNAAA--GEQKKGSG--KSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGR-PPKKTAEPDPAPAPTKRAR
        S ADFLKRIK+NSP +  +   +    + GN       QKK  G  K + SK +   ++ S  KK  + + + SK S  R   K+TAE     A  KR R
Subjt:  SAADFLKRIKQNSPAETMKRNGRGGSTSVGNAAA--GEQKKGSG--KSDKSKERTPSIKQSNEKKHPKEDASPSKRSVGR-PPKKTAEPDPAPAPTKRAR

Query:  EGGGKEPLKRAKKRARR
        E G  +  K+ +KR+RR
Subjt:  EGGGKEPLKRAKKRARR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAGATAGAGACCGGGACACTGAACAGACCACCGCCACATGGGGGACATGGGAGGAGCTTTTACTGGCGTGTGCCGTCAAGCGCCATGGATTTAAGGACTGGAA
TTCCGTCGCCATGGAAGTTCAGGCTCGTTCTTCTCTCCCTCATCTTCTTACCAACGCTCGTAACTGTGAACTCAAATTCCTTGACCTCAAACGACGCTTTACTTCCTTCC
AAAACGACGCCGTTTTTGGCCAGAACGGCGCCGGAATTGCCGATAAGGTCGATAGCACTGTACCTTGGGTCGATGAGTTGAGGAAACTCCGTGTTGCTGAACTACGACGA
GAGGTCGAGCGCTATGACGTTTCAATCAACTCGTTGCAGTTGAAGGTCAAGAGATTGGAGGAGGAGCGAGAACAAGGCCTGAACGATAGAGAGGCCGGCACTGGAAAACC
AGATCTAAAAACGGAGGCCAGAGAACGACGATCAGAAAACGACAAAAAGCTCTTCGGAGAACCAGACCACCGGTCCGGACCGAACGGAACAGTCGCGAAGCCACCGGCGG
TTCCCGGCGAAGATTCCGACCGCGAGGATTTATCCGTCAACCAATCCAATTCCACCGGTTCCAAAAGCGGTAACCGAAAAAGCACGGCCAAAAACTTCAAATCCGAAACC
GAACCGGACTTCGGTGGTTCATTCAGACCAGAACAGAACCGGAGAGCCGTCGAACCAGCCGGACCGCAGTCGGATGACGGCAGTACGGACACTGTGGTTAAGAACCCGAC
GTGCGACACATCAGAAACGAAAAAGAAAAAGGAGACGAAGCGAGTGGACGACTCGTCCGAGTTAGCGGACTCGGAGGCTCAGTCAAACGGCGGAGAGACGGCGACGAGGG
AAAGCAGCGAAGTACAAAGCTCGGCGAGTTTGACGGGGAGGATGAAGAGGAAACGACTCCTCAGAAAGGAGATCTCCGGCGGTAGTAGCGGCAATGAACCTCGCCGGACG
GCCGCCGTCAAGTCACGGCAGTTTGACGAAGTTTTGCAGATGATACGAGCGCACAAAAATGGCTCATTGTTCGAATCCCGTCTCCAGAGTCAGGAGACGGAAGAGTACAA
AGGCATGATCCGGCAGCATTTGGACCTGGAAACAGTTCAGGCCAAGATCAATTCTGGCTCTTATGCGTCCTCCAGCCTTGCATTTTACCGAGATCTCTTGCTTCTTTGCA
ACAATGCGGCTACCTTCTTCCCCAAATCTTCCAAGGAGTCGATGGCTGCTTGTGAACTCCGTCTTCTTGTTTCCAATGAAATGAAGAAGAGTCTGCCGGTTGCTCGGACT
GACCCTTCACCTGAAGTAGTGGATTCGTCCCCAGCGATTCCTTCCCGATCCAAGGGTACTGATCTTGAAGGATCACAGTCCTTGCTTGCCAAGCAGAAGTCCTCTGTTCC
TATAATTGTTTGTCGGAAACGGAGCAAAATTTCATCGAAGCTTTCATCGAGTGGTCTTGGTGAAAAAGGTGATCAGAGCAATGATGATGAAAAGTCAGCTGTTGATCTAA
AATCAACTATTAAAATGGCTTCAACTGCTGTTGAAGACCATGGCACTACAAAAGATAGTGGCAAGGTTAAAGAGAAACCTGTAACTGGAGCTAGAAGCATGCGGAGGAGC
AACGACAGTGCCAACTCTAGTGGTCCTAGCATCAAGAAACAGAATACGAATTCTGGATGGAAACCCAGTTCGACTAACGAAACAGAGACACCAAACCCAGATAAGAAGAA
ATCTGAGACGGTTGCACTAGAGAAGAAGCGTAGTGCAGCAGATTTCTTGAAGAGGATCAAGCAGAACTCCCCTGCAGAGACTATGAAGAGAAATGGCAGAGGTGGAAGTA
CTAGCGTTGGTAATGCAGCCGCCGGAGAACAAAAGAAGGGCAGTGGCAAAAGCGATAAATCAAAGGAGAGGACGCCATCGATAAAGCAGAGCAACGAGAAGAAGCACCCA
AAGGAGGATGCCAGCCCTTCCAAGCGCAGTGTCGGGCGGCCACCTAAGAAGACAGCAGAGCCCGACCCTGCTCCTGCTCCAACAAAAAGAGCCAGAGAAGGTGGTGGAAA
AGAACCTCTAAAGCGGGCAAAAAAAAGGGCTAGAAGGTGA
mRNA sequenceShow/hide mRNA sequence
TTGGTTTTCCGATTTCTCATTTTACTTGCTTTCTCTCAAATCGAAACTCAAATTAATTTCTTTTAAAAACGAGTTTTGGATTTAATTTTTCTTTTTCTTTCTTTTGTTCC
CTTCACGCCGCAGAGGAGAAGAACACGACGCGCAGAAACCCTAATAACAAACTCTCTAATGGAGGAACTTTCCCTTTTTGTTTCTCTTTCTCTGTGATTTTTTAAAAATT
AAAAGAGGGAAAAGAGAATTGGAGATATGAGAAAGAAGACCGCCTAATCGGGAATCTCTATGCTGTGTTTCTTCTTCTAATCTAATCTAATGGCGGAAGATAGAGACCGG
GACACTGAACAGACCACCGCCACATGGGGGACATGGGAGGAGCTTTTACTGGCGTGTGCCGTCAAGCGCCATGGATTTAAGGACTGGAATTCCGTCGCCATGGAAGTTCA
GGCTCGTTCTTCTCTCCCTCATCTTCTTACCAACGCTCGTAACTGTGAACTCAAATTCCTTGACCTCAAACGACGCTTTACTTCCTTCCAAAACGACGCCGTTTTTGGCC
AGAACGGCGCCGGAATTGCCGATAAGGTCGATAGCACTGTACCTTGGGTCGATGAGTTGAGGAAACTCCGTGTTGCTGAACTACGACGAGAGGTCGAGCGCTATGACGTT
TCAATCAACTCGTTGCAGTTGAAGGTCAAGAGATTGGAGGAGGAGCGAGAACAAGGCCTGAACGATAGAGAGGCCGGCACTGGAAAACCAGATCTAAAAACGGAGGCCAG
AGAACGACGATCAGAAAACGACAAAAAGCTCTTCGGAGAACCAGACCACCGGTCCGGACCGAACGGAACAGTCGCGAAGCCACCGGCGGTTCCCGGCGAAGATTCCGACC
GCGAGGATTTATCCGTCAACCAATCCAATTCCACCGGTTCCAAAAGCGGTAACCGAAAAAGCACGGCCAAAAACTTCAAATCCGAAACCGAACCGGACTTCGGTGGTTCA
TTCAGACCAGAACAGAACCGGAGAGCCGTCGAACCAGCCGGACCGCAGTCGGATGACGGCAGTACGGACACTGTGGTTAAGAACCCGACGTGCGACACATCAGAAACGAA
AAAGAAAAAGGAGACGAAGCGAGTGGACGACTCGTCCGAGTTAGCGGACTCGGAGGCTCAGTCAAACGGCGGAGAGACGGCGACGAGGGAAAGCAGCGAAGTACAAAGCT
CGGCGAGTTTGACGGGGAGGATGAAGAGGAAACGACTCCTCAGAAAGGAGATCTCCGGCGGTAGTAGCGGCAATGAACCTCGCCGGACGGCCGCCGTCAAGTCACGGCAG
TTTGACGAAGTTTTGCAGATGATACGAGCGCACAAAAATGGCTCATTGTTCGAATCCCGTCTCCAGAGTCAGGAGACGGAAGAGTACAAAGGCATGATCCGGCAGCATTT
GGACCTGGAAACAGTTCAGGCCAAGATCAATTCTGGCTCTTATGCGTCCTCCAGCCTTGCATTTTACCGAGATCTCTTGCTTCTTTGCAACAATGCGGCTACCTTCTTCC
CCAAATCTTCCAAGGAGTCGATGGCTGCTTGTGAACTCCGTCTTCTTGTTTCCAATGAAATGAAGAAGAGTCTGCCGGTTGCTCGGACTGACCCTTCACCTGAAGTAGTG
GATTCGTCCCCAGCGATTCCTTCCCGATCCAAGGGTACTGATCTTGAAGGATCACAGTCCTTGCTTGCCAAGCAGAAGTCCTCTGTTCCTATAATTGTTTGTCGGAAACG
GAGCAAAATTTCATCGAAGCTTTCATCGAGTGGTCTTGGTGAAAAAGGTGATCAGAGCAATGATGATGAAAAGTCAGCTGTTGATCTAAAATCAACTATTAAAATGGCTT
CAACTGCTGTTGAAGACCATGGCACTACAAAAGATAGTGGCAAGGTTAAAGAGAAACCTGTAACTGGAGCTAGAAGCATGCGGAGGAGCAACGACAGTGCCAACTCTAGT
GGTCCTAGCATCAAGAAACAGAATACGAATTCTGGATGGAAACCCAGTTCGACTAACGAAACAGAGACACCAAACCCAGATAAGAAGAAATCTGAGACGGTTGCACTAGA
GAAGAAGCGTAGTGCAGCAGATTTCTTGAAGAGGATCAAGCAGAACTCCCCTGCAGAGACTATGAAGAGAAATGGCAGAGGTGGAAGTACTAGCGTTGGTAATGCAGCCG
CCGGAGAACAAAAGAAGGGCAGTGGCAAAAGCGATAAATCAAAGGAGAGGACGCCATCGATAAAGCAGAGCAACGAGAAGAAGCACCCAAAGGAGGATGCCAGCCCTTCC
AAGCGCAGTGTCGGGCGGCCACCTAAGAAGACAGCAGAGCCCGACCCTGCTCCTGCTCCAACAAAAAGAGCCAGAGAAGGTGGTGGAAAAGAACCTCTAAAGCGGGCAAA
AAAAAGGGCTAGAAGGTGACCGTCCTAGTCACCAGCAGACGATTTGTTTGTGTATATTGTAGACGTAGCCTGCAGTGTATCAATCTGTATTGCATTTAATTATTTTTTAC
TTCTAATTTGTAACACGAAAAAGTTAGGCACTATTTTGTTTAGTTTTCTACTAAAAAACTAGTTAACCAGTAGTTAAAATTCAATTAGCACCTTTCTATAATTCAAACTC
ATTTTCGCTGCAATGATTTGGAAAGATCTGGACCGTCAATTAGAGAGAACCAGTTTGTTCCTAGTACAAGAGTACATGTGTGGTGGATATTGGAGATGAGCAAAAGCTTT
AATGGACACGTGGAGAATGGTAATTGGGTAGCCACCAGCGAAAGGCCGCCGCACAGGCTTTTTTTTTTTTTTTTAACAAGTATTCTGACTGTCTAAAGAGAGTTC
Protein sequenceShow/hide protein sequence
MAEDRDRDTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGQNGAGIADKVDSTVPWVDELRKLRVAELRR
EVERYDVSINSLQLKVKRLEEEREQGLNDREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSGNRKSTAKNFKSET
EPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRMKRKRLLRKEISGGSSGNEPRRT
AAVKSRQFDEVLQMIRAHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVART
DPSPEVVDSSPAIPSRSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKLSSSGLGEKGDQSNDDEKSAVDLKSTIKMASTAVEDHGTTKDSGKVKEKPVTGARSMRRS
NDSANSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKRIKQNSPAETMKRNGRGGSTSVGNAAAGEQKKGSGKSDKSKERTPSIKQSNEKKHP
KEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGGKEPLKRAKKRARR