; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002808 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002808
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionBromo domain-containing protein
Genome locationchr4:45829772..45833797
RNA-Seq ExpressionLag0002808
SyntenyLag0002808
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR001487 - Bromodomain
IPR017930 - Myb domain
IPR036427 - Bromodomain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593331.1 hypothetical protein SDJN03_12807, partial [Cucurbita argyrosperma subsp. sororia]2.9e-30183.61Show/hide
Query:  MAEDKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDEL
        MAEDK+R + Q+ ATWGTWEELLLA AV RHGFKDWNSVAME+QARSSLPH LT A NCELKFLDLKRRFTSFQNDAV   NGAGI D VDS VPWVDEL
Subjt:  MAEDKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDEL

Query:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN
        RKLRVAELRREV+ YDVSINSLQLKVKRLEEER+QGLN+ EAGTGKPDLKTEARERRSENDKKLFGEPDHRSG  GTV KP AV GEDSDRED SVNQSN
Subjt:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN

Query:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTS-ETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ
        STGSKSGNRKSTA+N KSET+P+F GS RPE NRRA E AGPQSDDGSTDTVVKNPTCDTS  TKKK ETKRVDDSSELADSEA+SNGGETATRESSEVQ
Subjt:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTS-ETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ

Query:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDL
        SSASLTGRMKR+R LRKEISGGSSGNEPRRT AVKSR+FDEVLQMIR HK+GSLFESRLQSQET EYKGMIRQHLDLETVQAK+NSGSY SSSLAFYRDL
Subjt:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDL

Query:  LLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SQSKG-TDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLG
        LLLCNNA TFFP SSKES+AA ELRLLVS EM KS+PVAR DPSP    SSP  P SQSKG  DLEGSQSL AK+KS VPI+VCRKRSKISSK SSS LG
Subjt:  LLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SQSKG-TDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLG

Query:  EKGDRSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SSKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEK
        EK DR+NDDEK A+DLKS++KMAST VEDHGTTKD SSK+KE+ +TGARSMRRSNDSA NSSGPSIKKQ TNSGWKP+S NETETP PDKKKSETVALEK
Subjt:  EKGDRSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SSKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEK

Query:  KRSAADFLKRIKQNSPAETMKRNGRGGSTSVGN----AAPGEQKKGSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGRPPKKTAE--PDPAPAPTK
        KRSAADFLKRIKQNSPAET KRNGRGGS++  N    AA GEQKKGSGK+   KE+  SIKQ+NEKK  KED SPSKRSVGRPPKK  E  PDP PAPTK
Subjt:  KRSAADFLKRIKQNSPAETMKRNGRGGSTSVGN----AAPGEQKKGSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGRPPKKTAE--PDPAPAPTK

Query:  RAREGGGKEPLKRAKKRARR
        RAREGGGKEPLKRA+KRARR
Subjt:  RAREGGGKEPLKRAKKRARR

XP_022144471.1 uncharacterized protein LOC111014149 [Momordica charantia]2.0e-30282.72Show/hide
Query:  MAEDKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDEL
        MAEDK+R +EQ+ A+WGTWEELLLACAVKRHGFKDWNSVAME+QARSSLPHLL  A NCELKFLDLKRRFTSFQNDAVF  NG GIADK+D+ +PWVDEL
Subjt:  MAEDKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDEL

Query:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKP--PAVPGEDSDREDLSVNQ
        RKLRVAELRREV+RYDVSI+SLQLKVKRLEEERE GLN+R+AGTGKPDLKTE+RERRSENDKKLFGEPD+RSGPNGTV+KP  PAVPGEDSDRED SVNQ
Subjt:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKP--PAVPGEDSDREDLSVNQ

Query:  SNSTGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEV
        SNSTGSKSGNRKST +N KSET+P+F GSFR EQNRRA EPAGPQSDDGSTDTV KNPTCD SET+KK + +   DSSELADSEAQSNGGETATRESSEV
Subjt:  SNSTGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEV

Query:  QSSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRD
        QSSASLTGRMKRKRLLRKEISGGSSGNEPRRTA VKSR+FD++LQMIR HK+GS FESRLQSQE+EEYKGM+RQHLDLE VQAK+NSGSY+SSSLAFYRD
Subjt:  QSSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRD

Query:  LLLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLGE
        LLLLCNNA  FFPK SKE++AACELRLLVSNE+KKSL V RTDPSPEVVDSSPA+PS+SKG DLEGS SLLAKQKSSVPIIVCRKRSKIS+K  SSSLGE
Subjt:  LLLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLGE

Query:  KGDRSNDDEKSAVDLKSTIKMASTAVEDHGTTKDSSKVKEKPVTGARSMRRSNDS-ANSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKR
        K DRS++DEKSAVD+K T+K AS AVEDHGTTKD SKVKEKPVTGARSMRRSNDS  NSSGPSIKKQN +SGWKP S NE ET  PDKKK ETVALEKKR
Subjt:  KGDRSNDDEKSAVDLKSTIKMASTAVEDHGTTKDSSKVKEKPVTGARSMRRSNDS-ANSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKR

Query:  SAADFLKRIKQNSPAETMKRNGRGGSTSVGNAAPGEQKKGSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGGK
        SAADFLKRIKQNSPAET+KRN R GS S    A  EQKKGS KSDK KER  S++Q+N+KK  K+DASPSKRSVGRPPKK    +  P PTKRAREGGGK
Subjt:  SAADFLKRIKQNSPAETMKRNGRGGSTSVGNAAPGEQKKGSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGGK

Query:  EPLKRAKKRARR
        EPLKR KKR+RR
Subjt:  EPLKRAKKRARR

XP_022959831.1 uncharacterized protein LOC111460774 [Cucurbita moschata]1.8e-30383.75Show/hide
Query:  MAEDKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDEL
        MAEDK+R + Q+ ATWGTWEELLLA AV RHGFKDWNSVAME+QARSSLPH LT A NCELKFLDLKRRFTSFQNDAV   NGAGI D VDS VPWVDEL
Subjt:  MAEDKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDEL

Query:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN
        RKLRVAELRREV+ YDVSINSLQLKVKRLEEER+QGLN+ EAGTGKPDLKTEARERRSENDKKLFGEPDHRSG  GTV KP AVPGEDSDRED SVNQSN
Subjt:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN

Query:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTS-ETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ
        STGSKSGNRKSTA+N KSET+P+F GS RPE NRRA E AGPQSDDGSTDTVVKNPTCDTS  TKKKKETKRVDDSSELADSEA+SNGGETATRESSEVQ
Subjt:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTS-ETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ

Query:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDL
        SSASLTGRMKR+R LRKEISGGSSGNEPRRT AVKSR+FDEVLQMIR HK+GSLFESRLQSQETEEYK MIRQHLDLETVQAK+NSGSY  SSLAFYRDL
Subjt:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDL

Query:  LLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SQSKG-TDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLG
        LLLCNNA TFFP SSKES+AA ELRLLVS EMKKS+PVAR DPSP    SSP  P SQSKG  DLEGSQSL AK+KS VPI+VCRKRSKISSK SSS LG
Subjt:  LLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SQSKG-TDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLG

Query:  EKGDRSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SSKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEK
        EK DR+NDDEK A+DLKS++KMAST VEDHGTTKD SSK+KE+ +TGARSMRRSNDSA NSSGPSIKKQ TNSGWKP+S NETETP PDKKKSETVALEK
Subjt:  EKGDRSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SSKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEK

Query:  KRSAADFLKRIKQNSPAETMKRNGRGGSTSVGN----AAPGEQKKGSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGRPPKKTAE--PDPAPAPTK
        KRSAADFLKRIKQNSPAET KRNGRGGS++  N    AA GEQKKGSGK+   KE+  SI+Q+NEKK  KED SPSKRSVGRPPKK  E  PDP PAPTK
Subjt:  KRSAADFLKRIKQNSPAETMKRNGRGGSTSVGN----AAPGEQKKGSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGRPPKKTAE--PDPAPAPTK

Query:  RAREGGGKEPLKRAKKRARR
        RAREGGGKEPLKRA+KRARR
Subjt:  RAREGGGKEPLKRAKKRARR

XP_023513859.1 bromodomain-containing protein bet-1 [Cucurbita pepo subsp. pepo]3.6e-30483.94Show/hide
Query:  MAEDKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDEL
        MAEDK+R + Q+ ATWGTWEELLLA AV RHGFKDWNSVAME+QARSSLPH LT A NCELKFLDLKRRFTSFQNDAV   NGAGI D VDS VPWVDEL
Subjt:  MAEDKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDEL

Query:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN
        RKLRVAELRREV+ YDVSINSLQLKVKRLEEER+QGLN+ EAGTGKPDLKTEARERRSENDKKLFGEPDHRSG  GTV KP AVPGEDSDRED SVNQSN
Subjt:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN

Query:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTS-ETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ
        STGSKSGNRKSTA+N KSET+P+F GS RPE NRRA E AGPQSDDGSTDTVVKNPTCDTS  TKKKKETKRVDDSSELADSEA+SNGGETATRESSEVQ
Subjt:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTS-ETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ

Query:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDL
        SSASLTGRMKR+R LRKEISGGSSGNEPRRT AVKSR+FDEVLQMIR HK+GSLFESRLQSQETEEYKGMIRQHLDLETVQAK+NSGSY SSSLAFYRDL
Subjt:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDL

Query:  LLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SQSKG-TDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLG
        LLLCNNA TFFP SSKE +AA ELRLLVS EMKKS+PVAR DPSP    SSP  P SQSKG  DLEGSQSL AK+KS VPI+VCRKRSKISSK SSS LG
Subjt:  LLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SQSKG-TDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLG

Query:  EKGDRSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SSKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEK
        EK DR+NDDEK A+DLKS++KMAST VEDHGTTKD SSK+KE+ +TGARSMRRSNDSA NSSGPSIKKQ TNSGWKP+S NETETP PD KKSETVALEK
Subjt:  EKGDRSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SSKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEK

Query:  KRSAADFLKRIKQNSPAETMKRNGRGGSTSVGN--AAPGEQKKGSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRARE
        KRSAADFLKRIKQNSPAET KRNGRGGS++  N  AA GEQKKGSGK+   KE+  SI+Q+NEKK  KED SPSKRSVGRPPKK  E +P PAPTKRARE
Subjt:  KRSAADFLKRIKQNSPAETMKRNGRGGSTSVGN--AAPGEQKKGSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRARE

Query:  GGGKEPLKRAKKRARR
        GGGKEPLKRA+KRARR
Subjt:  GGGKEPLKRAKKRARR

XP_038897226.1 uncharacterized protein LOC120085356 [Benincasa hispida]5.4e-31086.34Show/hide
Query:  TEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDELRKLRVAEL
        +EQ TATWGTWEELLLACAVKRHGFKDWNSVAME+QARSSLPHLLT ARNCELKF DLKRRFTSF+NDAV   NGAGIADKVDS VPWVDELRKLRVAEL
Subjt:  TEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDELRKLRVAEL

Query:  RREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSGN
        RREV+RYDVSINSLQLKVK+LEEEREQG+N+REA TGKPDLKTE+RERRSENDK LFGEPDHRSGPNGTVAKP AVPGEDSDRED SVNQSNSTGSKSGN
Subjt:  RREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSGN

Query:  RKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRM
        RKSTA+  KSET+PDF GS+RPEQNRRA EPAGPQSDDGSTDTVVKNPTCD SET KKKET+RVDDSSELADSEAQSNGG T TRESSEVQSSASLTGRM
Subjt:  RKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRM

Query:  KRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAAT
        KRKRLLRKEISGGSSGNEPRRTAAVKS++FDEVLQ IR HK+GSLFESRLQSQETEEYK MIRQHLDLE VQ KINSGSY SSS AFYRDLLLL NN  T
Subjt:  KRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAAT

Query:  FFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLGEKGDRSNDDEK
        FFPKSSKE +AAC+LRLL+SNEMKKSL VAR DPSPEVVDSSP IPS+SKG DLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSS+ LGEKG+RSNDDEK
Subjt:  FFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLGEKGDRSNDDEK

Query:  SAVDLKSTIKMASTAVEDHGTTKDSSKVKEKPVTGARSMRRSNDSA-NSSGP-SIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKRI
         AVDLKS+IK+ S  VED  TTKD SKVKEKP+TGARSMRRSNDSA NSSGP SIKKQNTNS WKPSS NETETP PDKKKSETVALEKKRSAADFLKRI
Subjt:  SAVDLKSTIKMASTAVEDHGTTKDSSKVKEKPVTGARSMRRSNDSA-NSSGP-SIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKRI

Query:  KQNSPAETMKRNGRGGSTSVGNAAPGEQKKGSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGGKEPLKRAKKR
        KQNSPAET+KRNGRGGS+SVG+ AP EQKKGSGKSDK KE+  ++KQ+N+K+ PKEDASPSKRSVGRPPKK AE D  P P KRAREGGGKEPLKR +K+
Subjt:  KQNSPAETMKRNGRGGSTSVGNAAPGEQKKGSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGGKEPLKRAKKR

Query:  ARR
        A+R
Subjt:  ARR

TrEMBL top hitse value%identityAlignment
A0A1S3CFH2 uncharacterized protein LOC1034998902.6e-30083.26Show/hide
Query:  TEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDELRKLRVAEL
        +EQ TATWGTWEELLLACAVKRHGFKDWNSV+ME+QARSSLPHLLT ARNCELKF DLKRRFTSFQNDAV  HN +GIADK+D+ +PWVDELRKLRVAEL
Subjt:  TEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDELRKLRVAEL

Query:  RREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSGN
        RREV+RYDVSINSLQLKVK+LEEEREQG+N+REA TGKPDLKTE+RERRSENDKK FGEPDHRSGPNGTV KPP VPGEDSDRED SVNQSNSTGSKSGN
Subjt:  RREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSGN

Query:  RKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRM
         KSTA+  KSET+PDF GS+RPEQNRR VEPAGPQSDDGSTDTVVKNPTCD SET KKKET+RVDDSSELADSEAQS+GG T TRESSEVQSSASLTGRM
Subjt:  RKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRM

Query:  KRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAAT
        KRKRLLR EISGGSSGNEPRR+A +KSR+FDEVLQ+IR HK+GSLFESRLQSQETEEYKGM+RQHLDLE VQ KINSGSY+SS+LAFYRDLLLL NN  T
Subjt:  KRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAAT

Query:  FFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLGEKGDRSNDDEK
        FFPKSSKE++AACELRLLVSNEMKKSL VA+TDP PEVVDS P IPSQSKG DLEGSQ+LLAKQKSSVPI+VCRKRSKIS+ PS++  GEKGDRSNDDEK
Subjt:  FFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLGEKGDRSNDDEK

Query:  SAVDLKSTIKMASTAVEDHGTTKDSSKVKEKPVTGARSMRRSNDSA-NSSGP-SIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKRI
         A DLKS+IK+ S  VE+  TTKD SKVKEKP TGARSMRRSNDSA NSSGP S KKQN  S WKPSS NETE P PDKKKSETVALEKKRSAADFLKRI
Subjt:  SAVDLKSTIKMASTAVEDHGTTKDSSKVKEKPVTGARSMRRSNDSA-NSSGP-SIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKRI

Query:  KQNSPAETMKRNGRGGST-SVGNAAPGEQKKGSGKSDKAKERTPS-IKQNNEKKNPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGGKEPLKRAK
        KQNSPAET KRNGRGGS+  VGNA P EQKKG+ K+DKAKER  S +KQ+N+KK PKEDASPSKRSVGRPPKK AE +P P P KRAREGGGKEPLKR +
Subjt:  KQNSPAETMKRNGRGGST-SVGNAAPGEQKKGSGKSDKAKERTPS-IKQNNEKKNPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGGKEPLKRAK

Query:  KRARR
        K+++R
Subjt:  KRARR

A0A5A7UYV1 Histone H3.v12.6e-30083.26Show/hide
Query:  TEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDELRKLRVAEL
        +EQ TATWGTWEELLLACAVKRHGFKDWNSV+ME+QARSSLPHLLT ARNCELKF DLKRRFTSFQNDAV  HN +GIADK+D+ +PWVDELRKLRVAEL
Subjt:  TEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDELRKLRVAEL

Query:  RREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSGN
        RREV+RYDVSINSLQLKVK+LEEEREQG+N+REA TGKPDLKTE+RERRSENDKK FGEPDHRSGPNGTV KPP VPGEDSDRED SVNQSNSTGSKSGN
Subjt:  RREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSGN

Query:  RKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRM
         KSTA+  KSET+PDF GS+RPEQNRR VEPAGPQSDDGSTDTVVKNPTCD SET KKKET+RVDDSSELADSEAQS+GG T TRESSEVQSSASLTGRM
Subjt:  RKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRM

Query:  KRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAAT
        KRKRLLR EISGGSSGNEPRR+A +KSR+FDEVLQ+IR HK+GSLFESRLQSQETEEYKGM+RQHLDLE VQ KINSGSY+SS+LAFYRDLLLL NN  T
Subjt:  KRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAAT

Query:  FFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLGEKGDRSNDDEK
        FFPKSSKE++AACELRLLVSNEMKKSL VA+TDP PEVVDS P IPSQSKG DLEGSQ+LLAKQKSSVPI+VCRKRSKIS+ PS++  GEKGDRSNDDEK
Subjt:  FFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLGEKGDRSNDDEK

Query:  SAVDLKSTIKMASTAVEDHGTTKDSSKVKEKPVTGARSMRRSNDSA-NSSGP-SIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKRI
         A DLKS+IK+ S  VE+  TTKD SKVKEKP TGARSMRRSNDSA NSSGP S KKQN  S WKPSS NETE P PDKKKSETVALEKKRSAADFLKRI
Subjt:  SAVDLKSTIKMASTAVEDHGTTKDSSKVKEKPVTGARSMRRSNDSA-NSSGP-SIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKRI

Query:  KQNSPAETMKRNGRGGST-SVGNAAPGEQKKGSGKSDKAKERTPS-IKQNNEKKNPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGGKEPLKRAK
        KQNSPAET KRNGRGGS+  VGNA P EQKKG+ K+DKAKER  S +KQ+N+KK PKEDASPSKRSVGRPPKK AE +P P P KRAREGGGKEPLKR +
Subjt:  KQNSPAETMKRNGRGGST-SVGNAAPGEQKKGSGKSDKAKERTPS-IKQNNEKKNPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGGKEPLKRAK

Query:  KRARR
        K+++R
Subjt:  KRARR

A0A6J1CTH9 uncharacterized protein LOC1110141499.6e-30382.72Show/hide
Query:  MAEDKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDEL
        MAEDK+R +EQ+ A+WGTWEELLLACAVKRHGFKDWNSVAME+QARSSLPHLL  A NCELKFLDLKRRFTSFQNDAVF  NG GIADK+D+ +PWVDEL
Subjt:  MAEDKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDEL

Query:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKP--PAVPGEDSDREDLSVNQ
        RKLRVAELRREV+RYDVSI+SLQLKVKRLEEERE GLN+R+AGTGKPDLKTE+RERRSENDKKLFGEPD+RSGPNGTV+KP  PAVPGEDSDRED SVNQ
Subjt:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKP--PAVPGEDSDREDLSVNQ

Query:  SNSTGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEV
        SNSTGSKSGNRKST +N KSET+P+F GSFR EQNRRA EPAGPQSDDGSTDTV KNPTCD SET+KK + +   DSSELADSEAQSNGGETATRESSEV
Subjt:  SNSTGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEV

Query:  QSSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRD
        QSSASLTGRMKRKRLLRKEISGGSSGNEPRRTA VKSR+FD++LQMIR HK+GS FESRLQSQE+EEYKGM+RQHLDLE VQAK+NSGSY+SSSLAFYRD
Subjt:  QSSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRD

Query:  LLLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLGE
        LLLLCNNA  FFPK SKE++AACELRLLVSNE+KKSL V RTDPSPEVVDSSPA+PS+SKG DLEGS SLLAKQKSSVPIIVCRKRSKIS+K  SSSLGE
Subjt:  LLLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLGE

Query:  KGDRSNDDEKSAVDLKSTIKMASTAVEDHGTTKDSSKVKEKPVTGARSMRRSNDS-ANSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKR
        K DRS++DEKSAVD+K T+K AS AVEDHGTTKD SKVKEKPVTGARSMRRSNDS  NSSGPSIKKQN +SGWKP S NE ET  PDKKK ETVALEKKR
Subjt:  KGDRSNDDEKSAVDLKSTIKMASTAVEDHGTTKDSSKVKEKPVTGARSMRRSNDS-ANSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKR

Query:  SAADFLKRIKQNSPAETMKRNGRGGSTSVGNAAPGEQKKGSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGGK
        SAADFLKRIKQNSPAET+KRN R GS S    A  EQKKGS KSDK KER  S++Q+N+KK  K+DASPSKRSVGRPPKK    +  P PTKRAREGGGK
Subjt:  SAADFLKRIKQNSPAETMKRNGRGGSTSVGNAAPGEQKKGSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGGK

Query:  EPLKRAKKRARR
        EPLKR KKR+RR
Subjt:  EPLKRAKKRARR

A0A6J1H5Z4 uncharacterized protein LOC1114607748.7e-30483.75Show/hide
Query:  MAEDKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDEL
        MAEDK+R + Q+ ATWGTWEELLLA AV RHGFKDWNSVAME+QARSSLPH LT A NCELKFLDLKRRFTSFQNDAV   NGAGI D VDS VPWVDEL
Subjt:  MAEDKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDEL

Query:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN
        RKLRVAELRREV+ YDVSINSLQLKVKRLEEER+QGLN+ EAGTGKPDLKTEARERRSENDKKLFGEPDHRSG  GTV KP AVPGEDSDRED SVNQSN
Subjt:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN

Query:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTS-ETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ
        STGSKSGNRKSTA+N KSET+P+F GS RPE NRRA E AGPQSDDGSTDTVVKNPTCDTS  TKKKKETKRVDDSSELADSEA+SNGGETATRESSEVQ
Subjt:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTS-ETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ

Query:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDL
        SSASLTGRMKR+R LRKEISGGSSGNEPRRT AVKSR+FDEVLQMIR HK+GSLFESRLQSQETEEYK MIRQHLDLETVQAK+NSGSY  SSLAFYRDL
Subjt:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDL

Query:  LLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SQSKG-TDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLG
        LLLCNNA TFFP SSKES+AA ELRLLVS EMKKS+PVAR DPSP    SSP  P SQSKG  DLEGSQSL AK+KS VPI+VCRKRSKISSK SSS LG
Subjt:  LLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SQSKG-TDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLG

Query:  EKGDRSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SSKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEK
        EK DR+NDDEK A+DLKS++KMAST VEDHGTTKD SSK+KE+ +TGARSMRRSNDSA NSSGPSIKKQ TNSGWKP+S NETETP PDKKKSETVALEK
Subjt:  EKGDRSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SSKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEK

Query:  KRSAADFLKRIKQNSPAETMKRNGRGGSTSVGN----AAPGEQKKGSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGRPPKKTAE--PDPAPAPTK
        KRSAADFLKRIKQNSPAET KRNGRGGS++  N    AA GEQKKGSGK+   KE+  SI+Q+NEKK  KED SPSKRSVGRPPKK  E  PDP PAPTK
Subjt:  KRSAADFLKRIKQNSPAETMKRNGRGGSTSVGN----AAPGEQKKGSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGRPPKKTAE--PDPAPAPTK

Query:  RAREGGGKEPLKRAKKRARR
        RAREGGGKEPLKRA+KRARR
Subjt:  RAREGGGKEPLKRAKKRARR

A0A6J1KZN3 dentin sialophosphoprotein-like5.3e-30182.89Show/hide
Query:  MAEDKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDEL
        MAEDK+R + Q+ ATWGTWEELLLA AV RHGFKDWNSVAME+QARSSLPH LT A NCELKFLDLKRRFT+FQNDAV   NGAGI D VDS VPWVDEL
Subjt:  MAEDKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDEL

Query:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN
        RKLRVAELRREV+ YDVSINSLQLKVKRLEEER+QGLN+ EAGTGKPDLKTEARERRSENDKKLFGEPDHRSG  GTV KP AVPGEDSDRED SVNQSN
Subjt:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN

Query:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQS
        STGSKSGNRKSTA+N KSET+ +F GS RPE NRRA EP+GPQSDDGSTDTVVKNPTCDTS TKKKKETKRVD+SSELADSEA+SNGG TATRESSEVQS
Subjt:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQS

Query:  SASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLL
        SASLTGRMKR+R LRKEISGGSSGNEPRRT AVKSR+FDEVLQMIR HK+GSLFESRLQSQETEEYKGMIRQHLDLETVQAK+NSGSY  SSLAFYRDLL
Subjt:  SASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLL

Query:  LLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SQSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLGEK
        LLCNNA TFFP SSKES+AA ELRLLVS EM KS+PVAR DPSP    SSP  P SQSKG DLEGSQSL AK+KS VPI+VCRKRSKISSK SSS LGEK
Subjt:  LLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIP-SQSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLGEK

Query:  GDRSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SSKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKR
         DR+NDDEK A+DLKS++KMAST VEDHGT KD SSK+KE+ +TGARSMRRSNDSA NSSGPSIKKQ TNSGWK +S NETET  PDKKKSETVALEKKR
Subjt:  GDRSNDDEKSAVDLKSTIKMASTAVEDHGTTKD-SSKVKEKPVTGARSMRRSNDSA-NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKR

Query:  SAADFLKRIKQNSPAETMKRNGRGGSTSVGN-----AAPGEQKKGSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGRPPKKTAE--PDPAPAPTKR
        SAADFLKRIKQNSPAET KRNGRGGS++  N     AA GEQKKGSGK+   KE+  SI+Q+NEKK  KED SPSKRSVGRPPKK  E  PDP PAPTKR
Subjt:  SAADFLKRIKQNSPAETMKRNGRGGSTSVGN-----AAPGEQKKGSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGRPPKKTAE--PDPAPAPTKR

Query:  AREGGGKEPLKRAKKRARR
        AREGGGKEPLKRA+KRARR
Subjt:  AREGGGKEPLKRAKKRARR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61215.1 bromodomain 44.8e-2025.14Show/hide
Query:  TEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDELRKLRVAEL
        T    + WGTWEELLL  AV RHG  DW  VA EL++  SLP + T    C+ K+ DL++R+   +                     W +EL+K RVAEL
Subjt:  TEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDELRKLRVAEL

Query:  RREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSGN
        +  + + + SI SL+ K++ L+ E                   E  +   ++ + L  EP            P +  G +   +D S + S S GS +  
Subjt:  RREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSGN

Query:  RKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRM
        ++ T  N+  E + +       E+ +  +     +S  G    V+       S  KK+ + KR D S  +     +     +A  ES    +SA +    
Subjt:  RKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRM

Query:  KRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAAT
            + R + +  +S ++ R  +    ++  ++   I +++   +F  RL SQ+   YK ++R+H+DL+TVQ++IN G   SS+   +RD LL+ NNAA 
Subjt:  KRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAAT

Query:  FFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSVPII---VCRKRSKISSKPSSSSLGEKGDRSN-
        F+ K+++E  +A  LR +V+  ++  L            D  P   S + G+    ++ ++  QKS+ P +   +  K+ +  + P  + + +    S+ 
Subjt:  FFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSVPII---VCRKRSKISSKPSSSSLGEKGDRSN-

Query:  DDEKSAVDLK-STIKMASTAVEDHGTTKDSSKVK---EKPVTGARSMRR
         +++S  DL  S +K ++   +     KD  +     E P    R   R
Subjt:  DDEKSAVDLK-STIKMASTAVEDHGTTKDSSKVK---EKPVTGARSMRR

AT2G42150.1 DNA-binding bromodomain-containing protein2.7e-5534.6Show/hide
Query:  MAEDKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLT-NARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDE
        MA+ ++  T     TW TWEELLLACAV RHG + WNSV+ E+Q  S  P+L +  A  C  K+ DLK RFT    +     + A I     ST PW++E
Subjt:  MAEDKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLT-NARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDE

Query:  LRKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQS
        LRKLRV ELRREVE+YD+SI++LQ KVK+LEEERE           KPD +TE  +   + ++   GEP         V  PP             +N++
Subjt:  LRKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQS

Query:  NSTGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ
         S   K    ++T      E E +  GS   E      + AG  S  GS ++V K PT ++   +    T       EL +SE  ++ GE  T   S+VQ
Subjt:  NSTGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQ

Query:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTA---AVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFY
        SSASL          RK  S     ++   +A    V+S+     ++++  H  GS F  RL+ QET EY  +IR+H+D E ++ ++  G Y S  + F+
Subjt:  SSASLTGRMKRKRLLRKEISGGSSGNEPRRTA---AVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFY

Query:  RDLLLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSL-------PVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSVP-IIVCRKRSKIS
        RDLLLL NNA  F+ + S E   A +L  LV  +M  +L        ++ + P  EVV    AIPS         S+ + +K + SVP I+ CRKRS ++
Subjt:  RDLLLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSL-------PVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSVP-IIVCRKRSKIS

Query:  SK------PSSSSLGEKGDRSND-DEKSAVDLKSTIKMASTAVEDHGTTKDSSKVKEKPVTGARSMRRSNDSANSSGPSIKKQNTNSGWKPSSTNETETP
        +K      P      +K D   D DEK   D            E  G   D S +        + M R   S+       K  N N   + SS N  ++ 
Subjt:  SK------PSSSSLGEKGDRSND-DEKSAVDLKSTIKMASTAVEDHGTTKDSSKVKEKPVTGARSMRRSNDSANSSGPSIKKQNTNSGWKPSSTNETETP

Query:  NPDKK-KSETVALEKKRSAADFLKRIKQNSPAETMKRNGRGGSTSVGNAAPGEQKK-GSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRS-VGRPPKKT
        +  KK   E     KK+ AA FL+R+K  S  +T+KR+    S++ G     EQ+K  S K+D  K   P I+Q N+K      ASP KRS  GR  ++ 
Subjt:  NPDKK-KSETVALEKKRSAADFLKRIKQNSPAETMKRNGRGGSTSVGNAAPGEQKK-GSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRS-VGRPPKKT

Query:  AEPDPAPAP--TKRAREGGGKEPLK----RAKKRARR
        A P  +  P   KR+R+ G KE       R KKRARR
Subjt:  AEPDPAPAP--TKRAREGGGKEPLK----RAKKRARR

AT2G44430.1 DNA-binding bromodomain-containing protein3.3e-9039.55Show/hide
Query:  EDKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFG--HNGAGIADKVDSTVPWVDEL
        ED     +  T  WGTWEELLLACAVKRHGF DW+SVA E+++RSSL HLL +A +C  K+ DLKRRF   +   V           ++V + +PW+++L
Subjt:  EDKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFG--HNGAGIADKVDSTVPWVDEL

Query:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN
        R LRVAELRREVERYD SI SLQLKVK+LEEE       RE G  KPDL+ E +E RSEND     E +HR        +      E+SDRE+ S+N+SN
Subjt:  RKLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSN

Query:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQS
        ST +                          E+     EP+  + DD   D   KNP  D            V+  +  A+ E  S    +    S E+  
Subjt:  STGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQS

Query:  SASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLL
        S +   + KRKR  RK+   G       R+A  KS+    +L +IR H  GSLFE RL+SQE ++YK M++QHLD+ET+Q K+  GSY SSSL FYRDL 
Subjt:  SASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLL

Query:  LLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSK--PSSSSLGE
        LL  NA  FFP SS ESMAA ELR +VS EM+K    A     P ++    A   +S   D E S S L++QKSS P++VC+KR  +S+K  PSSSS  +
Subjt:  LLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSK--PSSSSLGE

Query:  KGDRSNDDEKSAVDLKSTIKMASTAVEDHGTTKDSSKVKEKPVTGARSMRRSNDSA------NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVA
        K     DD K                      +  S+ K+   TG RS RR+N  A        +G    KQ        SS + +   +  K + +TV+
Subjt:  KGDRSNDDEKSAVDLKSTIKMASTAVEDHGTTKDSSKVKEKPVTGARSMRRSNDSA------NSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVA

Query:  LEKKRSAADFLKRIKQNSPAETMKRNGRGGSTSVGNAAPGEQKKGSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAR
         +KK+S ADFLKR+K+NSP +  K   + G         G  KK S K+   + R+ S+     KK  + + +P KR+ GRP KKTAE   A A  KR R
Subjt:  LEKKRSAADFLKRIKQNSPAETMKRNGRGGSTSVGNAAPGEQKKGSGKSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGRPPKKTAEPDPAPAPTKRAR

Query:  EGG-----GKEPLKRAKK
        + G      K+P KR +K
Subjt:  EGG-----GKEPLKRAKK

AT3G57980.1 DNA-binding bromodomain-containing protein4.1e-4831.76Show/hide
Query:  EELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDELRKLRVAELRREVERYDVSI
        EELLLACAV RHG   W+SVA E+  ++S    LT A +C  K+ DLKRRF+  +N    G           S+VPW++ELRKLRV ELRREVERYD+SI
Subjt:  EELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDELRKLRVAELRREVERYDVSI

Query:  NSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSGNRKSTAKNFKSE
        +SLQLKVK LE+ERE+ L    +     D   E +E  +E+         + SG   T  K    P ++S            TGS++ NR         E
Subjt:  NSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSGNRKSTAKNFKSE

Query:  TEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRMKRKRLLRKEIS
             GG    E      +PA   S  GS ++V K       E+ + +  +  +DS EL +S  +S  GE  T+E+S+ QSSAS      RK  + ++  
Subjt:  TEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRMKRKRLLRKEIS

Query:  GGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAATFFPKSSKESMA
             +       V+S+   + +++++ H  GS F  RL++QET +Y  +IRQH+D E +++++  G Y ++   F+RDLLLL NN   F+ + S E  A
Subjt:  GGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAATFFPKSSKESMA

Query:  ACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLGEKGDRSNDDEKSAVDLKSTIKM
        A +L  L+  +M   +P     P  E  D+      + K + L        K   SVPII CRKRS ++ +  +S          +  K    +  T+  
Subjt:  ACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLGEKGDRSNDDEKSAVDLKSTIKM

Query:  ASTAVEDHGTTKDSSKVKEKPVTGARSMRRSNDSANSSGPSIKKQNTNSGWKPSSTNETETPN---------PDKKKSETVALEKKRSAADFLKRIKQNS
           + E+ G   D     EKP+    S + +  +A S+   +  +N  +      +N   +PN           KK   T    KK+SAA FLKR+K  S
Subjt:  ASTAVEDHGTTKDSSKVKEKPVTGARSMRRSNDSANSSGPSIKKQNTNSGWKPSSTNETETPN---------PDKKKSETVALEKKRSAADFLKRIKQNS

Query:  PAETMKRNGRGGSTSVGNAAPGEQKKGSGKSDKAKE-RTPSIKQNNEKKNPK-EDASPSKRSVGRPPKKTAEPDPA--PAPTKRAREGGGKEPLKRAKK
         +ET+    +  S++    A  EQ+K + KS+K    + P+ ++    K P  E  SP+K++ G   K+      A   + T     G    P KR+K+
Subjt:  PAETMKRNGRGGSTSVGNAAPGEQKKGSGKSDKAKE-RTPSIKQNNEKKNPK-EDASPSKRSVGRPPKKTAEPDPA--PAPTKRAREGGGKEPLKRAKK

AT3G60110.1 DNA-binding bromodomain-containing protein6.8e-6736.68Show/hide
Query:  DKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDST--VPWVDELR
        D + +  Q    WGTWEEL+L CAVKRH F DW+SVA E+QARS    L+ +A NC LK+ DLKRRF    +  V   N    A++ D    + W+++LR
Subjt:  DKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDST--VPWVDELR

Query:  KLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGE--DSDRED-LSVNQ
         L +AELRREV+R D SI SLQLKVK+LEEE++      + G  KPDLK         ND+                 KP  V  E  +SDR+D  S+N+
Subjt:  KLRVAELRREVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGE--DSDRED-LSVNQ

Query:  SNSTGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEV
        SNST S                  D     +  +N R  +P          D V K    +T E +++  +KR +           SN GE         
Subjt:  SNSTGSKSGNRKSTAKNFKSETEPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEV

Query:  QSSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRD
            S T     KR  +K  SGG  G    ++A  KS+   +++++IR H  GS+FESRL+SQ+T++YK +IRQHLD++T++ K+  GSY SSSL+FYRD
Subjt:  QSSASLTGRMKRKRLLRKEISGGSSGNEPRRTAAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRD

Query:  LLLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSV-PIIVCRKRSKISSKPSSSSLG
        L LL  NA  FFP SS ES+AA ELR LVSNEMKK     RT      V  S A             +S +++QKSSV  ++ C+K+S    K S SS  
Subjt:  LLLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVARTDPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSV-PIIVCRKRSKISSKPSSSSLG

Query:  EKGDRSNDDEKSAVDLKSTIKMASTAVEDHGTTKDSSKVKEKPVTGARSMRRSNDSANSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKR
         + D     E S   + +T    S A     T+K+ + V +   TG    R  N        +IKKQ T++  + S  ++ E     K + +TVA +KK+
Subjt:  EKGDRSNDDEKSAVDLKSTIKMASTAVEDHGTTKDSSKVKEKPVTGARSMRRSNDSANSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKR

Query:  SAADFLKRIKQNSPAETMKRNGRGGSTSVGNAAP--GEQKKGSG--KSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGR-PPKKTAEPDPAPAPTKRAR
        S ADFLKRIK+NSP +  +   +    + GN       QKK  G  K + +K +   ++ +  KK  + + + SK S  R   K+TAE     A  KR R
Subjt:  SAADFLKRIKQNSPAETMKRNGRGGSTSVGNAAP--GEQKKGSG--KSDKAKERTPSIKQNNEKKNPKEDASPSKRSVGR-PPKKTAEPDPAPAPTKRAR

Query:  EGGGKEPLKRAKKRARR
        E G  +  K+ +KR+RR
Subjt:  EGGGKEPLKRAKKRARR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAGATAAAGACCGGCACACTGAACAGACCACCGCCACATGGGGGACATGGGAGGAGCTTTTACTGGCCTGTGCCGTCAAGCGCCATGGATTTAAGGACTGGAA
TTCCGTCGCCATGGAACTTCAGGCTCGTTCTTCTCTCCCTCATCTTCTTACCAACGCTCGTAACTGTGAACTCAAATTCCTTGACCTCAAACGACGCTTTACTTCCTTCC
AAAACGACGCCGTTTTTGGCCACAACGGCGCCGGAATTGCCGATAAGGTCGATAGCACTGTACCTTGGGTCGATGAGTTGAGGAAACTCCGTGTTGCTGAACTACGACGA
GAGGTCGAGCGCTATGACGTTTCAATCAACTCGTTGCAGTTGAAGGTCAAGAGATTGGAGGAGGAGCGAGAACAAGGCCTGAACAATAGAGAGGCCGGCACTGGAAAACC
AGATCTAAAAACGGAGGCCAGAGAACGACGATCAGAAAACGACAAAAAGCTCTTCGGAGAACCAGACCACCGGTCCGGACCGAACGGAACAGTCGCGAAGCCACCGGCCG
TTCCCGGCGAAGATTCCGACCGCGAGGATTTATCCGTCAACCAATCCAATTCCACCGGTTCCAAAAGCGGTAACCGAAAAAGCACGGCCAAAAACTTCAAATCCGAAACC
GAACCGGACTTCGGTGGTTCATTCAGACCAGAACAGAACCGGAGAGCCGTCGAACCAGCCGGACCGCAGTCGGATGACGGCAGTACGGACACTGTGGTTAAGAACCCGAC
GTGCGACACATCAGAAACGAAAAAGAAAAAGGAGACGAAGCGAGTGGACGACTCGTCCGAGTTAGCGGACTCGGAGGCTCAGTCAAACGGCGGAGAGACGGCGACGAGGG
AAAGCAGCGAAGTACAAAGCTCGGCGAGTTTGACGGGGAGGATGAAGAGGAAACGACTCCTCAGAAAGGAGATCTCCGGCGGTAGTAGCGGCAATGAACCTCGCCGGACG
GCCGCCGTCAAGTCACGGCAGTTTGACGAAGTTTTGCAGATGATACGTGAGCACAAAAATGGCTCATTGTTCGAATCCCGTCTTCAGAGTCAGGAGACGGAAGAGTACAA
AGGCATGATCCGGCAGCATTTGGACCTGGAAACAGTTCAGGCCAAGATCAATTCTGGCTCTTATGCGTCCTCCAGCCTTGCATTTTACCGAGATCTGTTGCTTCTTTGCA
ACAATGCGGCTACCTTCTTCCCCAAATCTTCCAAGGAGTCGATGGCTGCTTGTGAACTCCGTCTTCTTGTTTCCAATGAAATGAAGAAGAGTCTGCCGGTTGCTCGGACT
GACCCTTCACCTGAAGTAGTGGATTCGTCCCCAGCGATTCCTTCCCAATCCAAGGGTACTGATCTTGAAGGATCACAGTCCTTGCTTGCCAAGCAGAAGTCCTCTGTTCC
TATAATTGTTTGTCGGAAACGGAGCAAAATTTCATCTAAGCCTTCATCAAGTAGCCTTGGTGAAAAAGGTGATCGGAGCAATGATGATGAAAAGTCAGCTGTTGATCTAA
AATCAACTATTAAAATGGCTTCAACTGCTGTTGAAGACCATGGCACTACAAAAGATAGCAGCAAGGTTAAAGAGAAACCTGTAACTGGAGCTAGAAGCATGCGGAGGAGC
AACGACAGTGCCAACTCTAGTGGTCCTAGCATCAAGAAACAGAATACAAATTCTGGATGGAAACCCAGTTCGACTAACGAAACAGAGACACCAAACCCAGATAAGAAGAA
ATCTGAGACGGTTGCACTAGAGAAGAAGCGTAGTGCAGCAGATTTCTTGAAGAGGATCAAGCAGAACTCCCCTGCAGAGACTATGAAGAGAAATGGCAGAGGTGGAAGTA
CTAGTGTTGGTAATGCAGCCCCTGGAGAACAAAAGAAGGGCAGTGGCAAAAGCGATAAAGCAAAGGAGAGGACGCCATCGATAAAGCAGAACAACGAGAAGAAGAACCCA
AAGGAGGATGCCAGCCCTTCCAAGCGCAGTGTCGGGCGGCCGCCTAAGAAGACAGCAGAGCCCGACCCTGCTCCTGCTCCAACAAAAAGAGCCAGAGAAGGTGGTGGAAA
AGAACCTCTAAAGCGGGCAAAAAAAAGGGCTAGAAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAAGATAAAGACCGGCACACTGAACAGACCACCGCCACATGGGGGACATGGGAGGAGCTTTTACTGGCCTGTGCCGTCAAGCGCCATGGATTTAAGGACTGGAA
TTCCGTCGCCATGGAACTTCAGGCTCGTTCTTCTCTCCCTCATCTTCTTACCAACGCTCGTAACTGTGAACTCAAATTCCTTGACCTCAAACGACGCTTTACTTCCTTCC
AAAACGACGCCGTTTTTGGCCACAACGGCGCCGGAATTGCCGATAAGGTCGATAGCACTGTACCTTGGGTCGATGAGTTGAGGAAACTCCGTGTTGCTGAACTACGACGA
GAGGTCGAGCGCTATGACGTTTCAATCAACTCGTTGCAGTTGAAGGTCAAGAGATTGGAGGAGGAGCGAGAACAAGGCCTGAACAATAGAGAGGCCGGCACTGGAAAACC
AGATCTAAAAACGGAGGCCAGAGAACGACGATCAGAAAACGACAAAAAGCTCTTCGGAGAACCAGACCACCGGTCCGGACCGAACGGAACAGTCGCGAAGCCACCGGCCG
TTCCCGGCGAAGATTCCGACCGCGAGGATTTATCCGTCAACCAATCCAATTCCACCGGTTCCAAAAGCGGTAACCGAAAAAGCACGGCCAAAAACTTCAAATCCGAAACC
GAACCGGACTTCGGTGGTTCATTCAGACCAGAACAGAACCGGAGAGCCGTCGAACCAGCCGGACCGCAGTCGGATGACGGCAGTACGGACACTGTGGTTAAGAACCCGAC
GTGCGACACATCAGAAACGAAAAAGAAAAAGGAGACGAAGCGAGTGGACGACTCGTCCGAGTTAGCGGACTCGGAGGCTCAGTCAAACGGCGGAGAGACGGCGACGAGGG
AAAGCAGCGAAGTACAAAGCTCGGCGAGTTTGACGGGGAGGATGAAGAGGAAACGACTCCTCAGAAAGGAGATCTCCGGCGGTAGTAGCGGCAATGAACCTCGCCGGACG
GCCGCCGTCAAGTCACGGCAGTTTGACGAAGTTTTGCAGATGATACGTGAGCACAAAAATGGCTCATTGTTCGAATCCCGTCTTCAGAGTCAGGAGACGGAAGAGTACAA
AGGCATGATCCGGCAGCATTTGGACCTGGAAACAGTTCAGGCCAAGATCAATTCTGGCTCTTATGCGTCCTCCAGCCTTGCATTTTACCGAGATCTGTTGCTTCTTTGCA
ACAATGCGGCTACCTTCTTCCCCAAATCTTCCAAGGAGTCGATGGCTGCTTGTGAACTCCGTCTTCTTGTTTCCAATGAAATGAAGAAGAGTCTGCCGGTTGCTCGGACT
GACCCTTCACCTGAAGTAGTGGATTCGTCCCCAGCGATTCCTTCCCAATCCAAGGGTACTGATCTTGAAGGATCACAGTCCTTGCTTGCCAAGCAGAAGTCCTCTGTTCC
TATAATTGTTTGTCGGAAACGGAGCAAAATTTCATCTAAGCCTTCATCAAGTAGCCTTGGTGAAAAAGGTGATCGGAGCAATGATGATGAAAAGTCAGCTGTTGATCTAA
AATCAACTATTAAAATGGCTTCAACTGCTGTTGAAGACCATGGCACTACAAAAGATAGCAGCAAGGTTAAAGAGAAACCTGTAACTGGAGCTAGAAGCATGCGGAGGAGC
AACGACAGTGCCAACTCTAGTGGTCCTAGCATCAAGAAACAGAATACAAATTCTGGATGGAAACCCAGTTCGACTAACGAAACAGAGACACCAAACCCAGATAAGAAGAA
ATCTGAGACGGTTGCACTAGAGAAGAAGCGTAGTGCAGCAGATTTCTTGAAGAGGATCAAGCAGAACTCCCCTGCAGAGACTATGAAGAGAAATGGCAGAGGTGGAAGTA
CTAGTGTTGGTAATGCAGCCCCTGGAGAACAAAAGAAGGGCAGTGGCAAAAGCGATAAAGCAAAGGAGAGGACGCCATCGATAAAGCAGAACAACGAGAAGAAGAACCCA
AAGGAGGATGCCAGCCCTTCCAAGCGCAGTGTCGGGCGGCCGCCTAAGAAGACAGCAGAGCCCGACCCTGCTCCTGCTCCAACAAAAAGAGCCAGAGAAGGTGGTGGAAA
AGAACCTCTAAAGCGGGCAAAAAAAAGGGCTAGAAGGTGA
Protein sequenceShow/hide protein sequence
MAEDKDRHTEQTTATWGTWEELLLACAVKRHGFKDWNSVAMELQARSSLPHLLTNARNCELKFLDLKRRFTSFQNDAVFGHNGAGIADKVDSTVPWVDELRKLRVAELRR
EVERYDVSINSLQLKVKRLEEEREQGLNNREAGTGKPDLKTEARERRSENDKKLFGEPDHRSGPNGTVAKPPAVPGEDSDREDLSVNQSNSTGSKSGNRKSTAKNFKSET
EPDFGGSFRPEQNRRAVEPAGPQSDDGSTDTVVKNPTCDTSETKKKKETKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRMKRKRLLRKEISGGSSGNEPRRT
AAVKSRQFDEVLQMIREHKNGSLFESRLQSQETEEYKGMIRQHLDLETVQAKINSGSYASSSLAFYRDLLLLCNNAATFFPKSSKESMAACELRLLVSNEMKKSLPVART
DPSPEVVDSSPAIPSQSKGTDLEGSQSLLAKQKSSVPIIVCRKRSKISSKPSSSSLGEKGDRSNDDEKSAVDLKSTIKMASTAVEDHGTTKDSSKVKEKPVTGARSMRRS
NDSANSSGPSIKKQNTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKRIKQNSPAETMKRNGRGGSTSVGNAAPGEQKKGSGKSDKAKERTPSIKQNNEKKNP
KEDASPSKRSVGRPPKKTAEPDPAPAPTKRAREGGGKEPLKRAKKRARR