; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016628 (gene) of Snake gourd v1 genome

Gene IDTan0016628
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBromo domain-containing protein
Genome locationLG06:4073691..4078513
RNA-Seq ExpressionTan0016628
SyntenyTan0016628
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR001487 - Bromodomain
IPR017930 - Myb domain
IPR036427 - Bromodomain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593331.1 hypothetical protein SDJN03_12807, partial [Cucurbita argyrosperma subsp. sororia]1.6e-29983.5Show/hide
Query:  MAEDKERESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDEL
        MAEDKERES QSIATWGTWEELLLA AV RHGFKDWNSVAMEVQARSSLPH LTTA NCELKFLDLKRRFTS QNDAVL+ NGAGI D VD+AVPWVDEL
Subjt:  MAEDKERESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDEL

Query:  RKLRVAELRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSN
        RKLRVAELRREVQ YDVSINSLQLKVKRLEEER+QGLNDSEAGTGKPDLKTE+RERRSENDKK FGEPDHRSG  GTV KP  + GEDSDRED SVNQSN
Subjt:  RKLRVAELRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSN

Query:  STGSKS---------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKG-TKRVDDSSELADSEAQSNGGETATRESSEVQ
        STGSKS          K ETK EFTGS RPE NRRAAE AGPQSDDGSTDTVVKNPTCDTS T KKK  TKRVDDSSELADSEA+SNGGETATRESSEVQ
Subjt:  STGSKS---------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKG-TKRVDDSSELADSEAQSNGGETATRESSEVQ

Query:  SSASLTGRMKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDL
        SSASLTGRMKR+R LRKEISGGSSGN+PRRT AVKSRRFDEVLQMIR+HKHGSLFESRLQSQET EYK M+RQHLDLETVQAK+NSGSY SSSL+FYRDL
Subjt:  SSASLTGRMKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDL

Query:  LLLCNNAVTFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANP-SQSKG-PDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLG
        LLLCNNAVTFFP SSKESVAA E RLLVS EM KS+ VAR  PSP    S+P NP SQSKG PDLEGSQ L AK+KS VPI+VCRKRSKISSKLSS+GLG
Subjt:  LLLCNNAVTFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANP-SQSKG-PDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLG

Query:  ENGDRSNDDEKSAIDLKSSIKMASNTVEDQGTTKD-SSKVKEKPITGARSMRRSNDSATNSSGPSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEK
        E  DR+NDDEK A+DLKSS+KMAS  VED GTTKD SSK+KE+ ITGARSMRRSNDSATNSSGPSIKKQTTNSGWKP+S NETETP PDKKKSETVALEK
Subjt:  ENGDRSNDDEKSAIDLKSSIKMASNTVEDQGTTKD-SSKVKEKPITGARSMRRSNDSATNSSGPSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEK

Query:  KRSAADFLKRIKQNSPAETIIKRNGRGGSSNIGN----AATGELKKGSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEA--DPVPIST
        KRSAADFLKRIKQNSPAET  KRNGRGGSSN  N    AA GE KKGSGK+  GKE+ASSI+QSN+KKR K+D SPSKRSVGRPPKKA EA  DP P  T
Subjt:  KRSAADFLKRIKQNSPAETIIKRNGRGGSSNIGN----AATGELKKGSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEA--DPVPIST

Query:  KRAREGGGKEPLKRARKRPRR
        KRAREGGGKEPLKRARKR RR
Subjt:  KRAREGGGKEPLKRARKRPRR

XP_022959831.1 uncharacterized protein LOC111460774 [Cucurbita moschata]1.4e-30384.05Show/hide
Query:  MAEDKERESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDEL
        MAEDKERES QSIATWGTWEELLLA AV RHGFKDWNSVAMEVQARSSLPH LTTA NCELKFLDLKRRFTS QNDAVL+ NGAGI D VD+AVPWVDEL
Subjt:  MAEDKERESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDEL

Query:  RKLRVAELRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSN
        RKLRVAELRREVQ YDVSINSLQLKVKRLEEER+QGLNDSEAGTGKPDLKTE+RERRSENDKK FGEPDHRSG  GTV KP  +PGEDSDRED SVNQSN
Subjt:  RKLRVAELRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSN

Query:  STGSKS---------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTI-KKKGTKRVDDSSELADSEAQSNGGETATRESSEVQ
        STGSKS          K ETK EFTGS RPE NRRAAE AGPQSDDGSTDTVVKNPTCDTS T  KKK TKRVDDSSELADSEA+SNGGETATRESSEVQ
Subjt:  STGSKS---------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTI-KKKGTKRVDDSSELADSEAQSNGGETATRESSEVQ

Query:  SSASLTGRMKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDL
        SSASLTGRMKR+R LRKEISGGSSGN+PRRT AVKSRRFDEVLQMIR+HKHGSLFESRLQSQETEEYKVM+RQHLDLETVQAK+NSGSY  SSL+FYRDL
Subjt:  SSASLTGRMKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDL

Query:  LLLCNNAVTFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANP-SQSKG-PDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLG
        LLLCNNAVTFFP SSKESVAA E RLLVS EMKKS+ VAR  PSP    S+P NP SQSKG PDLEGSQ L AK+KS VPI+VCRKRSKISSKLSS+GLG
Subjt:  LLLCNNAVTFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANP-SQSKG-PDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLG

Query:  ENGDRSNDDEKSAIDLKSSIKMASNTVEDQGTTKD-SSKVKEKPITGARSMRRSNDSATNSSGPSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEK
        E  DR+NDDEK A+DLKSS+KMAS  VED GTTKD SSK+KE+ ITGARSMRRSNDSATNSSGPSIKKQTTNSGWKP+S NETETP PDKKKSETVALEK
Subjt:  ENGDRSNDDEKSAIDLKSSIKMASNTVEDQGTTKD-SSKVKEKPITGARSMRRSNDSATNSSGPSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEK

Query:  KRSAADFLKRIKQNSPAETIIKRNGRGGSSNIGN----AATGELKKGSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEA--DPVPIST
        KRSAADFLKRIKQNSPAET  KRNGRGGSSN  N    AA GE KKGSGK+  GKE+ASSIRQSN+KKR K+D SPSKRSVGRPPKKA EA  DP P  T
Subjt:  KRSAADFLKRIKQNSPAETIIKRNGRGGSSNIGN----AATGELKKGSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEA--DPVPIST

Query:  KRAREGGGKEPLKRARKRPRR
        KRAREGGGKEPLKRARKR RR
Subjt:  KRAREGGGKEPLKRARKRPRR

XP_023004478.1 dentin sialophosphoprotein-like [Cucurbita maxima]1.8e-30083.19Show/hide
Query:  MAEDKERESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDEL
        MAEDKERES QSIATWGTWEELLLA AV RHGFKDWNSVAMEVQARSSLPH LTTA NCELKFLDLKRRFT+ QNDAVLN NGAGI D VD+AVPWVDEL
Subjt:  MAEDKERESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDEL

Query:  RKLRVAELRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSN
        RKLRVAELRREVQ YDVSINSLQLKVKRLEEER+QGLNDSEAGTGKPDLKTE+RERRSENDKK FGEPDHRSG  GTV KP  +PGEDSDRED SVNQSN
Subjt:  RKLRVAELRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSN

Query:  STGSKS---------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEVQS
        STGSKS          K ETK+EFTGS RPE NRRAAEP+GPQSDDGSTDTVVKNPTCDTS T KKK TKRVD+SSELADSEA+SNGG TATRESSEVQS
Subjt:  STGSKS---------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEVQS

Query:  SASLTGRMKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLL
        SASLTGRMKR+R LRKEISGGSSGN+PRRT AVKSRRFDEVLQMIR+HKHGSLFESRLQSQETEEYK M+RQHLDLETVQAK+NSGSY  SSL+FYRDLL
Subjt:  SASLTGRMKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLL

Query:  LLCNNAVTFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANP-SQSKGPDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLGEN
        LLCNNAVTFFP SSKESVAA E RLLVS EM KS+ VAR  PSP    S+P NP SQSKGPDLEGSQ L AK+KS VPI+VCRKRSKISSKLSS+GLGE 
Subjt:  LLCNNAVTFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANP-SQSKGPDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLGEN

Query:  GDRSNDDEKSAIDLKSSIKMASNTVEDQGTTKD-SSKVKEKPITGARSMRRSNDSATNSSGPSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEKKR
         DR+NDDEK A+DLKSS+KMAS  VED GT KD SSK+KE+ ITGARSMRRSNDSATNSSGPSIKKQTTNSGWK +S NETET  PDKKKSETVALEKKR
Subjt:  GDRSNDDEKSAIDLKSSIKMASNTVEDQGTTKD-SSKVKEKPITGARSMRRSNDSATNSSGPSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEKKR

Query:  SAADFLKRIKQNSPAETIIKRNGRGGSSNIGN-----AATGELKKGSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEA--DPVPISTK
        SAADFLKRIKQNSPAET  KRNGRGGSSN  N     AA GE KKGSGK+  GKE+ASSIRQSN+KKR K+D SPSKRSVGRPPKKA EA  DP P  TK
Subjt:  SAADFLKRIKQNSPAETIIKRNGRGGSSNIGN-----AATGELKKGSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEA--DPVPISTK

Query:  RAREGGGKEPLKRARKRPRR
        RAREGGGKEPLKRARKR RR
Subjt:  RAREGGGKEPLKRARKRPRR

XP_023513859.1 bromodomain-containing protein bet-1 [Cucurbita pepo subsp. pepo]8.0e-30484.24Show/hide
Query:  MAEDKERESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDEL
        MAEDKERES QSIATWGTWEELLLA AV RHGFKDWNSVAMEVQARSSLPH LTTA NCELKFLDLKRRFTS QNDAVLN NGAGI D VD+AVPWVDEL
Subjt:  MAEDKERESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDEL

Query:  RKLRVAELRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSN
        RKLRVAELRREVQ YDVSINSLQLKVKRLEEER+QGLNDSEAGTGKPDLKTE+RERRSENDKK FGEPDHRSG  GTV KP  +PGEDSDRED SVNQSN
Subjt:  RKLRVAELRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSN

Query:  STGSKS---------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTI-KKKGTKRVDDSSELADSEAQSNGGETATRESSEVQ
        STGSKS          K ETK EFTGS RPE NRRAAE AGPQSDDGSTDTVVKNPTCDTS T  KKK TKRVDDSSELADSEA+SNGGETATRESSEVQ
Subjt:  STGSKS---------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTI-KKKGTKRVDDSSELADSEAQSNGGETATRESSEVQ

Query:  SSASLTGRMKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDL
        SSASLTGRMKR+R LRKEISGGSSGN+PRRT AVKSRRFDEVLQMIR+HKHGSLFESRLQSQETEEYK M+RQHLDLETVQAK+NSGSY SSSL+FYRDL
Subjt:  SSASLTGRMKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDL

Query:  LLLCNNAVTFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANP-SQSKG-PDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLG
        LLLCNNAVTFFP SSKE VAA E RLLVS EMKKS+ VAR  PSP    S+P NP SQSKG PDLEGSQ L AK+KS VPI+VCRKRSKISSKLSS+GLG
Subjt:  LLLCNNAVTFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANP-SQSKG-PDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLG

Query:  ENGDRSNDDEKSAIDLKSSIKMASNTVEDQGTTKD-SSKVKEKPITGARSMRRSNDSATNSSGPSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEK
        E  DR+NDDEK A+DLKSS+KMAS  VED GTTKD SSK+KE+ ITGARSMRRSNDSATNSSGPSIKKQTTNSGWKP+S NETETP PD KKSETVALEK
Subjt:  ENGDRSNDDEKSAIDLKSSIKMASNTVEDQGTTKD-SSKVKEKPITGARSMRRSNDSATNSSGPSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEK

Query:  KRSAADFLKRIKQNSPAETIIKRNGRGGSSNIGN--AATGELKKGSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEADPVPISTKRAR
        KRSAADFLKRIKQNSPAET  KRNGRGGSSN  N  AA GE KKGSGK+  GKE+ASSIRQSN+KKR K+D SPSKRSVGRPPKKA EA+P P  TKRAR
Subjt:  KRSAADFLKRIKQNSPAETIIKRNGRGGSSNIGN--AATGELKKGSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEADPVPISTKRAR

Query:  EGGGKEPLKRARKRPRR
        EGGGKEPLKRARKR RR
Subjt:  EGGGKEPLKRARKRPRR

XP_038897226.1 uncharacterized protein LOC120085356 [Benincasa hispida]5.3e-30885.53Show/hide
Query:  ESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDELRKLRVAE
        +SEQ  ATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKF DLKRRFTS +NDAVL+ NGAGIADKVD+AVPWVDELRKLRVAE
Subjt:  ESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDELRKLRVAE

Query:  LRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSNSTGSKS-
        LRREVQRYDVSINSLQLKVK+LEEEREQG+ND EA TGKPDLKTESRERRSENDK  FGEPDHRSGPNGTVAKP  +PGEDSDRED SVNQSNSTGSKS 
Subjt:  LRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSNSTGSKS-

Query:  --------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGR
                AK ETK +F GSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCD S+T KKK T+RVDDSSELADSEAQSNGG T TRESSEVQSSASLTGR
Subjt:  --------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGR

Query:  MKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLLLLCNNAV
        MKRKRLLRKEISGGSSGN+PRRT+AVKS+RFDEVLQ IRAHKHGSLFESRLQSQETEEYK M+RQHLDLE VQ K+NSGSYSSS  +FYRDLLLL NN V
Subjt:  MKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLLLLCNNAV

Query:  TFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANPSQSKGPDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLGENGDRSNDDE
        TFFPKSSKE VAAC+ RLL+SNEMKKSLQVAR  PSPEVVDS+P  PS+SKGPDLEGSQ LLAKQKSSVPIIVCRKRSKISSK SSTGLGE G+RSNDDE
Subjt:  TFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANPSQSKGPDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLGENGDRSNDDE

Query:  KSAIDLKSSIKMASNTVEDQGTTKDSSKVKEKPITGARSMRRSNDSATNSSGP-SIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKR
        K A+DLKSSIK+ SN VED+ TTKD SKVKEKPITGARSMRRSNDSATNSSGP SIKKQ TNS WKPSS NETETP PDKKKSETVALEKKRSAADFLKR
Subjt:  KSAIDLKSSIKMASNTVEDQGTTKDSSKVKEKPITGARSMRRSNDSATNSSGP-SIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKR

Query:  IKQNSPAETIIKRNGRGGSSNIGNAATGELKKGSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEADPVPISTKRAREGGGKEPLKRAR
        IKQNSPAET IKRNGRGGSS++G+ A  E KKGSGKSDKGKE+ S+++QSND KRPK+DASPSKRSVGRPPKKAAEADP PI  KRAREGGGKEPLKR R
Subjt:  IKQNSPAETIIKRNGRGGSSNIGNAATGELKKGSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEADPVPISTKRAREGGGKEPLKRAR

Query:  KRPRR
        K+ +R
Subjt:  KRPRR

TrEMBL top hitse value%identityAlignment
A0A1S3CFH2 uncharacterized protein LOC1034998901.3e-29983.29Show/hide
Query:  ESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDELRKLRVAE
        ESEQ  ATWGTWEELLLACAVKRHGFKDWNSV+MEVQARSSLPHLLTTARNCELKF DLKRRFTS QNDAVLNHN +GIADK+DTA+PWVDELRKLRVAE
Subjt:  ESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDELRKLRVAE

Query:  LRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSNSTGSKS-
        LRREVQRYDVSINSLQLKVK+LEEEREQG+ND EA TGKPDLKTESRERRSENDKK FGEPDHRSGPNGTV KPP +PGEDSDRED SVNQSNSTGSKS 
Subjt:  LRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSNSTGSKS-

Query:  --------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGR
                AK ETK +FTGSYRPEQNRR  EPAGPQSDDGSTDTVVKNPTCD S+T KKK T+RVDDSSELADSEAQS+GG T TRESSEVQSSASLTGR
Subjt:  --------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGR

Query:  MKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLLLLCNNAV
        MKRKRLLR EISGGSSGN+PRR++ +KSRRFDEVLQ+IRAHKHGSLFESRLQSQETEEYK MVRQHLDLE VQ K+NSGSYSSS+L+FYRDLLLL NN V
Subjt:  MKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLLLLCNNAV

Query:  TFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANPSQSKGPDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLGENGDRSNDDE
        TFFPKSSKE+VAACE RLLVSNEMKKSL+VA+T P PEVVDS P  PSQSKGPDLEGSQ LLAKQKSSVPI+VCRKRSKIS+  S+TG GE GDRSNDDE
Subjt:  TFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANPSQSKGPDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLGENGDRSNDDE

Query:  KSAIDLKSSIKMASNTVEDQGTTKDSSKVKEKPITGARSMRRSNDSATNSSGP-SIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKR
        K A DLKSSIK+ SN VE+  TTKD SKVKEKP TGARSMRRSNDSATNSSGP S KKQ   S WKPSS NETE P PDKKKSETVALEKKRSAADFLKR
Subjt:  KSAIDLKSSIKMASNTVEDQGTTKDSSKVKEKPITGARSMRRSNDSATNSSGP-SIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKR

Query:  IKQNSPAETIIKRNGRGGSSNIGNAATGELKKGSGKSDKGKERASS-IRQSNDKKRPKDDASPSKRSVGRPPKKAAEADPVPISTKRAREGGGKEPLKRA
        IKQNSPAET  KRNGRGGSS     AT E KKG+ K+DK KER SS ++QSNDKKRPK+DASPSKRSVGRPPKKAAEA+P P   KRAREGGGKEPLKR 
Subjt:  IKQNSPAETIIKRNGRGGSSNIGNAATGELKKGSGKSDKGKERASS-IRQSNDKKRPKDDASPSKRSVGRPPKKAAEADPVPISTKRAREGGGKEPLKRA

Query:  RKRPRR
        RK+ +R
Subjt:  RKRPRR

A0A5A7UYV1 Histone H3.v11.3e-29983.29Show/hide
Query:  ESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDELRKLRVAE
        ESEQ  ATWGTWEELLLACAVKRHGFKDWNSV+MEVQARSSLPHLLTTARNCELKF DLKRRFTS QNDAVLNHN +GIADK+DTA+PWVDELRKLRVAE
Subjt:  ESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDELRKLRVAE

Query:  LRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSNSTGSKS-
        LRREVQRYDVSINSLQLKVK+LEEEREQG+ND EA TGKPDLKTESRERRSENDKK FGEPDHRSGPNGTV KPP +PGEDSDRED SVNQSNSTGSKS 
Subjt:  LRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSNSTGSKS-

Query:  --------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGR
                AK ETK +FTGSYRPEQNRR  EPAGPQSDDGSTDTVVKNPTCD S+T KKK T+RVDDSSELADSEAQS+GG T TRESSEVQSSASLTGR
Subjt:  --------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGR

Query:  MKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLLLLCNNAV
        MKRKRLLR EISGGSSGN+PRR++ +KSRRFDEVLQ+IRAHKHGSLFESRLQSQETEEYK MVRQHLDLE VQ K+NSGSYSSS+L+FYRDLLLL NN V
Subjt:  MKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLLLLCNNAV

Query:  TFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANPSQSKGPDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLGENGDRSNDDE
        TFFPKSSKE+VAACE RLLVSNEMKKSL+VA+T P PEVVDS P  PSQSKGPDLEGSQ LLAKQKSSVPI+VCRKRSKIS+  S+TG GE GDRSNDDE
Subjt:  TFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANPSQSKGPDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLGENGDRSNDDE

Query:  KSAIDLKSSIKMASNTVEDQGTTKDSSKVKEKPITGARSMRRSNDSATNSSGP-SIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKR
        K A DLKSSIK+ SN VE+  TTKD SKVKEKP TGARSMRRSNDSATNSSGP S KKQ   S WKPSS NETE P PDKKKSETVALEKKRSAADFLKR
Subjt:  KSAIDLKSSIKMASNTVEDQGTTKDSSKVKEKPITGARSMRRSNDSATNSSGP-SIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKR

Query:  IKQNSPAETIIKRNGRGGSSNIGNAATGELKKGSGKSDKGKERASS-IRQSNDKKRPKDDASPSKRSVGRPPKKAAEADPVPISTKRAREGGGKEPLKRA
        IKQNSPAET  KRNGRGGSS     AT E KKG+ K+DK KER SS ++QSNDKKRPK+DASPSKRSVGRPPKKAAEA+P P   KRAREGGGKEPLKR 
Subjt:  IKQNSPAETIIKRNGRGGSSNIGNAATGELKKGSGKSDKGKERASS-IRQSNDKKRPKDDASPSKRSVGRPPKKAAEADPVPISTKRAREGGGKEPLKRA

Query:  RKRPRR
        RK+ +R
Subjt:  RKRPRR

A0A6J1CTH9 uncharacterized protein LOC1110141493.7e-29982.89Show/hide
Query:  MAEDKERESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDEL
        MAEDKERESEQS A+WGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLL TA NCELKFLDLKRRFTS QNDAV N NG GIADK+D A+PWVDEL
Subjt:  MAEDKERESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDEL

Query:  RKLRVAELRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKP--PGIPGEDSDREDLSVNQ
        RKLRVAELRREVQRYDVSI+SLQLKVKRLEEERE GLND +AGTGKPDLKTESRERRSENDKK FGEPD+RSGPNGTV+KP  P +PGEDSDRED SVNQ
Subjt:  RKLRVAELRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKP--PGIPGEDSDREDLSVNQ

Query:  SNSTGSKS---------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEV
        SNSTGSKS          K ETK EF GS+R EQNRRA+EPAGPQSDDGSTDTV KNPTCD S+T +KK  +   DSSELADSEAQSNGGETATRESSEV
Subjt:  SNSTGSKS---------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEV

Query:  QSSASLTGRMKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRD
        QSSASLTGRMKRKRLLRKEISGGSSGN+PRRT+ VKSRRFD++LQMIRAHKHGS FESRLQSQE+EEYK MVRQHLDLE VQAKVNSGSYSSSSL+FYRD
Subjt:  QSSASLTGRMKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRD

Query:  LLLLCNNAVTFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANPSQSKGPDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLGE
        LLLLCNNAV FFPK SKE+VAACE RLLVSNE+KKSLQV RT PSPEVVDS+PA PS+SKG DLEGS  LLAKQKSSVPIIVCRKRSKIS+KLSS+ LGE
Subjt:  LLLLCNNAVTFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANPSQSKGPDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLGE

Query:  NGDRSNDDEKSAIDLKSSIKMASNTVEDQGTTKDSSKVKEKPITGARSMRRSNDSATNSSGPSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEKKR
          DRS++DEKSA+D+K ++K ASN VED GTTKD SKVKEKP+TGARSMRRSNDS TNSSGPSIKKQ  +SGWKP S NE ET  PDKKK ETVALEKKR
Subjt:  NGDRSNDDEKSAIDLKSSIKMASNTVEDQGTTKDSSKVKEKPITGARSMRRSNDSATNSSGPSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEKKR

Query:  SAADFLKRIKQNSPAETIIKRNGRGGSSNIGNAATGELKKGSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEADPVPISTKRAREGGG
        SAADFLKRIKQNSPAET IKRN R GS +    AT E KKGS KSDKGKERA S+RQSNDKKR KDDASPSKRSVGRPPKK AEA+PVP  TKRAREGGG
Subjt:  SAADFLKRIKQNSPAETIIKRNGRGGSSNIGNAATGELKKGSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEADPVPISTKRAREGGG

Query:  KEPLKRARKRPRR
        KEPLKR +KR RR
Subjt:  KEPLKRARKRPRR

A0A6J1H5Z4 uncharacterized protein LOC1114607746.6e-30484.05Show/hide
Query:  MAEDKERESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDEL
        MAEDKERES QSIATWGTWEELLLA AV RHGFKDWNSVAMEVQARSSLPH LTTA NCELKFLDLKRRFTS QNDAVL+ NGAGI D VD+AVPWVDEL
Subjt:  MAEDKERESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDEL

Query:  RKLRVAELRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSN
        RKLRVAELRREVQ YDVSINSLQLKVKRLEEER+QGLNDSEAGTGKPDLKTE+RERRSENDKK FGEPDHRSG  GTV KP  +PGEDSDRED SVNQSN
Subjt:  RKLRVAELRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSN

Query:  STGSKS---------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTI-KKKGTKRVDDSSELADSEAQSNGGETATRESSEVQ
        STGSKS          K ETK EFTGS RPE NRRAAE AGPQSDDGSTDTVVKNPTCDTS T  KKK TKRVDDSSELADSEA+SNGGETATRESSEVQ
Subjt:  STGSKS---------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTI-KKKGTKRVDDSSELADSEAQSNGGETATRESSEVQ

Query:  SSASLTGRMKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDL
        SSASLTGRMKR+R LRKEISGGSSGN+PRRT AVKSRRFDEVLQMIR+HKHGSLFESRLQSQETEEYKVM+RQHLDLETVQAK+NSGSY  SSL+FYRDL
Subjt:  SSASLTGRMKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDL

Query:  LLLCNNAVTFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANP-SQSKG-PDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLG
        LLLCNNAVTFFP SSKESVAA E RLLVS EMKKS+ VAR  PSP    S+P NP SQSKG PDLEGSQ L AK+KS VPI+VCRKRSKISSKLSS+GLG
Subjt:  LLLCNNAVTFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANP-SQSKG-PDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLG

Query:  ENGDRSNDDEKSAIDLKSSIKMASNTVEDQGTTKD-SSKVKEKPITGARSMRRSNDSATNSSGPSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEK
        E  DR+NDDEK A+DLKSS+KMAS  VED GTTKD SSK+KE+ ITGARSMRRSNDSATNSSGPSIKKQTTNSGWKP+S NETETP PDKKKSETVALEK
Subjt:  ENGDRSNDDEKSAIDLKSSIKMASNTVEDQGTTKD-SSKVKEKPITGARSMRRSNDSATNSSGPSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEK

Query:  KRSAADFLKRIKQNSPAETIIKRNGRGGSSNIGN----AATGELKKGSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEA--DPVPIST
        KRSAADFLKRIKQNSPAET  KRNGRGGSSN  N    AA GE KKGSGK+  GKE+ASSIRQSN+KKR K+D SPSKRSVGRPPKKA EA  DP P  T
Subjt:  KRSAADFLKRIKQNSPAETIIKRNGRGGSSNIGN----AATGELKKGSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEA--DPVPIST

Query:  KRAREGGGKEPLKRARKRPRR
        KRAREGGGKEPLKRARKR RR
Subjt:  KRAREGGGKEPLKRARKRPRR

A0A6J1KZN3 dentin sialophosphoprotein-like8.9e-30183.19Show/hide
Query:  MAEDKERESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDEL
        MAEDKERES QSIATWGTWEELLLA AV RHGFKDWNSVAMEVQARSSLPH LTTA NCELKFLDLKRRFT+ QNDAVLN NGAGI D VD+AVPWVDEL
Subjt:  MAEDKERESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDEL

Query:  RKLRVAELRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSN
        RKLRVAELRREVQ YDVSINSLQLKVKRLEEER+QGLNDSEAGTGKPDLKTE+RERRSENDKK FGEPDHRSG  GTV KP  +PGEDSDRED SVNQSN
Subjt:  RKLRVAELRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSN

Query:  STGSKS---------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEVQS
        STGSKS          K ETK+EFTGS RPE NRRAAEP+GPQSDDGSTDTVVKNPTCDTS T KKK TKRVD+SSELADSEA+SNGG TATRESSEVQS
Subjt:  STGSKS---------AKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEVQS

Query:  SASLTGRMKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLL
        SASLTGRMKR+R LRKEISGGSSGN+PRRT AVKSRRFDEVLQMIR+HKHGSLFESRLQSQETEEYK M+RQHLDLETVQAK+NSGSY  SSL+FYRDLL
Subjt:  SASLTGRMKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLL

Query:  LLCNNAVTFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANP-SQSKGPDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLGEN
        LLCNNAVTFFP SSKESVAA E RLLVS EM KS+ VAR  PSP    S+P NP SQSKGPDLEGSQ L AK+KS VPI+VCRKRSKISSKLSS+GLGE 
Subjt:  LLCNNAVTFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANP-SQSKGPDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLGEN

Query:  GDRSNDDEKSAIDLKSSIKMASNTVEDQGTTKD-SSKVKEKPITGARSMRRSNDSATNSSGPSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEKKR
         DR+NDDEK A+DLKSS+KMAS  VED GT KD SSK+KE+ ITGARSMRRSNDSATNSSGPSIKKQTTNSGWK +S NETET  PDKKKSETVALEKKR
Subjt:  GDRSNDDEKSAIDLKSSIKMASNTVEDQGTTKD-SSKVKEKPITGARSMRRSNDSATNSSGPSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEKKR

Query:  SAADFLKRIKQNSPAETIIKRNGRGGSSNIGN-----AATGELKKGSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEA--DPVPISTK
        SAADFLKRIKQNSPAET  KRNGRGGSSN  N     AA GE KKGSGK+  GKE+ASSIRQSN+KKR K+D SPSKRSVGRPPKKA EA  DP P  TK
Subjt:  SAADFLKRIKQNSPAETIIKRNGRGGSSNIGN-----AATGELKKGSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEA--DPVPISTK

Query:  RAREGGGKEPLKRARKRPRR
        RAREGGGKEPLKRARKR RR
Subjt:  RAREGGGKEPLKRARKRPRR

SwissProt top hitse value%identityAlignment
A2BIL7 Tyrosine-protein kinase BAZ1B2.1e-0426.88Show/hide
Query:  KKKGTKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRMKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETE
        K K  K+    +  + S+  +N  +T+ + S++ +S A+ +  +    L+R+      S   P R   V+ ++ +E+LQ I   +H   F   + ++E E
Subjt:  KKKGTKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRMKRKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETE

Query:  EYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLLLLCNNAVTFFPKSSKESVAACEFR
        +Y+ ++   +DL T+Q K  S  Y S+S  F  D+ L+ +NA  +   SS  +V  C  R
Subjt:  EYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLLLLCNNAVTFFPKSSKESVAACEFR

Arabidopsis top hitse value%identityAlignment
AT1G61215.1 bromodomain 41.3e-1724.67Show/hide
Query:  WGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDELRKLRVAELRREVQRY
        WGTWEELLL  AV RHG  DW  VA E+++  SLP +  T   C+ K+ DL++R+   +                     W +EL+K RVAEL+  + + 
Subjt:  WGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDELRKLRVAELRREVQRY

Query:  DVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSNSTGSKSAKF--ETKA
        + SI SL+ K++ L+ E     ND            E  +   ++ +    EP  +S   G          +D+ ++  SV         +  +  E K+
Subjt:  DVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSNSTGSKSAKF--ETKA

Query:  EFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRMKRKRLLRKEISGGS
        E       E+ +        +S  G    V+ +         KK+G ++  D S     E       +A  ES    +SA +        + R + +  +
Subjt:  EFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRMKRKRLLRKEISGGS

Query:  SGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLLLLCNNAVTFFPKSSKESVAACE
        S +  R  S    +   ++   I  ++   +F  RL SQ+   YK +VR+H+DL+TVQ+++N  S SS+    +RD LL+ NNA  F+ K+++E  +A  
Subjt:  SGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLLLLCNNAVTFFPKSSKESVAACE

Query:  FRLLVSNEMKKSLQVARTGPSPEVVDSAPANPSQSKGPDLEGSQCLLAKQKSSVPII---VCRKRSKISSKLSSTGLGENGDRSN-DDEKSAIDLK-SSI
         R +V+  ++  L            D  P   S + G     ++ ++  QKS+ P +   +  K+ +  +    T + +    S+  +++S  DL  S++
Subjt:  FRLLVSNEMKKSLQVARTGPSPEVVDSAPANPSQSKGPDLEGSQCLLAKQKSSVPII---VCRKRSKISSKLSSTGLGENGDRSN-DDEKSAIDLK-SSI

Query:  KMASNTVEDQGTTKDSSKVK---EKPITGARSMRR
        K ++   +     KD  +     E P    R   R
Subjt:  KMASNTVEDQGTTKDSSKVK---EKPITGARSMRR

AT2G42150.1 DNA-binding bromodomain-containing protein3.6e-5234.28Show/hide
Query:  TWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLT-TARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDELRKLRVAELRREVQ
        TW TWEELLLACAV RHG + WNSV+ E+Q  S  P+L + TA  C  K+ DLK RFT    +  +  + A I     +  PW++ELRKLRV ELRREV+
Subjt:  TWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLT-TARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDELRKLRVAELRREVQ

Query:  RYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPP-GIPGEDSDREDLSVNQSNSTGSKSAKFETK
        +YD+SI++LQ KVK+LEEERE           KPD +TE+ +   + ++   GEP         V  PP  +  E    +   +   N+        E +
Subjt:  RYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPP-GIPGEDSDREDLSVNQSNSTGSKSAKFETK

Query:  AEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSS--ELADSEAQSNGGETATRESSEVQSSASLTGRMKRKRLLRKEIS
         E  GS   E     ++ AG  S  GS ++V K PT +         ++RV+  S  EL +SE  ++ GE  T   S+VQSSASL    K      KE  
Subjt:  AEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSS--ELADSEAQSNGGETATRESSEVQSSASLTGRMKRKRLLRKEIS

Query:  GGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLLLLCNNAVTFFPKSSKESVA
          +S  D      V+S+     ++++ +H  GS F  RL+ QET EY  ++R+H+D E ++ +V  G Y S  ++F+RDLLLL NNA  F+ + S E   
Subjt:  GGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLLLLCNNAVTFFPKSSKESVA

Query:  ACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANPSQSKGPDLEGSQCLLAKQKSSVP-IIVCRKRSKISSK-LSSTGLGENGDRSNDDEKSAIDLKSSI
        A +   LV  +M  +L+    G S    D    +P + +   +  S+ + +K + SVP I+ CRKRS +++K L     G +      D     D K   
Subjt:  ACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANPSQSKGPDLEGSQCLLAKQKSSVP-IIVCRKRSKISSK-LSSTGLGENGDRSNDDEKSAIDLKSSI

Query:  KMASNTVEDQGTTKDSSKVKEKPITGARSMRRSNDSATNSSGPSIKKQTTNSGWKPSSTNETETPNPDKK-KSETVALEKKRSAADFLKRIKQNSPAETI
         ++    E  G   D S +        + M R   S+T       K    N   + SS N  ++ +  KK   E     KK+ AA FL+R+K  S  +T 
Subjt:  KMASNTVEDQGTTKDSSKVKEKPITGARSMRRSNDSATNSSGPSIKKQTTNSGWKPSSTNETETPNPDKK-KSETVALEKKRSAADFLKRIKQNSPAETI

Query:  IKRNGRGGSSNIGNAATGELKK-GSGKSDKGKERASSIRQSNDKKRPKDDASPSKRS-VGRPPKK--AAEADPVPISTKRAREGGGKEPLK----RARKR
        +KR+    SS  G     E +K  S K+D  K     IRQ+N K      ASP KRS  GR  ++  A  +   PI  KR+R+ G KE       R +KR
Subjt:  IKRNGRGGSSNIGNAATGELKK-GSGKSDKGKERASSIRQSNDKKRPKDDASPSKRS-VGRPPKK--AAEADPVPISTKRAREGGGKEPLK----RARKR

Query:  PRR
         RR
Subjt:  PRR

AT2G44430.1 DNA-binding bromodomain-containing protein3.0e-9139.27Show/hide
Query:  EDKERESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLN--HNGAGIADKVDTAVPWVDEL
        ED     +     WGTWEELLLACAVKRHGF DW+SVA EV++RSSL HLL +A +C  K+ DLKRRF   +   V           ++V   +PW+++L
Subjt:  EDKERESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLN--HNGAGIADKVDTAVPWVDEL

Query:  RKLRVAELRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSN
        R LRVAELRREV+RYD SI SLQLKVK+LEEER       E G  KPDL+ E +E RSEND     E +HR               E+SDRE+ S+N+SN
Subjt:  RKLRVAELRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSN

Query:  STGSKSAKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRMK
        ST +                 E+     EP+  + DD   D   KNP     D + K  T   ++   +      S G E +   S E+  S +   + K
Subjt:  STGSKSAKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRMK

Query:  RKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLLLLCNNAVTF
        RKR  RK+   G       R++  KS+    +L +IR+H  GSLFE RL+SQE ++YK MV+QHLD+ET+Q K+  GSY SSSL FYRDL LL  NA+ F
Subjt:  RKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLLLLCNNAVTF

Query:  FPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANPSQSKGPDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLGENGDRSNDDEKS
        FP SS ES+AA E R +VS EM+K      TG +   +    A+  +S   D E S   L++QKSS P++VC+KR  +S+K S +               
Subjt:  FPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANPSQSKGPDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLGENGDRSNDDEKS

Query:  AIDLKSSIKMASNTVEDQGTTKDSSKVKEKPITGARSMRRSNDSA-----TNSSGPSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFL
             SS     +T E+       S+ K+   TG RS RR+N  A        +G    KQ        SS + +   +  K + +TV+ +KK+S ADFL
Subjt:  AIDLKSSIKMASNTVEDQGTTKDSSKVKEKPITGARSMRRSNDSA-----TNSSGPSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFL

Query:  KRIKQNSPAETIIKRNGRGGSSNIGNAATGELKKGSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEADPVPISTKRAREGG-----GK
        KR+K+NSP +    +N  GG+          +KK S K+   + R+SS+     KK+ + + +P KR+ GRP KK AEA     S KR R+ G      K
Subjt:  KRIKQNSPAETIIKRNGRGGSSNIGNAATGELKKGSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEADPVPISTKRAREGG-----GK

Query:  EPLKRARK
        +P KR RK
Subjt:  EPLKRARK

AT3G57980.1 DNA-binding bromodomain-containing protein3.8e-4631.05Show/hide
Query:  EELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTA-----VPWVDELRKLRVAELRREVQR
        EELLLACAV RHG   W+SVA EV  ++S    L TA +C  K+ DLKRRF+        N    G AD+   A     VPW++ELRKLRV ELRREV+R
Subjt:  EELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTA-----VPWVDELRKLRVAELRREVQR

Query:  YDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSNSTGSKSAKFETKAE
        YD+SI+SLQLKVK LE+ERE+ L    +   +     E+      N   P  E  +   PN                        NS G+ S       +
Subjt:  YDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSNSTGSKSAKFETKAE

Query:  FTGSYRPEQNRRAAE-----PAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRMKRKRLLRKEI
               E NR   E     PA   S  GS ++V K      SD  + K  +  +DS EL +S  +S  GE  T+E+S+ QSSAS      RK  + ++ 
Subjt:  FTGSYRPEQNRRAAE-----PAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRMKRKRLLRKEI

Query:  SGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLLLLCNNAVTFFPKSSKESV
              +       V+S+   + ++++++H  GS F  RL++QET +Y  ++RQH+D E ++++V  G Y ++   F+RDLLLL NN   F+ + S E  
Subjt:  SGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLLLLCNNAVTFFPKSSKESV

Query:  AACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANPSQSKGPDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLGENGDRSNDDEKSAIDLKSSIK
        AA +   L+  +M  S ++ +    P   D+   +  + K   L        K   SVPII CRKRS ++ +  ++          +  K    +  ++ 
Subjt:  AACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANPSQSKGPDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLGENGDRSNDDEKSAIDLKSSIK

Query:  MASNTVEDQGTTKDSSKVKEKPITGARSMRRSNDSATNSSG-----PSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKRIKQNSPA
            + E++G   D     EKPI   +  R +  S     G      S+    +N G  P+ ++  +     KK   T    KK+SAA FLKR+K  S +
Subjt:  MASNTVEDQGTTKDSSKVKEKPITGARSMRRSNDSATNSSG-----PSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKRIKQNSPA

Query:  ETIIKRNGRGGSSNIGNAATGELKKGSGKSDK--GKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEADPVPISTKRAREGGGKE--PLKRARKRP
        ET+++   +  SSN    A  E +K + KS+K    +  +  ++   K+   +  SP+K++ G   K+   +    +  KR  E   KE     R +KR 
Subjt:  ETIIKRNGRGGSSNIGNAATGELKKGSGKSDK--GKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEADPVPISTKRAREGGGKE--PLKRARKRP

Query:  RR
        +R
Subjt:  RR

AT3G60110.1 DNA-binding bromodomain-containing protein7.2e-6936.22Show/hide
Query:  DKERESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDELRKL
        D      Q    WGTWEEL+L CAVKRH F DW+SVA EVQARS    L+ +A NC LK+ DLKRRF    +    N   A   +     + W+++LR L
Subjt:  DKERESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDELRKL

Query:  RVAELRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGE--DSDRED-LSVNQSN
         +AELRREVQR D SI SLQLKVK+LEEE+     D + G  KPDLK         ND+                 KP  +  E  +SDR+D  S+N+SN
Subjt:  RVAELRREVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGE--DSDRED-LSVNQSN

Query:  STGSKSAKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRMK
        ST S   K        G    + N  +  P                      D + K  T   ++ +    SE  SN GE             S T    
Subjt:  STGSKSAKFETKAEFTGSYRPEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRMK

Query:  RKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLLLLCNNAVTF
         KR  +K  SGG  G    +++  KS+   +++++IR+H  GS+FESRL+SQ+T++YK ++RQHLD++T++ K+  GSY SSSLSFYRDL LL  NA+ F
Subjt:  RKRLLRKEISGGSSGNDPRRTSAVKSRRFDEVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLLLLCNNAVTF

Query:  FPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANPSQSKGPDLEGSQCLLAKQKSSV-PIIVCRKRSKISSKLSSTGLGENGDRSNDDEK
        FP SS ES+AA E R LVSNEMKK     RTG     V  + A  S             +++QKSSV  ++ C+K+S    K S +    +  R  D++K
Subjt:  FPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDSAPANPSQSKGPDLEGSQCLLAKQKSSV-PIIVCRKRSKISSKLSSTGLGENGDRSNDDEK

Query:  SAIDLKSSIKMASNTVEDQGTTKDSSKVKEKPITGARSMRRSNDSATNSSGPSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKRIK
        S    +  I   + T     T+  SS+   K I        + D+ T  +  +IKKQ T++  + S  ++ E     K + +TVA +KK+S ADFLKRIK
Subjt:  SAIDLKSSIKMASNTVEDQGTTKDSSKVKEKPITGARSMRRSNDSATNSSGPSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKRIK

Query:  QNSP--AETIIKRNGRGGSSNIGNAATGELKK-GSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEADPVPISTKRAREGG--GKEPLK
        +NSP   +    +N +    N+      + K  G+ K +  K +   +R S  KK+ + + + SK S  R  K+  E   V  + KR RE G   K+P K
Subjt:  QNSP--AETIIKRNGRGGSSNIGNAATGELKK-GSGKSDKGKERASSIRQSNDKKRPKDDASPSKRSVGRPPKKAAEADPVPISTKRAREGG--GKEPLK

Query:  RARK
        R+R+
Subjt:  RARK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAGATAAAGAGCGGGAGAGTGAACAGAGCATCGCCACATGGGGGACATGGGAGGAGCTTTTACTGGCCTGTGCAGTCAAGCGCCATGGATTTAAGGACTGGAA
TTCCGTCGCCATGGAAGTTCAGGCTCGTTCTTCTCTCCCTCATCTCCTAACCACCGCCCGCAACTGCGAACTCAAATTCCTTGACCTTAAACGACGGTTCACTTCCCTCC
AAAACGACGCCGTTTTGAACCACAACGGCGCTGGAATTGCCGATAAGGTTGATACCGCCGTACCTTGGGTCGATGAGTTGAGGAAACTCCGGGTTGCTGAACTCCGACGA
GAGGTCCAGCGCTATGACGTTTCAATCAACTCGTTGCAGTTGAAGGTCAAGAGATTGGAGGAGGAGCGAGAACAGGGCCTGAACGACAGCGAGGCCGGCACTGGAAAACC
AGATCTAAAAACGGAGTCCAGAGAACGACGATCAGAAAACGACAAAAAACCCTTCGGAGAACCAGACCACCGGTCCGGACCGAACGGAACAGTCGCGAAGCCGCCGGGGA
TTCCCGGAGAAGATTCCGACCGGGAGGATTTATCCGTCAACCAATCCAATTCCACCGGTTCCAAAAGCGCCAAATTCGAAACCAAAGCGGAATTTACCGGTTCATACCGA
CCAGAACAGAACCGAAGAGCCGCCGAACCAGCCGGACCGCAGTCGGATGACGGCAGTACGGACACTGTGGTTAAGAACCCGACGTGCGACACGTCTGATACGATAAAGAA
AAAGGGGACGAAGCGAGTCGATGACTCGTCTGAGTTGGCGGACTCGGAGGCGCAGTCAAACGGCGGAGAGACGGCGACGAGGGAGAGCAGTGAAGTACAGAGCTCGGCGA
GTTTGACAGGGAGGATGAAGAGGAAGAGGCTCCTCAGAAAGGAGATCTCCGGCGGTAGCAGCGGCAATGATCCTCGCCGGACGTCCGCCGTCAAGTCACGGCGGTTCGAC
GAGGTTTTGCAGATGATACGAGCGCATAAACATGGCTCCTTGTTCGAATCCCGTCTTCAAAGTCAGGAGACGGAAGAATACAAGGTCATGGTTCGGCAGCATTTAGATCT
AGAAACAGTTCAGGCCAAGGTCAATTCTGGTTCTTATTCTTCCTCCAGCCTCTCATTTTACCGCGATCTCTTACTCCTTTGCAACAATGCGGTTACCTTCTTCCCCAAAT
CCTCCAAGGAGTCGGTAGCTGCTTGTGAGTTCCGTCTTCTTGTCTCCAATGAAATGAAGAAGAGTCTGCAGGTTGCTCGAACTGGTCCTTCACCTGAAGTAGTGGATTCA
GCCCCTGCAAATCCTTCCCAATCCAAGGGTCCTGATCTTGAAGGATCACAGTGCTTGCTTGCCAAGCAGAAGTCCTCTGTTCCTATAATAGTTTGTCGGAAACGGAGCAA
AATTTCATCTAAGCTTTCATCAACTGGACTTGGTGAAAACGGTGATCGAAGCAATGATGATGAGAAGTCAGCCATTGATCTAAAATCAAGTATTAAAATGGCTTCTAACA
CTGTTGAAGACCAGGGCACTACAAAAGATAGCAGCAAGGTTAAAGAGAAACCTATTACTGGAGCCAGGAGCATGCGAAGGAGCAACGACAGTGCCACAAACTCTAGTGGT
CCTAGCATCAAGAAACAGACTACGAATTCTGGATGGAAACCCAGTTCGACCAATGAAACAGAGACACCAAATCCAGATAAGAAGAAATCCGAGACAGTTGCCCTAGAAAA
GAAGCGCAGTGCAGCAGATTTCTTGAAGAGGATCAAGCAGAACTCACCTGCAGAGACTATTATAAAGAGAAATGGCAGAGGTGGAAGTAGTAATATTGGTAATGCAGCTA
CTGGAGAACTAAAGAAGGGCAGTGGCAAAAGTGATAAAGGAAAGGAAAGAGCATCATCAATAAGGCAAAGCAATGACAAGAAGCGTCCCAAGGATGATGCCAGCCCTTCC
AAGCGCAGTGTCGGGCGACCACCCAAGAAGGCAGCGGAGGCCGATCCTGTTCCCATTTCAACAAAAAGAGCCAGGGAAGGTGGTGGAAAAGAACCTCTAAAGCGGGCAAG
AAAAAGGCCTAGAAGGTGA
mRNA sequenceShow/hide mRNA sequence
AGAAAACTTACCAAAAAAACCAAACAAAAACGGGAGATTTTTGGTTTTCTCATTTTGCTTCGTTTCTCTCAAATGTTCAAAAAAGAAAAAAAAAAAAAAAGAAACCGAAT
TAATTATTTTTAAACGAGTTTCGGATTTTCGATTTTATTTTATTTTTTTCTCCTTTTGTTCCTTTCACGCCGCAGAGAAGAAGAATACGACGCAGAAGAAACCCTAATAA
CAGACTCTCTAATGGAGGAACTTTCCCTTTTTGTTTCTCTTTCTCTGTGATTTTTTGAAAGATTAAAAGAGGGAAAACAGAATTGGAGATATGAGAAAGACGACCGGCTA
ATCGGGAATCTCTATGCTGTTTTTCTTCTTGTTGTAATCTAATGGCGGAAGATAAAGAGCGGGAGAGTGAACAGAGCATCGCCACATGGGGGACATGGGAGGAGCTTTTA
CTGGCCTGTGCAGTCAAGCGCCATGGATTTAAGGACTGGAATTCCGTCGCCATGGAAGTTCAGGCTCGTTCTTCTCTCCCTCATCTCCTAACCACCGCCCGCAACTGCGA
ACTCAAATTCCTTGACCTTAAACGACGGTTCACTTCCCTCCAAAACGACGCCGTTTTGAACCACAACGGCGCTGGAATTGCCGATAAGGTTGATACCGCCGTACCTTGGG
TCGATGAGTTGAGGAAACTCCGGGTTGCTGAACTCCGACGAGAGGTCCAGCGCTATGACGTTTCAATCAACTCGTTGCAGTTGAAGGTCAAGAGATTGGAGGAGGAGCGA
GAACAGGGCCTGAACGACAGCGAGGCCGGCACTGGAAAACCAGATCTAAAAACGGAGTCCAGAGAACGACGATCAGAAAACGACAAAAAACCCTTCGGAGAACCAGACCA
CCGGTCCGGACCGAACGGAACAGTCGCGAAGCCGCCGGGGATTCCCGGAGAAGATTCCGACCGGGAGGATTTATCCGTCAACCAATCCAATTCCACCGGTTCCAAAAGCG
CCAAATTCGAAACCAAAGCGGAATTTACCGGTTCATACCGACCAGAACAGAACCGAAGAGCCGCCGAACCAGCCGGACCGCAGTCGGATGACGGCAGTACGGACACTGTG
GTTAAGAACCCGACGTGCGACACGTCTGATACGATAAAGAAAAAGGGGACGAAGCGAGTCGATGACTCGTCTGAGTTGGCGGACTCGGAGGCGCAGTCAAACGGCGGAGA
GACGGCGACGAGGGAGAGCAGTGAAGTACAGAGCTCGGCGAGTTTGACAGGGAGGATGAAGAGGAAGAGGCTCCTCAGAAAGGAGATCTCCGGCGGTAGCAGCGGCAATG
ATCCTCGCCGGACGTCCGCCGTCAAGTCACGGCGGTTCGACGAGGTTTTGCAGATGATACGAGCGCATAAACATGGCTCCTTGTTCGAATCCCGTCTTCAAAGTCAGGAG
ACGGAAGAATACAAGGTCATGGTTCGGCAGCATTTAGATCTAGAAACAGTTCAGGCCAAGGTCAATTCTGGTTCTTATTCTTCCTCCAGCCTCTCATTTTACCGCGATCT
CTTACTCCTTTGCAACAATGCGGTTACCTTCTTCCCCAAATCCTCCAAGGAGTCGGTAGCTGCTTGTGAGTTCCGTCTTCTTGTCTCCAATGAAATGAAGAAGAGTCTGC
AGGTTGCTCGAACTGGTCCTTCACCTGAAGTAGTGGATTCAGCCCCTGCAAATCCTTCCCAATCCAAGGGTCCTGATCTTGAAGGATCACAGTGCTTGCTTGCCAAGCAG
AAGTCCTCTGTTCCTATAATAGTTTGTCGGAAACGGAGCAAAATTTCATCTAAGCTTTCATCAACTGGACTTGGTGAAAACGGTGATCGAAGCAATGATGATGAGAAGTC
AGCCATTGATCTAAAATCAAGTATTAAAATGGCTTCTAACACTGTTGAAGACCAGGGCACTACAAAAGATAGCAGCAAGGTTAAAGAGAAACCTATTACTGGAGCCAGGA
GCATGCGAAGGAGCAACGACAGTGCCACAAACTCTAGTGGTCCTAGCATCAAGAAACAGACTACGAATTCTGGATGGAAACCCAGTTCGACCAATGAAACAGAGACACCA
AATCCAGATAAGAAGAAATCCGAGACAGTTGCCCTAGAAAAGAAGCGCAGTGCAGCAGATTTCTTGAAGAGGATCAAGCAGAACTCACCTGCAGAGACTATTATAAAGAG
AAATGGCAGAGGTGGAAGTAGTAATATTGGTAATGCAGCTACTGGAGAACTAAAGAAGGGCAGTGGCAAAAGTGATAAAGGAAAGGAAAGAGCATCATCAATAAGGCAAA
GCAATGACAAGAAGCGTCCCAAGGATGATGCCAGCCCTTCCAAGCGCAGTGTCGGGCGACCACCCAAGAAGGCAGCGGAGGCCGATCCTGTTCCCATTTCAACAAAAAGA
GCCAGGGAAGGTGGTGGAAAAGAACCTCTAAAGCGGGCAAGAAAAAGGCCTAGAAGGTGACTGTCTCAGTCGCCCGTAGATGATTGTTTGTATATTAATAGACGTAGCCT
GCAGTGTATTAATCAACCTGTATTGCATTTAATTATTTTTTTACTTCTAATTTGTAACACGAAAAAGTTGGGCACTATTTTCTTTTGTTTCGTACCAAAACTAGTTAACC
AGATCACCACCTTTCTAAGGAACTGCAAATTCAATATCA
Protein sequenceShow/hide protein sequence
MAEDKERESEQSIATWGTWEELLLACAVKRHGFKDWNSVAMEVQARSSLPHLLTTARNCELKFLDLKRRFTSLQNDAVLNHNGAGIADKVDTAVPWVDELRKLRVAELRR
EVQRYDVSINSLQLKVKRLEEEREQGLNDSEAGTGKPDLKTESRERRSENDKKPFGEPDHRSGPNGTVAKPPGIPGEDSDREDLSVNQSNSTGSKSAKFETKAEFTGSYR
PEQNRRAAEPAGPQSDDGSTDTVVKNPTCDTSDTIKKKGTKRVDDSSELADSEAQSNGGETATRESSEVQSSASLTGRMKRKRLLRKEISGGSSGNDPRRTSAVKSRRFD
EVLQMIRAHKHGSLFESRLQSQETEEYKVMVRQHLDLETVQAKVNSGSYSSSSLSFYRDLLLLCNNAVTFFPKSSKESVAACEFRLLVSNEMKKSLQVARTGPSPEVVDS
APANPSQSKGPDLEGSQCLLAKQKSSVPIIVCRKRSKISSKLSSTGLGENGDRSNDDEKSAIDLKSSIKMASNTVEDQGTTKDSSKVKEKPITGARSMRRSNDSATNSSG
PSIKKQTTNSGWKPSSTNETETPNPDKKKSETVALEKKRSAADFLKRIKQNSPAETIIKRNGRGGSSNIGNAATGELKKGSGKSDKGKERASSIRQSNDKKRPKDDASPS
KRSVGRPPKKAAEADPVPISTKRAREGGGKEPLKRARKRPRR