; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016340 (gene) of Snake gourd v1 genome

Gene IDTan0016340
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBromodomain domain-containing protein
Genome locationLG01:116504189..116508907
RNA-Seq ExpressionTan0016340
SyntenyTan0016340
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001487 - Bromodomain
IPR036427 - Bromodomain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608638.1 Bromodomain and PHD finger-containing protein 3, partial [Cucurbita argyrosperma subsp. sororia]1.2e-12385.52Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKS-PSLG-NPIIN-
        MDFGTVRAKLDGGAYANLEQFEEDI LICSNAMKYNASDTVFFRQARSIQELAK+DFENLR+ESSDESE EQKVVRRGRPPGKSQK+    LG NP+ + 
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKS-PSLG-NPIIN-

Query:  NGAEFCSGATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRTST-SQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTY-NHSLSCG
         GAE CSGAT  SG DDS NVNGYNLRRARSSFRPLP DP VRTS  +QHGETLASWLPEWKNEFPASVLK VLKSGKNDNMAVDENRR TY NHSLSCG
Subjt:  NGAEFCSGATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRTST-SQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTY-NHSLSCG

Query:  NWPSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESISRELGRVLIQEIEMLK-QQRMSPVDGSSSNTKTVAQS
        N  SVFGNLDGDLKQLITVGLHAEHGY RSLALF ADLGPVVWKIA KKIES SRELGRVLIQEIEMLK QQRM P DG S++TKT A+S
Subjt:  NWPSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESISRELGRVLIQEIEMLK-QQRMSPVDGSSSNTKTVAQS

XP_004145600.1 uncharacterized protein LOC101217603 [Cucumis sativus]2.6e-20876.08Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKSPSLGNPIINNGA
        MDFGTVR KLD GAYANLEQFEEDI LICSNAMKYNASDTVFFRQARSIQELAKKDFENLR+ESSDESEPEQKVVRRGRPPGKS KKS  +GN I +NGA
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKSPSLGNPIINNGA

Query:  EFCSGATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRTST--SQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSCGNWP
        EFCSGATL SGCDDS NVNGYNLRRARS+FRPLPADPL RTST  +QHGETLASWLPEWK EFPASVLKGVLKSGKNDNMAV+ENRRDTYN S SCGNWP
Subjt:  EFCSGATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRTST--SQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSCGNWP

Query:  SVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESISRELGRVLIQEIEMLKQQRMSPVDGSSSNTKTVAQSSGILPCGSISGSNI
        SVFG+LDGDLKQLITVGLHAEHGYARSLALFAADLGP VW IALKKI+ ISRELGRVLIQEIEML+Q  + P+DG SS+ KTVA+S+          +NI
Subjt:  SVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESISRELGRVLIQEIEMLKQQRMSPVDGSSSNTKTVAQSSGILPCGSISGSNI

Query:  GISNNFLKQGEDA--EIDRERNSQSDTILLDRSRGVIGSTTCIPNEQKTITPSNIHQTNGDFPPHFFQEMRMVRLDAIIGGASCSDDSSVPCQTHCTLPA
        G+SNNFLK GEDA  EIDR RN++S T+LLDRSRG+IGSTTCIPNEQ  + PSNIH TNG+  PHF QEM+MVRLD+I+GG SCS+ S+ P         
Subjt:  GISNNFLKQGEDA--EIDRERNSQSDTILLDRSRGVIGSTTCIPNEQKTITPSNIHQTNGDFPPHFFQEMRMVRLDAIIGGASCSDDSSVPCQTHCTLPA

Query:  LNNASFQNPAGAGDMDLLSQPEMLKLA-EDGSKSHSLRHSPPRVNFQESIETQQEQGLGEKERWQELSTHPVLDSITFNPDLNFGLGLSAAPSSKLQILS
        LNNASFQ P+ + + DLL+Q  M KLA ED S+SH+  HSP R  FQ++++ QQ++   EK  WQELST PVLDSI F+PDLNFGLGLSAAP+S LQILS
Subjt:  LNNASFQNPAGAGDMDLLSQPEMLKLA-EDGSKSHSLRHSPPRVNFQESIETQQEQGLGEKERWQELSTHPVLDSITFNPDLNFGLGLSAAPSSKLQILS

Query:  QIQPDLVLQL
        QIQPDLVLQL
Subjt:  QIQPDLVLQL

XP_008452972.1 PREDICTED: uncharacterized protein LOC103493819 [Cucumis melo]1.1e-21979.06Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKSPSLGNPIINNGA
        MDFGTVR KLDGGAYANLEQFEEDI LICSNAMKYNASDTVFFRQARSIQELAKKDFENLR+ESSDESEPEQKVVRRGRPPGKS KKS  L NPI +NGA
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKSPSLGNPIINNGA

Query:  EFCSGATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRTST---SQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSCGNW
        EFCSGATL SGCDDS NVNGY+LRRARS+FRPLPADPL RTS+   +QHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAV+ENRRDTYN S+SCGNW
Subjt:  EFCSGATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRTST---SQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSCGNW

Query:  PSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESISRELGRVLIQEIEMLKQQRMSPVDGSSSNTKTVAQSSGILPCGSISGSN
        PSVFG+LDGDLKQLITVGLHAEHGYARSLALFAADLGP VW IALKKIESISRELGRVLI EIEML+Q R+ P+DG SS+ KTVA+S+ +LPC SISGSN
Subjt:  PSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESISRELGRVLIQEIEMLKQQRMSPVDGSSSNTKTVAQSSGILPCGSISGSN

Query:  IGISNNFLKQGEDA--EIDRERNSQSDTILLDRSRGVIGSTTCIPNEQKTITPSNIHQTNGDFPPHFFQEMRMVRLDAIIGGASCSDDSSVPCQTHCTLP
        IG+SNNFLK GEDA  EIDR RN +S+T+LLDRSRGV GSTTCIPNEQ  + PSNIH TNG+  PHF QEMRMVRLD+I+GG S SD        + T P
Subjt:  IGISNNFLKQGEDA--EIDRERNSQSDTILLDRSRGVIGSTTCIPNEQKTITPSNIHQTNGDFPPHFFQEMRMVRLDAIIGGASCSDDSSVPCQTHCTLP

Query:  ALNNASFQNPAGAGDMDLLSQPEMLKLAED-GSKSHSLRHSPPRVNFQESIETQQEQGLGEKERWQELSTHPVLDSITFNPDLNFGLGLSAAPSSKLQIL
        +LNNASFQ P+ + + DLLSQ  M KLAE+  S+SH+L HSP RV  Q+S++ QQ++   EK  WQELST PVLDSITFN DLNFGLGLSAAPSS LQIL
Subjt:  ALNNASFQNPAGAGDMDLLSQPEMLKLAED-GSKSHSLRHSPPRVNFQESIETQQEQGLGEKERWQELSTHPVLDSITFNPDLNFGLGLSAAPSSKLQIL

Query:  SQIQPDLVLQL
        SQIQPDLVLQL
Subjt:  SQIQPDLVLQL

XP_022137702.1 uncharacterized protein LOC111009072 [Momordica charantia]1.2e-20576.47Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKSPSLGNPIINNGA
        MDFGTVRAKLDGGAYANLEQFEEDI LICSNAMKYN SDTVF+RQA++IQELAKKDFENLRQ+SSD+SEPEQKVVRRGRPPGKSQK+S SLGNPI + G 
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKSPSLGNPIINNGA

Query:  EFCS-GATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRT-STSQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSCGNWP
        EFC+   TL SGCDDSN+VNGYNLRR+RSSFRPL +DPLVRT ST+QHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSL  GNWP
Subjt:  EFCS-GATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRT-STSQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSCGNWP

Query:  SVFGNLDGDLKQLITVGLHAEHGYARSLALFAAD-LGPVVWKIALKKIESISRELGRVLIQEIEMLKQQRMSPVDGSS-SNTKTVAQSSGILPCGSISGS
        SVFG+ DGDLKQLITV     H    SL  F  + +   +WKIA KKIESISRELG VL QEIEML+Q RM P+DG S S+TKTVA+S+GILPC SISGS
Subjt:  SVFGNLDGDLKQLITVGLHAEHGYARSLALFAAD-LGPVVWKIALKKIESISRELGRVLIQEIEMLKQQRMSPVDGSS-SNTKTVAQSSGILPCGSISGS

Query:  NIGISNNFLKQGEDAEIDRERNSQSDTILLDRSRGVIGSTTCIPNEQKTITPSNIHQTNGDFPPHFFQEMRMVRLDAIIGGASCSDDSSVPCQTHCTLPA
        N GIS+NFLK  ED EIDRER++QS+TILLDRSRG + STTCIPNE+KT+ PSNIH+   +F PHF  EMRMVRLD+I+GG SCSDDSSVP Q HCT PA
Subjt:  NIGISNNFLKQGEDAEIDRERNSQSDTILLDRSRGVIGSTTCIPNEQKTITPSNIHQTNGDFPPHFFQEMRMVRLDAIIGGASCSDDSSVPCQTHCTLPA

Query:  LNNASFQNPAGAGDMDLLSQPEMLKLAEDGSKSHSLRHSPPRVNFQESIETQQ-EQGLGEKERWQELSTHPVLDSITFNPDLNFGLGLSAAPSSKLQILS
         N ASFQ P  +GDMDLLS  +M +L+EDGS S   +HSP RV  QE IE ++ E+GLGEK RWQELSTHPVLDS+TFNPDLNFGLG S APSS LQILS
Subjt:  LNNASFQNPAGAGDMDLLSQPEMLKLAEDGSKSHSLRHSPPRVNFQESIETQQ-EQGLGEKERWQELSTHPVLDSITFNPDLNFGLGLSAAPSSKLQILS

Query:  QIQPDLVLQL
        QIQPDLVLQL
Subjt:  QIQPDLVLQL

XP_038900278.1 uncharacterized protein LOC120087359 [Benincasa hispida]3.0e-22882.39Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKSPSLGNPIINNGA
        MDFGTVRAKLDGGAYANL+QFEEDI LICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKS K+S  LGNPI +NGA
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKSPSLGNPIINNGA

Query:  EFCSGATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRTSTSQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSCGNWPSV
        EFCSGAT  SGCDDS NVNGYNLRR+RS+FRPLPADPL RTST+QHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAV+ENRRDTYN ++SCGNWPSV
Subjt:  EFCSGATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRTSTSQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSCGNWPSV

Query:  FGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESISRELGRVLIQEIEMLKQQRMSPVDGSSSNTKTVAQSSGILPCGSISGSNIGI
        FGN  GDLKQLITVGLHAEHGYARSLALFAADLGPVVW IAL+KIESISRELGRVLIQEIEM +Q RM P+DG SS  KTVA+S+ ILPC SISGSNIG+
Subjt:  FGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESISRELGRVLIQEIEMLKQQRMSPVDGSSSNTKTVAQSSGILPCGSISGSNIGI

Query:  SNNFLKQGEDA--EIDRERNSQSDTILLDRSRGVIGSTTCIPNE---QKTITPSNIHQTNGDFPPHFFQEMRMVRLDAIIGGASCSDDSSVPCQTHCTLP
        SNN LK GEDA  EIDR RNS+S+T+LLDRSRGVIGSTTCIPNE   QK I PSNI  TNG+  PHF QEMRMVRLD+I+GG SCSD  SVPCQ H T P
Subjt:  SNNFLKQGEDA--EIDRERNSQSDTILLDRSRGVIGSTTCIPNE---QKTITPSNIHQTNGDFPPHFFQEMRMVRLDAIIGGASCSDDSSVPCQTHCTLP

Query:  ALNNASFQNPAGAGDMDLLSQPEMLKLAED-GSKSHSLRHSPPRVNFQESIETQQEQGLGEKERWQELSTHPVLDSITFNPDLNFGLGLSAAPSSKLQIL
        A+NNASFQ PAGAGDMDLL+Q  M KLAE+  S+SH+ RHS   V FQ+SI+TQQ++   EK R QELST PVLDSITFNPDLNFGLGLSAAPSS LQIL
Subjt:  ALNNASFQNPAGAGDMDLLSQPEMLKLAED-GSKSHSLRHSPPRVNFQESIETQQEQGLGEKERWQELSTHPVLDSITFNPDLNFGLGLSAAPSSKLQIL

Query:  SQIQPDLVLQL
        SQIQPDLVLQL
Subjt:  SQIQPDLVLQL

TrEMBL top hitse value%identityAlignment
A0A0A0L1D3 Bromo domain-containing protein1.3e-20876.08Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKSPSLGNPIINNGA
        MDFGTVR KLD GAYANLEQFEEDI LICSNAMKYNASDTVFFRQARSIQELAKKDFENLR+ESSDESEPEQKVVRRGRPPGKS KKS  +GN I +NGA
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKSPSLGNPIINNGA

Query:  EFCSGATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRTST--SQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSCGNWP
        EFCSGATL SGCDDS NVNGYNLRRARS+FRPLPADPL RTST  +QHGETLASWLPEWK EFPASVLKGVLKSGKNDNMAV+ENRRDTYN S SCGNWP
Subjt:  EFCSGATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRTST--SQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSCGNWP

Query:  SVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESISRELGRVLIQEIEMLKQQRMSPVDGSSSNTKTVAQSSGILPCGSISGSNI
        SVFG+LDGDLKQLITVGLHAEHGYARSLALFAADLGP VW IALKKI+ ISRELGRVLIQEIEML+Q  + P+DG SS+ KTVA+S+          +NI
Subjt:  SVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESISRELGRVLIQEIEMLKQQRMSPVDGSSSNTKTVAQSSGILPCGSISGSNI

Query:  GISNNFLKQGEDA--EIDRERNSQSDTILLDRSRGVIGSTTCIPNEQKTITPSNIHQTNGDFPPHFFQEMRMVRLDAIIGGASCSDDSSVPCQTHCTLPA
        G+SNNFLK GEDA  EIDR RN++S T+LLDRSRG+IGSTTCIPNEQ  + PSNIH TNG+  PHF QEM+MVRLD+I+GG SCS+ S+ P         
Subjt:  GISNNFLKQGEDA--EIDRERNSQSDTILLDRSRGVIGSTTCIPNEQKTITPSNIHQTNGDFPPHFFQEMRMVRLDAIIGGASCSDDSSVPCQTHCTLPA

Query:  LNNASFQNPAGAGDMDLLSQPEMLKLA-EDGSKSHSLRHSPPRVNFQESIETQQEQGLGEKERWQELSTHPVLDSITFNPDLNFGLGLSAAPSSKLQILS
        LNNASFQ P+ + + DLL+Q  M KLA ED S+SH+  HSP R  FQ++++ QQ++   EK  WQELST PVLDSI F+PDLNFGLGLSAAP+S LQILS
Subjt:  LNNASFQNPAGAGDMDLLSQPEMLKLA-EDGSKSHSLRHSPPRVNFQESIETQQEQGLGEKERWQELSTHPVLDSITFNPDLNFGLGLSAAPSSKLQILS

Query:  QIQPDLVLQL
        QIQPDLVLQL
Subjt:  QIQPDLVLQL

A0A1S3BVX0 uncharacterized protein LOC1034938195.5e-22079.06Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKSPSLGNPIINNGA
        MDFGTVR KLDGGAYANLEQFEEDI LICSNAMKYNASDTVFFRQARSIQELAKKDFENLR+ESSDESEPEQKVVRRGRPPGKS KKS  L NPI +NGA
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKSPSLGNPIINNGA

Query:  EFCSGATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRTST---SQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSCGNW
        EFCSGATL SGCDDS NVNGY+LRRARS+FRPLPADPL RTS+   +QHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAV+ENRRDTYN S+SCGNW
Subjt:  EFCSGATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRTST---SQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSCGNW

Query:  PSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESISRELGRVLIQEIEMLKQQRMSPVDGSSSNTKTVAQSSGILPCGSISGSN
        PSVFG+LDGDLKQLITVGLHAEHGYARSLALFAADLGP VW IALKKIESISRELGRVLI EIEML+Q R+ P+DG SS+ KTVA+S+ +LPC SISGSN
Subjt:  PSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESISRELGRVLIQEIEMLKQQRMSPVDGSSSNTKTVAQSSGILPCGSISGSN

Query:  IGISNNFLKQGEDA--EIDRERNSQSDTILLDRSRGVIGSTTCIPNEQKTITPSNIHQTNGDFPPHFFQEMRMVRLDAIIGGASCSDDSSVPCQTHCTLP
        IG+SNNFLK GEDA  EIDR RN +S+T+LLDRSRGV GSTTCIPNEQ  + PSNIH TNG+  PHF QEMRMVRLD+I+GG S SD        + T P
Subjt:  IGISNNFLKQGEDA--EIDRERNSQSDTILLDRSRGVIGSTTCIPNEQKTITPSNIHQTNGDFPPHFFQEMRMVRLDAIIGGASCSDDSSVPCQTHCTLP

Query:  ALNNASFQNPAGAGDMDLLSQPEMLKLAED-GSKSHSLRHSPPRVNFQESIETQQEQGLGEKERWQELSTHPVLDSITFNPDLNFGLGLSAAPSSKLQIL
        +LNNASFQ P+ + + DLLSQ  M KLAE+  S+SH+L HSP RV  Q+S++ QQ++   EK  WQELST PVLDSITFN DLNFGLGLSAAPSS LQIL
Subjt:  ALNNASFQNPAGAGDMDLLSQPEMLKLAED-GSKSHSLRHSPPRVNFQESIETQQEQGLGEKERWQELSTHPVLDSITFNPDLNFGLGLSAAPSSKLQIL

Query:  SQIQPDLVLQL
        SQIQPDLVLQL
Subjt:  SQIQPDLVLQL

A0A5A7VEA9 Bromodomain domain-containing protein5.5e-22079.06Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKSPSLGNPIINNGA
        MDFGTVR KLDGGAYANLEQFEEDI LICSNAMKYNASDTVFFRQARSIQELAKKDFENLR+ESSDESEPEQKVVRRGRPPGKS KKS  L NPI +NGA
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKSPSLGNPIINNGA

Query:  EFCSGATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRTST---SQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSCGNW
        EFCSGATL SGCDDS NVNGY+LRRARS+FRPLPADPL RTS+   +QHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAV+ENRRDTYN S+SCGNW
Subjt:  EFCSGATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRTST---SQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSCGNW

Query:  PSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESISRELGRVLIQEIEMLKQQRMSPVDGSSSNTKTVAQSSGILPCGSISGSN
        PSVFG+LDGDLKQLITVGLHAEHGYARSLALFAADLGP VW IALKKIESISRELGRVLI EIEML+Q R+ P+DG SS+ KTVA+S+ +LPC SISGSN
Subjt:  PSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESISRELGRVLIQEIEMLKQQRMSPVDGSSSNTKTVAQSSGILPCGSISGSN

Query:  IGISNNFLKQGEDA--EIDRERNSQSDTILLDRSRGVIGSTTCIPNEQKTITPSNIHQTNGDFPPHFFQEMRMVRLDAIIGGASCSDDSSVPCQTHCTLP
        IG+SNNFLK GEDA  EIDR RN +S+T+LLDRSRGV GSTTCIPNEQ  + PSNIH TNG+  PHF QEMRMVRLD+I+GG S SD        + T P
Subjt:  IGISNNFLKQGEDA--EIDRERNSQSDTILLDRSRGVIGSTTCIPNEQKTITPSNIHQTNGDFPPHFFQEMRMVRLDAIIGGASCSDDSSVPCQTHCTLP

Query:  ALNNASFQNPAGAGDMDLLSQPEMLKLAED-GSKSHSLRHSPPRVNFQESIETQQEQGLGEKERWQELSTHPVLDSITFNPDLNFGLGLSAAPSSKLQIL
        +LNNASFQ P+ + + DLLSQ  M KLAE+  S+SH+L HSP RV  Q+S++ QQ++   EK  WQELST PVLDSITFN DLNFGLGLSAAPSS LQIL
Subjt:  ALNNASFQNPAGAGDMDLLSQPEMLKLAED-GSKSHSLRHSPPRVNFQESIETQQEQGLGEKERWQELSTHPVLDSITFNPDLNFGLGLSAAPSSKLQIL

Query:  SQIQPDLVLQL
        SQIQPDLVLQL
Subjt:  SQIQPDLVLQL

A0A6J1C905 uncharacterized protein LOC1110090725.9e-20676.47Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKSPSLGNPIINNGA
        MDFGTVRAKLDGGAYANLEQFEEDI LICSNAMKYN SDTVF+RQA++IQELAKKDFENLRQ+SSD+SEPEQKVVRRGRPPGKSQK+S SLGNPI + G 
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKSPSLGNPIINNGA

Query:  EFCS-GATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRT-STSQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSCGNWP
        EFC+   TL SGCDDSN+VNGYNLRR+RSSFRPL +DPLVRT ST+QHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSL  GNWP
Subjt:  EFCS-GATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRT-STSQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSCGNWP

Query:  SVFGNLDGDLKQLITVGLHAEHGYARSLALFAAD-LGPVVWKIALKKIESISRELGRVLIQEIEMLKQQRMSPVDGSS-SNTKTVAQSSGILPCGSISGS
        SVFG+ DGDLKQLITV     H    SL  F  + +   +WKIA KKIESISRELG VL QEIEML+Q RM P+DG S S+TKTVA+S+GILPC SISGS
Subjt:  SVFGNLDGDLKQLITVGLHAEHGYARSLALFAAD-LGPVVWKIALKKIESISRELGRVLIQEIEMLKQQRMSPVDGSS-SNTKTVAQSSGILPCGSISGS

Query:  NIGISNNFLKQGEDAEIDRERNSQSDTILLDRSRGVIGSTTCIPNEQKTITPSNIHQTNGDFPPHFFQEMRMVRLDAIIGGASCSDDSSVPCQTHCTLPA
        N GIS+NFLK  ED EIDRER++QS+TILLDRSRG + STTCIPNE+KT+ PSNIH+   +F PHF  EMRMVRLD+I+GG SCSDDSSVP Q HCT PA
Subjt:  NIGISNNFLKQGEDAEIDRERNSQSDTILLDRSRGVIGSTTCIPNEQKTITPSNIHQTNGDFPPHFFQEMRMVRLDAIIGGASCSDDSSVPCQTHCTLPA

Query:  LNNASFQNPAGAGDMDLLSQPEMLKLAEDGSKSHSLRHSPPRVNFQESIETQQ-EQGLGEKERWQELSTHPVLDSITFNPDLNFGLGLSAAPSSKLQILS
         N ASFQ P  +GDMDLLS  +M +L+EDGS S   +HSP RV  QE IE ++ E+GLGEK RWQELSTHPVLDS+TFNPDLNFGLG S APSS LQILS
Subjt:  LNNASFQNPAGAGDMDLLSQPEMLKLAEDGSKSHSLRHSPPRVNFQESIETQQ-EQGLGEKERWQELSTHPVLDSITFNPDLNFGLGLSAAPSSKLQILS

Query:  QIQPDLVLQL
        QIQPDLVLQL
Subjt:  QIQPDLVLQL

A0A6J1FLH0 uncharacterized protein LOC111446319 isoform X16.3e-12385.22Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKS-PSLG-NPIIN-
        MDFGTVRAKLDGGAY+NLEQFEEDI LICSNAMKYNASDTVFFRQARSIQELAK+DFENLR+ESSDESE EQKVVRRGRPPGKSQK+    LG NP+ + 
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKS-PSLG-NPIIN-

Query:  NGAEFCSGATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRTST-SQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTY-NHSLSCG
         GAE CSGAT  SG DDS NVNGYNLRRARSSFRPLP DP VRTS  +QHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRR TY NHSLSCG
Subjt:  NGAEFCSGATLPSGCDDSNNVNGYNLRRARSSFRPLPADPLVRTST-SQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTY-NHSLSCG

Query:  NWPSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESISRELGRVLIQEIEMLK-QQRMSPVD-GSSSNTKTVAQS
        N  SVFGNLDGDLKQLITVGLHAEHGY RSLALF ADLGPVVWKIA KKIES SRELGRVLIQEIEMLK QQRM P D G S++TKT A+S
Subjt:  NWPSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESISRELGRVLIQEIEMLK-QQRMSPVD-GSSSNTKTVAQS

SwissProt top hitse value%identityAlignment
B2KF05 Bromodomain and PHD finger-containing protein 36.5e-0843.94Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSD
        MDF T+R KL+   Y  LE+FEED  LI +N MKYNA DT+F R A  +++L      + R+++ +
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSD

G5E8P1 Bromodomain-containing protein 11.4e-0739.77Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQE-------LAKKDFENL-RQESSDESEPEQKVVRRGRP
        MDF T+R +L+   Y NL  FEED  LI  N MKYNA DTVF+R A  +++        A+++ E++  +E+S    PE+ +    RP
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQE-------LAKKDFENL-RQESSDESEPEQKVVRRGRP

O95696 Bromodomain-containing protein 15.5e-0738.64Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQE-------LAKKDFENL-RQESSDESEPEQKVVRRGRP
        MDF T+R +L+   Y NL +FEED  LI  N MKYNA DTVF+R A  +++        A+++ +++  +E+S    PE+      RP
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQE-------LAKKDFENL-RQESSDESEPEQKVVRRGRP

P55201 Peregrin1.2e-0649.02Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQE
        MDF T++  L+   Y N + FEED  LI SN +KYNA DT+F+R A  ++E
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQE

Q9ULD4 Bromodomain and PHD finger-containing protein 35.0e-0843.94Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSD
        MDF T+R KL+   Y  LE+FEED  LI +N MKYNA DT+F R A  +++L      + R+++ +
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSD

Arabidopsis top hitse value%identityAlignment
AT1G20670.1 DNA-binding bromodomain-containing protein3.0e-6151.17Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQ-------KVVRRGRPPGKSQKKSPSLGN
        MDF T+R KLD GAY+ LEQFE D+ LIC+NAM+YN++DTV++RQAR+IQELAKKDFENLRQ+S DE    Q       KV RRGRPP K  + S     
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQ-------KVVRRGRPPGKSQKKSPSLGN

Query:  PIINNGAEFCSGATLPSGCDDSNNVNG-YNLRRARSSFRPLPADPLVRTSTSQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSL
         I    +E  + A +P   D SN  +G YNLR+A  S++   A+  VR   + + ET + W  +W++EFP+SV+K V K G   +  VD+NRRDTYNH  
Subjt:  PIINNGAEFCSGATLPSGCDDSNNVNG-YNLRRARSSFRPLPADPLVRTSTSQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSL

Query:  SCGNWPSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESI
        +    PSV   L+ +LKQLI VGL+ E+GYA+SLA +AA+LGPV WKIA ++IE++
Subjt:  SCGNWPSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESI

AT1G76380.1 DNA-binding bromodomain-containing protein7.5e-5242.07Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDES----EPEQKVVRRGRPPGKSQKKSPSLGNPII
        MDF T+R KL+ GAY  LEQFE+D+ LIC+NAM+YN++DTV++RQAR++ ELAKKDF NLRQES  E       + KVV+RGRPPG   KK   L   +I
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDES----EPEQKVVRRGRPPGKSQKKSPSLGNPII

Query:  N-NGAEFCSGATLPSGCDDSNNVNG-YNLRRARSSFRPLPADPLVRTSTSQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSC
        +   ++  + A   +   DS+ ++G YNLR+   S+    A+  VR   + + E  +  L +W+ EFP SV+K V K G  +   VDENRRDTYN + + 
Subjt:  N-NGAEFCSGATLPSGCDDSNNVNG-YNLRRARSSFRPLPADPLVRTSTSQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSC

Query:  GNWPSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESI---SRELGRVLIQEIEMLKQQRMSPVDGSSSNTKTVA-----QSSG
            S+F  LD +LKQL  VGL AE+GYARSLA +AA++GPV W  A  +IE +     E G   + E      Q+ + + G    +   A     QSS 
Subjt:  GNWPSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESI---SRELGRVLIQEIEMLKQQRMSPVDGSSSNTKTVA-----QSSG

Query:  IL-PCGSISGSNIGISNNFLKQGEDAEI
        I+ P  S+S S IG  ++  +  E  ++
Subjt:  IL-PCGSISGSNIGISNNFLKQGEDAEI

AT1G76380.2 DNA-binding bromodomain-containing protein7.5e-5242.07Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDES----EPEQKVVRRGRPPGKSQKKSPSLGNPII
        MDF T+R KL+ GAY  LEQFE+D+ LIC+NAM+YN++DTV++RQAR++ ELAKKDF NLRQES  E       + KVV+RGRPPG   KK   L   +I
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDES----EPEQKVVRRGRPPGKSQKKSPSLGNPII

Query:  N-NGAEFCSGATLPSGCDDSNNVNG-YNLRRARSSFRPLPADPLVRTSTSQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSC
        +   ++  + A   +   DS+ ++G YNLR+   S+    A+  VR   + + E  +  L +W+ EFP SV+K V K G  +   VDENRRDTYN + + 
Subjt:  N-NGAEFCSGATLPSGCDDSNNVNG-YNLRRARSSFRPLPADPLVRTSTSQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSC

Query:  GNWPSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESI---SRELGRVLIQEIEMLKQQRMSPVDGSSSNTKTVA-----QSSG
            S+F  LD +LKQL  VGL AE+GYARSLA +AA++GPV W  A  +IE +     E G   + E      Q+ + + G    +   A     QSS 
Subjt:  GNWPSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESI---SRELGRVLIQEIEMLKQQRMSPVDGSSSNTKTVA-----QSSG

Query:  IL-PCGSISGSNIGISNNFLKQGEDAEI
        I+ P  S+S S IG  ++  +  E  ++
Subjt:  IL-PCGSISGSNIGISNNFLKQGEDAEI

AT1G76380.3 DNA-binding bromodomain-containing protein6.4e-5141.77Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDES----EPEQKVVRRGRPPGKSQKKSPSLGNPII
        MDF T+R KL+ GAY  LEQFE ++ LIC+NAM+YN++DTV++RQAR++ ELAKKDF NLRQES  E       + KVV+RGRPPG   KK   L   +I
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDES----EPEQKVVRRGRPPGKSQKKSPSLGNPII

Query:  N-NGAEFCSGATLPSGCDDSNNVNG-YNLRRARSSFRPLPADPLVRTSTSQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSC
        +   ++  + A   +   DS+ ++G YNLR+   S+    A+  VR   + + E  +  L +W+ EFP SV+K V K G  +   VDENRRDTYN + + 
Subjt:  N-NGAEFCSGATLPSGCDDSNNVNG-YNLRRARSSFRPLPADPLVRTSTSQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSC

Query:  GNWPSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESI---SRELGRVLIQEIEMLKQQRMSPVDGSSSNTKTVA-----QSSG
            S+F  LD +LKQL  VGL AE+GYARSLA +AA++GPV W  A  +IE +     E G   + E      Q+ + + G    +   A     QSS 
Subjt:  GNWPSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIESI---SRELGRVLIQEIEMLKQQRMSPVDGSSSNTKTVA-----QSSG

Query:  IL-PCGSISGSNIGISNNFLKQGEDAEI
        I+ P  S+S S IG  ++  +  E  ++
Subjt:  IL-PCGSISGSNIGISNNFLKQGEDAEI

AT5G55040.1 DNA-binding bromodomain-containing protein4.4e-3639.27Show/hide
Query:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPG--KSQKKSPSLGNPIINN
        MDF TVR KL  G+Y+ LE+ E D+LLICSNAM+YN+SDTV+++QAR+IQE+ K+ FE  R +    +E E K   + +P    K Q + P   N +   
Subjt:  MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPG--KSQKKSPSLGNPIINN

Query:  GAEFCSGATLPSGCDDSNN-VNGYNLRRARSSFRPLPADPLVRTSTS-----QHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSL
        G++F SGA L SG    N  V+       + S+     D L   +TS     +  E L+S              KG+          V+E+RR TY  S 
Subjt:  GAEFCSGATLPSGCDDSNN-VNGYNLRRARSSFRPLPADPLVRTSTS-----QHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSL

Query:  SCGN-WPSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIES---ISRELGRVLIQEIEML
          G+   S+F   + ++KQ + VGLHAEH Y RSLA FAA LGPV WKIA ++IE       + GR  + E E L
Subjt:  SCGN-WPSVFGNLDGDLKQLITVGLHAEHGYARSLALFAADLGPVVWKIALKKIES---ISRELGRVLIQEIEML


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTTGGGACTGTAAGGGCTAAATTAGATGGAGGAGCTTACGCAAATTTGGAACAGTTTGAGGAAGATATTCTTTTGATATGTTCAAATGCGATGAAGTATAATGC
TTCAGATACAGTTTTCTTCCGTCAGGCACGATCCATACAAGAACTTGCAAAGAAGGATTTTGAGAATTTGAGGCAAGAAAGCAGTGATGAAAGTGAACCAGAACAGAAAG
TTGTAAGGAGGGGTAGGCCACCAGGAAAGAGCCAGAAGAAATCTCCAAGTCTTGGTAACCCTATTATAAACAATGGGGCAGAGTTTTGCTCAGGTGCTACTCTTCCTTCT
GGATGTGATGATTCCAACAACGTCAATGGTTACAATTTGAGAAGAGCACGTTCTTCCTTTCGGCCCCTGCCTGCAGATCCTCTAGTCAGGACATCGACATCTCAACACGG
CGAAACTTTGGCCAGCTGGTTGCCTGAATGGAAAAATGAATTTCCAGCTTCAGTTTTGAAGGGTGTTCTTAAAAGTGGAAAGAATGATAATATGGCTGTGGATGAGAATA
GACGTGATACCTATAATCATTCATTGTCCTGCGGGAATTGGCCATCTGTCTTTGGCAATCTTGATGGAGACTTGAAGCAACTAATTACTGTAGGTTTGCATGCTGAGCAT
GGTTATGCAAGAAGCCTAGCTCTCTTTGCAGCTGATCTTGGTCCCGTAGTTTGGAAGATTGCTTTGAAGAAAATTGAAAGTATTAGTCGGGAGCTGGGGCGAGTATTGAT
ACAAGAAATTGAAATGCTGAAGCAGCAGCGCATGTCGCCTGTGGATGGAAGCTCATCCAACACAAAAACAGTAGCGCAGAGCTCAGGCATTCTTCCATGTGGAAGCATAT
CTGGCTCTAACATTGGTATTTCGAATAATTTTTTGAAACAAGGCGAAGATGCTGAAATTGACAGAGAGAGAAATTCCCAAAGCGACACAATATTGCTAGACAGAAGTAGA
GGCGTGATAGGATCTACAACGTGTATACCGAATGAGCAGAAGACGATAACTCCTTCGAACATTCACCAAACAAATGGTGATTTTCCCCCCCATTTTTTTCAAGAAATGAG
AATGGTAAGACTTGATGCCATTATTGGTGGAGCGTCTTGTTCAGACGACTCCTCAGTGCCTTGCCAAACGCATTGCACGTTGCCAGCTCTCAACAATGCTTCTTTCCAAA
ACCCTGCTGGTGCTGGTGATATGGATTTACTAAGCCAACCTGAAATGCTAAAACTTGCAGAAGATGGCTCTAAGTCACACTCTCTGCGGCATTCACCGCCCCGTGTCAAC
TTTCAGGAGTCAATCGAAACACAACAAGAGCAGGGGCTTGGTGAAAAAGAACGTTGGCAAGAACTATCAACTCATCCTGTGCTAGATTCAATCACCTTCAATCCCGACCT
TAATTTTGGATTAGGCTTATCTGCTGCACCCAGTTCTAAGCTGCAGATTCTTTCCCAAATTCAACCGGATTTAGTATTGCAGCTCTGA
mRNA sequenceShow/hide mRNA sequence
GCGACTCGCACGTGCCTAGAAATCCCTGCATCAGTGCATCGTCCGCCGTGAAAACTGAAATCAAACAACTCTCTGCGTTTGGTTCCTCCGGAAAAGGGAAGAACAATGGG
CGAGGTTTCCAAATCGACAATGAAGAAGCGAAAGAAAAAGGGTCGCCCTTCTCTTTTAGATCTCCAAAAGCGCTTCCTCAAACAGCAAAAGTTGCAGGAACAGCACCAAC
AACCGTCGAATGCTTTCGACTTTGCCTCGAACCCTAGGAGTCCATCATCTTGTCGCAATCGTAATATTCACCCGGCAACAGAGCGCGTCACCGGCGGTGACGAAGGCGAC
GATGACGACGAGCGCGTTGAGAAGAAGCACAAGCCTTTACTCGGTTTAACTTCCCGCCAAAACTATCCAACGTTGTCCGCTTATTCTTTACATAAATCCGAAGATTCCGA
GGCGGCCCTCAAACGTCGCAGAATCGGCGCCGCCCAGTTTGGATCTTGTGAAGTGAGTGAAAAAGCTTTGAAAGCGACAGACACTGCTCATGGTATCTAATTCTTAACTA
AGCTCAGTAGTAGTAGTTGTTGTTGTTGTTGAAGAAGGCGCTGTTGTGAACTTTATTTGCTGATACCATTTATACGTGGGTAGGGTCGGAGGTGGAGTCTGGTCCCACGA
CAACTTTGCCAGACAAAAAGTTGTTGATTTTCATCCTTGATAGACTTCAAAAGTGCGACTGTTCATTTGTTTTGTTATCGTTTTCCCTCATTATTGTTTGCCATTTAATA
GTCGTTTTGATTAGTGCTATCTGTTTTCTTCCAGAAAAGACACCCATGGAGTATTTTCCGAACCCGTGGATCCAAACGAGCTTCCCGACTACCATGTCATTATAGAGAAT
CCTATGGATTTTGGGACTGTAAGGGCTAAATTAGATGGAGGAGCTTACGCAAATTTGGAACAGTTTGAGGAAGATATTCTTTTGATATGTTCAAATGCGATGAAGTATAA
TGCTTCAGATACAGTTTTCTTCCGTCAGGCACGATCCATACAAGAACTTGCAAAGAAGGATTTTGAGAATTTGAGGCAAGAAAGCAGTGATGAAAGTGAACCAGAACAGA
AAGTTGTAAGGAGGGGTAGGCCACCAGGAAAGAGCCAGAAGAAATCTCCAAGTCTTGGTAACCCTATTATAAACAATGGGGCAGAGTTTTGCTCAGGTGCTACTCTTCCT
TCTGGATGTGATGATTCCAACAACGTCAATGGTTACAATTTGAGAAGAGCACGTTCTTCCTTTCGGCCCCTGCCTGCAGATCCTCTAGTCAGGACATCGACATCTCAACA
CGGCGAAACTTTGGCCAGCTGGTTGCCTGAATGGAAAAATGAATTTCCAGCTTCAGTTTTGAAGGGTGTTCTTAAAAGTGGAAAGAATGATAATATGGCTGTGGATGAGA
ATAGACGTGATACCTATAATCATTCATTGTCCTGCGGGAATTGGCCATCTGTCTTTGGCAATCTTGATGGAGACTTGAAGCAACTAATTACTGTAGGTTTGCATGCTGAG
CATGGTTATGCAAGAAGCCTAGCTCTCTTTGCAGCTGATCTTGGTCCCGTAGTTTGGAAGATTGCTTTGAAGAAAATTGAAAGTATTAGTCGGGAGCTGGGGCGAGTATT
GATACAAGAAATTGAAATGCTGAAGCAGCAGCGCATGTCGCCTGTGGATGGAAGCTCATCCAACACAAAAACAGTAGCGCAGAGCTCAGGCATTCTTCCATGTGGAAGCA
TATCTGGCTCTAACATTGGTATTTCGAATAATTTTTTGAAACAAGGCGAAGATGCTGAAATTGACAGAGAGAGAAATTCCCAAAGCGACACAATATTGCTAGACAGAAGT
AGAGGCGTGATAGGATCTACAACGTGTATACCGAATGAGCAGAAGACGATAACTCCTTCGAACATTCACCAAACAAATGGTGATTTTCCCCCCCATTTTTTTCAAGAAAT
GAGAATGGTAAGACTTGATGCCATTATTGGTGGAGCGTCTTGTTCAGACGACTCCTCAGTGCCTTGCCAAACGCATTGCACGTTGCCAGCTCTCAACAATGCTTCTTTCC
AAAACCCTGCTGGTGCTGGTGATATGGATTTACTAAGCCAACCTGAAATGCTAAAACTTGCAGAAGATGGCTCTAAGTCACACTCTCTGCGGCATTCACCGCCCCGTGTC
AACTTTCAGGAGTCAATCGAAACACAACAAGAGCAGGGGCTTGGTGAAAAAGAACGTTGGCAAGAACTATCAACTCATCCTGTGCTAGATTCAATCACCTTCAATCCCGA
CCTTAATTTTGGATTAGGCTTATCTGCTGCACCCAGTTCTAAGCTGCAGATTCTTTCCCAAATTCAACCGGATTTAGTATTGCAGCTCTGAAGACATGTTTGGGATTTAT
ACAATACAAGTAATGATGCAGCTCTGTACAGATAGTAACTTCAGTATATGTCTCATTTTCTCGATCTAGACCAGACCATAGGGACTAGTGCAATTATTGTCTGGATCGTC
CTCATTTATCTGTTTACGTGGACTAACCAAAACCAAAATCAATAAACAAGGAAATTTTTCATAACAATCATGTGTCAACTGTCTGCATTATTGCCTTCCCTAACCCATTG
ACTTTTTTGGCATCTGATGTACATGTAACGTATACTGTTCTTCATTTAAATTCTGATTGTAATGAACAGTTCATAATCTGATTTTTTGGGTTTTGTTTTTGGTTCACTAG
GTTGAGATCTTTTGGCCGTCCCATGTAATACTTGTACCAGACAGCCAAACATTGTTGAGAGCCAGACCCGCAATGGTTTGAGTAAGACAGCGAAGCTGAAAATGAATTGG
GGTTTTGCTGGTCTTTCTAAACACTTGCATCCTTGATTTGGGTCGAAATCCA
Protein sequenceShow/hide protein sequence
MDFGTVRAKLDGGAYANLEQFEEDILLICSNAMKYNASDTVFFRQARSIQELAKKDFENLRQESSDESEPEQKVVRRGRPPGKSQKKSPSLGNPIINNGAEFCSGATLPS
GCDDSNNVNGYNLRRARSSFRPLPADPLVRTSTSQHGETLASWLPEWKNEFPASVLKGVLKSGKNDNMAVDENRRDTYNHSLSCGNWPSVFGNLDGDLKQLITVGLHAEH
GYARSLALFAADLGPVVWKIALKKIESISRELGRVLIQEIEMLKQQRMSPVDGSSSNTKTVAQSSGILPCGSISGSNIGISNNFLKQGEDAEIDRERNSQSDTILLDRSR
GVIGSTTCIPNEQKTITPSNIHQTNGDFPPHFFQEMRMVRLDAIIGGASCSDDSSVPCQTHCTLPALNNASFQNPAGAGDMDLLSQPEMLKLAEDGSKSHSLRHSPPRVN
FQESIETQQEQGLGEKERWQELSTHPVLDSITFNPDLNFGLGLSAAPSSKLQILSQIQPDLVLQL