; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0007311 (gene) of Chayote v1 genome

Gene IDSed0007311
OrganismSechium edule (Chayote v1)
Descriptiontranscription factor bHLH123-like isoform X2
Genome locationLG13:13532768..13537081
RNA-Seq ExpressionSed0007311
SyntenySed0007311
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600752.1 Transcription factor basic helix-loop-helix 123, partial [Cucurbita argyrosperma subsp. sororia]3.3e-16473.86Show/hide
Query:  MAEEFQSTGNWWDASRSRYEA---PSSSSITTFVDHSDSAVAASDPNLHIMGLGLDWSQPPNFRGGAEKAPESSFRSMLQPDNMNLNMQETGQH-----Q
        MAEEFQS+GNWWDA+RSRYEA   PSSS+ITTFVDH DS   A DPNLHIMGLGLDW+Q P FRGG EKA ESSFRSMLQPDNMNLNMQETGQH     Q
Subjt:  MAEEFQSTGNWWDASRSRYEA---PSSSSITTFVDHSDSAVAASDPNLHIMGLGLDWSQPPNFRGGAEKAPESSFRSMLQPDNMNLNMQETGQH-----Q

Query:  HIQWMRSEKQYLGETPAAATEFKPINRGFSLD---QPQPQFSPSQY----SAVTSFSIDN-AGSATLYGNSATLLQGLIGG----EQQGSGVGMNFPYSG
         IQWMRSEK Y GE+P  ATEFKPINRGFSLD   QP PQFSPS Y    S +TSF +D  A +A LYGNS TLLQGL+ G    +QQ SG GMNFPY+ 
Subjt:  HIQWMRSEKQYLGETPAAATEFKPINRGFSLD---QPQPQFSPSQY----SAVTSFSIDN-AGSATLYGNSATLLQGLIGG----EQQGSGVGMNFPYSG

Query:  HYGLNSGELMAAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWNASDVKDVRPSYFPPLYNAAPFADKSKQNISEVGDSGTVGKKSGNDHNQQ
        H+G+NS ELMAA SWS SKVP FLRNSPPKGAAP P  S LQFSNNT FWNASD+KD R SY P  YNAA  A    ++ SE       GKKSGND NQQ
Subjt:  HYGLNSGELMAAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWNASDVKDVRPSYFPPLYNAAPFADKSKQNISEVGDSGTVGKKSGNDHNQQ

Query:  T---GAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKS--GAVVQQQHQQQSCEKTKDGE
            G     KRPRNE    LPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKS  GAVVQQQ QQ+ CEK K+GE
Subjt:  T---GAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKS--GAVVQQQHQQQSCEKTKDGE

Query:  GGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR
        G ++DLRSRGLCLVPVSSTFPVTHETTVDFWTP+FGGTFR
Subjt:  GGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR

XP_023546795.1 transcription factor bHLH123-like [Cucurbita pepo subsp. pepo]3.0e-16574.04Show/hide
Query:  MAEEFQSTGNWWDASRSRYEA---PSSSSITTFVDHSDSAVAASDPNLHIMGLGLDWSQPPNFRGGAEKAPESSFRSMLQPDNMNLNMQETGQH-----Q
        MAEEFQSTGNWWDASRSRYEA   PSSS+ITTFVDH DS   A DPNLHIMGLGLDW+Q P FRGG EKA ESSFRSMLQPDNMNLNMQETGQH     Q
Subjt:  MAEEFQSTGNWWDASRSRYEA---PSSSSITTFVDHSDSAVAASDPNLHIMGLGLDWSQPPNFRGGAEKAPESSFRSMLQPDNMNLNMQETGQH-----Q

Query:  HIQWMRSEKQYLGETPAAATEFKPINRGFSLD---QPQPQFSPSQY----SAVTSFSIDNAGSATLYGNSATLLQGLIGG----EQQGSGVGMNFPYSGH
         IQWMRSEK Y GE+P  ATEFKPINRGFSLD   QP PQFSPS Y    S +TSF ID A +A LYGNS TLLQGL+ G    +QQ SG GMNFPY+ H
Subjt:  HIQWMRSEKQYLGETPAAATEFKPINRGFSLD---QPQPQFSPSQY----SAVTSFSIDNAGSATLYGNSATLLQGLIGG----EQQGSGVGMNFPYSGH

Query:  YGLNSGELMAAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWNASDVKDVRPSYFPPLYNAAPFA-DKSKQNISEVGDSGTVGKKSGNDHNQQ
        +G+NS ELMAA SWS SKVP FLRNSPPKGAAP P+ S LQFSNNT FWNASD+KD R SY P  YNAA  A DKSK            GKKSGND NQQ
Subjt:  YGLNSGELMAAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWNASDVKDVRPSYFPPLYNAAPFA-DKSKQNISEVGDSGTVGKKSGNDHNQQ

Query:  T--------GAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKTK
                 G     KRPRNE    LPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSG  VQQQ QQ+ CEK K
Subjt:  T--------GAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKTK

Query:  DGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR
        +GEG ++DLRSRGLCLVPVSSTFPVTHETTVDFWTP+FGGTFR
Subjt:  DGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR

XP_038903335.1 transcription factor bHLH123 isoform X1 [Benincasa hispida]4.4e-16977.48Show/hide
Query:  MAEEFQSTGNWWDASRSRYEA---PSSSSITTFVDHSDSAVAASDPNLHIMGLGLDWSQPPNFRGGAEKAPESSFRSMLQPDNMNLNMQETG--------
        MAEEFQS+GNWW+ASR+RYEA   PSSSSITTFVDHSDSA  ASDPNLHIMGLGLDW+Q P FRGG EKA E SFRSMLQP+NMNLNMQETG        
Subjt:  MAEEFQSTGNWWDASRSRYEA---PSSSSITTFVDHSDSAVAASDPNLHIMGLGLDWSQPPNFRGGAEKAPESSFRSMLQPDNMNLNMQETG--------

Query:  --QHQHIQWMRSEKQYLGETPAAATEFKPINRGFSLD--QPQPQF-SPSQYSA----VTSFSIDNAGSATLYGNSATLLQGLI--GGEQQ----GSGVGM
          Q Q IQWMRSEK Y GE+P  ATEFK INRG  LD  Q QPQF SPS YS+    VTSF ID A    LYGNSATLLQGL+  G EQQ       VGM
Subjt:  --QHQHIQWMRSEKQYLGETPAAATEFKPINRGFSLD--QPQPQF-SPSQYSA----VTSFSIDNAGSATLYGNSATLLQGLI--GGEQQ----GSGVGM

Query:  NFPYSGHYGLNSGELM-AAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWNASDVKDVRPSYFPPLYN-AAPFADKSKQNISEVGDS-GTVGK
        NFPY+ H+G+NSGELM    SWS SK+PQFLRNSPPKGAA    HS LQFSNNT FWNASD+KDVRPSYFPP YN AA FA+K K NISEVGDS  TV K
Subjt:  NFPYSGHYGLNSGELM-AAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWNASDVKDVRPSYFPPLYN-AAPFADKSKQNISEVGDS-GTVGK

Query:  KSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKT
        KSGND NQQ+ A   AKRPRNE PSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGA VQ QHQQQ CEK+
Subjt:  KSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKT

Query:  KDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR
        K+GEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR
Subjt:  KDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR

XP_038903336.1 transcription factor bHLH123 isoform X2 [Benincasa hispida]6.4e-16877.03Show/hide
Query:  MAEEFQSTGNWWDASRSRYEA---PSSSSITTFVDHSDSAVAASDPNLHIMGLGLDWSQPPNFRGGAEKAPESSFRSMLQPDNMNLNMQETG--------
        MAEEFQS+GNWW+ASR+RYEA   PSSSSITTFVDHSDSA  ASDPNLHIMGLGLDW+QP    GG EKA E SFRSMLQP+NMNLNMQETG        
Subjt:  MAEEFQSTGNWWDASRSRYEA---PSSSSITTFVDHSDSAVAASDPNLHIMGLGLDWSQPPNFRGGAEKAPESSFRSMLQPDNMNLNMQETG--------

Query:  --QHQHIQWMRSEKQYLGETPAAATEFKPINRGFSLD--QPQPQF-SPSQYSA----VTSFSIDNAGSATLYGNSATLLQGLI--GGEQQ----GSGVGM
          Q Q IQWMRSEK Y GE+P  ATEFK INRG  LD  Q QPQF SPS YS+    VTSF ID A    LYGNSATLLQGL+  G EQQ       VGM
Subjt:  --QHQHIQWMRSEKQYLGETPAAATEFKPINRGFSLD--QPQPQF-SPSQYSA----VTSFSIDNAGSATLYGNSATLLQGLI--GGEQQ----GSGVGM

Query:  NFPYSGHYGLNSGELM-AAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWNASDVKDVRPSYFPPLYN-AAPFADKSKQNISEVGDS-GTVGK
        NFPY+ H+G+NSGELM    SWS SK+PQFLRNSPPKGAA    HS LQFSNNT FWNASD+KDVRPSYFPP YN AA FA+K K NISEVGDS  TV K
Subjt:  NFPYSGHYGLNSGELM-AAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWNASDVKDVRPSYFPPLYN-AAPFADKSKQNISEVGDS-GTVGK

Query:  KSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKT
        KSGND NQQ+ A   AKRPRNE PSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGA VQ QHQQQ CEK+
Subjt:  KSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKT

Query:  KDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR
        K+GEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR
Subjt:  KDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR

XP_038903337.1 transcription factor bHLH123 isoform X3 [Benincasa hispida]9.2e-16777.03Show/hide
Query:  MAEEFQSTGNWWDASRSRYEA---PSSSSITTFVDHSDSAVAASDPNLHIMGLGLDWSQPPNFRGGAEKAPESSFRSMLQPDNMNLNMQETG--------
        MAEEFQS+GNWW+ASR+RYEA   PSSSSITTFVDHSDSA  ASDPNLHIMGLGLDW+Q P FRGG EKA E SFRSMLQP+NMNLNMQETG        
Subjt:  MAEEFQSTGNWWDASRSRYEA---PSSSSITTFVDHSDSAVAASDPNLHIMGLGLDWSQPPNFRGGAEKAPESSFRSMLQPDNMNLNMQETG--------

Query:  --QHQHIQWMRSEKQYLGETPAAATEFKPINRGFSLD--QPQPQF-SPSQYSA----VTSFSIDNAGSATLYGNSATLLQGLI--GGEQQ----GSGVGM
          Q Q IQWMRSEK Y GE+P  ATEFK INRG  LD  Q QPQF SPS YS+    VTSF ID A    LYGNSATLLQGL+  G EQQ       VGM
Subjt:  --QHQHIQWMRSEKQYLGETPAAATEFKPINRGFSLD--QPQPQF-SPSQYSA----VTSFSIDNAGSATLYGNSATLLQGLI--GGEQQ----GSGVGM

Query:  NFPYSGHYGLNSGELM-AAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWNASDVKDVRPSYFPPLYN-AAPFADKSKQNISEVGDS-GTVGK
        NFPY+ H+G+NSGELM    SWS SK+PQFLRNSPPKGAA    HS LQFSNNT FWNASD+KDVRPSYFPP YN AA FA+K K NISEVGDS  TV K
Subjt:  NFPYSGHYGLNSGELM-AAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWNASDVKDVRPSYFPPLYN-AAPFADKSKQNISEVGDS-GTVGK

Query:  KSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKT
        KSGND NQQ+ A   AKRPRNE PSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQ  VLSTPYLKSGA VQ QHQQQ CEK+
Subjt:  KSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKT

Query:  KDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR
        K+GEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR
Subjt:  KDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR

TrEMBL top hitse value%identityAlignment
A0A0A0L564 BHLH domain-containing protein6.0e-16475.11Show/hide
Query:  MAEEFQSTGNWWDASRSRYEAPSSSSITTFVDHSDSAVAASDPNLHIMGLGLDWSQPPNFRGGAEKAPESSFRSMLQPDNMNLNMQETG----------Q
        MAEEFQS+GNWW+A+ SR  +PSSSSITTFVDHSDSA AASDPNLHIMGLGLDW+Q P FRGG EKA E SFRSMLQPDNMNLNM+ETG          Q
Subjt:  MAEEFQSTGNWWDASRSRYEAPSSSSITTFVDHSDSAVAASDPNLHIMGLGLDWSQPPNFRGGAEKAPESSFRSMLQPDNMNLNMQETG----------Q

Query:  HQHIQWMRSEKQYLGETPAAATEFKPINRGFSLD----------QPQPQF-SPSQY----SAVTSFSIDNAGSATLYGNSATLLQGLI--GGEQQGS---
         Q IQWMRSEK Y GE+P  AT+FKPINRGFSLD          Q QPQF SPS Y    SAVTS+ ID   +A LYGNSATLLQGL+  GGEQQ     
Subjt:  HQHIQWMRSEKQYLGETPAAATEFKPINRGFSLD----------QPQPQF-SPSQY----SAVTSFSIDNAGSATLYGNSATLLQGLI--GGEQQGS---

Query:  --GVGMNFPYSGHYGLNSGELM-AAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWNASDVKDVRPSYFPPLYNAAP-FADKSKQNISEVGDS
           +GMNFPY+ H+G+NSGELM    SWS SKVP +LRNSPPK  A    HS LQFSNNT FWNASD+K+VRPSYF P YNAA  F +KSK NISEVGDS
Subjt:  --GVGMNFPYSGHYGLNSGELM-AAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWNASDVKDVRPSYFPPLYNAAP-FADKSKQNISEVGDS

Query:  GTVGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQ
         T  KKSGND+NQQ+ A   AKRPRNE PSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQ
Subjt:  GTVGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQ

Query:  SCEKT-KDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR
          EK+ K+GEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR
Subjt:  SCEKT-KDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR

A0A6J1DW21 transcription factor bHLH123 isoform X11.5e-15974.17Show/hide
Query:  MAEEFQSTGNWWDASRSRYEA---PSSSSITTF---------------VDHSDSAVAASDPNLHIMGLGL-----DWSQPPNFRGGAEKAPESSFRSMLQ
        MAEEF S+GNWWDASRSRY+A   PSSSS+TTF               +  SDS  A++DPNLH MGLGL     DW+  P  RGG EKA ESSFRSMLQ
Subjt:  MAEEFQSTGNWWDASRSRYEA---PSSSSITTF---------------VDHSDSAVAASDPNLHIMGLGL-----DWSQPPNFRGGAEKAPESSFRSMLQ

Query:  PDNMNLNMQETG-QHQHIQWMRSEKQYLGETPAAATEFKPINRGFSLD--QPQP-QFSPSQY----SAVTSFSIDNAGSATLYGNSATLLQGLIGG--EQ
         +    NMQETG Q Q IQWMRSEK + GETP  A+EFK INRGFSLD  Q QP QFSP QY    SAVTSF +D A  A LYG+SATLLQGL+GG  +Q
Subjt:  PDNMNLNMQETG-QHQHIQWMRSEKQYLGETPAAATEFKPINRGFSLD--QPQP-QFSPSQY----SAVTSFSIDNAGSATLYGNSATLLQGLIGG--EQ

Query:  QGS--GVGMNFPYSGHYGLNSGELMAAPSWSTSKVPQFLRNSPPKGAAPAPSH-SHLQFSNNTPFWNASDVKDVRPSYFPPLYNAAP-FADKSKQNISEV
        Q S     +NFPYS H+ +NSGELM  P WS SKVP FLRNSPPK AAPAP H SHL FSNNTPFWNASD+KDVRPS+FP  YN AP + DKSKQNISEV
Subjt:  QGS--GVGMNFPYSGHYGLNSGELMAAPSWSTSKVPQFLRNSPPKGAAPAPSH-SHLQFSNNTPFWNASDVKDVRPSYFPPLYNAAP-FADKSKQNISEV

Query:  GDSGTVGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQH
         DSGTVGKKSGNDHNQQTG  A AKRPRNE PSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSG  VQQQH
Subjt:  GDSGTVGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQH

Query:  -QQQSCEKTKDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR
         QQQS EK+K+GEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR
Subjt:  -QQQSCEKTKDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR

A0A6J1FLL0 transcription factor bHLH123-like isoform X26.9e-16073.64Show/hide
Query:  MAEEFQSTGNWWDASRSRYEA---PSSSSITTFVDHSDSAVAASDPNLHIMGLGLDWSQPPNFRGGAEKAPESSFRSMLQPDNMNLNMQETGQH-----Q
        MAEEFQS+GNWWDA+RSRYEA   PSSS+ITTFVDH DS   A DPNLHIMGLGLDW+Q P FRGG EKA ESSFRSMLQPDNMNLNMQETGQH     Q
Subjt:  MAEEFQSTGNWWDASRSRYEA---PSSSSITTFVDHSDSAVAASDPNLHIMGLGLDWSQPPNFRGGAEKAPESSFRSMLQPDNMNLNMQETGQH-----Q

Query:  HIQWMRSEKQYLGETPAAATEFKPINRGFSLD---QPQPQFSPSQY----SAVTSFSIDN-AGSATLYGNSATLLQGLIGG----EQQGSGVGMNFPYSG
         IQWMRSEK Y GE+P  ATEFKPINRGFSLD   QP PQFSPS Y    S +TSF +D  A +A LYGNS TLLQGL+ G    +QQ SG GMNFPY+ 
Subjt:  HIQWMRSEKQYLGETPAAATEFKPINRGFSLD---QPQPQFSPSQY----SAVTSFSIDN-AGSATLYGNSATLLQGLIGG----EQQGSGVGMNFPYSG

Query:  HYGLNSGELMAAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWNASDVKDVRPSYFPPLYNAAPFA-DKSKQNISEVGDSGTVGKKSGNDHNQ
        H+G+NS ELMAA SWS SKVP FLRNSPPKGAAP P  S LQFSNNT FWNASD+KD R SY P  YNAA  A DKSK            GKKSGND NQ
Subjt:  HYGLNSGELMAAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWNASDVKDVRPSYFPPLYNAAPFA-DKSKQNISEVGDSGTVGKKSGNDHNQ

Query:  QT--GAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKS--GAVVQQQHQQQSCEKTKDGE
        Q   G     KRPRNE    LPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQ  VLSTPYLKS  GAVVQQQ QQ+ CEK  +GE
Subjt:  QT--GAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKS--GAVVQQQHQQQSCEKTKDGE

Query:  GGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR
        G ++DLRSRGLCLVPVSSTFPVTHETTVDFWTP+FGGTFR
Subjt:  GGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR

A0A6J1FU65 transcription factor bHLH123-like isoform X12.5e-16274.09Show/hide
Query:  MAEEFQSTGNWWDASRSRYEA---PSSSSITTFVDHSDSAVAASDPNLHIMGLGLDWSQPPNFRGGAEKAPESSFRSMLQPDNMNLNMQETGQH-----Q
        MAEEFQS+GNWWDA+RSRYEA   PSSS+ITTFVDH DS   A DPNLHIMGLGLDW+Q P FRGG EKA ESSFRSMLQPDNMNLNMQETGQH     Q
Subjt:  MAEEFQSTGNWWDASRSRYEA---PSSSSITTFVDHSDSAVAASDPNLHIMGLGLDWSQPPNFRGGAEKAPESSFRSMLQPDNMNLNMQETGQH-----Q

Query:  HIQWMRSEKQYLGETPAAATEFKPINRGFSLD---QPQPQFSPSQY----SAVTSFSIDN-AGSATLYGNSATLLQGLIGG----EQQGSGVGMNFPYSG
         IQWMRSEK Y GE+P  ATEFKPINRGFSLD   QP PQFSPS Y    S +TSF +D  A +A LYGNS TLLQGL+ G    +QQ SG GMNFPY+ 
Subjt:  HIQWMRSEKQYLGETPAAATEFKPINRGFSLD---QPQPQFSPSQY----SAVTSFSIDN-AGSATLYGNSATLLQGLIGG----EQQGSGVGMNFPYSG

Query:  HYGLNSGELMAAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWNASDVKDVRPSYFPPLYNAAPFA-DKSKQNISEVGDSGTVGKKSGNDHNQ
        H+G+NS ELMAA SWS SKVP FLRNSPPKGAAP P  S LQFSNNT FWNASD+KD R SY P  YNAA  A DKSK            GKKSGND NQ
Subjt:  HYGLNSGELMAAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWNASDVKDVRPSYFPPLYNAAPFA-DKSKQNISEVGDSGTVGKKSGNDHNQ

Query:  QT--GAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKS--GAVVQQQHQQQSCEKTKDGE
        Q   G     KRPRNE    LPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKS  GAVVQQQ QQ+ CEK  +GE
Subjt:  QT--GAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKS--GAVVQQQHQQQSCEKTKDGE

Query:  GGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR
        G ++DLRSRGLCLVPVSSTFPVTHETTVDFWTP+FGGTFR
Subjt:  GGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR

A0A6J1JW46 transcription factor bHLH123-like isoform X13.7e-16174.37Show/hide
Query:  MAEEFQSTGNWWDASRSRYEA---PSSSSITTFVDHSDSAVAASDPNLHIMGLGLDWSQPPNFRGGAEKAPESSFRSMLQPDNMNLNMQETGQH-----Q
        MAEEFQSTGNWWDASRSRYEA   PSSS+ITTFVDH D+   A DPNLHIMGLGLDW+Q P FRGG EK  ESSFRSMLQPDNMNLNMQETGQH     Q
Subjt:  MAEEFQSTGNWWDASRSRYEA---PSSSSITTFVDHSDSAVAASDPNLHIMGLGLDWSQPPNFRGGAEKAPESSFRSMLQPDNMNLNMQETGQH-----Q

Query:  HIQWMRSEKQYLGETPAAATEFKPINRGFSLD---QPQPQFSPSQY----SAVTSFSIDNAGSATLYGNSATLLQGLIGG----EQQGSGVGMNFPYSGH
         IQWMRSEK Y GE+P  ATEFKPINRGFSLD   QP PQFSPS Y    S VTSF ID+A +A LY NS TLLQGL+ G    +QQ SG GMNFPY+ H
Subjt:  HIQWMRSEKQYLGETPAAATEFKPINRGFSLD---QPQPQFSPSQY----SAVTSFSIDNAGSATLYGNSATLLQGLIGG----EQQGSGVGMNFPYSGH

Query:  YGLNSGELMAAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWNASDVKDVRPSYFPPLYNAAPFADKSKQNISEVGDSGTVGKKSGNDHN-QQ
        +G+NS ELMAA SWS S VP FLRNSPPKGAAP P+ S LQFSNNT FWNASD+KD R SY P  YN A  A    ++ SE       GKKSGND N QQ
Subjt:  YGLNSGELMAAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWNASDVKDVRPSYFPPLYNAAPFADKSKQNISEVGDSGTVGKKSGNDHN-QQ

Query:  TGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKS--GAVVQQQHQQQSCEKTKDGEGGK
        TG     KRPRNE    LPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKS  GAVVQQQ QQ+ CEK K+GEG +
Subjt:  TGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKS--GAVVQQQHQQQSCEKTKDGEGGK

Query:  QDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR
        +DLRSRGLCLVPVSSTFPVTHETTVDFWTP FGGTFR
Subjt:  QDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR

SwissProt top hitse value%identityAlignment
Q7XHI5 Transcription factor bHLH1331.6e-2540.11Show/hide
Query:  NISEVGDSGTVGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYL--KSG
        N+SE   S  +G    N            K+P+ ++PS     KVRKEK+G RI +L QLVSPFGKTDTASVLSEAI YI+FLH Q+  LS PY    S 
Subjt:  NISEVGDSGTVGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYL--KSG

Query:  AVVQQQHQQQSC----------------------------EKTKDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR
          +  QH Q++                             +K+   E   +DLRSRGLCLVP+S T  V  +   D+W P FG T +
Subjt:  AVVQQQHQQQSC----------------------------EKTKDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR

Q8GXT3 Transcription factor bHLH1232.4e-6443.4Show/hide
Query:  EFQSTGNWWDASRSRYEAPSSSSITTFVDHSDSAV---------AASDPNLHIMGLGLDWSQP-----PNFRGGAEKAPESSFRSMLQPDNMNL------
        +F ++G+WW  S S   + SSS   + ++   SAV          A+D +L ++GLGL    P      +   G  KA E+SF  MLQ +N+NL      
Subjt:  EFQSTGNWWDASRSRYEAPSSSSITTFVDHSDSAV---------AASDPNLHIMGLGLDWSQP-----PNFRGGAEKAPESSFRSMLQPDNMNL------

Query:  NMQETGQHQHIQWMRSEK----QYLGETPAAATEFKPI------NRGFSLDQPQPQFSPSQYSAVTS-------FSIDNAGSATLYGNSATLLQGLIG--
        N   T      Q   S+     Q L   P   ++FKP       NRGF LD    QFSP   S+  S       F++DN+ +A +Y  + T      G  
Subjt:  NMQETGQHQHIQWMRSEK----QYLGETPAAATEFKPI------NRGFSLDQPQPQFSPSQYSAVTS-------FSIDNAGSATLYGNSATLLQGLIG--

Query:  GEQQGSGVGMN--FPYSGH---------YGLNSG--ELMAAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWN--------ASDVKDVRPSYF
          QQ  G G +   P   H         +G ++G  + MA+   ST     FLR+SPP    P P HS L+FSNN  FWN        A    D   ++F
Subjt:  GEQQGSGVGMN--FPYSGH---------YGLNSG--ELMAAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWN--------ASDVKDVRPSYF

Query:  ----PPLYNAAPFADKSKQNISEVGDSGT-VGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYI
            PP  +   F D+  +NISE+ DS +   K+ GNDH         AKR ++E  SP PAFK RKEKMGDRI ALQQLVSPFGKTD ASVLSEAIEYI
Subjt:  ----PPLYNAAPFADKSKQNISEVGDSGT-VGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYI

Query:  KFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKTKDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR
        KFLH+QVS LS PY+KSGA +Q Q    S E     E    DLRSRGLCLVPVSSTFPVTH+TTVDFWTPTFGGTFR
Subjt:  KFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKTKDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR

Q8VZ22 Transcription factor bHLH1032.5e-2959.84Show/hide
Query:  PAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVS--VLSTPYLKS-GAVVQQQHQQQSCEKTKDGE-GGKQDL
        P KRPR E PS  P+FKVRKEK+GDRITALQQLVSPFGKTDTASVL +AI+YIKFL EQ++  V ++P+L S G+  Q+Q   +S   T +     +QDL
Subjt:  PAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVS--VLSTPYLKS-GAVVQQQHQQQSCEKTKDGE-GGKQDL

Query:  RSRGLCLVPVSSTF--PVTHETTVDFW
        RSRGLCL+P+SSTF  P  H  T   W
Subjt:  RSRGLCLVPVSSTF--PVTHETTVDFW

Q94JL3 Transcription factor bHLH1123.2e-4554.55Show/hide
Query:  FSNNT---PFWNAS---DVKDVRPSYF--PPLYNAAPFADKSKQNISEVGDSGTVGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITA
        FSNN    PFWN+S   ++ +  PS F   P   +    DK+K N+     S ++ +   N+        + AK+PR   PSPLP FKVRKE + D+IT+
Subjt:  FSNNT---PFWNAS---DVKDVRPSYF--PPLYNAAPFADKSKQNISEVGDSGTVGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITA

Query:  LQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKTKDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGG
        LQQLVSPFGKTDTASVL EAIEYIKFLH+QV+VLSTPY+K GA  QQQ Q     K++D E    +LR  GLCLVP+SSTFPV +ETT DFWTPTFGG
Subjt:  LQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKTKDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGG

Q9M0X8 Transcription factor bHLH1146.1e-2851.33Show/hide
Query:  SKQNISEVGDSGTVGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKS
        SK+ I++   + T  +    + N    +    KRPR E  SPLP+FKVRKEK+GDRITALQQLVSPFGKTDTASVL+EA+EYIKFL EQV+VLS P   +
Subjt:  SKQNISEVGDSGTVGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKS

Query:  GAVVQQQHQQQSCEKTKDGEGGKQ--------DLRSRGLCLVPVSSTFPV
           VQQQ           GE  +         DL SRGLCL+P+S+++PV
Subjt:  GAVVQQQHQQQSCEKTKDGEGGKQ--------DLRSRGLCLVPVSSTFPV

Arabidopsis top hitse value%identityAlignment
AT1G61660.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.3e-4654.55Show/hide
Query:  FSNNT---PFWNAS---DVKDVRPSYF--PPLYNAAPFADKSKQNISEVGDSGTVGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITA
        FSNN    PFWN+S   ++ +  PS F   P   +    DK+K N+     S ++ +   N+        + AK+PR   PSPLP FKVRKE + D+IT+
Subjt:  FSNNT---PFWNAS---DVKDVRPSYF--PPLYNAAPFADKSKQNISEVGDSGTVGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITA

Query:  LQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKTKDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGG
        LQQLVSPFGKTDTASVL EAIEYIKFLH+QV+VLSTPY+K GA  QQQ Q     K++D E    +LR  GLCLVP+SSTFPV +ETT DFWTPTFGG
Subjt:  LQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKTKDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGG

AT1G61660.3 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.5e-4554.04Show/hide
Query:  FSNNT---PFWNAS---DVKDVRPSYF--PPLYNAAPFADKSKQNISEVGDSGTVGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITA
        FSNN    PFWN+S   ++ +  PS F   P   +    DK+K        S ++ +   N+        + AK+PR   PSPLP FKVRKE + D+IT+
Subjt:  FSNNT---PFWNAS---DVKDVRPSYF--PPLYNAAPFADKSKQNISEVGDSGTVGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITA

Query:  LQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKTKDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGG
        LQQLVSPFGKTDTASVL EAIEYIKFLH+QV+VLSTPY+K GA  QQQ Q     K++D E    +LR  GLCLVP+SSTFPV +ETT DFWTPTFGG
Subjt:  LQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKTKDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGG

AT1G61660.4 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.4e-3245.96Show/hide
Query:  FSNNT---PFWNAS---DVKDVRPSYF--PPLYNAAPFADKSKQNISEVGDSGTVGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITA
        FSNN    PFWN+S   ++ +  PS F   P   +    DK+K N+     S ++ +   N+        + AK+PR   PSPLP F             
Subjt:  FSNNT---PFWNAS---DVKDVRPSYF--PPLYNAAPFADKSKQNISEVGDSGTVGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITA

Query:  LQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKTKDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGG
                 KTDTASVL EAIEYIKFLH+QV+VLSTPY+K GA  QQQ Q     K++D E    +LR  GLCLVP+SSTFPV +ETT DFWTPTFGG
Subjt:  LQQLVSPFGKTDTASVLSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKTKDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGG

AT3G20640.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.7e-6543.4Show/hide
Query:  EFQSTGNWWDASRSRYEAPSSSSITTFVDHSDSAV---------AASDPNLHIMGLGLDWSQP-----PNFRGGAEKAPESSFRSMLQPDNMNL------
        +F ++G+WW  S S   + SSS   + ++   SAV          A+D +L ++GLGL    P      +   G  KA E+SF  MLQ +N+NL      
Subjt:  EFQSTGNWWDASRSRYEAPSSSSITTFVDHSDSAV---------AASDPNLHIMGLGLDWSQP-----PNFRGGAEKAPESSFRSMLQPDNMNL------

Query:  NMQETGQHQHIQWMRSEK----QYLGETPAAATEFKPI------NRGFSLDQPQPQFSPSQYSAVTS-------FSIDNAGSATLYGNSATLLQGLIG--
        N   T      Q   S+     Q L   P   ++FKP       NRGF LD    QFSP   S+  S       F++DN+ +A +Y  + T      G  
Subjt:  NMQETGQHQHIQWMRSEK----QYLGETPAAATEFKPI------NRGFSLDQPQPQFSPSQYSAVTS-------FSIDNAGSATLYGNSATLLQGLIG--

Query:  GEQQGSGVGMN--FPYSGH---------YGLNSG--ELMAAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWN--------ASDVKDVRPSYF
          QQ  G G +   P   H         +G ++G  + MA+   ST     FLR+SPP    P P HS L+FSNN  FWN        A    D   ++F
Subjt:  GEQQGSGVGMN--FPYSGH---------YGLNSG--ELMAAPSWSTSKVPQFLRNSPPKGAAPAPSHSHLQFSNNTPFWN--------ASDVKDVRPSYF

Query:  ----PPLYNAAPFADKSKQNISEVGDSGT-VGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYI
            PP  +   F D+  +NISE+ DS +   K+ GNDH         AKR ++E  SP PAFK RKEKMGDRI ALQQLVSPFGKTD ASVLSEAIEYI
Subjt:  ----PPLYNAAPFADKSKQNISEVGDSGT-VGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYI

Query:  KFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKTKDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR
        KFLH+QVS LS PY+KSGA +Q Q    S E     E    DLRSRGLCLVPVSSTFPVTH+TTVDFWTPTFGGTFR
Subjt:  KFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKTKDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR

AT4G21340.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.7e-3059.84Show/hide
Query:  PAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVS--VLSTPYLKS-GAVVQQQHQQQSCEKTKDGE-GGKQDL
        P KRPR E PS  P+FKVRKEK+GDRITALQQLVSPFGKTDTASVL +AI+YIKFL EQ++  V ++P+L S G+  Q+Q   +S   T +     +QDL
Subjt:  PAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASVLSEAIEYIKFLHEQVS--VLSTPYLKS-GAVVQQQHQQQSCEKTKDGE-GGKQDL

Query:  RSRGLCLVPVSSTF--PVTHETTVDFW
        RSRGLCL+P+SSTF  P  H  T   W
Subjt:  RSRGLCLVPVSSTF--PVTHETTVDFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAGGAGTTTCAAAGCACCGGAAACTGGTGGGACGCTTCTAGAAGCCGCTATGAAGCCCCTTCGTCCTCCTCCATCACCACCTTTGTCGACCACTCCGACTCTGC
CGTTGCAGCCAGTGACCCCAATTTGCATATCATGGGCTTGGGCCTCGATTGGAGCCAGCCCCCCAACTTCCGGGGCGGGGCCGAGAAGGCACCCGAGAGCAGTTTCCGAT
CGATGCTGCAGCCTGATAATATGAATTTGAATATGCAAGAAACAGGACAGCACCAACATATTCAATGGATGAGATCGGAGAAGCAATATTTGGGGGAGACTCCGGCGGCT
GCGACTGAGTTCAAGCCAATAAACAGAGGGTTTTCGTTGGATCAGCCTCAGCCGCAGTTCAGCCCCTCTCAGTACAGCGCCGTCACCAGTTTTTCCATAGACAACGCCGG
CTCCGCTACTTTGTACGGTAATTCCGCAACATTATTACAAGGCTTAATAGGCGGCGAGCAGCAAGGTTCGGGGGTGGGTATGAACTTCCCCTACAGCGGTCACTACGGAT
TGAACTCCGGCGAGTTAATGGCGGCGCCGTCCTGGTCGACCTCTAAAGTACCACAGTTCTTAAGAAACTCGCCGCCGAAAGGTGCAGCGCCGGCGCCATCGCATAGTCAC
TTACAATTTTCTAACAATACGCCGTTCTGGAACGCGTCGGATGTAAAAGACGTGAGGCCCAGTTATTTTCCGCCGCTGTATAACGCCGCACCTTTCGCCGATAAATCAAA
GCAGAATATATCAGAAGTTGGTGATTCGGGAACAGTGGGTAAAAAAAGTGGAAATGATCATAATCAACAAACGGGTGCTATTGCTCCTGCTAAAAGGCCTCGAAATGAAA
TCCCTTCGCCATTGCCAGCTTTTAAGGTGAGGAAAGAGAAGATGGGAGACAGAATCACTGCGCTCCAACAACTCGTTTCACCTTTCGGAAAGACCGATACAGCTTCAGTG
CTGTCCGAAGCCATTGAATACATAAAGTTTCTCCATGAACAAGTCAGTGTTTTGAGCACTCCATATTTGAAGAGTGGAGCTGTAGTACAGCAACAACATCAGCAGCAGAG
CTGTGAGAAAACAAAGGATGGAGAAGGTGGTAAACAAGATCTAAGAAGCAGAGGGCTTTGTTTAGTTCCAGTTTCAAGTACATTTCCAGTCACTCACGAAACCACAGTTG
ATTTTTGGACTCCAACCTTCGGAGGAACATTCAGATAA
mRNA sequenceShow/hide mRNA sequence
TGCCGCGAACCGTTGTCAATCGATTTTGTAAATAAAATCTCCACAAAGAGTCATAGAACAATTGAGAAAAGTCGGAAAGGAAAACCCAAATTTGAAATCAAAATTGTAGA
AAATCTTGGAAACTGGATTCGAGATAGAATAGCCATGGCGGAGGAGTTTCAAAGCACCGGAAACTGGTGGGACGCTTCTAGAAGCCGCTATGAAGCCCCTTCGTCCTCCT
CCATCACCACCTTTGTCGACCACTCCGACTCTGCCGTTGCAGCCAGTGACCCCAATTTGCATATCATGGGCTTGGGCCTCGATTGGAGCCAGCCCCCCAACTTCCGGGGC
GGGGCCGAGAAGGCACCCGAGAGCAGTTTCCGATCGATGCTGCAGCCTGATAATATGAATTTGAATATGCAAGAAACAGGACAGCACCAACATATTCAATGGATGAGATC
GGAGAAGCAATATTTGGGGGAGACTCCGGCGGCTGCGACTGAGTTCAAGCCAATAAACAGAGGGTTTTCGTTGGATCAGCCTCAGCCGCAGTTCAGCCCCTCTCAGTACA
GCGCCGTCACCAGTTTTTCCATAGACAACGCCGGCTCCGCTACTTTGTACGGTAATTCCGCAACATTATTACAAGGCTTAATAGGCGGCGAGCAGCAAGGTTCGGGGGTG
GGTATGAACTTCCCCTACAGCGGTCACTACGGATTGAACTCCGGCGAGTTAATGGCGGCGCCGTCCTGGTCGACCTCTAAAGTACCACAGTTCTTAAGAAACTCGCCGCC
GAAAGGTGCAGCGCCGGCGCCATCGCATAGTCACTTACAATTTTCTAACAATACGCCGTTCTGGAACGCGTCGGATGTAAAAGACGTGAGGCCCAGTTATTTTCCGCCGC
TGTATAACGCCGCACCTTTCGCCGATAAATCAAAGCAGAATATATCAGAAGTTGGTGATTCGGGAACAGTGGGTAAAAAAAGTGGAAATGATCATAATCAACAAACGGGT
GCTATTGCTCCTGCTAAAAGGCCTCGAAATGAAATCCCTTCGCCATTGCCAGCTTTTAAGGTGAGGAAAGAGAAGATGGGAGACAGAATCACTGCGCTCCAACAACTCGT
TTCACCTTTCGGAAAGACCGATACAGCTTCAGTGCTGTCCGAAGCCATTGAATACATAAAGTTTCTCCATGAACAAGTCAGTGTTTTGAGCACTCCATATTTGAAGAGTG
GAGCTGTAGTACAGCAACAACATCAGCAGCAGAGCTGTGAGAAAACAAAGGATGGAGAAGGTGGTAAACAAGATCTAAGAAGCAGAGGGCTTTGTTTAGTTCCAGTTTCA
AGTACATTTCCAGTCACTCACGAAACCACAGTTGATTTTTGGACTCCAACCTTCGGAGGAACATTCAGATAAATAAAAAAAATAAAAATGCTTTCTTACGTTTGGCTTTA
AAATATATAGTTAATTATATAATATCATACAGTTATGAATACATTATTATTATTATGTTCAACTAGTACATATCTTATCAAAACCCTACCAACATAATTTGGGGATATTG
ATAAGATCATTACTGTACTAGTAATTAACAATATCTTAAAGGGAAAAAGAAGGGCAGTATAACCCTCCAAAGCGAACTTGAGAGGCCGTGCTAGAGGCAAAATGATGCAG
CTTTAGGGCCACTATTTTAGGAGTATTTTTATTTCTTTTTTGTACTCAATTCAAAATAATCACAATTTAGAAGAAAAAAAATGGTAATTAGGATTGAGTCATTTATAAGA
GC
Protein sequenceShow/hide protein sequence
MAEEFQSTGNWWDASRSRYEAPSSSSITTFVDHSDSAVAASDPNLHIMGLGLDWSQPPNFRGGAEKAPESSFRSMLQPDNMNLNMQETGQHQHIQWMRSEKQYLGETPAA
ATEFKPINRGFSLDQPQPQFSPSQYSAVTSFSIDNAGSATLYGNSATLLQGLIGGEQQGSGVGMNFPYSGHYGLNSGELMAAPSWSTSKVPQFLRNSPPKGAAPAPSHSH
LQFSNNTPFWNASDVKDVRPSYFPPLYNAAPFADKSKQNISEVGDSGTVGKKSGNDHNQQTGAIAPAKRPRNEIPSPLPAFKVRKEKMGDRITALQQLVSPFGKTDTASV
LSEAIEYIKFLHEQVSVLSTPYLKSGAVVQQQHQQQSCEKTKDGEGGKQDLRSRGLCLVPVSSTFPVTHETTVDFWTPTFGGTFR