; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS003841 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS003841
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptiontranscription factor bHLH68-like
Genome locationscaffold127:873094..876454
RNA-Seq ExpressionMS003841
SyntenyMS003841
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142511.1 transcription factor bHLH68 isoform X1 [Cucumis sativus]2.1e-13371.68Show/hide
Query:  MNRGVLQSSVVQQMMGAENPNWWS-NNIGGSMRTSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQDQ--LPESWSQLLLGGLA
        MNRGVLQSSV+QQMMG+ENPNWW+  NI GSMR+ST  Q  +      +SS       S NILFPH  +SLPFP   + D+QD   +PESWSQLLLGGL 
Subjt:  MNRGVLQSSVVQQMMGAENPNWWS-NNIGGSMRTSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQDQ--LPESWSQLLLGGLA

Query:  GELDEQKGGMGQFEAKKLLEDWEEEILNSNGHH--HHQLDHVKKE------QYVYGH---GDGGGRVGG--RSEDFQL-PHKQSWSHMIPASSPQSCVTS
        GE D+QKG MG F++KK LEDWEEEILNSN ++  H Q   VKKE       YVYGH   G GGG VGG    +D+QL   KQ+WS MI +SSPQSCVTS
Subjt:  GELDEQKGGMGQFEAKKLLEDWEEEILNSNGHH--HHQLDHVKKE------QYVYGH---GDGGGRVGG--RSEDFQL-PHKQSWSHMIPASSPQSCVTS

Query:  FSSNMLDFSNNS------PPDSRPRHP--LPDRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFL
        FSSNMLDFSNN+         SRPR+P  + DRSSECNSN  GGA KKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFL
Subjt:  FSSNMLDFSNNS------PPDSRPRHP--LPDRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFL

Query:  QSQIEALSLPYLGNASSSTRQQQHSV-QVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
        QSQIEALSLPYLGN S STRQ QH    +QGERNC+FPEDPGQLLNEN LKRKGVSEQ+EE K+DLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
Subjt:  QSQIEALSLPYLGNASSSTRQQQHSV-QVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG

XP_008462742.1 PREDICTED: transcription factor bHLH68 isoform X1 [Cucumis melo]8.4e-13573.05Show/hide
Query:  MNRGVLQSSVVQQMMGAENPNWWS-NNIGGSMRTSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQDQ--LPESWSQLLLGGLA
        MNRGVLQSSV+QQMMG+ENPNWW+  NI GSMR+ST  Q  +      +SSS      S NILFPH  +SLPFP   + D+QD   +PESWSQLLLGGL 
Subjt:  MNRGVLQSSVVQQMMGAENPNWWS-NNIGGSMRTSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQDQ--LPESWSQLLLGGLA

Query:  GELDEQKGGMGQFEAKKLLEDWEEEILNSNGHH---HHQLDHVKKE------QYVYGHGDGGGRVGGRSEDFQL-PHKQSWSHMIPASSPQSCVTSFSSN
         E D+QKG MG F++KK LEDWEEEILNSN ++   H Q   VKKE       YVYGHG GGG  GG  +D+QL   KQ+WS MI +SSPQSCVTSFSSN
Subjt:  GELDEQKGGMGQFEAKKLLEDWEEEILNSNGHH---HHQLDHVKKE------QYVYGHGDGGGRVGGRSEDFQL-PHKQSWSHMIPASSPQSCVTSFSSN

Query:  MLDFSNNS------PPDSRPRHPLP--DRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQI
        MLDFSNN+         SRPR+P P  DRSSECNSN  GGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQI
Subjt:  MLDFSNNS------PPDSRPRHPLP--DRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQI

Query:  EALSLPYLGNASSSTRQ---QQHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
        EALSLPYLGN S STRQ   QQHSVQ  GERNC+FPEDPGQLLNEN LKRKGVSEQ+EE K+DLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
Subjt:  EALSLPYLGNASSSTRQ---QQHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG

XP_022145064.1 transcription factor bHLH68-like [Momordica charantia]8.3e-17585.68Show/hide
Query:  MNRGVLQSSVVQQMMGAENPNWWSNNIGGSMRTSTQNQQPAAVT-AAATSSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQDQLPESWSQLLLGGLAGE
        MNRGVLQSSVVQQMMGAENPNWWSNNIGGSMRTSTQNQQPAAVT AAA SSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQDQLPESWSQLLLGGLAGE
Subjt:  MNRGVLQSSVVQQMMGAENPNWWSNNIGGSMRTSTQNQQPAAVT-AAATSSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQDQLPESWSQLLLGGLAGE

Query:  LDEQKGGMGQFEAKKLLEDWEEEILNSNGHHHHQLDHVKKEQYVYGHGDGGGRVGGRSEDFQLPHKQSWSHMIPASSPQSCVTSFSSNMLDFSNNSPPDS
        LDEQKG MGQFEAKKLLEDWEEEILNSNGHHHHQLDHVKKEQYVYGHGDGGGRVGGRSEDFQLPHKQSWSHMIPASSPQSCVTSFSSNMLDFSNNSPPDS
Subjt:  LDEQKGGMGQFEAKKLLEDWEEEILNSNGHHHHQLDHVKKEQYVYGHGDGGGRVGGRSEDFQLPHKQSWSHMIPASSPQSCVTSFSSNMLDFSNNSPPDS

Query:  RPRHPLPDRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASSSTRQQQH
        RPRHPLPDRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIE + L               
Subjt:  RPRHPLPDRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASSSTRQQQH

Query:  SVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGSFR
                                         DEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGSFR
Subjt:  SVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGSFR

XP_031744355.1 transcription factor bHLH68 isoform X2 [Cucumis sativus]9.7e-12368.59Show/hide
Query:  MNRGVLQSSVVQQMMGAENPNWWS-NNIGGSMRTSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQDQ--LPESWSQLLLGGLA
        MNRGVLQSSV+QQMMG+ENPNWW+  NI GSMR+ST  Q  +      +SS       S NILFPH  +SLPFP   + D+QD   +PESWSQLLLGGL 
Subjt:  MNRGVLQSSVVQQMMGAENPNWWS-NNIGGSMRTSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQDQ--LPESWSQLLLGGLA

Query:  GELDEQKGGMGQFEAKKLLEDWEEEILNSNGHH--HHQLDHVKKE------QYVYGH---GDGGGRVGG--RSEDFQL-PHKQSWSHMIPASSPQSCVTS
        GE D+QKG MG F++KK LEDWEEEILNSN ++  H Q   VKKE       YVYGH   G GGG VGG    +D+QL   KQ+WS MI +SSPQSCVTS
Subjt:  GELDEQKGGMGQFEAKKLLEDWEEEILNSNGHH--HHQLDHVKKE------QYVYGH---GDGGGRVGG--RSEDFQL-PHKQSWSHMIPASSPQSCVTS

Query:  FSSNMLDFSNNS------PPDSRPRHP--LPDRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFL
        FSSNMLDFSNN+         SRPR+P  + DRSSECNSN  GGA KKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFL
Subjt:  FSSNMLDFSNNS------PPDSRPRHP--LPDRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFL

Query:  QSQIEALSLPYLGNASSSTRQQQHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
        QSQIEALSLPYLGN S STRQ QH                  LLNEN LKRKGVSEQ+EE K+DLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
Subjt:  QSQIEALSLPYLGNASSSTRQQQHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG

XP_038879708.1 transcription factor bHLH68 isoform X1 [Benincasa hispida]4.8e-13071.68Show/hide
Query:  MNRGVLQSSVVQQMMGAENPNWWS-NNIGGSMRTSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQD--QLPESWSQLLLGGLA
        MNRGVLQSSV+QQMMG+ENPNWW+  NI G +R++T  Q         +S +     NSPNILFPH  +SLPFP   + D QD   LPESWSQLLLGGL 
Subjt:  MNRGVLQSSVVQQMMGAENPNWWS-NNIGGSMRTSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQD--QLPESWSQLLLGGLA

Query:  GELDEQKGGMGQFEAKKLLEDWEEEILNSNGHHHHQLDH--VKKE------QYVYGHGDGGGRVGGRSEDFQL-PHKQSWSHMIPASSPQSCVTSFSSNM
        GE D+QKG MG F++KK LE WEEEILNSN +++ Q +   VKKE       YVYGHG          +D+QL   KQ+WS MI +SSPQSCVTSFSSNM
Subjt:  GELDEQKGGMGQFEAKKLLEDWEEEILNSNGHHHHQLDH--VKKE------QYVYGHGDGGGRVGGRSEDFQL-PHKQSWSHMIPASSPQSCVTSFSSNM

Query:  LDFSNNSPPD-----SRPRHPLP-DRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEAL
        LDFSNN+  +     SRPRHP   DRSSECNSN TGGA KKARVQ SSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEAL
Subjt:  LDFSNNSPPD-----SRPRHPLP-DRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEAL

Query:  SLPYLGNASSSTRQQQHSV-QVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
        SLPYLGNAS STRQQQ     VQGERNC+FPEDPGQLLNEN LKRKGVSEQDEE K+DLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
Subjt:  SLPYLGNASSSTRQQQHSV-QVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG

TrEMBL top hitse value%identityAlignment
A0A0A0M120 BHLH domain-containing protein1.0e-13371.68Show/hide
Query:  MNRGVLQSSVVQQMMGAENPNWWS-NNIGGSMRTSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQDQ--LPESWSQLLLGGLA
        MNRGVLQSSV+QQMMG+ENPNWW+  NI GSMR+ST  Q  +      +SS       S NILFPH  +SLPFP   + D+QD   +PESWSQLLLGGL 
Subjt:  MNRGVLQSSVVQQMMGAENPNWWS-NNIGGSMRTSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQDQ--LPESWSQLLLGGLA

Query:  GELDEQKGGMGQFEAKKLLEDWEEEILNSNGHH--HHQLDHVKKE------QYVYGH---GDGGGRVGG--RSEDFQL-PHKQSWSHMIPASSPQSCVTS
        GE D+QKG MG F++KK LEDWEEEILNSN ++  H Q   VKKE       YVYGH   G GGG VGG    +D+QL   KQ+WS MI +SSPQSCVTS
Subjt:  GELDEQKGGMGQFEAKKLLEDWEEEILNSNGHH--HHQLDHVKKE------QYVYGH---GDGGGRVGG--RSEDFQL-PHKQSWSHMIPASSPQSCVTS

Query:  FSSNMLDFSNNS------PPDSRPRHP--LPDRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFL
        FSSNMLDFSNN+         SRPR+P  + DRSSECNSN  GGA KKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFL
Subjt:  FSSNMLDFSNNS------PPDSRPRHP--LPDRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFL

Query:  QSQIEALSLPYLGNASSSTRQQQHSV-QVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
        QSQIEALSLPYLGN S STRQ QH    +QGERNC+FPEDPGQLLNEN LKRKGVSEQ+EE K+DLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
Subjt:  QSQIEALSLPYLGNASSSTRQQQHSV-QVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG

A0A1S3CHN0 transcription factor bHLH68 isoform X21.6e-11867.51Show/hide
Query:  MNRGVLQSSVVQQMMGAENPNWWS-NNIGGSMRTSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQDQ--LPESWSQLLLGGLA
        MNRGVLQSSV+QQMMG+ENPNWW+  NI GSMR+ST  Q  +      +SSS      S NILFPH  +SLPFP   + D+QD   +PESWSQLLLGGL 
Subjt:  MNRGVLQSSVVQQMMGAENPNWWS-NNIGGSMRTSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQDQ--LPESWSQLLLGGLA

Query:  GELDEQKGGMGQFEAKKLLEDWEEEILNSNGHH---HHQLDHVKKE------QYVYGHGDGGGRVGGRSEDFQL-PHKQSWSHMIPASSPQSCVTSFSSN
         E D+QKG MG F++KK LEDWEEEILNSN ++   H Q   VKKE       YVYGHG GGG  GG  +D+QL   KQ+WS MI +SSPQSCVTSFSSN
Subjt:  GELDEQKGGMGQFEAKKLLEDWEEEILNSNGHH---HHQLDHVKKE------QYVYGHGDGGGRVGGRSEDFQL-PHKQSWSHMIPASSPQSCVTSFSSN

Query:  MLDFSNNS------PPDSRPRHPLP--DRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQI
        MLDFSNN+         SRPR+P P  DRSSECNSN  GGAAKKARVQSSSNQTTF                      KTDTASVLLEAIGYIRFLQSQI
Subjt:  MLDFSNNS------PPDSRPRHPLP--DRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQI

Query:  EALSLPYLGNASSSTRQ---QQHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
        EALSLPYLGN S STRQ   QQHSVQ  GERNC+FPEDPGQLLNEN LKRKGVSEQ+EE K+DLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
Subjt:  EALSLPYLGNASSSTRQ---QQHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG

A0A1S3CJ89 transcription factor bHLH68 isoform X14.1e-13573.05Show/hide
Query:  MNRGVLQSSVVQQMMGAENPNWWS-NNIGGSMRTSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQDQ--LPESWSQLLLGGLA
        MNRGVLQSSV+QQMMG+ENPNWW+  NI GSMR+ST  Q  +      +SSS      S NILFPH  +SLPFP   + D+QD   +PESWSQLLLGGL 
Subjt:  MNRGVLQSSVVQQMMGAENPNWWS-NNIGGSMRTSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQDQ--LPESWSQLLLGGLA

Query:  GELDEQKGGMGQFEAKKLLEDWEEEILNSNGHH---HHQLDHVKKE------QYVYGHGDGGGRVGGRSEDFQL-PHKQSWSHMIPASSPQSCVTSFSSN
         E D+QKG MG F++KK LEDWEEEILNSN ++   H Q   VKKE       YVYGHG GGG  GG  +D+QL   KQ+WS MI +SSPQSCVTSFSSN
Subjt:  GELDEQKGGMGQFEAKKLLEDWEEEILNSNGHH---HHQLDHVKKE------QYVYGHGDGGGRVGGRSEDFQL-PHKQSWSHMIPASSPQSCVTSFSSN

Query:  MLDFSNNS------PPDSRPRHPLP--DRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQI
        MLDFSNN+         SRPR+P P  DRSSECNSN  GGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQI
Subjt:  MLDFSNNS------PPDSRPRHPLP--DRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQI

Query:  EALSLPYLGNASSSTRQ---QQHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
        EALSLPYLGN S STRQ   QQHSVQ  GERNC+FPEDPGQLLNEN LKRKGVSEQ+EE K+DLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
Subjt:  EALSLPYLGNASSSTRQ---QQHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG

A0A6J1CU45 transcription factor bHLH68-like4.0e-17585.68Show/hide
Query:  MNRGVLQSSVVQQMMGAENPNWWSNNIGGSMRTSTQNQQPAAVT-AAATSSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQDQLPESWSQLLLGGLAGE
        MNRGVLQSSVVQQMMGAENPNWWSNNIGGSMRTSTQNQQPAAVT AAA SSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQDQLPESWSQLLLGGLAGE
Subjt:  MNRGVLQSSVVQQMMGAENPNWWSNNIGGSMRTSTQNQQPAAVT-AAATSSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQDQLPESWSQLLLGGLAGE

Query:  LDEQKGGMGQFEAKKLLEDWEEEILNSNGHHHHQLDHVKKEQYVYGHGDGGGRVGGRSEDFQLPHKQSWSHMIPASSPQSCVTSFSSNMLDFSNNSPPDS
        LDEQKG MGQFEAKKLLEDWEEEILNSNGHHHHQLDHVKKEQYVYGHGDGGGRVGGRSEDFQLPHKQSWSHMIPASSPQSCVTSFSSNMLDFSNNSPPDS
Subjt:  LDEQKGGMGQFEAKKLLEDWEEEILNSNGHHHHQLDHVKKEQYVYGHGDGGGRVGGRSEDFQLPHKQSWSHMIPASSPQSCVTSFSSNMLDFSNNSPPDS

Query:  RPRHPLPDRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASSSTRQQQH
        RPRHPLPDRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIE + L               
Subjt:  RPRHPLPDRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASSSTRQQQH

Query:  SVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGSFR
                                         DEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGSFR
Subjt:  SVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGSFR

A0A6J1F5W4 transcription factor bHLH685.7e-12168.27Show/hide
Query:  MNRGVLQSSVV-QQMMGAENPNWWSNNIGGSMRTSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHYSNSLPFP---FPSWCDSQDQLPESWSQLLLGGL
        MNRG LQSSV+ QQMMG+ENP+WW+    GS+RTS+  Q           SSP    NSPN LFPH  +SLPFP      +   +DQLPESWS LLLGGL
Subjt:  MNRGVLQSSVV-QQMMGAENPNWWSNNIGGSMRTSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHYSNSLPFP---FPSWCDSQDQLPESWSQLLLGGL

Query:  AGELDEQKGGMGQFEAKKLLEDWEEEILNSNGHHHHQLD-HVKKEQ--------YVYGHGDGGGRVGGRSEDFQLPHKQSWSHMIPASSPQSCVTSFSSN
         GE D+QKG MG     K LEDWEEEILN+N   HHQ    VKKE         YVYGHG   G V G +    LP KQ+WS +IP SSPQSCVTSFSSN
Subjt:  AGELDEQKGGMGQFEAKKLLEDWEEEILNSNGHHHHQLD-HVKKEQ--------YVYGHGDGGGRVGGRSEDFQLPHKQSWSHMIPASSPQSCVTSFSSN

Query:  MLDFSN---NSPPDSRPRHPLPDRSSECNSNAT-GGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALS
        MLDFSN   NSPPDSRPRHP PDRSSECNSNA  GGA KKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFL SQIEALS
Subjt:  MLDFSN---NSPPDSRPRHPLPDRSSECNSNAT-GGAAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALS

Query:  LPYLGNASSSTRQ-QQHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGSFR
        LPYLGN S+STRQ QQHS  VQGE N +FPE                ++QDEE K+DLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG FR
Subjt:  LPYLGNASSSTRQ-QQHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGSFR

SwissProt top hitse value%identityAlignment
Q7XHI5 Transcription factor bHLH1331.4e-6346.31Show/hide
Query:  MNRGVLQSSVVQQMMGAENPNWWSNNIGGSMR--TSTQNQQPAAVTAAATSSSP--FLCPNSPNILFPHY--SNSLPFPFPSWCDSQD---QLPESWSQL
        MNRGVL+SS VQ +  A NPNWW NN+   +R  T   + +P + TA   S  P  F  P S +   P +   NS P  F SW +  D     P S SQL
Subjt:  MNRGVLQSSVVQQMMGAENPNWWSNNIGGSMR--TSTQNQQPAAVTAAATSSSP--FLCPNSPNILFPHY--SNSLPFPFPSWCDSQD---QLPESWSQL

Query:  LLGGLAGELDEQKGGMGQFEAKKLLEDWEEEILNSNGHHHHQLDHVKKEQYVYGHGDGGGRVGGRSEDFQLPHKQSWSHMIPASSP--QSCVTSFSSNML
        LLGGL          MG           EEE +    HHHHQ  H   +     + +        S   +  +  S+  M   +SP  +SC T  ++N  
Subjt:  LLGGLAGELDEQKGGMGQFEAKKLLEDWEEEILNSNGHHHHQLDHVKKEQYVYGHGDGGGRVGGRSEDFQLPHKQSWSHMIPASSP--QSCVTSFSSNML

Query:  DFSNNSPPDSRPRHPLPDRSSECNSNATGG---AAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPY
        + +NN+              SECNS+   G   A KK ++Q  S+Q+T KVRKEKLG RI +LHQLVSPFGKTDTASVL EAIGYIRFL SQIEALSLPY
Subjt:  DFSNNSPPDSRPRHPLPDRSSECNSNATGG---AAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPY

Query:  LGNASSSTRQQQHSVQVQGERNCMFPEDPGQLLNENGLKRKGV---------SEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFG
         G  S +    QH+   Q   N +FPEDPGQL+NE  +KR GV         S  +EEP +DLRSRGLCLVP+SCTLQVGSDNGADYWAPAFG
Subjt:  LGNASSSTRQQQHSVQVQGERNCMFPEDPGQLLNENGLKRKGV---------SEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFG

Q8GXT3 Transcription factor bHLH1231.0e-2645.56Show/hide
Query:  DSRPRHPLPDRSSECNSNATGG-----AAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASS
        D +P++    R S  N    GG     AAK+A+ +++S    FK RKEK+GDRI AL QLVSPFGKTD ASVL EAI YI+FL  Q+ ALS PY+ + +S
Subjt:  DSRPRHPLPDRSSECNSNATGG-----AAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASS

Query:  STRQQ-QHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGG
           QQ  HS ++                           E  EEP  DLRSRGLCLVPVS T  V  D   D+W P FGG
Subjt:  STRQQ-QHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGG

Q8S3D1 Transcription factor bHLH682.2e-7747.67Show/hide
Query:  MNRGVLQSSVVQQMMGAENPNWWSNNIGGSMR--TSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHY-------------SNSLPF--PFPSWCDSQDQ
        MNRGVL+SS VQQ+M A NPNWW  N+ G MR        Q A +    T ++ +L P      FPH+             S SLP      SW +S D 
Subjt:  MNRGVLQSSVVQQMMGAENPNWWSNNIGGSMR--TSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHY-------------SNSLPF--PFPSWCDSQDQ

Query:  LPESW--SQLLLGGLAGELDEQKGGMGQ-----------FEAKKLLEDWEEEILNSNGHHHHQLDHVKKEQYVYGHGDGGGRVGGRSEDFQLPHKQSWSH
         PESW  SQLLLGGL    +E+   M             F+ K  LE+WEE++L+     H Q   V  +                 ++  + +   +  
Subjt:  LPESW--SQLLLGGLAGELDEQKGGMGQ-----------FEAKKLLEDWEEEILNSNGHHHHQLDHVKKEQYVYGHGDGGGRVGGRSEDFQLPHKQSWSH

Query:  MIPASSP-QSCVTSF--------------SSNMLDFS--NNSPPDSRPRHPLPDRSSECNSNATGGAA-KKARVQ-SSSNQTTFKVRKEKLGDRITALHQ
          P S P +SCVT+               ++NMLDFS  +N    S  RH  PDRSSECNS   GG+  KK R+Q S S+Q+T KVRKEKLG RI ALHQ
Subjt:  MIPASSP-QSCVTSF--------------SSNMLDFS--NNSPPDSRPRHPLPDRSSECNSNATGGAA-KKARVQ-SSSNQTTFKVRKEKLGDRITALHQ

Query:  LVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLG-NASSSTRQQQHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQ-------DEEPKRDLRSR
        LVSPFGKTDTASVL EAIGYIRFLQSQIEALS PY G  AS + R QQH   +QG+R+C+FPEDPGQL+N+  +KR+G S          EEPK+DLRSR
Subjt:  LVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLG-NASSSTRQQQHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQ-------DEEPKRDLRSR

Query:  GLCLVPVSCTLQVGSDNGADYWAPAFGGGG
        GLCLVP+SCTLQVGSDNGADYWAPA G  G
Subjt:  GLCLVPVSCTLQVGSDNGADYWAPAFGGGG

Q94JL3 Transcription factor bHLH1122.1e-2747.37Show/hide
Query:  AAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASSSTRQQQHSVQVQGERNCMFPEDPGQLL
        AAKK RV + S   TFKVRKE L D+IT+L QLVSPFGKTDTASVL EAI YI+FL  Q+  LS PY+   +S+ +QQ    Q+ G+             
Subjt:  AAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASSSTRQQQHSVQVQGERNCMFPEDPGQLL

Query:  NENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGG
                    QDE    +LR  GLCLVP+S T  V ++  AD+W P FGG
Subjt:  NENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGG

Q9SFZ3 Transcription factor bHLH1103.8e-2943.29Show/hide
Query:  GRVGGRSEDFQLPHKQSWSHMIPASSPQSCVTSFSSNMLDFSNNSPPDSRPRHPLPDRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALH
        G   G    F LP        +P  S     +S +  M  FSN        RH     +    + A   A+KK RV+S S+   FKVRKEKLGDRI AL 
Subjt:  GRVGGRSEDFQLPHKQSWSHMIPASSPQSCVTSFSSNMLDFSNNSPPDSRPRHPLPDRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALH

Query:  QLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASSSTRQQQHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPV
        QLVSPFGKTDTASVL+EAIGYI+FLQSQIE LS+PY+                +  RN   P    QL++++        E DEE  RDLRSRGLCLVP+
Subjt:  QLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASSSTRQQQHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPV

Query:  SCTLQVGSDN-------GADYW--APAFGGG
        SC   V  D        G  +W   P FGGG
Subjt:  SCTLQVGSDN-------GADYW--APAFGGG

Arabidopsis top hitse value%identityAlignment
AT1G27660.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.7e-3043.29Show/hide
Query:  GRVGGRSEDFQLPHKQSWSHMIPASSPQSCVTSFSSNMLDFSNNSPPDSRPRHPLPDRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALH
        G   G    F LP        +P  S     +S +  M  FSN        RH     +    + A   A+KK RV+S S+   FKVRKEKLGDRI AL 
Subjt:  GRVGGRSEDFQLPHKQSWSHMIPASSPQSCVTSFSSNMLDFSNNSPPDSRPRHPLPDRSSECNSNATGGAAKKARVQSSSNQTTFKVRKEKLGDRITALH

Query:  QLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASSSTRQQQHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPV
        QLVSPFGKTDTASVL+EAIGYI+FLQSQIE LS+PY+                +  RN   P    QL++++        E DEE  RDLRSRGLCLVP+
Subjt:  QLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASSSTRQQQHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQDEEPKRDLRSRGLCLVPV

Query:  SCTLQVGSDN-------GADYW--APAFGGG
        SC   V  D        G  +W   P FGGG
Subjt:  SCTLQVGSDN-------GADYW--APAFGGG

AT1G61660.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.5e-2847.37Show/hide
Query:  AAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASSSTRQQQHSVQVQGERNCMFPEDPGQLL
        AAKK RV + S   TFKVRKE L D+IT+L QLVSPFGKTDTASVL EAI YI+FL  Q+  LS PY+   +S+ +QQ    Q+ G+             
Subjt:  AAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASSSTRQQQHSVQVQGERNCMFPEDPGQLL

Query:  NENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGG
                    QDE    +LR  GLCLVP+S T  V ++  AD+W P FGG
Subjt:  NENGLKRKGVSEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGG

AT2G20100.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein9.8e-6546.31Show/hide
Query:  MNRGVLQSSVVQQMMGAENPNWWSNNIGGSMR--TSTQNQQPAAVTAAATSSSP--FLCPNSPNILFPHY--SNSLPFPFPSWCDSQD---QLPESWSQL
        MNRGVL+SS VQ +  A NPNWW NN+   +R  T   + +P + TA   S  P  F  P S +   P +   NS P  F SW +  D     P S SQL
Subjt:  MNRGVLQSSVVQQMMGAENPNWWSNNIGGSMR--TSTQNQQPAAVTAAATSSSP--FLCPNSPNILFPHY--SNSLPFPFPSWCDSQD---QLPESWSQL

Query:  LLGGLAGELDEQKGGMGQFEAKKLLEDWEEEILNSNGHHHHQLDHVKKEQYVYGHGDGGGRVGGRSEDFQLPHKQSWSHMIPASSP--QSCVTSFSSNML
        LLGGL          MG           EEE +    HHHHQ  H   +     + +        S   +  +  S+  M   +SP  +SC T  ++N  
Subjt:  LLGGLAGELDEQKGGMGQFEAKKLLEDWEEEILNSNGHHHHQLDHVKKEQYVYGHGDGGGRVGGRSEDFQLPHKQSWSHMIPASSP--QSCVTSFSSNML

Query:  DFSNNSPPDSRPRHPLPDRSSECNSNATGG---AAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPY
        + +NN+              SECNS+   G   A KK ++Q  S+Q+T KVRKEKLG RI +LHQLVSPFGKTDTASVL EAIGYIRFL SQIEALSLPY
Subjt:  DFSNNSPPDSRPRHPLPDRSSECNSNATGG---AAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPY

Query:  LGNASSSTRQQQHSVQVQGERNCMFPEDPGQLLNENGLKRKGV---------SEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFG
         G  S +    QH+   Q   N +FPEDPGQL+NE  +KR GV         S  +EEP +DLRSRGLCLVP+SCTLQVGSDNGADYWAPAFG
Subjt:  LGNASSSTRQQQHSVQVQGERNCMFPEDPGQLLNENGLKRKGV---------SEQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFG

AT2G20100.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.2e-4142.17Show/hide
Query:  MNRGVLQSSVVQQMMGAENPNWWSNNIGGSMR--TSTQNQQPAAVTAAATSSSP--FLCPNSPNILFPHY--SNSLPFPFPSWCDSQD---QLPESWSQL
        MNRGVL+SS VQ +  A NPNWW NN+   +R  T   + +P + TA   S  P  F  P S +   P +   NS P  F SW +  D     P S SQL
Subjt:  MNRGVLQSSVVQQMMGAENPNWWSNNIGGSMR--TSTQNQQPAAVTAAATSSSP--FLCPNSPNILFPHY--SNSLPFPFPSWCDSQD---QLPESWSQL

Query:  LLGGLAGELDEQKGGMGQFEAKKLLEDWEEEILNSNGHHHHQLDHVKKEQYVYGHGDGGGRVGGRSEDFQLPHKQSWSHMIPASSP--QSCVTSFSSNML
        LLGGL          MG           EEE +    HHHHQ  H   +     + +        S   +  +  S+  M   +SP  +SC T  ++N  
Subjt:  LLGGLAGELDEQKGGMGQFEAKKLLEDWEEEILNSNGHHHHQLDHVKKEQYVYGHGDGGGRVGGRSEDFQLPHKQSWSHMIPASSP--QSCVTSFSSNML

Query:  DFSNNSPPDSRPRHPLPDRSSECNSNATGG---AAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPY
        + +NN+              SECNS+   G   A KK ++Q  S+Q+T KVRKEKLG RI +LHQLVSPFGKTDTASVL EAIGYIRFL SQIEALSLPY
Subjt:  DFSNNSPPDSRPRHPLPDRSSECNSNATGG---AAKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPY

Query:  LGNASSSTRQQQHSVQVQGERNCMFPEDPGQL
         G  S +    QH+   Q   N +FPEDPGQ+
Subjt:  LGNASSSTRQQQHSVQVQGERNCMFPEDPGQL

AT4G29100.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.6e-7847.67Show/hide
Query:  MNRGVLQSSVVQQMMGAENPNWWSNNIGGSMR--TSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHY-------------SNSLPF--PFPSWCDSQDQ
        MNRGVL+SS VQQ+M A NPNWW  N+ G MR        Q A +    T ++ +L P      FPH+             S SLP      SW +S D 
Subjt:  MNRGVLQSSVVQQMMGAENPNWWSNNIGGSMR--TSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHY-------------SNSLPF--PFPSWCDSQDQ

Query:  LPESW--SQLLLGGLAGELDEQKGGMGQ-----------FEAKKLLEDWEEEILNSNGHHHHQLDHVKKEQYVYGHGDGGGRVGGRSEDFQLPHKQSWSH
         PESW  SQLLLGGL    +E+   M             F+ K  LE+WEE++L+     H Q   V  +                 ++  + +   +  
Subjt:  LPESW--SQLLLGGLAGELDEQKGGMGQ-----------FEAKKLLEDWEEEILNSNGHHHHQLDHVKKEQYVYGHGDGGGRVGGRSEDFQLPHKQSWSH

Query:  MIPASSP-QSCVTSF--------------SSNMLDFS--NNSPPDSRPRHPLPDRSSECNSNATGGAA-KKARVQ-SSSNQTTFKVRKEKLGDRITALHQ
          P S P +SCVT+               ++NMLDFS  +N    S  RH  PDRSSECNS   GG+  KK R+Q S S+Q+T KVRKEKLG RI ALHQ
Subjt:  MIPASSP-QSCVTSF--------------SSNMLDFS--NNSPPDSRPRHPLPDRSSECNSNATGGAA-KKARVQ-SSSNQTTFKVRKEKLGDRITALHQ

Query:  LVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLG-NASSSTRQQQHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQ-------DEEPKRDLRSR
        LVSPFGKTDTASVL EAIGYIRFLQSQIEALS PY G  AS + R QQH   +QG+R+C+FPEDPGQL+N+  +KR+G S          EEPK+DLRSR
Subjt:  LVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLG-NASSSTRQQQHSVQVQGERNCMFPEDPGQLLNENGLKRKGVSEQ-------DEEPKRDLRSR

Query:  GLCLVPVSCTLQVGSDNGADYWAPAFGGGG
        GLCLVP+SCTLQVGSDNGADYWAPA G  G
Subjt:  GLCLVPVSCTLQVGSDNGADYWAPAFGGGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGAGGGGTTCTGCAAAGCTCAGTTGTGCAACAGATGATGGGAGCAGAAAACCCTAATTGGTGGAGTAATAATATTGGTGGGAGTATGAGAACAAGCACACAAAA
CCAACAGCCAGCAGCAGTAACAGCAGCAGCAACTTCTTCTTCTCCATTTTTATGTCCTAATTCTCCCAATATTCTATTCCCTCACTACTCCAATTCCCTTCCTTTTCCTT
TTCCTTCTTGGTGTGATTCTCAGGACCAGCTTCCAGAGTCTTGGAGCCAACTTCTCTTGGGAGGTTTGGCTGGGGAATTGGATGAACAGAAGGGGGGTATGGGGCAGTTT
GAAGCCAAGAAATTGCTGGAGGATTGGGAAGAAGAGATTTTGAATAGTAACGGCCACCACCACCACCAGCTTGATCATGTCAAGAAGGAGCAGTACGTGTATGGACATGG
CGACGGCGGTGGCCGTGTCGGTGGCCGCAGCGAGGATTTTCAGCTGCCTCATAAACAAAGCTGGTCCCATATGATTCCTGCTTCTTCTCCTCAGTCATGCGTCACAAGTT
TTAGTAGCAATATGTTGGATTTTTCTAACAACTCGCCTCCTGATTCCCGCCCCCGACATCCCCTACCGGATCGCTCGTCTGAGTGTAACAGTAATGCAACTGGTGGGGCT
GCGAAGAAGGCGAGGGTTCAATCCTCTTCAAATCAAACCACTTTCAAGGTGAGGAAGGAGAAATTAGGTGACAGAATTACAGCTCTCCACCAGCTAGTTTCCCCATTTGG
GAAGACTGACACAGCTTCAGTTTTGTTAGAAGCTATTGGGTACATCAGATTCCTTCAGAGTCAAATTGAGGCCCTCAGTTTACCCTACTTGGGCAATGCTTCCTCAAGTA
CGAGGCAACAACAACATTCTGTTCAAGTTCAAGGAGAAAGAAATTGTATGTTTCCTGAAGACCCTGGCCAGCTCTTGAACGAGAATGGCTTGAAGAGGAAGGGAGTCTCT
GAGCAGGATGAAGAGCCAAAGAGAGATCTGAGGAGTAGAGGGTTGTGTCTGGTTCCAGTGTCCTGCACCCTGCAAGTTGGGAGTGACAATGGAGCAGATTATTGGGCTCC
GGCGTTTGGTGGCGGTGGCAGTTTCCGG
mRNA sequenceShow/hide mRNA sequence
ATGAATAGAGGGGTTCTGCAAAGCTCAGTTGTGCAACAGATGATGGGAGCAGAAAACCCTAATTGGTGGAGTAATAATATTGGTGGGAGTATGAGAACAAGCACACAAAA
CCAACAGCCAGCAGCAGTAACAGCAGCAGCAACTTCTTCTTCTCCATTTTTATGTCCTAATTCTCCCAATATTCTATTCCCTCACTACTCCAATTCCCTTCCTTTTCCTT
TTCCTTCTTGGTGTGATTCTCAGGACCAGCTTCCAGAGTCTTGGAGCCAACTTCTCTTGGGAGGTTTGGCTGGGGAATTGGATGAACAGAAGGGGGGTATGGGGCAGTTT
GAAGCCAAGAAATTGCTGGAGGATTGGGAAGAAGAGATTTTGAATAGTAACGGCCACCACCACCACCAGCTTGATCATGTCAAGAAGGAGCAGTACGTGTATGGACATGG
CGACGGCGGTGGCCGTGTCGGTGGCCGCAGCGAGGATTTTCAGCTGCCTCATAAACAAAGCTGGTCCCATATGATTCCTGCTTCTTCTCCTCAGTCATGCGTCACAAGTT
TTAGTAGCAATATGTTGGATTTTTCTAACAACTCGCCTCCTGATTCCCGCCCCCGACATCCCCTACCGGATCGCTCGTCTGAGTGTAACAGTAATGCAACTGGTGGGGCT
GCGAAGAAGGCGAGGGTTCAATCCTCTTCAAATCAAACCACTTTCAAGGTGAGGAAGGAGAAATTAGGTGACAGAATTACAGCTCTCCACCAGCTAGTTTCCCCATTTGG
GAAGACTGACACAGCTTCAGTTTTGTTAGAAGCTATTGGGTACATCAGATTCCTTCAGAGTCAAATTGAGGCCCTCAGTTTACCCTACTTGGGCAATGCTTCCTCAAGTA
CGAGGCAACAACAACATTCTGTTCAAGTTCAAGGAGAAAGAAATTGTATGTTTCCTGAAGACCCTGGCCAGCTCTTGAACGAGAATGGCTTGAAGAGGAAGGGAGTCTCT
GAGCAGGATGAAGAGCCAAAGAGAGATCTGAGGAGTAGAGGGTTGTGTCTGGTTCCAGTGTCCTGCACCCTGCAAGTTGGGAGTGACAATGGAGCAGATTATTGGGCTCC
GGCGTTTGGTGGCGGTGGCAGTTTCCGG
Protein sequenceShow/hide protein sequence
MNRGVLQSSVVQQMMGAENPNWWSNNIGGSMRTSTQNQQPAAVTAAATSSSPFLCPNSPNILFPHYSNSLPFPFPSWCDSQDQLPESWSQLLLGGLAGELDEQKGGMGQF
EAKKLLEDWEEEILNSNGHHHHQLDHVKKEQYVYGHGDGGGRVGGRSEDFQLPHKQSWSHMIPASSPQSCVTSFSSNMLDFSNNSPPDSRPRHPLPDRSSECNSNATGGA
AKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASSSTRQQQHSVQVQGERNCMFPEDPGQLLNENGLKRKGVS
EQDEEPKRDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGSFR