; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027208 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027208
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionbasic helix-loop-helix (bHLH) DNA-binding superfamily protein
Genome locationtig00153048:1989608..1992828
RNA-Seq ExpressionSgr027208
SyntenySgr027208
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142511.1 transcription factor bHLH68 isoform X1 [Cucumis sativus]1.1e-13973.82Show/hide
Query:  MNRGVLQSSVVQQMMAAENPNWWNMN--SATMRTS--SQQPSP----FLSPSPNILFPHSSLSPASLPFPS--WYDAQDH--LPESWSQLLLGGLVGEDD
        MNRGVLQSSV+QQMM +ENPNWWNM   S +MR+S  +Q  SP      S S NILFPHS     SLPFPS  +YDAQDH  +PESWSQLLLGGLVGE D
Subjt:  MNRGVLQSSVVQQMMAAENPNWWNMN--SATMRTS--SQQPSP----FLSPSPNILFPHSSLSPASLPFPS--WYDAQDH--LPESWSQLLLGGLVGEDD

Query:  DQKECIMGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGH--------------GGEDYQLPPKQSSWSSPLMPASSPQSCV
        DQK C MG FQSKKLEDWEEEILN+N      Q     VD+KKE S HASS  YVYGH              GG+DYQL   + +W SP++ +SSPQSCV
Subjt:  DQKECIMGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGH--------------GGEDYQLPPKQSSWSSPLMPASSPQSCV

Query:  TSFSSNMLDFSNNS------PPDSRPRHPPP--DRSSECNSNATGGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIR
        TSFSSNMLDFSNN+         SRPR+PP   DRSSECNSN  GGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIR
Subjt:  TSFSSNMLDFSNNS------PPDSRPRHPPP--DRSSECNSNATGGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIR

Query:  FLQSQIEALSLPYLGNASGSTRQ---QQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGG
        FLQSQIEALSLPYLGN SGSTRQ   QQHS+QGERNC+FPEDPGQLLNENCLKRKG SEQ+EE KKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGG
Subjt:  FLQSQIEALSLPYLGNASGSTRQ---QQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGG

Query:  G
        G
Subjt:  G

XP_008462742.1 PREDICTED: transcription factor bHLH68 isoform X1 [Cucumis melo]2.7e-13874.24Show/hide
Query:  MNRGVLQSSVVQQMMAAENPNWWNMN--SATMRTS--SQQPSP-----FLSPSPNILFPHSSLSPASLPFPS--WYDAQDH--LPESWSQLLLGGLVGED
        MNRGVLQSSV+QQMM +ENPNWWNM   S +MR+S  +Q  SP       S S NILFPHS     SLPFPS  +YDAQDH  +PESWSQLLLGGLV E 
Subjt:  MNRGVLQSSVVQQMMAAENPNWWNMN--SATMRTS--SQQPSP-----FLSPSPNILFPHSSLSPASLPFPS--WYDAQDH--LPESWSQLLLGGLVGED

Query:  DDQKECIMGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGH-------GGEDYQLPPKQSSWSSPLMPASSPQSCVTSFSSN
        DDQK C MG FQSKKLEDWEEEILN+N            VD+KKE S HA+S  YVYGH       GG+DYQL   + +W SP++ +SSPQSCVTSFSSN
Subjt:  DDQKECIMGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGH-------GGEDYQLPPKQSSWSSPLMPASSPQSCVTSFSSN

Query:  MLDFSNNS------PPDSRPRHPPP--DRSSECNSNATGGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQI
        MLDFSNN+         SRPR+PPP  DRSSECNSN  GGA KKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQI
Subjt:  MLDFSNNS------PPDSRPRHPPP--DRSSECNSNATGGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQI

Query:  EALSLPYLGNASGSTRQ---QQHSVQ-GERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
        EALSLPYLGN SGSTRQ   QQHSVQ GERNC+FPEDPGQLLNENCLKRKG SEQ+EE KKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
Subjt:  EALSLPYLGNASGSTRQ---QQHSVQ-GERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG

XP_022933813.1 transcription factor bHLH68 [Cucurbita moschata]1.2e-13073.18Show/hide
Query:  MNRGVLQSSVV-QQMMAAENPNWWNM-NSATMRTSSQQPSPFLSPSPNILFPHSSLSPASLPFPS--WYD---AQDHLPESWSQLLLGGLVGEDDDQKEC
        MNRG LQSSV+ QQMM +ENP+WWNM NS ++RTSS Q S   + SPN LFPHS     SLPFPS   YD    +D LPESWS LLLGGLVGE DDQK C
Subjt:  MNRGVLQSSVV-QQMMAAENPNWWNM-NSATMRTSSQQPSPFLSPSPNILFPHSSLSPASLPFPS--WYD---AQDHLPESWSQLLLGGLVGEDDDQKEC

Query:  IMGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHG-------GEDY-QLPPKQSSWSSPLMPASSPQSCVTSFSSNMLDFS
        + G FQ KKLEDWEEEILN N      Q+    VD+KKE SAHA+SC YVYGHG       G++Y  LPPKQ +WSS ++P SSPQSCVTSFSSNMLDFS
Subjt:  IMGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHG-------GEDY-QLPPKQSSWSSPLMPASSPQSCVTSFSSNMLDFS

Query:  N---NSPPDSRPRHPPPDRSSECNSNAT-GGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLG
        N   NSPPDSRPRHPPPDRSSECNSNA  GGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFL SQIEALSLPYLG
Subjt:  N---NSPPDSRPRHPPPDRSSECNSNAT-GGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLG

Query:  NASGSTRQ-QQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
        N S STRQ QQHSVQGE N +FPE                ++QDEE KKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
Subjt:  NASGSTRQ-QQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG

XP_038879708.1 transcription factor bHLH68 isoform X1 [Benincasa hispida]3.3e-14477.55Show/hide
Query:  MNRGVLQSSVVQQMMAAENPNWWNM---NSATMRTSSQQPSP---FLSPSPNILFPHSSLSPASLPFPS--WYDAQD--HLPESWSQLLLGGLVGEDDDQ
        MNRGVLQSSV+QQMM +ENPNWWNM   +     T++QQ SP     + SPNILFPHS     SLPFPS  +YD QD  HLPESWSQLLLGGLVGE DDQ
Subjt:  MNRGVLQSSVVQQMMAAENPNWWNM---NSATMRTSSQQPSP---FLSPSPNILFPHSSLSPASLPFPS--WYDAQD--HLPESWSQLLLGGLVGEDDDQ

Query:  KECIMGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHGGEDYQLPPKQSSWSSPLMPASSPQSCVTSFSSNMLDFSNNSPP
        K C MG FQSKKLE WEEEILN+N      QN    VD+KKE SA AS+  YVYGHGG+DYQL  ++ +W SP++ +SSPQSCVTSFSSNMLDFSNN+  
Subjt:  KECIMGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHGGEDYQLPPKQSSWSSPLMPASSPQSCVTSFSSNMLDFSNNSPP

Query:  D-----SRPRHPPP-DRSSECNSNATGGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNAS
        +     SRPRHPP  DRSSECNSN TGGAVKKARVQ SSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNAS
Subjt:  D-----SRPRHPPP-DRSSECNSNATGGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNAS

Query:  GSTR---QQQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
        GSTR   QQQHSVQGERNC+FPEDPGQLLNENCLKRKG SEQDEE KKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
Subjt:  GSTR---QQQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG

XP_038879709.1 transcription factor bHLH68 isoform X2 [Benincasa hispida]1.0e-13273.89Show/hide
Query:  MNRGVLQSSVVQQMMAAENPNWWNM---NSATMRTSSQQPSP---FLSPSPNILFPHSSLSPASLPFPS--WYDAQD--HLPESWSQLLLGGLVGEDDDQ
        MNRGVLQSSV+QQMM +ENPNWWNM   +     T++QQ SP     + SPNILFPHS     SLPFPS  +YD QD  HLPESWSQLLLGGLVGE DDQ
Subjt:  MNRGVLQSSVVQQMMAAENPNWWNM---NSATMRTSSQQPSP---FLSPSPNILFPHSSLSPASLPFPS--WYDAQD--HLPESWSQLLLGGLVGEDDDQ

Query:  KECIMGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHGGEDYQLPPKQSSWSSPLMPASSPQSCVTSFSSNMLDFSNNSPP
        K C MG FQSKKLE WEEEILN+N      QN    VD+KKE SA AS+  YVYGHGG+DYQL  ++ +W SP++ +SSPQSCVTSFSSNMLDFSNN+  
Subjt:  KECIMGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHGGEDYQLPPKQSSWSSPLMPASSPQSCVTSFSSNMLDFSNNSPP

Query:  D-----SRPRHPPP-DRSSECNSNATGGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNAS
        +     SRPRHPP  DRSSECNSN TGGAVKKARVQ SSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNAS
Subjt:  D-----SRPRHPPP-DRSSECNSNATGGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNAS

Query:  GSTR---QQQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
        GSTR   QQQHSVQGERNC+FPEDPG               QDEE KKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
Subjt:  GSTR---QQQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG

TrEMBL top hitse value%identityAlignment
A0A0A0M120 BHLH domain-containing protein5.4e-14073.82Show/hide
Query:  MNRGVLQSSVVQQMMAAENPNWWNMN--SATMRTS--SQQPSP----FLSPSPNILFPHSSLSPASLPFPS--WYDAQDH--LPESWSQLLLGGLVGEDD
        MNRGVLQSSV+QQMM +ENPNWWNM   S +MR+S  +Q  SP      S S NILFPHS     SLPFPS  +YDAQDH  +PESWSQLLLGGLVGE D
Subjt:  MNRGVLQSSVVQQMMAAENPNWWNMN--SATMRTS--SQQPSP----FLSPSPNILFPHSSLSPASLPFPS--WYDAQDH--LPESWSQLLLGGLVGEDD

Query:  DQKECIMGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGH--------------GGEDYQLPPKQSSWSSPLMPASSPQSCV
        DQK C MG FQSKKLEDWEEEILN+N      Q     VD+KKE S HASS  YVYGH              GG+DYQL   + +W SP++ +SSPQSCV
Subjt:  DQKECIMGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGH--------------GGEDYQLPPKQSSWSSPLMPASSPQSCV

Query:  TSFSSNMLDFSNNS------PPDSRPRHPPP--DRSSECNSNATGGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIR
        TSFSSNMLDFSNN+         SRPR+PP   DRSSECNSN  GGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIR
Subjt:  TSFSSNMLDFSNNS------PPDSRPRHPPP--DRSSECNSNATGGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIR

Query:  FLQSQIEALSLPYLGNASGSTRQ---QQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGG
        FLQSQIEALSLPYLGN SGSTRQ   QQHS+QGERNC+FPEDPGQLLNENCLKRKG SEQ+EE KKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGG
Subjt:  FLQSQIEALSLPYLGNASGSTRQ---QQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGG

Query:  G
        G
Subjt:  G

A0A1S3CJ89 transcription factor bHLH68 isoform X11.3e-13874.24Show/hide
Query:  MNRGVLQSSVVQQMMAAENPNWWNMN--SATMRTS--SQQPSP-----FLSPSPNILFPHSSLSPASLPFPS--WYDAQDH--LPESWSQLLLGGLVGED
        MNRGVLQSSV+QQMM +ENPNWWNM   S +MR+S  +Q  SP       S S NILFPHS     SLPFPS  +YDAQDH  +PESWSQLLLGGLV E 
Subjt:  MNRGVLQSSVVQQMMAAENPNWWNMN--SATMRTS--SQQPSP-----FLSPSPNILFPHSSLSPASLPFPS--WYDAQDH--LPESWSQLLLGGLVGED

Query:  DDQKECIMGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGH-------GGEDYQLPPKQSSWSSPLMPASSPQSCVTSFSSN
        DDQK C MG FQSKKLEDWEEEILN+N            VD+KKE S HA+S  YVYGH       GG+DYQL   + +W SP++ +SSPQSCVTSFSSN
Subjt:  DDQKECIMGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGH-------GGEDYQLPPKQSSWSSPLMPASSPQSCVTSFSSN

Query:  MLDFSNNS------PPDSRPRHPPP--DRSSECNSNATGGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQI
        MLDFSNN+         SRPR+PPP  DRSSECNSN  GGA KKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQI
Subjt:  MLDFSNNS------PPDSRPRHPPP--DRSSECNSNATGGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQI

Query:  EALSLPYLGNASGSTRQ---QQHSVQ-GERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
        EALSLPYLGN SGSTRQ   QQHSVQ GERNC+FPEDPGQLLNENCLKRKG SEQ+EE KKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
Subjt:  EALSLPYLGNASGSTRQ---QQHSVQ-GERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG

A0A2C9UAT7 BHLH domain-containing protein2.4e-12466.49Show/hide
Query:  MNRGVLQSSVVQQMMAAENPNWWNMNSATMRTSSQQPSPFLSP-------SPNILFPHSSLSPASLPFPSWYDAQDHLPESWSQLLLGGLVGEDDDQKEC
        MNRGVLQSS   Q + A NPNWWN+N+    T  QQ SPF+ P       +P+     SS + +SL  PSW+D QD LPESWSQLL+GGLV ED+     
Subjt:  MNRGVLQSSVVQQMMAAENPNWWNMNSATMRTSSQQPSPFLSP-------SPNILFPHSSLSPASLPFPSWYDAQDHLPESWSQLLLGGLVGEDDDQKEC

Query:  IMGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHGGEDYQLPPKQSSWSSPLMPASSPQSCVTSFSSNMLDFSNNSPPDSR
         M  FQ+KKLE+WEE++L+++     A + + +VD+K+E SA+    +YVYGH  ED+Q    + SWS  + PASSP+SCVTSFSSNMLDFS N      
Subjt:  IMGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHGGEDYQLPPKQSSWSSPLMPASSPQSCVTSFSSNMLDFSNNSPPDSR

Query:  PRHPPPDRSSECNSNATGGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASGSTR-----
         +HPPPDRSSECNS ATGGA+KKARVQ SS Q+TFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLG+ S + R     
Subjt:  PRHPPPDRSSECNSNATGGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASGSTR-----

Query:  -QQQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQ---DEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGFR
         QQQ SVQGERNC+FPEDPGQLLN+NC+KRKGAS+Q   +EEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPA GGGGFR
Subjt:  -QQQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQ---DEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGFR

A0A6J1F5W4 transcription factor bHLH685.9e-13173.18Show/hide
Query:  MNRGVLQSSVV-QQMMAAENPNWWNM-NSATMRTSSQQPSPFLSPSPNILFPHSSLSPASLPFPS--WYD---AQDHLPESWSQLLLGGLVGEDDDQKEC
        MNRG LQSSV+ QQMM +ENP+WWNM NS ++RTSS Q S   + SPN LFPHS     SLPFPS   YD    +D LPESWS LLLGGLVGE DDQK C
Subjt:  MNRGVLQSSVV-QQMMAAENPNWWNM-NSATMRTSSQQPSPFLSPSPNILFPHSSLSPASLPFPS--WYD---AQDHLPESWSQLLLGGLVGEDDDQKEC

Query:  IMGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHG-------GEDY-QLPPKQSSWSSPLMPASSPQSCVTSFSSNMLDFS
        + G FQ KKLEDWEEEILN N      Q+    VD+KKE SAHA+SC YVYGHG       G++Y  LPPKQ +WSS ++P SSPQSCVTSFSSNMLDFS
Subjt:  IMGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHG-------GEDY-QLPPKQSSWSSPLMPASSPQSCVTSFSSNMLDFS

Query:  N---NSPPDSRPRHPPPDRSSECNSNAT-GGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLG
        N   NSPPDSRPRHPPPDRSSECNSNA  GGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFL SQIEALSLPYLG
Subjt:  N---NSPPDSRPRHPPPDRSSECNSNAT-GGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLG

Query:  NASGSTRQ-QQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
        N S STRQ QQHSVQGE N +FPE                ++QDEE KKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG
Subjt:  NASGSTRQ-QQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGG

A0A6J1L807 transcription factor bHLH68 isoform X16.6e-13072.18Show/hide
Query:  MNRGVLQSSVV-QQMMAAENPNWWNM-NSATMRTSSQQPSPFLSPSPNILFPHSSLSPASLPFPSWYDAQDHLPESWSQLLLGGLVGEDDDQKECIMGQF
        MNRG LQSSV+ QQ+M +ENP+WW+M NS ++RTSS Q S   + SPN LFPHSSL   S     +   +D LPESWS LLLGGLVGE DDQK C+ G F
Subjt:  MNRGVLQSSVV-QQMMAAENPNWWNM-NSATMRTSSQQPSPFLSPSPNILFPHSSLSPASLPFPSWYDAQDHLPESWSQLLLGGLVGEDDDQKECIMGQF

Query:  QSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHGG-------EDY-QLPPKQSSWSSPLMPASSPQSCVTSFSSNMLDFSN---N
        Q KKLEDWEEEILN N      Q+    VD+KKE S HA+SC YVYGHGG       ++Y  LPPKQ +WSS ++P SSPQSCVTSFSSNMLDFSN   N
Subjt:  QSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHGG-------EDY-QLPPKQSSWSSPLMPASSPQSCVTSFSSNMLDFSN---N

Query:  SPPDSRPRHPPPDRSSECNSNAT-GGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASGS
        SPPDSRPRH PPDRSSECNSNA  GGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFL SQIEALSLPYLGN S S
Subjt:  SPPDSRPRHPPPDRSSECNSNAT-GGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASGS

Query:  TRQ-QQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGFR
        TRQ QQHSVQGE N +FPED               ++QDEE KKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGFR
Subjt:  TRQ-QQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGFR

SwissProt top hitse value%identityAlignment
Q7XHI5 Transcription factor bHLH1331.4e-6544.8Show/hide
Query:  MNRGVLQSSVVQQMMAAENPNWWNMNSATMRTSSQQPSPFLS---PSPNILFP------------HSSLSPASLP------FPSWYDAQD---HLPESWS
        MNRGVL+SS VQ + AA NPNWWN  S  +R     P+P +S   PS     P             SS SP+  P      F SW +  D     P S S
Subjt:  MNRGVLQSSVVQQMMAAENPNWWNMNSATMRTSSQQPSPFLS---PSPNILFP------------HSSLSPASLP------FPSWYDAQD---HLPESWS

Query:  QLLLGGLVGEDDDQKECI--------MGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHGGEDYQLPPKQSSWSSPLMPAS
        QLLLGGL+  ++++ E +           +Q+K++++WEE++L +                 K++S++ +S    YG               SSP  P +
Subjt:  QLLLGGLVGEDDDQKECI--------MGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHGGEDYQLPPKQSSWSSPLMPAS

Query:  SPQSCVTSFSSNMLDFSNNSPPDSRPRHPPPDRSSECNSNATGG---AVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYI
          +SC T  ++N  + +NN+              SECNS+   G   A KK ++Q  S+Q+T KVRKEKLG RI +LHQLVSPFGKTDTASVL EAIGYI
Subjt:  SPQSCVTSFSSNMLDFSNNSPPDSRPRHPPPDRSSECNSNATGG---AVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYI

Query:  RFLQSQIEALSLPYLGNASGSTRQQQHSVQGERNCMFPEDPGQLLNENCLKR--------KGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWA
        RFL SQIEALSLPY G  S +    QH+ Q   N +FPEDPGQL+NE C+KR           S  +EEP KDLRSRGLCLVP+SCTLQVGSDNGADYWA
Subjt:  RFLQSQIEALSLPYLGNASGSTRQQQHSVQGERNCMFPEDPGQLLNENCLKR--------KGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWA

Query:  PAFG
        PAFG
Subjt:  PAFG

Q8GXT3 Transcription factor bHLH1235.0e-2645.51Show/hide
Query:  DSRPRHPPPDRSSECNSNATGG-----AVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYL-GNAS
        D +P++    R S  N    GG     A K+A+ +++S    FK RKEK+GDRI AL QLVSPFGKTD ASVL EAI YI+FL  Q+ ALS PY+   AS
Subjt:  DSRPRHPPPDRSSECNSNATGG-----AVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYL-GNAS

Query:  GSTRQQQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGG
           +Q  HS +                          E  EEP  DLRSRGLCLVPVS T  V  D   D+W P FGG
Subjt:  GSTRQQQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGG

Q8S3D1 Transcription factor bHLH681.4e-8450.72Show/hide
Query:  MNRGVLQSSVVQQMMAAENPNWWNMNSA-----TMRTSSQQP-SPFLSPSPNIL--------FPH----------SSLSPASLP----FPSWYDAQDHLP
        MNRGVL+SS VQQ+MAA NPNWWN++        +    Q P  P ++P+ N L        FPH          SS S  SLP      SW ++ D  P
Subjt:  MNRGVLQSSVVQQMMAAENPNWWNMNSA-----TMRTSSQQP-SPFLSPSPNIL--------FPH----------SSLSPASLP----FPSWYDAQDHLP

Query:  ESW--SQLLLGGLVGEDDDQKECIMGQ----------FQSK-KLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHGGEDYQLPPKQSS
        ESW  SQLLLGGL+  ++++ E +             FQ K +LE+WEE++L++   QQA+      VD+K+E + + ++     G+       PP +S 
Subjt:  ESW--SQLLLGGLVGEDDDQKECIMGQ----------FQSK-KLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHGGEDYQLPPKQSS

Query:  WSSPLMPASSPQSCVTSFSSNMLDFS--NNSPPDSRPRHPPPDRSSECNSNATGGAV-KKARVQ-SSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTA
         ++    + +      + ++NMLDFS  +N    S  RH PPDRSSECNS   GG+  KK R+Q S S+Q+T KVRKEKLG RI ALHQLVSPFGKTDTA
Subjt:  WSSPLMPASSPQSCVTSFSSNMLDFS--NNSPPDSRPRHPPPDRSSECNSNATGGAV-KKARVQ-SSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTA

Query:  SVLLEAIGYIRFLQSQIEALSLPYLG-NASGSTRQQQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQ-------DEEPKKDLRSRGLCLVPVSCTLQV
        SVL EAIGYIRFLQSQIEALS PY G  ASG+ R QQH +QG+R+C+FPEDPGQL+N+ C+KR+GAS          EEPKKDLRSRGLCLVP+SCTLQV
Subjt:  SVLLEAIGYIRFLQSQIEALSLPYLG-NASGSTRQQQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQ-------DEEPKKDLRSRGLCLVPVSCTLQV

Query:  GSDNGADYWAPAFGGGGF
        GSDNGADYWAPA G  GF
Subjt:  GSDNGADYWAPAFGGGGF

Q94JL3 Transcription factor bHLH1123.7e-2947.4Show/hide
Query:  AVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASGSTRQQQHSVQGERNCMFPEDPGQLLNE
        A KK RV + S   TFKVRKE L D+IT+L QLVSPFGKTDTASVL EAI YI+FL  Q+  LS PY+    G++ QQQ  + G+               
Subjt:  AVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASGSTRQQQHSVQGERNCMFPEDPGQLLNE

Query:  NCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGFR
               +  QDE    +LR  GLCLVP+S T  V ++  AD+W P FGG  FR
Subjt:  NCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGFR

Q9SFZ3 Transcription factor bHLH1104.8e-2944.72Show/hide
Query:  VTSFSSNMLDFSNNSPPDSRPRHPPPDRSSECNSNATGGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIE
        ++S  ++ ++  +N P  S  +      +++   NA+    KK RV+S S+   FKVRKEKLGDRI AL QLVSPFGKTDTASVL+EAIGYI+FLQSQIE
Subjt:  VTSFSSNMLDFSNNSPPDSRPRHPPPDRSSECNSNATGGAVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIE

Query:  ALSLPYLGNASGSTRQQQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDN-------GADYW--APAFGGG
         LS+PY+              +  RN   P    QL+++       + E DEE  +DLRSRGLCLVP+SC   V  D        G  +W   P FGGG
Subjt:  ALSLPYLGNASGSTRQQQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDN-------GADYW--APAFGGG

Arabidopsis top hitse value%identityAlignment
AT1G61660.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.6e-3047.4Show/hide
Query:  AVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASGSTRQQQHSVQGERNCMFPEDPGQLLNE
        A KK RV + S   TFKVRKE L D+IT+L QLVSPFGKTDTASVL EAI YI+FL  Q+  LS PY+    G++ QQQ  + G+               
Subjt:  AVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASGSTRQQQHSVQGERNCMFPEDPGQLLNE

Query:  NCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGFR
               +  QDE    +LR  GLCLVP+S T  V ++  AD+W P FGG  FR
Subjt:  NCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGFR

AT1G61660.3 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.6e-3047.4Show/hide
Query:  AVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASGSTRQQQHSVQGERNCMFPEDPGQLLNE
        A KK RV + S   TFKVRKE L D+IT+L QLVSPFGKTDTASVL EAI YI+FL  Q+  LS PY+    G++ QQQ  + G+               
Subjt:  AVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASGSTRQQQHSVQGERNCMFPEDPGQLLNE

Query:  NCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGFR
               +  QDE    +LR  GLCLVP+S T  V ++  AD+W P FGG  FR
Subjt:  NCLKRKGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGFR

AT2G20100.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.0e-6644.8Show/hide
Query:  MNRGVLQSSVVQQMMAAENPNWWNMNSATMRTSSQQPSPFLS---PSPNILFP------------HSSLSPASLP------FPSWYDAQD---HLPESWS
        MNRGVL+SS VQ + AA NPNWWN  S  +R     P+P +S   PS     P             SS SP+  P      F SW +  D     P S S
Subjt:  MNRGVLQSSVVQQMMAAENPNWWNMNSATMRTSSQQPSPFLS---PSPNILFP------------HSSLSPASLP------FPSWYDAQD---HLPESWS

Query:  QLLLGGLVGEDDDQKECI--------MGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHGGEDYQLPPKQSSWSSPLMPAS
        QLLLGGL+  ++++ E +           +Q+K++++WEE++L +                 K++S++ +S    YG               SSP  P +
Subjt:  QLLLGGLVGEDDDQKECI--------MGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHGGEDYQLPPKQSSWSSPLMPAS

Query:  SPQSCVTSFSSNMLDFSNNSPPDSRPRHPPPDRSSECNSNATGG---AVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYI
          +SC T  ++N  + +NN+              SECNS+   G   A KK ++Q  S+Q+T KVRKEKLG RI +LHQLVSPFGKTDTASVL EAIGYI
Subjt:  SPQSCVTSFSSNMLDFSNNSPPDSRPRHPPPDRSSECNSNATGG---AVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYI

Query:  RFLQSQIEALSLPYLGNASGSTRQQQHSVQGERNCMFPEDPGQLLNENCLKR--------KGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWA
        RFL SQIEALSLPY G  S +    QH+ Q   N +FPEDPGQL+NE C+KR           S  +EEP KDLRSRGLCLVP+SCTLQVGSDNGADYWA
Subjt:  RFLQSQIEALSLPYLGNASGSTRQQQHSVQGERNCMFPEDPGQLLNENCLKR--------KGASEQDEEPKKDLRSRGLCLVPVSCTLQVGSDNGADYWA

Query:  PAFG
        PAFG
Subjt:  PAFG

AT2G20100.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein6.7e-4240.41Show/hide
Query:  MNRGVLQSSVVQQMMAAENPNWWNMNSATMRTSSQQPSPFLS---PSPNILFP------------HSSLSPASLP------FPSWYDAQD---HLPESWS
        MNRGVL+SS VQ + AA NPNWWN  S  +R     P+P +S   PS     P             SS SP+  P      F SW +  D     P S S
Subjt:  MNRGVLQSSVVQQMMAAENPNWWNMNSATMRTSSQQPSPFLS---PSPNILFP------------HSSLSPASLP------FPSWYDAQD---HLPESWS

Query:  QLLLGGLVGEDDDQKECI--------MGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHGGEDYQLPPKQSSWSSPLMPAS
        QLLLGGL+  ++++ E +           +Q+K++++WEE++L +                 K++S++ +S    YG               SSP  P +
Subjt:  QLLLGGLVGEDDDQKECI--------MGQFQSKKLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHGGEDYQLPPKQSSWSSPLMPAS

Query:  SPQSCVTSFSSNMLDFSNNSPPDSRPRHPPPDRSSECNSNATGG---AVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYI
          +SC T  ++N  + +NN+              SECNS+   G   A KK ++Q  S+Q+T KVRKEKLG RI +LHQLVSPFGKTDTASVL EAIGYI
Subjt:  SPQSCVTSFSSNMLDFSNNSPPDSRPRHPPPDRSSECNSNATGG---AVKKARVQSSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYI

Query:  RFLQSQIEALSLPYLGNASGSTRQQQHSVQGERNCMFPEDPGQL
        RFL SQIEALSLPY G  S +    QH+ Q   N +FPEDPGQ+
Subjt:  RFLQSQIEALSLPYLGNASGSTRQQQHSVQGERNCMFPEDPGQL

AT4G29100.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein9.8e-8650.72Show/hide
Query:  MNRGVLQSSVVQQMMAAENPNWWNMNSA-----TMRTSSQQP-SPFLSPSPNIL--------FPH----------SSLSPASLP----FPSWYDAQDHLP
        MNRGVL+SS VQQ+MAA NPNWWN++        +    Q P  P ++P+ N L        FPH          SS S  SLP      SW ++ D  P
Subjt:  MNRGVLQSSVVQQMMAAENPNWWNMNSA-----TMRTSSQQP-SPFLSPSPNIL--------FPH----------SSLSPASLP----FPSWYDAQDHLP

Query:  ESW--SQLLLGGLVGEDDDQKECIMGQ----------FQSK-KLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHGGEDYQLPPKQSS
        ESW  SQLLLGGL+  ++++ E +             FQ K +LE+WEE++L++   QQA+      VD+K+E + + ++     G+       PP +S 
Subjt:  ESW--SQLLLGGLVGEDDDQKECIMGQ----------FQSK-KLEDWEEEILNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHGGEDYQLPPKQSS

Query:  WSSPLMPASSPQSCVTSFSSNMLDFS--NNSPPDSRPRHPPPDRSSECNSNATGGAV-KKARVQ-SSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTA
         ++    + +      + ++NMLDFS  +N    S  RH PPDRSSECNS   GG+  KK R+Q S S+Q+T KVRKEKLG RI ALHQLVSPFGKTDTA
Subjt:  WSSPLMPASSPQSCVTSFSSNMLDFS--NNSPPDSRPRHPPPDRSSECNSNATGGAV-KKARVQ-SSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTA

Query:  SVLLEAIGYIRFLQSQIEALSLPYLG-NASGSTRQQQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQ-------DEEPKKDLRSRGLCLVPVSCTLQV
        SVL EAIGYIRFLQSQIEALS PY G  ASG+ R QQH +QG+R+C+FPEDPGQL+N+ C+KR+GAS          EEPKKDLRSRGLCLVP+SCTLQV
Subjt:  SVLLEAIGYIRFLQSQIEALSLPYLG-NASGSTRQQQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQ-------DEEPKKDLRSRGLCLVPVSCTLQV

Query:  GSDNGADYWAPAFGGGGF
        GSDNGADYWAPA G  GF
Subjt:  GSDNGADYWAPAFGGGGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGAGGGGTTTTGCAGAGCTCAGTTGTGCAACAGATGATGGCTGCAGAAAACCCTAACTGGTGGAATATGAATAGTGCCACCATGAGGACAAGCTCACAACAACC
TTCGCCTTTCTTATCTCCTTCTCCCAATATTCTTTTCCCCCACTCTTCCCTTTCTCCTGCTTCCCTTCCTTTTCCTTCCTGGTATGATGCCCAGGACCATCTTCCAGAGT
CTTGGAGCCAACTTCTCCTGGGAGGTTTGGTTGGTGAAGATGATGATCAGAAGGAGTGTATTATGGGTCAGTTTCAATCCAAGAAACTGGAGGACTGGGAAGAAGAGATA
TTGAATAACAACGGGCAGCAGCAGGCTGCTCAAAATACAAACGGCGTCGTTGATTTGAAGAAAGAGCAGTCGGCTCATGCAAGCAGCTGCACCTACGTGTATGGCCATGG
CGGCGAGGATTATCAGCTACCTCCAAAACAAAGTTCCTGGTCCAGCCCCTTGATGCCGGCGTCGTCTCCTCAGTCATGCGTCACAAGTTTCAGCAGCAATATGTTGGATT
TTTCAAACAACTCGCCTCCTGATTCCCGCCCTAGACATCCCCCACCAGATCGCTCCTCTGAGTGTAACAGCAATGCAACTGGTGGGGCTGTGAAGAAGGCGAGGGTTCAA
TCCTCTTCAAACCAAACCACTTTCAAGGTGAGGAAGGAAAAATTAGGTGACAGAATTACAGCTCTCCACCAGCTAGTTTCCCCATTTGGGAAGACTGACACAGCTTCAGT
TTTGTTAGAGGCTATTGGGTACATCAGATTCCTTCAGAGTCAAATTGAGGCCCTCAGCTTACCCTACTTGGGCAATGCTTCGGGAAGTACGAGGCAACAACAACATTCCG
TTCAAGGAGAAAGAAATTGTATGTTTCCTGAAGACCCTGGTCAGCTCTTGAACGAGAACTGCTTGAAGAGGAAGGGCGCCTCTGAGCAGGATGAAGAGCCAAAGAAGGAC
CTCAGAAGTAGAGGGTTGTGTCTGGTTCCGGTGTCCTGCACCCTGCAAGTTGGGAGTGACAACGGAGCAGATTATTGGGCTCCGGCGTTCGGGGGCGGCGGTTTCCGGTA
G
mRNA sequenceShow/hide mRNA sequence
ATGAATAGAGGGGTTTTGCAGAGCTCAGTTGTGCAACAGATGATGGCTGCAGAAAACCCTAACTGGTGGAATATGAATAGTGCCACCATGAGGACAAGCTCACAACAACC
TTCGCCTTTCTTATCTCCTTCTCCCAATATTCTTTTCCCCCACTCTTCCCTTTCTCCTGCTTCCCTTCCTTTTCCTTCCTGGTATGATGCCCAGGACCATCTTCCAGAGT
CTTGGAGCCAACTTCTCCTGGGAGGTTTGGTTGGTGAAGATGATGATCAGAAGGAGTGTATTATGGGTCAGTTTCAATCCAAGAAACTGGAGGACTGGGAAGAAGAGATA
TTGAATAACAACGGGCAGCAGCAGGCTGCTCAAAATACAAACGGCGTCGTTGATTTGAAGAAAGAGCAGTCGGCTCATGCAAGCAGCTGCACCTACGTGTATGGCCATGG
CGGCGAGGATTATCAGCTACCTCCAAAACAAAGTTCCTGGTCCAGCCCCTTGATGCCGGCGTCGTCTCCTCAGTCATGCGTCACAAGTTTCAGCAGCAATATGTTGGATT
TTTCAAACAACTCGCCTCCTGATTCCCGCCCTAGACATCCCCCACCAGATCGCTCCTCTGAGTGTAACAGCAATGCAACTGGTGGGGCTGTGAAGAAGGCGAGGGTTCAA
TCCTCTTCAAACCAAACCACTTTCAAGGTGAGGAAGGAAAAATTAGGTGACAGAATTACAGCTCTCCACCAGCTAGTTTCCCCATTTGGGAAGACTGACACAGCTTCAGT
TTTGTTAGAGGCTATTGGGTACATCAGATTCCTTCAGAGTCAAATTGAGGCCCTCAGCTTACCCTACTTGGGCAATGCTTCGGGAAGTACGAGGCAACAACAACATTCCG
TTCAAGGAGAAAGAAATTGTATGTTTCCTGAAGACCCTGGTCAGCTCTTGAACGAGAACTGCTTGAAGAGGAAGGGCGCCTCTGAGCAGGATGAAGAGCCAAAGAAGGAC
CTCAGAAGTAGAGGGTTGTGTCTGGTTCCGGTGTCCTGCACCCTGCAAGTTGGGAGTGACAACGGAGCAGATTATTGGGCTCCGGCGTTCGGGGGCGGCGGTTTCCGGTA
G
Protein sequenceShow/hide protein sequence
MNRGVLQSSVVQQMMAAENPNWWNMNSATMRTSSQQPSPFLSPSPNILFPHSSLSPASLPFPSWYDAQDHLPESWSQLLLGGLVGEDDDQKECIMGQFQSKKLEDWEEEI
LNNNGQQQAAQNTNGVVDLKKEQSAHASSCTYVYGHGGEDYQLPPKQSSWSSPLMPASSPQSCVTSFSSNMLDFSNNSPPDSRPRHPPPDRSSECNSNATGGAVKKARVQ
SSSNQTTFKVRKEKLGDRITALHQLVSPFGKTDTASVLLEAIGYIRFLQSQIEALSLPYLGNASGSTRQQQHSVQGERNCMFPEDPGQLLNENCLKRKGASEQDEEPKKD
LRSRGLCLVPVSCTLQVGSDNGADYWAPAFGGGGFR