; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS006071 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS006071
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionBHLH domain-containing protein
Genome locationscaffold254:3569659..3570374
RNA-Seq ExpressionMS006071
SyntenyMS006071
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582238.1 Transcription factor basic helix-loop-helix 95, partial [Cucurbita argyrosperma subsp. sororia]5.1e-7080.98Show/hide
Query:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM
        MEACTDSV+ LLP ILPV +SEAA+ NKASTSRKRRRA LEA GG+Q KGR KRKEM++SFDVLQSLVPNLSPKATRE IVSETIQFI+ L+KQLMRLEM
Subjt:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM

Query:  KKKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDI
        +KK P ESV  TM+ PSTNSDS  GGGVIVS S NIVLFGI+ ASVRRGMVTQILMAFER+QAEVLAANVAVSHGNL+LT+TASVHGY+EN IE+I+NDI
Subjt:  KKKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDI

Query:  LSLNK
        LSL K
Subjt:  LSLNK

XP_022138122.1 uncharacterized protein LOC111009370 [Momordica charantia]3.1e-9999.51Show/hide
Query:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMK
        MEACTDSVIPLL QILPVRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMK
Subjt:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMK

Query:  KKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS
        KKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS
Subjt:  KKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS

Query:  LNK
        LNK
Subjt:  LNK

XP_022955986.1 uncharacterized protein LOC111457820 [Cucurbita moschata]5.1e-7080.98Show/hide
Query:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM
        MEACTDSV+ LLP ILPV +SEAA+ NKASTSRKRRRA LEA GG+Q KGR KRKEM++SFDVLQSLVPNLSPKATRE IVSETIQFI+ L+KQLMRLEM
Subjt:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM

Query:  KKKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDI
        +KK P ESV  TM+ PSTNSDS  GGGVIVS S NIVLFGI+ ASVRRGMVTQILMAFER+QAEVLAANVAVSHGNL+LT+TASVHGY+EN IE+I+NDI
Subjt:  KKKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDI

Query:  LSLNK
        LSL K
Subjt:  LSLNK

XP_022979931.1 uncharacterized protein LOC111479473 [Cucurbita maxima]2.2e-6878.95Show/hide
Query:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM
        MEACTDSV+ LLP ILPV +SEAA+ NKASTSRKR RA LEA GG Q KGR KRKEM++SFDVLQSLVPNLSPKATRE IVSETIQFI+ L+KQLMRLEM
Subjt:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM

Query:  KKKLPSESVIATMIPPSTNSDSSGG----GGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKI
        +KK P ESV  TM+ PSTNSDS GG    GGVIVS S NIVLFGI+ ASVRRGMVT+ILMAFER+QAEVLAANVAVSHGNL+LT+TASVHGY+EN IE+I
Subjt:  KKKLPSESVIATMIPPSTNSDSSGG----GGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKI

Query:  KNDILSLNK
        +NDILSL K
Subjt:  KNDILSLNK

XP_023528352.1 uncharacterized protein LOC111791298 [Cucurbita pepo subsp. pepo]1.1e-6979.71Show/hide
Query:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM
        MEACTDSV+ LLP ILPV +SEAA+ NKASTSRKRRRA LEA GG+Q KGR KRKEM++SFDVLQSLVPNLSPKATRE IVSETIQFI+ L+KQL RLEM
Subjt:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM

Query:  KKKLPSESVIATMIPPSTNSDSSGG--GGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKN
        +KK P ESV  TM+ PSTNSDS GG  GGVIVS S NIVLFGI+ ASVRRGMVTQILMAFER+QAEVLAANVAVSHGNL+LT+TAS+HGY+EN IE+I+N
Subjt:  KKKLPSESVIATMIPPSTNSDSSGG--GGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKN

Query:  DILSLNK
        DILSL K
Subjt:  DILSLNK

TrEMBL top hitse value%identityAlignment
A0A0A0L5R7 BHLH domain-containing protein1.0e-6071.15Show/hide
Query:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM
        MEAC+DSVIPL P ILPV   EA +  +AS SRKR RA LEA GG+Q K + KRKEM++SFDVL+SLVPNLSPKATRE IVS  IQFI+FL+KQLMRLEM
Subjt:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM

Query:  KKKLPSESVIATMIPPSTNSDSSG---GGGVIVSASGNIVLFG-ILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIK
        +KK  SESV    + P+TNSDSSG   G GVIVS SGNIVLFG I+ASV+RG+VTQIL+ FER++AEVLAANV VSHGNL+LT+TASVHGY+EN IE+I+
Subjt:  KKKLPSESVIATMIPPSTNSDSSG---GGGVIVSASGNIVLFG-ILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIK

Query:  NDILSLNK
        NDIL L K
Subjt:  NDILSLNK

A0A5A7U767 Transcription factor bHLH95-like1.0e-6070.53Show/hide
Query:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM
        MEAC+DSV+PL P ILP+   EA +  +AS SRKR RA LEA GG+Q K + KRKEM++SFDVL+SLVPNLSPKATRE IVS  IQFI+FL+KQLMRLEM
Subjt:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM

Query:  KKKLPSESVIATMIPPSTNSDSSGGG--GVIVSASGNIVLFG-ILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKN
        +KK  SESV    + P++NSDSSGG   GVIVS SGNIVLFG I+ASV+RGMVTQIL+ FER++ EVLAANV VSHGNL+LT+TASVHGY+EN IE+I+N
Subjt:  KKKLPSESVIATMIPPSTNSDSSGGG--GVIVSASGNIVLFG-ILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKN

Query:  DILSLNK
        DILSL K
Subjt:  DILSLNK

A0A6J1CA71 uncharacterized protein LOC1110093701.5e-9999.51Show/hide
Query:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMK
        MEACTDSVIPLL QILPVRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMK
Subjt:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMK

Query:  KKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS
        KKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS
Subjt:  KKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS

Query:  LNK
        LNK
Subjt:  LNK

A0A6J1GWN9 uncharacterized protein LOC1114578202.5e-7080.98Show/hide
Query:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM
        MEACTDSV+ LLP ILPV +SEAA+ NKASTSRKRRRA LEA GG+Q KGR KRKEM++SFDVLQSLVPNLSPKATRE IVSETIQFI+ L+KQLMRLEM
Subjt:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM

Query:  KKKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDI
        +KK P ESV  TM+ PSTNSDS  GGGVIVS S NIVLFGI+ ASVRRGMVTQILMAFER+QAEVLAANVAVSHGNL+LT+TASVHGY+EN IE+I+NDI
Subjt:  KKKLPSESVIATMIPPSTNSDSSGGGGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDI

Query:  LSLNK
        LSL K
Subjt:  LSLNK

A0A6J1IS55 uncharacterized protein LOC1114794731.0e-6878.95Show/hide
Query:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM
        MEACTDSV+ LLP ILPV +SEAA+ NKASTSRKR RA LEA GG Q KGR KRKEM++SFDVLQSLVPNLSPKATRE IVSETIQFI+ L+KQLMRLEM
Subjt:  MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQ-KGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEM

Query:  KKKLPSESVIATMIPPSTNSDSSGG----GGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKI
        +KK P ESV  TM+ PSTNSDS GG    GGVIVS S NIVLFGI+ ASVRRGMVT+ILMAFER+QAEVLAANVAVSHGNL+LT+TASVHGY+EN IE+I
Subjt:  KKKLPSESVIATMIPPSTNSDSSGG----GGVIVSASGNIVLFGIL-ASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKI

Query:  KNDILSLNK
        +NDILSL K
Subjt:  KNDILSLNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G01260.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.1e-0422.95Show/hide
Query:  VRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMKKKLPSESVIATMIPPST
        V   E+ ++      R+      EA   ++  R +R+++NQ F  L+S+VPN+S K  + +++ + + +I+ L  +L  +E +++          +  S+
Subjt:  VRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMKKKLPSESVIATMIPPST

Query:  NSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS
        N   S    + V  SG  V   I   +     ++I  AFE ++ EV+ +N+ VS   +       +H ++  + E  K  ++S
Subjt:  NSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS

AT1G01260.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.1e-0422.95Show/hide
Query:  VRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMKKKLPSESVIATMIPPST
        V   E+ ++      R+      EA   ++  R +R+++NQ F  L+S+VPN+S K  + +++ + + +I+ L  +L  +E +++          +  S+
Subjt:  VRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMKKKLPSESVIATMIPPST

Query:  NSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS
        N   S    + V  SG  V   I   +     ++I  AFE ++ EV+ +N+ VS   +       +H ++  + E  K  ++S
Subjt:  NSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS

AT1G01260.3 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.1e-0422.95Show/hide
Query:  VRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMKKKLPSESVIATMIPPST
        V   E+ ++      R+      EA   ++  R +R+++NQ F  L+S+VPN+S K  + +++ + + +I+ L  +L  +E +++          +  S+
Subjt:  VRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMKKKLPSESVIATMIPPST

Query:  NSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS
        N   S    + V  SG  V   I   +     ++I  AFE ++ EV+ +N+ VS   +       +H ++  + E  K  ++S
Subjt:  NSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCCTGCACCGACTCTGTCATTCCACTTTTGCCGCAGATTTTGCCGGTCCGTGAATCTGAAGCCGCCGATCACAACAAGGCTTCCACCTCGAGAAAGCGTCGCAG
AGCCGATCTGGAGGCCGGCGGAGGTCTACAGAAAGGGAGAGCGAAGAGGAAGGAGATGAACCAGAGTTTCGATGTTCTTCAATCTCTCGTCCCCAATCTCTCTCCCAAGG
CCACGAGGGAGAATATTGTTTCCGAAACGATCCAGTTCATCGATTTTCTGGAGAAGCAGTTGATGAGGCTGGAAATGAAGAAGAAATTGCCATCGGAATCGGTGATCGCG
ACGATGATTCCGCCGAGTACGAACTCGGATTCATCCGGCGGAGGCGGCGTTATCGTCTCGGCCTCCGGCAACATCGTGTTGTTTGGGATTCTTGCTTCTGTTCGACGAGG
TATGGTGACACAGATTTTAATGGCGTTTGAAAGAAACCAGGCTGAAGTTCTTGCAGCAAATGTTGCAGTCAGCCATGGAAATTTAAGTTTGACAATCACGGCTTCTGTAC
ACGGTTACATTGAGAATGCCATAGAGAAGATTAAAAACGATATCCTGAGCTTAAACAAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGCCTGCACCGACTCTGTCATTCCACTTTTGCCGCAGATTTTGCCGGTCCGTGAATCTGAAGCCGCCGATCACAACAAGGCTTCCACCTCGAGAAAGCGTCGCAG
AGCCGATCTGGAGGCCGGCGGAGGTCTACAGAAAGGGAGAGCGAAGAGGAAGGAGATGAACCAGAGTTTCGATGTTCTTCAATCTCTCGTCCCCAATCTCTCTCCCAAGG
CCACGAGGGAGAATATTGTTTCCGAAACGATCCAGTTCATCGATTTTCTGGAGAAGCAGTTGATGAGGCTGGAAATGAAGAAGAAATTGCCATCGGAATCGGTGATCGCG
ACGATGATTCCGCCGAGTACGAACTCGGATTCATCCGGCGGAGGCGGCGTTATCGTCTCGGCCTCCGGCAACATCGTGTTGTTTGGGATTCTTGCTTCTGTTCGACGAGG
TATGGTGACACAGATTTTAATGGCGTTTGAAAGAAACCAGGCTGAAGTTCTTGCAGCAAATGTTGCAGTCAGCCATGGAAATTTAAGTTTGACAATCACGGCTTCTGTAC
ACGGTTACATTGAGAATGCCATAGAGAAGATTAAAAACGATATCCTGAGCTTAAACAAG
Protein sequenceShow/hide protein sequence
MEACTDSVIPLLPQILPVRESEAADHNKASTSRKRRRADLEAGGGLQKGRAKRKEMNQSFDVLQSLVPNLSPKATRENIVSETIQFIDFLEKQLMRLEMKKKLPSESVIA
TMIPPSTNSDSSGGGGVIVSASGNIVLFGILASVRRGMVTQILMAFERNQAEVLAANVAVSHGNLSLTITASVHGYIENAIEKIKNDILSLNK