; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1836 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1836
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionTranscription factor bHLH61 isoform 1
Genome locationMC04:25581653..25583680
RNA-Seq ExpressionMC04g1836
SyntenyMC04g1836
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022134038.1 uncharacterized protein LOC111006410, partial [Momordica charantia]6.64e-97100Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDVQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFEELG
        MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDVQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFEELG
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDVQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFEELG

Query:  LNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSE
        LNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSE
Subjt:  LNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSE

XP_022964721.1 uncharacterized protein LOC111464709 isoform X2 [Cucurbita moschata]8.42e-8588.46Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE
        MVSREHKKA LHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQD   VQNSIHPN  PMVTVE LVKGFSINVFSEKSCQGLLVSILE FE
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE

Query:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSEQD
        ELGLNVLEARVSCTD+FQLQA  EI+EQGEEA+DAQAVKEAVV+AIKSWS++ EQD
Subjt:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSEQD

XP_038884497.1 uncharacterized protein LOC120075304 isoform X2 [Benincasa hispida]1.58e-8688.89Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPL-----PM-VTVETLVKGFSINVFSEKSCQGLLVS
        MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD   VQNSIHPNPL     PM VTVE LVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPL-----PM-VTVETLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSEQD
        ILEVFEELGLNV+EARVSCTD+FQLQAI EI+E+GEEAIDAQAVKEAVVQAIKSW +S EQD
Subjt:  ILEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSEQD

XP_038884498.1 uncharacterized protein LOC120075304 isoform X3 [Benincasa hispida]1.58e-8688.89Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHA-LNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPL-----PMVTVETLVKGFSINVFSEKSCQGLLVS
        MVSREHKKAVLHEKLQLLRSITNSHA LNKASIIVDASKYIEELKQKVERLNQD   VQNSIHPNPL     PMVTVE LVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHA-LNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPL-----PMVTVETLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSEQD
        ILEVFEELGLNV+EARVSCTD+FQLQAI EI+E+GEEAIDAQAVKEAVVQAIKSW +S EQD
Subjt:  ILEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSEQD

XP_038884499.1 uncharacterized protein LOC120075304 isoform X4 [Benincasa hispida]2.27e-8889.44Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPL-----PMVTVETLVKGFSINVFSEKSCQGLLVSI
        MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD   VQNSIHPNPL     PMVTVE LVKGFSINVFSEKSCQGLLVSI
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPL-----PMVTVETLVKGFSINVFSEKSCQGLLVSI

Query:  LEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSEQD
        LEVFEELGLNV+EARVSCTD+FQLQAI EI+E+GEEAIDAQAVKEAVVQAIKSW +S EQD
Subjt:  LEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSEQD

TrEMBL top hitse value%identityAlignment
A0A6J1C0X4 uncharacterized protein LOC1110064103.21e-97100Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDVQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFEELG
        MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDVQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFEELG
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDVQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFEELG

Query:  LNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSE
        LNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSE
Subjt:  LNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSE

A0A6J1HIH6 uncharacterized protein LOC111464709 isoform X12.44e-8487.18Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE
        MVSREHKKA LHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQD   VQNSIHPN    VTVE LVKGFSINVFSEKSCQGLLVSILE FE
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE

Query:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSEQD
        ELGLNVLEARVSCTD+FQLQA  EI+EQGEEA+DAQAVKEAVV+AIKSWS++ EQD
Subjt:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSEQD

A0A6J1HJR0 uncharacterized protein LOC111464709 isoform X24.08e-8588.46Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE
        MVSREHKKA LHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQD   VQNSIHPN  PMVTVE LVKGFSINVFSEKSCQGLLVSILE FE
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE

Query:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSEQD
        ELGLNVLEARVSCTD+FQLQA  EI+EQGEEA+DAQAVKEAVV+AIKSWS++ EQD
Subjt:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSEQD

A0A6J1JTS8 uncharacterized protein LOC111487778 isoform X24.08e-8588.46Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE
        MVSREHKKA LHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQD   VQNSIHPN  PMVTVE LVKGFSINVFSEKSCQGLLVSILE FE
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE

Query:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSEQD
        ELGLNVLEARVSCTD+FQLQA  EI+EQGEEA+DAQAVKEAVV+AIKSWS++ EQD
Subjt:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSEQD

A0A6J1JV46 uncharacterized protein LOC111487778 isoform X12.44e-8487.18Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE
        MVSREHKKA LHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQD   VQNSIHPN    VTVE LVKGFSINVFSEKSCQGLLVSILE FE
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE

Query:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSEQD
        ELGLNVLEARVSCTD+FQLQA  EI+EQGEEA+DAQAVKEAVV+AIKSWS++ EQD
Subjt:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSEQD

SwissProt top hitse value%identityAlignment
Q9LSL1 Transcription factor bHLH938.5e-0426.71Show/hide
Query:  MVSREHKKAVLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDVQ-----NSIHPNPL-----PMVTVETLVKG---FSINVFSEKS--
        +++   ++  L+++L +LRSI      +++ SI+ DA  Y++EL  K+ +L  + Q     N+ H + L      +   E LV+    F I+   E +  
Subjt:  MVSREHKKAVLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDVQ-----NSIHPNPL-----PMVTVETLVKG---FSINVFSEKS--

Query:  ---CQ---GLLVSILEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAV
           C    GLL+S +   E LGL + +  +SC   F LQA      +  + I ++ +K+A+
Subjt:  ---CQ---GLLVSILEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAV

Q9LXA9 Transcription factor bHLH613.1e-0627.74Show/hide
Query:  MVSREHKKAVLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDVQN-SIHPNPLPMVTVETLVKGF--------SINVFSEKSC---QG
        +++   ++  L+++L LLRSI      +++ SI+ DA  Y++EL  K+ +L +D Q    + +   ++T E++V+           +N   +  C    G
Subjt:  MVSREHKKAVLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDVQN-SIHPNPLPMVTVETLVKGF--------SINVFSEKSC---QG

Query:  LLVSILEVFEELGLNVLEARVSCTDSFQLQA-IGEIDEQGEEAIDAQAVKEAVVQ
        L+VS +   E LGL + +  +SC   F LQA   E+ EQ    + ++A K+A+++
Subjt:  LLVSILEVFEELGLNVLEARVSCTDSFQLQA-IGEIDEQGEEAIDAQAVKEAVVQ

Arabidopsis top hitse value%identityAlignment
AT1G29270.1 unknown protein3.7e-1034.38Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITN-SHALNKASIIV-DASKYIEELKQKVERLNQDVQN-------SIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVS
        MV+ E KK     K   L+++T+   ++++ S+++ +A  YI  LK ++E L ++ ++       S+H      V VE + + F + + S +  +  LV+
Subjt:  MVSREHKKAVLHEKLQLLRSITN-SHALNKASIIV-DASKYIEELKQKVERLNQDVQN-------SIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVLEARVSCTDSFQLQAI
        ILE FEE+GLNV +AR SC DSF ++AI
Subjt:  ILEVFEELGLNVLEARVSCTDSFQLQAI

AT2G40435.1 BEST Arabidopsis thaliana protein match is: transcription regulators (TAIR:AT3G56220.1)2.5e-4364.1Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPN--PLPMVTVETLVKGFSINVFSEKSCQGLLVSILEV
        MVSRE K+  L EK QLLRSITNSHA N  SII+DASKYI++LKQKVER NQD    Q+S  P     PMVTVETL KGF INVFS K+  G+LVS+LE 
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPN--PLPMVTVETLVKGFSINVFSEKSCQGLLVSILEV

Query:  FEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSE
        FE++GLNVLEAR SCTDSF L A+G  +E GE  +DA+AVK+AV  AI+SW E ++
Subjt:  FEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSE

AT3G56220.1 transcription regulators1.3e-3958.97Show/hide
Query:  MVSREHKK-AVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLN------QDVQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSIL
        MVSREHK+ + L EK  LLRSIT+SHA ++ SIIVDASKYI++LKQKVE++N      Q  + S  PN  PMVTVETL KGF I V S K+  G+LV +L
Subjt:  MVSREHKK-AVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLN------QDVQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSIL

Query:  EVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES
        E FE+LGL+V+EARVSCTD+F L AIG  +    + IDA+AVK+AV +AI++WS+S
Subjt:  EVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES

AT5G10570.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.2e-0727.74Show/hide
Query:  MVSREHKKAVLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDVQN-SIHPNPLPMVTVETLVKGF--------SINVFSEKSC---QG
        +++   ++  L+++L LLRSI      +++ SI+ DA  Y++EL  K+ +L +D Q    + +   ++T E++V+           +N   +  C    G
Subjt:  MVSREHKKAVLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDVQN-SIHPNPLPMVTVETLVKGF--------SINVFSEKSC---QG

Query:  LLVSILEVFEELGLNVLEARVSCTDSFQLQA-IGEIDEQGEEAIDAQAVKEAVVQ
        L+VS +   E LGL + +  +SC   F LQA   E+ EQ    + ++A K+A+++
Subjt:  LLVSILEVFEELGLNVLEARVSCTDSFQLQA-IGEIDEQGEEAIDAQAVKEAVVQ

AT5G65640.1 beta HLH protein 936.1e-0526.71Show/hide
Query:  MVSREHKKAVLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDVQ-----NSIHPNPL-----PMVTVETLVKG---FSINVFSEKS--
        +++   ++  L+++L +LRSI      +++ SI+ DA  Y++EL  K+ +L  + Q     N+ H + L      +   E LV+    F I+   E +  
Subjt:  MVSREHKKAVLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDVQ-----NSIHPNPL-----PMVTVETLVKG---FSINVFSEKS--

Query:  ---CQ---GLLVSILEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAV
           C    GLL+S +   E LGL + +  +SC   F LQA      +  + I ++ +K+A+
Subjt:  ---CQ---GLLVSILEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCTAGAGAGCACAAGAAGGCAGTTCTGCATGAGAAGCTCCAATTACTTCGTTCAATTACCAACTCTCATGCTCTAAACAAGGCCTCGATAATAGTGGATGCATC
AAAATATATCGAGGAGCTAAAACAGAAAGTAGAAAGATTGAATCAAGACGTTCAAAATTCAATCCACCCAAATCCACTTCCCATGGTTACAGTGGAAACCCTAGTGAAGG
GATTTTCTATAAATGTATTTTCAGAAAAGAGCTGCCAAGGCCTCCTTGTCTCCATATTAGAAGTGTTTGAAGAGCTGGGGCTTAATGTTCTTGAAGCTAGGGTTTCCTGT
ACTGATAGTTTCCAATTACAAGCTATTGGAGAAATTGACGAACAAGGAGAGGAAGCCATTGACGCTCAAGCTGTGAAAGAAGCAGTCGTTCAAGCTATAAAGAGCTGGAG
CGAAAGCAGTGAACAAGATTAG
mRNA sequenceShow/hide mRNA sequence
AAATTTCTTGTCCACAATTGTCACAAGTAGTACTACTACTTTAAGTTACTATATACATTACTACATACGCATTAGGGTTAATTTGTTGGTTTGTGTGTGGAAGATTGAGA
TGTGGTTTTATTTGAATTGTACAAACTGTAAAGTGAGCATACACTTGTTATTGTTAATGGGGAATGATTCATTGAGGAAAAAAACAACTGAAAAAAAAAAGAAAAGAATG
TTAAGTAGGAAAAGAAAAATGATAGAAGGGCAATTTAGTAAGTAGAGTAGTTGGCATGGTATTTTATATATATTTGGTTGTAAAGGAGAGCAGTCATGTGGGAAGCTAGC
AAACAGAAGATCATATAGATTTGAGGGCTCATCATGGCCTTATAAAAACACCCTCAAAGTTTTTTCTTTTTTTGCAGAGATACTAAAGTGGAACAAAGAGAGAAAAAAGA
TAGAAGAAGAAGAAGAAGGCTCATAAAAGAGGAATAAGGAGGCTGTGAAAAAAAAAAAAAAAGGATTGAAAAAAGAAGGTTGAATCATCCATATCCATGGTTTCTAGAGA
GCACAAGAAGGCAGTTCTGCATGAGAAGCTCCAATTACTTCGTTCAATTACCAACTCTCATGCTCTAAACAAGGCCTCGATAATAGTGGATGCATCAAAATATATCGAGG
AGCTAAAACAGAAAGTAGAAAGATTGAATCAAGACGTTCAAAATTCAATCCACCCAAATCCACTTCCCATGGTTACAGTGGAAACCCTAGTGAAGGGATTTTCTATAAAT
GTATTTTCAGAAAAGAGCTGCCAAGGCCTCCTTGTCTCCATATTAGAAGTGTTTGAAGAGCTGGGGCTTAATGTTCTTGAAGCTAGGGTTTCCTGTACTGATAGTTTCCA
ATTACAAGCTATTGGAGAAATTGACGAACAAGGAGAGGAAGCCATTGACGCTCAAGCTGTGAAAGAAGCAGTCGTTCAAGCTATAAAGAGCTGGAGCGAAAGCAGTGAAC
AAGATTAGAAGAAGGAAAGAAAAAAAAAATCCTTAATTTCTCCAACTTGTTCCATTCTCGGCATTTCTTTCTTCTTCTTCTTTTTTTTTGGGGGGGGGGGNGGGGGGGTC
TTACCCCTGATCTTTGCTTGTAATGTTTAGTAAAAATTAATCAATGGAAAACCAATTGATTAATCTTCTCCAAAAATAAAGGAATCGAAAATTGAATATATAAACA
Protein sequenceShow/hide protein sequence
MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDVQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVLEARVSC
TDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSESSEQD