; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS003371 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS003371
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionTranscription factor bHLH61 isoform 1
Genome locationscaffold234:2402987..2404271
RNA-Seq ExpressionMS003371
SyntenyMS003371
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022134038.1 uncharacterized protein LOC111006410, partial [Momordica charantia]9.0e-72100Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDVQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFEELG
        MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDVQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFEELG
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDVQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFEELG

Query:  LNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES
        LNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES
Subjt:  LNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES

XP_022964721.1 uncharacterized protein LOC111464709 isoform X2 [Cucurbita moschata]6.4e-6288.82Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE
        MVSREHKKA LHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQD   VQNSIHPN  PMVTVE LVKGFSINVFSEKSCQGLLVSILE FE
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE

Query:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES
        ELGLNVLEARVSCTD+FQLQA  EI+EQGEEA+DAQAVKEAVV+AIKSWS++
Subjt:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES

XP_038884497.1 uncharacterized protein LOC120075304 isoform X2 [Benincasa hispida]2.6e-6389.24Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPL-----PM-VTVETLVKGFSINVFSEKSCQGLLVS
        MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD   VQNSIHPNPL     PM VTVE LVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPL-----PM-VTVETLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES
        ILEVFEELGLNV+EARVSCTD+FQLQAI EI+E+GEEAIDAQAVKEAVVQAIKSW +S
Subjt:  ILEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES

XP_038884498.1 uncharacterized protein LOC120075304 isoform X3 [Benincasa hispida]2.6e-6389.24Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHA-LNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPL-----PMVTVETLVKGFSINVFSEKSCQGLLVS
        MVSREHKKAVLHEKLQLLRSITNSHA LNKASIIVDASKYIEELKQKVERLNQD   VQNSIHPNPL     PMVTVE LVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHA-LNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPL-----PMVTVETLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES
        ILEVFEELGLNV+EARVSCTD+FQLQAI EI+E+GEEAIDAQAVKEAVVQAIKSW +S
Subjt:  ILEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES

XP_038884499.1 uncharacterized protein LOC120075304 isoform X4 [Benincasa hispida]1.1e-6489.81Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPL-----PMVTVETLVKGFSINVFSEKSCQGLLVSI
        MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD   VQNSIHPNPL     PMVTVE LVKGFSINVFSEKSCQGLLVSI
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPL-----PMVTVETLVKGFSINVFSEKSCQGLLVSI

Query:  LEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES
        LEVFEELGLNV+EARVSCTD+FQLQAI EI+E+GEEAIDAQAVKEAVVQAIKSW +S
Subjt:  LEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES

TrEMBL top hitse value%identityAlignment
A0A6J1C0X4 uncharacterized protein LOC1110064104.3e-72100Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDVQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFEELG
        MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDVQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFEELG
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDVQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFEELG

Query:  LNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES
        LNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES
Subjt:  LNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES

A0A6J1HIH6 uncharacterized protein LOC111464709 isoform X11.2e-6187.5Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE
        MVSREHKKA LHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQD   VQNSIHPN    VTVE LVKGFSINVFSEKSCQGLLVSILE FE
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE

Query:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES
        ELGLNVLEARVSCTD+FQLQA  EI+EQGEEA+DAQAVKEAVV+AIKSWS++
Subjt:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES

A0A6J1HJR0 uncharacterized protein LOC111464709 isoform X23.1e-6288.82Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE
        MVSREHKKA LHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQD   VQNSIHPN  PMVTVE LVKGFSINVFSEKSCQGLLVSILE FE
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE

Query:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES
        ELGLNVLEARVSCTD+FQLQA  EI+EQGEEA+DAQAVKEAVV+AIKSWS++
Subjt:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES

A0A6J1JTS8 uncharacterized protein LOC111487778 isoform X23.1e-6288.82Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE
        MVSREHKKA LHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQD   VQNSIHPN  PMVTVE LVKGFSINVFSEKSCQGLLVSILE FE
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE

Query:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES
        ELGLNVLEARVSCTD+FQLQA  EI+EQGEEA+DAQAVKEAVV+AIKSWS++
Subjt:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES

A0A6J1JV46 uncharacterized protein LOC111487778 isoform X11.2e-6187.5Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE
        MVSREHKKA LHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQD   VQNSIHPN    VTVE LVKGFSINVFSEKSCQGLLVSILE FE
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFE

Query:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES
        ELGLNVLEARVSCTD+FQLQA  EI+EQGEEA+DAQAVKEAVV+AIKSWS++
Subjt:  ELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES

SwissProt top hitse value%identityAlignment
Q9LSL1 Transcription factor bHLH936.4e-0426.71Show/hide
Query:  MVSREHKKAVLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDVQ-----NSIHPNPL-----PMVTVETLVKG---FSINVFSEKS--
        +++   ++  L+++L +LRSI      +++ SI+ DA  Y++EL  K+ +L  + Q     N+ H + L      +   E LV+    F I+   E +  
Subjt:  MVSREHKKAVLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDVQ-----NSIHPNPL-----PMVTVETLVKG---FSINVFSEKS--

Query:  ---CQ---GLLVSILEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAV
           C    GLL+S +   E LGL + +  +SC   F LQA      +  + I ++ +K+A+
Subjt:  ---CQ---GLLVSILEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAV

Q9LXA9 Transcription factor bHLH613.1e-0627.74Show/hide
Query:  MVSREHKKAVLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDVQN-SIHPNPLPMVTVETLVKGF--------SINVFSEKSC---QG
        +++   ++  L+++L LLRSI      +++ SI+ DA  Y++EL  K+ +L +D Q    + +   ++T E++V+           +N   +  C    G
Subjt:  MVSREHKKAVLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDVQN-SIHPNPLPMVTVETLVKGF--------SINVFSEKSC---QG

Query:  LLVSILEVFEELGLNVLEARVSCTDSFQLQA-IGEIDEQGEEAIDAQAVKEAVVQ
        L+VS +   E LGL + +  +SC   F LQA   E+ EQ    + ++A K+A+++
Subjt:  LLVSILEVFEELGLNVLEARVSCTDSFQLQA-IGEIDEQGEEAIDAQAVKEAVVQ

Arabidopsis top hitse value%identityAlignment
AT1G29270.1 unknown protein3.6e-1034.38Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITN-SHALNKASIIV-DASKYIEELKQKVERLNQDVQN-------SIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVS
        MV+ E KK     K   L+++T+   ++++ S+++ +A  YI  LK ++E L ++ ++       S+H      V VE + + F + + S +  +  LV+
Subjt:  MVSREHKKAVLHEKLQLLRSITN-SHALNKASIIV-DASKYIEELKQKVERLNQDVQN-------SIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVLEARVSCTDSFQLQAI
        ILE FEE+GLNV +AR SC DSF ++AI
Subjt:  ILEVFEELGLNVLEARVSCTDSFQLQAI

AT2G40435.1 BEST Arabidopsis thaliana protein match is: transcription regulators (TAIR:AT3G56220.1)1.9e-4365.36Show/hide
Query:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPN--PLPMVTVETLVKGFSINVFSEKSCQGLLVSILEV
        MVSRE K+  L EK QLLRSITNSHA N  SII+DASKYI++LKQKVER NQD    Q+S  P     PMVTVETL KGF INVFS K+  G+LVS+LE 
Subjt:  MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD---VQNSIHPN--PLPMVTVETLVKGFSINVFSEKSCQGLLVSILEV

Query:  FEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSE
        FE++GLNVLEAR SCTDSF L A+G  +E GE  +DA+AVK+AV  AI+SW E
Subjt:  FEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSE

AT3G56220.1 transcription regulators9.6e-4058.97Show/hide
Query:  MVSREHKK-AVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLN------QDVQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSIL
        MVSREHK+ + L EK  LLRSIT+SHA ++ SIIVDASKYI++LKQKVE++N      Q  + S  PN  PMVTVETL KGF I V S K+  G+LV +L
Subjt:  MVSREHKK-AVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLN------QDVQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSIL

Query:  EVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES
        E FE+LGL+V+EARVSCTD+F L AIG  +    + IDA+AVK+AV +AI++WS+S
Subjt:  EVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES

AT5G10570.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.2e-0727.74Show/hide
Query:  MVSREHKKAVLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDVQN-SIHPNPLPMVTVETLVKGF--------SINVFSEKSC---QG
        +++   ++  L+++L LLRSI      +++ SI+ DA  Y++EL  K+ +L +D Q    + +   ++T E++V+           +N   +  C    G
Subjt:  MVSREHKKAVLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDVQN-SIHPNPLPMVTVETLVKGF--------SINVFSEKSC---QG

Query:  LLVSILEVFEELGLNVLEARVSCTDSFQLQA-IGEIDEQGEEAIDAQAVKEAVVQ
        L+VS +   E LGL + +  +SC   F LQA   E+ EQ    + ++A K+A+++
Subjt:  LLVSILEVFEELGLNVLEARVSCTDSFQLQA-IGEIDEQGEEAIDAQAVKEAVVQ

AT5G65640.1 beta HLH protein 934.5e-0526.71Show/hide
Query:  MVSREHKKAVLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDVQ-----NSIHPNPL-----PMVTVETLVKG---FSINVFSEKS--
        +++   ++  L+++L +LRSI      +++ SI+ DA  Y++EL  K+ +L  + Q     N+ H + L      +   E LV+    F I+   E +  
Subjt:  MVSREHKKAVLHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDVQ-----NSIHPNPL-----PMVTVETLVKG---FSINVFSEKS--

Query:  ---CQ---GLLVSILEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAV
           C    GLL+S +   E LGL + +  +SC   F LQA      +  + I ++ +K+A+
Subjt:  ---CQ---GLLVSILEVFEELGLNVLEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCTAGAGAGCACAAGAAGGCAGTTCTGCATGAGAAGCTCCAATTACTTCGTTCAATTACCAACTCTCATGCTCTAAACAAGGCCTCGATAATAGTGGATGCATC
AAAATATATCGAGGAGCTAAAACAGAAAGTAGAAAGATTGAATCAAGACGTTCAAAATTCAATCCACCCAAATCCACTTCCCATGGTTACAGTGGAAACCCTAGTGAAGG
GATTTTCTATAAATGTATTTTCAGAAAAGAGCTGCCAAGGCCTCCTTGTCTCCATATTAGAAGTGTTTGAAGAGCTGGGGCTTAATGTTCTTGAAGCTAGGGTTTCCTGT
ACTGATAGTTTCCAATTACAAGCTATTGGAGAAATTGACGAACAAGGAGAGGAAGCCATTGACGCTCAAGCTGTGAAAGAAGCAGTCGTTCAAGCTATAAAGAGCTGGAG
CGAAAGC
mRNA sequenceShow/hide mRNA sequence
ATGGTTTCTAGAGAGCACAAGAAGGCAGTTCTGCATGAGAAGCTCCAATTACTTCGTTCAATTACCAACTCTCATGCTCTAAACAAGGCCTCGATAATAGTGGATGCATC
AAAATATATCGAGGAGCTAAAACAGAAAGTAGAAAGATTGAATCAAGACGTTCAAAATTCAATCCACCCAAATCCACTTCCCATGGTTACAGTGGAAACCCTAGTGAAGG
GATTTTCTATAAATGTATTTTCAGAAAAGAGCTGCCAAGGCCTCCTTGTCTCCATATTAGAAGTGTTTGAAGAGCTGGGGCTTAATGTTCTTGAAGCTAGGGTTTCCTGT
ACTGATAGTTTCCAATTACAAGCTATTGGAGAAATTGACGAACAAGGAGAGGAAGCCATTGACGCTCAAGCTGTGAAAGAAGCAGTCGTTCAAGCTATAAAGAGCTGGAG
CGAAAGC
Protein sequenceShow/hide protein sequence
MVSREHKKAVLHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDVQNSIHPNPLPMVTVETLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVLEARVSC
TDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKSWSES