; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0025063 (gene) of Chayote v1 genome

Gene IDSed0025063
OrganismSechium edule (Chayote v1)
DescriptionTranscription factor bHLH61-like protein
Genome locationLG11:35079889..35081607
RNA-Seq ExpressionSed0025063
SyntenySed0025063
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022134038.1 uncharacterized protein LOC111006410, partial [Momordica charantia]1.5e-6189.61Show/hide
Query:  MVSREHKKETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQ-VQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVSILEV
        MVSREHKK  LHEKLQLLRSITNSHALNKASIIVDAS+YIEELK KVERLNQ VQNS HPNPL    PM VTVE+L KGFSINVFSEKSCQGLLVSILEV
Subjt:  MVSREHKKETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQ-VQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVSILEV

Query:  FEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQS
        FEELGLNV+EARVSCTDSFQLQA GEIDEQGEEAIDAQAVKEAVVQAIKSWS+S
Subjt:  FEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQS

XP_038884496.1 uncharacterized protein LOC120075304 isoform X1 [Benincasa hispida]3.6e-6386.5Show/hide
Query:  MVSREHKKETLHEKLQLLRSITNSHA-LNKASIIVDASRYIEELKHKVERLNQ----VQNSSHPNPLSHHH-PMQVTVESLAKGFSINVFSEKSCQGLLV
        MVSREHKK  LHEKLQLLRSITNSHA LNKASIIVDAS+YIEELK KVERLNQ    VQNS HPNPLSH + PMQVTVE L KGFSINVFSEKSCQGLLV
Subjt:  MVSREHKKETLHEKLQLLRSITNSHA-LNKASIIVDASRYIEELKHKVERLNQ----VQNSSHPNPLSHHH-PMQVTVESLAKGFSINVFSEKSCQGLLV

Query:  SILEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQSGGQD
        SILEVFEELGLNVIEARVSCTD+FQLQA  EI+E+GEEAIDAQAVKEAVVQAIKSW QSG QD
Subjt:  SILEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQSGGQD

XP_038884497.1 uncharacterized protein LOC120075304 isoform X2 [Benincasa hispida]1.5e-6487.04Show/hide
Query:  MVSREHKKETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQ----VQNSSHPNPLSHHH-PMQVTVESLAKGFSINVFSEKSCQGLLVS
        MVSREHKK  LHEKLQLLRSITNSHALNKASIIVDAS+YIEELK KVERLNQ    VQNS HPNPLSH + PMQVTVE L KGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQ----VQNSSHPNPLSHHH-PMQVTVESLAKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQSGGQD
        ILEVFEELGLNVIEARVSCTD+FQLQA  EI+E+GEEAIDAQAVKEAVVQAIKSW QSG QD
Subjt:  ILEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQSGGQD

XP_038884498.1 uncharacterized protein LOC120075304 isoform X3 [Benincasa hispida]3.0e-6285.19Show/hide
Query:  MVSREHKKETLHEKLQLLRSITNSHA-LNKASIIVDASRYIEELKHKVERLNQ----VQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVS
        MVSREHKK  LHEKLQLLRSITNSHA LNKASIIVDAS+YIEELK KVERLNQ    VQNS HPNPLSH +   VTVE L KGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKETLHEKLQLLRSITNSHA-LNKASIIVDASRYIEELKHKVERLNQ----VQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQSGGQD
        ILEVFEELGLNVIEARVSCTD+FQLQA  EI+E+GEEAIDAQAVKEAVVQAIKSW QSG QD
Subjt:  ILEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQSGGQD

XP_038884499.1 uncharacterized protein LOC120075304 isoform X4 [Benincasa hispida]1.2e-6385.71Show/hide
Query:  MVSREHKKETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQ----VQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVSI
        MVSREHKK  LHEKLQLLRSITNSHALNKASIIVDAS+YIEELK KVERLNQ    VQNS HPNPLSH +   VTVE L KGFSINVFSEKSCQGLLVSI
Subjt:  MVSREHKKETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQ----VQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVSI

Query:  LEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQSGGQD
        LEVFEELGLNVIEARVSCTD+FQLQA  EI+E+GEEAIDAQAVKEAVVQAIKSW QSG QD
Subjt:  LEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQSGGQD

TrEMBL top hitse value%identityAlignment
A0A6J1C0X4 uncharacterized protein LOC1110064107.3e-6289.61Show/hide
Query:  MVSREHKKETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQ-VQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVSILEV
        MVSREHKK  LHEKLQLLRSITNSHALNKASIIVDAS+YIEELK KVERLNQ VQNS HPNPL    PM VTVE+L KGFSINVFSEKSCQGLLVSILEV
Subjt:  MVSREHKKETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQ-VQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVSILEV

Query:  FEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQS
        FEELGLNV+EARVSCTDSFQLQA GEIDEQGEEAIDAQAVKEAVVQAIKSWS+S
Subjt:  FEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQS

A0A6J1HIH6 uncharacterized protein LOC111464709 isoform X12.8e-6183.23Show/hide
Query:  MVSREHKKETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQ----VQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVSI
        MVSREHKK  LHEKLQLLRSITNSHALNK SIIVDAS+YIEELK KVERLNQ    VQNS HPN     HPMQVTVE+L KGFSINVFSEKSCQGLLVSI
Subjt:  MVSREHKKETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQ----VQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVSI

Query:  LEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQSGGQD
        LE FEELGLNV+EARVSCTD+FQLQA  EI+EQGEEA+DAQAVKEAVV+AIKSWSQ+G QD
Subjt:  LEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQSGGQD

A0A6J1HJR0 uncharacterized protein LOC111464709 isoform X22.6e-5982.61Show/hide
Query:  MVSREHKKETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQ----VQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVSI
        MVSREHKK  LHEKLQLLRSITNSHALNK SIIVDAS+YIEELK KVERLNQ    VQNS HPN     HPM VTVE+L KGFSINVFSEKSCQGLLVSI
Subjt:  MVSREHKKETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQ----VQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVSI

Query:  LEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQSGGQD
        LE FEELGLNV+EARVSCTD+FQLQA  EI+EQGEEA+DAQAVKEAVV+AIKSWSQ+G QD
Subjt:  LEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQSGGQD

A0A6J1JTS8 uncharacterized protein LOC111487778 isoform X22.6e-5982.61Show/hide
Query:  MVSREHKKETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQ----VQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVSI
        MVSREHKK  LHEKLQLLRSITNSHALNK SIIVDAS+YIEELK KVERLNQ    VQNS HPN     HPM VTVE+L KGFSINVFSEKSCQGLLVSI
Subjt:  MVSREHKKETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQ----VQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVSI

Query:  LEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQSGGQD
        LE FEELGLNV+EARVSCTD+FQLQA  EI+EQGEEA+DAQAVKEAVV+AIKSWSQ+G QD
Subjt:  LEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQSGGQD

A0A6J1JV46 uncharacterized protein LOC111487778 isoform X12.8e-6183.23Show/hide
Query:  MVSREHKKETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQ----VQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVSI
        MVSREHKK  LHEKLQLLRSITNSHALNK SIIVDAS+YIEELK KVERLNQ    VQNS HPN     HPMQVTVE+L KGFSINVFSEKSCQGLLVSI
Subjt:  MVSREHKKETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQ----VQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVSI

Query:  LEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQSGGQD
        LE FEELGLNV+EARVSCTD+FQLQA  EI+EQGEEA+DAQAVKEAVV+AIKSWSQ+G QD
Subjt:  LEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQSGGQD

SwissProt top hitse value%identityAlignment
Q9LSL1 Transcription factor bHLH932.3e-0424.84Show/hide
Query:  MVSREHKKETLHEKLQLLRSIT-NSHALNKASIIVDASRYIEELKHKVERLNQVQNSSHPNPLSHHHPMQVTVESL---------AKGFSINVFSEKS--
        +++   +++ L+++L +LRSI      +++ SI+ DA  Y++EL  K+ +L   +     +  SHH  +   ++ L         +  F I+   E +  
Subjt:  MVSREHKKETLHEKLQLLRSIT-NSHALNKASIIVDASRYIEELKHKVERLNQVQNSSHPNPLSHHHPMQVTVESL---------AKGFSINVFSEKS--

Query:  ---CQ---GLLVSILEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAV
           C    GLL+S +   E LGL + +  +SC   F LQA+     +  + I ++ +K+A+
Subjt:  ---CQ---GLLVSILEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAV

Q9LXA9 Transcription factor bHLH611.6e-0526.58Show/hide
Query:  MVSREHKKETLHEKLQLLRSIT-NSHALNKASIIVDASRYIEELKHKVERLNQVQNSSHPNPLSHHHPMQVTVESLAKGF--------SINVFSEKSC--
        +++   +++ L+++L LLRSI      +++ SI+ DA  Y++EL   ++++N++Q        + H    +T ES+ +           +N   +  C  
Subjt:  MVSREHKKETLHEKLQLLRSIT-NSHALNKASIIVDASRYIEELKHKVERLNQVQNSSHPNPLSHHHPMQVTVESLAKGF--------SINVFSEKSC--

Query:  -QGLLVSILEVFEELGLNVIEARVSCTDSFQLQAT-GEIDEQGEEAIDAQAVKEAVVQ
          GL+VS +   E LGL + +  +SC   F LQA+  E+ EQ    + ++A K+A+++
Subjt:  -QGLLVSILEVFEELGLNVIEARVSCTDSFQLQAT-GEIDEQGEEAIDAQAVKEAVVQ

Arabidopsis top hitse value%identityAlignment
AT1G29270.1 unknown protein4.9e-1034.4Show/hide
Query:  MVSREHKKETLHEKLQLLRSITN-SHALNKASIIV-DASRYIEELKHKVERL-NQVQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVSIL
        MV+ E KK     K   L+++T+   ++++ S+++ +A  YI  LK ++E L  + ++       S H   +V VE + + F + + S +  +  LV+IL
Subjt:  MVSREHKKETLHEKLQLLRSITN-SHALNKASIIV-DASRYIEELKHKVERL-NQVQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVSIL

Query:  EVFEELGLNVIEARVSCTDSFQLQA
        E FEE+GLNV +AR SC DSF ++A
Subjt:  EVFEELGLNVIEARVSCTDSFQLQA

AT2G40435.1 BEST Arabidopsis thaliana protein match is: transcription regulators (TAIR:AT3G56220.1)1.7e-3960.39Show/hide
Query:  MVSREHKKETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQ--VQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVSILE
        MVSRE K+ +L EK QLLRSITNSHA N  SII+DAS+YI++LK KVER NQ      S   P     PM VTVE+L KGF INVFS K+  G+LVS+LE
Subjt:  MVSREHKKETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQ--VQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVSILE

Query:  VFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQ
         FE++GLNV+EAR SCTDSF L A G  +E GE  +DA+AVK+AV  AI+SW +
Subjt:  VFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQ

AT3G56220.1 transcription regulators4.7e-3754.04Show/hide
Query:  MVSREHKK-ETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQV-------QNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGL
        MVSREHK+  +L EK  LLRSIT+SHA ++ SIIVDAS+YI++LK KVE++N         + SS PNP+       VTVE+L KGF I V S K+  G+
Subjt:  MVSREHKK-ETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQV-------QNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGL

Query:  LVSILEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQS
        LV +LE FE+LGL+V+EARVSCTD+F L A G  +    + IDA+AVK+AV +AI++WS S
Subjt:  LVSILEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQS

AT5G10570.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.1e-0626.58Show/hide
Query:  MVSREHKKETLHEKLQLLRSIT-NSHALNKASIIVDASRYIEELKHKVERLNQVQNSSHPNPLSHHHPMQVTVESLAKGF--------SINVFSEKSC--
        +++   +++ L+++L LLRSI      +++ SI+ DA  Y++EL   ++++N++Q        + H    +T ES+ +           +N   +  C  
Subjt:  MVSREHKKETLHEKLQLLRSIT-NSHALNKASIIVDASRYIEELKHKVERLNQVQNSSHPNPLSHHHPMQVTVESLAKGF--------SINVFSEKSC--

Query:  -QGLLVSILEVFEELGLNVIEARVSCTDSFQLQAT-GEIDEQGEEAIDAQAVKEAVVQ
          GL+VS +   E LGL + +  +SC   F LQA+  E+ EQ    + ++A K+A+++
Subjt:  -QGLLVSILEVFEELGLNVIEARVSCTDSFQLQAT-GEIDEQGEEAIDAQAVKEAVVQ

AT5G65640.1 beta HLH protein 931.6e-0524.84Show/hide
Query:  MVSREHKKETLHEKLQLLRSIT-NSHALNKASIIVDASRYIEELKHKVERLNQVQNSSHPNPLSHHHPMQVTVESL---------AKGFSINVFSEKS--
        +++   +++ L+++L +LRSI      +++ SI+ DA  Y++EL  K+ +L   +     +  SHH  +   ++ L         +  F I+   E +  
Subjt:  MVSREHKKETLHEKLQLLRSIT-NSHALNKASIIVDASRYIEELKHKVERLNQVQNSSHPNPLSHHHPMQVTVESL---------AKGFSINVFSEKS--

Query:  ---CQ---GLLVSILEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAV
           C    GLL+S +   E LGL + +  +SC   F LQA+     +  + I ++ +K+A+
Subjt:  ---CQ---GLLVSILEVFEELGLNVIEARVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCTAGAGAGCACAAGAAGGAAACTTTGCATGAAAAGCTTCAATTACTTCGTTCTATTACCAATTCTCATGCTCTAAACAAGGCCTCGATTATAGTGGATGCATC
AAGATATATTGAGGAGCTAAAACACAAAGTTGAAAGATTGAATCAAGTTCAAAATTCAAGCCACCCAAATCCACTTTCTCATCATCATCCCATGCAGGTTACAGTGGAAT
CCTTAGCAAAGGGATTTTCTATAAATGTATTTTCAGAAAAAAGCTGTCAAGGCCTCCTTGTCTCAATATTAGAAGTCTTTGAAGAATTGGGACTTAATGTTATTGAAGCT
AGGGTTTCTTGTACTGATAGTTTCCAATTACAAGCTACTGGAGAAATTGATGAACAAGGAGAAGAAGCCATTGATGCTCAAGCTGTGAAAGAAGCTGTAGTTCAAGCTAT
AAAGAGCTGGAGCCAAAGCGGTGGACAAGATTAA
mRNA sequenceShow/hide mRNA sequence
AAAAAAGAATCCATGGTTTCTAGAGAGCACAAGAAGGAAACTTTGCATGAAAAGCTTCAATTACTTCGTTCTATTACCAATTCTCATGCTCTAAACAAGGCCTCGATTAT
AGTGGATGCATCAAGATATATTGAGGAGCTAAAACACAAAGTTGAAAGATTGAATCAAGTTCAAAATTCAAGCCACCCAAATCCACTTTCTCATCATCATCCCATGCAGG
TTACAGTGGAATCCTTAGCAAAGGGATTTTCTATAAATGTATTTTCAGAAAAAAGCTGTCAAGGCCTCCTTGTCTCAATATTAGAAGTCTTTGAAGAATTGGGACTTAAT
GTTATTGAAGCTAGGGTTTCTTGTACTGATAGTTTCCAATTACAAGCTACTGGAGAAATTGATGAACAAGGAGAAGAAGCCATTGATGCTCAAGCTGTGAAAGAAGCTGT
AGTTCAAGCTATAAAGAGCTGGAGCCAAAGCGGTGGACAAGATTAAAAAAAAATCAAATCCTTAATTAATTTCTCCATTTTTTTTCAATCGCAGCCTTTTTTTTTCTTCT
TATTTATCTCTGTTTGTATTGTAGTAAAAATTAATCAATGGAAAAAACCGATTGATTAATCGTTCAAAATAATGGAAAACTCCTCACCTTTTCCACACATGGCC
Protein sequenceShow/hide protein sequence
MVSREHKKETLHEKLQLLRSITNSHALNKASIIVDASRYIEELKHKVERLNQVQNSSHPNPLSHHHPMQVTVESLAKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEA
RVSCTDSFQLQATGEIDEQGEEAIDAQAVKEAVVQAIKSWSQSGGQD