; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030247 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030247
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTranscription factor bHLH61 isoform 1
Genome locationchr8:45742322..45743773
RNA-Seq ExpressionLag0030247
SyntenyLag0030247
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022964721.1 uncharacterized protein LOC111464709 isoform X2 [Cucurbita moschata]1.7e-6888.27Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLLVS
        MVSREHKKAALHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN       HPMVTVE LVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLLVS

Query:  VLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD
        +LEAFEELGLNV+EARVSCTD+FQLQA  EI+EQGEEA+DAQAVKEAVV+AIK+WSQ+GEQD
Subjt:  VLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD

XP_038884496.1 uncharacterized protein LOC120075304 isoform X1 [Benincasa hispida]7.7e-6988.96Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHA-LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLLV
        MVSREHKKA LHEKLQLLRSITNSHA LNKASIIVDASKYIEELKQKVERLNQDI+TVQNSIHPNPLSH +    VTVE LVKGFSINVFSEKSCQGLLV
Subjt:  MVSREHKKAALHEKLQLLRSITNSHA-LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLLV

Query:  SVLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD
        S+LE FEELGLNVIEARVSCTD+FQLQAI EI+E+GEEAIDAQAVKEAVVQAIK+W QSGEQD
Subjt:  SVLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD

XP_038884497.1 uncharacterized protein LOC120075304 isoform X2 [Benincasa hispida]3.1e-7089.51Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLLVS
        MVSREHKKA LHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDI+TVQNSIHPNPLSH +    VTVE LVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLLVS

Query:  VLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD
        +LE FEELGLNVIEARVSCTD+FQLQAI EI+E+GEEAIDAQAVKEAVVQAIK+W QSGEQD
Subjt:  VLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD

XP_038884498.1 uncharacterized protein LOC120075304 isoform X3 [Benincasa hispida]2.0e-6990.18Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHA-LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLLV
        MVSREHKKA LHEKLQLLRSITNSHA LNKASIIVDASKYIEELKQKVERLNQDI+TVQNSIHPNPLS H + PMVTVE LVKGFSINVFSEKSCQGLLV
Subjt:  MVSREHKKAALHEKLQLLRSITNSHA-LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLLV

Query:  SVLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD
        S+LE FEELGLNVIEARVSCTD+FQLQAI EI+E+GEEAIDAQAVKEAVVQAIK+W QSGEQD
Subjt:  SVLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD

XP_038884499.1 uncharacterized protein LOC120075304 isoform X4 [Benincasa hispida]8.3e-7190.74Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLLVS
        MVSREHKKA LHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDI+TVQNSIHPNPLS H + PMVTVE LVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLLVS

Query:  VLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD
        +LE FEELGLNVIEARVSCTD+FQLQAI EI+E+GEEAIDAQAVKEAVVQAIK+W QSGEQD
Subjt:  VLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD

TrEMBL top hitse value%identityAlignment
A0A6J1C0X4 uncharacterized protein LOC1110064104.6e-6790Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLLVS
        MVSREHKKA LHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQD   VQNSIHPNPL      PMVTVETLVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLLVS

Query:  VLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGE
        +LE FEELGLNV+EARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIK+WS+S E
Subjt:  VLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGE

A0A6J1HIH6 uncharacterized protein LOC111464709 isoform X12.1e-6787.73Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPM-VTVETLVKGFSINVFSEKSCQGLLV
        MVSREHKKAALHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN       HPM VTVE LVKGFSINVFSEKSCQGLLV
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPM-VTVETLVKGFSINVFSEKSCQGLLV

Query:  SVLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD
        S+LEAFEELGLNV+EARVSCTD+FQLQA  EI+EQGEEA+DAQAVKEAVV+AIK+WSQ+GEQD
Subjt:  SVLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD

A0A6J1HJR0 uncharacterized protein LOC111464709 isoform X28.3e-6988.27Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLLVS
        MVSREHKKAALHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN       HPMVTVE LVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLLVS

Query:  VLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD
        +LEAFEELGLNV+EARVSCTD+FQLQA  EI+EQGEEA+DAQAVKEAVV+AIK+WSQ+GEQD
Subjt:  VLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD

A0A6J1JTS8 uncharacterized protein LOC111487778 isoform X28.3e-6988.27Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLLVS
        MVSREHKKAALHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN       HPMVTVE LVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLLVS

Query:  VLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD
        +LEAFEELGLNV+EARVSCTD+FQLQA  EI+EQGEEA+DAQAVKEAVV+AIK+WSQ+GEQD
Subjt:  VLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD

A0A6J1JV46 uncharacterized protein LOC111487778 isoform X12.1e-6787.73Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPM-VTVETLVKGFSINVFSEKSCQGLLV
        MVSREHKKAALHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN       HPM VTVE LVKGFSINVFSEKSCQGLLV
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPM-VTVETLVKGFSINVFSEKSCQGLLV

Query:  SVLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD
        S+LEAFEELGLNV+EARVSCTD+FQLQA  EI+EQGEEA+DAQAVKEAVV+AIK+WSQ+GEQD
Subjt:  SVLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD

SwissProt top hitse value%identityAlignment
Q9LSL1 Transcription factor bHLH935.3e-0426.67Show/hide
Query:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHH-----HPMVTVETLVKG---FSINVFSE
        +++   ++  L+++L +LRSI      +++ SI+ DA  Y++EL  K+ +L  +   + NS +    SHH         +   E LV+    F I+   E
Subjt:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHH-----HPMVTVETLVKG---FSINVFSE

Query:  KS-----CQ---GLLVSVLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAV
         +     C    GLL+S +   E LGL + +  +SC   F LQA      +  + I ++ +K+A+
Subjt:  KS-----CQ---GLLVSVLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAV

Q9LXA9 Transcription factor bHLH611.9e-0627.1Show/hide
Query:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSC---QG
        +++   ++  L+++L LLRSI      +++ SI+ DA  Y++EL  K+ +L +D   + ++ H + L  +      +++  V    +N   +  C    G
Subjt:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSC---QG

Query:  LLVSVLEAFEELGLNVIEARVSCTDSFQLQA-IGEIDEQGEEAIDAQAVKEAVVQ
        L+VS +   E LGL + +  +SC   F LQA   E+ EQ    + ++A K+A+++
Subjt:  LLVSVLEAFEELGLNVIEARVSCTDSFQLQA-IGEIDEQGEEAIDAQAVKEAVVQ

Arabidopsis top hitse value%identityAlignment
AT1G29270.1 unknown protein3.9e-1033.85Show/hide
Query:  MVSREHKKAALHEKLQLLRSITN-SHALNKASIIV-DASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLL
        MV+ E KK A   K   L+++T+   ++++ S+++ +A  YI  LK ++E L ++       +        H    V VE + + F + + S +  +  L
Subjt:  MVSREHKKAALHEKLQLLRSITN-SHALNKASIIV-DASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLL

Query:  VSVLEAFEELGLNVIEARVSCTDSFQLQAI
        V++LEAFEE+GLNV +AR SC DSF ++AI
Subjt:  VSVLEAFEELGLNVIEARVSCTDSFQLQAI

AT2G40435.1 BEST Arabidopsis thaliana protein match is: transcription regulators (TAIR:AT3G56220.1)9.2e-4463.06Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLLVS
        MVSRE K+ +L EK QLLRSITNSHA N  SII+DASKYI++LKQKVER NQD    Q+S  P         PMVTVETL KGF INVFS K+  G+LVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLLVS

Query:  VLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQ
        VLEAFE++GLNV+EAR SCTDSF L A+G  +E GE  +DA+AVK+AV  AI++W +
Subjt:  VLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQ

AT3G56220.1 transcription regulators1.8e-3957.41Show/hide
Query:  MVSREHKK-AALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQN---SIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQG
        MVSREHK+ ++L EK  LLRSIT+SHA ++ SIIVDASKYI++LKQKVE++N    + Q+   S  PN        PMVTVETL KGF I V S K+  G
Subjt:  MVSREHKK-AALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQN---SIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQG

Query:  LLVSVLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQS
        +LV VLE FE+LGL+V+EARVSCTD+F L AIG  +    + IDA+AVK+AV +AI+ WS S
Subjt:  LLVSVLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQS

AT5G10570.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.4e-0727.1Show/hide
Query:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSC---QG
        +++   ++  L+++L LLRSI      +++ SI+ DA  Y++EL  K+ +L +D   + ++ H + L  +      +++  V    +N   +  C    G
Subjt:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSC---QG

Query:  LLVSVLEAFEELGLNVIEARVSCTDSFQLQA-IGEIDEQGEEAIDAQAVKEAVVQ
        L+VS +   E LGL + +  +SC   F LQA   E+ EQ    + ++A K+A+++
Subjt:  LLVSVLEAFEELGLNVIEARVSCTDSFQLQA-IGEIDEQGEEAIDAQAVKEAVVQ

AT5G65640.1 beta HLH protein 933.8e-0526.67Show/hide
Query:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHH-----HPMVTVETLVKG---FSINVFSE
        +++   ++  L+++L +LRSI      +++ SI+ DA  Y++EL  K+ +L  +   + NS +    SHH         +   E LV+    F I+   E
Subjt:  MVSREHKKAALHEKLQLLRSIT-NSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHH-----HPMVTVETLVKG---FSINVFSE

Query:  KS-----CQ---GLLVSVLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAV
         +     C    GLL+S +   E LGL + +  +SC   F LQA      +  + I ++ +K+A+
Subjt:  KS-----CQ---GLLVSVLEAFEELGLNVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCTAGAGAGCACAAGAAGGCAGCTCTGCATGAGAAGCTTCAATTACTTCGTTCTATTACCAACTCTCATGCTCTAAACAAGGCCTCGATTATAGTGGATGCATC
AAAATATATCGAGGAGCTAAAACAGAAAGTAGAAAGATTGAATCAAGATATAGCAACCGTTCAAAATTCAATCCACCCAAATCCACTTTCTCATCATCATCATCATCCCA
TGGTTACAGTGGAAACCCTAGTAAAGGGATTTTCTATAAATGTATTTTCAGAAAAAAGCTGTCAAGGCCTCCTTGTCTCAGTATTAGAAGCCTTTGAAGAGCTTGGGCTT
AATGTTATTGAAGCTAGGGTTTCCTGTACTGATAGTTTCCAATTACAAGCTATTGGAGAAATTGACGAACAAGGAGAAGAAGCCATTGATGCTCAAGCTGTGAAAGAAGC
AGTAGTTCAAGCTATAAAGAACTGGAGCCAAAGCGGTGAACAAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTCTAGAGAGCACAAGAAGGCAGCTCTGCATGAGAAGCTTCAATTACTTCGTTCTATTACCAACTCTCATGCTCTAAACAAGGCCTCGATTATAGTGGATGCATC
AAAATATATCGAGGAGCTAAAACAGAAAGTAGAAAGATTGAATCAAGATATAGCAACCGTTCAAAATTCAATCCACCCAAATCCACTTTCTCATCATCATCATCATCCCA
TGGTTACAGTGGAAACCCTAGTAAAGGGATTTTCTATAAATGTATTTTCAGAAAAAAGCTGTCAAGGCCTCCTTGTCTCAGTATTAGAAGCCTTTGAAGAGCTTGGGCTT
AATGTTATTGAAGCTAGGGTTTCCTGTACTGATAGTTTCCAATTACAAGCTATTGGAGAAATTGACGAACAAGGAGAAGAAGCCATTGATGCTCAAGCTGTGAAAGAAGC
AGTAGTTCAAGCTATAAAGAACTGGAGCCAAAGCGGTGAACAAGATTAA
Protein sequenceShow/hide protein sequence
MVSREHKKAALHEKLQLLRSITNSHALNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHHHPMVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGL
NVIEARVSCTDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD