; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg039748 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg039748
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTranscription factor bHLH61-like protein
Genome locationscaffold10:45649596..45651243
RNA-Seq ExpressionSpg039748
SyntenySpg039748
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022964712.1 uncharacterized protein LOC111464709 isoform X1 [Cucurbita moschata]3.5e-5587.41Show/hide
Query:  LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQA
        LNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN     HPMQVTVE LVKGFSINVFSEKSCQGLLVS+LEAFEELGLNV+EARVSCTD+FQLQA
Subjt:  LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQA

Query:  IGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD
          EI+EQGEEA+DAQAVKEAVV+AIK+WSQ+GEQD
Subjt:  IGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD

XP_038884496.1 uncharacterized protein LOC120075304 isoform X1 [Benincasa hispida]1.2e-5890.51Show/hide
Query:  QLNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHH-PMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQL
        QLNKASIIVDASKYIEELKQKVERLNQDI+TVQNSIHPNPLSH + PMQVTVE LVKGFSINVFSEKSCQGLLVS+LE FEELGLNVIEARVSCTD+FQL
Subjt:  QLNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHH-PMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQL

Query:  QAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD
        QAI EI+E+GEEAIDAQAVKEAVVQAIK+W QSGEQD
Subjt:  QAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD

XP_038884497.1 uncharacterized protein LOC120075304 isoform X2 [Benincasa hispida]5.8e-5890.44Show/hide
Query:  LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHH-PMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQ
        LNKASIIVDASKYIEELKQKVERLNQDI+TVQNSIHPNPLSH + PMQVTVE LVKGFSINVFSEKSCQGLLVS+LE FEELGLNVIEARVSCTD+FQLQ
Subjt:  LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHH-PMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQ

Query:  AIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD
        AI EI+E+GEEAIDAQAVKEAVVQAIK+W QSGEQD
Subjt:  AIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD

XP_038884498.1 uncharacterized protein LOC120075304 isoform X3 [Benincasa hispida]9.9e-5888.97Show/hide
Query:  QLNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQ
        QLNKASIIVDASKYIEELKQKVERLNQDI+TVQNSIHPNPLSH +   VTVE LVKGFSINVFSEKSCQGLLVS+LE FEELGLNVIEARVSCTD+FQLQ
Subjt:  QLNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQ

Query:  AIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD
        AI EI+E+GEEAIDAQAVKEAVVQAIK+W QSGEQD
Subjt:  AIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD

XP_038884499.1 uncharacterized protein LOC120075304 isoform X4 [Benincasa hispida]4.9e-5788.89Show/hide
Query:  LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQA
        LNKASIIVDASKYIEELKQKVERLNQDI+TVQNSIHPNPLSH +   VTVE LVKGFSINVFSEKSCQGLLVS+LE FEELGLNVIEARVSCTD+FQLQA
Subjt:  LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQA

Query:  IGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD
        I EI+E+GEEAIDAQAVKEAVVQAIK+W QSGEQD
Subjt:  IGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD

TrEMBL top hitse value%identityAlignment
A0A6J1HIH6 uncharacterized protein LOC111464709 isoform X11.7e-5587.41Show/hide
Query:  LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQA
        LNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN     HPMQVTVE LVKGFSINVFSEKSCQGLLVS+LEAFEELGLNV+EARVSCTD+FQLQA
Subjt:  LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQA

Query:  IGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD
          EI+EQGEEA+DAQAVKEAVV+AIK+WSQ+GEQD
Subjt:  IGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD

A0A6J1HJR0 uncharacterized protein LOC111464709 isoform X21.6e-5386.67Show/hide
Query:  LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQA
        LNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN     HPM VTVE LVKGFSINVFSEKSCQGLLVS+LEAFEELGLNV+EARVSCTD+FQLQA
Subjt:  LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQA

Query:  IGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD
          EI+EQGEEA+DAQAVKEAVV+AIK+WSQ+GEQD
Subjt:  IGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD

A0A6J1JTS8 uncharacterized protein LOC111487778 isoform X21.6e-5386.67Show/hide
Query:  LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQA
        LNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN     HPM VTVE LVKGFSINVFSEKSCQGLLVS+LEAFEELGLNV+EARVSCTD+FQLQA
Subjt:  LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQA

Query:  IGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD
          EI+EQGEEA+DAQAVKEAVV+AIK+WSQ+GEQD
Subjt:  IGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD

A0A6J1JV46 uncharacterized protein LOC111487778 isoform X11.7e-5587.41Show/hide
Query:  LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQA
        LNK SIIVDASKYIEELKQKVERLNQDIATVQNSIHPN     HPMQVTVE LVKGFSINVFSEKSCQGLLVS+LEAFEELGLNV+EARVSCTD+FQLQA
Subjt:  LNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQA

Query:  IGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD
          EI+EQGEEA+DAQAVKEAVV+AIK+WSQ+GEQD
Subjt:  IGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD

A0A6J1K1N3 uncharacterized protein LOC111489817 isoform X11.6e-5386.76Show/hide
Query:  QLNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQ
        QLNKASIIVDASKYIEELKQKVERLNQDI+TVQ SI        HPMQVTVE+L KGFSINVFSEKSCQGLLVS+LEAFEELGLNV+EARVSCTDSFQLQ
Subjt:  QLNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQ

Query:  AIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD
        AI EI+E+GEEAIDAQAVKEAVVQAIK WSQSGEQD
Subjt:  AIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD

SwissProt top hitse value%identityAlignment
Q9LSE2 Transcription factor ICE15.0e-0424.66Show/hide
Query:  QLNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLV-----------------------------KGFSINVFSEKSCQGL
        ++++ASI+ DA  Y++EL Q++  L+ ++ +      P   S  HP+  T +TL                              +  +I++F  +   GL
Subjt:  QLNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLV-----------------------------KGFSINVFSEKSCQGL

Query:  LVSVLEAFEELGLNVIEARVSCTDSFQLQAI-GEIDEQGEEAIDAQ
        L++ ++A + LGL+V +A +SC + F L     E  ++G+E +  Q
Subjt:  LVSVLEAFEELGLNVIEARVSCTDSFQLQAI-GEIDEQGEEAIDAQ

Q9LXA9 Transcription factor bHLH615.9e-0527.48Show/hide
Query:  VWQLNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPM-QVTVETLVKGFSINVFSEKSC---QGLLVSVLEAFEELGLNVIEARVSCT
        + ++++ SI+ DA  Y++EL  K+ +L +D   + ++ H + L  +  M + +++  V    +N   +  C    GL+VS +   E LGL + +  +SC 
Subjt:  VWQLNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPM-QVTVETLVKGFSINVFSEKSC---QGLLVSVLEAFEELGLNVIEARVSCT

Query:  DSFQLQA-IGEIDEQGEEAIDAQAVKEAVVQ
          F LQA   E+ EQ    + ++A K+A+++
Subjt:  DSFQLQA-IGEIDEQGEEAIDAQAVKEAVVQ

Arabidopsis top hitse value%identityAlignment
AT1G29270.1 unknown protein3.1e-0936.36Show/hide
Query:  KASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQAI
        ++ +I +A  YI  LK ++E L ++   ++ +      S H   +V VE + + F + + S +  +  LV++LEAFEE+GLNV +AR SC DSF ++AI
Subjt:  KASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQAI

AT2G40435.1 BEST Arabidopsis thaliana protein match is: transcription regulators (TAIR:AT3G56220.1)9.0e-3361.24Show/hide
Query:  NKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQAI
        N  SII+DASKYI++LKQKVER NQD    Q+S    P     PM VTVETL KGF INVFS K+  G+LVSVLEAFE++GLNV+EAR SCTDSF L A+
Subjt:  NKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQLQAI

Query:  GEIDEQGEEAIDAQAVKEAVVQAIKNWSQ
        G  +E GE  +DA+AVK+AV  AI++W +
Subjt:  GEIDEQGEEAIDAQAVKEAVVQAIKNWSQ

AT3G26744.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.6e-0524.66Show/hide
Query:  QLNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLV-----------------------------KGFSINVFSEKSCQGL
        ++++ASI+ DA  Y++EL Q++  L+ ++ +      P   S  HP+  T +TL                              +  +I++F  +   GL
Subjt:  QLNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLV-----------------------------KGFSINVFSEKSCQGL

Query:  LVSVLEAFEELGLNVIEARVSCTDSFQLQAI-GEIDEQGEEAIDAQ
        L++ ++A + LGL+V +A +SC + F L     E  ++G+E +  Q
Subjt:  LVSVLEAFEELGLNVIEARVSCTDSFQLQAI-GEIDEQGEEAIDAQ

AT3G56220.1 transcription regulators1.9e-3054.89Show/hide
Query:  NKASIIVDASKYIEELKQKVERLNQDIATVQN---SIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQL
        ++ SIIVDASKYI++LKQKVE++N    + Q+   S  PNP+       VTVETL KGF I V S K+  G+LV VLE FE+LGL+V+EARVSCTD+F L
Subjt:  NKASIIVDASKYIEELKQKVERLNQDIATVQN---SIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSCTDSFQL

Query:  QAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQS
         AIG  +    + IDA+AVK+AV +AI+ WS S
Subjt:  QAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQS

AT5G10570.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein4.2e-0627.48Show/hide
Query:  VWQLNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPM-QVTVETLVKGFSINVFSEKSC---QGLLVSVLEAFEELGLNVIEARVSCT
        + ++++ SI+ DA  Y++EL  K+ +L +D   + ++ H + L  +  M + +++  V    +N   +  C    GL+VS +   E LGL + +  +SC 
Subjt:  VWQLNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPM-QVTVETLVKGFSINVFSEKSC---QGLLVSVLEAFEELGLNVIEARVSCT

Query:  DSFQLQA-IGEIDEQGEEAIDAQAVKEAVVQ
          F LQA   E+ EQ    + ++A K+A+++
Subjt:  DSFQLQA-IGEIDEQGEEAIDAQAVKEAVVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGCAATGATTTTCTAACAGAAAAAGTTTTGATTTTGTGTGTGTGGCAGCTAAACAAGGCCTCGATTATAGTGGATGCATCAAAATATATCGAGGAGCTAAAACA
GAAAGTAGAAAGATTGAATCAAGATATAGCAACCGTTCAAAATTCAATCCACCCAAATCCACTTTCTCATCATCATCCCATGCAGGTTACAGTGGAAACCCTAGTAAAGG
GATTTTCTATAAATGTATTTTCAGAAAAAAGCTGTCAAGGCCTCCTTGTCTCAGTATTAGAAGCCTTTGAAGAGCTGGGGCTTAATGTTATTGAAGCTAGGGTTTCCTGT
ACTGATAGTTTCCAATTACAAGCTATTGGAGAAATTGACGAACAAGGAGAAGAAGCCATTGATGCTCAAGCTGTGAAAGAAGCAGTAGTTCAAGCTATAAAGAACTGGAG
CCAAAGCGGTGAACAAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGCAATGATTTTCTAACAGAAAAAGTTTTGATTTTGTGTGTGTGGCAGCTAAACAAGGCCTCGATTATAGTGGATGCATCAAAATATATCGAGGAGCTAAAACA
GAAAGTAGAAAGATTGAATCAAGATATAGCAACCGTTCAAAATTCAATCCACCCAAATCCACTTTCTCATCATCATCCCATGCAGGTTACAGTGGAAACCCTAGTAAAGG
GATTTTCTATAAATGTATTTTCAGAAAAAAGCTGTCAAGGCCTCCTTGTCTCAGTATTAGAAGCCTTTGAAGAGCTGGGGCTTAATGTTATTGAAGCTAGGGTTTCCTGT
ACTGATAGTTTCCAATTACAAGCTATTGGAGAAATTGACGAACAAGGAGAAGAAGCCATTGATGCTCAAGCTGTGAAAGAAGCAGTAGTTCAAGCTATAAAGAACTGGAG
CCAAAGCGGTGAACAAGATTAA
Protein sequenceShow/hide protein sequence
MGSNDFLTEKVLILCVWQLNKASIIVDASKYIEELKQKVERLNQDIATVQNSIHPNPLSHHHPMQVTVETLVKGFSINVFSEKSCQGLLVSVLEAFEELGLNVIEARVSC
TDSFQLQAIGEIDEQGEEAIDAQAVKEAVVQAIKNWSQSGEQD