; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025147 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025147
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionBHLH domain-containing protein
Genome locationtig00003412:1705157..1705637
RNA-Seq ExpressionSgr025147
SyntenySgr025147
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:2000112 - regulation of cellular macromolecule biosynthetic process (biological process)
GO:0090575 - RNA polymerase II transcription factor complex (cellular component)
GO:0000977 - RNA polymerase II regulatory region sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR015660 - Achaete-scute transcription factor-related


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031567.1 transcription factor bHLH55-like [Cucumis melo var. makuwa]9.9e-2767.35Show/hide
Query:  GKNSPSSEASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRLVMG
        GKN  +    SSSS+SPQL I +MG+SLEIVLSSG++N +L CET+RILQ+EG EVVNASFSV+G SV HT+HA+LGDS+VEFG  K  ERL+RLV G
Subjt:  GKNSPSSEASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRLVMG

XP_008455310.1 PREDICTED: transcription factor bHLH55-like [Cucumis melo]9.9e-2767.35Show/hide
Query:  GKNSPSSEASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRLVMG
        GKN  +    SSSS+SPQL I +MG+SLEIVLSSG++N +L CET+RILQ+EG EVVNASFSV+G SV HT+HA+LGDS+VEFG  K  ERL+RLV G
Subjt:  GKNSPSSEASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRLVMG

XP_011659708.1 transcription factor bHLH162 [Cucumis sativus]1.4e-2569.15Show/hide
Query:  NSPSSEASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRLV
        N  SS +SSSSS+SPQL I +MG+SLEI+LSSG +N +L CET+RIL++EG EVV+ASFSV+GNSV HT+HAQLGDS+VEFG  K  ERL RLV
Subjt:  NSPSSEASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRLV

XP_022147999.1 transcription factor bHLH162 isoform X2 [Momordica charantia]4.1e-2568.93Show/hide
Query:  MGKNSPSSE----ASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRL
        MGK S ++     +SSSS A+PQL I E+GRSL+I LSSG +N FLF ETIRILQQEGAEVV+ASFS A NSVLHT+HAQLG+S+VEFG AKV ERL+ L
Subjt:  MGKNSPSSE----ASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRL

Query:  VMG
        V G
Subjt:  VMG

XP_038888628.1 transcription factor bHLH162-like [Benincasa hispida]4.7e-2971.29Show/hide
Query:  GKNSPS---SEASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRLVM
        GKN  S     +SSSSSASPQLNI +MG+S+EIVLSSGL+N +LFCET+RILQ+EG EV+NASFSV+GNSV HT+HAQLGDS+VEFG  K  ERL+RLV 
Subjt:  GKNSPS---SEASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRLVM

Query:  G
        G
Subjt:  G

TrEMBL top hitse value%identityAlignment
A0A0A0K2C9 BHLH domain-containing protein6.9e-2669.15Show/hide
Query:  NSPSSEASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRLV
        N  SS +SSSSS+SPQL I +MG+SLEI+LSSG +N +L CET+RIL++EG EVV+ASFSV+GNSV HT+HAQLGDS+VEFG  K  ERL RLV
Subjt:  NSPSSEASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRLV

A0A1S3C1C4 transcription factor bHLH55-like4.8e-2767.35Show/hide
Query:  GKNSPSSEASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRLVMG
        GKN  +    SSSS+SPQL I +MG+SLEIVLSSG++N +L CET+RILQ+EG EVVNASFSV+G SV HT+HA+LGDS+VEFG  K  ERL+RLV G
Subjt:  GKNSPSSEASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRLVMG

A0A5D3C788 Transcription factor bHLH55-like4.8e-2767.35Show/hide
Query:  GKNSPSSEASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRLVMG
        GKN  +    SSSS+SPQL I +MG+SLEIVLSSG++N +L CET+RILQ+EG EVVNASFSV+G SV HT+HA+LGDS+VEFG  K  ERL+RLV G
Subjt:  GKNSPSSEASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRLVMG

A0A6J1D2P8 transcription factor bHLH162 isoform X22.0e-2568.93Show/hide
Query:  MGKNSPSSE----ASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRL
        MGK S ++     +SSSS A+PQL I E+GRSL+I LSSG +N FLF ETIRILQQEGAEVV+ASFS A NSVLHT+HAQLG+S+VEFG AKV ERL+ L
Subjt:  MGKNSPSSE----ASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRL

Query:  VMG
        V G
Subjt:  VMG

A0A6J1D2U8 transcription factor bHLH168 isoform X12.0e-2568.93Show/hide
Query:  MGKNSPSSE----ASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRL
        MGK S ++     +SSSS A+PQL I E+GRSL+I LSSG +N FLF ETIRILQQEGAEVV+ASFS A NSVLHT+HAQLG+S+VEFG AKV ERL+ L
Subjt:  MGKNSPSSE----ASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRL

Query:  VMG
        V G
Subjt:  VMG

SwissProt top hitse value%identityAlignment
F4JIJ7 Transcription factor bHLH1621.1e-0736.73Show/hide
Query:  SSEASSSSSAS-----PQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQE-GAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGA-AKVRERLRRLV
        SS  SSS   S     P++ I E G    I L + L + F+FCE IR+L +E GAE+ +A +S+  ++V HT+H ++ +   ++GA +++ ERL ++V
Subjt:  SSEASSSSSAS-----PQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQE-GAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGA-AKVRERLRRLV

Arabidopsis top hitse value%identityAlignment
AT4G20970.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein7.6e-0936.73Show/hide
Query:  SSEASSSSSAS-----PQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQE-GAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGA-AKVRERLRRLV
        SS  SSS   S     P++ I E G    I L + L + F+FCE IR+L +E GAE+ +A +S+  ++V HT+H ++ +   ++GA +++ ERL ++V
Subjt:  SSEASSSSSAS-----PQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQE-GAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGA-AKVRERLRRLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTAAGAACTCCCCGAGCTCGGAGGCTTCGTCGTCTTCTTCTGCATCACCTCAGCTCAACATTTGCGAGATGGGTCGGAGTCTAGAGATAGTTCTGAGCAGTGGGTT
GAACAATCACTTCCTCTTCTGCGAAACCATTCGCATTCTTCAACAAGAAGGAGCTGAAGTCGTCAATGCCAGCTTCTCCGTGGCTGGGAACTCAGTTCTTCACACTGTCC
ATGCACAACTTGGGGATTCTGTGGTTGAATTTGGAGCGGCGAAAGTGAGAGAGAGACTGAGGAGATTGGTTATGGGTCGACCAGCGACATGGAATTGCAGAAGGAGCAGT
GGTGGGACTTCGATTTCCCACCTGAGAGATGGGATCTTTAAGCAACAAAATCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTAAGAACTCCCCGAGCTCGGAGGCTTCGTCGTCTTCTTCTGCATCACCTCAGCTCAACATTTGCGAGATGGGTCGGAGTCTAGAGATAGTTCTGAGCAGTGGGTT
GAACAATCACTTCCTCTTCTGCGAAACCATTCGCATTCTTCAACAAGAAGGAGCTGAAGTCGTCAATGCCAGCTTCTCCGTGGCTGGGAACTCAGTTCTTCACACTGTCC
ATGCACAACTTGGGGATTCTGTGGTTGAATTTGGAGCGGCGAAAGTGAGAGAGAGACTGAGGAGATTGGTTATGGGTCGACCAGCGACATGGAATTGCAGAAGGAGCAGT
GGTGGGACTTCGATTTCCCACCTGAGAGATGGGATCTTTAAGCAACAAAATCCTTAA
Protein sequenceShow/hide protein sequence
MGKNSPSSEASSSSSASPQLNICEMGRSLEIVLSSGLNNHFLFCETIRILQQEGAEVVNASFSVAGNSVLHTVHAQLGDSVVEFGAAKVRERLRRLVMGRPATWNCRRSS
GGTSISHLRDGIFKQQNP