; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024810 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024810
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionBHLH domain-containing protein
Genome locationtig00002486:3027017..3029500
RNA-Seq ExpressionSgr024810
SyntenySgr024810
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582238.1 Transcription factor basic helix-loop-helix 95, partial [Cucurbita argyrosperma subsp. sororia]1.7e-6774.76Show/hide
Query:  MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIE
        M+ CTDSV+ LLP I+PVH SEA + KAS SRKRRRALEANGG+Q KGREKRKEMSESFDVLQSLVPN+SPK             ATRE IVSETIQFIE
Subjt:  MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIE

Query:  FLQKQLMRLEMKKKPSESVTMLPSTNSDS-SGGVIVSVSGNIVLFGIL-ASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF--STVH
         LQKQLMRLEM+KKP ESVTMLPSTNSDS  GGVIVSVS NIVLFGI+ ASVRRGMVTQILM FERH+AEVLAANVAV HG LTLTVTASVHG+  +T+ 
Subjt:  FLQKQLMRLEMKKKPSESVTMLPSTNSDS-SGGVIVSVSGNIVLFGIL-ASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF--STVH

Query:  WSKRNNLSFK
          + + LS K
Subjt:  WSKRNNLSFK

XP_022955986.1 uncharacterized protein LOC111457820 [Cucurbita moschata]1.7e-6774.76Show/hide
Query:  MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIE
        M+ CTDSV+ LLP I+PVH SEA + KAS SRKRRRALEANGG+Q KGREKRKEMSESFDVLQSLVPN+SPK             ATRE IVSETIQFIE
Subjt:  MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIE

Query:  FLQKQLMRLEMKKKPSESVTMLPSTNSDS-SGGVIVSVSGNIVLFGIL-ASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF--STVH
         LQKQLMRLEM+KKP ESVTMLPSTNSDS  GGVIVSVS NIVLFGI+ ASVRRGMVTQILM FERH+AEVLAANVAV HG LTLTVTASVHG+  +T+ 
Subjt:  FLQKQLMRLEMKKKPSESVTMLPSTNSDS-SGGVIVSVSGNIVLFGIL-ASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF--STVH

Query:  WSKRNNLSFK
          + + LS K
Subjt:  WSKRNNLSFK

XP_022979931.1 uncharacterized protein LOC111479473 [Cucurbita maxima]8.0e-6572.09Show/hide
Query:  MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIE
        M+ CTDSV+ LLP I+PVH SEA + KAS SRKR RALEANGG Q KGREKRKEMSESFDVLQSLVPN+SPK             ATRE IVSETIQFIE
Subjt:  MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIE

Query:  FLQKQLMRLEMKKKPSESVTMLPSTNSDS------SGGVIVSVSGNIVLFGIL-ASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF-
         LQKQLMRLEM+KKP ESVTMLPSTNSDS       GGVIVSVS NIVLFGI+ ASVRRGMVT+ILM FERH+AEVLAANVAV HG LTLTVTASVHG+ 
Subjt:  FLQKQLMRLEMKKKPSESVTMLPSTNSDS------SGGVIVSVSGNIVLFGIL-ASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF-

Query:  -STVHWSKRNNLSFK
         +T+   + + LS K
Subjt:  -STVHWSKRNNLSFK

XP_023528352.1 uncharacterized protein LOC111791298 [Cucurbita pepo subsp. pepo]2.5e-6672.77Show/hide
Query:  MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIE
        M+ CTDSV+ LLP I+PVH SEA + KAS SRKRRRALEANGG+Q KGREKRKEMSESFDVLQSLVPN+SPK             ATRE IVSETIQFIE
Subjt:  MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIE

Query:  FLQKQLMRLEMKKKPSESVTMLPSTNSDS----SGGVIVSVSGNIVLFGIL-ASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF--S
         LQKQL RLEM+KKP ESVTMLPSTNSDS     GGVIVSVS NIVLFGI+ ASVRRGMVTQILM FERH+AEVLAANVAV HG LTLTVTAS+HG+  +
Subjt:  FLQKQLMRLEMKKKPSESVTMLPSTNSDS----SGGVIVSVSGNIVLFGIL-ASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF--S

Query:  TVHWSKRNNLSFK
        T+   + + LS K
Subjt:  TVHWSKRNNLSFK

XP_038885840.1 transcription factor bHLH95-like [Benincasa hispida]5.9e-6874.65Show/hide
Query:  MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIE
        M+ CTDSV+PL P I+PVH SEA ++KAS SRKR RALEANGG+Q K REKRKEMSESFDVLQSLVPN+SPK             ATRE IVSETIQFIE
Subjt:  MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIE

Query:  FLQKQLMRLEMKKKPSESVTMLPSTNSDSS----GGVIVSVSGNIVLFG-ILASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF--S
        FLQKQLMRLEM+KK SESVTMLPSTNSDSS    GGVIVSVSGNIVLFG I+ASV+RGMVTQILMVFERH+AEVLAANVAV HG LTLTVTASVHG+  +
Subjt:  FLQKQLMRLEMKKKPSESVTMLPSTNSDSS----GGVIVSVSGNIVLFG-ILASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF--S

Query:  TVHWSKRNNLSFK
        T+   K + LS K
Subjt:  TVHWSKRNNLSFK

TrEMBL top hitse value%identityAlignment
A0A1S4DSM0 transcription factor bHLH95-like2.0e-6167.14Show/hide
Query:  MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIE
        M+ C+DSV+PL P I+P+H  EA +++AS SRKR RALEANGG+Q K +EKRKEMSESFDVL+SLVPN+SPK             ATRE IVS  IQFIE
Subjt:  MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIE

Query:  FLQKQLMRLEMKKKPSESVTMLPSTNSDSSG----GVIVSVSGNIVLFG-ILASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF--S
        FLQKQLMRLEM+KK SESVT+LP++NSDSSG    GVIVS+SGNIVLFG I+ASV+RGMVTQIL+VFERH+ EVLAANV V HG LTLTVTASVHG+  +
Subjt:  FLQKQLMRLEMKKKPSESVTMLPSTNSDSSG----GVIVSVSGNIVLFG-ILASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF--S

Query:  TVHWSKRNNLSFK
        T+   + + LS K
Subjt:  TVHWSKRNNLSFK

A0A5A7U767 Transcription factor bHLH95-like2.0e-6167.14Show/hide
Query:  MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIE
        M+ C+DSV+PL P I+P+H  EA +++AS SRKR RALEANGG+Q K +EKRKEMSESFDVL+SLVPN+SPK             ATRE IVS  IQFIE
Subjt:  MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIE

Query:  FLQKQLMRLEMKKKPSESVTMLPSTNSDSSG----GVIVSVSGNIVLFG-ILASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF--S
        FLQKQLMRLEM+KK SESVT+LP++NSDSSG    GVIVS+SGNIVLFG I+ASV+RGMVTQIL+VFERH+ EVLAANV V HG LTLTVTASVHG+  +
Subjt:  FLQKQLMRLEMKKKPSESVTMLPSTNSDSSG----GVIVSVSGNIVLFG-ILASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF--S

Query:  TVHWSKRNNLSFK
        T+   + + LS K
Subjt:  TVHWSKRNNLSFK

A0A6J1CA71 uncharacterized protein LOC1110093701.4e-6275.88Show/hide
Query:  MDGCTDSVIPLLPQIMPVHDSEAVD-EKASFSRKRRRA-LEANGGIQKGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFI
        M+ CTDSVIPLL QI+PV +SEA D  KAS SRKRRRA LEA GG+QKGR KRKEM++SFDVLQSLVPN+SPK             ATRENIVSETIQFI
Subjt:  MDGCTDSVIPLLPQIMPVHDSEAVD-EKASFSRKRRRA-LEANGGIQKGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFI

Query:  EFLQKQLMRLEMKKK-PSESV--TML-PSTNSDSS--GGVIVSVSGNIVLFGILASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF
        +FL+KQLMRLEMKKK PSESV  TM+ PSTNSDSS  GGVIVS SGNIVLFGILASVRRGMVTQILM FER++AEVLAANVAV HG L+LT+TASVHG+
Subjt:  EFLQKQLMRLEMKKK-PSESV--TML-PSTNSDSS--GGVIVSVSGNIVLFGILASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF

A0A6J1GWN9 uncharacterized protein LOC1114578208.3e-6874.76Show/hide
Query:  MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIE
        M+ CTDSV+ LLP I+PVH SEA + KAS SRKRRRALEANGG+Q KGREKRKEMSESFDVLQSLVPN+SPK             ATRE IVSETIQFIE
Subjt:  MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIE

Query:  FLQKQLMRLEMKKKPSESVTMLPSTNSDS-SGGVIVSVSGNIVLFGIL-ASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF--STVH
         LQKQLMRLEM+KKP ESVTMLPSTNSDS  GGVIVSVS NIVLFGI+ ASVRRGMVTQILM FERH+AEVLAANVAV HG LTLTVTASVHG+  +T+ 
Subjt:  FLQKQLMRLEMKKKPSESVTMLPSTNSDS-SGGVIVSVSGNIVLFGIL-ASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF--STVH

Query:  WSKRNNLSFK
          + + LS K
Subjt:  WSKRNNLSFK

A0A6J1IS55 uncharacterized protein LOC1114794733.9e-6572.09Show/hide
Query:  MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIE
        M+ CTDSV+ LLP I+PVH SEA + KAS SRKR RALEANGG Q KGREKRKEMSESFDVLQSLVPN+SPK             ATRE IVSETIQFIE
Subjt:  MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQ-KGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIE

Query:  FLQKQLMRLEMKKKPSESVTMLPSTNSDS------SGGVIVSVSGNIVLFGIL-ASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF-
         LQKQLMRLEM+KKP ESVTMLPSTNSDS       GGVIVSVS NIVLFGI+ ASVRRGMVT+ILM FERH+AEVLAANVAV HG LTLTVTASVHG+ 
Subjt:  FLQKQLMRLEMKKKPSESVTMLPSTNSDS------SGGVIVSVSGNIVLFGIL-ASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGF-

Query:  -STVHWSKRNNLSFK
         +T+   + + LS K
Subjt:  -STVHWSKRNNLSFK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGCTGCACCGACTCTGTGATTCCACTTTTGCCGCAGATAATGCCAGTGCATGATTCTGAAGCAGTCGACGAGAAGGCTTCGTTCTCAAGAAAGCGTCGCAGAGC
CCTGGAAGCCAACGGAGGTATACAGAAGGGGAGAGAGAAGAGGAAGGAGATGAGCGAGAGTTTCGATGTTCTTCAATCTCTCGTCCCCAATATCTCTCCCAAGGTTAATC
GATTTGATTCTCGTTGTCGTATATACTTAGACGCTACGAGGGAGAATATTGTTTCCGAGACGATCCAGTTCATCGAGTTTCTGCAGAAGCAGTTGATGAGGCTGGAGATG
AAGAAGAAACCATCGGAATCGGTGACAATGCTTCCCAGTACGAACTCGGATTCATCAGGCGGCGTCATCGTCTCGGTCTCCGGCAACATTGTGTTGTTTGGGATTCTTGC
TTCTGTTCGACGAGGTATGGTGACACAGATTTTAATGGTGTTTGAAAGACACCGGGCTGAAGTTCTAGCAGCAAATGTTGCAGTCGGCCATGGCAAATTAACTTTAACAG
TCACAGCTTCTGTACACGGATTCAGCACAGTCCACTGGAGCAAAAGAAATAACTTGAGCTTCAAGGTCATACCCAACATAGTAGTTTTGCAGATGAAAGTTCCCCAATAC
AGAAATTGGAGATCCAGAACGCAGAAGGGCAAGGCAGATAACTCCATCATCCTCTATCTTCACGAAGGGGTGTTTGGTTCAAGCCCACACTGCCCATATAACTCTGCATA
CCTCCTGTTAAAGGAGCATCTGAACAGCCAAAGTTCAAATAGCCAACATCCACAAGTTTCCCATTTGAGAAGAATTGCAGGTTTGGAAGCCAGTCAAGGAATTGCAAAAG
TTAGAGCCACATGGCTCCAACTCATAGGTGAAGGATTTGGAGGAGTGGAACTTGGTGTTGGTGGGGCCTTTTTCTGGCTCACATTGATTACTGCAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGGCTGCACCGACTCTGTGATTCCACTTTTGCCGCAGATAATGCCAGTGCATGATTCTGAAGCAGTCGACGAGAAGGCTTCGTTCTCAAGAAAGCGTCGCAGAGC
CCTGGAAGCCAACGGAGGTATACAGAAGGGGAGAGAGAAGAGGAAGGAGATGAGCGAGAGTTTCGATGTTCTTCAATCTCTCGTCCCCAATATCTCTCCCAAGGTTAATC
GATTTGATTCTCGTTGTCGTATATACTTAGACGCTACGAGGGAGAATATTGTTTCCGAGACGATCCAGTTCATCGAGTTTCTGCAGAAGCAGTTGATGAGGCTGGAGATG
AAGAAGAAACCATCGGAATCGGTGACAATGCTTCCCAGTACGAACTCGGATTCATCAGGCGGCGTCATCGTCTCGGTCTCCGGCAACATTGTGTTGTTTGGGATTCTTGC
TTCTGTTCGACGAGGTATGGTGACACAGATTTTAATGGTGTTTGAAAGACACCGGGCTGAAGTTCTAGCAGCAAATGTTGCAGTCGGCCATGGCAAATTAACTTTAACAG
TCACAGCTTCTGTACACGGATTCAGCACAGTCCACTGGAGCAAAAGAAATAACTTGAGCTTCAAGGTCATACCCAACATAGTAGTTTTGCAGATGAAAGTTCCCCAATAC
AGAAATTGGAGATCCAGAACGCAGAAGGGCAAGGCAGATAACTCCATCATCCTCTATCTTCACGAAGGGGTGTTTGGTTCAAGCCCACACTGCCCATATAACTCTGCATA
CCTCCTGTTAAAGGAGCATCTGAACAGCCAAAGTTCAAATAGCCAACATCCACAAGTTTCCCATTTGAGAAGAATTGCAGGTTTGGAAGCCAGTCAAGGAATTGCAAAAG
TTAGAGCCACATGGCTCCAACTCATAGGTGAAGGATTTGGAGGAGTGGAACTTGGTGTTGGTGGGGCCTTTTTCTGGCTCACATTGATTACTGCAGTTTGA
Protein sequenceShow/hide protein sequence
MDGCTDSVIPLLPQIMPVHDSEAVDEKASFSRKRRRALEANGGIQKGREKRKEMSESFDVLQSLVPNISPKVNRFDSRCRIYLDATRENIVSETIQFIEFLQKQLMRLEM
KKKPSESVTMLPSTNSDSSGGVIVSVSGNIVLFGILASVRRGMVTQILMVFERHRAEVLAANVAVGHGKLTLTVTASVHGFSTVHWSKRNNLSFKVIPNIVVLQMKVPQY
RNWRSRTQKGKADNSIILYLHEGVFGSSPHCPYNSAYLLLKEHLNSQSSNSQHPQVSHLRRIAGLEASQGIAKVRATWLQLIGEGFGGVELGVGGAFFWLTLITAV