; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg02145 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg02145
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionbasic helix-loop-helix (bHLH) DNA-binding superfamily protein
Genome locationCarg_Chr15:893352..894492
RNA-Seq ExpressionCarg02145
SyntenyCarg02145
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016033.1 Transcription factor bHLH61 [Cucurbita argyrosperma subsp. argyrosperma]1.4e-72100Show/hide
Query:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVSVLEAFEELG
        MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVSVLEAFEELG
Subjt:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVSVLEAFEELG

Query:  LNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD
        LNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD
Subjt:  LNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD

XP_022939239.1 uncharacterized protein LOC111445214 isoform X3 [Cucurbita moschata]9.2e-7298.04Show/hide
Query:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVSVLEAFEELG
        MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVS+LEAFEELG
Subjt:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVSVLEAFEELG

Query:  LNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD
        LNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQ+VKEAVV+AIKIWSQSGEQD
Subjt:  LNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD

XP_022993986.1 uncharacterized protein LOC111489817 isoform X3 [Cucurbita maxima]2.1e-7197.39Show/hide
Query:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVSVLEAFEELG
        MVSREHNNA+LHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVS+LEAFEELG
Subjt:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVSVLEAFEELG

Query:  LNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD
        LNVLEARVSCTDSFQLQAIAEIEE+GEEAIDAQAVKEAVV+AIKIWSQSGEQD
Subjt:  LNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD

XP_023551076.1 uncharacterized protein LOC111809011 isoform X1 [Cucurbita pepo subsp. pepo]1.0e-7098.05Show/hide
Query:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPM-VTVESLAKGFSINVFSEKSCQGLLVSVLEAFEEL
        MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPM VTVESLAKGFSINVFSEKSCQGLLVS+LEAFEEL
Subjt:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPM-VTVESLAKGFSINVFSEKSCQGLLVSVLEAFEEL

Query:  GLNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD
        GLNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVV+AIKIWSQSGEQD
Subjt:  GLNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD

XP_023551078.1 uncharacterized protein LOC111809011 isoform X3 [Cucurbita pepo subsp. pepo]4.1e-7298.69Show/hide
Query:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVSVLEAFEELG
        MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVS+LEAFEELG
Subjt:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVSVLEAFEELG

Query:  LNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD
        LNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVV+AIKIWSQSGEQD
Subjt:  LNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD

TrEMBL top hitse value%identityAlignment
A0A6J1FFD0 uncharacterized protein LOC111445214 isoform X43.2e-7097.39Show/hide
Query:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVSVLEAFEELG
        MVSREHNNASLHHNLQLLRSITNSHA LNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVS+LEAFEELG
Subjt:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVSVLEAFEELG

Query:  LNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD
        LNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQ+VKEAVV+AIKIWSQSGEQD
Subjt:  LNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD

A0A6J1FG89 uncharacterized protein LOC111445214 isoform X11.1e-7097.4Show/hide
Query:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPM-VTVESLAKGFSINVFSEKSCQGLLVSVLEAFEEL
        MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPM VTVESLAKGFSINVFSEKSCQGLLVS+LEAFEEL
Subjt:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPM-VTVESLAKGFSINVFSEKSCQGLLVSVLEAFEEL

Query:  GLNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD
        GLNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQ+VKEAVV+AIKIWSQSGEQD
Subjt:  GLNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD

A0A6J1FGK6 uncharacterized protein LOC111445214 isoform X34.5e-7298.04Show/hide
Query:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVSVLEAFEELG
        MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVS+LEAFEELG
Subjt:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVSVLEAFEELG

Query:  LNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD
        LNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQ+VKEAVV+AIKIWSQSGEQD
Subjt:  LNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD

A0A6J1JXW1 uncharacterized protein LOC111489817 isoform X39.9e-7297.39Show/hide
Query:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVSVLEAFEELG
        MVSREHNNA+LHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVS+LEAFEELG
Subjt:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVSVLEAFEELG

Query:  LNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD
        LNVLEARVSCTDSFQLQAIAEIEE+GEEAIDAQAVKEAVV+AIKIWSQSGEQD
Subjt:  LNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD

A0A6J1K1N3 uncharacterized protein LOC111489817 isoform X12.4e-7096.75Show/hide
Query:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPM-VTVESLAKGFSINVFSEKSCQGLLVSVLEAFEEL
        MVSREHNNA+LHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPM VTVESLAKGFSINVFSEKSCQGLLVS+LEAFEEL
Subjt:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPM-VTVESLAKGFSINVFSEKSCQGLLVSVLEAFEEL

Query:  GLNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD
        GLNVLEARVSCTDSFQLQAIAEIEE+GEEAIDAQAVKEAVV+AIKIWSQSGEQD
Subjt:  GLNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD

SwissProt top hitse value%identityAlignment
Q10S44 Transcription factor BHLH36.5e-0420.62Show/hide
Query:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQ--------------
        +++       L+  L +LRSI    +++++ SI+ D   Y++EL ++++ L ++I      +  + T++  + G +  +    S +              
Subjt:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQ--------------

Query:  -------GLLVSVLEAFEELGLNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAV
               G+L+S + A E LGL + +  VSC   F +QA    E+   + +    +K+ +
Subjt:  -------GLLVSVLEAFEELGLNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAV

Q9LPW3 Transcription factor SCREAM28.5e-0422.29Show/hide
Query:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTV---QTSIHPMV-TVESLA------------------------
        +++       L+  L +LRS+    +++++ASI+ DA  Y++EL Q++  L+ ++ +     +S+HP+  T ++L+                        
Subjt:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTV---QTSIHPMV-TVESLA------------------------

Query:  ------KGFSINVFSEKSCQGLLVSVLEAFEELGLNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVE
              K  +I++F  +   GLL+S + A + LGL+V +A +SC + F L      + Q +  +  + +K  +++
Subjt:  ------KGFSINVFSEKSCQGLLVSVLEAFEELGLNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVE

Q9LXA9 Transcription factor bHLH611.8e-0627.92Show/hide
Query:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIH--PMVTVESLAKGF--------SINVFSEKSC---QG
        +++       L+  L LLRSI     ++++ SI+ DA  Y++EL  K+ +L +D   + ++ H   ++T ES+ +           +N   +  C    G
Subjt:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIH--PMVTVESLAKGF--------SINVFSEKSC---QG

Query:  LLVSVLEAFEELGLNVLEARVSCTDSFQLQAIA-EIEEQGEEAIDAQAVKEAVV
        L+VS +   E LGL + +  +SC   F LQA   E+ EQ    + ++A K+A++
Subjt:  LLVSVLEAFEELGLNVLEARVSCTDSFQLQAIA-EIEEQGEEAIDAQAVKEAVV

Arabidopsis top hitse value%identityAlignment
AT1G12860.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein6.1e-0522.29Show/hide
Query:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTV---QTSIHPMV-TVESLA------------------------
        +++       L+  L +LRS+    +++++ASI+ DA  Y++EL Q++  L+ ++ +     +S+HP+  T ++L+                        
Subjt:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTV---QTSIHPMV-TVESLA------------------------

Query:  ------KGFSINVFSEKSCQGLLVSVLEAFEELGLNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVE
              K  +I++F  +   GLL+S + A + LGL+V +A +SC + F L      + Q +  +  + +K  +++
Subjt:  ------KGFSINVFSEKSCQGLLVSVLEAFEELGLNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVE

AT1G29270.1 unknown protein6.3e-1033.33Show/hide
Query:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIV-DASKYIEELKQKVERLNQDIS----TVQTSIHPM--VTVESLAKGFSINVFSEKSCQGLLVSVL
        MV+ E    +       L+++T+    +++ S+++ +A  YI  LK ++E L ++      T + S+H    V VE + + F + + S +  +  LV++L
Subjt:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIV-DASKYIEELKQKVERLNQDIS----TVQTSIHPM--VTVESLAKGFSINVFSEKSCQGLLVSVL

Query:  EAFEELGLNVLEARVSCTDSFQLQAI
        EAFEE+GLNV +AR SC DSF ++AI
Subjt:  EAFEELGLNVLEARVSCTDSFQLQAI

AT2G40435.1 BEST Arabidopsis thaliana protein match is: transcription regulators (TAIR:AT3G56220.1)2.2e-3961.69Show/hide
Query:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIH------PMVTVESLAKGFSINVFSEKSCQGLLVSVLE
        MVSRE    SL    QLLRSITNSHA+ N  SII+DASKYI++LKQKVER NQD +  Q+S        PMVTVE+L KGF INVFS K+  G+LVSVLE
Subjt:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIH------PMVTVESLAKGFSINVFSEKSCQGLLVSVLE

Query:  AFEELGLNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQ
        AFE++GLNVLEAR SCTDSF L A+    E GE  +DA+AVK+AV +AI+ W +
Subjt:  AFEELGLNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQ

AT3G56220.1 transcription regulators4.3e-3556.13Show/hide
Query:  MVSREH-NNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQ-----TSIHPMVTVESLAKGFSINVFSEKSCQGLLVSVLE
        MVSREH   +SL     LLRSIT+SHA+ ++ SIIVDASKYI++LKQKVE++N   ++ Q     +  +PMVTVE+L KGF I V S K+  G+LV VLE
Subjt:  MVSREH-NNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQ-----TSIHPMVTVESLAKGFSINVFSEKSCQGLLVSVLE

Query:  AFEELGLNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQS
         FE+LGL+V+EARVSCTD+F L AI        + IDA+AVK+AV EAI+ WS S
Subjt:  AFEELGLNVLEARVSCTDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQS

AT5G10570.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.3e-0727.92Show/hide
Query:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIH--PMVTVESLAKGF--------SINVFSEKSC---QG
        +++       L+  L LLRSI     ++++ SI+ DA  Y++EL  K+ +L +D   + ++ H   ++T ES+ +           +N   +  C    G
Subjt:  MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIH--PMVTVESLAKGF--------SINVFSEKSC---QG

Query:  LLVSVLEAFEELGLNVLEARVSCTDSFQLQAIA-EIEEQGEEAIDAQAVKEAVV
        L+VS +   E LGL + +  +SC   F LQA   E+ EQ    + ++A K+A++
Subjt:  LLVSVLEAFEELGLNVLEARVSCTDSFQLQAIA-EIEEQGEEAIDAQAVKEAVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCTAGAGAGCACAACAACGCATCTCTTCATCACAACCTCCAATTACTTCGCTCTATTACCAATTCTCATGCTCAGCTCAACAAGGCCTCGATTATAGTGGATGC
ATCAAAATACATCGAGGAGCTTAAACAAAAAGTAGAAAGATTGAATCAAGATATCTCAACCGTTCAAACTTCAATCCATCCCATGGTGACAGTGGAAAGCCTAGCAAAGG
GGTTTTCCATTAATGTATTTTCAGAAAAGAGCTGCCAAGGCCTGCTCGTGTCGGTATTAGAAGCCTTTGAAGAGCTGGGGCTGAATGTTCTTGAAGCTAGGGTTTCTTGT
ACAGATAGTTTCCAATTACAAGCTATTGCAGAAATTGAGGAACAAGGAGAAGAAGCCATTGATGCTCAAGCTGTGAAAGAAGCAGTAGTTGAAGCTATAAAGATCTGGAG
CCAAAGCGGTGAACAAGATTAA
mRNA sequenceShow/hide mRNA sequence
TATAATTCAAATCATGTCTTATATATATATATATATATGGTTGTAGAGGAAGCTAGCCAACACAAGTTGGTATAGATTTGAGGCCTCACCATGCCCTTATAAAAACACAC
ACAAGAAAAAGAAACATATAACACACAAACAAAAAAGCTACAATCCATCCATGGTTTCTAGAGAGCACAACAACGCATCTCTTCATCACAACCTCCAATTACTTCGCTCT
ATTACCAATTCTCATGCTCAGCTCAACAAGGCCTCGATTATAGTGGATGCATCAAAATACATCGAGGAGCTTAAACAAAAAGTAGAAAGATTGAATCAAGATATCTCAAC
CGTTCAAACTTCAATCCATCCCATGGTGACAGTGGAAAGCCTAGCAAAGGGGTTTTCCATTAATGTATTTTCAGAAAAGAGCTGCCAAGGCCTGCTCGTGTCGGTATTAG
AAGCCTTTGAAGAGCTGGGGCTGAATGTTCTTGAAGCTAGGGTTTCTTGTACAGATAGTTTCCAATTACAAGCTATTGCAGAAATTGAGGAACAAGGAGAAGAAGCCATT
GATGCTCAAGCTGTGAAAGAAGCAGTAGTTGAAGCTATAAAGATCTGGAGCCAAAGCGGTGAACAAGATTAAAAACATCCTTAATTTCTCCAGCTTGTTCCAATTGCGGC
TGTTTTTTTTTTTTT
Protein sequenceShow/hide protein sequence
MVSREHNNASLHHNLQLLRSITNSHAQLNKASIIVDASKYIEELKQKVERLNQDISTVQTSIHPMVTVESLAKGFSINVFSEKSCQGLLVSVLEAFEELGLNVLEARVSC
TDSFQLQAIAEIEEQGEEAIDAQAVKEAVVEAIKIWSQSGEQD