; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019557 (gene) of Snake gourd v1 genome

Gene IDTan0019557
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBHLH domain-containing protein
Genome locationLG05:84930836..84933783
RNA-Seq ExpressionTan0019557
SyntenyTan0019557
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0090575 - RNA polymerase II transcription factor complex (cellular component)
GO:0000977 - RNA polymerase II regulatory region sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR015660 - Achaete-scute transcription factor-related
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573268.1 Transcription factor basic helix-loop-helix 162, partial [Cucurbita argyrosperma subsp. sororia]3.9e-5166.67Show/hide
Query:  MANNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAK-TLPDQLENATNYIKELEEKVEKLKEKREKLM--DEKQRRTCNEIKRR
        MANN IHCPSSA   DRKL E NRR EM  LFS L+SLVP++SST    EA+ TL DQLENATNYIK+L+E VEKLKEK+EKLM   E+  R+ +EIK R
Subjt:  MANNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAK-TLPDQLENATNYIKELEEKVEKLKEKREKLM--DEKQRRTCNEIKRR

Query:  LLLLQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKDGQYTV
         LL+QVEAHQVGSS+E LLTTGSDYH VL+Q+LQL+QENG +IV+++ S +  RVFHKI+A++VGEG  S   +GER+CETV KKFVSQY KD QY V
Subjt:  LLLLQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKDGQYTV

KAG6573271.1 Transcription factor basic helix-loop-helix 162, partial [Cucurbita argyrosperma subsp. sororia]8.2e-4961.54Show/hide
Query:  MANNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAKTLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLLL
        MA+N I+CP S   TD K+ E NRR+EM  L S L+SLVP++SST       TLPDQLENATNYIK+L+E VEKLKEKREKLM   ++ T     +  L+
Subjt:  MANNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAKTLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLLL

Query:  LQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKDGQYTV
        +QVEAH+VGS +E+LLTTGS Y  VLRQI+QL+QENG EIV ++QS +  R FHKI+AQ+VGEG  S G+ GER+CETV KKFVS+Y KD QYTV
Subjt:  LQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKDGQYTV

XP_022994108.1 transcription factor bHLH162-like [Cucurbita maxima]6.3e-4961.03Show/hide
Query:  MANNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAKTLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLLL
        MA+N I+CPSSA  TD+++ E NRR EM  L S L+SLVP+++ST       TLPDQLENATNYIK+L+E VEKLKEKREKLM   ++ T     +  ++
Subjt:  MANNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAKTLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLLL

Query:  LQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKDGQYTV
        +QVEAH VGSS+E+LLTTGSDYH VLRQI+QL+QENG EIV ++QS +  R FHKI+AQ+ GEG    G+ GER+CE V KKFVS Y KD QYTV
Subjt:  LQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKDGQYTV

XP_022994474.1 transcription factor bHLH167-like [Cucurbita maxima]5.1e-5165.82Show/hide
Query:  MANNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAK-TLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLL
        MANN IHCPSS    DRKL E NRR EM  LFS L+SLVP++SST    EA+ TL DQLENATNYIK+L+E VEKLKEKREKLM  ++  T     +  L
Subjt:  MANNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAK-TLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLL

Query:  LLQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKDGQYTV
        L+QVEAHQVGSS+EVLLTTGSDY FVL QILQL+QENG +IV+++ S +  RVFHKI+A++VGEG  S  + GER+CETV KKFVSQY KD QY V
Subjt:  LLQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKDGQYTV

XP_023542777.1 uncharacterized protein LOC111802586 [Cucurbita pepo subsp. pepo]1.3e-5166.83Show/hide
Query:  MANNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAK-TLPDQLENATNYIKELEEKVEKLKEKREKLM---DEKQRRTCNEIKR
        MANN IHCPSSA   DRKL E NRR EM  LFS L+SLVP++SST    EA+ TL DQLENATNYIK+L+E VEKLKEK+EKLM   +E  RR  +EIK 
Subjt:  MANNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAK-TLPDQLENATNYIKELEEKVEKLKEKREKLM---DEKQRRTCNEIKR

Query:  RLLLLQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKDGQYTV
        R LL+QVEAHQVGSS+E LLTTGS+YH VL+QILQL+QENG +IV+++ S +  RVFHKI+A++VGEG  S  ++GER+CETV KKFVSQY KD QY V
Subjt:  RLLLLQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKDGQYTV

TrEMBL top hitse value%identityAlignment
A0A1S3B660 transcription factor bHLH118-like4.5e-4556.59Show/hide
Query:  NNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEA-KTLPDQLENATNYIKELEEKVEKLKEKREKLM-----------DEKQRRT
        +NPI C  +   +DRK  E NRRKEM  LFS LNSL+P+ +S     EA +T+PDQLE+ATNYIKEL++ ++KLKEK+E+LM           + ++RR 
Subjt:  NNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEA-KTLPDQLENATNYIKELEEKVEKLKEKREKLM-----------DEKQRRT

Query:  CNEIKRRLLLLQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKD
             +  LLLQV+AHQ+GSS+EV LTTGSDYHF+L+Q+L+L+Q+NGAEI++++QSM T RVFHKI AQV GEG+  G  DGER+C+TV KKFVSQY KD
Subjt:  CNEIKRRLLLLQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKD

Query:  GQYTV
         Q +V
Subjt:  GQYTV

A0A6J1GT92 uncharacterized protein LOC1114572295.2e-4965.15Show/hide
Query:  MANNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAK-TLPDQLENATNYIKELEEKVEKLKEKREKLM--DEKQRRTCNEIKRR
        MANN IH PSSA   DRKL E NRR EM  LFS L+SLVP++SS    +EA+ TL DQLENATNYIK+L+E VEKLKEK+EKLM   E+  R+ +EIK R
Subjt:  MANNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAK-TLPDQLENATNYIKELEEKVEKLKEKREKLM--DEKQRRTCNEIKRR

Query:  LLLLQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKDGQYTV
         LL+QVEAHQVGSS+E LLTT SDYH VL+Q+LQL+QENG +IV+++ S +  RVFHKI+A++VGEG  S   +GER+CETV KKFVSQY KD QY V
Subjt:  LLLLQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKDGQYTV

A0A6J1JZ96 transcription factor bHLH167-like7.7e-4563.54Show/hide
Query:  MANNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAKTLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLLL
        MA+N IH  S++  TDRK        EM  LFS L+SLVP++ ST       TLP Q+ENATNYIK+L+E VEKLKEKREKL+  ++     EIK R LL
Subjt:  MANNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAKTLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLLL

Query:  LQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKDGQ
        +QVEAHQVGSS+EVLLTTGSDY  VL QILQL+QENG +I+H++ S I  RVFHKIVAQ+VGEGM+S G DGER+CETV KKFVSQY KDG+
Subjt:  LQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKDGQ

A0A6J1K2Y3 transcription factor bHLH167-like2.5e-5165.82Show/hide
Query:  MANNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAK-TLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLL
        MANN IHCPSS    DRKL E NRR EM  LFS L+SLVP++SST    EA+ TL DQLENATNYIK+L+E VEKLKEKREKLM  ++  T     +  L
Subjt:  MANNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAK-TLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLL

Query:  LLQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKDGQYTV
        L+QVEAHQVGSS+EVLLTTGSDY FVL QILQL+QENG +IV+++ S +  RVFHKI+A++VGEG  S  + GER+CETV KKFVSQY KD QY V
Subjt:  LLQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKDGQYTV

A0A6J1K483 transcription factor bHLH162-like3.0e-4961.03Show/hide
Query:  MANNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAKTLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLLL
        MA+N I+CPSSA  TD+++ E NRR EM  L S L+SLVP+++ST       TLPDQLENATNYIK+L+E VEKLKEKREKLM   ++ T     +  ++
Subjt:  MANNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAKTLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLLL

Query:  LQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKDGQYTV
        +QVEAH VGSS+E+LLTTGSDYH VLRQI+QL+QENG EIV ++QS +  R FHKI+AQ+ GEG    G+ GER+CE V KKFVS Y KD QYTV
Subjt:  LQVEAHQVGSSMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKDGQYTV

SwissProt top hitse value%identityAlignment
F4I4E1 Transcription factor bHLH1678.9e-0629.48Show/hide
Query:  SSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAKTLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLLLLQVEAHQVG
        SS++   R L E +RR  M HLFS L+S   H S T  L     +P  ++ AT+Y+ +L+E V  LKEK+  L+   Q    N  +   LL ++      
Subjt:  SSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAKTLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLLLLQVEAHQVG

Query:  SSMEV-LLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKK
        S++E+ L+   +    +L +++ + +E GA+++  +   +  R  + I+AQ +   ++  G D  R+ E V+K
Subjt:  SSMEV-LLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKK

F4JIJ7 Transcription factor bHLH1621.0e-1434.48Show/hide
Query:  TTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAKTLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLL-----------LLQ
        + DRK  E NRR +M  L+S L SL+PH SST    E  TLPDQL+ A NYIK+L+  VEK +E++  L+        N +    +           L +
Subjt:  TTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAKTLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLL-----------LLQ

Query:  VEAHQVGSSMEVLLTTGSDYHFVLRQILQ-LIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERL
        +E  + GS   + L T  ++ F+  +I++ L +E GAEI H   S++   VFH +  +V      +     ERL
Subjt:  VEAHQVGSSMEVLLTTGSDYHFVLRQILQ-LIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERL

Arabidopsis top hitse value%identityAlignment
AT1G10585.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein6.3e-0729.48Show/hide
Query:  SSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAKTLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLLLLQVEAHQVG
        SS++   R L E +RR  M HLFS L+S   H S T  L     +P  ++ AT+Y+ +L+E V  LKEK+  L+   Q    N  +   LL ++      
Subjt:  SSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAKTLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLLLLQVEAHQVG

Query:  SSMEV-LLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKK
        S++E+ L+   +    +L +++ + +E GA+++  +   +  R  + I+AQ +   ++  G D  R+ E V+K
Subjt:  SSMEV-LLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKK

AT4G20970.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein7.5e-1634.48Show/hide
Query:  TTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAKTLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLL-----------LLQ
        + DRK  E NRR +M  L+S L SL+PH SST    E  TLPDQL+ A NYIK+L+  VEK +E++  L+        N +    +           L +
Subjt:  TTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAKTLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLL-----------LLQ

Query:  VEAHQVGSSMEVLLTTGSDYHFVLRQILQ-LIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERL
        +E  + GS   + L T  ++ F+  +I++ L +E GAEI H   S++   VFH +  +V      +     ERL
Subjt:  VEAHQVGSSMEVLLTTGSDYHFVLRQILQ-LIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAACAACCCCATTCATTGTCCATCATCAGCAGTAACCACCGACCGAAAACTCACAGAGACAAATAGAAGAAAGGAAATGATCCATCTTTTCTCCGGCCTCAATTC
TCTCGTCCCCCATCGAAGCTCAACGTGCATGCTGTTGGAAGCTAAAACGCTGCCGGATCAGCTGGAAAATGCCACAAATTACATAAAAGAATTGGAGGAGAAGGTGGAGA
AATTGAAAGAGAAGAGAGAGAAGCTAATGGATGAAAAACAAAGAAGAACCTGCAATGAAATTAAACGGAGATTATTATTGCTGCAAGTTGAAGCTCATCAAGTGGGTTCT
TCAATGGAGGTTCTTTTGACAACTGGATCTGATTATCACTTTGTTTTAAGACAAATCCTTCAGCTGATTCAAGAAAATGGAGCTGAGATCGTCCATCTCAGTCAGTCCAT
GATCACAGTTCGAGTTTTTCACAAGATAGTAGCTCAGGTGGTTGGAGAAGGGATGGCCTCCGGAGGCAGTGATGGTGAAAGGCTTTGCGAGACTGTGAAGAAGAAGTTTG
TTTCACAGTACAATAAAGATGGCCAATACACTGTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAACAACCCCATTCATTGTCCATCATCAGCAGTAACCACCGACCGAAAACTCACAGAGACAAATAGAAGAAAGGAAATGATCCATCTTTTCTCCGGCCTCAATTC
TCTCGTCCCCCATCGAAGCTCAACGTGCATGCTGTTGGAAGCTAAAACGCTGCCGGATCAGCTGGAAAATGCCACAAATTACATAAAAGAATTGGAGGAGAAGGTGGAGA
AATTGAAAGAGAAGAGAGAGAAGCTAATGGATGAAAAACAAAGAAGAACCTGCAATGAAATTAAACGGAGATTATTATTGCTGCAAGTTGAAGCTCATCAAGTGGGTTCT
TCAATGGAGGTTCTTTTGACAACTGGATCTGATTATCACTTTGTTTTAAGACAAATCCTTCAGCTGATTCAAGAAAATGGAGCTGAGATCGTCCATCTCAGTCAGTCCAT
GATCACAGTTCGAGTTTTTCACAAGATAGTAGCTCAGGTGGTTGGAGAAGGGATGGCCTCCGGAGGCAGTGATGGTGAAAGGCTTTGCGAGACTGTGAAGAAGAAGTTTG
TTTCACAGTACAATAAAGATGGCCAATACACTGTCTAA
Protein sequenceShow/hide protein sequence
MANNPIHCPSSAVTTDRKLTETNRRKEMIHLFSGLNSLVPHRSSTCMLLEAKTLPDQLENATNYIKELEEKVEKLKEKREKLMDEKQRRTCNEIKRRLLLLQVEAHQVGS
SMEVLLTTGSDYHFVLRQILQLIQENGAEIVHLSQSMITVRVFHKIVAQVVGEGMASGGSDGERLCETVKKKFVSQYNKDGQYTV