; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS016285 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS016285
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionMyb family transcription factor
Genome locationscaffold1407:95801..96255
RNA-Seq ExpressionMS016285
SyntenyMS016285
Gene Ontology termsGO:0009913 - epidermal cell differentiation (biological process)
GO:0010063 - positive regulation of trichoblast fate specification (biological process)
GO:0010376 - stomatal complex formation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3960445.1 hypothetical protein CMV_014845 [Castanea mollissima]5.5e-1959.3Show/hide
Query:  KKMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH
        K M K+PRKQ+K+R+S SE  VS            S+EW  I+++EQEEDLICRM++LVGDRWDLIAGR+PGR   EIERFWI++H
Subjt:  KKMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH

KAF3960446.1 hypothetical protein CMV_014845 [Castanea mollissima]1.2e-1858.14Show/hide
Query:  KKMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH
        K M K+PRKQ+K+R+S SE             +V+S+EW  I+++EQEEDLICRM++LVGDRWDLIAGR+PGR   EIERFWI++H
Subjt:  KKMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH

XP_022146038.1 transcription factor CPC-like isoform X2 [Momordica charantia]9.7e-3282.8Show/hide
Query:  MEDEAKKMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKHAS
        MEDEAKKM KQP+KQSKSRNSSSE             +VTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKHAS
Subjt:  MEDEAKKMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKHAS

XP_023912293.1 transcription factor CPC-like isoform X1 [Quercus suber]5.5e-1959.52Show/hide
Query:  MAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH
        M K+PRKQ+K+R+S+SE  VS            S+EW  I+++EQEEDLICRM++LVGDRWDLIAGRIPGR   +IERFWI++H
Subjt:  MAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH

XP_038874276.1 transcription factor CPC-like [Benincasa hispida]1.4e-2572.04Show/hide
Query:  MEDEAKKMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKHAS
        ME EAKK AKQP+KQSK R S+SE             +VTSLEW++IQIS+QEEDLI RMH+LVGDRWDLIAGRIPGRTAVEIERFWILKH S
Subjt:  MEDEAKKMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKHAS

TrEMBL top hitse value%identityAlignment
A0A2I4E811 transcription factor CPC-like1.1e-1757.14Show/hide
Query:  MAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH
        M K+PRKQ+KS  SS+             ++V+S+EW+ I+++EQEEDLI RM+KLVGDRWDLIAGR+PGR   EIERFWI++H
Subjt:  MAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH

A0A2K1XT14 SANT domain-containing protein1.1e-1758.33Show/hide
Query:  MAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH
        M ++ +KQ+K+ +  SE   S +LL   L +V+S+EW+ I +SEQEEDLI RMH LVGDRW LIAGRIPGR A EIERFW+++H
Subjt:  MAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH

A0A654F2G6 SANT domain-containing protein5.0e-1855.06Show/hide
Query:  DEAKKMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH
        D+A+KM K+ R+QSK++ S SE             +V+S+EW+ +++SE+EEDLI RM+KLVGDRW+LIAGRIPGRT  EIER+W++KH
Subjt:  DEAKKMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH

A0A6J1CXI2 transcription factor CPC-like isoform X24.7e-3282.8Show/hide
Query:  MEDEAKKMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKHAS
        MEDEAKKM KQP+KQSKSRNSSSE             +VTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKHAS
Subjt:  MEDEAKKMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKHAS

A0A7J9N1M8 SANT domain-containing protein2.5e-1759.49Show/hide
Query:  RKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH
        RKQ K+    SEG ++L     ++ +V+S+EW+ I +SEQEEDLI RM+KLVGD+W LIAGRIPGR A EIERFWI++H
Subjt:  RKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH

SwissProt top hitse value%identityAlignment
O22059 Transcription factor CPC1.4e-2055.06Show/hide
Query:  DEAKKMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH
        D+A+KM K+ R+QSK++ S SE             +V+S+EW+ +++SE+EEDLI RM+KLVGDRW+LIAGRIPGRT  EIER+W++KH
Subjt:  DEAKKMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH

Q84RD1 MYB-like transcription factor ETC22.2e-1560.71Show/hide
Query:  KKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKHA
        ++V+S+EW+ I ++EQEEDLI RM++LVG+RWDLIAGR+ GR A EIER+WI++++
Subjt:  KKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKHA

Q8GV05 Transcription factor TRY2.0e-1662.5Show/hide
Query:  KKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKHA
        ++V+S+EW+ I ++EQEEDLI RM++LVGDRWDLIAGR+PGR   EIER+WI++++
Subjt:  KKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKHA

Q9LNI5 MYB-like transcription factor ETC11.7e-1870.91Show/hide
Query:  KKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH
        ++V+SLEW+ I ++++EEDLICRM+KLVG+RWDLIAGRIPGRTA EIERFW++K+
Subjt:  KKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH

Q9M157 MYB-like transcription factor ETC32.8e-1852.94Show/hide
Query:  KMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH
        +  KQP+  S   +SS E              V+SLEW+++ +S++EEDL+ RMHKLVGDRW+LIAGRIPGRTA EIERFW++K+
Subjt:  KMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH

Arabidopsis top hitse value%identityAlignment
AT1G01380.1 Homeodomain-like superfamily protein1.2e-1970.91Show/hide
Query:  KKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH
        ++V+SLEW+ I ++++EEDLICRM+KLVG+RWDLIAGRIPGRTA EIERFW++K+
Subjt:  KKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH

AT2G46410.1 Homeodomain-like superfamily protein9.7e-2255.06Show/hide
Query:  DEAKKMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH
        D+A+KM K+ R+QSK++ S SE             +V+S+EW+ +++SE+EEDLI RM+KLVGDRW+LIAGRIPGRT  EIER+W++KH
Subjt:  DEAKKMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH

AT4G01060.1 CAPRICE-like MYB32.4e-2056.47Show/hide
Query:  KMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH
        +  KQP K +    SSSEGT           +V+SLEW+++ +S++EEDL+ RMHKLVGDRW+LIAGRIPGRTA EIERFW++K+
Subjt:  KMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH

AT4G01060.2 CAPRICE-like MYB31.2e-1970.37Show/hide
Query:  KVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH
        +V+SLEW+++ +S++EEDL+ RMHKLVGDRW+LIAGRIPGRTA EIERFW++K+
Subjt:  KVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH

AT4G01060.3 CAPRICE-like MYB32.0e-1952.94Show/hide
Query:  KMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH
        +  KQP+  S   +SS E              V+SLEW+++ +S++EEDL+ RMHKLVGDRW+LIAGRIPGRTA EIERFW++K+
Subjt:  KMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATGAAGCTAAGAAGATGGCGAAACAGCCACGGAAACAGTCTAAGAGCAGGAATTCAAGCTCGGAAGGTACGGTTTCTCTGGTTTTGCTGCCATGGCTACTGAA
AAAAGTTACCAGTTTGGAGTGGAAACTCATTCAAATCAGTGAACAGGAGGAGGATCTCATATGTAGAATGCACAAGCTTGTTGGAGACAGGTGGGATTTGATCGCTGGAC
GAATTCCGGGACGTACTGCAGTGGAAATTGAGAGGTTTTGGATCTTGAAACATGCTTCC
mRNA sequenceShow/hide mRNA sequence
ATGGAAGATGAAGCTAAGAAGATGGCGAAACAGCCACGGAAACAGTCTAAGAGCAGGAATTCAAGCTCGGAAGGTACGGTTTCTCTGGTTTTGCTGCCATGGCTACTGAA
AAAAGTTACCAGTTTGGAGTGGAAACTCATTCAAATCAGTGAACAGGAGGAGGATCTCATATGTAGAATGCACAAGCTTGTTGGAGACAGGTGGGATTTGATCGCTGGAC
GAATTCCGGGACGTACTGCAGTGGAAATTGAGAGGTTTTGGATCTTGAAACATGCTTCC
Protein sequenceShow/hide protein sequence
MEDEAKKMAKQPRKQSKSRNSSSEGTVSLVLLPWLLKKVTSLEWKLIQISEQEEDLICRMHKLVGDRWDLIAGRIPGRTAVEIERFWILKHAS