; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023593 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023593
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHTH myb-type domain-containing protein
Genome locationtig00000892:4824087..4826202
RNA-Seq ExpressionSgr023593
SyntenySgr023593
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR006447 - Myb domain, plants
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain
IPR044848 - PHR1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604776.1 putative Myb family transcription factor, partial [Cucurbita argyrosperma subsp. sororia]2.1e-3980.67Show/hide
Query:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSM-EEKDDEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA
        FDLNEEA  ID            EENSSSNNGS+ EEK++E +R +G VRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA
Subjt:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSM-EEKDDEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA

Query:  HVKSHLQMYRSKKLDESGQ
        HVKSHLQMYRSKKLDESGQ
Subjt:  HVKSHLQMYRSKKLDESGQ

KAG7034901.1 putative Myb family transcription factor, partial [Cucurbita argyrosperma subsp. argyrosperma]2.1e-3980.67Show/hide
Query:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSM-EEKDDEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA
        FDLNEEA  ID            EENSSSNNGS+ EEK++E +R +G VRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA
Subjt:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSM-EEKDDEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA

Query:  HVKSHLQMYRSKKLDESGQ
        HVKSHLQMYRSKKLDESGQ
Subjt:  HVKSHLQMYRSKKLDESGQ

XP_022947230.1 uncharacterized protein LOC111451155 [Cucurbita moschata]2.1e-3980.67Show/hide
Query:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSM-EEKDDEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA
        FDLNEEA  ID            EENSSSNNGS+ EEK++E +R +G VRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA
Subjt:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSM-EEKDDEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA

Query:  HVKSHLQMYRSKKLDESGQ
        HVKSHLQMYRSKKLDESGQ
Subjt:  HVKSHLQMYRSKKLDESGQ

XP_022970836.1 two-component response regulator ARR10-like [Cucurbita maxima]2.1e-3980.67Show/hide
Query:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSM-EEKDDEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA
        FDLNEEA  ID            EENSSSNNGS+ EEK++E +R +G VRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA
Subjt:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSM-EEKDDEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA

Query:  HVKSHLQMYRSKKLDESGQ
        HVKSHLQMYRSKKLDESGQ
Subjt:  HVKSHLQMYRSKKLDESGQ

XP_023533211.1 uncharacterized protein LOC111795167 [Cucurbita pepo subsp. pepo]2.1e-3980.67Show/hide
Query:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSM-EEKDDEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA
        FDLNEEA  ID            EENSSSNNGS+ EEK++E +R +G VRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA
Subjt:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSM-EEKDDEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA

Query:  HVKSHLQMYRSKKLDESGQ
        HVKSHLQMYRSKKLDESGQ
Subjt:  HVKSHLQMYRSKKLDESGQ

TrEMBL top hitse value%identityAlignment
A0A0A0KBB2 HTH myb-type domain-containing protein6.5e-3977.6Show/hide
Query:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSMEEKDDEQ-------RRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNV
        FDLNEEAKGIDE   +   G     N+SSNNGSMEE + E+       R   G+VRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNV
Subjt:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSMEEKDDEQ-------RRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNV

Query:  RGLSIAHVKSHLQMYRSKKLDESGQ
        RGLSIAHVKSHLQMYRSKKLDESGQ
Subjt:  RGLSIAHVKSHLQMYRSKKLDESGQ

A0A1S3BJ91 probable transcription factor KAN2 isoform X25.0e-3979.2Show/hide
Query:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSMEEKD-----DEQRRST--GSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNV
        FDLNEEAKG+DE   +   G     N+SSNNGSMEE +     DEQR  +  GSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNV
Subjt:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSMEEKD-----DEQRRST--GSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNV

Query:  RGLSIAHVKSHLQMYRSKKLDESGQ
        RGLSIAHVKSHLQMYRSKKLDESGQ
Subjt:  RGLSIAHVKSHLQMYRSKKLDESGQ

A0A1S3BK41 uncharacterized protein LOC103490495 isoform X15.0e-3979.2Show/hide
Query:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSMEEKD-----DEQRRST--GSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNV
        FDLNEEAKG+DE   +   G     N+SSNNGSMEE +     DEQR  +  GSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNV
Subjt:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSMEEKD-----DEQRRST--GSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNV

Query:  RGLSIAHVKSHLQMYRSKKLDESGQ
        RGLSIAHVKSHLQMYRSKKLDESGQ
Subjt:  RGLSIAHVKSHLQMYRSKKLDESGQ

A0A6J1G6A3 uncharacterized protein LOC1114511551.0e-3980.67Show/hide
Query:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSM-EEKDDEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA
        FDLNEEA  ID            EENSSSNNGS+ EEK++E +R +G VRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA
Subjt:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSM-EEKDDEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA

Query:  HVKSHLQMYRSKKLDESGQ
        HVKSHLQMYRSKKLDESGQ
Subjt:  HVKSHLQMYRSKKLDESGQ

A0A6J1I531 two-component response regulator ARR10-like1.0e-3980.67Show/hide
Query:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSM-EEKDDEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA
        FDLNEEA  ID            EENSSSNNGS+ EEK++E +R +G VRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA
Subjt:  FDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSM-EEKDDEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIA

Query:  HVKSHLQMYRSKKLDESGQ
        HVKSHLQMYRSKKLDESGQ
Subjt:  HVKSHLQMYRSKKLDESGQ

SwissProt top hitse value%identityAlignment
A0A0P0X0C0 Myb family transcription factor MPH19.4e-1966.67Show/hide
Query:  VRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRS
        VR+Y+RS++PR+RWT ++H  FV AVE LGGQ+ ATPK +LQLM V+G+SI+H+KSHLQMYRS
Subjt:  VRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRS

A3AWH5 Myb family transcription factor MOF14.8e-2366.23Show/hide
Query:  GSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDESGQGRVS
        G+VR+Y RSK+PRLRWT +LH +FV A+E LGGQ++ATPKL+LQLM V+GL+I+HVKSHLQMYR  +L   G GR S
Subjt:  GSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDESGQGRVS

Q700D9 Putative Myb family transcription factor At1g146002.9e-2060.26Show/hide
Query:  GSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDESGQGRVSN
        G VR Y RS +PRLRWTP+LH +FV+AV+ LGGQ +ATPKLVL++M+V+GL+I+HVKSHLQMYR  ++   G+   S+
Subjt:  GSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDESGQGRVSN

Q93WJ9 Transcription repressor KAN11.7e-1567.24Show/hide
Query:  KMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKK
        + PR+RWT  LH  FV+AVE LGG ERATPK VL+LM+V+ L++AHVKSHLQMYR+ K
Subjt:  KMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKK

Q9FJV5 Probable transcription factor KAN48.8e-1754.02Show/hide
Query:  RRSTGSVRKYSRS-KMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDES---GQGRVSNQ
        +RS+ S+    RS + PR+RWT  LH  FV+AV+ LGG ERATPK VL+LMNV+ L++AHVKSHLQMYR+ K  +    G+G+V  +
Subjt:  RRSTGSVRKYSRS-KMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDES---GQGRVSNQ

Arabidopsis top hitse value%identityAlignment
AT1G14600.1 Homeodomain-like superfamily protein2.1e-2160.26Show/hide
Query:  GSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDESGQGRVSN
        G VR Y RS +PRLRWTP+LH +FV+AV+ LGGQ +ATPKLVL++M+V+GL+I+HVKSHLQMYR  ++   G+   S+
Subjt:  GSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDESGQGRVSN

AT2G02060.1 Homeodomain-like superfamily protein2.1e-2175.81Show/hide
Query:  VRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYR
        VR Y RS +PRLRWTPDLH  FV+AVE LGGQ RATPKLVL++M+V+GL+I+HVKSHLQMYR
Subjt:  VRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYR

AT2G38300.1 myb-like HTH transcriptional regulator family protein4.2e-3058.91Show/hide
Query:  EGSGKSCSVVRFDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSMEEKDDEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQ
        EG GK+        N E +  +E+E+++E G   EE+  S+N ++EE D + +     VR Y RSK+PRLRWTPDLHL FV AVERLGGQERATPKLV Q
Subjt:  EGSGKSCSVVRFDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSMEEKDDEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQ

Query:  LMNVRGLSIAHVKSHLQMYRSKKLDESGQ
        +MN++GLSIAHVKSHLQMYRSKK+D+ GQ
Subjt:  LMNVRGLSIAHVKSHLQMYRSKKLDESGQ

AT2G40260.1 Homeodomain-like superfamily protein1.4e-3060.8Show/hide
Query:  NEEAKGIDEDEDDDERGGCAEE----NSSSNNGSMEEKD-----DEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVR
        NEE    D+DE+DDE G   EE    + S ++ S EE+      D+ +++ GSVR Y+RSK PRLRWTP+LH+ F+ AVERLGG +RATPKLVLQLMNV+
Subjt:  NEEAKGIDEDEDDDERGGCAEE----NSSSNNGSMEEKD-----DEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVR

Query:  GLSIAHVKSHLQMYRSKKLDESGQG
        GLSIAHVKSHLQMYRSKK DE  +G
Subjt:  GLSIAHVKSHLQMYRSKKLDESGQG

AT2G42660.1 Homeodomain-like superfamily protein4.3e-2765.26Show/hide
Query:  EENSSSNNGSMEEKDDEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDESGQ
        EE    NN      +DE  + T +VR+Y RS MPRLRWTPDLHL+FV AV+RLGG +RATPKLVL++MN++GLSIAHVKSHLQMYRSKKL+ S +
Subjt:  EENSSSNNGSMEEKDDEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDESGQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCAGTACCCCCCTAGTTGCAGTTGCAGAGGTCAACTATTAAACATCATCACGGATGGGCAGCAGGCCACACATAATAGAAAGACCCGCAGAGAGAGAGAGAGAAA
TCAAGAGAGAGAGAGAGAGAGACAAGCTGTCAAAGAGAACAATCGAAGAGAGGAGAGAGAAGATGAGACGATAAATAATGGGGGAGAAGGGAGTGGGAAGTCGTGTTCAG
TTGTGAGGTTTGATTTGAACGAAGAAGCTAAAGGCATTGATGAGGATGAGGATGATGATGAGAGAGGTGGATGTGCAGAAGAGAACTCGAGCAGTAACAATGGAAGCATG
GAGGAGAAAGATGATGAGCAGAGGAGGTCGACGGGGAGTGTTAGAAAATATTCGAGATCCAAAATGCCAAGGCTTCGTTGGACTCCTGATCTTCACCTAGCCTTTGTAAA
TGCTGTTGAAAGACTCGGTGGACAAGAAAGAGCAACTCCAAAGTTGGTTCTTCAGTTGATGAATGTGAGAGGGCTCAGCATTGCTCATGTAAAGAGTCATTTGCAGATGT
ATCGAAGTAAGAAGTTGGATGAGTCTGGACAAGGTAGAGTCTCTAATCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGCAGTACCCCCCTAGTTGCAGTTGCAGAGGTCAACTATTAAACATCATCACGGATGGGCAGCAGGCCACACATAATAGAAAGACCCGCAGAGAGAGAGAGAGAAA
TCAAGAGAGAGAGAGAGAGAGACAAGCTGTCAAAGAGAACAATCGAAGAGAGGAGAGAGAAGATGAGACGATAAATAATGGGGGAGAAGGGAGTGGGAAGTCGTGTTCAG
TTGTGAGGTTTGATTTGAACGAAGAAGCTAAAGGCATTGATGAGGATGAGGATGATGATGAGAGAGGTGGATGTGCAGAAGAGAACTCGAGCAGTAACAATGGAAGCATG
GAGGAGAAAGATGATGAGCAGAGGAGGTCGACGGGGAGTGTTAGAAAATATTCGAGATCCAAAATGCCAAGGCTTCGTTGGACTCCTGATCTTCACCTAGCCTTTGTAAA
TGCTGTTGAAAGACTCGGTGGACAAGAAAGAGCAACTCCAAAGTTGGTTCTTCAGTTGATGAATGTGAGAGGGCTCAGCATTGCTCATGTAAAGAGTCATTTGCAGATGT
ATCGAAGTAAGAAGTTGGATGAGTCTGGACAAGGTAGAGTCTCTAATCAATAG
Protein sequenceShow/hide protein sequence
MGQYPPSCSCRGQLLNIITDGQQATHNRKTRRERERNQERERERQAVKENNRREEREDETINNGGEGSGKSCSVVRFDLNEEAKGIDEDEDDDERGGCAEENSSSNNGSM
EEKDDEQRRSTGSVRKYSRSKMPRLRWTPDLHLAFVNAVERLGGQERATPKLVLQLMNVRGLSIAHVKSHLQMYRSKKLDESGQGRVSNQ