; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019208 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019208
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTranscription factor bHLH35-like protein
Genome locationtig00153293:658811..659546
RNA-Seq ExpressionSgr019208
SyntenySgr019208
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034655.1 hypothetical protein SDJN02_04385, partial [Cucurbita argyrosperma subsp. argyrosperma]5.3e-6677.05Show/hide
Query:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTVK
        MVST Q+RF SR+KLRLVRSL T +S G QS  FWNA LF+HKLKLKLEAIEREYSNLL+MKREYLNSVK +H PK          EVKV+KVGEE  VK
Subjt:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTVK

Query:  VRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAEDEQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI
        VRCEKGGDRLV+VLEAFDKMGLNVLQA+VSC++ FSMEAIAVAEDEQSLN+ DIT+AIN AIDGK LAANQ QKETPEKD EI
Subjt:  VRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAEDEQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI

XP_022142089.1 uncharacterized protein LOC111012303 [Momordica charantia]2.9e-6476.63Show/hide
Query:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYL-NSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTV
        MVSTLQR+FASRRKLRLVR LA +ESPG QSCVFWNA LFIHKLKLKLEAIEREYSNLLA KRE L NSVK  H PK          EV+V+K+G+EF V
Subjt:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYL-NSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTV

Query:  KVRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAEDEQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI
        ++RCEKGGDRLVSVLEAFDKMGLNVL+ARVSCTD FSMEA+AVAE EQSL+VRDI +AINVAIDGK LAANQ Q+E PE++ EI
Subjt:  KVRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAEDEQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI

XP_022925942.1 uncharacterized protein LOC111433204 [Cucurbita moschata]1.5e-6576.5Show/hide
Query:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTVK
        MVST Q+RF SR+KLRLVRSL T +S G QS  FWNA LF+HKLKLKLEAIEREYSNLL+MKREYLNSVK +H PK          EVKV+KVGEE  VK
Subjt:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTVK

Query:  VRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAEDEQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI
        VRCEKGGDRLV+VLEAFDKMGLNVLQA+VSC++ FSMEAIAVAEDEQSLN+ DIT+AIN AIDGK LAANQ QKETP+KD EI
Subjt:  VRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAEDEQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI

XP_022978770.1 uncharacterized protein LOC111478631 [Cucurbita maxima]1.8e-6677.6Show/hide
Query:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTVK
        MVST Q+RF SR+KLRLVRSL T +S G QS  FWNA LF+HKLKLKLEAIEREYSNLL+MKREYLNSVK +H PK          EVKV+KVGEEF VK
Subjt:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTVK

Query:  VRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAEDEQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI
        VRCEKGGDRLV+VLEAFDKMGLNVLQA+VSC++ FSMEAIAVAEDEQSLN+ DIT+AIN AIDGK LAANQ QKETPEKD EI
Subjt:  VRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAEDEQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI

XP_023544520.1 uncharacterized protein LOC111804071 [Cucurbita pepo subsp. pepo]3.1e-6677.05Show/hide
Query:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTVK
        MVST Q+RF SR+KLRLVRSL T +S G QS  FWNA LF+HKLKLKLEAIEREYSNLL+MKREYLNSVK +H PK          EVKV+KVGEEF VK
Subjt:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTVK

Query:  VRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAEDEQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI
        VRCEKGGDRLV+VLEAFDKMGLNVLQA+VSC++ FSMEAIAVAEDEQSLN+ DIT+AIN AI+GK LAANQ QKETPEKD EI
Subjt:  VRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAEDEQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI

TrEMBL top hitse value%identityAlignment
A0A5A7UBN8 Uncharacterized protein3.0e-5971.74Show/hide
Query:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTVK
        MVSTLQ+RF SR KLRLVRSL T ES G Q+CVFWNA LFIHKLKLKLEAIEREYSNLL MKREYLNS+K  H  K          EVKV+K GEEF VK
Subjt:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTVK

Query:  VRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAED-EQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI
        VRCEKGGDRLVSVLEAF+KMGLNV++ARVSCT+ F MEAIAVAED  Q LN+ DIT AINVAID K LA NQH  + P  D+++
Subjt:  VRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAED-EQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI

A0A6J1CKK9 uncharacterized protein LOC1110123031.4e-6476.63Show/hide
Query:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYL-NSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTV
        MVSTLQR+FASRRKLRLVR LA +ESPG QSCVFWNA LFIHKLKLKLEAIEREYSNLLA KRE L NSVK  H PK          EV+V+K+G+EF V
Subjt:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYL-NSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTV

Query:  KVRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAEDEQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI
        ++RCEKGGDRLVSVLEAFDKMGLNVL+ARVSCTD FSMEA+AVAE EQSL+VRDI +AINVAIDGK LAANQ Q+E PE++ EI
Subjt:  KVRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAEDEQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI

A0A6J1ED01 uncharacterized protein LOC1114332047.4e-6676.5Show/hide
Query:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTVK
        MVST Q+RF SR+KLRLVRSL T +S G QS  FWNA LF+HKLKLKLEAIEREYSNLL+MKREYLNSVK +H PK          EVKV+KVGEE  VK
Subjt:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTVK

Query:  VRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAEDEQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI
        VRCEKGGDRLV+VLEAFDKMGLNVLQA+VSC++ FSMEAIAVAEDEQSLN+ DIT+AIN AIDGK LAANQ QKETP+KD EI
Subjt:  VRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAEDEQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI

A0A6J1IR65 uncharacterized protein LOC1114786318.8e-6777.6Show/hide
Query:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTVK
        MVST Q+RF SR+KLRLVRSL T +S G QS  FWNA LF+HKLKLKLEAIEREYSNLL+MKREYLNSVK +H PK          EVKV+KVGEEF VK
Subjt:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTVK

Query:  VRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAEDEQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI
        VRCEKGGDRLV+VLEAFDKMGLNVLQA+VSC++ FSMEAIAVAEDEQSLN+ DIT+AIN AIDGK LAANQ QKETPEKD EI
Subjt:  VRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAEDEQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI

E5GC31 Uncharacterized protein3.0e-5971.74Show/hide
Query:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTVK
        MVSTLQ+RF SR KLRLVRSL T ES G Q+CVFWNA LFIHKLKLKLEAIEREYSNLL MKREYLNS+K  H  K          EVKV+K GEEF VK
Subjt:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTVK

Query:  VRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAED-EQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI
        VRCEKGGDRLVSVLEAF+KMGLNV++ARVSCT+ F MEAIAVAED  Q LN+ DIT AINVAID K LA NQH  + P  D+++
Subjt:  VRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAED-EQSLNVRDITQAINVAIDGK-LAANQHQKETPEKDSEI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G29270.1 unknown protein2.0e-2340.24Show/hide
Query:  MVSTLQRRFASRRKLRLVRSLATNESPGH-QSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTV
        MV++ Q++ AS+ K   +++L   +   H QS V   A L+I  LKL++EA++REY +L   K+E L+                  QEVKV+K+GE F V
Subjt:  MVSTLQRRFASRRKLRLVRSLATNESPGH-QSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTV

Query:  KVRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAEDEQSL-NVRDITQAINVAI
        K++  +G + LV++LEAF++MGLNV QAR SC D F+MEAI   + +  L +V D+TQ +  A+
Subjt:  KVRCEKGGDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAEDEQSL-NVRDITQAINVAI

AT2G40435.1 BEST Arabidopsis thaliana protein match is: transcription regulators (TAIR:AT3G56220.1)3.0e-1132.93Show/hide
Query:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTVK
        MVS  Q+R + + K +L+RS+ TN    + + +  +A  +I KLK K+E   ++ +           S      PK           V V+ + + F + 
Subjt:  MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTVK

Query:  VRCEKG-GDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVA-EDEQSLNVRDITQAINVAI
        V   K     LVSVLEAF+ +GLNVL+AR SCTD FS+ A+ +  ED ++++   + QA+  AI
Subjt:  VRCEKG-GDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVA-EDEQSLNVRDITQAINVAI

AT3G56220.1 transcription regulators3.7e-0931.33Show/hide
Query:  MVSTLQRRFAS-RRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTV
        MVS   +R +S R K  L+RS+  + +    S +  +A  +I KLK K+E I    +N    ++ +  S                +  V V+ + + F +
Subjt:  MVSTLQRRFAS-RRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTV

Query:  KVRCEKG-GDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVA--EDEQSLNVRDITQAINVAI
        KV   K     LV VLE F+ +GL+V++ARVSCTD FS+ AI  +  +D   ++   + QA+  AI
Subjt:  KVRCEKG-GDRLVSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVA--EDEQSLNVRDITQAINVAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCTACATTACAAAGGAGATTCGCCTCACGCAGGAAGCTACGCCTTGTTCGATCGCTGGCCACCAATGAATCACCGGGACATCAGAGTTGTGTCTTCTGGAATGC
TTTTCTCTTCATTCATAAGCTTAAGCTCAAGCTGGAAGCAATCGAGAGAGAGTATTCGAATCTATTGGCCATGAAAAGAGAATACTTGAATTCAGTAAAGCATTTACATA
TCCCCAAGATGTGTTCTTGTGATGGGATGGGTGACCAGGAAGTGAAGGTAGACAAGGTTGGAGAAGAGTTTACAGTGAAAGTGAGATGCGAAAAGGGAGGGGACAGATTG
GTTTCAGTATTGGAGGCCTTTGACAAAATGGGTCTCAATGTTCTGCAAGCTAGAGTTTCATGTACCGATGGTTTTTCCATGGAAGCCATTGCTGTAGCTGAAGATGAACA
ATCTCTTAATGTAAGAGACATAACTCAAGCTATCAATGTAGCCATTGATGGGAAATTAGCTGCAAATCAGCATCAGAAAGAAACTCCTGAGAAAGATTCTGAAATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTTCTACATTACAAAGGAGATTCGCCTCACGCAGGAAGCTACGCCTTGTTCGATCGCTGGCCACCAATGAATCACCGGGACATCAGAGTTGTGTCTTCTGGAATGC
TTTTCTCTTCATTCATAAGCTTAAGCTCAAGCTGGAAGCAATCGAGAGAGAGTATTCGAATCTATTGGCCATGAAAAGAGAATACTTGAATTCAGTAAAGCATTTACATA
TCCCCAAGATGTGTTCTTGTGATGGGATGGGTGACCAGGAAGTGAAGGTAGACAAGGTTGGAGAAGAGTTTACAGTGAAAGTGAGATGCGAAAAGGGAGGGGACAGATTG
GTTTCAGTATTGGAGGCCTTTGACAAAATGGGTCTCAATGTTCTGCAAGCTAGAGTTTCATGTACCGATGGTTTTTCCATGGAAGCCATTGCTGTAGCTGAAGATGAACA
ATCTCTTAATGTAAGAGACATAACTCAAGCTATCAATGTAGCCATTGATGGGAAATTAGCTGCAAATCAGCATCAGAAAGAAACTCCTGAGAAAGATTCTGAAATTTAG
Protein sequenceShow/hide protein sequence
MVSTLQRRFASRRKLRLVRSLATNESPGHQSCVFWNAFLFIHKLKLKLEAIEREYSNLLAMKREYLNSVKHLHIPKMCSCDGMGDQEVKVDKVGEEFTVKVRCEKGGDRL
VSVLEAFDKMGLNVLQARVSCTDGFSMEAIAVAEDEQSLNVRDITQAINVAIDGKLAANQHQKETPEKDSEI