; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr005026 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr005026
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHTH-type transcriptional regulator protein ptxE
Genome locationtig00003509:134540..135013
RNA-Seq ExpressionSgr005026
SyntenySgr005026
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149511.1 uncharacterized protein LOC111017924 [Momordica charantia]6.8e-6287.9Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGICISSDSTNVAT+KLIL+DG LLE+SYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDD DELQLGQLYFALPLDRLNQPLHAEEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE
        MKAGG   EKCGSRRT ISP L SDEE  KV +RSVAKK  SG RRKFTAKLSAIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE

XP_022947655.1 uncharacterized protein LOC111451454 [Cucurbita moschata]4.1e-5978.98Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGICISSDS NV+T+KLILSDG LLE+SYPVKVSYVL KDPASFICNSD+MDF+DVV+A+DDDDELQLGQLYFALPL++LN+PLHAE+MAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE
        MKAGGGGSEKCGSRR     ++ S+EEL+K PR+ V K   SGG RKFTAKL AIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE

XP_023007426.1 uncharacterized protein LOC111499928 [Cucurbita maxima]8.3e-6079.62Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGICISSDS NV+T+KLILSDG LLE+SYPVKVSYVLQKDPASFICNSD+MDF+DVV+A+DD+DELQLGQLYFALPL++LN+PLHAE+MAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE
        MKAGGGGSEKCGSRR    P++ S+EEL+K PRR V K   +GG RKFTAKL AIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE

XP_023533314.1 uncharacterized protein LOC111795245 isoform X1 [Cucurbita pepo subsp. pepo]4.1e-5978.98Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGICISSDS NV+T+KLILSDG LLE+SYPVKVSYVL KDPASFICNSD+MDF+DVV+A+DDDDELQLGQLYFALPL++LN+PLHAE+MAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE
        MKAGGGGSEKCGSRR    P++  +EEL+K PR+ V K   SGG RKFTAKL AIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE

XP_038902080.1 uncharacterized protein LOC120088720 [Benincasa hispida]4.4e-6181.53Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGIC+SSD+ NVAT+KLIL+DG L+EFSYPVKVSY+LQK PASFICNSDEMDFDDVV A+DDDDELQLGQLYFALPLDRLNQPL AEEMAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE
        MKAGGGG+EKCGSRRT ISP+  SDEE +K PR+ +  K  SG  RKFTAKLSAIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE

TrEMBL top hitse value%identityAlignment
A0A6J1D8L2 uncharacterized protein LOC1110179243.3e-6287.9Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGICISSDSTNVAT+KLIL+DG LLE+SYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDD DELQLGQLYFALPLDRLNQPLHAEEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE
        MKAGG   EKCGSRRT ISP L SDEE  KV +RSVAKK  SG RRKFTAKLSAIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE

A0A6J1EJ90 uncharacterized protein LOC1114349662.1e-5677.07Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGICISSDS NVAT+KLIL+DG LLEFSYPVKVS++L K PA+FICNSD+MDFDD V A+ DDD LQLG LYFALPLDRLNQPLH EEMAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE
        +KAG GG+EK GSRRT +SPL  SDEE +K P RS+ K   SGG RKF AKLSAIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE

A0A6J1G7I2 uncharacterized protein LOC1114514542.0e-5978.98Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGICISSDS NV+T+KLILSDG LLE+SYPVKVSYVL KDPASFICNSD+MDF+DVV+A+DDDDELQLGQLYFALPL++LN+PLHAE+MAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE
        MKAGGGGSEKCGSRR     ++ S+EEL+K PR+ V K   SGG RKFTAKL AIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE

A0A6J1I7K7 uncharacterized protein LOC1114703501.7e-5575.8Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGICISSDS NVAT+KLIL+DG LLEFSYPVKVS++L K PA+FICNSD+MDFDD V A+ DDD LQLG LYFALPLDRLNQPLH EEMAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE
        +KAG GG+EK GSRRT +SP+  SDEE +K PRR + K   SG  RKF AKLSAIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE

A0A6J1L0H7 uncharacterized protein LOC1114999284.0e-6079.62Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGICISSDS NV+T+KLILSDG LLE+SYPVKVSYVLQKDPASFICNSD+MDF+DVV+A+DD+DELQLGQLYFALPL++LN+PLHAE+MAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE
        MKAGGGGSEKCGSRR    P++ S+EEL+K PRR V K   +GG RKFTAKL AIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G76600.1 unknown protein2.1e-1637.14Show/hide
Query:  MGICISSDS----TNVATSKLILSDGRLLEFSYPVKVSYVLQKDPAS----------FICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHA
        MG+C+S +     ++  T+K++  +G L E+  PV  S VL+ +  S          F+CNSD + +DD + AI+ D+ LQ  Q+YF LP+ +    L A
Subjt:  MGICISSDS----TNVATSKLILSDGRLLEFSYPVKVSYVLQKDPAS----------FICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHA

Query:  EEMAALAVKASSALMKAGGGGSEKCGSRRTPISPLLVSDE
         +MAALAVKAS A+ KA G  + +  S R  ISP++  ++
Subjt:  EEMAALAVKASSALMKAGGGGSEKCGSRRTPISPLLVSDE

AT2G23690.1 unknown protein7.5e-4359.51Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGIC S +ST VAT+KLIL DGR++EF+ PVKV YVLQK+P  FICNSD+MDFD+VVSAI  D+E QLGQLYFALPL  L+  L AEEMAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGG-GSEKCGSRRTPISPLLVSDEELKKV-----PRRSVAKKAASGGRRKFTAKLSAIPE
        M++GG  G +KC  RR  +SP++ S   +  V      R    +     GRRK+ AKLS I E
Subjt:  MKAGGG-GSEKCGSRRTPISPLLVSDEELKKV-----PRRSVAKKAASGGRRKFTAKLSAIPE

AT3G50800.1 unknown protein1.7e-3451.52Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MG C S +S    T+KLIL DG L EFS PVKV  +LQK+P SF+CNSD+MDFDD V A+   ++L+ G+LYF LPL  LN PL A+EMAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASG--------GRRKFTAKLSAIPE
         K+GGGG             L  +DE++ +   R V +    G        GRRKFTA+LS+I E
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASG--------GRRKFTAKLSAIPE

AT4G37240.1 unknown protein7.3e-3861.64Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGIC SS+ST VAT+KLIL DGR++EF+ PVKV YVL K P  FICNSD+MDFDD V+AI  D+ELQLGQ+YFALPL  L QPL AEEMAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRR
        M+ GGG     G RR  + P +VSD+   +V        + SG R+
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRR

AT5G66580.1 unknown protein2.4e-3352.87Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MG C S +S    ++KLIL DG L EFS PVKV  +LQK+P SF+CNSDEMDFDD VSA+  ++EL+ GQLYF LPL  LN PL AEEMAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE
         K+GG G    G      S      + +  V       +    G+R+FTA LS I E
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTATTTGCATTTCTTCCGATTCCACCAATGTTGCTACGTCGAAACTGATACTATCCGACGGAAGATTGCTGGAATTTTCCTACCCAGTTAAAGTCTCATACGTGTT
ACAGAAGGATCCGGCGAGTTTCATCTGCAACTCCGACGAGATGGATTTCGACGACGTCGTTTCCGCCATAGACGACGACGACGAGCTCCAACTCGGCCAGCTCTACTTCG
CCTTGCCTTTGGATAGGCTGAACCAGCCGCTGCACGCCGAGGAAATGGCTGCATTGGCCGTCAAAGCCAGCTCCGCGCTCATGAAGGCCGGCGGAGGCGGAAGTGAAAAA
TGTGGGTCTCGTCGGACGCCGATCTCGCCGCTGCTGGTTTCCGACGAGGAGTTGAAGAAAGTCCCACGAAGAAGCGTTGCGAAGAAAGCAGCGAGTGGTGGAAGGAGGAA
ATTCACGGCGAAACTGAGTGCAATTCCGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTATTTGCATTTCTTCCGATTCCACCAATGTTGCTACGTCGAAACTGATACTATCCGACGGAAGATTGCTGGAATTTTCCTACCCAGTTAAAGTCTCATACGTGTT
ACAGAAGGATCCGGCGAGTTTCATCTGCAACTCCGACGAGATGGATTTCGACGACGTCGTTTCCGCCATAGACGACGACGACGAGCTCCAACTCGGCCAGCTCTACTTCG
CCTTGCCTTTGGATAGGCTGAACCAGCCGCTGCACGCCGAGGAAATGGCTGCATTGGCCGTCAAAGCCAGCTCCGCGCTCATGAAGGCCGGCGGAGGCGGAAGTGAAAAA
TGTGGGTCTCGTCGGACGCCGATCTCGCCGCTGCTGGTTTCCGACGAGGAGTTGAAGAAAGTCCCACGAAGAAGCGTTGCGAAGAAAGCAGCGAGTGGTGGAAGGAGGAA
ATTCACGGCGAAACTGAGTGCAATTCCGGAATAA
Protein sequenceShow/hide protein sequence
MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSALMKAGGGGSEK
CGSRRTPISPLLVSDEELKKVPRRSVAKKAASGGRRKFTAKLSAIPE