; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021418 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021418
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHTH-type transcriptional regulator protein ptxE
Genome locationtig00153666:947114..947587
RNA-Seq ExpressionSgr021418
SyntenySgr021418
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149511.1 uncharacterized protein LOC111017924 [Momordica charantia]3.4e-6186.62Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGICISSDSTNVAT+KLIL+DG LLE+SYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDD DELQLGQLYFALPLDRLNQPLHAEEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE
        MKAGG   EKCGSRRT ISP L SDEE  KV +RSVAKK + G R+KFTAKLSAIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE

XP_022947655.1 uncharacterized protein LOC111451454 [Cucurbita moschata]2.7e-5877.71Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGICISSDS NV+T+KLILSDG LLE+SYPVKVSYVL KDPASFICNSD+MDF+DVV+A+DDDDELQLGQLYFALPL++LN+PLHAE+MAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE
        MKAGGGGSEKCGSRR     ++ S+EEL+K PR+ V K    GG +KFTAKL AIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE

XP_023007426.1 uncharacterized protein LOC111499928 [Cucurbita maxima]1.8e-5978.98Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGICISSDS NV+T+KLILSDG LLE+SYPVKVSYVLQKDPASFICNSD+MDF+DVV+A+DD+DELQLGQLYFALPL++LN+PLHAE+MAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE
        MKAGGGGSEKCGSRR    P++ S+EEL+K PRR V K    GG +KFTAKL AIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE

XP_023533314.1 uncharacterized protein LOC111795245 isoform X1 [Cucurbita pepo subsp. pepo]2.0e-5877.71Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGICISSDS NV+T+KLILSDG LLE+SYPVKVSYVL KDPASFICNSD+MDF+DVV+A+DDDDELQLGQLYFALPL++LN+PLHAE+MAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE
        MKAGGGGSEKCGSRR    P++  +EEL+K PR+ V K    GG +KFTAKL AIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE

XP_038902080.1 uncharacterized protein LOC120088720 [Benincasa hispida]1.7e-6081.01Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGIC+SSD+ NVAT+KLIL+DG L+EFSYPVKVSY+LQK PASFICNSDEMDFDDVV A+DDDDELQLGQLYFALPLDRLNQPL AEEMAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGR-KKFTAKLSAIPE
        MKAGGGG+EKCGSRRT ISP+  SDEE +K PR+ + K   G GR +KFTAKLSAIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGR-KKFTAKLSAIPE

TrEMBL top hitse value%identityAlignment
A0A6J1D8L2 uncharacterized protein LOC1110179241.6e-6186.62Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGICISSDSTNVAT+KLIL+DG LLE+SYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDD DELQLGQLYFALPLDRLNQPLHAEEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE
        MKAGG   EKCGSRRT ISP L SDEE  KV +RSVAKK + G R+KFTAKLSAIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE

A0A6J1EJ90 uncharacterized protein LOC1114349664.6e-5677.07Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGICISSDS NVAT+KLIL+DG LLEFSYPVKVS++L K PA+FICNSD+MDFDD V A+ DDD LQLG LYFALPLDRLNQPLH EEMAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE
        +KAG GG+EK GSRRT +SPL  SDEE +K P RS+ +K +GGGR KF AKLSAIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE

A0A6J1G7I2 uncharacterized protein LOC1114514541.3e-5877.71Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGICISSDS NV+T+KLILSDG LLE+SYPVKVSYVL KDPASFICNSD+MDF+DVV+A+DDDDELQLGQLYFALPL++LN+PLHAE+MAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE
        MKAGGGGSEKCGSRR     ++ S+EEL+K PR+ V K    GG +KFTAKL AIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE

A0A6J1I7K7 uncharacterized protein LOC1114703503.9e-5575.8Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGICISSDS NVAT+KLIL+DG LLEFSYPVKVS++L K PA+FICNSD+MDFDD V A+ DDD LQLG LYFALPLDRLNQPLH EEMAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE
        +KAG GG+EK GSRRT +SP+  SDEE +K PRR + +K +G GR KF AKLSAIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE

A0A6J1L0H7 uncharacterized protein LOC1114999288.9e-6078.98Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGICISSDS NV+T+KLILSDG LLE+SYPVKVSYVLQKDPASFICNSD+MDF+DVV+A+DD+DELQLGQLYFALPL++LN+PLHAE+MAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE
        MKAGGGGSEKCGSRR    P++ S+EEL+K PRR V K    GG +KFTAKL AIPE
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G76600.1 unknown protein2.1e-1637.14Show/hide
Query:  MGICISSDS----TNVATSKLILSDGRLLEFSYPVKVSYVLQKDPAS----------FICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHA
        MG+C+S +     ++  T+K++  +G L E+  PV  S VL+ +  S          F+CNSD + +DD + AI+ D+ LQ  Q+YF LP+ +    L A
Subjt:  MGICISSDS----TNVATSKLILSDGRLLEFSYPVKVSYVLQKDPAS----------FICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHA

Query:  EEMAALAVKASSALMKAGGGGSEKCGSRRTPISPLLVSDE
         +MAALAVKAS A+ KA G  + +  S R  ISP++  ++
Subjt:  EEMAALAVKASSALMKAGGGGSEKCGSRRTPISPLLVSDE

AT2G23690.1 unknown protein3.4e-4359.51Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGIC S +ST VAT+KLIL DGR++EF+ PVKV YVLQK+P  FICNSD+MDFD+VVSAI  D+E QLGQLYFALPL  L+  L AEEMAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGG-GSEKCGSRRTPISPLLVSDEELKKV-----PRRSVAKKAAGGGRKKFTAKLSAIPE
        M++GG  G +KC  RR  +SP++ S   +  V      R    +   G GR+K+ AKLS I E
Subjt:  MKAGGG-GSEKCGSRRTPISPLLVSDEELKKV-----PRRSVAKKAAGGGRKKFTAKLSAIPE

AT3G50800.1 unknown protein5.8e-3551.52Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MG C S +S    T+KLIL DG L EFS PVKV  +LQK+P SF+CNSD+MDFDD V A+   ++L+ G+LYF LPL  LN PL A+EMAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGG--------GRKKFTAKLSAIPE
         K+GGGG             L  +DE++ +   R V +   GG        GR+KFTA+LS+I E
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGG--------GRKKFTAKLSAIPE

AT4G37240.1 unknown protein1.1e-3856.9Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MGIC SS+ST VAT+KLIL DGR++EF+ PVKV YVL K P  FICNSD+MDFDD V+AI  D+ELQLGQ+YFALPL  L QPL AEEMAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKV-----------PRRSVAKKAAGGG------RKKFTAKLSAIPE
        M+ GGG     G RR  + P +VSD+   +V            RR V     GG       RK + A+LS I E
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKV-----------PRRSVAKKAAGGG------RKKFTAKLSAIPE

AT5G66580.1 unknown protein1.1e-3352.87Show/hide
Query:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL
        MG C S +S    ++KLIL DG L EFS PVKV  +LQK+P SF+CNSDEMDFDD VSA+  ++EL+ GQLYF LPL  LN PL AEEMAALAVKASSAL
Subjt:  MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE
         K+GG G    G      S      + +  V       +  G G+++FTA LS I E
Subjt:  MKAGGGGSEKCGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTATTTGCATTTCTTCCGATTCCACCAATGTTGCTACGTCGAAACTGATACTATCCGACGGAAGATTGCTGGAATTCTCCTACCCAGTTAAAGTCTCATACGTGTT
ACAGAAGGATCCGGCGAGTTTCATCTGCAACTCCGACGAGATGGATTTCGACGACGTCGTTTCCGCCATAGACGACGACGACGAGCTCCAACTCGGCCAGCTCTACTTCG
CCTTGCCTTTGGATAGGCTGAACCAGCCGCTGCACGCCGAGGAAATGGCTGCATTGGCCGTCAAAGCCAGCTCCGCGCTCATGAAGGCCGGCGGAGGCGGAAGTGAAAAA
TGTGGGTCTCGTCGGACGCCGATCTCGCCGCTGCTGGTTTCCGACGAGGAGTTGAAGAAAGTCCCACGAAGAAGCGTTGCCAAGAAAGCAGCGGGTGGTGGAAGAAAAAA
ATTCACGGCGAAACTGAGTGCAATTCCGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTATTTGCATTTCTTCCGATTCCACCAATGTTGCTACGTCGAAACTGATACTATCCGACGGAAGATTGCTGGAATTCTCCTACCCAGTTAAAGTCTCATACGTGTT
ACAGAAGGATCCGGCGAGTTTCATCTGCAACTCCGACGAGATGGATTTCGACGACGTCGTTTCCGCCATAGACGACGACGACGAGCTCCAACTCGGCCAGCTCTACTTCG
CCTTGCCTTTGGATAGGCTGAACCAGCCGCTGCACGCCGAGGAAATGGCTGCATTGGCCGTCAAAGCCAGCTCCGCGCTCATGAAGGCCGGCGGAGGCGGAAGTGAAAAA
TGTGGGTCTCGTCGGACGCCGATCTCGCCGCTGCTGGTTTCCGACGAGGAGTTGAAGAAAGTCCCACGAAGAAGCGTTGCCAAGAAAGCAGCGGGTGGTGGAAGAAAAAA
ATTCACGGCGAAACTGAGTGCAATTCCGGAATAA
Protein sequenceShow/hide protein sequence
MGICISSDSTNVATSKLILSDGRLLEFSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDDDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASSALMKAGGGGSEK
CGSRRTPISPLLVSDEELKKVPRRSVAKKAAGGGRKKFTAKLSAIPE