; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g1148 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g1148
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionHTH-type transcriptional regulator protein ptxE
Genome locationMC02:9934606..9935061
RNA-Seq ExpressionMC02g1148
SyntenyMC02g1148
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149511.1 uncharacterized protein LOC111017924 [Momordica charantia]6.87e-99100Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGGEKCGSRRTAISPALFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE
        MKAGGEKCGSRRTAISPALFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE
Subjt:  MKAGGEKCGSRRTAISPALFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE

XP_022971678.1 uncharacterized protein LOC111470350 [Cucurbita maxima]3.58e-7277.42Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGICISSDS NVATAKLILTDGTLLE+SYPVKVS++L K PA+FICNSD+MDFDD V A+ D D LQLG LYFALPLDRLNQPLH EEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGG---EKCGSRRTAISPALFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE
        +KAG    EK GSRRTA+SP  FSDEEF K  R   +KGSGR RKF AKLSAIPE
Subjt:  MKAGG---EKCGSRRTAISPALFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE

XP_023007426.1 uncharacterized protein LOC111499928 [Cucurbita maxima]7.87e-7177.07Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGICISSDS NV+TAKLIL+DGTLLEYSYPVKVSYVLQKDPASFICNSD+MDF+DVV+A+DD DELQLGQLYFALPL++LN+PLHAE+MAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGG---EKCGSRRTAISPALFSDEEFGKVQRSVAKKGS--GRRRKFTAKLSAIPE
        MKAGG   EKCGSRR    P +FS+EE  K  R   KKG+  G  RKFTAKL AIPE
Subjt:  MKAGG---EKCGSRRTAISPALFSDEEFGKVQRSVAKKGS--GRRRKFTAKLSAIPE

XP_023512544.1 uncharacterized protein LOC111777254 [Cucurbita pepo subsp. pepo]4.17e-7176.77Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGICISSDS NVATAKLILTDGTLLE+SYPVKVS++L K PA+FICNSD+MDFDD V A+ D D LQLG LYFALPLDRLNQPLH EEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGG---EKCGSRRTAISPALFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE
        +KAG    EK GSRRTA+SP  FSDEEF K  R   +KGSG  RKF AKLSAIPE
Subjt:  MKAGG---EKCGSRRTAISPALFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE

XP_038902080.1 uncharacterized protein LOC120088720 [Benincasa hispida]3.68e-8184.52Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGIC+SSD+ NVATAKLILTDGTL+E+SYPVKVSY+LQK PASFICNSDEMDFDDVV A+DD DELQLGQLYFALPLDRLNQPL AEEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGG---EKCGSRRTAISPALFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE
        MKAGG   EKCGSRRTAISP  FSDEEF K  R   KKGSGR RKFTAKLSAIPE
Subjt:  MKAGG---EKCGSRRTAISPALFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE

TrEMBL top hitse value%identityAlignment
A0A6J1D8L2 uncharacterized protein LOC1110179243.32e-99100Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGGEKCGSRRTAISPALFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE
        MKAGGEKCGSRRTAISPALFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE
Subjt:  MKAGGEKCGSRRTAISPALFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE

A0A6J1EJ90 uncharacterized protein LOC1114349662.35e-7076.13Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGICISSDS NVATAKLILTDGTLLE+SYPVKVS++L K PA+FICNSD+MDFDD V A+ D D LQLG LYFALPLDRLNQPLH EEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGG---EKCGSRRTAISPALFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE
        +KAG    EK GSRRTA+SP  FSDEEF K      +KGSG  RKF AKLSAIPE
Subjt:  MKAGG---EKCGSRRTAISPALFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE

A0A6J1G7I2 uncharacterized protein LOC1114514543.13e-7076.43Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGICISSDS NV+TAKLIL+DGTLLEYSYPVKVSYVL KDPASFICNSD+MDF+DVV+A+DD DELQLGQLYFALPL++LN+PLHAE+MAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGG---EKCGSRRTAISPALFSDEEFGKVQRSVAKKG--SGRRRKFTAKLSAIPE
        MKAGG   EKCGSRR  +    FS+EE  K  R   KKG  SG  RKFTAKL AIPE
Subjt:  MKAGG---EKCGSRRTAISPALFSDEEFGKVQRSVAKKG--SGRRRKFTAKLSAIPE

A0A6J1I7K7 uncharacterized protein LOC1114703501.73e-7277.42Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGICISSDS NVATAKLILTDGTLLE+SYPVKVS++L K PA+FICNSD+MDFDD V A+ D D LQLG LYFALPLDRLNQPLH EEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGG---EKCGSRRTAISPALFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE
        +KAG    EK GSRRTA+SP  FSDEEF K  R   +KGSGR RKF AKLSAIPE
Subjt:  MKAGG---EKCGSRRTAISPALFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE

A0A6J1L0H7 uncharacterized protein LOC1114999283.81e-7177.07Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGICISSDS NV+TAKLIL+DGTLLEYSYPVKVSYVLQKDPASFICNSD+MDF+DVV+A+DD DELQLGQLYFALPL++LN+PLHAE+MAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGG---EKCGSRRTAISPALFSDEEFGKVQRSVAKKGS--GRRRKFTAKLSAIPE
        MKAGG   EKCGSRR    P +FS+EE  K  R   KKG+  G  RKFTAKL AIPE
Subjt:  MKAGG---EKCGSRRTAISPALFSDEEFGKVQRSVAKKGS--GRRRKFTAKLSAIPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G76600.1 unknown protein6.2e-1840.3Show/hide
Query:  MGICISSDS----TNVATAKLILTDGTLLEYSYPVKVSYVLQKDPAS----------FICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHA
        MG+C+S +     ++  TAK++  +G L EY  PV  S VL+ +  S          F+CNSD + +DD + AI+  + LQ  Q+YF LP+ +    L A
Subjt:  MGICISSDS----TNVATAKLILTDGTLLEYSYPVKVSYVLQKDPAS----------FICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHA

Query:  EEMAALAVKASAALMKAGGEKCGSRRTA-ISPAL
         +MAALAVKAS A+ KA G+K   RR+  ISP +
Subjt:  EEMAALAVKASAALMKAGGEKCGSRRTA-ISPAL

AT2G23690.1 unknown protein1.2e-4058.28Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGIC S +ST VATAKLIL DG ++E++ PVKV YVLQK+P  FICNSD+MDFD+VVSAI   +E QLGQLYFALPL  L+  L AEEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGG----EKCGSRRTAISPALFSDEEFGKV-----QRSVAKKGSG--RRRKFTAKLSAIPE
        M++GG    +KC  RR  +SP +FS      V      R+  ++G G   RRK+ AKLS I E
Subjt:  MKAGG----EKCGSRRTAISPALFSDEEFGKV-----QRSVAKKGSG--RRRKFTAKLSAIPE

AT3G50800.1 unknown protein2.3e-3351.23Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MG C S +S    TAKLIL DGTL E+S PVKV  +LQK+P SF+CNSD+MDFDD V A+   ++L+ G+LYF LPL  LN PL A+EMAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGGEKCGSRRTAISPALFSDEEFGKVQ-RSVAKKGSG---------RRRKFTAKLSAIPE
         K+GG              ++DE+ G+ + R V + G G          RRKFTA+LS+I E
Subjt:  MKAGGEKCGSRRTAISPALFSDEEFGKVQ-RSVAKKGSG---------RRRKFTAKLSAIPE

AT4G37240.1 unknown protein5.1e-3660.84Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGIC SS+ST VATAKLIL DG ++E++ PVKV YVL K P  FICNSD+MDFDD V+AI   +ELQLGQ+YFALPL  L QPL AEEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGGEKCGSRRTAISPALFSDEEFGKVQR--SVAKKGSGRRR
        M+ GG  C  RR  + P + SD+   +V         GSGRR+
Subjt:  MKAGGEKCGSRRTAISPALFSDEEFGKVQR--SVAKKGSGRRR

AT5G66580.1 unknown protein8.9e-3353.21Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MG C S +S    +AKLIL DGTL E+S PVKV  +LQK+P SF+CNSDEMDFDD VSA+   +EL+ GQLYF LPL  LN PL AEEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGGE--KCGSRRTAISPALFSDEEFGKVQ-RSVAKKGSGR-RRKFTAKLSAIPE
         K+GG     G      S   +  +    V+      +G G+ +R+FTA LS I E
Subjt:  MKAGGE--KCGSRRTAISPALFSDEEFGKVQ-RSVAKKGSGR-RRKFTAKLSAIPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTATTTGCATATCGTCGGATTCTACGAATGTTGCTACGGCGAAACTGATTCTGACCGACGGAACATTGCTGGAATACTCGTACCCAGTTAAGGTCTCTTACGTGTT
GCAGAAAGATCCGGCGAGTTTTATCTGCAACTCCGACGAGATGGATTTCGACGACGTCGTTTCCGCCATTGACGACGGCGACGAGCTCCAACTCGGCCAGCTCTACTTCG
CCTTGCCGCTGGACAGGCTCAACCAGCCGCTGCACGCCGAGGAGATGGCCGCATTGGCCGTCAAGGCCAGCGCCGCCCTCATGAAGGCCGGCGGCGAAAAATGTGGGTCG
CGCCGGACTGCGATCTCGCCGGCGCTGTTTTCCGACGAGGAGTTCGGGAAAGTTCAGCGAAGTGTTGCGAAGAAAGGGAGTGGTAGAAGAAGGAAATTTACGGCGAAGCT
GAGTGCAATTCCGGAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTATTTGCATATCGTCGGATTCTACGAATGTTGCTACGGCGAAACTGATTCTGACCGACGGAACATTGCTGGAATACTCGTACCCAGTTAAGGTCTCTTACGTGTT
GCAGAAAGATCCGGCGAGTTTTATCTGCAACTCCGACGAGATGGATTTCGACGACGTCGTTTCCGCCATTGACGACGGCGACGAGCTCCAACTCGGCCAGCTCTACTTCG
CCTTGCCGCTGGACAGGCTCAACCAGCCGCTGCACGCCGAGGAGATGGCCGCATTGGCCGTCAAGGCCAGCGCCGCCCTCATGAAGGCCGGCGGCGAAAAATGTGGGTCG
CGCCGGACTGCGATCTCGCCGGCGCTGTTTTCCGACGAGGAGTTCGGGAAAGTTCAGCGAAGTGTTGCGAAGAAAGGGAGTGGTAGAAGAAGGAAATTTACGGCGAAGCT
GAGTGCAATTCCGGAA
Protein sequenceShow/hide protein sequence
MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAALMKAGGEKCGS
RRTAISPALFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE