; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS020377 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS020377
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionHTH-type transcriptional regulator protein ptxE
Genome locationscaffold211:48685..49140
RNA-Seq ExpressionMS020377
SyntenyMS020377
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149511.1 uncharacterized protein LOC111017924 [Momordica charantia]8.8e-7599.34Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGGEKCGSRRTAISPAMFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE
        MKAGGEKCGSRRTAISPA+FSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE
Subjt:  MKAGGEKCGSRRTAISPAMFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE

XP_022971678.1 uncharacterized protein LOC111470350 [Cucurbita maxima]1.0e-5477.42Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGICISSDS NVATAKLILTDGTLLE+SYPVKVS++L K PA+FICNSD+MDFDD V A+ D D LQLG LYFALPLDRLNQPLH EEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGG---EKCGSRRTAISPAMFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE
        +KAG    EK GSRRTA+SP  FSDEEF K  R   +KGSGR RKF AKLSAIPE
Subjt:  MKAGG---EKCGSRRTAISPAMFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE

XP_023007426.1 uncharacterized protein LOC111499928 [Cucurbita maxima]1.1e-5377.07Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGICISSDS NV+TAKLIL+DGTLLEYSYPVKVSYVLQKDPASFICNSD+MDF+DVV+A+DD DELQLGQLYFALPL++LN+PLHAE+MAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGG---EKCGSRRTAISPAMFSDEEFGKVQRSVAKKG--SGRRRKFTAKLSAIPE
        MKAGG   EKCGSRR    P +FS+EE  K  R   KKG  +G  RKFTAKL AIPE
Subjt:  MKAGG---EKCGSRRTAISPAMFSDEEFGKVQRSVAKKG--SGRRRKFTAKLSAIPE

XP_023512544.1 uncharacterized protein LOC111777254 [Cucurbita pepo subsp. pepo]6.6e-5476.77Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGICISSDS NVATAKLILTDGTLLE+SYPVKVS++L K PA+FICNSD+MDFDD V A+ D D LQLG LYFALPLDRLNQPLH EEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGG---EKCGSRRTAISPAMFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE
        +KAG    EK GSRRTA+SP  FSDEEF K  R   +KGSG  RKF AKLSAIPE
Subjt:  MKAGG---EKCGSRRTAISPAMFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE

XP_038902080.1 uncharacterized protein LOC120088720 [Benincasa hispida]1.5e-6184.52Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGIC+SSD+ NVATAKLILTDGTL+E+SYPVKVSY+LQK PASFICNSDEMDFDDVV A+DD DELQLGQLYFALPLDRLNQPL AEEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGG---EKCGSRRTAISPAMFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE
        MKAGG   EKCGSRRTAISP  FSDEEF K  R   KKGSGR RKFTAKLSAIPE
Subjt:  MKAGG---EKCGSRRTAISPAMFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE

TrEMBL top hitse value%identityAlignment
A0A6J1D8L2 uncharacterized protein LOC1110179244.3e-7599.34Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGGEKCGSRRTAISPAMFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE
        MKAGGEKCGSRRTAISPA+FSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE
Subjt:  MKAGGEKCGSRRTAISPAMFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE

A0A6J1EJ90 uncharacterized protein LOC1114349662.1e-5376.13Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGICISSDS NVATAKLILTDGTLLE+SYPVKVS++L K PA+FICNSD+MDFDD V A+ D D LQLG LYFALPLDRLNQPLH EEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGG---EKCGSRRTAISPAMFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE
        +KAG    EK GSRRTA+SP  FSDEEF K      +KGSG  RKF AKLSAIPE
Subjt:  MKAGG---EKCGSRRTAISPAMFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE

A0A6J1G7I2 uncharacterized protein LOC1114514542.1e-5376.43Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGICISSDS NV+TAKLIL+DGTLLEYSYPVKVSYVL KDPASFICNSD+MDF+DVV+A+DD DELQLGQLYFALPL++LN+PLHAE+MAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGG---EKCGSRRTAISPAMFSDEEFGKVQRSVAKK--GSGRRRKFTAKLSAIPE
        MKAGG   EKCGSRR  +    FS+EE  K  R   KK  GSG  RKFTAKL AIPE
Subjt:  MKAGG---EKCGSRRTAISPAMFSDEEFGKVQRSVAKK--GSGRRRKFTAKLSAIPE

A0A6J1I7K7 uncharacterized protein LOC1114703504.9e-5577.42Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGICISSDS NVATAKLILTDGTLLE+SYPVKVS++L K PA+FICNSD+MDFDD V A+ D D LQLG LYFALPLDRLNQPLH EEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGG---EKCGSRRTAISPAMFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE
        +KAG    EK GSRRTA+SP  FSDEEF K  R   +KGSGR RKF AKLSAIPE
Subjt:  MKAGG---EKCGSRRTAISPAMFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE

A0A6J1L0H7 uncharacterized protein LOC1114999285.4e-5477.07Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGICISSDS NV+TAKLIL+DGTLLEYSYPVKVSYVLQKDPASFICNSD+MDF+DVV+A+DD DELQLGQLYFALPL++LN+PLHAE+MAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGG---EKCGSRRTAISPAMFSDEEFGKVQRSVAKKG--SGRRRKFTAKLSAIPE
        MKAGG   EKCGSRR    P +FS+EE  K  R   KKG  +G  RKFTAKL AIPE
Subjt:  MKAGG---EKCGSRRTAISPAMFSDEEFGKVQRSVAKKG--SGRRRKFTAKLSAIPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G76600.1 unknown protein8.1e-1840.3Show/hide
Query:  MGICISSDS----TNVATAKLILTDGTLLEYSYPVKVSYVLQKDPAS----------FICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHA
        MG+C+S +     ++  TAK++  +G L EY  PV  S VL+ +  S          F+CNSD + +DD + AI+  + LQ  Q+YF LP+ +    L A
Subjt:  MGICISSDS----TNVATAKLILTDGTLLEYSYPVKVSYVLQKDPAS----------FICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHA

Query:  EEMAALAVKASAALMKAGGEKCGSRRTA-ISPAM
         +MAALAVKAS A+ KA G+K   RR+  ISP +
Subjt:  EEMAALAVKASAALMKAGGEKCGSRRTA-ISPAM

AT2G23690.1 unknown protein2.0e-4058.28Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGIC S +ST VATAKLIL DG ++E++ PVKV YVLQK+P  FICNSD+MDFD+VVSAI   +E QLGQLYFALPL  L+  L AEEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGG----EKCGSRRTAISPAMFSDEEFGKV-----QRSVAKKGSG--RRRKFTAKLSAIPE
        M++GG    +KC  RR  +SP +FS      V      R+  ++G G   RRK+ AKLS I E
Subjt:  MKAGG----EKCGSRRTAISPAMFSDEEFGKV-----QRSVAKKGSG--RRRKFTAKLSAIPE

AT3G50800.1 unknown protein2.3e-3351.23Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MG C S +S    TAKLIL DGTL E+S PVKV  +LQK+P SF+CNSD+MDFDD V A+   ++L+ G+LYF LPL  LN PL A+EMAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGGEKCGSRRTAISPAMFSDEEFGKVQ-RSVAKKGSG---------RRRKFTAKLSAIPE
         K+GG              ++DE+ G+ + R V + G G          RRKFTA+LS+I E
Subjt:  MKAGGEKCGSRRTAISPAMFSDEEFGKVQ-RSVAKKGSG---------RRRKFTAKLSAIPE

AT4G37240.1 unknown protein8.6e-3660.84Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MGIC SS+ST VATAKLIL DG ++E++ PVKV YVL K P  FICNSD+MDFDD V+AI   +ELQLGQ+YFALPL  L QPL AEEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGGEKCGSRRTAISPAMFSDEEFGKVQR--SVAKKGSGRRR
        M+ GG  C  RR  + P + SD+   +V         GSGRR+
Subjt:  MKAGGEKCGSRRTAISPAMFSDEEFGKVQR--SVAKKGSGRRR

AT5G66580.1 unknown protein1.2e-3253.21Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL
        MG C S +S    +AKLIL DGTL E+S PVKV  +LQK+P SF+CNSDEMDFDD VSA+   +EL+ GQLYF LPL  LN PL AEEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAAL

Query:  MKAGGE--KCGSRRTAISPAMFSDEEFGKVQ-RSVAKKGSGR-RRKFTAKLSAIPE
         K+GG     G      S   +  +    V+      +G G+ +R+FTA LS I E
Subjt:  MKAGGE--KCGSRRTAISPAMFSDEEFGKVQ-RSVAKKGSGR-RRKFTAKLSAIPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTATTTGCATATCGTCGGATTCTACGAATGTTGCTACGGCGAAACTGATTCTGACAGACGGAACATTGCTGGAATACTCGTACCCAGTTAAGGTCTCTTACGTGTT
GCAGAAAGATCCGGCGAGTTTTATCTGCAACTCCGACGAGATGGATTTCGACGACGTCGTTTCCGCCATTGACGACGGCGACGAGCTCCAACTCGGCCAGCTCTACTTCG
CCTTGCCGCTGGACAGGCTCAACCAGCCGCTGCACGCCGAGGAGATGGCCGCATTGGCCGTCAAGGCCAGCGCCGCCCTCATGAAGGCCGGCGGCGAAAAATGTGGGTCG
CGCCGGACTGCGATCTCGCCGGCGATGTTTTCCGACGAGGAGTTCGGGAAAGTTCAGCGAAGTGTTGCGAAGAAAGGGAGTGGTAGAAGAAGGAAATTTACGGCGAAGCT
GAGTGCAATTCCGGAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTATTTGCATATCGTCGGATTCTACGAATGTTGCTACGGCGAAACTGATTCTGACAGACGGAACATTGCTGGAATACTCGTACCCAGTTAAGGTCTCTTACGTGTT
GCAGAAAGATCCGGCGAGTTTTATCTGCAACTCCGACGAGATGGATTTCGACGACGTCGTTTCCGCCATTGACGACGGCGACGAGCTCCAACTCGGCCAGCTCTACTTCG
CCTTGCCGCTGGACAGGCTCAACCAGCCGCTGCACGCCGAGGAGATGGCCGCATTGGCCGTCAAGGCCAGCGCCGCCCTCATGAAGGCCGGCGGCGAAAAATGTGGGTCG
CGCCGGACTGCGATCTCGCCGGCGATGTTTTCCGACGAGGAGTTCGGGAAAGTTCAGCGAAGTGTTGCGAAGAAAGGGAGTGGTAGAAGAAGGAAATTTACGGCGAAGCT
GAGTGCAATTCCGGAA
Protein sequenceShow/hide protein sequence
MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVSAIDDGDELQLGQLYFALPLDRLNQPLHAEEMAALAVKASAALMKAGGEKCGS
RRTAISPAMFSDEEFGKVQRSVAKKGSGRRRKFTAKLSAIPE