; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021114 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021114
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionImidazole glycerol phosphate synthase hisHF, chloroplastic
Genome locationscaffold290:1371307..1371954
RNA-Seq ExpressionMS021114
SyntenyMS021114
Gene Ontology termsNA
InterPro domainsIPR008004 - Protein OCTOPUS-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148742.1 uncharacterized protein LOC101206077 [Cucumis sativus]1.1e-6768.91Show/hide
Query:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSA----TASSSSSSSSSSFSR------GSVGRVSSLIDSEPAFRRSRSVAA
        MAFYGD+DD WKCPKHPSKRRR GICPLCLRDRLV+LCPDCA VRPC C A    T +SSSSSSSSSFSR      GSVGR+S+LID EPAFRRSRS+ A
Subjt:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSA----TASSSSSSSSSSFSR------GSVGRVSSLIDSEPAFRRSRSVAA

Query:  AVPFLRPRSRFVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAA------------AGVEEAMRRTMMIRSRSVAAVSDSSGKFV
        A+PFL  RSRFV D      GDDCSS SG SAR + SFWS+F S KSKK  DGG +EAA            A VEEAMRR +MIRSRSV AV+DS G+ V
Subjt:  AVPFLRPRSRFVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAA------------AGVEEAMRRTMMIRSRSVAAVSDSSGKFV

Query:  RPPPAKGRGWYFPSPIKVFRQPKIPKPVLTERSPLHRG
        R PP K + WYFPSPIK FRQ K+PKPVLTERSPLHRG
Subjt:  RPPPAKGRGWYFPSPIKVFRQPKIPKPVLTERSPLHRG

XP_022145394.1 uncharacterized protein LOC111014850 [Momordica charantia]9.5e-11599.07Show/hide
Query:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSATASSSSSSSSSSFSRGSVGRVSSLIDSEPAFRRSRSVAAAVPFLRPRSR
        MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSATASSSSSSSSSSFSRGSVGRVSSLIDSEPAFRRSRSVAAAVPFLRPRSR
Subjt:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSATASSSSSSSSSSFSRGSVGRVSSLIDSEPAFRRSRSVAAAVPFLRPRSR

Query:  FVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAAAGVEEAMRRTMMIRSRSVAAVSDSSGKFVRPPPAKGRGWYFPSPIKVFRQP
        FVVDV GDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGG IEAAAGVEEAMRRTMMIRSRSVAAVSDSSGKFVRPPPAKGRGWYFPSPIKVFRQP
Subjt:  FVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAAAGVEEAMRRTMMIRSRSVAAVSDSSGKFVRPPPAKGRGWYFPSPIKVFRQP

Query:  KIPKPVLTERSPLHRG
        KIPKPVLTERSPLHRG
Subjt:  KIPKPVLTERSPLHRG

XP_022923201.1 uncharacterized protein LOC111430968 [Cucurbita moschata]1.3e-7170.71Show/hide
Query:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSA--TASSSSSSSSSSFSR---------GSVGRVSSLIDSEPAFRRSRSVA
        MAFYGD+DD WKCPKHPSKRRR GICPLCLRDRLV+LCPDCA VRPC C A  T +SSSSSSSSSFSR         GSVGR+S+LID EPAFRRSRS+ 
Subjt:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSA--TASSSSSSSSSSFSR---------GSVGRVSSLIDSEPAFRRSRSVA

Query:  AAVPFLRPRSRFVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAA------------AGVEEAMRRTMMIRSRSVAAVSDSSGKF
        AA+PFLR RSRFV D      GDDCSS SG SAR A SFWS+F S K+KKS+DGG IEAA            A VEEAMRRT+MIRSRSV AVSDSSG+F
Subjt:  AAVPFLRPRSRFVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAA------------AGVEEAMRRTMMIRSRSVAAVSDSSGKF

Query:  VRPPPAKGRGWYFPSPIKVFRQPKIPKPVLTERSPLHRG
        VR  P K + WYFPSPIK FRQ K+PK VLTERSPLHRG
Subjt:  VRPPPAKGRGWYFPSPIKVFRQPKIPKPVLTERSPLHRG

XP_022965250.1 uncharacterized protein LOC111465171 [Cucurbita maxima]6.5e-7170.29Show/hide
Query:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSA--TASSSSSSSSSSFSR---------GSVGRVSSLIDSEPAFRRSRSVA
        MAFYGD+DD WKCPKHPSKRRR GICPLCLRDRL+SLCPDCA VRPC C A  T +SSSSSSSSSFSR         GSVGR+S+LID EPAFRRSRS+ 
Subjt:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSA--TASSSSSSSSSSFSR---------GSVGRVSSLIDSEPAFRRSRSVA

Query:  AAVPFLRPRSRFVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAA------------AGVEEAMRRTMMIRSRSVAAVSDSSGKF
        AA+PFLR RSRFV D      GDDCSS SG SAR A SFWS+F S K+KKS+DGG IEAA            A VEEAMRRT+MIRSRSV AVSDSSG+F
Subjt:  AAVPFLRPRSRFVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAA------------AGVEEAMRRTMMIRSRSVAAVSDSSGKF

Query:  VRPPPAKGRGWYFPSPIKVFRQPKIPKPVLTERSPLHRG
        VR  P K + WYFPSPIK FRQ K+ K VLTERSPLHRG
Subjt:  VRPPPAKGRGWYFPSPIKVFRQPKIPKPVLTERSPLHRG

XP_023552753.1 uncharacterized protein LOC111810294 [Cucurbita pepo subsp. pepo]7.6e-7271.13Show/hide
Query:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSA--TASSSSSSSSSSFSR---------GSVGRVSSLIDSEPAFRRSRSVA
        MAFYGD+DD WKCPKHPSKRRR GICPLCLRDRLV+LCPDCA VRPC C A  T +SSSSSSSSSFSR         GSVGR+S+LID EPAFRRSRS+ 
Subjt:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSA--TASSSSSSSSSSFSR---------GSVGRVSSLIDSEPAFRRSRSVA

Query:  AAVPFLRPRSRFVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAA------------AGVEEAMRRTMMIRSRSVAAVSDSSGKF
        AA+PFLR RSRFV D      GDDCSS SG SAR A SFWS+F S K+KKS+DGG IEAA            A VEEAMRRT+MIRSRSV AVSDSSG+F
Subjt:  AAVPFLRPRSRFVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAA------------AGVEEAMRRTMMIRSRSVAAVSDSSGKF

Query:  VRPPPAKGRGWYFPSPIKVFRQPKIPKPVLTERSPLHRG
        VR  PAK + WYFPSPIK FRQ K+PK VLTERSPLHRG
Subjt:  VRPPPAKGRGWYFPSPIKVFRQPKIPKPVLTERSPLHRG

TrEMBL top hitse value%identityAlignment
A0A0A0L6G5 Uncharacterized protein5.5e-6868.91Show/hide
Query:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSA----TASSSSSSSSSSFSR------GSVGRVSSLIDSEPAFRRSRSVAA
        MAFYGD+DD WKCPKHPSKRRR GICPLCLRDRLV+LCPDCA VRPC C A    T +SSSSSSSSSFSR      GSVGR+S+LID EPAFRRSRS+ A
Subjt:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSA----TASSSSSSSSSSFSR------GSVGRVSSLIDSEPAFRRSRSVAA

Query:  AVPFLRPRSRFVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAA------------AGVEEAMRRTMMIRSRSVAAVSDSSGKFV
        A+PFL  RSRFV D      GDDCSS SG SAR + SFWS+F S KSKK  DGG +EAA            A VEEAMRR +MIRSRSV AV+DS G+ V
Subjt:  AVPFLRPRSRFVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAA------------AGVEEAMRRTMMIRSRSVAAVSDSSGKFV

Query:  RPPPAKGRGWYFPSPIKVFRQPKIPKPVLTERSPLHRG
        R PP K + WYFPSPIK FRQ K+PKPVLTERSPLHRG
Subjt:  RPPPAKGRGWYFPSPIKVFRQPKIPKPVLTERSPLHRG

A0A5A7UH78 Uncharacterized protein2.3e-6668.97Show/hide
Query:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSA----TASSSSSSSSSSFSR------GSVGRVSSLIDSEPAFRRSRSVAA
        MAFYGD+DD WKCPKHPSKRRR GICPLCLRDRLV+LCPDCA VRPC C A    T +SSSSSSSSSFSR      GSVGR+S+LID EPAFRRSRS+ A
Subjt:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSA----TASSSSSSSSSSFSR------GSVGRVSSLIDSEPAFRRSRSVAA

Query:  AVPFLRPRSRFVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIE------AAAGVEEAMRRTMMIRSRSVAAVSDSSGKFVRPPPAK
        A+PFL  RSRFV D       DDCSS+   SAR + SFWS+F S KSKK  DGG I+        A VEEAMRR +MIRSRSV AV+DS G+ VR PP K
Subjt:  AVPFLRPRSRFVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIE------AAAGVEEAMRRTMMIRSRSVAAVSDSSGKFVRPPPAK

Query:  GRGWYFPSPIKVFRQPKIPKPVLTERSPLHRG
         R WYFPSPIK FRQ KI KPVLTERSPLHRG
Subjt:  GRGWYFPSPIKVFRQPKIPKPVLTERSPLHRG

A0A6J1CV41 uncharacterized protein LOC1110148504.6e-11599.07Show/hide
Query:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSATASSSSSSSSSSFSRGSVGRVSSLIDSEPAFRRSRSVAAAVPFLRPRSR
        MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSATASSSSSSSSSSFSRGSVGRVSSLIDSEPAFRRSRSVAAAVPFLRPRSR
Subjt:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSATASSSSSSSSSSFSRGSVGRVSSLIDSEPAFRRSRSVAAAVPFLRPRSR

Query:  FVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAAAGVEEAMRRTMMIRSRSVAAVSDSSGKFVRPPPAKGRGWYFPSPIKVFRQP
        FVVDV GDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGG IEAAAGVEEAMRRTMMIRSRSVAAVSDSSGKFVRPPPAKGRGWYFPSPIKVFRQP
Subjt:  FVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAAAGVEEAMRRTMMIRSRSVAAVSDSSGKFVRPPPAKGRGWYFPSPIKVFRQP

Query:  KIPKPVLTERSPLHRG
        KIPKPVLTERSPLHRG
Subjt:  KIPKPVLTERSPLHRG

A0A6J1E664 uncharacterized protein LOC1114309686.3e-7270.71Show/hide
Query:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSA--TASSSSSSSSSSFSR---------GSVGRVSSLIDSEPAFRRSRSVA
        MAFYGD+DD WKCPKHPSKRRR GICPLCLRDRLV+LCPDCA VRPC C A  T +SSSSSSSSSFSR         GSVGR+S+LID EPAFRRSRS+ 
Subjt:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSA--TASSSSSSSSSSFSR---------GSVGRVSSLIDSEPAFRRSRSVA

Query:  AAVPFLRPRSRFVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAA------------AGVEEAMRRTMMIRSRSVAAVSDSSGKF
        AA+PFLR RSRFV D      GDDCSS SG SAR A SFWS+F S K+KKS+DGG IEAA            A VEEAMRRT+MIRSRSV AVSDSSG+F
Subjt:  AAVPFLRPRSRFVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAA------------AGVEEAMRRTMMIRSRSVAAVSDSSGKF

Query:  VRPPPAKGRGWYFPSPIKVFRQPKIPKPVLTERSPLHRG
        VR  P K + WYFPSPIK FRQ K+PK VLTERSPLHRG
Subjt:  VRPPPAKGRGWYFPSPIKVFRQPKIPKPVLTERSPLHRG

A0A6J1HN71 uncharacterized protein LOC1114651713.1e-7170.29Show/hide
Query:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSA--TASSSSSSSSSSFSR---------GSVGRVSSLIDSEPAFRRSRSVA
        MAFYGD+DD WKCPKHPSKRRR GICPLCLRDRL+SLCPDCA VRPC C A  T +SSSSSSSSSFSR         GSVGR+S+LID EPAFRRSRS+ 
Subjt:  MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSA--TASSSSSSSSSSFSR---------GSVGRVSSLIDSEPAFRRSRSVA

Query:  AAVPFLRPRSRFVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAA------------AGVEEAMRRTMMIRSRSVAAVSDSSGKF
        AA+PFLR RSRFV D      GDDCSS SG SAR A SFWS+F S K+KKS+DGG IEAA            A VEEAMRRT+MIRSRSV AVSDSSG+F
Subjt:  AAVPFLRPRSRFVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAA------------AGVEEAMRRTMMIRSRSVAAVSDSSGKF

Query:  VRPPPAKGRGWYFPSPIKVFRQPKIPKPVLTERSPLHRG
        VR  P K + WYFPSPIK FRQ K+ K VLTERSPLHRG
Subjt:  VRPPPAKGRGWYFPSPIKVFRQPKIPKPVLTERSPLHRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G32690.1 unknown protein2.5e-2842.24Show/hide
Query:  DDDDHWKCPKHPSKR---RRTGICPLCLRDRLVSLCPDCATVRPCACSATA----SSSSSSSSSSFS------------RGSVGRVSSLIDSEPAFRRSR
        D+D   +C KH SKR      G+CP CL +RL SLCPDCA   PC+CS+ A    S SSSSSSSSFS             GSVGRV+SLI+ EPAFRRS+
Subjt:  DDDDHWKCPKHPSKR---RRTGICPLCLRDRLVSLCPDCATVRPCACSATA----SSSSSSSSSSFS------------RGSVGRVSSLIDSEPAFRRSR

Query:  SVAAAVPFLRPRSRFVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAAAGVEEAMRRTMMIRSRSVAAVSDSSGKFVRPPPA--K
        S+  A+P ++P S  V+D          S      ++   SFW +F   +                    +  +M +SRSV AV+   G    P PA  K
Subjt:  SVAAAVPFLRPRSRFVVDVDGDGDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAAAGVEEAMRRTMMIRSRSVAAVSDSSGKFVRPPPA--K

Query:  GRGWYFPSPIKVFRQPKIPKPVLTERSPLHRG
        G+GW FPSPIKVFRQ ++ K +  +RSPL+RG
Subjt:  GRGWYFPSPIKVFRQPKIPKPVLTERSPLHRG

AT2G35200.1 unknown protein7.0e-2338.94Show/hide
Query:  KCPKHPSK-RRRTGICPLCLRDRLVSLCPDCATVRPCACSATAS-SSSSSSSSSFSR-GSVGRVSSLIDSEPAFRRSRSVAAAVPFLRPRSRFVVDVDGD
        KC +H  +     G+CP CL +RL SLCPDCA   PC+C+  AS SS      SF+R GSVGRV++LI+ EPAFRRS S+     F   +   +++ + D
Subjt:  KCPKHPSK-RRRTGICPLCLRDRLVSLCPDCATVRPCACSATAS-SSSSSSSSSFSR-GSVGRVSSLIDSEPAFRRSRSVAAAVPFLRPRSRFVVDVDGD

Query:  GDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAAAGVEEAMRRTMMIRSRSVAAVSDSSGKFVRPPPAKGRGWYFPSPIKVFRQPKIPKPVLT
                      RG    W +F   + ++++    I+AA          +M +S SVA     S     P  +KG GWYFPSPIKVFRQ ++ K +  
Subjt:  GDGDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAAAGVEEAMRRTMMIRSRSVAAVSDSSGKFVRPPPAKGRGWYFPSPIKVFRQPKIPKPVLT

Query:  ERSPLHRG
        +RSPL+RG
Subjt:  ERSPLHRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTTCTATGGCGATGACGACGACCACTGGAAATGTCCAAAACACCCTTCCAAGCGGCGGAGAACCGGAATCTGCCCTCTCTGCCTCCGCGACCGCCTCGTCTCTCT
CTGCCCTGACTGCGCCACCGTCCGCCCCTGCGCCTGCTCCGCCACCGCCTCTTCCTCTTCCTCTTCTTCCTCCTCGTCTTTCTCCCGCGGATCCGTCGGGCGCGTCTCCA
GTTTGATCGACAGCGAGCCGGCGTTCCGCCGCTCGCGCTCCGTCGCCGCGGCGGTTCCGTTTCTGCGACCGAGATCGAGATTCGTCGTCGATGTCGACGGCGACGGCGAC
GGGGACGACTGTTCCTCGGCGTCCGGTTACAGCGCCAGAGGAGCGCTGTCGTTCTGGTCGATGTTCTGGTCGCGGAAGAGCAAGAAGAGCCAGGACGGCGGAGGAATCGA
GGCGGCGGCGGGAGTCGAGGAAGCGATGCGGCGGACGATGATGATCCGGTCGAGATCGGTTGCGGCGGTTTCTGACTCCAGCGGGAAGTTCGTGCGGCCGCCGCCGGCGA
AGGGGCGGGGATGGTACTTTCCGAGTCCGATCAAAGTATTCCGGCAACCGAAGATTCCCAAGCCCGTTCTTACGGAACGGTCTCCGTTACACAGAGGT
mRNA sequenceShow/hide mRNA sequence
ATGGCCTTCTATGGCGATGACGACGACCACTGGAAATGTCCAAAACACCCTTCCAAGCGGCGGAGAACCGGAATCTGCCCTCTCTGCCTCCGCGACCGCCTCGTCTCTCT
CTGCCCTGACTGCGCCACCGTCCGCCCCTGCGCCTGCTCCGCCACCGCCTCTTCCTCTTCCTCTTCTTCCTCCTCGTCTTTCTCCCGCGGATCCGTCGGGCGCGTCTCCA
GTTTGATCGACAGCGAGCCGGCGTTCCGCCGCTCGCGCTCCGTCGCCGCGGCGGTTCCGTTTCTGCGACCGAGATCGAGATTCGTCGTCGATGTCGACGGCGACGGCGAC
GGGGACGACTGTTCCTCGGCGTCCGGTTACAGCGCCAGAGGAGCGCTGTCGTTCTGGTCGATGTTCTGGTCGCGGAAGAGCAAGAAGAGCCAGGACGGCGGAGGAATCGA
GGCGGCGGCGGGAGTCGAGGAAGCGATGCGGCGGACGATGATGATCCGGTCGAGATCGGTTGCGGCGGTTTCTGACTCCAGCGGGAAGTTCGTGCGGCCGCCGCCGGCGA
AGGGGCGGGGATGGTACTTTCCGAGTCCGATCAAAGTATTCCGGCAACCGAAGATTCCCAAGCCCGTTCTTACGGAACGGTCTCCGTTACACAGAGGT
Protein sequenceShow/hide protein sequence
MAFYGDDDDHWKCPKHPSKRRRTGICPLCLRDRLVSLCPDCATVRPCACSATASSSSSSSSSSFSRGSVGRVSSLIDSEPAFRRSRSVAAAVPFLRPRSRFVVDVDGDGD
GDDCSSASGYSARGALSFWSMFWSRKSKKSQDGGGIEAAAGVEEAMRRTMMIRSRSVAAVSDSSGKFVRPPPAKGRGWYFPSPIKVFRQPKIPKPVLTERSPLHRG