; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC00g0518 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC00g0518
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein EMB-1-like
Genome locationscaffold196:635600..636362
RNA-Seq ExpressionMC00g0518
SyntenyMC00g0518
Gene Ontology termsGO:0009737 - response to abscisic acid (biological process)
GO:0048700 - acquisition of desiccation tolerance in seed (biological process)
InterPro domainsIPR000389 - Small hydrophilic plant seed protein
IPR022377 - Small hydrophilic plant seed protein, conserved site
IPR038956 - Late embryogenesis abundant protein, LEA_5 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033764.1 Em-like protein GEA1, partial [Cucurbita argyrosperma subsp. argyrosperma]2.34e-5194.51Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK
        MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ+MGRKGGLSN+G+ GGERAAEEGVEIDESKFR K
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK

XP_004148611.1 protein EMB-1 [Cucumis sativus]1.41e-5193.41Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK
        MSSEQER +LDARARQGETV+PGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ+MGRKGGLSNTG+SGGERAAEEGVEIDESKFR K
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK

XP_008464209.1 PREDICTED: protein EMB-1-like [Cucumis melo]3.67e-5295.6Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK
        MSSEQER ELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ+MGRKGGLSNTG+SGGERAAEEGVEIDESKF TK
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK

XP_022144068.1 protein EMB-1-like [Momordica charantia]1.07e-55100Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK
        MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK

XP_022949998.1 protein EMB-1 [Cucurbita moschata]8.47e-5394.51Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK
        MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ+MGRKGGLSN+G+ GGERAAEEGVEIDESKFR K
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK

TrEMBL top hitse value%identityAlignment
A0A1S3CKY4 protein EMB-1-like1.78e-5295.6Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK
        MSSEQER ELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ+MGRKGGLSNTG+SGGERAAEEGVEIDESKF TK
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK

A0A5D3E041 Protein EMB-1-like1.78e-5295.6Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK
        MSSEQER ELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ+MGRKGGLSNTG+SGGERAAEEGVEIDESKF TK
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK

A0A6J1CSC5 protein EMB-1-like5.17e-56100Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK
        MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK

A0A6J1GDN9 protein EMB-14.10e-5394.51Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK
        MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ+MGRKGGLSN+G+ GGERAAEEGVEIDESKFR K
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK

A0A6J1ISS4 protein EMB-14.10e-5394.51Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK
        MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ+MGRKGGLSN+G+ GGERAAEEGVEIDESKFR K
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK

SwissProt top hitse value%identityAlignment
P04568 Em protein2.2e-3176.67Show/hide
Query:  SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK
        S +QER +LD +AR+GETVVPGGTGGKSLEAQE+LAEGRSRGGQTR+EQ+G EGY QMGRKGGLS    SGG+RAA EG++IDESKF+TK
Subjt:  SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK

P17639 Protein EMB-17.3e-3582.42Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK
        M+S+QE+ ELDARARQGETVVPGGTGGKSLEAQ+HLAEGRS+GGQTRKEQLG EGY +MGRKGGLSN  +SGGERA +EG++IDESKFRTK
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK

P42755 Em protein H52.6e-3280Show/hide
Query:  SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK
        S +QER ELD  AR+GETVVPGGTGGKSLEAQEHLA+GRSRGG+TRKEQLG EGY++MGRKGGLS    SGGERAA EG+EIDESKF+TK
Subjt:  SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK

Q02973 Em-like protein GEA63.4e-3280.22Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK
        M+S+QE+ +LD RA++GETVVPGGTGGKS EAQ+HLAEGRSRGGQTRKEQLG EGYQQMGRKGGLS     GGE A EEGVEIDESKFRTK
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK

Q05190 Late embryogenesis abundant protein B19.1A3.7e-3175.56Show/hide
Query:  SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK
        S +QER +LD +AR+GETVVPGGTGGKSLEAQ++LAEGRSRGGQTR+EQ+G EGY +MGRKGGLS+   SGGERAA EG++IDESKF+TK
Subjt:  SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK

Arabidopsis top hitse value%identityAlignment
AT2G40170.1 Stress induced protein2.4e-3380.22Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK
        M+S+QE+ +LD RA++GETVVPGGTGGKS EAQ+HLAEGRSRGGQTRKEQLG EGYQQMGRKGGLS     GGE A EEGVEIDESKFRTK
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK

AT3G51810.1 Stress induced protein4.5e-2447.33Show/hide
Query:  SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ--------------------------------------------
        S +  R ELD +A+QGETVVPGGTGG SLEAQEHLAEGRS+GGQTRKEQLGHEGYQ                                            
Subjt:  SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ--------------------------------------------

Query:  ----------------QMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK
                        +MGRKGGLS    SGGERA EEG+EIDESKF  K
Subjt:  ----------------QMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCGGAGCAGGAGAGGTGTGAACTCGACGCCAGGGCCAGGCAAGGGGAGACTGTCGTCCCTGGTGGAACTGGGGGCAAGAGTCTCGAAGCTCAGGAACACCTTGC
TGAAGGGCGGAGCCGTGGGGGGCAGACAAGGAAGGAGCAGCTGGGCCACGAAGGGTACCAACAGATGGGCCGCAAAGGAGGGCTAAGCAACACGGGTCTCTCGGGCGGAG
AGCGCGCTGCCGAGGAAGGCGTTGAGATTGATGAGTCCAAATTCAGGACCAAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCGGAGCAGGAGAGGTGTGAACTCGACGCCAGGGCCAGGCAAGGGGAGACTGTCGTCCCTGGTGGAACTGGGGGCAAGAGTCTCGAAGCTCAGGAACACCTTGC
TGAAGGGCGGAGCCGTGGGGGGCAGACAAGGAAGGAGCAGCTGGGCCACGAAGGGTACCAACAGATGGGCCGCAAAGGAGGGCTAAGCAACACGGGTCTCTCGGGCGGAG
AGCGCGCTGCCGAGGAAGGCGTTGAGATTGATGAGTCCAAATTCAGGACCAAA
Protein sequenceShow/hide protein sequence
MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQQMGRKGGLSNTGLSGGERAAEEGVEIDESKFRTK