; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000655 (gene) of Snake gourd v1 genome

Gene IDTan0000655
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein EMB-1
Genome locationLG03:63791103..63792004
RNA-Seq ExpressionTan0000655
SyntenyTan0000655
Gene Ontology termsGO:0009737 - response to abscisic acid (biological process)
GO:0048700 - acquisition of desiccation tolerance in seed (biological process)
InterPro domainsIPR000389 - Small hydrophilic plant seed protein
IPR022377 - Small hydrophilic plant seed protein, conserved site
IPR038956 - Late embryogenesis abundant protein, LEA_5 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033764.1 Em-like protein GEA1, partial [Cucurbita argyrosperma subsp. argyrosperma]3.2e-40100Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
        MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK

XP_004148611.1 protein EMB-1 [Cucumis sativus]2.8e-3693.41Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
        MSSEQER +LDARARQGETV+PGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSN+GM GGERAAEEGVEIDESKFR K
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK

XP_008464209.1 PREDICTED: protein EMB-1-like [Cucumis melo]2.2e-3694.51Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
        MSSEQER ELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSN+GM GGERAAEEGVEIDESKF  K
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK

XP_022144068.1 protein EMB-1-like [Momordica charantia]8.8e-3894.51Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
        MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ+MGRKGGLSN+G+ GGERAAEEGVEIDESKFR K
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK

XP_022949998.1 protein EMB-1 [Cucurbita moschata]3.2e-40100Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
        MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK

TrEMBL top hitse value%identityAlignment
A0A1S3CKY4 protein EMB-1-like1.1e-3694.51Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
        MSSEQER ELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSN+GM GGERAAEEGVEIDESKF  K
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK

A0A5D3E041 Protein EMB-1-like1.1e-3694.51Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
        MSSEQER ELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSN+GM GGERAAEEGVEIDESKF  K
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK

A0A6J1CSC5 protein EMB-1-like4.3e-3894.51Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
        MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ+MGRKGGLSN+G+ GGERAAEEGVEIDESKFR K
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK

A0A6J1GDN9 protein EMB-11.6e-40100Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
        MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK

A0A6J1ISS4 protein EMB-11.6e-40100Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
        MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK

SwissProt top hitse value%identityAlignment
P17639 Protein EMB-12.8e-3482.42Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
        M+S+QE+ ELDARARQGETVVPGGTGGKSLEAQ+HLAEGRS+GGQTRKEQLG EGY EMGRKGGLSN+ M GGERA +EG++IDESKFR K
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK

P42755 Em protein H52.2e-3178.89Show/hide
Query:  SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
        S +QER ELD  AR+GETVVPGGTGGKSLEAQEHLA+GRSRGG+TRKEQLG EGY+EMGRKGGLS     GGERAA EG+EIDESKF+ K
Subjt:  SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK

Q02973 Em-like protein GEA62.6e-3279.12Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
        M+S+QE+ +LD RA++GETVVPGGTGGKS EAQ+HLAEGRSRGGQTRKEQLG EGYQ+MGRKGGLS    PGGE A EEGVEIDESKFR K
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK

Q05190 Late embryogenesis abundant protein B19.1A2.4e-3074.44Show/hide
Query:  SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
        S +QER +LD +AR+GETVVPGGTGGKSLEAQ++LAEGRSRGGQTR+EQ+G EGY EMGRKGGLS++   GGERAA EG++IDESKF+ K
Subjt:  SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK

Q40864 Em-like protein7.5e-3279.78Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFR
        M  +Q+R ELDA+AR+GETVVPGGTGGKSL+AQE LAEGRSRGGQTRKEQ+G EGYQEMGRKGGLS++G PGGERA+EEG  IDESK+R
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFR

Arabidopsis top hitse value%identityAlignment
AT2G40170.1 Stress induced protein1.8e-3379.12Show/hide
Query:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
        M+S+QE+ +LD RA++GETVVPGGTGGKS EAQ+HLAEGRSRGGQTRKEQLG EGYQ+MGRKGGLS    PGGE A EEGVEIDESKFR K
Subjt:  MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK

AT3G51810.1 Stress induced protein1.7e-2347.33Show/hide
Query:  SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ--------------------------------------------
        S +  R ELD +A+QGETVVPGGTGG SLEAQEHLAEGRS+GGQTRKEQLGHEGYQ                                            
Subjt:  SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ--------------------------------------------

Query:  ----------------EMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK
                        EMGRKGGLS     GGERA EEG+EIDESKF  K
Subjt:  ----------------EMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCGGAGCAGGAAAGATGTGAACTCGACGCCAGGGCCAGGCAAGGGGAAACGGTCGTCCCCGGTGGAACTGGAGGCAAGAGTCTCGAAGCTCAGGAGCACCTCGC
TGAAGGGCGGAGCCGAGGGGGCCAGACAAGGAAGGAGCAGCTGGGACACGAAGGGTACCAAGAGATGGGCCGCAAAGGAGGGTTGAGCAACTCGGGTATGCCAGGAGGAG
AGCGTGCTGCTGAGGAAGGGGTTGAAATTGATGAGTCCAAGTTCAGGGCTAAGTAA
mRNA sequenceShow/hide mRNA sequence
GATTGTCAAAGCTAAGTTTTTCAAAGAGTCAAAGAGAGAGATGTCGTCGGAGCAGGAAAGATGTGAACTCGACGCCAGGGCCAGGCAAGGGGAAACGGTCGTCCCCGGTG
GAACTGGAGGCAAGAGTCTCGAAGCTCAGGAGCACCTCGCTGAAGGGCGGAGCCGAGGGGGCCAGACAAGGAAGGAGCAGCTGGGACACGAAGGGTACCAAGAGATGGGC
CGCAAAGGAGGGTTGAGCAACTCGGGTATGCCAGGAGGAGAGCGTGCTGCTGAGGAAGGGGTTGAAATTGATGAGTCCAAGTTCAGGGCTAAGTAAAAGCTTTCACAATA
TGACCGAGTTTCAAGTTTCGAATTCCAATTTCAGCTTTAGTTTCTTTTAGTGTCTGCTTTCTCTAAAAGCATCTAGAATAAGTGTGGTCTTGTCGGGTGTGTATATGGAT
GTTTCTGCATTGAGTGAGAAATTAACTTTTGGTCTTTCTTTTAAATTTCTGCCTGTGAAGGATAAAGTGAGCACCACCTAAAATCCTACAGGTCTAAACAACAAGACATT
CATAAAAT
Protein sequenceShow/hide protein sequence
MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK