; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0681 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0681
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionLate embryogenesis abundant protein
Genome locationMC02:5498596..5499165
RNA-Seq ExpressionMC02g0681
SyntenyMC02g0681
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149721.1 uncharacterized protein LOC111018077 [Momordica charantia]5.58e-7998.39Show/hide
Query:  MVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAGRLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVAST
        MV LSVRNPNKVAFKYSD TAVIRYRGEEFGEAPIPAGRLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVAST
Subjt:  MVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAGRLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVAST

Query:  ACDFAVDIKNRSIGDQQCHYRTQL
        ACDFAVDIKNRSIGDQQCHYRTQL
Subjt:  ACDFAVDIKNRSIGDQQCHYRTQL

XP_022948127.1 uncharacterized protein LOC111451800 [Cucurbita moschata]1.34e-7866.67Show/hide
Query:  KQRKFCLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAG
        K+R  C+ +LL++I LVI ++IL F VFKPK+PTI VDS+SLLDL +SLD  R  VDLNLTL+V L+V NPNKVAF++SD TAV+ YRGEE  EAPIP+G
Subjt:  KQRKFCLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAG

Query:  RLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHYRTQL
        RL+ADGT+ MNL+LTM+ADRLLAK EL  DV+AGELPIST+ARL GKV VIGVFKI VVA ++CD  +DI+ R++ DQ+C YRT+L
Subjt:  RLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHYRTQL

XP_023007254.1 uncharacterized protein LOC111499794 [Cucurbita maxima]6.02e-8268.28Show/hide
Query:  KQRKFCLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAG
        K+R  C+ +LL++I LVIL++IL F VFKPK+PTI VDSVSLLDL +SL+  R  VDLNLTL+V L+V NPNKVAF++SD TAV+ YRGEE  EAPIP+G
Subjt:  KQRKFCLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAG

Query:  RLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHYRTQL
        RL+ DGT+ MNL+LTM+ADRLLAK ELF DV+AGELPIST+ARL+GK+TVIGVFKI VVA ++CD  +DI+NRS+ DQ+C YRT+L
Subjt:  RLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHYRTQL

XP_023512272.1 uncharacterized protein LOC111777064 [Cucurbita pepo subsp. pepo]6.74e-7869.23Show/hide
Query:  KQRKFCLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAG
        K R  C+ +LL+V  LVI ++IL F  FKPKRPTIAVDSVSLLDL +SLD  RL+VDLNL+L++ LSV NPNKVAF+YS STAV+ YRGEE GEAPIPAG
Subjt:  KQRKFCLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAG

Query:  RLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHY
        RL AD T+ MNL+LTM+ADRLLAK ELF D ++GE+PI+ + RLSG V VIGVFKIHVVAS++CDF + I NRSI DQ+CHY
Subjt:  RLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHY

XP_023534551.1 uncharacterized protein LOC111796093 [Cucurbita pepo subsp. pepo]4.68e-7965.59Show/hide
Query:  KQRKFCLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAG
        K+R  C+ +LL++I LVI ++IL F VFKPK+PTI VDS+SLLDL +SL+  R  VDLNLTL+V L++ NPNKVAF++SD TAV+ YRGEE  EAPIP+G
Subjt:  KQRKFCLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAG

Query:  RLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHYRTQL
        RL+ADGT+ MNL+LTM+ADR+LAK ELF DV+ GELPIST+ARL+GKVTVIGVFKI VVA ++CD  ++I+NR++ DQ+C YRT+L
Subjt:  RLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHYRTQL

TrEMBL top hitse value%identityAlignment
A0A6J1D989 uncharacterized protein LOC1110180772.70e-7998.39Show/hide
Query:  MVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAGRLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVAST
        MV LSVRNPNKVAFKYSD TAVIRYRGEEFGEAPIPAGRLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVAST
Subjt:  MVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAGRLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVAST

Query:  ACDFAVDIKNRSIGDQQCHYRTQL
        ACDFAVDIKNRSIGDQQCHYRTQL
Subjt:  ACDFAVDIKNRSIGDQQCHYRTQL

A0A6J1FYG9 uncharacterized protein LOC1114486492.53e-7566.48Show/hide
Query:  KQRKFCLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAG
        K R  C+ +LL+V  L+I ++IL F  FKPKRPTIAVDSVSLLDL +SLD  RL+VDLNL+L++ LS+ NPNKVAF+YS +TAV+ YRGEE GEAPIPAG
Subjt:  KQRKFCLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAG

Query:  RLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHY
         L AD T+ MNL+LTM+ADRLLAK ELF D ++GE+PI+ + RLSG V VIGVFKIHVVAS++CD  + I NRSI DQ+CHY
Subjt:  RLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHY

A0A6J1G8C1 uncharacterized protein LOC1114518006.47e-7966.67Show/hide
Query:  KQRKFCLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAG
        K+R  C+ +LL++I LVI ++IL F VFKPK+PTI VDS+SLLDL +SLD  R  VDLNLTL+V L+V NPNKVAF++SD TAV+ YRGEE  EAPIP+G
Subjt:  KQRKFCLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAG

Query:  RLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHYRTQL
        RL+ADGT+ MNL+LTM+ADRLLAK EL  DV+AGELPIST+ARL GKV VIGVFKI VVA ++CD  +DI+ R++ DQ+C YRT+L
Subjt:  RLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHYRTQL

A0A6J1JFV2 uncharacterized protein LOC1114840295.10e-7567.58Show/hide
Query:  KQRKFCLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAG
        K R  C+ +LL+V  LVI ++IL F  FKPKRPTIAVDSVSLLDL +SLD  RL+VDLNL L++ LSV NPNKVAF+YS STAV+ YRGEE GE PIPAG
Subjt:  KQRKFCLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAG

Query:  RLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHY
        RL AD T+ MNL+L M+ADRLLAK ELF D ++GE+PI+ + RLSG V VIGVFKIHVVAS++CD  + I NRSI DQ+CHY
Subjt:  RLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHY

A0A6J1L4F9 uncharacterized protein LOC1114997942.91e-8268.28Show/hide
Query:  KQRKFCLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAG
        K+R  C+ +LL++I LVIL++IL F VFKPK+PTI VDSVSLLDL +SL+  R  VDLNLTL+V L+V NPNKVAF++SD TAV+ YRGEE  EAPIP+G
Subjt:  KQRKFCLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAG

Query:  RLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHYRTQL
        RL+ DGT+ MNL+LTM+ADRLLAK ELF DV+AGELPIST+ARL+GK+TVIGVFKI VVA ++CD  +DI+NRS+ DQ+C YRT+L
Subjt:  RLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHYRTQL

SwissProt top hitse value%identityAlignment
Q6DST1 Late embryogenesis abundant protein At1g640653.6e-0428.09Show/hide
Query:  CLGILLAVIGLVILLVILGFPVF-KPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGE-EFGEAPIPAGRLA
        CL   L +I ++  L ++   +F +  +P I   S+S  DL    + T      N TL+  +S+RN N  AF++ DST  + Y      GE  I   R+ 
Subjt:  CLGILLAVIGLVILLVILGFPVF-KPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGE-EFGEAPIPAGRLA

Query:  ADGTQGMNLSLTMIAD-RLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQC
        A  T  +   +  I   RLL   +L  D+  G L + + A + G++ V+G  K   V+  +C   +++  R I +  C
Subjt:  ADGTQGMNLSLTMIAD-RLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQC

Arabidopsis top hitse value%identityAlignment
AT1G52330.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.9e-0925.27Show/hide
Query:  QRKFCLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAGR
        +R+F + I L     +++       +F P  P I +  V +  + V   P      +++TL+V L V N +  +F ++D    I YRG+  G      G 
Subjt:  QRKFCLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAGR

Query:  LAADGTQGMNLSLTMIADRLLAKPE---LFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQC
        + A G+  ++    +  D ++  P+   L  D+  G +   T    +GK+ V+  F+  + A  AC   VD  N++I  Q C
Subjt:  LAADGTQGMNLSLTMIADRLLAKPE---LFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQC

AT1G64450.1 Glycine-rich protein family3.3e-0832.43Show/hide
Query:  CLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAGRLAAD
        C    + ++ L+++L+++ F VFKPK P I+V++V L   AVS +        N +    ++VRNPN+  F + DS+  + Y G + G   IPAG++ + 
Subjt:  CLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAGRLAAD

Query:  GTQGMNLSLTM
          Q M  + T+
Subjt:  GTQGMNLSLTM

AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family7.0e-1932.76Show/hide
Query:  LVILLVILGFPVFKPKRPTIAVDSVSL--LDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAGRLAADGTQGMNLS
        L  +++ L F VF+ K P I ++ V +  LD     +  +L +  N++++V +SV+NPN  +FKYS++T  I Y+G   GEA    G+     T  MN++
Subjt:  LVILLVILGFPVFKPKRPTIAVDSVSL--LDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAGRLAADGTQGMNLS

Query:  LTMIADRLLAKPELFPDVV-AGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHYRTQL
        + ++ DR+L+ P L  ++  +G + + +Y R+ GKV ++G+ K HV     C  AV+I  ++I D  C  +  L
Subjt:  LTMIADRLLAKPELFPDVV-AGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHYRTQL

AT3G05975.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.8e-1527.23Show/hide
Query:  LAKQRKFCL--GILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAP
        ++K+R  C+  GI+  +  + +  +IL   VFKPK P +   S ++  ++ ++      V LN TL + + ++NPN   F+Y     ++ YR    G   
Subjt:  LAKQRKFCL--GILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAP

Query:  IPAGRLAADGTQGMNLSLTMIADRLLAK-PELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHYRTQL
        +P+  L A G+  +   L +  D+ +A   ++  DV+ G++ + T A++ GK+T++G+FKI + + + C+  +   +  + DQ C  +T+L
Subjt:  IPAGRLAADGTQGMNLSLTMIADRLLAK-PELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHYRTQL

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.3e-4445.31Show/hide
Query:  KLAKQR--KFCLGI-LLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGE
        KL ++R  K C+   +L ++ + I++VIL F +FKPKRPT  +DSV++  L  S++P  L V LNLTL V LS++NPN++ F Y  S+A++ YRG+  GE
Subjt:  KLAKQR--KFCLGI-LLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGE

Query:  APIPAGRLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHYRTQL
        AP+PA R+AA  T  +N++LT++ADRLL++ +L  DV+AG +P++T+ +++GKVTV+ +FKI V +S++CD ++ + +R++  Q C Y T+L
Subjt:  APIPAGRLAADGTQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHYRTQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GCTAAACTGGCGAAACAACGAAAATTTTGCTTGGGCATTTTACTCGCCGTAATTGGGCTCGTAATTTTACTCGTGATTTTAGGGTTTCCAGTTTTCAAGCCCAAACGGCC
CACAATCGCCGTCGATTCGGTTTCTCTGCTCGATCTGGCCGTCTCTCTGGACCCCACGAGGCTCGCCGTCGATCTGAATCTCACTCTGATGGTGGCTCTCTCCGTCAGGA
ACCCTAATAAAGTGGCCTTCAAATACTCCGATAGCACCGCCGTCATCAGATACAGAGGGGAAGAATTCGGAGAAGCGCCGATTCCGGCCGGCCGGTTGGCGGCCGACGGG
ACCCAGGGAATGAACCTCTCACTGACGATGATTGCGGACCGGCTGCTCGCCAAGCCGGAGCTCTTCCCCGACGTGGTCGCCGGAGAACTTCCGATCAGCACTTACGCCAG
ACTTTCCGGTAAAGTGACGGTAATCGGCGTGTTCAAGATTCATGTTGTGGCCTCGACGGCTTGTGATTTCGCCGTCGACATTAAAAACCGAAGTATTGGAGATCAGCAGT
GCCATTATCGAACTCAGCTC
mRNA sequenceShow/hide mRNA sequence
GCTAAACTGGCGAAACAACGAAAATTTTGCTTGGGCATTTTACTCGCCGTAATTGGGCTCGTAATTTTACTCGTGATTTTAGGGTTTCCAGTTTTCAAGCCCAAACGGCC
CACAATCGCCGTCGATTCGGTTTCTCTGCTCGATCTGGCCGTCTCTCTGGACCCCACGAGGCTCGCCGTCGATCTGAATCTCACTCTGATGGTGGCTCTCTCCGTCAGGA
ACCCTAATAAAGTGGCCTTCAAATACTCCGATAGCACCGCCGTCATCAGATACAGAGGGGAAGAATTCGGAGAAGCGCCGATTCCGGCCGGCCGGTTGGCGGCCGACGGG
ACCCAGGGAATGAACCTCTCACTGACGATGATTGCGGACCGGCTGCTCGCCAAGCCGGAGCTCTTCCCCGACGTGGTCGCCGGAGAACTTCCGATCAGCACTTACGCCAG
ACTTTCCGGTAAAGTGACGGTAATCGGCGTGTTCAAGATTCATGTTGTGGCCTCGACGGCTTGTGATTTCGCCGTCGACATTAAAAACCGAAGTATTGGAGATCAGCAGT
GCCATTATCGAACTCAGCTC
Protein sequenceShow/hide protein sequence
AKLAKQRKFCLGILLAVIGLVILLVILGFPVFKPKRPTIAVDSVSLLDLAVSLDPTRLAVDLNLTLMVALSVRNPNKVAFKYSDSTAVIRYRGEEFGEAPIPAGRLAADG
TQGMNLSLTMIADRLLAKPELFPDVVAGELPISTYARLSGKVTVIGVFKIHVVASTACDFAVDIKNRSIGDQQCHYRTQL