; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g0609 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g0609
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionEpidermal patterning factor-like protein
Genome locationMC04:5388441..5389494
RNA-Seq ExpressionMC04g0609
SyntenyMC04g0609
Gene Ontology termsGO:0010374 - stomatal complex development (biological process)
InterPro domainsIPR039455 - EPIDERMAL PATTERNING FACTOR-like protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB1208372.1 EPIDERMAL PATTERNING FACTOR-like protein 4 [Morella rubra]9.16e-2952.63Show/hide
Query:  MDHPKVINAIALILALSLLSLSCFVHSRTLSTAPSPAPEGG-GIGSH-----KGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYP
        MD   ++  +A  L L+     C VH R L TAPSPAPE G  IG H     KG  + + MR GSFPA C SKCN C+PC+ V+VSVR+M L EN EYYP
Subjt:  MDHPKVINAIALILALSLLSLSCFVHSRTLSTAPSPAPEGG-GIGSH-----KGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYP

Query:  QVWRCMCHHDIFFP
        QVW+CMC  ++F P
Subjt:  QVWRCMCHHDIFFP

KAE8645843.1 hypothetical protein Csa_017281 [Cucumis sativus]2.23e-3778.95Show/hide
Query:  PAPEGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRS-MELAEN--IEYYPQVWRCMCHHDIFFP
        P PEG G+GSHKGV++ RH RKGSFP VCDSKCN CKPC LVQVSVRS M+L EN  IEYYPQVWRCMCHH+IFFP
Subjt:  PAPEGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRS-MELAEN--IEYYPQVWRCMCHHDIFFP

KAG6571922.1 EPIDERMAL PATTERNING FACTOR-like protein 4, partial [Cucurbita argyrosperma subsp. sororia]4.03e-5075.24Show/hide
Query:  KVINAIALILALSLLSLSCFVHSRTLSTAPSPAPEGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAEN-IEYYPQVWRCMCHH
        K+  A+AL+ AL+LLS SC V SRTL TAPSPAPEG G G HKGV   +HMRKGSFP  CDSKCN CKPC LVQVSVRSM L EN IEYYPQVWRCMCHH
Subjt:  KVINAIALILALSLLSLSCFVHSRTLSTAPSPAPEGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAEN-IEYYPQVWRCMCHH

Query:  DIFFP
        +IFFP
Subjt:  DIFFP

KAG7011608.1 EPIDERMAL PATTERNING FACTOR-like protein 4, partial [Cucurbita argyrosperma subsp. argyrosperma]2.76e-5075.24Show/hide
Query:  KVINAIALILALSLLSLSCFVHSRTLSTAPSPAPEGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAEN-IEYYPQVWRCMCHH
        K+  A+AL+ AL+LLS SC V SRTL TAPSPAPEG G G HKGV   +HMRKGSFP  CDSKCN CKPC LVQVSVRSM L EN IEYYPQVWRCMCHH
Subjt:  KVINAIALILALSLLSLSCFVHSRTLSTAPSPAPEGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAEN-IEYYPQVWRCMCHH

Query:  DIFFP
        +IFFP
Subjt:  DIFFP

XP_021617896.1 EPIDERMAL PATTERNING FACTOR-like protein 4 [Manihot esculenta]6.85e-2652.48Show/hide
Query:  ILALSLLSLSCFVHSRTLSTAPSPAP-----EGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYPQVWRCMCHHDIFF
        IL   +L  S  VHSR L+ +PSPAP     E GG+    G   +    +GSFPA C  KCN CKPC+ VQVSV +ME  EN EYYPQVW+C+C  DIF 
Subjt:  ILALSLLSLSCFVHSRTLSTAPSPAP-----EGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYPQVWRCMCHHDIFF

Query:  P
        P
Subjt:  P

TrEMBL top hitse value%identityAlignment
A0A2C9VK80 Epidermal patterning factor-like protein3.32e-2652.48Show/hide
Query:  ILALSLLSLSCFVHSRTLSTAPSPAP-----EGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYPQVWRCMCHHDIFF
        IL   +L  S  VHSR L+ +PSPAP     E GG+    G   +    +GSFPA C  KCN CKPC+ VQVSV +ME  EN EYYPQVW+C+C  DIF 
Subjt:  ILALSLLSLSCFVHSRTLSTAPSPAP-----EGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYPQVWRCMCHHDIFF

Query:  P
        P
Subjt:  P

A0A2P5XK54 Epidermal patterning factor-like protein2.03e-2551.38Show/hide
Query:  MDHPKVINAIALI-LALSLLSLSCFVHSRTLSTAPSPAPEGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYPQVWRC
        M+H K I  I +  L L++L  S  VHSR L+ +PSP+PE  G+     +  TR  R GS PA CDSKC  C+PC+ V+VSVR+ EL EN EYYPQVW+C
Subjt:  MDHPKVINAIALI-LALSLLSLSCFVHSRTLSTAPSPAPEGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYPQVWRC

Query:  MCHHDIFFP
        MC  +I+ P
Subjt:  MCHHDIFFP

A0A5D2H9D7 Epidermal patterning factor-like protein2.03e-2551.38Show/hide
Query:  MDHPKVINAIALI-LALSLLSLSCFVHSRTLSTAPSPAPEGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYPQVWRC
        M+H K I  I +  L L++L  S  VHSR L+ +PSP+PE  G+     +  TR  R GS PA CDSKC  C+PC+ V+VSVR+ EL EN EYYPQVW+C
Subjt:  MDHPKVINAIALI-LALSLLSLSCFVHSRTLSTAPSPAPEGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYPQVWRC

Query:  MCHHDIFFP
        MC  +I+ P
Subjt:  MCHHDIFFP

A0A5D2RAU7 Epidermal patterning factor-like protein1.01e-2551.38Show/hide
Query:  MDHPKVINAIALI-LALSLLSLSCFVHSRTLSTAPSPAPEGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYPQVWRC
        M+H K I  I +  L L++L  S  VHSR L+ +PSP+PE  G+     +  TR  R GS PA CDSKC  C+PC+ V+VSVR+ EL EN EYYPQVW+C
Subjt:  MDHPKVINAIALI-LALSLLSLSCFVHSRTLSTAPSPAPEGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYPQVWRC

Query:  MCHHDIFFP
        MC  +I+ P
Subjt:  MCHHDIFFP

A0A6A1V6A3 Epidermal patterning factor-like protein4.43e-2952.63Show/hide
Query:  MDHPKVINAIALILALSLLSLSCFVHSRTLSTAPSPAPEGG-GIGSH-----KGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYP
        MD   ++  +A  L L+     C VH R L TAPSPAPE G  IG H     KG  + + MR GSFPA C SKCN C+PC+ V+VSVR+M L EN EYYP
Subjt:  MDHPKVINAIALILALSLLSLSCFVHSRTLSTAPSPAPEGG-GIGSH-----KGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYP

Query:  QVWRCMCHHDIFFP
        QVW+CMC  ++F P
Subjt:  QVWRCMCHHDIFFP

SwissProt top hitse value%identityAlignment
Q1G3V9 EPIDERMAL PATTERNING FACTOR-like protein 81.2e-0432.84Show/hide
Query:  GSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSM--ELAENIEYYPQVWRCMCHHDIFFP
        G+H      +    GS P VC +KC +CKPC+     +R    +  ++  YYP  W C C   +F P
Subjt:  GSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSM--ELAENIEYYPQVWRCMCHHDIFFP

Q2V3I3 EPIDERMAL PATTERNING FACTOR-like protein 45.8e-0734.29Show/hide
Query:  VINAIALILALSLLSLSCFVHS--RTLSTAPSPAPEGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYPQVWRCMCHH
        ++ A+     L L S S  V +  R +         GG I S+K     R    GS P  C SKC  C+PC  V V ++   L+  +EYYP+ WRC C +
Subjt:  VINAIALILALSLLSLSCFVHS--RTLSTAPSPAPEGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYPQVWRCMCHH

Query:  DIFFP
         +F P
Subjt:  DIFFP

Q9LUH9 EPIDERMAL PATTERNING FACTOR-like protein 57.6e-0732.08Show/hide
Query:  LALSLLSLSCFVHSRTLSTAPSPAPEGGGIGSHKG-----------VFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYPQVWRCMCH
        + L  L +  F+   + S+A S     GG+G  K            V + R    GS P +C  KC  C+PC  V V ++   L   +EYYP+ WRC C 
Subjt:  LALSLLSLSCFVHSRTLSTAPSPAPEGGGIGSHKG-----------VFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYPQVWRCMCH

Query:  HDIFFP
        + +F P
Subjt:  HDIFFP

Arabidopsis top hitse value%identityAlignment
AT1G80133.1 unknown protein8.6e-0632.84Show/hide
Query:  GSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSM--ELAENIEYYPQVWRCMCHHDIFFP
        G+H      +    GS P VC +KC +CKPC+     +R    +  ++  YYP  W C C   +F P
Subjt:  GSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSM--ELAENIEYYPQVWRCMCHHDIFFP

AT3G22820.1 allergen-related5.4e-0832.08Show/hide
Query:  LALSLLSLSCFVHSRTLSTAPSPAPEGGGIGSHKG-----------VFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYPQVWRCMCH
        + L  L +  F+   + S+A S     GG+G  K            V + R    GS P +C  KC  C+PC  V V ++   L   +EYYP+ WRC C 
Subjt:  LALSLLSLSCFVHSRTLSTAPSPAPEGGGIGSHKG-----------VFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYPQVWRCMCH

Query:  HDIFFP
        + +F P
Subjt:  HDIFFP

AT4G14723.1 BEST Arabidopsis thaliana protein match is: allergen-related (TAIR:AT3G22820.1)4.1e-0834.29Show/hide
Query:  VINAIALILALSLLSLSCFVHS--RTLSTAPSPAPEGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYPQVWRCMCHH
        ++ A+     L L S S  V +  R +         GG I S+K     R    GS P  C SKC  C+PC  V V ++   L+  +EYYP+ WRC C +
Subjt:  VINAIALILALSLLSLSCFVHS--RTLSTAPSPAPEGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYPQVWRCMCHH

Query:  DIFFP
         +F P
Subjt:  DIFFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCACCCAAAGGTCATAAACGCTATTGCACTCATTCTCGCTCTCTCGTTGCTGAGTCTTTCATGTTTCGTTCACTCAAGAACTCTCTCGACCGCTCCATCGCCGGC
ACCTGAAGGAGGGGGAATTGGAAGCCATAAAGGAGTGTTTGAAACGAGGCATATGAGGAAGGGCTCATTTCCAGCAGTGTGTGATTCAAAGTGCAACCATTGCAAACCTT
GCATACTTGTTCAAGTATCAGTTAGATCCATGGAATTGGCTGAGAATATTGAATACTATCCTCAAGTATGGAGGTGCATGTGTCATCACGATATATTTTTTCCATAA
mRNA sequenceShow/hide mRNA sequence
CCAAAATCCCCTCAAAACCAACATCCGTAGACACAAACAAAAGCATGGATCACCCAAAGGTCATAAACGCTATTGCACTCATTCTCGCTCTCTCGTTGCTGAGTCTTTCA
TGTTTCGTTCACTCAAGAACTCTCTCGACCGCTCCATCGCCGGCACCTGAAGGAGGGGGAATTGGAAGCCATAAAGGAGTGTTTGAAACGAGGCATATGAGGAAGGGCTC
ATTTCCAGCAGTGTGTGATTCAAAGTGCAACCATTGCAAACCTTGCATACTTGTTCAAGTATCAGTTAGATCCATGGAATTGGCTGAGAATATTGAATACTATCCTCAAG
TATGGAGGTGCATGTGTCATCACGATATATTTTTTCCATAAAATAAAGTCCTACGTCAATATCAATGAAAATATCAAAATCTCGATTTCATCATATGTGAATAAAAATGC
ATGTTGGAGGTGTCGATGAATATTTCTATAAAAAAAAAATTATGAAAAGCAAGAGAAAGTCATACAAATTATTTAAAGTTATATACAGATACATGTATATTTTTTGTTTT
TCTAAATATTAGTACTTATATTATAGGGCTCAAGTATTTACAAAATCTGTCCTGGGATTACACATGTTATAAAGTTTAAAGTTAGAAGGTAAATAATATATACATATTGC
TATTAATTATCTATATTATATATATATAACTTGGAGAGAATATACTTGTCAATCTCCACTTGAAGGTGCAAAACATGTAATCCTTGTCCCATCTACACACAAAGTTAAAT
AAATAGAAAAATTATATGGGTA
Protein sequenceShow/hide protein sequence
MDHPKVINAIALILALSLLSLSCFVHSRTLSTAPSPAPEGGGIGSHKGVFETRHMRKGSFPAVCDSKCNHCKPCILVQVSVRSMELAENIEYYPQVWRCMCHHDIFFP