; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g0968 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g0968
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionEpidermal patterning factor-like protein
Genome locationMC01:15336005..15337683
RNA-Seq ExpressionMC01g0968
SyntenyMC01g0968
Gene Ontology termsGO:0010052 - guard cell differentiation (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR039455 - EPIDERMAL PATTERNING FACTOR-like protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026905.1 EPIDERMAL PATTERNING FACTOR-like protein 2 [Cucurbita argyrosperma subsp. argyrosperma]2.04e-6478.74Show/hide
Query:  MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN
        M CECNNNGVIGRSRILCAT+ FLF LILASTQMRFMAE          V E+KVVLRGQIGSRPPKCERRCSWC HCEAIQVP NPQKS     S   N
Subjt:  MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN

Query:  MAYARDEASNYKPMSWKCKCGSLIFNP
        +AYARDEASNYKPMSWKCKCGSLIFNP
Subjt:  MAYARDEASNYKPMSWKCKCGSLIFNP

XP_022132530.1 EPIDERMAL PATTERNING FACTOR-like protein 2 [Momordica charantia]3.87e-7992.13Show/hide
Query:  MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN
        MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE          VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN
Subjt:  MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN

Query:  MAYARDEASNYKPMSWKCKCGSLIFNP
        MAYARDEASNYKPMSWKCKCGSLIFNP
Subjt:  MAYARDEASNYKPMSWKCKCGSLIFNP

XP_022963149.1 EPIDERMAL PATTERNING FACTOR-like protein 2 [Cucurbita moschata]1.02e-6480.31Show/hide
Query:  MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN
        M CECNNNGVIGRSRILCATV FLF LILASTQMRFMAE          V E+KVVLRGQIGSRPPKCERRCSWC HCEAIQVP NPQKS     SA  N
Subjt:  MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN

Query:  MAYARDEASNYKPMSWKCKCGSLIFNP
        +AYARDEASNYKPMSWKCKCGSLIFNP
Subjt:  MAYARDEASNYKPMSWKCKCGSLIFNP

XP_023003267.1 EPIDERMAL PATTERNING FACTOR-like protein 2 isoform X2 [Cucurbita maxima]1.01e-6479.53Show/hide
Query:  MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN
        M CECNNNGVIGR RILCATV FLFLLILASTQMRFMAE          V E+KVVLRGQIGSRPPKCERRCSWC HCEAIQVP NPQKS     S   N
Subjt:  MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN

Query:  MAYARDEASNYKPMSWKCKCGSLIFNP
        +AYARDEASNYKPMSWKCKCGSLIFNP
Subjt:  MAYARDEASNYKPMSWKCKCGSLIFNP

XP_023518506.1 EPIDERMAL PATTERNING FACTOR-like protein 2 isoform X2 [Cucurbita pepo subsp. pepo]2.04e-6478.74Show/hide
Query:  MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN
        M CECNNNGVIGRSRILCAT+ FLF LILASTQMRFMAE          V E+KVVLRGQIGSRPPKCERRCSWC HCEAIQVP NPQKS     S   N
Subjt:  MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN

Query:  MAYARDEASNYKPMSWKCKCGSLIFNP
        +AYARDEASNYKPMSWKCKCGSLIFNP
Subjt:  MAYARDEASNYKPMSWKCKCGSLIFNP

TrEMBL top hitse value%identityAlignment
A0A6J1BSQ3 Epidermal patterning factor-like protein1.87e-7992.13Show/hide
Query:  MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN
        MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE          VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN
Subjt:  MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN

Query:  MAYARDEASNYKPMSWKCKCGSLIFNP
        MAYARDEASNYKPMSWKCKCGSLIFNP
Subjt:  MAYARDEASNYKPMSWKCKCGSLIFNP

A0A6J1GDI3 Epidermal patterning factor-like protein4.02e-6377.34Show/hide
Query:  MSCECNNNGV-IGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFN
        M CECNNNGV IGRSRILCATV FL LLILASTQMR  AE          V E+K +LRGQIGS+PPKCERRCSWCGHCEAIQVP NPQKSA   SSA  
Subjt:  MSCECNNNGV-IGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFN

Query:  NMAYARDEASNYKPMSWKCKCGSLIFNP
        N+ YARDEASNYKPMSWKCKCGSLIFNP
Subjt:  NMAYARDEASNYKPMSWKCKCGSLIFNP

A0A6J1HJ94 Epidermal patterning factor-like protein4.93e-6580.31Show/hide
Query:  MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN
        M CECNNNGVIGRSRILCATV FLF LILASTQMRFMAE          V E+KVVLRGQIGSRPPKCERRCSWC HCEAIQVP NPQKS     SA  N
Subjt:  MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN

Query:  MAYARDEASNYKPMSWKCKCGSLIFNP
        +AYARDEASNYKPMSWKCKCGSLIFNP
Subjt:  MAYARDEASNYKPMSWKCKCGSLIFNP

A0A6J1KLZ1 Epidermal patterning factor-like protein4.90e-6579.53Show/hide
Query:  MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN
        M CECNNNGVIGR RILCATV FLFLLILASTQMRFMAE          V E+KVVLRGQIGSRPPKCERRCSWC HCEAIQVP NPQKS     S   N
Subjt:  MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN

Query:  MAYARDEASNYKPMSWKCKCGSLIFNP
        +AYARDEASNYKPMSWKCKCGSLIFNP
Subjt:  MAYARDEASNYKPMSWKCKCGSLIFNP

A0A6J1KNP8 Epidermal patterning factor-like protein1.26e-6479.53Show/hide
Query:  MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN
        M CECNNNGVIGR RILCATV FLFLLILASTQMRFMAE          V E+KVVLRGQIGSRPPKCERRCSWC HCEAIQVP NPQKS     S   N
Subjt:  MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAE----------VGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNN

Query:  MAYARDEASNYKPMSWKCKCGSLIFNP
        +AYARDEASNYKPMSWKCKCGSLIFNP
Subjt:  MAYARDEASNYKPMSWKCKCGSLIFNP

SwissProt top hitse value%identityAlignment
C4B8C4 EPIDERMAL PATTERNING FACTOR-like protein 33.7e-0737.68Show/hide
Query:  EEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNNMAYARDEASNYKPMSWKCKC
        EE V  R +IGS+PP CE++C  C  CEAIQ PT             +++ +     +NY+P  W+C C
Subjt:  EEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNNMAYARDEASNYKPMSWKCKC

Q2V3I3 EPIDERMAL PATTERNING FACTOR-like protein 42.9e-0431.88Show/hide
Query:  GQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNNMAYARDEASNYKPMSWKCKCGSLIFNP
        G  GS PP C  +C  C  C+ + VP  P  S                    Y P +W+CKCG+ +F P
Subjt:  GQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNNMAYARDEASNYKPMSWKCKCGSLIFNP

Q9LFT5 EPIDERMAL PATTERNING FACTOR-like protein 12.5e-1133.33Show/hide
Query:  VPFLFLLILASTQMRFMAEVG---EEKVVL---RGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAF--------NNMAYARDEASNYKPMSW
        +P + +L++      F+  +      +V L   + ++GS PP C  RC+ C  C AIQVPT P +S  T  + F        +++    D+ SNYKPM W
Subjt:  VPFLFLLILASTQMRFMAEVG---EEKVVL---RGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAF--------NNMAYARDEASNYKPMSW

Query:  KCKCGSLIFNP
        KC C    +NP
Subjt:  KCKCGSLIFNP

Q9LUH9 EPIDERMAL PATTERNING FACTOR-like protein 58.5e-0431.88Show/hide
Query:  GQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNNMAYARDEASNYKPMSWKCKCGSLIFNP
        G  GS PP C  +C  C  C+A+ VP  P                       Y P +W+CKCG+ +F P
Subjt:  GQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNNMAYARDEASNYKPMSWKCKCGSLIFNP

Q9T068 EPIDERMAL PATTERNING FACTOR-like protein 21.5e-2149.57Show/hide
Query:  LFLLILASTQMRFMAEVGEE--------------KVVLRGQIGSRPPKCER-RCSWCGHCEAIQVPTNPQ-------KSANTNSSAFNNMAYAR-DEASN
        L LLIL ST    MA    E              K+++RG IGSRPP+CER RC  CGHCEAIQVPTNPQ        +++++SS   ++ Y R D+++N
Subjt:  LFLLILASTQMRFMAEVGEE--------------KVVLRGQIGSRPPKCER-RCSWCGHCEAIQVPTNPQ-------KSANTNSSAFNNMAYAR-DEASN

Query:  YKPMSWKCKCGSLIFNP
        YKPMSWKCKCG+ I+NP
Subjt:  YKPMSWKCKCGSLIFNP

Arabidopsis top hitse value%identityAlignment
AT3G13898.1 unknown protein2.6e-0837.68Show/hide
Query:  EEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNNMAYARDEASNYKPMSWKCKC
        EE V  R +IGS+PP CE++C  C  CEAIQ PT             +++ +     +NY+P  W+C C
Subjt:  EEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNNMAYARDEASNYKPMSWKCKC

AT3G22820.1 allergen-related6.1e-0531.88Show/hide
Query:  GQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNNMAYARDEASNYKPMSWKCKCGSLIFNP
        G  GS PP C  +C  C  C+A+ VP  P                       Y P +W+CKCG+ +F P
Subjt:  GQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNNMAYARDEASNYKPMSWKCKCGSLIFNP

AT4G14723.1 BEST Arabidopsis thaliana protein match is: allergen-related (TAIR:AT3G22820.1)2.1e-0531.88Show/hide
Query:  GQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNNMAYARDEASNYKPMSWKCKCGSLIFNP
        G  GS PP C  +C  C  C+ + VP  P  S                    Y P +W+CKCG+ +F P
Subjt:  GQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNNMAYARDEASNYKPMSWKCKCGSLIFNP

AT4G37810.1 unknown protein1.1e-2249.57Show/hide
Query:  LFLLILASTQMRFMAEVGEE--------------KVVLRGQIGSRPPKCER-RCSWCGHCEAIQVPTNPQ-------KSANTNSSAFNNMAYAR-DEASN
        L LLIL ST    MA    E              K+++RG IGSRPP+CER RC  CGHCEAIQVPTNPQ        +++++SS   ++ Y R D+++N
Subjt:  LFLLILASTQMRFMAEVGEE--------------KVVLRGQIGSRPPKCER-RCSWCGHCEAIQVPTNPQ-------KSANTNSSAFNNMAYAR-DEASN

Query:  YKPMSWKCKCGSLIFNP
        YKPMSWKCKCG+ I+NP
Subjt:  YKPMSWKCKCGSLIFNP

AT5G10310.1 unknown protein1.8e-1233.33Show/hide
Query:  VPFLFLLILASTQMRFMAEVG---EEKVVL---RGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAF--------NNMAYARDEASNYKPMSW
        +P + +L++      F+  +      +V L   + ++GS PP C  RC+ C  C AIQVPT P +S  T  + F        +++    D+ SNYKPM W
Subjt:  VPFLFLLILASTQMRFMAEVG---EEKVVL---RGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAF--------NNMAYARDEASNYKPMSW

Query:  KCKCGSLIFNP
        KC C    +NP
Subjt:  KCKCGSLIFNP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTGTGAGTGTAACAACAATGGCGTCATTGGGCGCAGCAGAATCTTGTGTGCGACTGTTCCTTTTCTCTTTCTTCTGATTTTGGCATCGACCCAGATGAGATTCAT
GGCTGAAGTGGGTGAAGAGAAAGTGGTATTGAGAGGACAAATTGGGTCAAGGCCTCCAAAATGTGAAAGGAGATGCAGCTGGTGTGGGCACTGTGAGGCCATTCAGGTGC
CTACAAACCCACAAAAATCAGCCAACACAAACTCTTCAGCATTCAACAACATGGCTTATGCAAGAGATGAAGCCTCCAATTACAAGCCCATGAGCTGGAAATGCAAATGT
GGGAGCCTAATCTTCAACCCC
mRNA sequenceShow/hide mRNA sequence
TTCCCATCCCCTCACCATTTTTTTTTTTGCTCTTGCTCAACAAGTTCAAACAGACAAAATACAAATGTCGGTTAGGCTACACTTATATTTATGAATGAAGCAATTAAAGT
TAAACCATGGTTTATTAATTCAATTCAAACTGGGGTTTTACAAGTTCAGAAATTGACAATCTGGGTCGACGATATAATAATCAAATTTTTGGGGGGCATGGCAGAAAATT
GCCGAAACCGAAACGAAATCCTGTAGACCTCTTGCTTGGGAAGAAGAAAGAAGGATATGGATATGGATATGGCGTGATTAGGTGGGAGTGTTCAGAACAAACTCACAAGA
AAACTTCATTAAACCAATAGAAAAAGAAAAAAAAAAAAAGACAAAAGGGAAACGAAAACGAAATCCCACTCCCACACAGCTGCCATTGCGCTGCACACCTCTGGTTATGG
AAAAACTTTGCAGCATTTCGGAGTATTTCCAAATTTTCCTCTCGTCTTCTGCCTCCTCCTTTGTCTTTTGCTTCGTCGTTTCACTGTTCCAATAAACTCTTCTTCCATCT
TCTTTTCTCTTTCCTCAATTATTTTTTCCCATTTCCAATCCAAAGTTCAAACCCCTTCATTTTTGTTTTCTTTTTTACTAGTTAATTTTTGGGTTATAGTTTTTTGGCCT
TGGCCTTCGGGGCTCTCCTTTCGAGTTCTGAACTTCTGAGGGCTTCTGCACTTTTCGGTACTCCCTCTTTTTTCTTTCGGTCTCTGCAAATTTCATTTCTTTTTTCCTTT
TCTTTTTCATTTGTCTTCCACGTTCCATCATCTCAAATATCTTTCCCAGACTAAGACAGATTCAGAGCAAATTATATTCATATATTTGCACGTACAAGCCAGCAATAAAA
ATAAAACAAAATCAGAAGCTAGCTAATCTCGCCGATTCAGATTAACTCTCACTTCACTCTTCAACTTCATTCGCCACCCACTTTTTTTTCCCCTTTATCTCATTCCTTTT
TATCCCTTTCTTGTGAATAACTTTGCAGCTTGGTTTTATTTTTAACCTCCCCTTCTCCAGGGAGAATTCCCTGTCTCTTTCGCACCTCCACTGAGAAAAAAGAAGAATGA
GTTGTGAGTGTAACAACAATGGCGTCATTGGGCGCAGCAGAATCTTGTGTGCGACTGTTCCTTTTCTCTTTCTTCTGATTTTGGCATCGACCCAGATGAGATTCATGGCT
GAAGTGGGTGAAGAGAAAGTGGTATTGAGAGGACAAATTGGGTCAAGGCCTCCAAAATGTGAAAGGAGATGCAGCTGGTGTGGGCACTGTGAGGCCATTCAGGTGCCTAC
AAACCCACAAAAATCAGCCAACACAAACTCTTCAGCATTCAACAACATGGCTTATGCAAGAGATGAAGCCTCCAATTACAAGCCCATGAGCTGGAAATGCAAATGTGGGA
GCCTAATCTTCAACCCC
Protein sequenceShow/hide protein sequence
MSCECNNNGVIGRSRILCATVPFLFLLILASTQMRFMAEVGEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPTNPQKSANTNSSAFNNMAYARDEASNYKPMSWKCKC
GSLIFNP