; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0024449 (gene) of Chayote v1 genome

Gene IDSed0024449
OrganismSechium edule (Chayote v1)
DescriptionEpidermal patterning factor-like protein
Genome locationLG01:9256390..9257595
RNA-Seq ExpressionSed0024449
SyntenySed0024449
Gene Ontology termsGO:0010052 - guard cell differentiation (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR039455 - EPIDERMAL PATTERNING FACTOR-like protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065061.1 EPIDERMAL PATTERNING FACTOR-like protein 2 [Cucumis melo var. makuwa]3.3e-4163.87Show/hide
Query:  MGSLPIW--QRNKHVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKA------------EKVGMMMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLH
        MGS  IW   RNKHV I ++FL VFI I  ++SIEGRGIQA   E K A            EK+GMMM +QIGSRPP CRR+C ECG  CEAVQVPV LH
Subjt:  MGSLPIW--QRNKHVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKA------------EKVGMMMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLH

Query:  DSHQNQRK--RRTQAS-------SNHDVALSSEDETSNYKPISWKCKCGNFIFNP
        DS+QNQ+K  RR ++S       S HDVALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  DSHQNQRK--RRTQAS-------SNHDVALSSEDETSNYKPISWKCKCGNFIFNP

XP_004148430.2 EPIDERMAL PATTERNING FACTOR-like protein 2 [Cucumis sativus]1.0e-4265.33Show/hide
Query:  MGSLPIW--QRNKHVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKA-------EKVGMMMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLHDSHQN
        MGS  IW   RNKH+ I ++FL VFI I  ++SIEGRGIQA   E K A       EK+GMMM +QIGSRPP CRR+CRECG  CEAVQVPV LHDS+QN
Subjt:  MGSLPIW--QRNKHVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKA-------EKVGMMMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLHDSHQN

Query:  QRKRRTQ---------ASSNHDVALSSEDETSNYKPISWKCKCGNFIFNP
        QRK R +          +S HDVALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  QRKRRTQ---------ASSNHDVALSSEDETSNYKPISWKCKCGNFIFNP

XP_008444980.1 PREDICTED: EPIDERMAL PATTERNING FACTOR-like protein 2 [Cucumis melo]2.5e-4164.29Show/hide
Query:  MGSLPIW--QRNKHVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKA-----------EKVGMMMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLHD
        MGS  IW   RNKHV I ++FL VFI I  ++SIEGRGIQA   E K A           EK+GMMM +QIGSRPP CRR+C ECG  CEAVQVPV LHD
Subjt:  MGSLPIW--QRNKHVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKA-----------EKVGMMMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLHD

Query:  SHQNQRK--RRTQAS-------SNHDVALSSEDETSNYKPISWKCKCGNFIFNP
        S+QNQ+K  RR ++S       S HDVALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  SHQNQRK--RRTQAS-------SNHDVALSSEDETSNYKPISWKCKCGNFIFNP

XP_022131875.1 EPIDERMAL PATTERNING FACTOR-like protein 2 [Momordica charantia]4.3e-4160.53Show/hide
Query:  MGSLPIWQRNK-HVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKAEKVGM---------MMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLHDSHQ
        MGSL  W RNK H+ IS++ L V IF+  IS  EGRGI  P+ E +K E  G          M+ SQIGSRPP CRRRCRECG  CEA+QVPV LHDS Q
Subjt:  MGSLPIWQRNK-HVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKAEKVGM---------MMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLHDSHQ

Query:  NQRKRRTQ----------ASSNHDVALSSEDETSNYKPISWKCKCGNFIFNP
        NQRK+R+            +SN D+ALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  NQRKRRTQ----------ASSNHDVALSSEDETSNYKPISWKCKCGNFIFNP

XP_038886514.1 EPIDERMAL PATTERNING FACTOR-like protein 2 [Benincasa hispida]1.2e-4365.99Show/hide
Query:  MGSLPIWQRNKHVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKK--------AEKVGMMMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLHD-SHQN
        MGS   W R KHV IS++FL V I IH  S +EGRGIQ  +   +          EK GMMM +QIGSRPP CRRRCRECG  CEAVQVPV LHD SHQN
Subjt:  MGSLPIWQRNKHVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKK--------AEKVGMMMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLHD-SHQN

Query:  QRKRRTQASSN------HDVALSSEDETSNYKPISWKCKCGNFIFNP
        QRK+RT  SS+      HDVALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  QRKRRTQASSN------HDVALSSEDETSNYKPISWKCKCGNFIFNP

TrEMBL top hitse value%identityAlignment
A0A0A0LPQ0 Epidermal patterning factor-like protein4.9e-4365.33Show/hide
Query:  MGSLPIW--QRNKHVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKA-------EKVGMMMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLHDSHQN
        MGS  IW   RNKH+ I ++FL VFI I  ++SIEGRGIQA   E K A       EK+GMMM +QIGSRPP CRR+CRECG  CEAVQVPV LHDS+QN
Subjt:  MGSLPIW--QRNKHVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKA-------EKVGMMMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLHDSHQN

Query:  QRKRRTQ---------ASSNHDVALSSEDETSNYKPISWKCKCGNFIFNP
        QRK R +          +S HDVALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  QRKRRTQ---------ASSNHDVALSSEDETSNYKPISWKCKCGNFIFNP

A0A1S3BBM2 Epidermal patterning factor-like protein1.2e-4164.29Show/hide
Query:  MGSLPIW--QRNKHVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKA-----------EKVGMMMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLHD
        MGS  IW   RNKHV I ++FL VFI I  ++SIEGRGIQA   E K A           EK+GMMM +QIGSRPP CRR+C ECG  CEAVQVPV LHD
Subjt:  MGSLPIW--QRNKHVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKA-----------EKVGMMMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLHD

Query:  SHQNQRK--RRTQAS-------SNHDVALSSEDETSNYKPISWKCKCGNFIFNP
        S+QNQ+K  RR ++S       S HDVALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  SHQNQRK--RRTQAS-------SNHDVALSSEDETSNYKPISWKCKCGNFIFNP

A0A5A7VH43 Epidermal patterning factor-like protein1.6e-4163.87Show/hide
Query:  MGSLPIW--QRNKHVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKA------------EKVGMMMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLH
        MGS  IW   RNKHV I ++FL VFI I  ++SIEGRGIQA   E K A            EK+GMMM +QIGSRPP CRR+C ECG  CEAVQVPV LH
Subjt:  MGSLPIW--QRNKHVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKA------------EKVGMMMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLH

Query:  DSHQNQRK--RRTQAS-------SNHDVALSSEDETSNYKPISWKCKCGNFIFNP
        DS+QNQ+K  RR ++S       S HDVALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  DSHQNQRK--RRTQAS-------SNHDVALSSEDETSNYKPISWKCKCGNFIFNP

A0A6J1BQW9 Epidermal patterning factor-like protein2.1e-4160.53Show/hide
Query:  MGSLPIWQRNK-HVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKAEKVGM---------MMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLHDSHQ
        MGSL  W RNK H+ IS++ L V IF+  IS  EGRGI  P+ E +K E  G          M+ SQIGSRPP CRRRCRECG  CEA+QVPV LHDS Q
Subjt:  MGSLPIWQRNK-HVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKAEKVGM---------MMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLHDSHQ

Query:  NQRKRRTQ----------ASSNHDVALSSEDETSNYKPISWKCKCGNFIFNP
        NQRK+R+            +SN D+ALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  NQRKRRTQ----------ASSNHDVALSSEDETSNYKPISWKCKCGNFIFNP

A0A6J1KP85 Epidermal patterning factor-like protein1.1e-3959.87Show/hide
Query:  MGSLPIWQRNKHVPIS-IIFLFVFIFIHAISSIEGRGIQAPVKEDKKAE------KVGMMMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLHDSHQNQR
        MGSL IW RNKH+ I  + FL VF+FIH  S  EGRG Q P+ ED+K E      ++  M  SQIGS+PP CRRRCRECG  CEAVQVPV +HDS   ++
Subjt:  MGSLPIWQRNKHVPIS-IIFLFVFIFIHAISSIEGRGIQAPVKEDKKAE------KVGMMMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLHDSHQNQR

Query:  KRRTQ------------ASSNHD-VALSSEDETSNYKPISWKCKCGNFIFNP
        +RRT+            +SS H+ VALSSEDETSNYKPISWKCKCGNFIFNP
Subjt:  KRRTQ------------ASSNHD-VALSSEDETSNYKPISWKCKCGNFIFNP

SwissProt top hitse value%identityAlignment
Q9LFT5 EPIDERMAL PATTERNING FACTOR-like protein 11.3e-0536.59Show/hide
Query:  SQIGSRPPICRRRCRECGSRCEAVQVPVGLHDSHQNQRKRRTQASSNHDVALSSE-----DETSNYKPISWKCKCGNFIFNP
        +++GS PP C  RC  C   C A+QVP          R  R    S   V   S      D+ SNYKP+ WKC C    +NP
Subjt:  SQIGSRPPICRRRCRECGSRCEAVQVPVGLHDSHQNQRKRRTQASSNHDVALSSE-----DETSNYKPISWKCKCGNFIFNP

Q9T068 EPIDERMAL PATTERNING FACTOR-like protein 23.5e-1437.69Show/hide
Query:  IWQRNKHVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKAEK-VGMMMGSQIGSRPPICRR-RCRECGSRCEAVQVPVGLHDS-HQNQRKRRTQASSN
        +W  N  +   ++ L +    H      GR     V+  K  ++ V MMM   IGSRPP C R RCR CG  CEA+QVP       H       + +S  
Subjt:  IWQRNKHVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKAEK-VGMMMGSQIGSRPPICRR-RCRECGSRCEAVQVPVGLHDS-HQNQRKRRTQASSN

Query:  HDVALSSEDETSNYKPISWKCKCGNFIFNP
          +  +  D+++NYKP+SWKCKCGN I+NP
Subjt:  HDVALSSEDETSNYKPISWKCKCGNFIFNP

Arabidopsis top hitse value%identityAlignment
AT2G30370.1 allergen-related7.6e-0429.55Show/hide
Query:  KKAEKVGMMMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLHDSHQNQRKRRTQASSNHDVALSSEDETSNYKPISWKCKCGNFIFNP
        K+AE   ++ G  +GS PP C  +C  C + C+ V VPV                             T+ Y P +W+CKCGN ++ P
Subjt:  KKAEKVGMMMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLHDSHQNQRKRRTQASSNHDVALSSEDETSNYKPISWKCKCGNFIFNP

AT3G22820.1 allergen-related2.6e-0427.42Show/hide
Query:  LFVFIFIHAISSIEGRGIQAP-------VKEDKKAEKVGMMMGSQ----IGSRPPICRRRCRECGSRCEAVQVPVGLHDSHQNQRKRRTQASSNHDVALS
        L V+ F+   SS     +Q P        KE  ++   G ++  +     GS PP+CR +C +C   C+AV VP+                     + + 
Subjt:  LFVFIFIHAISSIEGRGIQAP-------VKEDKKAEKVGMMMGSQ----IGSRPPICRRRCRECGSRCEAVQVPVGLHDSHQNQRKRRTQASSNHDVALS

Query:  SEDETSNYKPISWKCKCGNFIFNP
         E     Y P +W+CKCGN +F P
Subjt:  SEDETSNYKPISWKCKCGNFIFNP

AT4G14723.1 BEST Arabidopsis thaliana protein match is: allergen-related (TAIR:AT3G22820.1)5.8e-0433.78Show/hide
Query:  GSRPPICRRRCRECGSRCEAVQVPVGLHDSHQNQRKRRTQASSNHDVALSSEDETSNYKPISWKCKCGNFIFNP
        GS PP CR +C +C   C+ V VP+                       LS   E   Y P +W+CKCGN +F P
Subjt:  GSRPPICRRRCRECGSRCEAVQVPVGLHDSHQNQRKRRTQASSNHDVALSSEDETSNYKPISWKCKCGNFIFNP

AT4G37810.1 unknown protein2.5e-1537.69Show/hide
Query:  IWQRNKHVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKAEK-VGMMMGSQIGSRPPICRR-RCRECGSRCEAVQVPVGLHDS-HQNQRKRRTQASSN
        +W  N  +   ++ L +    H      GR     V+  K  ++ V MMM   IGSRPP C R RCR CG  CEA+QVP       H       + +S  
Subjt:  IWQRNKHVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKAEK-VGMMMGSQIGSRPPICRR-RCRECGSRCEAVQVPVGLHDS-HQNQRKRRTQASSN

Query:  HDVALSSEDETSNYKPISWKCKCGNFIFNP
          +  +  D+++NYKP+SWKCKCGN I+NP
Subjt:  HDVALSSEDETSNYKPISWKCKCGNFIFNP

AT5G10310.1 unknown protein9.5e-0736.59Show/hide
Query:  SQIGSRPPICRRRCRECGSRCEAVQVPVGLHDSHQNQRKRRTQASSNHDVALSSE-----DETSNYKPISWKCKCGNFIFNP
        +++GS PP C  RC  C   C A+QVP          R  R    S   V   S      D+ SNYKP+ WKC C    +NP
Subjt:  SQIGSRPPICRRRCRECGSRCEAVQVPVGLHDSHQNQRKRRTQASSNHDVALSSE-----DETSNYKPISWKCKCGNFIFNP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCAGCCTTCCAATTTGGCAGAGAAACAAACATGTTCCAATTTCCATCATCTTCCTCTTTGTTTTCATCTTCATTCATGCCATATCCTCAATTGAAGGAAGAGGAAT
TCAGGCACCAGTAAAGGAGGATAAGAAGGCAGAGAAGGTGGGGATGATGATGGGGAGTCAAATCGGGTCGCGGCCGCCGATCTGCCGAAGAAGATGCAGGGAATGCGGCA
GCCGTTGCGAGGCAGTTCAAGTACCAGTGGGGCTGCATGATTCACATCAGAATCAAAGAAAAAGAAGAACCCAAGCTTCTTCTAATCATGATGTGGCTCTATCAAGTGAA
GATGAGACCTCAAATTACAAACCCATAAGCTGGAAATGCAAGTGTGGAAACTTCATCTTCAACCCATGA
mRNA sequenceShow/hide mRNA sequence
AATTTCACAACCATTTCCCCCTCTTATAACTTATAAACCTTTTCCCTTATTTAAACACCTTATTCTTCTCCTTCTTTTATTCTTACCTTTAACATTCCATATCCATCCAT
CTCCTTTTTCTTTCTTTTTTCAGTCTCTGAAAAGATGGGCAGCCTTCCAATTTGGCAGAGAAACAAACATGTTCCAATTTCCATCATCTTCCTCTTTGTTTTCATCTTCA
TTCATGCCATATCCTCAATTGAAGGAAGAGGAATTCAGGCACCAGTAAAGGAGGATAAGAAGGCAGAGAAGGTGGGGATGATGATGGGGAGTCAAATCGGGTCGCGGCCG
CCGATCTGCCGAAGAAGATGCAGGGAATGCGGCAGCCGTTGCGAGGCAGTTCAAGTACCAGTGGGGCTGCATGATTCACATCAGAATCAAAGAAAAAGAAGAACCCAAGC
TTCTTCTAATCATGATGTGGCTCTATCAAGTGAAGATGAGACCTCAAATTACAAACCCATAAGCTGGAAATGCAAGTGTGGAAACTTCATCTTCAACCCATGATGTATCT
TTTTTCCCACAATTCAATATTTGAATAAAAACAGCAGCTTGTAAAGATGATCAAAAGCTGGAAATCAGCTCTTCTTCTGATCCAACTTCCAAGGTTCCATGAAAGGAAGG
GACAGCAGCTAGGTGAAGTATGCAGAGTTTTAGTAAATGTATCTAAAGGGAAAATGAAATGAGAGTTTGAAAGAGAGGAAAATGAGGTGGCAGTGTTGTGTGTCAGGTGA
GAATTCAG
Protein sequenceShow/hide protein sequence
MGSLPIWQRNKHVPISIIFLFVFIFIHAISSIEGRGIQAPVKEDKKAEKVGMMMGSQIGSRPPICRRRCRECGSRCEAVQVPVGLHDSHQNQRKRRTQASSNHDVALSSE
DETSNYKPISWKCKCGNFIFNP