; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026198 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026198
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionEpidermal patterning factor-like protein
Genome locationtig00153031:2735063..2735666
RNA-Seq ExpressionSgr026198
SyntenySgr026198
Gene Ontology termsGO:0010052 - guard cell differentiation (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR039455 - EPIDERMAL PATTERNING FACTOR-like protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132530.1 EPIDERMAL PATTERNING FACTOR-like protein 2 [Momordica charantia]1.1e-5787.4Show/hide
Query:  MGCEC-NNGVIGRSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKN
        M CEC NNGVIGRSRI CATVP LFLL+LASTQ+RFMAEGR + KRGQTV EEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVP NPQKS + NSSA  N
Subjt:  MGCEC-NNGVIGRSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKN

Query:  MAYARDEASNYKPMSWKCKCGSLIFNP
        MAYARDEASNYKPMSWKCKCGSLIFNP
Subjt:  MAYARDEASNYKPMSWKCKCGSLIFNP

XP_022963149.1 EPIDERMAL PATTERNING FACTOR-like protein 2 [Cucurbita moschata]1.9e-5485.04Show/hide
Query:  MGCEC-NNGVIGRSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKN
        MGCEC NNGVIGRSRI CATV  LF L+LASTQ+RFMAEGRS+SK G+TVSE+KVVLRGQIGSRPPKCERRCSWC HCEAIQVPANPQK     SSA+KN
Subjt:  MGCEC-NNGVIGRSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKN

Query:  MAYARDEASNYKPMSWKCKCGSLIFNP
        +AYARDEASNYKPMSWKCKCGSLIFNP
Subjt:  MAYARDEASNYKPMSWKCKCGSLIFNP

XP_023003266.1 EPIDERMAL PATTERNING FACTOR-like protein 2 isoform X1 [Cucurbita maxima]7.1e-5484.25Show/hide
Query:  MGCEC-NNGVIGRSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKN
        MGCEC NNGVIGR RI CATV  LFLL+LASTQ+RFMAEGRS+SK G+TVSE+KVVLRGQIGSRPPKCERRCSWC HCEAIQVPANPQK     SS +KN
Subjt:  MGCEC-NNGVIGRSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKN

Query:  MAYARDEASNYKPMSWKCKCGSLIFNP
        +AYARDEASNYKPMSWKCKCGSLIFNP
Subjt:  MAYARDEASNYKPMSWKCKCGSLIFNP

XP_023003267.1 EPIDERMAL PATTERNING FACTOR-like protein 2 isoform X2 [Cucurbita maxima]7.1e-5484.25Show/hide
Query:  MGCEC-NNGVIGRSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKN
        MGCEC NNGVIGR RI CATV  LFLL+LASTQ+RFMAEGRS+SK G+TVSE+KVVLRGQIGSRPPKCERRCSWC HCEAIQVPANPQK     SS +KN
Subjt:  MGCEC-NNGVIGRSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKN

Query:  MAYARDEASNYKPMSWKCKCGSLIFNP
        +AYARDEASNYKPMSWKCKCGSLIFNP
Subjt:  MAYARDEASNYKPMSWKCKCGSLIFNP

XP_038882800.1 EPIDERMAL PATTERNING FACTOR-like protein 2 isoform X1 [Benincasa hispida]1.9e-5482.03Show/hide
Query:  MGCECNNGVIG-RSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVS-EEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALK
        MGCECNNGV+G RSRI C+TV +LF L+LASTQ+RFMAEGR +S+ G+TV+ E+KVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKS  +NSS +K
Subjt:  MGCECNNGVIG-RSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVS-EEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALK

Query:  NMAYARDEASNYKPMSWKCKCGSLIFNP
        N+AYARDEASNYKPMSWKCKCGSLIFNP
Subjt:  NMAYARDEASNYKPMSWKCKCGSLIFNP

TrEMBL top hitse value%identityAlignment
A0A6J1BSQ3 Epidermal patterning factor-like protein5.1e-5887.4Show/hide
Query:  MGCEC-NNGVIGRSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKN
        M CEC NNGVIGRSRI CATVP LFLL+LASTQ+RFMAEGR + KRGQTV EEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVP NPQKS + NSSA  N
Subjt:  MGCEC-NNGVIGRSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKN

Query:  MAYARDEASNYKPMSWKCKCGSLIFNP
        MAYARDEASNYKPMSWKCKCGSLIFNP
Subjt:  MAYARDEASNYKPMSWKCKCGSLIFNP

A0A6J1GDI3 Epidermal patterning factor-like protein2.5e-5278.91Show/hide
Query:  MGCECNNG--VIGRSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALK
        MGCECNN   VIGRSRI CATV  L LL+LASTQ+R  AEGRS+S R +TV+E+K +LRGQIGS+PPKCERRCSWCGHCEAIQVPANPQKS ++ SSA+K
Subjt:  MGCECNNG--VIGRSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALK

Query:  NMAYARDEASNYKPMSWKCKCGSLIFNP
        N+ YARDEASNYKPMSWKCKCGSLIFNP
Subjt:  NMAYARDEASNYKPMSWKCKCGSLIFNP

A0A6J1HJ94 Epidermal patterning factor-like protein9.1e-5585.04Show/hide
Query:  MGCEC-NNGVIGRSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKN
        MGCEC NNGVIGRSRI CATV  LF L+LASTQ+RFMAEGRS+SK G+TVSE+KVVLRGQIGSRPPKCERRCSWC HCEAIQVPANPQK     SSA+KN
Subjt:  MGCEC-NNGVIGRSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKN

Query:  MAYARDEASNYKPMSWKCKCGSLIFNP
        +AYARDEASNYKPMSWKCKCGSLIFNP
Subjt:  MAYARDEASNYKPMSWKCKCGSLIFNP

A0A6J1KLZ1 Epidermal patterning factor-like protein3.5e-5484.25Show/hide
Query:  MGCEC-NNGVIGRSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKN
        MGCEC NNGVIGR RI CATV  LFLL+LASTQ+RFMAEGRS+SK G+TVSE+KVVLRGQIGSRPPKCERRCSWC HCEAIQVPANPQK     SS +KN
Subjt:  MGCEC-NNGVIGRSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKN

Query:  MAYARDEASNYKPMSWKCKCGSLIFNP
        +AYARDEASNYKPMSWKCKCGSLIFNP
Subjt:  MAYARDEASNYKPMSWKCKCGSLIFNP

A0A6J1KNP8 Epidermal patterning factor-like protein3.5e-5484.25Show/hide
Query:  MGCEC-NNGVIGRSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKN
        MGCEC NNGVIGR RI CATV  LFLL+LASTQ+RFMAEGRS+SK G+TVSE+KVVLRGQIGSRPPKCERRCSWC HCEAIQVPANPQK     SS +KN
Subjt:  MGCEC-NNGVIGRSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKN

Query:  MAYARDEASNYKPMSWKCKCGSLIFNP
        +AYARDEASNYKPMSWKCKCGSLIFNP
Subjt:  MAYARDEASNYKPMSWKCKCGSLIFNP

SwissProt top hitse value%identityAlignment
C4B8C4 EPIDERMAL PATTERNING FACTOR-like protein 31.5e-0636.23Show/hide
Query:  EEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKNMAYARDEASNYKPMSWKCKC
        EE V  R +IGS+PP CE++C  C  CEAIQ P             + ++ +     +NY+P  W+C C
Subjt:  EEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKNMAYARDEASNYKPMSWKCKC

Q2V3I3 EPIDERMAL PATTERNING FACTOR-like protein 41.1e-0429.37Show/hide
Query:  GVIGRSRIFCATVPILFLLL-LASTQLRFMAEGRSMSKR------GQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKNM
        G   R R F     + F LL L S      A+GR + +R      G  +   K    G  GS PP C  +C  C  C+ + VP  P  S           
Subjt:  GVIGRSRIFCATVPILFLLL-LASTQLRFMAEGRSMSKR------GQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKNM

Query:  AYARDEASNYKPMSWKCKCGSLIFNP
                 Y P +W+CKCG+ +F P
Subjt:  AYARDEASNYKPMSWKCKCGSLIFNP

Q9LFT5 EPIDERMAL PATTERNING FACTOR-like protein 17.3e-0930.43Show/hide
Query:  VPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSAL--------KNMAYARDEASNYK
        +P++ +LL+      F+   +        + E+K     ++GS PP C  RC+ C  C AIQVP  P +S     +           ++    D+ SNYK
Subjt:  VPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSAL--------KNMAYARDEASNYK

Query:  PMSWKCKCGSLIFNP
        PM WKC C    +NP
Subjt:  PMSWKCKCGSLIFNP

Q9LUH9 EPIDERMAL PATTERNING FACTOR-like protein 52.4e-0430.86Show/hide
Query:  GQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKNMAYARDEASNYKPMSWKCKCGSLIFNP
        GQ V ++++   G  GS PP C  +C  C  C+A+ VP  P                       Y P +W+CKCG+ +F P
Subjt:  GQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKNMAYARDEASNYKPMSWKCKCGSLIFNP

Q9T068 EPIDERMAL PATTERNING FACTOR-like protein 21.2e-2250Show/hide
Query:  ILFLLLLASTQLRFMAEGR------SMSKRGQTVSEEKVVLRGQIGSRPPKCER-RCSWCGHCEAIQVPANPQ-------KSGSRNSSALKNMAYAR-DE
        +L LL+L ST    MA GR        +K G    + K+++RG IGSRPP+CER RC  CGHCEAIQVP NPQ        + S +SS   ++ Y R D+
Subjt:  ILFLLLLASTQLRFMAEGR------SMSKRGQTVSEEKVVLRGQIGSRPPKCER-RCSWCGHCEAIQVPANPQ-------KSGSRNSSALKNMAYAR-DE

Query:  ASNYKPMSWKCKCGSLIFNP
        ++NYKPMSWKCKCG+ I+NP
Subjt:  ASNYKPMSWKCKCGSLIFNP

Arabidopsis top hitse value%identityAlignment
AT3G13898.1 unknown protein1.1e-0736.23Show/hide
Query:  EEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKNMAYARDEASNYKPMSWKCKC
        EE V  R +IGS+PP CE++C  C  CEAIQ P             + ++ +     +NY+P  W+C C
Subjt:  EEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKNMAYARDEASNYKPMSWKCKC

AT3G22820.1 allergen-related1.7e-0530.86Show/hide
Query:  GQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKNMAYARDEASNYKPMSWKCKCGSLIFNP
        GQ V ++++   G  GS PP C  +C  C  C+A+ VP  P                       Y P +W+CKCG+ +F P
Subjt:  GQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKNMAYARDEASNYKPMSWKCKCGSLIFNP

AT4G14723.1 BEST Arabidopsis thaliana protein match is: allergen-related (TAIR:AT3G22820.1)7.7e-0629.37Show/hide
Query:  GVIGRSRIFCATVPILFLLL-LASTQLRFMAEGRSMSKR------GQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKNM
        G   R R F     + F LL L S      A+GR + +R      G  +   K    G  GS PP C  +C  C  C+ + VP  P  S           
Subjt:  GVIGRSRIFCATVPILFLLL-LASTQLRFMAEGRSMSKR------GQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKNM

Query:  AYARDEASNYKPMSWKCKCGSLIFNP
                 Y P +W+CKCG+ +F P
Subjt:  AYARDEASNYKPMSWKCKCGSLIFNP

AT4G37810.1 unknown protein8.2e-2450Show/hide
Query:  ILFLLLLASTQLRFMAEGR------SMSKRGQTVSEEKVVLRGQIGSRPPKCER-RCSWCGHCEAIQVPANPQ-------KSGSRNSSALKNMAYAR-DE
        +L LL+L ST    MA GR        +K G    + K+++RG IGSRPP+CER RC  CGHCEAIQVP NPQ        + S +SS   ++ Y R D+
Subjt:  ILFLLLLASTQLRFMAEGR------SMSKRGQTVSEEKVVLRGQIGSRPPKCER-RCSWCGHCEAIQVPANPQ-------KSGSRNSSALKNMAYAR-DE

Query:  ASNYKPMSWKCKCGSLIFNP
        ++NYKPMSWKCKCG+ I+NP
Subjt:  ASNYKPMSWKCKCGSLIFNP

AT5G10310.1 unknown protein5.2e-1030.43Show/hide
Query:  VPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSAL--------KNMAYARDEASNYK
        +P++ +LL+      F+   +        + E+K     ++GS PP C  RC+ C  C AIQVP  P +S     +           ++    D+ SNYK
Subjt:  VPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSAL--------KNMAYARDEASNYK

Query:  PMSWKCKCGSLIFNP
        PM WKC C    +NP
Subjt:  PMSWKCKCGSLIFNP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTGTGAGTGTAACAATGGCGTCATTGGCCGCAGCAGAATCTTTTGTGCCACTGTTCCTATTCTCTTTCTTCTGTTATTGGCATCGACCCAGCTGAGATTCATGGC
TGAAGGCAGATCGATGTCAAAGAGAGGCCAGACAGTGAGTGAAGAGAAAGTGGTATTAAGAGGTCAAATAGGGTCAAGGCCCCCAAAATGCGAGAGAAGATGCAGCTGGT
GTGGACACTGTGAGGCCATTCAAGTGCCTGCAAACCCGCAAAAATCAGGCAGCAGAAACTCTTCAGCACTGAAGAACATGGCTTATGCAAGGGACGAGGCCTCCAATTAC
AAGCCCATGAGCTGGAAATGCAAATGCGGGAGCTTAATCTTCAACCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTGTGAGTGTAACAATGGCGTCATTGGCCGCAGCAGAATCTTTTGTGCCACTGTTCCTATTCTCTTTCTTCTGTTATTGGCATCGACCCAGCTGAGATTCATGGC
TGAAGGCAGATCGATGTCAAAGAGAGGCCAGACAGTGAGTGAAGAGAAAGTGGTATTAAGAGGTCAAATAGGGTCAAGGCCCCCAAAATGCGAGAGAAGATGCAGCTGGT
GTGGACACTGTGAGGCCATTCAAGTGCCTGCAAACCCGCAAAAATCAGGCAGCAGAAACTCTTCAGCACTGAAGAACATGGCTTATGCAAGGGACGAGGCCTCCAATTAC
AAGCCCATGAGCTGGAAATGCAAATGCGGGAGCTTAATCTTCAACCCTTAA
Protein sequenceShow/hide protein sequence
MGCECNNGVIGRSRIFCATVPILFLLLLASTQLRFMAEGRSMSKRGQTVSEEKVVLRGQIGSRPPKCERRCSWCGHCEAIQVPANPQKSGSRNSSALKNMAYARDEASNY
KPMSWKCKCGSLIFNP