; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G004890 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G004890
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionEpidermal patterning factor-like protein
Genome locationCG_Chr05:4725637..4726030
RNA-Seq ExpressionClCG05G004890
SyntenyClCG05G004890
Gene Ontology termsGO:0010052 - guard cell differentiation (biological process)
GO:0005576 - extracellular region (cellular component)
InterPro domainsIPR039455 - EPIDERMAL PATTERNING FACTOR-like protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597420.1 EPIDERMAL PATTERNING FACTOR-like protein 4, partial [Cucurbita argyrosperma subsp. sororia]2.6e-2764.21Show/hide
Query:  HPSNYYYHFSLLL-VLFAFAAL---LSASPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP
        H  + + +FSLLL +L AFAA+    +A+     W+G     +GPGSSPPTCR +CG+CWPC+PVHVPIQPG+SVPLEYYPEAWRCKCGNKLFMP
Subjt:  HPSNYYYHFSLLL-VLFAFAAL---LSASPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP

KAG7615994.1 hypothetical protein ISN45_At04g015230 [Arabidopsis thaliana x Arabidopsis arenosa]4.9e-2665.56Show/hide
Query:  LLVLFAFAALLSA----------SPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP
        LL LF+ ++++SA          S   GG++G  K   GPGSSPPTCR KCGKC PC+PVHVPIQPGLS+PLEYYPEAWRCKCGNKLFMP
Subjt:  LLVLFAFAALLSA----------SPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP

KGN56945.1 hypothetical protein Csa_009821 [Cucumis sativus]2.9e-3470.09Show/hide
Query:  MGS--LPPLHPSNYYYHFSLL-LVLFAF------AALLSASPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKC
        MGS  LP  HPS  ++HFSLL L LFAF      ++  S+S ++GGW+GD+K LVGPGSSPPTC  KCG+C PCEPVHVPIQPGLS+PLEYYPEAWRCKC
Subjt:  MGS--LPPLHPSNYYYHFSLL-LVLFAF------AALLSASPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKC

Query:  GNKLFMP
        GNKLFMP
Subjt:  GNKLFMP

XP_008438622.1 PREDICTED: EPIDERMAL PATTERNING FACTOR-like protein 4 [Cucumis melo]8.3e-3466.37Show/hide
Query:  MGS--LPPLHPSNYYYHFSLL-LVLFAF------------AALLSASPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPE
        MGS  LP  HPS  ++HFSLL L LFAF            ++  S+S ++GGW+GD+K LVGPGSSPPTC  KCG+C PCEPVHVPIQPGLS+PLEYYPE
Subjt:  MGS--LPPLHPSNYYYHFSLL-LVLFAF------------AALLSASPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPE

Query:  AWRCKCGNKLFMP
        AWRCKCGNKLFMP
Subjt:  AWRCKCGNKLFMP

XP_038905188.1 EPIDERMAL PATTERNING FACTOR-like protein 4 [Benincasa hispida]2.7e-4081.91Show/hide
Query:  PSNYYYHFSLLLVLF----AFAALLSASPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP
        PSNYYYHFSLLL+LF     F  L+SASP+SGGW+GD+K LVGPGS PPTCR KCG+C PCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP
Subjt:  PSNYYYHFSLLLVLF----AFAALLSASPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP

TrEMBL top hitse value%identityAlignment
A0A0A0L4P0 Epidermal patterning factor-like protein1.4e-3470.09Show/hide
Query:  MGS--LPPLHPSNYYYHFSLL-LVLFAF------AALLSASPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKC
        MGS  LP  HPS  ++HFSLL L LFAF      ++  S+S ++GGW+GD+K LVGPGSSPPTC  KCG+C PCEPVHVPIQPGLS+PLEYYPEAWRCKC
Subjt:  MGS--LPPLHPSNYYYHFSLL-LVLFAF------AALLSASPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKC

Query:  GNKLFMP
        GNKLFMP
Subjt:  GNKLFMP

A0A178V213 Epidermal patterning factor-like protein2.4e-2665.56Show/hide
Query:  LLVLFAFAALLSA----------SPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP
        LL LF+ ++++SA          S   GG++G  K   GPGSSPPTCR KCGKC PC+PVHVPIQPGLS+PLEYYPEAWRCKCGNKLFMP
Subjt:  LLVLFAFAALLSA----------SPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP

A0A1S3AWH2 Epidermal patterning factor-like protein4.0e-3466.37Show/hide
Query:  MGS--LPPLHPSNYYYHFSLL-LVLFAF------------AALLSASPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPE
        MGS  LP  HPS  ++HFSLL L LFAF            ++  S+S ++GGW+GD+K LVGPGSSPPTC  KCG+C PCEPVHVPIQPGLS+PLEYYPE
Subjt:  MGS--LPPLHPSNYYYHFSLL-LVLFAF------------AALLSASPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPE

Query:  AWRCKCGNKLFMP
        AWRCKCGNKLFMP
Subjt:  AWRCKCGNKLFMP

A0A6D2KFV2 Epidermal patterning factor-like protein5.3e-2663.83Show/hide
Query:  LVLFAFAALLSASP--DSGGWVG-------------DKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP
        L  FAF  L+SA+    +GGW+G             DK+   GPGSSPPTCR KCGKC PC+PVHVPIQPG+S+PLEYYPEAWRCKCGNKLFMP
Subjt:  LVLFAFAALLSASP--DSGGWVG-------------DKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP

V4P8Z8 Epidermal patterning factor-like protein5.3e-2664.13Show/hide
Query:  LVLFAFAALLSASPDS-GGWVGDK------------KILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP
        +  FA   L SAS  S GGW+G +            +   GPGSSPPTCR KCGKC PC+PVHVPIQPGLS+PLEYYPEAWRCKCGNKLFMP
Subjt:  LVLFAFAALLSASPDS-GGWVGDK------------KILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP

SwissProt top hitse value%identityAlignment
Q1PEY6 EPIDERMAL PATTERNING FACTOR-like protein 64.9e-2170.18Show/hide
Query:  KKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP
        ++IL G GSSPP C  KCG+C PC+PVHVP+ PG  V  EYYPEAWRCKCGNKL+MP
Subjt:  KKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP

Q2V3I3 EPIDERMAL PATTERNING FACTOR-like protein 43.2e-2864.44Show/hide
Query:  LLVLFAFAALLSA----------SPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP
        LL LF+ ++++SA          S   GG++   K   GPGSSPPTCR KCGKC PC+PVHVPIQPGLS+PLEYYPEAWRCKCGNKLFMP
Subjt:  LLVLFAFAALLSA----------SPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP

Q7M1E7 Polygalacturonase4.8e-0844.9Show/hide
Query:  SSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP
        SSPP C+ KC  C PC+P  + + P  + P +YYP+ W C C NK++ P
Subjt:  SSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP

Q9FY19 Polygalacturonase9.6e-0941.38Show/hide
Query:  DKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP
        D+K+     SSPP C+ KC  C PC+P  + + P  + P +YYP+ W C C NK++ P
Subjt:  DKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP

Q9LUH9 EPIDERMAL PATTERNING FACTOR-like protein 54.3e-2577.42Show/hide
Query:  GWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP
        G + D+K L GPGS PP CR KCGKC PC+ VHVPIQPGL +PLEYYPEAWRCKCGNKLFMP
Subjt:  GWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP

Arabidopsis top hitse value%identityAlignment
AT1G80133.1 unknown protein3.0e-0534.88Show/hide
Query:  LLVLFAFAALLSASPDSGGWVGDKKILVG-PGSSPPTCRFKCGKCWPCEPVHVPIQPGL-----SVPLEYYPEAWRCKCGNKLFMP
        L V   F +LLS    SG     +++     GS PP C  KC  C PC P    I+        S P  YYP  W C+C +++F P
Subjt:  LLVLFAFAALLSASPDSGGWVGDKKILVG-PGSSPPTCRFKCGKCWPCEPVHVPIQPGL-----SVPLEYYPEAWRCKCGNKLFMP

AT2G30370.1 allergen-related3.5e-2270.18Show/hide
Query:  KKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP
        ++IL G GSSPP C  KCG+C PC+PVHVP+ PG  V  EYYPEAWRCKCGNKL+MP
Subjt:  KKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP

AT2G30370.2 allergen-related3.5e-2270.18Show/hide
Query:  KKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP
        ++IL G GSSPP C  KCG+C PC+PVHVP+ PG  V  EYYPEAWRCKCGNKL+MP
Subjt:  KKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP

AT3G22820.1 allergen-related3.1e-2677.42Show/hide
Query:  GWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP
        G + D+K L GPGS PP CR KCGKC PC+ VHVPIQPGL +PLEYYPEAWRCKCGNKLFMP
Subjt:  GWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP

AT4G14723.1 BEST Arabidopsis thaliana protein match is: allergen-related (TAIR:AT3G22820.1)2.3e-2964.44Show/hide
Query:  LLVLFAFAALLSA----------SPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP
        LL LF+ ++++SA          S   GG++   K   GPGSSPPTCR KCGKC PC+PVHVPIQPGLS+PLEYYPEAWRCKCGNKLFMP
Subjt:  LLVLFAFAALLSA----------SPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCACTTCCCCCTCTCCATCCATCCAATTATTATTACCATTTCTCTCTTCTTCTCGTTCTCTTCGCTTTCGCCGCTTTACTCTCCGCCTCTCCAGACAGCGGAGG
GTGGGTTGGGGATAAGAAGATCTTGGTTGGACCAGGATCGTCGCCGCCGACGTGTCGATTTAAGTGTGGGAAATGCTGGCCCTGTGAACCGGTTCATGTTCCAATTCAAC
CGGGTTTGAGTGTGCCATTGGAGTATTACCCTGAAGCTTGGAGATGCAAGTGTGGGAATAAGTTGTTCATGCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGATCACTTCCCCCTCTCCATCCATCCAATTATTATTACCATTTCTCTCTTCTTCTCGTTCTCTTCGCTTTCGCCGCTTTACTCTCCGCCTCTCCAGACAGCGGAGG
GTGGGTTGGGGATAAGAAGATCTTGGTTGGACCAGGATCGTCGCCGCCGACGTGTCGATTTAAGTGTGGGAAATGCTGGCCCTGTGAACCGGTTCATGTTCCAATTCAAC
CGGGTTTGAGTGTGCCATTGGAGTATTACCCTGAAGCTTGGAGATGCAAGTGTGGGAATAAGTTGTTCATGCCTTGA
Protein sequenceShow/hide protein sequence
MGSLPPLHPSNYYYHFSLLLVLFAFAALLSASPDSGGWVGDKKILVGPGSSPPTCRFKCGKCWPCEPVHVPIQPGLSVPLEYYPEAWRCKCGNKLFMP