; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G06800 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G06800
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionEpidermal patterning factor-like protein
Genome locationClcChr06:7063349..7064299
RNA-Seq ExpressionClc06G06800
SyntenyClc06G06800
Gene Ontology termsGO:0010052 - guard cell differentiation (biological process)
GO:0005576 - extracellular region (cellular component)
InterPro domainsIPR039455 - EPIDERMAL PATTERNING FACTOR-like protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593874.1 EPIDERMAL PATTERNING FACTOR-like protein 4, partial [Cucurbita argyrosperma subsp. sororia]1.2e-4482.46Show/hide
Query:  MAVVLLHHHHRRRRSHSSLTLFLLAAVLFLLTSAIAARPTPIEE--IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYP
        MAVVLLHHHH   R     TLFL A VL LL SAIAARPTPI+E  +  G EERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPI PGLSLPLEYYP
Subjt:  MAVVLLHHHHRRRRSHSSLTLFLLAAVLFLLTSAIAARPTPIEE--IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYP

Query:  EAWRCKCGNSLYMP
        EAWRCKCGNSLYMP
Subjt:  EAWRCKCGNSLYMP

XP_008449315.1 PREDICTED: EPIDERMAL PATTERNING FACTOR-like protein 4 [Cucumis melo]5.8e-3976.92Show/hide
Query:  MAVVLLHHHHRRRRSHSSLTLFLLAAVLFLLTSAIAARPTPIEE-----IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLE
        MAV+LLHH    RR+H  LTLFL    L LLTS+IAARPTPI+E     + AGT ERV+TRRRL+GPGSSPPTCRSKCGSC PCTAVHVPIQPGLSLPLE
Subjt:  MAVVLLHHHHRRRRSHSSLTLFLLAAVLFLLTSAIAARPTPIEE-----IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLE

Query:  YYPEAWRCKCGNSLYMP
        YYPEAWRCKCGN+LYMP
Subjt:  YYPEAWRCKCGNSLYMP

XP_022930459.1 EPIDERMAL PATTERNING FACTOR-like protein 5 [Cucurbita moschata]6.6e-4379.66Show/hide
Query:  MAVVLLHHHHRRRRSHSSLTLFLLAAV----LFLLTSAIAARPTPIEE--IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPL
        MAVVLLHHHH   R     TLFL A V    L LL SAIAARPTPI+E  +  G EERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPI PGLSLPL
Subjt:  MAVVLLHHHHRRRRSHSSLTLFLLAAV----LFLLTSAIAARPTPIEE--IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPL

Query:  EYYPEAWRCKCGNSLYMP
        EYYPEAWRCKCGNSLYMP
Subjt:  EYYPEAWRCKCGNSLYMP

XP_023000566.1 EPIDERMAL PATTERNING FACTOR-like protein 5 [Cucurbita maxima]3.0e-4382.46Show/hide
Query:  MAVVLLHHHHRRRRSHSSLTLFLLAAVLFLLTSAIAARPTPIEE--IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYP
        MAVVLLHHHH   R     TLFL A VL LL SAIAARPTPI+E  +  G EERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPI PGLSLPLEYYP
Subjt:  MAVVLLHHHHRRRRSHSSLTLFLLAAVLFLLTSAIAARPTPIEE--IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYP

Query:  EAWRCKCGNSLYMP
        EAWRCKCGNSLYMP
Subjt:  EAWRCKCGNSLYMP

XP_023513820.1 EPIDERMAL PATTERNING FACTOR-like protein 5 [Cucurbita pepo subsp. pepo]1.2e-4482.46Show/hide
Query:  MAVVLLHHHHRRRRSHSSLTLFLLAAVLFLLTSAIAARPTPIEE--IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYP
        MAVVLLHHHH   R     TLFL A VL LL SAIAARPTPI+E  +  G EERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPI PGLSLPLEYYP
Subjt:  MAVVLLHHHHRRRRSHSSLTLFLLAAVLFLLTSAIAARPTPIEE--IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYP

Query:  EAWRCKCGNSLYMP
        EAWRCKCGNSLYMP
Subjt:  EAWRCKCGNSLYMP

TrEMBL top hitse value%identityAlignment
A0A0A0LKT0 Epidermal patterning factor-like protein4.9e-3670.34Show/hide
Query:  MAVVLLHHHHRRRRSHSSLTLFLLAAVLFLLTSAIAARPTPIEE------IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPL
        MA++LL HH RR    + L+L L    L LLTS+IAARPTPI++      +  GT+E  +TRRRLSGPGSSPPTCRSKCGSC PCTAVHVPIQPGLSLPL
Subjt:  MAVVLLHHHHRRRRSHSSLTLFLLAAVLFLLTSAIAARPTPIEE------IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPL

Query:  EYYPEAWRCKCGNSLYMP
        EYYPEAWRCKCGN+LYMP
Subjt:  EYYPEAWRCKCGNSLYMP

A0A1S3BMN7 Epidermal patterning factor-like protein2.8e-3976.92Show/hide
Query:  MAVVLLHHHHRRRRSHSSLTLFLLAAVLFLLTSAIAARPTPIEE-----IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLE
        MAV+LLHH    RR+H  LTLFL    L LLTS+IAARPTPI+E     + AGT ERV+TRRRL+GPGSSPPTCRSKCGSC PCTAVHVPIQPGLSLPLE
Subjt:  MAVVLLHHHHRRRRSHSSLTLFLLAAVLFLLTSAIAARPTPIEE-----IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLE

Query:  YYPEAWRCKCGNSLYMP
        YYPEAWRCKCGN+LYMP
Subjt:  YYPEAWRCKCGNSLYMP

A0A5A7UQS2 Epidermal patterning factor-like protein2.8e-3976.92Show/hide
Query:  MAVVLLHHHHRRRRSHSSLTLFLLAAVLFLLTSAIAARPTPIEE-----IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLE
        MAV+LLHH    RR+H  LTLFL    L LLTS+IAARPTPI+E     + AGT ERV+TRRRL+GPGSSPPTCRSKCGSC PCTAVHVPIQPGLSLPLE
Subjt:  MAVVLLHHHHRRRRSHSSLTLFLLAAVLFLLTSAIAARPTPIEE-----IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLE

Query:  YYPEAWRCKCGNSLYMP
        YYPEAWRCKCGN+LYMP
Subjt:  YYPEAWRCKCGNSLYMP

A0A6J1ERG1 Epidermal patterning factor-like protein3.2e-4379.66Show/hide
Query:  MAVVLLHHHHRRRRSHSSLTLFLLAAV----LFLLTSAIAARPTPIEE--IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPL
        MAVVLLHHHH   R     TLFL A V    L LL SAIAARPTPI+E  +  G EERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPI PGLSLPL
Subjt:  MAVVLLHHHHRRRRSHSSLTLFLLAAV----LFLLTSAIAARPTPIEE--IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPL

Query:  EYYPEAWRCKCGNSLYMP
        EYYPEAWRCKCGNSLYMP
Subjt:  EYYPEAWRCKCGNSLYMP

A0A6J1KG67 Epidermal patterning factor-like protein1.4e-4382.46Show/hide
Query:  MAVVLLHHHHRRRRSHSSLTLFLLAAVLFLLTSAIAARPTPIEE--IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYP
        MAVVLLHHHH   R     TLFL A VL LL SAIAARPTPI+E  +  G EERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPI PGLSLPLEYYP
Subjt:  MAVVLLHHHHRRRRSHSSLTLFLLAAVLFLLTSAIAARPTPIEE--IRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYP

Query:  EAWRCKCGNSLYMP
        EAWRCKCGNSLYMP
Subjt:  EAWRCKCGNSLYMP

SwissProt top hitse value%identityAlignment
Q1PEY6 EPIDERMAL PATTERNING FACTOR-like protein 66.2e-2071.93Show/hide
Query:  RRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYPEAWRCKCGNSLYMP
        RR L G GSSPP C SKCG CTPC  VHVP+ PG  +  EYYPEAWRCKCGN LYMP
Subjt:  RRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYPEAWRCKCGNSLYMP

Q2V3I3 EPIDERMAL PATTERNING FACTOR-like protein 41.9e-2455.66Show/hide
Query:  RRRRSHSSLTLFLLAAV-LFLLTSAIAARPTPIEEIRAGTE---ERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYPEAWRCKCG
        RRRR      L   A + LF  +S ++A    I + R G++     + + +R  GPGSSPPTCRSKCG C PC  VHVPIQPGLS+PLEYYPEAWRCKCG
Subjt:  RRRRSHSSLTLFLLAAV-LFLLTSAIAARPTPIEEIRAGTE---ERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYPEAWRCKCG

Query:  NSLYMP
        N L+MP
Subjt:  NSLYMP

Q7M1E7 Polygalacturonase4.6e-0742.86Show/hide
Query:  SSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYPEAWRCKCGNSLYMP
        SSPP C++KC  C PC    + + P  + P +YYP+ W C C N +Y P
Subjt:  SSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYPEAWRCKCGNSLYMP

Q9FY19 Polygalacturonase6.0e-0742.86Show/hide
Query:  SSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYPEAWRCKCGNSLYMP
        SSPP C++KC  C PC    + + P  + P +YYP+ W C C N +Y P
Subjt:  SSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYPEAWRCKCGNSLYMP

Q9LUH9 EPIDERMAL PATTERNING FACTOR-like protein 51.7e-2555.45Show/hide
Query:  TLFLLAAVLFLLTSAIAARPTPI--------EEIRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYPEAWRCKCGNSLYM
        TL + A +LF  +S+ A+   P         E  R+G   ++V ++RL GPGS PP CR KCG C PC AVHVPIQPGL +PLEYYPEAWRCKCGN L+M
Subjt:  TLFLLAAVLFLLTSAIAARPTPI--------EEIRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYPEAWRCKCGNSLYM

Query:  P
        P
Subjt:  P

Arabidopsis top hitse value%identityAlignment
AT2G30370.1 allergen-related4.4e-2171.93Show/hide
Query:  RRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYPEAWRCKCGNSLYMP
        RR L G GSSPP C SKCG CTPC  VHVP+ PG  +  EYYPEAWRCKCGN LYMP
Subjt:  RRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYPEAWRCKCGNSLYMP

AT2G30370.2 allergen-related4.4e-2171.93Show/hide
Query:  RRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYPEAWRCKCGNSLYMP
        RR L G GSSPP C SKCG CTPC  VHVP+ PG  +  EYYPEAWRCKCGN LYMP
Subjt:  RRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYPEAWRCKCGNSLYMP

AT3G22820.1 allergen-related1.2e-2655.45Show/hide
Query:  TLFLLAAVLFLLTSAIAARPTPI--------EEIRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYPEAWRCKCGNSLYM
        TL + A +LF  +S+ A+   P         E  R+G   ++V ++RL GPGS PP CR KCG C PC AVHVPIQPGL +PLEYYPEAWRCKCGN L+M
Subjt:  TLFLLAAVLFLLTSAIAARPTPI--------EEIRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYPEAWRCKCGNSLYM

Query:  P
        P
Subjt:  P

AT4G14723.1 BEST Arabidopsis thaliana protein match is: allergen-related (TAIR:AT3G22820.1)1.3e-2555.66Show/hide
Query:  RRRRSHSSLTLFLLAAV-LFLLTSAIAARPTPIEEIRAGTE---ERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYPEAWRCKCG
        RRRR      L   A + LF  +S ++A    I + R G++     + + +R  GPGSSPPTCRSKCG C PC  VHVPIQPGLS+PLEYYPEAWRCKCG
Subjt:  RRRRSHSSLTLFLLAAV-LFLLTSAIAARPTPIEEIRAGTE---ERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYPEAWRCKCG

Query:  NSLYMP
        N L+MP
Subjt:  NSLYMP

AT4G37810.1 unknown protein2.8e-0731.93Show/hide
Query:  LTLFLLAAVLFLLTSAIAARPTPIEEIRAGTEERVVTRRRLSGPGSSPPTC-RSKCGSCTPCTAVHVPIQPGLSL--PL---------------------
        L L +L +  F L +     P  +E  ++G ++  +  R L   GS PP C R +C SC  C A+ VP  P   L  PL                     
Subjt:  LTLFLLAAVLFLLTSAIAARPTPIEEIRAGTEERVVTRRRLSGPGSSPPTC-RSKCGSCTPCTAVHVPIQPGLSL--PL---------------------

Query:  -EYYPEAWRCKCGNSLYMP
          Y P +W+CKCGNS+Y P
Subjt:  -EYYPEAWRCKCGNSLYMP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGTAGTCCTCCTCCACCACCACCACCGTCGCCGCCGCAGCCACAGCTCTCTAACCCTTTTTCTTCTCGCCGCGGTTCTCTTCCTCCTCACTTCCGCCATTGCCGC
CCGCCCCACCCCCATCGAGGAAATAAGAGCAGGAACAGAGGAGCGAGTTGTGACACGGCGAAGACTCAGCGGGCCAGGGTCGTCACCGCCGACTTGTAGATCGAAGTGCG
GCAGTTGCACGCCGTGTACGGCGGTGCACGTGCCAATTCAACCGGGACTGAGTTTGCCGTTGGAATACTACCCTGAAGCTTGGCGGTGCAAATGCGGCAACTCCCTCTAT
ATGCCATGA
mRNA sequenceShow/hide mRNA sequence
CCCCACTCTCTAAAAATCTCTCTTTGTTTATTTCTTGAGGCCCTTTGACCCAAAGACCACAAAAAGAAAGGAAGGAAGGAAGGAAGGAAGAGAAGAAAATTTAAGGAAAA
AAAAGAAAAGAAAAGAAAACAAATTTATATATGGATTGAGATATGACAGCTCTCTGTTTTTTCTCGTTTTCTGTTTCGATTTTATGACCTTACTCCACTTCACTTCCTCC
TCCATTTCTGGCCGGCGATGGCCGTAGTCCTCCTCCACCACCACCACCGTCGCCGCCGCAGCCACAGCTCTCTAACCCTTTTTCTTCTCGCCGCGGTTCTCTTCCTCCTC
ACTTCCGCCATTGCCGCCCGCCCCACCCCCATCGAGGAAATAAGAGCAGGAACAGAGGAGCGAGTTGTGACACGGCGAAGACTCAGCGGGCCAGGGTCGTCACCGCCGAC
TTGTAGATCGAAGTGCGGCAGTTGCACGCCGTGTACGGCGGTGCACGTGCCAATTCAACCGGGACTGAGTTTGCCGTTGGAATACTACCCTGAAGCTTGGCGGTGCAAAT
GCGGCAACTCCCTCTATATGCCATGAAATTTTAGCAAAGTTCTGTGACATTAAACACTCTTTTTTTTTTTCTTTTCTCATTTCTTCGTCCTCCTCCTCTTTGTCAAATAT
ATTCCGGTCGATTGTTTTTGCCGCTCGTAGTCACTTCAATGGAGATTTTTTTCCTTTTGTTTCCTCTTATTTTTTTCTTAAAATGTCTATATGATATAGTGTCATCGGAA
CTCGAAATCCCCCATTGTTTTTTGTTTTGCTGCGTTGGGAAAGAAAGTTAAGAACTTTGGAAATA
Protein sequenceShow/hide protein sequence
MAVVLLHHHHRRRRSHSSLTLFLLAAVLFLLTSAIAARPTPIEEIRAGTEERVVTRRRLSGPGSSPPTCRSKCGSCTPCTAVHVPIQPGLSLPLEYYPEAWRCKCGNSLY
MP