; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G06000 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G06000
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPollen Ole e 1 allergen and extensin family protein
Genome locationClcChr09:4720122..4721484
RNA-Seq ExpressionClc09G06000
SyntenyClc09G06000
Gene Ontology termsGO:0071944 - cell periphery (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575812.1 Proline-rich protein 3, partial [Cucurbita argyrosperma subsp. sororia]5.6e-7888.17Show/hide
Query:  MASLHAIFLSLLVIIGSASSDNNGGGSNYDLMTPKLGKEERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFL
        MASL A+FLSLLVI+ SA  D+N GG  YDLMTPKL KE+RLLSTMIGI+GIILYK GSTI PL+GGLARITC+AVDEYGYEAASYTFLSDSSDANGYFL
Subjt:  MASLHAIFLSLLVIIGSASSDNNGGGSNYDLMTPKLGKEERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFL

Query:  ATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS
        ATLSPSEVEDKRELKECKAFLE+SPLENCQ+PSDLNNGVSGA LHSYKLLVHNKMKLFSVGPFLFTCQS
Subjt:  ATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS

XP_022953725.1 proline-rich protein 3-like [Cucurbita moschata]2.1e-7788.17Show/hide
Query:  MASLHAIFLSLLVIIGSASSDNNGGGSNYDLMTPKLGKEERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFL
        MASL A+FLSLLVI+ SA  D+N GG  YDLMTPKL KE+RLLSTMIGI+GIILYK GSTI PL+GGLARITC+AVDEYGYEAASYTFLSDSSDANGYFL
Subjt:  MASLHAIFLSLLVIIGSASSDNNGGGSNYDLMTPKLGKEERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFL

Query:  ATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS
        ATLS SEVEDKRELKECKAFLE+SPLENCQ+PSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS
Subjt:  ATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS

XP_022991245.1 proline-rich protein 3-like [Cucurbita maxima]1.9e-7888.17Show/hide
Query:  MASLHAIFLSLLVIIGSASSDNNGGGSNYDLMTPKLGKEERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFL
        MASL A+FLSLLVI+ SA  D+N GG  YDLMTP L KE+RLLSTMIGI+GIILYKFGSTI+PL+GGLARITC+AVDEYGYEAASYTFLSDSSDANGYFL
Subjt:  MASLHAIFLSLLVIIGSASSDNNGGGSNYDLMTPKLGKEERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFL

Query:  ATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS
        ATLSPSEV+DKRELKECKAFLE+SPLENCQ+PSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS
Subjt:  ATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS

XP_023548997.1 proline-rich protein 3-like [Cucurbita pepo subsp. pepo]3.6e-7787.57Show/hide
Query:  MASLHAIFLSLLVIIGSASSDNNGGGSNYDLMTPKLGKEERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFL
        MASL A+FLSLLVI+ SA  D++ GG  YDLMTPKL  E+RLLSTMIGI+GIILYK GSTI PL+GGLARITC+AVDEYGYEAASYTFLSDSSDANGYFL
Subjt:  MASLHAIFLSLLVIIGSASSDNNGGGSNYDLMTPKLGKEERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFL

Query:  ATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS
        ATLSPSEVEDKRELKECKAFLEVSPLENCQ+PSDLNNGVSGALLHSY+LLVHNKMKLFSVGPFLFTCQS
Subjt:  ATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS

XP_038896334.1 proline-rich protein 3-like [Benincasa hispida]6.4e-8293.49Show/hide
Query:  MASLHAIFLSLLVIIGSASSDNNGGGSNYDLMTPKLGKEERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFL
        MAS+HAIFLSL VII SA+ D+NGG SNY LMTPK+GKEERLLSTMIGIEGIILYKFGSTIAPL GGLARITCEAVDEYGYEAASYTFLSDSSDANGYFL
Subjt:  MASLHAIFLSLLVIIGSASSDNNGGGSNYDLMTPKLGKEERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFL

Query:  ATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS
        ATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHN MKLFSVGPFLFTCQS
Subjt:  ATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS

TrEMBL top hitse value%identityAlignment
A0A0A0K5H5 Uncharacterized protein5.8e-7384.97Show/hide
Query:  MASLHAIFLSLLV---IIGSASSDNNGGGSNYDLMTPKLGK-EERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDAN
        M SLHA+F SLLV   I+GSA+ D   GGS YD MTPKL K +ERLLSTMIGIEGIILYKFGS+I+PLQGGLARITC+ VDEYGYEAASYTFLS+SSD N
Subjt:  MASLHAIFLSLLV---IIGSASSDNNGGGSNYDLMTPKLGK-EERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDAN

Query:  GYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS
        GYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYK LVHN MKLFSVGPFLFTCQ+
Subjt:  GYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS

A0A1S3BRS8 proline-rich protein 3-like1.4e-7184.39Show/hide
Query:  MASLHAIFLSLLV---IIGSASSDNNGGGSNYDLMTPKLGK-EERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDAN
        M SLHA+ LSLLV   I+GSA+SD   G S YD MT KL K +ERLLSTMIGIEGIILYKFGS+I+PLQGGLARITC+ VDEYGYEAASYTFLS+SSD N
Subjt:  MASLHAIFLSLLV---IIGSASSDNNGGGSNYDLMTPKLGK-EERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDAN

Query:  GYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS
        GYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYK LVHN MKLFSVGPFLFTCQ+
Subjt:  GYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS

A0A5A7VL80 Proline-rich protein 3-like4.2e-7183.82Show/hide
Query:  MASLHAIFLSLLV---IIGSASSDNNGGGSNYDLMTPKLGK-EERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDAN
        M SLHA+ LSLLV   I+GSA+ D   G S YD MT KL K +ERLLSTMIGIEGIILYKFGS+I+PLQGGLARITC+ VDEYGYEAASYTFLS+SSD N
Subjt:  MASLHAIFLSLLV---IIGSASSDNNGGGSNYDLMTPKLGK-EERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDAN

Query:  GYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS
        GYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYK LVHN MKLFSVGPFLFTCQ+
Subjt:  GYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS

A0A6J1GQG7 proline-rich protein 3-like1.0e-7788.17Show/hide
Query:  MASLHAIFLSLLVIIGSASSDNNGGGSNYDLMTPKLGKEERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFL
        MASL A+FLSLLVI+ SA  D+N GG  YDLMTPKL KE+RLLSTMIGI+GIILYK GSTI PL+GGLARITC+AVDEYGYEAASYTFLSDSSDANGYFL
Subjt:  MASLHAIFLSLLVIIGSASSDNNGGGSNYDLMTPKLGKEERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFL

Query:  ATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS
        ATLS SEVEDKRELKECKAFLE+SPLENCQ+PSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS
Subjt:  ATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS

A0A6J1JSC5 proline-rich protein 3-like9.3e-7988.17Show/hide
Query:  MASLHAIFLSLLVIIGSASSDNNGGGSNYDLMTPKLGKEERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFL
        MASL A+FLSLLVI+ SA  D+N GG  YDLMTP L KE+RLLSTMIGI+GIILYKFGSTI+PL+GGLARITC+AVDEYGYEAASYTFLSDSSDANGYFL
Subjt:  MASLHAIFLSLLVIIGSASSDNNGGGSNYDLMTPKLGKEERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFL

Query:  ATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS
        ATLSPSEV+DKRELKECKAFLE+SPLENCQ+PSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS
Subjt:  ATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS

SwissProt top hitse value%identityAlignment
O81417 Protein SEED AND ROOT HAIR PROTECTIVE PROTEIN1.4e-2343.44Show/hide
Query:  IGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHS
        I +EGII  K G    P+QG  ARI C  VD YG E    + LS  +DA GYF+AT+ PS++   R + +CK +L  SPL +C  P+D+N GV G  L +
Subjt:  IGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHS

Query:  YKLLVHNKMKLFSVGPFLFTCQ
        Y++L     KL+  GPF +T +
Subjt:  YKLLVHNKMKLFSVGPFLFTCQ

Q9FZ35 Proline-rich protein 13.2e-1234.71Show/hide
Query:  IEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYK
        + GIIL K G    P+QG  A+I C     Y          SD +D  GYF   L+       + L  C+  L  SP+E C++P+++N G++G     Y 
Subjt:  IEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYK

Query:  LLVHNKMKLFSVGPFLFTCQS
              +KLF+VGPF FT  S
Subjt:  LLVHNKMKLFSVGPFLFTCQS

Q9LZJ7 Proline-rich protein 31.9e-1232.5Show/hide
Query:  IEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGA--LLHS
        ++GIIL K G    P+ G   +I C     YG         S+ +D+ GYF  +L+       ++L  C+  L +SP+E C++P+++N G++G    L+ 
Subjt:  IEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGA--LLHS

Query:  YKLLVHNKMKLFSVGPFLFT
        Y+      ++LFSVGPF +T
Subjt:  YKLLVHNKMKLFSVGPFLFT

Arabidopsis top hitse value%identityAlignment
AT1G54970.1 proline-rich protein 12.3e-1334.71Show/hide
Query:  IEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYK
        + GIIL K G    P+QG  A+I C     Y          SD +D  GYF   L+       + L  C+  L  SP+E C++P+++N G++G     Y 
Subjt:  IEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYK

Query:  LLVHNKMKLFSVGPFLFTCQS
              +KLF+VGPF FT  S
Subjt:  LLVHNKMKLFSVGPFLFTCQS

AT2G47530.1 Pollen Ole e 1 allergen and extensin family protein1.9e-1536.89Show/hide
Query:  IGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALL--
        I IEG IL K G    P+QGG  ++ C  VD YG   A  T  S  +D  GYF   ++         +  CK  LE SP+  C++P+++N GV+GA L  
Subjt:  IGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALL--

Query:  HSYKLLVHNKMKLFSVGPFLFT
         + K L H+ + L+++ PF F+
Subjt:  HSYKLLVHNKMKLFSVGPFLFT

AT2G47540.1 Pollen Ole e 1 allergen and extensin family protein1.0e-3755.22Show/hide
Query:  EERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVEDKR---ELKECKAFLEVSPLENCQSPSDL
        E  LLS+MIG++G+I  K GS + P+QG +AR+TCE  DEYGYEA   T LS ++DA GYFLATLS SEV+D +   ++KEC+AFLE+SP + C  P+++
Subjt:  EERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVEDKR---ELKECKAFLEVSPLENCQSPSDL

Query:  NNGVSGALLHSYKLLVHN-KMKLFSVGPFLFTCQ
        N G+SGA+L +Y+LL +  KMKLF+VGPF+F+ +
Subjt:  NNGVSGALLHSYKLLVHN-KMKLFSVGPFLFTCQ

AT3G62680.1 proline-rich protein 31.3e-1332.5Show/hide
Query:  IEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGA--LLHS
        ++GIIL K G    P+ G   +I C     YG         S+ +D+ GYF  +L+       ++L  C+  L +SP+E C++P+++N G++G    L+ 
Subjt:  IEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGA--LLHS

Query:  YKLLVHNKMKLFSVGPFLFT
        Y+      ++LFSVGPF +T
Subjt:  YKLLVHNKMKLFSVGPFLFT

AT4G02270.1 root hair specific 139.9e-2543.44Show/hide
Query:  IGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHS
        I +EGII  K G    P+QG  ARI C  VD YG E    + LS  +DA GYF+AT+ PS++   R + +CK +L  SPL +C  P+D+N GV G  L +
Subjt:  IGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVEDKRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHS

Query:  YKLLVHNKMKLFSVGPFLFTCQ
        Y++L     KL+  GPF +T +
Subjt:  YKLLVHNKMKLFSVGPFLFTCQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTCTACATGCAATTTTCTTGTCACTTTTGGTCATTATTGGTTCAGCTAGTAGTGATAACAATGGTGGTGGGTCTAATTATGATCTTATGACACCCAAATTGGG
TAAGGAAGAAAGGCTTCTCTCTACCATGATTGGTATTGAAGGAATTATTCTCTACAAATTTGGCTCAACAATTGCCCCTCTTCAAGGAGGTTTGGCAAGAATCACATGTG
AAGCAGTGGATGAGTATGGCTATGAGGCAGCTTCTTATACATTTTTAAGTGATTCAAGTGATGCAAATGGCTACTTCTTGGCAACATTATCTCCTTCAGAGGTAGAAGAC
AAGAGAGAGTTGAAGGAATGCAAAGCTTTTTTGGAGGTTTCACCATTAGAGAACTGTCAATCTCCTTCTGACCTCAACAATGGAGTCTCTGGTGCTCTTCTCCATTCTTA
CAAACTTTTGGTCCATAATAAGATGAAACTCTTCTCTGTTGGGCCTTTCCTTTTCACTTGCCAAAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTCTACATGCAATTTTCTTGTCACTTTTGGTCATTATTGGTTCAGCTAGTAGTGATAACAATGGTGGTGGGTCTAATTATGATCTTATGACACCCAAATTGGG
TAAGGAAGAAAGGCTTCTCTCTACCATGATTGGTATTGAAGGAATTATTCTCTACAAATTTGGCTCAACAATTGCCCCTCTTCAAGGAGGTTTGGCAAGAATCACATGTG
AAGCAGTGGATGAGTATGGCTATGAGGCAGCTTCTTATACATTTTTAAGTGATTCAAGTGATGCAAATGGCTACTTCTTGGCAACATTATCTCCTTCAGAGGTAGAAGAC
AAGAGAGAGTTGAAGGAATGCAAAGCTTTTTTGGAGGTTTCACCATTAGAGAACTGTCAATCTCCTTCTGACCTCAACAATGGAGTCTCTGGTGCTCTTCTCCATTCTTA
CAAACTTTTGGTCCATAATAAGATGAAACTCTTCTCTGTTGGGCCTTTCCTTTTCACTTGCCAAAGTTAA
Protein sequenceShow/hide protein sequence
MASLHAIFLSLLVIIGSASSDNNGGGSNYDLMTPKLGKEERLLSTMIGIEGIILYKFGSTIAPLQGGLARITCEAVDEYGYEAASYTFLSDSSDANGYFLATLSPSEVED
KRELKECKAFLEVSPLENCQSPSDLNNGVSGALLHSYKLLVHNKMKLFSVGPFLFTCQS