; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh02G008920 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh02G008920
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionclassical arabinogalactan protein 1-like
Genome locationCmo_Chr02:5503832..5504338
RNA-Seq ExpressionCmoCh02G008920
SyntenyCmoCh02G008920
Gene Ontology termsGO:0016310 - phosphorylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0016301 - kinase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605547.1 hypothetical protein SDJN03_02864, partial [Cucurbita argyrosperma subsp. sororia]2.1e-7798.81Show/hide
Query:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS
        MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPP SDLTP PSPARVPSPSPAPAPSLTADS
Subjt:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS

Query:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL
        DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL
Subjt:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL

XP_022958106.1 lysine-rich arabinogalactan protein 18-like [Cucurbita moschata]2.2e-79100Show/hide
Query:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS
        MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS
Subjt:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS

Query:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL
        DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL
Subjt:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL

XP_022995398.1 lysine-rich arabinogalactan protein 18-like [Cucurbita maxima]1.0e-7697.62Show/hide
Query:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS
        MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPP SDLTP PSPARVPSPSPAPAPSLTADS
Subjt:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS

Query:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL
        DFRNSIANGGGGETE SKGGMNGGKKAGIAVGVIAAACFVG+GGIVYKKRQDNIRRSQFGNAARSSFL
Subjt:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL

XP_023513399.1 classical arabinogalactan protein 2-like [Cucurbita pepo subsp. pepo]1.3e-5577.98Show/hide
Query:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS
        MAK +A+CC LL+     M+ AFSLEQ  +VPPSPSPESAA+ PP+ SPTPFPHAP SSP ESPL SPPAPP SDL P PSPAR PSPS  PAPS  ADS
Subjt:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS

Query:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL
        DFRNSI+N GG E+E+SKGGMNGGKKAGIAVGVIAAACFVG+GGIVYKKRQDNIRRSQ+GNAARSSFL
Subjt:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL

XP_023534499.1 alpha carbonic anhydrase 8-like [Cucurbita pepo subsp. pepo]4.0e-7698.21Show/hide
Query:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS
        MAKSVAF CLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPP SDLTP PSPARVPSPSPAPAPSLTADS
Subjt:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS

Query:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL
        DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL
Subjt:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL

TrEMBL top hitse value%identityAlignment
A0A5D3CV84 Pollen-specific leucine-rich repeat extensin-like protein 12.1e-4667.86Show/hide
Query:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS
        MAK++   CLL +F+++F++ +FSLE   ++PPS SP    +SPP+PSPTPFP+ PASSPAESPL SPPAPP SDL P PSPA V SP+P P+ SL ADS
Subjt:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS

Query:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL
        D  N  + GGG ++E SKGGMNGGKKAGIAVGVIAA CFVG+GG VYKKRQDNIRRSQ+G+AARSSFL
Subjt:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL

A0A6J1FYT3 alpha carbonic anhydrase 8-like8.4e-5678.57Show/hide
Query:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS
        MAK +AFCC LL+     M+ AFSLEQ  EVPPSP PESAA  PP+ SPTPFPHAPASSP ESPL SPPAPP SDL P PSPAR PSPS  PAPS  AD 
Subjt:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS

Query:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL
        DFRNSI+N GG E+E+SKGGMNGGKKAGIAVGVIAAACFVG+GGIVYKKRQDNIRRSQ+GNAARSSFL
Subjt:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL

A0A6J1H149 lysine-rich arabinogalactan protein 18-like1.1e-79100Show/hide
Query:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS
        MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS
Subjt:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS

Query:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL
        DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL
Subjt:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL

A0A6J1JA87 proline-rich receptor-like protein kinase PERK122.7e-5476.79Show/hide
Query:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS
        MAK +AFCC LL+     M+ AFSLEQ  EVPPS  PESA   PP+ SP PFPH PASSP ESPL SPPAPP SDL P PSPARVPSPSP  AP+  ADS
Subjt:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS

Query:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL
        DFRNSI+N GG E+E+SKGGMNGGKKAGIAVGVIAAACFVG+GGIVYKKRQDNIRRSQ+GNAARSSFL
Subjt:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL

A0A6J1K7T9 lysine-rich arabinogalactan protein 18-like5.1e-7797.62Show/hide
Query:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS
        MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPP SDLTP PSPARVPSPSPAPAPSLTADS
Subjt:  MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADS

Query:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL
        DFRNSIANGGGGETE SKGGMNGGKKAGIAVGVIAAACFVG+GGIVYKKRQDNIRRSQFGNAARSSFL
Subjt:  DFRNSIANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL

SwissProt top hitse value%identityAlignment
Q9FPQ6 Vegetative cell wall protein gp17.2e-0447.95Show/hide
Query:  PVEVPPSPSPESAADSP--PVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTA
        P  VPPSP+P   + +P  P PSP P P  P  SP+ SP  SP   P    +P PSP+ +PSPSP P+PS  A
Subjt:  PVEVPPSPSPESAADSP--PVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTA

Arabidopsis top hitse value%identityAlignment
AT2G28440.1 proline-rich family protein1.2e-0939.58Show/hide
Query:  PVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADSDFRNSIANGGGGETETSKG---GMNGG
        P     SP PES ADSP  P P P P +P SSP+       PAP   D    P P     PSPAP+P L    D + S  +  G E    +G   GM+G 
Subjt:  PVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADSDFRNSIANGGGGETETSKG---GMNGG

Query:  KKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL
        +KAGIA+G I     + +G +VYKKR+DN+ R+++       FL
Subjt:  KKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL

AT3G45230.1 hydroxyproline-rich glycoprotein family protein6.4e-1647.97Show/hide
Query:  PSPSPESAADSPPVPSPTPFPHAPASSPAESPLK-SPPAPPRSDLTPCPSPARVPSPSP-------------APAPSLTADSDFRNSIANGGGGE--TET
        P+PSP+  ADSP + +  P      +SPAESP++ S P  P ++ +P PSPA  PS SP             +P+PS  A SD  +S   G  GE     
Subjt:  PSPSPESAADSPPVPSPTPFPHAPASSPAESPLK-SPPAPPRSDLTPCPSPARVPSPSP-------------APAPSLTADSDFRNSIANGGGGE--TET

Query:  SKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAAR
        S GGM+GGKK G+A G IAA C VGV G VYKKRQ+NIRRS++G AAR
Subjt:  SKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAAR

AT5G60630.1 FUNCTIONS IN: molecular_function unknown2.4e-0745Show/hide
Query:  PRSDLTPCPSPARVPSPSPAPAPSLTADSDFRNSIANGGGGETETSKGGMN---GGKKAGIA-VGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSF
        P+S+    PSP          +  LT  +D   + +   GGE E S GG N   GGKK GIA VG IAAA  VG GG V KKR++NIRRS++G A+   F
Subjt:  PRSDLTPCPSPARVPSPSPAPAPSLTADSDFRNSIANGGGGETETSKGGMN---GGKKAGIA-VGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAATCCGTTGCTTTCTGTTGCCTTCTTTTGATGTTCGTTGCGGTGTTCATGGACGCCGCTTTCTCTTTGGAACAACCGGTGGAAGTTCCGCCTTCTCCA
TCTCCTGAATCTGCTGCAGATTCTCCTCCAGTGCCCTCCCCTACTCCCTTCCCTCACGCTCCTGCTAGTTCTCCGGCGGAATCACCTTTAAAATCTCCTCCTGCG
CCACCGCGCTCAGATCTCACTCCCTGTCCATCTCCAGCGCGTGTTCCATCTCCGTCTCCGGCTCCAGCCCCTTCACTCACTGCCGACAGTGATTTTCGTAACAGT
ATTGCCAATGGCGGTGGAGGAGAGACGGAAACTTCCAAGGGCGGGATGAATGGCGGTAAGAAGGCTGGAATTGCAGTTGGAGTGATTGCTGCTGCGTGTTTTGTG
GGAGTAGGAGGAATCGTTTACAAGAAGCGCCAAGACAACATTCGCCGATCTCAGTTCGGGAATGCCGCTAGGTCATCATTCCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAATCCGTTGCTTTCTGTTGCCTTCTTTTGATGTTCGTTGCGGTGTTCATGGACGCCGCTTTCTCTTTGGAACAACCGGTGGAAGTTCCGCCTTCTCCA
TCTCCTGAATCTGCTGCAGATTCTCCTCCAGTGCCCTCCCCTACTCCCTTCCCTCACGCTCCTGCTAGTTCTCCGGCGGAATCACCTTTAAAATCTCCTCCTGCG
CCACCGCGCTCAGATCTCACTCCCTGTCCATCTCCAGCGCGTGTTCCATCTCCGTCTCCGGCTCCAGCCCCTTCACTCACTGCCGACAGTGATTTTCGTAACAGT
ATTGCCAATGGCGGTGGAGGAGAGACGGAAACTTCCAAGGGCGGGATGAATGGCGGTAAGAAGGCTGGAATTGCAGTTGGAGTGATTGCTGCTGCGTGTTTTGTG
GGAGTAGGAGGAATCGTTTACAAGAAGCGCCAAGACAACATTCGCCGATCTCAGTTCGGGAATGCCGCTAGGTCATCATTCCTATGA
Protein sequenceShow/hide protein sequence
MAKSVAFCCLLLMFVAVFMDAAFSLEQPVEVPPSPSPESAADSPPVPSPTPFPHAPASSPAESPLKSPPAPPRSDLTPCPSPARVPSPSPAPAPSLTADSDFRNS
IANGGGGETETSKGGMNGGKKAGIAVGVIAAACFVGVGGIVYKKRQDNIRRSQFGNAARSSFL