; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G07100 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G07100
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionclassical arabinogalactan protein 9
Genome locationClcChr02:7198242..7201352
RNA-Seq ExpressionClc02G07100
SyntenyClc02G07100
Gene Ontology termsGO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR044981 - Lysine-rich arabinogalactan protein AGP9/17/18


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645846.1 hypothetical protein Csa_017309 [Cucumis sativus]2.8e-3967.29Show/hide
Query:  MMDRQALLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSAPPPASPPPATPPPATPPPATPPPATPPPATPPPATPPPATPPPATPPP
        MMD +ALLCFT ISIAF +A AQSP+S PT TP+PPTT+TPP  ++PPP + PPPASPPPA+PPPA+PPPATPPPA+PPPA+PPPA+PPPA+PPPATPPP
Subjt:  MMDRQALLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSAPPPASPPPATPPPATPPPATPPPATPPPATPPPATPPPATPPPATPPP

Query:  AT-PPPATPPP---ATPPPATPPSAPTTSPPAAVPAQGPSPVSSPPAPAMEAPGPAGP-ESPTPSQNDNLKDLNLQPHLKKVYSTQSSGVENVWRKETMV
        A+ PPPATPPP   A+PP A P  AP+     A  A  PSPVSSPPAP++EAPGPAGP +SPTPSQNDN                  SGVE VWRKE+MV
Subjt:  AT-PPPATPPP---ATPPPATPPSAPTTSPPAAVPAQGPSPVSSPPAPAMEAPGPAGP-ESPTPSQNDNLKDLNLQPHLKKVYSTQSSGVENVWRKETMV

Query:  GSILIGMGYVFMML
        GSI+IGMGYVF+ML
Subjt:  GSILIGMGYVFMML

KAG6571935.1 hypothetical protein SDJN03_28663, partial [Cucurbita argyrosperma subsp. sororia]2.2e-4768.24Show/hide
Query:  MDRQALLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSAPPPASPPPAT---------------PPPATPPPATPPPATPPPATPPPA
        MD QAL C T ISIAFA+AGAQ P+S PT TPAPPTTTT P  ATPPPVSAPPPA+PPPAT               PPPATPPPA+PPPATPPPA+PPPA
Subjt:  MDRQALLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSAPPPASPPPAT---------------PPPATPPPATPPPATPPPATPPPA

Query:  TPPPATPPPATPPPATPPPATPPPATPPPATPPSAPTTSPPAAVP-----------AQGPSPVSSPPAPAMEAPGPAGPESPTPSQNDNLKDLNLQPHLK
        TPPPA+PPPATPPPA+PPPATPPPATPPPATPP AP  SPPAAVP           A GP+PVSSPP P+ E PGPAGPESPTPSQNDN           
Subjt:  TPPPATPPPATPPPATPPPATPPPATPPPATPPSAPTTSPPAAVP-----------AQGPSPVSSPPAPAMEAPGPAGPESPTPSQNDNLKDLNLQPHLK

Query:  KVYSTQSSGVENVWRKETMVGSILIGMGYVFMM
               SGVE  WRKETMVGS+LIGMGYV MM
Subjt:  KVYSTQSSGVENVWRKETMVGSILIGMGYVFMM

XP_008455018.1 PREDICTED: classical arabinogalactan protein 9 [Cucumis melo]4.5e-4568.44Show/hide
Query:  MMDRQA-LLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSAPPPAS--PPPATPPPATPPPATPPPATPPPATPPPATPPPATPPPAT
        MMD +A LLCFTFISIAFAIAGAQSP+S PT TP+PPTT+ PP  +TPPPVS+PPPA+  PP ATPPPA+PPPA+PPPA+PPPATPPPA+PPPA+PPPA+
Subjt:  MMDRQA-LLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSAPPPAS--PPPATPPPATPPPATPPPATPPPATPPPATPPPATPPPAT

Query:  PPPATPPPATPPPATPPPATPPSAPTTSPPAAVP------------AQGPSPVSSPPAPAMEAPGPAGP-ESPTPSQNDNLKDLNLQPHLKKVYSTQSSG
        PPPA+PPPA+PPPA+PPPA PP AP  SPP AVP            A GP+PVSSPPAP++EAPGPAGP +SPTPSQNDN                  SG
Subjt:  PPPATPPPATPPPATPPPATPPSAPTTSPPAAVP------------AQGPSPVSSPPAPAMEAPGPAGP-ESPTPSQNDNLKDLNLQPHLKKVYSTQSSG

Query:  VENVWRKETMVGSILIGMGYVFMML
        VE VWRKE+MVGSI+IGMGYVF+ML
Subjt:  VENVWRKETMVGSILIGMGYVFMML

XP_022952421.1 classical arabinogalactan protein 9-like [Cucurbita moschata]6.5e-4467.84Show/hide
Query:  MDRQALLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSA---------PPPASPPPATPPPATPPPATPPPATPPPATPPPATPPPAT
        MD QAL CFT I +AFA+AGAQ P+S PT    P TTT PP  ATPPPVSA         PPPASPPPATPPPA+PPPATPPPA+PPPA+PPPATPPPA+
Subjt:  MDRQALLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSA---------PPPASPPPATPPPATPPPATPPPATPPPATPPPATPPPAT

Query:  PPPATPPPATPPPATPPPATPPPATPPSAPTTSPPAAVP-----------AQGPSPVSSPPAPAMEAPGPAGPESPTPSQNDNLKDLNLQPHLKKVYSTQ
        PPPATPPPA+PPPATPPPATPPPATPP AP  SPPAAVP           A GP+PVSSPP P+ E PGPAGPESPTPSQNDN                 
Subjt:  PPPATPPPATPPPATPPPATPPPATPPSAPTTSPPAAVP-----------AQGPSPVSSPPAPAMEAPGPAGPESPTPSQNDNLKDLNLQPHLKKVYSTQ

Query:  SSGVENVWRKETMVGSILIGMGYVFMM
         SGVE  WRKETMVGS+LIGMGYV MM
Subjt:  SSGVENVWRKETMVGSILIGMGYVFMM

XP_023553858.1 classical arabinogalactan protein 9-like [Cucurbita pepo subsp. pepo]3.7e-4770.4Show/hide
Query:  MDRQALLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSA-----PPPASPPPATPPPATPPPATPPPATPPPATPPPATPPPATPPPA
        MD QAL CFT ISIAFA+AGAQ P+S PT T APPTTTT P  ATPPPVS+     PPPASPPPATPPPA+PPPATPPPA+PPPATPPPA+PPPATPPPA
Subjt:  MDRQALLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSA-----PPPASPPPATPPPATPPPATPPPATPPPATPPPATPPPATPPPA

Query:  TPPPATPPPATPPPATPPPATPPSAPTTSPPAAVP-----------AQGPSPVSSPPAPAMEAPGPAGPESPTPSQNDNLKDLNLQPHLKKVYSTQSSGV
        +PPPATPPPA+PPPATPPPATPP  P  SPPAAVP           A GP+PVSSPP P+ E PGPAGPESPTPSQNDN                  SGV
Subjt:  TPPPATPPPATPPPATPPPATPPSAPTTSPPAAVP-----------AQGPSPVSSPPAPAMEAPGPAGPESPTPSQNDNLKDLNLQPHLKKVYSTQSSGV

Query:  ENVWRKETMVGSILIGMGYVFMM
        E  WRKETMVGS+LIGMGYV MM
Subjt:  ENVWRKETMVGSILIGMGYVFMM

TrEMBL top hitse value%identityAlignment
A0A0A0K6D8 Uncharacterized protein5.2e-3967.14Show/hide
Query:  MDRQALLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSAPPPASPPPATPPPATPPPATPPPATPPPATPPPATPPPATPPPATPPPA
        MD +ALLCFT ISIAF +A AQSP+S PT TP+PPTT+TPP  ++PPP + PPPASPPPA+PPPA+PPPATPPPA+PPPA+PPPA+PPPA+PPPATPPPA
Subjt:  MDRQALLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSAPPPASPPPATPPPATPPPATPPPATPPPATPPPATPPPATPPPATPPPA

Query:  T-PPPATPPP---ATPPPATPPSAPTTSPPAAVPAQGPSPVSSPPAPAMEAPGPAGP-ESPTPSQNDNLKDLNLQPHLKKVYSTQSSGVENVWRKETMVG
        + PPPATPPP   A+PP A P  AP+     A  A  PSPVSSPPAP++EAPGPAGP +SPTPSQNDN                  SGVE VWRKE+MVG
Subjt:  T-PPPATPPP---ATPPPATPPSAPTTSPPAAVPAQGPSPVSSPPAPAMEAPGPAGP-ESPTPSQNDNLKDLNLQPHLKKVYSTQSSGVENVWRKETMVG

Query:  SILIGMGYVFMML
        SI+IGMGYVF+ML
Subjt:  SILIGMGYVFMML

A0A1S3BZW9 classical arabinogalactan protein 92.2e-4568.44Show/hide
Query:  MMDRQA-LLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSAPPPAS--PPPATPPPATPPPATPPPATPPPATPPPATPPPATPPPAT
        MMD +A LLCFTFISIAFAIAGAQSP+S PT TP+PPTT+ PP  +TPPPVS+PPPA+  PP ATPPPA+PPPA+PPPA+PPPATPPPA+PPPA+PPPA+
Subjt:  MMDRQA-LLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSAPPPAS--PPPATPPPATPPPATPPPATPPPATPPPATPPPATPPPAT

Query:  PPPATPPPATPPPATPPPATPPSAPTTSPPAAVP------------AQGPSPVSSPPAPAMEAPGPAGP-ESPTPSQNDNLKDLNLQPHLKKVYSTQSSG
        PPPA+PPPA+PPPA+PPPA PP AP  SPP AVP            A GP+PVSSPPAP++EAPGPAGP +SPTPSQNDN                  SG
Subjt:  PPPATPPPATPPPATPPPATPPSAPTTSPPAAVP------------AQGPSPVSSPPAPAMEAPGPAGP-ESPTPSQNDNLKDLNLQPHLKKVYSTQSSG

Query:  VENVWRKETMVGSILIGMGYVFMML
        VE VWRKE+MVGSI+IGMGYVF+ML
Subjt:  VENVWRKETMVGSILIGMGYVFMML

A0A5A7SJX5 Classical arabinogalactan protein 92.2e-4568.44Show/hide
Query:  MMDRQA-LLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSAPPPAS--PPPATPPPATPPPATPPPATPPPATPPPATPPPATPPPAT
        MMD +A LLCFTFISIAFAIAGAQSP+S PT TP+PPTT+ PP  +TPPPVS+PPPA+  PP ATPPPA+PPPA+PPPA+PPPATPPPA+PPPA+PPPA+
Subjt:  MMDRQA-LLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSAPPPAS--PPPATPPPATPPPATPPPATPPPATPPPATPPPATPPPAT

Query:  PPPATPPPATPPPATPPPATPPSAPTTSPPAAVP------------AQGPSPVSSPPAPAMEAPGPAGP-ESPTPSQNDNLKDLNLQPHLKKVYSTQSSG
        PPPA+PPPA+PPPA+PPPA PP AP  SPP AVP            A GP+PVSSPPAP++EAPGPAGP +SPTPSQNDN                  SG
Subjt:  PPPATPPPATPPPATPPPATPPSAPTTSPPAAVP------------AQGPSPVSSPPAPAMEAPGPAGP-ESPTPSQNDNLKDLNLQPHLKKVYSTQSSG

Query:  VENVWRKETMVGSILIGMGYVFMML
        VE VWRKE+MVGSI+IGMGYVF+ML
Subjt:  VENVWRKETMVGSILIGMGYVFMML

A0A6J1GK65 classical arabinogalactan protein 9-like3.1e-4467.84Show/hide
Query:  MDRQALLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSA---------PPPASPPPATPPPATPPPATPPPATPPPATPPPATPPPAT
        MD QAL CFT I +AFA+AGAQ P+S PT    P TTT PP  ATPPPVSA         PPPASPPPATPPPA+PPPATPPPA+PPPA+PPPATPPPA+
Subjt:  MDRQALLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSA---------PPPASPPPATPPPATPPPATPPPATPPPATPPPATPPPAT

Query:  PPPATPPPATPPPATPPPATPPPATPPSAPTTSPPAAVP-----------AQGPSPVSSPPAPAMEAPGPAGPESPTPSQNDNLKDLNLQPHLKKVYSTQ
        PPPATPPPA+PPPATPPPATPPPATPP AP  SPPAAVP           A GP+PVSSPP P+ E PGPAGPESPTPSQNDN                 
Subjt:  PPPATPPPATPPPATPPPATPPPATPPSAPTTSPPAAVP-----------AQGPSPVSSPPAPAMEAPGPAGPESPTPSQNDNLKDLNLQPHLKKVYSTQ

Query:  SSGVENVWRKETMVGSILIGMGYVFMM
         SGVE  WRKETMVGS+LIGMGYV MM
Subjt:  SSGVENVWRKETMVGSILIGMGYVFMM

A0A6J1I8J9 classical arabinogalactan protein 9-like1.7e-2655.42Show/hide
Query:  MDRQALLCFTFISIAFAIAGAQSPASSPTYTPAPP----------------------TTTTPPSTATPPPVSAPPPASPPPATPPPATPPPATPPPATPP
        MD QAL CFT ISIAFA+AGAQ P+S PT TPAPP                                                   ATPPPA+PPPATPP
Subjt:  MDRQALLCFTFISIAFAIAGAQSPASSPTYTPAPP----------------------TTTTPPSTATPPPVSAPPPASPPPATPPPATPPPATPPPATPP

Query:  PATPPPATPPPATPPPATPPPATPPPATPPPATPPPATPPSAPTTSPPAAVPAQGPS-----------PVSSPPAPAMEAPGPAGPESPTPSQNDNLKDL
        PA+PPPATPPPA+PPPATPPPA+PPPATPPPATPPPATPP AP  SPPAAVPA  PS           PVSSPP P+ E PGPAGPESPTPSQNDN    
Subjt:  PATPPPATPPPATPPPATPPPATPPPATPPPATPPPATPPSAPTTSPPAAVPAQGPS-----------PVSSPPAPAMEAPGPAGPESPTPSQNDNLKDL

Query:  NLQPHLKKVYSTQSSGVENVWRKETMVGSILIGMGYVFMM
                      SGVE  WRKET++GS+LIGMGYV MM
Subjt:  NLQPHLKKVYSTQSSGVENVWRKETMVGSILIGMGYVFMM

SwissProt top hitse value%identityAlignment
Q9C5S0 Classical arabinogalactan protein 98.3e-1053.85Show/hide
Query:  ALLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSAPPP--ASPPPAT--PPPATPPP--ATPPPATPPPATPPPATPPPATPPPATPP
        A++C   I+    + G Q+P S PT TPAPPT TTPP  ATPPPVSAPPP   SPPP T  PPPA PPP  ++PPPA+PPPATPPP   PP  PP A+PP
Subjt:  ALLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSAPPP--ASPPPAT--PPPATPPP--ATPPPATPPPATPPPATPPPATPPPATPP

Query:  PATPPP-ATPPP---ATPPPATPPSAPTTSPPAAVPAQGPSPVSSPPAPAMEAPGPAGPE-SPTPSQNDNLKDLNLQPHLKKVYSTQSSGVENVW
        PATPPP ATPPP   A+PP   P  APTT P     +  PSP SSPP P+ +APGP+    SP PS      D+N Q    K+ S+   G   VW
Subjt:  PATPPP-ATPPP---ATPPPATPPSAPTTSPPAAVPAQGPSPVSSPPAPAMEAPGPAGPE-SPTPSQNDNLKDLNLQPHLKKVYSTQSSGVENVW

Arabidopsis top hitse value%identityAlignment
AT2G14890.1 arabinogalactan protein 95.9e-1153.85Show/hide
Query:  ALLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSAPPP--ASPPPAT--PPPATPPP--ATPPPATPPPATPPPATPPPATPPPATPP
        A++C   I+    + G Q+P S PT TPAPPT TTPP  ATPPPVSAPPP   SPPP T  PPPA PPP  ++PPPA+PPPATPPP   PP  PP A+PP
Subjt:  ALLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSAPPP--ASPPPAT--PPPATPPP--ATPPPATPPPATPPPATPPPATPPPATPP

Query:  PATPPP-ATPPP---ATPPPATPPSAPTTSPPAAVPAQGPSPVSSPPAPAMEAPGPAGPE-SPTPSQNDNLKDLNLQPHLKKVYSTQSSGVENVW
        PATPPP ATPPP   A+PP   P  APTT P     +  PSP SSPP P+ +APGP+    SP PS      D+N Q    K+ S+   G   VW
Subjt:  PATPPP-ATPPP---ATPPPATPPSAPTTSPPAAVPAQGPSPVSSPPAPAMEAPGPAGPE-SPTPSQNDNLKDLNLQPHLKKVYSTQSSGVENVW

AT2G14890.2 arabinogalactan protein 92.9e-1057.99Show/hide
Query:  ALLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSAPPP--ASPPPAT--PPPATPPP--ATPPPATPPPATPPPATPPPATPPPATPP
        A++C   I+    + G Q+P S PT TPAPPT TTPP  ATPPPVSAPPP   SPPP T  PPPA PPP  ++PPPA+PPPATPPP   PP  PP A+PP
Subjt:  ALLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSAPPP--ASPPPAT--PPPATPPP--ATPPPATPPPATPPPATPPPATPPPATPP

Query:  PATPPP-ATPPP---ATPPPATPPSAPTTSPPAAVPAQGPSPVSSPPAPAMEAPGPAGPE-SPTPSQND
        PATPPP ATPPP   A+PP   P  APTT P     +  PSP SSPP P+ +APGP+    SP PS  D
Subjt:  PATPPP-ATPPP---ATPPPATPPSAPTTSPPAAVPAQGPSPVSSPPAPAMEAPGPAGPE-SPTPSQND


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGATCGTCAAGCTTTGTTGTGTTTCACTTTCATCTCCATTGCCTTTGCCATCGCCGGAGCTCAGTCTCCCGCCAGTTCTCCCACCTACACCCCTGCTCCTCCCAC
CACCACCACCCCCCCTTCTACCGCCACTCCTCCCCCTGTTTCAGCTCCCCCACCGGCATCCCCACCTCCAGCAACTCCTCCTCCGGCAACTCCACCTCCGGCAACTCCCC
CTCCGGCTACTCCCCCTCCGGCTACTCCTCCTCCGGCAACTCCCCCTCCGGCTACTCCTCCTCCGGCAACTCCCCCTCCGGCCACTCCCCCACCGGCCACTCCTCCACCT
GCAACGCCACCACCCGCAACTCCACCGTCTGCACCAACGACATCCCCACCAGCAGCGGTTCCGGCTCAGGGTCCGTCCCCAGTTTCGAGCCCGCCGGCGCCGGCAATGGA
AGCTCCAGGACCTGCTGGCCCTGAATCTCCTACTCCGTCTCAGAACGACAATCTTAAAGATTTGAATCTACAGCCCCATTTAAAAAAGGTTTATTCCACACAAAGTAGTG
GAGTGGAGAACGTTTGGAGAAAGGAGACCATGGTGGGGAGCATATTGATTGGAATGGGATATGTATTTATGATGCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGGATCGTCAAGCTTTGTTGTGTTTCACTTTCATCTCCATTGCCTTTGCCATCGCCGGAGCTCAGTCTCCCGCCAGTTCTCCCACCTACACCCCTGCTCCTCCCAC
CACCACCACCCCCCCTTCTACCGCCACTCCTCCCCCTGTTTCAGCTCCCCCACCGGCATCCCCACCTCCAGCAACTCCTCCTCCGGCAACTCCACCTCCGGCAACTCCCC
CTCCGGCTACTCCCCCTCCGGCTACTCCTCCTCCGGCAACTCCCCCTCCGGCTACTCCTCCTCCGGCAACTCCCCCTCCGGCCACTCCCCCACCGGCCACTCCTCCACCT
GCAACGCCACCACCCGCAACTCCACCGTCTGCACCAACGACATCCCCACCAGCAGCGGTTCCGGCTCAGGGTCCGTCCCCAGTTTCGAGCCCGCCGGCGCCGGCAATGGA
AGCTCCAGGACCTGCTGGCCCTGAATCTCCTACTCCGTCTCAGAACGACAATCTTAAAGATTTGAATCTACAGCCCCATTTAAAAAAGGTTTATTCCACACAAAGTAGTG
GAGTGGAGAACGTTTGGAGAAAGGAGACCATGGTGGGGAGCATATTGATTGGAATGGGATATGTATTTATGATGCTTTAGGAAGAAGAAAGGGGTTGGACATAGTTGATA
TTATAATGTAATGGGTCATTTCTTCCTTCTGCAGTGCCCCCCTCTTTTTTGCTGTCTATGTGTTGCCTTTTCCATTATTCTTCATATTGGTTGCTCTATATTTTAGTTCA
TTTCGATTATTATTAGCCCATGTCTTGTTTGGGACTCTTGATTTATTATTATTAGCCTTTTGGGCTCTCTTTATATATAGATATATATGATTACCCATTTGCTTGGGATT
CTTTTGAGGTAAACAGCTTTTGTATCAACCCTATTTAGAGGGACTTGTTTTATGTTTTTAGTTTCATTATTATTGCTCACTT
Protein sequenceShow/hide protein sequence
MMDRQALLCFTFISIAFAIAGAQSPASSPTYTPAPPTTTTPPSTATPPPVSAPPPASPPPATPPPATPPPATPPPATPPPATPPPATPPPATPPPATPPPATPPPATPPP
ATPPPATPPSAPTTSPPAAVPAQGPSPVSSPPAPAMEAPGPAGPESPTPSQNDNLKDLNLQPHLKKVYSTQSSGVENVWRKETMVGSILIGMGYVFMML