; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019159 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019159
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionclassical arabinogalactan protein 4
Genome locationChr04:17950788..17951360
RNA-Seq ExpressionHG10019159
SyntenyHG10019159
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649311.1 hypothetical protein Csa_014808 [Cucumis sativus]9.1e-6183.52Show/hide
Query:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPV-TTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAP
        MASST+LN+LT+A LLIS AANSP PSPAP+ ESP+WKWTP+NEPPSSPPT EPP+ TTSPPSPTISPP TPPELSPVP+++APT PP  NPPTLSPTAP
Subjt:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPV-TTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAP

Query:  PPAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAI
        PPAVKSPSHAPSPAKTKAPAPAP+K SP PAKAPKSSKAP SSPPSPNG+MEPP PSVAPA  KEGNG GANRFAI
Subjt:  PPAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAI

KAG7014869.1 hypothetical protein SDJN02_22499, partial [Cucurbita argyrosperma subsp. argyrosperma]8.2e-5474.21Show/hide
Query:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPVTTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAPP
        MAS  VLNV+T ALLLISAAANSPLPSPAPSPESP WKWTP+ E PSSPP +EPP          SP  +PPELSPVPS+  PT PP  NPPTLSP A P
Subjt:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPVTTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAPP

Query:  PAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAIGGSVAVGLMIAALIA
        PAVKSP HAPSP+KTKAPAPAP+KMSP PA+APKSS  P SSPP+PNG MEPP PSVAP  PK  NGG ANRFAIGGSVA GLM AALIA
Subjt:  PAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAIGGSVAVGLMIAALIA

XP_008462882.1 PREDICTED: classical arabinogalactan protein 4 [Cucumis melo]5.5e-6682.72Show/hide
Query:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPV-TTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAP
        MASST+LN+LT+A LLISAAANSP PSPAPS ESP+WKWTP N+PPSSPPT EPP+ TTSPPSPTISPPATPPELSPVPS+++PT PP  NPPTLSPTAP
Subjt:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPV-TTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAP

Query:  PPAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAIGGSVAVGLMIAALIA
        PPAV SPSHAPSPAKTKAPAPAP+K+SP PAKAPKSSKAP SSPPSPNG+MEPP PS APA  KEGNG GANRFAIGGS+A G MIAA IA
Subjt:  PPAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAIGGSVAVGLMIAALIA

XP_011654351.1 classical arabinogalactan protein 4 [Cucumis sativus]2.7e-6583.25Show/hide
Query:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPV-TTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAP
        MASST+LN+LT+A LLIS AANSP PSPAP+ ESP+WKWTP+NEPPSSPPT EPP+ TTSPPSPTISPP TPPELSPVP+++APT PP  NPPTLSPTAP
Subjt:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPV-TTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAP

Query:  PPAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAIGGSVAVGLMIAALIA
        PPAVKSPSHAPSPAKTKAPAPAP+K SP PAKAPKSSKAP SSPPSPNG+MEPP PSVAPA  KEGNG GANRFAIGGS+A GLMIAA IA
Subjt:  PPAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAIGGSVAVGLMIAALIA

XP_038877309.1 vegetative cell wall protein gp1-like [Benincasa hispida]4.3e-7187.89Show/hide
Query:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPVTTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAPP
        MASSTVL VLTVALLLISAAANSP+PSPAP PESP+WKWTP+NEPPSSPP  EPP+TTSPPSPTISPPATPPE+SP+PS++APTLPPK NPPTLSPTA P
Subjt:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPVTTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAPP

Query:  PAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAIGGSVAVGLMIAALIA
        P VKSPSHAPSPAKTKAPAPAP+KMSP PAKAPKSSKAPASSPPSPNG MEPPAPSVAPA  K+ NGG ANRFAIGGSVAVGLMIAALIA
Subjt:  PAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAIGGSVAVGLMIAALIA

TrEMBL top hitse value%identityAlignment
A0A0A0L0Q7 Uncharacterized protein1.3e-6583.25Show/hide
Query:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPV-TTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAP
        MASST+LN+LT+A LLIS AANSP PSPAP+ ESP+WKWTP+NEPPSSPPT EPP+ TTSPPSPTISPP TPPELSPVP+++APT PP  NPPTLSPTAP
Subjt:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPV-TTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAP

Query:  PPAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAIGGSVAVGLMIAALIA
        PPAVKSPSHAPSPAKTKAPAPAP+K SP PAKAPKSSKAP SSPPSPNG+MEPP PSVAPA  KEGNG GANRFAIGGS+A GLMIAA IA
Subjt:  PPAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAIGGSVAVGLMIAALIA

A0A1S3CHW9 classical arabinogalactan protein 42.7e-6682.72Show/hide
Query:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPV-TTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAP
        MASST+LN+LT+A LLISAAANSP PSPAPS ESP+WKWTP N+PPSSPPT EPP+ TTSPPSPTISPPATPPELSPVPS+++PT PP  NPPTLSPTAP
Subjt:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPV-TTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAP

Query:  PPAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAIGGSVAVGLMIAALIA
        PPAV SPSHAPSPAKTKAPAPAP+K+SP PAKAPKSSKAP SSPPSPNG+MEPP PS APA  KEGNG GANRFAIGGS+A G MIAA IA
Subjt:  PPAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAIGGSVAVGLMIAALIA

A0A5A7V3Q7 Classical arabinogalactan protein 42.7e-6682.72Show/hide
Query:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPV-TTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAP
        MASST+LN+LT+A LLISAAANSP PSPAPS ESP+WKWTP N+PPSSPPT EPP+ TTSPPSPTISPPATPPELSPVPS+++PT PP  NPPTLSPTAP
Subjt:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPV-TTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAP

Query:  PPAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAIGGSVAVGLMIAALIA
        PPAV SPSHAPSPAKTKAPAPAP+K+SP PAKAPKSSKAP SSPPSPNG+MEPP PS APA  KEGNG GANRFAIGGS+A G MIAA IA
Subjt:  PPAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAIGGSVAVGLMIAALIA

A0A6J1E3R0 extensin-like1.3e-5272.63Show/hide
Query:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPVTTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAPP
        MAS  VLNV+T ALLLISA+ANSPLPSPAPSPESP WKWTP+ + PSSPP +EPP          SP  +PPELSPVPS+  PT PP  NPPTLSP A P
Subjt:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPVTTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAPP

Query:  PAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAIGGSVAVGLMIAALIA
        PAVKSP HAPSP+KTKAPAPAP+KMS  PA+APKSS AP SSPP+PNG MEPP PSVAP  PK  NGG ANRFAIGG+VA GLM AALIA
Subjt:  PAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAIGGSVAVGLMIAALIA

A0A6J1JCM0 extensin-like2.0e-5072.11Show/hide
Query:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPVTTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAPP
        MAS  VLNV+T ALLLISA+ANSPLPS APSPES  WKWTP+ E PSSPP +EPP          SP  +PPELSPVPS+  PT P   NPPTLSP A P
Subjt:  MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPVTTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAPP

Query:  PAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAIGGSVAVGLMIAALIA
        PAVKSP HAPSP+KTKAPAPAP+KMSP PA+APKSS APASSPP+PNG MEPP PSVAP  PK  NGG ANRFAI  SVA GLM AALIA
Subjt:  PAVKSPSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAIGGSVAVGLMIAALIA

SwissProt top hitse value%identityAlignment
Q9FPQ6 Vegetative cell wall protein gp15.1e-0644.3Show/hide
Query:  SPLPSPAPSPESPSWKWTPNNEPPSS----PPTVEPPVTTS--PPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAPP-PAVKSPSHAPSPAKT
        SP   P+PSP SP+    P+  PPS     PP+  PPV  S  PPSPT   P+ P   SP P + AP +PP   PP+ +P  PP PA  SP   PSPA  
Subjt:  SPLPSPAPSPESPSWKWTPNNEPPSS----PPTVEPPVTTS--PPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAPP-PAVKSPSHAPSPAKT

Query:  KAPAPAPTKMSP--PPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVP
          P+PAP   SP  PP+  P S   P+ +PPSP     PP PS  P  P
Subjt:  KAPAPAPTKMSP--PPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVP

Arabidopsis top hitse value%identityAlignment
AT1G44191.1 ECA1 gametogenesis related family protein4.4e-0537.11Show/hide
Query:  ISAAANSPLPSPAPSPESPSWK----WTPNNEPPS---SPPTVEPPVTTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAPPPAVKS----
        +S +  SP   P+P P +PS +     +P   PPS   SPP   P  +  PP P+ SPP  P +  P P   +P   PK +PP   P++PPP+ K     
Subjt:  ISAAANSPLPSPAPSPESPSWK----WTPNNEPPS---SPPTVEPPVTTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAPPPAVKS----

Query:  PSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKE
        P  +PSP K   P P P K  P P K      +P  SPP P     PP PS  P  PK+
Subjt:  PSHAPSPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTCCACCGTTCTGAATGTTCTCACGGTGGCGCTCCTGCTGATCTCCGCCGCTGCCAACTCCCCGCTACCGTCTCCAGCACCGAGCCCCGAATCACCGTCGTG
GAAATGGACTCCCAATAACGAGCCACCGTCCTCTCCACCGACAGTGGAACCGCCAGTGACGACGTCTCCACCTTCACCGACAATATCTCCGCCAGCCACCCCACCGGAGT
TGAGTCCCGTTCCATCAACCCATGCACCGACTCTGCCGCCGAAGACTAACCCACCCACATTGTCGCCGACCGCTCCACCGCCAGCAGTGAAGAGTCCCAGTCATGCGCCA
TCGCCGGCAAAAACGAAAGCACCAGCTCCGGCTCCGACCAAAATGAGTCCGCCGCCGGCCAAGGCACCGAAATCTTCTAAAGCTCCGGCGAGTAGTCCTCCGTCACCGAA
TGGAAGGATGGAACCACCGGCGCCATCAGTAGCTCCGGCAGTGCCGAAGGAGGGTAACGGCGGCGGTGCAAACAGATTCGCCATTGGTGGGTCTGTTGCAGTTGGATTAA
TGATTGCGGCTTTAATCGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCTCCACCGTTCTGAATGTTCTCACGGTGGCGCTCCTGCTGATCTCCGCCGCTGCCAACTCCCCGCTACCGTCTCCAGCACCGAGCCCCGAATCACCGTCGTG
GAAATGGACTCCCAATAACGAGCCACCGTCCTCTCCACCGACAGTGGAACCGCCAGTGACGACGTCTCCACCTTCACCGACAATATCTCCGCCAGCCACCCCACCGGAGT
TGAGTCCCGTTCCATCAACCCATGCACCGACTCTGCCGCCGAAGACTAACCCACCCACATTGTCGCCGACCGCTCCACCGCCAGCAGTGAAGAGTCCCAGTCATGCGCCA
TCGCCGGCAAAAACGAAAGCACCAGCTCCGGCTCCGACCAAAATGAGTCCGCCGCCGGCCAAGGCACCGAAATCTTCTAAAGCTCCGGCGAGTAGTCCTCCGTCACCGAA
TGGAAGGATGGAACCACCGGCGCCATCAGTAGCTCCGGCAGTGCCGAAGGAGGGTAACGGCGGCGGTGCAAACAGATTCGCCATTGGTGGGTCTGTTGCAGTTGGATTAA
TGATTGCGGCTTTAATCGCTTAA
Protein sequenceShow/hide protein sequence
MASSTVLNVLTVALLLISAAANSPLPSPAPSPESPSWKWTPNNEPPSSPPTVEPPVTTSPPSPTISPPATPPELSPVPSTHAPTLPPKTNPPTLSPTAPPPAVKSPSHAP
SPAKTKAPAPAPTKMSPPPAKAPKSSKAPASSPPSPNGRMEPPAPSVAPAVPKEGNGGGANRFAIGGSVAVGLMIAALIA