Optimizing main-memory join on modern hardware

Manegold, Stefan; Boncz, Peter; Kersten, Martin

In the past decade, the exponential growth in commodity CPUs speed has far outpaced advances in memory latency. A second trend is that CPU performance advances are not only brought by increased clock rate, but also by increasing parallelism inside the CPU. Current database systems have not yet adapted to these trends, and show poor utilization of both CPU and memory resources on current hardware. In this article, we show how these resources can be optimized for large joins and translate these insights into guidelines for future database architectures, encompassing data structures, algorithms, cost modeling, and implementation. In particular, we discuss how vertically fragmented data structures optimize cache performance on sequential data access. On the algorithmic side, we refine the partitioned hash-join with a new partitioning algorithm called radix-cluster, which is specifically designed to optimize memory access. The performance of this algorithm is quantified using a detailed analytical model that incorporates memory access costs in terms of a limited number of parameters, such as cache sizes and miss penalties. We also present a calibration tool that extracts such parameters automatically from any computer hardware. The accuracy of our models is proven by exhaustive experiments conducted with the Monet database system on three different hardware platforms. Finally, we investigate the effect of implementation techniques that optimize CPU resource usage. Our experiments show that large joins can be accelerated almost an order of magnitude on modern RISC hardware when both memory and CPU resources are optimized.

Additional Metadata
THEME	Information (theme 2)
Publisher	CWI
Series	Information Systems [INS]
Organisation	Database Architectures
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Manegold, S., Boncz, P., & Kersten, M. (1999). Optimizing main-memory join on modern hardware. Information Systems [INS]. CWI.

Free Full Text ( Final Version , 1mb )

See Also
article Optimizing main-memory join on modern hardware P.A. Boncz (Peter), S. Manegold (Stefan) and M.L. Kersten (Martin)
article Optimizing main-memory join on modern hardware P.A. Boncz (Peter), S. Manegold (Stefan) and M.L. Kersten (Martin)

Optimizing main-memory join on modern hardware

Publication

Publication

article
Optimizing main-memory join on modern hardware

article
Optimizing main-memory join on modern hardware

Address

CWI researchers

Questions or comments?

Optimizing main-memory join on modern hardware

Publication

Publication

article Optimizing main-memory join on modern hardware

article Optimizing main-memory join on modern hardware

Workflow

Workflow

Add Content

article
Optimizing main-memory join on modern hardware

article
Optimizing main-memory join on modern hardware