Optimizing main-memory join on modern hardware

Boncz, Peter; Manegold, Stefan; Kersten, Martin

P.A. Boncz (Peter), S. Manegold (Stefan) and M.L. Kersten (Martin)

2002-07-01

Optimizing main-memory join on modern hardware

IEEE Transactions on Knowledge and Data Engineering , Volume 14 - Issue 4 p. 709- 730

In the past decade, the exponential growth in commodity CPUs speed has far outpaced advances in memory latency. A second trend is that CPU performance advances are not only brought by increased clock rate, but also by increasing parallelism inside the CPU. Current database systems have not yet adapted to these trends, and show poor utilization of both CPU and memory resources on current hardware. In this article, we show how these resources can be optimized for large joins and translate these insights into guidelines for future database architectures, encompassing data structures, algorithms, cost modeling, and implementation. In particular, we discuss how vertically fragmented data structures optimize cache performance on sequential data access. On the algorithmic side, we refine the partitioned hash-join with a new partitioning algorithm called radix-cluster, which is specifically designed to optimize memory access. The performance of this algorithm is quantified using a detailed analytical model that incorporates memory access costs in terms of a limited number of parameters, such as cache sizes and miss penalties. We also present a calibration tool that extracts such parameters automatically from any computer hardware. The accuracy of our models is proven by exhaustive experiments conducted with the Monet database system on three different hardware platforms. Finally, we investigate the effect of implementation techniques that optimize CPU resource usage. Our experiments show that large joins can be accelerated almost an order of magnitude on modern RISC hardware when both memory and CPU resources are optimized.

Additional Metadata
THEME	Information (theme 2)
Publisher	I.E.E.E. Computer Society Press
Journal	IEEE Transactions on Knowledge and Data Engineering
Organisation	Database Architectures
Citation APA APA Style APA-ALL Style AAA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Boncz, P., Manegold, S.& Kersten, M. (2002). Optimizing main-memory join on modern hardware. IEEE Transactions on Knowledge and Data Engineering, 14(4), 709–730.

Free Full Text ( Author Manuscript , 3mb )

See Also
techReport Optimizing main-memory join on modern hardware S. Manegold (Stefan), P.A. Boncz (Peter) and M.L. Kersten (Martin)
techReport Optimizing main-memory join on modern hardware S. Manegold (Stefan), P.A. Boncz (Peter) and M.L. Kersten (Martin)

Optimizing main-memory join on modern hardware

Publication

Publication

techReport
Optimizing main-memory join on modern hardware

techReport
Optimizing main-memory join on modern hardware

Address

CWI researchers

Questions or comments?

Optimizing main-memory join on modern hardware

Publication

Publication

techReport Optimizing main-memory join on modern hardware

techReport Optimizing main-memory join on modern hardware

Workflow

Workflow

Add Content

techReport
Optimizing main-memory join on modern hardware

techReport
Optimizing main-memory join on modern hardware