Architecture-Conscious Hashing

Zukowski, Marcin; A. B. C. Héman, Sándor; Boncz, Peter

M. Zukowski (Marcin), S. A. B. C. Héman (Sándor) and P.A. Boncz (Peter)

2006

Architecture-Conscious Hashing

Presented at the International Workshop on Data Management on New Hardware , Chicago, Il, USA

Hashing is one of the fundamental techniques used to implement query processing operators such as grouping, aggregation and join. This paper studies the interaction between modern computer architecture and hash-based query processing techniques. First, we focus on extracting maximum hashing performance from super-scalar CPUs. In particular, we discuss fast hash functions, ways to efficiently handle multi-column keys and propose the use of a recently introduced hashing scheme called Cuckoo Hashing over the commonly used bucket-chained hashing. In the second part of the paper, we focus on the CPU cache usage, by dynamically partitioning data streams such that the partial hash tables fit in the CPU cache. Conventional partitioning works as a separate preparatory phase, forcing materialization, which may require I/O if the stream does not fit in RAM. We introduce best-effort partitioning, a technique that interleaves partitioning with execution of hash-based query processing operators and avoids I/O. In the process, we show how to prevent issues in partitioning with cacheline alignment, that can strongly decrease throughput. We also demonstrate overall query processing performance when both CPU-efficient hashing and best-effort partitioning are combined.

Additional Metadata
THEME	Information (theme 2)
Publisher	ACM
Project	Ambient Multimedia Databases
Conference	International Workshop on Data Management on New Hardware
Organisation	Database Architectures
Citation APA APA Style APA-ALL Style AAA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Zukowski, M., Héman, S.& Boncz, P. (2006). Architecture-Conscious Hashing. Proceedings of International Workshop on Data Management on New Hardware 2006, 1–8.

Free Full Text ( Author Manuscript , 432kb )

Architecture-Conscious Hashing

Publication

Publication

Address

CWI researchers

Questions or comments?

Architecture-Conscious Hashing

Publication

Publication

Workflow

Workflow

Add Content