Benchmarking optimization algorithms for auto-tuning GPU kernels

Schoonhoven, Richard; van Werkhoven, Ben; Batenburg, Joost

doi:10.1109/TEVC.2022.3210654

RA Schoonhoven (Richard), B.J.C. van Werkhoven (Ben) and K.J. Batenburg (Joost)

2022-09-29

Benchmarking optimization algorithms for auto-tuning GPU kernels

IEEE Transactions on Evolutionary Computation

Recent years have witnessed phenomenal growth in the application, and capabilities of Graphical Processing Units (GPUs) due to their high parallel computation power at relatively low cost. However, writing a computationally efficient GPU program (kernel) is challenging, and generally only certain specific kernel configurations lead to significant increases in performance. Auto-tuning is the process of automatically optimizing software for highly-efficient execution on a target hardware platform. Auto-tuning is particularly useful for GPU programming, as a single kernel requires re-tuning after code changes, for different input data, and for different architectures. However, the discrete, and non-convex nature of the search space creates a challenging optimization problem. In this work, we investigate which algorithm produces the fastest kernels if the time-budget for the tuning task is varied. We conduct a survey by performing experiments on 26 different kernel spaces, from 9 different GPUs, for 16 different evolutionary black-box optimization algorithms. We then analyze these results and introduce a novel metric based on the PageRank centrality concept as a tool for gaining insight into the difficulty of the optimization problem. We demonstrate that our metric correlates strongly with observed tuning performance.

Additional Metadata
Keywords	Kernel, Graphics processing units, Optimization, Codes, Search problems, Measurement, Benchmark testing, GPU computing, Auto-tuning, Performance optimization, Evolutionary computing, Fitness landscape analysis
Persistent URL	doi.org/10.1109/TEVC.2022.3210654
Journal	IEEE Transactions on Evolutionary Computation
Project	Real-Time 3D Tomography , the Center for Optimal, Real-Time Machine Studies of the Explosive Universe
Grant	This work was funded by the The Netherlands Organisation for Scientific Research (NWO); grant id nwo/639.073.506 - Real-Time 3D Tomography, This work was funded by the The Netherlands Organisation for Scientific Research (NWO); grant id 1160.18.316 - the Center for Optimal, Real-Time Machine Studies of the Explosive Universe (CORTEX)
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Schoonhoven, R., van Werkhoven, B., & Batenburg, J. (2022). Benchmarking optimization algorithms for auto-tuning GPU kernels. IEEE Transactions on Evolutionary Computation. doi:10.1109/TEVC.2022.3210654

View at Publisher

Full Text ( Author Manuscript , 8mb )

Benchmarking optimization algorithms for auto-tuning GPU kernels

Publication

Publication

Address

CWI researchers

Questions or comments?

Benchmarking optimization algorithms for auto-tuning GPU kernels

Publication

Publication

Workflow

Workflow

Add Content