To be greedy, or not to be — That is the question for Population Based Training variants

Chebykin, Aleksandr; Alderliesten, Tanja; Bosman, Peter

A. Chebykin (Aleksandr), T. Alderliesten (Tanja) and P.A.N. Bosman (Peter)

2025-06-02

To be greedy, or not to be — That is the question for Population Based Training variants

Transactions on Machine Learning Research , Volume 2025

Achieving excellent results with neural networks requires careful hyperparameter tuning, which can be automated via hyperparameter optimization algorithms such as Population Based Training (PBT). PBT stands out for its capability to efficiently optimize hyperparameter schedules in parallel and within the wall-clock time of training a single network. Several PBT variants have been proposed that improve performance in the experimental settings considered in the associated publications. However, the experimental settings and tasks vary across publications, while the best previous PBT variant is not always included in the comparisons, thus making the relative performance of PBT variants unclear. In this work, we empirically evaluate five single-objective PBT variants on a set of image classification and reinforcement learning tasks with different setups (such as increasingly large search spaces). We find that the Bayesian Optimization (BO) variants of PBT tend to behave greedier than the non-BO ones, which is beneficial when aggressively pursuing short-term gains improves long-term performance and harmful otherwise. This is a previously overlooked caveat to the reported improvements of the BO PBT variants. Examining their theoretical properties, we find that the returns of BO PBT variants are guaranteed to asymptotically approach the returns of the greedy hyperparameter schedule (rather than the optimal one, as claimed in prior work). Together with our empirical results, this leads us to conclude that there is currently no single best PBT variant capable of outperforming others both when pursuing short-term gains is helpful in the long term, and when it is harmful.

Additional Metadata
Journal	Transactions on Machine Learning Research
Organisation	Centrum Wiskunde & Informatica, Amsterdam (CWI), The Netherlands
Citation APA APA Style APA-ALL Style AAA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Chebykin, A., Alderliesten, T.& Bosman, P. (2025). To be greedy, or not to be — That is the question for Population Based Training variants. Transactions on Machine Learning Research, 2025.

Free Full Text ( Final Version , 1mb )

Additional Files
View at OpenReview

To be greedy, or not to be — That is the question for Population Based Training variants

Publication

Publication

Address

CWI researchers

Questions or comments?

To be greedy, or not to be — That is the question for Population Based Training variants

Publication

Publication

Workflow

Workflow

Add Content