Adaptive Hedge

van Erven, Tim; Grünwald, Peter; Koolen-Wijkstra, Wouter; de Rooij, Steven

T.A.L. van Erven (Tim), P.D. Grünwald (Peter), W.M. Koolen-Wijkstra (Wouter) and S. de Rooij (Steven)

2011

Adaptive Hedge

Presented at the 25th Annual Conference on Neural Information Processing Systems, NIPS 2011 (December 2011), Granada, Spain

Most methods for decision-theoretic online learning are based on theHedge algorithm, which takes a parameter called the learning rate. In most previous analyses the learning rate was carefully tuned to obtain optimal worst-case performance, leading to suboptimal performance on easy instances, for example when there exists an action that is significantly better than all others. We propose a new way of setting the learning rate, which adapts to the difficulty of the learning problem: in the worst case our procedure still guarantees optimal performance, but on easy instances it achieves much smaller regret. In particular, our adaptive method achieves constant regret in a probabilistic setting, when there exists an action that on average obtains strictly smaller loss than all other actions. We also provide a simulation study comparing our approach to existing methods.

Additional Metadata
THEME	Life Sciences (theme 5), Logistics (theme 3)
Publisher	The MIT Press
Series	Advances in Neural Information Processing Systems
Project	Learning when all models are wrong
Conference	25th Annual Conference on Neural Information Processing Systems, NIPS 2011
Organisation	Algorithms and Complexity
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	van Erven, T., Grünwald, P., Koolen-Wijkstra, W., & de Rooij, S. (2011). Adaptive Hedge. In Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011. The MIT Press.

Free Full Text ( Final Version )

Additional Files
Publisher Version

Adaptive Hedge

Publication

Publication

Address

CWI researchers

Questions or comments?

Adaptive Hedge

Publication

Publication

Workflow

Workflow

Add Content