Regret games paper

Degenne, Rémy; Shao, Han; Koolen-Wijkstra, Wouter

Welcome to tidnabbil! Here you can find algorithms for learning in structured bandit models in the fixed confidence and regret settings. These methods are based on iterated saddle point solvers, and they come with guarantees that in particular imply asymptotic optimality. This repository is made available in the hope that this library is useful to others. The code for the experiments in our structured bandit papers is included to ensure reproducibility, and to provide examples to get you started. Cheers!

Additional Metadata
Organisation	Machine Learning
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Degenne, R., Shao, H., & Koolen-Wijkstra, W. (2020). Regret games paper.

Additional Files
View at Bitbucket

See Also
inProceedings Structure adaptive algorithms for stochastic bandits R.R.B.P. Degenne (Rémy), H. Shao (Han) and W.M. Koolen-Wijkstra (Wouter)

Regret games paper

Publication

Publication

inProceedings
Structure adaptive algorithms for stochastic bandits

Address

CWI researchers

Questions or comments?

Regret games paper

Publication

Publication

inProceedings Structure adaptive algorithms for stochastic bandits

Workflow

Workflow

Add Content

inProceedings
Structure adaptive algorithms for stochastic bandits