Learning on a Budget Using Distributional RL

Serrano, Jonathan; Morales, Eduardo; Hernandez-Leal, Pablo; Bloembergen, Daniel; Kaisers, Michael

J. Serrano (Jonathan), E.F. Morales (Eduardo), P. Hernandez-Leal (Pablo), D. Bloembergen (Daniel) and M. Kaisers (Michael)

2018

Learning on a Budget Using Distributional RL

Presented at the Adaptive and Learning Agents (January 2018)

Agents acting in real-world scenarios often have constraints such as finite budgets or daily job performance targets. While repeated (episodic) tasks can be solved with existing RL algorithms, methods need to be extended if the repetition depends on performance. Recent work has introduced a distributional perspective on reinforcement learning, providing a model of episodic returns. Inspired by these results we contribute the new budget- and risk-aware distributional reinforcement learning (BRAD-RL) algorithm that bootstraps from the C51 distributional output and then uses value iteration to estimate the value of starting an episode with a certain amount of budget. With this strategy we can make budget-wise action selection within each episode and maximize the return across episodes. Experiments in a grid-world domain highlight the benefits of our algorithm, maximizing discounted future returns when low cumulative performance may terminate repetition.

Additional Metadata
Project	Demand response for grid-friendly quasi-autarkic energy cooperatives
Conference	Adaptive and Learning Agents
Grant	This work was funded by the The Netherlands Organisation for Scientific Research (NWO); grant id nwo/651.001.003 - Demand response for grid-friendly quasi-autarkic energy cooperatives
Organisation	Intelligent and autonomous systems
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Serrano, J., Morales, E., Hernandez-Leal, P., Bloembergen, D., & Kaisers, M. (2018). Learning on a Budget Using Distributional RL. In Proceedings of Adaptive and Learning Agents (ALA) Workshop, 2018.

Full Text ( Final Version , 664kb )

Learning on a Budget Using Distributional RL

Publication

Publication

Address

CWI researchers

Questions or comments?

Learning on a Budget Using Distributional RL

Publication

Publication

Workflow

Workflow

Add Content