COllective INtelligence with sequences of actions
The design of a Multi-Agent System (MAS) to perform well on a collective task is non-trivial. Straightforward application of learning in a MAS can lead to sub optimal solutions as agents compete or interfere. The COllective INtelligence (COIN) framework of Wolpert et al. proposes an engineering solution for MASs where agents learn to focus on actions which support a common task. As a case study, we investigate the performance of COIN for representative token retrieval problems found to be difficult for agents using classic Reinforcement Learning (RL). We further investigate several techniques from RL (model-based learning, $Q(lambda))$ to scale application of the COIN framework. Lastly, the COIN framework is extended to improve performance for sequences of actions.
|Learning (acm I.2.6), Distributed Artificial Intelligence (acm I.2.11)|
|Distributed systems (msc 68M14), Learning and adaptive systems (msc 68T05)|
|Software (theme 1), Logistics (theme 3), Energy (theme 4)|
|Software Engineering [SEN]|
|Organisation||Intelligent and autonomous systems|
't Hoen, P.J, & Bohte, S.M. (2003). COllective INtelligence with sequences of actions. Software Engineering [SEN]. CWI.