Heuristic Selection of Actions in Multiagent Reinforcement Learning.

Bianchi, Reinaldo A. C.; Ribeiro, Carlos H. C.; Costa, Anna Helena Reali

Tools

Export citation

Search in Google Scholar

Heuristic Selection of Actions in Multiagent Reinforcement Learning.

Proceedings article published in 2007 by Reinaldo A. C. Bianchi

, Carlos H. C. Ribeiro, Anna Helena Reali Costa

This paper is available in a repository.

Full text: Download

Preprint: policy unknown

Upload

Postprint: policy unknown

Upload

Published version: policy unknown

Upload

Abstract

This work presents a new algorithm, called Heuris- tically Accelerated Minimax-Q (HAMMQ), that al- lows the use of heuristics to speed up the well- known Multiagent Reinforcement Learning algo- rithm Minimax-Q. A heuristic function H that in- fluences the choice of the actions characterises the HAMMQ algorithm. This function is associated with a preference policy that indicates that a cer- tain action must be taken instead of another. A set of empirical evaluations were conducted for the proposed algorithm in a simplified simulator for the robot soccer domain, and experimental results show that even very simple heuristics enhances sig- nificantly the performance of the multiagent rein- forcement learning algorithm.

Links

Tools

Heuristic Selection of Actions in Multiagent Reinforcement Learning.

Abstract