Correlated Gaussian Multi-Objective Multi-Armed Bandit across Arms Algorithm

Yahyaa, Saba Q.; Drugan, Madalina M.

Published in

2015 IEEE Symposium Series on Computational Intelligence

DOI: 10.1109/ssci.2015.93

Tools

Export citation

Search in Google Scholar

Correlated Gaussian Multi-Objective Multi-Armed Bandit across Arms Algorithm

Proceedings article published in 2015 by Saba Q. Yahyaa, Madalina M. Drugan

This paper is available in a repository.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Stochastic multi-objective multi-armed bandit problem, (M OM AB), is a stochastic multi-armed problem where each arm generates a vector of rewards instead of a single scalar reward. The goal of (MOMAB) is to minimize the regret of playing suboptimal arms while playing fairly the Pareto optimal arms. In this paper, we consider Gaussian correlation across arms in (MOMAB), meaning that the generated reward vector of an arm gives us information not only about that arm itself but also on all the available arms. We call this framework the correlated-M OM AB problem. We extended Gittins index policy to correlated (M OM AB) because Gittins index has been used before to model the correlation between arms. We empirically compared Gittins index policy with multi-objective upper confidence bound policy on a test suite of correlated-M OM AB problems. We conclude that the performance of these policies depend on the number of arms and objectives.

Published in

Links

Tools

Correlated Gaussian Multi-Objective Multi-Armed Bandit across Arms Algorithm

Abstract