Chapman, Archie C.; Jennings, Nicholas R.; Leslie, David S. - Business School, University of Sydney - 2011
In this paper, we address the problem of convergence to Nash equilibria in games with rewards that are initially unknown and which must be estimated over time from noisy observations. These games arise in many real-world applications, whenever rewards for actions cannot be prespecified and must...