Bolton, P.; Harris, C. - Tilburg University, Center for Economic Research - 1996
This paper extends the classic two-armed bandit problem to a many-agent setting in which I players each face the same experi- mentation problem.The main change from the single-agent prob- lem is that an agent can now learn from the current experimentation of other agents.Information is therefore...