Showing 1 - 5 of 5
This paper extends the classic two-armed bandit problem to a many-agent setting in which I players each face the same experi- mentation problem.The main change from the single-agent prob- lem is that an agent can now learn from the current experimentation of other agents.Information is therefore...
Persistent link: https://www.econbiz.de/10011090884
Persistent link: https://www.econbiz.de/10011091242
Persistent link: https://www.econbiz.de/10011091597
Persistent link: https://www.econbiz.de/10011091806
Persistent link: https://www.econbiz.de/10011092284