Showing 1 - 10 of 87
We consider two players facing identical discrete-time bandit problems with a safe and a risky arm. In any period, the risky arm yields either a success or a failure, and the first success reveals the risky arm to dominate the safe one. When payoffs are public information, the ensuing free-rider...
Persistent link: https://www.econbiz.de/10009658114
Persistent link: https://www.econbiz.de/10011549152
Persistent link: https://www.econbiz.de/10002699193
Persistent link: https://www.econbiz.de/10011792230
Economists tend to assume that agents maximize their expected utility. However, many different experiments have questioned expected utility maximization by showing that human behavior can be characterized as random. This paper proposes Thompson Sampling as a theory of human behavior across very...
Persistent link: https://www.econbiz.de/10012099162
We analyze a two-player game of strategic experimentation with two-armed bandits. Each player has to decide in continuous time whether to use a safe arm with a known payoff or a risky arm whose likelihood of delivering payoffs is initially unknown. The quality of the risky arms is perfectly...
Persistent link: https://www.econbiz.de/10003951567
Persistent link: https://www.econbiz.de/10011392709
Persistent link: https://www.econbiz.de/10011336576
Persistent link: https://www.econbiz.de/10011312645
We study social learning in a large population of agents who only observe the actions taken by their neighbours. Agents have to choose one, out of two, reversible actions, each optimal in one, out of two, unknown states of the world. Each agent chooses rationally, on the basis of private...
Persistent link: https://www.econbiz.de/10009752451