Showing 1 - 10 of 61
Economists tend to assume that agents maximize their expected utility. However, many different experiments have questioned expected utility maximization by showing that human behavior can be characterized as random. This paper proposes Thompson Sampling as a theory of human behavior across very...
Persistent link: https://www.econbiz.de/10012099162
We analyze a two-player game of strategic experimentation with two-armed bandits. Each player has to decide in continuous time whether to use a safe arm with a known payoff or a risky arm whose likelihood of delivering payoffs is initially unknown. The quality of the risky arms is perfectly...
Persistent link: https://www.econbiz.de/10003951567
Persistent link: https://www.econbiz.de/10011392709
We study social learning in a large population of agents who only observe the actions taken by their neighbours. Agents have to choose one, out of two, reversible actions, each optimal in one, out of two, unknown states of the world. Each agent chooses rationally, on the basis of private...
Persistent link: https://www.econbiz.de/10009752451
Persistent link: https://www.econbiz.de/10010231789
We analyze a two-player game of strategic experimentation with two-armed bandits. Each player has to decide in continuous time whether to use a safe arm with a known payoff or a risky arm whose likelihood of delivering payoffs is initially unknown. The quality of the risky arms is perfectly...
Persistent link: https://www.econbiz.de/10010364305
Persistent link: https://www.econbiz.de/10011483559
Persistent link: https://www.econbiz.de/10011485724
We introduce uncertainty into Holmstrom and Milgrom (1987) to study optimal long-term contracting with learning. In a dynamic relationship, the agent's shirking not only reduces current performance but also increases the agent’s information rent due to the persistent belief manipulation...
Persistent link: https://www.econbiz.de/10011557712
Persistent link: https://www.econbiz.de/10010486678