Showing 1 - 1 of 1
In this paper we study learning procedures when counterfactuals (payo s of not-chosen actions) are not observed. The decision maker reasons in two steps: First, she updates her propensities for each action after every payo experience, where propensity is de ned as how much she prefers each...
Persistent link: https://www.econbiz.de/10008485536