Showing 1 - 2 of 2
We evaluate the asymptotic performance of boundedly-rational strategies in multi-armed bandit problems, where performance is measured in terms of the tendency (in the limit) to play optimal actions in either (i) isolation or (ii) networks of other learners. We show that, for many strategies...
Persistent link: https://www.econbiz.de/10010845512
Persistent link: https://www.econbiz.de/10010152274