Mayo-Wilson, Conor; Zollman, Kevin; Danks, David - In: International Journal of Game Theory 42 (2013) 3, pp. 695-723
We evaluate the asymptotic performance of boundedly-rational strategies in multi-armed bandit problems, where performance is measured in terms of the tendency (in the limit) to play optimal actions in either (i) isolation or (ii) networks of other learners. We show that, for many strategies...