NESTOUR, Élise PAYZAN LE - 2010
We study learning in a bandit problem where the outcome probabilities of six arms switch (jump) over time a restless bandit. In the experiment, optimal Bayesian learning tracks the jumps through learning of the probability of a jump or direct jump detection and, once a jump has occurred,...