Glazebrook, K. D.; Gittins, J. C. - In: Stochastic Processes and their Applications 46 (1993) 2, pp. 301-326
Following major theoretical advances in the study of multi-armed bandit problems, Gittins proposed a forwards induction (FI) approach to the development of policies for Markov decision processes (MDP's). Considerable computational savings are often possible over conventional dynamic programming....