Chen, Richard; Feinberg, Eugene - In: Computational Statistics 66 (2007) 1, pp. 165-179
This paper addresses constrained Markov decision processes, with expected discounted total cost criteria, which are controlled by non-randomized policies. A dynamic programming approach is used to construct optimal policies. The convergence of the series of finite horizon value functions to the...