Showing 1 - 1 of 1
This paper addresses constrained Markov decision processes, with expected discounted total cost criteria, which are controlled by non-randomized policies. A dynamic programming approach is used to construct optimal policies. The convergence of the series of finite horizon value functions to the...
Persistent link: https://www.econbiz.de/10010847733