Hordijk, A.; Kallenberg, L. C. M. - In: Management Science 25 (1979) 4, pp. 352-362
In this paper we show that for a finite Markov decision process an average optimal policy can be found by solving only one linear programming problem. Also the relation between the set of feasible solutions of the linear program and the set of stationary policies is analyzed.