Nowak, Andrzej S. - In: Mathematical Methods of Operations Research 49 (1999) 3, pp. 475-482
We extend a result by Cavazos-Cadena and Lasserre on the existence of strong 1-optimal stationary policies in Markov decision chains with countable state spaces, uniformly ergodic transition probabilities and bounded costs to a larger class of models with unbounded costs and the so-called...