Cruz-Suárez, Daniel; Montes-de-Oca, Raúl; … - In: Mathematical Methods of Operations Research 60 (2004) 3, pp. 415-436
This paper presents three conditions. Each of them guarantees the uniqueness of optimal policies of discounted Markov decision processes. The conditions presented here impose hypotheses specifically on the state space X, the action space A, the admissible action sets A(x),x∈X, the transition...