Abbad, Mohammed; Daoui, Cherki - In: Mathematical Methods of Operations Research 53 (2001) 3, pp. 451-463
We consider discrete time Markov Decision Process (MDP) with finite state and action spaces under average reward optimality criterion. The decomposition theory, in Ross and Varadarajan [11], leads to a natural partition of the state space into strongly communicating classes and a set of states...