Cavazos-Cadena, Rolando; Montes-De-Oca, Raúl - In: Computational Statistics 49 (1999) 3, pp. 441-456
This work concerns controlled Markov chains with denumerable state space and discrete time parameter. The reward function is assumed to be≤0 and the performance of a control policy is measured by the expected total-reward criterion. Within this context, sufficient conditions are given so that...