Similar Search Results

Conditions for the uniqueness of optimal policies of discounted Markov decision processes

Cruz-Suárez, Daniel; Montes-de-Oca, Raúl; … - In: Mathematical Methods of Operations Research 60 (2004) 3, pp. 415-436

This paper presents three conditions. Each of them guarantees the uniqueness of optimal policies of discounted Markov decision processes. The conditions presented here impose hypotheses specifically on the state space X, the action space A, the admissible action sets A(x),x∈X, the transition...

Persistent link: https://www.econbiz.de/10010999728

Conditions for the uniqueness of optimal policies of discounted Markov decision processes

Cruz-Suárez, Daniel; Montes-de-Oca, Raúl; … - In: Mathematical methods of operations research 60 (2004) 3, pp. 415-436

Persistent link: https://www.econbiz.de/10007784036

Nearly optimal stationary policies in negative dynamic programming

Cavazos-Cadena, Rolando; Montes-De-Oca, Raúl - In: Computational Statistics 49 (1999) 3, pp. 441-456

This work concerns controlled Markov chains with denumerable state space and discrete time parameter. The reward function is assumed to be≤0 and the performance of a control policy is measured by the expected total-reward criterion. Within this context, sufficient conditions are given so that...

Persistent link: https://www.econbiz.de/10010847483

Application of average dynamic programming to inventory systems

Vega-Amaya, Oscar; Montes-de-Oca, Raúl - In: Computational Statistics 47 (1998) 3, pp. 451-471

We show the existence ofaverage cost (AC-) optimal policy for an inventory system withuncountable state space; in fact, the AC-optimal cost and an AC-optimal stationary policy areexplicitly computed. In order to do this, we use a variant of thevanishing discount factor approach, which have been...

Persistent link: https://www.econbiz.de/10010847854

Discounted cost optimality problem: stability with respect to weak metrics

Gordienko, Evgueni; Lemus-Rodríguez, Enrique; … - In: Computational Statistics 68 (2008) 1, pp. 77-96

We find inequalities to estimate the stability (robustness) of a discounted cost optimization problem for discrete-time Markov control processes on a Borel state space. The one stage cost is allowed to be unbounded. Unlike the known results in this area we consider a perturbation of transition...

Persistent link: https://www.econbiz.de/10010847899

Approximation of average cost optimal policies for general Markov decision processes with unbounded costs

Gordienko, Evgueni; Montes-De-Oca, Raúl; … - In: Computational Statistics 45 (1997) 2, pp. 245-263

The aim of the paper is to show that Lyapunov-like ergodicity conditions on Markov decision processes with Borel state space and possibly unbounded cost provide the approximation of an average cost optimal policy by solvingn-stage optimization problems (n=1, 2, ...). The used approach ensures...

Persistent link: https://www.econbiz.de/10010847951

Nearly optimal stationary policies in negative dynamic programming

Cavazos-Cadena, Rolando; Montes-De-Oca, Raúl - In: Mathematical Methods of Operations Research 49 (1999) 3, pp. 441-456

Persistent link: https://www.econbiz.de/10010999528

An envelope theorem and some applications to discounted Markov decision processes

Cruz-Suárez, Hugo; Montes-de-Oca, Raúl - In: Mathematical Methods of Operations Research 67 (2008) 2, pp. 299-321

In this paper, an Envelope Theorem (ET) will be established for optimization problems on Euclidean spaces. In general, the Envelope Theorems permit analyzing an optimization problem and giving the solution by means of differentiability techniques. The ET will be presented in two versions. One of...

Persistent link: https://www.econbiz.de/10010999686

Average cost Markov control processes: stability with respect to the Kantorovich metric

Gordienko, Evgueni; Lemus-Rodríguez, Enrique; … - In: Mathematical Methods of Operations Research 70 (2009) 1, pp. 13-33

We study perturbations of a discrete-time Markov control process on a general state space. The amount of perturbation is measured by means of the Kantorovich distance. We assume that an average (per unit of time on the infinite horizon) optimal control policy can be found for the perturbed...

Persistent link: https://www.econbiz.de/10010999704

Nearly optimal policies in risk-sensitive positive dynamic programming on discrete spaces

Cavazos-Cadena, Rolando; Montes-de-Oca, Raúl - In: Mathematical Methods of Operations Research 52 (2000) 1, pp. 133-167

This note concerns Markov decision processes on a discrete state space. It is supposed that the reward function is nonnegative, and that the decision maker has a nonnull constant risk-sensitivity, which leads to grade random rewards via the expectation of an exponential utility function. The...

Persistent link: https://www.econbiz.de/10010999736