Similar Search Results

Semi-infinite weighted Markov decision processes with perturbation

Abbad, Mohammed; Rahhali, Khalid - In: Computational Statistics 60 (2004) 2, pp. 251-265

In this paper, Weighted reward Perturbed Markov Decision Processes with finite state and countable action spaces (semi-infinite WMDP for short) are considered. The ”weighted reward” refers to appropriately normalized convex combination of the discounted and the long-run average reward...

Persistent link: https://www.econbiz.de/10010759177

Weighted singularly perturbed hybrid stochastic systems

Liu, Ke; Filar, Jerzy A. - In: Computational Statistics 62 (2005) 1, pp. 41-54

In this paper weighted singularly perturbed hybrid stochastic systems are discussed. Under some reasonable assumptions, it is shown that there exists a uniformly δ-optimal policy when the perturbation is sufficiently small. Copyright Springer-Verlag Berlin Heidelberg 2005

Persistent link: https://www.econbiz.de/10010847546

Perturbation of linear quadratic systems with jump parameters and hybrid controls

Azouzi, Rachid El; Abbad, Mohammed; Altman, Eitan - In: Computational Statistics 51 (2000) 3, pp. 399-417

We consider the problem of the perturbation of a class of linear-quadratic differential games with piecewise deterministic dynamics, where the changes from one structure (for the dynamics) to another are governed by a finite-state Markov process. Player 1 controls the continuous dynamics,...

Persistent link: https://www.econbiz.de/10010847587

Semi-infinite discounted Markov decision processes: Policy improvement and singular perturbations

Abbad, Mohammed; Rahhali, Khalid - In: Computational Statistics 54 (2001) 2, pp. 279-290

In this paper, Discounted Markov Decision Processes with finite state and countable action set (semi-infinite DMDP for short) are considered. A policy improvement finite algorithm which finds a nearly optimal deterministic strategy is presented. The steps of the algorithm are based on the...

Persistent link: https://www.econbiz.de/10010847635

Nearly optimal stationary policies in negative dynamic programming

Cavazos-Cadena, Rolando; Montes-De-Oca, Raúl - In: Computational Statistics 49 (1999) 3, pp. 441-456

This work concerns controlled Markov chains with denumerable state space and discrete time parameter. The reward function is assumed to be≤0 and the performance of a control policy is measured by the expected total-reward criterion. Within this context, sufficient conditions are given so that...

Persistent link: https://www.econbiz.de/10010847483

Risk-sensitive capacity control in revenue management

Barz, C.; Waldmann, K. - In: Computational Statistics 65 (2007) 3, pp. 565-579

Both the static and the dynamic single-leg revenue management problem are studied from the perspective of a risk-averse decision maker. Structural results well-known from the risk-neutral case are extended to the risk-averse case on the basis of an exponential utility function. In particular,...

Persistent link: https://www.econbiz.de/10010847579

Compactness of the space of non-randomized policies in countable-state sequential decision processes

Chen, Richard; Feinberg, Eugene - In: Computational Statistics 71 (2010) 2, pp. 307-323

For sequential decision processes with countable state spaces, we prove compactness of the set of strategic measures corresponding to nonrandomized policies. For the Borel state case, this set may not be compact (Piunovskiy, Optimal control of random sequences in problems with constraints....

Persistent link: https://www.econbiz.de/10010847644

Multi-policy iteration with a distributed voting

Chang, Hyeong Soo - In: Computational Statistics 60 (2004) 2, pp. 299-310

We present a novel simulation-based algorithm, as an extension of the well-known policy iteration algorithm, by combining multi-policy improvement with a distributed simulation-based voting policy evaluation, for approximately solving Markov Decision Processes (MDPs) with infinite horizon...

Persistent link: https://www.econbiz.de/10010847739

Dynamic order replenishment policy in internet-based supply chains

Berman, Oded; Kim, Eungab - In: Computational Statistics 53 (2001) 3, pp. 371-390

We consider a problem of dynamic replenishment of parts in the supply chain consisting of single class of customers, company, and supplier. Customers request a service via the WEB-based ordering system and the company supports service using parts which are procured from the supplier. The...

Persistent link: https://www.econbiz.de/10010847765

The finiteness of the reward function and the optimal value function in Markov decision processes

Hu, Qiying; Xu, Chen - In: Computational Statistics 49 (1999) 2, pp. 255-266

This paper studies the discrete time Markov decision processes (MDP) with expected discounted total reward, where the state space is countable, the action space is measurable, the reward function is extended real-valued, and the discount rate may be any real number. Two conditions (GC) and (C)...

Persistent link: https://www.econbiz.de/10010847961