Similar Search Results

Weighted Markov decision processes with perturbation

Liu, Ke; Filar, Jerzy A. - In: Mathematical Methods of Operations Research 53 (2001) 3, pp. 465-480

In this paper we consider the weighted reward Markov decision process, with perturbation. The “weighted reward” refers to appropriately normalized convex combination of the discounted and the long-run average reward criteria. This criterion allows the controller to trade-off short-term costs...

Persistent link: https://www.econbiz.de/10010999741

Semi-infinite weighted Markov decision processes with perturbation

Abbad, Mohammed; Rahhali, Khalid - In: Mathematical Methods of Operations Research 60 (2004) 2, pp. 251-265

In this paper, Weighted reward Perturbed Markov Decision Processes with finite state and countable action spaces (semi-infinite WMDP for short) are considered. The ”weighted reward” refers to appropriately normalized convex combination of the discounted and the long-run average reward...

Persistent link: https://www.econbiz.de/10010999575

Weighted singularly perturbed hybrid stochastic systems

Liu, Ke; Filar, Jerzy A. - In: Mathematical Methods of Operations Research 62 (2005) 1, pp. 41-54

In this paper weighted singularly perturbed hybrid stochastic systems are discussed. Under some reasonable assumptions, it is shown that there exists a uniformly δ-optimal policy when the perturbation is sufficiently small. Copyright Springer-Verlag Berlin Heidelberg 2005

Persistent link: https://www.econbiz.de/10010999595

Perturbation of linear quadratic systems with jump parameters and hybrid controls

Azouzi, Rachid El; Abbad, Mohammed; Altman, Eitan - In: Mathematical Methods of Operations Research 51 (2000) 3, pp. 399-417

We consider the problem of the perturbation of a class of linear-quadratic differential games with piecewise deterministic dynamics, where the changes from one structure (for the dynamics) to another are governed by a finite-state Markov process. Player 1 controls the continuous dynamics,...

Persistent link: https://www.econbiz.de/10010950019

Semi-infinite discounted Markov decision processes: Policy improvement and singular perturbations

Abbad, Mohammed; Rahhali, Khalid - In: Mathematical Methods of Operations Research 54 (2001) 2, pp. 279-290

In this paper, Discounted Markov Decision Processes with finite state and countable action set (semi-infinite DMDP for short) are considered. A policy improvement finite algorithm which finds a nearly optimal deterministic strategy is presented. The steps of the algorithm are based on the...

Persistent link: https://www.econbiz.de/10010950056

A multi-objective approach for PH-graphs with applications to stochastic shortest paths

Buchholz, Peter; Dohndorf, Iryna - In: Mathematical Methods of Operations Research 93 (2020) 1, pp. 153-178

Stochastic shortest path problems (SSPPs) have many applications in practice and are subject of ongoing research for many years. This paper considers a variant of SSPPs where times or costs to pass an edge in a graph are, possibly correlated, random variables. There are two general goals one can...

Persistent link: https://www.econbiz.de/10014504089

Dynamic inventory strategies for profit maximization in a service facility with stochastic service, demand and lead time

Berman, Oded; Kim, Eungab - In: Mathematical Methods of Operations Research 60 (2004) 3, pp. 497-521

This paper addresses an optimal inventory control in a supply chain in which customers arrive at a facility according to a Poisson process and the facility provides service which takes exponential amounts of time, using items supplied by an outside supplier with exponential lead time process....

Persistent link: https://www.econbiz.de/10010949966

Asymptotic linear programming and policy improvement for singularly perturbed Markov decision processes

Altman, Eitan; Avrachenkov, Konstantin E.; Filar, Jerzy A. - In: Mathematical Methods of Operations Research 49 (1999) 1, pp. 97-109

In this paper we consider a singularly perturbed Markov decision process with finitely many states and actions and the limiting expected average reward criterion. We make no assumptions about the underlying ergodic structure. We present algorithms for the computation of a uniformly optimal...

Persistent link: https://www.econbiz.de/10010949977

Risk-sensitive capacity control in revenue management

Barz, C.; Waldmann, K. - In: Mathematical Methods of Operations Research 65 (2007) 3, pp. 565-579

Both the static and the dynamic single-leg revenue management problem are studied from the perspective of a risk-averse decision maker. Structural results well-known from the risk-neutral case are extended to the risk-averse case on the basis of an exponential utility function. In particular,...

Persistent link: https://www.econbiz.de/10010950011

On asymptotic optimization of a class of nonlinear stochastic hybrid systems

Shi, Peng; Altman, Eitan; Gaitsgory, Vladimir - In: Mathematical Methods of Operations Research 47 (1998) 2, pp. 289-315

We consider the problem of control for continuous time stochastic hybrid systems in finite time horizon. The systems considered are nonlinear: the state evolution is a nonlinear function of both the control and the state. The control parameters change at discrete times according to an underlying...

Persistent link: https://www.econbiz.de/10010950071