Showing 1 - 10 of 27
In this paper we consider the weighted reward Markov decision process, with perturbation. The “weighted reward” refers to appropriately normalized convex combination of the discounted and the long-run average reward criteria. This criterion allows the controller to trade-off short-term costs...
Persistent link: https://www.econbiz.de/10010999741
In this paper, Weighted reward Perturbed Markov Decision Processes with finite state and countable action spaces (semi-infinite WMDP for short) are considered. The ”weighted reward” refers to appropriately normalized convex combination of the discounted and the long-run average reward...
Persistent link: https://www.econbiz.de/10010999575
In this paper weighted singularly perturbed hybrid stochastic systems are discussed. Under some reasonable assumptions, it is shown that there exists a uniformly δ-optimal policy when the perturbation is sufficiently small. Copyright Springer-Verlag Berlin Heidelberg 2005
Persistent link: https://www.econbiz.de/10010999595
We consider the problem of the perturbation of a class of linear-quadratic differential games with piecewise deterministic dynamics, where the changes from one structure (for the dynamics) to another are governed by a finite-state Markov process. Player 1 controls the continuous dynamics,...
Persistent link: https://www.econbiz.de/10010950019
In this paper, Discounted Markov Decision Processes with finite state and countable action set (semi-infinite DMDP for short) are considered. A policy improvement finite algorithm which finds a nearly optimal deterministic strategy is presented. The steps of the algorithm are based on the...
Persistent link: https://www.econbiz.de/10010950056
Stochastic shortest path problems (SSPPs) have many applications in practice and are subject of ongoing research for many years. This paper considers a variant of SSPPs where times or costs to pass an edge in a graph are, possibly correlated, random variables. There are two general goals one can...
Persistent link: https://www.econbiz.de/10014504089
This work concerns controlled Markov chains with denumerable state space and discrete time parameter. The reward function is assumed to be≤0 and the performance of a control policy is measured by the expected total-reward criterion. Within this context, sufficient conditions are given so that...
Persistent link: https://www.econbiz.de/10010999528
For sequential decision processes with countable state spaces, we prove compactness of the set of strategic measures corresponding to nonrandomized policies. For the Borel state case, this set may not be compact (Piunovskiy, Optimal control of random sequences in problems with constraints....
Persistent link: https://www.econbiz.de/10010999682
In this paper we consider Markov Decision Processes with discounted cost and a random rate in Borel spaces. We establish the dynamic programming algorithm in finite and infinity horizon cases. We provide conditions for the existence of measurable selectors. And we show an example of...
Persistent link: https://www.econbiz.de/10010999690
This paper considers an assembly system where a firm produces a single product which is assembled using two types of components (component 1 and component 2). The components are provided by individual suppliers (supplier 1 and supplier 2). We assume that the firm makes different procurement...
Persistent link: https://www.econbiz.de/10010999700