Showing 1 - 10 of 14
Persistent link: https://www.econbiz.de/10001568127
The discrete time discounted multi-objective Markov decision model with incomplete state observations is presented. The lexicographically order optimal problem is shown to be equivalent to the optimal problem of a one-reward Markov decision model with complete state observations. Copyright...
Persistent link: https://www.econbiz.de/10010847968
Persistent link: https://www.econbiz.de/10006618230
The discrete time discounted multi-objective Markov decision model with incomplete state observations is presented. The lexicographically order optimal problem is shown to be equivalent to the optimal problem of a one-reward Markov decision model with complete state observations. Copyright...
Persistent link: https://www.econbiz.de/10010794922
Dynamic principal agent models are formulated based on constrained Markov decision process (CMDP), in which conditions are given that the state space of the system is countable and the agent chooses his actions from a countable action set. If the principal has finite alternative contracts to...
Persistent link: https://www.econbiz.de/10010847844
Dynamic principal agent models are formulated based on constrained Markov decision process (CMDP), in which conditions are given that the state space of the system is countable and the agent chooses his actions from a countable action set. If the principal has finite alternative contracts to...
Persistent link: https://www.econbiz.de/10010950240
Persistent link: https://www.econbiz.de/10006612656
Persistent link: https://www.econbiz.de/10010520460
The goal of this paper is provide a theory of K-person non-stationary Markov games with unbounded rewards, for a countable state space and action spaces. We investigate both the finite and infinite horizon problems. We define the concept of strong Nash equilibrium and present conditions for both...
Persistent link: https://www.econbiz.de/10010950241
Persistent link: https://www.econbiz.de/10006622517