Time aggregated Markov decision processes via standard dynamic programming
Year of publication: |
2011
|
---|---|
Authors: | Arruda, Edilson Fernandes de ; Fragoso, Marcelo D. |
Published in: |
Operations research letters. - Amsterdam [u.a.] : Elsevier, ISSN 0167-6377, ZDB-ID 720735-9. - Vol. 39.2011, 3, p. 193-197
|
Subject: | Theorie | Theory | Mathematische Optimierung | Mathematical programming | Markov-Kette | Markov chain | Entscheidung | Decision | Dynamische Optimierung | Dynamic programming | Aggregation |
-
Improved and generalized upper bounds on the complexity of policy iteration
Scherrer, Bruno, (2016)
-
Finitely additive dynamic programming
Sudderth, William D., (2016)
-
Schlosser, Rainer, (2022)
- More ...
-
Accelerating the convergence of value iteration by using partial transition functions
Arruda, Edilson Fernandes de, (2013)
-
Arruda, Edilson Fernandes de, (2021)
-
Siqueira, CecĂlia L., (2018)
- More ...