Convergence of finite memory Q learning for POMDPs and near optimality of learned policies under filter stability
Ali Devran Kara, Serdar Yüksel
Year of publication: |
2023
|
---|---|
Authors: | Kara, Ali Devran ; Yüksel, Serdar |
Published in: |
Mathematics of operations research. - Hanover, Md. : INFORMS, ISSN 1526-5471, ZDB-ID 2004273-5. - Vol. 48.2023, 4, p. 2066-2093
|
Subject: | nonlinear filtering | partially observed MDP | reinforcement learning | Theorie | Theory | Lernprozess | Learning process | Lernen | Learning | Zustandsraummodell | State space model | Zeitreihenanalyse | Time series analysis |
Saved in:
Online Resource
Saved in favorites
Similar items by subject
-
Learning with uncertain inflation target
Marzioni, Stefano, (2023)
-
Nakagawa, Hidetoshi, (2014)
-
Piecewise-linear approximations and filtering for DSGE models with occasionally-binding constraints
Aruoba, S. Borağan, (2021)
- More ...
Similar items by person