Convergence of finite memory Q learning for POMDPs and near optimality of learned policies under filter stability

Ali Devran Kara, Serdar Yüksel

Year of publication:	2023
Authors:	Kara, Ali Devran ; Yüksel, Serdar
Published in:	Mathematics of operations research. - Hanover, Md. : INFORMS, ISSN 1526-5471, ZDB-ID 2004273-5. - Vol. 48.2023, 4, p. 2066-2093
Subject:	nonlinear filtering \| partially observed MDP \| reinforcement learning \| Theorie \| Theory \| Lernprozess \| Learning process \| Lernen \| Learning \| Zustandsraummodell \| State space model \| Zeitreihenanalyse \| Time series analysis

Online Resource

Check full text access |

More access options

doi.org

Check Google Scholar

In libraries world-wide (WorldCat)

In German libraries (KVK)

Type of publication:	Article
Type of publication (narrower categories):	Aufsatz in Zeitschrift ; Article in journal
Language:	English
Other identifiers:	10.1287/moor.2022.1331 [DOI]
Source:	ECONIS - Online Catalogue of the ZBW

Persistent link: https://www.econbiz.de/10014437816

A service of the