Proximal reinforcement learning : efficient off-policy evaluation in partially observed markov decision processes

Andrew Bennett, Nathan Kallus

Year of publication:	2024
Authors:	Bennett, Andrew ; Kallus, Nathan
Published in:	Operations research. - Linthicum, Md. : INFORMS, ISSN 1526-5463, ZDB-ID 2019440-7. - Vol. 72.2024, 3, p. 1071-1086
Subject:	Machine Learning and Data Science \| offline reinforcement learning \| semiparametric efficiency \| unmeasured confounding \| Künstliche Intelligenz \| Artificial intelligence \| Lernen \| Learning \| Lernprozess \| Learning process \| Theorie \| Theory \| Markov-Kette \| Markov chain

Online Resource

Check full text access |

More access options

doi.org

Check Google Scholar

In libraries world-wide (WorldCat)

In German libraries (KVK)

Type of publication:	Article
Type of publication (narrower categories):	Aufsatz in Zeitschrift ; Article in journal
Language:	English
Other identifiers:	10.1287/opre.2021.0781 [DOI]
Source:	ECONIS - Online Catalogue of the ZBW

Persistent link: https://www.econbiz.de/10014557447

A service of the