Reliable off-policy evaluation for reinforcement learning

Jie Wang, Rui Gao, Hongyuan Zha

Year of publication:	2024
Authors:	Wang, Jie ; Gao, Rui ; Zha, Hongyuan
Published in:	Operations research. - Linthicum, Md. : INFORMS, ISSN 1526-5463, ZDB-ID 2019440-7. - Vol. 72.2024, 2, p. 699-716
Subject:	reinforcement learning \| Simulation \| uncertainty quantification \| Wasserstein robust optimization \| Lernen \| Learning \| Lernprozess \| Learning process \| Theorie \| Theory \| Robustes Verfahren \| Robust statistics \| Mathematische Optimierung \| Mathematical programming

Type of publication:	Article
Type of publication (narrower categories):	Aufsatz in Zeitschrift ; Article in journal
Language:	English
Other identifiers:	10.1287/opre.2022.2382 [DOI]
Source:	ECONIS - Online Catalogue of the ZBW

Persistent link: https://www.econbiz.de/10014520884