Reliable off-policy evaluation for reinforcement learning
Year of publication: |
2024
|
---|---|
Authors: | Wang, Jie ; Gao, Rui ; Zha, Hongyuan |
Published in: |
Operations research. - Linthicum, Md. : INFORMS, ISSN 1526-5463, ZDB-ID 2019440-7. - Vol. 72.2024, 2, p. 699-716
|
Subject: | reinforcement learning | Simulation | uncertainty quantification | Wasserstein robust optimization | Lernen | Learning | Lernprozess | Learning process | Theorie | Theory | Robustes Verfahren | Robust statistics | Mathematische Optimierung | Mathematical programming |
-
A unified framework for stochastic optimization
Powell, Warren B., (2019)
-
Sequential interdiction with incomplete information and learning
Borrero, Juan S., (2019)
-
Learning in sequential bilevel linear programming
Borrero, Juan S., (2022)
- More ...
-
Two-way Poisson mixture models for simultaneous document classification and word clustering
Li, Jia, (2006)
-
2nd Special issue on matrix computations and statistics
Barlow, Jesse L., (2006)
-
Barlow, Jesse L., (2002)
- More ...