Reliable off-policy evaluation for reinforcement learning
Year of publication: |
2024
|
---|---|
Authors: | Wang, Jie ; Gao, Rui ; Zha, Hongyuan |
Published in: |
Operations research. - Linthicum, Md. : INFORMS, ISSN 1526-5463, ZDB-ID 2019440-7. - Vol. 72.2024, 2, p. 699-716
|
Subject: | reinforcement learning | Simulation | uncertainty quantification | Wasserstein robust optimization | Theorie | Theory | Lernprozess | Learning process | Lernen | Learning | Robustes Verfahren | Robust statistics |
-
Reinforcement learning in robust Markov decision processes
Lim, Shiau Hong, (2016)
-
A unified framework for stochastic optimization
Powell, Warren B., (2019)
-
Lee, Elliot, (2015)
- More ...
-
Barlow, Jesse L., (2002)
-
Machine Learning and Robust Data Mining
Croux, Christophe, (2007)
-
Continuum Isomap for manifold learnings
Zha, Hongyuan, (2007)
- More ...