Policy learning with adaptively collected data
Year of publication: |
2024
|
---|---|
Authors: | Zhan, Ruohan ; Ren, Zhimei ; Athey, Susan ; Zhou, Zhengyuan |
Published in: |
Management science : journal of the Institute for Operations Research and the Management Sciences. - Hanover, Md. : INFORMS, ISSN 1526-5501, ZDB-ID 2023019-9. - Vol. 70.2024, 8, p. 5270-5297
|
Subject: | adaptive data collection | contextual bandits | minimax optimality | off-line policy learning | personalized decision making | Experiment | Lernprozess | Learning process | Entscheidung | Decision | Theorie | Theory |
-
Distributionally robust batch contextual bandits
Si, Nian, (2023)
-
Small-loss bounds for online learning with partial information
Lykouris, Thodoris, (2022)
-
Online decision making with high-dimensional covariates
Bastani, Hamsa, (2020)
- More ...
-
Dynamic batch learning in high-dimensional sparse linear contextual bandits
Ren, Zhimei, (2024)
-
Offline multi-action policy learning : generalization and optimization
Zhou, Zhengyuan, (2023)
-
Optimal No-Regret Learning in Strongly Monotone Games with Bandit Feedback
Lin, Tianyi, (2021)
- More ...