Delay-adaptive learning in generalized linear contextual bandits
Year of publication: |
2024
|
---|---|
Authors: | Blanchet, Jose ; Xu, Renyuan ; Zhou, Zhengyuan |
Published in: |
Mathematics of operations research. - Hanover, Md. : INFORMS, ISSN 1526-5471, ZDB-ID 2004273-5. - Vol. 49.2024, 1, p. 326-345
|
Subject: | contextual bandits | delayed feedback | generalized linear model | MLE | Lernprozess | Learning process | Experiment | Schätztheorie | Estimation theory |
-
Smooth contextual bandits : bridging the parametric and nondifferentiable regret regimes
Hu, Yichun, (2022)
-
LocalGLMnet : interpretable deep learning for tabular data
Richman, Ronald, (2023)
-
Distributionally robust batch contextual bandits
Si, Nian, (2023)
- More ...
-
Distributionally robust batch contextual bandits
Si, Nian, (2023)
-
Optimal No-Regret Learning in Strongly Monotone Games with Bandit Feedback
Lin, Tianyi, (2021)
-
Deterministic and stochastic wireless network games : equilibrium, dynamics, and price of anarchy
Zhou, Zhengyuan, (2018)
- More ...