Multi-armed bandit with sub-exponential rewards
Year of publication: |
2021
|
---|---|
Authors: | Jia, Huiwen ; Shi, Cong ; Shen, Siqian |
Published in: |
Operations research letters. - Amsterdam [u.a.] : Elsevier, ISSN 0167-6377, ZDB-ID 720735-9. - Vol. 49.2021, 5, p. 728-733
|
Subject: | Multi-armed bandit | Sub-exponential reward | Unbounded reward | Upper confidence bound |
-
Sample-path optimality and variance-maximization for Markov decision processes
Zhu, Q., (2007)
-
Sample-path optimality and variance-maximization for Markov decision processes
Zhu, Q., (2007)
-
Bayesian estimation of probabilities of default for low default portfolios
Tasche, Dirk, (2013)
- More ...
-
Online learning and pricing for service systems with reusable resources
Jia, Huiwen, (2024)
-
Online Learning and Pricing for Service Systems with Reusable Resources
Jia, Huiwen, (2022)
-
Online Learning and Pricing for Network Revenue Management with Reusable Resources
Jia, Huiwen, (2022)
- More ...