Optimal Exploration-Exploitation in a Multi-Armed-Bandit Problem with Non-Stationary Rewards
Year of publication: |
2020
|
---|---|
Authors: | Besbes, Omar |
Other Persons: | Gur, Yonatan (contributor) ; Zeevi, Assaf (contributor) |
Publisher: |
[2020]: [S.l.] : SSRN |
Subject: | Theorie | Theory | Mathematische Optimierung | Mathematical programming |
Extent: | 1 Online-Ressource (30 p) |
---|---|
Type of publication: | Book / Working Paper |
Language: | English |
Notes: | In: Stochastic Systems 9 (4), 319-337 (2019) Nach Informationen von SSRN wurde die ursprüngliche Fassung des Dokuments December 1, 2019 erstellt |
Other identifiers: | 10.2139/ssrn.2436629 [DOI] |
Source: | ECONIS - Online Catalogue of the ZBW |
-
Optimal proactive monitor placement & scheduling for IoT networks
Mostafa, Basma, (2022)
-
Eirinakis, Pavlos, (2024)
-
A hybrid ANN-MILP model for agile recovery production planning for PPE products under sharp demands
Babazadeh, Reza, (2025)
- More ...
-
Optimal Exploration-Exploitation in a Multi-armed-Bandit Problem with Non-stationary Rewards
Besbes, Omar, (2014)
-
Optimization in Online Content Recommendation Services: Beyond Click-Through-Rates
Besbes, Omar, (2014)
-
Non-Stationary Stochastic Optimization
Besbes, Omar, (2015)
- More ...