X-Armed Bandits
Year of publication: |
2011-04-19
|
---|---|
Authors: | Bubeck, Sébastien ; Munos, Rémi ; Stoltz, Gilles ; Szepesvari, Csaba |
Institutions: | HAL |
Subject: | bandits with infinitely many arms | optimistic online optimization | regret bounds | minimax rates |
Extent: | application/pdf |
---|---|
Series: | |
Type of publication: | Book / Working Paper |
Language: | English |
Notes: | View the original document on HAL open archive server: http://hal.archives-ouvertes.fr/hal-00450235/en/ Published, Journal of Machine Learning Research, 2011, 12, 1655-1695 |
Source: |
-
Policy choice in time series by empirical welfare maximization
Kitagawa, Toru, (2022)
-
Policy choice in time series by empirical welfare maximization
Kitagawa, Toru, (2022)
-
Agrawal, Shipra, (2022)
- More ...
-
Online Optimization in X-Armed Bandits
Bubeck, Sébastien, (2008)
-
Pure Exploration for Multi-Armed Bandit Problems
Bubeck, Sébastien, (2010)
-
Do countries falsify economic data strategically? Some evidence that they might.
Michalski, Tomasz, (2013)
- More ...