Explore first, exploit next : the true shape of regret in bandit problems

Aurélien Garivier, Pierre Ménard, Gilles Stoltz

Year of publication:	2019
Authors:	Garivier, Aurélien ; Ménard, Pierre ; Stoltz, Gilles
Published in:	Mathematics of operations research. - Catonsville, MD : INFORMS, ISSN 0364-765X, ZDB-ID 195683-8. - Vol. 44.2019, 2, p. 377-399
Subject:	multiarmed bandits \| cumulative regret \| information-theoretic proof techniques \| nonasymptotic lower bounds \| Theorie \| Theory \| Entscheidung unter Unsicherheit \| Decision under uncertainty \| Entscheidung \| Decision

Type of publication:	Article
Type of publication (narrower categories):	Aufsatz in Zeitschrift ; Article in journal
Language:	English
Other identifiers:	10.1287/moor.2017.0928 [DOI]
Source:	ECONIS - Online Catalogue of the ZBW

Persistent link: https://www.econbiz.de/10012028619