Showing 41 - 50 of 89
Persistent link: https://www.econbiz.de/10006418041
Regret minimization in repeated matrix games has been extensively studied ever since Hannan's seminal paper [Hannan, J., 1957. Approximation to Bayes risk in repeated play. In: Dresher, M., Tucker, A.W., Wolfe, P. (Eds.), Contributions to the Theory of Games, vol. III. Ann. of Math. Stud., vol....
Persistent link: https://www.econbiz.de/10005413696
We propose simple randomized strategies for sequential prediction under imperfect monitoring, that is, when the forecaster does not have access to the past outcomes but rather to a feedback signal. The proposed strategies are consistent in the sense that they achieve, asymptotically, the best...
Persistent link: https://www.econbiz.de/10009151309
We provide yet another proof of the existence of calibrated forecasters; it has two merits. First, it is valid for an arbitrary finite number of outcomes. Second, it is short and simple and it follows from a direct application of Blackwell's approachability theorem to carefully chosen...
Persistent link: https://www.econbiz.de/10009151328
We consider a multiarmed bandit problem where the expected reward of each arm is a linear function of an unknown scalar with a prior distribution. The objective is to choose a sequence of arms that maximizes the expected total (or discounted total) reward. We demonstrate the effectiveness of a...
Persistent link: https://www.econbiz.de/10009432173
Persistent link: https://www.econbiz.de/10003977866
Persistent link: https://www.econbiz.de/10003985721
Persistent link: https://www.econbiz.de/10003962626
Persistent link: https://www.econbiz.de/10011422750
Persistent link: https://www.econbiz.de/10011304103