Showing 1 - 10 of 19
Persistent link: https://www.econbiz.de/10009835382
Persistent link: https://www.econbiz.de/10009995834
Persistent link: https://www.econbiz.de/10005413666
Regret minimization in repeated matrix games has been extensively studied ever since Hannan's seminal paper [Hannan, J., 1957. Approximation to Bayes risk in repeated play. In: Dresher, M., Tucker, A.W., Wolfe, P. (Eds.), Contributions to the Theory of Games, vol. III. Ann. of Math. Stud., vol....
Persistent link: https://www.econbiz.de/10005413696
Persistent link: https://www.econbiz.de/10006417239
Persistent link: https://www.econbiz.de/10006418041
We consider a finite two-player zero-sum game with vector-valued rewards. We study the question of whether a given polyhedral set D is "approachable," that is, whether Player 1 (the "decision maker") can guarantee that the long-term average reward belongs to D, for any strategy of Player 2 (the...
Persistent link: https://www.econbiz.de/10005066714
We consider a finite-state, finite-action, infinite-horizon, discounted reward Markov decision process and study the bias and variance in the value function estimates that result from empirical estimates of the model parameters. We provide closed-form approximations for the bias and variance,...
Persistent link: https://www.econbiz.de/10009209247
Persistent link: https://www.econbiz.de/10008214454
Persistent link: https://www.econbiz.de/10008214462