Similar Search Results

A Distributional Interpretation of Robust Optimization

Xu, Huan; Caramanis, Constantine; Mannor, Shie - In: Mathematics of operations research 37 (2012) 1, pp. 95-111

Persistent link: https://www.econbiz.de/10009835382

Optimization Under Probabilistic Envelope Constraints

Xu, Huan; Caramanis, Constantine; Mannor, Shie - In: Operations research : the journal of the Operations … 60 (2012) 3, pp. 682-700

Persistent link: https://www.econbiz.de/10009995834

A contract-based model for directed network formation

Johari, Ramesh; Mannor, Shie; Tsitsiklis, John N. - In: Games and Economic Behavior 56 (2006) 2, pp. 201-224

Persistent link: https://www.econbiz.de/10005413666

Regret minimization in repeated matrix games with variable stage duration

Mannor, Shie; Shimkin, Nahum - In: Games and Economic Behavior 63 (2008) 1, pp. 227-258

Regret minimization in repeated matrix games has been extensively studied ever since Hannan's seminal paper [Hannan, J., 1957. Approximation to Bayes risk in repeated play. In: Dresher, M., Tucker, A.W., Wolfe, P. (Eds.), Contributions to the Theory of Games, vol. III. Ann. of Math. Stud., vol....

Persistent link: https://www.econbiz.de/10005413696

On the Empirical State-Action Frequencies in Markov Decision Processes Under General Policies

Mannor, Shie; Tsitsiklis, John N. - In: Mathematics of operations research 30 (2005) 3, pp. 545-561

Persistent link: https://www.econbiz.de/10006417239

The Empirical Bayes Envelope and Regret Minimization in Competitive Markov Decision Processes

Mannor, Shie; Shimkin, Nahum - In: Mathematics of operations research 28 (2003) 2, pp. 327-345

Persistent link: https://www.econbiz.de/10006418041

Approachability in repeated games: Computational aspects and a Stackelberg variant

Mannor, Shie; Tsitsiklis, John N. - In: Games and Economic Behavior 66 (2009) 1, pp. 315-325

We consider a finite two-player zero-sum game with vector-valued rewards. We study the question of whether a given polyhedral set D is "approachable," that is, whether Player 1 (the "decision maker") can guarantee that the long-term average reward belongs to D, for any strategy of Player 2 (the...

Persistent link: https://www.econbiz.de/10005066714

Bias and Variance Approximation in Value Function Estimates

Mannor, Shie; Simester, Duncan; Sun, Peng; Tsitsiklis, … - In: Management Science 53 (2007) 2, pp. 308-322

We consider a finite-state, finite-action, infinite-horizon, discounted reward Markov decision process and study the bias and variance in the value function estimates that result from empirical estimates of the model parameters. We provide closed-form approximations for the bias and variance,...

Persistent link: https://www.econbiz.de/10009209247

Basis Function Adaptation in Temporal Difference Reinforcement Learning

Menache, Ishai; Mannor, Shie; Shimkin, Nahum - 2005

Persistent link: https://www.econbiz.de/10008214454

A Tutorial on the Cross-Entropy Method

de Boer, Pieter-Tjerk; Kroese, Dirk P.; Mannor, Shie; … - 2005

Persistent link: https://www.econbiz.de/10008214462