Similar Search Results

Phase Transitions and Cyclic Phenomena in Bandits with Switching Constraints

Simchi-Levi, David - 2020

We consider the classical stochastic multi-armed bandit problem with a constraint on the total cost incurred by switching between actions. We prove matching upper and lower bounds on regret and provide near-optimal algorithms for this problem. Surprisingly, we discover phase transitions and...

Persistent link: https://www.econbiz.de/10012849391