Lehrer, Ehud; Smorodinsky, Rann - 2012
Consider an agent who faces a sequential decision problem. At each stage the agent takes an action and observes a stochastic outcome e.g., daily prices, weather conditions, opponents’ actions in a repeated game, etc. The agent’s stage-utility depends on his action, the observed outcome and...