Similar Search Results

Pathwise Optimization for Optimal Stopping Problems

Desai, Vijay V.; Farias, Vivek F.; Moallemi, Ciamac C. - In: Management Science 58 (2012) 12, pp. 2292-2308

We introduce the pathwise optimization (PO) method, a new convex optimization procedure to produce upper and lower bounds on the optimal value (the "price") of a high-dimensional optimal stopping problem. The PO method builds on a dual characterization of optimal stopping problems as...

Persistent link: https://www.econbiz.de/10010990541

Approximate dynamic programming via a smoothed linear program

Desai, Vijay V.; Farias, Vivek F.; Moallemi, Ciamac C. - In: Operations research 60 (2012) 3, pp. 655-674

Persistent link: https://www.econbiz.de/10009575346

Pathwise optimization for optimal stopping problems

Desai, Vijay V.; Farias, Vivek F.; Moallemi, Ciamac C. - In: Management science : journal of the Institute for … 58 (2012) 12, pp. 2292-2308

Persistent link: https://www.econbiz.de/10009701851

Pathwise Optimization for Optimal Stopping Problems

Desai, Vijay V.; Farias, Vivek F.; Moallemi, Ciamac C. - In: Management science : journal of the Institute for … 58 (2012) 12, pp. 2292-2308

Persistent link: https://www.econbiz.de/10010055802

Approximate Dynamic Programming via a Smoothed Linear Program

Desai, Vijay V.; Farias, Vivek F.; Moallemi, Ciamac C. - In: Operations research : the journal of the Operations … 60 (2012) 3, pp. 655-675

Persistent link: https://www.econbiz.de/10009995832

Near-optimal A-B testing

Bhat, Nikhil; Farias, Vivek F.; Moallemi, Ciamac C.; … - In: Management science : journal of the Institute for … 66 (2020) 10, pp. 4477-4495

Persistent link: https://www.econbiz.de/10012305255

Universal Reinforcement Learning

Moallemi, Ciamac C. - 2012

We consider an agent interacting with an unmodeled environment. At each time, the agent makes an observation, takes an action, and incurs a cost. Its actions can influence future observations and costs. The goal is to minimize the long-term average cost. We propose a novel algorithm, known as...

Persistent link: https://www.econbiz.de/10013113812