Showing 1 - 7 of 7
We introduce the pathwise optimization (PO) method, a new convex optimization procedure to produce upper and lower bounds on the optimal value (the "price") of a high-dimensional optimal stopping problem. The PO method builds on a dual characterization of optimal stopping problems as...
Persistent link: https://www.econbiz.de/10010990541
Persistent link: https://www.econbiz.de/10009575346
Persistent link: https://www.econbiz.de/10009701851
Persistent link: https://www.econbiz.de/10010055802
Persistent link: https://www.econbiz.de/10009995832
Persistent link: https://www.econbiz.de/10012305255
We consider an agent interacting with an unmodeled environment. At each time, the agent makes an observation, takes an action, and incurs a cost. Its actions can influence future observations and costs. The goal is to minimize the long-term average cost. We propose a novel algorithm, known as...
Persistent link: https://www.econbiz.de/10013113812