Yang, Bo; Nadarajah, Selvaprabu; Secomandi, Nicola - 2021
We study merchant energy production modeled as a compound switching and timing option. The resulting Markov decision process is intractable. Least squares Monte Carlo combined with information relaxation and duality is a state-of-the-art reinforcement learning methodology to obtain operating...