Showing 1 - 10 of 1,953
Persistent link: https://www.econbiz.de/10014497340
In stochastic lot sizing subject to dynamic and random demand, the minimization of operational costs is not the only conceivable objective. Minimizing the tardiness in customer demand satisfaction is no less important. Furthermore, the decision maker is interested in production plan stability....
Persistent link: https://www.econbiz.de/10014366827
Persistent link: https://www.econbiz.de/10013369271
Persistent link: https://www.econbiz.de/10011414783
Persistent link: https://www.econbiz.de/10011414799
Persistent link: https://www.econbiz.de/10012197155
The literature on learning in unknown environments emphasises reinforcing on actions which produce positive results …
Persistent link: https://www.econbiz.de/10011517970
Persistent link: https://www.econbiz.de/10011817848
The advent of reinforcement learning (RL) in financial markets is driven by several advantages inherent to this field … one integrated step, thereby closely aligning the machine learning problem with the objectives of the investor. At the … supervised learning methods, the RL research community has made considerable advances in the finance domain. The present paper …
Persistent link: https://www.econbiz.de/10011904954
dynamics and causal effects on observed variables. Using this connection, we develop two Reinforcement Learning methods termed … Direct Augmented V-Learning (DAV-Learning) and Safe Augmented V-Learning (SAV-Learning), which enable using the observed data … to efficiently learn an optimal treatment regime. We establish theoretical results for these learning methods, including …
Persistent link: https://www.econbiz.de/10012803079