Showing 1 - 4 of 4
Persistent link: https://www.econbiz.de/10014329888
The rapid changes in the finance industry due to the increasing amount of data has revolutionized the techniques on data processing and data analysis and brought new theoretical and computational challenges. In contrast to classical stochastic control theory and other analytical approaches for...
Persistent link: https://www.econbiz.de/10013403064
We explore reinforcement learning methods for finding the optimal policy in the linear quadratic regulator (LQR) problem. In particular we consider the convergence of policy gradient methods in the setting of known and unknown parameters. We are able to produce a global linear convergence...
Persistent link: https://www.econbiz.de/10013251559
We consider a general-sum N-player linear-quadratic game with stochastic dynamics over a finite horizon and prove the global convergence of the natural policy gradient method to the Nash equilibrium. In order to prove convergence of the method we require a certain amount of noise in the system....
Persistent link: https://www.econbiz.de/10013217478