Showing 1 - 10 of 25
Persistent link: https://www.econbiz.de/10012886883
Persistent link: https://www.econbiz.de/10003965031
Persistent link: https://www.econbiz.de/10009272035
We consider a general-sum N-player linear-quadratic game with stochastic dynamics over a finite horizon and prove the global convergence of the natural policy gradient method to the Nash equilibrium. In order to prove convergence of the method we require a certain amount of noise in the system....
Persistent link: https://www.econbiz.de/10013217478
We explore reinforcement learning methods for finding the optimal policy in the linear quadratic regulator (LQR) problem. In particular we consider the convergence of policy gradient methods in the setting of known and unknown parameters. We are able to produce a global linear convergence...
Persistent link: https://www.econbiz.de/10013251559
Persistent link: https://www.econbiz.de/10012289282
Persistent link: https://www.econbiz.de/10012265221
Persistent link: https://www.econbiz.de/10014340947
Persistent link: https://www.econbiz.de/10014384165
Persistent link: https://www.econbiz.de/10014514707