Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming

Year of publication:	2012
Authors:	Bertsekas, Dimitri P. ; Yu, Huizhen
Published in:	Mathematics of operations research. - Linthicum, Md : Inst, ISSN 0364-765X, ZDB-ID 1956838. - Vol. 37.2012, 1 (13.6.), p. 66-95

More access options

Type of publication:	Article
Source:	OLC-SSG Economic Sciences

Persistent link: https://www.econbiz.de/10009835381