//-->
Q-learning and enhanced policy iteration in discounted dynamic programming
Bertsekas, Dimitri P., (2012)
On near optimality of the set of finite-state controllers for average cost POMDP
Yu, Huizhen, (2008)
Error bounds for approximations from projected linear equations
Yu, Huizhen, (2010)