//-->
Error bounds for approximations from projected linear equations
Yu, Huizhen, (2010)
Q-learning and policy iteration algorithms for stochastic shortest path problems
Yu, Huizhen, (2013)
On boundedness of Q-learning iterates for stochastic shortest path problems