Similar Search Results

Eligibility Traces for Off-Policy Policy Evaluation

Precup, Doina - 2000

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference methods. Here we generalize eligibility traces to off-policy learning, in which one learns about a policy different from...

Persistent link: https://www.econbiz.de/10009468145

Overcommitment in cloud services : bin packing with chance constraints

Cohen, Maxime C.; Keller, Philipp W.; Mirrokni, Vahab; … - In: Management science : journal of the Institute for … 65 (2019) 7, pp. 3255-3271

Persistent link: https://www.econbiz.de/10012039988