Nonasymptotic analysis of Monte Carlo tree search
Year of publication: |
2022
|
---|---|
Authors: | Shah, Devavrat ; Xie, Qiaomin ; Xu, Zhi |
Published in: |
Operations research. - Linthicum, Md. : INFORMS, ISSN 1526-5463, ZDB-ID 2019440-7. - Vol. 70.2022, 6, p. 3234-3260
|
Subject: | Machine Learning and Data Science | Monte Carlo tree search | Nonstationary multi-armed bandit | reinforcement learning | Monte-Carlo-Simulation | Monte Carlo simulation | Künstliche Intelligenz | Artificial intelligence | Lernprozess | Learning process | Suchtheorie | Search theory |
-
Mo, Zhaobin, (2023)
-
Online model-based reinforcement learning for decision-making in long distance routes
Alcaraz, Juan J., (2022)
-
Fast global convergence of natural policy gradient methods with entropy regularization
Cen, Shicong, (2022)
- More ...
-
Greed works : online algorithms for unrelated machine stochastic scheduling
Gupta, Varun, (2020)
-
Xie, Qiaomin, (2023)
-
Large shareholder participation behaviors, managers’ risk-taking and firm innovation performance
Zhang, Feng, (2018)
- More ...