He, Zhongzhi Lawrence - 2022
This paper formulates a gradient-based reinforcement learning (GRL) model within a game-theoretic machine learning framework where players start from their initial circumstances with dispersed information, using the expected gradient to update choice propensities, and converge to the predicted...