Castro, Pablo S.; Ajit, Desai; Du, Han; Garratt, Rod; … - 2021 - Last updated: February 1, 2021
This paper uses reinforcement learning (RL) to approximate the policy rules of banks participating in a high-value payments system. The objective of the agents is to learn a policy function for the choice of amount of liquidity provided to the system at the beginning of the day. Individual...