Liu, Xiaotian; Hu, Ming; Peng, Yijie; Yang, Yaodong - 2022
We apply Multi-Agent Deep Reinforcement Learning (MADRL) to inventory management problems with multiple echelons and … constructed by single-agent deep reinforcement learning and other heuristic policies. Also, the application of HAPPO results in a … less significant bullwhip effect than policies constructed by single-agent deep reinforcement learning where information is …