Kato, Masahiro; Nakagawa, Kei - 2021
Risk management is critical in decision-making, and mean-variance (MV) trade-off is one of the most common criteria. However, in reinforcement learning (RL) under a dynamic environment, MV control is not as easy as that under a static environment owing to computational difficulties. For MV...