Using deep reinforcement learning with hierarchical risk parity for portfolio optimization
Adrian Millea and Abbas Edalat
We devise a hierarchical decision-making architecture for We devise a hierarchical decision-making architecture for portfolio optimization on multiple markets. At the highest level a Deep Reinforcement Learning (DRL) agent selects among a number of discrete actions, representing low-level agents. For the low-level agents, we use a set of Hierarchical Risk Parity (HRP) and Hierarchical Equal Risk Contribution (HERC) models with different hyperparameters, which all run in parallel, off-market (in a simulation). The information on which the DRL agent decides which of the low-level agents should act next is constituted by the stacking of the recent performances of all agents. Thus, the modelling resembles a statefull, non-stationary, multi-arm bandit, where the performance of the individual arms changes with time and is assumed to be dependent on the recent history. We perform experiments on the cryptocurrency market (117 assets), on the stock market (46 assets) and on the foreign exchange market (28 pairs) showing the excellent robustness and performance of the overall system. Moreover, we eliminate the need for retraining and are able to deal with large testing sets successfully.portfolio optimization on multiple markets. At the highest level a Deep Reinforcement Learning (DRL) agent selects among a number of discrete actions, representing low-level agents. For the low-level agents, we use a set of Hierarchical Risk Parity (HRP) and Hierarchical Equal Risk Contribution (HERC) models with different hyperparameters, which all run in parallel, off-market (in a simulation). The information on which the DRL agent decides which of the low-level agents should act next is constituted by the stacking of the recent performances of all agents. Thus, the modelling resembles a statefull, non-stationary, multi-arm bandit, where the performance of the individual arms changes with time and is assumed to be dependent on the recent history. We perform experiments on the cryptocurrency market (117 assets), on the stock market (46 assets) and on the foreign exchange market (28 pairs) showing the excellent robustness and performance of the overall system. Moreover, we eliminate the need for retraining and are able to deal with large testing sets successfully.
Year of publication: |
2022
|
---|---|
Authors: | Millea, Adrian ; Edalat, Abbas |
Published in: |
International Journal of Financial Studies : open access journal. - Basel : MDPI, ISSN 2227-7072, ZDB-ID 2704235-2. - Vol. 11.2023, 1, Art.-No. 10, p. 1-16
|
Subject: | Deep Reinforcement Learning | Hierarchical Risk Parity | Hierarchical Equal Risk Contribution | portfolio optimization | cryptocurrencies | stocks | foreign exchange | Portfolio-Management | Portfolio selection | Theorie | Theory | Lernprozess | Learning process | Risiko | Risk | Risikomanagement | Risk management |
Saved in:
freely available
Saved in favorites
Similar items by subject
-
Risk budgeting using a generalized diversity index
Koumou, Gilles Boevi, (2023)
-
Financial Risk Meter for emerging markets
Ben Amor, Souhir, (2022)
-
Strategic Sourcing Under Severe Disruption Risk : Learning Failures and Under-Diversification Bias
Goldschmidt, Kyle, (2020)
- More ...
Similar items by person