Similar Search Results

Exploiting distributional temporal difference learning to deal with tail risk

Bossaerts, Peter L.; Huang, Shijie; Yadav, Nitin - In: Risks 8 (2020) 4, pp. 1-20

In traditional Reinforcement Learning (RL), agents learn to optimize actions in a dynamic context based on recursive estimation of expected values. We show that this form of machine learning fails when rewards (returns) are affected by tail risk, i.e., leptokurtosis. Here, we adapt a recent...

Persistent link: https://www.econbiz.de/10013200646

"Lucas" in the laboratory

Asparouhova, Elena; Bossaerts, Peter; Roy, Nilanjan; … - 2015

The Lucas asset pricing model is studied here in a controlled setting. Participants could trade two long-lived securities in a continuous open-book system. The experimental design emulated the stationary, infinite-horizon setting of the model and incentivized participants to smooth consumption...

Persistent link: https://www.econbiz.de/10011381924