Showing 1 - 10 of 33
Persistent link: https://www.econbiz.de/10011888709
Persistent link: https://www.econbiz.de/10014491079
In traditional Reinforcement Learning (RL), agents learn to optimize actions in a dynamic context based on recursive estimation of expected values. We show that this form of machine learning fails when rewards (returns) are affected by tail risk, i.e., leptokurtosis. Here, we adapt a recent...
Persistent link: https://www.econbiz.de/10012387629
Persistent link: https://www.econbiz.de/10014547375
Persistent link: https://www.econbiz.de/10011982686
Persistent link: https://www.econbiz.de/10011637571
Persistent link: https://www.econbiz.de/10011951116
Persistent link: https://www.econbiz.de/10011953559
Persistent link: https://www.econbiz.de/10013167518
Persistent link: https://www.econbiz.de/10012162015