Showing 181 - 190 of 5,915
We propose a unified framework to study policy evaluation (PE) and the associated temporal difference (TD) methods for reinforcement learning in continuous time and space. We show that PE is equivalent to maintaining the martingale condition of a process. From this perspective, we find that the...
Persistent link: https://www.econbiz.de/10013213830
We consider reinforcement learning (RL) in continuous time and study the problem of achieving the best trade-off between exploration of a black box environment and exploitation of current knowledge. We propose an entropy-regularized reward function involving the differential entropy of the...
Persistent link: https://www.econbiz.de/10014033099
We formulate and carry out an analytical treatment of a single-period portfolio choice model featuring a reference point in wealth, S-shaped utility (value) functions with loss aversion, and probability weighting under Kahneman and Tversky's cumulative prospect theory (CPT). We introduce a new...
Persistent link: https://www.econbiz.de/10013134324
In this work, the time chart of Dow Jones Industrial Average (DJIA) index is analyzed and approach of recession time term is predicted, which may be hallmark of a worldwide economic crisis. However, the methods used for the prediction will be disclosed a few years from now. On the other hand,...
Persistent link: https://www.econbiz.de/10008511740
This paper establishes a non-stochastic analogue of the celebrated result by Dubins and Schwarz about reduction of continuous martingales to Brownian motion via time change. We consider an idealized financial security with continuous price path, without making any stochastic assumptions. It is...
Persistent link: https://www.econbiz.de/10008511741
This paper considers an optimal control of a big financial company with debt liability under bankrupt probability constraints. The company, which faces constant liability payments and has choices to choose various production/business policies from an available set of control policies with...
Persistent link: https://www.econbiz.de/10008511742
As a consequence of the dependence experienced in loan portfolios, the standard binomial test which is based on the assumption of independence does not appear appropriate for validating probabilities of default (PDs). The model underlying the new rules for minimum capital requirements (Basle II)...
Persistent link: https://www.econbiz.de/10009203574
This paper elaborates on the validation requirements for rating systems and probabilities of default (PDs) which were introduced with the New Capital Standards (Basel II). We start in Section 2 with some introductory remarks on the topics and approaches that will be discussed later on. Then we...
Persistent link: https://www.econbiz.de/10009203575
The pricing and hedging of a general class of options (including American, Bermudan and European options) on multiple assets are studied in the context of currency markets where trading is subject to proportional transaction costs, and where the existence of a risk-free num\'eraire is not...
Persistent link: https://www.econbiz.de/10009203576
Order flow in equity markets is remarkably persistent in the sense that order signs (to buy or sell) are positively autocorrelated out to time lags of tens of thousands of orders, corresponding to many days. Two possible explanations are herding, corresponding to positive correlation in the...
Persistent link: https://www.econbiz.de/10009203577