This paper shows how reinforcement learning can be used to derive optimal hedging strategies for derivatives when there … the basic reinforcement learning approach in a number of ways. First, it uses two different Q-functions so that both the …. This approach increases the range of objective functions that can be used. Second, it uses a learning algorithm that allows …