Risk-averse trust region optimization for reward-volatility reduction

Lorenzo Bisi; Luca Sabbioni; Edoardo Vittori; Matteo Papini; Marcello Restelli

Conference ProceedingsOPEN ACCESS

Risk-averse trust region optimization for reward-volatility reduction

IJCAI International Joint Conference on Artificial Intelligence (2020) 2021-January 4583-4589

DOI: 10.24963/ijcai.2020/632

28Citations

23Readers

Abstract

The use of reinforcement learning in algorithmic trading is of growing interest, since it offers the opportunity of making profit through the development of autonomous artificial traders, that do not depend on hard-coded rules. In such a framework, keeping uncertainty under control is as important as maximizing expected returns. Risk aversion has been addressed in reinforcement learning through measures related to the distribution of returns. However, in trading it is essential to keep under control the risk of portfolio positions in the intermediate steps. In this paper, we define a novel measure of risk, which we call reward volatility, consisting of the variance of the rewards under the state-occupancy measure. This new risk measure is shown to bound the return variance so that reducing the former also constrains the latter. We derive a policy gradient theorem with a new objective function that exploits the mean-volatility relationship. Furthermore, we adapt TRPO, the well-known policy gradient algorithm with monotonic improvement guarantees, in a risk-averse manner. Finally, we test the proposed approach in two financial environments using real market data.

Cite

CITATION STYLE

APA

Bisi, L., Sabbioni, L., Vittori, E., Papini, M., & Restelli, M. (2020). Risk-averse trust region optimization for reward-volatility reduction. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2021-January, pp. 4583–4589). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2020/632

Risk-averse trust region optimization for reward-volatility reduction

Abstract

Cite

Register to see more suggestions