Risk-Aware Reinforcement Learning for Multi-Period Portfolio Selection

www.lmu.de | UB | Blättern | Hilfe

Zur erweiterten Suche

English

Zur erweiterten Suche

Winkel, David ORCID: https://orcid.org/0000-0001-8829-0863; Strauß, Niklas ORCID: https://orcid.org/0000-0002-8083-7323; Schubert, Matthias ORCID: https://orcid.org/0000-0002-6566-6343 und Seidl, Thomas ORCID: https://orcid.org/0000-0002-4861-1412 (2023): Risk-Aware Reinforcement Learning for Multi-Period Portfolio Selection. European Conference, ECML PKDD 2022, Grenoble, France, September 19–23, 2022. In: Machine Learning and Knowledge Discovery in Databases European Conference, ECML PKDD 2022, Grenoble, France, September 19–23, 2022, Proceedings, Bd. 13718 Cham: Springer. S. 185-200

Volltext auf 'Open Access LMU' nicht verfügbar.

DOI: 10.1007/978-3-031-26422-1_12

Abstract

The task of portfolio management is the selection of portfolio allocations for every single time step during an investment period while adjusting the risk-return profile of the portfolio to the investor’s individual level of risk preference. In practice, it can be hard for an investor to quantify his individual risk preference. As an alternative, approximating the risk-return Pareto front allows for the comparison of different optimized portfolio allocations and hence for the selection of the most suitable risk level. Furthermore, an approximation of the Pareto front allows the analysis of the overall risk sensitivity of various investment policies. In this paper, we propose a deep reinforcement learning (RL) based approach, in which a single meta agent generates optimized portfolio allocation policies for any level of risk preference in a given interval. Our method is more efficient than previous approaches, as it only requires training of a single agent for the full approximate risk-return Pareto front. Additionally, it is more stable in training and only requires per time step market risk estimations independent of the policy. Such risk control per time step is a common regulatory requirement for e.g., insurance companies. We benchmark our meta agent against other state-of-the-art risk-aware RL methods using a realistic environment based on real-world Nasdaq-100 data. Our evaluation shows that the proposed meta agent outperforms various benchmark approaches by generating strategies with better risk-return profiles.

Dokumententyp:	Konferenzbeitrag (Paper)
Fakultät:	Mathematik, Informatik und Statistik > Informatik
Themengebiete:	000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik
ISBN:	978-3-031-26421-4 ; 978-3-031-26422-1
Ort:	Cham
Sprache:	Englisch
Dokumenten ID:	121867
Datum der Veröffentlichung auf Open Access LMU:	29. Okt. 2024 12:35
Letzte Änderungen:	29. Okt. 2024 12:35

Dokument bearbeiten