Reinforcement learning for deep portfolio optimization

Portfolio optimization is an important financial task that has received widespread attention in the field of artificial intelligence. In this paper, a novel deep portfolio optimization (DPO) framework was proposed, combining deep learning and reinforcement learning with modern portfolio theory. DPO...

Full description

Saved in:

Bibliographic Details
Main Authors:	Ruyu Yan, Jiafei Jin, Kun Han
Format:	Article
Language:	English
Published:	AIMS Press 2024-09-01
Series:	Electronic Research Archive
Subjects:	portfolio optimization transaction costs risk management deep learning reinforcement learning
Online Access:	https://www.aimspress.com/article/doi/10.3934/era.2024239
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832590826867261440
author	Ruyu Yan Jiafei Jin Kun Han
author_facet	Ruyu Yan Jiafei Jin Kun Han
author_sort	Ruyu Yan
collection	DOAJ
description	Portfolio optimization is an important financial task that has received widespread attention in the field of artificial intelligence. In this paper, a novel deep portfolio optimization (DPO) framework was proposed, combining deep learning and reinforcement learning with modern portfolio theory. DPO not only has the advantages of machine learning methods in investment decision-making, but also retains the essence of modern portfolio theory in portfolio optimization. Additionaly, it was crucial to simultaneously consider the time series and complex asset correlations of financial market information. Therefore, in order to improve DPO performance, features of assets information were extracted and fused. In addition, a novel risk-cost reward function was proposed, which realized optimal portfolio decision-making considering transaction cost and risk factors through reinforcement learning. Our results showed the superiority and generalization of the DPO framework for portfolio optimization tasks. Experiments conducted on two real-world datasets validated that DPO achieved the highest accumulative portfolio value compared to other strategies, demonstrating strong profitability. Its Sharpe ratio and maximum drawdown also performed excellently, indicating good economic benefits and achieving a trade-off between portfolio returns and risk. Additionally, the extraction and fusion of financial information features can significantly improve the applicability and effectiveness of DPO.
format	Article
id	doaj-art-8a5191538429415ca081748abfe2b2b1
institution	Kabale University
issn	2688-1594
language	English
publishDate	2024-09-01
publisher	AIMS Press
record_format	Article
series	Electronic Research Archive
spelling	doaj-art-8a5191538429415ca081748abfe2b2b12025-01-23T07:52:41ZengAIMS PressElectronic Research Archive2688-15942024-09-013295176520010.3934/era.2024239Reinforcement learning for deep portfolio optimizationRuyu Yan0Jiafei Jin1Kun Han2School of Management, Harbin Institute of Technology, Harbin 150000, ChinaSchool of Management, Harbin Institute of Technology, Harbin 150000, ChinaFaculty of Computing, Harbin Institute of Technology, Harbin 150000, ChinaPortfolio optimization is an important financial task that has received widespread attention in the field of artificial intelligence. In this paper, a novel deep portfolio optimization (DPO) framework was proposed, combining deep learning and reinforcement learning with modern portfolio theory. DPO not only has the advantages of machine learning methods in investment decision-making, but also retains the essence of modern portfolio theory in portfolio optimization. Additionaly, it was crucial to simultaneously consider the time series and complex asset correlations of financial market information. Therefore, in order to improve DPO performance, features of assets information were extracted and fused. In addition, a novel risk-cost reward function was proposed, which realized optimal portfolio decision-making considering transaction cost and risk factors through reinforcement learning. Our results showed the superiority and generalization of the DPO framework for portfolio optimization tasks. Experiments conducted on two real-world datasets validated that DPO achieved the highest accumulative portfolio value compared to other strategies, demonstrating strong profitability. Its Sharpe ratio and maximum drawdown also performed excellently, indicating good economic benefits and achieving a trade-off between portfolio returns and risk. Additionally, the extraction and fusion of financial information features can significantly improve the applicability and effectiveness of DPO.https://www.aimspress.com/article/doi/10.3934/era.2024239portfolio optimizationtransaction costsrisk managementdeep learningreinforcement learning
spellingShingle	Ruyu Yan Jiafei Jin Kun Han Reinforcement learning for deep portfolio optimization Electronic Research Archive portfolio optimization transaction costs risk management deep learning reinforcement learning
title	Reinforcement learning for deep portfolio optimization
title_full	Reinforcement learning for deep portfolio optimization
title_fullStr	Reinforcement learning for deep portfolio optimization
title_full_unstemmed	Reinforcement learning for deep portfolio optimization
title_short	Reinforcement learning for deep portfolio optimization
title_sort	reinforcement learning for deep portfolio optimization
topic	portfolio optimization transaction costs risk management deep learning reinforcement learning
url	https://www.aimspress.com/article/doi/10.3934/era.2024239
work_keys_str_mv	AT ruyuyan reinforcementlearningfordeepportfoliooptimization AT jiafeijin reinforcementlearningfordeepportfoliooptimization AT kunhan reinforcementlearningfordeepportfoliooptimization

Reinforcement learning for deep portfolio optimization

Similar Items