Reinforcement learning for deep portfolio optimization

Portfolio optimization is an important financial task that has received widespread attention in the field of artificial intelligence. In this paper, a novel deep portfolio optimization (DPO) framework was proposed, combining deep learning and reinforcement learning with modern portfolio theory. DPO...

Full description

Saved in:
Bibliographic Details
Main Authors: Ruyu Yan, Jiafei Jin, Kun Han
Format: Article
Language:English
Published: AIMS Press 2024-09-01
Series:Electronic Research Archive
Subjects:
Online Access:https://www.aimspress.com/article/doi/10.3934/era.2024239
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832590826867261440
author Ruyu Yan
Jiafei Jin
Kun Han
author_facet Ruyu Yan
Jiafei Jin
Kun Han
author_sort Ruyu Yan
collection DOAJ
description Portfolio optimization is an important financial task that has received widespread attention in the field of artificial intelligence. In this paper, a novel deep portfolio optimization (DPO) framework was proposed, combining deep learning and reinforcement learning with modern portfolio theory. DPO not only has the advantages of machine learning methods in investment decision-making, but also retains the essence of modern portfolio theory in portfolio optimization. Additionaly, it was crucial to simultaneously consider the time series and complex asset correlations of financial market information. Therefore, in order to improve DPO performance, features of assets information were extracted and fused. In addition, a novel risk-cost reward function was proposed, which realized optimal portfolio decision-making considering transaction cost and risk factors through reinforcement learning. Our results showed the superiority and generalization of the DPO framework for portfolio optimization tasks. Experiments conducted on two real-world datasets validated that DPO achieved the highest accumulative portfolio value compared to other strategies, demonstrating strong profitability. Its Sharpe ratio and maximum drawdown also performed excellently, indicating good economic benefits and achieving a trade-off between portfolio returns and risk. Additionally, the extraction and fusion of financial information features can significantly improve the applicability and effectiveness of DPO.
format Article
id doaj-art-8a5191538429415ca081748abfe2b2b1
institution Kabale University
issn 2688-1594
language English
publishDate 2024-09-01
publisher AIMS Press
record_format Article
series Electronic Research Archive
spelling doaj-art-8a5191538429415ca081748abfe2b2b12025-01-23T07:52:41ZengAIMS PressElectronic Research Archive2688-15942024-09-013295176520010.3934/era.2024239Reinforcement learning for deep portfolio optimizationRuyu Yan0Jiafei Jin1Kun Han2School of Management, Harbin Institute of Technology, Harbin 150000, ChinaSchool of Management, Harbin Institute of Technology, Harbin 150000, ChinaFaculty of Computing, Harbin Institute of Technology, Harbin 150000, ChinaPortfolio optimization is an important financial task that has received widespread attention in the field of artificial intelligence. In this paper, a novel deep portfolio optimization (DPO) framework was proposed, combining deep learning and reinforcement learning with modern portfolio theory. DPO not only has the advantages of machine learning methods in investment decision-making, but also retains the essence of modern portfolio theory in portfolio optimization. Additionaly, it was crucial to simultaneously consider the time series and complex asset correlations of financial market information. Therefore, in order to improve DPO performance, features of assets information were extracted and fused. In addition, a novel risk-cost reward function was proposed, which realized optimal portfolio decision-making considering transaction cost and risk factors through reinforcement learning. Our results showed the superiority and generalization of the DPO framework for portfolio optimization tasks. Experiments conducted on two real-world datasets validated that DPO achieved the highest accumulative portfolio value compared to other strategies, demonstrating strong profitability. Its Sharpe ratio and maximum drawdown also performed excellently, indicating good economic benefits and achieving a trade-off between portfolio returns and risk. Additionally, the extraction and fusion of financial information features can significantly improve the applicability and effectiveness of DPO.https://www.aimspress.com/article/doi/10.3934/era.2024239portfolio optimizationtransaction costsrisk managementdeep learningreinforcement learning
spellingShingle Ruyu Yan
Jiafei Jin
Kun Han
Reinforcement learning for deep portfolio optimization
Electronic Research Archive
portfolio optimization
transaction costs
risk management
deep learning
reinforcement learning
title Reinforcement learning for deep portfolio optimization
title_full Reinforcement learning for deep portfolio optimization
title_fullStr Reinforcement learning for deep portfolio optimization
title_full_unstemmed Reinforcement learning for deep portfolio optimization
title_short Reinforcement learning for deep portfolio optimization
title_sort reinforcement learning for deep portfolio optimization
topic portfolio optimization
transaction costs
risk management
deep learning
reinforcement learning
url https://www.aimspress.com/article/doi/10.3934/era.2024239
work_keys_str_mv AT ruyuyan reinforcementlearningfordeepportfoliooptimization
AT jiafeijin reinforcementlearningfordeepportfoliooptimization
AT kunhan reinforcementlearningfordeepportfoliooptimization