Reinforcement learning for deep portfolio optimization
Portfolio optimization is an important financial task that has received widespread attention in the field of artificial intelligence. In this paper, a novel deep portfolio optimization (DPO) framework was proposed, combining deep learning and reinforcement learning with modern portfolio theory. DPO...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
AIMS Press
2024-09-01
|
Series: | Electronic Research Archive |
Subjects: | |
Online Access: | https://www.aimspress.com/article/doi/10.3934/era.2024239 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832590826867261440 |
---|---|
author | Ruyu Yan Jiafei Jin Kun Han |
author_facet | Ruyu Yan Jiafei Jin Kun Han |
author_sort | Ruyu Yan |
collection | DOAJ |
description | Portfolio optimization is an important financial task that has received widespread attention in the field of artificial intelligence. In this paper, a novel deep portfolio optimization (DPO) framework was proposed, combining deep learning and reinforcement learning with modern portfolio theory. DPO not only has the advantages of machine learning methods in investment decision-making, but also retains the essence of modern portfolio theory in portfolio optimization. Additionaly, it was crucial to simultaneously consider the time series and complex asset correlations of financial market information. Therefore, in order to improve DPO performance, features of assets information were extracted and fused. In addition, a novel risk-cost reward function was proposed, which realized optimal portfolio decision-making considering transaction cost and risk factors through reinforcement learning. Our results showed the superiority and generalization of the DPO framework for portfolio optimization tasks. Experiments conducted on two real-world datasets validated that DPO achieved the highest accumulative portfolio value compared to other strategies, demonstrating strong profitability. Its Sharpe ratio and maximum drawdown also performed excellently, indicating good economic benefits and achieving a trade-off between portfolio returns and risk. Additionally, the extraction and fusion of financial information features can significantly improve the applicability and effectiveness of DPO. |
format | Article |
id | doaj-art-8a5191538429415ca081748abfe2b2b1 |
institution | Kabale University |
issn | 2688-1594 |
language | English |
publishDate | 2024-09-01 |
publisher | AIMS Press |
record_format | Article |
series | Electronic Research Archive |
spelling | doaj-art-8a5191538429415ca081748abfe2b2b12025-01-23T07:52:41ZengAIMS PressElectronic Research Archive2688-15942024-09-013295176520010.3934/era.2024239Reinforcement learning for deep portfolio optimizationRuyu Yan0Jiafei Jin1Kun Han2School of Management, Harbin Institute of Technology, Harbin 150000, ChinaSchool of Management, Harbin Institute of Technology, Harbin 150000, ChinaFaculty of Computing, Harbin Institute of Technology, Harbin 150000, ChinaPortfolio optimization is an important financial task that has received widespread attention in the field of artificial intelligence. In this paper, a novel deep portfolio optimization (DPO) framework was proposed, combining deep learning and reinforcement learning with modern portfolio theory. DPO not only has the advantages of machine learning methods in investment decision-making, but also retains the essence of modern portfolio theory in portfolio optimization. Additionaly, it was crucial to simultaneously consider the time series and complex asset correlations of financial market information. Therefore, in order to improve DPO performance, features of assets information were extracted and fused. In addition, a novel risk-cost reward function was proposed, which realized optimal portfolio decision-making considering transaction cost and risk factors through reinforcement learning. Our results showed the superiority and generalization of the DPO framework for portfolio optimization tasks. Experiments conducted on two real-world datasets validated that DPO achieved the highest accumulative portfolio value compared to other strategies, demonstrating strong profitability. Its Sharpe ratio and maximum drawdown also performed excellently, indicating good economic benefits and achieving a trade-off between portfolio returns and risk. Additionally, the extraction and fusion of financial information features can significantly improve the applicability and effectiveness of DPO.https://www.aimspress.com/article/doi/10.3934/era.2024239portfolio optimizationtransaction costsrisk managementdeep learningreinforcement learning |
spellingShingle | Ruyu Yan Jiafei Jin Kun Han Reinforcement learning for deep portfolio optimization Electronic Research Archive portfolio optimization transaction costs risk management deep learning reinforcement learning |
title | Reinforcement learning for deep portfolio optimization |
title_full | Reinforcement learning for deep portfolio optimization |
title_fullStr | Reinforcement learning for deep portfolio optimization |
title_full_unstemmed | Reinforcement learning for deep portfolio optimization |
title_short | Reinforcement learning for deep portfolio optimization |
title_sort | reinforcement learning for deep portfolio optimization |
topic | portfolio optimization transaction costs risk management deep learning reinforcement learning |
url | https://www.aimspress.com/article/doi/10.3934/era.2024239 |
work_keys_str_mv | AT ruyuyan reinforcementlearningfordeepportfoliooptimization AT jiafeijin reinforcementlearningfordeepportfoliooptimization AT kunhan reinforcementlearningfordeepportfoliooptimization |