4.8 Article

Multi-period portfolio optimization using a deep reinforcement learning hyper-heuristic approach

Journal

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.techfore.2023.122944

Keywords

Portfolio optimization; Deep reinforcement learning; Hyper-heuristic; Decision making; Uncertainty

Ask authors/readers for more resources

This study proposes a novel DRL hyper-heuristic framework for multi-period portfolio optimization problem. Compared to traditional DRL algorithms, this approach improves performance by searching for low-level trading strategies and leverages data-driven methods and multidimensional states to obtain additional information. Experimental results demonstrate significant performance gains in real-world capital market problems.
Portfolio optimization concerns with periodically allocating the limited funds to invest in a variety of potential assets in order to satisfy investors' appetites for risk and return goals. Recently, Deep Reinforcement Learning (DRL) has shown its promising capabilities in sequential decision making problems. However, traditional DRL algorithms directly operate in the space of low-level actions, which exhibits poor scalability and becomes intractable in real-world problem instances when the dimensionality of the environment increases. To deal with this, in this work, a novel DRL hyper-heuristic framework is proposed for multi-period portfolio optimization problem. Instead of exploiting the entire action domain, our proposed approach is more effective by searching for low-level well-developed trading strategies. In addition, our proposed approach is data driven and respects the nature of the problem by taking advantage of expert domain knowledge and posing it multidimensional states to further leverage additional diverse information from alternative views of the environment. The proposed approach is evaluated on five real-world capital market problem instances and numerous experimental results demonstrate our proposed method can achieve notable performance gains compared to state-of-art trading strategies as well as traditional DRL baseline method. The data we used are from five stock indices, covering the period from the 2012 to 2022. Our study can have salient policy implications for investment strategy formulation and effective regulatory frameworks establishment.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available