☆ 4.3 Article

Continuous-time mean-variance portfolio selection: A reinforcement learning framework

MATHEMATICAL FINANCE (2020)

期刊

MATHEMATICAL FINANCE

卷 30, 期 4, 页码 1273-1308

出版社

WILEY

DOI: 10.1111/mafi.12281

关键词

empirical study; entropy regularization; Gaussian distribution; mean-variance portfolio selection; policy improvement; reinforcement learning; simulation; stochastic control; theorem; value function

类别

Business, Finance Economics Mathematics, Interdisciplinary Applications Social Sciences, Mathematical Methods

资金

Nie Center for Intelligent Asset Management at Columbia
Columbia University
Nie Center for Intelligent Asset Management

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

We approach the continuous-time mean-variance portfolio selection with reinforcement learning (RL). The problem is to achieve the best trade-off between exploration and exploitation, and is formulated as an entropy-regularized, relaxed stochastic control problem. We prove that the optimal feedback policy for this problem must be Gaussian, with time-decaying variance. We then prove a policy improvement theorem, based on which we devise an implementable RL algorithm. We find that our algorithm and its variant outperform both traditional and deep neural network based algorithms in our simulation and empirical studies.

Continuous-time mean-variance portfolio selection: A reinforcement learning framework

期刊

MATHEMATICAL FINANCE

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Continuous-time mean-variance portfolio selection: A reinforcement learning framework

期刊

MATHEMATICAL FINANCE

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文