☆ 4.5 Article

Close the Gaps: A Learning-While-Doing Algorithm for Single-Product Revenue Management Problems

OPERATIONS RESEARCH (2014)

期刊

OPERATIONS RESEARCH

卷 62, 期 2, 页码 318-331

出版社

INFORMS

DOI: 10.1287/opre.2013.1245

关键词

类别

Management Operations Research & Management Science

资金

Major Program of the National Science Foundation of China [71320107001]
National Science Foundation of China [70971047, 71371078]
Program for New Century Talents in University, China [NCET-10-0382]
Research Fund for the Doctoral Program of Higher Education [RFDP 20110142110066]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

We consider a retailer selling a single product with limited on-hand inventory over a finite selling season. Customer demand arrives according to a Poisson process, the rate of which is influenced by a single action taken by the retailer (such as price adjustment, sales commission, advertisement intensity, etc.). The relationship between the action and the demand rate is not known in advance. However, the retailer is able to learn the optimal action on the fly as she maximizes her total expected revenue based on the observed demand reactions. Using the pricing problem as an example, we propose a dynamic learning-while-doing algorithm that only involves function value estimation to achieve a near-optimal performance. Our algorithm employs a series of shrinking price intervals and iteratively tests prices within that interval using a set of carefully chosen parameters. We prove that the performance of our algorithm is among the best of all possible algorithms in terms of the asymptotic regret (the relative loss compared to the full information optimal solution). Our result closes the performance gaps between parametric and nonparametric learning and between the post-price mechanism and the customer-bidding mechanism. Important managerial insight from this research is that the values of information on both the parametric form of the demand function as well as each customer's exact reservation price are less important than prior literature suggests. Our results also suggest that firms would be better off to perform dynamic learning and action concurrently rather than sequentially.

Close the Gaps: A Learning-While-Doing Algorithm for Single-Product Revenue Management Problems

期刊

OPERATIONS RESEARCH

出版社

INFORMS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Close the Gaps: A Learning-While-Doing Algorithm for Single-Product Revenue Management Problems

期刊

OPERATIONS RESEARCH

出版社

INFORMS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文