4.7 Article

Dynamic Inventory Repositioning in On-Demand Rental Networks

Journal

MANAGEMENT SCIENCE
Volume 68, Issue 11, Pages 7861-7878

Publisher

INFORMS
DOI: 10.1287/mnsc.2021.4286

Keywords

rental networks; inventory repositioning; optimal policies; approximate dynamic programming algorithms; stochastic dual dynamic programming

Funding

  1. US National Science Foundation (NS) [SCC-1831140]
  2. Singapore Ministry of Education [R-266-000-128-133]
  3. Extreme Science and Engineering Discovery Environment [NSF] [ACI-1548562]
  4. Bridges-2 system [NSF] [ACI-1445606]

Ask authors/readers for more resources

The paper explores an optimal policy for systems with a general network structure in the context of on-demand rental services. It demonstrates that the optimal policy can be described in terms of a specific region in the state space and proposes a provably convergent approximate dynamic programming algorithm to handle high-dimensional problems.
We consider a rental service with a fixed number of rental units distributed across multiple locations. The units are accessed by customers without prior reservation and on an on-demand basis. Customers can decide on how long to keep a unit and where to return it. Because of the randomness in demand and in returns, there is a need to periodically reposition inventory away from some locations and into others. In deciding on how much inventory to reposition and where, the system manager balances potential lost sales with repositioning costs. Although the problem is increasingly common in applications involving on-demand rental services, not much is known about the nature of the optimal policy for systems with a general network structure or about effective approaches to solving the problem. In this paper, first, we show that the optimal policy in each period can be described in terms of a well-specified region over the state space. Within this region, it is optimal not to reposition any inventory, whereas, outside the region, it is optimal to reposition but only such that the system moves to a new state that is on the boundary of the no-repositioning region. We also provide a simple check for when a state is in the no-repositioning region. Second, we leverage the features of the optimal policy, along with properties of the optimal cost function, to propose a provably convergent approximate dynamic programming algorithm to tackle problems with a large number of dimensions.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available