Journal
OPERATIONS RESEARCH
Volume -, Issue -, Pages -Publisher
INFORMS
DOI: 10.1287/opre.2022.2305
Keywords
Markov decision process; average cost; optimality inequality; partial observations; lost sales
Ask authors/readers for more resources
This paper investigates a partially observable lost-sales inventory system and proves the existence of a stationary optimal policy for average cost minimization using the vanishing discount factor approach. The key contribution of this study is a method to verify the uniform boundedness of the relative discounted value function, a crucial condition in the vanishing discount factor approach. Additionally, a valid policy is constructed to "copy" the actions of another policy for a process with a different initial state.
We consider a partially observable lost-sales inventory system, in which the inventory level is observed only when it reaches zero. We use the vanishing discount factor approach to prove the existence of a stationary optimal policy for the average cost minimization. As our main methodological contribution, we provide a way to verify the key condition of the vanishing discount factor approach???the uniform boundedness of the relative discounted value function. To accomplish that, we construct a valid policy, which, in a certain sense, ???copies??? the actions of another policy for the process with a different initial state. To the best of our knowledge, this paper is the first one on partially observable inventory models under the average cost criterion.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available