4.5 Article

Comparing neuro-dynamic programming algorithms for the vehicle routing problem with stochastic demands

Journal

COMPUTERS & OPERATIONS RESEARCH
Volume 27, Issue 11-12, Pages 1201-1225

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/S0305-0548(99)00146-X

Keywords

stochastic vehicle routing; neuro-dynamic programming; rollout policies; heuristics

Ask authors/readers for more resources

The paper considers a version of the vehicle routing problem where customers' demands are uncertain. The focus is on dynamically routing a single vehicle to serve the demands of a known set of geographically dispersed customers during real-time operations. The goal consists of minimizing the expected distance traveled in order to serve all customers' demands. Since actual demand is revealed upon arrival of the vehicle at the location of each customer, fully exploiting this feature requires a dynamic approach. This work studies the suitability of the emerging field of neuro-dynamic programming (NDP) in providing approximate solutions to this difficult stochastic combinatorial optimization problem. The paper compares the performance of two NDP algorithms: optimistic approximate policy iteration and a rollout policy. While the former improves the performance of a nearest-neighbor policy by 2.3%, the computational results indicate that the rollout policy generates higher quality solutions. The implication for the practitioner is that the rollout policy is a promising candidate for vehicle routing applications where a dynamic approach is required.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available