4.5 Article

Learning agents for the multi-mode project scheduling problem

Journal

JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY
Volume 62, Issue 2, Pages 281-290

Publisher

TAYLOR & FRANCIS LTD
DOI: 10.1057/jors.2010.101

Keywords

project scheduling; multi-agent reinforcement learning; learning automata

Ask authors/readers for more resources

Intelligent optimization refers to the promising technique of integrating learning mechanisms into (meta-) heuristic search. In this paper, we use multi-agent reinforcement learning for building high-quality solutions for the multi-mode resource-constrained project scheduling problem (MRCPSP). We use a network of distributed reinforcement learning agents that cooperate to jointly learn a well-performing constructive heuristic. Each agent, being responsible for one activity, uses two simple learning devices, called learning automata, that learn to select a successor activity order and a mode, respectively. By coupling the reward signals for both learning tasks, we can clearly show the advantage of using reinforcement learning in search. We present some comparative results, to show that our method can compete with the best performing algorithms for the MRCPSP, yet using only simple learning schemes without the burden of complex fine-tuning. Journal of the Operational Research Society (2011) 62, 281-290. doi:10.1057/jors.2010.101 Published online 25 August 2010

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available