Journal
2020 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE (PHM-BESANCON 2020)
Volume -, Issue -, Pages 349-353Publisher
IEEE COMPUTER SOC
DOI: 10.1109/PHM-Besancon49106.2020.00068
Keywords
deep reinforcement learning; actor-critic; pointer network; multi-head attention; permutation flowshop
Categories
Ask authors/readers for more resources
Permutation flowshop problem is a classic problem in combinatorial optimization. In this paper, we propose a deep reinforcement learning (DRL) model with heterogeneous network according to the different task characteristics of actor and critic in actor-critic model. The actor, acting as the policy network, is mainly responsible for strategy search and composed of LSTMs; the critic, acting as the value network, is mainly responsible for strategy evaluation and formed by the attention network. In order to increase the exploration ability of the model, we adopt the epsilon-greedy strategy to further improve the effectiveness of the model. The experimental results on multiple data sets show that our model achieves better performance on permutation flowshop problem.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available