4.7 Article

Multi-Agent Imitation Learning for Pervasive Edge Computing: A Decentralized Computation Offloading Algorithm

期刊

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TPDS.2020.3023936

关键词

Edge computing; Task analysis; Performance evaluation; Computational modeling; Games; Processor scheduling; Cloud computing; Pervasive edge computing; computation offloading; imitation learning; decentralized execution

资金

  1. Hong Kong RGC Research Impact Fund (RIF) [R5060-19]
  2. General Research Fund (GRF) [152221/19E]
  3. NationalNatural Science Foundation of China [61872310, 61971084, 62001073]

向作者/读者索取更多资源

Pervasive edge computing is a decentralized computing approach where users schedule based on their own utility, but ensuring fairness among devices is a challenge. Researchers propose an algorithm based on game theory and imitation learning to reduce task completion time and achieve significant advantages.
Pervasive edge computing refers to one kind of edge computing that merely relies on edge devices with sensing, storage and communication abilities to realize peer-to-peer offloading without centralized management. Due to lack of unified coordination, users always pursue profits by maximizing their own utilities. However, on one hand, users may not make appropriate scheduling decisions based on their local observations. On the other hand, how to guarantee the fairness among different edge devices in the fully decentralized environment is rather challenging. To solve the above issues, we propose a decentrailized computation offloading algorithm with the purpose of minimizing average task completion time in the pervasive edge computing networks. We first derive a Nash equilibrium among devices by stochastic game theories based on the full observations of system states. After that, we design a traffic offloading algorithm based on partial observations by integrating general adversarial imitation learning. Multiple experts can provide demonstrations, so that devices can mimic the behaviors of corresponding experts by minimizing the gaps between the distributions of their observation-action pairs. At last, theoretical and performance results show that our solution has a significant advantage compared with other representative algorithms.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据