4.8 Article

Joint Multiuser DNN Partitioning and Computational Resource Allocation for Collaborative Edge Intelligence

期刊

IEEE INTERNET OF THINGS JOURNAL
卷 8, 期 12, 页码 9511-9522

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2020.3010258

关键词

Servers; Computational modeling; Task analysis; Resource management; Artificial intelligence; Internet of Things; Optimization; Computation offloading; computational resource allocation; deep neural network (DNN) partitioning; mobile-edge computing (MEC)

资金

  1. National Key Research and Development Program of China [2017YFB1001703]
  2. National Science Foundation of China [U1711265, 61972432]
  3. Program for Guangdong Introducing Innovative and Entrepreneurial Teams [2017ZT07X355]
  4. Pearl River Talent Recruitment Program [2017GC010465]

向作者/读者索取更多资源

Mobile-edge computing (MEC) serves as a promising architecture supporting edge intelligence services by providing resources to the network edge. Optimizing DNN partitioning and resource allocation in a multiuser resource-constrained environment is a key research area.
Mobile-edge computing (MEC) has emerged as a promising supporting architecture providing a variety of resources to the network edge, thus acting as an enabler for edge intelligence services empowering massive mobile and Internet-of-Things (IoT) devices with artificial intelligence (AI) capability. With the assistance of edge servers, user equipments (UEs) are able to run deep neural network (DNN)-based AI applications, which are generally resource hungry and computation intensive such that an individual UE can hardly afford by itself in real time. However, the resources in each individual edge server are typically limited. Therefore, any resource optimization involving edge servers is by nature a resource-constrained optimization problem and needs to be tackled in such a realistic context. Motivated by this observation, we investigate the optimization problem of DNN partitioning (an emerging DNN offloading scheme) in a realistic multiuser resource-constrained condition that rarely considered in previous works. Despite the extremely large solution space, we reveal several properties of this specific optimization problem of joint multi-UE DNN partitioning and computational resource allocation. We propose an algorithm called iterative alternating optimization (IAO) that can achieve the optimal solution in polynomial time. In addition, we present a rigorous theoretic analysis of our algorithm in terms of time complexity and performance under realistic estimation error. Moreover, we build a prototype that implements our framework and conducts extensive experiments using realistic DNN models, whose results demonstrate its effectiveness and efficiency.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据