4.8 Article

Distributed Dynamic Resource Management and Pricing in the IoT Systems With Blockchain-as-a-Service and UAV-Enabled Mobile Edge Computing

期刊

IEEE INTERNET OF THINGS JOURNAL
卷 7, 期 3, 页码 1974-1993

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2019.2961958

关键词

Bayesian methods; blockchain; deep learning; game theory; incomplete information; Internet of Things (IoT); Markov processes; mobile-edge computing (MEC); reinforcement learning (RL); resource allocation; unmanned aerial vehicles (UAVs)

资金

  1. National Natural Science Foundation of China [61950410603]
  2. Singapore NRF National Satellite of Excellence, Design Science and Technology for Secure Critical Infra-Structure NSoE [DeST-SCI2019-0007]
  3. Singapore MOE [2017-T1-002-007 RG122/17, MOE2014-T2-2-015 ARC4/15]
  4. Singapore NRF [2015-NRF-ISF001-2277]
  5. Singapore EMA Energy Resilience [NRF2017 EWT-EP003-041]
  6. A*STAR-NTUSUTD Joint Research Grant Call on Artificial Intelligence for the Future of Manufacturing RGANS 1906 [WASP/NTU M4082187 (4080)]

向作者/读者索取更多资源

In this article, we study the pricing and resource management in the Internet of Things (IoT) system with blockchain-as-a-service (BaaS) and mobile-edge computing (MEC). The BaaS model includes the cloud-based server to perform blockchain tasks and the set of peers to collect data from local IoT devices. The MEC model consists of the set of terrestrial and aerial base stations (BSs), i.e., unmanned aerial vehicles (UAVs), to forward the tasks of peers to the BaaS server. Each BS is also equipped with an MEC server to run some blockchain tasks. As the BSs can be privately owned or controlled by different operators, there is no information exchange among them. We show that the resource management and pricing in the BaaS-MEC system are modeled as a stochastic Stackelberg game with multiple leaders and incomplete information about actions of leaders/BSs and followers/peers. We formulate a novel hierarchical reinforcement learning (RL) algorithm for the decision makings of BSs and peers. We also develop an unsupervised hierarchical deep learning (HDL) algorithm that combines deep $Q$ -learning (DQL) for BSs with the Bayesian deep learning (BDL) for peers. We prove that the proposed algorithms converge to stable states in which the peers' actions are the best responses to optimal actions of BSs.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据