4.8 Article

Dual-Timescale Resource Allocation for Collaborative Service Caching and Computation Offloading in IoT Systems

期刊

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS
卷 19, 期 2, 页码 1735-1746

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TII.2022.3186039

关键词

Collaborative service caching and computing; edge computing; hierarchical reinforcement learning; Internet of Things (IoT)

向作者/读者索取更多资源

Edge computing is important for future Internet of Things systems, as it can reduce service latency and energy consumption by offloading computational tasks to edge servers. Caching appropriate services in the edge server can improve the quality of service, but it requires joint optimization of resource allocation considering different timescales of caching and offloading operations. This article proposes a novel hierarchical deep reinforcement learning scheme to optimize collaborative service caching and computation offloading.
Edge computing has been envisioned as a key enabler to provide computation-intensive and delay-sensitive services in the future Internet of Things systems. By offloading the computational tasks to the edge server, both the service latency and energy consumption can be reduced. Since devices may request various types of computing services, caching appropriate services in the edge server to immediately provide computing resources can improve the quality of service. Nevertheless, it brings new challenges to jointly optimize the resource allocation, where the timeliness of caching and offloading operations are different. In this article, we first formulate the collaborative service caching and computation offloading as a dual-timescale resource allocation problem to minimize the costs of latency and energy consumption. Under this framework, a novel scheme based on hierarchical deep reinforcement learning is proposed to output collaborative caching and computing actions. Specifically, the proposed approach contains the service caching policy and the device computing policy with hierarchical action-value functions, which allows a flexible configuration of caching timescales. The simulation results demonstrate that the proposed policy outperforms the existing schemes on convergence performance and various parameters.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据