☆ 4.7 Article

Reinforcement Learning for Adaptive Caching With Dynamic Storage Pricing

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS (2019)

期刊

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS

卷 37, 期 10, 页码 2267-2281

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/JSAC.2019.2933780

关键词

Dynamic caching; fetching; dynamic programming; value iteration; Q-learning

类别

Engineering, Electrical & Electronic Telecommunications

资金

USA NSF [1508993, 1514056, 1711471, 1901134]
Spanish MINECO grant OMICROM [TEC2013-41604-R]
URJC Mobility Program

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Small base stations (SBs) of fifth-generation (SG) cellular networks are envisioned to have storage devices to locally serve requests for reusable and popular contents by caching them at the edge of the network, close to the end users. The ultimate goal is to smartly utilize a limited storage capacity to serve locally contents that are frequently requested instead of fetching them from the cloud, contributing to a better overall network performance and service experience. To enable the SBs with efficient fetch-cache decision-making schemes operating in dynamic settings, this paper introduces simple but flexible generic time-varying fetching and caching costs, which are then used to formulate a constrained minimization of the aggregate cost across files and time. Since caching decisions per time slot influence the content availability in future slots, the novel formulation for optimal fetch-cache decisions falls into the class of dynamic programming. Under this generic formulation, first by considering stationary distributions for the costs as well as file popularities, an efficient reinforcement learning-based solver known as value iteration algorithm can be used to solve the emerging optimization problem. Later, it is shown that practical limitations on cache capacity can be handled using a particular instance of this generic dynamic pricing formulation. Under this setting, to provide a light-weight online solver for the corresponding optimization, the well-known reinforcement learning algorithm, Q-learning, is employed to find optimal fetch-cache decisions. Numerical tests corroborating the merits of the proposed approach wrap up the paper.

Reinforcement Learning for Adaptive Caching With Dynamic Storage Pricing

期刊

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Reinforcement Learning for Adaptive Caching With Dynamic Storage Pricing

期刊

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文