4.7 Article

Partially Observable Minimum-Age Scheduling: The Greedy Policy

期刊

IEEE TRANSACTIONS ON COMMUNICATIONS
卷 70, 期 1, 页码 404-418

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCOMM.2021.3123362

关键词

Indexes; Markov processes; Costs; Robot sensing systems; Wireless sensor networks; Unmanned aerial vehicles; Surveillance; Age of information; multi-armed bandit; greedy policy; POMDP; recurrence relation

资金

  1. RGC General Research Funds [14205020]
  2. CUHK [4055126]

向作者/读者索取更多资源

This paper investigates the minimum-age scheduling problem in wireless sensor networks and proposes a greedy policy to minimize the expected age-of-information. By introducing a relaxed greedy policy and formulating the sampling process of each arm as a partially observable Markov decision process, the paper validates that the relaxed greedy policy is an effective approximation to the greedy policy in terms of expected age-of-information.
This paper studies the minimum-age scheduling problem in a wireless sensor network where an access point (AP) monitors the state of an object via a set of sensors. The freshness of the sensed state, measured by the age-of-information (AoI), varies at different sensors and is not directly observable to the AP. The AP has to decide which sensor to query/sample in order to get the most updated state information of the object (i.e., the state information with the minimum AoI). In this paper, we formulate the minimum-age scheduling problem as a multi-armed bandit problem with partially observable arms and explore the greedy policy to minimize the expected AoI sampled over an infinite horizon. To analyze the performance of the greedy policy, we 1) put forth a relaxed greedy policy that decouples the sampling processes of the arms, 2) formulate the sampling process of each arm as a partially observable Markov decision process (POMDP), and 3) derive the average sampled AoI under the relaxed greedy policy as a sum of the average AoI sampled from individual arms. Numerical and simulation results validate that the relaxed greedy policy is an excellent approximation to the greedy policy in terms of the expected AoI sampled over an infinite horizon.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据