4.7 Article

Partially Observable Minimum-Age Scheduling: The Greedy Policy

Journal

IEEE TRANSACTIONS ON COMMUNICATIONS
Volume 70, Issue 1, Pages 404-418

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCOMM.2021.3123362

Keywords

Indexes; Markov processes; Costs; Robot sensing systems; Wireless sensor networks; Unmanned aerial vehicles; Surveillance; Age of information; multi-armed bandit; greedy policy; POMDP; recurrence relation

Funding

  1. RGC General Research Funds [14205020]
  2. CUHK [4055126]

Ask authors/readers for more resources

This paper investigates the minimum-age scheduling problem in wireless sensor networks and proposes a greedy policy to minimize the expected age-of-information. By introducing a relaxed greedy policy and formulating the sampling process of each arm as a partially observable Markov decision process, the paper validates that the relaxed greedy policy is an effective approximation to the greedy policy in terms of expected age-of-information.
This paper studies the minimum-age scheduling problem in a wireless sensor network where an access point (AP) monitors the state of an object via a set of sensors. The freshness of the sensed state, measured by the age-of-information (AoI), varies at different sensors and is not directly observable to the AP. The AP has to decide which sensor to query/sample in order to get the most updated state information of the object (i.e., the state information with the minimum AoI). In this paper, we formulate the minimum-age scheduling problem as a multi-armed bandit problem with partially observable arms and explore the greedy policy to minimize the expected AoI sampled over an infinite horizon. To analyze the performance of the greedy policy, we 1) put forth a relaxed greedy policy that decouples the sampling processes of the arms, 2) formulate the sampling process of each arm as a partially observable Markov decision process (POMDP), and 3) derive the average sampled AoI under the relaxed greedy policy as a sum of the average AoI sampled from individual arms. Numerical and simulation results validate that the relaxed greedy policy is an excellent approximation to the greedy policy in terms of the expected AoI sampled over an infinite horizon.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available