4.7 Article

Path Planning for Adaptive CSI Map Construction With A3C in Dynamic Environments

Journal

IEEE TRANSACTIONS ON MOBILE COMPUTING
Volume 22, Issue 5, Pages 2925-2937

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TMC.2021.3131318

Keywords

Location awareness; Path planning; Task analysis; Fingerprint recognition; Wireless fidelity; Robot sensing systems; Heuristic algorithms; CSI; reinforcement learning; fingerprint localization

Ask authors/readers for more resources

In this paper, a novel data collection strategy based on reinforcement learning is proposed for fingerprint localization. By utilizing multivariate Gaussian process and mutual information, a small amount of real data is collected in advance to predict the rewards of sampling points. Then, an optimization problem is transformed into a sequential decision process and A3C algorithm is used to exploit the informative path. Extensive experiments validate the performance of the proposed algorithm and compared to existing algorithms, our system achieves similar indoor localization accuracy while reducing the workload of CSI collection.
With the growing demand of Location-Based Service, the fingerprint localization based on Channel State Information (CSI) has become a vital positioning technology because it has easy implementation, low device cost and adequate accuracy which benefits from fine-grained information provided by CSI. However, the main drawback is that the approach has to construct the fingerprint map manually during the off-line stage, which is tedious and time-consuming. In this paper, we propose a novel data collection strategy for path planning based on reinforcement learning, namely Asynchronous Advantage Actor-Critic (A3C). Given the limited exploration step length, it needs to maximize the informative CSI data for reducing manual cost. We collect a small amount of real data in advance to predict the rewards of all sampling points by multivariate Gaussian process and mutual information. Then the optimization problem is transformed into a sequential decision process, which can exploit the informative path by A3C. We complete the proposed algorithm in two real-world dynamic environments and extensive experiments verify its performance. Compared to coverage path planning and several existing algorithms, our system not only can achieve similar indoor localization accuracy, but also reduce the CSI collection task.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available