4.7 Article

Semi-Data-Aided Channel Estimation for MIMO Systems via Reinforcement Learning

Journal

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS
Volume 22, Issue 7, Pages 4565-4579

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TWC.2022.3227312

Keywords

Channel estimation; Symbols; MIMO communication; Wireless communication; Estimation; Iterative methods; Iterative decoding; Multiple-input multiple-output (MIMO); channel estimation; data-aided channel estimation; Index Terms; reinforcement learning; Monte Carlo tree search

Ask authors/readers for more resources

This paper proposes a semi-data-aided channel estimator that improves channel estimation accuracy by utilizing data symbols as pilot signals. The method leverages reinforcement learning to select reliable detected symbols and updates the channel estimate using the selected symbols as additional pilot signals. It defines a Markov decision process (MDP) to determine whether to use each detected symbol and develops a RL algorithm based on Monte Carlo tree search to find an effective policy. The proposed channel estimator requires less computational complexity compared to conventional iterative data-aided channel estimators and effectively mitigates channel estimation error and detection performance loss caused by insufficient pilot signals.
Data-aided channel estimation is a promising solution to improve channel estimation accuracy by exploiting data symbols as pilot signals for updating an initial channel estimate. In this paper, we propose a semi-data-aided channel estimator for multiple-input multiple-output communication systems. Our strategy is to leverage reinforcement learning (RL) for selecting reliable detected symbols, then update the channel estimate by utilizing only the selected symbols as additional pilot signals. Towards this end, we first define a Markov decision process (MDP) which sequentially decides whether to use each detected symbol as an additional pilot signal. We then develop an RL algorithm to find an effective policy of the MDP based on a Monte Carlo tree search approach. In this algorithm, we exploit the a-posteriori probability for approximating both the optimal future actions and the corresponding state transitions of the MDP and derive a closed-form expression for the optimal policy under the approximations. A key advantage of the proposed channel estimator is that it requires less computational complexity than conventional iterative data-aided channel estimators. Simulation results demonstrate that the proposed channel estimator effectively mitigates both channel estimation error and detection performance loss caused by insufficient pilot signals.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available