期刊
IEEE INTERNET OF THINGS JOURNAL
卷 8, 期 15, 页码 12289-12310出版社
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2021.3062659
关键词
Jamming; Interference; Reinforcement learning; Games; Receivers; Signal to noise ratio; Convergence; Anti-jamming; domain knowledge; reinforcement learning (RL); unmanned aerial vehicle (UAV) networks
资金
- NSFC [62071483]
The article proposes a knowledge-based reinforcement learning method that compresses the state space to improve algorithm convergence speed, enhancing the efficiency of UAV networks in combating smart jammers.
The unmanned aerial vehicles (UAVs) networks are very vulnerable to smart jammers that can choose their jamming strategy based on the ongoing channel state accordingly. Although reinforcement learning (RL) algorithms can give UAV networks the ability to make intelligent decisions, the high-dimensional state space makes it difficult for algorithms to converge quickly. This article proposes a knowledge-based RL method, which uses domain knowledge to compress the state space that the agent needs to explore and then improve the algorithm convergence speed. Specifically, we use the inertial law of the aircraft and the law of signal attenuation in free space to guide the highly efficient exploration of the UAVs in the state space. We incorporate the performance indicators of the receiver and the subjective value of the task into the design of the reward function, and build a virtual environment for pretraining to accelerate the convergence of anti-jamming decisions. In addition, the algorithm proposed is completely based on observable data, which is more realistic than those studies that assume the position or the channel strategy of the jammer. The simulation shows that the proposed algorithm can outperform the benchmarks of model-free RL algorithm in terms of converge speed and averaged reward.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据