4.4 Article

A Deep Q-Network-Based Collaborative Control Research for Smart Ammunition Formation

期刊

出版社

HINDAWI LTD
DOI: 10.1155/2022/2021693

关键词

-

资金

  1. National Natural Science Foundation of China [62003314, 51909245]
  2. Aeronautical Science Foundation of China [2019020U0002]
  3. Youth Science and Technology Research Fund, Shanxi Province Applied Basic Research Project [201901D211244]

向作者/读者索取更多资源

This paper investigates an intelligent deep Q-network (DQN) based control algorithm for the smart ammunition formation (SAF) collaborative control in a high dynamic and uncertain flight environment. The algorithm is developed with a dynamic model and reinforcement learning technique, showing novel performance in the SAF control.
The smart ammunition formation (SAF) system model usually has the characteristics of complexity, time variation, and nonlinearity. With the consideration of random factors, such as sensor error and environmental disturbance, the system model cannot be modeled accurately. To deal with this problem, this paper investigated an intelligent deep Q-network- (DQN-) based control algorithm for the SAF collaborative control, which deals with the high dynamics and uncertainty in the SAF flight environment. In the environment description of the SAF, we built a dynamic model to represent the system joint states, which referred to the smart ammunition's velocity, the trajectory inclination angle, the ballistic deflection angle, and the relative position between different formation nodes. Next, we describe the SAF collaborative control process as a Markov decision process (MDP) with the application of the reinforcement learning (RL) technique. Then, the basic framework e-imitation action-selecting strategy and the algorithm details were developed to address the SAF control problem based on the DQN scheme. Finally, the numerical simulation was carried out to verify the effectiveness and portability of the DQN-based algorithm. The average total reward curve showed a reasonable convergence, and the relative kinematic relationship among the formation nodes met the requirements of the controller design. It illustrated that the DQN-based algorithm obtained a novel performance in the SAF collaborative control.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据