4.7 Article

Adversarial retraining attack of asynchronous advantage actor-critic based pathfinding

Journal

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS
Volume 36, Issue 5, Pages 2323-2346

Publisher

WILEY
DOI: 10.1002/int.22380

Keywords

A3C; evasion attack; pathfinding; reinforcement learning; retraining attack

Funding

  1. National Natural Science Foundation of China [61972025, 61802389, 61672092, U1811264, 61966009]
  2. National Key R&D Program of China [2020YFB1005604, 2020YFB2103800]
  3. Fundamental Research Funds for the Central Universities of China [2018JBZ103, 2019RC008]
  4. Science and Technology on Information Assurance Laboratory, Guangxi Key Laboratory of Trusted Software [KX201902]

Ask authors/readers for more resources

Pathfinding is crucial in real-world scenarios and undergoing a revolution in efficient parallel learning with the development of reinforcement learning. This study is the first to explore adversarial attacks on A3C, achieving a high success rate and discussing defense strategies.
Pathfinding becomes an important component in many real-world scenarios, such as popular warehouse systems and autonomous aircraft towing vehicles. With the development of reinforcement learning (RL) especially in the context of asynchronous advantage actor-critic (A3C), pathfinding is undergoing a revolution in terms of efficient parallel learning. Similar to other artificial intelligence-based applications, A3C-based pathfinding is also threatened by the adversarial attack. In this paper, we are the first to study the adversarial attack to A3C, that can unexpectedly wake up longtime retraining mechanism until successful pathfinding. We also discover an attack example generation to launch the attack based on gradient band, in which only one baffle of extremely few unit lengths can successfully perform the attack. Experiments with detailed analysis are conducted to show a high attack success rate of 95% with an average baffle length of 2.95. We also discuss defense suggestions leveraging the insights from our analysis.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available