4.7 Article

Reinforcement Learning Control of a Flexible Two-Link Manipulator: An Experimental Investigation

Journal

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS
Volume 51, Issue 12, Pages 7326-7336

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TSMC.2020.2975232

Keywords

Mathematical model; Artificial neural networks; Vibrations; Manipulators; Heuristic algorithms; Robot kinematics; Flexible structure; neural networks (NNs); reinforcement learning (RL); robots; vibration control

Funding

  1. National Natural Science Foundation of China [61933001, 61873298]
  2. National Key Research and Development Program of China [2019YFB1703600]
  3. Joint Funds of Equipment Preresearch and Ministry of Education of China [6141A02033339]
  4. Beijing Top Discipline for Artificial Intelligent Science and Engineering, University Science and Technology Beijing

Ask authors/readers for more resources

This article discusses the control design and experiment validation of a flexible two-link manipulator system represented by ordinary differential equations, introducing a reinforcement learning control strategy based on actor-critic structure. The closed-loop system with the proposed control algorithm is proven to be semi-global uniform ultimate bounded by Lyapunov's direct method. Simulation and experiments demonstrate the effectiveness of the control approach in vibration suppression and trajectory tracking.
This article discusses the control design and experiment validation of a flexible two-link manipulator (FTLM) system represented by ordinary differential equations (ODEs). A reinforcement learning (RL) control strategy is developed that is based on actor-critic structure to enable vibration suppression while retaining trajectory tracking. Subsequently, the closed-loop system with the proposed RL control algorithm is proved to be semi-global uniform ultimate bounded (SGUUB) by Lyapunov's direct method. In the simulations, the control approach presented has been tested on the discretized ODE dynamic model and the analytical claims have been justified under the existence of uncertainty. Eventually, a series of experiments in a Quanser laboratory platform are investigated to demonstrate the effectiveness of the presented control and its application effect is compared with PD control.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available