4.7 Article

Robotic Visual-Inertial Calibration via Deep Deterministic Policy Gradient Learning

Journal

IEEE SENSORS JOURNAL
Volume 22, Issue 14, Pages 14448-14457

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JSEN.2022.3171818

Keywords

Calibration; Sensors; Cameras; Observability; Reinforcement learning; Training; Navigation; Visual-inertial calibration; deep deterministic policy gradient; observability analysis; partially observable Markov decision process

Funding

  1. National Key Research and Development Program [2021YFB2501603]

Ask authors/readers for more resources

This research proposes a visual-inertial calibration method using deep deterministic policy gradient learning. By analyzing nonlinear observability and establishing a relationship model, it achieves the self-calibration process of visual-inertial systems, and solves the problems of hyperparameter training and network instability through a reinforcement learning network model.
Visual-inertial calibration is important in robotic vision navigation systems, and calibration errors will reduce navigation accuracy for the longtime autonomous operation. Aiming at the problems of the complicated offline calibration process and the high calculation cost of self-calibration, a novel visual-inertial calibration method using deep deterministic policy gradient learning is proposed. Firstly, the error model of visual-inertial calibration is established considering the intrinsic and extrinsic parameters of the camera and IMU simultaneously. Secondly, the nonlinear observable analysis of the visual-inertial system is carried out. The rank decomposition of the Fisher information matrix is used to establish the relationship between the parameters to be calibrated and the nonlinear observability. Then, the visual-inertial self-calibration process is modeled as a partially observable Markov decision process to facilitate the design and optimization of subsequent reinforcement learning policies. Finally, a reinforcement learning network model is established for visual-inertial calibration using deep deterministic policy gradient, which is used to determine unobservable motion sequences. Meanwhile, the experience playback and target network are adopted in visual-inertial calibration algorithm to solve the problem of hyperparameter training and the instability of the network. Experiments in two different environments show that the proposed method achieves comparable performance comparing with the informative segment approach and the batch calibration approach. Moreover, the proposed method has the shortest trajectory length selected for calibration.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available