4.6 Article

Enhancing Conventional Geometry-Based Visual Odometry Pipeline Through Integration of Deep Descriptors

Journal

IEEE ACCESS
Volume 11, Issue -, Pages 58294-58307

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2023.3284463

Keywords

Cameras; Robot vision systems; Autonomous automobiles; Benchmark testing; Visualization; Deep learning; Pose estimation; Neural networks; Mobile robots; Odometry; Deep descriptors; deep neural networks; driverless vehicles; interest point detectors; mobile robots; visual odometry

Ask authors/readers for more resources

Geometry-based Visual Odometry (VO) techniques estimate camera motion from visual data obtained from one or more cameras using multi-view geometry. Although effective, these methods do not perform well in challenging cases. In this work, we propose to integrate deep descriptors into the traditional VO pipeline to improve correspondence between image points. Experimental results using conventional and deep descriptors show that our VO method effectively minimizes drift and produces better camera trajectories, performing competitively with state-of-the-art works.
Geometry-based Visual Odometry (VO) techniques are renowned in the fields of computer vision and robotics. They use methods from multi-view geometry to estimate camera motion from visual data obtained from one or more cameras. Tracking the camera motion precisely between different views is dependent on the correct estimation of correspondences between salient points of the views. In practice, geometry-based methods are found to be quite effective but do not perform well in challenging cases due to tracking failures caused by abrupt motion, occlusions, textureless and low-light scenes, etc. On the contrary, end-to-end learning from visual data using deep neural networks is an emerging area of research and deals with challenging cases successfully. Despite being computationally expensive, these methods do not outperform their counterparts in conditions favorable to geometry-based methods. Considering these facts in this work, our goal is to integrate deep descriptors to improve the correspondence between image points for tracking in a traditional geometry-based VO pipeline. We propose a simple stereo VO pipeline inspired by popular techniques found in the literature. Two conventional and four deep descriptors have been used in our experiments conducted on various image sequences of the challenging KITTI benchmark dataset. We have determined empirically that deep descriptors can effectively minimize drift in the VO estimates and produce better camera trajectories. The experimental results on the KITTI dataset demonstrate that our VO method performs at par with the state-of-the-art works reported in the literature.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available