4.7 Article

Inverse Depth Parametrization for Monocular SLAM

Journal

IEEE TRANSACTIONS ON ROBOTICS
Volume 24, Issue 5, Pages 932-945

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TRO.2008.2003276

Keywords

Monocular simultaneous localization and mapping (SLAM); real-time vision

Categories

Funding

  1. Spanish [PR2007-0427, DPI2006-13578, DGA(CONSI+D)-CAI IT12-06]
  2. Engineering and Physical Sciences Research Council (EP-SRC) [GR/T24685]
  3. University of Oxford
  4. University of Zaragoza
  5. Imperial College London
  6. Robotics Advancement
  7. Web-Publishing of Sensorial and Elaborated Extensive Datasets (RAWSEEDS) [FP6-IST-045144]
  8. EPSRC Advanced Research Fellowship

Ask authors/readers for more resources

We present a new parametrization for point features within monocular simultaneous localization and mapping (SLAM) that permits efficient and accurate representation of uncertainty during undelayed initialization and beyond, all within the standard extended Kalman filter (EKF). The key concept is direct parametrization of the inverse depth of features relative to the camera locations from which they were first viewed, which produces measurement equations with a high degree of linearity. Importantly, our parametrization can cope with features over a huge range of depths, even those that are so far from the camera that they present little parallax during motion-maintaining sufficient representative uncertainty that these points retain the opportunity to come in smoothly from infinity if the camera makes larger movements. Feature initialization is undelayed in the sense that even distant features are immediately used to improve camera motion estimates, acting initially as bearing references but not permanently labeled as such. The inverse depth parametrization remains well behaved for features at all stages of SLAM processing, but has the drawback in computational terms that each point is represented by a 6-D state vector as opposed to the standard three of a Euclidean XYZ representation. We show that once the depth estimate of a feature is sufficiently accurate, its representation can safely be converted to the Euclidean XYZ form, and propose a linearity index that allows automatic detection and conversion to maintain maximum efficiency only low parallax features need be maintained in inverse depth form for long periods. We present a real-time implementation at 30 Hz, where the parametrization is validated in a fully automatic 3-D SLAM system featuring a hand-held single camera with no additional sensing. Experiments show robust operation in challenging indoor and outdoor environments with a very large ranges of scene depth, varied motion, and also real time 360 degrees loop closing.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available