4.6 Article

SphereVLAD++: Attention-Based and Signal-Enhanced Viewpoint Invariant Descriptor

Journal

IEEE ROBOTICS AND AUTOMATION LETTERS
Volume 8, Issue 1, Pages 256-263

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/LRA.2022.3223555

Keywords

3D Place Recognition; Attention; Viewpoint-invariant Localization

Categories

Ask authors/readers for more resources

We propose SphereVLAD++, an attention-enhanced viewpoint invariant place recognition method, which projects point clouds onto a spherical perspective and captures contextual connections between local features and global 3D geometry distribution. It outperforms all relative state-of-the-art 3D place recognition methods, achieving successful retrieval rates of 7.06% and 28.15% under small or even totally reversed viewpoint differences. It also has low computation requirements and high time efficiency, making it suitable for low-cost robots.
LiDAR-based localization approach is a fundamental module for large-scale navigation tasks, such as last-mile delivery and autonomous driving, and localization robustness highly relies on viewpoints and 3D feature extraction. Our previous work provides a viewpoint-invariant descriptor to deal with viewpoint differences; however, the global descriptor suffers from a low signal-noise ratio in unsupervised clustering, reducing the distinguishable feature extraction ability. In this work, we develop SphereVLAD++, an attention-enhanced viewpoint invariant place recognition method. SphereVLAD++ projects the point cloud on the spherical perspective for each unique area and captures the contextual connections between local features and their dependencies with global 3D geometry distribution. In return, clustered elements within the global descriptor are conditioned on local and global geometries and support the original viewpoint-invariant property of SphereVLAD. In the experiments, we evaluated the localization performance of SphereVLAD++ on both the public KITTI360 dataset and self-generated datasets from the city of Pittsburgh. The experiment results show that SphereVLAD++ outperforms all relative state-of-the-art 3D place recognition methods under small or even totally reversed viewpoint differences and shows 7.06% and 28.15% successful retrieval rates with better than the second best. Low computation requirements and high time efficiency also help its application for low-cost robots.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available