☆ 4.6 Article

SphereVLAD++: Attention-Based and Signal-Enhanced Viewpoint Invariant Descriptor

IEEE ROBOTICS AND AUTOMATION LETTERS (2023)

Journal

IEEE ROBOTICS AND AUTOMATION LETTERS

Volume 8, Issue 1, Pages 256-263

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/LRA.2022.3223555

Keywords

3D Place Recognition; Attention; Viewpoint-invariant Localization

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

We propose SphereVLAD++, an attention-enhanced viewpoint invariant place recognition method, which projects point clouds onto a spherical perspective and captures contextual connections between local features and global 3D geometry distribution. It outperforms all relative state-of-the-art 3D place recognition methods, achieving successful retrieval rates of 7.06% and 28.15% under small or even totally reversed viewpoint differences. It also has low computation requirements and high time efficiency, making it suitable for low-cost robots.

LiDAR-based localization approach is a fundamental module for large-scale navigation tasks, such as last-mile delivery and autonomous driving, and localization robustness highly relies on viewpoints and 3D feature extraction. Our previous work provides a viewpoint-invariant descriptor to deal with viewpoint differences; however, the global descriptor suffers from a low signal-noise ratio in unsupervised clustering, reducing the distinguishable feature extraction ability. In this work, we develop SphereVLAD++, an attention-enhanced viewpoint invariant place recognition method. SphereVLAD++ projects the point cloud on the spherical perspective for each unique area and captures the contextual connections between local features and their dependencies with global 3D geometry distribution. In return, clustered elements within the global descriptor are conditioned on local and global geometries and support the original viewpoint-invariant property of SphereVLAD. In the experiments, we evaluated the localization performance of SphereVLAD++ on both the public KITTI360 dataset and self-generated datasets from the city of Pittsburgh. The experiment results show that SphereVLAD++ outperforms all relative state-of-the-art 3D place recognition methods under small or even totally reversed viewpoint differences and shows 7.06% and 28.15% successful retrieval rates with better than the second best. Low computation requirements and high time efficiency also help its application for low-cost robots.

SphereVLAD++: Attention-Based and Signal-Enhanced Viewpoint Invariant Descriptor

Journal

IEEE ROBOTICS AND AUTOMATION LETTERS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

SphereVLAD++: Attention-Based and Signal-Enhanced Viewpoint Invariant Descriptor

Journal

IEEE ROBOTICS AND AUTOMATION LETTERS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper