3.8 Proceedings Paper

Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition

出版社

IEEE COMPUTER SOC
DOI: 10.1109/CVPR46437.2021.01392

关键词

-

资金

  1. Australian Government [AUSMURIB000001]
  2. ONR MURI [N00014-19-1-2571]

向作者/读者索取更多资源

Visual Place Recognition is a challenging task that requires dealing with appearance and viewpoint changes in a changing world. Patch-NetVLAD introduces a novel approach that combines local and global descriptor methods to achieve superior performance and computational efficiency, making it suitable for enhancing both stand-alone place recognition capabilities and SLAM systems.
Visual Place Recognition is a challenging task for robotics and autonomous systems, which must deal with the twin problems of appearance and viewpoint change in an always changing world. This paper introduces Patch-NetVLAD, which provides a novel formulation for combining the advantages of both local and global descriptor methods by deriving patch-level features from NetVLAD residuals. Unlike the fixed spatial neighborhood regime of existing local keypoint features, our method enables aggregation and matching of deep-learned local features defined over the feature-space grid. We further introduce a multi-scale fusion of patch features that have complementary scales (i.e. patch sizes) via an integral feature space and show that the fused features are highly invariant to both condition (season, structure, and illumination) and viewpoint (translation and rotation) changes. Patch-NetVLAD achieves state-of-the-art visual place recognition results in computationally limited scenarios, validated on a range of challenging real-world datasets, including winning the Facebook Mapillary Visual Place Recognition Challenge at ECCV2020. It is also adaptable to user requirements, with a speed-optimised version operating over an order of magnitude faster than the state-of-the-art. By combining superior performance with improved computational efficiency in a configurable framework, Patch-NetVLAD is well suited to enhance both stand-alone place recognition capabilities and the overall performance of SLAM systems.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据