☆ 4.7 Article

Implicit neural refinement based multi-view stereo network with adaptive correlation

IMAGE AND VISION COMPUTING (2022)

Journal

IMAGE AND VISION COMPUTING

Volume 124, Issue -, Pages -

Publisher

ELSEVIER

DOI: 10.1016/j.imavis.2022.104511

Keywords

Multi -view stereo; Implicit neural representation; Adaptive aggregation; Feature pyramid; Coarse -to -fine

Funding

National Natural Science Founda-tion of China [51807003]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

In this paper, the authors propose an end-to-end trainable framework called ACINR-MVSNet for multi-view stereo (MVS) with adaptive group-wise correlation and implicit neural depth refinement. The framework consists of a one-stage MVS architecture followed by refinement modules and an implicit neural refinement module. An adaptive group-wise correlation similarity measure is proposed to solve visibility problems, and a pyramid-based feature extraction network is utilized to gather context-aware information. The experiments demonstrate the effectiveness and generalization of the proposed approach.

In this paper, we propose ACINR-MVSNet, an end-to-end trainable framework with adaptive group-wise corre-lation and implicit neural depth refinement for multi-view stereo (MVS). Previous learning-based MVS methods have demonstrated their outstanding performance, and most of them estimate depth maps in a coarse-to-fine manner. However, in a commonly used multi-stage cascaded framework, the previous wrong estimation might lead to error propagation. In contrast, we focus on another coarse-to-fine structure, i.e., one-stage MVS ar-chitecture followed by refinement modules. Inspired by implicit neural representation, we propose an implicit neural refinement module to refine the coarse depth map. Guided by the corresponding reference image, it can better recover finer details, especially those in boundary areas. To solve the visibility problem in complex sce-narios while maintaining efficiency, we propose an adaptive group-wise correlation similarity measure for cost volume construction. Besides, we present a pyramid-based feature extraction network with a repeated top -down and bottom-up structure to gather more context-aware information, which can better meet the challenges in ill-posed regions. This novel feature extractor is also utilized to construct an enhanced Gauss-Newton refine-ment module for further upsampling and optimizing. Extensive experiments on the DTU, the Tanks & Temples and the BlendedMVS datasets demonstrate the effectiveness and generalization of our approach, which can achieve better or competitive results compared to state-of-the-art methods. The code will be available at https://github.com/BoyangSONG/ACINR-MVSNet.(c) 2022 Elsevier B.V. All rights reserved.

Implicit neural refinement based multi-view stereo network with adaptive correlation

Journal

IMAGE AND VISION COMPUTING

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Implicit neural refinement based multi-view stereo network with adaptive correlation

Journal

IMAGE AND VISION COMPUTING

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper