4.7 Article

Implicit neural refinement based multi-view stereo network with adaptive correlation

Journal

IMAGE AND VISION COMPUTING
Volume 124, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.imavis.2022.104511

Keywords

Multi -view stereo; Implicit neural representation; Adaptive aggregation; Feature pyramid; Coarse -to -fine

Funding

  1. National Natural Science Founda-tion of China [51807003]

Ask authors/readers for more resources

In this paper, the authors propose an end-to-end trainable framework called ACINR-MVSNet for multi-view stereo (MVS) with adaptive group-wise correlation and implicit neural depth refinement. The framework consists of a one-stage MVS architecture followed by refinement modules and an implicit neural refinement module. An adaptive group-wise correlation similarity measure is proposed to solve visibility problems, and a pyramid-based feature extraction network is utilized to gather context-aware information. The experiments demonstrate the effectiveness and generalization of the proposed approach.
In this paper, we propose ACINR-MVSNet, an end-to-end trainable framework with adaptive group-wise corre-lation and implicit neural depth refinement for multi-view stereo (MVS). Previous learning-based MVS methods have demonstrated their outstanding performance, and most of them estimate depth maps in a coarse-to-fine manner. However, in a commonly used multi-stage cascaded framework, the previous wrong estimation might lead to error propagation. In contrast, we focus on another coarse-to-fine structure, i.e., one-stage MVS ar-chitecture followed by refinement modules. Inspired by implicit neural representation, we propose an implicit neural refinement module to refine the coarse depth map. Guided by the corresponding reference image, it can better recover finer details, especially those in boundary areas. To solve the visibility problem in complex sce-narios while maintaining efficiency, we propose an adaptive group-wise correlation similarity measure for cost volume construction. Besides, we present a pyramid-based feature extraction network with a repeated top -down and bottom-up structure to gather more context-aware information, which can better meet the challenges in ill-posed regions. This novel feature extractor is also utilized to construct an enhanced Gauss-Newton refine-ment module for further upsampling and optimizing. Extensive experiments on the DTU, the Tanks & Temples and the BlendedMVS datasets demonstrate the effectiveness and generalization of our approach, which can achieve better or competitive results compared to state-of-the-art methods. The code will be available at https://github.com/BoyangSONG/ACINR-MVSNet.(c) 2022 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available