☆ 4.5 Article

Adaptive depth estimation for pyramid multi-view stereo

COMPUTERS & GRAPHICS-UK (2021)

期刊

COMPUTERS & GRAPHICS-UK

卷 97, 期 -, 页码 268-278

出版社

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.cag.2021.04.016

关键词

3D Reconstruction; Multi-View Stereo; Deep Learning

类别

Computer Science, Software Engineering

资金

Key Technological Innovation Projects of Hubei Province [2018AAA062]
NSFC [61972298]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper proposes a MVS network for efficient high-resolution depth estimation by adaptively refining and upsampling the depth map to the desired resolution, reducing excessive computation on accurate positions. Experimental results show that the method can generate comparable results with state-of-the-art learning methods, reconstructing more geometric details and consuming less GPU memory.

In this paper, we propose a Multi-View Stereo (MVS) network which can perform efficient high-resolution depth estimation with low memory consumption. Classical learning-based MVS approaches typically construct 3D cost volumes to regress depth information, making the output resolution rather limited as the memory consumption grows cubically with the input resolution. Although recent approaches have made significant progress in scalability by introducing the coarse-to-fine fashion or sequential cost map regularization, the memory consumption still grows quadratically with input resolution and is not friendly for commodity GPU. Observing that the surfaces of most objects in real world are locally smooth, we assume that most of the depth hypotheses upsampled from a well-estimated depth map are accurate. Based on the assumption, we propose a pyramid MVS network based on the adaptive depth estimation, which gradually refines and upsamples the depth map to the desired resolution. Instead of estimating depth hypotheses for all pixels in the depth map, our method only performs prediction at adaptively selected locations, alleviating excessive computation on well-estimated positions. To estimate depth hypotheses for sparse selected locations, we propose the lightweight pixelwise depth estimation network, which can estimate depth value for each selected location independently. Experiments demonstrate that our method can generate results comparable with the state-of-the-art learning-based methods while reconstructing more geometric details and consuming less GPU memory. (c) 2021 Elsevier Ltd. All rights reserved.

Adaptive depth estimation for pyramid multi-view stereo

期刊

COMPUTERS & GRAPHICS-UK

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Adaptive depth estimation for pyramid multi-view stereo

期刊

COMPUTERS & GRAPHICS-UK

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文