4.6 Article

IV-Net: single-view 3D volume reconstruction by fusing features of image and recovered volume

期刊

VISUAL COMPUTER
卷 -, 期 -, 页码 -

出版社

SPRINGER
DOI: 10.1007/s00371-022-02725-6

关键词

Single-view 3D reconstruction; Multi-scale convolution; Deep learning; Residual convolutional neural network

资金

  1. National Natural Science Foundation of China [11471093]

向作者/读者索取更多资源

Researchers propose a voxel-based network, IV-Net, for single-view 3D reconstruction. This network combines features from images and recovered volumes, and utilizes multi-scale convolutional blocks and an IV refiner to improve the accuracy of shape and detail reconstruction.
Single-view 3D reconstruction aims to recover the 3D shape from one image of an object and has attracted increasingly attention in recent years. Mostly, previous works are devoted to learning a mapping from 2 to 3D, and lack of spatial information of objects will cause inaccurate reconstruction on the details of objects. To address this issue, for single-view 3D reconstruction, we propose a novel voxel-based network by fusing features of image and recovered volume, named IV-Net. By a pre-trained baseline, it achieves image feature and a coarse volume from each image input, where the recovered volume contains spatial semantic information. Specially, the multi-scale convolutional block is designed to improve 2D encoder by extracting multi-scale image information. To recover more accurate shape and details of the object, an IV refiner is further used to reconstruct the final volume. We conduct experimental evaluations on both synthetic ShapeNet dataset and real-world Pix3D dataset, and results of comparative experiments indicate that our IV-Net outperforms state-of-the-art approaches about accuracy and parameters.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据