期刊
VISUAL COMPUTER
卷 -, 期 -, 页码 -出版社
SPRINGER
DOI: 10.1007/s00371-022-02725-6
关键词
Single-view 3D reconstruction; Multi-scale convolution; Deep learning; Residual convolutional neural network
资金
- National Natural Science Foundation of China [11471093]
Researchers propose a voxel-based network, IV-Net, for single-view 3D reconstruction. This network combines features from images and recovered volumes, and utilizes multi-scale convolutional blocks and an IV refiner to improve the accuracy of shape and detail reconstruction.
Single-view 3D reconstruction aims to recover the 3D shape from one image of an object and has attracted increasingly attention in recent years. Mostly, previous works are devoted to learning a mapping from 2 to 3D, and lack of spatial information of objects will cause inaccurate reconstruction on the details of objects. To address this issue, for single-view 3D reconstruction, we propose a novel voxel-based network by fusing features of image and recovered volume, named IV-Net. By a pre-trained baseline, it achieves image feature and a coarse volume from each image input, where the recovered volume contains spatial semantic information. Specially, the multi-scale convolutional block is designed to improve 2D encoder by extracting multi-scale image information. To recover more accurate shape and details of the object, an IV refiner is further used to reconstruct the final volume. We conduct experimental evaluations on both synthetic ShapeNet dataset and real-world Pix3D dataset, and results of comparative experiments indicate that our IV-Net outperforms state-of-the-art approaches about accuracy and parameters.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据