4.7 Article

Remote Sensing Novel View Synthesis With Implicit Multiplane Representations

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TGRS.2022.3197409

关键词

Three-dimensional displays; Rendering (computer graphics); Remote sensing; Optimization; Image reconstruction; Neural networks; Estimation; Implicit neural network; multiplane images (MPIs); novel view synthesis; remote sensing (RS)

资金

  1. National Natural Science Foundation of China [62125102]

向作者/读者索取更多资源

Novel view synthesis of remote sensing scenes plays a significant role in scene visualization, human-computer interaction, and various downstream applications. This article proposes a novel RS view synthesis method based on implicit neural representations, which outperforms previous state-of-the-art methods in terms of reconstruction accuracy, visual fidelity, and time efficiency.
Novel view synthesis of remote sensing (RS) scenes is of great significance for scene visualization, human-computer interaction, and various downstream applications. Despite the recent advances in computer graphics and photogrammetry technology, generating novel views is still challenging particularly for RS images due to its high complexity, view sparsity, and limited view-perspective variations. In this article, we propose a novel RS view synthesis method by leveraging the recent advances in implicit neural representations. Considering the overhead and far depth imaging of RS images, we represent the 3-D space by combining implicit multiplane images (ImMPI) representation and deep neural networks. The 3-D scene is reconstructed under a self-supervised optimization paradigm through a differentiable multiplane renderer with multiview input constraints. Images from any novel views thus can be freely rendered on the basis of the reconstructed model. As a by-product, the depth maps corresponding to the given viewpoint can be generated along with the rendering output. We refer to our method as ImMPIs. To further improve the view synthesis under sparse-view inputs, we explore the learning-based initialization of RS 3-D scenes and proposed a neural-network-based prior extractor to accelerate the optimization process. In addition, we propose a new dataset for RS novel view synthesis with multiview real-world Google Earth images. Extensive experiments demonstrate the superiority of the ImMPI over previous state-of-the-art methods in terms of reconstruction accuracy, visual fidelity, and time efficiency. Ablation experiments also suggest the effectiveness of our methodology design.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据