☆ 4.6 Article

Panoramic image generation using deep neural networks

SOFT COMPUTING (2023)

期刊

SOFT COMPUTING

卷 27, 期 13, 页码 8679-8695

出版社

SPRINGER

DOI: 10.1007/s00500-023-08056-5

关键词

Computer vision; Convolutional neural networks; Homography matrix; Image stitching

类别

Computer Science, Artificial Intelligence Computer Science, Interdisciplinary Applications

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

A traditional approach using RANSAC algorithm on SIFT correspondences for panoramic image generation is not robust enough for highly varying natural images, and deep learning approach has not been extensively explored. This paper proposes a deep learning model for panoramic image generation and achieves superior results compared to the state-of-the-art SIFT+RANSAC algorithm. A novel panoramic image generation dataset is also introduced.

A traditional approach for panoramic image generation consists of a random sample consensus (RANSAC) algorithm on a set of scale-invariant feature transform (SIFT) correspondences to generate a homography matrix between two images. Although producing adequate results for some type of images, hand-crafted SIFT features are not robust enough for highly varying natural images and the iterative RANSAC algorithm with its randomness does not always find the desired homography matrix. Recently, deep neural networks have been producing significant results in many challenging computer vision problems by learning features from large amounts of data. However, only very few recent works have been applied deep learning to panoramic image generation with the objective of finding feature correspondences and estimating homography matrix. Moreover, the absence of a proper dataset for the image stitching task hinders the standardization of models and comparison of their results. This paper attempts to generate panoramic images by extensively experimenting with various approaches using deep neural networks. The best proposed deep learning model achieved 7.31 and 1.07 pixels of the average absolute value loss for corner difference in X and Y directions, respectively. At the same time, qualitative results demonstrate superiority in comparison with the state-of-the-art SIFT+RANSAC algorithm. Specifically, in 72% of time the proposed framework either produced better results than SIFT+RANSAC or results of the proposed approach and SIFT+RANSAC were indistinguishable. Although SIFT + RANSAC produces better results in 28% of the time with respect to the loss function, our results are still visually comparable in many of these cases. Finally, a novel panoramic image generation dataset is introduced in this paper.

Panoramic image generation using deep neural networks

期刊

SOFT COMPUTING

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Panoramic image generation using deep neural networks

期刊

SOFT COMPUTING

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文