☆ 4.7 Article

Multimodal Image Fusion Framework for End-to-End Remote Sensing Image Registration

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

期刊

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING

卷 61, 期 -, 页码 -

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TGRS.2023.3247642

关键词

Remote sensing; Feature extraction; Image matching; Image registration; Task analysis; Convolutional neural networks; Image fusion; End-to-end registration; multimodal fusion; remote sensing image; spatial transformer networks

类别

Geochemistry & Geophysics Engineering, Electrical & Electronic Remote Sensing Imaging Science & Photographic Technology

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

In this article, we propose a multimodal image fusion network with self-attention to merge the feature representation of the reference and sensed images. The integration information is then utilized to regress the prescribed points' displacement parameters to get PTM between the reference and sensed images. Finally, PTM is supplied into the spatial transformation network (STN), which warps the sensed image to the same coordinates as the reference image, achieving end-to-end matching. Our method is validated by qualitative and quantitative experimental results on multimodal remote sensing image matching tasks.

We formulate the registration as a function that maps the input reference and sensed images to eight displacement parameters between prescribed matching points, as opposed to the usual techniques (feature extraction-description-matching-geometric restrictions). The projection transformation matrix (PTM) is then computed in the neural network and used to warp the sensed image, uniting all matching tasks under one framework. In this article, we offer a multimodal image fusion network with self-attention to merge the feature representation of the reference and sensed images. The integration information is then utilized to regress the prescribed points' displacement parameters to get PTM between the reference and sensed images. Finally, PTM is supplied into the spatial transformation network (STN), which warps the sensed image to the same coordinates as the reference image, achieving end-to-end matching. In addition, a dual-supervised loss function is proposed to optimize the network from both the prescribed point displacement and the overall pixel matching perspectives. The effectiveness of our method is validated by qualitative and quantitative experimental results on multimodal remote sensing image matching tasks. The code is available at: https://github.com/liliangzhi110/E2EIR.

Multimodal Image Fusion Framework for End-to-End Remote Sensing Image Registration

期刊

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Multimodal Image Fusion Framework for End-to-End Remote Sensing Image Registration

期刊

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文