4.7 Article

HMSM-Net: Hierarchical multi-scale matching network for disparity estimation of high-resolution satellite stereo images

期刊

出版社

ELSEVIER
DOI: 10.1016/j.isprsjprs.2022.04.020

关键词

Satellite stereo images; Disparity estimation; Convolutional neural network; Hierarchical multi-scale matching; GaoFen-7 dataset

资金

  1. High-Resolution Remote Sensing Application Demonstration System for Urban Fine Management [06-Y30F04-9001-20/22]
  2. National Natural Science Foundation of China [42001413]

向作者/读者索取更多资源

This paper proposes a hierarchical multi-scale matching network (HMSM-Net) for the disparity estimation of high-resolution satellite stereo images. The network leverages multi-scale features and a hierarchical coarse-to-fine matching strategy to handle challenging regions and achieves superior accuracy compared to state-of-the-art methods. The authors also create a dense stereo matching dataset and provide the source codes and evaluation dataset for further research.
Disparity estimation of satellite stereo images is an essential and challenging task in photogrammetry and remote sensing. Recent researches have greatly promoted the development of disparity estimation algorithms by using CNN (Convolutional Neural Networks) based deep learning techniques. However, it is still difficult to handle intractable regions that are mainly caused by occlusions, disparity discontinuities, texture-less areas, and re-petitive patterns. Besides, the lack of training datasets for satellite stereo images remains another major issue that blocks the usage of CNN techniques due to the difficulty of obtaining ground-truth disparities. In this paper, we propose an end-to-end disparity learning model, termed hierarchical multi-scale matching network (HMSM-Net), for the disparity estimation of high-resolution satellite stereo images. First, multi-scale cost volumes are con-structed by using pyramidal features that capture spatial information of multiple levels, which learn corre-spondences at multiple scales and enable HMSM-Net to be more robust in intractable regions. Second, stereo matching is executed in a hierarchical coarse-to-fine manner by applying supervision to each scale, which allows a lower scale to act as prior knowledge and guides a higher scale to attain finer matching results. Third, a refinement module that incorporates the intensity and gradient information of the input left image is designed to regress a detailed full-resolution disparity map for local structure preservation. For network training and testing, a dense stereo matching dataset is created and published by using GaoFen-7 satellite stereo images. Finally, the proposed network is evaluated on the Urban Semantic 3D and GaoFen-7 datasets. Experimental results demonstrate that HMSM-Net achieves superior accuracy compared with state-of-the-art methods, and the improvement on intractable regions is noteworthy. Additionally, results and comparisons of different methods on the GaoFen-7 dataset show that it can severs as a challenging benchmark for performance assessment of methods applied to disparity estimation of satellite stereo images. The source codes and evaluation dataset are made publicly available at https://github.com/Sheng029/HMSM-Net.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据