☆ 4.7 Article

A deep learning method for building height estimation using high-resolution multi-view imagery over urban areas: A case study of 42 Chinese cities

REMOTE SENSING OF ENVIRONMENT (2021)

期刊

REMOTE SENSING OF ENVIRONMENT

卷 264, 期 -, 页码 -

出版社

ELSEVIER SCIENCE INC

DOI: 10.1016/j.rse.2021.112590

关键词

Building height; High-resolution; Multi-view; ZY-3; Multi-task; Deep learning

类别

Environmental Sciences Remote Sensing Imaging Science & Photographic Technology

资金

National Natural Science Foundation of China [41771360, 41971295]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The study focuses on using high-resolution multi-view satellite images for building height estimation, proposing the M(3)Net deep network model that can accurately estimate building height at a spatial resolution of 2.5 meters. Testing on 42 Chinese cities showed that M(3)Net outperformed the RF method with a lower root mean square error (RMSE), and the inclusion of ZY-3 multi-view images significantly reduced the uncertainty of building height prediction.

Knowledge of building height is critical for understanding the urban development process. High-resolution optical satellite images can provide fine spatial details within urban areas, while they have not been applied to building height estimation over multiple cities and the feasibility of mapping building height at a fine scale (< 5 m) remains understudied. Multi-view satellite images can describe vertical information of buildings, due to the inconsistent response of buildings (e.g., spectral and structural variations) to different viewing angles, but they have not been employed to deep learning-based building height estimation. In this context, we introduce high-resolution ZY-3 multi-view images to estimate building height at a spatial resolution of 2.5 m. We propose a multi-spectral, multi-view, and multi-task deep network (called M(3)Net) for building height estimation, where ZY-3 multi-spectral and multi-view images are fused in a multi-task learning framework. A random forest (RF) method using multi-source features is also carried out for comparison. We select 42 Chinese cities with diverse building types to test the proposed method. Results show that the M(3)Net obtains a lower root mean square error (RMSE) than the RF, and the inclusion of ZY-3 multi-view images can significantly lower the uncertainty of building height prediction. Comparison with two existing state-of-the-art studies further confirms the superiority of our method, especially the efficacy of the M(3)Net in alleviating the saturation effect of high-rise building height estimation. Compared to the vanilla single/multi-task models, the M(3)Net also achieves a lower RMSE. Moreover, the spatial-temporal transferability test indicates the robustness of the M(3)Net to imaging conditions and building styles. The test of our method on a relatively large area (covering about 14,120 km(2)) further validates the scalability of our method from the perspectives of both efficacy and quality. The source code will be made available at https://github.com/lauraset/BuildingHeightModel.

A deep learning method for building height estimation using high-resolution multi-view imagery over urban areas: A case study of 42 Chinese cities

期刊

REMOTE SENSING OF ENVIRONMENT

出版社

ELSEVIER SCIENCE INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A deep learning method for building height estimation using high-resolution multi-view imagery over urban areas: A case study of 42 Chinese cities

期刊

REMOTE SENSING OF ENVIRONMENT

出版社

ELSEVIER SCIENCE INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文