4.7 Article

3DCentripetalNet: Building height retrieval from monocular remote sensing imagery

出版社

ELSEVIER
DOI: 10.1016/j.jag.2023.103311

关键词

Building height retrieval; Monocular imagery; Building footprint generation; Height estimation

向作者/读者索取更多资源

Three-dimensional (3D) building structures play a vital role in understanding urban dynamics. Monocular remote sensing imagery is a cost-effective data source for large-scale building height retrieval. However, existing methods fail to consider the information of neighboring pixels belonging to the same building. Therefore, this study proposes a novel representation called 3D centripetal shifts, which incorporates both planar and vertical structures of buildings, and presents a robust solution named 3DCentripetalNet for building height retrieval.
Three-dimensional (3D) building structures are vital to understanding urban dynamics. Monocular remote sensing imagery is a cost-effective data source for large-scale building height retrieval when compared to LiDAR data and multi-view imagery. Existing methods learn building footprints and height maps per pixel via either a multi-task network or two separate networks, however, failing to consider the information of neighboring pixels that belong to the identical building. Therefore, we propose learning a novel representation for 3D buildings, namely 3D centripetal shifts, a unified representation of individual building instances. Our method is termed as 3DCentripetalNet and learns the 3D centripetal shift representation that incorporates planar and vertical structures of buildings. Afterward, a decoupling module is devised to learn building corner points. Finally, a 3D modeling module is designed to retrieve building height from the learned 3D centripetal shift map and corner points. We investigate the proposed 3DCentripetalNet on two datasets with different spatial resolutions, i.e., the ISPRS Vaihingen dataset (9 cm/pixel) and the Urban 3D dataset (50 cm/pixel). Experimental results suggest that 3DCentripetalNet is able to preserve sharp building boundaries, largely alleviate false detections, and significantly outperform other competitors. Thus, we believe that 3DCentripetalNet is a robust solution for the task of building height retrieval from monocular imagery.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据