4.7 Article

Deep residential representations: Using unsupervised learning to unlock elevation data for geo-demographic prediction

Journal

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING
Volume 187, Issue -, Pages 378-392

Publisher

ELSEVIER
DOI: 10.1016/j.isprsjprs.2022.03.015

Keywords

LiDAR; Geo-demographics; Self-supervised learning; Deep learning

Funding

  1. Economic and Social Research Council [ES/P000673/1]
  2. Natural Sciences and Engineering Research Council of Canada (NSERC) [RGPIN-2020-07114]
  3. Canada Research Chairs program

Ask authors/readers for more resources

LiDAR technology provides detailed three-dimensional elevation maps of urban and rural landscapes. This paper proposes a convenient task-agnostic tile elevation embedding method using unsupervised Deep Learning. The potential of the embeddings is tested by predicting deprivation indices, showing improved performance compared to using standard demographic features alone. The paper also demonstrates the coherent tile segments generated by the embedding pipeline using Deep Learning and K-means clustering.
LiDAR (short for Light Detection And Ranging or Laser Imaging, Detection, And Ranging) technology can be used to provide detailed three-dimensional elevation maps of urban and rural landscapes. The geographically granular and open-source nature of this data lends itself to an array of societal, organisational and business applications where geo-demographic type data is utilised. However, the complexity involved in processing this multi-dimensional data in raw form has thus far restricted its practical adoption. This paper proposes a series of convenient task-agnostic tile elevation embeddings to address this challenge, using recent advances from unsupervised Deep Learning. We test the potential of our embeddings by predicting seven English indices of deprivation (2019) for small geographies in the Greater London area. These indices cover a range of socioeconomic outcomes and serve as a proxy for a wide variety of potential downstream tasks to which the embeddings can be applied. We consider the suitability of this data not just on its own but also as an auxiliary source of data in combination with demographic features, thus providing a realistic use case for the embeddings. Having trialled various model/embedding configurations, we find that our best performing embeddings lead to RootMean-Squared-Error (RMSE) improvements of up to 21% over using standard demographic features alone. We also demonstrate how our embedding pipeline, using Deep Learning combined with K-means clustering, produces coherent tile segments which allow the latent embedding features to be interpreted.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available