4.7 Article

Assessing the representational accuracy of data-driven models: The case of the effect of urban green infrastructure on temperature

Journal

ENVIRONMENTAL MODELLING & SOFTWARE
Volume 141, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.envsoft.2021.105048

Keywords

Urban heat; Machine learning; Representational accuracy; Interpretable machine learning; Data-driven modelling

Funding

  1. Swiss National Science Foundation, National Research Programme 75 Big Data [167215]

Ask authors/readers for more resources

This study discusses the accuracy of using data-driven models for environmental science predictions and the methods for studying the phenomenon. It proposes three criteria to assess the relationships learned by machine learning algorithms to improve the understanding of modeled phenomena.
Data-driven modelling with machine learning (ML) is already being used for predictions in environmental science. However, it is less clear to what extent data-driven models that successfully predict a phenomenon are representationally accurate and thus increase our understanding of the phenomenon. Besides empirical accuracy, we propose three criteria to indirectly assess the relationships learned by the ML algorithms and how they relate to a phenomenon under investigation: first, consistency of the outcomes with background knowledge; second, the adequacy of the measurements, datasets and methods used to construct a data-driven model; third, the robustness of interpretable machine learning analyses across different ML algorithms. We apply the three criteria with a case study modelling of the effect of different urban green infrastructure types on temperature and show that our approach improves the assessment of representational accuracy and reduces representational uncertainty, which can improve the understanding of modelled phenomena.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available