4.5 Article

Learning earth system models from observations: machine learning or data assimilation?

Publisher

ROYAL SOC
DOI: 10.1098/rsta.2020.0089

Keywords

machine learning; data assimilation; earth system modelling; inverse methods; remote sensing

Ask authors/readers for more resources

Recent progress in machine learning has inspired the idea of directly improving earth system models from observations, with data assimilation and machine learning sharing many similarities that can be unified under a Bayesian framework. Bayesian networks provide a practical framework for integrating data assimilation and machine learning to improve physical models of earth system processes.
Recent progress in machine learning (ML) inspires the idea of improving (or learning) earth system models directly from the observations. Earth sciences already use data assimilation (DA), which underpins decades of progress in weather forecasting. DA and ML have many similarities: they are both inverse methods that can be united under a Bayesian (probabilistic) framework. ML could benefit from approaches used in DA, which has evolved to deal with real observations-these are uncertain, sparsely sampled, and only indirectly sensitive to the processes of interest. DA could also become more like ML and start learning improved models of the earth system, using parameter estimation, or by directly incorporating machine-learnable models. DA follows the Bayesian approach more exactly in terms of representing uncertainty, and in retaining existing physical knowledge, which helps to better constrain the learnt aspects of models. This article makes equivalences between DA and ML in the unifying framework of Bayesian networks. These help illustrate the equivalences between four-dimensional variational (4D-Var) DA and a recurrent neural network (RNN), for example. More broadly, Bayesian networks are graphical representations of the knowledge and processes embodied in earth system models, giving a framework for organising modelling components and knowledge, whether coming from physical equations or learnt from observations. Their full Bayesian solution is not computationally feasible but these networks can be solved with approximate methods already used in DA and ML, so they could provide a practical framework for the unification of the two. Development of all these approaches could address the strand challenge of making better use of observations, improve physical models of earth system processes. This article is part of the theme issue 'Machine learning weather and climate modeling.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available