4.6 Article

Nonequilibrium statistical mechanics and optimal prediction of partially-observed complex systems

Journal

NEW JOURNAL OF PHYSICS
Volume 24, Issue 10, Pages -

Publisher

IOP Publishing Ltd
DOI: 10.1088/1367-2630/ac95b7

Keywords

nonequilibrium statistical mechanics; partially-observed systems; Koopman operator; Mori-Zwanzig; data-driven models

Funding

  1. National Science Foundation [DMS-1440415]
  2. US Department of Energy through the LANL/LDRD Program
  3. US Department of Energy through the Center for Nonlinear Studies
  4. APRA E Program: Design Intelligence Fostering Formidable Energy Reduction and Enabling Novel Totally Impactful Advanced Technology Enhancements (DIFFERENTIATE) [DE-AR0001202]
  5. Templeton World Charity Foundation [TWCF0570]
  6. Foundational Questions Institute
  7. Fetzer Franklin Fund [FQXI-RFP-CPW-2007]
  8. US Army Research Laboratory
  9. US Army Research Office [W911NF-21-1-0048, W911NF-18-1-0028]
  10. US Department of Energy [DE-SC0017324]
  11. U.S. Department of Energy (DOE) [DE-SC0017324] Funding Source: U.S. Department of Energy (DOE)

Ask authors/readers for more resources

This article explains the success of data-driven models in partially-observed systems. Data-driven models use techniques such as delay-coordinate embeddings to implicitly model the effects of missing degrees of freedom, outperforming physics-based models in systems with few observable degrees of freedom.
Only a subset of degrees of freedom are typically accessible or measurable in real-world systems. As a consequence, the proper setting for empirical modeling is that of partially-observed systems. Notably, data-driven models consistently outperform physics-based models for systems with few observable degrees of freedom; e.g. hydrological systems. Here, we provide an operator-theoretic explanation for this empirical success. To predict a partially-observed system's future behavior with physics-based models, the missing degrees of freedom must be explicitly accounted for using data assimilation and model parametrization. Data-driven models, in contrast, employ delay-coordinate embeddings and their evolution under the Koopman operator to implicitly model the effects of the missing degrees of freedom. We describe in detail the statistical physics of partial observations underlying data-driven models using novel maximum entropy and maximum caliber measures. The resulting nonequilibrium Wiener projections applied to the Mori-Zwanzig formalism reveal how data-driven models may converge to the true dynamics of the observable degrees of freedom. Additionally, this framework shows how data-driven models infer the effects of unobserved degrees of freedom implicitly, in much the same way that physics models infer the effects explicitly. This provides a unified implicit-explicit modeling framework for predicting partially-observed systems, with hybrid physics-informed machine learning methods combining both implicit and explicit aspects.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available