4.7 Article

Inference of stochastic time series with missing data

Journal

PHYSICAL REVIEW E
Volume 104, Issue 2, Pages -

Publisher

AMER PHYSICAL SOC
DOI: 10.1103/PhysRevE.104.024119

Keywords

-

Funding

  1. Intramural Research Program of the National Institutes of Health, NIDDK
  2. New Faculty Startup Fund from Seoul National University
  3. National Research Foundation of Korea (NRF) Grant - Korea government (MSIT) [2019R1F1A1052916]
  4. National Research Foundation of Korea [2019R1F1A1052916] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

Ask authors/readers for more resources

Inferring dynamics from incomplete time series data is challenging, but an expectation maximization algorithm proposed in this study demonstrates effectiveness in restoring missing data points and inferring underlying network models. Balancing consistency between observed and missing data points is crucial for accurate model inference during iterative processes.
Inferring dynamics from time series is an important objective in data analysis. In particular, it is challenging to infer stochastic dynamics given incomplete data. We propose an expectation maximization (EM) algorithm that iterates between alternating two steps: E-step restores missing data points, while M-step infers an underlying network model from the restored data. Using synthetic data of a kinetic Ising model, we confirm that the algorithm works for restoring missing data points as well as inferring the underlying model. At the initial iteration of the EM algorithm, the model inference shows better model-data consistency with observed data points than with missing data points. As we keep iterating, however, missing data points show better model-data consistency. We find that demanding equal consistency of observed and missing data points provides an effective stopping criterion for the iteration to prevent going beyond the most accurate model inference. Using the EM algorithm and the stopping criterion together, we infer missing data points from a time-series data of real neuronal activities. Our method reproduces collective properties of neuronal activities such as correlations and firing statistics even when 70% of data points are masked as missing points.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available