☆ 4.7 Article

LSAR: Efficient Leverage Score Sampling Algorithm for the Analysis of Big Time Series Data

JOURNAL OF MACHINE LEARNING RESEARCH (2022)

Journal

JOURNAL OF MACHINE LEARNING RESEARCH

Volume 23, Issue -, Pages 1-36

Publisher

MICROTOME PUBL

Keywords

autoregressive model; maximum likelihood estimation; big data regime; ran-domized numerical linear algebra; sampling

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This paper applies methods from RandNLA to develop improved algorithms for analyzing large-scale time series data. A fast algorithm is developed to estimate the leverage scores of an AR model in big data regimes, showing high accuracy. Using these theoretical results, an efficient algorithm called LSAR is proposed to fit an appropriate AR model to big time series data, with high probability of finding maximum likelihood estimates and significantly improving running time compared to state-of-the-art alternatives.

We apply methods from randomized numerical linear algebra (RandNLA) to develop improved algorithms for the analysis of large-scale time series data. We first develop a new fast algorithm to estimate the leverage scores of an autoregressive (AR) model in big data regimes. We show that the accuracy of approximations lies within (1 + O (epsilon)) of the true leverage scores with high probability. These theoretical results are subsequently exploited to develop an efficient algorithm, called LSAR, for fitting an appropriate AR model to big time series data. Our proposed algorithm is guaranteed, with high probability, to find the maximum likelihood estimates of the parameters of the underlying true AR model and has a worst case running time that significantly improves those of the state-of-the-art alternatives in big data regimes. Empirical results on large-scale synthetic as well as real data highly support the theoretical results and reveal the efficacy of this new approach.

LSAR: Efficient Leverage Score Sampling Algorithm for the Analysis of Big Time Series Data

Journal

JOURNAL OF MACHINE LEARNING RESEARCH

Publisher

MICROTOME PUBL

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

LSAR: Efficient Leverage Score Sampling Algorithm for the Analysis of Big Time Series Data

Journal

JOURNAL OF MACHINE LEARNING RESEARCH

Publisher

MICROTOME PUBL

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper