☆ 4.7 Article

Asymptotic Analysis of Sampling Estimators for Randomized Numerical Linear Algebra Algorithms

JOURNAL OF MACHINE LEARNING RESEARCH (2022)

期刊

JOURNAL OF MACHINE LEARNING RESEARCH

卷 23, 期 -, 页码 -

出版社

MICROTOME PUBL

关键词

least squares; randomized numerical linear algebra; leverage scores; asymp-totic distribution; mean squared error; asymptotic mean squared error

类别

Automation & Control Systems Computer Science, Artificial Intelligence

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

In this article, we develop an asymptotic analysis to derive the distribution of RandNLA sampling estimators for the least-squares problem. We show that the sampling estimator is asymptotically normally distributed under mild regularity conditions and is asymptotically unbiased in both full sample approximation and model parameter inference settings. Based on our asymptotic analysis, we identify optimal sampling probabilities using two criteria and propose several new optimal sampling probability distributions. Our theoretical and empirical results provide insights on the role of leverage in the sampling process and demonstrate improvements over existing methods.

The statistical analysis of Randomized Numerical Linear Algebra (RandNLA) algorithms within the past few years has mostly focused on their performance as point estimators. However, this is insufficient for conducting statistical inference, e.g., constructing confidence intervals and hypothesis testing, since the distribution of the estimator is lacking. In this article, we develop an asymptotic analysis to derive the distribution of RandNLA sampling estimators for the least-squares problem. In particular, we derive the asymptotic distribution of a general sampling estimator with arbitrary sampling probabilities in a fixed design setting. The analysis is conducted in two complementary settings, i.e., when the objective of interest is to approximate the full sample estimator, and when it is to infer the underlying ground truth model parameters. For each setting, we show that the sampling estimator is asymptotically normally distributed under mild regularity conditions. Moreover, the sampling estimator is asymptotically unbiased in both settings. Based on our asymptotic analysis, we use two criteria, the Asymptotic Mean Squared Error (AMSE) and the Expected Asymptotic Mean Squared Error (EAMSE), to identify optimal sampling probabilities. Several of these optimal sampling probability distributions are new to the literature, e.g., the root leverage sampling estimator and the predictor length sampling estimator. Our theoretical results clarify the role of leverage in the sampling process, and our empirical results demonstrate improvements over existing methods.

Asymptotic Analysis of Sampling Estimators for Randomized Numerical Linear Algebra Algorithms

期刊

JOURNAL OF MACHINE LEARNING RESEARCH

出版社

MICROTOME PUBL

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Asymptotic Analysis of Sampling Estimators for Randomized Numerical Linear Algebra Algorithms

期刊

JOURNAL OF MACHINE LEARNING RESEARCH

出版社

MICROTOME PUBL

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文