4.7 Article

Asymptotic Analysis of Sampling Estimators for Randomized Numerical Linear Algebra Algorithms

期刊

出版社

MICROTOME PUBL

关键词

least squares; randomized numerical linear algebra; leverage scores; asymp-totic distribution; mean squared error; asymptotic mean squared error

向作者/读者索取更多资源

In this article, we develop an asymptotic analysis to derive the distribution of RandNLA sampling estimators for the least-squares problem. We show that the sampling estimator is asymptotically normally distributed under mild regularity conditions and is asymptotically unbiased in both full sample approximation and model parameter inference settings. Based on our asymptotic analysis, we identify optimal sampling probabilities using two criteria and propose several new optimal sampling probability distributions. Our theoretical and empirical results provide insights on the role of leverage in the sampling process and demonstrate improvements over existing methods.
The statistical analysis of Randomized Numerical Linear Algebra (RandNLA) algorithms within the past few years has mostly focused on their performance as point estimators. However, this is insufficient for conducting statistical inference, e.g., constructing confidence intervals and hypothesis testing, since the distribution of the estimator is lacking. In this article, we develop an asymptotic analysis to derive the distribution of RandNLA sampling estimators for the least-squares problem. In particular, we derive the asymptotic distribution of a general sampling estimator with arbitrary sampling probabilities in a fixed design setting. The analysis is conducted in two complementary settings, i.e., when the objective of interest is to approximate the full sample estimator, and when it is to infer the underlying ground truth model parameters. For each setting, we show that the sampling estimator is asymptotically normally distributed under mild regularity conditions. Moreover, the sampling estimator is asymptotically unbiased in both settings. Based on our asymptotic analysis, we use two criteria, the Asymptotic Mean Squared Error (AMSE) and the Expected Asymptotic Mean Squared Error (EAMSE), to identify optimal sampling probabilities. Several of these optimal sampling probability distributions are new to the literature, e.g., the root leverage sampling estimator and the predictor length sampling estimator. Our theoretical results clarify the role of leverage in the sampling process, and our empirical results demonstrate improvements over existing methods.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据