4.6 Article

NONADAPTIVE QUASI-OPTIMAL POINTS SELECTION FOR LEAST SQUARES LINEAR REGRESSION

期刊

SIAM JOURNAL ON SCIENTIFIC COMPUTING
卷 38, 期 1, 页码 A385-A411

出版社

SIAM PUBLICATIONS
DOI: 10.1137/15M1015868

关键词

linear regression; least squares; stochastic collocation

资金

  1. National Science Foundation (NSF) [1418771]
  2. Air Force Office of Scientific Research (AFOSR) [FA95501410022]
  3. Department of Energy [DESC0011615]
  4. Direct For Mathematical & Physical Scien
  5. Division Of Mathematical Sciences [1656459] Funding Source: National Science Foundation
  6. Division Of Mathematical Sciences
  7. Direct For Mathematical & Physical Scien [1418771] Funding Source: National Science Foundation

向作者/读者索取更多资源

In this paper we present a quasi-optimal sample set for ordinary least squares (OLS) regression. The quasi-optimal set is designed in such a way that, for a given number of samples, it can deliver the regression result as close as possible to the result obtained by a (much) larger set of candidate samples. The quasi-optimal set is determined by maximizing a quantity measuring the mutual column orthogonality and the determinant of the model matrix. This procedure is nonadaptive, in the sense that it does not depend on the sample data. This is useful in practice, as it allows one to determine, prior to the potentially expensive data collection procedure, where to sample the underlying system. In addition to presenting the theoretical motivation of the quasi-optimal set, we also present its efficient implementation via a greedy algorithm, along with several numerical examples to demonstrate its efficacy. Since the quasi-optimal set allows one to obtain a near optimal regression result under any affordable number of samples, it notably outperforms other standard choices of sample sets. An immediate application of the quasi-optimal set is stochastic collocation for uncertainty quantification, where data collection usually requires expensive PDE solutions. It is demonstrated that the quasi-optimal set OLS can deliver very competitive results, compared to the generalized polynomial chaos Galerkin method, and yet it remains fully nonintrusive and is much more flexible for practical computations.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据