4.6 Article

Linear or Nonlinear? Automatic Structure Discovery for Partially Linear Models

期刊

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
卷 106, 期 495, 页码 1099-1112

出版社

TAYLOR & FRANCIS INC
DOI: 10.1198/jasa.2011.tm10281

关键词

Model selection; RKHS; Semiparametric regression; Shrinkage; Smoothing splines

资金

  1. NSF [DMS-0645293, DMS-0906497, DMS-0747575]
  2. NIH [NIH/NCI R01 CA-085848, NIH/NCI R01 CA149569, NIH/NCI P01 CA142538]
  3. Division Of Mathematical Sciences
  4. Direct For Mathematical & Physical Scien [0906497, 1347844] Funding Source: National Science Foundation

向作者/读者索取更多资源

Partially linear models provide a useful class of tools for modeling complex data by naturally incorporating a combination of linear and nonlinear effects within one framework. One key question in partially linear models is the choice of model structure, that is, how to decide which covariates are linear and which are nonlinear. This is a fundamental, yet largely unsolved problem for partially linear models. In practice, one often assumes that the model structure is given or known and then makes estimation and inference based on that structure. Alternatively, there are two methods in common use for tackling the problem: hypotheses testing and visual screening based on the marginal fits. Both methods are quite useful in practice but have their drawbacks. First, it is difficult to construct a powerful procedure for testing multiple hypotheses of linear against nonlinear fits. Second, the screening procedure based on the scatterplots of individual covariate fits may provide an educated guess on the regression function form, but the procedure is ad hoc and lacks theoretical justifications. In this article, we propose a new approach to structure selection for partially linear models, called the LAND (Linear And Nonlinear Discoverer). The procedure is developed in an elegant mathematical framework and possesses desired theoretical and computational properties. Under certain regularity conditions, we show that the LAND estimator is able to identify the underlying true model structure correctly and at the same time estimate the multivariate regression function consistently. The convergence rate of the new estimator is established as well. We further propose an iterative algorithm to implement the procedure and illustrate its performance by simulated and real examples. Supplementary materials for this article are available online.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据