4.7 Article

Ordered homogeneity pursuit lasso for group variable selection with applications to spectroscopic data

期刊

出版社

ELSEVIER SCIENCE BV
DOI: 10.1016/j.chemolab.2017.07.004

关键词

Lasso; Homogeneity pursuit; Sparse learning; Variable ordering; Grouping effect; Partial least squares

资金

  1. National Natural Science Foundation of China [11271374, 11561010]
  2. Key Laboratory for Mixed and Missing Data Statistics of the Education Department of Guangxi Province [GXMMSL201404]
  3. Mathematics and Interdisciplinary Sciences Project
  4. Central South University

向作者/读者索取更多资源

In high-dimensional data modeling, variable selection methods have been a popular choice to improve the prediction accuracy by effectively selecting the subset of informative variables, and such methods can enhance the model interpretability with sparse representation. In this study, we propose a novel group variable selection method named ordered homogeneity pursuit lasso (OHPL) that takes the homogeneity structure in high dimensional data into account. OHPL is particularly useful in high-dimensional datasets with strongly correlated variables. We illustrate the approach using three real-world spectroscopic datasets and compare it with four state-of-the-art variable selection methods. The benchmark results on real-world data show that the proposed method is capable of identifying a small number of influential groups and has better prediction performance than its competitors. The OHPL method and the spectroscopic datasets are implemented and included in an R package OHPL available from https://ohpl.io.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据