4.3 Article

Optimism Bias Correction in Omics Studies with Big Data: Assessment of Penalized Methods on Simulated Data

期刊

OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY
卷 23, 期 4, 页码 207-213

出版社

MARY ANN LIEBERT, INC
DOI: 10.1089/omi.2018.0191

关键词

optimism bias; LASSO; variable selection; parameter estimation

向作者/读者索取更多资源

Big Data generated by omics technologies require simultaneous analyses of large numbers of variables. This leads to complex model selection and parameter estimates that show optimism bias. This study on simulated data sets examined optimism-bias correction by penalty regression methods in case-control studies that involve clinical and omics variables. Least absolute shrinkage and selection operator (LASSO)-based methods (LASSO-penalized logistic regression, adaptive LASSO, and regularized LASSO for selection + ridge regression) were evaluated using power, the false positive rate (FPR), false discovery rate (FDR), and by estimated versus theoretical parameter comparisons. The ordinary LASSO overcorrects the optimism bias. The adaptive LASSO with LASSO estimation of the weights was unable to provide a sufficient correction. Importantly, the adaptive LASSO with ridge estimation of the weights showed the best parameter estimation. The regularized LASSO selection showed a slight optimism bias that decreased with the increase in the training set size. The optimism bias decreased with the increase of the number of variables selected among truly differentially expressed variables; however, power, FPR, and FDR were correlated. A compromise between model selection and estimation accuracy should be found. These results might prove useful because Big Data analyses are becoming commonplace in omics/multiomics studies in integrative biology, precision medicine, and planetary health.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据