期刊
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS
卷 10, 期 4, 页码 -出版社
WILEY
DOI: 10.1002/wics.1430
关键词
-
Support vector machine (SVM) classification is a statistical learning method which easily accommodates large numbers of predictors and can discover both linear and nonlinear relationships between the predictors and outcomes. A common challenge is constructing an SVM when the training set includes observations with missing predictor values. In this paper, we identify when missing data can bias an SVM classifier. Because the missing data mechanisms which bias SVMs differ from the traditional framework of missing-at-random and missing-not-at-random, we argue for an SVM-specific framework for understanding missing data. Furthermore, we compare a number of missing data strategies for SVMs in a simulation study and real data example, and we make recommendations for SVM users based on the simulation study.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据