☆ 4.3 Article

Scoring Algorithms for a Computer-Based Cognitive Screening Tool: An Illustrative Example of Overfitting Machine Learning Approaches and the Impact on Estimates of Classification Accuracy

PSYCHOLOGICAL ASSESSMENT (2019)

期刊

PSYCHOLOGICAL ASSESSMENT

卷 31, 期 11, 页码 1377-1382

出版社

AMER PSYCHOLOGICAL ASSOC

DOI: 10.1037/pas0000764

关键词

mild cognitive impairment; accuracy; computerized testing; machine learning

类别

Psychology, Clinical

资金

Canadian Institutes for Health Research
Canadian Consortium on Neurodegeneration in Aging (CCNA)
Canadian Institutes of Health Research
Saskatchewan Health Research Foundation
Department of Family and Community Medicine, University of Toronto
Sunnybrook Health Sciences Centre

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Computerized cognitive screening tools, such as the self-administered Computerized Assessment of Memory Cognitive Impairment (CAMCI), require little training and ensure standardized administration and could be an ideal test for primary care settings. We conducted a secondary analysis of a data set including 887 older adults (M age = 72.7 years, SD = 7.1 years; 32.1% male; M years education = 13.4, SD = 2.7 years) with CAMCI scores and independent diagnoses of mild cognitive impairment (MCI). A study by the CAMCI developers used a portion of this data set with a machine learning decision tree model and suggested that the CAMCI had high classification accuracy for MCI (sensitivity = 0.86, specificity = 0.94). We found similar support for accuracy (sensitivity = 0.94, specificity = 0.94) by overfitting a decision tree model, but we found evidence of lower accuracy in a cross-validation sample (sensitivity = 0.62, specificity = 0.66). A logistic regression model, however, discriminated modestly in both training (sensitivity = 0.72, specificity = 0.80) and cross-validation data sets (sensitivity = 0.69, specificity = 0.74). Evidence for strong accuracy when overfitting a decision tree model and substantially reduced accuracy in cross-validation samples was replicated across 500 bootstrapped samples. In contrast, the evidence for accuracy of the logistic regression model was similar in the training and cross-validation samples. The logistic regression model produced accuracy estimates consistent with other published CAMCI studies, suggesting evidence for classification accuracy of the CAMCI for MCI is likely modest. This case study illustrates the general need for cross-validation and careful evaluation of the generalizability of machine learning models.

Scoring Algorithms for a Computer-Based Cognitive Screening Tool: An Illustrative Example of Overfitting Machine Learning Approaches and the Impact on Estimates of Classification Accuracy

期刊

PSYCHOLOGICAL ASSESSMENT

出版社

AMER PSYCHOLOGICAL ASSOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Scoring Algorithms for a Computer-Based Cognitive Screening Tool: An Illustrative Example of Overfitting Machine Learning Approaches and the Impact on Estimates of Classification Accuracy

期刊

PSYCHOLOGICAL ASSESSMENT

出版社

AMER PSYCHOLOGICAL ASSOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文