☆ 4.7 Article Proceedings Paper

Training neural network classifiers for medical decision making: The effects of imbalanced datasets on classification performance

NEURAL NETWORKS (2008)

期刊

NEURAL NETWORKS

卷 21, 期 2-3, 页码 427-436

出版社

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.neunet.2007.12.031

关键词

classification; feed-forward neural networks; class imbalance; computer-aided diagnosis

类别

Computer Science, Artificial Intelligence Neurosciences

资金

NCI NIH HHS [R01 CA101911, R01 CA095061-04, R01 CA112437-03, R01 CA095061, R01 CA112437, R01 CA101911-04] Funding Source: Medline

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

This Study investigates the effect of class imbalance in training data when developing neural network classifiers for computer-aided medical diagnosis. The investigation is performed in the presence of other characteristics that are typical among medical data, namely small training sample size, large number of features, and correlations between features. Two methods of neural network training are explored: classical backpropagation (BP) and particle swarm optimization (PSO) with clinically relevant training criteria. An experimental Study is performed using simulated data and the conclusions are further validated on real clinical data for breast cancer diagnosis. The results show that classifier performance deteriorates with even modest class imbalance in the training data. Further, it is shown that BP is generally preferable over PSO for imbalanced training data especially with small data sample and large number of features. Finally, it is shown that there is no clear preference between oversampling and no compensation approach and some guidance is provided regarding a proper selection. (C) 2007 Elsevier Ltd. All rights reserved.

Training neural network classifiers for medical decision making: The effects of imbalanced datasets on classification performance

期刊

NEURAL NETWORKS

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Training neural network classifiers for medical decision making: The effects of imbalanced datasets on classification performance

期刊

NEURAL NETWORKS

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文