4.6 Article

Ensemble Learners of Multiple Deep CNNs for Pulmonary Nodules Classification Using CT Images

期刊

IEEE ACCESS
卷 7, 期 -, 页码 110358-110371

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2019.2933670

关键词

Lung cancer; pulmonary nodules; CT; machine learning; ensemble learning; convolutional neural networks

资金

  1. National Natural Science Foundation of China [81671773, 61672146]
  2. Fundamental Research Funds for the Central Universities [N172008008, N180719020]
  3. Open Program of Neusoft Research of Intelligent Healthcare Technology Company Ltd. [NRIHTOP1803]

向作者/读者索取更多资源

Various deep convolutional neural networks (CNNs) have been used to distinguish between benign and malignant pulmonary nodules using CT images. However, single learner usually presents unsatisfied performance due to limited hypothesis space, or falling into local minima, or wrong selection of hypothesis space. To tackle these issues, we propose to build ensemble learners through fusing multiple deep CNN learners for pulmonary nodules classification. CT image patches of 743 nodules are extracted from LIDC-IDRI database and utilized. First, eight deep CNN learners with different architectures are trained and evaluated by 10-fold cross-validation. Each nodule has eight predictions from the eight primary learners. Second, we fuse these eight predictions by the strategies of majority voting (VOT), averaging (AVE), or machine learning. Specifically, different machine learning algorithms including K-Nearest-Neighbor (KNN), Support Vector Machines (SVM), Naive Bayes (NB), Decision Trees (DT), Multi-layer Perceptron (MLP), Random Forests (RF), Gradient Boosting Regression Trees (GBRT) and Adaptive Boosting (AdaBoost) are implemented. Moreover, the correlation coefficients between the predictions of 10 ensemble learners are calculated, and the hierarchical clustering dendrogram is drawn. It is found that the ensemble learners achieve higher prediction accuracy (84.0% vs 81.7%) than single CNN learner. The overlap ratio among the 10 ensemble learners is much higher than that of the 8 primary learners (62.9% vs 33.2%). In addition, it is shown that ensemble learners are roughly divided into three categories: the first (SVM, MLP, GBRT and RF) achieves the best performance; the second (VOT and AVE) is better than the third (AdaBoost, DT, NB and KNN). VOT and AVE yield higher recall than the machine learning algorithms. These results indicate that ensemble learners based on multiple CNN learners can achieve better performances for pulmonary nodules classification using CT images and that preferred fusion strategies include SVM, MLP, GBRT and RF.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据