4.6 Article

Classification of complete blood count and haemoglobin typing data by a C4.5 decision tree, a naive Bayes classifier and a multilayer perceptron for thalassaemia screening

期刊

BIOMEDICAL SIGNAL PROCESSING AND CONTROL
卷 7, 期 2, 页码 202-212

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.bspc.2011.03.007

关键词

C4.5; Complete blood count; Haemoglobin typing; Multilayer perceptron; Naive Bayes classifier; Thalassaemia

资金

  1. Thailand Research Fund (TRF) through Royal Golden Jubilee Ph.D. Programme [PHD/1.E.KN.51/A.1, PHD/1.E.KN.50/A.1, PHD/1.E.KN.49/A.1]
  2. Faculty of Engineering of the King Mongkut's University of Technology North Bangkok
  3. Mahidol Research Grant
  4. Office of the Higher Education Commission

向作者/读者索取更多资源

This article presents the classification of blood characteristics by a C4.5 decision tree, a naive Bayes classifier and a multilayer perceptron for thalassaemia screening. The aim is to classify eighteen classes of thalassaemia abnormality, which have a high prevalence in Thailand, and one control class by inspecting data characterised by a complete blood count (CBC) and haemoglobin typing. Two indices namely a haemoglobin concentration (HB) and a mean corpuscular volume (MCV) are the chosen CBC attributes. On the other hand, known types of haemoglobin from six ranges of retention time identified via high performance liquid chromatography (HPLC) are the chosen haemoglobin typing attributes. The stratified 10-fold cross-validation results indicate that the best classification performance with average accuracy of 93.23% (standard deviation = 1.67%) and 92.60% (standard deviation = 1.75%) is achieved when the naive Bayes classifier and the multilayer perceptron are respectively applied to samples which have been pre-processed by attribute discretisation. The results also suggest that the HB attribute is redundant. Moreover, the achieved classification performance is significantly higher than that obtained using only haemoglobin typing attributes as classifier inputs. Subsequently, the naive Bayes classifier and the multilayer perceptron are applied to an additional data set in a clinical trial which respectively results in accuracy of 99.39% and 99.71%. These results suggest that a combination of CBC and haemoglobin typing analysis with a naive Bayes classifier or a multilayer perceptron is highly suitable for automatic thalassaemia screening. (C) 2011 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据