☆ 4.7 Article

A fuzzy-based data transformation for feature extraction to increase classification performance with small medical data sets

ARTIFICIAL INTELLIGENCE IN MEDICINE (2011)

期刊

ARTIFICIAL INTELLIGENCE IN MEDICINE

卷 52, 期 1, 页码 45-52

出版社

ELSEVIER

DOI: 10.1016/j.artmed.2011.02.001

关键词

Feature extraction; Kernel principal component analysis; Support vector machine; Binary classification

类别

Computer Science, Artificial Intelligence Engineering, Biomedical Medical Informatics

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Objective: Medical data sets are usually small and have very high dimensionality. Too many attributes will make the analysis less efficient and will not necessarily increase accuracy, while too few data will decrease the modeling stability. Consequently, the main objective of this study is to extract the optimal subset of features to increase analytical performance when the data set is small. Methods: This paper proposes a fuzzy-based non-linear transformation method to extend classification related information from the original data attribute values for a small data set. Based on the new transformed data set, this study applies principal component analysis (PCA) to extract the optimal subset of features. Finally, we use the transformed data with these optimal features as the input data for a learning tool, a support vector machine (SVM). Six medical data sets: Pima Indians' diabetes, Wisconsin diagnostic breast cancer, Parkinson disease, echocardiogram, BUPA liver disorders dataset, and bladder cancer cases in Taiwan, are employed to illustrate the approach presented in this paper. Results: This research uses the t-test to evaluate the classification accuracy for a single data set; and uses the Friedman test to show the proposed method is better than other methods over the multiple data sets. The experiment results indicate that the proposed method has better classification performance than either PCA or kernel principal component analysis (KPCA) when the data set is small, and suggest creating new purpose-related information to improve the analysis performance. Conclusion: This paper has shown that feature extraction is important as a function of feature selection for efficient data analysis. When the data set is small, using the fuzzy-based transformation method presented in this work to increase the information available produces better results than the PCA and KPCA approaches. (C) 2011 Elsevier B.V. All rights reserved.

A fuzzy-based data transformation for feature extraction to increase classification performance with small medical data sets

期刊

ARTIFICIAL INTELLIGENCE IN MEDICINE

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A fuzzy-based data transformation for feature extraction to increase classification performance with small medical data sets

期刊

ARTIFICIAL INTELLIGENCE IN MEDICINE

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文