4.6 Article

A novel method for prediction of skin disease through supervised classification techniques

期刊

SOFT COMPUTING
卷 26, 期 19, 页码 10527-10533

出版社

SPRINGER
DOI: 10.1007/s00500-022-07435-8

关键词

Skin disease; Feature selection; Supervised classification; KNN; SVM; Random forest

向作者/读者索取更多资源

Skin diseases have a significant impact on both the physical and psychological health of patients, and accurately predicting the disease cases is crucial for effective treatment. However, selecting appropriate features from the vast amount of healthcare data available is challenging. This study aims to identify significant attributes and remove irrelevant features to improve the performance of the model.
Skin diseases are the most important worrying problems in societies because it affects the patients both physically and psychologically. Skin disease is one of the highly prone to risk with an association of climatic factors around the world. Predicting the skin disease cases associated with influencing factors is the most crucial task. It is very difficult task to identify the appropriate and optimal features for skin disease from the large volume of health sector data available in the world. Previous researchers applied different types of ensemble features selection techniques for the appropriate selection of features which gives highest accuracy with minimum computation time. Classification rate of any algorithm depends on feature extraction techniques and classifier used for classification purpose. Data availability is one of the most significant drawbacks in the health sector if data are available that might be in raw format. Filling missing value and type conversion almost takes 70% of the time. The missing value can be addressed by statistical parameters such as mean, average and median with stand mechanism in machine learning. The objective of this paper is the selection of significant attributes and removes irrelevant features that affect model performance. The performance of skin disease data can be experimented through K-nearest neighbor, support vector machine and random forest classifier. The entire research is carried out on the real-time dataset. The efficiency of the proposed approach is measured through confusion matrix, accuracy, F-measure, precision and recall.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据