4.4 Article

Performance Assessment of Machine Learning Classifiers Using Selective Feature Approaches for Cervical Cancer Detection

期刊

WIRELESS PERSONAL COMMUNICATIONS
卷 124, 期 3, 页码 2335-2366

出版社

SPRINGER
DOI: 10.1007/s11277-022-09467-7

关键词

Cervical cancer; Scaling; Oversampling; Cross-validation; Parameter selection; Feature selection

向作者/读者索取更多资源

This article discusses the use of machine learning methods for cervical cancer diagnosis, addressing the issues of data imbalance and non-uniform scaling. By comparing the performance of different machine learning algorithms under different data processing methods, it is found that the random forest predictor performs the best when optimized with feature selection methods.
Worldwide, cervical cancer is the leading cause of death among women from cancer. The symptoms of this gynecological disease are difficult to recognize at early stage, especially in those countries that don't have facility of screening programs. In diagnosis of cervical cancer, machine learning methods can be used to detect the malignous cancer cells at initial stage. The foremost apprehension in disease diagnosis involves data imbalance issue and non-uniform scaling in dataset. In this article, a prevalent oversampling approach Synthetic Minority Oversampling Technique along with fivefold cross-validation is being used on unscaled and scaled data to handle these issues. A promising comparison is been made among the performance of most prevalent machine learning (ML) classifiers such as Naive Bayes, Logistic Regression, K-Nearest Neighbor, Support Vector Machine (SVM), Linear Discriminant analysis, Multi-Layer Perceptron, Decision Tree (DT) and Random Forest (RF) on unscaled data and scaled data obtained by Min-Max scaling, Standard scaling and Normalization. RF, SVM and DT are the top three ML algorithms obtained in cervical cancer diagnosis for which optimization possibilities are explored with feature selection methods as Univariate feature selection and Recursive feature elimination (RFE). Overall performance of Random Forest predictor with RFE (RF-RFE) is superior to all others being implemented.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据