4.7 Article

Feature Selection for Classification using Principal Component Analysis and Information Gain

期刊

EXPERT SYSTEMS WITH APPLICATIONS
卷 174, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2021.114765

关键词

Feature selection; Classification; Dimensionality reduction; Filter model; Information gain; Principal component analysis

向作者/读者索取更多资源

This study investigates the application of feature selection and classification in various fields, addressing the challenges of high dimensionality in datasets and the negative impact of irrelevant and redundant attributes on classification algorithms. To improve classification performance, a hybrid filter model based on principal component analysis and information gain is proposed and applied to machine learning techniques, demonstrating enhanced accuracy, precision, and recall.
Feature Selection and classification have previously been widely applied in various areas like business, medical and media fields. High dimensionality in datasets is one of the main challenges that has been experienced in classifying data, data mining and sentiment analysis. Irrelevant and redundant attributes have also had a negative impact on the complexity and operation of algorithms for classifying data. Consequently, the algorithms record poor results or performance. Some existing work use all attributes for classification, some of which are insignificant for the task, thereby leading to poor performance. This paper therefore develops a hybrid filter model for feature selection based on principal component analysis and information gain. The hybrid model is then applied to support classification using machine learning techniques e.g. the Naive Bayes technique. Experimental results demonstrate that the hybrid filter model reduces data dimensions, selects appropriate feature sets, and reduces training time, hence providing better classification performance as measured by accuracy, precision and recall..

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据