4.7 Article

Multi-objective PSO based online feature selection for multi-label classification

期刊

KNOWLEDGE-BASED SYSTEMS
卷 222, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.knosys.2021.106966

关键词

Online feature selection (OFS); Particle swarm optimization (PSO); Multi-label classification; Multi-objective optimization; Redundant feature; Non-significant feature

向作者/读者索取更多资源

This paper presents an adaptive feature selection algorithm for multi-label classification scenarios, real-time selecting the optimal feature subset online. Through a three-phase filtering process, the algorithm improves the accuracy and efficiency of feature selection.
Feature selection approaches aim to select a set of prominent features that best describe the data to improve the efficiency without degrading the performance of the model. In many real-world applications such as social networks, it is not easy to get a static feature set; rather, new features arrive continuously in the system. Therefore, online feature selection (OFS) strategies have become popular in dealing with such problems. Recent years have also witnessed the prominence of multi label classification frameworks where multiple class labels can be associated with a single instance. The proposed method considers the multi-label learning and the arrival of features in an online fashion. The method automatically determines the best subset of features that is suitable for multi label classification. A three-phase filtering process is applied to select the appropriate features. The first phase is an evolutionary-based particle swarm optimization (PSO) technique that applies to the group of incoming features in a multi-objective framework. The second phase checks the redundancy of features selected in the current group to the already selected features and finally, the third phase finds the features in the already selected feature list that becomes non-significant on the selection of newly arrived features and discards them. The proposed algorithm is tested on fourteen multi-label data sets collected from various domains such as biology, music, and text. From the results, it is observed that the first and second phases are sufficient to select the appropriate feature set. The efficacy of the proposed algorithm can be verified from the obtained results. It outperforms the results obtained by state-of-the-art approaches in most cases. (C) 2021 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据