4.7 Review

Feature selection for online streaming high-dimensional data: A state-of-the-art review

Related references

Note: Only part of the references are listed.
Article Computer Science, Artificial Intelligence

A framework for feature selection through boosting

Ahmad Alsahaf et al.

Summary: This study introduces a novel feature selection framework based on boosting algorithm for selecting informative feature sets in classification problems. Comparative experiments on benchmark datasets show that the proposed method achieves higher accuracy with fewer features on most datasets, and the selected features exhibit lower redundancy.

EXPERT SYSTEMS WITH APPLICATIONS (2022)

Review Mathematics

A Review of the Modification Strategies of the Nature Inspired Algorithms for Feature Selection Problem

Ruba Abu Khurma et al.

Summary: This survey provides a research repository and reference for researchers planning to develop new Nature-inspired Algorithms for Feature Selection problems (NIAs-FS). It includes a thorough literature review, with a focus on feature selection, optimization algorithms, and modifications applied to NIAs in order to solve FS problems. The survey presents an overview of 156 articles that discuss NIAs modifications for tackling FS, supported by analytical views, visualized statistics, applied examples, open-source software systems, and discussions on open issues related to FS and NIAs. The survey also summarizes the main foundations of NIAs-FS, including the most popular operator (chaotic maps) and the most widely used modification technique (hybridization with a classifier). Microarray and medical applications are identified as the dominant domains for modified and used NIA-FS. Despite the popularity of NIAs-FS, there is still a need for further investigation in many areas.

MATHEMATICS (2022)

Article Computer Science, Artificial Intelligence

A K-Means clustering and SVM based hybrid concept drift detection technique for network anomaly detection

Meenal Jain et al.

Summary: This paper introduces the field of data stream mining and its application in anomaly detection in network traffic. Due to concept drift in the data streams, traditional machine learning algorithms face challenges in accuracy and false alarms. To address this issue, the paper proposes two new techniques for concept drift detection and utilizes sliding window and K-Means Clustering for data reduction and training dataset enhancement. Experimental results demonstrate improved classification accuracy and performance metrics using the proposed approach.

EXPERT SYSTEMS WITH APPLICATIONS (2022)

Article Computer Science, Artificial Intelligence

Adaptive ensemble of self-adjusting nearest neighbor subspaces for multi-label drifting data streams

Gavin Alberghini et al.

Summary: This paper introduces a novel ensemble method AESAKNNS for multi-label drifting streams, which adapts to concept drift by training base classifiers on different subspaces and monitoring drift occurrences. Experimental results support the better performance of AESAKNNS compared to other classifiers in diverse multi-label datasets.

NEUROCOMPUTING (2022)

Article Computer Science, Artificial Intelligence

An evidential reasoning rule based feature selection for improving trauma outcome prediction

Fatima Almaghrabi et al.

Summary: Key features for accurately predicting patient outcomes can be selected through random forest, ReliefF, and evidential reasoning (ER) rule. The impact of outcome class imbalance on feature selection is discussed, with synthetic minority over-sampling technique (SMOTE) showing differences in selected features. The highest prediction performance is achieved by the ER rule for selecting features, followed by ReliefF and random forest.

APPLIED SOFT COMPUTING (2021)

Article Computer Science, Artificial Intelligence

Online feature selection system for big data classification based on multi-objective automated negotiation

Fatma BenSaid et al.

Summary: Feature Selection plays a crucial role in learning and classification tasks by selecting relevant and non-redundant features. This paper introduced an online feature selection system MOANOFS that utilizes Multi-Objective Automated Negotiation to enhance classification performance for ultra-high dimensional databases.

PATTERN RECOGNITION (2021)

Article Computer Science, Artificial Intelligence

Condition-CNN: A hierarchical multi-label fashion image classification model

Brendan Kolisnik et al.

Summary: We propose a hierarchical image classification model, Condition-CNN, which improves prediction accuracy and reduces training time by using the Teacher Forcing training algorithm and conditional probabilities. The validation results show that Condition-CNN achieves higher prediction accuracy for Level 1, 2, and 3 classes compared to other baseline CNN models.

EXPERT SYSTEMS WITH APPLICATIONS (2021)

Article Computer Science, Artificial Intelligence

Online group streaming feature selection considering feature interaction

Peng Zhou et al.

Summary: This paper focuses on the interaction of features within and between streaming groups, proposing an Online Group Streaming Feature Selection method named OGSFS-FI, which consists of two stages: online intra-group selection and online inter-group selection. The method utilizes a new pair selection strategy and the elastic net method for efficient and effective feature selection.

KNOWLEDGE-BASED SYSTEMS (2021)

Article Computer Science, Information Systems

Multi-objective Cuckoo Search-based Streaming Feature Selection for Multi-label Dataset

Dipanjyoti Paul et al.

Summary: Feature selection is crucial for selecting relevant features and eliminating irrelevant or redundant features, particularly in multi-label datasets where label co-relation and grouping features play a significant role in improving model efficiency and quality.

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA (2021)

Article Computer Science, Information Systems

Towards an Unsupervised Feature Selection Method for Effective Dynamic Features

Naif Almusallam et al.

Summary: This paper proposes an efficient unsupervised feature selection method for streaming features applications, addressing the challenges of feature selection in dynamic features applications and achieving accurate determination of representative streaming features.

IEEE ACCESS (2021)

Article Computer Science, Theory & Methods

Fuzzy Rank Based Parallel Online Feature Selection Method using Multiple Sliding Windows

B. Venkatesh et al.

Summary: This paper proposes a parallel online feature selection method using multiple sliding windows and fuzzy fast-mRMR feature selection analysis, achieving significant results in selecting minimum redundant and maximum relevant features. It demonstrates outstanding performance on benchmark datasets with over 95% accuracy and overcomes existing drawbacks in online streaming feature selection methods.

OPEN COMPUTER SCIENCE (2021)

Article Computer Science, Artificial Intelligence

Adaptive Quick Reduct for Feature Drift Detection

Alessio Ferone et al.

Summary: Data streams are prevalent due to the widespread use of low-cost mobile devices, sensors, wireless networks, and the Internet of Things. This paper introduces a variation of the QuickReduct algorithm designed to handle data streams, which effectively addresses the issue of feature drift. Experiments conducted on five publicly available datasets with artificially injected drift have confirmed the effectiveness of the proposed method.

ALGORITHMS (2021)

Article Computer Science, Information Systems

Minimizing the Overlapping Degree to Improve Class-Imbalanced Learning Under Sparse Feature Selection: Application to Fraud Detection

El Barakaz Fatima et al.

Summary: Research focuses on the classification of class-imbalanced data, proposing three feature selection algorithms to improve classification performance. Experimental results show that the proposed algorithms manage the variation of false discovery rate during the selection of main features.

IEEE ACCESS (2021)

Review Computer Science, Artificial Intelligence

A review of unsupervised feature selection methods

Saul Solorio-Fernandez et al.

ARTIFICIAL INTELLIGENCE REVIEW (2020)

Article Computer Science, Artificial Intelligence

Ensemble feature selection for high-dimensional data: a stability analysis across multiple domains

Barbara Pes

NEURAL COMPUTING & APPLICATIONS (2020)

Article Computer Science, Artificial Intelligence

An intelligent clustering algorithm for high-dimensional multiview data in big data applications

Qian Tao et al.

NEUROCOMPUTING (2020)

Article Computer Science, Artificial Intelligence

Sparse feature selection: Relevance, redundancy and locality structure preserving guided by pairwise constraints

Zahir Noorie et al.

APPLIED SOFT COMPUTING (2020)

Article Computer Science, Interdisciplinary Applications

Benchmark for filter methods for feature selection in high-dimensional classification data

Andrea Bommert et al.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2020)

Article Computer Science, Artificial Intelligence

A multi-objective genetic algorithm for text feature selection using the relative discriminative criterion

Mahdieh Labani et al.

EXPERT SYSTEMS WITH APPLICATIONS (2020)

Article Computer Science, Theory & Methods

A general framework based on dynamic multi-objective evolutionary algorithms for handling feature drifts on data streams

Shaaban Sahmoud et al.

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE (2020)

Article Computer Science, Artificial Intelligence

Feature selection for multi-label learning with streaming label

Jinghua Liu et al.

NEUROCOMPUTING (2020)

Article Biology

A new feature selection algorithm based on relevance, redundancy and complementarity

Chao Li et al.

COMPUTERS IN BIOLOGY AND MEDICINE (2020)

Article Computer Science, Artificial Intelligence

Distributed Feature Selection for Big Data Using Fuzzy Rough Sets

Linghe Kong et al.

IEEE TRANSACTIONS ON FUZZY SYSTEMS (2020)

Article Telecommunications

Active feature acquisition on data streams under feature drift

Christian Beyer et al.

ANNALS OF TELECOMMUNICATIONS (2020)

Article Computer Science, Artificial Intelligence

Overview and comparative study of dimensionality reduction techniques for high dimensional data

Shaeela Ayesha et al.

INFORMATION FUSION (2020)

Review Computer Science, Artificial Intelligence

Adaptations of data mining methodologies: a systematic literature review

Veronika Plotnikova et al.

PEERJ COMPUTER SCIENCE (2020)

Article Computer Science, Artificial Intelligence

Embedded chaotic whale survival algorithm for filter-wrapper feature selection

Ritam Guha et al.

SOFT COMPUTING (2020)

Article Multidisciplinary Sciences

New Online Streaming Feature Selection Based on Neighborhood Rough Set for Medical Data

Dingfei Lei et al.

SYMMETRY-BASEL (2020)

Article Computer Science, Artificial Intelligence

Lessons learned from data stream classification applied to credit scoring

Jean Paul Barddal et al.

EXPERT SYSTEMS WITH APPLICATIONS (2020)

Article Computer Science, Artificial Intelligence

A systematic evaluation of filter Unsupervised Feature Selection methods

Saul Solorio-Fernandez et al.

EXPERT SYSTEMS WITH APPLICATIONS (2020)

Article Computer Science, Theory & Methods

A comprehensive survey of anomaly detection techniques for high dimensional big data

Srikanth Thudumu et al.

JOURNAL OF BIG DATA (2020)

Article Computer Science, Artificial Intelligence

Novel multi-label feature selection via label symmetric uncertainty correlation learning and feature redundancy evaluation

Jianhua Dai et al.

KNOWLEDGE-BASED SYSTEMS (2020)

Article Computer Science, Artificial Intelligence

Online Streaming Feature Selection via Multi-Conditional Independence and Mutual Information Entropy

Hongyi Wang et al.

INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS (2020)

Article Computer Science, Information Systems

Malicious Text Identification: Deep Learning from Public Comments and Emails

Asma Baccouche et al.

INFORMATION (2020)

Article Computer Science, Information Systems

Markov Boundary Learning With Streaming Data for Supervised Classification

Chaofan Liu et al.

IEEE ACCESS (2020)

Article Computer Science, Information Systems

Improved Feature Selection Model for Big Data Analytics

Ibrahim M. El-Hasnony et al.

IEEE ACCESS (2020)

Article Computer Science, Artificial Intelligence

Multi-label feature selection based on information entropy fusion in multi-source decision system

Wenbin Qian et al.

EVOLUTIONARY INTELLIGENCE (2020)

Review Computer Science, Information Systems

Feature selection for text classification: A review

Xuelian Deng et al.

MULTIMEDIA TOOLS AND APPLICATIONS (2019)

Article Computer Science, Artificial Intelligence

Online streaming feature selection: a minimum redundancy, maximum significance approach

Mohammad Masoud Javidi et al.

PATTERN ANALYSIS AND APPLICATIONS (2019)

Article Computer Science, Artificial Intelligence

A ranking-based feature selection approach for handwritten character recognition

Nicole Dalia Cilia et al.

PATTERN RECOGNITION LETTERS (2019)

Article Computer Science, Information Systems

Robust clinical marker identification for diabetic kidney disease with ensemble feature selection

Xing Song et al.

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION (2019)

Article Computer Science, Artificial Intelligence

OFS-Density: A novel online streaming feature selection method

Peng Zhou et al.

PATTERN RECOGNITION (2019)

Article Computer Science, Information Systems

Online streaming feature selection using adapted Neighborhood Rough Set

Peng Zhou et al.

INFORMATION SCIENCES (2019)

Article Computer Science, Artificial Intelligence

A label-specific multi-label feature selection algorithm based on the Pareto dominance concept

Shima Kashef et al.

PATTERN RECOGNITION (2019)

Article Biochemical Research Methods

FeatureSelect: a software for feature selection based on machine learning approaches

Yosef Masoudi-Sobhanzadeh et al.

BMC BIOINFORMATICS (2019)

Article Computer Science, Artificial Intelligence

Hybrid fast unsupervised feature selection for high-dimensional data

Zhaleh Manbari et al.

EXPERT SYSTEMS WITH APPLICATIONS (2019)

Article Biotechnology & Applied Microbiology

Frequency based feature selection method using whale algorithm

Hossein Nematzadeh et al.

GENOMICS (2019)

Review Computer Science, Artificial Intelligence

Ensembles for feature selection: A review and future trends

Veronica Bolon-Canedo et al.

INFORMATION FUSION (2019)

Article Computer Science, Artificial Intelligence

A new multi-objective wrapper method for feature selection - Accuracy and stability analysis for BCI

Jesus Gonzalez et al.

NEUROCOMPUTING (2019)

Article Computer Science, Information Systems

Feature Selection and Its Use in Big Data: Challenges, Methods, and Trends

Miao Rong et al.

IEEE ACCESS (2019)

Article Computer Science, Information Systems

Online Feature Selection for Streaming Features Using Self-Adaptation Sliding-Window Sampling

Dianlong You et al.

IEEE ACCESS (2019)

Article Computer Science, Information Systems

An Overview on Concepts Drift Learning

Adriana Sayuri Iwashita et al.

IEEE ACCESS (2019)

Article Computer Science, Information Systems

Addressing Feature Drift in Data Streams Using Iterative Subset Selection

Lanqin Yuan et al.

APPLIED COMPUTING REVIEW (2019)

Article Automation & Control Systems

An Embedded Feature Selection Method for Imbalanced Data Classification

Haoyue Liu et al.

IEEE-CAA JOURNAL OF AUTOMATICA SINICA (2019)

Article Computer Science, Information Systems

Swarm intelligent based online feature selection (OFS) and weighted entropy frequent pattern mining (WEFPM) algorithm for big data analysis

S. Gayathri Devi et al.

CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS (2019)

Article Computer Science, Artificial Intelligence

Streamwise feature selection: a rough set method

Mohammad Masoud Javidi et al.

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS (2018)

Article Computer Science, Artificial Intelligence

On the scalability of feature selection methods on high-dimensional data

V. Bolon-Canedo et al.

KNOWLEDGE AND INFORMATION SYSTEMS (2018)

Article Computer Science, Artificial Intelligence

OSFSMI: Online stream feature selection method based on mutual information

Maryam Rahmaninia et al.

APPLIED SOFT COMPUTING (2018)

Article Automation & Control Systems

A novel multivariate filter method for feature selection in text classification problems

Mahdieh Labani et al.

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2018)

Article Computer Science, Artificial Intelligence

Ultra High-Dimensional Nonlinear Feature Selection for Big Biological Data

Makoto Yamada et al.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2018)

Article Biochemical Research Methods

An Organelle Correlation-Guided Feature Selection Approach for Classifying Multi-Label Subcellular Bio-lmages

Wei Shao et al.

IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS (2018)

Article Computer Science, Information Systems

Streaming feature-based causal structure learning algorithm with symmetrical uncertainty

Jing Yang et al.

INFORMATION SCIENCES (2018)

Article Computer Science, Artificial Intelligence

A novel intrusion detection system for wireless mesh network with hybrid feature selection technique based on GA and MI

R. Vijayanand et al.

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS (2018)

Article Computer Science, Artificial Intelligence

Online Multi-label Group Feature Selection

Jinghua Liu et al.

KNOWLEDGE-BASED SYSTEMS (2018)

Article Computer Science, Artificial Intelligence

Feature selection in machine learning: A new perspective

Jie Cai et al.

NEUROCOMPUTING (2018)

Article Computer Science, Artificial Intelligence

Online multi-label streaming feature selection based on neighborhood rough set

Jinghua Liu et al.

PATTERN RECOGNITION (2018)

Review Computer Science, Artificial Intelligence

Multilabel feature selection: A comprehensive review and guiding experiments

Shima Kashef et al.

WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY (2018)

Article Computer Science, Artificial Intelligence

Large-dimensionality small-instance set feature selection: A hybrid bio-inspired heuristic approach

Hossam M. Zawbaa et al.

SWARM AND EVOLUTIONARY COMPUTATION (2018)

Article Computer Science, Information Systems

QER: a new feature selection method for sentiment analysis

Tuba Parlar et al.

HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES (2018)

Article Computer Science, Artificial Intelligence

Real-time feature selection technique with concept drift detection using adaptive micro-clusters for data stream mining

Mahmood Shakir Hammoodi et al.

KNOWLEDGE-BASED SYSTEMS (2018)

Article

A new online field feature selection algorithm based on streaming data

Zhenjiang Zhang et al.

Journal of Ambient Intelligence and Humanized Computing (2018)

Article Chemistry, Multidisciplinary

Online Streaming Feature Selection via Conditional Independence

Dianlong You et al.

APPLIED SCIENCES-BASEL (2018)

Article Computer Science, Artificial Intelligence

Text feature selection with a robust weight scheme and dynamic dimension reduction to text document clustering

Laith Mohammad Abualigah et al.

EXPERT SYSTEMS WITH APPLICATIONS (2017)

Article Computer Science, Artificial Intelligence

Challenges of Feature Selection for Big Data Analytics

Jundong Li et al.

IEEE INTELLIGENT SYSTEMS (2017)

Article Computer Science, Artificial Intelligence

Streaming Feature Selection for Multilabel Learning Based on Fuzzy Mutual Information

Yaojin Lin et al.

IEEE TRANSACTIONS ON FUZZY SYSTEMS (2017)

Article Computer Science, Artificial Intelligence

KEEL 3.0: An Open Source Software for Multi-Stage Analysis in Data Mining

Isaac Triguero et al.

International Journal of Computational Intelligence Systems (2017)

Article Computer Science, Software Engineering

A survey on feature drift adaptation: Definition, benchmark, challenges and future directions

Jean Paul Barddal et al.

JOURNAL OF SYSTEMS AND SOFTWARE (2017)

Review Computer Science, Artificial Intelligence

Recent advances in feature selection and its applications

Yun Li et al.

KNOWLEDGE AND INFORMATION SYSTEMS (2017)

Article Computer Science, Artificial Intelligence

Centralized vs. distributed feature selection methods based on data complexity measures

L. Moran-Fernandez et al.

KNOWLEDGE-BASED SYSTEMS (2017)

Article Computer Science, Artificial Intelligence

Online feature selection for high-dimensional class-imbalanced data

Peng Zhou et al.

KNOWLEDGE-BASED SYSTEMS (2017)

Article Computer Science, Artificial Intelligence

A survey on data preprocessing for data stream mining: Current status and future directions

Sergio Ramirez-Gallego et al.

NEUROCOMPUTING (2017)

Article Computer Science, Information Systems

Large-Scale Online Feature Selection for Ultra-High Dimensional Sparse Data

Yue Wu et al.

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA (2017)

Article Computer Science, Artificial Intelligence

A Social-aware online short-text feature selection technique for social media

Antonela Tommasel et al.

Information Fusion (2017)

Article Business

Big data: Dimensions, evolution, impacts, and challenges

In Lee

BUSINESS HORIZONS (2017)

Article Computer Science, Artificial Intelligence

Online streaming feature selection using rough sets

S. Eskandari et al.

INTERNATIONAL JOURNAL OF APPROXIMATE REASONING (2016)

Article Computer Science, Artificial Intelligence

LOFS: A library of online streaming feature selection

Kui Yu et al.

KNOWLEDGE-BASED SYSTEMS (2016)

Article Computer Science, Artificial Intelligence

Feature subset selection based on fuzzy neighborhood rough sets

Changzhong Wang et al.

KNOWLEDGE-BASED SYSTEMS (2016)

Review Computer Science, Artificial Intelligence

A systematic review of multi-label feature selection and a new method based on label construction

Newton Spolaor et al.

NEUROCOMPUTING (2016)

Article Computer Science, Artificial Intelligence

Feature Selection via Global Redundancy Minimization

De Wang et al.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2015)

Article Computer Science, Artificial Intelligence

Online Feature Selection with Group Structure Analysis

Jing Wang et al.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2015)

Article Computer Science, Artificial Intelligence

The Emerging Big Dimensionality

Yiteng Zhai et al.

IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE (2014)

Article Computer Science, Artificial Intelligence

Online Feature Selection and Its Applications

Jialei Wang et al.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2014)

Review Computer Science, Artificial Intelligence

A review of feature selection methods on synthetic data

Veronica Bolon-Canedo et al.

KNOWLEDGE AND INFORMATION SYSTEMS (2013)

Article Computer Science, Hardware & Architecture

Exploring Causal Relationships with Streaming Features

Kui Yu et al.

COMPUTER JOURNAL (2012)

Article Computer Science, Artificial Intelligence

Fast feature selection aimed at high-dimensional data via hybrid-sequential-ranked searches

R. Ruiz et al.

EXPERT SYSTEMS WITH APPLICATIONS (2012)

Article Computer Science, Artificial Intelligence

Neighborhood Rough Sets for Dynamic Data Mining

Junbo Zhang et al.

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS (2012)

Review Computer Science, Artificial Intelligence

Advances in data stream mining

Mohamed Medhat Gaber

WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY (2012)