☆ 4.7 Article

A novel probabilistic feature selection method for text classification

KNOWLEDGE-BASED SYSTEMS (2012)

Journal

KNOWLEDGE-BASED SYSTEMS

Volume 36, Issue -, Pages 226-235

Publisher

ELSEVIER

DOI: 10.1016/j.knosys.2012.06.005

Keywords

Feature selection; Filter; Pattern recognition; Text classification; Dimension reduction

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

High dimensionality of the feature space is one of the most important concerns in text classification problems due to processing time and accuracy considerations. Selection of distinctive features is therefore essential for text classification. This study proposes a novel filter based probabilistic feature selection method, namely distinguishing feature selector (DFS), for text classification. The proposed method is compared with well-known filter approaches including chi square, information gain, Gini index and deviation from Poisson distribution. The comparison is carried out for different datasets, classification algorithms, and success measures. Experimental results explicitly indicate that DFS offers a competitive performance with respect to the abovementioned approaches in terms of classification accuracy, dimension reduction rate and processing time. (C) 2012 Elsevier B.V. All rights reserved.

A novel probabilistic feature selection method for text classification

Journal

KNOWLEDGE-BASED SYSTEMS

Publisher

ELSEVIER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

A novel probabilistic feature selection method for text classification

Journal

KNOWLEDGE-BASED SYSTEMS

Publisher

ELSEVIER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper