Journal
INFORMATION PROCESSING & MANAGEMENT
Volume 56, Issue 1, Pages 167-191Publisher
ELSEVIER SCI LTD
DOI: 10.1016/j.ipm.2018.09.004
Keywords
Text classification; Feature selection; Hebb rule
Funding
- National Sciences Foundation [71731006]
Ask authors/readers for more resources
Text documents usually contain high dimensional non-discriminative (irrelevant and noisy) terms which lead to steep computational costs and poor learning performance of text classification. One of the effective solutions for this problem is feature selection which aims to identify discriminative terms from text data. This paper proposes a method termed Hebb rule based feature selection (HRFS). HRFS is based on supervised Hebb rule and assumes that terms and classes are neurons and select terms under the assumption that a term is discriminative if it keeps exciting the corresponding classes. This assumption can be explained as a term is highly correlated with a class if it is able to keep exciting the class according to the original Hebb postulate. Six benchmarking datasets are used to compare HRFS with other seven feature selection methods. Experimental results indicate that HRFS is effective to achieve better performance than the compared methods. HRFS can identify discriminative terms in the view of synapse between neurons. Moreover, HRFS is also efficient because it can be described in the view of matrix operation to decrease complexity of feature selection.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available