4.7 Article

A feature selection method based on improved fisher's discriminant ratio for text sentiment classification

期刊

EXPERT SYSTEMS WITH APPLICATIONS
卷 38, 期 7, 页码 8696-8702

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2011.01.077

关键词

Fisher's discriminant ratio; Feature selection; Text sentiment classification; Support vector machine

资金

  1. National Natural Science Foundation [60875040, 60970014]
  2. Ministry of Education of China [200801080006]
  3. Natural Science Foundation of Shanxi Province [2010011021-1]
  4. Shanxi Foundation of Tackling Key Problem in Science and Technology [051129]
  5. Office of Taiyuan City [09121001]

向作者/读者索取更多资源

Owing to its openness, virtualization and sharing criterion, the Internet has been rapidly becoming a platform for people to express their opinion, attitude, feeling and emotion. As the subjectivity texts are often too many for people to go through, how to automatically classify them into different sentiment orientation categories (e.g. positive/negative) has become an important research problem. In this paper, based on Fisher's discriminant ratio, an effective feature selection method is proposed for subjectivity text sentiment classification. In order to validate the proposed method, we compared it with the method based on Information Gain while Support Vector Machine is adopted as the classifier. Two experiments are conducted by combining different feature selection methods with two kinds of candidate feature sets. Under 2739 subjectivity documents of COAE2008s and 1006 car-related subjectivity documents, the experimental results indicate that the Fisher's discriminant ratio based on word frequency estimation has the best performance respectively with accuracy 86.61% and 82.80% under two corpus while the candidate features are the words which appear in both positive and negative texts. (C) 2011 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据