4.7 Article

Supervised sentiment analysis in Czech social media

期刊

INFORMATION PROCESSING & MANAGEMENT
卷 50, 期 5, 页码 693-707

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.ipm.2014.05.001

关键词

Sentiment analysis; Czech language; Social media; Machine learning; Feature selection

资金

  1. Advanced computing and information systems [SGS-2013-029]
  2. University of West Bohemia
  3. European Regional Development Fund (ERDF), project NTIS - New Technologies for Information Society, European Center of Excellence [CZ.1.05/1.1.00/02.0090]
  4. programme Projects of Large Infrastructure for Research, Development, and Innovations [LM2010005]
  5. programme Center CERIT Scientific Cloud, part of the Operational Program Research and Development for Innovations [CZ. 1.05/3.2.00/08.0144]

向作者/读者索取更多资源

This article describes in-depth research on machine learning methods for sentiment analysis of Czech social media. Whereas in English, Chinese, or Spanish this field has a long history and evaluation datasets for various domains are widely available, in the case of the Czech language no systematic research has yet been conducted. We tackle this issue and establish a common ground for further research by providing a large human-annotated Czech social media corpus. Furthermore, we evaluate state-of-the-art supervised machine learning methods for sentiment analysis. We explore different pre-processing techniques and employ various features and classifiers. We also experiment with five different feature selection algorithms and investigate the influence of named entity recognition and preprocessing on sentiment classification performance. Moreover, in addition to our newly created social media dataset, we also report results for other popular domains, such as movie and product reviews. We believe that this article will not only extend the current sentiment analysis research to another family of languages, but will also encourage competition, potentially leading to the production of high-end commercial solutions. (C) 2014 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据