☆ 4.5 Article

Cross lingual opinion holder extraction based on multi-kernel SVMs and transfer learning

WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS (2015)

期刊

WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS

卷 18, 期 2, 页码 299-316

出版社

SPRINGER

DOI: 10.1007/s11280-013-0246-0

关键词

Opinion holder extraction; Cross lingual; Multi-kernel SVMs; Transfer learning

类别

Computer Science, Information Systems Computer Science, Software Engineering

资金

MOE Specialized Research Fund for the Doctoral Program of Higher Education [20122302120070]
National Natural Science Foundation of China [61203378]
Open Projects Program of National Laboratory of Pattern Recognition Shenzhen Foundational Research Funding [JCYJ20120613152557576]
Shenzhen International Cooperation Research Funding [GJHZ20120613110641217]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Fine grained opinion analysis has much higher demand for annotated corpus which makes high quality analysis difficult when there are insufficient resources. In this paper we explore the use of cross lingual resources for opinion mining for resource poor languages. This paper presents a novel approach for cross lingual opinion holder extraction through leveraging finely annotated opinion corpus selectively from a source language as the supplementary training samples for the target language. Firstly, the opinion corpus in the source language with fine grained annotations are translated and projected to the target language to generate the training samples. Then, a classifier based on multi-kernel Support Vector Machines (SVMs) is developed to identify opinion holders in the target language, which uses a tree kernel based on syntactic features and a polynomial kernel based on semantic features, respectively. The two kernels are further improved by incorporating a pivot function based on word pair similarity. To reduce the noise of low quality translated samples, a Transfer learning algorithm is applied to select high quality translated samples iteratively for training the multi-kernel classifiers on the target language. Evaluations on transferring MPQA, an English opinion corpus (as the source language), to Chinese opinion analysis (as the target language) show that the opinion holder extraction performance on NTCIR-7 MOAT dataset is improved, which is higher than the Conditional Random Fields (CRFs) based approach and most reported systems in NTCIR-7 MOAT evaluation.

Cross lingual opinion holder extraction based on multi-kernel SVMs and transfer learning

期刊

WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Cross lingual opinion holder extraction based on multi-kernel SVMs and transfer learning

期刊

WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文