4.7 Article

Enhancing diversity and coverage of document summaries through subspace clustering and clustering-based optimization

期刊

INFORMATION SCIENCES
卷 279, 期 -, 页码 764-775

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2014.04.028

关键词

Document summarization; Information diversity; Information coverage; Subspace clustering

资金

  1. National Natural Science Foundation of China - China [61272291, 61202188, 61303125, 61303226]
  2. Central Universities under Grant - China [Z109021108, Z109021109]

向作者/读者索取更多资源

Sentence clustering has been successfully applied in document summarization to discover the topics conveyed in a collection of documents. However, existing clustering-based summarization approaches are seldom targeted for both diversity and coverage of summaries, which are believed to be the two key issues to determine the quality of summaries. The focus of this work is to explore a systematic approach that allows diversity and coverage to be tackled within an integrated clustering-based summarization framework. Given the fact that normally each topic can be described by a set of keywords and the choice of the keywords among the topics is topic-dependent, we take the advantage of the newly emerged subspace clustering to enable the flexibility of keyword selection and the improved quality of sentence clustering. On this basis, we develop two clustering-based optimization strategies, namely local optimization and global optimization to pursue our targets. Experimental results on the DUC datasets demonstrate effectiveness and robustness of the proposed approach. (C) 2014 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据