☆ 4.7 Article

Applying regression models to query-focused multi-document summarization

INFORMATION PROCESSING & MANAGEMENT (2011)

Journal

INFORMATION PROCESSING & MANAGEMENT

Volume 47, Issue 2, Pages 227-237

Publisher

ELSEVIER SCI LTD

DOI: 10.1016/j.ipm.2010.03.005

Keywords

Query-focused summarization; Support Vector Regression; Training data construction

Funding

Hong Kong RGC [PolyU5211/05E, PolyU5217/07E]
NSFC [60603093, 60875042]
973 National Basic Research Program of China [2004CB318102]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Most existing research on applying machine learning techniques to document summarization explores either classification models or learning-to-rank models. This paper presents our recent study on how to apply a different kind of learning models, namely regression models, to query-focused multi-document summarization. We choose to use Support Vector Regression (SVR) to estimate the importance of a sentence in a document set to be summarized through a set of pre-defined features. In order to learn the regression models, we propose several methods to construct the pseudo training data by assigning each sentence with a nearly true importance score calculated with the human summaries that have been provided for the corresponding document set. A series of evaluations on the DUC data sets are conducted to examine the efficiency and the robustness of the proposed approaches. When compared with classification models and ranking models, regression models are consistently preferable. (C) 2010 Elsevier Ltd. All rights reserved.

Applying regression models to query-focused multi-document summarization

Journal

INFORMATION PROCESSING & MANAGEMENT

Publisher

ELSEVIER SCI LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Applying regression models to query-focused multi-document summarization

Journal

INFORMATION PROCESSING & MANAGEMENT

Publisher

ELSEVIER SCI LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper