期刊
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING
卷 12, 期 10, 页码 8975-8990出版社
SPRINGER HEIDELBERG
DOI: 10.1007/s12652-020-02591-x
关键词
Text summarization; Extractive summarization; Graph-based; Topic-based; Similarity measure
In this paper, a new graph-based summarization technique is proposed, which not only considers the similarity among individual sentences, but also takes into account the similarity between sentences and the overall document topic. The weight assigned to the edges of the graph considers both the similarity among nodes and the similarity to the topics of the overall document. By incorporating semantic measure to find node similarity, the proposed method shows significant improvement in summary quality compared to existing text summarization techniques.
In graph-based extractive text summarization techniques, the weight assigned to the edges of the graph is the crucial parameter for the sentence ranking. The weights associated with the edges are based on the similarity between sentences (nodes). Most of the graph-based techniques use the common words based similarity measure to assign the weight. In this paper, we propose a new graph-based summarization technique, which, besides taking into account the similarity among the individual sentences, also considers the similarity between the sentences and the overall (input) document. While assigning the weight among the edges of the graph, we consider two attributes. The first attribute is the similarity among the nodes, which forms the edges of the graph. The second attribute is the weight given to a component that represents how much the particular edge is similar to the topics of the overall document for which we incorporate the topic modeling. Along with these modifications, we use the semantic measure to find the similarity among the nodes. The evaluation results of the proposed method demonstrate a significant improvement of the summary quality over the existing text summarization techniques.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据