4.6 Article

Using Conceptual Recurrence and Consistency Metrics for Topic Segmentation in Debate

期刊

APPLIED SCIENCES-BASEL
卷 12, 期 6, 页码 -

出版社

MDPI
DOI: 10.3390/app12062952

关键词

topic segmentation; debate; visual analytics; conceptual recurrence plot; natural language processing; text mining

资金

  1. National Research Foundation of Korea - Ministry of Education [NRF5199991014091]
  2. National Research Foundation of Korea (NRF) - Ministry of Science and ICT [NRF-2020R1F1A1075605]

向作者/读者索取更多资源

This paper proposes a topic segmentation model, CSseg, based on conceptual recurrence and debate consistency metrics. It investigates the relationship between conceptual similarity and topic segmentation. CSseg segments transcripts using similarity cohesion methods and weights based on conceptual similarities and debate consistency metrics, providing user-driven topic segmentation. The prototype of CSseg was implemented and compared with a previous model, showing better performance in debates.
We propose a topic segmentation model, CSseg (Conceptual Similarity-segmenter), for debates based on conceptual recurrence and debate consistency metrics. We research whether the conceptual similarity of conceptual recurrence and debate consistency metrics relate to topic segmentation. Conceptual similarity is a similarity between utterances in conceptual recurrence analysis, and debate consistency metrics represent the internal coherence properties that maintain the debate topic in interactions between participants. Based on the research question, CSseg segments transcripts by applying similarity cohesion methods based on conceptual similarities; the topic segmentation is affected by applying weights to conceptual similarities having debate internal consistency properties, including other-continuity, self-continuity, chains of arguments and counterarguments, and the topic guide of moderator. CSseg provides a user-driven topic segmentation by allowing the user to adjust the weights of the similarity cohesion methods and debate consistency metrics. It takes an approach that alleviates the problem whereby each person judges the topic segments differently in debates and multi-party discourse. We implemented the prototype of CSseg by utilizing the Korean TV debate program MBC 100-Minute Debate and analyzed the results by use cases. We compared CSseg and a previous model LCseg (Lexical Cohesion-segmenter) with the evaluation metrics Pk and WD. CSseg had greater performance than LCseg in debates.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据