期刊
APPLIED SCIENCES-BASEL
卷 12, 期 6, 页码 -出版社
MDPI
DOI: 10.3390/app12062952
关键词
topic segmentation; debate; visual analytics; conceptual recurrence plot; natural language processing; text mining
类别
资金
- National Research Foundation of Korea - Ministry of Education [NRF5199991014091]
- National Research Foundation of Korea (NRF) - Ministry of Science and ICT [NRF-2020R1F1A1075605]
This paper proposes a topic segmentation model, CSseg, based on conceptual recurrence and debate consistency metrics. It investigates the relationship between conceptual similarity and topic segmentation. CSseg segments transcripts using similarity cohesion methods and weights based on conceptual similarities and debate consistency metrics, providing user-driven topic segmentation. The prototype of CSseg was implemented and compared with a previous model, showing better performance in debates.
We propose a topic segmentation model, CSseg (Conceptual Similarity-segmenter), for debates based on conceptual recurrence and debate consistency metrics. We research whether the conceptual similarity of conceptual recurrence and debate consistency metrics relate to topic segmentation. Conceptual similarity is a similarity between utterances in conceptual recurrence analysis, and debate consistency metrics represent the internal coherence properties that maintain the debate topic in interactions between participants. Based on the research question, CSseg segments transcripts by applying similarity cohesion methods based on conceptual similarities; the topic segmentation is affected by applying weights to conceptual similarities having debate internal consistency properties, including other-continuity, self-continuity, chains of arguments and counterarguments, and the topic guide of moderator. CSseg provides a user-driven topic segmentation by allowing the user to adjust the weights of the similarity cohesion methods and debate consistency metrics. It takes an approach that alleviates the problem whereby each person judges the topic segments differently in debates and multi-party discourse. We implemented the prototype of CSseg by utilizing the Korean TV debate program MBC 100-Minute Debate and analyzed the results by use cases. We compared CSseg and a previous model LCseg (Lexical Cohesion-segmenter) with the evaluation metrics Pk and WD. CSseg had greater performance than LCseg in debates.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据