☆ 4.6 Article

Time expression recognition and normalization: a survey

ARTIFICIAL INTELLIGENCE REVIEW (2023)

期刊

ARTIFICIAL INTELLIGENCE REVIEW

卷 56, 期 9, 页码 9115-9140

出版社

SPRINGER

DOI: 10.1007/s10462-023-10400-y

关键词

Information extraction; Time expressions; Rule-based methods; Machine-learning methods; Deep-learning methods

类别

Computer Science, Artificial Intelligence

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Time information is crucial in the fields of data mining, information retrieval, and natural language processing. Time expression recognition and normalization (TERN) serves as a fundamental task for other linguistic tasks. This survey reviews previous research, provides an overview of time expression analysis development, and explores the role of time expressions in different domains. Three methods for TERN development are discussed: rule-based, traditional machine-learning, and deep-learning. Additionally, useful datasets, software, and potential future research directions are outlined.

Time information plays an important role in the areas of data mining, information retrieval, and natural language processing. Among the linguistic tasks related to time expressions, time expression recognition and normalization (TERN) is fundamental for other downstream tasks. Researchers from these areas have devoted considerable effort in the last two decades to define the problem of time expression analysis, design the standards for time expression annotation, build annotated corpora for time expressions, and develop methods to identify time expressions from free text. While there are some surveys concerned with the development of time information extraction, retrieval, and reasoning, to the best of our knowledge, there is no survey focusing on the TERN development. We fill in this blank. In this survey, we review previous researches, aiming to draw an overview of the development of time expression analysis and discuss the role that time expressions play in different areas. We focus on the task of recognizing and normalizing time expressions from free text and investigate three kinds of methods that researchers develop for TERN, namely rule-based methods, traditional machine-learning methods, and deep-learning methods. We will also discuss some factors about TERN development, including TIMEX type factor, language factor, and domain and textual factors. After that, we list some useful datasets and softwares for both tasks of TER and TEN as well as TERN and finally outline some potential directions of future research. We hope that this survey can help those researchers who are interested in TERN quickly gain a comprehensive understanding of the development of TERN and its potential research directions.

Time expression recognition and normalization: a survey

期刊

ARTIFICIAL INTELLIGENCE REVIEW

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Time expression recognition and normalization: a survey

期刊

ARTIFICIAL INTELLIGENCE REVIEW

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文