☆ 3.9 Article

Extraction of features from clinical routine data using text mining

OPHTHALMOLOGE (2021)

期刊

OPHTHALMOLOGE

卷 118, 期 3, 页码 264-272

出版社

SPRINGER HEIDELBERG

DOI: 10.1007/s00347-020-01177-4

关键词

Macular degeneration; Natural language processing; Systematized nomenclature of medicine; Electronic health records; Decision support systems

类别

Ophthalmology

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study utilized text mining to extract information from clinical texts and found that the extraction performance was good for certain data such as visual acuity, intraocular pressure, and accompanying diagnoses, showing potential for future applications. However, manual adjustments are still necessary to improve accuracy.

Background Anti-VEGF drugs are currently used to treat macular diseases. This has led to a wealth of additional data, which could help understand and predict treatment courses; however, this information is usually only available in free text form. Objective A retrospective study was designed to analyze how far interpretable information can be obtained from clinical texts by automated extraction. The aim was to assess the suitability of a text mining method that was customized for this purpose. Material and methods Data on 3683 patients were available, including 40,485 discharge letters. Some of the data of interest, e.g. visual acuity (VA), intraocular pressure (IOP) and accompanying diagnoses, were not only recorded textually but also entered in a database and could thus serve as a gold standard for text analysis. The text was analyzed using the Averbis Health Discovery text mining platform. To optimize the extraction task, rule knowledge and a German language technical vocabulary linked to the international medical terminology standard systematized nomenclature of medicine (SNOMED CT) was manually added. Results The correspondence between extracted data and the structured database entries is described by the F1 value. There was agreement of 94.7% for VA, 98.3% for IOP and 94.7% for the accompanying diagnoses. Manual analysis of noncorresponding cases showed that in 50% text content did not match the database content for various reasons. After an adjustment, F1 values 1-3% above the previously determined values were obtained. Conclusion Text mining procedures are very well suited for the considered discharge letter corpus and the problem described in order to extract contents from clinical texts in a structured manner for further evaluation.

Extraction of features from clinical routine data using text mining

期刊

OPHTHALMOLOGE

出版社

SPRINGER HEIDELBERG

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Extraction of features from clinical routine data using text mining

期刊

OPHTHALMOLOGE

出版社

SPRINGER HEIDELBERG

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文