☆ 4.6 Article

Data mining information from electronic health records produced high yield and accuracy for current smoking status

JOURNAL OF CLINICAL EPIDEMIOLOGY (2020)

期刊

JOURNAL OF CLINICAL EPIDEMIOLOGY

卷 118, 期 -, 页码 100-106

出版社

ELSEVIER SCIENCE INC

DOI: 10.1016/j.jclinepi.2019.11.006

关键词

Data mining; Electronic health records; Routine clinical data; Learning healthcare system; Data quality; Text mining

类别

Health Care Sciences & Services Public, Environmental & Occupational Health

资金

UMC Utrecht
Netherlands Organisation for Health Research and Development [8480-34001]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Objectives: Researchers are increasingly using routine clinical data for care evaluations and feedback to patients and clinicians. The quality of these evaluations depends on the quality and completeness of the input data. Study Design and Setting: We assessed the performance of an electronic health record (EHR)-based data mining algorithm, using the example of the smoking status in a cardiovascular population. As a reference standard, we used the questionnaire from the Utrecht Cardiovascular Cohort (UCC). To assess diagnostic accuracy, we calculated sensitivity, specificity, negative predictive value (NPV), and positive predictive value (PPV). Results: We analyzed 1,661 patients included in the UCC to January 18, 2019. Of those, 14% (n = 238) had missing information on smoking status in the UCC questionnaire. Data mining provided information on smoking status in 99% of the 1,661 participants. Diagnostic accuracy for current smoking was sensitivity 88%, specificity 92%, NPV 98%, and PPV 63%. From false positives, 85% reported they had quit smoking at the time of the UCC. Conclusion: Data mining showed great potential in retrieving information on smoking (a near complete yield). Its diagnostic performance is good for negative smoking statuses. The implications of misclassihcation with data mining are dependent on the application of the data. (C) 2019 The Authors. Published by Elsevier Inc.

Data mining information from electronic health records produced high yield and accuracy for current smoking status

期刊

JOURNAL OF CLINICAL EPIDEMIOLOGY

出版社

ELSEVIER SCIENCE INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Data mining information from electronic health records produced high yield and accuracy for current smoking status

期刊

JOURNAL OF CLINICAL EPIDEMIOLOGY

出版社

ELSEVIER SCIENCE INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文