4.3 Article

Optimal dynamic treatment regime estimation using information extraction from unstructured clinical text

期刊

BIOMETRICAL JOURNAL
卷 64, 期 4, 页码 805-817

出版社

WILEY
DOI: 10.1002/bimj.202100077

关键词

causal inference; clinical decision making; electronic health record; precision medicine; text mining

资金

  1. National Science Foundation [1416953, 1636840, 1734853, 1916425]
  2. National Institutes ofHealth [R01CA233487, R01MH121079, R01MH126137, T32GM141746, UL1TR002240]
  3. Direct For Computer & Info Scie & Enginr [1636840] Funding Source: National Science Foundation
  4. Direct For Education and Human Resources [1416953] Funding Source: National Science Foundation
  5. Direct For Social, Behav & Economic Scie
  6. Division Of Behavioral and Cognitive Sci [1734853] Funding Source: National Science Foundation
  7. Division Of Undergraduate Education [1416953] Funding Source: National Science Foundation
  8. Office of Advanced Cyberinfrastructure (OAC) [1636840] Funding Source: National Science Foundation
  9. Office of Advanced Cyberinfrastructure (OAC)
  10. Direct For Computer & Info Scie & Enginr [1916425] Funding Source: National Science Foundation

向作者/读者索取更多资源

The wide-scale adoption of electronic health records provides extensive information for precision medicine and personalized healthcare. By leveraging free-text clinical information extraction techniques, optimal dynamic treatment regimes can be estimated, allowing for individualized treatments based on patient characteristics and treatment history.
The wide-scale adoption of electronic health records (EHRs) provides extensive information to support precision medicine and personalized health care. In addition to structured EHRs, we leverage free-text clinical information extraction (IE) techniques to estimate optimal dynamic treatment regimes (DTRs), a sequence of decision rules that dictate how to individualize treatments to patients based on treatment and covariate history. The proposed IE of patient characteristics closely resembles The clinical Text Analysis and Knowledge Extraction System and employs named entity recognition, boundary detection, and negation annotation. It also utilizes regular expressions to extract numerical information. Combining the proposed IE with optimal DTR estimation, we extract derived patient characteristics and use tree-based reinforcement learning (T-RL) to estimate multistage optimal DTRs. IE significantly improved the estimation in counterfactual outcome models compared to using structured EHR data alone, which often include incomplete data, data entry errors, and other potentially unobserved risk factors. Moreover, including IE in optimal DTR estimation provides larger study cohorts and a broader pool of candidate tailoring variables. We demonstrate the performance of our proposed method via simulations and an application using clinical records to guide blood pressure control treatments among critically ill patients with severe acute hypertension. This joint estimation approach improves the accuracy of identifying the optimal treatment sequence by 14-24% compared to traditional inference without using IE, based on our simulations over various scenarios. In the blood pressure control application, we successfully extracted significant blood pressure predictors that are unobserved or partially missing from structured EHR.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据