4.6 Article

Portability of an algorithm to identify rheumatoid arthritis in electronic health records

期刊

出版社

BMJ PUBLISHING GROUP
DOI: 10.1136/amiajnl-2011-000583

关键词

-

资金

  1. Pharmacogenomics Research Network (PGRN) [U01-GM092691]
  2. National Library of Medicine (NLM) [U54-LM008748, R01-LM010685, 5T15LM007450-10]
  3. NIH [U01-GM092691, R01-AR057108, R01-AR056768, R01-AR059648, K08-AR060257, R01-GM079330]
  4. Burroughs Wellcome Fund
  5. NSF [DMS-0854970]
  6. National Center for Research Resources [UL1RR025741]
  7. Vanderbilt CTSA grant from the National Center for Research Resources [1 UL1 RR024975]

向作者/读者索取更多资源

Objectives Electronic health records (EHR) can allow for the generation of large cohorts of individuals with given diseases for clinical and genomic research. A rate-limiting step is the development of electronic phenotype selection algorithms to find such cohorts. This study evaluated the portability of a published phenotype algorithm to identify rheumatoid arthritis (RA) patients from EHR records at three institutions with different EHR systems. Materials and Methods Physicians reviewed charts from three institutions to identify patients with RA. Each institution compiled attributes from various sources in the EHR, including codified data and clinical narratives, which were searched using one of two natural language processing (NLP) systems. The performance of the published model was compared with locally retrained models. Results Applying the previously published model from Partners Healthcare to datasets from Northwestern and Vanderbilt Universities, the area under the receiver operating characteristic curve was found to be 92% for Northwestern and 95% for Vanderbilt, compared with 97% at Partners. Retraining the model improved the average sensitivity at a specificity of 97% to 72% from the original 65%. Both the original logistic regression models and locally retrained models were superior to simple billing code count thresholds. Discussion These results show that a previously published algorithm for RA is portable to two external hospitals using different EHR systems, different NLP systems, and different target NLP vocabularies. Retraining the algorithm primarily increased the sensitivity at each site. Conclusion Electronic phenotype algorithms allow rapid identification of case populations in multiple sites with little retraining.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据