4.7 Article Data Paper

Columbia Open Health Data, clinical concept prevalence and co-occurrence from electronic health records

期刊

SCIENTIFIC DATA
卷 5, 期 -, 页码 -

出版社

NATURE PORTFOLIO
DOI: 10.1038/sdata.2018.273

关键词

-

资金

  1. NCATS [OT3TR002027]
  2. NLM [R01LM009886-08A1, R01LM006910]
  3. NIGMS [R01GM107145]
  4. NATIONAL CENTER FOR ADVANCING TRANSLATIONAL SCIENCES [OT3TR002027] Funding Source: NIH RePORTER
  5. NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES [R01GM107145] Funding Source: NIH RePORTER
  6. NATIONAL LIBRARY OF MEDICINE [R01LM006910, R01LM009886] Funding Source: NIH RePORTER

向作者/读者索取更多资源

Columbia Open Health Data (COHD) is a publicly accessible database of electronic health record (EHR) prevalence and co-occurrence frequencies between conditions, drugs, procedures, and demographics. COHD was derived from Columbia University Irving Medical Center's Observational Health Data Sciences and Informatics (OHDSI) database. The lifetime dataset, derived from all records, contains 36,578 single concepts (11,952 conditions, 12,334 drugs, and 10,816 procedures) and 32,788,901 concept pairs from 5,364,781 patients. The 5-year dataset, derived from records from 2013-2017, contains 29,964 single concepts (10,159 conditions, 10,264 drugs, and 8,270 procedures) and 15,927,195 concept pairs from 1,790,431 patients. Exclusion of rare concepts (count <= 10) and Poisson randomization enable data sharing by eliminating risks to patient privacy. EHR prevalences are informative of healthcare consumption rates. Analysis of co-occurrence frequencies via relative frequency analysis and observed-expected frequency ratio are informative of associations between clinical concepts, useful for biomedical research tasks such as drug repurposing and pharmacovigilance. COHD is publicly accessible through a web application-programming interface (API) and downloadable from the Figshare repository. The code is available on GitHub.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据