4.4 Article

Natural language processing to identify social determinants of health in Alzheimer's disease and related dementia from electronic health records

期刊

HEALTH SERVICES RESEARCH
卷 -, 期 -, 页码 -

出版社

WILEY
DOI: 10.1111/1475-6773.14210

关键词

Alzheimer's disease and related dementia; electronic health records; machine learning; natural language processing; social determinants of health

向作者/读者索取更多资源

The objective of this study was to develop a natural language processing (NLP) algorithm that can identify various social determinants of health (SDoH) for patients with Alzheimer's disease and related dementias (ADRD). The algorithm was trained and validated using 1000 medical notes from the social worker records at Michigan Medicine. The rule-based NLP algorithm performed well in identifying most categories of SDoH.
Objective: To develop a natural language processing (NLP) algorithm that identifies social determinants of health (SDoH), including housing, transportation, food, and medication insecurities, social isolation, abuse, neglect, or exploitation, and financial difficulties for patients with Alzheimer's disease and related dementias (ADRD) from unstructured electronic health records (EHRs).Data Sources and Study Setting: We leveraged 1000 medical notes randomly selected from 7401 emergency department and inpatient social worker notes generated between 2015 and 2019 for 231 unique patients diagnosed with ADRD at Michigan Medicine.Study Design: We developed a rule-based NLP algorithm for the identification of seven domains of SDoH noted above. We also compared the rule-based algorithm with deep learning and regularized logistic regression approaches. These models were compared using accuracy, sensitivity, specificity, F1 score, and the area under the receiver operating characteristic curve (AUC). All notes were split into 700 notes for training NLP algorithms, and 300 notes for validation.Data Collection/Extraction MethodsSocial worker notes used in this study were extracted from the Michigan Medicine EHR database.Principal Findings: Of the 700 notes for training, F1 and AUC for the rule-based algorithm were at least 0.94 and 0.95, respectively, for all SDoH categories. Of the 300 notes for validation, F1 and AUC were at least 0.80 and 0.97, respectively, for all SDoH except housing and medication insecurities. The deep learning and regularized logistic regression algorithms had unsatisfactory performance.Conclusions: The rule-based algorithm can accurately extract SDoH information in all seven domains of SDoH except housing and medication insecurities. Findings from the algorithm can be used by clinicians and social workers to proactively address social needs of patients with ADRD and other vulnerable patient populations.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据