4.7 Article

Unifying Diagnosis Identification and Prediction Method Embedding the Disease Ontology Structure From Electronic Medical Records

Journal

FRONTIERS IN PUBLIC HEALTH
Volume 9, Issue -, Pages -

Publisher

FRONTIERS MEDIA SA
DOI: 10.3389/fpubh.2021.793801

Keywords

unifying diagnosis; disease ontology structure; set similarity measure; clustering; electronic medical records

Funding

  1. National Natural Science Foundation of China [71771034, 72101236, 71421001]
  2. Scientific and Technological Innovation Foundation of Dalian [2018J11CY009]
  3. Henan Province Youth Talent Promotion Project [2021HYTP052]
  4. Henan Province Medical Science and Technology Research Plan [LHGJ20200279]
  5. Henan Province Key Scientific Research Projects of Universities [21A320035]

Ask authors/readers for more resources

The objective of this study is to identify and predict a unifying diagnosis (UD) from electronic medical records (EMRs) using a data-driven approach. The results showed that the proposed method effectively extracted a typical diagnosis code co-occurrence pattern, achieved accurate prediction of UD based on patients' diagnostic and admission information, and outperformed other methods overall.
ObjectiveThe reasonable classification of a large number of distinct diagnosis codes can clarify patient diagnostic information and help clinicians to improve their ability to assign and target treatment for primary diseases. Our objective is to identify and predict a unifying diagnosis (UD) from electronic medical records (EMRs). MethodsWe screened 4,418 sepsis patients from a public MIMIC-III database and extracted their diagnostic information for UD identification, their demographic information, laboratory examination information, chief complaint, and history of present illness information for UD prediction. We proposed a data-driven UD identification and prediction method (UDIPM) embedding the disease ontology structure. First, we designed a set similarity measure method embedding the disease ontology structure to generate a patient similarity matrix. Second, we applied affinity propagation clustering to divide patients into different clusters, and extracted a typical diagnosis code co-occurrence pattern from each cluster. Furthermore, we identified a UD by fusing visual analysis and a conditional co-occurrence matrix. Finally, we trained five classifiers in combination with feature fusion and feature selection method to unify the diagnosis prediction. ResultsThe experimental results on a public electronic medical record dataset showed that the UDIPM could extracted a typical diagnosis code co-occurrence pattern effectively, identified and predicted a UD based on patients' diagnostic and admission information, and outperformed other fusion methods overall. ConclusionsThe accurate identification and prediction of the UD from a large number of distinct diagnosis codes and multi-source heterogeneous patient admission information in EMRs can provide a data-driven approach to assist better coding integration of diagnosis.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available