4.5 Article

The critical care data exchange format: a proposed flexible data standard for combining clinical and high-frequency physiologic data in critical care

Journal

PHYSIOLOGICAL MEASUREMENT
Volume 42, Issue 6, Pages -

Publisher

IOP PUBLISHING LTD
DOI: 10.1088/1361-6579/abfc9b

Keywords

critical care; ICU; data science; computation medicine; physiological monitoring; data sharing; precision medicine

Ask authors/readers for more resources

The study aimed to develop a standardized format for exchanging clinical and physiologic data from different sites without the need for specific schemas or tools. A collaborative process was used to establish a standard that allows for interoperability and data sharing with common analysis tools.
Objective. To develop a standardized format for exchanging clinical and physiologic data generated in the intensive care unit. Our goal was to develop a format that would accommodate the data collection pipelines of various sites but would not require dataset-specific schemas or ad-hoc tools for decoding and analysis. Approach. A number of centers had independently developed solutions for storing clinical and physiologic data using Hierarchical Data Format-Version 5 (HDF5), a well-supported standard already in use in multiple other fields. These individual solutions involved design choices that made the data difficult to share despite the underlying common framework. A collaborative process was used to form the basis of a proposed standard that would allow for interoperability and data sharing with common analysis tools. Main Results. We developed the HDF5-based critical care data exchange format which stores multiparametric data in an efficient, self-describing, hierarchical structure and supports real-time streaming and compression. In addition to cardiorespiratory and laboratory data, the format can, in future, accommodate other large datasets such as imaging and genomics. We demonstated the feasibility of a standardized format by converting data from three sites as well as the MIMIC III dataset. Significance. Individual approaches to the storage of multiparametric clinical data are proliferating, representing both a duplication of effort and a missed opportunity for collaboration. Adoption of a standardized format for clinical data exchange will enable the development of a digital biobank, facilitate the external validation of machine learning models and be a powerful tool for sharing multiparametric, high frequency patient level data in multisite clinical trials. Our proposed solution focuses on supporting standardized ontologies such as LOINC allowing easy reading of data regardless of the source and in so doing provides a useful method to integrate large amounts of existing data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available