4.6 Review

How is test laboratory data used and characterised by machine learning models? A systematic review of diagnostic and prognostic models developed for COVID-19 patients using only laboratory data

Journal

CLINICAL CHEMISTRY AND LABORATORY MEDICINE
Volume 60, Issue 12, Pages 1887-1901

Publisher

WALTER DE GRUYTER GMBH
DOI: 10.1515/cclm-2022-0182

Keywords

complete blood count (CBC); COVID-19; diagnostic study; laboratory tests; machine learning; prognostic study; SARS-CoV-2

Ask authors/readers for more resources

The current gold standard for COVID-19 diagnosis, rRT-PCR test, has limitations. Machine learning methods applied to digital imagery have attracted attention. This review focuses on machine learning-based diagnostic and prognostic studies using hematochemical parameters. The reviewed studies show heterogeneity and reporting issues. Closer collaboration between data scientists and medical laboratory professionals is needed.
The current gold standard for COVID-19 diagnosis, the rRT-PCR test, is hampered by long turnaround times, probable reagent shortages, high false-negative rates and high prices. As a result, machine learning (ML) methods have recently piqued interest, particularly when applied to digital imagery (X-rays and CT scans). In this review, the literature on ML-based diagnostic and prognostic studies grounded on hematochemical parameters has been considered. By doing so, a gap in the current literature was addressed concerning the application of machine learning to laboratory medicine. Sixty-eight articles have been included that were extracted from the Scopus and PubMed indexes. These studies were marked by a great deal of heterogeneity in terms of the examined laboratory test and clinical parameters, sample size, reference populations, ML algorithms, and validation approaches. The majority of research was found to be hampered by reporting and replicability issues: only four of the surveyed studies provided complete information on analytic procedures (units of measure, analyzing equipment), while 29 provided no information at all. Only 16 studies included independent external validation. In light of these findings, we discuss the importance of closer collaboration between data scientists and medical laboratory professionals in order to correctly characterise the relevant population, select the most appropriate statistical and analytical methods, ensure reproducibility, enable the proper interpretation of the results, and gain actual utility by using machine learning methods in clinical practice.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available