4.3 Article

External validation of Machine Learning models for COVID-19 detection based on Complete Blood Count

Journal

HEALTH INFORMATION SCIENCE AND SYSTEMS
Volume 9, Issue 1, Pages -

Publisher

SPRINGER
DOI: 10.1007/s13755-021-00167-3

Keywords

COVID-19; Machine Learning; External validation; Calibration; Complete Blood count

Funding

  1. Universita degli Studi di Milano - Bicocca within the CRUI-CARE Agreement

Ask authors/readers for more resources

This study externally validated 6 state-of-the-art diagnostic ML models trained on CBC with an average AUC of 95% and Brier score of 0.11, outperforming existing methods. The best performing model (SVM) achieved an average AUC of 97.5% and good calibration, showing potential for early identification of COVID-19 patients.
Purpose The rRT-PCR for COVID-19 diagnosis is affected by long turnaround time, potential shortage of reagents, high false-negative rates and high costs. Routine hematochemical tests are a faster and less expensive alternative for diagnosis. Thus, Machine Learning (ML) has been applied to hematological parameters to develop diagnostic tools and help clinicians in promptly managing positive patients. However, few ML models have been externally validated, making their real-world applicability unclear. Methods We externally validate 6 state-of-the-art diagnostic ML models, based on Complete Blood Count (CBC) and trained on a dataset encompassing 816 COVID-19 positive cases. The external validation was performed based on two datasets, collected at two different hospitals in northern Italy and encompassing 163 and 104 COVID-19 positive cases, in terms of both error rate and calibration. Results and Conclusion We report an average AUC of 95% and average Brier score of 0.11, out-performing existing ML methods, and showing good cross-site transportability. The best performing model (SVM) reported an average AUC of 97.5% (Sensitivity: 87.5%, Specificity: 94%), comparable with the performance of RT-PCR, and was also the best calibrated. The validated models can be useful in the early identification of potential COVID-19 patients, due to the rapid availability of CBC exams, and in multiple test settings.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available