☆ 4.5 Article

Machine learning algorithms do not outperform preoperative thresholds in predicting clinically meaningful improvements after total knee arthroplasty

KNEE SURGERY SPORTS TRAUMATOLOGY ARTHROSCOPY (2022)

Journal

KNEE SURGERY SPORTS TRAUMATOLOGY ARTHROSCOPY

Volume 30, Issue 8, Pages 2624-2630

Publisher

SPRINGER

DOI: 10.1007/s00167-021-06642-4

Keywords

Total knee arthroplasty; Machine learning; Artificial intelligence; Patient reported outcome measures; MCID

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This study compared the predictive performance of machine learning algorithms and preoperative PROM thresholds in predicting minimal clinically important difference (MCID) attainment at 2 years after total knee arthroplasty (TKA). Both methods performed similarly, with the patient's preoperative PROM score being the most important predictor of MCID attainment. ROC analysis identified optimal preoperative threshold values for the SF-36 PCS, MCS, and WOMAC, providing insight for future research.

Purpose Patient-reported outcome measures (PROMs) are important measures of success after total knee arthroplasty (TKA) and being able to predict their improvements could enhance preoperative decision-making. Our study aims to compare the predictive performance of machine learning (ML) algorithms and preoperative PROM thresholds in predicting minimal clinically important difference (MCID) attainment at 2 years after TKA. Methods Prospectively collected data of 2840 primary TKA performed between 2008 and 2018 was extracted from our joint replacement registry and split into a training set (80%) and test set (20%). Using the training set, ML algorithms were developed using patient demographics, comorbidities and preoperative PROMs, whereas the optimal preoperative threshold was determined using ROC analysis. Both methods were used to predict MCID attainment for the SF-36 PCS, MCS and WOMAC at 2 years postoperatively, with predictive performance evaluated on the independent test set. Results ML algorithms and preoperative PROM models performed similarly in predicting MCID for the SF-36 PCS (AUC: 0.77 vs 0.74), MCS (AUC: 0.95 vs 0.95) and WOMAC (AUC: 0.89 vs 0.88). For each outcome, the most important predictor of MCID attainment was the patient's preoperative PROM score. ROC analysis also identified optimal preoperative threshold values of 33.6, 54.1 and 72.7 for the SF-36 PCS, MCS and WOMAC, respectively. Conclusion ML algorithms did not perform significantly better than preoperative PROM thresholds in predicting MCID attainment after TKA. Future research should routinely compare the predictive ability of ML algorithms with existing methods and determine the type of clinical problems which may benefit the most from it.

Machine learning algorithms do not outperform preoperative thresholds in predicting clinically meaningful improvements after total knee arthroplasty

Journal

KNEE SURGERY SPORTS TRAUMATOLOGY ARTHROSCOPY

Publisher

SPRINGER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Machine learning algorithms do not outperform preoperative thresholds in predicting clinically meaningful improvements after total knee arthroplasty

Journal

KNEE SURGERY SPORTS TRAUMATOLOGY ARTHROSCOPY

Publisher

SPRINGER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper