☆ 4.7 Article

Machine learning predicts cancer-associated deep vein thrombosis using clinically available variables

INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS (2022)

Journal

INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS

Volume 161, Issue -, Pages -

Publisher

ELSEVIER IRELAND LTD

DOI: 10.1016/j.ijmedinf.2022.104733

Keywords

Neoplasms; Deep vein thrombosis; Machine learning; Decision making; Risk stratification

Funding

National Key Research and Devel-opment Project [2017YFC1309204]
National Natural Science Foundation [81660484]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This study developed and validated machine learning models for predicting cancer-associated deep vein thrombosis (DVT) using five algorithms. The best recommended model was visualized through a nomogram and web calculator. It provides assistance in evaluating individualized DVT risk and making decisions.

Purpose: To develop and validate machine learning (ML) models for cancer-associated deep vein thrombosis (DVT) and to compare the performance of these models with the Khorana score (KS). Methods: We randomly extracted data of 2100 patients with cancer between Jan. 1, 2017, and Oct. 31, 2019, and 1035 patients who underwent Doppler ultrasonography were enrolled. Univariate analysis and Lasso regression were applied to select important predictors. Model training and hyperparameter tuning were implemented on 70% of the data using a ten-fold cross-validation method. The remaining 30% of the data were used to compare the performance with seven indicators (area under the receiver operating characteristic curve [AUC], sensitivity, specificity, accuracy, balanced accuracy, Brier score, and calibration curve), among all five ML models (linear discriminant analysis [LDA], logistic regression [LR], classification tree [CT], random forest [RF], and support vector machine [SVM]), and the KS. Results: The incidence of cancer-associated DVT was 22.3%. The top five predictors were D-dimer level, age, Charlson Comorbidity Index (CCI), length of stay (LOS), and previous VTE (venous thromboembolism) history according to RF. Only LDA (AUC = 0.773) and LR (AUC = 0.772) outperformed KS (AUC = 0.642), and combination with D-dimer showed improved performance in all models. A nomogram and web calculator https:// webcalculatorofcancerassociateddvt.shinyapps.io/dynnomapp/ were used to visualize the best recommended LR model. Conclusion: This study developed and validated cancer-associated DVT predictive models using five ML algorithms and visualized the best recommended model using a nomogram and web calculator. The nomogram and web calculator developed in this study may assist doctors and nurses in evaluating individualized cancerassociated DVT risk and making decisions. However, other prospective cohort studies should be conducted to externally validate the recommended model.

Machine learning predicts cancer-associated deep vein thrombosis using clinically available variables

Journal

INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS

Publisher

ELSEVIER IRELAND LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Machine learning predicts cancer-associated deep vein thrombosis using clinically available variables

Journal

INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS

Publisher

ELSEVIER IRELAND LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper