4.7 Article

The prognostic value of machine learning techniques versus cox regression model for head and neck cancer

Journal

METHODS
Volume 205, Issue -, Pages 123-132

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.ymeth.2022.07.001

Keywords

Head and neck cancer; Machine learning; Cox regression; Prognostic prediction

Ask authors/readers for more resources

This study compared the prognostic value of various machine learning techniques and the statistical Cox regression model for different types of head and neck cancer. The results showed that the random forest model outperformed all other models in prognostic prediction, especially for major salivary gland cancer, oral cavity cancer, and oropharyngeal cancer. Age and tumor size were identified as the most important clinical variables in the random forest model.
Background: Accurate prognostic prediction for head and neck cancer (HNC) is important for the improvement of clinical management. We aimed to compare the prognostic value of various machine learning techniques (MLTs) and statistical Cox regression model for different types of HNC. Methods: Clinical data of HNC patients were extracted from the Surveillance, Epidemiology, and End Results (SEER) database from 1974 to 2016. The prediction performance of five ML models, including random forest (RF), gradient boosting decision tree (GBDT), support vector machine (SVM), neural network (NN) and deep learning (DL), were compared with the statistical Cox regression model by estimating the concordance index (Cindex), integrated Brier score (IBS), time-dependent receiver operating characteristic (ROC) curve and the area under the curve (AUC). Results: Our results showed that the RF model outperformed all other models in prognostic prediction for all tumor sites of HNC, particularly for major salivary gland cancer (MSGC, C-index: 88.730 +/- 0.8700, IBS: 7.680 +/- 0.4800), oral cavity cancer (OCC, C-index: 84.250 +/- 0.6700, IBS: 11.480 +/- 0.3300) and oropharyngeal cancer (OPC, C-index: 82.510 +/- 0.5400, IBS: 10.120 +/- 0.1400). Meanwhile, we analyzed the importance of each clinical variable in the RF model, in which age and tumor size presented the strongest positive prognostic effects. Additionally, similar results can be observed in the internal (6th edition of the AJCC TNM staging system cohort) and external validations (the TCGA HNC cohort). Conclusions: The RF model is a promising prognostic prediction tool for HNC patients, regardless of the anatomic subsites.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available