4.8 Article

Comprehensive Evaluation of Machine Learning Models and Gene Expression Signatures for Prostate Cancer Prognosis Using Large Population Cohorts

Journal

CANCER RESEARCH
Volume 82, Issue 9, Pages 832-843

Publisher

AMER ASSOC CANCER RESEARCH
DOI: 10.1158/0008-5472.CAN-21-3074

Keywords

-

Categories

Funding

  1. University of California (UC) Riverside
  2. UC Cancer Research Coordinating Committee Competition Award
  3. USDA FACT Award [2019-67022-29930]
  4. Science and Technology Project of Guizhou Province in 2017 [(2017)5803]
  5. High-level innovative talent project of Guizhou Province in 2018 [(2018)5639]
  6. Science and Technology Plan Project of Guiyang in 2019 [(2019)2-15]
  7. National Natural Science Foundation of China [82072813, 8157142]
  8. Guangzhou Municipal Science and Technology Project [201803040001]

Ask authors/readers for more resources

This study systematically evaluated the performance of machine learning algorithms and gene expression signatures for prostate cancer prognosis. Survival analysis models outperformed binary classification models, with Cox-Ridge and Cox-PLS methods being the most robust. These findings can assist clinical decision-making.
Overtreatment remains a pervasive problem in prostate cancer management due to the highly variable and often indolent course of disease. Molecular signatures derived from gene expression profiling have played critical roles in guiding prostate cancer treatment decisions. Many gene expression signatures have been developed to improve the risk stratification of prostate cancer and some of them have already been applied to clinical practice. However, no comprehensive evaluation has been performed to compare the performance of these signatures. In this study, we conducted a systematic and unbiased evaluation of 15 machine learning (ML) algorithms and 30 published prostate cancer gene expression-based prognostic signatures leveraging 10 transcriptomics datasets with 1,558 primary patients with prostate cancer from public data repositories. This analysis revealed that survival analysis models outperformed binary classification models for risk assessment, and the performance of the survival analysis methods-Cox model regularized with ridge penalty (Cox-Ridge) and partial least squares (PLS) regression for Cox model (Cox-PLS)-were generally more robust than the other methods. Based on the Cox-Ridge algorithm, several top prognostic signatures displayed comparable or even better performance than commercial panels. These findings will facilitate the identification of existing prognostic signatures that are promising for further validation in prospective studies and promote the development of robust prognostic models to guide clinical decisionmaking. Moreover, this study provides a valuable data resource from large primary prostate cancer cohorts, which can be used to develop, validate, and evaluate novel statistical methodologies and molecular signatures to improve prostate cancer management. Significance: This systematic evaluation of 15 machine learning algorithms and 30 published gene expression signatures for the prognosis of prostate cancer will assist clinical decision-making.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available