4.3 Article

On the Use of Topological Features of Metabolic Networks for the Classification of Cancer Samples

Journal

CURRENT GENOMICS
Volume 22, Issue 2, Pages 88-97

Publisher

BENTHAM SCIENCE PUBL LTD
DOI: 10.2174/1389202922666210301084151

Keywords

Metabolic networks; cancer sample classification; machine learning; RNA-seq data; topological properties; network pruning

Funding

  1. Italian Ministry of University and Research (MIUR) through grant Dipartimenti di Eccel-lenza 2017
  2. CRUK/AECC/AIRC Accelerator Award [22790]
  3. National Council for Scientific and Technological Development (CNPq) [155957/2018-0]
  4. Sao Paulo Research Foundation (FAPESP) [2020/03514-9]
  5. Fundacao de Amparo a Pesquisa do Estado de Sao Paulo (FAPESP) [20/03514-9] Funding Source: FAPESP

Ask authors/readers for more resources

This study investigates the utilization of the topological properties of sample-specific metabolic networks for cancer classification, demonstrating that useful information can be extracted from a relatively limited number of features. Support Vector Machines achieve the best classification accuracy, suggesting the effectiveness of using these topological properties for cancer classification.
Background: The increasing availability of omics data collected from patients affected by severe pathologies, such as cancer, is fostering the development of data science methods for their analysis. Introduction: The combination of data integration and machine learning approaches can provide new powerful instruments to tackle the complexity of cancer development and deliver effective diagnostic and prognostic strategies. Methods: We explore the possibility of exploiting the topological properties of sample-specific metabolic networks as features in a supervised classification task. Such networks are obtained by projecting transcriptomic data from RNA-seq experiments on genome-wide metabolic models to define weighted networks modeling the overall metabolic activity of a given sample. Results: We show the classification results on a labeled breast cancer dataset from the TCGA database, including 210 samples (cancer vs. normal). In particular, we investigate how the performance is affected by a threshold-based pruning of the networks by comparing Artificial Neural Networks, Support Vector Machines and Random Forests. Interestingly, the best classification performance is achieved within a small threshold range for all methods, suggesting that it might represent an effective choice to recover useful information while filtering out noise from data. Overall, the best accuracy is achieved with SVMs, which exhibit performances similar to those obtained when gene expression profiles are used as features. Conclusion: These findings demonstrate that the topological properties of sample-specific metabolic networks are effective in classifying cancer and normal samples, suggesting that useful information can be extracted from a relatively limited number of features.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available