4.7 Article

Accuracy of mutational signature software on correlated signatures

Journal

SCIENTIFIC REPORTS
Volume 12, Issue 1, Pages -

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41598-021-04207-6

Keywords

-

Funding

  1. Singapore National Medical Research Council [MOH-000032/MOH-CIRG18may-0004, NMRC/CIRG/1422/2015]
  2. Singapore Ministry of Health via the Duke-NUS Signature Research Programmes
  3. Singapore Ministry of Education via the Duke-NUS Signature Research Programmes

Ask authors/readers for more resources

Mutational signatures are important for various fields such as DNA damage and repair, aging, cancer biology, etc. Unsupervised learning can infer mutational signatures, but separating correlated signatures is a challenge. While evaluating different methods, some outperform others, highlighting the complexity of mutational signature extraction.
Mutational signatures are characteristic patterns of mutations generated by exogenous mutagens or by endogenous mutational processes. Mutational signatures are important for research into DNA damage and repair, aging, cancer biology, genetic toxicology, and epidemiology. Unsupervised learning can infer mutational signatures from the somatic mutations in large numbers of tumors, and separating correlated signatures is a notable challenge for this task. To investigate which methods can best meet this challenge, we assessed 18 computational methods for inferring mutational signatures on 20 synthetic data sets that incorporated varying degrees of correlated activity of two common mutational signatures. Performance varied widely, and four methods noticeably outperformed the others: hdp (based on hierarchical Dirichlet processes), SigProExtractor (based on multiple non-negative matrix factorizations over resampled data), TCSM (based on an approach used in document topic analysis), and mutSpec.NMF (also based on non-negative matrix factorization). The results underscored the complexities of mutational signature extraction, including the importance and difficulty of determining the correct number of signatures and the importance of hyperparameters. Our findings indicate directions for improvement of the software and show a need for care when interpreting results from any of these methods, including the need for assessing sensitivity of the results to input parameters.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available