4.8 Article

Topology based data analysis identifies a subgroup of breast cancers with a unique mutational profile and excellent survival

Publisher

NATL ACAD SCIENCES
DOI: 10.1073/pnas.1102826108

Keywords

applied topology; p53; systems biology

Funding

  1. National Institutes of Health [I-U54_CA149145-01]
  2. Air Force Office of Scientific Research [FA9550-09-0-1-0531]
  3. Office of Naval Research [N00014-08-1-0931]
  4. National Science Foundation [DMS 0905823]
  5. Breast Cancer Foundation
  6. National Cancer Institute
  7. Direct For Mathematical & Physical Scien
  8. Division Of Mathematical Sciences [905823, 808515] Funding Source: National Science Foundation

Ask authors/readers for more resources

High-throughput biological data, whether generated as sequencing, transcriptional microarrays, proteomic, or other means, continues to require analytic methods that address its high dimensional aspects. Because the computational part of data analysis ultimately identifies shape characteristics in the organization of data sets, the mathematics of shape recognition in high dimensions continues to be a crucial part of data analysis. This article introduces a method that extracts information from high-throughput microarray data and, by using topology, provides greater depth of information than current analytic techniques. The method, termed Progression Analysis of Disease (PAD), first identifies robust aspects of cluster analysis, then goes deeper to find a multitude of biologically meaningful shape characteristics in these data. Additionally, because PAD incorporates a visualization tool, it provides a simple picture or graph that can be used to further explore these data. Although PAD can be applied to a wide range of high-throughput data types, it is used here as an example to analyze breast cancer transcriptional data. This identified a unique subgroup of Estrogen Receptor-positive (ER+) breast cancers that express high levels of c-MYB and low levels of innate inflammatory genes. These patients exhibit 100% survival and no metastasis. No supervised step beyond distinction between tumor and healthy patients was used to identify this subtype. The group has a clear and distinct, statistically significant molecular signature, it highlights coherent biology but is invisible to cluster methods, and does not fit into the accepted classification of Luminal A/B, Normal-like subtypes of ER+ breast cancers. We denote the group as c-MYB+ breast cancer.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available