4.2 Article

Identification of stem cells from large cell populations with topological scoring

Journal

MOLECULAR OMICS
Volume 17, Issue 1, Pages 59-65

Publisher

ROYAL SOC CHEMISTRY
DOI: 10.1039/d0mo00039f

Keywords

-

Funding

  1. Stowers Institute for Medical Research
  2. National Institute of General Medical Sciences of the National Institutes of Health [RO1GM112639]

Ask authors/readers for more resources

Machine learning and topological analysis methods are increasingly utilized for large-scale omics datasets, including flow cytometry data which shares similarities with genomics and proteomics datasets. The TopS approach has been shown to effectively analyze flow cytometry datasets, identifying rare cell populations and providing novel insights into the data patterns. Coupling TopS with various methods like topological data analysis, X-shift clustering, and t-SNE can further enhance the analysis of large-scale flow cytometry datasets.
Machine learning and topological analysis methods are becoming increasingly used on various large-scale omics datasets. Modern high dimensional flow cytometry data sets share many features with other omics datasets like genomics and proteomics. For example, genomics or proteomics datasets can be sparse and have high dimensionality, and flow cytometry datasets can also share these features. This makes flow cytometry data potentially a suitable candidate for employing machine learning and topological scoring strategies, for example, to gain novel insights into patterns within the data. We have previously developed a Topological Score (TopS) and implemented it for the analysis of quantitative protein interaction network datasets. Here we show that TopS approach for large scale data analysis is applicable to the analysis of a previously described flow cytometry sorted human hematopoietic stem cell dataset. We demonstrate that TopS is capable of effectively sorting this dataset into cell populations and identify rare cell populations. We demonstrate the utility of TopS when coupled with multiple approaches including topological data analysis, X-shift clustering, and t-Distributed Stochastic Neighbor Embedding (t-SNE). Our results suggest that TopS could be effectively used to analyze large scale flow cytometry datasets to find rare cell populations.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available