4.7 Article

Semi-Supervised Topological Analysis for Elucidating Hidden Structures in High-Dimensional Transcriptome Datasets

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TCBB.2019.2950657

Keywords

Data models; Bioinformatics; Data analysis; Genomics; Manganese; Data mining; Data structures; Data and knowledge visualization; data mining; bioinformatics (genome or protein) databases

Funding

  1. Center for Individualized Medicine at Mayo Clinic
  2. career enhancement award from the Mayo Clinic Ovarian SPORE [P50 CA136393]

Ask authors/readers for more resources

Topological data analysis is a powerful method for dimensionality reduction, data relationship mining, and data structure representation, but current TDA modeling frameworks do not take into account domain context information and prior knowledge. The developed semi-supervised topological analysis (STA) framework, validated with simulation data, has been successfully applied to real gene expression and ovarian cancer data.
Topological data analysis (TDA) is a powerful method for reducing data dimensionality, mining underlying data relationships, and intuitively representing the data structure. The Mapper algorithm is one such tool that projects high-dimensional data to 1-dimensional space by using a filter function that is subsequently used to reconstruct the data topology relationships. However, domain context information and prior knowledge have not been considered in current TDA modeling frameworks. Here, we report the development and evaluation of a semi-supervised topological analysis (STA) framework that incorporates discrete or continuously labeled data points and selects the most relevant filter functions accordingly. We validate the proposed STA framework with simulation data and then apply it to samples from Genotype-Tissue Expression data and ovarian cancer transcriptome datasets. The graphs generated by STA for these 2 datasets, based on gene expression profiles, are consistent with prior knowledge, thereby supporting the effectiveness of the proposed framework.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available