4.7 Article

Semi-Supervised Topological Analysis for Elucidating Hidden Structures in High-Dimensional Transcriptome Datasets

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TCBB.2019.2950657

关键词

Data models; Bioinformatics; Data analysis; Genomics; Manganese; Data mining; Data structures; Data and knowledge visualization; data mining; bioinformatics (genome or protein) databases

资金

  1. Center for Individualized Medicine at Mayo Clinic
  2. career enhancement award from the Mayo Clinic Ovarian SPORE [P50 CA136393]

向作者/读者索取更多资源

Topological data analysis is a powerful method for dimensionality reduction, data relationship mining, and data structure representation, but current TDA modeling frameworks do not take into account domain context information and prior knowledge. The developed semi-supervised topological analysis (STA) framework, validated with simulation data, has been successfully applied to real gene expression and ovarian cancer data.
Topological data analysis (TDA) is a powerful method for reducing data dimensionality, mining underlying data relationships, and intuitively representing the data structure. The Mapper algorithm is one such tool that projects high-dimensional data to 1-dimensional space by using a filter function that is subsequently used to reconstruct the data topology relationships. However, domain context information and prior knowledge have not been considered in current TDA modeling frameworks. Here, we report the development and evaluation of a semi-supervised topological analysis (STA) framework that incorporates discrete or continuously labeled data points and selects the most relevant filter functions accordingly. We validate the proposed STA framework with simulation data and then apply it to samples from Genotype-Tissue Expression data and ovarian cancer transcriptome datasets. The graphs generated by STA for these 2 datasets, based on gene expression profiles, are consistent with prior knowledge, thereby supporting the effectiveness of the proposed framework.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据